src-kafka

Commit Graph

Author	SHA1	Message	Date
Mickael Maison	83503404e4	KAFKA-6770; Add New Protocol Versions to 1.1.0 documentation (#4847 ) Update 1.1 docs to include 2 new versions to existing APIs: - DescribeConfigs v1 - Fetch v7 Also fix a typo in1 FetchRequest.	7 years ago
Rajini Sivaram	e5de679d62	KAFKA-6765: Handle exception while reading throttle metric value in test (#4869 ) Quota tests wait for throttle metric to be updated without waiting for requests to complete to avoid waiting for potentially large throttle times. This requires the test to read metric values while a broker may be updating the value, resulting in exception in the test. Since this issue can also occur with JMX metrics reporter, change synchronization on metrics with sensors to use the sensor as lock.	7 years ago
Andy Coates	432c82d3bf	KAFKA-6727; Fix broken Config hashCode() and equals() (#4796 ) Reviewers: Manikumar Reddy O <manikumar.reddy@gmail.com>, Guozhang Wang <wangguoz@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Ismael Juma	f3ed56b21f	MINOR: Mention that -1 disables retention by time (#4881 ) Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Guozhang Wang	9871357086	KAFKA-6592: Follow-up (#4864 ) Do not require ConsoleConsumer to specify inner serde as s special property, but just a normal property of the message formatter.	7 years ago
Guozhang Wang	0dc7f0e66f	KAFKA-6611, PART II: Improve Streams SimpleBenchmark (#4854 ) SimpleBenchmark: 1.a Do not rely on manual num.records / bytes collection on atomic integers. 1.b Rely on config files for num.threads, bootstrap.servers, etc. 1.c Add parameters for key skewness and value size. 1.d Refactor the tests for loading phase, adding tumbling-windowed count. 1.e For consumer / consumeproduce, collect metrics on consumer instead. 1.f Force stop the test after 3 minutes, this is based on empirical numbers of 10M records. Other tests: use config for kafka bootstrap servers. streams_simple_benchmark.py: only use scale 1 for system test, remove yahoo from benchmark tests. Note that the JMX based metrics is more accurate than the manually collected metrics. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Guozhang Wang	b599b395f3	KAFKA-6058: Refactor consumer API result return types (#4856 ) Refactored the return types in consumer group APIs the following way: ``` Map<TopicPartition, KafkaFuture<Void>> DeleteConsumerGroupsResult#deletedGroups() Map<TopicPartition, KafkaFuture<ConsumerGroupDescription>> DescribeConsumerGroupsResult#describedGroups() KafkaFuture<Collection<ConsumerGroupListing>> ListConsumerGroupsResult#listings() KafkaFuture<Map<TopicPartition, OffsetAndMetadata>> ListConsumerGroupOffsetsResult#partitionsToOffsetAndMetadata() ``` * For DeleteConsumerGroupsResult and DescribeConsumerGroupsResult, for each group id we have two round-trips to get the coordinator, and then send the delete / describe request; I leave the potential optimization of batching requests for future work. * For ListConsumerGroupOffsetsResult, it is a simple single round-trip and hence the whole map is wrapped as a Future. * ListConsumerGroupsResult, it is the most tricky one: we would only know how many futures we should wait for after the first listNode returns, and hence I constructed the flattened future in the middle wrapped with the underlying map of futures; also added an iterator API to compensate the "fail the whole future if any broker returns error" behavior. The iterator future will throw exception on the failing brokers, while return the consumer for other succeeded brokers. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Jason Gustafson	fb3a9485a8	MINOR: Disable failing testDescribeConsumerGroupOffsets test case (#4863 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
John Roesler	cc43e77bbb	MINOR: make Sensor#add idempotent (#4853 ) This change makes adding a metric to a sensor idempotent. That is, if the metric is already added to the sensor, the method returns with success. The current behavior is that any attempt to register a second metric with the same name is an error. Testing strategy: There is a new unit test covering this behavior Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Jorge Quilcate Otoya	6a99da87ab	KAFKA-6058: KIP-222; Add Consumer Group operations to Admin API KIP: https://cwiki.apache.org/confluence/display/KAFKA/KIP-222+-+Add+Consumer+Group+operations+to+Admin+API Author: Jorge Quilcate Otoya <quilcate.jorge@gmail.com> Author: Jorge Esteban Quilcate Otoya <quilcate.jorge@gmail.com> Author: Guozhang Wang <wangguoz@gmail.com> Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Guozhang Wang <wangguoz@gmail.com> Closes #4454 from jeqo/feature/admin-client-describe-consumer-group	7 years ago
Manikumar Reddy O	47918f2d79	KAFKA-6447: Add Delegation Token Operations to KafkaAdminClient (KIP-249) (#4427 ) Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Manikumar Reddy O	5e277e5579	KAFKA-4883: handle NullPointerException while parsing login modue control flag (#4849 )	7 years ago
Magnus Edenhill	e490a90625	Make [Config]Resource.toString() consistent with existing code (#4845 ) The toString() for ConfigResource was using { } instead of ( ) which is inconsistent with the existing toStrings in the code, while toString for Resource was using a mix of ( and }.	7 years ago
Jason Gustafson	0a8f35b684	KAFKA-6768; Transactional producer may hang in close with pending requests (#4842 ) This patch fixes an edge case in producer shutdown which prevents `close()` from completing due to a pending request which will never be sent due to shutdown initiation. I have added a test case which reproduces the scenario. Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Rajini Sivaram	77ebd32016	KAFKA-6576: Configurable Quota Management (KIP-257) (#4699 ) Enable quota calculation to be customized using a configurable callback. See KIP-257 for details. Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Manikumar Reddy O	77c79df396	KAFKA-6741: Disable Selector's idle connection timeout in testNetworkThreadTimeRecorded() test (#4824 ) Reviewers: Jason Gustafson <jason@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Rajini Sivaram	9f8c3167eb	KAFKA-4292: Configurable SASL callback handlers (KIP-86) (#2022 ) Implementation of KIP-86. Client, server and login callback handlers have been made configurable for both brokers and clients. Reviewers: Jun Rao <junrao@gmail.com>, Ron Dagostino <rndgstn@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	7 years ago
Chia-Ping Tsai	53d4267c59	MINOR: Don’t send the DeleteTopicsRequest for invalid topic names (#4763 ) The invalid topic name is already handled locally so it is unnecessary to send the DeleteTopicsRequest. This PR adds a count to MockClient for testing. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Dhruvil Shah	719a21f7c9	KAFKA-6739; Ignore headers when down-converting from V2 to V0/V1 (#4813 ) Ignore headers when down-converting to V0/V1 since they are not supported. Added a test-case to verify down-conversion sanity in presence of headers. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Manikumar Reddy O	e4c8e3e758	MINOR: Add Timed wait to SslTransportLayerTest.testNetworkThreadTimeRecorded (#4811 ) Avoid test hanging when there is a failure by limiting wait time.	7 years ago
Jason Gustafson	8662a022c4	MINOR: Fix partition loading checks in GroupCoordinator (#4788 ) In the group coordinator, we currently check whether the partition is owned before checking whether it is loading. Since loading is a prerequisite for partition ownership, it means that it is not actually possible to see the COORDINATOR_LOAD_IN_PROGRESS error. The impact is mostly harmless: while loading the group, the client may send unnecessary FindCoordinator requests to rediscover the coordinator. I've fixed the bug and restructured the code to enable testing. In the process of fixing this bug, the following improvements have been made: 1. We now verify valid groupId in all request handlers. 2. Currently if the coordinator is loading when a SyncGroup is received, we'll return NOT_COORDINATOR. I've changed this to return REBALANCE_IN_PROGRESS since the rebalance state will have been lost on coordinator failover. This effectively forces the consumer to rejoin the group, which seems preferable over unnecessarily rediscovering the coordinator. 3. I added a check for the COORDINATOR_LOAD_IN_PROGRESS handler in SyncGroup. Although we do not currently return this error, it seems reasonable that we might want to some day, so it seems better to get the check in now. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
JieFang.He	cb7cf7c5a7	KAFKA-6702: Wrong className in LoggerFactory.getLogger method (#4772 ) Reviewers: Manikumar Reddy, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
huxi	5d5a2ce4bb	KAFKA-6716: Should close the `discardChannel` in MockSelector#completeSend (#4783 )	7 years ago
huxi	9eb32eaad5	KAFKA-6446; KafkaProducer initTransactions() should timeout after max.block.ms (#4563 ) Currently the `initTransactions()` API blocks indefinitely if the broker cannot be reached. This patch changes the behavior to raise a `TimeoutException` after waiting for `max.block.ms`. Reviewers: Apurva Mehta <apurva@confluent.io>, Jason Gustafson <jason@confluent.io>	7 years ago
Rajini Sivaram	2307314432	MINOR: Fix encoder config to make DynamicBrokerReconfigurationTest stable (#4764 ) DynamicBrokerReconfigurationTest currently assumes that passwords encoded with one secret will fail with an exception if decoded with another secret and configures an old.secret in setUp. This could potentially cause test failures if a password was incorrectly decoded with the wrong secret, since the test writes passwords encoded with the new secret directly to ZooKeeper. Since old.secret is only used in one test for verifying secret rotation, this config can be moved to that test to avoid transient failures. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Anna Povzner	5c24295d44	Trogdor's ProducerBench does not fail if topics exists (#4673 ) Added configs to ProducerBenchSpec: topicPrefix: name of topics will be of format topicPrefix + topic index. If not provided, default is "produceBenchTopic". partitionsPerTopic: number of partitions per topic. If not provided, default is 1. replicationFactor: replication factor per topic. If not provided, default is 3. The behavior of producer bench is changed such that if some or all topics already exist (with topic names = topicPrefix + topic index), and they have the same number of partitions as requested, the worker uses those topics and does not fail. The producer bench fails if one or more existing topics has number of partitions that is different from expected number of partitions. Added unit test for WorkerUtils -- for existing methods and new methods. Fixed bug in MockAdminClient, where createTopics() would over-write existing topic's replication factor and number of partitions while correctly completing the appropriate futures exceptionally with TopicExistsException. Reviewers: Colin P. Mccabe <cmccabe@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Guozhang Wang	0f364cd53a	MINOR: Pass a streams config to replace the single state dir (#4714 ) This is a general change and is re-requisite to allow streams benchmark test with different streams tests. For the streams benchmark itself I will have a separate PR for switching configs. Details: 1. Create a "streams.properties" file under PERSISTENT_ROOT before all the streams test. For now it will only contain a single config of state.dir pointing to PERSISTENT_ROOT. 2. For all the system test related code, replace the main function parameter of state.dir with propsFilename, then inside the function load the props from the file and apply overrides if necessary. 3. Minor fixes. Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Colin Patrick McCabe	27bb3ccace	MINOR: KafkaFutureImpl#addWaiter should be protected (#4734 ) KafkaFutureImpl#addWaiter should be protected, just like KafkaFuture#addWaiter. As described in KIP-218, whenComplete is the public API, not addWaiter.	7 years ago
Dhruvil Shah	ae31ee63dc	KAFKA-6530: Use actual first offset of message set when rolling log segment (#4660 ) Use the exact first offset of message set when rolling log segment. This is possible to do for message format V2 and beyond without any performance penalty, because we have the first offset stored in the header. This augments the fix made in KAFKA-4451 to avoid using the heuristic for V2 and beyond messages. Added unit tests to simulate cases where segment needs to roll because of overflow in index offsets. Verified that the new segment created in these cases uses the first offset, instead of the heuristic in use previously.	7 years ago
Sandor Murakozi	2afac71566	MINOR: Remove unnecessary null checks (#4708 ) Remove unnecessary null check in StringDeserializer, MockProducerInterceptor and KStreamImpl. Reviewers: Vahid Hashemian <vahidhashemian@us.ibm.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Dong Lin	d935699486	KAFKA-6640; Improve efficiency of KafkaAdminClient.describeTopics() (#4694 ) Currently in KafkaAdminClient.describeTopics(), for each topic in the request, a complete map of cluster and errors will be constructed for every topic and partition. This unnecessarily increases the complexity of describeTopics() to O(n^2). This patch improves the complexity to O(n). Reviewers: Ismael Juma <ismael@juma.me.uk>, Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Siva Santhalingam	0bb8e66184	KAFKA-6024; Move arg validation in KafkaConsumer ahead of `acquireAndEnsureOpen` (#4617 )	7 years ago
Vitaly Pushkar	b1aa1912f0	KAFKA-4831: Extract WindowedSerde to public APIs (#3307 ) Now that we have augmented WindowSerde with non-arg parameters, extract it out as part of the public APIs so that users who want to I/O windowed streams can use it. This is originally introduced by @vitaly-pushkar This PR grows out to be a much larger one, as I found a few tech debts and bugs while working on it. Here is a summary of the PR: Public API changes (I will propose a KIP after a first round of reviews): Add TimeWindowedSerializer, TimeWindowedDeserializer, SessionWindowedSerializer, SessionWindowedDeserializer into o.a.k.streams.kstream. The serializers would implemented an internal WindowedSerializer interface for the serializeBaseKey function used in 3) below. Add WindowedSerdes into o.a.k.streams.kstream. The reason to now add them into o.a.k.clients's Serdes is that it then needs dependency of streams. Add "default.windowed.key.serde.inner" and "default.windowed.value.serde.inner" into StreamsConfig, used when "default.key.serde" is specified to use time or session windowed serde. Note this requires the serde class, not the type class. Consolidated serde format from multiple classes, including SessionKeySerde.java for session, and WindowStoreUtils for time window, into SessionKeySchema and WindowKeySchema. Bug fix: WindowedStreamPartitioner needs to consider both time window and session window serdes. Removed RocksDBWindowBytesStore etc optimization since after KIP-182 all the serde know happens on metered store, hence this optimization is not worth. Bug fix: for time window, the serdes used for store and the serdes used for piping (source and sink node) are different: the former needs to append sequence number but not for the later. Other minor cleanups: remove unnecessary throws, etc. Authors: Guozhang Wang <wangguoz@gmail.com>, Vitaly Pushkar <vitaly.pushkar@gmail.com> Reviewers: Matthias J. Sax <mjsax@apache.org>, Bill Bejeck <bill@confluent.io>, Xi Hu	7 years ago
wushujames	c5ba0da993	MINOR: Fix incorrect references to the max transaction timeout config (#4664 )	7 years ago
Jason Gustafson	925d6a2ef3	MINOR: Skip sending fetches/offset lookups when awaiting the reconnect backoff (#4644 ) Logging can get spammy during the reconnect blackout period because any requests we send to ConsumerNetworkClient will immediately be failed when poll() returns. This patch checks for connection failures prior to sending fetches and offset lookups and skips sending to any failed nodes. Test cases added for both. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Guozhang Wang	e5d6c9a79a	MINOR: Do not start processor for bounce-at-start (#4639 ) Only start it after the broker has been shutdown.	7 years ago
Jason Gustafson	8f2c087166	MINOR: Complete inflight requests in order on disconnect (#4642 ) NetworkClient should use FIFO order when completing inflight requests following a disconnect. I've added new unit tests for `InFlightRequests` and `NetworkClient` which verify completion order. Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Jason Gustafson	604b93cfde	KAFKA-6606; Ensure consumer awaits auto-commit interval after sending… (#4641 ) We need to reset the auto-commit deadline after sending the offset commit request so that we do not resend it while the request is still inflight. Added unit tests ensuring this behavior and proper backoff in the case of a failure. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Jason Gustafson	6cfcc9d553	KAFKA-6593; Fix livelock with consumer heartbeat thread in commitSync (#4625 ) Contention for the lock in ConsumerNetworkClient can lead to a livelock situation in which an active commitSync is unable to make progress because its completion is blocked in the heartbeat thread. The fix is twofold: 1) We change ConsumerNetworkClient to use a fair lock to reduce the chance of each thread getting starved. 2) We eliminate the dependence on the lock in ConsumerNetworkClient for callback completion so that callbacks will not be blocked by an active poll(). Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Thomas Leplus	031f522a2d	MINOR: Fix javadoc typo in Headers (#4627 )	7 years ago
Guozhang Wang	97ad549d56	KAFKA-6534: Enforce a rebalance in the next poll call when encounter task migration (#4544 ) The fix is in two folds: For tasks that's closed in closeZombieTask, their corresponding partitions are still in runningByPartition so those closed tasks may still be returned in activeTasks and standbyTasks. Adding guards on the returned tasks and if they are closed notify the thread to trigger rebalance immediately. When triggering a rebalance, un-subscribe and re-subscribe immediately to make sure we are not dependent on the background heartbeat thread timing. Some minor changes on log4j. More specifically, I moved the log entry of closeZombieTask to its callers with more context information and the action going to take. I can re-produce the issue with EosIntegrationTest may hand-code the heartbeat thread to GC, and confirmed this patch fixed the issue. Unfortunately this test cannot be added to AK since currently we do not have ways to manipulate the heartbeat thread in unit tests. Reviewers: Jason Gustafson <jason@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Matthias J. Sax	5df535e8a3	MINOR: fixes lgtm.com warnings (#4582 ) fixes lgmt.com warnings cleanup PrintForeachAction and Printed Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Sebastian Bauersfeld <sebastianbauersfeld@gmx.de>, Damian Guy <damian@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Igor Kostiakov	99d650c2c8	KAFKA-6590; Fix bug in aggregation of consumer fetch bytes and counts metrics (#4278 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Daniel Shuy	4a027a0af1	Fix typo in MockProducer JavaDoc (#4606 )	7 years ago
Colin Patrick McCabe	66039b1312	MINOR: Fix ConcurrentModificationException in TransactionManager (#4608 )	7 years ago
Jason Gustafson	1d8ed875db	MINOR: Fix javadoc for consumer offsets lookup APIs which do not block indefinitely (#4613 ) The blocking time for these APIs is bounded by the request timeout.	7 years ago
Jason Gustafson	660c0c0aa3	KAFKA-6238; Fix inter-broker protocol message format compatibility check This patch fixes a bug in the validation of the inter-broker protocol and the message format version. We should allow the configured message format api version to be greater than the inter-broker protocol api version as long as the actual message format versions are equal. For example, if the message format version is set to 1.0, it is fine for the inter-broker protocol version to be 0.11.0 because they both use message format v2. I have added a unit test which checks compatibility for all combinations of the message format version and the inter-broker protocol version. Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #4583 from hachikuji/KAFKA-6328-REOPENED	7 years ago
Manikumar Reddy O	ac2536e77e	KAFKA-5624; Add expiry check to sensor.add() methods (#4404 )	7 years ago
Jason Gustafson	1547cf6de8	KAFKA-6554; Missing lastOffsetDelta validation before log append (#4585 ) Add validation checks that the offset range is valid and aligned with the batch count prior to appending to the log. Several unit tests have been added to verify the various invalid cases.	7 years ago
ying-zheng	13caded15e	KAFKA-6430: Add buffer for gzip streams (#4537 ) As described in the JIRA ticket, this can double throughput.	7 years ago

1 2 3 4 5 ...

1281 Commits (530d951cbd47e7d27d5e687881f3dffe8b880377)