src-kafka

Commit Graph

Author	SHA1	Message	Date
uncleGen	b0d840d34b	KAFKA-5928; Avoid redundant requests to zookeeper when reassign topic partition Author: uncleGen <hustyugm@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Dong Lin <lindong28@gmail.com> Closes #3894 from uncleGen/KAFKA-5928	6 years ago
Manikumar Reddy O	cf981443e4	MINOR: Log AdminClient configs (#5457 ) Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>, Jason Gustafson <jason@confluent.io>	6 years ago
Matthias J. Sax	c1ca53b369	MINOR: fix Streams docs state.dir (#5465 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Jimmy Casey	b282bbe8be	Fixed Spelling. (#5432 ) Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
ying-zheng	b01f8fb668	KAFKA-7142: fix joinGroup performance issues (#5354 ) Summary: 1. Revert GroupMetadata.members to private 2. Add back a wrongly removed comment 3. In GroupMetadata.remove(), update supportedProtocols and awaitingJoinCallbackMembers, only when the remove succeeded Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
Stanislav Kozlovski	518e9d3eee	KAFKA-7169: Custom SASL extensions for OAuthBearer authentication mechanism (KIP-342) (#5379 ) Reviewers: Ron Dagostino <rndgstn@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Manikumar Reddy O	e75048d3e5	MINOR: increase timeout values in streams tests (#5461 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Colin Patrick McCabe	609c81ec8b	KAFKA-7183: Add a trogdor test that creates many connections to brokers (#5393 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
John Roesler	b9f1179694	MINOR: clean up window store interface to avoid confusion (#5359 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Manikumar Reddy O	924466ad62	MINOR: close producer instance in AbstractJoinIntegrationTest (#5459 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	cf2c5e9ffc	MINOR: clean up node and store sensors (#5450 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Jason Gustafson	fc5f6b0e46	MINOR: Add Timer to simplify timeout bookkeeping and use it in the consumer (#5087 ) We currently do a lot of bookkeeping for timeouts which is both error-prone and distracting. This patch adds a new `Timer` class to simplify this logic and control unnecessary calls to system time. In particular, this helps with nested timeout operations. The consumer has been updated to use the new class. Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	3637b2c374	MINOR: Require final variables in Streams (#5452 ) Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
radai-rosenblatt	09fe51f3eb	KAFKA-6648; Fetcher.getTopicMetadata() should return all partitions for each requested topic Currently Fetcher.getTopicMetadata() will not include offline partitions. Thus KafkaConsumer.partitionsFor(topic) will not return all partitions of a topic if there if any partition of the topic is offline. This causes problem if user tries to query the total number of partitions of the given topic. Author: radai-rosenblatt <radai.rosenblatt@gmail.com> Reviewers: Jason Gustafson <jason@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com> Closes #4679 from radai-rosenblatt/partition_shenanigans	6 years ago
Guozhang Wang	afe00effe2	KAFKA-3514: Part II, Choose tasks with data on all partitions to process (#5398 ) 1. In each iteration, decide if a task is processable if all of its partitions contains data, so it can decide which record to process next. 1.a Add one exception that, if the task indeed have data on some but not all of its partitions, we only consider as not processable for some finite round of iterations. 1.b Add a task-level metric to record whenever we are forced to process a task that is only "partially data available", since it may leads to non-determinism. 2. Break the main loop on put-raw-data and process-them. Since now not all data put into the queue would be processed completely within a single iteration. 3. NOTE that within an iteration, if a task has exhausted one of its queue it will still be processed, since we only update processable list once in each iteration, I'm improving on this on the follow-up part III PR. 4. Found and fixed a bug in metrics recording: the taskName and sensorName parameters were exchanged. 5. Optimized task stream time computation again since our current partition stream time reasoning has been simplified. 6. Added unit tests. Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <vvcephei@users.noreply.github.com>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
Matthias J. Sax	b083ed66b9	MINOR: improve JavaDocs for Streams PAPI WordCountExample (#5442 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Arjun Satish	70d882861e	KAFKA-7228: Set errorHandlingMetrics for dead letter queue DLQ reporter does not get a `errorHandlingMetrics` object when created by the worker. This results in an NPE. Signed-off-by: Arjun Satish <arjunconfluent.io> More detailed description of your change, if necessary. The PR title and PR message become the squashed commit message, so use a separate comment to ping reviewers. Summary of testing strategy (including rationale) for the feature or bug fix. Unit and/or integration tests are expected for any behaviour change and system tests should be considered for larger changes. Author: Arjun Satish <arjun@confluent.io> Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5440 from wicknicks/KAFKA-7228	6 years ago
Jason Gustafson	596c6c0c0b	KAFKA-7231; Ensure NetworkClient uses overridden request timeout (#5444 ) Fixed incorrect use of default timeout instead of the argument explicitly passed to `newClientRequest`. Reviewers: Ron Dagostino <rndgstn@gmail.com>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Manikumar Reddy O	ac79d49047	MINOR: Implement toString() in config validator classes (#5401 )	6 years ago
Simon Clark	530d951cbd	MINOR: Fixed default streams state dir location. (#5441 ) Co-authored-by: Mickael Maison <mickael.maison@gmail.com> Co-authored-by: Simon Clark <simonc6r@gmail.com> Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
Bill Bejeck	c19213ab41	KAFKA-6761: Construct Physical Plan using Graph, Reduce streams footprint part III (#5201 ) The specific changes in this PR from the second PR include: 1. Changed the types of graph nodes to names conveying more context 2. Build the entire physical plan from the graph, after StreamsBuilder.build() is called. Other changes are addressed directly as review comments on the PR. Testing consists of using all existing streams tests to validate building the physical plan with graph Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <vvcephei@users.noreply.github.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Jason Gustafson	c3e7c0bcb2	MINOR: Producers should set delivery timeout instead of retries (#5425 ) Use delivery timeout instead of retries when possible and remove various TODOs associated with completion of KIP-91. Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	aa48791297	KAFKA-7161: check invariant: oldValue is in the state (#5366 ) Reviewers: Vasily Sulatskov <redvasily@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	814fbe0fea	MINOR: Remove 1 minute minimum segment interval (#5323 ) * new minimum is 0, just like window size * refactor tests to use smaller segment sizes as well Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Bill Bejeck	e09d6d796f	KAFKA-7027: Add overloaded build method to StreamsBuilder (#5437 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Lucas Wang	96bc0b882d	KAFKA-7180; Fixing the flaky test testHWCheckpointWithFailuresSingleLogSegment By waiting until server1 has joined the ISR before shutting down server2 Rerun the test method many times after the code change, and there is no flakiness any more. Author: Lucas Wang <luwang@linkedin.com> Reviewers: Mayuresh Gharat <gharatmayuresh15@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5387 from gitlw/fixing_flacky_logrecevorytest	6 years ago
Lee Dongjin	495c78db6f	KAFKA-6999: Add description on read-write lock vulnerability of ReadOnlyKeyValueStore (#5351 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Dhruvil Shah	08a4cda34e	[MINOR] Improve consumer logging on LeaveGroup (#5420 ) * Improve consumer logging on LeaveGroup * Add GroupCoordinator logging, and address review comments Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	c8c3a7dc48	KAFKA-7192 Follow-up: update checkpoint to the reset beginning offset (#5430 ) 1. When we reinitialize the state store due to no CHECKPOINT with EOS turned on, we should update the checkpoint to consumer.seekToBeginnning() / consumer.position() to avoid falling into endless iterations. 2. Fixed a few other logic bugs around needsInitializing and needsRestoring. Reviewers: Jason Gustafson <jason@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
ying-zheng	a61594dee1	KAFKA-6432: Make index lookup more cache friendly (#5346 ) KAFKA-6432: Make index lookup more cache friendly For each topic-partition, Kafka broker maintains two indices: one for message offset, one for message timestamp. By default, a new index entry is appended to each index for every 4KB messages. The lookup of the indices is a simple binary search. The indices are mmaped files, and cached by Linux page cache. Both consumer fetch and follower fetch have to do an offset lookup, before accessing the actual message data. The simple binary search algorithm used for looking up the index is not cache friendly, and may cause page faults even on high QPS topic-partitions. In a normal Kafka broker, all the follower fetch requests, and most consumer fetch requests should only look up the last few entries of the index. We can make the index lookup more cache friendly, by searching in the last one or two pages of the index first. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Guozhang Wang <wangguoz@gmail.com>, Ted Yu <yuzhihong@gmail.com>, Ismael Juma <github@juma.me.uk>, Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
Guozhang Wang	061885e9f1	KAFKA-7192: Wipe out if EOS is turned on and checkpoint file does not exist (#5421 ) 1. As titled and as described in comments. 2. Modified unit test slightly to insert for new keys in committed data to expose this issue. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Jon Lee	a932520135	KAFKA-7126; Reduce number of rebalance for large consumer group after a topic is created This patch forces metadata update for consumers with pattern subscription at the beginning of rebalance (retry.backoff.ms is respected). This is to prevent such consumers from detecting subscription changes (e.g., new topic creation) independently and triggering multiple unnecessary rebalances. KAFKA-7126 contains detailed scenarios and rationale. Author: Jon Lee <jonlee@linkedin.com> Reviewers: Jason Gustafson <jason@confluent.io>, Ted Yu <yuzhihong@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5408 from jonlee2/KAFKA-7126	6 years ago
Matthias J. Sax	42af41d5fc	MINOR: Caching layer should forward record timestamp (#5423 ) Reviewer: Guozhang Wang <guozhang@confluent.io>	6 years ago
Yu Yang	7fc7136ffd	KAFKA-5886; Introduce delivery.timeout.ms producer config (KIP-91) (#5270 ) Co-authored-by: Sumant Tambe <sutambe@yahoo.com> Co-authored-by: Yu Yang <yuyang@pinterest.com> Reviewers: Ted Yu <yuzhihong@gmail.com>, Apurva Mehta <apurva@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Bill Bejeck	1d9a427225	KAFKA-7144: Fix task assignment to be even (#5390 ) This PR now justs removes the check in TaskPairs.hasNewPair that was causing the task assignment issue. This was done as we need to further refine task assignment strategy and this approach needs to include the statefulness of tasks and is best done in one pass vs taking a "patchy" approach. Updated current tests and ran locally Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Matthias J. Sax	487b954542	MINOR: internal config objects should not be logged (#5389 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Jason Gustafson	c83ecf4c55	KAFKA-7194; Fix buffer underflow if onJoinComplete is retried after failure (#5417 ) An untimely wakeup can cause ConsumerCoordinator.onJoinComplete to throw a WakeupException before completion. On the next poll(), it will be retried, but this leads to an underflow error because the buffer containing the assignment data will already have been advanced. The solution is to duplicate the buffer passed to onJoinComplete. Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Manikumar Reddy O	52c5b5f111	MINOR: Remove unused TopicAndPartition usage in tests (#5419 ) Also replace `TopicAndPartition` with `TopicPartition` in `MetadataCache`. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Rajini Sivaram	4b60ed3247	KAFKA-7193: Use ZooKeeper IP address in streams tests to avoid timeouts (#5414 ) ZooKeeper client from version 3.4.13 doesn't handle connections to localhost very well. If ZooKeeper is started on 127.0.0.1 on a machine that has both ipv4 and ipv6 and a client is created using localhost rather than the IP address in the connection string, ZooKeeper client attempts to connect to ipv4 or ipv6 randomly with a fixed one second backoff if connection fails. Use 127.0.0.1 instead of localhost in streams tests to avoid intermittent test failures due to ZK client connection timeouts if ipv6 is chosen in consecutive address selections. Also add note to upgrade docs for 2.0.0. Reviewers: Ismael Juma <github@juma.me.uk>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Manikumar Reddy O	5db2f9903a	MINOR: Close ZooKeeperClient if waitUntilConnected fails during construction (#5411 ) This has always been an issue, but the recent upgrade to ZooKeeper 3.4.13 means it is also an issue when an unresolvable ZK address is used, causing some tests to leak threads. The change in behaviour in ZK 3.4.13 is that no exception is thrown from the ZooKeeper constructor in case of an unresolvable address. Instead, ZooKeeper tries to re-resolve the address hoping it becomes resolvable again. We eventually throw a `ZooKeeperClientTimeoutException`, which is similar to the case where the the address is resolvable but ZooKeeper is not reachable. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Zhanxiang (Patrick) Huang	9a7f29c1ed	KAFKA-7152; Avoid moving a replica out of isr if its LEO equals leader's LEO When there are many inactive partitions in the cluster, we observed constant churn of URP in the cluster even if follower can catch up with leader's byte-in-rate because leader broker frequently moves replicas of inactive partitions out of ISR. This PR mitigates this issue by not moving replica out of ISR if follower's LEO == leader's LEO. Author: Zhanxiang (Patrick) Huang <hzxa21@hotmail.com> Reviewers: Dong Lin <lindong28@gmail.com> Closes #5412 from hzxa21/KAFKA-7152	6 years ago
Dhruvil Shah	d11f6f26b7	KAFKA-6897; Prevent KafkaProducer.send from blocking when producer is closed (#5027 ) After successful completion of KafkaProducer#close, it is possible that an application calls KafkaProducer#send. If the send is invoked for a topic for which we do not have any metadata, the producer will block until `max.block.ms` elapses - we do not expect to receive any metadata update in this case because Sender (and NetworkClient) has already exited. It is only when RecordAccumulator#append is invoked that we notice that the producer has already been closed and throw an exception. If `max.block.ms` is set to Long.MaxValue (or a sufficiently high value in general), the producer could block awaiting metadata indefinitely. This patch makes sure `Metadata#awaitUpdate` periodically checks if the network client has been closed, and if so bails out as soon as possible.	6 years ago
Sandor Murakozi	591954e2e5	MINOR: Add registerController method to KafkaZkClient (#4598 ) And change KafkaController to use the newly introduced method. Also remove redundant `InZk` postfixes from `registerBrokerInZk` and `updateBrokerInfoInZk`. As `checkedEphemeralCreate` is not used outside of `KafkaZkClient` any longer, reduce its visibility. ControllerIntegrationTest already covers this functionality well, it validates the refactor. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Colin Patrick McCabe	b9b70c95a2	MINOR: Change "no such session ID" log to debug (#5316 ) Improve the log messages while at it and fix some code style issues. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
wangshao	ee0c2cee21	MINOR: Fix logged timeout in KafkaProducer.close() (#4623 ) The log line says `ms`, but the actual value could represent a different time unit depending on what the user provided. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Jon Lee	f9cdadfdcf	KAFKA-7177; Update 2.0 documentation to reflect changed quota behaviors by KIP-219 Updated the 2.0 document for changed quota behaviors. Author: Jon Lee <jonlee@linkedin.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Dong Lin <lindong28@gmail.com> Closes #5384 from jonlee2/KAFKA-7177	6 years ago
Konstantine Karantasis	5d2bf6328e	MINOR: FileStreamSinkTask should create file if it doesn't exist (#5406 ) A recent change from `new FileOutputStream` to `Files.newOutputStream` missed the `CREATE` flag (which is necessary in addition to `APPEND`). Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Dhruvil Shah	9449f055c7	KAFKA-7185: Allow empty resource name when matching ACLs (#5400 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Arjun Satish	0b5fd99ecb	MINOR: Add thread dumps if broker node cannot be stopped (#5373 ) In system tests, it is useful to have the thread dumps if a broker cannot be stopped using SIGTERM. Reviewers: Xavier Léauté <xl+github@xvrl.net>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Matthias J. Sax	c5a4d5cb91	MINOR: fix upgrade docs for Streams (#5394 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Rajini Sivaram <rajini@confluent.io>	6 years ago

1 2 3 4 5 ...

5313 Commits (b0d840d34b4172add831367e8fc2c51e75efb549) All Branches Search

5313 Commits (b0d840d34b4172add831367e8fc2c51e75efb549)

All Branches