src-kafka

Commit Graph

Author	SHA1	Message	Date
Rajini Sivaram	ce19f34f1e	KAFKA-7255: Fix timing issue with create/update in SimpleAclAuthorizer (#5478 ) ACL updates currently get `(currentAcls, currentVersion)` for the resource from ZK and do a conditional update using `(currentAcls+newAcl, currentVersion)`. This supports concurrent atomic updates if the resource path already exists in ZK. If the path doesn't exist, we currently do a conditional createOrUpdate using `(newAcl, -1)`. But `-1` has a special meaning in ZooKeeper for update operations - it means match any version. So two brokers adding acls using `(newAcl1, -1)` and `(newAcl2, -1)` will result in one broker creating the path and setting newAcl1, while the other broker can potentially update the path with `(newAcl2, -1)`, losing newAcl1. The timing window is very small, but we have seen intermittent failures in `SimpleAclAuthorizerTest.testHighConcurrencyModificationOfResourceAcls` as a result of this window. This commit fixes the version used for conditional updates in ZooKeeper. It also replaces the confusing `ZkVersion.NoVersion=-1` used for `set(any-version)` and `get(return not-found)` with `ZkVersion.MatchAnyVersion` for `set(any-version)` and `ZkVersion.UnknownVersion` for `get(return not-found)` to avoid the return value from `get` matching arbitrary values in `set`.	6 years ago
Guozhang Wang	2fd4e8d13c	MINOR: add spotlessScalaCheck to jenkins job (#5473 ) Reviewers: Ewen Cheslack-Postava <ewen@confluent.io>	6 years ago
Arjun Satish	28a1ae4183	MINOR: System test for error handling and writes to DeadLetterQueue Added a system test which creates a file sink with json converter and attempts to feed it bad records. The bad records should land in the DLQ if it is enabled, and the task should be killed or bad records skipped based on test parameters. Signed-off-by: Arjun Satish <arjunconfluent.io> More detailed description of your change, if necessary. The PR title and PR message become the squashed commit message, so use a separate comment to ping reviewers. Summary of testing strategy (including rationale) for the feature or bug fix. Unit and/or integration tests are expected for any behaviour change and system tests should be considered for larger changes. Author: Arjun Satish <arjun@confluent.io> Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5456 from wicknicks/error-handling-sys-test	6 years ago
Arjun Satish	e876c921b0	MINOR: Add connector configs to site-docs In AK's documentation, the config props for connectors are not listed (https://kafka.apache.org/documentation/#connectconfigs). This PR adds these sink and source connector configs to the html site-docs. Signed-off-by: Arjun Satish <arjunconfluent.io> Author: Arjun Satish <arjun@confluent.io> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #5469 from wicknicks/add-connector-configs-to-docs	6 years ago
Robert Yokota	36a8fec0ab	KAFKA-7225: Pretransform validated props If a property requires validation, it should be pretransformed if it is a variable reference, in order to have a value that will properly pass the validation. Author: Robert Yokota <rayokota@gmail.com> Reviewers: Randall Hauch <rhauch@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5445 from rayokota/KAFKA-7225-pretransform-validated-props	6 years ago
Manikumar Reddy O	a9d7f8a1fd	MINOR: Fix Streams scala format violations (#5472 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Arjun Satish	cbe86bc463	MINOR: Fix minikdc cleanup in system tests (#5471 ) The original way of stopping the minikdc process sometimes misfires because the process arg string is very long, and `ps` is not able to find the correct process. Using the `kill_java_processes` method is more reliable for finding and killing java processes.	6 years ago
Marko Stanković	b966ce127c	Fix a typo in delegation.token.expiry.time.ms docs (#5449 ) Reviewers: Manikumar Reddy O <manikumar.reddy@gmail.com>	6 years ago
Michal Dziemianko	ed13d7eebb	KAFKA-7250: fix transform function in scala DSL to accept TranformerSupplier (#5468 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Kevin Lu	fb8eba3d33	KAFKA-7207: Make rate & total metrics documentation consistent (#5429 ) Some sections of the Monitoring metrics documentation list out the -total metrics, and some sections do not list them out. Make them consistent and list out the missing -total metrics.	6 years ago
uncleGen	b0d840d34b	KAFKA-5928; Avoid redundant requests to zookeeper when reassign topic partition Author: uncleGen <hustyugm@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Dong Lin <lindong28@gmail.com> Closes #3894 from uncleGen/KAFKA-5928	6 years ago
Manikumar Reddy O	cf981443e4	MINOR: Log AdminClient configs (#5457 ) Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>, Jason Gustafson <jason@confluent.io>	6 years ago
Matthias J. Sax	c1ca53b369	MINOR: fix Streams docs state.dir (#5465 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Jimmy Casey	b282bbe8be	Fixed Spelling. (#5432 ) Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
ying-zheng	b01f8fb668	KAFKA-7142: fix joinGroup performance issues (#5354 ) Summary: 1. Revert GroupMetadata.members to private 2. Add back a wrongly removed comment 3. In GroupMetadata.remove(), update supportedProtocols and awaitingJoinCallbackMembers, only when the remove succeeded Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
Stanislav Kozlovski	518e9d3eee	KAFKA-7169: Custom SASL extensions for OAuthBearer authentication mechanism (KIP-342) (#5379 ) Reviewers: Ron Dagostino <rndgstn@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Manikumar Reddy O	e75048d3e5	MINOR: increase timeout values in streams tests (#5461 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Colin Patrick McCabe	609c81ec8b	KAFKA-7183: Add a trogdor test that creates many connections to brokers (#5393 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
John Roesler	b9f1179694	MINOR: clean up window store interface to avoid confusion (#5359 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Manikumar Reddy O	924466ad62	MINOR: close producer instance in AbstractJoinIntegrationTest (#5459 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	cf2c5e9ffc	MINOR: clean up node and store sensors (#5450 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Jason Gustafson	fc5f6b0e46	MINOR: Add Timer to simplify timeout bookkeeping and use it in the consumer (#5087 ) We currently do a lot of bookkeeping for timeouts which is both error-prone and distracting. This patch adds a new `Timer` class to simplify this logic and control unnecessary calls to system time. In particular, this helps with nested timeout operations. The consumer has been updated to use the new class. Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	3637b2c374	MINOR: Require final variables in Streams (#5452 ) Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
radai-rosenblatt	09fe51f3eb	KAFKA-6648; Fetcher.getTopicMetadata() should return all partitions for each requested topic Currently Fetcher.getTopicMetadata() will not include offline partitions. Thus KafkaConsumer.partitionsFor(topic) will not return all partitions of a topic if there if any partition of the topic is offline. This causes problem if user tries to query the total number of partitions of the given topic. Author: radai-rosenblatt <radai.rosenblatt@gmail.com> Reviewers: Jason Gustafson <jason@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com> Closes #4679 from radai-rosenblatt/partition_shenanigans	6 years ago
Guozhang Wang	afe00effe2	KAFKA-3514: Part II, Choose tasks with data on all partitions to process (#5398 ) 1. In each iteration, decide if a task is processable if all of its partitions contains data, so it can decide which record to process next. 1.a Add one exception that, if the task indeed have data on some but not all of its partitions, we only consider as not processable for some finite round of iterations. 1.b Add a task-level metric to record whenever we are forced to process a task that is only "partially data available", since it may leads to non-determinism. 2. Break the main loop on put-raw-data and process-them. Since now not all data put into the queue would be processed completely within a single iteration. 3. NOTE that within an iteration, if a task has exhausted one of its queue it will still be processed, since we only update processable list once in each iteration, I'm improving on this on the follow-up part III PR. 4. Found and fixed a bug in metrics recording: the taskName and sensorName parameters were exchanged. 5. Optimized task stream time computation again since our current partition stream time reasoning has been simplified. 6. Added unit tests. Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <vvcephei@users.noreply.github.com>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
Matthias J. Sax	b083ed66b9	MINOR: improve JavaDocs for Streams PAPI WordCountExample (#5442 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Arjun Satish	70d882861e	KAFKA-7228: Set errorHandlingMetrics for dead letter queue DLQ reporter does not get a `errorHandlingMetrics` object when created by the worker. This results in an NPE. Signed-off-by: Arjun Satish <arjunconfluent.io> More detailed description of your change, if necessary. The PR title and PR message become the squashed commit message, so use a separate comment to ping reviewers. Summary of testing strategy (including rationale) for the feature or bug fix. Unit and/or integration tests are expected for any behaviour change and system tests should be considered for larger changes. Author: Arjun Satish <arjun@confluent.io> Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5440 from wicknicks/KAFKA-7228	6 years ago
Jason Gustafson	596c6c0c0b	KAFKA-7231; Ensure NetworkClient uses overridden request timeout (#5444 ) Fixed incorrect use of default timeout instead of the argument explicitly passed to `newClientRequest`. Reviewers: Ron Dagostino <rndgstn@gmail.com>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Manikumar Reddy O	ac79d49047	MINOR: Implement toString() in config validator classes (#5401 )	6 years ago
Simon Clark	530d951cbd	MINOR: Fixed default streams state dir location. (#5441 ) Co-authored-by: Mickael Maison <mickael.maison@gmail.com> Co-authored-by: Simon Clark <simonc6r@gmail.com> Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
Bill Bejeck	c19213ab41	KAFKA-6761: Construct Physical Plan using Graph, Reduce streams footprint part III (#5201 ) The specific changes in this PR from the second PR include: 1. Changed the types of graph nodes to names conveying more context 2. Build the entire physical plan from the graph, after StreamsBuilder.build() is called. Other changes are addressed directly as review comments on the PR. Testing consists of using all existing streams tests to validate building the physical plan with graph Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <vvcephei@users.noreply.github.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Jason Gustafson	c3e7c0bcb2	MINOR: Producers should set delivery timeout instead of retries (#5425 ) Use delivery timeout instead of retries when possible and remove various TODOs associated with completion of KIP-91. Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	aa48791297	KAFKA-7161: check invariant: oldValue is in the state (#5366 ) Reviewers: Vasily Sulatskov <redvasily@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	814fbe0fea	MINOR: Remove 1 minute minimum segment interval (#5323 ) * new minimum is 0, just like window size * refactor tests to use smaller segment sizes as well Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Bill Bejeck	e09d6d796f	KAFKA-7027: Add overloaded build method to StreamsBuilder (#5437 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Lucas Wang	96bc0b882d	KAFKA-7180; Fixing the flaky test testHWCheckpointWithFailuresSingleLogSegment By waiting until server1 has joined the ISR before shutting down server2 Rerun the test method many times after the code change, and there is no flakiness any more. Author: Lucas Wang <luwang@linkedin.com> Reviewers: Mayuresh Gharat <gharatmayuresh15@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5387 from gitlw/fixing_flacky_logrecevorytest	6 years ago
Lee Dongjin	495c78db6f	KAFKA-6999: Add description on read-write lock vulnerability of ReadOnlyKeyValueStore (#5351 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Dhruvil Shah	08a4cda34e	[MINOR] Improve consumer logging on LeaveGroup (#5420 ) * Improve consumer logging on LeaveGroup * Add GroupCoordinator logging, and address review comments Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	c8c3a7dc48	KAFKA-7192 Follow-up: update checkpoint to the reset beginning offset (#5430 ) 1. When we reinitialize the state store due to no CHECKPOINT with EOS turned on, we should update the checkpoint to consumer.seekToBeginnning() / consumer.position() to avoid falling into endless iterations. 2. Fixed a few other logic bugs around needsInitializing and needsRestoring. Reviewers: Jason Gustafson <jason@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
ying-zheng	a61594dee1	KAFKA-6432: Make index lookup more cache friendly (#5346 ) KAFKA-6432: Make index lookup more cache friendly For each topic-partition, Kafka broker maintains two indices: one for message offset, one for message timestamp. By default, a new index entry is appended to each index for every 4KB messages. The lookup of the indices is a simple binary search. The indices are mmaped files, and cached by Linux page cache. Both consumer fetch and follower fetch have to do an offset lookup, before accessing the actual message data. The simple binary search algorithm used for looking up the index is not cache friendly, and may cause page faults even on high QPS topic-partitions. In a normal Kafka broker, all the follower fetch requests, and most consumer fetch requests should only look up the last few entries of the index. We can make the index lookup more cache friendly, by searching in the last one or two pages of the index first. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Guozhang Wang <wangguoz@gmail.com>, Ted Yu <yuzhihong@gmail.com>, Ismael Juma <github@juma.me.uk>, Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
Guozhang Wang	061885e9f1	KAFKA-7192: Wipe out if EOS is turned on and checkpoint file does not exist (#5421 ) 1. As titled and as described in comments. 2. Modified unit test slightly to insert for new keys in committed data to expose this issue. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Jon Lee	a932520135	KAFKA-7126; Reduce number of rebalance for large consumer group after a topic is created This patch forces metadata update for consumers with pattern subscription at the beginning of rebalance (retry.backoff.ms is respected). This is to prevent such consumers from detecting subscription changes (e.g., new topic creation) independently and triggering multiple unnecessary rebalances. KAFKA-7126 contains detailed scenarios and rationale. Author: Jon Lee <jonlee@linkedin.com> Reviewers: Jason Gustafson <jason@confluent.io>, Ted Yu <yuzhihong@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5408 from jonlee2/KAFKA-7126	6 years ago
Matthias J. Sax	42af41d5fc	MINOR: Caching layer should forward record timestamp (#5423 ) Reviewer: Guozhang Wang <guozhang@confluent.io>	6 years ago
Yu Yang	7fc7136ffd	KAFKA-5886; Introduce delivery.timeout.ms producer config (KIP-91) (#5270 ) Co-authored-by: Sumant Tambe <sutambe@yahoo.com> Co-authored-by: Yu Yang <yuyang@pinterest.com> Reviewers: Ted Yu <yuzhihong@gmail.com>, Apurva Mehta <apurva@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Bill Bejeck	1d9a427225	KAFKA-7144: Fix task assignment to be even (#5390 ) This PR now justs removes the check in TaskPairs.hasNewPair that was causing the task assignment issue. This was done as we need to further refine task assignment strategy and this approach needs to include the statefulness of tasks and is best done in one pass vs taking a "patchy" approach. Updated current tests and ran locally Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Matthias J. Sax	487b954542	MINOR: internal config objects should not be logged (#5389 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Jason Gustafson	c83ecf4c55	KAFKA-7194; Fix buffer underflow if onJoinComplete is retried after failure (#5417 ) An untimely wakeup can cause ConsumerCoordinator.onJoinComplete to throw a WakeupException before completion. On the next poll(), it will be retried, but this leads to an underflow error because the buffer containing the assignment data will already have been advanced. The solution is to duplicate the buffer passed to onJoinComplete. Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Manikumar Reddy O	52c5b5f111	MINOR: Remove unused TopicAndPartition usage in tests (#5419 ) Also replace `TopicAndPartition` with `TopicPartition` in `MetadataCache`. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Rajini Sivaram	4b60ed3247	KAFKA-7193: Use ZooKeeper IP address in streams tests to avoid timeouts (#5414 ) ZooKeeper client from version 3.4.13 doesn't handle connections to localhost very well. If ZooKeeper is started on 127.0.0.1 on a machine that has both ipv4 and ipv6 and a client is created using localhost rather than the IP address in the connection string, ZooKeeper client attempts to connect to ipv4 or ipv6 randomly with a fixed one second backoff if connection fails. Use 127.0.0.1 instead of localhost in streams tests to avoid intermittent test failures due to ZK client connection timeouts if ipv6 is chosen in consecutive address selections. Also add note to upgrade docs for 2.0.0. Reviewers: Ismael Juma <github@juma.me.uk>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Manikumar Reddy O	5db2f9903a	MINOR: Close ZooKeeperClient if waitUntilConnected fails during construction (#5411 ) This has always been an issue, but the recent upgrade to ZooKeeper 3.4.13 means it is also an issue when an unresolvable ZK address is used, causing some tests to leak threads. The change in behaviour in ZK 3.4.13 is that no exception is thrown from the ZooKeeper constructor in case of an unresolvable address. Instead, ZooKeeper tries to re-resolve the address hoping it becomes resolvable again. We eventually throw a `ZooKeeperClientTimeoutException`, which is similar to the case where the the address is resolvable but ZooKeeper is not reachable. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago

1 2 3 4 5 ...

5323 Commits (ce19f34f1e3e6e697cce2e01721bedc850fca698) All Branches Search

5323 Commits (ce19f34f1e3e6e697cce2e01721bedc850fca698)

All Branches