src-kafka

Author	SHA1	Message	Date
Lee Dongjin	05cba28ca7	MINOR: A few cleanups and compiler warning fixes (#6986 ) Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
David Arthur	23beeea34b	KAFKA-8443; Broker support for fetch from followers (#6832 ) Follow on to #6731, this PR adds broker-side support for [KIP-392](https://cwiki.apache.org/confluence/display/KAFKA/KIP-392%3A+Allow+consumers+to+fetch+from+closest+replica) (fetch from followers). Changes: * All brokers will handle FetchRequest regardless of leadership * Leaders can compute a preferred replica to return to the client * New ReplicaSelector interface for determining the preferred replica * Incremental fetches will include partitions with no records if the preferred replica has been computed * Adds new JMX to expose the current preferred read replica of a partition in the consumer Two new conditions were added for completing a delayed fetch. They both relate to communicating the high watermark to followers without waiting for a timeout: * For regular fetches, if the high watermark changes within a single fetch request * For incremental fetch sessions, if the follower's high watermark is lower than the leader A new JMX attribute `preferred-read-replica` was added to the `kafka.consumer:type=consumer-fetch-manager-metrics,client-id=some-consumer,topic=my-topic,partition=0` object. This was added to support the new system test which verifies that the fetch from follower behavior works end-to-end. This attribute could also be useful in the future when debugging problems with the consumer. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago
Colin Patrick McCabe	822abe47db	MINOR: WorkerUtils#topicDescriptions must unwrap exceptions properly (#6937 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	5 years ago
Kamal Chandraprakash	3750898e20	MINOR: Improve group metadata unknown key version exception message (#7006 ) The patch clarifies the exception message for unknown key versions when loading from the group metadata topic. The patch also makes a trivial change in `KafkaAdminClient` to use `Map.computeIfAbsent`. Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago
Colin P. Mccabe	711c817254	KAFKA-8560; The Kafka protocol generator should support common structures Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Gwen Shapira Closes #6966 from cmccabe/KAFKA-8560	5 years ago
Boyang Chen	f8db022b08	KAFKA-8538 (part of KIP-345): add group.instance.id to DescribeGroup (#6957 ) Include group.instance.id in the describe group result for better visibility. Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Boyang Chen	fbf6a76fc4	KAFKA-8356: add static membership info to round robin assignor (#6815 ) The purpose here is to leverage static membership information during round robin consumer assignment, because persistent member id could help make the assignment remain the same during rebalance. The comparison logic is changed to: 1. If member A and member B both have group.instance.id, then compare their group.instance.id 2. If member A has group.instance.id, while member B doesn't, then A < B 3. If both member A and B don't have group.instance.id, compare their member.id In round robin assignor, we use ephemeral member.id to sort the members in order for assignment. This semantic is not stable and could trigger unnecessary shuffle of tasks. By leveraging group.instance.id the static member assignment shall be persist when satisfying following conditions: 1. number of members remain the same across generation 2. static members' identities persist across generation Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Mickael Maison	14d854936e	KAFKA-8390: Use automatic RPC generation in CreateDelegationToken (#6828 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Nathan Murthy	55707adaee	MINOR: add unit test for Utils.murmur2 (#5926 )	5 years ago
Guozhang Wang	e5c4ebdd74	KAFKA-8179: Part 2, ConsumerCoordinator Algorithm (#6778 ) 1. In ConsumerCoordinator, select the protocol as the common protocol from all configured assignor instances' supported protocols with the highest number. 1.b. In onJoinPrepare: only call onPartitionRevoked with EAGER. 1.a. In onJoinComplete: call onPartitionAssigned with EAGER; call onPartitionRevoked following onPartitionAssigned with COOPERATIVE, and then request re-join if the error indicates so. 1.c. In performAssignment: update the user's assignor returned assignments by excluding all partitions that are still owned by some other members. 2. I've refactored the Subscription / Assignment such that: assigned partitions, error codes, and group instance id are not-final anymore, instead they can be updated. For the last one, it is directly related to the logic of this PR but I felt it is more convienent to go with other fields. 3. Testing: primarily in ConsumerCoordinatorTest, make it parameterized with protocol, and add necessary scenarios for COOPERATIVE protocol. I intentionally omitted the documentation change since there are some behavioral updates that needs to be finalized in later PRs, and hence I will also only add the docs in later PRs. Reviewers: Bill Bejeck <bbejeck@gmail.com>, Boyang Chen <boyang@confluent.io>, Sophie Blee-Goldman <sophie@confluent.io>	5 years ago
Rajini Sivaram	7cb0a1ef4f	MINOR: Reinstate info-level log for dynamic update of SSL keystores (#6925 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Ismael Juma	11641c7f53	MINOR: Reflection free implementation of `defaultKerberosRealm` (#6978 ) The existing implementation triggers warnings in Java 9+ and relies on internal classes that vary depending on the JDK provider. The proposed implementation fixes these issues and it's more concise. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	3e9d1c1411	KAFKA-8106: Skipping ByteBuffer allocation of key / value / headers in logValidator (#6785 ) * KAFKA-8106:Reducing the allocation and copying of ByteBuffer when logValidator do validation. * KAFKA-8106:Reducing the allocation and copying of ByteBuffer when logValidator do validation. * github comments * use batch.skipKeyValueIterator * cleanups * no need to skip kv for uncompressed iterator * checkstyle fixes * fix findbugs * adding unit tests * reuse decompression buffer; and using streaming iterator * checkstyle * add unit tests * remove reusing buffer supplier * fix unit tests * add unit tests * use streaming iterator * minor refactoring * rename * github comments * github comments * reuse buffer at DefaultRecord caller * some further optimization * major refactoring * further refactoring * update comment * github comments * minor fix * add jmh benchmarks * update jmh * github comments * minor fix * github comments	5 years ago
Karan Kumar	4d1d995a11	KAFKA-8563: Remove redundant `NetworkSend.sizeDelimit()` method (#6967 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Dhruvil Shah	5f8b2898ce	KAFKA-8570; Grow buffer to hold down converted records if it was insufficiently sized (#6974 ) When the log contains out of order message formats (for example v2 message followed by v1 message) and consists of compressed batches typically greater than 1kB in size, it is possible for down-conversion to fail. With compressed batches, we estimate the size of down-converted batches using: ``` private static int estimateCompressedSizeInBytes(int size, CompressionType compressionType) { return compressionType == CompressionType.NONE ? size : Math.min(Math.max(size / 2, 1024), 1 << 16); } ``` This almost always underestimates size of down-converted records if the batch is between 1kB-64kB in size. In general, this means we may under estimate the total size required for compressed batches. Because of an implicit assumption in the code that messages with a lower message format appear before any with a higher message format, we do not grow the buffer we copy the down converted records into when we see a message <= the target message format. This assumption becomes incorrect when the log contains out of order message formats, for example because of leaders flapping while upgrading the message format. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Boyang Chen	03d61ebfb9	KAFKA-8569: integrate warning message under static membership (#6972 ) Static members never leave the group, so potentially we could log a flooding number of warning messages in the hb thread. The solution is to only log as warning when we are on dynamic membership. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
wenhoujx	93bf965894	KAFKA-8559: Allocate ArrayList with correct size in PartitionStates (#6964 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Boyang Chen	c7db82b59a	MINOR: rename subscription construction function (#6954 ) Per discussion on #6936, some nit fixes to the Subscription initialization path. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Boyang Chen	1ae92914e2	HOTFIX: Fix optional import in ConsumerCoordinator (#6953 ) This was caused by back-to-back merging of #6854 (which removed the Optional import) and #6936 (which needed the import). Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Boyang Chen	47f908fa73	KAFKA-8539; Add group.instance.id to Subscription (#6936 ) This PR is part of KIP-345's effort to utilize this new field for more stable topic partition assignment. We add the group instance id to the `Subscription` object to allow partition assignors to make stickier assignments. More details [here](https://cwiki.apache.org/confluence/display/KAFKA/KIP-345%3A+Introduce+static+membership+protocol+to+reduce+consumer+rebalances#KIP-345:Introducestaticmembershipprotocoltoreduceconsumerrebalances-ClientBehaviorChanges). Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Boyang Chen	1b9e107388	KAFKA-7853: Refactor coordinator config (#6854 ) An attempt to refactor current coordinator logic. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Konstantine Karantasis <konstantine@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Colin P. Mccabe	e047864f30	MINOR: fix some warnings in the broker Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Gwen Shapira Closes #6942 from cmccabe/fix-scala-warnings	6 years ago
Guozhang Wang	2ef02f111e	KAFKA-8179: Part I, Bump up consumer protocol to v2 (#6528 ) 1. Add new fields of subscription / assignment and bump up consumer protocol to v2. 2. Update tests to make sure old versioned protocol can be successfully deserialized, and new versioned protocol can be deserialized by old byte code. Reviewers: Boyang Chen <boyang@confluent.io>, Sophie Blee-Goldman <sophie@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
wenhoujx	35814298e1	KAFKA-8488: Reduce logging-related string allocation in FetchSessionHandler Reviewers: Colin P. McCabe <cmccabe@apache.org>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Jason Gustafson	8dd4fb5ebe	KAFKA-8530; Check for topic authorization errors in OffsetFetch response (#6928 ) The OffsetFetch requires Topic Describe permission. If a client does not have this, we return TOPIC_AUTHORIZATION_FAILED at the partition level. Currently the consumer does not handle this error explicitly, but raises it as a generic `KafkaException`. For consistency with other APIs and to fix transient test failures in `PlaintextEndToEndAuthorizationTest`, we should raise `TopicAuthorizationFailedException` instead. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Jason Gustafson	af2801031c	KAFKA-8483/KAFKA-8484; Ensure safe handling of producerId resets (#6883 ) The idempotent producer attempts to detect spurious UNKNOWN_PRODUCER_ID errors and handle them by reassigning sequence numbers to the inflight batches. The inflight batches are tracked in a PriorityQueue. The problem is that the reassignment of sequence numbers depends on the iteration order of PriorityQueue, which does not guarantee any ordering. So this can result in sequence numbers being assigned in the wrong order. This patch fixes the problem by using a sorted set instead of a priority queue so that the iteration order preserves the sequence order. Note that resetting sequence numbers is an exceptional case. This patch also fixes KAFKA-8484, which can cause an IllegalStateException when the producerId is reset while there are pending produce requests inflight. The solution is to ensure that sequence numbers are only reset if the producerId of a failed batch corresponds to the current producerId. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
highluck	e2c15e0eeb	MINOR: Remove uncommitted code (#6919 )	6 years ago
Guozhang Wang	bebcbe3a04	KAFKA-8487: Only request re-join on REBALANCE_IN_PROGRESS in CommitOffsetResponse (#6894 ) Plus some minor cleanups on AbstractCoordinator. Reviewers: Boyang Chen <boyang@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	cca05cace4	KAFKA-8331: stream static membership system test (#6877 ) As title suggested, we boost 3 stream instances stream job with one minute session timeout, and once the group is stable, doing couple of rolling bounces for the entire cluster. Every rejoin based on restart should have no generation bump on the client side. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
Almog Gavra	8e161580b8	KAFKA-8305; Support default partitions & replication factor in AdminClient#createTopic (KIP-464) (#6728 ) This commit makes three changes: - Adds a constructor for NewTopic(String, Optional<Integer>, Optional<Short>) which allows users to specify Optional.empty() for numPartitions or replicationFactor in order to use the broker default. - Changes AdminManager to accept -1 as valid options for replication factor and numPartitions (resolving to broker defaults). - Makes --partitions and --replication-factor optional arguments when creating topics using kafka-topics.sh. - Adds a dependency on scalaJava8Compat library to make it simpler to convert Scala Option to Java Optional Reviewers: Ismael Juma <ismael@juma.me.uk>, Ryanne Dolan <ryannedolan@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
David Arthur	264d1d8a8b	Improve logging in the consumer for epoch updates (#6879 )	6 years ago
Boyang Chen	055c9c7bd6	KAFKA 8311: better handle timeout exception on Stream thread (#6662 ) The goals for this small diff are: 1. Give user guidance if they want to relax commit timeout threshold 2. Indicate the code path where timeout exception was caught Reviewers: John Roesler <john@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
Guozhang Wang	573152dfa8	HOTFIX: Allow multi-batches for old format and no compression (#6871 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Randall Hauch	ce008e72de	KAFKA-8475: Temporarily restore SslFactory.sslContext() helper Temporarily restore the SslFactory.sslContext() function, which some connectors use. This function is not a public API and it will be removed eventually. For now, we will mark it as deprecated.	6 years ago
tadsul	b042b36674	KAFKA-8426; Fix for keeping the ConfigProvider configs consistent with KIP-297 (#6750 ) According to KIP-297 a parameter is passed to ConfigProvider with syntax "config.providers.{name}.param.{param-name}". Currently AbstractConfig allows parameters of the format "config.providers.{name}.{param-name}". With this fix AbstractConfig will be consistent with KIP-297 syntax. Reviewers: Robert Yokota <rayokota@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
tadsul	2c810e4afb	KAFKA-8425: Fix for correctly handling immutable maps (KIP-421 bug) (#6795 ) Since the originals map passed to AbstractConfig constructor may be immutable, avoid updating this map while resolving indirect config variables. Instead a new ResolvingMap instance is now used to store resolved configs. Reviewers: Randall Hauch <rhauch@gmail.com>, Boyang Chen <bchen11@outlook.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Lifei Chen	5795675599	MINOR:Replace duplicated code with common function in utils (#6819 ) Reviewers: Ivan Yurchenko <ivanyu@aiven.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Jason Gustafson	fd9a20e416	KAFKA-8429; Handle offset change when OffsetForLeaderEpoch inflight (#6811 ) It is possible for the offset of a partition to be changed while we are in the middle of validation. If the OffsetForLeaderEpoch request is in-flight and the offset changes, we need to redo the validation after it returns. We had a check for this situation previously, but it was only checking if the current leader epoch had changed. This patch fixes this and moves the validation in `SubscriptionState` where it can be protected with a lock. Additionally, this patch adds test cases for the SubscriptionState validation API. We fix a small bug handling broker downgrades. Basically we should skip validation if the latest metadata does not include leader epoch information. Reviewers: David Arthur <mumrah@gmail.com>	6 years ago
Viktor Somogyi	e82e2e723a	KAFKA-7703; position() may return a wrong offset after seekToEnd (#6407 ) When poll is called which resets the offsets to the beginning, followed by a seekToEnd and a position, it could happen that the "reset to earliest" call in poll overrides the "reset to latest" initiated by seekToEnd in a very delicate way: 1. both request has been issued and returned to the client side (listOffsetResponse has happened) 2. in Fetcher.resetOffsetIfNeeded(TopicPartition, Long, OffsetData) the thread scheduler could prefer the heartbeat thread with the "reset to earliest" call, overriding the offset to the earliest and setting the SubscriptionState with that position. 3. The thread scheduler continues execution of the thread (application thread) with the "reset to latest" call and discards it as the "reset to earliest" already set the position - the wrong one. 4. The blocking position call returns with the earliest offset instead of the latest, despite it wasn't expected. The fix makes SubscriptionState synchronized so that we can verify that the reset is expected while holding the lock. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
José Armando García Sancio	121308cc7a	KAFKA-8286; Generalized Leader Election Admin RPC (KIP-460) (#6686 ) Implements KIP-460: https://cwiki.apache.org/confluence/display/KAFKA/KIP-460%3A+Admin+Leader+Election+RPC. Reviewers: Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	051379ea5d	KAFKA-8430: unit test to make sure null `group.id` and valid `group.instance.id` are valid combo (#6830 ) As title suggests, this unit test is just a double check. No need to push in 2.3 Reviewers: Guozhang Wang <wangguoz@gmail.com>, Matthias J. Sax <mjsax@apache.org>	6 years ago
Boyang Chen	901eb36883	MINOR: Set default `group.instance.id` in JoinGroupResponse to null (#6831 ) As we are planning to add on more supporting features for rebalancing under static membership, we need to make sure the behavior for `group.instance.id` is consistent throughout the whole stack. This patch ensures that the default value is null in the JoinGroup response. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Colin Patrick McCabe	24f664aa16	MINOR: Auth operations must be null when talking to a pre-KIP-430 broker (#6812 ) Authorized operations must be null when talking to a pre-KIP-430 broker. If we present this as the empty set instead, it is impossible for clients to know if they have no permissions, or are talking to an old broker. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Jason Gustafson	e6057e5038	KAFKA-8437; Await node api versions before checking if offset validation is possible (#6823 ) The consumer should await api version information before determining whether the broker supports offset validation. In KAFKA-8422, we skip the validation if we don't have api version information, which means we always skip validation the first time we connect to a node. This bug was detected by the failing system test `tests/client/truncation_test.py`. The test passes again with this fix. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Jason Gustafson	a1808962e5	KAFKA-8422; Client should send OffsetForLeaderEpoch only if broker supports latest version (#6806 ) In the olden days, OffsetForLeaderEpoch was exclusively an inter-broker protocol and required Cluster level permission. With KIP-320, clients can use this API as well and so we lowered the required permission to Topic Describe. The only way the client can be sure that the new permissions are in use is to require version 3 of the protocol which was bumped for 2.3. If the broker does not support this version, we skip the validation and revert to the old behavior. Additionally, this patch fixes a problem with the newly added replicaId field when parsed from older versions which did not have it. If the field was not present, then we used the consumer's sentinel value, but this would limit the range of visible offsets by the high watermark. To get around this problem, this patch adds a separate "debug" sentinel similar to APIs like Fetch and ListOffsets. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
soondenana	46a02f3231	KAFKA-8341. Retry Consumer group operation for NOT_COORDINATOR error (#6723 ) An API call for consumer groups must send a FindCoordinatorRequest to find the consumer group coordinator, and then send a follow-up request to that node. But the coordinator might move after the FindCoordinatorRequest but before the follow-up request is sent. In that case we currently fail. This change fixes that by detecting this error and then retrying. This fixes listConsumerGroupOffsets, deleteConsumerGroups, and describeConsumerGroups. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Boyang Chen <bchen11@outlook.com>	6 years ago
Guozhang Wang	4574b2438a	MINOR: Remove checking on original joined subscription within handleAssignmentMismatch (#6782 ) When consumer coordinator realize the subscription may have changed, today we check again against the joinedSubscription within handleAssignmentMismatch. This checking however is a bit fishy and over-kill as well. It's better just simplifying it to always request re-join. The joinedSubscription object itself however still need to be maintained for potential augment to avoid extra re-joining the group. Since testOutdatedCoordinatorAssignment already cover the normal case we also remove the other invalidAssignment test case. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	cafdc1e7df	KAFKA-8399: bring back internal.leave.group.on.close config for KStream (#6779 ) As title states. We plan to merge this to both trunk and 2.3 if it could fix the stream system tests globally. Reference implementation: #6673 Reviewers: Guozhang Wang <wangguoz@gmail.com>, Matthias J. Sax <mjsax@apache.org>	6 years ago
Jason Gustafson	4f11090597	HOTFIX: Fix recent protocol breakage from KIP-345 and KIP-392 (#6780 ) KIP-345 and KIP-392 introduced a couple breaking changes for old versions of bumped protocols. This patch fixes them. Reviewers: Colin Patrick McCabe <cmccabe@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Boyang Chen <bchen11@outlook.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
David Arthur	bacb45e044	MINOR: Set `replicaId` for OffsetsForLeaderEpoch from followers (#6775 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago

1 2 3 4 5 ...

1572 Commits (05cba28ca7aafd3974e9e818be08f239b6162855)