src-kafka

Commit Graph

Author	SHA1	Message	Date
Bob Barrett	ea814d7869	KAFKA-8614; Consistent naming for IncrementalAlterConfig and AlterConfig responses (#7022 ) This patch changes the name of the `Resources` field of AlterConfigsResponse to `Responses`. This makes it consistent with AlterConfigsResponse, which has a differently-named but structurally-identical field. Tested with unit tests. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Boyang Chen	25f4e3c7d4	KAFKA-8643; Bring back public MemberDescription constructor (#7060 ) This patch fixes a compatibility breaking `MemberDescription` constructor change in #6957. It also updates `equals` and `hashCode` for the new `groupInstanceId` field that was added in the same patch. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	5 years ago
Jason Gustafson	ebb80f568d	KAFKA-8653; Default rebalance timeout to session timeout for JoinGroup v0 (#7072 ) The rebalance timeout was added to the JoinGroup protocol in version 1. Prior to 2.3, we handled version 0 JoinGroup requests by setting the rebalance timeout to be equal to the session timeout. We lost this logic when we converted the API to use the generated schema definition (#6419) which uses the default value of -1. The impact of this is that the group rebalance timeout becomes 0, so rebalances finish immediately after we enter the PrepareRebalance state and kick out all old members. This causes consumer groups to enter an endless rebalance loop. This patch restores the old behavior. Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Boyang Chen	4f5a5eb579	KAFKA-8424: replace ListGroups request/response with automated protocol (#6805 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>, Ismael Juma <ismael@juma.me.uk>	5 years ago
Boyang Chen	65b044b200	KAFKA-8636; Add documentation change for max poll interval with static members (#7048 ) Clarify the behavior of `max.poll.interval.ms` for static consumers since it is slightly different from dynamic members. Reviewers: Bill Bejeck <bbejeck@gmail.com>, Matthias J. Sax <mjsax@apache.org>, Jason Gustafson <jason@confluent.io>	5 years ago
Vikas Singh	38f86d139c	MINOR: Use `Topic::isInternalTopic` instead of directly checking (#7047 ) We don't allow changing number of partitions for internal topics. To do so we check if the topic name belongs to the set of internal topics directly instead of using the "isInternalTopic" method. This breaks the encapsulation by making client aware of the fact that internal topics have special names. This is a simple change to use the method `Topic::isInternalTopic` method instead of checking it directly in "alterTopic" command. We also reduce visibility to `Topic::INTERNAL_TOPICS` to avoid unnecessary reliance on it in the future. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
mmanna-sapfgl	99737588b6	KAFKA-3333: Adds RoundRobinPartitioner with tests (#6771 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>, Sriharsha Chintalapani <sriharsha@apache.org>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Joel Hamill	3bb126dfbf	MINOR: Fixes AK config typos (#7046 )	5 years ago
Lee Dongjin	05cba28ca7	MINOR: A few cleanups and compiler warning fixes (#6986 ) Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
David Arthur	23beeea34b	KAFKA-8443; Broker support for fetch from followers (#6832 ) Follow on to #6731, this PR adds broker-side support for [KIP-392](https://cwiki.apache.org/confluence/display/KAFKA/KIP-392%3A+Allow+consumers+to+fetch+from+closest+replica) (fetch from followers). Changes: * All brokers will handle FetchRequest regardless of leadership * Leaders can compute a preferred replica to return to the client * New ReplicaSelector interface for determining the preferred replica * Incremental fetches will include partitions with no records if the preferred replica has been computed * Adds new JMX to expose the current preferred read replica of a partition in the consumer Two new conditions were added for completing a delayed fetch. They both relate to communicating the high watermark to followers without waiting for a timeout: * For regular fetches, if the high watermark changes within a single fetch request * For incremental fetch sessions, if the follower's high watermark is lower than the leader A new JMX attribute `preferred-read-replica` was added to the `kafka.consumer:type=consumer-fetch-manager-metrics,client-id=some-consumer,topic=my-topic,partition=0` object. This was added to support the new system test which verifies that the fetch from follower behavior works end-to-end. This attribute could also be useful in the future when debugging problems with the consumer. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago
Colin Patrick McCabe	822abe47db	MINOR: WorkerUtils#topicDescriptions must unwrap exceptions properly (#6937 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	5 years ago
Kamal Chandraprakash	3750898e20	MINOR: Improve group metadata unknown key version exception message (#7006 ) The patch clarifies the exception message for unknown key versions when loading from the group metadata topic. The patch also makes a trivial change in `KafkaAdminClient` to use `Map.computeIfAbsent`. Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago
Colin P. Mccabe	711c817254	KAFKA-8560; The Kafka protocol generator should support common structures Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Gwen Shapira Closes #6966 from cmccabe/KAFKA-8560	5 years ago
Boyang Chen	f8db022b08	KAFKA-8538 (part of KIP-345): add group.instance.id to DescribeGroup (#6957 ) Include group.instance.id in the describe group result for better visibility. Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Boyang Chen	fbf6a76fc4	KAFKA-8356: add static membership info to round robin assignor (#6815 ) The purpose here is to leverage static membership information during round robin consumer assignment, because persistent member id could help make the assignment remain the same during rebalance. The comparison logic is changed to: 1. If member A and member B both have group.instance.id, then compare their group.instance.id 2. If member A has group.instance.id, while member B doesn't, then A < B 3. If both member A and B don't have group.instance.id, compare their member.id In round robin assignor, we use ephemeral member.id to sort the members in order for assignment. This semantic is not stable and could trigger unnecessary shuffle of tasks. By leveraging group.instance.id the static member assignment shall be persist when satisfying following conditions: 1. number of members remain the same across generation 2. static members' identities persist across generation Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Mickael Maison	14d854936e	KAFKA-8390: Use automatic RPC generation in CreateDelegationToken (#6828 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Nathan Murthy	55707adaee	MINOR: add unit test for Utils.murmur2 (#5926 )	5 years ago
Guozhang Wang	e5c4ebdd74	KAFKA-8179: Part 2, ConsumerCoordinator Algorithm (#6778 ) 1. In ConsumerCoordinator, select the protocol as the common protocol from all configured assignor instances' supported protocols with the highest number. 1.b. In onJoinPrepare: only call onPartitionRevoked with EAGER. 1.a. In onJoinComplete: call onPartitionAssigned with EAGER; call onPartitionRevoked following onPartitionAssigned with COOPERATIVE, and then request re-join if the error indicates so. 1.c. In performAssignment: update the user's assignor returned assignments by excluding all partitions that are still owned by some other members. 2. I've refactored the Subscription / Assignment such that: assigned partitions, error codes, and group instance id are not-final anymore, instead they can be updated. For the last one, it is directly related to the logic of this PR but I felt it is more convienent to go with other fields. 3. Testing: primarily in ConsumerCoordinatorTest, make it parameterized with protocol, and add necessary scenarios for COOPERATIVE protocol. I intentionally omitted the documentation change since there are some behavioral updates that needs to be finalized in later PRs, and hence I will also only add the docs in later PRs. Reviewers: Bill Bejeck <bbejeck@gmail.com>, Boyang Chen <boyang@confluent.io>, Sophie Blee-Goldman <sophie@confluent.io>	5 years ago
Rajini Sivaram	7cb0a1ef4f	MINOR: Reinstate info-level log for dynamic update of SSL keystores (#6925 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Ismael Juma	11641c7f53	MINOR: Reflection free implementation of `defaultKerberosRealm` (#6978 ) The existing implementation triggers warnings in Java 9+ and relies on internal classes that vary depending on the JDK provider. The proposed implementation fixes these issues and it's more concise. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	3e9d1c1411	KAFKA-8106: Skipping ByteBuffer allocation of key / value / headers in logValidator (#6785 ) * KAFKA-8106:Reducing the allocation and copying of ByteBuffer when logValidator do validation. * KAFKA-8106:Reducing the allocation and copying of ByteBuffer when logValidator do validation. * github comments * use batch.skipKeyValueIterator * cleanups * no need to skip kv for uncompressed iterator * checkstyle fixes * fix findbugs * adding unit tests * reuse decompression buffer; and using streaming iterator * checkstyle * add unit tests * remove reusing buffer supplier * fix unit tests * add unit tests * use streaming iterator * minor refactoring * rename * github comments * github comments * reuse buffer at DefaultRecord caller * some further optimization * major refactoring * further refactoring * update comment * github comments * minor fix * add jmh benchmarks * update jmh * github comments * minor fix * github comments	5 years ago
Karan Kumar	4d1d995a11	KAFKA-8563: Remove redundant `NetworkSend.sizeDelimit()` method (#6967 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Dhruvil Shah	5f8b2898ce	KAFKA-8570; Grow buffer to hold down converted records if it was insufficiently sized (#6974 ) When the log contains out of order message formats (for example v2 message followed by v1 message) and consists of compressed batches typically greater than 1kB in size, it is possible for down-conversion to fail. With compressed batches, we estimate the size of down-converted batches using: ``` private static int estimateCompressedSizeInBytes(int size, CompressionType compressionType) { return compressionType == CompressionType.NONE ? size : Math.min(Math.max(size / 2, 1024), 1 << 16); } ``` This almost always underestimates size of down-converted records if the batch is between 1kB-64kB in size. In general, this means we may under estimate the total size required for compressed batches. Because of an implicit assumption in the code that messages with a lower message format appear before any with a higher message format, we do not grow the buffer we copy the down converted records into when we see a message <= the target message format. This assumption becomes incorrect when the log contains out of order message formats, for example because of leaders flapping while upgrading the message format. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Boyang Chen	03d61ebfb9	KAFKA-8569: integrate warning message under static membership (#6972 ) Static members never leave the group, so potentially we could log a flooding number of warning messages in the hb thread. The solution is to only log as warning when we are on dynamic membership. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
wenhoujx	93bf965894	KAFKA-8559: Allocate ArrayList with correct size in PartitionStates (#6964 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Boyang Chen	c7db82b59a	MINOR: rename subscription construction function (#6954 ) Per discussion on #6936, some nit fixes to the Subscription initialization path. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Boyang Chen	1ae92914e2	HOTFIX: Fix optional import in ConsumerCoordinator (#6953 ) This was caused by back-to-back merging of #6854 (which removed the Optional import) and #6936 (which needed the import). Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Boyang Chen	47f908fa73	KAFKA-8539; Add group.instance.id to Subscription (#6936 ) This PR is part of KIP-345's effort to utilize this new field for more stable topic partition assignment. We add the group instance id to the `Subscription` object to allow partition assignors to make stickier assignments. More details [here](https://cwiki.apache.org/confluence/display/KAFKA/KIP-345%3A+Introduce+static+membership+protocol+to+reduce+consumer+rebalances#KIP-345:Introducestaticmembershipprotocoltoreduceconsumerrebalances-ClientBehaviorChanges). Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Boyang Chen	1b9e107388	KAFKA-7853: Refactor coordinator config (#6854 ) An attempt to refactor current coordinator logic. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Konstantine Karantasis <konstantine@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Colin P. Mccabe	e047864f30	MINOR: fix some warnings in the broker Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Gwen Shapira Closes #6942 from cmccabe/fix-scala-warnings	6 years ago
Guozhang Wang	2ef02f111e	KAFKA-8179: Part I, Bump up consumer protocol to v2 (#6528 ) 1. Add new fields of subscription / assignment and bump up consumer protocol to v2. 2. Update tests to make sure old versioned protocol can be successfully deserialized, and new versioned protocol can be deserialized by old byte code. Reviewers: Boyang Chen <boyang@confluent.io>, Sophie Blee-Goldman <sophie@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
wenhoujx	35814298e1	KAFKA-8488: Reduce logging-related string allocation in FetchSessionHandler Reviewers: Colin P. McCabe <cmccabe@apache.org>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Jason Gustafson	8dd4fb5ebe	KAFKA-8530; Check for topic authorization errors in OffsetFetch response (#6928 ) The OffsetFetch requires Topic Describe permission. If a client does not have this, we return TOPIC_AUTHORIZATION_FAILED at the partition level. Currently the consumer does not handle this error explicitly, but raises it as a generic `KafkaException`. For consistency with other APIs and to fix transient test failures in `PlaintextEndToEndAuthorizationTest`, we should raise `TopicAuthorizationFailedException` instead. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Jason Gustafson	af2801031c	KAFKA-8483/KAFKA-8484; Ensure safe handling of producerId resets (#6883 ) The idempotent producer attempts to detect spurious UNKNOWN_PRODUCER_ID errors and handle them by reassigning sequence numbers to the inflight batches. The inflight batches are tracked in a PriorityQueue. The problem is that the reassignment of sequence numbers depends on the iteration order of PriorityQueue, which does not guarantee any ordering. So this can result in sequence numbers being assigned in the wrong order. This patch fixes the problem by using a sorted set instead of a priority queue so that the iteration order preserves the sequence order. Note that resetting sequence numbers is an exceptional case. This patch also fixes KAFKA-8484, which can cause an IllegalStateException when the producerId is reset while there are pending produce requests inflight. The solution is to ensure that sequence numbers are only reset if the producerId of a failed batch corresponds to the current producerId. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
highluck	e2c15e0eeb	MINOR: Remove uncommitted code (#6919 )	6 years ago
Guozhang Wang	bebcbe3a04	KAFKA-8487: Only request re-join on REBALANCE_IN_PROGRESS in CommitOffsetResponse (#6894 ) Plus some minor cleanups on AbstractCoordinator. Reviewers: Boyang Chen <boyang@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	cca05cace4	KAFKA-8331: stream static membership system test (#6877 ) As title suggested, we boost 3 stream instances stream job with one minute session timeout, and once the group is stable, doing couple of rolling bounces for the entire cluster. Every rejoin based on restart should have no generation bump on the client side. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
Almog Gavra	8e161580b8	KAFKA-8305; Support default partitions & replication factor in AdminClient#createTopic (KIP-464) (#6728 ) This commit makes three changes: - Adds a constructor for NewTopic(String, Optional<Integer>, Optional<Short>) which allows users to specify Optional.empty() for numPartitions or replicationFactor in order to use the broker default. - Changes AdminManager to accept -1 as valid options for replication factor and numPartitions (resolving to broker defaults). - Makes --partitions and --replication-factor optional arguments when creating topics using kafka-topics.sh. - Adds a dependency on scalaJava8Compat library to make it simpler to convert Scala Option to Java Optional Reviewers: Ismael Juma <ismael@juma.me.uk>, Ryanne Dolan <ryannedolan@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
David Arthur	264d1d8a8b	Improve logging in the consumer for epoch updates (#6879 )	6 years ago
Boyang Chen	055c9c7bd6	KAFKA 8311: better handle timeout exception on Stream thread (#6662 ) The goals for this small diff are: 1. Give user guidance if they want to relax commit timeout threshold 2. Indicate the code path where timeout exception was caught Reviewers: John Roesler <john@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
Guozhang Wang	573152dfa8	HOTFIX: Allow multi-batches for old format and no compression (#6871 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Randall Hauch	ce008e72de	KAFKA-8475: Temporarily restore SslFactory.sslContext() helper Temporarily restore the SslFactory.sslContext() function, which some connectors use. This function is not a public API and it will be removed eventually. For now, we will mark it as deprecated.	6 years ago
tadsul	b042b36674	KAFKA-8426; Fix for keeping the ConfigProvider configs consistent with KIP-297 (#6750 ) According to KIP-297 a parameter is passed to ConfigProvider with syntax "config.providers.{name}.param.{param-name}". Currently AbstractConfig allows parameters of the format "config.providers.{name}.{param-name}". With this fix AbstractConfig will be consistent with KIP-297 syntax. Reviewers: Robert Yokota <rayokota@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
tadsul	2c810e4afb	KAFKA-8425: Fix for correctly handling immutable maps (KIP-421 bug) (#6795 ) Since the originals map passed to AbstractConfig constructor may be immutable, avoid updating this map while resolving indirect config variables. Instead a new ResolvingMap instance is now used to store resolved configs. Reviewers: Randall Hauch <rhauch@gmail.com>, Boyang Chen <bchen11@outlook.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Lifei Chen	5795675599	MINOR:Replace duplicated code with common function in utils (#6819 ) Reviewers: Ivan Yurchenko <ivanyu@aiven.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Jason Gustafson	fd9a20e416	KAFKA-8429; Handle offset change when OffsetForLeaderEpoch inflight (#6811 ) It is possible for the offset of a partition to be changed while we are in the middle of validation. If the OffsetForLeaderEpoch request is in-flight and the offset changes, we need to redo the validation after it returns. We had a check for this situation previously, but it was only checking if the current leader epoch had changed. This patch fixes this and moves the validation in `SubscriptionState` where it can be protected with a lock. Additionally, this patch adds test cases for the SubscriptionState validation API. We fix a small bug handling broker downgrades. Basically we should skip validation if the latest metadata does not include leader epoch information. Reviewers: David Arthur <mumrah@gmail.com>	6 years ago
Viktor Somogyi	e82e2e723a	KAFKA-7703; position() may return a wrong offset after seekToEnd (#6407 ) When poll is called which resets the offsets to the beginning, followed by a seekToEnd and a position, it could happen that the "reset to earliest" call in poll overrides the "reset to latest" initiated by seekToEnd in a very delicate way: 1. both request has been issued and returned to the client side (listOffsetResponse has happened) 2. in Fetcher.resetOffsetIfNeeded(TopicPartition, Long, OffsetData) the thread scheduler could prefer the heartbeat thread with the "reset to earliest" call, overriding the offset to the earliest and setting the SubscriptionState with that position. 3. The thread scheduler continues execution of the thread (application thread) with the "reset to latest" call and discards it as the "reset to earliest" already set the position - the wrong one. 4. The blocking position call returns with the earliest offset instead of the latest, despite it wasn't expected. The fix makes SubscriptionState synchronized so that we can verify that the reset is expected while holding the lock. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
José Armando García Sancio	121308cc7a	KAFKA-8286; Generalized Leader Election Admin RPC (KIP-460) (#6686 ) Implements KIP-460: https://cwiki.apache.org/confluence/display/KAFKA/KIP-460%3A+Admin+Leader+Election+RPC. Reviewers: Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	051379ea5d	KAFKA-8430: unit test to make sure null `group.id` and valid `group.instance.id` are valid combo (#6830 ) As title suggests, this unit test is just a double check. No need to push in 2.3 Reviewers: Guozhang Wang <wangguoz@gmail.com>, Matthias J. Sax <mjsax@apache.org>	6 years ago
Boyang Chen	901eb36883	MINOR: Set default `group.instance.id` in JoinGroupResponse to null (#6831 ) As we are planning to add on more supporting features for rebalancing under static membership, we need to make sure the behavior for `group.instance.id` is consistent throughout the whole stack. This patch ensures that the default value is null in the JoinGroup response. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago

1 2 3 4 5 ...

1580 Commits (3e48bdbc333602a042b6b0fb7fb9e14625ab4ece)