src-kafka

Commit Graph

Author	SHA1	Message	Date
Kevin Lu	3fb1d70b6b	KAFKA-7236: Add --under-min-isr option to describe topics command (KIP-351) (#6224 ) * KAFKA-7236: Add --under-min-isr option to describe topics command (KIP-351) * Minor changes to description and make test consistent with others * Fix option, and add additional test with mixed partition status * Add fully-replicated-topic to test case * Address review nits	6 years ago
Jason Gustafson	d152989f26	KAFKA-7897; Disable leader epoch cache when older message formats are used (#6232 ) When an older message format is in use, we should disable the leader epoch cache so that we resort to truncation by high watermark. Previously we updated the cache for all versions when a broker became leader for a partition. This can cause large and unnecessary truncations after leader changes because we relied on the presence of _any_ cached epoch in order to tell whether to use the improved truncation logic possible with the OffsetsForLeaderEpoch API. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Viktor Somogyi-Vass <viktorsomogyi@gmail.com>, Jun Rao <junrao@gmail.com>	6 years ago
Jason Gustafson	8147846acb	KAFKA-7540; Retry coordinator lookup to fix transient failure in ConsumerBounceTest (#6235 ) Add logic in ConsumerBounceTest to check the error code in FindCoordinator responses and retry if needed. This should help with transient failures or at least get us closer to the actual problem. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Viktor Somogyi-Vass	776041db11	KAFKA-7804: Update docs for topic-command related KIP-377 This PR adds a upgrade notes and changes examples to use the bootstrap-server. Author: Viktor Somogyi-Vass <viktorsomogyi@gmail.com> Reviewers: Srinivas <srinivas96alluri@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6118 from viktorsomogyi/topiccommand-adminclient-doc	6 years ago
Colin Patrick McCabe	e2e8bdbd8c	KAFKA-7832: Use automatic RPC generation in CreateTopics (#5972 ) Reviewers: Jun Rao <junrao@gmail.com>, Tom Bentley <tbentley@redhat.com>, Boyang Chen <bchen11@outlook.com>	6 years ago
lambdaliu	4d70e8971f	fix typo (#5150 )	6 years ago
Stanislav Kozlovski	66129b1518	MINOR: Reduce replica.fetch.backoff.ms in ReassignPartitionsClusterTest (#5887 ) The default backoff of 1000ms when there are no partitions to fetch can cause `shouldExecuteThrottledReassignment` to fail due to it taking too long. So we reduce it to 100ms. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk	6 years ago
Stanislav Kozlovski	4420d9ec67	KAFKA-7641; Introduce "group.max.size" config to limit group sizes (#6163 ) This patch introduces a new config - "group.max.size", which caps the maximum size any group can reach. It has a default value of Int.MAX_VALUE. Once a group is of the maximum size, subsequent JoinGroup requests receive a MAX_SIZE_REACHED error. In the case where the config is changed and a Coordinator broker with the new config loads an old group that is over the threshold, members are kicked out of the group and a rebalance is forced. Reviewers: Vahid Hashemian <vahid.hashemian@gmail.com>, Boyang Chen <bchen11@outlook.com>, Gwen Shapira <cshapi@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Viktor Somogyi-Vass	1f85e8e923	KAFKA-7433; Introduce broker options in TopicCommand to use AdminClient (KIP-377) The PR adds --bootstrap-server and --admin.config options to TopicCommand and implements an alternative, AdminClient based way of topic management. As testing I've duplicated the existing tests and made them working with the AdminClient options. Author: Viktor Somogyi-Vass <viktorsomogyi@gmail.com> Reviewers: Andras Katona <41361962+akatona84@users.noreply.github.com>, Sandor Murakozi <smurakozi@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>, Jason Gustafson <jason@confluent.io> Closes #5683 from viktorsomogyi/topiccommand-adminclient	6 years ago
Rajini Sivaram	4b29487fa9	KAFKA-7719: Improve fairness in SocketServer processors (KIP-402) (#6022 ) Limit the number of new connections processed in each iteration of each Processor. Block Acceptor if the connection queue is full on all Processors. Added a metric to track accept blocked time percent. See KIP-402 for details. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Boyang Chen	dc634f18f7	KAFKA-7859: Use automatic RPC generation in LeaveGroups (#6188 ) Reviewed-by: Colin P. McCabe <cmccabe@apache.org>	6 years ago
Matthias J. Sax	1fa02d5aef	MINOR: Update usage of deprecated API (#6146 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Ismael Juma <ismael@confluent.io>, Jorge Quilcate Otoya <quilcate.jorge@gmail.com>, John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Dhruvil Shah	646ec94879	KAFKA-7837: Ensure offline partitions are picked up as soon as possible when shrinking ISR (#6202 ) Check if a partition is offline while iterating all partitions. Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Dhruvil Shah	ef89cf4eb6	KAFKA-7838: Log leader and follower end offsets when shrinking ISR (#6168 ) Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Tom Bentley	269b65279c	KAFKA-5692: Change PreferredReplicaLeaderElectionCommand to use Admin… (#3848 ) See also KIP-183. This implements the following algorithm: AdminClient sends ElectPreferredLeadersRequest. KafakApis receives ElectPreferredLeadersRequest and delegates to ReplicaManager.electPreferredLeaders() ReplicaManager delegates to KafkaController.electPreferredLeaders() KafkaController adds a PreferredReplicaLeaderElection to the EventManager, ReplicaManager.electPreferredLeaders()'s callback uses the delayedElectPreferredReplicasPurgatory to wait for the results of the election to appear in the metadata cache. If there are no results because of errors, or because the preferred leaders are already leading the partitions then a response is returned immediately. In the EventManager work thread the preferred leader is elected as follows: The EventManager runs PreferredReplicaLeaderElection.process() process() calls KafkaController.onPreferredReplicaElectionWithResults() KafkaController.onPreferredReplicaElectionWithResults() calls the PartitionStateMachine.handleStateChangesWithResults() to perform the election (asynchronously the PSM will send LeaderAndIsrRequest to the new and old leaders and UpdateMetadataRequest to all brokers) then invokes the callback. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Jun Rao <junrao@gmail.com>	6 years ago
mingaliu	e4b54a5d97	KAFKA-7692; Fix ProducerStateManager SequenceNumber overflow (#5990 ) This patch fixes a few overflow issues with wrapping sequence numbers in the broker's producer state tracking. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	523465b3c1	MINOR: Cleanup handling of mixed transactional/idempotent records (#6172 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk>, Colin Patrick McCabe <colin@cmccabe.xyz>	6 years ago
Lee Dongjin	e87e3f2cb2	MINOR: Remove unused imports, exceptions, and values (#6117 ) 1. Remove unthrown exceptions from MemoryRecordsBuilderTest 2. Remove unused imports from ReplicaFetcherThread, ZooKeeperClient, ApiVersionTest, PartitionTest 3. Remove unused value from PartitionTest	6 years ago
Jacek Laskowski	59f22521c0	MINOR: Log partition info when creating new request batch in controller (#6145 ) Due to the missing `$`, the name was being logged instead of the value. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Dhruvil Shah	4b54eb4621	MINOR: log when controller begins processing logdir failure event (#6153 ) Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Dong Lin	6a7eebe891	KAFKA-7829; Javadoc should show that AdminClient.alterReplicaLogDirs() is supported in Kafka 1.1.0 or later (#6157 ) Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Boyang Chen	9a9310d074	KAFKA-7824; Require member.id for initial join group request [KIP-394] (#6058 ) This patch implements KIP-394 as documented in https://cwiki.apache.org/confluence/display/KAFKA/KIP-394%3A+Require+member.id+for+initial+join+group+request. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Ismael Juma	12947f4f94	HOTFIX: Fix Properties.putAll compiler error when compiling with Java 11 (#6140 )	6 years ago
Mayuresh Gharat	8afce0e338	KAFKA-4453 : Added code to separate controller connections and requests from the data plane (#5921 ) KIP-291 Implementation : Added code to separate controller connections and requests from the data plane. Tested with local deployment that the controller request are handled by the control plane and other requests are handled by the data plane. Also added unit tests in order to test the functionality. Author: Lucas Wang <luwang@linkedin.com>, Author: Mayuresh Gharat <gharatmayuresh15@gmail.com> Reviewers: Joel Koshy <jjkoshy@gmail.com>, Jun Rao <junrao@gmail.com>	6 years ago
Kamal Chandraprakash	cb3eedcf94	KAFKA-7781; Add validation check for retention.ms topic property. Using AdminClient#alterConfigs, topic `retention.ms` property can be assigned to a value lesser than -1. This leads to inconsistency while describing the topic configuration. We should not allow values lesser than -1. Author: Kamal Chandraprakash <kamal.chandraprakash@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>,Matthias J. Sax <matthias@confluent.io> Closes #6082 from kamalcph/KAFKA-7781	6 years ago
Jason Gustafson	e120feb485	HOTFIX: Compilation error in CommandLineUtils (#6131 ) This was broken by #6084. The syntax works with Scala 2.12, but not 2.11. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>	6 years ago
Kan Li	694da1ac1e	KAFKA-6627: Prevent config default values overriding ones specified through --producer-property on command line. (#6084 ) * KAFKA-6627: Prevent config default values overriding ones specified through --producer-property on command line. In Console{Producer,Consumer}, extraProducerProps (options specified in --producer-property) is applied first, then overriden unconditionally, even if the value is not specified explicitly (and default value is used). This patch fixes it so that it doesn't override the existing value set by --producer-property if it is not explicitly specified. The contribution is my original work and I license the work to the project under the project's open source license. Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
Manikumar Reddy	e8959bd766	KAFKA-5994; Log ClusterAuthorizationException for all ClusterAction requests Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #5021 from omkreddy/KAFKA-5994-CLUSTER-AUTH	6 years ago
Anna Povzner	b2b79c4f0e	KAFKA-7786; Ignore OffsetsForLeaderEpoch response if epoch changed while request in flight (#6101 ) There is a race condition in ReplicaFetcherThread, where we can update PartitionFetchState with the new leader epoch (same leader) before handling the OffsetsForLeaderEpoch response with FENCED_LEADER_EPOCH error which causes removing partition from partitionStates, which in turn causes no fetching until the next LeaderAndIsr. This patch adds logic to ensure that the leader epoch doesn't change while an OffsetsForLeaderEpoch request is in flight (which could happen with back-to-back leader elections). If it has changed, we ignore the response. Also added toString() implementation to PartitionData, because some log messages did not show useful info which I found while investigating the above system test failure. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Bob Barrett	e325676994	KAFKA-6833; Producer should await metadata for unknown partitions (#6073 ) This patch changes the behavior of KafkaProducer.waitOnMetadata to wait up to max.block.ms when the partition specified in the produce request is out of the range of partitions present in the metadata. This improves the user experience in the case when partitions are added to a topic and a client attempts to produce to one of the new partitions before the metadata has propagated to the brokers. Tested with unit tests. Reviewers: Arjun Satish <arjun@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Colin P. Mccabe	aff02944fe	KAFKA-7051: Improve the efficiency of ReplicaManager (fixup)	6 years ago
Colin Patrick McCabe	145cad752d	KAFKA-7051: Improve the efficiency of ReplicaManager (#5206 ) Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Dong Lin <lindong28@gmail.com>	6 years ago
Jason Gustafson	fcdde2c604	MINOR: Use functional patterns in PartitionStates (#6089 ) * MINOR: Use functional patterns in PartitionStates * Can't use fluent consumers yet in scala 2.11	6 years ago
Attila Sasvari	eb9ca17a7e	KAFKA-7752: Add "/kafka-acl-extended" zk node path to secure root paths - This commits sets ACL on /kafka-acl-extended - Extended ZkAuthorizationTest to check ACL on /kafka-acl-extended - Using zookeeper-security-migration.sh tool on a Kerberized test cluster, I verified the changes: secured and unsecured Kafka znodes and examined ACL on /kafka-acl-extended with zookeeper client Author: Attila Sasvari <asasvari@apache.org> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Andras Katona <41361962+akatona84@users.noreply.github.com> Closes #6072 from asasvari/KAFKA-7752	6 years ago
ying-zheng	459a4dd032	KAFKA-6431: Shard purgatory to mitigate lock contention (#5338 ) * Shard purgatory to reduce lock contention * put constant into Object, use foldLeft instead of for loop * watchersForKey -> watchersByKey * Incorporate Jun's comments: use named arguments instead of _, and remove an unnecessary lock Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>, Jun Rao <junrao@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Manohar Vanam	55334453a5	KAFKA-7054; Kafka describe command should throw topic doesn't exist exception User Interface Improvement : If topic doesn't exist then Kafka describe command should throw topic doesn't exist exception, like alter and delete commands Author: Manohar Vanam <manohar.crazy09@gmail.com> Reviewers: Vahid Hashemian <vahid.hashemian@gmail.com>, Jason Gustafson <jason@confluent.io>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #5211 from ManoharVanam/KAFKA-7054	6 years ago
Satish Duggana	b23bf41e84	KAFKA-7742; Fixed removing hmac entry for a token being removed from DelegationTokenCache Author: Satish Duggana <satishd@apache.org> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6037 from satishd/KAFKA-7742	6 years ago
Jason Gustafson	40392266aa	MINOR: Include additional detail in fetch error message (#6036 ) This patch adds additional information in the log message after a fetch failure to make debugging easier. Reviewers: David Arthur <mumrah@gmail.com>	6 years ago
David Arthur	152292994e	KAFKA-2334; Guard against non-monotonic offsets in the client (#5991 ) After a recent leader election, the leaders high-water mark might lag behind the offset at the beginning of the new epoch (as well as the previous leader's HW). This can lead to offsets going backwards from a client perspective, which is confusing and leads to strange behavior in some clients. This change causes Partition#fetchOffsetForTimestamp to throw an exception to indicate the offsets are not yet available from the leader. For new clients, a new OFFSET_NOT_AVAILABLE error is added. For existing clients, a LEADER_NOT_AVAILABLE is thrown. This is an implementation of [KIP-207](https://cwiki.apache.org/confluence/display/KAFKA/KIP-207%3A+Offsets+returned+by+ListOffsetsResponse+should+be+monotonically+increasing+even+during+a+partition+leader+change). Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Dhruvil Shah <dhruvil@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	20069b3906	KAFKA-7610; Proactively timeout new group members if rebalance is delayed (#5962 ) When a consumer first joins a group, it doesn't have an assigned memberId. If the rebalance is delayed for some reason, the client may disconnect after a request timeout and retry. Since the client had not received its memberId, then we do not have a way to detect the retry and expire the previously generated member id. This can lead to unbounded growth in the size of the group until the rebalance has completed. This patch fixes the problem by proactively completing all JoinGroup requests for new members after a timeout of 5 minutes. If the client is still around, we expect it to retry. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Boyang Chen <bchen11@outlook.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Lee Dongjin	7a3dffb0ca	KAFKA-7549; Old ProduceRequest with zstd compression does not return error to client (#5925 ) Older versions of the Produce API should return an error if zstd is used. This validation existed, but it was done during request parsing, which means that instead of returning an error code, the broker disconnected. This patch fixes the issue by moving the validation outside of the parsing logic. It also fixes several other record validations which had the same problem. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Stanislav Kozlovski	edfa681736	MINOR: Catch NoRecordsException in testCommaSeparatedRegex() test (#5944 ) This test sometimes fails with ``` kafka.tools.MirrorMaker$NoRecordsException at kafka.tools.MirrorMaker$ConsumerWrapper.receive(MirrorMaker.scala:483) at kafka.tools.MirrorMakerIntegrationTest$$anonfun$testCommaSeparatedRegex$1.apply$mcZ$sp(MirrorMakerIntegrationTest.scala:92) at kafka.utils.TestUtils$.waitUntilTrue(TestUtils.scala:738) ``` The test should catch `NoRecordsException` instead of `TimeoutException`. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
huxi	87cc31c4e7	KAFKA-7704: MaxLag.Replica metric is reported incorrectly (#5998 ) On the follower side, for the empty `LogAppendInfo` retrieved from the leader, fetcherLagStats set the wrong lag for fetcherLagStats due to `nextOffset` is zero.	6 years ago
Anna Povzner	0ffdf8307f	KAFKA-6388; Recover from rolling an empty segment that already exists (#5986 ) There were several reported incidents where the log is rolled to a new segment with the same base offset as an active segment, causing KafkaException: Trying to roll a new log segment for topic partition X-N with start offset M while it already exists. In the cases we have seen, this happens to an empty log segment where there is long idle time before the next append and somehow we get to a state where offsetIndex.isFull() returns true due to _maxEntries == 0. This PR recovers from this state by deleting and recreating the segment and all of its associated index files. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Rajini Sivaram	0d4cf64af3	KAFKA-7697: Process DelayedFetch without holding leaderIsrUpdateLock (#5999 ) Delayed fetch operations acquire leaderIsrUpdate read lock of one or more Partitions from the fetch request when attempting to complete the fetch operation. While appending new records, complete fetch requests after releasing leaderIsrUpdate of the Partition to which records were appended to avoid deadlocks in request handler threads. Reviewers: Jason Gustafson <jason@confluent.io>, Jun Rao <junrao@gmail.com>	6 years ago
huxi	f65b1c4796	KAFKA-7687; Print batch level information in DumpLogSegments when deep iterating (#5976 ) DumpLogSegments should print batch level information when deep-iteration is specified. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Rajini Sivaram	437cf35373	KAFKA-7702: Fix matching of prefixed ACLs to match single char prefix (#5994 ) Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Zhanxiang (Patrick) Huang	2155c6d54b	KAFKA-7235: Detect outdated control requests and bounced brokers using broker generation (#5821 ) * KAFKA-7235: Detect outdated control requests and bounced brokers using broker generation * Add broker_epoch in controlled shutdown request * Move broker epoch check into controller for ControlledShutdownRequest * Refactor schema definition for controler requests/responses * Address comments * Address comments * Address comments * Send back STALE_BROKER_EPOCH error in ControlledShutdown response * Fix build issue * Address comments * Address comments * Address comments * Address comments * Fix tests after rebase * Address comments * Address comments	6 years ago
Anna Povzner	3acebe6383	MINOR: Fix handling of dummy record in EndToEndLatency tool EndToEndLatency tool produces a dummy record in case the topic does not exist. This behavior was introduced in this PR https://github.com/apache/kafka/pull/5319 as part of updating the tool to use latest consumer API. However, if we run the tool with producer acks == 1, the high watermark may not be updated before we reset consumer offsets to latest. In rare cases when this happens, the tool will throw an exception in the for loop where the consumer will unexpectedly consume the dummy record. As a result, we occasionally see Benchmark.test_end_to_end_latency system test failures. This PR checks if topic exists, and creates the topic using AdminClient if it does not exist. Author: Anna Povzner <anna@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5950 from apovzner/fix-EndToEndLatency	6 years ago
Manikumar Reddy	944f24cfdc	KAFKA-7259; Remove deprecated ZKUtils usage from ZkSecurityMigrator - Remove ZKUtils usage from various tests Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>, Ismael Juma <ismael@juma.me.uk>, Satish Duggana <satishd@apache.org>, Jun Rao <junrao@gmail.com>, Ryanne Dolan <ryannedolan@gmail.com> Closes #5480 from omkreddy/zkutils	6 years ago

1 2 3 4 5 ...

2464 Commits (08036fa4b1e5b8227fef78c5693897f19bb80de9)