src-kafka

Author	SHA1	Message	Date
Colin Patrick McCabe	56051e7639	KAFKA-8820: kafka-reassign-partitions.sh should support the KIP-455 API (#8244 ) Rewrite ReassignPartitionsCommand to use the KIP-455 API when possible, rather than direct communication with ZooKeeper. Direct ZK access is still supported, but deprecated, as described in KIP-455. As specified in KIP-455, the tool has several new flags. --cancel stops an assignment which is in progress. --preserve-throttle causes the --verify and --cancel commands to leave the throttles alone. --additional allows users to execute another partition assignment even if there is already one in progress. Finally, --show displays all of the current partition reassignments. Reorganize the reassignment code and tests somewhat to rely more on unit testing using the MockAdminClient and less on integration testing. Each integration test where we bring up a cluster seems to take about 5 seconds, so it's good when we can get similar coverage from unit tests. To enable this, MockAdminClient now supports incrementalAlterConfigs, alterReplicaLogDirs, describeReplicaLogDirs, and some other APIs. MockAdminClient is also now thread-safe, to match the real AdminClient implementation. In DeleteTopicTest, use the KIP-455 API rather than invoking the reassignment command.	5 years ago
Chia-Ping Tsai	c27f629e95	KAFKA-9654; Update epoch in `ReplicaAlterLogDirsThread` after new LeaderAndIsr (#8223 ) Currently when there is a leader change with a log dir reassignment in progress, we do not update the leader epoch in the partition state maintained by `ReplicaAlterLogDirsThread`. This can lead to a FENCED_LEADER_EPOCH error, which results in the partition being marked as failed, which is a permanent failure until the broker is restarted. This patch fixes the problem by updating the epoch in `ReplicaAlterLogDirsThread` after receiving a new LeaderAndIsr request from the controller. Reviewers: Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago
jiameixie	11e6aedff6	MINOR: Bump RocksDB version from 5.18.3 to 5.18.4 (#8284 ) Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Matthias J. Sax	21cfd0b453	MINOR: Fix generic types in StreamsBuilder and Topology (#8273 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Guozhang Wang <guozhang@confluent.io>, John Roesler <john@confluent.io>	5 years ago
Matthias J. Sax	89cd2f2a0b	KAFKA-9441: Unify committing within TaskManager (#8218 ) - part of KIP-447 - commit all tasks at once using non-eos (and eos-beta in follow up work) - unified commit logic into TaskManager - split existing methods of Task interface in pre/post parts Reviewers: Boyang Chen <boyang@confluent.io>, Guozhang Wang <guozhang@confluent.io>	5 years ago
A. Sophie Blee-Goldman	9ee8277cdd	KAFKA-6145: Add new assignment configs Add 4 new assignor configs in preparation for the new assignment algorithm: 1. acceptable.recovery.lag 2. balance.factor 3. max.warmup.replicas 4. probing.rebalance.interval.ms Implements: KIP-441 Reviewers: Bruno Cadonna <bruno@confluent.io>, John Roesler <vvcephei@apache.org>	5 years ago
Bruno Cadonna	0174c95f4f	MINOR: Fix javadoc warning in StreamsMetric (#8314 ) Reviewers: Matthias J. Sax <mjsax@apache.org>, Bill Bejeck <bbejeck@apache.org>	5 years ago
Tom Bentley	4f1e8331ff	KAFKA-9435: DescribeLogDirs automated protocol (#7972 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	5 years ago
Lucas Bradstreet	5c0cf02947	MINOR: return unmodifiableMap for PartitionStates.partitionStateMap. (#7637 ) Makes the map returned by partitionStateMap unmodifiable to prevent mutation. Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Boyang Chen	c7164a3866	KAFKA-8618: Replace Txn marker with automated protocol (#7039 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	5 years ago
A. Sophie Blee-Goldman	85c96f5230	KAFKA-9568: enforce rebalance if client endpoint has changed (#8299 ) Since the assignment info includes a map with all member's host info, we can just check the received map to make sure our endpoint is contained. If not, we need to force the group to rebalance and get our updated endpoint info. Reviewers: Boyang Chen <boyang@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	b1999ba22d	KAFKA-5604: Remove the redundant TODO marker on the Streams side (#8313 ) The issue itself has been fixed a while ago on the producer side, so we can just remove this TODO marker now (we've removed the isZombie flag already anyways). Reviewers: John Roesler <vvcephei@apache.org>	5 years ago
Boyang Chen	b586283c53	KAFKA-9656; Return COORDINATOR_NOT_AVAILABLE for older producer clients (#8253 ) The `TxnOffsetCommit` API suffers from a bug affecting older client versions which treat `COORDINATOR_LOAD_IN_PROGRESS` errors as fatal. This PR changes the handling on the broker to instead return `COORDINATOR_NOT_AVAILABLE` in this case so that clients won't crash upon doing txn commit. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Sanjana Kaundinya	34a7ba56a1	KAFKA-9047; AdminClient group operations should respect retries and backoff (#8161 ) Previously, `AdminClient` group operations did not respect a `Call`'s number of configured tries and retry backoff. This could lead to tight retry loops that put a lot of pressure on the broker. This PR introduces fixes that ensures for all group operations the `AdminClient` respects the number of tries and the backoff a given `Call` has. Reviewers: Vikas Singh <vikas@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
Guozhang Wang	c0cff61e8c	HOTFIX: do not depend on file modified time in StateDirectoryTest	5 years ago
Ismael Juma	93f082e093	MINOR: Update Scala to 2.12.11 (#8308 ) Highlights: * Performance improvements in the ollections library: algorithmic improvements and changes to avoid unnecessary allocations. * Performance improvements in the compiler. * ASM was upgraded to 7.3.1, allowing the optimizer to run on JDK 13+. Full release notes: https://github.com/scala/scala/releases/tag/v2.12.11 Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	5 years ago
belugabehr	da7d134640	KAFKA-9404: Use ArrayList instead of LinkedList in Sensor (#7936 ) The former is generally a better option than the latter. Reviewers: Ron Dagostino <rdagostino@confluent.io>, Ismael Juma <ismael@juma.me.uk>	5 years ago
Sanjana Kaundinya	5fc3cd61fc	KAFKA-9625: Fix altering and describing dynamic broker configurations (#8260 ) * Broker throttles were incorrectly marked as sensitive configurations. Fix this, so that their values can be returned via DescribeConfigs as expected. * Previously, changes to broker configs that consisted only of deletions were ignored by the brokers because of faulty delta calculation logic that didn't consider deletions as changes, only alterations as changes. Fix this and add a regression test. Reviewers: Colin P. McCabe <cmccabe@apache.org>	5 years ago
A. Sophie Blee-Goldman	d38e97e319	MINOR: clean up required setup for StreamsPartitionAssignorTest (#8306 ) No logical or behavioral changes, just a bit of cleanup in this class before we have to write and fix a lot of these tests for KIP-441: * Moved creation of streamsMetadata mock to setUp (in exactly one test it will be overwritten with a strict mock) * Tried to clean up the use of helper methods for configuring the assignor. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Chia-Ping Tsai	f08c9c7de6	HOTFIX: fix flaky StateDirectoryTest.shouldReturnEmptyArrayIfListFilesReturnsNull (#8310 ) StateDirectoryTest.shouldReturnEmptyArrayIfListFilesReturnsNull always moves the stage dir to /tmp/state-renamed so it always fails if there is already a folder (for example, the stuff leaved by previous test). Reviewers: Boyang Chen <boyang@confluent.io>, A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	6a88d32b9f	KAFKA-8803: Remove timestamp check in completeTransitionTo (#8278 ) In prepareAddPartitions the txnStartTimestamp could be updated as updateTimestamp, which is assumed to be always larger then the original startTimestamp. However, due to ntp time shift the timer may go backwards and hence the newStartTimestamp be smaller than the original one. Then later in completeTransitionTo the time check would fail with an IllegalStateException, and the txn would not transit to Ongoing. An indirect result of this, is that this txn would NEVER be expired anymore because only Ongoing ones would be checked for expiration. We should do the same as in #3286 to remove this check. Also added test coverage for both KAFKA-5415 and KAFKA-8803. Reviewers: Jason Gustafson<jason@confluent.io>	5 years ago
Lucas Bradstreet	97156256c7	MINOR: double -Xss setting from 2m to 4m in build.gradle (#8264 ) I have seen an increased incidence in StackOverflowError(s) when compiling scala. This change doubles the max stack size to 4m. ``` > Task :core:compileScala FAILED FAILURE: Build failed with an exception. * What went wrong: Execution failed for task ':core:compileScala'. > java.lang.StackOverflowError (no error message) ``` Reviewers: Andrew Choi <a24choi@edu.uwaterloo.ca>, Ismael Juma <ismael@juma.me.uk>	5 years ago
A. Sophie Blee-Goldman	673018504f	MINOR: cleanup and add tests to StateDirectoryTest (#8304 ) Adds tests for edge conditions of listAllTaskDirectories Also includes some minor cleanup of the StateDirectoryTest class Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Boyang Chen	1e6d944813	HOTFIX: StateDirectoryTest should use Set instead of List (#8305 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>	5 years ago
Matthias J. Sax	dffc7f8c30	MINOR: Fix build and JavaDoc warnings (#8291 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, @SoontaekLim, Bill Bejeck <bill@confluent.io>	5 years ago
Brian Byrne	fc79853c4d	MINOR: Fix kafka.server.RequestQuotaTest missing new ApiKeys. (#8302 ) The test was broken by commit 227a7322b77840e08924b9486e4bda2f3dfc1f1a. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Colin P. McCabe <cmccabe@apache.org>	5 years ago
Nigel Liang	569cf994b0	KAFKA-9712: Catch and handle exception thrown by reflections scanner (#8289 ) This commit works around a bug in version v0.9.12 of the upstream `reflections` library by catching and handling the exception thrown. The reflections issue is tracked by: https://github.com/ronmamo/reflections/issues/273 New unit tests were introduced to test the behavior. * KAFKA-9712: Catch and handle exception thrown by reflections scanner * Update connect/runtime/src/main/java/org/apache/kafka/connect/runtime/isolation/DelegatingClassLoader.java Co-Authored-By: Konstantine Karantasis <konstantine@confluent.io> * Move result initialization back to right before it is used * Use `java.io.File` in tests * Fix checkstyle Co-authored-by: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	5 years ago
Manikumar Reddy	a0e1407820	KAFKA-9670; Reduce allocations in Metadata Response preparation (#8236 ) This PR removes intermediate conversions between `MetadataResponse.TopicMetadata` => `MetadataResponseTopic` and `MetadataResponse.PartitionMetadata` => `MetadataResponsePartition` objects. There is 15-20% reduction in object allocations and 5-10% improvement in metadata request performance. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson<jason@confluent.io>	5 years ago
Chia-Ping Tsai	31659c3ee1	MINOR: fix Scala 2.13 build error introduced in #8083 (#8301 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>, Brian Byrne <bbyrne@confluent.io>	5 years ago
A. Sophie Blee-Goldman	045c6c3c48	MINOR: enforce non-negative invariant for checkpointed offsets (#8297 ) While discussing KIP-441 we realize we don't strictly enforce that all checkpointed offset sums are positive (or 0, though there's not much point to checkingpoint a 0 offset is there)? Rather than awkwardly try handle this within every user/reader of the checkpoint file, we should just make a guarantee that all returned checkpointed offsets are positive. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Dominic Evans	ddd3dfbfae	MINOR: comment apikey types in generated switch (#8201 ) As a developer, it would be convenient if the generated {request,response}HeaderVersion case statements in ApiMessageType.java included a comment to remind me which type each of them is so I don't need to manually cross-reference the newer/rarer ones. Also include commented lines for the two special cases around ApiVersionsResponse and ControllerShutdownRequest which are hardcoded in the ApiMessageTypeGenerator.java and not covered by the message format json files. Before: ```java public short requestHeaderVersion(short _version) { switch (apiKey) { case 0: return (short) 1; case 1: return (short) 1; case 2: return (short) 1; case 3: if (_version >= 9) { return (short) 2; } else { return (short) 1; } // ...etc ``` After: ```java public short requestHeaderVersion(short _version) { switch (apiKey) { case 0: // Produce return (short) 1; case 1: // Fetch return (short) 1; case 2: // ListOffset return (short) 1; case 3: // Metadata if (_version >= 9) { return (short) 2; } else { return (short) 1; } // ...etc ``` Signed-off-by: Dominic Evans <dominic.evans@uk.ibm.com> Reviewers: Mickael Maison <mickael.maison@gmail.com>	5 years ago
Mickael Maison	2e6b15813c	MINOR: Fix typo in CreateTopicsResponse.json (#8300 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Boyang Chen <boyang@confluent.io>	5 years ago
Brian Byrne	227a7322b7	KIP-546: Implement describeClientQuotas and alterClientQuotas. (#8083 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	5 years ago
Guozhang Wang	605d55dc17	KAFKA-6647: Do note delete the lock file while holding the lock (#8267 ) 1. Inside StateDirectory#cleanRemovedTasks, skip deleting the lock file (and hence the parent directory) until releasing the lock. And after the lock is released only go ahead and delete the parent directory if manualUserCall == true. That is, this is triggered from KafkaStreams#cleanUp and users are responsible to make sure that Streams instance is not started and hence there are no other threads trying to grab that lock. 2. As a result, during scheduled cleanup the corresponding task.dir would not be empty but be left with only the lock file, so effectively we still achieve the goal of releasing disk spaces. For callers of listTaskDirectories like KIP-441 (cc @ableegoldman to take a look) I've introduced a new listNonEmptyTaskDirectories which excludes such dummy task.dirs with only the lock file left. 3. Also fixed KAFKA-8999 along the way to expose the exception while traversing the directory. Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, John Roesler <vvcephei@apache.org>	5 years ago
Anna Povzner	78554e73f6	KAFKA-9677: Fix consumer fetch with small consume bandwidth quotas (#8290 ) When we changed quota communication with KIP-219, fetch requests get throttled by returning empty response with the delay in throttle_time_ms and Kafka consumer retries again after the delay. With default configs, the maximum fetch size could be as big as 50MB (or 10MB per partition). The default broker config (1-second window, 10 full windows of tracked bandwidth/thread utilization usage) means that < 5MB/s consumer quota (per broker) may block consumers from being able to fetch any data. This PR ensures that consumers cannot get blocked by quota by capping fetchMaxBytes in KafkaApis.handleFetchRequest() to quota window * consume bandwidth quota. In the example of default configs (10-second quota window) and 1MB/s consumer bandwidth quota, fetchMaxBytes would be capped to 10MB. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
Matthias J. Sax	dd2c6a0a6f	KAFKA-9533: Fix JavaDocs of KStream.transformValues (#8298 ) Reviewers: Bill Bejeck <bill@confluent.io>	5 years ago
John Roesler	7945cbc73b	MINOR: reuse pseudo-topic in FKJoin (#8296 ) Reuse the same pseudo-topic for serializing the LHS value in the foreign-key join resolver as we originally used to serialize it before sending the subscription request. Reviewers: Boyang Chen <boyang@confluent.io>	5 years ago
A. Sophie Blee-Goldman	542853d99b	KAFKA-6145: Pt 2. Include offset sums in subscription (#8246 ) KIP-441 Pt. 2: Compute sum of offsets across all stores/changelogs in a task and include them in the subscription. Previously each thread would just encode every task on disk, but we now need to read the changelog file which is unsafe to do without a lock on the task directory. So, each thread now encodes only its assigned active and standby tasks, and ignores any already-locked tasks. In some cases there may be unowned and unlocked tasks on disk that were reassigned to another instance and haven't been cleaned up yet by the background thread. Each StreamThread makes a weak effort to lock any such task directories it finds, and if successful is then responsible for computing and reporting that task's offset sum (based on reading the checkpoint file) This PR therefore also addresses two orthogonal issues: 1. Prevent background cleaner thread from deleting unowned stores during a rebalance 2. Deduplicate standby tasks in subscription: each thread used to include every (non-active) task found on disk in its "standby task" set, which meant every active, standby, and unowned task was encoded by every thread. Reviewers: Bruno Cadonna <bruno@confluent.io>, John Roesler <vvcephei@apache.org>	5 years ago
Kowshik Prakasam	454c3cf617	KAFKA-9714; Eliminate unused reference to IBP in `TransactionStateManager` (#8293 ) Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Rajini Sivaram	f165cdc325	KAFKA-9718; Don't log passwords for AlterConfigs in request logs (#8294 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Tom Bentley	f869e33ab2	KAFKA-8768: DeleteRecords request/response automated protocol (#7957 ) Also add version 2 to make use of flexible versions, per KIP-482. Reviewers: Mickael Maison <mickael.maison@gmail.com>	5 years ago
jiao	e3ccf20794	KAFKA-9685: Solve Set concatenation perf issue in AclAuthorizer To dismiss the usage of operation ++ against Set which is slow when Set has many entries. This pr introduces a new class 'AclSets' which takes multiple Sets as parameters and do 'find' against them one by one. For more details about perf and benchmark, refer to [KAFKA-9685](https://issues.apache.org/jira/browse/KAFKA-9685) Author: jiao <jiao.zhang@linecorp.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #8261 from jiao-zhangS/jira-9685	5 years ago
Boyang Chen	2d2311d75c	KAFKA-9657: Throw upon offset fetch unsupported stable flag protocol (#8265 ) This PR tries to add an internal flag to throw if we hit an unexpected protocol version for offset fetch. It could be used together with EOS_BETA flag so that if server side downgrades unexpectedly, we shall fail the application ASAP. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Steve Rodrigues	c1901c0231	KAFKA-9644: Handle non-existent configs in incrementalAlterConfigs APPEND/SUBTRACT Problem ---- The `incrementalAlterConfigs` API supports OpType.APPEND and OpType.SUBTRACT for configuration properties of LIST type. If an APPEND or SUBTRACT OpType is submitted for a config property which currently has no value, then the operation fails with a NullPointerException on the broker side (conveyed as an "unknown server error" to the client). This is because the alter code does a `getProperty` of the existing configuration value with no concern as to whether or not the property actually exists. This change handles the case of existing null properties. Testing ----- This change includes 2 test cases in the unit test that demonstrate the issue for OpType.SUBTRACT and OpType.APPEND. Author: Steve Rodrigues <srodrigues@confluent.io> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Bob Barrett <bob.barrett@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com> Closes #8216 from steverod/steverod.kafka-9644	5 years ago
Guozhang Wang	935aa1a3d4	KAFKA-9659: Add more log4j when updating static member mappings (#8269 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Boyang Chen <boyang@confluent.io>, Rohan <desai.p.rohan@gmail.com>	5 years ago
Bruno Cadonna	0c256e16ab	KAFKA-9675: Fix bug that prevents RocksDB metrics to be updated (#8256 ) Reviewers: John Roesler <vvcephei@apache.org>	5 years ago
Rajini Sivaram	8a80d942e6	KAFKA-9695; Handle null config values for createTopics, alterConfigs (#8266 ) Throw InvalidRequestException if null configs are specified for CreateTopics, AlterConfigs or IncrementalAlterConfigs. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Boyang Chen	b526528caf	KAFKA-9701: Add more debug log on client to reproduce the issue (#8272 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Matthias J. Sax	524182d7f4	MINOR: Update Streams IQ JavaDocs to not point to a deprecated method (#8271 ) Reviewers: John Roesler <john@confluent.io>, Guozhang Wang <guozhang@confluent.io>	5 years ago
Boyang Chen	f19a798008	KAFKA-9605; Do not attempt to abort batches when txn manager is in fatal error (#8177 ) We detected a bug in soak where the producer batches shall be failed in sender loop before the produce response callback. This shall trigger an illegal state exception on the producer batch as it is already aborted. The impact is not severe since sender is on its own thread but should be fixed to avoid unnecessary critical exception. Reviewers: Bob Barrett <bob.barrett@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago

1 2 3 4 5 ...

7255 Commits (56051e763965d439f11f20f876475732eed7b307) All Branches Search

7255 Commits (56051e763965d439f11f20f876475732eed7b307)

All Branches