src-kafka

Commit Graph

Author	SHA1	Message	Date
Kowshik Prakasam	cdf725828b	KAFKA-10832: Fix Log to use the correct ProducerStateManager instance when updating producers (#9718 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	4 years ago
Walker Carlson	d5dc7dfe00	KAFKA-10810: Replace stream threads (#9697 ) StreamThreads can now be replaced in the streams uncaught exception handler Reviewers: Bruno Cadonna <bruno@confluent.io>, John Roesler <vvcephei@apache.org>, Leah Thomas <lthomas@confluent.io>	4 years ago
Ismael Juma	8cabd57612	MINOR: Update jmh to 1.27 for async profiler support (#9129 ) Also updated the jmh readme to make it easier for new people to know what's possible and best practices. There were some changes in the generated benchmarking code that required adjusting `spotbugs-exclude.xml` and for a `javac` warning to be suppressed for the benchmarking module. I took the chance to make the spotbugs exclusion mode maintainable via a regex pattern. Tested the commands on Linux and macOS with zsh. JMH highlights: * async-profiler integration. Can be used with -prof async, pass -prof async:help to look for the accepted options. * perf c2c [2] integration. Can be used with -prof perfc2c, if available. * JFR profiler integration. Can be used with -prof jfr, pass -prof jfr:help to look for the accepted options. Full details: * 1.24: https://mail.openjdk.java.net/pipermail/jmh-dev/2020-August/002982.html * 1.25: https://mail.openjdk.java.net/pipermail/jmh-dev/2020-August/002987.html * 1.26: https://mail.openjdk.java.net/pipermail/jmh-dev/2020-October/003024.html * 1.27: https://mail.openjdk.java.net/pipermail/jmh-dev/2020-December/003096.html Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Bill Bejeck <bbejeck@gmail.com>, Lucas Bradstreet <lucasbradstreet@gmail.com>	4 years ago
Matthias J. Sax	567a2ec737	KAFKA-10017: fix flaky EOS-beta upgrade test (#9688 ) Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <guozhang@confluent.io>	4 years ago
dengziming	125d5ea0fb	KAFKA-10677; Complete fetches in purgatory immediately after resigning (#9639 ) This patch adds logic to complete fetches immediately after resigning by returning the BROKER_NOT_AVAILABLE error. This ensures that the new election cannot be delayed by fetches which are stuck in purgatory. Reviewers: Jason Gustafson <jason@confluent.io>	4 years ago
David Mao	b44d32dffe	KAFKA-10748: Add IP connection rate throttling metric (KIP-612) (#9685 ) This PR adds the IP throttling metric as described in KIP-612. Reviewers: Anna Povzner <anna@confluent.io>, David Jacot <djacot@confluent.io>	4 years ago
David Mao	404062d2b6	KAFKA-10747: Extend DescribeClientQuotas and AlterClientQuotas APIs to support IP connection rate quota (KIP-612) (#9628 ) This PR adds support for IP entities to the `DescribeClientQuotas` and `AlterClientQuotas` APIs. This PR also adds support for describing/altering IP quotas via `kafka-configs` tooling. Reviewers: Brian Byrne <bbyrne@confluent.io>, Anna Povzner <anna@confluent.io>, David Jacot <djacot@confluent.io>	4 years ago
Boyang Chen	310e240abd	throw corresponding invalid producer epoch (#9700 ) As suggested, ensure InvalidProducerEpoch gets caught properly on stream side. Reviewers: Guozhang Wang <wangguoz@gmail.com>, A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	4 years ago
dengziming	8e82eaa711	MINOR: Fix some java docs of ReplicaStateMachine (#8552 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
bertber	db79f86025	MINOR: remove duplicate code from resetByDuration (#9699 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
Brajesh Kumar	fa93982d3a	KAFKA-9892; Producer state snapshot should be forced to disk (#9621 ) FileChannel.close() does not guarantee modified buffer would be written on the file system. We are changing it with force() semantics to enforce file buffer and metadata written to filesystem (FileChannel.force(true) updates buffer and metadata). Reviewers: Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	4 years ago
Chia-Ping Tsai	6e15937feb	KAFKA-10289; Fix failed connect_distributed_test.py (ConnectDistributedTest.test_bounce) (#9673 ) In Python 3, `filter` functions return iterators rather than `list` so it can traverse only once. Hence, the following loop will only see "empty" and then validation fails. ```python src_messages = self.source.committed_messages() # return iterator sink_messages = self.sink.flushed_messages()) # return iterator for task in range(num_tasks): # only first task can "see" the result. following tasks see empty result src_seqnos = [msg['seqno'] for msg in src_messages if msg['task'] == task] ``` Reference: https://portingguide.readthedocs.io/en/latest/iterators.html#new-behavior-of-map-and-filter. Reviewers: Jason Gustafson <jason@confluent.io>	4 years ago
Jason Gustafson	a8b668b37c	KAFKA-10826; Ensure raft io thread respects linger timeout (#9716 ) When there are no pending operations, the raft IO thread can block indefinitely waiting for a network event. We rely on asynchronous wakeups in order to break the blocking wait in order to respond to a scheduled append. The current logic already does this, but only for the case when the linger time has been completed during the call to `scheduleAppend`. It is possible instead that after making one call to `scheduleAppend` to start the linger timer, the application does not do any additional appends. In this case, we still need the IO thread to wakeup when the linger timer expires. This patch fixes the problem by ensuring that the IO thread gets woken up after the first append which begins the linger timer. Reviewers: Guozhang Wang <wangguoz@gmail.com>	4 years ago
Chia-Ping Tsai	1cf9ce95ad	MINOR: add "flush=True" to all print in system tests (#9711 ) That makes the behavior of print equal to pyhton2. Reviewers: Guozhang Wang <wangguoz@gmail.com>	4 years ago
Ismael Juma	1f98112e99	MINOR: Remove connection id from Send and consolidate request/message utils (#9714 ) Connection id is now only present in `NetworkSend`, which is now the class used by `Selector`/`NetworkClient`/`KafkaChannel` (which works well since `NetworkReceive` is the class used for received data). The previous `NetworkSend` was also responsible for adding a size prefix. This logic is already present in `SendBuilder`, but for the minority of cases where `SendBuilder` is not used (including a number of tests), we now have `ByteBufferSend.sizePrefixed()`. With regards to the request/message utilities: * Renamed `toByteBuffer`/`toBytes` in `MessageUtil` to `toVersionPrefixedByteBuffer`/`toVersionPrefixedBytes` for clarity. * Introduced new `MessageUtil.toByteBuffer` that does not include the version as the prefix. * Renamed `serializeBody` in `AbstractRequest/Response` to `serialize` for symmetry with `parse`. * Introduced `RequestTestUtils` and moved relevant methods from `TestUtils`. * Moved `serializeWithHeader` methods that were only used in tests to `RequestTestUtils`. * Deleted `MessageTestUtil`. Finally, a couple of changes to simplify coding patterns: * Added `flip()` and `buffer()` to `ByteBufferAccessor`. * Added `MessageSizeAccumulator.sizeExcludingZeroCopy`. * Used lambdas instead of `TestCondition`. * Used `Arrays.copyOf` instead of `System.arraycopy` in `MessageUtil`. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Jason Gustafson <jason@confluent.io>	4 years ago
Lincong Li	ff88874e0d	KAFKA-10606: Disable auto topic creation for fetch-all-topic-metadata request (#9435 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
Ismael Juma	00f7341a82	Revert "KAFKA-10713: Stricter protocol parsing in hostnames (#9593 )" This reverts commit `8a59a22881` since it breaks client configurations like `bootstrap.servers=SASL_PLAINTEXT://localhost:49767`. A KIP will be submitted to discuss the details and an adjusted change will be submitted depending on the outcome of that.	4 years ago
mowczare	cd95ce4ace	MINOR: fix typo "intervall" to "interval" (#5435 ) Co-authored-by: Chia-Ping Tsai <chia7712@gmail.com> Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
leah	78a986bf59	MINOR: Clean up streams metric sensors (#9696 ) Reviewers: Bruno Cadonna <bruno@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
APaMio	c5575801b7	MINOR: Using primitive data types for loop index (#9705 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
vamossagar12	99b5e4f4ab	KAFKA-10634; Adding LeaderId to voters list in LeaderChangeMessage along with granting voters (#9539 ) This patch ensures that the leader is included among the voters in the `LeaderChangeMessage`. It also adds an additional field for the set of granting voters, which was originally specified in KIP-595. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Jason Gustafson <jason@confluent.io>	4 years ago
Boyang Chen	41ea0775e0	KAFKA-10667: add timeout for forwarding requests (#9564 ) add total timeout for forwarding, including the underlying broker-to-controller channel timeout setting. Reviewers: David Arthur <mumrah@gmail.com>, Jason Gustafson <jason@confluent.io>	4 years ago
dengziming	3e5a22cefa	KAFKA-10756; Add missing unit test for `UnattachedState` (#9635 ) This patch adds a unit test for `UnattachedState`, similar to `ResignedStateTest` and `VotedStateTest`. Reviewers: Jason Gustafson <jason@confluent.io>	4 years ago
Jason Gustafson	153bbb8ac0	MINOR: Configure reconnect backoff in `BrokerToControllerChannelManager` (#9709 ) We should configure a reconnect backoff for controller connections to prevent tight reconnect loops when the controller cannot be reached. I have borrowed the same configuration we use in `TransactionMarkerChannelManager`. Reviewers: David Arthur <mumrah@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Boyang Chen <boyang@confluent.io>	4 years ago
Chia-Ping Tsai	aebb0e3394	KAFKA-10264; Fix Flaky Test TransactionsTest.testBumpTransactionalEpoch (#9291 ) The test case sends two records before killing broker. The failure is caused when both records are NOT sent in a single batch. The failure of first record can abort second batch and then produces `KafkaException` rather than `TimeoutException`. The patch removes the second record send. Reviewers: Jason Gustafson <jason@confluent.io>	4 years ago
Kowshik Prakasam	1d84f54367	MINOR: Remove redundant default parameter values in call to LogSegment.open (#9710 ) Reviewers: Jun Rao <junrao@gmail.com>	4 years ago
Ismael Juma	6f27bb02da	KAFKA-10818: Skip conversion to `Struct` when serializing generated requests/responses (#7409 ) Generated request/response classes have code to serialize/deserialize directly to `ByteBuffer` so the intermediate conversion to `Struct` can be skipped for them. We have recently completed the transition to generated request/response classes, so we can also remove the `Struct` based fallbacks. Additional noteworthy changes: * `AbstractRequest.parseRequest` has a more efficient computation of request size that relies on the received buffer instead of the parsed `Struct`. * Use `SendBuilder` for `AbstractRequest/Response` `toSend`, made the superclass implementation final and removed the overrides that are no longer necessary. * Removed request/response constructors that assume latest version as they are unsafe outside of tests. * Removed redundant version fields in requests/responses. * Removed unnecessary work in `OffsetFetchResponse`'s constructor when version >= 2. * Made `AbstractResponse.throttleTimeMs()` abstract. * Using `toSend` in `SaslClientAuthenticator` instead of `serialize`. * Various changes in Request/Response classes to make them more consistent and to rely on the Data classes as much as possible when it comes to their state. * Remove the version argument from `AbstractResponse.toString`. * Fix `getErrorResponse` for `ProduceRequest` and `DescribeClientQuotasRequest` to use `ApiError` which processes the error message sent back to the clients. This was uncovered by an accidental fix to a `RequestResponseTest` test (it was calling `AbstractResponse.toString` instead of `AbstractResponse.toString(short)`). Rely on existing protocol tests to ensure this refactoring does not change observed behavior (aside from improved performance). Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
José Armando García Sancio	ab0807dd85	KAFKA-10394: Add classes to read and write snapshot for KIP-630 (#9512 ) This PR adds support for generating snapshot for KIP-630. 1. Adds the interfaces `RawSnapshotWriter` and `RawSnapshotReader` and the implementations `FileRawSnapshotWriter` and `FileRawSnapshotReader` respectively. These interfaces and implementations are low level API for writing and reading snapshots. They are internal to the Raft implementation and are not exposed to the users of `RaftClient`. They operation at the `Record` level. These types are exposed to the `RaftClient` through the `ReplicatedLog` interface. 2. Adds a buffered snapshot writer: `SnapshotWriter<T>`. This type is a higher-level type and it is exposed through the `RaftClient` interface. A future PR will add the related `SnapshotReader<T>`, which will be used by the state machine to load a snapshot. Reviewers: Jason Gustafson <jason@confluent.io>	4 years ago
Rajini Sivaram	b8ebcc2a93	KAFKA-10798; Ensure response is delayed for failed SASL authentication with connection close delay (#9678 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	4 years ago
Randall Hauch	8db3b1a09a	KAFKA-10811: Correct the MirrorConnectorsIntegrationTest to correctly mask the exit procedures (#9698 ) Normally the `EmbeddedConnectCluster` class masks the `Exit` procedures using within the Connect worker. This normally works great when a single instance of the embedded cluster is used. However, the `MirrorConnectorsIntegrationTest` uses two `EmbeddedConnectCluster` instances, and when the first one is stopped it would reset the (static) exit procedures, and any problems during shutdown of the second embedded Connect cluster would cause the worker to shut down the JVM running the tests. Instead, the `MirrorConnectorsIntegrationTest` class should mask the `Exit` procedures and instruct the `EmbeddedConnectClusters` instances (via the existing builder method) to not mask the procedures. Author: Randall Hauch <rhauch@gmail.com> Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
Guozhang Wang	a57486e750	MINOR: Do not print log4j for memberId required (#9667 ) For MemberIdRequiredException, we would not print the exception at INFO with a full exception message since it may introduce more confusion that clearance. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Boyang Chen <boyang@confluent.io>	4 years ago
high.lee	88c8180957	KAFKA-8147: Update upgrade notes for KIP-446 (#8965 ) Reviewer: Matthias J. Sax <matthias@confluent.io>, John Roesler <john@confluent.io>	4 years ago
Walker Carlson	9ece7fe372	KAFKA-10500: Allow people to add new StreamThread at runtime (#9615 ) Part of KIP-663. Reviewers: Bruno Cadonna <bruno@confluent.io>, A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	4 years ago
Chia-Ping Tsai	b9640a71c4	HOTFIX: fix failed build caused by StreamThreadTest (#9691 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	4 years ago
Rohit Deshpande	4e9c7fc8a5	KAFKA-10629: TopologyTestDriver should not require a Properties argument (#9660 ) Implements KIP-680. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Matthias J. Sax <matthias@confluent.io>	4 years ago
Chris Egerton	4f2f08eb00	KAFKA-10792: Prevent source task shutdown from blocking herder thread (#9669 ) Changes the `WorkerSourceTask` class to only call `SourceTask::stop` from the task thread when the task is actually stopped (via `Source:task::close` just before `WorkerTask::run` completes), and only if an attempt has been made to start the task (which will not be the case if it was created in the paused state and then shut down before being started). This prevents `SourceTask::stop` from being indirectly invoked on the herder's thread, which can have adverse effects if the task is unable to shut down promptly. Unit tests are tweaked where necessary to account for this new logic, which covers some edge cases mentioned in PR #5020 that were unaddressed up until now. The existing integration tests for blocking connectors are expanded to also include cases for blocking source and sink tasks. Full coverage of every source/sink task method is intentionally omitted from these expanded tests in order to avoid inflating test runtime (each one adds an extra 5 seconds at minimum) and because the tests that are added here were sufficient to reproduce the bug with source task shutdown. Author: Chris Egerton <chrise@confluent.io> Reviewers: Nigel Liang <nigel@nigelliang.com>, Tom Bentley <tbentley@redhat.com>, Randall Hauch <rhauch@gmail.com>	4 years ago
Geordie	cc0247bf53	MINOR: Leaves lock() outside the try block (#9687 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
Luke Chen	20ae73b051	KAFKA-10665: close all kafkaStreams before purgeLocalStreamsState (#9674 ) The flaky tests are because we forgot to close the kafkaStreams before purgeLocalStreamsState, so that sometimes there will be some tmp files be created/deleted during streams running(ex: checkpoint.tmp), and caused the DirectoryNotEmptyException or NoSuchFileException be thrown. Reviewers: Levani Kokhreidze, Bill Bejeck <bbejeck@apache.org>	4 years ago
APaMio	df0c52e7fd	MINOR: a small refactor for LogManage#shutdown (#9680 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
leah	4cc6d204ec	KAFKA-10500: Add failed-stream-threads metric for adding + removing stream threads (#9614 ) Part of KIP-663. Reviewer: Bruno Cadonna <bruno@confluent.io>, Walker Carlson <wcarlson@confluent.io>, Matthias J. Sax <matthias@confluent.io>	4 years ago
Prateek Agarwal	155f2c06fb	KAFKA-10803: Fix improper removal of bad dynamic config (#9682 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
Geordie	b18ecad90e	MINOR: Make Histogram#clear more readable (#9679 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	4 years ago
Bruno Cadonna	7c68531a1f	MINOR: Fix flaky test shouldQueryOnlyActivePartitionStoresByDefault (#9681 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	4 years ago
David Arthur	633f7cff19	KAFKA-10799 AlterIsr utilizes ReplicaManager ISR metrics (#9677 ) Add small interface to Partition.scala that allows AlterIsr and ZK code paths to update the ISR metrics managed by ReplicaManager. This opens the door for consolidating even more code between the two ISR update code paths.	4 years ago
Jim Galasyn	03cff6cb59	MINOR: Fix KTable-KTable foreign-key join example (#9683 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	4 years ago
James Cheng	16eb1f5cd1	KAFKA-10473: Add docs on partition size-on-disk, and other log-related metrics (#9276 ) kafka.log,type=Log,name=Size kafka.log,type=Log,name=NumLogSegments kafka.log,type=Log,name=LogStartOffset kafka.log,type=Log,name=LogEndOffset Reviewers: Guozhang Wang <wangguoz@gmail.com>	4 years ago
David Jacot	10364e4b0c	KAFKA-10739; Replace EpochEndOffset with automated protocol (#9630 ) This patch follows up https://github.com/apache/kafka/pull/9547. It refactors KafkaApis, ReplicaManager and Partition to use `OffsetForLeaderEpochResponseData.EpochEndOffset` instead of `EpochEndOffset`. In the mean time, it removes `OffsetsForLeaderEpochRequest#epochsByTopicPartition` and `OffsetsForLeaderEpochResponse#responses` and replaces their usages to use the automated protocol directly. Finally, it removes old constructors in `OffsetsForLeaderEpochResponse`. The patch relies on existing tests. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Jason Gustafson <jason@confluent.io>	4 years ago
Ankit Kumar	9de16bd2e6	KAFKA-10460: ReplicaListValidator format checking is incomplete (#9326 ) Co-authored-by: akumar <akumar@cloudera.com> Reviewers: Mickael Maison <mickael.maison@gmail.com>, Viktor Somogyi-Vass <viktorsomogyi@gmail.com>	4 years ago
Rajini Sivaram	7ecc3a579a	KAFKA-10554; Perform follower truncation based on diverging epochs in Fetch response (#9382 ) From IBP 2.7 onwards, fetch responses include diverging epoch and offset in fetch responses if lastFetchedEpoch is provided in the fetch request. This PR uses that information for truncation and avoids the additional OffsetForLeaderEpoch requests in followers when lastFetchedEpoch is known. Co-authored-by: Jason Gustafson <jason@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io>, Nikhil Bhatia <rite2nikhil@gmail.com>	4 years ago
Chia-Ping Tsai	abb8ff61cc	MINOR: Align the UID inside/outside container (#9652 ) Reviewers: Jason Gustafson <jason@confluent.io>	4 years ago

... 2 3 4 5 6 ...

8449 Commits (976e6ea0f77c16816b068720abd476005073f39d) All Branches Search

8449 Commits (976e6ea0f77c16816b068720abd476005073f39d)

All Branches