src-kafka

Commit Graph

Author	SHA1	Message	Date
Hanyu Zheng	bbdf6de88a	KAFKA-15527: Add reverseRange and reverseAll query over kv-store in IQv2 (#14477 ) Implements KIP-985. Reviewers: Matthias J. Sax <matthias@confluent.io>	1 year ago
Matthias J. Sax	9b468fb278	MINOR: Do not end Javadoc comments with `**/` (#14540 ) Reviewers: Bruno Cadonna <bruno@confluent.io>, Bill Bejeck <bill@confluent.io>, Hao Li <hli@confluent.io>, Josep Prat <josep.prat@aiven.io>	1 year ago
Lucas Brutschy	e7e399b940	MINOR: allow removing a suspended task from task registry. (#14555 ) When we get a suspended task re-assigned in the eager rebalance protocol, we have to add the task back to the state updater so that it has a chance to catch up with its change log. This was prevented by a check in Tasks, which disallows removing SUSPENDED tasks from the task registry. I couldn't find a reason why this must be an invariant of the task registry, so this weakens the check. The error happens in the integration between TaskRegistry and TaskManager. However, this change anyway adds unit tests to more closely specify the intended behavior of the two modules. Reviewers: Bruno Cadonna <bruno@confluent.io>	1 year ago
Hanyu Zheng	732bffcae6	KAFKA-15569: test and add test cases in IQv2StoreIntegrationTest (#14523 ) Reviewers: Matthias J. Sax <matthias@confluent.io>	1 year ago
Matthias J. Sax	d4c661c017	MINOR: cleanup warnings in Kafka Streams code base (#14549 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, A. Sophie Blee-Goldman <sophie@responsive.dev>	1 year ago
Matthias J. Sax	649e2cbc8f	MINOR: Fix `Consumed` to return new object instead of `this` (#14550 ) We embrace immutability and thus should return a new object instead of `this`, similar to other config classed we use in the DSL. Side JavaDocs cleanup for a bunch of classes. Reviewers: Guozhang Wang <wangguoz@gmail.com>	1 year ago
Matthias J. Sax	cd1b7639cb	MINOR: cleanup some warning in Kafka Streams examples (#14547 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	1 year ago
Ismael Juma	4cf86c5d2f	KAFKA-15492: Upgrade and enable spotbugs when building with Java 21 (#14533 ) Spotbugs was temporarily disabled as part of KAFKA-15485 to support Kafka build with JDK 21. This PR upgrades the spotbugs version to 4.8.0 which adds support for JDK 21 and enables it's usage on build again. Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Bruno Cadonna	c7f730d9d9	MINOR: Only commit running active and standby tasks when tasks corrupted (#14508 ) When tasks are found corrupted, Kafka Streams tries to commit the non-corrupted tasks before closing and reviving the corrupted active tasks. Besides active running tasks, Kafka Streams tries to commit restoring active tasks and standby tasks. However, restoring active tasks do not need to be committed since they do not have offsets to commit and the current code does not write a checkpoint. Furthermore, trying to commit restoring active tasks with the state updater enabled results in the following error: java.lang.UnsupportedOperationException: This task is read-only at org.apache.kafka.streams.processor.internals.ReadOnlyTask.commitNeeded(ReadOnlyTask.java:209) ... since commitNeeded() is not a read-only method for active tasks. In future, we can consider writing a checkpoint for active restoring tasks in this situation. Additionally, we should fix commitNeeded() in active tasks to be read-only. Reviewers: Matthias J. Sax <matthias@confluent.io>, Lucas Brutschy <lbrutschy@confluent.io>	1 year ago
Levani Kokhreidze	7d1847c4c3	MINOR: Fix KafkaStreams#streamThreadLeaveConsumerGroup logging (#14526 ) Fixes logging for KafkaStreams#streamThreadLeaveConsumerGroup. In order not to lose the trace of the whole exception, passing Exception e as a second argument, while message is pre-formatted and passed as string as a first argument. With this, we won't loose the stack trace of the exception. Reviewers: Anna Sophie Blee-Goldman <sophie@responsive.dev>	1 year ago
Levani Kokhreidze	5dd155f350	KAFKA-15571: `StateRestoreListener#onRestoreSuspended` is never called because `DelegatingStateRestoreListener` doesn't implement `onRestoreSuspended` (#14519 ) With https://issues.apache.org/jira/browse/KAFKA-10575 StateRestoreListener#onRestoreSuspended was added. But local tests show that it is never called because DelegatingStateRestoreListener was not updated to call a new method Reviewers: Anna Sophie Blee-Goldman <sophie@responsive.dev>, Bruno Cadonna <cadonna@confluent.io>	1 year ago
Christo Lolov	a0e3d01fef	KAFKA-14133: Move MeteredTimestampedKeyValueStoreTest, ReadOnlyWindowStoreFacadeTest and TimestampedWindowStoreBuilderTest to Mockito (#14412 ) Reviewers: Divij Vaidya <diviv@amazon.com>, Yash Mayya <yash.mayya@gmail.com>	1 year ago
Bruno Cadonna	c32d2338a7	KAFKA-10199: Enable state updater by default (#13927 ) Now that the implementation for the state updater is done, we can enable it by default. This PR enables the state updater by default and fixes code that made assumptions that are not true when the state updater is enabled (mainly tests). Reviewers: Lucas Brutschy <lbrutschy@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Walker Carlson <wcarlson@confluent.io>	1 year ago
Lucas Brutschy	6263197a62	KAFKA-15326: [9/N] Start and stop executors and cornercases (#14281 ) * Implements start and stop of task executors * Introduce flush operation to keep consumer operations out of the processing threads * Fixes corner case: handle requested unassignment during shutdown * Fixes corner case: handle race between voluntary unassignment and requested unassigment * Fixes corner case: task locking future completes for the empty set * Fixes corner case: we should not reassign a task with an uncaught exception to a task executor * Improved logging * Number of threads controlled from outside, of the TaskManager Reviewers: Bruno Cadonna <bruno@confluent.io>	1 year ago
Lucas Brutschy	b58864e476	MINOR: Logging fix in StreamsPartitionAssignor (#14435 ) Fix broken log message Reviewer: A. Sophie Blee-Goldman <ableegoldman@apache.org>	1 year ago
Hao Li	e71f6ebc81	MINOR: only log error when rack aware assignment is enabled (#14415 ) Reviewers: Lucas Brutschy <lbrutschy@confluent.io>, Matthias J. Sax <matthias@confluent.io>	1 year ago
Bruno Cadonna	673a25acc3	KAFKA-10199: Do not unlock state directories of tasks in state updater (#14442 ) When Streams completes a rebalance, it unlocks state directories all unassigned tasks. Unfortunately, when the state updater is enabled, Streams does not look into the state updater to determine the unassigned tasks. This commit corrects this by adding the check. Reviewer: Lucas Brutschy <lbrutschy@confluent.io>	1 year ago
Lucas Brutschy	079e5d647c	KAFKA-15326: [8/N] Move consumer interaction out of processing methods (#14226 ) The process method inside the tasks needs to be called from within the processing threads. However, it currently interacts with the consumer in two ways: * It resumes processing when the PartitionGroup buffers are empty * It fetches the lag from the consumer We introduce updateLags() and resumePollingForPartitionsWithAvailableSpace() methods that call into the task from the polling thread, in order to set up the consumer correctly for the next poll, and extract metadata from the consumer after the poll. Reviewer: Bruno Cadonna <bruno@confluent.io>	1 year ago
Bruno Cadonna	65efb98134	KAFKA-10199: Do not process when in PARTITIONS_REVOKED (#14265 ) When a Streams application is subscribed with a pattern to input topics and an input topic is deleted, the stream thread transists to PARTITIONS_REVOKED and a rebalance is triggered. This happens inside the poll call. Sometimes, the poll call returns before a new assignment is received. That means, Streams executes the poll loop in state PARTITIONS_REVOKED. With the state updater enabled processing is also executed in states other than RUNNING and so processing is also executed when the stream thread is in state PARTITION_REVOKED. However, that triggers an IllegalStateException with error message: No current assignment for partition TEST-TOPIC-A-0 which is a fatal error. This commit prevents processing when the stream thread is in state PARTITIONS_REVOKED. Reviewer: Lucas Brutschy <lbrutschy@confluent.io>	1 year ago
Lucas Brutschy	2d04370bca	KAFKA-10199: Fix restoration behavior for paused tasks (#14437 ) State updater can get into a busy loop when all tasks are paused, because changelogReader will never return that all changelogs have been read completely. Fix this, by awaiting if updatingTasks is empty. Related and included: if we are restoring and all tasks are paused, we should return immediately from StoreChangelogReader. Reviewer: Bruno Cadonna <cadonna@apache.org>	1 year ago
Bruno Cadonna	a46da90b8f	KAFKA-10199: Add missing catch for lock exception (#14403 ) The state directory throws a lock exception during initialization if a task state directory is still locked by the stream thread that previously owned the task. When this happens, Streams catches the lock exception, ignores the exception, and tries to initialize the task in the next exception. In the state updater code path, we missed catching the lock exception when Streams recycles a task. That leads to the lock exception thrown to the exception handler, which is unexpected and leads to test failures. Reviewer: Lucas Brutschy <lbrutschy@confluent.io>	1 year ago
Lucas Brutschy	9c2e5daf60	MINOR: Revert log level changes in LogCaptureAppender (#14436 ) LogCaptureAppender sets the log level in various tests to check if a certain log message is produced. The log level is however never reverted, changing the log level across the board and introducing flakiness due to non-determinism since the log level depends on execution order. Some log messages change the timing inside tests significantly. Reviewer: Bruno Cadonna <cadonna@apache.org>	1 year ago
Ismael Juma	98febb989a	KAFKA-15485: Fix "this-escape" compiler warnings introduced by JDK 21 (1/N) (#14427 ) This is one of the steps required for kafka to compile with Java 21. For each case, one of the following fixes were applied: 1. Suppress warning if fixing would potentially result in an incompatible change (for public classes) 2. Add final to one or more methods so that the escape is not possible 3. Replace method calls with direct field access. In addition, we also fix a couple of compiler warnings related to deprecated references in the `core` module. See the following for more details regarding the new lint warning: https://www.oracle.com/java/technologies/javase/21-relnote-issues.html#JDK-8015831 Reviewers: Divij Vaidya <diviv@amazon.com>, Satish Duggana <satishd@apache.org>, Chris Egerton <chrise@aiven.io>	1 year ago
Christo Lolov	5bdea94c05	KAFKA-14133: Move MeteredSessionStoreTest, MeteredWindowStoreTest and ReadOnlyKeyValueStoreFacadeTest to Mockito (#14404 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Divij Vaidya	9e5ca8416d	MINOR: Fix kafka-site formatting (#14419 ) Reviewers: Satish Duggana <satishd@apache.org>, Josep Prat <jlprat@apache.org>	1 year ago
Wuzhengyu97	fcd382138e	MINOR: Used Admin instead of AdminClient to create Admin (#14411 ) Used Admin instead of AdminClient to create Admin Reviewers: Ziming Deng <dengziming1993@gmail.com>	1 year ago
Christo Lolov	58da419035	KAFKA-14133: Move KeyValueIteratorFacadeTest, KeyValueSegmentTest and MeteredKeyValueStoreTest to Mockito (#14396 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Nick Telford	f041efa5fd	KAFKA-13973: Fix inflated block cache metrics (#14317 ) All block cache metrics are being multiplied by the total number of column families. In a `RocksDBTimestampedStore`, we have 2 column families (the default, and the timestamped values), which causes all block cache metrics in these stores to become doubled. The cause is that our metrics recorder uses `getAggregatedLongProperty` to fetch block cache metrics. `getAggregatedLongProperty` queries the property on each column family in the database, and sums the results. Since we always configure all column families to share the same block cache, that causes the same block cache to be queried multiple times for its metrics, with the results added togehter, effectively multiplying the real value by the total number of column families. To fix this, we should simply use `getLongProperty`, which queries a single column family (the default one). Since all column families share the same block cache, querying just one of them will give us the correct metrics for that shared block cache. Note: the same block cache is shared among all column families of a store irrespective of whether the user has configured a shared block cache across multiple stores. Reviewers: Matthias J. Sax <matthias@confluent.io>, Bruno Cadonna <cadonna@apache.org>	1 year ago
Lucas Brutschy	07a18478be	KAFKA-15326: [7/N] Processing thread non-busy waiting (#14180 ) Avoid busy waiting for processable tasks. We need to be a bit careful here to not have the task executors to sleep when work is available. We have to make sure to signal on the condition variable any time a task becomes "processable". Here are some situations where a task becomes processable: - Task is unassigned from another TaskExecutor. - Task state is changed (should only happen inside when a task is locked inside the polling phase). - When tasks are unlocked. - When tasks are added. - New records available. - A task is resumed. So in summary, we - We should probably lock tasks when they are paused and unlock them when they are resumed. We should also wake the task executors after every polling phase. This belongs to the StreamThread integration work (separate PR). We add DefaultTaskManager.signalProcessableTasks for this. - We need to awake the task executors in DefaultTaskManager.unassignTask, DefaultTaskManager.unlockTasks and DefaultTaskManager.add. Reviewers: Walker Carlson <wcarlson@confluent.io>, Bruno Cadonna <cadonna@apache.org>	1 year ago
Lucas Brutschy	eb39c95080	MINOR: StoreChangelogReaderTest fails with log-level DEBUG (#14300 ) A mocked method is executed unexpectedly when we enable DEBUG log level, leading to confusing test failures during debugging. Since the log message itself seems useful, we adapt the test to take the additional mocked method call into account). Reviewer: Bruno Cadonna <cadonna@apache.org>	1 year ago
Yash Mayya	d34d84dbef	KAFKA-7438: Migrate WindowStoreBuilderTest from EasyMock to Mockito (#14152 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Christo Lolov	7a516b0386	KAFKA-14133: Move AbstractStreamTest and RocksDBMetricsRecordingTriggerTest to Mockito (#14223 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Rohan	cc53889aaa	KAFKA-15429: reset transactionInFlight on StreamsProducer close (#14326 ) Resets the value of transactionInFlight to false when closing the StreamsProducer. This ensures we don't try to commit against a closed producer Reviewers: Anna Sophie Blee-Goldman <ableegoldman@apache.org>	1 year ago
Rohan	d293cd0735	KAFKA-15429: catch+log errors from unsubscribe in streamthread shutdown (#14325 ) Preliminary fix for KAFKA-15429 which updates StreamThread.completeShutdown to catch-and-log errors from consumer.unsubscribe. Though this does not prevent the exception, it does preserve the original exception that caused the stream thread to exit. Reviewers: Anna Sophie Blee-Goldman <ableegoldman@apache.org>	1 year ago
Lucas Brutschy	16dc983ad6	Kafka Streams Threading: Timeout behavior (#14171 ) Implement setting and clearing task timeouts, as well as changing the output on exceptions to make it similar to the existing code path. Reviewer: Walker Carlson <wcarlson@apache.org>	1 year ago
A. Sophie Blee-Goldman	95e1cdc4ef	HOTFIX: avoid placement of unnecessary transient standby tasks & improve assignor logging (#14149 ) Minor fix to avoid creating unnecessary standby tasks, especially when these may be surprising or unexpected as in the case of an application with num.standby.replicas = 0 and warmup replicas disabled. The "bug" here was introduced during the fix for an issue with cooperative rebalancing and in-memory stores. The fundamental problem is that in-memory stores cannot be unassigned from a consumer for any period, however temporary, without being closed and losing all the accumulated state. This caused some grief when the new HA task assignor would assign an active task to a node based on the readiness of the standby version of that task, but would have to remove the active task from the initial assignment so it could first be revoked from its previous owner, as per the cooperative rebalancing protocol. This temporary gap in any version of that task among the consumer's assignment for that one intermediate rebalance would end up causing the consumer to lose all state for it, in the case of in-memory stores. To fix this, we simply began to place standby tasks on the intended recipient of an active task awaiting revocation by another consumer. However, the fix was a bit of an overreach, as we assigned these temporary standby tasks in all cases, regardless of whether there had previously been a standby version of that task. We can narrow this down without sacrificing any of the intended functionality by only assigning this kind of standby task where the consumer had previously owned some version of it that would otherwise potentially be lost. Also breaks up some of the long log lines in the StreamsPartitionAssignor and expands the summary info while moving it all to the front of the line (following reports of missing info due to truncation of long log lines in larger applications)	1 year ago
Christo Lolov	dbda60c60d	KAFKA-14133: Move RocksDBRangeIteratorTest, TimestampedKeyValueStoreBuilderTest and TimestampedSegmentTest to Mockito (#14222 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Taher Ghaleb	3b02e97b65	KAFKA-15403: Refactor @Test(expected) annotation with assertThrows (#14264 ) assertThrows makes the verification of exceptions clearer and more intuitive, thus improving code readability compared to the annotation approach. It is considered a test smell in the research literature. One possible research is due to developers not keeping up to date with recent versions of testing frameworks. All such patterns in streams have been refactored. Reviewers: vamossagar12 <sagarmeansocean@gmail.com>, Justine Olshan <jolshan@confluent.io>	1 year ago
Christo Lolov	664f71b207	KAFKA-14133: Move RecordCollectorTest, StateRestoreCallbackAdapterTest and StoreToProcessorContextAdapterTest to Mockito (#14210 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Satish Duggana	9e3b1f9b9b	MINOR Bump trunk to 3.7.0-SNAPSHOT (#14286 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Phuc-Hong-Tran	8d12c1175c	KAFKA-15152: Fix incorrect format specifiers when formatting string (#14026 ) Reviewers: Divij Vaidya <diviv@amazon.com> Co-authored-by: phuchong.tran <phuchong.tran@servicenow.com>	1 year ago
Christo Lolov	86afa416d2	KAFKA-14133: Move mocks from KStreamTransformValuesTest, KTableImplTest and KTableTransformValuesTest to Mockito (#14204 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Walker Carlson	ad76497b12	KAFKA-14936: fix grace period partition issue (#14269 ) Move the store creation to builder pattern and recover mintimestamp Reviewers: John Roesler<vvcephei@apache.org>, Bill Bejeck <bbejeck@gmail.com>	1 year ago
Bruno Cadonna	05c329e61f	KAFKA-10199: Change to RUNNING if no pending task to init exist (#14249 ) A stream thread should only change to RUNNING if there are no active tasks in restoration in the state updater and if there are no pending tasks to recycle and to init. Usually all pending tasks to init are added to the state updater in the same poll iteration that handles the assignment. However, if during an initialization of a task a LockException the task is re-added to the tasks to init and initialization is retried in the next poll iteration. A LockException might occur when a state directory is still locked by another thread, when the rebalance just happened. Reviewers: Lucas Brutschy <lbrutschy@confluent.io>, Walker Carlson <wcarlson@confluent.io>	1 year ago
Bruno Cadonna	4c7e0a9fa6	MINOR: Decouple purging committed records from committing (#14227 ) Currently, Kafka Streams only tries to purge records whose offset are committed from a repartition topic when at least one offset was committed in the current commit. The coupling between committing some offsets and purging records is not needed and might delay purging of records. For example, if a in-flight call for purging records has not completed yet when a commit happens, a new call is not issued. If then the earlier in-flight call for purging records finally completes but the next commit does not commit any offsets, Streams does not issue the call for purging records whose offset were committed in the previous commit because the purging call was still in-flight. This change issues calls for purging records during any commit if the purge interval passed, even if no offsets were committed in the current commit. Reviewers: Lucas Brutschy <lbrutschy@confluent.io>, Walker Carlson <wcarlson@confluent.io>	1 year ago
Walker Carlson	d0b7677c2c	KAFKA-14936: Add restore logic (3/N) (#14027 ) Added restore logic for the buffer in grace period joins. Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	1 year ago
Matthias J. Sax	b36cf4ef97	HOTIFX: fix Kafka Streams upgrade path from 3.4 to 3.5 (#14103 ) KIP-904 introduced a backward incompatible change that requires a 2-bounce rolling upgrade. The new "3.4" upgrade config value is not recognized by `AssignorConfiguration` though and thus crashed Kafka Streams if use. Reviewers: Farooq Qaiser <fqaiser94@gmail.com>, Bruno Cadonna <bruno@confluent.io>	1 year ago
Lucas Brutschy	ee036ed9ef	KAFKA-15319: Upgrade rocksdb to fix CVE-2022-37434 (#14216 ) Rocksdbjni<7.9.2 is vulnerable to CVE-2022-37434 due to zlib 1.2.12 Reviewers: Divij Vaidya <diviv@amazon.com>, Bruno Cadonna <cadonna@apache.org>	1 year ago
Lucas Brutschy	d85a700813	MINOR: Do not reuse admin client across tests (#14225 ) Reusing an admin client across tests can cause false positives in leak checkers, so don't do it. Reviewers: Divij Vaidya <diviv@amazon.com>, Matthias J. Sax <matthias@confluent.io>	1 year ago
Christo Lolov	d0e9e94629	KAFKA-14133: Migrate ActiveTaskCreatorTest, ChangelogTopicsTest and GlobalProcessorContextImplTest to Mockito (#14209 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago

1 2 3 4 5 ...

2725 Commits (14029e2ddd1f5084d426ea8280abfad249988c0a)