src-kafka

Commit Graph

Author	SHA1	Message	Date
Bruno Cadonna	e3c2148b20	KAFKA-8964: Rename tag client-id for thread-level metrics and below (#7429 ) * Renamed tag client-id to thread-id for thread-level metrics and below * Corrected metrics tag keys for state store that had suffix "-id" instead of "state-id" Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Jukka Karvanen	7e3f8895d6	MINOR: Modified Exception handling for KIP-470 (#7461 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Omkar Mestry	cfa10678bd	KAFKA-7245: Deprecate WindowStore#put(key, value) (#7105 ) Implements KIP-474. Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
A. Sophie Blee-Goldman	d88f1048da	KAFKA-8179: Part 7, cooperative rebalancing in Streams (#7386 ) Key improvements with this PR: * tasks will remain available for IQ during a rebalance (but not during restore) * continue restoring and processing standby tasks during a rebalance * continue processing active tasks during rebalance until the RecordQueue is empty* * only revoked tasks must suspended/closed * StreamsPartitionAssignor tries to return tasks to their previous consumers within a client * but do not try to commit, for now (pending KAFKA-7312) Reviewers: John Roesler <john@confluent.io>, Boyang Chen <boyang@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Manikumar Reddy	4c2bd567b1	MINOR: Bump version to 2.5.0-SNAPSHOT (#7455 )	5 years ago
Jukka Karvanen	a5a6938c69	KAFKA-8233: TopologyTestDriver test input and output usability improvements (#7378 ) Implements KIP-470 Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Bruno Cadonna	52007e878a	KAFKA-8934: Introduce instance-level metrics for streams applications (#7416 ) 1. Moves StreamsMetricsImpl from StreamThread to KafkaStreams 2. Adds instance-level metrics as specified in KIP-444, i.e.: -- version -- commit-id -- application-id -- topology-description -- state Reviewers: Guozhang Wang <wangguoz@gmail.com>, John Roesler <john@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
Adam Bellemare	c87fe9402c	KAFKA-3705 Added a foreignKeyJoin implementation for KTable. (#5527 ) https://issues.apache.org/jira/browse/KAFKA-3705 Allows for a KTable to map its value to a given foreign key and join on another KTable keyed on that foreign key. Applies the joiner, then returns the tuples keyed on the original key. This supports updates from both sides of the join. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Matthias J. Sax <mjsax@apache.org>, John Roesler <john@confluent.io>, Boyang Chen <boyang@confluent.io>, Christopher Pettitt <cpettitt@confluent.io>, Bill Bejeck <bbejeck@gmail.com>, Jan Filipiak <Jan.Filipiak@trivago.com>, pgwhalen, Alexei Daniline	5 years ago
Bill Bejeck	6925775e63	KAFKA-8558: Add StreamJoined config object to join (#7285 ) Reviewer: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Guozhang Wang	11ab6e7d8f	HOTFIX: remove unsued StreamsConfig from StreamsPartitionAssignor	5 years ago
A. Sophie Blee-Goldman	c7efc3613c	HOTFIX: don't throw if upgrading from very old versions (#7436 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
A. Sophie Blee-Goldman	8da69936a7	KAFKA-8649: Send latest commonly supported version in assignment (#7423 ) Instead of sending the leader's version and having older members try to blindly upgrade. The only other real change here is that we will also set the VERSION_PROBING error code and return early from onAssignment when we are upgrading our used subscription version (not just downgrading it) since this implies the whole group has finished the rolling upgrade and all members should rejoin with the new subscription version. Also piggy-backing on a fix for a potentially dangerous edge case, where every thread of an instance is assigned the same set of active tasks. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bruno Cadonna	3ca204b427	MINOR: Shutdown RockDB metrics recording trigger thread (#7417 ) added shutdown for thread that triggers recording of RocksDBMetrics added unit tests to verify the start and shutdown of the thread refactored a bit of code Reviewers: Christopher Pettitt <cpettitt@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
Bill Bejeck	9e294cbca2	KAFKA-8807: Flaky GlobalStreamThread test (#7418 ) A minor refactor to explicitly verify that Processor#close is only called once. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Sophie Blee-Goldman <sophie@confluent.io>, Bruno Cadonna <bruno@confluent.io>,	5 years ago
Ismael Juma	422687148e	MINOR: Mark RocksDBStoreTest as integration test (#7412 ) shouldNotThrowExceptionOnRestoreWhenThereIsPreExistingRocksDbFiles takes 1m30s, which is too long for a unit test. `RocksDBTimestampedStoreTest` inherits from `RocksDBStoreTest` and it's implicitly considered an integration test too. Reviewers: Guozhang Wang <guozhang@confluent.io>	5 years ago
Matthias J. Sax	9fbb0de5fc	KAFKA-8927: Deprecate PartitionGrouper interface (#7376 ) Reviewers: Bruno Cadonna <bruno@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bill Bejeck	d53eab16b2	MINOR: Adjust logic of conditions to set number of partitions in step zero of assignment. (#7419 ) A minor change in logic to account for repartition topics where we might not have the num partitions yet in the metadata. Ran all existing tests plus all streams system tests. Reviewers: John Roesler <vvcephei@users.noreply.github.com>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Michał Siatkowski	45c800ff01	KAFKA-8911: Using proper WindowSerdes constructors in their implicit definitions (#7352 ) Detailed info is available in the ticket: https://issues.apache.org/jira/browse/KAFKA-8911 Briefly, implicit defs are calling empty constructors, which exists only for reflection object creation. Therefore, while using the implicit definitons, a NPE occurs when Serde is called. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
Ismael Juma	66183f730f	KAFKA-8471: Replace control requests/responses with automated protocol (#7353 ) Replaced UpdateMetadata{Request, Response}, LeaderAndIsr{Request, Response} and StopReplica{Request, Response} with the automated protocol classes. Updated the JSON schema for the 3 request types to be more consistent and less strict (if needed to avoid duplication). The general approach is to avoid generating new collections in the request classes. Normalization happens in the constructor to make this possible. Builders still have to group by topic to maintain the external ungrouped view. Introduced new tests for LeaderAndIsrRequest and UpdateMetadataRequest to verify that the new logic is correct. A few other clean-ups/fixes in code that was touched due to these changes: * KAFKA-8956: Refactor DelayedCreatePartitions#updateWaiting to avoid modifying collection in foreach. * Avoid unnecessary allocation for state change trace logging if trace logging is not enabled * Use `toBuffer` instead of `toList`, `toIndexedSeq` or `toSeq` as it generally performs better and it matches the performance characteristics of `java.util.ArrayList`. This is particularly important when passing such instances to Java code. * Minor refactoring for clarity and readability. * Removed usage of deprecated `/:`, unused imports and unnecessary `var`s. * Include exception in `AdminClientIntegrationTest` failure message. * Move StopReplicaRequest verification in `AuthorizerIntegrationTest` to the end to match the comment. Reviewers: Colin Patrick McCabe <cmccabe@apache.org>	5 years ago
Guozhang Wang	22434e6535	KAFKA-8319: Make KafkaStreamsTest a non-integration test class (#7382 ) Previous KafkaStreamsTest takes 2min20s on my local laptop, because lots of its integration test which is producing / consuming records, and checking state directory file system takes lots of time. On the other hand, these tests should be well simplified with mocks. This test reduces the test from a clumsy integration test class into a unit tests with mocks of its internal modules. And some other test functions should not be in KafkaStreamsTest actually and have been moved to other modular test classes. Now it takes 2s. Also it helps removing the potential flakiness of the following (some of them are claimed resolved only because we have not seen them recently, but after looking at the test code I can verify they are still flaky): * KAFKA-5818 (the original JIRA ticket indeed exposed a real issue that has been fixed, but the test itself remains flaky) * KAFKA-6215 * KAFKA-7921 * KAFKA-7990 * KAFKA-8319 * KAFKA-8427 Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Bruno Cadonna <bruno@confluent.io>	5 years ago
A. Sophie Blee-Goldman	74f8ae1303	KAFKA-8179: do not suspend standby tasks during rebalance (#7321 ) Some work needs to be done in Streams before we can incorporate cooperative rebalancing. This PR lays the groundwork for it by doing some refactoring, including a behavioral change that affects eager ("normal") rebalancing as well: will no longer suspend standbys in onPartitionsRevoked, instead we just close any that were reassigned in onPartitionsAssigned Reviewers: Bruno Cadonna <bruno@confluent.io>, Boyang Chen <boyang@confluent.io>, John Roesler <vvcephei@users.noreply.github.com>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bruno Cadonna	ad3b8437fd	KAFKA-8580: Compute RocksDB metrics (#7263 ) A metric recorder runs in it own thread and regularly records RocksDB metrics from RocksDB's statistics. For segmented state stores the metrics are aggregated over the segments. Reviewers: John Roesler <vvcephei@users.noreply.github.com>, A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	bcc023773f	KAFKA-8880: Add overloaded function of Consumer.committed (#7304 ) 1. Add the overloaded functions. 2. Update the code in Streams to use the batch API for better latency (this applies to both active StreamsTask for initialize the offsets, as well as the StandbyTasks for updating offset limits). 3. Also update all unit test to replace the deprecated APIs. Reviewers: Christopher Pettitt <cpettitt@confluent.io>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>, Bill Bejeck <bill@confluent.io>	5 years ago
Florian Hussonnois	beac4c7534	KAFKA-6958: Overload methods for group and windowed stream to allow to name operation name using the new Named class (#6413 ) This is the last PR for the KIP-307. NOTE : PR 6412 should be merge first Thanks a lot for the review. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
Bruno Cadonna	e98e239a0c	KAFKA-8859: Refactor cache-level metrics (#7367 ) Cache-level metrics are refactor according to KIP-444: tag client-id changed to thread-id name hitRatio changed to hit-ratio made backward compatible by using streams config built.in.metrics.version Reviewers: Guozhang Wang <wangguoz@gmail.com>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
Matthias J. Sax	e85d671dee	MINOR: replace `late` with `out-of-order` in JavaDocs and docs (#7274 ) Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>	5 years ago
Guozhang Wang	a0470726c4	MINOR: Move Murmur3 to Streams	5 years ago
Richard Yu	73c6bd8ac9	[KAFKA-7994] Improve Stream time accuracy for restarts and rebalances (#6694 ) Reviewers: Bruno Cadonna <bruno@confluent.io>, A. Sophie Blee-Goldman <sophie@confluent.io>, Boyang Chen <boyang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
A. Sophie Blee-Goldman	9ba898edc7	remove unused import (#7345 ) Remove unused import that's slipping past checkstyle somehow Reviewers: Matthias J. Sax <mjsax@apache.org>, Christopher Pettitt <cpettitt@confluent.io>	5 years ago
vinoth chandar	4962c8193e	KAFKA-8839 : Improve streams debug logging (#7258 ) * log lock acquistion failures on the state store * Document required uniqueness of state.dir path * Move bunch of log calls around task state changes to DEBUG * More readable log messages during partition assignment Reviewers: Matthias J. Sax <mjsax@apache.org>, A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Stanislav Kozlovski	935b280540	MINOR: Default to 5 partitions of the __consumer_offsets topic in Streams integration tests (#7331 ) Given that the tests do not create clusters larger than 3, we do not gain much by creating 50 partitions for that topic. Reducing it should slightly increase test startup and shutdown speed. Reviewers: Matthias J. Sax <mjsax@apache.org>, Guozhang Wang <wangguoz@gmail.com>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
Bruno Cadonna	bab3e082dc	KAFKA-8859: Expose built-in streams metrics version in `StreamsMetricsImpl` (#7323 ) The streams config built.in.metrics.version is needed to add metrics in a backward-compatible way. However, not in every location where metrics are added a streams config is available to check built.in.metrics.version. Thus, the config value needs to be exposed through the StreamsMetricsImpl object. Reviewers: John Roesler <vvcephei@users.noreply.github.com>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Mickael Maison	ac385c4c3a	KAFKA-8474; Use HTML lists for config layout (#6870 ) Replace the `<table>` elements by `<ul>` so the full page width can be used for the configuration descriptions instead of only a very narrow column. I moved the other fields (Type, Default Value, etc) below each entry. Reviewers: Boyang Chen <boyang@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
cpettitt-confluent	83c7c0158f	KAFKA-8755: Fix state restore for standby tasks with optimized topology (#7238 ) Key changes include: 1. Moves general offset limit updates down to StandbyTask. 2. Updates offsets for StandbyTask at most once per commit and only when we need and updated offset limit to make progress. 3. Avoids writing an 0 checkpoint when StandbyTask.update is called but we cannot apply any of the records. 4. Avoids going into a restoring state in the case that the last checkpoint is greater or equal to the offset limit (consumer committed offset). This needs special attention please. Code is in StoreChangelogReader. 5. Does update offset limits initially for StreamTask because it provides a way to prevent playing to many records from the changelog (also the input topic with optimized topology). NOTE: this PR depends on KAFKA-8816, which is under review separately. Fortunately the changes involved are few. You can focus just on the KAFKA-8755 commit if you prefer. Reviewers: Matthias J. Sax <mjsax@apache.org>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bruno Cadonna	6a3a580399	KAFKA-8856: Add Streams config for backward-compatible metrics (#7279 ) Reviewers: John Roesler <vvcephei@users.noreply.github.com>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
cpettitt-confluent	18246e509e	KAFKA-8878: Fix flaky test AssignedStreamsTasksTest#shouldCloseCleanlyWithSuspendedTaskAndEOS (#7302 ) The previous approach to testing KAFKA-8412 was to look at the logs and determine if an error occurred during close. There was no direct way to detect than an exception occurred because the exception was eaten in AssignedTasks.close. In the PR for that ticket (#7207) it was acknowledged that this was a brittle way to test for the exception. We now see occasional failures because an unrelated ERROR level log entry is made while closing the task. This change eliminates the brittle log checking by rethrowing any time an exception occurs in close, even when a subsequent unclean close succeeds. This has the potential benefit of uncovering other supressed exceptions down the road. I've verified that even with us rethrowing on closeUnclean that all tests pass. Reviewers: Matthias J. Sax <mjsax@apache.org>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
John Roesler	0f177ea6b8	MINOR: Clean up partition assignment logic (#7249 ) These are just some "tidying up" changes I made when I was preparing to start working on KIP-441. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
vinoth chandar	ffef0871c2	KAFKA-7149 : Reducing streams assignment data size (#7185 ) * Leader instance uses dictionary encoding on the wire to send topic partitions * Topic names (most expensive component) are mapped to an integer using the dictionary * Follower instances receive the dictionary, decode topic names back * Purely an on-the-wire optimization, no in-memory structures changed * Test case added for version 5 AssignmentInfo Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Chia-Ping Tsai	18e6bb251b	KAFKA-8861 Fix flaky RegexSourceIntegrationTest.testMultipleConsumersCanReadFromPartitionedTopic (#7281 ) similar to https://issues.apache.org/jira/browse/KAFKA-8011 and https://issues.apache.org/jira/browse/KAFKA-8026 Reviewers: Matthias J. Sax <mjsax@apache.org>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
khairy	caaf253b63	MINOR: remove unnecessary nulllity check (#7282 ) Minor code enhancement: remove unnecessary check of nullity. Reviewers: Bill Bejeck <bbejeck@gmail.com>	5 years ago
Omar Al-Safi	8dc80e2297	KAFKA-7849: Fix the warning when using GlobalKTable (#7104 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>	5 years ago
Guozhang Wang	40432e31f7	MONIR: Check for NULL in case of version probing (#7275 ) In case of version probing we would skip the logic for setting cluster / assigned tasks; since these values are initialized as null they are vulnerable to NPE when code changes. Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Bill Bejeck <bill@confluent.io>	5 years ago
Bruno Cadonna	d18d6b033e	MINOR: Refactor tag key for store level metrics (#7257 ) The tag key for store level metrics specified in StreamsMetricsImpl is unified with the tag keys on thread and task level. Reviewers: Sophie Blee-Goldman <sophie@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
Bruno Cadonna	d2741e5cbf	MINOR: Remove `activeTaskCheckpointableOffsets` from `AbstractTask` (#7253 ) Reviewers: cpettitt-confluent <53191309+cpettitt-confluent@users.noreply.github.com>, A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bill Bejeck	fcfee618ee	MINOR: Only send delete request if there are offsets in map (#7256 ) Currently on commit streams will attempt to delete offsets from repartition topics. However, if a topology does not have any repartition topics, then the recordsToDelete map will be empty. This PR adds a check that the recordsToDelete is not empty before executing the AdminClient#deleteRecords() method. Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
A. Sophie Blee-Goldman	cf32a1a6a0	KAFKA-8179: Part 4, add CooperativeStickyAssignor (#7130 ) Splits the existing StickyAssignor logic into an AbstractStickyAssignor class, which is extended by the existing (eager) StickyAssignor and by the new CooperativeStickyAssignor which supports incremental cooperative rebalancing. There is no actual change to the logic -- most methods from StickyAssignor were moved to AbstractStickyAssignor to be shared with CooperativeStickyAssignor, and the abstract MemberData memberData(Subscription) method converts the Subscription to the embedded list of owned partitions for each assignor. The "generation" logic is left in, however this is always Optional.empty() for the CooperativeStickyAssignor as onPartitionsLost should always be called when a generation is missed. Reviewers: Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bruno Cadonna	24547b810c	KAFKA-8579: Expose RocksDB metrics (#7209 ) RocksDB metrics are added to the Kafka metrics. For each segmented state store only one set of metrics is exposed rather than one set of metrics for each segment. The metrics are not computed yet. Reviewers: John Roesler <john@confluent.io>, Guozhang Wang <guozhang@confluent.io>	5 years ago
cpettitt-confluent	6b24b2e836	KAFKA-8816: Make offsets immutable to users of RecordCollector.offsets (#7223 ) Make offsets immutable to users of RecordCollector.offsets. Fix up an existing case where offsets could be modified in this way. Add a simple test to verify offsets cannot be changed externally. Reviewers: Bruno Cadonna <bruno@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
cpettitt-confluent	7334222a71	KAFKA-8412: Fix nullpointer exception thrown on flushing before closing producers (#7207 ) Prior to this change an NPE is raised when calling AssignedTasks.close under the following conditions: 1. EOS is enabled 2. The task was in a suspended state The cause for the NPE is that when a clean close is requested for a StreamTask the StreamTask tries to commit. However, in the suspended state there is no producer so ultimately an NPE is thrown for the contained RecordCollector in flush. The fix put forth in this commit is to have AssignedTasks call closeSuspended when it knows the underlying StreamTask is suspended. Note also that this test is quite involved. I could have just tested that AssignedTasks calls closeSuspended when appropriate, but that is testing, IMO, a detail of the implementation and doesn't actually verify we reproduced the original problem as it was described. I feel much more confident that we are reproducing the behavior - and we can test exactly the conditions that lead to it - when testing across AssignedTasks and StreamTask. I believe this is an additional support for the argument of eventually consolidating the state split across classes. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	c6664e1d08	MINOR: Move the resetting from revoked to the thread loop (#7243 ) Move the error code resetting logic from the onPartitionsRevoked callback into the streamthread directly after we've decided to rejoin the group, since onPartitionsRevoked are not guaranteed to be triggered. Ran system tests on the originally failed StreamsUpgradeTest 10 times and passed. Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Jun Rao <junrao@gmail.com>	5 years ago

1 2 3 4 5 ...

1556 Commits (eb8e2a8e3b3e2e7f5c097593faf2c651f92f2caf)