src-kafka

Commit Graph

Author	SHA1	Message	Date
John Roesler	e16859dc48	KAFKA-9390: Make serde pseudo-topics unique (#8054 ) During the discussion for KIP-213, we decided to pass "pseudo-topics" to the internal serdes we use to construct the wrapper serdes for CombinedKey and hashing the left-hand-side value. However, during the implementation, this strategy wasn't fully implemented, and we wound up using the same topic name for a few different data types. Reviewers: Guozhang Wang <guozhang@confluent.io>	5 years ago
high.lee	dc89c86d43	KAFKA-9483: Add Scala KStream#toTable to the Streams DSL (#8024 ) Part of KIP-523 Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>	5 years ago
Matthias J. Sax	50aead64b9	MINOR: fix and improve StreamsConfig JavaDocs (#8086 ) Reviewer: John Roesler <john@confluent.io>	5 years ago
John Roesler	520a76155c	KAFKA-9517: Fix default serdes with FK join (#8061 ) During the KIP-213 implementation and verification, we neglected to test the code path for falling back to default serdes if none are given in the topology. Reviewer: Bill Bejeck <bbejeck@gmail.com>	5 years ago
Boyang Chen	ff8c40ccb6	KAFKA-9523: Migrate BranchedMultiLevelRepartitionConnectedTopologyTest into a unit test (#8081 ) Relying on integration test to catch an algorithm bug introduces more flakiness, reduce the test into a unit test to reduce the flakiness until we upgrade Java/Scala libs. Checked the test shall fail with older version of StreamsPartitionAssignor. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bruno Cadonna	3dfc6c15e4	KAFKA-9480: Fix bug that prevented to measure task-level process-rate (#8018 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Guozhang Wang	e70e5d913a	KAFKA-9505: Only loop over topics-to-validate in retries (#8039 ) Found this bug from the repeated flaky runs of system tests, it seems to be long lurking but also would only happen if there are frequent rebalances / topic creation within a short time, which is exactly the case in some of our smoke system tests. Also added a unit test. Reviewers: Boyang Chen <boyang@confluent.io>, A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Navinder Pal Singh Brar	d76fa1b22d	KAFKA-9487: Follow-up PR of Kafka-9445 (#8033 ) Follows up on the original PR for KAFKA-9445 to address a final round of feedback Reviewers: John Roesler <vvcephei@apache.org>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Matthias J. Sax	059a81e3c9	KAFKA-7658: Follow up to original PR (#8027 ) Follow up to original PR #7985 for KIP-523 (adding `KStream#toTable()` operator) - improve JavaDocs - add more unit tests - fix bug for auto-repartitioning - some code cleanup Reviewers: High Lee <yello1109@daum.net>, John Roesler <john@confluent.io>	5 years ago
Guozhang Wang	a6c9e96bd3	HOTFIX: Fix two test failures in JDK11 (#8063 ) 1. StoreChangelogReaderTest.shouldRequestCommittedOffsetsAndHandleTimeoutException[1] This is due to stricter ternary operator type casting 2. KStreamImplTest.shouldSupportTriggerMaterializedWithKTableFromKStream This is added recently where String typed values for <String, Integer>, in J8 it is allowed but in J11 it is not allowed. Reviewers: John Roesler <john@confluent.io>	5 years ago
A. Sophie Blee-Goldman	f698f3f840	MINOR: further InternalTopologyBuilder cleanup (#8046 ) Followup to KAFKA-7317 and KAFKA-9113, there's some additional cleanup we can do in InternalTopologyBuilder. Mostly refactors the subscription code to make the initialization more explicit and reduce some duplicated code in the update logic. Also some minor cleanup of the build method. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	5380938f8b	MINOR: Add timer for update limit offsets (#8047 ) Instead of always try to update committed offset limits as long as there are buffered records for standby tasks, we leverage on the commit interval to reduce our consumer.committed frequency. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, John Roesler <john@confluent.io>	5 years ago
Guozhang Wang	7ea636c661	HOTFIX: checkstyle for newly added unit test	5 years ago
Daniel Beskin	bdd0a9299f	MINOR: Fixing null handilg in ValueAndTimestampSerializer (#7679 ) Since ValueAndTimestampSerializer wraps an unknown Serializer, the output of that Serializer can be null. In which case the line .allocate(rawTimestamp.length + rawValue.length) will throw a NullPointerException. This pull request returns null instead. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	4090f9a2b0	KAFKA-9113: Clean up task management and state management (#7997 ) This PR is collaborated by Guozhang Wang and John Roesler. It is a significant tech debt cleanup on task management and state management, and is broken down by several sub-tasks listed below: Extract embedded clients (producer and consumer) into RecordCollector from StreamTask. guozhangwang#2 guozhangwang#5 Consolidate the standby updating and active restoring logic into ChangelogReader and extract out of StreamThread. guozhangwang#3 guozhangwang#4 Introduce Task state life cycle (created, restoring, running, suspended, closing), and refactor the task operations based on the current state. guozhangwang#6 guozhangwang#7 Consolidate AssignedTasks into TaskManager and simplify the logic of changelog management and task management (since they are already moved in step 2) and 3)). guozhangwang#8 guozhangwang#9 Also simplified the StreamThread logic a bit as the embedded clients / changelog restoration logic has been moved into step 1) and 2). guozhangwang#10 Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Bruno Cadonna <bruno@confluent.io>, Boyang Chen <boyang@confluent.io>	5 years ago
Matthias J. Sax	8bb962d66f	KAFKA-9490: Fix generics for Grouped (#8028 ) Reviewers: Andrew Choi <andchoi@linkedin.com>, John Roesler <john@confluent.io>	5 years ago
David Arthur	7e776b0462	Bump trunk to 2.6.0-SNAPSHOT (#8026 )	5 years ago
Charles Feduke	5ddab1b60c	MINOR: updated documentation where RocksDBStore was being used as the sample class for byte[] versus Bytes examples (#5884 ) Co-authored-by: Guozhang Wang <wangguoz@gmail.com>	5 years ago
high.lee	6b86af3a27	KAFKA-7658: Add KStream#toTable to the Streams DSL (#7985 ) Implements KIP-523. Reviewer: Matthias J. Sax <matthias@confluent.io>	5 years ago
Navinder Pal Singh Brar	05b2361c04	KAFKA-9445: Allow adding changes to allow serving from a specific partition (#7984 ) Implements KIP-562. Reviewers: Vinoth Chandar <vchandar@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
highluck	31ef2b9add	MINOR: Remove unused fields in StreamsMetricsImpl (#7992 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Ted Yu	b50e213eeb	MINOR: Fix topology builder debug log message (#8005 ) Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Ron Dagostino	a3509c0870	MINOR: MiniKdc JVM shutdown hook fix (#7946 ) Also made all shutdown hooks consistent and added tests Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
highluck	2e351e06b3	KAFKA-9152; Improve Sensor Retrieval (#7928 ) This ticket shall improve two aspects of the retrieval of sensors: https://issues.apache.org/jira/browse/KAFKA-9152 Currently, when a sensor is retrieved with Metrics.Sensor() (e.g. ThreadMetrics.createTaskSensor()) after it was created with the same method Metrics.Sensor(), the sensor is added again to the corresponding queue in Sensors (e.g. threadLevelSensors) in StreamsMetricsImpl. Those queues are used to remove the sensors when removeAllLevelSensors() is called. Having multiple times the same sensors in this queue is not an issue from a correctness point of view. However, it would reduce the footprint to only store a sensor once in those queues. When a sensor is retrieved, the current code attempts to create a new sensor and to add to it again the corresponding metrics. This could be avoided. Both aspects could be improved by checking whether a sensor already exists by calling getSensor() on the Metrics object and checking the return value. Reviewers: Bruno Cadonna <bruno@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
A. Sophie Blee-Goldman	57b2f6807d	KAFKA-7317: Use collections subscription for main consumer to reduce metadata (#7969 ) Also addresses KAFKA-8821 Note that we still have to fall back to using pattern subscription if the user has added any regex-based source nodes to the topology. Includes some minor cleanup on the side Reviewers: Bill Bejeck <bbejeck@gmail.com>	5 years ago
Levani Kokhreidze	21df6eeda9	MINOR: Use Math.min for StreamsPartitionAssignor#updateMinReceivedVersion method (#7954 ) Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
John Roesler	cd4ab4189e	MINOR: fix flaky StreamsUpgradeTestIntegrationTest (#7974 ) Make the test resilient to rebalance timing. Reviewed-by: Guozhang Wang <wangguoz@gmail.com>	5 years ago
vinoth chandar	bbd3348dcb	KAFKA-9431: Expose API in KafkaStreams to fetch all local offset lags (#7961 ) Add a new method to KafkaStreams to return an estimate of the lags for all partitions of all local stores. Implements: KIP-535 Co-authored-by: Navinder Pal Singh Brar <navinder_brar@yahoo.com> Reviewed-by: John Roesler <vvcephei@apache.org>	5 years ago
vinoth chandar	0c76fbbbed	KAFKA-6144: IQ option to query standbys (#7962 ) Add a new overload of KafkaStreams#store that allows users to query standby and restoring stores in addition to active ones. Closes: #7962 Implements: KIP-535 Co-authored-by: Navinder Pal Singh Brar <navinder_brar@yahoo.com> Reviewed-by: John Roesler <vvcephei@apache.org>	5 years ago
vinoth chandar	71c5729a41	KAFKA-6144: Add KeyQueryMetadata APIs to KafkaStreams (#7960 ) Deprecate existing metadata query APIs in favor of new ones that include standby hosts as well as partition information. Closes: #7960 Implements: KIP-535 Co-authored-by: Navinder Pal Singh Brar <navinder_brar@yahoo.com> Reviewed-by: John Roesler <vvcephei@apache.org>	5 years ago
Matthias J. Sax	81fcb80924	KAFKA-9294: Add tests for Named parameter (#7927 ) Part 2 -- tests for stateful KStream operators Reviewers: Bill Bejeck <bill@confluent.io>	5 years ago
Matthias J. Sax	be4f50e7fd	MINOR: JavaDoc cleanup (#7873 ) Reviewers: Bill Bejeck <bill@confluent.io>	5 years ago
Guozhang Wang	505e8240cd	KAFKA-8421: Still return data during rebalance (#7312 ) Not wait until updateAssignmentMetadataIfNeeded returns true, but only call it once with 0 timeout. Also do not return empty if in rebalance. Trim the pre-fetched records after long polling since assignment may have been changed. Also need to update SubscriptionState to retain the state in assignFromSubscribed if it already exists (similar to assignFromUser), so that we do not need the transition of INITIALIZING to FETCHING. Unit test: this actually took me the most time :) Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Bruno Cadonna <bruno@confluent.io>, Sophie Blee-Goldman <sophie@confluent.io>, Jason Gustafson <jason@confluent.io>, Richard Yu <yohan.richard.yu@gmail.com>, dengziming <dengziming1993@gmail.com>	5 years ago
Matthias J. Sax	e94f5dcc80	KAFKA-9294: Add tests for Named parameter (#7874 ) Part 1 -- tests for stateless KStream operators only Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>	5 years ago
David Kim	163c5e7ac2	KAFKA-9068: Fix javadoc of Stores.{persistent,inMemory}SessionStore (#7908 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	5 years ago
highluck	bbbf431d6e	KAFKA-9384: Loop improvements (#7907 ) Reviewers: Bruno Cadonna <bruno@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Ismael Juma	a024e679c7	MINOR: Update dependencies for Kafka 2.5 (#7909 ) Noteworthy: * zstd decompression speed improvement of ~10%: https://github.com/facebook/zstd/releases/tag/v1.4.4 * EasyMock, PowerMock and Mockito: improved support for Java 13. * Replace usage of method deprecated by Mockito. * Gradle plugins updated to versions that require Gradle 5.x, this is fine since we no longer depend on the installed Gradle version. * Fixed build not to depend on methods deprecated in Gradle 5.x (fixes KAFKA-8786). * Reflections 0.9.12 no longer depends on Guava (fixes KAFKA-3061). * Updated `OptimizedKTableIntegrationTest` to pass with new version of Hamcrest. * Several Jetty improvements and bug fixes: - https://github.com/eclipse/jetty.project/releases/tag/jetty-9.4.21.v20190926 - https://github.com/eclipse/jetty.project/releases/tag/jetty-9.4.22.v20191022 - https://github.com/eclipse/jetty.project/releases/tag/jetty-9.4.23.v20191118 - https://github.com/eclipse/jetty.project/releases/tag/jetty-9.4.24.v20191120 - https://github.com/eclipse/jetty.project/releases/tag/jetty-9.4.25.v20191220 Note that I did not upgrade lz4 due to https://github.com/lz4/lz4-java/issues/156. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Co-authored-by: Ismael Juma <ismael@juma.me.uk> Co-authored-by: Aljoscha <aljoscha.poertner@posteo.de>	5 years ago
Matthias J. Sax	1ccca5c6a9	KAFKA-6049: extend Kafka Streams Scala API for cogroup (KIP-150) (#7847 ) Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>	5 years ago
Boyang Chen	1adf0ee889	KAFKA-9335: Fix StreamPartitionAssignor regression in repartition topics counts (#7904 ) This PR fixes the regression introduced in 2.4 from 2 refactoring PRs: #7249 #7419 The bug was introduced by having a logical path leading numPartitionsCandidate to be 0, which is assigned to numPartitions and later being checked by setNumPartitions. In the subsequent check we will throw illegal argument if the numPartitions is 0. This bug is both impacting new 2.4 application and upgrades to 2.4 in certain types of topology. The example in original JIRA was imported as a new integration test to guard against such regression. We also verify that without the bug fix application will still fail by running this integration test. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
A. Sophie Blee-Goldman	1513c817d4	KAFKA-6614: configure internal topics with message.timestamp.type=CreateTime by default (#7889 ) Reviewers: Matthias J. Sax <matthias@confluent.io>	5 years ago
Ismael Juma	6dc6f6a60d	KAFKA-9324: Drop support for Scala 2.11 (KIP-531) (#7859 ) * Adjust build and documentation. * Use lambda syntax for SAM types in `core`, `streams-scala` and `connect-runtime` modules. * Remove `runnable` and `newThread` from `CoreUtils` as lambda syntax for SAM types make them unnecessary. * Remove stale comment in `FunctionsCompatConversions`, `KGroupedStream`, `KGroupedTable' and `KStream` about Scala 2.11, the conversions are needed for Scala 2.12 too. * Deprecate `org.apache.kafka.streams.scala.kstream.Suppressed` and use `org.apache.kafka.streams.kstream.Suppressed` instead. * Use `Admin.create` instead of `AdminClient.create`. Static methods in Java interfaces can be invoked since Scala 2.12. I noticed that MirrorMaker 2 uses `AdminClient.create`, but I did not change them as Connectors have restrictions on newer client APIs. * Improve efficiency in a few `Gauge` implementations by avoiding unnecessary intermediate collections. * Remove pointless `Option.apply` in `ZookeeperClient` `SessionState` metric. * Fix unused import/variable and other compiler warnings. * Reduce visibility of some vals/defs. Reviewers: Manikumar Reddy <manikumar@confluent.io>, Guozhang Wang <wangguoz@gmail.com>, Gwen Shapira <gwen@confluent.io>	5 years ago
A. Sophie Blee-Goldman	3453e9e2ee	HOTFIX: fix system test race condition (#7836 ) In some system tests a Streams app is started and then prints a message to stdout, which the system test waits for to confirm the node has successfully been brought up. It then greps for certain log messages in a retriable loop. But waiting on the Streams app to start/print to stdout does not mean the log file has been created yet, so the grep may return an error. Although this occurs in a retriable loop it is assumed that grep will not fail, and the result is piped to wc and then blindly converted to an int in the python function, which fails since the error message is a string (throws ValueError) We should catch the ValueError and return a 0 so it can try again rather than immediately crash Reviewers: Bill Bejeck <bbejeck@gmail.com>, John Roesler <vvcephei@users.noreply.github.com>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
sainath batthala	65d97762ab	KAFKA-9334: Added more unit tests for Materialized class (#7871 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	5 years ago
John Roesler	cdbf40d572	KAFKA-9310: Handle UnknownProducerId from RecordCollector.send (#7845 ) Reviewers: Matthias J. Sax <mjsax@apache.org>	5 years ago
Bruno Cadonna	1d21cf166a	KAFKA-9305: Add version 2.4 to Streams system tests (#7841 ) Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Matthias J. Sax	5a65da5fe9	MINOR: Kafka Streams Scala API cleanup (#7852 ) Reviewers: Bill Bejeck <bill@confluent.io>	5 years ago
Bruno Cadonna	c1351c34a9	MINOR: Refactor versions in `FutureSubscriptionInfo` (#7849 ) Simplify `FutureSubscriptionInfo` Reviewers: John Roesler <vvcephei@apache.org>	5 years ago
Lee Dongjin	8c64aa080a	MINOR: trivial cleanups - Reformat header: `CustomDeserializerTest`, `ReplicaVerificationToolTest` - Remove unused constructor: `ConsumerGroupDescription` - Remove unused variables in `TimeOrderedKeyValueBufferTest#shouldRestoreV2Format` - Remove deprecated `Number` consturctor calls; use `Number#valueOf` instread. Author: Lee Dongjin <dongjin@apache.org> Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #7202 from dongjinleekr/cleanup/201908	5 years ago
Guozhang Wang	a87decb9e4	KAFKA-9113: Extract clients from tasks to record collectors (#7833 ) This is part1 of a series of PRs for task management cleanup: 1. Primarily cleanup MockRecordCollectors: remove unnecessary anonymous inheritance but just consolidate on the NoOpRecordCollector -> renamed to MockRecordCollector. Most relevant changes are unit tests that would be relying on this MockRecordCollector. 2. Let StandbyContextImpl#recordCollector() to return null instead of returning a no-op collector, since in standby tasks we should ALWAYS bypass the logging logic and only use the inner store for restoreBatch. Returning null helps us to realize this assertion failed as NPE as early as possible whereas a no-op collector just hides the bug. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bruno Cadonna	dbafa07be3	MINOR: Improve javadoc of user-customizable metrics API (#7810 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, Bill Bejeck <bbejeck@gmail.com>	5 years ago

1 2 3 4 5 ...

1582 Commits (e16859dc48c679b3c7d9735438df046479b8ec4a)