src-kafka

Commit Graph

Author	SHA1	Message	Date
Srinivas Reddy	88443b4a37	Fix the missing ApiUtils tests in streams module. (#6003 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Pasquale Vazzana	dffce6e7ae	KAFKA-7655 Metadata spamming requests from Kafka Streams under some circumstances, potential DOS (#5929 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
Matthias J. Sax	046b0087bd	MINOR: improve Streams checkstyle and code cleanup (#5954 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Nikolay Izhikov <nIzhikov@gmail.com>, Ismael Juma <ismael@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Nikolay	c142809038	KAFKA-6970: All standard state stores guarded with read only wrapper (#6016 ) Reviewer: Matthias J. Sax <matthias@confluent.io>, John Roesler <john@confluent.io>	6 years ago
linyli001	a94c8da508	KAFKA-7443: OffsetOutOfRangeException in restoring state store from changelog topic when start offset of local checkpoint is smaller than that of changelog topic (#5946 ) Reviewer: Matthias J. Sax <matthias@confluent.io>, John Roesler <john@confluent.io>	6 years ago
Guozhang Wang	ab156fded1	KAFKA-6036: Follow-up; cleanup sendOldValues logic ForwardingCacheFlushListener (#6017 ) This is a follow-up PR from the previous PR #5779, where KTabeSource always get old values from the store even if sendOldValues. It gets me to make a pass over all the KTable/KStreamXXX processor to push the sendOldValues at the callers in order to avoid unnecessary store reads. More details: ForwardingCacheFlushListener and TupleForwarder both need sendOldValues as parameters. a. For ForwardingCacheFlushListener it is not needed at all, since its callers XXXCachedStore already use the sendOldValues values passed from TupleForwarder to avoid getting old values from underlying stores. b. For TupleForwarder, it actually only need to pass the boolean flag to the cached store; and then it does not need to keep it as its own variable since the cached store already respects the boolean to pass null or the actual value.. The only other minor bug I found from the pass in on KTableJoinMerge, where we always pass old values and ignores sendOldValues. Reviewers: Matthias J. Sax <mjsax@apache.org>	6 years ago
Guozhang Wang	c0353d8ddc	KAFKA-6036: Local Materialization for Source KTable (#5779 ) Refactor the materialization for source KTables in the way that: If Materialized.as(queryableName) is specified, materialize; If the downstream operator requires to fetch from this KTable via ValueGetters, materialize; If the downstream operator requires to send old values, materialize. Otherwise do not materialize the KTable. E.g. builder.table("topic").filter().toStream().to("topic") would not create any state stores. There's a couple of minor changes along with PR as well: KTableImpl's queryableStoreName and isQueryable are merged into queryableStoreName only, and if it is null it means not queryable. As long as it is not null, it should be queryable (i.e. internally generated names will not be used any more). To achieve this, splitted MaterializedInternal.storeName() and MaterializedInternal.queryableName(). The former can be internally generated and will not be exposed to users. QueryableName can be modified to set to the internal store name if we decide to materialize it during the DSL parsing / physical topology generation phase. And only if queryableName is specified the corresponding KTable is determined to be materialized. Found some overlapping unit tests among KTableImplTest, and KTableXXTest, removed them. There are a few typing bugs found along the way, fixed them as well. ----------------------- This PR is an illustration of experimenting a poc towards logical materializations. Today we've logically materialized the KTable for filter / mapValues / transformValues if queryableName is not specified via Materialized, but whenever users specify queryableName we will still always materialize. My original goal is to also consider logically materialize for queryable stores, but when implementing it via a wrapped store to apply the transformations on the fly I realized it is tougher than I thought, because we not only need to support fetch or get, but also needs to support range queries, approximateNumEntries, and isOpen etc as well, which are not efficient to support. So in the end I'd suggest we still stick with the rule of always materializing if queryableName is specified, and only consider logical materialization otherwise. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <mjsax@apache.org>	6 years ago
John Roesler	b492296757	MINOR: fix checkpoint write failure warning log (#6008 ) We saw a log statement in which the cause of the failure to write a checkpoint was not properly logged. This change logs the exception properly and also verifies the log message. Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Jonathan Santilli	b616f913c8	KAFKA-7678: Avoid NPE when closing the RecordCollector (#5993 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Nikolay	ec501f305e	KAFKA-7420: Global store surrounded by read only implementation (#5865 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Kamal Chandraprakash (@kamalcph), Bill Bejeck <bill@confluent.io>	6 years ago
Srinivas Reddy	7283711c0d	KAFKA-7446: Fix the duration and instant validation messages. (#5930 ) Changes made as part of this commit. - Improved error message for better readability at millis validation utility - Corrected java documentation on `AdvanceInterval` check. - Added caller specific prefix text to make error message more clear to developers/users. Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Jacek Laskowski <jacek@japila.pl>	6 years ago
Bill Bejeck	ab1fb3fdde	MINOR: Adding system test for named repartition topics (#5913 ) This is a system test for doing a rolling upgrade of a topology with a named repartition topic. 1. An initial Kafka Streams application is started on 3 nodes. The topology has one operation forcing a repartition and the repartition topic is explicitly named. 2. Each node is started and processing of data is validated 3. Then one node is stopped (full stop is verified) 4. A property is set signaling the node to add operations to the topology before the repartition node which forces a renumbering of all operators (except repartition node) 5. Restart the node and confirm processing records 6. Repeat the steps for the other 2 nodes completing the rolling upgrade I ran two runs of the system test with 25 repeats in each run for a total of 50 test runs. All test runs passed Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Matthias J. Sax	4444a31edb	MINOR: improve QueryableStateIntegrationTest (#5987 ) Fix test Comparators plus Java8 cleanup Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Matthias J. Sax	9b476bc5f4	MINOR: improve state directory test (#5961 ) Reviewers: Bill Bejeck <bill@confluent.io>, Kamal Chandraprakash (@kamalcph), Guozhang Wang <guozhang@confluent.io>	6 years ago
John Roesler	bfbc32d9bc	KAFKA-7660: fix parentSensors memory leak (#5953 ) In StreamsMetricsImpl, the parentSensors map was keeping references to Sensors after the sensors themselves had been removed. Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Bill Bejeck	2c305dc64c	KAFKA-7671: Stream-Global Table join should not reset repartition flag (#5959 ) This PR fixes an issue reported from a user. When we join a KStream with a GlobalKTable we should not reset the repartition flag as the stream may have previously changed its key, and the resulting stream could be used in an aggregation operation or join with another stream which may require a repartition for correct results. I've added a test which fails without the fix. Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Kamal Chandraprakash	de24d4a459	KAFKA-7367: Streams should not create state store directories unless they are needed (#5696 ) * KAFKA-7367: Ensure stateless topologies don't require disk access * KAFKA-7367: Streams should not create state store directories unless they are needed. * Addressed the review comments. * Addressed the review-2 comments. * Fixed FileAlreadyExistsException * Addressed the review-3 comments. * Resolved the conflicts.	6 years ago
Bill Bejeck	dfd545485a	MINOR: Add system test for optimization upgrades (#5912 ) This is a new system test testing for optimizing an existing topology. This test takes the following steps 1. Start a Kafka Streams application that uses a selectKey then performs 3 groupByKey() operations and 1 join creating four repartition topics 2. Verify all instances start and process data 3. Stop all instances and verify stopped 4. For each stopped instance update the config for TOPOLOGY_OPTIMIZATION to all then restart the instance and verify the instance has started successfully also verifying Kafka Streams reduced the number of repartition topics from 4 to 1 5. Verify that each instance is processing data from the aggregation, reduce, and join operation Stop all instances and verify the shut down is complete. 6. For testing I ran two passes of the system test with 25 repeats for a total of 50 test runs. All test runs passed Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	55c77ebf01	KAFKA-7223: Suppression Buffer Metrics (#5795 ) Add the final batch of metrics from KIP-328 Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Matthias J. Sax	d0ed3894d6	MINOR: Refactor code for restoring tasks (#5768 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
cadonna	808dc0a96b	MINOR: Update docs with out-dated context.schedule(...) examples (#5924 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Guozhang Wang	46e7b136e2	KAFKA-7536: Initialize TopologyTestDriver with non-null topic (#5923 ) In TopologyTestDriver constructor set non-null topic; and in unit test intentionally turn on caching to verify this case. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Samuel Hawker	8dc4d0e787	KAFKA-6567: Remove KStreamWindowReducer (#5922 ) This pull request removes the final reference to KStreamWindowReducer and replaces it with KStreamWindowAggregate Signed-off-by: Samuel Hawker sam.b.hawker@gmail.com contribution is my original work and that I license the work to the project under the project's open source license. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Yishun Guan	9646602d68	KAFKA-7402: Implement KIP-376 AutoCloseable additions	6 years ago
Guozhang Wang	23ed45a21d	HOTFIX: remove deprecated calls	6 years ago
John Roesler	abc09597db	MINOR: Remove redundant SuppressIntegrationTests (#5896 ) The removed tests have counterparts covered by SuppressScenarioTest using the TopologyTestDriver. This will speed up the build and improve stability in the CPU-constrained Jenkins environment. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Matthias J. Sax	80eb2c28f6	MINOR: improve Puncutation JavaDocs and add runtime argument check (#5895 ) Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Matthias J. Sax	d2f37944b6	KAFKA-7584: StreamsConfig throws ClassCastException if max.in.flight.request.per.connect is specified as String (#5874 ) Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	14d3ead19d	MINOR: Remove deprecated callers (#5911 ) Callers of 1) Windows#until, 2) Windows#of, 3) Serialized are replaced when possible with the new APIs. Reviewers: Matthias J. Sax <mjsax@apache.org>, Bill Bejeck <bill@confluent.io>	6 years ago
Ismael Juma	12f310d50e	KAFKA-7612: Fix javac warnings and enable warnings as errors (#5900 ) - Use Xlint:all with 3 exclusions (filed KAFKA-7613 to remove the exclusions) - Use the same javac options when compiling tests (seems accidental that we didn't do this before) - Replaced several deprecated method calls with non-deprecated ones: - `KafkaConsumer.poll(long)` and `KafkaConsumer.close(long)` - `Class.newInstance` and `new Integer/Long` (deprecated since Java 9) - `scala.Console` (deprecated in Scala 2.11) - `PartitionData` taking a timestamp (one of them seemingly a bug) - `JsonMappingException` single parameter constructor - Fix unnecessary usage of raw types in several places. - Add @SuppressWarnings for deprecations, unchecked and switch fallthrough in several places. - Scala clean-ups (var -> val, ETA expansion warnings, avoid reflective calls) - Use lambdas to simplify code in a few places - Add @SafeVarargs, fix varargs usage and remove unnecessary `Utils.mkList` method Reviewers: Matthias J. Sax <mjsax@apache.org>, Manikumar Reddy <manikumar.reddy@gmail.com>, Randall Hauch <rhauch@gmail.com>, Bill Bejeck <bill@confluent.io>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	6 years ago
Lucas Bradstreet	872df553fc	MINOR: KStreams SuppressionDurabilityIntegrationTest should set StreamsConfig.STATE_CONFIG_DIR. (#5870 )	6 years ago
Jason Gustafson	d71cb54672	KAFKA-7567; Clean up internal metadata usage for consistency and extensibility (#5813 ) This patch makes two improvements to internal metadata handling logic and testing: 1. It reduce dependence on the public object `Cluster` for internal metadata propagation since it is not easy to evolve. As an example, we need to propagate leader epochs from the metadata response to `Metadata`, but it is not straightforward to do this without exposing it in `PartitionInfo` since that is what `Cluster` uses internally. By doing this change, we are able to remove some redundant `Cluster` building logic. 2. We want to make the metadata handling in `MockClient` simpler and more consistent. Currently we have mix of metadata update mechanisms which are internally inconsistent with each other and do not match the implementation in `NetworkClient`. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Bill Bejeck	63715efa02	MINOR: Bump timeout for sending records (#5843 ) Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Lucas Bradstreet	7cdc433d11	MINOR: SuppressionIntegrationTest should set StreamsConfig.STATE_DIR_CONFIG (#5847 ) Sets StreamsConfig.STATED_DIR_CONFIG to temp directory in SuppressionIntegrationTest, to match StreamsTestUtils. This is a similar fix to #5826. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Lucas Bradstreet	c9d3debfe4	MINOR: SuppressScenarioTest should set StreamsConfig.STATE_DIR_CONFIG (#5826 ) Set `StreamsConfig.STATED_DIR_CONFIG` in `SuppressScenarioTest`, as with `StreamsTestUtils`. I have deliberately avoided using `StreamsTestUtils` as this test sets bogus config parameters, but still fails if the default `STATE_DIR_CONFIG` does not exist. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, John Roesler <john@confluent.io>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Bill Bejeck	57502a6995	KAFKA-7534: Error in flush calling close may prevent underlying store from closing (#5833 ) Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
John Roesler	cb6d69cbee	MINOR: buffer should ignore caching (#5819 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
occho	e9bbfde10f	MINOR: Prohibit setting StreamsConfig commit.interval.ms to a negative value (#5809 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
John Eismeier	83c3996974	MINOR: Fix some typos Just a doc change Author: John Eismeier <john.eismeier@gmail.com> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #4573 from jeis2497052/trunk	6 years ago
John Roesler	eae5ae3b04	KAFKA-7080 and KAFKA-7222: Cleanup overlapping KIP changes Part 2 #5804 removed `Windows#segmentInterval`, but did not remove all references to it. Author: John Roesler <john@confluent.io> Reviewers: Damian Guy <damian.guy@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5806 from vvcephei/fix-missing-segment-interval	6 years ago
Bill Bejeck	86b1150e18	MINOR: Update Streams Scala API for addition of Grouped (#5793 ) While working on the documentation updates I realized the Streams Scala API needs to get updated for the addition of Grouped Added a test for Grouped.scala ran all streams-scala tests and streams tests Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <john@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
John Roesler	4b7148a5b6	KAFKA-7223: Suppression documentation (#5787 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Matthias J. Sax	2646781d32	KAFKA-7080 and KAFKA-7222: Cleanup overlapping KIP changes (#5804 ) Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
John Roesler	f047e1c178	MINOR: fix non-deterministic streams-scala tests (#5792 ) Stop using current system time by default, as it introduces non-determinism. Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
Matthias J. Sax	0b417b8331	MINOR: updates docs for KIP-358 (#5796 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Jim Galasyn <jim.galasyn@confluent.io>	6 years ago
John Roesler	21f88a595b	KAFKA-7223: Add late-record metrics (#5742 ) Add late record metrics, as specified in KIP-328 Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
John Roesler	905f813507	MINOR: default implementation for new window store overloads (#5759 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Nikolay Izhikov <nizhikov@apache.org>	6 years ago
Kamal Chandraprakash	9a74569b99	KAFKA-7483: Allow streams to pass headers through Serializer. (#5751 ) Satish Duggana <sduggana@hortonworks.com>, Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
Kamal Chandraprakash	4be79b6cee	MINOR: Remove redundant `if` condition. (#5697 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Nikolay	6d16879c0f	KAFKA-7477: Improve Streams close timeout semantics (#5747 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago

... 3 4 5 6 7 ...

1364 Commits (c758122ce59674ec3e33618d896e4e5cdbb45e87)