src-kafka

Commit Graph

Author	SHA1	Message	Date
Bruno Cadonna	9a47d7e35f	KAFKA-9603: Do not turn on bulk loading for segmented stores on stand-by tasks (#8661 ) Segmented state stores turn on bulk loading of the underlying RocksDB when restoring. This is correct for segmented state stores that are in restore mode on active tasks and the onRestoreStart() and onRestoreEnd() in RocksDBSegmentsBatchingRestoreCallback take care of toggling bulk loading mode on and off. However, restoreAll() in RocksDBSegmentsBatchingRestoreCallback might also turn on bulk loading mode. When this happens on a stand-by task bulk loading mode is never turned off. That leads to steadily increasing open file decriptors in RocksDB because in bulk loading mode RocksDB creates continuously new files but never compacts them (which is the intended behaviour). Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Levani Kokhreidze	67770072da	KAFKA-9859 / kafka-streams-application-reset tool doesn't take into account topics generated by KTable foreign key join operation (#8671 ) This PR fixes kafka-streams-application-reset tool. Before, kafka-streams-application-reset tool wasn't taking into account topics generated by KTable foreign key join operation. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Okada Haruki	affb270bbc	MINOR: Fix redundant typos in comments and javadocs (#8693 ) * MINOR: Fix typo in RecordAccumulator * MINOR: Fix typo in several files Reviewers: Ron Dagostino <rdagostino@confluent.io>, Konstantine Karantasis <konstantine@confluent.io>	5 years ago
Boyang Chen	7c7d88339b	KAFKA-10010: Should make state store registration idempotent (#8681 ) Standby task could also at risk of getting into illegal state when not being closed during HandleLostAll: 1. The standby task was initializing as CREATED state, and task corrupted exception was thrown from registerStateStores 2. The task corrupted exception was caught, and do a non-affected task commit 3. The task commit failed due to task migrated exception 4. The handleLostAll didn't close the standby task, leaving it as CREATED state 5. Next rebalance complete, the same task was assigned back as standby task. 6. Illegal Argument exception caught as state store already registered Reviewers: A. Sophie Blee-Goldman <ableegoldman@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Boyang Chen	76e0233c93	KAFKA-10011: Remove task id from lockedTaskDirectories during handleLostAll (#8682 ) As stated, we couldn't wait for handleRebalanceComplete in the case of handleLostAll, as we already closed the active task as dirty, and could potentially require its offset in the next thread.runOnce call. Co-authored-by: Guozhang Wang <wangguoz@gmail.com> Reviewers: A. Sophie Blee-Goldman <ableegoldman@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Andras Katona	c0fa36dde7	KAFKA-9992: Eliminate JavaConverters in EmbeddedKafkaCluster (#8673 ) Fixes EmbeddedKafkaCluster.deleteTopicAndWait for use with kafka_2.13 Reviewers: Boyang Chen <boyang@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>, John Roesler <vvcephei@apache.org>	5 years ago
Bruno Cadonna	af02f76623	KAFKA-6145: Add unit tests to verify fix of bug KAFKA-9173 (#8689 ) Ensure that the assignor will always assign tasks to new instances. Reviewers: John Roesler <vvcephei@apache.org>	5 years ago
A. Sophie Blee-Goldman	392e49b1ed	MINOR: consolidate processor context for active/standby (#8669 ) This is a prerequisite for KAFKA-9501 and will also be useful for KAFKA-9603 There should be no logical changes here: the main difference is the removal of StandbyContextImpl in preparation for contexts to transition between active and standby. Also includes some minor cleanup, eg pulling the ReadOnly/ReadWrite decorators out into a separate file. Reviewers: Bruno Cadonna <bruno@confluent.io>, John Roesler <vvcephei@apache.org>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	d534b5d817	KAFKA-10001: Should trigger store specific callback if it is also a listener (#8670 ) The store's registered callback could also be a restore listener, in which case it should be triggered along with the user specified global listener as well. Reviewers: Boyang Chen <boyang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Boyang Chen	b558287c0b	MINOR: Handle task migrated inside corruption path (#8667 ) Reviewers: John Roesler <vvcephei@apache.org>	5 years ago
A. Sophie Blee-Goldman	4e722021a9	MINOR: skip listOffsets request for newly created changelog topics (#8662 ) A small hotfix to avoid an extra probing rebalance the first time an application is launched. This should particularly improve the testing experience. Reviewer: Matthias J. Sax <matthias@confluent.io>, John Roesler <vvcephei@apache.org>	5 years ago
John Roesler	d62f6ebdfe	KAFKA-6145: KIP-441: Improve assignment balance (#8588 ) Validate that the assignment is always balanced wrt: * active assignment balance * stateful assignment balance * task-parallel balance Reviewers: Bruno Cadonna <bruno@confluent.io>, A. Sophie Blee-Goldman <sophie@confluent.io>	5 years ago
zhaohaidao	43a9e39983	KAFKA-9850 Move KStream#repartition operator validation during Topolo… (#8550 ) Reviewers: Boyang Chen <boyang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
A. Sophie Blee-Goldman	53875bb043	KAFKA-9966: add internal assignment listener to stabilize eos-beta upgrade test (#8648 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	5 years ago
Matthias J. Sax	778d04d4f9	MINOR: Fix ProcessorContext JavaDocs and stream-time computation (#8603 ) Reviewer: John Roesler <john@confluent.io>	5 years ago
Matthias J. Sax	c9dc0cd9cf	MINOR: improve tests for TopologyTestDriver (#8631 ) Reviewers: John Roesler <john@confluent.io>, Andy Coates <andy@confluent.io>	5 years ago
A. Sophie Blee-Goldman	58f7a97314	KAFKA-9821: consolidate Streams rebalance triggering mechanisms (#8596 ) Persist followup rebalance in assignment and consolidate rebalance triggering mechanisms Reviewers: John Roesler <vvcephei@apache.org>	5 years ago
Richard Yu	f54cece73e	KAFKA-8770: KIP-557: Drop idempotent KTable source updates (#8254 ) Drops idempotent updates from KTable source operators. Specifically, drop updates in which the value is unchanged, and the timestamp is the same or larger. Implements: KIP-557 Reviewers: Bruno Cadonna <bruno@confluent.io>, John Roesler <vvcephei@apache.org>	5 years ago
Guozhang Wang	035299a55d	MINOR: Remove allow concurrent test (#8641 ) Reviewers: John Roesler <vvcephei@apache.org>	5 years ago
Matthias J. Sax	318063a16a	KAFKA-9466: Update Kafka Streams docs for KIP-447 (#8621 ) Reviewers: Boyang Chen <boyang@confluent.io>, Jim Galasyn <jim.galasyn@confluent.io>, Guozhang Wang <guozhang@confluent.io>	5 years ago
Guozhang Wang	6d0e722f4d	KAFKA-9949: Fix flaky GlobalKTableIntegrationTest (#8635 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	5 years ago
A. Sophie Blee-Goldman	72d72a974c	KAFKA-9921: explicit handling of null values with retainDuplicates (#8626 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	5 years ago
Boyang Chen	fe2742e1b2	KAFKA-9972: Only commit tasks with valid states (#8632 ) We spotted a case in the soak test where a standby task could be in CREATED state during commit, which causes an illegal state exception. To prevent this from happening, the solution is to always enforce a state check. Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <vvcephei@apache.org>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
high.lee	133c2ed58a	KAFKA-9290: Update IQ related JavaDocs (#8114 ) Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Matthias J. Sax	611831b051	KAFKA-9928: Fix flaky GlobalKTableEOSIntegrationTest (#8600 ) Reviewer: Guozhang Wang <guozhang@confluent.io>	5 years ago
Andy Coates	33bfdacd7a	KAFKA-9865: Expose output topic names from TopologyTestDriver (#8483 ) Implements KIP-594 Reviewers: Matthias J. Sax <matthias@confluent.io>	5 years ago
Jason Gustafson	c15cd5cfeb	MINOR: Only add 'Data' suffix for generated request/response/header types (#8625 ) Currently we add "Data" to all generated classnames in order to avoid naming collisions with existing Request/Response objects. Generated classes for other persistent schema definitions (such as those used in `GroupCoordinator` and `TransactionCoordinator`) will not necessarily have the same problem, so it would be nice if the generated types could use the name defined in the schema directly. Reviewers: Boyang Chen <boyang@confluent.io>, Colin P. McCabe <cmccabe@apache.org>	5 years ago
Matthias J. Sax	ff9995fa02	MINOR: Improve TopologyTestDriver JavaDocs (#8619 ) Reviewers: Bruno Cadonna <bruno@confluent.io>, Bill Bejeck <bbejeck@apache.org>	5 years ago
Matthias J. Sax	1daa8f638b	KAFKA-9748: Add Streams eos-beta integration test (#8496 ) Reviewers: Boyang Chen <boyang@confluent.io>, Guozhang Wang <guozhang@confluent.io>	5 years ago
Guozhang Wang	69586fb604	HOTFIX: set correct numIterations in shouldAllowConcurrentAccesses	5 years ago
Guozhang Wang	34824b7bff	KAFKA-9798: Send one round synchronously before starting the async producer (#8565 ) Comparing all other test cases, the shouldAllowConcurrentAccesses starts an async producer sending records throughout the test other than just synchronously sent and acked a few records before we start the streams application. Right after the streams app is started, we check that at least one record is sent to the output topic (i.e. completed processing). However since only this test starts the producer async and did not wait for it to complete, it is possible that the async producer gets too longer to produce some records and causing it to fail. To follow what other tests did, I let this test to first send one round of records synchronously before starting the async producing. Also encountered some new scala warnings that I fixed along with this PR. Reviewers: Matthias J. Sax <matthias@confluent.io>	5 years ago
Guozhang Wang	9453567241	MINOR: Improve Sensor recording efficiency (#8593 ) 1. Added a recordInternal function to let all other public functions trigger, so that shouldRecord would only be checked once. 2. In Streams, pass along the current wall-clock time inside InternalProcessorContext when process / punctuate which can be passed in to the record function to reduce the calling frequency of SystemTime.milliseconds(). Reviewers: John Roesler <vvcephei@apache.org>	5 years ago
John Roesler	fd095aaafd	KAFKA-8410: Revert Part 1: processor context bounds (#8414 ) (#8595 ) This reverts commit `29e08fd2c2`. There turned out to be more than expected problems with adding the generic parameters. Reviewers: Matthias J. Sax <matthias@confluent.io>	5 years ago
A. Sophie Blee-Goldman	95edaba861	KAFKA-6145: KIP 441 remove balance factor (#8597 ) Reviewers: John Roesler <vvcephei@apache.org>	5 years ago
Bruno Cadonna	e5217d6cb5	KAFKA-6145: Remove check to reuse previous assignment (#8590 ) Since we cannot guarantee to reassign the correct number of stand-by tasks when reusing the previous assignment and the reassignment is rather a micro-optimization, it is removed to keep the algorithm correct and simple. Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, John Roesler <vvcephei@apache.org>	5 years ago
John Roesler	dc4d439825	KAFKA-9875: Make integration tests more resilient (#8578 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
John Roesler	688f2e9c0f	KAFKA-9925: decorate pseudo-topics with app id (#8574 ) Reviewers: Boyang Chen <boyang@confluent.io>, Kin Siu	5 years ago
John Roesler	7907b5a6e9	KAFKA-9832: fix attempt to commit non-running tasks (#8580 ) KAFKA-9832: fix attempt to commit non-running tasks Reviewers: Matthias J. Sax <matthias@confluent.io>	5 years ago
A. Sophie Blee-Goldman	b5de449377	KAFKA-9127: don't create StreamThreads for global-only topology (#8540 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <vvcephei@apache.org>	5 years ago
Guozhang Wang	11fc953c05	KAFKA-9176: Retry on getting local stores from KafkaStreams (#8568 ) This PR fixes and improves two major issues: 1. When calling KafkaStreams#store we can always get an InvalidStateStoreException, and even waiting for Streams state to become RUNNING is not sufficient (this is also how OptimizedKTableIntegrationTest failed). So I wrapped all the function with a Util wrapper that captures and retries on that exception. 2. While trouble-shooting this issue, I also realized a potential bug in test-util's produceKeyValuesSynchronously, which creates a new producer for each of the record to send in that batch --- i.e. if you are sending N records with a single call, within that call it will create N producers used to send one record each, which is very slow and costly. Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, John Roesler <john@confluent.io>	5 years ago
John Roesler	5bb3415c77	KAFKA-6145: KIP-441: Add TaskAssignor class config (#8541 ) * add a config to set the TaskAssignor * set the default assignor to HighAvailabilityTaskAssignor * fix broken tests (with some TODOs in the system tests) Implements: KIP-441 Reviewers: Bruno Cadonna <bruno@confluent.io>, A. Sophie Blee-Goldman <sophie@confluent.io>	5 years ago
A. Sophie Blee-Goldman	6a5946282c	KAFKA-9921: disable caching on stores configured to retain duplicates (#8564 ) These two options are essentially incompatible, as caching will do nothing to reduce downstream traffic and writes when it has to allow non-unique keys (skipping records where the value is also the same is a separate issue, see KIP-557). But enabling caching on a store that's configured to retain duplicates is actually more than just ineffective, and currently causes incorrect results. We should just log a warning and disable caching whenever a store is retaining duplicates to avoid introducing a regression. Maybe when 3.0 comes around we should consider throwing an exception instead to alert the user more aggressively. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>, John Roesler <john@confluent.io>	5 years ago
Guozhang Wang	99b8b51f1e	MINOR: Remove unused foreign-key join class (#8547 ) Reviewers: John Roesler <john@confluent.io>	5 years ago
Guozhang Wang	fbd8cf0d86	KAFKA-9388: Refactor integration tests to always use different application ids (#8530 ) When debugging KAFKA-9388, I found the reason that the second test method test takes much longer (10s) than the previous one (~500ms) is because they used the same app.id. When the previous clients are shutdown, they would not send leave-group and hence we are still depending on the session timeout (10s) for the members to be removed out of the group. When the second test is triggered, they will join the same group because of the same application id, and the prepare-rebalance phase would would for the full rebalance timeout before it kicks out the previous members. Setting different application ids could resolve such issues for integration tests --- I did a quick search and found some other integration tests have the same issue. And after this PR my local unit test runtime reduced from about 14min to 7min. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, John Roesler <john@confluent.io>	5 years ago
A. Sophie Blee-Goldman	5c548e5dfc	KAFKA-6145: KIP-441: Build state constrained assignment from balanced one (#8497 ) Implements: KIP-441 Reviewers: Bruno Cadonna <bruno@confluent.io>, John Roesler <vvcephei@apache.org>	5 years ago
Matthias J. Sax	11d8ef76ff	MINOR: Improve usage of LogCaptureAppender (#8508 ) Reviewers: Ismael Juma <ismael@confluent.io>, John Roesler <john@confluent.io>	5 years ago
Boyang Chen	60b912cd16	KAFKA-9868: Reduce number of transaction log partitions for embed broker (#8522 ) Reviewers: Matthias J. Sax <matthias@confluent.io>	5 years ago
Guozhang Wang	fcf45e1fac	MINOR: Further reduce runtime for metrics integration tests (#8514 ) 1. In both RocksDBMetrics and Metrics integration tests, we do not need to wait for consumer to consume records from output topics since the sensors / metrics are registered upon task creation. 2. Merged the two test cases of RocksDB with one app that creates two state stores (non-segmented and segmented). With these two changes, local runtime of these two tests reduced from 2min+ and 3min+ to under a minute. Reviewers: Bruno Cadonna <bruno@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Chia-Ping Tsai	cfea096a8d	HOTFIX: fix checkstyle error of RocksDBStoreTest and flaky RocksDBTimestampedStoreTest.shouldOpenExistingStoreInRegularMode (#8515 ) 1. Fix broken build 2. Fix flaky RocksDBTimestampedStoreTest.shouldOpenExistingStoreInRegularMode Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
A. Sophie Blee-Goldman	6ea3eedfd8	MINOR: cleanup RocksDBStore tests (#8510 ) One of the new rocksdb unit tests creates a non-temporary rocksdb directory wherever the test is run from, with some rocksdb files left behind after the test(s) are done. We should use the tempDirectory dir for this testing Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago

1 2 3 4 5 ...

1756 Commits (59efa12d0c0c098e6f57848bf5a43b459c0563e0)