src-kafka

Commit Graph

Author	SHA1	Message	Date
Jason Gustafson	9da6823a3a	HOTFIX: Fix breakage in `ConsumerPerformanceTest` (#8113 ) Test cases in `ConsumerPerformanceTest` were failing and causing a system exit rather than throwing the expected exception following #8023. We didn't catch this because the tests were marked as skipped and not failed. Reviewers: Guozhang Wang <guozhang@confluent.io>	5 years ago
Mitch	96c69da8c1	KAFKA-8507; Unify connection name flag for command line tool [KIP-499] (#8023 ) This change updates ConsoleProducer, ConsumerPerformance, VerifiableProducer, and VerifiableConsumer classes to add and prefer the --bootstrap-server flag for defining the connection point of the Kafka cluster. This change is part of KIP-499: https://cwiki.apache.org/confluence/display/KAFKA/KIP-499+-+Unify+connection+name+flag+for+command+line+tool. Reviewers: Ron Dagostino <rdagostino@confluent.io>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Chia-Ping Tsai <chia7712@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago
Xavier Léauté	7e1c39f75a	KAFKA-9106 make metrics exposed via jmx configurable (#7674 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>, Rajini Sivaram <rajinisivaram@googlemail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Stanislav Kozlovski	ea72edebf2	MINOR: Do not override retries for idempotent producers (#8097 ) The KafkaProducer code would set infinite retries (MAX_INT) if the producer was configured with idempotence and no retries were configured by the user. This is superfluous because KIP-91 changed the retry functionality to both be time-based and the default retries config to be MAX_INT. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
huxi	46e80dbd20	KAFKA-9538; Fix flaky test `testResetOffsetsExportImportPlan` (#6561 ) This patch adds logic to the test case to ensure that consumer groups are in a valid state prior to attempting offset reset. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Konstantine Karantasis	97d2c726f1	MINOR: Small Connect integration test fixes (#8100 ) Author: Konstantine Karantasis <konstantine@confluent.io> Reviewer: Randall Hauch <rhauch@gmail.com>	5 years ago
Lev Zemlyanov	f51e1e6548	allow ReplaceField SMT to handle tombstone records (#7731 ) Signed-off-by: Lev Zemlyanov <lev@confluent.io>	5 years ago
Lev Zemlyanov	c8f1ee9cd9	KAFKA-9192: fix NPE when for converting optional json schema in structs (#7733 ) Author: Lev Zemlyanov <lev@confluent.io> Reviewers: Greg Harris <gregh@confluent.io>, Randall Hauch <rhauch@gmail.com>	5 years ago
Boyang Chen	07db26c20f	KAFKA-9417: New Integration Test for KIP-447 (#8000 ) This change mainly have 2 components: 1. extend the existing transactions_test.py to also try out new sendTxnOffsets(groupMetadata) API to make sure we are not introducing any regression or compatibility issue a. We shrink the time window to 10 seconds for the txn timeout scheduler on broker so that we could trigger expiration earlier than later 2. create a completely new system test class called group_mode_transactions_test which is more complicated than the existing system test, as we are taking rebalance into consideration and using multiple partitions instead of one. For further breakdown: a. The message count was done on partition level, instead of global as we need to visualize the per partition order throughout the test. For this sake, we extend ConsoleConsumer to print out the data partition as well to help message copier interpret the per partition data. b. The progress count includes the time for completing the pending txn offset expiration c. More visibility and feature improvements on TransactionMessageCopier to better work under either standalone or group mode. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Matthias J. Sax	aa0d0ec32a	KAFKA-6607: Commit correct offsets for transactional input data (#8091 ) Reviewers: Guozhang Wang <guozhang@confluent.io>	5 years ago
David Jacot	2cbd3d7519	KAFKA-9499; Improve deletion process by batching more aggressively (#8053 ) This PR speeds up the deletion process by doing the following: - Batch whenever possible to minimize the number of requests sent out to other brokers; - Refactor `onPartitionDeletion` to remove the usage of `allLiveReplicas`. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Jason Gustafson	0a5dec0b3a	MINOR: Fix unnecessary metadata fetch before group assignment (#8095 ) The recent increase in the flakiness of one of the offset reset tests (KAFKA-9538) traces back to https://github.com/apache/kafka/pull/7941. After investigation, we found that following this patch, the consumer was sending an additional metadata request prior to performing the group assignment. This slight timing difference was enough to trigger the test failures. The problem turned out to be due to a bug in `SubscriptionState.groupSubscribe`, which no longer counted the local subscription when determining if there were new topics to fetch metadata for. Hence the extra metadata update. This patch restores the old logic. Without the fix, we saw 30-50% test failures locally. With it, I could no longer reproduce the failure. However, #6561 is probably still needed to improve the resilience of this test. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
John Roesler	1681c78f60	KAFKA-9500: Fix FK Join Topology (#8015 ) Corrects a flaw leading to an exception while building topologies that include both: * A foreign-key join with the result not explicitly materialized * An operation after the join that requires source materialization Also corrects a flaw in TopologyTestDriver leading to output records being enqueued in the wrong order under some (presumably rare) circumstances. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
John Roesler	998f1520f9	KAKFA-9503: Fix TopologyTestDriver output order (#8065 ) Migrates TopologyTestDriver processing to be closer to the same processing/ordering semantics as KafkaStreams. This corrects the output order for recursive topologies as reported in KAFKA-9503, and also works similarly in the case of task idling.	5 years ago
Bruno Cadonna	cde6d18983	KAFKA-9355: Fix bug that removed RocksDB metrics after failure in EOS (#7996 ) * Added init() method to RocksDBMetricsRecorder * Added call to init() of RocksDBMetricsRecorder to init() of RocksDB store * Added call to init() of RocksDBMetricsRecorder to openExisting() of segmented state stores * Adapted unit tests * Added integration test that reproduces the situation in which the bug occurred Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
John Roesler	e16859dc48	KAFKA-9390: Make serde pseudo-topics unique (#8054 ) During the discussion for KIP-213, we decided to pass "pseudo-topics" to the internal serdes we use to construct the wrapper serdes for CombinedKey and hashing the left-hand-side value. However, during the implementation, this strategy wasn't fully implemented, and we wound up using the same topic name for a few different data types. Reviewers: Guozhang Wang <guozhang@confluent.io>	5 years ago
Colin Patrick McCabe	a149c3effa	MINOR: improve error reporting in DescribeConsumerGroupTest (#8080 ) Reviewers: David Arthur <mumrah@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	5 years ago
high.lee	dc89c86d43	KAFKA-9483: Add Scala KStream#toTable to the Streams DSL (#8024 ) Part of KIP-523 Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>	5 years ago
Matthias J. Sax	50aead64b9	MINOR: fix and improve StreamsConfig JavaDocs (#8086 ) Reviewer: John Roesler <john@confluent.io>	5 years ago
Konstantine Karantasis	0f14aef8cb	MINOR: Start using Response and replace IOException in EmbeddedConnectCluster for failures (#8055 ) Changed `EmbeddedConnectCluster` to add utility methods that return `Response`, throw `ConnectException` instead of `IOException` for failures, and deprecate the old methods that returned primitive types rather than `Response`. Also introduce common assertions for embedded clusters under `EmbeddedConnectClusterAssertions`. Author: Konstantine Karantasis <konstantine@confluent.io> Reviewer: Randall Hauch <rhauch@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	5 years ago
Gunnar Morling	5727b24509	KAFKA-7052 Avoiding NPE in ExtractField SMT in case of non-existent fields (#8059 ) Author: Gunnar Morling <gunnar.morling@googlemail.com> Reviewer: Randall Hauch <rhauch@gmail.com>	5 years ago
John Roesler	520a76155c	KAFKA-9517: Fix default serdes with FK join (#8061 ) During the KIP-213 implementation and verification, we neglected to test the code path for falling back to default serdes if none are given in the topology. Reviewer: Bill Bejeck <bbejeck@gmail.com>	5 years ago
Boyang Chen	ff8c40ccb6	KAFKA-9523: Migrate BranchedMultiLevelRepartitionConnectedTopologyTest into a unit test (#8081 ) Relying on integration test to catch an algorithm bug introduces more flakiness, reduce the test into a unit test to reduce the flakiness until we upgrade Java/Scala libs. Checked the test shall fail with older version of StreamsPartitionAssignor. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bruno Cadonna	3dfc6c15e4	KAFKA-9480: Fix bug that prevented to measure task-level process-rate (#8018 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Guozhang Wang	e70e5d913a	KAFKA-9505: Only loop over topics-to-validate in retries (#8039 ) Found this bug from the repeated flaky runs of system tests, it seems to be long lurking but also would only happen if there are frequent rebalances / topic creation within a short time, which is exactly the case in some of our smoke system tests. Also added a unit test. Reviewers: Boyang Chen <boyang@confluent.io>, A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Manikumar Reddy	41fdae35df	MINOR: Update schema field names in DescribeAcls Request/Response Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Colin Patrick McCabe <cmccabe@apache.org> Closes #8075 from omkreddy/KAFKA-9026-Fix	5 years ago
Navinder Pal Singh Brar	d76fa1b22d	KAFKA-9487: Follow-up PR of Kafka-9445 (#8033 ) Follows up on the original PR for KAFKA-9445 to address a final round of feedback Reviewers: John Roesler <vvcephei@apache.org>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Sönke Liebau	3b1c61385b	KAFKA-9423: Refine layout of configuration options on website and make individual settings directly linkable (#7955 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	5 years ago
Brian Byrne	0f8698a329	KAFKA-8904: Improve producer's topic metadata fetching. (#7781 ) When the producer encouteres new topic(s), it now only fetches the metadata for the new topics. For cases where a producer interacts with a lot of topics, this reduces the cost for the topic being evicted from the cache, and during startup when populating the topic cache. Additionally adds a new producer configuration variable 'metadata.max.idle.ms', which controls how long topic metadata may be idle (i.e. not produced to) before it's finally discarded from the metadata cache. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, dengziming <dengziming1993@gmail.com>	5 years ago
Matthias J. Sax	059a81e3c9	KAFKA-7658: Follow up to original PR (#8027 ) Follow up to original PR #7985 for KIP-523 (adding `KStream#toTable()` operator) - improve JavaDocs - add more unit tests - fix bug for auto-repartitioning - some code cleanup Reviewers: High Lee <yello1109@daum.net>, John Roesler <john@confluent.io>	5 years ago
Ron Dagostino	342f13a838	KAFKA-8843: KIP-515: Zookeeper TLS support Signed-off-by: Ron Dagostino <rdagostinoconfluent.io> Author: Ron Dagostino <rdagostino@confluent.io> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com> Closes #8003 from rondagostino/KAFKA-8843	5 years ago
Zach Zhang	7b5b15e2f8	MINOR: Add missing quote for malformed line content (#8070 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	5 years ago
Kun Song	87eaa5396d	MINOR: Simplify KafkaProducerTest (#8044 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Ron Dagostino <rndgstn@gmail.com>	5 years ago
David Mao	7a2a198d1e	KAFKA-9507; AdminClient should check for missing committed offsets (#8057 ) Addresses exception being thrown by `AdminClient` when `listConsumerGroupOffsets` returns a negative offset. A negative offset indicates the absence of a committed offset for a requested partition, and should result in a null in the returned offset map. Reviewers: Anna Povzner <anna@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
Sanjana Kaundinya	be4a6ddebb	KAFKA-9519: Deprecate the --zookeeper flag in ConfigCommand (#8056 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>, Ron Dagostino <rndgstn@gmail.com>	5 years ago
Sanjana Kaundinya	4d86b191c0	KAFKA-9509; Fixing flakiness of MirrorConnectorsIntegrationTest.testReplication (#8048 ) The test case `org.apache.kafka.connect.mirror.MirrorConnectorsIntegrationTest.testReplication` has shown to be increasingly flaky recently. This PR aims to make this test more deterministic. Specifically, the flakiness was due to a timing issue between the tasks not starting up in time for the test to start running. This PR remediates that by introducing a status check after every connector is started up. These status checks include that the connector is found on the connect cluster as well as there are tasks created and up and running for that connector. These checks are introduced before the test starts running so that there is a confidence that the connectors and tasks are started up correctly before the test runs. Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
Guozhang Wang	a6c9e96bd3	HOTFIX: Fix two test failures in JDK11 (#8063 ) 1. StoreChangelogReaderTest.shouldRequestCommittedOffsetsAndHandleTimeoutException[1] This is due to stricter ternary operator type casting 2. KStreamImplTest.shouldSupportTriggerMaterializedWithKTableFromKStream This is added recently where String typed values for <String, Integer>, in J8 it is allowed but in J11 it is not allowed. Reviewers: John Roesler <john@confluent.io>	5 years ago
Joel Hamill	83e1a8d71c	DOCS - clarify transactionalID and idempotent behavior (#7821 ) If transactional.id is set without setting enable.idempotence, the producer will set enable.idempotence to true implicitly. The docs should reflect this. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
A. Sophie Blee-Goldman	f698f3f840	MINOR: further InternalTopologyBuilder cleanup (#8046 ) Followup to KAFKA-7317 and KAFKA-9113, there's some additional cleanup we can do in InternalTopologyBuilder. Mostly refactors the subscription code to make the initialization more explicit and reduce some duplicated code in the update logic. Also some minor cleanup of the build method. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	5380938f8b	MINOR: Add timer for update limit offsets (#8047 ) Instead of always try to update committed offset limits as long as there are buffered records for standby tasks, we leverage on the commit interval to reduce our consumer.committed frequency. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, John Roesler <john@confluent.io>	5 years ago
Boyang Chen	f48946f572	HOTFIX: Fix spotsbug failure in Kafka examples (#8051 ) Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Boyang Chen	9d17bf98b6	KAFKA-9447: Add new customized EOS model example (#8031 ) With the improvement of 447, we are now offering developers a better experience on writing their customized EOS apps with group subscription, instead of manual assignments. With the demo, user should be able to get started more quickly on writing their own EOS app, and understand the processing logic much better. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Viktor Somogyi	987f0eeb31	KAFKA-8164: Add support for retrying failed (#8019 ) Disabled by default, but enabled for Jenkins PR builds (maximum of 1 retry per test with up to 5 retries for the test run). Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Guozhang Wang	7ea636c661	HOTFIX: checkstyle for newly added unit test	5 years ago
Jason Gustafson	ae0c6e58e5	KAFKA-9261; Client should handle unavailable leader metadata (#7770 ) The client caches metadata fetched from Metadata requests. Previously, each metadata response overwrote all of the metadata from the previous one, so we could rely on the expectation that the broker only returned the leaderId for a partition if it had connection information available. This behavior changed with KIP-320 since having the leader epoch allows the client to filter out partition metadata which is known to be stale. However, because of this, we can no longer rely on the request-level guarantee of leader availability. There is no mechanism similar to the leader epoch to track the staleness of broker metadata, so we still overwrite all of the broker metadata from each response, which means that the partition metadata can get out of sync with the broker metadata in the client's cache. Hence it is no longer safe to validate inside the `Cluster` constructor that each leader has an associated `Node` Fixing this issue was unfortunately not straightforward because the cache was built to maintain references to broker metadata through the `Node` object at the partition level. In order to keep the state consistent, each `Node` reference would need to be updated based on the new broker metadata. Instead of doing that, this patch changes the cache so that it is structured more closely with the Metadata response schema. Broker node information is maintained at the top level in a single collection and cached partition metadata only references the id of the broker. To accommodate this, we have removed `PartitionInfoAndEpoch` and we have altered `MetadataResponse.PartitionMetadata` to eliminate its `Node` references. Note that one of the side benefits of the refactor here is that we virtually eliminate one of the hotspots in Metadata request handling in `MetadataCache.getEndpoints` (which was renamed to `maybeFilterAliveReplicas`). The only reason this was expensive was because we had to build a new collection for the `Node` representations of each of the replica lists. This information was doomed to just get discarded on serialization, so the whole effort was wasteful. Now, we work with the lower level id lists and no copy of the replicas is needed (at least for all versions other than 0). Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk>	5 years ago
David Jacot	5db02ead60	MINOR: Fix typos introduced in KIP-559 (#8042 ) A few references to KIP-559 in the schema definitions needed to be fixed. Reviewers: Brajesh Kumar <bristy@users.noreply.github.com>, Ron Dagostino <rdagostino@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
Daniel Beskin	bdd0a9299f	MINOR: Fixing null handilg in ValueAndTimestampSerializer (#7679 ) Since ValueAndTimestampSerializer wraps an unknown Serializer, the output of that Serializer can be null. In which case the line .allocate(rawTimestamp.length + rawValue.length) will throw a NullPointerException. This pull request returns null instead. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Guozhang Wang	4090f9a2b0	KAFKA-9113: Clean up task management and state management (#7997 ) This PR is collaborated by Guozhang Wang and John Roesler. It is a significant tech debt cleanup on task management and state management, and is broken down by several sub-tasks listed below: Extract embedded clients (producer and consumer) into RecordCollector from StreamTask. guozhangwang#2 guozhangwang#5 Consolidate the standby updating and active restoring logic into ChangelogReader and extract out of StreamThread. guozhangwang#3 guozhangwang#4 Introduce Task state life cycle (created, restoring, running, suspended, closing), and refactor the task operations based on the current state. guozhangwang#6 guozhangwang#7 Consolidate AssignedTasks into TaskManager and simplify the logic of changelog management and task management (since they are already moved in step 2) and 3)). guozhangwang#8 guozhangwang#9 Also simplified the StreamThread logic a bit as the embedded clients / changelog restoration logic has been moved into step 1) and 2). guozhangwang#10 Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Bruno Cadonna <bruno@confluent.io>, Boyang Chen <boyang@confluent.io>	5 years ago
Colin Patrick McCabe	a16dfe6739	MINOR: fix checkstyle issue in ConsumerConfig.java (#8038 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Jason Gustafson	b029902b12	KAFKA-9491; Increment high watermark after full log truncation (#8037 ) When a follower's fetch offset is behind the leader's log start offset, the follower will do a full log truncation. When it does so, it must update both its log start offset and high watermark. The previous code did the former, but not the latter. Failure to update the high watermark in this case can lead to out of range errors if the follower becomes leader before getting the latest high watermark from the previous leader. The out of range errors occur when we attempt to resolve the log position of the high watermark in DelayedFetch in order to determine if a fetch is satisfied. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Ismael Juma <ismael@juma.me.uk>	5 years ago

1 2 3 4 5 ...

7108 Commits (9da6823a3a5cef7305d72788c295ada2fd1c0c88) All Branches Search

7108 Commits (9da6823a3a5cef7305d72788c295ada2fd1c0c88)

All Branches