src-kafka

Commit Graph

Author	SHA1	Message	Date
John Roesler	e38e3a66ab	MINOR: Fix standby streamTime (#5288 ) #5253 broke standby restoration for windowed stores. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	8250738ae4	KAFKA-7101: Consider session store for windowed store default configs (#5298 ) 1. extend isWindowStore to consider session store as well. 2. extend the existing unit test accordingly. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
John Roesler	64fff8bfcc	KAFKA-7080: replace numSegments with segmentInterval (#5257 ) See also KIP-319. Replace number-of-segments parameters with segment-interval-ms parameters in various places. The latter was always the parameter that several components needed, and we accidentally supplied the former because it was the one available. Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Chia-Ping Tsai	57320981bb	Minor: fix javadocs of StreamsConfig and ValueTransformerWithKey (#5157 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	6 years ago
Yishun Guan	d44d5d7520	KAFKA-6986: Export Admin Client metrics through Stream Threads (#5210 ) KAFKA-6986:Export Admin Client metrics through Stream Threads We already exported producer and consumer metrics through KafkaStreams class: #4998 It makes sense to also export the Admin client metrics. I didn't add a separate unittest case for this. Let me know if it's needed. This is my first contribution, feel free to point out any mistakes that I did. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	7947c94140	MINOR: Upgrade RocksDB to 5.13.4 (#5309 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Guozhang Wang	6bfaf4dc60	MINOR: Store metrics scope, total metrics (#5290 ) 1. Rename metrics scope of rocksDB window and session stores; also modify the store metrics accordingly with guidance on its correlations to metricsScope. 2. Add the missing total metrics for per-thread, per-task, per-node and per-store sensors. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Guozhang Wang	be0f10e190	MINOR: KAFKA-7112: Only resume restoration if state is still PARTITIONS_ASSIGNED after poll (#5306 ) Before KIP-266, consumer.poll(0) would call updateAssignmentMetadataIfNeeded(Long.MAX_VALUE), which makes sure that the rebalance is definitely completed, i.e. both onPartitionRevoked and onPartitionAssigned called within this poll(0). After KIP-266, however, it is possible that only onPartitionRevoked will be called if timeout is elapsed. And hence we need to double check that state is still PARTITIONS_ASSIGNED after the consumer.poll(duration) call. Reviewers: Ted Yu <yuzhihong@gmail.com>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Manikumar Reddy O	51935ee2e6	KAFKA-7091; AdminClient should handle FindCoordinatorResponse errors (#5278 ) - Update KafkaAdminClient implementation to handle FindCoordinatorResponse errors - Remove scala AdminClient usage from core and streams tests Reviewers: Matthias J. Sax <matthias@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Ismael Juma	7a74ec62d2	MINOR: Avoid FileInputStream/FileOutputStream (#5281 ) They rely on finalizers (before Java 11), which create unnecessary GC load. The alternatives are as easy to use and don't have this issue. Also use FileChannel directly instead of retrieving it from RandomAccessFile whenever possible since the indirection is unnecessary. Finally, add a few try/finally blocks. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
xinzhg	b054789d69	MINOR: Fix comment in quick union (#5244 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
John Roesler	954be11bf2	KAFKA-6978: make window retention time strict (#5218 ) Enforce window retention times strictly: * records for windows that are expired get dropped * queries for timestamps old enough to be expired immediately answered with null Reviewers: Bill Bejeck <bill@confluent.io>, Damian Guy <damian@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	d3e264e773	MINOR: update web docs and examples of Streams with Java8 syntax (#5249 ) Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Damian Guy <damian@confluent.io>	6 years ago
John Roesler	6732593bba	KAFKA-7072: clean up segments only after they expire (#5253 ) Significant refactor of Segments to use stream-time as the basis of segment expiration. Previously Segments assumed that the current record time was representative of stream time. In the event of a "future" event (one whose record time is greater than the stream time), this would inappropriately drop live segments. Now, Segments will provision the new segment to house the future event and drop old segments only after they expire. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Stephane Maarek	410e00cbcb	KAFKA-7066 added better logging in case of Serialisation issue (#5239 ) Following the error message of: https://github.com/apache/kafka/blob/trunk/streams/src/main/java/org/apache/kafka/streams/processor/internals/SinkNode.java#L93 Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Ismael Juma	cc4dce94af	KAFKA-2983: Remove Scala consumers and related code (#5230 ) - Removed Scala consumers (`SimpleConsumer` and `ZooKeeperConsumerConnector`) and their tests. - Removed Scala request/response/message classes. - Removed any mention of new consumer or new producer in the code with the exception of MirrorMaker where the new.consumer option was never deprecated so we have to keep it for now. The non-code documentation has not been updated either, that will be done separately. - Removed a number of tools that only made sense in the context of the Scala consumers (see upgrade notes). - Updated some tools that worked with both Scala and Java consumers so that they only support the latter (see upgrade notes). - Removed `BaseConsumer` and related classes apart from `BaseRecord` which is used in `MirrorMakerMessageHandler`. The latter is a pluggable interface so effectively public API. - Removed `ZkUtils` methods that were only used by the old consumers. - Removed `ZkUtils.registerBroker` and `ZKCheckedEphemeral` since the broker now uses the methods in `KafkaZkClient` and no-one else should be using that method. - Updated system tests so that they don't use the Scala consumers except for multi-version tests. - Updated LogDirFailureTest so that the consumer offsets topic would continue to be available after all the failures. This was necessary for it to work with the Java consumer. - Some multi-version system tests had not been updated to include recently released Kafka versions, fixed it. - Updated findBugs and checkstyle configs not to refer to deleted classes and packages. Reviewers: Dong Lin <lindong28@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	7 years ago
Bill Bejeck	1354371d4f	KAFKA-6761: Construct logical Streams Graph in DSL Parsing (#4983 ) This version is a WIP and intentionally leaves out some additional required changes to keep the reviewing effort more manageable. This version of the process includes 1. Cleaning up the graph objects to reduce the number of parameters and make the naming conventions more clear. 2. Intercepting all calls to the InternalToplogyBuilder and capturing all details required for possible optimizations and building the final topology. This PR does not include writing out the current physical plan, so no tests included. The next PR will include additional changes to building the graph and writing the topology out without optimizations, using the current streams tests. Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
nixsticks	339fc2379d	KAFKA-7055: Update InternalTopologyBuilder to throw TopologyException if a processor or sink is added with no upstream node attached (#5215 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Matthias J. Sax	0dfa53c47a	KAFKA-6711: GlobalStateManagerImpl should not write offsets of in-memory stores in checkpoint file (#5219 )	7 years ago
Matthias J. Sax	ff96d57437	KAFKA-6860: Fix NPE in Kafka Streams with EOS enabled (#5187 ) Reviewers: John Roesler <john@confluent.io>, Ko Byoung Kwon, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
John Roesler	ce7fe8fe5f	MINOR: Use new consumer API timeout in test (#5217 ) The old timeout configs no longer take effect, as of `53ca52f855`. They are replaced by the new one. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Jagadesh Adireddi	c903d5767e	KAFKA-6749: Fixed TopologyTestDriver to process stream processing guarantee as exactly once (#4912 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Ted Yu <yuzhihong@gmail.com>	7 years ago
Matthias J. Sax	301474f0ba	MINOR: code cleanup follow up for KAFKA-6906 (#5196 ) Reviewers: Ted Yu <yuzhihong@gmail.com>, Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Filipe Agapito	de4f4f530a	KAFKA-6474: Rewrite tests to use new public TopologyTestDriver [part 2] (#4986 ) * KAFKA-6474: Rewrite tests to use new public TopologyTestDriver [part 2] * Refactor: -KTableFilterTest.java -KTableImplTest.java -KTableMapValuesTest.java -KTableSourceTest.java * Add access to task, processorTopology, and globalTopology in TopologyTestDriver via TopologyTestDriverWrapper * Remove unnecessary constructor in TopologyTestDriver * Change how TopologyTestDriverWrapper#getProcessorContext sets the current node Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Gitomain	40f63eb9c1	KAFKA-6782: solved the bug of restoration of aborted messages for GlobalStateStore and KGlobalTable (#4900 ) Reviewer: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Guozhang Wang	7a59061252	KAFKA-7023: Add unit test (#5197 ) Add a unit test that validates after restoreStart, the options are set with bulk loading configs; and after restoreEnd, it resumes to the customized configs Reviewers: Matthias J. Sax <matthias@confluent.io>	7 years ago
Guozhang Wang	d98ec33364	KAFKA-7021: Reuse source based on config (#5163 ) This PR actually contains two changes: 1. leverage on the TOPOLOGY_OPTIMIZATION config to "adjust" the topology internally to reuse the source topic. 2. fixed a long dangling bug that whenever source topic is reused as changelog topic, write the checkpoint file for the consumed offset, this is done by union the ackedOffset from the producer, plus the consumed offset from the consumer, note we will priori ackedOffset since the same topic may show up in both (think about repartition topic), by doing this the consumed offset from source topics can be treated as checkpointed offset when reuse happens. 3. added a few unit and integration tests with / wo the reusing, and make sure the restoration, standby task, and internal topic creation behaviors are all correct. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Jagadesh Adireddi	ee5cc974d2	KAFKA-6906: Fixed to commit transactions if data is produced via wall clock punctuation (#5105 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Liquan Pei	cc4157d877	KAFKA-7023: Move prepareForBulkLoad() call after customized RocksDBConfigSetter (#5166 ) *Summary options.prepareForBulkLoad() and then use the configs from the customized customized RocksDBConfigSetter. This may overwrite the configs set in prepareBulkLoad call. The fix is to move prepareBulkLoad call after applying configs customized RocksDBConfigSetter. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
John Roesler	74bdafe386	KAFKA-5697: Use nonblocking poll in Streams (#5107 ) Make use of the new Consumer#poll(Duration) to avoid getting stuck in poll when the broker is unavailable. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Matthias J. Sax	bb260e924f	MINOR: remove duplicate map in StoreChangelogReader (#5143 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Jagadesh Adireddi	150967994a	KAFKA-6538: Changes to enhance ByteStore exceptions thrown from RocksDBStore with more human readable info (#5103 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Bill Bejeck	f54acdbb13	KAFKA-6935: Add config for allowing optional optimization (#5071 ) Adding configuration to StreamsConfig allowing for making topology optimization optional. Added unit tests are verifying default values, setting correct value and failure on invalid values. Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Matthias J. Sax	0eddddb82b	KAFKA-6967: TopologyTestDriver does not allow pre-populating state stores that have change logging (#5096 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, James Cheng <jylcheng@yahoo.com>, Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>	7 years ago
Lee Dongjin	594a0e1a07	KAFKA-6993: Fix defective documentations for KStream/KTable methods (#5136 ) * KAFKA-6993: Fix defective documentations for KStream/KTable methods 1. Fix the documentation of following methods, e.g., making more detailed description for the overloaded methods: - KStream#join - KStream#leftJoin - KStream#outerJoin - KTable#filter - KTable#filterNot - KTable#mapValues - KTable#transformValues - KTable#join - KTable#leftJoin - KTable#outerJoin 2. (trivial) with possible new type -> with possibly new type. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Rajini Sivaram	a1ca07d316	MINOR: Bump version to 2.1.0-SNAPSHOT (#5153 )	7 years ago
ConcurrencyPractitioner	ba0ebca7a5	[KAFKA-6730] Simplify State Store Recovery (#5013 ) Reviewer: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Bill Bejeck	ef413699b6	KAFKA-6704: InvalidStateStoreException from IQ when StreamThread closes store (#4801 ) While using an iterator from IQ, it's possible to get an InvalidStateStoreException if the StreamThread closes the store during a range query. Added a unit test to SegmentIteratorTest for this condition. Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
John Roesler	ba5fd3c8a4	MINOR: Add regression tests for KTable mapValues and filter (#5134 ) Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Rajini Sivaram	9df3872fbd	KAFKA-3665: Enable TLS hostname verification by default (KIP-294) (#4956 ) Make HTTPS the default ssl.endpoint.identification.algorithm. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
John Roesler	6f9f365573	KAFKA-6813: return to double-counting for count topology names (#5075 ) #4919 unintentionally changed the topology naming scheme. This change returns to the prior scheme. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	718d6f2475	MINOR: Remove deprecated KafkaStreams constructors in docs (#5118 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Vahid Hashemian	0cacbcf30e	MINOR: Remove usages of JavaConversions and fix some typos (#5115 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Bill Bejeck	cb2f024f87	MINOR: Use thread name and task for sensor name (#5111 ) Changes to keep the operation name as is and make the sensor name unique. Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
emmanuel Harel	e24916a68f	MINOR:Fix table outer join test (#5099 )	7 years ago
Joan Goyeau	ad56f04af9	KAFKA-6936: Implicit materialized for aggregate, count and reduce (#5066 ) In #4919 we propagate the SerDes for each of these aggregation operators. As @guozhangwang mentioned in that PR: ``` reduce: inherit the key and value serdes from the parent XXImpl class. count: inherit the key serdes, enforce setting the Serdes.Long() for value serdes. aggregate: inherit the key serdes, do not set for value serdes internally. ``` Although it's all good for reduce and count, it is quiet unsafe to have aggregate without Materialized given. In fact I don't see why we would not give a Materialized for the aggregate since the result type will always be different (otherwise use reduce) and also the value Serde is simply not propagated. This has been discussed previously in a broader PR before but I believe for aggregate we could pass implicitly a Materialized the same way we pass a Joined, just to avoid the stupid case. Then if the user wants to specialize, he can give his own Materialized. Reviewers: Debasish Ghosh <dghosh@acm.org>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Matthias J. Sax	d166485be1	KAFKA-6054: Add 'version probing' to Kafka Streams rebalance (#4636 ) implements KIP-268 Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Guozhang Wang	f33e9a346e	KAFKA-4936: Add dynamic routing in Streams (#5018 ) implements KIP-303 Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Florian Hussonnois	14171fa8b4	KAFKA-6957 make InternalTopologyBuilder accessible from AbstractStream subclasses (#5085 ) Currently, the AbstractStream class defines a copy-constructor that allow to extend KStream and KTable APIs with new methods without impacting the public interface. However adding new processor or/and store to the topology is made throught the internalTopologyBuilder that is not accessible from AbstractStream subclasses defined outside of the package (package visibility). Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Dark	2b6630b518	Remove duplicate code which is invoked twice (#5039 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	7 years ago

1 2 3 4 5 ...

1017 Commits (e38e3a66ab099996ecb156ec9105869f3d9b9228)