src-kafka

Author	SHA1	Message	Date
nprad	79a2f892ca	KAFKA-6966: Extend TopologyDescription to better represent Source and (#5284 ) Implements KIP-321 Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
John Roesler	b1539ff62d	KAFKA-7250: switch scala transform to TransformSupplier (#5481 ) #5468 introduced a breaking API change that was actually avoidable. This PR re-introduces the old API as deprecated and alters the API introduced by #5468 to be consistent with the other methods also, fixed misc syntax problems	6 years ago
Kamal Chandraprakash	13a7544418	MINOR: Fixed log in Topology Builder. (#5477 ) - fix log statement in Topology Builder. - addressed some warnings shown by Intellij Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Satish Duggana <satishd@apache.org>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Bill Bejeck	59ae73482d	MINOR: Follow up for KAFKA-6761 graph should add stores for consistency (#5453 ) While working on 4th PR, I noticed that I had missed adding stores via the graph vs. directly via the InternalStreamsBuilder. Probably ok to do so, but we should be consistent. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	d12ceacd7a	KAFKA-7158: Add unit test for window store range queries (#5466 ) While debugging the reported issue, I found that our current unit test lacks coverage to actually expose the underlying root cause. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Manikumar Reddy O	a9d7f8a1fd	MINOR: Fix Streams scala format violations (#5472 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Michal Dziemianko	ed13d7eebb	KAFKA-7250: fix transform function in scala DSL to accept TranformerSupplier (#5468 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Manikumar Reddy O	e75048d3e5	MINOR: increase timeout values in streams tests (#5461 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	b9f1179694	MINOR: clean up window store interface to avoid confusion (#5359 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Manikumar Reddy O	924466ad62	MINOR: close producer instance in AbstractJoinIntegrationTest (#5459 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	cf2c5e9ffc	MINOR: clean up node and store sensors (#5450 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
John Roesler	3637b2c374	MINOR: Require final variables in Streams (#5452 ) Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Guozhang Wang	afe00effe2	KAFKA-3514: Part II, Choose tasks with data on all partitions to process (#5398 ) 1. In each iteration, decide if a task is processable if all of its partitions contains data, so it can decide which record to process next. 1.a Add one exception that, if the task indeed have data on some but not all of its partitions, we only consider as not processable for some finite round of iterations. 1.b Add a task-level metric to record whenever we are forced to process a task that is only "partially data available", since it may leads to non-determinism. 2. Break the main loop on put-raw-data and process-them. Since now not all data put into the queue would be processed completely within a single iteration. 3. NOTE that within an iteration, if a task has exhausted one of its queue it will still be processed, since we only update processable list once in each iteration, I'm improving on this on the follow-up part III PR. 4. Found and fixed a bug in metrics recording: the taskName and sensorName parameters were exchanged. 5. Optimized task stream time computation again since our current partition stream time reasoning has been simplified. 6. Added unit tests. Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <vvcephei@users.noreply.github.com>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
Matthias J. Sax	b083ed66b9	MINOR: improve JavaDocs for Streams PAPI WordCountExample (#5442 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Bill Bejeck	c19213ab41	KAFKA-6761: Construct Physical Plan using Graph, Reduce streams footprint part III (#5201 ) The specific changes in this PR from the second PR include: 1. Changed the types of graph nodes to names conveying more context 2. Build the entire physical plan from the graph, after StreamsBuilder.build() is called. Other changes are addressed directly as review comments on the PR. Testing consists of using all existing streams tests to validate building the physical plan with graph Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <vvcephei@users.noreply.github.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Jason Gustafson	c3e7c0bcb2	MINOR: Producers should set delivery timeout instead of retries (#5425 ) Use delivery timeout instead of retries when possible and remove various TODOs associated with completion of KIP-91. Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	aa48791297	KAFKA-7161: check invariant: oldValue is in the state (#5366 ) Reviewers: Vasily Sulatskov <redvasily@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	814fbe0fea	MINOR: Remove 1 minute minimum segment interval (#5323 ) * new minimum is 0, just like window size * refactor tests to use smaller segment sizes as well Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Bill Bejeck	e09d6d796f	KAFKA-7027: Add overloaded build method to StreamsBuilder (#5437 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Lee Dongjin	495c78db6f	KAFKA-6999: Add description on read-write lock vulnerability of ReadOnlyKeyValueStore (#5351 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	c8c3a7dc48	KAFKA-7192 Follow-up: update checkpoint to the reset beginning offset (#5430 ) 1. When we reinitialize the state store due to no CHECKPOINT with EOS turned on, we should update the checkpoint to consumer.seekToBeginnning() / consumer.position() to avoid falling into endless iterations. 2. Fixed a few other logic bugs around needsInitializing and needsRestoring. Reviewers: Jason Gustafson <jason@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
Guozhang Wang	061885e9f1	KAFKA-7192: Wipe out if EOS is turned on and checkpoint file does not exist (#5421 ) 1. As titled and as described in comments. 2. Modified unit test slightly to insert for new keys in committed data to expose this issue. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Matthias J. Sax	42af41d5fc	MINOR: Caching layer should forward record timestamp (#5423 ) Reviewer: Guozhang Wang <guozhang@confluent.io>	6 years ago
Bill Bejeck	1d9a427225	KAFKA-7144: Fix task assignment to be even (#5390 ) This PR now justs removes the check in TaskPairs.hasNewPair that was causing the task assignment issue. This was done as we need to further refine task assignment strategy and this approach needs to include the statefulness of tasks and is best done in one pass vs taking a "patchy" approach. Updated current tests and ran locally Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Matthias J. Sax	487b954542	MINOR: internal config objects should not be logged (#5389 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Rajini Sivaram	4b60ed3247	KAFKA-7193: Use ZooKeeper IP address in streams tests to avoid timeouts (#5414 ) ZooKeeper client from version 3.4.13 doesn't handle connections to localhost very well. If ZooKeeper is started on 127.0.0.1 on a machine that has both ipv4 and ipv6 and a client is created using localhost rather than the IP address in the connection string, ZooKeeper client attempts to connect to ipv4 or ipv6 randomly with a fixed one second backoff if connection fails. Use 127.0.0.1 instead of localhost in streams tests to avoid intermittent test failures due to ZK client connection timeouts if ipv6 is chosen in consecutive address selections. Also add note to upgrade docs for 2.0.0. Reviewers: Ismael Juma <github@juma.me.uk>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Guozhang Wang	75825caee4	KAFKA-5037 Follow-up: move Scala test to Java (#5399 ) Reviewers: Ted Yu <yuzhihong@gmail.com>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Manikumar Reddy O	9089fb2d82	MINOR: Fix format violations streams scala tests (#5402 ) @guozhangwang @mjsax hot fix for streams scala test format violations Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Ted Yu	82f124ae30	KAFKA-5037: Fix infinite loop if all input topics are unknown at startup 1. At the beginning of assign, we first check that all the non-repartition source topics are included in the metadata. If not, we log an error at the leader and set an error in the Assignment userData bytes, indicating that leader cannot complete assignment and the error code would indicate the root cause of it. 2. Upon receiving the assignment, if the error is not NONE the streams will shutdown itself with a log entry re-stating the root cause interpreted from the error code. Author: tedyu <yuzhihong@gmail.com> Reviewers: Matthias J. Sax <mjsax@apache.org>, Guozhang Wang <wangguoz@gmail.com> Closes #5322 from tedyu/trunk	6 years ago
Manikumar Reddy O	96c53e96b8	MINOR: Remove deprecated ZkUtils usage from EmbeddedKafkaCluster (#5324 ) Reviewers: Matthias J. Sax <mjsax@apache.org>, Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	2f6240ac94	KAFKA-3514: Remove min timestamp tracker (#5382 ) 1. Remove MinTimestampTracker and its TimestampTracker interface. 2. In RecordQueue, keep track of the head record (deserialized) while put the rest raw bytes records in the fifo queue, the head record as well as the partition timestamp will be updated accordingly. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Matthias J. Sax	06d96628f0	MINOR: remove unused MeteredKeyValueStore (#5380 ) Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
Liquan Pei	08fe24b46a	KAFKA-7103: Use bulkloading for RocksDBSegmentedBytesStore during init (#5276 ) This PR uses bulk loading for recovering RocksDBWindowStore, same as RocksDBStore. Reviewers: Boyang Chen <bchen11@outlook.com>, Shawn Nguyen <shnguyen@pinterest.com>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
hashangayasri	07647c2a4c	MINOR: make the constructor of InMemoryKeyValueStore public so that it can be re-used by custom (in-memory) stores (#5310 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Joan Goyeau	05c5854d1f	MINOR: Add Scalafmt to Streams Scala API (#4965 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
John Roesler	e38e3a66ab	MINOR: Fix standby streamTime (#5288 ) #5253 broke standby restoration for windowed stores. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	8250738ae4	KAFKA-7101: Consider session store for windowed store default configs (#5298 ) 1. extend isWindowStore to consider session store as well. 2. extend the existing unit test accordingly. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
John Roesler	64fff8bfcc	KAFKA-7080: replace numSegments with segmentInterval (#5257 ) See also KIP-319. Replace number-of-segments parameters with segment-interval-ms parameters in various places. The latter was always the parameter that several components needed, and we accidentally supplied the former because it was the one available. Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Chia-Ping Tsai	57320981bb	Minor: fix javadocs of StreamsConfig and ValueTransformerWithKey (#5157 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	6 years ago
Yishun Guan	d44d5d7520	KAFKA-6986: Export Admin Client metrics through Stream Threads (#5210 ) KAFKA-6986:Export Admin Client metrics through Stream Threads We already exported producer and consumer metrics through KafkaStreams class: #4998 It makes sense to also export the Admin client metrics. I didn't add a separate unittest case for this. Let me know if it's needed. This is my first contribution, feel free to point out any mistakes that I did. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	7947c94140	MINOR: Upgrade RocksDB to 5.13.4 (#5309 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Guozhang Wang	6bfaf4dc60	MINOR: Store metrics scope, total metrics (#5290 ) 1. Rename metrics scope of rocksDB window and session stores; also modify the store metrics accordingly with guidance on its correlations to metricsScope. 2. Add the missing total metrics for per-thread, per-task, per-node and per-store sensors. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Guozhang Wang	be0f10e190	MINOR: KAFKA-7112: Only resume restoration if state is still PARTITIONS_ASSIGNED after poll (#5306 ) Before KIP-266, consumer.poll(0) would call updateAssignmentMetadataIfNeeded(Long.MAX_VALUE), which makes sure that the rebalance is definitely completed, i.e. both onPartitionRevoked and onPartitionAssigned called within this poll(0). After KIP-266, however, it is possible that only onPartitionRevoked will be called if timeout is elapsed. And hence we need to double check that state is still PARTITIONS_ASSIGNED after the consumer.poll(duration) call. Reviewers: Ted Yu <yuzhihong@gmail.com>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Manikumar Reddy O	51935ee2e6	KAFKA-7091; AdminClient should handle FindCoordinatorResponse errors (#5278 ) - Update KafkaAdminClient implementation to handle FindCoordinatorResponse errors - Remove scala AdminClient usage from core and streams tests Reviewers: Matthias J. Sax <matthias@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Ismael Juma	7a74ec62d2	MINOR: Avoid FileInputStream/FileOutputStream (#5281 ) They rely on finalizers (before Java 11), which create unnecessary GC load. The alternatives are as easy to use and don't have this issue. Also use FileChannel directly instead of retrieving it from RandomAccessFile whenever possible since the indirection is unnecessary. Finally, add a few try/finally blocks. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
xinzhg	b054789d69	MINOR: Fix comment in quick union (#5244 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
John Roesler	954be11bf2	KAFKA-6978: make window retention time strict (#5218 ) Enforce window retention times strictly: * records for windows that are expired get dropped * queries for timestamps old enough to be expired immediately answered with null Reviewers: Bill Bejeck <bill@confluent.io>, Damian Guy <damian@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Guozhang Wang	d3e264e773	MINOR: update web docs and examples of Streams with Java8 syntax (#5249 ) Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Damian Guy <damian@confluent.io>	7 years ago
John Roesler	6732593bba	KAFKA-7072: clean up segments only after they expire (#5253 ) Significant refactor of Segments to use stream-time as the basis of segment expiration. Previously Segments assumed that the current record time was representative of stream time. In the event of a "future" event (one whose record time is greater than the stream time), this would inappropriately drop live segments. Now, Segments will provision the new segment to house the future event and drop old segments only after they expire. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Stephane Maarek	410e00cbcb	KAFKA-7066 added better logging in case of Serialisation issue (#5239 ) Following the error message of: https://github.com/apache/kafka/blob/trunk/streams/src/main/java/org/apache/kafka/streams/processor/internals/SinkNode.java#L93 Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago

1 2 3 4 5 ...

1102 Commits (5ba9cade7b066cc26842aeaac5662a57c502ffcb)