src-kafka

Commit Graph

Author	SHA1	Message	Date
Bill Bejeck	f448e49fbe	KAFKA-6844: Call shutdown on GlobalStreamThread after all StreamThreads have stopped (#4950 ) Moved the shutdown of GlobalStreamThread to after all StreamThread instances have stopped. There can be a race condition where shut down is called on a StreamThread then shut down is called on a GlobalStreamThread, but if StreamThread is delayed in shutting down, the GlobalStreamThread can shutdown first. If the StreamThread tries to access a GlobalStateStore before closing the user can get an exception stating "..Store xxx is currently closed " Tested by running all current streams tests. Reviewers: Ted Yu <yuzhihong@gmail.com>, John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	de8a67b565	HOTFIX: fix streams tutorial code example	7 years ago
John Roesler	2d8049b713	KAFKA-5697: issue Consumer#wakeup during Streams shutdown Wakeup consumers during shutdown to break them out of any internally blocking calls. Semantically, it should be fine to treat a WakeupException as "no work to do", which will then continue the threads' polling loops, leading them to discover that they are supposed to shut down, which they will do gracefully. The existing tests should be sufficient to verify no regressions. Author: John Roesler <john@confluent.io> Reviewers: Bill Bejeck <bbejeck@gmail.com>, Guozhang Wang <wangguoz@gmail.com> Closes #4930 from vvcephei/streams-client-wakeup-on-shutdown minor javadoc updates	7 years ago
Guozhang Wang	af983267be	MINOR: Removed deprecated schedule function (#4908 ) While working on this, I also refactored the MockProcessor out of the MockProcessorSupplier to cleanup the unit test paths. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Bill Bejeck	515ce21c74	KAFKA-6761: Part 1 of 3; Graph nodes (#4923 ) This PR supersedes PR #4654 as it was growing too large. All comments in that PR should be addressed here. I will attempt to break the PRs for the topology optimization effort into 3 PRs total and will follow this general plan: 1. This PR only adds the graph nodes and graph. The graph nodes will hold the information used to make calls to the InternalTopologyBuilder when using the DSL. Graph nodes are stored in the StreamsTopologyGraph until the final topology needs building then the graph is traversed and optimizations are made at that point. There are no tests in this PR relying on the follow-up PR to use all current streams tests, which should suffice. 2. PR 2 will intercept all DSL calls and build the graph. The InternalStreamsBuilder uses the graph to provide the required info to the InternalTopologyBuilder and build a topology. The condition of satisfaction for this PR is that all current unit, integration and system tests pass using the graph. 3. PR 3 adds some optimizations mainly automatically repartitioning for operations that may modify a key and have child operations that would normally create a separate repartition topic, saving possible unnecessary repartition topics. For example the following topology: ``` KStream<String, String> mappedStreamOther = inputStream.map(new KeyValueMapper<String, String, KeyValue<? extends String, ? extends String>>() { @Override public KeyValue<? extends String, ? extends String> apply(String key, String value) { return KeyValue.pair(key.substring(0, 3), value); } }); mappedStreamOther.groupByKey().windowedBy(TimeWindows.of(5000)).count().toStream().to("count-one-out"); mappedStreamOther.groupByKey().windowedBy(TimeWindows.of(10000)).count().toStream().to("count-two-out"); mappedStreamOther.groupByKey().windowedBy(TimeWindows.of(15000)).count().toStream().to("count-three-out"); ``` would create 3 repartion topics, but after applying an optimization strategy, only one is created. Reviewers: John Roesler <john@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Anna Povzner	a5318722c7	KAFKA-6857; Leader should reply with undefined offset if undefined leader epoch requested (#4967 ) The leader must explicitly check if requested leader epoch is undefined, and return undefined offset so that the follower can fall back to truncating to high watermark. Otherwise, if the leader also is not tracking leader epochs, it may return its LEO, which will the follower to truncate to the incorrect offset.	7 years ago
Rajini Sivaram	6dbd9b59e6	KAFKA-6854; Handle batches deleted during log cleaning of logs with txns (#4962 ) Log cleaner grows buffers when result.messagesRead is zero. This contains the number of filtered messages read from source which can be zero when transactions are used because batches may be discarded. Log cleaner incorrectly assumes that messages were not read because the buffer was too small and attempts to double the buffer size unnecessarily, failing with an exception if the buffer is already max.message.bytes. Additional check for discarded batches has been added to avoid growing buffers when batches are discarded. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Fedor Bobin	de147837dd	KAFKA-6853: ZooKeeperRequestLatencyMs is incorrect (#4961 ) ResponseMetadata.responseTimeMs is always 0 or negative. Reviewers: Rajini Sivaram <rajinisivaram@gmail.com>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Ismael Juma	55cdc934fb	Upgrade ZooKeeper to 3.4.12 and Scala to 2.12.6 (#4940 ) Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Bill Bejeck	04a70bd3fe	KAFKA-6829: retry commits on unknown topic or partition (#4948 ) For the UNKNOWN_TOPIC_OR_PARTITION error, we could change the consumer's behavior to retry after this error. While this is a rare case since the user would not commit offsets for topics unless they had been able to fetch from them, but this doesn't really handle the situation where the broker hasn't received any metadata updates. Reviewers: Jason Gustafson <jason@confluent.io>, John Roesler <john@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	a7746dd5b4	HOTFIX: Simplify ConsoleConsumer stripWithPrefix function	7 years ago
Boyang Chen	1b170df31c	KAFKA-6657: Add StreamsConfig prefix for different consumers (#4805 ) This pull request is for JIRA 6657, for KIP-276. Added unit tests for new getGlobalConsumerConfigs API and make sure existing restore consumer tests are passing. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	31ce3a2705	KAFKA-6825: Make StreamsConfig#DEFAULT_PRODUCTION_EXCEPTION_HANDLER_CLASS_CONFIG public (#4929 ) Reviewers: Matthias J Sax <matthias@confluentio>	7 years ago
Mickael Maison	b82252b1e0	MINOR: Removed unused imports in a few tests (#4938 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Rajini Sivaram	b4d8552218	KAFKA-6526: Enable unclean leader election without controller change (#4920 ) Enable dynamic update of default unclean leader election config of brokers. A new controller event has been added to process unclean leader election when the config is enabled dynamically. Reviewers: Dong Lin <lindong28@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	7 years ago
Rajini Sivaram	9d2efd83a6	KAFKA-6810; Enable dynamic update of SSL truststores (#4904 ) Enable broker's SSL truststores to be dynamically updated using ConfigCommand in the same way as keystores are updated.	7 years ago
Jason Gustafson	f467c9c243	MINOR: Ensure exception messages include partition/segment info when possible (#4907 ) Reviewers: Anna Povzner <anna@confluent.io>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Manikumar Reddy O	be5b0fd2a9	MINOR: Fix sasl.jaas.config doc string (#4921 )	7 years ago
Mickael Maison	902009ea98	KAFKA-3417: Wrap metric reporter calls in try/catch blocks (#3635 ) Prevent exception thrown by metric reporters to impact request processing and other reporters. Co-authored-by: Mickael Maison <mickael.maison@gmail.com> Co-authored-by: Edoardo Comar <ecomar@uk.ibm.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
khairy	6655a4d75f	KAFKA-6535: Set default retention ms for Streams repartition topics to Long.MAX_VALUE (#4730 ) Implements KIP-284 Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Colin Patrick McCabe	aa3c38f595	KAFKA-6785: Add Trogdor documentation (#4862 )	7 years ago
Colin Patrick McCabe	8577632b3a	MINOR: Fix Trogdor tests, partition assignments (#4892 )	7 years ago
Bill Bejeck	c6fd3d488e	MINOR: update VerifiableProducer to send keys if configured and removed StreamsRepeatingKeyProducerService (#4841 ) This PR does the following: * Remove the StreamsRepeatingIntegerKeyProducerService and the associated Java class * Add a parameter to VerifiableProducer.java to enable sending keys when specified * Update the corresponding Python file verifiable_producer.py to support the new parameter. Reviewers: Matthias J Sax <matthias@confluentio>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
manjuapu	0143a65091	Fix streams web doc configs tables (#4943 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
manjuapu	3ce14a84a8	MINOR: Streams web doc table fix (#4942 )	7 years ago
Guozhang Wang	b2e0812f69	HOTFIX: rename run_test to execute in streams simple benchmark (#4941 )	7 years ago
Max Zheng	e128f7f3bb	MINOR: Pin pip to 9.0.3 as 10 is not compatible with with system pip (#4937 ) If not pinned, the following error will happen: Traceback (most recent call last): File "/usr/bin/pip", line 9, in <module> from pip import main ImportError: cannot import name main Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Manikumar Reddy O	ff1875fce0	KAFKA-6778; AdminClient.describeConfigs() should return error for non-existent topics (#4866 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Guozhang Wang	8725e3604b	MINOR: Remove deprecated streams config (#4906 ) Removed the following: "zookeeper.connect", "key.serde", "value.serde", "timestamp.extractor" Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Jason Gustafson <jason@confluent.io>	7 years ago
Filipe Agapito	885abbfcd4	KAFKA-6474: Rewrite tests to use new public TopologyTestDriver [partial] (#4832 ) * Remove ProcessorTopologyTestDriver from TopologyTest * Fix ProcessorTopologyTest * Remove ProcessorTopologyTestDriver and InternalTopologyAccessor * Partially refactored StreamsBuilderTest but missing one test * Refactor KStreamBuilderTest * Refactor AbstractStreamTest * Further cleanup of AbstractStreamTest * Refactor GlobalKTableJoinsTest * Refactor InternalStreamsBuilderTest * Fix circular dependency in build.gradle * Refactor KGroupedStreamImplTest * Partial modifications to KGroupedTableImplTest * Refactor KGroupedTableImplTest * Refactor KStreamBranchTest * Refactor KStreamFilterTest * Refactor KStreamFlatMapTest KStreamFlatMapValuesTest * Refactor KStreamForeachTest * Refactor KStreamGlobalKTableJoinTest * Refactor KStreamGlobalKTableLeftJoinTest * Refactor KStreamImplTest * Refactor KStreamImplTest * Refactor KStreamKStreamJoinTest * Refactor KStreamKStreamLeftJoinTest * Refactor KStreamKTableJoinTest * Refactor KStreamKTableLeftJoinTest * Refactor KStreamMapTest and KStreamMapValuesTest * Refactor KStreamPeekTest and KStreamTransformTest * Refactor KStreamSelectKeyTest * Refactor KStreamTransformValuesTest * Refactor KStreamWindowAggregateTest * Add Depercation anotation to KStreamTestDriver and rollback failing tests in StreamsBuilderTest and KTableAggregateTest * Refactor KTableFilterTest * Refactor KTableForeachTest * Add getter for ProcessorTopology, and simplify tests in StreamsBuilderTest * Refactor KTableImplTest * Remove unused imports * Refactor KTableAggregateTest * Fix style errors * Fix gradle build * Address reviewer comments: - Remove properties new instance - Remove extraneous line - Remove unnecessary TopologyTestDriver instances from StreamsBuilderTest - Move props.clear() to @After - Clarify use of timestamp in KStreamFlatMapValuesTest - Keep test using old Punctuator in KStreamTransformTest - Add comment to clarify clock advances in KStreamTransformTest - Add TopologyTestDriverWrapper class to access the protected constructor of TopologyTestDriver - Revert KTableImplTest.testRepartition to KStreamTestDriver to avoid exposing the TopologyTestDriver processor topology - Revert partially migrated classes: KTableAggregateTest, KTableFilterTest, and KTableImplTest * Rebase on current trunk an fix conflicts Reviewers: Matthias J Sax <matthias@confluentio>, Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>	7 years ago
Jarek Rudzinski	f38bc0c339	MINOR: use jdk8 to build/run system tests (#4925 ) Debian installer packages are no longer available for Java 7. Also upgrade AMI to latest ubuntu/trusty 14 amd64 as the older one is no longer available. Note that this only changes the JDK used to build and run the system tests. We still have Jenkins jobs that compile and run the JUnit tests with Java 7 so that we don't use features that are only available in newer Java versions.	7 years ago
Bill Bejeck	e7f019690a	MINOR: Fixes for streams system tests (#4935 ) This PR fixes some regressions introduced into streams system tests and sets the upgrade tests to ignore until PR #4636 is merged as it has the fixes for the upgrade tests. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
John Roesler	fc6e92260c	MINOR: fix NamedCache metrics in Streams (#4917 ) * Fixes a bug in which all NamedCache instances in a process shared one parent metric. * Also fixes a bug which incorrectly computed the per-cache metric tag (which was undetected due to the former bug). * Drop the StreamsMetricsConventions#xLevelSensorName convention in favor of StreamsMetricsImpl#xLevelSensor to allow StreamsMetricsImpl to track thread- and cache-level metrics, so that they may be cleanly declared from anywhere but still unloaded at the appropriate time. This was necessary right now so that the NamedCache could register a thread-level parent sensor to be unloaded when the thread, not the cache, is closed. * The above changes made it mostly unnecessary for the StreamsMetricsImpl to expose a reference to the underlying Metrics registry, so I did a little extra work to remove that reference, including removing inconsistently-used and unnecessary calls to Metrics#close() in the tests. The existing tests should be sufficient to verify this change. Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Jason Gustafson	459efb02ad	HOTFIX: ListConsumerGroupsResult should use KafkaFuture (#4933 )	7 years ago
Colin Patrick McCabe	6be908a829	MINOR: Refactor AdminClient ListConsumerGroups API (#4884 ) The current Iterator-based ListConsumerGroups API is synchronous. The API should be asynchronous to fit in with the other AdminClient APIs. Also fix some error handling corner cases. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Anna Povzner	cbb5b51475	KAFKA-6795; Added unit tests for ReplicaAlterLogDirsThread Added unit tests for ReplicaAlterLogDirsThread. Mostly focused on unit tests for truncating logic. Fixed ReplicaAlterLogDirsThread.buildLeaderEpochRequest() to use future replica's latest epoch (not the latest epoch of replica it is fetching from). This follows the logic that offset for leader epoch request should be based on leader epoch of the follower (in this case it's the future local replica). Also fixed PartitionFetchState constructor that takes offset and delay. The code ignored the delay parameter and used 0 for the delay. This constructor is used only by another constructor which passes delay = 0, which luckily works. Author: Anna Povzner <anna@confluent.io> Reviewers: Dong Lin <lindong28@gmail.com> Closes #4918 from apovzner/kafka-6795	7 years ago
Xavier Léauté	34a1f7099b	KAFKA-6826 avoid range scans when forwarding values during aggregation (#4927 ) Reviewers: Matthias J Sax <matthias@confluentio>, Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Ismael Juma	c853ef75a1	MINOR: Bump version to 2.0.0-SNAPSHOT (#4804 )	7 years ago
Jason Gustafson	acd669e424	KAFKA-6796; Fix surprising UNKNOWN_TOPIC error from requests to non-replicas (#4883 ) Currently if the client sends a produce request or a fetch request to a broker which isn't a replica, we return UNKNOWN_TOPIC_OR_PARTITION. This is a bit surprising to see when the topic actually exists. It would be better to return NOT_LEADER to avoid confusion. Clients typically handle both errors by refreshing metadata and retrying, so changing this should not cause any change in behavior on the client. This case can be hit following a partition reassignment after the leader is moved and the local replica is deleted. To validate the current behavior and the fix, I've added integration tests for the fetch and produce APIs.	7 years ago
John Roesler	12a0f46895	KAFKA-6376: Document skipped records metrics changes (#4922 ) Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Anna Povzner	3bc2575dfc	MINOR: Disabled flaky DynamicBrokerReconfigurationTest.testAddRemoveSslListener until fixed (#4924 ) Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Jagadesh Adireddi	a05e33693b	KAFKA-6677: Fixed StreamsConfig producer's max-in-flight allowed when EOS enabled. (#4868 ) Reviewers: Matthias J Sax <matthias@confluentio>, Bill Bejeck <bill@confluent.io>	7 years ago
Debasish Ghosh	b2e4db01b6	KAFKA-6670: Implement a Scala wrapper library for Kafka Streams This PR implements a Scala wrapper library for Kafka Streams. The library is implemented as a project under streams, namely `:streams:streams-scala`. The PR contains the following: * the library implementation of the wrapper abstractions * the test suite * the changes in `build.gradle` to build the library jar The library has been tested running the tests as follows: ``` $ ./gradlew -Dtest.single=StreamToTableJoinScalaIntegrationTestImplicitSerdes streams:streams-scala:test $ ./gradlew -Dtest.single=StreamToTableJoinScalaIntegrationTestImplicitSerdesWithAvro streams:streams-scala:test $ ./gradlew -Dtest.single=WordCountTest streams:streams-scala:test ``` Author: Debasish Ghosh <ghosh.debasish@gmail.com> Author: Sean Glover <seglo@randonom.com> Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Ismael Juma <ismael@juma.me.uk>, John Roesler <john@confluent.io>, Damian Guy <damian@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #4756 from debasishg/scala-streams	7 years ago
Patrik Erdes	35c75ea503	MINOR: Fix formatting in --new-consumer deprecation warning (#4903 )	7 years ago
John Roesler	ed51b2cdf5	KAFKA-6376; refactor skip metrics in Kafka Streams * unify skipped records metering * log warnings when things get skipped * tighten up metrics usage a bit ### Testing strategy: Unit testing of the metrics and the logs should be sufficient. Author: John Roesler <john@confluent.io> Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #4812 from vvcephei/kip-274-streams-skip-metrics	7 years ago
taekyung	51db468bf3	MINOR: Fixed deserialization.exception.handler default value of config-streams (#4914 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Arjun Satish	d9e804b889	MINOR: Clarify meaning of end offset in consumer javadocs (#4885 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Rajini Sivaram	9e062b3e65	MINOR: Use distinct consumer groups in dynamic listener tests (#4870 )	7 years ago
Jagadesh Adireddi	b510737e76	KAFKA-5253: Fixed TopologyTestDriver to handle streams created with patterns (#4793 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Guozhang Wang	1f523d9d72	MINOR: add window store range query in simple benchmark (#4894 ) There are a couple minor additions in this PR: 1. Add a new test for window store, to range query upon receiving each record. 2. In the non-windowed state store case, add a get call before the put call. 3. Enable caching by default to be consistent with other Join / Aggregate cases, where caching is enabled by default. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago

1 2 3 4 5 ...

4968 Commits (f448e49fbe8c3c609f9beffa2f421fc0b875aa3d) All Branches Search

4968 Commits (f448e49fbe8c3c609f9beffa2f421fc0b875aa3d)

All Branches