src-kafka

Commit Graph

Author	SHA1	Message	Date
Guozhang Wang	1a324d784c	KAFKA-6729: Reuse source topics for source KTable's materialized store's changelog (#5017 ) 1. In InternalTopologyBuilder#topicGroups, which is used in StreamsPartitionAssignor, look for book-kept storeToChangelogTopic map before creating a new internal changelog topics. In this way if the source KTable is created, its source topic stored in storeToChangelogTopic will be used. 2. Added unit test (confirmed that without 1) it will fail). 3. MINOR: removed TODOs that are related to removed KStreamBuilder. 4. MINOR: removed TODOs in StreamsBuilderTest util functions and replaced with TopologyWrapper. 5. MINOR: removed StreamsBuilderTest#testFrom as it is already covered by TopologyTest#shouldNotAllowToAddSourcesWithSameName, plus it requires KStreamImpl.SOURCE_NAME which should be a package private field of the KStreamImpl. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Joan Goyeau	ac9de822b2	MINOR: Use Set instead of List for multiple topics (#5024 ) Debasish Ghosh <dghosh@acm.org>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Matthias J. Sax	0b3712d8a5	MINOR: add missing parameter `processing.guaratees` to Streams docs (#5023 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Guozhang Wang	d4204e8b14	MINOR: fix broken links in streams doc (#5025 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
David Glasser	e9154b7960	KAFKA-6905: Document that Processors may be re-used by Streams (#5022 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	c9161afda9	MINOR: doc change for deprecate removal (#5006 ) Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Boyang Chen	1e207b2ef8	KAFKA-6896: Add producer metrics exporting in KafkaStreams (#4998 ) We would like to also export the producer metrics from StreamThread just like consumer metrics, so that we could gain more visibility of stream application. The approach is to pass in the threadProducer into the StreamThread so that we could export its metrics in dynamic. Note that this is a pure internal change that doesn't require a KIP, and in the future we also want to export admin client metrics. A followup KIP for admin client will be created once this is merged. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Dong Lin	0bb48a1669	KAFKA-3473; More Controller Health Metrics (KIP-237) This patch adds a few metrics that are useful for monitoring controller health. See KIP-237 for more detail. Author: Dong Lin <lindong28@gmail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #4392 from lindong28/KAFKA-3473	7 years ago
Matthias J. Sax	9947cd40c6	MINOR: Ensure sensor names are unique in Kafka Streams (#5009 ) Reviewer: Guozhang Wang <guozhang@confluent.io>	7 years ago
Matthias J. Sax	adeced2997	HOTFIX: RegexSourceIntegrationTest needs to cleanup shared output topic (#5008 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	caca1fdc90	KAFKA-6813: Remove deprecated APIs in KIP-182, Part III (#4991 ) 1. Remove TopologyBuilder, TopologyBuilderException, KStreamBuilder, 2. Completed the leftover work of https://issues.apache.org/jira/browse/KAFKA-5660, when we remove TopologyBuilderException. 3. Added MockStoreBuilder to replace MockStateStoreSupplier, remove all XXStoreSupplier except StateStoreSupplier as it is still referenced in the logical streams graph. 4. Minor: rename KStreamsFineGrainedAutoResetIntegrationTest.java to FineGrainedAutoResetIntegrationTest.java. Reviewers: Matthias J. Sax <matthias@confluent.io>	7 years ago
Joel Hamill	c14b0ad9ee	MINOR - Fix typo in Streams Dev Guide (#4972 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Joan Goyeau	40d191b563	MINOR: Count fix and Type alias refactor in Streams Scala API (#4966 ) Reviewers: Debasish Ghosh <dghosh@acm.org>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Manikumar Reddy O	ec7ba32af6	KAFKA-6394; Add a check to prevent misconfiguration of advertised listeners (#4897 ) Do not allow server startup if one of its configured advertised listeners has already been registered by another broker.	7 years ago
fedosov-alexander	6eb7cf1300	KAFKA-5965: Remove Deprecated AdminClient from Streams Resetter Tool (#4968 ) Removed usage of deprecated AdminClient from StreamsResetter No additional tests are required. Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Ismael Juma	c3921d489f	MINOR: Rename RecordFormat to RecordVersion (#4809 ) Also include a few clean-ups: * Method/variable/parameter renames to make them consistent with the class name * Return `ApiVersion` from `minSupportedFor` * Use `values` to remove some code duplication * Reduce duplication in `ApiVersion` by introducing the `shortVersion` method and building the versions map programatically * Avoid unnecessary `regex` in `ApiVersion.apply` * Added scaladoc to a few methods Some of these were originally discussed in: https://github.com/apache/kafka/pull/4583#pullrequestreview-98089400 Added a test for `ApiVersion.shortVersion`. Relying on existing tests for the rest since there is no change in behaviour. Reviewers: Jason Gustafson <jason@confluent.io>	7 years ago
Jason Gustafson	a5ea6d10a8	MINOR: A few small cleanups in AdminClient from KAFKA-6299 (#4989 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Robert Yokota	f69900cd1e	KAFKA-6894: Improve err msg when connecting processor with global store (#5000 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Rajini Sivaram	7ed7cca4c9	KAFKA-6893; Create processors before starting acceptor in SocketServer (#4999 )	7 years ago
Gunju Ko	c90bbc2749	MINOR: Fix typo in ConsumerRebalanceListener JavaDoc (#4996 )	7 years ago
Guozhang Wang	fa1702fece	MINOR: Remove deprecated valueTransformer.punctuate (#4993 ) Also removed the InternalValueTransformerWithKey / Supplier which is used to mock away the deprecated punctuate function. Reviewers: Matthias J. Sax <matthias@confluent.io>	7 years ago
Rajini Sivaram	830ee16d0d	MINOR: Update dynamic broker configuration doc for truststore update (#4954 ) Reviewers: Manikumar Reddy O <manikumar.reddy@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Chia-Ping Tsai	4f7c11a1df	KAFKA-6870 Concurrency conflicts in SampledStat (#4985 ) Make `KafkaMetric.measurableValue` thread-safe Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Anna Povzner	9679c44d2b	KAFKA-6361: Fix log divergence between leader and follower after fast leader fail over (#4882 ) Implementation of KIP-279 as described here: https://cwiki.apache.org/confluence/display/KAFKA/KIP-279%3A+Fix+log+divergence+between+leader+and+follower+after+fast+leader+fail+over In summary: - Added leader_epoch to OFFSET_FOR_LEADER_EPOCH_RESPONSE - Leader replies with the pair( largest epoch less than or equal to the requested epoch, the end offset of this epoch) - If Follower does not know about the leader epoch that leader replies with, it truncates to the end offset of largest leader epoch less than leader epoch that leader replied with, and sends another OffsetForLeaderEpoch request. That request contains the largest leader epoch less than leader epoch that leader replied with. Reviewers: Dong Lin <lindong28@gmail.com>, Jun Rao <junrao@gmail.com>	7 years ago
Guozhang Wang	0b1a118f45	KAFKA-6813: Remove deprecated APIs in KIP-182, Part II (#4976 ) 1. Remove the deprecated StateStoreSuppliers, and the corresponding Stores.create() functions and factories: only the base StateStoreSupplier and MockStoreSupplier were still preserved as they are needed by the deprecated TopologyBuilder and KStreamBuilder. Will remove them in a follow-up PR. 2. Add TopologyWrapper.java as the original InternalTopologyBuilderAccessor was removed, but I realized it is still needed as of now. 3. Minor: removed StateStoreTestUtils.java and inline its logic in its callers since now with StoreBuilder it is just a one-liner. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
tedyu	8fb5b37013	KAFKA-6878 Switch the order of underlying.init and initInternal (#4988 ) This is continuation of #4978. From Guozhang: I think to fix this issue, in init we could consider switching the steps of 1 and 2: initInternal(context); underlying.init(context, root); since volatile boolean open = false; it should be sufficient. In this case the check on step 3) will fail if underlying.init is not completed and we will throw InvalidStateStoreException. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Colin Patrick McCabe	abbd53da4a	KAFKA-6299; Fix AdminClient error handling when metadata changes (#4295 ) When AdminClient gets a NOT_CONTROLLER error, it should refresh its metadata and retry the request, rather than making the end-user deal with NotControllerException. Move AdminClient's metadata management outside of NetworkClient and into AdminMetadataManager. This will make it easier to do more sophisticated metadata management in the future, such as implementing a NodeProvider which fetches the leaders for topics. Rather than manipulating newCalls directly, the AdminClient service thread now drains it directly into pendingCalls. This minimizes the amount of locking we have to do, since pendingCalls is only accessed from the service thread.	7 years ago
tedyu	e32dcb9a66	KAFKA-6878: NPE when querying global state store not in READY state (#4978 ) Check whether cache is null before retrieving from cache. Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
asutosh936	5ca9ed5ede	KAFKA 6673: Implemented missing override equals method (#4745 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Rajini Sivaram	0ecb72f59d	KAFKA-6834: Handle compaction with batches bigger than max.message.bytes (#4953 ) Grow buffers in log cleaner to hold one message set after sanity check even if message set is bigger than max.message.bytes. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	7 years ago
Colin Patrick McCabe	b27e098a7d	MINOR: Fix trace logging in ReplicaManager (#4916 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Adem Efe Gencer	7afcb3a64c	KAFKA-6877; Remove completedFetch upon a failed parse if it contains no records. This patch removed a completedFetch from the completedFetches queue upon a failed parse if it contains no records. The following scenario explains why this is needed for an instance of this case – i.e. in TopicAuthorizationException. 0. Let's assume a scenario, in which the consumer is attempting to read from a topic without the necessary read permission. 1. In Fetcher#fetchedRecords(), after peeking the completedFetches, the Fetcher#parseCompletedFetch(CompletedFetch) throws a TopicAuthorizationException (as expected). 2. Fetcher#fetchedRecords() passes the TopicAuthorizationException up without having a chance to poll completedFetches. So, the same completedFetch remains at the completedFetches queue. 3. Upon following calls to Fetcher#fetchedRecords(), peeking the completedFetches will always return the same completedFetch independent of any updates to the ACL that the topic is trying to read from. 4. Hence, despite the creation of an ACL with correct permissions, once the consumer sees the TopicAuthorizationException, it will be unable to recover without a bounce. Author: Adem Efe Gencer <agencer@linkedin.com> Reviewers: Jiangjie (Becket) Qin <becket.qin@gmail.com> Closes #4974 from efeg/fix/parseCompletedFetchRemainsInQueue	7 years ago
Roman Khlebnov	fcb15e357c	KAFKA-6292; Improve FileLogInputStream batch position checks to avoid type overflow (#4928 ) Switch from sum operations to subtraction to avoid type casting in checks and type overflow during `FlieLogInputStream` work, especially in cases where property `log.segment.bytes` was set close to the `Integer.MAX_VALUE` and used as a `position` inside `nextBatch()` function. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
dan norwood	b328fc729b	MINOR: add equals()/hashCode() for Produced/Consumed (#4979 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	7 years ago
Jason Gustafson	bce10794a0	KAFKA-6879; Invoke session init callbacks outside lock to avoid Controller deadlock (#4977 ) Fixes a deadlock between the controller's beforeInitializingSession callback which holds the zookeeper client initialization lock while awaiting completion of an asynchronous event which itself depends on the same lock. Also catch and log callback exceptions to ensure the ZooKeeper reconnection takes place. Finally, configure KafkaScheduler in ZooKeeperClient to have at least 1 thread. Added tests that fail or hang without the changes in this PR. Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Joan Goyeau	b88d70b532	MINOR: Make Serdes less confusing in Scala (#4963 ) Serdes are confusing in the Scala wrapper: * We have wrappers around Serializer, Deserializer and Serde which are not very useful. * We have Serdes in 2 places org.apache.kafka.common.serialization.Serde and in DefaultSerdes, instead we should be having only one place where to find all the Serdes. I wanted to do this PR before the release as this is a breaking change. This shouldn't add more so the current tests should be enough. Reviewers: Debasish Ghosh <dghosh@acm.org>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Guozhang Wang	2b5a594066	KAFKA-6813: Remove deprecated APIs in KIP-182, Part I (#4919 ) I'm breaking KAFKA-6813 into a couple of "smaller" PRs and this is the first one. It focused on: Remove deprecated APIs in KStream, KTable, KGroupedStream, KGroupedTable, SessionWindowedKStream, TimeWindowedKStream. Also found a couple of overlooked bugs while working on them: 2.a) In KTable.filter / mapValues without the additional parameter indicating the materialized stores, originally we will not materialize the store. After KIP-182 we mistakenly diverge the semantics: for KTable.mapValues it is still the case, for KTable.filter we will always materialize. 2.b) In XXStream/Table.reduce/count, we used to try to reuse the serdes since their types are pre-known (for reduce it is the same types for both key / value, for count it is the same types for key, and Long for value). This was somehow lost in the past refactoring. 2.c) We are enforcing to cast a Serde<V> to Serde<VR> for XXStream / Table.aggregate, for which the returned value type is NOT known, such the enforced casting should not be applied and we should require users to provide us the value serde if they believe the default ones are not applicable. 2.d) Whenever we are creating a new MaterializedInternal we are effectively incrementing the suffix index for the store / processor-node names. However in some places this MaterializedInternal is only used for validation, so the resulted processor-node / store suffix is not monotonic. Reviewers: Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Liju John	55dd97097f	KAFKA-6628: RocksDBSegmentedBytesStoreTest does not cover time window serdes (#4836 ) Updated RocksDBSegmentedBytesStoreTest class to include time window serdes. Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Guozhang Wang	42771eb37d	MINOR: Remove deprecated KTable#writeAs, print, foreach, to, through (#4910 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Michael G. Noll	00d1137570	KAFKA-6871: KStreams Scala API: incorrect Javadocs and misleading parameter name (#4971 ) Reviewer: Matthias J. Sax <matthias@confluent.io>, Debasish Ghosh <dghosh@acm.org>, Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Guozhang Wang	32e97b1d9d	MINOR: Remove deprecated parameter in ProcessorContext#register (#4911 ) Updated the upgrade doc as well since we do not have an overloaded function without the deprecated parameter before. Also renamed the 1.2 release version to 2.0. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Filipe Agapito	6f641fef6a	KAFKA-6474: Rewrite tests to use new public TopologyTestDriver [cleanup] (#4939 ) * Add method to create test properties to StreamsTestUtils * Make TopologyTestDriver protected constructor package-private * Add comment suggesting the use of TopologyTestDriver to KStreamTestDriver * Cleanup: - GlobalKTableJoinsTest - KGroupedStreamImplTest - KGroupedTableImplTest - KStreamBranchTest - KStreamFilterTest - KStreamFlatMapTest - KStreamFlatMapValuesTest - KStreamForeachTest - KStreamGlobalKTableJoinTest - KStreamGlobalKTableLeftJoinTest - KStreamImplTest - KStreamKStreamJoinTest - KStreamKStreamLeftJoinTest - KStreamGlobalKTableLeftJoinTest - KStreamKTableJoinTest - KStreamKTableLeftJoinTest - KStreamMapTest - KStreamMapValuesTest - KStreamPeekTest - StreamsBuilderTest - KStreamSelectKeyTest - KStreamTransformTest - KStreamTransformValuesTest - KStreamWindowAggregateTest - KTableForeachTest Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Sean Glover	893e044515	MINOR: Build and code sample updates for Kafka Streams DSL for Scala (#4949 ) Several build and documentation updates were required after the merge of KAFKA-6670: Implement a Scala wrapper library for Kafka Streams. Encode Scala major version into streams-scala artifacts. To differentiate versions of the kafka-streams-scala artifact across Scala major versions it's required to encode the version into the artifact name before its published to a maven repository. This is accomplished by following a similar release process as kafka core, which encodes the Scala major version and then runs the build for each major version of Scala supported. This is considered standard practice when releasing Scala libraries, but is not handled for us automatically with the basic Scala for Gradle support. After this change you can generate and install the kafka-streams-scala artifact into the local maven repository: $ ./gradlew -PscalaVersion=2.11 install $ ./gradlew -PscalaVersion=2.12 install Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	564311f5cd	MINOR: Remove KafkaStreams#toString (#4909 ) Remove the deprecated KafkaStreams#toString function. Also override toString() for internal classes for debugging purposes. Reviewers: Bill Bejeck <bill@confluent.io>, Damian Guy <damian@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Surabhi Dixit	03a2d8243d	KAFKA-6867; Corrected the typos in upgrade.html (#4970 ) Reviewers: Jakob Homan <jghoman@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Bill Bejeck	f448e49fbe	KAFKA-6844: Call shutdown on GlobalStreamThread after all StreamThreads have stopped (#4950 ) Moved the shutdown of GlobalStreamThread to after all StreamThread instances have stopped. There can be a race condition where shut down is called on a StreamThread then shut down is called on a GlobalStreamThread, but if StreamThread is delayed in shutting down, the GlobalStreamThread can shutdown first. If the StreamThread tries to access a GlobalStateStore before closing the user can get an exception stating "..Store xxx is currently closed " Tested by running all current streams tests. Reviewers: Ted Yu <yuzhihong@gmail.com>, John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	de8a67b565	HOTFIX: fix streams tutorial code example	7 years ago
John Roesler	2d8049b713	KAFKA-5697: issue Consumer#wakeup during Streams shutdown Wakeup consumers during shutdown to break them out of any internally blocking calls. Semantically, it should be fine to treat a WakeupException as "no work to do", which will then continue the threads' polling loops, leading them to discover that they are supposed to shut down, which they will do gracefully. The existing tests should be sufficient to verify no regressions. Author: John Roesler <john@confluent.io> Reviewers: Bill Bejeck <bbejeck@gmail.com>, Guozhang Wang <wangguoz@gmail.com> Closes #4930 from vvcephei/streams-client-wakeup-on-shutdown minor javadoc updates	7 years ago
Guozhang Wang	af983267be	MINOR: Removed deprecated schedule function (#4908 ) While working on this, I also refactored the MockProcessor out of the MockProcessorSupplier to cleanup the unit test paths. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Bill Bejeck	515ce21c74	KAFKA-6761: Part 1 of 3; Graph nodes (#4923 ) This PR supersedes PR #4654 as it was growing too large. All comments in that PR should be addressed here. I will attempt to break the PRs for the topology optimization effort into 3 PRs total and will follow this general plan: 1. This PR only adds the graph nodes and graph. The graph nodes will hold the information used to make calls to the InternalTopologyBuilder when using the DSL. Graph nodes are stored in the StreamsTopologyGraph until the final topology needs building then the graph is traversed and optimizations are made at that point. There are no tests in this PR relying on the follow-up PR to use all current streams tests, which should suffice. 2. PR 2 will intercept all DSL calls and build the graph. The InternalStreamsBuilder uses the graph to provide the required info to the InternalTopologyBuilder and build a topology. The condition of satisfaction for this PR is that all current unit, integration and system tests pass using the graph. 3. PR 3 adds some optimizations mainly automatically repartitioning for operations that may modify a key and have child operations that would normally create a separate repartition topic, saving possible unnecessary repartition topics. For example the following topology: ``` KStream<String, String> mappedStreamOther = inputStream.map(new KeyValueMapper<String, String, KeyValue<? extends String, ? extends String>>() { @Override public KeyValue<? extends String, ? extends String> apply(String key, String value) { return KeyValue.pair(key.substring(0, 3), value); } }); mappedStreamOther.groupByKey().windowedBy(TimeWindows.of(5000)).count().toStream().to("count-one-out"); mappedStreamOther.groupByKey().windowedBy(TimeWindows.of(10000)).count().toStream().to("count-two-out"); mappedStreamOther.groupByKey().windowedBy(TimeWindows.of(15000)).count().toStream().to("count-three-out"); ``` would create 3 repartion topics, but after applying an optimization strategy, only one is created. Reviewers: John Roesler <john@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago

... 2 3 4 5 6 ...

5163 Commits (d904058c5fc321ea6d29dee41f74ea44cdbad3ea) All Branches Search

5163 Commits (d904058c5fc321ea6d29dee41f74ea44cdbad3ea)

All Branches