src-kafka

Commit Graph

Author	SHA1	Message	Date
Jagadesh Adireddi	95b46a12e5	KAFKA-6685: Added Exception to distinguish message Key from Value during deserializing. https://issues.apache.org/jira/browse/KAFKA-6685 Added Exception message in `WorkerSinkTask.convertMessages` to distinguish message Key from Value during deserialization to Kafka connect format. More detailed description of your change, if necessary. The PR title and PR message become the squashed commit message, so use a separate comment to ping reviewers. Summary of testing strategy (including rationale) for the feature or bug fix. Unit and/or integration tests are expected for any behaviour change and system tests should be considered for larger changes. Author: Jagadesh Adireddi <adireddijagadesh@gmail.com> Reviewers: Randall Hauch <rhauch@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #4765 from jadireddi/KAFKA-6685---log-message-should-distinguish-key-from-value	7 years ago
Guozhang Wang	70a506b983	MINOR: Ignore test_broker_type_bounce_at_start system test (#5055 ) test_broker_type_bounce_at_start tries to validate that when the controller is down, the streams client will always fail trying to create the topic; with the current behavior of admin client it is actually not always true: the actual behavior depends on the admin client internals as well as when the controller becomes unavailable during the leader assign partitions phase. I'd suggest at least ignore this test for now until the admin client has more stable (personally I'd even suggest removing this test as its coverage benefits is smaller than its introduced issues to me). Also adding a few more log4j entries as a result of investigating this issue. Reviewers: Matthias J. Sax <matthias@confluent.io>	7 years ago
Guozhang Wang	cbce95d9a5	MINOR: Reduce required occurrance from 100 to 10 (#5048 ) Due to #4644 the consumer connector logs will be much more clean with fewer "broker may not be available" entries. We need to reduce the required frequency from 100 to a smaller number. I've thought about reducing to just 1, but it may still be transient (i.e. even if broker is starting up you may see a few entries) so I reduced it to 10. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Colin Patrick McCabe	16ad358d64	KAFKA-6868; Fix buffer underflow and expose group state in the consumer groups API (#4980 ) * The consumer groups API should expose group state and coordinator information. This information is needed by administrative tools and scripts that access consume groups. * The partition assignment will be empty when the group is rebalancing. Fix an issue where the adminclient attempted to deserialize this empty buffer. * Remove nulls from the API and make all collections immutable. * DescribeConsumerGroupsResult#all should return a result as expected, rather than Void * Fix exception text for GroupIdNotFoundException, GroupNotEmptyException. It was being filled in as "The group id The group id does not exist was not found" and similar. Reviewers: Attila Sasvari <asasvari@apache.org>, Andras Beni <andrasbeni@cloudera.com>, Dong Lin <lindong28@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Joan Goyeau	96cda0e07a	MINOR: Fix type inference on joins and aggregates (#5019 ) The type inference doesn't currently work for the join functions in Scala as it doesn't know yet the types of the given KStream[K, V] or KTable[K, V]. The fix here is to curry the joiner function. I personally prefer this notation but this also means it differs more from the Java API. I believe the diff with the Java API is worth in this case as it's not only solving the type inference but also fits better the Scala way of coding (ex: fold). Moreover any Scala dev will bug and spend little time on these functions trying to understand why the type inference is not working and then get frustrated to be obliged to be explicit here where it's not harmful to be inferred. Reviewers: Debasish Ghosh <dghosh@acm.org>, Guozhang Wang <guozhang@confluent.io>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Guozhang Wang	9752ccad55	KAFKA-6729: Follow up; disable logging for source KTable. (#5038 ) Reviewers: Matthias J. Sax <matthias@confluent.io>	7 years ago
David Glasser	f65f3a878f	KAFKA-6905: Document that Transformers may be re-used by Streams (#5026 ) This is a follow-up to #5022 which added documentation to the Processor interface. This commit adds similar documentation to Transformer and ValueTransformer. Also, s/processor/transformer/ in the close() docs. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Andy Coates	4e1c8ffd0d	KAFKA-6849: add transformValues methods to KTable. (#4959 ) See the KIP: https://cwiki.apache.org/confluence/display/KAFKA/KIP-292%3A+Add+transformValues%28%29+method+to+KTable This PR adds the transformValues method to the KTable interface. The semantics of the call are the same as the methods of the same name on the KStream interface. Fixes KAFKA-6849 Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Rajini Sivaram	c53e274d31	KAFKA-6917; Process txn completion asynchronously to avoid deadlock (#5036 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Guozhang Wang	05ea580091	MINOR: Remove unused class (#5037 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Robert Yokota	ee8abb2f70	KAFKA-6566: Improve Connect Resource Cleanup This is a change to improve resource cleanup for sink tasks and source tasks. Now `Task.stop()` is called from both `WorkerSinkTask.close()` and `WorkerSourceTask.close()`. It is called from `WorkerXXXTask.close()` since this method is called in the `finally` block of `WorkerTask.run()`, and Connect developers use `stop()` to clean up resources. Author: Robert Yokota <rayokota@gmail.com> Reviewers: Randall Hauch <rhauch@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5020 from rayokota/K6566-improve-connect-resource-cleanup	7 years ago
John Roesler	58a910f0a7	KAFKA-5697: revert wakeup-based impl (#5035 ) The wakeup-based strategy caused more problems than it solved, so we'll instead focus on KIP-266. Revert commit `2d8049b`. Keep the metrics addition and the new test util. Also keep the tests for shutdown, although they must be ignored until poll(Duration) is done in the scope of KIP-266. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	ba237c5d21	HOTFIX: use ConsumedInternal in StreamsBuilder	7 years ago
Guozhang Wang	6b8e79b137	HOTFIX: move Conusmed to o.a.k.streams.kstream (#5033 ) Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Guozhang Wang	1a324d784c	KAFKA-6729: Reuse source topics for source KTable's materialized store's changelog (#5017 ) 1. In InternalTopologyBuilder#topicGroups, which is used in StreamsPartitionAssignor, look for book-kept storeToChangelogTopic map before creating a new internal changelog topics. In this way if the source KTable is created, its source topic stored in storeToChangelogTopic will be used. 2. Added unit test (confirmed that without 1) it will fail). 3. MINOR: removed TODOs that are related to removed KStreamBuilder. 4. MINOR: removed TODOs in StreamsBuilderTest util functions and replaced with TopologyWrapper. 5. MINOR: removed StreamsBuilderTest#testFrom as it is already covered by TopologyTest#shouldNotAllowToAddSourcesWithSameName, plus it requires KStreamImpl.SOURCE_NAME which should be a package private field of the KStreamImpl. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Joan Goyeau	ac9de822b2	MINOR: Use Set instead of List for multiple topics (#5024 ) Debasish Ghosh <dghosh@acm.org>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Matthias J. Sax	0b3712d8a5	MINOR: add missing parameter `processing.guaratees` to Streams docs (#5023 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Guozhang Wang	d4204e8b14	MINOR: fix broken links in streams doc (#5025 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
David Glasser	e9154b7960	KAFKA-6905: Document that Processors may be re-used by Streams (#5022 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	c9161afda9	MINOR: doc change for deprecate removal (#5006 ) Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Boyang Chen	1e207b2ef8	KAFKA-6896: Add producer metrics exporting in KafkaStreams (#4998 ) We would like to also export the producer metrics from StreamThread just like consumer metrics, so that we could gain more visibility of stream application. The approach is to pass in the threadProducer into the StreamThread so that we could export its metrics in dynamic. Note that this is a pure internal change that doesn't require a KIP, and in the future we also want to export admin client metrics. A followup KIP for admin client will be created once this is merged. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Dong Lin	0bb48a1669	KAFKA-3473; More Controller Health Metrics (KIP-237) This patch adds a few metrics that are useful for monitoring controller health. See KIP-237 for more detail. Author: Dong Lin <lindong28@gmail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #4392 from lindong28/KAFKA-3473	7 years ago
Matthias J. Sax	9947cd40c6	MINOR: Ensure sensor names are unique in Kafka Streams (#5009 ) Reviewer: Guozhang Wang <guozhang@confluent.io>	7 years ago
Matthias J. Sax	adeced2997	HOTFIX: RegexSourceIntegrationTest needs to cleanup shared output topic (#5008 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Guozhang Wang	caca1fdc90	KAFKA-6813: Remove deprecated APIs in KIP-182, Part III (#4991 ) 1. Remove TopologyBuilder, TopologyBuilderException, KStreamBuilder, 2. Completed the leftover work of https://issues.apache.org/jira/browse/KAFKA-5660, when we remove TopologyBuilderException. 3. Added MockStoreBuilder to replace MockStateStoreSupplier, remove all XXStoreSupplier except StateStoreSupplier as it is still referenced in the logical streams graph. 4. Minor: rename KStreamsFineGrainedAutoResetIntegrationTest.java to FineGrainedAutoResetIntegrationTest.java. Reviewers: Matthias J. Sax <matthias@confluent.io>	7 years ago
Joel Hamill	c14b0ad9ee	MINOR - Fix typo in Streams Dev Guide (#4972 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Joan Goyeau	40d191b563	MINOR: Count fix and Type alias refactor in Streams Scala API (#4966 ) Reviewers: Debasish Ghosh <dghosh@acm.org>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Manikumar Reddy O	ec7ba32af6	KAFKA-6394; Add a check to prevent misconfiguration of advertised listeners (#4897 ) Do not allow server startup if one of its configured advertised listeners has already been registered by another broker.	7 years ago
fedosov-alexander	6eb7cf1300	KAFKA-5965: Remove Deprecated AdminClient from Streams Resetter Tool (#4968 ) Removed usage of deprecated AdminClient from StreamsResetter No additional tests are required. Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Ismael Juma	c3921d489f	MINOR: Rename RecordFormat to RecordVersion (#4809 ) Also include a few clean-ups: * Method/variable/parameter renames to make them consistent with the class name * Return `ApiVersion` from `minSupportedFor` * Use `values` to remove some code duplication * Reduce duplication in `ApiVersion` by introducing the `shortVersion` method and building the versions map programatically * Avoid unnecessary `regex` in `ApiVersion.apply` * Added scaladoc to a few methods Some of these were originally discussed in: https://github.com/apache/kafka/pull/4583#pullrequestreview-98089400 Added a test for `ApiVersion.shortVersion`. Relying on existing tests for the rest since there is no change in behaviour. Reviewers: Jason Gustafson <jason@confluent.io>	7 years ago
Jason Gustafson	a5ea6d10a8	MINOR: A few small cleanups in AdminClient from KAFKA-6299 (#4989 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Robert Yokota	f69900cd1e	KAFKA-6894: Improve err msg when connecting processor with global store (#5000 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Rajini Sivaram	7ed7cca4c9	KAFKA-6893; Create processors before starting acceptor in SocketServer (#4999 )	7 years ago
Gunju Ko	c90bbc2749	MINOR: Fix typo in ConsumerRebalanceListener JavaDoc (#4996 )	7 years ago
Guozhang Wang	fa1702fece	MINOR: Remove deprecated valueTransformer.punctuate (#4993 ) Also removed the InternalValueTransformerWithKey / Supplier which is used to mock away the deprecated punctuate function. Reviewers: Matthias J. Sax <matthias@confluent.io>	7 years ago
Rajini Sivaram	830ee16d0d	MINOR: Update dynamic broker configuration doc for truststore update (#4954 ) Reviewers: Manikumar Reddy O <manikumar.reddy@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Chia-Ping Tsai	4f7c11a1df	KAFKA-6870 Concurrency conflicts in SampledStat (#4985 ) Make `KafkaMetric.measurableValue` thread-safe Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Anna Povzner	9679c44d2b	KAFKA-6361: Fix log divergence between leader and follower after fast leader fail over (#4882 ) Implementation of KIP-279 as described here: https://cwiki.apache.org/confluence/display/KAFKA/KIP-279%3A+Fix+log+divergence+between+leader+and+follower+after+fast+leader+fail+over In summary: - Added leader_epoch to OFFSET_FOR_LEADER_EPOCH_RESPONSE - Leader replies with the pair( largest epoch less than or equal to the requested epoch, the end offset of this epoch) - If Follower does not know about the leader epoch that leader replies with, it truncates to the end offset of largest leader epoch less than leader epoch that leader replied with, and sends another OffsetForLeaderEpoch request. That request contains the largest leader epoch less than leader epoch that leader replied with. Reviewers: Dong Lin <lindong28@gmail.com>, Jun Rao <junrao@gmail.com>	7 years ago
Guozhang Wang	0b1a118f45	KAFKA-6813: Remove deprecated APIs in KIP-182, Part II (#4976 ) 1. Remove the deprecated StateStoreSuppliers, and the corresponding Stores.create() functions and factories: only the base StateStoreSupplier and MockStoreSupplier were still preserved as they are needed by the deprecated TopologyBuilder and KStreamBuilder. Will remove them in a follow-up PR. 2. Add TopologyWrapper.java as the original InternalTopologyBuilderAccessor was removed, but I realized it is still needed as of now. 3. Minor: removed StateStoreTestUtils.java and inline its logic in its callers since now with StoreBuilder it is just a one-liner. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
tedyu	8fb5b37013	KAFKA-6878 Switch the order of underlying.init and initInternal (#4988 ) This is continuation of #4978. From Guozhang: I think to fix this issue, in init we could consider switching the steps of 1 and 2: initInternal(context); underlying.init(context, root); since volatile boolean open = false; it should be sufficient. In this case the check on step 3) will fail if underlying.init is not completed and we will throw InvalidStateStoreException. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Colin Patrick McCabe	abbd53da4a	KAFKA-6299; Fix AdminClient error handling when metadata changes (#4295 ) When AdminClient gets a NOT_CONTROLLER error, it should refresh its metadata and retry the request, rather than making the end-user deal with NotControllerException. Move AdminClient's metadata management outside of NetworkClient and into AdminMetadataManager. This will make it easier to do more sophisticated metadata management in the future, such as implementing a NodeProvider which fetches the leaders for topics. Rather than manipulating newCalls directly, the AdminClient service thread now drains it directly into pendingCalls. This minimizes the amount of locking we have to do, since pendingCalls is only accessed from the service thread.	7 years ago
tedyu	e32dcb9a66	KAFKA-6878: NPE when querying global state store not in READY state (#4978 ) Check whether cache is null before retrieving from cache. Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
asutosh936	5ca9ed5ede	KAFKA 6673: Implemented missing override equals method (#4745 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Rajini Sivaram	0ecb72f59d	KAFKA-6834: Handle compaction with batches bigger than max.message.bytes (#4953 ) Grow buffers in log cleaner to hold one message set after sanity check even if message set is bigger than max.message.bytes. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	7 years ago
Colin Patrick McCabe	b27e098a7d	MINOR: Fix trace logging in ReplicaManager (#4916 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Adem Efe Gencer	7afcb3a64c	KAFKA-6877; Remove completedFetch upon a failed parse if it contains no records. This patch removed a completedFetch from the completedFetches queue upon a failed parse if it contains no records. The following scenario explains why this is needed for an instance of this case – i.e. in TopicAuthorizationException. 0. Let's assume a scenario, in which the consumer is attempting to read from a topic without the necessary read permission. 1. In Fetcher#fetchedRecords(), after peeking the completedFetches, the Fetcher#parseCompletedFetch(CompletedFetch) throws a TopicAuthorizationException (as expected). 2. Fetcher#fetchedRecords() passes the TopicAuthorizationException up without having a chance to poll completedFetches. So, the same completedFetch remains at the completedFetches queue. 3. Upon following calls to Fetcher#fetchedRecords(), peeking the completedFetches will always return the same completedFetch independent of any updates to the ACL that the topic is trying to read from. 4. Hence, despite the creation of an ACL with correct permissions, once the consumer sees the TopicAuthorizationException, it will be unable to recover without a bounce. Author: Adem Efe Gencer <agencer@linkedin.com> Reviewers: Jiangjie (Becket) Qin <becket.qin@gmail.com> Closes #4974 from efeg/fix/parseCompletedFetchRemainsInQueue	7 years ago
Roman Khlebnov	fcb15e357c	KAFKA-6292; Improve FileLogInputStream batch position checks to avoid type overflow (#4928 ) Switch from sum operations to subtraction to avoid type casting in checks and type overflow during `FlieLogInputStream` work, especially in cases where property `log.segment.bytes` was set close to the `Integer.MAX_VALUE` and used as a `position` inside `nextBatch()` function. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
dan norwood	b328fc729b	MINOR: add equals()/hashCode() for Produced/Consumed (#4979 ) Reviewer: Matthias J. Sax <matthias@confluent.io>	7 years ago
Jason Gustafson	bce10794a0	KAFKA-6879; Invoke session init callbacks outside lock to avoid Controller deadlock (#4977 ) Fixes a deadlock between the controller's beforeInitializingSession callback which holds the zookeeper client initialization lock while awaiting completion of an asynchronous event which itself depends on the same lock. Also catch and log callback exceptions to ensure the ZooKeeper reconnection takes place. Finally, configure KafkaScheduler in ZooKeeperClient to have at least 1 thread. Added tests that fail or hang without the changes in this PR. Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Joan Goyeau	b88d70b532	MINOR: Make Serdes less confusing in Scala (#4963 ) Serdes are confusing in the Scala wrapper: * We have wrappers around Serializer, Deserializer and Serde which are not very useful. * We have Serdes in 2 places org.apache.kafka.common.serialization.Serde and in DefaultSerdes, instead we should be having only one place where to find all the Serdes. I wanted to do this PR before the release as this is a breaking change. This shouldn't add more so the current tests should be enough. Reviewers: Debasish Ghosh <dghosh@acm.org>, Guozhang Wang <guozhang@confluent.io>	7 years ago

1 2 3 4 5 ...

5027 Commits (95b46a12e5a74da2699b3472c639cc82b28cea96) All Branches Search

5027 Commits (95b46a12e5a74da2699b3472c639cc82b28cea96)

All Branches