src-kafka

Commit Graph

Author	SHA1	Message	Date
Magnus Edenhill	e490a90625	Make [Config]Resource.toString() consistent with existing code (#4845 ) The toString() for ConfigResource was using { } instead of ( ) which is inconsistent with the existing toStrings in the code, while toString for Resource was using a mix of ( and }.	7 years ago
Jason Gustafson	0a8f35b684	KAFKA-6768; Transactional producer may hang in close with pending requests (#4842 ) This patch fixes an edge case in producer shutdown which prevents `close()` from completing due to a pending request which will never be sent due to shutdown initiation. I have added a test case which reproduces the scenario. Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Rajini Sivaram	77ebd32016	KAFKA-6576: Configurable Quota Management (KIP-257) (#4699 ) Enable quota calculation to be customized using a configurable callback. See KIP-257 for details. Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Manikumar Reddy O	77c79df396	KAFKA-6741: Disable Selector's idle connection timeout in testNetworkThreadTimeRecorded() test (#4824 ) Reviewers: Jason Gustafson <jason@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Rajini Sivaram	9f8c3167eb	KAFKA-4292: Configurable SASL callback handlers (KIP-86) (#2022 ) Implementation of KIP-86. Client, server and login callback handlers have been made configurable for both brokers and clients. Reviewers: Jun Rao <junrao@gmail.com>, Ron Dagostino <rndgstn@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	7 years ago
Chia-Ping Tsai	53d4267c59	MINOR: Don’t send the DeleteTopicsRequest for invalid topic names (#4763 ) The invalid topic name is already handled locally so it is unnecessary to send the DeleteTopicsRequest. This PR adds a count to MockClient for testing. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Dhruvil Shah	719a21f7c9	KAFKA-6739; Ignore headers when down-converting from V2 to V0/V1 (#4813 ) Ignore headers when down-converting to V0/V1 since they are not supported. Added a test-case to verify down-conversion sanity in presence of headers. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Manikumar Reddy O	e4c8e3e758	MINOR: Add Timed wait to SslTransportLayerTest.testNetworkThreadTimeRecorded (#4811 ) Avoid test hanging when there is a failure by limiting wait time.	7 years ago
Jason Gustafson	8662a022c4	MINOR: Fix partition loading checks in GroupCoordinator (#4788 ) In the group coordinator, we currently check whether the partition is owned before checking whether it is loading. Since loading is a prerequisite for partition ownership, it means that it is not actually possible to see the COORDINATOR_LOAD_IN_PROGRESS error. The impact is mostly harmless: while loading the group, the client may send unnecessary FindCoordinator requests to rediscover the coordinator. I've fixed the bug and restructured the code to enable testing. In the process of fixing this bug, the following improvements have been made: 1. We now verify valid groupId in all request handlers. 2. Currently if the coordinator is loading when a SyncGroup is received, we'll return NOT_COORDINATOR. I've changed this to return REBALANCE_IN_PROGRESS since the rebalance state will have been lost on coordinator failover. This effectively forces the consumer to rejoin the group, which seems preferable over unnecessarily rediscovering the coordinator. 3. I added a check for the COORDINATOR_LOAD_IN_PROGRESS handler in SyncGroup. Although we do not currently return this error, it seems reasonable that we might want to some day, so it seems better to get the check in now. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
JieFang.He	cb7cf7c5a7	KAFKA-6702: Wrong className in LoggerFactory.getLogger method (#4772 ) Reviewers: Manikumar Reddy, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
huxi	5d5a2ce4bb	KAFKA-6716: Should close the `discardChannel` in MockSelector#completeSend (#4783 )	7 years ago
huxi	9eb32eaad5	KAFKA-6446; KafkaProducer initTransactions() should timeout after max.block.ms (#4563 ) Currently the `initTransactions()` API blocks indefinitely if the broker cannot be reached. This patch changes the behavior to raise a `TimeoutException` after waiting for `max.block.ms`. Reviewers: Apurva Mehta <apurva@confluent.io>, Jason Gustafson <jason@confluent.io>	7 years ago
Rajini Sivaram	2307314432	MINOR: Fix encoder config to make DynamicBrokerReconfigurationTest stable (#4764 ) DynamicBrokerReconfigurationTest currently assumes that passwords encoded with one secret will fail with an exception if decoded with another secret and configures an old.secret in setUp. This could potentially cause test failures if a password was incorrectly decoded with the wrong secret, since the test writes passwords encoded with the new secret directly to ZooKeeper. Since old.secret is only used in one test for verifying secret rotation, this config can be moved to that test to avoid transient failures. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Anna Povzner	5c24295d44	Trogdor's ProducerBench does not fail if topics exists (#4673 ) Added configs to ProducerBenchSpec: topicPrefix: name of topics will be of format topicPrefix + topic index. If not provided, default is "produceBenchTopic". partitionsPerTopic: number of partitions per topic. If not provided, default is 1. replicationFactor: replication factor per topic. If not provided, default is 3. The behavior of producer bench is changed such that if some or all topics already exist (with topic names = topicPrefix + topic index), and they have the same number of partitions as requested, the worker uses those topics and does not fail. The producer bench fails if one or more existing topics has number of partitions that is different from expected number of partitions. Added unit test for WorkerUtils -- for existing methods and new methods. Fixed bug in MockAdminClient, where createTopics() would over-write existing topic's replication factor and number of partitions while correctly completing the appropriate futures exceptionally with TopicExistsException. Reviewers: Colin P. Mccabe <cmccabe@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Guozhang Wang	0f364cd53a	MINOR: Pass a streams config to replace the single state dir (#4714 ) This is a general change and is re-requisite to allow streams benchmark test with different streams tests. For the streams benchmark itself I will have a separate PR for switching configs. Details: 1. Create a "streams.properties" file under PERSISTENT_ROOT before all the streams test. For now it will only contain a single config of state.dir pointing to PERSISTENT_ROOT. 2. For all the system test related code, replace the main function parameter of state.dir with propsFilename, then inside the function load the props from the file and apply overrides if necessary. 3. Minor fixes. Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Colin Patrick McCabe	27bb3ccace	MINOR: KafkaFutureImpl#addWaiter should be protected (#4734 ) KafkaFutureImpl#addWaiter should be protected, just like KafkaFuture#addWaiter. As described in KIP-218, whenComplete is the public API, not addWaiter.	7 years ago
Dhruvil Shah	ae31ee63dc	KAFKA-6530: Use actual first offset of message set when rolling log segment (#4660 ) Use the exact first offset of message set when rolling log segment. This is possible to do for message format V2 and beyond without any performance penalty, because we have the first offset stored in the header. This augments the fix made in KAFKA-4451 to avoid using the heuristic for V2 and beyond messages. Added unit tests to simulate cases where segment needs to roll because of overflow in index offsets. Verified that the new segment created in these cases uses the first offset, instead of the heuristic in use previously.	7 years ago
Sandor Murakozi	2afac71566	MINOR: Remove unnecessary null checks (#4708 ) Remove unnecessary null check in StringDeserializer, MockProducerInterceptor and KStreamImpl. Reviewers: Vahid Hashemian <vahidhashemian@us.ibm.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Dong Lin	d935699486	KAFKA-6640; Improve efficiency of KafkaAdminClient.describeTopics() (#4694 ) Currently in KafkaAdminClient.describeTopics(), for each topic in the request, a complete map of cluster and errors will be constructed for every topic and partition. This unnecessarily increases the complexity of describeTopics() to O(n^2). This patch improves the complexity to O(n). Reviewers: Ismael Juma <ismael@juma.me.uk>, Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Siva Santhalingam	0bb8e66184	KAFKA-6024; Move arg validation in KafkaConsumer ahead of `acquireAndEnsureOpen` (#4617 )	7 years ago
Vitaly Pushkar	b1aa1912f0	KAFKA-4831: Extract WindowedSerde to public APIs (#3307 ) Now that we have augmented WindowSerde with non-arg parameters, extract it out as part of the public APIs so that users who want to I/O windowed streams can use it. This is originally introduced by @vitaly-pushkar This PR grows out to be a much larger one, as I found a few tech debts and bugs while working on it. Here is a summary of the PR: Public API changes (I will propose a KIP after a first round of reviews): Add TimeWindowedSerializer, TimeWindowedDeserializer, SessionWindowedSerializer, SessionWindowedDeserializer into o.a.k.streams.kstream. The serializers would implemented an internal WindowedSerializer interface for the serializeBaseKey function used in 3) below. Add WindowedSerdes into o.a.k.streams.kstream. The reason to now add them into o.a.k.clients's Serdes is that it then needs dependency of streams. Add "default.windowed.key.serde.inner" and "default.windowed.value.serde.inner" into StreamsConfig, used when "default.key.serde" is specified to use time or session windowed serde. Note this requires the serde class, not the type class. Consolidated serde format from multiple classes, including SessionKeySerde.java for session, and WindowStoreUtils for time window, into SessionKeySchema and WindowKeySchema. Bug fix: WindowedStreamPartitioner needs to consider both time window and session window serdes. Removed RocksDBWindowBytesStore etc optimization since after KIP-182 all the serde know happens on metered store, hence this optimization is not worth. Bug fix: for time window, the serdes used for store and the serdes used for piping (source and sink node) are different: the former needs to append sequence number but not for the later. Other minor cleanups: remove unnecessary throws, etc. Authors: Guozhang Wang <wangguoz@gmail.com>, Vitaly Pushkar <vitaly.pushkar@gmail.com> Reviewers: Matthias J. Sax <mjsax@apache.org>, Bill Bejeck <bill@confluent.io>, Xi Hu	7 years ago
wushujames	c5ba0da993	MINOR: Fix incorrect references to the max transaction timeout config (#4664 )	7 years ago
Jason Gustafson	925d6a2ef3	MINOR: Skip sending fetches/offset lookups when awaiting the reconnect backoff (#4644 ) Logging can get spammy during the reconnect blackout period because any requests we send to ConsumerNetworkClient will immediately be failed when poll() returns. This patch checks for connection failures prior to sending fetches and offset lookups and skips sending to any failed nodes. Test cases added for both. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Guozhang Wang	e5d6c9a79a	MINOR: Do not start processor for bounce-at-start (#4639 ) Only start it after the broker has been shutdown.	7 years ago
Jason Gustafson	8f2c087166	MINOR: Complete inflight requests in order on disconnect (#4642 ) NetworkClient should use FIFO order when completing inflight requests following a disconnect. I've added new unit tests for `InFlightRequests` and `NetworkClient` which verify completion order. Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Jason Gustafson	604b93cfde	KAFKA-6606; Ensure consumer awaits auto-commit interval after sending… (#4641 ) We need to reset the auto-commit deadline after sending the offset commit request so that we do not resend it while the request is still inflight. Added unit tests ensuring this behavior and proper backoff in the case of a failure. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Jason Gustafson	6cfcc9d553	KAFKA-6593; Fix livelock with consumer heartbeat thread in commitSync (#4625 ) Contention for the lock in ConsumerNetworkClient can lead to a livelock situation in which an active commitSync is unable to make progress because its completion is blocked in the heartbeat thread. The fix is twofold: 1) We change ConsumerNetworkClient to use a fair lock to reduce the chance of each thread getting starved. 2) We eliminate the dependence on the lock in ConsumerNetworkClient for callback completion so that callbacks will not be blocked by an active poll(). Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Thomas Leplus	031f522a2d	MINOR: Fix javadoc typo in Headers (#4627 )	7 years ago
Guozhang Wang	97ad549d56	KAFKA-6534: Enforce a rebalance in the next poll call when encounter task migration (#4544 ) The fix is in two folds: For tasks that's closed in closeZombieTask, their corresponding partitions are still in runningByPartition so those closed tasks may still be returned in activeTasks and standbyTasks. Adding guards on the returned tasks and if they are closed notify the thread to trigger rebalance immediately. When triggering a rebalance, un-subscribe and re-subscribe immediately to make sure we are not dependent on the background heartbeat thread timing. Some minor changes on log4j. More specifically, I moved the log entry of closeZombieTask to its callers with more context information and the action going to take. I can re-produce the issue with EosIntegrationTest may hand-code the heartbeat thread to GC, and confirmed this patch fixed the issue. Unfortunately this test cannot be added to AK since currently we do not have ways to manipulate the heartbeat thread in unit tests. Reviewers: Jason Gustafson <jason@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Matthias J. Sax	5df535e8a3	MINOR: fixes lgtm.com warnings (#4582 ) fixes lgmt.com warnings cleanup PrintForeachAction and Printed Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Sebastian Bauersfeld <sebastianbauersfeld@gmx.de>, Damian Guy <damian@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Igor Kostiakov	99d650c2c8	KAFKA-6590; Fix bug in aggregation of consumer fetch bytes and counts metrics (#4278 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Daniel Shuy	4a027a0af1	Fix typo in MockProducer JavaDoc (#4606 )	7 years ago
Colin Patrick McCabe	66039b1312	MINOR: Fix ConcurrentModificationException in TransactionManager (#4608 )	7 years ago
Jason Gustafson	1d8ed875db	MINOR: Fix javadoc for consumer offsets lookup APIs which do not block indefinitely (#4613 ) The blocking time for these APIs is bounded by the request timeout.	7 years ago
Jason Gustafson	660c0c0aa3	KAFKA-6238; Fix inter-broker protocol message format compatibility check This patch fixes a bug in the validation of the inter-broker protocol and the message format version. We should allow the configured message format api version to be greater than the inter-broker protocol api version as long as the actual message format versions are equal. For example, if the message format version is set to 1.0, it is fine for the inter-broker protocol version to be 0.11.0 because they both use message format v2. I have added a unit test which checks compatibility for all combinations of the message format version and the inter-broker protocol version. Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #4583 from hachikuji/KAFKA-6328-REOPENED	7 years ago
Manikumar Reddy O	ac2536e77e	KAFKA-5624; Add expiry check to sensor.add() methods (#4404 )	7 years ago
Jason Gustafson	1547cf6de8	KAFKA-6554; Missing lastOffsetDelta validation before log append (#4585 ) Add validation checks that the offset range is valid and aligned with the batch count prior to appending to the log. Several unit tests have been added to verify the various invalid cases.	7 years ago
ying-zheng	13caded15e	KAFKA-6430: Add buffer for gzip streams (#4537 ) As described in the JIRA ticket, this can double throughput.	7 years ago
dan norwood	55aa7de62a	MINOR: Make it explicit in consumer docs that poll() is needed for callback to run (#4480 ) Make it clear in the docs that the rebalance listener is only invoked during an active call to `poll()`. Plus a few additional doc cleanups. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Jason Gustafson	f12d237fdd	MINOR: Free sends in MultiSend as they complete (#4574 ) Currently we hold onto all Records references in a multi-partition fetch response until the full response has completed. This can be a problem when the records have been down-converted since they will be occupying a (potentially large) chunk of memory. This patch changes the behavior in MultiSend so that once a Send is completed, we no longer keep a reference to it, which will allow the Records objects to be freed sooner. I have added a simple unit test to verify that sends are removed as the MultiSend progresses. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Eugene Sevastyanov	004ef6b04b	MINOR: Cache Node's hashCode to improve the producer's performance (#4350 ) `Node` is immutable so this is safe. With 100 brokers, 150 topics and 350 partitions, `HashSet.contains` in `RecordAccumulator.ready` took about 40% of the application time. It is caused by re-calculating a hash code of a leader (Node instance) for every batch entry. Caching the hashCode reduced the time of `HashSet.contains` in `RecordAccumulator.ready` to ~2%. The measurements were taken with Flight Recorder. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ted Yu <yuzhihong@gmail.com>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Rajini Sivaram	1067cc3422	KAFKA-6512: Discard references to buffers used for compression (#4570 ) ProducerBatch retains references to MemoryRecordsBuilder and cannot be freed until acks are received. Removing references to buffers used for compression after records are built will enable these to be garbage collected sooner, reducing the risk of OOM. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>, Lothsahn <Lothsahn@gmail.com>	7 years ago
Rajini Sivaram	015e224b3d	MINOR: Support dynamic JAAS config for broker's LoginManager cache (#4568 ) Fix LoginManager caching when sasl.jaas.config is defined for broker and add unit tests. Reviewers: Jason Gustafson <jason@confluent.io>	7 years ago
Sebastian Bauersfeld	50cd855385	HOTFIX: Fix lgtm.com alerts (dead code and out-of-bounds error) (#4388 ) This fixes two alerts flagged on lgtm.com for Apache Kafka. This dead code alert where InvalidTypeIdException indirectly extends JsonMappingException. The flagged condition with the type test appears after the type test for the latter and thus makes its body dead. I opted to change the order of the tests. Please let me know if this is the intended behavior. The second commit addresses this out-of-bounds alert. More alerts can be found here. Note that my colleague Aditya Sharad addressed some of those in the now outdated #2939. Reviewers: Matthias J. Sax <matthias@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Rajini Sivaram	38e9958d6e	KAFKA-6476: Documentation for dynamic broker configuration (#4558 ) Docs for dynamic broker configuration (KIP-226)	7 years ago
Bill Bejeck	27b56b1458	KAFKA-6364: Second check for ensuring changelog topic not changed during restore (#4511 ) Added a second check for race condition where store changelog topic updated during restore, but not if a KTable changelog topic. This will be tricky to test, but I wanted to push the PR to get feedback on the approach. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Jason Gustafson	6d18d882b8	KAFKA-6397: Consumer should not block setting positions of unavailable partitions (#4557 ) Prior to this patch, the consumer always blocks in poll() if there are any partitions which are awaiting their initial positions. This behavior was inconsistent with normal fetch behavior since we allow fetching on available partitions even if one or more of the assigned partitions becomes unavailable _after_ initial offset lookup. With this patch, the consumer will do offset resets asynchronously, which allows other partitions to make progress even if the initial positions for some partitions cannot be found. I have added several new unit tests in `FetcherTest` and `KafkaConsumerTest` to verify the new behavior. One minor compatibility implication worth mentioning is apparent from the change I made in `DynamicBrokerReconfigurationTest`. Previously it was possible to assume that all partitions had a fetch position after `poll()` completed with a non-empty assignment. This assumption is no longer generally true, but you can force the positions to be updated using the `position()` API which still blocks indefinitely until a position is available. Note that this this patch also removes the logic to cache committed offsets in `SubscriptionState` since it was no longer needed (the consumer's `committed()` API always does an offset lookup anyway). In addition to avoiding the complexity of maintaining the cache, this avoids wasteful offset lookups to refresh the cache when `commitAsync()` is used. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Manikumar Reddy O	36b00300f5	MINOR: Update log condition in JassContext.loadServerContext (#4566 )	7 years ago
Guozhang Wang	f3a3253e24	HOTFIX: Fix reset integration test hangs on busy wait (#4491 ) * do not use static properties * use new object to take appID * capture timeout exception inside condition Reviewers: Matthias J. Sax <matthias@confluent.io>	7 years ago
Magnus Reftel	234b54c2c4	MINOR: more details in error message descriptions (#3267 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago

... 3 4 5 6 7 ...

1369 Commits (e869d8f7b495e65379212ab8e5ec1f8e8f3fa650)