src-kafka

Commit Graph

Author	SHA1	Message	Date
Rajini Sivaram	a6691fb79e	KAFKA-8091; Use commitSync to check connection failure in listener update test (#6450 ) The use of consumer.poll() made the test flaky since in some cases, it doesn't wait for coordinator connection. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
huxi	ee3b9c57fd	MINOR: Fix typos in LogValidator (#6449 )	6 years ago
Rajini Sivaram	3ec2ca5e33	KAFKA-7730; Limit number of active connections per listener in brokers (KIP-402) Adds a new listener config `max.connections` to limit the number of active connections on each listener. The config may be prefixed with listener prefix. This limit may be dynamically reconfigured without restarting the broker. This is one of the PRs for KIP-402 (https://cwiki.apache.org/confluence/display/KAFKA/KIP-402%3A+Improve+fairness+in+SocketServer+processors). Note that this is currently built on top of PR #6022 Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Gwen Shapira <cshapi@gmail.com> Closes #6034 from rajinisivaram/KAFKA-7730-max-connections	6 years ago
Rajini Sivaram	d436f32e6b	KAFKA-8091; Remove unsafe produce from dynamic listener update test (#6443 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Bill Bejeck	9ecadc4df4	MINOR: Use Java 8 lambdas in KStreamImplTest (#6430 ) Just a minor cleanup to use Java 8 lambdas vs anonymous classes in this test. I ran all tests in the streams test suite Reviewers: Matthias J. Sax <mjsax@apache.org>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Rajini Sivaram	0f83c4cdb7	KAFKA-7976; Update config before notifying controller of unclean leader update (#6426 ) When unclean leader election is enabled dynamically on brokers, we notify controller of the update before updating KafkaConfig. When processing this event, controller's decision to elect unclean leaders is based on the current KafkaConfig, so there is a small timing window when the controller may not elect unclean leader because KafkaConfig of the server was not yet updated. The commit fixes this timing window by using the existing BrokerReconfigurable interface used by other classes which rely on the current value of KafkaConfig. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
huxihx	2903606921	KAFKA-7801: TopicCommand should not be able to alter transaction topic partition count To keep align with the way it handles the offset topic, TopicCommand should not be able to alter transaction topic partition count. Author: huxihx <huxi_2b@hotmail.com> Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6109 from huxihx/KAFKA-7801	6 years ago
Rajini Sivaram	9aaa32b64d	KAFKA-8091; Wait for processor shutdown before testing removed listeners (#6425 ) DynamicBrokerReconfigurationTest.testAddRemoveSaslListeners removes a listener, waits for the config to be propagated to all brokers and then validates that connections to the removed listener fail. But there is a small timing window between config update and Processor shutdown. Before validating that connections to a removed listener fail, this commit waits for all metrics of the removed listener to be deleted, ensuring that the Processors of the listener have been shutdown. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
huxihx	1e1b669e9d	MINOR: Update delete topics zk path in assertion error messages - Update delete topics zk path from /admin/delete_topic to /admin/delete_topics in assertion error messages Author: huxihx <huxi_2b@hotmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6422 from huxihx/delete-topics	6 years ago
Manikumar Reddy	a42f16f980	KAFKA-7922: Return authorized operations in Metadata request response (KIP-430 Part-2) - Use automatic RPC generation in Metadata Request/Response classes - https://cwiki.apache.org/confluence/display/KAFKA/KIP-430+-+Return+Authorized+Operations+in+Describe+Responses Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #6352 from omkreddy/KIP-430-METADATA	6 years ago
Suman BN	65aea1f362	MINOR: Print usage when parse fails during console producer Handle OptionException while parsing options when using console producer and print usage before die. Author: Suman BN <sumannewton@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6386 from sumannewton/console-producer-parse-printusage	6 years ago
Zhanxiang (Patrick) Huang	44be5d2221	KAFKA-8069; Fix early expiration of offsets due to invalid loading of expire timestamp (#6401 ) After the 2.1 release, if the broker hasn't been upgrade to the latest inter-broker protocol version, the committed offsets stored in the __consumer_offset topic will get cleaned up way earlier than it should be when the offsets are loaded back from the __consumer_offset topic in GroupCoordinator, which will happen during leadership transition or after broker bounce. This patch fixes the bug by setting expireTimestamp to None if it is the default value after loading v1 offset records from __consumer_offsets. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Rajini Sivaram	47bc85f2e7	KAFKA-7980 - Fix timing issue in SocketServerTest.testConnectionRateLimit (#6391 ) Test currently checks that there were at least 5 polls when 5 connections are established with connectionQueueSize=1. But we could be doing the check just after the 5th connection before the 5th poll, so updated the check to verify that there were at least 4 polls. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Jason Gustafson	460e46c3bb	KAFKA-7831; Do not modify subscription state from background thread (#6221 ) Metadata may be updated from the background thread, so we need to protect access to SubscriptionState. This patch restructures the metadata handling so that we only check pattern subscriptions in the foreground. Additionally, it improves the following: 1. SubscriptionState is now the source of truth for the topics that will be fetched. We had a lot of messy logic previously to try and keep the the topic set in Metadata consistent with the subscription, so this simplifies the logic. 2. The metadata needs for the producer and consumer are quite different, so it made sense to separate the custom logic into separate extensions of Metadata. For example, only the producer requires topic expiration. 3. We've always had an edge case in which a metadata change with an inflight request may cause us to effectively miss an expected update. This patch implements a separate version inside Metadata which is bumped when the needed topics changes. 4. This patch removes the MetadataListener, which was the cause of https://issues.apache.org/jira/browse/KAFKA-7764. Reviewers: David Arthur <mumrah@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
KartikVK	f708e78294	KAFKA-7950; Update description for the "time" parameter for GetOffsetShell Added additional description for the "time" parameter for GetOffsetShell which adds " No offset is returned if timestamp provided is greater than recently committed record timestamp." in the description. Author: KartikVK <karthikkalaghatgi123@gmail.com> Reviewers: huxi <huxi_2b@hotmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6357 from Kartikvk1996/kartik-branch	6 years ago
Jun Rao	73737892d5	KAFKA-8018: Flaky Test SaslSslAdminClientIntegrationTest#testLegacyAclOpsNeverAffectOrReturnPrefixed Disable forceSync in EmbeddedZookeeper. Increase ZK tick to allow longer maxSessionTimeout in tests. Increase ZK client session timeout in tests. Handle transient ZK session expiration exception in test utils for createTopic. Author: Jun Rao <junrao@gmail.com> Reviewers: Guozhang Wang <wangguoz@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #6354 from junrao/KAFKA-8018	6 years ago
Rajini Sivaram	2bb74bfc3b	KAFKA-7979 - Clean up threads and increase timeout in PartitionTest (#6378 ) Stack trace generated from the test failure shows that test failed even though threads were runnable and making progress, indicating that the timeout may be too small when test machine is slow. Increasing timeout from 10 to 15 seconds, consistent with the default wait in other tests. Thread dump also showed a lot of left over threads from other tests, so added clean up of those as well. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Rajini Sivaram	3747f55336	KAFKA-7976 - Fix DynamicBrokerReconfigurationTest.testUncleanLeaderElectionEnable (#6374 ) Ensure that controller is not shutdown in the test. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Manikumar Reddy	9ee5f920d5	KAFKA-7312: Change broker port used in testMinimumRequestTimeouts and testForceClose Port 22 is used by ssh, which causes the AdminClient to throw an OOM: > java.lang.OutOfMemoryError: Java heap space > at java.nio.HeapByteBuffer.<init>(HeapByteBuffer.java:57) > at java.nio.ByteBuffer.allocate(ByteBuffer.java:335) > at org.apache.kafka.common.memory.MemoryPool$1.tryAllocate(MemoryPool.java:30) > at org.apache.kafka.common.network.NetworkReceive.readFrom(NetworkReceive.java:112) > at org.apache.kafka.common.network.KafkaChannel.receive(KafkaChannel.java:424) > at org.apache.kafka.common.network.KafkaChannel.read(KafkaChannel.java:385) > at org.apache.kafka.common.network.Selector.attemptRead(Selector.java:640) > at org.apache.kafka.common.network.Selector.pollSelectionKeys(Selector.java:561) > at org.apache.kafka.common.network.Selector.poll(Selector.java:472) > at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:535) > at org.apache.kafka.clients.admin.KafkaAdminClient$AdminClientRunnable.run(KafkaAdminClient.java:1140) > at java.lang.Thread.run(Thread.java:748) > > Author: Manikumar Reddy <manikumar.reddy@gmail.com> Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #6360 from omkreddy/KAFKA-7312	6 years ago
Mickael Maison	0d56f14135	KAFKA-7997: Use automatic RPC generation in SaslAuthenticate Author: Mickael Maison <mickael.maison@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6324 from mimaison/sasl-authenticate	6 years ago
Bob Barrett	57604c2331	KAFKA-8002; Log dir reassignment stalls if future replica has different segment base offset (#6346 ) This patch fixes a bug in log dir reassignment where Partition.maybeReplaceCurrentWithFutureReplica would compare the entire LogEndOffsetMetadata of each replica to determine whether the reassignment has completed. If the active segments of both replicas have different base segments (for example, if the current replica had previously been cleaned and the future replica rolled segments at different points), the reassignment will never complete. The fix is to compare only the LogEndOffsetMetadata.messageOffset for each replica. Tested with a unit test that simulates the compacted current replica case. Reviewers: Anna Povzner <anna@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Manikumar Reddy	f11fa5ef40	KAFKA-7922: Return authorized operations in describe consumer group responses (KIP-430 Part-1) - Use automatic RPC generation in DescribeGroups Request/Response classes Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #6322 from omkreddy/KIP-430-Return-Ops	6 years ago
Colin Hicks	70828cea49	KAFKA-8012; Ensure partitionStates have not been removed before truncating. (#6333 ) This patch fixes a regression in the replica fetcher which occurs when the replica fetcher manager simultaneously calls `removeFetcherForPartitions`, removing the corresponding partitionStates, while a replica fetcher thread attempts to truncate the same partition(s) in `truncateToHighWatermark`. This causes an NPE which causes the fetcher to crash. This change simply checks that the `partitionState` is not null first. Note that a similar guard exists in `truncateToEpochEndOffsets`. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Jason Gustafson <jason@confluent.io>	6 years ago
lambdaliu	33e005994d	MINOR: Skip quota check when replica is in sync (#6344 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Bob Barrett	18b3a878a0	MINOR: Improve logging for alter log dirs (#6302 ) This patch adds several new log messages to provide more information about errors during log dir movement and to make it clear when each partition movement is finished. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Stanislav Kozlovski	f667f573ff	Address flakiness of CustomQuotaCallbackTest#testCustomQuotaCallback (#6330 )	6 years ago
Gardner Vickers	bd6520a22a	KAFKA-7956 In ShutdownableThread, immediately complete the shutdown if the thread has not been started (#6218 ) In some test cases it's desirable to instantiate a subclass of `ShutdownableThread` without starting it. Since most subclasses of `ShutdownableThread` put cleanup logic in `ShutdownableThread.shutdown()`, being able to call `shutdown()` on the non-running thread would be useful. This change allows us to avoid blocking in `ShutdownableThread.shutdown()` if the thread's `run()` method has not been called. We also add a check that `initiateShutdown()` was called before `awaitShutdown()`, to protect against the case where a user calls `awaitShutdown()` before the thread has been started, and unexpectedly is not blocked on the thread shutting down. Reviewers : Dhruvil Shah <dhruvil@confluent.io>, Jun Rao <junrao@gmail.com>	6 years ago
Jason Gustafson	66a6fc7204	MINOR: Refactor replica log dir fetching for improved logging (#6313 ) In order to debug problems with log directory reassignments, it is helpful to know when the fetcher thread begins moving a particular partition. This patch refactors the fetch logic so that we stick to a selected partition as long as it is available and log a message when a different partition is selected. Reviewers: Viktor Somogyi-Vass <viktorsomogyi@gmail.com>, Dong Lin <lindong28@gmail.com>, Jun Rao <junrao@gmail.com>	6 years ago
Mickael Maison	4824dc994d	KAFKA-7972: Use automatic RPC generation in SaslHandshake Author: Mickael Maison <mickael.maison@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6301 from mimaison/sasl-handshake	6 years ago
Gwen (Chen) Shapira	e82cc50ddb	KAFKA-7938: Fix test flakiness in DeleteConsumerGroupsTest (#6312 )	6 years ago
Gwen (Chen) Shapira	0150dbc1d0	KAFKA-7937: Fix Flaky Test ResetConsumerGroupOffsetTest.testResetOffsetsNotExistingGroup (#6311 )	6 years ago
Ryan Chen	217f45ed55	KAFKA-7864; validate partitions are 0-based (#6246 ) Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>, Jun Rao <junrao@gmail.com>	6 years ago
huxi	201da05427	KAFKA-7763; Calls to commitTransaction and abortTransaction should not block indefinitely (#6066 ) Currently, commitTransaction and abortTransaction wait indefinitely for the respective operation to be completed. This patch uses the producer's max block time to limit the time that we will wait. If the timeout elapses, we raise a TimeoutException, which allows the user to either close the producer or retry the operation. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	f775be0514	MINOR: Handle Metadata v0 all topics requests during parsing (#6300 ) Use of `MetadataRequest.isAllTopics` is not consistently defined for all versions of the api. For v0, it evaluates to false. This patch makes the behavior consistent for all versions. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Chia-Ping Tsai	35a0de32ee	KAFKA-6161 Add default implementation to close() and configure() for Serdes (#5348 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Murad	8de3092b05	KAFKA-7930: topic is not internal if explicitly listed in args (#6267 ) Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Zhanxiang (Patrick) Huang	2932d32afb	KAFKA-7283: Enable lazy mmap on index files and skip sanity check for segments below recovery point (#5498 ) Per the KIP-263 discussion, we think we can improve broker restart time by avoiding performing costly disk operations when sanity checking index files for segments below recovery point on broker startup. This PR includes the following changes: 1. Mmap the index file and populate fields of the index file on-demand rather than performing costly disk operations when creating the index object on broker startup. 2. Skip sanity checks on the time index and offset index of segments. 1. For segment with offset below the flushed point (recovery point), these segments are safely flushed so we don't need to sanity check the index files. if there are indeed data corruption on disk, given that we don't sanity check the segment file, sanity checking only the indexes adds little benefit. 2. For segment with offset above the flushed point (recovery point), we will recover these segments in `recoveryLog()` (Log.scala) in any case so sanity checking the index files for these segments is redundant. We did experiments on a cluster with 15 brokers, each of which has ~3k segments (and there are 31.8k partitions with RF=3 which are evenly distributed across brokers; total bytes-in-rate is around 400 MBps). The results show that rolling bounce time reduces from 135 minutes to 55 minutes. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	6 years ago
Lee Dongjin	71a7219dfd	KAFKA-7920; Do not permit zstd produce requests until IBP is updated to 2.1 (#6256 ) Fail produce requests using zstd until the inter.broker.protocol.version is large enough that replicas are ensured to support it. Otherwise, followers receive the `UNSUPPORTED_COMPRESSION_TYPE` when fetching zstd data and ISRs shrink. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Arjun Satish	4cb8f56b45	KAFKA-7909; Ensure timely rebalance completion after pending members rejoin or fail (#6251 ) Fix the following situations, where pending members (one that has a member-id, but hasn't joined the group) can cause rebalance operations to fail: - In AbstractCoordinator, a pending consumer should be allowed to leave. - A rebalance operation must successfully complete if a pending member either joins or times out. - During a rebalance operation, a pending member must be able to leave a group. Reviewers: Boyang Chen <bchen11@outlook.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Ismael Juma	e17352f03e	KAFKA-7487: DumpLogSegments misreports offset mismatches (#5756 ) - Compare last offset of first batch (instead of first offset) with index offset - Early exit from loop due to zero entries must happen before checking for mismatch - {TimeIndex,OffsetIndex}.entry should return absolute offset like other methods. These methods are only used by DumpLogSegments. - DumpLogSegments now calls `closeHandlers` on OffsetIndex, TimeIndex and FileRecords. - Add OffsetIndex, TimeIndex and DumpLogSegments tests - Remove unnecessary casts by using covariant returns in OffsetIndex and TimeIndex - Minor clean-ups - Fix `checkArgs` so that it does what it says only. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
Ismael Juma	45a896e741	KAFKA-7935: UNSUPPORTED_COMPRESSION_TYPE if ReplicaManager.getLogConfig returns None (#6274 ) Replaced `forall` with `exists`. Added a unit test to `KafkaApisTest` that failed before the change. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Kyle Ambroff-Kao	decc09b012	KAFKA-6569: Move OffsetIndex/TimeIndex logger to companion object (#4586 ) We identified that we spend a lot of time in the creation of Logger instances when creating OffsetIndex/TimeIndex due to the Logging mixin. When the broker is bootstrapping it's just doing this in a tight loop, so the time adds up. This patch moves the logger to the companion objects of OffsetIndex, TimeIndex and AbstractIndex resolving this issue. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Co-authored-by: Kyle Ambroff <kyle@ambroff.com> Co-authored-by: Ismael Juma <ismael@juma.me.uk>	6 years ago
Lee Dongjin	44257f2937	KAFKA-7884; Docs for message.format.version should display valid values (#6209 ) The config docs for message.format.version and log.message.format.version show invalid (corrupt?) "valid values". The problem is that`ApiVersionValidator#toString` is missing. In contrast, all other Validators like `ThrottledReplicaListValidator` or `Range`, have its own `toString` method. This patch solves this problem by adding `ApiVersionValidator#toString`. It also provides a unit test for it. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Ismael Juma	a421dd2a26	MINOR: Fix bugs identified by compiler warnings (#6258 ) - Add missing string interpolation - Fix and simplify testElectPreferredLeaders - Remove unused code - Replace deprecated usage of JUnit `assertThat` - Change var to val and fix non-exhaustive pattern match - Fix eta warning - Simplify code - Remove commented out code Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Manikumar Reddy	8b97c3d8a9	MINOR: Add missing Alter Operation to Topic supported operations list in AclCommand - Update the AclCommandTest Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #6263 from omkreddy/aclcommand	6 years ago
Mickael Maison	62b22496c2	MINOR: Fixed a couple of typos in Config docs Author: Mickael Maison <mickael.maison@gmail.com> Reviewers: Gwen Shapira Closes #6259 from mimaison/config-typos	6 years ago
Ismael Juma	c7f99bc2bd	MINOR: Update JUnit to 4.13 and annotate log cleaner integration test (#6248 ) JUnit 4.13 fixes the issue where `Category` and `Parameterized` annotations could not be used together. It also deprecates `ExpectedException` and `assertThat`. Given this, we: - Replace `ExpectedException` with the newly introduced `assertThrows`. - Replace `Assert.assertThat` with `MatcherAssert.assertThat`. - Annotate `AbstractLogCleanerIntegrationTest` with `IntegrationTest` category. Reviewers: Ewen Cheslack-Postava <ewen@confluent.io>, David Arthur <mumrah@gmail.com>	6 years ago
Kevin Lu	3fb1d70b6b	KAFKA-7236: Add --under-min-isr option to describe topics command (KIP-351) (#6224 ) * KAFKA-7236: Add --under-min-isr option to describe topics command (KIP-351) * Minor changes to description and make test consistent with others * Fix option, and add additional test with mixed partition status * Add fully-replicated-topic to test case * Address review nits	6 years ago
Jason Gustafson	d152989f26	KAFKA-7897; Disable leader epoch cache when older message formats are used (#6232 ) When an older message format is in use, we should disable the leader epoch cache so that we resort to truncation by high watermark. Previously we updated the cache for all versions when a broker became leader for a partition. This can cause large and unnecessary truncations after leader changes because we relied on the presence of _any_ cached epoch in order to tell whether to use the improved truncation logic possible with the OffsetsForLeaderEpoch API. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Viktor Somogyi-Vass <viktorsomogyi@gmail.com>, Jun Rao <junrao@gmail.com>	6 years ago
Jason Gustafson	8147846acb	KAFKA-7540; Retry coordinator lookup to fix transient failure in ConsumerBounceTest (#6235 ) Add logic in ConsumerBounceTest to check the error code in FindCoordinator responses and retry if needed. This should help with transient failures or at least get us closer to the actual problem. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago

1 2 3 4 5 ...

2611 Commits (c758122ce59674ec3e33618d896e4e5cdbb45e87)