src-kafka

Commit Graph

Author	SHA1	Message	Date
Ivan Yurchenko	13e265ab3d	KAFKA-7986: Distinguish logging from different ZooKeeperClient instances (#6493 ) A broken can have more than one instance of ZooKeeperClient. For example, SimpleAclAuthorizer creates a separate ZooKeeperClient instance when configured. This commit makes it possible to optionally specify the name for the ZooKeeperClient instance. The name is specified only for a broker's ZooKeeperClient instances, but not for commands' and tests'. Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Radai Rosenblatt	1deb072f56	MINOR: Avoid unnecessary collection copy in MetadataCache (#6397 ) `map` was being used to convert `Iterable[Integer]` to `Iterable[Int`]. That operation represented 11% of total CPU time measured under load for us. We also expect a positive impact on GC. Reviewers: Joel Koshy <jjkoshy@gmail.com>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Anna Povzner	58d057296a	KAFKA-7989: RequestQuotaTest should wait for quota config change before running tests (#6482 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
huxihx	f6f8da7071	KAFKA-8098: Fix Flaky Test testConsumerGroups - The flaky failure is caused by the fact that the main thread sometimes issues DescribeConsumerGroup request before the consumer assignment takes effect. Added a latch to make sure such situation is not going to happen. Author: huxihx <huxi_2b@hotmail.com> Author: huxi <huxi_2b@hotmail.com> Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6441 from huxihx/KAFKA-8098	6 years ago
Bob Barrett	70ddd8af71	MINOR: Improve logging around index files (#6385 ) This patch adds additional DEBUG statements in AbstractIndex.scala, OffsetIndex.scala, and TimeIndex.scala. It also changes the logging on append from DEBUG to TRACE to make DEBUG logging less disruptive, and it ensures that exceptions raised from index classes include file/offset information. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	8406f3624d	KAFKA-7858: Automatically generate JoinGroup request/response Reviewers: Colin P. McCabe <cmccabe@apache.org>	6 years ago
Rajini Sivaram	810bc69b7a	KAFKA-8121; Shutdown ZK client expiry handler earlier during close (#6462 ) Shutdown session expiry thread prior to closing ZooKeeper client to ensure that new clients are not created by the expiry thread and left active when returning from ZooKeeperClient.close(). Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Anna Povzner	5192956581	MINOR: Improve verification in flaky testPartitionReassignmentDuringDeleteTopic (#6460 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Rajini Sivaram	6279b03813	KAFKA-8118; Ensure ZK clients are closed in tests, fix verification (#6456 ) We verify that ZK clients are closed in tests since these can affect subsequent tests and that makes it hard to debug test failures. But because of changes to ZooKeeper client, we were checking the wrong thread name. The thread name used now is <creatorThreadName>-EventThread where creatorThreadName varies depending on the test. Fixed ZooKeeperTestHarness to check this format and fixed tests which were leaving ZK clients behind. Also added a test to make sure we can detect changes to the thread name when we update ZK clients in future. Reviewers: Ismael Juma <ismael@juma.me.uk>, Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
huxihx	938580ff6c	KAFKA-7813: JmxTool throws NPE when --object-name is omitted https://issues.apache.org/jira/browse/KAFKA-7813 Running the JMX tool without --object-name parameter, results in a NullPointerException. More detailed description of your change, if necessary. The PR title and PR message become the squashed commit message, so use a separate comment to ping reviewers. Summary of testing strategy (including rationale) for the feature or bug fix. Unit and/or integration tests are expected for any behaviour change and system tests should be considered for larger changes. Author: huxihx <huxi_2b@hotmail.com> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #6139 from huxihx/KAFKA-7813	6 years ago
Manikumar Reddy	74e755fdb1	KAFKA-8114: Wait for SCRAM credential propagation in DelegationTokenEndToEndAuthorizationTest (#6452 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Matthias J. Sax	e1961b8298	MINOR: Update code to not use deprecated methods (#6434 ) Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Colin P. McCabe <cmccabe@confluent.io>	6 years ago
Rajini Sivaram	a6691fb79e	KAFKA-8091; Use commitSync to check connection failure in listener update test (#6450 ) The use of consumer.poll() made the test flaky since in some cases, it doesn't wait for coordinator connection. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
huxi	ee3b9c57fd	MINOR: Fix typos in LogValidator (#6449 )	6 years ago
Rajini Sivaram	3ec2ca5e33	KAFKA-7730; Limit number of active connections per listener in brokers (KIP-402) Adds a new listener config `max.connections` to limit the number of active connections on each listener. The config may be prefixed with listener prefix. This limit may be dynamically reconfigured without restarting the broker. This is one of the PRs for KIP-402 (https://cwiki.apache.org/confluence/display/KAFKA/KIP-402%3A+Improve+fairness+in+SocketServer+processors). Note that this is currently built on top of PR #6022 Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Gwen Shapira <cshapi@gmail.com> Closes #6034 from rajinisivaram/KAFKA-7730-max-connections	6 years ago
Rajini Sivaram	d436f32e6b	KAFKA-8091; Remove unsafe produce from dynamic listener update test (#6443 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Bill Bejeck	9ecadc4df4	MINOR: Use Java 8 lambdas in KStreamImplTest (#6430 ) Just a minor cleanup to use Java 8 lambdas vs anonymous classes in this test. I ran all tests in the streams test suite Reviewers: Matthias J. Sax <mjsax@apache.org>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Rajini Sivaram	0f83c4cdb7	KAFKA-7976; Update config before notifying controller of unclean leader update (#6426 ) When unclean leader election is enabled dynamically on brokers, we notify controller of the update before updating KafkaConfig. When processing this event, controller's decision to elect unclean leaders is based on the current KafkaConfig, so there is a small timing window when the controller may not elect unclean leader because KafkaConfig of the server was not yet updated. The commit fixes this timing window by using the existing BrokerReconfigurable interface used by other classes which rely on the current value of KafkaConfig. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
huxihx	2903606921	KAFKA-7801: TopicCommand should not be able to alter transaction topic partition count To keep align with the way it handles the offset topic, TopicCommand should not be able to alter transaction topic partition count. Author: huxihx <huxi_2b@hotmail.com> Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6109 from huxihx/KAFKA-7801	6 years ago
Rajini Sivaram	9aaa32b64d	KAFKA-8091; Wait for processor shutdown before testing removed listeners (#6425 ) DynamicBrokerReconfigurationTest.testAddRemoveSaslListeners removes a listener, waits for the config to be propagated to all brokers and then validates that connections to the removed listener fail. But there is a small timing window between config update and Processor shutdown. Before validating that connections to a removed listener fail, this commit waits for all metrics of the removed listener to be deleted, ensuring that the Processors of the listener have been shutdown. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
huxihx	1e1b669e9d	MINOR: Update delete topics zk path in assertion error messages - Update delete topics zk path from /admin/delete_topic to /admin/delete_topics in assertion error messages Author: huxihx <huxi_2b@hotmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6422 from huxihx/delete-topics	6 years ago
Manikumar Reddy	a42f16f980	KAFKA-7922: Return authorized operations in Metadata request response (KIP-430 Part-2) - Use automatic RPC generation in Metadata Request/Response classes - https://cwiki.apache.org/confluence/display/KAFKA/KIP-430+-+Return+Authorized+Operations+in+Describe+Responses Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #6352 from omkreddy/KIP-430-METADATA	6 years ago
Suman BN	65aea1f362	MINOR: Print usage when parse fails during console producer Handle OptionException while parsing options when using console producer and print usage before die. Author: Suman BN <sumannewton@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6386 from sumannewton/console-producer-parse-printusage	6 years ago
Zhanxiang (Patrick) Huang	44be5d2221	KAFKA-8069; Fix early expiration of offsets due to invalid loading of expire timestamp (#6401 ) After the 2.1 release, if the broker hasn't been upgrade to the latest inter-broker protocol version, the committed offsets stored in the __consumer_offset topic will get cleaned up way earlier than it should be when the offsets are loaded back from the __consumer_offset topic in GroupCoordinator, which will happen during leadership transition or after broker bounce. This patch fixes the bug by setting expireTimestamp to None if it is the default value after loading v1 offset records from __consumer_offsets. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Rajini Sivaram	47bc85f2e7	KAFKA-7980 - Fix timing issue in SocketServerTest.testConnectionRateLimit (#6391 ) Test currently checks that there were at least 5 polls when 5 connections are established with connectionQueueSize=1. But we could be doing the check just after the 5th connection before the 5th poll, so updated the check to verify that there were at least 4 polls. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Jason Gustafson	460e46c3bb	KAFKA-7831; Do not modify subscription state from background thread (#6221 ) Metadata may be updated from the background thread, so we need to protect access to SubscriptionState. This patch restructures the metadata handling so that we only check pattern subscriptions in the foreground. Additionally, it improves the following: 1. SubscriptionState is now the source of truth for the topics that will be fetched. We had a lot of messy logic previously to try and keep the the topic set in Metadata consistent with the subscription, so this simplifies the logic. 2. The metadata needs for the producer and consumer are quite different, so it made sense to separate the custom logic into separate extensions of Metadata. For example, only the producer requires topic expiration. 3. We've always had an edge case in which a metadata change with an inflight request may cause us to effectively miss an expected update. This patch implements a separate version inside Metadata which is bumped when the needed topics changes. 4. This patch removes the MetadataListener, which was the cause of https://issues.apache.org/jira/browse/KAFKA-7764. Reviewers: David Arthur <mumrah@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
KartikVK	f708e78294	KAFKA-7950; Update description for the "time" parameter for GetOffsetShell Added additional description for the "time" parameter for GetOffsetShell which adds " No offset is returned if timestamp provided is greater than recently committed record timestamp." in the description. Author: KartikVK <karthikkalaghatgi123@gmail.com> Reviewers: huxi <huxi_2b@hotmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6357 from Kartikvk1996/kartik-branch	6 years ago
Jun Rao	73737892d5	KAFKA-8018: Flaky Test SaslSslAdminClientIntegrationTest#testLegacyAclOpsNeverAffectOrReturnPrefixed Disable forceSync in EmbeddedZookeeper. Increase ZK tick to allow longer maxSessionTimeout in tests. Increase ZK client session timeout in tests. Handle transient ZK session expiration exception in test utils for createTopic. Author: Jun Rao <junrao@gmail.com> Reviewers: Guozhang Wang <wangguoz@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #6354 from junrao/KAFKA-8018	6 years ago
Rajini Sivaram	2bb74bfc3b	KAFKA-7979 - Clean up threads and increase timeout in PartitionTest (#6378 ) Stack trace generated from the test failure shows that test failed even though threads were runnable and making progress, indicating that the timeout may be too small when test machine is slow. Increasing timeout from 10 to 15 seconds, consistent with the default wait in other tests. Thread dump also showed a lot of left over threads from other tests, so added clean up of those as well. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Rajini Sivaram	3747f55336	KAFKA-7976 - Fix DynamicBrokerReconfigurationTest.testUncleanLeaderElectionEnable (#6374 ) Ensure that controller is not shutdown in the test. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Manikumar Reddy	9ee5f920d5	KAFKA-7312: Change broker port used in testMinimumRequestTimeouts and testForceClose Port 22 is used by ssh, which causes the AdminClient to throw an OOM: > java.lang.OutOfMemoryError: Java heap space > at java.nio.HeapByteBuffer.<init>(HeapByteBuffer.java:57) > at java.nio.ByteBuffer.allocate(ByteBuffer.java:335) > at org.apache.kafka.common.memory.MemoryPool$1.tryAllocate(MemoryPool.java:30) > at org.apache.kafka.common.network.NetworkReceive.readFrom(NetworkReceive.java:112) > at org.apache.kafka.common.network.KafkaChannel.receive(KafkaChannel.java:424) > at org.apache.kafka.common.network.KafkaChannel.read(KafkaChannel.java:385) > at org.apache.kafka.common.network.Selector.attemptRead(Selector.java:640) > at org.apache.kafka.common.network.Selector.pollSelectionKeys(Selector.java:561) > at org.apache.kafka.common.network.Selector.poll(Selector.java:472) > at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:535) > at org.apache.kafka.clients.admin.KafkaAdminClient$AdminClientRunnable.run(KafkaAdminClient.java:1140) > at java.lang.Thread.run(Thread.java:748) > > Author: Manikumar Reddy <manikumar.reddy@gmail.com> Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #6360 from omkreddy/KAFKA-7312	6 years ago
Mickael Maison	0d56f14135	KAFKA-7997: Use automatic RPC generation in SaslAuthenticate Author: Mickael Maison <mickael.maison@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6324 from mimaison/sasl-authenticate	6 years ago
Bob Barrett	57604c2331	KAFKA-8002; Log dir reassignment stalls if future replica has different segment base offset (#6346 ) This patch fixes a bug in log dir reassignment where Partition.maybeReplaceCurrentWithFutureReplica would compare the entire LogEndOffsetMetadata of each replica to determine whether the reassignment has completed. If the active segments of both replicas have different base segments (for example, if the current replica had previously been cleaned and the future replica rolled segments at different points), the reassignment will never complete. The fix is to compare only the LogEndOffsetMetadata.messageOffset for each replica. Tested with a unit test that simulates the compacted current replica case. Reviewers: Anna Povzner <anna@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Manikumar Reddy	f11fa5ef40	KAFKA-7922: Return authorized operations in describe consumer group responses (KIP-430 Part-1) - Use automatic RPC generation in DescribeGroups Request/Response classes Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #6322 from omkreddy/KIP-430-Return-Ops	6 years ago
Colin Hicks	70828cea49	KAFKA-8012; Ensure partitionStates have not been removed before truncating. (#6333 ) This patch fixes a regression in the replica fetcher which occurs when the replica fetcher manager simultaneously calls `removeFetcherForPartitions`, removing the corresponding partitionStates, while a replica fetcher thread attempts to truncate the same partition(s) in `truncateToHighWatermark`. This causes an NPE which causes the fetcher to crash. This change simply checks that the `partitionState` is not null first. Note that a similar guard exists in `truncateToEpochEndOffsets`. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Jason Gustafson <jason@confluent.io>	6 years ago
lambdaliu	33e005994d	MINOR: Skip quota check when replica is in sync (#6344 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Bob Barrett	18b3a878a0	MINOR: Improve logging for alter log dirs (#6302 ) This patch adds several new log messages to provide more information about errors during log dir movement and to make it clear when each partition movement is finished. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Stanislav Kozlovski	f667f573ff	Address flakiness of CustomQuotaCallbackTest#testCustomQuotaCallback (#6330 )	6 years ago
Gardner Vickers	bd6520a22a	KAFKA-7956 In ShutdownableThread, immediately complete the shutdown if the thread has not been started (#6218 ) In some test cases it's desirable to instantiate a subclass of `ShutdownableThread` without starting it. Since most subclasses of `ShutdownableThread` put cleanup logic in `ShutdownableThread.shutdown()`, being able to call `shutdown()` on the non-running thread would be useful. This change allows us to avoid blocking in `ShutdownableThread.shutdown()` if the thread's `run()` method has not been called. We also add a check that `initiateShutdown()` was called before `awaitShutdown()`, to protect against the case where a user calls `awaitShutdown()` before the thread has been started, and unexpectedly is not blocked on the thread shutting down. Reviewers : Dhruvil Shah <dhruvil@confluent.io>, Jun Rao <junrao@gmail.com>	6 years ago
Jason Gustafson	66a6fc7204	MINOR: Refactor replica log dir fetching for improved logging (#6313 ) In order to debug problems with log directory reassignments, it is helpful to know when the fetcher thread begins moving a particular partition. This patch refactors the fetch logic so that we stick to a selected partition as long as it is available and log a message when a different partition is selected. Reviewers: Viktor Somogyi-Vass <viktorsomogyi@gmail.com>, Dong Lin <lindong28@gmail.com>, Jun Rao <junrao@gmail.com>	6 years ago
Mickael Maison	4824dc994d	KAFKA-7972: Use automatic RPC generation in SaslHandshake Author: Mickael Maison <mickael.maison@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6301 from mimaison/sasl-handshake	6 years ago
Gwen (Chen) Shapira	e82cc50ddb	KAFKA-7938: Fix test flakiness in DeleteConsumerGroupsTest (#6312 )	6 years ago
Gwen (Chen) Shapira	0150dbc1d0	KAFKA-7937: Fix Flaky Test ResetConsumerGroupOffsetTest.testResetOffsetsNotExistingGroup (#6311 )	6 years ago
Ryan Chen	217f45ed55	KAFKA-7864; validate partitions are 0-based (#6246 ) Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>, Jun Rao <junrao@gmail.com>	6 years ago
huxi	201da05427	KAFKA-7763; Calls to commitTransaction and abortTransaction should not block indefinitely (#6066 ) Currently, commitTransaction and abortTransaction wait indefinitely for the respective operation to be completed. This patch uses the producer's max block time to limit the time that we will wait. If the timeout elapses, we raise a TimeoutException, which allows the user to either close the producer or retry the operation. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	f775be0514	MINOR: Handle Metadata v0 all topics requests during parsing (#6300 ) Use of `MetadataRequest.isAllTopics` is not consistently defined for all versions of the api. For v0, it evaluates to false. This patch makes the behavior consistent for all versions. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Chia-Ping Tsai	35a0de32ee	KAFKA-6161 Add default implementation to close() and configure() for Serdes (#5348 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Murad	8de3092b05	KAFKA-7930: topic is not internal if explicitly listed in args (#6267 ) Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Zhanxiang (Patrick) Huang	2932d32afb	KAFKA-7283: Enable lazy mmap on index files and skip sanity check for segments below recovery point (#5498 ) Per the KIP-263 discussion, we think we can improve broker restart time by avoiding performing costly disk operations when sanity checking index files for segments below recovery point on broker startup. This PR includes the following changes: 1. Mmap the index file and populate fields of the index file on-demand rather than performing costly disk operations when creating the index object on broker startup. 2. Skip sanity checks on the time index and offset index of segments. 1. For segment with offset below the flushed point (recovery point), these segments are safely flushed so we don't need to sanity check the index files. if there are indeed data corruption on disk, given that we don't sanity check the segment file, sanity checking only the indexes adds little benefit. 2. For segment with offset above the flushed point (recovery point), we will recover these segments in `recoveryLog()` (Log.scala) in any case so sanity checking the index files for these segments is redundant. We did experiments on a cluster with 15 brokers, each of which has ~3k segments (and there are 31.8k partitions with RF=3 which are evenly distributed across brokers; total bytes-in-rate is around 400 MBps). The results show that rolling bounce time reduces from 135 minutes to 55 minutes. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	6 years ago
Lee Dongjin	71a7219dfd	KAFKA-7920; Do not permit zstd produce requests until IBP is updated to 2.1 (#6256 ) Fail produce requests using zstd until the inter.broker.protocol.version is large enough that replicas are ensured to support it. Otherwise, followers receive the `UNSUPPORTED_COMPRESSION_TYPE` when fetching zstd data and ISRs shrink. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago

1 2 3 4 5 ...

2523 Commits (ff78c684ff22d81174cd789a4ac7e7e1fe4dfc8a)