src-kafka

Commit Graph

Author	SHA1	Message	Date
Jason Gustafson	6c2e7005ba	MINOR: Remove unused IteratorTemplate (#5903 ) There seems to be no reason to keep this around since it is not used outside of testing and AbstractIterator is basically the same thing. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
huxi	3eaf44ba8e	KAFKA-7557: optimize LogManager.truncateFullyAndStartAt() (#5848 ) Instead of calling deleteSnapshotsAfterRecoveryPointCheckpoint for allLogs, invoking it only for the logs being truncated. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	6 years ago
Ismael Juma	af2e6fb548	MINOR: Update zstd, easymock, powermock, zkclient and build plugins (#5846 ) EasyMock 4.0.x includes a change that relies on the caller for inferring the return type of mock creator methods. Updated a number of Scala tests for compilation and execution to succeed. The versions of EasyMock and PowerMock in this PR include full support for Java 11. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Jonathan Santilli	eb3335ef59	KAFKA-7165: Retry the BrokerInfo registration into ZooKeeper (#5575 ) * Add logic to retry the BrokerInfo registration into ZooKeeper In case the ZooKeeper session has been regenerated and the broker tries to register the BrokerInfo into Zookeeper, this code deletes the current BrokerInfo from Zookeeper and creates it again, just if the znode ephemeral owner belongs to the Broker which tries to register himself again into ZooKeeper * Add test to validate the BrokerInfo re-registration into ZooKeeper	6 years ago
Zhanxiang (Patrick) Huang	7b5ffa0a07	KAFKA-7537: Avoid sending full UpdateMetadataRequest to existing brokers in the cluster on broker changes to reduce controller memory footprint (#5869 ) This PR avoids sending out full UpdateMetadataReuqest in the following scenarios: 1. On broker startup, send out full UpdateMetadataRequest to newly added brokers and only send out UpdateMetadataReuqest with empty partition states to existing brokers. 2. On broker failure, if it doesn't require leader election, only include the states of partitions that are hosted by the dead broker(s) in the UpdateMetadataReuqest instead of including all partition states. This PR also introduces a minor optimization in the MetadataCache update to avoid copying the previous partition states upon receiving UpdateMetadataRequest with no partition states. Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Dong Lin	6f83d05131	KAFKA-7313; StopReplicaRequest should attempt to remove future replica for the partition only if future replica exists This patch fixes two issues: 1) Currently if a broker received StopReplicaRequest with delete=true for the same offline replica, the first StopRelicaRequest will show KafkaStorageException and the second StopRelicaRequest will show ReplicaNotAvailableException. This is because the first StopRelicaRequest will remove the mapping (tp -> ReplicaManager.OfflinePartition) from ReplicaManager.allPartitions before returning KafkaStorageException, thus the second StopRelicaRequest will not find this partition as offline. This result appears to be inconsistent. And since the replica is already offline and broker will not be able to delete file for this replica, the StopReplicaRequest should fail without making any change and broker should still remember that this replica is offline. 2) Currently if broker receives StopReplicaRequest with delete=true, the broker will attempt to remove future replica for the partition, which will cause KafkaStorageException in the StopReplicaResponse if this replica does not have future replica. It is problematic to always return KafkaStorageException in the response if future replica does not exist. Author: Dong Lin <lindong28@gmail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #5533 from lindong28/KAFKA-7313	6 years ago
Jason Gustafson	fc1dc358ee	KAFKA-7568; Return leader epoch in ListOffsets response (#5855 ) As part of KIP-320, the ListOffsets API should return the leader epoch of any fetched offset. We either get this epoch from the log itself for a timestamp query or from the epoch cache if we are searching the earliest or latest offset in the log. When handling queries for the latest offset, we have elected to choose the current leader epoch, which is consistent with other handling (e.g. OffsetsForTimes). Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Manikumar Reddy O	b2c67e4b9a	MINOR: Add try/finally blocks to close adminclient in DelegationTokenEndToEndAuthorizationTest (#5861 )	6 years ago
Jason Gustafson	8065a0bef4	MINOR: Fix a few blocking calls in PlaintextConsumerTest (#5859 ) We've been seeing some hanging builds recently (see KAFKA-7553). Consistently the culprit seems to be a test case in PlaintextConsumerTest. This patch doesn't fix the underlying issue, but it eliminates a few places where these test cases could block: 1. It replaces several calls to the deprecated `poll(long)` which can block indefinitely in the worst case in order to join the group with `poll(Duration)` which respects the timeout. 2. It also fixes a consume utility in `TestUtils` which can block for a long time depending on the number of records that are expected to be consumed. Reviewers: Ismael Juma <ismael@juma.me.uk>, Colin Patrick McCabe <colin@cmccabe.xyz>	6 years ago
Stanislav Kozlovski	fc1eea127a	MINOR: Improve ReplicationQuotasTest#shouldThrottleOldSegments resiliency (#5849 ) I've seen this test fail with ``` java.lang.AssertionError: Throttled replication of 6352ms should be < 6000ms ``` A contributing factor is that it starts counting the time it took for replication before the replication itself has started. `createServer()` initializes ZK and other systems before it starts up the replication thread. I ran the test 25 times locally both ways. Average `throttledTook` before the change: 5341.75 Mean `throttledTook` after the change: 5256.92 Note that those are the results from `./gradlew core:test --tests kafka.server.ReplicationQuotasTest.shouldThrottleOldSegments`. I've noticed that if I run the whole test class `ReplicationQuotasTest`, the `throttledTook` is close ~4100. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Jason Gustafson	d71cb54672	KAFKA-7567; Clean up internal metadata usage for consistency and extensibility (#5813 ) This patch makes two improvements to internal metadata handling logic and testing: 1. It reduce dependence on the public object `Cluster` for internal metadata propagation since it is not easy to evolve. As an example, we need to propagate leader epochs from the metadata response to `Metadata`, but it is not straightforward to do this without exposing it in `PartitionInfo` since that is what `Cluster` uses internally. By doing this change, we are able to remove some redundant `Cluster` building logic. 2. We want to make the metadata handling in `MockClient` simpler and more consistent. Currently we have mix of metadata update mechanisms which are internally inconsistent with each other and do not match the implementation in `NetworkClient`. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Mickael Maison	9a0ea25fee	MINOR: Use string/log interpolation instead of string concat in core and clients (#5850 ) Also removed a few unused imports and tweaked the log message slightly. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
huxi	5d7cb438a5	MINOR: Remove duplicate `subscribe` call in ConsumerPerformance (#5828 ) In the `consume` method, the consumer subscribes the topic, so no need to do the same thing before the method call. Also include minor clean-up in `consume`. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Stanislav Kozlovski	ccfcbfd13f	MINOR: Make log cleaner tests more efficient and less flaky (#5836 ) `testMarksPartitionsAsOfflineAndPopulatesUncleanableMetrics` sometimes fails because the 15 second timeout expires. Inspecting the error message from the build failure, we see that this timeout happens in the writeDups() calls which call roll(). ```text [2018-10-23 15:18:51,018] ERROR Error while flushing log for log-1 in dir /tmp/kafka-8190355063195903574 with offset 74 (kafka.server.LogDirFailureChannel:76) java.nio.channels.ClosedByInterruptException ... at kafka.log.Log.roll(Log.scala:1550) ... at kafka.log.AbstractLogCleanerIntegrationTest.writeDups(AbstractLogCleanerIntegrationTest.scala:132) ... ``` After investigating, I saw that this test would call Log#roll() around 60 times every run. Increasing the segmentSize config to `2048` reduces the number of Log#roll() calls while ensuring that there are multiple rolls still. I saw that most other LogCleaner tests also call roll() ~90 times, so I've changed the default to be `2048`. I've also made the one test which requires a smaller segmentSize to set it via the args. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Ron Dagostino	e8a3bc7425	KAFKA-7352; Allow SASL Connections to Periodically Re-Authenticate (KIP-368) (#5582 ) KIP-368 implementation to enable periodic re-authentication of SASL clients. Also adds a broker configuration option to terminate client connections that do not re-authenticate within the configured interval.	6 years ago
Ismael Juma	51061792ca	MINOR: Remove unused commitSync in ConsoleConsumer (#5845 ) Dead code is confusing. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>	6 years ago
Stanislav Kozlovski	eda4a2904a	KAFKA-7532: Clean-up controller log when shutting down brokers (#5831 ) This line prints out (when empty): ``` [2018-10-23 12:19:59,977] INFO [Controller id=0] Removed ArrayBuffer() from list of shutting down brokers. (kafka.controller.KafkaController) ``` Use `mkString` to eliminate `ArrayBuffer` and only log if not empty. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
lambdaliu	73da591916	KAFKA-7535; KafkaConsumer doesn't report records-lag if isolation.level is read_committed FetchResponse should return the partitionData's lastStabeleOffset Author: lambdaliu <lambdaliu@tencent.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Dhruvil Shah <dhruvil@confluent.io>, Dong Lin <lindong28@gmail.com> Closes #5835 from lambdaliu/KAFKA-7535	6 years ago
Manikumar Reddy	32e1da570a	KAFKA-5462: Add configuration to build custom SSL principal name (KIP-371) Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Sriharsha Chintalapani <sriharsha@apache.org> Closes #5684 from omkreddy/KAFKA-5462-SSL-Name	6 years ago
Stanislav Kozlovski	d2c870b468	MINOR: Fix flaky assertion in ControllerIntegrationTest (#5829 ) `ControllerIntegrationTest#waitUntilControllerEpoch` sometimes fails with the following error: ``` java.util.NoSuchElementException: None.get at scala.None$.get(Option.scala:347) at scala.None$.get(Option.scala:345) at kafka.controller.ControllerIntegrationTest$$anonfun$waitUntilControllerEpoch$1.apply$mcZ$sp(ControllerIntegrationTest.scala:312) at kafka.utils.TestUtils$.waitUntilTrue(TestUtils.scala:779) at kafka.controller.ControllerIntegrationTest.waitUntilControllerEpoch(ControllerIntegrationTest.scala:312) at kafka.controller.ControllerIntegrationTest.testEmptyCluster(ControllerIntegrationTest.scala:51) ``` We should retry until the value is defined or it times out. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Bridger Howell	ed94cf4d31	KAFKA-7519 Clear pending transaction state when expiration fails (#5820 ) Make sure that the transaction state is properly cleared when the `transactionalId-expiration` task fails. Operations on that transactional id would otherwise return a `CONCURRENT_TRANSACTIONS` error and appear "untouchable" to transaction state changes, preventing transactional producers from operating until a broker restart or transaction coordinator change. Unit tested by verifying that having the `transactionalId-expiration` task won't leave the transaction metadata in a pending state if the replica manager returns an error. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
John Eismeier	83c3996974	MINOR: Fix some typos Just a doc change Author: John Eismeier <john.eismeier@gmail.com> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #4573 from jeis2497052/trunk	6 years ago
Dhruvil Shah	9e088eb120	MINOR: Remove redundant try block in LogCleaner (#5776 ) Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Zhanxiang (Patrick) Huang	928f45f61f	KAFKA-7464; catch exceptions in "leaderEndpoint.close()" when shutting down ReplicaFetcherThread After KAFKA-6051, we close leaderEndPoint in replica fetcher thread initiateShutdown to try to preempt in-progress fetch request and accelerate repica fetcher thread shutdown. However, leaderEndpoint can throw an Exception when the replica fetcher thread is still actively fetching, which can cause ReplicaManager to fail to shutdown cleanly. This PR catches the exceptions thrown in "leaderEndpoint.close()" instead of letting it throw up in the call stack. Author: Zhanxiang (Patrick) Huang <hzxa21@hotmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5808 from hzxa21/KAFKA-7464	6 years ago
Jason Gustafson	1e92b70306	MINOR: Ensure initial topic configs and updates are logged This patch adds logging of topic config overrides during creation or during the handling of alter config requests. Also did some minor cleanup to avoid redundant validation logic when adding partitions. Author: Jason Gustafson <jason@confluent.io> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #5812 from hachikuji/minor-log-topic-creation-configs	6 years ago
Suman	5681309094	KAFKA-6764: Improve the whitelist command-line option for console-consumer (#5637 )	6 years ago
Rajini Sivaram	4c602e6130	KAFKA-7498: Remove references from `common.requests` to `clients` (#5784 ) Add CreatePartitionsRequest.PartitionDetails similar to CreateTopicsRequest.TopicDetails to avoid references from `common.requests` package to `clients`. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
jonathanskrzypek	a947fe8da8	KAFKA-6195: Resolve DNS aliases in bootstrap.server (KIP-235) (#4485 ) Adds `client.dns.lookup=resolve_canonical_bootstrap_servers_only` option to perform full dns resolution of bootstrap addresses Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Sriharsha Chintalapani <sriharsha@apache.org>, Edoardo Comar <ecomar@uk.ibm.com>, Mickael Maison <mickael.maison@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Rajini Sivaram	6f5e37347f	KAFKA-7485: Wait for truststore update request to complete in test (#5791 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Jun Rao	aaf8e02403	KAFKA-7482: LeaderAndIsrRequest should be sent to the shutting down broker (#5745 ) Reviewers: Dong Lin <lindong28@gmail.com>	6 years ago
Edoardo Comar	f393b2f7dd	KAFKA-6863 Kafka clients should try to use multiple DNS resolved IP (#4987 ) Implementation of KIP-302: Based on the new client configuration `client.dns.lookup`, a NetworkClient can use InetAddress.getAllByName to find all IPs and iterate over them when they fail to connect. Only uses either IPv4 or IPv6 addresses similar to the default mode. Co-authored-by: Edoardo Comar <ecomar@uk.ibm.com> Co-authored-by: Mickael Maison <mickael.maison@gmail.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Ismael Juma	8d52b7ee0b	MINOR: AbstractIndex.close should unmap (#5757 ) Reviewers: Dong Lin <lindong28@gmail.com>, Jun Rao <junrao@gmail.com>	6 years ago
Ismael Juma	adb3a950ee	MINOR: Fix remaining core, connect and clients tests to pass with Java 11 (#5771 ) - SslFactoryTest should use SslFactory to create SSLEngine - Use Mockito instead of EasyMock in `ConsoleConsumerTest` as one of the tests mocks a standard library class and the latest released EasyMock version can't do that when Java 11 is used. - Avoid mocking `ConcurrentMap` in `SourceTaskOffsetCommitterTest` for similar reasons. As it happens, mocking is not actually needed here. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Manikumar Reddy O	34f029e3dc	MINOR: Fix broken standalone ReplicationQuotasTestRig test (#5773 ) * Fix `ZkUtils.getReassignmentJson` to pass Java map to `Json.encodeAsString` * Allow new file creation in ReplicationQuotasTestRig test Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Lee Dongjin	741cb761c5	KAFKA-4514; Add Codec for ZStandard Compression (#2267 ) This patch adds support for zstandard compression to Kafka as documented in KIP-110: https://cwiki.apache.org/confluence/display/KAFKA/KIP-110%3A+Add+Codec+for+ZStandard+Compression. Reviewers: Ivan Babrou <ibobrik@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Manikumar Reddy O	0848b78881	KAFKA-7366: Make topic configs segment.bytes and segment.ms to take effect immediately (#5728 ) Reviewers: Ismael Juma <ismael@juma.me.uk> and Jun Rao <junrao@gmail.com>	6 years ago
Gardner Vickers	6165b43744	MINOR: Fix LogDirFailureTest flake Ensure that `TestUtils.waitUntilTrue(..)` is blocked on both send completed and a new leader being assigned Author: Gardner Vickers <gardner@vickers.me> Reviewers: Dhruvil Shah <dhruvil@confluent.io>, Dong Lin <lindong28@gmail.com> Closes #5695 from gardnervickers/log-dir-failure-test-fix	6 years ago
Stanislav Kozlovski	13379af17d	KAFKA-7215: Improve LogCleaner Error Handling (#5439 ) The thread no longer dies. When encountering an unexpected error, it marks the partition as "uncleanable" which means it will not try to clean its logs in subsequent runs. Reviewers: Dhruvil Shah <dhruvil@confluent.io>, Jun Rao <junrao@gmail.com>	6 years ago
Jason Gustafson	5e9208fc05	HOTFIX: Compilation error in GroupMetadataManagerTest (#5752 ) Accidentally broke after merging KAFKA-7395 which had not been updated for #5727. Reviewers:Matthias J. Sax <matthias@confluent.io>	6 years ago
Jason Gustafson	ed3bd79633	KAFKA-7395; Add fencing to replication protocol (KIP-320) (#5661 ) This patch contains the broker-side support for the fencing improvements from KIP-320. This includes the leader epoch validation in the ListOffsets, OffsetsForLeaderEpoch, and Fetch APIs as well as the changes needed in the fetcher threads to maintain and use the current leader epoch. The client changes from KIP-320 will be left for a follow-up. One notable change worth mentioning is that we now require the read lock in `Partition` in order to read from the log or to query offsets. This is necessary to ensure the safety of the leader epoch validation. Additionally, we forward all leader epoch changes to the replica fetcher thread and go through the truncation phase. This is needed to ensure the fetcher always has the latest epoch and to guarantee that we cannot miss needed truncation if we missed an epoch change. Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Bob Barrett	79757c90df	KAFKA-7467; NoSuchElementException is raised because controlBatch is empty (#5727 ) This patch adds checks before reading the first record of a control batch. If the batch is empty, it is treated as having already been cleaned. In the case of LogCleaner this means it is safe to discard. In the case of ProducerStateManager it means it shouldn't cause state to be stored because the relevant transaction has already been cleaned. In the case of Fetcher, it just preempts the check for an abort. In the case of GroupMetadataManager, it doesn't process the offset as a commit. The patch also adds isControl to the output of DumpLogSegments. Changes were tested with unit tests, except the DumpLogSegments change which was tested manually.	6 years ago
Jason Gustafson	62f9b64f11	MINOR: Ensure consumers are closed in DynamicBrokerReconfigurationTest (#5750 ) In `ConsumerBuilder.build`, if `awaitInitialPositions` raises an exception, the consumer will not be closed properly. We should add the consumer instance to the `consumers` collection immediately after construction. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Xiongqi Wesley Wu	7ea0655711	KAFKA-7441; Allow LogCleanerManager.resumeCleaning() to be used concurrently Author: Xiongqi Wesley Wu <xiongqi.wu@gmail.com> Reviewers: Dong Lin <lindong28@gmail.com> Closes #5694 from xiowu0/fixrace2	6 years ago
Jason Gustafson	f2dd6aa269	KAFKA-7415; Persist leader epoch and start offset on becoming a leader (#5678 ) This patch ensures that the leader epoch cache is updated when a broker becomes leader with the latest epoch and the log end offset as its starting offset. This guarantees that the leader will be able to provide the right truncation point even if the follower has data from leader epochs which the leader itself does not have. This situation can occur when there are back to back leader elections. Additionally, we have made the following changes: 1. The leader epoch cache enforces monotonically increase epochs and starting offsets among its entry. Whenever a new entry is appended which violates requirement, we remove the conflicting entries from the cache. 2. Previously we returned an unknown epoch and offset if an epoch is queried which comes before the first entry in the cache. Now we return the smallest . For example, if the earliest entry in the cache is (epoch=5, startOffset=10), then a query for epoch 4 will return (epoch=4, endOffset=10). This ensures that followers (and consumers in KIP-320) can always determine where the correct starting point is for the active log range on the leader. Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Lincong Li	260b07a6da	KAFKA-7196; Remove heartbeat delayed operation for those removed consumers at the end of each rebalance During the consumer group rebalance, when the joining group phase finishes, the heartbeat delayed operation of the consumer that fails to rejoin the group should be removed from the purgatory. Otherwise, even though the member ID of the consumer has been removed from the group, its heartbeat delayed operation is still registered in the purgatory and the heartbeat delayed operation is going to timeout and then another unnecessary rebalance is triggered because of it. Author: Lincong Li <lcli@linkedin.com> Reviewers: Dong Lin <lindong28@gmail.com> Closes #5556 from Lincong/remove_heartbeat_delayedOperation	6 years ago
Rajini Sivaram	8fb5e63aa8	KAFKA-7429: Enable key/truststore update with same filename/password (#5699 )	6 years ago
Zhanxiang (Patrick) Huang	b35e97125f	KAFKA-7459: Use thread-safe Pool for RequestMetrics.requestRateInternal (#5717 ) As part of KAFKA-6514, the `apiVersion` tag was added to the `RequestsPerSec` metric. A thread unsafe `HashMap` was used in the implementation even though it can be accessed by multiple threads. Fix it by replacing it with the thread-safe `Pool`. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
huxi	70d90c3718	KAFKA-7409; Validate message format version before creating topics or altering configs (#5651 ) Values for `message.format.version` and `log.message.format.version` should be verified before topic creation or config change. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	9f7267dd2f	KAFKA-7437; Persist leader epoch in offset commit metadata (#5689 ) This commit implements the changes described in KIP-320 for the persistence of leader epoch information in the offset commit protocol. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Vahid Hashemian	2933f21374	KAFKA-7403; Use default timestamp if no expire timestamp set in offset commit value (#5690 ) This fixes a regression caused by KAFKA-4682 (KIP-211) which caused offset commit failures after upgrading from an older version which used the v1 inter-broker format.	6 years ago

1 2 3 4 5 ...

2454 Commits (dc634f18f7ea2ef24d202d6a2380365754005b60)