src-kafka

Author	SHA1	Message	Date
Mickael Maison	855f899bb5	KAFKA-8256; Replace Heartbeat request/response with automated protocol (#6691 ) Reviewers: Boyang Chen <bchen11@outlook.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	6e6dcceb93	KAFKA-8220; Avoid kicking out static group members through rebalance timeout (#6666 ) To make static consumer group members more persistent, we want to avoid kicking out unjoined members through rebalance timeout. Essentially we allow static members to participate in a rebalance using their old subscription without sending a JoinGroup. The only catch is that an unjoined static member might be the current group leader, and we may need to elect a different leader. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	2208f9966d	KAFKA-8354; Replace Sync group request/response with automated protocol (#6729 ) Update SyncGroup API to use the generated protocol classes. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Colin Patrick McCabe	0494cd329f	MINOR: Refactor SslFactory (#6674 ) SslFactory: split the part of SslFactory that creates SSLEngine instances into SslEngineBuilder. When (re)configuring, we simply create a new SslEngineBuilder. This allows us to make all the builder fields immutable. It also simplifies the logic for reconfiguring. Because we sometimes need to test old SslEngine instances against new ones, being able to use both the old and the new builder at once is useful. Create an enum named SslClientAuth which encodes the possible values for ssl.client.auth. This will simplify the handling of this configuration. SslTransportLayer#maybeProcessHandshakeFailure should treat an SSLHandshakeException with a "Received fatal alert" message as a handshake error (and therefore an authentication error.) SslFactoryTest: add some line breaks for very long lines. ConfigCommand#main: when terminating the command due to an uncaught exception, log the exception using debug level in slf4j, in addition to printing it to stderr. This makes it easier to debug failing junit tests, where stderr may not be kept, or may be reordered with respect to other slf4j messages. The use of debug level is consistent with how we handle other types of exceptions in ConfigCommand#main. StateChangeLogMerger#main: spell out the full name of scala.io.Source rather than abbreviating it as io.Source. This makes it clearer that it is part of the Scala standard library. It also avoids compiler errors when other libraries whose groupId starts with "io" are used in the broker. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Rajini Sivaram	050fdd6537	KAFKA-8336; Enable dynamic reconfiguration of broker's client-side certs (#6721 ) Enable reconfiguration of SSL keystores and truststores in client-side channel builders used by brokers for controller, transaction coordinator and replica fetchers. This enables brokers using TLS mutual authentication for inter-broker listener to use short-lived certs that may be updated before expiry without restarting brokers. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Dhruvil Shah	16d4d8cafc	MINOR: Fix flaky ConsumerTopicCreationTest (#6727 ) `ConsumerTopicCreationTest` relied on `KafkaConsumer#poll` to send a `MetadataRequest` within 100ms to verify if a topic is auto created or not. This is brittle and does not guarantee if the request made it to the broker or was processed successfully. This PR fixes the flaky test by adding another topic; we wait until we consume a previously produced record to this topic. This ensures MetadataRequest was processed and we could then check if the topic we're interested in was created or not. Reviewers: Boyang Chen <bchen11@outlook.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Xiongqi Wu	b86e8c1ea9	MINOR: add docs for KIP-354 KAFKA-7321 (#6724 ) MINOR: update documentation for the log cleaner max compaction lag feature (KIP-354) implemented in KAFKA-7321 Author: Xiongqi Wu <xiowu@linkedin.com> Reviewer: Joel Koshy <jjkoshy@gmail.com>	6 years ago
Jason Gustafson	e4007a6408	KAFKA-8294; Batch StopReplica requests when possible and improve test coverage (#6642 ) The main problem we are trying to solve here is the batching of StopReplica requests and the lack of test coverage for `ControllerChannelManager`. Addressing the first problem was straightforward, but the second problem required quite a bit of work because of the dependence on `KafkaController` for all of the events. It seemed to make sense to separate the events from the processing of events so that we could remove this dependence and improve testability. With the refactoring, I was able to add test cases covering most of the logic in `ControllerChannelManager` including the generation of requests and the expected response handling logic. Note that I have not actually changed any of the event handling logic in `KafkaController`. While refactoring this logic, I found that the event queue time metric was not being correctly computed. The problem is that many of the controller events were singleton objects which inherited the `enqueueTimeMs` field from the `ControllerEvent` trait. This would never get updated, so queue time would be skewed. Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Xiongqi Wu	1fdc853301	KAFKA-7321: Add a Maximum Log Compaction Lag (KIP-354) (#6009 ) KAFKA-7321: Add a Maximum Log Compaction Lag (KIP-354) Records become eligible for compaction after the specified time interval. Author: Xiongqi Wu <xiowu@linkedin.com> Reviewer: Joel Koshy <jjkoshy@gmail.com>	6 years ago
Jason Gustafson	d798dbf497	KAFKA-8335; Clean empty batches when sequence numbers are reused (#6715 ) The log cleaner attempts to preserve the last entry for each producerId in order to ensure that sequence/epoch state is not lost. The current validation checks only the last sequence number for each producerId in order to decide whether a batch should be retained. There are two problems with this: 1. Sequence numbers are not unique alone. It is the tuple of sequence number and epoch which is uniquely defined. 2. The group coordinator always writes batches beginning with sequence number 0, which means there could be many batches which have the same sequence number. The complete fix for the second issue would probably add proper sequence number bookkeeping in the coordinator. For now, we have left the coordinator implementation unchanged and changed the cleaner logic to use the last offset written by a producer instead of the last sequence number. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Kengo Seki	511c8e2c9e	MINOR: Remove unnecessary OptionParser#accepts method call from PreferredReplicaLeaderElectionCommand (#6710 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Bob Barrett	a97e55b838	KAFKA-8332: Refactor ImplicitLinkedHashSet to avoid losing ordering when converting to Scala Because of how conversions between Java collections and Scala collections work, ImplicitLinkedHashMultiSet objects were being treated as unordered in some contexts where they shouldn't be. This broke JOIN_GROUP handling. This patch renames ImplicitLinkedHashMultiSet to ImplicitLinkedHashMultCollection. The order of Collection objects will be preserved when converting to scala. Adding Set and List "views" to the Collection gives us a more elegant way of accessing that functionality when needed. Reviewers: Colin P. McCabe <cmccabe@apache.org>	6 years ago
Ismael Juma	c09e25fac2	MINOR: Fix bug in Struct.equals and use Objects.equals/Long.hashCode (#6680 ) * Fixed bug in Struct.equals where we returned prematurely and added tests * Update RequestResponseTest to check that `equals` and `hashCode` of the struct is the same after serialization/deserialization only when possible. * Use `Objects.equals` and `Long.hashCode` to simplify code * Removed deprecated usages of `JUnitTestSuite` Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Dhruvil Shah	e6cff21fd8	KAFKA-7320; Add consumer configuration to disable auto topic creation [KIP-361] (#5542 ) Implements KIP-361 to provide a consumer configuration to specify whether subscribing or assigning a non-existent topic would result in it being automatically created or not. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Lee Dongjin	4eadaff6b2	MINOR: Remove unused field in `ListenerConnectionQuota` Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Sönke Liebau	2bf153f6a7	KAFKA-8131; Move --version implementation into CommandLineUtils (#6481 ) This patch refactors the implementation of the --version option and moves it into the default command options. This has the benefit of automatically including it in the usage output of the command line tools. Several tools had to be manually updated because they did not use the common options. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Mickael Maison	407bcdf78e	KAFKA-8056; Use automatic RPC generation for FindCoordinator (#6408 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Fangbin Sun	0c62f5e664	KAFKA-7455: Support JmxTool to connect to a secured RMI port. (#5968 ) Reviewers: Attila Sasvari <asasvari@apache.org>, Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
huxi	aaf2345386	MINOR: Fix ThrottledReplicaListValidator doc error. (#6537 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Dhruvil Shah	56b92a5504	KAFKA-8306; Initialize log end offset accurately when start offset is non-zero (#6652 ) This patch ensures that the log end offset of each partition is initialized consistently with the checkpointed log start offset. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Ismael Juma	a37282415e	MINOR: Upgrade dependencies for Kafka 2.3 (#6665 ) Many patch and minor updates. Scalatest and Jetty deprecated classes that we use. I removed usages for the former and filed KAFKA-8316 for the latter (I suppressed the relevant deprecation warnings until the JIRA is fixed). As part of the scalatest fixes, I also removed `TestUtils.fail` since it duplicates `Assertions.fail`. I also fixed a few compiler warnings that have crept in since my last sweep. Updates of note: - Jetty: 9.4.14 -> 9.4.18 * https://github.com/eclipse/jetty.project/releases/tag/jetty-9.4.15.v20190215 * https://github.com/eclipse/jetty.project/releases/tag/jetty-9.4.16.v20190411 * https://github.com/eclipse/jetty.project/releases/tag/jetty-9.4.17.v20190418 * https://github.com/eclipse/jetty.project/releases/tag/jetty-9.4.17.v20190418 * https://github.com/eclipse/jetty.project/releases/tag/jetty-9.4.18.v20190429 - zstd: 1.3.8-1 -> 1.4.0-1 * https://github.com/facebook/zstd/releases/tag/v1.4.0 * zstd's fastest strategy, 6-8% faster in most scenarios - zookeeper: 3.4.13 -> 3.4.14 * https://zookeeper.apache.org/doc/r3.4.14/releasenotes.html ### Committer Checklist (excluded from commit message) - [ ] Verify design and implementation - [ ] Verify test coverage and CI build status - [ ] Verify documentation (including upgrade notes) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Jason Gustafson	3ba4686d4d	KAFKA-7601; Clear leader epoch cache on downgraded format in append (#6568 ) During a partial message format upgrade, it is possible for the message format to flap between new and old versions. If we detect that data appended to the log is on an old format, we can clear the leader epoch cache so that we revert to truncation by high watermark. Once the upgrade completes and all replicas are on the same format, we will append to the epoch cache as usual. Note this is related to KAFKA-7897, which handles message format downgrades through configuration. Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Jason Gustafson	c34330c548	KAFKA-8248; Ensure time updated before sending transactional request (#6613 ) This patch fixes a bug in the sending of transactional requests. We need to call `KafkaClient.send` with an updated current time. Failing to do so can result in an `IllegalStateExcepton` which leaves the producer effectively dead since the in-flight correlation id has been set, but no request has been sent. To avoid the same problem in the future, we update the in flight correlationId only after sending the request. Reviewers: Matthias J. Sax <matthias@confluent.io>, Apurva Mehta <apurva@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Stanislav Kozlovski	191f2faae0	KAFKA-7992: Introduce start-time-ms metric (#6318 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Dhruvil Shah	b4532a65f7	KAFKA-8134: `linger.ms` must be a long Reviewers: Ismael Juma <ismael@juma.me.uk>, Colin P. McCabe <cmccabe@apache.org>	6 years ago
Boyang Chen	0f995ba6be	KAFKA-7862 & KIP-345 part-one: Add static membership logic to JoinGroup protocol (#6177 ) This is the first diff for the implementation of JoinGroup logic for static membership. The goal of this diff contains: * Add group.instance.id to be unique identifier for consumer instances, provided by end user; Modify group coordinator to accept JoinGroupRequest with/without static membership, refactor the logic for readability and code reusability. * Add client side support for incorporating static membership changes, including new config for group.instance.id, apply stream thread client id by default, and new join group exception handling. * Increase max session timeout to 30 min for more user flexibility if they are inclined to tolerate partial unavailability than burdening rebalance. * Unit tests for each module changes, especially on the group coordinator logic. Crossing the possibilities like: 6.1 Dynamic/Static member 6.2 Known/Unknown member id 6.3 Group stable/unstable 6.4 Leader/Follower The rest of the 345 change will be broken down to 4 separate diffs: * Avoid kicking out members through rebalance.timeout, only do the kick out through session timeout. * Changes around LeaveGroup logic, including version bumping, broker logic, client logic, etc. * Admin client changes to add ability to batch remove static members * Deprecate group.initial.rebalance.delay Reviewers: Liquan Pei <liquanpei@gmail.com>, Stanislav Kozlovski <familyguyuser192@windowslive.com>, Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Boyang Chen	0c2d829249	KAFKA-7903: automatically generate OffsetCommitRequest (#6583 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	6 years ago
Shaobo Liu	26a001d133	MINOR: Fix log message error of loadTransactionMetadata (#6571 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Kengo Seki	25ea9246be	MINOR: Remove an unnecessary character from broker's startup log Author: Kengo Seki <sekikn@apache.org> Reviewers: Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6628 from sekikn/remove-unnecessary-character	6 years ago
Lysss	ad4a7c3436	MINOR: Make LogCleaner.shouldRetainRecord more readable (#6590 ) Reviewers: Bob Barrett <bob.barrett@outlook.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Sönke Liebau	3eaccb3eff	MINOR: Remove implicit return statement (#6629 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	17c8016646	KAFKA-8237; Untangle TopicDeleteManager and add test cases (#6588 ) The controller maintains state across `ControllerContext`, `PartitionStateMachine`, `ReplicaStateMachine`, and `TopicDeletionManager`. None of this state is actually isolated from the rest. For example, topics undergoing deletion are intertwined with the partition and replica states. As a consequence of this, each of these components tends to be dependent on all the rest, which makes testing and reasoning about the system difficult. This is a first step toward untangling all the state. This patch moves it all into `ControllerContext` and removes many of the circular dependencies. So far, this is mostly a direct translation, but in the future we can add additional validation in `ControllerContext` to make sure that state is maintained consistently. Additionally, this patch adds several mock objects to enable easier testing: `MockReplicaStateMachine` and `MockPartitionStateMachine`. These have simplified logic for updating the current state. This is used to create some new test cases for `TopicDeletionManager`. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Jun Rao <junrao@gmail.com>	6 years ago
Jason Gustafson	e32d33cd4f	KAFKA-7965; Fix flaky test ConsumerBounceTest We suspect the problem might be a race condition after broker startup where the consumer has yet to find the coordinator and rebalance. The fix here rolls all the brokers first and then waits for the expected exception. Author: Jason Gustafson <jason@confluent.io> Reviewers: Gwen Shapira Closes #6608 from hachikuji/KAFKA-7965	6 years ago
John Roesler	7b4b298edd	HOTFIX: Fix compilation error in `ProducerStateManagerTest` (#6603 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	6964c356aa	KAFKA-7866; Ensure no duplicate offsets after txn index append failure (#6570 ) This patch fixes a bug in the append logic which can cause duplicate offsets to be appended to the log when the append to the transaction index fails. Rather than incrementing the log end offset after the index append, we do it immediately after the records are written to the log. If the index append later fails, we do two things: 1) We ensure that the last stable offset cannot advance. This guarantees that the aborted data will not be returned to the user until the transaction index contains the corresponding entry. 2) We skip updating the end offset of the producer state. When recovering the log, we will have to reprocess the log and write the index entries. Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Jason Gustafson	48179677a7	MINOR: Ensure producer state append exceptions areuseful (#6591 ) We should include partition/offset information when we raise exceptions during producer state validation. This saves a lot of the discovery work to figure out where the problem occurred. This patch also includes a new test case to verify additional coordinator fencing cases. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
huxi	a05eaaa8f4	KAFKA-7965; Fix testRollingBrokerRestartsWithSmallerMaxGroupSizeConfigDisruptsBigGroup (#6557 ) Most of the time, the group coordinator runs on broker 1. Occasionally the group coordinator will be placed on broker 2. If that's the case, the loop starting at line 320 have no chance to check and update `kickedOutConsumerIdx`. A quick fix is to safely do another round of loop to ensure `kickedOutConsumerIdx` always be checked after the last broker restart. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Manikumar Reddy	3b1524c5df	KAFKA-7466: Add IncrementalAlterConfigs API (KIP-339) (#6247 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>, Viktor Somogyi <viktorsomogyi@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Alex Dunayevsky	47a9871ef6	KAFKA-7471: Multiple Consumer Group Management Feature (#5726 ) * Describe/Delete/Reset offsets on multiple consumer groups at a time (including each group by repeating `--group` parameter) * Describe/Delete/Reset offsets on ALL consumer groups at a time (add new `--all-groups` option similar to `--all-topics`) * Reset plan CSV file generation reworked: structure updated to support multiple consumer groups and make sure that CSV file generation is done properly since there are no restrictions on consumer group names and symbols like commas and quotes are allowed. * Extending data output table format by adding `GROUP` column for all `--describe` queries	6 years ago
Sönke Liebau	9495b5f991	MINOR: Mention in configuration of broker setting log.retention.ms that -1 disables retention by time (#6464 ) Includes an update to the relevant configuration doc.	6 years ago
Rajini Sivaram	51a67d52cb	KAFKA-8232; Test topic delete completion rather than intermediate state (#6581 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Jason Gustafson	53e95ffcdb	MINOR: Use generated InitProducerId RPC (#6538 ) This patch updates the InitProducerId request API to use the generated sources. It also fixes a small bug in the DescribeAclsRequest class where we were using the wrong api key. Reviewers: Mickael Maison <mickael.maison@gmail.com>, Colin McCabe <cmccabe@apache.org>	6 years ago
Jason Gustafson	db338ef67c	MINOR: Move common consumer tests out of abstract consumer class (#6548 ) ConsumerBounceTest redundantly executes a couple test cases which were included in the abstract class `BaseConsumerTest`. We should try to keep a cleaner separation of testing logic and utility logic so that this does not happen (the build time is long enough without doing unnecessary work). This PR moves the cluster initialization and consumer utilities out of BaseConsumerTest and into a new class AbstractConsumerTest. We then let ConsumerBounceTest extend AbstractConsumerTest. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Stanislav Kozlovski	cc4fde35c9	KAFKA-7893; Refactor ConsumerBounceTest to reuse functionality from BaseConsumerTest (#6238 ) This PR should help address the flakiness in the ConsumerBounceTest#testRollingBrokerRestartsWithSmallerMaxGroupSizeConfigDisruptsBigGroup test (https://issues.apache.org/jira/browse/KAFKA-7965). I tested this locally and have verified it significantly reduces flakiness - 25/25 tests now pass. Running the test 25 times in trunk, I'd get `18/25` passes. It does so by reusing the less-flaky consumer integration testing functionality inside `BaseConsumerTest`. Most notably, the test now makes use of the `ConsumerAssignmentPoller` class - each consumer now polls non-stop rather than the more batch-oriented polling we had in `ConsumerBounceTest#waitForRebalance()`. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Rajini Sivaram	844120c601	KAFKA-8190; Don't update keystore modification time during validation (#6539 ) Ensure that modification time is checked against the file used to create the SSLContext that is in-use so that SSLContext is updated whenever file is modified and a config update request is received. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Mickael Maison	825fa3fa09	MINOR: Fixed a few warning in core and connects (#6545 ) - var -> val - unused imports - Javadoc fix Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Kevin Lu	31d191fc85	KAFKA-7904; Add AtMinIsr partition metric and TopicCommand option (KIP-427) - Add `AtMinIsrPartitionCount` metric to `ReplicaManager` - Add `AtMinIsr` metric to `Partition` - Add `--at-min-isr-partitions` describe `TopicCommand` option https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=103089398 Author: Kevin Lu <lu.kevin@berkeley.edu> Author: lu.kevin@berkeley.edu <kelu@paypal.com> Reviewers: Gwen Shapira Closes #6421 from KevinLiLu/KAFKA-7904	6 years ago
Mickael Maison	c301025484	KAFKA-8090: Use automatic RPC generation in ControlledShutdown Author: Mickael Maison <mickael.maison@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6423 from mimaison/controlled-shutdown	6 years ago
Viktor Somogyi-Vass	e560bae22a	KAFKA-8030: Fix flaky tests in TopicCommandWithAdminClientTest This change adds waits for metadata updates after killing the broker in order to make the tests more stable. Author: Viktor Somogyi-Vass <viktorsomogyi@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6505 from viktorsomogyi/flaky-min-isr-test	6 years ago
Mickael Maison	981815c8d1	KAFKA-8034: Use automatic RPC generation in DeleteTopics Reviewers: Colin P. McCabe <cmccabe@apache.org>	6 years ago

1 2 3 4 5 ...

2623 Commits (f74cedb985bc573e3638dd272ebbe9a87f5482ec)