src-kafka

Commit Graph

Author	SHA1	Message	Date
dengziming	33c8774ace	KAFKA-9353; Add groupInstanceId to DescribeGroup for better visibility (#7886 ) Kafka-8538(#6957) has already added `group.instance.id` to `MemberDescription` but didn't print it in the describe group output, so this patch adds the logic to do so. Before the change, the describe command prints as follows: ``` GROUP CONSUMER-ID HOST CLIENT-ID #PARTITIONS DemoConsumer consumer-DemoConsumer-2-89251f12-f0ae-4dc1-a118-bda49f2a6e86 /127.0.0.1 consumer-DemoConsumer-2 0 DemoConsumer consumer-DemoConsumer-1-72221c6b-f3d9-4c68-96db-ffffa12ddf93 /127.0.0.1 consumer-DemoConsumer-1 1 ``` After the change, the describe command prints as follows: ``` GROUP CONSUMER-ID GROUP-INSTANCE-ID HOST CLIENT-ID #PARTITIONS DemoConsumer groupIns2-f050379c-9c0d-433c-bbe0-44de6177b60d groupIns2 /127.0.0.1 consumer-DemoConsumer-groupIns2 0 DemoConsumer groupIns1-44805ba9-ae6f-49d3-89af-44a4b95aff8d groupIns1 /127.0.0.1 consumer-DemoConsumer-groupIns1 1 ``` If all the `GROUP-INSTANCE-ID` is null, just as the previous: ``` GROUP CONSUMER-ID HOST CLIENT-ID #PARTITIONS DemoConsumer consumer-DemoConsumer-2-89251f12-f0ae-4dc1-a118-bda49f2a6e86 /127.0.0.1 consumer-DemoConsumer-2 0 DemoConsumer consumer-DemoConsumer-1-72221c6b-f3d9-4c68-96db-ffffa12ddf93 /127.0.0.1 consumer-DemoConsumer-1 1 ``` Reviewers: Alice <WheresAlice@users.noreply.github.com>, Matthias J. Sax <matthias@confluent.io>, Boyang Chen <boyang@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
Ismael Juma	ac3043cff0	MINOR: Remove unused `Json.legacyEncodeAsString` (#8726 ) Updated a couple of test usages not to rely on it and removed the tests for the removed method. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Jason Gustafson	c95b45d04f	MINOR: Add reason to log message when incrementing the log start offset (#8701 ) Sometimes logging leaves us guessing at the cause of an increment to the log start offset. Since this results in deletion of user data, we should provide the reason explicitly. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Ismael Juma <ismael@juma.me.uk>	5 years ago
Lucas Bradstreet	c6adcca95f	MINOR: avoid unnecessary list iteration in ApiVersion.lastVersion (#8708 ) We unnecessarily iterate the versions list each time we lookup lastVersion, including in the hotpath Log.appendAsFollower. Given that allVersions is a constant, this is unnecessary. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Ismael Juma <ismael@juma.me.uk>	5 years ago
阿洋	45383f75b3	KAFKA-10022:console-producer supports the setting of client.id (#8698 ) "console-producer" supports the setting of "client.id", which is a reasonable requirement, and the way "console consumer" and "console producer" handle "client.id" can be unified. "client.id" defaults to "console-producer" Co-authored-by: xinzhuxiansheng <xinzhuxiansheng@autohome.com.cn> Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Ismael Juma	5302efb2d1	MINOR: Improve broker registration and Log logging (#8714 ) Broker registration previously: > INFO Registered broker 0 at path /brokers/ids/0 with addresses: ArraySeq(EndPoint(localhost,9092,ListenerName(PLAINTEXT),PLAINTEXT),EndPoint(localhost,9093,ListenerName(SSL),SSL)), czxid (broker epoch): 4294967320 (kafka.zk.KafkaZkClient) Now: > INFO Registered broker 0 at path /brokers/ids/0 with addresses: PLAINTEXT://localhost:9092,SSL://localhost:9093, czxid (broker epoch): 4294967320 (kafka.zk.KafkaZkClient) The second improvement is to avoid logging messages like: > "Deleting segments List()" Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Matthias J. Sax	27824baa21	KAFKA-10003: Mark KStream.through() as deprecated and update Scala API (#8679 ) - part of KIP-221 Co-authored-by: John Roesler <john@confluent.io>	5 years ago
Brian Byrne	d9e9a18a19	KAFKA-9980: Fix bug where alterClientQuotas could not set default client quotas (#8658 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	5 years ago
Anna Povzner	f6781f42ff	MINOR: Added unit tests for ConnectionQuotas (#8650 ) Reviiewers: David Jacot <djacot@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
Levani Kokhreidze	67770072da	KAFKA-9859 / kafka-streams-application-reset tool doesn't take into account topics generated by KTable foreign key join operation (#8671 ) This PR fixes kafka-streams-application-reset tool. Before, kafka-streams-application-reset tool wasn't taking into account topics generated by KTable foreign key join operation. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
showuon	ad0850659f	KAFKA-10004: ConfigCommand fails to find default broker configs without ZK (#8675 ) Reviewers: Brian Byrne <bbyrne@confluent.io>, Colin P. McCabe <cmccabe@apache.org>	5 years ago
Colin Patrick McCabe	1f2ff73b28	KIP-551: Expose disk read and write metrics (#8569 ) Reviewers: David Arthur <mumrah@gmail.com>, Mickael Maison <mickael.maison@gmail.com>	5 years ago
Chia-Ping Tsai	78e18b575c	KAFKA-9617 Replica Fetcher can mark partition as failed when max.message.bytes is changed (#8659 ) Skip to check the size of record if the record is already accepted by leader. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Jason Gustafson	81cf3fa5f2	KAFKA-9669; Loosen validation of inner offsets for older message formats (#8647 ) Prior to KAFKA-8106, we allowed the v0 and v1 message formats to contain non-consecutive inner offsets. Inside `LogValidator`, we would detect this case and rewrite the batch. After KAFKA-8106, we changed the logic to raise an error in the case of the v1 message format (v0 was still expected to be rewritten). This caused an incompatibility for older clients which were depending on the looser validation. This patch reverts the old logic of rewriting the batch to fix the invalid inner offsets. Note that the v2 message format has always had stricter validation. This patch also adds a test case for this. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Ismael Juma <ismael@juma.me.uk>	5 years ago
Ismael Juma	391ad90112	KAFKA-9956: Authorizer APIs may be invoked more than once for a given request (#8643 ) * Fix describeConfigs and alterConfigs not to invoke authorizer more than once * Add tests to KafkaApisTest to verify the fixes * Rename `filterAuthorized` to `filterByAuthorized` * Tweak `filterByAuthorized` to take resources instead of resource names and improve implementation * Introduce `partitionMapByAuthorized` and `partitionSeqByAuthorized` and simplify code by using it * Replace List with Seq in some AdminManager methods * Remove stray `println` in `KafkaApisTest` Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Ismael Juma	847ff8f557	MINOR: Use `forEach` and `ifPresent` to simplify Scala code (#8642 ) * Use `forEach` instead of `asScala.foreach` for Java Iterables. * Use `ifPresent` instead of `asScala.foreach` for Java Optionals. * Use `forEach` instead of `entrySet.forEach` for Java maps. * Keep `asScala.foreach` for `Properties` as the Scala implementation has a better interface (keys and values are of type `String`). * A few clean-ups: unnecessary `()`, `{}`, `new`, etc. Reviewers: Manikumar Reddy <manikumar@confluent.io>	5 years ago
Brian Byrne	5583089df0	KAFKA-9942: ConfigCommand fails to set client quotas for default users with --bootstrap-server. (#8628 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	5 years ago
Jason Gustafson	849b65a13a	KAFKA-9947; Ensure proper shutdown of services in `TransactionsBounceTest` (#8602 ) This patch ensures that both clients and the bounce schedule get shutdown properly in this test. Additionally, it fixes the surprising behavior of using the passed delivery timeout to override the request timeout in `createTransactionalProducer`. Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Viktor Somogyi	d70dacb54a	KAFKA-6342; Remove unused workaround for JSON parsing of non-escaped strings (#8591 ) Previously we had fallback logic when parsing ACLs to handle older entries which may contain non-escaped characters. This code became dead after 1.1 since it was no longer used in the parsing of ACLs. This patch removes the fallback logic. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	5 years ago
David Jacot	3b6bf80530	KAFKA-9946; Partition deletion event should only be sent if deletion was requested in the StopReplica request (#8609 ) This patch fixes a regression in the `StopReplica` response handling. We should only send the event on receiving the `StopReplica` response if we had requested deletion in the request. Reviewers: Lucas Bradstreet <lucas@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
Ismael Juma	fbfda2c4ad	KAFKA-9731: Disable immediate fetch response for hw propagation if replica selector is not defined (#8607 ) In the case described in the JIRA, there was a 50%+ increase in the total fetch request rate in 2.4.0 due to this change. I included a few additional clean-ups: * Simplify `findPreferredReadReplica` and avoid unnecessary collection copies. * Use `LongSupplier` instead of `Supplier<Long>` in `SubscriptionState` to avoid unnecessary boxing. Added a unit test to ReplicaManagerTest and cleaned up the test class a bit including consistent usage of Time in MockTimer and other components. Reviewers: Gwen Shapira <gwen@confluent.io>, David Arthur <mumrah@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago
THREE LEVEL HELMET	7cb1600d6a	MINOR: Clean up some test dependencies on ConfigCommand and TopicCommand (#8527 ) Avoid calling into ConfigCommand and TopicCommand from tests that are not related to these commands. It's better to just invoke the admin APIs. Change a few cases where we were testing the deprecated --zookeeper flag to testing the --bootstrap-server flag instead. Unless we're explicitly testing the deprecated code path, we should be using the non-deprecated flags. Move testCreateWithUnspecifiedReplicationFactorAndPartitionsWithZkClient from TopicCommandWithAdminClientTest.scala into TopicCommandWithZKClientTest.scala, since it makes more sense in the latter. Reviewers: Colin P. McCabe <cmccabe@apache.org>	5 years ago
Guozhang Wang	34824b7bff	KAFKA-9798: Send one round synchronously before starting the async producer (#8565 ) Comparing all other test cases, the shouldAllowConcurrentAccesses starts an async producer sending records throughout the test other than just synchronously sent and acked a few records before we start the streams application. Right after the streams app is started, we check that at least one record is sent to the output topic (i.e. completed processing). However since only this test starts the producer async and did not wait for it to complete, it is possible that the async producer gets too longer to produce some records and causing it to fail. To follow what other tests did, I let this test to first send one round of records synchronously before starting the async producing. Also encountered some new scala warnings that I fixed along with this PR. Reviewers: Matthias J. Sax <matthias@confluent.io>	5 years ago
Leonard Ge	2aecb089af	KAFKA-9589: Enable testLogAppendTimeNonCompressedV2 and fix bug in helper method (#8533 ) Adjust `checkLogAppendTimeNonCompressed` to assert `shallowOffsetOfMaxTimestamp` correctly for message format 2. Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Jason Gustafson	794648aa55	KAFKA-9939; Fix overcounting delayed fetches in request rate metrics (#8586 ) Fetches which hit purgatory are currently counted twice in fetch request rate metrics. This patch moves the metric update into `fetchMessages` so that they are only counted once. Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Chia-Ping Tsai	6bb4bbf874	HOTFIX: Avoid ambiguity error of Properties#putAll in Java 11 and scala 2.12 (#8599 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Ismael Juma	322b10964c	KAFKA-9652: Fix throttle metric in RequestChannel and request log due to KIP-219 (#8567 ) After KIP-219, responses are sent immediately and we rely on a combination of clients and muting of the channel to throttle. The result of this is that we need to track `apiThrottleTimeMs` as an explicit value instead of inferring it. On the other hand, we no longer need `apiRemoteCompleteTimeNanos`. Extend `BaseQuotaTest` to verify that throttle time in the request channel metrics are being set. Given the nature of the throttling numbers, the test is not particularly precise. I included a few clean-ups: * Pass KafkaMetric to QuotaViolationException so that the caller doesn't have to retrieve it from the metrics registry. * Inline Supplier in SocketServer (use SAM). * Reduce redundant `time.milliseconds` and `time.nanoseconds`calls. * Use monotonic clock in ThrottledChannel and simplify `compareTo` method. * Simplify `TimerTaskList.compareTo`. * Consolidate the number of places where we update `apiLocalCompleteTimeNanos` and `responseCompleteTimeNanos`. * Added `toString` to ByteBufferSend` and `MultiRecordsSend`. * Restrict access to methods in `QuotaTestClients` to expose only what we need to. Reviewers: Jun Rao <junrao@gmail.com>	5 years ago
Ismael Juma	7edbff3394	KAFKA-9932: Don't load configs from ZK when the log has already been loaded (#8582 ) If a broker contains 8k replicas, we would previously issue 8k ZK calls to retrieve topic configs when processing the first LeaderAndIsr request. That should translate to 0 after these changes. Credit to @junrao for identifying the problem. Reviewers: Jun Rao <junrao@gmail.com>	5 years ago
Leonard Ge	77ac06f3f1	Minor: remove redundant check in auto preferred leader election (#8566 ) This is a minor follower up PR of #8524 Reviewer: Jun Rao <junrao@gmail.com>	5 years ago
David Jacot	c5d13dcb6c	KAFKA-9885; Evict last members of a group when the maximum allowed is reached (#8525 ) This PR updates the algorithm which limits the number of members within a group (`group.max.size`) to fix the following two issues: 1. As described in KAFKA-9885, we found out that multiple members of a group can be evicted if the leader of the consumer offset partition changes before the group is persisted. This happens because the current eviction logic always evict the first member rejoining the group. 2. We also found out that dynamic members, when required to have a known member id, are not always limited. The caveat is that the current logic only considers unknown members and uses the group size, which does not include the so called pending members, to accept or reject a member. In this case, when they rejoins, they are not unknown member anymore and thus could bypass the limit. See `testDynamicMembersJoinGroupWithMaxSizeAndRequiredKnownMember` for the whole scenario. This PR changes the logic to address the above two issues and extends the tests coverage to cover all the member types. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Leonard Ge	db9e55a50f	KAFKA-9866: Avoid election for topics where preferred leader is not in ISR (#8524 ) In this commit we made sure that the auto leader election only happens after the newly starter broker is in the isr. No accompany tests are added due to the fact that: this is a change to the private method and no public facing change is made it is hard to create tests for this change without considerable effort Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Jun Rao <junrao@gmail.com>	5 years ago
Anna Povzner	bd17085ec1	KAFKA-9839; Broker should accept control requests with newer broker epoch (#8509 ) A broker throws IllegalStateException if the broker epoch in the LeaderAndIsr/UpdateMetadataRequest/StopReplicaRequest is larger than its current broker epoch. However, there is no guarantee that the broker would receive the latest broker epoch before the controller: when the broker registers with ZK, there are few more instructions to process before this broker "knows" about its epoch, while the controller may already get notified and send UPDATE_METADATA request (as an example) with the new epoch. This will result in clients getting stale metadata from this broker. With this PR, a broker accepts LeaderAndIsr/UpdateMetadataRequest/StopReplicaRequest if the broker epoch is newer than the current epoch. Reviewers: David Jacot <djacot@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
Aneel Nazareth	2ca19cf603	KAKFA-9612: Add an option to kafka-configs.sh to add configs from a prop file (KIP-574) Add an option to kafka-configs.sh `--add-config-file` that adds the configs from a properties file. Testing: Added new tests to ConfigCommandTest.scala Author: Aneel Nazareth <aneel@confluent.io> Reviewers: David Jacot <djacot@confluent.io>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #8184 from WanderingStar/KAFKA-9612	5 years ago
José Armando García Sancio	d63eaaaa01	MINOR: Partition is under reassignment when adding and removing (#8364 ) A partition is under reassignment if the either the set of adding replicas or set removing replicas is non-empty. Fix the test assertion such that it prints stdout on failure. Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Lucas Bradstreet	cfc34cace5	MINOR: reduce allocations in log start and recovery checkpoints (#8467 ) For brokers with replica counts > 4000, allocations from logsByDir become substantial. logsByDir is called often by LogManager.checkpointLogRecoveryOffsets and LogManager.checkpointLogStartOffsets. The approach used is similar to the one from the checkpointHighwatermarks change in https://github.com/apache/kafka/pull/6741. Are there better ways to structure out data structure to avoid creating logsByDir on demand for each checkpoint iteration? This micro-optimization will help as is, but if we can avoid doing this completely it'd be better. JMH benchmark results: ``` Before: Benchmark (numPartitions) (numTopics) Mode Cnt Score Error Units CheckpointBench.measureCheckpointLogStartOffsets 3 100 thrpt 15 2.233 ± 0.013 ops/ms CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate 3 100 thrpt 15 477.097 ± 49.731 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate.norm 3 100 thrpt 15 246083.007 ± 33.052 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space 3 100 thrpt 15 475.683 ± 55.569 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space.norm 3 100 thrpt 15 245474.040 ± 14968.328 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen 3 100 thrpt 15 0.001 ± 0.001 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen.norm 3 100 thrpt 15 0.341 ± 0.268 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.count 3 100 thrpt 15 129.000 counts CheckpointBench.measureCheckpointLogStartOffsets:·gc.time 3 100 thrpt 15 52.000 ms CheckpointBench.measureCheckpointLogStartOffsets 3 1000 thrpt 15 0.572 ± 0.004 ops/ms CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate 3 1000 thrpt 15 1360.240 ± 150.539 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate.norm 3 1000 thrpt 15 2750221.257 ± 891.024 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space 3 1000 thrpt 15 1362.908 ± 148.799 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space.norm 3 1000 thrpt 15 2756395.092 ± 44671.843 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen 3 1000 thrpt 15 0.017 ± 0.008 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen.norm 3 1000 thrpt 15 33.611 ± 14.401 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.count 3 1000 thrpt 15 273.000 counts CheckpointBench.measureCheckpointLogStartOffsets:·gc.time 3 1000 thrpt 15 186.000 ms CheckpointBench.measureCheckpointLogStartOffsets 3 2000 thrpt 15 0.266 ± 0.002 ops/ms CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate 3 2000 thrpt 15 1342.557 ± 171.260 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate.norm 3 2000 thrpt 15 5877881.729 ± 3695.086 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space 3 2000 thrpt 15 1343.965 ± 186.069 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space.norm 3 2000 thrpt 15 5877788.561 ± 168540.343 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen 3 2000 thrpt 15 0.081 ± 0.043 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen.norm 3 2000 thrpt 15 351.277 ± 167.006 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.count 3 2000 thrpt 15 253.000 counts CheckpointBench.measureCheckpointLogStartOffsets:·gc.time 3 2000 thrpt 15 231.000 ms JMH benchmarks done After: CheckpointBench.measureCheckpointLogStartOffsets 3 100 thrpt 15 2.809 ± 0.129 ops/ms CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate 3 100 thrpt 15 211.248 ± 25.953 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate.norm 3 100 thrpt 15 86533.838 ± 3763.989 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space 3 100 thrpt 15 211.512 ± 38.669 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space.norm 3 100 thrpt 15 86228.552 ± 9590.781 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen 3 100 thrpt 15 ≈ 10⁻³ MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen.norm 3 100 thrpt 15 0.140 ± 0.111 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.count 3 100 thrpt 15 57.000 counts CheckpointBench.measureCheckpointLogStartOffsets:·gc.time 3 100 thrpt 15 25.000 ms CheckpointBench.measureCheckpointLogStartOffsets 3 1000 thrpt 15 1.046 ± 0.030 ops/ms CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate 3 1000 thrpt 15 524.597 ± 74.793 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate.norm 3 1000 thrpt 15 582898.889 ± 37552.262 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space 3 1000 thrpt 15 519.675 ± 89.754 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space.norm 3 1000 thrpt 15 576371.150 ± 55972.955 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen 3 1000 thrpt 15 0.009 ± 0.005 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen.norm 3 1000 thrpt 15 9.920 ± 5.375 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.count 3 1000 thrpt 15 111.000 counts CheckpointBench.measureCheckpointLogStartOffsets:·gc.time 3 1000 thrpt 15 56.000 ms CheckpointBench.measureCheckpointLogStartOffsets 3 2000 thrpt 15 0.617 ± 0.007 ops/ms CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate 3 2000 thrpt 15 573.061 ± 95.931 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.alloc.rate.norm 3 2000 thrpt 15 1092098.004 ± 75140.633 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space 3 2000 thrpt 15 572.448 ± 97.960 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Eden_Space.norm 3 2000 thrpt 15 1091290.460 ± 85946.164 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen 3 2000 thrpt 15 0.010 ± 0.012 MB/sec CheckpointBench.measureCheckpointLogStartOffsets:·gc.churn.G1_Old_Gen.norm 3 2000 thrpt 15 19.990 ± 24.407 B/op CheckpointBench.measureCheckpointLogStartOffsets:·gc.count 3 2000 thrpt 15 109.000 counts CheckpointBench.measureCheckpointLogStartOffsets:·gc.time 3 2000 thrpt 15 67.000 ms JMH benchmarks done ``` For the 2000 topic, 3 partition case, we see a reduction in normalized allocations from 5877881B/op to 1284190.774B/op, a reduction of 78%. Some allocation profiles from a mid sized broker follow. I have seen worse, but these add up to around 3.8% on a broker that saw GC overhead in CPU time of around 30%. You could argue that this is relatively small, but it seems worthwhile for a low risk change. ![image](https://user-images.githubusercontent.com/252189/79058104-33e91d80-7c1e-11ea-99c9-0cf2e3571e1f.png) ![image](https://user-images.githubusercontent.com/252189/79058105-38add180-7c1e-11ea-8bfd-6e6eafb0c794.png) Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
zshuo	62b2eac4e1	KAFKA-9704: Fix the issue z/OS won't let us resize file when mmap. (#8224 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	5 years ago
Boyang Chen	f3c8bff311	KAFKA-8639: Replace AddPartitionsToTxn with Automated Protocol (#8326 ) Part of the protocol automation effort. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
David Jacot	85e81d48c8	KAFKA-9844; Fix race condition which allows more than maximum number of members(#8454 ) This patch fixes a race condition in the join group request handling which sometimes results in not enforcing the maximum number of members allowed in a group. Reviewers: Boyang Chen <boyang@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
Guozhang Wang	5c0fd36ee5	KAFKA-9823: Remember the sent generation for the coordinator request (#8445 ) For join / sync / commit / heartbeat request, we would remember the sent generation in the created handler object, and then upon getting the error code, we could check whether the sent generation still matches the current generation. If not, it means that the member has already reset its generation or has participated in a new rebalance already. This means: 1. For join / sync-group request, we do not need to call reset-generation any more for illegal-generation / unknown-member. But we would still set the error since at a given time only one join/sync round-trip would be in flight, and hence we should not be participating in a new rebalance. Also for fenced instance error we still treat it as fatal since we should not be participating in a new rebalance, so this is still not expected. 2. For commit request, we do not set the corresponding error for illegal-generation / unknown-member / fenced-instance but raise rebalance-in-progress. For commit-sync it would be still thrown to user, while for commit-async it would be logged and swallowed. 3. For heartbeat request, we do not treat illegal-generation / unknown-member / fenced-instance errors and just consider it as succeeded since this should be a stale heartbeat which can be ignored. Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Boyang Chen <boyang@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
Ismael Juma	c5ae154a3f	MINOR: Enable fatal warnings with scala 2.13 (#8429 ) * Upgrade to Scala 2.13.2 which introduces the ability to suppress warnings. * Upgrade to scala-collection-compat 2.1.6 as it introduces the @nowarn annotation for Scala 2.12. * While at it, also update scala-java8-compat to 0.9.1. * Fix compiler warnings and add @nowarn for the unfixed ones. Scala 2.13.2 highlights (besides @nowarn): * Rewrite Vector (using "radix-balanced finger tree vectors"), for performance. Small vectors are now more compactly represented. Some operations are now drastically faster on large vectors. A few operations may be a little slower. * Matching strings makes switches in bytecode. https://github.com/scala/scala/releases/tag/v2.13.2 Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Ismael Juma	065415e524	MINOR: Upgrade gradle plugins and test libraries for Java 14 support (#8519 ) Also: * Remove deprecated `=` in resolutionStrategy. * Replace `AES/GCM/PKCS5Padding` with `AES/GCM/NoPadding` in `PasswordEncoderTest`. The former is invalid and JDK 14 rejects it, see https://bugs.openjdk.java.net/browse/JDK-8229043. With these changes, the build works with Java 14 and Scala 2.12. The same will apply to Scala 2.13 when Scala 2.13.2 is released (should happen within 1-2 weeks). Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Lucas Bradstreet	851b45c842	MINOR: reduce impact of trace logging in replica hot path (#8468 ) The impact of trace logging is normally small, on the order of 40ns per getEffectiveLevel check, however this adds up with trace is called multiple times per partition in the replica fetch hot path. This PR removes some trace logs that are not very useful and reduces cases where the level is checked over and over for one fetch request. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	5 years ago
Lucas Bradstreet	00a59b392d	MINOR: improve test coverage for dynamic LogConfig(s) (#7616 ) Adding a dynamically updatable log config is currently error prone, as it is easy to set them up as a val not a def and this would result in a dynamically updated broker default not applying to a LogConfig after broker restart. This PR adds a guard against introducing these issues by ensuring that all log configs are exhaustively checked via a test. For example, if the following line was a val and not a def, there would be a problem with dynamically updating broker defaults for the config. `4bde9bb3cc/core/src/main/scala/kafka/server/KafkaConfig.scala (L1216)` Reviewers: Dhruvil Shah <dhruvil@confluent.io>, Ismael Juma <ismael@juma.me.uk>	5 years ago
David Jacot	9a36d9f913	KAFKA-9796; Ensure broker shutdown is not stuck when Acceptor is waiting on connection queue (#8448 ) This commit reworks the SocketServer to always start the acceptor threads after the processor threads and to always stop the acceptor threads before the processor threads. It ensures that the acceptor shutdown is not blocked waiting on the processors to be fully shutdown by decoupling the shutdown signal and the awaiting. It also ensure that the processor threads drain its newConnection queue to unblock acceptors that may be waiting. However, the acceptors still bind during the startup, only the processing of new connections and requests is further delayed. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
Ismael Juma	0e46dd4a1d	MINOR: Use streaming iterator with decompression buffer when building offset map (#8494 ) This makes it consistent with the `filterTo` methods.	5 years ago
Jason Gustafson	413c4b55b5	KAFKA-9838; Add log concurrency test and fix minor race condition (#8476 ) The patch adds a new test case for validating concurrent read/write behavior in the `Log` implementation. In the process of verifying this, we found a race condition in `read`. The previous logic checks whether the start offset is equal to the end offset before collecting the high watermark. It is possible that the log is truncated in between these two conditions which could cause the high watermark to be equal to the log end offset. When this happens, `LogSegment.read` fails because it is unable to find the starting position to read from. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
David Jacot	f646c9c0bc	MINOR: KafkaApis#handleOffsetDeleteRequest does not group result correctly (#8485 ) `KafkaApis#handleOffsetDeleteRequest` does not build the response correctly because `topics.add` is not in the correct loop. Fortunately, due to how the response is processed by the admin client, it works but sends redundant information on the wire. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago
Stanislav Kozlovski	bd427346a4	MINOR: Serialize state change logs for handling LeaderAndIsr and StopReplica requests (#8493 ) This patch moves the state change logger logs for handling a LeaderAndIsr/StopReplica request inside the replicaStateChangeLock in order to serialize the logs. This helps to tell apart per-partition actions of concurrent LAIR/StopReplica requests in cases where requests pile up waiting on the lock. Reviewer: Jun Rao <junrao@gmail.com>	5 years ago
Lucas Bradstreet	0a5097323b	KAFKA-9864: Avoid expensive QuotaViolationException usage (#8477 ) QuotaViolationException generates an exception message via String.format in the constructor even though the message is often not used, e.g. https://github.com/apache/kafka/blob/trunk/core/src/main/scala/kafka/server/ClientQuotaManager.scala#L258. We now override `toString` instead. It also generates an unnecessary stack trace, which is now avoided using the same pattern as in ApiException. I have also avoided use of QuotaViolationException for control flow in ReplicationQuotaManager which is another hotspot that we have seen in practice. Reviewers: Gwen Shapira <gwen@confluent.io>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Ismael Juma <ismael@juma.me.uk>	5 years ago
Lucas Bradstreet	4ac2ad3a2b	MINOR: Eliminate unnecessary partition lookups (#8484 ) There are two cases in the fetch pass where a partition is unnecessarily looked up from the partition Pool, when one is already accessible. This will be a fairly minor improvement on high partition count clusters, but could be worth 1% from some profiles I have seen. More importantly, the code is cleaner this way. Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago

1 2 3 4 5 ...

2990 Commits (1672a75e1f04ce3b7cd4fa202942a8887cf811e1)