src-kafka

Commit Graph

Author	SHA1	Message	Date
Matthias J. Sax	2421a69556	MINOR: Fix Kafka Streams JavaDocs with regard to new StreamJoined class (#7627 ) Reviewers: Bruno Cadonna <bruno@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
Bruno Cadonna	96d95e947a	MINOR: Fix sensor retrieval in stand0by task's constructor (#7632 ) We should not use StreamsMetricsImpl. threadLevelSensor directly which would only retrieve the sensor but would not add any metrics to the sensor. Generally speaking we should always use the corresponding-level Metrics class (e.g. ThreadMetrics) to get the sensors which are populated with metrics. Reviewers: Guozhang Wang <wangguoz@gmail.com>, John Roesler <john@confluent.io>, Bill Bejeck <bbejeck@gmail.com>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Viktor Somogyi	5fa2de43ec	MINOR: Replace some Java 7 style code with Java 8 style (#7623 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
John Roesler	4a5155c934	KAFKA-8868: Generate SubscriptionInfo protocol message (#7248 ) Rather than maintain hand coded protocol serialization code, Streams could use the same code-generation framework as Clients/Core. There isn't a perfect match, since the code generation framework includes an assumption that you're generating "protocol messages", rather than just arbitrary blobs, but I think it's close enough to justify using it, and improving it over time. Using the code generation allows us to drop a lot of detail-oriented, brittle, and hard-to-maintain serialization logic in favor of a schema spec. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Boyang Chen <boyang@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Stanislav Kozlovski	72282ed198	MINOR: Correctly mark offset expiry in GroupMetadataManager's OffsetExpired metric We would mistakenly increment the `OffsetCommits` metric instead Author: Stanislav Kozlovski <stanislav_kozlovski@outlook.com> Reviewers: David Jacot <djacot@confluent.io>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #7624 from stanislavkozlovski/minor-fix-group-coordinator-offset-expiry-metric	5 years ago
Boyang Chen	f65c2acad7	KAFKA-8972 (2.4 blocker): bug fix for restoring task (#7617 ) This is a typo bug which is due to calling a wrong map. Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Guozhang Wang <guozhang@confluent.io>	5 years ago
huxi	9e81ec9a6e	KAFKA-9093: NullPointerException in KafkaConsumer with group.instance.id (#7590 ) `log` in KafkaConsumer does not get initialized if an invalid value for group.intance.id is given during consumer construction. In this case we should skip the catch block's close procedure since no internal objects have been initialized yet. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bruno Cadonna	fc0f82372e	KAFKA-8980: Refactor state-store-level streams metrics (#7584 ) Refactors metrics according to KIP-444 Introduces StateStoreMetrics as a central provider for state store metrics Adds metric scope (a.k.a. store type) to the in-memory suppression buffer Reviewers: Guozhang Wang <wangguoz@gmail.com>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
José Armando García Sancio	0dbf95b1c1	MINOR: Fix documentation for updateCurrentReassignment (#7611 ) The function KafkaController.updateCurrentReassignment doesn't return any value. Fix the documentation to reflect that.	5 years ago
Stanislav Kozlovski	2dba8803f7	MINOR: Preserve backwards-compatibility by renaming the AlterPartitionReassignment metric to PartitionReassignment In `18d4e57f6e (diff-394389922df5210adf43a8b7064cc4ffL61)` we unintentionally renamed the metric with the previous changes to reassignments Author: Stanislav Kozlovski <stanislav_kozlovski@outlook.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #7606 from stanislavkozlovski/minor-partition-reassignment-metric	5 years ago
A. Sophie Blee-Goldman	cb5c7313b4	KAFKA-8972 (2.4 blocker): clear all state for zombie task on TaskMigratedException (#7608 ) Third bugfix for the failing broker bounce system test with cooperative rebalancing: tl;dr We need to remove everything associated with a task when it is closed, but in some cases (eg AssignedTasks#commit) on a TaskMigratedExceptionwe would close it as a zombie and then (only) remove the taskId from therunning` map. This left its partitions, restorers, state stores, etc around and in an undefined state, causing exceptions when closing and/or opening the stores again. Longer explanation: In AssignedTasks (the abstract class from which the standby and active task variations extend) a commit failure (even due to broker down/unavailable) is treated as a TaskMigratedException after which the failed task is closed as a zombie and removed from running -- the remaining tasks (ie those still in running are then also closed as zombies in the subsequent onPartitionsLost However we do not remove the closed task from runningByPartition nor do we remove the corresponding changelogs, if restoring, from the StoreChangelogReader since that applies only to active tasks, and AssignedTasks is generic/abstract. The changelog reader then retains a mapping from the closed task's changelog partition to its CompositeRestoreListener (and does not replace this when the new one comes along after the rebalance). The restore listener has a reference to a specific RocksDBStore instance, one which was closed when the task was closed as a zombie, so it accidentally tries to restore to the "old" RocksDBStore instance rather than the new one that was just opened. Although technically this bug existed before KIP-429, it was only uncovered now that we remove tasks and clear their state/partitions/etc one at a time. We don't technically need to cherrypick the fix back earlier as before we just blindly clear all data structures entirely during an eager rebalance. Reviewers: Boyang Chen <boyang@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Bruno Cadonna	065411aa22	KAFKA-9077: Fix reading of metrics of Streams' SimpleBenchmark (#7610 ) With KIP-444 the metrics definitions are refactored. Thus, Streams' SimpleBenchmark needs to be updated to correctly access the refactored metrics. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Matthias J. Sax <mjsax@apache.org>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
Boyang Chen	465f810730	KAFKA-8972 (2.4 blocker): correctly release lost partitions during consumer.unsubscribe() (#7441 ) Inside onLeavePrepare we would look into the assignment and try to revoke the owned tasks and notify users via RebalanceListener#onPartitionsRevoked, and then clear the assignment. However, the subscription's assignment is already cleared in this.subscriptions.unsubscribe(); which means user's rebalance listener would never be triggered. In other words, from consumer client's pov nothing is owned after unsubscribe, but from the user caller's pov the partitions are not revoked yet. For callers like Kafka Streams which rely on the rebalance listener to maintain their internal state, this leads to inconsistent state management and failure cases. Before KIP-429 this issue is hidden away since every time the consumer re-joins the group later, it would still revoke everything anyways regardless of the passed-in parameters of the rebalance listener; with KIP-429 this is easier to reproduce now. Our fixes are following: • Inside unsubscribe, first do onLeavePrepare / maybeLeaveGroup and then subscription.unsubscribe. This we we are guaranteed that the streams' tasks are all closed as revoked by then. • [Optimization] If the generation is reset due to fatal error from join / hb response etc, then we know that all partitions are lost, and we should not trigger onPartitionRevoked, but instead just onPartitionsLost inside onLeavePrepare. This is because we don't want to commit for lost tracks during rebalance which is doomed to fail as we don't have any generation info. Reviewers: Matthias J. Sax <matthias@confluent.io>, A. Sophie Blee-Goldman <sophie@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	5 years ago
A. Sophie Blee-Goldman	56bc507485	MINOR: improve logging of tasks on shutdown (#7597 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Guozhang Wang	59a75f4422	KAFKA-9048 Pt1: Remove Unnecessary lookup in Fetch Building (#7576 ) Get rid of partitionStates that creates a new PartitionState for each state since all the callers do not require it to be a Seq. Modify ReplicaFetcherThread constructor to fix the broken benchmark path. This PR: Benchmark (partitionCount) Mode Cnt Score Error Units ReplicaFetcherThreadBenchmark.testFetcher 100 avgt 15 9280.953 ± 55.967 ns/op ReplicaFetcherThreadBenchmark.testFetcher 500 avgt 15 61533.546 ± 1213.559 ns/op ReplicaFetcherThreadBenchmark.testFetcher 1000 avgt 15 151306.146 ± 1820.222 ns/op ReplicaFetcherThreadBenchmark.testFetcher 5000 avgt 15 1138547.929 ± 45301.938 ns/op Trunk: Benchmark (partitionCount) Mode Cnt Score Error Units \| \| \| \| \| -- \| -- \| -- \| -- \| -- \| -- ReplicaFetcherThreadBenchmark.testFetcher 100 avgt 15 9305.588 ± 51.886 ns/op \| \| \| \| \| ReplicaFetcherThreadBenchmark.testFetcher 500 avgt 15 65216.933 ± 939.827 ns/op \| \| \| \| \| ReplicaFetcherThreadBenchmark.testFetcher 1000 avgt 15 151715.514 ± 1361.009 ns/op \| \| \| \| \| ReplicaFetcherThreadBenchmark.testFetcher 5000 avgt 15 1231958.103 ± 94 Reviewers: Jason Gustafson <jason@confluent.io>, Lucas Bradstreet <lucasbradstreet@gmail.com>	5 years ago
Mickael Maison	e4262471c9	MINOR: Fix command examples in kafka-reassign-partitions.sh docs (#7583 ) Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Jason Gustafson	4bde9bb3cc	KAFKA-9102; Increase default zk session timeout and replica max lag [KIP-537] (#7596 ) This patch increases the default value of `zookeeper.session.timeout` from 6s to 18s and `replica.lag.time.max.ms` from 10s to 30s. This change was documented in KIP-537: https://cwiki.apache.org/confluence/display/KAFKA/KIP-537%3A+Increase+default+zookeeper+session+timeout. Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Bob Barrett	c5df208281	KAFKA-9105; Add back truncateHead method to ProducerStateManager (#7599 ) The truncateHead method was removed from ProducerStateManager by github.com/apache/kafka/commit/c49775b. This meant that snapshots were no longer removed when the log start offset increased, even though the intent of that change was to remove snapshots but preserve the in-memory mapping. This patch adds the required functionality back. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Bruno Cadonna	27ba8f5a39	KAFKA-8968: Refactor task-level metrics (#7566 ) Introduces TaskMetrics class Introduces dropped-records Replaces skipped-records with dropped-records with latest built-in metrics version Does not add standby-process-ratio and active-process-ratio Does not refactor parent sensors for processor node metrics Reviewers: Guozhang Wang <wangguoz@gmail.com>, John Roesler <john@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	5 years ago
Boyang Chen	77fc498889	KAFKA-8992; Redefine RemoveMembersFromGroup interface on AdminClient (#7478 ) This PR fixes the inconsistency involved in the `removeMembersFromGroup` admin API calls: 1. Fail the `all()` request when there is sub level error (either partition or member) 2. Change getMembers() to members() 3. Hide the actual Errors from user 4. Do not expose generated MemberIdentity type 5. Use more consistent naming for Options and Result types Reviewers: Guozhang Wang <wangguoz@gmail.com>, David Jacot <djacot@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
NIkhil Bhatia	43262d358a	KAFKA-9038; Allow creating partitions while a reassignment is in progress (#7582 ) Prior to this patch, partition creation would not be allowed for any topic while a reassignment is in progress. The PR makes this a topic level check. As long as a particular topic is not being reassigned, we allow partitions to be increased. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Jason Gustafson <jason@confluent.io>	5 years ago
A. Sophie Blee-Goldman	5987c76153	KAFKA-8972: Need to flush state even on unclean close (#7589 ) In the case of unclean close we still need to make sure all the stores are flushed before closing any. Reviewers: Matthias J. Sax <matthias@confluent.io>, Boyang Chen <boyang@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	5 years ago
Guozhang Wang	4f682b3c0a	KAFKA-8729: Add upgrade docs for KIP-467 on augmented produce response (#7522 ) Add a paragraph explaining the producer caller's expected behavior change on record validation failure scenarios that are improved by KIP-467. Reviewers: Tu V. Tran <tu@confluent.io>, Jason Gustafson <jason@confluent.io>	5 years ago
Jason Gustafson	1df01d2583	KAFKA-9089; Reassignment should be resilient to unexpected errors (#7562 ) The purpose of this patch is to make the reassignment algorithm simpler and more resilient to unexpected errors. Specifically, it has the following improvements: 1. Remove `ReassignedPartitionContext`. We no longer need to track the previous reassignment through the context and we now use the assignment state as the single source of truth for the target replicas in a reassignment. 2. Remove the intermediate assignment state when overriding a previous reassignment. Instead, an overriding reassignment directly updates the assignment state and shuts down any unneeded replicas. Reassignments are _always_ persisted in Zookeeper before being updated in the controller context. 3. To fix race conditions with concurrent submissions, reassignment completion for a partition always checks for a zk partition reassignment to be removed. This means the controller no longer needs to track the source of the reassignment. 4. Api reassignments explicitly remove reassignment state from zk prior to beginning the new reassignment. This fixes an inconsistency in precedence. Upon controller failover, zookeeper reassignments always take precedence over any active reassignment. So if we do not have the logic to remove the zk reassignment when an api reassignment is triggered, then we can revert to the older zk reassignment. Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Jun Rao <junrao@gmail.com>	5 years ago
Stanislav Kozlovski	28ef7f1d6d	MINOR: Re-implement NewPartitionReassignment#of() (#7592 ) Re-implement NewPartitionReassignment#of. It now takes a list rather than a variable-length list of arguments. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Vikas Singh <vikas@confluent.io>	5 years ago
Adam Bellemare	aab14fb843	MINOR: Add documentation for foreign-key joins (KIP-213) (#7535 ) Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Bill Bejeck	c015169aa6	MINOR: Streams upgrade system test cleanup (#7571 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, Sophie Blee-Goldman <sophie@confluent.io>, Boyang Chen <boyang@confluent.io>,	5 years ago
Chris Pettitt	6975f1dfa9	KAFKA-8700: Flaky Test QueryableStateIntegrationTest#queryOnRebalance (#7548 ) This is not guaranteed to actually fix queryOnRebalance, since the failure could never be reproduced locally. I did not bump timeouts because it looks like that has been done in the past for this test without success. Instead this change makes the following improvements: It waits for the application to be in a RUNNING state before proceeding with the test. It waits for the remaining instance to return to RUNNING state within a timeout after rebalance. I observed once that we were able to do the KV queries but the instance was still in REBALANCING, so this should reduce some opportunity for flakiness. The meat of this change: we now iterate over all keys in one shot (vs. one at a time with a timeout) and collect various failures, all of which are reported at the end. This should help us to narrow down the cause of flakiness if it shows up again. Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Nikolay	adb2bdb122	KAFKA-8584: The RPC code generator should support ByteBuffer. (#7342 ) The RPC code generator should support using the ByteBuffer class in addition to byte arrays. By using the ByteBuffer class, we can avoid performing a copy in many situations. Also modify TestByteBufferDataTest to test the new feature. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Guozhang Wang <wangguoz@gmail.com>	5 years ago
Jason Gustafson	0971f66ff5	KAFKA-9056; Inbound/outbound byte metrics should reflect incomplete sends/receives (#7551 ) Currently we only record completed sends and receives in the selector metrics. If there is a disconnect in the middle of the respective operation, then it is not counted. The metrics will be more accurate if we take into account partial sends and receives. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com	5 years ago
Konstantine Karantasis	fc0963a236	KAFKA-9078: Fix Connect system test after adding MM2 connector classes MM2 added a few connector classes in Connect's classpath and given that the assertion in the Connect REST system tests need to be adjusted to account for these additions. This fix makes sure that the loaded Connect plugins are a superset of the expected by the test connectors. Testing: The change is straightforward. The fix was tested with local system test runs. Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #7578 from kkonstantine/minor-fix-connect-test-after-mm2-classes	5 years ago
Jason Gustafson	4fc649d85c	MINOR: Add toString to PartitionReassignment (#7579 ) This patch adds a `toString()` implementation to `PartitionReassignment`. It also makes the `ListPartitionReassignmentsResult` constructor use default access, which is the standard for the admin client *Result classes. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
José Armando García Sancio	c00bd38ab2	MINOR: Rename brokers to replicas in the reassignment API (#7570 ) Reviewers: Jason Gustafson <jason@confluent.io>, Manikumar Reddy <manikumar.reddy@gmail.com>, Vikas Singh <vikas@confluent.io>, Colin P. McCabe <cmccabe@apache.org>	5 years ago
Stanislav Kozlovski	2efee34b74	MINOR: Check against empty replicas in AlterPartitionReassignments (#7574 ) Do not allow an empty replica set to be passed into the reassignment API. Reviewers: Colin P. McCabe <cmccabe@apache.org>, José Armando García Sancio <jsancio@gmail.com>	5 years ago
Guozhang Wang	6d8da96ba8	MINOR: Reset timer when all the buffer is drained and empty (#7573 ) For scenarios where the incoming traffic of all input partitions are small, there's a pitfall that the enforced processing timer is not reset after we have enforce processed ALL records. The fix itself is pretty simple: we just reset the timer when there's no buffered records. Reviewers: Javier Holguera <javier.holguera@gmail.com>, Boyang Chen <boyang@confluent.io>, Bill Bejeck <bill@confluent.io>	5 years ago
Manikumar Reddy	f46eb292f5	MINOR: Add upgrade docs for 2.4.0 release Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #7530 from omkreddy/2.4-upgrade-docs	5 years ago
Bill Bejeck	c7a3d1b52d	MINOR:Upgrade guide updates for KIP-479 (#7550 ) Reviewers: A. Sophie Blee-Goldmann <sophie@confluent.io>, John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Bill Bejeck	7bef73d724	MINOR: KIP-307 upgrade guide docs (#7547 ) Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
John Roesler	7bc8c0dcd1	MINOR: don't require key serde in join materialized (#7557 ) Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Guozhang Wang	f90b7e9cb4	KAFKA-8940: Tighten up SmokeTestDriver (#7565 ) After many runs of reproducing the failure (on my local MP5 it takes about 100 - 200 run to get one) I think it is more likely a flaky one and not exposing a real bug in rebalance protocol. What I've observed is that, when the verifying consumer is trying to fetch from the output topics (there are 11 of them), it poll(1sec) each time, and retries 30 times if there's no more data to fetch and stop. It means that if there are no data fetched from the output topics for 30 * 1 = 30 seconds then the verification would stop (potentially too early). And for the failure cases, we observe consistent rebalancing among the closing / newly created clients since the closing is async, i.e. while new clients are added it is possible that closing clients triggered rebalance are not completed yet (note that each instance is configured with 3 threads, and in the worst case there are 6 instances running / pending shutdown at the same time, so a group fo 3 * 6 = 18 members is possible). However, there's still a possible bug that in KIP-429, somehow the rebalance can never stabilize and members keep re-rejoining and hence cause it to fail. We have another unit test that have bumped up to 3 rebalance by @ableegoldman and if that failed again then it may be a better confirmation such bug may exist. So what I've done so far for this test: 1. When closing a client, wait for it to complete closure before moving on to the next iteration and starting a new instance to reduce the rebalance churns. 2. Poll for 5 seconds instead of 1 to wait for longer time: 5 * 30 = 150 seconds, and locally my laptop finished this test in about 50 seconds. 3. Minor debug logging improvement; in fact some of them is to reduce redundant debug logging since it is too long and sometimes hides the key information. Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>	5 years ago
Bill Bejeck	6afe05fe89	MINOR: system test clean up (#7552 ) Guozhang Wang <wangguoz@gmail.com>, Sophie Blee-Goldman <sophie@confluent.io>,	5 years ago
Manikumar Reddy	e20dcffa84	KAFKA-8943: Move SecurityProviderCreator to org.apache.kafka.common.security.auth package (#7564 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
Lee Dongjin	a9b0fc866a	KAFKA-8482: Improve documentation on AdminClient#alterReplicaLogDirs, AlterReplicaLogDirsResult (#7083 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Mickael Maison	99a4068c5c	KAFKA-7689; Add AlterConsumerGroup/List Offsets to AdminClient [KIP-396] (#7296 ) This patch implements new AdminClient APIs to list offsets and alter consumer group offsets as documented in KIP-396: https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=97551484. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago
Bruno Cadonna	2298c7f84f	KAFKA-8964: Refactor thread-level metrics depending on built-in metrics version (#7474 ) * Made commit-over-tasks sensor and skipped-records sensor optional since they are removed in the latest version * Refactored methods for sensor creation * Adapted unit and integration tests Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Viktor Somogyi	8f639eb79c	KAFKA-9041; Flaky Test LogCleanerIntegrationTest#testIsThreadFailed (#7542 ) Aims to fix the flaky LogCleanerIntegrationTest#testIsThreadFailed by changing how metrics are cleaned. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Guozhang Wang	eac367c76a	MINOR: Add metrics in operations doc for KIP-429 / KIP-467 (#7527 ) Reviewers: Tu V. Tran <tu@confluent.io>, A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Nikolay	4e094217f7	KAFKA-8455: Add VoidSerde to Serdes (#7485 ) Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
A. Sophie Blee-Goldman	8feb516dd9	MINOR: fix typo in TestInputTopic.getTimestampAndAdvance (#7553 ) Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	5 years ago
Dhruvil Shah	317089663c	KAFKA-8962; Use least loaded node for AdminClient#describeTopics (#7421 ) Allow routing of `AdminClient#describeTopics` to any broker in the cluster than just the controller, so that we don't create a hotspot for this API call. `AdminClient#describeTopics` uses the broker's metadata cache which is asynchronously maintained, so routing to brokers other than the controller is not expected to have a significant difference in terms of metadata consistency; all metadata requests are eventually consistent. This patch also fixes a few flaky test failures. Reviewers: Ismael Juma <ismael@juma.me.uk>, José Armando García Sancio <jsancio@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago

1 2 3 4 5 ...

6774 Commits (2421a69556f0275efc99680c05f0bab27ecafb0c) All Branches Search

6774 Commits (2421a69556f0275efc99680c05f0bab27ecafb0c)

All Branches