src-kafka

Commit Graph

Author	SHA1	Message	Date
Rajini Sivaram	9d2efd83a6	KAFKA-6810; Enable dynamic update of SSL truststores (#4904 ) Enable broker's SSL truststores to be dynamically updated using ConfigCommand in the same way as keystores are updated.	7 years ago
Jason Gustafson	f467c9c243	MINOR: Ensure exception messages include partition/segment info when possible (#4907 ) Reviewers: Anna Povzner <anna@confluent.io>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Mickael Maison	902009ea98	KAFKA-3417: Wrap metric reporter calls in try/catch blocks (#3635 ) Prevent exception thrown by metric reporters to impact request processing and other reporters. Co-authored-by: Mickael Maison <mickael.maison@gmail.com> Co-authored-by: Edoardo Comar <ecomar@uk.ibm.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Manikumar Reddy O	ff1875fce0	KAFKA-6778; AdminClient.describeConfigs() should return error for non-existent topics (#4866 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Anna Povzner	cbb5b51475	KAFKA-6795; Added unit tests for ReplicaAlterLogDirsThread Added unit tests for ReplicaAlterLogDirsThread. Mostly focused on unit tests for truncating logic. Fixed ReplicaAlterLogDirsThread.buildLeaderEpochRequest() to use future replica's latest epoch (not the latest epoch of replica it is fetching from). This follows the logic that offset for leader epoch request should be based on leader epoch of the follower (in this case it's the future local replica). Also fixed PartitionFetchState constructor that takes offset and delay. The code ignored the delay parameter and used 0 for the delay. This constructor is used only by another constructor which passes delay = 0, which luckily works. Author: Anna Povzner <anna@confluent.io> Reviewers: Dong Lin <lindong28@gmail.com> Closes #4918 from apovzner/kafka-6795	7 years ago
Ismael Juma	c853ef75a1	MINOR: Bump version to 2.0.0-SNAPSHOT (#4804 )	7 years ago
Jason Gustafson	acd669e424	KAFKA-6796; Fix surprising UNKNOWN_TOPIC error from requests to non-replicas (#4883 ) Currently if the client sends a produce request or a fetch request to a broker which isn't a replica, we return UNKNOWN_TOPIC_OR_PARTITION. This is a bit surprising to see when the topic actually exists. It would be better to return NOT_LEADER to avoid confusion. Clients typically handle both errors by refreshing metadata and retrying, so changing this should not cause any change in behavior on the client. This case can be hit following a partition reassignment after the leader is moved and the local replica is deleted. To validate the current behavior and the fix, I've added integration tests for the fetch and produce APIs.	7 years ago
Anna Povzner	3bc2575dfc	MINOR: Disabled flaky DynamicBrokerReconfigurationTest.testAddRemoveSslListener until fixed (#4924 ) Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Patrik Erdes	35c75ea503	MINOR: Fix formatting in --new-consumer deprecation warning (#4903 )	7 years ago
Rajini Sivaram	9e062b3e65	MINOR: Use distinct consumer groups in dynamic listener tests (#4870 )	7 years ago
Rajini Sivaram	98bb75a58f	KAFKA-6772: Load credentials from ZK before accepting connections (#4867 ) Start processing client connections only after completing KafkaServer initialization to ensure that credentials are loaded from ZK into cache before authentications are processed. Acceptors are started earlier so that bound port is known for registering in ZK. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>	7 years ago
gitlw	341db990dc	KAFKA-6650: Allowing transition to OfflineReplica state for replicas without leadership info (#4825 ) A partially deleted topic can end up with some partitions having no leadership info. For the partially deleted topic, a new controller should be able to finish the topic deletion by transitioning the rogue partition's replicas to OfflineReplica state. This patch adds logic to transition replicas to OfflineReplica state whose partitions have no leadership info. Added a new test method to cover the partially deleted topic case. Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Chia-Ping Tsai	4013767d86	MINOR: Log the exception thrown by Selector.poll (#4873 )	7 years ago
Allen Wang	19418fc86a	KAFKA-6514; Add API version as a tag for the RequestsPerSec metric (#4506 ) Updated `RequestChannel` to include `version` as a tag for all RequestsPerSec metrics (KIP-272). Updated tests to verify that the extra tag exists.	7 years ago
Guozhang Wang	9871357086	KAFKA-6592: Follow-up (#4864 ) Do not require ConsoleConsumer to specify inner serde as s special property, but just a normal property of the message formatter.	7 years ago
Sönke Liebau	886daf5fca	KAFKA-6234; Increased timeout value for lowWatermark response to fix transient failures (#4238 ) Removed timeout from get call that caused the test to fail occasionally, this will instead fall back to the wrapping waitUntilTrue timeout. Also added unnesting of exceptions from ExecutionException that was originally missing and put the retrieved value for lowWatermark in the fail message for better readability in case of test failure. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
huxi	4e35a2bfb7	KAFKA-6592: ConsoleConsumer should support WindowedSerdes (#4797 ) Have Console consumer support TimeWindowedDeserializer/SessionWindowedDeserializer. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Jason Gustafson	7421f9dce2	KAFKA-6773; Allow offset commit/fetch/describe/delete with empty groupId (#4851 ) We had a regression in #4788 which caused the offset commit/fetch/describe APIs to fail if the groupId was empty. This should be allowed for backwards compatibility. Additionally, I have modified DeleteGroups to allow removal of the empty group, which was missed in the initial implementation. I've added a test case to ensure that we do not miss this again in the future. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Colin Patrick McCabe	e4d652befe	MINOR: Fix AsyncProducerTest bug that hits when logging is turned up (#4450 ) AsyncProducerTest gets an error about an incorrect mock when the logging level is turned up. Instead of usIng a mock, just create a real SyncProducerConfig object, since the object is simple to create. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Jimin Hsieh	83a9e04c19	MINOR: Fix doc - `FileMessageSet` was replaced by `FileRecords` (#4852 ) Reviewers: Manikumar Reddy O <manikumar.reddy@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Manikumar Reddy O	47918f2d79	KAFKA-6447: Add Delegation Token Operations to KafkaAdminClient (KIP-249) (#4427 ) Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Manikumar Reddy O	e29fa9a4ca	KAFKA-6752: Enable unclean leader election metric (#4838 ) Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Rajini Sivaram	79c6f7cd9a	MINOR: Move creation of quota callback to ensure single instance (#4848 ) Move creation of quota callback instance out of KafkaConfig constructor to QuotaFactory.instantiate to avoid creating a callback instance for every KafkaConfig since we create temporary KafkaConfigs during dynamic config updates. Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Ismael Juma	fedac0cea7	MINOR: Mention leader in a few follower/controller log messages (#4835 )	7 years ago
Rajini Sivaram	77ebd32016	KAFKA-6576: Configurable Quota Management (KIP-257) (#4699 ) Enable quota calculation to be customized using a configurable callback. See KIP-257 for details. Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Manikumar Reddy O	ed2f10e050	MINOR: Update max.connections.per.ip.overrides config docs (#4819 ) Add a validation check to make sure max.connections.per.ip.overrides is configured when max.connections.per.ip is set zero. Also clean up the config description.	7 years ago
Rajini Sivaram	9f8c3167eb	KAFKA-4292: Configurable SASL callback handlers (KIP-86) (#2022 ) Implementation of KIP-86. Client, server and login callback handlers have been made configurable for both brokers and clients. Reviewers: Jun Rao <junrao@gmail.com>, Ron Dagostino <rndgstn@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	7 years ago
Viktor Somogyi	1dc30272e1	KAFKA-5674; Reduce max.connections.per.ip minimum to 0 (#3610 ) By allowing `max.connections.per.ip` to be 0, Kafka can support IP-based filtering using `max.connections.per.ip.overrides`.	7 years ago
Jason Gustafson	8662a022c4	MINOR: Fix partition loading checks in GroupCoordinator (#4788 ) In the group coordinator, we currently check whether the partition is owned before checking whether it is loading. Since loading is a prerequisite for partition ownership, it means that it is not actually possible to see the COORDINATOR_LOAD_IN_PROGRESS error. The impact is mostly harmless: while loading the group, the client may send unnecessary FindCoordinator requests to rediscover the coordinator. I've fixed the bug and restructured the code to enable testing. In the process of fixing this bug, the following improvements have been made: 1. We now verify valid groupId in all request handlers. 2. Currently if the coordinator is loading when a SyncGroup is received, we'll return NOT_COORDINATOR. I've changed this to return REBALANCE_IN_PROGRESS since the rebalance state will have been lost on coordinator failover. This effectively forces the consumer to rejoin the group, which seems preferable over unnecessarily rediscovering the coordinator. 3. I added a check for the COORDINATOR_LOAD_IN_PROGRESS handler in SyncGroup. Although we do not currently return this error, it seems reasonable that we might want to some day, so it seems better to get the check in now. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Nick Travers	4106cb1db1	KAFKA-4914: Partition reassignment tool should check types before persisting state in ZooKeeper (#2708 ) Prior to this, there have been instances where invalid data was allowed to be persisted in ZooKeeper, which causes ClassCastExceptions when a broker is restarted and reads this type-unsafe data. Adds basic structural and type validation for the reassignment JSON via introduction of Scala case classes that map to the expected JSON structure. Also use the Scala case classes to deserialize the JSON to avoid duplication. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Viktor Somogyi <viktor.somogyi@cloudera.com>, Ismael Juma <ismael@juma.me.uk>	7 years ago
gitlw	2ef6ee2338	KAFKA-6630: Speed up the processing of TopicDeletionStopReplicaResponseReceived events on the controller (#4668 ) Reviewed by Jun Rao <junrao@gmail.com>	7 years ago
Alex D	dd7011783f	KAFKA-6724; ConsumerPerformance should not always reset to earliest offsets (#4787 ) Remove the explicit `seekToBeginning` on startup and instead rely on the consumer's auto offset reset strategy to set the initial position.	7 years ago
Ismael Juma	281dbfd981	MINOR: LogCleaner.validateReconfiguration fixes (#4770 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Manikumar Reddy O	395c7e0f09	MINOR: Fix ReassignPartitionsClusterTest.testHwAfterPartitionReassignment test (#4781 ) Reviewers: Dong Lin <lindong28@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
huxi	9eb32eaad5	KAFKA-6446; KafkaProducer initTransactions() should timeout after max.block.ms (#4563 ) Currently the `initTransactions()` API blocks indefinitely if the broker cannot be reached. This patch changes the behavior to raise a `TimeoutException` after waiting for `max.block.ms`. Reviewers: Apurva Mehta <apurva@confluent.io>, Jason Gustafson <jason@confluent.io>	7 years ago
Vahid Hashemian	3f611044cc	KAFKA-6052; Fix WriteTxnMarkers request retry issue in InterBrokerSendThread (#4705 ) This resolves the issue found when running the brokers on Windows which prevents the coordinator from sending WriteTxnMarkers requests to complete a transaction.	7 years ago
Rajini Sivaram	f66aebff36	KAFKA-6710: Remove Thread.sleep from LogManager.deleteLogs (#4771 ) `Thread.sleep` in `LogManager.deleteLogs` potentially blocks a scheduler thread for up to `log.segment.delete.delay.ms` with a default value of a minute. To avoid this, `deleteLogs` now deletes the logs for which `currentDefaultConfig.fileDeleteDelayMs` has elapsed after the delete was scheduled. Logs for which this interval has not yet elapsed are considered for deletion in the next iteration of `deleteLogs`, which is scheduled sooner if required. Reviewers: Jun Rao <junrao@gmail.com>, Dong Lin <lindong28@gmail.com>, Ted Yu <yuzhihong@gmail.com>	7 years ago
huxi	dd78b9fa26	KAFKA-6637; Avoid divide by zero error with segment.ms set to zero (#4698 ) Require a minimum value of 1 for `segment.ms` to avoid division by zero when computing random jitter. Reviewers: Manikumar Reddy O <manikumar.reddy@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Lucas Wang	685fd03dda	KAFKA-6612: Added logic to prevent increasing partition counts during topic deletion This patch adds logic in handling the PartitionModifications event, so that if the partition count is increased when a topic deletion is still in progress, the controller will restore the data of the path /brokers/topics/"topic" to remove the added partitions. Testing done: Added a new test method to cover the bug Author: Lucas Wang <luwang@linkedin.com> Reviewers: Jiangjie (Becket) Qin <becket.qin@gmail.com> Closes #4666 from gitlw/prevent_increasing_partition_count_during_topic_deletion	7 years ago
Rajini Sivaram	2307314432	MINOR: Fix encoder config to make DynamicBrokerReconfigurationTest stable (#4764 ) DynamicBrokerReconfigurationTest currently assumes that passwords encoded with one secret will fail with an exception if decoded with another secret and configures an old.secret in setUp. This could potentially cause test failures if a password was incorrectly decoded with the wrong secret, since the test writes passwords encoded with the new secret directly to ZooKeeper. Since old.secret is only used in one test for verifying secret rotation, this config can be moved to that test to avoid transient failures. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Jason Gustafson	fcf8781602	KAFKA-6683; Ensure producer state not mutated prior to append (#4755 ) We were unintentionally mutating the cached queue of batches prior to appending to the log. This could have several bad consequences if the append ultimately failed or was truncated. In the reporter's case, it caused the snapshot to be invalid after a segment roll. The snapshot contained producer state at offsets higher than the snapshot offset. If we ever had to load from that snapshot, the state was left inconsistent, which led to an error that ultimately crashed the replica fetcher. The fix required some refactoring to avoid sharing the same underlying queue inside ProducerAppendInfo. I have added test cases which reproduce the invalid snapshot state. I have also made an effort to clean up logging since it was not easy to track this problem down. One final note: I have removed the duplicate check inside ProducerStateManager since it was both redundant and incorrect. The redundancy was in the checking of the cached batches: we already check these in Log.analyzeAndValidateProducerState. The incorrectness was the handling of sequence number overflow: we were only handling one very specific case of overflow, but others would have resulted in an invalid assertion. Instead, we now throw OutOfOrderSequenceException. Reviewers: Apurva Mehta <apurva@confluent.io>, Jun Rao <junrao@gmail.com>	7 years ago
Guozhang Wang	f2fbfaaccc	KAFKA-6611: PART I, Use JMXTool in SimpleBenchmark (#4650 ) 1. Use JmxMixin for SimpleBenchmark (will remove the self reporting in #4744), only when loading phase is false (i.e. we are in fact starting the streams app). 2. Reported the full jmx reported metrics in log files, and in the returned data only return the max values: this is because we want to skip the warming up and cooling down periods that will have lower rate numbers, while max represents the actual rate at full speed. 3. Incorporates two other improves to JMXTool: #1241 and #2950 Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Rohan Desai <desai.p.rohan@gmail.com>	7 years ago
Rajini Sivaram	57b1c28d60	MINOR: Fix AdminClient.describeConfigs() of listener configs (#4747 ) Don't return config values from `describeConfigs` if the config type cannot be determined. Obtain config types correctly for listener configs for `describeConfigs` and password encryption. Reviewers: Jason Gustafson <jason@confluent.io>	7 years ago
Matthias J. Sax	f0a29a6935	MINOR: remove obsolete warning in StreamsResetter (#4749 ) Reviewer: Guozhang Wang <guozhang@confluent.io>	7 years ago
Rajini Sivaram	2f90cb86c1	MINOR: Remove acceptor creation in network thread update code (#4742 ) Fix dynamic addition of network threads to only create new Processor threads and not the Acceptor.	7 years ago
Colin Patrick McCabe	f5287ccad2	MINOR: Fix flaky TestUtils functions (#4743 ) TestUtils#produceMessages should always close the KafkaProducer, even when there is an exception. Otherwise, the test will leak threads when there is an error. TestUtils#createNewProducer should create a producer with a requestTimeoutMs of 30 seconds by default, not around 10 seconds. This should avoid tests that flake when the load on Jenkins climbs. Fix two cases where a very short timeout of 2 seconds was getting set. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Manikumar Reddy O	aefe35e493	KAFKA-6680: Fix issues related to Dynamic Broker configs (#4731 ) - Fix kafkaConfig initialization if there are no dynamic configs defined in ZK. - Update DynamicListenerConfig.validateReconfiguration() to check new Listeners must be subset of listener map Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Ismael Juma	6fab286da2	MINOR: Fix some compiler warnings (#4726 )	7 years ago
Jason Gustafson	7041e76bd6	MINOR: Some logging improvements for debugging delayed produce status (#4691 ) A few small logging improvements which help debugging replication issues.	7 years ago
Dong Lin	4391a4214d	MINOR: Use log start offset as high watermark if current value is out of range (#4722 ) Reviewers: Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago

... 3 4 5 6 7 ...

2379 Commits (9951f8fee145ce10b5dccde665e160a5f4ff6d03)