src-kafka

Author	SHA1	Message	Date
Dhruvil Shah	837f31dd18	KAFKA-6927; Chunked down-conversion to prevent out of memory errors on broker [KIP-283] (#4871 ) Implementation for lazy down-conversion in a chunked manner for efficient memory usage during down-conversion. This pull request is mainly to get initial feedback on the direction of the patch. The patch includes all the main components from KIP-283. Reviewers: Jason Gustafson <jason@confluent.io>	7 years ago
Jon Lee	1facab387f	KAFKA-6028: Improve the quota throttle communication (KIP-219) This implements KIP-219, where a broker returns a response with throttle time on quota violation immediately after processing the corresponding request. After the response is sent out, the broker will keep the channel muted until the throttle time is over. Also, on receiving a response with throttle time, client will block outgoing communication to the broker for the specified throttle time. See PR 4830, 5064 and 5094 for all the review history Author: Jon Lee <jonlee@jonlee-ld1.linkedin.biz> Reviewers: Jun Rao <junrao@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk>, Dong Lin <lindong28@gmail.com> Closes #5064 from jonlee2/kip-219	7 years ago
Dong Lin	d99f4a0ffa	KAFKA-6617; Improve controller performance by batching reassignment znode write operation KafkaController currently writes reassignment znode once for every partition that has been successfully reassigned. This is unnecessary and controller should be able to update reassignment znode once to remove all partitions that have been reassigned from the reassignment znode. Author: Dong Lin <dolin@linkedin.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #4659 from lindong28/KAFKA-6617	7 years ago
Rajini Sivaram	3a8d3a7927	KAFKA-6916; Refresh metadata in admin client if broker connection fails (#5050 ) Refresh metadata if broker connection fails so that new calls are sent only to nodes that are alive and requests to controller are sent to the new controller if controller changes due to broker failure. Also reassign calls that could not be sent. Reviewers: Dong Lin <lindong28@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Manikumar Reddy O	8bf20bb586	MINOR: Update consumer javadoc for invalid operations on unassigned partitions (#5005 ) Document cases where `IllegalStateException` is raised when attempting an invalid operation on an unassigned partition. Also change `position()` to raise `IllegalStateException` when called on an unassigned partition for consistency.	7 years ago
John Roesler	c470ff70d3	KAFKA-5697; Implement new consumer poll API from KIP-266 (#4855 ) Add the new stricter-timeout version of `poll` proposed in KIP-266. The pre-existing variant `poll(long timeout)` would block indefinitely for metadata updates if they were needed, then it would issue a fetch and poll for `timeout` ms for new records. The initial indefinite metadata block caused applications to become stuck when the brokers became unavailable. The existence of the timeout parameter made the indefinite block especially unintuitive. This PR adds `poll(Duration timeout)` with the semantics: 1. iff a metadata update is needed: 1. send (asynchronous) metadata requests 2. poll for metadata responses (counts against timeout) - if no response within timeout, return an empty collection immediately 2. if there is fetch data available, return it immediately 3. if there is no fetch request in flight, send fetch requests 4. poll for fetch responses (counts against timeout) - if no response within timeout, return an empty collection (leaving async fetch request for the next poll) - if we get a response, return the response The old method, `poll(long timeout)` is deprecated, but we do not change its semantics, so it remains: 1. iff a metadata update is needed: 1. send (asynchronous) metadata requests 2. poll for metadata responses indefinitely until we get it 2. if there is fetch data available, return it immediately 3. if there is no fetch request in flight, send fetch requests 4. poll for fetch responses (counts against timeout) - if no response within timeout, return an empty collection (leaving async fetch request for the next poll) - if we get a response, return the response One notable usage is prohibited by the new `poll`: previously, you could call `poll(0)` to block for metadata updates, for example to initialize the client, supposedly without fetching records. Note, though, that this behavior is not according to any contract, and there is no guarantee that `poll(0)` won't return records the first time it's called. Therefore, it has always been unsafe to ignore the response.	7 years ago
Chia-Ping Tsai	8d1e96181d	MINOR: Replace unused variables by underscore (#5003 ) And remove one unused expression. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Ron Dagostino	8c5d7e0408	KAFKA-6562: OAuth Authentication via SASL/OAUTHBEARER (KIP-255) (#4994 ) This KIP adds the following functionality related to SASL/OAUTHBEARER: 1) Allow clients (both brokers when SASL/OAUTHBEARER is the inter-broker protocol as well as non-broker clients) to flexibly retrieve an access token from an OAuth 2 authorization server based on the declaration of a custom login CallbackHandler implementation and have that access token transparently and automatically transmitted to a broker for authentication. 2) Allow brokers to flexibly validate provided access tokens when a client establishes a connection based on the declaration of a custom SASL Server CallbackHandler implementation. 3) Provide implementations of the above retrieval and validation features based on an unsecured JSON Web Token that function out-of-the-box with minimal configuration required (i.e. implementations of the two types of callback handlers mentioned above will be used by default with no need to explicitly declare them). 4) Allow clients (both brokers when SASL/OAUTHBEARER is the inter-broker protocol as well as non-broker clients) to transparently retrieve a new access token in the background before the existing access token expires in case the client has to open new connections.	7 years ago
Manikumar Reddy O	d45d7ec781	KAFKA-2951; Add a test to verify produce, consume with ACLs for topic/group wildcard resources (#5054 )	7 years ago
maytals	39fe105dfd	Minor: Fixed ConsumerOffset#path (#5060 ) consumer offset path in zookeeper should be /consumers/${group}/offsets/${topic}/${partition} instead of /consumers/${group}/offset/${topic}/${partition}. Added `s` to the word `offset`. Reviewers: Ismael Juma <ismael@juma.me.uk>, Manikumar Reddy O <manikumar.reddy@gmail.com>, Jun Rao <junrao@gmail.com>	7 years ago
yaphet	ab3ff7101c	KAFKA-6930: Convert byte array to string in KafkaZkClient debug log (#5061 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Jun Rao	345db59650	KAFKA-6937: In-sync replica delayed during fetch if replica throttle is exceeded (#5074 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Dong Lin <lindong28@gmail.com>, Ben Stopford <benstopford@gmail.com>	7 years ago
Ismael Juma	70f0d0bd3f	MINOR: Use reflection for signal handler and do not enable it for IBM JDK (#5047 ) The Signal classes are not available in the compile classpath if --release is used so we use reflection as a workaround. As part of that moved the code to Java and added a simple unit test. Also disabled the signal handler if the IBM JDK is being used due to KAFKA-6918. Manually tested shutdown via ctrl+c and verified that the message is printed.	7 years ago
Sasaki Toru	440445e7c5	KAFKA-2061; Offer a --version flag to print the kafka version [KIP-278] (#639 ) Reviewers: Andy Lindeman, Jeremy Donahue, Jason Gustafson <jason@confluent.io>	7 years ago
Ismael Juma	7132a85fc3	KAFKA-6921; Remove old Scala producer and related code * Removed Scala producers, request classes, kafka.tools.ProducerPerformance, encoders, tests. * Updated ConsoleProducer to remove Scala producer support (removed `BaseProducer` and several options that are not used by the Java producer). * Updated a few Scala consumer tests to use the new producer (including a minor refactor of `produceMessages` methods in `TestUtils`). * Updated `ClientUtils.fetchTopicMetadata` to use `SimpleConsumer` instead of `SyncProducer`. * Removed `TestKafkaAppender` as it looks useless and it defined an `Encoder`. * Minor import clean-ups No new tests added since behaviour should remain the same after these changes. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Manikumar Reddy O <manikumar.reddy@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5045 from ijuma/kafka-6921-remove-old-producer	7 years ago
Rajini Sivaram	ff9f928c16	KAFKA-6911; Fix dynamic keystore/truststore update check (#5029 ) Fix the check, add unit test to verify the change, update `DynamicBrokerReconfigurationTest` to avoid dynamic keystore update in tests which are not expected to update keystores.	7 years ago
Jason Gustafson	e8847205f9	MINOR: Fix transiently failing consumer group admin integration test (#5067 ) Since the producer is using retries=0, we need to await topic creation before sending any records. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Ismael Juma	a30ecc6755	MINOR: Remove o.a.kafka.common.utils.Base64 and IS_JAVA8_COMPATIBLE We no longer need them since we now require Java 8. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Andras Beni <andrasbeni@cloudera.com>, Manikumar Reddy O <manikumar.reddy@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5049 from ijuma/remove-base64	7 years ago
Colin Patrick McCabe	16ad358d64	KAFKA-6868; Fix buffer underflow and expose group state in the consumer groups API (#4980 ) * The consumer groups API should expose group state and coordinator information. This information is needed by administrative tools and scripts that access consume groups. * The partition assignment will be empty when the group is rebalancing. Fix an issue where the adminclient attempted to deserialize this empty buffer. * Remove nulls from the API and make all collections immutable. * DescribeConsumerGroupsResult#all should return a result as expected, rather than Void * Fix exception text for GroupIdNotFoundException, GroupNotEmptyException. It was being filled in as "The group id The group id does not exist was not found" and similar. Reviewers: Attila Sasvari <asasvari@apache.org>, Andras Beni <andrasbeni@cloudera.com>, Dong Lin <lindong28@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Rajini Sivaram	c53e274d31	KAFKA-6917; Process txn completion asynchronously to avoid deadlock (#5036 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Dong Lin	0bb48a1669	KAFKA-3473; More Controller Health Metrics (KIP-237) This patch adds a few metrics that are useful for monitoring controller health. See KIP-237 for more detail. Author: Dong Lin <lindong28@gmail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #4392 from lindong28/KAFKA-3473	7 years ago
Manikumar Reddy O	ec7ba32af6	KAFKA-6394; Add a check to prevent misconfiguration of advertised listeners (#4897 ) Do not allow server startup if one of its configured advertised listeners has already been registered by another broker.	7 years ago
fedosov-alexander	6eb7cf1300	KAFKA-5965: Remove Deprecated AdminClient from Streams Resetter Tool (#4968 ) Removed usage of deprecated AdminClient from StreamsResetter No additional tests are required. Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Ismael Juma	c3921d489f	MINOR: Rename RecordFormat to RecordVersion (#4809 ) Also include a few clean-ups: * Method/variable/parameter renames to make them consistent with the class name * Return `ApiVersion` from `minSupportedFor` * Use `values` to remove some code duplication * Reduce duplication in `ApiVersion` by introducing the `shortVersion` method and building the versions map programatically * Avoid unnecessary `regex` in `ApiVersion.apply` * Added scaladoc to a few methods Some of these were originally discussed in: https://github.com/apache/kafka/pull/4583#pullrequestreview-98089400 Added a test for `ApiVersion.shortVersion`. Relying on existing tests for the rest since there is no change in behaviour. Reviewers: Jason Gustafson <jason@confluent.io>	7 years ago
Rajini Sivaram	7ed7cca4c9	KAFKA-6893; Create processors before starting acceptor in SocketServer (#4999 )	7 years ago
Anna Povzner	9679c44d2b	KAFKA-6361: Fix log divergence between leader and follower after fast leader fail over (#4882 ) Implementation of KIP-279 as described here: https://cwiki.apache.org/confluence/display/KAFKA/KIP-279%3A+Fix+log+divergence+between+leader+and+follower+after+fast+leader+fail+over In summary: - Added leader_epoch to OFFSET_FOR_LEADER_EPOCH_RESPONSE - Leader replies with the pair( largest epoch less than or equal to the requested epoch, the end offset of this epoch) - If Follower does not know about the leader epoch that leader replies with, it truncates to the end offset of largest leader epoch less than leader epoch that leader replied with, and sends another OffsetForLeaderEpoch request. That request contains the largest leader epoch less than leader epoch that leader replied with. Reviewers: Dong Lin <lindong28@gmail.com>, Jun Rao <junrao@gmail.com>	7 years ago
Rajini Sivaram	0ecb72f59d	KAFKA-6834: Handle compaction with batches bigger than max.message.bytes (#4953 ) Grow buffers in log cleaner to hold one message set after sanity check even if message set is bigger than max.message.bytes. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	7 years ago
Colin Patrick McCabe	b27e098a7d	MINOR: Fix trace logging in ReplicaManager (#4916 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Jason Gustafson	bce10794a0	KAFKA-6879; Invoke session init callbacks outside lock to avoid Controller deadlock (#4977 ) Fixes a deadlock between the controller's beforeInitializingSession callback which holds the zookeeper client initialization lock while awaiting completion of an asynchronous event which itself depends on the same lock. Also catch and log callback exceptions to ensure the ZooKeeper reconnection takes place. Finally, configure KafkaScheduler in ZooKeeperClient to have at least 1 thread. Added tests that fail or hang without the changes in this PR. Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Anna Povzner	a5318722c7	KAFKA-6857; Leader should reply with undefined offset if undefined leader epoch requested (#4967 ) The leader must explicitly check if requested leader epoch is undefined, and return undefined offset so that the follower can fall back to truncating to high watermark. Otherwise, if the leader also is not tracking leader epochs, it may return its LEO, which will the follower to truncate to the incorrect offset.	7 years ago
Rajini Sivaram	6dbd9b59e6	KAFKA-6854; Handle batches deleted during log cleaning of logs with txns (#4962 ) Log cleaner grows buffers when result.messagesRead is zero. This contains the number of filtered messages read from source which can be zero when transactions are used because batches may be discarded. Log cleaner incorrectly assumes that messages were not read because the buffer was too small and attempts to double the buffer size unnecessarily, failing with an exception if the buffer is already max.message.bytes. Additional check for discarded batches has been added to avoid growing buffers when batches are discarded. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Fedor Bobin	de147837dd	KAFKA-6853: ZooKeeperRequestLatencyMs is incorrect (#4961 ) ResponseMetadata.responseTimeMs is always 0 or negative. Reviewers: Rajini Sivaram <rajinisivaram@gmail.com>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Ismael Juma	55cdc934fb	Upgrade ZooKeeper to 3.4.12 and Scala to 2.12.6 (#4940 ) Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Guozhang Wang	a7746dd5b4	HOTFIX: Simplify ConsoleConsumer stripWithPrefix function	7 years ago
Mickael Maison	b82252b1e0	MINOR: Removed unused imports in a few tests (#4938 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Rajini Sivaram	b4d8552218	KAFKA-6526: Enable unclean leader election without controller change (#4920 ) Enable dynamic update of default unclean leader election config of brokers. A new controller event has been added to process unclean leader election when the config is enabled dynamically. Reviewers: Dong Lin <lindong28@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	7 years ago
Rajini Sivaram	9d2efd83a6	KAFKA-6810; Enable dynamic update of SSL truststores (#4904 ) Enable broker's SSL truststores to be dynamically updated using ConfigCommand in the same way as keystores are updated.	7 years ago
Jason Gustafson	f467c9c243	MINOR: Ensure exception messages include partition/segment info when possible (#4907 ) Reviewers: Anna Povzner <anna@confluent.io>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Mickael Maison	902009ea98	KAFKA-3417: Wrap metric reporter calls in try/catch blocks (#3635 ) Prevent exception thrown by metric reporters to impact request processing and other reporters. Co-authored-by: Mickael Maison <mickael.maison@gmail.com> Co-authored-by: Edoardo Comar <ecomar@uk.ibm.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Manikumar Reddy O	ff1875fce0	KAFKA-6778; AdminClient.describeConfigs() should return error for non-existent topics (#4866 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Anna Povzner	cbb5b51475	KAFKA-6795; Added unit tests for ReplicaAlterLogDirsThread Added unit tests for ReplicaAlterLogDirsThread. Mostly focused on unit tests for truncating logic. Fixed ReplicaAlterLogDirsThread.buildLeaderEpochRequest() to use future replica's latest epoch (not the latest epoch of replica it is fetching from). This follows the logic that offset for leader epoch request should be based on leader epoch of the follower (in this case it's the future local replica). Also fixed PartitionFetchState constructor that takes offset and delay. The code ignored the delay parameter and used 0 for the delay. This constructor is used only by another constructor which passes delay = 0, which luckily works. Author: Anna Povzner <anna@confluent.io> Reviewers: Dong Lin <lindong28@gmail.com> Closes #4918 from apovzner/kafka-6795	7 years ago
Ismael Juma	c853ef75a1	MINOR: Bump version to 2.0.0-SNAPSHOT (#4804 )	7 years ago
Jason Gustafson	acd669e424	KAFKA-6796; Fix surprising UNKNOWN_TOPIC error from requests to non-replicas (#4883 ) Currently if the client sends a produce request or a fetch request to a broker which isn't a replica, we return UNKNOWN_TOPIC_OR_PARTITION. This is a bit surprising to see when the topic actually exists. It would be better to return NOT_LEADER to avoid confusion. Clients typically handle both errors by refreshing metadata and retrying, so changing this should not cause any change in behavior on the client. This case can be hit following a partition reassignment after the leader is moved and the local replica is deleted. To validate the current behavior and the fix, I've added integration tests for the fetch and produce APIs.	7 years ago
Anna Povzner	3bc2575dfc	MINOR: Disabled flaky DynamicBrokerReconfigurationTest.testAddRemoveSslListener until fixed (#4924 ) Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Patrik Erdes	35c75ea503	MINOR: Fix formatting in --new-consumer deprecation warning (#4903 )	7 years ago
Rajini Sivaram	9e062b3e65	MINOR: Use distinct consumer groups in dynamic listener tests (#4870 )	7 years ago
Rajini Sivaram	98bb75a58f	KAFKA-6772: Load credentials from ZK before accepting connections (#4867 ) Start processing client connections only after completing KafkaServer initialization to ensure that credentials are loaded from ZK into cache before authentications are processed. Acceptors are started earlier so that bound port is known for registering in ZK. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>	7 years ago
gitlw	341db990dc	KAFKA-6650: Allowing transition to OfflineReplica state for replicas without leadership info (#4825 ) A partially deleted topic can end up with some partitions having no leadership info. For the partially deleted topic, a new controller should be able to finish the topic deletion by transitioning the rogue partition's replicas to OfflineReplica state. This patch adds logic to transition replicas to OfflineReplica state whose partitions have no leadership info. Added a new test method to cover the partially deleted topic case. Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Chia-Ping Tsai	4013767d86	MINOR: Log the exception thrown by Selector.poll (#4873 )	7 years ago
Allen Wang	19418fc86a	KAFKA-6514; Add API version as a tag for the RequestsPerSec metric (#4506 ) Updated `RequestChannel` to include `version` as a tag for all RequestsPerSec metrics (KIP-272). Updated tests to verify that the extra tag exists.	7 years ago

1 2 3 4 5 ...

2315 Commits (bf0675ac5f31d67a1c75baf211475b8fc98f1ef1)