src-kafka

Commit Graph

Author	SHA1	Message	Date
Tom Bentley	269b65279c	KAFKA-5692: Change PreferredReplicaLeaderElectionCommand to use Admin… (#3848 ) See also KIP-183. This implements the following algorithm: AdminClient sends ElectPreferredLeadersRequest. KafakApis receives ElectPreferredLeadersRequest and delegates to ReplicaManager.electPreferredLeaders() ReplicaManager delegates to KafkaController.electPreferredLeaders() KafkaController adds a PreferredReplicaLeaderElection to the EventManager, ReplicaManager.electPreferredLeaders()'s callback uses the delayedElectPreferredReplicasPurgatory to wait for the results of the election to appear in the metadata cache. If there are no results because of errors, or because the preferred leaders are already leading the partitions then a response is returned immediately. In the EventManager work thread the preferred leader is elected as follows: The EventManager runs PreferredReplicaLeaderElection.process() process() calls KafkaController.onPreferredReplicaElectionWithResults() KafkaController.onPreferredReplicaElectionWithResults() calls the PartitionStateMachine.handleStateChangesWithResults() to perform the election (asynchronously the PSM will send LeaderAndIsrRequest to the new and old leaders and UpdateMetadataRequest to all brokers) then invokes the callback. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Jun Rao <junrao@gmail.com>	6 years ago
mingaliu	0f926f0c1e	KAFKA-7693; Fix SequenceNumber overflow in producer (#5989 ) The problem is that the sequence number is an Int and should wrap around when it reaches the Int.MaxValue. The bug here is it doesn't wrap around and become negative and raises an error. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
mingaliu	e4b54a5d97	KAFKA-7692; Fix ProducerStateManager SequenceNumber overflow (#5990 ) This patch fixes a few overflow issues with wrapping sequence numbers in the broker's producer state tracking. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	523465b3c1	MINOR: Cleanup handling of mixed transactional/idempotent records (#6172 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk>, Colin Patrick McCabe <colin@cmccabe.xyz>	6 years ago
Stanislav Kozlovski	fb0db7602a	KAFKA-7844: Use regular subproject for generator to fix *All targets (#6182 ) The presence of the buildSrc subproject is causing problems when we try to run installAll, jarAll, and the other "all" targets. It's easier just to make the generator code a regular subproject and use the JavaExec gradle task to run the code. This also makes it more straightforward to run the generator unit tests. Reviewers: David Arthur <mumrah@gmail.com>, Ismael Juma <ismael@juma.me.uk> Co-authored-by: Colin P. Mccabe <cmccabe@confluent.io> Co-authored-by: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	6 years ago
Lee Dongjin	07d2cf2fdb	Fix Documentation for cleanup.policy is out of date (#6181 )	6 years ago
ryannatesmith	e75e4732c9	MINOR: Rejoin split ssl principal mapping rules (#6099 ) * Join ssl principal mapping rules correctly before evaluating. Java properties splits the configuration array on commas, and that leads to rules containing commas being split before being evaluated. This commit adds a code change to re-join those strings into full rules before evaluating them. The function assumes every rule is either DEFAULT or begins with the prefix RULE:	6 years ago
Lee Dongjin	e87e3f2cb2	MINOR: Remove unused imports, exceptions, and values (#6117 ) 1. Remove unthrown exceptions from MemoryRecordsBuilderTest 2. Remove unused imports from ReplicaFetcherThread, ZooKeeperClient, ApiVersionTest, PartitionTest 3. Remove unused value from PartitionTest	6 years ago
Lars Francke	6cae2577ba	Fix Javadoc of KafkaConsumer (#6155 ) The Javadoc is using Properties.put which should never be used because it allows putting non-strings into a Properties object which is designed to only handle strings. Two other minor fixes so the examples actually work	6 years ago
Dong Lin	6a7eebe891	KAFKA-7829; Javadoc should show that AdminClient.alterReplicaLogDirs() is supported in Kafka 1.1.0 or later (#6157 ) Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
David Arthur	2c44e77e2f	KAFKA-7738; Track leader epochs in client Metadata (#6045 ) Track the last seen partition epoch in the Metadata class. When handling metadata updates, check that the partition info being received is for the last seen epoch or a newer one. This prevents stale metadata from being loaded into the client. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	9a9310d074	KAFKA-7824; Require member.id for initial join group request [KIP-394] (#6058 ) This patch implements KIP-394 as documented in https://cwiki.apache.org/confluence/display/KAFKA/KIP-394%3A+Require+member.id+for+initial+join+group+request. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Chia-Ping Tsai	af634a4a98	KAFKA-7391; Introduce close(Duration) to Producer and AdminClient instead of close(long, TimeUnit) (#5667 ) See KIP-367: https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=89070496. Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Tom Bentley	d8f126d70a	Fix KAFKA-7789 by increasing the key size for the RSA keys generated for (#6096 ) Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Colin Patrick McCabe	71e85f5e84	KAFKA-7609; Add Protocol Generator for Kafka (#5893 ) This patch adds a framework to automatically generate the request/response classes for Kafka's protocol. The code will be updated to use the generated classes in follow-up patches. Below is a brief summary of the included components: buildSrc/src The message generator code is here. This code is automatically re-run by gradle when one of the schema files changes. The entire directory is processed at once to minimize the number of times we have to start a new JVM. We use Jackson to translate the JSON files into Java objects. clients/src/main/java/org/apache/kafka/common/protocol/Message.java This is the interface implemented by all automatically generated messages. clients/src/main/java/org/apache/kafka/common/protocol/MessageUtil.java Some utility functions used by the generated message code. clients/src/main/java/org/apache/kafka/common/protocol/Readable.java, Writable.java, ByteBufferAccessor.java The generated message code uses these classes for writing to a buffer. clients/src/main/message/README.md This README file explains how the JSON schemas work. *clients/src/main/message/\.json The JSON files in this directory implement every supported version of every Kafka API. The unit tests automatically validate that the generated schemas match the hand-written schemas in our code. Additionally, there are some things like request and response headers that have schemas here. clients/src/main/java/org/apache/kafka/common/utils/ImplicitLinkedHashSet.java** I added an optimization here for empty sets. This is useful here because I want all messages to start with empty sets by default prior to being loaded with data. This is similar to the "empty list" optimizations in the `java.util.ArrayList` class. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Ismael Juma <ismael@juma.me.uk>, Bob Barrett <bob.barrett@outlook.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Lee Dongjin	7df3e8cd38	KAFKA-7808: AdminClient#describeTopics should not throw InvalidTopic if topic name is not found (#6124 ) * Update KafkaAdminClient#describeTopics to throw UnknownTopicOrPartitionException. * Remove unused method: WorkerUtils#getMatchingTopicPartitions. * Add some JavaDoc. Reviewed-by: Colin P. McCabe <cmccabe@apache.org>, Ryanne Dolan <ryannedolan@gmail.com>	6 years ago
Anna Povzner	b2b79c4f0e	KAFKA-7786; Ignore OffsetsForLeaderEpoch response if epoch changed while request in flight (#6101 ) There is a race condition in ReplicaFetcherThread, where we can update PartitionFetchState with the new leader epoch (same leader) before handling the OffsetsForLeaderEpoch response with FENCED_LEADER_EPOCH error which causes removing partition from partitionStates, which in turn causes no fetching until the next LeaderAndIsr. This patch adds logic to ensure that the leader epoch doesn't change while an OffsetsForLeaderEpoch request is in flight (which could happen with back-to-back leader elections). If it has changed, we ignore the response. Also added toString() implementation to PartitionData, because some log messages did not show useful info which I found while investigating the above system test failure. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Stanislav Kozlovski	66a9416e38	MINOR: Log successful/failed authentications with socket information (#5856 ) Use `info` for failed authentications and `debug` for successful ones. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Bob Barrett	e325676994	KAFKA-6833; Producer should await metadata for unknown partitions (#6073 ) This patch changes the behavior of KafkaProducer.waitOnMetadata to wait up to max.block.ms when the partition specified in the produce request is out of the range of partitions present in the metadata. This improves the user experience in the case when partitions are added to a topic and a client attempts to produce to one of the new partitions before the metadata has propagated to the brokers. Tested with unit tests. Reviewers: Arjun Satish <arjun@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
hackerwin7	e4f233ed33	KAFKA-7755; Look up client host name since DNS entry may have changed (#6049 ) Lookup client host name after every full iteration through the addresses returned. Reviewers: Loïc Monney <loicmonney@github.com>, Edoardo Comar <ecomar@uk.ibm.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Jason Gustafson	fcdde2c604	MINOR: Use functional patterns in PartitionStates (#6089 ) * MINOR: Use functional patterns in PartitionStates * Can't use fluent consumers yet in scala 2.11	6 years ago
Guozhang Wang	b16afbb77b	KAFKA-6928: Refactor StreamsPartitionAssignor retry logic (#6085 ) 1. The retry loop of the InternalTopicManager would just be: a) describe topics, and exclude those which already exist with the right num.partitions, b) for the remaining topics, try to create them. Remove any inner loops. 2. In CreateTopicResponse and MetadataResponse (for describe topic), handle the special error code of TopicExist and UnknownTopicOrPartition in order to retry in the next loop. 3. Do not handle TimeoutException since it should already been handled inside AdminClient. Add corresponding unit tests for a) topic marked for deletion but not complete yet, in which case metadata response would not contain this topic, but create topic would return error TopicExists; b) request keep getting timed out. Reviewers: Matthias J. Sax <matthias@confluent.io>	6 years ago
lambdaliu	6ea5474e4c	KAFKA-7734: Metrics tags should use LinkedHashMap to guarantee ordering (#6032 ) This pull request replaces HashMap with LinkedHashMap to guarantee ordering of metrics tags. Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <guozhang@confluent.io>, John Roesler <vvcephei@users.noreply.github.com>	6 years ago
layfe	d086e83fec	KAFKA-5503; Idempotent producer ignores shutdown while fetching ProducerId (#5881 ) Check `running` in `Sender.maybeWaitForProducerId` to ensure that the producer can be closed while awaiting initialization of the producerId. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Flavien Raynaud	9295444d48	MINOR: Improve exception messages in FileChannelRecordBatch (#6068 ) Replace `channel` by `fileRecords` in potentially thrown KafkaException descriptions when loading/writing `FileChannelRecordBatch`. This makes exception messages more readable (channel only shows an object hashcode, fileRecords shows the path of the file being read and start/end positions in the file). Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Viktor Somogyi	684184973e	MINOR: Hygiene fixes in KafkaFutureImpl (#5098 ) Change-Id: Ia44c6c659418bbed5367645b814725365daba820	6 years ago
Matthias Wessendorf	d413117769	KAFKA-7762; Update KafkaConsumer Javadoc examples to use poll(Duration timeout) API Author: Matthias Wessendorf <mwessend@redhat.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6052 from matzew/use_new_poll_api	6 years ago
Satish Duggana	b23bf41e84	KAFKA-7742; Fixed removing hmac entry for a token being removed from DelegationTokenCache Author: Satish Duggana <satishd@apache.org> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6037 from satishd/KAFKA-7742	6 years ago
Jason Gustafson	40392266aa	MINOR: Include additional detail in fetch error message (#6036 ) This patch adds additional information in the log message after a fetch failure to make debugging easier. Reviewers: David Arthur <mumrah@gmail.com>	6 years ago
David Arthur	152292994e	KAFKA-2334; Guard against non-monotonic offsets in the client (#5991 ) After a recent leader election, the leaders high-water mark might lag behind the offset at the beginning of the new epoch (as well as the previous leader's HW). This can lead to offsets going backwards from a client perspective, which is confusing and leads to strange behavior in some clients. This change causes Partition#fetchOffsetForTimestamp to throw an exception to indicate the offsets are not yet available from the leader. For new clients, a new OFFSET_NOT_AVAILABLE error is added. For existing clients, a LEADER_NOT_AVAILABLE is thrown. This is an implementation of [KIP-207](https://cwiki.apache.org/confluence/display/KAFKA/KIP-207%3A+Offsets+returned+by+ListOffsetsResponse+should+be+monotonically+increasing+even+during+a+partition+leader+change). Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Dhruvil Shah <dhruvil@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Rajini Sivaram	46e8081f9c	KAFKA-7712; Remove channel from Selector before propagating exception (#6023 ) Ensure that channel and selection keys are removed from `Selector` collections before propagating connect exceptions. They are currently cleared on the next `poll()`, but we can't ensure that callers (NetworkClient for example) wont try to connect again before the next `poll` and hence we should clear the collections before re-throwing exceptions from `connect()`. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
hackerwin7	975b680bcd	KAFKA-7705; Fix and simplify producer config in javadoc example (#6000 ) The example in the producer's javadoc contained an inconsistent value for `delivery.timeout.ms`. This patch removes the inconsistent config and several unnecessary overrides in order to simplify the example. Reviewers: huxi <huxi_2b@hotmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
linyli001	a94c8da508	KAFKA-7443: OffsetOutOfRangeException in restoring state store from changelog topic when start offset of local checkpoint is smaller than that of changelog topic (#5946 ) Reviewer: Matthias J. Sax <matthias@confluent.io>, John Roesler <john@confluent.io>	6 years ago
Lee Dongjin	7a3dffb0ca	KAFKA-7549; Old ProduceRequest with zstd compression does not return error to client (#5925 ) Older versions of the Produce API should return an error if zstd is used. This validation existed, but it was done during request parsing, which means that instead of returning an error code, the broker disconnected. This patch fixes the issue by moving the validation outside of the parsing logic. It also fixes several other record validations which had the same problem. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Gardner Vickers	ac35ef6242	MINOR: Specify character encoding in NetworkTestUtils (#5965 ) This attempts to address the flaky test `SaslAuthenticatorTest.testCannotReauthenticateWithDifferentPrincipal()` I was not able to reproduce locally even after 150 test runs in a loop, but given the error message: ``` org.junit.ComparisonFailure: expected: <[6QBJiMZ6o5AqbNAjDTDjWtQSa4alfuUWsYKIy2tt7dz5heDaWZlz21yr8Gl4uEJkQABQXeEL0UebdpufDb5k8SvReSK6wYwQ9huP-9]> but was:<[????ï¿½ï¿½ï¿½ï¿½????OAUTHBEARER]> ``` `????ï¿½ï¿½ï¿½ï¿½????` seems to mean invalid UTF-8. We now specify the charset when writing out and reading in bytes. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Mark Cho	c050503464	KAFKA-7709: Fix ConcurrentModificationException when retrieving expired inflight batches on multiple partitions. (#6005 ) Reviewers: Dhruvil Shah <dhruvil@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Zhanxiang (Patrick) Huang	2155c6d54b	KAFKA-7235: Detect outdated control requests and bounced brokers using broker generation (#5821 ) * KAFKA-7235: Detect outdated control requests and bounced brokers using broker generation * Add broker_epoch in controlled shutdown request * Move broker epoch check into controller for ControlledShutdownRequest * Refactor schema definition for controler requests/responses * Address comments * Address comments * Address comments * Send back STALE_BROKER_EPOCH error in ControlledShutdown response * Fix build issue * Address comments * Address comments * Address comments * Address comments * Fix tests after rebase * Address comments * Address comments	6 years ago
Viktor Somogyi	c4822648ef	MINOR: hygene cleanup in TransactionManagerTest (#5951 ) Reviewers: Andras Katona <41361962+akatona84@users.noreply.github.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
John Roesler	b7d95da88d	KAFKA-7660: Fix child sensor memory leak (#5974 ) A heap dump provided by Patrik Kleindl in https://issues.apache.org/jira/browse/KAFKA-7660 identifies the childrenSensors map in Metrics as keeping references to sensors alive after they have been removed. This PR fixes it and adds a test to be sure. Reviewers: Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Mickael Maison	4e90af34c6	MINOR: Various javadoc improvement in clients and connect (#5878 ) Fixed formatting issues and added links in a few classes	6 years ago
Stanislav Kozlovski	068ab9cefa	KAFKA-7528: Standardize on Min/Avg/Max Kafka metrics' default value - NaN (#5908 ) While metrics like Min, Avg and Max make sense to respective use Double.MAX_VALUE, 0.0 and Double.MIN_VALUE as default values to ease computation logic, exposing those values makes reading them a bit misleading. For instance, how would you differentiate whether your -avg metric has a value of 0 because it was given samples of 0 or no samples were fed to it? It makes sense to standardize on the output of these metrics with something that clearly denotes that no values have been recorded. Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Stig Rohde Døssing	1d4ca5adf3	KAFKA-7616; Make MockConsumer only add entries to the partition map returned by poll() if there are any records to return …eturned by poll() if there are any records to return The MockConsumer behaves unlike the real consumer in that it can return a non-empty ConsumerRecords from poll, that also has a count of 0. This change makes the MockConsumer only add partitions to the ConsumerRecords if there are records to return for those partitions. A unit test in MockConsumerTest demonstrates the issue. Author: Stig Rohde Døssing <stigdoessing@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #5901 from srdo/KAFKA-7616	6 years ago
Yishun Guan	9646602d68	KAFKA-7402: Implement KIP-376 AutoCloseable additions	6 years ago
Vahid Hashemian	c3e7d6252c	KAFKA-6774; Improve the default group id behavior in KafkaConsumer (KIP-289) (#5877 ) Improve the default group id behavior by: * changing the default consumer group to null, where no offset commit or fetch, or group management operations are allowed * deprecating the use of empty (`""`) consumer group on the client Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Rajini Sivaram	1a4d44f206	KAFKA-7576; Fix shutdown of replica fetcher threads (#5875 ) ReplicaFetcherThread.shutdown attempts to close the fetcher's Selector while the thread is running. This in unsafe and can result in `Selector.close()` failing with an exception. The exception is caught and logged at debug level, but this can lead to socket leak if the shutdown is due to dynamic config update rather than broker shutdown. This PR changes the shutdown logic to close Selector after the replica fetcher thread is shutdown, with a wakeup() and flag to terminate blocking sends first. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Ismael Juma	12f310d50e	KAFKA-7612: Fix javac warnings and enable warnings as errors (#5900 ) - Use Xlint:all with 3 exclusions (filed KAFKA-7613 to remove the exclusions) - Use the same javac options when compiling tests (seems accidental that we didn't do this before) - Replaced several deprecated method calls with non-deprecated ones: - `KafkaConsumer.poll(long)` and `KafkaConsumer.close(long)` - `Class.newInstance` and `new Integer/Long` (deprecated since Java 9) - `scala.Console` (deprecated in Scala 2.11) - `PartitionData` taking a timestamp (one of them seemingly a bug) - `JsonMappingException` single parameter constructor - Fix unnecessary usage of raw types in several places. - Add @SuppressWarnings for deprecations, unchecked and switch fallthrough in several places. - Scala clean-ups (var -> val, ETA expansion warnings, avoid reflective calls) - Use lambdas to simplify code in a few places - Add @SafeVarargs, fix varargs usage and remove unnecessary `Utils.mkList` method Reviewers: Matthias J. Sax <mjsax@apache.org>, Manikumar Reddy <manikumar.reddy@gmail.com>, Randall Hauch <rhauch@gmail.com>, Bill Bejeck <bill@confluent.io>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	6 years ago
Andras Katona	1c1e5ee979	KAFKA-7518: Fix FutureRecordMetadata.get when TimeUnit is not ms (#5815 ) Also check for timeout before calling `nextRecordMetadata.get`. Added unit test validating the fix. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
huxi	895c83f88d	KAFKA-7412: clarify the doc for producer callback (#5798 ) The metadata in the callback is not null with non-null exception. Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Jason Gustafson	29383d6d6a	KAFKA-7604; Fix flaky unit test `testRebalanceAfterTopicUnavailableWithPatternSubscribe` (#5889 ) The problem is the concurrent metadata updates in the foreground and in the heartbeat thread. Changed the code to use ConsumerNetworkClient.poll, which enforces mutual exclusion when accessing the underlying client.	6 years ago
Jason Gustafson	fc1dc358ee	KAFKA-7568; Return leader epoch in ListOffsets response (#5855 ) As part of KIP-320, the ListOffsets API should return the leader epoch of any fetched offset. We either get this epoch from the log itself for a timestamp query or from the epoch cache if we are searching the earliest or latest offset in the log. When handling queries for the latest offset, we have elected to choose the current leader epoch, which is consistent with other handling (e.g. OffsetsForTimes). Reviewers: Jun Rao <junrao@gmail.com>	6 years ago

1 2 3 4 5 ...

1458 Commits (1f692bdf53af4a80b7fd256de4e94ff1d17fc861)