src-kafka

Commit Graph

Author	SHA1	Message	Date
Jason Gustafson	7f19df29ac	MINOR: AdminClient should respect retry backoff AdminClient should backoff when retrying a Call. Fixed and added a unit test Author: Jason Gustafson <jason@confluent.io> Reviewers: Dong Lin <lindong28@gmail.com> Closes #5077 from hachikuji/admin-client-retry-backoff	7 years ago
Radai Rosenblatt	c9ec292135	Improve kafka client sensor registration performance by lazily calculating JMX attributes When any metric (e.g. per-partition metric) is created or deleted, registerMBean() is called which in turn calls getMBeanInfo().getClassName(). However, KafkaMbean.getMBeanInfo() instantiates an array of all sensors even though we only need the class name. This costs a lot of CPU to register sensors when consumer with large partition assignment starts. For example, it takes 5 minutes to start a consumer with 35k partitions. This patch reduces the consumer startup time seconds. Author: radai-rosenblatt <radai.rosenblatt@gmail.com> Reviewers: Satish Duggana <satish.duggana@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5011 from radai-rosenblatt/fun-with-jmx	7 years ago
Ismael Juma	70f0d0bd3f	MINOR: Use reflection for signal handler and do not enable it for IBM JDK (#5047 ) The Signal classes are not available in the compile classpath if --release is used so we use reflection as a workaround. As part of that moved the code to Java and added a simple unit test. Also disabled the signal handler if the IBM JDK is being used due to KAFKA-6918. Manually tested shutdown via ctrl+c and verified that the message is printed.	7 years ago
Rajini Sivaram	ff9f928c16	KAFKA-6911; Fix dynamic keystore/truststore update check (#5029 ) Fix the check, add unit test to verify the change, update `DynamicBrokerReconfigurationTest` to avoid dynamic keystore update in tests which are not expected to update keystores.	7 years ago
Jason Gustafson	0f86e68840	MINOR: Remove dependence on __consumer_offsets in AdminClient listConsumerGroups Avoid dependence on the internal __consumer_offsets topic to handle `listConsumerGroups()` since it unnecessarily requires users to have Describe access on an internal topic. Instead we query each broker independently. For most clusters, this amounts to the same thing since the default number of partitions for __consumer_offsets is 50. This also provides better encapsulation since it avoids exposing the use of __consumer_offsets, which gives us more flexibility in the future. Author: Jason Gustafson <jason@confluent.io> Reviewers: Dong Lin <lindong28@gmail.com> Closes #5007 from hachikuji/remove-admin-use-of-offsets-topic	7 years ago
Jason Gustafson	5be47a2f26	MINOR: AdminClient consumer group domain objects should have public constructors (#5063 ) These constructors should be public to allow users to write test cases using them. We follow a similar pattern for the other domain objects that we expose in `AdminClient` (e.g. `TopicDescription`). Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Jason Gustafson	c1b30a12b1	MINOR: AdminClient metadata manager should reset state on failure If the internal metadata request fails, we must reset the state inside `AdminClientMetadataManager` or we will be stuck indefinitely in the `UPDATE_PENDING` state and have no way to fetch new metadata. Author: Jason Gustafson <jason@confluent.io> Reviewers: Dong Lin <lindong28@gmail.com> Closes #5057 from hachikuji/fix-admin-client-metadata-update-failure	7 years ago
Ismael Juma	a30ecc6755	MINOR: Remove o.a.kafka.common.utils.Base64 and IS_JAVA8_COMPATIBLE We no longer need them since we now require Java 8. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Andras Beni <andrasbeni@cloudera.com>, Manikumar Reddy O <manikumar.reddy@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5049 from ijuma/remove-base64	7 years ago
Ismael Juma	193c779682	MINOR: Remove unnecessary conditional in KafkaAdminClient to fix checkstyle (#5058 )	7 years ago
Ismael Juma	e70a191d30	KAFKA-4423: Drop support for Java 7 (KIP-118) and update deps (#5046 ) * Set --source, --target and --release to 1.8. * Build Scala 2.12 by default. * Remove some conditionals in the build file now that Java 8 is the minimum version. * Bump the version of Jetty, Jersey and Checkstyle (the newer versions require Java 8). * Fixed issues uncovered by the new version if Checkstyle. * A couple of minor updates to handle an incompatible source change in the new version of Jetty. * Add dependency to jersey-hk2 to fix failing tests caused by Jersey upgrade. * Update release script to use Java 8 and to take into account that Scala 2.12 is now built by default. * While we're at it, bump the version of Gradle, Gradle plugins, ScalaLogging, JMH and apache directory api. * Minor documentation updates including the readme and upgrade notes. A number of Streams Java 7 examples can be removed subsequently.	7 years ago
Guozhang Wang	70a506b983	MINOR: Ignore test_broker_type_bounce_at_start system test (#5055 ) test_broker_type_bounce_at_start tries to validate that when the controller is down, the streams client will always fail trying to create the topic; with the current behavior of admin client it is actually not always true: the actual behavior depends on the admin client internals as well as when the controller becomes unavailable during the leader assign partitions phase. I'd suggest at least ignore this test for now until the admin client has more stable (personally I'd even suggest removing this test as its coverage benefits is smaller than its introduced issues to me). Also adding a few more log4j entries as a result of investigating this issue. Reviewers: Matthias J. Sax <matthias@confluent.io>	7 years ago
Colin Patrick McCabe	16ad358d64	KAFKA-6868; Fix buffer underflow and expose group state in the consumer groups API (#4980 ) * The consumer groups API should expose group state and coordinator information. This information is needed by administrative tools and scripts that access consume groups. * The partition assignment will be empty when the group is rebalancing. Fix an issue where the adminclient attempted to deserialize this empty buffer. * Remove nulls from the API and make all collections immutable. * DescribeConsumerGroupsResult#all should return a result as expected, rather than Void * Fix exception text for GroupIdNotFoundException, GroupNotEmptyException. It was being filled in as "The group id The group id does not exist was not found" and similar. Reviewers: Attila Sasvari <asasvari@apache.org>, Andras Beni <andrasbeni@cloudera.com>, Dong Lin <lindong28@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Boyang Chen	1e207b2ef8	KAFKA-6896: Add producer metrics exporting in KafkaStreams (#4998 ) We would like to also export the producer metrics from StreamThread just like consumer metrics, so that we could gain more visibility of stream application. The approach is to pass in the threadProducer into the StreamThread so that we could export its metrics in dynamic. Note that this is a pure internal change that doesn't require a KIP, and in the future we also want to export admin client metrics. A followup KIP for admin client will be created once this is merged. Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
Matthias J. Sax	9947cd40c6	MINOR: Ensure sensor names are unique in Kafka Streams (#5009 ) Reviewer: Guozhang Wang <guozhang@confluent.io>	7 years ago
Ismael Juma	c3921d489f	MINOR: Rename RecordFormat to RecordVersion (#4809 ) Also include a few clean-ups: * Method/variable/parameter renames to make them consistent with the class name * Return `ApiVersion` from `minSupportedFor` * Use `values` to remove some code duplication * Reduce duplication in `ApiVersion` by introducing the `shortVersion` method and building the versions map programatically * Avoid unnecessary `regex` in `ApiVersion.apply` * Added scaladoc to a few methods Some of these were originally discussed in: https://github.com/apache/kafka/pull/4583#pullrequestreview-98089400 Added a test for `ApiVersion.shortVersion`. Relying on existing tests for the rest since there is no change in behaviour. Reviewers: Jason Gustafson <jason@confluent.io>	7 years ago
Jason Gustafson	a5ea6d10a8	MINOR: A few small cleanups in AdminClient from KAFKA-6299 (#4989 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Gunju Ko	c90bbc2749	MINOR: Fix typo in ConsumerRebalanceListener JavaDoc (#4996 )	7 years ago
Chia-Ping Tsai	4f7c11a1df	KAFKA-6870 Concurrency conflicts in SampledStat (#4985 ) Make `KafkaMetric.measurableValue` thread-safe Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Anna Povzner	9679c44d2b	KAFKA-6361: Fix log divergence between leader and follower after fast leader fail over (#4882 ) Implementation of KIP-279 as described here: https://cwiki.apache.org/confluence/display/KAFKA/KIP-279%3A+Fix+log+divergence+between+leader+and+follower+after+fast+leader+fail+over In summary: - Added leader_epoch to OFFSET_FOR_LEADER_EPOCH_RESPONSE - Leader replies with the pair( largest epoch less than or equal to the requested epoch, the end offset of this epoch) - If Follower does not know about the leader epoch that leader replies with, it truncates to the end offset of largest leader epoch less than leader epoch that leader replied with, and sends another OffsetForLeaderEpoch request. That request contains the largest leader epoch less than leader epoch that leader replied with. Reviewers: Dong Lin <lindong28@gmail.com>, Jun Rao <junrao@gmail.com>	7 years ago
Colin Patrick McCabe	abbd53da4a	KAFKA-6299; Fix AdminClient error handling when metadata changes (#4295 ) When AdminClient gets a NOT_CONTROLLER error, it should refresh its metadata and retry the request, rather than making the end-user deal with NotControllerException. Move AdminClient's metadata management outside of NetworkClient and into AdminMetadataManager. This will make it easier to do more sophisticated metadata management in the future, such as implementing a NodeProvider which fetches the leaders for topics. Rather than manipulating newCalls directly, the AdminClient service thread now drains it directly into pendingCalls. This minimizes the amount of locking we have to do, since pendingCalls is only accessed from the service thread.	7 years ago
Rajini Sivaram	0ecb72f59d	KAFKA-6834: Handle compaction with batches bigger than max.message.bytes (#4953 ) Grow buffers in log cleaner to hold one message set after sanity check even if message set is bigger than max.message.bytes. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	7 years ago
Adem Efe Gencer	7afcb3a64c	KAFKA-6877; Remove completedFetch upon a failed parse if it contains no records. This patch removed a completedFetch from the completedFetches queue upon a failed parse if it contains no records. The following scenario explains why this is needed for an instance of this case – i.e. in TopicAuthorizationException. 0. Let's assume a scenario, in which the consumer is attempting to read from a topic without the necessary read permission. 1. In Fetcher#fetchedRecords(), after peeking the completedFetches, the Fetcher#parseCompletedFetch(CompletedFetch) throws a TopicAuthorizationException (as expected). 2. Fetcher#fetchedRecords() passes the TopicAuthorizationException up without having a chance to poll completedFetches. So, the same completedFetch remains at the completedFetches queue. 3. Upon following calls to Fetcher#fetchedRecords(), peeking the completedFetches will always return the same completedFetch independent of any updates to the ACL that the topic is trying to read from. 4. Hence, despite the creation of an ACL with correct permissions, once the consumer sees the TopicAuthorizationException, it will be unable to recover without a bounce. Author: Adem Efe Gencer <agencer@linkedin.com> Reviewers: Jiangjie (Becket) Qin <becket.qin@gmail.com> Closes #4974 from efeg/fix/parseCompletedFetchRemainsInQueue	7 years ago
Roman Khlebnov	fcb15e357c	KAFKA-6292; Improve FileLogInputStream batch position checks to avoid type overflow (#4928 ) Switch from sum operations to subtraction to avoid type casting in checks and type overflow during `FlieLogInputStream` work, especially in cases where property `log.segment.bytes` was set close to the `Integer.MAX_VALUE` and used as a `position` inside `nextBatch()` function. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Rajini Sivaram	6dbd9b59e6	KAFKA-6854; Handle batches deleted during log cleaning of logs with txns (#4962 ) Log cleaner grows buffers when result.messagesRead is zero. This contains the number of filtered messages read from source which can be zero when transactions are used because batches may be discarded. Log cleaner incorrectly assumes that messages were not read because the buffer was too small and attempts to double the buffer size unnecessarily, failing with an exception if the buffer is already max.message.bytes. Additional check for discarded batches has been added to avoid growing buffers when batches are discarded. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Bill Bejeck	04a70bd3fe	KAFKA-6829: retry commits on unknown topic or partition (#4948 ) For the UNKNOWN_TOPIC_OR_PARTITION error, we could change the consumer's behavior to retry after this error. While this is a rare case since the user would not commit offsets for topics unless they had been able to fetch from them, but this doesn't really handle the situation where the broker hasn't received any metadata updates. Reviewers: Jason Gustafson <jason@confluent.io>, John Roesler <john@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Rajini Sivaram	9d2efd83a6	KAFKA-6810; Enable dynamic update of SSL truststores (#4904 ) Enable broker's SSL truststores to be dynamically updated using ConfigCommand in the same way as keystores are updated.	7 years ago
Jason Gustafson	f467c9c243	MINOR: Ensure exception messages include partition/segment info when possible (#4907 ) Reviewers: Anna Povzner <anna@confluent.io>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Manikumar Reddy O	be5b0fd2a9	MINOR: Fix sasl.jaas.config doc string (#4921 )	7 years ago
Mickael Maison	902009ea98	KAFKA-3417: Wrap metric reporter calls in try/catch blocks (#3635 ) Prevent exception thrown by metric reporters to impact request processing and other reporters. Co-authored-by: Mickael Maison <mickael.maison@gmail.com> Co-authored-by: Edoardo Comar <ecomar@uk.ibm.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Jason Gustafson	459efb02ad	HOTFIX: ListConsumerGroupsResult should use KafkaFuture (#4933 )	7 years ago
Colin Patrick McCabe	6be908a829	MINOR: Refactor AdminClient ListConsumerGroups API (#4884 ) The current Iterator-based ListConsumerGroups API is synchronous. The API should be asynchronous to fit in with the other AdminClient APIs. Also fix some error handling corner cases. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Arjun Satish	d9e804b889	MINOR: Clarify meaning of end offset in consumer javadocs (#4885 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Andras Beni	462342210f	KAFKA-3365; Add documentation method for protocol types and update doc generation (#4735 ) Reviewers: Sandor Murakozi <smurakozi@gmail.com>, Magnus Edenhill <magnus@edenhill.se>, Jason Gustafson <jason@confluent.io>	7 years ago
John Roesler	ac9c3ed0b4	KAFKA-6376: preliminary cleanup (#4872 ) General cleanup of Streams code, mostly resolving compiler warnings and re-formatting. The regular testing suite should be sufficient. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Mickael Maison	83503404e4	KAFKA-6770; Add New Protocol Versions to 1.1.0 documentation (#4847 ) Update 1.1 docs to include 2 new versions to existing APIs: - DescribeConfigs v1 - Fetch v7 Also fix a typo in1 FetchRequest.	7 years ago
Rajini Sivaram	e5de679d62	KAFKA-6765: Handle exception while reading throttle metric value in test (#4869 ) Quota tests wait for throttle metric to be updated without waiting for requests to complete to avoid waiting for potentially large throttle times. This requires the test to read metric values while a broker may be updating the value, resulting in exception in the test. Since this issue can also occur with JMX metrics reporter, change synchronization on metrics with sensors to use the sensor as lock.	7 years ago
Andy Coates	432c82d3bf	KAFKA-6727; Fix broken Config hashCode() and equals() (#4796 ) Reviewers: Manikumar Reddy O <manikumar.reddy@gmail.com>, Guozhang Wang <wangguoz@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Ismael Juma	f3ed56b21f	MINOR: Mention that -1 disables retention by time (#4881 ) Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Guozhang Wang	9871357086	KAFKA-6592: Follow-up (#4864 ) Do not require ConsoleConsumer to specify inner serde as s special property, but just a normal property of the message formatter.	7 years ago
Guozhang Wang	0dc7f0e66f	KAFKA-6611, PART II: Improve Streams SimpleBenchmark (#4854 ) SimpleBenchmark: 1.a Do not rely on manual num.records / bytes collection on atomic integers. 1.b Rely on config files for num.threads, bootstrap.servers, etc. 1.c Add parameters for key skewness and value size. 1.d Refactor the tests for loading phase, adding tumbling-windowed count. 1.e For consumer / consumeproduce, collect metrics on consumer instead. 1.f Force stop the test after 3 minutes, this is based on empirical numbers of 10M records. Other tests: use config for kafka bootstrap servers. streams_simple_benchmark.py: only use scale 1 for system test, remove yahoo from benchmark tests. Note that the JMX based metrics is more accurate than the manually collected metrics. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Guozhang Wang	b599b395f3	KAFKA-6058: Refactor consumer API result return types (#4856 ) Refactored the return types in consumer group APIs the following way: ``` Map<TopicPartition, KafkaFuture<Void>> DeleteConsumerGroupsResult#deletedGroups() Map<TopicPartition, KafkaFuture<ConsumerGroupDescription>> DescribeConsumerGroupsResult#describedGroups() KafkaFuture<Collection<ConsumerGroupListing>> ListConsumerGroupsResult#listings() KafkaFuture<Map<TopicPartition, OffsetAndMetadata>> ListConsumerGroupOffsetsResult#partitionsToOffsetAndMetadata() ``` * For DeleteConsumerGroupsResult and DescribeConsumerGroupsResult, for each group id we have two round-trips to get the coordinator, and then send the delete / describe request; I leave the potential optimization of batching requests for future work. * For ListConsumerGroupOffsetsResult, it is a simple single round-trip and hence the whole map is wrapped as a Future. * ListConsumerGroupsResult, it is the most tricky one: we would only know how many futures we should wait for after the first listNode returns, and hence I constructed the flattened future in the middle wrapped with the underlying map of futures; also added an iterator API to compensate the "fail the whole future if any broker returns error" behavior. The iterator future will throw exception on the failing brokers, while return the consumer for other succeeded brokers. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jason Gustafson <jason@confluent.io>	7 years ago
Jason Gustafson	fb3a9485a8	MINOR: Disable failing testDescribeConsumerGroupOffsets test case (#4863 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	7 years ago
John Roesler	cc43e77bbb	MINOR: make Sensor#add idempotent (#4853 ) This change makes adding a metric to a sensor idempotent. That is, if the metric is already added to the sensor, the method returns with success. The current behavior is that any attempt to register a second metric with the same name is an error. Testing strategy: There is a new unit test covering this behavior Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Jorge Quilcate Otoya	6a99da87ab	KAFKA-6058: KIP-222; Add Consumer Group operations to Admin API KIP: https://cwiki.apache.org/confluence/display/KAFKA/KIP-222+-+Add+Consumer+Group+operations+to+Admin+API Author: Jorge Quilcate Otoya <quilcate.jorge@gmail.com> Author: Jorge Esteban Quilcate Otoya <quilcate.jorge@gmail.com> Author: Guozhang Wang <wangguoz@gmail.com> Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Guozhang Wang <wangguoz@gmail.com> Closes #4454 from jeqo/feature/admin-client-describe-consumer-group	7 years ago
Manikumar Reddy O	47918f2d79	KAFKA-6447: Add Delegation Token Operations to KafkaAdminClient (KIP-249) (#4427 ) Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Manikumar Reddy O	5e277e5579	KAFKA-4883: handle NullPointerException while parsing login modue control flag (#4849 )	7 years ago
Magnus Edenhill	e490a90625	Make [Config]Resource.toString() consistent with existing code (#4845 ) The toString() for ConfigResource was using { } instead of ( ) which is inconsistent with the existing toStrings in the code, while toString for Resource was using a mix of ( and }.	7 years ago
Jason Gustafson	0a8f35b684	KAFKA-6768; Transactional producer may hang in close with pending requests (#4842 ) This patch fixes an edge case in producer shutdown which prevents `close()` from completing due to a pending request which will never be sent due to shutdown initiation. I have added a test case which reproduces the scenario. Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Rajini Sivaram	77ebd32016	KAFKA-6576: Configurable Quota Management (KIP-257) (#4699 ) Enable quota calculation to be customized using a configurable callback. See KIP-257 for details. Reviewers: Jun Rao <junrao@gmail.com>	7 years ago
Manikumar Reddy O	77c79df396	KAFKA-6741: Disable Selector's idle connection timeout in testNetworkThreadTimeRecorded() test (#4824 ) Reviewers: Jason Gustafson <jason@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago

... 6 7 8 9 10 ...

1565 Commits (cfea95343dfe9f297bceb98676b7bc04e5035776)