src-kafka

Author	SHA1	Message	Date
Jason Gustafson	596c6c0c0b	KAFKA-7231; Ensure NetworkClient uses overridden request timeout (#5444 ) Fixed incorrect use of default timeout instead of the argument explicitly passed to `newClientRequest`. Reviewers: Ron Dagostino <rndgstn@gmail.com>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Manikumar Reddy O	ac79d49047	MINOR: Implement toString() in config validator classes (#5401 )	6 years ago
Jason Gustafson	c3e7c0bcb2	MINOR: Producers should set delivery timeout instead of retries (#5425 ) Use delivery timeout instead of retries when possible and remove various TODOs associated with completion of KIP-91. Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Dhruvil Shah	08a4cda34e	[MINOR] Improve consumer logging on LeaveGroup (#5420 ) * Improve consumer logging on LeaveGroup * Add GroupCoordinator logging, and address review comments Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Jon Lee	a932520135	KAFKA-7126; Reduce number of rebalance for large consumer group after a topic is created This patch forces metadata update for consumers with pattern subscription at the beginning of rebalance (retry.backoff.ms is respected). This is to prevent such consumers from detecting subscription changes (e.g., new topic creation) independently and triggering multiple unnecessary rebalances. KAFKA-7126 contains detailed scenarios and rationale. Author: Jon Lee <jonlee@linkedin.com> Reviewers: Jason Gustafson <jason@confluent.io>, Ted Yu <yuzhihong@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5408 from jonlee2/KAFKA-7126	6 years ago
Yu Yang	7fc7136ffd	KAFKA-5886; Introduce delivery.timeout.ms producer config (KIP-91) (#5270 ) Co-authored-by: Sumant Tambe <sutambe@yahoo.com> Co-authored-by: Yu Yang <yuyang@pinterest.com> Reviewers: Ted Yu <yuzhihong@gmail.com>, Apurva Mehta <apurva@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Matthias J. Sax	487b954542	MINOR: internal config objects should not be logged (#5389 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Jason Gustafson	c83ecf4c55	KAFKA-7194; Fix buffer underflow if onJoinComplete is retried after failure (#5417 ) An untimely wakeup can cause ConsumerCoordinator.onJoinComplete to throw a WakeupException before completion. On the next poll(), it will be retried, but this leads to an underflow error because the buffer containing the assignment data will already have been advanced. The solution is to duplicate the buffer passed to onJoinComplete. Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Dhruvil Shah	d11f6f26b7	KAFKA-6897; Prevent KafkaProducer.send from blocking when producer is closed (#5027 ) After successful completion of KafkaProducer#close, it is possible that an application calls KafkaProducer#send. If the send is invoked for a topic for which we do not have any metadata, the producer will block until `max.block.ms` elapses - we do not expect to receive any metadata update in this case because Sender (and NetworkClient) has already exited. It is only when RecordAccumulator#append is invoked that we notice that the producer has already been closed and throw an exception. If `max.block.ms` is set to Long.MaxValue (or a sufficiently high value in general), the producer could block awaiting metadata indefinitely. This patch makes sure `Metadata#awaitUpdate` periodically checks if the network client has been closed, and if so bails out as soon as possible.	6 years ago
wangshao	ee0c2cee21	MINOR: Fix logged timeout in KafkaProducer.close() (#4623 ) The log line says `ms`, but the actual value could represent a different time unit depending on what the user provided. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Rajini Sivaram	980b725bb0	KAFKA-3702: Change log level of SSL close_notify failure (#5397 ) SslTransportLayer currently closes the SSL engine and logs a warning if close_notify message canot be sent because the remote end closed its connection. This tends to fill up broker logs, especially when using clients which close connections immediately. Since this log entry is not very useful anyway, it would be better to log at debug level. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Rajini Sivaram	be02dbe287	MINOR: Fix transient test failure in SslTransportLayerTest (#5396 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Ron Dagostino	284b46b7ce	KAFKA-7182: SASL/OAUTHBEARER client response missing %x01 seps (#5391 ) The SASL/OAUTHBEARER client response as currently implemented in OAuthBearerSaslClient sends the valid gs2-header "n,," but then sends the "auth" key and value immediately after it. This does not conform to the specification because there is no %x01 after the gs2-header, no %x01 after the auth value, and no terminating %x01. Fixed this and the parsing of the client response in OAuthBearerSaslServer, which currently allows the malformed text. Also updated to accept and ignore unknown properties as required by the spec. Reviewers: Stanislav Kozlovski <familyguyuser192@windowslive.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Rajini Sivaram	0875ddeb23	KAFKA-7168: Treat connection close during SSL handshake as retriable (#5371 ) SSL `close_notify` from broker connection close was processed as a handshake failure in clients while unwrapping the message if a handshake is in progress. Updated to handle this as a retriable IOException rather than a non-retriable SslAuthenticationException to avoid authentication exceptions in clients during rolling restart of brokers. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Ahmed Al Mehdi	8c47a3e52f	KAFKA-5098; KafkaProducer should reject sends to invalid topics …egal char and generates InvalidTopicException If config parameter max.block.ms config parameter is set to a non-zero value, KafkaProducer.send() blocks for the max.block.ms time if topic name has illegal char or is invalid. Wrote a unit test that verifies the appropriate exception is returned when performing a get on the returned future by KafkaProducer.send(). Author: Ahmed Al Mehdi <aalmehdi@aalmehdi-ld1.linkedin.biz> Reviewers: Ismael Juma <ismael@juma.me.uk>, Joel Koshy <jjkoshy@gmail.com>, Manikumar Reddy O <manikumar.reddy@gmail.com> Closes #5247 from ahmedha/KAFKA-5098	6 years ago
Matthias Wessendorf	ba8fb6ec70	Fixing incorrect JavaDoc for METRICS_RECORDING_LEVEL_CONFIG key Reviewers: Sriharsha Chintalapani <sriharsha@apache.org>	6 years ago
Jason Gustafson	8119683a23	MINOR: Tighten FileRecords size checks to prevent overflow (#5332 ) Add some additional size validation to prevent overflows when using `FileRecords`. Reviewers: Dhruvil Shah <dhruvil@confluent.io>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Rajini Sivaram	1f8527b331	KAFKA-7136: Avoid deadlocks in synchronized metrics reporters (#5341 ) We need to use the same lock for metric update and read to avoid NPE and concurrent modification exceptions. Sensor add/remove/update are synchronized on Sensor since they access lists and maps that are not thread-safe. Reporters are notified of metrics add/remove while holding (Sensor, Metrics) locks and reporters may synchronize on the reporter lock. Metric read may be invoked by metrics reporters while holding a reporter lock. So read/update cannot be synchronized using Sensor since that could lead to deadlock. This PR introduces a new lock in Sensor for update/read. Locking order: - Sensor#add: Sensor -> Metrics -> MetricsReporter - Metrics#removeSensor: Sensor -> Metrics -> MetricsReporter - KafkaMetric#metricValue: MetricsReporter -> Sensor#metricLock - Sensor#record: Sensor -> Sensor#metricLock Reviewers: Jun Rao <junrao@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Yishun Guan	d44d5d7520	KAFKA-6986: Export Admin Client metrics through Stream Threads (#5210 ) KAFKA-6986:Export Admin Client metrics through Stream Threads We already exported producer and consumer metrics through KafkaStreams class: #4998 It makes sense to also export the Admin client metrics. I didn't add a separate unittest case for this. Let me know if it's needed. This is my first contribution, feel free to point out any mistakes that I did. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Stanislav Kozlovski	053d831992	KAFKA-7111: Log error connecting to node at a higher log level (#5312 ) There are cases where the broker would return an unresolve-able address (e.g broker inside a docker network while client is outside) and the client would not log any information as to why it is timing out, since the default log level does not print DEBUG messages. Changing this log level will enable easier troubleshooting in such circumstances. This change does not change the logs shown on transient failures like a broker failure.	6 years ago
Jason Gustafson	919409bb23	MINOR: Ensure heartbeat last poll time always updated (#5308 ) We need to ensure that the last poll time is always updated when the user call poll(Duration). This patch fixes a bug in the new KIP-266 timeout behavior which would cause this to be skipped if the coordinator could not be found while the consumer was in an active group. Note that I've also fixed some type inconsistencies for various timeouts. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Stanislav Kozlovski	daa8082d67	KAFKA-6809: Count inbound connections in the connection-creation metric (#5301 ) Previously, the connection-creation metric only accounted for opened connections from the broker. This change extends it to account for received connections.	6 years ago
Manikumar Reddy O	b2ed7d80de	MINOR: Update consumer javadoc for position method (#5100 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Manikumar Reddy O	51935ee2e6	KAFKA-7091; AdminClient should handle FindCoordinatorResponse errors (#5278 ) - Update KafkaAdminClient implementation to handle FindCoordinatorResponse errors - Remove scala AdminClient usage from core and streams tests Reviewers: Matthias J. Sax <matthias@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Ismael Juma	7a74ec62d2	MINOR: Avoid FileInputStream/FileOutputStream (#5281 ) They rely on finalizers (before Java 11), which create unnecessary GC load. The alternatives are as easy to use and don't have this issue. Also use FileChannel directly instead of retrieving it from RandomAccessFile whenever possible since the indirection is unnecessary. Finally, add a few try/finally blocks. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
tedyu	2f36da1747	Minor: add exception to debug log for Sender#maybeSendTransactionalRequest (#5282 ) Reviewer: Ismael Juma <ismael@confluent.io>, Matthias J. Sax <matthias@confluent.io>	6 years ago
Rajini Sivaram	624acc63da	MINOR: Cleanup threads in integration tests (#5269 ) Leftover threads doing network I/O can interfere with subsequent tests. Add missing shutdown in tests and include admin client in the check for leftover threads. Reviewers: Anna Povzner <anna@confluent.io>, Dhruvil Shah <dhruvil@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Manikumar Reddy O <manikumar.reddy@gmail.com>	6 years ago
Vahid Hashemian	418a91b5d4	KAFKA-4682; Revise expiration semantics of consumer group offsets (KIP-211 - Part 1) (#4896 ) This patch contains the improved offset expiration semantics proposed in KIP-211. Committed offsets will not be expired as long as a group is active. Once all members have left the group, then offsets will be expired after the timeout configured by `offsets.retention.minutes`. Note that the optimization for early expiration of unsubscribed topics will be implemented in a separate patch.	6 years ago
Rajini Sivaram	e5ec3f55c9	KAFKA-6546: Use LISTENER_NOT_FOUND_ON_LEADER error for missing listener (#5189 ) For metadata request version 6 and above, use a different error code to indicate missing listener on leader broker to enable diagnosis of listener configuration issues. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Andy Coates	d2b84bd4b0	Kafka_7064 - bug introduced when switching config commands to ConfigResource (#5245 ) Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago
Jason Gustafson	3afe2ed8e3	MINOR: Handle segment splitting edge cases and fix recovery bug (#5169 ) This patch fixes the following issues in the log splitting logic added to address KAFKA-6264: 1. We were not handling the case when all messages in the segment overflowed the index. In this case, there is only one resulting segment following the split. 2. There was an off-by-one error in the recovery logic when completing a swap operation which caused an unintended segment deletion. Additionally, this patch factors out of `splitOverflowedSegment` a method to write to a segment using from with an instance of `FileRecords`. This allows for future reuse and isolated testing. Reviewers: Dhruvil Shah <dhruvil@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	7 years ago
Ismael Juma	cc4dce94af	KAFKA-2983: Remove Scala consumers and related code (#5230 ) - Removed Scala consumers (`SimpleConsumer` and `ZooKeeperConsumerConnector`) and their tests. - Removed Scala request/response/message classes. - Removed any mention of new consumer or new producer in the code with the exception of MirrorMaker where the new.consumer option was never deprecated so we have to keep it for now. The non-code documentation has not been updated either, that will be done separately. - Removed a number of tools that only made sense in the context of the Scala consumers (see upgrade notes). - Updated some tools that worked with both Scala and Java consumers so that they only support the latter (see upgrade notes). - Removed `BaseConsumer` and related classes apart from `BaseRecord` which is used in `MirrorMakerMessageHandler`. The latter is a pluggable interface so effectively public API. - Removed `ZkUtils` methods that were only used by the old consumers. - Removed `ZkUtils.registerBroker` and `ZKCheckedEphemeral` since the broker now uses the methods in `KafkaZkClient` and no-one else should be using that method. - Updated system tests so that they don't use the Scala consumers except for multi-version tests. - Updated LogDirFailureTest so that the consumer offsets topic would continue to be available after all the failures. This was necessary for it to work with the Java consumer. - Some multi-version system tests had not been updated to include recently released Kafka versions, fixed it. - Updated findBugs and checkstyle configs not to refer to deleted classes and packages. Reviewers: Dong Lin <lindong28@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	7 years ago
Rajini Sivaram	e8955f731e	KAFKA-7012: Don't process SSL channels without data to process (#5237 ) Avoid unnecessary processing of SSL channels when there are some bytes buffered, but not enough to make progress. Reviewers: Radai Rosenblatt <radai.rosenblatt@gmail.com>, Jun Rao <junrao@gmail.com>	7 years ago
Jason Gustafson	347b37e319	MINOR: Do not require request timeout be larger than session timeout (#5246 ) This check was left over from the old consumer logic in which the join group was bound by the session timeout. Since we use a custom timeout for JoinGroup, this restriction no longer makes sense. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Robert Yokota	d06da1b7f4	KAFKA-7068: Handle null config values during transform (KIP-297) Fix NPE when processing null config values during transform. Author: Robert Yokota <rayokota@gmail.com> Reviewers: Magesh Nandakumar <magesh.n.kumar@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5241 from rayokota/KIP-297-null-config-values	7 years ago
Chia-Ping Tsai	f2cb8523d1	KAFKA-7032 The TimeUnit is neglected by KakfaConsumer#close(long, Tim… (#5182 )	7 years ago
Dhruvil Shah	a8c17e36c3	MINOR: Fix chunked down-conversion behavior when no valid batch exists after conversion (#5173 ) We might decide to drop certain message batches during down-conversion because older clients might not be able to interpret them. One such example is control batches which are typically removed by the broker if down-conversion to V0 or V1 is required. This patch makes sure the chunked down-conversion implementation is able to handle such cases.	7 years ago
Andy Coates	642a97783d	KAFKA-7010: Rename ResourceNameType to PatternType (#5205 ) The initial PR for KIP-290 #5117 added a new `ResourceNameType`, which was initially a field on `Resource` and `ResourceFilter`. However, follow on PRs have now moved the name type fields to new `ResourcePattern` and `ResourcePatternFilter` classes. This means the old name is no longer valid and may be confusing. The PR looks to rename the class to a more intuitive `resource.PatternType`. @cmccabe also requested that the current `ANY` value for this class be renamed to avoid confusion. `PatternType.ANY` currently causes `ResourcePatternFilter` to bring back all ACLs that would affect the supplied resource, i.e. it brings back literal, wildcard ACLs, and also does pattern matching to work out which prefix acls would affect the resource. This is very different from the behaviour of `ResourceType.ANY`, which just means the filter ignores the type of resources. `ANY` is to be renamed to `MATCH` to disambiguate it from other `ANY` filter types. A new `ANY` will be added that works in the same way as others, i.e. it will cause the filter to ignore the pattern type, (but won't do any pattern matching). Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago
Dhruvil Shah	9a71bfb9d6	KAFKA-7030; Add configuration to disable message down-conversion (KIP-283) (#5192 ) Add support for the topic-level `message.downconversion.enable` config as part of KIP-283.	7 years ago
Jason Gustafson	443091b844	KAFKA-7050; Decrease default consumer request timeout to 30s (#5203 ) This patch changes the default `request.timeout.ms` of the consumer to 30 seconds. Additionally, it adds logic to `NetworkClient` and related to components to support timeouts at the request level. We use this to handle the special case of the JoinGroup request, which may block for as long as the value configured by `max.poll.interval.ms`. Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Dhruvil Shah	53ca52f855	KAFKA-6979; Add `default.api.timeout.ms` to KafkaConsumer (KIP-266) (#5122 ) Adds a configuration that specifies the default timeout for KafkaConsumer APIs that could block. This was introduced in KIP-266. Reviewers: Satish Duggana <satish.duggana@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Andy Coates	a592402512	KAFKA-7007: Use JSON for /kafka-acl-extended-changes path (#5161 ) Keep Literal ACLs on the old paths, using the old formats, to maintain backwards compatibility. Have Prefixed, and any latter types, go on new paths, using JSON, (old brokers are not aware of them) Add checks to reject any adminClient requests to add prefixed acls before the cluster is fully upgraded. Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago
Robert Yokota	16190e9bfd	MINOR: Move FileConfigProvider to provider subpackage (#5194 ) This moves FileConfigProvider to the org.apache.common.config.provider package to more easily isolate provider implementations going forward. Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Randall Hauch <rhauch@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Dong Lin	4580d9f16a	MINOR: Remove deprecated per-partition lag metrics It takes O(n^2) time to instantiate a mbean with n attributes which can be very slow if the number of attributes of this mbean is large. This PR removes metrics whose number of attributes can grow with the number of partitions in the cluster to fix the performance issue. These metrics have already been marked for removal in 2.0 by KIP-225. Author: Dong Lin <lindong28@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #5172 from lindong28/remove-deprecated-metrics	7 years ago
Dhruvil Shah	f0b1b46486	MINOR: Use SL4J string interpolation instead of string concatenation (#5113 ) Also tweak logging message slightly and use Records.LOG_OVERHEAD definition. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Andy Coates	49db5a63c0	KAFKA-7005: Remove duplicate resource class. (#5184 ) This is a follow-on change requested as part of the initial PR for KIP-290 #5117. @cmccabe requested that the `resource.Resource` class be factored out in favour of `ConfigResource` to avoid confusion between all the `Resource` implementations. Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago
Ismael Juma	a4c2921736	MINOR: Remove APIs deprecated in 0.11.0 for core and clients (#5158 ) Not included: old consumers and checksum methods Reviewers: Dong Lin <lindong28@gmail.com>	7 years ago
Dhruvil Shah	d2b2fbdf94	KAFKA-6264; Split log segments as needed if offsets overflow the indexes (#4975 ) This patch adds logic to detect and fix segments which have overflowed offsets as a result of bugs in older versions of Kafka. Reviewers: Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Andy Coates	0b3989fd72	KAFKA-7006 - remove duplicate Scala ResourceNameType in preference to… (#5152 ) remove duplicate Scala ResourceNameType in preference to in preference to Java ResourceNameType. This is follow on work for KIP-290 and PR #5117, which saw the Scala ResourceNameType class introduced. I've added tests to ensure AclBindings can't be created with ResourceNameType.ANY or UNKNOWN. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago
Andy Coates	b4fa87cc51	KAFKA-7011 - Remove ResourceNameType field from Java Resource class. (#5160 ) The initial PR for KIP-290 #5117 added a `ResourceNameType` field to the Java and Scala `Resource` classes to introduce the concept of Prefixed ACLS. This does not make a lot of sense as these classes are meant to represent cluster resources, which would not have a concept of 'name type'. This work has not been released yet, so we have time to change it. This PR looks to refactor the code to remove the name type field from the Java `Resource` class. (The Scala one will age out once KIP-290 is done, and removing it would involve changes to the `Authorizer` interface, so this class was not touched). This is achieved by replacing the use of `Resource` with `ResourcePattern` and `ResourceFilter` with `ResourceFilterPattern`. A `ResourcePattern` is a combination of resource type, name and name type, where each field needs to be defined. A `ResourcePatternFilter` is used to select patterns during describe and delete operations. The adminClient uses `AclBinding` and `AclBindingFilter`. These types have been switched over to use the new pattern types. The AclCommands class, used by Kafka-acls.sh, has been converted to use the new pattern types. The result is that the original `Resource` and `ResourceFilter` classes are not really used anywhere, except deprecated methods. However, the `Resource` class will be used if/when KIP-50 is done. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago

... 2 3 4 5 6 ...

1433 Commits (0d461e4ea0a8353c358ae661837f471995943bb0)