src-kafka

Author	SHA1	Message	Date
Viktor Somogyi	b1090e52a3	MINOR: Eliminate warnings from KafkaProducerTest (#5548 ) And code clean-ups in the same file. Reviewers: Kamal Chandraprakash <kamal.chandraprakash@gmail.com>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Rohan	ea4078e72a	KAFKA-7311: Reset next batch expiry time on each poll loop Sender/RecordAccumulator never resets the next batch expiry time. Its always computed as the min of the current value and the expiry time for all batches being processed. This means that its always set to the expiry time of the first batch, and once that time has passed Sender starts spinning on epoll with a timeout of 0, which consumes a lot of CPU. This patch updates Sender to reset the next batch expiry time on each poll loop so that a new value reflecting the expiry time for the current set of batches is computed. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Jason Gustafson	4f25f1fb71	KAFKA-7296; Handle coordinator loading error in TxnOffsetCommit (#5514 ) We should check TxnOffsetCommit responses for the COORDINATOR_LOADING_IN_PROGRESS error code and retry if we see it. Additionally, if we encounter an abortable error, we need to ensure that pending transaction offset commits are cleared. Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Rajini Sivaram	e4de328dab	KAFKA-7288: Fix for SslSelectorTest.testCloseConnectionInClosingState (#5504 ) Ensure that sends are completed before waiting for channel to be closed based on idle expiry, since channel will not be expired if added to ready keys in the next poll as a result of pending sends. Reviewers: Jun Rao <junrao@gmail.com>	6 years ago
Stanislav Kozlovski	ad84eb5e76	KAFKA-7169: Validate SASL extensions through callback on server side (#5497 ) Reviewers: Ron Dagostino <rndgstn@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
John Roesler	6d1685fa45	KAFKA-7284: streams should unwrap fenced exception (#5499 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
Rajini Sivaram	a3dc10f1d9	KAFKA-7261: Record 1.0 for total metric when Count stat is used for rate (#5484 ) Reviewers: Jun Rao <junrao@gmail.com>, John Roesler <john@confluent.io>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Manikumar Reddy O	92004fa21a	KAFKA-6751; Support dynamic configuration of max.connections.per.ip/max.connections.per.ip.overrides configs (KIP-308) (#5334 ) KIP-308 implementation. See https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=85474993. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Sébastien Launay	811c6433bd	KAFKA-4950; Fix ConcurrentModificationException on assigned-partitions metric update (#3907 ) Use a volatile field to track the size of the set of assigned partitions to avoid the concurrent access to the underlying linked hash map. Reviewers: Vahid Hashemian <vahidhashemian@us.ibm.com>, Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Stanislav Kozlovski	518e9d3eee	KAFKA-7169: Custom SASL extensions for OAuthBearer authentication mechanism (KIP-342) (#5379 ) Reviewers: Ron Dagostino <rndgstn@gmail.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Jason Gustafson	fc5f6b0e46	MINOR: Add Timer to simplify timeout bookkeeping and use it in the consumer (#5087 ) We currently do a lot of bookkeeping for timeouts which is both error-prone and distracting. This patch adds a new `Timer` class to simplify this logic and control unnecessary calls to system time. In particular, this helps with nested timeout operations. The consumer has been updated to use the new class. Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
radai-rosenblatt	09fe51f3eb	KAFKA-6648; Fetcher.getTopicMetadata() should return all partitions for each requested topic Currently Fetcher.getTopicMetadata() will not include offline partitions. Thus KafkaConsumer.partitionsFor(topic) will not return all partitions of a topic if there if any partition of the topic is offline. This causes problem if user tries to query the total number of partitions of the given topic. Author: radai-rosenblatt <radai.rosenblatt@gmail.com> Reviewers: Jason Gustafson <jason@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com> Closes #4679 from radai-rosenblatt/partition_shenanigans	6 years ago
Jason Gustafson	596c6c0c0b	KAFKA-7231; Ensure NetworkClient uses overridden request timeout (#5444 ) Fixed incorrect use of default timeout instead of the argument explicitly passed to `newClientRequest`. Reviewers: Ron Dagostino <rndgstn@gmail.com>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Jon Lee	a932520135	KAFKA-7126; Reduce number of rebalance for large consumer group after a topic is created This patch forces metadata update for consumers with pattern subscription at the beginning of rebalance (retry.backoff.ms is respected). This is to prevent such consumers from detecting subscription changes (e.g., new topic creation) independently and triggering multiple unnecessary rebalances. KAFKA-7126 contains detailed scenarios and rationale. Author: Jon Lee <jonlee@linkedin.com> Reviewers: Jason Gustafson <jason@confluent.io>, Ted Yu <yuzhihong@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5408 from jonlee2/KAFKA-7126	6 years ago
Yu Yang	7fc7136ffd	KAFKA-5886; Introduce delivery.timeout.ms producer config (KIP-91) (#5270 ) Co-authored-by: Sumant Tambe <sutambe@yahoo.com> Co-authored-by: Yu Yang <yuyang@pinterest.com> Reviewers: Ted Yu <yuzhihong@gmail.com>, Apurva Mehta <apurva@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	c83ecf4c55	KAFKA-7194; Fix buffer underflow if onJoinComplete is retried after failure (#5417 ) An untimely wakeup can cause ConsumerCoordinator.onJoinComplete to throw a WakeupException before completion. On the next poll(), it will be retried, but this leads to an underflow error because the buffer containing the assignment data will already have been advanced. The solution is to duplicate the buffer passed to onJoinComplete. Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Dhruvil Shah	d11f6f26b7	KAFKA-6897; Prevent KafkaProducer.send from blocking when producer is closed (#5027 ) After successful completion of KafkaProducer#close, it is possible that an application calls KafkaProducer#send. If the send is invoked for a topic for which we do not have any metadata, the producer will block until `max.block.ms` elapses - we do not expect to receive any metadata update in this case because Sender (and NetworkClient) has already exited. It is only when RecordAccumulator#append is invoked that we notice that the producer has already been closed and throw an exception. If `max.block.ms` is set to Long.MaxValue (or a sufficiently high value in general), the producer could block awaiting metadata indefinitely. This patch makes sure `Metadata#awaitUpdate` periodically checks if the network client has been closed, and if so bails out as soon as possible.	6 years ago
Rajini Sivaram	be02dbe287	MINOR: Fix transient test failure in SslTransportLayerTest (#5396 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Ron Dagostino	284b46b7ce	KAFKA-7182: SASL/OAUTHBEARER client response missing %x01 seps (#5391 ) The SASL/OAUTHBEARER client response as currently implemented in OAuthBearerSaslClient sends the valid gs2-header "n,," but then sends the "auth" key and value immediately after it. This does not conform to the specification because there is no %x01 after the gs2-header, no %x01 after the auth value, and no terminating %x01. Fixed this and the parsing of the client response in OAuthBearerSaslServer, which currently allows the malformed text. Also updated to accept and ignore unknown properties as required by the spec. Reviewers: Stanislav Kozlovski <familyguyuser192@windowslive.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Rajini Sivaram	0875ddeb23	KAFKA-7168: Treat connection close during SSL handshake as retriable (#5371 ) SSL `close_notify` from broker connection close was processed as a handshake failure in clients while unwrapping the message if a handshake is in progress. Updated to handle this as a retriable IOException rather than a non-retriable SslAuthenticationException to avoid authentication exceptions in clients during rolling restart of brokers. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Ahmed Al Mehdi	8c47a3e52f	KAFKA-5098; KafkaProducer should reject sends to invalid topics …egal char and generates InvalidTopicException If config parameter max.block.ms config parameter is set to a non-zero value, KafkaProducer.send() blocks for the max.block.ms time if topic name has illegal char or is invalid. Wrote a unit test that verifies the appropriate exception is returned when performing a get on the returned future by KafkaProducer.send(). Author: Ahmed Al Mehdi <aalmehdi@aalmehdi-ld1.linkedin.biz> Reviewers: Ismael Juma <ismael@juma.me.uk>, Joel Koshy <jjkoshy@gmail.com>, Manikumar Reddy O <manikumar.reddy@gmail.com> Closes #5247 from ahmedha/KAFKA-5098	6 years ago
Jason Gustafson	8119683a23	MINOR: Tighten FileRecords size checks to prevent overflow (#5332 ) Add some additional size validation to prevent overflows when using `FileRecords`. Reviewers: Dhruvil Shah <dhruvil@confluent.io>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Rajini Sivaram	1f8527b331	KAFKA-7136: Avoid deadlocks in synchronized metrics reporters (#5341 ) We need to use the same lock for metric update and read to avoid NPE and concurrent modification exceptions. Sensor add/remove/update are synchronized on Sensor since they access lists and maps that are not thread-safe. Reporters are notified of metrics add/remove while holding (Sensor, Metrics) locks and reporters may synchronize on the reporter lock. Metric read may be invoked by metrics reporters while holding a reporter lock. So read/update cannot be synchronized using Sensor since that could lead to deadlock. This PR introduces a new lock in Sensor for update/read. Locking order: - Sensor#add: Sensor -> Metrics -> MetricsReporter - Metrics#removeSensor: Sensor -> Metrics -> MetricsReporter - KafkaMetric#metricValue: MetricsReporter -> Sensor#metricLock - Sensor#record: Sensor -> Sensor#metricLock Reviewers: Jun Rao <junrao@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Yishun Guan	d44d5d7520	KAFKA-6986: Export Admin Client metrics through Stream Threads (#5210 ) KAFKA-6986:Export Admin Client metrics through Stream Threads We already exported producer and consumer metrics through KafkaStreams class: #4998 It makes sense to also export the Admin client metrics. I didn't add a separate unittest case for this. Let me know if it's needed. This is my first contribution, feel free to point out any mistakes that I did. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Jason Gustafson	919409bb23	MINOR: Ensure heartbeat last poll time always updated (#5308 ) We need to ensure that the last poll time is always updated when the user call poll(Duration). This patch fixes a bug in the new KIP-266 timeout behavior which would cause this to be skipped if the coordinator could not be found while the consumer was in an active group. Note that I've also fixed some type inconsistencies for various timeouts. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Stanislav Kozlovski	daa8082d67	KAFKA-6809: Count inbound connections in the connection-creation metric (#5301 ) Previously, the connection-creation metric only accounted for opened connections from the broker. This change extends it to account for received connections.	6 years ago
Manikumar Reddy O	51935ee2e6	KAFKA-7091; AdminClient should handle FindCoordinatorResponse errors (#5278 ) - Update KafkaAdminClient implementation to handle FindCoordinatorResponse errors - Remove scala AdminClient usage from core and streams tests Reviewers: Matthias J. Sax <matthias@confluent.io>, Jason Gustafson <jason@confluent.io>	6 years ago
Ismael Juma	7a74ec62d2	MINOR: Avoid FileInputStream/FileOutputStream (#5281 ) They rely on finalizers (before Java 11), which create unnecessary GC load. The alternatives are as easy to use and don't have this issue. Also use FileChannel directly instead of retrieving it from RandomAccessFile whenever possible since the indirection is unnecessary. Finally, add a few try/finally blocks. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Rajini Sivaram	624acc63da	MINOR: Cleanup threads in integration tests (#5269 ) Leftover threads doing network I/O can interfere with subsequent tests. Add missing shutdown in tests and include admin client in the check for leftover threads. Reviewers: Anna Povzner <anna@confluent.io>, Dhruvil Shah <dhruvil@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Manikumar Reddy O <manikumar.reddy@gmail.com>	6 years ago
Vahid Hashemian	418a91b5d4	KAFKA-4682; Revise expiration semantics of consumer group offsets (KIP-211 - Part 1) (#4896 ) This patch contains the improved offset expiration semantics proposed in KIP-211. Committed offsets will not be expired as long as a group is active. Once all members have left the group, then offsets will be expired after the timeout configured by `offsets.retention.minutes`. Note that the optimization for early expiration of unsubscribed topics will be implemented in a separate patch.	6 years ago
Andy Coates	d2b84bd4b0	Kafka_7064 - bug introduced when switching config commands to ConfigResource (#5245 ) Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago
Jason Gustafson	3afe2ed8e3	MINOR: Handle segment splitting edge cases and fix recovery bug (#5169 ) This patch fixes the following issues in the log splitting logic added to address KAFKA-6264: 1. We were not handling the case when all messages in the segment overflowed the index. In this case, there is only one resulting segment following the split. 2. There was an off-by-one error in the recovery logic when completing a swap operation which caused an unintended segment deletion. Additionally, this patch factors out of `splitOverflowedSegment` a method to write to a segment using from with an instance of `FileRecords`. This allows for future reuse and isolated testing. Reviewers: Dhruvil Shah <dhruvil@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	7 years ago
Rajini Sivaram	e8955f731e	KAFKA-7012: Don't process SSL channels without data to process (#5237 ) Avoid unnecessary processing of SSL channels when there are some bytes buffered, but not enough to make progress. Reviewers: Radai Rosenblatt <radai.rosenblatt@gmail.com>, Jun Rao <junrao@gmail.com>	7 years ago
Robert Yokota	d06da1b7f4	KAFKA-7068: Handle null config values during transform (KIP-297) Fix NPE when processing null config values during transform. Author: Robert Yokota <rayokota@gmail.com> Reviewers: Magesh Nandakumar <magesh.n.kumar@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5241 from rayokota/KIP-297-null-config-values	7 years ago
Chia-Ping Tsai	f2cb8523d1	KAFKA-7032 The TimeUnit is neglected by KakfaConsumer#close(long, Tim… (#5182 )	7 years ago
Dhruvil Shah	a8c17e36c3	MINOR: Fix chunked down-conversion behavior when no valid batch exists after conversion (#5173 ) We might decide to drop certain message batches during down-conversion because older clients might not be able to interpret them. One such example is control batches which are typically removed by the broker if down-conversion to V0 or V1 is required. This patch makes sure the chunked down-conversion implementation is able to handle such cases.	7 years ago
Andy Coates	642a97783d	KAFKA-7010: Rename ResourceNameType to PatternType (#5205 ) The initial PR for KIP-290 #5117 added a new `ResourceNameType`, which was initially a field on `Resource` and `ResourceFilter`. However, follow on PRs have now moved the name type fields to new `ResourcePattern` and `ResourcePatternFilter` classes. This means the old name is no longer valid and may be confusing. The PR looks to rename the class to a more intuitive `resource.PatternType`. @cmccabe also requested that the current `ANY` value for this class be renamed to avoid confusion. `PatternType.ANY` currently causes `ResourcePatternFilter` to bring back all ACLs that would affect the supplied resource, i.e. it brings back literal, wildcard ACLs, and also does pattern matching to work out which prefix acls would affect the resource. This is very different from the behaviour of `ResourceType.ANY`, which just means the filter ignores the type of resources. `ANY` is to be renamed to `MATCH` to disambiguate it from other `ANY` filter types. A new `ANY` will be added that works in the same way as others, i.e. it will cause the filter to ignore the pattern type, (but won't do any pattern matching). Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago
Jason Gustafson	443091b844	KAFKA-7050; Decrease default consumer request timeout to 30s (#5203 ) This patch changes the default `request.timeout.ms` of the consumer to 30 seconds. Additionally, it adds logic to `NetworkClient` and related to components to support timeouts at the request level. We use this to handle the special case of the JoinGroup request, which may block for as long as the value configured by `max.poll.interval.ms`. Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Dhruvil Shah	53ca52f855	KAFKA-6979; Add `default.api.timeout.ms` to KafkaConsumer (KIP-266) (#5122 ) Adds a configuration that specifies the default timeout for KafkaConsumer APIs that could block. This was introduced in KIP-266. Reviewers: Satish Duggana <satish.duggana@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Robert Yokota	16190e9bfd	MINOR: Move FileConfigProvider to provider subpackage (#5194 ) This moves FileConfigProvider to the org.apache.common.config.provider package to more easily isolate provider implementations going forward. Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Randall Hauch <rhauch@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	7 years ago
Dong Lin	4580d9f16a	MINOR: Remove deprecated per-partition lag metrics It takes O(n^2) time to instantiate a mbean with n attributes which can be very slow if the number of attributes of this mbean is large. This PR removes metrics whose number of attributes can grow with the number of partitions in the cluster to fix the performance issue. These metrics have already been marked for removal in 2.0 by KIP-225. Author: Dong Lin <lindong28@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #5172 from lindong28/remove-deprecated-metrics	7 years ago
Andy Coates	49db5a63c0	KAFKA-7005: Remove duplicate resource class. (#5184 ) This is a follow-on change requested as part of the initial PR for KIP-290 #5117. @cmccabe requested that the `resource.Resource` class be factored out in favour of `ConfigResource` to avoid confusion between all the `Resource` implementations. Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago
Dhruvil Shah	d2b2fbdf94	KAFKA-6264; Split log segments as needed if offsets overflow the indexes (#4975 ) This patch adds logic to detect and fix segments which have overflowed offsets as a result of bugs in older versions of Kafka. Reviewers: Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	7 years ago
Andy Coates	0b3989fd72	KAFKA-7006 - remove duplicate Scala ResourceNameType in preference to… (#5152 ) remove duplicate Scala ResourceNameType in preference to in preference to Java ResourceNameType. This is follow on work for KIP-290 and PR #5117, which saw the Scala ResourceNameType class introduced. I've added tests to ensure AclBindings can't be created with ResourceNameType.ANY or UNKNOWN. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago
Andy Coates	b4fa87cc51	KAFKA-7011 - Remove ResourceNameType field from Java Resource class. (#5160 ) The initial PR for KIP-290 #5117 added a `ResourceNameType` field to the Java and Scala `Resource` classes to introduce the concept of Prefixed ACLS. This does not make a lot of sense as these classes are meant to represent cluster resources, which would not have a concept of 'name type'. This work has not been released yet, so we have time to change it. This PR looks to refactor the code to remove the name type field from the Java `Resource` class. (The Scala one will age out once KIP-290 is done, and removing it would involve changes to the `Authorizer` interface, so this class was not touched). This is achieved by replacing the use of `Resource` with `ResourcePattern` and `ResourceFilter` with `ResourceFilterPattern`. A `ResourcePattern` is a combination of resource type, name and name type, where each field needs to be defined. A `ResourcePatternFilter` is used to select patterns during describe and delete operations. The adminClient uses `AclBinding` and `AclBindingFilter`. These types have been switched over to use the new pattern types. The AclCommands class, used by Kafka-acls.sh, has been converted to use the new pattern types. The result is that the original `Resource` and `ResourceFilter` classes are not really used anywhere, except deprecated methods. However, the `Resource` class will be used if/when KIP-50 is done. Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com>	7 years ago
Andy Coates	b3aa655a70	KAFKA-6841: Support Prefixed ACLs (KIP-290) (#5117 ) Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com> Co-authored-by: Piyush Vijay <pvijay@apple.com> Co-authored-by: Andy Coates <big-andy-coates@users.noreply.github.com>	7 years ago
Rajini Sivaram	7c69de42df	MINOR: Rename package `internal` to `internals` for consistency (#5137 )	7 years ago
Mickael Maison	8a166f8c28	KAFKA-6750: Add listener name to authentication context (KIP-282) (#4829 ) PrincipalBuilder implementations can now take the listener into account when creating the Principal. This is especially interesting in deployments where inter-broker traffic is on a different listener than client traffic or when the same protocol is used by multiple listeners. The change in itself is mostly "plumbing" as the listener name needs to be passed from ChannelBuilders all the way down to all classes implementing AuthenticationContext. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk> Co-authored-by: Edoardo Comar <ecomar@uk.ibm.com> Co-authored-by: Mickael Maison <mickael.maison@gmail.com>	7 years ago
Rajini Sivaram	9df3872fbd	KAFKA-3665: Enable TLS hostname verification by default (KIP-294) (#4956 ) Make HTTPS the default ssl.endpoint.identification.algorithm. Reviewers: Ismael Juma <ismael@juma.me.uk>	7 years ago
Jason Gustafson	d02f02130e	MINOR: Fix bug in AdminClient node reassignment following connection failure (#5112 ) We added logic to reassign nodes in callToSend after a connection failure, but we do not handle the case when there is no node currently available to reassign the request to. This can happen when using MetadataUpdateNodeIdProvider if all of the known nodes are blacked out awaiting the retry backoff. To fix this, we need to ensure that the call is added to pendingCalls if a new node cannot be found.	7 years ago

1 2 3 4 5 ...

710 Commits (b8559de23d120ca07daa6f66de6bba253d16a74a)