src-kafka

Commit Graph

Author	SHA1	Message	Date
sdreynolds	89f331eac3	KAFKA-8229; Reset WorkerSinkTask offset commit interval after task commit (#6579 ) Prior to this change, the next commit time advances _each_ time a commit happens -- including when a commit happens because it was requested by the `Task`. When a `Task` requests a commit several times, the clock advances far into the future which prevents expected periodic commits from happening. This commit changes the behavior, we reset `nextCommit` relative to the time of the commit. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	5f06999cf3	MINOR: Remove ControllerEventManager metrics on close (#6788 ) Remove created metrics when shutting down `ControllerEventManager`. This fixes transient failures in `ControllerEventManagerTest.testEventQueueTime` and is generally good hygiene. Reviewers: José Armando García Sancio <jsancio@gmail.com>, Ismael Juma <ismael@juma.me.uk>	6 years ago
Boyang Chen	cafdc1e7df	KAFKA-8399: bring back internal.leave.group.on.close config for KStream (#6779 ) As title states. We plan to merge this to both trunk and 2.3 if it could fix the stream system tests globally. Reference implementation: #6673 Reviewers: Guozhang Wang <wangguoz@gmail.com>, Matthias J. Sax <mjsax@apache.org>	6 years ago
Manikumar Reddy	5ca6a2ee94	MINOR: Use `jps` cmd to find out the pid of TransactionalMessageCopier Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #6787 from omkreddy/transaction_test	6 years ago
Jason Gustafson	4f11090597	HOTFIX: Fix recent protocol breakage from KIP-345 and KIP-392 (#6780 ) KIP-345 and KIP-392 introduced a couple breaking changes for old versions of bumped protocols. This patch fixes them. Reviewers: Colin Patrick McCabe <cmccabe@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Boyang Chen <bchen11@outlook.com>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
David Arthur	bacb45e044	MINOR: Set `replicaId` for OffsetsForLeaderEpoch from followers (#6775 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	ce5ce2d569	MINOR: A few logging improvements in the broker (#6773 ) Reviewers: Boyang Chen <bchen11@outlook.com>, Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Rajini Sivaram	012880d424	KAFKA-8052; Ensure fetch session epoch is updated before new request (#6582 ) Reviewers: Jason Gustafson <jason@confluent.io>, Colin Patrick McCabe <cmccabe@confluent.io>, Andrew Olson <aolson1@cerner.com>, José Armando García Sancio <jsancio@users.noreply.github.com>	6 years ago
John Roesler	3b5d7aee6c	KAFKA-8315: fix the JoinWindows retention deprecation doc (#6664 ) Fix a javadoc mistake introduced in https://github.com/apache/kafka/pull/5911/files#diff-35e3523474fa277a63e36a3fe9e22af8. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Magesh Nandakumar	126230dad0	KAFKA-8265: Fix override config name to match KIP-458. (#6776 ) Author: Magesh Nandakumar <magesh.n.kumar@gmail.com> Reviewer: Randall Hauch <rhauch@gmail.com>	6 years ago
Manikumar Reddy	d77bac1c93	KAFKA-3143: Controller should transition offline replicas on startup Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io> Closes #5041 from omkreddy/KAFKA-3143	6 years ago
Colin Hicks	a420abf2db	MINOR: Work around OpenJDK 11 javadocs issue. (#6747 ) Some versions of OpenJDK 11 do not properly handle external javadocs links referencing previous Java versions. See: https://bugs.openjdk.java.net/browse/JDK-8212233. Failure symptom: `> Task :connect:api:javadoc javadoc: error - The code being documented uses modules but the packages defined in https://docs.oracle.com/javase/8/docs/api/ are in the unnamed module. 1 error` This PR conditionally sets the Java api docs link for the affected Gradle tasks. I verified that the links render correctly in the generated documentation when building with `1.8.0_181` and `11.0.3`. For example, in `build/docs/javadoc/org/apache/kafka/connect/source/SourceTask.html` the hyperlink to `java.nio.channels.Selector` points to a valid page on Oracle's site in both cases. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Colin Patrick McCabe	87ff83a82e	MINOR: Bump version to 2.4.0-SNAPSHOT (#6774 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Magesh Nandakumar	7d70133b75	KAFKA-8265: Fix config name to match KIP-458. (#6755 ) Return a copy of the ConfigDef in Client Configs. Related to KIP-458. Author: Magesh Nandakumar <magesh.n.kumar@gmail.com Reviewer: Randall Hauch <rhauch@gmail.com>	6 years ago
Lee Dongjin	b43f5446ac	KAFKA-8316; Remove deprecated usage of Slf4jRequestLog, SslContextFactory (#6668 ) * Remove deprecated class Slf4jRequestLog: use Slf4jRequestLogWriter, CustomRequestLog instread. 1. Remove '@SuppressWarnings("deprecation")' from RestServer#initializeResources, JsonRestServer#start. 2. Remove unused JsonRestServer#httpRequest. * Fix deprecated class usage: SslContextFactory -> SslContextFactory.[Server, Client] 1. Split SSLUtils#createSslContextFactory into SSLUtils#create[Server, Client]SideSslContextFactory: each method instantiates SslContextFactory.[Server, Client], respectively. 2. SSLUtils#configureSslContextFactoryAuthentication is called from SSLUtils#createServerSideSslContextFactory only. 3. Update SSLUtilsTest following splittion; for client-side SSL Context Factory, SslContextFactory#get[Need, Want]ClientAuth is always false. (SSLUtilsTest#testCreateClientSideSslContextFactory) Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	6 years ago
Rajini Sivaram	614ea55ad7	KAFKA-8381; Disable hostname validation when verifying inter-broker SSL (#6757 ) - Make endpoint validation configurable on SslEngineBuilder when creating an engine - Disable endpoint validation for engines created for inter-broker SSL validation since it is unsafe to use `localhost` - Use empty hostname in validation engine to ensure tests fail if validation is re-enabled by mistake - Add tests to verify inter-broker SSL validation Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	6 years ago
Stanislav Kozlovski	5a30a806ec	MINOR: Add log when the consumer does not send an offset commit due to not being part of an active group (#6404 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Bill Bejeck	11a8a8d274	KAFKA-8290: Close producer for zombie task (#6636 ) When we close a task and EOS is enabled we should always close the producer regardless if the task is in a zombie state (the broker fenced the producer) or not. I've added tests that fail without this change. Reviewers: Matthias J. Sax <mjsax@apache.org>, Jason Gustafson <jason@confluent.io>	6 years ago
Kengo Seki	fc616cb521	MINOR: Update command options for kafka-console-consumer.sh in vagrant/README.md (#6689 ) The following command in vagrant/README.md doesn't work, since `--zookeeper` option has been unsuppored from v2.0.0. This PR updates its command options to fix it. ``` bin/kafka-console-consumer.sh --zookeeper zk1:2181 --topic sandbox --from-beginning ``` Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
John Roesler	c140f09406	KAFKA-6474: remove KStreamTestDriver (#6732 ) The implementation of KIP-258 broke the state store methods in KStreamTestDriver. These methods were unused in this project, so the breakage was not detected. Since this is an internal testing utility, and it was deprecated and partially removed in favor of TopologyTestDriver, I opted to just complete the removal of the class. Reviewers: A. Sophie Blee-Goldman <ableegoldman@gmail.com>, Boyang Chen <boyang@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Jason Gustafson	b52170372b	MINOR: Increase security test timeouts for transient failures (#6760 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Boyang Chen	e00c0d316d	MINOR: Fix typo in heartbeat request protocol definition (#6759 ) This changes the field "generationid" to "generationId" to be consistent with other uses. Reviewers: Shaobo Liu <lambda.tencent@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Kengo Seki	a29a005316	KAFKA-8349; Add Windows batch files corresponding to kafka-delete-records.sh and kafka-log-dirs.sh (#6709 ) Some shell scripts don't have corresponding batch files in bin\windows. For improving Windows platform support, This PR adds the following batch files: - bin\windows\kafka-delete-records.bat - bin\windows\kafka-log-dirs.bat Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
commandini	51c72ad025	MINOR: Fixed broken link to the IBM article about j-zerocopy (#6749 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	9fa331b811	KAFKA-8225 & KIP-345 part-2: fencing static member instances with conflicting group.instance.id (#6650 ) For static members join/rejoin, we encode the current timestamp in the new member.id. The format looks like group.instance.id-timestamp. During consumer/broker interaction logic (Join, Sync, Heartbeat, Commit), we shall check the whether group.instance.id is known on group. If yes, we shall match the member.id stored on static membership map with the request member.id. If mismatching, this indicates a conflict consumer has used same group.instance.id, and it will receive a fatal exception to shut down. Right now the only missing part is the system test. Will work on it offline while getting the major logic changes reviewed. Reviewers: Ryanne Dolan <ryannedolan@gmail.com>, Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
David Arthur	e2847e8603	KAFKA-8365; Consumer and protocol support for follower fetching (#6731 ) This patch includes API changes for follower fetching per [KIP-392](https://cwiki.apache.org/confluence/display/KAFKA/KIP-392%3A+Allow+consumers+to+fetch+from+closest+replica) as well as the consumer implementation. After this patch, consumers will continue to fetch only from the leader, since the broker implementation to select an alternate read replica is not included here. Adds new `client.rack` consumer configuration property is added which allows the consumer to indicate its rack. This is just an arbitrary string to indicate some relative location, it doesn't have to actually represent a physical rack. We are keeping the naming consistent with the broker property (`broker.rack`). FetchRequest now includes `rack_id` which can optionally be specified by the consumer. FetchResponse includes an optional `preferred_read_replica` field for each partition in the response. OffsetForLeaderEpochRequest also adds new `replica_id` field which is similar to the same field in FetchRequest. When the consumer sees a `preferred_read_replica` in a fetch response, it will use the Node with that ID for the next fetch. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	d6057a9fe4	MINOR: Remove spammy log message during topic deletion Deletion of a large number of topics can cause a ton of log spam. In a test case on 2.2, deletion of 50 topics with 100 partitions each caused about 158 Mb of data in the controller log. With the improvements to batch StopReplica and the patch here, we reduce that to about 1.5 Mb. Kudos to gwenshap for spotting these spammy messages. Author: Jason Gustafson <jason@confluent.io> Reviewers: Gwen Shapira Closes #6738 from hachikuji/remove-verbose-topic-deletion-log-message	6 years ago
Matthias J. Sax	6a2749faa6	KAFKA-6455: Improve DSL operator timestamp semantics (#6725 ) Basic idea: KTable-KTable join: set max(left-ts,right-ts) for result #agg(...) (stream/table windowed/non-windowed): set max(ts1, ts2, ts3,...) of all input records that contribute to the aggregation result for all stateless transformation: input-ts -> output-ts Reviewers: Guozhang Wang <wangguoz@gmail.com>, John Roesler <john@confluent.io>, Andy Coates <andy@confluent.io>, Bill Bejeck <bbejeck@gmail.com	6 years ago
Matthias J. Sax	e018bfe2b0	KAFKA-3522: TopologyTestDriver should only return custom stores via untyped getStateStore() method (#6756 ) Reviewers: Sophie Blee-Goldman <sophie@confluent.io>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
Aishwarya Gune	414852c701	KAFKA-8346; Improve replica fetcher behavior for handling partition failure [KIP-461] (#6716 ) The replica fetcher thread is terminated in case a partition crashes which leads to under replication. This behavior can be improved by dropping the failed partition. The thread can continue monitoring the rest of the partitions. If all partitions of a thread have failed, the thread would be shut down. This is documented in KIP-461: https://cwiki.apache.org/confluence/display/KAFKA/KIP-461+-+Improve+Replica+Fetcher+behavior+at+handling+partition+failure. Reviewers: Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Bill Bejeck	9077d83672	MINOR: Add select changes from 3rd KIP-307 PR for incrementing name index counter (#6754 ) When users provide a name for operation via the Streams DSL, we need to increment the counter used for auto-generated names to make sure any operators downstream of a named operator still produce a compatible name. This PR is a subset of #6411 by @fhussonnois. We need to merge this PR now because it covers cases when users name repartition topics or state stores. Updated tests to reflect the counter produces expected number even when the user provides a name. Matthias J. Sax <mjsax@apache.org>, John Roesler <john@confluent.io>	6 years ago
A. Sophie Blee-Goldman	16769d263e	KAFKA-8215: Upgrade Rocks to v5.18.3 (#6743 ) This upgrade exposes a number of new options, including the WriteBufferManager which -- along with existing TableConfig options -- allows users to limit the total memory used by RocksDB across instances. This can alleviate some cascading OOM potential when, for example, a large number of stateful tasks are suddenly migrated to the same host. The RocksDB docs guarantee backwards format compatibility across versions Reviewers: Matthias J. Sax <mjsax@apache.org>, Bill Bejeck <bbejeck@gmail.com>,	6 years ago
Matthias J. Sax	a1286edb04	MINOR: Fix race condition in Streams tests (#6748 ) Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
Shaobo Liu	64c2d49cf5	MINOR: Add test for ConsumerNetworkClient.trySend (#6739 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Rajini Sivaram	8de7d37724	KAFKA-8379; Fix KafkaAdminClientTest.testUnreachableBootstrapServer (#6753 ) Initiate `unreachable server` scenario before starting admin client to avoid timing issues if node is disconnected from the test thread while admin client network thread is processing a metadata request. Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Magesh Nandakumar	2e91a310d7	KAFKA-8265: Initial implementation for ConnectorClientConfigPolicy to enable overrides (KIP-458) (#6624 ) Implementation to enable policy for Connector Client config overrides. This is implemented per the KIP-458. Reviewers: Randall Hauch <rhauch@gmail.com>	6 years ago
Konstantine Karantasis	ce584a01ff	KAFKA-5505: Incremental cooperative rebalancing in Connect (KIP-415) (#6363 ) Added the incremental cooperative rebalancing in Connect to avoid global rebalances on all connectors and tasks with each new/changed/removed connector. This new protocol is backward compatible and will work with heterogeneous clusters that exist during a rolling upgrade, but once the clusters consist of new workers only some affected connectors and tasks will be rebalanced: connectors and tasks on existing nodes still in the cluster and not added/changed/removed will continue running while the affected connectors and tasks are rebalanced. This commit attempted to minimize the changes to the existing V0 protocol logic, though that was not entirely possible. This commit adds extensive unit and integration tests for both the old V0 protocol and the new v1 protocol. Soak testing has been performed multiple times to verify behavior while connectors and added, changed, and removed and while workers are added and removed from the cluster. Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Randall Hauch <rhauch@gmail.com>, Ewen Cheslack-Postava <me@ewencp.org>, Robert Yokota <rayokota@gmail.com>, David Arthur <mumrah@gmail.com>, Ryanne Dolan <ryannedolan@gmail.com>	6 years ago
dan norwood	5a95c2e1cd	Add '?expand' query param for additional info on '/connectors'. (#6658 ) Per KIP-465, kept existing behavior of `/connectors` resource in the Connect's REST API, but added the ability to specify `?expand` query parameter to get list of connectors with status details on each connector. Added unit tests, and verified passing existing system tests (which use the older list form). See https://cwiki.apache.org/confluence/display/KAFKA/KIP-465%3A+Add+Consolidated+Connector+Endpoint+to+Connect+REST+API. Author: Dan Norwood <norwood@confluent.io> Reviewer: Randall Hauch <rhauch@gmail.com>	6 years ago
Jason Gustafson	26814e060e	KAFKA-8376; Least loaded node should consider connections which are being prepared (#6746 ) This fixes a regression caused by KAFKA-8275. The least loaded node selection should take into account nodes which are currently being connect to. This includes both the CONNECTING and CHECKING_API_VERSIONS states since `canSendRequest` would return false in either case. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago
Mickael Maison	855f899bb5	KAFKA-8256; Replace Heartbeat request/response with automated protocol (#6691 ) Reviewers: Boyang Chen <bchen11@outlook.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Matthias J. Sax	16b408898e	KAFAK-3522: Add TopologyTestDriver unit tests (#6179 ) Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>	6 years ago
A. Sophie Blee-Goldman	8078427104	KAFKA-8347: Choose next record to process by timestamp (#6719 ) When choosing the next record to process, we should look at the head record's timestamp of each partition and choose the lowest rather than choosing the lowest of the partition's streamtime. This change effectively makes RecordQueue return the timestamp of the head record rather than its streamtime. Streamtime is removed (replaced) from RecordQueue as it was only being tracked in order to choose the next partition to poll from. Reviewers: Matthias J. Sax <mjsax@apache.org>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
sandmannn	b96aa003b6	MINOR: Added missing method parameter to `performAssignment` javadoc (#6744 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Boyang Chen	6e6dcceb93	KAFKA-8220; Avoid kicking out static group members through rebalance timeout (#6666 ) To make static consumer group members more persistent, we want to avoid kicking out unjoined members through rebalance timeout. Essentially we allow static members to participate in a rebalance using their old subscription without sending a JoinGroup. The only catch is that an unjoined static member might be the current group leader, and we may need to elect a different leader. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Randall Hauch	b395ef4182	KAFKA-3816: Add MDC logging to Connect runtime (#5743 ) See https://cwiki.apache.org/confluence/display/KAFKA/KIP-449%3A+Add+connector+contexts+to+Connect+worker+logs Added LoggingContext as a simple mechanism to set and unset Mapped Diagnostic Contexts (MDC) in the loggers to provide for each thread useful parameters that can be used within the logging configuration. MDC avoids having to modify lots of log statements, since the parameters are available to all log statements issued by the thread, no matter what class makes those calls. The design intentionally minimizes the number of changes to any existing classes, and does not use Java 8 features so it can be easily backported if desired, although per this KIP it will be applied initially only in AK 2.3 and later and must be enabled via the Log4J configuration. Reviewers: Jason Gustafson <jason@conflent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Konstantine Karantasis	2327b35558	MINOR: Enable console logs in Connect tests (#6745 ) Author: Konstantine Karantasis <konstantine@confluent.io> Reviewer: Randall Hauch <rhauch@gmail.com>	6 years ago
Magesh Nandakumar	5928ffd0dc	KAFKA-8320 : fix retriable exception package for source connectors (#6675 ) WorkerSourceTask is catching the exception from wrong package org.apache.kafka.common.errors. It is not clear from the API standpoint as to which package the connect framework supports - the one from common or connect. The safest thing would be to support both the packages even though it's less desirable. Author: Magesh Nandakumar <magesh.n.kumar@gmail.com> Reviewers: Arjun Satish <arjun@confluent.io>, Randall Hauch <rhauch@gmail.com>	6 years ago
Boyang Chen	2208f9966d	KAFKA-8354; Replace Sync group request/response with automated protocol (#6729 ) Update SyncGroup API to use the generated protocol classes. Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Paul Davidson	fbb09952ac	KAFKA-5061 - Make default Worker Task client IDs distinct (#6097 ) Use the task ID to make the default client IDs used by Worker Tasks distinct and stable. This is avoids name conflicts on JMX MBeans and enables useful monitoring. This implements https://cwiki.apache.org/confluence/display/KAFKA/KIP-411%3A+Make+default+Kafka+Connect+worker+task+client+IDs+distinct. See: https://issues.apache.org/jira/browse/KAFKA-5061 Author: Paul Davidson <> Reviewer: Cyrus Vafadari <cyrusv@alum.mit.edu>, Arjun Satish <arjun@confluent.io>, Randall Hauch <rhauch@gmail.com>	6 years ago
Colin Patrick McCabe	0494cd329f	MINOR: Refactor SslFactory (#6674 ) SslFactory: split the part of SslFactory that creates SSLEngine instances into SslEngineBuilder. When (re)configuring, we simply create a new SslEngineBuilder. This allows us to make all the builder fields immutable. It also simplifies the logic for reconfiguring. Because we sometimes need to test old SslEngine instances against new ones, being able to use both the old and the new builder at once is useful. Create an enum named SslClientAuth which encodes the possible values for ssl.client.auth. This will simplify the handling of this configuration. SslTransportLayer#maybeProcessHandshakeFailure should treat an SSLHandshakeException with a "Received fatal alert" message as a handshake error (and therefore an authentication error.) SslFactoryTest: add some line breaks for very long lines. ConfigCommand#main: when terminating the command due to an uncaught exception, log the exception using debug level in slf4j, in addition to printing it to stderr. This makes it easier to debug failing junit tests, where stderr may not be kept, or may be reordered with respect to other slf4j messages. The use of debug level is consistent with how we handle other types of exceptions in ConfigCommand#main. StateChangeLogMerger#main: spell out the full name of scala.io.Source rather than abbreviating it as io.Source. This makes it clearer that it is part of the Scala standard library. It also avoids compiler errors when other libraries whose groupId starts with "io" are used in the broker. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	6 years ago

... 3 4 5 6 7 ...

6434 Commits (e5f7220b23ba556352d80a0575fcb6cbfe2d576d) All Branches Search

6434 Commits (e5f7220b23ba556352d80a0575fcb6cbfe2d576d)

All Branches