src-kafka

Author	SHA1	Message	Date
Manikumar Reddy	4c2bd567b1	MINOR: Bump version to 2.5.0-SNAPSHOT (#7455 )	5 years ago
A. Sophie Blee-Goldman	3b05dc685b	MINOR: just remove leader on trunk like we did on 2.3 (#7447 ) Small follow-up to trunk PR #7423 While debugging the 2.3 VP PR we realized we should remove the leader-tracking from the VP system test altogether. We'd already merged the corresponding trunk PR so I made a quick new PR for trunk (also fixes a missed version bump in one of the log messages) Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Chris Egerton	791d0d61bf	KAFKA-8804: Secure internal Connect REST endpoints (#7310 ) Implemented KIP-507 to secure the internal Connect REST endpoints that are only for intra-cluster communication. A new V2 of the Connect subprotocol enables this feature, where the leader generates a new session key, shares it with the other workers via the configuration topic, and workers send and validate requests to these internal endpoints using the shared key. Currently the internal `POST /connectors/<connector>/tasks` endpoint is the only one that is secured. This change adds unit tests and makes some small alterations to system tests to target the new `sessioned` Connect subprotocol. A new integration test ensures that the endpoint is actually secured (i.e., requests with missing/invalid signatures are rejected with a 400 BAD RESPONSE status). Author: Chris Egerton <chrise@confluent.io> Reviewed: Konstantine Karantasis <konstantine@confluent.io>, Randall Hauch <rhauch@gmail.com>	5 years ago
A. Sophie Blee-Goldman	8da69936a7	KAFKA-8649: Send latest commonly supported version in assignment (#7423 ) Instead of sending the leader's version and having older members try to blindly upgrade. The only other real change here is that we will also set the VERSION_PROBING error code and return early from onAssignment when we are upgrading our used subscription version (not just downgrading it) since this implies the whole group has finished the rolling upgrade and all members should rejoin with the new subscription version. Also piggy-backing on a fix for a potentially dangerous edge case, where every thread of an instance is assigned the same set of active tasks. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Rajini Sivaram	0d31272b35	KAFKA-8848; Update system tests to use new AclAuthorizer (#7374 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	5 years ago
Randall Hauch	ada35d5ff4	Add recent versions of Kafka to the matrix of ConnectDistributedTest (#7024 ) Reviewers: Arjun Satish <arjun@confluent.io>, Konstantine Karantasis <k.karantasis@gmail.com>	5 years ago
Vikas Singh	312e4db590	MINOR. implement --expose-ports option in ducker-ak (#7269 ) This change adds a command line option to the `ducker-ak up' command to enable exposing ports from docker containers. The exposed ports will be mapped to the ephemeral ports on the host. The option is called `expose-ports' and can take either a single value (like 5005) or a range (like 5005-5009). This port will then exposed from each docker container that ducker-ak sets up. Reviewers: Colin P. McCabe <cmccabe@apache.org>, José Armando García Sancio <jsancio@users.noreply.github.com>	5 years ago
vinoth chandar	ffef0871c2	KAFKA-7149 : Reducing streams assignment data size (#7185 ) * Leader instance uses dictionary encoding on the wire to send topic partitions * Topic names (most expensive component) are mapped to an integer using the dictionary * Follower instances receive the dictionary, decode topic names back * Purely an on-the-wire optimization, no in-memory structures changed * Test case added for version 5 AssignmentInfo Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Colin Patrick McCabe	a225347ff2	KAFKA-8840: Fix bug where ClientCompatibilityFeaturesTest fails when running multiple iterations (#7260 ) Fix a bug where ClientCompatibilityFeaturesTest fails when running multiple iterations. Also, fix a typo in tests/docker/Dockerfile. Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Vikas Singh	09ad6b84c5	MINOR. Fix 2.3.0 streams systest dockerfile typo (#7272 ) As part of commit 4d1ee26a136997d31dbd6ddca07e09b34c41c77d streams version 2.3.0 test jar was added, but there was a simple typo in the path that specified the version. `ducker-ak up` was failing because of that. Fixed that. Reviewers: Guozhang Wang <wangguoz@gmail.com>	5 years ago
Matthias J. Sax	4d1ee26a13	KAFKA-8594: Add version 2.3 to Streams system tests (#7131 ) Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Boyang Chen <boyang@confluent.io>, Bill Bejeck <bill@confluent.io>	5 years ago
Brian Bushree	6c8f654d5f	MINOR: Upgrade ducktape to 0.7.6 Author: Brian Bushree <bbushree@confluent.io> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #7138 from brianbushree/update-ducktape	5 years ago
David Arthur	ff9e95cb09	MINOR: Add fetch from follower system test (#7166 ) This adds a basic system test that enables rack-aware brokers with the rack-aware replica selector for fetch from followers (KIP-392). The test asserts that the follower was read from at least once and that all the messages that were produced were successfully consumed. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Arjun Satish	794637232c	KAFKA-8774: Regex can be found anywhere in config value (#7197 ) Corrected the AbstractHerder to correctly identify task configs that contain variables for externalized secrets. The original method incorrectly used `matcher.matches()` instead of `matcher.find()`. The former method expects the entire string to match the regex, whereas the second one can find a pattern anywhere within the input string (which fits this use case more correctly). Added unit tests to cover various cases of a config with externalized secrets, and updated system tests to cover case where config value contains additional characters besides secret that requires regex pattern to be found anywhere in the string (as opposed to complete match). Author: Arjun Satish <arjun@confluent.io> Reviewer: Randall Hauch <rhauch@gmail.com>	5 years ago
Matthias J. Sax	e9a35fe02e	MINOR: Bump system test version from 2.2.0 to 2.2.1 (#6873 ) Reviewers: Boyang Chen <boyang@confluent.io>, Bill Bejeck <bill@confluent.io>	5 years ago
Rajini Sivaram	de8ce78a90	MINOR: Tolerate limited data loss for upgrade tests with old message format (#7102 ) To avoid transient system test failures, tolerate a small amount of data loss due to truncation in upgrade system tests using older message format prior to KIP-101, where data loss was possible. Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Brian Bushree	e5f7220b23	MINOR: kafkatest - adding whitelist for interbroker sasl configs (#7093 )	5 years ago
Ismael Juma	d67495d6a7	KAFKA-8634: Update ZooKeeper to 3.5.5 (#6802 ) ZooKeeper 3.5.5 is the first stable release in the 3.5.x series. The key new feature in is TLS support, but there are a few more noteworthy features: * Dynamic reconfiguration * Local sessions * New node types: Container, TTL * Ability to remove watchers * Multi-threaded commit processor * Upgraded to Netty 4.1 See the release notes for more detail: https://zookeeper.apache.org/doc/r3.5.5/releasenotes.html In addition to the version bump, we: * Add `commons-cli` dependency as it's required by `ZooKeeperMain`, but specified as `provided` in their pom. * Remove unnecessary `ZooKeeperMainWrapper`, the bug it worked around was fixed upstream a long time ago. * Ignore non zero exit in one system test invocation of `ZooKeeperMain`. `ZooKeeperMainWrapper` always returned `0` and `ZooKeeperService.query` relies on that for correct behavior. Reviewers: Jason Gustafson <jason@confluent.io>	5 years ago
Ismael Juma	57903be496	MINOR: Remove zkclient dependency (#7036 ) ZkUtils was removed so we don't need this anymore. Also: * Fix ZkSecurityMigrator and ReplicaManagerTest not to reference ZkClient classes. * Remove references to zkclient in various `log4j.properties` and `import-control.xml`. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	5 years ago
David Arthur	23beeea34b	KAFKA-8443; Broker support for fetch from followers (#6832 ) Follow on to #6731, this PR adds broker-side support for [KIP-392](https://cwiki.apache.org/confluence/display/KAFKA/KIP-392%3A+Allow+consumers+to+fetch+from+closest+replica) (fetch from followers). Changes: * All brokers will handle FetchRequest regardless of leadership * Leaders can compute a preferred replica to return to the client * New ReplicaSelector interface for determining the preferred replica * Incremental fetches will include partitions with no records if the preferred replica has been computed * Adds new JMX to expose the current preferred read replica of a partition in the consumer Two new conditions were added for completing a delayed fetch. They both relate to communicating the high watermark to followers without waiting for a timeout: * For regular fetches, if the high watermark changes within a single fetch request * For incremental fetch sessions, if the follower's high watermark is lower than the leader A new JMX attribute `preferred-read-replica` was added to the `kafka.consumer:type=consumer-fetch-manager-metrics,client-id=some-consumer,topic=my-topic,partition=0` object. This was added to support the new system test which verifies that the fetch from follower behavior works end-to-end. This attribute could also be useful in the future when debugging problems with the consumer. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Jun Rao <junrao@gmail.com>, Jason Gustafson <jason@confluent.io>	5 years ago
Brian Bushree	5287036b38	MINOR: system tests - avoid 'sasl.enabled.mechanisms' in listener overrides (#7018 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
Randall Hauch	6f91096c7d	MINOR: Fix version for ConnectDistributed system test, remove 0.9.0.1 compatibility test (#7023 ) Connect tests were using String version for KafkaService instead of the expected KafkaVersion object. This broke due to recent changes to KafkaVersion. It turns out that the tests with String version were running compatibility tests against `dev` brokers rather than the older broker versions they were expecting to run against. When version was fixed, tests using 0.9.0.1 brokers started failing since new clients are not compatible with 0.9.0.1 brokers. So this PR fixes version parameter and removes the two tests against 0.9.0.1 brokers. Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
Colin Patrick McCabe	3d2d87abd1	MINOR: Add compatibility tests for 2.3.0 (#6995 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	5 years ago
Brian Bushree	357aedeb1b	MINOR: Support listener config overrides in system tests (#6981 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
Stanislav Vodetskyi	594d043037	MINOR: Fix failing upgrade test by supporting both security.inter.broker.protocol and inter.broker.listener.name depending on kafka version (#7000 ) Reviewers: Brian Bushree <bbushree@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
Stanislav Vodetskyi	f51d7d3c93	KAFKA-8557: system tests - add support for (optional) interbroker listener with the same security protocol as client listeners (#6938 ) Reviewers: Brian Bushree <bbushree@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	5 years ago
David Arthur	d7a5e31ca2	KAFKA-8519 Add trogdor action to slow down a network (#6912 ) This adds a new Trogdor fault spec for inducing network latency on a network device for system testing. It operates very similarly to the existing network partition spec by executing the `tc` linux utility.	5 years ago
Boyang Chen	e981b82601	KAFKA-8500; Static member rejoin should always update member.id (#6899 ) This PR fixes a bug in static group membership. Previously we limit the `member.id` replacement in JoinGroup to only cases when the group is in Stable. This is error-prone and could potentially allow duplicate consumers reading from the same topic. For example, imagine a case where two unknown members join in the `PrepareRebalance` stage at the same time. The PR fixes the following things: 1. Replace `member.id` at any time we see a known static member rejoins group with unknown member.id 2. Immediately fence any ongoing join/sync group callback to early terminate the duplicate member. 3. Clearly handle Dead/Empty cases as exceptional. 4. Return old leader id upon static member leader rejoin to avoid trivial member assignment being triggered. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	2feb44ebc8	MINOR: Fix race condition on shutdown of verifiable producer We've seen `ReplicaVerificationToolTest.test_replica_lags` fail occasionally due to errors such as the following: ``` RemoteCommandError: ubuntuworker7: Command 'kill -15 2896' returned non-zero exit status 1. Remote error message: bash: line 0: kill: (2896) - No such process ``` The problem seems to be a shutdown race condition when using `max_messages` with the producer. The process may already be gone which will cause the signal to fail. Author: Jason Gustafson <jason@confluent.io> Reviewers: Gwen Shapira Closes #6906 from hachikuji/fix-failing-replicat-verification-test	6 years ago
Jason Gustafson	c7c310beff	MINOR: Lower producer throughput in flaky upgrade system test We see the upgrade test failing from time to time. I looked into it and found that the root cause is basically that the test throughput can be too high for the 0.9 producer to make progress. Eventually it reaches a point where it has a huge backlog of timed out requests in the accumulator which all have to be expired. We see a long run of messages like this in the output: ``` {"exception":"class org.apache.kafka.common.errors.TimeoutException","time_ms":1559907386132,"name":"producer_send_error","topic":"test_topic","message":"Batch Expired","class":"class org.apache.kafka.tools.VerifiableProducer","value":"335160","key":null} {"exception":"class org.apache.kafka.common.errors.TimeoutException","time_ms":1559907386132,"name":"producer_send_error","topic":"test_topic","message":"Batch Expired","class":"class org.apache.kafka.tools.VerifiableProducer","value":"335163","key":null} {"exception":"class org.apache.kafka.common.errors.TimeoutException","time_ms":1559907386133,"name":"producer_send_error","topic":"test_topic","message":"Batch Expired","class":"class org.apache.kafka.tools.VerifiableProducer","value":"335166","key":null} {"exception":"class org.apache.kafka.common.errors.TimeoutException","time_ms":1559907386133,"name":"producer_send_error","topic":"test_topic","message":"Batch Expired","class":"class org.apache.kafka.tools.VerifiableProducer","value":"335169","key":null} ``` This can continue for a long time (I have observed up to 1 min) and prevents the producer from successfully writing any new data. While it is busy expiring the batches, no data is getting delivered to the consumer, which causes it to eventually raise a timeout. ``` kafka.consumer.ConsumerTimeoutException at kafka.consumer.NewShinyConsumer.receive(BaseConsumer.scala:50) at kafka.tools.ConsoleConsumer$.process(ConsoleConsumer.scala:109) at kafka.tools.ConsoleConsumer$.run(ConsoleConsumer.scala:69) at kafka.tools.ConsoleConsumer$.main(ConsoleConsumer.scala:47) at kafka.tools.ConsoleConsumer.main(ConsoleConsumer.scala) ``` The fix here is to reduce the throughput, which seems reasonable since the purpose of the test is to verify the upgrade, which does not demand heavy load. Note that I investigated several failing instances of this test going back to 1.0 and saw a similar pattern, so there does not appear to be a regression. Author: Jason Gustafson <jason@confluent.io> Reviewers: Gwen Shapira Closes #6907 from hachikuji/lower-throughput-for-upgrade-test	6 years ago
Lucas Bradstreet	677713baf3	KAFKA-8499: ensure java is in PATH for ducker system tests (#6898 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	6 years ago
Boyang Chen	cca05cace4	KAFKA-8331: stream static membership system test (#6877 ) As title suggested, we boost 3 stream instances stream job with one minute session timeout, and once the group is stable, doing couple of rolling bounces for the entire cluster. Every rejoin based on restart should have no generation bump on the client side. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Bill Bejeck <bbejeck@gmail.com>	6 years ago
Stanislav Kozlovski	58aa04f91e	MINOR: Improve Trogdor external command worker docs (#6438 ) Reviewers: Colin McCabe <cmccabe@apache.org>, Xi Yang <xi@confluent.io>	6 years ago
Matthias J. Sax	ba3dc49437	KAFKA-8155: Add 2.2.0 release to system tests (#6597 ) Reviewers: Bill Bejeck <bill@confluent.io>, Boyang Chen <boyang@confluent.io>, Bruno Cadonna <bruno@confluent.io>, Guozhang Wang <guozhang@confuent.io>	6 years ago
Konstantine Karantasis	55d07e717e	KAFKA-8473: Adjust Connect system tests for incremental cooperative rebalancing (#6872 ) Author: Konstantine Karantasis <konstantine@confluent.io> Reviewer: Randall Hauch <rhauch@gmail.com>	6 years ago
Matthias J. Sax	55bfea1378	KAFKA-8155: Add 2.1.1 release to system tests (#6596 ) Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Guozhang Wang <guozhang@confluent.io>	6 years ago
Alex Diachenko	77a9a108ff	KAFKA-8418: Wait until REST resources are loaded when starting a Connect Worker. (#6840 ) Author: Alex Diachenko <sansanichfb@gmail.com> Reviewers: Arjun Satish <arjun@confluent.io>, Konstantine Karantasis <konstantine@confluent.io>, Randall Hauch <rhauch@gmail.com>	6 years ago
Alex Diachenko	4838855ea7	MINOR: Fix red herring when ConnectDistributedTest.test_bounce fails. (#6838 ) Author: Alex Diachenko <sansanichfb@gmail.com> Reviewer: Randall Hauch <rhauch@gmail.com>	6 years ago
Bill Bejeck	f249956390	MINOR: Account for different versions in upgrade (#6835 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, Bruno Cadonna <bruno@confluent.io>	6 years ago
Matthias J. Sax	d286051a21	MINOR: fix Streams version-probing system test (#6764 ) Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Boyang Chen <boyang@confluent.io>	6 years ago
Ismael Juma	c823f32070	MINOR: Add 2.0, 2.1 and 2.2 to broker and client compat tests These are important to ensure we don't break compatibility. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Gwen Shapira Closes #6794 from ijuma/update-version-compat-tests	6 years ago
Konstantine Karantasis	c6d083d7fc	KAFKA-8417: Remove redundant network definition --net=host when starting testing docker containers (#6797 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	6 years ago
Manikumar Reddy	5ca6a2ee94	MINOR: Use `jps` cmd to find out the pid of TransactionalMessageCopier Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #6787 from omkreddy/transaction_test	6 years ago
Colin Patrick McCabe	87ff83a82e	MINOR: Bump version to 2.4.0-SNAPSHOT (#6774 ) Reviewers: Jason Gustafson <jason@confluent.io>	6 years ago
Jason Gustafson	b52170372b	MINOR: Increase security test timeouts for transient failures (#6760 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	6 years ago
Boyang Chen	9fa331b811	KAFKA-8225 & KIP-345 part-2: fencing static member instances with conflicting group.instance.id (#6650 ) For static members join/rejoin, we encode the current timestamp in the new member.id. The format looks like group.instance.id-timestamp. During consumer/broker interaction logic (Join, Sync, Heartbeat, Commit), we shall check the whether group.instance.id is known on group. If yes, we shall match the member.id stored on static membership map with the request member.id. If mismatching, this indicates a conflict consumer has used same group.instance.id, and it will receive a fatal exception to shut down. Right now the only missing part is the system test. Will work on it offline while getting the major logic changes reviewed. Reviewers: Ryanne Dolan <ryannedolan@gmail.com>, Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Vahid Hashemian	16ece15fb3	MINOR: Include StickyAssignor in system tests (#5223 ) Reviewers: Jason Gustafson <jason@conflent.io>	6 years ago
Magesh Nandakumar	40d5c9fac9	KAFKA-8352 : Fix Connect System test failure 404 Not Found (#6713 ) Corrects the system tests to check for either a 404 or a 409 error and sleeping until the Connect REST API becomes available. This corrects a previous change to how REST extensions are initialized (#6651), which added the ability of Connect throwing a 404 if the resources are not yet started. The integration tests were already looking for 409. Author: Magesh Nandakumar <magesh.n.kumar@gmail.com> Reviewer: Randall Hauch <rhauch@gmail.com>	6 years ago
Boyang Chen	0f995ba6be	KAFKA-7862 & KIP-345 part-one: Add static membership logic to JoinGroup protocol (#6177 ) This is the first diff for the implementation of JoinGroup logic for static membership. The goal of this diff contains: * Add group.instance.id to be unique identifier for consumer instances, provided by end user; Modify group coordinator to accept JoinGroupRequest with/without static membership, refactor the logic for readability and code reusability. * Add client side support for incorporating static membership changes, including new config for group.instance.id, apply stream thread client id by default, and new join group exception handling. * Increase max session timeout to 30 min for more user flexibility if they are inclined to tolerate partial unavailability than burdening rebalance. * Unit tests for each module changes, especially on the group coordinator logic. Crossing the possibilities like: 6.1 Dynamic/Static member 6.2 Known/Unknown member id 6.3 Group stable/unstable 6.4 Leader/Follower The rest of the 345 change will be broken down to 4 separate diffs: * Avoid kicking out members through rebalance.timeout, only do the kick out through session timeout. * Changes around LeaveGroup logic, including version bumping, broker logic, client logic, etc. * Admin client changes to add ability to batch remove static members * Deprecate group.initial.rebalance.delay Reviewers: Liquan Pei <liquanpei@gmail.com>, Stanislav Kozlovski <familyguyuser192@windowslive.com>, Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	6 years ago
Boyang Chen	847957cbea	KAFKA-8291 : System test fix (#6637 ) As titled, this PR changed the default reset policy to latest accidentally for system tests, which in fact was earliest. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago

1 2 3 4 5 ...

555 Commits (12d98a3d7ad9b023c600bdbb5e3ffb42b07406b1)