src-kafka

Commit Graph

Author	SHA1	Message	Date
Bill Bejeck	286216b56e	MINOR: Rolling bounce upgrade fixed broker system test (#4690 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Guozhang Wang	0f364cd53a	MINOR: Pass a streams config to replace the single state dir (#4714 ) This is a general change and is re-requisite to allow streams benchmark test with different streams tests. For the streams benchmark itself I will have a separate PR for switching configs. Details: 1. Create a "streams.properties" file under PERSISTENT_ROOT before all the streams test. For now it will only contain a single config of state.dir pointing to PERSISTENT_ROOT. 2. For all the system test related code, replace the main function parameter of state.dir with propsFilename, then inside the function load the props from the file and apply overrides if necessary. 3. Minor fixes. Matthias J. Sax <matthias@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Ewen Cheslack-Postava	f264bfa296	KAFKA-6676: Ensure Kafka chroot exists in system tests and use chroot on one test with security parameterizations (#4729 ) Ensures Kafka chroot exists in ZK when starting KafkaService so commands that use ZK and are executed before the first Kafka broker starts do not fail due to the missing chroot. Also uses chroot with one test that also has security parameterizations so Kafka's test suite exercises these combinations. Previously no tests were exercising chroots. Changes were validated using sanity_checks which include the chroot-ed test as well as some non-chroot-ed tests.	7 years ago
Matthias J. Sax	7fe06a8666	MINOR: fix flaky Streams EOS system test (#4728 ) Reviewer: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Chris Egerton	b6ae51b108	Allow users to specify 'if-not-exists' when creating topics while testing (#4715 ) Author: Chris Egerton <chrise@confluent.io>	7 years ago
John Roesler	7006d0f58b	MINOR: Streams system tests fixes/updates (#4689 ) Some changes required to get the Streams system tests working via Docker To test: TC_PATHS="tests/kafkatest/tests/streams" bash tests/docker/run_tests.sh That command will take about 3.5 hours, and should pass. Note there are a couple of ignored tests. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Bill Bejeck <bill@confluent.io>	7 years ago
Guozhang Wang	e5d6c9a79a	MINOR: Do not start processor for bounce-at-start (#4639 ) Only start it after the broker has been shutdown.	7 years ago
Bill Bejeck	8a7d7e7955	MINOR: Add System test for standby task-rebalancing (#4554 ) Author: Bill Bejeck <bill@confluent.io> Reviewers: Damian Guy <damian@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Matthias J. Sax	0cd83e997d	MINOR: wait for broker startup for system tests (#4363 ) ensure that brokers are registered at ZK before start() returns Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io>, Damian Guy <damian@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Randall Hauch	fc19c3e6f2	KAFKA-6577: Fix Connect system tests and add debug messages NOTE: This should be backported to the `1.1` branch, and is currently a blocker for 1.1. The `connect_test.py::ConnectStandaloneFileTest.test_file_source_and_sink` system test is failing with the SASL configuration without a sufficient explanation. During the test, the Connect worker fails to start, but the Connect log contains no useful information. There are actual several things compounding to cause the failure and make it difficult to understand the problem. First, the `tests/kafkatest/tests/connect/templates/connect_standalone.properties` is only adding in the broker's security configuration with the `producer.` and `consumer.` prefixes, but is not adding them with no prefix. The worker uses the AdminClient to connect to the broker to get the Kafka cluster ID and to manage the three internal topics, and the AdminClient is configured via top-level properties. Because the SASL test requires the clients all connect using SASL, the lack of broker security configs means the AdminClient was attempting and failing to connect to the broker. This is corrected by adding the broker's security configuration to the Connect worker configuration file at the top-level. (This was already being done in the `connect_distributed.properties` file.) Second, the default `request.timeout.ms` for the AdminClient (and the other clients) is 120 seconds, so the AdminClient was retrying for 120 seconds before it would give up and thrown an error. However, the test was only waiting for 60 seconds before determining that the service failed to start. This can be corrected by setting `request.timeout.ms=10000` in the Connect distributed and standalone worker configurations. Third, the Connect workers were recently changed to lookup the Kafka cluster ID before it started the herder. This is unlike the older uses of the AdminClient to find and manage the internal topics, where failure to connect was not necessarily logged correctly but nevertheless still skipped over, relying upon broker auto-topic creation to create the internal topics. (This may be why the test did not fail prior to the recent change to always require a successful AdminClient connection.) Although the worker never got this far in its startup process, the fact that we missed such an error since the prior releases means that failure to connect with the AdminClient was not being properly reported. The `ConnectStandaloneFileTest.test_file_source_and_sink` system tests were run locally prior to this fix, and they failed as with the nightlies. Once these fixes were made, the locally run system tests passed. Author: Randall Hauch <rhauch@gmail.com> Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <me@ewencp.org> Closes #4610 from rhauch/kafka-6577-trunk	7 years ago
Matthias J. Sax	1715e6eac0	MINOR: Fix Streams-Broker-Compatibility system test (#4594 ) fixes error message handling for test consumer client and KafkaStreams instance updates expected error message fixes race condition in system test code and avoids starting Streams processor twice Author: Matthias J. Sax <matthias@confluent.io.> Reviewer: Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>	7 years ago
Ewen Cheslack-Postava	da379c95e4	MINOR: Cancel port forwarding for HttpMetricsCollector during cleanup Currently port forwarding is setup for HttpMetricsCollector when the Service's start_node method is called, but not canceled during stop. This hasn't presented a problem so far because we don't have tests that use this and restart the service. However, if a test/service does that, it will throw an exception since the port is already bound. This just does the cleanup when stopping so a subsequent attempt to start again will succeed. https://jenkins.confluent.io/job/system-test-kafka-branch-builder/1320 is a test run for a Test that uses ProducerPerformanceService, which in turn uses HttpMetricsCollector to validate the change. Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Ismael Juma <ismael@juma.me.uk>, Apurva Mehta <apurva@confluent.io> Closes #4604 from ewencp/cleanup-reverse-port-forward	7 years ago
Damian Guy	57059d4022	MINOR: Fix streams broker compatibility test. Change the string in the test condition to the one that is logged Author: Damian Guy <damian.guy@gmail.com> Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #4599 from dguy/broker-compatibility	7 years ago
Konstantine Karantasis	f10c0d3863	MINOR: Fix file source task configs in system tests. Another fall-through of `headers.converter` and `batch.size` properties. Here in `FileStreamSourceConnector` tests Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Randall Hauch <rhauch@gmail.com>, Damian Guy <damian.guy@gmail.com> Closes #4590 from kkonstantine/MINOR-Fix-file-source-task-config-in-system-tests	7 years ago
Matthias J. Sax	0b3b6049f0	MINOR: Fix Streams EOS system tests (#4572 ) Avoid loosing log/stdout/stderr files on restart Reenables tests Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>	7 years ago
Attila Sasvari	4837d44673	MINOR: Fix typos in kafka.py (#4581 )	7 years ago
Guozhang Wang	962bc638f9	MINOR: Add a new system test for resilience (#4560 ) * Rolling kill-restart Streams instances with brokers unavailable temporarily, and validate that the streams can still complete the restart process Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Bill Bejeck	67803384d9	MINOR: adding system tests for how streams functions with broker faiures (#4513 ) System test for two cases: * Starting a multi-node streams application with the broker down initially, broker starts and confirm rebalance completes and streams application still able to process records. * Multi-node streams app running, broker goes down, stop stream instance(s) confirm after broker comes back remaining streams instance(s) still function. Reviewers: Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Damian Guy	ca01711c0e	Bump trunk versions to 1.2-SNAPSHOT (#4505 )	7 years ago
Konstantine Karantasis	83cc138e0c	MINOR: Add async and different sync startup modes in connect service test class Allow Connect Service in system tests to start asynchronously. Specifically, allow for three startup conditions: 1. No condition - start async and return immediately. 2. Semi-async - start immediately after plugins have been discovered successfully. 3. Sync - start returns after the worker has completed startup. This is the current mode, but its condition is improved by checking that the port of Connect's REST interface is open, rather than that a log line has appeared in the logs. An associated system test run has been started here: https://jenkins.confluent.io/job/system-test-confluent-platform-branch-builder/586/ ewencp rhauch, I'd appreciate your review. Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Randall Hauch <rhauch@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #4423 from kkonstantine/MINOR-Add-async-and-different-sync-startup-modes-in-ConnectService-test-class	7 years ago
Matthias J. Sax	2cefe3f0d0	MINOR: Temporarily disable flaky Streams EOS system tests (#4355 ) Reviewers: Bill Bejeck <bill@confluent.io>, Ismael Juma <ismael@juma.me.uk>	7 years ago
Guozhang Wang	7d6f6f7320	MINOR: Fix race condition in Streams EOS system test We should start the process only within the `with` block, otherwise the bytes parameter would cause a race condition that result in false alarms of system test failures. Author: Guozhang Wang <wangguoz@gmail.com> Reviewers: Ewen Cheslack-Postava <me@ewencp.org> Closes #4348 from guozhangwang/KMinor-fix-eos-test	7 years ago
Colin P. Mccabe	760d86a970	KAFKA-5849; Add process stop, round trip workload, partitioned test * Implement process stop faults via SIGSTOP / SIGCONT * Implement RoundTripWorkload, which both sends messages, and confirms that they are received at least once. * Allow Trogdor tasks to block until other Trogdor tasks are complete. * Add CreateTopicsWorker, which can be a building block for a lot of tests. * Simplify how TaskSpec subclasses in ducktape serialize themselves to JSON. * Implement some fault injection tests in round_trip_workload_test.py Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com> Closes #4323 from cmccabe/KAFKA-5849	7 years ago
Bill Bejeck	f3b9afe622	MINOR: Broker down for significant amt of time system test System test where a broker is offline more than the configured timeouts. In this case: - Max poll interval set to 45 secs - Retries set to 2 - Request timeout set to 15 seconds - Max block ms set to 30 seconds The broker was taken off-line for 70 seconds or more than double request timeout * num retries [passing system test results](http://confluent-kafka-branch-builder-system-test-results.s3-us-west-2.amazonaws.com/2017-12-11--001.1513034559--bbejeck--KSTREAMS_1179_broker_down_for_significant_amt_of_time--6ab4802/report.html) Author: Bill Bejeck <bill@confluent.io> Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #4313 from bbejeck/KSTREAMS_1179_broker_down_for_significant_amt_of_time	7 years ago
Rajini Sivaram	e5741b90cd	MINOR: Increase number of messages in replica verification tool test Increase the number of messages produced to make the test more reliable. The test failed in a recent build and also fails intermittently when run locally. Since the producer uses acks=0 and the test stops as soon as a lag is observed, the change shouldn't have a big impact on the time taken to run when lag is observed sooner. Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Jason Gustafson <jason@confluent.io> Closes #4312 from rajinisivaram/MINOR-replicaverification-test	7 years ago
Matthias J. Sax	234ec8a8af	KAFKA-4857: Replace StreamsKafkaClient with AdminClient in Kafka Streams Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Bill Bejeck <bbejeck@gmail.com>, Guozhang Wang <wangguoz@gmail.com> Closes #4242 from mjsax/kafka-4857-admit-client	7 years ago
Bill Bejeck	9204197abf	MINOR: Fix broker compatibility tests Updated the System test `stream_broker_compatibility_test.py` to address system test failures as we have removed explicit broker version checking - Ignore the `0.8.2.2` and `0.9.0.0` tests because the `NetworkClient` only logs `UnsupportedVersionException`s that occur and will continue to retry connecting. Once issue https://issues.apache.org/jira/browse/KAFKA-6297 is addressed, we may re-enable these tests. - Updated existing tests expected error messages - Updated Streams code in test for to make sure we fail fast for incompatible brokers Author: Bill Bejeck <bill@confluent.io> Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #4286 from bbejeck/MINOR_fix_broker_compatibility_tests	7 years ago
Mikkin	18e34482e6	KAFKA-6284: Fixed system test for Connect REST API `topics.regex` was added in KAFKA-3073. This change fixes the test that invokes `/validate` to ensure that all the configdefs are returned as expected. Author: Mikkin <mikkin@confluent.io> Reviewers: Randall Hauch <rhauch@gmail.com>, Jason Gustafson <jason@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #4279 from mikkin/KAFKA-6284	7 years ago
Colin P. Mccabe	58877a0dea	KAFKA-6255; Add ProduceBench to Trogdor Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #4245 from cmccabe/KAFKA-6255	7 years ago
Colin P. Mccabe	a133e69b45	KAFKA-6247; Install Kibosh on Vagrant and fix release downloads in Docker Fix an omission where Kibosh was not getting installed on Vagrant instances running in AWS. Fix an issue where the Dockerfile was unable to download old Apache Kafka releases. See the discussion on KAFKA-6233. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #4240 from cmccabe/KAFKA-6247	7 years ago
Colin P. Mccabe	d9cbc6b1a2	KAFKA-5811; Add Kibosh integration for Trogdor and Ducktape For ducktape: add Kibosh to the testing Dockerfile. Create files_unreadable_fault_spec.py. For trogdor: create FilesUnreadableFaultSpec.java. Add a unit test of using the Kibosh service. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #4195 from cmccabe/KAFKA-5811	7 years ago
Ewen Cheslack-Postava	718dda1144	MINOR: Add HttpMetricsReporter for system tests Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #4072 from ewencp/http-metrics	7 years ago
Colin P. Mccabe	4fac83ba1f	KAFKA-6060; Add workload generation capabilities to Trogdor Previously, Trogdor only handled "Faults." Now, Trogdor can handle "Tasks" which may be either faults, or workloads to execute in the background. The Agent and Coordinator have been refactored from a mutexes-and-condition-variables paradigm into a message passing paradigm. No locks are necessary, because only one thread can access the task state or worker state. This makes them a lot easier to reason about. The MockTime class can now handle mocking deferred message passing (adding a message to an ExecutorService with a delay). I added a MockTimeTest. MiniTrogdorCluster now starts up Agent and Coordinator classes in paralle in order to minimize junit test time. RPC messages now inherit from a common Message.java class. This class handles implementing serialization, equals, hashCode, etc. Remove FaultSet, since it is no longer necessary. Previously, if CoordinatorClient or AgentClient hit a networking problem, they would throw an exception. They now retry several times before giving up. Additionally, the REST RPCs to the Coordinator and Agent have been changed to be idempotent. If a response is lost, and the request is resent, no harm will be done. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk> Closes #4073 from cmccabe/KAFKA-6060	7 years ago
Matthias J. Sax	c7ab3efcbe	MINOR: Code cleanup and JavaDoc improvements for clients and Streams Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Bill Bejeck <bill@confluent.io>, Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #4128 from mjsax/minor-cleanup minor fix	7 years ago
Xavier Léauté	f7f8e11213	MINOR: reset state in cleanup, fixes jmx mixin flakiness ewencp ijuma Author: Xavier Léauté <xl+github@xvrl.net> Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #4123 from xvrl/fix-jmx-flakiness (cherry picked from commit `91eb178e95`) Signed-off-by: Ewen Cheslack-Postava <me@ewencp.org>	7 years ago
Magnus Edenhill	60c36b0984	MINOR: Fix var typo in verifiable_consumer assertion Author: Magnus Edenhill <magnus@edenhill.se> Reviewers: Jason Gustafson <jason@confluent.io> Closes #4098 from edenhill/verfcons_var_fix	7 years ago
Apurva Mehta	90b5ce3f04	KAFKA-6016; Make the reassign partitions system test use the idempotent producer With these changes, we are ensuring that the partitions being reassigned are from non-zero offsets. We also ensure that every message in the log has producerId and sequence number. This means that it successfully reproduces https://issues.apache.org/jira/browse/KAFKA-6003. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io> Closes #4029 from apurvam/KAFKA-6016-add-idempotent-producer-to-reassign-partitions	7 years ago
Matthias J. Sax	51063441d3	KAFKA-5362; Follow up to Streams EOS system test - improve tests to get rid of calls to `sleep` in Python - fixed some flaky test conditions - improve debugging Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Damian Guy <damian.guy@gmail.com>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #3542 from mjsax/failing-eos-system-tests	7 years ago
Guozhang Wang	6bcbd17d34	Bump up version to 1.1.0-SNAPSHOT	7 years ago
Apurva Mehta	dd6347a5df	KAFKA-5888; System test to check ordering of messages with transactions and max.in.flight > 1 To check ordering, we augment the existing transactions test to read and write from topics with one partition. Since we are writing monotonically increasing numbers, the topics should always be sorted, making it very easy to check for out of order messages. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3969 from apurvam/KAFKA-5888-system-test-which-check-ordering	7 years ago
Randall Hauch	afaaea8093	KAFKA-5954; Correct Connect REST API system test Author: Randall Hauch <rhauch@gmail.com> Reviewers: tedyu <yuzhihong@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #3934 from rhauch/kafka-5954	7 years ago
Ismael Juma	52d7b6763b	MINOR: Fix replica_verification_tool.py to handle slight change in output format The string representation of TopicPartition was changed to be {topic}-{partitition} consistently in the following commit: `f6f56a645b` Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Damian Guy <damian.guy@gmail.com> Closes #3890 from ijuma/fix-replica-verification-test	7 years ago
Ismael Juma	439050816b	MINOR: Tweak detection of kafka server start-up in system tests Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #3834 from ijuma/tweak-system-test-regex-for-detecting-server-start-up	7 years ago
Xavier Léauté	6e40455862	KAFKA-5742; Fix incorrect method name follow-up Author: Xavier Léauté <xl+github@xvrl.net> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3818 from xvrl/fix-startswith	7 years ago
Ismael Juma	07a428e0c8	MINOR: Always specify the keystore type in system tests Also throw an exception if a null keystore type is seen in `SecurityStore`. This should never happen. The default keystore type has changed in Java 9 ( http://openjdk.java.net/jeps/229), so we need to be explicit to have consistent behaviour across Java versions. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #3808 from ijuma/set-jks-explicitly-in-system-tests	7 years ago
Colin P. Mccabe	4065ffb3e1	KAFKA-5777; Add ducktape integration for Trogdor Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #3726 from cmccabe/KAFKA-5777	7 years ago
Randall Hauch	75070bdb5d	MINOR: Increase timeout of Zookeeper service in system tests The previous timeout was 10 seconds, but system test failures have occurred when Zookeeper has started after about 11 seconds. Increasing the timeout to 30 seconds, since most of the time this extra time will not be required, and when it is it will prevent a failed system test. In addition to merging to `trunk`, please backport to the `0.11.x` and `0.10.2.x` branches. Author: Randall Hauch <rhauch@gmail.com> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #3774 from rhauch/MINOR-Increase-timeout-of-zookeeper-service-in-system-tests	7 years ago
Colin P. Mccabe	949577ca77	KAFKA-5768; Upgrade to ducktape 0.7.1 Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3721 from cmccabe/KAFKA-5768	7 years ago
Colin P. Mccabe	14f6ecd915	MINOR: KafkaService should print node hostname on failure Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #3715 from cmccabe/kafka_service_print_node_hostname_on_failure	7 years ago
Damian Guy	99eebc8404	HOTFIX: reduce streams benchmark input records to 10 million We are occasionally hitting some timeouts due to processing not finishing. So rather than failing the build for these reasons it would be better to reduce the runtime. Author: Damian Guy <damian.guy@gmail.com> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #3725 from dguy/fix-system-test	7 years ago

1 2 3 4 5 ...

329 Commits (9951f8fee145ce10b5dccde665e160a5f4ff6d03)