src-kafka

Commit Graph

Author	SHA1	Message	Date
Colin Patrick McCabe	8577632b3a	MINOR: Fix Trogdor tests, partition assignments (#4892 )	7 years ago
Bill Bejeck	c6fd3d488e	MINOR: update VerifiableProducer to send keys if configured and removed StreamsRepeatingKeyProducerService (#4841 ) This PR does the following: * Remove the StreamsRepeatingIntegerKeyProducerService and the associated Java class * Add a parameter to VerifiableProducer.java to enable sending keys when specified * Update the corresponding Python file verifiable_producer.py to support the new parameter. Reviewers: Matthias J Sax <matthias@confluentio>, Guozhang Wang <wangguoz@gmail.com>	7 years ago
Colin Patrick McCabe	93e03414f7	KAFKA-6771. Make specifying partitions more flexible (#4850 )	7 years ago
Colin Patrick McCabe	832b096f4f	KAFKA-6696 Trogdor should support destroying tasks (#4759 ) Implement destroying tasks and workers. This means erasing all record of them on the Coordinator and the Agent. Workers should be identified by unique 64-bit worker IDs, rather than by the names of the tasks they are implementing. This ensures that when a task is destroyed and re-created with the same task ID, the old workers will be not be treated as part of the new task instance. Fix some return results from RPCs. In some cases RPCs were returning values that were never used. Attempting to re-create the same task ID with different arguments should fail. Add RequestConflictException to represent HTTP error code 409 (CONFLICT) for this scenario. If only one worker in a task stops, don't stop all the other workers for that task, unless the worker that stopped had an error. Reviewers: Anna Povzner <anna@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Colin Patrick McCabe	4223ef6106	MINOR: Add NullPayloadGenerator to Trogdor (#4844 )	7 years ago
Anna Povzner	989fe0497e	Kafka-6693: Added consumer workload to Trogdor (#4775 ) Added consumer only workload to Trogdor. The topics must already be pre-populated. The spec lets the user request topic pattern and range of partitions to assign to [startPartition, endPartition]. Reviewers: Colin P. Mccabe <cmccabe@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Colin Patrick McCabe	40183e3156	KAFKA-6688. The Trogdor coordinator should track task statuses (#4737 ) Reviewers: Anna Povzner <anna@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Anna Povzner	da32db9f34	Trogdor: Added commonClientConf and adminClientConf to workload specs (#4757 ) Currently, WorkerUtils will be able to create topics when there is no security. To be able to work with secure kafka, WorkerUtils.createTopic() needs to be able to take security configs. This PR adds commonClientConf field to both producer bench and roundtrip workload specs so that users can specify security and other common configs once for producer/consumer and adminClient. Also added adminClientConf field to workload specs so that users can specify adminClient specific configs if they want to. For completeness, added consumerConf and producerConf to roundtrip workload spec. Reviewers: Ismael Juma <ismael@juma.me.uk>, Colin P. Mccabe <cmccabe@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Colin Patrick McCabe	63642d6051	KAFKA-6694: The Trogdor Coordinator should support filtering task responses (#4741 )	7 years ago
cburroughs	514936af6f	MINOR: Remove redundant initialization of `Stats.index` (#4751 )	7 years ago
Anna Povzner	5c24295d44	Trogdor's ProducerBench does not fail if topics exists (#4673 ) Added configs to ProducerBenchSpec: topicPrefix: name of topics will be of format topicPrefix + topic index. If not provided, default is "produceBenchTopic". partitionsPerTopic: number of partitions per topic. If not provided, default is 1. replicationFactor: replication factor per topic. If not provided, default is 3. The behavior of producer bench is changed such that if some or all topics already exist (with topic names = topicPrefix + topic index), and they have the same number of partitions as requested, the worker uses those topics and does not fail. The producer bench fails if one or more existing topics has number of partitions that is different from expected number of partitions. Added unit test for WorkerUtils -- for existing methods and new methods. Fixed bug in MockAdminClient, where createTopics() would over-write existing topic's replication factor and number of partitions while correctly completing the appropriate futures exceptionally with TopicExistsException. Reviewers: Colin P. Mccabe <cmccabe@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Colin Patrick McCabe	0560193706	MINOR: improve trogdor commandline (#4721 ) Allow -c as a synonym for --agent.config and --coordinator.config. Allow -n as a synonym for --node-name. Add an example trogdor.conf file.	7 years ago
Colin Patrick McCabe	a70e4f95d7	KAFKA-6658; Fix RoundTripWorkload and make k/v generation configurable (#4710 ) Make PayloadGenerator an interface which can have multiple implementations: constant, uniform random, sequential. Allow different payload generators to be used for keys and values. This change fixes RoundTripWorkload. Previously RoundTripWorkload was unable to get the sequence number of the keys that it produced.	7 years ago
Colin Patrick McCabe	9e0e6e43a7	MINOR: Trogdor should not assume an agent co-located with the controller (#4712 )	7 years ago
Colin Patrick McCabe	8c10e06007	MINOR: Avoid nulls when deserializing Trogodor JSON (#4688 )	7 years ago
Colin Patrick McCabe	bf8a4c2ce7	MINOR: Improve Trogdor client logging. (#4675 ) AgentClient and CoordinatorClient should have the option of logging failures to custom log4j objects. There should also be builders for these objects, to make them easier to extend in the future. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Colin Patrick McCabe	ec9e8110e3	MINOR: add DEFAULT_PORT for Trogdor Agent and Coordinator (#4674 ) Add a DEFAULT_PORT constant for the Trogdor Agent and Coordinator. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Anna Povzner	f1c112c63d	MINOR: Add PayloadGenerator to Trogdor (#4640 ) It generates the producer payload (key and value) and makes sure that the values are populated to target a realistic compression rate (0.3 - 0.4) if compression is used. The generated payload is deterministic and can be replayed from a given position. For now, all generated values are constant size, and key types can be configured to be either null or 8 bytes. Added messageSize parameter to producer spec, that specifies produced key + message size.	7 years ago
Steven Aerts	ae42cc8030	KAFKA-6018: Make KafkaFuture.Future an interface (KIP-218) Changing KafkaFuture.Future and KafkaFuture.BiConsumer into an interface makes them a functional interface. This makes them Java 8 lambda compatible. Author: Colin P. Mccabe <cmccabe@confluent.io> Author: Steven Aerts <steven.aerts@gmail.com> Reviewers: Colin P. Mccabe <cmccabe@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Xavier Léauté <xl+github@xvrl.net>, Tom Bentley <tbentley@redhat.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #4033 from steven-aerts/KAFKA-6018	7 years ago
Romain Hardouin	a7e49027b2	MINOR: Catch JsonMappingException subclass (#3821 ) Handle InvalidTypeIdException as NOT_IMPLEMENTED and add unit tests for all exceptions. Reviewers: Colin P. Mccabe <cmccabe@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	7 years ago
Rajini Sivaram	3de910319e	KAFKA-6415; Use WARN log level for Metadata in system test When a log entry is appended to a Kafka topic using `KafkaLog4jAppender`, the producer.send operation may block waiting for metadata. This can result in deadlocks in a couple of scenarios if a log entry from the producer network thread is also at a log level that results in the entry being appended to a Kafka topic. 1. Producer's network thread will attempt to send data to a Kafka topic and this is unsafe since producer.send may block waiting for metadata, causing a deadlock since the thread will not process the metadata request/response. 2. `KafkaLog4jAppender#append` is invoked while holding the lock of the logger. So the thread waiting for metadata in the initial send will be holding the logger lock. If the producer network thread has.a log entry that needs to be appended, it will attempt to acquire the logger lock and deadlock. This is a temporary workaround to avoid deadlocks in system tests by setting log level to WARN for `Metadata` in `VerifiableLog4jAppender`. The fix has been verified using the system tests log4j_appender_test.py which started failing when the info-level log entry was introduced. Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Satish Duggana <satish.duggana@gmail.com>, tedyu <yuzhihong@gmail.com> Closes #4375 from rajinisivaram/KAFKA-6415-log4jappender	7 years ago
Colin P. Mccabe	760d86a970	KAFKA-5849; Add process stop, round trip workload, partitioned test * Implement process stop faults via SIGSTOP / SIGCONT * Implement RoundTripWorkload, which both sends messages, and confirms that they are received at least once. * Allow Trogdor tasks to block until other Trogdor tasks are complete. * Add CreateTopicsWorker, which can be a building block for a lot of tests. * Simplify how TaskSpec subclasses in ducktape serialize themselves to JSON. * Implement some fault injection tests in round_trip_workload_test.py Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com> Closes #4323 from cmccabe/KAFKA-5849	7 years ago
Colin P. Mccabe	58877a0dea	KAFKA-6255; Add ProduceBench to Trogdor Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #4245 from cmccabe/KAFKA-6255	7 years ago
Colin P. Mccabe	d9cbc6b1a2	KAFKA-5811; Add Kibosh integration for Trogdor and Ducktape For ducktape: add Kibosh to the testing Dockerfile. Create files_unreadable_fault_spec.py. For trogdor: create FilesUnreadableFaultSpec.java. Add a unit test of using the Kibosh service. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #4195 from cmccabe/KAFKA-5811	7 years ago
Ewen Cheslack-Postava	54371e63d3	MINOR: Make PushHttpMetricsReporter API compatible with releases back to 0.8.2.2 This is follow up to #4072 which added the PushHttpMetricsReporter and converted some services to use it. We somehow missed some compatibility issues that made the ProducerPerformance tool fail when using a newer tools jar with older common/clients jar, which we do with some system tests so we have all the features we need in the tool but can build compatibility tests for older releases. This just adjusts some API usage to make the tool compatible with all previous releases. I have a full run of the tests starting [here](https://jenkins.confluent.io/job/system-test-kafka-branch-builder/1122/) Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #4214 from ewencp/fix-compatibility-sanity-check-tests	7 years ago
Ewen Cheslack-Postava	718dda1144	MINOR: Add HttpMetricsReporter for system tests Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #4072 from ewencp/http-metrics	7 years ago
Adem Efe Gencer	86062e9a78	KAFKA-6157; Fix repeated words words in JavaDoc and comments. Author: Adem Efe Gencer <agencer@linkedin.com> Reviewers: Jiangjie Qin <becket.qin@gmail.com> Closes #4170 from efeg/bug/typoFix	7 years ago
Colin P. Mccabe	4fac83ba1f	KAFKA-6060; Add workload generation capabilities to Trogdor Previously, Trogdor only handled "Faults." Now, Trogdor can handle "Tasks" which may be either faults, or workloads to execute in the background. The Agent and Coordinator have been refactored from a mutexes-and-condition-variables paradigm into a message passing paradigm. No locks are necessary, because only one thread can access the task state or worker state. This makes them a lot easier to reason about. The MockTime class can now handle mocking deferred message passing (adding a message to an ExecutorService with a delay). I added a MockTimeTest. MiniTrogdorCluster now starts up Agent and Coordinator classes in paralle in order to minimize junit test time. RPC messages now inherit from a common Message.java class. This class handles implementing serialization, equals, hashCode, etc. Remove FaultSet, since it is no longer necessary. Previously, if CoordinatorClient or AgentClient hit a networking problem, they would throw an exception. They now retry several times before giving up. Additionally, the REST RPCs to the Coordinator and Agent have been changed to be idempotent. If a response is lost, and the request is resent, no harm will be done. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk> Closes #4073 from cmccabe/KAFKA-6060	7 years ago
Tom Bentley	6118ecb590	KAFKA-6130; Ensure VerifiableConsumer halts when --max-messages is reached Author: Tom Bentley <tbentley@redhat.com> Reviewers: Jason Gustafson <jason@confluent.io> Closes #4157 from tombentley/KAFKA-6130-verifiable-consumer-max-messages	7 years ago
Apurva Mehta	34188b4cc4	MINOR: Bump the request timeout for the transactional message copier Multiple inflights means that when there are rolling bounces or other cluster instability, there is an increased likelihood of having previously tried batch expire in the accumulator. This is a fatal error for a transactional producer, causing the `TransactionalMessageCopier` to exit. To work around this, we bump the request timeout. We can get rid of this when KIP-91 is merged. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io> Closes #4039 from apurvam/MINOR-bump-request-timeout-in-transactional-message-copier	7 years ago
Apurva Mehta	bdf8e211ec	KAFKA-6053; Fix NoSuchMethodError when creating ProducerRecords with older client versions Author: Apurva Mehta <apurva@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #4057 from apurvam/KAFKA-6053-fix-no-such-method-error-in-producer-record	7 years ago
Apurva Mehta	90b5ce3f04	KAFKA-6016; Make the reassign partitions system test use the idempotent producer With these changes, we are ensuring that the partitions being reassigned are from non-zero offsets. We also ensure that every message in the log has producerId and sequence number. This means that it successfully reproduces https://issues.apache.org/jira/browse/KAFKA-6003. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io> Closes #4029 from apurvam/KAFKA-6016-add-idempotent-producer-to-reassign-partitions	7 years ago
Jason Gustafson	5383f9bed0	MINOR: Use SecurityProtocol in AuthenticationContext Since we removed the unused `TRACE` option from `SecurityProtocol`, it now seems safer to expose it from `AuthenticationContext`. Additionally this patch exposes javadocs under security.auth and relocates the `Login` and `AuthCallbackHandler` to a non-public package. Author: Jason Gustafson <jason@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #3863 from hachikuji/use-security-protocol-in-auth-context	7 years ago
Rajini Sivaram	021d8a8e96	KAFKA-5746; Add new metrics to support health checks (KIP-188) Adds new metrics to support health checks: 1. Error rates for each request type, per-error code 2. Request size and temporary memory size 3. Message conversion rate and time 4. Successful and failed authentication rates 5. ZooKeeper latency and status 6. Client version Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3705 from rajinisivaram/KAFKA-5746-new-metrics	7 years ago
Apurva Mehta	5d2422258c	KAFKA-5494; Enable idempotence with max.in.flight.requests.per.connection > 1 Here we introduce client and broker changes to support multiple inflight requests while still guaranteeing idempotence. Two major problems to be solved: 1. Sequence number management on the client when there are request failures. When a batch fails, future inflight batches will also fail with `OutOfOrderSequenceException`. This must be handled on the client with intelligent sequence reassignment. We must also deal with the fatal failure of some batch: the future batches must get different sequence numbers when the come back. 2. On the broker, when we have multiple inflights, we can get duplicates of multiple old batches. With this patch, we retain the record metadata for 5 older batches. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3743 from apurvam/KAFKA-5494-increase-max-in-flight-for-idempotent-producer	7 years ago
Colin P. Mccabe	4065ffb3e1	KAFKA-5777; Add ducktape integration for Trogdor Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #3726 from cmccabe/KAFKA-5777	7 years ago
Colin P. Mccabe	ded8741173	KAFKA-5806; Fix transient unit test failure in trogdor coordinator shutdown In the coordinator, we should check that 'shutdown' is not true before going to sleep waiting for the condition. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Jason Gustafson <jason@confluent.io> Closes #3755 from cmccabe/KAFKA-5806	7 years ago
Colin P. Mccabe	0772fde562	KAFKA-5776; Add the Trogdor fault injection daemon Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com> Closes #3699 from cmccabe/trogdor-review	7 years ago
ppatierno	f15cdc73dd	KAFKA-5516: Formatting verifiable producer/consumer output in a similar fashion Author: ppatierno <ppatierno@live.com> Author: Paolo Patierno <ppatierno@live.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #3434 from ppatierno/verifiable-consumer-producer	7 years ago
Vahid Hashemian	f87d58b796	MINOR: Code Cleanup Clean up includes: - Switching try-catch-finally blocks to try-with-resources when possible - Removing some seemingly unnecessary `SuppressWarnings` annotations - Resolving some Java warnings - Closing unclosed Closable objects - Removing unused code Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Balint Molnar <balintmolnar91@gmail.com>, Guozhang Wang <wangguoz@gmail.com>, Matthias J. Sax <matthias@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io> Closes #3222 from vahidhashemian/minor/code_cleanup_1706	7 years ago
Apurva Mehta	bc47e9d6ca	KAFKA-5491; Enable transactions in ProducerPerformance Tool With this patch, the `ProducePerfomance` tool can create transactions of differing durations. This patch was used to to collect the initial set of benchmarks for transaction performance, documented here: https://docs.google.com/spreadsheets/d/1dHY6M7qCiX-NFvsgvaE0YoVdNq26uA8608XIh_DUpI4/edit#gid=282787170 Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jun Rao <junrao@gmail.com> Closes #3400 from apurvam/MINOR-add-transaction-size-to-producre-perf	8 years ago
Ismael Juma	0f60617fab	KAFKA-5275; AdminClient API consistency Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Colin P. Mccabe <cmccabe@confluent.io>, Jason Gustafson <jason@confluent.io> Closes #3339 from ijuma/kafka-5275-admin-client-api-consistency	8 years ago
Jason Gustafson	005b86ecf3	MINOR: Add random aborts to system test transactional copier service Author: Jason Gustafson <jason@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #3340 from hachikuji/add-random-aborts-to-system-test	8 years ago
Colin P. Mccabe	7d1ef63bec	KAFKA-5404; Add more AdminClient checks to ClientCompatibilityTest Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3263 from cmccabe/KAFKA-5404	8 years ago
Apurva Mehta	79db393ffa	KAFKA-5385; ProducerBatch expiry should go through Sender.failBatch Before this patch, we would call `producerBatch.done` directly from the accumulator when expiring batches. This meant that we would not transition to the `ABORTABLE_ERROR` state in the transaction manager, allowing other transactional requests (including Commits!) to go through, even though the produce failed. This patch modifies the logic so that we call `Sender.failBatch` on every expired batch, thus ensuring that the transaction state is accurate. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io> Closes #3252 from apurvam/KAFKA-5385-fail-transaction-if-batches-expire	8 years ago
Apurva Mehta	202cb8ea89	KAFKA-5366; Add concurrent reads to transactions system test This currently fails in multiple ways. One of which is most likely KAFKA-5355, where the concurrent consumer reads duplicates. During broker bounces, the concurrent consumer misses messages completely. This is another bug. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3217 from apurvam/KAFKA-5366-add-concurrent-reads-to-transactions-system-test	8 years ago
Colin P. Mccabe	f389b71570	KAFKA-5374; Set allow auto topic creation to false when requesting node information only It avoids the need to handle protocol downgrades and it's safe (i.e. it will never cause the auto creation of topics). Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3220 from ijuma/kafka-5374-admin-client-metadata	8 years ago
Apurva Mehta	1959835d9e	KAFKA-5281; System tests for transactions Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3149 from apurvam/KAFKA-5281-transactions-system-tests	8 years ago
Jun Rao	b154221774	MINOR: ProducerPerformance should work with older client jars Author: Jun Rao <junrao@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2896 from junrao/minor	8 years ago
Jun Rao	80e0548e2b	KAFKA-5100; ProducerPerformanceService failing due to parsing error Author: Jun Rao <junrao@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2890 from junrao/kafka-5100	8 years ago

1 2

88 Commits (fffb9c5b5cac1a669f22dd99860774d6c0fdb94b)