src-kafka

Commit Graph

Author	SHA1	Message	Date
John Roesler	74502710b3	MINOR: report streams benchmarks separately (#5275 ) Specify each benchmark as a separate test so that we can see the results reported independently. Reviewers: Guozhang Wang <wangguoz@gmail.com>	6 years ago
Ismael Juma	cc4dce94af	KAFKA-2983: Remove Scala consumers and related code (#5230 ) - Removed Scala consumers (`SimpleConsumer` and `ZooKeeperConsumerConnector`) and their tests. - Removed Scala request/response/message classes. - Removed any mention of new consumer or new producer in the code with the exception of MirrorMaker where the new.consumer option was never deprecated so we have to keep it for now. The non-code documentation has not been updated either, that will be done separately. - Removed a number of tools that only made sense in the context of the Scala consumers (see upgrade notes). - Updated some tools that worked with both Scala and Java consumers so that they only support the latter (see upgrade notes). - Removed `BaseConsumer` and related classes apart from `BaseRecord` which is used in `MirrorMakerMessageHandler`. The latter is a pluggable interface so effectively public API. - Removed `ZkUtils` methods that were only used by the old consumers. - Removed `ZkUtils.registerBroker` and `ZKCheckedEphemeral` since the broker now uses the methods in `KafkaZkClient` and no-one else should be using that method. - Updated system tests so that they don't use the Scala consumers except for multi-version tests. - Updated LogDirFailureTest so that the consumer offsets topic would continue to be available after all the failures. This was necessary for it to work with the Java consumer. - Some multi-version system tests had not been updated to include recently released Kafka versions, fixed it. - Updated findBugs and checkstyle configs not to refer to deleted classes and packages. Reviewers: Dong Lin <lindong28@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	7 years ago
Guozhang Wang	b2e0812f69	HOTFIX: rename run_test to execute in streams simple benchmark (#4941 )	7 years ago
Guozhang Wang	1f523d9d72	MINOR: add window store range query in simple benchmark (#4894 ) There are a couple minor additions in this PR: 1. Add a new test for window store, to range query upon receiving each record. 2. In the non-windowed state store case, add a get call before the put call. 3. Enable caching by default to be consistent with other Join / Aggregate cases, where caching is enabled by default. Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Guozhang Wang	0dc7f0e66f	KAFKA-6611, PART II: Improve Streams SimpleBenchmark (#4854 ) SimpleBenchmark: 1.a Do not rely on manual num.records / bytes collection on atomic integers. 1.b Rely on config files for num.threads, bootstrap.servers, etc. 1.c Add parameters for key skewness and value size. 1.d Refactor the tests for loading phase, adding tumbling-windowed count. 1.e For consumer / consumeproduce, collect metrics on consumer instead. 1.f Force stop the test after 3 minutes, this is based on empirical numbers of 10M records. Other tests: use config for kafka bootstrap servers. streams_simple_benchmark.py: only use scale 1 for system test, remove yahoo from benchmark tests. Note that the JMX based metrics is more accurate than the manually collected metrics. Reviewers: John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>	7 years ago
Guozhang Wang	f2fbfaaccc	KAFKA-6611: PART I, Use JMXTool in SimpleBenchmark (#4650 ) 1. Use JmxMixin for SimpleBenchmark (will remove the self reporting in #4744), only when loading phase is false (i.e. we are in fact starting the streams app). 2. Reported the full jmx reported metrics in log files, and in the returned data only return the max values: this is because we want to skip the warming up and cooling down periods that will have lower rate numbers, while max represents the actual rate at full speed. 3. Incorporates two other improves to JMXTool: #1241 and #2950 Reviewers: John Roesler <john@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Rohan Desai <desai.p.rohan@gmail.com>	7 years ago
Damian Guy	99eebc8404	HOTFIX: reduce streams benchmark input records to 10 million We are occasionally hitting some timeouts due to processing not finishing. So rather than failing the build for these reasons it would be better to reduce the runtime. Author: Damian Guy <damian.guy@gmail.com> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #3725 from dguy/fix-system-test	7 years ago
Eno Thereska	55a90938a1	MINOR: add Yahoo benchmark to nightly runs Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Damian Guy <damian.guy@gmail.com> Closes #3289 from enothereska/yahoo-benchmark	8 years ago
Eno Thereska	5c3d7ca711	MINOR: log4j template should accept log_level The log_level parameter is used in system tests in kafka.py. However the log4j template accepted that parameter in only one place. This led to a large number of DEBUG lines printed even when the intention was to capture only INFO lines. Led to huge log files. Thanks to ijuma for noticing this. Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3247 from enothereska/minor-log4j-template-fix	8 years ago
Eno Thereska	84a14fec29	KAFKA-4843: More efficient round-robin scheduler - Improves streams efficiency by more than 200K requests/second (small 100 byte requests) - Gets streams efficiency very close to pure consumer (see results in https://jenkins.confluent.io/job/system-test-kafka-branch-builder/746/console) - Maintains same fairness across tasks - Schedules all records in the queue in-between poll() calls, not just one per task. Author: Eno Thereska <eno@confluent.io> Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Damian Guy, Matthias J. Sax, Guozhang Wang Closes #2643 from enothereska/minor-schedule-round-robin	8 years ago
Eno Thereska	b7378d567f	MINOR: Standardised benchmark params for consumer and streams There were some minor differences in the basic consumer config and streams config that are now rectified. In addition, in AWS environments the socket size makes a big difference to performance and I've tuned it up accordingly. I've also increased the number of records now that perf is higher. Author: Eno Thereska <eno@confluent.io> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #2634 from enothereska/minor-standardize-params	8 years ago
Eno Thereska	b865a8b1dc	KAFKA-4716: send create topics to controller in internaltopicmanager This PR fixes a blocker issue, where the streams client code cannot talk to the controller. It also enables a system test that was previously failing. This PR is for trunk only. A separate PR with just the fix (but not the tests) will be created for 0.10.2. Author: Eno Thereska <eno@confluent.io> Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Damian Guy, Ismael Juma, Matthias J. Sax, Guozhang Wang Closes #2522 from enothereska/KAFKA-4716-metadata	8 years ago
Eno Thereska	13a82b48ca	KAFKA-4702: Parametrize streams benchmarks to run at scale Author: Eno Thereska <eno.thereska@gmail.com> Author: Eno Thereska <eno@confluent.io> Author: Ubuntu <ubuntu@ip-172-31-22-146.us-west-2.compute.internal> Reviewers: Matthias J. Sax, Guozhang Wang Closes #2478 from enothereska/minor-benchmark-args	8 years ago
Ewen Cheslack-Postava	6264cc1557	KAFKA-4450; Add upgrade tests for 0.10.1 and rename TRUNK to DEV_BRANCH to reduce confusion Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2457 from ewencp/kafka-4450-upgrade-tests	8 years ago
Matthias J. Sax	448c1a4114	HOTIFX: streams system test do not start up correctly Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Guozhang Wang, Damian Guy, Eno Thereska Closes #2428 from mjsax/hotfixSystemTests	8 years ago
Geoff Anderson	62e043a865	KAFKA-4140: Upgrade to ducktape 0.6.0 and make system tests parallel friendly Updates to take advantage of soon-to-be-released ducktape features. Author: Geoff Anderson <geoff@confluent.io> Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #1834 from granders/systest-parallel-friendly	8 years ago
Eno Thereska	c5d26c4829	KAFKA-4016: Added join benchmarks Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Ismael Juma, Damian Guy, Guozhang Wang Closes #1700 from enothereska/join-benchmarks	8 years ago
Geoff Anderson	801fee89d8	MINOR: cleanup apache license in python files ijuma As discussed in https://github.com/apache/kafka/pull/1645, this patch removes an extraneous line from several __init__.py files, and a few others as well Author: Geoff Anderson <geoff@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #1659 from granders/minor-cleanup-init-files	8 years ago
Eno Thereska	f1b37eec74	HOTFIX: Adding init file so streams benchmark is autodiscovered Without this file the benchmark does not run nightly. Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Geoff Anderson <geoff@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #1645 from enothereska/hotfix-streams-test	8 years ago
Eno Thereska	61c568d839	MINOR: Added simple streams benchmark to system tests Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Geoff Anderson, Guozhang Wang, Ismael Juma Closes #1621 from enothereska/simple-benchmark-streams-system-tests	8 years ago
Geoff Anderson	54092c12ed	KAFKA-3592: System test - configurable paths This patch adds logic for the following: - remove hard-coded paths to various scripts and jars in kafkatest service classes - provide a mechanism for overriding path resolution logic with a "pluggable" path resolver class Author: Geoff Anderson <geoff@confluent.io> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #1245 from granders/configurable-install-path	9 years ago
Ismael Juma	a5f1158c31	KAFKA-3558; Add compression_type parameter to benchmarks in benchmark_test.py * Use a fixed `Random` seed in `EndToEndLatency.scala` for determinism * Add `compression_type` to and remove `consumer_fetch_max_wait` from `end_to_end_latency.py`. The latter was never used. * Tweak logging of `end_to_end_latency.py` to be similar to `consumer_performance.py`. * Add `compression_type` to `benchmark_test.py` methods and add `snappy` to `matrix` annotation * Use randomly generated bytes from a restricted range for `ProducerPerformance` payload. This is a simple fix for now. It can be improved in the PR for KAFKA-3554. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #1225 from ijuma/kafka-3558-add-compression_type-benchmark_test.py	9 years ago
Ismael Juma	c1694833d5	KAFKA-3490; Multiple version support for ducktape performance tests Author: Ismael Juma <ismael@juma.me.uk> Author: Geoff Anderson <geoff@confluent.io> Reviewers: Geoff Anderson <geoff@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #1173 from ijuma/kafka-3490-multiple-version-support-perf-tests	9 years ago
Grant Henke	45c585b4f7	KAFKA-3483: Restructure ducktape tests to simplify running subsets of tests … tests Author: Grant Henke <granthenke@gmail.com> Reviewers: Geoff Anderson <geoff@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #1162 from granthenke/ducktape-structure	9 years ago

24 Commits (ec501f305e53a09072580fb3824048c170d32a48)