src-kafka

Commit Graph

Author	SHA1	Message	Date
Matthias J. Sax	afeadbef50	KAFKA-5003; StreamThread should catch InvalidTopicException We should catch `InvalidTopicException` and not just `NoOffsetForPartitionException`. Also, we need to step through all partitions that might be affected and reset those. Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Bill Bejeck <bbejeck@gmail.com>, Eno Thereska <eno@confluent.io>, Damian Guy <damian.guy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2747 from mjsax/minor-fix-reset	8 years ago
Colin P. Mccabe	d5fb7364ae	KAFKA-4993; Fix findbugs warnings in kafka-clients Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2779 from cmccabe/KAFKA-4993	8 years ago
Armin Braun	97e61d4ae2	MINOR: Fix multiple KafkaStreams.StreamStateListener being instantiated There should only be a single `KafkaStreams.StreamStateListener` to ensure synchronization of operations on `KafkaStreams.StreamStateListener#threadState`. Author: Armin Braun <me@obrown.io> Reviewers: Damian Guy <damian.guy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2801 from original-brownbear/fix-stream-state-listener	8 years ago
Ewen Cheslack-Postava	d4c4bcf017	MINOR: Make ConfigDef safer by not using empty string for NO_DEFAULT_VALUE. Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Damian Guy <damian.guy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2660 from ewencp/minor-make-configdef-safer	8 years ago
Matthias J. Sax	0df910c034	MINOR: Fix flaky StateDirectoryTest This fixes: ``` java.lang.AssertionError: expected:<2> but was:<3> at org.junit.Assert.fail(Assert.java:88) at org.junit.Assert.failNotEquals(Assert.java:834) at org.junit.Assert.assertEquals(Assert.java:645) at org.junit.Assert.assertEquals(Assert.java:631) at org.apache.kafka.streams.processor.internals.StateDirectoryTest.shouldCleanUpTaskStateDirectoriesThatAreNotCurrentlyLocked(StateDirectoryTest.java:145) ``` While running test in infinite loop, hit other problems: - fixed file management (release all locks and close everything) - increased sleep time for `shouldCleanupStateDirectoriesWhenLastModifiedIsLessThanNowMinusCleanupDelay` too (was flaky as well) Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Eno Thereska <eno@confluent.io>, Damian Guy <damian.guy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2781 from mjsax/minor-fix-stateDirectoryTest	8 years ago
Matthias J. Sax	aea1465118	HOTFIX: WindowedStreamPartitioner does not provide topic name to serializer Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Eno Thereska <eno@confluent.io>, Damian Guy <damian.guy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2777 from mjsax/hotfix-window-serdes-trunk	8 years ago
Eno Thereska	49f80b2360	KAFKA-4916: test streams with brokers failing Several fixes for handling broker failures: - default replication value for internal topics is now 3 in test itself (not in streams code, that will require a KIP. - streams producer waits for acks from all replicas in test itself (not in streams code, that will require a KIP. - backoff time for streams client to try again after a failure to contact controller. - fix bug related to state store locks (this helps in multi-threaded scenarios) - fix related to catching exceptions property for network errors. - system test for all the above Author: Eno Thereska <eno@confluent.io> Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Matthias J. Sax <matthias@confluent.io>, Damian Guy <damian.guy@gmail.com>, Guozhang Wang <wangguoz@gmail.com>, Dan Norwood <norwood@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #2719 from enothereska/KAFKA-4916-broker-bounce-test	8 years ago
Konstantine Karantasis	9160810072	KAFKA-4837: Fix class name comparison in connector-plugins REST endpoint Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #2798 from kkonstantine/KAFKA-4837-Config-validation-in-Connector-plugins-need-to-compare-against-both-canonical-and-simple-class-names	8 years ago
Vitaly Pushkar	54bf2fb5ff	KAFKA-4810: Make Kafka Connect SchemaBuilder more lax about checking that fields are unset https://issues.apache.org/jira/browse/KAFKA-4810 > Currently SchemaBuilder is strict when checking that certain fields have not been set yet (e.g. version, name, doc). It just checks that the field is null. This is intended to protect the user from buggy code that overwrites a field with different values, but it's a bit too strict currently. In generic code for converting schemas (e.g. Converters) you will sometimes initialize a builder with these values (e.g. because you get a SchemaBuilder for a logical type, which sets name & version), but then have generic code for setting name & version from the source schema. Changed the validation method to not only check if a field is null but also to check if the new value that is being set is the same as the current value of the field. ewencp Author: Vitaly Pushkar <vitaly.pushkar@gmail.com> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #2806 from vitaly-pushkar/KAFKA-4810-schema-builder-default-fields-validation	8 years ago
Balint Molnar	75e213e550	KAFKA-4855: Struct SchemaBuilder should not allow duplicate fields ewencp can you please review. Author: Balint Molnar <balintmolnar91@gmail.com> Reviewers: Gwen Shapira <cshapi@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #2732 from baluchicken/KAFKA-4855	8 years ago
Colin P. Mccabe	f812a8fd93	KAFKA-4977: Fix findbugs issues in connect/runtime Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #2763 from cmccabe/KAFKA-4977	8 years ago
Shun Takebayashi	ca2979f847	MINOR: Suppress ProducerConfig warning in MirrorMaker Though MirrorMaker uses the `producer.type` value of the producer properties, ProducerConfig show the warning: `The configuration 'producer.type' was supplied but isn't a known config.` Author: Shun Takebayashi <shun@takebayashi.asia> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2676 from takebayashi/suppress-mirrormaker-warning	8 years ago
Apurva Mehta	b9b2cfc28c	MINOR: Close the producer batch append stream when the batch gets full to free up resources Of particular importance are compression buffers (64 KB for LZ4, for example). Author: Apurva Mehta <apurva@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2796 from apurvam/idempotent-producer-close-data-stream	8 years ago
Armin Braun	040fde8ec1	KAFKA-4878: Improved Invalid Connect Config Error Message Addresses for https://issues.apache.org/jira/browse/KAFKA-4878 * Adjusted the error message to explicitly state errors and their number * Dried up the logic for generating the message between standalone and distributed Example messed up two config keys in the file source config: ```` namse=local-file-source connector.class=FileStreamSource tasks.max=1 fisle=test.txt topic=connect-test ``` Produces: ``` [2017-03-22 08:57:11,896] ERROR Stopping after connector error (org.apache.kafka.connect.cli.ConnectStandalone:99) java.util.concurrent.ExecutionException: org.apache.kafka.connect.runtime.rest.errors.BadRequestException: Connector configuration is invalid and contains the following 2 error(s): Missing required configuration "file" which has no default value. Missing required configuration "name" which has no default value. You can also find the above list of errors at the endpoint `/{connectorType}/config/validate` ``` Author: Armin Braun <me@obrown.io> Reviewers: Gwen Shapira, Konstantine Karantasis, Ewen Cheslack-Postava Closes #2722 from original-brownbear/KAFKA-4878	8 years ago
Matthias J. Sax	800d29648b	MINOR: fix cleanup phase for KStreamWindowAggregateTest fixes: ``` java.nio.file.NoSuchFileException: /tmp/test7863510415433793941/topic2-Canonized/topic2-Canonized-197001010000/000015.sst at sun.nio.fs.UnixException.translateToIOException(UnixException.java:86) at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:102) at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:107) at sun.nio.fs.UnixFileAttributeViews$Basic.readAttributes(UnixFileAttributeViews.java:55) at sun.nio.fs.UnixFileSystemProvider.readAttributes(UnixFileSystemProvider.java:144) at sun.nio.fs.LinuxFileSystemProvider.readAttributes(LinuxFileSystemProvider.java:97) at java.nio.file.Files.readAttributes(Files.java:1686) at java.nio.file.FileTreeWalker.walk(FileTreeWalker.java:105) at java.nio.file.FileTreeWalker.walk(FileTreeWalker.java:199) at java.nio.file.FileTreeWalker.walk(FileTreeWalker.java:199) at java.nio.file.FileTreeWalker.walk(FileTreeWalker.java:199) at java.nio.file.FileTreeWalker.walk(FileTreeWalker.java:69) at java.nio.file.Files.walkFileTree(Files.java:2602) at java.nio.file.Files.walkFileTree(Files.java:2635) at org.apache.kafka.common.utils.Utils.delete(Utils.java:555) at org.apache.kafka.streams.kstream.internals.KStreamWindowAggregateTest.testJoin(KStreamWindowAggregateTest.java:320) ``` Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Eno Thereska <eno@confluent.io>, Damian Guy <damian.guy@gmail.com>, Jun Rao <junrao@gmail.com> Closes #2778 from mjsax/minor-fix-kstreamWindowAggregateTest	8 years ago
Ismael Juma	f54b61909d	HOTFIX: Set `baseOffset` correctly in `RecordAccumulator` The bug meant that the base offset was the same as the batch size instead of 0 so the broker would always recompress batches. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com> Closes #2794 from ijuma/fix-records-builder-construction	8 years ago
Rajini Sivaram	1ba8b40b34	MINOR: Fix potential deadlock in consumer close test Fixes deadlock scenario found during local test run: The main thread was waiting for the coordinator lock. The thread performing close() was holding the coordinator lock and polling to find coordinator. The test expected close() to timeout, but for timing out, the main thread had to update time, which it couldn't since it was waiting for the lock. This fix avoids using coordinator in the main thread during the close task. Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2792 from rajinisivaram/MINOR-closetest-deadlock	8 years ago
Armin Braun	3364f12bc2	MINOR: Fix deadlock between StreamThread and KafkaStreams This may be a reason why we see Jenkins jobs time out at times. I can reproduce it locally. With current trunk there is a possibility to run into this: ```sh "kafka-streams-close-thread" #585 daemon prio=5 os_prio=0 tid=0x00007f66d052d800 nid=0x7e02 waiting for monitor entry [0x00007f66ae2e5000] java.lang.Thread.State: BLOCKED (on object monitor) at org.apache.kafka.streams.processor.internals.StreamThread.close(StreamThread.java:345) - waiting to lock <0x000000077d33c538> (a org.apache.kafka.streams.processor.internals.StreamThread) at org.apache.kafka.streams.KafkaStreams$1.run(KafkaStreams.java:474) at java.lang.Thread.run(Thread.java:745) "appId-bd262a91-5155-4a35-bc46-c6432552c2c5-StreamThread-97" #583 prio=5 os_prio=0 tid=0x00007f66d052f000 nid=0x7e01 waiting for monitor entry [0x00007f66ae4e6000] java.lang.Thread.State: BLOCKED (on object monitor) at org.apache.kafka.streams.KafkaStreams.setState(KafkaStreams.java:219) - waiting to lock <0x000000077d335760> (a org.apache.kafka.streams.KafkaStreams) at org.apache.kafka.streams.KafkaStreams.access$100(KafkaStreams.java:117) at org.apache.kafka.streams.KafkaStreams$StreamStateListener.onChange(KafkaStreams.java:259) - locked <0x000000077d42f138> (a org.apache.kafka.streams.KafkaStreams$StreamStateListener) at org.apache.kafka.streams.processor.internals.StreamThread.setState(StreamThread.java:168) - locked <0x000000077d33c538> (a org.apache.kafka.streams.processor.internals.StreamThread) at org.apache.kafka.streams.processor.internals.StreamThread.setStateWhenNotInPendingShutdown(StreamThread.java:176) - locked <0x000000077d33c538> (a org.apache.kafka.streams.processor.internals.StreamThread) at org.apache.kafka.streams.processor.internals.StreamThread.access$1600(StreamThread.java:70) at org.apache.kafka.streams.processor.internals.StreamThread$RebalanceListener.onPartitionsRevoked(StreamThread.java:1321) at org.apache.kafka.clients.consumer.internals.ConsumerCoordinator.onJoinPrepare(ConsumerCoordinator.java:406) at org.apache.kafka.clients.consumer.internals.AbstractCoordinator.joinGroupIfNeeded(AbstractCoordinator.java:349) at org.apache.kafka.clients.consumer.internals.AbstractCoordinator.ensureActiveGroup(AbstractCoordinator.java:310) at org.apache.kafka.clients.consumer.internals.ConsumerCoordinator.poll(ConsumerCoordinator.java:296) at org.apache.kafka.clients.consumer.KafkaConsumer.pollOnce(KafkaConsumer.java:1037) at org.apache.kafka.clients.consumer.KafkaConsumer.poll(KafkaConsumer.java:1002) at org.apache.kafka.streams.processor.internals.StreamThread.pollRequests(StreamThread.java:531) at org.apache.kafka.streams.processor.internals.StreamThread.runLoop(StreamThread.java:669) at org.apache.kafka.streams.processor.internals.StreamThread.run(StreamThread.java:326) ``` In a nutshell: `KafkaStreams` and `StreamThread` are both waiting for each other since another intermittent `close` (eg. from a test) comes along also trying to lock on `KafkaStreams` : ```sh "main" #1 prio=5 os_prio=0 tid=0x00007f66d000c800 nid=0x78bb in Object.wait() [0x00007f66d7a15000] java.lang.Thread.State: WAITING (on object monitor) at java.lang.Object.wait(Native Method) at java.lang.Thread.join(Thread.java:1249) - locked <0x000000077d45a590> (a java.lang.Thread) at org.apache.kafka.streams.KafkaStreams.close(KafkaStreams.java:503) - locked <0x000000077d335760> (a org.apache.kafka.streams.KafkaStreams) at org.apache.kafka.streams.KafkaStreams.close(KafkaStreams.java:447) at org.apache.kafka.streams.KafkaStreamsTest.testCannotStartOnceClosed(KafkaStreamsTest.java:115) ``` => causing a deadlock. Fixed this by softer locking on the state change, that guarantees atomic changes to the state but does not lock on the whole object (I at least could not find another method that would require more than atomicly-locked access except for `setState`). Also qualified the state listeners with their outer-class to make the whole code-flow around this more readable (having two interfaces with the same naming for interface and method and then using them between their two outer classes is crazy hard to read imo :)). Easy to reproduced yourself by running `org.apache.kafka.streams.KafkaStreamsTest` in a loop for a bit (save yourself some time by running 2-4 in parallel :)). Eventually it will lock on one of the tests (for me this takes less than 1 min with 4 parallel runs). Author: Armin Braun <me@obrown.io> Author: Armin <me@obrown.io> Reviewers: Eno Thereska <eno@confluent.io>, Damian Guy <damian.guy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2791 from original-brownbear/fix-streams-deadlock	8 years ago
Apurva Mehta	bdf4cba047	KAFKA-4817; Add idempotent producer semantics This is from the KIP-98 proposal. The main points of discussion surround the correctness logic, particularly the Log class where incoming entries are validated and duplicates are dropped, and also the producer error handling to ensure that the semantics are sound from the users point of view. There is some subtlety in the idempotent producer semantics. This patch only guarantees idempotent production upto the point where an error has to be returned to the user. Once we hit a such a non-recoverable error, we can no longer guarantee message ordering nor idempotence without additional logic at the application level. In particular, if an application wants guaranteed message order without duplicates, then it needs to do the following in the error callback: 1. Close the producer so that no queued batches are sent. This is important for guaranteeing ordering. 2. Read the tail of the log to inspect the last message committed. This is important for avoiding duplicates. Author: Apurva Mehta <apurva@confluent.io> Author: hachikuji <jason@confluent.io> Author: Apurva Mehta <apurva.1618@gmail.com> Author: Guozhang Wang <wangguoz@gmail.com> Author: fpj <fpj@apache.org> Author: Jason Gustafson <jason@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com> Closes #2735 from apurvam/exactly-once-idempotent-producer	8 years ago
shuguo zheng	1ce6aa5503	KAFKA-4964; Use correct keystore/trustore name in documentation Author: shuguo zheng <zheng.shuguo@zte.com.cn> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2749 from zhengsg/local	8 years ago
Eno Thereska	5f88cf79fb	MINOR: Increase max.poll time for streams consumers Author: Eno Thereska <eno@confluent.io> Reviewers: Damian Guy, Matthias J. Sax, Guozhang Wang Closes #2770 from enothereska/minor-increase-max-poll	8 years ago
Jason Gustafson	93d451ceeb	KAFKA-4689; Disable system tests for consumer hard failures See the JIRA for the full details. Essentially the test assertions depend on receiving reliable events from the consumer processes, but this is not generally possible in the presence of a hard failure (i.e. `kill -9`). Until we solve this problem, the hard failure scenarios will be turned off. Author: Jason Gustafson <jason@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2771 from hachikuji/KAFKA-4689	8 years ago
Bill Bejeck	15e0234a5f	KAFKA-4791: unable to add state store with regex matched topics Fix for adding state stores with regex defined sources Author: bbejeck <bbejeck@gmail.com> Reviewers: Matthias J. Sax, Damian Guy, Guozhang Wang Closes #2618 from bbejeck/KAFKA-4791_unable_to_add_statestore_regex_topics	8 years ago
Magnus Edenhill	4e92fd5f74	MINOR: Vagrant provisioning fixes Author: Magnus Edenhill <magnus@edenhill.se> Reviewers: Jason Gustafson <jason@confluent.io> Closes #2767 from edenhill/harden_provision	8 years ago
Jason Gustafson	dd71e4a8d8	MINOR: Ensure streaming iterator is closed by Fetcher Author: Jason Gustafson <jason@confluent.io> Author: Ismael Juma <github@juma.me.uk> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2762 from hachikuji/ensure-decompression-stream-closed	8 years ago
Eno Thereska	f9772d5fb2	MINOR: reduce amount of verbose printing Author: Eno Thereska <eno@confluent.io> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #2764 from enothereska/minor-remove-verboseprint	8 years ago
Matthias J. Sax	92b7d75700	KAFKA-4980: testReprocessingFromScratch unit test failure We got test error `org.apache.kafka.common.errors.TopicExistsException: Topic 'inputTopic' already exists.` in some builds. Can reproduce reliably at local machine. Root cause it async "topic delete" that might not be finished before topic gets re-created. Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Ismael Juma, Damian Guy, Guozhang Wang Closes #2757 from mjsax/minor-fix-resetintegrationtest	8 years ago
Vahid Hashemian	2e075fe6a4	MINOR: Update possible errors in OffsetFetchResponse Note: None of the use cases for offset fetch would lead to a `TOPIC_AUTHORIZATION_FAILED` error (fetching offset of an unauthorized partition would return an `UNKNOWN_TOPIC_OR_PARTITION` error). That is why it is being removed from the `PARTITION_ERRORS` list. Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Jason Gustafson <jason@confluent.io> Closes #2653 from vahidhashemian/minor/update_possible_errors_in_offset_fetch_response	8 years ago
Dong Lin	4b3ea062be	KAFKA-4973; Fix transient failure of AdminClientTest.testDeleteRecordsWithException Author: Dong Lin <lindong28@gmail.com> Reviewers: Jiangjie Qin <becket.qin@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2760 from lindong28/KAFKA-4973	8 years ago
Vahid Hashemian	a3e13776e6	MINOR: Fix typos in javadoc and code comments Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2595 from vahidhashemian/minor/fix_typos_1702	8 years ago
Manikumar Reddy O	81721f8c53	MINOR: Doc change related to ZK sasl configs Author: Manikumar Reddy O <manikumar.reddy@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2620 from omkreddy/MINOR-ZK-CHANGE	8 years ago
Colin P. Mccabe	d345d53e4e	KAFKA-4902; Utils#delete should correctly handle I/O errors and symlinks Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Jun Rao <junrao@gmail.com>, Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2691 from cmccabe/KAFKA-4902	8 years ago
Kamal C	43fb2df7a4	MINOR: Map `mkString` format updated to default java format This is a minor change but it helps to improve the log readability. Author: Kamal C <kamal.chandraprakash@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2709 from Kamal15/util	8 years ago
Edoardo Comar	c808e8955f	MINOR: FetchRequest.Builder maxBytes for version <3 The maxBytes field should be set to DEFAULT_RESPONSE_MAX_BYTES, the same way as the constructor using the Struct does. codeveloped with mimaison Author: Edoardo Comar <ecomar@uk.ibm.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2694 from edoardocomar/MINOR-FetchRequest	8 years ago
Eno Thereska	84a14fec29	KAFKA-4843: More efficient round-robin scheduler - Improves streams efficiency by more than 200K requests/second (small 100 byte requests) - Gets streams efficiency very close to pure consumer (see results in https://jenkins.confluent.io/job/system-test-kafka-branch-builder/746/console) - Maintains same fairness across tasks - Schedules all records in the queue in-between poll() calls, not just one per task. Author: Eno Thereska <eno@confluent.io> Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Damian Guy, Matthias J. Sax, Guozhang Wang Closes #2643 from enothereska/minor-schedule-round-robin	8 years ago
Ismael Juma	6feaa8a581	KAFKA-1449; Use CRC32C for checksum of V2 message format I manually tested that Crc32CTest and AbstractChecksums pass with JDK 9. I also verified that `Java9ChecksumFactory` is used in that case. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #2739 from ijuma/kafka-1449-crc32c	8 years ago
Colin P. Mccabe	aea5989d98	KAFKA-4944; Fix an "unread field" findbugs warning in streams examples Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Michael G. Noll <michael@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2727 from cmccabe/KAFKA-4944	8 years ago
Colin P. Mccabe	7adf1e4148	KAFKA-4945; Suppress findbugs warnings about machine-generated code in jmh-benchmarks Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2729 from cmccabe/KAFKA-4945	8 years ago
Colin P. Mccabe	42284960da	KAFKA-4903; Remove dead code in Shell Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2692 from cmccabe/KAFKA-4903	8 years ago
Ismael Juma	4ce65d65df	KAFKA-4574; Ignore test_zk_security_upgrade until KIP-101 lands The transient failures make it harder to spot real failures and we can live without what is being tested (adding security to ZK via a rolling upgrade) until KIP-101 lands. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Apurva Mehta <apurva@confluent.io>, Jun Rao <junrao@gmail.com> Closes #2742 from ijuma/disable-zk-upgrade-test	8 years ago
Jason Gustafson	a0b8e435c9	MINOR: Support streaming decompression of fetched records for new format Author: Jason Gustafson <jason@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #2738 from hachikuji/streaming-compressed-iterator	8 years ago
Onur Karaman	edb372dcaf	KAFKA-4959; Remove controller concurrent access to non-threadsafe NetworkClient, Selector, and SSLEngine This brought down a cluster by causing continuous controller moves. ZkClient's ZkEventThread and a RequestSendThread can concurrently use objects that aren't thread-safe: * Selector * NetworkClient * SSLEngine (this was the big one for us. We turn on SSL for interbroker communication). As per the "Concurrency Notes" section from https://docs.oracle.com/javase/7/docs/api/javax/net/ssl/SSLEngine.html: > two threads must not attempt to call the same method (either wrap() or unwrap()) concurrently SSLEngine.wrap gets called in: * SslTransportLayer.write * SslTransportLayer.handshake * SslTransportLayer.close It turns out that the ZkEventThread and RequestSendThread can concurrently call SSLEngine.wrap: * ZkEventThread calls SslTransportLayer.close from ControllerChannelManager.removeExistingBroker * RequestSendThread can call SslTransportLayer.write or SslTransportLayer.handshake from NetworkClient.poll Suppose the controller moves for whatever reason. The former controller could have had a RequestSendThread who was in the middle of sending out messages to the cluster while the ZkEventThread began executing KafkaController.onControllerResignation, which calls ControllerChannelManager.shutdown, which sequentially cleans up the controller-to-broker queue and connection for every broker in the cluster. This cleanup includes the call to ControllerChannelManager.removeExistingBroker as mentioned earlier, causing the concurrent call to SSLEngine.wrap. This concurrent call throws a BufferOverflowException which ControllerChannelManager.removeExistingBroker catches so the ControllerChannelManager.shutdown moves onto cleaning up the next controller-to-broker queue and connection, skipping the cleanup steps such as clearing the queue, stopping the RequestSendThread, and removing the entry from its brokerStateInfo map. By failing out of the Selector.close, the sensors corresponding to the broker connection has not been cleaned up. Any later attempt at initializing an identical Selector will result in a sensor collision and therefore cause Selector initialization to throw an exception. In other words, any later attempts by this broker to become controller again will fail on initialization. When controller initialization fails, the controller deletes the /controller znode and lets another broker take over. Now suppose the controller moves enough times such that every broker hits the BufferOverflowException concurrency issue. We're now guaranteed to fail controller initialization due to the sensor collision on every controller transition, so the controller will move across brokers continuously. This patch avoids the concurrent use of non-threadsafe classes in ControllerChannelManager.removeExistingBroker by shutting down the RequestSendThread before closing the NetworkClient. Author: Onur Karaman <okaraman@linkedin.com> Reviewers: Joel Koshy <jjkoshy.w@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2746 from onurkaraman/KAFKA-4959	8 years ago
Jiangjie Qin	23a0f09863	Minor: Remove the accidentally checked in file which broke checkStyle.	8 years ago
Dong Lin	8b05ad406d	KAFKA-4586; Add purgeDataBefore() API (KIP-107) Author: Dong Lin <lindong28@gmail.com> Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jiangjie Qin <becket.qin@gmail.com> Closes #2476 from lindong28/KAFKA-4586	8 years ago
Armin Braun	f3f9a9eafb	KAFKA-4569; Check for wakeup on every call to KafkaConsumer.poll Author: Armin Braun <me@obrown.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io> Closes #2699 from original-brownbear/KAFKA-4569	8 years ago
Damian Guy	1abed91bd2	KAFKA-4881: add internal leave.group.on.close config to consumer Author: Damian Guy <damian.guy@gmail.com> Reviewers: Ismael Juma, Guozhang Wang Closes #2650 from dguy/consumer-leave-group-config	8 years ago
Ismael Juma	d27e09e60c	MINOR: Use method handles instead of reflection for creating Snappy and LZ4 streams 1. Use Initialization-on-demand holder idiom that relies on JVM lazy-loading instead of explicit initialization check. 2. Method handles were designed to be faster than Core Reflection, particularly if the method handle can be stored in a static final field (the JVM can then optimise the call as if it was a regular method call). Since the code is of similar complexity (and simpler if we consider the whole PR), I am treating this as a clean-up instead of a performance improvement (which would require doing benchmarks). 3. Remove unused `ByteBufferReceive`. 4. I removed the snappy library from the classpath and verified that `CompressionTypeTest` (which uses LZ4) still passes. This shows that the right level of laziness is achieved even if we use one of the lazily loaded compression algorithms. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #2740 from ijuma/use-method-handles-for-compressed-stream-supplier	8 years ago
Ismael Juma	d348ac92c8	MINOR: Fix deserialization of abortedTransactions and lastStableOffset in FetchResponse Thanks to Dong Lin for finding the lastStableOffset issue. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Dong Lin <lindong28@gmail.com>, Jason Gustafson <jason@confluent.io> Closes #2737 from ijuma/fix-fetch-response-lso	8 years ago
Jason Gustafson	462767660b	HOTFIX: Fix unsafe dependence on class name in VerifiableClientJava Author: Jason Gustafson <jason@confluent.io> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #2736 from hachikuji/hotfix-verifiable-clients	8 years ago
Jason Gustafson	5bd06f1d54	KAFKA-4816; Message format changes for idempotent/transactional producer (KIP-98) Author: Jason Gustafson <jason@confluent.io> Reviewers: Jun Rao <junrao@gmail.com>, Apurva Mehta <apurva@confluent.io>, Guozhang Wang <wangguoz@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2614 from hachikuji/exactly-once-message-format	8 years ago

1 2 3 4 5 ...

3395 Commits (afeadbef50ee8cb5c23de26c1b2a5ad2c7ad941e) All Branches Search

3395 Commits (afeadbef50ee8cb5c23de26c1b2a5ad2c7ad941e)

All Branches