src-kafka

Commit Graph

Author	SHA1	Message	Date
huxi	609e9b0b2f	KAFKA-5068; Optionally print out metrics after running the perf tests junrao added a config `--print.metrics` to control whether ProducerPerformance prints out metrics at the end of the test. If its okay, will add the code counterpart for consumer. Author: huxi <huxi@zhenrongbao.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #2860 from amethystic/kafka-5068_print_metrics_in_perf_tests	8 years ago
anukin	41c0f8adde	KAFKA-5049; Chroot check should be done for each ZkUtils instance Author: anukin <anukin2611@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2857 from anukin/KAFKA_5049_zkroot_check	8 years ago
Onur Karaman	c4e59a338a	KAFKA-5069; add controller integration tests Test the various controller protocols by observing zookeeper and broker state. Author: Onur Karaman <okaraman@linkedin.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com> Closes #2853 from onurkaraman/KAFKA-5069	8 years ago
Ben Stopford	020ca79036	KAFKA-5036; Second part: Points 2 -> 5): Refactor caching of Latest Epoch This PR covers point (2) and point (5) from KAFKA-5036: Commit 1: 2. Currently, we update the leader epoch in epochCache after log append in the follower but before log append in the leader. It would be more consistent to always do this after log append. This also avoids issues related to failure in log append. 5. The constructor of LeaderEpochFileCache has the following: lock synchronized { ListBuffer(checkpoint.read(): _) } But everywhere else uses a read or write lock. We should use consistent locking. This is a refactor to the way epochs are cached, replacing the code to cache the latest epoch in the LeaderEpochFileCache by reusing the cached value in Partition. There is no functional change. Commit 2:* Adds an assert(epoch >=0) as epochs are written. Refactors tests so they never hit this assert. Author: Ben Stopford <benstopford@gmail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #2831 from benstopford/KAFKA-5036-part2-second-try	8 years ago
Dong Lin	17ce2a7307	KAFKA-5075; Defer exception to the next pollOnce() if consumer's fetch position has already increased Author: Dong Lin <lindong28@gmail.com> Author: Dong Lin <lindong28@users.noreply.github.com> Reviewers: Jiangjie Qin <becket.qin@gmail.com> Closes #2859 from lindong28/KAFKA-5075	8 years ago
jozi-k	1d25369d22	MINOR: Make LeaderAndIsr immutable case class Also include a few code readability improvements. Author: jozi-k <jozef.koval@protonmail.ch> Reviewers: Guozhang Wang <wangguoz@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2731 from jozi-k/immutable_LeaderAndIsr	8 years ago
Colin P. Mccabe	256f8d5662	KAFKA-5013; Fail the build when findbugs fails Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2805 from cmccabe/KAFKA-5013	8 years ago
huxi	8e7516ea2e	KAFKA-4866; Console consumer `print.value` property is ignored This property is mentioned in the quickstart. Author: huxi <huxi@zhenrongbao.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2661 from amethystic/kafka4866_consoleconsumer_ignore_printvalue	8 years ago
Apurva Mehta	d0e7c6b930	KAFKA-5043; Rename GroupCoordinator to FindCoordinator (KIP-98) Also: 1. FindCoordinator is more general and takes a coordinator_type so that it can be used for the group and transaction coordinators. 2. Include an error message in FindCoordinatorResponse to make the errors at the client side more informative. We have just added the field to the protocol in this PR, a subsequent PR will update the code to use it. 3. Rename `Errors` names for FindCoordinator to be more generic. This is a compatible change as the ids remain the same. 4. Since the exception classes for the error codes are in a public package, we introduce new ones and deprecate the old ones. The classes were not thrown back to the user (KAFKA-5052 aside), so this is a compatible change. 5. Update InitPidRequest for transactions. Since this protocol API was introduced recently and is not used by default, we did not bump its version. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2825 from apurvam/exactly-once-rpc-stubs	8 years ago
Ismael Juma	c31958eb0d	HOTFIX: Use a true sentinel for `UseDefaultAcls` In `67fc2a91a6`, we are using an empty collection and comparing via value equality, so if a user passes an empty collection, they will get the default ACLs instead of no ACLs. We fix that issue here. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Rajini Sivaram Closes #2829 from ijuma/zk-utils-default-acls-improvement and squashes the following commits: `0846172` [Ismael Juma] Add missing import `2dc84f3` [Ismael Juma] Simplify logic in `sensitivePath` `8122f27` [Ismael Juma] Use a true sentinel instead of an empty collection for `UseDefaultAcls`	8 years ago
Ismael Juma	82a8e83de6	MINOR: Fix incorrect pattern matching on `version` in `CheckpointFile` Also add test and refactor things a little to make testing easier. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Ben Stopford <benstopford@gmail.com>, Jun Rao <junrao@gmail.com> Closes #2822 from ijuma/hotfix-checkpoint-file	8 years ago
Rajini Sivaram	67fc2a91a6	KAFKA-4943: Make /config/users with SCRAM credentials not world-readable Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Ismael Juma, Jun Rao Closes #2733 from rajinisivaram/KAFKA-4943	8 years ago
Ismael Juma	5cf64f06a8	MINOR: Log append validation improvements - Consistent validation across different code paths in LogValidator - Validate baseOffset for message format V2 - Flesh out LogValidatorTest to check producerId, baseSequence, producerEpoch and partitionLeaderEpoch. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com> Closes #2802 from ijuma/validate-base-offset	8 years ago
Colin P. Mccabe	ab148f39ae	KAFKA-4899; Fix findbugs warnings in kafka-core Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Jozef Koval <jozef.koval@protonmail.ch>, Ismael Juma <ismael@juma.me.uk> Closes #2687 from cmccabe/KAFKA-4899	8 years ago
Matthias J. Sax	865d82af2c	KAFKA-4990; Request/response classes for transactions (KIP-98) Author: Matthias J. Sax <matthias@confluent.io> Author: Guozhang Wang <wangguoz@gmail.com> Author: Jason Gustafson <jason@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2799 from mjsax/kafka-4990-add-api-stub-config-parameters-request-types	8 years ago
Ismael Juma	76a10e23f0	HOTFIX: Remove duplicate entry from ApiVersion Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com> Closes #2818 from ijuma/fix-api-version	8 years ago
Ben Stopford	0baea2ac13	KIP-101: Alter Replication Protocol to use Leader Epoch rather than High Watermark for Truncation This PR replaces https://github.com/apache/kafka/pull/2743 (just raising from Confluent repo) This PR describes the addition of Partition Level Leader Epochs to messages in Kafka as a mechanism for fixing some known issues in the replication protocol. Full details can be found here: [KIP-101 Reference](https://cwiki.apache.org/confluence/display/KAFKA/KIP-101+-+Alter+Replication+Protocol+to+use+Leader+Epoch+rather+than+High+Watermark+for+Truncation) The key elements are: - Epochs are stamped on messages as they enter the leader. - Epochs are tracked in both leader and follower in a new checkpoint file. - A new API allows followers to retrieve the leader's latest offset for a particular epoch. - The logic for truncating the log, when a replica becomes a follower, has been moved from Partition into the ReplicaFetcherThread - When partitions are added to the ReplicaFetcherThread they are added in an initialising state. Initialising partitions request leader epochs and then truncate their logs appropriately. This test provides a good overview of the workflow `EpochDrivenReplicationProtocolAcceptanceTest.shouldFollowLeaderEpochBasicWorkflow()` The corrupted log use case is covered by the test `EpochDrivenReplicationProtocolAcceptanceTest.offsetsShouldNotGoBackwards()` Remaining work: There is a do list here: https://docs.google.com/document/d/1edmMo70MfHEZH9x38OQfTWsHr7UGTvg-NOxeFhOeRew/edit?usp=sharing Author: Ben Stopford <benstopford@gmail.com> Author: Jun Rao <junrao@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com> Closes #2808 from benstopford/kip-101-v2	8 years ago
Shun Takebayashi	ca2979f847	MINOR: Suppress ProducerConfig warning in MirrorMaker Though MirrorMaker uses the `producer.type` value of the producer properties, ProducerConfig show the warning: `The configuration 'producer.type' was supplied but isn't a known config.` Author: Shun Takebayashi <shun@takebayashi.asia> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2676 from takebayashi/suppress-mirrormaker-warning	8 years ago
Apurva Mehta	b9b2cfc28c	MINOR: Close the producer batch append stream when the batch gets full to free up resources Of particular importance are compression buffers (64 KB for LZ4, for example). Author: Apurva Mehta <apurva@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2796 from apurvam/idempotent-producer-close-data-stream	8 years ago
Apurva Mehta	bdf4cba047	KAFKA-4817; Add idempotent producer semantics This is from the KIP-98 proposal. The main points of discussion surround the correctness logic, particularly the Log class where incoming entries are validated and duplicates are dropped, and also the producer error handling to ensure that the semantics are sound from the users point of view. There is some subtlety in the idempotent producer semantics. This patch only guarantees idempotent production upto the point where an error has to be returned to the user. Once we hit a such a non-recoverable error, we can no longer guarantee message ordering nor idempotence without additional logic at the application level. In particular, if an application wants guaranteed message order without duplicates, then it needs to do the following in the error callback: 1. Close the producer so that no queued batches are sent. This is important for guaranteeing ordering. 2. Read the tail of the log to inspect the last message committed. This is important for avoiding duplicates. Author: Apurva Mehta <apurva@confluent.io> Author: hachikuji <jason@confluent.io> Author: Apurva Mehta <apurva.1618@gmail.com> Author: Guozhang Wang <wangguoz@gmail.com> Author: fpj <fpj@apache.org> Author: Jason Gustafson <jason@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com> Closes #2735 from apurvam/exactly-once-idempotent-producer	8 years ago
Dong Lin	4b3ea062be	KAFKA-4973; Fix transient failure of AdminClientTest.testDeleteRecordsWithException Author: Dong Lin <lindong28@gmail.com> Reviewers: Jiangjie Qin <becket.qin@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2760 from lindong28/KAFKA-4973	8 years ago
Vahid Hashemian	a3e13776e6	MINOR: Fix typos in javadoc and code comments Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2595 from vahidhashemian/minor/fix_typos_1702	8 years ago
Colin P. Mccabe	42284960da	KAFKA-4903; Remove dead code in Shell Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2692 from cmccabe/KAFKA-4903	8 years ago
Onur Karaman	edb372dcaf	KAFKA-4959; Remove controller concurrent access to non-threadsafe NetworkClient, Selector, and SSLEngine This brought down a cluster by causing continuous controller moves. ZkClient's ZkEventThread and a RequestSendThread can concurrently use objects that aren't thread-safe: * Selector * NetworkClient * SSLEngine (this was the big one for us. We turn on SSL for interbroker communication). As per the "Concurrency Notes" section from https://docs.oracle.com/javase/7/docs/api/javax/net/ssl/SSLEngine.html: > two threads must not attempt to call the same method (either wrap() or unwrap()) concurrently SSLEngine.wrap gets called in: * SslTransportLayer.write * SslTransportLayer.handshake * SslTransportLayer.close It turns out that the ZkEventThread and RequestSendThread can concurrently call SSLEngine.wrap: * ZkEventThread calls SslTransportLayer.close from ControllerChannelManager.removeExistingBroker * RequestSendThread can call SslTransportLayer.write or SslTransportLayer.handshake from NetworkClient.poll Suppose the controller moves for whatever reason. The former controller could have had a RequestSendThread who was in the middle of sending out messages to the cluster while the ZkEventThread began executing KafkaController.onControllerResignation, which calls ControllerChannelManager.shutdown, which sequentially cleans up the controller-to-broker queue and connection for every broker in the cluster. This cleanup includes the call to ControllerChannelManager.removeExistingBroker as mentioned earlier, causing the concurrent call to SSLEngine.wrap. This concurrent call throws a BufferOverflowException which ControllerChannelManager.removeExistingBroker catches so the ControllerChannelManager.shutdown moves onto cleaning up the next controller-to-broker queue and connection, skipping the cleanup steps such as clearing the queue, stopping the RequestSendThread, and removing the entry from its brokerStateInfo map. By failing out of the Selector.close, the sensors corresponding to the broker connection has not been cleaned up. Any later attempt at initializing an identical Selector will result in a sensor collision and therefore cause Selector initialization to throw an exception. In other words, any later attempts by this broker to become controller again will fail on initialization. When controller initialization fails, the controller deletes the /controller znode and lets another broker take over. Now suppose the controller moves enough times such that every broker hits the BufferOverflowException concurrency issue. We're now guaranteed to fail controller initialization due to the sensor collision on every controller transition, so the controller will move across brokers continuously. This patch avoids the concurrent use of non-threadsafe classes in ControllerChannelManager.removeExistingBroker by shutting down the RequestSendThread before closing the NetworkClient. Author: Onur Karaman <okaraman@linkedin.com> Reviewers: Joel Koshy <jjkoshy.w@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2746 from onurkaraman/KAFKA-4959	8 years ago
Dong Lin	8b05ad406d	KAFKA-4586; Add purgeDataBefore() API (KIP-107) Author: Dong Lin <lindong28@gmail.com> Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jiangjie Qin <becket.qin@gmail.com> Closes #2476 from lindong28/KAFKA-4586	8 years ago
Jason Gustafson	5bd06f1d54	KAFKA-4816; Message format changes for idempotent/transactional producer (KIP-98) Author: Jason Gustafson <jason@confluent.io> Reviewers: Jun Rao <junrao@gmail.com>, Apurva Mehta <apurva@confluent.io>, Guozhang Wang <wangguoz@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2614 from hachikuji/exactly-once-message-format	8 years ago
Damian Guy	fef7fca2af	KAFKA-4594; Annotate integration tests and provide gradle build targets to run subsets of tests This uses JUnit Categories to identify integration tests. Adds 2 new build targets: `integrationTest` and `unitTest`. Author: Damian Guy <damian.guy@gmail.com> Reviewers: Eno Thereska <eno@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2695 from dguy/junit-categories	8 years ago
Sachin Mittal	197a5d5a6d	KAFKA-4848: Fix retryWithBackoff deadlock issue Fixes related to handling of MAX_POLL_INTERVAL_MS_CONFIG during deadlock and CommitFailedException on partition revoked. Author: Sachin Mittal <sjmittal@gmail.com> Reviewers: Matthias J. Sax, Damian Guy, Guozhang Wang Closes #2642 from sjmittal/trunk	8 years ago
Colin P. Mccabe	5a2fcdd6d4	KAFKA-4894; Fix findbugs "default character set in use" warnings Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Jun Rao <junrao@gmail.com> Closes #2683 from cmccabe/KAFKA-4894	8 years ago
Eno Thereska	c5adf17aea	MINOR: Log error if offsets topic creation fails This problem is hard to debug otherwise as there error returned to the client (“Coordinator not available”) is not very informative. Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2652 from enothereska/minor-warning-enforcement-offset	8 years ago
Ismael Juma	1659ca1773	KAFKA-4901; Make ProduceRequest thread-safe If request logging is enabled, `ProduceRequest` can be accessed and mutated concurrently from a network thread (which calls `toString`) and a request handler thread (which calls `clearPartitionRecords()`). That can lead to a `ConcurrentModificationException` when iterating the `partitionRecords` map. The underlying thread-safety issue has existed since the server started using the Java implementation of ProduceRequest in 0.10.0. However, we were incorrectly not clearing the underlying struct until 0.10.2, so `toString` itself was thread-safe until that change. In 0.10.2, `toString` is no longer thread-safe and we could potentially see a `NullPointerException` given the right set of interleavings between `toString` and `clearPartitionRecords` although we haven't seen that happen yet. In trunk, we changed the requests to have a `toStruct` method instead of creating a struct in the constructor and `toString` was no longer printing the contents of the `Struct`. This accidentally fixed the race condition, but it meant that request logging was less useful. A couple of days ago, `AbstractRequest.toString` was changed to print the contents of the request by calling `toStruct().toString()` and reintroduced the race condition. The impact is more visible because we iterate over a `HashMap`, which proactively checks for concurrent modification (unlike arrays). We will need a separate PR for 0.10.2. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jiangjie Qin <becket.qin@gmail.com>, Onur Karaman <okaraman@linkedin.com>, Jun Rao <junrao@gmail.com> Closes #2689 from ijuma/produce-request-thread-safety	8 years ago
Ismael Juma	7565dcd8b0	KAFKA-4861; GroupMetadataManager record is rejected if broker configured with LogAppendTime The record should be created with CreateTime (like in the producer). The conversion to LogAppendTime happens automatically (if necessary). Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #2657 from ijuma/kafka-4861-log-append-time-breaks-group-data-manager	8 years ago
simplesteph	294018a578	KAFKA-4864; added correct zookeeper nodes for security migrator Author: simplesteph <stephane.maarek@gmail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #2655 from simplesteph/fix-security-migrator-tool	8 years ago
Jason Gustafson	f7354e779c	MINOR: Add varint serde utilities for new message format Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2647 from hachikuji/add-varint-serdes	8 years ago
Ben Stopford	63010cbfe5	KAFKA-4266; ReassignPartitionsClusterTest: ensure ZK publication is completed before start Increase the reliability of the one temporal comparison in ReassignPartitionsClusterTest by imposing a delay after ZK is updated. This should be more reliable than just increasing the amount of data. This relates to a previous PR: https://github.com/apache/kafka/pull/1982 Author: Ben Stopford <benstopford@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #1997 from benstopford/KAFKA-4266	8 years ago
Vahid Hashemian	f111f2a716	MINOR: additional refactoring around the use of Errors A couple of updates were missed in the [PR](https://github.com/apache/kafka/pull/2475) that replaced the use of error codes with Errors objects. Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2635 from vahidhashemian/minor/Errors_refactoring_leftover	8 years ago
Armin Braun	5781feb527	KAFKA-3182; Fix testSocketsCloseOnShutdown transient failures * Turned off Nagle on the sending sockets to force the socket to physically acknowledge after the first write in `sendRequest` * Added a `200ms` delay between write attempts (necessary on Linux, but not Mac) Author: Armin Braun <armin.braun@1und1.de> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2632 from original-brownbear/KAFKA-3182	8 years ago
Vahid Hashemian	573a6f3986	KAFKA-2857; Retry querying the consumer group while initializing This applies to new-consumer based groups and would avoid scenarios in which user issues a `--describe` query while the group is initializing. Example: The following could occur for a newly created group. ``` kafkakafka:~/workspace/kafka$ bin/kafka-consumer-groups.sh --bootstrap-server localhost:9092 --describe --group g Note: This will only show information about consumers that use the Java consumer API (non-ZooKeeper-based consumers). Error: Executing consumer group command failed due to The group coordinator is not available. ``` With this PR the group is queried repeatedly at specific intervals within a preset (and configurable) timeout `group-init-timeout` to circumvent unfortunate situations like above. Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Jason Gustafson <jason@confluent.io> Closes #2538 from vahidhashemian/KAFKA-2857	8 years ago
Rajini Sivaram	a3c45b0c92	KAFKA-4631; Request metadata in consumer if topic/partitions unavailable If leader node of one more more partitions in a consumer subscription are temporarily unavailable, request metadata refresh so that partitions skipped for assignment dont have to wait for metadata expiry before reassignment. Metadata refresh is also requested if a subscribe topic or assigned partition doesn't exist. Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Vahid Hashemian <vahidhashemian@us.ibm.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io> Closes #2622 from rajinisivaram/KAFKA-4631	8 years ago
huxi	1b902b4ed3	KAFKA4811; ReplicaFetchThread may fail to create due to existing metric Have fetcherThreadMap keyed off brokerId + fetcherId instead of broker + fetcherId, but did not consider the case where port is changed. Author: huxi <huxi@zhenrongbao.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #2606 from amethystic/kafka4811_ReplicaFetchThread_fail_create	8 years ago
Matthias J. Sax	d0e436c471	MINOR: improve license header check by providing head file instead of (prefix) header regex Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #2303 from mjsax/licenseHeader	8 years ago
Ismael Juma	b917af1901	MINOR: Make it impossible to invoke `Request.body` without an explicit type parameter Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #2579 from ijuma/safer-body	8 years ago
Ismael Juma	0483a0b0b7	MINOR: Fix transient failure of testCannotSendToInternalTopic It’s a simple matter of creating the internal topic before trying to send to it. Otherwise, we could get an `UnknownTopicOrPartitionException` in some cases. Without the change, I could reproduce a failure in less than 5 runs. With the change, 30 consecutive runs succeeded. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Apurva Mehta <apurva.1618@gmail.com>, Jason Gustafson <jason@confluent.io> Closes #2584 from ijuma/test-cannot-send-to-internal-topic-transient-failure	8 years ago
Armin Braun	1ed4b48d5a	KAFKA-4198; Fix race condition in KafkaServer.shutdown() It contained this step: val canShutdown = isShuttingDown.compareAndSet(false, true) if (canShutdown && shutdownLatch.getCount > 0) { without any fallback for the case of `shutdownLatch.getCount == 0`. So in the case of `shutdownLatch.getCount == 0` (when a previous call to the shutdown method was right about to finish) you would set `isShuttingDown` to true again without any possibility of ever getting the server started (since `startup` will check `isShuttingDown` before setting up a new latch with count 1). Long story short: concurrent calls to shutdown can get the server locked in a broken state. This fixes the reported error: java.lang.IllegalStateException: Kafka server is still shutting down, cannot re-start! at kafka.server.KafkaServer.startup(KafkaServer.scala:184) at kafka.integration.KafkaServerTestHarness$$anonfun$restartDeadBrokers$2.apply$mcVI$sp(KafkaServerTestHarness.scala:117) at kafka.integration.KafkaServerTestHarness$$anonfun$restartDeadBrokers$2.apply(KafkaServerTestHarness.scala:116) at kafka.integration.KafkaServerTestHarness$$anonfun$restartDeadBrokers$2.apply(KafkaServerTestHarness.scala:116) at scala.collection.TraversableLike$WithFilter$$anonfun$foreach$1.apply(TraversableLike.scala:733) at scala.collection.immutable.Range.foreach(Range.scala:160) at scala.collection.TraversableLike$WithFilter.foreach(TraversableLike.scala:732) at kafka.integration.KafkaServerTestHarness$class.restartDeadBrokers(KafkaServerTestHarness.scala:116) at kafka.api.ConsumerBounceTest.restartDeadBrokers(ConsumerBounceTest.scala:34) at kafka.api.ConsumerBounceTest$BounceBrokerScheduler.doWork(ConsumerBounceTest.scala:158) Author: Armin Braun <me@obrown.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2568 from original-brownbear/KAFKA-4198	8 years ago
Ismael Juma	5a2abc5182	KAFKA-4788: Revert "KAFKA-4092: retention.bytes should not be allowed to be less than segment.bytes" The intent is good, but it needs to take into account broker configs as well. See KAFKA-4788 for more details. This reverts commit `4ca5abe8ee`. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com> Closes #2588 from ijuma/kafka-4788	8 years ago
Ismael Juma	015f1d7381	MINOR: Move ProtoUtils methods to ApiKeys Also move `requireTimestamp` to `minVersion` logic from `Fetcher` to `ListOffsetRequest.Builder.forConsumer()`. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Colin P. Mccabe <cmccabe@confluent.io>, Jason Gustafson <jason@confluent.io> Closes #2580 from ijuma/move-proto-utils-to-api-keys	8 years ago
Colin P. Mccabe	913c09e4a9	KAFKA-4708; Fix transient failure in BrokerApiVersionsCommandTest.checkBrokerApiVersionCommandOutput Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io>, Dong Lin <lindong28@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2489 from cmccabe/KAFKA-4708	8 years ago
Jason Gustafson	8bd06482d0	MINOR: Remove unused MessageWriter and CompressionFactory Author: Jason Gustafson <jason@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2543 from hachikuji/remove-message-writer	8 years ago
Armin Braun	d24d932efb	HOTFIX: ClassCastException in Request error logging Fixed ClassCastException resulting from missing type hint in request logging. Author: Armin Braun <me@obrown.io> Reviewers: Jason Gustafson <jason@confluent.io> Closes #2571 from original-brownbear/fix-logging-err-response	8 years ago
Jason Gustafson	a515e93570	HOTFIX: ClassCastException in request logging Author: Jason Gustafson <jason@confluent.io> Reviewers: Colin P. Mccabe <cmccabe@confluent.io>, Ewen Cheslack-Postava <me@ewencp.org> Closes #2566 from hachikuji/hotfix-request-logging	8 years ago

1 2 3 4 5 ...

1651 Commits (609e9b0b2f46ce72ed91965f7e43c512b26a609b)