src-kafka

Commit Graph

Author	SHA1	Message	Date
Apurva Mehta	a318b15129	KAFKA-5322; Add `OPERATION_NOT_ATTEMPTED` error code to resolve AddPartitionsToTxn inconsistency In the `AddPartitionsToTxn` request handling, if even one partition fails authorization checks, the entire request is essentially failed. However, the `AddPartitionsToTxnResponse` today will only contain the error codes for the topics which failed authorization. It will have no error code for the topics which succeeded, making it inconsistent with other APIs. This patch adds a new error code `OPERATION_NOT_ATTEMPTED` which is returned for the successful partitions to indicate that they were not added to the transaction. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io> Closes #3204 from apurvam/KAFKA-5322-add-operation-not-attempted-for-add-partitions	8 years ago
Ismael Juma	4959444afc	KAFKA-5236; Increase the block/buffer size when compressing with Snappy or Gzip We had originally increased Snappy’s block size as part of KAFKA-3704. However, we had some issues with excessive memory usage in the producer and we reverted it in `7c6ee8d5e`. After more investigation, we fixed the underlying reason why memory usage seemed to grow much more than expected via KAFKA-3747 (included in 0.10.0.1). In 0.10.2, we changed the broker to use the same classes as the producer and the broker’s block size for Snappy was changed from 32 KB to 1KB. As reported in KAFKA-5236, the on disk size is, in some cases, 50% larger when the data is compressed with 1 KB instead of 32 KB as the block size. As discussed in KAFKA-3704, it may be worth making this configurable and/or allocate the compression buffers from the producer pool. However, for 0.11.0.0, I think the simplest thing to do is to default to 32 KB for Snappy (the default if no block size is provided). I also increased the Gzip buffer size. 1 KB is too small and the default is smaller still (512 bytes). 8 KB (which is the default buffer size for BufferedOutputStream) seemed like a reasonable default. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3205 from ijuma/kafka-5236-snappy-block-size	8 years ago
Vahid Hashemian	b63e41ea78	KAFKA-3982: Fix processing order of some of the consumer properties This PR updates processing of console consumer's input properties. For both old and new consumer, the value provided for `auto.offset.reset` indirectly through `consumer.config` or `consumer.property` arguments will now take effect. For new consumer and for `key.deserializer` and `value.deserializer` properties, the precedence order is fixed to first the value directly provided as an argument, then the value provided indirectly via `consumer.property` and then `consumer.config`, and finally a default value. Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #1655 from vahidhashemian/KAFKA-3982	8 years ago
amethystic	d7d1196a0b	KAFKA-5327: ConsoleConsumer should manually commit offsets for records that are returned in receive() KAFKA-5327: ConsoleConsumer should manually commit offsets for those records it really consumed. Currently it leaves this job to the automatic offset commit scheme where some unread messages will be passed if `--max-messages` is set. Author: amethystic <huxi_2b@hotmail.com> Author: huxi <huxi_2b@hotmail.com> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #3148 from amethystic/KAFKA-5327_ConsoleConsumer_distable_autocommit	8 years ago
Vahid Hashemian	f85c18032b	KAFKA-3264; Deprecate the old Scala consumer (KIP-109) Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Ismael Juma <ismael@juma.me.uk> This patch had conflicts when merged, resolved by Committer: Ismael Juma <ismael@juma.me.uk> Closes #2328 from vahidhashemian/KAFKA-3264	8 years ago
Vahid Hashemian	5d46348619	KAFKA-5282; Use a factory method to create producers/consumers and close them in tearDown Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> This patch had conflicts when merged, resolved by Committer: Ismael Juma <ismael@juma.me.uk> Closes #3129 from vahidhashemian/KAFKA-5282	8 years ago
Jason Gustafson	1c882ee5fb	KAFKA-5283; Handle producer epoch/sequence overflow - Producer sequence numbers should wrap around - Generate a new producerId if the producer epoch would overflow Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Apurva Mehta <apurva@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #3183 from hachikuji/KAFKA-5283	8 years ago
Jason Gustafson	0c3e466eb0	MINOR: Logging/debugging improvements for transactions Author: Jason Gustafson <jason@confluent.io> Author: Apurva Mehta <apurva.1618@gmail.com> Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #3185 from hachikuji/minor-transaction-logging-improvements	8 years ago
Matthias J. Sax	39c1e6259c	KAFKA-5361: Add EOS integration tests for Streams API Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Damian Guy <damian.guy@gmail.com>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #3193 from mjsax/kafka-5361-add-eos-integration-tests-for-streams-api	8 years ago
Apurva Mehta	1959835d9e	KAFKA-5281; System tests for transactions Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3149 from apurvam/KAFKA-5281-transactions-system-tests	8 years ago
Ismael Juma	8e8b3c5657	KAFKA-5360; Down-converted uncompressed batches should respect fetch offset More specifically, V2 messages are always batched (whether compressed or not) while V0/V1 are only batched if they are compressed. Clients like librdkafka expect to receive messages from the fetch offset when dealing with uncompressed V0/V1 messages. When converting from V2 to V0/1, we were returning all the messages in the V2 batch. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3191 from ijuma/kafka-5360-down-converted-uncompressed-respect-offset	8 years ago
Rajini Sivaram	640082776b	KAFKA-4956: Verify client-side throttle time metrics in quota test Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #3190 from rajinisivaram/KAFKA-4956-unittest	8 years ago
Apurva Mehta	049abe7efa	KAFKA-5351: Reset pending state when returning an error in appendTransactionToLog Without this patch, future client retries would get the `CONCURRENT_TRANSACTIONS` error code indefinitely, since the pending state wouldn't be cleared when the append to the log failed. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #3184 from apurvam/KAFKA-5351-clear-pending-state-on-retriable-error	8 years ago
Ismael Juma	7311dcbc53	KAFKA-5291; AdminClient should not trigger auto creation of topics - Added a boolean `allow_auto_topic_creation` to MetadataRequest and bumped the protocol version to V4. - When connecting to brokers older than 0.11.0.0, the `allow_auto_topic_creation` field won't be considered, so we send a metadata request for all topics to keep the behavior consistent. - Set `allow_auto_topic_creation` to false in the new AdminClient and StreamsKafkaClient (which exists for the purpose of creating topics manually); set it to true everywhere else for now. Other clients will eventually rely on client-side auto topic creation, but that’s not there yet. - Add `allowAutoTopicCreation` field to `Metadata`, which is used by `DefaultMetadataUpdater`. This is not strictly needed for the new `AdminClient`, but it avoids surprises if it ever adds a topic to `Metadata` via `setTopics` or `addTopic`. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com> Closes #3098 from ijuma/kafka-5291-admin-client-no-auto-topic-creation	8 years ago
Ismael Juma	647afeff6a	KAFKA-5353; baseTimestamp should always have a create timestamp This makes the case where we build the records from scratch consistent with the case where update the batch header "in place". Thanks to edenhill who found the issue while testing librdkafka. The reason our tests don’t catch this is that we rely on the maxTimestamp to compute the record level timestamps if log append time is used. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3177 from ijuma/set-base-sequence-for-log-append-time	8 years ago
Ismael Juma	eeb8f67810	MINOR: Use `waitUntil` to fix transient failures of ControllerFailoverTest Without it, it's possible that the assertion is checked before the exception is thrown in the callback. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #3182 from ijuma/fix-controller-failover-flakiness	8 years ago
Jason Gustafson	81f0c1e8f2	KAFKA-5093; Avoid loading full batch data when possible when iterating FileRecords Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3160 from hachikuji/KAFKA-5093	8 years ago
Colin P. Mccabe	da9a171c99	KAFKA-5265; Move ACLs, Config, Topic classes into org.apache.kafka.common Also introduce TopicConfig. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3120 from cmccabe/KAFKA-5265	8 years ago
Ismael Juma	9323a75335	MINOR: Use new consumer in ProducerCompressionTest This should be less flaky as it has a higher timeout. I also increased the timeout in a couple of other tests that had a very low (100 ms) timeouts. The failure would manifest itself as: ```text java.net.SocketTimeoutException at sun.nio.ch.SocketAdaptor$SocketInputStream.read(SocketAdaptor.java:229) at sun.nio.ch.ChannelInputStream.read(ChannelInputStream.java:103) at java.nio.channels.Channels$ReadableByteChannelImpl.read(Channels.java:385) at org.apache.kafka.common.network.NetworkReceive.readFromReadableChannel(NetworkReceive.java:85) at kafka.network.BlockingChannel.readCompletely(BlockingChannel.scala:129) at kafka.network.BlockingChannel.receive(BlockingChannel.scala:120) at kafka.consumer.SimpleConsumer.liftedTree1$1(SimpleConsumer.scala:100) at kafka.consumer.SimpleConsumer.kafka$consumer$SimpleConsumer$$sendRequest(SimpleConsumer.scala:84) at kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply$mcV$sp(SimpleConsumer.scala:133) at kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply(SimpleConsumer.scala:133) at kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply(SimpleConsumer.scala:133) at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:31) at kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply$mcV$sp(SimpleConsumer.scala:132) at kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply(SimpleConsumer.scala:132) at kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply(SimpleConsumer.scala:132) at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:31) at kafka.consumer.SimpleConsumer.fetch(SimpleConsumer.scala:131) at kafka.api.test.ProducerCompressionTest.testCompression(ProducerCompressionTest.scala:97) ``` Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #3178 from ijuma/producer-compression-test-flaky	8 years ago
Ismael Juma	31cc8885e4	MINOR: Improve assert in ControllerFailoverTest It sometimes fails in Jenkins like: ```text java.lang.AssertionError: IllegalStateException was not thrown at org.junit.Assert.fail(Assert.java:88) at org.junit.Assert.assertTrue(Assert.java:41) at kafka.controller.ControllerFailoverTest.testHandleIllegalStateException(ControllerFailoverTest.scala:86) ``` I ran it locally 100 times with no failure. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #3176 from ijuma/improve-controller-failover-assert	8 years ago
Jorge Quilcate Otoya	ef9551297c	KAFKA-5266; Follow-up improvements for consumer offset reset tool (KIP-122) Implement improvements defined here: https://issues.apache.org/jira/browse/KAFKA-5266 Author: Jorge Quilcate Otoya <quilcate.jorge@gmail.com> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3102 from jeqo/feature/KAFKA-5266	8 years ago
Damian Guy	2cc8f48ae5	KAFKA-5308; TC should handle UNSUPPORTED_FOR_MESSAGE_FORMAT in WriteTxnMarker response Return UNSUPPORTED_MESSAGE_FORMAT in handleWriteTxnMarkers when a topic is not the correct message format. Remove any TopicPartitions that have same error from those waiting for markers Author: Damian Guy <damian.guy@gmail.com> Reviewers: Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #3152 from dguy/kafka-5308	8 years ago
Jason Gustafson	d41cf1b778	KAFKA-5251; Producer should cancel unsent AddPartitions and Produce requests on abort Author: Jason Gustafson <jason@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #3161 from hachikuji/KAFKA-5251	8 years ago
Colin P. Mccabe	3250cc767e	KAFKA-5324; AdminClient: add close with timeout, fix some timeout bugs Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3141 from cmccabe/KAFKA-5324	8 years ago
Ismael Juma	6021618f9d	MINOR: onControllerResignation should be invoked if triggerControllerMove is called Also update the test to be simpler since we can use a mock event to simulate the issue more easily (thanks Jun for the suggestion). This should fix two issues: 1. A transient test failure due to a NPE in ControllerFailoverTest.testMetadataUpdate: ```text Caused by: java.lang.NullPointerException at kafka.controller.ControllerBrokerRequestBatch.addUpdateMetadataRequestForBrokers(ControllerChannelManager.scala:338) at kafka.controller.KafkaController.sendUpdateMetadataRequest(KafkaController.scala:975) at kafka.controller.ControllerFailoverTest.testMetadataUpdate(ControllerFailoverTest.scala:141) ``` The test was creating an additional thread and it does not seem like it was doing the appropriate synchronization (perhaps this became more of an issue after we changed the Controller to be single-threaded and changed the locking) 2. Setting `activeControllerId.set(-1)` in `triggerControllerMove` causes `Reelect` not to invoke `onControllerResignation`. Among other things, this causes an `IllegalStateException` to be thrown when `KafkaScheduler.startup` is invoked for the second time without the corresponding `shutdown`. We now simply call `onControllerResignation` as part of `triggerControllerMove`. Finally, I included a few clean-ups: 1. No longer update the broker state in `onControllerFailover`. This is no longer needed since we removed the `RunningAsController` state (KAFKA-3761). 2. Trivial clean-ups in KafkaController 3. Removed unused parameter in `ZkUtils.getPartitionLeaderAndIsrForTopics` Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com> Closes #2935 from ijuma/on-controller-resignation-if-trigger-controller-move	8 years ago
Guozhang Wang	80223b14ee	KAFKA-5202: Handle topic deletion while trying to send txn markers Here is the sketch of this proposal: 1. When it is time to send the txn markers, only look for the leader node of the partition once instead of retrying, and if that information is not available, it means the partition is highly likely been removed since it was in the cache before. In this case, we just remove the partition from the metadata object and skip putting into the corresponding queue, and if all partitions' leader broker are non-available, complete this delayed operation to proceed to write the complete txn log entry. 2. If the leader id is unknown from the cache but the corresponding node object with the listener name is not available, it means that the leader is likely unavailable right now. Put it into a separate queue and let sender thread retry fetching its metadata again each time upon draining the queue. One caveat of this approach is the delete-and-recreate case, and the argument is that since all the messages are deleted anyways when deleting the topic-partition, it does not matter whether the markers are on the log partitions or not. Author: Guozhang Wang <wangguoz@gmail.com> Reviewers: Apurva Mehta <apurva@confluent.io>, Damian Guy <damian.guy@gmail.com>, Jason Gustafson <jason@confluent.io> Closes #3130 from guozhangwang/K5202-handle-topic-deletion	8 years ago
xinlihua	f0745cd514	KAFKA-4603: Disallow abbreviations in OptionParser constructor KAFKA-4603 the command parsed error Using "new OptionParser" might result in parse error Change all the OptionParser constructor in Kafka into "new OptionParser(false)" Author: xinlihua <xin.lihua1@zte.com.cn> Author: unknown <00067310@A23338408.zte.intra> Author: auroraxlh <xin.lihua1@zte.com.cn> Author: xin <xin.lihua1@zte.com.cn> Reviewers: Damian Guy, Guozhang Wang Closes #2349 from auroraxlh/fix_OptionParser_bug	8 years ago
Jiangjie Qin	6b03497915	KAFKA-5344; set message.timestamp.difference.max.ms back to Long.MaxValue Author: Jiangjie Qin <becket.qin@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3163 from becketqin/KAFKA-5344	8 years ago
amethystic	6f5930d631	KAFKA-5278: ConsoleConsumer should honor `--value-deserializer` In the original implementation, console-consumer fails to honor `--value-deserializer` config. Author: amethystic <huxi_2b@hotmail.com> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #3100 from amethystic/KAFKA-5278	8 years ago
Damian Guy	a8794d8a5d	KAFKA-5260; Producer should not send AbortTxn unless transaction has actually begun Keep track of when a transaction has begun by setting a flag, `transactionStarted` when a successfull `AddPartitionsToTxnResponse` or `AddOffsetsToTxnResponse` had been received. If an `AbortTxnRequest` about to be sent and `transactionStarted` is false, don't send the request and transition the state to `READY` Author: Damian Guy <damian.guy@gmail.com> Reviewers: Apurva Mehta <apurva@confluent.io>, Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io> Closes #3126 from dguy/kafka-5260	8 years ago
Jason Gustafson	dfa3c8a92d	KAFKA-5316; LogCleaner should account for larger record sets after cleaning Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com> Closes #3142 from hachikuji/KAFKA-5316	8 years ago
Vahid Hashemian	b50387eb7c	MINOR: Remove unused method parameter in `SimpleAclAuthorizer` Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3147 from vahidhashemian/minor/remove_unsed_method_parameter_simpleaclauthorizer	8 years ago
Rajini Sivaram	eb3aae7a05	MINOR: Cleanup in tests to avoid threads being left behind Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3146 from rajinisivaram/MINOR-test-cleanup	8 years ago
Ismael Juma	68eed84f24	KAFKA-5333; Remove Broker ACL resource type Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com> Closes #3154 from ijuma/kafka-5333-remove-broker-acl-resource-type	8 years ago
Damian Guy	7892b4e6c7	KAFKA-5128; Check inter broker version in transactional methods Add check in `KafkaApis` that the inter broker protocol version is at least `KAFKA_0_11_0_IV0`, i.e., supporting transactions Author: Damian Guy <damian.guy@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io> Closes #3103 from dguy/kafka-5128	8 years ago
Jason Gustafson	3743363827	MINOR: Preserve the base offset of the original record batch in V2 The previous code did not handle this correctly if a batch was compacted more than once. Also add test case for duplicate check after log cleaning and improve various comments. Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3145 from hachikuji/minor-improve-base-sequence-docs	8 years ago
Damian Guy	20e2008785	KAFKA-5279: TransactionCoordinator must expire transactionalIds remove transactions that have not been updated for at least `transactional.id.expiration.ms` Author: Damian Guy <damian.guy@gmail.com> Reviewers: Apurva Mehta, Guozhang Wang Closes #3101 from dguy/kafka-5279	8 years ago
Jason Gustafson	cea319a4ad	KAFKA-4935; Deprecate client checksum API and compute lazy partial checksum for magic v2 Author: Jason Gustafson <jason@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #3123 from hachikuji/KAFKA-4935	8 years ago
Jason Gustafson	fdcee8b8b3	MINOR: GroupCoordinator can append with group lock Author: Jason Gustafson <jason@confluent.io> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #3133 from hachikuji/minor-replica-manager-append-refactor	8 years ago
Jason Gustafson	38f6cae9e8	KAFKA-5259; TransactionalId auth implies ProducerId auth Author: Jason Gustafson <jason@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Jun Rao <junrao@gmail.com> Closes #3075 from hachikuji/KAFKA-5259-FIXED	8 years ago
Vahid Hashemian	88200938f0	MINOR: Improve the help doc of consumer group command Clarify the consumer group command help message around `zookeeper`, `bootstrap-server`, and `new-consumer` options. Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Reviewers: Jason Gustafson <jason@confluent.io> Closes #2046 from vahidhashemian/minor/improve_consumer_group_command_doc	8 years ago
Apurva Mehta	4d89db9682	KAFKA-5273: Make KafkaConsumer.committed query the server for all partitions Before this patch the consumer would return the cached offsets for partitions in its current assignment. This worked when all the offset commits went through the consumer. With KIP-98, offsets can be committed transactionally through the producer. This means that relying on cached positions in the consumer returns incorrect information: since commits go through the producer, the cache is never updated. Hence we need to update the `KafkaConsumer.committed` method to always lookup the server for the last committed offset to ensure it gets the correct information every time. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Jason Gustafson, Guozhang Wang Closes #3119 from apurvam/KAFKA-5273-kafkaconsumer-committed-should-always-hit-server	8 years ago
Ismael Juma	516d8457d8	KAFKA-5135; Controller Health Metrics (KIP-143) Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com>, Onur Karaman <okaraman@linkedin.com> Closes #2983 from ijuma/kafka-5135-controller-health-metrics-kip-143	8 years ago
Onur Karaman	beeddc25d6	KAFKA-5310; reset ControllerContext during resignation This ticket is all about ControllerContext initialization and teardown. The key points are: 1. we should teardown ControllerContext during resignation instead of waiting on election to fix it up. A heapdump shows that the former controller keeps pretty much all of its ControllerContext state laying around. 2. we don't properly teardown/reset ControllerContext.partitionsBeingReassigned. This can cause problems when the former controller becomes re-elected as controller at a later point in time. Suppose a partition assignment is initially R0. Now suppose a reassignment R1 gets stuck during controller C0 and an admin tries to "undo" R1 (by deleting /admin/partitions_reassigned, deleting /controller, and submitting another reassignment specifying R0). The new controller C1 may succeed with R0. If the controller moves back to C0, it will then reattempt R1 even though that partition reassignment has been cleared from zookeeper prior to shifting the controller back to C0. This results in the actual partition reassignment in zookeeper being unexpectedly changed back to R1. Author: Onur Karaman <okaraman@linkedin.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #3122 from onurkaraman/KAFKA-5310	8 years ago
Guozhang Wang	1bf6483316	KAFKA-5280: Protect txn metadata map with read-write lock Two major changes plus one minor change: 0. change stateLock to a read-write lock. 1. Put the check of "isCoordinator" and "coordinatorLoading" together with the return of the metadata, under one read lock block, since otherwise we can get incorrect behavior if there is a change in the metadata cache after the check but before the accessing of the metadata. 2. Grab the read lock right before trying to append to local txn log, and until the local append returns; this is to avoid the scenario that the epoch has actually changed when we are appending to local log (e.g. emigration followed by immigration). 3. only watch on txnId instead of txnId and txnPartitionId in the txn marker purgatory, and disable reaper thread, as we can now safely clear all the delayed operations by traversing the marker queues. Author: Guozhang Wang <wangguoz@gmail.com> Reviewers: Jason Gustafson, Jun Rao Closes #3082 from guozhangwang/K5231-read-write-lock	8 years ago
Jason Gustafson	70ec4b1d92	MINOR: Log transaction metadata state transitions plus a few cleanups Author: Jason Gustafson <jason@confluent.io> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #3081 from hachikuji/minor-add-txn-transition-logging	8 years ago
Damian Guy	5a6676bfca	MINOR: remove TransactionCoordinatorIntegrationTest `TransactionCoordinatorIntegrationTest` is not covering anything that isn't already covered by the more complete `TransactionsTest` Author: Damian Guy <damian.guy@gmail.com> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #3128 from dguy/minor-remove-test	8 years ago
Apurva Mehta	d1853f7911	KAFKA-5247; Materialize committed offsets in offset order With this patch, offset commits are always materialized according to the order of the commit records in the offsets topic. Before this patch, transactional offset commits were materialized in transaction order. However, the log cleaner will always preserve the record with the greatest offset. This meant that if there was a mix of offset commits from a consumer and a transactional producer, then it we would switch from transactional order to offset order after cleaning, resulting in an inconsistent state. Author: Apurva Mehta <apurva@confluent.io> Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io> Closes #3108 from apurvam/KAFKA-5247-materialize-committed-offsets-in-offset-order	8 years ago
Jason Gustafson	e3e2f1d22d	MINOR: Broker should disallow downconversion of transactional/idempotent records Author: Jason Gustafson <jason@confluent.io> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #3118 from hachikuji/disallow-transactional-idempotent-downconversion	8 years ago
Jason Gustafson	fcdbb71953	KAFKA-5186; Avoid expensive log scan to build producer state when upgrading Author: Jason Gustafson <jason@confluent.io> Reviewers: Jun Rao <junrao@gmail.com> Closes #3113 from hachikuji/KAFKA-5186	8 years ago

... 3 4 5 6 7 ...

1994 Commits (5f779ca3f38a1149aea1219ed4a91c677955101d)