src-kafka

Commit Graph

Author	SHA1	Message	Date
Jason Gustafson	b2cb6caa1e	MINOR: Move `KafkaYammerMetrics` to server-common (#11970 ) With major server components like the new quorum controller being moved outside of the `core` module, it is useful to have shared dependencies moved into `server-common`. An example of this is Yammer metrics which server components still rely heavily upon. All server components should have access to the default registry used by the broker so that new metrics can be registered and metric naming conventions should be standardized. This is particularly important in KRaft where we are attempting to recreate identically named metrics in the controller context. This patch takes a step in this direction. It moves `KafkaYammerMetrics` into `server-common` and it implements standard metric naming utilities there. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	3 years ago
Idan Kamara	eddb98df67	MINOR: Fix class comparison in `AlterConfigPolicy.RequestMetadata.equals()` (#11900 ) This patch fixes a bug in the `AlterConfigPolicy.RequestMetadata.equals` method where we were not comparing the class correctly. Co-authored-by: David Jacot <djacot@confluent.io> Reviewers: David Jacot <djacot@confluent.io>	3 years ago
Colin Patrick McCabe	07553d13f7	MINOR: create KafkaConfigSchema and TimelineObject (#11809 ) Create KafkaConfigSchema to encapsulate the concept of determining the types of configuration keys. This is useful in the controller because we can't import KafkaConfig, which is part of core. Also introduce the TimelineObject class, which is a more generic version of TimelineInteger / TimelineLong. Reviewers: David Arthur <mumrah@gmail.com>	3 years ago
Chris Egerton	6bef673197	KAFKA-10000: Add new metrics for source task transactions (#11772 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	3 years ago
Wenjun Ruan	760e6f3741	Add license header in suppressions.xml (#11753 ) Add license header in suppressions.xml Reviewers: Luke Chen <showuon@gmail.com>	3 years ago
Mickael Maison	0269edfc80	KAFKA-13577: Replace easymock with mockito in kafka:core - part 3 (#11674 ) Reviewers: Tom Bentley <tbentley@redhat.com>	3 years ago
Colin Patrick McCabe	d35283f011	KAFKA-13646; Implement KIP-801: KRaft authorizer (#11649 ) Currently, when using KRaft mode, users still have to have an Apache ZooKeeper instance if they want to use AclAuthorizer. We should have a built-in Authorizer for KRaft mode that does not depend on ZooKeeper. This PR introduces such an authorizer, called StandardAuthorizer. See KIP-801 for a full description of the new Authorizer design. Authorizer.java: add aclCount API as described in KIP-801. StandardAuthorizer is currently the only authorizer that implements it, but eventually we may implement it for AclAuthorizer and others as well. ControllerApis.scala: fix a bug where createPartitions was authorized using CREATE on the topic resource rather than ALTER on the topic resource as it should have been. QuorumTestHarness: rename the controller endpoint to CONTROLLER for consistency (the brokers already called it that). This is relevant in AuthorizerIntegrationTest where we are examining endpoint names. Also add the controllerServers call. TestUtils.scala: adapt the ACL functions to be usable from KRaft, by ensuring that they use the Authorizer from the current active controller. BrokerMetadataPublisher.scala: add broker-side ACL application logic. Controller.java: add ACL APIs. Also add a findAllTopicIds API in order to make junit tests that use KafkaServerTestHarness#getTopicNames and KafkaServerTestHarness#getTopicIds work smoothly. AuthorizerIntegrationTest.scala: convert over testAuthorizationWithTopicExisting (more to come soon) QuorumController.java: add logic for replaying ACL-based records. This means storing them in the new AclControlManager object, and integrating them into controller snapshots. It also means applying the changes in the Authorizer, if one is configured. In renounce, when reverting to a snapshot, also set newBytesSinceLastSnapshot to 0. Reviewers: YeonCheol Jang <YeonCheolGit@users.noreply.github.com>, Jason Gustafson <jason@confluent.io>	3 years ago
John Roesler	7ef8701cca	KAFKA-13553: add PAPI KV store tests for IQv2 (#11624 ) During some recent reviews, @mjsax pointed out that StateStore layers are constructed differently the stores are added via the PAPI vs. the DSL. This PR adds KeyValueStore PAPI construction to the IQv2StoreIntegrationTest so that we can ensure IQv2 works on every possible state store. Reviewers: Patrick Stuedi <pstuedi@apache.org>, Guozhang Wang <guozhang@apache.org>	3 years ago
Colin Patrick McCabe	454d63c158	KAFKA-13515: Fix KRaft config validation issues (#11577 ) Require that topics exist before topic configurations can be created for them. Merge the code from ConfigurationControlManager#checkConfigResource into ControllerConfigurationValidator to avoid duplication. Add KRaft support to DynamicConfigChangeTest. Split out tests in DynamicConfigChangeTest that don't require a cluster into DynamicConfigChangeUnitTest to save test time. Reviewers: David Arthur <mumrah@gmail.com>	3 years ago
Mickael Maison	d565c969f3	MINOR: Refactor RequestResponseTest (#11393 ) - Ensure we test all requests and responses - Code cleanups Reviewers: Tom Bentley <tbentley@redhat.com>, Luke Chen <showuon@gmail.com>	3 years ago
John Roesler	14c2449050	KAFKA-13491: IQv2 framework (#11557 ) Implements the major part of the IQv2 framework as proposed in KIP-796. Reviewers: Patrick Stuedi <pstuedi@apache.org>, Vicky Papavasileiou <vpapavasileiou@confluent.io>, Bruno Cadonnna <cadonna@apache.org>	3 years ago
Justine Olshan	e8818e234a	KAFKA-13111: Re-evaluate Fetch Sessions when using topic IDs (#11331 ) With the changes for topic IDs, we have a different flow. When a broker receives a request, it uses a map to convert the topic ID to topic names. If the topic ID is not found in the map, we return a top level error and close the session. This decision was motivated by the difficulty to store “unresolved” partitions in the session. In earlier iterations we stored an “unresolved” partition object in the cache, but it was somewhat hard to reason about and required extra logic to try to resolve the topic ID on each incremental request and add to the session. It also required extra logic to forget the topic (either by topic ID if the topic name was never known or by topic name if it was finally resolved when we wanted to remove from the session.) One helpful simplifying factor is that we only allow one type of request (uses topic ID or does not use topic ID) in the session. That means we can rely on a session continuing to have the same information. We don’t have to worry about converting topics only known by name to topic ID for a response and we won’t need to convert topics only known by ID to name for a response. This PR introduces a change to store the "unresolved partitions" in the cached partition object. If a version 13+ request is sent with a topic ID that is unknown, a cached partition will be created with that fetch request data and a null topic name. On subsequent incremental requests, unresolved partitions may be resolved with the new IDs found in the metadata cache. When handling the request, getting all partitions will return a TopicIdPartition object that will be used to handle the request and build the response. Since we can rely on only one type of request (with IDs or without), the cached partitions map will have different keys depending on what fetch request version is being used. This PR involves changes both in FetchSessionHandler and FetchSession. Some major changes are outlined below. 1. FetchSessionHandler: Forgetting a topic and adding a new topic with the same name - We may have a case where there is a topic foo with ID 1 in the session. Upon a subsequent metadata update, we may have topic foo with ID 2. This means that topic foo has been deleted and recreated. When sending fetch requests version 13+ we will send a request to add foo ID 2 to the session and remove foo ID 1. Otherwise, we will fall back to the same behavior for versions 12 and below 2. FetchSession: Resolving in Incremental Sessions - Incremental sessions contain two distinct sets of partitions. Partitions that are sent in the latest request that are new/updates/forgotten partitions and the partitions already in the session. If we want to resolve unknown topic IDs we will need to handle both cases. * Partitions in the request - These partitions are either new or updating/forgetting previous partitions in the session. The new partitions are trivial. We either have a resolved partition or create a partition that is unresolved. For the other cases, we need to be a bit more careful. * For updated partitions we have a few cases – keep in mind, we may not programmatically know if a partition is an update: 1. partition in session is resolved, update is resolved: trivial  2. partition in session is unresolved, update is unresolved: in code, this is equivalent to the case above, so trivial as well  3. partition in session is unresolved, update is resolved: this means the partition in the session does not have a name, but the metadata cache now contains the name – to fix this we can check if there exists a cached partition with the given ID and update it both with the partition update and with the topic name.  4. partition in session is resolved, update is unresolved: this means the partition in the session has a name, but the update was unable to be resolved (ie, the topic is deleted) – this is the odd case. We will look up the partition using the ID. We will find the old version with a name but will not replace the name. This will lead to an UNKNOWN_TOPIC_OR_PARTITION or INCONSISTENT_TOPIC_ID error which will be handled with a metadata update. Likely a future request will forget the partition, and we will be able to do so by ID.  5. Two partitions in the session have IDs, but they are different: only one topic ID should exist in the metadata at a time, so likely only one topic ID is in the fetch set. The other one should be in the toForget. We will be able to remove this partition from the session. If for some reason, we don't try to forget this partition — one of the partitions in the session will cause an inconsistent topic ID error and the metadata for this partition will be refreshed — this should result in the old ID being removed from the session. This should not happen if the FetchSessionHandler is correctly in sync.  * For the forgotten partitions we have the same cases: 1. partition in session is resolved, forgotten is resolved: trivial  2. partition in session is unresolved, forgotten is unresolved: in code, this is equivalent to the case above, so trivial as well  3. partition in session is unresolved, forgotten is resolved: this means the partition in the session does not have a name, but the metadata cache now contains the name – to fix this we can check if there exists a cached partition with the given ID and try to forget it before we check the resolved name case.  4. partition in session is resolved, update is unresolved: this means the partition in the session has a name, but the update was unable to be resolved (ie, the topic is deleted) We will look up the partition using the ID. We will find the old version with a name and be able to delete it.  5. both partitions in the session have IDs, but they are different: This should be the same case as described above. If we somehow do not have the ID in the session, no partition will be removed. This should not happen unless the Fetch Session Handler is out of sync.  * Partitions in the session - there may be some partitions in the session already that are unresolved. We can resolve them in forEachPartition using a method that checks if the partition is unresolved and tries to resolve it using a topicName map from the request. The partition will be resolved before the function using the cached partition is applied. Reviewers: David Jacot <djacot@confluent.io>	3 years ago
Kirk True	7b379539a5	KAFKA-13202: KIP-768: Extend SASL/OAUTHBEARER with Support for OIDC (#11284 ) This task is to provide a concrete implementation of the interfaces defined in KIP-255 to allow Kafka to connect to an OAuth/OIDC identity provider for authentication and token retrieval. While KIP-255 provides an unsecured JWT example for development, this will fill in the gap and provide a production-grade implementation. The OAuth/OIDC work will allow out-of-the-box configuration by any Apache Kafka users to connect to an external identity provider service (e.g. Okta, Auth0, Azure, etc.). The code will implement the standard OAuth client credentials grant type. The proposed change is largely composed of a pair of AuthenticateCallbackHandler implementations: one to login on the client and one to validate on the broker. See the following for more detail: KIP-768 KAFKA-13202 Reviewers: Yi Ding <dingyi.zj@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	3 years ago
dengziming	37b3f8c15b	MINOR: MetadataShell should handle ClientQuotaRecord (#11339 ) MetadataShell should handle ClientQuotaRecord. Also, add MetadataNodeManagerTest. Reviewers: Colin P. McCabe <cmccabe@apache.org>	3 years ago
José Armando García Sancio	da58d75c43	MINOR: Fix highest offset when loading KRaft metadata snapshots (#11386 ) When loading a snapshot the broker BrokerMetadataListener was using the batch's append time, offset and epoch. These are not the same as the append time, offset and epoch from the log. This PR fixes it to instead use the lastContainedLogTimeStamp, lastContainedLogOffset and lastContainedLogEpoch from the SnapshotReader. This PR refactors the MetadataImage and MetadataDelta to include an offset and epoch. It also swaps the order of the arguments for ReplicaManager.applyDelta, in order to be more consistent with MetadataPublisher.publish. Reviewers: Colin P. McCabe <cmccabe@apache.org>	3 years ago
Satish Duggana	34d56dc8d0	KAFKA-12802 Added a file based cache for consumed remote log metadata for each partition to avoid consuming again incase of broker restarts. (#11058 ) Added snapshots for consumed remote log metadata for each partition to avoid consuming again in case of broker restarts. These snapshots are stored in the respective topic partition log directories. Reviewers: Kowshik Prakasam <kprakasam@confluent.io>, Cong Ding <cong@ccding.com>, Jun Rao <junrao@gmail.com>	3 years ago
Colin Patrick McCabe	3f3a0e0d9e	KAFKA-13280: Avoid O(N) behavior in KRaftMetadataCache#topicNamesToIds (#11311 ) Avoid O(N) behavior in KRaftMetadataCache#topicNamesToIds and KRaftMetadataCache#topicIdsToNames by returning a map subclass that exposes the TopicsImage data structures without copying them. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>	3 years ago
Colin Patrick McCabe	85548acafb	KAFKA-13279: allow CreateTopicsPolicy, AlterConfigsPolicy in KRaft mode (#11310 ) Add support for CreateTopicsPolicy and AlterConfigsPolicy when running in KRaft mode. Reviewers: David Arthur <mumrah@gmail.com>, Niket Goel <ngoel@confluent.io>	3 years ago
Colin Patrick McCabe	074a3dacca	MINOR: Make ReplicaManager, LogManager, KafkaApis easier to construct (#11320 ) The ReplicaManager, LogManager, and KafkaApis class all have many constructor parameters. It is often difficult to add or remove a parameter, since there are so many locations that need to be updated. In order to address this problem, we should use named parameters when constructing these objects from Scala code. This will make it easy to add new optional parameters without modifying many test cases. It will also make it easier to read git diffs and PRs, since the parameters will have names next to them. Since Java does not support named paramters, this PR adds several Builder classes which can be used to achieve the same effect. ReplicaManager also had a secondary constructor, which this PR removes. The function of the secondary constructor was just to provide some default parameters for the main constructor. However, it is simpler just to actually use default parameters. Reviewers: David Arthur <mumrah@gmail.com>	3 years ago
Matthew Wong	6c80643009	[KAFKA-8522] Streamline tombstone and transaction marker removal (#10914 ) This PR aims to remove tombstones that persist indefinitely due to low throughput. Previously, deleteHorizon was calculated from the segment's last modified time. In this PR, the deleteHorizon will now be tracked in the baseTimestamp of RecordBatches. After the first cleaning pass that finds a record batch with tombstones, the record batch is recopied with deleteHorizon flag and a new baseTimestamp that is the deleteHorizonMs. The records in the batch are rebuilt with relative timestamps based on the deleteHorizonMs that is recorded. Later cleaning passes will be able to remove tombstones more accurately on their deleteHorizon due to the individual time tracking on record batches. KIP 534: https://cwiki.apache.org/confluence/display/KAFKA/KIP-534%3A+Retain+tombstones+and+transaction+markers+for+approximately+delete.retention.ms+milliseconds Co-authored-by: Ted Yu <yuzhihong@gmail.com> Co-authored-by: Richard Yu <yohan.richard.yu@gmail.com>	3 years ago
Rohan	01ab888dbd	KAFKA-13229: add total blocked time metric to streams (KIP-761) (#11149 ) * Add the following producer metrics: flush-time-total: cumulative sum of time elapsed during in flush. txn-init-time-total: cumulative sum of time elapsed during in initTransactions. txn-begin-time-total: cumulative sum of time elapsed during in beginTransaction. txn-send-offsets-time-total: cumulative sum of time elapsed during in sendOffsetsToTransaction. txn-commit-time-total: cumulative sum of time elapsed during in commitTransaction. txn-abort-time-total: cumulative sum of time elapsed during in abortTransaction. * Add the following consumer metrics: commited-time-total: cumulative sum of time elapsed during in committed. commit-sync-time-total: cumulative sum of time elapsed during in commitSync. * Add a total-blocked-time metric to streams that is the sum of: consumer’s io-waittime-total consumer’s iotime-total consumer’s committed-time-total consumer’s commit-sync-time-total restore consumer’s io-waittime-total restore consumer’s iotime-total admin client’s io-waittime-total admin client’s iotime-total producer’s bufferpool-wait-time-total producer's flush-time-total producer's txn-init-time-total producer's txn-begin-time-total producer's txn-send-offsets-time-total producer's txn-commit-time-total producer's txn-abort-time-total Reviewers: Bruno Cadonna <cadonna@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	3 years ago
Josep Prat	83f0ae3821	KAFKA-12862: Update Scala fmt library and apply fixes (#10784 ) Updates the scala fmt to the latest stable version. Applies all the style fixes (all source code changes are done by scala fmt). Removes setting about dangling parentheses as `true` is already the default. Reviewer: John Roesler <john@confluent.io>	3 years ago
A. Sophie Blee-Goldman	6854eb8332	KAFKA-12648: Pt. 3 - addNamedTopology API (#10788 ) Pt. 1: #10609 Pt. 2: #10683 Pt. 3: #10788 In Pt. 3 we implement the addNamedTopology API. This can be used to update the processing topology of a running Kafka Streams application without resetting the app, or even pausing/restarting the process. It's up to the user to ensure that this API is called on every instance of an application to ensure all clients are able to run the newly added NamedTopology. Reviewers: Guozhang Wang <guozhang@confluent.io>, Walker Carlson <wcarlson@confluent.io>	3 years ago
A. Sophie Blee-Goldman	4710a49146	KAFKA-12648: Pt. 2 - Introduce TopologyMetadata to wrap InternalTopologyBuilders of named topologies (#10683 ) Pt. 1: #10609 Pt. 2: #10683 Pt. 3: #10788 The TopologyMetadata is next up after Pt. 1 #10609. This PR sets up the basic architecture for running an app with multiple NamedTopologies, though the APIs to add/remove them dynamically are not implemented until Pt. 3 Reviewers: Guozhang Wang <guozhang@confluent.io>, Walker Carlson <wcarlson@confluent.io>	3 years ago
José Armando García Sancio	55d9acad65	KAFKA-13113; Support unregistering Raft listeners (#11109 ) This patch adds support for unregistering listeners to `RaftClient`. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Jason Gustafson <jason@confluent.io>	3 years ago
Satish Duggana	e8ce93bd53	KAFKA-9555 Added default RLMM implementation based on internal topic storage. (#10579 ) KAFKA-9555 Added default RLMM implementation based on internal topic storage. This is the initial version of the default RLMM implementation. This includes changes containing default RLMM configs, RLMM implementation, producer/consumer managers. Introduced TopicBasedRemoteLogMetadataManagerHarness which takes care of bringing up a Kafka cluster and create remote log metadata topic and initializes TopicBasedRemoteLogMetadataManager. Refactored existing RemoteLogMetadataCacheTest to RemoteLogSegmentLifecycleTest to have parameterized tests to run both RemoteLogMetadataCache and also TopicBasedRemoteLogMetadataManager. Refactored existing InmemoryRemoteLogMetadataManagerTest, RemoteLogMetadataManagerTest to have parameterized tests to run both InmemoryRemoteLogMetadataManager and also TopicBasedRemoteLogMetadataManager. This is part of tiered storage KIP-405 efforts. Reviewers: Kowshik Prakasam <kprakasam@confluent.io>, Cong Ding <cong@ccding.com>, Jun Rao <junrao@gmail.com>	3 years ago
Colin Patrick McCabe	e07de97a4c	KAFKA-12803: Support reassigning partitions when in KRaft mode (#10753 ) Support the KIP-455 reassignment API when in KRaft mode. Reassignments which merely rearrange partitions complete immediately. Those that only remove a partition complete immediately if the ISR would be non-empty after the specified removals. Reassignments that add one or more partitions follow the KIP-455 pattern of adding all the adding replicas to the replica set, and then waiting for the ISR to include all the new partitions before completing. Changes to the partition sets are accomplished via PartitionChangeRecord. Reviewers: Jun Rao <junrao@gmail.com>	3 years ago
A. Sophie Blee-Goldman	37d086fa2a	KAFKA-12984: make AbstractStickyAssignor resilient to invalid input, utilize generation in cooperative, and fix assignment bug (#10985 ) 1) Bring the generation field back to the CooperativeStickyAssignor so we don't need to rely so heavily on the ConsumerCoordinator properly updating its SubscriptionState after eg falling out of the group. The plain StickyAssignor always used the generation since it had to, so we just make sure the CooperativeStickyAssignor has this tool as well 2) In case of unforeseen problems or further bugs that slip past the generation field safety net, the assignor will now explicitly look out for partitions that are being claimed by multiple consumers as owned in the same generation. Such a case should never occur, but if it does, we have to invalidate this partition from the ownedPartitions of both consumers, since we can't tell who, if anyone, has the valid claim to this partition. 3) Fix a subtle bug that I discovered while writing tests for the above two fixes: in the constrained algorithm, we compute the exact number of partitions each consumer should end up with, and keep track of the "unfilled" members who must -- or might -- require more partitions to hit their quota. The problem was that members at the minQuota were being considered as "unfilled" even after we had already hit the maximum number of consumers allowed to go up to the maxQuota, meaning those minQuota members could/should not accept any more partitions beyond that. I believe this was introduced in #10509, so it shouldn't be in any released versions and does not need to be backported. Reviewers: Guozhang Wang <guozhang@apache.org>, Luke Chen <showuon@gmail.com>	3 years ago
Justine Olshan	2b8aff58b5	KAFKA-10580: Add topic ID support to Fetch request (#9944 ) Updated FetchRequest and FetchResponse to use topic IDs rather than topic names. Some of the complicated code is found in FetchSession and FetchSessionHandler. We need to be able to store topic IDs and maintain a cache on the broker for IDs that may not have been resolved. On incremental fetch requests, we will try to resolve them or remove them if in toForget. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Jun Rao <junrao@gmail.com>	3 years ago
Sanjana Kaundinya	e00c0f3719	KAFKA-12234: Implement request/response for offsetFetch batching (KIP-709) (#10962 ) This implements the request and response portion of KIP-709. It updates the OffsetFetch request and response to support fetching offsets for multiple consumer groups at a time. If the broker does not support the new OffsetFetch version, clients can revert to the previous behaviour and use a request for each coordinator. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Konstantine Karantasis <konstantine@confluent.io>	3 years ago
Colin Patrick McCabe	b4e45cd0d2	KAFKA-13019: Add MetadataImage and MetadataDelta classes for KRaft Snapshots (#10949 ) Create the image/ module for storing, reading, and writing broker metadata images. Metadata images are immutable. New images are produced from existing images using delta classes. Delta classes are mutable, and represent changes to a base image. MetadataImage objects can be converted to lists of KRaft metadata records. This is essentially writing a KRaft snapshot. The resulting snapshot can be read back into a MetadataDelta object. In practice, we will typically read the snapshot, and then read a few more records to get fully up to date. After that, the MetadataDelta can be converted to a MetadataImage as usual. Sometimes, we have to load a snapshot even though we already have an existing non-empty MetadataImage. We would do this if the broker fell too far behind and needed to receive a snapshot to catch up. This is handled just like the normal snapshot loading process. Anything that is not in the snapshot will be marked as deleted in the MetadataDelta once finishSnapshot() is called. In addition to being used for reading and writing snapshots, MetadataImage also serves as a cache for broker information in memory. A follow-up PR will replace MetadataCache, CachedConfigRepository, and the client quotas cache with the corresponding Image classes. TopicsDelta also replaces the "deferred partition" state that the RaftReplicaManager currently implements. (That change is also in a follow-up PR.) Reviewers: Jason Gustafson <jason@confluent.io>, David Arthur <mumrah@gmail.com>	3 years ago
kpatelatwork	527ba111c7	KAFKA-4793: Connect API to restart connector and tasks (KIP-745) (#10822 ) Implements KIP-745 https://cwiki.apache.org/confluence/display/KAFKA/KIP-745%3A+Connect+API+to+restart+connector+and+tasks to change connector REST API to restart a connector and its tasks as a whole. Testing strategy - [x] Unit tests added for all possible combinations of onlyFailed and includeTasks - [x] Integration tests added for all possible combinations of onlyFailed and includeTasks - [x] System tests for happy path Reviewers: Randall Hauch <rhauch@gmail.com>, Diego Erdody <erdody@gmail.com>, Konstantine Karantasis <k.karantasis@gmail.com>	3 years ago
Niket	d3ec9f940c	KAFKA-12952 Add header and footer records for raft snapshots (#10899 ) Add header and footer records for raft snapshots. This helps identify when the snapshot starts and ends. The header also contains a time. The time field is currently set to 0. KAFKA-12997 will add in the necessary wiring to use the correct timestamp. Reviewers: Jose Sancio <jsancio@gmail.com>, Colin P. McCabe <cmccabe@apache.org>	3 years ago
Jason Gustafson	299eea88a5	KAFKA-12870; Flush in progress not cleared after transaction completion (#10880 ) We had been using `RecordAccumulator.beginFlush` in order to force the `RecordAccumulator` to flush pending batches when a transaction was being completed. Internally, `RecordAccumulator` has a simple counter for the number of flushes in progress. The count gets incremented in `beginFlush` and it is expected to be decremented by `awaitFlushCompletion`. The second call to decrement the counter never happened in the transactional path, so the counter could get stuck at a positive value, which means that the linger time would effectively be ignored. This patch fixes the problem by removing the use of `beginFlush` in `Sender`. Instead, we now add an additional condition in `RecordAccumulator` to explicitly check when a transaction is being completed. Reviewers: Guozhang Wang <wangguoz@gmail.com>	3 years ago
Satish Duggana	56250f446a	KAFKA-12816 Added tiered storage related configs including remote log manager configs. (#10733 ) Added tiered storage related configs including remote log manager configs. Added local log retention configs to LogConfig. Added tests for the added configs. Reviewers: Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	3 years ago
Ismael Juma	d27a84f70c	KAFKA-12945: Remove port, host.name and related configs in 3.0 (#10872 ) They have been deprecated since 0.10.0. Full list of removes configs: * port * host.name * advertised.port * advertised.host.name Also adjust tests to take the removals into account. Some tests were no longer relevant and have been removed. Finally, took the chance to: * Clean up unnecessary usage of `KafkaConfig$.MODULE$` in related files. * Add missing `Test` annotations to `AdvertiseBrokerTest` and make necessary changes for the tests to pass. Reviewers: David Jacot <djacot@confluent.io>, Luke Chen <showuon@gmail.com>	3 years ago
José Armando García Sancio	b67a77d5b9	KAFKA-12787; Integrate controller snapshoting with raft client (#10786 ) Directly use `RaftClient.Listener`, `SnapshotWriter` and `SnapshotReader` in the quorum controller. 1. Allow `RaftClient` users to create snapshots by specifying the last committed offset and last committed epoch. These values are validated against the log and leader epoch cache. 2. Remove duplicate classes in the metadata module for writing and reading snapshots. 3. Changed the logic for comparing snapshots. The old logic was assuming a certain batch grouping. This didn't match the implementation of the snapshot writer. The snapshot writer is free to merge batches before writing them. 4. Improve `LocalLogManager` to keep track of multiple snapshots. 5. Improve the documentation and API for the snapshot classes to highlight the distinction between the offset of batches in the snapshot vs the offset of batches in the log. These two offsets are independent of one another. `SnapshotWriter` and `SnapshotReader` expose a method called `lastOffsetFromLog` which represents the last inclusive offset from the log that is represented in the snapshot. Reviewers: dengziming <swzmdeng@163.com>, Jason Gustafson <jason@confluent.io>	3 years ago
A. Sophie Blee-Goldman	48379bd6e5	KAFKA-12648: Pt. 1 - Add NamedTopology to protocol and state directory structure (#10609 ) This PR includes adding the NamedTopology to the Subscription/AssignmentInfo, and to the StateDirectory so it can place NamedTopology tasks within the hierarchical structure with task directories under the NamedTopology parent dir. Reviewers: Walker Carlson <wcarlson@confluent.io>, Guozhang Wang <guozhang@confluent.io>	3 years ago
José Armando García Sancio	f50f13d781	KAFKA-12342: Remove MetaLogShim and use RaftClient directly (#10705 ) This patch removes the temporary shim layer we added to bridge the interface differences between MetaLogManager and RaftClient. Instead, we now use the RaftClient directly from the metadata module. This also means that the metadata gradle module now depends on raft, rather than the other way around. Finally, this PR also consolidates the handleResign and handleNewLeader APIs into a single handleLeaderChange API. Co-authored-by: Jason Gustafson <jason@confluent.io>	4 years ago
Colin Patrick McCabe	9e5b77fb96	KAFKA-12788: improve KRaft replica placement (#10494 ) Implement a striped replica placement algorithm for KRaft. This also means implementing rack awareness. Previously, KRraft just chose replicas randomly in a non-rack-aware fashion. Also, allow replicas to be placed on fenced brokers if there are no other choices. This was specified in KIP-631 but previously not implemented. Reviewers: Jun Rao <junrao@gmail.com>	4 years ago
Colin Patrick McCabe	f20fdbd839	KAFKA-12778: Fix QuorumController request timeouts and electLeaders (#10688 ) The QuorumController should honor the timeout for RPC requests which feature a timeout. For electLeaders, attempt to trigger a leader election for all partitions when the request specifies null for the topics argument. Reviewers: David Arthur <mumrah@gmail.com>	4 years ago
Daniyar Yeralin	6d1ae8bc00	KAFKA-8326: Introduce List Serde (#6592 ) Introduce List serde for primitive types or custom serdes with a serializer and a deserializer according to KIP-466 Reviewers: Anna Sophie Blee-Goldman <ableegoldman@apache.org>, Matthias J. Sax <mjsax@conflunet.io>, John Roesler <roesler@confluent.io>, Michael Noll <michael@confluent.io>	4 years ago
Satish Duggana	7ef3879429	KAFKA-12758 Added `server-common` module to have server side common classes. (#10638 ) Added server-common module to have server side common classes. Moved ApiMessageAndVersion, RecordSerde, AbstractApiMessageSerde, and BytesApiMessageSerde to server-common module. Reivewers: Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	4 years ago
Chris Egerton	9ba583f6d6	KAFKA-12252 and KAFKA-12262: Fix session key rotation when leadership changes (#10014 ) Author: Chris Egerton <chrise@confluent.io> Reviewers: Greg Harris <gregh@confluent.io>, Randall Hauch <rhauch@gmail.com>	4 years ago
Satish Duggana	a1367f57f5	KAFKA-12429: Added serdes for the default implementation of RLMM based on an internal topic as storage. (#10271 ) KAFKA-12429: Added serdes for the default implementation of RLMM based on an internal topic as storage. This topic will receive events of RemoteLogSegmentMetadata, RemoteLogSegmentUpdate, and RemotePartitionDeleteMetadata. These events are serialized into Kafka protocol message format. Added tests for all the event types for that topic. This is part of the tiered storaqe implementation KIP-405. Reivewers: Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	4 years ago
Vito Jeng	816f5c3b86	KAFKA-5876: KIP-216 Part 3, Apply StreamsNotStartedException for Interactive Queries (#10597 ) KIP-216 Part 3: Throw StreamsNotStartedException if KafkaStreams state is CREATED Reviewers: Anna Sophie Blee-Goldman <ableegoldman@apache.org>	4 years ago
José Armando García Sancio	6203bf8b94	KAFKA-12154; Raft Snapshot Loading API (#10085 ) Implement Raft Snapshot loading API. 1. Adds a new method `handleSnapshot` to `raft.Listener` which is called whenever the `RaftClient` determines that the `Listener` needs to load a new snapshot before reading the log. This happens when the `Listener`'s next offset is less than the log start offset also known as the earliest snapshot. 2. Adds a new type `SnapshotReader<T>` which provides a `Iterator<Batch<T>>` interface and de-serializes records in the `RawSnapshotReader` into `T`s 3. Adds a new type `RecordsIterator<T>` that implements an `Iterator<Batch<T>>` by scanning a `Records` object and deserializes the batches and records into `Batch<T>`. This type is used by both `SnapshotReader<T>` and `RecordsBatchReader<T>` internally to implement the `Iterator` interface that they expose. 4. Changes the `MockLog` implementation to read one or two batches at a time. The previous implementation always read from the given offset to the high-watermark. This made it impossible to test interesting snapshot loading scenarios. 5. Removed `throws IOException` from some methods. Some of types were inconsistently throwing `IOException` in some cases and throwing `RuntimeException(..., new IOException(...))` in others. This PR improves the consistent by wrapping `IOException` in `RuntimeException` in a few more places and replacing `Closeable` with `AutoCloseable`. 6. Updated the Kafka Raft simulation test to take into account snapshot. `ReplicatedCounter` was updated to generate snapshot after 10 records get committed. This means that the `ConsistentCommittedData` validation was extended to take snapshots into account. Also added a new invariant to ensure that the log start offset is consistently set with the earliest snapshot. Reviewers: dengziming <swzmdeng@163.com>, David Arthur <mumrah@gmail.com>, Jason Gustafson <jason@confluent.io>	4 years ago
Sergio Peña	bf359f8e29	KAFKA-10847: Fix spurious results on left/outer stream-stream joins (#10462 ) Fixes the issue with https://issues.apache.org/jira/browse/KAFKA-10847. To fix the above problem, the left/outer stream-stream join processor uses a buffer to hold non-joined records for some time until the window closes, so they are not processed if a join is found during the join window time. If the window of a record closes and a join was not found, then this should be emitted and processed by the consequent topology processor. A new time-ordered window store is used to temporary hold records that do not have a join and keep the records keys ordered by time. The KStreamStreamJoin has a reference to this new store . For every non-joined record seen, the processor writes it to this new state store without processing it. When a joined record is seen, the processor deletes the joined record from the new state store to prevent further processing. Records that were never joined at the end of the window + grace period are emitted to the next topology processor. I use the stream time to check for the expiry time for determinism results . The KStreamStreamJoin checks for expired records and emit them every time a new record is processed in the join processor. The new state store is shared with the left and right join nodes. The new store needs to serialize the record keys using a combined key of <joinSide-recordKey>. This key combination helps to delete the records from the other join if a joined record is found. Two new serdes are created for this, KeyAndJoinSideSerde which serializes a boolean value that specifies the side where the key is found, and ValueOrOtherValueSerde that serializes either V1 or V2 based on where the key was found. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	4 years ago
A. Sophie Blee-Goldman	3805f3706f	KAFKA-12574: KIP-732, Deprecate eos-alpha and replace eos-beta with eos-v2 (#10573 ) Deprecates the following 1. StreamsConfig.EXACTLY_ONCE 2. StreamsConfig.EXACTLY_ONCE_BETA 3. Producer#sendOffsetsToTransaction(Map offsets, String consumerGroupId) And introduces a new StreamsConfig.EXACTLY_ONCE_V2 config. Additionally, this PR replaces usages of the term "eos-beta" throughout the code with the term "eos-v2" Reviewers: Matthias J. Sax <mjsax@confluent.io>	4 years ago
Colin Patrick McCabe	a8a6952e4a	KAFKA-12471: Implement createPartitions in KIP-500 mode (#10343 ) Implement the createPartitions RPC which adds more partitions to a topic in the KIP-500 controller. Factor out some of the logic for validating manual partition assignments, so that it can be shared between createTopics and createPartitions. Add a startPartition argument to the replica placer. Reviewers: Jason Gustafson <jason@confluent.io>	4 years ago

... 2 3 4 5 6 ...

476 Commits (trunk)