src-kafka

Commit Graph

Author	SHA1	Message	Date
maniekes	987609404c	KAFKA-15685: Add support for MinGW and MSYS2 (windows OS) (#13321 ) Kafka class runner does not work with MINGW/Git Bash on Windows. This commit adds support for MinGW and MSYS2 development environments. Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Owen Leung	9989b68d0d	KAFKA-15200: Add pre-requisite check in release.py (#14636 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Gaurav Narula	abd104a606	MINOR: avoid blocking for randomness in DefaultRecordBatchTest (#14625 ) Using `SecureRandom.getInstanceStrong()` results in using `/dev/random` which is known to block in Linux when the OS runs low on entropy. This was noticable when running tests in containerised CI environments. This commit avoids using a CSPRNG altogether since the tests do not need cryptographically secure random numbers. Reviewers: Divij Vaidya <diviv@amazon.com>, Igor Soarez <soarez@apple.com> --------- Co-authored-by: Igor Soarez <soarez@apple.com>	1 year ago
hudeqi	b559942c17	KAFKA-15671: Fix flaky test RemoteIndexCacheTest.testClearCacheAndIndexFilesWhenResizeCache (#14622 ) Reviewers: Divij Vaidya <diviv@amazon.com> --------- Co-authored-by: Deqi Hu <deqi.hu@shopee.com>	1 year ago
dengziming	03ea24aa1d	MINOR: Fix flaky testFollowerCompleteDelayedFetchesOnReplication (#14616 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	1 year ago
Kirk True	2b233bfa5f	KAFKA-14274 [6, 7]: Introduction of fetch request manager (#14406 ) Changes: 1. Introduces FetchRequestManager that implements the RequestManager API for fetching messages from brokers. Unlike Fetcher, record decompression and deserialization is performed on the application thread inside CompletedFetch. 2. Restructured the code so that objects owned by the background thread are not instantiated until the background thread runs (via Supplier) to ensure that there are no references available to the application thread. 3. Ensuring resources are properly using Closeable and using IdempotentCloser to ensure they're only closed once. 4. Introduces ConsumerTestBuilder to reduce a lot of inconsistency in the way the objects were built up for tests. Reviewers: Philip Nee <pnee@confluent.io>, Lianet Magrans <lianetmr@gmail.com>, Jun Rao<junrao@gmail.com>	1 year ago
Lucas Brutschy	d144b7ee38	KAFKA-15326: [10/N] Integrate processing thread (#14193 ) - Introduce a new internal config flag to enable processing threads - If enabled, create a scheduling task manager inside the normal task manager (renamings will be added on top of this), and use it from the stream thread - All operations inside the task manager that change task state, lock the corresponding tasks if processing threads are enabled. - Adds a new abstract class AbstractPartitionGroup. We can modify the underlying implementation depending on the synchronization requirements. PartitionGroup is the unsynchronized subclass that is going to be used by the original code path. The processing thread code path uses a trivially synchronized SynchronizedPartitionGroup that uses object monitors. Further down the road, there is the opportunity to implement a weakly synchronized alternative. The details are complex, but since the implementation is essentially a queue + some other things, it should be feasible to implement this lock-free. - Refactorings in StreamThreadTest: Make all tests use the thread member variable and add tearDown in order avoid thread leaks and simplify debugging. Make the test parameterized on two internal flags: state updater enabled and processing threads enabled. Use JUnit's assume to disable all tests that do not apply. Enable some integration tests with processing threads enabled. Reviewer: Bruno Cadonna <bruno@confluent.io>	1 year ago
Nikolay	e0121a38b1	MINOR: Deduplicating ConsumerGroupCommand print formating (#14610 ) ConsumerGroupCommand contains code duplications for table row format. This PR reduces code duplication and make it more clear and easy to understand. Reviewers: Luke Chen <showuon@gmail.com>, hudeqi <1217150961@qq.com>	1 year ago
Jotaniya Jeel	4612fe42af	KAFKA-15481: Fix concurrency bug in RemoteIndexCache (#14483 ) RemoteIndexCache has a concurrency bug which leads to IOException while fetching data from remote tier. The bug could be reproduced as per the following order of events:- Thread 1 (cache thread): invalidates the entry, removalListener is invoked async, so the files have not been renamed to "deleted" suffix yet. Thread 2: (fetch thread): tries to find entry in cache, doesn't find it because it has been removed by 1, fetches the entry from S3, writes it to existing file (using replace existing) Thread 1: async removalListener is invoked, acquires a lock on old entry (which has been removed from cache), it renames the file to "deleted" and starts deleting it Thread 2: Tries to create in-memory/mmapped index, but doesn't find the file and hence, creates a new file of size 2GB in AbstractIndex constructor. JVM returns an error as it won't allow creation of 2GB random access file. This commit fixes the bug by using EvictionListener instead of RemovalListener to perform the eviction atomically with the file rename. It handles the manual removal (not handled by EvictionListener) by using computeIfAbsent() and enforcing atomic cache removal & file rename. Reviewers: Luke Chen <showuon@gmail.com>, Divij Vaidya <diviv@amazon.com>, Arpit Goyal <goyal.arpit.91@gmail.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	1 year ago
Mickael Maison	8b9f6d17f2	KAFKA-15093: Add 3.5 Streams upgrade system tests (#14602 ) Reviewers: Matthias J. Sax <mjsax@apache.org>	1 year ago
shuoer86	27a155c80a	MINOR: Fix typos in build.gradle, tests and trogdor (#14574 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, hudeqi <1217150961@qq.com>	1 year ago
vamossagar12	1a3aca305e	KAFKA-15457: Add support for OffsetFetch version 9 in admin client (#14611 ) This patch adds support for OffsetFetch version 9 in the admin client. It mainly allows handling two new error codes `STALE_MEMBER_EPOCH` and `UNKNOWN_MEMBER_ID` introduced as part of KIP-848. Reviewers: David Jacot <djacot@confluent.io>	1 year ago
Mickael Maison	9c77c17c4e	KAFKA-15664: Add 3.4 Streams upgrade system tests (#14601 ) Reviewers: Luke Chen <showuon@gmail.com>, Matthias J. Sax <mjsax@apache.org>	1 year ago
Gantigmaa Selenge	84a58d75bb	KAFKA-15566: Fix test FetchRequestTest.testLastFetchedEpochValidation for KRaft mode (#14563 ) Fix test FetchRequestTest.testLastFetchedEpochValidation for KRaft mode The test fails due to unexpected error (OFFSET_OUT_OF_RANGE) when enabled with KRaft mode. The reason it takes longer to set the leader epoch in KRaft mode is because of the way the topic partitions are created differently than Zookeeper. In Zookeeper mode, we create the topic partitions directly with Zookeeper therefore seem to take less time to create the logs and set leader epoch on broker. In KRaft mode, we use Admin client to create topic partitions. Even though the test waits for topic partitions to get created and appear in metadata cache, it doesn’t seem to be sufficient time for leader epoch to get set on the brokers. Reviewers: Luke Chen <showuon@gmail.com>, dengziming <dengziming1993@gmail.com>	1 year ago
Greg Harris	ffcb6d4a1a	KAFKA-14767: Fix missing commitId build error after git gc (#13315 ) git gc moves commit hashes from individual .git/refs/heads/ to .git/packed-refs which is not read by the determineCommitId function. Replace the existing lookup within the .git directory with a GrGit lookup that handles packed and unpacked refs transparently. Reviewers: Ismael Juma <ismael@juma.me.uk>	1 year ago
Matthias J. Sax	4371214fbe	KAFKA-15378: fix streams upgrade system test (#14539 ) Fixing bad test setup. We tried to fix an upgrade bug for FK-joins in 3.1 release, but it later turned out that the PR was not sufficient to fix it. We finally fixed in 3.4 release. This PR updates the system test matrix to only test working versions with FK-joins, limited to available test versions. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Hao Li <hli@confluent.io>, Mickael Maison <mickael.maison@gmail.com>	1 year ago
Justine Olshan	e8c8969330	KAFKA-15626: Replace verification guard object with an specific type (#14568 ) I've added a new class with an incrementing atomic long to represent the verification guard. Upon creation of verification guard, we will increment this value and assign it to the guard. The expected behavior is the same as the object guard, but with better debuggability with the string value and type safety (I found a type safety issue in the current code when implementing this) Reviewers: Ismael Juma <ismael@juma.me.uk>, Artem Livshits <alivshits@confluent.io>	1 year ago
Josep Prat	eed5e68880	MINOR: Server-Commons cleanup (#14572 ) MINOR: Server-Commons cleanup Fixes Javadoc and minor issues in the Java files of Server-Commons modules. Javadoc is now formatted as intended by the author of the doc itself. Signed-off-by: Josep Prat <josep.prat@aiven.io> Reviewers: Mickael Maison <mickael.maison@gmail.com>	1 year ago
hudeqi	4083cd627e	KAFKA-15607: Fix NPE in MirrorCheckpointTask::syncGroupOffset (#14587 ) Reviewers: Chris Egerton <chrise@aiven.io>	1 year ago
Christo Lolov	b5ec6e8a0d	KAFKA-14133: Move RocksDBGenericOptionsToDbOptionsColumnFamilyOptionsAdapterTest to Mockito (#14586 ) Reviewers: Divij Vaidya <diviv@amazon.com>	1 year ago
Chris Egerton	091eb9b349	KAFKA-15428: Cluster-wide dynamic log adjustments for Connect (#14538 ) Reviewers: Greg Harris <greg.harris@aiven.io>, Yang Yang <yayang@uber.com>, Yash Mayya <yash.mayya@gmail.com>	1 year ago
Philip Nee	c81a725219	KAFKA-15534: Inject request completion time when the request failed (#14532 ) Currently, we aren't able to access the request completion time if the request is completed exceptionally, which results in many system calls. This is not ideal because these system calls can add up. Instead, time is already retrieved on the top of the background thread event loop, which is then propagated into the NetworkClientDelegate.poll. In this PR - I store the completion time in the handler, so that it becomes accessible in the callbacks. Reviewer: Bruno Cadonna <cadonna@apache.org>	1 year ago
hudeqi	21ebbe6b28	MINOR:Remove unused method parameter in ConsumerGroupCommand (#14585 ) In ConsumerGroupCommand, there are two methods: getLogEndOffsets and getLogStartOffsets, the first parameter groupId is not used, so remove it. Reviewers: Luke Chen <showuon@gmail.com>	1 year ago
Gantigmaa Selenge	486d5f6c64	KAFKA-15566: Fix flaky tests in FetchRequestTest.scala in KRaft mode (#14573 ) Fixed some of the failing tests in FetchRequestTest. testFetchWithPartitionsWithIdError and testCreateIncrementalFetchWithPartitionsInErrorV12 fail with the following error when enabled with KRaft mode. These tests only fail sometimes when running locally but consistently failed when running in the Jenkins Pipeline. Tests will call the utility function TestUtils.waitUntilLeaderIsKnown after creating the topic partitions so that they wait for the logs to be created on the leader before sending fetch requests. Enabled all tests except checkLastFetchedEpochValidation with KRaft mode. Looking at the build history in Jenkins, all the other tests except these 2 tests and checkLastFetchedEpochValidation were passing when they were enabled with KRaft mode. Therefore enabled them with KRaft mode again but left checkLastFetchedEpochValidation to be investigated further. Reviewers: Luke Chen <showuon@gmail.com>, dengziming <dengziming1993@gmail.com>	1 year ago
Calvin Liu	af747fbfed	KAFKA-15581: Introduce ELR (#14312 ) This patch introduces preliminary changes for Eligible Leader Replicas (KIP-966) * New MetadataVersion 16 (3.7-IV1) * New record versions for PartitionRecord and PartitionChangeRecord * New tagged fields on PartitionRecord and PartitionChangeRecord * New static config "eligible.leader.replicas.enable" to gate the whole feature Reviewers: Artem Livshits <alivshits@confluent.io>, David Arthur <mumrah@gmail.com>, Colin P. McCabe <cmccabe@apache.org>	1 year ago
Calvin Liu	14029e2ddd	KAFKA-15582: Identify clean shutdown broker (#14465 ) The PR includes: * Added a new class of CleanShutdownFile which helps write and read from a clean shutdown file. * Updated the BrokerRegistration API. * Client side handling for the broker epoch. * Minimum work on the controller side. Reviewers: Jun Rao <junrao@gmail.com>	1 year ago
Hanyu Zheng	bbdf6de88a	KAFKA-15527: Add reverseRange and reverseAll query over kv-store in IQv2 (#14477 ) Implements KIP-985. Reviewers: Matthias J. Sax <matthias@confluent.io>	1 year ago
Apoorv Mittal	36abc8dcea	KAFKA-15604: Telemetry API request and response schemas and classes (KIP-714) (#14554 ) Initial PR for [KIP-714](https://cwiki.apache.org/confluence/display/KAFKA/KIP-714%3A+Client+metrics+and+observability) - [KAFKA-15601](https://issues.apache.org/jira/browse/KAFKA-15601). This PR defines json request and response schemas for the new Telemetry APIs and implements the corresponding java classes. Reviewers: Andrew Schofield <andrew_schofield@uk.ibm.com>, Kirk True <ktrue@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Walker Carlson <wcarlson@apache.org>	1 year ago
vamossagar12	8f3731e2bd	KAFKA-15454: Add support for OffsetCommit version 9 in admin client (#14571 ) This patch adds support for OffsetCommit version 9 in the admin client. It mainly allows handling two new error codes `STALE_MEMBER_EPOCH` and `GROUP_ID_NOT_FOUND ` introduced as part of KIP-848. Reviewers: David Jacot <djacot@confluent.io>	1 year ago
Apoorv Mittal	26aa353dc1	KAFKA-15616: Client telemetry states and transition (KIP-714) (#14566 ) Part of KIP-714. Reviewers: Andrew Schofield <aschofield@confluent.io>, Philip Nee <pnee@confluent.io>, Kirk True <ktrue@confluent.io>, Walker Carlson <wcarlson@confluent.io>, Matthias J. Sax <matthias@confluent.io>	1 year ago
Apoorv Mittal	78166101eb	KAFKA-15613: Client API definition and configurations (KIP-714) (#14560 ) Part of KIP-714. Reviewers: Andrew Schofield <aschofield@confluent.io>, Walker Carlson <wcarlson@confluent.io>, Matthias J. Sax <matthias@confluent.io>	1 year ago
Matthias J. Sax	72fdd9f62a	MINOR: add KIP-941 to Kafka Streams upgrade docs (#14577 ) Reviewers: Hao Li <hli@confluent.io>, Walker Carlson <wcarlson@confluent.io>, Bill Bejeck <bill@confluent.io>	1 year ago
Lianet Magrans	48449b68fd	KAFKA-15554: Client state changes for handling one assignment at a time & minor improvements (#14413 ) This patch includes: - target assignment changes : accepting only one at a time according to the updated protocol. - changes for error handling, leaving responsibility in the heartbeatManager and exposing only the functionality for when the state needs to be updated (on successful HB, on fencing, on fatal failure) - allow transitions for failures when joining - tests & minor improvements/fixes addressing initial version review Reviewers: Kirk True <ktrue@confluent.io>, Philip Nee <pnee@confluent.io>, David Jacot <djacot@confluent.io>	1 year ago
Arpit Goyal	dc6a53e196	MINOR: Rename lock variable of the entry class (#14569 ) The RemoteIndexCache has a variable lock and the child class also have a variable lock in the same class file. Renaming lock of the entry(child class) to avoid confusion. Reviewers: Luke Chen <showuon@gmail.com>, hudeqi <1217150961@qq.com>	1 year ago
Mickael Maison	8aee297669	MINOR: Various Java cleanups in core (#14561 ) Reviewers: Josep Prat <josep.prat@aiven.io>	1 year ago
Matthias J. Sax	9b468fb278	MINOR: Do not end Javadoc comments with `**/` (#14540 ) Reviewers: Bruno Cadonna <bruno@confluent.io>, Bill Bejeck <bill@confluent.io>, Hao Li <hli@confluent.io>, Josep Prat <josep.prat@aiven.io>	1 year ago
Jeff Kim	abee8f711c	KAFKA-14519; [1/N] Implement coordinator runtime metrics (#14417 ) Implements the following metrics: kafka.server:type=group-coordinator-metrics,name=num-partitions,state=loading kafka.server:type=group-coordinator-metrics,name=num-partitions,state=active kafka.server:type=group-coordinator-metrics,name=num-partitions,state=failed kafka.server:type=group-coordinator-metrics,name=event-queue-size kafka.server:type=group-coordinator-metrics,name=partition-load-time-max kafka.server:type=group-coordinator-metrics,name=partition-load-time-avg kafka.server:type=group-coordinator-metrics,name=thread-idle-ratio-min kafka.server:type=group-coordinator-metrics,name=thread-idle-ratio-avg The PR makes these metrics generic so that in the future the transaction coordinator runtime can implement the same metrics in a similar fashion. Also, CoordinatorLoaderImpl#load will now return LoadSummary which encapsulates the start time, end time, number of records/bytes. Co-authored-by: David Jacot <djacot@confluent.io> Reviewers: Ritika Reddy <rreddy@confluent.io>, Calvin Liu <caliu@confluent.io>, David Jacot <djacot@confluent.io>, Justine Olshan <jolshan@confluent.io>	1 year ago
Lucas Brutschy	e7e399b940	MINOR: allow removing a suspended task from task registry. (#14555 ) When we get a suspended task re-assigned in the eager rebalance protocol, we have to add the task back to the state updater so that it has a chance to catch up with its change log. This was prevented by a check in Tasks, which disallows removing SUSPENDED tasks from the task registry. I couldn't find a reason why this must be an invariant of the task registry, so this weakens the check. The error happens in the integration between TaskRegistry and TaskManager. However, this change anyway adds unit tests to more closely specify the intended behavior of the two modules. Reviewers: Bruno Cadonna <bruno@confluent.io>	1 year ago
Mickael Maison	9d04c7a045	MINOR: Various Scala cleanups in core (#14558 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	1 year ago
Omnia G.H Ibrahim	9af1e74b5e	KAFKA-14596: Move TopicCommand to tools (#13201 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Federico Valeri <fedevaleri@gmail.com>	1 year ago
Ismael Juma	69e591db3a	MINOR: Rewrite/Move KafkaNetworkChannel to the `raft` module (#14559 ) This is now possible since `InterBrokerSend` was moved from `core` to `server-common`. Also rewrite/move `KafkaNetworkChannelTest`. The scala version of `KafkaNetworkChannelTest` passed with the changes here (before I deleted it). Reviewers: Justine Olshan <jolshan@confluent.io>, José Armando García Sancio <jsancio@users.noreply.github.com>	1 year ago
Luke Chen	7376d2c5b1	MINOR: add quick start for tiered storage feature (#14528 ) Some users complained they don't have a way to determine if there is something wrong in the RSM plug-in they implemented, or there's something wrong in Kafka itself. Also, if there are users who just want to try the tiered storage feature out before implementing anything, it would be good we have an RSM implementation by default. Per the discussion in the KIP, there will be no default RSM implementation in Kafka, but we can use the LocalTieredStorage implemented for integration test, to resolve the issues above. Reviewers: Christo Lolov <lolovc@amazon.com>, Divij Vaidya <diviv@amazon.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>, Satish Duggana <satishd@apache.org>	1 year ago
Hanyu Zheng	732bffcae6	KAFKA-15569: test and add test cases in IQv2StoreIntegrationTest (#14523 ) Reviewers: Matthias J. Sax <matthias@confluent.io>	1 year ago
mannoopj	da314ee48c	KAFKA-15532: non active controllers return 0 for ZkWriteBeforelag (#14478 ) Since only the active controller is performing the dual-write to ZK during a migration, it should be the only controller to report the ZkWriteBehindLag metric. Currently, if the controller fails over during a migration, the previous active controller will incorrectly report its last value for ZkWriteBehindLag forever. Instead, it should report zero. Reviewers: Colin P. McCabe <cmccabe@apache.org>, David Arthur <mumrah@gmail.com>	1 year ago
dengziming	5c9db5e735	KAFKA-15390: Do not return fenced broker in FetchResponse.preferredReplica (#14272 ) Do not return fenced brokers from metadataCache.getPartitionReplicaEndpoints, since that could lead to them getting used as preferred read replicas. Reviewers: Colin P. McCabe <cmccabe@apache.org>	1 year ago
Ismael Juma	1073d434ec	KAFKA-14481: Move LogSegment/LogSegments to storage module (#14529 ) A few notes: * Delete a few methods from `UnifiedLog` that were simply invoking the related method in `LogFileUtils` * Fix `CoreUtils.swallow` to use the passed in `logging` * Fix `LogCleanerParameterizedIntegrationTest` to close `log` before reopening * Minor tweaks in `LogSegment` for readability For broader context on this change, please check: * KAFKA-14470: Move log layer to storage module Reviewers: Divij Vaidya <diviv@amazon.com>, Satish Duggana <satishd@apache.org>	1 year ago
bachmanity1	eb187745cd	MINOR: Fix docs for ReplicationBytes(Out\|In)PerSec metrics (#14228 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Taras Ledkov	1 year ago
Hector Geraldino	4150595b0a	KAFKA-14684: Replace EasyMock/PowerMock with Mockito in WorkerSinkTaskThreadedTest (#14505 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Christo Lolov <christololov@gmail.com>	1 year ago
hudeqi	b0b8693c72	KAFKA-15536: Dynamically resize remoteIndexCache (#14511 ) Dynamically resize remoteIndexCache Reviewers: Christo Lolov <lolovc@amazon.com>, Luke Chen <showuon@gmail.com>, Divij Vaidya <diviv@amazon.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	1 year ago
Matthias J. Sax	d4c661c017	MINOR: cleanup warnings in Kafka Streams code base (#14549 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, A. Sophie Blee-Goldman <sophie@responsive.dev>	1 year ago

1 2 3 4 5 ...

11825 Commits (987609404c1d3f65f4d0cc642982b898c28121ac) All Branches Search

11825 Commits (987609404c1d3f65f4d0cc642982b898c28121ac)

All Branches