KAFKA-18230: handle not controller or not leader error in admin client by showuon · Pull Request #18165 · apache/kafka

showuon · 2024-12-13T10:27:05Z

When admin client starts up, it'll get the metadata of the cluster. And when the admin client sends request directly to the controller (via --bootstrap-controller), it'll send the request to the active controller. But if there is a leadership change in the controller after the metadata request and before the target request sent, the request will fail immediately with NOT_CONTROLLER error or NOT_LEADER_OR_FOLLOWER error. It's because the requests that need metadata log change must need to do on the active controller. Instead of failing immediately, the admin client should catch the error and retry the metadata update to send the request again. Note, in some application, the admin client could exist for a long time to send multiple requests when needed, this case could happen more often.

Take describeMetadataQuorum for example, we'll use LeastLoadedBrokerOrActiveKController(here) to get the active controller via describeCluster/Metadata API, then send describeMetadataQuorum to the active controller. You can see, it's possible that the active controller changed right after describeCluster/Metadata call and before describeMetadataQuorum call, and even worse if the application creates an long running adminClient to handle any calls in the lifecycle. And you can see how we handle the describeMetadataQuorum response here. We don't handle NOT_CONTROLLER nor NOT_LEADER_OR_FOLLOWER errors. The error response will look like this:

DescribeQuorumResponseData(errorCode=0, errorMessage='', topics=[TopicData(topicName='__cluster_metadata', partitions=[PartitionData(partitionIndex=0, errorCode=6, errorMessage='For requests intended only for the leader, this error indicates that the broker is not the current leader. For requests intended for any replica, this error indicates that the broker is not a replica of the topic partition.', leaderId=0, leaderEpoch=0, highWatermark=0, currentVoters=[], observers=[])])], nodes=[])

Since the NOT_LEADER_OR_FOLLOWER and NOT_CONTROLLER are both retriable errors, when receiving them, the admin client will keep retrying until request time out or metadata expired in config metadata.max.age.ms (default is 5 mins).

Comparably, when we invoke createTopic, deleteTopic, alterPartitionReassignment, ... we'll invoke handleNotControllerError to handle controller change, because we know when brokers receive these calls, they will forward to the active controller, but when there's controller leadership change, we need to re-fetch the metadata and retry.

Please note that when talking directly to the controller, we might get NOT_LEADER_OR_FOLLOWER( ex: here) because in the controller quorum's perspective, this controller is not a leader. That's why I added this in handleNotControllerError:
metadataManager.usingBootstrapControllers() && response.errorCounts().containsKey(Errors.NOT_LEADER_OR_FOLLOWER))

I think this is just a miss when we were implementing KIP-919: Allow AdminClient to Talk Directly with the KRaft Controller Quorum. And that's why in this PR, I handled NOT_CONTROLLER and NOT_LEADER_OR_FOLLOWER not just for describeMetadataQuorum, but also for the requests that talk to controller directly, and the request must need the active controller to handle (ex: controller will modify the metadata log, requesting raft update,...). The APIs are:

createAcls
deleteAcls
alterConfigs
describeMetadataQuorum
addRaftVoter
removeRaftVoter

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

mumrah · 2024-12-13T14:19:04Z

@showuon #17881 adds a "triage" label to PRs from non-committers. Turns out this also affect committers if their membership visibility in the ASF GitHub org is not public. I added instructions for setting your membership visibility to public https://github.com/apache/kafka/blob/trunk/.github/workflows/README.md#pr-triage

showuon · 2025-01-16T09:59:15Z

@chia7712 @cmccabe , could you help take a look? Thanks.

mimaison · 2025-01-20T10:58:22Z

@dajac @jolshan Can you take a look?
I think it's a good candidate for 4.0 too, as this really impacts Admin clients when the controller quorum rolls.

dajac · 2025-01-23T13:52:47Z

@showuon @mimaison Thanks for the patch. I'd like to better understand the impact. My understanding is that the controller is not cleared on NOT_CONTROLLER or NOT_LEADER_OR_FOLLOWER errors and hence the admin client can no longer communicate to the active controller because it keeps sending requests to the old one. Is my understanding correct?

showuon · 2025-01-24T06:34:19Z

@showuon @mimaison Thanks for the patch. I'd like to better understand the impact. My understanding is that the controller is not cleared on NOT_CONTROLLER or NOT_LEADER_OR_FOLLOWER errors and hence the admin client can no longer communicate to the active controller because it keeps sending requests to the old one. Is my understanding correct?

@dajac , sorry that I didn't make it clear in the description. Yes, you're right, the root cause is that the admin client didn't handle the NOT_CONTROLLER or NOT_LEADER_OR_FOLLOWER error when talking directly to the controller, which causes the admin client can only get the exception.

Take describeMetadataQuorum for example, we'll use LeastLoadedBrokerOrActiveKController(here) to get the active controller via describeCluster/Metadata API, then send describeMetadataQuorum to the active controller. You can see, it's possible that the active controller changed right after describeCluster/Metadata call and before describeMetadataQuorum call, and even worse if the application creates an long running adminClient to handle any calls in the lifecycle. And you can see how we handle the describeMetadataQuorum response here. We don't handle NOT_CONTROLLER nor NOT_LEADER_OR_FOLLOWER errors. The error response will look like this:

DescribeQuorumResponseData(errorCode=0, errorMessage='', topics=[TopicData(topicName='__cluster_metadata', partitions=[PartitionData(partitionIndex=0, errorCode=6, errorMessage='For requests intended only for the leader, this error indicates that the broker is not the current leader. For requests intended for any replica, this error indicates that the broker is not a replica of the topic partition.', leaderId=0, leaderEpoch=0, highWatermark=0, currentVoters=[], observers=[])])], nodes=[])

Since the NOT_LEADER_OR_FOLLOWER and NOT_CONTROLLER are both retriable errors, when receiving them, the admin client will keep retrying until request time out or metadata expired in config metadata.max.age.ms (default is 5 mins).

Comparably, when we invoke createTopic, deleteTopic, alterPartitionReassignment, ... we'll invoke handleNotControllerError to handle controller change, because we know when brokers receive these calls, they will forward to the active controller, but when there's controller leadership change, we need to re-fetch the metadata and retry.

Please note that when talking directly to the controller, we might get NOT_LEADER_OR_FOLLOWER( ex: here) because in the controller quorum's perspective, this controller is not a leader. That's why I added this in handleNotControllerError:
metadataManager.usingBootstrapControllers() && response.errorCounts().containsKey(Errors.NOT_LEADER_OR_FOLLOWER))

I think this is just a miss when we were implementing KIP-919: Allow AdminClient to Talk Directly with the KRaft Controller Quorum. And that's why in this PR, I handled NOT_CONTROLLER and NOT_LEADER_OR_FOLLOWER not just for describeMetadataQuorum, but also for the requests that supports to talk to controller directly, and the request must need the active controller to handle (ex: controller will modify the metadata log, requesting raft update,...). The APIs are:

createAcls
deleteAcls
alterConfigs
describeMetadataQuorum
addRaftVoter
removeRaftVoter

Hope that's clear. I've also updated the PR description. Thanks.

dajac · 2025-01-30T10:17:48Z

@showuon Thanks for the explanation. I am OK with getting this one into 4.0. However, I don't have time for reviewing it. @mimaison Could you review it?

mimaison · 2025-01-31T07:44:52Z

I'll try to take a look today. @AndrewJSchofield if you have time, can you take a look too? Thanks

mimaison

LGTM

jolshan · 2025-01-31T18:07:52Z

Sorry, for some reason I only get tags when someone approves/merges the PR. Thanks for taking a look @mimaison!

chia7712

@showuon thanks for this patch, and I have only some small comment. otherwise, LGTM

chia7712 · 2025-01-31T18:11:58Z

+        // When sending requests directly to the follower controller, it might return NOT_LEADER_OR_FOLLOWER error.
+        if (response.errorCounts().containsKey(Errors.NOT_CONTROLLER) ||
+                metadataManager.usingBootstrapControllers() && response.errorCounts().containsKey(Errors.NOT_LEADER_OR_FOLLOWER)) {
            handleNotControllerError(Errors.NOT_CONTROLLER);


Should we pass NOT_LEADER_OR_FOLLOWER instead of NOT_CONTROLLER when it encounters the error NOT_LEADER_OR_FOLLOWER?

Agree. Updated.

chia7712 · 2025-01-31T18:19:00Z

        AdminMetadataManager metadataManager = new AdminMetadataManager(new LogContext(),
                adminClientConfig.getLong(AdminClientConfig.RETRY_BACKOFF_MS_CONFIG),
-                adminClientConfig.getLong(AdminClientConfig.METADATA_MAX_AGE_CONFIG), false);
+                adminClientConfig.getLong(AdminClientConfig.METADATA_MAX_AGE_CONFIG), usingBootstrapController);


we can replace usingBootstrapController by config.containsKey(AdminClientConfig.BOOTSTRAP_CONTROLLERS_CONFIG) to streamline it

Good suggestion. Updated.

#18165) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>

mimaison · 2025-02-04T16:05:31Z

Applied to 4.0 too: 8026d6b

apache#18165) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>

KAFKA-18230: handle not controller or not leader error in admin client

d270ae3

github-actions bot added triage PRs from the community core Kafka Broker clients labels Dec 13, 2024

This comment was marked as outdated.

Sign in to view

github-actions bot added the needs-attention label Dec 21, 2024

mumrah removed triage PRs from the community needs-attention labels Dec 23, 2024

mimaison approved these changes Jan 31, 2025

View reviewed changes

chia7712 reviewed Jan 31, 2025

View reviewed changes

KAFKA-18230: refactor

a92f45b

chia7712 approved these changes Feb 4, 2025

View reviewed changes

mimaison merged commit 612e129 into apache:trunk Feb 4, 2025

mimaison pushed a commit that referenced this pull request Feb 4, 2025

KAFKA-18230: Handle not controller or not leader error in admin client (

8026d6b

#18165) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>

pdruley pushed a commit to pdruley/kafka that referenced this pull request Feb 12, 2025

KAFKA-18230: Handle not controller or not leader error in admin client (

c7129ae

apache#18165) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>

manoj-mathivanan pushed a commit to manoj-mathivanan/kafka that referenced this pull request Feb 19, 2025

KAFKA-18230: Handle not controller or not leader error in admin client (

9ef7928

apache#18165) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>

Conversation

showuon commented Dec 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Committer Checklist (excluded from commit message)

Uh oh!

mumrah commented Dec 13, 2024

Uh oh!

This comment was marked as outdated.

showuon commented Jan 16, 2025

Uh oh!

mimaison commented Jan 20, 2025

Uh oh!

dajac commented Jan 23, 2025

Uh oh!

showuon commented Jan 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dajac commented Jan 30, 2025

Uh oh!

mimaison commented Jan 31, 2025

Uh oh!

mimaison left a comment

Choose a reason for hiding this comment

Uh oh!

jolshan commented Jan 31, 2025

Uh oh!

chia7712 left a comment

Choose a reason for hiding this comment

Uh oh!

chia7712 Jan 31, 2025

Choose a reason for hiding this comment

Uh oh!

showuon Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

chia7712 Jan 31, 2025

Choose a reason for hiding this comment

Uh oh!

showuon Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

mimaison commented Feb 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

showuon commented Dec 13, 2024 •

edited

Loading

showuon commented Jan 24, 2025 •

edited

Loading