KAFKA-6791: Add CoordinatorNodeProvider in KafkaAdminClient by huxihx · Pull Request #4902 · apache/kafka

huxihx · 2018-04-20T03:41:11Z

KAFKA-6791: Add CoordinatorNodeProvider in KafkaAdminClient
https://issues.apache.org/jira/browse/KAFKA-6791

Add CoordinatorNodeProvider interface to support batch retrieval for group coordinators.

More detailed description of your change,
if necessary. The PR title and PR message become
the squashed commit message, so use a separate
comment to ping reviewers.

Summary of testing strategy (including rationale)
for the feature or bug fix. Unit and/or integration
tests are expected for any behaviour change and
system tests should be considered for larger changes.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

https://issues.apache.org/jira/browse/KAFKA-6791 Add `CoordinatorNodeProvider` interface and its implementor `ConsumerGroupCoordinatorNodeProvider` to support batch retrieval for the group coordinators.

huxihx · 2018-04-20T04:58:08Z

retest it please

huxihx · 2018-04-20T06:01:01Z

@guozhangwang Please kindly review. Thanks

guozhangwang · 2018-04-20T15:55:53Z

cc @hachikuji @cmccabe for reviews as well.

hachikuji · 2018-04-20T20:22:17Z

+            int nodeId = coordinator == null ? -1 : coordinator.id(); // leave null-handling to the next NodeProvider

-            runnable.call(new Call("findCoordinator", deadline, new LeastLoadedNodeProvider()) {
+            runnable.call(new Call("describeConsumerGroups", deadline, new ConstantNodeIdProvider(nodeId)) {


The ideal would be to use the CoordinatorNodeProvider here. There is not much benefit in having it if we just invoke it inline. The problem is that the provide() method is called by the send thread, so we cannot have it block on an operation which itself depends on the send thread. To make it work nicely in this way, we probably need an asynchronous NodeProvider API which effectively lets us chain the DescribeGroup request on to its completion. For example, maybe something like this could work:

interface AsyncNodeProvider { KafkaFuture<Node> provide(); }

cc @cmccabe (who may have some ideas as well)

@hachikuji Correct me if I am wrong. Coordinator-finding should be always finished before doing group-related tasks, no matter async or sync interface we use, so it means we have to wait for it in any cases. What we should do is to ensure the blocking time is not unlimited.

@hachikuji is correct. We can't do blocking operations in the admin client service thread. We certainly can't do blocking operations that wait for the service thread itself. This will deadlock.

I think it's a good idea to have a coordinator node provider, but we need to build out a little more infrastructure to make it possible. I have a change which should help with that, at #4295

@hachikuji Do you think it's okay to convert this async operation into a sync one, similar with this above:

TopicPartition tp = new TopicPartition("__consumer_offsets", Math.abs(groupID.hashCode() % 50)); return metadata.fetch().leaderFor(tp);

It's an interesting thought, but users may override the number of partitions for __consumer_offsets, so I don't think it will work. More generally, we are trying to avoid dependence in the clients on the __consumer_offsets topic since it ties the behavior of the client to what is more properly an implementation detail.

huxihx · 2018-05-02T11:21:40Z

@hachikuji Please review again. Thanks.

huxihx · 2018-05-06T11:45:24Z

retest it please

cmccabe · 2018-05-15T17:21:41Z

@huxihx: thanks for the PR. I don't think this is needed any more, though, now that we merged KAFKA-6299.

huxihx · 2018-05-22T00:42:06Z

@cmccabe Sorry for the late response. Okay, will cancel this PR soon.

huxihx · 2018-05-22T01:03:07Z

Closed this PR since it was already fixed by KAFKA-6299.

huxi-2b added 2 commits April 20, 2018 11:28

KAFKA-6791: Add a CoordinatorNodeProvider in KafkaAdminClient

a9a89c8

https://issues.apache.org/jira/browse/KAFKA-6791 Add `CoordinatorNodeProvider` interface and its implementor `ConsumerGroupCoordinatorNodeProvider` to support batch retrieval for the group coordinators.

Batch retrieval coordinators for multiple groups ahead of time

151d777

guozhangwang requested a review from hachikuji April 20, 2018 15:55

hachikuji reviewed Apr 20, 2018

View reviewed changes

huxi-2b added 3 commits April 27, 2018 11:28

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-6791

22abb5b

Merge remote-tracking branch 'upstream/trunk' into KAFKA-6791

7cbb7d7

addressed Jason's comments to enable an async node provider.

c9a98aa

huxihx closed this May 22, 2018

jeffwidman mentioned this pull request Sep 17, 2020

Feature delete consumergroups dpkp/kafka-python#2040

Merged

Conversation

huxihx commented Apr 20, 2018

Committer Checklist (excluded from commit message)

Uh oh!

huxihx commented Apr 20, 2018

Uh oh!

huxihx commented Apr 20, 2018

Uh oh!

guozhangwang commented Apr 20, 2018

Uh oh!

hachikuji Apr 20, 2018

Choose a reason for hiding this comment

Uh oh!

huxihx Apr 23, 2018

Choose a reason for hiding this comment

Uh oh!

cmccabe Apr 23, 2018

Choose a reason for hiding this comment

Uh oh!

huxihx Apr 27, 2018

Choose a reason for hiding this comment

Uh oh!

hachikuji Apr 28, 2018

Choose a reason for hiding this comment

Uh oh!

huxihx commented May 2, 2018

Uh oh!

huxihx commented May 6, 2018

Uh oh!

cmccabe commented May 15, 2018

Uh oh!

huxihx commented May 22, 2018

Uh oh!

huxihx commented May 22, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants