KAFKA-12257: Consumer mishandles topics deleted and recreated with the same name (trunk) by jolshan · Pull Request #11004 · apache/kafka

jolshan · 2021-07-08T21:37:32Z

Trunk version of #10952

This PR slightly cleans up some of the changes made in #9944

Store topic ID info in consumer metadata. We will always take the topic ID from the latest metadata response and remove any topic IDs from the cache if the metadata response did not return a topic ID for the topic.

With the addition of topic IDs, when we encounter a new topic ID (recreated topic) we can choose to get the topic's metadata even if the epoch is lower than the deleted topic.

The idea is that when we update from no topic IDs to using topic IDs, we will not count the topic as new (It could be the same topic but with a new ID). We will only take the update if the topic ID changed.

Added tests for this scenario as well as some tests for storing the topic IDs. Also added tests for topic IDs in metadata cache.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

jolshan · 2021-07-08T21:39:22Z

                }

-                builder.add(partition, topicIds.getOrDefault(partition.topic(), Uuid.ZERO_UUID), new FetchRequest.PartitionData(position.offset,
+                Uuid topicId = metadata.topicId(partition.topic());


One change we are making for this PR is to just get the topic ID for a single provided topic name. I want to double check that the metadata (and underlying map) can not change when adding these partitions to the builder since the builder assumes IDs do not change.

For my understanding -- we won't update the metadata during this method, correct? Or is there something like another thread that could update it?

It could be updated in a separate thread. I cannot see how that would be a problem though. We do have synchronization in Metadata.

It would have been a problem before KAFKA-13111 when we assumed only one topic ID per build for a given topic name (we had a mapping), but maybe it is ok now that we store the ID in the data and use it to build the request.

Apologies. I was being a bit slow here. I had not considered the possibility of the id of a given topic changing while we were building the fetch request. I had forgotten that the fetch builder logic does allow the same topic to be included multiple times. It do agree that it is probably better to not allow this. So reverting this change makes sense.

jolshan · 2021-11-11T00:30:34Z

I pushed some of the changes that I missed from the 3.0 branch. We'll see how the build goes. Tests seemed to look ok for me locally.

hachikuji · 2021-11-15T20:44:15Z

                }

-                builder.add(partition, topicIds.getOrDefault(partition.topic(), Uuid.ZERO_UUID), new FetchRequest.PartitionData(position.offset,
+                Uuid topicId = metadata.topicId(partition.topic());


It could be updated in a separate thread. I cannot see how that would be a problem though. We do have synchronization in Metadata.

hachikuji · 2021-11-17T01:46:03Z

-     * @return the topic ID for the given topic name or null if the ID does not exist or is not known
+     * @return a mapping from topic names to topic IDs for all topics with valid IDs in the cache
     */
-    public synchronized Uuid topicId(String topicName) {


Any harm keeping this one? Seems like it simplified some of the uses, especially in tests.

Hmmm...this is the version that could have a new cache value? Only thing I might worry about is misuse.

hachikuji

LGTM. Thanks for the patch!

…e same name (#11004) Store topic ID info in consumer metadata. We will always take the topic ID from the latest metadata response and remove any topic IDs from the cache if the metadata response did not return a topic ID for the topic. The benefit of this is that it lets us detect topic recreations. This allows the client to update metadata even if the leader epoch is lower than what was seen previously. Reviewers: Jason Gustafson <jason@confluent.io>

…e same name (apache#11004) Store topic ID info in consumer metadata. We will always take the topic ID from the latest metadata response and remove any topic IDs from the cache if the metadata response did not return a topic ID for the topic. The benefit of this is that it lets us detect topic recreations. This allows the client to update metadata even if the leader epoch is lower than what was seen previously. Reviewers: Jason Gustafson <jason@confluent.io>

broke here apache#11004

jolshan added 3 commits July 1, 2021 12:28

Don't ignore metadata if new topic ID seen.

b5ed86b

Change method to return single name or ID, minor cleanups

6308c0c

Merge branch 'trunk' of github.com:apache/kafka into KAFKA-12257-trunk

240e489

jolshan commented Jul 8, 2021

View reviewed changes

hachikuji reviewed Nov 9, 2021

View reviewed changes

Comment thread clients/src/main/java/org/apache/kafka/clients/Metadata.java Outdated

Comment thread clients/src/main/java/org/apache/kafka/clients/Metadata.java Outdated

Comment thread clients/src/main/java/org/apache/kafka/clients/Metadata.java Outdated

jolshan added 4 commits November 10, 2021 15:30

Addressing comments

e128b00

Better error message

def79c3

More style and comment fixes

9bafd9f

Reorder checks so topic ID is first

6c83107

Merge branch 'trunk' of github.com:apache/kafka into KAFKA-12257-trunk

6e43c3c

hachikuji reviewed Nov 15, 2021

View reviewed changes

Addressing comments

92d6ef1

hachikuji reviewed Nov 16, 2021

View reviewed changes

Comment thread clients/src/main/java/org/apache/kafka/clients/Metadata.java Outdated

Put map access back in metadata

0a7e6c0

hachikuji reviewed Nov 17, 2021

View reviewed changes

hachikuji approved these changes Nov 17, 2021

View reviewed changes

hachikuji merged commit 06dfa54 into apache:trunk Nov 17, 2021

prat0318 mentioned this pull request Nov 30, 2021

KAFKA-13488: Producer fails to recover if topic gets deleted midway #11552

Merged

3 tasks

msn-tldr added a commit to msn-tldr/kafka that referenced this pull request Nov 1, 2023

Fix Metadatacache where pre-existing topicids wouldn't be retained

4553fe6

broke here apache#11004

Conversation

jolshan commented Jul 8, 2021

Committer Checklist (excluded from commit message)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jolshan commented Nov 11, 2021

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hachikuji left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants