KAFKA-8335; Clean empty batches when sequence numbers are reused by hachikuji · Pull Request #6715 · apache/kafka

hachikuji · 2019-05-10T22:26:26Z

The log cleaner attempts to preserve the last entry for each producerId in order to ensure that sequence/epoch state is not lost. The current validation checks only the last sequence number for each producerId in order to decide whether a batch should be retained. There are two problems with this:

Sequence numbers are not unique alone. It is the tuple of sequence number and epoch which is uniquely defined.
The group coordinator always writes batches beginning with sequence number 0, which means there could be many batches which have the same sequence number.

The complete fix for the second issue is probably to do the proper sequence number bookkeeping in the coordinator. For now, I have left the coordinator implementation unchanged and changed the cleaner logic to use the last offset written by a producer instead of the last sequence number.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

guozhangwang

LGTM! Just a question about the second condition.

guozhangwang · 2019-05-11T19:25:14Z


        try {
          cleanInto(log.topicPartition, currentSegment.log, cleaned, map, retainDeletes, log.config.maxMessageSize,
-            transactionMetadata, log.activeProducersWithLastSequence, stats)


We can remove this activeProducersWithLastSequence function now right?

It's still used pretty heavily in test cases.

guozhangwang · 2019-05-11T19:29:17Z

+          lastRecordsOfActiveProducers.get(batch.producerId).exists { lastRecord =>
+            lastRecord.lastDataOffset match {
+              case Some(offset) => batch.lastOffset == offset
+              case None => batch.isControlBatch && batch.producerEpoch == lastRecord.producerEpoch


Just being paranoid here: a txn marker should always follow some data (from the same producer-id) on the partition -- i.e. if there is no data produced to this partition, there should be no txn marker.

With that, I'm not sure if the second condition should really happen or not: a producer has never produced any data, hence has no latest offsets, but still have a control batch. Even with compaction, the producer's latest offset should still be preserved right?

A producer can abort after adding the partition to the transaction but before sending any data. In this case, there could be markers without any data.

Ack, makes sense.

guozhangwang · 2019-05-11T19:32:57Z

retest this please

The log cleaner attempts to preserve the last entry for each producerId in order to ensure that sequence/epoch state is not lost. The current validation checks only the last sequence number for each producerId in order to decide whether a batch should be retained. There are two problems with this: 1. Sequence numbers are not unique alone. It is the tuple of sequence number and epoch which is uniquely defined. 2. The group coordinator always writes batches beginning with sequence number 0, which means there could be many batches which have the same sequence number. The complete fix for the second issue would probably add proper sequence number bookkeeping in the coordinator. For now, we have left the coordinator implementation unchanged and changed the cleaner logic to use the last offset written by a producer instead of the last sequence number. Reviewers: Guozhang Wang <wangguoz@gmail.com>

…es-14-May * AK_REPO/trunk: (24 commits) KAFKA-7321: Add a Maximum Log Compaction Lag (KIP-354) (apache#6009) KAFKA-8335; Clean empty batches when sequence numbers are reused (apache#6715) KAFKA-6455: Session Aggregation should use window-end-time as record timestamp (apache#6645) KAFKA-6521: Use timestamped stores for KTables (apache#6667) [MINOR] Consolidate in-memory/rocksdb unit tests for window & session store (apache#6677) MINOR: Include StickyAssignor in system tests (apache#5223) KAFKA-7633: Allow Kafka Connect to access internal topics without cluster ACLs (apache#5918) MINOR: Align KTableAgg and KTableReduce (apache#6712) MINOR: Fix code section formatting in TROGDOR.md (apache#6720) MINOR: Remove unnecessary OptionParser#accepts method call from PreferredReplicaLeaderElectionCommand (apache#6710) KAFKA-8352 : Fix Connect System test failure 404 Not Found (apache#6713) KAFKA-8348: Fix KafkaStreams JavaDocs (apache#6707) MINOR: Add missing option for running vagrant-up.sh with AWS to vagrant/README.md KAFKA-8344; Fix vagrant-up.sh to work with AWS properly MINOR: docs typo in '--zookeeper myhost:2181--execute' MINOR: Remove header and key/value converter config value logging (apache#6660) KAFKA-8231: Expansion of ConnectClusterState interface (apache#6584) KAFKA-8324: Add close() method to RocksDBConfigSetter (apache#6697) KAFKA-6789; Handle retriable group errors in AdminClient API (apache#5578) KAFKA-8332: Refactor ImplicitLinkedHashSet to avoid losing ordering when converting to Scala ...

junrao

@hachikuji : Thanks for the PR. LGTM too.

…che#6715) The log cleaner attempts to preserve the last entry for each producerId in order to ensure that sequence/epoch state is not lost. The current validation checks only the last sequence number for each producerId in order to decide whether a batch should be retained. There are two problems with this: 1. Sequence numbers are not unique alone. It is the tuple of sequence number and epoch which is uniquely defined. 2. The group coordinator always writes batches beginning with sequence number 0, which means there could be many batches which have the same sequence number. The complete fix for the second issue would probably add proper sequence number bookkeeping in the coordinator. For now, we have left the coordinator implementation unchanged and changed the cleaner logic to use the last offset written by a producer instead of the last sequence number. Reviewers: Guozhang Wang <wangguoz@gmail.com>

KAFKA-8335; Clean empty batches when sequence numbers are reused

dab0403

guozhangwang approved these changes May 11, 2019

View reviewed changes

hachikuji merged commit d798dbf into apache:trunk May 13, 2019

junrao reviewed May 13, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAFKA-8335; Clean empty batches when sequence numbers are reused#6715

KAFKA-8335; Clean empty batches when sequence numbers are reused#6715
hachikuji merged 1 commit intoapache:trunkfrom
hachikuji:KAFKA-8335

hachikuji commented May 10, 2019

Uh oh!

guozhangwang left a comment

Uh oh!

guozhangwang May 11, 2019

Uh oh!

hachikuji May 13, 2019

Uh oh!

guozhangwang May 11, 2019

Uh oh!

hachikuji May 12, 2019

Uh oh!

guozhangwang May 13, 2019

Uh oh!

guozhangwang commented May 11, 2019

Uh oh!

junrao left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hachikuji commented May 10, 2019

Committer Checklist (excluded from commit message)

Uh oh!

guozhangwang left a comment

Choose a reason for hiding this comment

Uh oh!

guozhangwang May 11, 2019

Choose a reason for hiding this comment

Uh oh!

hachikuji May 13, 2019

Choose a reason for hiding this comment

Uh oh!

guozhangwang May 11, 2019

Choose a reason for hiding this comment

Uh oh!

hachikuji May 12, 2019

Choose a reason for hiding this comment

Uh oh!

guozhangwang May 13, 2019

Choose a reason for hiding this comment

Uh oh!

guozhangwang commented May 11, 2019

Uh oh!

junrao left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants