KAFKA-2055; Fix transient ConsumerBounceTest.testSeekAndCommitWithBro…#98
KAFKA-2055; Fix transient ConsumerBounceTest.testSeekAndCommitWithBro…#98lvfangmin wants to merge 1 commit intoapache:trunkfrom
Conversation
…kerFailures failure;
There was a problem hiding this comment.
Do you think it's worthwhile to mention the JIRA for the issue that this check gets around (i.e. KAFKA-1211)?
There was a problem hiding this comment.
Yes, I think it's better to track this work around, as we may have to remove this in the future.
There was a problem hiding this comment.
I think KAFKA-1211 is not related to this check, but KAFKA-2334?
There was a problem hiding this comment.
@guozhangwang Perhaps I was mistaken, but I thought the issue was that the writes were getting lost before the first seekToEnd, which means that it fails to find the expected position. I don't think it was because of inconsistency in the ordering of offsets visible to the user.
There was a problem hiding this comment.
@hachikuji the write was not lost, but just that the HW on the new leader was older than on the new leader, and that seekToEnd is guarded by the HW of the leader. I think it is irrelevant to the messages on the replicas.
There was a problem hiding this comment.
@guozhangwang That makes sense, thanks for the explanation. But is it not also possible that the writes could get lost in a scenario like that of KAFKA-1211 where the log gets truncated to the HW after a second leader failure? That would also reveal itself in the first seekToEnd returning an earlier than expected offset.
|
kafka-trunk-git-pr #50 FAILURE |
|
@lvfangmin I saw the following test failure from git-pr #50: kafka.api.ProducerSendTest > testCloseWithZeroTimeoutFromCallerThread FAILED Could you rerun the tests and see if they are transient? If yes we can file a separate JIRA for it. |
|
@guozhangwang Running kafka.api.ProducerSendTest.testCloseWithZeroTimeoutFromCallerThread, will fire a separate JIRA if it's a transient one. |
|
Encounter another error while running this test after 15 minutes: kafka.api.ProducerSendTest > testCloseWithZeroTimeoutFromCallerThread STANDARD_OUT kafka.api.ProducerSendTest > testCloseWithZeroTimeoutFromCallerThread FAILED Will run once more to verify. |
|
@lvfangmin I saw your filed another JIRA for this error, did you find it transient with and without your patch? |
|
@guozhangwang The transient error occurred when with/without my patch. It doesn't related with my patch, so it's safe to merge it. |
|
LGTM |
…nse when the request is to fetch metadata for all topics (apache#98) TICKET = KAFKA-10606 LI_DESCRIPTION = LIKAFKA-32857 EXIT_CRITERIA = Once we cherry-pick the fix for KAFKA-10606
CSMDS-649: Checkstyle error fixes (apache#98)
Otherwise, there's room for flakiness.
Cl3 refactor
…kerFailures failure;