[chery-pick](branch-2.1) Pick "[Fix](group commit) Fix group commit block queue mem estimate fault" #37379

Yukang-Lian · 2024-07-07T02:46:29Z

Pick [Fix](group commit) Fix group commit block queue mem estimate faule #35314

Proposed changes

Issue Number: close #xxx

Problem: When group commit=async_mode and NULL data is imported into a variant type column, it causes incorrect memory statistics for group commit backpressure, leading to a stuck issue. Cause: In group commit mode, blocks are first added to a queue in batches using add block, and then blocks are retrieved from the queue using get block. To track memory usage during backpressure, we add the block size to the memory statistics during add block and subtract the block size from the memory statistics during get block. However, for variant types, during the add block write to WAL, serialization occurs, which can merge types (e.g., merging int and bigint into bigint), thereby changing the block size. This results in a discrepancy between the block size during get block and add block, causing memory statistics to overflow.
Solution: Record the block size at the time of add block and use this recorded size during get block instead of the actual block size. This ensures consistency in the memory addition and subtraction.

Further comments

If this is a relatively large or complex change, kick off the discussion at dev@doris.apache.org by explaining why you chose the solution you did and what alternatives you considered, etc...

Proposed changes

Issue Number: close #xxx

…pache#35314) ## Proposed changes Issue Number: close #xxx  **Problem:** When `group commit=async_mode` and NULL data is imported into a `variant` type column, it causes incorrect memory statistics for group commit backpressure, leading to a stuck issue. **Cause:** In group commit mode, blocks are first added to a queue in batches using `add block`, and then blocks are retrieved from the queue using `get block`. To track memory usage during backpressure, we add the block size to the memory statistics during `add block` and subtract the block size from the memory statistics during `get block`. However, for `variant` types, during the `add block` write to WAL, serialization occurs, which can merge types (e.g., merging `int` and `bigint` into `bigint`), thereby changing the block size. This results in a discrepancy between the block size during `get block` and `add block`, causing memory statistics to overflow. **Solution:** Record the block size at the time of `add block` and use this recorded size during `get block` instead of the actual block size. This ensures consistency in the memory addition and subtraction. ## Further comments If this is a relatively large or complex change, kick off the discussion at [dev@doris.apache.org](mailto:dev@doris.apache.org) by explaining why you chose the solution you did and what alternatives you considered, etc...

Yukang-Lian · 2024-07-07T02:46:50Z

run buildall

github-actions · 2024-07-07T02:52:47Z

clang-tidy review says "All clean, LGTM! 👍"

doris-robot · 2024-07-07T04:20:47Z

TeamCity be ut coverage result:
Function Coverage: 36.32% (9151/25194)
Line Coverage: 27.86% (74703/268115)
Region Coverage: 26.74% (38512/144033)
Branch Coverage: 23.45% (19521/83252)
Coverage Report: http://coverage.selectdb-in.cc/coverage/c01253eb4b4d2ffd8b1987dda27c81fb7464124f_c01253eb4b4d2ffd8b1987dda27c81fb7464124f/report/index.html

dataroaring merged commit 7d423b3 into apache:branch-2.1 Jul 7, 2024

yiguolei mentioned this pull request Jul 19, 2024

2.1.5 Release Notes #38111

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[chery-pick](branch-2.1) Pick "[Fix](group commit) Fix group commit block queue mem estimate fault" #37379

[chery-pick](branch-2.1) Pick "[Fix](group commit) Fix group commit block queue mem estimate fault" #37379

Uh oh!

Yukang-Lian commented Jul 7, 2024

Uh oh!

Yukang-Lian commented Jul 7, 2024

Uh oh!

github-actions bot commented Jul 7, 2024

Uh oh!

doris-robot commented Jul 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[chery-pick](branch-2.1) Pick "[Fix](group commit) Fix group commit block queue mem estimate fault" #37379

[chery-pick](branch-2.1) Pick "[Fix](group commit) Fix group commit block queue mem estimate fault" #37379

Uh oh!

Conversation

Yukang-Lian commented Jul 7, 2024

Proposed changes

Further comments

Proposed changes

Uh oh!

Yukang-Lian commented Jul 7, 2024

Uh oh!

github-actions bot commented Jul 7, 2024

Uh oh!

doris-robot commented Jul 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants