Skip to content

Conversation

@kaijchen
Copy link
Member

Proposed changes

#40912 has changed meaning of write_mem in memtable memory limiter.
This PR is a followup to change the active memtable flush policy accordingly.

It also changed:

  1. The amount of memtable writers selected in one flush.
  2. The memtable writers are selected in orders of its size.

@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

@kaijchen
Copy link
Member Author

run buildall

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 37.32% (9591/25699)
Line Coverage: 28.71% (79276/276109)
Region Coverage: 28.17% (41038/145657)
Branch Coverage: 24.80% (20910/84318)
Coverage Report: http://coverage.selectdb-in.cc/coverage/44b63b632c25b76a91fa28f5d4e687dad4dd33cb_44b63b632c25b76a91fa28f5d4e687dad4dd33cb/report/index.html

Copy link
Contributor

@xinyiZzz xinyiZzz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Sep 19, 2024
@github-actions
Copy link
Contributor

PR approved by at least one committer and no changes requested.

@github-actions
Copy link
Contributor

PR approved by anyone and no changes requested.

}
int64_t mem = w->active_memtable_mem_consumption();
if (mem < sort_mem * 0.9) {
// if the memtable writer just got flushed, don't flush it again
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

when will reach this code?

@yiguolei yiguolei merged commit 34ab55f into apache:master Sep 20, 2024
yiguolei pushed a commit that referenced this pull request Sep 20, 2024
## Proposed changes

#40912 has changed meaning of `write_mem` in memtable memory limiter.
This PR is a followup to change the active memtable flush policy
accordingly.

It also changed:
1. The amount of memtable writers selected in one flush.
2. The memtable writers are selected in orders of its size.
kaijchen added a commit to kaijchen/doris that referenced this pull request Sep 24, 2024
liaoxin01 pushed a commit that referenced this pull request Sep 26, 2024
## Proposed changes

#41018 used priority queue when selecting memtables to flush.
But the compare function is wrong and causing the order to be the
opposite.

> Note that the Compare parameter is defined such that it
returns true if its first argument comes before its second argument in a
weak ordering. But because the priority queue outputs largest elements
first, the elements that "come before" are actually output last. That
is, the front of the queue contains the "last" element according to the
weak ordering imposed by Compare.

This PR fixes the compare function to make larger memtables come front.
kaijchen added a commit to kaijchen/doris that referenced this pull request Oct 21, 2024
## Proposed changes

apache#40912 has changed meaning of `write_mem` in memtable memory limiter.
This PR is a followup to change the active memtable flush policy
accordingly.

It also changed:
1. The amount of memtable writers selected in one flush.
2. The memtable writers are selected in orders of its size.
kaijchen added a commit to kaijchen/doris that referenced this pull request Oct 21, 2024
…he#41278)

## Proposed changes

apache#41018 used priority queue when selecting memtables to flush.
But the compare function is wrong and causing the order to be the
opposite.

> Note that the Compare parameter is defined such that it
returns true if its first argument comes before its second argument in a
weak ordering. But because the priority queue outputs largest elements
first, the elements that "come before" are actually output last. That
is, the front of the queue contains the "last" element according to the
weak ordering imposed by Compare.

This PR fixes the compare function to make larger memtables come front.
kaijchen added a commit to kaijchen/doris that referenced this pull request Nov 18, 2024
This PR is a followup to change the active memtable flush policy
accordingly.

It also changed:
1. The amount of memtable writers selected in one flush.
2. The memtable writers are selected in orders of its size.
kaijchen added a commit to kaijchen/doris that referenced this pull request Nov 18, 2024
…he#41278)

## Proposed changes

apache#41018 used priority queue when selecting memtables to flush.
But the compare function is wrong and causing the order to be the
opposite.

> Note that the Compare parameter is defined such that it
returns true if its first argument comes before its second argument in a
weak ordering. But because the priority queue outputs largest elements
first, the elements that "come before" are actually output last. That
is, the front of the queue contains the "last" element according to the
weak ordering imposed by Compare.

This PR fixes the compare function to make larger memtables come front.
kaijchen added a commit to kaijchen/doris that referenced this pull request Nov 18, 2024
This PR is a followup to change the active memtable flush policy
accordingly.

It also changed:
1. The amount of memtable writers selected in one flush.
2. The memtable writers are selected in orders of its size.
kaijchen added a commit to kaijchen/doris that referenced this pull request Nov 18, 2024
…he#41278)

## Proposed changes

apache#41018 used priority queue when selecting memtables to flush.
But the compare function is wrong and causing the order to be the
opposite.

> Note that the Compare parameter is defined such that it
returns true if its first argument comes before its second argument in a
weak ordering. But because the priority queue outputs largest elements
first, the elements that "come before" are actually output last. That
is, the front of the queue contains the "last" element according to the
weak ordering imposed by Compare.

This PR fixes the compare function to make larger memtables come front.
kaijchen added a commit to kaijchen/doris that referenced this pull request Nov 18, 2024
This PR is a followup to change the active memtable flush policy
accordingly.

It also changed:
1. The amount of memtable writers selected in one flush.
2. The memtable writers are selected in orders of its size.
kaijchen added a commit to kaijchen/doris that referenced this pull request Nov 18, 2024
…he#41278)

## Proposed changes

apache#41018 used priority queue when selecting memtables to flush.
But the compare function is wrong and causing the order to be the
opposite.

> Note that the Compare parameter is defined such that it
returns true if its first argument comes before its second argument in a
weak ordering. But because the priority queue outputs largest elements
first, the elements that "come before" are actually output last. That
is, the front of the queue contains the "last" element according to the
weak ordering imposed by Compare.

This PR fixes the compare function to make larger memtables come front.
@kaijchen kaijchen deleted the limiter-policy branch July 8, 2025 02:30
dataroaring pushed a commit that referenced this pull request Jul 11, 2025
…52906)

Related PR: #41018

Problem Summary:

Update the `_need_flush()` function to subtract both `_queue_mem_usage`
and `_flush_mem_usage` when deciding how much memory needs to be
flushed.

Previously, we only subtracted `_queue_mem_usage`, which could lead to
flushing more active memtables than necessary.

This change ensures that only the required amount of active memtable
memory is flushed, avoiding premature flushes.
This helps prevent creating small segments, which can hurt performance
and storage efficiency.
kaijchen added a commit to kaijchen/doris that referenced this pull request Jul 25, 2025
…pache#52906)

Related PR: apache#41018

Problem Summary:

Update the `_need_flush()` function to subtract both `_queue_mem_usage`
and `_flush_mem_usage` when deciding how much memory needs to be
flushed.

Previously, we only subtracted `_queue_mem_usage`, which could lead to
flushing more active memtables than necessary.

This change ensures that only the required amount of active memtable
memory is flushed, avoiding premature flushes.
This helps prevent creating small segments, which can hurt performance
and storage efficiency.
kaijchen added a commit to kaijchen/doris that referenced this pull request Jul 25, 2025
…pache#52906)

Related PR: apache#41018

Problem Summary:

Update the `_need_flush()` function to subtract both `_queue_mem_usage`
and `_flush_mem_usage` when deciding how much memory needs to be
flushed.

Previously, we only subtracted `_queue_mem_usage`, which could lead to
flushing more active memtables than necessary.

This change ensures that only the required amount of active memtable
memory is flushed, avoiding premature flushes.
This helps prevent creating small segments, which can hurt performance
and storage efficiency.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. dev/3.0.3-merged reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants