-
Notifications
You must be signed in to change notification settings - Fork 3.7k
[fix](auto bucket)Fix auto bucket calc bucketnum err when partition size is invalid #52801
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
|
wait case |
fe/fe-core/src/main/java/org/apache/doris/clone/DynamicPartitionScheduler.java
Show resolved
Hide resolved
|
run buildall |
yujun777
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
PR approved by anyone and no changes requested. |
|
run buildall |
TPC-H: Total hot run time: 33590 ms |
TPC-DS: Total hot run time: 185158 ms |
ClickBench: Total hot run time: 29.58 s |
dataroaring
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
PR approved by at least one committer and no changes requested. |
…ize is invalid (apache#52801) …ize invalid 1. Fixed the problem that auto bucket will calculate wrong results when partition size is inaccurate - If `replica.size == 0`, filter out this replica. In the tablet.getDataSize function, the size is calculated by taking the average value of the replicas. When the size of a replica is 0, it will have a great impact on the average value. Therefore, the replicas with size=0 are filtered out. - If the partition size equals 0, do not include it in the estimation of the partition size. - If all versions with data partitions have sizes equal to 0, then the newly calculated bucket number for the partition will equal the bucket number of the previous version with a size greater than 0. Since we do not know the partition size of the data partitions (as stats thread have not been collected yet), we assume that the new partition's size equals the size of the previous version with a size greater than 0. Consequently, the bucket number will naturally equal that of the previous partition. 2. Added alarm log when the bucket num calculated by auto bucket exceeds the threshold
…ize is invalid (apache#52801) …ize invalid 1. Fixed the problem that auto bucket will calculate wrong results when partition size is inaccurate - If `replica.size == 0`, filter out this replica. In the tablet.getDataSize function, the size is calculated by taking the average value of the replicas. When the size of a replica is 0, it will have a great impact on the average value. Therefore, the replicas with size=0 are filtered out. - If the partition size equals 0, do not include it in the estimation of the partition size. - If all versions with data partitions have sizes equal to 0, then the newly calculated bucket number for the partition will equal the bucket number of the previous version with a size greater than 0. Since we do not know the partition size of the data partitions (as stats thread have not been collected yet), we assume that the new partition's size equals the size of the previous version with a size greater than 0. Consequently, the bucket number will naturally equal that of the previous partition. 2. Added alarm log when the bucket num calculated by auto bucket exceeds the threshold
…ize invalid
replica.size == 0, filter out this replica. In the tablet.getDataSize function, the size is calculated by taking the average value of the replicas. When the size of a replica is 0, it will have a great impact on the average value. Therefore, the replicas with size=0 are filtered out.What problem does this PR solve?
Issue Number: close #xxx
Related PR: #xxx
Problem Summary:
Release note
None
Check List (For Author)
Test
Behavior changed:
Does this need documentation?
Check List (For Reviewer who merge this PR)