-
Notifications
You must be signed in to change notification settings - Fork 3.7k
[opt](split) generate and get split batch concurrently #36045
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thank you for your contribution to Apache Doris. Since 2024-03-18, the Document has been moved to doris-website. |
|
clang-tidy review says "All clean, LGTM! 👍" |
|
run buildall |
|
clang-tidy review says "All clean, LGTM! 👍" |
|
TeamCity be ut coverage result: |
TPC-H: Total hot run time: 41508 ms |
TPC-DS: Total hot run time: 172672 ms |
ClickBench: Total hot run time: 30.64 s |
|
clang-tidy review says "All clean, LGTM! 👍" |
|
clang-tidy review says "All clean, LGTM! 👍" |
|
run buildall |
|
TeamCity be ut coverage result: |
TPC-H: Total hot run time: 39650 ms |
TPC-DS: Total hot run time: 173388 ms |
ClickBench: Total hot run time: 30.69 s |
morningman
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
PR approved by at least one committer and no changes requested. |
|
PR approved by anyone and no changes requested. |
|
run buildall |
|
clang-tidy review says "All clean, LGTM! 👍" |
1 similar comment
|
clang-tidy review says "All clean, LGTM! 👍" |
|
TeamCity be ut coverage result: |
|
run buildall |
|
TeamCity be ut coverage result: |
|
run buildall |
|
clang-tidy review says "All clean, LGTM! 👍" |
TPC-H: Total hot run time: 39718 ms |
|
TeamCity be ut coverage result: |
TPC-DS: Total hot run time: 174369 ms |
ClickBench: Total hot run time: 30.16 s |
|
run buildall |
|
clang-tidy review says "All clean, LGTM! 👍" |
TPC-H: Total hot run time: 39985 ms |
TPC-DS: Total hot run time: 172755 ms |
ClickBench: Total hot run time: 31.01 s |
|
TeamCity be ut coverage result: |
morningman
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
PR approved by at least one committer and no changes requested. |
## Proposed changes Generate and get split batch concurrently. `SplitSource.getNextBatch` remove the synchronization, and make each get their splits concurrently, and `SplitAssignment` generates splits asynchronously.
Proposed changes
Generate and get split batch concurrently.
SplitSource.getNextBatchremove the synchronization, and make each get their splits concurrently, andSplitAssignmentgenerates splits asynchronously.Because obtaining partitions and splits is not stable, once encountering unstable situations, the time consumption will increase several times. The batch interface will retrieve split data in batches, which greatly increases the number of unstable situations encountered. So temporarily close the batch interface.
Improvements
Query a big table with 60591 partitions and 1209172 splits:
set
max_external_file_cache_num=0infe/fe.confto turn off the cache;set
num_partitions_in_batch_mode=-1insession variablesto turn off the batch mode.