-
Notifications
You must be signed in to change notification settings - Fork 3.7k
branch-3.0 [fix] (inverted index) Fix UTF-8 4-byte truncation issue and add configuration to control correct term writing (#48657) #48792
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
|
run buildall |
TPC-H: Total hot run time: 41039 ms |
TPC-DS: Total hot run time: 197820 ms |
ClickBench: Total hot run time: 32.75 s |
…iguration to control correct term writing (apache#48657) Problem Summary: Added the `enable_inverted_index_correct_term_write` configuration as a compatibility option to fix the issue of term truncation caused by special characters. In some situations, a rollback may be necessary after an upgrade. To accommodate this, the initial goal is to set `enable_inverted_index_correct_term_write=false`, prioritizing compatibility with correct term reading before enabling correct term writing. Once reading is confirmed to be working correctly, the configuration can be switched to `enable_inverted_index_correct_term_write=true` to write non-truncated terms properly.
|
run buildall |
TPC-H: Total hot run time: 39838 ms |
TPC-DS: Total hot run time: 196403 ms |
ClickBench: Total hot run time: 32.39 s |
dataroaring
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
PR approved by at least one committer and no changes requested. |
|
PR approved by anyone and no changes requested. |
…iguration to control correct term writing (apache#48657) (apache#48792) (apache#3803) cherry pick from apache#48657
cherry pick from #48657