Skip to content

Conversation

@airborne12
Copy link
Member

@airborne12 airborne12 commented Feb 24, 2025

cherry pick from #47846 #48231

…ache#47846)

This pull request focuses on improving the handling of null values in
the inverted index writer and simplifying the codebase by removing
redundant null map checks. The most important changes include removing
unnecessary null map handling in several methods and ensuring proper
null bitmap updates.

Improvements to null value handling and code simplification:

*
[`be/src/olap/rowset/segment_v2/column_writer.cpp`](diffhunk://#diff-db6023c6e1df0c3616055f02e769cc20fcef7ee083cb3755cec1b661bb7b42ffL952-L958):
Removed redundant null map handling in `Status
ArrayColumnWriter::append_nullable` method.
*
[`be/src/olap/rowset/segment_v2/inverted_index_writer.cpp`](diffhunk://#diff-97781916b276f771710ab520c79ca29d5e4e331296fad7573fc9933a376dc165L328-R328):
Simplified `add_array_nulls` method to always return `Status::OK()`.
*
[`be/src/olap/rowset/segment_v2/inverted_index_writer.cpp`](diffhunk://#diff-97781916b276f771710ab520c79ca29d5e4e331296fad7573fc9933a376dc165L429-R426):
Added null map check before accessing elements in the loop to prevent
potential null pointer dereference.
[[1]](diffhunk://#diff-97781916b276f771710ab520c79ca29d5e4e331296fad7573fc9933a376dc165L429-R426)
[[2]](diffhunk://#diff-97781916b276f771710ab520c79ca29d5e4e331296fad7573fc9933a376dc165L525-R531)
*
[`be/src/olap/rowset/segment_v2/inverted_index_writer.cpp`](diffhunk://#diff-97781916b276f771710ab520c79ca29d5e4e331296fad7573fc9933a376dc165R513):
Updated `_null_bitmap` in the `add_null_document` method to ensure
proper null bitmap updates.
*
[`be/src/olap/task/index_builder.cpp`](diffhunk://#diff-df38b3b177cd231676ce7a405526b3419c543e29171143ddec02960a84a930c6L645-R645):
Removed redundant null map handling in `Status
IndexBuilder::_add_nullable` method.
@airborne12
Copy link
Member Author

run buildall

@Thearas
Copy link
Contributor

Thearas commented Feb 24, 2025

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

  1. What problem was fixed (it's best to include specific error reporting information). How it was fixed.
  2. Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
  3. What features were added. Why was this function added?
  4. Which code was refactored and why was this part of the code refactored?
  5. Which functions were optimized and what is the difference before and after the optimization?

@airborne12
Copy link
Member Author

run buildall

@airborne12
Copy link
Member Author

run buildall

@airborne12
Copy link
Member Author

run buildall

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.60% (9636/26328)
Line Coverage: 28.20% (79976/283639)
Region Coverage: 26.82% (40977/152809)
Branch Coverage: 23.57% (20782/88154)
Coverage Report: http://coverage.selectdb-in.cc/coverage/6cfcad231d13a91741f95b2efb193150135c4027_6cfcad231d13a91741f95b2efb193150135c4027/report/index.html

@yiguolei yiguolei merged commit 1aa57a3 into apache:branch-2.1 Feb 25, 2025
19 of 20 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants