Skip to content

Conversation

@kaka11chen
Copy link

  1. Implements selection vector for ORC lazy materialization. From the test, currently implements float/double, date/timestamp, decimal, string dict types for better performance, and other types have performance penalty.
  2. Decrease loadStripIndex() call count.
  3. Adjust code format.

1. Implements selection vector for ORC lazy materialization.
From the test, currently implements float/double, date/timestamp, decimal, string dict types for better performance,
and other types have performance penalty.
2. Decrease `loadStripIndex()` call count.
3. Adjust code format.
@morningman morningman merged commit 0e53506 into apache:orc May 6, 2023
kaka11chen added a commit to kaka11chen/doris that referenced this pull request May 7, 2023
- Implements ORC lazy materialization, integrate with the implementation of apache/doris-thirdparty#56 and apache/doris-thirdparty#62.
- Refactor code: Move `execute_conjuncts()` and `execute_conjuncts_and_filter_block()` in `parquet_group_reader `to `VExprContext`, used by parquet reader and orc reader.
- Add session variable `enable_parquet_lazy_materialization` and `enable_orc_lazy_materialization` to control whether enable lazy materialization.
morningman pushed a commit to apache/doris that referenced this pull request May 9, 2023
- Implements ORC lazy materialization, integrate with the implementation of apache/doris-thirdparty#56 and apache/doris-thirdparty#62.
- Refactor code: Move `execute_conjuncts()` and `execute_conjuncts_and_filter_block()` in `parquet_group_reader `to `VExprContext`, used by parquet reader and orc reader.
- Add session variables `enable_parquet_lazy_materialization` and `enable_orc_lazy_materialization` to control whether enable lazy materialization.
- Modify `build.sh` to update apache-orc submodule or download package every time.
Reminiscent pushed a commit to Reminiscent/doris that referenced this pull request May 15, 2023
…#18615)

- Implements ORC lazy materialization, integrate with the implementation of apache/doris-thirdparty#56 and apache/doris-thirdparty#62.
- Refactor code: Move `execute_conjuncts()` and `execute_conjuncts_and_filter_block()` in `parquet_group_reader `to `VExprContext`, used by parquet reader and orc reader.
- Add session variables `enable_parquet_lazy_materialization` and `enable_orc_lazy_materialization` to control whether enable lazy materialization.
- Modify `build.sh` to update apache-orc submodule or download package every time.
Reminiscent pushed a commit to Reminiscent/doris that referenced this pull request May 15, 2023
…#18615)

- Implements ORC lazy materialization, integrate with the implementation of apache/doris-thirdparty#56 and apache/doris-thirdparty#62.
- Refactor code: Move `execute_conjuncts()` and `execute_conjuncts_and_filter_block()` in `parquet_group_reader `to `VExprContext`, used by parquet reader and orc reader.
- Add session variables `enable_parquet_lazy_materialization` and `enable_orc_lazy_materialization` to control whether enable lazy materialization.
- Modify `build.sh` to update apache-orc submodule or download package every time.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants