Skip to content

Conversation

@kaka11chen
Copy link

No description provided.

@morningman morningman merged commit f10bb3d into apache:orc Jan 10, 2025
morningman pushed a commit to apache/doris that referenced this pull request Mar 3, 2025
…#45966)

### What problem does this PR solve?

related: apache/doris-thirdparty#270

Problem Summary:

The original merge io mechanism `MergeRangeFileReader` requires that the
range must be read in order, and the ranges can be out of order, so the
range cannot be read back.
And if you turn on delayed materialization of orc complex types, you
will need to present a stream readback scenario, such as `select
struct_element(info, 'age'), id from test_orc_struct, where
struct_element(info, 'name') = 'Alice'`.
When late materialization is turned on, the present stream of the parent
node `info` will be read first after `name` is read. When reading `age`,
the parent node `info` needs to be read back. So the late
materialization of the orc complex type cannot be turned on at present.
koarz pushed a commit to koarz/doris that referenced this pull request Jun 4, 2025
…apache#45966)

### What problem does this PR solve?

related: apache/doris-thirdparty#270

Problem Summary:

The original merge io mechanism `MergeRangeFileReader` requires that the
range must be read in order, and the ranges can be out of order, so the
range cannot be read back.
And if you turn on delayed materialization of orc complex types, you
will need to present a stream readback scenario, such as `select
struct_element(info, 'age'), id from test_orc_struct, where
struct_element(info, 'name') = 'Alice'`.
When late materialization is turned on, the present stream of the parent
node `info` will be read first after `name` is read. When reading `age`,
the parent node `info` needs to be read back. So the late
materialization of the orc complex type cannot be turned on at present.
kaka11chen added a commit to kaka11chen/doris that referenced this pull request Jun 20, 2025
…apache#45966)

related: apache/doris-thirdparty#270

Problem Summary:

The original merge io mechanism `MergeRangeFileReader` requires that the
range must be read in order, and the ranges can be out of order, so the
range cannot be read back.
And if you turn on delayed materialization of orc complex types, you
will need to present a stream readback scenario, such as `select
struct_element(info, 'age'), id from test_orc_struct, where
struct_element(info, 'name') = 'Alice'`.
When late materialization is turned on, the present stream of the parent
node `info` will be read first after `name` is read. When reading `age`,
the parent node `info` needs to be read back. So the late
materialization of the orc complex type cannot be turned on at present.
kaka11chen added a commit to kaka11chen/doris that referenced this pull request Jun 20, 2025
…apache#45966)

related: apache/doris-thirdparty#270

Problem Summary:

The original merge io mechanism `MergeRangeFileReader` requires that the
range must be read in order, and the ranges can be out of order, so the
range cannot be read back.
And if you turn on delayed materialization of orc complex types, you
will need to present a stream readback scenario, such as `select
struct_element(info, 'age'), id from test_orc_struct, where
struct_element(info, 'name') = 'Alice'`.
When late materialization is turned on, the present stream of the parent
node `info` will be read first after `name` is read. When reading `age`,
the parent node `info` needs to be read back. So the late
materialization of the orc complex type cannot be turned on at present.
kaka11chen added a commit to kaka11chen/doris that referenced this pull request Jun 20, 2025
…apache#45966)

related: apache/doris-thirdparty#270

Problem Summary:

The original merge io mechanism `MergeRangeFileReader` requires that the
range must be read in order, and the ranges can be out of order, so the
range cannot be read back.
And if you turn on delayed materialization of orc complex types, you
will need to present a stream readback scenario, such as `select
struct_element(info, 'age'), id from test_orc_struct, where
struct_element(info, 'name') = 'Alice'`.
When late materialization is turned on, the present stream of the parent
node `info` will be read first after `name` is read. When reading `age`,
the parent node `info` needs to be read back. So the late
materialization of the orc complex type cannot be turned on at present.
kaka11chen added a commit to kaka11chen/doris that referenced this pull request Jun 24, 2025
…apache#45966)

related: apache/doris-thirdparty#270

Problem Summary:

The original merge io mechanism `MergeRangeFileReader` requires that the
range must be read in order, and the ranges can be out of order, so the
range cannot be read back.
And if you turn on delayed materialization of orc complex types, you
will need to present a stream readback scenario, such as `select
struct_element(info, 'age'), id from test_orc_struct, where
struct_element(info, 'name') = 'Alice'`.
When late materialization is turned on, the present stream of the parent
node `info` will be read first after `name` is read. When reading `age`,
the parent node `info` needs to be read back. So the late
materialization of the orc complex type cannot be turned on at present.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants