Skip to content

Support for _pos metadata column #1765

@vustef

Description

@vustef

Is your feature request related to a problem or challenge?

The spec defines metadata columns, one of which is the _pos column. This column seems instrumental for write operations positional deletes in future, as well as for having some ways for users to identify rows (combination of filepath + position). There's also a related DataFusion issue: apache/datafusion#13261.

Describe the solution you'd like

The solution should allow building a TableScan with option to return metadata columns (in this case _pos). When that happens, the library should return batches that include this extra column. The column should preserve gaps in case of filter pushdown (and in case of merge-on-read).

Willingness to contribute

I can contribute to this feature independently

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions