Skip to content

Schema issue between Arrow and PyIceberg #8913

@asheeshgarg

Description

@asheeshgarg

Apache Iceberg version

1.4.1 (latest release)

Query engine

Other

Please describe the bug 🐞

@Fokko we have a table in iceberg which has some of the column names begin with numbers. We are able to scan the table using PyIceberg. When try to bind it to Arrow or DuckDB we see its Arrow invalid FieldRef.Name no match for field.

What we observe in in Arrow the field name beginning with number like 2030_ABC is renamed to _2030_ABC while the schema on iceberg is correct to define it as 2030_ABC which is in original data. Which trigger this issue.

Seem more of Arrow Bug happy to open it at Arrow project. Let me know

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions