fix: incorrect Parquet INT96 timestamp values from ArrowReader by mbutrovich · Pull Request #2301 · apache/iceberg-rust

mbutrovich · 2026-03-31T16:35:43Z

Which issue does this PR close?

Closes bug: incorrect Parquet INT96 values from ArrowReader #2299.

What changes are included in this PR?

Add coerce_int96_timestamps() to patch the Arrow schema before reading, using arrow-rs's schema hint mechanism (ArrowReaderOptions::with_schema) to read
INT96 columns at the resolution specified by the Iceberg table schema
timestamp/timestamptz → microsecond, timestamp_ns/timestamptz_ns → nanosecond, per the Iceberg spec
Falls back to microsecond when no field ID is available (matching Iceberg Java's TimestampInt96Reader behavior)
Applied after all three schema resolution branches (with field IDs, name mapping, positional fallback) so the fix covers both native and migrated tables
Handles INT96 inside nested types (structs, lists, maps) via ArrowSchemaVisitor traversal
Visitor and tests live in a standalone arrow/int96.rs module to keep reader.rs manageable
Made visit_schema in arrow/schema.rs pub(crate) so the coercion visitor can reuse the existing traversal

Are these changes tested?

test_read_int96_timestamps_with_field_ids — files with embedded field IDs (branch 1)
test_read_int96_timestamps_without_field_ids — migrated files without field IDs (branches 2/3)
test_read_int96_timestamps_in_struct — INT96 inside a struct field
test_read_int96_timestamps_in_list — INT96 inside a list field (3-level Parquet LIST encoding)
test_read_int96_timestamps_in_map — INT96 as map values
All tests use dates outside the i64 nanosecond range (~1677-2262) to confirm the overflow is avoided
Apache DataFusion Comet used the repro test in
apache/datafusion-comet#3856 and it passes with this change:
test: [DO NOT MERGE] test upstream iceberg-rust fix for #3856 datafusion-comet#3857

mbutrovich · 2026-03-31T18:14:17Z

Thanks for the feedback @emkornfield! You gave me a good idea on how I might be able to restructure this to address several of your comments. If you don't mind, I'll re-request a review probably later today.

emkornfield · 2026-03-31T18:33:30Z

Thanks for the feedback @emkornfield! You gave me a good idea on how I might be able to restructure this to address several of your comments. If you don't mind, I'll re-request a review probably later today.

Sounds good, please take comments with a grain of salt, I'm fairly new the code base.

mbutrovich · 2026-04-01T15:11:27Z

Rewrote using ArrowSchemaVisitor, following the same pattern as MetadataStripVisitor. The visitor walks the Arrow schema and at each primitive, checks if it's Timestamp(Nanosecond) and looks up the Iceberg field by ID to determine the target resolution.

This also eliminates the parquet_schema parameter entirely — we no longer need Parquet column paths to find INT96 columns, since arrow-rs only produces Timestamp(Nanosecond) for INT96 (INT64 timestamps get the correct unit from Parquet metadata).

Made visit_schema pub(crate) so it's accessible from reader.rs.

blackmwk · 2026-04-02T07:29:17Z

+}
+
+impl ArrowSchemaVisitor for Int96CoercionVisitor<'_> {
+    type T = Field;


Should this be FieldRef?

I think we have to wait for #2310 for this one. The trait currently passes &Field in before_field/after_field, so getting a FieldRef requires cloning into a new Arc. Once #2310 changes the trait to pass &FieldRef, we can store Vec<FieldRef> and the push becomes a cheap Arc::clone.

mbutrovich · 2026-04-02T17:47:34Z

Most recent changes:

Standalone module — Moved coerce_int96_timestamps, Int96CoercionVisitor, and all 5 INT96 tests to crates/iceberg/src/arrow/int96.rs. Registered in mod.rs.
Concrete types — Replaced Self::T/Self::U with Field/ArrowSchema in all ArrowSchemaVisitor method signatures.
Pop in after_* methods — Push in before_field/before_list_element/before_map_key/before_map_value, pop in the corresponding after_* methods. primitive/struct/list/map now peek with .last() instead of .pop().
FieldRef in field_stack — Kept Vec<Field> for now since the trait passes &Field. Added // TODO(#2310) comment referencing the tracking issue.

Thanks for the patience so far @blackmwk!

blackmwk · 2026-04-15T09:52:05Z

+
+    /// Files with embedded field IDs (branch 1 of schema resolution).
+    #[tokio::test]
+    async fn test_read_int96_timestamps_with_field_ids() {


As ut, I expect to see tests for the schema visitor only. I think it would be better to put these tests into reader module?

Thanks for the review @blackmwk!

Moved integration tests from int96.rs to reader.rs

The 5 tests that write Parquet files and read through ArrowReader now live in the reader test module where they belong.

Converted /// doc comments to // to match codebase test style.

Added 11 unit tests for the schema visitor in int96.rs

test_coerce_timestamp_ns_to_us — Timestamp field ID → microsecond

test_coerce_timestamptz_ns_to_us — Timestamptz field ID → microsecond, preserves timezone

test_no_coercion_when_iceberg_is_timestamp_ns — TimestampNs → no change

test_no_coercion_when_iceberg_is_timestamptz_ns — TimestamptzNs → no change

test_no_coercion_when_already_microsecond — already microsecond → no change

test_defaults_to_us_without_field_ids — no field ID metadata → falls back to microsecond (Iceberg Java behavior)

test_defaults_to_us_when_iceberg_type_is_not_timestamp — field ID maps to non-timestamp Iceberg type → falls back to microsecond

test_coerce_preserves_field_metadata — field metadata survives coercion

test_coerce_timestamp_in_struct — nested struct coercion

test_coerce_timestamp_in_list — nested list coercion

test_coerce_timestamp_in_map_value — nested map value coercion

# Conflicts: # crates/iceberg/src/arrow/reader.rs

blackmwk

Thanks @mbutrovich for this pr!

## Which issue does this PR close? - Closes #2299. ## What changes are included in this PR? - Add `coerce_int96_timestamps()` to patch the Arrow schema before reading, using arrow-rs's schema hint mechanism (`ArrowReaderOptions::with_schema`) to read INT96 columns at the resolution specified by the Iceberg table schema - `timestamp`/`timestamptz` → microsecond, `timestamp_ns`/`timestamptz_ns` → nanosecond, per the [Iceberg spec](https://iceberg.apache.org/spec/#primitive-types) - Falls back to microsecond when no field ID is available (matching Iceberg Java's `TimestampInt96Reader` behavior) - Applied after all three schema resolution branches (with field IDs, name mapping, positional fallback) so the fix covers both native and migrated tables - Handles INT96 inside nested types (structs, lists, maps) via `ArrowSchemaVisitor` traversal - Visitor and tests live in a standalone `arrow/int96.rs` module to keep `reader.rs` manageable - Made `visit_schema` in `arrow/schema.rs` `pub(crate)` so the coercion visitor can reuse the existing traversal ## Are these changes tested? - `test_read_int96_timestamps_with_field_ids` — files with embedded field IDs (branch 1) - `test_read_int96_timestamps_without_field_ids` — migrated files without field IDs (branches 2/3) - `test_read_int96_timestamps_in_struct` — INT96 inside a struct field - `test_read_int96_timestamps_in_list` — INT96 inside a list field (3-level Parquet LIST encoding) - `test_read_int96_timestamps_in_map` — INT96 as map values - All tests use dates outside the i64 nanosecond range (~1677-2262) to confirm the overflow is avoided - [Apache DataFusion Comet](https://github.com/apache/datafusion-comet) used the repro test in [apache/datafusion-comet#3856](apache/datafusion-comet#3856) and it passes with this change: apache/datafusion-comet#3857 (cherry picked from commit a2f067d)

…e#2301) ## Which issue does this PR close? - Closes apache#2299. ## What changes are included in this PR? - Add `coerce_int96_timestamps()` to patch the Arrow schema before reading, using arrow-rs's schema hint mechanism (`ArrowReaderOptions::with_schema`) to read INT96 columns at the resolution specified by the Iceberg table schema - `timestamp`/`timestamptz` → microsecond, `timestamp_ns`/`timestamptz_ns` → nanosecond, per the [Iceberg spec](https://iceberg.apache.org/spec/#primitive-types) - Falls back to microsecond when no field ID is available (matching Iceberg Java's `TimestampInt96Reader` behavior) - Applied after all three schema resolution branches (with field IDs, name mapping, positional fallback) so the fix covers both native and migrated tables - Handles INT96 inside nested types (structs, lists, maps) via `ArrowSchemaVisitor` traversal - Visitor and tests live in a standalone `arrow/int96.rs` module to keep `reader.rs` manageable - Made `visit_schema` in `arrow/schema.rs` `pub(crate)` so the coercion visitor can reuse the existing traversal ## Are these changes tested? - `test_read_int96_timestamps_with_field_ids` — files with embedded field IDs (branch 1) - `test_read_int96_timestamps_without_field_ids` — migrated files without field IDs (branches 2/3) - `test_read_int96_timestamps_in_struct` — INT96 inside a struct field - `test_read_int96_timestamps_in_list` — INT96 inside a list field (3-level Parquet LIST encoding) - `test_read_int96_timestamps_in_map` — INT96 as map values - All tests use dates outside the i64 nanosecond range (~1677-2262) to confirm the overflow is avoided - [Apache DataFusion Comet](https://github.com/apache/datafusion-comet) used the repro test in [apache/datafusion-comet#3856](apache/datafusion-comet#3856) and it passes with this change: apache/datafusion-comet#3857 (cherry picked from commit a2f067d)

mbutrovich added 2 commits March 31, 2026 12:01

coerce int96 timestamps

a3e90d3

cleanup

7d80d2e

mbutrovich self-assigned this Mar 31, 2026

mbutrovich added the bug Something isn't working label Mar 31, 2026

mbutrovich mentioned this pull request Mar 31, 2026

test: [DO NOT MERGE] test upstream iceberg-rust fix for #3856 apache/datafusion-comet#3857

Closed

emkornfield reviewed Mar 31, 2026

View reviewed changes