[Rust] [Parquet] ArrowReader fails on some timestamp types

I discovered this bug with this query
```java

> SELECT tpep_pickup_datetime FROM taxi LIMIT 1;
General("InvalidArgumentError(\"column types must match schema types, expected Timestamp(Microsecond, None) but found UInt64 at column index 0\")") 
```
The parquet reader detects this schema when reading from the file:
```java

Schema { 
  fields: [
    Field { name: "tpep_pickup_datetime", data_type: Timestamp(Microsecond, None), nullable: true, dict_id: 0, dict_is_ordered: false }
  ], 
  metadata: {} 
} 
```
The struct array read from the file contains:
```java

[PrimitiveArray<UInt64>
[
  1567318008000000,
  1567319357000000,
  1567320092000000,
  1567321151000000, 
```
 When the Parquet arrow reader creates the record batch, the following validation logic fails:
```java

for i in 0..columns.len() {
    if columns[i].len() != len {
        return Err(ArrowError::InvalidArgumentError(
            "all columns in a record batch must have the same length".to_string(),
        ));
    }
    if columns[i].data_type() != schema.field(i).data_type() {
        return Err(ArrowError::InvalidArgumentError(format!(
            "column types must match schema types, expected {:?} but found {:?} at column index {}",
            schema.field(i).data_type(),
            columns[i].data_type(),
            i)));
    }
}
 
```

**Reporter**: [Andy Grove](https://issues.apache.org/jira/browse/ARROW-8258) / @andygrove
**Assignee**: [Renjie Liu](https://issues.apache.org/jira/browse/ARROW-8258) / @liurenjie1024
#### Related issues:
- [[Rust] [Parquet] Add support for writing temporal types](https://github.com/apache/arrow/issues/24606) (relates to)

<sub>**Note**: *This issue was originally created as [ARROW-8258](https://issues.apache.org/jira/browse/ARROW-8258). Please see the [migration documentation](https://github.com/apache/arrow/issues/14542) for further details.*</sub>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Rust] [Parquet] ArrowReader fails on some timestamp types #24454

Related issues:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Rust] [Parquet] ArrowReader fails on some timestamp types #24454

Description

Related issues:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions