[R] [C++] Allow me to write_parquet() from an arrow_dplyr_query 

Right now, I can:
```Java

ds <- open_dataset("some.parquet")
ds %>% 
  mutate(
    o_orderdate = cast(o_orderdate, date32())  
  ) %>% 
  write_dataset(path = "new.parquet")
```

but I can't:
```Java

tab <- read_parquet("some.parquet", as_data_frame = FALSE)
tab %>% 
  mutate(
    o_orderdate = cast(o_orderdate, date32())  
  ) %>% 
  write_parquet("new.parquet")
```

In this case, I can cast the column as a separate command and then `write_parquet()` after, but it would be nice to be able to us `write_parquet()` in a pipeline.

This will require a libarrow addition to / another version of WriteParquet that takes a RecordBatchReader instead of a fully-instantiated Table


**Reporter**: [Jonathan Keane](https://issues.apache.org/jira/browse/ARROW-14428) / @jonkeane
#### Related issues:
- [write_parquet() / write_csv_arrow() cannot stream a dataset object back to S3](https://github.com/apache/arrow/issues/30615) (is duplicated by)
- [[C++] Allow ParquetWriter to take a RecordBatchReader as input](https://github.com/apache/arrow/issues/30279) (depends upon)

<sub>**Note**: *This issue was originally created as [ARROW-14428](https://issues.apache.org/jira/browse/ARROW-14428). Please see the [migration documentation](https://github.com/apache/arrow/issues/14542) for further details.*</sub>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[R] [C++] Allow me to write_parquet() from an arrow_dplyr_query #29992

Related issues:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[R] [C++] Allow me to write_parquet() from an arrow_dplyr_query #29992

Description

Related issues:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions