Skip to content

[R] String columns read lazily from readr error when transferred to an arrow table #43349

@jonkeane

Description

@jonkeane

Describe the bug, including details regarding any error messages, version, and platform.

On dev, here is a reprex:

arrow::as_arrow_table(
    readr::read_csv(readr::readr_example("chickens.csv"), lazy = TRUE), 
    sink = tempfile(fileext=".parquet")
)
#> Rows: 5 Columns: 4
#> ── Column specification ────────────────────────────────────────────────────────
#> Delimiter: ","
#> chr (3): chicken, sex, motto
#> dbl (1): eggs_laid
#> 
#> ℹ Use `spec()` to retrieve the full column specification for this data.
#> ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
#> Error in Table__from_dots(dots, schema, option_use_threads()): No Set_elt found for ALTSTRING class [class: vroom_chr, pkg: vroom]

This wasn't a problem in our last release:

arrow::as_arrow_table(
    readr::read_csv(readr::readr_example("chickens.csv"), lazy = TRUE), 
    sink = tempfile(fileext=".parquet")
)
#> Rows: 5 Columns: 4
#> ── Column specification ────────────────────────────────────────────────────────
#> Delimiter: ","
#> chr (3): chicken, sex, motto
#> dbl (1): eggs_laid
#> 
#> ℹ Use `spec()` to retrieve the full column specification for this data.
#> ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
#> Table
#> 5 rows x 4 columns
#> $chicken <string>
#> $sex <string>
#> $eggs_laid <double>
#> $motto <string>
#> 
#> See $metadata for additional Schema metadata

Component(s)

R

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions