Skip to content

Vectorized read using Arrow #319

Merged
danielcweeks merged 27 commits intoapache:vectorized-readfrom
prodeezy:vectorized-read
Jul 26, 2019
Merged

Vectorized read using Arrow #319
danielcweeks merged 27 commits intoapache:vectorized-readfrom
prodeezy:vectorized-read

Conversation

@prodeezy
Copy link
Copy Markdown
Contributor

@prodeezy prodeezy commented Jul 26, 2019

Just a WIP POC right now so is not expected to be merged. This is just for comments.

Things I intend to add:

  • a separate module iceberg-arrow that will house the code for reading parquet into Arrow
    Iceberg's Reader needs to choose Row-wise or Vectorized reading based on config.
    issue#311
  • batch sizing as config issue#312

/cc @danielcweeks @anjalinorwood @samarthjain

prodeezy and others added 27 commits June 4, 2019 17:48
…-2.0-SNAPSHOT` (snapshot prefix is required by snapshot repo)
@prodeezy
Copy link
Copy Markdown
Contributor Author

prodeezy commented Jul 26, 2019

tests fail but check-style checks pass.

@danielcweeks
Copy link
Copy Markdown
Contributor

@prodeezy I'm going to go ahead and commit this now, which will hopefully unblock others to start making pull requests.

@danielcweeks danielcweeks merged commit 9578d06 into apache:vectorized-read Jul 26, 2019
@prodeezy prodeezy changed the title [WIP] Vectorized read using Arrow Vectorized read using Arrow Oct 29, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants