-
Notifications
You must be signed in to change notification settings - Fork 70
ability to append partitions to existing arrow files #160
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Also ref #105 which seems related. |
Codecov Report
@@ Coverage Diff @@
## main #160 +/- ##
==========================================
+ Coverage 81.32% 81.94% +0.62%
==========================================
Files 25 26 +1
Lines 3015 3119 +104
==========================================
+ Hits 2452 2556 +104
Misses 563 563
Continue to review full report at Codecov.
|
57d13c6 to
ce68fd0
Compare
This adds a method to `append` partitions to existing arrow files. Partitiions to append to are supplied in the form of any [Tables.jl](https://github.com/JuliaData/Tables.jl)-compatible table. Multiple record batches will be written based on the number of `Tables.partitions(tbl)` that are provided. Each partition being appended must have the same `Tables.Schema` as the destination arrow file that is being appended to. Other parameters that `append` accepts are similar to what `write` accepts.
|
Hi @quinnj, does this look okay, or is anything more needed here? |
|
Sorry, I've been trying to catch up on a bunch of stuff since coming back from vacation; planning on reviewing this more in-depth in the next 24 hours. |
store few additional stream properties in the `Stream` data type and avoid duplicating code for append functionality
|
@quinnj I have now added commits to allow appends to |
|
bump! |
quinnj
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for all the work on this @tanmaykm!
This adds a method to
appendpartitions to existing arrow files. Partitiions to append to are supplied in the form of any Tables.jl-compatible table.Multiple record batches will be written based on the number of
Tables.partitions(tbl)that are provided.Each partition being appended must have the same
Tables.Schemaas the destination arrow file that is being appended to.Other parameters that
appendaccepts are similar to whatwriteaccepts.