Conversation
Codecov Report
@@ Coverage Diff @@
## master #3608 +/- ##
==========================================
+ Coverage 98.24% 98.25% +0.01%
==========================================
Files 67 69 +2
Lines 12972 13088 +116
==========================================
+ Hits 12744 12860 +116
Misses 228 228
Continue to review full report at Codecov.
|
jangorecki
left a comment
There was a problem hiding this comment.
some initial comments.
add some Rraw with examples, or put them in comments next to R code, so it is easier to reproduce.
we can think what we could parallelize here... we could eventually run all columns passed to ... in parallel
|
I will push some helpers to new branch, which are likely to be useful here. |
|
#3609
cc(F)
x = sample(1e8)
any_na_integer(x)
cc(F)
x = list(sample(1e8), c(sample(1e8-1), NA), sample(1e8))
list_any_na_integer(x)anyNA would be best to incorporate into core loop in update: #3608 (comment) |
jangorecki
left a comment
There was a problem hiding this comment.
some more detailed feedback
|
I reduced number of passes over
|
|
|
|
|
|
Should long vectors be supported? |
|
@HughParsonage in the long run they should but I was not able to code |
|
some benchmarking of Matt's approach (strictly single pass handling everything) vs single pass |
Closes #3424
Timings using script https://gist.github.com/MichaelChirico/3ed5276838e9114f89a607594e101ab2
logicalcolumn:integercolumn:numericcolumn:charactercolumn: