data.table could collect more statistics about data while processing. This allows potential optimizations, not limited to internal data.table code. Users can use them to speed up their code and design more data-driven functions.
List of measures to collect:
optionally, as I don't see obvious optimizations coming from those:
data.table could collect more statistics about data while processing. This allows potential optimizations, not limited to internal data.table code. Users can use them to speed up their code and design more data-driven functions.
List of measures to collect:
haskey(x)!is.null(idx<-attr(attr(x, "index"), idx_name))anyNAlength(attr(idx, "starts"))attr(idx, "maxgrpn")attr(idx, "maxgrpn")==1Lx[c(idx[1L], idx[length(idx)])]{{hasna}} && length(attr(idx, "starts"))==1Loptionally, as I don't see obvious optimizations coming from those: