upgrade from data.frame to data.table syntax as much as possible. think ive found any areas already where this was creating performance issues, but it would be good to do for the sake of consistency elsewhere. should pay particular attention to underlying stats and utils, etc.