I have several thousands of .gz files containing data in csv format - about 60GB in total in terms of .gz files. Decompressing them and load some pieces via fread turns out a huge pain in the first step. I'm wonder whether it is possible to improve the functionality of fread so that it can read compressed file formats just as read.table does?
Perhaps file connection issues are highly relevant, as mentioned in #341, #543, and #561.
Some other reference:
http://stackoverflow.com/questions/5764499/decompress-gz-file-using-r
http://blog.revolutionanalytics.com/2009/12/r-tip-save-time-and-space-by-compressing-data-files.html
I have several thousands of
.gzfiles containing data incsvformat - about 60GB in total in terms of.gzfiles. Decompressing them and load some pieces viafreadturns out a huge pain in the first step. I'm wonder whether it is possible to improve the functionality offreadso that it can read compressed file formats just asread.tabledoes?Perhaps file connection issues are highly relevant, as mentioned in #341, #543, and #561.
Some other reference:
http://stackoverflow.com/questions/5764499/decompress-gz-file-using-r
http://blog.revolutionanalytics.com/2009/12/r-tip-save-time-and-space-by-compressing-data-files.html