You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Using nrows= could be parallel again by just issuing a (rare) warning if any out-of-sample type bumps occurred, since they could occur after the nrow'th row, as mentioned in PR Better skip= and nrow= #2623. This warning would be acceptable and would avoid extra code complexity and memory usage.
Bugs (critical)
nrowsargument is too small relative to the size of the file (fread fails when max_nrows argument is too small #2243)skipwith files whose size is a multiple of 4096 (fread crashes when using option skip with some data files #2201)%character ([fread] Error message is not displayed when a field ends with percent sign #2287)fillis false (Segfault with fread #2299)Bugs (core)
na.stringsis a number, it may fail to work properly (na.strings may fail to convert numbers into NAs in dev #2222)fill=True(fread fails on file with inconsistent # columns #2267)nrowsargument is used (fread warning on "stopped reading" should interact with nrows #1671)skipandnrows([fread] broken functionality in 1.9.6 #1267)selectshould advise usingdropif receiving negative indices (Negative select in fread #2423)NULs (fread error reading Latin-1 file containing NUL byte <0x00> #2435, is it possible to ignore error and coninue reading from file #2496)select(Error in fread when selecting a subset of columns, and one of the fields contains a sep #2464)strip.white=FALSEis not respected (fread's skip.white=FALSE behavior #2492)\rnewlines are squashed in a single-column file (\r newlines are squashed in a single-column file #2542)sepmay sometimes be ignored (freadfill=true and sep= provided could still read as 1-column #2666)Bugs (R)
select = NULLcauses an error (fread error with explicit 'select = NULL' #2007)integer64parameter does not work out-of-sample (in fread integer64 argument is ignored if int64 type-bump occurs out-of-sample #2749, fread(integer64 = "double") not working for some data #2607)colClasses="logical"may produce a spurious error (colClasses=logical is no longer working #2766)Feature requests (core)
freadshould distinguish between empty string and n/a string (fread/fwrite default options for NA mismatch #2281, fread should distinguish between quoted and unquoted empty strings #2217, Irreversible empty string handling by fread() and fwrite() #2214, fread reads in empty fields as logical NA #1159)write.table) (fread row.names option #1818, fread does not handle row.names produced by write.table #763)1#.INF,1.#IND,1.#QNAN, and1.#SNAN(Handle 1.#INF, 1.#IND, 1.#QNAN and 1.#SNAN #1800)allow fread to process input file in chunks, i.e. add ability to stop/resume scanning (Chunked fread #1721)nrowargument to be negative, meaning skip that many lines from the end (fread's nrow argument could accept -ve values to skip last 'n' rows #1643)freadto support thousand separator #1636, [Request] Strip commas from columns declared numeric in fread #1723)allow reading part of a file >4Gb on a 32-bit machine ([Request] With fread, do not throw 32 bit large file support error if nrows is specified #949)nrows=could be parallel again by just issuing a (rare) warning if any out-of-sample type bumps occurred, since they could occur after the nrow'th row, as mentioned in PR Better skip= and nrow= #2623. This warning would be acceptable and would avoid extra code complexity and memory usage.Feature requests (R)
selectis a list, thencolClassesshould accept a list of same length (FR forfread: ifselectis used,colClassesneed only correspond to the columns inselect#1426)stringsAsFactorsparameter to be a fraction between 0 and 1 (fread stringsAsFactors=0.2 #2025)select/col.names/colClasses([Request] Ability to use OR statements within fread #2185, select via pattern in fread #2066, [R-Forge #5323] fread: colClasses does not convert to non-builtin types #491)sep2(when will sep2 in fread be implemented? #1162).gzfiles directly (Support .gz file format for fread #717)dropcould drop all duplicated column names (e.g. many"filler"or""column names) (Improve how fread's drop/select deal with duplicate column names #1899)Miscellaneous