-
Notifications
You must be signed in to change notification settings - Fork 4k
ARROW-16000: [C++][Python] Dataset: Added transcoding function option to CSV scanner #13709
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from all commits
Commits
Show all changes
27 commits
Select commit
Hold shift + click to select a range
5d7c2bd
Added field to CsvFragmentScanOptions that holds an optional transfor…
joosthooz a82239d
WIP wrapping a trancoder around all input streams of a dataset
joosthooz f67590c
Added input stream wrapping to CsvFileFormat::CountRows too
joosthooz a717ddc
Moved make_streamwrap_func into io.pxi, removed duplicated code
joosthooz 9284eb6
Use UpperCamelCase in function name
joosthooz b2061bf
Additional name change, formatting fix
joosthooz bbdaa07
Use GetReadOptions(), because it overrides the use_threads field
joosthooz c959112
Only add a transcoder for csv files
joosthooz 7fceb84
Processed some review comments regarding documentation
joosthooz 401f67d
Moved encoding parameter from dataset() into CsvFileFormat
joosthooz 9816e46
Removed a left-over occurrence of the workaround encoding parameter
joosthooz 888b3e8
Setting default encoding to utf8
joosthooz 0f67e8c
Always creating a default_fragment_scan_options
joosthooz 47c536d
Using a different way of checking the file format
joosthooz a539dd0
Added documentation about added encoding parameter
joosthooz 22eff73
Implementaed an alternative way of passing the encoding.
joosthooz 4d9802b
Added python-specific encoding field to CsvFileformat.
joosthooz 4d819aa
Removed encoding from CsvFragmentScanOptions
joosthooz 1534bd1
Added transcoding functionality by dlopening libiconv
joosthooz d2c9a06
Removed encoding library wrapper code, now returning error if no tran…
joosthooz a82a32a
Changed default for csv encoding to a constant
joosthooz 21f1202
Generating an error in C++ when the encoding is not UTF-8
joosthooz e06f03f
In python, setting the encoding back to utf8 after creating a transco…
joosthooz 24b1cc5
'UTF-8' -> 'utf8' to make it consistent with python
joosthooz c2c4b22
Disabled ReadOptions encoding validation
joosthooz 30124c2
Formatting
joosthooz 9a5bf20
Moved the creation of the stream wrapper function into the readoption…
joosthooz File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.