Skip to content

Conversation

@hubgeter
Copy link
Contributor

bp #39926

Proposed changes

Since Doris does not have an unsigned type, we convert parquet uint32 type to doris bigint (int64) type.
When reading the parquet file, the byte size stored in parquet and the byte size of the data type mapped by doris are inconsistent, resulting in be core.
Fix:
When reading, we read according to the byte size stored in parquet, and then convert it to the data type mapped by doris.

Mapping relationship description:
parquet -> doris
UInt8 -> Int16
UInt16 -> Int32
UInt32 -> Int64
UInt64 -> Int128.

Proposed changes

Issue Number: close #xxx

…es. (apache#39926)

## Proposed changes
Since Doris does not have an unsigned type, we convert parquet uint32
type to doris bigint (int64) type.
When reading the parquet file, the byte size stored in parquet and the
byte size of the data type mapped by doris are inconsistent, resulting
in be core.
Fix:
When reading, we read according to the byte size stored in parquet, and
then convert it to the data type mapped by doris.

Mapping relationship description:
parquet -> doris  
UInt8 -> Int16
UInt16 -> Int32
UInt32 -> Int64
UInt64 -> Int128.
@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

@hubgeter
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

@morningman morningman merged commit a7156ee into apache:branch-2.1 Aug 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants