-
Notifications
You must be signed in to change notification settings - Fork 3.7k
branch-3.0: [Fix](Serde) Support hive compatible output format #49036 #49830
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
|
run buildall |
|
TeamCity cloud ut coverage result: |
TPC-H: Total hot run time: 40399 ms |
TPC-DS: Total hot run time: 191061 ms |
ClickBench: Total hot run time: 32.01 s |
Problem Summary:
The output format of complex data types are different between Hive and
Doris, such as array, map and struct.
When user migrate from Hive to Doris, they expect the same format so
that they don't need to modify their business code.
This PR mainly changes:
Add a new option to session variable `serde_dialect`: If set to hive,
the output format returned to MySQL client of some datatypes will be
changed:
Array
Doris: ["abc", "def", "", null, 1]
Hive: ["abc","def","",null,true]
Map
Doris: {"k1":null, "k2":"v3"}
Hive: {"k1":null,"k2":"v3"}
Struct
Doris: {"s_id":100, "s_name":"abc , "", "s_address":null}
Hive: {"s_id":100,"s_name":"abc ,"","s_address":null}
Related #37039
b5d4c6b to
dab8723
Compare
|
run buildall |
|
TeamCity cloud ut coverage result: |
TPC-H: Total hot run time: 40691 ms |
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
TPC-DS: Total hot run time: 197477 ms |
ClickBench: Total hot run time: 32.92 s |
Cherry-picked from #49036