Skip to content

Conversation

@BePPPower
Copy link
Contributor

@BePPPower BePPPower commented Jun 21, 2024

Proposed changes

Issue Number: close #xxx

Previously, empty rows in CSV files were ignored by Doris.
Now, add a session variable to control whether empty rows in CSV files are read as NULL values.

@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

@github-actions
Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

@BePPPower
Copy link
Contributor Author

run buildall

@BePPPower BePPPower force-pushed the fixReadCsvEmptyLine branch from 4d07e5e to fe4fe65 Compare June 21, 2024 09:18
@BePPPower
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.46% (9006/24701)
Line Coverage: 28.01% (73867/263759)
Region Coverage: 27.48% (38367/139610)
Branch Coverage: 24.18% (19556/80876)
Coverage Report: http://coverage.selectdb-in.cc/coverage/fe4fe65c2b10e5a0f4e92179f7360d067744c3d1_fe4fe65c2b10e5a0f4e92179f7360d067744c3d1/report/index.html

@BePPPower
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

@BePPPower
Copy link
Contributor Author

run buildall

2 similar comments
@BePPPower
Copy link
Contributor Author

run buildall

@BePPPower
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

@BePPPower BePPPower force-pushed the fixReadCsvEmptyLine branch from 7a12a72 to dc1077b Compare June 24, 2024 05:06
@BePPPower
Copy link
Contributor Author

run buildall

1 similar comment
@BePPPower
Copy link
Contributor Author

run buildall

@BePPPower
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

@doris-robot
Copy link

TPC-H: Total hot run time: 40895 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit dfe28b37b65d2ed7243e308a6457d014cf23aeec, data reload: false

------ Round 1 ----------------------------------
q1	18285	4727	4446	4446
q2	2593	196	182	182
q3	12193	1181	1190	1181
q4	10619	764	932	764
q5	7524	2778	2714	2714
q6	224	138	140	138
q7	959	615	619	615
q8	9219	2091	2086	2086
q9	9078	6532	6440	6440
q10	8936	3707	3738	3707
q11	456	244	242	242
q12	422	240	237	237
q13	17751	2977	3011	2977
q14	267	228	218	218
q15	528	493	492	492
q16	492	379	372	372
q17	978	713	703	703
q18	8103	7464	7372	7372
q19	7674	1463	1589	1463
q20	668	320	325	320
q21	4999	3876	4059	3876
q22	413	350	350	350
Total cold run time: 122381 ms
Total hot run time: 40895 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4366	4257	4207	4207
q2	384	280	277	277
q3	2955	2783	2744	2744
q4	1917	1617	1615	1615
q5	5290	5338	5304	5304
q6	215	130	131	130
q7	2170	1730	1694	1694
q8	3209	3340	3349	3340
q9	8318	8340	8338	8338
q10	3857	3694	3573	3573
q11	570	499	484	484
q12	770	607	601	601
q13	17418	2952	3013	2952
q14	286	260	266	260
q15	523	470	483	470
q16	500	425	407	407
q17	1778	1499	1471	1471
q18	7595	7501	7383	7383
q19	1710	1464	1547	1464
q20	1997	1797	1769	1769
q21	4853	4655	4844	4655
q22	640	563	539	539
Total cold run time: 71321 ms
Total hot run time: 53677 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 173315 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit dfe28b37b65d2ed7243e308a6457d014cf23aeec, data reload: false

query1	908	383	394	383
query2	6456	2316	2336	2316
query3	6646	210	215	210
query4	19215	17317	17228	17228
query5	4166	477	477	477
query6	262	166	167	166
query7	4610	296	293	293
query8	326	298	287	287
query9	8677	2356	2341	2341
query10	584	304	276	276
query11	10391	10044	9865	9865
query12	126	84	90	84
query13	1624	366	369	366
query14	9352	7632	7889	7632
query15	252	193	189	189
query16	7818	267	263	263
query17	1913	547	521	521
query18	1481	278	265	265
query19	199	153	155	153
query20	90	86	87	86
query21	208	128	127	127
query22	4362	4054	4097	4054
query23	33726	32859	32914	32859
query24	11863	2927	2891	2891
query25	646	348	357	348
query26	1609	153	151	151
query27	2765	313	322	313
query28	7019	2031	2018	2018
query29	946	643	604	604
query30	288	151	147	147
query31	960	733	734	733
query32	94	55	59	55
query33	758	278	281	278
query34	948	465	484	465
query35	730	628	634	628
query36	1082	910	916	910
query37	168	70	72	70
query38	2884	2749	2743	2743
query39	838	810	811	810
query40	278	130	122	122
query41	57	51	51	51
query42	117	95	101	95
query43	578	552	559	552
query44	1152	734	735	734
query45	202	161	163	161
query46	1082	727	710	710
query47	1871	1756	1755	1755
query48	363	309	304	304
query49	1165	402	400	400
query50	765	383	388	383
query51	6964	6829	6780	6780
query52	111	93	92	92
query53	356	296	288	288
query54	938	459	431	431
query55	77	75	75	75
query56	279	258	254	254
query57	1144	1052	1056	1052
query58	262	238	255	238
query59	3404	3010	3042	3010
query60	289	274	297	274
query61	89	91	94	91
query62	646	445	457	445
query63	320	292	296	292
query64	9871	2250	1729	1729
query65	3197	3151	3119	3119
query66	1245	365	329	329
query67	15378	14973	14978	14973
query68	4581	530	543	530
query69	521	430	329	329
query70	1193	1176	1154	1154
query71	403	270	275	270
query72	7655	5537	5549	5537
query73	757	320	326	320
query74	5856	5528	5404	5404
query75	3360	2668	2698	2668
query76	2780	996	898	898
query77	444	312	294	294
query78	10349	9844	9777	9777
query79	1987	515	522	515
query80	833	473	483	473
query81	589	217	223	217
query82	967	105	100	100
query83	263	169	167	167
query84	233	86	93	86
query85	1224	271	323	271
query86	436	319	330	319
query87	3369	3117	3088	3088
query88	3425	2460	2438	2438
query89	473	399	397	397
query90	1822	192	188	188
query91	125	101	102	101
query92	71	49	50	49
query93	1717	509	503	503
query94	1247	195	184	184
query95	408	312	306	306
query96	601	277	269	269
query97	3204	3082	3093	3082
query98	212	201	190	190
query99	1265	847	855	847
Total cold run time: 271560 ms
Total hot run time: 173315 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 31.73 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit dfe28b37b65d2ed7243e308a6457d014cf23aeec, data reload: false

query1	0.04	0.03	0.03
query2	0.09	0.04	0.04
query3	0.22	0.05	0.06
query4	1.69	0.08	0.08
query5	0.50	0.49	0.48
query6	1.14	0.73	0.72
query7	0.02	0.02	0.01
query8	0.05	0.04	0.04
query9	0.57	0.49	0.50
query10	0.54	0.55	0.55
query11	0.15	0.11	0.11
query12	0.14	0.11	0.12
query13	0.58	0.58	0.58
query14	0.76	0.80	0.80
query15	0.85	0.80	0.81
query16	0.36	0.36	0.37
query17	1.04	1.01	1.02
query18	0.21	0.24	0.26
query19	1.84	1.78	1.77
query20	0.01	0.01	0.01
query21	15.41	0.65	0.65
query22	3.49	6.72	2.88
query23	18.29	1.32	1.30
query24	2.15	0.23	0.23
query25	0.15	0.09	0.08
query26	0.26	0.17	0.17
query27	0.07	0.07	0.08
query28	13.17	1.02	0.99
query29	12.65	3.36	3.38
query30	0.26	0.06	0.06
query31	2.86	0.40	0.39
query32	3.25	0.47	0.47
query33	2.88	2.97	2.91
query34	17.02	4.38	4.46
query35	4.44	4.48	4.44
query36	0.66	0.48	0.46
query37	0.20	0.16	0.15
query38	0.16	0.16	0.15
query39	0.05	0.04	0.04
query40	0.16	0.14	0.13
query41	0.10	0.05	0.05
query42	0.06	0.06	0.05
query43	0.05	0.04	0.04
Total cold run time: 108.59 s
Total hot run time: 31.73 s

@AshinGau
Copy link
Member

LGTM

@github-actions
Copy link
Contributor

PR approved by at least one committer and no changes requested.

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Jun 25, 2024
@github-actions
Copy link
Contributor

PR approved by anyone and no changes requested.

@morningman morningman merged commit 7287fdd into apache:master Jun 25, 2024
dataroaring pushed a commit that referenced this pull request Jun 26, 2024
…s in CSV files are read as NULL values. (#36668)

Previously, empty rows in CSV files were ignored by Doris.
Now, add a session variable to control whether empty rows in CSV files
are read as NULL values.
BePPPower added a commit to BePPPower/doris that referenced this pull request Jul 2, 2024
…s in CSV files are read as NULL values. (apache#36668)

Previously, empty rows in CSV files were ignored by Doris.
Now, add a session variable to control whether empty rows in CSV files
are read as NULL values.
morningman pushed a commit that referenced this pull request Jul 2, 2024
morningman pushed a commit to morningman/doris that referenced this pull request Jul 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. dev/2.1.5-merged dev/3.0.0-merged meta-change reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants