Skip to content

Conversation

@dataroaring
Copy link
Contributor

@dataroaring dataroaring commented Jun 27, 2024

When a lot of tablets fail when loading, then detailed information would cause oom.

pick #36873

Proposed changes

Issue Number: close #xxx

When a lot of tablets fail when loading, then detailed information would cause oom.
@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

@dataroaring
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 49834 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit fd607546372ebe1fe651f78c43b875fedd99277e, data reload: false

------ Round 1 ----------------------------------
q1	17694	4378	4329	4329
q2	2072	157	145	145
q3	10419	1901	1894	1894
q4	10363	1261	1341	1261
q5	8721	3878	3890	3878
q6	232	145	124	124
q7	2058	1620	1596	1596
q8	9283	2723	2700	2700
q9	10604	10343	10338	10338
q10	8626	3528	3527	3527
q11	424	253	240	240
q12	470	298	303	298
q13	18361	3940	4030	3940
q14	347	335	336	335
q15	504	455	461	455
q16	688	573	565	565
q17	1130	985	933	933
q18	7319	6929	6860	6860
q19	1777	1678	1676	1676
q20	543	303	287	287
q21	4454	4074	4016	4016
q22	530	446	437	437
Total cold run time: 116619 ms
Total hot run time: 49834 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4330	4285	4311	4285
q2	322	223	224	223
q3	4142	4140	4129	4129
q4	2763	2745	2750	2745
q5	7141	7077	7049	7049
q6	237	120	120	120
q7	3182	2845	2846	2845
q8	4359	4476	4490	4476
q9	16916	16791	16871	16791
q10	4242	4274	4288	4274
q11	756	691	664	664
q12	1030	831	872	831
q13	6818	3734	3709	3709
q14	445	421	424	421
q15	513	454	460	454
q16	741	694	673	673
q17	3849	3860	3937	3860
q18	8724	8790	8701	8701
q19	1726	1702	1619	1619
q20	2390	2082	2096	2082
q21	8450	8463	8501	8463
q22	1034	1005	978	978
Total cold run time: 84110 ms
Total hot run time: 79392 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 203901 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit fd607546372ebe1fe651f78c43b875fedd99277e, data reload: false

query1	941	421	379	379
query2	6551	2631	2723	2631
query3	6925	208	202	202
query4	21093	17982	17829	17829
query5	19731	6494	6546	6494
query6	287	213	226	213
query7	4160	298	311	298
query8	429	465	402	402
query9	3155	2697	2650	2650
query10	417	305	291	291
query11	11355	10728	10677	10677
query12	127	78	73	73
query13	5597	688	695	688
query14	17580	13612	13680	13612
query15	360	244	239	239
query16	6477	282	267	267
query17	1705	1432	869	869
query18	2346	413	408	408
query19	221	152	147	147
query20	78	78	84	78
query21	189	99	93	93
query22	5217	5062	5053	5053
query23	32595	31816	32041	31816
query24	6966	6485	6478	6478
query25	515	436	428	428
query26	513	159	159	159
query27	1795	294	293	293
query28	6150	2359	2333	2333
query29	2910	2670	2795	2670
query30	242	162	168	162
query31	902	732	737	732
query32	71	62	62	62
query33	405	269	274	269
query34	864	475	475	475
query35	1127	852	919	852
query36	1339	1164	1239	1164
query37	91	64	60	60
query38	3061	2929	2912	2912
query39	1367	1341	1322	1322
query40	215	95	97	95
query41	46	44	47	44
query42	84	85	80	80
query43	702	686	742	686
query44	1170	714	746	714
query45	256	241	234	234
query46	1227	961	957	957
query47	1875	1813	1785	1785
query48	1009	723	699	699
query49	627	376	382	376
query50	868	619	626	619
query51	4799	4627	4723	4627
query52	92	84	89	84
query53	443	324	325	324
query54	2644	2475	2461	2461
query55	99	76	87	76
query56	260	222	232	222
query57	1121	1102	1056	1056
query58	219	194	196	194
query59	4208	4121	4214	4121
query60	221	216	212	212
query61	99	93	96	93
query62	820	485	446	446
query63	483	345	344	344
query64	2534	1524	1450	1450
query65	3624	3556	3544	3544
query66	752	381	388	381
query67	17441	16132	15635	15635
query68	8492	660	664	660
query69	579	357	357	357
query70	1668	1345	1276	1276
query71	412	299	309	299
query72	6702	3490	3498	3490
query73	739	320	337	320
query74	6207	5844	5871	5844
query75	4653	3749	3641	3641
query76	4896	1138	1168	1138
query77	675	253	262	253
query78	12345	11912	11628	11628
query79	12187	650	653	650
query80	1601	394	396	394
query81	499	235	228	228
query82	1658	96	105	96
query83	164	128	127	127
query84	257	71	70	70
query85	945	315	317	315
query86	348	299	315	299
query87	3242	3031	3012	3012
query88	5221	2328	2313	2313
query89	490	271	299	271
query90	1973	204	219	204
query91	181	141	151	141
query92	59	53	54	53
query93	7155	599	592	592
query94	800	216	215	215
query95	1111	1061	1047	1047
query96	636	324	327	324
query97	6400	6304	6411	6304
query98	201	171	171	171
query99	2963	879	830	830
Total cold run time: 319251 ms
Total hot run time: 203901 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 31.01 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit fd607546372ebe1fe651f78c43b875fedd99277e, data reload: false

query1	0.02	0.02	0.02
query2	0.07	0.02	0.02
query3	0.25	0.05	0.05
query4	1.79	0.07	0.10
query5	0.54	0.52	0.51
query6	1.28	0.62	0.61
query7	0.02	0.00	0.01
query8	0.04	0.02	0.02
query9	0.53	0.49	0.49
query10	0.53	0.53	0.53
query11	0.11	0.08	0.08
query12	0.12	0.08	0.09
query13	0.62	0.62	0.61
query14	0.79	0.81	0.79
query15	0.78	0.76	0.76
query16	0.38	0.37	0.37
query17	1.01	0.96	1.02
query18	0.24	0.25	0.23
query19	1.92	1.84	1.86
query20	0.02	0.01	0.01
query21	15.47	0.63	0.56
query22	2.11	2.43	1.88
query23	17.48	1.16	0.96
query24	8.37	1.03	0.85
query25	0.42	0.12	0.05
query26	0.84	0.16	0.15
query27	0.04	0.03	0.04
query28	4.90	0.81	0.77
query29	12.69	2.34	2.36
query30	0.62	0.59	0.53
query31	2.81	0.39	0.38
query32	3.36	0.49	0.48
query33	3.04	3.10	3.10
query34	15.25	4.81	4.78
query35	4.85	4.86	4.84
query36	1.05	1.03	1.03
query37	0.06	0.04	0.04
query38	0.04	0.02	0.02
query39	0.02	0.02	0.01
query40	0.15	0.13	0.14
query41	0.06	0.02	0.01
query42	0.02	0.01	0.01
query43	0.03	0.01	0.01
Total cold run time: 104.74 s
Total hot run time: 31.01 s

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit fd607546372ebe1fe651f78c43b875fedd99277e with default session variables
Stream load json:         20 seconds loaded 2358488459 Bytes, about 112 MB/s
Stream load orc:          59 seconds loaded 1101869774 Bytes, about 17 MB/s
Stream load parquet:      32 seconds loaded 861443392 Bytes, about 25 MB/s
Insert into select:       21.1 seconds inserted 10000000 Rows, about 473K ops/s

@dataroaring dataroaring merged commit f95221e into apache:branch-2.0 Jun 28, 2024
mongo360 pushed a commit to mongo360/doris that referenced this pull request Aug 16, 2024
When a lot of tablets fail when loading, then detailed information would
cause oom.

## Proposed changes

Issue Number: close #xxx

<!--Describe your changes.-->
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants