Skip to content

[fix](csv) escape quote with double quote for csv format table#50101

Merged
morningman merged 17 commits intoapache:masterfrom
suxiaogang223:fix_escape_quote
Jun 4, 2025
Merged

[fix](csv) escape quote with double quote for csv format table#50101
morningman merged 17 commits intoapache:masterfrom
suxiaogang223:fix_escape_quote

Conversation

@suxiaogang223
Copy link
Contributor

@suxiaogang223 suxiaogang223 commented Apr 16, 2025

What problem does this PR solve?

Problem Summary:
According to the CSV standard format definition, quote characters inside a string should be escaped using a pair of quote characters. However, the current implementation does not handle this case correctly, which may lead to incorrect parsing results when the input string contains quote characters.

Release note

None

Check List (For Author)

  • Test

    • Regression test
    • Unit Test
    • Manual test (add detailed scripts or steps below)
    • No need to test or manual test. Explain why:
      • This is a refactor/code format and no logic has been changed.
      • Previous test can cover this change.
      • No code files have been changed.
      • Other reason
  • Behavior changed:

    • No.
    • Yes.
  • Does this need documentation?

    • No.
    • Yes.

Check List (For Reviewer who merge this PR)

  • Confirm the release note
  • Confirm test cases
  • Confirm document
  • Add branch pick label

@hello-stephen
Copy link
Contributor

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

  1. What problem was fixed (it's best to include specific error reporting information). How it was fixed.
  2. Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
  3. What features were added. Why was this function added?
  4. Which code was refactored and why was this part of the code refactored?
  5. Which functions were optimized and what is the difference before and after the optimization?

@suxiaogang223
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 34145 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 60f020efd1abeec41ada1d6cdcc51d495eeb4436, data reload: false

------ Round 1 ----------------------------------
q1	26311	5086	5059	5059
q2	2075	304	183	183
q3	10356	1242	717	717
q4	10226	1003	578	578
q5	7526	2323	2346	2323
q6	179	162	131	131
q7	919	731	623	623
q8	9313	1281	1083	1083
q9	6736	5135	5133	5133
q10	6866	2315	1879	1879
q11	487	288	276	276
q12	354	346	219	219
q13	17793	3686	3066	3066
q14	230	227	210	210
q15	521	488	491	488
q16	450	452	405	405
q17	611	868	380	380
q18	7636	7208	7188	7188
q19	1564	949	557	557
q20	339	332	218	218
q21	4037	3389	2457	2457
q22	1079	1029	972	972
Total cold run time: 115608 ms
Total hot run time: 34145 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5124	5136	5101	5101
q2	243	328	229	229
q3	2118	2655	2264	2264
q4	1447	1819	1461	1461
q5	4422	4436	4365	4365
q6	220	167	127	127
q7	1965	1900	1736	1736
q8	2582	2551	2484	2484
q9	7217	7179	6978	6978
q10	3023	3167	2723	2723
q11	588	511	482	482
q12	676	756	628	628
q13	3505	3903	3246	3246
q14	307	293	291	291
q15	536	489	484	484
q16	458	491	472	472
q17	1147	1577	1398	1398
q18	7769	7629	7369	7369
q19	781	784	860	784
q20	1952	2071	1811	1811
q21	5125	4843	4820	4820
q22	1093	1072	992	992
Total cold run time: 52298 ms
Total hot run time: 50245 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 192379 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 60f020efd1abeec41ada1d6cdcc51d495eeb4436, data reload: false

query1	1401	1070	1050	1050
query2	6141	1838	1851	1838
query3	11009	4478	4430	4430
query4	55193	26071	23231	23231
query5	5032	544	454	454
query6	360	210	200	200
query7	4896	505	288	288
query8	305	254	241	241
query9	5624	2587	2578	2578
query10	451	342	248	248
query11	15082	15033	14758	14758
query12	147	111	102	102
query13	1037	503	405	405
query14	10142	6144	6168	6144
query15	194	189	178	178
query16	7217	656	521	521
query17	1085	724	598	598
query18	1633	400	343	343
query19	199	206	153	153
query20	123	128	124	124
query21	203	119	113	113
query22	4688	4695	4492	4492
query23	33975	33348	33617	33348
query24	6499	2468	2414	2414
query25	472	450	402	402
query26	687	290	174	174
query27	2137	502	338	338
query28	3115	2155	2172	2155
query29	617	596	460	460
query30	287	224	195	195
query31	899	871	823	823
query32	72	64	58	58
query33	473	355	307	307
query34	748	876	496	496
query35	816	822	740	740
query36	929	1013	914	914
query37	117	103	74	74
query38	4170	4200	4184	4184
query39	1518	1441	1450	1441
query40	222	123	105	105
query41	53	50	50	50
query42	129	106	109	106
query43	480	514	490	490
query44	1301	809	824	809
query45	182	181	164	164
query46	839	1022	632	632
query47	1858	1897	1812	1812
query48	382	410	303	303
query49	682	519	395	395
query50	645	685	403	403
query51	4200	4310	4257	4257
query52	113	108	102	102
query53	221	261	185	185
query54	580	574	510	510
query55	89	82	84	82
query56	297	289	312	289
query57	1149	1182	1144	1144
query58	266	257	263	257
query59	2657	2813	2725	2725
query60	351	336	335	335
query61	160	155	152	152
query62	769	742	690	690
query63	225	192	188	188
query64	2015	1077	695	695
query65	4343	4255	4220	4220
query66	713	396	301	301
query67	15832	15818	15431	15431
query68	7023	882	515	515
query69	542	301	254	254
query70	1181	1195	1096	1096
query71	523	311	301	301
query72	5774	4745	4831	4745
query73	1504	635	339	339
query74	9390	8786	8918	8786
query75	3914	3189	2725	2725
query76	4184	1186	758	758
query77	641	363	281	281
query78	10034	10171	9321	9321
query79	1992	805	566	566
query80	652	499	427	427
query81	467	252	214	214
query82	415	127	98	98
query83	246	246	233	233
query84	288	101	82	82
query85	773	351	304	304
query86	349	290	272	272
query87	4427	4416	4332	4332
query88	3352	2163	2161	2161
query89	404	310	275	275
query90	1951	201	207	201
query91	146	145	116	116
query92	74	67	56	56
query93	1099	949	578	578
query94	674	413	290	290
query95	364	286	289	286
query96	488	571	274	274
query97	3165	3219	3189	3189
query98	230	204	197	197
query99	1688	1361	1263	1263
Total cold run time: 298400 ms
Total hot run time: 192379 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 29.5 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 60f020efd1abeec41ada1d6cdcc51d495eeb4436, data reload: false

query1	0.04	0.03	0.03
query2	0.11	0.11	0.12
query3	0.26	0.19	0.19
query4	1.59	0.20	0.19
query5	0.58	0.57	0.59
query6	1.20	0.72	0.72
query7	0.02	0.02	0.01
query8	0.04	0.03	0.04
query9	0.57	0.54	0.50
query10	0.56	0.58	0.58
query11	0.15	0.11	0.11
query12	0.14	0.12	0.11
query13	0.61	0.59	0.59
query14	1.17	1.16	1.21
query15	0.87	0.86	0.86
query16	0.38	0.38	0.37
query17	1.04	1.02	1.04
query18	0.22	0.20	0.20
query19	1.90	1.82	1.80
query20	0.01	0.02	0.01
query21	15.43	0.91	0.56
query22	0.75	1.06	0.67
query23	15.08	1.38	0.62
query24	6.97	1.44	0.86
query25	0.51	0.17	0.08
query26	0.46	0.16	0.13
query27	0.05	0.05	0.05
query28	10.49	0.90	0.43
query29	12.56	4.04	3.29
query30	0.25	0.10	0.07
query31	2.82	0.61	0.39
query32	3.23	0.53	0.48
query33	3.04	3.07	3.07
query34	15.69	5.06	4.45
query35	4.50	4.51	4.42
query36	0.65	0.50	0.48
query37	0.08	0.06	0.06
query38	0.05	0.04	0.03
query39	0.03	0.02	0.02
query40	0.16	0.14	0.12
query41	0.08	0.02	0.02
query42	0.03	0.02	0.02
query43	0.04	0.03	0.02
Total cold run time: 104.41 s
Total hot run time: 29.5 s

@doris-robot
Copy link

BE UT Coverage Report

Increment line coverage 0.00% (0/46) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 53.15% (14422/27137)
Line Coverage 42.00% (124972/297537)
Region Coverage 40.81% (63851/156472)
Branch Coverage 35.44% (32111/90608)

@suxiaogang223
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 34185 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit a3537fffb434cf019496af69b8e09d26ce89a661, data reload: false

------ Round 1 ----------------------------------
q1	26249	5062	5054	5054
q2	2072	281	203	203
q3	10357	1256	716	716
q4	10218	1011	555	555
q5	7510	2357	2405	2357
q6	186	164	135	135
q7	1016	749	610	610
q8	9306	1284	1117	1117
q9	6724	5132	5097	5097
q10	6842	2314	1894	1894
q11	481	288	280	280
q12	362	362	226	226
q13	17784	3722	3116	3116
q14	221	223	217	217
q15	544	489	487	487
q16	449	451	408	408
q17	605	876	363	363
q18	7489	7127	7127	7127
q19	1926	976	569	569
q20	340	343	227	227
q21	4091	3443	2466	2466
q22	1036	1030	961	961
Total cold run time: 115808 ms
Total hot run time: 34185 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5228	5128	5145	5128
q2	243	330	230	230
q3	2184	2685	2297	2297
q4	1427	1917	1503	1503
q5	4507	4453	4419	4419
q6	224	171	128	128
q7	1992	1939	1758	1758
q8	2643	2691	2615	2615
q9	7352	7179	7113	7113
q10	2996	3180	2739	2739
q11	573	502	483	483
q12	700	760	632	632
q13	3540	3906	3346	3346
q14	285	328	295	295
q15	537	487	467	467
q16	475	514	459	459
q17	1191	1529	1407	1407
q18	7866	7544	7419	7419
q19	822	854	982	854
q20	2036	2095	1829	1829
q21	5298	4944	4798	4798
q22	1087	1088	1042	1042
Total cold run time: 53206 ms
Total hot run time: 50961 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 191935 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit a3537fffb434cf019496af69b8e09d26ce89a661, data reload: false

query1	1399	1105	1072	1072
query2	6346	1785	1775	1775
query3	10975	4540	4520	4520
query4	53063	26217	22935	22935
query5	5090	529	449	449
query6	341	215	186	186
query7	4899	497	286	286
query8	334	274	244	244
query9	5832	2598	2614	2598
query10	427	319	256	256
query11	15054	15020	14796	14796
query12	159	109	103	103
query13	1048	492	408	408
query14	9953	6392	6322	6322
query15	194	210	168	168
query16	7050	639	521	521
query17	1070	714	577	577
query18	1549	407	332	332
query19	189	186	168	168
query20	128	127	122	122
query21	205	123	113	113
query22	4353	4380	4268	4268
query23	34290	33679	33485	33485
query24	6303	2445	2444	2444
query25	492	498	435	435
query26	672	286	160	160
query27	2135	508	345	345
query28	3039	2167	2166	2166
query29	637	550	436	436
query30	269	218	189	189
query31	899	862	772	772
query32	77	62	66	62
query33	461	365	312	312
query34	753	878	512	512
query35	789	845	776	776
query36	955	982	915	915
query37	119	105	76	76
query38	4261	4290	4373	4290
query39	1486	1436	1447	1436
query40	210	120	107	107
query41	55	53	49	49
query42	122	105	109	105
query43	505	507	493	493
query44	1391	833	842	833
query45	189	174	168	168
query46	848	1030	652	652
query47	1819	1875	1794	1794
query48	392	431	307	307
query49	707	505	431	431
query50	672	711	432	432
query51	4197	4289	4149	4149
query52	112	115	107	107
query53	234	268	183	183
query54	591	568	534	534
query55	83	87	86	86
query56	324	319	281	281
query57	1151	1171	1124	1124
query58	277	260	258	258
query59	2667	2746	2545	2545
query60	330	327	315	315
query61	129	125	129	125
query62	741	757	674	674
query63	229	188	191	188
query64	1482	1030	668	668
query65	4430	4348	4284	4284
query66	724	400	298	298
query67	15923	15453	15160	15160
query68	7154	875	505	505
query69	536	298	264	264
query70	1211	1094	1144	1094
query71	495	320	305	305
query72	5560	4988	4880	4880
query73	1430	630	346	346
query74	9211	8829	8872	8829
query75	3794	3201	2693	2693
query76	4261	1181	757	757
query77	635	370	294	294
query78	9944	10123	9212	9212
query79	2562	806	572	572
query80	655	501	439	439
query81	487	250	216	216
query82	426	127	98	98
query83	376	248	285	248
query84	292	106	86	86
query85	792	355	304	304
query86	373	308	282	282
query87	4422	4347	4264	4264
query88	3220	2233	2242	2233
query89	403	325	283	283
query90	1907	205	212	205
query91	144	149	108	108
query92	76	65	55	55
query93	1310	952	591	591
query94	689	411	291	291
query95	374	288	338	288
query96	480	574	277	277
query97	3145	3205	3084	3084
query98	236	207	218	207
query99	1444	1405	1245	1245
Total cold run time: 295873 ms
Total hot run time: 191935 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 29.78 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit a3537fffb434cf019496af69b8e09d26ce89a661, data reload: false

query1	0.04	0.04	0.03
query2	0.12	0.11	0.11
query3	0.26	0.20	0.19
query4	1.60	0.19	0.20
query5	0.60	0.58	0.60
query6	1.19	0.71	0.71
query7	0.02	0.02	0.01
query8	0.05	0.04	0.03
query9	0.59	0.52	0.53
query10	0.56	0.57	0.57
query11	0.15	0.11	0.10
query12	0.14	0.12	0.11
query13	0.61	0.59	0.59
query14	1.16	1.17	1.20
query15	0.87	0.84	0.85
query16	0.43	0.40	0.39
query17	1.00	1.01	1.06
query18	0.21	0.19	0.20
query19	1.90	1.82	1.79
query20	0.01	0.01	0.01
query21	15.39	0.92	0.55
query22	0.77	1.17	0.75
query23	14.86	1.40	0.67
query24	6.94	1.00	1.33
query25	0.49	0.22	0.09
query26	0.63	0.17	0.13
query27	0.05	0.05	0.05
query28	9.13	0.91	0.43
query29	12.56	4.03	3.33
query30	0.26	0.08	0.06
query31	2.84	0.59	0.39
query32	3.23	0.54	0.46
query33	3.07	3.00	3.10
query34	15.74	5.08	4.46
query35	4.56	4.48	4.50
query36	0.66	0.48	0.49
query37	0.09	0.06	0.06
query38	0.05	0.04	0.04
query39	0.03	0.02	0.03
query40	0.16	0.14	0.12
query41	0.08	0.03	0.02
query42	0.04	0.03	0.02
query43	0.04	0.03	0.03
Total cold run time: 103.18 s
Total hot run time: 29.78 s

@hello-stephen
Copy link
Contributor

BE UT Coverage Report

Increment line coverage 0.00% (0/46) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 53.14% (14419/27135)
Line Coverage 42.00% (124956/297489)
Region Coverage 40.80% (63821/156405)
Branch Coverage 35.44% (32096/90552)

@suxiaogang223
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 34096 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit c666cd1766f656978cd4ad71291ad299d4d36297, data reload: false

------ Round 1 ----------------------------------
q1	26489	5137	5022	5022
q2	2090	278	188	188
q3	10381	1251	719	719
q4	10234	1008	540	540
q5	7565	2399	2427	2399
q6	181	164	133	133
q7	924	748	618	618
q8	9339	1339	1167	1167
q9	6764	5152	5086	5086
q10	6829	2324	1880	1880
q11	472	288	279	279
q12	352	356	229	229
q13	17769	3694	3057	3057
q14	234	225	208	208
q15	536	480	484	480
q16	457	452	410	410
q17	604	861	358	358
q18	7605	7201	7095	7095
q19	1733	973	568	568
q20	330	337	221	221
q21	4000	3393	2445	2445
q22	1033	1029	994	994
Total cold run time: 115921 ms
Total hot run time: 34096 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5154	5083	5044	5044
q2	234	337	233	233
q3	2190	2645	2314	2314
q4	1401	1818	1489	1489
q5	4469	4391	4408	4391
q6	213	190	127	127
q7	1987	1907	1781	1781
q8	2592	2598	2766	2598
q9	7254	7211	7155	7155
q10	2993	3218	2757	2757
q11	581	512	483	483
q12	705	752	599	599
q13	3522	3862	3308	3308
q14	308	306	291	291
q15	549	501	494	494
q16	477	509	476	476
q17	1153	1592	1382	1382
q18	7914	7578	7567	7567
q19	840	807	897	807
q20	2016	1991	1883	1883
q21	5259	4881	4795	4795
q22	1093	1067	1047	1047
Total cold run time: 52904 ms
Total hot run time: 51021 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 193103 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit c666cd1766f656978cd4ad71291ad299d4d36297, data reload: false

query1	1382	1102	1065	1065
query2	6222	1885	1826	1826
query3	10980	4493	4548	4493
query4	54541	26078	23293	23293
query5	5238	565	455	455
query6	340	197	186	186
query7	4999	510	288	288
query8	319	264	255	255
query9	5854	2551	2576	2551
query10	434	331	267	267
query11	15035	14981	14784	14784
query12	162	108	102	102
query13	1078	497	401	401
query14	10026	6289	6329	6289
query15	202	202	178	178
query16	7086	680	532	532
query17	1108	777	602	602
query18	1552	421	328	328
query19	213	204	161	161
query20	129	126	125	125
query21	215	130	110	110
query22	4540	4620	4518	4518
query23	34101	33424	33578	33424
query24	6545	2449	2432	2432
query25	516	459	400	400
query26	691	281	154	154
query27	2257	512	333	333
query28	2969	2168	2139	2139
query29	595	563	435	435
query30	280	235	199	199
query31	856	840	778	778
query32	77	60	61	60
query33	484	384	316	316
query34	767	885	522	522
query35	790	894	767	767
query36	962	987	900	900
query37	116	103	79	79
query38	4236	4176	4204	4176
query39	1642	1441	1430	1430
query40	207	116	104	104
query41	54	54	76	54
query42	126	112	104	104
query43	505	500	486	486
query44	1324	828	817	817
query45	178	183	169	169
query46	860	1040	651	651
query47	1929	1885	1800	1800
query48	377	417	304	304
query49	691	519	427	427
query50	680	714	421	421
query51	4135	4236	4255	4236
query52	117	110	103	103
query53	234	257	182	182
query54	604	578	540	540
query55	84	82	83	82
query56	289	308	295	295
query57	1148	1199	1130	1130
query58	264	258	253	253
query59	2762	2816	2789	2789
query60	327	330	303	303
query61	162	132	129	129
query62	751	740	675	675
query63	225	189	188	188
query64	1731	1042	726	726
query65	4429	4424	4256	4256
query66	722	396	302	302
query67	15914	15622	15565	15565
query68	7029	883	512	512
query69	545	299	260	260
query70	1162	1106	1081	1081
query71	511	315	293	293
query72	5982	4849	4870	4849
query73	1426	678	336	336
query74	8942	9166	9006	9006
query75	3954	3256	2654	2654
query76	4212	1191	777	777
query77	643	365	285	285
query78	10058	10118	9244	9244
query79	2712	809	567	567
query80	627	505	445	445
query81	470	251	222	222
query82	454	125	95	95
query83	321	245	233	233
query84	297	100	83	83
query85	848	350	315	315
query86	367	306	304	304
query87	4379	4434	4247	4247
query88	3484	2169	2218	2169
query89	394	313	275	275
query90	1925	209	211	209
query91	142	140	110	110
query92	77	56	57	56
query93	1527	940	566	566
query94	665	410	296	296
query95	368	293	281	281
query96	497	564	274	274
query97	3162	3281	3126	3126
query98	233	216	204	204
query99	1443	1411	1285	1285
Total cold run time: 299224 ms
Total hot run time: 193103 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 29.76 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit c666cd1766f656978cd4ad71291ad299d4d36297, data reload: false

query1	0.04	0.04	0.03
query2	0.12	0.11	0.11
query3	0.24	0.20	0.20
query4	1.59	0.20	0.20
query5	0.60	0.59	0.60
query6	1.20	0.72	0.71
query7	0.02	0.02	0.02
query8	0.05	0.04	0.03
query9	0.57	0.53	0.52
query10	0.58	0.59	0.58
query11	0.16	0.10	0.11
query12	0.15	0.11	0.12
query13	0.62	0.59	0.60
query14	1.15	1.17	1.19
query15	0.87	0.85	0.85
query16	0.39	0.38	0.39
query17	1.00	1.06	1.01
query18	0.22	0.20	0.19
query19	1.87	1.85	1.79
query20	0.01	0.01	0.01
query21	15.41	0.92	0.57
query22	0.76	1.26	0.97
query23	14.68	1.39	0.66
query24	6.55	1.85	0.38
query25	0.35	0.26	0.24
query26	0.67	0.15	0.14
query27	0.05	0.04	0.06
query28	9.34	0.88	0.44
query29	12.56	4.02	3.35
query30	0.25	0.09	0.07
query31	2.82	0.60	0.39
query32	3.22	0.55	0.47
query33	3.05	3.10	3.03
query34	15.83	5.10	4.50
query35	4.54	4.56	4.53
query36	0.69	0.51	0.48
query37	0.08	0.06	0.06
query38	0.05	0.04	0.04
query39	0.03	0.02	0.02
query40	0.16	0.14	0.13
query41	0.08	0.03	0.02
query42	0.03	0.02	0.02
query43	0.04	0.02	0.02
Total cold run time: 102.69 s
Total hot run time: 29.76 s

@suxiaogang223
Copy link
Contributor Author

run buildall

2 similar comments
@suxiaogang223
Copy link
Contributor Author

run buildall

@suxiaogang223
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 34089 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 66de029eb59b37aae8819494771f5123e6a77144, data reload: false

------ Round 1 ----------------------------------
q1	26422	5165	5178	5165
q2	2086	292	199	199
q3	10494	1282	690	690
q4	10242	1019	542	542
q5	7727	2488	2339	2339
q6	184	166	133	133
q7	920	739	627	627
q8	9334	1355	1108	1108
q9	6895	5149	5155	5149
q10	6832	2325	1884	1884
q11	460	289	280	280
q12	352	361	215	215
q13	17772	3714	3052	3052
q14	226	239	217	217
q15	531	485	487	485
q16	450	451	404	404
q17	601	876	377	377
q18	7573	7263	7065	7065
q19	1229	973	565	565
q20	342	356	216	216
q21	4007	3376	2409	2409
q22	1033	1025	968	968
Total cold run time: 115712 ms
Total hot run time: 34089 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5163	5240	5130	5130
q2	229	331	238	238
q3	2162	2636	2293	2293
q4	1409	1812	1499	1499
q5	4583	4450	4412	4412
q6	223	177	127	127
q7	1967	1888	1737	1737
q8	2610	2519	2513	2513
q9	7102	7136	7179	7136
q10	3003	3176	2725	2725
q11	572	520	486	486
q12	704	773	592	592
q13	3498	3849	3291	3291
q14	294	293	267	267
q15	532	477	479	477
q16	467	508	456	456
q17	1215	1577	1395	1395
q18	7796	7561	7395	7395
q19	824	830	878	830
q20	1991	2052	1804	1804
q21	5136	4716	4540	4540
q22	1064	1041	999	999
Total cold run time: 52544 ms
Total hot run time: 50342 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 185219 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 66de029eb59b37aae8819494771f5123e6a77144, data reload: false

query1	988	469	497	469
query2	6558	1815	1820	1815
query3	6744	229	216	216
query4	26391	23326	23091	23091
query5	4285	603	461	461
query6	295	200	229	200
query7	4633	488	275	275
query8	293	245	255	245
query9	8625	2527	2555	2527
query10	491	320	274	274
query11	15424	15077	14834	14834
query12	162	111	106	106
query13	1665	513	391	391
query14	9390	6133	5954	5954
query15	219	190	168	168
query16	7447	637	461	461
query17	1190	717	555	555
query18	1966	390	312	312
query19	189	183	153	153
query20	118	122	115	115
query21	216	125	106	106
query22	4158	4498	4192	4192
query23	33847	32834	32917	32834
query24	8498	2350	2358	2350
query25	513	456	407	407
query26	1227	266	145	145
query27	2763	495	318	318
query28	4356	2080	2056	2056
query29	718	538	421	421
query30	275	215	186	186
query31	942	856	756	756
query32	73	66	62	62
query33	573	380	314	314
query34	785	835	509	509
query35	781	827	739	739
query36	965	1040	942	942
query37	110	102	73	73
query38	4163	4273	4097	4097
query39	1477	1426	1387	1387
query40	214	115	102	102
query41	56	53	53	53
query42	120	106	115	106
query43	497	500	455	455
query44	1260	790	780	780
query45	179	176	178	176
query46	839	1023	620	620
query47	1781	1822	1784	1784
query48	380	414	301	301
query49	811	513	443	443
query50	649	684	417	417
query51	4063	4186	4073	4073
query52	107	107	99	99
query53	232	259	197	197
query54	580	577	529	529
query55	80	79	80	79
query56	305	293	274	274
query57	1160	1161	1097	1097
query58	267	252	257	252
query59	2558	2686	2555	2555
query60	318	324	309	309
query61	133	145	128	128
query62	796	748	668	668
query63	219	181	180	180
query64	4288	1016	670	670
query65	4343	4254	4247	4247
query66	1134	415	309	309
query67	15851	15820	15371	15371
query68	8373	882	507	507
query69	461	301	264	264
query70	1247	1183	1092	1092
query71	481	327	307	307
query72	5549	4763	4764	4763
query73	762	607	340	340
query74	8902	9129	8740	8740
query75	3956	3214	2725	2725
query76	3739	1193	753	753
query77	771	367	280	280
query78	10068	9909	9270	9270
query79	5586	804	542	542
query80	710	522	432	432
query81	462	259	218	218
query82	729	127	100	100
query83	286	250	236	236
query84	291	107	82	82
query85	769	346	307	307
query86	333	308	270	270
query87	4473	4359	4325	4325
query88	3248	2178	2204	2178
query89	463	313	286	286
query90	1955	208	206	206
query91	143	141	110	110
query92	81	63	56	56
query93	2980	957	578	578
query94	668	391	306	306
query95	374	292	275	275
query96	481	563	266	266
query97	3149	3221	3158	3158
query98	228	203	208	203
query99	1443	1417	1254	1254
Total cold run time: 279894 ms
Total hot run time: 185219 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 29.21 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 66de029eb59b37aae8819494771f5123e6a77144, data reload: false

query1	0.05	0.04	0.04
query2	0.12	0.11	0.11
query3	0.25	0.19	0.19
query4	1.59	0.20	0.19
query5	0.59	0.59	0.57
query6	1.21	0.71	0.73
query7	0.02	0.02	0.01
query8	0.05	0.03	0.04
query9	0.57	0.54	0.51
query10	0.56	0.57	0.57
query11	0.16	0.10	0.11
query12	0.15	0.11	0.12
query13	0.61	0.59	0.59
query14	1.16	1.21	1.15
query15	0.88	0.86	0.85
query16	0.38	0.38	0.39
query17	1.02	1.02	1.02
query18	0.21	0.19	0.20
query19	1.89	1.73	1.71
query20	0.02	0.01	0.02
query21	15.40	0.88	0.55
query22	0.76	1.30	0.78
query23	14.71	1.36	0.64
query24	7.07	1.96	0.43
query25	0.50	0.27	0.15
query26	0.67	0.17	0.13
query27	0.05	0.05	0.05
query28	9.31	0.87	0.45
query29	12.52	3.95	3.30
query30	0.27	0.10	0.06
query31	2.82	0.58	0.38
query32	3.22	0.54	0.46
query33	3.07	3.02	3.13
query34	15.72	5.07	4.50
query35	4.55	4.51	4.48
query36	0.66	0.49	0.48
query37	0.09	0.06	0.06
query38	0.05	0.04	0.03
query39	0.03	0.03	0.02
query40	0.16	0.14	0.12
query41	0.08	0.03	0.03
query42	0.04	0.03	0.02
query43	0.03	0.03	0.03
Total cold run time: 103.27 s
Total hot run time: 29.21 s

@hello-stephen
Copy link
Contributor

BE UT Coverage Report

Increment line coverage 27.66% (13/47) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 53.63% (14560/27151)
Line Coverage 42.44% (126220/297376)
Region Coverage 41.25% (64497/156369)
Branch Coverage 35.83% (32431/90518)

@hello-stephen
Copy link
Contributor

BE Regression P0 && UT Coverage Report

Increment line coverage 44.68% (21/47) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage
Line Coverage
Region Coverage
Branch Coverage

@suxiaogang223
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 33879 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit dacd8801c526d1afcb24ec6d9a4746861dd68179, data reload: false

------ Round 1 ----------------------------------
q1	25913	5160	5058	5058
q2	2074	277	184	184
q3	10462	1239	680	680
q4	10265	991	541	541
q5	8747	2434	2330	2330
q6	268	161	129	129
q7	948	770	602	602
q8	9324	1277	1107	1107
q9	6914	5057	5118	5057
q10	6865	2316	1900	1900
q11	479	283	259	259
q12	343	355	222	222
q13	17757	3999	3155	3155
q14	222	225	221	221
q15	533	486	485	485
q16	445	451	396	396
q17	595	859	369	369
q18	7684	7201	7029	7029
q19	1513	943	555	555
q20	351	343	222	222
q21	3912	3354	2419	2419
q22	1019	1019	959	959
Total cold run time: 116633 ms
Total hot run time: 33879 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5119	5093	5063	5063
q2	242	326	229	229
q3	2147	2646	2280	2280
q4	1419	1806	1507	1507
q5	4560	4406	4382	4382
q6	208	163	124	124
q7	1987	1905	1731	1731
q8	2604	2492	2484	2484
q9	7051	7069	7094	7069
q10	2971	3136	2749	2749
q11	569	500	480	480
q12	694	759	615	615
q13	3413	3851	3242	3242
q14	280	298	281	281
q15	537	481	483	481
q16	456	499	464	464
q17	1140	1599	1371	1371
q18	7625	7457	7375	7375
q19	798	760	826	760
q20	2042	2009	1906	1906
q21	5240	4719	4695	4695
q22	1025	994	944	944
Total cold run time: 52127 ms
Total hot run time: 50232 ms

@github-actions github-actions bot added approved Indicates a PR has been approved by one committer. reviewed labels Jun 4, 2025
@github-actions
Copy link
Contributor

github-actions bot commented Jun 4, 2025

PR approved by anyone and no changes requested.

@morningman morningman merged commit 084c802 into apache:master Jun 4, 2025
29 of 32 checks passed
suxiaogang223 added a commit to suxiaogang223/doris that referenced this pull request Jun 27, 2025
…e#50101)

Problem Summary:
According to the CSV standard format definition, quote characters inside
a string should be escaped using a pair of quote characters. However,
the current implementation does not handle this case correctly, which
may lead to incorrect parsing results when the input string contains
quote characters.
suxiaogang223 added a commit to suxiaogang223/doris that referenced this pull request Jun 27, 2025
…e#50101)

Problem Summary:
According to the CSV standard format definition, quote characters inside
a string should be escaped using a pair of quote characters. However,
the current implementation does not handle this case correctly, which
may lead to incorrect parsing results when the input string contains
quote characters.
suxiaogang223 added a commit to suxiaogang223/doris that referenced this pull request Jun 30, 2025
…e#50101)

Problem Summary:
According to the CSV standard format definition, quote characters inside
a string should be escaped using a pair of quote characters. However,
the current implementation does not handle this case correctly, which
may lead to incorrect parsing results when the input string contains
quote characters.
koarz pushed a commit to koarz/doris that referenced this pull request Jul 3, 2025
@suxiaogang223 suxiaogang223 deleted the fix_escape_quote branch July 10, 2025 09:01
morningman pushed a commit that referenced this pull request Nov 6, 2025
### What problem does this PR solve?

Fix wrong result when escape same as enclose, introduced by
#50101

data:
```
50,"{""a"": 1}"
60,"{""a"": 2}"
```

query
```
select * from local( "backend_id" = "1760087225568", "file_path" = "test.csv", "format" = "csv", "column_separator" = ",", "enclose" = "\"", "escape" = "\""
```

expectation:
```
+------+------------+
| k1   | k2         |
+------+------------+
|   50 | {"a": 1} |
|   60 | {"a": 2} |
+------+------------+
```

real:
```
+------+------------------------+
| k1   | k2                     |
+------+------------------------+
|   50 | {"a": 1}
60,{"a": 2}

 |
+------+------------------------+
```
github-actions bot pushed a commit that referenced this pull request Nov 6, 2025
### What problem does this PR solve?

Fix wrong result when escape same as enclose, introduced by
#50101

data:
```
50,"{""a"": 1}"
60,"{""a"": 2}"
```

query
```
select * from local( "backend_id" = "1760087225568", "file_path" = "test.csv", "format" = "csv", "column_separator" = ",", "enclose" = "\"", "escape" = "\""
```

expectation:
```
+------+------------+
| k1   | k2         |
+------+------------+
|   50 | {"a": 1} |
|   60 | {"a": 2} |
+------+------------+
```

real:
```
+------+------------------------+
| k1   | k2                     |
+------+------------------------+
|   50 | {"a": 1}
60,{"a": 2}

 |
+------+------------------------+
```
github-actions bot pushed a commit that referenced this pull request Nov 6, 2025
### What problem does this PR solve?

Fix wrong result when escape same as enclose, introduced by
#50101

data:
```
50,"{""a"": 1}"
60,"{""a"": 2}"
```

query
```
select * from local( "backend_id" = "1760087225568", "file_path" = "test.csv", "format" = "csv", "column_separator" = ",", "enclose" = "\"", "escape" = "\""
```

expectation:
```
+------+------------+
| k1   | k2         |
+------+------------+
|   50 | {"a": 1} |
|   60 | {"a": 2} |
+------+------------+
```

real:
```
+------+------------------------+
| k1   | k2                     |
+------+------------------------+
|   50 | {"a": 1}
60,{"a": 2}

 |
+------+------------------------+
```
wyxxxcat pushed a commit to wyxxxcat/doris that referenced this pull request Nov 18, 2025
…e#57632)

### What problem does this PR solve?

Fix wrong result when escape same as enclose, introduced by
apache#50101

data:
```
50,"{""a"": 1}"
60,"{""a"": 2}"
```

query
```
select * from local( "backend_id" = "1760087225568", "file_path" = "test.csv", "format" = "csv", "column_separator" = ",", "enclose" = "\"", "escape" = "\""
```

expectation:
```
+------+------------+
| k1   | k2         |
+------+------------+
|   50 | {"a": 1} |
|   60 | {"a": 2} |
+------+------------+
```

real:
```
+------+------------------------+
| k1   | k2                     |
+------+------------------------+
|   50 | {"a": 1}
60,{"a": 2}

 |
+------+------------------------+
```
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. dev/3.0.7-merged dev/3.1.0-merged reviewed usercase Important user case type label

Projects

None yet

Development

Successfully merging this pull request may close these issues.

9 participants

Comments