Re: Speed up COPY FROM text/CSV parsing using SIMD
Nazir Bilal Yavuz <byavuz81@gmail.com>
From: Nazir Bilal Yavuz <byavuz81@gmail.com>
To: Manni Wood <manni.wood@enterprisedb.com>
Cc: KAZAR Ayoub <ma_kazar@esi.dz>, Nathan Bossart <nathandbossart@gmail.com>, Andrew Dunstan <andrew@dunslane.net>, Shinya Kato <shinya11.kato@gmail.com>, PostgreSQL-development <pgsql-hackers@postgresql.org>
Date: 2025-12-10T11:59:41Z
Lists: pgsql-hackers
Attachments
- v4.2-0001-Speed-up-COPY-FROM-text-CSV-parsing-using-SIMD.patch (text/x-patch)
- v4.2-0002-Speed-up-COPY-FROM-text-CSV-parsing-using-SIMD.patch (text/x-patch)
- v4.2-0003-Feedback-Changes.patch (text/x-patch)
Hi, On Wed, 10 Dec 2025 at 01:13, Manni Wood <manni.wood@enterprisedb.com> wrote: > > Bilal Yavuz (Nazir Bilal Yavuz?), It is Nazir Bilal Yavuz, I changed some settings on my phone and it seems that it affected my mail account, hopefully it should be fixed now. > I did not get a chance to do any work on this today, but wanted to thank you for finding my logic errors in counting special chars for CSV, and hacking on my naive solution to make it faster. By attempting Andrew Dunstan's suggestion, I got a better feel for the reality that the "housekeeping" code produces a significant amount of overhead. You are welcome! v4.1 has some problems with in_quote case in SIMD handling code and counting cstate->chars_processed variable. I fixed them in v4.2. -- Regards, Nazir Bilal Yavuz Microsoft