Thread
-
Reject HEADER with binary and json COPY formats by option presence
Chao Li <li.evan.chao@gmail.com> — 2026-05-31T01:57:06Z
Hi, While testing “file_fdw: Support multi-line HEADER option”, I noticed a small issue. The doc says that the HEADER option cannot be used with the “binary" or “json" format: ``` <varlistentry id="sql-copy-params-header"> <term><literal>HEADER</literal></term> <listitem> <para> On output, if this option is set to <literal>true</literal> (or an equivalent Boolean value), the first line of the output will contain the column names from the table. Integer values <literal>0</literal> and <literal>1</literal> are accepted as Boolean values, but other integers are not allowed for <command>COPY TO</command> commands. </para> <para> On input, if this option is set to <literal>true</literal> (or an equivalent Boolean value), the first line of the input is discarded. If set to a non-negative integer, that number of lines are discarded. If set to <literal>MATCH</literal>, the first line is discarded, and it must contain column names that exactly match the table's columns, in both number and order; otherwise, an error is raised. The <literal>MATCH</literal> value is only valid for <command>COPY FROM</command> commands. </para> <para> This option is not allowed when using <literal>binary</literal> or <literal>json</literal> format. </para> </listitem> </varlistentry> ``` However, when I specified "header ‘0", the command did not fail. That means the current behavior depends on the value of the “header" option, not on presence: ``` evantest=# create foreign table ft (i int) server fs options (format 'binary', filename '/tmp/ft.bin', header '1'); ERROR: cannot specify HEADER in BINARY mode evantest=# create foreign table ft (i int) server fs options (format 'binary', filename '/tmp/ft.bin', header '0'); CREATE FOREIGN TABLE ``` As we can see, "header 1" fails, but header 0" is silently accepted. I don't think this behavior matches what the documentation describes. For comparison, VACUUM has a similar option. “BUFFER_USAGE_LIMIT" is not allowed with "VACUUM FULL", and a value of 0 means disabling the buffer access strategy: ``` <varlistentry> <term><literal>BUFFER_USAGE_LIMIT</literal></term> <listitem> <para> Specifies the <glossterm linkend="glossary-buffer-access-strategy">Buffer Access Strategy</glossterm> ring buffer size for <command>VACUUM</command>. This size is used to calculate the number of shared buffers which will be reused as part of this strategy. <literal>0</literal> disables use of a <literal>Buffer Access Strategy</literal>. If <option>ANALYZE</option> is also specified, the <option>BUFFER_USAGE_LIMIT</option> value is used for both the vacuum and analyze stages. This option can't be used with the <option>FULL</option> option except if <option>ANALYZE</option> is also specified. When this option is not specified, <command>VACUUM</command> uses the value from <xref linkend="guc-vacuum-buffer-usage-limit"/>. Higher settings can allow <command>VACUUM</command> to run more quickly, but having too large a setting may cause too many other useful pages to be evicted from shared buffers. The minimum value is <literal>128 kB</literal> and the maximum value is <literal>16 GB</literal>. </para> </listitem> </varlistentry> ``` Using BUFFER_USAGE_LIMIT 0 with FULL is still rejected: ``` evantest=# vacuum (full, BUFFER_USAGE_LIMIT 0) t; ERROR: BUFFER_USAGE_LIMIT cannot be specified for VACUUM FULL ``` So VACUUM rejects BUFFER_USAGE_LIMIT based on the presence of the option, not its value. I think we should keep the behavior consistent here, and VACUUM's behavior better matches the documentation. Otherwise, I am afraid this could encourage more inconsistencies in the future. The fix is straightforward. Since we already have the “header_specified" variable to indicate whether the option is present, we can check “header_specified" instead. I reported a similar issue for the COPY command earlier in thread [1]. If this patch is accepted, then that one may be worth considering as well. [1] https://www.postgresql.org/message-id/C1D2509E-E5D1-46B0-932C-B57AA7B963A1%40gmail.com -- Chao Li (Evan) HighGo Software Co., Ltd. https://www.highgo.com/