Re: Support UTF-8 files with BOM in COPY FROM
Itagaki Takahiro <itagaki.takahiro@gmail.com>
From: Itagaki Takahiro <itagaki.takahiro@gmail.com>
To: Magnus Hagander <magnus@hagander.net>
Cc: PostgreSQL Hackers <pgsql-hackers@postgresql.org>
Date: 2011-09-26T11:36:11Z
Lists: pgsql-hackers
On Mon, Sep 26, 2011 at 20:12, Magnus Hagander <magnus@hagander.net> wrote: > I like it in general. But if we're looking at the BOM, shouldn't we > also look and *reject* the file if it's a BOM for a non-UTF8 file? Say > if the BOM claims it's UTF16? -1 because we're depending on manual configuration for now. It would be reasonable if we had used automatic detection of character encoding, but we don't. In addition, some crazy encoding might use BOM codes as a valid character. -- Itagaki Takahiro