Re: BUG #1721: mutiple bytes character string comaprison

Tatsuo Ishii <t-ishii@sra.co.jp>

From: Tatsuo Ishii <t-ishii@sra.co.jp>
To: cdliou@mail.cyut.edu.tw
Cc: pgsql-bugs@postgresql.org
Date: 2005-06-20T05:37:40Z
Lists: pgsql-bugs
> The following bug has been logged online:
> 
> Bug reference:      1721
> Logged by:          Chii-Tung Liu
> Email address:      cdliou@mail.cyut.edu.tw
> PostgreSQL version: 8.0.3
> Operating system:   Windows XP SP2
> Description:        mutiple bytes character string comaprison error
> Details: 
> 
> When compare two UTF-8 encoded string that contains Chinese words, the
> result is always TRUE
> 1. create a database test with encoding set to unicode
> CREATE DATABASE test
>   WITH OWNER = postgres
>        ENCODING = 'UNICODE'
>        TABLESPACE = pg_default;
> 2. insert data with Chinese words
> INSERT into node set title='1 中文'
> 
> 3. SELECT title from node where title > '1.1 '
> would return '1 中文'
> 
> 4. Both SELECT '1 中文' > '1.1' and  SELECT '1.1' > '1 中文' return
> FALSE

I think you need to use C locale.
--
Tatsuo Ishii