Re: Remaining dependency on setlocale()
Peter Eisentraut <peter@eisentraut.org>
Commits
GET /api/v1/messages/:b64id/commits
the thread's linked commits as JSON, with link sources.
API reference →
-
fuzzystrmatch: use pg_ascii_toupper().
- b96a9fd76f32 19 (unreleased) landed
-
Avoid global LC_CTYPE dependency in pg_locale_icu.c.
- 0a90df58cf38 19 (unreleased) landed
-
downcase_identifier(): use method table from locale provider.
- 87b2968df0f8 19 (unreleased) landed
-
ltree: fix case-insensitive matching.
- 806555e3000d 18.2 landed
- 7f007e4a044a 19 (unreleased) landed
-
Fix multibyte issue in ltree_strncasecmp().
- 898991966bc9 14.21 landed
- 335b2f30b468 15.16 landed
- b80227c0a54c 16.12 landed
- b8cfe9dc2e7f 17.8 landed
- f79e239e0bc6 18.2 landed
- 84d5efa7e3eb 19 (unreleased) landed
-
Use multibyte-aware extraction of pattern prefixes.
- 9c8de1596912 19 (unreleased) landed
-
Add pg_iswcased().
- 630706ced04e 19 (unreleased) landed
-
Remove char_tolower() API.
- 1e493158d3d2 19 (unreleased) landed
-
Make regex "max_chr" depend on encoding, not provider.
- 19b966243c38 19 (unreleased) landed
-
Change some callers to use pg_ascii_toupper().
- 99cd8890beca 19 (unreleased) landed
-
Allow pg_locale_t APIs to work when ctype_is_c.
- 147602822597 19 (unreleased) landed
-
Add #define for UNICODE_CASEMAP_BUFSZ.
- 8d299052fe58 19 (unreleased) landed
-
Inline pg_ascii_tolower() and pg_ascii_toupper().
- ec4997a9d733 19 (unreleased) landed
-
Avoid global LC_CTYPE dependency in pg_locale_libc.c.
- f81bf78ce12b 19 (unreleased) landed
-
Force LC_COLLATE to C in postmaster.
- 5e6e42e44fe1 19 (unreleased) landed
-
Change wchar2char() and char2wchar() to accept a locale_t.
- 53cd0b71ee2e 19 (unreleased) landed
-
Use pg_ascii_tolower()/pg_ascii_toupper() where appropriate.
- d81dcc8d6243 19 (unreleased) landed
-
inet_net_pton.c: use pg_ascii_tolower() rather than tolower().
- 8898082a5d3e 18.0 landed
-
isn.c: use pg_ascii_toupper() instead of toupper().
- 7a6880fadc17 18.0 landed
-
contrib/spi/refint.c: use pg_ascii_tolower() instead.
- 78bd364ee39c 18.0 landed
-
copyfromparse.c: use pg_ascii_tolower() rather than tolower().
- 4c787a24e7e2 18.0 landed
-
Revert "Tidy up locale thread safety in ECPG library."
- 3c8e463b0d88 18.0 cited
-
Tidy up locale thread safety in ECPG library.
- 8e993bff5326 18.0 cited
-
All supported systems have locale_t.
- 8d9a9f034e92 17.0 cited
On 29.10.25 01:19, Jeff Davis wrote: > On Wed, 2025-07-23 at 19:11 -0700, Jeff Davis wrote: >> On Fri, 2025-07-11 at 11:48 +1200, Thomas Munro wrote: >>> On Fri, Jul 11, 2025 at 6:22 AM Jeff Davis <pgsql@j-davis.com> >>> wrote: >>>> I don't have a great windows development environment, and it >>>> appears CI >>>> and the buildfarm don't offer great coverage either. Can I ask >>>> for >>>> a >>>> volunteer to do the windows side of this work? >>> >>> Me neither but I'm willing to help with that, and have done lots of >>> closely related things through trial-by-CI... > > Attached a new patch series, v6. > > Rather than creating new global locale_t objects, this series (along > with a separate patch for NLS[1]) removes the dependency on the global > LC_CTYPE entirely. It's a bunch of small patches that replace direct > calls to tolower()/toupper() with calls into the provider. > > An assumption of these patches is that, in the UTF-8 encoding, the > logic in pg_tolower()/pg_toupper() is equivalent to > pg_ascii_tolower()/pg_ascii_toupper(). I'm getting a bit confused by all these different variant function names. Like we have now tolower TOLOWER char_tolower pg_tolower pg_strlower pg_ascii_tolower downcase_identifier and maybe more, and upper versions. This patch set makes changes like - else if (IS_HIGHBIT_SET(ch2) && isupper(ch2)) - ch2 = tolower(ch2); + else if (IS_HIGHBIT_SET(ch2)) + ch2 = TOLOWER(ch2); So there is apparently some semantic difference between tolower() and TOLOWER(), which is represented by the fact that the function name is all upper case? Actually, it's a macro and could mean different things in different contexts. And there is very little documentation accompanying all these different functions. For example, struct collate_methods and struct ctype_methods contain barely any documentation at all. Many of these issues are pre-existing, but I just figured it has reached a point where we need to do something about it.