Re: Remaining dependency on setlocale()
Peter Eisentraut <peter@eisentraut.org>
Commits
GET /api/v1/messages/:b64id/commits
the thread's linked commits as JSON, with link sources.
API reference →
-
fuzzystrmatch: use pg_ascii_toupper().
- b96a9fd76f32 19 (unreleased) landed
-
Avoid global LC_CTYPE dependency in pg_locale_icu.c.
- 0a90df58cf38 19 (unreleased) landed
-
downcase_identifier(): use method table from locale provider.
- 87b2968df0f8 19 (unreleased) landed
-
ltree: fix case-insensitive matching.
- 806555e3000d 18.2 landed
- 7f007e4a044a 19 (unreleased) landed
-
Fix multibyte issue in ltree_strncasecmp().
- 898991966bc9 14.21 landed
- 335b2f30b468 15.16 landed
- b80227c0a54c 16.12 landed
- b8cfe9dc2e7f 17.8 landed
- f79e239e0bc6 18.2 landed
- 84d5efa7e3eb 19 (unreleased) landed
-
Use multibyte-aware extraction of pattern prefixes.
- 9c8de1596912 19 (unreleased) landed
-
Add pg_iswcased().
- 630706ced04e 19 (unreleased) landed
-
Remove char_tolower() API.
- 1e493158d3d2 19 (unreleased) landed
-
Make regex "max_chr" depend on encoding, not provider.
- 19b966243c38 19 (unreleased) landed
-
Change some callers to use pg_ascii_toupper().
- 99cd8890beca 19 (unreleased) landed
-
Allow pg_locale_t APIs to work when ctype_is_c.
- 147602822597 19 (unreleased) landed
-
Add #define for UNICODE_CASEMAP_BUFSZ.
- 8d299052fe58 19 (unreleased) landed
-
Inline pg_ascii_tolower() and pg_ascii_toupper().
- ec4997a9d733 19 (unreleased) landed
-
Avoid global LC_CTYPE dependency in pg_locale_libc.c.
- f81bf78ce12b 19 (unreleased) landed
-
Force LC_COLLATE to C in postmaster.
- 5e6e42e44fe1 19 (unreleased) landed
-
Change wchar2char() and char2wchar() to accept a locale_t.
- 53cd0b71ee2e 19 (unreleased) landed
-
Use pg_ascii_tolower()/pg_ascii_toupper() where appropriate.
- d81dcc8d6243 19 (unreleased) landed
-
inet_net_pton.c: use pg_ascii_tolower() rather than tolower().
- 8898082a5d3e 18.0 landed
-
isn.c: use pg_ascii_toupper() instead of toupper().
- 7a6880fadc17 18.0 landed
-
contrib/spi/refint.c: use pg_ascii_tolower() instead.
- 78bd364ee39c 18.0 landed
-
copyfromparse.c: use pg_ascii_tolower() rather than tolower().
- 4c787a24e7e2 18.0 landed
-
Revert "Tidy up locale thread safety in ECPG library."
- 3c8e463b0d88 18.0 cited
-
Tidy up locale thread safety in ECPG library.
- 8e993bff5326 18.0 cited
-
All supported systems have locale_t.
- 8d9a9f034e92 17.0 cited
On 29.11.25 21:50, Jeff Davis wrote: > All fixed, thank you! (I apologize for posting a patch in that state to > begin with...) > > I also reorganized slightly to separate out the pg_iswcased() API into > its own patch, and moved the like_support.c changes from the ctype_is_c > patch (already committed: 1476028225) into the pattern prefixes patch. I reviewed the v11 patches. But I wasn't able to apply them locally (couldn't find a starting commit where they applied cleanly), so I haven't tested them. Patches 0001 through 0006 seem generally ok, with some small comments: v11-0003-Fix-inconsistency-between-ltree_strncasecmp-and-.patch The function comment reads "Check if b has a prefix of a." -- Is that the same as "Check if a is a prefix of b."? The latter might be clearer. v11-0004-Remove-char_tolower-API.patch The updated comment reads + * For efficiency reasons, in the C locale we don't call lower() on the + * pattern and text, but instead call SB_lower_char on each character. but the patch removes SB_lower_char(). v11-0006-Use-multibyte-aware-extraction-of-pattern-prefix.patch Might have a small typo in the commit message: ; and preserve and char-at-a-time logic for bytea. For the remaining patches I have some more substantial questions. v11-0007-fuzzystrmatch-use-pg_ascii_toupper.patch dmetaphone.c has a comment case '\xc7': /* C with cedilla */ so the premise that "fuzzystrmatch is designed for ASCII" does not appear to be correct. Needs more analysis. (But apparently it's not multibyte aware at all, so I don't know what to do about that.) v11-0008-downcase_identifier-use-method-table-from-locale.patch I'm confused here about the name of the function pg_strfold_ident(). In general, case "folding" results in an opaque string that is really only useful for comparing against other case-folded strings. But for identifiers we are actually interested lower-casing. I think this should be corrected in the API naming. v11-0009-Control-LC_COLLATE-with-GUC.patch I know there were some complaints about compatibility with extensions, but I don't think anything concrete was presented. I would like to see more evidence that we need this. Also, recall that we used to have a lc_collate GUC, and in the end people got confused that it didn't actually show a meaningful value when you used ICU. So we removed that. It seems adding this back in would create a similar kind of confusion. So to avoid that, maybe this should be called fallback_lc_collate or something like that. If we were to proceed with this patch, it should have some documentation and tests.