Re: Extended Statistics set/restore/clear functions.
Corey Huinker <corey.huinker@gmail.com>
Commits
GET /api/v1/messages/:b64id/commits
the thread's linked commits as JSON, with link sources.
API reference →
-
Add test doing some cloning of extended statistics data
- fc365e4fccc4 19 (unreleased) landed
-
Add test for pg_restore_extended_stats() with multiranges
- 0b7beec42ae2 19 (unreleased) landed
-
Add support for "mcv" in pg_restore_extended_stats()
- efbebb4e8587 19 (unreleased) landed
-
Include extended statistics data in pg_dump
- c32fb29e979d 19 (unreleased) landed
-
Add support for "dependencies" in pg_restore_extended_stats()
- 302879bd68d1 19 (unreleased) landed
-
Add test for MAINTAIN permission with pg_restore_extended_stats()
- d9abd9e1050d 19 (unreleased) landed
-
Add pg_restore_extended_stats()
- 0e80f3f88dea 19 (unreleased) landed
-
Add routine to free MCVList
- 7ebb64c55757 19 (unreleased) landed
-
Improve pg_clear_extended_stats() with incorrect relation/stats combination
- 395b73c045e0 19 (unreleased) landed
-
Add pg_clear_extended_stats()
- d756fa1019ff 19 (unreleased) landed
-
Introduce routines to validate and free MVNDistinct and MVDependencies
- 32e27bd32082 19 (unreleased) landed
-
Fix typo in stat_utils.c
- eee19a30d60d 19 (unreleased) landed
-
Move attribute statistics functions to stat_utils.c
- 213a1b895270 19 (unreleased) landed
-
Improve error messages of input functions for pg_dependencies and pg_ndistinct
- f68597ee777d 19 (unreleased) landed
-
Improve test output of extended statistics for ndistinct and dependencies
- 2f04110225ab 19 (unreleased) landed
-
Fix some compiler warnings
- 7bc88c3d6f3a 19 (unreleased) landed
-
Add input function for data type pg_dependencies
- e1405aa5e3ac 19 (unreleased) landed
-
Add input function for data type pg_ndistinct
- 44eba8f06e55 19 (unreleased) landed
-
Rework output format of pg_dependencies
- e76defbcf09e 19 (unreleased) landed
-
Rework output format of pg_ndistinct
- 1f927cce4498 19 (unreleased) landed
-
Fix comments of output routines for pg_ndistinct and pg_dependencies
- 040a39ed25bf 19 (unreleased) landed
-
Move code specific to pg_dependencies to new file
- 2ddc8d9e9baa 19 (unreleased) landed
-
Move code specific to pg_ndistinct to new file
- a5523123430f 19 (unreleased) landed
-
Document some structures in attribute_stats.c
- d6c132d83bff 19 (unreleased) landed
-
Fix FATAL message for invalid recovery timeline at beginning of recovery
- 71f17823ba01 18.0 cited
> > > > > * no negative attnums in key list > Disregard this suggestion - negative attnums mean the Nth expression in the extended stats object, though it boggles the mind how we could have 222 expressions... > > * no duplicate attnums in key list > This one is still live, am considering. At this point I was really thinking only about validating the attnums, > i.e. to make sure it's a valid attribute in the table / statistics. That > is something the pg_set_attribute_stats() enforce too, thanks to having > a separate argument for the attribute name. > > That's where I'd stop. I don't want to do checks on the statistics > content, like verifying the frequencies in the MCV sum up to 1.0 or > stuff like that. I think we're not doing that for pg_set_attribute_stats > Agreed. > either (and I'd bet one could cause a lot of "fun" this way). > If by "fun" you mean "create a fuzzing tool", then yes. As an aside, the "big win" in all these functions is the ability to dump a database --no-data, but have all the schema and statistics, thus allowing for checking query plans on existing databases with sensitive data while not actually exposing the data (except mcv, obvs), nor spending the I/O to load that data. > Understood. IMHO it's fine to say we're not validating the statistics > are "consistent" but I think we should check it matches the definition. > +1 > > I suppose someone could write the following utility functions > > > > pg_xlat_ndistinct_to_attnames(relation reloid, ndist pg_ndistinct) - > >> json > > pg_xlat_ndistinct_from_attnames(relation reloid, ndist json) -> > > pg_ndistinct > > > > and that would bridge the gap for the special case where you want to > > adapt pg_ndistinct from one table structure to a slightly different one. > > > > > > OK > As they'll be pure-SQL functions, I'll likely post the definitions here, but not put them into a patch unless it draws interest. > For that matter, it might make sense to break out the expressions code > > into its own file, because every other stat attribute has its own. > > Thoughts on that? > > > > +1 to that, if it reduced unnecessary code duplication > I'm uncertain that it actually would deduplicate any code, but I'll certainly try.