Re: Extended Statistics set/restore/clear functions.
Corey Huinker <corey.huinker@gmail.com>
Commits
GET /api/v1/messages/:b64id/commits
the thread's linked commits as JSON, with link sources.
API reference →
-
Add test doing some cloning of extended statistics data
- fc365e4fccc4 19 (unreleased) landed
-
Add test for pg_restore_extended_stats() with multiranges
- 0b7beec42ae2 19 (unreleased) landed
-
Add support for "mcv" in pg_restore_extended_stats()
- efbebb4e8587 19 (unreleased) landed
-
Include extended statistics data in pg_dump
- c32fb29e979d 19 (unreleased) landed
-
Add support for "dependencies" in pg_restore_extended_stats()
- 302879bd68d1 19 (unreleased) landed
-
Add test for MAINTAIN permission with pg_restore_extended_stats()
- d9abd9e1050d 19 (unreleased) landed
-
Add pg_restore_extended_stats()
- 0e80f3f88dea 19 (unreleased) landed
-
Add routine to free MCVList
- 7ebb64c55757 19 (unreleased) landed
-
Improve pg_clear_extended_stats() with incorrect relation/stats combination
- 395b73c045e0 19 (unreleased) landed
-
Add pg_clear_extended_stats()
- d756fa1019ff 19 (unreleased) landed
-
Introduce routines to validate and free MVNDistinct and MVDependencies
- 32e27bd32082 19 (unreleased) landed
-
Fix typo in stat_utils.c
- eee19a30d60d 19 (unreleased) landed
-
Move attribute statistics functions to stat_utils.c
- 213a1b895270 19 (unreleased) landed
-
Improve error messages of input functions for pg_dependencies and pg_ndistinct
- f68597ee777d 19 (unreleased) landed
-
Improve test output of extended statistics for ndistinct and dependencies
- 2f04110225ab 19 (unreleased) landed
-
Fix some compiler warnings
- 7bc88c3d6f3a 19 (unreleased) landed
-
Add input function for data type pg_dependencies
- e1405aa5e3ac 19 (unreleased) landed
-
Add input function for data type pg_ndistinct
- 44eba8f06e55 19 (unreleased) landed
-
Rework output format of pg_dependencies
- e76defbcf09e 19 (unreleased) landed
-
Rework output format of pg_ndistinct
- 1f927cce4498 19 (unreleased) landed
-
Fix comments of output routines for pg_ndistinct and pg_dependencies
- 040a39ed25bf 19 (unreleased) landed
-
Move code specific to pg_dependencies to new file
- 2ddc8d9e9baa 19 (unreleased) landed
-
Move code specific to pg_ndistinct to new file
- a5523123430f 19 (unreleased) landed
-
Document some structures in attribute_stats.c
- d6c132d83bff 19 (unreleased) landed
-
Fix FATAL message for invalid recovery timeline at beginning of recovery
- 71f17823ba01 18.0 cited
Attachments
- v11-0001-Make-pg_ndinstinct-a-proper-adt.patch (text/x-patch) patch v11-0001
- v11-0002-Make-pg_dependencies-a-proper-adt.patch (text/x-patch) patch v11-0002
- v11-0003-Refactor-output-format-of-pg_ndistinct.patch (text/x-patch) patch v11-0003
- v11-0004-Refactor-output-format-of-pg_dependencies.patch (text/x-patch) patch v11-0004
- v11-0005-Add-working-input-function-for-pg_ndistinct.patch (text/x-patch) patch v11-0005
- v11-0006-Add-working-input-function-for-pg_dependencies.patch (text/x-patch) patch v11-0006
- v11-0007-Expose-attribute-statistics-functions-for-use-in.patch (text/x-patch) patch v11-0007
- v11-0008-Add-extended-statistics-support-functions.patch (text/x-patch) patch v11-0008
- v11-0009-Include-Extended-Statistics-in-pg_dump.patch (text/x-patch) patch v11-0009
> > Thanks for the new patch. And FWIW I disagree with this approach: > cleanup and refactoring pieces make more sense if done first, as these > lead to less code churn in the final result. So... I've begun to put > my hands on the patch set. The whole has been restructured a bit, as > per the attached. Patch 0001 to 0004 feel OK here, these include two > code moves and the two output functions: > - Two new files for adt/, that I'm planning to apply soon as a > separate cleanup. > - New output functions, with keys added to a new header named > statistics_format.h, for frontend and backend consumption. > Agreed, 0001-0004 all look good. > Next comes the input functions. First, I am unhappy with the amount > of testing that has been put into ndistinct, first and only input > facility I've looked at in details for the moment. I have quickly > spotted a couple a few issues while testing buggy input, like this one > that crashes on pointer dereference, not good obviously: > SELECT '[]'::pg_ndistinct; > - I put some work into more specific error messages for invalid values for both pg_ndistinct and pg_dependencies. - The check for empty attribute lists and item lists now occur in the array-end event handler. - Also tried to standardize conventions between the two data types (switch statements, similar utility functions, etc). > > These are checked in the patches that introduce the functions like > with pg_ndistinct_validate_items(), based on the list of stxkeys we > have. However, I think that this is not enough by itself. Shouldn't > we check that the list of items in the array is what we expect based > on the longest "attributes" array at least, even after a JSON that was > parsed? That would be cheap to check in the output function itself, > at least as a first layer of checks before trying something with the > import function and cross-checking the list of attributes for the > extended statistics object. I added tests for both duplicate attribute sequences as well as making the first-longest attribute sequence the template by which all later and shorter sequences are checked. I had been reluctant to add checks like this, because so many similar validations were removed from the earlier statistics code like histograms and the like. > > I suspect a similar family of issues with pg_dependencies, and it > would be nice to move the tests with the input function into a new > regression file, like the other one. > Did so. 0001-0004,0007,0009 unchanged. Significant modification of the stats_import.sql regression tests in 0008 to conform to stricter datatype rules enacted in 0005, 0006.