Re: eliminate xl_heap_visible to reduce WAL (and eventually set VM on-access)
Melanie Plageman <melanieplageman@gmail.com>
Commits
GET /api/v1/messages/:b64id/commits
the thread's linked commits as JSON, with link sources.
API reference →
-
Remove table_scan_analyze_next_tuple unneeded parameter OldestXmin
- 284925508ae6 19 (unreleased) landed
-
Simplify visibility check in heap_page_would_be_all_visible()
- 3efe58febc3c 19 (unreleased) landed
-
Eliminate use of cached VM value in lazy_scan_prune()
- 648a7e28d7c2 19 (unreleased) landed
-
Combine visibilitymap_set() cases in lazy_scan_prune()
- 21796c267d0a 19 (unreleased) landed
-
Fix const qualification in prune_freeze_setup()
- 4877391ce894 19 (unreleased) landed
-
Simplify vacuum visibility assertion
- bd298f54a0d6 19 (unreleased) landed
-
Split heap_page_prune_and_freeze() into helpers
- e135e044572e 19 (unreleased) landed
-
Assert that cutoffs are provided if freezing will be attempted
- cd38b7e77315 19 (unreleased) landed
-
Split PruneFreezeParams initializers to one field per line
- 1e14edcea5e1 19 (unreleased) landed
-
Refactor heap_page_prune_and_freeze() parameters into a struct
- 1937ed70621e 19 (unreleased) landed
-
Make heap_page_is_all_visible independent of LVRelState
- 3e4705484e0c 19 (unreleased) landed
-
Inline TransactionIdFollows/Precedes[OrEquals]()
- 43b05b38ea4d 19 (unreleased) landed
-
Add helper for freeze determination to heap_page_prune_and_freeze
- c8dd6542bae4 19 (unreleased) landed
-
Bump XLOG_PAGE_MAGIC after xl_heap_prune change
- 4a8fb58671d3 19 (unreleased) landed
-
Correct prune WAL record opcode name in comment
- ae8ea7278c16 19 (unreleased) landed
-
Add error codes when vacuum discovers VM corruption
- 8ec97e78a771 19 (unreleased) landed
-
Remove unused xl_heap_prune member, reason
- 4b5f206de2bb 19 (unreleased) landed
-
Remove unneeded VM pin from VM replay
- 3399c265543e 19 (unreleased) landed
-
Add assert and log message to visibilitymap_set
- e3d5ddb7ca91 19 (unreleased) landed
-
Add error codes to some corruption log messages
- fd6ec93bf890 13.0 cited
Attachments
- v29-0001-Combine-visibilitymap_set-cases-in-lazy_scan_pru.patch (text/x-patch) patch v29-0001
- v29-0002-Eliminate-use-of-cached-VM-value-in-lazy_scan_pr.patch (text/x-patch) patch v29-0002
- v29-0003-Refactor-lazy_scan_prune-VM-clear-logic-into-hel.patch (text/x-patch) patch v29-0003
- v29-0004-Set-the-VM-in-heap_page_prune_and_freeze.patch (text/x-patch) patch v29-0004
- v29-0005-Move-VM-assert-into-prune-freeze-code.patch (text/x-patch) patch v29-0005
- v29-0006-Eliminate-XLOG_HEAP2_VISIBLE-from-vacuum-phase-I.patch (text/x-patch) patch v29-0006
- v29-0007-Eliminate-XLOG_HEAP2_VISIBLE-from-empty-page-vac.patch (text/x-patch) patch v29-0007
- v29-0008-Remove-XLOG_HEAP2_VISIBLE-entirely.patch (text/x-patch) patch v29-0008
- v29-0009-Simplify-heap_page_would_be_all_visible-visibili.patch (text/x-patch) patch v29-0009
- v29-0010-Use-GlobalVisState-in-vacuum-to-determine-page-l.patch (text/x-patch) patch v29-0010
- v29-0011-Unset-all_visible-sooner-if-not-freezing.patch (text/x-patch) patch v29-0011
- v29-0012-Track-which-relations-are-modified-by-a-query.patch (text/x-patch) patch v29-0012
- v29-0013-Pass-down-information-on-table-modification-to-s.patch (text/x-patch) patch v29-0013
- v29-0014-Allow-on-access-pruning-to-set-pages-all-visible.patch (text/x-patch) patch v29-0014
- v29-0015-Set-pd_prune_xid-on-insert.patch (text/x-patch) patch v29-0015
Attached v29 addresses some feedback and also corrects a small error with the assertion I had added in the previous version's 0009. On Thu, Dec 18, 2025 at 10:38 PM Xuneng Zhou <xunengzhou@gmail.com> wrote: > > I’ve done a basic review of patches 1 and 2. Here are some comments > which may be somewhat immature, as this is a fairly large change set > and I’m new to some parts of the code. > > 1) Potential stale old_vmbits after VM repair n v2 Good catch! I've fixed this in attached v29. > 2) Add Assert(BufferIsDirty(buf)) > > Since the patch's core claim is "buffer must be dirty before WAL > registration", an assertion encodes this invariant. Should we add: > > Assert(BufferIsValid(buf)); > Assert(BufferIsDirty(buf)); > > right before the visibilitymap_set() call? There are already assertions that will trip in various places -- most importantly in XLogRegisterBuffer(), which is the one that inspired this refactor. > The comment at lines: > > "The only scenario where it is not already dirty is if the VM was removed…" > > This phrasing could become misleading after future refactors. Can we > make it more direct like: > > > "We must mark the heap buffer dirty before calling visibilitymap_set(), because it may WAL-log the buffer and XLogRegisterBuffer() requires it." I see your point about future refactors missing updating comments like this. But, I don't think we are going to refactor the code such that we can have PD_ALL_VISIBLE set without the VM bits set more often. Also, it is common practice in Postgres to describe very specific edge cases or odd scenarios in order to explain code that may seem confusing without the comment. It does risk that comment later becoming stale, but it is better that future developers understand why the code is there. That being said, I take your point that the comment is confusing. I have updated it in a different way. > > "Even if PD_ALL_VISIBLE is already set, we don't need to worry about unnecessarily dirtying the heap buffer, as it must be marked dirty before adding it to the WAL chain. The only scenario where it is not already dirty is if the VM was removed..." > > In this test we now call MarkBufferDirty() on the heap page even when > only setting the VM, so the comments claiming “does not need to modify > the heap buffer”/“no heap page modification” might be misleading. It > might be better to say the test doesn’t need to modify heap > tuples/page contents or doesn’t need to prune/freeze. The point I'm trying to make is that we have to dirty the buffer even if we don't modify the page because of the XLOG sub-system requirements. And, it may seem like a waste to do that if not modifying the page, but the page will rarely be clean anyway. I've tried to make this more clear in attached v29. - Melanie