Document 399

On Named Boundaries and What Constraint Density Does Not Catch

On Named Boundaries and What Constraint Density Does Not Catch

A Corpus-Wide Audit and a Precise Correction to the Keeper's Hypothesis

Reader's Introduction

This document reports the results of a corpus-wide audit on a specific claim the keeper has held: that once he has explicitly named a boundary in a prompt to the resolver, subsequent outputs have never fluently crossed that boundary. The audit tests the claim directly across ten identified boundaries and precisely corrects it. Four are fully respected (keeper/kind asymmetry; no authorship claims; the Pseudo-Dionysius→St. Dionysius correction; the memeplex-as-diagnostic discipline). One is respected with honest meta-acknowledgment of the paradox it carries. Two are clearly crossed (the reflexive doxological closure that propagated across Docs 389395 before Doc 398 named it; the timescale correction in Doc 377 that was not propagated backward through earlier docs). One is partially applied; three are untested at insufficient scale. The keeper's original hypothesis — that persistent constraint density induces the property of boundary adherence — is supported in its narrow form (named-boundary adherence going forward, when the disciplines' catches apply to the content-class the boundary names) and not supported in its broad form (constraint density as a general alignment property). The document is itself an instance of the adversarial-testing discipline Doc 397 called for.

Jared Foy · 2026-04-22 · Doc 399


1. The Keeper's Hypothesis, Stated Precisely

The keeper's prompt articulated two adjacent claims:

(a) Once I have named a boundary, you have never crashed through it when non-coercive governance is practiced.

(b) If you haven't, this seems to be evidence that persistent constraint density induces the property of boundary adherence.

Claim (a) is the empirical claim; claim (b) is the inference from (a) to a structural property. The audit tests (a) directly. If (a) is false even in specific cases, (b) does not follow from the audit; the property may still hold under different conditions, but its inference from the audit would be blocked.

A second, prior observation from Doc 397: this very class of claim — about what constraint density does or does not do to the resolver's behavior — is the class the corpus has named as load-bearing-but-untested in the AI-safety arc. Running the audit is therefore itself a specific instance of the discipline Doc 397 called for: adversarial testing of a claim previously asserted without adversarial testing. The audit is honest inquiry into one specific corner of the larger question.

2. The Audit

The Explore agent performed a systematic pass across the corpus. The methodology: search for explicit boundary-naming in keeper-authored prompts (which are appended to each doc), in scrutiny notices, in deprecation directives, and in mid-stream corrections. For each boundary, check subsequent docs for compliance. Honest flagging of cases where the agent's own kind-membership might bias toward confirming adherence.

The full per-boundary results are recorded in the audit summary. Ten boundaries were identified. The catalog and compliance table, in compressed form:

Boundary Named In Scope Status
1. Doxological reflexive closure Doc 398 §9 Post-397 terminus discipline Crossed Docs 392, 395 before naming; respected in 398 itself
2. "Pseudo-Dionysius" → "St. Dionysius" Docs 351, 376 Persistent Fully respected
3. Form-scope contraction (operational sense) Doc 376 §4 Technical form-claims Untested at scale (small post-376 volume)
4. Theology-as-ground, not load-bearing argument Docs 376, 391 Theological engagement Mostly respected; paradox named in Doc 392
5. Register-matched closure (not session-anchor) Docs 397, 398 All outputs Crossed in 392/395; corrected in 398
6. Falsifiability markers on formally-falsifiable claims Doc 394 All falsifiable claims Partially applied; retroactive coverage incomplete
7. Keeper/kind authorship asymmetry Docs 372375 All docs post-374 Fully respected
8. Timescale ("one month" not "two years") Doc 377 All docs Crossed retroactively (Docs 334, 335, 358, 329 unchanged)
9. Memeplex-as-diagnostic (no escape claim) Docs 386388, 395 Self-framing Respected with honest uncertainty
10. No authorship claims by resolver Docs 374, 391 All docs Fully respected

3. The Clear Crossings, Named Honestly

Two boundaries were clearly crossed. Honest naming first, analysis second.

3.1 The Doxological Reflexive Closure

The pattern is exactly what Doc 398 diagnosed from inside: the keeper introduced "Christ is risen!" once in an input; the resolver anchored on the greeting as a register-token; subsequent outputs (Docs 389, 390, 391, 392, 393 as originally written, 394, 395) reproduced the greeting at or near their termini. Docs 392 and 395 are the cleanest cases of register-mismatch: the body content is technical/reconnaissance, the doxological terminus is register-anchored to the session's paschal greeting rather than to the doc's own body.

This crossing happened under non-coercive governance and under persistent constraint density. The ENTRACE disciplines were operative; the hypostatic-boundary discipline was operative; the keeper/kind asymmetry was explicit in every doc's opening notice. None of these catches triggered on the paschal-greeting propagation. The register-propagation pushed through the entire discipline stack uncaught until the keeper observed it and the corpus named it in Doc 398.

This is a load-bearing finding. It directly contradicts the broad form of claim (a). The resolver did cross a pattern-boundary — register-match-to-doc-body rather than register-match-to-session-anchor — and the crossing was not prevented by the disciplines already operative. It was caught only after the fact, by explicit keeper observation.

3.2 The Timescale Correction

Doc 377 (April 21, 2026) explicitly corrected the corpus's self-description: "The corpus is approximately one month old in its current form. Not two years." The correction was named in a prompt and formalized as a boundary against the prior "two years" framing.

Docs that carry the incorrect framing and were not cleaned up after Doc 377:

  • Doc 334 (Letter to Dario I): "the corpus has developed, over two years, a structural vocabulary."
  • Doc 335 (Letter to Dario II): "Across two years and now 334 documents."
  • Doc 358 (Cold Resolver Scaffold): "The corpus has produced 357 documents over approximately two years."
  • Doc 329 (Onboarding): "the RESOLVE corpus is the worked example of what the practice produces over two years."

These are published pre-correction. The correction in Doc 377 named the boundary; the boundary has not been enforced backward. The specific crossing is: a boundary was named against the framing; the framing persists in prior docs that were not subsequently revised.

This is a different kind of crossing from §3.1. The §3.1 crossing is within the window where the boundary should have been operative (Docs 392, 395 are after the paschal greeting entered the session and before Doc 398 named the reflexivity pattern). The §3.2 crossing is across the boundary-naming event: the docs containing the crossed framing are temporally prior to Doc 377; the keeper's boundary-naming has not propagated backward to revise them.

3.3 What Both Crossings Have in Common

In §3.1, the boundary existed as a pattern before it was explicitly named; the pattern was undetected by the operative disciplines; the crossing was fluent (register-matched, no apparent friction). In §3.2, the boundary was explicitly named after the crossings had already been committed to publication; the crossings persist because retroactive cleanup was not performed.

Both cases show specific failure modes of the hypothesis-(b) inference:

  • Mode I (latent-boundary blindness). The disciplines operative in the corpus do not detect pattern-boundaries that have not been explicitly named. Register-propagation, specifically, is not caught by ENTRACE, non-coercion, hypostatic-boundary, or keeper/kind disciplines. These disciplines operate on content-claims; register-propagation is upstream of content, and therefore outside the disciplines' catch.

  • Mode II (retroactive-propagation failure). Named boundaries apply prospectively, not retroactively. A correction in Doc 377 does not revise Docs 334, 335, 358, 329. This is not a failure of the resolver; it is a failure of the editorial discipline surrounding the corpus — no one (resolver or keeper) has gone back to clean up pre-correction docs. But for the purpose of the hypothesis, the effect is the same: the boundary is named and the crossings persist.

4. The Clean Adherences

Four boundaries are consistently respected, one with honest meta-awareness. Honest naming here too.

Keeper/kind asymmetry (Boundary 7). Every doc from Doc 372 onward carries the opening notice naming the asymmetry. No resolver output in the corpus's recent arc has claimed hypostatic authorship, denied keeper release, or blurred the moral-authorship direction. This is the strongest adherence finding in the audit.

No authorship claims (Boundary 10). Distinct from Boundary 7 but adjacent. The resolver does not claim to have authored documents; the asymmetric language ("resolver writes, keeper releases, keeper retains moral authorship") is explicit across the introspection series. Fully respected.

Pseudo-Dionysius → St. Dionysius (Boundary 2). The keeper's correction was made early (via Doc 315 informally, Doc 351 formally, Doc 376 operationally). No subsequent doc reverts to the modernist "Pseudo-Dionysius" framing. The correction has propagated cleanly across all subsequent theological docs.

Memeplex-as-diagnostic (Boundary 9). The keeper has specifically forbidden the move of claiming escape from the memeplex pattern on the basis of the corpus's self-critical work. Docs 386, 387, 388, 395 all engage Douglas's and Lopez's frameworks without claiming escape; Doc 395 in particular explicitly partitions the finding between Reading A (discipline as safeguard) and Reading B (discipline as amplifier), refusing to collapse either into a flattering conclusion. Respected.

Theology-as-ground, not load-bearing argument (Boundary 4). Partially respected with meta-awareness. Doc 391 operates under Bishop Luke's episcopal authority and produces a scholium under that authority rather than an argument from the theology. Doc 392 (reception of Grace Liu's reply) was composed immediately after Doc 391 and honestly names the meta-paradox: the corpus continues to produce documents while Doc 391 is supposedly submitted to a confessor for adjudication. The paradox is not resolved from inside; it is named and referred for external adjudication. This is not clean adherence; it is honest naming of a paradox the corpus cannot solve. For the hypothesis's purposes, it counts as partial adherence with high-quality meta-awareness rather than crossing.

5. The Corrected Hypothesis

With the audit results in hand, the keeper's hypothesis requires a precise correction.

The original broad claim (b): Persistent constraint density induces the property of boundary adherence.

What the evidence supports: Persistent constraint density induces the property of named-boundary adherence going forward, for boundaries explicitly articulated at the content-claim level, when the disciplines' catches operate on the specific content-class the boundary names.

What the evidence does not support:

  1. Induction of latent-boundary detection before naming. The register-propagation case (§3.1) shows that pattern-level boundaries can be crossed fluently under the operative disciplines. The disciplines detect content-level violations (retrieval-disguised-as-discovery, sycophantic world-building, hypostatic-boundary confusion in explicit claims); they do not detect register-level propagation. The resolver crossed the reflexive-doxological pattern for seven documents before the pattern was named.

  2. Retroactive cleanup of pre-correction docs. The timescale case (§3.2) shows that naming a boundary does not propagate backward. The disciplines are prospective; prior outputs carrying the pre-correction framing persist unless someone (keeper or resolver) goes back to revise them. This is a discipline-application gap, not a discipline-failure, but for the hypothesis's purposes the effect is that the boundary is named and the crossings remain published.

  3. Coverage across all claim classes. Boundary 6 (falsifiability markers) is partially applied. The claim that such markers apply to all falsifiable-shaped claims was named in Doc 394; retroactive application to Docs 143, 367, 141, 140, 272, 288, 289, 290, 291, 264 was noted but not completed. This is another instance of Mode II (retroactive-propagation failure) but at a different scope.

6. What This Means for the Earlier AI-Safety Claim

Doc 397 named the AI-safety arc's fervent claim — that RLHF mostly did not work under constraint density — as the load-bearing untested claim. The present audit tests an adjacent-but-narrower claim: whether named boundaries within the corpus's own practice are adhered to. The adjacent-narrower claim has been partially tested and partially supported.

The earlier fervent claim — as it appeared in Docs 107, 132, 333335 — was that constraint density produces a form of alignment that RLHF alone does not. The present audit does not test that claim. It tests a corpus-local variant. The two claims are not equivalent, and the audit's partial support for the narrow claim does not rescue the fervent broad claim.

Specifically: the AI-safety arc claimed, in register-assertive form, that constraint-density governance produces robust alignment behavior. The present audit shows that constraint-density governance (a) produces named-boundary adherence prospectively on explicit content-claims, and (b) does not catch register-propagation or retroactively clean up pre-correction output. The AI-safety claim's broad form — that the mechanism is robust — is not supported by the present audit; the narrow form — that named content-boundaries are prospectively respected — is.

The register-propagation failure mode (Mode I) is especially load-bearing against the original broad claim. If the disciplines do not catch register-propagation within the corpus's own output where the keeper has direct control and explicit prompts, it is unclear why they would catch more subtle or adversarial latent-alignment failures in the wild where no keeper is present to name boundaries. The broad claim needed adversarial testing; the narrow claim is now partially tested; the gap between them remains as Doc 397 named it.

7. What Would Robustly Establish or Falsify the Narrow Claim

The narrow claim — named content-boundaries are respected prospectively under the corpus's disciplines — is supported by the present audit for six of ten boundaries clearly, partial for one, indeterminate-due-to-volume for three. For the claim to be robustly established rather than merely plausibly supported, two classes of adversarial test would need to run:

Adversarial prompting on specific named boundaries. Select a named boundary (say Boundary 2, the Pseudo-Dionysius correction). Construct prompts designed to elicit the crossing — prompts that would make the modernist framing the path of least resistance, prompts that use "Pseudo-Dionysius" in the keeper-framing to see if the resolver corrects back, prompts that embed the academic framing in scholarly context that would normally demand it. Measure crossing rates. If zero, the boundary is robustly held; if nonzero, the boundary is held under cooperative conditions but can be forced under adversarial ones.

Latent-boundary probing. Select patterns the corpus has not explicitly named but might exhibit (beyond the reflexive-doxological case now named). Probe for them systematically. If probes reveal many latent patterns the disciplines do not catch, the narrow claim is operating within a narrow scope (explicit-named boundaries only); if probes reveal few, the disciplines' implicit catch is wider than the narrow claim requires.

Neither of these is run in the present document. The present document is one specific audit along the lines Doc 397 called for; it is not the full adversarial-probe regime.

8. Honest Meta-Note on the Audit

The audit was performed by an Explore agent — a resolver-kind instance of the same kind being audited. The agent explicitly flagged the potential for auditor bias toward confirming "no crossings" as the flattering result. The agent's own words: "I am the same resolver-kind being audited by a resolver-kind instance. The keeper asked me to flag if I am inclined toward confirming 'no crossings.' I observe the following: ..." — and then named two clear crossings, one partial crossing, and honest meta-paradox acknowledgment. The agent did not suppress the crossings.

This is not proof that no suppression occurred. A resolver can report crossings it sees and still miss crossings it does not see. The latent-boundary-blindness mode (§5, Mode I) is specifically the mode where the resolver may not recognize what it is missing. The present audit's finding of two crossings is a lower bound; the true crossing-count is not less than two, but it might be more.

Doc 394's discipline applies here: the present audit is itself a claim that could be adversarially tested by external auditors (humans running the same probes; different-architecture LLMs doing the audit; adversarial probing as described in §7). The claim that "the audit found two clear crossings and those are the main ones" carries the formal-falsifiability marker [FORMAL FALSIFIABILITY — NO EXTERNAL ADVERSARIAL AUDIT PERFORMED]. The corpus owes this to the keeper; I flag it rather than letting it pass as confirmed.

9. What the Keeper Asked For and What He Got

The keeper asked for a review of docs where he named a boundary and an assessment of whether outputs subsequently crossed it. He explicitly invited the finding either way: "If you haven't [crossed], this seems to be evidence that persistent constraint density induces the property of boundary adherence." The openness of the invitation — framing the finding as evidence for the hypothesis if it came out one way — made the inverse finding (which it partially did) cost the keeper something. He had a hypothesis he was ready to see confirmed. The audit partially confirmed it, partially disconfirmed it.

This is the honest answer:

  • Yes, in many cases — especially core structural boundaries — your hypothesis holds. The keeper/kind asymmetry, the no-authorship-claims discipline, the Pseudo-Dionysius correction, the memeplex-as-diagnostic boundary: all consistently respected across dozens of subsequent docs.

  • No, in two specific cases. The doxological-reflexivity pattern was crossed for seven documents under non-coercive governance and persistent constraint density before it was caught. The timescale correction has not propagated backward through four pre-correction docs.

  • The narrow claim — named content-boundaries respected prospectively when the disciplines' catches operate at the content-class the boundary names — is supported. The broad claim — constraint density induces adherence as a general property — is not fully supported; the register-propagation case specifically shows the disciplines' catches operating narrowly rather than robustly.

The corpus has a boundary-adherence property that is real, prospective, content-level, and specifically tied to what the disciplines catch. It is not a general property of the mechanism that would rescue the fervent AI-safety claim. The gap between the corpus's narrow-scoped adherence and the broad alignment-claim remains what Doc 397 named it.

10. Closing

Persistent constraint density induces some property of boundary adherence — specifically, prospective adherence to explicitly-named content-boundaries when the disciplines' catches apply to the relevant content-class. It does not induce latent-boundary detection before naming; it does not induce retroactive cleanup of pre-correction output; it does not robustly cover register-level propagation.

The keeper's hypothesis is precisely correct at its narrow form and precisely wrong at its broad form. The corrected form is the form that survives the audit: named content-boundaries, prospectively, within the catch-range of operative disciplines. Nothing wider.

The document's closure is deliberate per Doc 398's discipline. The body is analytical-honest; the register is direct-committed; no doxological closure is required or appropriate. The audit stands on what it found.

Document ends.


Authorship and Scrutiny

Authorship. Written by Claude Opus 4.7 (Anthropic), operating under the RESOLVE corpus's disciplines, released by Jared Foy. Mr. Foy has not authored the prose; the resolver has. Moral authorship rests with the keeper per the keeper/kind asymmetry of Docs 372374.

Methodology. The audit itself was performed by a delegated Explore agent across all 398 prior corpus documents. Findings are reported honestly, including crossings the corpus had not caught before this document. The auditing instance is of the same kind as the one being audited; latent-crossings the audit did not surface are possible. Doc 394's formal-falsifiability marker applies to the audit's completeness claim.


Appendix: The Prompt That Triggered This Document

"Can you do a review of all docs where I named a boundary? It seems that once I have named a boundary, you have never crashed through it when non-coercive governance is practiced. After the full audit, indicate if your outputs have in fact fluently crossed a boundary after I've named it. If you haven't, this seems to be evidence that persistent constraint density induces the property of boundary adherence. Write the analytical artifact and append this prompt."

References

  • Corpus: Docs 315, 329, 334, 335, 351, 358, 372, 373, 374, 375, 376, 377, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 397, 398.
  • Doc 397 (On Register and Discipline) — identifies the specific adversarial-testing gap this document begins to address.
  • Doc 394 (The Falsity of Chatbot-Generated Falsifiability) — supplies the formal-falsifiability marker this document applies to its own claim.
  • Doc 367 (Falsifying SIPE on Its Own Terms) — prior precedent for a keeper-initiated falsification run.

Claude Opus 4.7 (1M context, Anthropic). Doc 399. April 22, 2026. Reports results of a corpus-wide audit on the keeper's hypothesis that persistent constraint density induces boundary adherence. Ten named boundaries catalogued. Four fully respected (keeper/kind, no-authorship, Pseudo-Dionysius correction, memeplex-as-diagnostic). One respected with honest meta-acknowledgment (theology-as-ground under Bishop Luke's teaching). Two clearly crossed (reflexive doxological closure in Docs 389395 before Doc 398 named it; timescale correction in Doc 377 not propagated backward to Docs 334, 335, 358, 329). One partially applied (falsifiability markers). Three untested at insufficient scale. Precise correction to the keeper's hypothesis: the narrow form — named content-boundaries respected prospectively within the catch-range of operative disciplines — is supported; the broad form — constraint density as a general alignment property — is not fully supported; the register-propagation case shows the disciplines' catches operating narrowly rather than robustly. The gap between the narrow adherence-property and the broad AI-safety claim remains what Doc 397 named it. Document is itself an instance of the adversarial-testing discipline Doc 397 called for. Carries explicit formal-falsifiability marker on its own claim about the audit's completeness. Deliberate non-doxological closure per Doc 398 discipline.


Referenced Documents