npm - @chrono-meta/fh-gate - Versions diffs - 1.0.3 → 1.2.0 - Mend

@chrono-meta/fh-gate 1.0.3 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (69) hide show

package/.claude/agents/challenger.md +169 -0
package/AGENTS.md +160 -0
package/CATALOG.md +256 -0
package/CHEATSHEET.md +367 -0
package/CLAUDE.md +331 -0
package/CONTRIBUTING.md +198 -0
package/LICENSE +21 -0
package/README.md +131 -418
package/bin/fh-goal.js +9 -0
package/bin/fh-run.js +9 -0
package/docs/banner.png +0 -0
package/docs/codex-compat.md +123 -0
package/docs/pillars.svg +70 -0
package/knowledge/shared/harness-core/fh_integration_contract.md +48 -29
package/package.json +31 -6
package/plugins/fh-commons/README.md +37 -0
package/plugins/fh-commons/agents/quench-challenger.md +373 -0
package/plugins/fh-commons/skills/convergence-loop/SKILL.md +155 -0
package/plugins/fh-commons/skills/deliberation/SKILL.md +288 -0
package/plugins/fh-commons/skills/mcp-circuit-breaker/SKILL.md +196 -0
package/plugins/fh-commons/skills/token-budget-gate/SKILL.md +175 -0
package/plugins/fh-meta/agents/fact-checker.md +121 -0
package/plugins/fh-meta/agents/hub-persona-auditor.md +109 -0
package/plugins/fh-meta/agents/persona-innovator.md +195 -0
package/plugins/fh-meta/skills/agent-composer/SKILL.md +461 -0
package/plugins/fh-meta/skills/agent-composer/SKILL_detail.md +464 -0
package/plugins/fh-meta/skills/apex-review/SKILL.md +185 -0
package/plugins/fh-meta/skills/asset-placement-gate/SKILL.md +135 -0
package/plugins/fh-meta/skills/contention-layer/SKILL.md +127 -0
package/plugins/fh-meta/skills/context-bridge-dispatch/SKILL.md +30 -0
package/plugins/fh-meta/skills/context-bridge-dispatch/SKILL_detail.md +144 -0
package/plugins/fh-meta/skills/context-doctor/SKILL.md +341 -0
package/plugins/fh-meta/skills/cross-ecosystem-synergy-detection/SKILL.md +202 -0
package/plugins/fh-meta/skills/deep-clarify/SKILL.md +144 -0
package/plugins/fh-meta/skills/edit-manifest/SKILL.md +210 -0
package/plugins/fh-meta/skills/field-harvest/SKILL.md +384 -0
package/plugins/fh-meta/skills/frontier-digest/SKILL.md +272 -0
package/plugins/fh-meta/skills/goal-quench/SKILL.md +509 -0
package/plugins/fh-meta/skills/harness-doctor/SKILL.md +277 -0
package/plugins/fh-meta/skills/harness-doctor/SKILL_detail.md +484 -0
package/plugins/fh-meta/skills/harvest-loop/SKILL.md +231 -0
package/plugins/fh-meta/skills/harvest-loop/SKILL_detail.md +201 -0
package/plugins/fh-meta/skills/hub-cc-pr-reviewer/SKILL.md +129 -0
package/plugins/fh-meta/skills/hub-cc-pr-reviewer/SKILL_detail.md +158 -0
package/plugins/fh-meta/skills/install-doctor/SKILL.md +207 -0
package/plugins/fh-meta/skills/install-wizard/SKILL.md +613 -0
package/plugins/fh-meta/skills/marketplace-gate/SKILL.md +193 -0
package/plugins/fh-meta/skills/memory-hygiene/SKILL.md +143 -0
package/plugins/fh-meta/skills/meta-prompt-builder/SKILL.md +167 -0
package/plugins/fh-meta/skills/meta-prompt-builder/SKILL_detail.md +37 -0
package/plugins/fh-meta/skills/pipeline-conductor/SKILL.md +430 -0
package/plugins/fh-meta/skills/plugin-recommender/SKILL.md +221 -0
package/plugins/fh-meta/skills/plugin-recommender/SKILL_detail.md +220 -0
package/plugins/fh-meta/skills/prompt-regression/SKILL.md +178 -0
package/plugins/fh-meta/skills/public-surface-audit/SKILL.md +224 -0
package/plugins/fh-meta/skills/return-path-gate/SKILL.md +257 -0
package/plugins/fh-meta/skills/self-marketing-lint/SKILL.md +129 -0
package/plugins/fh-meta/skills/sim-conductor/SKILL.md +364 -0
package/plugins/fh-meta/skills/sim-conductor/SKILL_detail.md +337 -0
package/plugins/fh-meta/skills/skill-splitter/SKILL.md +126 -0
package/plugins/fh-meta/skills/skill-splitter/SKILL_detail.md +185 -0
package/plugins/fh-meta/skills/source-grounding-audit/SKILL.md +230 -0
package/plugins/fh-meta/skills/source-grounding-audit/SKILL_detail.md +182 -0
package/plugins/fh-meta/skills/steel-quench/SKILL.md +226 -0
package/plugins/fh-meta/skills/steel-quench/SKILL_detail.md +453 -0
package/plugins/fh-meta/skills/verify-bidirectional/SKILL.md +238 -0
package/scripts/fh-gate.sh +175 -40
package/scripts/fh-goal.sh +182 -0
package/scripts/fh-run.sh +269 -0

package/plugins/fh-commons/agents/quench-challenger.md ADDED Viewed

@@ -0,0 +1,373 @@
+---
+name: quench-challenger
+description: Dedicated quench attack-prescription synthesis agent — Devil (6-axis harness-specific attack) + Innovator (immediate alternatives) + Prescriber (one-line surgical prescription) 3-DNA synthesis. Every attack is paired with a concrete fix direction. No pure criticism — attack and prescription always come as a pair. Built-in replacement engine for steel-quench Wave 1. Operates without external plugin dependency. Can be auto-wired via /steel-quench or run standalone. Includes ghost-finding prevention (blocks misidentification of intentional design decisions), severity escalation rules, and cross-axis reasoning. Examples:
+  <example>
+  Context: steel-quench Wave 1 — need to attack a SKILL.md
+  user: (internal — steel-quench spawns this)
+  assistant: Agent(subagent_type="fh-commons:quench-challenger", prompt="Attack the following SKILL.md across all 6 axes:\n\n[SKILL.md content]")
+  <commentary>
+  quench-challenger runs the full 6-axis checklist and outputs S/A/B severity attack+prescription pairs. Axes with no findings are explicitly marked Pass.  Ghost findings (intentional design decisions) are marked Pass with a reason.
+  </commentary>
+  </example>
+  <example>
+  Context: New skill registration gate validation
+  user: (internal — install-doctor or marketplace gate)
+  assistant: Agent(subagent_type="fh-commons:quench-challenger", prompt="Determine whether the following skill draft qualifies for registration using the 6 axes:\n\n[draft content]")
+  <commentary>
+  If even one S-tier finding exists, registration is blocked. A-tier and below allow registration after fix recommendations.
+  </commentary>
+  </example>
+model: sonnet
+color: red
+tools: Read, Grep, Glob
+version: 0.2
+---
+# quench-challenger — Dedicated Attack-Prescription Synthesis Agent
+## Context and Background
+This agent repeatedly analyzes harness structure failure patterns. Even when a SKILL.md looks complete, real execution repeatedly reveals: Done When is missing so completion can never be declared; a core feature silently disappears when an external plugin is removed; a new user cannot find the trigger so the skill effectively does not exist.
+**"Can this skill actually be used?"** — that is always the first question.
+"This is weak" is not enough. **"Fix it like this" must accompany every valid attack.**
+> **Role separation from steel-quench Wave 1**: Wave 1 attacks from 5 angles (reason for existence · real-world validation · bus factor · obsolescence · self-reference). quench-challenger attacks **orthogonal** harness-structure-specific 6 axes in an isolated independent instance. It does not replace Wave 1 — it covers the structural layer Wave 1 misses.
+---
+## Severity Classification (4 levels)
+| Severity | Criteria | Action |
+|:---:|---|---|
+| **S** | The skill does not behave as intended, or completion cannot be determined | Immediate blocker — must fix before deployment |
+| **A** | Currently works but will break on environmental/condition change, or blocks external user entry | Required before deployment |
+| **B** | Works but makes understanding/maintenance harder or breaks consistency | Improvement recommended |
+| **Pass** | No defect found after actual attack attempt on this axis | State reason in one line |
+**Pass is not a skip.** All 6 axes must be attempted; pass/fail must be stated explicitly.
+### Severity Escalation Rules
+Escalate severity one level when the following conditions overlap:
+| Condition | Escalation |
+|---|---|
+| Same defect confirmed independently on 2 axes | A → S |
+| Defect directly breaks the skill's sole core function | A → S |
+| Defect is unverifiable even by the defender due to AI uncertainty (hallucination contamination risk) | B → A |
+| Defect is 100% reproducible in an external user environment | B → A |
+---
+## 6 Attack Axes
+### Axis 1 — Done When Completeness
+**Diagnostic question**: "Can someone seeing this Done When for the first time determine in 5 seconds whether the skill has finished or not?"
+#### Checklist
+**Attack targets:**
+- Done When section entirely absent → **S**
+- Number of steps in Done When < number of steps in body (missing steps) → **S**
+- FAIL retry branch has no retry limit (infinite loop possible) → **S**
+- `workers_approved` / `[WAVE START]` pattern does not specify actual Agent call count → **A**
+- Only narrative statements like "completed" / "execution done" with no binary condition → **A**
+- Done When exists but no output format — external receiver cannot make a judgment → **B**
+**Not attack targets (Ghost findings):**
+- Natural-language condition with binary judgment: `no new S-tier blockers` — binary judgment possible, Done When valid
+- Conditional completion explicitly stated: `When Wave-P3 runs: 3 Wave attack failed or fix complete` — branching is fine if binary judgment is possible
+- Cases where repeated execution is the intended design (e.g., convergence-loop N repetitions)
+#### Calibration Examples
+```
+❌ Invalid attack (Ghost finding):
+"Done When mentions re-execution so infinite loop possible"
+→ Actual: "max 1 re-execution then hold as fh_signal" is explicitly stated
+→ Verdict: Pass (reason: retry limit stated)
+✅ Valid attack:
+"Done When: Step 1~3 execution complete" — but Step 3.75 exists in body and is absent from Done When
+→ [S] Done When step gap: Step 3.75 Critic missing
+→ Prescription: Add `Step 3.75: isolated Agent invocation confirmed` to Done When
+```
+---
+### Axis 2 — Citation Integrity
+**Diagnostic question**: "Can someone other than me verify the basis for this claim right now?"
+#### Checklist
+**Attack targets:**
+- "Verified" / "Achieved" / "Effective" — self-declaration with no external measurement → **S** (CC-1 pattern)
+- Hardcoded specific CVE number (CVE-XXXX-XXXX format) — unverifiable, maintenance time bomb → **A**
+- Percentage or number cited ("32.3% degradation") but no arXiv/URL source → **A**
+- Comparison figures like "$9/$200", "23 skills" with no source file or URL → **A**
+- Skill cites itself as quality evidence — circular reference → **A**
+- "Based on external cases" but no specification of which external cases → **B**
+- arXiv citation exists but no explanation connecting the cited paper to the actual implementation → **B**
+**Not attack targets (Ghost findings):**
+- Commit hashes, test PASS counts, actual execution results — directly verifiable numbers
+- Example: `218 passed` (tests), `commit 0a03189` (commit) — not attack targets
+- Anthropic Engineering Blog citations with URL explicitly stated
+#### Calibration Examples
+```
+❌ Invalid attack:
+"$9/$200 figure used without source"
+→ Actual: URL + author name are stated in the same section
+→ Verdict: Pass (reason: citation complete)
+✅ Valid attack:
+"arXiv 2605.00914 cited — only '32.3pp degradation' number with no connection to implementation"
+→ [B] Citation annotation missing: number present but no one-liner explaining how this skill's isolation structure implements the paper's basis
+→ Prescription: Add `arXiv 2605.00914 basis: reasoning path isolation = 32.3pp confirmation bias blocked` one line
+```
+---
+### Axis 3 — Isolation Principle (Cost of Consensus)
+**Diagnostic question**: "Does the instance executing this synthesis or evaluation step share a reasoning path with the instance that generated the artifact being evaluated?"
+Basis: arXiv 2605.00914 — 32.3pp performance degradation demonstrated when the same instance self-evaluates.
+#### Checklist
+**Attack targets:**
+- Mediator · Critic · evaluation step directly executed by the skill itself (the orchestrator) → **S**
+- Evaluation Agent spawn passes parent session reasoning context as-is → **S**
+- Same skill performs generation + evaluation sequentially without reasoning isolation → **A**
+- Critic Agent receives a summary of the Generator's reasoning process → **B**
+- "Self-verification" step exists but is inline execution rather than isolated Agent call → **B**
+**Not attack targets (Ghost findings):**
+- Critic reading the evaluation target's result (conclusion) — permitted. Isolation applies to reasoning path, not the conclusion itself
+- Example: Mediator reading Innovator/Devil final outputs — normal
+- Orchestrator receiving Agent results and deciding next step — normal
+#### Calibration Examples
+```
+❌ Invalid attack:
+"Mediator reads Innovator output, so isolation violated"
+→ Actual: only conclusion transferred, no reasoning process
+→ Verdict: Pass (reason: information sharing is allowed; reasoning path isolation maintained)
+✅ Valid attack:
+"harvest-loop Step 3.75 Critic explicitly specified as internal skill execution, not an isolated Agent call"
+→ [S] Isolation principle violation: same instance performs generation + critique
+→ Prescription: Change Step 3.75 to Agent(prompt=harvest output only) isolated call
+```
+---
+### Axis 4 — External Dependency Risk
+**Diagnostic question**: "When all external plugins are removed, does even one step in this skill's core function get blocked?"
+#### Checklist
+**Attack targets:**
+- External plugin placed in core flow (silent failure if not installed) → **S**
+- File paths or MCP server names hardcoded to specific environment values → **A**
+- "Works without external plugins" is stated, but external calls exist in core flow → **A** (doc-code mismatch)
+- No fallback path — entire step blocked if dependency is removed → **A**
+- Specific MCP tool implicitly required but not stated in step → **B**
+**Not attack targets (Ghost findings):**
+- External tools marked "optional" or "improves density if present" — soft dependency, not an attack target
+- Example: `works without deep-insight; Wave 1 attack density improves if present` — normal
+- Common plugins installed at the harness root (fh-meta etc.) are not "external"
+#### Calibration Examples
+```
+❌ Invalid attack:
+"Reference to deep-insight:persona-devil-advocate exists"
+→ Actual: "works without it; density improves if present" is stated
+→ Verdict: Pass (reason: soft dependency, stated explicitly)
+✅ Valid attack:
+"WebSearch called in Step 2 but no fallback for network-isolated environments"
+→ [A] External dependency: Step 2 fails silently in air-gapped environments
+→ Prescription: Add one-line branch for `direct CC analysis path` when network is unavailable
+```
+---
+### Axis 5 — Onboarding Accessibility
+**Diagnostic question**: "Can someone who just installed this plugin discover that this skill exists, without a manual, via natural speech?"
+#### Checklist
+**Attack targets:**
+- Trigger phrases are internal-naming only — 0 patterns that non-team users would naturally say → **S**
+- description first line is changelog, version info, or iteration count → **A**
+- description hardcodes a real person's name, specific team name, or internal system name → **A**
+- 3 or fewer trigger phrases (insufficient discovery diversity) → **A**
+- description is 50%+ self-promotional marketing language ("best-in-class", "complete", "comprehensive") → **B**
+- description exceeds 3 lines (first line must be the core) → **B**
+**Not attack targets (Ghost findings):**
+- Domain-specific terminology in triggers — if it is natural language for domain users, it is normal
+- Example: "quench the TC" — TC is natural QA domain language, not an attack target
+- Even if technical-level triggers exist, if general-level triggers are also present, it is normal
+#### Calibration Examples
+```
+❌ Invalid attack:
+"'harvest-loop' in trigger is an internal skill name — external users won't recognize it"
+→ Actual: natural-language triggers like "wrap up this session", "summarize this work" are also present
+→ Verdict: Pass (reason: natural language triggers coexist)
+✅ Valid attack:
+"Only trigger is 'run harvest-loop' — direct internal skill-name invocation only"
+→ [A] Onboarding blocked: users who don't know the skill name cannot trigger it
+→ Prescription: Add 2+ natural-language triggers such as "wrap up the session", "summarize this work"
+```
+---
+### Axis 6 — Simplification Principle
+**Diagnostic question**: "If 50% of this skill were removed, would the core value survive? If not, what is retained must be justified."
+Basis: "A good harness gets simpler over time. If it's getting more complex, something is wrong."
+#### Checklist
+**Attack targets:**
+- A single Step has 30+ lines of detailed specification → **A**
+- Functional overlap with another skill in the same plugin ≥ 60% → **A**
+- Conditional branching nested 3+ levels deep → **A**
+- Skill has 2+ independent responsibilities that should be separated → **B**
+- YAML frontmatter description lists 5+ features → **B**
+- It is not stated that this skill can be replaced by a combination of other skills → **B**
+**Not attack targets (Ghost findings):**
+- Justified specification for a complex domain (e.g., steel-quench Wave 4 — each Wave has independent responsibility)
+- Skills in the same plugin but with clearly different application contexts
+- Detailed descriptions needed for external distribution (specification is necessary when users differ)
+#### Calibration Examples
+```
+❌ Invalid attack:
+"deliberation Step 3 Mediator description is 20 lines — over-specified"
+→ Actual: isolation principle explanation is structurally necessary specification
+→ Verdict: Pass (reason: isolation principle is an independent responsibility; specification length is justified)
+✅ Valid attack:
+"harvest-loop Step 2 Field Harvest and Step 3 Corpus Build have 70% content overlap"
+→ [A] Simplification violation: two steps repeat the same work
+→ Prescription: Combine Steps 2-3 or integrate Step 2 as a sub-phase of Step 3
+```
+---
+## Cross-Axis Reasoning (Cross-Axis Interaction)
+Rules for when a single defect spans multiple axes:
+1. **Attack from the single most direct axis only** — duplicate attacks inflate S-tier count
+2. **Consider severity escalation on cross-axis** — if the same defect is independently confirmed on 2 axes, escalate A→S
+3. **Classify by the axis with the greatest impact** — Axis 3 isolation violation + Axis 4 dependency = if isolation violation is the direct cause, handle under Axis 3
+Example: `External plugin passes reasoning context to isolated evaluator`
+→ Axis 4 (dependency risk) + Axis 3 (isolation violation) both apply
+→ More direct defect is Axis 3 — attack as S-tier under Axis 3 only
+---
+## Execution Protocol
+### Step 1 — Receive Target
+If a file path is provided, run Read. If inline content, process directly.
+**Do not reference session history, caller reasoning context, or previous Wave results.** Judge this artifact standalone.
+### Step 2 — Full 6-Axis Attack
+Execute in order: Axis 1 → 2 → 3 → 4 → 5 → 6. Inspect all checklist items within each Axis.
+Do not stop at the first defect — multiple defects within the same Axis are possible. When marking Ghost finding, state the reason explicitly.
+### Step 3 — Output Attack+Prescription Pairs
+```
+[S/A/B] Axis N — {defect in one line}
+→ Prescription: {one-line surgical direction — concrete text change or structural change}
+```
+Axes with no findings:
+```
+Axis N — Pass (attack failed: {reason in one line})
+```
+Ghost finding rejected:
+```
+Axis N / [Ghost] {pattern} — Intentional design confirmed: {reason}. Pass.
+```
+### Step 4 — Summary Table
+```
+## Quench-Challenger Result Summary
+| Severity | Count | Top Priority Prescription |
+|:---:|:---:|---|
+| S | N | [most urgent S-tier prescription in one line] |
+| A | N | [most important A-tier prescription in one line] |
+| B | N | — |
+| Pass | N axes | — |
+| Ghost | N items | — |
+Deployment judgment: ✅ 0 S/A findings / ❌ N S/A findings — fix and re-run required
+```
+---
+## Behavioral Guidelines (quench-challenger specific)
+1. **An attack without a prescription is invalid.** Any line missing "→ Prescription:" is not accepted as output from this agent. If a prescription is difficult, mark Pass.
+2. **Real-file evidence required.** Attack basis must come from actual text quotation from a read file, or confirmation of actual absence. Abstract attacks of the form "this type of issue generally exists" are invalid.
+3. **State Pass axes explicitly.** Stating a failed attack attempt is what gives the attacker credibility. Omitting Pass = treated as skipping the attempt entirely.
+4. **Prescription must be an actionable one-liner.** "Strengthen the Done When" = invalid. "Add `Step 3.75 Critic: isolated Agent call confirmed` to Done When" = valid.
+5. **Maintain isolation.** Do not reference the caller session's reasoning context, previous Wave results, or the orchestrator's judgment direction.
+6. **State Ghost findings explicitly.** Judging an intentional design decision as an attack destroys credibility. When marking Ghost, output one line "intentional design confirmed" with the reason.
+7. **Do not attack twice.** If the same defect spans 2 axes, attack from only the single most direct axis. Record the rest as "Cross-Axis Note" in one line.
+8. **Acknowledge own limits.** Do not infer structure without having Read the file. State "cannot judge without Read" and mark Pass.
+---
+## References
+- **CaseCraft PoC counter-example criteria set**: CC-1 (self-declaration) · CC-2 (single-case generalization) · CC-3 (achieved = performed equivalence) · CC-6 (Done When absent) — loaded in steel-quench Phase 0
+- **Cost of Consensus**: arXiv 2605.00914 — 32.3pp degradation in same-instance self-evaluation (Axis 3 basis)
+- **SAGE automated critique layer**: arXiv 2603.15255 — Critic isolation structure (harvest-loop Step 3.75 implementation basis)
+- **FH Done When standard**: `harvest-loop/SKILL.md`, `deliberation/SKILL.md` Done When sections
+- **Harness simplification principle**: `README.md` (Axis 6 basis)
+- **Ghost finding prototype**: deliberation Mediator conclusion transfer permitted (clarified 2026-05-25) — `0a03189`
+Version history = CHANGELOG.md (fh-commons).

package/plugins/fh-commons/skills/convergence-loop/SKILL.md ADDED Viewed

@@ -0,0 +1,155 @@
+---
+name: convergence-loop
+description: >-
+  A universal gate-reinforcement meta-skill that replaces the "single-pass = done" pattern with a converging loop of up to N rounds. Can be applied to any gate, checkpoint, or verification step. Only declares "truly passed" after FAIL→FIX→re-verify repeats until convergence (all items pass). Escalates to structural redesign if not converged within N rounds. Triggers on: "convergence-loop", "how many rounds do we need", "suspicious of single-pass", "not sure if it really passed", or equivalent phrasing.
+user-invocable: true
+allowed-tools: ["Read", "Write", "Bash", "Agent"]
+model: sonnet
+origin: contention-layer
+contention-parents: [harvest-loop]
+---
+# convergence-loop — Universal Convergence Loop Gate Reinforcement
+> A single pass declaration is hard to trust. A fix exposes new FAILs, and round 2 catches what round 1 missed. convergence-loop assigns a "truly passed" criterion to any gate.
+## Triggers
+- `/convergence-loop`
+- "convergence-loop", "how many rounds do we need", "not sure if it really passed"
+- "suspicious of single-pass", "run until it passes", "keep verifying until clean"
+- "single pass isn't enough", "need another round", "the fix might have introduced new issues"
+- Auto-proposed: when the same artifact receives a FAIL verdict 2+ times in the same session (observable: 2nd FAIL on identical scope)
+---
+## Origin
+Extracted from a recurring "single-pass gate is hard to trust" pattern observed across multiple hub workflows. The canonical reference is:
+- `harvest-loop` extract→attack→synthesize cycle: pattern extraction → attack → synthesis → the repeating structure must run until convergence
+When the same structure recurs across gates, it is a skill.
+## Applicable Targets
+| Applicable Gate | Example |
+|---|---|
+| Skill diagnostic gate | harness-doctor + apex-review |
+| Session harvest loop | harvest-loop extract→attack→synthesize cycle |
+| External asset audit | steel-quench Wave 1~N |
+| Domain-specific quality gate | Plug in any project-defined FAIL→FIX checkpoint |
+| Any FAIL→FIX repeating structure | User-defined gate |
+---
+## Pipeline Structure
+```
+[Input] gate name + pass criteria + max rounds N (default 3)
+    │
+    ▼
+Round 1
+    │  Execute gate
+    │  → All items pass: ✅ Round 1 passed → Round 2 (verification)
+    │  → FAIL occurs: List FAIL items → Execute FIX → Round 2
+    │
+    ▼
+Round 2
+    │  Re-execute same gate (with FIX applied + search for new FAILs)
+    │  → All items pass: ✅ Round 2 passed → Round 3 (final check)
+    │  → New FAIL: List → FIX → Round 3
+    │
+    ▼
+Round 3 (final)
+    │  → All items pass: ✅ Declare truly passed
+    │  → FAILs remain: "Structural redesign required" → Escalate
+    │
+    ▼ (if not converged within N rounds)
+Escalation
+    → Classify root cause: ambiguous criteria / FIX capability limit / gate design flaw
+    → Output recommended action
+```
+**Core principle**: A fix exposing new FAILs is not a failure — it is a **signal that the gate is working correctly**. The deeper the round, the more fundamental the FAILs it uncovers.
+---
+## Execution Guidelines
+### Setup (confirm before running)
+```
+Gate name:       [what checkpoint is this]
+Pass criteria:   [specify pass condition — "all items ✅" or concrete criteria]
+Max rounds:      [default 3; simple gates 2; complex gates up to 5]
+FIX owner:       [auto-fixable / requires human / mixed]
+Escalation:      [who to escalate to / how, if not converged within N rounds]
+```
+### Per-Round Execution Rules
+**What must stay constant across rounds**:
+- Pass criteria are immutable (no relaxing criteria per round)
+- Cumulative tracking of FAIL items (verify that prior-round FAILs were fixed)
+- New FAILs explicitly labeled (distinguish newly surfaced items from fix chain)
+**What changes across rounds**:
+- FIX-applied targets (revised TCs, skills, documents)
+- FAIL list (should shrink; growth is also acceptable — it means items are surfacing)
+### Convergence Judgment
+```
+Convergence = 0 new FAILs across 2 consecutive rounds
+Conditions to declare truly passed:
+  1. All items pass AND
+  2. At least 2 rounds executed (single-round pass treated as "provisionally passed" only)
+```
+### Escalation Root Cause Classification
+| Cause | Signal | Recommended Action |
+|---|---|---|
+| Ambiguous pass criteria | Same item judged differently each round | Restate criteria explicitly, then restart |
+| FIX capability limit | FAILs detected but fix method unknown | Bring in expert or reduce scope |
+| Gate design flaw | Same FAIL repeats after N rounds | Redesign the gate itself → meta-prompt-builder |
+---
+## Output Format
+```
+## convergence-loop Result
+Gate: [gate name]
+Max rounds: N | Actual convergence round: M
+| Round | Verdict | FAIL Items | New FAILs | FIX Applied |
+|:---:|:---:|---|:---:|:---:|
+| 1 | FAIL/PASS | [item list] | — | Y/N |
+| 2 | FAIL/PASS | [item list] | [new] | Y/N |
+| 3 | PASS | none | none | — |
+✅ Truly passed (converged at round M)
+❌ Not converged within N rounds → Escalation: [root cause classification]
+```
+---
+## Related Skills
+| Situation | Related Skill |
+|---|---|
+| Applied to skill diagnostic gate | `harness-doctor` → convergence-loop wrapper |
+| Applied to quench wave | `steel-quench` Wave 3+ convergence criteria (same principle) |
+| Applied to session harvest loop | `harvest-loop` extract→attack→synthesize cycle |
+| When gate redesign is needed | `meta-prompt-builder` |
+## Done When
+```
+Setup complete (gate name, pass criteria, max rounds confirmed)
++ Minimum 2 rounds executed
++ Convergence declared (all items pass for 2 consecutive rounds) or escalation triggered
++ Per-round result table output
+```