npm - wogiflow - Versions diffs - 2.24.0 → 2.25.1 - Mend

wogiflow 2.24.0 → 2.25.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

package/.claude/commands/wogi-debug-hypothesis.md +1 -1
package/.claude/commands/wogi-decide.md +40 -0
package/.claude/commands/wogi-init.md +29 -0
package/.claude/commands/wogi-learn.md +46 -0
package/.claude/commands/wogi-onboard.md +26 -0
package/.claude/commands/wogi-peer-review.md +5 -3
package/.claude/commands/wogi-triage.md +57 -0
package/package.json +2 -2
package/scripts/flow-completion-truth-gate.js +130 -0
package/scripts/flow-config-defaults.js +25 -0
package/scripts/flow-extraction-review.js +18 -3
package/scripts/flow-morning.js +12 -3
package/scripts/flow-session-end.js +12 -3

package/.claude/commands/wogi-debug-hypothesis.md CHANGED Viewed

@@ -176,7 +176,7 @@ After all agents complete, display the consolidated results:
 ### Step 4: Hypothesis Adversary (v2.23.0+ — MANDATORY unless `--no-adversary`)
-After consolidation, spawn a single Agent (different `model` param if `config.hybrid.enabled`, else same) with this prompt:
+After consolidation, spawn a single Agent on a DIFFERENT model (default `sonnet` via `config.researchReasoningGate.tier3.adversaryModel` — canonical cross-command adversary key, same as `/wogi-peer-review`, `/wogi-learn`, `/wogi-decide`) with this prompt:
 ```
 You are the hypothesis adversary.

package/.claude/commands/wogi-decide.md CHANGED Viewed

@@ -304,6 +304,46 @@ In `config.json`:
 }
 ```
+## Rule-Creation Adversary (v2.25.0+ — OPTIONAL but recommended for ambiguous rules)
+When creating a non-trivial rule (anything beyond pure preference-setting like "always use semicolons"), spawn an adversary on a different model (default `sonnet` via `config.researchReasoningGate.tier3.adversaryModel`) to stress-test the proposed rule BEFORE it lands in `decisions.md`.
+```
+Spawn Agent (subagent_type: general-purpose, model: <adversaryModel>):
+Input:
+  Proposed rule title: <title>
+  Proposed rule body: <body>
+  User's original phrasing: <literal request>
+Prompt:
+  You are the rule-creation adversary.
+  1. Edge cases: name 3 situations where following this rule would produce
+     worse outcomes than NOT following it.
+  2. Interpretation: are there 2+ reasonable interpretations? If yes, list
+     them and pick the one the user most likely meant.
+  3. Scope creep: could this rule be over-applied to situations the user
+     didn't intend? Suggest scope qualifiers.
+  4. Verdict:
+     - ACCEPT   — ship as-is
+     - CLARIFY  — multiple interpretations; ask user
+     - NARROW   — over-application risk; add scope qualifiers
+     - REJECT   — edge cases dominate; more harm than good
+  Output JSON: {
+    "verdict", "edge_cases", "interpretations",
+    "scope_qualifiers", "suggested_revision"
+  }
+```
+Process:
+- **ACCEPT** → proceed with rule creation
+- **CLARIFY** → ask user to pick interpretation
+- **NARROW** → show scope qualifier; ask user to approve
+- **REJECT** → surface edge cases; require explicit override
+Fail-open: adversary unavailable → proceed with standard flow. User confirmation is still present.
 ## Files
 | Action | File |

package/.claude/commands/wogi-init.md CHANGED Viewed

@@ -1209,3 +1209,32 @@ Say "show me the rules" or "what patterns are we using?" anytime.
 ### If user cancels mid-wizard
 - Save progress to `.workflow/state/setup-progress.json`
 - Next run can offer to resume
+## v2.25.0+ — Modern Config Scaffolding (MANDATORY)
+New projects MUST be initialized with the following modern-stack config blocks explicitly written to `.workflow/config.json` so users can see + tune them (defaults-only inheritance is fine for behavior, but visibility matters for learning):
+```json
+{
+  "intentGroundedReasoning": { "enabled": true },
+  "taskBoundaryReset": {
+    "enabled": true,
+    "maxRestartsPerSession": 50
+  },
+  "storyFlow": {
+    "consumerImpactAnalysis": { "enabled": true, "breakingThreshold": 5 },
+    "scopeConfidenceAudit": { "enabled": true },
+    "itemReconciliation": { "enabled": true, "minItems": 3 }
+  },
+  "longInputGate": { "enabled": true, "lineThreshold": 40 },
+  "researchReasoningGate": {
+    "enabled": true,
+    "tier2": { "enabled": true },
+    "tier3": { "enabled": true, "adversaryModel": "sonnet" }
+  }
+}
+```
+These capabilities (IGR, task-boundary restart, P0 story gates, long-input routing, research reasoning gate) have proven out in 2.22+ releases. New users should NOT have to manually enable them via `flow migrate-igr` or equivalent — they are active from the first session.
+If onboarding a workspace (multi-repo), also ensure `workspace.autoPickupChannelDispatches: true` and the 2.22.x restart-handoff settings are present.

package/.claude/commands/wogi-learn.md CHANGED Viewed

@@ -275,6 +275,52 @@ In `config.json`:
 }
 ```
+## Promotion Adversary (v2.25.0+ — MANDATORY)
+Before promoting a pattern from `feedback-patterns.md` to `decisions.md`, run a **Promotion Adversary** on a different model. Rationale: same-model self-critique rubber-stamps. The adversary checks whether the N events that triggered promotion share an actual root cause (genuine recurrence) vs. superficial similarity with different underlying causes (false recurrence — common when the pattern detector just matched keywords).
+```
+Spawn Agent (subagent_type: general-purpose,
+             model: config.researchReasoningGate.tier3.adversaryModel, default 'sonnet'):
+Input:
+  Proposed rule: <title + body>
+  Triggering events: [
+    { date, request, correction },
+    { date, request, correction },
+    { date, request, correction }
+  ]
+Prompt:
+  You are the rule-promotion adversary.
+  Do these N events actually share a root cause, or are they superficially
+  similar events with different underlying issues?
+  1. For each event: describe the root cause in your own words.
+  2. List what's common to all N root causes.
+  3. List what's different between them.
+  4. Verdict:
+     - SAME_PATTERN  — genuine recurrence; rule is well-founded
+     - MIXED         — N-1 match but one event has a different root cause
+     - DIFFERENT     — surface-similar only; no unifying pattern
+  Output JSON:
+  {
+    "verdict": "SAME_PATTERN" | "MIXED" | "DIFFERENT",
+    "root_causes": [...],
+    "commonalities": [...],
+    "differences": [...],
+    "suggested_rule_scope": "as_proposed" | "narrower" | "split_into_N"
+  }
+```
+Process the verdict:
+- **SAME_PATTERN** → proceed with promotion as-is
+- **MIXED** → ask the user: "Adversary flags event #X as different root cause. Promote rule anyway, narrow scope, or split into multiple rules?"
+- **DIFFERENT** → DO NOT auto-promote. Surface adversary output; require explicit user confirmation.
+Fail-open: if adversary cannot be spawned (missing API key, network), proceed with standard promotion and log a warning. The threshold check + user confirmation still apply.
 ## Files
 | Action | File |

package/.claude/commands/wogi-onboard.md CHANGED Viewed

@@ -1077,3 +1077,29 @@ AskUserQuestion({
   }]
 });
 ```
+## v2.25.0+ — Modern Config Scaffolding (MANDATORY)
+When generating `.workflow/config.json` for a fresh project, include these 2.22+ capability blocks so new users inherit the current-best defaults:
+```json
+{
+  "intentGroundedReasoning": { "enabled": true },
+  "taskBoundaryReset": { "enabled": true, "maxRestartsPerSession": 50 },
+  "storyFlow": {
+    "consumerImpactAnalysis": { "enabled": true, "breakingThreshold": 5 },
+    "scopeConfidenceAudit": { "enabled": true },
+    "itemReconciliation": { "enabled": true, "minItems": 3 }
+  },
+  "longInputGate": { "enabled": true, "lineThreshold": 40 },
+  "researchReasoningGate": {
+    "enabled": true,
+    "tier2": { "enabled": true },
+    "tier3": { "enabled": true, "adversaryModel": "sonnet" }
+  }
+}
+```
+These drive IGR (Architect + Adversary + Truth Gate), task-boundary context reset via the `wogi-claude` wrapper, `/wogi-story` P0 spec gates, auto-routing of long inputs to `/wogi-extract-review`, and the research reasoning gate's assumption-surfacing + cross-model adversary. All have proven out across the 2.22.x release series; new users should not have to discover them one at a time.
+For multi-repo workspaces, also scaffold `workspace.autoPickupChannelDispatches: true` and leave the other `workspace.*` defaults intact — they include the 2.22.2 restart-handoff protocol.

package/.claude/commands/wogi-peer-review.md CHANGED Viewed

@@ -47,9 +47,11 @@ Models are selected once per session and remembered for subsequent runs.
 ├─────────────────────────────────────────────────────────┤
 │  1. Collect code changes (git diff or specified files)   │
 │  2. Classify change size → effort tier:                  │
-│     L0/L1 (>10 files)  → opus-4-7 xhigh                  │
+│     L0/L1 (>10 files)  → opus (latest) xhigh             │
 │     L2 (3-10 files)    → sonnet medium                   │
 │     L3 (<3 files)      → haiku medium                    │
+│     (Model IDs resolve from config.models — avoid        │
+│      hardcoding model version in this doc.)              │
 │  3. Generate improvement-focused prompt                  │
 │  4. If includeClaude enabled:                            │
 │     - Launch Claude review (Task agent, Explore type)    │
@@ -96,7 +98,7 @@ analysis, EACH carrying an explicit evidence tier.
 ## Synthesis Adversary (v2.23.0+ — MANDATORY unless `--no-adversary`)
-After initial synthesis, spawn a single adversary agent on a DIFFERENT model from the synthesizer (default: if synthesizer is Opus, adversary is Sonnet; config via `peerReview.adversaryModel`). Prompt:
+After initial synthesis, spawn a single adversary agent on a DIFFERENT model from the synthesizer (default `sonnet`; override via the canonical `config.researchReasoningGate.tier3.adversaryModel` — same key used by `/wogi-debug-hypothesis`, `/wogi-learn`, `/wogi-decide`). Prompt:
 ```
 You are the synthesis adversary.
@@ -199,7 +201,7 @@ For manual review (no API keys needed): `/wogi-peer-review --manual`
 | `--verbose` | Show full model responses |
 | `--create-tasks` | Auto-create tasks for strong agreements |
 | `--no-adversary` | Skip the v2.23.0 synthesis adversary (not recommended for L0/L1 diffs) |
-| `--adversary-model <id>` | Override adversary model (default: cross-model from synthesizer) |
+| `--adversary-model <id>` | Override adversary model (default: `config.researchReasoningGate.tier3.adversaryModel`, usually `sonnet`) |
 | `--effort <level>` | Override effort tier (low/medium/high/xhigh/max) — otherwise derived from diff size |
 ARGUMENTS: {args}

package/.claude/commands/wogi-triage.md CHANGED Viewed

@@ -356,3 +356,60 @@ Each finding is displayed using these fields from `last-review.json`:
 | File | `finding.file` + `finding.line` | "src/api.ts:45" |
 | Issue | `finding.issue` | "Raw JSON.parse without try-catch" |
 | Recommendation | `finding.recommendation` | "Use safeJsonParse from flow-utils.js" |
+## Anti-Deferral Enforcement (v2.25.0+ — two layers)
+The **Review-Findings Anti-Deferral Rule** (`.workflow/state/decisions.md`, 2026-04-15) gets two complementary enforcement layers. One mechanical (an actual gate in the codebase), one AI-followed (a protocol documented here that the triage flow honors).
+### Layer 1 — Mechanical gate (v2.25.1+)
+`scripts/flow-completion-truth-gate.js` exports `parseCommitMessageClaims()` and `verifyCommitMessageAgainstDiff()`. Callers pass a commit message and the staged diff (or changed-files list); the function parses finding IDs (`F1`/`M1`/`SEC-001`), task IDs (`wf-XXXXXXXX` after fix/close/resolve verbs), and file-path mentions, then checks each against the diff. Any unverified claim surfaces as a blocking prompt with three remediation options. This is real code, callable from pre-commit hooks, `flow-done.js`, or the triage flow itself.
+Example usage:
+```javascript
+const { verifyCommitMessageAgainstDiff, formatMissingClaimsMessage } =
+  require('wogiflow/scripts/flow-completion-truth-gate');
+const result = verifyCommitMessageAgainstDiff(commitMsg, { diffText, changedFiles });
+if (!result.ok) {
+  console.error(formatMissingClaimsMessage(result));
+  // Block + remediate
+}
+```
+### Layer 2 — AI-followed protocol (documentation)
+The rest of the triage flow is a protocol the AI follows. It is NOT automatically enforced by a hook — the historical v2.17.4 incident showed that doc-only protocols can be violated. The mechanical gate above closes the most damaging failure mode (commit message / diff mismatch). The AI-followed rules below cover the earlier stages:
+1. **Defer requires explicit user confirmation + reason.** The triage flow prompts when proposing to defer:
+   ```
+   Defer finding wf-review-XXXX?
+     Severity: HIGH
+     Reason required: [user input]
+     [Confirm defer] [Cancel — fix now]
+   ```
+   Auto-defer without reason is forbidden by this protocol.
+2. **"Fix all" / "Option 1" means fix ALL.** If the user requests bulk processing:
+   - Ship a fix for every finding with evidence-tier ≥ 1
+   - If any finding is too large, STOP and ask: "Finding X requires ~Y minutes of work. Ship now, split to its own release, or defer (needs reason)?"
+   - Never silently convert a finding to "deferred" in commit messages or release notes
+3. **Triage output includes a Deferral Audit Trail**:
+   ```
+   ━━━ TRIAGE SUMMARY ━━━
+   Fixed: 12
+   Deferred (with reasons): 2
+     • M3 — "requires restructure, tracked as wf-XXXXXXXX" (user-confirmed)
+     • L5 — "out of scope for current release" (user-confirmed)
+   Silently dropped: 0 ← MUST be 0
+   ━━━━━━━━━━━━━━━━━━━━━━
+   ```
+### Honest tradeoff
+Layer 1 is genuinely mechanical — impossible for an AI to bypass without explicitly disabling the gate. Layer 2 is a protocol the AI can fail to follow if prompted poorly, distracted, or confused about priorities. Both matter; calling the whole system "architecturally impossible to bypass" would be inaccurate. The mechanical gate at least ensures that WHEN the AI writes a commit message, claimed fixes must actually appear in the diff.
+Historical incident (v2.17.4 release, 2026-04-15): commit claimed "fix all findings" but M1 and M3 were silently dropped. Layer 1 would have caught that — the commit message mentioned M1 + M3 but the diff didn't. Layer 2 is the human-protocol reinforcement.
+Skip via `config.triage.antiDeferralEnforcement.enabled: false` — note that this is currently a surface flag only (read by AI-followed protocol, not by the Layer 1 gate); to disable Layer 1 set `config.commitClaimsGate.enabled: false`.

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "wogiflow",
-  "version": "2.24.0",
+  "version": "2.25.1",
   "description": "AI-powered development workflow management system with multi-model support",
   "main": "lib/index.js",
   "bin": {
@@ -10,7 +10,7 @@
   },
   "scripts": {
     "flow": "./scripts/flow",
-    "test": "NODE_ENV=test node --test tests/auto-compact-prompt.test.js tests/flow-paths.test.js tests/flow-io.test.js tests/flow-config-loader.test.js tests/flow-damage-control.test.js tests/flow-output.test.js tests/flow-constants.test.js tests/flow-session-state.test.js tests/flow-hooks-integration.test.js tests/flow-utils.test.js tests/flow-security.test.js tests/flow-memory-db.test.js tests/flow-durable-session.test.js tests/flow-skill-matcher.test.js tests/flow-bridge.test.js tests/flow-proactive-compact.test.js tests/flow-cascade-completion.test.js tests/flow-capture-gate.test.js tests/flow-correction-detector-hybrid.test.js tests/flow-promote.test.js tests/flow-archive-runs.test.js tests/flow-memory.test.js tests/flow-hooks-pre-tool-helpers.test.js tests/flow-hooks-bugfix-scope-gate.test.js tests/flow-hooks-routing-gate.test.js tests/flow-hooks-phase-read-gate.test.js tests/flow-hooks-commit-log-gate.test.js tests/flow-hooks-deploy-gate.test.js tests/flow-hooks-todowrite-gate.test.js tests/flow-hooks-git-safety-gate.test.js tests/flow-hooks-scope-mutation-gate.test.js tests/flow-hooks-strike-gate.test.js tests/flow-hooks-component-check.test.js tests/flow-hooks-scope-gate.test.js tests/flow-hooks-implementation-gate.test.js tests/flow-hooks-research-gate.test.js tests/flow-hooks-loop-check.test.js tests/flow-hooks-manager-boundary-gate.test.js tests/flow-hooks-phase-gate.test.js tests/flow-hooks-pre-tool-orchestrator.test.js tests/flow-hooks-observation-capture.test.js tests/flow-hooks-task-gate.test.js tests/flow-durable-session-suspension.test.js tests/flow-health-mcp-scopes.test.js tests/flow-lean-config.test.js tests/flow-workspace-autopickup.test.js tests/flow-worker-boundary-gate.test.js tests/flow-worker-question-classifier.test.js tests/flow-completion-truth-gate-contradictions.test.js tests/flow-structure-sensor.test.js tests/flow-workspace-dispatch-tracking.test.js tests/flow-story-gates.test.js tests/flow-workspace-restart-handoff.test.js tests/flow-wogi-claude-wrapper.test.js tests/flow-wave1-integrations.test.js tests/flow-wave2-integrations.test.js && NODE_ENV=test node tests/run-quality-gates.test.js",
+    "test": "NODE_ENV=test node --test tests/auto-compact-prompt.test.js tests/flow-paths.test.js tests/flow-io.test.js tests/flow-config-loader.test.js tests/flow-damage-control.test.js tests/flow-output.test.js tests/flow-constants.test.js tests/flow-session-state.test.js tests/flow-hooks-integration.test.js tests/flow-utils.test.js tests/flow-security.test.js tests/flow-memory-db.test.js tests/flow-durable-session.test.js tests/flow-skill-matcher.test.js tests/flow-bridge.test.js tests/flow-proactive-compact.test.js tests/flow-cascade-completion.test.js tests/flow-capture-gate.test.js tests/flow-correction-detector-hybrid.test.js tests/flow-promote.test.js tests/flow-archive-runs.test.js tests/flow-memory.test.js tests/flow-hooks-pre-tool-helpers.test.js tests/flow-hooks-bugfix-scope-gate.test.js tests/flow-hooks-routing-gate.test.js tests/flow-hooks-phase-read-gate.test.js tests/flow-hooks-commit-log-gate.test.js tests/flow-hooks-deploy-gate.test.js tests/flow-hooks-todowrite-gate.test.js tests/flow-hooks-git-safety-gate.test.js tests/flow-hooks-scope-mutation-gate.test.js tests/flow-hooks-strike-gate.test.js tests/flow-hooks-component-check.test.js tests/flow-hooks-scope-gate.test.js tests/flow-hooks-implementation-gate.test.js tests/flow-hooks-research-gate.test.js tests/flow-hooks-loop-check.test.js tests/flow-hooks-manager-boundary-gate.test.js tests/flow-hooks-phase-gate.test.js tests/flow-hooks-pre-tool-orchestrator.test.js tests/flow-hooks-observation-capture.test.js tests/flow-hooks-task-gate.test.js tests/flow-durable-session-suspension.test.js tests/flow-health-mcp-scopes.test.js tests/flow-lean-config.test.js tests/flow-workspace-autopickup.test.js tests/flow-worker-boundary-gate.test.js tests/flow-worker-question-classifier.test.js tests/flow-completion-truth-gate-contradictions.test.js tests/flow-structure-sensor.test.js tests/flow-workspace-dispatch-tracking.test.js tests/flow-story-gates.test.js tests/flow-workspace-restart-handoff.test.js tests/flow-wogi-claude-wrapper.test.js tests/flow-wave1-integrations.test.js tests/flow-wave2-integrations.test.js tests/flow-wave3-integrations.test.js tests/flow-commit-claims-gate.test.js && NODE_ENV=test node tests/run-quality-gates.test.js",
     "test:syntax": "find scripts/ lib/ -name '*.js' -not -path '*/node_modules/*' -exec node --check {} +",
     "lint": "eslint scripts/ lib/ tests/",
     "lint:ci": "eslint scripts/ lib/ tests/ --max-warnings 0",

package/scripts/flow-completion-truth-gate.js CHANGED Viewed

@@ -614,6 +614,133 @@ function collectArrayEntries(obj, keys) {
   return out;
 }
+// ============================================================
+// Commit-vs-diff consistency scanner (v2.25.1 — H2b from Waves 1-3 review)
+// ============================================================
+/**
+ * Parse a commit message for "fixes X" / "closes X" / "F1, F2, M1" style claims
+ * that should be verifiable against the diff.
+ *
+ * Heuristics — conservative to avoid false positives:
+ *   1. Bracketed finding IDs: `F1`, `F2`, `M1`, `H3`, `L5`, or `SEC-001`/`PERF-002`
+ *   2. Task IDs: `wf-XXXXXXXX` that appear as "fixes wf-...", "closes wf-...", etc.
+ *   3. File paths mentioned in fix-context: "fixes `path/to/file.js`"
+ *
+ * Returns the structured claims a diff-consistency check can verify.
+ *
+ * @param {string} commitMessage
+ * @returns {{claims: Array<{kind: 'finding-id'|'task-id'|'file', value: string, raw: string}>}}
+ */
+function parseCommitMessageClaims(commitMessage) {
+  const claims = [];
+  if (typeof commitMessage !== 'string' || commitMessage.trim().length === 0) {
+    return { claims };
+  }
+  // Finding IDs: F1, F2, M1, H3, L5, SEC-001, PERF-002, etc.
+  //   - Single-letter + digits: match on word boundary
+  //   - ALLCAPS-dashnum: SEC-001, PERF-002
+  const findingRe = /\b(?:F\d+|H\d+|M\d+|L\d+|[A-Z]{2,6}-\d+)\b/g;
+  for (const m of commitMessage.matchAll(findingRe)) {
+    claims.push({ kind: 'finding-id', value: m[0], raw: m[0] });
+  }
+  // Task IDs (wf-XXXXXXXX) — only count if preceded by fix/close/resolve verb
+  const taskRe = /\b(?:fix(?:es|ed)?|clos(?:es|ed)?|resolv(?:es|ed)?|address(?:es|ed)?)\s+(wf-[0-9a-f]{8})\b/gi;
+  for (const m of commitMessage.matchAll(taskRe)) {
+    claims.push({ kind: 'task-id', value: m[1], raw: m[0] });
+  }
+  // File paths in backticks after fix/address verbs: `fixes \`path/to/file.js\``
+  const fileRe = /(?:fix(?:es|ed)?|address(?:es|ed)?|updat(?:es|ed)?)\s+`([^`\n]{3,120})`/gi;
+  for (const m of commitMessage.matchAll(fileRe)) {
+    // Only count values that look like file paths (have an extension or a slash)
+    const val = m[1];
+    if (/[./]/.test(val) && !val.includes(' ')) {
+      claims.push({ kind: 'file', value: val, raw: m[0] });
+    }
+  }
+  // Dedup
+  const seen = new Set();
+  return {
+    claims: claims.filter(c => {
+      const k = `${c.kind}::${c.value.toLowerCase()}`;
+      if (seen.has(k)) return false;
+      seen.add(k);
+      return true;
+    })
+  };
+}
+/**
+ * Check commit message claims against the staged diff. Each claim must appear
+ * somewhere in the diff (a file path in the changed-files list OR the token
+ * appearing as-is in the diff body).
+ *
+ * @param {string} commitMessage
+ * @param {Object} [opts]
+ * @param {string} [opts.diffText] — raw `git diff --staged` output
+ * @param {string[]} [opts.changedFiles] — staged file list (alternative input)
+ * @returns {{ok: boolean, totalClaims: number, missingClaims: Array, verifiedClaims: Array}}
+ */
+function verifyCommitMessageAgainstDiff(commitMessage, opts = {}) {
+  const { claims } = parseCommitMessageClaims(commitMessage);
+  if (claims.length === 0) return { ok: true, totalClaims: 0, missingClaims: [], verifiedClaims: [] };
+  const diffText = typeof opts.diffText === 'string' ? opts.diffText : '';
+  const changedFiles = Array.isArray(opts.changedFiles) ? opts.changedFiles : [];
+  const haystack = [diffText, ...changedFiles].join('\n');
+  const missingClaims = [];
+  const verifiedClaims = [];
+  for (const claim of claims) {
+    let found = false;
+    if (claim.kind === 'file') {
+      // File claims verify by exact path match (or suffix) in changed-files list
+      found = changedFiles.some(f => f === claim.value || f.endsWith('/' + claim.value) || f.endsWith(claim.value));
+      if (!found) found = diffText.includes(claim.value);
+    } else {
+      // finding-id + task-id: plain substring search in the haystack
+      found = haystack.includes(claim.value);
+    }
+    (found ? verifiedClaims : missingClaims).push(claim);
+  }
+  return {
+    ok: missingClaims.length === 0,
+    totalClaims: claims.length,
+    missingClaims,
+    verifiedClaims
+  };
+}
+/**
+ * Human-readable message when claims are missing from the diff.
+ *
+ * @param {Object} result — from verifyCommitMessageAgainstDiff
+ * @returns {string|null}
+ */
+function formatMissingClaimsMessage(result) {
+  if (!result || result.ok || !Array.isArray(result.missingClaims) || result.missingClaims.length === 0) {
+    return null;
+  }
+  const lines = [
+    `Commit message claims ${result.missingClaims.length} item(s) that do not appear in the staged diff:`
+  ];
+  for (const c of result.missingClaims) {
+    lines.push(`  • ${c.kind === 'finding-id' ? 'Finding' : c.kind === 'task-id' ? 'Task' : 'File'} "${c.value}" — not found`);
+  }
+  lines.push('');
+  lines.push('Options:');
+  lines.push('  1. Add the missing fix to the commit now (git add + amend)');
+  lines.push('  2. Remove the unverified claim from the commit message');
+  lines.push('  3. Acknowledge + proceed (use --force-commit-claims if blocking from a gate)');
+  return lines.join('\n');
+}
 // ============================================================
 // Exports
 // ============================================================
@@ -627,6 +754,9 @@ module.exports = {
   isTruthGateDisabled,
   getMinTierForDone,
   scanForClaimContradictions,
+  parseCommitMessageClaims,
+  verifyCommitMessageAgainstDiff,
+  formatMissingClaimsMessage,
   TIER_NAMES,
   DONE_WORDS,
   DISAGREEMENT_WORDS,

package/scripts/flow-config-defaults.js CHANGED Viewed

@@ -818,6 +818,31 @@ const CONFIG_DEFAULTS = {
   // --- Gate Confidence ---
   gateConfidence: { enabled: false },
+  // --- Intent-Grounded Reasoning (IGR) ---
+  // Master flag for the IGR pipeline: Intent Framing (Step 1.15), Architect
+  // Pass (Step 1.55), Logic Adversary (Step 1.57), Scope-Confidence Audit
+  // (Step 1.45), Completion Truth Gate (Step 3.9). Default-on so new projects
+  // inherit the full reasoning pipeline. See .claude/docs/intent-grounded-reasoning.md.
+  intentGroundedReasoning: {
+    enabled: true,
+    _comment: 'IGR pipeline: architect + logic adversary + truth gate. See .claude/docs/intent-grounded-reasoning.md'
+  },
+  // --- Research Reasoning Gate ---
+  // Tiered classification for conversation-mode questions. Tier 1 = factual,
+  // direct answer. Tier 2 = domain/recommendation, surface assumptions and
+  // wait for user confirmation. Tier 3 = architecture, tier 2 flow + spawn
+  // cross-model adversary. See wogi-start.md § Research Reasoning Gate.
+  researchReasoningGate: {
+    enabled: true,
+    tier2: { enabled: true },
+    tier3: {
+      enabled: true,
+      adversaryModel: 'sonnet',
+      _comment_adversaryModel: 'Model used for Tier-3 cross-model adversary. Reused by /wogi-peer-review, /wogi-debug-hypothesis, /wogi-learn, /wogi-decide — single canonical key.'
+    }
+  },
   // --- Long Input Gate ---
   longInputGate: {
     enabled: true,

package/scripts/flow-extraction-review.js CHANGED Viewed

@@ -104,8 +104,13 @@ function loadReviewSession() {
       return null;
     }
-    // Check for prototype pollution keys
-    if ('__proto__' in parsed || 'constructor' in parsed || 'prototype' in parsed) {
+    // Check for prototype pollution keys. Use Object.prototype.hasOwnProperty
+    // rather than `key in parsed` — the latter also returns true for inherited
+    // properties, and EVERY plain object inherits `constructor` from
+    // Object.prototype, which made this guard falsely trip on every valid
+    // session file (pre-existing bug, found via v2.25.1 wave2 test).
+    const hasOwn = Object.prototype.hasOwnProperty;
+    if (hasOwn.call(parsed, '__proto__') || hasOwn.call(parsed, 'constructor') || hasOwn.call(parsed, 'prototype')) {
       console.error('Review session file contains unsafe keys');
       return null;
     }
@@ -414,11 +419,21 @@ function exportAsItemManifest() {
   // Coordinate with Intent Bootstrap (see flow-story-gates.coordinateIntentBootstrap)
   // so /wogi-start doesn't re-prompt if the user already scheduled bootstrap via
   // /wogi-story during this session.
+  //
+  // v2.25.1: Semantics corrected (nit from Waves 1-3 review). The flag
+  // represents "is IGR bootstrap active/scheduled for this session?", NOT
+  // "did THIS call schedule it?". `result.active` is true when IGR is enabled
+  // and bootstrap has been scheduled — whether by this call or a prior one.
   let intentBootstrapScheduled = false;
   try {
     const gates = require('./flow-story-gates');
     const result = gates.coordinateIntentBootstrap();
-    intentBootstrapScheduled = !!(result && result.scheduled);
+    if (result && result.active) {
+      // Scheduled in this call OR already-scheduled from a prior call = active
+      intentBootstrapScheduled = result.scheduled === true ||
+                                 result.reason === 'already-scheduled' ||
+                                 result.reason === 'artifacts-exist';
+    }
   } catch (_err) { /* non-critical */ }
   return {

package/scripts/flow-morning.js CHANGED Viewed

@@ -385,22 +385,31 @@ function collectBriefingData() {
   // v2.23.0 — Workspace dispatch surfacing (manager mode only).
   // If the user is working inside a workspace manager session, surface any
   // overdue or restart-gap-lost dispatches so the morning briefing catches
-  // what the last manager turn would have caught. Fail-open.
+  // what the last manager turn would have caught. Fail-open; DEBUG-logged.
   try {
     if (process.env.WOGI_WORKSPACE_ROOT) {
       const { buildOverdueContext } = require('./hooks/core/overdue-dispatches');
       const ctx = buildOverdueContext();
       if (ctx) briefing.workspaceOverdue = ctx;
     }
-  } catch (_err) { /* non-critical */ }
+  } catch (err) {
+    if (process.env.DEBUG) {
+      console.error(`[morning] Workspace overdue check failed (fail-open): ${err.message}`);
+    }
+  }
   // v2.23.0 — Completion-claim honesty scan.
   // Catches done-word-in-notes-while-status-partial and similar
   // contradictions across ready.json (uses the honesty-infra from 2026-04-16).
+  // Fail-open; DEBUG-logged.
   try {
     const { checkCompletionClaimHonesty } = require('./flow-health');
     briefing.honestyHits = checkCompletionClaimHonesty();
-  } catch (_err) { /* non-critical */ }
+  } catch (err) {
+    if (process.env.DEBUG) {
+      console.error(`[morning] Honesty scan failed (fail-open): ${err.message}`);
+    }
+  }
   // Generate suggested prompt if enabled
   if (morningConfig.generatePrompt !== false) {

package/scripts/flow-session-end.js CHANGED Viewed

@@ -596,9 +596,18 @@ function writeWorkspaceSessionEndMessage() {
   const workspaceRoot = process.env.WOGI_WORKSPACE_ROOT;
   if (!workspaceRoot) return;
   const repo = process.env.WOGI_REPO_NAME;
-  // Only manager-mode sessions emit this signal. Workers use their own
-  // Stop-hook worker-stopped message (see lib/workspace-messages.js).
-  if (repo && repo !== 'manager') return;
+  // Only emit this signal from EXPLICIT manager-mode sessions.
+  // v2.25.1 (M2 from Waves 1-3 review): tightened to require
+  // WOGI_REPO_NAME === 'manager' explicitly. Previously we let
+  // unset-repo sessions fall through, which could emit a spurious
+  // "manager session ended" broadcast from a mis-env'd worker shell.
+  // Workers use their own Stop-hook worker-stopped message.
+  if (repo !== 'manager') {
+    if (repo && process.env.DEBUG) {
+      console.error(`[session-end] Skipping workspace message — WOGI_REPO_NAME is '${repo}', not 'manager'`);
+    }
+    return;
+  }
   try {
     const messagesLib = path.resolve(__dirname, '..', 'lib', 'workspace-messages.js');