npm - helixevo - Versions diffs - 0.6.1 → 0.8.0 - Mend

helixevo 0.6.1 → 0.8.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (22) hide show

package/CHANGELOG.md +26 -0
package/README.md +22 -7
package/dashboard/app/api/proof/route.ts +71 -0
package/dashboard/app/api/run/route.ts +20 -1
package/dashboard/app/coevolution/client.tsx +6 -1
package/dashboard/app/coevolution/page.tsx +3 -1
package/dashboard/app/commands/page.tsx +59 -8
package/dashboard/app/guide/page.tsx +77 -25
package/dashboard/app/ontology/client.tsx +8 -1
package/dashboard/app/ontology/page.tsx +3 -1
package/dashboard/app/page.tsx +172 -6
package/dashboard/app/proof/client.tsx +348 -0
package/dashboard/app/proof/page.tsx +9 -0
package/dashboard/app/topology/client.tsx +48 -0
package/dashboard/app/topology/page.tsx +3 -1
package/dashboard/components/sidebar-nav.tsx +1 -0
package/dashboard/lib/data.ts +177 -0
package/dashboard/lib/loop-map.ts +23 -3
package/dashboard/lib/proof.ts +577 -0
package/dashboard/lib/release-spotlight.ts +10 -10
package/dist/cli.js +1744 -220
package/package.json +2 -2

package/CHANGELOG.md CHANGED Viewed

@@ -4,6 +4,32 @@ All notable changes to HelixEvo are documented here.
 ## [Unreleased]
+## [0.8.0] - 2026-03-25
+### Added
+- Persisted `topology-optimize-status.json` so the dashboard and CLI can distinguish full optimize refresh from partial/degraded conflict enrichment
+- Persisted `llm-runtime-state.json` so HelixEvo can track default provider, per-provider health, last execution, and explicit fallback truth across Claude Code, Codex, and Ollama
+- New provider-control layer that keeps Claude Code as default while adding optional Codex and Ollama support for shared prompt-in / text-out operations
+### Changed
+- `graph --optimize` now refreshes the topology review queue first and reports partial-vs-full enrichment truthfully instead of hiding useful structural backlog behind brittle enrichment failures
+- Dashboard run actions now prefer the local built CLI when available, improving live control coherence during local execution
+- Overview, Topology, and Proof now provide stronger next-step guidance around degraded optimize runs and measuring/regressed proof states
+- `status`, Overview, Commands, Guide, and README now expose provider-control truth, including Claude default state, optional Codex/Ollama support, and explicit fallback/degraded behavior
+- Claude-backed web search and research remain explicitly Claude-scoped rather than pretending provider symmetry where it does not exist yet
+## [0.7.0] - 2026-03-25
+### Added
+- New `helixevo proof` command for bounded outcome attribution and explicit proof review across interventions, transfer, topology execution, semantic adoption, and legacy evolution impact
+- New dashboard `Proof` route and `/api/proof` operator control surface for first-class prove-stage review
+- New `~/.helix/proof-reviews.jsonl` ledger for verify / defer / contest decisions on derived proof records
+### Changed
+- The dashboard operator loop now routes the Prove stage to `/proof` instead of only the Guide metrics anchor
+- Overview, Co-Evolution, Ontology, Topology, Guide, Commands, and README now surface the new proof layer and the broader prove-stage framing
+- `metrics`, `status`, and `report` now point operators toward `helixevo proof --status` for broader post-action review
 ## [0.6.1] - 2026-03-24
 ### Added

package/README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # HelixEvo
-Co-evolving skill and project brain for AI agents. HelixEvo captures failures, traces activations, models pressure, routes governed responses, promotes cross-project transfer, reviews structural topology changes, safely executes accepted topology transitions with rollback, and now lets approved ontology concepts become active semantic consumers inside the live control loop.
+Co-evolving skill and project brain for AI agents. HelixEvo captures failures, traces activations, models pressure, routes governed responses, promotes cross-project transfer, reviews structural topology changes, safely executes accepted topology transitions with rollback, lets approved ontology concepts become active semantic consumers inside the live control loop, and now exposes a first-class proof layer for bounded outcome attribution across the brain loop.
 ## How it works
@@ -21,15 +21,21 @@ Every proposed change goes through:
 - **[Bun](https://bun.sh)** — used for building (`curl -fsSL https://bun.sh/install | bash`)
 - **[Claude CLI](https://docs.anthropic.com/en/docs/claude-code)** — installed and authenticated
   - Requires a **Claude Max plan** subscription
-  - HelixEvo uses `claude --print` for all LLM operations (no API key needed)
+  - Claude Code remains the **default provider** for HelixEvo
   - Prefer `claude auth login` managed credentials over exporting a hardcoded `CLAUDE_CODE_OAUTH_TOKEN`
   - HelixEvo now retries once without an inherited `CLAUDE_CODE_OAUTH_TOKEN` if that override is stale but local Claude auth is valid
+- **Optional providers**
+  - **Codex CLI** (`codex`) for GPT Codex on shared prompt-in / text-out paths
+  - **Ollama** (`ollama` + local daemon) for shared local-model prompt-in / text-out paths
+  - Claude-only web-search and research tooling remain explicitly Claude-scoped
 Verify prerequisites:
 ```bash
 node --version    # v18+
 bun --version     # any
-claude --version  # any
+claude --version  # default provider
+codex --version   # optional
+ollama --version  # optional
 ```
 ## Install
@@ -81,6 +87,7 @@ helixevo dashboard
 |---------|-------------|
 | `helixevo watch` | Always-on learning: auto-capture + auto-evolve |
 | `helixevo metrics` | Correction rates, skill trends, evolution impact |
+| `helixevo proof` | Outcome attribution and proof review across interventions, transfer, topology, ontology, and evolution |
 | `helixevo health` | Network health: cohesion, coverage, balance, transfer |
 | `helixevo init` | Import existing skills + generate skill tests |
 | `helixevo capture <session>` | Extract failures from a session file |
@@ -91,9 +98,9 @@ helixevo dashboard
 | `helixevo graph` | View skill network in terminal |
 | `helixevo ontology` | Refresh, review, adopt, and inspect ontology concepts plus semantic control coverage |
 | `helixevo topology` | Prepare, apply, roll back, and inspect reviewed topology execution |
-| `helixevo research` | Proactive web research for skill improvement |
+| `helixevo research` | Proactive web research for skill improvement (Claude-scoped web-tool path) |
 | `helixevo dashboard [--port <n>]` | Open web dashboard, preferring localhost:3847 and falling forward if occupied |
-| `helixevo status` | Show system health |
+| `helixevo status` | Show system health plus provider-control truth |
 | `helixevo report` | Generate evolution report |
 ### Common options
@@ -109,7 +116,7 @@ helixevo graph                    # TUI view (instant, cached)
 helixevo graph --mermaid          # Open in browser as Mermaid diagram
 helixevo graph --obsidian ~/vault # Sync to Obsidian vault
 helixevo graph --rebuild          # Re-infer relationships (LLM call)
-helixevo graph --optimize         # Detect structural candidates + refresh topology review queue
+helixevo graph --optimize         # Refresh topology review queue first, then report full vs partial conflict enrichment
 helixevo ontology --status        # Show ontology kernel / frontier / extension / adoption state
 helixevo ontology --status --verbose
                                    # Show top active concepts, unused extensions, and deprecation-sensitive concepts
@@ -120,6 +127,9 @@ helixevo topology --status        # Show reviewed topology execution state
 helixevo topology --prepare <id>  # Prepare an accepted topology candidate
 helixevo topology --apply <id>    # Apply a safe prepared topology plan
 helixevo topology --rollback <id> # Roll back an applied topology plan
+helixevo proof --status           # Review proof state across the live loop
+helixevo proof --review <id> --decision verify
+                                  # Verify a proof record after operator review
 ```
 ### Research options
@@ -144,13 +154,16 @@ All data is stored in `~/.helix/`:
 ├── pressure-interventions.jsonl # Routed intervention ledger across response lanes
 ├── transfer-events.jsonl    # Promotion / transfer evidence across motifs and projects
 ├── governance-state.json    # Operator steering for active governance mode
+├── llm-runtime-state.json   # Default provider, per-provider health, last execution, and fallback truth
 ├── topology-review-candidates.json # Persisted structural review queue
 ├── topology-review-decisions.jsonl # Operator accept/reject/defer decision ledger
+├── topology-optimize-status.json # Last full/partial optimize refresh status + queue/enrichment summary
 ├── topology-overrides.json   # Applied safe structural topology overrides
 ├── topology-snapshots.json   # Snapshot refs for reviewed execution and rollback
 ├── topology-apply-plans.json # Prepared reviewed topology plans
 ├── topology-executions.jsonl # Prepared/applied/rolled-back execution ledger
 ├── topology-artifacts.jsonl  # Evidence artifacts for reviewed structural execution
+├── proof-reviews.jsonl      # Operator verify/defer/contest ledger for derived proof records
 ├── evolution-artifacts.jsonl # Evolution + ontology-review evidence artifacts
 ├── ontology/
 │   ├── kernel.json          # Materialized ontology kernel snapshot
@@ -184,11 +197,12 @@ helixevo dashboard --port 3900
 ```
 **Tabs:**
-- **Overview** — Premium control cockpit with frontier signals, brain foundation, semantic backbone, ontology adoption visibility, pressure counts, topology review visibility, and prepared/applied structural state
+- **Overview** — Premium control cockpit with frontier signals, brain foundation, provider-control truth, semantic backbone, ontology adoption visibility, proof review visibility, pressure counts, topology review visibility, and prepared/applied structural state
 - **Skill Network** — Interactive graph, premium inspector, co-evolution routing signals, and topology review/execution handoff links
 - **Co-Evolution** — Operator cockpit for routed pressure response, governance mode visibility, promotion queues, transfer evidence, semantic route influence, and topology handoff
 - **Ontology** — Semantic control surface for kernel visibility, frontier concept review, approved ontology extensions, adoption coverage, deprecation risk, and native ontology change events
 - **Topology** — Governance steering plus a persistent operator pipeline for review → prepare → apply → rollback across merge / split / promote / rewire / consolidate candidates
+- **Proof** — Outcome-attribution and proof-review cockpit for bounded effectiveness review across interventions, transfer, topology execution, semantic adoption, and evolution impact
 - **Projects** — Project intake studio, live project analysis, gap routing, per-project pressure hotspots, and promotion feeders
 - **Evolution** — Timeline of evolution runs with judge scores, artifact provenance, and activation-aware context
 - **Research** — Knowledge buffer plus a live “why research now” handoff from current pressure, governed routing, and recurring gaps
@@ -235,6 +249,7 @@ Failures → Cluster → Propose → Replay → Multi-Judge → Regression → C
 - **Governance steering** lets the operator pin or release the active adaptation mode rather than relying only on derived routing.
 - **Topology review** persists merge / split / promote / rewire / consolidate candidates so manual review is a real workflow.
 - **Reviewed topology execution** turns accepted safe candidates into prepared plans, snapshot-backed applies, and rollbackable structural transitions.
+- **Proof control** turns bounded outcome attribution into an explicit operator layer where interventions, transfer, topology execution, semantic adoption, and evolution impact can be verified, deferred, or contested.
 - **Evolution artifacts** preserve proposal-level evidence so the dashboard can show what changed, why, and with what provenance.
 **Three-layer hierarchy:**

package/dashboard/app/api/proof/route.ts ADDED Viewed

@@ -0,0 +1,71 @@
+import { NextResponse } from 'next/server'
+import { spawn } from 'child_process'
+import { existsSync } from 'fs'
+import { join } from 'path'
+import { loadProofDashboardSummary } from '@/lib/proof'
+import type { ProofReviewDecisionStatus } from '@/lib/proof'
+export const dynamic = 'force-dynamic'
+function resolveProofRunner(): { cmd: string; argsPrefix: string[] } {
+  const candidates = [
+    join(process.cwd(), '..', 'dist', 'cli.js'),
+    join(process.cwd(), 'dist', 'cli.js'),
+  ]
+  for (const candidate of candidates) {
+    if (existsSync(candidate)) return { cmd: process.execPath, argsPrefix: [candidate] }
+  }
+  return { cmd: 'helixevo', argsPrefix: [] }
+}
+function runProofCommand(args: string[]): Promise<{ success: boolean; output: string }> {
+  return new Promise((resolve) => {
+    const runner = resolveProofRunner()
+    const child = spawn(runner.cmd, [...runner.argsPrefix, 'proof', ...args], {
+      env: { ...process.env },
+      stdio: ['ignore', 'pipe', 'pipe'],
+    })
+    let output = ''
+    child.stdout?.on('data', (chunk: Buffer) => { output += chunk.toString() })
+    child.stderr?.on('data', (chunk: Buffer) => { output += chunk.toString() })
+    child.on('close', (code) => resolve({ success: code === 0, output }))
+    child.on('error', (err) => resolve({ success: false, output: `Error: ${err.message}` }))
+  })
+}
+export async function GET() {
+  return NextResponse.json(loadProofDashboardSummary())
+}
+export async function POST(request: Request) {
+  const body = await request.json() as {
+    action?: 'review'
+    recordId?: string
+    decision?: ProofReviewDecisionStatus
+    rationale?: string
+  }
+  if (body.action !== 'review') {
+    return NextResponse.json({ success: false, error: 'action must be review' }, { status: 400 })
+  }
+  if (!body.recordId || !body.decision) {
+    return NextResponse.json({ success: false, error: 'recordId and decision are required' }, { status: 400 })
+  }
+  const args = ['--review', body.recordId, '--decision', body.decision]
+  if (body.rationale?.trim()) args.push('--rationale', body.rationale.trim())
+  const result = await runProofCommand(args)
+  if (!result.success) {
+    return NextResponse.json({ success: false, error: result.output || 'Proof command failed' }, { status: 500 })
+  }
+  return NextResponse.json({
+    success: true,
+    output: result.output,
+    dashboard: loadProofDashboardSummary(),
+  })
+}

package/dashboard/app/api/run/route.ts CHANGED Viewed

@@ -1,5 +1,7 @@
 import { NextResponse } from 'next/server'
 import { spawn, type ChildProcess } from 'child_process'
+import { existsSync } from 'fs'
+import { join } from 'path'
 export const dynamic = 'force-dynamic'
@@ -7,6 +9,7 @@ const ALLOWED_COMMANDS: Record<string, { cmd: string; args: string[]; timeout: n
   'status':         { cmd: 'helixevo', args: ['status'],                           timeout: 15000 },
   'health':         { cmd: 'helixevo', args: ['health', '--verbose'],              timeout: 120000 },
   'metrics':        { cmd: 'helixevo', args: ['metrics', '--verbose'],             timeout: 15000 },
+  'proof':          { cmd: 'helixevo', args: ['proof', '--verbose'],               timeout: 20000 },
   'evolve':         { cmd: 'helixevo', args: ['evolve', '--verbose'],              timeout: 300000 },
   'evolve-dry':     { cmd: 'helixevo', args: ['evolve', '--dry-run', '--verbose'], timeout: 300000 },
   'generalize':     { cmd: 'helixevo', args: ['generalize', '--verbose'],          timeout: 300000 },
@@ -45,6 +48,21 @@ function buildCommandEntry(body: { command: string; project?: string; path?: str
   return null
 }
+function resolveRunRunner(): { cmd: string; argsPrefix: string[] } {
+  const candidates = [
+    join(process.cwd(), '..', 'dist', 'cli.js'),
+    join(process.cwd(), 'dist', 'cli.js'),
+  ]
+  for (const candidate of candidates) {
+    if (existsSync(candidate)) {
+      return { cmd: process.execPath, argsPrefix: [candidate] }
+    }
+  }
+  return { cmd: 'helixevo', argsPrefix: [] }
+}
 let activeProcess: ChildProcess | null = null
 let activeCommand: string | null = null
@@ -70,7 +88,8 @@ export async function POST(request: Request) {
   const stream = new ReadableStream({
     start(controller) {
-      const child = spawn(entry.cmd, entry.args, {
+      const runner = resolveRunRunner()
+      const child = spawn(runner.cmd, [...runner.argsPrefix, ...entry.args], {
         env: { ...process.env },
         stdio: ['ignore', 'pipe', 'pipe'],
       })

package/dashboard/app/coevolution/client.tsx CHANGED Viewed

@@ -9,6 +9,7 @@ import { SectionFrame } from '@/components/section-frame'
 import { OperatorLoopTrail } from '@/components/operator-loop-trail'
 import { SurfaceJumpLinks } from '@/components/surface-jump-links'
 import { NextStepEmptyState } from '@/components/next-step-empty-state'
+import type { ProofDashboardSummary } from '@/lib/proof'
 type RunState = 'idle' | 'running' | 'success' | 'error' | 'stopped'
 type CommandName = 'evolve' | 'research' | 'generalize'
@@ -127,6 +128,7 @@ interface Props {
       }
     }
   }
+  proof: ProofDashboardSummary
 }
 function consoleTone(state: RunState): 'neutral' | 'green' | 'red' | 'yellow' {
@@ -169,7 +171,7 @@ function formatMode(mode: Summary['governance']['activeMode']) {
   return mode.split('-').join(' ')
 }
-export default function CoEvolutionClient({ summary, ontology }: Props) {
+export default function CoEvolutionClient({ summary, ontology, proof }: Props) {
   const [runState, setRunState] = useState<RunState>('idle')
   const [activeCommand, setActiveCommand] = useState<CommandName | null>(null)
   const [output, setOutput] = useState('')
@@ -272,6 +274,7 @@ export default function CoEvolutionClient({ summary, ontology }: Props) {
           { label: `${summary.topologyReviews.open} topology reviews`, tone: summary.topologyReviews.open > 0 ? 'yellow' : 'green' },
           { label: `${ontology.ontologyLoop.frontier} ontology frontier`, tone: ontology.ontologyLoop.reviewOpen > 0 ? 'blue' : 'neutral' },
           { label: `${ontology.ontologyLoop.adoption.activeConcepts} active concepts`, tone: ontology.ontologyLoop.adoption.activeConcepts > 0 ? 'green' : 'neutral' },
+          { label: `${proof.summary.reviewOpen} proof review`, tone: proof.summary.reviewOpen > 0 ? 'yellow' : proof.summary.effective > 0 ? 'green' : 'neutral' },
           { label: `${summary.recentTransfers.length} recent transfers`, tone: summary.recentTransfers.length > 0 ? 'green' : 'neutral' },
           { label: formatMode(summary.governance.activeMode), tone: toneForMode(summary.governance.activeMode) },
         ]}
@@ -284,6 +287,7 @@ export default function CoEvolutionClient({ summary, ontology }: Props) {
               <div style={{ marginTop: 8, display: 'flex', gap: 6, flexWrap: 'wrap' }}>
                 <span className="badge badge-gray">source: {summary.governance.source}</span>
                 <span className="badge badge-gray">review threshold {(summary.governance.profile.reviewThreshold * 100).toFixed(0)}%</span>
+                <Link href="/proof" className="badge badge-gray" style={{ textDecoration: 'none' }}>open proof</Link>
               </div>
             </div>
             <div style={{ display: 'grid', gap: 10 }}>
@@ -307,6 +311,7 @@ export default function CoEvolutionClient({ summary, ontology }: Props) {
         <MetricCard label="Prepared topology" value={summary.topologyExecution.prepared} sublabel={`${summary.topologyExecution.applied} applied • ${summary.topologyExecution.rolledBack} rolled back`} tone={summary.topologyExecution.prepared > 0 ? 'blue' : summary.topologyExecution.applied > 0 ? 'green' : 'neutral'} icon="↑" />
         <MetricCard label="Active semantics" value={ontology.ontologyLoop.adoption.activeConcepts} sublabel={`${ontology.ontologyLoop.adoption.totalBindings} bindings • ${ontology.ontologyLoop.adoption.routesInfluenced} influenced routes`} tone={ontology.ontologyLoop.adoption.activeConcepts > 0 ? 'green' : 'neutral'} icon="◎" />
         <MetricCard label="Recorded interventions" value={summary.pressureInterventions.total} sublabel={`${summary.pressureInterventions.completed} completed • ${summary.pressureInterventions.dryRun} dry-run`} tone="blue" icon="↺" />
+        <MetricCard label="Proof review" value={proof.summary.reviewOpen} sublabel={`${proof.summary.effective} effective • ${proof.summary.regressed} regressed`} tone={proof.summary.reviewOpen > 0 ? 'yellow' : proof.summary.effective > 0 ? 'green' : 'neutral'} icon="◇" />
         <MetricCard label="Realized transfers" value={summary.recentTransfers.filter((event) => event.status === 'realized').length} sublabel={`${summary.pressureMotifs.addressed} motifs now addressed`} tone="green" icon="↑" />
       </div>

package/dashboard/app/coevolution/page.tsx CHANGED Viewed

@@ -1,4 +1,5 @@
 import { loadCoEvolutionSummary, loadOntologySummary } from '@/lib/data'
+import { loadProofDashboardSummary } from '@/lib/proof'
 import CoEvolutionClient from './client'
 export const dynamic = 'force-dynamic'
@@ -6,5 +7,6 @@ export const dynamic = 'force-dynamic'
 export default function CoEvolutionPage() {
   const summary = loadCoEvolutionSummary()
   const ontology = loadOntologySummary()
-  return <CoEvolutionClient summary={summary} ontology={ontology} />
+  const proof = loadProofDashboardSummary()
+  return <CoEvolutionClient summary={summary} ontology={ontology} proof={proof} />
 }

package/dashboard/app/commands/page.tsx CHANGED Viewed

@@ -111,7 +111,7 @@ const COMMANDS: CommandInfo[] = [
   },
   {
     name: 'graph',
-    description: 'Visualize and manage the skill network graph. Shows relationships between skills (depends, enhances, conflicts, co-evolves), and graph optimize now persists a topology review queue for merge, split, promote, and rewire candidates.',
+    description: 'Visualize and manage the skill network graph. Shows relationships between skills (depends, enhances, conflicts, co-evolves), and graph optimize now refreshes a truthful topology review queue first, then reports whether conflict enrichment completed fully or only partially.',
     usage: 'helixevo graph [options]',
     examples: [
       { cmd: 'helixevo graph', desc: 'Show skill network in terminal (instant)' },
@@ -130,6 +130,7 @@ const COMMANDS: CommandInfo[] = [
     category: 'network',
     needsLLM: true,
     runnable: { command: 'graph-rebuild', label: 'Rebuild Graph', icon: 'M13.828 10.172a4 4 0 00-5.656 0l-4 4a4 4 0 105.656 5.656l1.102-1.101m-.758-4.899a4 4 0 005.656 0l4-4a4 4 0 00-5.656-5.656l-1.1 1.1', color: 'var(--purple)' },
+    note: 'graph --optimize now distinguishes queue refresh from conflict enrichment. In degraded mode it can still surface a real review queue while clearly marking enrichment as partial rather than silently pretending full optimize succeeded.',
   },
   {
     name: 'ontology',
@@ -178,7 +179,7 @@ const COMMANDS: CommandInfo[] = [
   },
   {
     name: 'research',
-    description: 'Proactive skill discovery via web research. Identifies gaps in your skill network, generates hypotheses, searches the web for solutions, and creates draft skills from discoveries.',
+    description: 'Proactive skill discovery via web research. Identifies gaps in your skill network, generates hypotheses, searches the web for solutions, and creates draft skills from discoveries. This lane remains explicitly Claude-scoped because it depends on Claude tool-enabled web search rather than provider-neutral prompting.',
     usage: 'helixevo research [options]',
     examples: [
       { cmd: 'helixevo research', desc: 'Run proactive research' },
@@ -212,7 +213,7 @@ const COMMANDS: CommandInfo[] = [
   },
   {
     name: 'metrics',
-    description: 'Show correction rates, skill improvement trends, and evolution impact over time. Helps you understand how your skills are improving and where attention is needed.',
+    description: 'Show correction rates, skill improvement trends, and legacy evolution impact over time. Metrics remains the quantitative prove surface, while the newer proof layer now expands outcome review across interventions, topology, transfer, and semantic adoption.',
     usage: 'helixevo metrics [options]',
     examples: [
       { cmd: 'helixevo metrics', desc: 'Show summary metrics' },
@@ -225,9 +226,30 @@ const COMMANDS: CommandInfo[] = [
     needsLLM: false,
     runnable: { command: 'metrics', label: 'Show Metrics', icon: 'M9 19v-6a2 2 0 00-2-2H5a2 2 0 00-2 2v6a2 2 0 002 2h2a2 2 0 002-2zm0 0V9a2 2 0 012-2h2a2 2 0 012 2v10m-6 0a2 2 0 002 2h2a2 2 0 002-2m0 0V5a2 2 0 012-2h2a2 2 0 012 2v14a2 2 0 01-2 2h-2a2 2 0 01-2-2z', color: 'var(--text-secondary)' },
   },
+  {
+    name: 'proof',
+    description: 'Review bounded outcome attribution across interventions, transfer, topology execution, semantic adoption, and legacy evolution impact. Proof is where the newer brain loop becomes operator-reviewable instead of relying only on passive heuristics.',
+    usage: 'helixevo proof [options]',
+    examples: [
+      { cmd: 'helixevo proof --status', desc: 'Show proof summary plus the current open review queue' },
+      { cmd: 'helixevo proof --status --verbose', desc: 'Show detailed proof records, reasons, and next actions' },
+      { cmd: 'helixevo proof --review <recordId> --decision verify', desc: 'Verify a derived proof record after operator review' },
+    ],
+    options: [
+      { flag: '--status', desc: 'Show proof summary and open review state' },
+      { flag: '--review <recordId>', desc: 'Review a derived proof record' },
+      { flag: '--decision <verify|defer|contest>', desc: 'Decision for --review' },
+      { flag: '--rationale <text>', desc: 'Optional rationale for the proof review decision' },
+      { flag: '--verbose', desc: 'Show detailed proof records and derived reasons' },
+    ],
+    category: 'analysis',
+    needsLLM: false,
+    runnable: { command: 'proof', label: 'Open Proof State', icon: 'M9 17v-2m3 2v-4m3 4v-6m2 10H7a2 2 0 01-2-2V5a2 2 0 012-2h5.586a1 1 0 01.707.293l5.414 5.414a1 1 0 01.293.707V19a2 2 0 01-2 2z', color: 'var(--blue)' },
+    note: 'Proof stays bounded and reviewable. Measuring means live but not yet proven, regressed means explicit negative evidence, and verified strengthens review trust without pretending stronger causal certainty than the evidence supports.',
+  },
   {
     name: 'status',
-    description: 'Quick overview of system state: total skills, frontier size, failure count, skill tests, and network health. Like a health check but without LLM analysis.',
+    description: 'Quick overview of system state: total skills, frontier size, failure count, skill tests, provider control health, and the last recorded provider execution. Like a health check but without deep model analysis.',
     usage: 'helixevo status',
     examples: [
       { cmd: 'helixevo status', desc: 'Show system status' },
@@ -313,7 +335,7 @@ const WORKFLOW = [
   { label: 'evolve', desc: 'Improve skills', tone: 'green' as const },
   { label: 'generalize', desc: 'Abstract patterns', tone: 'purple' as const },
   { label: 'graph --rebuild', desc: 'Map relationships', tone: 'yellow' as const },
-  { label: 'health', desc: 'Assess quality', tone: 'blue' as const },
+  { label: 'proof --status', desc: 'Review outcomes', tone: 'blue' as const },
 ]
 const WORKFLOW_RECIPES = [
@@ -341,6 +363,12 @@ const WORKFLOW_RECIPES = [
     summary: 'Refresh structural review candidates, prepare accepted safe plans, apply them, and keep rollback available.',
     steps: ['helixevo graph --optimize', 'helixevo topology --prepare <candidateId>', 'helixevo topology --apply <planId>'],
   },
+  {
+    title: 'Proof review loop',
+    tone: 'blue' as const,
+    summary: 'Inspect outcome attribution across the live loop, then verify, defer, or contest proof records explicitly.',
+    steps: ['helixevo proof --status', 'helixevo proof --status --verbose', 'helixevo proof --review <recordId> --decision verify'],
+  },
 ]
 const CATEGORIES: Array<{
@@ -384,13 +412,14 @@ export default function CommandsPage() {
           { label: `${COMMANDS.length} commands`, tone: 'blue' },
           { label: `${stats.llmCommands} need LLM access`, tone: 'purple' },
           { label: `${stats.runnableCommands} runnable here`, tone: 'green' },
+          { label: 'Claude default • Codex + Ollama optional', tone: 'blue' },
           { label: `${stats.optionFlags} documented flags`, tone: 'yellow' },
         ]}
         actions={
           <div className="hero-note-card">
             <div className="hero-note-label">Recommended operating loop</div>
-            <div className="hero-note-title">Project Setup → Watch → Co-Evolution → Ontology → Topology</div>
-            <div className="hero-note-copy">Use the Commands page as the practical bridge between HelixEvo’s CLI, semantic control loop, and the premium dashboard cockpit.</div>
+            <div className="hero-note-title">Project Setup → Watch → Co-Evolution → Ontology → Topology → Proof</div>
+            <div className="hero-note-copy">Use the Commands page as the practical bridge between HelixEvo’s CLI, semantic control loop, structural control, and the new prove surface.</div>
           </div>
         }
       />
@@ -402,6 +431,28 @@ export default function CommandsPage() {
         <MetricCard label="Flags documented" value={stats.optionFlags} sublabel="Total option flags surfaced across the current command reference." tone="yellow" />
       </div>
+      <SectionFrame
+        eyebrow="Provider control"
+        title="Claude default, Codex and Ollama optional"
+        description="Shared prompt-in / text-out operations now run through provider control. Claude Code remains the default provider, while Codex and Ollama can be enabled for supported operations. Claude-only web search and research tooling stay explicitly Claude-scoped."
+        tone="blue"
+      >
+        <div className="grid-3" style={{ gap: 12 }}>
+          <div className="card" style={{ padding: '18px 18px 16px' }}>
+            <div style={{ fontSize: 12, fontWeight: 700, color: 'var(--blue)', marginBottom: 8 }}>Default path</div>
+            <div style={{ fontSize: 12.5, color: 'var(--text-dim)', lineHeight: 1.65 }}>Claude Code stays the default provider. Existing CLI flows keep working without forcing a provider switch.</div>
+          </div>
+          <div className="card" style={{ padding: '18px 18px 16px' }}>
+            <div style={{ fontSize: 12, fontWeight: 700, color: 'var(--purple)', marginBottom: 8 }}>Optional providers</div>
+            <div style={{ fontSize: 12.5, color: 'var(--text-dim)', lineHeight: 1.65 }}>GPT Codex and Ollama can now be enabled for shared chat / JSON / judge-style paths when you want alternate cloud or local execution.</div>
+          </div>
+          <div className="card" style={{ padding: '18px 18px 16px' }}>
+            <div style={{ fontSize: 12, fontWeight: 700, color: 'var(--yellow)', marginBottom: 8 }}>Truthfulness rule</div>
+            <div style={{ fontSize: 12.5, color: 'var(--text-dim)', lineHeight: 1.65 }}>Fallback is explicit, not silent. If a command is Claude-scoped or a fallback path was used, status and dashboard surfaces now record that truth explicitly.</div>
+          </div>
+        </div>
+      </SectionFrame>
       <SectionFrame
         eyebrow="Workflow framing"
         title="Typical operating sequence"
@@ -433,7 +484,7 @@ export default function CommandsPage() {
       <SectionFrame
         eyebrow="Operator recipes"
         title="Fast command loops for the live product"
-        description="These compact sequences make the M9 dashboard and CLI feel like one coordinated operating surface instead of separate references."
+        description="These compact sequences make the current dashboard, CLI, and prove surface feel like one coordinated operating system instead of separate references."
         tone="blue"
       >
         <div className="grid-2" style={{ gap: 14 }}>