helixevo 0.4.0 → 0.4.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CHANGELOG.md CHANGED
@@ -4,6 +4,11 @@ All notable changes to HelixEvo are documented here.
4
4
 
5
5
  ## [Unreleased]
6
6
 
7
+ ## [0.4.1] - 2026-03-24
8
+
9
+ ### Changed
10
+ - Rebuilt the dashboard Guide into a clearer operator-and-theory manual with a brain-stack explanation, end-to-end adaptation loop, dashboard surface map, grouped memory model, explicit maturity/safety boundaries, glossary, and current command coverage including `project-setup` and `topology`
11
+
7
12
  ## [0.4.0] - 2026-03-24
8
13
 
9
14
  ### Added
@@ -9,6 +9,9 @@ const TOC = [
9
9
  { id: 'quickstart', label: 'Quick Start', icon: '▸' },
10
10
  { id: 'commands', label: 'Commands', icon: '⌘' },
11
11
  { id: 'architecture', label: 'Architecture', icon: '◎' },
12
+ { id: 'brainstack', label: 'Brain Stack', icon: '◈' },
13
+ { id: 'loop', label: 'Adaptation Loop', icon: '↺' },
14
+ { id: 'surfaces', label: 'Surface Map', icon: '▦' },
12
15
  { id: 'watch', label: 'Always-On Learning', icon: '⚡' },
13
16
  { id: 'evolution', label: 'Evolution Pipeline', icon: '⟳' },
14
17
  { id: 'judges', label: 'Multi-Judge System', icon: '⚖' },
@@ -21,8 +24,10 @@ const TOC = [
21
24
  { id: 'network', label: 'Skill Network', icon: '⬡' },
22
25
  { id: 'config', label: 'Configuration', icon: '⚙' },
23
26
  { id: 'data', label: 'Data & Storage', icon: '◫' },
27
+ { id: 'maturity', label: 'Maturity & Safety', icon: '⛉' },
24
28
  { id: 'manage', label: 'Skill Management', icon: '✎' },
25
29
  { id: 'craft', label: 'Craft Agent', icon: '⚡' },
30
+ { id: 'glossary', label: 'Glossary', icon: '✦' },
26
31
  { id: 'faq', label: 'FAQ', icon: '?' },
27
32
  ]
28
33
 
@@ -229,7 +234,7 @@ export default function GuidePage() {
229
234
  <div>
230
235
  <div className="guide-toc-title">Field Guide</div>
231
236
  <div style={{ fontSize: 11, color: 'var(--text-muted)', marginTop: 4, lineHeight: 1.45 }}>
232
- CLI, evolution, research, and network operations in one manual.
237
+ Operator manual and brain architecture map for the current HelixEvo release.
233
238
  </div>
234
239
  </div>
235
240
  <div className="guide-toc-version">v{VERSION}</div>
@@ -240,8 +245,8 @@ export default function GuidePage() {
240
245
  </div>
241
246
  <div style={{ display: 'flex', gap: 6, flexWrap: 'wrap' }}>
242
247
  <a href="#quickstart" className="hero-chip hero-chip-blue" style={{ textDecoration: 'none' }}>Quick Start</a>
243
- <a href="#watch" className="hero-chip hero-chip-green" style={{ textDecoration: 'none' }}>Always-On</a>
244
- <a href="#evolution" className="hero-chip hero-chip-purple" style={{ textDecoration: 'none' }}>Evolution</a>
248
+ <a href="#brainstack" className="hero-chip hero-chip-green" style={{ textDecoration: 'none' }}>Brain Stack</a>
249
+ <a href="#surfaces" className="hero-chip hero-chip-purple" style={{ textDecoration: 'none' }}>Surface Map</a>
245
250
  </div>
246
251
  </div>
247
252
  {TOC.map(item => (
@@ -261,227 +266,244 @@ export default function GuidePage() {
261
266
  <div className="guide-content">
262
267
  {/* Hero */}
263
268
  <div className="guide-hero">
264
- <div className="guide-hero-badge">Documentation</div>
265
- <h1 className="guide-hero-title">HelixEvo Guide</h1>
269
+ <div className="guide-hero-badge">Operator Manual</div>
270
+ <h1 className="guide-hero-title">HelixEvo Brain Guide</h1>
266
271
  <p className="guide-hero-desc">
267
- A comprehensive guide to the self-evolving skill ecosystem for AI agents.
268
- Capture failures, evolve skills through multi-judge evaluation, and maintain
269
- a Pareto frontier of optimal configurations.
272
+ A structured guide to the current HelixEvo brain: semantic kernel, observation memory,
273
+ pressure, governed response, transfer, topology review, and rollbackable reviewed execution.
274
+ Use it as both an operator manual and a theory-to-product map.
270
275
  </p>
271
276
  <div className="hero-chip-row" style={{ marginTop: 18 }}>
272
277
  <span className="hero-chip hero-chip-blue">v{VERSION}</span>
273
278
  <span className="hero-chip hero-chip-purple">{TOC.length} sections</span>
274
- <span className="hero-chip hero-chip-green">CLI + dashboard + network</span>
275
- <span className="hero-chip hero-chip-yellow">operator reference</span>
279
+ <span className="hero-chip hero-chip-green">operator path + theory path</span>
280
+ <span className="hero-chip hero-chip-yellow">governed plasticity reference</span>
276
281
  </div>
277
282
 
278
283
  <div className="grid-3" style={{ marginTop: 24, marginBottom: 24 }}>
279
284
  <div className="card" style={{ padding: '18px 18px 16px' }}>
280
- <div style={{ fontSize: 10, fontWeight: 700, color: 'var(--text-muted)', textTransform: 'uppercase', letterSpacing: 0.7, marginBottom: 8 }}>Start here</div>
281
- <div style={{ fontSize: 15, fontWeight: 700, color: 'var(--text)', marginBottom: 6 }}>Quick Start → Watch → Evolve</div>
282
- <div style={{ fontSize: 12.5, color: 'var(--text-dim)', lineHeight: 1.6 }}>If you are new, this path gets HelixEvo initialized, learning from corrections, and producing measurable skill improvements.</div>
285
+ <div style={{ fontSize: 10, fontWeight: 700, color: 'var(--text-muted)', textTransform: 'uppercase', letterSpacing: 0.7, marginBottom: 8 }}>Start operating</div>
286
+ <div style={{ fontSize: 15, fontWeight: 700, color: 'var(--text)', marginBottom: 6 }}>Project setup → Watch / Capture Co-Evolution → Topology</div>
287
+ <div style={{ fontSize: 12.5, color: 'var(--text-dim)', lineHeight: 1.6 }}>This is the shortest path to seeing pressure, governed response, and structural control in the live product.</div>
283
288
  </div>
284
289
  <div className="card" style={{ padding: '18px 18px 16px' }}>
285
- <div style={{ fontSize: 10, fontWeight: 700, color: 'var(--text-muted)', textTransform: 'uppercase', letterSpacing: 0.7, marginBottom: 8 }}>Core mental model</div>
286
- <div style={{ fontSize: 15, fontWeight: 700, color: 'var(--text)', marginBottom: 6 }}>Capture failures, mutate skills, validate aggressively</div>
287
- <div style={{ fontSize: 12.5, color: 'var(--text-dim)', lineHeight: 1.6 }}>The system only gets better when real corrections are fed into evaluation, regression testing, and monitored deployment.</div>
290
+ <div style={{ fontSize: 10, fontWeight: 700, color: 'var(--text-muted)', textTransform: 'uppercase', letterSpacing: 0.7, marginBottom: 8 }}>Understand the brain</div>
291
+ <div style={{ fontSize: 15, fontWeight: 700, color: 'var(--text)', marginBottom: 6 }}>Read the stack, then trace one signal through the loop</div>
292
+ <div style={{ fontSize: 12.5, color: 'var(--text-dim)', lineHeight: 1.6 }}>The current system is best understood as layered cognition: semantic kernel observation pressure response → transfer → governance → topology.</div>
288
293
  </div>
289
294
  <div className="card" style={{ padding: '18px 18px 16px' }}>
290
295
  <div style={{ fontSize: 10, fontWeight: 700, color: 'var(--text-muted)', textTransform: 'uppercase', letterSpacing: 0.7, marginBottom: 8 }}>Fast jumps</div>
291
296
  <div style={{ display: 'flex', gap: 8, flexWrap: 'wrap' }}>
292
297
  <a href="#quickstart" className="hero-chip hero-chip-blue" style={{ textDecoration: 'none' }}>Quick Start</a>
293
- <a href="#commands" className="hero-chip hero-chip-purple" style={{ textDecoration: 'none' }}>Commands</a>
294
- <a href="#architecture" className="hero-chip hero-chip-green" style={{ textDecoration: 'none' }}>Architecture</a>
295
- <a href="#network" className="hero-chip hero-chip-yellow" style={{ textDecoration: 'none' }}>Skill Network</a>
298
+ <a href="#brainstack" className="hero-chip hero-chip-purple" style={{ textDecoration: 'none' }}>Brain Stack</a>
299
+ <a href="#surfaces" className="hero-chip hero-chip-green" style={{ textDecoration: 'none' }}>Surface Map</a>
300
+ <a href="#maturity" className="hero-chip hero-chip-yellow" style={{ textDecoration: 'none' }}>Safety</a>
296
301
  </div>
297
302
  </div>
298
303
  </div>
299
304
 
300
305
  <div className="guide-hero-features">
301
- <FeatureCard icon="⚡" title="Always-On" desc="Auto-captures corrections and triggers evolution in real-time" />
302
- <FeatureCard icon="" title="Network Evolution" desc="Skills and projects co-evolve as an interconnected organism" />
303
- <FeatureCard icon="📊" title="Closed-Loop Proof" desc="Tracks whether evolution actually reduces correction rates" />
304
- <FeatureCard icon="" title="Auto-Generalize" desc="Cross-project patterns automatically become abstract skills" />
306
+ <FeatureCard icon="⚡" title="Pressure-Aware" desc="Failures and project gaps become explicit adaptation demand rather than hidden friction" />
307
+ <FeatureCard icon="" title="Governed Routing" desc="Response lanes stay visible: research, specialize, evolve, generalize, and manual review" />
308
+ <FeatureCard icon="" title="Topology Control" desc="Reviewed structural changes can now move through prepare, apply, and rollback" />
309
+ <FeatureCard icon="📊" title="Closed-Loop Proof" desc="Backtesting and operator visibility show whether the brain is actually improving" />
305
310
  </div>
306
311
  </div>
307
312
 
308
313
  {/* ─── Overview ─── */}
309
- <Section id="overview" title="Overview" subtitle="What HelixEvo does and why it exists.">
314
+ <Section id="overview" title="Overview" subtitle="HelixEvo is now best understood as a governed co-evolving brain.">
310
315
  <p className="guide-text">
311
- HelixEvo is a self-improving system that manages SKILL.md files for AI agents. When an agent makes a mistake
312
- and gets corrected, HelixEvo captures that failure, clusters similar failures together, and proposes skill
313
- improvements. Every proposed change goes through rigorous multi-judge evaluation and regression testing before
314
- being deployed with a 3-day canary period.
316
+ HelixEvo still captures failures, proposes skill mutations, evaluates them with judges, and deploys improvements carefully.
317
+ What changed over the recent milestone arc is that these mutation mechanics now live inside a larger architecture that senses pressure,
318
+ routes intervention under governance, records transfer evidence, reviews topology, and can execute a safe reviewed subset of structural change with rollback.
315
319
  </p>
316
320
  <p className="guide-text">
317
- Built on ideas from <strong>EvoSkill</strong> and <strong>AutoResearch</strong>, HelixEvo implements a
318
- three-directional evolution model:
321
+ That means the current product should not be explained as only “capture → evolve → validate.” The more truthful frame is:
322
+ <strong> semantic kernel → observation → pressure → response → transfer → governance → topology review → topology execution → operator surfaces.</strong>
319
323
  </p>
320
324
  <div className="guide-directions">
321
325
  <div className="guide-direction">
322
- <div className="guide-direction-arrow" style={{ color: 'var(--purple)' }}>↑</div>
326
+ <div className="guide-direction-arrow" style={{ color: 'var(--blue)' }}>◎</div>
323
327
  <div>
324
- <strong>Generalize</strong>
325
- <span>Detect cross-project patterns and promote them to abstract, reusable skills</span>
328
+ <strong>Sense</strong>
329
+ <span>Failures, project analysis, and activation traces reveal where the system is under demand</span>
326
330
  </div>
327
331
  </div>
328
332
  <div className="guide-direction">
329
- <div className="guide-direction-arrow" style={{ color: 'var(--green)' }}>↓</div>
333
+ <div className="guide-direction-arrow" style={{ color: 'var(--purple)' }}>⇄</div>
330
334
  <div>
331
- <strong>Specialize</strong>
332
- <span>Create project-specific skills from domain skills combined with local failures</span>
335
+ <strong>Respond</strong>
336
+ <span>Governed routes choose whether pressure should drive research, specialize, evolve, generalize, or manual review</span>
333
337
  </div>
334
338
  </div>
335
339
  <div className="guide-direction">
336
- <div className="guide-direction-arrow" style={{ color: 'var(--blue)' }}>↔</div>
340
+ <div className="guide-direction-arrow" style={{ color: 'var(--green)' }}>⛉</div>
337
341
  <div>
338
- <strong>Lateral</strong>
339
- <span>Merge, split, and resolve conflicts between skills at the same level</span>
342
+ <strong>Restructure</strong>
343
+ <span>Accepted topology review can now become prepared, applied, and rollbackable structural change</span>
340
344
  </div>
341
345
  </div>
342
346
  </div>
343
347
  <Callout type="tip">
344
- HelixEvo now exposes a visible <strong>brain foundation</strong> in the dashboard: ontology-backed summaries,
345
- activation traces, evolution artifacts, pressure signals, governance steering, topology review, and reviewed topology execution all feed the Overview,
346
- Skill Network, Topology, Projects, Research, and Evolution surfaces.
348
+ Read this guide in two modes: <strong>operator path</strong> if you want to know what to run and which tab to use,
349
+ or <strong>theory path</strong> if you want to understand how ontology, pressure, governance, transfer, and topology now fit together.
347
350
  </Callout>
348
351
  </Section>
349
352
 
350
353
  {/* ─── Quick Start ─── */}
351
- <Section id="quickstart" title="Quick Start" subtitle="Get up and running in under 5 minutes.">
354
+ <Section id="quickstart" title="Quick Start" subtitle="The shortest operator path through the current HelixEvo brain loop.">
352
355
  <Callout type="info">
353
356
  <strong>Prerequisites:</strong> Node.js 18+, <a href="https://bun.sh">Bun</a> (for building),
354
357
  and <a href="https://docs.anthropic.com/en/docs/claude-code">Claude CLI</a> with a Claude Max plan.
355
358
  Prefer <code>claude auth login</code> managed credentials over exporting a hardcoded <code>CLAUDE_CODE_OAUTH_TOKEN</code>.
356
359
  </Callout>
357
360
 
358
- <Step n={1} title="Install HelixEvo">
359
- <Code title="Terminal">{`# From npm (recommended)
361
+ <Step n={1} title="Install and initialize HelixEvo">
362
+ <Code title="Terminal">{`# Install from npm
360
363
  npm install -g helixevo
361
364
 
362
- # Or from source
363
- git clone https://github.com/danielchen26/helixevo.git
364
- cd helixevo && npm install && npm run build && npm link`}</Code>
365
- </Step>
366
-
367
- <Step n={2} title="Initialize your skill ecosystem">
368
- <Code title="Terminal">{`helixevo init`}</Code>
365
+ # Initialize your skill ecosystem
366
+ helixevo init`}</Code>
369
367
  <p className="guide-text-sm">
370
- This scans your existing SKILL.md files (from <code>~/.agents/skills/</code>), imports them into HelixEvo,
371
- and generates skill tests for each skill. It also creates the data directory at <code>~/.helix/</code>.
368
+ Initialization imports existing skills, generates regression tests, and creates the runtime memory under <code>~/.helix/</code>.
372
369
  </p>
373
370
  </Step>
374
371
 
375
- <Step n={3} title="Capture failures from a session">
376
- <Code title="Terminal">{`helixevo capture path/to/session.json --project myapp`}</Code>
372
+ <Step n={2} title="Analyze a real project so the brain has context">
373
+ <Code title="Terminal">{`# Analyze the current repository
374
+ helixevo project-setup .
375
+
376
+ # Or analyze another project path
377
+ helixevo project-setup ~/myapp --verbose`}</Code>
377
378
  <p className="guide-text-sm">
378
- Extracts corrections, mode switches, retries, and manual edits from Claude session files and records them
379
- as structured failure records.
379
+ <code>project-setup</code> is now a key operator entry point because it persists project context, activation traces, and pressure signals instead of leaving the dashboard blind to real project demand.
380
380
  </p>
381
381
  </Step>
382
382
 
383
- <Step n={4} title="Evolve skills">
384
- <Code title="Terminal">{`helixevo evolve --verbose`}</Code>
383
+ <Step n={3} title="Feed the observation layer">
384
+ <Code title="Terminal">{`# Continuous observation
385
+ helixevo watch --project myapp
386
+
387
+ # Or one-off capture
388
+ helixevo capture path/to/session.json --project myapp`}</Code>
385
389
  <p className="guide-text-sm">
386
- Clusters failures, generates proposals, runs 3-judge evaluation and regression tests, then deploys
387
- improvements as canaries. Use <code>--dry-run</code> to preview without applying.
390
+ Observation is where corrections become structured memory. Without this layer, the response and topology layers have nothing truthful to operate on.
388
391
  </p>
389
392
  </Step>
390
393
 
391
- <Step n={5} title="Explore the results">
392
- <Code title="Terminal">{`# View the skill network
393
- helixevo graph
394
+ <Step n={4} title="Run the response loop">
395
+ <Code title="Terminal">{`# Proposal-level evolution
396
+ helixevo evolve --verbose
394
397
 
395
- # Open this dashboard
396
- helixevo dashboard
397
-
398
- # Prefer a different port if you want
399
- helixevo dashboard --port 3900
398
+ # Structural review + execution status
399
+ helixevo graph --optimize
400
+ helixevo topology --status`}</Code>
401
+ <p className="guide-text-sm">
402
+ <code>evolve</code> mutates skills directly, while <code>graph --optimize</code> and <code>topology</code> expose the higher-level structural review and execution loop.
403
+ </p>
404
+ </Step>
400
405
 
401
- # Check system health
402
- helixevo status`}</Code>
406
+ <Step n={5} title="Use the dashboard as the operator cockpit">
407
+ <Code title="Terminal">{`helixevo dashboard
408
+ # Then visit Overview → Co-Evolution → Topology → Network`}</Code>
403
409
  <p className="guide-text-sm">
404
- The dashboard prefers port <code>3847</code>. If that port is already occupied, HelixEvo now reuses a known managed dashboard or falls forward to the next open port instead of failing immediately.
410
+ The dashboard prefers port <code>3847</code>, reuses a known managed dashboard when possible, and otherwise falls forward to the next open port.
411
+ Use <strong>Co-Evolution</strong> for routed pressure and <strong>Topology</strong> for reviewed structural control.
405
412
  </p>
406
413
  </Step>
407
414
  </Section>
408
415
 
409
416
  {/* ─── Commands ─── */}
410
- <Section id="commands" title="Commands" subtitle="Complete CLI reference for every HelixEvo command.">
417
+ <Section id="commands" title="Commands" subtitle="Command surfaces grouped by the job they do in the current brain loop.">
418
+ <p className="guide-text">
419
+ The CLI is now easier to reason about if you group commands by role: <strong>observe</strong>, <strong>respond</strong>,
420
+ <strong>restructure</strong>, and <strong>prove</strong>. The Guide now explicitly includes the current release-critical additions
421
+ such as <code>project-setup</code> and <code>topology</code>.
422
+ </p>
411
423
  <div className="guide-command-grid">
412
424
  {[
413
425
  {
414
- cmd: 'helixevo watch',
415
- desc: 'Always-on learning mode. Watches for corrections in real-time, auto-captures failures, and triggers evolution when thresholds are met.',
416
- flags: ['--project <name>', '--events <path>', '--verbose', '--no-evolve'],
417
- },
418
- {
419
- cmd: 'helixevo metrics',
420
- desc: 'Show correction rates per skill, trends over time, and whether each evolution actually reduced corrections. The proof that HelixEvo works.',
421
- flags: ['--verbose'],
426
+ cmd: 'helixevo init',
427
+ desc: 'Initialize the skill ecosystem, generate tests, and build the first local brain memory under ~/.helix/.',
428
+ flags: ['--skills-paths <paths...>', '--skip-tests'],
422
429
  },
423
430
  {
424
- cmd: 'helixevo health',
425
- desc: 'Assess network health: cohesion, coverage, balance, cross-project transfer rate. Detects gaps, orphans, and generalization candidates.',
426
- flags: ['--verbose'],
431
+ cmd: 'helixevo project-setup <path>',
432
+ desc: 'Analyze a project, persist a project profile, emit activation traces, and turn missing capability into project-level pressure.',
433
+ flags: ['--dry-run', '--verbose'],
427
434
  },
428
435
  {
429
- cmd: 'helixevo init',
430
- desc: 'Import existing skills and generate skill tests. Scans ~/.agents/skills/ and creates the HelixEvo data directory.',
431
- flags: ['--verbose'],
436
+ cmd: 'helixevo watch',
437
+ desc: 'Always-on observation mode. Watches for corrections in real time, captures failures automatically, and triggers evolution when thresholds are met.',
438
+ flags: ['--project <name>', '--events <path>', '--verbose', '--no-evolve'],
432
439
  },
433
440
  {
434
441
  cmd: 'helixevo capture <session>',
435
- desc: 'Extract failure records from a Claude session JSON file. Detects verbal corrections, manual edits, mode switches, and retries.',
442
+ desc: 'Extract failure records from a session file when you want one-off observation instead of continuous watch mode.',
436
443
  flags: ['--project <name>', '--verbose'],
437
444
  },
438
445
  {
439
446
  cmd: 'helixevo evolve',
440
- desc: 'Run the full evolution cycle: cluster failures, generate proposals, evaluate with 3 judges, regression test, and deploy canaries.',
447
+ desc: 'Run the proposal-level mutation pipeline: cluster failures, propose changes, judge them, regression test them, and admit winners.',
441
448
  flags: ['--dry-run', '--verbose', '--max-proposals <n>'],
442
449
  },
443
450
  {
444
- cmd: 'helixevo generalize',
445
- desc: 'Promote cross-project patterns upward. Detects skills that solve similar problems across projects and creates abstract parent skills.',
446
- flags: ['--dry-run', '--verbose'],
451
+ cmd: 'helixevo research',
452
+ desc: 'Run the discovery lane for missing capability, frontier practices, and experimental skill opportunities.',
453
+ flags: ['--project <path>', '--max-hypotheses <n>', '--dry-run', '--verbose'],
447
454
  },
448
455
  {
449
456
  cmd: 'helixevo specialize --project <name>',
450
- desc: 'Create project-specific skills by combining domain skills with project-level failure patterns.',
457
+ desc: 'Push knowledge downward into project-specific variants when local context matters more than shared abstraction.',
451
458
  flags: ['--dry-run', '--verbose'],
452
459
  },
453
460
  {
454
- cmd: 'helixevo research',
455
- desc: 'Proactive web research to discover new skill opportunities. Searches for best practices, analyzes gaps, and generates experimental skills.',
456
- flags: ['--project <path>', '--max-hypotheses <n>', '--dry-run', '--verbose'],
461
+ cmd: 'helixevo generalize',
462
+ desc: 'Push recurring multi-project motifs upward into reusable shared knowledge with transfer evidence.',
463
+ flags: ['--dry-run', '--verbose'],
457
464
  },
458
465
  {
459
466
  cmd: 'helixevo graph',
460
- desc: 'Visualize the skill network. Shows nodes (skills), edges (relationships), and clusters.',
461
- flags: ['--mermaid', '--obsidian <vault>', '--rebuild', '--optimize'],
467
+ desc: 'Inspect the network and, with --optimize, refresh topology review candidates for structural issues and opportunities.',
468
+ flags: ['--mermaid', '--obsidian <vault>', '--rebuild', '--optimize', '--verbose'],
469
+ },
470
+ {
471
+ cmd: 'helixevo topology',
472
+ desc: 'Move accepted topology review through prepare, apply, and rollback. This is the explicit control surface for reviewed structural execution.',
473
+ flags: ['--status', '--prepare <candidateId>', '--apply <planId>', '--rollback <planId>', '--verbose'],
474
+ },
475
+ {
476
+ cmd: 'helixevo health',
477
+ desc: 'Assess cohesion, coverage, balance, and transfer so you can tell whether the network is learning in a reusable and connected way.',
478
+ flags: ['--verbose'],
479
+ },
480
+ {
481
+ cmd: 'helixevo metrics',
482
+ desc: 'Measure whether evolution actually reduces corrections over time. This is the primary proof command.',
483
+ flags: ['--verbose'],
462
484
  },
463
485
  {
464
486
  cmd: 'helixevo dashboard',
465
- desc: 'Open the interactive web dashboard. HelixEvo prefers localhost:3847, reuses a known managed dashboard if one is already there, and otherwise falls forward to the next open port. Before launch, it also checks npm for a newer version and auto-updates itself unless you opt out.',
487
+ desc: 'Open the premium operator dashboard. It prefers localhost:3847, reuses a known managed dashboard, falls forward if needed, and can auto-update before launch.',
466
488
  flags: ['--background', '--stop', '--port <port>', '--no-auto-update'],
467
489
  },
468
490
  {
469
491
  cmd: 'helixevo status',
470
- desc: 'Show system health: skill count, failure stats, frontier status, canary deployments, and recent evolution.',
492
+ desc: 'Show fast system state without deeper LLM analysis.',
471
493
  flags: [],
472
494
  },
473
495
  {
474
496
  cmd: 'helixevo report',
475
- desc: 'Generate a detailed evolution report. Can output to file, Obsidian vault, or Craft Agent session.',
476
- flags: ['--verbose'],
497
+ desc: 'Generate a human-readable summary of evolution activity over a time window.',
498
+ flags: ['--days <n>', '--output <path>'],
477
499
  },
478
- ].map(c => (
500
+ ].map((c) => (
479
501
  <div key={c.cmd} className="guide-command-card">
480
502
  <code className="guide-command-name">{c.cmd}</code>
481
503
  <p className="guide-command-desc">{c.desc}</p>
482
504
  {c.flags.length > 0 && (
483
505
  <div className="guide-command-flags">
484
- {c.flags.map(f => <code key={f} className="guide-command-flag">{f}</code>)}
506
+ {c.flags.map((f) => <code key={f} className="guide-command-flag">{f}</code>)}
485
507
  </div>
486
508
  )}
487
509
  </div>
@@ -489,46 +511,179 @@ helixevo status`}</Code>
489
511
  </div>
490
512
 
491
513
  <Callout type="tip">
492
- Most commands support <code>--dry-run</code> to preview changes without applying them,
493
- and <code>--verbose</code> to see detailed LLM interactions and scoring.
514
+ A good mental grouping is: <strong>observe</strong> with <code>project-setup</code>, <code>watch</code>, and <code>capture</code>;
515
+ <strong>respond</strong> with <code>research</code>, <code>specialize</code>, <code>evolve</code>, and <code>generalize</code>;
516
+ <strong>restructure</strong> with <code>graph --optimize</code> plus <code>topology</code>; and <strong>prove</strong> with <code>metrics</code>, <code>health</code>, and <code>report</code>.
494
517
  </Callout>
495
518
  </Section>
496
519
 
497
520
  {/* ─── Architecture ─── */}
498
- <Section id="architecture" title="Architecture" subtitle="How the system is structured and how data flows through it.">
499
- <h3 className="guide-h3">Pipeline</h3>
521
+ <Section id="architecture" title="Architecture" subtitle="How the classic evolution pipeline now sits inside a broader co-evolving brain.">
522
+ <h3 className="guide-h3">Classic mutation pipeline</h3>
500
523
  <p className="guide-text">
501
- Every evolution cycle follows this pipeline. Each stage acts as a quality gate proposals must
502
- pass all stages to be deployed.
524
+ The original HelixEvo mutation pipeline is still real and still important. Capture, clustering, proposal generation,
525
+ judging, regression, and canary deployment remain the core mutation engine. What changed in the recent milestone arc is that
526
+ this engine now operates inside a bigger system that senses pressure, routes intervention, persists governance,
527
+ reviews topology, and executes a safe subset of structural changes with rollback.
503
528
  </p>
504
529
  <ArchitectureDiagram />
505
530
 
506
- <h3 className="guide-h3">Brain Foundation</h3>
531
+ <h3 className="guide-h3">Brain foundation</h3>
507
532
  <div className="grid-2" style={{ gap: 12, marginBottom: 20 }}>
508
- <div className="guide-dimension-card" style={{ borderLeftColor: 'var(--blue)' }}>
509
- <div style={{ fontSize: 11, fontWeight: 700, color: 'var(--blue)', textTransform: 'uppercase', letterSpacing: 1 }}>Ontology</div>
510
- <div style={{ fontSize: 12, color: 'var(--text-dim)', marginTop: 4 }}>Defines the stable semantic kernel for skills, projects, tasks, capabilities, artifacts, and mutations.</div>
511
- </div>
512
- <div className="guide-dimension-card" style={{ borderLeftColor: 'var(--green)' }}>
513
- <div style={{ fontSize: 11, fontWeight: 700, color: 'var(--green)', textTransform: 'uppercase', letterSpacing: 1 }}>Activation traces</div>
514
- <div style={{ fontSize: 12, color: 'var(--text-dim)', marginTop: 4 }}>Capture which skills and gaps were active during failure analysis and project analysis.</div>
515
- </div>
516
- <div className="guide-dimension-card" style={{ borderLeftColor: 'var(--purple)' }}>
517
- <div style={{ fontSize: 11, fontWeight: 700, color: 'var(--purple)', textTransform: 'uppercase', letterSpacing: 1 }}>Evolution artifacts</div>
518
- <div style={{ fontSize: 12, color: 'var(--text-dim)', marginTop: 4 }}>Preserve proposal-level evidence so the dashboard can show what changed, why, and with which provenance.</div>
519
- </div>
520
- <div className="guide-dimension-card" style={{ borderLeftColor: 'var(--yellow)' }}>
521
- <div style={{ fontSize: 11, fontWeight: 700, color: 'var(--yellow)', textTransform: 'uppercase', letterSpacing: 1 }}>Pressure signals</div>
522
- <div style={{ fontSize: 12, color: 'var(--text-dim)', marginTop: 4 }}>Turn failures and project gaps into explicit adaptation demand, routing attention into Network, Projects, and Research.</div>
523
- </div>
533
+ {[
534
+ ['Ontology', 'var(--blue)', 'Stable semantic kernel for skills, projects, tasks, capabilities, artifacts, and mutations.'],
535
+ ['Activation traces', 'var(--green)', 'Observation memory showing which skills and gaps were active when capture or project analysis occurred.'],
536
+ ['Pressure signals', 'var(--yellow)', 'Explicit adaptation demand derived from failures and project gaps.'],
537
+ ['Interventions', 'var(--purple)', 'The response ledger across research, specialize, evolve, generalize, and manual-review lanes.'],
538
+ ['Governance', 'var(--blue)', 'Derived or operator-selected routing bias that explains why the system is steering one way rather than another.'],
539
+ ['Topology memory', 'var(--green)', 'Review candidates, decisions, apply plans, snapshots, artifacts, and rollbackable structural state.'],
540
+ ].map(([label, color, desc]) => (
541
+ <div key={String(label)} className="guide-dimension-card" style={{ borderLeftColor: String(color) }}>
542
+ <div style={{ fontSize: 11, fontWeight: 700, color: String(color), textTransform: 'uppercase', letterSpacing: 1 }}>{label}</div>
543
+ <div style={{ fontSize: 12, color: 'var(--text-dim)', marginTop: 4 }}>{desc}</div>
544
+ </div>
545
+ ))}
524
546
  </div>
525
547
 
526
- <h3 className="guide-h3">Three-Layer Hierarchy</h3>
548
+ <h3 className="guide-h3">Three-layer skill hierarchy</h3>
527
549
  <p className="guide-text">
528
- Skills are organized into three layers. The <code>generalize</code> command promotes patterns upward,
529
- while <code>specialize</code> creates project-specific variants downward.
550
+ The skill hierarchy still matters. <code>specialize</code> pushes knowledge downward into project context,
551
+ while <code>generalize</code> promotes reusable knowledge upward into shared layers. The newer pressure, transfer, and topology work does not replace this hierarchy — it makes the hierarchy more governable.
530
552
  </p>
531
553
  <HierarchyDiagram />
554
+
555
+ <Callout type="tip">
556
+ The next three sections are the real bridge from theory to product: the <strong>9-layer conceptual stack</strong>,
557
+ a concrete <strong>end-to-end adaptation loop</strong>, and a <strong>dashboard surface map</strong> showing where each part of the brain becomes visible.
558
+ </Callout>
559
+ </Section>
560
+
561
+ {/* ─── Brain Stack ─── */}
562
+ <Section id="brainstack" title="Brain Stack" subtitle="The clearest current mental model for HelixEvo is a layered stack, not a flat feature list.">
563
+ <div className="grid-2" style={{ gap: 12 }}>
564
+ {[
565
+ {
566
+ title: '1. Semantic layer',
567
+ color: 'var(--blue)',
568
+ desc: 'Ontology and invariants define what kinds of things HelixEvo can talk about and persist truthfully.',
569
+ where: 'Visible in: ontology-aware graph summaries, skill metadata, and brain foundation summaries.'
570
+ },
571
+ {
572
+ title: '2. Observation layer',
573
+ color: 'var(--green)',
574
+ desc: 'Failures, project analysis, and activation traces record what happened, where it happened, and which skills were active.',
575
+ where: 'Visible in: capture, project-setup, Projects, and Evolution.'
576
+ },
577
+ {
578
+ title: '3. Pressure layer',
579
+ color: 'var(--yellow)',
580
+ desc: 'Pressure signals convert raw observation into explicit adaptation demand at local, project, and network scope. Motifs aggregate recurring patterns above single signals.',
581
+ where: 'Visible in: Overview, Co-Evolution, Network, Projects, and Research.'
582
+ },
583
+ {
584
+ title: '4. Response layer',
585
+ color: 'var(--purple)',
586
+ desc: 'Interventions route pressure into research, specialize, evolve, generalize, or manual review. This is where the system decides how to act.',
587
+ where: 'Visible in: Co-Evolution backlog, recent interventions, and route recommendations.'
588
+ },
589
+ {
590
+ title: '5. Transfer layer',
591
+ color: 'var(--blue)',
592
+ desc: 'Transfer events record when reusable knowledge was actually promoted, not just suggested. This is how motif-level closure becomes truthful.',
593
+ where: 'Visible in: Co-Evolution promotion queue, transfer evidence, and Research / Network transfer summaries.'
594
+ },
595
+ {
596
+ title: '6. Governance layer',
597
+ color: 'var(--green)',
598
+ desc: 'Governance selects the current routing posture: derived automatically from conditions or pinned manually by the operator.',
599
+ where: 'Visible in: Co-Evolution and Topology governance chips, rationale, and profile explanation.'
600
+ },
601
+ {
602
+ title: '7. Topology review layer',
603
+ color: 'var(--yellow)',
604
+ desc: 'Graph optimization and network health produce structural review candidates. Operator decisions persist review memory instead of disappearing into an ephemeral suggestion list.',
605
+ where: 'Visible in: graph --optimize, Topology review queue, and decision ledger.'
606
+ },
607
+ {
608
+ title: '8. Topology execution layer',
609
+ color: 'var(--purple)',
610
+ desc: 'Accepted safe candidates can now become apply plans with snapshots, artifacts, apply state, and rollback. Riskier classes remain bounded.',
611
+ where: 'Visible in: Topology accepted-ready queue, prepared queue, applied history, and /api/topology-apply.'
612
+ },
613
+ {
614
+ title: '9. Surface layer',
615
+ color: 'var(--blue)',
616
+ desc: 'The dashboard is not decoration; each tab exposes a different slice of the brain so operators can inspect, steer, and verify it.',
617
+ where: 'Visible in: Overview, Co-Evolution, Network, Topology, Projects, Research, Evolution, Commands, and Guide.'
618
+ },
619
+ ].map((layer) => (
620
+ <div key={layer.title} className="guide-dimension-card" style={{ borderLeftColor: layer.color }}>
621
+ <div style={{ fontSize: 12, fontWeight: 700, color: layer.color, letterSpacing: 0.2 }}>{layer.title}</div>
622
+ <div style={{ fontSize: 12, color: 'var(--text-dim)', marginTop: 6, lineHeight: 1.6 }}>{layer.desc}</div>
623
+ <div style={{ fontSize: 11, color: 'var(--text-muted)', marginTop: 8, lineHeight: 1.55 }}>{layer.where}</div>
624
+ </div>
625
+ ))}
626
+ </div>
627
+ </Section>
628
+
629
+ {/* ─── Adaptation Loop ─── */}
630
+ <Section id="loop" title="End-to-End Adaptation Loop" subtitle="One realistic signal can now travel through observation, pressure, response, review, and safe structural execution.">
631
+ <p className="guide-text">
632
+ This example is the easiest way to connect the theory to the product. It shows how a real capability gap can move through the current HelixEvo brain.
633
+ </p>
634
+ <Code title="Worked example">{`Project setup spots repeated UI architecture gaps
635
+ → activation trace records the active skills + missing capability
636
+ → pressure signal opens for the project and contributes to a recurring motif
637
+ → governance biases the route toward research / specialize / generalize
638
+ → an intervention is recorded in the response ledger
639
+ → transfer events accumulate if reusable knowledge is promoted
640
+ → graph --optimize surfaces a topology review candidate if the structure itself needs cleanup
641
+ → operator accepts the candidate in /topology
642
+ → helixevo topology --prepare creates an apply plan + snapshot + artifact
643
+ → helixevo topology --apply changes the effective graph through the override layer
644
+ → helixevo topology --rollback restores the before-apply snapshot if needed`}</Code>
645
+
646
+ <div className="guide-pipeline" style={{ marginTop: 18 }}>
647
+ <PipelineStep icon="1" title="Observe" color="var(--blue)" desc="Project analysis or failure capture persists activation-aware observation." />
648
+ <div className="guide-pipeline-connector" />
649
+ <PipelineStep icon="2" title="Pressurize" color="var(--yellow)" desc="Observation becomes explicit demand through pressure signals and recurring motifs." />
650
+ <div className="guide-pipeline-connector" />
651
+ <PipelineStep icon="3" title="Route" color="var(--purple)" desc="Governance biases the next lane: research, specialize, evolve, generalize, or manual review." />
652
+ <div className="guide-pipeline-connector" />
653
+ <PipelineStep icon="4" title="Restructure" color="var(--green)" desc="Topology review and execution handle cases where the structure itself, not just a skill rule, must change." />
654
+ <div className="guide-pipeline-connector" />
655
+ <PipelineStep icon="5" title="Prove" color="var(--blue)" desc="Metrics, transfer evidence, artifacts, and rollback discipline show whether the system improved truthfully." />
656
+ </div>
657
+
658
+ <Callout type="info">
659
+ A key current semantic rule is that <strong>motif-level transfer can address recurring network pressure without falsely claiming that every local signal is closed</strong>.
660
+ That rule is one of the biggest maturity improvements in the recent theory and implementation arc.
661
+ </Callout>
662
+ </Section>
663
+
664
+ {/* ─── Surface Map ─── */}
665
+ <Section id="surfaces" title="Dashboard Surface Map" subtitle="Each tab is a different control or observability surface for the same brain.">
666
+ <div className="grid-2" style={{ gap: 12 }}>
667
+ {[
668
+ ['Overview', 'var(--blue)', 'Top-level cockpit for frontier state, brain foundation, pressure totals, topology review counts, and prepared/applied structural state.'],
669
+ ['Co-Evolution', 'var(--purple)', 'The response cockpit. Use it to inspect routed pressure, governance mode, promotion queue, interventions, and transfer evidence.'],
670
+ ['Skill Network', 'var(--green)', 'Graph-level understanding: relationships, co-evolution signals, inspector context, and structural handoff links.'],
671
+ ['Topology', 'var(--yellow)', 'Governed plasticity surface for review decisions, accepted-ready queue, prepared plans, apply, rollback, and execution history.'],
672
+ ['Projects', 'var(--blue)', 'Project intake and project-aware pressure surface. Best for capability gaps, activation traces, and promotion feeders.'],
673
+ ['Research', 'var(--purple)', 'Discovery-oriented view grounded in current pressure and routed recommendations rather than disconnected idea generation.'],
674
+ ['Evolution', 'var(--green)', 'Proposal-centric evidence view: judge scores, artifact provenance, and iteration history.'],
675
+ ['Commands + Guide', 'var(--yellow)', 'Reference surfaces: one for command execution semantics, one for operator mental model and architecture.'],
676
+ ].map(([title, color, desc]) => (
677
+ <div key={String(title)} className="guide-dimension-card" style={{ borderLeftColor: String(color) }}>
678
+ <div style={{ fontSize: 12, fontWeight: 700, color: String(color), letterSpacing: 0.2 }}>{title}</div>
679
+ <div style={{ fontSize: 12, color: 'var(--text-dim)', marginTop: 6, lineHeight: 1.6 }}>{desc}</div>
680
+ </div>
681
+ ))}
682
+ </div>
683
+ <Callout type="tip">
684
+ If you are debugging current state, the best sequence is usually: <strong>Overview → Co-Evolution → Topology → Skill Network → Projects / Research</strong>.
685
+ That path mirrors the stack from summary → routed demand → structural review/execution → graph context → project or discovery detail.
686
+ </Callout>
532
687
  </Section>
533
688
 
534
689
  {/* ─── Always-On Learning ─── */}
@@ -902,7 +1057,7 @@ generation: 3
902
1057
  </Section>
903
1058
 
904
1059
  {/* ─── Data & Storage ─── */}
905
- <Section id="data" title="Data & Storage" subtitle="Where everything lives and how it's structured.">
1060
+ <Section id="data" title="Data & Storage" subtitle="The runtime memory is now easier to understand as grouped cognitive layers, not just a flat folder tree.">
906
1061
  <Code title="~/.helix/ directory structure">{`~/.helix/
907
1062
  ├── config.json # Configuration
908
1063
  ├── failures.jsonl # Captured failure records (append-only)
@@ -921,7 +1076,7 @@ generation: 3
921
1076
  ├── evolution-artifacts.jsonl # Proposal/iteration evidence artifacts
922
1077
  ├── frontier.json # Pareto frontier (top-K programs)
923
1078
  ├── evolution-history.json # All evolution iterations + proposals
924
- ├── skill-tests.jsonl # Regression test cases (append-only)
1079
+ ├── skill-tests.jsonl # Regression test cases (append-only)
925
1080
  ├── skill-graph.json # Cached network (nodes + edges + ontology version)
926
1081
  ├── canary-registry.json # Active canary deployments
927
1082
  ├── knowledge-buffer.json # Research discoveries + drafts
@@ -931,35 +1086,100 @@ generation: 3
931
1086
  ├── backups/ # Pre-canary skill backups
932
1087
  └── reports/ # Generated evolution reports`}</Code>
933
1088
 
934
- <h3 className="guide-h3">Key Data Formats</h3>
935
- <div className="grid-2" style={{ gap: 12 }}>
1089
+ <h3 className="guide-h3">Memory groups</h3>
1090
+ <div className="grid-2" style={{ gap: 12, marginBottom: 20 }}>
1091
+ {[
1092
+ ['Observation memory', 'var(--blue)', 'failures.jsonl + activation-traces.jsonl capture what happened, where it happened, and which skills were active.'],
1093
+ ['Pressure & response memory', 'var(--yellow)', 'pressure-signals.jsonl + pressure-interventions.jsonl + transfer-events.jsonl describe demand, routing, and reusable promotion evidence.'],
1094
+ ['Governance & review memory', 'var(--purple)', 'governance-state.json + topology-review-candidates.json + topology-review-decisions.jsonl preserve why structural decisions are being made.'],
1095
+ ['Topology execution memory', 'var(--green)', 'topology-overrides.json + topology-snapshots.json + topology-apply-plans.json + topology-executions.jsonl + topology-artifacts.jsonl preserve reviewed structural execution and rollback.'],
1096
+ ['Evaluation & frontier memory', 'var(--blue)', 'evolution-history.json + evolution-artifacts.jsonl + skill-tests.jsonl + canary-registry.json + frontier.json preserve proof, guardrails, and best configurations.'],
1097
+ ['Discovery memory', 'var(--purple)', 'knowledge-buffer.json keeps research discoveries and drafts so failed experiments can be iterated instead of lost.'],
1098
+ ].map(([title, color, desc]) => (
1099
+ <div key={String(title)} className="guide-dimension-card" style={{ borderLeftColor: String(color) }}>
1100
+ <div style={{ fontSize: 12, fontWeight: 700, color: String(color), letterSpacing: 0.2 }}>{title}</div>
1101
+ <div style={{ fontSize: 12, color: 'var(--text-dim)', marginTop: 6, lineHeight: 1.6 }}>{desc}</div>
1102
+ </div>
1103
+ ))}
1104
+ </div>
1105
+
1106
+ <h3 className="guide-h3">Representative data formats</h3>
1107
+ <div className="grid-3" style={{ gap: 12 }}>
936
1108
  <div className="guide-data-card">
937
1109
  <div className="guide-data-title">Failure Record</div>
938
1110
  <Code>{`{
939
1111
  "id": "f_abc123",
940
1112
  "sessionId": "session-001",
941
1113
  "project": "myapp",
942
- "userRequest": "Add a login page",
943
- "agentAction": "Created login.tsx",
944
- "correction": "Use the existing auth hook",
945
1114
  "correctionType": "verbal",
946
1115
  "skillsActive": ["react-patterns"],
947
1116
  "resolved": false
948
1117
  }`}</Code>
949
1118
  </div>
950
1119
  <div className="guide-data-card">
951
- <div className="guide-data-title">Skill Test</div>
1120
+ <div className="guide-data-title">Pressure Intervention</div>
952
1121
  <Code>{`{
953
- "id": "gc_react_42",
954
- "skill": "react-patterns",
955
- "input": "Create a form component",
956
- "context": "React 19, TypeScript",
957
- "expectedBehavior": "Uses useActionState for form submission...",
958
- "lastResult": "pass",
959
- "consecutivePasses": 5
1122
+ "id": "pi_123",
1123
+ "interventionType": "generalize",
1124
+ "status": "completed",
1125
+ "impact": "structural",
1126
+ "projectId": "website-a",
1127
+ "reasonSummary": "Recurring motif now promotes shared UI guidance"
960
1128
  }`}</Code>
961
1129
  </div>
1130
+ <div className="guide-data-card">
1131
+ <div className="guide-data-title">Topology Apply Plan</div>
1132
+ <Code>{`{
1133
+ "id": "topology_plan_abc",
1134
+ "candidateId": "topology_rewire_xyz",
1135
+ "executionMode": "apply",
1136
+ "safeToApply": true,
1137
+ "beforeSnapshotId": "topology_snapshot_001",
1138
+ "plannedGraphChanges": [
1139
+ "Suppress 1 conflict edge covered by the reviewed rewire decision"
1140
+ ]
1141
+ }`}</Code>
1142
+ </div>
1143
+ </div>
1144
+ </Section>
1145
+
1146
+ {/* ─── Maturity & Safety ─── */}
1147
+ <Section id="maturity" title="Current Maturity & Safety" subtitle="What is real now, what is bounded, and what should not be overstated.">
1148
+ <div className="grid-3" style={{ gap: 12 }}>
1149
+ <div className="guide-dimension-card" style={{ borderLeftColor: 'var(--green)' }}>
1150
+ <div style={{ fontSize: 12, fontWeight: 700, color: 'var(--green)' }}>Operational now</div>
1151
+ <ul className="guide-list" style={{ marginTop: 8 }}>
1152
+ <li>pressure sensing and motifs</li>
1153
+ <li>governed routing with operator-visible rationale</li>
1154
+ <li>transfer evidence through generalized promotion</li>
1155
+ <li>persisted governance steering</li>
1156
+ <li>topology review queue + decisions</li>
1157
+ <li>prepare / apply / rollback for the safe reviewed subset</li>
1158
+ </ul>
1159
+ </div>
1160
+ <div className="guide-dimension-card" style={{ borderLeftColor: 'var(--yellow)' }}>
1161
+ <div style={{ fontSize: 12, fontWeight: 700, color: 'var(--yellow)' }}>Bounded by design</div>
1162
+ <ul className="guide-list" style={{ marginTop: 8 }}>
1163
+ <li>only accepted review candidates can enter apply planning</li>
1164
+ <li>only safe plans are directly applied</li>
1165
+ <li>snapshots and artifacts preserve execution truth</li>
1166
+ <li>rollback restores before-apply state instead of guessing</li>
1167
+ </ul>
1168
+ </div>
1169
+ <div className="guide-dimension-card" style={{ borderLeftColor: 'var(--purple)' }}>
1170
+ <div style={{ fontSize: 12, fontWeight: 700, color: 'var(--purple)' }}>Not yet claimed</div>
1171
+ <ul className="guide-list" style={{ marginTop: 8 }}>
1172
+ <li>full-spectrum autonomous destructive self-restructuring</li>
1173
+ <li>broad automatic merge / split / prune application</li>
1174
+ <li>fully operational dynamic ontology evolution</li>
1175
+ <li>complete autonomous structural governance without operator oversight</li>
1176
+ </ul>
1177
+ </div>
962
1178
  </div>
1179
+ <Callout type="warning">
1180
+ The most truthful way to describe the current product is: <strong>governed co-evolving brain with bounded safe structural execution</strong>.
1181
+ That is stronger and more credible than implying unrestricted autonomous self-restructuring.
1182
+ </Callout>
963
1183
  </Section>
964
1184
 
965
1185
  {/* ─── Skill Management ─── */}
@@ -1019,15 +1239,58 @@ generation: 3
1019
1239
  </Callout>
1020
1240
  </Section>
1021
1241
 
1242
+ {/* ─── Glossary ─── */}
1243
+ <Section id="glossary" title="Glossary" subtitle="Short definitions for the new core terms in the current HelixEvo brain model.">
1244
+ <div className="grid-2" style={{ gap: 12 }}>
1245
+ {[
1246
+ ['Ontology', 'The stable semantic kernel that defines what kinds of things HelixEvo can talk about and persist truthfully.'],
1247
+ ['Activation trace', 'Observation memory describing which skills, gaps, and contexts were active when analysis happened.'],
1248
+ ['Pressure signal', 'An explicit representation of adaptation demand derived from failures or project gaps.'],
1249
+ ['Motif', 'A recurring family of pressure above the single-signal level. Motifs matter because network-level transfer should respond to patterns, not isolated anecdotes.'],
1250
+ ['Intervention', 'A recorded response action such as research, specialize, evolve, generalize, or manual review.'],
1251
+ ['Transfer event', 'Evidence that reusable knowledge was actually promoted or transferred across projects or layers.'],
1252
+ ['Governance mode', 'The current routing posture — derived automatically or pinned manually by the operator.'],
1253
+ ['Topology review', 'Persisted structural candidate memory plus explicit operator decision state.'],
1254
+ ['Apply plan', 'A prepared reviewed topology change with a before snapshot, planned graph changes, and execution mode.'],
1255
+ ['Snapshot', 'Saved structural state used to prove what changed and to support rollback.'],
1256
+ ['Topology artifact', 'Evidence created around reviewed structural execution, similar to how evolution artifacts preserve proposal-level change evidence.'],
1257
+ ['Rollback', 'Restoring the before-apply structural state for a previously applied safe topology plan.'],
1258
+ ].map(([term, desc]) => (
1259
+ <div key={String(term)} className="guide-dimension-card" style={{ borderLeftColor: 'var(--blue)' }}>
1260
+ <div style={{ fontSize: 12, fontWeight: 700, color: 'var(--blue)' }}>{term}</div>
1261
+ <div style={{ fontSize: 12, color: 'var(--text-dim)', marginTop: 6, lineHeight: 1.6 }}>{desc}</div>
1262
+ </div>
1263
+ ))}
1264
+ </div>
1265
+ </Section>
1266
+
1022
1267
  {/* ─── FAQ ─── */}
1023
- <Section id="faq" title="FAQ" subtitle="Frequently asked questions.">
1024
- <FAQItem q="How many failures do I need before evolution works?">
1025
- By default, 5 unresolved failures are required (<code>minFailuresForEvolution</code>).
1026
- This ensures enough signal for meaningful pattern detection.
1268
+ <Section id="faq" title="FAQ" subtitle="Frequently asked questions for the current theory and operator model.">
1269
+ <FAQItem q="What is governance steering?">
1270
+ Governance steering lets the operator pin the current response posture instead of relying only on derived routing.
1271
+ In practice, that means HelixEvo can stay <code>balanced</code>, become <code>transfer-focused</code>, shift toward <code>project-critical</code>, or stay more <code>conservative</code> depending on what the operator wants and what the system is seeing.
1272
+ </FAQItem>
1273
+ <FAQItem q="What is the difference between topology review and topology execution?">
1274
+ <strong>Topology review</strong> is the memory and decision layer: candidates, reasons, risk, and accept/defer/reject state.
1275
+ <strong>Topology execution</strong> starts only after acceptance and adds prepare/apply/rollback, snapshots, artifacts, and execution records.
1027
1276
  </FAQItem>
1028
- <FAQItem q="What LLM model does HelixEvo use?">
1029
- HelixEvo uses <code>claude --print</code> with configurable models (default: <code>sonnet</code>).
1030
- No API key is needed it requires a Claude Max plan subscription. Judges and proposals can use different models.
1277
+ <FAQItem q="What does prepare vs apply vs rollback mean?">
1278
+ <code>prepare</code> creates an explicit apply plan with a before snapshot and planned graph changes.
1279
+ <code>apply</code> executes the safe reviewed subset against the effective graph state.
1280
+ <code>rollback</code> restores the before-apply snapshot if the applied structural change should be reversed.
1281
+ </FAQItem>
1282
+ <FAQItem q="What is a motif and why does it matter?">
1283
+ A motif is a recurring pattern of pressure across multiple signals or projects. It matters because network-level actions like <code>generalize</code> should respond to real recurring pressure families, not pretend that one local signal justifies system-wide promotion.
1284
+ </FAQItem>
1285
+ <FAQItem q="What is a transfer event?">
1286
+ A transfer event is evidence that reusable knowledge was actually promoted or reused across layers or projects. This is how HelixEvo distinguishes a recommendation from a realized knowledge transfer.
1287
+ </FAQItem>
1288
+ <FAQItem q="How do I prove HelixEvo's brain is working?">
1289
+ Use multiple proof surfaces together: <code>metrics</code> for correction reduction, Co-Evolution for routed interventions and transfer evidence, Topology for reviewed structural execution state, and the verification reports under <code>reports/verification/</code> for milestone-level backtesting.
1290
+ </FAQItem>
1291
+ <FAQItem q="How many failures do I need before evolution works?">
1292
+ By default, 5 unresolved failures are required (<code>minFailuresForEvolution</code>) for the standard evolution trigger.
1293
+ Always-on mode can also trigger earlier through burst or cross-project conditions.
1031
1294
  </FAQItem>
1032
1295
  <FAQItem q="What if Claude says my OAuth token expired?">
1033
1296
  First run <code>claude auth login</code> to refresh local Claude credentials. Avoid exporting a hardcoded
@@ -1035,42 +1298,18 @@ generation: 3
1035
1298
  token override when local Claude auth is healthy.
1036
1299
  </FAQItem>
1037
1300
  <FAQItem q="Can I use HelixEvo with other AI agents?">
1038
- Yes. HelixEvo manages standard SKILL.md files with YAML frontmatter. Any agent that reads SKILL.md
1039
- files can benefit from HelixEvo&apos;s evolution pipeline.
1040
- </FAQItem>
1041
- <FAQItem q="What happens during canary rollback?">
1042
- If the failure rate for a canary skill exceeds <code>autoRollbackThreshold</code> (default: 1.5x),
1043
- the skill is automatically reverted from the backup. The failed evolution is recorded in history.
1044
- </FAQItem>
1045
- <FAQItem q="How does cross-skill regression work?">
1046
- When Skill A evolves, HelixEvo checks the skill graph for co-evolved, dependent, and enhancing
1047
- partners. It tests their skill tests against Skill A&apos;s changes. If partner pass rate drops below 95%,
1048
- the proposal is rejected.
1049
- </FAQItem>
1050
- <FAQItem q="How does the knowledge buffer work?">
1051
- All research discoveries are saved, even if the resulting hypothesis fails. Failed experiments above
1052
- a minimum score are saved as drafts. When the same hypothesis appears again, HelixEvo iterates on
1053
- the draft rather than starting from scratch.
1054
- </FAQItem>
1055
- <FAQItem q="How does helixevo watch detect corrections?">
1056
- Two-stage detection: (1) Fast regex pre-filter checks for correction signals in English and Chinese
1057
- (e.g., &quot;wrong&quot;, &quot;not like that&quot;, &quot;不对&quot;, &quot;改成&quot;). (2) If a signal is detected, an LLM analyzes
1058
- the recent conversation window to extract structured failure records with confidence scores (&gt;0.7 required).
1301
+ Yes. HelixEvo manages standard SKILL.md files with YAML frontmatter. Any agent that reads SKILL.md files can benefit from HelixEvo&apos;s evolution, pressure, transfer, and topology workflows.
1059
1302
  </FAQItem>
1060
- <FAQItem q="How do I prove HelixEvo is working?">
1061
- Run <code>helixevo metrics</code>. It tracks correction rates per skill over 7-day rolling windows
1062
- and compares before/after rates for each evolution. The verdict shows how many evolutions actually
1063
- reduced corrections. This is closed-loop measurement — not LLM scores judging LLM output.
1303
+ <FAQItem q="What happens during canary rollback versus topology rollback?">
1304
+ Canary rollback restores a skill change that degraded live performance during monitored deployment.
1305
+ Topology rollback restores the before-apply structural state for a reviewed topology plan. They protect different layers: one protects skill behavior, the other protects graph structure.
1064
1306
  </FAQItem>
1065
1307
  <FAQItem q="What is network health?">
1066
- Network health scores the entire skill network as a unit across 4 dimensions: cohesion (connectivity),
1067
- coverage (failure mapping), balance (hierarchy structure), and transfer rate (cross-project reuse).
1068
- It runs after every evolution and is shown in the summary. Run <code>helixevo health</code> to see the full report.
1308
+ Network health scores the entire skill network as a unit across cohesion, coverage, balance, and transfer rate.
1309
+ It remains one of the best structural diagnostics because it tells you whether the network is learning in a connected and reusable way rather than just accumulating local fixes.
1069
1310
  </FAQItem>
1070
1311
  <FAQItem q="When does auto-generalization trigger?">
1071
- Automatically during <code>helixevo evolve</code> when the network health assessment detects cross-project
1072
- patterns — the same failure type appearing in 2+ projects. It creates a domain-level abstract skill,
1073
- validates with regression tests, and sets up parent/child inheritance with the source project skills.
1312
+ It triggers when recurring multi-project patterns become strong enough to justify promotion. The newer theory frame is that <code>generalize</code> responds to recurring motifs and should create transfer evidence rather than merely appearing active.
1074
1313
  </FAQItem>
1075
1314
  </Section>
1076
1315
 
@@ -1079,7 +1318,7 @@ generation: 3
1079
1318
  <div className="guide-footer-content">
1080
1319
  <div style={{ fontSize: 13, fontWeight: 600 }}>HelixEvo v{VERSION}</div>
1081
1320
  <div style={{ fontSize: 12, color: 'var(--text-dim)', marginTop: 4 }}>
1082
- Self-evolving skill ecosystem for AI agents · MIT License
1321
+ Co-evolving brain for AI agents · MIT License
1083
1322
  </div>
1084
1323
  </div>
1085
1324
  </div>
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "helixevo",
3
- "version": "0.4.0",
3
+ "version": "0.4.1",
4
4
  "description": "Co-evolving skill and project brain for AI agents, with ontology-aware learning, governed response, rollbackable topology control, and a premium dashboard.",
5
5
  "type": "module",
6
6
  "bin": {