murmur8 4.3.4 → 4.5.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -211,6 +211,12 @@ The pipeline includes validation, smart routing, feedback loops, and history tra
211
211
 
212
212
 
213
213
  ┌─────────────────────────────────────────────────────────────────┐
214
+ │ Feedback micro-Task: Cass reviews Alex's spec │
215
+ │ • Quality gate: proceed / pause / revise │
216
+ └─────────────────────────────────────────────────────────────────┘
217
+
218
+
219
+ ┌─────────────────────────────────────────────────────────────────┐
214
220
  │ Smart Routing (v2.7) │
215
221
  │ • Classify feature as technical or user-facing │
216
222
  │ • Technical → skip Cass (saves ~25-40k tokens) │
@@ -229,13 +235,25 @@ The pipeline includes validation, smart routing, feedback loops, and history tra
229
235
 
230
236
 
231
237
  ┌─────────────────────────────────────────────────────────────────┐
232
- │ Nigel (Tests) + Handoff Summary
233
- │ │ │
234
- ▼ │
235
- │ Codey rates Nigel → Quality Gate │
236
- │ │ │
237
-
238
- Codey (Plan Implement)
238
+ Feedback micro-Task: Nigel reviews Cass's stories
239
+ └─────────────────────────────────────────────────────────────────┘
240
+
241
+
242
+ ┌─────────────────────────────────────────────────────────────────┐
243
+ Nigel (Test Spec + Handoff) → Nigel (Executable Tests)
244
+ Split into two atomic calls for token efficiency
245
+ └─────────────────────────────────────────────────────────────────┘
246
+
247
+
248
+ ┌─────────────────────────────────────────────────────────────────┐
249
+ │ Feedback micro-Task: Codey reviews Nigel's tests │
250
+ └─────────────────────────────────────────────────────────────────┘
251
+
252
+
253
+ ┌─────────────────────────────────────────────────────────────────┐
254
+ │ Codey (Plan) → Codey (Implement per-step) │
255
+ │ • Plan uses strict parseable format │
256
+ │ • Orchestrator spawns one Task per implementation step │
239
257
  └─────────────────────────────────────────────────────────────────┘
240
258
 
241
259
  On Failure│
@@ -309,6 +327,24 @@ analyzes: recommends: calibrates:
309
327
  • Trends • And feedback issues
310
328
  ```
311
329
 
330
+ ### Accessing Module Data
331
+
332
+ Data is collected from both invocation methods and accessible via CLI commands:
333
+
334
+ | Data | `/implement-feature` (skill) | `npx murmur8 murm` (CLI) | How to access |
335
+ |------|------------------------------|--------------------------|---------------|
336
+ | **Per-stage timing** (alex, cass, nigel, codey) | Recorded by orchestrating agent | Merged from worktree on successful merge | `npx murmur8 history` |
337
+ | **Feedback ratings** (agent-to-agent) | Recorded by feedback micro-Tasks | Merged from worktree on successful merge | `npx murmur8 history`, `npx murmur8 insights --feedback` |
338
+ | **Token cost per stage** | Recorded by orchestrating agent | Merged from worktree on successful merge | `npx murmur8 history --cost` |
339
+ | **Batch summary** (total duration, feature outcomes) | N/A (single feature) | Recorded at batch completion | `npx murmur8 history` |
340
+ | **Success/failure status** | Recorded per run | Recorded per feature + batch | `npx murmur8 history --stats` |
341
+ | **Retry attempts & strategies** | Recorded on failure | Merged from worktree on successful merge | `npx murmur8 insights --failures` |
342
+ | **Bottleneck analysis** | Derived from history | Derived from history | `npx murmur8 insights --bottlenecks` |
343
+ | **Smart retry recommendations** | Used live during pipeline | Used live during pipeline | Automatic on failure |
344
+ | **Diff preview** | Shown before commit | Shown per worktree before merge | Interactive during pipeline |
345
+
346
+ **How worktree history merging works:** When `npx murmur8 murm` runs, each feature pipeline executes `/implement-feature` inside an isolated git worktree. The skill records per-stage data to `.claude/pipeline-history.json` within that worktree. After a successful merge, murmur8 reads this file and appends its entries to the main project's history before cleaning up the worktree. Failed/conflicted worktrees preserve their history for debugging.
347
+
312
348
  ## Directory Structure
313
349
 
314
350
  ```
@@ -320,13 +356,15 @@ your-project/
320
356
  │ │ ├── AGENT_TESTER_NIGEL.md
321
357
  │ │ ├── AGENT_DEVELOPER_CODEY.md
322
358
  │ │ └── GUARDRAILS.md # Shared guardrails (v2.7)
323
- │ ├── prompts/ # Slim runtime prompts (v2.7)
359
+ │ ├── prompts/ # Self-contained runtime prompts (v4.4)
324
360
  │ │ ├── TEMPLATE.md
325
361
  │ │ ├── alex-runtime.md
326
362
  │ │ ├── cass-runtime.md
327
363
  │ │ ├── nigel-runtime.md
328
364
  │ │ ├── codey-plan-runtime.md
329
- │ │ └── codey-implement-runtime.md
365
+ │ │ ├── codey-implement-runtime.md
366
+ │ │ ├── skill-murm-mode.md # Murmuration steps (loaded on demand)
367
+ │ │ └── skill-error-recovery.md # Error handling (loaded on failure)
330
368
  │ ├── templates/ # Spec and output templates
331
369
  │ │ ├── SYSTEM_SPEC.md
332
370
  │ │ ├── FEATURE_SPEC.md
@@ -357,15 +395,15 @@ your-project/
357
395
 
358
396
  ## Agent Guardrails
359
397
 
360
- All agents follow strict guardrails to ensure quality:
398
+ All agents follow strict guardrails enforced via inlined rules in each self-contained runtime prompt. The authoritative source is `.blueprint/agents/GUARDRAILS.md`, with critical rules inlined directly into agent prompts so sub-agents never need to load external files at runtime.
361
399
 
362
400
  | Guardrail | Description |
363
401
  |-----------|-------------|
364
402
  | **Source Restrictions** | Only use provided inputs (specs, code, business_context) |
365
403
  | **Prohibited Sources** | No social media, forums, external APIs, training data for domain facts |
366
- | **Citation Requirements** | All claims must cite source files |
404
+ | **Assumption Labeling** | All assumptions must be explicitly labeled |
367
405
  | **Confidentiality** | Business context treated as confidential |
368
- | **Escalation Protocol** | Clear rules for when to ask vs assume |
406
+ | **Escalation Protocol** | Clear rules for when to ask vs assume — flag ambiguity, don't guess |
369
407
 
370
408
  ## Self-Improvement Loop
371
409
 
@@ -423,21 +461,21 @@ npx murmur8 init
423
461
 
424
462
  Both CLIs execute the same pipeline: Alex → Cass → Nigel → Codey. The skill uses each CLI's native agent/task mechanism.
425
463
 
426
- ## Token Efficiency (v2.7)
464
+ ## Token Efficiency (v4.4)
427
465
 
428
- Version 2.7 introduces several optimizations to reduce token usage:
466
+ The pipeline is optimised to operate within strict token limits (4096 output, 1024 thinking per sub-agent):
429
467
 
430
- | Optimization | Savings | Description |
431
- |--------------|---------|-------------|
432
- | **Shared Guardrails** | ~1,200 tokens | Single GUARDRAILS.md instead of duplicated in each agent spec |
433
- | **Slim Runtime Prompts** | ~5,200 tokens | 30-50 line prompts instead of 200-400 line full specs |
434
- | **Upstream Summaries** | ~2,000-4,000 tokens | Handoff summaries between agents instead of full artifacts |
435
- | **Template Extraction** | ~800 tokens | Templates moved to separate files, loaded on demand |
436
- | **Lazy Business Context** | Variable | Only loaded when feature spec references it |
437
- | **Compressed Feedback** | ~400 tokens | 3-line feedback prompts instead of 7-line |
468
+ | Optimization | Impact | Description |
469
+ |--------------|--------|-------------|
470
+ | **Self-contained prompts** | ~660 lines/call eliminated | Agents never read full specs, guardrails, manifesto, or rituals at runtime |
471
+ | **Trusted handoff chain** | ~200-400 tokens/call saved | Each agent reads only the upstream handoff + its immediate inputs |
472
+ | **Split Nigel** | Prevents truncation | Test spec and executable tests are separate atomic calls |
473
+ | **Orchestrator-driven Codey** | Prevents truncation | One Task per implementation step instead of monolithic implement call |
474
+ | **Feedback micro-Tasks** | ~50 output tokens each | Separate review calls don't consume the main agent's output budget |
475
+ | **Hybrid SKILL.md** | 48% smaller | Murmuration and error recovery loaded on demand, not always |
476
+ | **Strict plan format** | Zero parsing overhead | Machine-parseable one-liner per step, orchestrator splits on regex |
438
477
  | **Smart Story Routing** | ~25,000-40,000 tokens | Skip Cass for technical features |
439
-
440
- **Total estimated savings: 10,000+ tokens per pipeline run** (more for technical features)
478
+ | **Lazy Business Context** | Variable | Only loaded when feature spec references it |
441
479
 
442
480
  ## Cost Tracking
443
481
 
@@ -517,8 +555,8 @@ murmur8 murm <slug-a> <slug-b> <slug-c>
517
555
  │ Spawn Pipelines │
518
556
  │ (max 3 concurrent by default) │
519
557
  │ │
520
- │ Each runs: Alex Nigel → Codey
521
- in its isolated worktree
558
+ │ Each runs full pipeline in isolation:
559
+ Alex [Cass] → Nigel → Codey
522
560
  └───────────────────────────────────────┘
523
561
 
524
562