murmur8 4.3.4 → 4.5.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.blueprint/agents/AGENT_BA_CASS.md +4 -12
- package/.blueprint/agents/AGENT_DEVELOPER_CODEY.md +1 -9
- package/.blueprint/agents/AGENT_SPECIFICATION_ALEX.md +0 -9
- package/.blueprint/agents/AGENT_TESTER_NIGEL.md +0 -9
- package/.blueprint/agents/GUARDRAILS.md +1 -1
- package/.blueprint/prompts/TEMPLATE.md +6 -4
- package/.blueprint/prompts/alex-runtime.md +3 -5
- package/.blueprint/prompts/cass-runtime.md +5 -7
- package/.blueprint/prompts/codey-implement-runtime.md +2 -6
- package/.blueprint/prompts/codey-plan-runtime.md +21 -21
- package/.blueprint/prompts/nigel-runtime.md +4 -6
- package/.blueprint/prompts/skill-error-recovery.md +86 -0
- package/.blueprint/prompts/skill-murm-mode.md +143 -0
- package/README.md +64 -26
- package/SKILL.md +214 -800
- package/package.json +1 -1
- package/src/murm.js +50 -0
package/README.md
CHANGED
|
@@ -211,6 +211,12 @@ The pipeline includes validation, smart routing, feedback loops, and history tra
|
|
|
211
211
|
│
|
|
212
212
|
▼
|
|
213
213
|
┌─────────────────────────────────────────────────────────────────┐
|
|
214
|
+
│ Feedback micro-Task: Cass reviews Alex's spec │
|
|
215
|
+
│ • Quality gate: proceed / pause / revise │
|
|
216
|
+
└─────────────────────────────────────────────────────────────────┘
|
|
217
|
+
│
|
|
218
|
+
▼
|
|
219
|
+
┌─────────────────────────────────────────────────────────────────┐
|
|
214
220
|
│ Smart Routing (v2.7) │
|
|
215
221
|
│ • Classify feature as technical or user-facing │
|
|
216
222
|
│ • Technical → skip Cass (saves ~25-40k tokens) │
|
|
@@ -229,13 +235,25 @@ The pipeline includes validation, smart routing, feedback loops, and history tra
|
|
|
229
235
|
│
|
|
230
236
|
▼
|
|
231
237
|
┌─────────────────────────────────────────────────────────────────┐
|
|
232
|
-
│ Nigel
|
|
233
|
-
|
|
234
|
-
│
|
|
235
|
-
|
|
236
|
-
|
|
237
|
-
│
|
|
238
|
-
│
|
|
238
|
+
│ Feedback micro-Task: Nigel reviews Cass's stories │
|
|
239
|
+
└─────────────────────────────────────────────────────────────────┘
|
|
240
|
+
│
|
|
241
|
+
▼
|
|
242
|
+
┌─────────────────────────────────────────────────────────────────┐
|
|
243
|
+
│ Nigel (Test Spec + Handoff) → Nigel (Executable Tests) │
|
|
244
|
+
│ • Split into two atomic calls for token efficiency │
|
|
245
|
+
└─────────────────────────────────────────────────────────────────┘
|
|
246
|
+
│
|
|
247
|
+
▼
|
|
248
|
+
┌─────────────────────────────────────────────────────────────────┐
|
|
249
|
+
│ Feedback micro-Task: Codey reviews Nigel's tests │
|
|
250
|
+
└─────────────────────────────────────────────────────────────────┘
|
|
251
|
+
│
|
|
252
|
+
▼
|
|
253
|
+
┌─────────────────────────────────────────────────────────────────┐
|
|
254
|
+
│ Codey (Plan) → Codey (Implement per-step) │
|
|
255
|
+
│ • Plan uses strict parseable format │
|
|
256
|
+
│ • Orchestrator spawns one Task per implementation step │
|
|
239
257
|
└─────────────────────────────────────────────────────────────────┘
|
|
240
258
|
│
|
|
241
259
|
On Failure│
|
|
@@ -309,6 +327,24 @@ analyzes: recommends: calibrates:
|
|
|
309
327
|
• Trends • And feedback issues
|
|
310
328
|
```
|
|
311
329
|
|
|
330
|
+
### Accessing Module Data
|
|
331
|
+
|
|
332
|
+
Data is collected from both invocation methods and accessible via CLI commands:
|
|
333
|
+
|
|
334
|
+
| Data | `/implement-feature` (skill) | `npx murmur8 murm` (CLI) | How to access |
|
|
335
|
+
|------|------------------------------|--------------------------|---------------|
|
|
336
|
+
| **Per-stage timing** (alex, cass, nigel, codey) | Recorded by orchestrating agent | Merged from worktree on successful merge | `npx murmur8 history` |
|
|
337
|
+
| **Feedback ratings** (agent-to-agent) | Recorded by feedback micro-Tasks | Merged from worktree on successful merge | `npx murmur8 history`, `npx murmur8 insights --feedback` |
|
|
338
|
+
| **Token cost per stage** | Recorded by orchestrating agent | Merged from worktree on successful merge | `npx murmur8 history --cost` |
|
|
339
|
+
| **Batch summary** (total duration, feature outcomes) | N/A (single feature) | Recorded at batch completion | `npx murmur8 history` |
|
|
340
|
+
| **Success/failure status** | Recorded per run | Recorded per feature + batch | `npx murmur8 history --stats` |
|
|
341
|
+
| **Retry attempts & strategies** | Recorded on failure | Merged from worktree on successful merge | `npx murmur8 insights --failures` |
|
|
342
|
+
| **Bottleneck analysis** | Derived from history | Derived from history | `npx murmur8 insights --bottlenecks` |
|
|
343
|
+
| **Smart retry recommendations** | Used live during pipeline | Used live during pipeline | Automatic on failure |
|
|
344
|
+
| **Diff preview** | Shown before commit | Shown per worktree before merge | Interactive during pipeline |
|
|
345
|
+
|
|
346
|
+
**How worktree history merging works:** When `npx murmur8 murm` runs, each feature pipeline executes `/implement-feature` inside an isolated git worktree. The skill records per-stage data to `.claude/pipeline-history.json` within that worktree. After a successful merge, murmur8 reads this file and appends its entries to the main project's history before cleaning up the worktree. Failed/conflicted worktrees preserve their history for debugging.
|
|
347
|
+
|
|
312
348
|
## Directory Structure
|
|
313
349
|
|
|
314
350
|
```
|
|
@@ -320,13 +356,15 @@ your-project/
|
|
|
320
356
|
│ │ ├── AGENT_TESTER_NIGEL.md
|
|
321
357
|
│ │ ├── AGENT_DEVELOPER_CODEY.md
|
|
322
358
|
│ │ └── GUARDRAILS.md # Shared guardrails (v2.7)
|
|
323
|
-
│ ├── prompts/ #
|
|
359
|
+
│ ├── prompts/ # Self-contained runtime prompts (v4.4)
|
|
324
360
|
│ │ ├── TEMPLATE.md
|
|
325
361
|
│ │ ├── alex-runtime.md
|
|
326
362
|
│ │ ├── cass-runtime.md
|
|
327
363
|
│ │ ├── nigel-runtime.md
|
|
328
364
|
│ │ ├── codey-plan-runtime.md
|
|
329
|
-
│ │
|
|
365
|
+
│ │ ├── codey-implement-runtime.md
|
|
366
|
+
│ │ ├── skill-murm-mode.md # Murmuration steps (loaded on demand)
|
|
367
|
+
│ │ └── skill-error-recovery.md # Error handling (loaded on failure)
|
|
330
368
|
│ ├── templates/ # Spec and output templates
|
|
331
369
|
│ │ ├── SYSTEM_SPEC.md
|
|
332
370
|
│ │ ├── FEATURE_SPEC.md
|
|
@@ -357,15 +395,15 @@ your-project/
|
|
|
357
395
|
|
|
358
396
|
## Agent Guardrails
|
|
359
397
|
|
|
360
|
-
All agents follow strict guardrails to
|
|
398
|
+
All agents follow strict guardrails enforced via inlined rules in each self-contained runtime prompt. The authoritative source is `.blueprint/agents/GUARDRAILS.md`, with critical rules inlined directly into agent prompts so sub-agents never need to load external files at runtime.
|
|
361
399
|
|
|
362
400
|
| Guardrail | Description |
|
|
363
401
|
|-----------|-------------|
|
|
364
402
|
| **Source Restrictions** | Only use provided inputs (specs, code, business_context) |
|
|
365
403
|
| **Prohibited Sources** | No social media, forums, external APIs, training data for domain facts |
|
|
366
|
-
| **
|
|
404
|
+
| **Assumption Labeling** | All assumptions must be explicitly labeled |
|
|
367
405
|
| **Confidentiality** | Business context treated as confidential |
|
|
368
|
-
| **Escalation Protocol** | Clear rules for when to ask vs assume |
|
|
406
|
+
| **Escalation Protocol** | Clear rules for when to ask vs assume — flag ambiguity, don't guess |
|
|
369
407
|
|
|
370
408
|
## Self-Improvement Loop
|
|
371
409
|
|
|
@@ -423,21 +461,21 @@ npx murmur8 init
|
|
|
423
461
|
|
|
424
462
|
Both CLIs execute the same pipeline: Alex → Cass → Nigel → Codey. The skill uses each CLI's native agent/task mechanism.
|
|
425
463
|
|
|
426
|
-
## Token Efficiency (
|
|
464
|
+
## Token Efficiency (v4.4)
|
|
427
465
|
|
|
428
|
-
|
|
466
|
+
The pipeline is optimised to operate within strict token limits (4096 output, 1024 thinking per sub-agent):
|
|
429
467
|
|
|
430
|
-
| Optimization |
|
|
431
|
-
|
|
432
|
-
| **
|
|
433
|
-
| **
|
|
434
|
-
| **
|
|
435
|
-
| **
|
|
436
|
-
| **
|
|
437
|
-
| **
|
|
468
|
+
| Optimization | Impact | Description |
|
|
469
|
+
|--------------|--------|-------------|
|
|
470
|
+
| **Self-contained prompts** | ~660 lines/call eliminated | Agents never read full specs, guardrails, manifesto, or rituals at runtime |
|
|
471
|
+
| **Trusted handoff chain** | ~200-400 tokens/call saved | Each agent reads only the upstream handoff + its immediate inputs |
|
|
472
|
+
| **Split Nigel** | Prevents truncation | Test spec and executable tests are separate atomic calls |
|
|
473
|
+
| **Orchestrator-driven Codey** | Prevents truncation | One Task per implementation step instead of monolithic implement call |
|
|
474
|
+
| **Feedback micro-Tasks** | ~50 output tokens each | Separate review calls don't consume the main agent's output budget |
|
|
475
|
+
| **Hybrid SKILL.md** | 48% smaller | Murmuration and error recovery loaded on demand, not always |
|
|
476
|
+
| **Strict plan format** | Zero parsing overhead | Machine-parseable one-liner per step, orchestrator splits on regex |
|
|
438
477
|
| **Smart Story Routing** | ~25,000-40,000 tokens | Skip Cass for technical features |
|
|
439
|
-
|
|
440
|
-
**Total estimated savings: 10,000+ tokens per pipeline run** (more for technical features)
|
|
478
|
+
| **Lazy Business Context** | Variable | Only loaded when feature spec references it |
|
|
441
479
|
|
|
442
480
|
## Cost Tracking
|
|
443
481
|
|
|
@@ -517,8 +555,8 @@ murmur8 murm <slug-a> <slug-b> <slug-c>
|
|
|
517
555
|
│ Spawn Pipelines │
|
|
518
556
|
│ (max 3 concurrent by default) │
|
|
519
557
|
│ │
|
|
520
|
-
│ Each runs
|
|
521
|
-
│
|
|
558
|
+
│ Each runs full pipeline in isolation: │
|
|
559
|
+
│ Alex → [Cass] → Nigel → Codey │
|
|
522
560
|
└───────────────────────────────────────┘
|
|
523
561
|
│
|
|
524
562
|
▼
|