maestro-flow 0.3.34 → 0.3.35

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -17,261 +17,127 @@
17
17
 
18
18
  ---
19
19
 
20
- Maestro-Flow is a workflow orchestration framework for multi-agent development with Claude Code, Codex, Gemini, and other AI agents. It automates the most time-consuming part of AI-assisted engineering — deciding which agents to use, in what order, with what context. Describe your intent, and Maestro-Flow routes to the optimal command chain, drives parallel agent execution, and closes the loop through a real-time dashboard, self-healing issue pipeline, and evolving knowledge graph.
20
+ Maestro-Flow is a workflow orchestration framework for multi-agent development with Claude Code, Codex, Gemini, and other AI agents. Describe your intent, and Maestro-Flow routes to the optimal command chain, drives parallel agent execution, and closes the loop through adaptive decision-making, a real-time dashboard, and an evolving knowledge graph.
21
21
 
22
22
  ---
23
23
 
24
- ## What It Does
25
-
26
- You describe what you want. Maestro-Flow figures out which agents to use, in what order, with what context — and drives it to completion.
24
+ ## Install
27
25
 
28
26
  ```bash
29
- # Natural language — Maestro-Flow routes to the optimal command chain
30
- /maestro "implement OAuth2 authentication with refresh tokens"
31
-
32
- # Or step by step
33
- /maestro-init # Set up project workspace
34
- /maestro-roadmap # Create phased roadmap interactively
35
- /maestro-analyze # Multi-dimensional analysis
36
- /maestro-plan # Generate execution plan
37
- /maestro-execute # Wave-based parallel agent execution
38
- /maestro-verify # Goal-backward verification
39
- ```
40
-
41
- ### The Pipeline
42
-
43
- ```mermaid
44
- graph LR
45
- A[init] --> B[brainstorm]
46
- B --> C[roadmap]
47
- C --> D[analyze]
48
- D --> E[plan]
49
- E --> F[execute]
50
- F --> G[verify]
51
- G --> H[review]
52
- H --> I[milestone-audit]
53
- I --> J[milestone-complete]
54
- J -->|next milestone| D
27
+ npm install -g maestro-flow
28
+ maestro install
55
29
  ```
56
30
 
57
- All work artifacts live in `.workflow/scratch/`, tracked by `state.json` artifact registry. Phases are labels in the roadmap, not directories.
58
-
59
- ### Quick Channels
60
-
61
- | Channel | Flow | When |
62
- |---------|------|------|
63
- | `/maestro-quick` | analyze > plan > execute | Quick fixes, small features |
64
- | Scratch mode | `analyze -q` > `plan --dir` > `execute --dir` | No roadmap, just get it done |
65
- | `/maestro "..."` | AI-routed command chain | Describe intent, let Maestro-Flow decide |
31
+ **Prerequisites**: Node.js >= 18, Claude Code CLI. Optional: Codex CLI, Gemini CLI for multi-agent workflows.
66
32
 
67
33
  ---
68
34
 
69
- ## Four Pillars
70
-
71
- ### 1. Structured Pipeline
72
-
73
- Scratch-based milestone workflow with artifact registry tracking. Each milestone moves through analyze > plan > execute > verify > review > milestone-audit > milestone-complete. All artifacts live in `.workflow/scratch/`, registered in `state.json`. The dashboard shows what's happening and what to do next.
35
+ ## Quick Start
74
36
 
75
- 49 slash commands across 6 categories power every stage from project initialization to quality retrospective.
37
+ **`/maestro-ralph`** is the primary entry point a closed-loop lifecycle engine that reads project state, infers your position, builds an adaptive command chain, and drives it to completion:
76
38
 
77
- ### 2. Autonomous Autopilot
78
-
79
- **Commander Agent** — a background supervisor that runs a tick loop:
80
-
81
- ```
82
- assess → decide → dispatch → wait → assess → ...
39
+ ```bash
40
+ /maestro-ralph "implement OAuth2 authentication with refresh tokens"
83
41
  ```
84
42
 
85
- It reads project state (phases, tasks, issues, agent slots), decides what needs attention, and dispatches agents automatically. Three profiles: `conservative`, `balanced`, `aggressive`.
43
+ Ralph automatically determines where you are in the lifecycle (brainstorm roadmap analyze plan execute → verify → review → test → milestone-complete) and builds the appropriate chain. Decision nodes at key checkpoints evaluate results and dynamically insert debug+fix loops when needed.
86
44
 
87
- **Issue Closed-Loop** — issues aren't just tickets, they're a self-healing pipeline:
88
-
89
- ```mermaid
90
- graph LR
91
- A[Discover] --> B[Analyze]
92
- B --> C[Plan]
93
- C --> D[Execute]
94
- D --> E[Close]
45
+ ```bash
46
+ /maestro-ralph status # View session progress
47
+ /maestro-ralph continue # Resume after decision pause
48
+ /maestro-ralph -y "build a REST API" # Full auto mode — no pauses
95
49
  ```
96
50
 
97
- | Stage | Command | What Happens |
98
- |-------|---------|-------------|
99
- | **Discover** | `/manage-issue-discover` | 8-perspective scan: bugs, UX, tech debt, security, performance, testing gaps, code quality, documentation |
100
- | **Analyze** | `/maestro-analyze --gaps` | Root cause analysis via CLI exploration, writes `issue.analysis` |
101
- | **Plan** | `/maestro-plan --gaps` | Generate TASK files linked to issues via `task_refs` |
102
- | **Execute** | `/maestro-execute` | Wave-based parallel execution with automatic issue status sync |
103
- | **Close** | Automatic | All linked tasks completed > resolved > closed |
104
-
105
- Quality commands (`review`, `test`, `verify`) automatically create issues for problems they find. Issue fixes flow back into the phase pipeline. The loop closes itself.
106
-
107
- ### 3. Visual Control Plane
51
+ ### Other Entry Points
108
52
 
109
- Real-time project dashboard at `http://127.0.0.1:3001`. Built with React 19, Tailwind CSS 4, and WebSocket live updates.
110
-
111
- | View | Key | What You See |
112
- |------|-----|-------------|
113
- | **Board** | `K` | Kanban columns — Backlog, In Progress, Review, Done |
114
- | **Timeline** | `T` | Gantt-style phase timeline with progress bars |
115
- | **Table** | `L` | Every phase and issue in a sortable table |
116
- | **Center** | `C` | Command center — active executions, issue queue, quality metrics |
117
-
118
- Pick an agent on any issue card, hit play. Batch-select issues, dispatch them all in parallel. Watch agents work in a real-time streaming panel.
119
-
120
- ### 4. Smart Knowledge Base
121
-
122
- The project builds intelligence over time through two systems:
123
-
124
- **Wiki Knowledge Graph** — structured entries (specs, phases, decisions, lessons) connected by semantic links. BM25 search, backlink traversal, health scoring. `/wiki-connect` discovers hidden connections; `/wiki-digest` generates themed digests with coverage heatmaps and gap analysis.
125
-
126
- **Learning Toolkit** — 5 commands that turn code and history into reusable knowledge:
127
-
128
- | Command | What It Does |
53
+ | Command | When to Use |
129
54
  |---------|-------------|
130
- | `/learn-retro` | Unified retrospective git metrics + decision evaluation via `--lens git\|decision\|all` |
131
- | `/learn-follow` | Guided reading with forcing questions extracts patterns and builds understanding |
132
- | `/learn-decompose` | 4-dimension parallel pattern extraction, saves to specs/wiki |
133
- | `/learn-second-opinion` | Multi-perspective analysis: review, challenge, or consult modes |
134
- | `/learn-investigate` | Systematic question investigation with hypothesis testing |
135
-
136
- All learning commands share `lessons.jsonl` — a unified knowledge store queryable via `/manage-learn`. Specs, retrospectives, and manual insights all flow into the same pool.
55
+ | `/maestro "..."` | Describe intent, let AI route to the optimal command chain |
56
+ | `/maestro-quick` | Quick fixes, small features (analyze plan execute) |
57
+ | `/maestro-*` | Step-by-step: init, roadmap, analyze, plan, execute, verify |
137
58
 
138
59
  ---
139
60
 
140
- ## Under the Hood
141
-
142
- ### Multi-Agent Engine
143
-
144
- Maestro-Flow coordinates multiple AI agents in parallel:
145
-
146
- ```
147
- ┌────────────────────────────────┐
148
- │ ExecutionScheduler │
149
- │ (wave-based parallel engine) │
150
- └───────────┬────────────────────┘
151
-
152
- ┌──────────────┼──────────────┐
153
- │ │ │
154
- ┌─────┴─────┐ ┌─────┴──────┐ ┌────┴──────┐
155
- │ Claude │ │ Codex │ │ Gemini │
156
- │ Agent SDK │ │ CLI │ │ CLI │
157
- └───────────┘ └────────────┘ └───────────┘
158
- ```
159
-
160
- - **Wave execution** — independent tasks run in parallel, dependent tasks wait for predecessors
161
- - **Agent SDK** — native Claude Agent SDK for Claude Code processes
162
- - **CLI adapters** — Codex, Gemini, Qwen, OpenCode all accessible through `maestro delegate`
163
- - **Workspace isolation** — each agent gets a clean execution context
164
-
165
- ### Hook System
166
-
167
- 11 context-aware hooks across 3 installation levels:
61
+ ## Key Features
168
62
 
169
- | Hook | Purpose |
170
- |------|---------|
171
- | `context-monitor` | Monitors context usage, injects warnings when running low |
172
- | `spec-injector` | Auto-injects project specs into subagent prompts by agent type + keyword |
173
- | `keyword-spec-injector` | Scans user prompts for keywords, injects matching `<spec-entry>` entries |
174
- | `spec-validator` | Validates `<spec-entry>` format on Write/Edit to `.workflow/specs/` |
175
- | `delegate-monitor` | Tracks async delegate task completion |
176
- | `team-monitor` | Collab heartbeat — reports activity to `.workflow/collab/activity.jsonl` for teammate awareness |
177
- | `telemetry` | Execution telemetry collection |
178
- | `session-context` | Injects workflow state at session start |
179
- | `skill-context` | Injects workflow state when invoking workflow skills |
180
- | `coordinator-tracker` | Tracks coordinator chain progress |
181
- | `workflow-guard` | Protects critical files and enforces workflow constraints |
63
+ ### 1. Adaptive Lifecycle Engine (`maestro-ralph`)
182
64
 
183
- The `spec-injector` routes project specs to agents based on category coding agents get coding conventions, planning agents get architecture constraints. The `keyword-spec-injector` provides entry-level precision when a user prompt mentions "auth", only auth-related spec entries are injected. Session dedup prevents re-injection within the same session. A 4-tier context budget (full > reduced > minimal > skip) adapts injection volume to remaining context.
65
+ Reads project state infers lifecycle position builds command chain with decision nodes. At each checkpoint, Ralph reads actual execution results and decides: continue, or insert a debug fix retry loop. The chain grows and shrinks dynamically based on outcomes.
184
66
 
185
- ```bash
186
- maestro hooks install --level minimal # context-monitor + spec-injector
187
- maestro hooks install --level standard # + delegate/team/telemetry + session/skill-context + coordinator-tracker
188
- maestro hooks install --level full # + workflow-guard
189
67
  ```
190
-
191
- ### Overlay System
192
-
193
- Non-invasive patches for `.claude/commands/*.md` — add steps, reading requirements, quality gates without editing originals. Overlays survive `maestro install` upgrades.
194
-
195
- ```bash
196
- /maestro-overlay "add CLI verification after maestro-execute"
197
- maestro overlay list # Interactive TUI management
198
- maestro overlay bundle -o team.json # Pack for sharing
68
+ brainstorm → init → roadmap → analyze → plan → execute → verify
69
+ post-verify
70
+ business-test
71
+ ◆ post-business-test
72
+ review
73
+ ◆ post-review
74
+ test
75
+ post-test
76
+ milestone-audit milestone-complete
77
+ ◆ post-milestone → next milestone
199
78
  ```
200
79
 
201
- ---
80
+ **Three quality modes** — control how thorough each phase is:
202
81
 
203
- ## 49 Commands, 21 Agents
82
+ | Mode | Stages | When |
83
+ |------|--------|------|
84
+ | `full` | verify → business-test → review → test-gen → test | Production features, security-critical code |
85
+ | `standard` | verify → review → test | Default, balanced quality |
86
+ | `quick` | verify → CLI-review | Quick fixes, prototyping |
204
87
 
205
- ### Commands
88
+ **Full pipeline explained** — each stage serves a distinct quality gate:
206
89
 
207
- | Category | Count | Prefix | Purpose |
208
- |----------|-------|--------|---------|
209
- | **Core Workflow** | 18 | `maestro-*` | Full lifecycle init, brainstorm, roadmap, analyze, plan, execute, verify, coordinate, milestones, overlays, UI design |
210
- | **Management** | 12 | `manage-*` | Issue lifecycle, codebase docs, knowledge capture, memory, harvest, status |
211
- | **Quality** | 9 | `quality-*` | Review, test, debug, test-gen, integration-test, business-test, refactor, retrospective, sync |
212
- | **Learning** | 5 | `learn-*` | Unified retro, follow-along, pattern decompose, investigate, second opinion |
213
- | **Specification** | 3 | `spec-*` | Setup, add, load |
214
- | **Wiki** | 2 | `wiki-*` | Connection discovery, knowledge digest |
90
+ 1. **verify** goal-backward verification: checks that all plan requirements are implemented, validates architectural constraints, anti-pattern scan, Nyquist test coverage
91
+ 2. **business-test** — PRD-forward business testing: requirement traceability, fixture generation, multi-layer execution against acceptance criteria
92
+ 3. **review** — multi-dimensional code review: correctness, readability, performance, security, testing, architecture
93
+ 4. **test-gen** coverage gap analysis and automatic test generation (TDD/E2E classification, L0-L3 progressive layers)
94
+ 5. **test** conversational UAT: interactive exploratory testing with session persistence and gap-plan closure
215
95
 
216
- ### Agents
96
+ At each `◆` decision node, Ralph evaluates outcomes and decides: pass through, or insert a debug → fix → retry loop. Max retries configurable per decision point.
217
97
 
218
- 21 specialized agent definitions in `.claude/agents/` — each a focused Markdown file that Claude Code loads on demand. Includes `workflow-planner`, `workflow-executor`, `issue-discover-agent`, `workflow-debugger`, `workflow-verifier`, `team-worker`, and more.
98
+ ### 2. Structured Pipeline
219
99
 
220
- ---
100
+ 49 slash commands across 6 categories power every stage — from project initialization to quality retrospective. All artifacts live in `.workflow/scratch/`, tracked by `state.json`.
221
101
 
222
- ## Getting Started
102
+ ### 3. Issue Closed-Loop
223
103
 
224
- ### Prerequisites
104
+ Issues aren't just tickets — they're a self-healing pipeline: discover → analyze → plan → execute → close. Quality commands automatically create issues for problems they find. Issue fixes flow back into the phase pipeline.
225
105
 
226
- - Node.js >= 18
227
- - [Claude Code](https://claude.com/code) CLI
228
- - (Optional) Codex CLI, Gemini CLI for multi-agent workflows
106
+ ### 4. Visual Dashboard
229
107
 
230
- ### Install
108
+ Real-time dashboard at `http://127.0.0.1:3001` with Kanban board, Gantt timeline, sortable table, and command center. Pick an agent on any issue card and dispatch. Built with React 19, Tailwind CSS 4, WebSocket live updates.
231
109
 
232
110
  ```bash
233
- npm install -g maestro-flow
234
-
235
- # Install workflows, commands, agents, templates
236
- maestro install
111
+ maestro serve # http://127.0.0.1:3001
112
+ maestro view # Terminal TUI alternative
237
113
  ```
238
114
 
239
- ### First Run
115
+ ### 5. Multi-Agent Engine
240
116
 
241
- ```bash
242
- /maestro-init # Initialize project
243
- /maestro-roadmap # Create roadmap
244
- /maestro-analyze # Analyze current milestone
245
- /maestro-plan # Plan (outputs to scratch/)
246
- /maestro-execute # Execute all pending plans
247
-
248
- # Or just:
249
- /maestro "build a REST API for user management"
250
- ```
117
+ Coordinates Claude Code, Codex, Gemini, Qwen, and OpenCode in parallel via wave-based execution. Independent tasks run concurrently; dependent tasks wait for predecessors.
251
118
 
252
- ### Dashboard
119
+ ### 6. Smart Knowledge Base
253
120
 
254
- ```bash
255
- maestro serve # → http://127.0.0.1:3001
256
- maestro view # Terminal TUI alternative
257
- ```
121
+ Wiki knowledge graph with BM25 search, backlink traversal, and health scoring. Learning toolkit (retro, follow, decompose, investigate, second-opinion) feeds into a unified `lessons.jsonl` store.
258
122
 
259
- ### MCP Server
123
+ ### 7. Hook & Overlay System
260
124
 
261
- ```bash
262
- # Claude Code — load as development MCP server
263
- claude --dangerously-load-development-channels server:maestro --dangerously-skip-permissions
125
+ 11 context-aware hooks inject project specs into agent prompts, monitor context usage, and track delegate execution. The overlay system enables non-invasive patches for `.claude/commands/*.md` that survive upgrades.
264
126
 
265
- # stdio transport (for Claude Desktop, other MCP clients)
266
- npm run mcp
267
- ```
127
+ ---
268
128
 
269
- ### Workflow Launcher
129
+ ## Commands & Agents
270
130
 
271
- ```bash
272
- maestro launcher # Interactive workflow + settings picker
273
- maestro launcher list # Show registered workflows
274
- ```
131
+ | Category | Count | Prefix | Purpose |
132
+ |----------|-------|--------|---------|
133
+ | **Core Workflow** | 18 | `maestro-*` | Full lifecycle ralph, init, roadmap, analyze, plan, execute, verify, milestones, overlays |
134
+ | **Management** | 12 | `manage-*` | Issue lifecycle, codebase docs, knowledge capture, memory, status |
135
+ | **Quality** | 9 | `quality-*` | Review, test, debug, test-gen, integration-test, business-test, refactor, sync |
136
+ | **Learning** | 5 | `learn-*` | Retro, follow-along, pattern decompose, investigate, second opinion |
137
+ | **Specification** | 3 | `spec-*` | Setup, add, load |
138
+ | **Wiki** | 2 | `wiki-*` | Connection discovery, knowledge digest |
139
+
140
+ 21 specialized agent definitions in `.claude/agents/` — each a focused Markdown file that Claude Code loads on demand.
275
141
 
276
142
  ---
277
143
 
@@ -288,9 +154,6 @@ maestro/
288
154
  │ └── src/
289
155
  │ ├── client/ # React 19 + Zustand + Tailwind CSS 4
290
156
  │ ├── server/ # Hono API + WebSocket + SSE
291
- │ │ ├── agents/ # AgentManager + adapters
292
- │ │ ├── commander/ # Autonomous Commander Agent
293
- │ │ └── execution/ # ExecutionScheduler + WaveExecutor
294
157
  │ └── shared/ # Shared types
295
158
  ├── .claude/
296
159
  │ ├── commands/ # 49 slash commands (.md)
@@ -300,8 +163,6 @@ maestro/
300
163
  └── extensions/ # Plugin system
301
164
  ```
302
165
 
303
- ### Tech Stack
304
-
305
166
  | Layer | Technology |
306
167
  |-------|-----------|
307
168
  | CLI | Commander.js, TypeScript, ESM |
@@ -315,22 +176,23 @@ maestro/
315
176
 
316
177
  ## Documentation
317
178
 
318
- - **[Command Usage Guide](guide/command-usage-guide.md)** — All 51 commands with workflow diagrams, pipeline chaining, Issue closed-loop, and quick channels
319
- - **[Spec System Guide](guide/spec-system-guide.md)** — Project specs with `<spec-entry>` closed-tag format, keyword-based loading, validation hooks, session dedup injection
320
- - **[Delegate Async Guide](guide/delegate-async-guide.md)** — Async task delegation: CLI & MCP usage, message injection, chaining, broker lifecycle
321
- - **[Overlay Guide](guide/overlay-guide.md)** — Non-invasive command extensions: overlay format, section injection, bundle/import, interactive TUI management
322
- - **[Hooks Guide](guide/hooks-guide.md)** — Hook system architecture, 11 hooks, spec injection, context budget, configuration
323
- - **[Worktree Parallel Dev Guide](guide/worktree-guide.md)** — Milestone-level worktree parallelism: fork, sync, merge, scope enforcement, dashboard integration
324
- - **[Collab — User Guide](guide/team-lite-guide.md)** — Multi-person collaboration for 2-8 person teams: join, sync, activity awareness, conflict preflight, task management, namespace isolation
325
- - **[Collab — Design](guide/team-lite-design.md)** — Architecture, data model, namespace boundary between human-collab (`.workflow/collab/`) and agent-pipeline (`.workflow/.team/`) domains
326
- - **[MCP Tools Reference](guide/mcp-tools-guide.en.md)** — All 9 MCP endpoint tools: file ops (edit/write/read), team collaboration (msg/mailbox/task/agent), and persistent memory
327
- - **[CLI Commands Reference](guide/cli-commands-guide.en.md)** — All 21 terminal commands — install, delegate, coordinate, wiki, hooks, overlay, collab, and more
179
+ - **[Maestro Ralph Guide](guide/maestro-ralph-guide.md)** — Adaptive lifecycle engine: position inference, decision nodes, quality modes, retry escalation
180
+ - **[Command Usage Guide](guide/command-usage-guide.md)** — All 51 commands with workflow diagrams, pipeline chaining, Issue closed-loop
181
+ - **[CLI Commands Reference](guide/cli-commands-guide.en.md)** — All 21 terminal commands: install, delegate, coordinate, wiki, hooks, overlay, collab
182
+ - **[Spec System Guide](guide/spec-system-guide.md)** — Project specs with `<spec-entry>` format, keyword-based loading, validation hooks
183
+ - **[Delegate Async Guide](guide/delegate-async-guide.md)** — Async task delegation: CLI & MCP usage, message injection, chaining
184
+ - **[Overlay Guide](guide/overlay-guide.md)** — Non-invasive command extensions: format, section injection, bundle/import
185
+ - **[Hooks Guide](guide/hooks-guide.md)** — Hook system architecture, 11 hooks, spec injection, context budget
186
+ - **[Worktree Guide](guide/worktree-guide.md)** — Milestone-level parallel development: fork, sync, merge, dashboard integration
187
+ - **[Collab User Guide](guide/team-lite-guide.md)** — Multi-person collaboration for 2-8 person teams
188
+ - **[Collab Design](guide/team-lite-design.md)** — Architecture, data model, namespace boundaries
189
+ - **[MCP Tools Reference](guide/mcp-tools-guide.en.md)** — All 9 MCP endpoint tools
328
190
 
329
191
  ---
330
192
 
331
193
  ## Acknowledgments
332
194
 
333
- - **[GET SHIT DONE](https://github.com/gsd-build/get-shit-done)** by TACHES — The spec-driven development model and context engineering philosophy that shaped Maestro-Flow's pipeline design.
195
+ - **[GET SHIT DONE](https://github.com/gsd-build/get-shit-done)** by TACHES — The spec-driven development model and context engineering philosophy.
334
196
  - **[Claude-Code-Workflow](https://github.com/catlog22/Claude-Code-Workflow)** — The predecessor that pioneered multi-CLI orchestration and skill-based workflow routing.
335
197
 
336
198
  ## Contributors