@sulhadin/orchestrator 3.0.0-beta.14 → 3.0.0-beta.15

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -62,6 +62,8 @@ PM challenges scope, creates M1-user-auth with 3 phases
62
62
  | `/orchestra start --auto` | Fully autonomous — warns once, then auto-push |
63
63
  | `/orchestra hotfix {desc}` | Ultra-fast fix: implement → verify → commit → push |
64
64
  | `/orchestra status` | Milestone status report (PM only) |
65
+ | `/orchestra verifier [N]` | Verify milestones match PRD/RFC requirements (PM only) |
66
+ | `/orchestra rewind [N]` | Review execution history: decisions, metrics, insights (PM only) |
65
67
  | `/orchestra blueprint {name}` | Generate milestones from template |
66
68
  | `/orchestra blueprint add` | Save current work as reusable template |
67
69
  | `/orchestra create-role` | Create a new role interactively (Orchestrator only) |
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@sulhadin/orchestrator",
3
- "version": "3.0.0-beta.14",
3
+ "version": "3.0.0-beta.15",
4
4
  "description": "AI Team Orchestration System — multi-role coordination for Claude Code",
5
5
  "bin": "bin/index.js",
6
6
  "scripts": {
@@ -221,13 +221,8 @@ Behavior after milestone completion depends on `pipeline.milestone_isolation`:
221
221
  ### Inline Mode (default)
222
222
 
223
223
  After push and retro:
224
- 1. Clear context.md: remove `## Status`, `## Phases`, `## Decisions`, `## Metrics` sections, keep only `## Codebase Map`
225
- 2. **STOP.** Print: "Milestone {id} complete and pushed. Run `/compact` then `/orchestra start` for next milestone."
226
- 3. Do NOT loop to next milestone — user manages context manually.
227
-
228
- **Why stop?** Conductor accumulates ~5-8k tokens per milestone from phase
229
- results, review cycles, and commit logs. In inline mode, the user controls
230
- when to compact and restart, keeping quality high across milestones.
224
+ 1. **STOP.** Print: "Milestone {id} complete and pushed."
225
+ 2. Do NOT loop to next milestone.
231
226
 
232
227
  ### Agent Mode
233
228
 
@@ -367,7 +362,7 @@ pipeline: {quick | standard | full}
367
362
  - **Phase failed:** Set status to `failed`, add error summary and last-error
368
363
  - **Decisions:** Append key decisions from sub-agent's `notes` field — only non-obvious choices that affect later phases
369
364
  - **Metrics:** Record approximate phase duration and verification_retries from sub-agent result
370
- - **Milestone complete (inline mode):** Clear all sections except `## Codebase Map`
365
+ - **Milestone complete:** Retro is written to knowledge.md (see Milestone Completion)
371
366
 
372
367
  ### On Resume
373
368
 
@@ -11,6 +11,8 @@ COMMANDS:
11
11
  /orchestra start --auto Fully autonomous (warns once, then auto-push)
12
12
  /orchestra hotfix {desc} Ultra-fast fix: implement → verify → commit → push
13
13
  /orchestra status Milestone status report (PM only)
14
+ /orchestra verifier [N] Verify milestones match requirements (PM only)
15
+ /orchestra rewind [N] Review milestone execution history (PM only)
14
16
  /orchestra help Show this help
15
17
  /orchestra blueprint {name} Generate milestones from template (PM only)
16
18
  /orchestra blueprint add Save current work as blueprint (PM only)
@@ -0,0 +1,60 @@
1
+ Review milestone execution history for actionable insights. PM role only.
2
+
3
+ **Usage:**
4
+ - `/orchestra rewind` — rewind all `done` milestones
5
+ - `/orchestra rewind 1,2,3` — rewind only specified milestone numbers
6
+
7
+ 1. Read `.orchestra/roles/product-manager.md` to activate PM.
8
+ 2. Scan `.orchestra/milestones/` — collect milestones to review:
9
+ - No arguments: all milestones with `status: done`
10
+ - With numbers: only milestones matching those numbers (e.g., `1` matches `M1-*`)
11
+ 3. For each milestone, read execution artifacts:
12
+ - `context.md` — structured sections:
13
+ - `## Decisions` — key choices made during implementation
14
+ - `## Metrics` — phase duration and verification retries
15
+ - `## Phases` — status, commits, errors per phase
16
+ - `knowledge.md` — retro entry for this milestone
17
+ - `grooming.md` — original scope vs what actually happened
18
+ - Review verdict and comments (from context.md or git log)
19
+ 4. Extract and present — focus on **what the user needs to know**, not execution mechanics:
20
+
21
+ ```
22
+ ## Rewind: M1-user-auth
23
+
24
+ ### Key Decisions Made During Execution
25
+ - phase-1: Used Stripe SDK v4 instead of raw API (architect RFC recommendation)
26
+ - phase-2: Split webhook handler into separate file for testability
27
+ - phase-3: Chose CSS modules over Tailwind (frontend preference)
28
+
29
+ ### Performance
30
+ - Total phases: 5 | Completed: 5 | Failed: 0
31
+ - Longest phase: phase-3 (~12min) — complex UI with form validation
32
+ - Verification retries: 3 total (phase-2: 2, phase-4: 1)
33
+ - Stuck: No
34
+
35
+ ### Review Findings
36
+ - Verdict: approved-with-comments
37
+ - Comments:
38
+ - "Consider adding index on user_email for login query" (non-blocking)
39
+ - "Error messages expose internal details" (non-blocking, logged)
40
+
41
+ ### Scope Changes
42
+ - Original grooming planned 4 phases, executed 5 (phase-3 was split during implementation)
43
+ - phase-2 scope expanded: webhook handler was not in original PRD, added during RFC
44
+
45
+ ### Unresolved Items
46
+ - 🔧 DB index on user_email — reviewer flagged, not addressed
47
+ - 🔧 Error message sanitization — reviewer flagged, not addressed
48
+ - 🔧 phase-2 workaround: hardcoded timeout — flagged as tech debt in Decisions
49
+
50
+ ### What We Learned
51
+ - 📝 Webhook handler pattern — reusable for future integrations
52
+ - ⏱️ Form validation phases consistently slow — consider a form-validation skill
53
+ - 💡 Splitting phase-3 mid-execution worked well — complex UI benefits from smaller phases
54
+ ```
55
+
56
+ 5. After all milestones, present a cross-milestone summary:
57
+ - **Unresolved items** — review comments and flagged workarounds never addressed, across all milestones
58
+ - **Recurring patterns** — same review comments, same slow phase types, same failure modes
59
+ - **Skill gaps** — missing skills that would have helped
60
+ - **Strategic suggestions** — new skills to create, process improvements, items to fix in upcoming work
@@ -0,0 +1,52 @@
1
+ Verify that implemented milestones match their requirements. PM role only.
2
+
3
+ **Usage:**
4
+ - `/orchestra verifier` — verify all `done` milestones
5
+ - `/orchestra verifier 1,2,3` — verify only specified milestone numbers
6
+
7
+ 1. Read `.orchestra/roles/product-manager.md` to activate PM.
8
+ 2. Scan `.orchestra/milestones/` — collect milestones to verify:
9
+ - No arguments: all milestones with `status: done`
10
+ - With numbers: only milestones matching those numbers (e.g., `1` matches `M1-*`)
11
+ 3. For each milestone, read:
12
+ - `prd.md` — product requirements and acceptance criteria
13
+ - `rfc.md` — technical design decisions (if exists)
14
+ - `milestone.md` — summary and acceptance criteria
15
+ - `grooming.md` — scope decisions and phase breakdown
16
+ - All `phases/*.md` — phase acceptance criteria
17
+ 4. For each milestone, read execution context:
18
+ - `context.md` — `## Decisions` section (why specific approaches were chosen)
19
+ - `context.md` — `## Phases` section (which phases completed, which failed)
20
+ 5. For each milestone, read the actual implementation:
21
+ - Run `git log --oneline` filtered to commits from that milestone's phases
22
+ - Run `git diff` for those commits to see what changed
23
+ - Read the current state of modified files — diff shows changes, but current code shows completeness
24
+ 6. Compare requirements vs implementation. For each requirement/acceptance criterion:
25
+ - **met** — implementation satisfies the requirement
26
+ - **partial** — partially implemented, missing aspects noted
27
+ - **missed** — not implemented at all
28
+ - **deviated** — implemented differently than specified
29
+ 6. Report:
30
+
31
+ ```
32
+ ## Verification: M1-user-auth
33
+
34
+ ### Requirements Coverage
35
+ - ✅ met: JWT authentication endpoint (phase-1, commit abc123)
36
+ - ⚠️ partial: Rate limiting — implemented but no Redis backing (phase-2)
37
+ - ❌ missed: Password reset flow — not in any commit
38
+ - 🔀 deviated: Token refresh — RFC said rotating tokens, implemented static expiry
39
+
40
+ ### Summary
41
+ 4 requirements: 1 met, 1 partial, 1 missed, 1 deviated
42
+
43
+ ### Severity
44
+ - 🔴 critical: Password reset flow missing (core auth feature)
45
+ - 🟡 moderate: Rate limiting without Redis (works but won't scale)
46
+ - 🟡 moderate: Token refresh deviation (security concern)
47
+ ```
48
+
49
+ 8. After reporting all milestones, if there are critical or moderate gaps:
50
+ - List gaps grouped by severity
51
+ - Suggest: "Use `/orchestra pm` to plan fix milestones for these gaps."
52
+ - Do NOT create milestones directly — PM decides scope and priority
@@ -386,7 +386,7 @@ sequenceDiagram
386
386
  C->>C: reviewer → approved
387
387
  C->>C: Push → M1 done
388
388
 
389
- Note over C: STOP. "Run /compact then /orchestra start"
389
+ Note over C: STOP. "Run /compact or /clear then /orchestra start"
390
390
  ```
391
391
 
392
392
  ### 3. Conductor Execution Loop (Agent Mode)