npm - opencode-swarm-plugin - Versions diffs - 0.23.6 → 0.24.0 - Mend

opencode-swarm-plugin 0.23.6 → 0.24.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (32) hide show

package/.turbo/turbo-build.log +4 -4
package/CHANGELOG.md +27 -0
package/README.md +155 -3
package/bin/swarm.ts +497 -187
package/dist/index.d.ts +27 -0
package/dist/index.d.ts.map +1 -1
package/dist/index.js +620 -89
package/dist/plugin.js +548 -89
package/dist/schemas/bead-events.d.ts +698 -0
package/dist/schemas/bead-events.d.ts.map +1 -0
package/dist/schemas/index.d.ts +1 -0
package/dist/schemas/index.d.ts.map +1 -1
package/dist/skills.d.ts.map +1 -1
package/dist/swarm-decompose.d.ts +74 -0
package/dist/swarm-decompose.d.ts.map +1 -1
package/dist/swarm-orchestrate.d.ts.map +1 -1
package/dist/swarm-prompts.d.ts +1 -1
package/dist/swarm-prompts.d.ts.map +1 -1
package/dist/swarm.d.ts +27 -0
package/dist/swarm.d.ts.map +1 -1
package/docs/testing/context-recovery-test.md +470 -0
package/examples/commands/swarm.md +92 -20
package/global-skills/swarm-coordination/SKILL.md +380 -10
package/package.json +1 -1
package/src/schemas/bead-events.test.ts +341 -0
package/src/schemas/bead-events.ts +583 -0
package/src/schemas/index.ts +51 -0
package/src/skills.ts +10 -3
package/src/swarm-decompose.ts +337 -0
package/src/swarm-orchestrate.ts +15 -51
package/src/swarm-prompts.ts +144 -42
package/src/swarm.integration.test.ts +581 -31

package/docs/testing/context-recovery-test.md ADDED Viewed

@@ -0,0 +1,470 @@
+# Context Recovery Manual Test Scenario
+## Overview
+This test scenario verifies that the swarm coordination system can survive context death and recover from checkpoints. It proves that work-in-progress state is preserved across session boundaries, enabling agents to resume work after catastrophic context loss.
+**What this tests:**
+- Automatic checkpoint creation at progress milestones (25%, 50%, 75%)
+- State persistence to swarm-mail event store
+- Recovery mechanism that restores agent state
+- Continuity of work across session boundaries
+**Success criteria:**
+- Agent can resume work from exact state before context loss
+- All file modifications are tracked
+- Progress percentage is preserved
+- Coordinator context and directives are restored
+---
+## Prerequisites
+### Required Setup
+1. **Project with swarm-mail initialized**
+   ```bash
+   cd /path/to/your/project
+   # Ensure swarm-mail database exists
+   ```
+2. **OpenCode Swarm Plugin installed**
+   ```bash
+   npm install opencode-swarm-plugin
+   # or
+   bun add opencode-swarm-plugin
+   ```
+3. **Test bead structure**
+   - Epic bead with at least one subtask
+   - Example:
+     ```bash
+     beads_create_epic(
+       epic_title: "Test Context Recovery",
+       subtasks: [
+         { title: "Modify test files", files: ["test/file1.ts", "test/file2.ts"] }
+       ]
+     )
+     ```
+4. **Two terminal windows/sessions**
+   - Session A: For initial work (will be killed)
+   - Session B: For recovery
+---
+## Test Procedure
+### Phase 1: Start Initial Work Session
+**Session A - Terminal 1**
+1. **Initialize swarm mail**
+   ```typescript
+   swarmmail_init(
+     project_path: "/absolute/path/to/project",
+     task_description: "bd-123.1: Test context recovery feature"
+   )
+   ```
+   **Expected result:**
+   ```json
+   {
+     "success": true,
+     "data": {
+       "agent_name": "BlueLake",  // Random agent name
+       "project_key": "/absolute/path/to/project"
+     }
+   }
+   ```
+   **Verify:**
+   - ✅ Agent name assigned (e.g., "BlueLake")
+   - ✅ Project key matches your path
+2. **Reserve files for work**
+   ```typescript
+   swarmmail_reserve(
+     paths: ["test/file1.ts", "test/file2.ts"],
+     reason: "bd-123.1: Context recovery test",
+     ttl_seconds: 3600
+   )
+   ```
+   **Expected result:**
+   ```json
+   {
+     "success": true,
+     "data": {
+       "reservation_ids": [1, 2],
+       "agent_name": "BlueLake",
+       "expires_at": 1234567890
+     }
+   }
+   ```
+   **Verify:**
+   - ✅ Reservation IDs returned
+   - ✅ Files locked to this agent
+3. **Make some file modifications**
+   ```bash
+   # Modify test/file1.ts
+   echo "// First change" >> test/file1.ts
+   ```
+   **Expected result:**
+   - File modified on disk
+   **Verify:**
+   - ✅ File contains new content
+4. **Report 50% progress (triggers auto-checkpoint)**
+   ```typescript
+   swarm_progress(
+     project_key: "/absolute/path/to/project",
+     agent_name: "BlueLake",
+     bead_id: "bd-123.1",
+     status: "in_progress",
+     progress_percent: 50,
+     message: "Completed first file modification",
+     files_touched: ["test/file1.ts"]
+   )
+   ```
+   **Expected result:**
+   ```json
+   {
+     "success": true,
+     "data": {
+       "checkpoint_created": true,
+       "message": "Progress reported and checkpoint saved"
+     }
+   }
+   ```
+   **Verify:**
+   - ✅ Checkpoint creation confirmed
+   - ✅ Progress percentage is 50
+   - ✅ Files touched recorded
+5. **Verify checkpoint was created in swarm-mail**
+   ```typescript
+   // Query the event store directly (if you have access)
+   // Or check via beads metadata
+   beads_query(status: "in_progress")
+   ```
+   **Expected result:**
+   - Bead shows 50% progress
+   - Checkpoint event exists in event store
+   **Verify:**
+   - ✅ Checkpoint event type: "swarm_checkpoint_created"
+   - ✅ Recovery data includes: epic_id, bead_id, files, progress_percent, files_modified
+---
+### Phase 2: Simulate Context Death
+**Session A - Terminal 1**
+6. **Kill the session abruptly**
+   ```bash
+   # Press Ctrl+C or kill the terminal
+   # DO NOT gracefully close - simulate crash
+   ```
+   **Expected result:**
+   - Session terminates immediately
+   - No cleanup runs
+   **Verify:**
+   - ✅ Session ended ungracefully
+   - ✅ Agent did NOT release reservations
+   - ✅ Work state is "frozen" in event store
+---
+### Phase 3: Recover State in New Session
+**Session B - Terminal 2**
+7. **Start fresh session (simulate new agent)**
+   ```typescript
+   swarmmail_init(
+     project_path: "/absolute/path/to/project",
+     task_description: "Recovering from context death"
+   )
+   ```
+   **Expected result:**
+   ```json
+   {
+     "success": true,
+     "data": {
+       "agent_name": "CrimsonPeak",  // DIFFERENT agent name
+       "project_key": "/absolute/path/to/project"
+     }
+   }
+   ```
+   **Verify:**
+   - ✅ New agent name (different from Session A)
+   - ✅ Fresh session started
+8. **Attempt recovery**
+   ```typescript
+   swarm_recover(
+     project_key: "/absolute/path/to/project",
+     bead_id: "bd-123.1"
+   )
+   ```
+   **Expected result:**
+   ```json
+   {
+     "success": true,
+     "data": {
+       "recovered": true,
+       "checkpoint": {
+         "epic_id": "bd-123",
+         "bead_id": "bd-123.1",
+         "strategy": "file-based",
+         "files": ["test/file1.ts", "test/file2.ts"],
+         "recovery": {
+           "last_checkpoint": 1234567890,
+           "files_modified": ["test/file1.ts"],
+           "progress_percent": 50,
+           "last_message": "Completed first file modification"
+         },
+         "directives": {
+           "shared_context": "Test context recovery feature",
+           "coordinator_notes": "Resume from 50% completion"
+         }
+       },
+       "message": "State recovered from checkpoint at 50%"
+     }
+   }
+   ```
+   **Verify:**
+   - ✅ Recovery successful
+   - ✅ Progress is 50% (matches last checkpoint)
+   - ✅ Files modified list is correct
+   - ✅ Last message preserved
+   - ✅ Strategy and directives restored
+9. **Verify file reservations were transferred**
+   ```typescript
+   // Check inbox for reservation status
+   swarmmail_inbox(limit: 5)
+   ```
+   **Expected result:**
+   - Reservations still exist (orphaned from BlueLake)
+   - OR recovery automatically transferred ownership to CrimsonPeak
+   **Verify:**
+   - ✅ Files are either still reserved or available for new reservation
+   - ✅ No reservation conflicts
+10. **Resume work with recovered state**
+    ```bash
+    # Modify test/file2.ts (continue where Session A left off)
+    echo "// Second change" >> test/file2.ts
+    ```
+    **Expected result:**
+    - File modified successfully
+    **Verify:**
+    - ✅ Agent can continue work
+    - ✅ File modifications build on previous state
+11. **Report completion**
+    ```typescript
+    swarm_complete(
+      project_key: "/absolute/path/to/project",
+      agent_name: "CrimsonPeak",
+      bead_id: "bd-123.1",
+      summary: "Completed context recovery test - survived session death",
+      files_touched: ["test/file1.ts", "test/file2.ts"]
+    )
+    ```
+    **Expected result:**
+    ```json
+    {
+      "success": true,
+      "data": {
+        "bead_closed": true,
+        "reservations_released": true,
+        "ubs_scan_passed": true
+      }
+    }
+    ```
+    **Verify:**
+    - ✅ Bead marked complete
+    - ✅ Reservations released
+    - ✅ All files touched recorded (both sessions combined)
+---
+## Verification Checklist
+### Checkpoint Creation
+- [ ] Auto-checkpoint triggered at 50% progress
+- [ ] Checkpoint includes epic_id, bead_id, strategy
+- [ ] Files list preserved
+- [ ] Progress percentage stored
+- [ ] Files modified list accurate
+- [ ] Last message captured
+### Recovery Mechanism
+- [ ] New session can query checkpoint by bead_id
+- [ ] All checkpoint data restored correctly
+- [ ] Directives and context preserved
+- [ ] Recovery returns actionable state object
+### State Continuity
+- [ ] Work can resume from exact checkpoint state
+- [ ] File modifications from Session A are visible
+- [ ] Progress percentage matches last checkpoint (50%)
+- [ ] Completion acknowledges full file list (both sessions)
+### Edge Cases
+- [ ] Recovery fails gracefully if no checkpoint exists
+- [ ] Recovery handles multiple checkpoints (returns latest)
+- [ ] Orphaned reservations don't block recovery
+- [ ] Recovery works across different agent names
+---
+## Expected Failure Modes (Negative Testing)
+### Test 1: Recovery with No Checkpoint
+```typescript
+swarm_recover(
+  project_key: "/path/to/project",
+  bead_id: "bd-999.1"  // Non-existent bead
+)
+```
+**Expected result:**
+```json
+{
+  "success": false,
+  "error": "No checkpoint found for bead bd-999.1"
+}
+```
+### Test 2: Recovery Before Any Progress
+```typescript
+// Create bead but never report progress
+swarm_recover(
+  project_key: "/path/to/project",
+  bead_id: "bd-123.2"
+)
+```
+**Expected result:**
+```json
+{
+  "success": false,
+  "error": "No checkpoint found - agent never reported progress"
+}
+```
+### Test 3: Manual Checkpoint Creation
+```typescript
+// Agent can force checkpoint at any time
+swarm_checkpoint(
+  project_key: "/path/to/project",
+  bead_id: "bd-123.1",
+  checkpoint_data: {
+    progress_percent: 33,
+    files_modified: ["test/file1.ts"],
+    message: "Manual checkpoint before risky operation"
+  }
+)
+```
+**Expected result:**
+```json
+{
+  "success": true,
+  "data": {
+    "checkpoint_id": 42,
+    "message": "Manual checkpoint created"
+  }
+}
+```
+---
+## Troubleshooting
+### Issue: Recovery returns empty checkpoint
+**Cause:** Checkpoint event not committed to event store
+**Fix:** Verify `swarm_progress` was called with `progress_percent >= 25`
+### Issue: Files modified in Session A not visible in Session B
+**Cause:** File changes not committed to git or filesystem
+**Fix:** Ensure file writes are flushed before killing session
+### Issue: Reservation conflicts after recovery
+**Cause:** Orphaned reservations from dead agent
+**Fix:** Implement TTL-based reservation expiry or manual release by project_key
+### Issue: Multiple checkpoints confuse recovery
+**Cause:** Recovery not selecting latest checkpoint
+**Fix:** Verify recovery queries `ORDER BY timestamp DESC LIMIT 1`
+---
+## Advanced Scenarios
+### Scenario A: Coordinator Death
+1. Coordinator spawns 5 worker agents
+2. Coordinator dies at 60% overall completion
+3. New coordinator recovers state for all workers
+4. Workers continue reporting to new coordinator
+### Scenario B: Cascading Recovery
+1. Worker A checkpoints at 50%
+2. Worker A dies
+3. Worker B recovers Worker A's state
+4. Worker B checkpoints at 75%
+5. Worker B dies
+6. Worker C recovers Worker B's state (which includes Worker A's progress)
+### Scenario C: Partial File Reservation
+1. Agent reserves 10 files
+2. Modifies 3 files
+3. Dies at 30%
+4. Recovery agent only needs to work on remaining 7 files
+---
+## Success Metrics
+| Metric | Target | Actual |
+|--------|--------|--------|
+| Recovery accuracy | 100% state match | _____ |
+| Time to recover | < 5 seconds | _____ |
+| Data loss | 0 bytes | _____ |
+| Checkpoint overhead | < 100ms per checkpoint | _____ |
+| Storage per checkpoint | < 10KB | _____ |
+---
+## Conclusion
+This manual test proves that:
+1. ✅ Agents can survive catastrophic context loss
+2. ✅ Work state is preserved in event-sourced storage
+3. ✅ Recovery is deterministic and accurate
+4. ✅ Multi-session workflows are possible
+**Sign-off:** If all verification checkboxes are marked and success metrics met, the context recovery feature is production-ready.

package/examples/commands/swarm.md CHANGED Viewed

@@ -10,10 +10,29 @@ $ARGUMENTS
 ## Flags (parse from task above)
+### Planning Modes
+- `--fast` - Skip brainstorming, go straight to decomposition
+- `--auto` - Use best recommendations, minimal questions
+- `--confirm-only` - Show decomposition, single yes/no, then execute
+- (default) - Full Socratic planning with questions and alternatives
+### Workflow Options
 - `--to-main` - Push directly to main, skip PR
 - `--no-sync` - Skip mid-task context sharing
-**Default: Feature branch + PR with context sync.**
+**Defaults: Socratic planning, feature branch + PR, context sync enabled.**
+### Example Usage
+```bash
+/swarm "task description"              # Full Socratic (default)
+/swarm --fast "task description"       # Skip brainstorming
+/swarm --auto "task description"       # Auto-select, minimal Q&A
+/swarm --confirm-only "task"           # Show plan, yes/no only
+/swarm --fast --to-main "quick fix"    # Fast mode + push to main
+```
 ## MANDATORY: Swarm Mail
@@ -126,11 +145,61 @@ git checkout -b swarm/<short-task-name>
 git push -u origin HEAD
 ```
-### 4. Decompose Task (DELEGATE TO SUBAGENT)
+### 4. Interactive Planning (MANDATORY)
+**Parse planning mode from flags:**
+- `--fast` → mode="fast"
+- `--auto` → mode="auto"
+- `--confirm-only` → mode="confirm-only"
+- No flag → mode="socratic" (default)
+**Use swarm_plan_interactive for ALL planning:**
+```bash
+# Start interactive planning session
+swarm_plan_interactive(
+  task="<task description>",
+  mode="socratic",  # or "fast", "auto", "confirm-only"
+  context="<synthesized knowledge from step 2>",
+  max_subtasks=5
+)
+```
+**Multi-turn conversation flow:**
+The tool returns:
+```json
+{
+  "ready_to_decompose": false,  // or true when planning complete
+  "follow_up": "What approach do you prefer: A) file-based or B) feature-based?",
+  "options": ["A) File-based...", "B) Feature-based..."],
+  "recommendation": "I recommend A because..."
+}
+```
+**Continue conversation until ready_to_decompose=true:**
+```bash
+# User responds to follow-up question
+# You call swarm_plan_interactive again with:
+swarm_plan_interactive(
+  task="<same task>",
+  mode="socratic",
+  context="<synthesized knowledge>",
+  user_response="A - file-based approach"
+)
+# Repeat until ready_to_decompose=true
+# Then tool returns final decomposition prompt
+```
+**When ready_to_decompose=true:**
 > **⚠️ CRITICAL: Context Preservation**
 >
-> **DO NOT decompose inline in the coordinator thread.** This consumes massive context with file reading, CASS queries, and reasoning. You will hit context limits on long swarms.
+> **DO NOT decompose inline in the coordinator thread.** This consumes massive context with file reading, CASS queries, and reasoning.
 >
 > **ALWAYS delegate to a `swarm/planner` subagent** that returns only the validated BeadTree JSON.
@@ -138,11 +207,8 @@ git push -u origin HEAD
 ```bash
 # This pollutes your main thread context
-swarm_select_strategy(task="<the task>")
-swarm_plan_prompt(task="<the task>", ...)
 # ... you reason about decomposition inline ...
 # ... context fills with file contents, analysis ...
-swarm_validate_decomposition(response="...")
 ```
 **✅ Do this (delegate to subagent):**
@@ -151,36 +217,42 @@ swarm_validate_decomposition(response="...")
 # 1. Create planning bead
 beads_create(title="Plan: <task>", type="task", description="Decompose into subtasks")
-# 2. Delegate to swarm/planner subagent
+# 2. Get final prompt from swarm_plan_interactive (when ready_to_decompose=true)
+# final_prompt = <from last swarm_plan_interactive call>
+# 3. Delegate to swarm/planner subagent
 Task(
   subagent_type="swarm/planner",
   description="Decompose task: <task>",
   prompt="
 You are a swarm planner. Generate a BeadTree for this task.
-## Task
-<task description>
-## Synthesized Context
-<from knowledge gathering step 2>
+<final_prompt from swarm_plan_interactive>
 ## Instructions
-1. Use swarm_select_strategy(task=\"...\")
-2. Use swarm_plan_prompt(task=\"...\", max_subtasks=5, query_cass=true)
-3. Reason about decomposition strategy
-4. Generate BeadTree JSON
-5. Validate with swarm_validate_decomposition
-6. Return ONLY the validated BeadTree JSON (no analysis)
+1. Reason about decomposition strategy
+2. Generate BeadTree JSON
+3. Validate with swarm_validate_decomposition
+4. Return ONLY the validated BeadTree JSON (no analysis)
 Output: Valid BeadTree JSON only.
   "
 )
-# 3. Subagent returns validated JSON, parse it
+# 4. Subagent returns validated JSON, parse it
 # beadTree = <result from subagent>
 ```
-**Why?**
+**Planning Mode Behavior:**
+| Mode            | Questions | User Input | Confirmation |
+| --------------- | --------- | ---------- | ------------ |
+| `socratic`      | Multiple  | Yes        | Yes          |
+| `fast`          | None      | No         | Yes          |
+| `auto`          | Minimal   | Rare       | No           |
+| `confirm-only`  | None      | Yes (1x)   | Yes (1x)     |
+**Why delegate?**
 - Main thread stays clean (only receives final JSON)
 - Subagent context is disposable (garbage collected after planning)