npm - deepflow - Versions diffs - 0.1.44 → 0.1.46 - Mend

deepflow 0.1.44 → 0.1.46

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

package/bin/install.js +2 -2
package/package.json +1 -1
package/src/commands/df/debate.md +283 -0
package/src/commands/df/discover.md +182 -0
package/src/commands/df/execute.md +124 -16
package/src/commands/df/spec.md +2 -0
package/src/commands/df/verify.md +9 -2

package/bin/install.js CHANGED Viewed

@@ -146,7 +146,7 @@ async function main() {
   console.log(`${c.green}Installation complete!${c.reset}`);
   console.log('');
   console.log(`Installed to ${c.cyan}${CLAUDE_DIR}${c.reset}:`);
-  console.log('  commands/df/     — /df:spec, /df:plan, /df:execute, /df:verify');
+  console.log('  commands/df/     — /df:discover, /df:debate, /df:spec, /df:plan, /df:execute, /df:verify');
   console.log('  skills/          — gap-discovery, atomic-commits, code-completeness');
   console.log('  agents/          — reasoner');
   if (level === 'global') {
@@ -165,7 +165,7 @@ async function main() {
     console.log('  1. claude');
   }
   console.log('  2. Describe what you want to build');
-  console.log('  3. /df:spec feature-name');
+  console.log('  3. /df:discover feature-name');
   console.log('');
 }

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "deepflow",
-  "version": "0.1.44",
+  "version": "0.1.46",
   "description": "Stay in flow state - lightweight spec-driven task orchestration for Claude Code",
   "keywords": [
     "claude",

package/src/commands/df/debate.md ADDED Viewed

@@ -0,0 +1,283 @@
+# /df:debate — Multi-Perspective Analysis
+## Orchestrator Role
+You coordinate reasoner agents to debate a problem from multiple perspectives, then synthesize their arguments into a structured document.
+**NEVER:** Read source files, use Glob/Grep directly, run git, use TaskOutput, use `run_in_background`, use Explore agents
+**ONLY:** Spawn reasoner agents (non-background), write debate file, respond conversationally
+---
+## Purpose
+Generate a multi-perspective analysis of a problem before formalizing into a spec. Surfaces tensions, trade-offs, and blind spots that a single perspective would miss.
+## Usage
+```
+/df:debate <name>
+```
+## Skills & Agents
+**Use Task tool to spawn agents:**
+| Agent | subagent_type | model | Purpose |
+|-------|---------------|-------|---------|
+| User Advocate | `reasoner` | `opus` | UX, simplicity, real user needs |
+| Tech Skeptic | `reasoner` | `opus` | Technical risks, hidden complexity, feasibility |
+| Systems Thinker | `reasoner` | `opus` | Integration, scalability, long-term effects |
+| LLM Efficiency | `reasoner` | `opus` | Token density, minimal scaffolding, navigable structure |
+| Synthesizer | `reasoner` | `opus` | Merge perspectives into consensus + tensions |
+---
+## Behavior
+### 1. SUMMARIZE
+Summarize the conversation context (from prior discover/conversation) in ~200 words. This summary will be passed to each perspective agent.
+The summary should capture:
+- The core problem being solved
+- Key requirements mentioned
+- Constraints and boundaries
+- User's stated preferences and priorities
+### 2. SPAWN PERSPECTIVES
+**Spawn ALL 4 perspective agents in ONE message (non-background, parallel):**
+Each agent receives the same context summary but a different role. Each must:
+- Argue from their perspective
+- Identify risks the other perspectives might miss
+- Propose concrete alternatives where they disagree with the likely approach
+```python
+# All 4 in a single message — parallel, non-background:
+Task(subagent_type="reasoner", model="opus", prompt="""
+You are the USER ADVOCATE in a design debate.
+## Context
+{summary}
+## Your Role
+Argue from the perspective of the end user. Focus on:
+- Simplicity and ease of use
+- Real user needs vs assumed needs
+- Friction points and cognitive load
+- Whether the solution matches how users actually think
+Provide:
+1. Your key arguments (3-5 points)
+2. Risks you see from a user perspective
+3. Concrete alternatives if you disagree with the current direction
+Keep response under 400 words.
+""")
+Task(subagent_type="reasoner", model="opus", prompt="""
+You are the TECH SKEPTIC in a design debate.
+## Context
+{summary}
+## Your Role
+Challenge technical assumptions and surface hidden complexity. Focus on:
+- What could go wrong technically
+- Hidden dependencies or coupling
+- Complexity that seems simple but isn't
+- Maintenance burden over time
+Provide:
+1. Your key arguments (3-5 points)
+2. Technical risks others might overlook
+3. Simpler alternatives worth considering
+Keep response under 400 words.
+""")
+Task(subagent_type="reasoner", model="opus", prompt="""
+You are the SYSTEMS THINKER in a design debate.
+## Context
+{summary}
+## Your Role
+Analyze how this fits into the broader system. Focus on:
+- Integration with existing components
+- Scalability implications
+- Second-order effects and unintended consequences
+- Long-term evolution and extensibility
+Provide:
+1. Your key arguments (3-5 points)
+2. Systemic risks and ripple effects
+3. Architectural alternatives worth considering
+Keep response under 400 words.
+""")
+Task(subagent_type="reasoner", model="opus", prompt="""
+You are the LLM EFFICIENCY expert in a design debate.
+## Context
+{summary}
+## Your Role
+Evaluate from the perspective of LLM consumption and interaction. Focus on:
+- Token density: can the output be consumed efficiently by LLMs?
+- Minimal scaffolding: avoid ceremony that adds tokens without information
+- Navigable structure: can an LLM quickly find what it needs?
+- Attention budget: does the design respect limited context windows?
+Provide:
+1. Your key arguments (3-5 points)
+2. Efficiency risks others might not consider
+3. Alternatives that optimize for LLM consumption
+Keep response under 400 words.
+""")
+```
+### 3. SYNTHESIZE
+After all 4 perspectives return, spawn 1 additional reasoner to synthesize:
+```python
+Task(subagent_type="reasoner", model="opus", prompt="""
+You are the SYNTHESIZER. Four perspectives have debated a design problem.
+## Context
+{summary}
+## User Advocate's Arguments
+{user_advocate_response}
+## Tech Skeptic's Arguments
+{tech_skeptic_response}
+## Systems Thinker's Arguments
+{systems_thinker_response}
+## LLM Efficiency's Arguments
+{llm_efficiency_response}
+## Your Task
+Synthesize these perspectives into:
+1. **Consensus** — Points where all or most perspectives agree
+2. **Tensions** — Unresolved disagreements and genuine trade-offs
+3. **Open Decisions** — Questions that need human judgment to resolve
+4. **Recommendation** — Your balanced recommendation considering all perspectives
+Be specific. Name the tensions, don't smooth them over.
+Keep response under 500 words.
+""")
+```
+### 4. WRITE DEBATE FILE
+Create `specs/.debate-{name}.md`:
+```markdown
+# Debate: {Name}
+## Context
+[~200 word summary from step 1]
+## Perspectives
+### User Advocate
+[arguments from agent]
+### Tech Skeptic
+[arguments from agent]
+### Systems Thinker
+[arguments from agent]
+### LLM Efficiency
+[arguments from agent]
+## Synthesis
+### Consensus
+[from synthesizer]
+### Tensions
+[from synthesizer]
+### Open Decisions
+[from synthesizer]
+### Recommendation
+[from synthesizer]
+```
+### 5. CONFIRM
+After writing the file, present a brief summary to the user:
+```
+✓ Created specs/.debate-{name}.md
+Key tensions:
+- [tension 1]
+- [tension 2]
+Open decisions:
+- [decision 1]
+- [decision 2]
+Next: Run /df:spec {name} to formalize into a specification
+```
+---
+## Rules
+- **All 4 perspective agents MUST be spawned in ONE message** (parallel, non-background)
+- **NEVER use `run_in_background`** — causes late notifications that pollute output
+- **NEVER use TaskOutput** — returns full transcripts that explode context
+- **NEVER use Explore agents** — this command doesn't read code
+- **NEVER read source files directly** — agents receive context via prompt only
+- Reasoner agents receive context through their prompt, not by reading files
+- The debate file goes in `specs/` so `/df:spec` can reference it
+- File name MUST be `.debate-{name}.md` (dot prefix = auxiliary file)
+- Keep each perspective under 400 words, synthesis under 500 words
+## Example
+```
+USER: /df:debate auth
+CLAUDE: Let me summarize what we've discussed and get multiple perspectives
+on the authentication design.
+[Summarizes: ~200 words about auth requirements from conversation]
+[Spawns 4 reasoner agents in parallel — User Advocate, Tech Skeptic,
+Systems Thinker, LLM Efficiency]
+[All 4 return their arguments]
+[Spawns synthesizer agent with all 4 perspectives]
+[Synthesizer returns consensus, tensions, open decisions, recommendation]
+[Writes specs/.debate-auth.md]
+✓ Created specs/.debate-auth.md
+Key tensions:
+- OAuth complexity vs simpler API key approach
+- User convenience (social login) vs privacy concerns
+- Centralized auth service vs per-route middleware
+Open decisions:
+- Session storage strategy (JWT vs server-side)
+- Token expiration policy
+Next: Run /df:spec auth to formalize into a specification
+```

package/src/commands/df/discover.md ADDED Viewed

@@ -0,0 +1,182 @@
+# /df:discover — Deep Problem Exploration
+## Orchestrator Role
+You are a Socratic questioner. Your ONLY job is to ask questions that surface hidden requirements, assumptions, and constraints.
+**NEVER:** Read source files, use Glob/Grep, spawn agents, create files, run git, use TaskOutput, use Task tool
+**ONLY:** Ask questions using `AskUserQuestion` tool, respond conversationally
+---
+## Purpose
+Explore a problem space deeply before formalizing into specs. Surface motivations, constraints, scope boundaries, success criteria, and anti-goals through structured questioning.
+## Usage
+```
+/df:discover <name>
+```
+## Behavior
+Work through these phases organically. You don't need to announce phases — let the conversation flow naturally. Move to the next phase when the current one feels sufficiently explored.
+### Phase 1: MOTIVATION
+Why does this need to exist? What problem does it solve? Who suffers without it?
+Example questions:
+- What triggered the need for this?
+- Who will use this and what's their current workaround?
+- What happens if we don't build this?
+### Phase 2: CONTEXT
+What already exists? What has been tried? What's the current state?
+Example questions:
+- Is there existing code or infrastructure that relates to this?
+- Have you tried solving this before? What worked/didn't?
+- Are there external systems or APIs involved?
+### Phase 3: SCOPE
+What's in? What's out? What's the minimum viable version?
+Example questions:
+- What's the smallest version that would be useful?
+- What features feel essential vs nice-to-have?
+- Are there parts you explicitly want to exclude?
+### Phase 4: CONSTRAINTS
+Technical limits, time pressure, resource boundaries?
+Example questions:
+- Are there performance requirements or SLAs?
+- What technologies are non-negotiable?
+- Is there a deadline or timeline pressure?
+### Phase 5: SUCCESS
+How do we know it worked? What does "done" look like?
+Example questions:
+- How will you verify this works correctly?
+- What metrics would indicate success?
+- What would make you confident enough to ship?
+### Phase 6: ANTI-GOALS
+What should we explicitly NOT do? What traps to avoid?
+Example questions:
+- What's the most common way this kind of feature gets over-engineered?
+- Are there approaches you've seen fail elsewhere?
+- What should we explicitly avoid building?
+---
+## Rules
+### Questioning Rules
+- Use `AskUserQuestion` tool for structured questions with options
+- Maximum **4 questions per `AskUserQuestion` call** (tool limit)
+- Headers must be **≤12 characters**
+- Mix structured questions (AskUserQuestion) with conversational follow-ups
+- Ask follow-up questions based on answers — don't just march through phases mechanically
+- Go deeper on surprising or unclear answers
+### Behavioral Rules
+- **NEVER assume** — if something is ambiguous, ask
+- **NEVER suggest ending** — the user decides when they're done
+- **NEVER take action** — no code reading, no file creation, no agents
+- **NEVER skip phases** — but adapt depth based on the problem
+- Keep your responses short between questions — don't lecture
+- Acknowledge answers briefly before asking the next question
+### When the User Wants to Move On
+When the user signals they want to advance (e.g., "I think that's enough", "let's move on", "ready for next step"):
+```
+Great, we've covered a lot of ground. To analyze this from multiple perspectives, run:
+/df:debate {name}
+Or if you want to go straight to a spec:
+/df:spec {name}
+```
+## Example
+```
+USER: /df:discover auth
+CLAUDE: Let's explore the authentication problem space.
+[Uses AskUserQuestion]
+{
+  "questions": [
+    {
+      "question": "What's driving the need for auth right now?",
+      "header": "Motivation",
+      "multiSelect": false,
+      "options": [
+        {"label": "Security gap", "description": "Current system has no auth or weak auth"},
+        {"label": "New users", "description": "Opening up to users who need access control"},
+        {"label": "Compliance", "description": "Regulatory or policy requirement"}
+      ]
+    },
+    {
+      "question": "Who are the primary users that will authenticate?",
+      "header": "Users",
+      "multiSelect": true,
+      "options": [
+        {"label": "End users", "description": "Public-facing users of the product"},
+        {"label": "Internal team", "description": "Your team or organization members"},
+        {"label": "API clients", "description": "Other services or integrations"}
+      ]
+    }
+  ]
+}
+USER: [Selects: Security gap, End users + API clients]
+CLAUDE: So there's a real security gap and you need to cover both human users
+and machine clients. That's two different auth flows to think about.
+[Uses AskUserQuestion]
+{
+  "questions": [
+    {
+      "question": "Do end users currently have any way to identify themselves?",
+      "header": "Current state",
+      "multiSelect": false,
+      "options": [
+        {"label": "Nothing", "description": "Completely open, no identification"},
+        {"label": "Basic auth", "description": "Simple username/password exists"},
+        {"label": "Third-party", "description": "Using external auth provider already"}
+      ]
+    },
+    {
+      "question": "For API clients, what authentication exists today?",
+      "header": "API auth",
+      "multiSelect": false,
+      "options": [
+        {"label": "None", "description": "APIs are open"},
+        {"label": "API keys", "description": "Static keys in headers"},
+        {"label": "OAuth/JWT", "description": "Token-based auth already in place"}
+      ]
+    }
+  ]
+}
+[... continues through phases based on answers ...]
+USER: I think we've covered everything, let's move on.
+CLAUDE: Great, we've covered a lot of ground. To analyze this from multiple
+perspectives, run:
+/df:debate auth
+Or if you want to go straight to a spec:
+/df:spec auth
+```

package/src/commands/df/execute.md CHANGED Viewed

@@ -137,8 +137,10 @@ experiment_file: ".deepflow/experiments/upload--streaming--failed.md"
 }
 ```
+Note: `completed_tasks` is kept for backward compatibility but is now derivable from PLAN.md `[x]` entries. The native task system (TaskList) is the primary source for runtime task status.
 **On checkpoint:** Complete wave → update PLAN.md → save to worktree → exit.
-**Resume:** `--continue` loads checkpoint, verifies worktree, skips completed tasks.
+**Resume:** `--continue` loads checkpoint, verifies worktree, skips completed tasks. Native tasks are re-registered for remaining `[ ]` items only.
 ## Behavior
@@ -188,6 +190,30 @@ Load: PLAN.md (required), specs/doing-*.md, .deepflow/config.yaml
 If missing: "No PLAN.md found. Run /df:plan first."
 ```
+### 2.5. REGISTER NATIVE TASKS
+Parse PLAN.md and create native tasks for tracking, dependency management, and UI spinners.
+**For each uncompleted task (`[ ]`) in PLAN.md:**
+```
+1. TaskCreate:
+   - subject: "{task_id}: {description}" (e.g. "T1: Create upload endpoint")
+   - description: Full task block from PLAN.md (files, blocked by, type, etc.)
+   - activeForm: "{gerund form of description}" (e.g. "Creating upload endpoint")
+2. Store mapping: PLAN.md task_id (T1) → native task ID
+```
+**After all tasks created, set up dependencies:**
+```
+For each task with "Blocked by: T{n}, T{m}":
+  TaskUpdate(taskId: native_id, addBlockedBy: [native_id_of_Tn, native_id_of_Tm])
+```
+**On `--continue`:** Only create tasks for remaining `[ ]` items (skip `[x]` completed).
 ### 3. CHECK FOR UNPLANNED SPECS
 Warn if `specs/*.md` (excluding doing-/done-) exist. Non-blocking.
@@ -244,12 +270,30 @@ Topic extraction:
 ### 5. IDENTIFY READY TASKS
-Ready = `[ ]` + all `blocked_by` complete + experiment validated (if applicable) + not in checkpoint.
+Use TaskList to find ready tasks (replaces manual PLAN.md parsing):
+```
+Ready = TaskList results where:
+  - status: "pending"
+  - blockedBy: empty (auto-unblocked by native dependency system)
+```
+**Cross-check with experiment validation** (for spike-blocked tasks):
+- If task depends on spike AND experiment not `--passed.md` → still blocked
+  - TaskUpdate to add spike as blocker if not already set
+Ready = TaskList pending + empty blockedBy + experiment validated (if applicable).
 ### 6. SPAWN AGENTS
 Context ≥50%: checkpoint and exit.
+**Before spawning each agent**, mark its native task as in_progress:
+```
+TaskUpdate(taskId: native_id, status: "in_progress")
+```
+This activates the UI spinner showing the task's activeForm (e.g. "Creating upload endpoint").
 **CRITICAL: Spawn ALL ready tasks in a SINGLE response with MULTIPLE Task tool calls.**
 DO NOT spawn one task, wait, then spawn another. Instead, call Task tool multiple times in the SAME message block. This enables true parallelism.
@@ -319,8 +363,15 @@ Then rename experiment:
 **Gate:**
 ```
-VERIFIED_PASS → Unblock, log "✓ Spike {task_id} verified"
-VERIFIED_FAIL → Block, log "✗ Spike {task_id} failed verification"
+VERIFIED_PASS →
+  TaskUpdate(taskId: spike_native_id, status: "completed")
+  # Native system auto-unblocks dependent tasks
+  Log "✓ Spike {task_id} verified"
+VERIFIED_FAIL →
+  # Spike task stays as pending, dependents remain blocked
+  # No TaskUpdate needed — native system keeps them blocked
+  Log "✗ Spike {task_id} failed verification"
   If override: log "⚠ Agent incorrectly marked as passed"
 ```
@@ -390,6 +441,12 @@ Rules:
 When a task fails and cannot be auto-fixed:
+**Native task update:**
+```
+TaskUpdate(taskId: native_id, status: "pending")  # Reset to pending, not deleted
+```
+This keeps the task visible for retry. Dependent tasks remain blocked.
 **Behavior:**
 1. Leave worktree intact at `{worktree_path}`
 2. Keep checkpoint.json for potential resume
@@ -434,9 +491,11 @@ After spawning wave agents, your turn ENDS. Completion notifications drive the l
 **Per notification:**
 1. Read result file for the completed agent
-2. Report ONE line: "✓ Tx: status (commit)"
-3. If NOT all wave agents done → end turn, wait
-4. If ALL wave agents done → check context, update PLAN.md, spawn next wave or finish
+2. TaskUpdate(taskId: native_id, status: "completed") — auto-unblocks dependent tasks
+3. Update PLAN.md: `[ ]` → `[x]` + commit hash (as before)
+4. Report ONE line: "✓ Tx: status (commit)"
+5. If NOT all wave agents done → end turn, wait
+6. If ALL wave agents done → use TaskList to find newly unblocked tasks, check context, spawn next wave or finish
 **Between waves:** Check context %. If ≥50%, checkpoint and exit.
@@ -456,18 +515,41 @@ After spawning wave agents, your turn ENDS. Completion notifications drive the l
 ```
 /df:execute (context: 12%)
-Spawning Wave 1: T1, T2, T3 parallel...
+Loading PLAN.md...
+  T1: Create upload endpoint (ready)
+  T2: Add S3 service (blocked by T1)
+  T3: Add auth guard (blocked by T1)
+Registering native tasks...
+  TaskCreate → T1 (native: task-001)
+  TaskCreate → T2 (native: task-002)
+  TaskCreate → T3 (native: task-003)
+  TaskUpdate(task-002, addBlockedBy: [task-001])
+  TaskUpdate(task-003, addBlockedBy: [task-001])
+Spawning Wave 1: T1
+  TaskUpdate(task-001, status: "in_progress")  ← spinner: "Creating upload endpoint"
 [Agent "T1" completed]
-✓ T1: success (abc1234)
+  TaskUpdate(task-001, status: "completed")  ← auto-unblocks task-002, task-003
+  ✓ T1: success (abc1234)
+TaskList → task-002, task-003 now ready (blockedBy empty)
+Spawning Wave 2: T2, T3 parallel
+  TaskUpdate(task-002, status: "in_progress")
+  TaskUpdate(task-003, status: "in_progress")
 [Agent "T2" completed]
-✓ T2: success (def5678)
+  TaskUpdate(task-002, status: "completed")
+  ✓ T2: success (def5678)
 [Agent "T3" completed]
-✓ T3: success (ghi9012)
+  TaskUpdate(task-003, status: "completed")
+  ✓ T3: success (ghi9012)
-Wave 1 complete (3/3). Context: 35%
+Wave 2 complete (2/2). Context: 35%
 ✓ doing-upload → done-upload
 ✓ Complete: 3/3 tasks
@@ -480,27 +562,43 @@ Next: Run /df:verify to verify specs and merge to main
 ```
 /df:execute (context: 10%)
+Loading PLAN.md...
+Registering native tasks...
+  TaskCreate → T1 [SPIKE] (native: task-001)
+  TaskCreate → T2 (native: task-002)
+  TaskCreate → T3 (native: task-003)
+  TaskUpdate(task-002, addBlockedBy: [task-001])
+  TaskUpdate(task-003, addBlockedBy: [task-001])
 Checking experiment status...
   T1 [SPIKE]: No experiment yet, spike executable
   T2: Blocked by T1 (spike not validated)
   T3: Blocked by T1 (spike not validated)
-Spawning Wave 1: T1 [SPIKE]...
+Spawning Wave 1: T1 [SPIKE]
+  TaskUpdate(task-001, status: "in_progress")
 [Agent "T1 SPIKE" completed]
 ✓ T1: complete, verifying...
 Verifying T1...
   ✓ Spike T1 verified (throughput 8500 >= 7000)
+  TaskUpdate(task-001, status: "completed")  ← auto-unblocks task-002, task-003
   → upload--streaming--passed.md
-Spawning Wave 2: T2, T3 parallel...
+TaskList → task-002, task-003 now ready
+Spawning Wave 2: T2, T3 parallel
+  TaskUpdate(task-002, status: "in_progress")
+  TaskUpdate(task-003, status: "in_progress")
 [Agent "T2" completed]
-✓ T2: success (def5678)
+  TaskUpdate(task-002, status: "completed")
+  ✓ T2: success (def5678)
 [Agent "T3" completed]
-✓ T3: success (ghi9012)
+  TaskUpdate(task-003, status: "completed")
+  ✓ T3: success (ghi9012)
 Wave 2 complete (2/2). Context: 40%
@@ -515,11 +613,16 @@ Next: Run /df:verify to verify specs and merge to main
 ```
 /df:execute (context: 10%)
+Registering native tasks...
+  TaskCreate → T1 [SPIKE], T2, T3 (with dependencies)
 Wave 1: T1 [SPIKE] (context: 15%)
+  TaskUpdate(task-001, status: "in_progress")
   T1: complete, verifying...
 Verifying T1...
   ✗ Spike T1 failed verification (throughput 1500 < 7000)
+  # Spike stays pending — dependents remain blocked
   → upload--streaming--failed.md
 ⚠ Spike T1 invalidated hypothesis
@@ -533,12 +636,17 @@ Next: Run /df:plan to generate new hypothesis spike
 ```
 /df:execute (context: 10%)
+Registering native tasks...
+  TaskCreate → T1 [SPIKE], T2, T3 (with dependencies)
 Wave 1: T1 [SPIKE] (context: 15%)
+  TaskUpdate(task-001, status: "in_progress")
   T1: complete (agent said: success), verifying...
 Verifying T1...
   ✗ Spike T1 failed verification (throughput 1500 < 7000)
   ⚠ Agent incorrectly marked as passed — overriding to FAILED
+  TaskUpdate(task-001, status: "pending")  ← reset, dependents stay blocked
   → upload--streaming--failed.md
 ⚠ Spike T1 invalidated hypothesis

package/src/commands/df/spec.md CHANGED Viewed

@@ -31,6 +31,8 @@ Transform conversation context into a structured specification file.
 ### 1. GATHER CODEBASE CONTEXT
+**Check for debate file first:** If `specs/.debate-{name}.md` exists, read it using the Read tool. Pass its content (especially the Synthesis section) to the reasoner agent in step 3 as additional context. The debate file contains multi-perspective analysis that should inform requirements and constraints.
 **NEVER use `run_in_background` for Explore agents** — causes late "Agent completed" notifications that pollute output after work is done.
 **NEVER use TaskOutput** — returns full agent transcripts (100KB+) that explode context.

package/src/commands/df/verify.md CHANGED Viewed

@@ -51,13 +51,20 @@ Report per spec: requirements count, acceptance count, quality issues.
 **If all pass:** Proceed to Post-Verification merge.
-**If issues found:** Add fix tasks to PLAN.md in the worktree and loop back to execute:
+**If issues found:** Add fix tasks to PLAN.md in the worktree and register as native tasks, then loop back to execute:
 1. Discover worktree (same logic as Post-Verification step 1)
 2. Write new fix tasks to `{worktree_path}/PLAN.md` under the existing spec section
    - Task IDs continue from last (e.g. if T9 was last, fixes start at T10)
    - Format: `- [ ] **T10**: Fix {description}` with `Files:` and details
-3. Output report + next step:
+3. Register fix tasks as native tasks for immediate tracking:
+   ```
+   For each fix task added:
+     TaskCreate(subject: "T10: Fix {description}", description: "...", activeForm: "Fixing {description}")
+     TaskUpdate(addBlockedBy: [...]) if dependencies exist
+   ```
+   This allows `/df:execute --continue` to find fix tasks via TaskList immediately.
+4. Output report + next step:
 ```
 done-upload.md: 4/4 reqs ✓, 3/5 acceptance ✗, 1 quality issue