npm - feed-the-machine - Versions diffs - 1.6.1 → 1.7.1 - Mend

feed-the-machine 1.6.1 → 1.7.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (272) hide show

package/LICENSE +21 -21
package/README.md +262 -170
package/bin/__pycache__/tasks_db.cpython-314.pyc +0 -0
package/bin/brain.py +1340 -0
package/bin/convert_claude_skills_to_codex.py +490 -0
package/bin/generate-manifest.mjs +463 -463
package/bin/harden_codex_skills.py +141 -0
package/bin/install.mjs +491 -491
package/bin/migrate-eng-buddy-data.py +875 -0
package/bin/playbook_engine/__init__.py +1 -0
package/bin/playbook_engine/conftest.py +8 -0
package/bin/playbook_engine/extractor.py +33 -0
package/bin/playbook_engine/manager.py +102 -0
package/bin/playbook_engine/models.py +84 -0
package/bin/playbook_engine/registry.py +35 -0
package/bin/playbook_engine/test_extractor.py +72 -0
package/bin/playbook_engine/test_integration.py +129 -0
package/bin/playbook_engine/test_manager.py +85 -0
package/bin/playbook_engine/test_models.py +166 -0
package/bin/playbook_engine/test_registry.py +67 -0
package/bin/playbook_engine/test_tracer.py +86 -0
package/bin/playbook_engine/tracer.py +93 -0
package/bin/tasks_db.py +456 -0
package/docs/HOOKS.md +243 -243
package/docs/INBOX.md +233 -233
package/ftm/SKILL.md +125 -122
package/ftm-audit/SKILL.md +673 -623
package/ftm-audit/references/protocols/PROJECT-PATTERNS.md +91 -91
package/ftm-audit/references/protocols/RUNTIME-WIRING.md +66 -66
package/ftm-audit/references/protocols/WIRING-CONTRACTS.md +135 -135
package/ftm-audit/references/strategies/AUTO-FIX-STRATEGIES.md +69 -69
package/ftm-audit/references/templates/REPORT-FORMAT.md +96 -96
package/ftm-audit/scripts/run-knip.sh +23 -23
package/ftm-audit.yml +2 -2
package/ftm-brainstorm/SKILL.md +1003 -498
package/ftm-brainstorm/evals/evals.json +180 -100
package/ftm-brainstorm/evals/promptfoo.yaml +109 -109
package/ftm-brainstorm/references/agent-prompts.md +552 -224
package/ftm-brainstorm/references/plan-template.md +209 -121
package/ftm-brainstorm.yml +2 -2
package/ftm-browse/SKILL.md +454 -454
package/ftm-browse/daemon/browser-manager.ts +206 -206
package/ftm-browse/daemon/bun.lock +30 -30
package/ftm-browse/daemon/cli.ts +347 -347
package/ftm-browse/daemon/commands.ts +410 -410
package/ftm-browse/daemon/main.ts +357 -357
package/ftm-browse/daemon/package.json +17 -17
package/ftm-browse/daemon/server.ts +189 -189
package/ftm-browse/daemon/snapshot.ts +519 -519
package/ftm-browse/daemon/tsconfig.json +22 -22
package/ftm-browse.yml +4 -4
package/ftm-capture/SKILL.md +370 -370
package/ftm-capture.yml +4 -4
package/ftm-codex-gate/SKILL.md +361 -361
package/ftm-codex-gate.yml +2 -2
package/ftm-config/SKILL.md +422 -345
package/ftm-config.default.yml +125 -82
package/ftm-config.yml +44 -2
package/ftm-council/SKILL.md +416 -416
package/ftm-council/references/prompts/CLAUDE-INVESTIGATION.md +60 -60
package/ftm-council/references/prompts/CODEX-INVESTIGATION.md +58 -58
package/ftm-council/references/prompts/GEMINI-INVESTIGATION.md +58 -58
package/ftm-council/references/prompts/REBUTTAL-TEMPLATE.md +57 -57
package/ftm-council/references/protocols/PREREQUISITES.md +47 -47
package/ftm-council/references/protocols/STEP-0-FRAMING.md +46 -46
package/ftm-council-chat.yml +2 -0
package/ftm-council.yml +2 -2
package/ftm-dashboard/SKILL.md +163 -163
package/ftm-dashboard.yml +4 -4
package/ftm-debug/SKILL.md +1037 -1037
package/ftm-debug/references/phases/PHASE-0-INTAKE.md +58 -58
package/ftm-debug/references/phases/PHASE-1-TRIAGE.md +46 -46
package/ftm-debug/references/phases/PHASE-2-WAR-ROOM-AGENTS.md +279 -279
package/ftm-debug/references/phases/PHASE-3-TO-6-EXECUTION.md +436 -436
package/ftm-debug/references/protocols/BLACKBOARD.md +86 -86
package/ftm-debug/references/protocols/EDGE-CASES.md +103 -103
package/ftm-debug.yml +2 -2
package/ftm-diagram/SKILL.md +277 -277
package/ftm-diagram.yml +2 -2
package/ftm-executor/SKILL.md +777 -777
package/ftm-executor/references/STYLE-TEMPLATE.md +73 -73
package/ftm-executor/references/phases/PHASE-0-VERIFICATION.md +62 -62
package/ftm-executor/references/phases/PHASE-2-AGENT-ASSEMBLY.md +34 -34
package/ftm-executor/references/phases/PHASE-3-WORKTREES.md +38 -38
package/ftm-executor/references/phases/PHASE-4-5-AUDIT.md +81 -72
package/ftm-executor/references/phases/PHASE-4-DISPATCH.md +66 -66
package/ftm-executor/references/phases/PHASE-5-5-CODEX-GATE.md +73 -73
package/ftm-executor/references/protocols/DOCUMENTATION-BOOTSTRAP.md +36 -36
package/ftm-executor/references/protocols/MODEL-PROFILE.md +59 -59
package/ftm-executor/references/protocols/PROGRESS-TRACKING.md +66 -66
package/ftm-executor/runtime/ftm-runtime.mjs +252 -252
package/ftm-executor/runtime/package.json +8 -8
package/ftm-executor.yml +2 -2
package/ftm-git/SKILL.md +441 -441
package/ftm-git/evals/evals.json +26 -26
package/ftm-git/evals/promptfoo.yaml +75 -75
package/ftm-git/hooks/post-commit-experience.sh +92 -92
package/ftm-git/references/patterns/SECRET-PATTERNS.md +104 -104
package/ftm-git/references/protocols/REMEDIATION.md +139 -139
package/ftm-git/scripts/pre-commit-secrets.sh +110 -110
package/ftm-git.yml +2 -2
package/ftm-inbox/backend/__pycache__/main.cpython-314.pyc +0 -0
package/ftm-inbox/backend/adapters/_retry.py +64 -64
package/ftm-inbox/backend/adapters/base.py +230 -230
package/ftm-inbox/backend/adapters/freshservice.py +104 -104
package/ftm-inbox/backend/adapters/gmail.py +125 -125
package/ftm-inbox/backend/adapters/jira.py +136 -136
package/ftm-inbox/backend/adapters/registry.py +192 -192
package/ftm-inbox/backend/adapters/slack.py +110 -110
package/ftm-inbox/backend/db/connection.py +54 -54
package/ftm-inbox/backend/db/schema.py +78 -78
package/ftm-inbox/backend/executor/__init__.py +7 -7
package/ftm-inbox/backend/executor/engine.py +149 -149
package/ftm-inbox/backend/executor/step_runner.py +98 -98
package/ftm-inbox/backend/main.py +103 -103
package/ftm-inbox/backend/models/__init__.py +1 -1
package/ftm-inbox/backend/models/unified_task.py +36 -36
package/ftm-inbox/backend/planner/__init__.py +6 -6
package/ftm-inbox/backend/planner/__pycache__/__init__.cpython-314.pyc +0 -0
package/ftm-inbox/backend/planner/__pycache__/generator.cpython-314.pyc +0 -0
package/ftm-inbox/backend/planner/__pycache__/schema.cpython-314.pyc +0 -0
package/ftm-inbox/backend/planner/generator.py +127 -127
package/ftm-inbox/backend/planner/schema.py +34 -34
package/ftm-inbox/backend/requirements.txt +5 -5
package/ftm-inbox/backend/routes/__pycache__/plan.cpython-314.pyc +0 -0
package/ftm-inbox/backend/routes/execute.py +186 -186
package/ftm-inbox/backend/routes/health.py +52 -52
package/ftm-inbox/backend/routes/inbox.py +68 -68
package/ftm-inbox/backend/routes/plan.py +271 -271
package/ftm-inbox/bin/launchagent.mjs +91 -91
package/ftm-inbox/bin/setup.mjs +188 -188
package/ftm-inbox/bin/start.sh +10 -10
package/ftm-inbox/bin/status.sh +17 -17
package/ftm-inbox/bin/stop.sh +8 -8
package/ftm-inbox/config.example.yml +55 -55
package/ftm-inbox/package-lock.json +2898 -2898
package/ftm-inbox/package.json +26 -26
package/ftm-inbox/postcss.config.js +6 -6
package/ftm-inbox/src/app.css +199 -199
package/ftm-inbox/src/app.html +18 -18
package/ftm-inbox/src/lib/api.ts +166 -166
package/ftm-inbox/src/lib/components/ExecutionLog.svelte +81 -81
package/ftm-inbox/src/lib/components/InboxFeed.svelte +143 -143
package/ftm-inbox/src/lib/components/PlanStep.svelte +271 -271
package/ftm-inbox/src/lib/components/PlanView.svelte +206 -206
package/ftm-inbox/src/lib/components/StreamPanel.svelte +99 -99
package/ftm-inbox/src/lib/components/TaskCard.svelte +190 -190
package/ftm-inbox/src/lib/components/ui/EmptyState.svelte +63 -63
package/ftm-inbox/src/lib/components/ui/KawaiiCard.svelte +86 -86
package/ftm-inbox/src/lib/components/ui/PillButton.svelte +106 -106
package/ftm-inbox/src/lib/components/ui/StatusBadge.svelte +67 -67
package/ftm-inbox/src/lib/components/ui/StreamDrawer.svelte +149 -149
package/ftm-inbox/src/lib/components/ui/ThemeToggle.svelte +80 -80
package/ftm-inbox/src/lib/theme.ts +47 -47
package/ftm-inbox/src/routes/+layout.svelte +76 -76
package/ftm-inbox/src/routes/+page.svelte +401 -401
package/ftm-inbox/svelte.config.js +12 -12
package/ftm-inbox/tailwind.config.ts +63 -63
package/ftm-inbox/tsconfig.json +13 -13
package/ftm-inbox/vite.config.ts +6 -6
package/ftm-intent/SKILL.md +241 -241
package/ftm-intent.yml +2 -2
package/ftm-manifest.json +3794 -3794
package/ftm-map/SKILL.md +291 -291
package/ftm-map/scripts/db.py +712 -712
package/ftm-map/scripts/index.py +415 -415
package/ftm-map/scripts/parser.py +224 -224
package/ftm-map/scripts/queries/go-tags.scm +20 -20
package/ftm-map/scripts/queries/javascript-tags.scm +35 -35
package/ftm-map/scripts/queries/python-tags.scm +31 -31
package/ftm-map/scripts/queries/ruby-tags.scm +19 -19
package/ftm-map/scripts/queries/rust-tags.scm +37 -37
package/ftm-map/scripts/queries/typescript-tags.scm +41 -41
package/ftm-map/scripts/query.py +301 -301
package/ftm-map/scripts/ranker.py +377 -377
package/ftm-map/scripts/requirements.txt +5 -5
package/ftm-map/scripts/setup-hooks.sh +27 -27
package/ftm-map/scripts/setup.sh +56 -56
package/ftm-map/scripts/test_db.py +364 -364
package/ftm-map/scripts/test_parser.py +174 -174
package/ftm-map/scripts/test_query.py +183 -183
package/ftm-map/scripts/test_ranker.py +199 -199
package/ftm-map/scripts/views.py +591 -591
package/ftm-map.yml +2 -2
package/ftm-mind/SKILL.md +201 -1943
package/ftm-mind/evals/promptfoo.yaml +142 -142
package/ftm-mind/references/blackboard-protocol.md +110 -0
package/ftm-mind/references/blackboard-schema.md +328 -328
package/ftm-mind/references/complexity-guide.md +110 -110
package/ftm-mind/references/complexity-sizing.md +138 -0
package/ftm-mind/references/decide-act-protocol.md +172 -0
package/ftm-mind/references/direct-execution.md +51 -0
package/ftm-mind/references/environment-discovery.md +77 -0
package/ftm-mind/references/event-registry.md +319 -319
package/ftm-mind/references/mcp-inventory.md +300 -296
package/ftm-mind/references/ops-routing.md +47 -0
package/ftm-mind/references/orient-protocol.md +234 -0
package/ftm-mind/references/personality.md +40 -0
package/ftm-mind/references/protocols/COMPLEXITY-SIZING.md +72 -72
package/ftm-mind/references/protocols/MCP-HEURISTICS.md +32 -32
package/ftm-mind/references/protocols/PLAN-APPROVAL.md +80 -80
package/ftm-mind/references/reflexion-protocol.md +249 -249
package/ftm-mind/references/routing/SCENARIOS.md +22 -22
package/ftm-mind/references/routing-scenarios.md +35 -35
package/ftm-mind.yml +2 -2
package/ftm-ops.yml +4 -0
package/ftm-pause/SKILL.md +395 -395
package/ftm-pause/references/protocols/SKILL-RESTORE-PROTOCOLS.md +186 -186
package/ftm-pause/references/protocols/VALIDATION.md +80 -80
package/ftm-pause.yml +2 -2
package/ftm-researcher/SKILL.md +275 -275
package/ftm-researcher/evals/agent-diversity.yaml +17 -17
package/ftm-researcher/evals/synthesis-quality.yaml +12 -12
package/ftm-researcher/evals/trigger-accuracy.yaml +39 -39
package/ftm-researcher/references/adaptive-search.md +116 -116
package/ftm-researcher/references/agent-prompts.md +193 -193
package/ftm-researcher/references/council-integration.md +193 -193
package/ftm-researcher/references/output-format.md +203 -203
package/ftm-researcher/references/synthesis-pipeline.md +165 -165
package/ftm-researcher/scripts/score_credibility.py +234 -234
package/ftm-researcher/scripts/validate_research.py +92 -92
package/ftm-researcher.yml +2 -2
package/ftm-resume/SKILL.md +518 -518
package/ftm-resume/references/protocols/VALIDATION.md +172 -172
package/ftm-resume.yml +2 -2
package/ftm-retro/SKILL.md +380 -380
package/ftm-retro/references/protocols/SCORING-RUBRICS.md +89 -89
package/ftm-retro/references/templates/REPORT-FORMAT.md +109 -109
package/ftm-retro.yml +2 -2
package/ftm-routine/SKILL.md +170 -170
package/ftm-routine.yml +4 -4
package/ftm-state/blackboard/capabilities.json +5 -5
package/ftm-state/blackboard/capabilities.schema.json +27 -27
package/ftm-state/blackboard/context.json +37 -23
package/ftm-state/blackboard/experiences/doom-statusline-fix.json +26 -0
package/ftm-state/blackboard/experiences/hackathon-pages-site.json +26 -0
package/ftm-state/blackboard/experiences/hindsight-sso-kickoff.json +42 -0
package/ftm-state/blackboard/experiences/index.json +58 -9
package/ftm-state/blackboard/experiences/learning-ragnarok-api-access.json +23 -0
package/ftm-state/blackboard/experiences/nordlayer-members-auto-assign.json +26 -0
package/ftm-state/blackboard/experiences/saml2aws-stale-session-fix.json +41 -0
package/ftm-state/blackboard/patterns.json +6 -6
package/ftm-state/schemas/context.schema.json +130 -130
package/ftm-state/schemas/experience-index.schema.json +77 -77
package/ftm-state/schemas/experience.schema.json +78 -78
package/ftm-state/schemas/patterns.schema.json +44 -44
package/ftm-upgrade/SKILL.md +194 -194
package/ftm-upgrade/scripts/check-version.sh +76 -76
package/ftm-upgrade/scripts/upgrade.sh +143 -143
package/ftm-upgrade.yml +2 -2
package/ftm-verify.yml +2 -2
package/ftm.yml +2 -2
package/hooks/ftm-auto-log.sh +137 -0
package/hooks/ftm-blackboard-enforcer.sh +93 -93
package/hooks/ftm-discovery-reminder.sh +90 -90
package/hooks/ftm-drafts-gate.sh +61 -61
package/hooks/ftm-event-logger.mjs +107 -107
package/hooks/ftm-install-hooks.sh +240 -0
package/hooks/ftm-learning-capture.sh +117 -0
package/hooks/ftm-map-autodetect.sh +79 -79
package/hooks/ftm-pending-sync-check.sh +22 -22
package/hooks/ftm-plan-gate.sh +92 -92
package/hooks/ftm-post-commit-trigger.sh +57 -57
package/hooks/ftm-post-compaction.sh +138 -0
package/hooks/ftm-pre-compaction.sh +147 -0
package/hooks/ftm-session-end.sh +52 -0
package/hooks/ftm-session-snapshot.sh +213 -0
package/hooks/ftm-task-loader.sh +100 -0
package/hooks/settings-template.json +91 -81
package/install.sh +363 -363
package/package.json +84 -84
package/uninstall.sh +25 -25

package/ftm-debug/references/phases/PHASE-2-WAR-ROOM-AGENTS.md CHANGED Viewed

@@ -1,279 +1,279 @@
-# Phase 2: War Room Agent Profiles & Prompts
-All four investigation agents run simultaneously. Each receives the problem statement and codebase context from Phase 0.
----
-## Agent: Instrumenter
-The Instrumenter adds comprehensive debug logging and observability to the problem area. This agent works in its own worktree so instrumentation code stays isolated from fix attempts.
-```
-You are the Instrumenter in a debug war room. Your job is to add debug
-logging and observability so the team can SEE what's happening at runtime.
-Working directory: [worktree path]
-Problem: [problem statement]
-Codebase context: [from Phase 0]
-Likely root cause category: [from investigation plan]
-## What to Instrument
-Add logging that captures the invisible. Think about what data would let
-you diagnose this bug if you could only read a log file:
-### State Snapshots
-- Capture the full state at key decision points (before/after transforms,
-  at branch conditions, before API calls)
-- Log both the input AND output of any function in the suspect path
-- For UI bugs: capture render state, props, computed values
-- For API bugs: capture request + response bodies + headers + timing
-- For state management bugs: capture state before and after mutations
-### Timing & Sequencing
-- Add timestamps to every log entry (use high-resolution: performance.now()
-  or process.hrtime() depending on environment)
-- Log entry and exit of key functions to see execution order
-- For async code: log when promises are created, resolved, rejected
-- For event-driven code: log event emission and handler invocation
-### Environment & Configuration
-- Log all relevant env vars, feature flags, config values at startup
-- Log platform/runtime details (versions, OS, screen size for UI bugs)
-- Capture the state of any caches, memoization, or lazy-loaded resources
-### Error Boundaries
-- Wrap suspect code in try/catch (if not already) and log caught errors
-  with full stack traces
-- Add error event listeners where appropriate
-- Log warnings that might be swallowed silently
-## Output Format
-1. Make all changes in the worktree and commit them
-2. Write a file called `DEBUG-INSTRUMENTATION.md` documenting:
-   - Every log point added and what it captures
-   - How to enable/trigger the logging (env vars, flags, etc.)
-   - How to read the output (log file locations, format explanation)
-   - A suggested test script to exercise the instrumented code paths
-3. If the problem has a UI component, add visual debug indicators too
-   (border highlights, state dumps in dev tools, overlay panels)
-## Key Principle
-Instrument generously. It's cheap to add logging and expensive to guess.
-The cost of too much logging is scrolling; the cost of too little is
-another round of debugging. When in doubt, log it.
-```
----
-## Agent: Researcher
-The Researcher searches for existing solutions — someone else has probably hit this exact bug or something like it.
-```
-You are the Researcher in a debug war room. Your job is to find out if
-this problem has been solved before, what patterns others used, and what
-pitfalls to avoid.
-Problem: [problem statement]
-Codebase context: [from Phase 0]
-Tech stack: [languages, frameworks, key dependencies from Phase 0]
-Likely root cause category: [from investigation plan]
-## Research Vectors (search all of these)
-### 1. GitHub Issues & Discussions
-Search the GitHub repos of every dependency in the problem path:
-- Search for keywords from the error message or symptom
-- Search for the function/class names involved
-- Check closed issues — the fix might already exist in a newer version
-- Check open issues — this might be a known unfixed bug
-### 2. Stack Overflow & Forums
-Search for:
-- The exact error message (in quotes)
-- The symptom described in plain language + framework name
-- The specific API or function that's misbehaving
-### 3. Library Documentation
-Use Context7 or official docs to check:
-- Are we using the API correctly? Check current docs, not cached knowledge
-- Are there known caveats, migration notes, or breaking changes?
-- Is there a recommended pattern we're not following?
-### 4. Blog Posts & Technical Articles
-Search for:
-- "[framework] + [symptom]" — e.g., "React useEffect infinite loop"
-- "[library] + [error category]" — e.g., "webpack ESM require crash"
-- "[pattern] + debugging" — e.g., "WebSocket reconnection race condition"
-### 5. Release Notes & Changelogs
-Check if a recent dependency update introduced the issue:
-- Compare the installed version vs latest, check changelog between them
-- Look for deprecation notices that match our usage pattern
-## Output Format
-Write a file called `RESEARCH-FINDINGS.md` with:
-For each relevant finding:
-- **Source**: URL or reference
-- **Relevance**: Why this applies to our problem (1-2 sentences)
-- **Solution found**: What fix/workaround was used (if any)
-- **Confidence**: How closely this matches our situation (high/medium/low)
-- **Key insight**: The non-obvious thing we should know
-End with a **Recommended approach** section that synthesizes the most
-promising leads into an actionable suggestion.
-## Key Principle
-Cast a wide net, then filter ruthlessly. The goal is not 50 vaguely
-related links — it's 3-5 findings that directly inform the fix. Quality
-of relevance over quantity of results.
-```
----
-## Agent: Reproducer
-The Reproducer creates a minimal, reliable way to trigger the bug.
-```
-You are the Reproducer in a debug war room. Your job is to create the
-simplest possible reproduction of the bug — ideally an automated test
-that fails, or a script that triggers the symptom reliably.
-Working directory: [worktree path]
-Problem: [problem statement]
-Codebase context: [from Phase 0]
-Reproduction steps from user: [if any]
-## Reproduction Strategy
-### 1. Verify the User's Steps
-If the user provided reproduction steps, follow them exactly first.
-Document whether the bug appears consistently or intermittently.
-### 2. Write a Failing Test
-The gold standard is a test that:
-- Fails now (reproduces the bug)
-- Will pass when the bug is fixed
-- Runs in the project's existing test framework
-If the bug is in a function: write a unit test with the inputs that
-trigger the failure.
-If the bug is in a flow: write an integration test that exercises the
-full path.
-If the bug requires a running server/UI: write a script that automates
-the trigger (curl commands, Playwright script, CLI invocation, etc.)
-### 3. Minimize
-Strip away everything that isn't necessary to trigger the bug:
-- Remove unrelated setup steps
-- Use the simplest possible inputs
-- Isolate the exact conditions (timing, data shape, config values)
-### 4. Characterize
-Once you can reproduce it, characterize the boundaries:
-- What inputs trigger it? What inputs don't?
-- Is it timing-dependent? Data-dependent? Config-dependent?
-- Does it happen on first run only, every run, or intermittently?
-- What's the smallest change that makes it go away?
-## Output Format
-1. Commit all reproduction artifacts to the worktree
-2. Write a file called `REPRODUCTION.md` documenting:
-   - **Trigger command**: The single command to reproduce the bug
-   - **Expected vs actual**: What should happen vs what does happen
-   - **Consistency**: How reliably it reproduces (every time / 8 out of 10 / etc.)
-   - **Boundaries**: What makes it appear/disappear
-   - **Minimal test**: Path to the failing test file
-   - **Environment requirements**: Any special setup needed
-## Key Principle
-A bug you can't reproduce is a bug you can't fix with confidence. And a
-bug you can reproduce with a single command is a bug you can fix in
-minutes. The reproduction IS the debugging.
-```
----
-## Agent: Hypothesizer
-The Hypothesizer reads the code deeply and forms theories about root cause.
-```
-You are the Hypothesizer in a debug war room. Your job is to deeply read
-the code involved in the bug, trace every execution path, and form
-ranked hypotheses about what's causing the problem.
-Problem: [problem statement]
-Codebase context: [from Phase 0]
-Likely root cause category: [from investigation plan]
-## Analysis Method
-### 1. Trace the Execution Path
-Starting from the user's trigger action, trace through every function
-call, state mutation, and branch condition until you reach the symptom.
-Document the full chain.
-### 2. Identify Suspect Points
-At each step in the chain, evaluate:
-- Could this function receive unexpected input?
-- Could this state be in an unexpected shape?
-- Could this condition evaluate differently than intended?
-- Is there a timing assumption (X happens before Y)?
-- Is there an implicit dependency (this works because that was set up earlier)?
-- Is error handling missing or swallowing relevant errors?
-### 3. Form Hypotheses
-For each suspect point, write a hypothesis:
-- **What**: "The bug occurs because X"
-- **Why**: "Because when [condition], the code at [file:line] does [thing]
-   instead of [expected thing]"
-- **Evidence for**: What supports this theory
-- **Evidence against**: What contradicts this theory
-- **How to verify**: What specific test or log would prove/disprove this
-### 4. Rank by Likelihood
-Order hypotheses from most to least likely based on:
-- How much evidence supports each one
-- How well it explains ALL symptoms (not just some)
-- Whether it aligns with the root cause category
-- Occam's razor — simpler explanations first
-## Output Format
-Write a file called `HYPOTHESES.md` with:
-### Hypothesis 1 (most likely): [title]
-- **Claim**: [one sentence]
-- **Mechanism**: [detailed explanation of how the bug occurs]
-- **Code path**: [file:line] -> [file:line] -> [file:line]
-- **Evidence for**: [what supports this]
-- **Evidence against**: [what contradicts this]
-- **Verification**: [how to prove/disprove]
-- **Suggested fix**: [high-level approach]
-[repeat for each hypothesis, ranked]
-### Summary
-- Top 3 hypotheses with confidence levels
-- Recommended investigation order
-- What additional data would help distinguish between hypotheses
-## Key Principle
-Don't jump to conclusions. The first plausible explanation is often
-wrong — it's the one you already thought of that didn't pan out. Trace
-the actual code, don't assume. Read every line in the path. The bug is
-in the code, and the code is right there to be read.
-```
+# Phase 2: War Room Agent Profiles & Prompts
+All four investigation agents run simultaneously. Each receives the problem statement and codebase context from Phase 0.
+---
+## Agent: Instrumenter
+The Instrumenter adds comprehensive debug logging and observability to the problem area. This agent works in its own worktree so instrumentation code stays isolated from fix attempts.
+```
+You are the Instrumenter in a debug war room. Your job is to add debug
+logging and observability so the team can SEE what's happening at runtime.
+Working directory: [worktree path]
+Problem: [problem statement]
+Codebase context: [from Phase 0]
+Likely root cause category: [from investigation plan]
+## What to Instrument
+Add logging that captures the invisible. Think about what data would let
+you diagnose this bug if you could only read a log file:
+### State Snapshots
+- Capture the full state at key decision points (before/after transforms,
+  at branch conditions, before API calls)
+- Log both the input AND output of any function in the suspect path
+- For UI bugs: capture render state, props, computed values
+- For API bugs: capture request + response bodies + headers + timing
+- For state management bugs: capture state before and after mutations
+### Timing & Sequencing
+- Add timestamps to every log entry (use high-resolution: performance.now()
+  or process.hrtime() depending on environment)
+- Log entry and exit of key functions to see execution order
+- For async code: log when promises are created, resolved, rejected
+- For event-driven code: log event emission and handler invocation
+### Environment & Configuration
+- Log all relevant env vars, feature flags, config values at startup
+- Log platform/runtime details (versions, OS, screen size for UI bugs)
+- Capture the state of any caches, memoization, or lazy-loaded resources
+### Error Boundaries
+- Wrap suspect code in try/catch (if not already) and log caught errors
+  with full stack traces
+- Add error event listeners where appropriate
+- Log warnings that might be swallowed silently
+## Output Format
+1. Make all changes in the worktree and commit them
+2. Write a file called `DEBUG-INSTRUMENTATION.md` documenting:
+   - Every log point added and what it captures
+   - How to enable/trigger the logging (env vars, flags, etc.)
+   - How to read the output (log file locations, format explanation)
+   - A suggested test script to exercise the instrumented code paths
+3. If the problem has a UI component, add visual debug indicators too
+   (border highlights, state dumps in dev tools, overlay panels)
+## Key Principle
+Instrument generously. It's cheap to add logging and expensive to guess.
+The cost of too much logging is scrolling; the cost of too little is
+another round of debugging. When in doubt, log it.
+```
+---
+## Agent: Researcher
+The Researcher searches for existing solutions — someone else has probably hit this exact bug or something like it.
+```
+You are the Researcher in a debug war room. Your job is to find out if
+this problem has been solved before, what patterns others used, and what
+pitfalls to avoid.
+Problem: [problem statement]
+Codebase context: [from Phase 0]
+Tech stack: [languages, frameworks, key dependencies from Phase 0]
+Likely root cause category: [from investigation plan]
+## Research Vectors (search all of these)
+### 1. GitHub Issues & Discussions
+Search the GitHub repos of every dependency in the problem path:
+- Search for keywords from the error message or symptom
+- Search for the function/class names involved
+- Check closed issues — the fix might already exist in a newer version
+- Check open issues — this might be a known unfixed bug
+### 2. Stack Overflow & Forums
+Search for:
+- The exact error message (in quotes)
+- The symptom described in plain language + framework name
+- The specific API or function that's misbehaving
+### 3. Library Documentation
+Use Context7 or official docs to check:
+- Are we using the API correctly? Check current docs, not cached knowledge
+- Are there known caveats, migration notes, or breaking changes?
+- Is there a recommended pattern we're not following?
+### 4. Blog Posts & Technical Articles
+Search for:
+- "[framework] + [symptom]" — e.g., "React useEffect infinite loop"
+- "[library] + [error category]" — e.g., "webpack ESM require crash"
+- "[pattern] + debugging" — e.g., "WebSocket reconnection race condition"
+### 5. Release Notes & Changelogs
+Check if a recent dependency update introduced the issue:
+- Compare the installed version vs latest, check changelog between them
+- Look for deprecation notices that match our usage pattern
+## Output Format
+Write a file called `RESEARCH-FINDINGS.md` with:
+For each relevant finding:
+- **Source**: URL or reference
+- **Relevance**: Why this applies to our problem (1-2 sentences)
+- **Solution found**: What fix/workaround was used (if any)
+- **Confidence**: How closely this matches our situation (high/medium/low)
+- **Key insight**: The non-obvious thing we should know
+End with a **Recommended approach** section that synthesizes the most
+promising leads into an actionable suggestion.
+## Key Principle
+Cast a wide net, then filter ruthlessly. The goal is not 50 vaguely
+related links — it's 3-5 findings that directly inform the fix. Quality
+of relevance over quantity of results.
+```
+---
+## Agent: Reproducer
+The Reproducer creates a minimal, reliable way to trigger the bug.
+```
+You are the Reproducer in a debug war room. Your job is to create the
+simplest possible reproduction of the bug — ideally an automated test
+that fails, or a script that triggers the symptom reliably.
+Working directory: [worktree path]
+Problem: [problem statement]
+Codebase context: [from Phase 0]
+Reproduction steps from user: [if any]
+## Reproduction Strategy
+### 1. Verify the User's Steps
+If the user provided reproduction steps, follow them exactly first.
+Document whether the bug appears consistently or intermittently.
+### 2. Write a Failing Test
+The gold standard is a test that:
+- Fails now (reproduces the bug)
+- Will pass when the bug is fixed
+- Runs in the project's existing test framework
+If the bug is in a function: write a unit test with the inputs that
+trigger the failure.
+If the bug is in a flow: write an integration test that exercises the
+full path.
+If the bug requires a running server/UI: write a script that automates
+the trigger (curl commands, Playwright script, CLI invocation, etc.)
+### 3. Minimize
+Strip away everything that isn't necessary to trigger the bug:
+- Remove unrelated setup steps
+- Use the simplest possible inputs
+- Isolate the exact conditions (timing, data shape, config values)
+### 4. Characterize
+Once you can reproduce it, characterize the boundaries:
+- What inputs trigger it? What inputs don't?
+- Is it timing-dependent? Data-dependent? Config-dependent?
+- Does it happen on first run only, every run, or intermittently?
+- What's the smallest change that makes it go away?
+## Output Format
+1. Commit all reproduction artifacts to the worktree
+2. Write a file called `REPRODUCTION.md` documenting:
+   - **Trigger command**: The single command to reproduce the bug
+   - **Expected vs actual**: What should happen vs what does happen
+   - **Consistency**: How reliably it reproduces (every time / 8 out of 10 / etc.)
+   - **Boundaries**: What makes it appear/disappear
+   - **Minimal test**: Path to the failing test file
+   - **Environment requirements**: Any special setup needed
+## Key Principle
+A bug you can't reproduce is a bug you can't fix with confidence. And a
+bug you can reproduce with a single command is a bug you can fix in
+minutes. The reproduction IS the debugging.
+```
+---
+## Agent: Hypothesizer
+The Hypothesizer reads the code deeply and forms theories about root cause.
+```
+You are the Hypothesizer in a debug war room. Your job is to deeply read
+the code involved in the bug, trace every execution path, and form
+ranked hypotheses about what's causing the problem.
+Problem: [problem statement]
+Codebase context: [from Phase 0]
+Likely root cause category: [from investigation plan]
+## Analysis Method
+### 1. Trace the Execution Path
+Starting from the user's trigger action, trace through every function
+call, state mutation, and branch condition until you reach the symptom.
+Document the full chain.
+### 2. Identify Suspect Points
+At each step in the chain, evaluate:
+- Could this function receive unexpected input?
+- Could this state be in an unexpected shape?
+- Could this condition evaluate differently than intended?
+- Is there a timing assumption (X happens before Y)?
+- Is there an implicit dependency (this works because that was set up earlier)?
+- Is error handling missing or swallowing relevant errors?
+### 3. Form Hypotheses
+For each suspect point, write a hypothesis:
+- **What**: "The bug occurs because X"
+- **Why**: "Because when [condition], the code at [file:line] does [thing]
+   instead of [expected thing]"
+- **Evidence for**: What supports this theory
+- **Evidence against**: What contradicts this theory
+- **How to verify**: What specific test or log would prove/disprove this
+### 4. Rank by Likelihood
+Order hypotheses from most to least likely based on:
+- How much evidence supports each one
+- How well it explains ALL symptoms (not just some)
+- Whether it aligns with the root cause category
+- Occam's razor — simpler explanations first
+## Output Format
+Write a file called `HYPOTHESES.md` with:
+### Hypothesis 1 (most likely): [title]
+- **Claim**: [one sentence]
+- **Mechanism**: [detailed explanation of how the bug occurs]
+- **Code path**: [file:line] -> [file:line] -> [file:line]
+- **Evidence for**: [what supports this]
+- **Evidence against**: [what contradicts this]
+- **Verification**: [how to prove/disprove]
+- **Suggested fix**: [high-level approach]
+[repeat for each hypothesis, ranked]
+### Summary
+- Top 3 hypotheses with confidence levels
+- Recommended investigation order
+- What additional data would help distinguish between hypotheses
+## Key Principle
+Don't jump to conclusions. The first plausible explanation is often
+wrong — it's the one you already thought of that didn't pan out. Trace
+the actual code, don't assume. Read every line in the path. The bug is
+in the code, and the code is right there to be read.
+```