npm - @alxyrgin/agent-forge - Versions diffs - 3.0.0 → 3.2.0 - Mend

@alxyrgin/agent-forge 3.0.0 → 3.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (14) hide show

package/README.md +353 -118
package/dist/index.js +262 -19
package/dist/index.js.map +1 -1
package/package.json +1 -1
package/templates/config/linear-mapping.json.ejs +27 -0
package/templates/root/CLAUDE.md.ejs +2 -0
package/templates/rules/linear-sync.md.ejs +65 -0
package/templates/skills/core/complete-task/SKILL.md.ejs +17 -0
package/templates/skills/core/done/SKILL.md.ejs +18 -0
package/templates/skills/core/end-session/SKILL.md.ejs +18 -0
package/templates/skills/core/plan/SKILL.md.ejs +9 -0
package/templates/skills/core/take-task/SKILL.md.ejs +9 -0
package/templates/skills/extra/decompose/SKILL.md.ejs +9 -0
package/templates/skills/extra/sync-linear/SKILL.md.ejs +87 -0

package/README.md CHANGED Viewed

@@ -1,8 +1,10 @@
 # agent-forge
-AI-driven Development Framework for Claude Code.
+AI-driven Development Framework for Claude Code. Generates a complete development infrastructure with 20 specialized agents, quality gates, and TDD pipelines.
-Scaffold a complete development infrastructure with Memory Bank, specialized agents, skills, hooks, checkpoint system, and development rules in any project.
+[![npm version](https://img.shields.io/npm/v/@alxyrgin/agent-forge.svg)](https://www.npmjs.com/package/@alxyrgin/agent-forge)
+[![license](https://img.shields.io/npm/l/@alxyrgin/agent-forge.svg)](https://github.com/alxyrgin/agent-forge/blob/main/LICENSE)
+[![node](https://img.shields.io/node/v/@alxyrgin/agent-forge.svg)](https://nodejs.org)
 ## Quick Start
@@ -10,186 +12,419 @@ Scaffold a complete development infrastructure with Memory Bank, specialized age
 npx @alxyrgin/agent-forge init
 ```
-This creates a full AI-driven development infrastructure in your project:
+The interactive wizard asks for your project name, tech stack, team, and agent preset — then scaffolds the entire AI infrastructure in seconds. Start working immediately with `/start-session`.
-- **`.claude/`** — CLAUDE.md (Team Lead instructions), 5-20 agents, 10-21 skills, 8 rules, hooks
-- **`dev-infra/memory/`** — 9 Memory Bank files for persistent context (incl. checkpoint)
-- **`dev-infra/tasks/`** — Task tracking system (tasks.json)
-- **`dev-infra/sessions/`** — Session logs
-- **`dev-infra/tests/`** — Test structure (acceptance, PMI, results)
+```bash
+npx @alxyrgin/agent-forge init --yes   # non-interactive, use defaults
+```
-## How It Works
+## What Gets Generated
+| Directory | Contents |
+|-----------|----------|
+| `.claude/CLAUDE.md` | Team Lead instructions — the orchestrator prompt |
+| `.claude/agents/` | 5–20 specialized agents across 4 categories |
+| `.claude/skills/` | 10–21 slash commands for development workflows |
+| `.claude/rules/` | 8 development standards enforced automatically |
+| `.claude/hooks/` | Git/tool hooks (protect-docs, stop hook) |
+| `dev-infra/memory/` | 9 Memory Bank files for persistent context |
+| `dev-infra/tasks/` | Task tracking system (`tasks.json`) |
+| `dev-infra/sessions/` | Session logs |
+| `dev-infra/tests/` | Test structure (acceptance criteria, PMI scenarios, results) |
+## Architecture
+The Team Lead (defined in `CLAUDE.md`) orchestrates four categories of specialized agents:
+```mermaid
+graph TD
+    subgraph "Orchestrator"
+        TL["Team Lead<br/>(CLAUDE.md)"]
+    end
+    subgraph "Pipeline — 8 agents"
+        analyst["analyst"]
+        architect["architect"]
+        skeptic["skeptic"]
+        developer["developer"]
+        tester["tester"]
+        inspector["inspector"]
+        reviewer["reviewer"]
+        planner["planner"]
+    end
+    subgraph "Planning — 4 agents"
+        researcher["researcher"]
+        interviewer["interviewer"]
+        validator["validator"]
+        decomposer["decomposer"]
+    end
+    subgraph "Security — 4 agents"
+        auditor["auditor"]
+        prompter["prompter"]
+        deployer["deployer"]
+        scaffolder["scaffolder"]
+    end
+    subgraph "Documentation — 4 agents"
+        librarian["librarian"]
+        writer["writer"]
+        gatekeeper["gatekeeper"]
+        verifier["verifier"]
+    end
+    TL --> analyst
+    TL --> researcher
+    TL --> auditor
+    TL --> librarian
+    analyst --> architect --> skeptic
+    developer <--> tester
+    tester --> inspector --> reviewer
+```
-### Memory Bank
+Each agent has a defined role, a set of allowed tools, a model assignment, and structured JSON output with verdicts. The Team Lead reads each verdict and routes the pipeline accordingly.
-9 markdown files that persist context across sessions:
+## Pipelines
-| File | Purpose |
-|------|---------|
-| `active-context.md` | Current session state, what's done, next steps |
-| `progress.md` | Milestone progress, task statuses |
-| `project-brief.md` | Project overview, team, stack |
-| `decisions.md` | Architectural Decision Records (ADR) |
-| `tech-stack.md` | Technology stack details |
-| `tech-debt.md` | Technical debt registry with lifecycle tracking |
-| `patterns.md` | Code patterns and conventions |
-| `troubleshooting.md` | Problem solutions log |
-| `checkpoint.yml` | Recovery checkpoint for interrupted sessions |
+Every task is classified by size — **S**, **M**, or **L** — and routed through the appropriate pipeline. Larger tasks get more validation steps.
+### S-Pipeline. Small tasks (1 file, < 50 lines)
+```mermaid
+graph LR
+    S1["checkpoint"] --> S2["developer"] --> S3["tester +<br/>inspector"] --> S4["quick-review"] --> S5["tech-debt"] --> S6["fixation"]
+    style S1 fill:#f0f0f0,stroke:#999
+    style S6 fill:#f0f0f0,stroke:#999
+```
+### M-Pipeline. Medium tasks (2–5 files, new module)
-### Agents
+```mermaid
+graph LR
+    M1["checkpoint"] --> M2["analyst"] --> M3["TDD RED"] --> M4["developer ↔<br/>tester +<br/>inspector"] --> M5["quality<br/>gates"] --> M6["reviewer"] --> M7["tech-debt"] --> M8["fixation"]
-20 specialized AI agents organized in 4 categories:
+    style M1 fill:#f0f0f0,stroke:#999
+    style M8 fill:#f0f0f0,stroke:#999
+```
-| Category | Agents | Count |
-|----------|--------|-------|
-| Pipeline | analyst, architect, skeptic, developer, tester, inspector, reviewer, planner | 8 |
-| Planning | researcher, validator, interviewer, decomposer | 4 |
-| Security | auditor, prompter, deployer, scaffolder | 4 |
-| Documentation | librarian, writer, gatekeeper, verifier | 4 |
+### L-Pipeline. Large tasks (6+ files, architecture changes)
-**Preset coverage:**
-- **minimal** (5 agents) — analyst, developer, tester, inspector, reviewer
-- **core** (8 agents) — full pipeline category
-- **full** (20 agents) — all categories
+```mermaid
+graph LR
+    L1["checkpoint"] --> L2["analyst"] --> L3["architect +<br/>reviewer<br/>(plan)"] --> L4["skeptic"] --> L5["TDD RED"] --> L6["developer ↔<br/>tester +<br/>inspector"] --> L7["quality<br/>gates"] --> L8["reviewer"] --> L9["tech-debt"] --> L10["fixation"]
-### Skills (Slash Commands)
+    style L1 fill:#f0f0f0,stroke:#999
+    style L10 fill:#f0f0f0,stroke:#999
+```
-#### Core skills (all presets) — 10
+Key pipeline features:
+- **TDD RED phase** (M/L) — tester writes failing tests *before* developer writes code
+- **Per-feature loops** (L) — developer and tester iterate on each feature independently
+- **Inspector gate** — validates test quality after tester, before reviewer
+- **Multi-round review** — reviewer runs up to 3 rounds; CRITICAL/HIGH issues go back to developer
+- **Tech-debt is mandatory** for all sizes — never skipped
+## Agent Presets
+Three presets control how many agents are scaffolded:
+| Preset | Agents | Skills | Best for |
+|--------|--------|--------|----------|
+| **minimal** | 5 | 10 | Solo developer, small projects |
+| **core** (default) | 8 | 10 | Teams, production projects |
+| **full** | 20 | 21 | Complex systems, enterprise |
+### Preset coverage
+| Agent | minimal | core | full |
+|-------|:-------:|:----:|:----:|
+| analyst | x | x | x |
+| architect | | x | x |
+| skeptic | | x | x |
+| developer | x | x | x |
+| tester | x | x | x |
+| inspector | x | x | x |
+| reviewer | x | x | x |
+| planner | | x | x |
+| researcher | | | x |
+| interviewer | | | x |
+| validator | | | x |
+| decomposer | | | x |
+| auditor | | | x |
+| prompter | | | x |
+| deployer | | | x |
+| scaffolder | | | x |
+| librarian | | | x |
+| writer | | | x |
+| gatekeeper | | | x |
+| verifier | | | x |
+## Agents
+### Pipeline (8 agents)
+Core development cycle — from analysis to code review.
+| Agent | Description | Verdicts |
+|-------|-------------|----------|
+| **analyst** | Analyzes task requirements from documentation, extracts acceptance criteria and PMI scenarios | `COMPLETE`, `NEEDS_DISCOVERY` |
+| **architect** | Designs module architecture — structure, API contracts, data schemas | `READY`, `NEEDS_INPUT` |
+| **skeptic** | Reality checker — verifies plans against actual codebase, finds "mirages" (non-existent files, APIs, modules) | `PASS`, `PASS_WITH_WARNINGS`, `FAIL` |
+| **developer** | Writes code following project patterns, makes failing TDD tests green | `DONE`, `BLOCKED` |
+| **tester** | Parametric testing agent — unit, integration, acceptance, smoke. Supports TDD mode | `PASS`, `FAIL` |
+| **inspector** | Validates test quality — coverage, naming, assertions, mocking, isolation, edge cases | `APPROVE`, `REQUEST_CHANGES` |
+| **reviewer** | Code review with iterations and escalation. Modes: default, plan_review, quick | `APPROVE`, `REQUEST_CHANGES`, `ESCALATE` |
+| **planner** | Project-level planning — milestones, tasks, dependencies, completeness validation | `VALID`, `ISSUES_FOUND` |
+### Planning (4 agents)
+Deep analysis and task decomposition. Available in the **full** preset.
+| Agent | Description |
+|-------|-------------|
+| **researcher** | Codebase exploration — entry points, patterns, dependencies, integrations |
+| **interviewer** | Structured discovery interview — 3 cycles (general, code-informed, edge cases) |
+| **validator** | Specification validation — 4 modes: userspec, techspec, task, completeness |
+| **decomposer** | Task decomposition — generates atomic tasks with TDD anchors, acceptance criteria, and verify steps |
+### Security (4 agents)
+Security audits and infrastructure review. Available in the **full** preset.
+| Agent | Description |
+|-------|-------------|
+| **auditor** | Security analysis — OWASP Top 10, hardcoded secrets, threat modeling, access control |
+| **prompter** | LLM prompt review — clarity, few-shot quality, output format, injection safety, token efficiency |
+| **deployer** | CI/CD review — workflow correctness, secrets management, platform config, deploy scripts |
+| **scaffolder** | Project infrastructure review — structure, Docker, pre-commit hooks, .gitignore, dependency management |
+### Documentation (4 agents)
+Documentation quality and deployment validation. Available in the **full** preset.
+| Agent | Description |
+|-------|-------------|
+| **librarian** | Documentation review — completeness, freshness, absence of bloat, consistency |
+| **writer** | Generates stakeholder-facing reports and internal documentation |
+| **gatekeeper** | Pre-deploy QA — runs tests, verifies acceptance criteria, checks deferred criteria |
+| **verifier** | Post-deploy QA — live environment verification, manual verification plans |
+## Skills (Slash Commands)
+### Core skills (all presets) — 10 commands
 | Command | Description |
 |---------|-------------|
-| `/start-session` | Begin work: sync repo, check checkpoint, load context, show progress |
-| `/end-session` | Save context, checkpoint, create session log, commit & push |
+| `/start-session` | Begin work — sync repo, check checkpoint, load context, show progress |
+| `/end-session` | Save context, checkpoint, create session log, commit and push |
 | `/take-task [id]` | Full development cycle with feature-size routing (S/M/L) |
 | `/complete-task [id]` | Verify task, smoke test, update progress, clear checkpoint |
 | `/status` | Show project status, deadlines, blockers |
-| `/plan [mode]` | Plan/replan/validate tasks from documentation |
-| `/review [file]` | Code review for file or task |
+| `/plan [mode]` | Plan, replan, or validate tasks from documentation |
+| `/review [file]` | Code review for a file or task |
 | `/code [task]` | Direct code generation for a specific task |
 | `/test [target]` | Run or generate tests for a target |
 | `/done [id]` | Quick-complete a task with minimal ceremony |
-#### Extra skills (full preset only) — 11
+### Extra skills (full preset only) — 11 commands
 | Command | Description |
 |---------|-------------|
 | `/interview` | Structured discovery interview (3 cycles, completeness >= 85%) |
 | `/audit-wave` | Comprehensive pre-milestone audit with GO/NO-GO verdict |
 | `/write-report` | Generate non-technical progress report for stakeholders |
-| `/dashboard` | Project dashboard: progress, health, tech debt, activity |
+| `/dashboard` | Project dashboard — progress, health, tech debt, activity |
 | `/skill-master [name]` | Create a new custom skill from template |
-| `/decompose [task]` | Break down a task into subtasks |
+| `/decompose [task]` | Break down a task into subtasks with TDD anchors |
 | `/feature [name]` | Scaffold a new feature end-to-end |
 | `/security [target]` | Run security analysis on a target |
 | `/spec [feature]` | Generate specification for a feature |
 | `/techspec [module]` | Generate technical specification for a module |
 | `/prompts [agent]` | Manage and optimize agent prompts |
-### Feature-size Routing
-Tasks are automatically classified and routed through the appropriate pipeline:
-| Size | Criteria | Steps |
-|------|----------|-------|
-| **S** | 1 file, < 50 lines | checkpoint → code → tester+inspector → quick-review → tech-debt → fixation (6 steps) |
-| **M** | 2-5 files, new module | checkpoint → analysis → TDD(RED) → code+tester+inspector → review → tech-debt → fixation (8 steps) |
-| **L** | 6+ files, architecture changes | full cycle with architect, skeptic, per-feature loops, inspector, multi-round review (10 steps) |
-### Checkpoint System
+## Rules
-The checkpoint system (`dev-infra/memory/checkpoint.yml`) enables recovery after session interruptions:
+8 development standards that are loaded automatically and enforced across all agents:
-- **Automatic saving** — checkpoint is updated after each pipeline step
-- **Recovery on start** — `/start-session` detects active checkpoint and offers to resume
-- **Cleanup on completion** — `/complete-task` clears the checkpoint
-### Hooks
+| Rule | Purpose |
+|------|---------|
+| `commit-conventions` | Commit message format — `[type](scope): description` |
+| `development-cycle` | Feature-size routing (S/M/L) and pipeline step definitions |
+| `testing-standards` | Test coverage >= 80%, edge cases, access control testing |
+| `shared-resources` | Singleton resource registry — no duplicate DB connections or API clients |
+| `context-loading` | Just-in-time context loading — pass data, not file references |
+| `agent-output-format` | JSON output standard for all agents with structured verdicts |
+| `quality-gates` | Verdict-based routing between pipeline steps |
+| `rollback-protocol` | Rollback procedures for failed deployments |
-- **`protect-docs.sh`** — PreToolUse hook that blocks Edit/Write operations in `docs/` directory
-- **Stop hook** — Reminds to save checkpoint and run `/end-session` before exiting
+## Quality Gates
-### Rules
+Every agent returns a structured verdict. The Team Lead reads the verdict and routes the pipeline:
-8 development standards enforced automatically:
+```mermaid
+graph TD
+    A["Agent returns verdict"] --> B{"Verdict type?"}
+    B -->|"PASS / APPROVE / DONE"| C["Continue pipeline"]
+    B -->|"WARNINGS / ATTENTION"| D["Show to user,<br/>continue"]
+    B -->|"FAIL / BLOCKED"| E{"Retry count < 3?"}
+    E -->|"Yes"| F["Return to<br/>previous step"]
+    E -->|"No"| G["Escalate to user"]
+    F --> A
-| Rule | Purpose |
-|------|---------|
-| `commit-conventions` | Commit message format and style |
-| `development-cycle` | Feature-size routing and pipeline steps |
-| `testing-standards` | Test coverage and quality requirements |
-| `shared-resources` | Singleton resource registry and patterns |
-| `context-loading` | Just-in-time context loading, anti-patterns |
-| `agent-output-format` | JSON output standard for all agents |
-| `quality-gates` | Verdict-based routing and quality checkpoints |
-| `rollback-protocol` | Rollback procedures for failed deployments |
+    style C fill:#d4edda,stroke:#28a745
+    style D fill:#fff3cd,stroke:#ffc107
+    style G fill:#f8d7da,stroke:#dc3545
+```
-## Configuration
+### Verdict matrix
+```mermaid
+graph LR
+    subgraph "Analysis"
+        A1["analyst"] -->|COMPLETE| A2["architect"]
+        A1 -->|NEEDS_DISCOVERY| A3["ask user"]
+        A3 --> A1
+    end
+    subgraph "Architecture"
+        A2 -->|READY| SK["skeptic"]
+        A2 -->|NEEDS_INPUT| A4["ask user"]
+        A4 --> A2
+    end
+    subgraph "Reality Check"
+        SK -->|PASS| DEV["developer"]
+        SK -->|FAIL| A2
+    end
+    subgraph "Code + Tests"
+        DEV -->|DONE| TST["tester"]
+        TST -->|PASS| INS["inspector"]
+        TST -->|FAIL| DEV
+        INS -->|APPROVE| REV["reviewer"]
+        INS -->|REQUEST_CHANGES| TST
+    end
+    subgraph "Review"
+        REV -->|APPROVE| FIN["finalize"]
+        REV -->|REQUEST_CHANGES| DEV
+    end
+    style FIN fill:#d4edda,stroke:#28a745
+```
-### Agent Presets
+## CLI Commands
-| Preset | Agents | Skills | Description |
-|--------|--------|--------|-------------|
-| **minimal** | 5 | 10 | Essentials + inspector |
-| **core** (default) | 8 | 10 | Full development pipeline |
-| **full** | 20 | 21 | All categories + extra skills |
+### `agent-forge init`
-### Interactive Setup
+Initialize AI-driven development infrastructure in the current directory.
 ```bash
-npx @alxyrgin/agent-forge init
+npx @alxyrgin/agent-forge init           # interactive setup
+npx @alxyrgin/agent-forge init --yes     # use defaults (TypeScript, core preset)
+npx @alxyrgin/agent-forge init --overwrite  # overwrite existing files
 ```
-Prompts for:
+The wizard prompts for:
 - Project name and description
-- Technology stack (Python/TypeScript/Go/Rust)
+- Technology stack (Python / TypeScript / Go / Rust)
 - Framework and test framework
 - Team members (names, roles, emails)
 - Milestones (optional)
-- Agent preset (core/full/minimal)
-- Commit style (standard/conventional)
+- Agent preset (minimal / core / full)
+- Commit style (standard / conventional)
+### `agent-forge update`
-### Non-interactive
+Update framework files while preserving your data.
 ```bash
-npx @alxyrgin/agent-forge init --yes  # Use defaults
+npx @alxyrgin/agent-forge update
 ```
-## Commands
+**Overwritten** (updated to latest version):
+- `.claude/CLAUDE.md`
+- `.claude/agents/*`
+- `.claude/skills/*`
+- `.claude/rules/*`
+- `.claude/hooks/*`
+- `.claude/settings.json`
-### `agent-forge init`
+**Preserved** (your data stays intact):
+- `dev-infra/memory/*` — your Memory Bank
+- `dev-infra/tasks/*` — your task tracking
+- `dev-infra/sessions/*` — your session logs
+- `dev-infra/tests/*` — your test structure
-Initialize AI-driven development infrastructure.
+### `agent-forge doctor`
-Options:
-- `--yes, -y` — skip prompts, use defaults
-- `--overwrite` — overwrite existing files
+Check integrity of the generated structure. Verifies that all expected files exist and are not empty.
-### `agent-forge doctor`
+```bash
+npx @alxyrgin/agent-forge doctor
+```
+## Memory Bank
+9 files that persist context across sessions. The Team Lead reads and updates these automatically.
+| File | Purpose |
+|------|---------|
+| `active-context.md` | Current session state — what is done, what is next |
+| `progress.md` | Milestone progress, task statuses |
+| `project-brief.md` | Project overview, team, stack |
+| `decisions.md` | Architectural Decision Records (ADR) |
+| `tech-stack.md` | Technology stack details |
+| `tech-debt.md` | Technical debt registry with lifecycle tracking (open / in_progress / resolved) |
+| `patterns.md` | Code patterns and conventions |
+| `troubleshooting.md` | Problem solutions log |
+| `checkpoint.yml` | Recovery checkpoint for interrupted sessions |
-Check integrity of the generated structure.
+### Checkpoint System
+The checkpoint (`dev-infra/memory/checkpoint.yml`) enables recovery after session interruptions:
+- **Automatic saving** — updated after each pipeline step
+- **Recovery on start** — `/start-session` detects an active checkpoint and offers to resume
+- **Cleanup on completion** — `/complete-task` clears the checkpoint
-Verifies all expected files exist and are not empty.
+```mermaid
+graph LR
+    A["Session interrupted"] --> B["checkpoint.yml<br/>saved automatically"]
+    B --> C["Next session:<br/>/start-session"]
+    C --> D{"Active<br/>checkpoint?"}
+    D -->|"Yes"| E["Offer to resume<br/>from last step"]
+    D -->|"No"| F["Fresh start"]
+    style B fill:#fff3cd,stroke:#ffc107
+    style E fill:#d4edda,stroke:#28a745
+```
 ## Generated Structure
 ```
 your-project/
 ├── .claude/
-│   ├── CLAUDE.md              # Team Lead instructions
-│   ├── settings.json          # Claude Code hooks & env
+│   ├── CLAUDE.md                  # Team Lead instructions
+│   ├── settings.json              # Claude Code hooks and env
 │   ├── hooks/
-│   │   └── protect-docs.sh    # PreToolUse hook
-│   ├── agents/                # 5-20 specialized agents (4 categories)
-│   │   ├── pipeline/          # analyst, architect, skeptic, developer,
-│   │   │                      # tester, inspector, reviewer, planner
-│   │   ├── planning/          # researcher, validator, interviewer, decomposer
-│   │   ├── security/          # auditor, prompter, deployer, scaffolder
-│   │   └── documentation/     # librarian, writer, gatekeeper, verifier
-│   ├── skills/                # 10-21 slash commands
+│   │   └── protect-docs.sh        # PreToolUse hook — blocks edits in docs/
+│   ├── agents/
+│   │   ├── pipeline/              # analyst, architect, skeptic, developer,
+│   │   │                          # tester, inspector, reviewer, planner
+│   │   ├── planning/              # researcher, validator, interviewer, decomposer
+│   │   ├── security/              # auditor, prompter, deployer, scaffolder
+│   │   └── documentation/         # librarian, writer, gatekeeper, verifier
+│   ├── skills/
 │   │   ├── start-session/SKILL.md
 │   │   ├── take-task/SKILL.md
-│   │   └── ...
-│   └── rules/                 # 8 development standards
+│   │   ├── code/SKILL.md
+│   │   ├── test/SKILL.md
+│   │   └── ...                    # 10–21 slash commands
+│   └── rules/
 │       ├── commit-conventions.md
 │       ├── development-cycle.md
 │       ├── testing-standards.md
@@ -199,19 +434,19 @@ your-project/
 │       ├── quality-gates.md
 │       └── rollback-protocol.md
 ├── dev-infra/
-│   ├── memory/                # 9 Memory Bank files
+│   ├── memory/                    # 9 Memory Bank files
 │   │   ├── active-context.md
 │   │   ├── progress.md
 │   │   ├── checkpoint.yml
 │   │   └── ...
 │   ├── tasks/
-│   │   └── tasks.json         # Task tracking
-│   ├── sessions/              # Session logs
-│   └── tests/                 # Test structure
-│       ├── acceptance/
-│       ├── pmi/
-│       └── results/
-└── .claude-forge.json         # Manifest for doctor
+│   │   └── tasks.json             # Task tracking
+│   ├── sessions/                  # Session logs
+│   └── tests/
+│       ├── acceptance/            # Acceptance criteria
+│       ├── pmi/                   # PMI scenarios
+│       └── results/               # Test results
+└── .claude-forge.json             # Manifest for doctor and update
 ```
 ## License