@alxyrgin/agent-forge 1.0.0 → 3.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (64) hide show
  1. package/README.md +384 -96
  2. package/dist/index.js +355 -58
  3. package/dist/index.js.map +1 -1
  4. package/package.json +12 -2
  5. package/templates/agents/documentation/gatekeeper.md.ejs +83 -0
  6. package/templates/agents/documentation/librarian.md.ejs +80 -0
  7. package/templates/agents/documentation/verifier.md.ejs +92 -0
  8. package/templates/agents/documentation/writer.md.ejs +189 -0
  9. package/templates/agents/pipeline/analyst.md.ejs +65 -0
  10. package/templates/agents/{core → pipeline}/architect.md.ejs +38 -1
  11. package/templates/agents/pipeline/developer.md.ejs +75 -0
  12. package/templates/agents/pipeline/inspector.md.ejs +123 -0
  13. package/templates/agents/pipeline/planner.md.ejs +211 -0
  14. package/templates/agents/pipeline/reviewer.md.ejs +158 -0
  15. package/templates/agents/pipeline/skeptic.md.ejs +105 -0
  16. package/templates/agents/pipeline/tester.md.ejs +134 -0
  17. package/templates/agents/planning/decomposer.md.ejs +103 -0
  18. package/templates/agents/planning/interviewer.md.ejs +104 -0
  19. package/templates/agents/planning/researcher.md.ejs +96 -0
  20. package/templates/agents/planning/validator.md.ejs +97 -0
  21. package/templates/agents/security/auditor.md.ejs +105 -0
  22. package/templates/agents/security/deployer.md.ejs +81 -0
  23. package/templates/agents/security/prompter.md.ejs +87 -0
  24. package/templates/agents/security/scaffolder.md.ejs +75 -0
  25. package/templates/hooks/protect-docs.sh.ejs +25 -0
  26. package/templates/memory/checkpoint.yml.ejs +21 -0
  27. package/templates/memory/tech-debt.md.ejs +9 -1
  28. package/templates/root/CLAUDE.md.ejs +228 -30
  29. package/templates/root/settings.json.ejs +15 -1
  30. package/templates/rules/agent-output-format.md.ejs +80 -0
  31. package/templates/rules/context-loading.md.ejs +70 -0
  32. package/templates/rules/development-cycle.md.ejs +163 -29
  33. package/templates/rules/quality-gates.md.ejs +80 -0
  34. package/templates/rules/rollback-protocol.md.ejs +60 -0
  35. package/templates/rules/shared-resources.md.ejs +73 -0
  36. package/templates/skills/core/code/SKILL.md.ejs +85 -0
  37. package/templates/skills/core/complete-task/SKILL.md.ejs +22 -9
  38. package/templates/skills/core/done/SKILL.md.ejs +66 -0
  39. package/templates/skills/core/end-session/SKILL.md.ejs +14 -6
  40. package/templates/skills/core/review/SKILL.md.ejs +1 -1
  41. package/templates/skills/core/start-session/SKILL.md.ejs +73 -10
  42. package/templates/skills/core/take-task/SKILL.md.ejs +279 -30
  43. package/templates/skills/core/test/SKILL.md.ejs +200 -0
  44. package/templates/skills/extra/audit-wave/SKILL.md.ejs +58 -0
  45. package/templates/skills/extra/dashboard/SKILL.md.ejs +64 -0
  46. package/templates/skills/extra/decompose/SKILL.md.ejs +72 -0
  47. package/templates/skills/extra/feature/SKILL.md.ejs +83 -0
  48. package/templates/skills/extra/interview/SKILL.md.ejs +66 -0
  49. package/templates/skills/extra/prompts/SKILL.md.ejs +65 -0
  50. package/templates/skills/extra/security/SKILL.md.ejs +77 -0
  51. package/templates/skills/extra/skill-master/SKILL.md.ejs +51 -0
  52. package/templates/skills/extra/spec/SKILL.md.ejs +111 -0
  53. package/templates/skills/extra/techspec/SKILL.md.ejs +97 -0
  54. package/templates/skills/extra/write-report/SKILL.md.ejs +26 -0
  55. package/templates/agents/core/analyst.md.ejs +0 -56
  56. package/templates/agents/core/developer.md.ejs +0 -54
  57. package/templates/agents/core/doc-writer.md.ejs +0 -50
  58. package/templates/agents/core/progress-tracker.md.ejs +0 -56
  59. package/templates/agents/core/reviewer.md.ejs +0 -52
  60. package/templates/agents/core/security-auditor.md.ejs +0 -51
  61. package/templates/agents/core/unit-tester.md.ejs +0 -56
  62. package/templates/agents/extra/acceptance-tester.md.ejs +0 -48
  63. package/templates/agents/extra/integration-tester.md.ejs +0 -49
  64. package/templates/agents/extra/planner.md.ejs +0 -89
package/README.md CHANGED
@@ -1,164 +1,452 @@
1
1
  # agent-forge
2
2
 
3
- AI-driven Development Framework for Claude Code.
3
+ AI-driven Development Framework for Claude Code. Generates a complete development infrastructure with 20 specialized agents, quality gates, and TDD pipelines.
4
4
 
5
- Scaffold a complete development infrastructure with Memory Bank, specialized agents, skills, and development rules in any project.
5
+ [![npm version](https://img.shields.io/npm/v/@alxyrgin/agent-forge.svg)](https://www.npmjs.com/package/@alxyrgin/agent-forge)
6
+ [![license](https://img.shields.io/npm/l/@alxyrgin/agent-forge.svg)](https://github.com/alxyrgin/agent-forge/blob/main/LICENSE)
7
+ [![node](https://img.shields.io/node/v/@alxyrgin/agent-forge.svg)](https://nodejs.org)
6
8
 
7
9
  ## Quick Start
8
10
 
9
11
  ```bash
10
- npx agent-forge init
12
+ npx @alxyrgin/agent-forge init
11
13
  ```
12
14
 
13
- This creates a full AI-driven development infrastructure in your project:
15
+ The interactive wizard asks for your project name, tech stack, team, and agent preset — then scaffolds the entire AI infrastructure in seconds. Start working immediately with `/start-session`.
14
16
 
15
- - **`.claude/`** — CLAUDE.md (Team Lead instructions), 8 agents, 7 skills, 3 rules
16
- - **`dev-infra/memory/`** 8 Memory Bank files for persistent context
17
- - **`dev-infra/tasks/`** — Task tracking system (tasks.json)
18
- - **`dev-infra/sessions/`** — Session logs
19
- - **`dev-infra/tests/`** — Test structure (acceptance, PMI, results)
17
+ ```bash
18
+ npx @alxyrgin/agent-forge init --yes # non-interactive, use defaults
19
+ ```
20
20
 
21
- ## How It Works
21
+ ## What Gets Generated
22
+
23
+ | Directory | Contents |
24
+ |-----------|----------|
25
+ | `.claude/CLAUDE.md` | Team Lead instructions — the orchestrator prompt |
26
+ | `.claude/agents/` | 5–20 specialized agents across 4 categories |
27
+ | `.claude/skills/` | 10–21 slash commands for development workflows |
28
+ | `.claude/rules/` | 8 development standards enforced automatically |
29
+ | `.claude/hooks/` | Git/tool hooks (protect-docs, stop hook) |
30
+ | `dev-infra/memory/` | 9 Memory Bank files for persistent context |
31
+ | `dev-infra/tasks/` | Task tracking system (`tasks.json`) |
32
+ | `dev-infra/sessions/` | Session logs |
33
+ | `dev-infra/tests/` | Test structure (acceptance criteria, PMI scenarios, results) |
34
+
35
+ ## Architecture
36
+
37
+ The Team Lead (defined in `CLAUDE.md`) orchestrates four categories of specialized agents:
38
+
39
+ ```mermaid
40
+ graph TD
41
+ subgraph "Orchestrator"
42
+ TL["Team Lead<br/>(CLAUDE.md)"]
43
+ end
44
+
45
+ subgraph "Pipeline — 8 agents"
46
+ analyst["analyst"]
47
+ architect["architect"]
48
+ skeptic["skeptic"]
49
+ developer["developer"]
50
+ tester["tester"]
51
+ inspector["inspector"]
52
+ reviewer["reviewer"]
53
+ planner["planner"]
54
+ end
55
+
56
+ subgraph "Planning — 4 agents"
57
+ researcher["researcher"]
58
+ interviewer["interviewer"]
59
+ validator["validator"]
60
+ decomposer["decomposer"]
61
+ end
62
+
63
+ subgraph "Security — 4 agents"
64
+ auditor["auditor"]
65
+ prompter["prompter"]
66
+ deployer["deployer"]
67
+ scaffolder["scaffolder"]
68
+ end
69
+
70
+ subgraph "Documentation — 4 agents"
71
+ librarian["librarian"]
72
+ writer["writer"]
73
+ gatekeeper["gatekeeper"]
74
+ verifier["verifier"]
75
+ end
76
+
77
+ TL --> analyst
78
+ TL --> researcher
79
+ TL --> auditor
80
+ TL --> librarian
81
+
82
+ analyst --> architect --> skeptic
83
+ developer <--> tester
84
+ tester --> inspector --> reviewer
85
+ ```
22
86
 
23
- ### Memory Bank
87
+ Each agent has a defined role, a set of allowed tools, a model assignment, and structured JSON output with verdicts. The Team Lead reads each verdict and routes the pipeline accordingly.
24
88
 
25
- 8 markdown files that persist context across sessions:
89
+ ## Pipelines
26
90
 
27
- | File | Purpose |
28
- |------|---------|
29
- | `active-context.md` | Current session state, what's done, next steps |
30
- | `progress.md` | Milestone progress, task statuses |
31
- | `project-brief.md` | Project overview, team, stack |
32
- | `decisions.md` | Architectural Decision Records (ADR) |
33
- | `tech-stack.md` | Technology stack details |
34
- | `tech-debt.md` | Technical debt registry |
35
- | `patterns.md` | Code patterns and conventions |
36
- | `troubleshooting.md` | Problem solutions log |
91
+ Every task is classified by size — **S**, **M**, or **L** — and routed through the appropriate pipeline. Larger tasks get more validation steps.
92
+
93
+ ### S-Pipeline. Small tasks (1 file, < 50 lines)
37
94
 
38
- ### Agents
95
+ ```mermaid
96
+ graph LR
97
+ S1["checkpoint"] --> S2["developer"] --> S3["tester +<br/>inspector"] --> S4["quick-review"] --> S5["tech-debt"] --> S6["fixation"]
39
98
 
40
- Specialized AI agents, each with a specific role:
99
+ style S1 fill:#f0f0f0,stroke:#999
100
+ style S6 fill:#f0f0f0,stroke:#999
101
+ ```
102
+
103
+ ### M-Pipeline. Medium tasks (2–5 files, new module)
104
+
105
+ ```mermaid
106
+ graph LR
107
+ M1["checkpoint"] --> M2["analyst"] --> M3["TDD RED"] --> M4["developer ↔<br/>tester +<br/>inspector"] --> M5["quality<br/>gates"] --> M6["reviewer"] --> M7["tech-debt"] --> M8["fixation"]
41
108
 
42
- | Agent | Role |
43
- |-------|------|
44
- | `analyst` | Requirement analysis from docs |
45
- | `architect` | Module architecture design |
46
- | `developer` | Code implementation |
47
- | `unit-tester` | Unit test writing and running |
48
- | `reviewer` | Code quality review |
49
- | `security-auditor` | Security and access control audit |
50
- | `doc-writer` | Documentation generation |
51
- | `progress-tracker` | Memory bank and task updates |
109
+ style M1 fill:#f0f0f0,stroke:#999
110
+ style M8 fill:#f0f0f0,stroke:#999
111
+ ```
112
+
113
+ ### L-Pipeline. Large tasks (6+ files, architecture changes)
52
114
 
53
- **Full preset** adds: `planner`, `integration-tester`, `acceptance-tester`
115
+ ```mermaid
116
+ graph LR
117
+ L1["checkpoint"] --> L2["analyst"] --> L3["architect +<br/>reviewer<br/>(plan)"] --> L4["skeptic"] --> L5["TDD RED"] --> L6["developer ↔<br/>tester +<br/>inspector"] --> L7["quality<br/>gates"] --> L8["reviewer"] --> L9["tech-debt"] --> L10["fixation"]
54
118
 
55
- **Minimal preset** includes only: `analyst`, `developer`, `unit-tester`, `reviewer`
119
+ style L1 fill:#f0f0f0,stroke:#999
120
+ style L10 fill:#f0f0f0,stroke:#999
121
+ ```
56
122
 
57
- ### Skills (Slash Commands)
123
+ Key pipeline features:
124
+ - **TDD RED phase** (M/L) — tester writes failing tests *before* developer writes code
125
+ - **Per-feature loops** (L) — developer and tester iterate on each feature independently
126
+ - **Inspector gate** — validates test quality after tester, before reviewer
127
+ - **Multi-round review** — reviewer runs up to 3 rounds; CRITICAL/HIGH issues go back to developer
128
+ - **Tech-debt is mandatory** for all sizes — never skipped
129
+
130
+ ## Agent Presets
131
+
132
+ Three presets control how many agents are scaffolded:
133
+
134
+ | Preset | Agents | Skills | Best for |
135
+ |--------|--------|--------|----------|
136
+ | **minimal** | 5 | 10 | Solo developer, small projects |
137
+ | **core** (default) | 8 | 10 | Teams, production projects |
138
+ | **full** | 20 | 21 | Complex systems, enterprise |
139
+
140
+ ### Preset coverage
141
+
142
+ | Agent | minimal | core | full |
143
+ |-------|:-------:|:----:|:----:|
144
+ | analyst | x | x | x |
145
+ | architect | | x | x |
146
+ | skeptic | | x | x |
147
+ | developer | x | x | x |
148
+ | tester | x | x | x |
149
+ | inspector | x | x | x |
150
+ | reviewer | x | x | x |
151
+ | planner | | x | x |
152
+ | researcher | | | x |
153
+ | interviewer | | | x |
154
+ | validator | | | x |
155
+ | decomposer | | | x |
156
+ | auditor | | | x |
157
+ | prompter | | | x |
158
+ | deployer | | | x |
159
+ | scaffolder | | | x |
160
+ | librarian | | | x |
161
+ | writer | | | x |
162
+ | gatekeeper | | | x |
163
+ | verifier | | | x |
164
+
165
+ ## Agents
166
+
167
+ ### Pipeline (8 agents)
168
+
169
+ Core development cycle — from analysis to code review.
170
+
171
+ | Agent | Description | Verdicts |
172
+ |-------|-------------|----------|
173
+ | **analyst** | Analyzes task requirements from documentation, extracts acceptance criteria and PMI scenarios | `COMPLETE`, `NEEDS_DISCOVERY` |
174
+ | **architect** | Designs module architecture — structure, API contracts, data schemas | `READY`, `NEEDS_INPUT` |
175
+ | **skeptic** | Reality checker — verifies plans against actual codebase, finds "mirages" (non-existent files, APIs, modules) | `PASS`, `PASS_WITH_WARNINGS`, `FAIL` |
176
+ | **developer** | Writes code following project patterns, makes failing TDD tests green | `DONE`, `BLOCKED` |
177
+ | **tester** | Parametric testing agent — unit, integration, acceptance, smoke. Supports TDD mode | `PASS`, `FAIL` |
178
+ | **inspector** | Validates test quality — coverage, naming, assertions, mocking, isolation, edge cases | `APPROVE`, `REQUEST_CHANGES` |
179
+ | **reviewer** | Code review with iterations and escalation. Modes: default, plan_review, quick | `APPROVE`, `REQUEST_CHANGES`, `ESCALATE` |
180
+ | **planner** | Project-level planning — milestones, tasks, dependencies, completeness validation | `VALID`, `ISSUES_FOUND` |
181
+
182
+ ### Planning (4 agents)
183
+
184
+ Deep analysis and task decomposition. Available in the **full** preset.
185
+
186
+ | Agent | Description |
187
+ |-------|-------------|
188
+ | **researcher** | Codebase exploration — entry points, patterns, dependencies, integrations |
189
+ | **interviewer** | Structured discovery interview — 3 cycles (general, code-informed, edge cases) |
190
+ | **validator** | Specification validation — 4 modes: userspec, techspec, task, completeness |
191
+ | **decomposer** | Task decomposition — generates atomic tasks with TDD anchors, acceptance criteria, and verify steps |
192
+
193
+ ### Security (4 agents)
194
+
195
+ Security audits and infrastructure review. Available in the **full** preset.
196
+
197
+ | Agent | Description |
198
+ |-------|-------------|
199
+ | **auditor** | Security analysis — OWASP Top 10, hardcoded secrets, threat modeling, access control |
200
+ | **prompter** | LLM prompt review — clarity, few-shot quality, output format, injection safety, token efficiency |
201
+ | **deployer** | CI/CD review — workflow correctness, secrets management, platform config, deploy scripts |
202
+ | **scaffolder** | Project infrastructure review — structure, Docker, pre-commit hooks, .gitignore, dependency management |
203
+
204
+ ### Documentation (4 agents)
205
+
206
+ Documentation quality and deployment validation. Available in the **full** preset.
207
+
208
+ | Agent | Description |
209
+ |-------|-------------|
210
+ | **librarian** | Documentation review — completeness, freshness, absence of bloat, consistency |
211
+ | **writer** | Generates stakeholder-facing reports and internal documentation |
212
+ | **gatekeeper** | Pre-deploy QA — runs tests, verifies acceptance criteria, checks deferred criteria |
213
+ | **verifier** | Post-deploy QA — live environment verification, manual verification plans |
214
+
215
+ ## Skills (Slash Commands)
216
+
217
+ ### Core skills (all presets) — 10 commands
58
218
 
59
219
  | Command | Description |
60
220
  |---------|-------------|
61
- | `/start-session` | Begin work: sync repo, load context, show progress |
62
- | `/end-session` | Save context, create session log, commit & push |
63
- | `/take-task [id]` | Full development cycle: analysis -> code -> test -> review |
64
- | `/complete-task [id]` | Verify task completion, update progress |
221
+ | `/start-session` | Begin work sync repo, check checkpoint, load context, show progress |
222
+ | `/end-session` | Save context, checkpoint, create session log, commit and push |
223
+ | `/take-task [id]` | Full development cycle with feature-size routing (S/M/L) |
224
+ | `/complete-task [id]` | Verify task, smoke test, update progress, clear checkpoint |
65
225
  | `/status` | Show project status, deadlines, blockers |
66
- | `/plan [mode]` | Plan/replan/validate tasks from documentation |
67
- | `/review [file]` | Code review for file or task |
226
+ | `/plan [mode]` | Plan, replan, or validate tasks from documentation |
227
+ | `/review [file]` | Code review for a file or task |
228
+ | `/code [task]` | Direct code generation for a specific task |
229
+ | `/test [target]` | Run or generate tests for a target |
230
+ | `/done [id]` | Quick-complete a task with minimal ceremony |
68
231
 
69
- ### Development Cycle
232
+ ### Extra skills (full preset only) — 11 commands
70
233
 
71
- When you run `/take-task`, the system automatically orchestrates:
72
-
73
- ```
74
- [1] ANALYSIS — analyst reads docs, finds requirements
75
- [2] DISCOVERY — clarifying questions if unclear
76
- [3] PLAN architect designs module structure
77
- [4] CODE — developer writes code
78
- [5] TESTS — unit-tester writes tests (>=80% coverage)
79
- [6] REVIEW — reviewer checks quality and security
80
- [7] FIXATION — update tasks, progress, propose commit
81
- [8] TECH-DEBT — record any deviations
234
+ | Command | Description |
235
+ |---------|-------------|
236
+ | `/interview` | Structured discovery interview (3 cycles, completeness >= 85%) |
237
+ | `/audit-wave` | Comprehensive pre-milestone audit with GO/NO-GO verdict |
238
+ | `/write-report` | Generate non-technical progress report for stakeholders |
239
+ | `/dashboard` | Project dashboard progress, health, tech debt, activity |
240
+ | `/skill-master [name]` | Create a new custom skill from template |
241
+ | `/decompose [task]` | Break down a task into subtasks with TDD anchors |
242
+ | `/feature [name]` | Scaffold a new feature end-to-end |
243
+ | `/security [target]` | Run security analysis on a target |
244
+ | `/spec [feature]` | Generate specification for a feature |
245
+ | `/techspec [module]` | Generate technical specification for a module |
246
+ | `/prompts [agent]` | Manage and optimize agent prompts |
247
+
248
+ ## Rules
249
+
250
+ 8 development standards that are loaded automatically and enforced across all agents:
251
+
252
+ | Rule | Purpose |
253
+ |------|---------|
254
+ | `commit-conventions` | Commit message format — `[type](scope): description` |
255
+ | `development-cycle` | Feature-size routing (S/M/L) and pipeline step definitions |
256
+ | `testing-standards` | Test coverage >= 80%, edge cases, access control testing |
257
+ | `shared-resources` | Singleton resource registry — no duplicate DB connections or API clients |
258
+ | `context-loading` | Just-in-time context loading — pass data, not file references |
259
+ | `agent-output-format` | JSON output standard for all agents with structured verdicts |
260
+ | `quality-gates` | Verdict-based routing between pipeline steps |
261
+ | `rollback-protocol` | Rollback procedures for failed deployments |
262
+
263
+ ## Quality Gates
264
+
265
+ Every agent returns a structured verdict. The Team Lead reads the verdict and routes the pipeline:
266
+
267
+ ```mermaid
268
+ graph TD
269
+ A["Agent returns verdict"] --> B{"Verdict type?"}
270
+ B -->|"PASS / APPROVE / DONE"| C["Continue pipeline"]
271
+ B -->|"WARNINGS / ATTENTION"| D["Show to user,<br/>continue"]
272
+ B -->|"FAIL / BLOCKED"| E{"Retry count < 3?"}
273
+ E -->|"Yes"| F["Return to<br/>previous step"]
274
+ E -->|"No"| G["Escalate to user"]
275
+ F --> A
276
+
277
+ style C fill:#d4edda,stroke:#28a745
278
+ style D fill:#fff3cd,stroke:#ffc107
279
+ style G fill:#f8d7da,stroke:#dc3545
82
280
  ```
83
281
 
84
- ## Configuration
282
+ ### Verdict matrix
283
+
284
+ ```mermaid
285
+ graph LR
286
+ subgraph "Analysis"
287
+ A1["analyst"] -->|COMPLETE| A2["architect"]
288
+ A1 -->|NEEDS_DISCOVERY| A3["ask user"]
289
+ A3 --> A1
290
+ end
291
+
292
+ subgraph "Architecture"
293
+ A2 -->|READY| SK["skeptic"]
294
+ A2 -->|NEEDS_INPUT| A4["ask user"]
295
+ A4 --> A2
296
+ end
297
+
298
+ subgraph "Reality Check"
299
+ SK -->|PASS| DEV["developer"]
300
+ SK -->|FAIL| A2
301
+ end
302
+
303
+ subgraph "Code + Tests"
304
+ DEV -->|DONE| TST["tester"]
305
+ TST -->|PASS| INS["inspector"]
306
+ TST -->|FAIL| DEV
307
+ INS -->|APPROVE| REV["reviewer"]
308
+ INS -->|REQUEST_CHANGES| TST
309
+ end
310
+
311
+ subgraph "Review"
312
+ REV -->|APPROVE| FIN["finalize"]
313
+ REV -->|REQUEST_CHANGES| DEV
314
+ end
315
+
316
+ style FIN fill:#d4edda,stroke:#28a745
317
+ ```
85
318
 
86
- ### Agent Presets
319
+ ## CLI Commands
87
320
 
88
- - **Core** (default, 8 agents) — balanced set for most projects
89
- - **Full** (11 agents) — adds planner, integration tester, acceptance tester
90
- - **Minimal** (4 agents) — analyst, developer, unit-tester, reviewer
321
+ ### `agent-forge init`
91
322
 
92
- ### Interactive Setup
323
+ Initialize AI-driven development infrastructure in the current directory.
93
324
 
94
325
  ```bash
95
- npx agent-forge init
326
+ npx @alxyrgin/agent-forge init # interactive setup
327
+ npx @alxyrgin/agent-forge init --yes # use defaults (TypeScript, core preset)
328
+ npx @alxyrgin/agent-forge init --overwrite # overwrite existing files
96
329
  ```
97
330
 
98
- Prompts for:
331
+ The wizard prompts for:
99
332
  - Project name and description
100
- - Technology stack (Python/TypeScript/Go/Rust)
333
+ - Technology stack (Python / TypeScript / Go / Rust)
101
334
  - Framework and test framework
102
335
  - Team members (names, roles, emails)
103
336
  - Milestones (optional)
104
- - Agent preset (core/full/minimal)
105
- - Commit style (standard/conventional)
337
+ - Agent preset (minimal / core / full)
338
+ - Commit style (standard / conventional)
339
+
340
+ ### `agent-forge update`
106
341
 
107
- ### Non-interactive
342
+ Update framework files while preserving your data.
108
343
 
109
344
  ```bash
110
- npx agent-forge init --yes # Use defaults
345
+ npx @alxyrgin/agent-forge update
111
346
  ```
112
347
 
113
- ## Commands
348
+ **Overwritten** (updated to latest version):
349
+ - `.claude/CLAUDE.md`
350
+ - `.claude/agents/*`
351
+ - `.claude/skills/*`
352
+ - `.claude/rules/*`
353
+ - `.claude/hooks/*`
354
+ - `.claude/settings.json`
114
355
 
115
- ### `agent-forge init`
356
+ **Preserved** (your data stays intact):
357
+ - `dev-infra/memory/*` — your Memory Bank
358
+ - `dev-infra/tasks/*` — your task tracking
359
+ - `dev-infra/sessions/*` — your session logs
360
+ - `dev-infra/tests/*` — your test structure
116
361
 
117
- Initialize AI-driven development infrastructure.
362
+ ### `agent-forge doctor`
118
363
 
119
- Options:
120
- - `--yes, -y` — skip prompts, use defaults
121
- - `--overwrite` — overwrite existing files
364
+ Check integrity of the generated structure. Verifies that all expected files exist and are not empty.
122
365
 
123
- ### `agent-forge doctor`
366
+ ```bash
367
+ npx @alxyrgin/agent-forge doctor
368
+ ```
369
+
370
+ ## Memory Bank
371
+
372
+ 9 files that persist context across sessions. The Team Lead reads and updates these automatically.
373
+
374
+ | File | Purpose |
375
+ |------|---------|
376
+ | `active-context.md` | Current session state — what is done, what is next |
377
+ | `progress.md` | Milestone progress, task statuses |
378
+ | `project-brief.md` | Project overview, team, stack |
379
+ | `decisions.md` | Architectural Decision Records (ADR) |
380
+ | `tech-stack.md` | Technology stack details |
381
+ | `tech-debt.md` | Technical debt registry with lifecycle tracking (open / in_progress / resolved) |
382
+ | `patterns.md` | Code patterns and conventions |
383
+ | `troubleshooting.md` | Problem solutions log |
384
+ | `checkpoint.yml` | Recovery checkpoint for interrupted sessions |
124
385
 
125
- Check integrity of the generated structure.
386
+ ### Checkpoint System
126
387
 
127
- Verifies all expected files exist and are not empty.
388
+ The checkpoint (`dev-infra/memory/checkpoint.yml`) enables recovery after session interruptions:
389
+
390
+ - **Automatic saving** — updated after each pipeline step
391
+ - **Recovery on start** — `/start-session` detects an active checkpoint and offers to resume
392
+ - **Cleanup on completion** — `/complete-task` clears the checkpoint
393
+
394
+ ```mermaid
395
+ graph LR
396
+ A["Session interrupted"] --> B["checkpoint.yml<br/>saved automatically"]
397
+ B --> C["Next session:<br/>/start-session"]
398
+ C --> D{"Active<br/>checkpoint?"}
399
+ D -->|"Yes"| E["Offer to resume<br/>from last step"]
400
+ D -->|"No"| F["Fresh start"]
401
+
402
+ style B fill:#fff3cd,stroke:#ffc107
403
+ style E fill:#d4edda,stroke:#28a745
404
+ ```
128
405
 
129
406
  ## Generated Structure
130
407
 
131
408
  ```
132
409
  your-project/
133
410
  ├── .claude/
134
- │ ├── CLAUDE.md # Team Lead instructions
135
- │ ├── settings.json # Claude Code hooks
136
- │ ├── agents/ # 4-11 specialized agents
137
- │ │ ├── analyst.md
138
- ├── architect.md
139
- │ │ ├── developer.md
140
- │ │ └── ...
141
- │ ├── skills/ # 7 slash commands
411
+ │ ├── CLAUDE.md # Team Lead instructions
412
+ │ ├── settings.json # Claude Code hooks and env
413
+ │ ├── hooks/
414
+ │ │ └── protect-docs.sh # PreToolUse hook — blocks edits in docs/
415
+ │ ├── agents/
416
+ │ │ ├── pipeline/ # analyst, architect, skeptic, developer,
417
+ │ │ │ # tester, inspector, reviewer, planner
418
+ ├── planning/ # researcher, validator, interviewer, decomposer
419
+ │ │ ├── security/ # auditor, prompter, deployer, scaffolder
420
+ │ │ └── documentation/ # librarian, writer, gatekeeper, verifier
421
+ │ ├── skills/
142
422
  │ │ ├── start-session/SKILL.md
143
423
  │ │ ├── take-task/SKILL.md
144
- │ │ └── ...
145
- └── rules/ # 3 development standards
424
+ │ │ ├── code/SKILL.md
425
+ │ ├── test/SKILL.md
426
+ │ │ └── ... # 10–21 slash commands
427
+ │ └── rules/
146
428
  │ ├── commit-conventions.md
147
429
  │ ├── development-cycle.md
148
- └── testing-standards.md
430
+ ├── testing-standards.md
431
+ │ ├── shared-resources.md
432
+ │ ├── context-loading.md
433
+ │ ├── agent-output-format.md
434
+ │ ├── quality-gates.md
435
+ │ └── rollback-protocol.md
149
436
  ├── dev-infra/
150
- │ ├── memory/ # 8 Memory Bank files
437
+ │ ├── memory/ # 9 Memory Bank files
151
438
  │ │ ├── active-context.md
152
439
  │ │ ├── progress.md
440
+ │ │ ├── checkpoint.yml
153
441
  │ │ └── ...
154
442
  │ ├── tasks/
155
- │ │ └── tasks.json # Task tracking
156
- │ ├── sessions/ # Session logs
157
- │ └── tests/ # Test structure
158
- │ ├── acceptance/
159
- │ ├── pmi/
160
- │ └── results/
161
- └── .claude-forge.json # Manifest for doctor
443
+ │ │ └── tasks.json # Task tracking
444
+ │ ├── sessions/ # Session logs
445
+ │ └── tests/
446
+ │ ├── acceptance/ # Acceptance criteria
447
+ │ ├── pmi/ # PMI scenarios
448
+ │ └── results/ # Test results
449
+ └── .claude-forge.json # Manifest for doctor and update
162
450
  ```
163
451
 
164
452
  ## License