@alxyrgin/agent-forge 1.0.0 → 3.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +384 -96
- package/dist/index.js +355 -58
- package/dist/index.js.map +1 -1
- package/package.json +12 -2
- package/templates/agents/documentation/gatekeeper.md.ejs +83 -0
- package/templates/agents/documentation/librarian.md.ejs +80 -0
- package/templates/agents/documentation/verifier.md.ejs +92 -0
- package/templates/agents/documentation/writer.md.ejs +189 -0
- package/templates/agents/pipeline/analyst.md.ejs +65 -0
- package/templates/agents/{core → pipeline}/architect.md.ejs +38 -1
- package/templates/agents/pipeline/developer.md.ejs +75 -0
- package/templates/agents/pipeline/inspector.md.ejs +123 -0
- package/templates/agents/pipeline/planner.md.ejs +211 -0
- package/templates/agents/pipeline/reviewer.md.ejs +158 -0
- package/templates/agents/pipeline/skeptic.md.ejs +105 -0
- package/templates/agents/pipeline/tester.md.ejs +134 -0
- package/templates/agents/planning/decomposer.md.ejs +103 -0
- package/templates/agents/planning/interviewer.md.ejs +104 -0
- package/templates/agents/planning/researcher.md.ejs +96 -0
- package/templates/agents/planning/validator.md.ejs +97 -0
- package/templates/agents/security/auditor.md.ejs +105 -0
- package/templates/agents/security/deployer.md.ejs +81 -0
- package/templates/agents/security/prompter.md.ejs +87 -0
- package/templates/agents/security/scaffolder.md.ejs +75 -0
- package/templates/hooks/protect-docs.sh.ejs +25 -0
- package/templates/memory/checkpoint.yml.ejs +21 -0
- package/templates/memory/tech-debt.md.ejs +9 -1
- package/templates/root/CLAUDE.md.ejs +228 -30
- package/templates/root/settings.json.ejs +15 -1
- package/templates/rules/agent-output-format.md.ejs +80 -0
- package/templates/rules/context-loading.md.ejs +70 -0
- package/templates/rules/development-cycle.md.ejs +163 -29
- package/templates/rules/quality-gates.md.ejs +80 -0
- package/templates/rules/rollback-protocol.md.ejs +60 -0
- package/templates/rules/shared-resources.md.ejs +73 -0
- package/templates/skills/core/code/SKILL.md.ejs +85 -0
- package/templates/skills/core/complete-task/SKILL.md.ejs +22 -9
- package/templates/skills/core/done/SKILL.md.ejs +66 -0
- package/templates/skills/core/end-session/SKILL.md.ejs +14 -6
- package/templates/skills/core/review/SKILL.md.ejs +1 -1
- package/templates/skills/core/start-session/SKILL.md.ejs +73 -10
- package/templates/skills/core/take-task/SKILL.md.ejs +279 -30
- package/templates/skills/core/test/SKILL.md.ejs +200 -0
- package/templates/skills/extra/audit-wave/SKILL.md.ejs +58 -0
- package/templates/skills/extra/dashboard/SKILL.md.ejs +64 -0
- package/templates/skills/extra/decompose/SKILL.md.ejs +72 -0
- package/templates/skills/extra/feature/SKILL.md.ejs +83 -0
- package/templates/skills/extra/interview/SKILL.md.ejs +66 -0
- package/templates/skills/extra/prompts/SKILL.md.ejs +65 -0
- package/templates/skills/extra/security/SKILL.md.ejs +77 -0
- package/templates/skills/extra/skill-master/SKILL.md.ejs +51 -0
- package/templates/skills/extra/spec/SKILL.md.ejs +111 -0
- package/templates/skills/extra/techspec/SKILL.md.ejs +97 -0
- package/templates/skills/extra/write-report/SKILL.md.ejs +26 -0
- package/templates/agents/core/analyst.md.ejs +0 -56
- package/templates/agents/core/developer.md.ejs +0 -54
- package/templates/agents/core/doc-writer.md.ejs +0 -50
- package/templates/agents/core/progress-tracker.md.ejs +0 -56
- package/templates/agents/core/reviewer.md.ejs +0 -52
- package/templates/agents/core/security-auditor.md.ejs +0 -51
- package/templates/agents/core/unit-tester.md.ejs +0 -56
- package/templates/agents/extra/acceptance-tester.md.ejs +0 -48
- package/templates/agents/extra/integration-tester.md.ejs +0 -49
- package/templates/agents/extra/planner.md.ejs +0 -89
package/README.md
CHANGED
|
@@ -1,164 +1,452 @@
|
|
|
1
1
|
# agent-forge
|
|
2
2
|
|
|
3
|
-
AI-driven Development Framework for Claude Code.
|
|
3
|
+
AI-driven Development Framework for Claude Code. Generates a complete development infrastructure with 20 specialized agents, quality gates, and TDD pipelines.
|
|
4
4
|
|
|
5
|
-
|
|
5
|
+
[](https://www.npmjs.com/package/@alxyrgin/agent-forge)
|
|
6
|
+
[](https://github.com/alxyrgin/agent-forge/blob/main/LICENSE)
|
|
7
|
+
[](https://nodejs.org)
|
|
6
8
|
|
|
7
9
|
## Quick Start
|
|
8
10
|
|
|
9
11
|
```bash
|
|
10
|
-
npx agent-forge init
|
|
12
|
+
npx @alxyrgin/agent-forge init
|
|
11
13
|
```
|
|
12
14
|
|
|
13
|
-
|
|
15
|
+
The interactive wizard asks for your project name, tech stack, team, and agent preset — then scaffolds the entire AI infrastructure in seconds. Start working immediately with `/start-session`.
|
|
14
16
|
|
|
15
|
-
|
|
16
|
-
|
|
17
|
-
|
|
18
|
-
- **`dev-infra/sessions/`** — Session logs
|
|
19
|
-
- **`dev-infra/tests/`** — Test structure (acceptance, PMI, results)
|
|
17
|
+
```bash
|
|
18
|
+
npx @alxyrgin/agent-forge init --yes # non-interactive, use defaults
|
|
19
|
+
```
|
|
20
20
|
|
|
21
|
-
##
|
|
21
|
+
## What Gets Generated
|
|
22
|
+
|
|
23
|
+
| Directory | Contents |
|
|
24
|
+
|-----------|----------|
|
|
25
|
+
| `.claude/CLAUDE.md` | Team Lead instructions — the orchestrator prompt |
|
|
26
|
+
| `.claude/agents/` | 5–20 specialized agents across 4 categories |
|
|
27
|
+
| `.claude/skills/` | 10–21 slash commands for development workflows |
|
|
28
|
+
| `.claude/rules/` | 8 development standards enforced automatically |
|
|
29
|
+
| `.claude/hooks/` | Git/tool hooks (protect-docs, stop hook) |
|
|
30
|
+
| `dev-infra/memory/` | 9 Memory Bank files for persistent context |
|
|
31
|
+
| `dev-infra/tasks/` | Task tracking system (`tasks.json`) |
|
|
32
|
+
| `dev-infra/sessions/` | Session logs |
|
|
33
|
+
| `dev-infra/tests/` | Test structure (acceptance criteria, PMI scenarios, results) |
|
|
34
|
+
|
|
35
|
+
## Architecture
|
|
36
|
+
|
|
37
|
+
The Team Lead (defined in `CLAUDE.md`) orchestrates four categories of specialized agents:
|
|
38
|
+
|
|
39
|
+
```mermaid
|
|
40
|
+
graph TD
|
|
41
|
+
subgraph "Orchestrator"
|
|
42
|
+
TL["Team Lead<br/>(CLAUDE.md)"]
|
|
43
|
+
end
|
|
44
|
+
|
|
45
|
+
subgraph "Pipeline — 8 agents"
|
|
46
|
+
analyst["analyst"]
|
|
47
|
+
architect["architect"]
|
|
48
|
+
skeptic["skeptic"]
|
|
49
|
+
developer["developer"]
|
|
50
|
+
tester["tester"]
|
|
51
|
+
inspector["inspector"]
|
|
52
|
+
reviewer["reviewer"]
|
|
53
|
+
planner["planner"]
|
|
54
|
+
end
|
|
55
|
+
|
|
56
|
+
subgraph "Planning — 4 agents"
|
|
57
|
+
researcher["researcher"]
|
|
58
|
+
interviewer["interviewer"]
|
|
59
|
+
validator["validator"]
|
|
60
|
+
decomposer["decomposer"]
|
|
61
|
+
end
|
|
62
|
+
|
|
63
|
+
subgraph "Security — 4 agents"
|
|
64
|
+
auditor["auditor"]
|
|
65
|
+
prompter["prompter"]
|
|
66
|
+
deployer["deployer"]
|
|
67
|
+
scaffolder["scaffolder"]
|
|
68
|
+
end
|
|
69
|
+
|
|
70
|
+
subgraph "Documentation — 4 agents"
|
|
71
|
+
librarian["librarian"]
|
|
72
|
+
writer["writer"]
|
|
73
|
+
gatekeeper["gatekeeper"]
|
|
74
|
+
verifier["verifier"]
|
|
75
|
+
end
|
|
76
|
+
|
|
77
|
+
TL --> analyst
|
|
78
|
+
TL --> researcher
|
|
79
|
+
TL --> auditor
|
|
80
|
+
TL --> librarian
|
|
81
|
+
|
|
82
|
+
analyst --> architect --> skeptic
|
|
83
|
+
developer <--> tester
|
|
84
|
+
tester --> inspector --> reviewer
|
|
85
|
+
```
|
|
22
86
|
|
|
23
|
-
|
|
87
|
+
Each agent has a defined role, a set of allowed tools, a model assignment, and structured JSON output with verdicts. The Team Lead reads each verdict and routes the pipeline accordingly.
|
|
24
88
|
|
|
25
|
-
|
|
89
|
+
## Pipelines
|
|
26
90
|
|
|
27
|
-
|
|
28
|
-
|
|
29
|
-
|
|
30
|
-
| `progress.md` | Milestone progress, task statuses |
|
|
31
|
-
| `project-brief.md` | Project overview, team, stack |
|
|
32
|
-
| `decisions.md` | Architectural Decision Records (ADR) |
|
|
33
|
-
| `tech-stack.md` | Technology stack details |
|
|
34
|
-
| `tech-debt.md` | Technical debt registry |
|
|
35
|
-
| `patterns.md` | Code patterns and conventions |
|
|
36
|
-
| `troubleshooting.md` | Problem solutions log |
|
|
91
|
+
Every task is classified by size — **S**, **M**, or **L** — and routed through the appropriate pipeline. Larger tasks get more validation steps.
|
|
92
|
+
|
|
93
|
+
### S-Pipeline. Small tasks (1 file, < 50 lines)
|
|
37
94
|
|
|
38
|
-
|
|
95
|
+
```mermaid
|
|
96
|
+
graph LR
|
|
97
|
+
S1["checkpoint"] --> S2["developer"] --> S3["tester +<br/>inspector"] --> S4["quick-review"] --> S5["tech-debt"] --> S6["fixation"]
|
|
39
98
|
|
|
40
|
-
|
|
99
|
+
style S1 fill:#f0f0f0,stroke:#999
|
|
100
|
+
style S6 fill:#f0f0f0,stroke:#999
|
|
101
|
+
```
|
|
102
|
+
|
|
103
|
+
### M-Pipeline. Medium tasks (2–5 files, new module)
|
|
104
|
+
|
|
105
|
+
```mermaid
|
|
106
|
+
graph LR
|
|
107
|
+
M1["checkpoint"] --> M2["analyst"] --> M3["TDD RED"] --> M4["developer ↔<br/>tester +<br/>inspector"] --> M5["quality<br/>gates"] --> M6["reviewer"] --> M7["tech-debt"] --> M8["fixation"]
|
|
41
108
|
|
|
42
|
-
|
|
43
|
-
|
|
44
|
-
|
|
45
|
-
|
|
46
|
-
|
|
47
|
-
| `unit-tester` | Unit test writing and running |
|
|
48
|
-
| `reviewer` | Code quality review |
|
|
49
|
-
| `security-auditor` | Security and access control audit |
|
|
50
|
-
| `doc-writer` | Documentation generation |
|
|
51
|
-
| `progress-tracker` | Memory bank and task updates |
|
|
109
|
+
style M1 fill:#f0f0f0,stroke:#999
|
|
110
|
+
style M8 fill:#f0f0f0,stroke:#999
|
|
111
|
+
```
|
|
112
|
+
|
|
113
|
+
### L-Pipeline. Large tasks (6+ files, architecture changes)
|
|
52
114
|
|
|
53
|
-
|
|
115
|
+
```mermaid
|
|
116
|
+
graph LR
|
|
117
|
+
L1["checkpoint"] --> L2["analyst"] --> L3["architect +<br/>reviewer<br/>(plan)"] --> L4["skeptic"] --> L5["TDD RED"] --> L6["developer ↔<br/>tester +<br/>inspector"] --> L7["quality<br/>gates"] --> L8["reviewer"] --> L9["tech-debt"] --> L10["fixation"]
|
|
54
118
|
|
|
55
|
-
|
|
119
|
+
style L1 fill:#f0f0f0,stroke:#999
|
|
120
|
+
style L10 fill:#f0f0f0,stroke:#999
|
|
121
|
+
```
|
|
56
122
|
|
|
57
|
-
|
|
123
|
+
Key pipeline features:
|
|
124
|
+
- **TDD RED phase** (M/L) — tester writes failing tests *before* developer writes code
|
|
125
|
+
- **Per-feature loops** (L) — developer and tester iterate on each feature independently
|
|
126
|
+
- **Inspector gate** — validates test quality after tester, before reviewer
|
|
127
|
+
- **Multi-round review** — reviewer runs up to 3 rounds; CRITICAL/HIGH issues go back to developer
|
|
128
|
+
- **Tech-debt is mandatory** for all sizes — never skipped
|
|
129
|
+
|
|
130
|
+
## Agent Presets
|
|
131
|
+
|
|
132
|
+
Three presets control how many agents are scaffolded:
|
|
133
|
+
|
|
134
|
+
| Preset | Agents | Skills | Best for |
|
|
135
|
+
|--------|--------|--------|----------|
|
|
136
|
+
| **minimal** | 5 | 10 | Solo developer, small projects |
|
|
137
|
+
| **core** (default) | 8 | 10 | Teams, production projects |
|
|
138
|
+
| **full** | 20 | 21 | Complex systems, enterprise |
|
|
139
|
+
|
|
140
|
+
### Preset coverage
|
|
141
|
+
|
|
142
|
+
| Agent | minimal | core | full |
|
|
143
|
+
|-------|:-------:|:----:|:----:|
|
|
144
|
+
| analyst | x | x | x |
|
|
145
|
+
| architect | | x | x |
|
|
146
|
+
| skeptic | | x | x |
|
|
147
|
+
| developer | x | x | x |
|
|
148
|
+
| tester | x | x | x |
|
|
149
|
+
| inspector | x | x | x |
|
|
150
|
+
| reviewer | x | x | x |
|
|
151
|
+
| planner | | x | x |
|
|
152
|
+
| researcher | | | x |
|
|
153
|
+
| interviewer | | | x |
|
|
154
|
+
| validator | | | x |
|
|
155
|
+
| decomposer | | | x |
|
|
156
|
+
| auditor | | | x |
|
|
157
|
+
| prompter | | | x |
|
|
158
|
+
| deployer | | | x |
|
|
159
|
+
| scaffolder | | | x |
|
|
160
|
+
| librarian | | | x |
|
|
161
|
+
| writer | | | x |
|
|
162
|
+
| gatekeeper | | | x |
|
|
163
|
+
| verifier | | | x |
|
|
164
|
+
|
|
165
|
+
## Agents
|
|
166
|
+
|
|
167
|
+
### Pipeline (8 agents)
|
|
168
|
+
|
|
169
|
+
Core development cycle — from analysis to code review.
|
|
170
|
+
|
|
171
|
+
| Agent | Description | Verdicts |
|
|
172
|
+
|-------|-------------|----------|
|
|
173
|
+
| **analyst** | Analyzes task requirements from documentation, extracts acceptance criteria and PMI scenarios | `COMPLETE`, `NEEDS_DISCOVERY` |
|
|
174
|
+
| **architect** | Designs module architecture — structure, API contracts, data schemas | `READY`, `NEEDS_INPUT` |
|
|
175
|
+
| **skeptic** | Reality checker — verifies plans against actual codebase, finds "mirages" (non-existent files, APIs, modules) | `PASS`, `PASS_WITH_WARNINGS`, `FAIL` |
|
|
176
|
+
| **developer** | Writes code following project patterns, makes failing TDD tests green | `DONE`, `BLOCKED` |
|
|
177
|
+
| **tester** | Parametric testing agent — unit, integration, acceptance, smoke. Supports TDD mode | `PASS`, `FAIL` |
|
|
178
|
+
| **inspector** | Validates test quality — coverage, naming, assertions, mocking, isolation, edge cases | `APPROVE`, `REQUEST_CHANGES` |
|
|
179
|
+
| **reviewer** | Code review with iterations and escalation. Modes: default, plan_review, quick | `APPROVE`, `REQUEST_CHANGES`, `ESCALATE` |
|
|
180
|
+
| **planner** | Project-level planning — milestones, tasks, dependencies, completeness validation | `VALID`, `ISSUES_FOUND` |
|
|
181
|
+
|
|
182
|
+
### Planning (4 agents)
|
|
183
|
+
|
|
184
|
+
Deep analysis and task decomposition. Available in the **full** preset.
|
|
185
|
+
|
|
186
|
+
| Agent | Description |
|
|
187
|
+
|-------|-------------|
|
|
188
|
+
| **researcher** | Codebase exploration — entry points, patterns, dependencies, integrations |
|
|
189
|
+
| **interviewer** | Structured discovery interview — 3 cycles (general, code-informed, edge cases) |
|
|
190
|
+
| **validator** | Specification validation — 4 modes: userspec, techspec, task, completeness |
|
|
191
|
+
| **decomposer** | Task decomposition — generates atomic tasks with TDD anchors, acceptance criteria, and verify steps |
|
|
192
|
+
|
|
193
|
+
### Security (4 agents)
|
|
194
|
+
|
|
195
|
+
Security audits and infrastructure review. Available in the **full** preset.
|
|
196
|
+
|
|
197
|
+
| Agent | Description |
|
|
198
|
+
|-------|-------------|
|
|
199
|
+
| **auditor** | Security analysis — OWASP Top 10, hardcoded secrets, threat modeling, access control |
|
|
200
|
+
| **prompter** | LLM prompt review — clarity, few-shot quality, output format, injection safety, token efficiency |
|
|
201
|
+
| **deployer** | CI/CD review — workflow correctness, secrets management, platform config, deploy scripts |
|
|
202
|
+
| **scaffolder** | Project infrastructure review — structure, Docker, pre-commit hooks, .gitignore, dependency management |
|
|
203
|
+
|
|
204
|
+
### Documentation (4 agents)
|
|
205
|
+
|
|
206
|
+
Documentation quality and deployment validation. Available in the **full** preset.
|
|
207
|
+
|
|
208
|
+
| Agent | Description |
|
|
209
|
+
|-------|-------------|
|
|
210
|
+
| **librarian** | Documentation review — completeness, freshness, absence of bloat, consistency |
|
|
211
|
+
| **writer** | Generates stakeholder-facing reports and internal documentation |
|
|
212
|
+
| **gatekeeper** | Pre-deploy QA — runs tests, verifies acceptance criteria, checks deferred criteria |
|
|
213
|
+
| **verifier** | Post-deploy QA — live environment verification, manual verification plans |
|
|
214
|
+
|
|
215
|
+
## Skills (Slash Commands)
|
|
216
|
+
|
|
217
|
+
### Core skills (all presets) — 10 commands
|
|
58
218
|
|
|
59
219
|
| Command | Description |
|
|
60
220
|
|---------|-------------|
|
|
61
|
-
| `/start-session` | Begin work
|
|
62
|
-
| `/end-session` | Save context, create session log, commit
|
|
63
|
-
| `/take-task [id]` | Full development cycle
|
|
64
|
-
| `/complete-task [id]` | Verify task
|
|
221
|
+
| `/start-session` | Begin work — sync repo, check checkpoint, load context, show progress |
|
|
222
|
+
| `/end-session` | Save context, checkpoint, create session log, commit and push |
|
|
223
|
+
| `/take-task [id]` | Full development cycle with feature-size routing (S/M/L) |
|
|
224
|
+
| `/complete-task [id]` | Verify task, smoke test, update progress, clear checkpoint |
|
|
65
225
|
| `/status` | Show project status, deadlines, blockers |
|
|
66
|
-
| `/plan [mode]` | Plan
|
|
67
|
-
| `/review [file]` | Code review for file or task |
|
|
226
|
+
| `/plan [mode]` | Plan, replan, or validate tasks from documentation |
|
|
227
|
+
| `/review [file]` | Code review for a file or task |
|
|
228
|
+
| `/code [task]` | Direct code generation for a specific task |
|
|
229
|
+
| `/test [target]` | Run or generate tests for a target |
|
|
230
|
+
| `/done [id]` | Quick-complete a task with minimal ceremony |
|
|
68
231
|
|
|
69
|
-
###
|
|
232
|
+
### Extra skills (full preset only) — 11 commands
|
|
70
233
|
|
|
71
|
-
|
|
72
|
-
|
|
73
|
-
|
|
74
|
-
|
|
75
|
-
|
|
76
|
-
|
|
77
|
-
[
|
|
78
|
-
[
|
|
79
|
-
[
|
|
80
|
-
[
|
|
81
|
-
[
|
|
234
|
+
| Command | Description |
|
|
235
|
+
|---------|-------------|
|
|
236
|
+
| `/interview` | Structured discovery interview (3 cycles, completeness >= 85%) |
|
|
237
|
+
| `/audit-wave` | Comprehensive pre-milestone audit with GO/NO-GO verdict |
|
|
238
|
+
| `/write-report` | Generate non-technical progress report for stakeholders |
|
|
239
|
+
| `/dashboard` | Project dashboard — progress, health, tech debt, activity |
|
|
240
|
+
| `/skill-master [name]` | Create a new custom skill from template |
|
|
241
|
+
| `/decompose [task]` | Break down a task into subtasks with TDD anchors |
|
|
242
|
+
| `/feature [name]` | Scaffold a new feature end-to-end |
|
|
243
|
+
| `/security [target]` | Run security analysis on a target |
|
|
244
|
+
| `/spec [feature]` | Generate specification for a feature |
|
|
245
|
+
| `/techspec [module]` | Generate technical specification for a module |
|
|
246
|
+
| `/prompts [agent]` | Manage and optimize agent prompts |
|
|
247
|
+
|
|
248
|
+
## Rules
|
|
249
|
+
|
|
250
|
+
8 development standards that are loaded automatically and enforced across all agents:
|
|
251
|
+
|
|
252
|
+
| Rule | Purpose |
|
|
253
|
+
|------|---------|
|
|
254
|
+
| `commit-conventions` | Commit message format — `[type](scope): description` |
|
|
255
|
+
| `development-cycle` | Feature-size routing (S/M/L) and pipeline step definitions |
|
|
256
|
+
| `testing-standards` | Test coverage >= 80%, edge cases, access control testing |
|
|
257
|
+
| `shared-resources` | Singleton resource registry — no duplicate DB connections or API clients |
|
|
258
|
+
| `context-loading` | Just-in-time context loading — pass data, not file references |
|
|
259
|
+
| `agent-output-format` | JSON output standard for all agents with structured verdicts |
|
|
260
|
+
| `quality-gates` | Verdict-based routing between pipeline steps |
|
|
261
|
+
| `rollback-protocol` | Rollback procedures for failed deployments |
|
|
262
|
+
|
|
263
|
+
## Quality Gates
|
|
264
|
+
|
|
265
|
+
Every agent returns a structured verdict. The Team Lead reads the verdict and routes the pipeline:
|
|
266
|
+
|
|
267
|
+
```mermaid
|
|
268
|
+
graph TD
|
|
269
|
+
A["Agent returns verdict"] --> B{"Verdict type?"}
|
|
270
|
+
B -->|"PASS / APPROVE / DONE"| C["Continue pipeline"]
|
|
271
|
+
B -->|"WARNINGS / ATTENTION"| D["Show to user,<br/>continue"]
|
|
272
|
+
B -->|"FAIL / BLOCKED"| E{"Retry count < 3?"}
|
|
273
|
+
E -->|"Yes"| F["Return to<br/>previous step"]
|
|
274
|
+
E -->|"No"| G["Escalate to user"]
|
|
275
|
+
F --> A
|
|
276
|
+
|
|
277
|
+
style C fill:#d4edda,stroke:#28a745
|
|
278
|
+
style D fill:#fff3cd,stroke:#ffc107
|
|
279
|
+
style G fill:#f8d7da,stroke:#dc3545
|
|
82
280
|
```
|
|
83
281
|
|
|
84
|
-
|
|
282
|
+
### Verdict matrix
|
|
283
|
+
|
|
284
|
+
```mermaid
|
|
285
|
+
graph LR
|
|
286
|
+
subgraph "Analysis"
|
|
287
|
+
A1["analyst"] -->|COMPLETE| A2["architect"]
|
|
288
|
+
A1 -->|NEEDS_DISCOVERY| A3["ask user"]
|
|
289
|
+
A3 --> A1
|
|
290
|
+
end
|
|
291
|
+
|
|
292
|
+
subgraph "Architecture"
|
|
293
|
+
A2 -->|READY| SK["skeptic"]
|
|
294
|
+
A2 -->|NEEDS_INPUT| A4["ask user"]
|
|
295
|
+
A4 --> A2
|
|
296
|
+
end
|
|
297
|
+
|
|
298
|
+
subgraph "Reality Check"
|
|
299
|
+
SK -->|PASS| DEV["developer"]
|
|
300
|
+
SK -->|FAIL| A2
|
|
301
|
+
end
|
|
302
|
+
|
|
303
|
+
subgraph "Code + Tests"
|
|
304
|
+
DEV -->|DONE| TST["tester"]
|
|
305
|
+
TST -->|PASS| INS["inspector"]
|
|
306
|
+
TST -->|FAIL| DEV
|
|
307
|
+
INS -->|APPROVE| REV["reviewer"]
|
|
308
|
+
INS -->|REQUEST_CHANGES| TST
|
|
309
|
+
end
|
|
310
|
+
|
|
311
|
+
subgraph "Review"
|
|
312
|
+
REV -->|APPROVE| FIN["finalize"]
|
|
313
|
+
REV -->|REQUEST_CHANGES| DEV
|
|
314
|
+
end
|
|
315
|
+
|
|
316
|
+
style FIN fill:#d4edda,stroke:#28a745
|
|
317
|
+
```
|
|
85
318
|
|
|
86
|
-
|
|
319
|
+
## CLI Commands
|
|
87
320
|
|
|
88
|
-
-
|
|
89
|
-
- **Full** (11 agents) — adds planner, integration tester, acceptance tester
|
|
90
|
-
- **Minimal** (4 agents) — analyst, developer, unit-tester, reviewer
|
|
321
|
+
### `agent-forge init`
|
|
91
322
|
|
|
92
|
-
|
|
323
|
+
Initialize AI-driven development infrastructure in the current directory.
|
|
93
324
|
|
|
94
325
|
```bash
|
|
95
|
-
npx agent-forge init
|
|
326
|
+
npx @alxyrgin/agent-forge init # interactive setup
|
|
327
|
+
npx @alxyrgin/agent-forge init --yes # use defaults (TypeScript, core preset)
|
|
328
|
+
npx @alxyrgin/agent-forge init --overwrite # overwrite existing files
|
|
96
329
|
```
|
|
97
330
|
|
|
98
|
-
|
|
331
|
+
The wizard prompts for:
|
|
99
332
|
- Project name and description
|
|
100
|
-
- Technology stack (Python/TypeScript/Go/Rust)
|
|
333
|
+
- Technology stack (Python / TypeScript / Go / Rust)
|
|
101
334
|
- Framework and test framework
|
|
102
335
|
- Team members (names, roles, emails)
|
|
103
336
|
- Milestones (optional)
|
|
104
|
-
- Agent preset (core/full
|
|
105
|
-
- Commit style (standard/conventional)
|
|
337
|
+
- Agent preset (minimal / core / full)
|
|
338
|
+
- Commit style (standard / conventional)
|
|
339
|
+
|
|
340
|
+
### `agent-forge update`
|
|
106
341
|
|
|
107
|
-
|
|
342
|
+
Update framework files while preserving your data.
|
|
108
343
|
|
|
109
344
|
```bash
|
|
110
|
-
npx agent-forge
|
|
345
|
+
npx @alxyrgin/agent-forge update
|
|
111
346
|
```
|
|
112
347
|
|
|
113
|
-
|
|
348
|
+
**Overwritten** (updated to latest version):
|
|
349
|
+
- `.claude/CLAUDE.md`
|
|
350
|
+
- `.claude/agents/*`
|
|
351
|
+
- `.claude/skills/*`
|
|
352
|
+
- `.claude/rules/*`
|
|
353
|
+
- `.claude/hooks/*`
|
|
354
|
+
- `.claude/settings.json`
|
|
114
355
|
|
|
115
|
-
|
|
356
|
+
**Preserved** (your data stays intact):
|
|
357
|
+
- `dev-infra/memory/*` — your Memory Bank
|
|
358
|
+
- `dev-infra/tasks/*` — your task tracking
|
|
359
|
+
- `dev-infra/sessions/*` — your session logs
|
|
360
|
+
- `dev-infra/tests/*` — your test structure
|
|
116
361
|
|
|
117
|
-
|
|
362
|
+
### `agent-forge doctor`
|
|
118
363
|
|
|
119
|
-
|
|
120
|
-
- `--yes, -y` — skip prompts, use defaults
|
|
121
|
-
- `--overwrite` — overwrite existing files
|
|
364
|
+
Check integrity of the generated structure. Verifies that all expected files exist and are not empty.
|
|
122
365
|
|
|
123
|
-
|
|
366
|
+
```bash
|
|
367
|
+
npx @alxyrgin/agent-forge doctor
|
|
368
|
+
```
|
|
369
|
+
|
|
370
|
+
## Memory Bank
|
|
371
|
+
|
|
372
|
+
9 files that persist context across sessions. The Team Lead reads and updates these automatically.
|
|
373
|
+
|
|
374
|
+
| File | Purpose |
|
|
375
|
+
|------|---------|
|
|
376
|
+
| `active-context.md` | Current session state — what is done, what is next |
|
|
377
|
+
| `progress.md` | Milestone progress, task statuses |
|
|
378
|
+
| `project-brief.md` | Project overview, team, stack |
|
|
379
|
+
| `decisions.md` | Architectural Decision Records (ADR) |
|
|
380
|
+
| `tech-stack.md` | Technology stack details |
|
|
381
|
+
| `tech-debt.md` | Technical debt registry with lifecycle tracking (open / in_progress / resolved) |
|
|
382
|
+
| `patterns.md` | Code patterns and conventions |
|
|
383
|
+
| `troubleshooting.md` | Problem solutions log |
|
|
384
|
+
| `checkpoint.yml` | Recovery checkpoint for interrupted sessions |
|
|
124
385
|
|
|
125
|
-
|
|
386
|
+
### Checkpoint System
|
|
126
387
|
|
|
127
|
-
|
|
388
|
+
The checkpoint (`dev-infra/memory/checkpoint.yml`) enables recovery after session interruptions:
|
|
389
|
+
|
|
390
|
+
- **Automatic saving** — updated after each pipeline step
|
|
391
|
+
- **Recovery on start** — `/start-session` detects an active checkpoint and offers to resume
|
|
392
|
+
- **Cleanup on completion** — `/complete-task` clears the checkpoint
|
|
393
|
+
|
|
394
|
+
```mermaid
|
|
395
|
+
graph LR
|
|
396
|
+
A["Session interrupted"] --> B["checkpoint.yml<br/>saved automatically"]
|
|
397
|
+
B --> C["Next session:<br/>/start-session"]
|
|
398
|
+
C --> D{"Active<br/>checkpoint?"}
|
|
399
|
+
D -->|"Yes"| E["Offer to resume<br/>from last step"]
|
|
400
|
+
D -->|"No"| F["Fresh start"]
|
|
401
|
+
|
|
402
|
+
style B fill:#fff3cd,stroke:#ffc107
|
|
403
|
+
style E fill:#d4edda,stroke:#28a745
|
|
404
|
+
```
|
|
128
405
|
|
|
129
406
|
## Generated Structure
|
|
130
407
|
|
|
131
408
|
```
|
|
132
409
|
your-project/
|
|
133
410
|
├── .claude/
|
|
134
|
-
│ ├── CLAUDE.md
|
|
135
|
-
│ ├── settings.json
|
|
136
|
-
│ ├──
|
|
137
|
-
│ │
|
|
138
|
-
│
|
|
139
|
-
│ │ ├── developer
|
|
140
|
-
│ │
|
|
141
|
-
│ ├──
|
|
411
|
+
│ ├── CLAUDE.md # Team Lead instructions
|
|
412
|
+
│ ├── settings.json # Claude Code hooks and env
|
|
413
|
+
│ ├── hooks/
|
|
414
|
+
│ │ └── protect-docs.sh # PreToolUse hook — blocks edits in docs/
|
|
415
|
+
│ ├── agents/
|
|
416
|
+
│ │ ├── pipeline/ # analyst, architect, skeptic, developer,
|
|
417
|
+
│ │ │ # tester, inspector, reviewer, planner
|
|
418
|
+
│ │ ├── planning/ # researcher, validator, interviewer, decomposer
|
|
419
|
+
│ │ ├── security/ # auditor, prompter, deployer, scaffolder
|
|
420
|
+
│ │ └── documentation/ # librarian, writer, gatekeeper, verifier
|
|
421
|
+
│ ├── skills/
|
|
142
422
|
│ │ ├── start-session/SKILL.md
|
|
143
423
|
│ │ ├── take-task/SKILL.md
|
|
144
|
-
│ │
|
|
145
|
-
│
|
|
424
|
+
│ │ ├── code/SKILL.md
|
|
425
|
+
│ │ ├── test/SKILL.md
|
|
426
|
+
│ │ └── ... # 10–21 slash commands
|
|
427
|
+
│ └── rules/
|
|
146
428
|
│ ├── commit-conventions.md
|
|
147
429
|
│ ├── development-cycle.md
|
|
148
|
-
│
|
|
430
|
+
│ ├── testing-standards.md
|
|
431
|
+
│ ├── shared-resources.md
|
|
432
|
+
│ ├── context-loading.md
|
|
433
|
+
│ ├── agent-output-format.md
|
|
434
|
+
│ ├── quality-gates.md
|
|
435
|
+
│ └── rollback-protocol.md
|
|
149
436
|
├── dev-infra/
|
|
150
|
-
│ ├── memory/
|
|
437
|
+
│ ├── memory/ # 9 Memory Bank files
|
|
151
438
|
│ │ ├── active-context.md
|
|
152
439
|
│ │ ├── progress.md
|
|
440
|
+
│ │ ├── checkpoint.yml
|
|
153
441
|
│ │ └── ...
|
|
154
442
|
│ ├── tasks/
|
|
155
|
-
│ │ └── tasks.json
|
|
156
|
-
│ ├── sessions/
|
|
157
|
-
│ └── tests/
|
|
158
|
-
│ ├── acceptance/
|
|
159
|
-
│ ├── pmi/
|
|
160
|
-
│ └── results/
|
|
161
|
-
└── .claude-forge.json
|
|
443
|
+
│ │ └── tasks.json # Task tracking
|
|
444
|
+
│ ├── sessions/ # Session logs
|
|
445
|
+
│ └── tests/
|
|
446
|
+
│ ├── acceptance/ # Acceptance criteria
|
|
447
|
+
│ ├── pmi/ # PMI scenarios
|
|
448
|
+
│ └── results/ # Test results
|
|
449
|
+
└── .claude-forge.json # Manifest for doctor and update
|
|
162
450
|
```
|
|
163
451
|
|
|
164
452
|
## License
|