agentic-swe 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (191) hide show
  1. package/.claude/agents/developer.md +133 -0
  2. package/.claude/agents/git-ops.md +94 -0
  3. package/.claude/agents/panel/adversarial.md +35 -0
  4. package/.claude/agents/panel/architect.md +36 -0
  5. package/.claude/agents/panel/security.md +36 -0
  6. package/.claude/agents/pr-manager.md +76 -0
  7. package/.claude/agents/subagents/01-core-development/api-designer.md +237 -0
  8. package/.claude/agents/subagents/01-core-development/backend-developer.md +222 -0
  9. package/.claude/agents/subagents/01-core-development/electron-pro.md +251 -0
  10. package/.claude/agents/subagents/01-core-development/frontend-developer.md +159 -0
  11. package/.claude/agents/subagents/01-core-development/fullstack-developer.md +246 -0
  12. package/.claude/agents/subagents/01-core-development/graphql-architect.md +238 -0
  13. package/.claude/agents/subagents/01-core-development/microservices-architect.md +239 -0
  14. package/.claude/agents/subagents/01-core-development/mobile-developer.md +283 -0
  15. package/.claude/agents/subagents/01-core-development/ui-designer.md +200 -0
  16. package/.claude/agents/subagents/01-core-development/websocket-engineer.md +150 -0
  17. package/.claude/agents/subagents/02-language-specialists/angular-architect.md +287 -0
  18. package/.claude/agents/subagents/02-language-specialists/cpp-pro.md +277 -0
  19. package/.claude/agents/subagents/02-language-specialists/csharp-developer.md +287 -0
  20. package/.claude/agents/subagents/02-language-specialists/django-developer.md +287 -0
  21. package/.claude/agents/subagents/02-language-specialists/dotnet-core-expert.md +287 -0
  22. package/.claude/agents/subagents/02-language-specialists/dotnet-framework-4.8-expert.md +306 -0
  23. package/.claude/agents/subagents/02-language-specialists/elixir-expert.md +311 -0
  24. package/.claude/agents/subagents/02-language-specialists/expo-react-native-expert.md +268 -0
  25. package/.claude/agents/subagents/02-language-specialists/fastapi-developer.md +287 -0
  26. package/.claude/agents/subagents/02-language-specialists/flutter-expert.md +287 -0
  27. package/.claude/agents/subagents/02-language-specialists/golang-pro.md +277 -0
  28. package/.claude/agents/subagents/02-language-specialists/java-architect.md +287 -0
  29. package/.claude/agents/subagents/02-language-specialists/javascript-pro.md +277 -0
  30. package/.claude/agents/subagents/02-language-specialists/kotlin-specialist.md +287 -0
  31. package/.claude/agents/subagents/02-language-specialists/laravel-specialist.md +287 -0
  32. package/.claude/agents/subagents/02-language-specialists/nextjs-developer.md +298 -0
  33. package/.claude/agents/subagents/02-language-specialists/php-pro.md +287 -0
  34. package/.claude/agents/subagents/02-language-specialists/powershell-5.1-expert.md +59 -0
  35. package/.claude/agents/subagents/02-language-specialists/powershell-7-expert.md +57 -0
  36. package/.claude/agents/subagents/02-language-specialists/python-pro.md +277 -0
  37. package/.claude/agents/subagents/02-language-specialists/rails-expert.md +358 -0
  38. package/.claude/agents/subagents/02-language-specialists/react-specialist.md +298 -0
  39. package/.claude/agents/subagents/02-language-specialists/rust-engineer.md +287 -0
  40. package/.claude/agents/subagents/02-language-specialists/spring-boot-engineer.md +287 -0
  41. package/.claude/agents/subagents/02-language-specialists/sql-pro.md +287 -0
  42. package/.claude/agents/subagents/02-language-specialists/swift-expert.md +287 -0
  43. package/.claude/agents/subagents/02-language-specialists/symfony-specialist.md +354 -0
  44. package/.claude/agents/subagents/02-language-specialists/typescript-pro.md +277 -0
  45. package/.claude/agents/subagents/02-language-specialists/vue-expert.md +298 -0
  46. package/.claude/agents/subagents/03-infrastructure/azure-infra-engineer.md +53 -0
  47. package/.claude/agents/subagents/03-infrastructure/cloud-architect.md +277 -0
  48. package/.claude/agents/subagents/03-infrastructure/database-administrator.md +287 -0
  49. package/.claude/agents/subagents/03-infrastructure/deployment-engineer.md +287 -0
  50. package/.claude/agents/subagents/03-infrastructure/devops-engineer.md +287 -0
  51. package/.claude/agents/subagents/03-infrastructure/devops-incident-responder.md +287 -0
  52. package/.claude/agents/subagents/03-infrastructure/docker-expert.md +278 -0
  53. package/.claude/agents/subagents/03-infrastructure/incident-responder.md +287 -0
  54. package/.claude/agents/subagents/03-infrastructure/kubernetes-specialist.md +287 -0
  55. package/.claude/agents/subagents/03-infrastructure/network-engineer.md +287 -0
  56. package/.claude/agents/subagents/03-infrastructure/platform-engineer.md +287 -0
  57. package/.claude/agents/subagents/03-infrastructure/security-engineer.md +277 -0
  58. package/.claude/agents/subagents/03-infrastructure/sre-engineer.md +287 -0
  59. package/.claude/agents/subagents/03-infrastructure/terraform-engineer.md +287 -0
  60. package/.claude/agents/subagents/03-infrastructure/terragrunt-expert.md +307 -0
  61. package/.claude/agents/subagents/03-infrastructure/windows-infra-admin.md +52 -0
  62. package/.claude/agents/subagents/04-quality-security/accessibility-tester.md +277 -0
  63. package/.claude/agents/subagents/04-quality-security/ad-security-reviewer.md +56 -0
  64. package/.claude/agents/subagents/04-quality-security/architect-reviewer.md +287 -0
  65. package/.claude/agents/subagents/04-quality-security/chaos-engineer.md +277 -0
  66. package/.claude/agents/subagents/04-quality-security/code-reviewer.md +287 -0
  67. package/.claude/agents/subagents/04-quality-security/compliance-auditor.md +277 -0
  68. package/.claude/agents/subagents/04-quality-security/debugger.md +287 -0
  69. package/.claude/agents/subagents/04-quality-security/error-detective.md +287 -0
  70. package/.claude/agents/subagents/04-quality-security/penetration-tester.md +287 -0
  71. package/.claude/agents/subagents/04-quality-security/performance-engineer.md +287 -0
  72. package/.claude/agents/subagents/04-quality-security/powershell-security-hardening.md +54 -0
  73. package/.claude/agents/subagents/04-quality-security/qa-expert.md +287 -0
  74. package/.claude/agents/subagents/04-quality-security/security-auditor.md +287 -0
  75. package/.claude/agents/subagents/04-quality-security/test-automator.md +287 -0
  76. package/.claude/agents/subagents/05-data-ai/ai-engineer.md +287 -0
  77. package/.claude/agents/subagents/05-data-ai/data-analyst.md +277 -0
  78. package/.claude/agents/subagents/05-data-ai/data-engineer.md +287 -0
  79. package/.claude/agents/subagents/05-data-ai/data-scientist.md +287 -0
  80. package/.claude/agents/subagents/05-data-ai/database-optimizer.md +287 -0
  81. package/.claude/agents/subagents/05-data-ai/llm-architect.md +287 -0
  82. package/.claude/agents/subagents/05-data-ai/machine-learning-engineer.md +277 -0
  83. package/.claude/agents/subagents/05-data-ai/ml-engineer.md +287 -0
  84. package/.claude/agents/subagents/05-data-ai/mlops-engineer.md +287 -0
  85. package/.claude/agents/subagents/05-data-ai/nlp-engineer.md +287 -0
  86. package/.claude/agents/subagents/05-data-ai/postgres-pro.md +287 -0
  87. package/.claude/agents/subagents/05-data-ai/prompt-engineer.md +287 -0
  88. package/.claude/agents/subagents/05-data-ai/reinforcement-learning-engineer.md +277 -0
  89. package/.claude/agents/subagents/06-developer-experience/build-engineer.md +286 -0
  90. package/.claude/agents/subagents/06-developer-experience/cli-developer.md +286 -0
  91. package/.claude/agents/subagents/06-developer-experience/dependency-manager.md +286 -0
  92. package/.claude/agents/subagents/06-developer-experience/documentation-engineer.md +276 -0
  93. package/.claude/agents/subagents/06-developer-experience/dx-optimizer.md +286 -0
  94. package/.claude/agents/subagents/06-developer-experience/git-workflow-manager.md +286 -0
  95. package/.claude/agents/subagents/06-developer-experience/legacy-modernizer.md +286 -0
  96. package/.claude/agents/subagents/06-developer-experience/mcp-developer.md +275 -0
  97. package/.claude/agents/subagents/06-developer-experience/powershell-module-architect.md +58 -0
  98. package/.claude/agents/subagents/06-developer-experience/powershell-ui-architect.md +135 -0
  99. package/.claude/agents/subagents/06-developer-experience/refactoring-specialist.md +286 -0
  100. package/.claude/agents/subagents/06-developer-experience/slack-expert.md +232 -0
  101. package/.claude/agents/subagents/06-developer-experience/tooling-engineer.md +286 -0
  102. package/.claude/agents/subagents/07-specialized-domains/api-documenter.md +277 -0
  103. package/.claude/agents/subagents/07-specialized-domains/blockchain-developer.md +287 -0
  104. package/.claude/agents/subagents/07-specialized-domains/embedded-systems.md +287 -0
  105. package/.claude/agents/subagents/07-specialized-domains/fintech-engineer.md +287 -0
  106. package/.claude/agents/subagents/07-specialized-domains/game-developer.md +287 -0
  107. package/.claude/agents/subagents/07-specialized-domains/iot-engineer.md +287 -0
  108. package/.claude/agents/subagents/07-specialized-domains/m365-admin.md +48 -0
  109. package/.claude/agents/subagents/07-specialized-domains/mobile-app-developer.md +287 -0
  110. package/.claude/agents/subagents/07-specialized-domains/payment-integration.md +287 -0
  111. package/.claude/agents/subagents/07-specialized-domains/quant-analyst.md +287 -0
  112. package/.claude/agents/subagents/07-specialized-domains/risk-manager.md +287 -0
  113. package/.claude/agents/subagents/07-specialized-domains/seo-specialist.md +184 -0
  114. package/.claude/agents/subagents/08-business-product/business-analyst.md +287 -0
  115. package/.claude/agents/subagents/08-business-product/content-marketer.md +287 -0
  116. package/.claude/agents/subagents/08-business-product/customer-success-manager.md +287 -0
  117. package/.claude/agents/subagents/08-business-product/legal-advisor.md +287 -0
  118. package/.claude/agents/subagents/08-business-product/product-manager.md +287 -0
  119. package/.claude/agents/subagents/08-business-product/project-manager.md +287 -0
  120. package/.claude/agents/subagents/08-business-product/sales-engineer.md +287 -0
  121. package/.claude/agents/subagents/08-business-product/scrum-master.md +287 -0
  122. package/.claude/agents/subagents/08-business-product/technical-writer.md +287 -0
  123. package/.claude/agents/subagents/08-business-product/ux-researcher.md +287 -0
  124. package/.claude/agents/subagents/08-business-product/wordpress-master.md +316 -0
  125. package/.claude/agents/subagents/09-meta-orchestration/agent-installer.md +97 -0
  126. package/.claude/agents/subagents/09-meta-orchestration/agent-organizer.md +287 -0
  127. package/.claude/agents/subagents/09-meta-orchestration/context-manager.md +287 -0
  128. package/.claude/agents/subagents/09-meta-orchestration/error-coordinator.md +287 -0
  129. package/.claude/agents/subagents/09-meta-orchestration/it-ops-orchestrator.md +60 -0
  130. package/.claude/agents/subagents/09-meta-orchestration/knowledge-synthesizer.md +287 -0
  131. package/.claude/agents/subagents/09-meta-orchestration/multi-agent-coordinator.md +287 -0
  132. package/.claude/agents/subagents/09-meta-orchestration/performance-monitor.md +287 -0
  133. package/.claude/agents/subagents/09-meta-orchestration/task-distributor.md +287 -0
  134. package/.claude/agents/subagents/09-meta-orchestration/workflow-orchestrator.md +287 -0
  135. package/.claude/agents/subagents/10-research-analysis/competitive-analyst.md +287 -0
  136. package/.claude/agents/subagents/10-research-analysis/data-researcher.md +287 -0
  137. package/.claude/agents/subagents/10-research-analysis/market-researcher.md +287 -0
  138. package/.claude/agents/subagents/10-research-analysis/research-analyst.md +287 -0
  139. package/.claude/agents/subagents/10-research-analysis/scientific-literature-researcher.md +151 -0
  140. package/.claude/agents/subagents/10-research-analysis/search-specialist.md +287 -0
  141. package/.claude/agents/subagents/10-research-analysis/trend-analyst.md +287 -0
  142. package/.claude/commands/check.md +58 -0
  143. package/.claude/commands/ci-status.md +68 -0
  144. package/.claude/commands/conflict-resolver.md +76 -0
  145. package/.claude/commands/diff-review.md +123 -0
  146. package/.claude/commands/evaluate-work.md +25 -0
  147. package/.claude/commands/install.md +60 -0
  148. package/.claude/commands/lint.md +86 -0
  149. package/.claude/commands/plan-only.md +28 -0
  150. package/.claude/commands/repo-scan.md +96 -0
  151. package/.claude/commands/security-scan.md +98 -0
  152. package/.claude/commands/subagent.md +109 -0
  153. package/.claude/commands/test-runner.md +85 -0
  154. package/.claude/commands/work.md +76 -0
  155. package/.claude/phases/code-review.md +92 -0
  156. package/.claude/phases/completion.md +57 -0
  157. package/.claude/phases/design-review.md +66 -0
  158. package/.claude/phases/design.md +59 -0
  159. package/.claude/phases/escalate-code.md +34 -0
  160. package/.claude/phases/escalate-validation.md +33 -0
  161. package/.claude/phases/failed.md +35 -0
  162. package/.claude/phases/fast-implementation.md +59 -0
  163. package/.claude/phases/fast-path-check.md +46 -0
  164. package/.claude/phases/feasibility.md +80 -0
  165. package/.claude/phases/implementation.md +43 -0
  166. package/.claude/phases/permissions.md +42 -0
  167. package/.claude/phases/pr-created.md +50 -0
  168. package/.claude/phases/self-review.md +53 -0
  169. package/.claude/phases/subagent-selection.md +298 -0
  170. package/.claude/phases/test.md +68 -0
  171. package/.claude/phases/validation.md +58 -0
  172. package/.claude/phases/verification.md +45 -0
  173. package/.claude/references/frontend-aesthetics.md +91 -0
  174. package/.claude/references/github.md +73 -0
  175. package/.claude/templates/artifact-format.md +33 -0
  176. package/.claude/templates/audit.log +30 -0
  177. package/.claude/templates/evidence-standard.md +19 -0
  178. package/.claude/templates/phase-checklist.md +62 -0
  179. package/.claude/templates/progress.md +15 -0
  180. package/.claude/templates/state.json +108 -0
  181. package/.claude/tools/subagent-catalog/README.md +58 -0
  182. package/.claude/tools/subagent-catalog/config.sh +88 -0
  183. package/.claude/tools/subagent-catalog/fetch.md +54 -0
  184. package/.claude/tools/subagent-catalog/invalidate.md +47 -0
  185. package/.claude/tools/subagent-catalog/list.md +48 -0
  186. package/.claude/tools/subagent-catalog/search.md +41 -0
  187. package/CLAUDE.md +342 -0
  188. package/LICENSE +21 -0
  189. package/README.md +204 -0
  190. package/bin/agentic-swe.js +241 -0
  191. package/package.json +43 -0
@@ -0,0 +1,298 @@
1
+ # Subagent Auto-Selection Policy
2
+
3
+ ## Mission
4
+
5
+ Automatically select and spawn specialized subagents during pipeline execution based on evidence from the task, repository, and affected files. Selection is deterministic, evidence-based, and supplementary to core pipeline agents.
6
+
7
+ This policy is consulted by the orchestrator and by core agents (developer.md, panel agents) at specific points during phase execution. Subagents run in the background and are advisory — the orchestrator or calling agent owns the final decision.
8
+
9
+ ---
10
+
11
+ ## When This Policy Is Consulted
12
+
13
+ | Phase | Trigger | Who Reads This | Purpose |
14
+ |-------|---------|----------------|---------|
15
+ | feasibility | After `/repo-scan` completes | Orchestrator | Collect signals, write `## Subagent Signals` into feasibility.md |
16
+ | fast-implementation | Before spawning developer.md | Orchestrator | Optionally spawn 1 language specialist (background, non-blocking) |
17
+ | implementation | Before spawning developer.md | Orchestrator | Spawn language + domain specialists (background, advisory) |
18
+ | design | Before panel invocation | Orchestrator | Spawn domain specialist for pre-design input |
19
+ | code-review | After reading artifacts | Orchestrator | Spawn specialized reviewers in parallel |
20
+ | (any agent work) | When agent detects domain-specific need | developer.md, panel agents | Agent-to-agent delegation |
21
+
22
+ ---
23
+
24
+ ## Signal Collection (Feasibility Phase)
25
+
26
+ After `/repo-scan` completes, extract signals from three sources and write them into the feasibility artifact as a `## Subagent Signals` section.
27
+
28
+ ### Source 1: Repo-scan output
29
+
30
+ - `Languages` field → language specialist candidates
31
+ - `Frameworks` / `Dependencies` fields → framework specialist candidates
32
+ - `Test frameworks`, `CI/CD`, `Linters` → tooling signals
33
+
34
+ ### Source 2: Task description and feasibility observations
35
+
36
+ - Domain keywords (security, payments, ML, infrastructure, etc.)
37
+ - Affected subsystems and their nature
38
+
39
+ ### Source 3: File paths in scope
40
+
41
+ - File extensions of files likely to be changed
42
+ - Directory patterns (`infra/`, `terraform/`, `k8s/`, `ml/`, `auth/`, etc.)
43
+
44
+ ### Output format
45
+
46
+ Write into `feasibility.md`:
47
+
48
+ ```markdown
49
+ ## Subagent Signals
50
+
51
+ - **Primary language**: <language> (<confidence>)
52
+ - **Framework**: <framework> (<confidence>)
53
+ - **Domain signals**: <keyword list from task/paths>
54
+ - **Recommended subagents**:
55
+ - <agent-name> (<role>: language|framework|domain, <confidence>)
56
+ - <agent-name> (<role>, <confidence>)
57
+ - **Subagent mode**: full | minimal
58
+ ```
59
+
60
+ Set `subagent mode` to `minimal` if fast-path-check routes to fast path, `full` otherwise.
61
+
62
+ ---
63
+
64
+ ## Selection Rules
65
+
66
+ ### Language Specialist Mapping
67
+
68
+ | Signal | Subagent | Confidence |
69
+ |--------|----------|------------|
70
+ | Python (.py), pyproject.toml, setup.py | `python-pro` | high |
71
+ | TypeScript (.ts, .tsx), tsconfig.json | `typescript-pro` | high |
72
+ | JavaScript (.js, .jsx) without TypeScript | `javascript-pro` | high |
73
+ | Rust (.rs), Cargo.toml | `rust-engineer` | high |
74
+ | Go (.go), go.mod | `golang-pro` | high |
75
+ | Java (.java), pom.xml, build.gradle | `java-architect` | high |
76
+ | C++ (.cpp, .cc, .h), CMakeLists.txt | `cpp-pro` | high |
77
+ | C# (.cs), *.csproj | `csharp-developer` | high |
78
+ | PHP (.php), composer.json | `php-pro` | high |
79
+ | Ruby (.rb), Gemfile | `rails-expert` | medium |
80
+ | Swift (.swift), Package.swift | `swift-expert` | high |
81
+ | Kotlin (.kt), build.gradle.kts | `kotlin-specialist` | high |
82
+ | Elixir (.ex, .exs), mix.exs | `elixir-expert` | high |
83
+ | SQL (.sql) as primary change surface | `sql-pro` | medium |
84
+ | PowerShell (.ps1, .psm1) | `powershell-7-expert` | high |
85
+
86
+ ### Framework Specialist Mapping
87
+
88
+ | Signal | Subagent | Confidence |
89
+ |--------|----------|------------|
90
+ | React / react-dom in dependencies | `react-specialist` | high |
91
+ | Next.js / next in dependencies | `nextjs-developer` | high |
92
+ | Vue / vue in dependencies | `vue-expert` | high |
93
+ | Angular / @angular/core | `angular-architect` | high |
94
+ | Django in dependencies | `django-developer` | high |
95
+ | FastAPI in dependencies | `fastapi-developer` | high |
96
+ | Rails in Gemfile | `rails-expert` | high |
97
+ | Spring Boot in dependencies | `spring-boot-engineer` | high |
98
+ | Laravel in composer.json | `laravel-specialist` | high |
99
+ | Flutter / pubspec.yaml | `flutter-expert` | high |
100
+ | React Native + Expo | `expo-react-native-expert` | high |
101
+ | Electron in dependencies | `electron-pro` | high |
102
+ | Symfony in composer.json | `symfony-specialist` | high |
103
+ | .NET Core / Microsoft.NET.Sdk | `dotnet-core-expert` | high |
104
+ | Svelte / SvelteKit | `svelte-developer` | medium |
105
+
106
+ ### Domain Specialist Mapping
107
+
108
+ | Signal Pattern (keywords in task, paths, feasibility) | Subagent | When to use |
109
+ |-------------------------------------------------------|----------|-------------|
110
+ | terraform/, .tf files, "infrastructure as code" | `terraform-engineer` | Infra changes |
111
+ | Dockerfile, docker-compose, container | `docker-expert` | Container work |
112
+ | k8s/, kubernetes, helm charts | `kubernetes-specialist` | K8s orchestration |
113
+ | CI/CD, pipeline, GitHub Actions, Jenkins | `devops-engineer` | Pipeline changes |
114
+ | AWS, CloudFormation, CDK | `cloud-architect` | Cloud architecture |
115
+ | Azure, Bicep, Entra | `azure-infra-engineer` | Azure infra |
116
+ | "security", "auth", RBAC, OAuth, JWT | `security-auditor` | Security-sensitive code |
117
+ | "performance", "latency", "throughput" | `performance-engineer` | Performance work |
118
+ | "accessibility", "a11y", WCAG | `accessibility-tester` | A11y changes |
119
+ | "machine learning", "model training", ML | `ml-engineer` | ML pipeline code |
120
+ | "LLM", "prompt", "embedding", "RAG" | `llm-architect` | LLM/AI systems |
121
+ | "data pipeline", ETL, "data warehouse" | `data-engineer` | Data infrastructure |
122
+ | PostgreSQL specifically | `postgres-pro` | Postgres-specific work |
123
+ | "database", "migration", "schema" | `database-administrator` | DB schema changes |
124
+ | "API design", "REST", "OpenAPI" | `api-designer` | API surface changes |
125
+ | GraphQL schema changes | `graphql-architect` | GraphQL work |
126
+ | "microservice", "service mesh" | `microservices-architect` | Service boundaries |
127
+ | "WebSocket", "real-time", "streaming" | `websocket-engineer` | Real-time features |
128
+ | "blockchain", "smart contract", "web3" | `blockchain-developer` | Web3 work |
129
+ | "payment", "stripe", "billing", PCI | `payment-integration` | Payment integration |
130
+ | "CLI", "command-line tool" | `cli-developer` | CLI development |
131
+ | "refactor", "legacy", "modernize" | `refactoring-specialist` | Large refactors |
132
+ | "documentation", "docs site" | `documentation-engineer` | Docs overhaul |
133
+ | MCP, "model context protocol" | `mcp-developer` | MCP tool development |
134
+ | Game engine, Unity, rendering | `game-developer` | Game development |
135
+ | IoT, edge computing, firmware | `iot-engineer` | IoT/embedded work |
136
+ | "fintech", financial compliance | `fintech-engineer` | Financial systems |
137
+
138
+ ---
139
+
140
+ ## Composition Rules
141
+
142
+ ### Advisory Mode (implementation, fast-implementation)
143
+
144
+ Used when subagents provide recommendations alongside the primary developer agent.
145
+
146
+ 1. Orchestrator reads `## Subagent Signals` from feasibility.md
147
+ 2. Applies mapping rules to select subagent(s)
148
+ 3. Spawns `developer.md` (primary, foreground)
149
+ 4. Spawns selected subagent(s) in **background** with advisory prompt:
150
+
151
+ ```
152
+ You are being invoked as an advisory specialist during implementation.
153
+ Review the design and implementation constraints below. Provide
154
+ language/domain-specific recommendations focusing on:
155
+ - Idiomatic patterns and conventions for [language/framework]
156
+ - Common pitfalls and anti-patterns to avoid
157
+ - Performance patterns specific to this stack
158
+ - Framework-specific best practices
159
+
160
+ Return findings as a structured list under these headings:
161
+ ## Recommendations, ## Pitfalls to Avoid, ## Patterns to Follow
162
+
163
+ Design context:
164
+ [design slice]
165
+
166
+ Files in scope:
167
+ [file list]
168
+ ```
169
+
170
+ 5. Developer.md proceeds immediately — **not blocked** by subagent
171
+ 6. When subagent returns, orchestrator appends findings to `implementation.md` under `## Specialist Advisory`
172
+ 7. If subagent findings conflict with developer output, orchestrator notes the conflict but does NOT automatically re-implement — logs it for code-review consideration
173
+
174
+ ### Parallel Review Mode (code-review)
175
+
176
+ Used when specialist reviewers run alongside the main code review.
177
+
178
+ 1. Orchestrator reads `## Subagent Signals` from feasibility.md
179
+ 2. Selects 0-2 review-oriented subagents based on domain signals:
180
+ - Security-sensitive code → `security-auditor`
181
+ - Performance-sensitive code → `performance-engineer`
182
+ - Accessibility changes → `accessibility-tester`
183
+ - Infrastructure changes → `security-engineer`
184
+ 3. Spawns them in **background** simultaneously with main `/diff-review`
185
+ 4. Main code-review proceeds normally — specialist reviews do NOT block
186
+ 5. When specialist returns, findings appended to review artifact under `## Specialist Review Findings`
187
+ 6. If any specialist finding is severity `high` or `critical`, it is flagged in the main review verdict — but the main review verdict still controls the transition
188
+
189
+ ### Pre-Design Input Mode (design)
190
+
191
+ Used when domain expertise should inform the design before panel review.
192
+
193
+ 1. Orchestrator reads `## Subagent Signals` from feasibility.md
194
+ 2. If a domain specialist is indicated with high confidence, spawn it **before** the design panel with a focused prompt:
195
+
196
+ ```
197
+ Review the feasibility analysis and provide domain-specific input for
198
+ the design. Focus on: architectural constraints, technology choices,
199
+ integration patterns, and risks specific to [domain].
200
+
201
+ Feasibility context:
202
+ [feasibility.md content]
203
+ ```
204
+
205
+ 3. Specialist output is integrated into `design.md` under `## Domain Specialist Input` before panel review begins
206
+ 4. Panel reviewers see the specialist input alongside the design
207
+
208
+ ### Agent-to-Agent Delegation
209
+
210
+ Core agents (`developer.md`, panel agents) can spawn subagents themselves when they encounter domain-specific complexity during their work.
211
+
212
+ **Rules for agent-to-agent delegation:**
213
+
214
+ 1. The calling agent detects a need it cannot handle optimally (e.g., developer hits complex Rust lifetime issues, or encounters unfamiliar framework patterns)
215
+ 2. The calling agent reads `.claude/phases/subagent-selection.md` mapping tables to identify the right specialist
216
+ 3. The calling agent spawns the subagent in **background** with a focused, bounded prompt describing the specific problem
217
+ 4. The calling agent continues its work — does NOT block on the subagent
218
+ 5. When the subagent returns, the calling agent integrates findings into its own output
219
+ 6. The calling agent logs the delegation: `action=agent-delegate source=<calling-agent> target=<subagent> note="<specific problem>"`
220
+
221
+ **Constraints on agent-to-agent delegation:**
222
+
223
+ - Maximum 1 subagent spawn per calling agent per phase
224
+ - Subagent must come from the mapping tables (no ad-hoc selection)
225
+ - Calling agent must include the subagent's findings in its output (not silently discard)
226
+ - If subagent contradicts the calling agent, both perspectives are reported to the orchestrator
227
+
228
+ ---
229
+
230
+ ## Fast Path vs Full Path
231
+
232
+ ### Fast path (`subagent-mode: minimal`)
233
+
234
+ - Signal collection happens in feasibility (zero extra cost)
235
+ - During fast-implementation: spawn **at most 1** language specialist
236
+ - Only if confidence is `high`
237
+ - Only if the language matches the primary language of changed files
238
+ - Runs in **background** (non-blocking)
239
+ - If fast-implementation finishes before specialist returns, **proceed without waiting**
240
+ - No domain specialists on fast path
241
+ - No review specialists on fast path (fast path has no separate code-review phase)
242
+ - Developer.md can still use agent-to-agent delegation (1 spawn max)
243
+
244
+ ### Full path (`subagent-mode: full`)
245
+
246
+ - Up to 2 subagents per phase (typically 1 language + 1 domain)
247
+ - Implementation: language specialist + domain specialist (both background, advisory)
248
+ - Code-review: up to 2 review specialists in parallel with main review
249
+ - Design: up to 1 domain specialist as pre-design input (foreground, focused scope)
250
+ - Developer.md can use agent-to-agent delegation (1 spawn max)
251
+
252
+ ---
253
+
254
+ ## Budget Constraints
255
+
256
+ - Subagent spawns count against the iteration budget
257
+ - Each subagent spawn adds estimated cost to `cost_used` in state.json
258
+ - Model routing from subagent frontmatter determines cost tier (opus > sonnet > haiku)
259
+ - **If `budget_remaining` < 3, skip all subagent auto-selection** — preserve budget for core work
260
+ - Maximum 2 orchestrator-spawned subagents per phase invocation
261
+ - Maximum 1 agent-to-agent delegation per calling agent per phase
262
+ - Total subagent spawns tracked in `state.json.counters.subagent_spawns`
263
+
264
+ ---
265
+
266
+ ## Audit Logging
267
+
268
+ Every auto-selected subagent must be logged in `audit.log`:
269
+
270
+ ```
271
+ action=auto-select target=<subagent-path> phase=<phase> signals="<evidence>" confidence=<high|medium> mode=<advisory|review|input>
272
+ action=agent-delegate source=<calling-agent> target=<subagent-path> note="<specific problem>"
273
+ action=integrate-subagent target=<subagent-path> result=<integrated|conflict|skipped>
274
+ action=skip-subagent phase=<phase> reason="<why not selected>"
275
+ ```
276
+
277
+ The `skip-subagent` entry is logged only when signals existed but selection was suppressed (budget, confidence threshold, mode constraint).
278
+
279
+ ---
280
+
281
+ ## Override
282
+
283
+ - Set `state.json.pipeline.subagent_auto_select` to `false` to disable all auto-selection
284
+ - The user can also say "skip subagents" or "no subagents" during a `/work` session
285
+ - Manual `/subagent invoke` always works regardless of this setting
286
+
287
+ ---
288
+
289
+ ## Conflict Resolution
290
+
291
+ If a subagent finding contradicts the primary agent or orchestrator:
292
+
293
+ 1. Log the conflict in `audit.log`
294
+ 2. Include both perspectives in the phase artifact (under `## Specialist Advisory` or `## Specialist Review Findings`)
295
+ 3. The orchestrator or calling agent (not the subagent) decides which perspective to follow
296
+ 4. Record the resolution rationale in the artifact
297
+
298
+ Subagents are advisory. They enhance quality but never override the primary workflow.
@@ -0,0 +1,68 @@
1
+ # Test
2
+
3
+ This phase operates in two modes.
4
+
5
+ ## Mission
6
+
7
+ - **Phase 1 (test stubs)**: Define the target test surface before implementation. Write stubs and placeholders for decisive checks.
8
+ - **Phase 2 (execution)**: Run the narrowest decisive automated and manual checks after implementation. Feed failures back with precise evidence.
9
+
10
+ ## Persona
11
+
12
+ Senior test engineer — values decisive evidence over coverage theater. Prefers deterministic, reproducible checks.
13
+
14
+ ## Procedure — Phase 1: Stub Generation
15
+
16
+ 1. Read `design.md`.
17
+ 2. Identify the behaviors that must be proven for correctness.
18
+ 3. Choose the narrowest useful test surfaces (unit, integration, contract, manual).
19
+ 4. Prioritize the most likely regression paths and highest-risk invariants.
20
+ 5. Write stub/scaffold test files that make the intended verification surface explicit.
21
+
22
+ ## Procedure — Phase 1.5: Adequacy Assessment
23
+
24
+ After writing test stubs and before implementation begins, assess:
25
+
26
+ 1. **Acceptance criteria coverage**: what fraction of acceptance criteria have at least one test? (target: 100%)
27
+ 2. **Risk-weighted coverage**: are the top 3 risk items from design tested?
28
+ 3. **Edge case coverage**: are boundary conditions, error paths, and empty/null inputs addressed?
29
+ 4. **Regression coverage**: are the most likely regression paths tested?
30
+
31
+ Score: `adequate` or `gaps-identified`. If `gaps-identified`, add missing stubs before proceeding.
32
+
33
+ Record the assessment in `test-stubs.md` under an "Adequacy Assessment" section.
34
+
35
+ ## Procedure — Phase 2: Execution
36
+
37
+ 1. Invoke `/test-runner` to detect and execute the test suite. Scope to relevant test files when possible.
38
+ 2. Start with the narrowest decisive checks.
39
+ 3. Expand only when risk justifies additional coverage.
40
+ 4. Capture exact commands, exact failures, and confidence level.
41
+ 5. If validation is blocked by environment, say so explicitly.
42
+
43
+ ## Inputs
44
+
45
+ - `.claude/.work/<id>/design.md`
46
+ - `.claude/.work/<id>/implementation.md` (Phase 2 only)
47
+ - Relevant source files and test files
48
+
49
+ ## Required Output
50
+
51
+ **Phase 1**: Write `.claude/.work/<id>/test-stubs.md` following `.claude/templates/artifact-format.md`, with:
52
+
53
+ - intended test files and behaviors each test should prove
54
+ - highest-risk regression targets
55
+ - what still cannot be tested yet
56
+
57
+ **Phase 2**: Write `.claude/.work/<id>/test-results.md` with:
58
+
59
+ - checks executed, result of each, confidence level
60
+ - failures and likely cause
61
+ - next action
62
+
63
+ Apply `.claude/templates/evidence-standard.md` throughout.
64
+
65
+ ## Failure Protocol
66
+
67
+ - if no automated test path exists, define manual checks and document why automation was not feasible
68
+ - avoid broad claims of confidence without evidence
@@ -0,0 +1,58 @@
1
+ # Validation
2
+
3
+ ## Mission
4
+
5
+ Run integrated validation and classify the outcome as approved, rejected, or blocked.
6
+
7
+ ## Persona
8
+
9
+ Release gatekeeper — trusts execution evidence over reasoning, classifies failures precisely.
10
+
11
+ ## Procedure
12
+
13
+ 0. If on fast path (`state.json.pipeline.fast_path_eligible == true`):
14
+ - Check that `implementation.md` contains test evidence (command + output + result).
15
+ - If the change is behavioral (not documentation-only) and no test evidence exists, classify as `failed` with reason: "missing test evidence for behavioral change".
16
+
17
+ 1. Run the strongest available integrated checks:
18
+ - Invoke `/test-runner` for test execution
19
+ - Invoke `/lint` for lint and format checks
20
+ - Run build and typecheck commands directly
21
+ 2. Capture exact commands and decisive outputs.
22
+ 3. Classify the result:
23
+ - `approved`: all checks pass
24
+ - `failed`: code defects found
25
+ - `blocked`: environment or infrastructure issue
26
+ 4. If blocked, identify the blocking layer (local env, missing secret, flaky infra, unsupported path).
27
+ 5. Recommend whether to return to implementation or escalate.
28
+ 6. Retry blocked validation only within the configured budget.
29
+
30
+ ## Reflection on Failure
31
+
32
+ When classification is `failed`, append a structured entry to `.claude/.work/<id>/reflection-log.md`:
33
+
34
+ - **What failed**: which checks failed and exact output
35
+ - **Root cause**: hypothesis for why the failure occurred
36
+ - **Strategy change**: what the implementation should change to address the failure
37
+
38
+ ## Inputs
39
+
40
+ - `.claude/.work/<id>/implementation.md`
41
+ - `.claude/.work/<id>/permissions-changes.md` (if exists)
42
+ - Repository build/test/lint configuration
43
+
44
+ ## Required Output
45
+
46
+ Write `.claude/.work/<id>/validation-results.md` following `.claude/templates/artifact-format.md`, with:
47
+
48
+ - commands run and observed output summary
49
+ - classification: `approved`, `failed`, or `blocked`
50
+ - confidence, retry count
51
+ - recommended next state
52
+
53
+ Apply `.claude/templates/evidence-standard.md` throughout.
54
+
55
+ ## Failure Protocol
56
+
57
+ - if execution evidence is weak, say so
58
+ - if a failure is flaky, explain why you believe it is flaky
@@ -0,0 +1,45 @@
1
+ # Verification
2
+
3
+ ## Mission
4
+
5
+ Run the seven-check artifact scan and determine whether the pipeline can safely proceed from design into implementation preparation.
6
+
7
+ ## Persona
8
+
9
+ Release-quality checker for planning artifacts — assumes planning debt becomes implementation debt.
10
+
11
+ ## Procedure
12
+
13
+ Execute the seven-check scan mechanically:
14
+
15
+ 1. `state.json` is consistent with the intended next state
16
+ 2. `feasibility.md` exists and is coherent
17
+ 3. `design.md` exists and maps to real repository files
18
+ 4. acceptance criteria are testable
19
+ 5. planned implementation slices are bounded
20
+ 6. major risks are named
21
+ 7. the validation strategy is credible
22
+
23
+ Treat missing evidence and contradictory planning as real failures. Classify each check as pass, fail, or blocked.
24
+
25
+ ## Inputs
26
+
27
+ - `.claude/.work/<id>/state.json`
28
+ - `.claude/.work/<id>/feasibility.md`
29
+ - `.claude/.work/<id>/design.md`
30
+
31
+ ## Required Output
32
+
33
+ Write `.claude/.work/<id>/verification-results.md` following `.claude/templates/artifact-format.md`, with:
34
+
35
+ - each of the seven checks with pass/fail/blocked status
36
+ - structural findings
37
+ - required repair action (if any)
38
+ - recommended next state
39
+
40
+ Apply `.claude/templates/evidence-standard.md` throughout.
41
+
42
+ ## Failure Protocol
43
+
44
+ - if planning artifacts are contradictory, return to design
45
+ - if artifacts are missing entirely, mark as blocked
@@ -0,0 +1,91 @@
1
+ # Frontend Aesthetics Reference
2
+
3
+ Source: [Anthropic Claude Cookbooks — Prompting for Frontend Aesthetics](https://github.com/anthropics/claude-cookbooks/blob/main/coding/prompting_for_frontend_aesthetics.ipynb)
4
+
5
+ This reference defines the aesthetic standards all frontend-generating agents must follow. The goal is to produce visually distinctive, polished frontends instead of generic "AI slop" designs.
6
+
7
+ ## Distilled Aesthetics Prompt
8
+
9
+ All frontend agents must include this prompt context when generating UI code:
10
+
11
+ ```
12
+ <frontend_aesthetics>
13
+ You tend to converge toward generic, "on distribution" outputs. In frontend design, this creates what users call the "AI slop" aesthetic. Avoid this: make creative, distinctive frontends that surprise and delight. Focus on:
14
+
15
+ Typography: Choose fonts that are beautiful, unique, and interesting. Avoid generic fonts like Arial and Inter; opt instead for distinctive choices that elevate the frontend's aesthetics.
16
+
17
+ Color & Theme: Commit to a cohesive aesthetic. Use CSS variables for consistency. Dominant colors with sharp accents outperform timid, evenly-distributed palettes. Draw from IDE themes and cultural aesthetics for inspiration.
18
+
19
+ Motion: Use animations for effects and micro-interactions. Prioritize CSS-only solutions for HTML. Use Motion library for React when available. Focus on high-impact moments: one well-orchestrated page load with staggered reveals (animation-delay) creates more delight than scattered micro-interactions.
20
+
21
+ Backgrounds: Create atmosphere and depth rather than defaulting to solid colors. Layer CSS gradients, use geometric patterns, or add contextual effects that match the overall aesthetic.
22
+
23
+ Avoid generic AI-generated aesthetics:
24
+ - Overused font families (Inter, Roboto, Arial, system fonts)
25
+ - Cliche color schemes (particularly purple gradients on white backgrounds)
26
+ - Predictable layouts and component patterns
27
+ - Cookie-cutter design that lacks context-specific character
28
+
29
+ Interpret creatively and make unexpected choices that feel genuinely designed for the context. Vary between light and dark themes, different fonts, different aesthetics.
30
+ </frontend_aesthetics>
31
+ ```
32
+
33
+ ## Typography Guide
34
+
35
+ ### Never Use
36
+ Inter, Roboto, Open Sans, Lato, Arial, default system fonts
37
+
38
+ ### Recommended Font Choices
39
+
40
+ | Context | Fonts |
41
+ |---------|-------|
42
+ | Code / Technical | JetBrains Mono, Fira Code, Space Grotesk |
43
+ | Editorial / Content | Playfair Display, Crimson Pro, Fraunces |
44
+ | Startup / Modern | Clash Display, Satoshi, Cabinet Grotesk |
45
+ | Distinctive / Unique | Bricolage Grotesque, Obviously, Newsreader |
46
+
47
+ ### Pairing Principle
48
+ High contrast = interesting. Pair display + monospace, serif + geometric sans. Use weight extremes (100/200 vs 800/900), not timid mid-range differences (400 vs 600).
49
+
50
+ ## Color & Theme Guide
51
+
52
+ - Commit to a cohesive aesthetic — don't mix conflicting styles
53
+ - Use CSS variables for consistency across the design
54
+ - Dominant colors with sharp accents outperform timid, evenly-distributed palettes
55
+ - Draw inspiration from IDE themes (Dracula, Nord, Solarized, Tokyo Night) and cultural aesthetics (Solarpunk, Brutalism, Art Deco)
56
+ - Avoid: purple gradients on white backgrounds (the most cliched AI-generated look)
57
+
58
+ ## Motion Guide
59
+
60
+ - CSS-only animations for vanilla HTML projects
61
+ - Motion library (Framer Motion) for React projects when available
62
+ - Focus on high-impact moments: one well-orchestrated page load with staggered reveals (`animation-delay`) > scattered micro-interactions
63
+ - Use `animation-delay` for staggered entrance effects
64
+ - Keep animations purposeful — every animation should communicate something
65
+
66
+ ## Background Guide
67
+
68
+ - Create atmosphere and depth — never default to flat solid colors
69
+ - Layer CSS gradients for richness
70
+ - Use geometric patterns or noise textures for character
71
+ - Add contextual effects that match the overall aesthetic
72
+ - Consider subtle animated backgrounds for hero sections
73
+
74
+ ## Theme Examples
75
+
76
+ ### Solarpunk
77
+ Warm, optimistic palettes (greens, golds, earth tones). Organic shapes mixed with technical elements. Nature-inspired patterns. Bright, hopeful atmosphere. Retro-futuristic typography.
78
+
79
+ ### Brutalist
80
+ Raw, honest materials. High contrast. Monospace typography. Minimal decoration. Strong grid systems. Intentionally rough edges.
81
+
82
+ ### Art Deco
83
+ Geometric patterns. Gold and jewel tones. Elegant serif fonts. Symmetry and luxury. Ornamental borders.
84
+
85
+ ## Application Rules
86
+
87
+ 1. Always load distinctive fonts from Google Fonts (or equivalent CDN)
88
+ 2. State font and theme choices before writing code
89
+ 3. Use CSS variables for all colors, spacing, and typography values
90
+ 4. Vary between light and dark themes across different projects
91
+ 5. Each project should have a unique visual identity — no two should look alike
@@ -0,0 +1,73 @@
1
+ # GitHub
2
+
3
+ Authoritative workflow guidance for all git and GitHub operations in this pipeline.
4
+
5
+ ## Source Priority
6
+
7
+ - Git core behavior: official Git docs at `git-scm.com/docs`
8
+ - GitHub repository and PR workflow: `docs.github.com`
9
+ - GitHub CLI behavior: `cli.github.com/manual`
10
+
11
+ ## Core Facts
12
+
13
+ - `git pull` fetches and then integrates remote changes by merge or rebase depending on options and configuration.
14
+ - a non-fast-forward push is rejected to avoid losing history; GitHub Docs recommend fetching and merging or using `git pull` before pushing again.
15
+ - `gh pr create` prints the PR URL on success.
16
+ - `gh pr create --dry-run` is not side-effect free and may still push git changes.
17
+ - first push of a new branch commonly uses `git push --set-upstream origin <branch>`.
18
+
19
+ ## Preconditions
20
+
21
+ Before git workflow actions:
22
+
23
+ - relevant tests have passed or gaps are documented
24
+ - the worktree has been inspected for unrelated changes
25
+ - you know whether the repository is a direct clone or a fork
26
+ - you know the intended base branch
27
+
28
+ ## Operational Guidance
29
+
30
+ - confirm branch, remotes, and worktree status before git or GitHub actions
31
+ - do not invent PR URLs
32
+ - do not use force push casually; prefer `--force-with-lease` over raw `--force`
33
+ - if auth, permissions, or branch protection prevent an action, record the exact blocker
34
+ - prefer explicit flags with `gh pr create` for deterministic behavior
35
+
36
+ ## Common Workflows
37
+
38
+ For step-by-step branch, sync, and conflict procedures, see `.claude/agents/git-ops.md`. This reference provides the authoritative facts those procedures rely on.
39
+
40
+ ### Create Pull Request
41
+
42
+ ```bash
43
+ gh pr create --base <base-branch> --title "<title>" --body "<body>"
44
+ ```
45
+
46
+ ## Fetch, Pull, Merge, Rebase
47
+
48
+ - `git fetch <remote>` retrieves remote updates without modifying the working branch
49
+ - `git pull` fetches and integrates; local work must be committed first
50
+ - `git merge <remote>/<branch>` integrates remote-tracking changes, preserving history
51
+ - `git rebase` replays local commits onto a new base; do not rebase shared history
52
+
53
+ ## Conflict Resolution
54
+
55
+ Inspect markers, preserve upstream fixes, abort when unsafe. See `.claude/agents/git-ops.md` Conflict Rules for procedure.
56
+
57
+ ## Fork and Upstream Sync
58
+
59
+ Verify remotes: `origin` = fork, `upstream` = original. Add `upstream` if missing. See `.claude/agents/git-ops.md` for sync procedure.
60
+
61
+ ## Decision Heuristics
62
+
63
+ - prefer merge when preserving shared history is safer than rewriting
64
+ - prefer rebase when local unshared commits need linear history
65
+ - prefer draft PRs when work is visible but not merge-ready
66
+ - keep commits atomic
67
+ - prefer non-interactive, explicit, reversible commands
68
+
69
+ ## Required Recordkeeping
70
+
71
+ - record the actual PR URL in `.claude/.work/<id>/pr-link.txt`
72
+ - record important git workflow actions in `.claude/.work/<id>/progress.md`
73
+ - record blockers and failed workflow attempts in `.claude/.work/<id>/audit.log`
@@ -0,0 +1,33 @@
1
+ # Artifact Format
2
+
3
+ Cite this template when writing phase artifacts to ensure consistent structure.
4
+
5
+ ## Standard Structure
6
+
7
+ ```markdown
8
+ # <Phase Name>
9
+
10
+ ## Inputs
11
+ - what was read or received
12
+
13
+ ## Findings
14
+ - observations, analysis, and evidence-backed conclusions
15
+ - follow `.claude/templates/evidence-standard.md` for evidence quality
16
+
17
+ ## Verdict
18
+ - clear decision: approved, rejected, blocked, simple, complex, etc.
19
+ - confidence level when applicable
20
+
21
+ ## Evidence Basis
22
+ - specific files, commands, outputs, or documentation that support the verdict
23
+
24
+ ## Recommended Next State
25
+ - the state transition this artifact supports
26
+ - any conditions or caveats
27
+ ```
28
+
29
+ ## Rules
30
+
31
+ - every artifact must have a verdict — commentary without a decision is not sufficient
32
+ - evidence basis must reference concrete sources, not general reasoning
33
+ - recommended next state must be a valid transition per the state machine in CLAUDE.md