@poolzin/pool-bot 2026.3.11 β†’ 2026.3.14

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (195) hide show
  1. package/CHANGELOG.md +121 -0
  2. package/dist/.buildstamp +1 -1
  3. package/dist/agents/checkpoint-manager.js +291 -0
  4. package/dist/agents/poolbot-tools.js +5 -0
  5. package/dist/agents/subagent-announce-reliability.js +160 -0
  6. package/dist/agents/tool-result-truncation.js +299 -0
  7. package/dist/agents/tools/nodes-file-tool.js +197 -0
  8. package/dist/build-info.json +3 -3
  9. package/dist/cli/config-cli.js +60 -0
  10. package/dist/cron/cron-improvements.js +195 -0
  11. package/dist/discord/discord-improvements.js +167 -0
  12. package/dist/gateway/auth-rate-limit.js +19 -0
  13. package/dist/gateway/auth.js +41 -0
  14. package/dist/gateway/gateway-improvements.js +294 -0
  15. package/dist/gateway/node-command-policy.js +7 -2
  16. package/dist/infra/net/ssrf.js +15 -2
  17. package/dist/infra/shell-security.js +201 -0
  18. package/dist/memory/memory-improvements.js +239 -0
  19. package/dist/node-host/runner.js +146 -79
  20. package/dist/security/prototype-pollution.js +141 -0
  21. package/dist/security/webhook-security.js +253 -0
  22. package/dist/shared/net/ip.js +52 -1
  23. package/dist/slack/slack-improvements.js +225 -0
  24. package/dist/telegram/telegram-improvements.js +220 -0
  25. package/dist/ui-plugins/ui-plugins-improvements.js +191 -0
  26. package/docs/ANALISE_OPENCLAW_PROFISSIONAL.md +520 -0
  27. package/docs/competitive-analysis.md +421 -0
  28. package/docs/implementation-analysis.md +393 -0
  29. package/docs/plans/2026-03-11-file-operations-security-hardening.md +307 -0
  30. package/docs/plans/2026-03-11-integracao-projetos-poolbot.md +666 -0
  31. package/docs/refactor/plugin-development-guide.md +281 -0
  32. package/extensions/agency-agents/README.md +301 -0
  33. package/extensions/agency-agents/agents/CONTRIBUTING.md +353 -0
  34. package/extensions/agency-agents/agents/README.md +602 -0
  35. package/extensions/agency-agents/agents/design/design-brand-guardian.md +320 -0
  36. package/extensions/agency-agents/agents/design/design-image-prompt-engineer.md +234 -0
  37. package/extensions/agency-agents/agents/design/design-ui-designer.md +381 -0
  38. package/extensions/agency-agents/agents/design/design-ux-architect.md +467 -0
  39. package/extensions/agency-agents/agents/design/design-ux-researcher.md +327 -0
  40. package/extensions/agency-agents/agents/design/design-visual-storyteller.md +147 -0
  41. package/extensions/agency-agents/agents/design/design-whimsy-injector.md +436 -0
  42. package/extensions/agency-agents/agents/engineering/engineering-ai-engineer.md +144 -0
  43. package/extensions/agency-agents/agents/engineering/engineering-backend-architect.md +233 -0
  44. package/extensions/agency-agents/agents/engineering/engineering-devops-automator.md +374 -0
  45. package/extensions/agency-agents/agents/engineering/engineering-frontend-developer.md +223 -0
  46. package/extensions/agency-agents/agents/engineering/engineering-mobile-app-builder.md +491 -0
  47. package/extensions/agency-agents/agents/engineering/engineering-rapid-prototyper.md +460 -0
  48. package/extensions/agency-agents/agents/engineering/engineering-security-engineer.md +275 -0
  49. package/extensions/agency-agents/agents/engineering/engineering-senior-developer.md +174 -0
  50. package/extensions/agency-agents/agents/examples/README.md +48 -0
  51. package/extensions/agency-agents/agents/examples/nexus-spatial-discovery.md +852 -0
  52. package/extensions/agency-agents/agents/examples/workflow-landing-page.md +119 -0
  53. package/extensions/agency-agents/agents/examples/workflow-startup-mvp.md +155 -0
  54. package/extensions/agency-agents/agents/integrations/README.md +117 -0
  55. package/extensions/agency-agents/agents/integrations/aider/README.md +38 -0
  56. package/extensions/agency-agents/agents/integrations/antigravity/README.md +49 -0
  57. package/extensions/agency-agents/agents/integrations/claude-code/README.md +31 -0
  58. package/extensions/agency-agents/agents/integrations/cursor/README.md +38 -0
  59. package/extensions/agency-agents/agents/integrations/gemini-cli/README.md +36 -0
  60. package/extensions/agency-agents/agents/integrations/opencode/README.md +58 -0
  61. package/extensions/agency-agents/agents/integrations/windsurf/README.md +26 -0
  62. package/extensions/agency-agents/agents/marketing/marketing-app-store-optimizer.md +319 -0
  63. package/extensions/agency-agents/agents/marketing/marketing-content-creator.md +52 -0
  64. package/extensions/agency-agents/agents/marketing/marketing-growth-hacker.md +52 -0
  65. package/extensions/agency-agents/agents/marketing/marketing-instagram-curator.md +111 -0
  66. package/extensions/agency-agents/agents/marketing/marketing-reddit-community-builder.md +121 -0
  67. package/extensions/agency-agents/agents/marketing/marketing-social-media-strategist.md +123 -0
  68. package/extensions/agency-agents/agents/marketing/marketing-tiktok-strategist.md +123 -0
  69. package/extensions/agency-agents/agents/marketing/marketing-twitter-engager.md +124 -0
  70. package/extensions/agency-agents/agents/marketing/marketing-wechat-official-account.md +143 -0
  71. package/extensions/agency-agents/agents/marketing/marketing-xiaohongshu-specialist.md +136 -0
  72. package/extensions/agency-agents/agents/marketing/marketing-zhihu-strategist.md +160 -0
  73. package/extensions/agency-agents/agents/product/product-feedback-synthesizer.md +117 -0
  74. package/extensions/agency-agents/agents/product/product-sprint-prioritizer.md +152 -0
  75. package/extensions/agency-agents/agents/product/product-trend-researcher.md +157 -0
  76. package/extensions/agency-agents/agents/project-management/project-management-experiment-tracker.md +196 -0
  77. package/extensions/agency-agents/agents/project-management/project-management-project-shepherd.md +192 -0
  78. package/extensions/agency-agents/agents/project-management/project-management-studio-operations.md +198 -0
  79. package/extensions/agency-agents/agents/project-management/project-management-studio-producer.md +201 -0
  80. package/extensions/agency-agents/agents/project-management/project-manager-senior.md +133 -0
  81. package/extensions/agency-agents/agents/scripts/convert.sh +362 -0
  82. package/extensions/agency-agents/agents/scripts/install.sh +465 -0
  83. package/extensions/agency-agents/agents/scripts/lint-agents.sh +115 -0
  84. package/extensions/agency-agents/agents/spatial-computing/macos-spatial-metal-engineer.md +335 -0
  85. package/extensions/agency-agents/agents/spatial-computing/terminal-integration-specialist.md +68 -0
  86. package/extensions/agency-agents/agents/spatial-computing/visionos-spatial-engineer.md +52 -0
  87. package/extensions/agency-agents/agents/spatial-computing/xr-cockpit-interaction-specialist.md +30 -0
  88. package/extensions/agency-agents/agents/spatial-computing/xr-immersive-developer.md +30 -0
  89. package/extensions/agency-agents/agents/spatial-computing/xr-interface-architect.md +30 -0
  90. package/extensions/agency-agents/agents/specialized/agentic-identity-trust.md +367 -0
  91. package/extensions/agency-agents/agents/specialized/agents-orchestrator.md +365 -0
  92. package/extensions/agency-agents/agents/specialized/data-analytics-reporter.md +52 -0
  93. package/extensions/agency-agents/agents/specialized/data-consolidation-agent.md +58 -0
  94. package/extensions/agency-agents/agents/specialized/lsp-index-engineer.md +312 -0
  95. package/extensions/agency-agents/agents/specialized/report-distribution-agent.md +63 -0
  96. package/extensions/agency-agents/agents/specialized/sales-data-extraction-agent.md +65 -0
  97. package/extensions/agency-agents/agents/strategy/EXECUTIVE-BRIEF.md +95 -0
  98. package/extensions/agency-agents/agents/strategy/QUICKSTART.md +194 -0
  99. package/extensions/agency-agents/agents/strategy/coordination/agent-activation-prompts.md +401 -0
  100. package/extensions/agency-agents/agents/strategy/coordination/handoff-templates.md +357 -0
  101. package/extensions/agency-agents/agents/strategy/nexus-strategy.md +1110 -0
  102. package/extensions/agency-agents/agents/strategy/playbooks/phase-0-discovery.md +178 -0
  103. package/extensions/agency-agents/agents/strategy/playbooks/phase-1-strategy.md +238 -0
  104. package/extensions/agency-agents/agents/strategy/playbooks/phase-2-foundation.md +278 -0
  105. package/extensions/agency-agents/agents/strategy/playbooks/phase-3-build.md +286 -0
  106. package/extensions/agency-agents/agents/strategy/playbooks/phase-4-hardening.md +332 -0
  107. package/extensions/agency-agents/agents/strategy/playbooks/phase-5-launch.md +277 -0
  108. package/extensions/agency-agents/agents/strategy/playbooks/phase-6-operate.md +318 -0
  109. package/extensions/agency-agents/agents/strategy/runbooks/scenario-enterprise-feature.md +157 -0
  110. package/extensions/agency-agents/agents/strategy/runbooks/scenario-incident-response.md +217 -0
  111. package/extensions/agency-agents/agents/strategy/runbooks/scenario-marketing-campaign.md +187 -0
  112. package/extensions/agency-agents/agents/strategy/runbooks/scenario-startup-mvp.md +154 -0
  113. package/extensions/agency-agents/agents/support/support-analytics-reporter.md +363 -0
  114. package/extensions/agency-agents/agents/support/support-executive-summary-generator.md +210 -0
  115. package/extensions/agency-agents/agents/support/support-finance-tracker.md +440 -0
  116. package/extensions/agency-agents/agents/support/support-infrastructure-maintainer.md +616 -0
  117. package/extensions/agency-agents/agents/support/support-legal-compliance-checker.md +586 -0
  118. package/extensions/agency-agents/agents/support/support-support-responder.md +583 -0
  119. package/extensions/agency-agents/agents/testing/testing-accessibility-auditor.md +313 -0
  120. package/extensions/agency-agents/agents/testing/testing-api-tester.md +304 -0
  121. package/extensions/agency-agents/agents/testing/testing-evidence-collector.md +208 -0
  122. package/extensions/agency-agents/agents/testing/testing-performance-benchmarker.md +266 -0
  123. package/extensions/agency-agents/agents/testing/testing-reality-checker.md +236 -0
  124. package/extensions/agency-agents/agents/testing/testing-test-results-analyzer.md +303 -0
  125. package/extensions/agency-agents/agents/testing/testing-tool-evaluator.md +392 -0
  126. package/extensions/agency-agents/agents/testing/testing-workflow-optimizer.md +448 -0
  127. package/extensions/agency-agents/index.ts +733 -0
  128. package/extensions/agency-agents/node_modules/.bin/jiti +21 -0
  129. package/extensions/agency-agents/node_modules/.bin/tsc +21 -0
  130. package/extensions/agency-agents/node_modules/.bin/tsserver +21 -0
  131. package/extensions/agency-agents/node_modules/.bin/tsx +21 -0
  132. package/extensions/agency-agents/node_modules/.bin/vite +21 -0
  133. package/extensions/agency-agents/node_modules/.bin/vitest +21 -0
  134. package/extensions/agency-agents/node_modules/.bin/yaml +21 -0
  135. package/extensions/agency-agents/node_modules/.vite/vitest/da39a3ee5e6b4b0d3255bfef95601890afd80709/results.json +1 -0
  136. package/extensions/agency-agents/package.json +25 -0
  137. package/extensions/agency-agents/src/AgencyAgentsService.test.ts +443 -0
  138. package/extensions/agency-agents/src/AgencyAgentsService.ts +288 -0
  139. package/extensions/agency-agents/src/types.ts +147 -0
  140. package/extensions/agency-agents/vitest.config.ts +8 -0
  141. package/extensions/hexstrike-ai/README.md +98 -0
  142. package/extensions/hexstrike-ai/node_modules/.bin/tsc +21 -0
  143. package/extensions/hexstrike-ai/node_modules/.bin/tsserver +21 -0
  144. package/extensions/hexstrike-ai/package.json +29 -0
  145. package/extensions/hexstrike-ai/poolbot.plugin.json +31 -0
  146. package/extensions/hexstrike-ai/src/client.ts +91 -0
  147. package/extensions/hexstrike-ai/src/index.ts +170 -0
  148. package/extensions/hexstrike-ai/src/server/hexstrike_mcp.py +5470 -0
  149. package/extensions/hexstrike-ai/src/server/hexstrike_server.py +17289 -0
  150. package/extensions/hexstrike-ai/src/server/requirements.txt +84 -0
  151. package/extensions/hexstrike-ai/src/server-manager.ts +83 -0
  152. package/extensions/hexstrike-ai/tsconfig.json +20 -0
  153. package/extensions/hexstrike-bridge/package.json +1 -1
  154. package/extensions/hexstrike-bridge/poolbot.plugin.json +23 -0
  155. package/extensions/mcp-server/poolbot.plugin.json +10 -0
  156. package/extensions/page-agent/README.md +159 -0
  157. package/extensions/page-agent/index.ts +595 -0
  158. package/extensions/page-agent/node_modules/.bin/jiti +21 -0
  159. package/extensions/page-agent/node_modules/.bin/playwright +21 -0
  160. package/extensions/page-agent/node_modules/.bin/tsc +21 -0
  161. package/extensions/page-agent/node_modules/.bin/tsserver +21 -0
  162. package/extensions/page-agent/node_modules/.bin/tsx +21 -0
  163. package/extensions/page-agent/node_modules/.bin/vitest +21 -0
  164. package/extensions/page-agent/node_modules/.bin/yaml +21 -0
  165. package/extensions/page-agent/package.json +43 -0
  166. package/extensions/page-agent/src/PageAgentService.test.ts +517 -0
  167. package/extensions/page-agent/src/PageAgentService.ts +636 -0
  168. package/extensions/page-agent/src/PoolBotPageController.test.ts +358 -0
  169. package/extensions/page-agent/src/PoolBotPageController.ts +245 -0
  170. package/extensions/page-agent/src/index.ts +20 -0
  171. package/extensions/page-agent/src/tools.test.ts +231 -0
  172. package/extensions/page-agent/src/tools.ts +167 -0
  173. package/extensions/page-agent/src/types.ts +198 -0
  174. package/extensions/template/README.md +101 -0
  175. package/extensions/template/index.ts +38 -0
  176. package/extensions/template/package.json +15 -0
  177. package/extensions/template/poolbot.plugin.json +10 -0
  178. package/extensions/xyops/README.md +227 -0
  179. package/extensions/xyops/index.ts +342 -0
  180. package/extensions/xyops/node_modules/.bin/jiti +21 -0
  181. package/extensions/xyops/node_modules/.bin/tsc +21 -0
  182. package/extensions/xyops/node_modules/.bin/tsserver +21 -0
  183. package/extensions/xyops/node_modules/.bin/tsx +21 -0
  184. package/extensions/xyops/node_modules/.bin/vitest +21 -0
  185. package/extensions/xyops/node_modules/.bin/yaml +21 -0
  186. package/extensions/xyops/node_modules/.vite/vitest/da39a3ee5e6b4b0d3255bfef95601890afd80709/results.json +1 -0
  187. package/extensions/xyops/package.json +39 -0
  188. package/extensions/xyops/src/client.test.ts +467 -0
  189. package/extensions/xyops/src/client.ts +157 -0
  190. package/extensions/xyops/src/types.ts +147 -0
  191. package/extensions/xyops/vitest.config.ts +8 -0
  192. package/package.json +1 -1
  193. package/extensions/mavalie/README.md +0 -97
  194. package/extensions/mavalie/package.json +0 -15
  195. package/extensions/mavalie/src/index.ts +0 -62
@@ -0,0 +1,286 @@
1
+ # πŸ”¨ Phase 3 Playbook β€” Build & Iterate
2
+
3
+ > **Duration**: 2-12 weeks (varies by scope) | **Agents**: 15-30+ | **Gate Keeper**: Agents Orchestrator
4
+
5
+ ---
6
+
7
+ ## Objective
8
+
9
+ Implement all features through continuous Dev↔QA loops. Every task is validated before the next begins. This is where the bulk of the work happens β€” and where NEXUS's orchestration delivers the most value.
10
+
11
+ ## Pre-Conditions
12
+
13
+ - [ ] Phase 2 Quality Gate passed (foundation verified)
14
+ - [ ] Sprint Prioritizer backlog available with RICE scores
15
+ - [ ] CI/CD pipeline operational
16
+ - [ ] Design system and component library ready
17
+ - [ ] API scaffold with auth system ready
18
+
19
+ ## The Dev↔QA Loop β€” Core Mechanic
20
+
21
+ The Agents Orchestrator manages every task through this cycle:
22
+
23
+ ```
24
+ FOR EACH task IN sprint_backlog (ordered by RICE score):
25
+
26
+ 1. ASSIGN task to appropriate Developer Agent (see assignment matrix)
27
+ 2. Developer IMPLEMENTS task
28
+ 3. Evidence Collector TESTS task
29
+ - Visual screenshots (desktop, tablet, mobile)
30
+ - Functional verification against acceptance criteria
31
+ - Brand consistency check
32
+ 4. IF verdict == PASS:
33
+ Mark task complete
34
+ Move to next task
35
+ ELIF verdict == FAIL AND attempts < 3:
36
+ Send QA feedback to Developer
37
+ Developer FIXES specific issues
38
+ Return to step 3
39
+ ELIF attempts >= 3:
40
+ ESCALATE to Agents Orchestrator
41
+ Orchestrator decides: reassign, decompose, defer, or accept
42
+ 5. UPDATE pipeline status report
43
+ ```
44
+
45
+ ## Agent Assignment Matrix
46
+
47
+ ### Primary Developer Assignment
48
+
49
+ | Task Category | Primary Agent | Backup Agent | QA Agent |
50
+ |--------------|--------------|-------------|----------|
51
+ | **React/Vue/Angular UI** | Frontend Developer | Rapid Prototyper | Evidence Collector |
52
+ | **REST/GraphQL API** | Backend Architect | Senior Developer | API Tester |
53
+ | **Database operations** | Backend Architect | β€” | API Tester |
54
+ | **Mobile (iOS/Android)** | Mobile App Builder | β€” | Evidence Collector |
55
+ | **ML model/pipeline** | AI Engineer | β€” | Test Results Analyzer |
56
+ | **CI/CD/Infrastructure** | DevOps Automator | Infrastructure Maintainer | Performance Benchmarker |
57
+ | **Premium/complex feature** | Senior Developer | Backend Architect | Evidence Collector |
58
+ | **Quick prototype/POC** | Rapid Prototyper | Frontend Developer | Evidence Collector |
59
+ | **WebXR/immersive** | XR Immersive Developer | β€” | Evidence Collector |
60
+ | **visionOS** | visionOS Spatial Engineer | macOS Spatial/Metal Engineer | Evidence Collector |
61
+ | **Cockpit controls** | XR Cockpit Interaction Specialist | XR Interface Architect | Evidence Collector |
62
+ | **CLI/terminal tools** | Terminal Integration Specialist | β€” | API Tester |
63
+ | **Code intelligence** | LSP/Index Engineer | β€” | Test Results Analyzer |
64
+ | **Performance optimization** | Performance Benchmarker | Infrastructure Maintainer | Performance Benchmarker |
65
+
66
+ ### Specialist Support (activated as needed)
67
+
68
+ | Specialist | When to Activate | Trigger |
69
+ |-----------|-----------------|---------|
70
+ | UI Designer | Component needs visual refinement | Developer requests design guidance |
71
+ | Whimsy Injector | Feature needs delight/personality | UX review identifies opportunity |
72
+ | Visual Storyteller | Visual narrative content needed | Content requires visual assets |
73
+ | Brand Guardian | Brand consistency concern | QA finds brand deviation |
74
+ | XR Interface Architect | Spatial interaction design needed | XR feature requires UX guidance |
75
+ | Data Analytics Reporter | Deep data analysis needed | Feature requires analytics integration |
76
+
77
+ ## Parallel Build Tracks
78
+
79
+ For NEXUS-Full deployments, four tracks run simultaneously:
80
+
81
+ ### Track A: Core Product Development
82
+ ```
83
+ Managed by: Agents Orchestrator (Dev↔QA loop)
84
+ Agents: Frontend Developer, Backend Architect, AI Engineer,
85
+ Mobile App Builder, Senior Developer
86
+ QA: Evidence Collector, API Tester, Test Results Analyzer
87
+
88
+ Sprint cadence: 2-week sprints
89
+ Daily: Task implementation + QA validation
90
+ End of sprint: Sprint review + retrospective
91
+ ```
92
+
93
+ ### Track B: Growth & Marketing Preparation
94
+ ```
95
+ Managed by: Project Shepherd
96
+ Agents: Growth Hacker, Content Creator, Social Media Strategist,
97
+ App Store Optimizer
98
+
99
+ Sprint cadence: Aligned with Track A milestones
100
+ Activities:
101
+ - Growth Hacker β†’ Design viral loops and referral mechanics
102
+ - Content Creator β†’ Build launch content pipeline
103
+ - Social Media Strategist β†’ Plan cross-platform campaign
104
+ - App Store Optimizer β†’ Prepare store listing (if mobile)
105
+ ```
106
+
107
+ ### Track C: Quality & Operations
108
+ ```
109
+ Managed by: Agents Orchestrator
110
+ Agents: Evidence Collector, API Tester, Performance Benchmarker,
111
+ Workflow Optimizer, Experiment Tracker
112
+
113
+ Continuous activities:
114
+ - Evidence Collector β†’ Screenshot QA for every task
115
+ - API Tester β†’ Endpoint validation for every API task
116
+ - Performance Benchmarker β†’ Periodic load testing
117
+ - Workflow Optimizer β†’ Process improvement identification
118
+ - Experiment Tracker β†’ A/B test setup for validated features
119
+ ```
120
+
121
+ ### Track D: Brand & Experience Polish
122
+ ```
123
+ Managed by: Brand Guardian
124
+ Agents: UI Designer, Brand Guardian, Visual Storyteller,
125
+ Whimsy Injector
126
+
127
+ Triggered activities:
128
+ - UI Designer β†’ Component refinement when QA identifies visual issues
129
+ - Brand Guardian β†’ Periodic brand consistency audit
130
+ - Visual Storyteller β†’ Visual narrative assets as features complete
131
+ - Whimsy Injector β†’ Micro-interactions and delight moments
132
+ ```
133
+
134
+ ## Sprint Execution Template
135
+
136
+ ### Sprint Planning (Day 1)
137
+
138
+ ```
139
+ Sprint Prioritizer activates:
140
+ 1. Review backlog with updated RICE scores
141
+ 2. Select tasks for sprint based on team velocity
142
+ 3. Assign tasks to developer agents
143
+ 4. Identify dependencies and ordering
144
+ 5. Set sprint goal and success criteria
145
+
146
+ Output: Sprint Plan with task assignments
147
+ ```
148
+
149
+ ### Daily Execution (Day 2 to Day N-1)
150
+
151
+ ```
152
+ Agents Orchestrator manages:
153
+ 1. Current task status check
154
+ 2. Dev↔QA loop execution
155
+ 3. Blocker identification and resolution
156
+ 4. Progress tracking and reporting
157
+
158
+ Status report format:
159
+ - Tasks completed today: [list]
160
+ - Tasks in QA: [list]
161
+ - Tasks in development: [list]
162
+ - Blocked tasks: [list with reason]
163
+ - QA pass rate: [X/Y]
164
+ ```
165
+
166
+ ### Sprint Review (Day N)
167
+
168
+ ```
169
+ Project Shepherd facilitates:
170
+ 1. Demo completed features
171
+ 2. Review QA evidence for each task
172
+ 3. Collect stakeholder feedback
173
+ 4. Update backlog based on learnings
174
+
175
+ Participants: All active agents + stakeholders
176
+ Output: Sprint Review Summary
177
+ ```
178
+
179
+ ### Sprint Retrospective
180
+
181
+ ```
182
+ Workflow Optimizer facilitates:
183
+ 1. What went well?
184
+ 2. What could improve?
185
+ 3. What will we change next sprint?
186
+ 4. Process efficiency metrics
187
+
188
+ Output: Retrospective Action Items
189
+ ```
190
+
191
+ ## Orchestrator Decision Logic
192
+
193
+ ### Task Failure Handling
194
+
195
+ ```
196
+ WHEN task fails QA:
197
+ IF attempt == 1:
198
+ β†’ Send specific QA feedback to developer
199
+ β†’ Developer fixes ONLY the identified issues
200
+ β†’ Re-submit for QA
201
+
202
+ IF attempt == 2:
203
+ β†’ Send accumulated QA feedback
204
+ β†’ Consider: Is the developer agent the right fit?
205
+ β†’ Developer fixes with additional context
206
+ β†’ Re-submit for QA
207
+
208
+ IF attempt == 3:
209
+ β†’ ESCALATE
210
+ β†’ Options:
211
+ a) Reassign to different developer agent
212
+ b) Decompose task into smaller sub-tasks
213
+ c) Revise approach/architecture
214
+ d) Accept with known limitations (document)
215
+ e) Defer to future sprint
216
+ β†’ Document decision and rationale
217
+ ```
218
+
219
+ ### Parallel Task Management
220
+
221
+ ```
222
+ WHEN multiple tasks have no dependencies:
223
+ β†’ Assign to different developer agents simultaneously
224
+ β†’ Each runs independent Dev↔QA loop
225
+ β†’ Orchestrator tracks all loops concurrently
226
+ β†’ Merge completed tasks in dependency order
227
+
228
+ WHEN task has dependencies:
229
+ β†’ Wait for dependency to pass QA
230
+ β†’ Then assign dependent task
231
+ β†’ Include dependency context in handoff
232
+ ```
233
+
234
+ ## Quality Gate Checklist
235
+
236
+ | # | Criterion | Evidence Source | Status |
237
+ |---|-----------|----------------|--------|
238
+ | 1 | All sprint tasks pass QA (100% completion) | Evidence Collector screenshots per task | ☐ |
239
+ | 2 | All API endpoints validated | API Tester regression report | ☐ |
240
+ | 3 | Performance baselines met (P95 < 200ms) | Performance Benchmarker report | ☐ |
241
+ | 4 | Brand consistency verified (95%+ adherence) | Brand Guardian audit | ☐ |
242
+ | 5 | No critical bugs (zero P0/P1 open) | Test Results Analyzer summary | ☐ |
243
+ | 6 | All acceptance criteria met | Task-by-task verification | ☐ |
244
+ | 7 | Code review completed for all PRs | Git history evidence | ☐ |
245
+
246
+ ## Gate Decision
247
+
248
+ **Gate Keeper**: Agents Orchestrator
249
+
250
+ - **PASS**: Feature-complete application β†’ Phase 4 activation
251
+ - **CONTINUE**: More sprints needed β†’ Continue Phase 3
252
+ - **ESCALATE**: Systemic issues β†’ Studio Producer intervention
253
+
254
+ ## Handoff to Phase 4
255
+
256
+ ```markdown
257
+ ## Phase 3 β†’ Phase 4 Handoff Package
258
+
259
+ ### For Reality Checker:
260
+ - Complete application (all features implemented)
261
+ - All QA evidence from Dev↔QA loops
262
+ - API Tester regression results
263
+ - Performance Benchmarker baseline data
264
+ - Brand Guardian consistency audit
265
+ - Known issues list (if any accepted limitations)
266
+
267
+ ### For Legal Compliance Checker:
268
+ - Data handling implementation details
269
+ - Privacy policy implementation
270
+ - Consent management implementation
271
+ - Security measures implemented
272
+
273
+ ### For Performance Benchmarker:
274
+ - Application URLs for load testing
275
+ - Expected traffic patterns
276
+ - Performance budgets from architecture
277
+
278
+ ### For Infrastructure Maintainer:
279
+ - Production environment requirements
280
+ - Scaling configuration needs
281
+ - Monitoring alert thresholds
282
+ ```
283
+
284
+ ---
285
+
286
+ *Phase 3 is complete when all sprint tasks pass QA, all API endpoints are validated, performance baselines are met, and no critical bugs remain open.*
@@ -0,0 +1,332 @@
1
+ # πŸ›‘οΈ Phase 4 Playbook β€” Quality & Hardening
2
+
3
+ > **Duration**: 3-7 days | **Agents**: 8 | **Gate Keeper**: Reality Checker (sole authority)
4
+
5
+ ---
6
+
7
+ ## Objective
8
+
9
+ The final quality gauntlet. The Reality Checker defaults to "NEEDS WORK" β€” you must prove production readiness with overwhelming evidence. This phase exists because first implementations typically need 2-3 revision cycles, and that's healthy.
10
+
11
+ ## Pre-Conditions
12
+
13
+ - [ ] Phase 3 Quality Gate passed (all tasks QA'd)
14
+ - [ ] Phase 3 Handoff Package received
15
+ - [ ] All features implemented and individually verified
16
+
17
+ ## Critical Mindset
18
+
19
+ > **The Reality Checker's default verdict is NEEDS WORK.**
20
+ >
21
+ > This is not pessimism β€” it's realism. Production readiness requires:
22
+ > - Complete user journeys working end-to-end
23
+ > - Cross-device consistency (desktop, tablet, mobile)
24
+ > - Performance under load (not just happy path)
25
+ > - Security validation (not just "we added auth")
26
+ > - Specification compliance (every requirement, not most)
27
+ >
28
+ > A B/B+ rating on first pass is normal and expected.
29
+
30
+ ## Agent Activation Sequence
31
+
32
+ ### Step 1: Evidence Collection (Day 1-2, All Parallel)
33
+
34
+ #### πŸ“Έ Evidence Collector β€” Comprehensive Visual Evidence
35
+ ```
36
+ Activate Evidence Collector for comprehensive system evidence on [PROJECT].
37
+
38
+ Deliverables required:
39
+ 1. Full screenshot suite:
40
+ - Desktop (1920x1080) β€” every page/view
41
+ - Tablet (768x1024) β€” every page/view
42
+ - Mobile (375x667) β€” every page/view
43
+ 2. Interaction evidence:
44
+ - Navigation flows (before/after clicks)
45
+ - Form interactions (empty, filled, submitted, error states)
46
+ - Modal/dialog interactions
47
+ - Accordion/expandable content
48
+ 3. Theme evidence:
49
+ - Light mode β€” all pages
50
+ - Dark mode β€” all pages
51
+ - System preference detection
52
+ 4. Error state evidence:
53
+ - 404 pages
54
+ - Form validation errors
55
+ - Network error handling
56
+ - Empty states
57
+
58
+ Format: Screenshot Evidence Package with test-results.json
59
+ Timeline: 2 days
60
+ ```
61
+
62
+ #### πŸ”Œ API Tester β€” Full API Regression
63
+ ```
64
+ Activate API Tester for complete API regression on [PROJECT].
65
+
66
+ Deliverables required:
67
+ 1. Endpoint regression suite:
68
+ - All endpoints tested (GET, POST, PUT, DELETE)
69
+ - Authentication/authorization verification
70
+ - Input validation testing
71
+ - Error response verification
72
+ 2. Integration testing:
73
+ - Cross-service communication
74
+ - Database operation verification
75
+ - External API integration
76
+ 3. Edge case testing:
77
+ - Rate limiting behavior
78
+ - Large payload handling
79
+ - Concurrent request handling
80
+ - Malformed input handling
81
+
82
+ Format: API Test Report with pass/fail per endpoint
83
+ Timeline: 2 days
84
+ ```
85
+
86
+ #### ⚑ Performance Benchmarker β€” Load Testing
87
+ ```
88
+ Activate Performance Benchmarker for load testing on [PROJECT].
89
+
90
+ Deliverables required:
91
+ 1. Load test at 10x expected traffic:
92
+ - Response time distribution (P50, P95, P99)
93
+ - Throughput under load
94
+ - Error rate under load
95
+ - Resource utilization (CPU, memory, network)
96
+ 2. Core Web Vitals measurement:
97
+ - LCP (Largest Contentful Paint) < 2.5s
98
+ - FID (First Input Delay) < 100ms
99
+ - CLS (Cumulative Layout Shift) < 0.1
100
+ 3. Database performance:
101
+ - Query execution times
102
+ - Connection pool utilization
103
+ - Index effectiveness
104
+ 4. Stress test results:
105
+ - Breaking point identification
106
+ - Graceful degradation behavior
107
+ - Recovery time after overload
108
+
109
+ Format: Performance Certification Report
110
+ Timeline: 2 days
111
+ ```
112
+
113
+ #### βš–οΈ Legal Compliance Checker β€” Final Compliance Audit
114
+ ```
115
+ Activate Legal Compliance Checker for final compliance audit on [PROJECT].
116
+
117
+ Deliverables required:
118
+ 1. Privacy compliance verification:
119
+ - Privacy policy accuracy
120
+ - Consent management functionality
121
+ - Data subject rights implementation
122
+ - Cookie consent implementation
123
+ 2. Security compliance:
124
+ - Data encryption (at rest and in transit)
125
+ - Authentication security
126
+ - Input sanitization
127
+ - OWASP Top 10 check
128
+ 3. Regulatory compliance:
129
+ - GDPR requirements (if applicable)
130
+ - CCPA requirements (if applicable)
131
+ - Industry-specific requirements
132
+ 4. Accessibility compliance:
133
+ - WCAG 2.1 AA verification
134
+ - Screen reader compatibility
135
+ - Keyboard navigation
136
+
137
+ Format: Compliance Certification Report
138
+ Timeline: 2 days
139
+ ```
140
+
141
+ ### Step 2: Analysis (Day 3-4, Parallel, after Step 1)
142
+
143
+ #### πŸ“Š Test Results Analyzer β€” Quality Metrics Aggregation
144
+ ```
145
+ Activate Test Results Analyzer for quality metrics aggregation on [PROJECT].
146
+
147
+ Input: ALL Step 1 reports
148
+ Deliverables required:
149
+ 1. Aggregate quality dashboard:
150
+ - Overall quality score
151
+ - Category breakdown (visual, functional, performance, security, compliance)
152
+ - Issue severity distribution
153
+ - Trend analysis (if multiple test cycles)
154
+ 2. Issue prioritization:
155
+ - Critical issues (must fix before production)
156
+ - High issues (should fix before production)
157
+ - Medium issues (fix in next sprint)
158
+ - Low issues (backlog)
159
+ 3. Risk assessment:
160
+ - Production readiness probability
161
+ - Remaining risk areas
162
+ - Recommended mitigations
163
+
164
+ Format: Quality Metrics Dashboard
165
+ Timeline: 1 day
166
+ ```
167
+
168
+ #### πŸ”„ Workflow Optimizer β€” Process Efficiency Review
169
+ ```
170
+ Activate Workflow Optimizer for process efficiency review on [PROJECT].
171
+
172
+ Input: Phase 3 execution data + Step 1 findings
173
+ Deliverables required:
174
+ 1. Process efficiency analysis:
175
+ - Dev↔QA loop efficiency (first-pass rate, average retries)
176
+ - Bottleneck identification
177
+ - Time-to-resolution for different issue types
178
+ 2. Improvement recommendations:
179
+ - Process changes for Phase 6 operations
180
+ - Automation opportunities
181
+ - Quality improvement suggestions
182
+
183
+ Format: Optimization Recommendations Report
184
+ Timeline: 1 day
185
+ ```
186
+
187
+ #### πŸ—οΈ Infrastructure Maintainer β€” Production Readiness Check
188
+ ```
189
+ Activate Infrastructure Maintainer for production readiness on [PROJECT].
190
+
191
+ Deliverables required:
192
+ 1. Production environment validation:
193
+ - All services healthy and responding
194
+ - Auto-scaling configured and tested
195
+ - Load balancer configuration verified
196
+ - SSL/TLS certificates valid
197
+ 2. Monitoring validation:
198
+ - All critical metrics being collected
199
+ - Alert rules configured and tested
200
+ - Dashboard access verified
201
+ - Log aggregation working
202
+ 3. Disaster recovery validation:
203
+ - Backup systems operational
204
+ - Recovery procedures documented and tested
205
+ - Failover mechanisms verified
206
+ 4. Security validation:
207
+ - Firewall rules reviewed
208
+ - Access controls verified
209
+ - Secrets management confirmed
210
+ - Vulnerability scan clean
211
+
212
+ Format: Infrastructure Readiness Report
213
+ Timeline: 1 day
214
+ ```
215
+
216
+ ### Step 3: Final Judgment (Day 5-7, Sequential)
217
+
218
+ #### πŸ” Reality Checker β€” THE FINAL VERDICT
219
+ ```
220
+ Activate Reality Checker for final integration testing on [PROJECT].
221
+
222
+ MANDATORY PROCESS β€” DO NOT SKIP:
223
+
224
+ Step 1: Reality Check Commands
225
+ - Verify what was actually built (ls, grep for claimed features)
226
+ - Cross-check claimed features against specification
227
+ - Run comprehensive screenshot capture
228
+ - Review all evidence from Step 1 and Step 2
229
+
230
+ Step 2: QA Cross-Validation
231
+ - Review Evidence Collector findings
232
+ - Cross-reference with API Tester results
233
+ - Verify Performance Benchmarker data
234
+ - Confirm Legal Compliance Checker findings
235
+
236
+ Step 3: End-to-End System Validation
237
+ - Test COMPLETE user journeys (not individual features)
238
+ - Verify responsive behavior across ALL devices
239
+ - Check interaction flows end-to-end
240
+ - Review actual performance data
241
+
242
+ Step 4: Specification Reality Check
243
+ - Quote EXACT text from original specification
244
+ - Compare with ACTUAL implementation evidence
245
+ - Document EVERY gap between spec and reality
246
+ - No assumptions β€” evidence only
247
+
248
+ VERDICT OPTIONS:
249
+ - READY: Overwhelming evidence of production readiness (rare first pass)
250
+ - NEEDS WORK: Specific issues identified with fix list (expected)
251
+ - NOT READY: Major architectural issues requiring Phase 1/2 revisit
252
+
253
+ Format: Reality-Based Integration Report
254
+ Default: NEEDS WORK unless proven otherwise
255
+ ```
256
+
257
+ ## Quality Gate β€” THE FINAL GATE
258
+
259
+ | # | Criterion | Threshold | Evidence Required |
260
+ |---|-----------|-----------|-------------------|
261
+ | 1 | User journeys complete | All critical paths working end-to-end | Reality Checker screenshots |
262
+ | 2 | Cross-device consistency | Desktop + Tablet + Mobile all working | Responsive screenshots |
263
+ | 3 | Performance certified | P95 < 200ms, LCP < 2.5s, uptime > 99.9% | Performance Benchmarker report |
264
+ | 4 | Security validated | Zero critical vulnerabilities | Security scan + compliance report |
265
+ | 5 | Compliance certified | All regulatory requirements met | Legal Compliance Checker report |
266
+ | 6 | Specification compliance | 100% of spec requirements implemented | Point-by-point verification |
267
+ | 7 | Infrastructure ready | Production environment validated | Infrastructure Maintainer report |
268
+
269
+ ## Gate Decision
270
+
271
+ **Sole authority**: Reality Checker
272
+
273
+ ### If READY (proceed to Phase 5):
274
+ ```markdown
275
+ ## Phase 4 β†’ Phase 5 Handoff Package
276
+
277
+ ### For Launch Team:
278
+ - Reality Checker certification report
279
+ - Performance certification
280
+ - Compliance certification
281
+ - Infrastructure readiness report
282
+ - Known limitations (if any)
283
+
284
+ ### For Growth Hacker:
285
+ - Product ready for users
286
+ - Feature list for marketing messaging
287
+ - Performance data for credibility
288
+
289
+ ### For DevOps Automator:
290
+ - Production deployment approved
291
+ - Blue-green deployment plan
292
+ - Rollback procedures confirmed
293
+ ```
294
+
295
+ ### If NEEDS WORK (return to Phase 3):
296
+ ```markdown
297
+ ## Phase 4 β†’ Phase 3 Return Package
298
+
299
+ ### Fix List (from Reality Checker):
300
+ 1. [Critical Issue 1]: [Description + evidence + fix instruction]
301
+ 2. [Critical Issue 2]: [Description + evidence + fix instruction]
302
+ 3. [High Issue 1]: [Description + evidence + fix instruction]
303
+ ...
304
+
305
+ ### Process:
306
+ - Issues enter Dev↔QA loop (Phase 3 mechanics)
307
+ - Each fix must pass Evidence Collector QA
308
+ - When all fixes complete β†’ Return to Phase 4 Step 3
309
+ - Reality Checker re-evaluates with updated evidence
310
+
311
+ ### Expected: 2-3 revision cycles is normal
312
+ ```
313
+
314
+ ### If NOT READY (return to Phase 1/2):
315
+ ```markdown
316
+ ## Phase 4 β†’ Phase 1/2 Return Package
317
+
318
+ ### Architectural Issues Identified:
319
+ 1. [Fundamental Issue]: [Why it can't be fixed in Phase 3]
320
+ 2. [Structural Problem]: [What needs to change at architecture level]
321
+
322
+ ### Recommended Action:
323
+ - [ ] Revise system architecture (Phase 1)
324
+ - [ ] Rebuild foundation (Phase 2)
325
+ - [ ] Descope and redefine (Phase 1)
326
+
327
+ ### Studio Producer Decision Required
328
+ ```
329
+
330
+ ---
331
+
332
+ *Phase 4 is complete when the Reality Checker issues a READY verdict with overwhelming evidence. NEEDS WORK is the expected first-pass result β€” it means the system is working but needs polish.*