agentic-sdlc 1.5.1 → 1.8.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (149) hide show
  1. package/.agent/rules/agent-execution.md +55 -0
  2. package/.agent/rules/ai-enforcement.md +4 -3
  3. package/.agent/rules/artifacts.md +79 -77
  4. package/.agent/rules/auto-learning.md +78 -0
  5. package/.agent/rules/code-quality.md +40 -0
  6. package/.agent/rules/git-workflow.md +44 -24
  7. package/.agent/rules/global.md +10 -6
  8. package/.agent/rules/naming-conventions.md +55 -0
  9. package/.agent/skills/role-ba.md +6 -2
  10. package/.agent/skills/role-brain.md +5 -1
  11. package/.agent/skills/role-cloud.md +38 -0
  12. package/.agent/skills/role-dev.md +31 -5
  13. package/.agent/skills/role-devops.md +9 -0
  14. package/.agent/skills/role-game.md +35 -0
  15. package/.agent/skills/role-mobile.md +55 -0
  16. package/.agent/skills/role-orchestrator.md +4 -0
  17. package/.agent/skills/role-pm.md +4 -0
  18. package/.agent/skills/role-po.md +4 -0
  19. package/.agent/skills/role-reporter.md +4 -0
  20. package/.agent/skills/role-research.md +78 -0
  21. package/.agent/skills/role-sa.md +4 -0
  22. package/.agent/skills/role-seca.md +4 -0
  23. package/.agent/skills/role-stakeholder.md +4 -0
  24. package/.agent/skills/role-tester.md +15 -3
  25. package/.agent/skills/role-uiux.md +4 -0
  26. package/.agent/templates/CHANGELOG-Template.md +2 -2
  27. package/.agent/templates/Design-Verification-Report-Template.md +4 -4
  28. package/.agent/templates/DevOps-Plan-Template.md +7 -0
  29. package/.agent/templates/Specification-Template.md +38 -0
  30. package/.agent/templates/ab-comparison-report.md +175 -0
  31. package/.agent/templates/observer-report.md +131 -0
  32. package/.agent/templates/quality-score-report.md +197 -0
  33. package/.agent/templates/self-learning-digest.md +268 -0
  34. package/.agent/templates/system-health-report.md +330 -0
  35. package/.agent/workflows/ab.md +101 -0
  36. package/.agent/workflows/autogen.md +65 -0
  37. package/.agent/workflows/brain.md +52 -42
  38. package/.agent/workflows/commit.md +61 -0
  39. package/.agent/workflows/cycle.md +36 -15
  40. package/.agent/workflows/debug.md +123 -0
  41. package/.agent/workflows/deep-search.md +82 -0
  42. package/.agent/workflows/docs.md +144 -0
  43. package/.agent/workflows/emergency.md +17 -15
  44. package/.agent/workflows/explore.md +15 -9
  45. package/.agent/workflows/housekeeping.md +24 -11
  46. package/.agent/workflows/metrics.md +14 -12
  47. package/.agent/workflows/monitor.md +98 -0
  48. package/.agent/workflows/observe.md +84 -0
  49. package/.agent/workflows/onboarding.md +135 -0
  50. package/.agent/workflows/orchestrator.md +21 -14
  51. package/.agent/workflows/planning.md +126 -0
  52. package/.agent/workflows/refactor.md +132 -0
  53. package/.agent/workflows/release.md +19 -12
  54. package/.agent/workflows/review.md +99 -0
  55. package/.agent/workflows/score.md +104 -0
  56. package/.agent/workflows/sprint.md +16 -14
  57. package/.agent/workflows/validate.md +13 -11
  58. package/.agent/workflows/worktree.md +154 -0
  59. package/CHANGELOG.md +88 -0
  60. package/README.md +12 -4
  61. package/bin/cli.js +143 -13
  62. package/docs/.brain-health-history.json +42 -0
  63. package/docs/.brain-improvements.json +53 -0
  64. package/docs/.brain-learner-log.json +27 -0
  65. package/docs/.brain-scores.json +310 -0
  66. package/docs/architecture/system-flow.mermaid +81 -0
  67. package/docs/artifacts/2026-01-05-enforcement-gates-plan.md +80 -0
  68. package/docs/artifacts/2026-01-05-workflow-analysis.md +231 -0
  69. package/docs/artifacts/README.md +26 -0
  70. package/docs/guides/MCP-GUIDE.md +1 -0
  71. package/docs/reports/2026-01-05-autogen-evaluation.md +64 -0
  72. package/docs/reports/2026-01-05-brain-layer-analysis.md +109 -0
  73. package/docs/reports/2026-01-05-repository-audit.md +253 -0
  74. package/docs/reports/Metrics-Dashboard-2026-01-08.md +29 -0
  75. package/docs/reports/Metrics-Dashboard-Final.md +29 -0
  76. package/docs/reports/Validation-Report-2026-01-05.md +40 -0
  77. package/docs/reports/Validation-Report-2026-01-08.md +40 -0
  78. package/docs/reports/worktrunk-audit.md +94 -0
  79. package/docs/solutions/README.md +96 -0
  80. package/docs/walkthroughs/2026-01-05-audit-implementation.md +36 -0
  81. package/docs/walkthroughs/2026-01-05-autonomy-release.md +54 -0
  82. package/docs/walkthroughs/2026-01-05-enforcement-gates.md +33 -0
  83. package/docs/walkthroughs/2026-01-05-judge-enhancement.md +30 -0
  84. package/docs/walkthroughs/2026-01-05-landing-page-orchestrator.md +52 -0
  85. package/docs/walkthroughs/2026-01-05-validation.md +32 -0
  86. package/docs/walkthroughs/2026-01-05-workflow-audit.md +89 -0
  87. package/docs/walkthroughs/2026-01-05-workflow-refactoring.md +44 -0
  88. package/docs/walkthroughs/2026-01-06-worktrunk-integration.md +41 -0
  89. package/docs/walkthroughs/README.md +25 -0
  90. package/package.json +33 -19
  91. package/.agent/knowledge-base/AUTO-LEARNING-GUIDE.md +0 -327
  92. package/.agent/knowledge-base/HOW-IT-WORKS.md +0 -365
  93. package/.agent/knowledge-base/INDEX.md +0 -43
  94. package/.agent/knowledge-base/README.md +0 -242
  95. package/.agent/knowledge-base/architecture/.gitkeep +0 -1
  96. package/.agent/knowledge-base/architecture/KB-2026-01-01-003-neo4j-graph-database-skills.md +0 -1146
  97. package/.agent/knowledge-base/architecture/README.md +0 -98
  98. package/.agent/knowledge-base/bugs/.gitkeep +0 -1
  99. package/.agent/knowledge-base/bugs/KB-2026-01-02-yaml-special-character-escaping.md +0 -56
  100. package/.agent/knowledge-base/bugs/medium/KB-2026-01-01-001-example-auto-learned.md +0 -198
  101. package/.agent/knowledge-base/features/.gitkeep +0 -1
  102. package/.agent/knowledge-base/features/KB-2026-01-01-001-landing-page-design-trends-2026.md +0 -646
  103. package/.agent/knowledge-base/features/KB-2026-01-01-004-uiux-design-skills-2026.md +0 -945
  104. package/.agent/knowledge-base/features/KB-2026-01-01-005-modern-ai-landing-page-ui.md +0 -310
  105. package/.agent/knowledge-base/features/KB-2026-01-01-006-award-winning-landing-page-patterns.md +0 -324
  106. package/.agent/knowledge-base/features/KB-2026-01-02-001-cleanup-workflow.md +0 -242
  107. package/.agent/knowledge-base/features/KB-2026-01-02-002-landing-page-monorepo-architecture.md +0 -148
  108. package/.agent/knowledge-base/features/KB-2026-01-02-003-premium-glassmorphism-patterns.md +0 -58
  109. package/.agent/knowledge-base/features/KB-2026-01-04-ai-agent-enforcement.md +0 -46
  110. package/.agent/knowledge-base/features/README.md +0 -83
  111. package/.agent/knowledge-base/features/figma-landing-page-workflow.md +0 -311
  112. package/.agent/knowledge-base/features/figma-mcp-sa-guide.md +0 -673
  113. package/.agent/knowledge-base/features/figma-mcp-uiux-guide.md +0 -459
  114. package/.agent/knowledge-base/performance/.gitkeep +0 -1
  115. package/.agent/knowledge-base/performance/KB-2026-01-02-lazy-loading-optimization.md +0 -80
  116. package/.agent/knowledge-base/platform-specific/.gitkeep +0 -1
  117. package/.agent/knowledge-base/platform-specific/KB-2026-01-02-windows-console-encoding.md +0 -56
  118. package/.agent/knowledge-base/role-guides/DEV-KB-Guide.md +0 -527
  119. package/.agent/knowledge-base/role-guides/DEVOPS-KB-Guide.md +0 -491
  120. package/.agent/knowledge-base/role-guides/PM-KB-Guide.md +0 -299
  121. package/.agent/knowledge-base/role-guides/SECA-KB-Guide.md +0 -555
  122. package/.agent/knowledge-base/role-guides/TESTER-KB-Guide.md +0 -519
  123. package/.agent/knowledge-base/security/.gitkeep +0 -1
  124. package/.agent/knowledge-base/security/KB-2026-01-02-input-validation-sanitization.md +0 -74
  125. package/.agent/rules/AUTO-LEARNING.md +0 -418
  126. package/.agent/rules/KNOWLEDGE-BASE.md +0 -45
  127. package/.agent/skills/role-qa.md +0 -81
  128. package/.agent/workflows/compound.md +0 -51
  129. package/.agent/workflows/preflight.md +0 -35
  130. package/.agent/workflows/route.md +0 -160
  131. package/bin/kb +0 -34
  132. package/bin/kb.bat +0 -28
  133. package/bin/kb_cli.py +0 -226
  134. package/bin/lib/README.md +0 -411
  135. package/bin/lib/__init__.py +0 -7
  136. package/bin/lib/__pycache__/kb_add.cpython-313.pyc +0 -0
  137. package/bin/lib/__pycache__/kb_common.cpython-313.pyc +0 -0
  138. package/bin/lib/__pycache__/kb_compound.cpython-313.pyc +0 -0
  139. package/bin/lib/__pycache__/kb_index.cpython-313.pyc +0 -0
  140. package/bin/lib/__pycache__/kb_list.cpython-313.pyc +0 -0
  141. package/bin/lib/__pycache__/kb_search.cpython-313.pyc +0 -0
  142. package/bin/lib/__pycache__/kb_stats.cpython-313.pyc +0 -0
  143. package/bin/lib/kb_add.py +0 -203
  144. package/bin/lib/kb_common.py +0 -224
  145. package/bin/lib/kb_compound.py +0 -250
  146. package/bin/lib/kb_index.py +0 -193
  147. package/bin/lib/kb_list.py +0 -144
  148. package/bin/lib/kb_search.py +0 -121
  149. package/bin/lib/kb_stats.py +0 -153
@@ -0,0 +1,81 @@
1
+ graph TD
2
+ %% Actors
3
+ User([User])
4
+
5
+ %% Brain Layer
6
+ Brain[Brain Meta-Controller]
7
+ Judge[Judge]
8
+ Learner[Learner]
9
+
10
+ %% Workflows
11
+ Cycle["/cycle"]
12
+ Explore["/explore"]
13
+ Emergency["/emergency"]
14
+ Sprint["/sprint"]
15
+ Metrics["/metrics"]
16
+
17
+ %% Roles
18
+ PM[Project Manager]
19
+ BA[Business Analyst]
20
+ SA[System Analyst]
21
+ UIUX[UI/UX Designer]
22
+ PO[Product Owner]
23
+ QA[Tester]
24
+ SecA[Security Analyst]
25
+ Dev[Developer]
26
+ DevOps[DevOps]
27
+ Reporter[Reporter]
28
+ Stakeholder[Stakeholder]
29
+
30
+ %% Main Flow
31
+ User -->|Directives| Brain
32
+ Brain -->|Route| Cycle
33
+ Brain -->|Route| Explore
34
+ Brain -->|Route| Emergency
35
+ Brain -->|Route| Sprint
36
+
37
+ %% Sprint Flow
38
+ Sprint -->|Start| PM
39
+ PM -->|Plan| BA
40
+ BA -->|Reqs| SA
41
+ BA -->|Reqs| UIUX
42
+ BA -->|Reqs| PO
43
+
44
+ PO -->|Backlog| SA
45
+ PO -->|Backlog| UIUX
46
+
47
+ SA -->|Design| QA
48
+ UIUX -->|Design| QA
49
+
50
+ QA -->|Verify| SecA
51
+ SecA -->|Verify| Dev
52
+
53
+ Dev -->|Code| DevOps
54
+ DevOps -->|Deploy| Tester["Tester/QA"]
55
+
56
+ Tester -->|Pass| Reporter
57
+ Tester -->|Fail| Dev
58
+
59
+ Reporter -->|Report| Stakeholder
60
+ Stakeholder -->|Approve| Brain
61
+ Stakeholder -->|Reject| PM
62
+
63
+ %% Sub-flows
64
+ Cycle -->|Task| Dev
65
+ Explore -->|Analysis| PM
66
+ Emergency -->|Hotfix| Dev
67
+
68
+ %% Feedback Loops
69
+ Brain -->|Score| Judge
70
+ Judge -->|Feedback| Brain
71
+ Brain -->|Learn| Learner
72
+ Learner -->|Update| Brain
73
+
74
+ %% Next Steps (Explicit)
75
+ PM -.->|Next| SA
76
+ SA -.->|Next| QA
77
+ QA -.->|Next| Dev
78
+ Dev -.->|Next| Tester
79
+ Tester -.->|Next| Reporter
80
+ Reporter -.->|Next| Stakeholder
81
+ Stakeholder -.->|Next| Brain
@@ -0,0 +1,80 @@
1
+ # Implementation Plan: Strengthen Brain Protocol Enforcement
2
+
3
+ **Date:** 2026-01-05
4
+ **Issue:** Brain tools exist but not being called during agent sessions
5
+
6
+ ---
7
+
8
+ ## 🔴 User Identified Gaps
9
+
10
+ | # | Gap | Current State | Required State |
11
+ |---|-----|---------------|----------------|
12
+ | 1 | **Observer** | Not halting on errors | MUST halt, fix, resume |
13
+ | 2 | **A/B Testing** | Not used | Use for small tasks |
14
+ | 3 | **Planning** | Jump to implementation | MUST plan first |
15
+ | 4 | **Self-Improve** | Not running | Run after each session |
16
+ | 5 | **Reports** | No artifacts | MUST create walkthrough |
17
+ | 6 | **Housekeeping** | Not triggered | Run after task completion |
18
+
19
+ ---
20
+
21
+ ## Proposed Changes
22
+
23
+ ### [MODIFY] [GEMINI.md](file:///d:/dev/agentic-sdlc/GEMINI.md)
24
+
25
+ Add **CRITICAL ENFORCEMENT GATES** section with mandatory checkpoints:
26
+
27
+ ```markdown
28
+ ## 🚨 CRITICAL ENFORCEMENT GATES
29
+
30
+ ### Gate 1: Pre-Task (BEFORE ANYTHING)
31
+ ```bash
32
+ python tools/brain/observer.py --status
33
+ python tools/brain/model_optimizer.py --recommend "[task]"
34
+ ```
35
+ Decision: If task is small, consider A/B testing.
36
+
37
+ ### Gate 2: Planning (BEFORE CODE)
38
+ - Create implementation_plan.md
39
+ - Get user approval before execution
40
+
41
+ ### Gate 3: Error Handling
42
+ If ANY script fails:
43
+ 1. STOP immediately
44
+ 2. Call: `python tools/brain/observer.py --halt "[error]"`
45
+ 3. Fix the issue
46
+ 4. Call: `python tools/brain/observer.py --resume`
47
+
48
+ ### Gate 4: Post-Task (AFTER COMPLETION)
49
+ ```bash
50
+ python tools/brain/learner.py --learn "[task]"
51
+ python tools/brain/judge.py --score "[artifact]"
52
+ python tools/brain/self_improver.py --analyze
53
+ python bin/kb_cli.py compound sync
54
+ ```
55
+
56
+ ### Gate 5: Reporting
57
+ - Create walkthrough.md
58
+ - Save to docs/walkthroughs/
59
+
60
+ ### Gate 6: Cleanup
61
+ ```bash
62
+ python tools/workflows/housekeeping.py
63
+ ```
64
+ ```
65
+
66
+ ---
67
+
68
+ ## Verification Plan
69
+
70
+ After implementation:
71
+ 1. Test a small task with A/B testing
72
+ 2. Intentionally cause an error to test halt
73
+ 3. Verify planning step is enforced
74
+ 4. Check report generation
75
+
76
+ ---
77
+
78
+ ## ❓ Awaiting Approval
79
+
80
+ Proceed with strengthening GEMINI.md enforcement?
@@ -0,0 +1,231 @@
1
+ # 🔬 Workflow Analysis Report: Add/Remove Recommendations
2
+
3
+ **Date:** 2026-01-05
4
+ **Purpose:** Deep analysis of `.agent/workflows/` to recommend additions and removals
5
+
6
+ ---
7
+
8
+ ## 📊 Current Workflow Inventory (13 workflows)
9
+
10
+ | Workflow | Type | Size | Purpose |
11
+ |----------|------|------|---------|
12
+ | `brain.md` | Support | 2.6KB | Meta-level controller, sync, learning |
13
+ | `compound.md` | Support | 1.0KB | Knowledge capture after tasks |
14
+ | `cycle.md` | Process | 2.0KB | Task lifecycle (plan→work→review) |
15
+ | `emergency.md` | Process | 3.3KB | Hotfix/incident response |
16
+ | `explore.md` | Process | 3.5KB | Deep investigation |
17
+ | `housekeeping.md` | Support | 2.9KB | Cleanup and maintenance |
18
+ | `metrics.md` | Utility | 3.9KB | Project statistics |
19
+ | `orchestrator.md` | Process | 2.2KB | Full SDLC automation |
20
+ | `preflight.md` | Support | 1.0KB | Pre-task checks |
21
+ | `release.md` | Support | 3.5KB | Changelog & versioning |
22
+ | `route.md` | Support | 3.5KB | Workflow selection helper |
23
+ | `sprint.md` | Process | 3.3KB | Sprint lifecycle |
24
+ | `validate.md` | Utility | 3.7KB | Workflow compliance check |
25
+
26
+ ---
27
+
28
+ ## 🔴 RECOMMEND REMOVAL (3 workflows)
29
+
30
+ ### 1. ❌ REMOVE: `preflight.md`
31
+
32
+ **Reason:**
33
+ - **Redundancy:** This workflow duplicates the "Enforcement Reminder" that already exists at the bottom of EVERY workflow file
34
+ - **Not automated:** Contains manual steps that AI agents already follow naturally
35
+ - **Low value:** The GEMINI.md already enforces pre-flight checks via the "Pre-Flight Checklist" section
36
+ - **Confusion:** Having both `preflight.md` AND enforcement reminders in each workflow creates duplication
37
+
38
+ **Evidence:**
39
+ ```markdown
40
+ # Every workflow ends with:
41
+ ## ENFORCEMENT REMINDER
42
+ Before executing, complete /preflight checks.
43
+ ```
44
+
45
+ **Alternative:** The enforcement is already embedded. Remove this standalone workflow.
46
+
47
+ ---
48
+
49
+ ### 2. ❌ REMOVE: `route.md`
50
+
51
+ **Reason:**
52
+ - **Redundancy with GEMINI.md:** The routing logic is already documented in `GEMINI.md` under "Role Activation Matrix" and "Slash Command Interpretation"
53
+ - **Static content:** Contains no executable commands - it's purely reference documentation
54
+ - **Better placement:** This should be reference documentation in `.agent/rules/` or `GEMINI.md`, not a workflow
55
+ - **No /route command exists:** The routing happens automatically via `/orchestrator` and brain
56
+
57
+ **Evidence:**
58
+ - GEMINI.md already has:
59
+ ```markdown
60
+ ### Role Activation Matrix
61
+ | Task Type | Required Roles | Workflow |
62
+ |-----------|---------------|----------|
63
+ | New Feature/Project | @PM → @SA → @UIUX → @DEV → @TESTER | /orchestrator |
64
+ ```
65
+
66
+ **Alternative:** Merge key content into `GEMINI.md` or `.agent/rules/global.md`
67
+
68
+ ---
69
+
70
+ ### 3. ❌ CONSIDER REMOVING: `compound.md`
71
+
72
+ **Reason:**
73
+ - **Already embedded in other workflows:** Both `/cycle` and `/emergency` already include compound learning steps (Step 7 in cycle, Step 7 in emergency)
74
+ - **Very short (1KB):** Not enough value as standalone workflow
75
+ - **Rarely invoked directly:** Users should use `/cycle` or `/emergency` which include compound learning
76
+
77
+ **Evidence from cycle.md:**
78
+ ```markdown
79
+ ### 7. Self-Learning (MANDATORY)
80
+ agentic-sdlc kb compound sync
81
+ agentic-sdlc learn --record-success "TASK-ID" --task-type "feature"
82
+ ```
83
+
84
+ **Alternative:** Keep as reference but mark as "called automatically by other workflows"
85
+
86
+ ---
87
+
88
+ ## 🟢 RECOMMEND ADDING (5 new workflows)
89
+
90
+ ### 1. ✅ ADD: `/review.md` - Code Review Workflow
91
+
92
+ **Rationale:**
93
+ - **Gap identified:** No dedicated workflow for PR reviews
94
+ - **Current state:** `@TESTER` does design verification but no code review workflow
95
+ - **High frequency task:** Code reviews happen daily
96
+
97
+ **Proposed content:**
98
+ - Quick PR review checklist
99
+ - Integration with GitHub PR comments
100
+ - Calling `@TESTER` and `@SECA` for specialized reviews
101
+ - Link to KB for similar code patterns
102
+
103
+ ---
104
+
105
+ ### 2. ✅ ADD: `/debug.md` - Debugging Workflow
106
+
107
+ **Rationale:**
108
+ - **Gap identified:** No workflow for systematic debugging
109
+ - **Different from /emergency:** Emergency is for production issues; debug is for local development
110
+ - **High complexity task:** Debugging often takes 3+ hours, needs structure
111
+
112
+ **Proposed content:**
113
+ - Systematic debug steps (reproduce → isolate → identify → fix → verify)
114
+ - Log analysis commands
115
+ - Common debugging tools
116
+ - KB search for similar bugs
117
+ - Integration with `/compound` for learning
118
+
119
+ ---
120
+
121
+ ### 3. ✅ ADD: `/refactor.md` - Refactoring Workflow
122
+
123
+ **Rationale:**
124
+ - **Gap identified:** No workflow for safe refactoring
125
+ - **High-risk activity:** Refactoring can break existing functionality
126
+ - **Quality focus:** Needs verification steps
127
+
128
+ **Proposed content:**
129
+ - Scope definition (what's being refactored)
130
+ - Test verification before/after
131
+ - Atomic commits
132
+ - Code review integration
133
+ - **Key:** Run tests before AND after refactoring
134
+
135
+ ---
136
+
137
+ ### 4. ✅ ADD: `/onboarding.md` - New Agent Onboarding
138
+
139
+ **Rationale:**
140
+ - **Gap identified:** No workflow for new AI agents joining project
141
+ - **Context needed:** New agents need to understand project structure
142
+ - **Accelerate productivity:** Quick ramp-up for new sessions
143
+
144
+ **Proposed content:**
145
+ - Project structure overview
146
+ - Key files to read first (`GEMINI.md`, `README.md`)
147
+ - Current sprint status
148
+ - KB search for relevant context
149
+ - Active issues/tasks
150
+
151
+ ---
152
+
153
+ ### 5. ✅ ADD: `/docs.md` - Documentation Workflow
154
+
155
+ **Rationale:**
156
+ - **Gap identified:** No dedicated documentation workflow
157
+ - **Current state:** `/cycle` mentions docs but no structure
158
+ - **Quality:** Documentation often neglected
159
+
160
+ **Proposed content:**
161
+ - Types of docs (API, user guide, KB entry)
162
+ - Template selection
163
+ - Review process
164
+ - Integration with `/release` for changelog
165
+
166
+ ---
167
+
168
+ ## 🟡 RECOMMEND IMPROVEMENTS (Existing workflows)
169
+
170
+ ### 1. 🔧 IMPROVE: `orchestrator.md`
171
+
172
+ **Current issues:**
173
+ - Very lightweight (2.2KB) for "Full SDLC Automation"
174
+ - Missing detailed phase transitions
175
+ - No artifact checklists per phase
176
+
177
+ **Recommendation:**
178
+ - Expand with detailed steps per phase
179
+ - Add artifact requirements per phase
180
+ - Add time estimates
181
+
182
+ ---
183
+
184
+ ### 2. 🔧 IMPROVE: `brain.md`
185
+
186
+ **Current issues:**
187
+ - References non-existent tools: `tools/brain/observer.py`, `tools/brain/judge.py`, etc.
188
+ - Only `tools/brain/brain_cli.py` exists
189
+
190
+ **Recommendation:**
191
+ - Update to match actual tool inventory
192
+ - Either create missing tools or remove references
193
+
194
+ ---
195
+
196
+ ### 3. 🔧 IMPROVE: `cycle.md`
197
+
198
+ **Current issues:**
199
+ - Team Communication step references tool that may not exist: `tools/communication/cli.py`
200
+ - Missing explicit test requirements
201
+
202
+ **Recommendation:**
203
+ - Verify tool existence
204
+ - Add explicit "run tests" step
205
+
206
+ ---
207
+
208
+ ## 📋 Summary
209
+
210
+ | Action | Count | Workflows |
211
+ |--------|-------|-----------|
212
+ | **Remove** | 2-3 | `preflight.md`, `route.md`, (optionally `compound.md`) |
213
+ | **Add** | 5 | `review.md`, `debug.md`, `refactor.md`, `onboarding.md`, `docs.md` |
214
+ | **Improve** | 3 | `orchestrator.md`, `brain.md`, `cycle.md` |
215
+
216
+ ---
217
+
218
+ ## 🎯 Priority Order
219
+
220
+ 1. **P0 - Critical:** Add `/review.md` and `/debug.md` (most common use cases)
221
+ 2. **P1 - Important:** Remove `preflight.md` and `route.md` (reduce confusion)
222
+ 3. **P2 - Nice to have:** Add `/refactor.md`, `/onboarding.md`, `/docs.md`
223
+ 4. **P3 - Backlog:** Improve `orchestrator.md`, fix tool references
224
+
225
+ ---
226
+
227
+ ## ❓ Open Questions for User
228
+
229
+ 1. **Compound workflow:** Keep as standalone or merge into cycle/emergency?
230
+ 2. **Tool references:** Should we create missing brain tools or remove references?
231
+ 3. **Priority:** Which new workflows should we implement first?
@@ -0,0 +1,26 @@
1
+ # Artifacts Directory
2
+
3
+ This folder stores IDE-generated artifacts that must be persisted for self-learning.
4
+
5
+ ## What Goes Here
6
+
7
+ | Artifact Type | Example |
8
+ |---------------|---------|
9
+ | Analysis reports | `2026-01-05-workflow-analysis.md` |
10
+ | Task summaries | `2026-01-05-task-refactoring.md` |
11
+ | Investigation reports | `2026-01-05-explore-auth.md` |
12
+ | Gap analysis | `2026-01-05-gap-analysis.md` |
13
+
14
+ ## Naming Convention
15
+
16
+ ```
17
+ [YYYY-MM-DD]-[task-name].md
18
+ ```
19
+
20
+ ## Sync to Neo4j
21
+
22
+ After adding artifacts:
23
+
24
+ ```bash
25
+ agentic-sdlc kb compound sync
26
+ ```
@@ -14,6 +14,7 @@ The following servers are integrated into the team roles. Ensure these are confi
14
14
  | **GitIngest** | Codebase snapshots | @ORCHESTRATOR, @REPORTER |
15
15
  | **Apidog** | API Testing & Design | @SA, @TESTER |
16
16
  | **Brave Search** | External Research | @PM, @PO |
17
+ | **Deep Search** | Technical Research (DDG + GitHub + StackOverflow) | @RESEARCH, @SA, @DEV |
17
18
  | **Firecrawl** | Web Scraper / Log research | @SECA, @DEVOPS |
18
19
  | **Playwright** | E2E / Browser Testing | @QA, @TESTER |
19
20
  | **Context7** | Architecture Analysis | @SA, @DEV |
@@ -0,0 +1,64 @@
1
+ # Evaluation Report: Microsoft AutoGen Integration
2
+
3
+ **Date:** 2026-01-05
4
+ **Status:** Draft
5
+ **Author:** @BRAIN (Research)
6
+
7
+ ## 1. Executive Summary
8
+ This report evaluates the applicability of **Microsoft AutoGen** (specifically v0.4+) to the **Agentic SDLC** project.
9
+
10
+ **Conclusion:** AutoGen represents a significant paradigm shift from the current `CLI + Workflow` architecture to a `Runtime + Event-Driven` architecture. While it offers powerful capabilities for autonomous multi-agent collaboration and state management, a full migration would require substantial refactoring.
11
+ **Recommendation:** We recommend a **Phased Adoption (Hybrid Approach)**, starting with a pilot implementation for the `@Orchestrator` role or a specific complex workflow (e.g., `/sprint`), while maintaining the existing stable CLI tools for atomic tasks.
12
+
13
+ ---
14
+
15
+ ## 2. Microsoft AutoGen Overview
16
+ AutoGen is a framework for building event-driven, distributed, agentic applications.
17
+ * **Core Unit:** `ConversableAgent` (an object that can send/receive messages).
18
+ * **Key Features:**
19
+ * **Multi-Agent Conversation:** Built-in patterns for Two-Agent Chat, Group Chat, and Hierarchical Chat.
20
+ * **Human-in-the-loop:** `UserProxyAgent` allows seamless human intervention.
21
+ * **Code Execution:** Native support for executing code (Docker/Local) within conversations.
22
+ * **Tool Use:** Agents can be equipped with functions (Tools) to interact with the environment.
23
+ * **Ecosystem:** v0.4 introduces an event-driven architecture, enabling distributed agents and better scalability.
24
+
25
+ ## 3. Current "Agentic SDLC" Architecture Analysis
26
+ The current system acts as a **Meta-Level Controller** using a "Brain" workflow.
27
+ * **Architecture:** `CLI-First`. Interactions are discrete tool calls driven by prompt engineering and static Markdown definitions (`.agent/skills/`, `.agent/workflows/`).
28
+ * **Execution Model:** "Run & Stop". Scripts in `tools/` run, perform an action, and exit. State is persisted in files (Markdown, JSON) or Neo4j.
29
+ * **Pros:** Simple, transparent, stateless (easy to debug), strongly typed workflows (Markdown).
30
+ * **Cons:** Limited "autonomy" between steps; rigid workflow adherence; limited inter-agent negotiation (requires user as relay).
31
+
32
+ ## 4. Gap Analysis
33
+
34
+ | Feature | Agentic SDLC (Current) | Microsoft AutoGen (Target) | Gap/Bridge |
35
+ | :--- | :--- | :--- | :--- |
36
+ | **Agent Definition** | Markdown Prompts + CLI Tools | Python Classes (`AssistantAgent`) | Requires wrapping Prompts into Class metadata. |
37
+ | **Communication** | Invisible (Prompt -> Tool -> Output) | Explicit Message Passing | Needs a message loop (runtime). |
38
+ | **Orchestration** | User / Static Workflow Files | Dynamic GroupChat Manager | AutoGen excels here. |
39
+ | **Human Inputs** | `notify_user` / Interrupts | `UserProxyAgent` | Direct replacement possible. |
40
+ | **Tools** | `tools/` directory (Python scripts) | `autogen.tools` | Existing tools can be registered easily. |
41
+
42
+ ## 5. Integration Scenarios
43
+
44
+ ### Scenario A: The "Super-Tool" (Recommended Pilot)
45
+ Treat AutoGen as a *Tool* within the existing SDLC.
46
+ * **Concept:** Create a new tool `tools/autogen/runner.py`.
47
+ * **Usage:** The current Brain invokes this tool to spin up a simpler sub-team (e.g., "Solver Team: Dev + Tester") to solve a specific hard problem autonomously.
48
+ * **Pros:** Low risk, high value for complex tasks.
49
+ * **Cons:** Context switching between "System Agent" and "AutoGen Sub-agents".
50
+
51
+ ### Scenario B: The "Brain Replacement" (Long Term)
52
+ Refactor the entire `bin/agentic-sdlc` CLI to wrap an AutoGen runtime.
53
+ * **Concept:** When the user types `/orchestrator`, it launches a persistent AutoGen `GroupChat` involving `@PM`, `@Dev`, etc.
54
+ * **Pros:** True agentic autonomy, dynamic planning.
55
+ * **Cons:** Complete rewrite of the Supervisor layer.
56
+
57
+ ## 6. Proposed Pilot: "The Auto-Coder"
58
+ We propose building a pilot module using AutoGen to handle the `/emergency` or `/debug` workflow.
59
+ **Objective:** Give an AutoGen "Debugger Agent" access to `grep`, `read_file`, and `run_test` tools and let it autonomously find root causes without constant user prompting.
60
+
61
+ ## 7. Next Steps
62
+ 1. **Prototype:** Create a `tools/experiment/autogen_pilot.py`.
63
+ 2. **Define:** Map the `@DEV` and `@TESTER` roles to AutoGen definitions.
64
+ 3. **Evaluate:** Measure if the AutoGen loop resolves bugs faster than the manual `/debug` workflow.
@@ -0,0 +1,109 @@
1
+ # 🧠 Brain Root Layer Analysis
2
+
3
+ **Date:** 2026-01-05
4
+ **Issue:** Brain components not working during agent chat
5
+
6
+ ---
7
+
8
+ ## Current State: All Tools EXIST ✅
9
+
10
+ | Component | Script | Lines | Status |
11
+ |-----------|--------|-------|--------|
12
+ | Observer | `tools/brain/observer.py` | 297 | ✅ Implemented |
13
+ | Judge | `tools/brain/judge.py` | 341 | ✅ Implemented |
14
+ | Learner | `tools/brain/learner.py` | 298 | ✅ Implemented |
15
+ | A/B Tester | `tools/brain/ab_tester.py` | 353 | ✅ Implemented |
16
+ | Model Optimizer | `tools/brain/model_optimizer.py` | 341 | ✅ Implemented |
17
+ | Self-Improver | `tools/brain/self_improver.py` | 372 | ✅ Implemented |
18
+
19
+ ---
20
+
21
+ ## 🔴 The Problem
22
+
23
+ The brain tools are **standalone CLI scripts** that must be called explicitly. They do NOT:
24
+ - Auto-run when agent starts a session
25
+ - Monitor chat in real-time
26
+ - Intercept agent actions
27
+ - Auto-trigger learning after tasks
28
+
29
+ **Current reality:**
30
+ ```
31
+ User Chat → Agent → Executes Task
32
+
33
+ (Brain tools NOT called)
34
+ ```
35
+
36
+ **Expected:**
37
+ ```
38
+ User Chat → Agent → Brain Observer watches
39
+ → Agent Executes Task
40
+ → Judge scores result
41
+ → Learner records patterns
42
+ → Self-Improver updates rules
43
+ ```
44
+
45
+ ---
46
+
47
+ ## 🟢 Solution Options
48
+
49
+ ### Option 1: IDE Integration (Best but Hard)
50
+ **How:** IDE hooks call brain tools before/after each agent action.
51
+ - **Cursor:** Custom MCP server
52
+ - **Windsurf:** Cascade plugin
53
+ - **Antigravity:** Extension hooks
54
+
55
+ **Pros:** Fully automatic, no agent changes needed
56
+ **Cons:** Requires IDE-specific development
57
+
58
+ ---
59
+
60
+ ### Option 2: Agent Protocol (Recommended)
61
+ **How:** Add mandatory steps to GEMINI.md that agents MUST follow:
62
+
63
+ ```markdown
64
+ ## Brain Protocol (MANDATORY)
65
+
66
+ Before EVERY task:
67
+ 1. Check observer status: `python tools/brain/observer.py --status`
68
+ 2. Get model recommendation: `python tools/brain/model_optimizer.py --recommend "[task]"`
69
+
70
+ After EVERY task:
71
+ 1. Score result: `python tools/brain/judge.py --score "[artifact]"`
72
+ 2. Trigger learning: `python tools/brain/learner.py --learn "[description]"`
73
+ 3. Record A/B if applicable
74
+ ```
75
+
76
+ **Pros:** Works now, no IDE changes
77
+ **Cons:** Relies on agent compliance
78
+
79
+ ---
80
+
81
+ ### Option 3: Batch/Scheduled (Easiest)
82
+ **How:** Run brain analysis periodically, not per-task.
83
+
84
+ ```bash
85
+ # Daily brain sync (add to workflow)
86
+ python tools/brain/observer.py --watch
87
+ python tools/brain/self_improver.py --analyze
88
+ python tools/brain/self_improver.py --plan
89
+ ```
90
+
91
+ **Pros:** Simple, low overhead
92
+ **Cons:** Not real-time
93
+
94
+ ---
95
+
96
+ ## 📋 Recommendation: Option 2 + Option 3
97
+
98
+ 1. **Update GEMINI.md** with mandatory brain protocol
99
+ 2. **Add brain check to `/onboarding`**
100
+ 3. **Add brain sync to `/housekeeping`**
101
+ 4. **Future:** Build MCP server for full integration
102
+
103
+ ---
104
+
105
+ ## ❓ Questions for User
106
+
107
+ 1. Implement Option 2 (add brain protocol to GEMINI.md)?
108
+ 2. Add brain hooks to existing workflows?
109
+ 3. Build MCP server for Cursor/Windsurf (future)?