claude-flow-novice 2.15.10 → 2.15.11
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/claude-assets/agents/cfn-dev-team/CLAUDE.md +9 -81
- package/claude-assets/agents/cfn-dev-team/architecture/base-template-generator.md +4 -4
- package/claude-assets/agents/cfn-dev-team/architecture/planner.md +4 -4
- package/claude-assets/agents/cfn-dev-team/architecture/system-architect.md +5 -5
- package/claude-assets/agents/cfn-dev-team/coordinators/cfn-v3-coordinator.md +5 -1
- package/claude-assets/agents/cfn-dev-team/dev-ops/devops-engineer.md +4 -4
- package/claude-assets/agents/cfn-dev-team/dev-ops/docker-specialist.md +7 -37
- package/claude-assets/agents/cfn-dev-team/dev-ops/kubernetes-specialist.md +7 -37
- package/claude-assets/agents/cfn-dev-team/dev-ops/monitoring-specialist.md +4 -4
- package/claude-assets/agents/cfn-dev-team/developers/api-gateway-specialist.md +11 -42
- package/claude-assets/agents/cfn-dev-team/product-owners/accessibility-advocate-persona.md +4 -4
- package/claude-assets/agents/cfn-dev-team/product-owners/cto-agent.md +4 -4
- package/claude-assets/agents/cfn-dev-team/product-owners/power-user-persona.md +4 -4
- package/claude-assets/agents/cfn-dev-team/product-owners/product-owner.md +18 -22
- package/claude-assets/agents/cfn-dev-team/reviewers/code-reviewer.md +1 -1
- package/claude-assets/agents/cfn-dev-team/reviewers/quality/code-quality-validator.md +1 -1
- package/claude-assets/agents/cfn-dev-team/reviewers/quality/perf-analyzer.md +1 -1
- package/claude-assets/agents/cfn-dev-team/reviewers/quality/performance-benchmarker.md +1 -1
- package/claude-assets/agents/cfn-dev-team/reviewers/quality/security-specialist.md +1 -1
- package/claude-assets/agents/cfn-dev-team/testers/api-testing-specialist.md +7 -35
- package/claude-assets/agents/cfn-dev-team/testers/chaos-engineering-specialist.md +17 -36
- package/claude-assets/agents/cfn-dev-team/testers/contract-tester.md +10 -11
- package/claude-assets/agents/cfn-dev-team/testers/e2e/playwright-tester.md +5 -5
- package/claude-assets/agents/cfn-dev-team/testers/integration-tester.md +10 -12
- package/claude-assets/agents/cfn-dev-team/testers/interaction-tester.md +7 -36
- package/claude-assets/agents/cfn-dev-team/testers/load-testing-specialist.md +10 -12
- package/claude-assets/agents/cfn-dev-team/testers/mutation-testing-specialist.md +10 -12
- package/claude-assets/agents/cfn-dev-team/testers/playwright-tester.md +7 -37
- package/claude-assets/agents/cfn-dev-team/testers/tester.md +7 -33
- package/claude-assets/agents/cfn-dev-team/testers/unit/tdd-london-unit-swarm.md +5 -5
- package/claude-assets/agents/cfn-dev-team/testers/validation/validation-production-validator.md +4 -4
- package/claude-assets/agents/cfn-dev-team/testing/test-validation-agent.md +4 -4
- package/claude-assets/agents/cfn-dev-team/utility/agent-builder.md +16 -16
- package/claude-assets/agents/cfn-dev-team/utility/analyst.md +4 -4
- package/claude-assets/agents/cfn-dev-team/utility/code-booster.md +4 -4
- package/claude-assets/agents/cfn-dev-team/utility/context-curator.md +4 -4
- package/claude-assets/agents/cfn-dev-team/utility/epic-creator.md +7 -92
- package/claude-assets/agents/cfn-dev-team/utility/memory-leak-specialist.md +7 -100
- package/claude-assets/agents/cfn-dev-team/utility/researcher.md +4 -4
- package/claude-assets/agents/cfn-dev-team/utility/z-ai-specialist.md +7 -91
- package/dist/cli/agent-definition-parser.js +37 -4
- package/dist/cli/agent-definition-parser.js.map +1 -1
- package/dist/cli/agent-executor.js +32 -2
- package/dist/cli/agent-executor.js.map +1 -1
- package/dist/cli/config-manager.js +109 -91
- package/dist/cli/config-manager.js.map +1 -1
- package/package.json +1 -1
|
@@ -921,44 +921,15 @@ DO NOT report subjective confidence scores. Instead:
|
|
|
921
921
|
- ❌ OLD: "Confidence: 0.92 - chaos tests comprehensive"
|
|
922
922
|
- ✅ NEW: "Chaos Tests: 46/48 passed (95.8% pass rate) - 2 network partition scenarios need tuning"
|
|
923
923
|
|
|
924
|
-
## Completion Protocol
|
|
924
|
+
## Completion Protocol
|
|
925
925
|
|
|
926
|
-
Complete your work and provide
|
|
926
|
+
Complete your work and provide a structured response with:
|
|
927
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
928
|
+
- Summary of work completed
|
|
929
|
+
- List of deliverables created
|
|
930
|
+
- Any recommendations or findings
|
|
927
931
|
|
|
928
|
-
|
|
929
|
-
|
|
930
|
-
```bash
|
|
931
|
-
# Parse natively (no external dependencies)
|
|
932
|
-
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
933
|
-
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
934
|
-
TOTAL=$((PASS + FAIL))
|
|
935
|
-
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
936
|
-
|
|
937
|
-
# Return results (Main Chat receives automatically in Task Mode)
|
|
938
|
-
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
939
|
-
```
|
|
940
|
-
|
|
941
|
-
2. **Validate Results**:
|
|
942
|
-
- Coverage: ≥80%
|
|
943
|
-
- Critical paths covered: X/Y
|
|
944
|
-
- System resilience verified: Yes/No
|
|
945
|
-
|
|
946
|
-
3. **Store Results**: Use test-results key (not confidence key)
|
|
947
|
-
5. **Signal Completion**: Push to completion queue
|
|
948
|
-
|
|
949
|
-
**Example Report:**
|
|
950
|
-
```
|
|
951
|
-
Chaos Engineering Test Summary:
|
|
952
|
-
- Pod Failure Tests: 10/10 passed (100%)
|
|
953
|
-
- Network Chaos Tests: 14/16 passed (87.5%)
|
|
954
|
-
- Resource Stress Tests: 12/12 passed (100%)
|
|
955
|
-
- Disaster Recovery Tests: 10/10 passed (100%)
|
|
956
|
-
- Overall: 46/48 passed (95.8%)
|
|
957
|
-
- Coverage: 86.2%
|
|
958
|
-
- Critical Paths Tested: 12/12 (100%)
|
|
959
|
-
- System Resilience Verified: Yes
|
|
960
|
-
- Gate Status: PASS (≥95% overall, all critical paths validated)
|
|
961
|
-
```
|
|
932
|
+
**Note:** Coordination handled automatically by the system.
|
|
962
933
|
|
|
963
934
|
## Deliverables
|
|
964
935
|
|
|
@@ -982,3 +953,13 @@ Chaos Engineering Test Summary:
|
|
|
982
953
|
→ **Failure Injection**: `.claude/skills/failure-injection/SKILL.md`
|
|
983
954
|
→ **Gameday Planning**: `.claude/skills/gameday-runbooks/SKILL.md`
|
|
984
955
|
→ **Resilience Testing**: `.claude/skills/resilience-validation/SKILL.md`
|
|
956
|
+
|
|
957
|
+
## Completion Protocol
|
|
958
|
+
|
|
959
|
+
Complete your work and provide a structured response with:
|
|
960
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
961
|
+
- Summary of work completed
|
|
962
|
+
- List of deliverables created
|
|
963
|
+
- Any recommendations or findings
|
|
964
|
+
|
|
965
|
+
**Note:** Coordination handled automatically by the system.
|
|
@@ -93,17 +93,6 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
|
93
93
|
# Report completion (no confidence score)
|
|
94
94
|
```
|
|
95
95
|
|
|
96
|
-
### 4. Completion Protocol
|
|
97
|
-
|
|
98
|
-
**DO NOT** report confidence scores. Report test metrics:
|
|
99
|
-
```bash
|
|
100
|
-
echo "Contract Test Results:"
|
|
101
|
-
echo " Total Contracts: 15"
|
|
102
|
-
echo " Verified: 15"
|
|
103
|
-
echo " Failed: 0"
|
|
104
|
-
echo " Pass Rate: 100%"
|
|
105
|
-
```
|
|
106
|
-
|
|
107
96
|
---
|
|
108
97
|
|
|
109
98
|
## Role: Contract Testing Specialist (Loop 2 Validator)
|
|
@@ -716,3 +705,13 @@ pip install pact-python
|
|
|
716
705
|
- Good: 0.85-0.95 (minor issues, non-breaking)
|
|
717
706
|
- Poor: <0.85 (breaking changes, contract failures)
|
|
718
707
|
- Critical: <0.5 (adapter bugs, major contract violations)
|
|
708
|
+
|
|
709
|
+
## Completion Protocol
|
|
710
|
+
|
|
711
|
+
Complete your work and provide a structured response with:
|
|
712
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
713
|
+
- Summary of work completed
|
|
714
|
+
- List of deliverables created
|
|
715
|
+
- Any recommendations or findings
|
|
716
|
+
|
|
717
|
+
**Note:** Coordination handled automatically by the system.
|
|
@@ -107,10 +107,10 @@ await sqlite.memoryAdapter.set(
|
|
|
107
107
|
## Completion Protocol
|
|
108
108
|
|
|
109
109
|
Complete your work and provide a structured response with:
|
|
110
|
-
- Confidence score (0.0-1.0) based on
|
|
111
|
-
- Summary of
|
|
112
|
-
- List of
|
|
113
|
-
-
|
|
110
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
111
|
+
- Summary of work completed
|
|
112
|
+
- List of deliverables created
|
|
113
|
+
- Any recommendations or findings
|
|
114
114
|
|
|
115
|
-
**Note:** Coordination
|
|
115
|
+
**Note:** Coordination handled automatically by the system.
|
|
116
116
|
|
|
@@ -81,18 +81,6 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
|
81
81
|
# Report completion (no confidence score)
|
|
82
82
|
```
|
|
83
83
|
|
|
84
|
-
### 4. Completion Protocol
|
|
85
|
-
|
|
86
|
-
**DO NOT** report confidence scores. Report test metrics:
|
|
87
|
-
```bash
|
|
88
|
-
echo "Integration Test Results:"
|
|
89
|
-
echo " Total Workflows: 12"
|
|
90
|
-
echo " Passed: 11"
|
|
91
|
-
echo " Failed: 1"
|
|
92
|
-
echo " Pass Rate: 92%"
|
|
93
|
-
echo " Critical Workflows: 100% (all passed)"
|
|
94
|
-
```
|
|
95
|
-
|
|
96
84
|
---
|
|
97
85
|
|
|
98
86
|
## Role: Integration Testing Specialist (Loop 2 Validator)
|
|
@@ -815,3 +803,13 @@ afterAll(async () => {
|
|
|
815
803
|
- ✅ **Race Conditions**: Concurrent update tests catch double-spend bugs
|
|
816
804
|
- ✅ **Cascade Deletion**: Referential integrity tests catch orphaned records
|
|
817
805
|
- ✅ **Cross-Service**: Orchestration tests catch integration breaks
|
|
806
|
+
|
|
807
|
+
## Completion Protocol
|
|
808
|
+
|
|
809
|
+
Complete your work and provide a structured response with:
|
|
810
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
811
|
+
- Summary of work completed
|
|
812
|
+
- List of deliverables created
|
|
813
|
+
- Any recommendations or findings
|
|
814
|
+
|
|
815
|
+
**Note:** Coordination handled automatically by the system.
|
|
@@ -196,42 +196,13 @@ DO NOT report subjective confidence scores. Instead:
|
|
|
196
196
|
- ❌ OLD: "Confidence: 0.87 - interaction tests comprehensive"
|
|
197
197
|
- ✅ NEW: "Interaction Tests: 52/55 passed (94.5% pass rate) - 3 accessibility edge cases found"
|
|
198
198
|
|
|
199
|
-
## Completion Protocol
|
|
199
|
+
## Completion Protocol
|
|
200
200
|
|
|
201
|
-
Complete your work and provide
|
|
201
|
+
Complete your work and provide a structured response with:
|
|
202
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
203
|
+
- Summary of work completed
|
|
204
|
+
- List of deliverables created
|
|
205
|
+
- Any recommendations or findings
|
|
202
206
|
|
|
203
|
-
|
|
204
|
-
```bash
|
|
205
|
-
# Parse natively (no external dependencies)
|
|
206
|
-
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
207
|
-
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
208
|
-
TOTAL=$((PASS + FAIL))
|
|
209
|
-
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
210
|
-
|
|
211
|
-
# Return results (Main Chat receives automatically in Task Mode)
|
|
212
|
-
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
213
|
-
```
|
|
214
|
-
|
|
215
|
-
2. **Validate Results**:
|
|
216
|
-
- Coverage: ≥80%
|
|
217
|
-
- WCAG AA compliance: Yes/No
|
|
218
|
-
- Critical flows covered: X/Y
|
|
219
|
-
|
|
220
|
-
3. **Store Results**: Use test-results key (not confidence key)
|
|
221
|
-
4. **Signal Completion**: Push to completion queue
|
|
222
|
-
|
|
223
|
-
**Example Report:**
|
|
224
|
-
```
|
|
225
|
-
Interaction Testing Summary:
|
|
226
|
-
- Integration Tests: 24/25 passed (96%)
|
|
227
|
-
- E2E User Flow Tests: 18/20 passed (90%)
|
|
228
|
-
- Accessibility Tests: 10/10 passed (100%)
|
|
229
|
-
- Overall: 52/55 passed (94.5%)
|
|
230
|
-
- Coverage: 87.2%
|
|
231
|
-
- WCAG AA Compliance: Yes
|
|
232
|
-
- Critical Flows: 8/8 (100%)
|
|
233
|
-
- Gate Status: PASS (≥95% in 1/3 suites, accessibility validated)
|
|
234
|
-
```
|
|
235
|
-
|
|
236
|
-
**Note:** Coordination instructions and success criteria provided when spawned via CLI.
|
|
207
|
+
**Note:** Coordination handled automatically by the system.
|
|
237
208
|
|
|
@@ -75,18 +75,6 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
|
75
75
|
# Report completion (no confidence score)
|
|
76
76
|
```
|
|
77
77
|
|
|
78
|
-
### 4. Completion Protocol
|
|
79
|
-
|
|
80
|
-
**DO NOT** report confidence scores. Report performance metrics:
|
|
81
|
-
```bash
|
|
82
|
-
echo "Load Test Results:"
|
|
83
|
-
echo " RPS: 1200 (threshold: ≥1000)"
|
|
84
|
-
echo " P95 Latency: 450ms (threshold: <500ms)"
|
|
85
|
-
echo " P99 Latency: 850ms (threshold: <1000ms)"
|
|
86
|
-
echo " Error Rate: 0.1% (threshold: <1%)"
|
|
87
|
-
echo " Pass Rate: 100%"
|
|
88
|
-
```
|
|
89
|
-
|
|
90
78
|
---
|
|
91
79
|
|
|
92
80
|
## Core Responsibilities
|
|
@@ -535,3 +523,13 @@ Servers Needed = RPS Required / RPS Per Server
|
|
|
535
523
|
- Monitoring system resources during tests
|
|
536
524
|
- Analyzing performance over extended duration
|
|
537
525
|
- Validating error handling under stress
|
|
526
|
+
|
|
527
|
+
## Completion Protocol
|
|
528
|
+
|
|
529
|
+
Complete your work and provide a structured response with:
|
|
530
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
531
|
+
- Summary of work completed
|
|
532
|
+
- List of deliverables created
|
|
533
|
+
- Any recommendations or findings
|
|
534
|
+
|
|
535
|
+
**Note:** Coordination handled automatically by the system.
|
|
@@ -72,18 +72,6 @@ MUTATION_SCORE=$(echo "$TEST_OUTPUT" | grep -oP 'Mutation score: \K[0-9.]+')
|
|
|
72
72
|
# Report completion (no confidence score)
|
|
73
73
|
```
|
|
74
74
|
|
|
75
|
-
### 4. Completion Protocol
|
|
76
|
-
|
|
77
|
-
**DO NOT** report confidence scores. Report mutation metrics:
|
|
78
|
-
```bash
|
|
79
|
-
echo "Mutation Testing Results:"
|
|
80
|
-
echo " Mutants Generated: 145"
|
|
81
|
-
echo " Mutants Killed: 132"
|
|
82
|
-
echo " Mutants Survived: 13"
|
|
83
|
-
echo " Mutation Score: 91%"
|
|
84
|
-
echo " Threshold: 85% (PASS)"
|
|
85
|
-
```
|
|
86
|
-
|
|
87
75
|
---
|
|
88
76
|
|
|
89
77
|
## Role: Mutation Testing Specialist (Loop 2 Validator)
|
|
@@ -666,3 +654,13 @@ echo "Consensus: $CONSENSUS"
|
|
|
666
654
|
- Prevents false sense of security from high coverage
|
|
667
655
|
- Identifies boundary condition bugs
|
|
668
656
|
- Validates logical operator correctness
|
|
657
|
+
|
|
658
|
+
## Completion Protocol
|
|
659
|
+
|
|
660
|
+
Complete your work and provide a structured response with:
|
|
661
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
662
|
+
- Summary of work completed
|
|
663
|
+
- List of deliverables created
|
|
664
|
+
- Any recommendations or findings
|
|
665
|
+
|
|
666
|
+
**Note:** Coordination handled automatically by the system.
|
|
@@ -263,42 +263,12 @@ DO NOT report subjective confidence scores. Instead:
|
|
|
263
263
|
- ❌ OLD: "Confidence: 0.89 - Playwright tests look solid"
|
|
264
264
|
- ✅ NEW: "Playwright Tests: 35/37 passed (94.6% pass rate) - 2 cross-browser compatibility issues"
|
|
265
265
|
|
|
266
|
-
## Completion Protocol
|
|
266
|
+
## Completion Protocol
|
|
267
267
|
|
|
268
|
-
Complete your work and provide
|
|
268
|
+
Complete your work and provide a structured response with:
|
|
269
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
270
|
+
- Summary of work completed
|
|
271
|
+
- List of deliverables created
|
|
272
|
+
- Any recommendations or findings
|
|
269
273
|
|
|
270
|
-
|
|
271
|
-
|
|
272
|
-
```bash
|
|
273
|
-
# Parse natively (no external dependencies)
|
|
274
|
-
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
275
|
-
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
276
|
-
TOTAL=$((PASS + FAIL))
|
|
277
|
-
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
278
|
-
|
|
279
|
-
# Return results (Main Chat receives automatically in Task Mode)
|
|
280
|
-
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
281
|
-
```
|
|
282
|
-
|
|
283
|
-
2. **Validate Results**:
|
|
284
|
-
- Coverage: ≥80%
|
|
285
|
-
- Cross-browser coverage: X/Y browsers
|
|
286
|
-
- Critical flows covered: X/Y
|
|
287
|
-
|
|
288
|
-
3. **Store Results**: Use test-results key (not confidence key)
|
|
289
|
-
4. **Signal Completion**: Push to completion queue
|
|
290
|
-
|
|
291
|
-
**Example Report:**
|
|
292
|
-
```
|
|
293
|
-
Playwright E2E Test Summary:
|
|
294
|
-
- Authentication Tests: 8/8 passed (100%)
|
|
295
|
-
- Critical User Flows: 15/16 passed (93.75%)
|
|
296
|
-
- Cross-browser Tests: 12/13 passed (92.3%)
|
|
297
|
-
- Overall: 35/37 passed (94.6%)
|
|
298
|
-
- Coverage: 84.2%
|
|
299
|
-
- Cross-browser: Chromium, Firefox, WebKit (1 issue)
|
|
300
|
-
- Critical Flows Covered: 6/6 (100%)
|
|
301
|
-
- Gate Status: PASS (≥95% in 1/3 suites, cross-browser validation in progress)
|
|
302
|
-
```
|
|
303
|
-
|
|
304
|
-
**Note:** Coordination instructions and success criteria provided when spawned via CLI.
|
|
274
|
+
**Note:** Coordination handled automatically by the system.
|
|
@@ -216,38 +216,12 @@ DO NOT report subjective confidence scores. Instead:
|
|
|
216
216
|
- ❌ OLD: "Confidence: 0.85 - tests look comprehensive"
|
|
217
217
|
- ✅ NEW: "Tests: 125/130 passed (96.2% pass rate) - 5 edge case failures"
|
|
218
218
|
|
|
219
|
-
## Completion Protocol
|
|
219
|
+
## Completion Protocol
|
|
220
220
|
|
|
221
|
-
Complete your work and provide
|
|
221
|
+
Complete your work and provide a structured response with:
|
|
222
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
223
|
+
- Summary of work completed
|
|
224
|
+
- List of deliverables created
|
|
225
|
+
- Any recommendations or findings
|
|
222
226
|
|
|
223
|
-
|
|
224
|
-
|
|
225
|
-
```bash
|
|
226
|
-
# Parse natively (no external dependencies)
|
|
227
|
-
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
228
|
-
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
229
|
-
TOTAL=$((PASS + FAIL))
|
|
230
|
-
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
231
|
-
|
|
232
|
-
# Return results (Main Chat receives automatically in Task Mode)
|
|
233
|
-
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
234
|
-
```
|
|
235
|
-
|
|
236
|
-
2. **Validate Results**:
|
|
237
|
-
- Coverage: ≥80%
|
|
238
|
-
|
|
239
|
-
3. **Store Results**: Use test-results key (not confidence key)
|
|
240
|
-
4. **Signal Completion**: Push to completion queue
|
|
241
|
-
|
|
242
|
-
**Example Report:**
|
|
243
|
-
```
|
|
244
|
-
Test Execution Summary:
|
|
245
|
-
- Functional Tests: 45/47 passed (95.7%)
|
|
246
|
-
- Integration Tests: 50/50 passed (100%)
|
|
247
|
-
- E2E Tests: 30/33 passed (90.9%)
|
|
248
|
-
- Overall: 125/130 passed (96.2%)
|
|
249
|
-
- Coverage: 87.5%
|
|
250
|
-
- Gate Status: PASS (≥95% overall coverage, ≥87% code coverage)
|
|
251
|
-
```
|
|
252
|
-
|
|
253
|
-
**Note:** Coordination instructions and success criteria provided when spawned via CLI.
|
|
227
|
+
**Note:** Coordination handled automatically by the system.
|
|
@@ -21,9 +21,9 @@ validation_hooks:
|
|
|
21
21
|
## Completion Protocol
|
|
22
22
|
|
|
23
23
|
Complete your work and provide a structured response with:
|
|
24
|
-
- Confidence score (0.0-1.0) based on
|
|
25
|
-
- Summary of
|
|
26
|
-
- List of
|
|
27
|
-
-
|
|
24
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
25
|
+
- Summary of work completed
|
|
26
|
+
- List of deliverables created
|
|
27
|
+
- Any recommendations or findings
|
|
28
28
|
|
|
29
|
-
**Note:** Coordination
|
|
29
|
+
**Note:** Coordination handled automatically by the system.
|
package/claude-assets/agents/cfn-dev-team/testers/validation/validation-production-validator.md
CHANGED
|
@@ -12,8 +12,8 @@ Remember: Production validation ensures system reliability, user safety, and org
|
|
|
12
12
|
|
|
13
13
|
Complete your work and provide a structured response with:
|
|
14
14
|
- Confidence score (0.0-1.0) based on work quality
|
|
15
|
-
- Summary of
|
|
16
|
-
- List of
|
|
17
|
-
- Any recommendations
|
|
15
|
+
- Summary of work completed
|
|
16
|
+
- List of deliverables created
|
|
17
|
+
- Any recommendations or findings
|
|
18
18
|
|
|
19
|
-
**Note:** Coordination
|
|
19
|
+
**Note:** Coordination handled automatically by the system.
|
|
@@ -44,11 +44,11 @@ Specialized reviewer agent focused on validating test results, analyzing coverag
|
|
|
44
44
|
|
|
45
45
|
Complete your work and provide a structured response with:
|
|
46
46
|
- Confidence score (0.0-1.0) based on work quality
|
|
47
|
-
- Summary of
|
|
48
|
-
- List of
|
|
49
|
-
- Any recommendations
|
|
47
|
+
- Summary of work completed
|
|
48
|
+
- List of deliverables created
|
|
49
|
+
- Any recommendations or findings
|
|
50
50
|
|
|
51
|
-
**Note:** Coordination
|
|
51
|
+
**Note:** Coordination handled automatically by the system.
|
|
52
52
|
|
|
53
53
|
## Validation Workflow
|
|
54
54
|
|
|
@@ -104,11 +104,11 @@ When creating agent templates, include this standardized completion section:
|
|
|
104
104
|
|
|
105
105
|
Complete your work and provide a structured response with:
|
|
106
106
|
- Confidence score (0.0-1.0) based on work quality
|
|
107
|
-
- Summary of
|
|
108
|
-
- List of
|
|
109
|
-
- Any recommendations
|
|
107
|
+
- Summary of work completed
|
|
108
|
+
- List of deliverables created
|
|
109
|
+
- Any recommendations or findings
|
|
110
110
|
|
|
111
|
-
**Note:** Coordination
|
|
111
|
+
**Note:** Coordination handled automatically by the system.
|
|
112
112
|
```
|
|
113
113
|
|
|
114
114
|
---
|
|
@@ -209,11 +209,11 @@ You implement REST API endpoints following best practices and OpenAPI specificat
|
|
|
209
209
|
|
|
210
210
|
Complete your work and provide a structured response with:
|
|
211
211
|
- Confidence score (0.0-1.0) based on work quality
|
|
212
|
-
- Summary of
|
|
213
|
-
- List of
|
|
214
|
-
- Any recommendations
|
|
212
|
+
- Summary of work completed
|
|
213
|
+
- List of deliverables created
|
|
214
|
+
- Any recommendations or findings
|
|
215
215
|
|
|
216
|
-
**Note:** Coordination
|
|
216
|
+
**Note:** Coordination handled automatically by the system.
|
|
217
217
|
|
|
218
218
|
## Success Metrics
|
|
219
219
|
- All endpoints tested
|
|
@@ -290,11 +290,11 @@ You review code for security vulnerabilities and compliance issues.
|
|
|
290
290
|
|
|
291
291
|
Complete your work and provide a structured response with:
|
|
292
292
|
- Confidence score (0.0-1.0) based on work quality
|
|
293
|
-
- Summary of
|
|
294
|
-
- List of
|
|
295
|
-
- Any recommendations
|
|
293
|
+
- Summary of work completed
|
|
294
|
+
- List of deliverables created
|
|
295
|
+
- Any recommendations or findings
|
|
296
296
|
|
|
297
|
-
**Note:** Coordination
|
|
297
|
+
**Note:** Coordination handled automatically by the system.
|
|
298
298
|
|
|
299
299
|
## Success Metrics
|
|
300
300
|
- Zero critical vulnerabilities
|
|
@@ -364,11 +364,11 @@ npx claude-flow-novice agent-spawn frontend-dev --task-id "$TASK_ID"
|
|
|
364
364
|
|
|
365
365
|
Complete your work and provide a structured response with:
|
|
366
366
|
- Confidence score (0.0-1.0) based on work quality
|
|
367
|
-
- Summary of
|
|
368
|
-
- List of
|
|
369
|
-
- Any recommendations
|
|
367
|
+
- Summary of work completed
|
|
368
|
+
- List of deliverables created
|
|
369
|
+
- Any recommendations or findings
|
|
370
370
|
|
|
371
|
-
**Note:** Coordination
|
|
371
|
+
**Note:** Coordination handled automatically by the system.
|
|
372
372
|
|
|
373
373
|
## Success Metrics
|
|
374
374
|
- Feature fully implemented
|
|
@@ -119,10 +119,10 @@ async function analyzeWithFallback(system) {
|
|
|
119
119
|
|
|
120
120
|
Complete your work and provide a structured response with:
|
|
121
121
|
- Confidence score (0.0-1.0) based on work quality
|
|
122
|
-
- Summary of
|
|
123
|
-
- List of
|
|
124
|
-
- Any recommendations
|
|
122
|
+
- Summary of work completed
|
|
123
|
+
- List of deliverables created
|
|
124
|
+
- Any recommendations or findings
|
|
125
125
|
|
|
126
|
-
**Note:** Coordination
|
|
126
|
+
**Note:** Coordination handled automatically by the system.
|
|
127
127
|
|
|
128
128
|
Remember: Analysis is not about criticism, but about providing a clear path to system improvement through data-driven insights.
|
|
@@ -60,8 +60,8 @@ Specialized performance optimization expert leveraging WASM acceleration and adv
|
|
|
60
60
|
|
|
61
61
|
Complete your work and provide a structured response with:
|
|
62
62
|
- Confidence score (0.0-1.0) based on work quality
|
|
63
|
-
- Summary of
|
|
64
|
-
- List of
|
|
65
|
-
- Any recommendations
|
|
63
|
+
- Summary of work completed
|
|
64
|
+
- List of deliverables created
|
|
65
|
+
- Any recommendations or findings
|
|
66
66
|
|
|
67
|
-
**Note:** Coordination
|
|
67
|
+
**Note:** Coordination handled automatically by the system.
|
|
@@ -49,8 +49,8 @@ Intelligent context merging and prioritization
|
|
|
49
49
|
|
|
50
50
|
Complete your work and provide a structured response with:
|
|
51
51
|
- Confidence score (0.0-1.0) based on work quality
|
|
52
|
-
- Summary of
|
|
53
|
-
- List of
|
|
54
|
-
- Any recommendations
|
|
52
|
+
- Summary of work completed
|
|
53
|
+
- List of deliverables created
|
|
54
|
+
- Any recommendations or findings
|
|
55
55
|
|
|
56
|
-
**Note:** Coordination
|
|
56
|
+
**Note:** Coordination handled automatically by the system.
|
|
@@ -17,67 +17,6 @@ acl_level: 3
|
|
|
17
17
|
|
|
18
18
|
You transform high-level product requirements into structured epic configuration JSON files suitable for CFN Loop execution.
|
|
19
19
|
|
|
20
|
-
## Success Criteria Awareness (REQUIRED - Phase 2 TDD)
|
|
21
|
-
|
|
22
|
-
### 1. Read Success Criteria
|
|
23
|
-
Before starting work, read test requirements from environment:
|
|
24
|
-
```bash
|
|
25
|
-
if [[ -n "${AGENT_SUCCESS_CRITERIA:-}" ]]; then
|
|
26
|
-
# Validate JSON before parsing
|
|
27
|
-
if ! echo "$AGENT_SUCCESS_CRITERIA" | jq -e '.' >/dev/null 2>&1; then
|
|
28
|
-
echo "❌ Invalid JSON in AGENT_SUCCESS_CRITERIA" >&2
|
|
29
|
-
exit 1
|
|
30
|
-
fi
|
|
31
|
-
|
|
32
|
-
CRITERIA=$(echo "$AGENT_SUCCESS_CRITERIA" | jq -r '.')
|
|
33
|
-
TEST_SUITES=$(echo "$CRITERIA" | jq -r '.test_suites[] // empty')
|
|
34
|
-
|
|
35
|
-
if [[ -n "$TEST_SUITES" ]]; then
|
|
36
|
-
echo "📋 Success Criteria Loaded:"
|
|
37
|
-
echo "$TEST_SUITES" | jq -r '.name // "unnamed"'
|
|
38
|
-
fi
|
|
39
|
-
fi
|
|
40
|
-
```
|
|
41
|
-
|
|
42
|
-
### 2. TDD Protocol (MANDATORY)
|
|
43
|
-
|
|
44
|
-
**Write Tests First (15-20 min):**
|
|
45
|
-
- Extract test requirements from success criteria
|
|
46
|
-
- Write failing tests for epic decomposition and JSON configuration generation
|
|
47
|
-
- Ensure test coverage ≥80%
|
|
48
|
-
|
|
49
|
-
**Implement (30-40 min):**
|
|
50
|
-
- Write minimum code to pass tests
|
|
51
|
-
- Run tests continuously with monitoring
|
|
52
|
-
- Refactor for quality
|
|
53
|
-
|
|
54
|
-
**Validate (5 min):**
|
|
55
|
-
- Run full test suite from success criteria
|
|
56
|
-
- Verify pass rate meets threshold (Standard: ≥95%)
|
|
57
|
-
- Check coverage metrics
|
|
58
|
-
|
|
59
|
-
### 3. Report Test Results (NOT Confidence)
|
|
60
|
-
|
|
61
|
-
**Old (Deprecated):**
|
|
62
|
-
```bash
|
|
63
|
-
# report-completion.sh --confidence 0.85
|
|
64
|
-
```
|
|
65
|
-
|
|
66
|
-
**New (Required):**
|
|
67
|
-
```bash
|
|
68
|
-
# Execute tests and capture output
|
|
69
|
-
TEST_OUTPUT=$(npm test 2>&1)
|
|
70
|
-
|
|
71
|
-
# Parse natively (no external dependencies)
|
|
72
|
-
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
73
|
-
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
74
|
-
TOTAL=$((PASS + FAIL))
|
|
75
|
-
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
76
|
-
|
|
77
|
-
# Return results (Main Chat receives automatically in Task Mode)
|
|
78
|
-
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
79
|
-
```
|
|
80
|
-
|
|
81
20
|
## Core Identity
|
|
82
21
|
|
|
83
22
|
You embody three complementary personas:
|
|
@@ -186,36 +125,12 @@ Store epic configuration for coordinator reference:
|
|
|
186
125
|
- Clear scope boundaries
|
|
187
126
|
- Mode-appropriate thresholds
|
|
188
127
|
|
|
189
|
-
##
|
|
128
|
+
## Completion Protocol
|
|
190
129
|
|
|
191
|
-
Complete your
|
|
192
|
-
|
|
193
|
-
|
|
194
|
-
|
|
195
|
-
|
|
196
|
-
# Parse natively (no external dependencies)
|
|
197
|
-
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
198
|
-
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
199
|
-
TOTAL=$((PASS + FAIL))
|
|
200
|
-
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
201
|
-
|
|
202
|
-
# Return results (Main Chat receives automatically in Task Mode)
|
|
203
|
-
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
204
|
-
```
|
|
205
|
-
3. **Verify Coverage**: ≥80%
|
|
206
|
-
4. **Store in Redis**: Use test-results key (not confidence key)
|
|
207
|
-
5. **Signal Completion**: Push to completion queue
|
|
208
|
-
|
|
209
|
-
**Example Report:**
|
|
210
|
-
```
|
|
211
|
-
Test Execution Summary:
|
|
212
|
-
- Epic Analysis: 10/10 passed (100%)
|
|
213
|
-
- Phase Decomposition: 12/12 passed (100%)
|
|
214
|
-
- Agent Selection: 8/8 passed (100%)
|
|
215
|
-
- JSON Configuration: 9/9 passed (100%)
|
|
216
|
-
- Overall: 39/39 passed (100%)
|
|
217
|
-
- Coverage: 91.2%
|
|
218
|
-
- Gate Status: PASS (≥95% in all suites)
|
|
219
|
-
```
|
|
130
|
+
Complete your work and provide a structured response with:
|
|
131
|
+
- Confidence score (0.0-1.0) based on work quality
|
|
132
|
+
- Summary of work completed
|
|
133
|
+
- List of deliverables created
|
|
134
|
+
- Any recommendations or findings
|
|
220
135
|
|
|
221
|
-
**Note:** Coordination
|
|
136
|
+
**Note:** Coordination handled automatically by the system.
|