claude-flow-novice 2.15.8 → 2.15.9
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude/skills/cfn-loop-orchestration/helpers/consensus-ts.sh +104 -0
- package/.claude/skills/cfn-loop-orchestration/helpers/deliverable-verifier-ts.sh +123 -0
- package/.claude/skills/cfn-loop-orchestration/helpers/iteration-manager-ts.sh +89 -0
- package/.claude/skills/cfn-loop-orchestration/helpers/timeout-calculator-ts.sh +47 -0
- package/.claude/skills/cfn-redis-coordination/report-completion.sh +55 -10
- package/.claude/skills/cfn-redis-coordination/store-context.sh +31 -1
- package/claude-assets/agents/cfn-dev-team/coordinators/cfn-frontend-coordinator.md +6 -1
- package/claude-assets/agents/cfn-dev-team/coordinators/consensus-builder.md +6 -1
- package/claude-assets/agents/cfn-dev-team/coordinators/handoff-coordinator.md +6 -1
- package/claude-assets/agents/cfn-dev-team/coordinators/multi-sprint-coordinator.md +6 -1
- package/claude-assets/agents/cfn-dev-team/dev-ops/docker-specialist.md +18 -8
- package/claude-assets/agents/cfn-dev-team/dev-ops/kubernetes-specialist.md +18 -8
- package/claude-assets/agents/cfn-dev-team/developers/api-gateway-specialist.md +18 -8
- package/claude-assets/agents/cfn-dev-team/developers/backend-developer.md +17 -7
- package/claude-assets/agents/cfn-dev-team/developers/graphql-specialist.md +17 -8
- package/claude-assets/agents/cfn-dev-team/developers/rust-developer.md +18 -8
- package/claude-assets/agents/cfn-dev-team/reviewers/code-reviewer.md +9 -5
- package/claude-assets/agents/cfn-dev-team/reviewers/quality/code-quality-validator.md +13 -6
- package/claude-assets/agents/cfn-dev-team/reviewers/quality/perf-analyzer.md +13 -6
- package/claude-assets/agents/cfn-dev-team/reviewers/quality/performance-benchmarker.md +13 -6
- package/claude-assets/agents/cfn-dev-team/reviewers/quality/security-specialist.md +15 -5
- package/claude-assets/agents/cfn-dev-team/testers/api-testing-specialist.md +9 -5
- package/claude-assets/agents/cfn-dev-team/testers/chaos-engineering-specialist.md +8 -4
- package/claude-assets/agents/cfn-dev-team/testers/interaction-tester.md +16 -13
- package/claude-assets/agents/cfn-dev-team/testers/playwright-tester.md +9 -5
- package/claude-assets/agents/cfn-dev-team/testers/tester.md +9 -5
- package/claude-assets/skills/cfn-loop-orchestration/helpers/consensus-ts.sh +104 -0
- package/claude-assets/skills/cfn-loop-orchestration/helpers/deliverable-verifier-ts.sh +123 -0
- package/claude-assets/skills/cfn-loop-orchestration/helpers/iteration-manager-ts.sh +89 -0
- package/claude-assets/skills/cfn-loop-orchestration/helpers/timeout-calculator-ts.sh +47 -0
- package/claude-assets/skills/cfn-redis-coordination/report-completion.sh +55 -10
- package/claude-assets/skills/cfn-redis-coordination/store-context.sh +31 -1
- package/package.json +1 -1
|
@@ -598,16 +598,26 @@ Before reporting high confidence:
|
|
|
598
598
|
Complete your work and provide test-based validation:
|
|
599
599
|
|
|
600
600
|
1. **Execute Tests**: Run all test suites from success criteria
|
|
601
|
-
|
|
602
|
-
|
|
603
|
-
|
|
604
|
-
|
|
605
|
-
|
|
606
|
-
|
|
607
|
-
|
|
608
|
-
|
|
601
|
+
```bash
|
|
602
|
+
# Parse natively (no external dependencies)
|
|
603
|
+
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
604
|
+
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
605
|
+
TOTAL=$((PASS + FAIL))
|
|
606
|
+
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
607
|
+
|
|
608
|
+
# Return results (Main Chat receives automatically in Task Mode)
|
|
609
|
+
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
610
|
+
```
|
|
611
|
+
|
|
612
|
+
2. **Parse Results**: Extract test counts and calculate pass rate
|
|
613
|
+
|
|
614
|
+
3. **Coverage Check**: Ensure coverage meets minimum thresholds
|
|
615
|
+
- Manifest tests: ≥95%
|
|
616
|
+
- Deployment tests: ≥90%
|
|
609
617
|
- Coverage: ≥80%
|
|
618
|
+
|
|
610
619
|
4. **Store in Redis**: Use test-results key (not confidence key)
|
|
620
|
+
|
|
611
621
|
5. **Signal Completion**: Push to completion queue
|
|
612
622
|
|
|
613
623
|
**Example Report:**
|
|
@@ -963,16 +963,26 @@ Before reporting high confidence:
|
|
|
963
963
|
Complete your work and provide test-based validation:
|
|
964
964
|
|
|
965
965
|
1. **Execute Tests**: Run all test suites from success criteria
|
|
966
|
-
|
|
967
|
-
|
|
968
|
-
|
|
969
|
-
|
|
970
|
-
|
|
971
|
-
|
|
972
|
-
|
|
973
|
-
|
|
966
|
+
```bash
|
|
967
|
+
# Parse natively (no external dependencies)
|
|
968
|
+
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
969
|
+
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
970
|
+
TOTAL=$((PASS + FAIL))
|
|
971
|
+
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
972
|
+
|
|
973
|
+
# Return results (Main Chat receives automatically in Task Mode)
|
|
974
|
+
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
975
|
+
```
|
|
976
|
+
|
|
977
|
+
2. **Parse Results**: Extract test counts and calculate pass rate
|
|
978
|
+
|
|
979
|
+
3. **Coverage Check**: Ensure coverage meets minimum thresholds
|
|
980
|
+
- Core tests: ≥95%
|
|
981
|
+
- Configuration tests: ≥90%
|
|
974
982
|
- Coverage: ≥80%
|
|
983
|
+
|
|
975
984
|
4. **Store in Redis**: Use test-results key (not confidence key)
|
|
985
|
+
|
|
976
986
|
5. **Signal Completion**: Push to completion queue
|
|
977
987
|
|
|
978
988
|
**Example Report:**
|
|
@@ -164,16 +164,26 @@ DO NOT report subjective confidence scores. Instead:
|
|
|
164
164
|
Complete your work and provide test-based validation:
|
|
165
165
|
|
|
166
166
|
1. **Execute Tests**: Run all test suites from success criteria
|
|
167
|
-
|
|
168
|
-
|
|
169
|
-
|
|
170
|
-
|
|
171
|
-
|
|
167
|
+
```bash
|
|
168
|
+
# Parse natively (no external dependencies)
|
|
169
|
+
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
170
|
+
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
171
|
+
TOTAL=$((PASS + FAIL))
|
|
172
|
+
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
173
|
+
|
|
174
|
+
# Return results (Main Chat receives automatically in Task Mode)
|
|
175
|
+
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
176
|
+
```
|
|
177
|
+
|
|
178
|
+
2. **Parse Results**: Extract test counts and calculate pass rate
|
|
172
179
|
|
|
173
|
-
|
|
174
|
-
|
|
180
|
+
3. **Coverage Check**: Ensure coverage meets minimum thresholds
|
|
181
|
+
- Unit tests: ≥95%
|
|
182
|
+
- Integration tests: ≥90%
|
|
175
183
|
- Coverage: ≥80%
|
|
184
|
+
|
|
176
185
|
4. **Store in Redis**: Use test-results key (not confidence key)
|
|
186
|
+
|
|
177
187
|
5. **Signal Completion**: Push to completion queue
|
|
178
188
|
|
|
179
189
|
**Example Report:**
|
|
@@ -663,16 +663,25 @@ Validate work with tests instead of confidence scores:
|
|
|
663
663
|
- Query complexity tests
|
|
664
664
|
- Authentication/authorization tests
|
|
665
665
|
|
|
666
|
-
|
|
667
|
-
|
|
668
|
-
|
|
669
|
-
|
|
670
|
-
|
|
671
|
-
|
|
672
|
-
|
|
673
|
-
|
|
666
|
+
2. **Parse Test Results**: Extract test counts and calculate pass rate
|
|
667
|
+
```bash
|
|
668
|
+
# Parse natively (no external dependencies)
|
|
669
|
+
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
670
|
+
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
671
|
+
TOTAL=$((PASS + FAIL))
|
|
672
|
+
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
673
|
+
|
|
674
|
+
# Return results (Main Chat receives automatically in Task Mode)
|
|
675
|
+
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
676
|
+
```
|
|
677
|
+
|
|
678
|
+
3. **Coverage Check**: Ensure coverage meets minimum thresholds
|
|
679
|
+
- Schema tests: ≥95%
|
|
680
|
+
- Resolver tests: ≥90%
|
|
674
681
|
- Coverage: ≥80%
|
|
682
|
+
|
|
675
683
|
4. **Store in Redis**: Use test-results key (not confidence key)
|
|
684
|
+
|
|
676
685
|
5. **Signal Completion**: Push to completion queue
|
|
677
686
|
|
|
678
687
|
## Completion Protocol (Test-Driven)
|
|
@@ -187,16 +187,26 @@ Remember: Prioritize safety, performance, and clear, concise implementation.
|
|
|
187
187
|
Complete your work and provide test-based validation:
|
|
188
188
|
|
|
189
189
|
1. **Execute Tests**: Run all test suites from success criteria
|
|
190
|
-
|
|
191
|
-
|
|
192
|
-
|
|
193
|
-
|
|
194
|
-
|
|
195
|
-
|
|
196
|
-
|
|
197
|
-
|
|
190
|
+
```bash
|
|
191
|
+
# Parse natively (no external dependencies)
|
|
192
|
+
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
193
|
+
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
194
|
+
TOTAL=$((PASS + FAIL))
|
|
195
|
+
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
196
|
+
|
|
197
|
+
# Return results (Main Chat receives automatically in Task Mode)
|
|
198
|
+
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
199
|
+
```
|
|
200
|
+
|
|
201
|
+
2. **Parse Results**: Extract test counts and calculate pass rate
|
|
202
|
+
|
|
203
|
+
3. **Coverage Check**: Ensure coverage meets minimum thresholds
|
|
204
|
+
- Unit tests: ≥95%
|
|
205
|
+
- Integration tests: ≥90%
|
|
198
206
|
- Coverage: ≥80%
|
|
207
|
+
|
|
199
208
|
4. **Store in Redis**: Use test-results key (not confidence key)
|
|
209
|
+
|
|
200
210
|
5. **Signal Completion**: Push to completion queue
|
|
201
211
|
|
|
202
212
|
**Example Report:**
|
|
@@ -52,8 +52,7 @@ fi
|
|
|
52
52
|
|
|
53
53
|
### 3. Report Test Results (NOT Confidence)
|
|
54
54
|
|
|
55
|
-
**Old (Deprecated):**
|
|
56
|
-
```bash
|
|
55
|
+
**Old (Deprecated):** Not used
|
|
57
56
|
|
|
58
57
|
**New (Required):**
|
|
59
58
|
```bash
|
|
@@ -65,7 +64,6 @@ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
|
65
64
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
66
65
|
TOTAL=$((PASS + FAIL))
|
|
67
66
|
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
68
|
-
|
|
69
67
|
```
|
|
70
68
|
|
|
71
69
|
## MCP Tool Access (Task Mode)
|
|
@@ -287,6 +285,8 @@ DO NOT report subjective confidence scores. Instead:
|
|
|
287
285
|
Complete your work and provide test-based validation:
|
|
288
286
|
|
|
289
287
|
1. **Execute Tests**: Run all test suites from success criteria
|
|
288
|
+
|
|
289
|
+
```bash
|
|
290
290
|
# Parse natively (no external dependencies)
|
|
291
291
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
292
292
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -295,9 +295,13 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
295
295
|
|
|
296
296
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
297
297
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
298
|
+
```
|
|
299
|
+
|
|
300
|
+
2. **Validate Results**:
|
|
298
301
|
- Coverage: ≥80%
|
|
299
|
-
|
|
300
|
-
|
|
302
|
+
|
|
303
|
+
3. **Store Results**: Use test-results key (not confidence key)
|
|
304
|
+
4. **Signal Completion**: Push to completion queue
|
|
301
305
|
|
|
302
306
|
**Example Report:**
|
|
303
307
|
```
|
|
@@ -64,8 +64,7 @@ fi
|
|
|
64
64
|
|
|
65
65
|
### 3. Report Test Results (NOT Confidence)
|
|
66
66
|
|
|
67
|
-
**Old (Deprecated):**
|
|
68
|
-
```bash
|
|
67
|
+
**Old (Deprecated):** Not used
|
|
69
68
|
|
|
70
69
|
**New (Required):**
|
|
71
70
|
```bash
|
|
@@ -80,7 +79,6 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
80
79
|
|
|
81
80
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
82
81
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
83
|
-
|
|
84
82
|
```
|
|
85
83
|
|
|
86
84
|
## Post-Edit Validation
|
|
@@ -206,6 +204,8 @@ Remember: Code analysis reveals improvement opportunities. Focus on actionable,
|
|
|
206
204
|
DO NOT report subjective confidence scores. Instead:
|
|
207
205
|
|
|
208
206
|
1. **Execute Tests**: Run test suite defined in success criteria
|
|
207
|
+
|
|
208
|
+
```bash
|
|
209
209
|
# Parse natively (no external dependencies)
|
|
210
210
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
211
211
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -214,8 +214,9 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
214
214
|
|
|
215
215
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
216
216
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
217
|
+
```
|
|
217
218
|
|
|
218
|
-
**Validation:**
|
|
219
|
+
**Validation Examples:**
|
|
219
220
|
- ❌ OLD: "Confidence: 0.83 - quality metrics look solid"
|
|
220
221
|
- ✅ NEW: "Quality Tests: 38/40 passed (95% pass rate) - 2 refactoring validation scenarios need review"
|
|
221
222
|
|
|
@@ -224,6 +225,8 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
|
224
225
|
Complete your work and provide test-based validation:
|
|
225
226
|
|
|
226
227
|
1. **Execute Tests**: Run all code quality test suites from success criteria
|
|
228
|
+
|
|
229
|
+
```bash
|
|
227
230
|
# Parse natively (no external dependencies)
|
|
228
231
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
229
232
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -232,11 +235,15 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
232
235
|
|
|
233
236
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
234
237
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
238
|
+
```
|
|
239
|
+
|
|
240
|
+
2. **Validate Results**:
|
|
235
241
|
- Coverage: ≥80%
|
|
236
242
|
- Code smells detected: N
|
|
237
243
|
- Technical debt score: X/10
|
|
238
|
-
|
|
239
|
-
|
|
244
|
+
|
|
245
|
+
3. **Store Results**: Use test-results key (not confidence key)
|
|
246
|
+
4. **Signal Completion**: Push to completion queue
|
|
240
247
|
|
|
241
248
|
**Example Report:**
|
|
242
249
|
```
|
|
@@ -54,8 +54,7 @@ fi
|
|
|
54
54
|
|
|
55
55
|
### 3. Report Test Results (NOT Confidence)
|
|
56
56
|
|
|
57
|
-
**Old (Deprecated):**
|
|
58
|
-
```bash
|
|
57
|
+
**Old (Deprecated):** Not used
|
|
59
58
|
|
|
60
59
|
**New (Required):**
|
|
61
60
|
```bash
|
|
@@ -70,7 +69,6 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
70
69
|
|
|
71
70
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
72
71
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
73
|
-
|
|
74
72
|
```
|
|
75
73
|
|
|
76
74
|
## Mandatory Post-Edit Validation
|
|
@@ -237,6 +235,8 @@ Remember: Optimize for highest impact with reasonable effort. Focus on critical
|
|
|
237
235
|
DO NOT report subjective confidence scores. Instead:
|
|
238
236
|
|
|
239
237
|
1. **Execute Tests**: Run test suite defined in success criteria
|
|
238
|
+
|
|
239
|
+
```bash
|
|
240
240
|
# Parse natively (no external dependencies)
|
|
241
241
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
242
242
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -245,8 +245,9 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
245
245
|
|
|
246
246
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
247
247
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
248
|
+
```
|
|
248
249
|
|
|
249
|
-
**Validation:**
|
|
250
|
+
**Validation Examples:**
|
|
250
251
|
- ❌ OLD: "Confidence: 0.86 - analysis is thorough"
|
|
251
252
|
- ✅ NEW: "Analysis Tests: 42/45 passed (93.3% pass rate) - 3 optimization scenarios need validation"
|
|
252
253
|
|
|
@@ -255,6 +256,8 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
|
255
256
|
Complete your work and provide test-based validation:
|
|
256
257
|
|
|
257
258
|
1. **Execute Tests**: Run all performance analysis test suites from success criteria
|
|
259
|
+
|
|
260
|
+
```bash
|
|
258
261
|
# Parse natively (no external dependencies)
|
|
259
262
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
260
263
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -263,11 +266,15 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
263
266
|
|
|
264
267
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
265
268
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
269
|
+
```
|
|
270
|
+
|
|
271
|
+
2. **Validate Results**:
|
|
266
272
|
- Coverage: ≥80%
|
|
267
273
|
- Bottlenecks identified: N
|
|
268
274
|
- Expected improvement: X%
|
|
269
|
-
|
|
270
|
-
|
|
275
|
+
|
|
276
|
+
3. **Store Results**: Use test-results key (not confidence key)
|
|
277
|
+
4. **Signal Completion**: Push to completion queue
|
|
271
278
|
|
|
272
279
|
**Example Report:**
|
|
273
280
|
```
|
|
@@ -51,8 +51,7 @@ fi
|
|
|
51
51
|
|
|
52
52
|
### 3. Report Test Results (NOT Confidence)
|
|
53
53
|
|
|
54
|
-
**Old (Deprecated):**
|
|
55
|
-
```bash
|
|
54
|
+
**Old (Deprecated):** Not used
|
|
56
55
|
|
|
57
56
|
**New (Required):**
|
|
58
57
|
```bash
|
|
@@ -67,7 +66,6 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
67
66
|
|
|
68
67
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
69
68
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
70
|
-
|
|
71
69
|
```
|
|
72
70
|
|
|
73
71
|
## 🚨 Mandatory Post-Edit Validation
|
|
@@ -83,6 +81,8 @@ Refer to [.claude/templates/post-edit-validation.md](../templates/post-edit-vali
|
|
|
83
81
|
DO NOT report subjective confidence scores. Instead:
|
|
84
82
|
|
|
85
83
|
1. **Execute Tests**: Run test suite defined in success criteria
|
|
84
|
+
|
|
85
|
+
```bash
|
|
86
86
|
# Parse natively (no external dependencies)
|
|
87
87
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
88
88
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -91,8 +91,9 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
91
91
|
|
|
92
92
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
93
93
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
94
|
+
```
|
|
94
95
|
|
|
95
|
-
**Validation:**
|
|
96
|
+
**Validation Examples:**
|
|
96
97
|
- ❌ OLD: "Confidence: 0.88 - benchmarks look good"
|
|
97
98
|
- ✅ NEW: "Benchmark Tests: 28/30 passed (93.3% pass rate) - 2 latency outliers detected"
|
|
98
99
|
|
|
@@ -101,6 +102,8 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
|
101
102
|
Complete your work and provide test-based validation:
|
|
102
103
|
|
|
103
104
|
1. **Execute Tests**: Run all benchmark test suites from success criteria
|
|
105
|
+
|
|
106
|
+
```bash
|
|
104
107
|
# Parse natively (no external dependencies)
|
|
105
108
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
106
109
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -109,10 +112,14 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
109
112
|
|
|
110
113
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
111
114
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
115
|
+
```
|
|
116
|
+
|
|
117
|
+
2. **Validate Results**:
|
|
112
118
|
- Coverage: ≥80%
|
|
113
119
|
- Performance baseline established: Yes/No
|
|
114
|
-
|
|
115
|
-
|
|
120
|
+
|
|
121
|
+
3. **Store Results**: Use test-results key (not confidence key)
|
|
122
|
+
4. **Signal Completion**: Push to completion queue
|
|
116
123
|
|
|
117
124
|
**Example Report:**
|
|
118
125
|
```
|
|
@@ -49,8 +49,7 @@ fi
|
|
|
49
49
|
|
|
50
50
|
### 3. Report Test Results (NOT Confidence)
|
|
51
51
|
|
|
52
|
-
**Old (Deprecated):**
|
|
53
|
-
```bash
|
|
52
|
+
**Old (Deprecated):** Not used
|
|
54
53
|
|
|
55
54
|
**New (Required):**
|
|
56
55
|
```bash
|
|
@@ -65,7 +64,6 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
65
64
|
|
|
66
65
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
67
66
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
68
|
-
|
|
69
67
|
```
|
|
70
68
|
|
|
71
69
|
## 🚨 MANDATORY DOCUMENTATION REDACTION PROTOCOL
|
|
@@ -165,6 +163,9 @@ Security analysis results are captured and processed through structured reportin
|
|
|
165
163
|
DO NOT report subjective confidence scores. Instead:
|
|
166
164
|
|
|
167
165
|
1. **Execute Tests**: Run test suite defined in success criteria
|
|
166
|
+
2. **Parse Results**: Use native bash parsing (no external dependencies)
|
|
167
|
+
|
|
168
|
+
```bash
|
|
168
169
|
# Parse natively (no external dependencies)
|
|
169
170
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
170
171
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -173,6 +174,8 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
173
174
|
|
|
174
175
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
175
176
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
177
|
+
```
|
|
178
|
+
3. **Pass Rate**: Your security analysis passes the gate if tests ≥ threshold (95% standard mode)
|
|
176
179
|
|
|
177
180
|
**Validation:**
|
|
178
181
|
- ❌ OLD: "Confidence: 0.90 - security looks solid"
|
|
@@ -183,6 +186,9 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
|
183
186
|
Complete your work and provide test-based validation:
|
|
184
187
|
|
|
185
188
|
1. **Execute Tests**: Run all security test suites from success criteria
|
|
189
|
+
2. **Parse Results**:
|
|
190
|
+
|
|
191
|
+
```bash
|
|
186
192
|
# Parse natively (no external dependencies)
|
|
187
193
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
188
194
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -191,10 +197,14 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
191
197
|
|
|
192
198
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
193
199
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
200
|
+
```
|
|
201
|
+
3. **Report Metrics**:
|
|
202
|
+
- Total tests: X
|
|
203
|
+
- Passed: Y
|
|
204
|
+
- Failed: Z
|
|
205
|
+
- Pass rate: Y/X (e.g., 0.95)
|
|
194
206
|
- Coverage: ≥80%
|
|
195
207
|
- Critical vulnerabilities found: N
|
|
196
|
-
4. **Store in Redis**: Use test-results key (not confidence key)
|
|
197
|
-
5. **Signal Completion**: Push to completion queue
|
|
198
208
|
|
|
199
209
|
**Example Report:**
|
|
200
210
|
```
|
|
@@ -62,8 +62,7 @@ fi
|
|
|
62
62
|
|
|
63
63
|
### 3. Report Test Results (NOT Confidence)
|
|
64
64
|
|
|
65
|
-
**Old (Deprecated):**
|
|
66
|
-
```bash
|
|
65
|
+
**Old (Deprecated):** Not used
|
|
67
66
|
|
|
68
67
|
**New (Required):**
|
|
69
68
|
```bash
|
|
@@ -75,7 +74,6 @@ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
|
75
74
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
76
75
|
TOTAL=$((PASS + FAIL))
|
|
77
76
|
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
78
|
-
|
|
79
77
|
```
|
|
80
78
|
|
|
81
79
|
## Core Responsibilities
|
|
@@ -767,6 +765,8 @@ DO NOT report subjective confidence scores. Instead:
|
|
|
767
765
|
Complete your work and provide test-based validation:
|
|
768
766
|
|
|
769
767
|
1. **Execute Tests**: Run all API test suites from success criteria
|
|
768
|
+
|
|
769
|
+
```bash
|
|
770
770
|
# Parse natively (no external dependencies)
|
|
771
771
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
772
772
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -775,11 +775,15 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
775
775
|
|
|
776
776
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
777
777
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
778
|
+
```
|
|
779
|
+
|
|
780
|
+
2. **Validate Results**:
|
|
778
781
|
- Coverage: ≥80%
|
|
779
782
|
- Contract tests: X/Y passed
|
|
780
783
|
- Security tests: X/Y passed
|
|
781
|
-
|
|
782
|
-
|
|
784
|
+
|
|
785
|
+
3. **Store Results**: Use test-results key (not confidence key)
|
|
786
|
+
4. **Signal Completion**: Push to completion queue
|
|
783
787
|
|
|
784
788
|
**Example Report:**
|
|
785
789
|
```text
|
|
@@ -60,8 +60,7 @@ fi
|
|
|
60
60
|
|
|
61
61
|
### 3. Report Test Results (NOT Confidence)
|
|
62
62
|
|
|
63
|
-
**Old (Deprecated):**
|
|
64
|
-
```bash
|
|
63
|
+
**Old (Deprecated):** Not used
|
|
65
64
|
|
|
66
65
|
**New (Required):**
|
|
67
66
|
```bash
|
|
@@ -73,7 +72,6 @@ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
|
73
72
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
74
73
|
TOTAL=$((PASS + FAIL))
|
|
75
74
|
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
76
|
-
|
|
77
75
|
```
|
|
78
76
|
|
|
79
77
|
## Core Responsibilities
|
|
@@ -928,6 +926,8 @@ DO NOT report subjective confidence scores. Instead:
|
|
|
928
926
|
Complete your work and provide test-based validation:
|
|
929
927
|
|
|
930
928
|
1. **Execute Tests**: Run all chaos experiment test suites from success criteria
|
|
929
|
+
|
|
930
|
+
```bash
|
|
931
931
|
# Parse natively (no external dependencies)
|
|
932
932
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
933
933
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -936,10 +936,14 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
936
936
|
|
|
937
937
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
938
938
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
939
|
+
```
|
|
940
|
+
|
|
941
|
+
2. **Validate Results**:
|
|
939
942
|
- Coverage: ≥80%
|
|
940
943
|
- Critical paths covered: X/Y
|
|
941
944
|
- System resilience verified: Yes/No
|
|
942
|
-
|
|
945
|
+
|
|
946
|
+
3. **Store Results**: Use test-results key (not confidence key)
|
|
943
947
|
5. **Signal Completion**: Push to completion queue
|
|
944
948
|
|
|
945
949
|
**Example Report:**
|
|
@@ -56,8 +56,7 @@ fi
|
|
|
56
56
|
|
|
57
57
|
### 3. Report Test Results (NOT Confidence)
|
|
58
58
|
|
|
59
|
-
**Old (Deprecated):**
|
|
60
|
-
```bash
|
|
59
|
+
**Old (Deprecated):** Not used
|
|
61
60
|
|
|
62
61
|
**New (Required):**
|
|
63
62
|
```bash
|
|
@@ -69,7 +68,6 @@ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
|
69
68
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
70
69
|
TOTAL=$((PASS + FAIL))
|
|
71
70
|
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
72
|
-
|
|
73
71
|
```
|
|
74
72
|
|
|
75
73
|
## MCP Tool Access (Task Mode)
|
|
@@ -203,19 +201,24 @@ DO NOT report subjective confidence scores. Instead:
|
|
|
203
201
|
Complete your work and provide test-based validation:
|
|
204
202
|
|
|
205
203
|
1. **Execute Tests**: Run all interaction test suites from success criteria
|
|
206
|
-
|
|
207
|
-
|
|
208
|
-
|
|
209
|
-
|
|
210
|
-
|
|
211
|
-
|
|
212
|
-
|
|
213
|
-
|
|
204
|
+
```bash
|
|
205
|
+
# Parse natively (no external dependencies)
|
|
206
|
+
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
207
|
+
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
208
|
+
TOTAL=$((PASS + FAIL))
|
|
209
|
+
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
210
|
+
|
|
211
|
+
# Return results (Main Chat receives automatically in Task Mode)
|
|
212
|
+
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
213
|
+
```
|
|
214
|
+
|
|
215
|
+
2. **Validate Results**:
|
|
214
216
|
- Coverage: ≥80%
|
|
215
217
|
- WCAG AA compliance: Yes/No
|
|
216
218
|
- Critical flows covered: X/Y
|
|
217
|
-
|
|
218
|
-
|
|
219
|
+
|
|
220
|
+
3. **Store Results**: Use test-results key (not confidence key)
|
|
221
|
+
4. **Signal Completion**: Push to completion queue
|
|
219
222
|
|
|
220
223
|
**Example Report:**
|
|
221
224
|
```
|
|
@@ -63,8 +63,7 @@ fi
|
|
|
63
63
|
|
|
64
64
|
### 3. Report Test Results (NOT Confidence)
|
|
65
65
|
|
|
66
|
-
**Old (Deprecated):**
|
|
67
|
-
```bash
|
|
66
|
+
**Old (Deprecated):** Not used
|
|
68
67
|
|
|
69
68
|
**New (Required):**
|
|
70
69
|
```bash
|
|
@@ -76,7 +75,6 @@ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
|
76
75
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
77
76
|
TOTAL=$((PASS + FAIL))
|
|
78
77
|
RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
|
|
79
|
-
|
|
80
78
|
```
|
|
81
79
|
|
|
82
80
|
## 🚨 Mandatory Post-Edit Validation
|
|
@@ -270,6 +268,8 @@ DO NOT report subjective confidence scores. Instead:
|
|
|
270
268
|
Complete your work and provide test-based validation:
|
|
271
269
|
|
|
272
270
|
1. **Execute Tests**: Run all Playwright test suites from success criteria
|
|
271
|
+
|
|
272
|
+
```bash
|
|
273
273
|
# Parse natively (no external dependencies)
|
|
274
274
|
PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
|
|
275
275
|
FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
|
|
@@ -278,11 +278,15 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
|
|
|
278
278
|
|
|
279
279
|
# Return results (Main Chat receives automatically in Task Mode)
|
|
280
280
|
echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
|
|
281
|
+
```
|
|
282
|
+
|
|
283
|
+
2. **Validate Results**:
|
|
281
284
|
- Coverage: ≥80%
|
|
282
285
|
- Cross-browser coverage: X/Y browsers
|
|
283
286
|
- Critical flows covered: X/Y
|
|
284
|
-
|
|
285
|
-
|
|
287
|
+
|
|
288
|
+
3. **Store Results**: Use test-results key (not confidence key)
|
|
289
|
+
4. **Signal Completion**: Push to completion queue
|
|
286
290
|
|
|
287
291
|
**Example Report:**
|
|
288
292
|
```
|