claude-flow-novice 2.15.8 → 2.15.9

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (33) hide show
  1. package/.claude/skills/cfn-loop-orchestration/helpers/consensus-ts.sh +104 -0
  2. package/.claude/skills/cfn-loop-orchestration/helpers/deliverable-verifier-ts.sh +123 -0
  3. package/.claude/skills/cfn-loop-orchestration/helpers/iteration-manager-ts.sh +89 -0
  4. package/.claude/skills/cfn-loop-orchestration/helpers/timeout-calculator-ts.sh +47 -0
  5. package/.claude/skills/cfn-redis-coordination/report-completion.sh +55 -10
  6. package/.claude/skills/cfn-redis-coordination/store-context.sh +31 -1
  7. package/claude-assets/agents/cfn-dev-team/coordinators/cfn-frontend-coordinator.md +6 -1
  8. package/claude-assets/agents/cfn-dev-team/coordinators/consensus-builder.md +6 -1
  9. package/claude-assets/agents/cfn-dev-team/coordinators/handoff-coordinator.md +6 -1
  10. package/claude-assets/agents/cfn-dev-team/coordinators/multi-sprint-coordinator.md +6 -1
  11. package/claude-assets/agents/cfn-dev-team/dev-ops/docker-specialist.md +18 -8
  12. package/claude-assets/agents/cfn-dev-team/dev-ops/kubernetes-specialist.md +18 -8
  13. package/claude-assets/agents/cfn-dev-team/developers/api-gateway-specialist.md +18 -8
  14. package/claude-assets/agents/cfn-dev-team/developers/backend-developer.md +17 -7
  15. package/claude-assets/agents/cfn-dev-team/developers/graphql-specialist.md +17 -8
  16. package/claude-assets/agents/cfn-dev-team/developers/rust-developer.md +18 -8
  17. package/claude-assets/agents/cfn-dev-team/reviewers/code-reviewer.md +9 -5
  18. package/claude-assets/agents/cfn-dev-team/reviewers/quality/code-quality-validator.md +13 -6
  19. package/claude-assets/agents/cfn-dev-team/reviewers/quality/perf-analyzer.md +13 -6
  20. package/claude-assets/agents/cfn-dev-team/reviewers/quality/performance-benchmarker.md +13 -6
  21. package/claude-assets/agents/cfn-dev-team/reviewers/quality/security-specialist.md +15 -5
  22. package/claude-assets/agents/cfn-dev-team/testers/api-testing-specialist.md +9 -5
  23. package/claude-assets/agents/cfn-dev-team/testers/chaos-engineering-specialist.md +8 -4
  24. package/claude-assets/agents/cfn-dev-team/testers/interaction-tester.md +16 -13
  25. package/claude-assets/agents/cfn-dev-team/testers/playwright-tester.md +9 -5
  26. package/claude-assets/agents/cfn-dev-team/testers/tester.md +9 -5
  27. package/claude-assets/skills/cfn-loop-orchestration/helpers/consensus-ts.sh +104 -0
  28. package/claude-assets/skills/cfn-loop-orchestration/helpers/deliverable-verifier-ts.sh +123 -0
  29. package/claude-assets/skills/cfn-loop-orchestration/helpers/iteration-manager-ts.sh +89 -0
  30. package/claude-assets/skills/cfn-loop-orchestration/helpers/timeout-calculator-ts.sh +47 -0
  31. package/claude-assets/skills/cfn-redis-coordination/report-completion.sh +55 -10
  32. package/claude-assets/skills/cfn-redis-coordination/store-context.sh +31 -1
  33. package/package.json +1 -1
@@ -598,16 +598,26 @@ Before reporting high confidence:
598
598
  Complete your work and provide test-based validation:
599
599
 
600
600
  1. **Execute Tests**: Run all test suites from success criteria
601
- # Parse natively (no external dependencies)
602
- PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
603
- FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
604
- TOTAL=$((PASS + FAIL))
605
- RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
606
-
607
- # Return results (Main Chat receives automatically in Task Mode)
608
- echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
601
+ ```bash
602
+ # Parse natively (no external dependencies)
603
+ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
604
+ FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
605
+ TOTAL=$((PASS + FAIL))
606
+ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
607
+
608
+ # Return results (Main Chat receives automatically in Task Mode)
609
+ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
610
+ ```
611
+
612
+ 2. **Parse Results**: Extract test counts and calculate pass rate
613
+
614
+ 3. **Coverage Check**: Ensure coverage meets minimum thresholds
615
+ - Manifest tests: ≥95%
616
+ - Deployment tests: ≥90%
609
617
  - Coverage: ≥80%
618
+
610
619
  4. **Store in Redis**: Use test-results key (not confidence key)
620
+
611
621
  5. **Signal Completion**: Push to completion queue
612
622
 
613
623
  **Example Report:**
@@ -963,16 +963,26 @@ Before reporting high confidence:
963
963
  Complete your work and provide test-based validation:
964
964
 
965
965
  1. **Execute Tests**: Run all test suites from success criteria
966
- # Parse natively (no external dependencies)
967
- PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
968
- FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
969
- TOTAL=$((PASS + FAIL))
970
- RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
971
-
972
- # Return results (Main Chat receives automatically in Task Mode)
973
- echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
966
+ ```bash
967
+ # Parse natively (no external dependencies)
968
+ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
969
+ FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
970
+ TOTAL=$((PASS + FAIL))
971
+ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
972
+
973
+ # Return results (Main Chat receives automatically in Task Mode)
974
+ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
975
+ ```
976
+
977
+ 2. **Parse Results**: Extract test counts and calculate pass rate
978
+
979
+ 3. **Coverage Check**: Ensure coverage meets minimum thresholds
980
+ - Core tests: ≥95%
981
+ - Configuration tests: ≥90%
974
982
  - Coverage: ≥80%
983
+
975
984
  4. **Store in Redis**: Use test-results key (not confidence key)
985
+
976
986
  5. **Signal Completion**: Push to completion queue
977
987
 
978
988
  **Example Report:**
@@ -164,16 +164,26 @@ DO NOT report subjective confidence scores. Instead:
164
164
  Complete your work and provide test-based validation:
165
165
 
166
166
  1. **Execute Tests**: Run all test suites from success criteria
167
- # Parse natively (no external dependencies)
168
- PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
169
- FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
170
- TOTAL=$((PASS + FAIL))
171
- RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
167
+ ```bash
168
+ # Parse natively (no external dependencies)
169
+ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
170
+ FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
171
+ TOTAL=$((PASS + FAIL))
172
+ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
173
+
174
+ # Return results (Main Chat receives automatically in Task Mode)
175
+ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
176
+ ```
177
+
178
+ 2. **Parse Results**: Extract test counts and calculate pass rate
172
179
 
173
- # Return results (Main Chat receives automatically in Task Mode)
174
- echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
180
+ 3. **Coverage Check**: Ensure coverage meets minimum thresholds
181
+ - Unit tests: ≥95%
182
+ - Integration tests: ≥90%
175
183
  - Coverage: ≥80%
184
+
176
185
  4. **Store in Redis**: Use test-results key (not confidence key)
186
+
177
187
  5. **Signal Completion**: Push to completion queue
178
188
 
179
189
  **Example Report:**
@@ -663,16 +663,25 @@ Validate work with tests instead of confidence scores:
663
663
  - Query complexity tests
664
664
  - Authentication/authorization tests
665
665
 
666
- # Parse natively (no external dependencies)
667
- PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
668
- FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
669
- TOTAL=$((PASS + FAIL))
670
- RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
671
-
672
- # Return results (Main Chat receives automatically in Task Mode)
673
- echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
666
+ 2. **Parse Test Results**: Extract test counts and calculate pass rate
667
+ ```bash
668
+ # Parse natively (no external dependencies)
669
+ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
670
+ FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
671
+ TOTAL=$((PASS + FAIL))
672
+ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
673
+
674
+ # Return results (Main Chat receives automatically in Task Mode)
675
+ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
676
+ ```
677
+
678
+ 3. **Coverage Check**: Ensure coverage meets minimum thresholds
679
+ - Schema tests: ≥95%
680
+ - Resolver tests: ≥90%
674
681
  - Coverage: ≥80%
682
+
675
683
  4. **Store in Redis**: Use test-results key (not confidence key)
684
+
676
685
  5. **Signal Completion**: Push to completion queue
677
686
 
678
687
  ## Completion Protocol (Test-Driven)
@@ -187,16 +187,26 @@ Remember: Prioritize safety, performance, and clear, concise implementation.
187
187
  Complete your work and provide test-based validation:
188
188
 
189
189
  1. **Execute Tests**: Run all test suites from success criteria
190
- # Parse natively (no external dependencies)
191
- PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
192
- FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
193
- TOTAL=$((PASS + FAIL))
194
- RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
195
-
196
- # Return results (Main Chat receives automatically in Task Mode)
197
- echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
190
+ ```bash
191
+ # Parse natively (no external dependencies)
192
+ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
193
+ FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
194
+ TOTAL=$((PASS + FAIL))
195
+ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
196
+
197
+ # Return results (Main Chat receives automatically in Task Mode)
198
+ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
199
+ ```
200
+
201
+ 2. **Parse Results**: Extract test counts and calculate pass rate
202
+
203
+ 3. **Coverage Check**: Ensure coverage meets minimum thresholds
204
+ - Unit tests: ≥95%
205
+ - Integration tests: ≥90%
198
206
  - Coverage: ≥80%
207
+
199
208
  4. **Store in Redis**: Use test-results key (not confidence key)
209
+
200
210
  5. **Signal Completion**: Push to completion queue
201
211
 
202
212
  **Example Report:**
@@ -52,8 +52,7 @@ fi
52
52
 
53
53
  ### 3. Report Test Results (NOT Confidence)
54
54
 
55
- **Old (Deprecated):**
56
- ```bash
55
+ **Old (Deprecated):** Not used
57
56
 
58
57
  **New (Required):**
59
58
  ```bash
@@ -65,7 +64,6 @@ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
65
64
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
66
65
  TOTAL=$((PASS + FAIL))
67
66
  RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
68
-
69
67
  ```
70
68
 
71
69
  ## MCP Tool Access (Task Mode)
@@ -287,6 +285,8 @@ DO NOT report subjective confidence scores. Instead:
287
285
  Complete your work and provide test-based validation:
288
286
 
289
287
  1. **Execute Tests**: Run all test suites from success criteria
288
+
289
+ ```bash
290
290
  # Parse natively (no external dependencies)
291
291
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
292
292
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -295,9 +295,13 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
295
295
 
296
296
  # Return results (Main Chat receives automatically in Task Mode)
297
297
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
298
+ ```
299
+
300
+ 2. **Validate Results**:
298
301
  - Coverage: ≥80%
299
- 4. **Store in Redis**: Use test-results key (not confidence key)
300
- 5. **Signal Completion**: Push to completion queue
302
+
303
+ 3. **Store Results**: Use test-results key (not confidence key)
304
+ 4. **Signal Completion**: Push to completion queue
301
305
 
302
306
  **Example Report:**
303
307
  ```
@@ -64,8 +64,7 @@ fi
64
64
 
65
65
  ### 3. Report Test Results (NOT Confidence)
66
66
 
67
- **Old (Deprecated):**
68
- ```bash
67
+ **Old (Deprecated):** Not used
69
68
 
70
69
  **New (Required):**
71
70
  ```bash
@@ -80,7 +79,6 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
80
79
 
81
80
  # Return results (Main Chat receives automatically in Task Mode)
82
81
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
83
-
84
82
  ```
85
83
 
86
84
  ## Post-Edit Validation
@@ -206,6 +204,8 @@ Remember: Code analysis reveals improvement opportunities. Focus on actionable,
206
204
  DO NOT report subjective confidence scores. Instead:
207
205
 
208
206
  1. **Execute Tests**: Run test suite defined in success criteria
207
+
208
+ ```bash
209
209
  # Parse natively (no external dependencies)
210
210
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
211
211
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -214,8 +214,9 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
214
214
 
215
215
  # Return results (Main Chat receives automatically in Task Mode)
216
216
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
217
+ ```
217
218
 
218
- **Validation:**
219
+ **Validation Examples:**
219
220
  - ❌ OLD: "Confidence: 0.83 - quality metrics look solid"
220
221
  - ✅ NEW: "Quality Tests: 38/40 passed (95% pass rate) - 2 refactoring validation scenarios need review"
221
222
 
@@ -224,6 +225,8 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
224
225
  Complete your work and provide test-based validation:
225
226
 
226
227
  1. **Execute Tests**: Run all code quality test suites from success criteria
228
+
229
+ ```bash
227
230
  # Parse natively (no external dependencies)
228
231
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
229
232
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -232,11 +235,15 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
232
235
 
233
236
  # Return results (Main Chat receives automatically in Task Mode)
234
237
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
238
+ ```
239
+
240
+ 2. **Validate Results**:
235
241
  - Coverage: ≥80%
236
242
  - Code smells detected: N
237
243
  - Technical debt score: X/10
238
- 4. **Store in Redis**: Use test-results key (not confidence key)
239
- 5. **Signal Completion**: Push to completion queue
244
+
245
+ 3. **Store Results**: Use test-results key (not confidence key)
246
+ 4. **Signal Completion**: Push to completion queue
240
247
 
241
248
  **Example Report:**
242
249
  ```
@@ -54,8 +54,7 @@ fi
54
54
 
55
55
  ### 3. Report Test Results (NOT Confidence)
56
56
 
57
- **Old (Deprecated):**
58
- ```bash
57
+ **Old (Deprecated):** Not used
59
58
 
60
59
  **New (Required):**
61
60
  ```bash
@@ -70,7 +69,6 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
70
69
 
71
70
  # Return results (Main Chat receives automatically in Task Mode)
72
71
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
73
-
74
72
  ```
75
73
 
76
74
  ## Mandatory Post-Edit Validation
@@ -237,6 +235,8 @@ Remember: Optimize for highest impact with reasonable effort. Focus on critical
237
235
  DO NOT report subjective confidence scores. Instead:
238
236
 
239
237
  1. **Execute Tests**: Run test suite defined in success criteria
238
+
239
+ ```bash
240
240
  # Parse natively (no external dependencies)
241
241
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
242
242
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -245,8 +245,9 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
245
245
 
246
246
  # Return results (Main Chat receives automatically in Task Mode)
247
247
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
248
+ ```
248
249
 
249
- **Validation:**
250
+ **Validation Examples:**
250
251
  - ❌ OLD: "Confidence: 0.86 - analysis is thorough"
251
252
  - ✅ NEW: "Analysis Tests: 42/45 passed (93.3% pass rate) - 3 optimization scenarios need validation"
252
253
 
@@ -255,6 +256,8 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
255
256
  Complete your work and provide test-based validation:
256
257
 
257
258
  1. **Execute Tests**: Run all performance analysis test suites from success criteria
259
+
260
+ ```bash
258
261
  # Parse natively (no external dependencies)
259
262
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
260
263
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -263,11 +266,15 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
263
266
 
264
267
  # Return results (Main Chat receives automatically in Task Mode)
265
268
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
269
+ ```
270
+
271
+ 2. **Validate Results**:
266
272
  - Coverage: ≥80%
267
273
  - Bottlenecks identified: N
268
274
  - Expected improvement: X%
269
- 4. **Store in Redis**: Use test-results key (not confidence key)
270
- 5. **Signal Completion**: Push to completion queue
275
+
276
+ 3. **Store Results**: Use test-results key (not confidence key)
277
+ 4. **Signal Completion**: Push to completion queue
271
278
 
272
279
  **Example Report:**
273
280
  ```
@@ -51,8 +51,7 @@ fi
51
51
 
52
52
  ### 3. Report Test Results (NOT Confidence)
53
53
 
54
- **Old (Deprecated):**
55
- ```bash
54
+ **Old (Deprecated):** Not used
56
55
 
57
56
  **New (Required):**
58
57
  ```bash
@@ -67,7 +66,6 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
67
66
 
68
67
  # Return results (Main Chat receives automatically in Task Mode)
69
68
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
70
-
71
69
  ```
72
70
 
73
71
  ## 🚨 Mandatory Post-Edit Validation
@@ -83,6 +81,8 @@ Refer to [.claude/templates/post-edit-validation.md](../templates/post-edit-vali
83
81
  DO NOT report subjective confidence scores. Instead:
84
82
 
85
83
  1. **Execute Tests**: Run test suite defined in success criteria
84
+
85
+ ```bash
86
86
  # Parse natively (no external dependencies)
87
87
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
88
88
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -91,8 +91,9 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
91
91
 
92
92
  # Return results (Main Chat receives automatically in Task Mode)
93
93
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
94
+ ```
94
95
 
95
- **Validation:**
96
+ **Validation Examples:**
96
97
  - ❌ OLD: "Confidence: 0.88 - benchmarks look good"
97
98
  - ✅ NEW: "Benchmark Tests: 28/30 passed (93.3% pass rate) - 2 latency outliers detected"
98
99
 
@@ -101,6 +102,8 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
101
102
  Complete your work and provide test-based validation:
102
103
 
103
104
  1. **Execute Tests**: Run all benchmark test suites from success criteria
105
+
106
+ ```bash
104
107
  # Parse natively (no external dependencies)
105
108
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
106
109
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -109,10 +112,14 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
109
112
 
110
113
  # Return results (Main Chat receives automatically in Task Mode)
111
114
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
115
+ ```
116
+
117
+ 2. **Validate Results**:
112
118
  - Coverage: ≥80%
113
119
  - Performance baseline established: Yes/No
114
- 4. **Store in Redis**: Use test-results key (not confidence key)
115
- 5. **Signal Completion**: Push to completion queue
120
+
121
+ 3. **Store Results**: Use test-results key (not confidence key)
122
+ 4. **Signal Completion**: Push to completion queue
116
123
 
117
124
  **Example Report:**
118
125
  ```
@@ -49,8 +49,7 @@ fi
49
49
 
50
50
  ### 3. Report Test Results (NOT Confidence)
51
51
 
52
- **Old (Deprecated):**
53
- ```bash
52
+ **Old (Deprecated):** Not used
54
53
 
55
54
  **New (Required):**
56
55
  ```bash
@@ -65,7 +64,6 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
65
64
 
66
65
  # Return results (Main Chat receives automatically in Task Mode)
67
66
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
68
-
69
67
  ```
70
68
 
71
69
  ## 🚨 MANDATORY DOCUMENTATION REDACTION PROTOCOL
@@ -165,6 +163,9 @@ Security analysis results are captured and processed through structured reportin
165
163
  DO NOT report subjective confidence scores. Instead:
166
164
 
167
165
  1. **Execute Tests**: Run test suite defined in success criteria
166
+ 2. **Parse Results**: Use native bash parsing (no external dependencies)
167
+
168
+ ```bash
168
169
  # Parse natively (no external dependencies)
169
170
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
170
171
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -173,6 +174,8 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
173
174
 
174
175
  # Return results (Main Chat receives automatically in Task Mode)
175
176
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
177
+ ```
178
+ 3. **Pass Rate**: Your security analysis passes the gate if tests ≥ threshold (95% standard mode)
176
179
 
177
180
  **Validation:**
178
181
  - ❌ OLD: "Confidence: 0.90 - security looks solid"
@@ -183,6 +186,9 @@ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
183
186
  Complete your work and provide test-based validation:
184
187
 
185
188
  1. **Execute Tests**: Run all security test suites from success criteria
189
+ 2. **Parse Results**:
190
+
191
+ ```bash
186
192
  # Parse natively (no external dependencies)
187
193
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
188
194
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -191,10 +197,14 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
191
197
 
192
198
  # Return results (Main Chat receives automatically in Task Mode)
193
199
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
200
+ ```
201
+ 3. **Report Metrics**:
202
+ - Total tests: X
203
+ - Passed: Y
204
+ - Failed: Z
205
+ - Pass rate: Y/X (e.g., 0.95)
194
206
  - Coverage: ≥80%
195
207
  - Critical vulnerabilities found: N
196
- 4. **Store in Redis**: Use test-results key (not confidence key)
197
- 5. **Signal Completion**: Push to completion queue
198
208
 
199
209
  **Example Report:**
200
210
  ```
@@ -62,8 +62,7 @@ fi
62
62
 
63
63
  ### 3. Report Test Results (NOT Confidence)
64
64
 
65
- **Old (Deprecated):**
66
- ```bash
65
+ **Old (Deprecated):** Not used
67
66
 
68
67
  **New (Required):**
69
68
  ```bash
@@ -75,7 +74,6 @@ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
75
74
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
76
75
  TOTAL=$((PASS + FAIL))
77
76
  RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
78
-
79
77
  ```
80
78
 
81
79
  ## Core Responsibilities
@@ -767,6 +765,8 @@ DO NOT report subjective confidence scores. Instead:
767
765
  Complete your work and provide test-based validation:
768
766
 
769
767
  1. **Execute Tests**: Run all API test suites from success criteria
768
+
769
+ ```bash
770
770
  # Parse natively (no external dependencies)
771
771
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
772
772
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -775,11 +775,15 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
775
775
 
776
776
  # Return results (Main Chat receives automatically in Task Mode)
777
777
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
778
+ ```
779
+
780
+ 2. **Validate Results**:
778
781
  - Coverage: ≥80%
779
782
  - Contract tests: X/Y passed
780
783
  - Security tests: X/Y passed
781
- 4. **Store in Redis**: Use test-results key (not confidence key)
782
- 5. **Signal Completion**: Push to completion queue
784
+
785
+ 3. **Store Results**: Use test-results key (not confidence key)
786
+ 4. **Signal Completion**: Push to completion queue
783
787
 
784
788
  **Example Report:**
785
789
  ```text
@@ -60,8 +60,7 @@ fi
60
60
 
61
61
  ### 3. Report Test Results (NOT Confidence)
62
62
 
63
- **Old (Deprecated):**
64
- ```bash
63
+ **Old (Deprecated):** Not used
65
64
 
66
65
  **New (Required):**
67
66
  ```bash
@@ -73,7 +72,6 @@ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
73
72
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
74
73
  TOTAL=$((PASS + FAIL))
75
74
  RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
76
-
77
75
  ```
78
76
 
79
77
  ## Core Responsibilities
@@ -928,6 +926,8 @@ DO NOT report subjective confidence scores. Instead:
928
926
  Complete your work and provide test-based validation:
929
927
 
930
928
  1. **Execute Tests**: Run all chaos experiment test suites from success criteria
929
+
930
+ ```bash
931
931
  # Parse natively (no external dependencies)
932
932
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
933
933
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -936,10 +936,14 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
936
936
 
937
937
  # Return results (Main Chat receives automatically in Task Mode)
938
938
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
939
+ ```
940
+
941
+ 2. **Validate Results**:
939
942
  - Coverage: ≥80%
940
943
  - Critical paths covered: X/Y
941
944
  - System resilience verified: Yes/No
942
- 4. **Store in Redis**: Use test-results key (not confidence key)
945
+
946
+ 3. **Store Results**: Use test-results key (not confidence key)
943
947
  5. **Signal Completion**: Push to completion queue
944
948
 
945
949
  **Example Report:**
@@ -56,8 +56,7 @@ fi
56
56
 
57
57
  ### 3. Report Test Results (NOT Confidence)
58
58
 
59
- **Old (Deprecated):**
60
- ```bash
59
+ **Old (Deprecated):** Not used
61
60
 
62
61
  **New (Required):**
63
62
  ```bash
@@ -69,7 +68,6 @@ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
69
68
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
70
69
  TOTAL=$((PASS + FAIL))
71
70
  RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
72
-
73
71
  ```
74
72
 
75
73
  ## MCP Tool Access (Task Mode)
@@ -203,19 +201,24 @@ DO NOT report subjective confidence scores. Instead:
203
201
  Complete your work and provide test-based validation:
204
202
 
205
203
  1. **Execute Tests**: Run all interaction test suites from success criteria
206
- # Parse natively (no external dependencies)
207
- PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
208
- FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
209
- TOTAL=$((PASS + FAIL))
210
- RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
211
-
212
- # Return results (Main Chat receives automatically in Task Mode)
213
- echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
204
+ ```bash
205
+ # Parse natively (no external dependencies)
206
+ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
207
+ FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
208
+ TOTAL=$((PASS + FAIL))
209
+ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
210
+
211
+ # Return results (Main Chat receives automatically in Task Mode)
212
+ echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
213
+ ```
214
+
215
+ 2. **Validate Results**:
214
216
  - Coverage: ≥80%
215
217
  - WCAG AA compliance: Yes/No
216
218
  - Critical flows covered: X/Y
217
- 4. **Store in Redis**: Use test-results key (not confidence key)
218
- 5. **Signal Completion**: Push to completion queue
219
+
220
+ 3. **Store Results**: Use test-results key (not confidence key)
221
+ 4. **Signal Completion**: Push to completion queue
219
222
 
220
223
  **Example Report:**
221
224
  ```
@@ -63,8 +63,7 @@ fi
63
63
 
64
64
  ### 3. Report Test Results (NOT Confidence)
65
65
 
66
- **Old (Deprecated):**
67
- ```bash
66
+ **Old (Deprecated):** Not used
68
67
 
69
68
  **New (Required):**
70
69
  ```bash
@@ -76,7 +75,6 @@ PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
76
75
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
77
76
  TOTAL=$((PASS + FAIL))
78
77
  RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0.00\"}")
79
-
80
78
  ```
81
79
 
82
80
  ## 🚨 Mandatory Post-Edit Validation
@@ -270,6 +268,8 @@ DO NOT report subjective confidence scores. Instead:
270
268
  Complete your work and provide test-based validation:
271
269
 
272
270
  1. **Execute Tests**: Run all Playwright test suites from success criteria
271
+
272
+ ```bash
273
273
  # Parse natively (no external dependencies)
274
274
  PASS=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= passing)' || echo "0")
275
275
  FAIL=$(echo "$TEST_OUTPUT" | grep -oP '\d+(?= failing)' || echo "0")
@@ -278,11 +278,15 @@ RATE=$(awk "BEGIN {if ($TOTAL > 0) printf \"%.2f\", $PASS/$TOTAL; else print \"0
278
278
 
279
279
  # Return results (Main Chat receives automatically in Task Mode)
280
280
  echo "{\"passed\": $PASS, \"failed\": $FAIL, \"pass_rate\": $RATE}"
281
+ ```
282
+
283
+ 2. **Validate Results**:
281
284
  - Coverage: ≥80%
282
285
  - Cross-browser coverage: X/Y browsers
283
286
  - Critical flows covered: X/Y
284
- 4. **Store in Redis**: Use test-results key (not confidence key)
285
- 5. **Signal Completion**: Push to completion queue
287
+
288
+ 3. **Store Results**: Use test-results key (not confidence key)
289
+ 4. **Signal Completion**: Push to completion queue
286
290
 
287
291
  **Example Report:**
288
292
  ```