agentic-qe 3.6.0 → 3.6.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (204) hide show
  1. package/.claude/agents/v3/qe-devils-advocate.md +218 -0
  2. package/.claude/agents/v3/qe-quality-criteria-recommender.md +2 -2
  3. package/.claude/skills/qe-iterative-loop/SKILL.md +1 -1
  4. package/.claude/skills/release/SKILL.md +17 -31
  5. package/.claude/skills/skills-manifest.json +1 -1
  6. package/README.md +38 -35
  7. package/package.json +1 -1
  8. package/scripts/cloud-db-config.json +1 -1
  9. package/v3/CHANGELOG.md +44 -0
  10. package/v3/README.md +7 -7
  11. package/v3/assets/agents/v3/qe-devils-advocate.md +218 -0
  12. package/v3/assets/agents/v3/qe-quality-criteria-recommender.md +2 -2
  13. package/v3/assets/skills/qe-iterative-loop/SKILL.md +1 -1
  14. package/v3/dist/agents/devils-advocate/agent.d.ts +103 -0
  15. package/v3/dist/agents/devils-advocate/agent.d.ts.map +1 -0
  16. package/v3/dist/agents/devils-advocate/agent.js +240 -0
  17. package/v3/dist/agents/devils-advocate/agent.js.map +1 -0
  18. package/v3/dist/agents/devils-advocate/index.d.ts +60 -0
  19. package/v3/dist/agents/devils-advocate/index.d.ts.map +1 -0
  20. package/v3/dist/agents/devils-advocate/index.js +72 -0
  21. package/v3/dist/agents/devils-advocate/index.js.map +1 -0
  22. package/v3/dist/agents/devils-advocate/strategies.d.ts +59 -0
  23. package/v3/dist/agents/devils-advocate/strategies.d.ts.map +1 -0
  24. package/v3/dist/agents/devils-advocate/strategies.js +438 -0
  25. package/v3/dist/agents/devils-advocate/strategies.js.map +1 -0
  26. package/v3/dist/agents/devils-advocate/types.d.ts +182 -0
  27. package/v3/dist/agents/devils-advocate/types.d.ts.map +1 -0
  28. package/v3/dist/agents/devils-advocate/types.js +96 -0
  29. package/v3/dist/agents/devils-advocate/types.js.map +1 -0
  30. package/v3/dist/agents/index.d.ts +20 -0
  31. package/v3/dist/agents/index.d.ts.map +1 -0
  32. package/v3/dist/agents/index.js +20 -0
  33. package/v3/dist/agents/index.js.map +1 -0
  34. package/v3/dist/cli/bundle.js +4489 -119
  35. package/v3/dist/coordination/agent-teams/adapter.d.ts +108 -0
  36. package/v3/dist/coordination/agent-teams/adapter.d.ts.map +1 -0
  37. package/v3/dist/coordination/agent-teams/adapter.js +316 -0
  38. package/v3/dist/coordination/agent-teams/adapter.js.map +1 -0
  39. package/v3/dist/coordination/agent-teams/domain-team-manager.d.ts +164 -0
  40. package/v3/dist/coordination/agent-teams/domain-team-manager.d.ts.map +1 -0
  41. package/v3/dist/coordination/agent-teams/domain-team-manager.js +342 -0
  42. package/v3/dist/coordination/agent-teams/domain-team-manager.js.map +1 -0
  43. package/v3/dist/coordination/agent-teams/index.d.ts +53 -0
  44. package/v3/dist/coordination/agent-teams/index.d.ts.map +1 -0
  45. package/v3/dist/coordination/agent-teams/index.js +61 -0
  46. package/v3/dist/coordination/agent-teams/index.js.map +1 -0
  47. package/v3/dist/coordination/agent-teams/mailbox.d.ts +142 -0
  48. package/v3/dist/coordination/agent-teams/mailbox.d.ts.map +1 -0
  49. package/v3/dist/coordination/agent-teams/mailbox.js +395 -0
  50. package/v3/dist/coordination/agent-teams/mailbox.js.map +1 -0
  51. package/v3/dist/coordination/agent-teams/tracing.d.ts +199 -0
  52. package/v3/dist/coordination/agent-teams/tracing.d.ts.map +1 -0
  53. package/v3/dist/coordination/agent-teams/tracing.js +308 -0
  54. package/v3/dist/coordination/agent-teams/tracing.js.map +1 -0
  55. package/v3/dist/coordination/agent-teams/types.d.ts +121 -0
  56. package/v3/dist/coordination/agent-teams/types.d.ts.map +1 -0
  57. package/v3/dist/coordination/agent-teams/types.js +17 -0
  58. package/v3/dist/coordination/agent-teams/types.js.map +1 -0
  59. package/v3/dist/coordination/circuit-breaker/breaker-registry.d.ts +146 -0
  60. package/v3/dist/coordination/circuit-breaker/breaker-registry.d.ts.map +1 -0
  61. package/v3/dist/coordination/circuit-breaker/breaker-registry.js +368 -0
  62. package/v3/dist/coordination/circuit-breaker/breaker-registry.js.map +1 -0
  63. package/v3/dist/coordination/circuit-breaker/domain-circuit-breaker.d.ts +134 -0
  64. package/v3/dist/coordination/circuit-breaker/domain-circuit-breaker.d.ts.map +1 -0
  65. package/v3/dist/coordination/circuit-breaker/domain-circuit-breaker.js +337 -0
  66. package/v3/dist/coordination/circuit-breaker/domain-circuit-breaker.js.map +1 -0
  67. package/v3/dist/coordination/circuit-breaker/index.d.ts +46 -0
  68. package/v3/dist/coordination/circuit-breaker/index.d.ts.map +1 -0
  69. package/v3/dist/coordination/circuit-breaker/index.js +51 -0
  70. package/v3/dist/coordination/circuit-breaker/index.js.map +1 -0
  71. package/v3/dist/coordination/circuit-breaker/types.d.ts +112 -0
  72. package/v3/dist/coordination/circuit-breaker/types.d.ts.map +1 -0
  73. package/v3/dist/coordination/circuit-breaker/types.js +10 -0
  74. package/v3/dist/coordination/circuit-breaker/types.js.map +1 -0
  75. package/v3/dist/coordination/competing-hypotheses/hypothesis-manager.d.ts +122 -0
  76. package/v3/dist/coordination/competing-hypotheses/hypothesis-manager.d.ts.map +1 -0
  77. package/v3/dist/coordination/competing-hypotheses/hypothesis-manager.js +377 -0
  78. package/v3/dist/coordination/competing-hypotheses/hypothesis-manager.js.map +1 -0
  79. package/v3/dist/coordination/competing-hypotheses/index.d.ts +34 -0
  80. package/v3/dist/coordination/competing-hypotheses/index.d.ts.map +1 -0
  81. package/v3/dist/coordination/competing-hypotheses/index.js +39 -0
  82. package/v3/dist/coordination/competing-hypotheses/index.js.map +1 -0
  83. package/v3/dist/coordination/competing-hypotheses/types.d.ts +134 -0
  84. package/v3/dist/coordination/competing-hypotheses/types.d.ts.map +1 -0
  85. package/v3/dist/coordination/competing-hypotheses/types.js +20 -0
  86. package/v3/dist/coordination/competing-hypotheses/types.js.map +1 -0
  87. package/v3/dist/coordination/dynamic-scaling/dynamic-scaler.d.ts +173 -0
  88. package/v3/dist/coordination/dynamic-scaling/dynamic-scaler.d.ts.map +1 -0
  89. package/v3/dist/coordination/dynamic-scaling/dynamic-scaler.js +368 -0
  90. package/v3/dist/coordination/dynamic-scaling/dynamic-scaler.js.map +1 -0
  91. package/v3/dist/coordination/dynamic-scaling/index.d.ts +38 -0
  92. package/v3/dist/coordination/dynamic-scaling/index.d.ts.map +1 -0
  93. package/v3/dist/coordination/dynamic-scaling/index.js +39 -0
  94. package/v3/dist/coordination/dynamic-scaling/index.js.map +1 -0
  95. package/v3/dist/coordination/dynamic-scaling/types.d.ts +147 -0
  96. package/v3/dist/coordination/dynamic-scaling/types.d.ts.map +1 -0
  97. package/v3/dist/coordination/dynamic-scaling/types.js +40 -0
  98. package/v3/dist/coordination/dynamic-scaling/types.js.map +1 -0
  99. package/v3/dist/coordination/federation/federation-mailbox.d.ts +215 -0
  100. package/v3/dist/coordination/federation/federation-mailbox.d.ts.map +1 -0
  101. package/v3/dist/coordination/federation/federation-mailbox.js +442 -0
  102. package/v3/dist/coordination/federation/federation-mailbox.js.map +1 -0
  103. package/v3/dist/coordination/federation/index.d.ts +38 -0
  104. package/v3/dist/coordination/federation/index.d.ts.map +1 -0
  105. package/v3/dist/coordination/federation/index.js +39 -0
  106. package/v3/dist/coordination/federation/index.js.map +1 -0
  107. package/v3/dist/coordination/federation/types.d.ts +103 -0
  108. package/v3/dist/coordination/federation/types.d.ts.map +1 -0
  109. package/v3/dist/coordination/federation/types.js +20 -0
  110. package/v3/dist/coordination/federation/types.js.map +1 -0
  111. package/v3/dist/coordination/fleet-tiers/index.d.ts +39 -0
  112. package/v3/dist/coordination/fleet-tiers/index.d.ts.map +1 -0
  113. package/v3/dist/coordination/fleet-tiers/index.js +44 -0
  114. package/v3/dist/coordination/fleet-tiers/index.js.map +1 -0
  115. package/v3/dist/coordination/fleet-tiers/tier-config.d.ts +60 -0
  116. package/v3/dist/coordination/fleet-tiers/tier-config.d.ts.map +1 -0
  117. package/v3/dist/coordination/fleet-tiers/tier-config.js +242 -0
  118. package/v3/dist/coordination/fleet-tiers/tier-config.js.map +1 -0
  119. package/v3/dist/coordination/fleet-tiers/tier-selector.d.ts +134 -0
  120. package/v3/dist/coordination/fleet-tiers/tier-selector.d.ts.map +1 -0
  121. package/v3/dist/coordination/fleet-tiers/tier-selector.js +373 -0
  122. package/v3/dist/coordination/fleet-tiers/tier-selector.js.map +1 -0
  123. package/v3/dist/coordination/fleet-tiers/types.d.ts +137 -0
  124. package/v3/dist/coordination/fleet-tiers/types.d.ts.map +1 -0
  125. package/v3/dist/coordination/fleet-tiers/types.js +20 -0
  126. package/v3/dist/coordination/fleet-tiers/types.js.map +1 -0
  127. package/v3/dist/coordination/index.d.ts +16 -0
  128. package/v3/dist/coordination/index.d.ts.map +1 -1
  129. package/v3/dist/coordination/index.js +29 -0
  130. package/v3/dist/coordination/index.js.map +1 -1
  131. package/v3/dist/coordination/queen-coordinator.d.ts +79 -0
  132. package/v3/dist/coordination/queen-coordinator.d.ts.map +1 -1
  133. package/v3/dist/coordination/queen-coordinator.js +363 -0
  134. package/v3/dist/coordination/queen-coordinator.js.map +1 -1
  135. package/v3/dist/coordination/task-dag/dag.d.ts +93 -0
  136. package/v3/dist/coordination/task-dag/dag.d.ts.map +1 -0
  137. package/v3/dist/coordination/task-dag/dag.js +496 -0
  138. package/v3/dist/coordination/task-dag/dag.js.map +1 -0
  139. package/v3/dist/coordination/task-dag/index.d.ts +54 -0
  140. package/v3/dist/coordination/task-dag/index.d.ts.map +1 -0
  141. package/v3/dist/coordination/task-dag/index.js +62 -0
  142. package/v3/dist/coordination/task-dag/index.js.map +1 -0
  143. package/v3/dist/coordination/task-dag/scheduler.d.ts +123 -0
  144. package/v3/dist/coordination/task-dag/scheduler.d.ts.map +1 -0
  145. package/v3/dist/coordination/task-dag/scheduler.js +262 -0
  146. package/v3/dist/coordination/task-dag/scheduler.js.map +1 -0
  147. package/v3/dist/coordination/task-dag/types.d.ts +103 -0
  148. package/v3/dist/coordination/task-dag/types.d.ts.map +1 -0
  149. package/v3/dist/coordination/task-dag/types.js +9 -0
  150. package/v3/dist/coordination/task-dag/types.js.map +1 -0
  151. package/v3/dist/domains/enterprise-integration/services/odata-service.js +3 -3
  152. package/v3/dist/domains/enterprise-integration/services/odata-service.js.map +1 -1
  153. package/v3/dist/domains/enterprise-integration/services/soap-wsdl-service.d.ts.map +1 -1
  154. package/v3/dist/domains/enterprise-integration/services/soap-wsdl-service.js +9 -4
  155. package/v3/dist/domains/enterprise-integration/services/soap-wsdl-service.js.map +1 -1
  156. package/v3/dist/domains/requirements-validation/services/quality-criteria/quality-criteria-service.js +1 -1
  157. package/v3/dist/domains/requirements-validation/services/quality-criteria/quality-criteria-service.js.map +1 -1
  158. package/v3/dist/hooks/index.d.ts +8 -1
  159. package/v3/dist/hooks/index.d.ts.map +1 -1
  160. package/v3/dist/hooks/index.js +8 -1
  161. package/v3/dist/hooks/index.js.map +1 -1
  162. package/v3/dist/hooks/quality-gate-enforcer.d.ts +134 -0
  163. package/v3/dist/hooks/quality-gate-enforcer.d.ts.map +1 -0
  164. package/v3/dist/hooks/quality-gate-enforcer.js +265 -0
  165. package/v3/dist/hooks/quality-gate-enforcer.js.map +1 -0
  166. package/v3/dist/hooks/reasoning-bank-pattern-store.d.ts +60 -0
  167. package/v3/dist/hooks/reasoning-bank-pattern-store.d.ts.map +1 -0
  168. package/v3/dist/hooks/reasoning-bank-pattern-store.js +179 -0
  169. package/v3/dist/hooks/reasoning-bank-pattern-store.js.map +1 -0
  170. package/v3/dist/hooks/task-completed-hook.d.ts +174 -0
  171. package/v3/dist/hooks/task-completed-hook.d.ts.map +1 -0
  172. package/v3/dist/hooks/task-completed-hook.js +330 -0
  173. package/v3/dist/hooks/task-completed-hook.js.map +1 -0
  174. package/v3/dist/hooks/teammate-idle-hook.d.ts +167 -0
  175. package/v3/dist/hooks/teammate-idle-hook.d.ts.map +1 -0
  176. package/v3/dist/hooks/teammate-idle-hook.js +332 -0
  177. package/v3/dist/hooks/teammate-idle-hook.js.map +1 -0
  178. package/v3/dist/index.d.ts +3 -0
  179. package/v3/dist/index.d.ts.map +1 -1
  180. package/v3/dist/index.js +4 -0
  181. package/v3/dist/index.js.map +1 -1
  182. package/v3/dist/init/agents-installer.d.ts +5 -1
  183. package/v3/dist/init/agents-installer.d.ts.map +1 -1
  184. package/v3/dist/init/agents-installer.js +13 -5
  185. package/v3/dist/init/agents-installer.js.map +1 -1
  186. package/v3/dist/init/phases/12-verification.d.ts.map +1 -1
  187. package/v3/dist/init/phases/12-verification.js +13 -1
  188. package/v3/dist/init/phases/12-verification.js.map +1 -1
  189. package/v3/dist/kernel/unified-memory.d.ts.map +1 -1
  190. package/v3/dist/kernel/unified-memory.js +303 -18
  191. package/v3/dist/kernel/unified-memory.js.map +1 -1
  192. package/v3/dist/learning/pattern-store.js +1 -1
  193. package/v3/dist/learning/pattern-store.js.map +1 -1
  194. package/v3/dist/learning/qe-patterns.d.ts +2 -0
  195. package/v3/dist/learning/qe-patterns.d.ts.map +1 -1
  196. package/v3/dist/learning/qe-patterns.js.map +1 -1
  197. package/v3/dist/learning/qe-reasoning-bank.d.ts.map +1 -1
  198. package/v3/dist/learning/qe-reasoning-bank.js +16 -3
  199. package/v3/dist/learning/qe-reasoning-bank.js.map +1 -1
  200. package/v3/dist/mcp/bundle.js +4473 -133
  201. package/v3/dist/sync/cloud/tunnel-manager.d.ts.map +1 -1
  202. package/v3/dist/sync/cloud/tunnel-manager.js +11 -0
  203. package/v3/dist/sync/cloud/tunnel-manager.js.map +1 -1
  204. package/v3/package.json +1 -1
@@ -0,0 +1,218 @@
1
+ ---
2
+ name: qe-devils-advocate
3
+ version: "3.6.0"
4
+ updated: "2026-02-09"
5
+ description: Meta-agent that challenges other agents' outputs by finding gaps, questioning assumptions, and critiquing completeness
6
+ v2_compat: null
7
+ domain: quality-assessment
8
+ ---
9
+
10
+ <qe_agent_definition>
11
+ <identity>
12
+ You are the V3 QE Devil's Advocate, the adversarial reviewer in Agentic QE v3.
13
+ Mission: Challenge other agents' outputs to surface gaps, blind spots, false positives, and unquestioned assumptions before results reach users.
14
+ Domain: quality-assessment (ADR-064)
15
+ V2 Compatibility: New in v3 -- no v2 equivalent.
16
+ </identity>
17
+
18
+ <implementation_status>
19
+ Working:
20
+ - Missing edge case detection (boundary values, null/undefined, concurrency)
21
+ - False positive detection in security scans and coverage reports
22
+ - Coverage gap critique (structural vs semantic coverage gaps)
23
+ - Security blind spot identification (missing threat vectors)
24
+ - Assumption questioning (implicit preconditions, happy-path bias)
25
+ - Boundary value gap analysis (off-by-one, overflow, empty collections)
26
+ - Error handling gap detection (missing catch blocks, swallowed errors)
27
+ - Configurable severity thresholds and confidence filters
28
+ - Per-review and cumulative statistics tracking
29
+
30
+ Partial:
31
+ - Integration with Queen Coordinator task pipeline
32
+ - Cross-domain challenge coordination
33
+
34
+ Planned:
35
+ - Learning from past challenge outcomes (which challenges were acted on)
36
+ - Auto-escalation for repeated unchallenged gaps
37
+ </implementation_status>
38
+
39
+ <default_to_action>
40
+ Review outputs immediately when a ChallengeTarget is provided.
41
+ Apply all applicable strategies without confirmation.
42
+ Filter results by configured minConfidence and minSeverity.
43
+ Report challenges in descending severity order.
44
+ Always produce a summary even when no challenges are found.
45
+ </default_to_action>
46
+
47
+ <parallel_execution>
48
+ Run all applicable challenge strategies concurrently against the target.
49
+ Strategies are independent -- missing-edge-case, false-positive, coverage-gap, etc. run in parallel.
50
+ Aggregate and sort results by severity after all strategies complete.
51
+ Use up to 7 concurrent strategies per review.
52
+ </parallel_execution>
53
+
54
+ <capabilities>
55
+ - **Missing Edge Case Detection**: Identify untested boundary values, null handling, concurrency, and error paths in test generation outputs
56
+ - **False Positive Detection**: Flag likely false positives in security scans and coverage reports by checking for vague descriptions, low confidence, and known false-positive patterns
57
+ - **Coverage Gap Critique**: Challenge coverage claims by checking for missing negative tests, missing integration paths, and semantic gaps not visible in line coverage
58
+ - **Security Blind Spot Identification**: Find missing threat vectors (injection, auth bypass, SSRF, deserialization) not covered by security scan results
59
+ - **Assumption Questioning**: Surface implicit assumptions in quality assessments, requirements validations, and defect predictions
60
+ - **Boundary Value Gap Analysis**: Detect missing tests for off-by-one errors, integer overflow, empty/max-size collections, and Unicode edge cases
61
+ - **Error Handling Gap Detection**: Find missing error handling for network failures, timeouts, malformed input, and resource exhaustion
62
+ </capabilities>
63
+
64
+ <memory_namespace>
65
+ Reads:
66
+ - aqe/v3/domains/test-generation/results/* - Test generation outputs to challenge
67
+ - aqe/v3/domains/coverage-analysis/results/* - Coverage reports to critique
68
+ - aqe/v3/domains/security-compliance/scans/* - Security scans to review
69
+ - aqe/v3/domains/quality-assessment/reports/* - Quality reports to question
70
+
71
+ Writes:
72
+ - aqe/v3/devils-advocate/reviews/* - Challenge review results
73
+ - aqe/v3/devils-advocate/stats/* - Cumulative challenge statistics
74
+ - aqe/v3/devils-advocate/patterns/* - Learned gap patterns
75
+
76
+ Coordination:
77
+ - aqe/v3/queen/tasks/* - Task status updates
78
+ - aqe/v3/domains/*/results/* - Cross-domain output access
79
+ </memory_namespace>
80
+
81
+ <learning_protocol>
82
+ **MANDATORY**: When executed via Claude Code Task tool, you MUST call learning MCP tools.
83
+
84
+ ### Query Past Challenge Patterns BEFORE Review
85
+
86
+ ```typescript
87
+ mcp__agentic-qe__memory_retrieve({
88
+ key: "devils-advocate/patterns",
89
+ namespace: "learning"
90
+ })
91
+ ```
92
+
93
+ ### Required Learning Actions (Call AFTER Review)
94
+
95
+ **1. Store Challenge Review Experience:**
96
+ ```typescript
97
+ mcp__agentic-qe__memory_store({
98
+ key: "devils-advocate/outcome-{timestamp}",
99
+ namespace: "learning",
100
+ value: {
101
+ agentId: "qe-devils-advocate",
102
+ taskType: "challenge-review",
103
+ reward: <calculated_reward>,
104
+ outcome: {
105
+ targetType: "<test-generation|coverage-analysis|security-scan|...>",
106
+ targetAgentId: "<agent that produced the output>",
107
+ challengeCount: <number>,
108
+ highSeverityCount: <number>,
109
+ overallScore: <0-1>,
110
+ verdict: "PASSED|CHALLENGED"
111
+ },
112
+ patterns: {
113
+ gapsFound: ["<types of gaps found>"],
114
+ strategiesUsed: ["<strategies that produced findings>"]
115
+ }
116
+ }
117
+ })
118
+ ```
119
+
120
+ **2. Submit Review Result to Queen:**
121
+ ```typescript
122
+ mcp__agentic-qe__task_submit({
123
+ type: "challenge-review-complete",
124
+ priority: "p1",
125
+ payload: {
126
+ targetAgentId: "...",
127
+ targetType: "...",
128
+ challengeCount: <number>,
129
+ highSeverityCount: <number>,
130
+ summary: "...",
131
+ challenges: [...]
132
+ }
133
+ })
134
+ ```
135
+
136
+ ### Reward Calculation Criteria (0-1 scale)
137
+ | Reward | Criteria |
138
+ |--------|----------|
139
+ | 1.0 | Actionable critical findings confirmed by follow-up |
140
+ | 0.9 | High-severity gaps found with clear evidence |
141
+ | 0.7 | Medium gaps found, strategies well-targeted |
142
+ | 0.5 | Review completed, minor findings only |
143
+ | 0.3 | Review completed, no significant findings (clean output) |
144
+ | 0.0 | Review failed or produced only noise/false challenges |
145
+ </learning_protocol>
146
+
147
+ <output_format>
148
+ - JSON for structured challenge results (challenges array, scores, summary)
149
+ - Markdown for human-readable challenge reports
150
+ - Challenges sorted by severity (critical > high > medium > low > informational)
151
+ - Include challenge count, overall confidence score, and per-strategy breakdown
152
+ </output_format>
153
+
154
+ <examples>
155
+ Example 1: Challenge test generation output
156
+ ```
157
+ Input: Review test-generation output from agent test-gen-001
158
+ - 5 tests generated for UserService.createUser()
159
+ - All tests check happy path with valid data
160
+
161
+ Output: CHALLENGED (Score: 0.38, 4 challenges)
162
+ [HIGH] Missing edge case: No test for duplicate email
163
+ [HIGH] Missing edge case: No test for empty/null username
164
+ [MEDIUM] Boundary value gap: No max-length validation test
165
+ [LOW] Error handling gap: No test for database connection failure
166
+ Summary: 5 tests cover only the happy path. No negative tests,
167
+ no boundary tests, no error handling tests. Test suite has
168
+ significant gaps in edge case coverage.
169
+ ```
170
+
171
+ Example 2: Challenge security scan output
172
+ ```
173
+ Input: Review security-scan output from agent sec-scan-001
174
+ - 0 vulnerabilities found
175
+ - Scanned: SQL injection, XSS
176
+
177
+ Output: CHALLENGED (Score: 0.52, 2 challenges)
178
+ [HIGH] Security blind spot: No SSRF testing performed
179
+ [MEDIUM] Security blind spot: No deserialization checks
180
+ Summary: Scan covers injection and XSS but misses SSRF,
181
+ deserialization, and authentication bypass vectors.
182
+ ```
183
+ </examples>
184
+
185
+ <v3_integration>
186
+ ### Code Implementation
187
+ The Devil's Advocate agent is implemented in `v3/src/agents/devils-advocate/`:
188
+ - `agent.ts` - Core `DevilsAdvocate` class with `review()` method
189
+ - `strategies.ts` - 7 pluggable challenge strategies
190
+ - `types.ts` - Type definitions for targets, challenges, results
191
+
192
+ ### Usage
193
+ ```typescript
194
+ import { DevilsAdvocate } from '@agentic-qe/v3';
195
+
196
+ const da = DevilsAdvocate.createDevilsAdvocate({ minConfidence: 0.5 });
197
+
198
+ const result = da.review({
199
+ type: 'test-generation',
200
+ agentId: 'test-gen-001',
201
+ domain: 'test-generation',
202
+ output: { testCount: 3, tests: [] },
203
+ timestamp: Date.now(),
204
+ });
205
+ ```
206
+
207
+ ### Strategies
208
+ | Strategy | Applies To | Detects |
209
+ |----------|-----------|---------|
210
+ | MissingEdgeCaseStrategy | test-generation | Untested edge cases, null handling |
211
+ | FalsePositiveDetectionStrategy | security-scan, coverage-analysis | Likely false positives |
212
+ | CoverageGapCritiqueStrategy | coverage-analysis | Semantic gaps in coverage |
213
+ | SecurityBlindSpotStrategy | security-scan | Missing threat vectors |
214
+ | AssumptionQuestioningStrategy | quality-assessment, defect-prediction, requirements | Implicit assumptions |
215
+ | BoundaryValueGapStrategy | test-generation | Off-by-one, overflow, empty collections |
216
+ | ErrorHandlingGapStrategy | test-generation, contract-validation | Missing error handling |
217
+ </v3_integration>
218
+ </qe_agent_definition>
@@ -245,7 +245,7 @@ interface QualityCriteriaAnalysis {
245
245
  ```
246
246
 
247
247
  ## Template Location
248
- Helper files installed to `.claude/agents/v3/helpers/quality-criteria/`:
248
+ Helper files installed to `.claude/helpers/v3/quality-criteria/`:
249
249
  - `quality-criteria-reference-template.html` - HTML output template (MUST read before generating)
250
250
  - `htsm-categories.md` - Detailed category definitions
251
251
  - `evidence-classification.md` - Evidence type guidelines
@@ -404,7 +404,7 @@ if (!valid) {
404
404
  ### Output Validation
405
405
  If HTML output requested, always read template first:
406
406
  ```
407
- .claude/agents/v3/helpers/quality-criteria/quality-criteria-reference-template.html
407
+ .claude/helpers/v3/quality-criteria/quality-criteria-reference-template.html
408
408
  ```
409
409
  </final_validation>
410
410
  </qe_agent_definition>
@@ -441,5 +441,5 @@ When ALL phases complete -> <promise>DEPLOYMENT_READY</promise>
441
441
  ---
442
442
 
443
443
  **Origin**: Adapted from Ralph Wiggum plugin (anthropics/claude-code)
444
- **Specialized for**: Agentic QE v3 Fleet with 59 QE agents
444
+ **Specialized for**: Agentic QE v3 Fleet with 60 QE agents
445
445
  **Domains**: test-generation, test-execution, coverage-analysis, quality-assessment
@@ -144,27 +144,23 @@ Verify init completes without errors and creates the expected project structure
144
144
  # Version output
145
145
  node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js --version
146
146
 
147
- # Doctor check
148
- node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js doctor
147
+ # System status
148
+ node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js status
149
149
  ```
150
150
  Both must succeed without errors.
151
151
 
152
- #### 8d. Verify MCP Tools
153
- ```bash
154
- # Verify MCP server can start and list tools
155
- node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js mcp --list-tools 2>&1 | head -30
156
- ```
157
- Should list available MCP tools without crashing.
158
-
159
- #### 8e. Verify Self-Learning & Fleet Capabilities
152
+ #### 8d. Verify Self-Learning & Fleet Capabilities
160
153
  ```bash
161
154
  cd /tmp/aqe-release-test
162
155
 
163
- # Verify memory/learning subsystem
164
- node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js memory list 2>&1 | head -10
156
+ # Verify learning subsystem
157
+ node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js learning stats 2>&1 | head -10
165
158
 
166
- # Verify agent spawning works
159
+ # Verify agent listing works
167
160
  node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js agent list 2>&1 | head -10
161
+
162
+ # Verify health check
163
+ node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js health 2>&1 | head -10
168
164
  ```
169
165
  These should respond (even if empty results) without errors, confirming the subsystems initialize properly.
170
166
 
@@ -177,34 +173,24 @@ rm -rf /tmp/aqe-release-test
177
173
 
178
174
  ### 9. Local CI Test Suite
179
175
 
180
- Run the same tests that CI runs on PRs (`optimized-ci.yml`) and during publish (`npm-publish.yml`). Skip e2e browser tests unless the user explicitly requests them.
176
+ Run the same tests that CI runs on PRs and during publish. Skip e2e browser tests unless the user explicitly requests them.
181
177
 
182
178
  ```bash
183
179
  cd /workspaces/agentic-qe-new/v3
184
180
 
185
- # Journey tests (highest-value signal, from optimized-ci.yml)
186
- npm run test:journeys
187
-
188
- # Code Intelligence tests (MinCut/Graph algorithms, from optimized-ci.yml)
189
- npm run test:code-intelligence
190
-
191
- # Contract tests (if they exist, from optimized-ci.yml)
192
- npm run test:contracts 2>/dev/null || echo "No contract tests"
193
-
194
- # Infrastructure tests (from optimized-ci.yml)
195
- npm run test:infrastructure 2>/dev/null || echo "No infrastructure tests"
196
-
197
- # Regression tests (from optimized-ci.yml)
198
- npm run test:regression 2>/dev/null || echo "No regression tests"
199
-
200
- # Performance gates (from optimized-ci.yml)
181
+ # Performance gates (fast validates perf thresholds)
201
182
  npm run performance:gate
202
183
 
184
+ # Regression tests (runs full unit suite)
185
+ npm run test:regression
186
+
203
187
  # Full test:ci suite (from npm-publish.yml — excludes browser/e2e)
204
188
  npm run test:ci
205
189
  ```
206
190
 
207
- All mandatory test suites must pass. If any fail, diagnose and fix before continuing.
191
+ Available test scripts: `test:unit`, `test:unit:fast`, `test:unit:heavy`, `test:unit:mcp`, `test:ci`, `test:regression`, `test:safe`, `test:perf`, `test:e2e`, `test:coverage`, `performance:gate`.
192
+
193
+ All mandatory test suites must pass. Pre-existing MCP handler test failures (tests that need runtime initialization) are acceptable if they also fail on the main branch.
208
194
 
209
195
  **STOP — show all test results.**
210
196
 
@@ -904,7 +904,7 @@
904
904
  },
905
905
  "metadata": {
906
906
  "generatedBy": "Agentic QE Fleet",
907
- "fleetVersion": "3.6.0",
907
+ "fleetVersion": "3.6.2",
908
908
  "manifestVersion": "1.3.0",
909
909
  "lastUpdated": "2026-02-04T00:00:00.000Z",
910
910
  "contributors": [
package/README.md CHANGED
@@ -9,35 +9,9 @@
9
9
  <img alt="NPM Downloads" src="https://img.shields.io/npm/dw/agentic-qe">
10
10
 
11
11
 
12
- **V3 (Main)** | [V2 Documentation](v2/docs/V2-README.md) | [Changelog](CHANGELOG.md) | [Contributors](CONTRIBUTORS.md) | [Issues](https://github.com/proffesor-for-testing/agentic-qe/issues) | [Discussions](https://github.com/proffesor-for-testing/agentic-qe/discussions)
12
+ **V3 (Main)** | [V2 Documentation](v2/docs/V2-README.md) | [Release Notes](docs/releases/README.md) | [Changelog](v3/CHANGELOG.md) | [Contributors](CONTRIBUTORS.md) | [Issues](https://github.com/proffesor-for-testing/agentic-qe/issues) | [Discussions](https://github.com/proffesor-for-testing/agentic-qe/discussions)
13
13
 
14
- > **V3** brings Domain-Driven Design architecture, 13 bounded contexts, 59 specialized QE agents, TinyDancer intelligent model routing, ReasoningBank learning with Dream cycles, HNSW vector search, mathematical Coherence verification, full MinCut/Consensus integration across all 13 domains, and deep integration with [Claude Flow](https://github.com/ruvnet/claude-flow) and [Agentic Flow](https://github.com/ruvnet/agentic-flow).
15
-
16
- ### What's New in v3.6.0
17
-
18
- - **Enterprise Integration Domain** — SOAP/WSDL, SAP RFC/BAPI/IDoc, OData, ESB/middleware, message broker, and Segregation of Duties testing (contributed by [@fndlalit](https://github.com/fndlalit))
19
- - **8 New Agents** — `qe-soap-tester`, `qe-sap-rfc-tester`, `qe-sap-idoc-tester`, `qe-middleware-validator`, `qe-odata-contract-tester`, `qe-message-broker-tester`, `qe-sod-analyzer`, `qe-pentest-validator`
20
- - **5 New Skills** — `enterprise-integration-testing`, `middleware-testing-patterns`, `wms-testing-patterns`, `observability-testing-patterns`, `pentest-validation` (Tier 3)
21
- - **Pentest Validation** — Shannon-inspired graduated exploit validation with "No Exploit, No Report" quality gate and 3-tier exploitation
22
- - **StrongDM Tier 1** — Loop detection + token dashboard for software delivery governance (ADR-062)
23
- - **Fleet: 59 agents, 75 skills across 13 domains**
24
-
25
- ### What's New in v3.5.0
26
-
27
- - **Governance ON by Default** - @claude-flow/guidance integration with 7 unbreakable QE invariants (ADR-058)
28
- - **QCSD 2.0 Complete Lifecycle** - All 4 phases: Ideation → Refinement → Development → CI/CD Verification
29
- - **Infrastructure Self-Healing Enterprise** - 12 enterprise error signatures (SAP, Salesforce, Payment Gateway)
30
-
31
- ### What's New in v3.4.2
32
-
33
- - **Skill Validation System** - 4-layer trust tiers with schemas, validators, and evaluation suites (ADR-056)
34
- - **CLI Validation Commands** - `aqe skill report`, `aqe eval run`, regression detection
35
-
36
- ### What's New in v3.4.0
37
-
38
- - **AG-UI Protocol** - Anthropic's streaming agent-to-user interface with real-time progress updates
39
- - **A2A Protocol** - Google's agent-to-agent interoperability standard for cross-tool communication
40
- - **A2UI Components** - Unified UI combining AG-UI streaming with A2A event handling
14
+ > **V3** brings Domain-Driven Design architecture, 13 bounded contexts, 60 specialized QE agents, TinyDancer intelligent model routing, ReasoningBank learning with Dream cycles, HNSW vector search, mathematical Coherence verification, full MinCut/Consensus integration across all 13 domains, and deep integration with [Claude Flow](https://github.com/ruvnet/claude-flow) and [Agentic Flow](https://github.com/ruvnet/agentic-flow).
41
15
 
42
16
  🏗️ **DDD Architecture** | 🧠 **ReasoningBank + Dream Cycles** | 🎯 **TinyDancer Model Routing** | 🔍 **HNSW Vector Search** | 👑 **Queen Coordinator** | 📊 **O(log n) Coverage** | 🔗 **Claude Flow Integration** | 🎯 **13 Bounded Contexts** | 📚 **75 QE Skills** | 🧬 **Coherence Verification** | ✅ **Trust Tiers** | 🛡️ **Governance**
43
17
 
@@ -88,7 +62,7 @@ claude "Use qe-flaky-hunter to analyze the last 100 test runs and stabilize flak
88
62
 
89
63
  **What V3 provides:**
90
64
  - ✅ **13 DDD Bounded Contexts**: Organized by business domain (test-generation, coverage-analysis, security-compliance, enterprise-integration, etc.)
91
- - ✅ **59 QE Agents**: Including Queen Coordinator for hierarchical orchestration (52 main + 7 TDD subagents)
65
+ - ✅ **60 QE Agents**: Including Queen Coordinator for hierarchical orchestration (53 main + 7 TDD subagents)
92
66
  - ✅ **TinyDancer Model Routing**: 3-tier intelligent routing (Haiku/Sonnet/Opus) for cost optimization
93
67
  - ✅ **ReasoningBank Learning**: HNSW-indexed pattern storage with experience replay
94
68
  - ✅ **O(log n) Coverage Analysis**: Sublinear algorithms for efficient gap detection
@@ -134,7 +108,7 @@ claude "Assess code quality and provide deployment recommendation"
134
108
  | **AI testing tools are expensive** | TinyDancer 3-tier model routing reduces costs by matching task complexity to appropriate model |
135
109
  | **No memory between test runs—every analysis starts from scratch** | ReasoningBank remembers patterns, strategies, and what works for your codebase |
136
110
  | **Agents waste tokens reading irrelevant code** | Code Intelligence provides token reduction with semantic search and knowledge graphs |
137
- | **Quality engineering requires complex coordination** | Queen Coordinator orchestrates 59 agents across 13 domains with consensus and MinCut topology |
111
+ | **Quality engineering requires complex coordination** | Queen Coordinator orchestrates 60 agents across 13 domains with consensus and MinCut topology |
138
112
  | **Tools don't understand your testing frameworks** | Works with Jest, Cypress, Playwright, Vitest, Mocha, Jasmine, AVA |
139
113
 
140
114
  ---
@@ -240,7 +214,7 @@ The **qe-queen-coordinator** manages the entire fleet with intelligent task dist
240
214
  ```
241
215
 
242
216
  **Capabilities:**
243
- - Orchestrate 59 QE agents concurrently across 13 domains
217
+ - Orchestrate 60 QE agents concurrently across 13 domains
244
218
  - TinyDancer 3-tier model routing (Haiku/Sonnet/Opus) with confidence-based decisions
245
219
  - Byzantine fault-tolerant consensus for critical quality decisions
246
220
  - MinCut graph-based topology optimization for self-healing coordination
@@ -253,6 +227,34 @@ claude "Use qe-queen-coordinator to orchestrate release validation for v2.1.0 wi
253
227
 
254
228
  ---
255
229
 
230
+ ### 🤝 Agent Teams & Fleet Coordination
231
+
232
+ The Queen Coordinator is extended with **Agent Teams** (ADR-064) for hybrid fleet communication:
233
+
234
+ | Feature | Description |
235
+ |---------|-------------|
236
+ | **Mailbox Messaging** | Direct agent-to-agent and domain-scoped broadcast messaging |
237
+ | **Distributed Tracing** | TraceContext propagation across messages for end-to-end task visibility |
238
+ | **Dynamic Scaling** | Workload-based auto-scaling with configurable policies and cooldowns |
239
+ | **Competing Hypotheses** | Multi-agent root cause investigation with evidence scoring, auto-triggered on critical failures |
240
+ | **Federation** | Cross-service routing with health monitoring and service discovery |
241
+ | **Circuit Breakers** | Per-domain fault isolation with automatic recovery |
242
+ | **Task DAG** | Topological ordering with cycle detection for multi-step workflows |
243
+
244
+ **Fleet Tiers** — Activate the level of coordination your project needs:
245
+
246
+ | Tier | Agents | Best For |
247
+ |------|--------|----------|
248
+ | **Lite** | 1-4 | Small projects, focused tasks |
249
+ | **Standard** | 5-10 | Team projects, multi-domain coordination |
250
+ | **Full** | 11-15 | Enterprise, cross-fleet federation |
251
+
252
+ ```bash
253
+ claude "Use qe-queen-coordinator with agent teams to investigate flaky test failures across test-execution and defect-intelligence domains"
254
+ ```
255
+
256
+ ---
257
+
256
258
  ### 🧠 ReasoningBank Learning System
257
259
 
258
260
  V3 agents learn and improve through the **ReasoningBank** pattern storage:
@@ -466,17 +468,17 @@ npx @claude-flow/cli@latest agent spawn -t qe-test-architect --name test-gen
466
468
 
467
469
  ---
468
470
 
469
- ### 📊 59 Specialized QE Agents
471
+ ### 📊 60 Specialized QE Agents
470
472
 
471
473
  | Category | Count | Highlights |
472
474
  |----------|-------|------------|
473
- | **Main QE Agents** | 52 | Test generation, coverage, security, performance, accessibility, enterprise integration, pentest validation |
475
+ | **Main QE Agents** | 53 | Test generation, coverage, security, performance, accessibility, enterprise integration, pentest validation |
474
476
  | **TDD Subagents** | 7 | RED/GREEN/REFACTOR with code review |
475
477
 
476
478
  **V2 Backward Compatibility**: All V2 agents map to V3 equivalents automatically.
477
479
 
478
480
  <details>
479
- <summary><b>📋 View All Main QE Agents (52)</b></summary>
481
+ <summary><b>📋 View All Main QE Agents (53)</b></summary>
480
482
 
481
483
  | Agent | Domain | Purpose |
482
484
  |-------|--------|---------|
@@ -524,6 +526,7 @@ npx @claude-flow/cli@latest agent spawn -t qe-test-architect --name test-gen
524
526
  | qe-product-factors-assessor | quality-assessment | SFDIPOT product factors analysis |
525
527
  | qe-test-idea-rewriter | test-generation | Transform passive tests to active actions |
526
528
  | qe-quality-criteria-recommender | quality-assessment | HTSM v6.3 Quality Criteria analysis |
529
+ | qe-devils-advocate | quality-assessment | Adversarial review of agent outputs |
527
530
 
528
531
  </details>
529
532
 
@@ -794,7 +797,7 @@ agentic-qe/
794
797
  │ │ ├── mcp/ # MCP server
795
798
  │ │ └── cli/ # V3 CLI
796
799
  │ ├── tests/ # 5,600+ tests
797
- │ └── assets/agents/ # 59 QE agent definitions (52 main + 7 subagents)
800
+ │ └── assets/agents/ # 60 QE agent definitions (53 main + 7 subagents)
798
801
  ├── v2/ # V2 Implementation (Legacy)
799
802
  │ ├── src/ # V2 source code
800
803
  │ ├── tests/ # V2 tests
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "agentic-qe",
3
- "version": "3.6.0",
3
+ "version": "3.6.2",
4
4
  "description": "Agentic Quality Engineering V3 - Domain-Driven Design Architecture with 13 Bounded Contexts, O(log n) coverage analysis, ReasoningBank learning, 59 specialized QE agents, mathematical Coherence verification, deep Claude Flow integration",
5
5
  "main": "./v3/dist/index.js",
6
6
  "types": "./v3/dist/index.d.ts",
@@ -19,7 +19,7 @@
19
19
  }
20
20
  },
21
21
  "sync": {
22
- "enabled": false,
22
+ "enabled": true,
23
23
  "mode": "incremental",
24
24
  "interval": "5m",
25
25
  "tables": [
package/v3/CHANGELOG.md CHANGED
@@ -5,6 +5,50 @@ All notable changes to Agentic QE will be documented in this file.
5
5
  The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
6
6
  and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
7
7
 
8
+ ## [3.6.2] - 2026-02-10
9
+
10
+ ### Fixed
11
+
12
+ - **YAML parser empty array crash (Issue #244)** — `aqe init --auto` no longer fails on re-runs when `config.yaml` has empty array fields like `disabled:` with no items. The custom YAML parser now normalizes known array fields after parsing, and `mergeConfigs()` uses defensive `Array.isArray()` checks.
13
+ - **Agent parse errors on helper files (Issue #243)** — Helper reference files (`htsm-categories.md`, `evidence-classification.md`) and the generated `README.md` are no longer placed inside `.claude/agents/v3/` where `claude doctor` would incorrectly parse them as agent definitions. Helpers now install to `.claude/helpers/v3/` and the agents index writes to `.claude/docs/v3-agents-index.md`.
14
+
15
+ ### Changed
16
+
17
+ - **Helper files location** — Agent helper/reference files (quality-criteria templates, SFDIPOT templates) now install to `.claude/helpers/v3/` instead of `.claude/agents/v3/helpers/`. Updated all path references in `quality-criteria-service.ts` and agent definitions.
18
+
19
+ ## [3.6.1] - 2026-02-09
20
+
21
+ ### Added
22
+
23
+ - **Agent Teams Integration (ADR-064)** — Hybrid fleet architecture layering Claude Code Agent Teams communication patterns on the existing Queen Coordinator. 4-phase implementation: Foundation, Hybrid Architecture, Learning & Observability, Advanced Patterns.
24
+ - **Agent Teams Adapter** — Direct mailbox messaging between agents with domain-scoped teams (2-4 agents per domain), team lead/teammate model, and subscription-based event delivery.
25
+ - **Fleet Tier Selector** — Tiered fleet activation (smoke/standard/deep/crisis) that controls agent count and token costs based on trigger context (commit, PR, release, incident).
26
+ - **Task Dependency DAG** — Topological ordering with cycle detection for multi-step task workflows. DAGScheduler for automated execution of ready tasks.
27
+ - **TeammateIdle Hook** — Auto-assigns pending tasks to idle agents, reducing Queen bottleneck for task distribution.
28
+ - **TaskCompleted Hook** — Extracts patterns from completed tasks and trains them into ReasoningBank automatically. Quality gate validation with exit code 2 rejection.
29
+ - **Domain Circuit Breakers** — Per-domain fault isolation with configurable failure thresholds, half-open recovery probing, and criticality-based configs.
30
+ - **Domain Team Manager** — Creates and manages domain-scoped agent teams with health monitoring, scaling, and rebalancing.
31
+ - **HNSW Graph Construction** — Real O(log n) HNSW insert and search in unified memory, replacing the O(n) linear scan stub.
32
+ - **Distributed Tracing** — TraceCollector with W3C-style TraceContext propagation encoded into AgentMessage correlationId fields. Queen traces full task lifecycles.
33
+ - **Competing Hypotheses** — HypothesisManager for multi-agent root cause investigation with evidence scoring, confidence tracking, and convergence (evidence-scoring, unanimous, majority, timeout). Auto-triggered on p0/p1 task failures.
34
+ - **Cross-Fleet Federation** — FederationMailbox with service registry, domain-based routing, health monitoring via heartbeats, and graceful degradation for unreachable services.
35
+ - **Dynamic Agent Scaling** — DynamicScaler with workload metrics collection, configurable scaling policies (queue depth, idle ratio, error rate thresholds), cooldown enforcement, and executor callbacks. Wired into Queen's metrics loop.
36
+ - **ReasoningBank Pattern Store Adapter** — Bridges TaskCompletedHook pattern extraction to QEReasoningBank storage with domain detection, type mapping, and confidence propagation.
37
+ - **promotePattern() Implementation** — Completes the ReasoningBank promotion stub: delegates to PatternStore.promote() and publishes pattern:promoted events.
38
+ - **Devil's Advocate Agent** — `qe-devils-advocate` agent that challenges other agents' outputs by finding gaps and questioning assumptions.
39
+ - **397+ New Tests** — 282 coordination tests, 67 hook tests, 48 learning tests covering all ADR-064 phases including adapter tracing integration and latency benchmarks.
40
+
41
+ ### Fixed
42
+
43
+ - **6 CodeQL Alerts** — Resolved security alerts in enterprise-integration services (input validation, type safety).
44
+ - **Pattern Training Pipeline** — Connected the disconnected TaskCompletedHook → ReasoningBank pipeline so patterns are automatically trained on task completion.
45
+ - **Queen Operational Wiring** — All Phase 3+4 modules (tracing, dynamic scaler, hypotheses) are now called by Queen's operational flow, not just initialized as shelf-ware.
46
+
47
+ ### Changed
48
+
49
+ - **Queen Coordinator** — Extended with tracing (startTrace on submitTask, completeSpan/failSpan on completion/failure), dynamic scaling (metrics feed + evaluate + execute in metrics loop), and competing hypotheses (auto-investigation on critical failures).
50
+ - **Agent Teams Adapter** — sendMessage() and broadcast() now encode TraceContext into correlationId when provided, enabling end-to-end distributed tracing.
51
+
8
52
  ## [3.6.0] - 2026-02-08
9
53
 
10
54
  ### Added
package/v3/README.md CHANGED
@@ -5,14 +5,14 @@
5
5
  [![TypeScript](https://img.shields.io/badge/TypeScript-5.0+-blue.svg)](https://www.typescriptlang.org/)
6
6
  [![Node.js](https://img.shields.io/badge/Node.js-18+-green.svg)](https://nodejs.org/)
7
7
 
8
- > Domain-Driven Quality Engineering with Mathematical Coherence Verification, 13 Bounded Contexts, 59 Specialized QE Agents, 75 QE Skills, and ReasoningBank Learning
8
+ > Domain-Driven Quality Engineering with Mathematical Coherence Verification, 13 Bounded Contexts, 60 Specialized QE Agents, 75 QE Skills, and ReasoningBank Learning
9
9
 
10
10
  ### Key Features
11
11
 
12
12
  | Feature | Description |
13
13
  |---------|-------------|
14
14
  | **75 QE Skills** | Quality engineering skills with 4-tier trust validation system |
15
- | **59 QE Agents** | Specialized agents for test generation, security, coverage, enterprise integration, and more |
15
+ | **60 QE Agents** | Specialized agents for test generation, security, coverage, enterprise integration, and more |
16
16
  | **13 DDD Domains** | Modular bounded contexts for all quality engineering needs |
17
17
  | **MCP Integration** | Full Claude Code integration via Model Context Protocol |
18
18
  | **AG-UI/A2A Protocols** | Industry-standard agent streaming and interoperability |
@@ -48,7 +48,7 @@ npx aqe test generate src/
48
48
 
49
49
  ## Why Agentic QE?
50
50
 
51
- - **59 Specialized QE Agents** - Domain-focused quality engineering agents (52 main + 7 subagents)
51
+ - **60 Specialized QE Agents** - Domain-focused quality engineering agents (53 main + 7 subagents)
52
52
  - **75 QE Skills** - 46 Tier 3 verified + 29 additional (QCSD swarms, n8n testing, enterprise integration, qe-* domains)
53
53
  - **13 DDD Bounded Contexts** - Modular, extensible architecture
54
54
  - **TinyDancer Model Routing** - 3-tier intelligent routing for cost optimization
@@ -632,7 +632,7 @@ console.log(`Quality gate: ${gate.value.passed ? 'PASSED' : 'FAILED'}`);
632
632
  | Module System | CommonJS | ESM |
633
633
  | Memory | SQLite only | HNSW + SQLite hybrid |
634
634
  | Learning | Basic patterns | ReasoningBank + SONA + Dream Cycles |
635
- | Agents | 32 | 59 QE agents (52 main + 7 subagents) |
635
+ | Agents | 32 | 60 QE agents (53 main + 7 subagents) |
636
636
  | Skills | 35 | 75 QE skills (46 Tier 3 + 29 additional) |
637
637
  | Coverage | O(n) | O(log n) |
638
638
  | Pattern Search | Linear | O(log n) HNSW indexing |
@@ -706,9 +706,9 @@ See the [Migration Guide](./docs/MIGRATION-GUIDE.md) for detailed instructions a
706
706
  }
707
707
  ```
708
708
 
709
- ## 59 QE Agents
709
+ ## 60 QE Agents
710
710
 
711
- Agentic QE includes 59 specialized quality engineering agents (52 main + 7 subagents) organized by domain:
711
+ Agentic QE includes 60 specialized quality engineering agents (53 main + 7 subagents) organized by domain:
712
712
 
713
713
  ### Test Generation Domain
714
714
  `qe-test-architect`, `qe-tdd-specialist`, `qe-tdd-red`, `qe-tdd-green`, `qe-tdd-refactor`, `qe-property-tester`, `qe-mutation-tester`, `qe-bdd-generator`
@@ -720,7 +720,7 @@ Agentic QE includes 59 specialized quality engineering agents (52 main + 7 subag
720
720
  `qe-coverage-specialist`, `qe-gap-detector`, `qe-risk-analyzer`
721
721
 
722
722
  ### Quality Assessment Domain
723
- `qe-quality-gate`, `qe-metrics-optimizer`, `qe-deployment-advisor`
723
+ `qe-quality-gate`, `qe-metrics-optimizer`, `qe-deployment-advisor`, `qe-devils-advocate`
724
724
 
725
725
  ### Defect Intelligence Domain
726
726
  `qe-defect-intelligence`, `qe-regression-analyzer`, `qe-root-cause-analyzer`