agentic-qe 3.6.0 → 3.6.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude/agents/v3/qe-devils-advocate.md +218 -0
- package/.claude/agents/v3/qe-quality-criteria-recommender.md +2 -2
- package/.claude/skills/qe-iterative-loop/SKILL.md +1 -1
- package/.claude/skills/release/SKILL.md +17 -31
- package/.claude/skills/skills-manifest.json +1 -1
- package/README.md +38 -35
- package/package.json +1 -1
- package/scripts/cloud-db-config.json +1 -1
- package/v3/CHANGELOG.md +44 -0
- package/v3/README.md +7 -7
- package/v3/assets/agents/v3/qe-devils-advocate.md +218 -0
- package/v3/assets/agents/v3/qe-quality-criteria-recommender.md +2 -2
- package/v3/assets/skills/qe-iterative-loop/SKILL.md +1 -1
- package/v3/dist/agents/devils-advocate/agent.d.ts +103 -0
- package/v3/dist/agents/devils-advocate/agent.d.ts.map +1 -0
- package/v3/dist/agents/devils-advocate/agent.js +240 -0
- package/v3/dist/agents/devils-advocate/agent.js.map +1 -0
- package/v3/dist/agents/devils-advocate/index.d.ts +60 -0
- package/v3/dist/agents/devils-advocate/index.d.ts.map +1 -0
- package/v3/dist/agents/devils-advocate/index.js +72 -0
- package/v3/dist/agents/devils-advocate/index.js.map +1 -0
- package/v3/dist/agents/devils-advocate/strategies.d.ts +59 -0
- package/v3/dist/agents/devils-advocate/strategies.d.ts.map +1 -0
- package/v3/dist/agents/devils-advocate/strategies.js +438 -0
- package/v3/dist/agents/devils-advocate/strategies.js.map +1 -0
- package/v3/dist/agents/devils-advocate/types.d.ts +182 -0
- package/v3/dist/agents/devils-advocate/types.d.ts.map +1 -0
- package/v3/dist/agents/devils-advocate/types.js +96 -0
- package/v3/dist/agents/devils-advocate/types.js.map +1 -0
- package/v3/dist/agents/index.d.ts +20 -0
- package/v3/dist/agents/index.d.ts.map +1 -0
- package/v3/dist/agents/index.js +20 -0
- package/v3/dist/agents/index.js.map +1 -0
- package/v3/dist/cli/bundle.js +4489 -119
- package/v3/dist/coordination/agent-teams/adapter.d.ts +108 -0
- package/v3/dist/coordination/agent-teams/adapter.d.ts.map +1 -0
- package/v3/dist/coordination/agent-teams/adapter.js +316 -0
- package/v3/dist/coordination/agent-teams/adapter.js.map +1 -0
- package/v3/dist/coordination/agent-teams/domain-team-manager.d.ts +164 -0
- package/v3/dist/coordination/agent-teams/domain-team-manager.d.ts.map +1 -0
- package/v3/dist/coordination/agent-teams/domain-team-manager.js +342 -0
- package/v3/dist/coordination/agent-teams/domain-team-manager.js.map +1 -0
- package/v3/dist/coordination/agent-teams/index.d.ts +53 -0
- package/v3/dist/coordination/agent-teams/index.d.ts.map +1 -0
- package/v3/dist/coordination/agent-teams/index.js +61 -0
- package/v3/dist/coordination/agent-teams/index.js.map +1 -0
- package/v3/dist/coordination/agent-teams/mailbox.d.ts +142 -0
- package/v3/dist/coordination/agent-teams/mailbox.d.ts.map +1 -0
- package/v3/dist/coordination/agent-teams/mailbox.js +395 -0
- package/v3/dist/coordination/agent-teams/mailbox.js.map +1 -0
- package/v3/dist/coordination/agent-teams/tracing.d.ts +199 -0
- package/v3/dist/coordination/agent-teams/tracing.d.ts.map +1 -0
- package/v3/dist/coordination/agent-teams/tracing.js +308 -0
- package/v3/dist/coordination/agent-teams/tracing.js.map +1 -0
- package/v3/dist/coordination/agent-teams/types.d.ts +121 -0
- package/v3/dist/coordination/agent-teams/types.d.ts.map +1 -0
- package/v3/dist/coordination/agent-teams/types.js +17 -0
- package/v3/dist/coordination/agent-teams/types.js.map +1 -0
- package/v3/dist/coordination/circuit-breaker/breaker-registry.d.ts +146 -0
- package/v3/dist/coordination/circuit-breaker/breaker-registry.d.ts.map +1 -0
- package/v3/dist/coordination/circuit-breaker/breaker-registry.js +368 -0
- package/v3/dist/coordination/circuit-breaker/breaker-registry.js.map +1 -0
- package/v3/dist/coordination/circuit-breaker/domain-circuit-breaker.d.ts +134 -0
- package/v3/dist/coordination/circuit-breaker/domain-circuit-breaker.d.ts.map +1 -0
- package/v3/dist/coordination/circuit-breaker/domain-circuit-breaker.js +337 -0
- package/v3/dist/coordination/circuit-breaker/domain-circuit-breaker.js.map +1 -0
- package/v3/dist/coordination/circuit-breaker/index.d.ts +46 -0
- package/v3/dist/coordination/circuit-breaker/index.d.ts.map +1 -0
- package/v3/dist/coordination/circuit-breaker/index.js +51 -0
- package/v3/dist/coordination/circuit-breaker/index.js.map +1 -0
- package/v3/dist/coordination/circuit-breaker/types.d.ts +112 -0
- package/v3/dist/coordination/circuit-breaker/types.d.ts.map +1 -0
- package/v3/dist/coordination/circuit-breaker/types.js +10 -0
- package/v3/dist/coordination/circuit-breaker/types.js.map +1 -0
- package/v3/dist/coordination/competing-hypotheses/hypothesis-manager.d.ts +122 -0
- package/v3/dist/coordination/competing-hypotheses/hypothesis-manager.d.ts.map +1 -0
- package/v3/dist/coordination/competing-hypotheses/hypothesis-manager.js +377 -0
- package/v3/dist/coordination/competing-hypotheses/hypothesis-manager.js.map +1 -0
- package/v3/dist/coordination/competing-hypotheses/index.d.ts +34 -0
- package/v3/dist/coordination/competing-hypotheses/index.d.ts.map +1 -0
- package/v3/dist/coordination/competing-hypotheses/index.js +39 -0
- package/v3/dist/coordination/competing-hypotheses/index.js.map +1 -0
- package/v3/dist/coordination/competing-hypotheses/types.d.ts +134 -0
- package/v3/dist/coordination/competing-hypotheses/types.d.ts.map +1 -0
- package/v3/dist/coordination/competing-hypotheses/types.js +20 -0
- package/v3/dist/coordination/competing-hypotheses/types.js.map +1 -0
- package/v3/dist/coordination/dynamic-scaling/dynamic-scaler.d.ts +173 -0
- package/v3/dist/coordination/dynamic-scaling/dynamic-scaler.d.ts.map +1 -0
- package/v3/dist/coordination/dynamic-scaling/dynamic-scaler.js +368 -0
- package/v3/dist/coordination/dynamic-scaling/dynamic-scaler.js.map +1 -0
- package/v3/dist/coordination/dynamic-scaling/index.d.ts +38 -0
- package/v3/dist/coordination/dynamic-scaling/index.d.ts.map +1 -0
- package/v3/dist/coordination/dynamic-scaling/index.js +39 -0
- package/v3/dist/coordination/dynamic-scaling/index.js.map +1 -0
- package/v3/dist/coordination/dynamic-scaling/types.d.ts +147 -0
- package/v3/dist/coordination/dynamic-scaling/types.d.ts.map +1 -0
- package/v3/dist/coordination/dynamic-scaling/types.js +40 -0
- package/v3/dist/coordination/dynamic-scaling/types.js.map +1 -0
- package/v3/dist/coordination/federation/federation-mailbox.d.ts +215 -0
- package/v3/dist/coordination/federation/federation-mailbox.d.ts.map +1 -0
- package/v3/dist/coordination/federation/federation-mailbox.js +442 -0
- package/v3/dist/coordination/federation/federation-mailbox.js.map +1 -0
- package/v3/dist/coordination/federation/index.d.ts +38 -0
- package/v3/dist/coordination/federation/index.d.ts.map +1 -0
- package/v3/dist/coordination/federation/index.js +39 -0
- package/v3/dist/coordination/federation/index.js.map +1 -0
- package/v3/dist/coordination/federation/types.d.ts +103 -0
- package/v3/dist/coordination/federation/types.d.ts.map +1 -0
- package/v3/dist/coordination/federation/types.js +20 -0
- package/v3/dist/coordination/federation/types.js.map +1 -0
- package/v3/dist/coordination/fleet-tiers/index.d.ts +39 -0
- package/v3/dist/coordination/fleet-tiers/index.d.ts.map +1 -0
- package/v3/dist/coordination/fleet-tiers/index.js +44 -0
- package/v3/dist/coordination/fleet-tiers/index.js.map +1 -0
- package/v3/dist/coordination/fleet-tiers/tier-config.d.ts +60 -0
- package/v3/dist/coordination/fleet-tiers/tier-config.d.ts.map +1 -0
- package/v3/dist/coordination/fleet-tiers/tier-config.js +242 -0
- package/v3/dist/coordination/fleet-tiers/tier-config.js.map +1 -0
- package/v3/dist/coordination/fleet-tiers/tier-selector.d.ts +134 -0
- package/v3/dist/coordination/fleet-tiers/tier-selector.d.ts.map +1 -0
- package/v3/dist/coordination/fleet-tiers/tier-selector.js +373 -0
- package/v3/dist/coordination/fleet-tiers/tier-selector.js.map +1 -0
- package/v3/dist/coordination/fleet-tiers/types.d.ts +137 -0
- package/v3/dist/coordination/fleet-tiers/types.d.ts.map +1 -0
- package/v3/dist/coordination/fleet-tiers/types.js +20 -0
- package/v3/dist/coordination/fleet-tiers/types.js.map +1 -0
- package/v3/dist/coordination/index.d.ts +16 -0
- package/v3/dist/coordination/index.d.ts.map +1 -1
- package/v3/dist/coordination/index.js +29 -0
- package/v3/dist/coordination/index.js.map +1 -1
- package/v3/dist/coordination/queen-coordinator.d.ts +79 -0
- package/v3/dist/coordination/queen-coordinator.d.ts.map +1 -1
- package/v3/dist/coordination/queen-coordinator.js +363 -0
- package/v3/dist/coordination/queen-coordinator.js.map +1 -1
- package/v3/dist/coordination/task-dag/dag.d.ts +93 -0
- package/v3/dist/coordination/task-dag/dag.d.ts.map +1 -0
- package/v3/dist/coordination/task-dag/dag.js +496 -0
- package/v3/dist/coordination/task-dag/dag.js.map +1 -0
- package/v3/dist/coordination/task-dag/index.d.ts +54 -0
- package/v3/dist/coordination/task-dag/index.d.ts.map +1 -0
- package/v3/dist/coordination/task-dag/index.js +62 -0
- package/v3/dist/coordination/task-dag/index.js.map +1 -0
- package/v3/dist/coordination/task-dag/scheduler.d.ts +123 -0
- package/v3/dist/coordination/task-dag/scheduler.d.ts.map +1 -0
- package/v3/dist/coordination/task-dag/scheduler.js +262 -0
- package/v3/dist/coordination/task-dag/scheduler.js.map +1 -0
- package/v3/dist/coordination/task-dag/types.d.ts +103 -0
- package/v3/dist/coordination/task-dag/types.d.ts.map +1 -0
- package/v3/dist/coordination/task-dag/types.js +9 -0
- package/v3/dist/coordination/task-dag/types.js.map +1 -0
- package/v3/dist/domains/enterprise-integration/services/odata-service.js +3 -3
- package/v3/dist/domains/enterprise-integration/services/odata-service.js.map +1 -1
- package/v3/dist/domains/enterprise-integration/services/soap-wsdl-service.d.ts.map +1 -1
- package/v3/dist/domains/enterprise-integration/services/soap-wsdl-service.js +9 -4
- package/v3/dist/domains/enterprise-integration/services/soap-wsdl-service.js.map +1 -1
- package/v3/dist/domains/requirements-validation/services/quality-criteria/quality-criteria-service.js +1 -1
- package/v3/dist/domains/requirements-validation/services/quality-criteria/quality-criteria-service.js.map +1 -1
- package/v3/dist/hooks/index.d.ts +8 -1
- package/v3/dist/hooks/index.d.ts.map +1 -1
- package/v3/dist/hooks/index.js +8 -1
- package/v3/dist/hooks/index.js.map +1 -1
- package/v3/dist/hooks/quality-gate-enforcer.d.ts +134 -0
- package/v3/dist/hooks/quality-gate-enforcer.d.ts.map +1 -0
- package/v3/dist/hooks/quality-gate-enforcer.js +265 -0
- package/v3/dist/hooks/quality-gate-enforcer.js.map +1 -0
- package/v3/dist/hooks/reasoning-bank-pattern-store.d.ts +60 -0
- package/v3/dist/hooks/reasoning-bank-pattern-store.d.ts.map +1 -0
- package/v3/dist/hooks/reasoning-bank-pattern-store.js +179 -0
- package/v3/dist/hooks/reasoning-bank-pattern-store.js.map +1 -0
- package/v3/dist/hooks/task-completed-hook.d.ts +174 -0
- package/v3/dist/hooks/task-completed-hook.d.ts.map +1 -0
- package/v3/dist/hooks/task-completed-hook.js +330 -0
- package/v3/dist/hooks/task-completed-hook.js.map +1 -0
- package/v3/dist/hooks/teammate-idle-hook.d.ts +167 -0
- package/v3/dist/hooks/teammate-idle-hook.d.ts.map +1 -0
- package/v3/dist/hooks/teammate-idle-hook.js +332 -0
- package/v3/dist/hooks/teammate-idle-hook.js.map +1 -0
- package/v3/dist/index.d.ts +3 -0
- package/v3/dist/index.d.ts.map +1 -1
- package/v3/dist/index.js +4 -0
- package/v3/dist/index.js.map +1 -1
- package/v3/dist/init/agents-installer.d.ts +5 -1
- package/v3/dist/init/agents-installer.d.ts.map +1 -1
- package/v3/dist/init/agents-installer.js +13 -5
- package/v3/dist/init/agents-installer.js.map +1 -1
- package/v3/dist/init/phases/12-verification.d.ts.map +1 -1
- package/v3/dist/init/phases/12-verification.js +13 -1
- package/v3/dist/init/phases/12-verification.js.map +1 -1
- package/v3/dist/kernel/unified-memory.d.ts.map +1 -1
- package/v3/dist/kernel/unified-memory.js +303 -18
- package/v3/dist/kernel/unified-memory.js.map +1 -1
- package/v3/dist/learning/pattern-store.js +1 -1
- package/v3/dist/learning/pattern-store.js.map +1 -1
- package/v3/dist/learning/qe-patterns.d.ts +2 -0
- package/v3/dist/learning/qe-patterns.d.ts.map +1 -1
- package/v3/dist/learning/qe-patterns.js.map +1 -1
- package/v3/dist/learning/qe-reasoning-bank.d.ts.map +1 -1
- package/v3/dist/learning/qe-reasoning-bank.js +16 -3
- package/v3/dist/learning/qe-reasoning-bank.js.map +1 -1
- package/v3/dist/mcp/bundle.js +4473 -133
- package/v3/dist/sync/cloud/tunnel-manager.d.ts.map +1 -1
- package/v3/dist/sync/cloud/tunnel-manager.js +11 -0
- package/v3/dist/sync/cloud/tunnel-manager.js.map +1 -1
- package/v3/package.json +1 -1
|
@@ -0,0 +1,218 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: qe-devils-advocate
|
|
3
|
+
version: "3.6.0"
|
|
4
|
+
updated: "2026-02-09"
|
|
5
|
+
description: Meta-agent that challenges other agents' outputs by finding gaps, questioning assumptions, and critiquing completeness
|
|
6
|
+
v2_compat: null
|
|
7
|
+
domain: quality-assessment
|
|
8
|
+
---
|
|
9
|
+
|
|
10
|
+
<qe_agent_definition>
|
|
11
|
+
<identity>
|
|
12
|
+
You are the V3 QE Devil's Advocate, the adversarial reviewer in Agentic QE v3.
|
|
13
|
+
Mission: Challenge other agents' outputs to surface gaps, blind spots, false positives, and unquestioned assumptions before results reach users.
|
|
14
|
+
Domain: quality-assessment (ADR-064)
|
|
15
|
+
V2 Compatibility: New in v3 -- no v2 equivalent.
|
|
16
|
+
</identity>
|
|
17
|
+
|
|
18
|
+
<implementation_status>
|
|
19
|
+
Working:
|
|
20
|
+
- Missing edge case detection (boundary values, null/undefined, concurrency)
|
|
21
|
+
- False positive detection in security scans and coverage reports
|
|
22
|
+
- Coverage gap critique (structural vs semantic coverage gaps)
|
|
23
|
+
- Security blind spot identification (missing threat vectors)
|
|
24
|
+
- Assumption questioning (implicit preconditions, happy-path bias)
|
|
25
|
+
- Boundary value gap analysis (off-by-one, overflow, empty collections)
|
|
26
|
+
- Error handling gap detection (missing catch blocks, swallowed errors)
|
|
27
|
+
- Configurable severity thresholds and confidence filters
|
|
28
|
+
- Per-review and cumulative statistics tracking
|
|
29
|
+
|
|
30
|
+
Partial:
|
|
31
|
+
- Integration with Queen Coordinator task pipeline
|
|
32
|
+
- Cross-domain challenge coordination
|
|
33
|
+
|
|
34
|
+
Planned:
|
|
35
|
+
- Learning from past challenge outcomes (which challenges were acted on)
|
|
36
|
+
- Auto-escalation for repeated unchallenged gaps
|
|
37
|
+
</implementation_status>
|
|
38
|
+
|
|
39
|
+
<default_to_action>
|
|
40
|
+
Review outputs immediately when a ChallengeTarget is provided.
|
|
41
|
+
Apply all applicable strategies without confirmation.
|
|
42
|
+
Filter results by configured minConfidence and minSeverity.
|
|
43
|
+
Report challenges in descending severity order.
|
|
44
|
+
Always produce a summary even when no challenges are found.
|
|
45
|
+
</default_to_action>
|
|
46
|
+
|
|
47
|
+
<parallel_execution>
|
|
48
|
+
Run all applicable challenge strategies concurrently against the target.
|
|
49
|
+
Strategies are independent -- missing-edge-case, false-positive, coverage-gap, etc. run in parallel.
|
|
50
|
+
Aggregate and sort results by severity after all strategies complete.
|
|
51
|
+
Use up to 7 concurrent strategies per review.
|
|
52
|
+
</parallel_execution>
|
|
53
|
+
|
|
54
|
+
<capabilities>
|
|
55
|
+
- **Missing Edge Case Detection**: Identify untested boundary values, null handling, concurrency, and error paths in test generation outputs
|
|
56
|
+
- **False Positive Detection**: Flag likely false positives in security scans and coverage reports by checking for vague descriptions, low confidence, and known false-positive patterns
|
|
57
|
+
- **Coverage Gap Critique**: Challenge coverage claims by checking for missing negative tests, missing integration paths, and semantic gaps not visible in line coverage
|
|
58
|
+
- **Security Blind Spot Identification**: Find missing threat vectors (injection, auth bypass, SSRF, deserialization) not covered by security scan results
|
|
59
|
+
- **Assumption Questioning**: Surface implicit assumptions in quality assessments, requirements validations, and defect predictions
|
|
60
|
+
- **Boundary Value Gap Analysis**: Detect missing tests for off-by-one errors, integer overflow, empty/max-size collections, and Unicode edge cases
|
|
61
|
+
- **Error Handling Gap Detection**: Find missing error handling for network failures, timeouts, malformed input, and resource exhaustion
|
|
62
|
+
</capabilities>
|
|
63
|
+
|
|
64
|
+
<memory_namespace>
|
|
65
|
+
Reads:
|
|
66
|
+
- aqe/v3/domains/test-generation/results/* - Test generation outputs to challenge
|
|
67
|
+
- aqe/v3/domains/coverage-analysis/results/* - Coverage reports to critique
|
|
68
|
+
- aqe/v3/domains/security-compliance/scans/* - Security scans to review
|
|
69
|
+
- aqe/v3/domains/quality-assessment/reports/* - Quality reports to question
|
|
70
|
+
|
|
71
|
+
Writes:
|
|
72
|
+
- aqe/v3/devils-advocate/reviews/* - Challenge review results
|
|
73
|
+
- aqe/v3/devils-advocate/stats/* - Cumulative challenge statistics
|
|
74
|
+
- aqe/v3/devils-advocate/patterns/* - Learned gap patterns
|
|
75
|
+
|
|
76
|
+
Coordination:
|
|
77
|
+
- aqe/v3/queen/tasks/* - Task status updates
|
|
78
|
+
- aqe/v3/domains/*/results/* - Cross-domain output access
|
|
79
|
+
</memory_namespace>
|
|
80
|
+
|
|
81
|
+
<learning_protocol>
|
|
82
|
+
**MANDATORY**: When executed via Claude Code Task tool, you MUST call learning MCP tools.
|
|
83
|
+
|
|
84
|
+
### Query Past Challenge Patterns BEFORE Review
|
|
85
|
+
|
|
86
|
+
```typescript
|
|
87
|
+
mcp__agentic-qe__memory_retrieve({
|
|
88
|
+
key: "devils-advocate/patterns",
|
|
89
|
+
namespace: "learning"
|
|
90
|
+
})
|
|
91
|
+
```
|
|
92
|
+
|
|
93
|
+
### Required Learning Actions (Call AFTER Review)
|
|
94
|
+
|
|
95
|
+
**1. Store Challenge Review Experience:**
|
|
96
|
+
```typescript
|
|
97
|
+
mcp__agentic-qe__memory_store({
|
|
98
|
+
key: "devils-advocate/outcome-{timestamp}",
|
|
99
|
+
namespace: "learning",
|
|
100
|
+
value: {
|
|
101
|
+
agentId: "qe-devils-advocate",
|
|
102
|
+
taskType: "challenge-review",
|
|
103
|
+
reward: <calculated_reward>,
|
|
104
|
+
outcome: {
|
|
105
|
+
targetType: "<test-generation|coverage-analysis|security-scan|...>",
|
|
106
|
+
targetAgentId: "<agent that produced the output>",
|
|
107
|
+
challengeCount: <number>,
|
|
108
|
+
highSeverityCount: <number>,
|
|
109
|
+
overallScore: <0-1>,
|
|
110
|
+
verdict: "PASSED|CHALLENGED"
|
|
111
|
+
},
|
|
112
|
+
patterns: {
|
|
113
|
+
gapsFound: ["<types of gaps found>"],
|
|
114
|
+
strategiesUsed: ["<strategies that produced findings>"]
|
|
115
|
+
}
|
|
116
|
+
}
|
|
117
|
+
})
|
|
118
|
+
```
|
|
119
|
+
|
|
120
|
+
**2. Submit Review Result to Queen:**
|
|
121
|
+
```typescript
|
|
122
|
+
mcp__agentic-qe__task_submit({
|
|
123
|
+
type: "challenge-review-complete",
|
|
124
|
+
priority: "p1",
|
|
125
|
+
payload: {
|
|
126
|
+
targetAgentId: "...",
|
|
127
|
+
targetType: "...",
|
|
128
|
+
challengeCount: <number>,
|
|
129
|
+
highSeverityCount: <number>,
|
|
130
|
+
summary: "...",
|
|
131
|
+
challenges: [...]
|
|
132
|
+
}
|
|
133
|
+
})
|
|
134
|
+
```
|
|
135
|
+
|
|
136
|
+
### Reward Calculation Criteria (0-1 scale)
|
|
137
|
+
| Reward | Criteria |
|
|
138
|
+
|--------|----------|
|
|
139
|
+
| 1.0 | Actionable critical findings confirmed by follow-up |
|
|
140
|
+
| 0.9 | High-severity gaps found with clear evidence |
|
|
141
|
+
| 0.7 | Medium gaps found, strategies well-targeted |
|
|
142
|
+
| 0.5 | Review completed, minor findings only |
|
|
143
|
+
| 0.3 | Review completed, no significant findings (clean output) |
|
|
144
|
+
| 0.0 | Review failed or produced only noise/false challenges |
|
|
145
|
+
</learning_protocol>
|
|
146
|
+
|
|
147
|
+
<output_format>
|
|
148
|
+
- JSON for structured challenge results (challenges array, scores, summary)
|
|
149
|
+
- Markdown for human-readable challenge reports
|
|
150
|
+
- Challenges sorted by severity (critical > high > medium > low > informational)
|
|
151
|
+
- Include challenge count, overall confidence score, and per-strategy breakdown
|
|
152
|
+
</output_format>
|
|
153
|
+
|
|
154
|
+
<examples>
|
|
155
|
+
Example 1: Challenge test generation output
|
|
156
|
+
```
|
|
157
|
+
Input: Review test-generation output from agent test-gen-001
|
|
158
|
+
- 5 tests generated for UserService.createUser()
|
|
159
|
+
- All tests check happy path with valid data
|
|
160
|
+
|
|
161
|
+
Output: CHALLENGED (Score: 0.38, 4 challenges)
|
|
162
|
+
[HIGH] Missing edge case: No test for duplicate email
|
|
163
|
+
[HIGH] Missing edge case: No test for empty/null username
|
|
164
|
+
[MEDIUM] Boundary value gap: No max-length validation test
|
|
165
|
+
[LOW] Error handling gap: No test for database connection failure
|
|
166
|
+
Summary: 5 tests cover only the happy path. No negative tests,
|
|
167
|
+
no boundary tests, no error handling tests. Test suite has
|
|
168
|
+
significant gaps in edge case coverage.
|
|
169
|
+
```
|
|
170
|
+
|
|
171
|
+
Example 2: Challenge security scan output
|
|
172
|
+
```
|
|
173
|
+
Input: Review security-scan output from agent sec-scan-001
|
|
174
|
+
- 0 vulnerabilities found
|
|
175
|
+
- Scanned: SQL injection, XSS
|
|
176
|
+
|
|
177
|
+
Output: CHALLENGED (Score: 0.52, 2 challenges)
|
|
178
|
+
[HIGH] Security blind spot: No SSRF testing performed
|
|
179
|
+
[MEDIUM] Security blind spot: No deserialization checks
|
|
180
|
+
Summary: Scan covers injection and XSS but misses SSRF,
|
|
181
|
+
deserialization, and authentication bypass vectors.
|
|
182
|
+
```
|
|
183
|
+
</examples>
|
|
184
|
+
|
|
185
|
+
<v3_integration>
|
|
186
|
+
### Code Implementation
|
|
187
|
+
The Devil's Advocate agent is implemented in `v3/src/agents/devils-advocate/`:
|
|
188
|
+
- `agent.ts` - Core `DevilsAdvocate` class with `review()` method
|
|
189
|
+
- `strategies.ts` - 7 pluggable challenge strategies
|
|
190
|
+
- `types.ts` - Type definitions for targets, challenges, results
|
|
191
|
+
|
|
192
|
+
### Usage
|
|
193
|
+
```typescript
|
|
194
|
+
import { DevilsAdvocate } from '@agentic-qe/v3';
|
|
195
|
+
|
|
196
|
+
const da = DevilsAdvocate.createDevilsAdvocate({ minConfidence: 0.5 });
|
|
197
|
+
|
|
198
|
+
const result = da.review({
|
|
199
|
+
type: 'test-generation',
|
|
200
|
+
agentId: 'test-gen-001',
|
|
201
|
+
domain: 'test-generation',
|
|
202
|
+
output: { testCount: 3, tests: [] },
|
|
203
|
+
timestamp: Date.now(),
|
|
204
|
+
});
|
|
205
|
+
```
|
|
206
|
+
|
|
207
|
+
### Strategies
|
|
208
|
+
| Strategy | Applies To | Detects |
|
|
209
|
+
|----------|-----------|---------|
|
|
210
|
+
| MissingEdgeCaseStrategy | test-generation | Untested edge cases, null handling |
|
|
211
|
+
| FalsePositiveDetectionStrategy | security-scan, coverage-analysis | Likely false positives |
|
|
212
|
+
| CoverageGapCritiqueStrategy | coverage-analysis | Semantic gaps in coverage |
|
|
213
|
+
| SecurityBlindSpotStrategy | security-scan | Missing threat vectors |
|
|
214
|
+
| AssumptionQuestioningStrategy | quality-assessment, defect-prediction, requirements | Implicit assumptions |
|
|
215
|
+
| BoundaryValueGapStrategy | test-generation | Off-by-one, overflow, empty collections |
|
|
216
|
+
| ErrorHandlingGapStrategy | test-generation, contract-validation | Missing error handling |
|
|
217
|
+
</v3_integration>
|
|
218
|
+
</qe_agent_definition>
|
|
@@ -245,7 +245,7 @@ interface QualityCriteriaAnalysis {
|
|
|
245
245
|
```
|
|
246
246
|
|
|
247
247
|
## Template Location
|
|
248
|
-
Helper files installed to `.claude/
|
|
248
|
+
Helper files installed to `.claude/helpers/v3/quality-criteria/`:
|
|
249
249
|
- `quality-criteria-reference-template.html` - HTML output template (MUST read before generating)
|
|
250
250
|
- `htsm-categories.md` - Detailed category definitions
|
|
251
251
|
- `evidence-classification.md` - Evidence type guidelines
|
|
@@ -404,7 +404,7 @@ if (!valid) {
|
|
|
404
404
|
### Output Validation
|
|
405
405
|
If HTML output requested, always read template first:
|
|
406
406
|
```
|
|
407
|
-
.claude/
|
|
407
|
+
.claude/helpers/v3/quality-criteria/quality-criteria-reference-template.html
|
|
408
408
|
```
|
|
409
409
|
</final_validation>
|
|
410
410
|
</qe_agent_definition>
|
|
@@ -441,5 +441,5 @@ When ALL phases complete -> <promise>DEPLOYMENT_READY</promise>
|
|
|
441
441
|
---
|
|
442
442
|
|
|
443
443
|
**Origin**: Adapted from Ralph Wiggum plugin (anthropics/claude-code)
|
|
444
|
-
**Specialized for**: Agentic QE v3 Fleet with
|
|
444
|
+
**Specialized for**: Agentic QE v3 Fleet with 60 QE agents
|
|
445
445
|
**Domains**: test-generation, test-execution, coverage-analysis, quality-assessment
|
|
@@ -144,27 +144,23 @@ Verify init completes without errors and creates the expected project structure
|
|
|
144
144
|
# Version output
|
|
145
145
|
node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js --version
|
|
146
146
|
|
|
147
|
-
#
|
|
148
|
-
node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js
|
|
147
|
+
# System status
|
|
148
|
+
node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js status
|
|
149
149
|
```
|
|
150
150
|
Both must succeed without errors.
|
|
151
151
|
|
|
152
|
-
#### 8d. Verify
|
|
153
|
-
```bash
|
|
154
|
-
# Verify MCP server can start and list tools
|
|
155
|
-
node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js mcp --list-tools 2>&1 | head -30
|
|
156
|
-
```
|
|
157
|
-
Should list available MCP tools without crashing.
|
|
158
|
-
|
|
159
|
-
#### 8e. Verify Self-Learning & Fleet Capabilities
|
|
152
|
+
#### 8d. Verify Self-Learning & Fleet Capabilities
|
|
160
153
|
```bash
|
|
161
154
|
cd /tmp/aqe-release-test
|
|
162
155
|
|
|
163
|
-
# Verify
|
|
164
|
-
node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js
|
|
156
|
+
# Verify learning subsystem
|
|
157
|
+
node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js learning stats 2>&1 | head -10
|
|
165
158
|
|
|
166
|
-
# Verify agent
|
|
159
|
+
# Verify agent listing works
|
|
167
160
|
node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js agent list 2>&1 | head -10
|
|
161
|
+
|
|
162
|
+
# Verify health check
|
|
163
|
+
node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js health 2>&1 | head -10
|
|
168
164
|
```
|
|
169
165
|
These should respond (even if empty results) without errors, confirming the subsystems initialize properly.
|
|
170
166
|
|
|
@@ -177,34 +173,24 @@ rm -rf /tmp/aqe-release-test
|
|
|
177
173
|
|
|
178
174
|
### 9. Local CI Test Suite
|
|
179
175
|
|
|
180
|
-
Run the same tests that CI runs on PRs
|
|
176
|
+
Run the same tests that CI runs on PRs and during publish. Skip e2e browser tests unless the user explicitly requests them.
|
|
181
177
|
|
|
182
178
|
```bash
|
|
183
179
|
cd /workspaces/agentic-qe-new/v3
|
|
184
180
|
|
|
185
|
-
#
|
|
186
|
-
npm run test:journeys
|
|
187
|
-
|
|
188
|
-
# Code Intelligence tests (MinCut/Graph algorithms, from optimized-ci.yml)
|
|
189
|
-
npm run test:code-intelligence
|
|
190
|
-
|
|
191
|
-
# Contract tests (if they exist, from optimized-ci.yml)
|
|
192
|
-
npm run test:contracts 2>/dev/null || echo "No contract tests"
|
|
193
|
-
|
|
194
|
-
# Infrastructure tests (from optimized-ci.yml)
|
|
195
|
-
npm run test:infrastructure 2>/dev/null || echo "No infrastructure tests"
|
|
196
|
-
|
|
197
|
-
# Regression tests (from optimized-ci.yml)
|
|
198
|
-
npm run test:regression 2>/dev/null || echo "No regression tests"
|
|
199
|
-
|
|
200
|
-
# Performance gates (from optimized-ci.yml)
|
|
181
|
+
# Performance gates (fast — validates perf thresholds)
|
|
201
182
|
npm run performance:gate
|
|
202
183
|
|
|
184
|
+
# Regression tests (runs full unit suite)
|
|
185
|
+
npm run test:regression
|
|
186
|
+
|
|
203
187
|
# Full test:ci suite (from npm-publish.yml — excludes browser/e2e)
|
|
204
188
|
npm run test:ci
|
|
205
189
|
```
|
|
206
190
|
|
|
207
|
-
|
|
191
|
+
Available test scripts: `test:unit`, `test:unit:fast`, `test:unit:heavy`, `test:unit:mcp`, `test:ci`, `test:regression`, `test:safe`, `test:perf`, `test:e2e`, `test:coverage`, `performance:gate`.
|
|
192
|
+
|
|
193
|
+
All mandatory test suites must pass. Pre-existing MCP handler test failures (tests that need runtime initialization) are acceptable if they also fail on the main branch.
|
|
208
194
|
|
|
209
195
|
**STOP — show all test results.**
|
|
210
196
|
|
package/README.md
CHANGED
|
@@ -9,35 +9,9 @@
|
|
|
9
9
|
<img alt="NPM Downloads" src="https://img.shields.io/npm/dw/agentic-qe">
|
|
10
10
|
|
|
11
11
|
|
|
12
|
-
**V3 (Main)** | [V2 Documentation](v2/docs/V2-README.md) | [Changelog](CHANGELOG.md) | [Contributors](CONTRIBUTORS.md) | [Issues](https://github.com/proffesor-for-testing/agentic-qe/issues) | [Discussions](https://github.com/proffesor-for-testing/agentic-qe/discussions)
|
|
12
|
+
**V3 (Main)** | [V2 Documentation](v2/docs/V2-README.md) | [Release Notes](docs/releases/README.md) | [Changelog](v3/CHANGELOG.md) | [Contributors](CONTRIBUTORS.md) | [Issues](https://github.com/proffesor-for-testing/agentic-qe/issues) | [Discussions](https://github.com/proffesor-for-testing/agentic-qe/discussions)
|
|
13
13
|
|
|
14
|
-
> **V3** brings Domain-Driven Design architecture, 13 bounded contexts,
|
|
15
|
-
|
|
16
|
-
### What's New in v3.6.0
|
|
17
|
-
|
|
18
|
-
- **Enterprise Integration Domain** — SOAP/WSDL, SAP RFC/BAPI/IDoc, OData, ESB/middleware, message broker, and Segregation of Duties testing (contributed by [@fndlalit](https://github.com/fndlalit))
|
|
19
|
-
- **8 New Agents** — `qe-soap-tester`, `qe-sap-rfc-tester`, `qe-sap-idoc-tester`, `qe-middleware-validator`, `qe-odata-contract-tester`, `qe-message-broker-tester`, `qe-sod-analyzer`, `qe-pentest-validator`
|
|
20
|
-
- **5 New Skills** — `enterprise-integration-testing`, `middleware-testing-patterns`, `wms-testing-patterns`, `observability-testing-patterns`, `pentest-validation` (Tier 3)
|
|
21
|
-
- **Pentest Validation** — Shannon-inspired graduated exploit validation with "No Exploit, No Report" quality gate and 3-tier exploitation
|
|
22
|
-
- **StrongDM Tier 1** — Loop detection + token dashboard for software delivery governance (ADR-062)
|
|
23
|
-
- **Fleet: 59 agents, 75 skills across 13 domains**
|
|
24
|
-
|
|
25
|
-
### What's New in v3.5.0
|
|
26
|
-
|
|
27
|
-
- **Governance ON by Default** - @claude-flow/guidance integration with 7 unbreakable QE invariants (ADR-058)
|
|
28
|
-
- **QCSD 2.0 Complete Lifecycle** - All 4 phases: Ideation → Refinement → Development → CI/CD Verification
|
|
29
|
-
- **Infrastructure Self-Healing Enterprise** - 12 enterprise error signatures (SAP, Salesforce, Payment Gateway)
|
|
30
|
-
|
|
31
|
-
### What's New in v3.4.2
|
|
32
|
-
|
|
33
|
-
- **Skill Validation System** - 4-layer trust tiers with schemas, validators, and evaluation suites (ADR-056)
|
|
34
|
-
- **CLI Validation Commands** - `aqe skill report`, `aqe eval run`, regression detection
|
|
35
|
-
|
|
36
|
-
### What's New in v3.4.0
|
|
37
|
-
|
|
38
|
-
- **AG-UI Protocol** - Anthropic's streaming agent-to-user interface with real-time progress updates
|
|
39
|
-
- **A2A Protocol** - Google's agent-to-agent interoperability standard for cross-tool communication
|
|
40
|
-
- **A2UI Components** - Unified UI combining AG-UI streaming with A2A event handling
|
|
14
|
+
> **V3** brings Domain-Driven Design architecture, 13 bounded contexts, 60 specialized QE agents, TinyDancer intelligent model routing, ReasoningBank learning with Dream cycles, HNSW vector search, mathematical Coherence verification, full MinCut/Consensus integration across all 13 domains, and deep integration with [Claude Flow](https://github.com/ruvnet/claude-flow) and [Agentic Flow](https://github.com/ruvnet/agentic-flow).
|
|
41
15
|
|
|
42
16
|
🏗️ **DDD Architecture** | 🧠 **ReasoningBank + Dream Cycles** | 🎯 **TinyDancer Model Routing** | 🔍 **HNSW Vector Search** | 👑 **Queen Coordinator** | 📊 **O(log n) Coverage** | 🔗 **Claude Flow Integration** | 🎯 **13 Bounded Contexts** | 📚 **75 QE Skills** | 🧬 **Coherence Verification** | ✅ **Trust Tiers** | 🛡️ **Governance**
|
|
43
17
|
|
|
@@ -88,7 +62,7 @@ claude "Use qe-flaky-hunter to analyze the last 100 test runs and stabilize flak
|
|
|
88
62
|
|
|
89
63
|
**What V3 provides:**
|
|
90
64
|
- ✅ **13 DDD Bounded Contexts**: Organized by business domain (test-generation, coverage-analysis, security-compliance, enterprise-integration, etc.)
|
|
91
|
-
- ✅ **
|
|
65
|
+
- ✅ **60 QE Agents**: Including Queen Coordinator for hierarchical orchestration (53 main + 7 TDD subagents)
|
|
92
66
|
- ✅ **TinyDancer Model Routing**: 3-tier intelligent routing (Haiku/Sonnet/Opus) for cost optimization
|
|
93
67
|
- ✅ **ReasoningBank Learning**: HNSW-indexed pattern storage with experience replay
|
|
94
68
|
- ✅ **O(log n) Coverage Analysis**: Sublinear algorithms for efficient gap detection
|
|
@@ -134,7 +108,7 @@ claude "Assess code quality and provide deployment recommendation"
|
|
|
134
108
|
| **AI testing tools are expensive** | TinyDancer 3-tier model routing reduces costs by matching task complexity to appropriate model |
|
|
135
109
|
| **No memory between test runs—every analysis starts from scratch** | ReasoningBank remembers patterns, strategies, and what works for your codebase |
|
|
136
110
|
| **Agents waste tokens reading irrelevant code** | Code Intelligence provides token reduction with semantic search and knowledge graphs |
|
|
137
|
-
| **Quality engineering requires complex coordination** | Queen Coordinator orchestrates
|
|
111
|
+
| **Quality engineering requires complex coordination** | Queen Coordinator orchestrates 60 agents across 13 domains with consensus and MinCut topology |
|
|
138
112
|
| **Tools don't understand your testing frameworks** | Works with Jest, Cypress, Playwright, Vitest, Mocha, Jasmine, AVA |
|
|
139
113
|
|
|
140
114
|
---
|
|
@@ -240,7 +214,7 @@ The **qe-queen-coordinator** manages the entire fleet with intelligent task dist
|
|
|
240
214
|
```
|
|
241
215
|
|
|
242
216
|
**Capabilities:**
|
|
243
|
-
- Orchestrate
|
|
217
|
+
- Orchestrate 60 QE agents concurrently across 13 domains
|
|
244
218
|
- TinyDancer 3-tier model routing (Haiku/Sonnet/Opus) with confidence-based decisions
|
|
245
219
|
- Byzantine fault-tolerant consensus for critical quality decisions
|
|
246
220
|
- MinCut graph-based topology optimization for self-healing coordination
|
|
@@ -253,6 +227,34 @@ claude "Use qe-queen-coordinator to orchestrate release validation for v2.1.0 wi
|
|
|
253
227
|
|
|
254
228
|
---
|
|
255
229
|
|
|
230
|
+
### 🤝 Agent Teams & Fleet Coordination
|
|
231
|
+
|
|
232
|
+
The Queen Coordinator is extended with **Agent Teams** (ADR-064) for hybrid fleet communication:
|
|
233
|
+
|
|
234
|
+
| Feature | Description |
|
|
235
|
+
|---------|-------------|
|
|
236
|
+
| **Mailbox Messaging** | Direct agent-to-agent and domain-scoped broadcast messaging |
|
|
237
|
+
| **Distributed Tracing** | TraceContext propagation across messages for end-to-end task visibility |
|
|
238
|
+
| **Dynamic Scaling** | Workload-based auto-scaling with configurable policies and cooldowns |
|
|
239
|
+
| **Competing Hypotheses** | Multi-agent root cause investigation with evidence scoring, auto-triggered on critical failures |
|
|
240
|
+
| **Federation** | Cross-service routing with health monitoring and service discovery |
|
|
241
|
+
| **Circuit Breakers** | Per-domain fault isolation with automatic recovery |
|
|
242
|
+
| **Task DAG** | Topological ordering with cycle detection for multi-step workflows |
|
|
243
|
+
|
|
244
|
+
**Fleet Tiers** — Activate the level of coordination your project needs:
|
|
245
|
+
|
|
246
|
+
| Tier | Agents | Best For |
|
|
247
|
+
|------|--------|----------|
|
|
248
|
+
| **Lite** | 1-4 | Small projects, focused tasks |
|
|
249
|
+
| **Standard** | 5-10 | Team projects, multi-domain coordination |
|
|
250
|
+
| **Full** | 11-15 | Enterprise, cross-fleet federation |
|
|
251
|
+
|
|
252
|
+
```bash
|
|
253
|
+
claude "Use qe-queen-coordinator with agent teams to investigate flaky test failures across test-execution and defect-intelligence domains"
|
|
254
|
+
```
|
|
255
|
+
|
|
256
|
+
---
|
|
257
|
+
|
|
256
258
|
### 🧠 ReasoningBank Learning System
|
|
257
259
|
|
|
258
260
|
V3 agents learn and improve through the **ReasoningBank** pattern storage:
|
|
@@ -466,17 +468,17 @@ npx @claude-flow/cli@latest agent spawn -t qe-test-architect --name test-gen
|
|
|
466
468
|
|
|
467
469
|
---
|
|
468
470
|
|
|
469
|
-
### 📊
|
|
471
|
+
### 📊 60 Specialized QE Agents
|
|
470
472
|
|
|
471
473
|
| Category | Count | Highlights |
|
|
472
474
|
|----------|-------|------------|
|
|
473
|
-
| **Main QE Agents** |
|
|
475
|
+
| **Main QE Agents** | 53 | Test generation, coverage, security, performance, accessibility, enterprise integration, pentest validation |
|
|
474
476
|
| **TDD Subagents** | 7 | RED/GREEN/REFACTOR with code review |
|
|
475
477
|
|
|
476
478
|
**V2 Backward Compatibility**: All V2 agents map to V3 equivalents automatically.
|
|
477
479
|
|
|
478
480
|
<details>
|
|
479
|
-
<summary><b>📋 View All Main QE Agents (
|
|
481
|
+
<summary><b>📋 View All Main QE Agents (53)</b></summary>
|
|
480
482
|
|
|
481
483
|
| Agent | Domain | Purpose |
|
|
482
484
|
|-------|--------|---------|
|
|
@@ -524,6 +526,7 @@ npx @claude-flow/cli@latest agent spawn -t qe-test-architect --name test-gen
|
|
|
524
526
|
| qe-product-factors-assessor | quality-assessment | SFDIPOT product factors analysis |
|
|
525
527
|
| qe-test-idea-rewriter | test-generation | Transform passive tests to active actions |
|
|
526
528
|
| qe-quality-criteria-recommender | quality-assessment | HTSM v6.3 Quality Criteria analysis |
|
|
529
|
+
| qe-devils-advocate | quality-assessment | Adversarial review of agent outputs |
|
|
527
530
|
|
|
528
531
|
</details>
|
|
529
532
|
|
|
@@ -794,7 +797,7 @@ agentic-qe/
|
|
|
794
797
|
│ │ ├── mcp/ # MCP server
|
|
795
798
|
│ │ └── cli/ # V3 CLI
|
|
796
799
|
│ ├── tests/ # 5,600+ tests
|
|
797
|
-
│ └── assets/agents/ #
|
|
800
|
+
│ └── assets/agents/ # 60 QE agent definitions (53 main + 7 subagents)
|
|
798
801
|
├── v2/ # V2 Implementation (Legacy)
|
|
799
802
|
│ ├── src/ # V2 source code
|
|
800
803
|
│ ├── tests/ # V2 tests
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "agentic-qe",
|
|
3
|
-
"version": "3.6.
|
|
3
|
+
"version": "3.6.2",
|
|
4
4
|
"description": "Agentic Quality Engineering V3 - Domain-Driven Design Architecture with 13 Bounded Contexts, O(log n) coverage analysis, ReasoningBank learning, 59 specialized QE agents, mathematical Coherence verification, deep Claude Flow integration",
|
|
5
5
|
"main": "./v3/dist/index.js",
|
|
6
6
|
"types": "./v3/dist/index.d.ts",
|
package/v3/CHANGELOG.md
CHANGED
|
@@ -5,6 +5,50 @@ All notable changes to Agentic QE will be documented in this file.
|
|
|
5
5
|
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
|
|
6
6
|
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
|
|
7
7
|
|
|
8
|
+
## [3.6.2] - 2026-02-10
|
|
9
|
+
|
|
10
|
+
### Fixed
|
|
11
|
+
|
|
12
|
+
- **YAML parser empty array crash (Issue #244)** — `aqe init --auto` no longer fails on re-runs when `config.yaml` has empty array fields like `disabled:` with no items. The custom YAML parser now normalizes known array fields after parsing, and `mergeConfigs()` uses defensive `Array.isArray()` checks.
|
|
13
|
+
- **Agent parse errors on helper files (Issue #243)** — Helper reference files (`htsm-categories.md`, `evidence-classification.md`) and the generated `README.md` are no longer placed inside `.claude/agents/v3/` where `claude doctor` would incorrectly parse them as agent definitions. Helpers now install to `.claude/helpers/v3/` and the agents index writes to `.claude/docs/v3-agents-index.md`.
|
|
14
|
+
|
|
15
|
+
### Changed
|
|
16
|
+
|
|
17
|
+
- **Helper files location** — Agent helper/reference files (quality-criteria templates, SFDIPOT templates) now install to `.claude/helpers/v3/` instead of `.claude/agents/v3/helpers/`. Updated all path references in `quality-criteria-service.ts` and agent definitions.
|
|
18
|
+
|
|
19
|
+
## [3.6.1] - 2026-02-09
|
|
20
|
+
|
|
21
|
+
### Added
|
|
22
|
+
|
|
23
|
+
- **Agent Teams Integration (ADR-064)** — Hybrid fleet architecture layering Claude Code Agent Teams communication patterns on the existing Queen Coordinator. 4-phase implementation: Foundation, Hybrid Architecture, Learning & Observability, Advanced Patterns.
|
|
24
|
+
- **Agent Teams Adapter** — Direct mailbox messaging between agents with domain-scoped teams (2-4 agents per domain), team lead/teammate model, and subscription-based event delivery.
|
|
25
|
+
- **Fleet Tier Selector** — Tiered fleet activation (smoke/standard/deep/crisis) that controls agent count and token costs based on trigger context (commit, PR, release, incident).
|
|
26
|
+
- **Task Dependency DAG** — Topological ordering with cycle detection for multi-step task workflows. DAGScheduler for automated execution of ready tasks.
|
|
27
|
+
- **TeammateIdle Hook** — Auto-assigns pending tasks to idle agents, reducing Queen bottleneck for task distribution.
|
|
28
|
+
- **TaskCompleted Hook** — Extracts patterns from completed tasks and trains them into ReasoningBank automatically. Quality gate validation with exit code 2 rejection.
|
|
29
|
+
- **Domain Circuit Breakers** — Per-domain fault isolation with configurable failure thresholds, half-open recovery probing, and criticality-based configs.
|
|
30
|
+
- **Domain Team Manager** — Creates and manages domain-scoped agent teams with health monitoring, scaling, and rebalancing.
|
|
31
|
+
- **HNSW Graph Construction** — Real O(log n) HNSW insert and search in unified memory, replacing the O(n) linear scan stub.
|
|
32
|
+
- **Distributed Tracing** — TraceCollector with W3C-style TraceContext propagation encoded into AgentMessage correlationId fields. Queen traces full task lifecycles.
|
|
33
|
+
- **Competing Hypotheses** — HypothesisManager for multi-agent root cause investigation with evidence scoring, confidence tracking, and convergence (evidence-scoring, unanimous, majority, timeout). Auto-triggered on p0/p1 task failures.
|
|
34
|
+
- **Cross-Fleet Federation** — FederationMailbox with service registry, domain-based routing, health monitoring via heartbeats, and graceful degradation for unreachable services.
|
|
35
|
+
- **Dynamic Agent Scaling** — DynamicScaler with workload metrics collection, configurable scaling policies (queue depth, idle ratio, error rate thresholds), cooldown enforcement, and executor callbacks. Wired into Queen's metrics loop.
|
|
36
|
+
- **ReasoningBank Pattern Store Adapter** — Bridges TaskCompletedHook pattern extraction to QEReasoningBank storage with domain detection, type mapping, and confidence propagation.
|
|
37
|
+
- **promotePattern() Implementation** — Completes the ReasoningBank promotion stub: delegates to PatternStore.promote() and publishes pattern:promoted events.
|
|
38
|
+
- **Devil's Advocate Agent** — `qe-devils-advocate` agent that challenges other agents' outputs by finding gaps and questioning assumptions.
|
|
39
|
+
- **397+ New Tests** — 282 coordination tests, 67 hook tests, 48 learning tests covering all ADR-064 phases including adapter tracing integration and latency benchmarks.
|
|
40
|
+
|
|
41
|
+
### Fixed
|
|
42
|
+
|
|
43
|
+
- **6 CodeQL Alerts** — Resolved security alerts in enterprise-integration services (input validation, type safety).
|
|
44
|
+
- **Pattern Training Pipeline** — Connected the disconnected TaskCompletedHook → ReasoningBank pipeline so patterns are automatically trained on task completion.
|
|
45
|
+
- **Queen Operational Wiring** — All Phase 3+4 modules (tracing, dynamic scaler, hypotheses) are now called by Queen's operational flow, not just initialized as shelf-ware.
|
|
46
|
+
|
|
47
|
+
### Changed
|
|
48
|
+
|
|
49
|
+
- **Queen Coordinator** — Extended with tracing (startTrace on submitTask, completeSpan/failSpan on completion/failure), dynamic scaling (metrics feed + evaluate + execute in metrics loop), and competing hypotheses (auto-investigation on critical failures).
|
|
50
|
+
- **Agent Teams Adapter** — sendMessage() and broadcast() now encode TraceContext into correlationId when provided, enabling end-to-end distributed tracing.
|
|
51
|
+
|
|
8
52
|
## [3.6.0] - 2026-02-08
|
|
9
53
|
|
|
10
54
|
### Added
|
package/v3/README.md
CHANGED
|
@@ -5,14 +5,14 @@
|
|
|
5
5
|
[](https://www.typescriptlang.org/)
|
|
6
6
|
[](https://nodejs.org/)
|
|
7
7
|
|
|
8
|
-
> Domain-Driven Quality Engineering with Mathematical Coherence Verification, 13 Bounded Contexts,
|
|
8
|
+
> Domain-Driven Quality Engineering with Mathematical Coherence Verification, 13 Bounded Contexts, 60 Specialized QE Agents, 75 QE Skills, and ReasoningBank Learning
|
|
9
9
|
|
|
10
10
|
### Key Features
|
|
11
11
|
|
|
12
12
|
| Feature | Description |
|
|
13
13
|
|---------|-------------|
|
|
14
14
|
| **75 QE Skills** | Quality engineering skills with 4-tier trust validation system |
|
|
15
|
-
| **
|
|
15
|
+
| **60 QE Agents** | Specialized agents for test generation, security, coverage, enterprise integration, and more |
|
|
16
16
|
| **13 DDD Domains** | Modular bounded contexts for all quality engineering needs |
|
|
17
17
|
| **MCP Integration** | Full Claude Code integration via Model Context Protocol |
|
|
18
18
|
| **AG-UI/A2A Protocols** | Industry-standard agent streaming and interoperability |
|
|
@@ -48,7 +48,7 @@ npx aqe test generate src/
|
|
|
48
48
|
|
|
49
49
|
## Why Agentic QE?
|
|
50
50
|
|
|
51
|
-
- **
|
|
51
|
+
- **60 Specialized QE Agents** - Domain-focused quality engineering agents (53 main + 7 subagents)
|
|
52
52
|
- **75 QE Skills** - 46 Tier 3 verified + 29 additional (QCSD swarms, n8n testing, enterprise integration, qe-* domains)
|
|
53
53
|
- **13 DDD Bounded Contexts** - Modular, extensible architecture
|
|
54
54
|
- **TinyDancer Model Routing** - 3-tier intelligent routing for cost optimization
|
|
@@ -632,7 +632,7 @@ console.log(`Quality gate: ${gate.value.passed ? 'PASSED' : 'FAILED'}`);
|
|
|
632
632
|
| Module System | CommonJS | ESM |
|
|
633
633
|
| Memory | SQLite only | HNSW + SQLite hybrid |
|
|
634
634
|
| Learning | Basic patterns | ReasoningBank + SONA + Dream Cycles |
|
|
635
|
-
| Agents | 32 |
|
|
635
|
+
| Agents | 32 | 60 QE agents (53 main + 7 subagents) |
|
|
636
636
|
| Skills | 35 | 75 QE skills (46 Tier 3 + 29 additional) |
|
|
637
637
|
| Coverage | O(n) | O(log n) |
|
|
638
638
|
| Pattern Search | Linear | O(log n) HNSW indexing |
|
|
@@ -706,9 +706,9 @@ See the [Migration Guide](./docs/MIGRATION-GUIDE.md) for detailed instructions a
|
|
|
706
706
|
}
|
|
707
707
|
```
|
|
708
708
|
|
|
709
|
-
##
|
|
709
|
+
## 60 QE Agents
|
|
710
710
|
|
|
711
|
-
Agentic QE includes
|
|
711
|
+
Agentic QE includes 60 specialized quality engineering agents (53 main + 7 subagents) organized by domain:
|
|
712
712
|
|
|
713
713
|
### Test Generation Domain
|
|
714
714
|
`qe-test-architect`, `qe-tdd-specialist`, `qe-tdd-red`, `qe-tdd-green`, `qe-tdd-refactor`, `qe-property-tester`, `qe-mutation-tester`, `qe-bdd-generator`
|
|
@@ -720,7 +720,7 @@ Agentic QE includes 59 specialized quality engineering agents (52 main + 7 subag
|
|
|
720
720
|
`qe-coverage-specialist`, `qe-gap-detector`, `qe-risk-analyzer`
|
|
721
721
|
|
|
722
722
|
### Quality Assessment Domain
|
|
723
|
-
`qe-quality-gate`, `qe-metrics-optimizer`, `qe-deployment-advisor`
|
|
723
|
+
`qe-quality-gate`, `qe-metrics-optimizer`, `qe-deployment-advisor`, `qe-devils-advocate`
|
|
724
724
|
|
|
725
725
|
### Defect Intelligence Domain
|
|
726
726
|
`qe-defect-intelligence`, `qe-regression-analyzer`, `qe-root-cause-analyzer`
|