agentic-qe 3.5.4 → 3.6.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (247) hide show
  1. package/.claude/agents/v3/README.md +1 -1
  2. package/.claude/agents/v3/qe-message-broker-tester.md +380 -0
  3. package/.claude/agents/v3/qe-middleware-validator.md +423 -0
  4. package/.claude/agents/v3/qe-odata-contract-tester.md +484 -0
  5. package/.claude/agents/v3/qe-pentest-validator.md +359 -0
  6. package/.claude/agents/v3/qe-qx-partner.md +85 -3
  7. package/.claude/agents/v3/qe-sap-idoc-tester.md +407 -0
  8. package/.claude/agents/v3/qe-sap-rfc-tester.md +357 -0
  9. package/.claude/agents/v3/qe-soap-tester.md +340 -0
  10. package/.claude/agents/v3/qe-sod-analyzer.md +528 -0
  11. package/.claude/agents/v3/templates/qx-report-template.html +437 -0
  12. package/.claude/skills/debug-loop/SKILL.md +61 -0
  13. package/.claude/skills/enterprise-integration-testing/SKILL.md +735 -0
  14. package/.claude/skills/middleware-testing-patterns/SKILL.md +798 -0
  15. package/.claude/skills/observability-testing-patterns/SKILL.md +930 -0
  16. package/.claude/skills/pentest-validation/SKILL.md +268 -0
  17. package/.claude/skills/pentest-validation/evals/pentest-validation.yaml +708 -0
  18. package/.claude/skills/pentest-validation/schemas/output.json +281 -0
  19. package/.claude/skills/pentest-validation/scripts/validate.sh +402 -0
  20. package/.claude/skills/pr-review/SKILL.md +61 -0
  21. package/.claude/skills/qcsd-cicd-swarm/SKILL.md +315 -13
  22. package/.claude/skills/qcsd-development-swarm/SKILL.md +331 -16
  23. package/.claude/skills/qcsd-ideation-swarm/SKILL.md +258 -9
  24. package/.claude/skills/qcsd-refinement-swarm/SKILL.md +344 -22
  25. package/.claude/skills/release/SKILL.md +333 -0
  26. package/.claude/skills/skills-manifest.json +5 -4
  27. package/.claude/skills/wms-testing-patterns/SKILL.md +949 -0
  28. package/README.md +30 -21
  29. package/package.json +2 -2
  30. package/v3/CHANGELOG.md +57 -0
  31. package/v3/README.md +23 -20
  32. package/v3/assets/agents/v3/qe-message-broker-tester.md +380 -0
  33. package/v3/assets/agents/v3/qe-middleware-validator.md +423 -0
  34. package/v3/assets/agents/v3/qe-odata-contract-tester.md +484 -0
  35. package/v3/assets/agents/v3/qe-pentest-validator.md +359 -0
  36. package/v3/assets/agents/v3/qe-qx-partner.md +90 -12
  37. package/v3/assets/agents/v3/qe-sap-idoc-tester.md +407 -0
  38. package/v3/assets/agents/v3/qe-sap-rfc-tester.md +357 -0
  39. package/v3/assets/agents/v3/qe-soap-tester.md +340 -0
  40. package/v3/assets/agents/v3/qe-sod-analyzer.md +528 -0
  41. package/v3/assets/agents/v3/templates/qx-report-template.html +437 -0
  42. package/v3/assets/skills/debug-loop/SKILL.md +61 -0
  43. package/v3/assets/skills/enterprise-integration-testing/SKILL.md +735 -0
  44. package/v3/assets/skills/middleware-testing-patterns/SKILL.md +798 -0
  45. package/v3/assets/skills/observability-testing-patterns/SKILL.md +930 -0
  46. package/v3/assets/skills/pentest-validation/SKILL.md +268 -0
  47. package/v3/assets/skills/pentest-validation/evals/pentest-validation.yaml +708 -0
  48. package/v3/assets/skills/pentest-validation/schemas/output.json +281 -0
  49. package/v3/assets/skills/pentest-validation/scripts/validate.sh +402 -0
  50. package/v3/assets/skills/pr-review/SKILL.md +61 -0
  51. package/v3/assets/skills/qcsd-cicd-swarm/SKILL.md +2206 -0
  52. package/v3/assets/skills/qcsd-development-swarm/SKILL.md +2154 -0
  53. package/v3/assets/skills/qcsd-ideation-swarm/SKILL.md +2008 -1753
  54. package/v3/assets/skills/qcsd-refinement-swarm/SKILL.md +2398 -0
  55. package/v3/assets/skills/security-visual-testing/SKILL.md +223 -0
  56. package/v3/assets/skills/security-visual-testing/evals/security-visual-testing.yaml +163 -0
  57. package/v3/assets/skills/security-visual-testing/schemas/output.json +486 -0
  58. package/v3/assets/skills/security-visual-testing/scripts/validate.sh +748 -0
  59. package/v3/assets/skills/sfdipot-product-factors/SKILL.md +239 -0
  60. package/v3/assets/skills/test-idea-rewriting/SKILL.md +229 -0
  61. package/v3/assets/skills/wms-testing-patterns/SKILL.md +949 -0
  62. package/v3/dist/cli/bundle.js +10816 -4625
  63. package/v3/dist/cli/commands/coverage.d.ts.map +1 -1
  64. package/v3/dist/cli/commands/coverage.js +29 -0
  65. package/v3/dist/cli/commands/coverage.js.map +1 -1
  66. package/v3/dist/cli/commands/learning.d.ts.map +1 -1
  67. package/v3/dist/cli/commands/learning.js +9 -0
  68. package/v3/dist/cli/commands/learning.js.map +1 -1
  69. package/v3/dist/cli/commands/token-usage.d.ts.map +1 -1
  70. package/v3/dist/cli/commands/token-usage.js +6 -2
  71. package/v3/dist/cli/commands/token-usage.js.map +1 -1
  72. package/v3/dist/coordination/protocols/learning-consolidation.d.ts.map +1 -1
  73. package/v3/dist/coordination/protocols/learning-consolidation.js +1 -0
  74. package/v3/dist/coordination/protocols/learning-consolidation.js.map +1 -1
  75. package/v3/dist/domains/coverage-analysis/coordinator.d.ts +8 -0
  76. package/v3/dist/domains/coverage-analysis/coordinator.d.ts.map +1 -1
  77. package/v3/dist/domains/coverage-analysis/coordinator.js +67 -0
  78. package/v3/dist/domains/coverage-analysis/coordinator.js.map +1 -1
  79. package/v3/dist/domains/coverage-analysis/interfaces.d.ts +33 -0
  80. package/v3/dist/domains/coverage-analysis/interfaces.d.ts.map +1 -1
  81. package/v3/dist/domains/coverage-analysis/plugin.d.ts.map +1 -1
  82. package/v3/dist/domains/coverage-analysis/plugin.js +17 -0
  83. package/v3/dist/domains/coverage-analysis/plugin.js.map +1 -1
  84. package/v3/dist/domains/coverage-analysis/services/ghost-coverage-analyzer.d.ts +125 -0
  85. package/v3/dist/domains/coverage-analysis/services/ghost-coverage-analyzer.d.ts.map +1 -0
  86. package/v3/dist/domains/coverage-analysis/services/ghost-coverage-analyzer.js +317 -0
  87. package/v3/dist/domains/coverage-analysis/services/ghost-coverage-analyzer.js.map +1 -0
  88. package/v3/dist/domains/coverage-analysis/services/index.d.ts +1 -0
  89. package/v3/dist/domains/coverage-analysis/services/index.d.ts.map +1 -1
  90. package/v3/dist/domains/coverage-analysis/services/index.js +4 -0
  91. package/v3/dist/domains/coverage-analysis/services/index.js.map +1 -1
  92. package/v3/dist/domains/enterprise-integration/coordinator.d.ts +91 -0
  93. package/v3/dist/domains/enterprise-integration/coordinator.d.ts.map +1 -0
  94. package/v3/dist/domains/enterprise-integration/coordinator.js +672 -0
  95. package/v3/dist/domains/enterprise-integration/coordinator.js.map +1 -0
  96. package/v3/dist/domains/enterprise-integration/index.d.ts +14 -0
  97. package/v3/dist/domains/enterprise-integration/index.d.ts.map +1 -0
  98. package/v3/dist/domains/enterprise-integration/index.js +18 -0
  99. package/v3/dist/domains/enterprise-integration/index.js.map +1 -0
  100. package/v3/dist/domains/enterprise-integration/interfaces.d.ts +376 -0
  101. package/v3/dist/domains/enterprise-integration/interfaces.d.ts.map +1 -0
  102. package/v3/dist/domains/enterprise-integration/interfaces.js +11 -0
  103. package/v3/dist/domains/enterprise-integration/interfaces.js.map +1 -0
  104. package/v3/dist/domains/enterprise-integration/plugin.d.ts +88 -0
  105. package/v3/dist/domains/enterprise-integration/plugin.d.ts.map +1 -0
  106. package/v3/dist/domains/enterprise-integration/plugin.js +515 -0
  107. package/v3/dist/domains/enterprise-integration/plugin.js.map +1 -0
  108. package/v3/dist/domains/enterprise-integration/services/esb-middleware-service.d.ts +67 -0
  109. package/v3/dist/domains/enterprise-integration/services/esb-middleware-service.d.ts.map +1 -0
  110. package/v3/dist/domains/enterprise-integration/services/esb-middleware-service.js +670 -0
  111. package/v3/dist/domains/enterprise-integration/services/esb-middleware-service.js.map +1 -0
  112. package/v3/dist/domains/enterprise-integration/services/index.d.ts +13 -0
  113. package/v3/dist/domains/enterprise-integration/services/index.d.ts.map +1 -0
  114. package/v3/dist/domains/enterprise-integration/services/index.js +13 -0
  115. package/v3/dist/domains/enterprise-integration/services/index.js.map +1 -0
  116. package/v3/dist/domains/enterprise-integration/services/message-broker-service.d.ts +88 -0
  117. package/v3/dist/domains/enterprise-integration/services/message-broker-service.d.ts.map +1 -0
  118. package/v3/dist/domains/enterprise-integration/services/message-broker-service.js +560 -0
  119. package/v3/dist/domains/enterprise-integration/services/message-broker-service.js.map +1 -0
  120. package/v3/dist/domains/enterprise-integration/services/odata-service.d.ts +75 -0
  121. package/v3/dist/domains/enterprise-integration/services/odata-service.d.ts.map +1 -0
  122. package/v3/dist/domains/enterprise-integration/services/odata-service.js +618 -0
  123. package/v3/dist/domains/enterprise-integration/services/odata-service.js.map +1 -0
  124. package/v3/dist/domains/enterprise-integration/services/sap-integration-service.d.ts +73 -0
  125. package/v3/dist/domains/enterprise-integration/services/sap-integration-service.d.ts.map +1 -0
  126. package/v3/dist/domains/enterprise-integration/services/sap-integration-service.js +513 -0
  127. package/v3/dist/domains/enterprise-integration/services/sap-integration-service.js.map +1 -0
  128. package/v3/dist/domains/enterprise-integration/services/soap-wsdl-service.d.ts +84 -0
  129. package/v3/dist/domains/enterprise-integration/services/soap-wsdl-service.d.ts.map +1 -0
  130. package/v3/dist/domains/enterprise-integration/services/soap-wsdl-service.js +639 -0
  131. package/v3/dist/domains/enterprise-integration/services/soap-wsdl-service.js.map +1 -0
  132. package/v3/dist/domains/enterprise-integration/services/sod-analysis-service.d.ts +90 -0
  133. package/v3/dist/domains/enterprise-integration/services/sod-analysis-service.d.ts.map +1 -0
  134. package/v3/dist/domains/enterprise-integration/services/sod-analysis-service.js +389 -0
  135. package/v3/dist/domains/enterprise-integration/services/sod-analysis-service.js.map +1 -0
  136. package/v3/dist/domains/index.d.ts +1 -0
  137. package/v3/dist/domains/index.d.ts.map +1 -1
  138. package/v3/dist/domains/index.js +1 -0
  139. package/v3/dist/domains/index.js.map +1 -1
  140. package/v3/dist/domains/learning-optimization/coordinator.d.ts.map +1 -1
  141. package/v3/dist/domains/learning-optimization/coordinator.js +1 -0
  142. package/v3/dist/domains/learning-optimization/coordinator.js.map +1 -1
  143. package/v3/dist/domains/learning-optimization/services/learning-coordinator.d.ts.map +1 -1
  144. package/v3/dist/domains/learning-optimization/services/learning-coordinator.js +1 -0
  145. package/v3/dist/domains/learning-optimization/services/learning-coordinator.js.map +1 -1
  146. package/v3/dist/domains/learning-optimization/services/transfer-specialist.d.ts.map +1 -1
  147. package/v3/dist/domains/learning-optimization/services/transfer-specialist.js +1 -0
  148. package/v3/dist/domains/learning-optimization/services/transfer-specialist.js.map +1 -1
  149. package/v3/dist/domains/test-execution/coordinator.d.ts.map +1 -1
  150. package/v3/dist/domains/test-execution/coordinator.js +34 -0
  151. package/v3/dist/domains/test-execution/coordinator.js.map +1 -1
  152. package/v3/dist/domains/test-generation/services/test-data-generator.d.ts +7 -1
  153. package/v3/dist/domains/test-generation/services/test-data-generator.d.ts.map +1 -1
  154. package/v3/dist/domains/test-generation/services/test-data-generator.js +116 -98
  155. package/v3/dist/domains/test-generation/services/test-data-generator.js.map +1 -1
  156. package/v3/dist/init/agents-installer.d.ts +4 -0
  157. package/v3/dist/init/agents-installer.d.ts.map +1 -1
  158. package/v3/dist/init/agents-installer.js +32 -3
  159. package/v3/dist/init/agents-installer.js.map +1 -1
  160. package/v3/dist/init/skills-installer.d.ts.map +1 -1
  161. package/v3/dist/init/skills-installer.js +4 -1
  162. package/v3/dist/init/skills-installer.js.map +1 -1
  163. package/v3/dist/init/types.d.ts.map +1 -1
  164. package/v3/dist/init/types.js +1 -0
  165. package/v3/dist/init/types.js.map +1 -1
  166. package/v3/dist/integrations/agentic-flow/model-router/budget-enforcer.d.ts +16 -0
  167. package/v3/dist/integrations/agentic-flow/model-router/budget-enforcer.d.ts.map +1 -1
  168. package/v3/dist/integrations/agentic-flow/model-router/budget-enforcer.js +41 -0
  169. package/v3/dist/integrations/agentic-flow/model-router/budget-enforcer.js.map +1 -1
  170. package/v3/dist/integrations/ruvector/provider.d.ts.map +1 -1
  171. package/v3/dist/integrations/ruvector/provider.js +1 -0
  172. package/v3/dist/integrations/ruvector/provider.js.map +1 -1
  173. package/v3/dist/kernel/anti-drift-middleware.d.ts +160 -0
  174. package/v3/dist/kernel/anti-drift-middleware.d.ts.map +1 -0
  175. package/v3/dist/kernel/anti-drift-middleware.js +376 -0
  176. package/v3/dist/kernel/anti-drift-middleware.js.map +1 -0
  177. package/v3/dist/kernel/event-bus.d.ts +29 -1
  178. package/v3/dist/kernel/event-bus.d.ts.map +1 -1
  179. package/v3/dist/kernel/event-bus.js +69 -6
  180. package/v3/dist/kernel/event-bus.js.map +1 -1
  181. package/v3/dist/kernel/interfaces.d.ts +35 -0
  182. package/v3/dist/kernel/interfaces.d.ts.map +1 -1
  183. package/v3/dist/kernel/kernel.d.ts.map +1 -1
  184. package/v3/dist/kernel/kernel.js +14 -17
  185. package/v3/dist/kernel/kernel.js.map +1 -1
  186. package/v3/dist/kernel/unified-memory.d.ts +5 -0
  187. package/v3/dist/kernel/unified-memory.d.ts.map +1 -1
  188. package/v3/dist/kernel/unified-memory.js +28 -0
  189. package/v3/dist/kernel/unified-memory.js.map +1 -1
  190. package/v3/dist/learning/asymmetric-learning.d.ts +133 -0
  191. package/v3/dist/learning/asymmetric-learning.d.ts.map +1 -0
  192. package/v3/dist/learning/asymmetric-learning.js +170 -0
  193. package/v3/dist/learning/asymmetric-learning.js.map +1 -0
  194. package/v3/dist/learning/pattern-lifecycle.d.ts +26 -0
  195. package/v3/dist/learning/pattern-lifecycle.d.ts.map +1 -1
  196. package/v3/dist/learning/pattern-lifecycle.js +83 -0
  197. package/v3/dist/learning/pattern-lifecycle.js.map +1 -1
  198. package/v3/dist/learning/qe-patterns.d.ts +8 -0
  199. package/v3/dist/learning/qe-patterns.d.ts.map +1 -1
  200. package/v3/dist/learning/qe-patterns.js.map +1 -1
  201. package/v3/dist/learning/real-qe-reasoning-bank.d.ts +28 -0
  202. package/v3/dist/learning/real-qe-reasoning-bank.d.ts.map +1 -1
  203. package/v3/dist/learning/real-qe-reasoning-bank.js +72 -4
  204. package/v3/dist/learning/real-qe-reasoning-bank.js.map +1 -1
  205. package/v3/dist/learning/token-tracker.d.ts +22 -0
  206. package/v3/dist/learning/token-tracker.d.ts.map +1 -1
  207. package/v3/dist/learning/token-tracker.js +67 -0
  208. package/v3/dist/learning/token-tracker.js.map +1 -1
  209. package/v3/dist/mcp/bundle.js +11099 -4879
  210. package/v3/dist/mcp/tool-registry.d.ts.map +1 -1
  211. package/v3/dist/mcp/tool-registry.js +4 -0
  212. package/v3/dist/mcp/tool-registry.js.map +1 -1
  213. package/v3/dist/mcp/tools/analysis/token-usage.d.ts +1 -1
  214. package/v3/dist/mcp/tools/analysis/token-usage.d.ts.map +1 -1
  215. package/v3/dist/mcp/tools/analysis/token-usage.js +23 -3
  216. package/v3/dist/mcp/tools/analysis/token-usage.js.map +1 -1
  217. package/v3/dist/mcp/tools/base.d.ts.map +1 -1
  218. package/v3/dist/mcp/tools/base.js +1 -15
  219. package/v3/dist/mcp/tools/base.js.map +1 -1
  220. package/v3/dist/mcp/tools/coverage-analysis/index.d.ts +7 -0
  221. package/v3/dist/mcp/tools/coverage-analysis/index.d.ts.map +1 -1
  222. package/v3/dist/mcp/tools/coverage-analysis/index.js +30 -0
  223. package/v3/dist/mcp/tools/coverage-analysis/index.js.map +1 -1
  224. package/v3/dist/optimization/token-optimizer-service.d.ts +7 -0
  225. package/v3/dist/optimization/token-optimizer-service.d.ts.map +1 -1
  226. package/v3/dist/optimization/token-optimizer-service.js +10 -1
  227. package/v3/dist/optimization/token-optimizer-service.js.map +1 -1
  228. package/v3/dist/shared/types/index.d.ts +20 -1
  229. package/v3/dist/shared/types/index.d.ts.map +1 -1
  230. package/v3/dist/shared/types/index.js +1 -0
  231. package/v3/dist/shared/types/index.js.map +1 -1
  232. package/v3/dist/strange-loop/healing-controller.d.ts.map +1 -1
  233. package/v3/dist/strange-loop/healing-controller.js +12 -0
  234. package/v3/dist/strange-loop/healing-controller.js.map +1 -1
  235. package/v3/dist/strange-loop/strange-loop.d.ts +14 -0
  236. package/v3/dist/strange-loop/strange-loop.d.ts.map +1 -1
  237. package/v3/dist/strange-loop/strange-loop.js +24 -0
  238. package/v3/dist/strange-loop/strange-loop.js.map +1 -1
  239. package/v3/dist/strange-loop/types.d.ts +2 -2
  240. package/v3/dist/strange-loop/types.d.ts.map +1 -1
  241. package/v3/dist/strange-loop/types.js.map +1 -1
  242. package/v3/dist/sync/claude-flow-bridge.d.ts.map +1 -1
  243. package/v3/dist/sync/claude-flow-bridge.js +3 -24
  244. package/v3/dist/sync/claude-flow-bridge.js.map +1 -1
  245. package/v3/package.json +12 -9
  246. /package/{v3/assets/skills/sfdipot-product-factors/skill.md → .claude/skills/sfdipot-product-factors/SKILL.md} +0 -0
  247. /package/{v3/assets/skills/test-idea-rewriting/skill.md → .claude/skills/test-idea-rewriting/SKILL.md} +0 -0
@@ -0,0 +1,268 @@
1
+ ---
2
+ name: pentest-validation
3
+ description: "Orchestrate security finding validation through graduated exploitation. 4-phase pipeline: recon (SAST/DAST), analysis (code review), validation (exploit proof), report (No Exploit, No Report gate). Eliminates false positives by proving exploitability."
4
+ category: specialized-testing
5
+ priority: critical
6
+ tokenEstimate: 1500
7
+ agents: [qe-pentest-validator, qe-security-scanner, qe-security-reviewer, qe-security-auditor, qe-quality-gate]
8
+ implementation_status: optimized
9
+ optimization_version: 1.0
10
+ last_optimized: 2026-02-08
11
+ dependencies: [security-testing]
12
+ quick_reference_card: true
13
+ tags: [pentest, exploitation, security-validation, shannon, no-exploit-no-report, graduated-exploitation]
14
+ trust_tier: 3
15
+ validation:
16
+ schema_path: schemas/output.json
17
+ validator_path: scripts/validate.sh
18
+ eval_path: evals/pentest-validation.yaml
19
+ ---
20
+
21
+ # Pentest Validation
22
+
23
+ <default_to_action>
24
+ When validating security findings:
25
+ 1. REQUIRE explicit authorization for target URL
26
+ 2. SCAN with qe-security-scanner (SAST + dependency + secrets)
27
+ 3. ANALYZE with qe-security-reviewer + qe-security-auditor (parallel)
28
+ 4. VALIDATE with qe-pentest-validator (graduated exploitation, parallel per vuln type)
29
+ 5. REPORT only confirmed findings with PoC evidence ("No Exploit, No Report")
30
+ 6. UPDATE exploit playbook with new patterns
31
+
32
+ **Quality Gates:**
33
+ - Authorization confirmed before ANY exploitation
34
+ - Target URL is staging/dev (NOT production)
35
+ - Budget cap enforced ($15 default)
36
+ - Time cap enforced (30 min default)
37
+ - All exploitation attempts logged
38
+ </default_to_action>
39
+
40
+ ## Quick Reference Card
41
+
42
+ ### The 4-Phase Pipeline
43
+
44
+ | Phase | Agent(s) | Purpose | Parallelism |
45
+ |-------|----------|---------|-------------|
46
+ | **1. Recon** | qe-security-scanner | SAST, DAST, dependency scan, secrets | Internal parallel |
47
+ | **2. Analysis** | qe-security-reviewer + qe-security-auditor | Code review + compliance check | Both in parallel |
48
+ | **3. Validation** | qe-pentest-validator | Graduated exploit validation | Per-vuln-type parallel |
49
+ | **4. Report** | qe-quality-gate | "No Exploit, No Report" filter | Sequential |
50
+
51
+ ### Graduated Exploitation Tiers
52
+
53
+ | Tier | Handler | Cost | Latency | Use When |
54
+ |------|---------|------|---------|----------|
55
+ | **1** | Agent Booster (WASM) | $0 | <1ms | Code pattern is conclusive (eval, innerHTML, hardcoded creds) |
56
+ | **2** | Haiku | $0.0002 | ~500ms | Need payload test against live target |
57
+ | **3** | Sonnet/Opus | $0.003-$0.015 | 2-5s | Full exploit chain with data proof |
58
+
59
+ ### When to Use This Skill
60
+
61
+ | Scenario | Tier | Estimated Cost |
62
+ |----------|------|----------------|
63
+ | PR security review (source only) | 1 | $0 |
64
+ | Pre-release validation (staging) | 1-2 | $1-5 |
65
+ | Full pentest validation | 1-3 | $5-15 |
66
+ | Compliance audit evidence | 1-3 | $5-15 |
67
+
68
+ ---
69
+
70
+ ## Configuration
71
+
72
+ ```yaml
73
+ pentest:
74
+ target_url: https://staging.app.com # REQUIRED for Tier 2-3
75
+ source_repo: ./src # REQUIRED for Tier 1+
76
+ exploitation_tier: 2 # 1=pattern-only, 2=payload-test, 3=full-exploit
77
+ vuln_types: # Which pipelines to run
78
+ - injection # SQL, NoSQL, command injection
79
+ - xss # Reflected, stored, DOM XSS
80
+ - auth # Auth bypass, session, JWT
81
+ - ssrf # URL scheme abuse, metadata
82
+ max_cost_usd: 15 # Budget cap per run
83
+ timeout_minutes: 30 # Time cap per run
84
+ require_authorization: true # MUST confirm target ownership
85
+ no_production: true # Block production URLs
86
+ production_patterns: # URL patterns to block
87
+ - "*.prod.*"
88
+ - "api.*"
89
+ - "www.*"
90
+ ```
91
+
92
+ ---
93
+
94
+ ## Safeguards (Mandatory)
95
+
96
+ ### Authorization Gate
97
+ Every pentest validation run MUST:
98
+ 1. Display target URL and exploitation tier to user
99
+ 2. Require explicit confirmation: "I own/authorized testing of this target"
100
+ 3. Log authorization with timestamp
101
+ 4. Block if target URL matches production patterns
102
+
103
+ ### What This Skill Does NOT Do
104
+ - Full autonomous reconnaissance (Nmap, Subfinder)
105
+ - Zero-day exploit development
106
+ - Attack targets without explicit authorization
107
+ - Test production systems
108
+ - Store actual exfiltrated data (only proof of access)
109
+ - Social engineering or phishing simulation
110
+ - Port scanning or service discovery
111
+
112
+ ---
113
+
114
+ ## Validation Pipelines
115
+
116
+ ### Injection Pipeline
117
+ | Attack | Tier 1 (Pattern) | Tier 2 (Payload) | Tier 3 (Full) |
118
+ |--------|-------------------|-------------------|----------------|
119
+ | SQL injection | String concat in query | `' OR '1'='1` response diff | UNION SELECT data extraction |
120
+ | NoSQL injection | `$where`, `$gt` in query | Operator injection test | Collection enumeration |
121
+ | Command injection | `exec()`, `system()` calls | Command delimiter test | Reverse shell proof |
122
+ | LDAP injection | String concat in filter | Wildcard injection | Directory enumeration |
123
+
124
+ ### XSS Pipeline
125
+ | Attack | Tier 1 (Pattern) | Tier 2 (Payload) | Tier 3 (Full) |
126
+ |--------|-------------------|-------------------|----------------|
127
+ | Reflected XSS | No output encoding | `<img onerror>` reflection | Browser JS execution via Playwright |
128
+ | Stored XSS | `innerHTML` assignment | Payload stored + retrieved | Cookie theft PoC |
129
+ | DOM XSS | `document.write(location)` | Fragment injection | DOM manipulation proof |
130
+
131
+ ### Auth Pipeline
132
+ | Attack | Tier 1 (Pattern) | Tier 2 (Payload) | Tier 3 (Full) |
133
+ |--------|-------------------|-------------------|----------------|
134
+ | JWT none | No algorithm validation | Modified JWT accepted | Admin access with forged token |
135
+ | Session fixation | No session rotation | Pre-set session reused | Cross-user session hijack |
136
+ | Credential stuffing | No rate limiting | 100 attempts unblocked | Valid credential discovery |
137
+ | IDOR | No authorization check | Access other user data | Full CRUD on foreign resources |
138
+
139
+ ### SSRF Pipeline
140
+ | Attack | Tier 1 (Pattern) | Tier 2 (Payload) | Tier 3 (Full) |
141
+ |--------|-------------------|-------------------|----------------|
142
+ | Internal URL | User-controlled URL fetch | `http://169.254.169.254` | Cloud metadata extraction |
143
+ | DNS rebinding | URL validation bypass | Rebind to internal IP | Internal service access |
144
+ | Protocol smuggling | URL scheme not restricted | `file:///etc/passwd` | File content in response |
145
+
146
+ ---
147
+
148
+ ## Agent Coordination
149
+
150
+ ### Orchestration Pattern
151
+ ```typescript
152
+ // Phase 1: Recon (parallel scans)
153
+ await Task("Security Scan", {
154
+ target: "./src",
155
+ layers: { sast: true, dast: true, dependencies: true, secrets: true }
156
+ }, "qe-security-scanner");
157
+
158
+ // Phase 2: Analysis (parallel review)
159
+ await Promise.all([
160
+ Task("Code Security Review", {
161
+ findings: phase1Results,
162
+ depth: "comprehensive"
163
+ }, "qe-security-reviewer"),
164
+
165
+ Task("Compliance Audit", {
166
+ findings: phase1Results,
167
+ frameworks: ["owasp-top-10"]
168
+ }, "qe-security-auditor")
169
+ ]);
170
+
171
+ // Phase 3: Validation (graduated exploitation)
172
+ await Task("Exploit Validation", {
173
+ findings: [...phase1Results, ...phase2Results],
174
+ target_url: "https://staging.app.com",
175
+ exploitation_tier: 2,
176
+ vuln_types: ["injection", "xss", "auth", "ssrf"],
177
+ max_cost_usd: 15,
178
+ timeout_minutes: 30
179
+ }, "qe-pentest-validator");
180
+
181
+ // Phase 4: Report ("No Exploit, No Report" gate)
182
+ await Task("Security Quality Gate", {
183
+ findings: phase3Results.confirmedFindings,
184
+ gate: "no-exploit-no-report",
185
+ require_poc: true
186
+ }, "qe-quality-gate");
187
+ ```
188
+
189
+ ### Finding Classification
190
+ | Status | Meaning | Action |
191
+ |--------|---------|--------|
192
+ | `confirmed-exploitable` | Exploitation succeeded with PoC | Report with evidence |
193
+ | `likely-exploitable` | Partial exploitation, defenses detected | Report with caveats |
194
+ | `not-exploitable` | All exploitation attempts failed | Filter from report |
195
+ | `inconclusive` | WAF/defense blocked, unclear if vulnerable | Report for manual review |
196
+
197
+ ---
198
+
199
+ ## Exploit Playbook Memory
200
+
201
+ ### Namespace Structure
202
+ ```
203
+ aqe/pentest/
204
+ playbook/
205
+ exploit/{vuln_type}/{tech_stack}/{technique}
206
+ bypass/{defense_type}/{technique}
207
+ payload/{vuln_type}/{variant}
208
+ results/
209
+ validation-{timestamp}
210
+ poc/
211
+ {finding_id}-poc
212
+ ```
213
+
214
+ ### Learning Loop
215
+ 1. **Before validation**: Query playbook for known patterns matching findings
216
+ 2. **During validation**: Try known payloads first (higher success rate)
217
+ 3. **After validation**: Store new successful patterns with confidence scores
218
+ 4. **Over time**: Agent converges on most effective payloads per tech stack
219
+
220
+ ---
221
+
222
+ ## Cost Optimization
223
+
224
+ ### Estimated Cost by Scenario
225
+ | Scenario | Tier Mix | Findings | Est. Cost | Est. Time |
226
+ |----------|----------|----------|-----------|-----------|
227
+ | PR check (source only) | 100% Tier 1 | 5 | $0 | <5s |
228
+ | Sprint validation | 70% T1, 30% T2 | 15 | $2-5 | 5-10 min |
229
+ | Release validation | 40% T1, 40% T2, 20% T3 | 25 | $8-15 | 15-30 min |
230
+ | Full pentest | 20% T1, 30% T2, 50% T3 | 40 | $15-30 | 30-60 min |
231
+
232
+ ### Cost vs Shannon Comparison
233
+ | Metric | Shannon | AQE Pentest Validation |
234
+ |--------|---------|----------------------|
235
+ | Cost per run | ~$50 | $5-15 (graduated tiers) |
236
+ | Runtime | 60-90 min | 15-30 min (parallel pipelines) |
237
+ | False positive rate | Low (exploit-proven) | Low (same principle) |
238
+ | Learning | None (static prompts) | ReasoningBank playbook |
239
+
240
+ ---
241
+
242
+ ## Success Metrics
243
+
244
+ | Metric | Target | Measurement |
245
+ |--------|--------|-------------|
246
+ | False positive reduction | >60% of findings eliminated | Pre/post validator comparison |
247
+ | Exploit confirmation rate | >80% of confirmed findings truly exploitable | Manual PoC verification |
248
+ | Cost per run | <$15 USD | Token tracking per pipeline |
249
+ | Time per run | <30 minutes | Execution time metrics |
250
+ | Playbook growth | 100+ patterns after 6 months | Memory namespace count |
251
+
252
+ ---
253
+
254
+ ## Related Skills
255
+
256
+ - [security-testing](../security-testing/) - OWASP vulnerability scanning
257
+ - [qe-security-compliance](../qe-security-compliance/) - SAST/DAST automation
258
+ - [compliance-testing](../compliance-testing/) - Regulatory compliance
259
+ - [api-testing-patterns](../api-testing-patterns/) - API security testing
260
+ - [chaos-engineering-resilience](../chaos-engineering-resilience/) - Security under chaos
261
+
262
+ ---
263
+
264
+ ## Remember
265
+
266
+ **"No Exploit, No Report."** A vulnerability scanner that can't prove exploitation delivers uncertain value. This skill transforms security findings from theoretical risks into proven vulnerabilities with evidence. Every confirmed finding comes with a reproducible proof-of-concept. Every false positive is eliminated before it reaches the report.
267
+
268
+ **Think proof, not prediction.** Don't report what MIGHT be vulnerable. Prove what IS vulnerable.