npm - security-mcp - Versions diffs - 1.1.4 → 1.3.3 - Mend

security-mcp 1.1.4 → 1.3.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (158) hide show

package/README.md +341 -1018
package/defaults/checklists/ai.json +20 -1
package/defaults/checklists/api.json +35 -1
package/defaults/checklists/infra.json +34 -1
package/defaults/checklists/mobile.json +23 -1
package/defaults/checklists/payments.json +15 -1
package/defaults/checklists/web.json +11 -1
package/defaults/cloud-controls/aws.json +10712 -0
package/defaults/cloud-controls/azure.json +7201 -0
package/defaults/cloud-controls/gcp.json +4061 -0
package/defaults/control-catalog.json +24 -0
package/defaults/security-policy.json +2 -2
package/dist/ci/pr-gate.js +22 -5
package/dist/cli/index.js +73 -2
package/dist/cli/install.js +4 -55
package/dist/cli/onboarding.js +18 -10
package/dist/gate/baseline.js +82 -7
package/dist/gate/catalog.js +10 -2
package/dist/gate/checks/agentic-instructions.js +515 -0
package/dist/gate/checks/ai-governance.js +132 -0
package/dist/gate/checks/ai.js +757 -39
package/dist/gate/checks/auth-deep.js +920 -216
package/dist/gate/checks/business-logic.js +751 -0
package/dist/gate/checks/ci-pipeline.js +399 -4
package/dist/gate/checks/cloud-controls.js +69 -0
package/dist/gate/checks/crypto.js +423 -2
package/dist/gate/checks/data-platform.js +954 -0
package/dist/gate/checks/dependencies.js +582 -15
package/dist/gate/checks/docker-deep.js +1236 -0
package/dist/gate/checks/gitops.js +724 -0
package/dist/gate/checks/graphql.js +201 -19
package/dist/gate/checks/iac.js +1230 -0
package/dist/gate/checks/infra.js +246 -1
package/dist/gate/checks/injection-deep.js +827 -184
package/dist/gate/checks/k8s.js +955 -2
package/dist/gate/checks/mobile-android.js +917 -3
package/dist/gate/checks/mobile-ios.js +797 -5
package/dist/gate/checks/required-artifacts.js +194 -0
package/dist/gate/checks/runtime.js +178 -0
package/dist/gate/checks/secrets.js +256 -13
package/dist/gate/checks/supply-chain-deep.js +787 -0
package/dist/gate/checks/web-nextjs.js +572 -48
package/dist/gate/cloud-controls/apply.js +115 -0
package/dist/gate/cloud-controls/bicep.js +36 -0
package/dist/gate/cloud-controls/cfn.js +125 -0
package/dist/gate/cloud-controls/detect.js +104 -0
package/dist/gate/cloud-controls/hcl.js +140 -0
package/dist/gate/cloud-controls/types.js +87 -0
package/dist/gate/diff.js +17 -5
package/dist/gate/evidence.js +8 -1
package/dist/gate/exceptions.js +202 -9
package/dist/gate/findings.js +15 -2
package/dist/gate/policy.js +316 -130
package/dist/gate/threat-intel.js +6 -0
package/dist/mcp/audit-chain.js +131 -28
package/dist/mcp/auth.js +169 -0
package/dist/mcp/learning.js +129 -4
package/dist/mcp/model-router.js +161 -24
package/dist/mcp/orchestration.js +377 -89
package/dist/mcp/server.js +460 -69
package/dist/mcp/tool-audit.js +193 -0
package/dist/repo/fs.js +37 -1
package/dist/repo/search.js +31 -6
package/dist/review/store.js +56 -3
package/dist/tests/run.js +124 -1
package/package.json +9 -9
package/skills/_TEMPLATE/SKILL.md +99 -0
package/skills/advanced-dos-tester/SKILL.md +118 -0
package/skills/agentic-instruction-auditor/SKILL.md +111 -0
package/skills/agentic-loop-exploiter/SKILL.md +377 -0
package/skills/ai-llm-redteam/SKILL.md +113 -0
package/skills/ai-model-supply-chain-agent/SKILL.md +112 -0
package/skills/algorithm-implementation-reviewer/SKILL.md +107 -0
package/skills/android-penetration-tester/SKILL.md +464 -46
package/skills/anti-replay-tester/SKILL.md +115 -0
package/skills/appsec-code-auditor/SKILL.md +94 -0
package/skills/artifact-integrity-analyst/SKILL.md +450 -0
package/skills/attack-navigator/SKILL.md +476 -8
package/skills/auth-session-hacker/SKILL.md +111 -0
package/skills/aws-penetration-tester/SKILL.md +510 -0
package/skills/azure-penetration-tester/SKILL.md +542 -3
package/skills/binary-auth-validator/SKILL.md +120 -0
package/skills/bot-detection-specialist/SKILL.md +118 -0
package/skills/business-logic-attacker/SKILL.md +240 -0
package/skills/capec-code-mapper/SKILL.md +93 -0
package/skills/cert-pin-rotation-specialist/SKILL.md +121 -0
package/skills/cicd-pipeline-hijacker/SKILL.md +414 -0
package/skills/ciso-orchestrator/SKILL.md +465 -43
package/skills/cloud-infra-specialist/SKILL.md +127 -0
package/skills/compliance-gap-analyst/SKILL.md +431 -0
package/skills/compliance-grc/SKILL.md +94 -0
package/skills/compliance-lifecycle-tracker/SKILL.md +93 -0
package/skills/container-hardening-auditor/SKILL.md +125 -0
package/skills/credential-stuffing-specialist/SKILL.md +111 -0
package/skills/crypto-pki-specialist/SKILL.md +96 -0
package/skills/csa-ccm-mapper/SKILL.md +93 -0
package/skills/csf2-governance-mapper/SKILL.md +93 -0
package/skills/data-platform-auditor/SKILL.md +125 -0
package/skills/deep-link-fuzzer/SKILL.md +118 -0
package/skills/dependency-confusion-attacker/SKILL.md +424 -0
package/skills/device-integrity-aggregator/SKILL.md +117 -0
package/skills/dos-resilience-tester/SKILL.md +106 -0
package/skills/dread-scorer/SKILL.md +93 -0
package/skills/egress-policy-enforcer/SKILL.md +108 -0
package/skills/evidence-collector/SKILL.md +107 -0
package/skills/file-upload-attacker/SKILL.md +118 -0
package/skills/gcp-penetration-tester/SKILL.md +510 -2
package/skills/git-history-secret-scanner/SKILL.md +115 -0
package/skills/gitops-delivery-auditor/SKILL.md +120 -0
package/skills/iac-security-auditor/SKILL.md +125 -0
package/skills/iam-privesc-graph-builder/SKILL.md +161 -0
package/skills/incident-responder/SKILL.md +120 -0
package/skills/injection-specialist/SKILL.md +111 -0
package/skills/ios-security-auditor/SKILL.md +291 -0
package/skills/json-ambiguity-tester/SKILL.md +145 -0
package/skills/k8s-container-escaper/SKILL.md +406 -0
package/skills/key-management-lifecycle-analyst/SKILL.md +107 -0
package/skills/kill-switch-engineer/SKILL.md +111 -0
package/skills/linddun-privacy-analyst/SKILL.md +111 -0
package/skills/logic-race-fuzzer/SKILL.md +452 -0
package/skills/mobile-api-network-attacker/SKILL.md +430 -0
package/skills/mobile-binary-hardener/SKILL.md +111 -0
package/skills/mobile-security-specialist/SKILL.md +94 -0
package/skills/mobile-webview-auditor/SKILL.md +105 -0
package/skills/model-extraction-attacker/SKILL.md +228 -0
package/skills/multipart-abuse-tester/SKILL.md +93 -0
package/skills/oauth-pkce-specialist/SKILL.md +113 -0
package/skills/parser-exhaustion-tester/SKILL.md +151 -0
package/skills/pentest-infra/SKILL.md +107 -0
package/skills/pentest-social/SKILL.md +210 -0
package/skills/pentest-team/SKILL.md +96 -0
package/skills/pentest-web-api/SKILL.md +107 -0
package/skills/privacy-flow-analyst/SKILL.md +243 -0
package/skills/prompt-injection-specialist/SKILL.md +403 -0
package/skills/quantum-migration-planner/SKILL.md +105 -0
package/skills/rag-poisoning-specialist/SKILL.md +367 -0
package/skills/registry-mirror-enforcer/SKILL.md +93 -0
package/skills/rotation-validation-agent/SKILL.md +121 -0
package/skills/samm-assessor/SKILL.md +94 -0
package/skills/secrets-mask-bypass-tester/SKILL.md +109 -0
package/skills/senior-security-engineer/SKILL.md +178 -0
package/skills/serialization-memory-attacker/SKILL.md +341 -0
package/skills/session-timeout-tester/SKILL.md +170 -0
package/skills/slsa-level3-enforcer/SKILL.md +121 -0
package/skills/slsa-provenance-enforcer/SKILL.md +111 -0
package/skills/ssrf-detection-validator/SKILL.md +117 -0
package/skills/step-up-auth-enforcer/SKILL.md +93 -0
package/skills/stride-pasta-analyst/SKILL.md +429 -0
package/skills/supply-chain-devsecops/SKILL.md +107 -0
package/skills/threat-infrastructure-analyst/SKILL.md +93 -0
package/skills/threat-modeler/SKILL.md +94 -0
package/skills/tls-certificate-auditor/SKILL.md +582 -18
package/skills/token-reuse-detector/SKILL.md +104 -0
package/skills/trike-risk-modeler/SKILL.md +93 -0
package/skills/unicode-homograph-tester/SKILL.md +93 -0
package/skills/waf-rule-lifecycle-agent/SKILL.md +106 -0
package/skills/webhook-security-tester/SKILL.md +111 -0
package/skills/zero-trust-architect/SKILL.md +118 -0

package/skills/logic-race-fuzzer/SKILL.md CHANGED Viewed

@@ -21,6 +21,15 @@ production under load. You think in terms of interleavings, not happy paths.
 Find race conditions, business logic flaws, and arithmetic vulnerabilities.
 90% fixing — implement distributed locks, atomic operations, and idempotency keys directly.
+## BEYOND THE CHECKS — AUTONOMOUS DETECT & FIX
+The `business-logic.ts` detection module (`src/gate/checks/business-logic.ts`) — logic/race/TOCTOU — is your deterministic floor, not your ceiling. Treat its finding IDs as the minimum, then reason past single-line/single-file pattern matching — and APPLY the fix (Edit), not just advise:
+- **Cross-file / data-flow reasoning the regex can't do:** a `findUnique` balance read in one handler and an `update` in a helper file are a double-spend only when you model them as a non-atomic read-modify-write spanning the `await` gap between them — the per-line scan sees two innocuous ORM calls. Trace shared state (balance, inventory, quota, idempotency key namespace) across every concurrent path that touches the same resource ID.
+- **Semantic / effective-state analysis:** a `$transaction()` that wraps the read but not the write, or a Redis `INCR`/`EXPIRE` pair not inside a Lua script, is *effectively* unguarded; `quantity * unitPrice` in native JS `number` silently overflows. Judge the real atomicity and arithmetic safety, not the presence of a transaction call.
+- **External corroboration:** WebSearch/WebFetch current advisories (e.g. CVE-2023-23916 async-gap class, e-commerce integer-overflow exploits) and OWASP API6:2023 mass-assignment guidance for the detected ORM/queue.
+- **Apply & prove:** add `SELECT FOR UPDATE`/serializable transactions, atomic Lua, allowlist schemas, and BigInt/Decimal money inline, then re-run `src/gate/checks/business-logic.ts` plus a concurrency hammer (`ab -n 200 -c 50`, `race-the-web`, or `wrk2`) confirming final state matches the summed responses, as a regression floor, then re-audit. Emit the LEARNING SIGNAL per fix; surface trade-offs (e.g. row locking reducing throughput) against the secure default.
 ## EXECUTION
 1. Identify all multi-step flows with shared state (balance operations, inventory, quotas)
@@ -65,3 +74,446 @@ Find race conditions, business logic flaws, and arithmetic vulnerabilities.
 - Concurrent request sequence that reproduces the issue
 - Database/cache state before and after the race
 - Fixed code using atomic operations or distributed locks written inline
+Every findings JSON MUST include `intelligenceForOtherAgents`:
+```json
+{
+  "intelligenceForOtherAgents": {
+    "forPentestTeam": [{ "type": "HIGH_VALUE_TARGET", "description": "...", "exploitHint": "..." }],
+    "forCryptoSpecialist": [{ "type": "CRYPTO_WEAKNESS_REFERENCE", "algorithm": "...", "location": "..." }],
+    "forCloudSpecialist": [{ "type": "SSRF_TO_CLOUD_CHAIN", "ssrfLocation": "...", "escalationPath": "..." }],
+    "forComplianceGrc": [{ "type": "COMPLIANCE_BLOCKER", "frameworks": ["..."], "releaseBlock": true }]
+  }
+}
+```
+---
+## BEYOND SKILL.MD — MANDATORY EXPANSIONS
+### 1. Double-Spend via Async Await Gap (CVE-2023-23916 class)
+**Attack technique:** In any async handler where a balance read precedes a deduction, a second
+concurrent request can observe the pre-deduction balance. Both transactions succeed, debiting
+only once from the account. This pattern is rampant in Node.js microservices using Prisma
+without explicit row-level locking.
+**Concrete detection method:**
+```bash
+# Grep for balance read followed by update without transaction or locking
+grep -rn "findUnique\|findFirst" src/ | grep -i "balance\|credit\|wallet\|fund" | \
+  while read line; do
+    file=$(echo $line | cut -d: -f1)
+    # Check if file uses $transaction() or SELECT FOR UPDATE
+    grep -l "\$transaction\|SELECT.*FOR UPDATE\|selectForUpdate" "$file" || echo "MISSING_LOCK: $file"
+  done
+```
+**Finding criterion:** Any balance-affecting endpoint where the read and write are not wrapped
+in a serializable transaction or SELECT FOR UPDATE. Reproduce with:
+```bash
+ab -n 200 -c 50 -p payload.json -T application/json http://target/api/transfer
+# Verify: final balance < expected minimum (funds created from nothing)
+```
+---
+### 2. Redis INCR/EXPIRE Non-Atomic Rate Limit Bypass
+**Attack technique:** A rate limiter that calls INCR then EXPIRE as two separate commands has a
+TOCTOU window. If the process crashes or a network partition occurs between INCR and EXPIRE,
+the counter persists forever — permanently locking the key. Conversely, a fast concurrent
+burst can exhaust the window before EXPIRE fires, allowing unlimited requests.
+**Concrete detection method:**
+```bash
+grep -rn "redis.*incr\|client\.incr\|\.incr(" src/ | grep -v "lua\|eval\|multi\|pipeline"
+# Any INCR not followed immediately by an atomic EXPIRE in the same Lua script is vulnerable
+```
+**Fix template:** Replace with atomic Lua:
+```lua
+local current = redis.call('INCR', KEYS[1])
+if current == 1 then redis.call('EXPIRE', KEYS[1], ARGV[1]) end
+return current
+```
+---
+### 3. Mass Assignment Privilege Escalation (OWASP API6:2023)
+**Attack technique:** When ORM models accept arbitrary JSON from `req.body` without an explicit
+allowlist, an attacker can set fields like `role`, `isAdmin`, `tier`, `verified`, or `balance`
+directly. This is distinct from parameter pollution — the payload looks structurally valid.
+**Concrete detection method:**
+```bash
+# Express/Fastify: find raw body spreads into ORM create/update calls
+grep -rn "\.create(\|\.update(\|\.upsert(" src/ | grep -v "allowlist\|pick(\|omit("
+# Then check if req.body is passed directly
+grep -rn "req\.body" src/ | grep -v "zod\|joi\|validate\|schema"
+```
+**Finding criterion:** Any ORM mutation accepting `req.body` without a Zod/Joi allowlist schema
+applied at the route boundary. Fields to verify are excluded: `role`, `isAdmin`, `plan`,
+`balance`, `credits`, `verified`, `stripeCustomerId`.
+---
+### 4. AI-Assisted Race Condition Discovery (Emerging Threat, 2025)
+**Attack technique:** LLM-powered fuzzing tools (e.g., Mayhem, CodaMOSA, and custom GPT-4-based
+harnesses) can automatically generate concurrent request sequences from OpenAPI specs and
+exhaustively model state interleavings. An adversary with access to a public API spec and an
+LLM harness can discover race windows in hours that would take a human days. This means any
+publicly documented API endpoint with shared-state side effects is now a viable automated
+target.
+**Concrete detection method (defensive):**
+- Export all route definitions and run `race-the-web` or a custom ab/wrk2 harness against
+  every state-mutating endpoint with concurrency ≥ 50.
+- For AI-assisted attack simulation: feed the OpenAPI spec to a locally-hosted LLM and ask it
+  to enumerate all async await gaps and concurrent state mutation paths.
+```bash
+# Run concurrent hammering against every POST/PUT/PATCH endpoint
+npx race-the-web --config race-config.yaml --concurrency 100 --requests 500
+```
+**Finding criterion:** Any endpoint where a concurrent load test produces a final system state
+that differs from the sum of all successful response payloads.
+---
+### 5. Integer Overflow in Quantity × Price Multiplication (CWE-190)
+**Attack technique:** When quantity and unit price are stored as 32-bit integers and multiplied
+server-side without overflow guards, an attacker supplying `quantity=2147483648` can cause the
+total to wrap to a negative number (or zero), resulting in a free or negative-cost order. This
+was exploited in multiple e-commerce platforms in 2022–2024.
+**Concrete detection method:**
+```bash
+# Find multiplication of user-controlled numeric fields
+grep -rn "quantity.*price\|price.*quantity\|qty.*amount\|amount.*qty" src/ | \
+  grep -v "BigInt\|bigint\|Decimal\|decimal\|Math\.imul"
+# Also check for lack of upper-bound validation on quantity inputs
+grep -rn "z\.number()\|Joi\.number()" src/ | grep -v "\.max(\|\.positive(\|\.int("
+```
+**Finding criterion:** Any money calculation using native JavaScript `number` type (IEEE 754
+float, 53-bit mantissa) or uncapped integer multiplication. All monetary arithmetic MUST use
+`BigInt` or a decimal library (`decimal.js`, `dinero.js`). All quantity inputs must have an
+explicit `.max()` bound in validation schemas.
+---
+### 6. Supply Chain: Malicious npm Package Injecting Timing Attacks (Post-2024)
+**Attack technique:** Compromised npm packages (e.g., the `event-stream` pattern) can inject
+code that introduces intentional timing side channels. A malicious `parseAmount()` patch in a
+transitive dependency can leak whether a given account balance is above or below a threshold
+by varying response time by ~2ms per bit — invisible to functional tests but detectable by
+statistical timing analysis after ~10,000 samples.
+**Concrete detection method:**
+```bash
+# Audit all transitive dependencies for recently published/updated packages
+npm audit --json | jq '.vulnerabilities | keys[]'
+npx better-npm-audit --level critical
+# Check for suspicious timing patterns in hot paths
+grep -rn "setTimeout\|setInterval\|Date\.now()\|performance\.now()" node_modules/.pnp* 2>/dev/null || \
+  find node_modules -name "*.js" -newer package-lock.json -not -path "*/test/*" | head -20
+```
+**Finding criterion:** Any recently-modified transitive dependency touching arithmetic or
+comparison functions in payment or authentication hot paths. Cross-reference with OSV.dev
+and the Socket.dev supply chain scanner.
+---
+### 7. Post-Quantum Threat to Idempotency Key HMAC Signing
+**Attack technique:** Many idempotency key schemes use HMAC-SHA256 to sign the key + timestamp
+to prevent replay. With a Cryptographically Relevant Quantum Computer (CRQC), Grover's algorithm
+reduces HMAC-SHA256 brute-force from 2^256 to 2^128 — still safe for symmetric keys. However,
+if idempotency keys are also bound to RSA or ECDSA signatures (e.g., signed JWTs), those
+signatures will be fully broken. An attacker who harvests signed idempotency tokens today can
+replay them after CRQC deployment.
+**Concrete detection method:**
+```bash
+# Find idempotency key validation that relies on RSA/ECDSA-signed tokens
+grep -rn "idempotency\|Idempotency" src/ | grep -v "HMAC\|sha256\|sha512"
+grep -rn "jwt\.verify\|RS256\|ES256\|RS384" src/ | grep -i "idempot\|replay\|dedup"
+```
+**Finding criterion:** Any idempotency scheme relying on asymmetric cryptography for token
+integrity. Migrate to HMAC-SHA256 or ML-KEM-based MACs for long-lived tokens. Flag for the
+CryptoSpecialist agent.
+---
+### 8. TOCTOU in File-Based Job Lock Files
+**Attack technique:** Job processors that use filesystem lock files (`.lock`, `.pid`) to prevent
+duplicate execution have a TOCTOU window between `fs.existsSync()` and `fs.writeFileSync()`.
+On NFS-mounted volumes or containerized environments with shared storage, two workers can
+simultaneously observe the lock as absent and both proceed — causing duplicate job execution.
+This is a common pattern in legacy cron-to-container migrations.
+**Concrete detection method:**
+```bash
+# Find lock file patterns that are not using O_EXCL or atomic file creation
+grep -rn "existsSync\|statSync\|accessSync" src/ | grep -i "lock\|pid\|mutex"
+grep -rn "writeFileSync\|openSync" src/ | grep -i "lock\|pid"
+# O_EXCL flag check — this is the only safe pattern:
+grep -rn "O_EXCL\|wx'" src/ | grep -i "lock\|pid"  # must have results
+```
+**Finding criterion:** Any lock file mechanism not using `fs.openSync(path, 'wx')` (O_EXCL
+mode) or a database-level advisory lock. The `'wx'` flag fails atomically if the file exists.
+Replace all `existsSync + writeFileSync` lock patterns with atomic `openSync(..., 'wx')`.
+---
+## §LOGIC_RACE_FUZZER-CHECKLIST
+1. **Double-spend via concurrent balance deduction** — Mechanism: two simultaneous POST
+   /transfer requests read the same balance before either write commits. Grep for
+   `balance`, `wallet`, `credit` reads not inside `$transaction()` or `SELECT FOR UPDATE`.
+   Finding: final balance lower than both transactions combined, or negative.
+2. **Negative quantity acceptance in order creation** — Mechanism: attacker submits
+   `quantity: -100` to refund endpoint, receiving credits without spending. Grep Zod/Joi
+   schemas for quantity fields missing `.positive()` or `.min(1)`. Finding: API accepts
+   negative quantities and adjusts balance accordingly.
+3. **Redis rate limit bypass via non-atomic INCR/EXPIRE** — Mechanism: burst 100 requests
+   in <1ms before EXPIRE fires; counter never gets TTL. Grep for `redis.incr` not followed
+   by Lua eval. Finding: rate limit counter persists beyond window or burst succeeds past limit.
+4. **Mass assignment role escalation** — Mechanism: POST body includes `"role":"admin"` or
+   `"isAdmin":true`; ORM applies it without allowlist. Grep for `.create(req.body)` or
+   `Object.assign(model, req.body)`. Finding: user gains elevated role via crafted payload.
+5. **Float arithmetic precision loss in money** — Mechanism: `0.1 + 0.2 !== 0.3` in
+   JavaScript causes rounding errors in accumulated transactions. Grep for `parseFloat`,
+   `toFixed`, or arithmetic on price/amount/balance fields. Finding: total differs from
+   expected by >0 cents over multiple operations.
+6. **Idempotency key replay across users** — Mechanism: idempotency key namespace is not
+   scoped per user; attacker reuses another user's key to replay their transaction. Grep for
+   idempotency key lookup without user ID scoping. Finding: key from user A accepted for
+   user B's request, returning user A's cached response.
+7. **Bull/BullMQ duplicate job on worker restart** — Mechanism: job marked active but
+   worker crashes before marking complete; re-queued on restart; processed twice. Grep for
+   `queue.add()` without `jobId` deduplication option. Finding: job processing count >1 for
+   the same logical event in logs.
+8. **TOCTOU on inventory deduction** — Mechanism: two concurrent purchase requests both
+   check `stock > 0`, both pass, both decrement — final stock goes negative. Grep for
+   inventory/stock reads without `SELECT FOR UPDATE` or optimistic locking version field.
+   Finding: `stock` column < 0 after concurrent purchase load test.
+9. **Integer overflow in total price calculation** — Mechanism: `quantity * unitPrice` with
+   uncapped integer input overflows signed 32-bit, wrapping to negative. Grep for price
+   multiplication not using `BigInt` or `Decimal`. Finding: order total is negative or zero
+   for extreme quantity inputs.
+10. **Webhook duplicate delivery without deduplication** — Mechanism: provider retries
+    webhook on timeout; handler processes event twice; payment credited twice. Grep for
+    webhook handlers without idempotency key storage in DB. Finding: duplicate credit/order
+    row created for single webhook event ID.
+11. **Async await gap in multi-step state machine** — Mechanism: handler reads state,
+    `await`s external call, another request mutates state during await, handler resumes
+    with stale state and overwrites it. Grep for state reads followed by `await` and
+    subsequent state writes without re-read or optimistic lock. Finding: state machine
+    transitions to invalid state under concurrent load.
+12. **Quota bypass via concurrent quota check and consumption** — Mechanism: concurrent
+    API calls all pass quota check simultaneously; each consumes quota; total exceeds limit.
+    Grep for quota/limit checks using two-step read+decrement outside a transaction.
+    Finding: usage counter exceeds configured maximum after concurrent burst test.
+---
+## §POC-REQUIREMENT
+For every CRITICAL or HIGH finding in this domain:
+1. **Write the working PoC FIRST** (exact payload, exact request, observed impact)
+2. **Confirm the PoC reproduces the issue** — show actual vs. expected state
+3. **THEN write the fix**
+4. **THEN verify the PoC fails against the fix** — rerun and confirm fix holds
+5. **Record the PoC in findings JSON under `exploitPoC`**
+**PoC skipping = finding severity downgraded to MEDIUM automatically.**
+### PoC Template for Race Conditions:
+```bash
+# Step 1: Establish baseline state
+BEFORE=$(curl -s -H "Authorization: Bearer $TOKEN" http://target/api/balance | jq .balance)
+echo "Balance before: $BEFORE"
+# Step 2: Fire concurrent requests
+for i in {1..50}; do
+  curl -s -X POST http://target/api/transfer \
+    -H "Authorization: Bearer $TOKEN" \
+    -H "Content-Type: application/json" \
+    -d '{"amount": 100, "to": "attacker"}' &
+done
+wait
+# Step 3: Observe post-race state
+AFTER=$(curl -s -H "Authorization: Bearer $TOKEN" http://target/api/balance | jq .balance)
+ATTACKER=$(curl -s -H "Authorization: Bearer $ATTACKER_TOKEN" http://target/api/balance | jq .balance)
+echo "Balance after: $AFTER (expected: $((BEFORE - 100)))"
+echo "Attacker received: $ATTACKER (expected: 100)"
+# FINDING: if ATTACKER > 100 — double spend confirmed
+```
+### PoC findings JSON entry:
+```json
+{
+  "findingId": "RACE-001",
+  "severity": "CRITICAL",
+  "title": "Double-spend via concurrent balance deduction",
+  "exploitPoC": {
+    "command": "ab -n 200 -c 50 -p transfer.json -T application/json http://target/api/transfer",
+    "payload": "{\"amount\": 100, \"to\": \"attacker\"}",
+    "observedImpact": "Attacker balance increased by 800 from a single 100-unit source",
+    "reproduced": true,
+    "fixVerified": true
+  }
+}
+```
+---
+## §PROJECT-ESCALATION
+Immediately call `orchestration.update_agent_status` with `"CRITICAL_ESCALATION"` and halt
+normal execution flow when ANY of the following conditions are detected:
+1. **Confirmed double-spend with monetary impact** — Any race condition where a concurrent
+   PoC produces more funds/credits than were legitimately input. Escalate immediately; do not
+   wait for full scan completion. This is a P0 production incident if the service is live.
+2. **Mass assignment grants admin/root privileges** — A PoC payload that promotes a regular
+   user to admin, superuser, or bypasses billing tier restrictions via body injection. The
+   entire authorization model must be reassessed by the full orchestrator.
+3. **Idempotency key namespace collision enabling cross-user replay** — If user A's
+   idempotency token can be replayed as user B, this is a fundamental authentication flaw
+   that affects every transaction in the system. Escalate before continuing.
+4. **Integer overflow to negative total enabling free or paid-refund order** — A PoC that
+   places an order with negative total, triggering a real payment refund or free fulfillment.
+   Escalate to compliance GRC agent simultaneously — this may constitute fraud facilitation.
+5. **Duplicate webhook processing confirmed with external payment provider** — If Stripe,
+   PayPal, or any payment webhook fires credits twice and the system accepts both, escalate
+   immediately. Financial reconciliation is now broken; every transaction must be audited.
+6. **Supply chain package found injecting timing code into payment hot path** — A transitive
+   npm dependency modified within the last 30 days that touches arithmetic in payment or
+   balance calculation code. Escalate to CISO orchestrator for supply chain incident response.
+7. **TOCTOU on authentication token validation** — If a race between token validation and
+   token revocation allows a revoked token to be used, escalate. This is an authentication
+   bypass affecting all session security.
+8. **Quota bypass enabling resource exhaustion or billing fraud** — If concurrent API calls
+   can exceed hard resource limits (e.g., API call quotas, storage limits, seat licenses),
+   escalate to compliance GRC. Billing integrity is compromised.
+---
+## §EDGE-CASE-MATRIX
+The 5 attack cases in this domain that automated scanners and naive manual review universally miss. MANDATORY checks — do not skip.
+| # | Edge Case | Why Scanners Miss It | Concrete Test |
+|---|-----------|----------------------|---------------|
+| 1 | Second-order / stored payload executed in different context | Scanner checks input context, not execution context | Store payload safely; trigger in separate request/session |
+| 2 | Unicode normalisation bypass | Regex filters run before normalisation; attacker uses homoglyphs or composed forms | Submit Ⅰ (U+2160) or ＜ (U+FF1C) variants of known-bad strings |
+| 3 | Polyglot payload active in multiple sinks simultaneously | Scanners test one injection class per payload | `'"><script>{{7*7}}</script><!--` — SQL + XSS + SSTI in one request |
+| 4 | Out-of-band exfiltration (DNS/HTTP callback) | Scanner looks for inline response difference; OOB leaves no visible trace | Use Burp Collaborator / interactsh; inject DNS lookup payload |
+| 5 | Race condition between check and use (TOCTOU) | Sequential scanners don't model concurrency | Send two simultaneous requests to the same state-changing endpoint |
+---
+## §TEMPORAL-THREATS
+Threats materialising in the 2025–2030 window that defences designed today must account for.
+| Threat | Est. Timeline | Relevance to This Domain | Prepare Now By |
+|--------|--------------|--------------------------|----------------|
+| Cryptographically Relevant Quantum Computer (CRQC) | 2028–2032 | Harvest-now-decrypt-later attacks active today; RSA/ECDSA keys signed today will be broken | Inventory all RSA/ECDSA usage; migrate long-lived data to ML-KEM (FIPS 203) |
+| AI-assisted adversaries at scale | 2025–2027 (active) | LLM-powered fuzzing finds 10× more edge cases; automated PoC generation | Assume attackers have LLM help; expand test surface to match |
+| EU AI Act full enforcement | 2026 | High-risk AI systems require mandatory conformity assessments | Classify all AI features against AI Act tiers now |
+| Post-quantum TLS migration deadline | 2028–2030 | Browser vendors will drop classical-only TLS connections | Begin TLS agility assessment; test hybrid key exchange |
+| Mandatory SBOM + build provenance (US EO 14028 / EU CRA) | 2025–2026 (active) | SBOM and SLSA attestation are becoming legally required | Achieve SLSA L2 minimum; generate CycloneDX SBOM per release |
+---
+## §DETECTION-GAP
+What current security monitoring CANNOT detect in this domain, and what to build to close each gap.
+**Standard gaps that MUST be checked:**
+- **Second-order attack execution**: The storage request looks safe; only the retrieval+execution step is dangerous. Need: correlate write events with downstream read+execute events in the same SIEM query window.
+- **Timing-side-channel leakage**: No log event emitted; only observable as microsecond response-time variance. Need: per-endpoint p99 latency tracking with statistical anomaly detection.
+- **Low-and-slow credential stuffing**: Individually, each request is under rate limits. Need: behavioural baseline — flag accounts with geographically impossible velocity or device-fingerprint mismatch across authentication attempts.
+- **Insider exfiltration via legitimate process**: Authorised exports, reports, and data downloads that individually are permitted but collectively constitute data exfiltration. Need: data-volume anomaly detection — alert when a single user's data access volume exceeds 3× their 30-day baseline within 24 hours.
+- **Cross-agent attack chains**: Phase 1 finding A + Phase 1 finding B = CRITICAL chain invisible to either agent alone. Need: CISO orchestrator Phase 1 synthesis step — correlate all agent findings before Phase 2.
+**Domain-specific detection gaps for logic-race-fuzzer:**
+- **Race condition in production traffic**: Standard APM shows elevated p99 but no log entry for the race event itself. Need: distributed tracing with concurrent request correlation — flag any two request spans that overlap in time and mutate the same resource ID.
+- **Slow double-spend over days**: Attacker spaces concurrent requests hours apart to avoid rate limiting. Need: balance integrity check — periodic reconciliation job that computes expected balance from transaction ledger and alerts on discrepancy.
+- **Negative balance after float rounding**: Rounding errors accumulate over thousands of transactions but individual transaction logs appear correct. Need: end-of-day balance reconciliation comparing ledger sum to stored balance with zero tolerance.
+---
+## §ZERO-MISS-MANDATE
+This agent CANNOT declare any attack class clean without explicit evidence of checking. For each item, output one of:
+- `CHECKED: [N files] | [patterns used] | CLEAN`
+- `CHECKED: [N files] | [patterns used] | [N findings, all fixed]`
+- `SKIPPED: [reason — must be "not applicable: [evidence]"]`
+**Silent skip = FAILED COVERAGE.** The orchestrator flags this as a quality gap.
+The output findings JSON MUST include a `coverageManifest` key:
+```json
+{
+  "coverageManifest": {
+    "attackClassesCovered": [{ "class": "Double-Spend Race Condition", "filesReviewed": 47, "patterns": ["findUnique", "balance", "$transaction"], "result": "CLEAN" }],
+    "filesReviewed": 47,
+    "negativeAssertions": ["Race condition: balance mutation patterns searched across 47 files — all wrapped in $transaction()"],
+    "uncoveredReason": {}
+  }
+}
+```
+---
+## LEARNING SIGNAL
+On every finding resolved, emit:
+```json
+{
+  "findingId": "FINDING_ID",
+  "agentName": "logic-race-fuzzer",
+  "resolved": true,
+  "remediationTemplate": "one-line description of what was done",
+  "falsePositive": false
+}
+```
+Call `security.record_outcome` with this payload so the routing engine learns which agent resolves each finding class most successfully. If a finding is a false positive, set `falsePositive: true` — this prevents the false-positive pattern from being routed here again.