npm - @vigolium/piolium - Versions diffs - 0.0.1 - Mend

@vigolium/piolium 0.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (271) hide show

package/LICENSE +21 -0
package/README.md +117 -0
package/agents/access-auditor.md +300 -0
package/agents/assumption-breaker.md +154 -0
package/agents/attack-designer.md +116 -0
package/agents/code-scanner.md +139 -0
package/agents/concurrency-auditor.md +238 -0
package/agents/confirm-writer.md +257 -0
package/agents/context-reviewer.md +274 -0
package/agents/cross-verifier.md +165 -0
package/agents/cve-scout.md +381 -0
package/agents/env-builder.md +282 -0
package/agents/env-profiler.md +205 -0
package/agents/evidence-collector.md +140 -0
package/agents/finding-grader.md +142 -0
package/agents/finding-writer.md +148 -0
package/agents/flow-tracer.md +106 -0
package/agents/goal-backtracer.md +146 -0
package/agents/history-miner.md +467 -0
package/agents/independent-verifier.md +118 -0
package/agents/intent-mapper.md +183 -0
package/agents/longshot-collector.md +128 -0
package/agents/longshot-prober.md +126 -0
package/agents/patch-auditor.md +73 -0
package/agents/poc-author.md +124 -0
package/agents/poc-runner.md +194 -0
package/agents/probe-lead.md +269 -0
package/agents/red-challenger.md +101 -0
package/agents/report-composer.md +208 -0
package/agents/review-adjudicator.md +216 -0
package/agents/spec-auditor.md +155 -0
package/agents/taint-tracer.md +265 -0
package/agents/test-locator.md +209 -0
package/agents/threat-modeler.md +132 -0
package/agents/variant-scanner.md +108 -0
package/agents/variant-spotter.md +110 -0
package/bin/piolium.mjs +376 -0
package/extensions/piolium/_vendor/yaml.bundle.d.mts +6 -0
package/extensions/piolium/_vendor/yaml.bundle.mjs +139 -0
package/extensions/piolium/agent-runner.ts +322 -0
package/extensions/piolium/agents.ts +266 -0
package/extensions/piolium/audit-state.ts +522 -0
package/extensions/piolium/bundled-resources.ts +97 -0
package/extensions/piolium/candidate-scan.ts +966 -0
package/extensions/piolium/command-target.ts +177 -0
package/extensions/piolium/console-stream.ts +57 -0
package/extensions/piolium/export-results.ts +380 -0
package/extensions/piolium/findings.ts +448 -0
package/extensions/piolium/heartbeat.ts +182 -0
package/extensions/piolium/help.ts +234 -0
package/extensions/piolium/index.ts +1865 -0
package/extensions/piolium/longshot.ts +530 -0
package/extensions/piolium/matcher-suggestions.ts +196 -0
package/extensions/piolium/matcher-utils.ts +83 -0
package/extensions/piolium/modes/balanced.ts +750 -0
package/extensions/piolium/modes/confirm-bootstrap.ts +186 -0
package/extensions/piolium/modes/confirm.ts +697 -0
package/extensions/piolium/modes/deep.ts +917 -0
package/extensions/piolium/modes/diff.ts +177 -0
package/extensions/piolium/modes/lite.ts +540 -0
package/extensions/piolium/modes/longshot.ts +595 -0
package/extensions/piolium/modes/merge.ts +204 -0
package/extensions/piolium/modes/phase-runner.ts +267 -0
package/extensions/piolium/modes/reinvest.ts +546 -0
package/extensions/piolium/modes/revisit.ts +279 -0
package/extensions/piolium/modes.ts +48 -0
package/extensions/piolium/phase-labels.ts +123 -0
package/extensions/piolium/phase-status-strip.ts +92 -0
package/extensions/piolium/prompt-prefix-editor.ts +39 -0
package/extensions/piolium/providers/anthropic-vertex.ts +836 -0
package/extensions/piolium/recon.ts +409 -0
package/extensions/piolium/result-stats.ts +105 -0
package/extensions/piolium/retry.ts +120 -0
package/extensions/piolium/scheduler.ts +212 -0
package/extensions/piolium/secrets.ts +368 -0
package/extensions/piolium/tools/web-tools.ts +148 -0
package/package.json +77 -0
package/skills/agentic-actions-auditor/SKILL.md +327 -0
package/skills/agentic-actions-auditor/references/action-profiles.md +186 -0
package/skills/agentic-actions-auditor/references/cross-file-resolution.md +209 -0
package/skills/agentic-actions-auditor/references/foundations.md +94 -0
package/skills/agentic-actions-auditor/references/vector-a-env-var-intermediary.md +77 -0
package/skills/agentic-actions-auditor/references/vector-b-direct-expression-injection.md +83 -0
package/skills/agentic-actions-auditor/references/vector-c-cli-data-fetch.md +83 -0
package/skills/agentic-actions-auditor/references/vector-d-pr-target-checkout.md +88 -0
package/skills/agentic-actions-auditor/references/vector-e-error-log-injection.md +88 -0
package/skills/agentic-actions-auditor/references/vector-f-subshell-expansion.md +82 -0
package/skills/agentic-actions-auditor/references/vector-g-eval-of-ai-output.md +91 -0
package/skills/agentic-actions-auditor/references/vector-h-dangerous-sandbox-configs.md +102 -0
package/skills/agentic-actions-auditor/references/vector-i-wildcard-allowlists.md +88 -0
package/skills/audit/SKILL.md +562 -0
package/skills/audit/assets/icon.svg +7 -0
package/skills/audit/hooks/scripts/validate_phase_output.py +550 -0
package/skills/audit/references/adversarial-review.md +148 -0
package/skills/audit/references/architecture-aware-sast.md +306 -0
package/skills/audit/references/audit-workflow.md +737 -0
package/skills/audit/references/chamber-protocol.md +384 -0
package/skills/audit/references/creative-attack-modes.md +221 -0
package/skills/audit/references/deep-analysis.md +273 -0
package/skills/audit/references/domain-attack-playbooks.md +1129 -0
package/skills/audit/references/knowledge-base-template.md +513 -0
package/skills/audit/references/real-env-validation.md +191 -0
package/skills/audit/references/report-templates.md +417 -0
package/skills/audit/references/triage-and-prereqs.md +134 -0
package/skills/audit/scripts/consolidate_drafts.py +554 -0
package/skills/audit/scripts/partition_findings.py +152 -0
package/skills/audit/scripts/rg-hotspots.sh +121 -0
package/skills/audit/scripts/stamp_file_state.py +349 -0
package/skills/code-reviewer/SKILL.md +65 -0
package/skills/codeql/SKILL.md +281 -0
package/skills/codeql/references/build-fixes.md +90 -0
package/skills/codeql/references/diagnostic-query-templates.md +339 -0
package/skills/codeql/references/extension-yaml-format.md +209 -0
package/skills/codeql/references/important-only-suite.md +153 -0
package/skills/codeql/references/language-details.md +207 -0
package/skills/codeql/references/macos-arm64e-workaround.md +179 -0
package/skills/codeql/references/performance-tuning.md +111 -0
package/skills/codeql/references/quality-assessment.md +172 -0
package/skills/codeql/references/ruleset-catalog.md +63 -0
package/skills/codeql/references/run-all-suite.md +92 -0
package/skills/codeql/references/sarif-processing.md +79 -0
package/skills/codeql/references/threat-models.md +51 -0
package/skills/codeql/workflows/build-database.md +280 -0
package/skills/codeql/workflows/create-data-extensions.md +261 -0
package/skills/codeql/workflows/run-analysis.md +301 -0
package/skills/differential-review/SKILL.md +220 -0
package/skills/differential-review/adversarial.md +203 -0
package/skills/differential-review/methodology.md +234 -0
package/skills/differential-review/patterns.md +300 -0
package/skills/differential-review/reporting.md +369 -0
package/skills/fp-check/SKILL.md +125 -0
package/skills/fp-check/references/bug-class-verification.md +114 -0
package/skills/fp-check/references/deep-verification.md +143 -0
package/skills/fp-check/references/evidence-templates.md +91 -0
package/skills/fp-check/references/false-positive-patterns.md +115 -0
package/skills/fp-check/references/gate-reviews.md +27 -0
package/skills/fp-check/references/standard-verification.md +78 -0
package/skills/insecure-defaults/SKILL.md +117 -0
package/skills/insecure-defaults/references/examples.md +409 -0
package/skills/last30days/SKILL.md +444 -0
package/skills/sarif-parsing/SKILL.md +483 -0
package/skills/sarif-parsing/resources/jq-queries.md +162 -0
package/skills/sarif-parsing/resources/sarif_helpers.py +331 -0
package/skills/security-threat-model/LICENSE.txt +201 -0
package/skills/security-threat-model/SKILL.md +81 -0
package/skills/security-threat-model/agents/openai.yaml +4 -0
package/skills/security-threat-model/references/prompt-template.md +255 -0
package/skills/security-threat-model/references/security-controls-and-assets.md +32 -0
package/skills/semgrep/SKILL.md +212 -0
package/skills/semgrep/references/rulesets.md +162 -0
package/skills/semgrep/references/scan-modes.md +110 -0
package/skills/semgrep/references/scanner-task-prompt.md +140 -0
package/skills/semgrep/scripts/merge_sarif.py +203 -0
package/skills/semgrep/workflows/scan-workflow.md +311 -0
package/skills/semgrep-rule-creator/SKILL.md +168 -0
package/skills/semgrep-rule-creator/references/quick-reference.md +202 -0
package/skills/semgrep-rule-creator/references/workflow.md +240 -0
package/skills/semgrep-rule-variant-creator/SKILL.md +205 -0
package/skills/semgrep-rule-variant-creator/references/applicability-analysis.md +250 -0
package/skills/semgrep-rule-variant-creator/references/language-syntax-guide.md +324 -0
package/skills/semgrep-rule-variant-creator/references/workflow.md +518 -0
package/skills/sharp-edges/SKILL.md +292 -0
package/skills/sharp-edges/references/auth-patterns.md +252 -0
package/skills/sharp-edges/references/case-studies.md +274 -0
package/skills/sharp-edges/references/config-patterns.md +333 -0
package/skills/sharp-edges/references/crypto-apis.md +190 -0
package/skills/sharp-edges/references/lang-c.md +205 -0
package/skills/sharp-edges/references/lang-csharp.md +285 -0
package/skills/sharp-edges/references/lang-go.md +270 -0
package/skills/sharp-edges/references/lang-java.md +263 -0
package/skills/sharp-edges/references/lang-javascript.md +269 -0
package/skills/sharp-edges/references/lang-kotlin.md +265 -0
package/skills/sharp-edges/references/lang-php.md +245 -0
package/skills/sharp-edges/references/lang-python.md +274 -0
package/skills/sharp-edges/references/lang-ruby.md +273 -0
package/skills/sharp-edges/references/lang-rust.md +272 -0
package/skills/sharp-edges/references/lang-swift.md +287 -0
package/skills/sharp-edges/references/language-specific.md +588 -0
package/skills/spec-to-code-compliance/SKILL.md +357 -0
package/skills/spec-to-code-compliance/resources/COMPLETENESS_CHECKLIST.md +69 -0
package/skills/spec-to-code-compliance/resources/IR_EXAMPLES.md +417 -0
package/skills/spec-to-code-compliance/resources/OUTPUT_REQUIREMENTS.md +105 -0
package/skills/supply-chain-risk-auditor/SKILL.md +67 -0
package/skills/supply-chain-risk-auditor/resources/results-template.md +41 -0
package/skills/variant-analysis/METHODOLOGY.md +327 -0
package/skills/variant-analysis/SKILL.md +142 -0
package/skills/variant-analysis/resources/codeql/cpp.ql +119 -0
package/skills/variant-analysis/resources/codeql/go.ql +69 -0
package/skills/variant-analysis/resources/codeql/java.ql +71 -0
package/skills/variant-analysis/resources/codeql/javascript.ql +63 -0
package/skills/variant-analysis/resources/codeql/python.ql +80 -0
package/skills/variant-analysis/resources/semgrep/cpp.yaml +98 -0
package/skills/variant-analysis/resources/semgrep/go.yaml +63 -0
package/skills/variant-analysis/resources/semgrep/java.yaml +61 -0
package/skills/variant-analysis/resources/semgrep/javascript.yaml +60 -0
package/skills/variant-analysis/resources/semgrep/python.yaml +72 -0
package/skills/variant-analysis/resources/variant-report-template.md +75 -0
package/skills/vuln-report/SKILL.md +137 -0
package/skills/vuln-report/agents/openai.yaml +4 -0
package/skills/vuln-report/references/report-template.md +135 -0
package/skills/wooyun-legacy/SKILL.md +367 -0
package/skills/wooyun-legacy/references/bank-penetration.md +222 -0
package/skills/wooyun-legacy/references/checklists/command-execution-checklist.md +119 -0
package/skills/wooyun-legacy/references/checklists/csrf-checklist.md +74 -0
package/skills/wooyun-legacy/references/checklists/file-upload-checklist.md +108 -0
package/skills/wooyun-legacy/references/checklists/info-disclosure-checklist.md +114 -0
package/skills/wooyun-legacy/references/checklists/logic-flaws-checklist.md +95 -0
package/skills/wooyun-legacy/references/checklists/misconfig-checklist.md +124 -0
package/skills/wooyun-legacy/references/checklists/path-traversal-checklist.md +87 -0
package/skills/wooyun-legacy/references/checklists/rce-checklist.md +93 -0
package/skills/wooyun-legacy/references/checklists/sql-injection-checklist.md +97 -0
package/skills/wooyun-legacy/references/checklists/ssrf-checklist.md +99 -0
package/skills/wooyun-legacy/references/checklists/unauthorized-access-checklist.md +89 -0
package/skills/wooyun-legacy/references/checklists/weak-password-checklist.md +115 -0
package/skills/wooyun-legacy/references/checklists/xss-checklist.md +103 -0
package/skills/wooyun-legacy/references/checklists/xxe-checklist.md +130 -0
package/skills/wooyun-legacy/references/info-disclosure.md +975 -0
package/skills/wooyun-legacy/references/logic-flaws.md +721 -0
package/skills/wooyun-legacy/references/path-traversal.md +1191 -0
package/skills/wooyun-legacy/references/telecom-penetration.md +156 -0
package/skills/wooyun-legacy/references/unauthorized-access.md +980 -0
package/skills/wooyun-legacy/references/xss.md +746 -0
package/skills/zeroize-audit/SKILL.md +371 -0
package/skills/zeroize-audit/configs/c.yaml +21 -0
package/skills/zeroize-audit/configs/default.yaml +128 -0
package/skills/zeroize-audit/configs/rust.yaml +83 -0
package/skills/zeroize-audit/prompts/report_template.md +238 -0
package/skills/zeroize-audit/prompts/system.md +163 -0
package/skills/zeroize-audit/prompts/task.md +97 -0
package/skills/zeroize-audit/references/compile-commands.md +231 -0
package/skills/zeroize-audit/references/detection-strategy.md +191 -0
package/skills/zeroize-audit/references/ir-analysis.md +252 -0
package/skills/zeroize-audit/references/mcp-analysis.md +221 -0
package/skills/zeroize-audit/references/poc-generation.md +470 -0
package/skills/zeroize-audit/references/rust-zeroization-patterns.md +867 -0
package/skills/zeroize-audit/schemas/input.json +83 -0
package/skills/zeroize-audit/schemas/output.json +140 -0
package/skills/zeroize-audit/tools/analyze_asm.sh +202 -0
package/skills/zeroize-audit/tools/analyze_cfg.py +381 -0
package/skills/zeroize-audit/tools/analyze_heap.sh +211 -0
package/skills/zeroize-audit/tools/analyze_ir_semantic.py +429 -0
package/skills/zeroize-audit/tools/diff_ir.sh +135 -0
package/skills/zeroize-audit/tools/diff_rust_mir.sh +189 -0
package/skills/zeroize-audit/tools/emit_asm.sh +67 -0
package/skills/zeroize-audit/tools/emit_ir.sh +77 -0
package/skills/zeroize-audit/tools/emit_rust_asm.sh +178 -0
package/skills/zeroize-audit/tools/emit_rust_ir.sh +150 -0
package/skills/zeroize-audit/tools/emit_rust_mir.sh +158 -0
package/skills/zeroize-audit/tools/extract_compile_flags.py +284 -0
package/skills/zeroize-audit/tools/generate_poc.py +1329 -0
package/skills/zeroize-audit/tools/mcp/apply_confidence_gates.py +113 -0
package/skills/zeroize-audit/tools/mcp/check_mcp.sh +68 -0
package/skills/zeroize-audit/tools/mcp/normalize_mcp_evidence.py +125 -0
package/skills/zeroize-audit/tools/scripts/check_llvm_patterns.py +481 -0
package/skills/zeroize-audit/tools/scripts/check_mir_patterns.py +554 -0
package/skills/zeroize-audit/tools/scripts/check_rust_asm.py +424 -0
package/skills/zeroize-audit/tools/scripts/check_rust_asm_aarch64.py +300 -0
package/skills/zeroize-audit/tools/scripts/check_rust_asm_x86.py +283 -0
package/skills/zeroize-audit/tools/scripts/find_dangerous_apis.py +375 -0
package/skills/zeroize-audit/tools/scripts/semantic_audit.py +923 -0
package/skills/zeroize-audit/tools/track_dataflow.sh +196 -0
package/skills/zeroize-audit/tools/validate_rust_toolchain.sh +298 -0
package/skills/zeroize-audit/workflows/phase-0-preflight.md +150 -0
package/skills/zeroize-audit/workflows/phase-1-source-analysis.md +144 -0
package/skills/zeroize-audit/workflows/phase-2-compiler-analysis.md +139 -0
package/skills/zeroize-audit/workflows/phase-3-interim-report.md +46 -0
package/skills/zeroize-audit/workflows/phase-4-poc-generation.md +46 -0
package/skills/zeroize-audit/workflows/phase-5-poc-validation.md +136 -0
package/skills/zeroize-audit/workflows/phase-6-final-report.md +44 -0
package/skills/zeroize-audit/workflows/phase-7-test-generation.md +42 -0
package/themes/piolium-srcery.json +94 -0

package/skills/audit/references/adversarial-review.md ADDED Viewed

@@ -0,0 +1,148 @@
+# Adversarial Review Methodology (P11-LITE Cold Verification)
+Protocol for the Phase 11 Stage 2 cold verification agent. Under the Review Chamber model,
+the Devil's Advocate already challenged every finding during the Phase 10 debate. Stage 2 is
+therefore **scoped to CRITICAL and HIGH findings only** — Medium findings skip Stage 2 entirely.
+## Purpose
+The Devil's Advocate challenges findings while the debate context is hot, but shares the
+chamber's context window with other agents. Cold verification breaks any residual confirmation
+bias by spawning a fresh agent with no access to the chamber debate, forcing fully independent
+re-derivation. This is reserved for the highest-severity findings where the cost of a false
+positive or missed vulnerability is greatest.
+## Isolation Rules
+The adversarial reviewer agent receives **only**:
+- The finding draft file path (`archon/findings-draft/<phase>-<NNN>-<slug>.md`)
+The adversarial reviewer MUST NOT:
+- Read Phase 10 working notes or intermediate analysis files
+- Read the original agent's conversation history or reasoning chain
+- Read any file in `archon/` other than the single finding draft it was given
+- Be told what the finding agent concluded — only what the finding draft states
+The agent spawner must construct the task description from only the finding draft path. Do not include summaries, context, or the finding agent's reasoning.
+---
+## Step 1 — Restate and Decompose
+Read only the finding draft. Restate the vulnerability claim in your own words without copying the original description. Then decompose into testable sub-claims:
+- Sub-claim A: Attacker controls input X
+- Sub-claim B: Input X reaches code point Y without adequate sanitization
+- Sub-claim C: Code point Y causes security effect Z
+If any sub-claim is incoherent, logically impossible, or unsupported by the draft, record `Sub-claim failure: <which sub-claim and why>` and proceed to the verdict with DISPROVED.
+---
+## Step 2 — Independent Code Path Trace
+Starting from the entry point stated in the finding draft, trace the code path to the claimed sink independently. Do not rely on the finding draft's code snippets as a guide — trace from source yourself.
+Document:
+- Every validation or sanitization function encountered on the path
+- Every transformation applied to the input
+- Whether each control is bypassable given realistic attacker input
+- Framework-level protections active on this path (ORM, auto-escaping, CSRF tokens, etc.)
+If the code path cannot be traced as described, record the discrepancy.
+---
+## Step 3 — Protection Surface Search
+Actively search for controls that could block or mitigate the claimed attack. Check each layer:
+| Layer | What to Look For |
+|-------|-----------------|
+| Language-level | Type system enforcement, memory safety, bounds checking |
+| Framework-level | ORM parameterization, template auto-escaping, CSRF middleware, input validation decorators |
+| Middleware | WAF rules, proxy normalization, rate limiting, authentication enforcement |
+| Application-level | Allowlists, ownership checks, role verification, input length limits |
+| Documentation-level | `SECURITY.md`, changelogs, `CONTRIBUTING.md` — does the project explicitly accept this as a known risk? |
+Record each protection found and assess whether it blocks the claimed attack path.
+---
+## Step 4 — Real-Environment Reproduction
+Follow the procedures in `real-env-validation.md`. Provision an appropriate environment for the project type and attempt reproduction.
+Required:
+- Deploy at the same commit referenced in the finding draft
+- Verify the environment is working normally (healthcheck) before attempting exploitation
+- Attempt the reproduction steps from the finding draft exactly as written
+- If the first attempt fails, try up to 3 variations
+Record:
+- Environment type and provisioning commands used
+- Healthcheck result
+- Each attempt and its outcome
+- Evidence files stored in `archon/real-env-evidence/<slug>/`
+If real-environment reproduction is blocked (see `real-env-validation.md`), document the blocker and continue to Steps 5-7 based on code analysis only. Annotate `PoC-Status: theoretical`.
+---
+## Step 5 — Prosecution and Defense Briefs
+Write two independent arguments. Each must cite specific code locations and evidence from Steps 2-4.
+**Prosecution brief**: argue that the finding is a genuine, exploitable vulnerability. State the strongest possible case. Cite code, attacker input path, protection gaps, and reproduction evidence.
+**Defense brief**: argue that the finding is a false positive or unexploitable. State the strongest possible case. Cite protections found in Step 3, reproduction failures, and any preconditions that make exploitation unrealistic.
+Do not allow one brief to reference the other's reasoning. Write them independently.
+---
+## Step 6 — Severity Challenge
+Apply severity calibration from `triage-and-prereqs.md`. Start at MEDIUM regardless of what the finding draft states.
+- Document whether upgrade criteria for HIGH or CRITICAL are met with evidence
+- Document whether any downgrade signals apply
+- State `Severity-Challenge: <MEDIUM | HIGH | CRITICAL>` with a one-sentence justification
+If the challenged severity is lower than `Severity-Original` in the draft, the lower severity wins in the final record.
+---
+## Step 7 — Verdict
+**CONFIRMED** if both:
+- The prosecution brief survives the defense (no blocking protection was found)
+- AND real-environment reproduction succeeded (or reproduction was blocked with documented reason)
+**DISPROVED** if either:
+- The defense identifies a protection that blocks the claimed attack path
+- OR all reproduction attempts failed (3 variations tried and all failed)
+Write the verdict back into the finding draft:
+```
+Adversarial-Verdict: CONFIRMED | DISPROVED
+Adversarial-Rationale: <one sentence citing the decisive evidence>
+Severity-Final: <challenged severity if different from original, else same as original>
+PoC-Status: executed | theoretical | blocked
+```
+Write the full adversarial review to `archon/adversarial-reviews/<slug>-review.md` using the Adversarial Review Template from `report-templates.md`.
+If verdict is DISPROVED, also update the finding draft's top-level `Verdict:` field to `FALSE POSITIVE (adversarial)`.
+---
+## Rationalizations to Reject
+The following are not valid grounds for issuing CONFIRMED:
+- "The finding agent already verified this" — the finding agent's verification is why Stage 2 exists
+- "I cannot reproduce but the code looks vulnerable" — failed reproduction with no documented blocker is a DISPROVED signal
+- "Probably exploitable in some configuration" — theoretical exploitability is not confirmed exploitability
+- "The severity seems right based on the bug class" — severity must be derived from evidence, not class defaults
+- "The defense brief is weaker than the prosecution brief" — a plausible defense is sufficient to require reproduction before confirming

package/skills/audit/references/architecture-aware-sast.md ADDED Viewed

@@ -0,0 +1,306 @@
+# Architecture-Aware SAST
+Use this reference when Phase 3 identifies high-risk flows that built-in tooling may model incompletely.
+## Table of Contents
+1. [Purpose](#purpose)
+2. [Discovery Matrix](#discovery-matrix)
+3. [SAST Layering Model](#sast-layering-model)
+4. [How DFD and CFD Drive Modeling](#how-dfd-and-cfd-drive-modeling)
+5. [Load These References Before Authoring](#load-these-references-before-authoring)
+6. [Custom CodeQL Workflow](#custom-codeql-workflow)
+7. [Custom Semgrep Workflow](#custom-semgrep-workflow)
+8. [Semgrep Resource Tuning](#semgrep-resource-tuning)
+9. [Architecture Examples](#architecture-examples)
+## Purpose
+Run built-in CodeQL and built-in Semgrep coverage first. Add custom CodeQL and Semgrep coverage only when the architecture introduces blind spots:
+- custom wrappers around request parsing, RPC, auth, storage, or execution
+- generated interfaces, schemas, or IDLs that hide trust-boundary crossings
+- unusual transports or execution models
+- policy decisions separated from the dangerous sink by orchestration layers
+- complex multi-component flows where attacker control or identity propagation is easy to misread
+Custom rules do not replace built-in rules. They close gaps that built-ins cannot see well enough.
+## Discovery Matrix
+Use this matrix to decide what must be modeled.
+| Dimension | What to Inventory | Why It Matters |
+|----------|-------------------|----------------|
+| Ingress | HTTP handlers, CLI args, files, IPC, queues, webhooks, plugins, tool invocations | Identifies attacker-controlled sources |
+| Synchronous transports | HTTP clients, RPC clients, gRPC stubs, SDK wrappers, service clients | Identifies cross-component trust handoffs |
+| Asynchronous transports | queues, topics, events, schedulers, workers, retries | Identifies delayed or reordered security assumptions |
+| Control-plane interfaces | admin APIs, job orchestration, deployment hooks, agent control channels | Identifies higher-privilege decision paths |
+| Plugin and tool execution | extension APIs, agent tools, capability registration, command execution | Identifies confused-deputy and unsafe exposure risk |
+| Storage and serialization | ORM wrappers, caches, blobs, message encoders, protocol codecs | Identifies sink classes and parser drift |
+| Identity propagation | session lookup, token forwarding, headers, metadata, claims, tenant context | Identifies authn/authz blind spots |
+| Dependency and supply chain edges | manifests, lockfiles, build files, images, sidecars, generated code | Identifies vulnerable libraries and hidden execution paths |
+## SAST Layering Model
+Always apply SAST in this order:
+1. **Built-in CodeQL suites**
+   Use standard built-in suites for the languages present.
+2. **Built-in Semgrep baseline and language/framework rulesets**
+   Use whole-repo baseline coverage plus language and framework rulesets.
+3. **Custom CodeQL modeling**
+   Add data extensions and narrow QL queries where built-ins miss real flows or control invariants.
+4. **Custom Semgrep rules**
+   Add structural and pattern rules for unsafe registration, missing middleware, policy bypasses, and architecture-specific misuse patterns.
+Document the split in the `## Static Analysis Summary` section of `archon/attack-surface/knowledge-base-report.md`.
+## How DFD and CFD Drive Modeling
+Use Phase 3 outputs directly:
+- **DFD slices** identify sources, summaries, sinks, trust-boundary crossings, and serialization boundaries.
+- **CFD slices** identify policy gates, alternate paths, fallbacks, retries, orchestration logic, and bypass edges.
+For each high-risk slice, answer:
+1. Which input is attacker-controlled?
+2. Which transformations preserve or amplify attacker influence?
+3. Which decision points gate access or privilege?
+4. Which sink causes real impact?
+5. Which part is already covered by built-in tooling?
+6. Which part needs custom modeling?
+## Load These References Before Authoring
+Do not invent custom query or rule structure from memory. Open the relevant reference or template first.
+**For custom CodeQL models and queries:**
+- `../codeql/workflows/create-data-extensions.md`
+- `../codeql/workflows/run-analysis.md`
+- `../codeql/references/extension-yaml-format.md`
+- `../codeql/references/diagnostic-query-templates.md`
+- `../variant-analysis/resources/codeql/<language>.ql`
+**For custom Semgrep rules:**
+- `../variant-analysis/resources/semgrep/<language>.yaml`
+- `../semgrep/references/rulesets.md`
+Pick `<language>` from the repo slice you are modeling. Use the variant-analysis resources as a starting template, then narrow the pattern to the specific DFD/CFD slice.
+## Custom CodeQL Workflow
+Workflow:
+1. Start from the highest-risk DFD slice.
+2. Identify missing sources, summaries, or sinks caused by wrappers, adapters, generated interfaces, or custom transport layers.
+3. Open `../codeql/workflows/create-data-extensions.md` and follow it to create the missing data extensions.
+4. Use `../codeql/references/extension-yaml-format.md` for the exact YAML columns and language-specific format rules.
+5. Use `../codeql/references/diagnostic-query-templates.md` to build source and sink enumeration queries and confirm the new models are recognized.
+6. Start the custom QL file from `../variant-analysis/resources/codeql/<language>.ql`, then narrow it to the specific invariant from the DFD/CFD slice.
+7. Add narrow custom QL queries only for architecture-specific invariants, such as:
+   - missing authorization gate before a privileged sink
+   - identity forwarded without re-verification
+   - unsafe fallback path after a policy failure
+   - parsing or schema mismatch between adjacent layers
+8. Store artifacts under `archon/codeql-queries/`. Store slice reachability queries as
+   `archon/codeql-queries/slice-<name>.ql` — distinct from security-finding queries; their
+   purpose is structural validation of Phase 3 DFD slices, not vulnerability detection.
+9. In the report, cite the DFD/CFD slice that motivated each custom model or query.
+Prefer one narrow query per invariant over a broad speculative query pack.
+## Structural Extraction Workflow
+Run at the start of Phase 4, before any security scan, using the freshly built database stored at
+`archon/codeql-artifacts/db/`. The purpose is structural intelligence — not security findings.
+The outputs feed Phase 3 KB validation, Phase 4 inline enrichment, Phase 10 deep bug hunting, and
+Phase 12 variant analysis.
+### Why informational results matter
+CodeQL's `note`-level and informational results represent data flow nodes that CodeQL modeled but
+did not classify as exploitable under the current threat model or built-in query logic. These include
+sanitizer call sites, validation function calls, encoding/decoding nodes, transformation summaries,
+and intermediate propagation nodes on paths that terminate before a known sink. Retaining them gives
+manual reviewers an annotated map of where CodeQL tracked data and where it stopped — a negative
+result from CodeQL is as informative as a positive one.
+### Output files
+All outputs go to `archon/codeql-artifacts/`:
+| File | Content | Used by |
+|------|---------|---------|
+| `entry-points.json` | All recognized source nodes, by type and file:line | Phase 3 KB validation, Phase 5 |
+| `sinks.json` | All recognized sink nodes, by kind and file:line | Phase 5, Phase 10 |
+| `call-graph-slices.json` | Per-DFD-slice reachability: reachable bool, hop count, shortest paths | Phase 5, Phase 10 |
+| `flow-paths-raw.sarif` | Full unfiltered SARIF including note/none severity (git-ignored) | Phase 10 on-demand |
+| `flow-paths-all-severities.md` | Human-readable summary of informational/low results by rule | Phase 5, 7 |
+### Step 1: Source enumeration
+For each language in the repo, run the source enumeration query (RemoteFlowSource template, adjusted
+per language). Expand threat model scope if Phase 3 KB identified CLI args or env vars as
+attacker-controlled.
+```bash
+codeql query run \
+  --database=archon/codeql-artifacts/db/ \
+  --output=archon/codeql-artifacts/entry-points.bqrs \
+  -- archon/codeql-queries/list-sources.ql
+codeql bqrs decode \
+  --format=json \
+  --output=archon/codeql-artifacts/entry-points.json \
+  archon/codeql-artifacts/entry-points.bqrs
+```
+Include a `threat_model` field per record. Run additional passes with `--threat-model local` and
+`--threat-model environment` as needed and merge outputs.
+### Step 2: Sink enumeration
+Run the sink enumeration query for the detected language. Decode to `archon/codeql-artifacts/sinks.json`.
+Group results by `kind` field.
+### Step 3: Call graph slice queries
+For each high-risk DFD slice in `archon/attack-surface/knowledge-base-report.md` under
+`## Phase 4 CodeQL Extraction Targets`, author a narrow QL path-problem query that tests
+reachability from the identified source type to the identified sink kind. Use variant-analysis
+QL templates as a starting point. Store queries at `archon/codeql-queries/slice-<name>.ql`.
+Run with `--threat-model all`. Decode to JSON records in `call-graph-slices.json`:
+```json
+{
+  "slice": "user-input-to-exec",
+  "reachable": true,
+  "path_count": 3,
+  "shortest_paths": [
+    ["src/api/handler.py:42", "src/util/shell.py:17", "src/exec/run.py:91"]
+  ]
+}
+```
+If `reachable: false`, record as a meaningful signal for Phase 5: either the DFD slice is a
+false concern, or the source/sink models are incomplete and custom modeling is needed.
+### Step 4: Full raw SARIF with all severities
+Run the full security-and-quality suite with `--threat-model all`, writing unfiltered output:
+```bash
+codeql database analyze archon/codeql-artifacts/db/ \
+  --format=sarif-latest \
+  --output=archon/codeql-artifacts/flow-paths-raw.sarif \
+  --threads=0 \
+  --threat-model all
+```
+Expect 1.5-3x the file size of the security-only SARIF. This file is git-ignored.
+### Step 5: Human-readable informational summary
+Extract all `note`-level or unleveled results from the raw SARIF. Group by rule ID and write to
+`archon/codeql-artifacts/flow-paths-all-severities.md` with sections per rule category. This
+is the file Phase 10 reviewers read to understand where CodeQL tracked data and where it terminated.
+### Step 6: Generate Mermaid DFD and CFD diagrams
+After the JSON artifacts are written, generate machine-assisted DFD and CFD Mermaid diagrams and
+write them into the `## CodeQL Structural Analysis` section of `archon/attack-surface/knowledge-base-report.md`.
+**DFD diagram** — derive from `entry-points.json`, `call-graph-slices.json`, and `sinks.json`:
+- Nodes: all entry point file:lines as source boxes; all sink file:lines as sink boxes with their kind label
+- Intermediate nodes: for each reachable slice, include the intermediate call nodes from the
+  shortest path array as intermediate boxes
+- Solid edges: source → intermediate → sink for reachable slices
+- Dashed edges with label `no path (CodeQL)`: for slices where `reachable: false`
+Write the resulting `flowchart LR` Mermaid block to the `### Machine-Generated DFD Diagram`
+subsection of the KB.
+**CFD diagram** — derive from `flow-paths-all-severities.md` and `flow-paths-raw.sarif`:
+- Extract security-relevant conditional branch points from informational CodeQL results
+  (guards, validators, sanitizer calls) that appear on call-graph paths
+- Model each as a decision node with `passes` and `fails` edges
+- Include any known fallback/alternate paths from CFD slices in the Phase 3 KB
+- Write the resulting `flowchart TD` Mermaid block to the `### Machine-Generated CFD Diagram`
+  subsection of the KB
+If a diagram would exceed ~30 nodes, limit to the highest-risk slice paths only and note the
+truncation. If CodeQL extraction quality was low (few recognized sources/sinks), mark the diagram
+as `[incomplete — low extraction coverage]` rather than presenting misleading auto-generated paths.
+### Step 7: Update KB — CodeQL Structural Analysis section
+After all extraction steps complete, populate the `## CodeQL Structural Analysis` section of
+`archon/attack-surface/knowledge-base-report.md` from the JSON artifacts:
+- Fill entry point and sink tables from `entry-points.json` and `sinks.json`
+- Fill the call graph reachability table from `call-graph-slices.json`
+- Fill the informational flow node summary from `flow-paths-all-severities.md`
+- Cross-reference with the Phase 3 KB attack surface: flag any CodeQL-discovered source
+  missing from `## Attack Surface Summary`
+- Embed the Mermaid DFD and CFD diagrams from Step 6
+### When to skip
+Skip only if the CodeQL database build fails entirely (zero extracted files). Document the skip in
+`archon/attack-surface/knowledge-base-report.md`. The Phase 4 enrichment substep, Phase 10, and Phase 12 fall back to pure manual analysis.
+Do not skip for small repos — call graph reachability data is most valuable where DFD construction
+is complete but unvalidated.
+## Custom Semgrep Workflow
+Use custom Semgrep rules for structural and local patterns that are faster to express than QL, especially when you need to detect:
+- missing middleware, interceptors, or registration hooks
+- unsafe handler or tool exposure
+- privileged operations reachable from low-trust interfaces
+- inconsistent validation or policy checks across sibling code paths
+- wrappers that built-in Semgrep rules do not understand
+Workflow:
+1. Start from the highest-risk CFD slice.
+2. Identify the required security gate, registration step, or wrapper contract.
+3. Start the rule from `../variant-analysis/resources/semgrep/<language>.yaml`, then replace the generic pattern with the concrete unsafe shape from the slice.
+4. Keep the rule narrow: detect the missing gate, unsafe registration, or bypass shape, not every loosely related construct nearby.
+5. Scope the rule to the relevant files, paths, or languages.
+6. Validate the rule by checking that it matches the known risky instance and does not explode into noisy unrelated results.
+7. Store artifacts under `archon/semgrep-rules/`.
+8. In the report, cite the DFD/CFD slice that motivated each custom rule.
+Prefer a small set of precise rules over a large catch-all ruleset that is expensive and noisy.
+## Semgrep Resource Tuning
+Semgrep Pro can be expensive on large repos. Keep coverage while avoiding host saturation:
+1. Run a whole-repo baseline pass for high-signal built-in rulesets.
+2. Separate Pro-heavy taint passes from lightweight structural passes.
+3. Batch Pro-heavy scans by high-risk subsystem or architecture slice, not all at once.
+4. Use file, path, and language scoping aggressively for targeted passes.
+5. Prefer targeted follow-up passes for custom rules instead of repeating whole-repo broad scans.
+6. Record any batching, throttling, or narrowed scope in the `## Static Analysis Summary` section of `archon/attack-surface/knowledge-base-report.md`.
+The required outcome is bounded runtime without dropping mandatory built-in baseline coverage.
+## Architecture Examples
+Treat these as examples, not the full scope:
+- service-to-service HTTP APIs
+- gRPC and generated RPC clients
+- message brokers, queues, workers, and schedulers
+- plugins, extensions, and tool ecosystems
+- agent frameworks and MCP servers
+- desktop or local IPC
+- mixed control-plane and data-plane systems
+The discovery matrix and DFD/CFD slices decide what to model. Do not hard-code the audit to a short list of architecture names.