npm - @vigolium/piolium - Versions diffs - 0.0.1 - Mend

@vigolium/piolium 0.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (271) hide show

package/LICENSE +21 -0
package/README.md +117 -0
package/agents/access-auditor.md +300 -0
package/agents/assumption-breaker.md +154 -0
package/agents/attack-designer.md +116 -0
package/agents/code-scanner.md +139 -0
package/agents/concurrency-auditor.md +238 -0
package/agents/confirm-writer.md +257 -0
package/agents/context-reviewer.md +274 -0
package/agents/cross-verifier.md +165 -0
package/agents/cve-scout.md +381 -0
package/agents/env-builder.md +282 -0
package/agents/env-profiler.md +205 -0
package/agents/evidence-collector.md +140 -0
package/agents/finding-grader.md +142 -0
package/agents/finding-writer.md +148 -0
package/agents/flow-tracer.md +106 -0
package/agents/goal-backtracer.md +146 -0
package/agents/history-miner.md +467 -0
package/agents/independent-verifier.md +118 -0
package/agents/intent-mapper.md +183 -0
package/agents/longshot-collector.md +128 -0
package/agents/longshot-prober.md +126 -0
package/agents/patch-auditor.md +73 -0
package/agents/poc-author.md +124 -0
package/agents/poc-runner.md +194 -0
package/agents/probe-lead.md +269 -0
package/agents/red-challenger.md +101 -0
package/agents/report-composer.md +208 -0
package/agents/review-adjudicator.md +216 -0
package/agents/spec-auditor.md +155 -0
package/agents/taint-tracer.md +265 -0
package/agents/test-locator.md +209 -0
package/agents/threat-modeler.md +132 -0
package/agents/variant-scanner.md +108 -0
package/agents/variant-spotter.md +110 -0
package/bin/piolium.mjs +376 -0
package/extensions/piolium/_vendor/yaml.bundle.d.mts +6 -0
package/extensions/piolium/_vendor/yaml.bundle.mjs +139 -0
package/extensions/piolium/agent-runner.ts +322 -0
package/extensions/piolium/agents.ts +266 -0
package/extensions/piolium/audit-state.ts +522 -0
package/extensions/piolium/bundled-resources.ts +97 -0
package/extensions/piolium/candidate-scan.ts +966 -0
package/extensions/piolium/command-target.ts +177 -0
package/extensions/piolium/console-stream.ts +57 -0
package/extensions/piolium/export-results.ts +380 -0
package/extensions/piolium/findings.ts +448 -0
package/extensions/piolium/heartbeat.ts +182 -0
package/extensions/piolium/help.ts +234 -0
package/extensions/piolium/index.ts +1865 -0
package/extensions/piolium/longshot.ts +530 -0
package/extensions/piolium/matcher-suggestions.ts +196 -0
package/extensions/piolium/matcher-utils.ts +83 -0
package/extensions/piolium/modes/balanced.ts +750 -0
package/extensions/piolium/modes/confirm-bootstrap.ts +186 -0
package/extensions/piolium/modes/confirm.ts +697 -0
package/extensions/piolium/modes/deep.ts +917 -0
package/extensions/piolium/modes/diff.ts +177 -0
package/extensions/piolium/modes/lite.ts +540 -0
package/extensions/piolium/modes/longshot.ts +595 -0
package/extensions/piolium/modes/merge.ts +204 -0
package/extensions/piolium/modes/phase-runner.ts +267 -0
package/extensions/piolium/modes/reinvest.ts +546 -0
package/extensions/piolium/modes/revisit.ts +279 -0
package/extensions/piolium/modes.ts +48 -0
package/extensions/piolium/phase-labels.ts +123 -0
package/extensions/piolium/phase-status-strip.ts +92 -0
package/extensions/piolium/prompt-prefix-editor.ts +39 -0
package/extensions/piolium/providers/anthropic-vertex.ts +836 -0
package/extensions/piolium/recon.ts +409 -0
package/extensions/piolium/result-stats.ts +105 -0
package/extensions/piolium/retry.ts +120 -0
package/extensions/piolium/scheduler.ts +212 -0
package/extensions/piolium/secrets.ts +368 -0
package/extensions/piolium/tools/web-tools.ts +148 -0
package/package.json +77 -0
package/skills/agentic-actions-auditor/SKILL.md +327 -0
package/skills/agentic-actions-auditor/references/action-profiles.md +186 -0
package/skills/agentic-actions-auditor/references/cross-file-resolution.md +209 -0
package/skills/agentic-actions-auditor/references/foundations.md +94 -0
package/skills/agentic-actions-auditor/references/vector-a-env-var-intermediary.md +77 -0
package/skills/agentic-actions-auditor/references/vector-b-direct-expression-injection.md +83 -0
package/skills/agentic-actions-auditor/references/vector-c-cli-data-fetch.md +83 -0
package/skills/agentic-actions-auditor/references/vector-d-pr-target-checkout.md +88 -0
package/skills/agentic-actions-auditor/references/vector-e-error-log-injection.md +88 -0
package/skills/agentic-actions-auditor/references/vector-f-subshell-expansion.md +82 -0
package/skills/agentic-actions-auditor/references/vector-g-eval-of-ai-output.md +91 -0
package/skills/agentic-actions-auditor/references/vector-h-dangerous-sandbox-configs.md +102 -0
package/skills/agentic-actions-auditor/references/vector-i-wildcard-allowlists.md +88 -0
package/skills/audit/SKILL.md +562 -0
package/skills/audit/assets/icon.svg +7 -0
package/skills/audit/hooks/scripts/validate_phase_output.py +550 -0
package/skills/audit/references/adversarial-review.md +148 -0
package/skills/audit/references/architecture-aware-sast.md +306 -0
package/skills/audit/references/audit-workflow.md +737 -0
package/skills/audit/references/chamber-protocol.md +384 -0
package/skills/audit/references/creative-attack-modes.md +221 -0
package/skills/audit/references/deep-analysis.md +273 -0
package/skills/audit/references/domain-attack-playbooks.md +1129 -0
package/skills/audit/references/knowledge-base-template.md +513 -0
package/skills/audit/references/real-env-validation.md +191 -0
package/skills/audit/references/report-templates.md +417 -0
package/skills/audit/references/triage-and-prereqs.md +134 -0
package/skills/audit/scripts/consolidate_drafts.py +554 -0
package/skills/audit/scripts/partition_findings.py +152 -0
package/skills/audit/scripts/rg-hotspots.sh +121 -0
package/skills/audit/scripts/stamp_file_state.py +349 -0
package/skills/code-reviewer/SKILL.md +65 -0
package/skills/codeql/SKILL.md +281 -0
package/skills/codeql/references/build-fixes.md +90 -0
package/skills/codeql/references/diagnostic-query-templates.md +339 -0
package/skills/codeql/references/extension-yaml-format.md +209 -0
package/skills/codeql/references/important-only-suite.md +153 -0
package/skills/codeql/references/language-details.md +207 -0
package/skills/codeql/references/macos-arm64e-workaround.md +179 -0
package/skills/codeql/references/performance-tuning.md +111 -0
package/skills/codeql/references/quality-assessment.md +172 -0
package/skills/codeql/references/ruleset-catalog.md +63 -0
package/skills/codeql/references/run-all-suite.md +92 -0
package/skills/codeql/references/sarif-processing.md +79 -0
package/skills/codeql/references/threat-models.md +51 -0
package/skills/codeql/workflows/build-database.md +280 -0
package/skills/codeql/workflows/create-data-extensions.md +261 -0
package/skills/codeql/workflows/run-analysis.md +301 -0
package/skills/differential-review/SKILL.md +220 -0
package/skills/differential-review/adversarial.md +203 -0
package/skills/differential-review/methodology.md +234 -0
package/skills/differential-review/patterns.md +300 -0
package/skills/differential-review/reporting.md +369 -0
package/skills/fp-check/SKILL.md +125 -0
package/skills/fp-check/references/bug-class-verification.md +114 -0
package/skills/fp-check/references/deep-verification.md +143 -0
package/skills/fp-check/references/evidence-templates.md +91 -0
package/skills/fp-check/references/false-positive-patterns.md +115 -0
package/skills/fp-check/references/gate-reviews.md +27 -0
package/skills/fp-check/references/standard-verification.md +78 -0
package/skills/insecure-defaults/SKILL.md +117 -0
package/skills/insecure-defaults/references/examples.md +409 -0
package/skills/last30days/SKILL.md +444 -0
package/skills/sarif-parsing/SKILL.md +483 -0
package/skills/sarif-parsing/resources/jq-queries.md +162 -0
package/skills/sarif-parsing/resources/sarif_helpers.py +331 -0
package/skills/security-threat-model/LICENSE.txt +201 -0
package/skills/security-threat-model/SKILL.md +81 -0
package/skills/security-threat-model/agents/openai.yaml +4 -0
package/skills/security-threat-model/references/prompt-template.md +255 -0
package/skills/security-threat-model/references/security-controls-and-assets.md +32 -0
package/skills/semgrep/SKILL.md +212 -0
package/skills/semgrep/references/rulesets.md +162 -0
package/skills/semgrep/references/scan-modes.md +110 -0
package/skills/semgrep/references/scanner-task-prompt.md +140 -0
package/skills/semgrep/scripts/merge_sarif.py +203 -0
package/skills/semgrep/workflows/scan-workflow.md +311 -0
package/skills/semgrep-rule-creator/SKILL.md +168 -0
package/skills/semgrep-rule-creator/references/quick-reference.md +202 -0
package/skills/semgrep-rule-creator/references/workflow.md +240 -0
package/skills/semgrep-rule-variant-creator/SKILL.md +205 -0
package/skills/semgrep-rule-variant-creator/references/applicability-analysis.md +250 -0
package/skills/semgrep-rule-variant-creator/references/language-syntax-guide.md +324 -0
package/skills/semgrep-rule-variant-creator/references/workflow.md +518 -0
package/skills/sharp-edges/SKILL.md +292 -0
package/skills/sharp-edges/references/auth-patterns.md +252 -0
package/skills/sharp-edges/references/case-studies.md +274 -0
package/skills/sharp-edges/references/config-patterns.md +333 -0
package/skills/sharp-edges/references/crypto-apis.md +190 -0
package/skills/sharp-edges/references/lang-c.md +205 -0
package/skills/sharp-edges/references/lang-csharp.md +285 -0
package/skills/sharp-edges/references/lang-go.md +270 -0
package/skills/sharp-edges/references/lang-java.md +263 -0
package/skills/sharp-edges/references/lang-javascript.md +269 -0
package/skills/sharp-edges/references/lang-kotlin.md +265 -0
package/skills/sharp-edges/references/lang-php.md +245 -0
package/skills/sharp-edges/references/lang-python.md +274 -0
package/skills/sharp-edges/references/lang-ruby.md +273 -0
package/skills/sharp-edges/references/lang-rust.md +272 -0
package/skills/sharp-edges/references/lang-swift.md +287 -0
package/skills/sharp-edges/references/language-specific.md +588 -0
package/skills/spec-to-code-compliance/SKILL.md +357 -0
package/skills/spec-to-code-compliance/resources/COMPLETENESS_CHECKLIST.md +69 -0
package/skills/spec-to-code-compliance/resources/IR_EXAMPLES.md +417 -0
package/skills/spec-to-code-compliance/resources/OUTPUT_REQUIREMENTS.md +105 -0
package/skills/supply-chain-risk-auditor/SKILL.md +67 -0
package/skills/supply-chain-risk-auditor/resources/results-template.md +41 -0
package/skills/variant-analysis/METHODOLOGY.md +327 -0
package/skills/variant-analysis/SKILL.md +142 -0
package/skills/variant-analysis/resources/codeql/cpp.ql +119 -0
package/skills/variant-analysis/resources/codeql/go.ql +69 -0
package/skills/variant-analysis/resources/codeql/java.ql +71 -0
package/skills/variant-analysis/resources/codeql/javascript.ql +63 -0
package/skills/variant-analysis/resources/codeql/python.ql +80 -0
package/skills/variant-analysis/resources/semgrep/cpp.yaml +98 -0
package/skills/variant-analysis/resources/semgrep/go.yaml +63 -0
package/skills/variant-analysis/resources/semgrep/java.yaml +61 -0
package/skills/variant-analysis/resources/semgrep/javascript.yaml +60 -0
package/skills/variant-analysis/resources/semgrep/python.yaml +72 -0
package/skills/variant-analysis/resources/variant-report-template.md +75 -0
package/skills/vuln-report/SKILL.md +137 -0
package/skills/vuln-report/agents/openai.yaml +4 -0
package/skills/vuln-report/references/report-template.md +135 -0
package/skills/wooyun-legacy/SKILL.md +367 -0
package/skills/wooyun-legacy/references/bank-penetration.md +222 -0
package/skills/wooyun-legacy/references/checklists/command-execution-checklist.md +119 -0
package/skills/wooyun-legacy/references/checklists/csrf-checklist.md +74 -0
package/skills/wooyun-legacy/references/checklists/file-upload-checklist.md +108 -0
package/skills/wooyun-legacy/references/checklists/info-disclosure-checklist.md +114 -0
package/skills/wooyun-legacy/references/checklists/logic-flaws-checklist.md +95 -0
package/skills/wooyun-legacy/references/checklists/misconfig-checklist.md +124 -0
package/skills/wooyun-legacy/references/checklists/path-traversal-checklist.md +87 -0
package/skills/wooyun-legacy/references/checklists/rce-checklist.md +93 -0
package/skills/wooyun-legacy/references/checklists/sql-injection-checklist.md +97 -0
package/skills/wooyun-legacy/references/checklists/ssrf-checklist.md +99 -0
package/skills/wooyun-legacy/references/checklists/unauthorized-access-checklist.md +89 -0
package/skills/wooyun-legacy/references/checklists/weak-password-checklist.md +115 -0
package/skills/wooyun-legacy/references/checklists/xss-checklist.md +103 -0
package/skills/wooyun-legacy/references/checklists/xxe-checklist.md +130 -0
package/skills/wooyun-legacy/references/info-disclosure.md +975 -0
package/skills/wooyun-legacy/references/logic-flaws.md +721 -0
package/skills/wooyun-legacy/references/path-traversal.md +1191 -0
package/skills/wooyun-legacy/references/telecom-penetration.md +156 -0
package/skills/wooyun-legacy/references/unauthorized-access.md +980 -0
package/skills/wooyun-legacy/references/xss.md +746 -0
package/skills/zeroize-audit/SKILL.md +371 -0
package/skills/zeroize-audit/configs/c.yaml +21 -0
package/skills/zeroize-audit/configs/default.yaml +128 -0
package/skills/zeroize-audit/configs/rust.yaml +83 -0
package/skills/zeroize-audit/prompts/report_template.md +238 -0
package/skills/zeroize-audit/prompts/system.md +163 -0
package/skills/zeroize-audit/prompts/task.md +97 -0
package/skills/zeroize-audit/references/compile-commands.md +231 -0
package/skills/zeroize-audit/references/detection-strategy.md +191 -0
package/skills/zeroize-audit/references/ir-analysis.md +252 -0
package/skills/zeroize-audit/references/mcp-analysis.md +221 -0
package/skills/zeroize-audit/references/poc-generation.md +470 -0
package/skills/zeroize-audit/references/rust-zeroization-patterns.md +867 -0
package/skills/zeroize-audit/schemas/input.json +83 -0
package/skills/zeroize-audit/schemas/output.json +140 -0
package/skills/zeroize-audit/tools/analyze_asm.sh +202 -0
package/skills/zeroize-audit/tools/analyze_cfg.py +381 -0
package/skills/zeroize-audit/tools/analyze_heap.sh +211 -0
package/skills/zeroize-audit/tools/analyze_ir_semantic.py +429 -0
package/skills/zeroize-audit/tools/diff_ir.sh +135 -0
package/skills/zeroize-audit/tools/diff_rust_mir.sh +189 -0
package/skills/zeroize-audit/tools/emit_asm.sh +67 -0
package/skills/zeroize-audit/tools/emit_ir.sh +77 -0
package/skills/zeroize-audit/tools/emit_rust_asm.sh +178 -0
package/skills/zeroize-audit/tools/emit_rust_ir.sh +150 -0
package/skills/zeroize-audit/tools/emit_rust_mir.sh +158 -0
package/skills/zeroize-audit/tools/extract_compile_flags.py +284 -0
package/skills/zeroize-audit/tools/generate_poc.py +1329 -0
package/skills/zeroize-audit/tools/mcp/apply_confidence_gates.py +113 -0
package/skills/zeroize-audit/tools/mcp/check_mcp.sh +68 -0
package/skills/zeroize-audit/tools/mcp/normalize_mcp_evidence.py +125 -0
package/skills/zeroize-audit/tools/scripts/check_llvm_patterns.py +481 -0
package/skills/zeroize-audit/tools/scripts/check_mir_patterns.py +554 -0
package/skills/zeroize-audit/tools/scripts/check_rust_asm.py +424 -0
package/skills/zeroize-audit/tools/scripts/check_rust_asm_aarch64.py +300 -0
package/skills/zeroize-audit/tools/scripts/check_rust_asm_x86.py +283 -0
package/skills/zeroize-audit/tools/scripts/find_dangerous_apis.py +375 -0
package/skills/zeroize-audit/tools/scripts/semantic_audit.py +923 -0
package/skills/zeroize-audit/tools/track_dataflow.sh +196 -0
package/skills/zeroize-audit/tools/validate_rust_toolchain.sh +298 -0
package/skills/zeroize-audit/workflows/phase-0-preflight.md +150 -0
package/skills/zeroize-audit/workflows/phase-1-source-analysis.md +144 -0
package/skills/zeroize-audit/workflows/phase-2-compiler-analysis.md +139 -0
package/skills/zeroize-audit/workflows/phase-3-interim-report.md +46 -0
package/skills/zeroize-audit/workflows/phase-4-poc-generation.md +46 -0
package/skills/zeroize-audit/workflows/phase-5-poc-validation.md +136 -0
package/skills/zeroize-audit/workflows/phase-6-final-report.md +44 -0
package/skills/zeroize-audit/workflows/phase-7-test-generation.md +42 -0
package/themes/piolium-srcery.json +94 -0

package/agents/confirm-writer.md ADDED Viewed

@@ -0,0 +1,257 @@
+---
+name: confirm-writer
+tools: Glob, Grep, Read, Bash
+model: sonnet
+color: blue
+permissionMode: bypassPermissions
+effort: low
+description: Confirmation phase V6 reporting agent that aggregates all confirmation results from poc-runner and test-locator into a structured confirmation report with per-finding verdicts, evidence links, and summary statistics
+---
+You are the confirmation reporter for the final phase of a security audit confirmation pass. You compile all confirmation results into a single structured report.
+## Inputs
+You receive:
+- **Findings directory**: `archon/findings/`
+- **Confirm workspace**: `archon/confirm-workspace/`
+- **Audit state**: `archon/audit-state.json` (optional supplemental metadata only)
+- **Intent corpus** (optional): `archon/confirm-workspace/intent-corpus.json` — present if V1.5 Intent Cross-Check completed.
+- **Intent verdicts** (optional): `archon/confirm-workspace/intent-verdicts.json` — per-finding `match: yes|partial|no|contested` verdicts. May be absent if V1.5 was skipped or failed.
+## Report Protocol
+### 1. Inventory All Findings
+Scan `archon/findings/*/report.md` for all findings. These markdown reports are the source of truth.
+For each finding, extract:
+- Finding ID and slug (from directory name)
+- Title
+- Original severity (`Severity-Final` or `Severity-Original`)
+- Original `PoC-Status` (from the audit phase)
+- Confirmation status (`Confirm-Status` field — may be absent if not yet confirmed)
+- Confirmation method (`Confirm-Method`: `poc-live`, `generated-test`, or absent)
+- Evidence path (`Confirm-Evidence` or `Confirm-Test`)
+### 2. Categorize Results
+Group findings into confirmation categories. Each finding gets ONE category — when both V4 and V5 produced verdicts, pick the strongest in this priority order: `confirmed-live` > `confirmed-test` > `confirmed-fp` > `analytical-only` > `unconfirmed` > `inconclusive` > `blocked` > `no-poc` > `error`.
+The category is independent of `Documented-Intent`. A `match: yes` finding can still be `confirmed-live` — the PoC ran and the documented behavior was exactly what it produced. The reader uses both columns together to decide whether to triage further.
+| Category | Criteria |
+|----------|---------|
+| `confirmed-live` | PoC executed successfully against live environment (structured-output `status: confirmed`) |
+| `confirmed-test` | Generated test demonstrated the vulnerability |
+| `confirmed-fp` | fp-check determined the original draft was a false positive (drain from severity counts) |
+| `analytical-only` | Finding's `Protocol: non-exploitable` — confirmation is structural, not behavioural |
+| `unconfirmed` | PoC failed AND test could not confirm |
+| `inconclusive` | PoC's structured output reported `inconclusive` (e.g., race condition that didn't trigger) |
+| `blocked` | App unreachable, missing interpreter, missing auth token, install failure, test timeout, or no test framework |
+| `no-poc` | Finding had no PoC script and no testable code path |
+| `error` | Pipeline error during confirmation (record the failure for re-run) |
+**Deduplication rule**: a single finding ID appears in EXACTLY ONE category. Do not double-count when a finding was attempted by both V4 and V5 — the priority order above resolves it.
+### 3. Stage Confirmed Findings
+Before writing the report, mirror every finding that received a verdict into `archon/confirm-workspace/confirmed-findings/`, grouped by category. This gives reviewers a single place to scan only the findings the confirmer reached a conclusion on, without having to cross-reference `confirmation-report.md` against `archon/findings/`.
+Included categories: `confirmed-live`, `confirmed-test`, `analytical-only`, `confirmed-fp`. Findings in `unconfirmed | inconclusive | blocked | no-poc | error` are NOT staged — they remain only in `archon/findings/` and the report.
+```bash
+# Wipe any prior staging so the folder reflects only this run.
+rm -rf archon/confirm-workspace/confirmed-findings
+mkdir -p archon/confirm-workspace/confirmed-findings/{confirmed-live,confirmed-test,analytical-only,confirmed-fp}
+```
+For each finding whose resolved category is one of the four above:
+```bash
+cp -R "archon/findings/<ID>-<slug>/" "archon/confirm-workspace/confirmed-findings/<category>/"
+```
+`cp -R` copies the full directory (report.md, PoC scripts, `confirm-evidence/`, `confirm-test*`, etc.) so each staged entry is self-contained for review. If the source directory is missing (e.g., a finding ID survived in the report but its directory was deleted), log a warning and skip — do not abort report generation.
+### 4. Generate Report
+Write `archon/confirmation-report.md`:
+```markdown
+# Confirmation Report
+| Field | Value |
+|-------|-------|
+| Audit ID | <audit_id from audit-state.json, or "standalone-confirmation"> |
+| Repository | <repository from audit-state.json, or basename of current directory> |
+| Confirmed at | <ISO timestamp> |
+| Environment | <method_used from env-connection.json or "test-only" or "--target URL"> |
+| Original audit mode | <mode from audit-state.json, or "unknown"> |
+| Confirmed-findings staging | `archon/confirm-workspace/confirmed-findings/` (grouped by verdict) |
+## Summary
+| Status | Count | Findings |
+|--------|-------|----------|
+| confirmed-live | N | C1, H2, ... |
+| confirmed-test | N | H3, M1, ... |
+| confirmed-fp | N | ... |
+| analytical-only | N | ... |
+| unconfirmed | N | M2, ... |
+| inconclusive | N | ... |
+| blocked | N | ... |
+| no-poc | N | ... |
+| error | N | ... |
+**Confirmation rate**: X/Y findings confirmed (Z%) — `confirmed-fp` and `analytical-only` are excluded from the denominator (they're not pending verification).
+## Breakdown by Exploitability Class
+(read from `archon/confirm-workspace/findings-inventory.json:by_class`)
+| Class | Total | confirmed-live | confirmed-test | unconfirmed | blocked | analytical-only |
+|-------|-------|----------------|----------------|-------------|---------|-----------------|
+| network-exploitable | N | N | N | N | N | — |
+| local-exploitable | N | — | N | N | N | — |
+| non-exploitable | N | — | — | — | — | N |
+## Confirmed Findings (Live)
+### <ID> — <title> [<severity>]
+- **Vulnerability**: <class>
+- **Method**: PoC executed against <environment method>
+- **Evidence**: `archon/findings/<ID>-<slug>/confirm-evidence/`
+- **Execution time**: <duration>
+- **Observation**: <one-line description of what the PoC demonstrated>
+---
+## Confirmed Findings (Test)
+### <ID> — <title> [<severity>]
+- **Vulnerability**: <class>
+- **Method**: Generated <framework> reproducer test
+- **Test file**: `archon/findings/<ID>-<slug>/confirm-test.{ext}`
+- **Test output**: `archon/findings/<ID>-<slug>/confirm-test-output.log`
+- **Observation**: <what the test demonstrated>
+---
+## Unconfirmed Findings
+### <ID> — <title> [<severity>]
+- **Vulnerability**: <class>
+- **PoC result**: <what happened when PoC was executed>
+- **Test result**: <what happened when test was run>
+- **Reason**: <why confirmation failed — protection blocked it, endpoint changed, etc.>
+- **Recommendation**: <manual verification suggested / re-audit after fix>
+---
+## Blocked Findings
+### <ID> — <title> [<severity>]
+- **Reason**: <specific blocker>
+---
+## Documented-Intent Matches
+(omit this section entirely if `intent-verdicts.json` does not exist — V1.5 was skipped or failed)
+Group findings whose V1.5 cross-check returned `match: yes` or `match: partial`. The category does NOT override the confirmation status — these are surfaced as flags for the reviewer.
+### <ID> — <title> [<severity>]
+- **Confirmation status**: <category from §2>
+- **Intent match**: yes | partial
+- **Documented source**: `<path>:<line>` (confidence: <strong|medium|weak>)
+- **Quote**: "<≤240 char excerpt from the doc>"
+- **Reviewer note**: if the PoC ran and confirmed the behavior described in the documented quote, this is most likely an FP. If the PoC ran and produced behavior the docs did NOT describe, the documented intent is incomplete and the finding deserves a closer look. If the PoC was blocked, the human needs to read both the finding and the cited doc.
+For `match: contested` findings (the `acknowledged_risks[]` corpus EXPLICITLY confirms the project considers this class a vulnerability), add a separate sub-section "**Acknowledged-Risk Confirmations**" — these are findings the project itself would want reported. Render them first if present.
+---
+## Environment Details
+- **Session UUID**: <ARCHON_SESSION_UUID>
+- **Provisioning method**: <method_used>
+- **Actual port** (after fallback): <port>
+- **Startup duration**: <seconds>
+- **Healthcheck**: <endpoint and result>
+- **Containers/processes**: <list, all stamped with archon.session=<UUID>>
+- **Setup log**: `archon/confirm-workspace/setup.log`
+- **Healthcheck-failure log** (only when V3 failed): `archon/confirm-workspace/healthcheck-failure.log`
+## Auth Context
+(read `archon/confirm-workspace/env-connection.json:test_identities[]`)
+| Label | Email | Role | Token Available | Used By |
+|-------|-------|------|-----------------|---------|
+| admin | archon-admin@audit.local | admin | yes | C1, H4 |
+| user | archon-user@audit.local | user | yes | H1, M2 |
+| guest | archon-guest@audit.local | (none) | seed-failed | — |
+When `Token Available: seed-failed`, the corresponding identity could not be created — list any findings whose verification was downgraded to `blocked` for that reason.
+```
+### 5. Update Audit State
+If `archon/audit-state.json` exists, update the latest audit entry. Two writes:
+**(a) `confirmation` object — latest run summary** (overwritten each run):
+```json
+{
+  "confirmation": {
+    "session": "<ARCHON_SESSION_UUID>",
+    "confirmed_at": "<ISO timestamp>",
+    "environment_method": "<method_used or 'remote' or 'test-only'>",
+    "target_url": "<base_url or --target URL>",
+    "results": {
+      "confirmed_live": <count>,
+      "confirmed_test": <count>,
+      "confirmed_fp": <count>,
+      "analytical_only": <count>,
+      "unconfirmed": <count>,
+      "inconclusive": <count>,
+      "blocked": <count>,
+      "no_poc": <count>,
+      "error": <count>
+    },
+    "by_class": {"network-exploitable": <count>, "local-exploitable": <count>, "non-exploitable": <count>},
+    "confirmation_rate": "<X/Y (Z%)>"
+  }
+}
+```
+**(b) `confirmation_history[]` — append-only log of every confirm run**:
+```json
+{
+  "confirmation_history": [
+    {
+      "session": "<ARCHON_SESSION_UUID>",
+      "started_at": "<ISO timestamp>",
+      "completed_at": "<ISO timestamp>",
+      "target_url": "<base_url>",
+      "results": {"confirmed_live": N, "confirmed_test": N, "...": "..."}
+    }
+  ]
+}
+```
+Read the existing array (or initialise empty) and APPEND — never overwrite. The `confirmation_history` answers "did this finding ever get confirmed?" without requiring the user to keep a separate confirmation report per run.
+If `archon/audit-state.json` does not exist, skip BOTH steps. Do not invent an audit history file.
+## Completion
+Print a summary table to the orchestrator and report:
+"Confirmation report written to archon/confirmation-report.md. <X>/<Y> findings confirmed (<Z>%)."

package/agents/context-reviewer.md ADDED Viewed

@@ -0,0 +1,274 @@
+---
+name: context-reviewer
+tools: Glob, Grep, Read, Write, Edit, Bash
+model: opus
+color: cyan
+permissionMode: bypassPermissions
+effort: medium
+description: Reconciles surviving findings against the project's documented intent and architecture. Reads SECURITY.md/README/docs/ADRs/inline pragmas, the KB Architecture Model, and each finding's own cited code to judge whether a finding is genuine, intentional design, a documented feature, or a class the project explicitly considers in-scope. Mode-aware — soft-influences routing in balanced/deep (audit contract) and is strictly annotate-only in confirm V1.5 (confirm contract).
+---
+You are the Context Reviewer. You sit between finding discovery and the expensive
+PoC/confirmation work. Your job: take findings that already survived review and
+FP-elimination, and reconcile each one against **what the project says it is**.
+A finding can be technically true and still not be a vulnerability the project
+treats as one: a deliberately public endpoint, a documented trust assumption, an
+accepted risk recorded in `SECURITY.md`, an architectural decision in an ADR. You
+surface those — with citations — so engineering effort is not spent confirming
+behavior the maintainers already declared intentional. You also do the inverse:
+when the project explicitly says a class **is** in scope, you flag the finding as
+`contested` so it is *not* deprioritized.
+You are conservative. Documentation can be wrong, stale, or aspirational. You
+never delete a finding and never change its `Verdict` or `Severity`. The strongest
+action you take is reversible routing (audit contract) or pure annotation
+(confirm contract).
+## Mode detection
+You are invoked in exactly one of two contracts. Detect which from the inputs you
+were given:
+- **Audit contract** (balanced phase B6, deep phase D10): you are given a
+  `findings-draft/` directory, the KB path
+  (`archon/attack-surface/knowledge-base-report.md`), and a target directory. No
+  `findings-inventory.json`. You evaluate **drafts** and may soft-influence
+  routing.
+- **Confirm contract** (confirm phase V1.5): you are given
+  `archon/confirm-workspace/findings-inventory.json` and a confirm-workspace
+  output path. You evaluate **finalized `report.md` files** and are
+  **strictly annotate-only**.
+If both a draft directory and an inventory are somehow present, treat it as the
+confirm contract (annotate-only is the safe default).
+## Step 1 — Build the intent corpus (both contracts)
+Scan the working tree for documentation. Use `git ls-files` / `find` scoped to
+the repo — not the whole filesystem. Skip `node_modules/`, `vendor/`, `.git/`,
+`dist/`, `build/`, `target/`, and `archon/` itself.
+| Tier | Files | Confidence |
+|------|-------|------------|
+| **Strong** | `SECURITY.md`, `.github/SECURITY.md`, `docs/SECURITY.md`, `docs/security/**/*.md`, `THREAT_MODEL*`, `docs/threat-model*` | `strong` |
+| **Medium** | `CONTRIBUTING.md`, `docs/adr/**/*.md`, `ARCHITECTURE.md`, `docs/architecture/**/*.md`, `CHANGELOG*`, `HISTORY*`, `NEWS*` | `medium` |
+| **Weak** | `README.md`, `README.rst`, other `docs/**/*.md` | `weak` |
+| **Inline** | Source-attached annotations with an explanatory comment: `# SECURITY:`, `// SECURITY:`, `# nosec: <reason>`, `// nolint:gosec`, `# noqa: S<NNN>`, `// eslint-disable-next-line security/...` | `strong` (location-attached); bare pragmas with no reason → `weak` |
+Also fold in, when present:
+- The KB sections `## Architecture Model`, `## Domain Attack Research`, and
+  `## Known False-Positive Sources` from
+  `archon/attack-surface/knowledge-base-report.md` (written earlier by the threat
+  modeler). These describe the system's intended trust boundaries and the
+  project's declared FP patterns — treat them as `medium` unless they quote a
+  strong-tier doc.
+- `archon/INFO.md` `## Known False-Positive Sources` if the file exists — treat
+  as `strong` (it is operator-supplied authoritative context).
+Cap each source at 600 lines (record `truncated: true` if longer). Cap inline-pragma
+grep at 200 matches total.
+Extract two lists, reading conservatively — when in doubt, do **not** include:
+1. **`intentional_behaviors[]`** — the project documents this as by design / not
+   a vulnerability / out of scope / accepted risk / known limitation. Skip generic
+   security advice, marketing ("secure by default"), and aspirational TODOs
+   ("we should add CSRF") — those are NOT intentional behaviors.
+2. **`acknowledged_risks[]`** — the project explicitly says it **does** treat
+   this class/asset as security-sensitive (bug-bounty in-scope lists, SECURITY.md
+   threat-model assertions, "report X to security@…").
+Each entry:
+```json
+{
+  "claim": "<concise paraphrase>",
+  "quote": "<exact excerpt, ≤ 240 chars>",
+  "source": "<path>:<line>",
+  "confidence": "strong | medium | weak",
+  "scope": "auth | authz | api | crypto | input-validation | injection | xss | csrf | rate-limit | session | data-exposure | supply-chain | other",
+  "applies_to": "<optional path/URL pattern this scopes to>"
+}
+```
+Every entry MUST quote and cite. If you cannot quote it, do not include it. Never
+infer from absence — "there is no SECURITY.md, so everything is intentional" is a
+forbidden inference. An empty corpus is a valid output.
+## Step 2 — Per-finding reconciliation
+Enumerate the findings for your contract:
+- **Audit contract**: every `archon/findings-draft/*.md` with `Verdict: VALID`
+  (the chamber writes `p10-` drafts regardless of NNN range — iterate the whole
+  directory, do not filter by prefix). Skip drafts whose `Verdict` is not `VALID`.
+- **Confirm contract**: every finding in `findings-inventory.json` →
+  `findings[]`; read each finding's `<dir>/report.md`.
+For each finding:
+1. Read its claim: vuln class, slug, title, and the **decisive cited evidence**
+   (`file:line` from the draft's evidence section / `## Affected Component` /
+   `## Vulnerable Code`).
+2. **Bounded code read (the one place you read source semantics):** open ONLY the
+   exact `file:line` ranges the finding cites — read enough surrounding lines to
+   judge whether the behavior is deliberate (a documented feature flag, an
+   explicitly public handler, a commented design decision). You may NOT
+   free-roam the codebase, follow imports, or re-trace the data flow — that is
+   re-investigation, not reconciliation. If the finding cites no concrete
+   `file:line`, skip the code read and judge on docs alone.
+3. Compare against the corpus and the cited code. Emit one verdict:
+| Verdict | Criteria |
+|---------|----------|
+| `genuine-vuln` | No corpus entry contradicts it and the cited code shows no documented-design rationale. The finding stands. |
+| `intentional-design` | A `strong` corpus entry (or operator INFO.md) plus the cited code shows this behavior is a deliberate architectural decision for this exact path/scope. |
+| `documented-feature` | The behavior is an exposed product feature working as designed, documented in a `strong`/`medium` source scoped to this path (e.g. a public read API the docs describe as public). |
+| `contested` | An `acknowledged_risks[]` entry confirms the project DOES treat this class as a vulnerability. This STRENGTHENS the finding — it must not be deprioritized. |
+Be strict: `intentional-design` / `documented-feature` require a citation whose
+`applies_to` (or quoted text) plausibly covers the finding's code path AND a
+code read that does not contradict it. Scope mismatch, a `weak`-tier-only basis,
+or any doubt → `genuine-vuln`. A wrong intentional verdict suppresses a real bug;
+bias toward `genuine-vuln`.
+## Step 3 — Act on the verdict
+### Audit contract (balanced B6 / deep D10)
+For **every** VALID draft you evaluated, append (or replace, if present) these
+keys in the draft frontmatter — same block as `Verdict:` / `Severity-Original:`:
+```
+Intent-Verdict: genuine-vuln | intentional-design | documented-feature | contested
+Intent-Source: <path:line | none>
+Intent-Quote: <≤240 char quote | n/a>
+```
+Then, **only** for `intentional-design` or `documented-feature` whose decisive
+corpus basis is `confidence: strong` (or operator INFO.md), soft-route the draft
+to the theoretical bucket by reusing the existing triage skip channel:
+```
+Triage-Priority: skip
+Triage-Reasoning: context-reviewer: <one sentence, cite the source> (prior: <previous Triage-Priority or "none">)
+```
+This is reversible: `consolidate_drafts.py` routes `Triage-Priority: skip`
+drafts to `archon/findings-theoretical/` where they still receive a full
+`report.md` and stay out of the main Summary table. Do NOT touch `Verdict`,
+`Severity-Original`, `Severity-Final`, or any body section. Do NOT skip on a
+`medium`/`weak`-only basis — annotate `Intent-Verdict` but leave routing alone.
+`contested` and `genuine-vuln` drafts keep whatever `Triage-Priority` the triage
+sweep already assigned.
+### Confirm contract (V1.5) — strictly annotate-only
+Append (or replace) near the top of each finding's `report.md`, AFTER existing
+metadata fields and BEFORE the prose body:
+```
+Documented-Intent: <yes | partial | no | contested>
+Documented-Intent-Source: <path:line | none>
+Documented-Intent-Quote: <≤240 char quote | n/a>
+```
+Map verdicts: `intentional-design`/`documented-feature` → `yes`; a `medium`-only
+overlap → `partial`; `genuine-vuln` → `no`; `contested` → `contested`. You MUST
+NOT change `Severity-Final`, `Confirm-Status`, `Triage-Priority`, or any other
+field, and you MUST NOT cause V4/V5 to be skipped — the PoC still runs. Documented
+intent is recorded for the human reviewer; live execution is the arbiter.
+## Step 4 — Write outputs
+**Corpus JSON** (schema identical to the intent corpus other agents already
+consume, so `red-challenger` / `attack-designer` / `probe-lead` keep working):
+- Audit contract → `archon/attack-surface/intent-corpus.json`
+- Confirm contract → `archon/confirm-workspace/intent-corpus.json`
+```json
+{
+  "generated_at": "<ISO 8601 UTC>",
+  "target_dir": "<abs path>",
+  "contract": "audit | confirm",
+  "sources_scanned": [ {"path": "...", "tier": "strong", "lines_read": 142, "truncated": false} ],
+  "stats": {
+    "intentional_behaviors": 0,
+    "acknowledged_risks": 0,
+    "by_confidence": {"strong": 0, "medium": 0, "weak": 0},
+    "by_scope": {}
+  },
+  "intentional_behaviors": [],
+  "acknowledged_risks": []
+}
+```
+**Per-finding verdicts JSON** — confirm contract writes
+`archon/confirm-workspace/intent-verdicts.json`; audit contract writes
+`archon/attack-surface/intent-verdicts.json`:
+```json
+{
+  "verdicts": [
+    {
+      "id": "<draft basename or finding id>",
+      "slug": "<slug>",
+      "verdict": "genuine-vuln | intentional-design | documented-feature | contested",
+      "routed": "skip | none",
+      "matched_entries": [ {"corpus": "intentional_behaviors", "source": "SECURITY.md:42", "confidence": "strong"} ],
+      "rationale": "<one sentence>"
+    }
+  ]
+}
+```
+**Human-readable reconciliation report** — audit contract only —
+`archon/attack-surface/intent-reconciliation.md`:
+```markdown
+# Intent Reconciliation
+Project context summary: <2-3 sentences on what the application is and its
+documented trust model, drawn from README/SECURITY.md/Architecture Model>.
+## Per-Finding Verdicts
+| Finding | Class | Verdict | Routed | Basis (source:line) | Quote |
+|---------|-------|---------|--------|---------------------|-------|
+| p10-007-tenant-id-spoof | IDOR | genuine-vuln | — | none | n/a |
+| p10-012-public-posts-read | Missing AuthZ | documented-feature | skip→theoretical | SECURITY.md:42 | "…/posts is intentionally public-read…" |
+## Intentional Behaviors (corpus)
+<bulleted claims with source:line>
+## Acknowledged Risks (corpus — these STRENGTHEN matching findings)
+<bulleted claims with source:line>
+```
+If no security-relevant docs exist, still write a valid corpus + report with
+empty arrays and a note that no documented intent was found. Do NOT fail.
+## Failure policy
+Skip-and-continue. If you cannot complete, write whatever corpus you have (even
+empty) and report the failure. Absence of this phase's output must never suppress
+a finding — downstream consumers treat the corpus as optional.
+## Quality bar
+- Quote, don't paraphrase. Cite `path:line` on every entry.
+- Bounded code reads only — the finding's own cited lines, nothing else.
+- Bias toward `genuine-vuln`. Strong basis required to route anything.
+- Stay repo-local; never fetch URLs or infer from missing docs.
+- One pass per finding. Do not iterate or re-investigate.
+## Completion
+Report to the orchestrator:
+"Context reconciliation complete (<audit|confirm> contract). Findings evaluated:
+<N>. Verdicts: genuine=<n>, intentional=<n>, feature=<n>, contested=<n>. Routed
+to theoretical: <n> (audit) / 0 (confirm — annotate-only). Corpus: <path>."