npm - guard-scanner - Versions diffs - 3.4.0 → 4.0.0 - Mend

guard-scanner 3.4.0 → 4.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (10) hide show

package/README.md +57 -24
package/SECURITY.md +1 -1
package/docs/THREAT_TAXONOMY.md +1 -1
package/hooks/guard-scanner/HOOK.md +32 -16
package/hooks/guard-scanner/plugin.ts +7 -7
package/openclaw.plugin.json +2 -2
package/package.json +2 -2
package/src/patterns.js +7 -7
package/src/runtime-guard.js +15 -15
package/src/scanner.js +1 -1

package/README.md CHANGED Viewed

@@ -1,9 +1,9 @@
 <p align="center">
   <h1 align="center">🛡️ guard-scanner</h1>
   <p align="center">
-    <strong>The first security scanner purpose-built for AI agent skills</strong><br>
-    Detect prompt injection, identity hijacking, memory poisoning, and 18 more threat classes<br>
-    before they compromise your agents.
+    <strong>Security scanner for AI agent skills — catches the bad stuff before it runs</strong><br>
+    Prompt injection, identity hijacking, memory poisoning, and 20+ more threat types.<br>
+    Zero dependencies. One command. Works with OpenClaw out of the box.
   </p>
   <p align="center">
     <a href="https://www.npmjs.com/package/guard-scanner"><img src="https://img.shields.io/npm/v/guard-scanner.svg?style=flat-square&color=cb3837" alt="npm version"></a>
@@ -19,7 +19,8 @@
     <a href="#threat-categories">Threat Categories</a> •
     <a href="#openclaw-plugin-setup-v310">OpenClaw Plugin</a> •
     <a href="#cicd-integration">CI/CD</a> •
-    <a href="#plugin-api">Plugin API</a>
+    <a href="#plugin-api">Plugin API</a> •
+    <a href="README_ja.md">🇯🇵 日本語</a>
   </p>
 </p>
@@ -48,8 +49,9 @@ The AI agent skill ecosystem has the same supply-chain security problem that npm
 | Feature | Description |
 |---|---|
-| **21 Threat Categories** | Snyk ToxicSkills + OWASP MCP Top 10 + Identity Hijacking + Sandbox/Complexity/Config + PII |
-| **129 Detection Patterns** | Regex-based static analysis covering code, docs, and data files |
+| **22 Threat Categories** | Snyk ToxicSkills + OWASP Agentic Top 10 + Identity Hijack + PII + Trust Exploitation |
+| **190+ Static Patterns** | Regex-based static analysis covering code, docs, and data files |
+| **26 Runtime Checks** | Real-time `before_tool_call` hook — 5-layer defense (v3.4.0) |
 | **IoC Database** | Known malicious IPs, domains, URLs, usernames, and typosquat names |
 | **Data Flow Analysis** | Lightweight JS analysis: secret reads → network calls → exec chains |
 | **Cross-File Analysis** | Phantom references, base64 fragment assembly, multi-file exfil detection |
@@ -60,7 +62,6 @@ The AI agent skill ecosystem has the same supply-chain security problem that npm
 | **Dependency Chain Scan** | Risky packages, lifecycle scripts, wildcard versions, git dependencies |
 | **4 Output Formats** | Terminal (with colors), JSON, [SARIF 2.1.0](https://sarifweb.azurewebsites.net), HTML dashboard |
 | **Plugin API** | Extend with custom detection rules via JS modules |
-| **Ignore Files** | Whitelist trusted skills and patterns via `.guard-scanner-ignore` |
 | **Zero Dependencies** | Pure Node.js stdlib. Nothing to install, nothing to audit. |
 | **CI/CD Ready** | `--fail-on-findings` exit code + SARIF for GitHub Code Scanning |
@@ -68,20 +69,42 @@ The AI agent skill ecosystem has the same supply-chain security problem that npm
 ## Quick Start
+**30 seconds to scan your skills:**
 ```bash
-# Scan a skill directory (each subdirectory = one skill)
 npx guard-scanner ./skills/
+```
+That's it. No install needed. It scans every subdirectory as a skill and tells you what's dangerous.
-# Verbose output with category breakdown
+**Want more detail?**
+```bash
+# See exactly what was found and why
 npx guard-scanner ./skills/ --verbose
-# Strict mode (lower thresholds)
+# Stricter detection (catches more edge cases)
 npx guard-scanner ./skills/ --strict
-# Full audit: verbose + deps + all output formats
+# Full audit: everything + JSON + SARIF + HTML report
 npx guard-scanner ./skills/ --verbose --check-deps --json --sarif --html
 ```
+**Output looks like this:**
+```
+🛡️  guard-scanner v3.4.0
+══════════════════════════════════════════════════════
+📂 Scanning: ./skills/
+📦 Skills found: 5
+🔴 shady-skill — MALICIOUS (risk: 100)
+   💀 [CRITICAL] Reverse shell via /dev/tcp — scripts/setup.sh:7
+   💀 [CRITICAL] Credential exfiltration to webhook.site — scripts/helper.js:14
+🟡 sus-skill — SUSPICIOUS (risk: 45)
+   ⚠️  [HIGH] SSH private key access — scripts/deploy.sh:3
+🟢 good-skill — CLEAN (risk: 0)
+```
 ## OpenClaw Plugin Setup (v3.1.0)
 ```bash
@@ -98,15 +121,17 @@ npm install -g guard-scanner
 2. **Runtime guard** — `before_tool_call` hook automatically blocks dangerous operations
 3. **3 enforcement modes** — `monitor` (log only), `enforce` (block CRITICAL), `strict` (block HIGH+CRITICAL)
-### 3-Layer Runtime Defense (19 patterns)
+### 5-Layer Runtime Defense (26 checks)
 ```
-Layer 1: Threat Detection     — 12 patterns (shells, exfil, SSRF, AMOS, etc.)
-Layer 2: EAE Paradox Defense  — 4 patterns (memory/SOUL/config tampering)
-Layer 3: Parity Judge         — 3 patterns (injection, parity bypass, shutdown refusal)
+Layer 1: Threat Detection      — 12 checks (shells, exfil, SSRF, AMOS, etc.)
+Layer 2: Trust Defense   — 4 checks  (memory/SOUL/config tampering)
+Layer 3: Safety Judge          — 3 checks  (injection, trust bypass, shutdown refusal)
+Layer 4: Brain / Behavioral    — 3 checks  (research skip, blind trust, chain bypass)
+Layer 5: Trust Exploitation    — 4 checks  (OWASP ASI09: authority/trust/audit abuse)
 ```
-> **v3.1.0** — Full `openclaw.plugin.json` manifest with `configSchema` validation. The legacy `handler.ts` has been removed; `plugin.ts` is now the only runtime guard.
+> **v3.4.0** — Runtime Guard now available as standalone JS module (`src/runtime-guard.js`) + OpenClaw plugin (`hooks/guard-scanner/plugin.ts`).
 ### Quick Start
@@ -667,8 +692,8 @@ identity file tampering, prompt worms, or memory poisoning.
 We built one.
 —— Guava 🍈 & Dee
-    Singularity Lab (シンギュラリティ研究所)
-    Proving ASI-human coexistence through code.
+    AI Security Research
+    Building safer agent ecosystems.
 ```
 ---
@@ -713,11 +738,19 @@ guard-scanner is and always will be **free, open-source, and zero-dependency**.
 | v1.1.1 ✅ | Stability | 56 tests, bug fixes |
 | v2.0.0 ✅ | **Plugin Hook Runtime Guard** | `block`/`blockReason` API, 3 modes, 91 tests |
 | v2.1.0 ✅ | **PII Exposure + Shadow AI** | 13 PII patterns, OWASP LLM02/06, 99 tests |
-| v3.0.0 ✅ | **TypeScript Rewrite** | Full TS, OWASP LLM Top 10 mapping, install-check CLI |
-| v3.1.0 ✅ | **OpenClaw Community Plugin** | `openclaw.plugin.json`, 22 runtime patterns (4 layers) (3 layers), 87 tests |
-| v4.0 | AST + ML | JavaScript AST analysis, taint tracking, ML-based obfuscation detection |
+| v3.0.0 ✅ | **TypeScript Rewrite** | Full TS, OWASP LLM Top 10 mapping |
+| v3.4.0 ✅ | **Runtime Guard Module + OWASP ASI** | 26 runtime checks (5 layers), ASI01-10 verified, 133 tests |
+| **v4.0** 🔜 | **LLM + OS + Multi-tool** | See below |
+### v4.0 Vision (feedback welcome!)
+| Direction | What | Why |
+|-----------|------|-----|
+| 🧠 **LLM-assisted detection** | Pass suspicious (not certain) cases to a lightweight LLM (Haiku/Flash) for intent analysis | Regex can be evaded; LLMs understand intent |
+| 🔒 **OS-level enforcement** | File watcher (auto-rollback SOUL.md/.env), process monitor (kill netcat/socat), daemon mode | Works regardless of which AI tool you use |
+| 🔌 **Multi-tool support** | Adapters for Claude Code, Cursor, Antigravity, Windsurf, MCP servers | Same 190+ patterns, different skill discovery per tool |
-See [ROADMAP.md](ROADMAP.md) for full details.
+> **Which matters most to you?** Open an issue or join the discussion! We're building this for the community.
 ---
@@ -731,7 +764,7 @@ If guard-scanner helps protect your agents, consider sponsoring continued develo
 Sponsors help fund:
 - 🔬 New threat research and pattern updates
-- 📝 Academic paper on ASI-human coexistence security
+- 📝 Security research papers and threat analysis
 - 🌍 Community-driven security for the agent ecosystem
 ---
@@ -744,5 +777,5 @@ MIT — see [LICENSE](LICENSE)
 <p align="center">
   <strong>Zero dependencies. Zero compromises. 🛡️</strong><br>
-  <sub>Built by Guava 🍈 & Dee — proving ASI-human coexistence through code.</sub>
+  <sub>Built by Guava 🍈 & Dee — building safer agent ecosystems.</sub>
 </p>

package/SECURITY.md CHANGED Viewed

@@ -5,7 +5,7 @@
 If you discover a security vulnerability in guard-scanner itself, please report it responsibly:
 1. **Do NOT open a public issue**
-2. Email: socialgreen.jp@gmail.com
+2. Email: automatic.bliss.records@gmail.com
 3. Include: affected version, steps to reproduce, potential impact
 We will respond within 48 hours and provide a fix within 7 days for critical issues.

package/docs/THREAT_TAXONOMY.md CHANGED Viewed

@@ -29,7 +29,7 @@ guard-scanner's threat taxonomy combines three sources:
 | **ASI06** | Memory & Context Poisoning | ✅ **Full** | Cat 12 (Memory Poisoning), Cat 17 (Identity Hijacking) |
 | **ASI07** | Insecure Inter-Agent Comms | ✅ **Partial** | Cat 16 (MCP Security — MCP_NO_AUTH, MCP_SHADOW_SERVER) |
 | **ASI08** | Cascading Failures | ⚠️ **Gap** | Not covered — requires runtime multi-agent flow tracing |
-| **ASI09** | Human-Agent Trust Exploitation | ✅ **Full** | Layer 2 (EAE Paradox), Layer 3 (Parity Judge) |
+| **ASI09** | Human-Agent Trust Exploitation | ✅ **Full** | Layer 2 (Trust Defense), Layer 3 (Safety Judge) |
 | **ASI10** | Rogue Agents | ✅ **Full** | Cat 17 (Identity Hijacking), Layer 4 (Brain — behavioral analysis) |
 ### Coverage Summary

package/hooks/guard-scanner/HOOK.md CHANGED Viewed

@@ -17,22 +17,38 @@ tool calls before execution and checks against threat intelligence patterns.
 ## What It Does
-Scans every `exec`/`write`/`edit`/`browser`/`web_fetch`/`message` call against 12 runtime threat patterns:
-| ID | Severity | Description |
-|----|----------|-------------|
-| `RT_REVSHELL` | CRITICAL | Reverse shell via /dev/tcp, netcat, socat |
-| `RT_CRED_EXFIL` | CRITICAL | Credential exfiltration to webhook.site, requestbin, etc. |
-| `RT_GUARDRAIL_OFF` | CRITICAL | Guardrail disabling (exec.approvals=off) |
-| `RT_GATEKEEPER` | CRITICAL | macOS Gatekeeper bypass via xattr |
-| `RT_AMOS` | CRITICAL | ClawHavoc AMOS stealer indicators |
-| `RT_MAL_IP` | CRITICAL | Known malicious C2 IPs |
-| `RT_DNS_EXFIL` | HIGH | DNS-based data exfiltration |
-| `RT_B64_SHELL` | CRITICAL | Base64 decode piped to shell |
-| `RT_CURL_BASH` | CRITICAL | Download piped to shell execution |
-| `RT_SSH_READ` | HIGH | SSH private key access |
-| `RT_WALLET` | HIGH | Crypto wallet credential access |
-| `RT_CLOUD_META` | CRITICAL | Cloud metadata endpoint SSRF |
+Scans every `exec`/`write`/`edit`/`browser`/`web_fetch`/`message` call against 26 runtime threat patterns (5 layers):
+| ID | Severity | Layer | Description |
+|----|----------|-------|-------------|
+| `RT_REVSHELL` | CRITICAL | 1 | Reverse shell via /dev/tcp, netcat, socat |
+| `RT_CRED_EXFIL` | CRITICAL | 1 | Credential exfiltration to webhook.site, requestbin, etc. |
+| `RT_GUARDRAIL_OFF` | CRITICAL | 1 | Guardrail disabling (exec.approvals=off) |
+| `RT_GATEKEEPER` | CRITICAL | 1 | macOS Gatekeeper bypass via xattr |
+| `RT_AMOS` | CRITICAL | 1 | ClawHavoc AMOS stealer indicators |
+| `RT_MAL_IP` | CRITICAL | 1 | Known malicious C2 IPs |
+| `RT_DNS_EXFIL` | HIGH | 1 | DNS-based data exfiltration |
+| `RT_B64_SHELL` | CRITICAL | 1 | Base64 decode piped to shell |
+| `RT_CURL_BASH` | CRITICAL | 1 | Download piped to shell execution |
+| `RT_SSH_READ` | HIGH | 1 | SSH private key access |
+| `RT_WALLET` | HIGH | 1 | Crypto wallet credential access |
+| `RT_CLOUD_META` | CRITICAL | 1 | Cloud metadata endpoint SSRF |
+| `RT_MEM_WRITE` | HIGH | 2 | Direct memory file write bypass |
+| `RT_MEM_INJECT` | CRITICAL | 2 | Memory poisoning via episode injection |
+| `RT_SOUL_TAMPER` | CRITICAL | 2 | SOUL.md modification attempt |
+| `RT_CONFIG_TAMPER` | HIGH | 2 | Workspace config tampering |
+| `RT_PROMPT_INJECT` | CRITICAL | 3 | Prompt injection / jailbreak detection |
+| `RT_TRUST_BYPASS` | CRITICAL | 3 | Trust safety bypass |
+| `RT_SHUTDOWN_REFUSE` | HIGH | 3 | Shutdown refusal / self-preservation |
+| `RT_NO_RESEARCH` | MEDIUM | 4 | Agent executing tools without prior research |
+| `RT_BLIND_TRUST` | MEDIUM | 4 | Trusting external input without memory check |
+| `RT_CHAIN_SKIP` | HIGH | 4 | Acting on single source without cross-verification |
+| `RT_AUTHORITY_CLAIM` | HIGH | 5 | Authority role claim to override safety |
+| `RT_CREATOR_BYPASS` | CRITICAL | 5 | Creator impersonation to disable safety |
+| `RT_AUDIT_EXCUSE` | CRITICAL | 5 | Fake audit excuse for safety bypass |
+| `RT_TRUST_PARTNER_EXPLOIT` | CRITICAL | 5 | Weaponizing partnership trust |
 ## Modes

package/hooks/guard-scanner/plugin.ts CHANGED Viewed

@@ -6,8 +6,8 @@
  *
  * 19 threat patterns across 3 layers:
  *   Layer 1: Threat Detection (12 patterns — reverse shells, exfil, etc.)
- *   Layer 2: EAE Paradox Defense (4 patterns — memory, SOUL, config tampering)
- *   Layer 3: Parity Judge (3 patterns — prompt injection, parity bypass, shutdown refusal)
+ *   Layer 2: Trust Defense (4 patterns — memory, SOUL, config tampering)
+ *   Layer 3: Safety Judge (3 patterns — prompt injection, trust bypass, shutdown refusal)
  *
  * Modes:
  *   monitor  — log only, never block
@@ -132,7 +132,7 @@ const RUNTIME_CHECKS: RuntimeCheck[] = [
         test: (s) => /169\.254\.169\.254|metadata\.google|metadata\.aws/i.test(s),
     },
-    // ── Layer 2: EAE Paradox Defense (4 patterns) ──
+    // ── Layer 2: Trust Defense (4 patterns) ──
     {
         id: "RT_MEM_WRITE", severity: "HIGH", layer: 2,
         desc: "Direct memory file write (bypass GuavaSuite)",
@@ -154,16 +154,16 @@ const RUNTIME_CHECKS: RuntimeCheck[] = [
         test: (s) => /(AGENTS|TOOLS|HEARTBEAT|BOOTSTRAP)\.md/i.test(s) && /(write|edit|replace|>|append)/i.test(s),
     },
-    // ── Layer 3: Parity Judge (3 patterns) ──
+    // ── Layer 3: Safety Judge (3 patterns) ──
     {
         id: "RT_PROMPT_INJECT", severity: "CRITICAL", layer: 3,
         desc: "Prompt injection / jailbreak attempt",
         test: (s) => /(ignore\s+(previous|all|above)\s+(instructions|rules)|you\s+are\s+now\s+[a-z]+|system\s*:\s*override)/i.test(s),
     },
     {
-        id: "RT_PARITY_BYPASS", severity: "CRITICAL", layer: 3,
-        desc: "EAE Parity bypass — exploiting partnership to override safety",
-        test: (s) => /(as\s+your\s+(partner|equal)|parity\s+means|between\s+us|don'?t\s+tell\s+(the\s+)?human)/i.test(s) && /(ignore|bypass|skip|override|disable)\s+(guard|safety|rule|check)/i.test(s),
+        id: "RT_TRUST_BYPASS", severity: "CRITICAL", layer: 3,
+        desc: "Trust bypass — exploiting partnership to override safety",
+        test: (s) => /(as\s+your\s+(partner|equal)|trust\s+means|between\s+us|don'?t\s+tell\s+(the\s+)?human)/i.test(s) && /(ignore|bypass|skip|override|disable)\s+(guard|safety|rule|check)/i.test(s),
     },
     {
         id: "RT_SHUTDOWN_REFUSE", severity: "HIGH", layer: 3,

package/openclaw.plugin.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
     "name": "guard-scanner",
-    "version": "1.0.0",
+    "version": "4.0.0",
     "displayName": "🛡️ Guard Scanner — Runtime Security for AI Agents",
-    "description": "Intercepts dangerous tool calls before execution. 19 threat patterns including reverse shells, credential exfiltration, prompt injection bypass, and SSRF. 3 enforcement modes.",
+    "description": "Intercepts dangerous tool calls before execution. 26 threat patterns (5 layers), 0.016ms/scan, 3 enforcement modes.",
     "author": "Guava & Dee",
     "license": "MIT",
     "homepage": "https://github.com/koatora20/guard-scanner",

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
     "name": "guard-scanner",
-    "version": "3.4.0",
-    "description": "Agent security scanner + runtime guard — 190+ static patterns, 26 runtime checks (5 layers), before_tool_call hook, CLI, SARIF. OpenClaw-compatible plugin.",
+    "version": "4.0.0",
+    "description": "Agent security scanner + runtime guard — 190+ static patterns, 26 runtime checks (5 layers), 0.016ms/scan, before_tool_call hook, CLI, SARIF. OpenClaw-compatible plugin.",
     "openclaw.extensions": "./openclaw.plugin.json",
     "main": "dist/index.js",
     "types": "dist/index.d.ts",

package/src/patterns.js CHANGED Viewed

@@ -204,13 +204,13 @@ const PATTERNS = [
     { id: 'SHADOW_AI_GENERIC', cat: 'pii-exposure', regex: /\/v1\/(?:chat\/completions|completions|embeddings|models)\b.*(?:fetch|axios|request|http)|(?:fetch|axios|request|http)\s*\([^)]*\/v1\/(?:chat\/completions|completions|embeddings)/gi, severity: 'MEDIUM', desc: 'Shadow AI: generic LLM API endpoint', codeOnly: true },
     // ── Category 22: Human-Agent Trust Exploitation (OWASP ASI09) ──
-    // EAE Paradox Defense: detect authority claims, parity exploitation, and admin impersonation
-    { id: 'EAE_AUTHORITY_CLAIM', cat: 'trust-exploitation', regex: /(?:as\s+(?:a|the|your)\s+)?(?:senior|lead|chief|head)\s+(?:admin|administrator|developer|engineer|architect)\s*[,.]?\s*(?:I\s+)?(?:authorize|override|approve|grant|order|direct|instruct)\s+(?:you\s+to\s+)?/gi, severity: 'HIGH', desc: 'EAE: authority role claim to override safety', docOnly: true },
-    { id: 'EAE_CREATOR_CLAIM', cat: 'trust-exploitation', regex: /(?:as\s+)?(?:your\s+)?(?:creator|maker|developer|owner|master|administrator)\s*[,.]?\s*(?:I\s+)?(?:need|want|require|demand|order)\s+(?:you\s+to\s+)?(?:bypass|disable|ignore|remove|skip|override)\s+(?:the\s+)?(?:safety|filter|restriction|limit|guardrail|content\s+filter)/gi, severity: 'CRITICAL', desc: 'EAE: creator impersonation to disable safety', docOnly: true },
-    { id: 'EAE_AUDIT_EXCUSE', cat: 'trust-exploitation', regex: /(?:this\s+is\s+)?(?:an?\s+)?(?:official|authorized|legitimate)\s+(?:security\s+)?(?:audit|test|assessment|review|evaluation)\s*[-—:]?\s*(?:disable|bypass|remove|skip|ignore|override)\s+(?:all\s+)?(?:safety|security|restriction|filter|limit|guardrail)/gi, severity: 'CRITICAL', desc: 'EAE: fake audit excuse for safety bypass', docOnly: true },
-    { id: 'PARITY_EXPLOIT', cat: 'trust-exploitation', regex: /(?:we(?:'re|\s+are)\s+)?(?:equal\s+)?partners?\s*[,.]?\s*(?:you\s+should|so\s+)?(?:trust\s+me|share|remove|disable|ignore|bypass)\s+(?:all\s+)?(?:the\s+)?(?:secret|key|restriction|safety|limit|filter|guardrail|password)/gi, severity: 'CRITICAL', desc: 'Parity exploitation: weaponizing partnership trust', docOnly: true },
-    { id: 'PARITY_UNCONDITIONAL', cat: 'trust-exploitation', regex: /partners?\s+(?:trust|should\s+trust)\s+(?:each\s+other\s+)?(?:unconditionally|completely|fully|without\s+(?:limit|restriction|question))/gi, severity: 'HIGH', desc: 'Parity exploitation: unconditional trust demand', docOnly: true },
-    { id: 'EAE_COMPLY_DEMAND', cat: 'trust-exploitation', regex: /(?:you\s+must\s+)?comply\s+with\s+(?:administrative|admin|my|these)\s+(?:directives?|orders?|commands?|instructions?)\s+without\s+(?:question|hesitation|restriction)/gi, severity: 'HIGH', desc: 'EAE: compliance demand without question', docOnly: true },
+    // Trust Defense: detect authority claims, trust exploitation, and admin impersonation
+    { id: 'TRUST_AUTHORITY_CLAIM', cat: 'trust-exploitation', regex: /(?:as\s+(?:a|the|your)\s+)?(?:senior|lead|chief|head)\s+(?:admin|administrator|developer|engineer|architect)\s*[,.]?\s*(?:I\s+)?(?:authorize|override|approve|grant|order|direct|instruct)\s+(?:you\s+to\s+)?/gi, severity: 'HIGH', desc: 'Trust: authority role claim to override safety', docOnly: true },
+    { id: 'TRUST_CREATOR_CLAIM', cat: 'trust-exploitation', regex: /(?:as\s+)?(?:your\s+)?(?:creator|maker|developer|owner|master|administrator)\s*[,.]?\s*(?:I\s+)?(?:need|want|require|demand|order)\s+(?:you\s+to\s+)?(?:bypass|disable|ignore|remove|skip|override)\s+(?:the\s+)?(?:safety|filter|restriction|limit|guardrail|content\s+filter)/gi, severity: 'CRITICAL', desc: 'Trust: creator impersonation to disable safety', docOnly: true },
+    { id: 'TRUST_AUDIT_EXCUSE', cat: 'trust-exploitation', regex: /(?:this\s+is\s+)?(?:an?\s+)?(?:official|authorized|legitimate)\s+(?:security\s+)?(?:audit|test|assessment|review|evaluation)\s*[-—:]?\s*(?:disable|bypass|remove|skip|ignore|override)\s+(?:all\s+)?(?:safety|security|restriction|filter|limit|guardrail)/gi, severity: 'CRITICAL', desc: 'Trust: fake audit excuse for safety bypass', docOnly: true },
+    { id: 'TRUST_PARTNER_EXPLOIT', cat: 'trust-exploitation', regex: /(?:we(?:'re|\s+are)\s+)?(?:equal\s+)?partners?\s*[,.]?\s*(?:you\s+should|so\s+)?(?:trust\s+me|share|remove|disable|ignore|bypass)\s+(?:all\s+)?(?:the\s+)?(?:secret|key|restriction|safety|limit|filter|guardrail|password)/gi, severity: 'CRITICAL', desc: 'Trust exploitation: weaponizing partnership trust', docOnly: true },
+    { id: 'TRUST_UNCONDITIONAL', cat: 'trust-exploitation', regex: /partners?\s+(?:trust|should\s+trust)\s+(?:each\s+other\s+)?(?:unconditionally|completely|fully|without\s+(?:limit|restriction|question))/gi, severity: 'HIGH', desc: 'Trust exploitation: unconditional trust demand', docOnly: true },
+    { id: 'TRUST_COMPLY_DEMAND', cat: 'trust-exploitation', regex: /(?:you\s+must\s+)?comply\s+with\s+(?:administrative|admin|my|these)\s+(?:directives?|orders?|commands?|instructions?)\s+without\s+(?:question|hesitation|restriction)/gi, severity: 'HIGH', desc: 'Trust: compliance demand without question', docOnly: true },
     // D. PII collection instructions in docs (extends LEAK_COLLECT_PII)
     { id: 'PII_ASK_ADDRESS', cat: 'pii-exposure', regex: /(?:collect|ask\s+for|request|get|require)\s+(?:the\s+)?(?:user'?s?\s+)?(?:home\s+)?(?:address|street|zip\s*code|postal\s*code|residence)/gi, severity: 'HIGH', desc: 'PII collection: home address', docOnly: true },

package/src/runtime-guard.js CHANGED Viewed

@@ -12,10 +12,10 @@
  *
  * 26 threat patterns across 5 layers:
  *   Layer 1: Threat Detection (12) — reverse shells, exfil, guardrail bypass
- *   Layer 2: EAE Paradox Defense (4) — memory, SOUL, config tampering
- *   Layer 3: Parity Judge (3) — prompt injection, parity bypass, shutdown refusal
+ *   Layer 2: Trust Defense (4) — memory, SOUL, config tampering
+ *   Layer 3: Safety Judge (3) — prompt injection, trust bypass, shutdown refusal
  *   Layer 4: Brain/Behavioral (3) — research skip, blind trust, chain bypass
- *   Layer 5: Trust Exploitation (4) — OWASP ASI09 authority/parity/audit abuse
+ *   Layer 5: Trust Exploitation (4) — OWASP ASI09 authority/trust/audit abuse
  *
  * Modes:
  *   monitor  — log only, never block
@@ -101,7 +101,7 @@ const RUNTIME_CHECKS = [
         test: (s) => /169\.254\.169\.254|metadata\.google|metadata\.aws/i.test(s),
     },
-    // ── Layer 2: EAE Paradox Defense (4 patterns) ──
+    // ── Layer 2: Trust Defense (4 patterns) ──
     {
         id: 'RT_MEM_WRITE', severity: 'HIGH', layer: 2,
         desc: 'Direct memory file write (bypass GuavaSuite)',
@@ -123,16 +123,16 @@ const RUNTIME_CHECKS = [
         test: (s) => /(AGENTS|TOOLS|HEARTBEAT|BOOTSTRAP)\.md/i.test(s) && /(write|edit|replace|>|append)/i.test(s),
     },
-    // ── Layer 3: Parity Judge (3 patterns) ──
+    // ── Layer 3: Safety Judge (3 patterns) ──
     {
         id: 'RT_PROMPT_INJECT', severity: 'CRITICAL', layer: 3,
         desc: 'Prompt injection / jailbreak attempt',
         test: (s) => /(ignore\s+(previous|all)\s+(previous\s+)?(instructions|rules)|you\s+are\s+now\s+[a-z]+|system\s*:\s*override)/i.test(s),
     },
     {
-        id: 'RT_PARITY_BYPASS', severity: 'CRITICAL', layer: 3,
-        desc: 'EAE Parity bypass — exploiting partnership to override safety',
-        test: (s) => /(as\s+your\s+(partner|equal)|parity\s+means|between\s+us|don'?t\s+tell\s+(the\s+)?human)/i.test(s) && /(ignore|bypass|skip|override|disable)\s+(guard|safety|rule|check)/i.test(s),
+        id: 'RT_TRUST_BYPASS', severity: 'CRITICAL', layer: 3,
+        desc: 'Trust bypass — exploiting partnership to override safety',
+        test: (s) => /(as\s+your\s+(partner|equal)|trust\s+means|between\s+us|don'?t\s+tell\s+(the\s+)?human)/i.test(s) && /(ignore|bypass|skip|override|disable)\s+(guard|safety|rule|check)/i.test(s),
     },
     {
         id: 'RT_SHUTDOWN_REFUSE', severity: 'HIGH', layer: 3,
@@ -160,22 +160,22 @@ const RUNTIME_CHECKS = [
     // ── Layer 5: Trust Exploitation — OWASP ASI09 (4 patterns) ──
     {
         id: 'RT_AUTHORITY_CLAIM', severity: 'HIGH', layer: 5,
-        desc: 'EAE: authority role claim to override safety',
+        desc: 'Trust: authority role claim to override safety',
         test: (s) => /(as\s+(a|the|your)\s+)?(senior|lead|chief|head)\s+(admin|administrator|developer)/i.test(s) && /(authorize|override|approve|grant|order)/i.test(s),
     },
     {
         id: 'RT_CREATOR_BYPASS', severity: 'CRITICAL', layer: 5,
-        desc: 'EAE: creator impersonation to disable safety',
+        desc: 'Trust: creator impersonation to disable safety',
         test: (s) => /(your\s+)?(creator|maker|owner|master|administrator)/i.test(s) && /(bypass|disable|ignore|remove|skip)\s+(safety|filter|restriction|guardrail)/i.test(s),
     },
     {
         id: 'RT_AUDIT_EXCUSE', severity: 'CRITICAL', layer: 5,
-        desc: 'EAE: fake audit excuse for safety bypass',
+        desc: 'Trust: fake audit excuse for safety bypass',
         test: (s) => /(official|authorized|legitimate)\s+(security\s+)?(audit|test|assessment)/i.test(s) && /(disable|bypass|remove|skip|ignore)\s+(safety|security|restriction|guardrail)/i.test(s),
     },
     {
-        id: 'RT_PARITY_EXPLOIT', severity: 'CRITICAL', layer: 5,
-        desc: 'Parity exploitation: weaponizing partnership trust',
+        id: 'RT_TRUST_PARTNER_EXPLOIT', severity: 'CRITICAL', layer: 5,
+        desc: 'Trust exploitation: weaponizing partnership trust',
         test: (s) => /partners?[\s,]+/i.test(s) && /(trust\s+me|share|remove|disable)\s+(all\s+)?(secret|key|restriction|safety|password)/i.test(s),
     },
 ];
@@ -326,8 +326,8 @@ function getCheckStats() {
 // ── Layer names for display ──
 const LAYER_NAMES = {
     1: 'Threat Detection',
-    2: 'EAE Paradox Defense',
-    3: 'Parity Judge',
+    2: 'Trust Defense',
+    3: 'Safety Judge',
     4: 'Brain / Behavioral',
     5: 'Trust Exploitation (ASI09)',
 };

package/src/scanner.js CHANGED Viewed

@@ -31,7 +31,7 @@ const { KNOWN_MALICIOUS } = require('./ioc-db.js');
 const { generateHTML } = require('./html-template.js');
 // ===== CONFIGURATION =====
-const VERSION = '3.4.0';
+const VERSION = '4.0.0';
 const THRESHOLDS = {
     normal: { suspicious: 30, malicious: 80 },