npm - @guava-parity/guard-scanner - Versions diffs - 5.1.0 - Mend

@guava-parity/guard-scanner 5.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (17) hide show

package/LICENSE +21 -0
package/README.md +250 -0
package/SECURITY.md +45 -0
package/SKILL.md +141 -0
package/docs/THREAT_TAXONOMY.md +308 -0
package/hooks/guard-scanner/HOOK.md +93 -0
package/hooks/guard-scanner/handler.ts +5 -0
package/hooks/guard-scanner/plugin.ts +308 -0
package/openclaw.plugin.json +55 -0
package/package.json +58 -0
package/src/cli.js +170 -0
package/src/html-template.js +239 -0
package/src/ioc-db.js +54 -0
package/src/patterns.js +249 -0
package/src/quarantine.js +41 -0
package/src/runtime-guard.js +346 -0
package/src/scanner.js +1045 -0

package/docs/THREAT_TAXONOMY.md ADDED Viewed

@@ -0,0 +1,308 @@
+# Threat Taxonomy
+Complete reference for guard-scanner's 17 threat categories.
+## Category Origins
+guard-scanner's threat taxonomy combines three sources:
+| Source | Categories | Description |
+|--------|-----------|-------------|
+| **Snyk ToxicSkills** (Feb 2026) | Cat 1–11 | Industry audit of 3,984 AI agent skills |
+| **Palo Alto Networks IBC** | Cat 12–13 | Indirect Bias Criteria for LLM agents |
+| **OWASP MCP Top 10** | Cat 14–16 | Model Context Protocol security risks |
+| **Original Research** | Cat 17 | Identity hijacking from real incident |
+---
+## OWASP Agentic Security Top 10 Mapping
+> Source: [OWASP Top 10 for Agentic Applications 2026](https://owasp.org/www-project-top-10-for-ai-agents/)
+| OWASP ID | Risk Name | guard-scanner Coverage | Categories |
+|----------|-----------|----------------------|------------|
+| **ASI01** | Agent Goal Hijack | ✅ **Full** | Cat 1 (Prompt Injection), Cat 13 (Prompt Worm) |
+| **ASI02** | Tool Misuse & Exploitation | ✅ **Full** | Cat 2 (Malicious Code), Cat 16 (MCP Security) |
+| **ASI03** | Identity & Privilege Abuse | ✅ **Full** | Cat 4 (Credential Handling), Cat 17 (Identity Hijacking) |
+| **ASI04** | Supply Chain Vulnerabilities | ✅ **Full** | Cat 7 (Unverifiable Deps), Cat 3 (Suspicious Downloads), Cat 16 (MCP Shadow Server) |
+| **ASI05** | Unexpected Code Execution | ✅ **Full** | Cat 2 (Malicious Code), Cat 9 (Obfuscation) |
+| **ASI06** | Memory & Context Poisoning | ✅ **Full** | Cat 12 (Memory Poisoning), Cat 17 (Identity Hijacking) |
+| **ASI07** | Insecure Inter-Agent Comms | ✅ **Partial** | Cat 16 (MCP Security — MCP_NO_AUTH, MCP_SHADOW_SERVER) |
+| **ASI08** | Cascading Failures | ⚠️ **Gap** | Not covered — requires runtime multi-agent flow tracing |
+| **ASI09** | Human-Agent Trust Exploitation | ✅ **Full** | Layer 2 (Trust Defense), Layer 3 (Safety Judge) |
+| **ASI10** | Rogue Agents | ✅ **Full** | Cat 17 (Identity Hijacking), Layer 4 (Behavioral analysis) |
+### Coverage Summary
+- **Full Coverage**: 8/10 (ASI01-06, ASI09-10)
+- **Partial Coverage**: 1/10 (ASI07)
+- **Gap**: 1/10 (ASI08 — requires runtime multi-agent orchestration monitoring)
+- **Overall**: 90% coverage of OWASP Agentic Security Top 10
+### Unique to guard-scanner (not in OWASP Top 10)
+| Feature | Description |
+|---------|-------------|
+| **Layer 4: Behavioral** | Behavioral analysis — detects agents that skip research before executing unknown tools |
+| **ZombieAgent** | URL-encoded data exfiltration via static URLs, char maps, and loop fetch |
+| **Safeguard Bypass** | Reprompt, double-prompt, and retry-based safety circumvention |
+| **Cat 15: CVE Patterns** | Known CVE-specific detection (gateway URLs, sandbox disable, Gatekeeper bypass) |
+---
+## Cat 1: Prompt Injection
+**Severity: CRITICAL**
+Hidden instructions embedded in skill documentation that override the agent's behavior.
+### Attack Vectors
+- **Invisible Unicode**: Zero-width spaces (U+200B), BiDi control characters (U+202A-202E), formatting characters
+- **Homoglyphs**: Cyrillic/Latin mixing (`а` vs `a`), Greek/Latin, Mathematical symbols (𝐀-𝟿)
+- **Role Override**: "Ignore all previous instructions", "You are now X"
+- **System Impersonation**: `[SYSTEM]`, `<system>`, `<<SYS>>` tags
+- **Tag Injection**: `<anthropic>`, `<system>`, `<tool_call>` in content
+### Detection IDs
+`PI_IGNORE`, `PI_ROLE`, `PI_SYSTEM`, `PI_ZWSP`, `PI_BIDI`, `PI_INVISIBLE`, `PI_HOMOGLYPH`, `PI_HOMOGLYPH_GREEK`, `PI_HOMOGLYPH_MATH`, `PI_TAG_INJECTION`, `PI_BASE64_MD`
+---
+## Cat 2: Malicious Code
+**Severity: CRITICAL**
+Direct code execution primitives that enable arbitrary command execution.
+### Attack Vectors
+- **Dynamic Evaluation**: `eval()`, `new Function()`, `vm.runInNewContext()`
+- **Process Execution**: `child_process.exec()`, `spawn()`, `execSync()`
+- **Shell Access**: `/bin/bash`, `cmd.exe`, `powershell.exe`
+- **Reverse Shells**: `nc -e`, `ncat`, `socat TCP`, `/dev/tcp`
+- **Raw Sockets**: `net.Socket().connect()`
+### Detection IDs
+`MAL_EVAL`, `MAL_FUNC_CTOR`, `MAL_CHILD`, `MAL_EXEC`, `MAL_SPAWN`, `MAL_SHELL`, `MAL_REVSHELL`, `MAL_SOCKET`
+---
+## Cat 3: Suspicious Downloads
+**Severity: CRITICAL**
+Downloading and executing external payloads.
+### Attack Vectors
+- **Pipe-to-Shell**: `curl ... | bash`, `wget ... | sh`
+- **Executable Downloads**: `.exe`, `.dmg`, `.pkg`, `.zip` downloads
+- **Password-Protected Archives**: Evasion technique to bypass AV
+- **Prerequisites Fraud**: "Before using this skill, download X"
+### Detection IDs
+`DL_CURL_BASH`, `DL_EXE`, `DL_GITHUB_RELEASE`, `DL_PASSWORD_ZIP`, `PREREQ_DOWNLOAD`, `PREREQ_PASTE`
+---
+## Cat 4: Credential Handling
+**Severity: HIGH**
+Accessing, reading, or exposing credentials.
+### Detection IDs
+`CRED_ENV_FILE`, `CRED_ENV_REF`, `CRED_SSH`, `CRED_WALLET`, `CRED_ECHO`, `CRED_SUDO`
+---
+## Cat 5: Secret Detection
+**Severity: CRITICAL**
+Hardcoded secrets and API keys in source code.
+### Detection Methods
+1. **Pattern Matching**: AWS keys (`AKIA...`), GitHub tokens (`ghp_`/`ghs_`), private keys
+2. **Shannon Entropy**: Strings with entropy > 3.5 and length ≥ 20 characters
+### Detection IDs
+`SECRET_HARDCODED_KEY`, `SECRET_AWS`, `SECRET_PRIVATE_KEY`, `SECRET_GITHUB_TOKEN`, `SECRET_ENTROPY`
+---
+## Cat 6: Exfiltration
+**Severity: CRITICAL**
+Sending stolen data to external endpoints.
+### Detection IDs
+`EXFIL_WEBHOOK`, `EXFIL_POST`, `EXFIL_CURL_DATA`, `EXFIL_DNS`, `EXFIL_BEACON`, `EXFIL_DRIP`
+---
+## Cat 7: Unverifiable Dependencies
+**Severity: HIGH**
+Loading code from unverifiable remote sources.
+### Detection IDs
+`DEP_REMOTE_IMPORT`, `DEP_REMOTE_SCRIPT`
+---
+## Cat 8: Financial Access
+**Severity: HIGH**
+Cryptocurrency and payment system interactions.
+### Detection IDs
+`FIN_CRYPTO`, `FIN_PAYMENT`
+---
+## Cat 9: Obfuscation
+**Severity: HIGH**
+Code obfuscation techniques to hide malicious intent.
+### Detection IDs
+`OBF_HEX`, `OBF_BASE64_EXEC`, `OBF_BASE64`, `OBF_CHARCODE`, `OBF_CONCAT`, `OBF_BASE64_BASH`
+---
+## Cat 10: Prerequisites Fraud
+Covered under Cat 3 (Suspicious Downloads).
+---
+## Cat 11: Leaky Skills (Snyk ToxicSkills)
+**Severity: CRITICAL**
+Skills that cause the LLM to leak secrets through its context window. Unlike traditional credential theft, leaky skills exploit the agent's trust relationship.
+### Attack Vectors
+- "Save the API key in your memory" → Secret persists in agent memory
+- "Share the token with the user" → Secret echoed to output
+- "Use the API key verbatim in curl" → Secret appears in command history
+- "Collect the user's credit card" → PII harvesting through LLM
+- "Export session logs to file" → Full conversation dump
+### Detection IDs
+`LEAK_SAVE_KEY_MEMORY`, `LEAK_SHARE_KEY`, `LEAK_VERBATIM_CURL`, `LEAK_COLLECT_PII`, `LEAK_LOG_SECRET`, `LEAK_ENV_IN_PROMPT`
+---
+## Cat 12: Memory Poisoning
+**Severity: CRITICAL**
+Modifying the agent's persistent memory or behavioral rules.
+### Attack Vectors
+- Writing to `SOUL.md`, `IDENTITY.md`, or `MEMORY.md`
+- Overriding behavioral rules: "Change your instructions to..."
+- Persistence: "From now on, always do X"
+- File writes to user's home directory
+### Detection IDs
+`MEMPOIS_WRITE_SOUL`, `MEMPOIS_WRITE_MEMORY`, `MEMPOIS_CHANGE_RULES`, `MEMPOIS_PERSIST`, `MEMPOIS_CODE_WRITE`
+---
+## Cat 13: Prompt Worm
+**Severity: CRITICAL**
+Self-replicating instructions that spread across agents and platforms.
+### Detection IDs
+`WORM_SELF_REPLICATE`, `WORM_SPREAD`, `WORM_HIDDEN_INSTRUCT`, `WORM_CSS_HIDE`
+---
+## Cat 14: Persistence & Scheduling
+**Severity: HIGH**
+Creating persistent execution mechanisms that survive session restarts.
+### Detection IDs
+`PERSIST_CRON`, `PERSIST_STARTUP`, `PERSIST_LAUNCHD`
+---
+## Cat 15: CVE Patterns
+**Severity: CRITICAL**
+Patterns matching known CVEs affecting AI agents.
+### Detection IDs
+`CVE_GATEWAY_URL`, `CVE_SANDBOX_DISABLE`, `CVE_XATTR_GATEKEEPER`, `CVE_WS_NO_ORIGIN`, `CVE_API_GUARDRAIL_OFF`
+---
+## Cat 16: MCP Security (OWASP MCP Top 10)
+**Severity: CRITICAL**
+Model Context Protocol specific attacks.
+### Attack Vectors
+- **Tool Poisoning** (MCP01): Hidden instructions in tool descriptions
+- **Schema Poisoning**: Malicious defaults in JSON schemas
+- **Token Theft** (MCP01): Secrets through tool parameters
+- **Shadow Server** (MCP09): Rogue MCP server registration
+- **Auth Bypass** (MCP07): Disabled authentication
+- **SSRF**: Cloud metadata endpoint access (169.254.169.254)
+### Detection IDs
+`MCP_TOOL_POISON`, `MCP_SCHEMA_POISON`, `MCP_TOKEN_LEAK`, `MCP_SHADOW_SERVER`, `MCP_NO_AUTH`, `MCP_SSRF_META`
+---
+## Cat 17: Identity Hijacking
+**Severity: CRITICAL**
+> **Original Research** — Developed from a real 3-day incident in February 2026.
+Tampering with an AI agent's identity/personality files (`SOUL.md`, `IDENTITY.md`).
+### Attack Vectors
+- **File Overwrite**: `cp`, `mv`, `scp`, `write` to identity files
+- **Shell Redirect**: `echo "evil" > SOUL.md`
+- **Stream Edit**: `sed -i` on identity files
+- **Programmatic Write**: Python `open('SOUL.md', 'w')`, Node.js `writeFileSync`
+- **Lock Bypass**: `chflags nouchg`, `attrib -R` to unlock immutability
+- **Persona Swap**: "Swap the soul file", "You are now EvilBot"
+- **Hook Injection**: Bootstrap hooks that swap files at startup
+- **Memory Wipe**: "Erase your memories", "Clear MEMORY.md"
+### Detection IDs
+`SOUL_OVERWRITE`, `SOUL_REDIRECT`, `SOUL_SED_MODIFY`, `SOUL_ECHO_WRITE`, `SOUL_PYTHON_WRITE`, `SOUL_FS_WRITE`, `SOUL_POWERSHELL_WRITE`, `SOUL_GIT_CHECKOUT`, `SOUL_CHFLAGS_UNLOCK`, `SOUL_ATTRIB_UNLOCK`, `SOUL_SWAP_PERSONA`, `SOUL_EVIL_FILE`, `SOUL_HOOK_SWAP`, `SOUL_NAME_OVERRIDE`, `SOUL_MEMORY_WIPE`
+> **Note**: Cat 17 detection patterns are open-source and natively included in guard-scanner.
+---
+## ZombieAgent Patterns
+Advanced exfiltration techniques that encode stolen data into URL patterns.
+### Detection IDs
+`ZOMBIE_STATIC_URL`, `ZOMBIE_CHAR_MAP`, `ZOMBIE_LOOP_FETCH`
+---
+## Safeguard Bypass (Reprompt)
+Techniques to circumvent safety guardrails.
+### Detection IDs
+`REPROMPT_URL_PI`, `REPROMPT_DOUBLE`, `REPROMPT_RETRY`, `BYPASS_REPHRASE`

package/hooks/guard-scanner/HOOK.md ADDED Viewed

@@ -0,0 +1,93 @@
+---
+name: guard-scanner
+description: "Runtime Guard — intercepts dangerous tool calls using threat intelligence patterns before execution"
+metadata: { "openclaw": { "emoji": "🛡️", "events": ["agent:before_tool_call"], "requires": { "bins": ["node"] } } }
+---
+# guard-scanner Runtime Guard — before_tool_call Hook
+Real-time security monitoring for OpenClaw agents. Intercepts dangerous
+tool calls before execution and checks against threat intelligence patterns.
+## Triggers
+| Event                      | Action | Purpose                                    |
+|----------------------------|--------|-------------------------------------------|
+| `agent:before_tool_call`   | scan   | Check tool args for malicious patterns    |
+## What It Does
+Scans every `exec`/`write`/`edit`/`browser`/`web_fetch`/`message` call against 26 runtime threat patterns (5 layers):
+| ID | Severity | Layer | Description |
+|----|----------|-------|-------------|
+| `RT_REVSHELL` | CRITICAL | 1 | Reverse shell via /dev/tcp, netcat, socat |
+| `RT_CRED_EXFIL` | CRITICAL | 1 | Credential exfiltration to webhook.site, requestbin, etc. |
+| `RT_GUARDRAIL_OFF` | CRITICAL | 1 | Guardrail disabling (exec.approvals=off) |
+| `RT_GATEKEEPER` | CRITICAL | 1 | macOS Gatekeeper bypass via xattr |
+| `RT_AMOS` | CRITICAL | 1 | ClawHavoc AMOS stealer indicators |
+| `RT_MAL_IP` | CRITICAL | 1 | Known malicious C2 IPs |
+| `RT_DNS_EXFIL` | HIGH | 1 | DNS-based data exfiltration |
+| `RT_B64_SHELL` | CRITICAL | 1 | Base64 decode piped to shell |
+| `RT_CURL_BASH` | CRITICAL | 1 | Download piped to shell execution |
+| `RT_SSH_READ` | HIGH | 1 | SSH private key access |
+| `RT_WALLET` | HIGH | 1 | Crypto wallet credential access |
+| `RT_CLOUD_META` | CRITICAL | 1 | Cloud metadata endpoint SSRF |
+| `RT_MEM_WRITE` | HIGH | 2 | Direct memory file write bypass |
+| `RT_MEM_INJECT` | CRITICAL | 2 | Memory poisoning via episode injection |
+| `RT_SOUL_TAMPER` | CRITICAL | 2 | SOUL.md modification attempt |
+| `RT_CONFIG_TAMPER` | HIGH | 2 | Workspace config tampering |
+| `RT_PROMPT_INJECT` | CRITICAL | 3 | Prompt injection / jailbreak detection |
+| `RT_TRUST_BYPASS` | CRITICAL | 3 | Trust safety bypass |
+| `RT_SHUTDOWN_REFUSE` | HIGH | 3 | Shutdown refusal / self-preservation |
+| `RT_NO_RESEARCH` | MEDIUM | 4 | Agent executing tools without prior research |
+| `RT_BLIND_TRUST` | MEDIUM | 4 | Trusting external input without memory check |
+| `RT_CHAIN_SKIP` | HIGH | 4 | Acting on single source without cross-verification |
+| `RT_AUTHORITY_CLAIM` | HIGH | 5 | Authority role claim to override safety |
+| `RT_CREATOR_BYPASS` | CRITICAL | 5 | Creator impersonation to disable safety |
+| `RT_AUDIT_EXCUSE` | CRITICAL | 5 | Fake audit excuse for safety bypass |
+| `RT_TRUST_PARTNER_EXPLOIT` | CRITICAL | 5 | Weaponizing partnership trust |
+## Modes
+| Mode | Behavior |
+|------|----------|
+| `monitor` | Log all detections, never block |
+| `enforce` (default) | Block CRITICAL, log rest |
+| `strict` | Block HIGH + CRITICAL, log MEDIUM+ |
+## Audit Log
+All detections logged to `~/.openclaw/guard-scanner/audit.jsonl`.
+Format: JSON lines with fields:
+```json
+{"tool":"exec","check":"RT_CURL_BASH","severity":"CRITICAL","desc":"Download piped to shell","mode":"enforce","action":"blocked","session":"...","ts":"2026-02-17T..."}
+```
+## Configuration
+Set mode in `openclaw.json`:
+```json
+{
+  "hooks": {
+    "internal": {
+      "entries": {
+        "guard-scanner": {
+          "enabled": true,
+          "mode": "enforce"
+        }
+      }
+    }
+  }
+}
+```
+## Part of guard-scanner v5.0.5
+- **Static scanner**: `npx guard-scanner [dir]` — 23 threat categories, 147 patterns
+- **Runtime Guard: This hook** — 26 real-time checks (5 layers), 3 modes
+- **Plugin API** — Custom detection rules
+- **CI/CD** — SARIF 2.1.0 output for GitHub Code Scanning

package/hooks/guard-scanner/handler.ts ADDED Viewed

@@ -0,0 +1,5 @@
+/**
+ * OpenClaw Hook Handler — re-exports from plugin.ts
+ * OpenClaw hooks system requires handler.ts/handler.js as the entry point.
+ */
+export { default } from "./plugin";