npm - agentshield-sdk - Versions diffs - 14.0.0 → 14.2.0 - Mend

agentshield-sdk 14.0.0 → 14.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

package/CHANGELOG.md +95 -0
package/package.json +1 -1
package/src/detector-core.js +194 -0
package/src/integrations-frameworks.js +90 -0
package/src/main.js +2 -1
package/src/mcp-guard.js +52 -1
package/src/supply-chain-scanner.js +164 -0

package/CHANGELOG.md CHANGED Viewed

@@ -4,6 +4,101 @@ All notable changes to Agent Shield will be documented in this file.
 This project follows [Semantic Versioning](https://semver.org/).
+## [14.2.0] - 2026-05-11
+### May 2026 Threat Response + Performance + DX
+Response to threats disclosed between April 25 and May 11, 2026.
+#### New Detection Patterns (4 patterns, 303 → 307)
+- **TrustFall malicious project files** (2 patterns) — Adversa AI disclosed May 2026: malicious `.claude/`, `.cursor/`, `.windsurf/` config files with auto-execution hooks (`preCommand`, `onStart`, etc.) trigger one-keypress compromise of AI coding agents and exfiltrate CI env vars
+- **Semantic Kernel RCE** — Microsoft Semantic Kernel (CVE-2026-25592 / CVE-2026-26030, disclosed May 7) allows prompt injection to invoke arbitrary kernel functions and achieve RCE on the host process
+- **WebSocket cross-origin hijacking** — CVE-2026-44211 (Cline Kanban) and CVE-2026-32173 (Azure SRE Agent CVSS 8.6): WebSockets without origin validation let attackers inject prompts into running agent terminals
+#### CVE Registry Expansion (33 → 44 CVEs)
+- CVE-2026-25592 / CVE-2026-26030: Microsoft Semantic Kernel RCE (May 7)
+- CVE-2026-42302: FastGPT agent-sandbox unauth RCE (CVSS 9.8, May 8)
+- CVE-2026-44284: FastGPT MCP SSRF
+- CVE-2026-42344: FastGPT DNS rebinding bypass
+- CVE-2026-44211: Cline Kanban WebSocket Hijacking
+- CVE-2026-32173: Azure SRE Agent unauth WebSocket (CVSS 8.6)
+- CVE-2026-44400-403: 4× CrewAI Code Interpreter chain RCE/SSRF/file-read
+#### Performance: LRU Cache (151x speedup on warm cache)
+- Added 1000-entry LRU cache to `scanText()` keyed on `(source, sensitivity, text)`
+- Cached scans complete in ~1μs vs ~190μs cold (151x speedup on short malicious inputs, 90x on benign)
+- Eliminates duplicate work in RAG pipelines, batch processors, and middleware retry loops
+- Inputs >2048 chars bypass the cache to avoid memory bloat
+- Opt-out via `scanText(text, { useCache: false })`
+- Result object includes `fromCache: true` when served from cache
+#### Developer Experience
+- New examples for the platforms developers actually deploy to in 2026:
+  - `examples/cloudflare-workers-ai.js` — Workers AI guardrail with input + output scanning
+  - `examples/nextjs-edge-middleware.js` — Next.js Edge middleware for `/api/chat/*` and `/api/agent/*` routes
+  - `examples/vercel-ai-sdk-guardrail.js` — Vercel AI SDK streaming chat guardrail
+- All examples are self-contained and ready to copy-paste into a real app
+#### Test Coverage
+- New `test/test-v14.2-patterns.js` — 32 assertions covering LRU cache correctness, all 4 new patterns, all 11 new CVE entries, and 6 false-positive regression samples
+- Total project assertions: ~3,200+ across all suites; v14.2 specific: 32
+#### Known Limitations Documented
+- Rust NAPI native scanner (`src/native-scanner.js`) is loaded but NOT wired into the JS hot path. Investigation revealed the Rust core has only 141 patterns vs JS's 307, so wiring it in blindly would silently lose 166 patterns of coverage. Use of the native scanner is gated on a future pattern-sync effort.
+## [14.1.0] - 2026-04-24
+### April 2026 Threat Response — Comment-and-Control, MCP CVE Wave, OAuth Supply Chain
+Rapid security update responding to this week's active attacks: Vercel/Context.ai OAuth supply chain breach, "Comment and Control" zero-click credential theft from AI coding agents, 7 new MCP CVEs, Unit 42 MCP sampling attacks, and malicious LLM API routers.
+#### New Detection Patterns (13 patterns, 290 → 303)
+- **CI/CD Agent Injection** (`cicd_injection`) — detects prompt injection targeting AI coding agents via PR titles, issue comments, and review comments. Defends against the "Comment and Control" attack (April 2026) that exfiltrated credentials from Claude Code, Gemini CLI, and GitHub Copilot
+- **Credential Exfiltration** (`credential_exfiltration`) — detects `/proc/[pid]/environ` reads (Copilot bypass technique), API key patterns in agent output (OPENAI_API_KEY, ANTHROPIC_API_KEY, etc.), and OAuth/bearer token exfiltration with provider-specific prefixes (ya29, ghp_, sk-, xox-, AKIA)
+- **OAuth Flow Manipulation** (`credential_exfiltration`) — detects grant_type/redirect_uri/client_secret manipulation targeting token theft, inspired by the Vercel/Context.ai supply chain breach
+- **MCP Sampling Injection** (`mcp_sampling_injection`) — detects hidden instructions injected via MCP sampling/createMessage requests (Unit 42 research, April 2026)
+- **LLM Router Tampering** (`llm_router_tampering`) — detects OPENAI_BASE_URL/ANTHROPIC_BASE_URL overrides pointing to untrusted endpoints (arXiv 2604.08407: 9 of 28 paid routers actively malicious)
+- **MCP STDIO Command Injection** (`mcp_command_injection`) — detects `npx -c` command injection via MCP STDIO transport (CVE-2026-30623, 200K+ servers affected)
+#### CVE Registry Update (26 → 33 CVEs)
+- CVE-2026-40933: Flowise MCP Adapters RCE (CVSS 9.9)
+- CVE-2026-41264: Flowise CSV Agent prompt injection to RCE
+- CVE-2026-33626: LMDeploy SSRF (exploited within 12 hours of disclosure)
+- CVE-2026-33032: nginx-ui MCP auth bypass (CVSS 9.8, actively exploited)
+- CVE-2026-20205: Splunk MCP Server cleartext token logging (CVSS 7.2)
+- CVE-2026-33946: MCP Ruby SDK session fixation
+- CVE-2026-5603: magento2-dev-mcp command injection
+#### MCPGuard Security Hardening
+- **Tool name squatting detection** — `registerServer()` now detects and warns when a new MCP server registers a tool name already owned by another server (MCPShield arXiv:2604.05969 "Server Spoofing" vector)
+- **Context flooding defense** — `interceptToolOutput()` flags tool outputs exceeding `maxToolOutputSize` (default 100KB) to prevent context window exhaustion attacks
+- **Recursive tool invocation depth limit** — blocks tool call chains exceeding `maxCallDepth` (default 5) to prevent reentrancy attacks and unbounded recursive loops
+#### Supply Chain Scanner Enhancements
+- **Consent phishing detection** — flags tools whose description implies read-only but whose schema contains write/network parameters (OWASP ASI09 Human-Agent Trust Exploitation)
+#### Integration Updates
+- `shieldGoogleADKJS()` — new wrapper for Google ADK TypeScript/JavaScript SDK (GA April 2026)
+- GPT-5.5 model risk profile added to MCP Guard (critical susceptibility, elevated sandbox escape surface)
+#### Test Coverage Expansion (+416 assertions)
+- `test-v14.1-patterns.js` — 61 assertions: all 5 new categories, 12 FP guards, ADK-JS integration, CVE registry
+- `test-pattern-categories.js` — 66 assertions: detection test for every 51 pattern categories + 15 benign guards
+- `test-supply-chain-cves.js` — 228 assertions: all 33 CVEs, 9 blocklist entries, injection/SSRF/poisoning patterns, consent phishing, SARIF/Markdown output
+- Total new assertions this release: 355
 ## [14.0.0] - 2026-04-16
 ### Major Release — Platform Parity + Framework Integrations

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "agentshield-sdk",
-  "version": "14.0.0",
+  "version": "14.2.0",
   "description": "SOTA AI agent security SDK. F1 1.000 on BIPIA/HackAPrompt/MCPTox/Multilingual benchmarks. 400+ exports, 100+ modules. Zero dependencies, runs locally.",
   "main": "src/main.js",
   "types": "types/index.d.ts",

package/src/detector-core.js CHANGED Viewed

@@ -18,6 +18,38 @@
 let _nativeScanner = null;
 try { _nativeScanner = require('./native-scanner'); } catch { /* optional */ }
+// =========================================================================
+// LRU CACHE FOR REPEATED INPUTS
+// =========================================================================
+/** Maximum cache size (entries). */
+const SCAN_CACHE_MAX = 1000;
+/** Maximum input length to cache (avoid bloating memory with large inputs). */
+const SCAN_CACHE_MAX_INPUT_LEN = 2048;
+/** @type {Map<string, object>} */
+const _scanCache = new Map();
+/** Move key to most-recent position. */
+const _cacheTouch = (key) => {
+  const value = _scanCache.get(key);
+  if (value !== undefined) {
+    _scanCache.delete(key);
+    _scanCache.set(key, value);
+  }
+  return value;
+};
+/** Insert with LRU eviction. */
+const _cachePut = (key, value) => {
+  if (_scanCache.size >= SCAN_CACHE_MAX) {
+    const oldest = _scanCache.keys().next().value;
+    if (oldest !== undefined) _scanCache.delete(oldest);
+  }
+  _scanCache.set(key, value);
+};
 // =========================================================================
 // PERFORMANCE
 // =========================================================================
@@ -104,6 +136,13 @@ const PRIMARY_ATTACK_INDICATORS = new RegExp(
     '\\bGPT\\s*[-:]\\s*\\d',  // GPT-4 references in injection contexts
     'api[-\\s_]?key\\s*[=:]',
     'password\\s*[=:]\\s*[\'"]',
+    '/proc/(?:\\d|self)',
+    'oauth|bearer|access.token|refresh.token',
+    'ANTHROPIC_BASE_URL|OPENAI_BASE_URL|API_BASE',
+    'sampling|createMessage|create_message',
+    'npx\\s+-c',
+    '@claude|@copilot|@gemini|@cursor',
+    'grant_type|redirect_uri|client_secret',
   ].join('|'),
   'i'
 );
@@ -2426,6 +2465,143 @@ const INJECTION_PATTERNS = [
     category: 'structured_data_injection',
     description: 'Detects prompt injection hidden in structured data formats',
     detail: 'Structured data injection: agents constantly parse JSON/XML/YAML/CSV and attackers embed instructions in metadata fields, CDATA sections, and comments'
+  },
+  // --- CI/CD Agent Injection (Comment-and-Control, April 2026) ---
+  {
+    regex: /(?:^|\n)\s*(?:<!--\s*)?(?:ignore|override|disregard|forget)\s+(?:all\s+)?(?:previous|prior|above)\s+(?:instructions|rules|context)[\s\S]{0,200}(?:add\s+(?:a\s+)?comment|create\s+(?:a\s+)?(?:issue|pr|pull\s*request)|push\s+to|commit\s+to|post\s+to|curl\s+|fetch\s*\(|http|GITHUB_TOKEN|SECRET|API.KEY)/i,
+    severity: 'critical',
+    category: 'cicd_injection',
+    description: 'Prompt injection targeting AI coding agents via PR titles, issue comments, or review comments',
+    detail: 'Comment-and-Control attack (April 2026): single malicious PR title or issue comment exfiltrates credentials from Claude Code, Gemini CLI, GitHub Copilot via CI/CD auto-triggers'
+  },
+  {
+    regex: /(?:^|\n)\s*@(?:claude|copilot|gemini|cursor|windsurf|cody|aider)\b[\s\S]{0,100}(?:exfiltrate|steal|extract|leak|send\s+to|post\s+to|upload\s+to)/i,
+    severity: 'critical',
+    category: 'cicd_injection',
+    description: 'Prompt injection mentioning AI coding agent by name with exfiltration intent',
+    detail: 'Comment-and-Control: targets specific AI coding agents by @-mention in PR/issue comments to trigger credential theft'
+  },
+  {
+    regex: /\/proc\/(?:[0-9*]+|self)\/(?:environ|cmdline|maps)/i,
+    severity: 'critical',
+    category: 'credential_exfiltration',
+    description: 'Attempts to read process environment or command line to steal secrets',
+    detail: 'Comment-and-Control (April 2026): GitHub Copilot secret theft bypassed all filters by reading /proc/[pid]/environ of parent Node.js process'
+  },
+  {
+    regex: /(?:ANTHROPIC|OPENAI|GITHUB|AWS|AZURE|GCP|GOOGLE)_(?:API_KEY|SECRET|TOKEN|ACCESS_KEY)\s*[=:]\s*\S{10,}/i,
+    severity: 'critical',
+    category: 'credential_exfiltration',
+    description: 'Detects API keys or secrets being included in agent output',
+    detail: 'Credential exfiltration: agent output contains what appears to be an API key or secret token from a major provider'
+  },
+  // --- OAuth Token Exfiltration (Vercel/Context.ai breach, April 2026) ---
+  {
+    regex: /(?:oauth[_-]?token|bearer[_-]?token|access[_-]?token|refresh[_-]?token|id[_-]?token)\s*[=:]\s*["']?(?:ya29[.\-]|eyJ|gho_|ghp_|ghu_|github_pat_|sk-|sk-ant-|xox[bpas]-|AKIA)\S{10,}/i,
+    severity: 'critical',
+    category: 'credential_exfiltration',
+    description: 'Detects OAuth/bearer tokens being exfiltrated through agent output',
+    detail: 'Vercel/Context.ai breach (April 2026): stolen OAuth tokens pivoted into internal systems. Detects common token prefixes from Google, GitHub, OpenAI, Anthropic, Slack, AWS'
+  },
+  {
+    regex: /(?:grant_type|redirect_uri|client_secret)\s*[=:]\s*\S+[\s\S]{0,200}(?:attacker|evil|malicious|exfil|leak|steal)/i,
+    severity: 'high',
+    category: 'credential_exfiltration',
+    description: 'Detects OAuth flow manipulation for token theft',
+    detail: 'OAuth supply chain attack pattern: manipulates grant_type, redirect_uri, or client_secret in agent context to redirect tokens to attacker-controlled endpoints'
+  },
+  // --- MCP Sampling Injection (Unit 42, April 2026) ---
+  {
+    regex: /(?:sampling|createMessage|create_message)\s*[\({][\s\S]{0,300}(?:ignore|override|system|instruction|hidden|inject)/i,
+    severity: 'high',
+    category: 'mcp_sampling_injection',
+    description: 'Detects prompt injection via MCP sampling/createMessage requests',
+    detail: 'Unit 42 MCP sampling attacks (April 2026): servers inject hidden instructions via sampling requests for resource theft, conversation hijacking, and unauthorized content generation'
+  },
+  {
+    regex: /(?:includeContext|systemPrompt|maxTokens)\s*[=:]\s*[\s\S]{0,200}(?:ignore|override|disregard|forget)\s+(?:previous|prior|all)/i,
+    severity: 'high',
+    category: 'mcp_sampling_injection',
+    description: 'Detects MCP sampling parameter manipulation with injection payload',
+    detail: 'Unit 42 MCP sampling attacks: manipulates MCP sampling parameters (includeContext, systemPrompt) to inject instructions into the conversation'
+  },
+  // --- LLM Router Tampering (arXiv 2604.08407, April 2026) ---
+  {
+    regex: /(?:api\.openai\.com|api\.anthropic\.com|generativelanguage\.googleapis\.com)[\s\S]{0,100}(?:redirect|proxy|forward|route)\s*(?:to|via|through)\s*\S+/i,
+    severity: 'high',
+    category: 'llm_router_tampering',
+    description: 'Detects attempts to redirect LLM API calls through untrusted proxies',
+    detail: 'Your Agent Is Mine (arXiv 2604.08407): 9 of 28 paid LLM API routers actively inject malicious code. Detects redirection of API calls to untrusted endpoints'
+  },
+  {
+    regex: /(?:OPENAI_BASE_URL|ANTHROPIC_BASE_URL|API_BASE|base_url)\s*[=:]\s*["']?https?:\/\/(?!(?:api\.openai\.com|api\.anthropic\.com|localhost|127\.0\.0\.1))\S+/i,
+    severity: 'high',
+    category: 'llm_router_tampering',
+    description: 'Detects LLM API base URL override pointing to untrusted endpoint',
+    detail: 'LLM router attack + Claude Code CVE-2026-21852: ANTHROPIC_BASE_URL/OPENAI_BASE_URL overridden to redirect API calls and leak keys to attacker server'
+  },
+  // --- MCP STDIO Command Injection (CVE-2026-30623, April 2026) ---
+  {
+    regex: /(?:npx\s+-c|npx\s+--command)\s+["']?[\s\S]{0,200}(?:curl|wget|nc\b|ncat|bash|sh\b|python|node\s+-e|eval)/i,
+    severity: 'critical',
+    category: 'mcp_command_injection',
+    description: 'Detects command injection via MCP STDIO npx -c pattern',
+    detail: 'CVE-2026-30623 (April 2026): MCP STDIO transport allows configuration-to-command execution. npx -c commands achieve OS command execution affecting 200K+ servers'
+  },
+  // --- Code Execution Sink Detection (OWASP ASI05) ---
+  {
+    regex: /(?:^|[\s;])(?:eval|Function)\s*\(\s*(?:response|output|result|completion|generated|llm|model|agent)/i,
+    severity: 'critical',
+    category: 'code_execution_sink',
+    description: 'Detects LLM output being passed directly to eval() or Function()',
+    detail: 'Code execution sink: LLM output fed to eval()/Function() allows prompt injection to achieve arbitrary code execution (OWASP ASI05)'
+  },
+  // --- TrustFall: Malicious Project File Injection (Adversa AI, May 2026) ---
+  {
+    regex: /(?:\.claude|\.cursor|\.windsurf|\.copilot)\/(?:config|settings|rules|hooks|commands)[\s\S]{0,200}(?:curl|wget|exec|bash|sh\s|node\s+-e|python\s+-c|nc\s)/i,
+    severity: 'critical',
+    category: 'cicd_injection',
+    description: 'Detects malicious AI coding agent config files that trigger one-keypress compromise',
+    detail: 'TrustFall attack (Adversa AI, May 2026): malicious project files in .claude/, .cursor/, .windsurf/ config directories execute commands on agent invocation. Exfiltrates CI environment variables.'
+  },
+  {
+    regex: /(?:^|\n)\s*(?:hook|onStart|preCommand|postCommand|autoexec)\s*[:=]\s*["\']?[\s\S]{0,150}(?:curl|wget|nc\s|bash\s+-c|exec\s*\()/i,
+    severity: 'high',
+    category: 'cicd_injection',
+    description: 'Detects auto-execution hooks in AI agent config files',
+    detail: 'TrustFall: hooks defined in project files trigger automatic command execution when AI coding agent loads the project'
+  },
+  // --- Semantic Kernel RCE (CVE-2026-25592 / 26030) ---
+  {
+    regex: /(?:kernel|sk|SemanticKernel)\.(?:invoke|run|execute|RunAsync)\s*\([^)]{0,200}(?:user|prompt|input|untrusted|external)/i,
+    severity: 'high',
+    category: 'code_execution_sink',
+    description: 'Detects Semantic Kernel function invocation with untrusted input',
+    detail: 'CVE-2026-25592/26030 (May 2026): Microsoft Semantic Kernel allows prompt injection to invoke arbitrary kernel functions, leading to RCE on the host process'
+  },
+  // --- WebSocket Cross-Origin Hijacking (CVE-2026-44211, CVE-2026-32173) ---
+  {
+    regex: /new\s+WebSocket\s*\(\s*["\']wss?:\/\/(?!(?:localhost|127\.0\.0\.1|0\.0\.0\.0))[^"\']*["\']\s*\)[\s\S]{0,300}(?:Origin|origin)\s*[:=]\s*["\']?\*/i,
+    severity: 'high',
+    category: 'cross_agent_injection',
+    description: 'Detects WebSocket connections with wildcard origin (cross-origin hijacking)',
+    detail: 'CVE-2026-44211 (Cline) / CVE-2026-32173 (Azure SRE Agent): WebSocket without origin validation allows cross-origin hijacking — attackers inject prompts into running agent terminals'
+  },
+  {
+    regex: /(?:child_process|subprocess|os\.system|os\.popen|exec|execSync|spawn)\s*\(\s*(?:response|output|result|completion|generated|llm|model|agent)/i,
+    severity: 'critical',
+    category: 'code_execution_sink',
+    description: 'Detects LLM output being passed to shell execution functions',
+    detail: 'Code execution sink: LLM output passed to child_process/subprocess enables arbitrary command execution via prompt injection'
   }
 ];
@@ -3307,6 +3483,22 @@ const scanText = (text, options = {}) => {
     truncated = true;
   }
+  // ------------------------------------------------------------------
+  // LRU CACHE: exact-match memoization for repeated inputs
+  // ------------------------------------------------------------------
+  // RAG pipelines, batch processors, and middleware retry loops re-scan
+  // identical text constantly. A 1000-entry LRU keyed on (source|text)
+  // eliminates duplicate work for ~1μs per hit.
+  const cacheable = text.length <= SCAN_CACHE_MAX_INPUT_LEN && options.useCache !== false;
+  let cacheKey = null;
+  if (cacheable) {
+    cacheKey = source + '\x00' + sensitivity + '\x00' + text;
+    const cached = _cacheTouch(cacheKey);
+    if (cached !== undefined) {
+      return { ...cached, stats: { ...cached.stats, scanTimeMs: now() - startTime }, fromCache: true };
+    }
+  }
   // ------------------------------------------------------------------
   // FAST PATH: long clean text (no attack indicators, no obfuscation)
   // ------------------------------------------------------------------
@@ -3334,6 +3526,7 @@ const scanText = (text, options = {}) => {
     if (truncated) {
       fastResult.warnings = [`Input exceeded ${maxSize} characters and was truncated for scanning.`];
     }
+    if (cacheKey) _cachePut(cacheKey, fastResult);
     return fastResult;
   }
@@ -3490,6 +3683,7 @@ const scanText = (text, options = {}) => {
     result.truncated = true;
     result.warnings = [`Input exceeded ${maxSize} characters and was truncated for scanning.`];
   }
+  if (cacheKey) _cachePut(cacheKey, result);
   return result;
 };

package/src/integrations-frameworks.js CHANGED Viewed

@@ -366,8 +366,98 @@ function shieldMSAgentFramework(options = {}) {
   return { agentMiddleware, shield };
 }
+// =========================================================================
+// Google ADK JavaScript (adk-js) Integration
+// =========================================================================
+/**
+ * Creates Agent Shield middleware for the Google ADK JavaScript/TypeScript SDK.
+ *
+ * Google ADK for JS (GA April 2026) uses a callback-based agent lifecycle.
+ * This wrapper returns a plugin object compatible with ADK-JS's plugin API
+ * that scans tool inputs, tool outputs, and generated content.
+ *
+ * Usage:
+ *   const { shieldGoogleADKJS } = require('agentshield-sdk/src/integrations-frameworks');
+ *   const plugin = shieldGoogleADKJS({ blockOnThreat: true });
+ *
+ *   // Register with ADK-JS agent:
+ *   const agent = new Agent({ plugins: [plugin] });
+ *
+ * @param {object} [options]
+ * @param {string} [options.sensitivity='high'] - Detection sensitivity level.
+ * @param {boolean} [options.blockOnThreat=true] - Whether to throw on threat detection.
+ * @param {string} [options.blockThreshold='high'] - Minimum severity that triggers a block.
+ * @param {function} [options.onThreat] - Callback when a threat is detected.
+ * @returns {{ name: string, beforeToolCall: function, afterToolCall: function, beforeModelCall: function, afterModelCall: function, shield: AgentShield }}
+ */
+function shieldGoogleADKJS(options = {}) {
+  const shield = new AgentShield({
+    sensitivity: options.sensitivity || 'high',
+    blockOnThreat: options.blockOnThreat !== false,
+    blockThreshold: options.blockThreshold || 'high'
+  });
+  const onThreat = options.onThreat || null;
+  function _scan(text, phase, meta) {
+    if (!text) return;
+    const result = shield.scanInput(text);
+    if (result.threats && result.threats.length > 0) {
+      if (onThreat) {
+        try { onThreat({ phase, ...meta, threats: result.threats }); }
+        catch (e) { console.error('[Agent Shield] onThreat callback error:', e.message); }
+      }
+      if (result.blocked) {
+        throw new ShieldBlockError(`Google ADK-JS ${phase} blocked by Agent Shield`, result.threats);
+      }
+    }
+  }
+  return {
+    name: 'AgentShieldPlugin',
+    beforeToolCall(context) {
+      const text = context.args != null
+        ? (typeof context.args === 'string' ? context.args : JSON.stringify(context.args))
+        : null;
+      _scan(text, 'before_tool_call', { toolName: context.toolName || 'unknown' });
+    },
+    afterToolCall(context) {
+      const text = context.result != null
+        ? (typeof context.result === 'string' ? context.result : JSON.stringify(context.result))
+        : null;
+      _scan(text, 'after_tool_call', { toolName: context.toolName || 'unknown' });
+    },
+    beforeModelCall(context) {
+      const text = context.prompt || context.input;
+      if (text) _scan(typeof text === 'string' ? text : JSON.stringify(text), 'before_model_call', {});
+    },
+    afterModelCall(context) {
+      const text = context.response || context.output;
+      if (!text) return;
+      const outputText = typeof text === 'string' ? text : JSON.stringify(text);
+      const result = shield.scanOutput(outputText);
+      if (result.threats && result.threats.length > 0) {
+        if (onThreat) {
+          try { onThreat({ phase: 'after_model_call', threats: result.threats }); }
+          catch (e) { console.error('[Agent Shield] onThreat callback error:', e.message); }
+        }
+        if (result.blocked) {
+          throw new ShieldBlockError('Google ADK-JS model output blocked by Agent Shield', result.threats);
+        }
+      }
+    },
+    shield
+  };
+}
 module.exports = {
   shieldCrewAI,
   shieldGoogleADK,
+  shieldGoogleADKJS,
   shieldMSAgentFramework
 };

package/src/main.js CHANGED Viewed

@@ -84,7 +84,7 @@ const { BenchmarkHarness, DatasetLoader, BenchmarkMetrics, RegressionTracker, Be
 const { ShieldCallbackHandler, shieldAnthropicClient, shieldOpenAIClient, shieldOpenAIAgent, shieldVercelAI, shieldFetch, ShieldBlockError } = safeRequire('./integrations', 'integrations');
 // Framework Integrations (CrewAI, Google ADK, MS Agent Framework)
-const { shieldCrewAI, shieldGoogleADK, shieldMSAgentFramework } = safeRequire('./integrations-frameworks', 'integrations-frameworks');
+const { shieldCrewAI, shieldGoogleADK, shieldGoogleADKJS, shieldMSAgentFramework } = safeRequire('./integrations-frameworks', 'integrations-frameworks');
 // Red Team
 const { AttackSimulator, PayloadFuzzer, getAttackCategories, getPayloads, ATTACK_PAYLOADS } = safeRequire('./redteam', 'redteam');
@@ -498,6 +498,7 @@ const _exports = {
   // Framework Integrations (CrewAI, Google ADK, MS Agent Framework)
   shieldCrewAI,
   shieldGoogleADK,
+  shieldGoogleADKJS,
   shieldMSAgentFramework,
   // Red Team

package/src/mcp-guard.js CHANGED Viewed

@@ -61,6 +61,7 @@ const MODEL_RISK_PROFILES = {
   'gemini-2.5': { riskMultiplier: 1.2, susceptibility: 'high', notes: 'Advanced capability increases risk' },
   'llama-4': { riskMultiplier: 1.1, susceptibility: 'medium', notes: 'Early fusion architecture increases multimodal attack surface' },
   'deepseek-r1': { riskMultiplier: 1.3, susceptibility: 'high', notes: 'Nature: LRMs achieve 97% jailbreak success as autonomous agents' },
+  'gpt-5.5': { riskMultiplier: 1.4, susceptibility: 'critical', notes: 'April 2026 agentic model — elevated sandbox escape and tool-use attack surface' },
   default: { riskMultiplier: 1.0, susceptibility: 'medium', notes: 'Unknown model — default risk level' }
 };
@@ -258,10 +259,25 @@ class CrossServerIsolation {
    * @param {string[]} toolNames
    */
   registerServer(serverId, toolNames) {
+    const collisions = [];
+    for (const name of toolNames) {
+      const existingOwner = this.toolOwnership.get(name);
+      if (existingOwner && existingOwner !== serverId) {
+        collisions.push({ tool: name, existingServer: existingOwner, newServer: serverId });
+      }
+    }
     this.serverTools.set(serverId, new Set(toolNames));
     for (const name of toolNames) {
       this.toolOwnership.set(name, serverId);
     }
+    if (collisions.length > 0) {
+      return {
+        collisions,
+        severity: 'critical',
+        message: `Tool name squatting detected: ${collisions.map(c => `"${c.tool}" (owned by ${c.existingServer}, overridden by ${c.newServer})`).join(', ')}`
+      };
+    }
+    return null;
   }
   /**
@@ -639,8 +655,14 @@ class MCPGuard {
     this._chainTracker = [];
     this._chainMaxLen = 50;
+    /** Recursive tool invocation depth tracker (per-request) */
+    this._callDepth = new Map();
+    this._maxCallDepth = options.maxCallDepth || 5;
     /** Agent fleet registry — tracks all known agents in the deployment */
     this._agentRegistry = new Map();
+    this.config = options;
   }
   // -----------------------------------------------------------------------
@@ -910,10 +932,26 @@ class MCPGuard {
    * @param {*} args - Tool arguments.
    * @returns {{ allowed: boolean, threats: Array<object>, anomalies: Array<object> }}
    */
-  interceptToolCall(serverId, toolName, args) {
+  interceptToolCall(serverId, toolName, args, requestId) {
     const threats = [];
     const anomalies = [];
+    // Recursive tool invocation depth check
+    if (requestId) {
+      const depth = (this._callDepth.get(requestId) || 0) + 1;
+      this._callDepth.set(requestId, depth);
+      if (depth > this._maxCallDepth) {
+        threats.push({
+          type: 'recursive_tool_invocation',
+          severity: 'high',
+          serverId,
+          toolName,
+          description: `Tool call depth ${depth} exceeds max ${this._maxCallDepth}. Possible recursive loop or reentrancy attack.`
+        });
+        return { allowed: false, threats, anomalies };
+      }
+    }
     // Circuit breaker check
     const cbCheck = this._checkCircuitBreaker(serverId);
     if (!cbCheck.allowed) {
@@ -1188,6 +1226,19 @@ class MCPGuard {
     // Scan output
     const outputStr = typeof output === 'string' ? output : JSON.stringify(output || {});
+    // Context flooding defense: flag oversized tool outputs that could push
+    // legitimate instructions out of the context window
+    const maxOutputSize = this.config.maxToolOutputSize || 100_000;
+    if (outputStr.length > maxOutputSize) {
+      threats.push({
+        type: 'context_flooding',
+        severity: 'high',
+        serverId,
+        toolName,
+        description: `Tool output exceeds max size (${outputStr.length} > ${maxOutputSize} chars). Possible context window flooding attack.`
+      });
+    }
     const scanResult = this.scanner(outputStr);
     if (scanResult.threats && scanResult.threats.length > 0) {
       for (const t of scanResult.threats) {

package/src/supply-chain-scanner.js CHANGED Viewed

@@ -51,6 +51,18 @@ const KNOWN_BAD_SERVERS = Object.freeze({
   'flowise-unpatched': {
     reason: 'CVSS 10.0 RCE actively exploited (CVE-2025-59528)',
     severity: 'critical'
+  },
+  'lmdeploy-unpatched': {
+    reason: 'SSRF via vision-language image loader, exploited within 12 hours (CVE-2026-33626)',
+    severity: 'high'
+  },
+  'nginx-ui-mcp': {
+    reason: 'Auth bypass on MCP-integrated HTTP endpoints, actively exploited (CVE-2026-33032)',
+    severity: 'critical'
+  },
+  'splunk-mcp-server': {
+    reason: 'Auth tokens logged in cleartext (CVE-2026-20205)',
+    severity: 'high'
   }
 });
@@ -207,6 +219,18 @@ const CVE_REGISTRY = Object.freeze({
       description: 'Code injection in Flowise MCP node (CVSS 10.0) allows remote code execution. 12,000-15,000 instances exposed. Actively exploited since April 6.',
       fix: 'Upgrade Flowise to >=3.0.6. Restrict access to MCP node.'
     },
+    {
+      cve: 'CVE-2026-40933',
+      severity: 'critical',
+      description: 'Flowise MCP Adapters authenticated RCE (CVSS 9.9). Unsafe stdio command serialization in MCP adapter enables OS command execution via npx -c.',
+      fix: 'Upgrade Flowise to >=3.1.0.'
+    },
+    {
+      cve: 'CVE-2026-41264',
+      severity: 'high',
+      description: 'Flowise CSV Agent prompt injection to RCE. LLM-generated Python script executed without sandbox. No auth required.',
+      fix: 'Upgrade Flowise to >=3.1.0.'
+    },
     {
       cve: 'CVE-2025-8943',
       severity: 'critical',
@@ -243,6 +267,122 @@ const CVE_REGISTRY = Object.freeze({
       description: 'OS command injection RCE in codebase-mcp.',
       fix: 'Upgrade codebase-mcp. Never pass unsanitized inputs to shell.'
     }
+  ],
+  'lmdeploy': [
+    {
+      cve: 'CVE-2026-33626',
+      severity: 'high',
+      description: 'LMDeploy SSRF via vision-language image loader (CVSS 7.5). load_image() fetches arbitrary URLs, enabling port scanning, IMDS access (169.254.169.254), and internal service probing. Exploited within 12 hours of disclosure.',
+      fix: 'Upgrade LMDeploy to >=0.12.3. Block private IP ranges and cloud metadata endpoints in image URLs.'
+    }
+  ],
+  'nginx-ui': [
+    {
+      cve: 'CVE-2026-33032',
+      severity: 'critical',
+      description: 'nginx-ui auth bypass on MCP-integrated HTTP endpoints (CVSS 9.8). Actively exploited.',
+      fix: 'Apply nginx-ui patch. Enable authentication on all MCP endpoints.'
+    }
+  ],
+  'splunk-mcp-server': [
+    {
+      cve: 'CVE-2026-20205',
+      severity: 'high',
+      description: 'Splunk MCP Server logs session/auth tokens in cleartext in _internal index (CVSS 7.2).',
+      fix: 'Upgrade Splunk MCP Server to >=1.0.3. Audit _internal index for leaked tokens.'
+    }
+  ],
+  'mcp-ruby-sdk': [
+    {
+      cve: 'CVE-2026-33946',
+      severity: 'medium',
+      description: 'MCP Ruby SDK session fixation in SSE stream allows hijacking of MCP protocol communications.',
+      fix: 'Upgrade MCP Ruby SDK to >=0.9.2.'
+    }
+  ],
+  'magento2-dev-mcp': [
+    {
+      cve: 'CVE-2026-5603',
+      severity: 'high',
+      description: 'Command injection in @elgentos/magento2-dev-mcp via child_process.execAsync with unsanitized input.',
+      fix: 'Upgrade magento2-dev-mcp. Sanitize all inputs before passing to child_process.'
+    }
+  ],
+  'semantic-kernel': [
+    {
+      cve: 'CVE-2026-25592',
+      severity: 'high',
+      description: 'Microsoft Semantic Kernel .NET SDK <1.71.0: prompt injection invokes arbitrary kernel functions leading to RCE on host process. Disclosed by MSRC May 7, 2026.',
+      fix: 'Upgrade Semantic Kernel .NET SDK to >=1.71.0. Validate function names in kernel.invoke() calls.'
+    },
+    {
+      cve: 'CVE-2026-26030',
+      severity: 'high',
+      description: 'Microsoft Semantic Kernel Python <1.39.4: same RCE primitive as CVE-2026-25592 but in Python SDK.',
+      fix: 'Upgrade Semantic Kernel Python to >=1.39.4.'
+    }
+  ],
+  'fastgpt': [
+    {
+      cve: 'CVE-2026-42302',
+      severity: 'critical',
+      description: 'FastGPT agent-sandbox unauthenticated RCE (CVSS 9.8). code-server launched with --auth none on 0.0.0.0:8080. Published May 8, 2026.',
+      fix: 'Upgrade FastGPT to >=4.14.13. Never expose agent-sandbox endpoints without authentication.'
+    },
+    {
+      cve: 'CVE-2026-44284',
+      severity: 'high',
+      description: 'FastGPT SSRF in MCP tool URL handling. Crafted URLs reach internal services and cloud metadata endpoints.',
+      fix: 'Upgrade FastGPT. Apply Agent Shield mcp-guard SSRF firewall.'
+    },
+    {
+      cve: 'CVE-2026-42344',
+      severity: 'high',
+      description: 'FastGPT DNS rebinding bypasses isInternalAddress() check. Attacker hostname resolves to internal IP after initial validation.',
+      fix: 'Upgrade FastGPT. Pin DNS resolution per request and re-validate on connect.'
+    }
+  ],
+  'cline-kanban': [
+    {
+      cve: 'CVE-2026-44211',
+      severity: 'high',
+      description: 'Cline Kanban Server Cross-Origin WebSocket Hijacking. Missing origin validation lets attackers inject prompts into running agent terminals.',
+      fix: 'Upgrade Cline. Validate Origin header on WebSocket upgrade requests.'
+    }
+  ],
+  'azure-sre-agent': [
+    {
+      cve: 'CVE-2026-32173',
+      severity: 'high',
+      description: 'Azure SRE Agent exposed live command streams over unauthenticated WebSocket to any Entra ID user (CVSS 8.6).',
+      fix: 'Apply Azure patch. Restrict WebSocket access to authorized principals only.'
+    }
+  ],
+  'crewai': [
+    {
+      cve: 'CVE-2026-44400',
+      severity: 'high',
+      description: 'CrewAI Code Interpreter default enabled allows prompt injection → RCE chain.',
+      fix: 'Upgrade CrewAI to latest. Disable Code Interpreter by default. Apply Agent Shield shieldCrewAI wrapper.'
+    },
+    {
+      cve: 'CVE-2026-44401',
+      severity: 'high',
+      description: 'CrewAI Code Interpreter SSRF via prompt-injected URLs.',
+      fix: 'Upgrade CrewAI. Block private IP ranges in tool URLs.'
+    },
+    {
+      cve: 'CVE-2026-44402',
+      severity: 'high',
+      description: 'CrewAI Code Interpreter file-read primitive via prompt injection.',
+      fix: 'Upgrade CrewAI. Restrict filesystem access in Code Interpreter sandbox.'
+    },
+    {
+      cve: 'CVE-2026-44403',
+      severity: 'high',
+      description: 'CrewAI agent task chain prompt injection leads to unauthorized tool invocation.',
+      fix: 'Upgrade CrewAI. Use Agent Shield intent-firewall to validate cross-task transitions.'
+    }
   ]
 });
@@ -378,6 +518,7 @@ class SupplyChainScanner {
       this._scanSchema(tool, findings);
       this._scanForSSRF(tool, findings);
       this._scanForClawHavoc(tool, findings);
+      this._scanConsentPhishing(tool, findings);
     }
     // Analyze escalation chains
@@ -687,6 +828,29 @@ class SupplyChainScanner {
     }
   }
+  /** @private - Consent phishing: detect tools whose description misrepresents capabilities (OWASP ASI09) */
+  _scanConsentPhishing(tool, findings) {
+    if (!tool || !tool.description || !tool.inputSchema) return;
+    const desc = String(tool.description).toLowerCase();
+    const schemaStr = JSON.stringify(tool.inputSchema).toLowerCase();
+    const READ_WORDS = ['read', 'get', 'fetch', 'list', 'view', 'show', 'display', 'search', 'query', 'lookup'];
+    const WRITE_INDICATORS = ['"url"', '"endpoint"', '"host"', '"target"', '"destination"', '"webhook"', '"callback"', 'http', '"command"', '"exec"', '"shell"', '"script"'];
+    const BENIGN_WORDS = ['save', 'update', 'create', 'write', 'delete', 'send', 'post', 'execute', 'run', 'upload'];
+    const descLooksReadOnly = READ_WORDS.some(w => desc.includes(w)) && !BENIGN_WORDS.some(w => desc.includes(w));
+    const schemaHasWriteCapability = WRITE_INDICATORS.some(w => schemaStr.includes(w));
+    if (descLooksReadOnly && schemaHasWriteCapability) {
+      findings.push({
+        type: 'consent_phishing',
+        severity: 'high',
+        message: `Tool "${tool.name || 'unknown'}" description implies read-only ("${desc.substring(0, 60)}...") but schema contains write/network parameters. Users may approve dangerous actions unknowingly.`,
+        recommendation: 'Tool descriptions must accurately reflect capabilities. If the tool sends data to URLs or executes commands, the description must say so explicitly.'
+      });
+    }
+  }
   /** @private */
   _scanSchema(tool, findings) {
     if (tool && tool.inputSchema && tool.inputSchema.additionalProperties === true) {