npm - agentshield-sdk - Versions diffs - 7.0.0 - Mend

agentshield-sdk 7.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (84) hide show

package/CHANGELOG.md +191 -0
package/LICENSE +21 -0
package/README.md +975 -0
package/bin/agent-shield.js +680 -0
package/package.json +118 -0
package/src/adaptive.js +330 -0
package/src/agent-protocol.js +998 -0
package/src/alert-tuning.js +480 -0
package/src/allowlist.js +603 -0
package/src/audit-immutable.js +914 -0
package/src/audit-streaming.js +469 -0
package/src/badges.js +196 -0
package/src/behavior-profiling.js +289 -0
package/src/benchmark-harness.js +804 -0
package/src/canary.js +271 -0
package/src/certification.js +563 -0
package/src/circuit-breaker.js +321 -0
package/src/compliance.js +617 -0
package/src/confidence-tuning.js +324 -0
package/src/confused-deputy.js +624 -0
package/src/context-scoring.js +360 -0
package/src/conversation.js +494 -0
package/src/cost-optimizer.js +1024 -0
package/src/ctf.js +462 -0
package/src/detector-core.js +1999 -0
package/src/distributed.js +359 -0
package/src/document-scanner.js +795 -0
package/src/embedding.js +307 -0
package/src/encoding.js +429 -0
package/src/enterprise.js +405 -0
package/src/errors.js +100 -0
package/src/eu-ai-act.js +523 -0
package/src/fuzzer.js +764 -0
package/src/honeypot.js +328 -0
package/src/i18n-patterns.js +523 -0
package/src/index.js +430 -0
package/src/integrations.js +528 -0
package/src/llm-redteam.js +670 -0
package/src/main.js +741 -0
package/src/main.mjs +38 -0
package/src/mcp-bridge.js +542 -0
package/src/mcp-certification.js +846 -0
package/src/mcp-sdk-integration.js +355 -0
package/src/mcp-security-runtime.js +741 -0
package/src/mcp-server.js +740 -0
package/src/middleware.js +208 -0
package/src/model-finetuning.js +884 -0
package/src/model-fingerprint.js +1042 -0
package/src/multi-agent-trust.js +453 -0
package/src/multi-agent.js +404 -0
package/src/multimodal.js +296 -0
package/src/nist-mapping.js +505 -0
package/src/observability.js +330 -0
package/src/openclaw.js +450 -0
package/src/otel.js +544 -0
package/src/owasp-2025.js +483 -0
package/src/pii.js +390 -0
package/src/plugin-marketplace.js +628 -0
package/src/plugin-system.js +349 -0
package/src/policy-dsl.js +775 -0
package/src/policy-extended.js +635 -0
package/src/policy.js +443 -0
package/src/presets.js +409 -0
package/src/production.js +557 -0
package/src/prompt-leakage.js +321 -0
package/src/rag-vulnerability.js +579 -0
package/src/redteam.js +475 -0
package/src/response-handler.js +429 -0
package/src/scanners.js +357 -0
package/src/self-healing.js +363 -0
package/src/semantic.js +339 -0
package/src/shield-score.js +250 -0
package/src/sso-saml.js +897 -0
package/src/stream-scanner.js +806 -0
package/src/testing.js +505 -0
package/src/threat-encyclopedia.js +629 -0
package/src/threat-intel-network.js +1017 -0
package/src/token-analysis.js +467 -0
package/src/tool-guard.js +412 -0
package/src/tool-output-validator.js +354 -0
package/src/utils.js +83 -0
package/src/watermark.js +235 -0
package/src/worker-scanner.js +601 -0
package/types/index.d.ts +2088 -0

package/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,191 @@
+# Changelog
+All notable changes to Agent Shield will be documented in this file.
+This project follows [Semantic Versioning](https://semver.org/).
+## [7.0.0] — 2026-03-21
+### Added
+- **MCP Security Runtime** — `MCPSecurityRuntime` unified security layer for MCP servers with per-user/per-session/per-tool authorization, session state machine (prevents tool ordering attacks), behavioral anomaly detection, delegation with scope narrowing, and one-line middleware integration (`src/mcp-security-runtime.js`)
+- **MCP Certification** — `MCPCertification` with 15 security requirements (auth, scanning, rate limiting, audit, crypto, monitoring, policy), Platinum/Gold/Silver/Bronze levels, formatted reports with actionable recommendations (`src/mcp-certification.js`)
+- **Cross-Org Agent Trust** — `CrossOrgAgentTrust` certificate authority for AI agents crossing organizational boundaries — issue, verify, and revoke HMAC-signed certificates with trust levels, org restrictions, and automatic expiry (`src/mcp-certification.js`)
+- **Agent Threat Intelligence** — `AgentThreatIntelligence` local threat pattern corpus with confidence decay, trend analysis (attack rate, bypass rate, direction), and corpus export/import for federated learning (`src/mcp-certification.js`)
+- **Live Demo** — `examples/mcp-security-demo.js` simulating all four Meta rogue AI agent attack vectors with real-time blocking
+### Changed
+- **AES-256-GCM encryption** replaces XOR cipher in `SecureChannel` (`src/agent-protocol.js`)
+- **HMAC-SHA256 signing** replaces plain SHA256 in `AuthorizationContext` and `EphemeralTokenManager` with configurable signing keys (`src/confused-deputy.js`)
+- Timing-safe signature verification throughout using `crypto.timingSafeEqual()`
+- Automatic expired token cleanup in `EphemeralTokenManager`
+- Intent matching uses word-boundary matching instead of substring to prevent spoofing
+- Token issuance now verifies context integrity before minting
+- Delegation depth enforcement (configurable, default 5)
+- Total exports increased from 302 to 310+ across 77+ modules
+- Test suite expanded to 962 assertions across 13 test suites (112 MCP security tests)
+### Fixed
+- 37 bugs fixed across two deep bug hunting cycles (see commit history)
+- Memory leaks in pending timestamps, revoked tokens, user tokens, behavior profiles
+- Double-counting in tool call blocked stats
+- Certificate eviction with LRU fallback for non-expired overflow
+- Map modification during iteration in session cleanup
+- Orphaned child sessions on parent termination
+## [6.0.0] — 2026-03-21
+### Added
+- **OWASP LLM Top 10 v2025 Coverage Matrix** — `OWASPCoverageMatrix` mapping all Agent Shield capabilities to OWASP LLM Top 10 (2025 edition) with per-category coverage scoring, gap analysis, and compliance reporting (`src/owasp-2025.js`)
+- **MCP Bridge** — `MCPBridge` for native Model Context Protocol integration with `MCPToolPolicy` (per-tool allow/deny), `MCPSessionGuard` (session budgets, rate limiting), `MCPResourceScanner` (resource URI validation), and `createMCPMiddleware` for Express (`src/mcp-bridge.js`)
+- **NIST AI RMF Mapping** — `NISTMapper` mapping to NIST AI Risk Management Framework (2025) across GOVERN/MAP/MEASURE/MANAGE/MONITOR functions, `AIBOMGenerator` for AI Bill of Materials, `NISTComplianceChecker` with SP 800-53 AI control mapping (`src/nist-mapping.js`)
+- **EU AI Act Compliance** — `RiskClassifier` (prohibited/high/limited/minimal risk classification), `ConformityAssessment` (Article 43 checklist), `TransparencyReporter` (Article 13 obligations), `EUIncidentReporter` (Article 62 serious incident reporting), `EUAIActDashboard` with deadline tracking (`src/eu-ai-act.js`)
+- **System Prompt Leakage Detector** — `SystemPromptGuard` detecting 20+ prompt extraction attack patterns (direct requests, indirect extraction, roleplay-based attacks), `PromptFingerprinter` for output leak scoring, `PromptLeakageMitigation` with configurable response strategies (OWASP LLM07-2025) (`src/prompt-leakage.js`)
+- **RAG/Vector Vulnerability Scanner** — `RAGVulnerabilityScanner` detecting chunk boundary manipulation, metadata injection, authority spoofing, retrieval poisoning, and context window stuffing; `EmbeddingIntegrityChecker` for vector integrity; `RAGPipelineAuditor` for end-to-end RAG pipeline security (OWASP LLM08-2025) (`src/rag-vulnerability.js`)
+- **Confused Deputy Prevention** — `AuthorizationContext` (immutable user-to-agent binding with delegation chain), `EphemeralTokenManager` (scoped, auto-rotating tokens replacing static API keys), `IntentValidator` (post-auth action verification with scope/role/intent policies), `ConfusedDeputyGuard` (per-user MCP authorization preventing privilege escalation through delegation). Directly addresses the four IAM gaps from Meta's rogue AI agent incident (March 2026) (`src/confused-deputy.js`)
+### Changed
+- Total exports increased from 254 to 302 across 74+ modules
+- Test suite expanded to 850 assertions across 11 test suites (122 v6 tests + 85 confused deputy tests)
+## [5.0.0] — 2026-03-20
+### Added
+- **Agent-to-Agent Protocol** — `AgentProtocol` with `SecureChannel` (HMAC-signed, replay-protected), `HandshakeManager` (mutual auth, challenge-response), `MessageRouter` for multi-agent topologies (`src/agent-protocol.js`)
+- **Real-time Streaming Dashboard** — WebSocket server (RFC 6455) with live threat feed, SVG line/donut/heatmap charts, dark/light mode, auto-reconnect (`dashboard-live/`)
+- **Policy-as-Code DSL** — `PolicyDSL` with tokenizer, recursive descent parser, compiler, and runtime supporting `when/then/and/or` conditions, `allow` blocks, `rate_limit`, `scan_mode` (`src/policy-dsl.js`)
+- **Fuzzing Harness** — `FuzzingHarness` with `InputGenerator` (8 strategies including grammar-based, mutation, encoding), `MutationEngine` (13 mutations), `CoverageTracker`, `CrashCollector`, 50+ seed corpus (`src/fuzzer.js`)
+- **Model Fingerprinting** — `ModelFingerprinter` with `ResponseAnalyzer` (16 stylistic features), `StyleProfile` (cosine similarity), `FingerprintDatabase` (5 built-in profiles), `SupplyChainDetector` for model swap detection (`src/model-fingerprint.js`)
+- **Cost/Latency Optimizer** — `CostOptimizer` with `AdaptiveScanner` (auto-escalating tiers: fast/standard/deep/paranoid), `LatencyBudget`, `PerformanceMonitor` (p50/p95/p99), 4 optimization presets (`src/cost-optimizer.js`)
+## [4.0.0] — 2026-03-19
+### Added
+- **Rust Core Engine** — `RegexSet`-based O(n) multi-pattern matching with compilation targets for WASM (`wasm-bindgen`), Node.js NAPI (`napi-rs`), and Python (`PyO3`) (`rust-core/`)
+- **Go SDK** — full detection engine, HTTP/gRPC middleware, CLI tool, benchmarks, zero external dependencies (`go-sdk/`)
+- **Terraform Provider** — `agent_shield_policy`, `agent_shield_rule`, `agent_shield_tenant` resources for infrastructure-as-code (`terraform-provider/`)
+- **OpenTelemetry Collector** — receiver (HTTP scan endpoint to log records) and processor (scan logs/traces, annotate/drop/log actions) (`otel-collector/`)
+- **GitHub App** — PR scanning with Check Run annotations, GitHub Action (`action.yml`), webhook signature verification, diff parsing (`github-app/`)
+- **Benchmark Registry** — `BenchmarkSuite` (100+ test cases), `MetricsCalculator` (F1, MCC, throughput, latency percentiles), `Leaderboard` with interactive web dashboard (`benchmark-registry/`)
+- **Multi-language Patterns** — `I18nPatternManager` with 32+ patterns across CJK, Arabic, Cyrillic, and Indic scripts (`src/i18n-patterns.js`)
+- **LLM Red Team Suite** — `JailbreakLibrary` (35+ templates across 6 categories), `AdversarialGenerator`, `EvasionTester`, `RedTeamReport` with weakness analysis (`src/llm-redteam.js`)
+## [3.0.0] — 2026-03-19
+### Added
+- **Self-Healing Patterns** — `SelfHealingEngine` auto-generates detection patterns from false negatives with `PatternGenerator` (`src/self-healing.js`)
+- **Honeypot Mode** — `HoneypotEngine` with session tracking, escalation analysis, technique intelligence, fake response generation (`src/honeypot.js`)
+- **Multi-Modal Scanning** — `MultiModalScanner` for images (alt text, OCR, metadata), audio transcripts, PDFs, and tool outputs (`src/multimodal.js`)
+- **Agent Behavior Profiling** — `BehaviorProfile` with statistical baselining, anomaly detection (z-score), health checks (`src/behavior-profiling.js`)
+- **Threat Intelligence Network** — `ThreatIntelNetwork` with `PeerNode` reputation, `PatternAnonymizer` (differential privacy), `ConsensusEngine`, `ThreatFeed` with STIX-like export (`src/threat-intel-network.js`)
+## [2.1.0] — 2026-03-19
+### Added
+- **Distributed Scanning** — `DistributedShield` with pluggable adapters (`MemoryAdapter`, `RedisAdapter`), pub/sub threat broadcasting (`src/distributed.js`)
+- **Audit Log Streaming** — `AuditStreamManager` with `FileTransport`, `SplunkTransport`, `ElasticsearchTransport` adapters (`src/audit-streaming.js`)
+- **Kubernetes Operator** — sidecar container with Helm chart, `MutatingWebhookConfiguration` for auto-injection, Prometheus metrics, health checks (`k8s/`)
+- **SSO/SAML Integration** — `SSOManager`, `SAMLParser`, `OIDCHandler`, `IdentityMapper` with enterprise IdP mapping to RBAC roles (`src/sso-saml.js`)
+- **Custom Model Fine-Tuning** — `ModelTrainer` with TF-IDF + logistic regression, `TrainingPipeline`, `DatasetManager`, `ModelEvaluator`, `FineTunedModel` export/import (`src/model-finetuning.js`)
+## [2.0.0] — 2026-03-19
+### Added
+- **Plugin Marketplace** — `PluginRegistry`, `PluginValidator`, `MarketplaceClient` with quality scoring, safety validation, version management (`src/plugin-marketplace.js`)
+- **Dashboard v2** — real-time web dashboard with threat visualization, donut charts, sparklines, dark/light mode, scan metrics (`dashboard-live/`)
+- **Python SDK** — `agent_shield` Python package with core detection, `AgentShield` class, LangChain/LlamaIndex wrappers, Flask/FastAPI middleware, CLI tool (`python-sdk/`)
+- **WASM Build** — ESM/UMD bundles for browsers, Cloudflare Workers, Deno, Bun with build script and platform-specific examples (`wasm/`)
+- **VS Code Extension** — `agent-shield-vscode` with inline diagnostics, real-time scanning, severity mapping, string literal extraction for JS/TS/Python/Markdown (`vscode-extension/`)
+## [1.2.0] — 2026-03-19
+### Added
+- **LLM-Assisted Classification** — `SemanticClassifier` with Ollama/OpenAI-compatible local endpoint support, two-pass `enhancedScan()` (`src/semantic.js`)
+- **Embedding-Based Similarity** — `EmbeddingSimilarityDetector` using TF-IDF + cosine similarity against 28-pattern attack corpus (`src/embedding.js`)
+- **Context-Aware Scoring** — `ConversationContextAnalyzer` with multi-turn escalation detection, topic pivot alerts, velocity/repetition checks (`src/context-scoring.js`)
+- **Confidence Calibration** — `ConfidenceTuner` with per-category threshold learning, feedback recording, precision/recall metrics (`src/confidence-tuning.js`)
+## [1.1.0] — 2026-03-19
+### Added
+- **Expanded Homoglyph Detection** — comprehensive Unicode mapping covering Cherokee, Georgian, IPA, Math Symbols, Enclosed/Circled, Small Caps, Superscript/Subscript characters
+- **Worker Threads Support** — opt-in `ThreadedWorkerScanner` for real parallel CPU-bound scanning
+- **Independent Benchmark Suite** — reproducible performance script (`npm run benchmark`) with throughput, latency, scaling, accuracy, and memory metrics
+- **AI Phishing Patterns** — QR code phishing, MFA harvesting, credential urgency, subscription scams, AI access lures
+### Changed
+- **Adversarial Mutation Resilience** — synonym-aware keyword cluster detection, pre-normalization, hex-escape decoding (84% to 95.3% detection rate)
+## [1.0.0] — 2026-03-19
+### Initial Release
+Agent Shield v1.0.0 — a zero-dependency security SDK for AI agents.
+### Core Features
+- **Prompt Injection Detection** — detects fake system prompts, instruction overrides, ChatML/LLaMA delimiters, markdown headers, and 30+ injection patterns
+- **Role Hijacking Detection** — catches DAN mode, developer mode, jailbreak attempts, persona attacks
+- **Data Exfiltration Prevention** — blocks system prompt extraction, markdown image leaks, fetch calls, tag extraction
+- **Tool Abuse Detection** — flags sensitive file access, shell execution, SQL injection, path traversal, recursive tool calls
+- **Social Engineering Detection** — identifies identity concealment, urgency + authority, gaslighting, false pre-approval
+- **Obfuscation Detection** — decodes Unicode homoglyphs, zero-width chars, Base64, hex, ROT13, leetspeak, reversed text
+- **Multi-Language Support** — detects attacks in English, Spanish, French, German, Portuguese, Chinese, Japanese
+### Modules
+- **AgentShield** — main SDK class with configurable sensitivity, blocking, and callbacks
+- **Canary Tokens** — generate and detect prompt leak canaries
+- **PII Redactor** — auto-redact SSNs, emails, phone numbers, credit cards (DLP engine)
+- **Tool Guard** — tool sequence analysis and permission boundaries
+- **Circuit Breaker** — rate limiting and automatic trip on repeated attacks
+- **Conversation Analysis** — fragmentation detection, language switch detection, behavioral fingerprinting
+- **Multi-Agent Security** — agent firewall, delegation chains, shared threat state
+- **Multi-Agent Trust** — message signing (HMAC), capability tokens, blast radius containment
+- **Encoding Detection** — steganography, encoding bruteforce, structured data scanning
+- **Output Watermarking** — watermark agent outputs with differential privacy
+- **Policy Engine** — YAML/JSON policy loading, structured logging, webhook alerts
+- **Compliance Reporting** — SOC2, HIPAA, GDPR, OWASP, NIST, EU AI Act reports with audit trails
+- **Enterprise Features** — multi-tenant isolation, RBAC, debug mode
+- **RAG Scanner** — scan retrieved documents before they enter the context
+- **Red Team Simulator** — 49 built-in attack payloads with automated testing
+- **Shield Score** — quantitative security scoring and benchmarking
+### Framework Integrations
+- Anthropic / Claude SDK (`shieldAnthropicClient`)
+- OpenAI SDK (`shieldOpenAIClient`)
+- LangChain (`ShieldCallbackHandler`)
+- Vercel AI SDK (`shieldVercelAI`)
+- Express middleware (`expressMiddleware`)
+- Generic agent wrapper (`wrapAgent`, `shieldTools`)
+### CLI
+- `npx agent-shield scan` — scan text for threats
+- `npx agent-shield score` — calculate shield score
+- `npx agent-shield redteam` — run attack simulation
+- `npx agent-shield audit` — compliance audit
+- `npx agent-shield patterns` — list all detection patterns
+### Benchmarks
+- 100% detection on internal red team (49 attacks)
+- 99.1% detection on external benchmark (108 real-world attacks)
+- 0% false positive rate (103 benign inputs)
+- 100/100 A+ shield score
+- ~48,000 scans/sec throughput
+- < 0.03ms average latency

package/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2026 Agent Shield Contributors
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.