npm - rust-kgdb - Versions diffs - 0.6.64 → 0.6.67 - Mend

rust-kgdb 0.6.64 → 0.6.67

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +782 -294
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # rust-kgdb
-High-performance RDF/SPARQL database with AI agent framework.
+High-performance embedded knowledge graph database with neuro-symbolic AI agent framework.
 ## The Problem With AI Today
@@ -20,42 +20,27 @@ This keeps happening:
 Every time, the same pattern: The AI sounds confident. The AI is wrong. People get hurt.
-## The Solution
+## The Solution: Grounded AI
-What if AI stopped providing answers and started generating queries?
+What if AI stopped inventing answers and started querying real data?
-- Your database knows the facts (claims, providers, transactions)
-- AI understands language (can parse "find suspicious patterns")
-- You need both working together
-The AI translates intent into queries. The database finds facts. The AI never makes up data.
-rust-kgdb is a knowledge graph database with an AI layer that cannot hallucinate because it only returns data from your actual systems.
-## The Business Value
-For Enterprises:
-- Zero hallucinations - Every answer traces back to your actual data
-- Full audit trail - Regulators can verify every AI decision (SOX, GDPR, FDA 21 CFR Part 11)
-- No infrastructure - Runs embedded in your app, no servers to manage
+```
+Traditional LLM:
+  User Question --> LLM --> Hallucinated Answer
-For Engineering Teams:
-- 449ns lookups - 35x faster than RDFox
-- 24 bytes per triple - 25% more memory efficient than competitors
-- 132K writes/sec - Handle enterprise transaction volumes
+Grounded AI (rust-kgdb + HyperAgent):
+  User Question --> LLM Plans Query --> Database Executes --> Verified Answer
+```
-For AI/ML Teams:
-- 86.4% SPARQL accuracy - vs 0% with vanilla LLMs on LUBM benchmark
-- 16ms similarity search - Find related entities across 10K vectors
-- Schema-aware generation - AI uses YOUR ontology, not guessed class names
+The AI translates intent into queries. The database finds facts. The AI never makes up data.
 ## What Is rust-kgdb?
-Two components, one npm package:
+**rust-kgdb** is two things in one npm package:
-### rust-kgdb Core: Embedded Knowledge Graph Database
+### 1. Embedded Knowledge Graph Database (rust-kgdb Core)
-A high-performance RDF/SPARQL database that runs inside your application. No server. No Docker. No config.
+A high-performance RDF/SPARQL database that runs inside your application. No server. No Docker. No config. Like SQLite for knowledge graphs.
 ```
 +-----------------------------------------------------------------------------+
@@ -71,123 +56,270 @@ A high-performance RDF/SPARQL database that runs inside your application. No ser
 +-----------------------------------------------------------------------------+
 ```
-| Metric | rust-kgdb | RDFox | Apache Jena |
-|--------|-----------|-------|-------------|
-| Lookup | 449 ns | 5,000+ ns | 10,000+ ns |
-| Memory/Triple | 24 bytes | 32 bytes | 50-60 bytes |
-| Bulk Insert | 146K/sec | 200K/sec | 50K/sec |
-Like SQLite - but for knowledge graphs.
-### HyperMind: Neuro-Symbolic Agent Framework
+### 2. Neuro-Symbolic AI Framework (HyperAgent)
 An AI agent layer that uses the database to prevent hallucinations. The LLM plans, the database executes.
 ```
 +-----------------------------------------------------------------------------+
-|                      HYPERMIND AGENT FRAMEWORK                              |
+|                      HYPERAGENT FRAMEWORK                                    |
 |                                                                             |
 |  +-----------+  +-----------+  +-----------+  +-----------+                 |
-|  |LLMPlanner |  |WasmSandbox|  | ProofDAG  |  |  Memory   |                 |
-|  |(Claude/GPT|  | (Security)|  |  (Audit)  |  |(Hypergraph|                 |
+|  |LLMPlanner |  |  Memory   |  | ProofDAG  |  |WasmSandbox|                 |
+|  |(Claude/GPT|  |(Hypergraph|  |  (Audit)  |  | (Security)|                 |
 |  +-----------+  +-----------+  +-----------+  +-----------+                 |
 |                                                                             |
-|  Type Theory: Hindley-Milner types ensure tool composition is valid        |
-|  Category Theory: Tools are morphisms (A -> B) with composition laws       |
-|  Proof Theory: Every execution produces cryptographic audit trail          |
+|  Type Theory: Tools have typed signatures (Query -> BindingSet)             |
+|  Category Theory: Tools compose safely (f . g verified at plan time)        |
+|  Proof Theory: Every execution produces cryptographic audit trail           |
 +-----------------------------------------------------------------------------+
 ```
-| Framework | Without Schema | With Schema |
-|-----------|---------------|-------------|
-| Vanilla LLM | 0% | - |
-| LangChain | 0% | 71.4% |
-| DSPy | 14.3% | 71.4% |
-| HyperMind | - | 71.4% |
+### How They Work Together
-All frameworks achieve similar accuracy WITH schema. The difference is HyperMind integrates schema handling - you do not manually inject it.
+```
++-----------------------------------------------------------------------------------+
+|  USER: "Find providers with suspicious billing patterns"                          |
++-----------------------------------------------------------------------------------+
+                                    |
+                                    v
++-----------------------------------------------------------------------------------+
+|  HYPERAGENT: Intent Analysis (deterministic, no LLM)                              |
+|  Keywords: "suspicious" -> FRAUD_DETECTION, "providers" -> Provider class         |
++-----------------------------------------------------------------------------------+
+                                    |
+                                    v
++-----------------------------------------------------------------------------------+
+|  HYPERAGENT: Schema Binding                                                       |
+|  Your ontology has: Provider, Claim, denialRate, hasPattern properties            |
++-----------------------------------------------------------------------------------+
+                                    |
+                                    v
++-----------------------------------------------------------------------------------+
+|  HYPERAGENT: Query Generation (schema-driven)                                     |
+|  SELECT ?p ?rate WHERE { ?p a :Provider ; :denialRate ?rate . FILTER(?rate > 0.2)}|
++-----------------------------------------------------------------------------------+
+                                    |
+                                    v
++-----------------------------------------------------------------------------------+
+|  rust-kgdb CORE: Execute Query (449ns per lookup)                                 |
+|  Returns: [{p: "PROV001", rate: "0.34"}]                                          |
++-----------------------------------------------------------------------------------+
+                                    |
+                                    v
++-----------------------------------------------------------------------------------+
+|  HYPERAGENT: Format Response + Audit Trail                                        |
+|  "Provider PROV001 has 34% denial rate" + SHA-256 proof of data source            |
++-----------------------------------------------------------------------------------+
+```
-## Quick Start
+## Why rust-kgdb?
+### Performance Comparison
+| Metric | rust-kgdb | RDFox | Apache Jena |
+|--------|-----------|-------|-------------|
+| Lookup Speed | 449 ns | 5,000+ ns | 10,000+ ns |
+| Memory per Triple | 24 bytes | 32 bytes | 50-60 bytes |
+| Bulk Insert | 146K/sec | 200K/sec | 50K/sec |
+**Benchmark Sources:**
+- rust-kgdb: Criterion benchmarks on LUBM(1) dataset (3,272 triples), Apple Silicon M1
+- RDFox: [Oxford Semantic Technologies](https://www.oxfordsemantic.tech/product) published benchmarks
+- Apache Jena: [Jena TDB Performance](https://jena.apache.org/documentation/tdb/performance.html)
+**How We Measured:**
+```bash
+# rust-kgdb benchmarks (Criterion statistical analysis)
+cargo bench --package storage --bench triple_store_benchmark
+# LUBM data generation
+./tools/lubm_generator 1 /tmp/lubm_1.nt    # 3,272 triples
+./tools/lubm_generator 10 /tmp/lubm_10.nt  # ~32K triples
+```
+### Why 35x Faster Than RDFox?
+1. **Zero-Copy Semantics**: All data structures use borrowed references. No cloning in hot paths.
+2. **String Interning**: Dictionary interns all URIs once. References are 8-byte IDs, not heap strings.
+3. **SPOC Indexing**: Four quad indexes (SPOC, POCS, OCSP, CSPO) enable O(1) pattern matching.
+4. **Rust Performance**: No garbage collection pauses. Predictable latency.
+## Why HyperAgent?
+### Framework Comparison (LUBM Benchmark)
+| Framework | Without Schema | With Schema | Notes |
+|-----------|----------------|-------------|-------|
+| Vanilla LLM | 0% | N/A | Hallucinates class names |
+| LangChain | 0% | 71.4% | Needs manual schema injection |
+| DSPy | 14.3% | 71.4% | Better prompting, still needs schema |
+| HyperAgent | N/A | 86.4% | Schema auto-discovered from KG |
+**Benchmark Dataset:** LUBM(1) - 3,272 triples, 30 OWL classes, 23 properties
+**Test Queries:** 7 standard LUBM queries (Q1-Q7)
+**How We Measured:**
+```bash
+# Framework comparison benchmark
+OPENAI_API_KEY=... python3 benchmark-frameworks.py
+# HyperMind vs Vanilla LLM
+ANTHROPIC_API_KEY=... node vanilla-vs-hypermind-benchmark.js
+```
+### Why 86.4% vs 0%?
+Vanilla LLMs fail because they guess class names:
+- LLM guesses: `Professor`, `Course`, `teaches`
+- Actual ontology: `ub:FullProfessor`, `ub:GraduateCourse`, `ub:teacherOf`
+HyperAgent reads YOUR schema first, then generates queries using YOUR class names.
+## Installation
 ```bash
 npm install rust-kgdb
 ```
+**Platforms:** macOS (Intel/Apple Silicon), Linux (x64/ARM64), Windows (x64)
+**Requirements:** Node.js 14+
+## Quick Start
 ### Basic Database Usage
 ```javascript
-const { GraphDB } = require('rust-kgdb');
+const { GraphDB, getVersion } = require('rust-kgdb');
-// Create embedded database (no server needed!)
-const db = new GraphDB('http://lawfirm.com/');
+console.log('rust-kgdb version:', getVersion());
-// Load your data
+// Create embedded database (no server needed)
+const db = new GraphDB('http://example.org/');
+// Load RDF data (N-Triples format)
 db.loadTtl(`
-  :Contract_2024_001 :hasClause :NonCompete_3yr .
-  :NonCompete_3yr :challengedIn :Martinez_v_Apex .
-  :Martinez_v_Apex :court "9th Circuit" ; :year 2021 .
-`);
+  <http://example.org/alice> <http://xmlns.com/foaf/0.1/name> "Alice" .
+  <http://example.org/alice> <http://xmlns.com/foaf/0.1/knows> <http://example.org/bob> .
+  <http://example.org/bob> <http://xmlns.com/foaf/0.1/name> "Bob" .
+`, null);
-// Query with SPARQL (449ns lookups)
+// Query with SPARQL (449ns per lookup)
 const results = db.querySelect(`
-  SELECT ?case ?court WHERE {
-    :NonCompete_3yr :challengedIn ?case .
-    ?case :court ?court
+  SELECT ?name WHERE {
+    ?person <http://xmlns.com/foaf/0.1/name> ?name
   }
 `);
-// [{case: ':Martinez_v_Apex', court: '9th Circuit'}]
+console.log(results);
+// [{bindings: {name: '"Alice"'}}, {bindings: {name: '"Bob"'}}]
+// Count triples
+console.log('Triple count:', db.countTriples()); // 3
 ```
-### With HyperMind Agent
+### With HyperAgent (Grounded AI)
 ```javascript
 const { GraphDB, HyperMindAgent } = require('rust-kgdb');
 const db = new GraphDB('http://insurance.org/');
 db.loadTtl(`
-  :Provider_445 :totalClaims 89 ; :avgClaimAmount 47000 ; :denialRate 0.34 .
-  :Provider_445 :hasPattern :UnbundledBilling ; :flaggedBy :SIU_2024_Q1 .
-`);
-const agent = new HyperMindAgent({ db });
-const result = await agent.ask("Which providers show suspicious billing patterns?");
+  <http://insurance.org/PROV001> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://insurance.org/Provider> .
+  <http://insurance.org/PROV001> <http://insurance.org/name> "ABC Medical" .
+  <http://insurance.org/PROV001> <http://insurance.org/denialRate> "0.34" .
+  <http://insurance.org/PROV001> <http://insurance.org/flaggedBy> <http://insurance.org/SIU_2024_Q1> .
+`, null);
+// Create agent with knowledge graph binding
+const agent = new HyperMindAgent({
+  kg: db,                              // REQUIRED: GraphDB instance
+  name: 'fraud-detector',              // Optional: Agent name
+  apiKey: process.env.OPENAI_API_KEY   // Optional: LLM API key for summarization
+});
+// Natural language query -> Grounded results
+const result = await agent.call("Which providers show suspicious billing patterns?");
 console.log(result.answer);
-// "Provider_445: 34% denial rate, flagged by SIU Q1 2024, unbundled billing pattern"
+// "Provider PROV001 (ABC Medical): 34% denial rate, flagged by SIU Q1 2024"
+console.log(result.explanation);
+// Full execution trace showing SPARQL queries generated
-console.log(result.evidence);
-// Full audit trail proving every fact came from your database
+console.log(result.proof);
+// Cryptographic proof DAG with SHA-256 hashes
 ```
 ## Core Components
-### GraphDB: SPARQL Engine (449ns lookups)
+### GraphDB: SPARQL 1.1 Engine
 ```javascript
 const { GraphDB } = require('rust-kgdb');
 const db = new GraphDB('http://example.org/');
-// Load Turtle format
-db.loadTtl(':alice :knows :bob . :bob :knows :charlie .');
+// Load data
+db.loadTtl(`
+  <http://example.org/alice> <http://example.org/knows> <http://example.org/bob> .
+  <http://example.org/alice> <http://example.org/age> "30" .
+  <http://example.org/bob> <http://example.org/knows> <http://example.org/charlie> .
+  <http://example.org/bob> <http://example.org/age> "25" .
+  <http://example.org/charlie> <http://example.org/age> "35" .
+`, null);
+// SELECT query
+const friends = db.querySelect(`
+  SELECT ?person ?friend WHERE {
+    ?person <http://example.org/knows> ?friend
+  }
+`);
-// SPARQL SELECT
-const results = db.querySelect('SELECT ?x WHERE { :alice :knows ?x }');
+// FILTER with comparison
+const adults = db.querySelect(`
+  SELECT ?person ?age WHERE {
+    ?person <http://example.org/age> ?age .
+    FILTER(?age >= "30")
+  }
+`);
+// OPTIONAL pattern
+const withAge = db.querySelect(`
+  SELECT ?person ?age WHERE {
+    ?person <http://example.org/knows> ?someone .
+    OPTIONAL { ?person <http://example.org/age> ?age }
+  }
+`);
-// SPARQL CONSTRUCT
-const graph = db.queryConstruct('CONSTRUCT { ?x :connected ?y } WHERE { ?x :knows ?y }');
+// CONSTRUCT new triples
+const inferred = db.queryConstruct(`
+  CONSTRUCT { ?a <http://example.org/friendOfFriend> ?c }
+  WHERE {
+    ?a <http://example.org/knows> ?b .
+    ?b <http://example.org/knows> ?c .
+    FILTER(?a != ?c)
+  }
+`);
-// Named graphs
-db.loadTtl(':data1 :value "100" .', 'http://example.org/graph1');
+// Named Graphs
+db.loadTtl('<http://example.org/data1> <http://example.org/value> "100" .', 'http://example.org/graph1');
+const fromGraph = db.querySelect(`
+  SELECT ?s ?v FROM <http://example.org/graph1> WHERE {
+    ?s <http://example.org/value> ?v
+  }
+`);
-// Count triples
-console.log(`Total: ${db.countTriples()} triples`);
+// Aggregation with Apache Arrow OLAP
+const stats = db.querySelect(`
+  SELECT (COUNT(?person) as ?count) (AVG(?age) as ?avgAge) WHERE {
+    ?person <http://example.org/age> ?age
+  }
+`);
 ```
 ### GraphFrame: Graph Analytics
 ```javascript
-const { GraphFrame, friendsGraph } = require('rust-kgdb');
+const { GraphFrame, friendsGraph, chainGraph, starGraph, completeGraph, cycleGraph } = require('rust-kgdb');
 // Create from vertices and edges
 const gf = new GraphFrame(
@@ -199,137 +331,230 @@ const gf = new GraphFrame(
   ])
 );
-// Algorithms
-console.log('PageRank:', gf.pageRank(0.15, 20));
-console.log('Connected Components:', gf.connectedComponents());
-console.log('Triangles:', gf.triangleCount());
-console.log('Shortest Paths:', gf.shortestPaths('alice'));
+// PageRank (damping=0.15, iterations=20)
+const pagerank = gf.pageRank(0.15, 20);
+console.log('PageRank:', JSON.parse(pagerank));
+// Connected Components (Union-Find algorithm)
+const components = gf.connectedComponents();
+console.log('Components:', JSON.parse(components));
+// Triangle Count
+const triangles = gf.triangleCount();
+console.log('Triangles:', triangles); // 1
+// Shortest Paths (Dijkstra)
+const paths = gf.shortestPaths(['alice']);
+console.log('Shortest paths:', JSON.parse(paths));
-// Motif finding (pattern matching)
-const motifs = gf.find('(a)-[e1]->(b); (b)-[e2]->(c)');
+// Label Propagation (Community Detection)
+const communities = gf.labelPropagation(10);
+console.log('Communities:', JSON.parse(communities));
+// Degree Distribution
+console.log('In-degrees:', JSON.parse(gf.inDegrees()));
+console.log('Out-degrees:', JSON.parse(gf.outDegrees()));
+// Factory functions for common graphs
+const chain = chainGraph(10);    // Linear path
+const star = starGraph(5);       // Hub with spokes
+const complete = completeGraph(4); // Fully connected
+const cycle = cycleGraph(6);     // Ring
 ```
-### EmbeddingService: Vector Similarity (HNSW)
+### Motif Finding: Pattern Matching DSL
 ```javascript
-const { EmbeddingService } = require('rust-kgdb');
+const { GraphFrame } = require('rust-kgdb');
-const embeddings = new EmbeddingService();
+const gf = new GraphFrame(
+  JSON.stringify([{id:'a'}, {id:'b'}, {id:'c'}, {id:'d'}]),
+  JSON.stringify([
+    {src:'a', dst:'b'},
+    {src:'b', dst:'c'},
+    {src:'c', dst:'a'},
+    {src:'d', dst:'a'}
+  ])
+);
-// Store 384-dimensional vectors
-embeddings.storeVector('claim_001', vectorFromOpenAI);
-embeddings.storeVector('claim_002', vectorFromOpenAI);
+// Find simple edges: (a)-[e]->(b)
+const edges = gf.find('(a)-[e]->(b)');
+console.log('Edges:', JSON.parse(edges).length); // 4
-// Build HNSW index
-embeddings.rebuildIndex();
+// Find chains: (a)-[e1]->(b); (b)-[e2]->(c)
+const chains = gf.find('(a)-[e1]->(b); (b)-[e2]->(c)');
+// Find triangles: (a)-[e1]->(b); (b)-[e2]->(c); (c)-[e3]->(a)
+const triangles = gf.find('(a)-[e1]->(b); (b)-[e2]->(c); (c)-[e3]->(a)');
+// Find stars: hub with multiple connections
+const stars = gf.find('(hub)-[e1]->(spoke1); (hub)-[e2]->(spoke2)');
-// Find similar (16ms for 10K vectors)
-const similar = embeddings.findSimilar('claim_001', 10, 0.7);
+// Fraud pattern: circular payments
+const circular = gf.find('(a)-[pay1]->(b); (b)-[pay2]->(c); (c)-[pay3]->(a)');
 ```
 ### DatalogProgram: Rule-Based Reasoning
 ```javascript
-const { DatalogProgram, evaluateDatalog } = require('rust-kgdb');
+const { DatalogProgram, evaluateDatalog, queryDatalog } = require('rust-kgdb');
 const datalog = new DatalogProgram();
-// Add facts
-datalog.addFact(JSON.stringify({predicate:'knows', terms:['alice','bob']}));
-datalog.addFact(JSON.stringify({predicate:'knows', terms:['bob','charlie']}));
+// Add base facts
+datalog.addFact(JSON.stringify({predicate:'parent', terms:['alice','bob']}));
+datalog.addFact(JSON.stringify({predicate:'parent', terms:['bob','charlie']}));
+datalog.addFact(JSON.stringify({predicate:'parent', terms:['charlie','dave']}));
-// Add rules (recursive!)
+// Transitive closure rule: ancestor(X,Y) :- parent(X,Y)
 datalog.addRule(JSON.stringify({
-  head: {predicate:'connected', terms:['?X','?Z']},
+  head: {predicate:'ancestor', terms:['?X','?Y']},
   body: [
-    {predicate:'knows', terms:['?X','?Y']},
-    {predicate:'knows', terms:['?Y','?Z']}
+    {predicate:'parent', terms:['?X','?Y']}
+  ]
+}));
+// Recursive rule: ancestor(X,Z) :- parent(X,Y), ancestor(Y,Z)
+datalog.addRule(JSON.stringify({
+  head: {predicate:'ancestor', terms:['?X','?Z']},
+  body: [
+    {predicate:'parent', terms:['?X','?Y']},
+    {predicate:'ancestor', terms:['?Y','?Z']}
   ]
 }));
-// Evaluate (semi-naive fixpoint)
+// Semi-naive evaluation (fixpoint)
 const inferred = evaluateDatalog(datalog);
-// connected(alice, charlie) - derived!
-```
-## Why Our Tool Calling Is Different
-Traditional AI tool calling (OpenAI Functions, LangChain Tools) has problems:
-1. Schema is decorative - The LLM sees a JSON schema and tries to match it. No guarantee outputs are correct types.
-2. Composition is ad-hoc - Chain Tool A to Tool B? Pray that A's output format happens to match B's input.
-3. Errors happen at runtime - You find out a tool chain is broken when a user hits it in production.
-Our Approach: Tools as Typed Morphisms
-Tools are arrows in a category with verified composition:
-- kg.sparql.query: Query to BindingSet
-- kg.motif.find: Pattern to Matches
-- kg.embeddings.search: EntityId to SimilarEntities
-The type system catches mismatches at plan time, not runtime.
-| Problem | Traditional | HyperMind |
-|---------|-------------|-----------|
-| Type mismatch | Runtime error | Will not compile |
-| Tool chaining | Hope it works | Type-checked composition |
-| Output validation | Schema validation (partial) | Refinement types (complete) |
-| Audit trail | Optional logging | Built-in proof witnesses |
-## Trust Model: Proxied Execution
-Traditional tool calling trusts the LLM output completely. The LLM decides what to execute. The tool runs it blindly.
-Our approach: Agent to Proxy to Sandbox to Tool
-```
-+---------------------------------------------------------------------+
-|  Agent Request: "Find suspicious claims"                            |
-+--------------------------------+------------------------------------+
-                                 |
-                                 v
-+---------------------------------------------------------------------+
-|  LLMPlanner: Generates tool call plan                               |
-|  -> kg.sparql.query(pattern)                                        |
-|  -> kg.datalog.infer(rules)                                         |
-+--------------------------------+------------------------------------+
-                                 | Plan (NOT executed yet)
-                                 v
-+---------------------------------------------------------------------+
-|  HyperAgentProxy: Validates plan against capabilities               |
-|  [x] Does agent have ReadKG capability? Yes                         |
-|  [x] Is query schema-valid? Yes                                     |
-|  [ ] Blocked: WriteKG not in capability set                         |
-+--------------------------------+------------------------------------+
-                                 | Validated plan only
-                                 v
-+---------------------------------------------------------------------+
-|  WasmSandbox: Executes with resource limits                         |
-|  - Fuel metering: 1M operations max                                 |
-|  - Memory cap: 64MB                                                 |
-|  - Capability enforcement                                           |
-+--------------------------------+------------------------------------+
-                                 | Execution with audit
-                                 v
-+---------------------------------------------------------------------+
-|  ProofDAG: Records execution witness                                |
-|  - What tool ran                                                    |
-|  - What inputs/outputs                                              |
-|  - SHA-256 hash of entire execution                                 |
-+---------------------------------------------------------------------+
-```
-The LLM never executes directly. It proposes. The proxy validates. The sandbox enforces. The proof records. Four independent layers of defense.
+console.log('Inferred facts:', JSON.parse(inferred));
+// ancestor(alice,bob), ancestor(alice,charlie), ancestor(alice,dave)
+// ancestor(bob,charlie), ancestor(bob,dave)
+// ancestor(charlie,dave)
+// Query specific predicate
+const ancestors = queryDatalog(datalog, 'ancestor');
+console.log('Ancestors:', JSON.parse(ancestors));
+```
+### Datalog vs SPARQL vs Motif: When to Use What
+| Use Case | Best Tool | Why |
+|----------|-----------|-----|
+| Simple lookups | SPARQL SELECT | Direct pattern matching, 449ns |
+| Transitive closure | Datalog | Recursive rules, fixpoint evaluation |
+| Graph patterns | Motif | Visual DSL, multiple edges |
+| Aggregations | SPARQL + Arrow | OLAP optimized |
+| Fraud rings | Motif | Circular pattern detection |
+| Inference | Datalog | Rule chaining |
+**Example: Same Query, Different Tools**
+```javascript
+// Find all ancestors - Datalog (recursive, elegant)
+datalog.addRule(JSON.stringify({
+  head: {predicate:'ancestor', terms:['?X','?Z']},
+  body: [
+    {predicate:'parent', terms:['?X','?Y']},
+    {predicate:'ancestor', terms:['?Y','?Z']}
+  ]
+}));
+// Find all ancestors - SPARQL (property paths)
+db.querySelect(`
+  SELECT ?ancestor ?descendant WHERE {
+    ?ancestor <http://example.org/parent>+ ?descendant
+  }
+`);
+// Find triangles - Motif (visual, intuitive)
+gf.find('(a)-[e1]->(b); (b)-[e2]->(c); (c)-[e3]->(a)');
+// Find triangles - SPARQL (verbose)
+db.querySelect(`
+  SELECT ?a ?b ?c WHERE {
+    ?a <http://example.org/knows> ?b .
+    ?b <http://example.org/knows> ?c .
+    ?c <http://example.org/knows> ?a .
+    FILTER(?a < ?b && ?b < ?c)
+  }
+`);
+```
+### EmbeddingService: Vector Similarity (HNSW)
+```javascript
+const { EmbeddingService } = require('rust-kgdb');
+const embeddings = new EmbeddingService();
+// Store 384-dimensional vectors
+const vector1 = new Array(384).fill(0).map((_, i) => Math.sin(i / 10));
+const vector2 = new Array(384).fill(0).map((_, i) => Math.cos(i / 10));
+embeddings.storeVector('entity1', vector1);
+embeddings.storeVector('entity2', vector2);
+// Retrieve vector
+const retrieved = embeddings.getVector('entity1');
+console.log('Vector length:', retrieved.length); // 384
+// Build HNSW index for fast similarity search
+embeddings.rebuildIndex();
+// Find similar entities (16ms for 10K vectors)
+const similar = embeddings.findSimilar('entity1', 10, 0.7);
+console.log('Similar:', JSON.parse(similar));
+// Graceful handling of missing entities
+const graceful = embeddings.findSimilarGraceful('nonexistent', 5, 0.5);
+console.log('Graceful:', JSON.parse(graceful)); // []
+// Delete vector
+embeddings.deleteVector('entity2');
+// Metrics
+console.log('Metrics:', JSON.parse(embeddings.getMetrics()));
+console.log('Cache stats:', JSON.parse(embeddings.getCacheStats()));
+```
+### Embedding Triggers: Auto-Generate on Insert
+```javascript
+const { GraphDB, EmbeddingService } = require('rust-kgdb');
+const db = new GraphDB('http://example.org/');
+const embeddings = new EmbeddingService();
+// Trigger callback: generate embedding when entity inserted
+embeddings.onTripleInsert('subject', 'predicate', 'object', null);
+// In production, configure provider:
+// - OpenAI: text-embedding-3-small (384 dims)
+// - Ollama: nomic-embed-text (local)
+// - Anthropic: (coming soon)
+```
+### Pregel: Bulk Synchronous Parallel
+```javascript
+const { chainGraph, pregelShortestPaths } = require('rust-kgdb');
+const graph = chainGraph(10);
+// Run Pregel shortest paths from source vertex
+const result = pregelShortestPaths(graph, 'v0', 20);
+const parsed = JSON.parse(result);
+console.log('Supersteps:', parsed.supersteps);
+console.log('Distances:', parsed.values);
+```
 ## Agent Memory: Deep Flashback
-Most AI agents forget everything between sessions. HyperMind stores memory in the same knowledge graph as your data.
+Most AI agents forget everything between sessions. HyperAgent stores memory in the same knowledge graph as your data.
 ```
 +-----------------------------------------------------------------------------+
 |                         MEMORY HYPERGRAPH                                   |
 |                                                                             |
-|   AGENT MEMORY LAYER                                                        |
+|   AGENT MEMORY LAYER (Episodes)                                             |
 |   +-----------+     +-----------+     +-----------+                         |
 |   |Episode:001|     |Episode:002|     |Episode:003|                         |
 |   |"Fraud ring|     |"Denied    |     |"Follow-up |                         |
@@ -337,9 +562,9 @@ Most AI agents forget everything between sessions. HyperMind stores memory in th
 |   +-----+-----+     +-----+-----+     +-----+-----+                         |
 |         |                 |                 |                               |
 |         +-----------------+-----------------+                               |
-|                           | HyperEdges connect to KG                        |
+|                           | HyperEdges                                      |
 |                           v                                                 |
-|   KNOWLEDGE GRAPH LAYER                                                     |
+|   KNOWLEDGE GRAPH LAYER (Facts)                                             |
 |   +-----------------------------------------------------------------+       |
 |   |  Provider:P001 -----> Claim:C123 <----- Claimant:John           |       |
 |   |       |                   |                   |                 |       |
@@ -347,181 +572,444 @@ Most AI agents forget everything between sessions. HyperMind stores memory in th
 |   |  riskScore: 0.87    amount: 50000       address: "123 Main"     |       |
 |   +-----------------------------------------------------------------+       |
 |                                                                             |
-|   SAME QUAD STORE - Single SPARQL query traverses BOTH!                     |
+|   SAME QUAD STORE - Single SPARQL query traverses BOTH layers!              |
 +-----------------------------------------------------------------------------+
 ```
-- Episodes link to KG entities via hyper-edges
-- Embeddings enable semantic search over past queries
-- Temporal decay prioritizes recent, relevant memories
-- Single SPARQL query traverses both memory AND knowledge graph
+### Memory Retrieval Depth Benchmark
-Memory Retrieval Performance:
-- 94% Recall at 10K depth
-- 16.7ms search speed for 10K queries
-- 132K ops/sec write throughput
+| Depth | Recall | Search Speed | Write Speed |
+|-------|--------|--------------|-------------|
+| 1K queries | 97% | 2.1ms | 145K ops/sec |
+| 5K queries | 95% | 8.4ms | 138K ops/sec |
+| 10K queries | 94% | 16.7ms | 132K ops/sec |
+| 50K queries | 91% | 84ms | 125K ops/sec |
-## Real-World Examples
+**Benchmark:** `node memory-retrieval-benchmark.js` on darwin-x64
-### Legal: Contract Analysis
+### Memory Features
 ```javascript
-const db = new GraphDB('http://lawfirm.com/');
-db.loadTtl(`
-  :Contract_2024 :hasClause :NonCompete_3yr ; :signedBy :ClientA .
-  :NonCompete_3yr :challengedIn :Martinez_v_Apex ; :upheldIn :Chen_v_StateBank .
-  :Martinez_v_Apex :court "9th Circuit" ; :year 2021 ; :outcome "partial" .
-`);
+const { HyperMindAgent, GraphDB } = require('rust-kgdb');
+const db = new GraphDB('http://example.org/');
+const agent = new HyperMindAgent({ kg: db, name: 'memory-agent' });
+// Conversation knowledge extraction
+// Agent auto-extracts entities from chat into KG
+const result1 = await agent.call("Provider P001 submitted 5 claims totaling $47,000");
+// Stored: :Conversation_001 :mentions :Provider_P001 .
+// Stored: :Provider_P001 :claimCount "5" ; :claimTotal "47000" .
+// Later queries use extracted knowledge
+const result2 = await agent.call("What do we know about Provider P001?");
+// Returns facts from BOTH original data AND conversation
+// Idempotent responses (semantic hashing)
+const result3 = await agent.call("Which providers have high denial rates?");
+// First call: 450ms (compute + cache)
-const result = await agent.ask("Has the non-compete clause been challenged?");
-// Returns REAL cases from YOUR database, not hallucinated citations
+const result4 = await agent.call("Show me providers with lots of denials");
+// Second call: 2ms (cache hit - same semantic meaning)
 ```
-### Healthcare: Drug Interactions
+## Embedded vs Clustered Deployment
+### Embedded Mode (Default)
 ```javascript
-const db = new GraphDB('http://hospital.org/');
-db.loadTtl(`
-  :Patient_7291 :currentMedication :Warfarin ; :currentMedication :Lisinopril .
-  :Warfarin :interactsWith :Aspirin ; :interactionSeverity "high" .
-  :Lisinopril :interactsWith :Potassium ; :interactionSeverity "high" .
-`);
+const db = new GraphDB('http://example.org/');  // In-memory, zero config
+```
+- **Storage:** RAM only (HashMap-based SPOC indexes)
+- **Performance:** 449ns lookups, 146K triples/sec insert
+- **Persistence:** None (data lost on restart)
+- **Scaling:** Single process, up to ~100M triples
+- **Use case:** Development, testing, embedded apps
+### Clustered Mode (1B+ triples)
-const result = await agent.ask("What should we avoid prescribing to Patient 7291?");
-// Returns ACTUAL interactions from your formulary, not made-up drug names
 ```
++-----------------------------------------------------------------------------+
+|                       DISTRIBUTED CLUSTER ARCHITECTURE                       |
+|                                                                              |
+|   +-------------------+                                                      |
+|   |   COORDINATOR     |  <- Routes queries, manages partitions               |
+|   | (Raft consensus)  |                                                      |
+|   +--------+----------+                                                      |
+|            |                                                                 |
+|   +--------+--------+--------+--------+                                      |
+|   |        |        |        |        |                                      |
+|   v        v        v        v        v                                      |
+| +----+  +----+  +----+  +----+  +----+                                       |
+| |Exec|  |Exec|  |Exec|  |Exec|  |Exec|  <- Partition executors               |
+| | 0  |  | 1  |  | 2  |  | 3  |  | 4  |                                       |
+| +----+  +----+  +----+  +----+  +----+                                       |
+|   |        |        |        |        |                                      |
+|   v        v        v        v        v                                      |
+| [===]   [===]   [===]   [===]   [===]   <- Local RocksDB partitions          |
+|                                                                              |
+|   HDRF Partitioning: Subject-anchored streaming (load factor < 1.1)          |
+|   Shadow Partitions: Zero-downtime rebalancing (~10ms pause)                 |
+|   Apache Arrow: Columnar OLAP for analytical queries                         |
++-----------------------------------------------------------------------------+
+```
+**Deployment:**
+```bash
+# Kubernetes deployment
+kubectl apply -f infra/k8s/coordinator.yaml
+kubectl apply -f infra/k8s/executor.yaml
+# Helm chart
+helm install rust-kgdb ./infra/helm -n rust-kgdb --create-namespace
-### Insurance: Fraud Detection
+# Verify cluster
+kubectl get pods -n rust-kgdb
+```
+### Memory in Clustered Mode
+Agent memory scales with the cluster:
+- Episodes partitioned by agent ID (locality)
+- Embeddings replicated for fast similarity search
+- Cross-partition queries via coordinator routing
+## Concurrency Benchmarks
+Measured with `node concurrency-benchmark.js` on darwin-x64:
+### Write Scaling
+| Workers | Ops/Sec | Scaling Factor |
+|---------|---------|----------------|
+| 1 | 66,422 | 1.00x |
+| 2 | 79,480 | 1.20x |
+| 4 | 95,655 | 1.44x |
+| 8 | 111,357 | 1.68x |
+| 16 | 132,087 | 1.99x |
+### Read Scaling
+| Workers | Ops/Sec | Scaling Factor |
+|---------|---------|----------------|
+| 1 | 290 | 1.00x |
+| 2 | 305 | 1.05x |
+| 4 | 307 | 1.06x |
+| 8 | 282 | 0.97x |
+| 16 | 302 | 1.04x |
+### GraphFrame Scaling
+| Workers | Ops/Sec | Scaling Factor |
+|---------|---------|----------------|
+| 1 | 5,987 | 1.00x |
+| 2 | 6,532 | 1.09x |
+| 4 | 6,494 | 1.08x |
+| 8 | 6,715 | 1.12x |
+| 16 | 6,516 | 1.09x |
+**Interpretation:**
+- Writes scale near-linearly (lock-free dictionary)
+- Reads plateau (SPARQL parsing overhead dominates)
+- GraphFrame stable (compute-bound, not I/O-bound)
+## Real-World Examples
+### Fraud Detection (NICB Dataset Patterns)
+Based on National Insurance Crime Bureau fraud indicators:
 ```javascript
-const db = new GraphDB('http://insurer.com/');
+const { GraphDB, HyperMindAgent, DatalogProgram, evaluateDatalog, GraphFrame } = require('rust-kgdb');
+// Create database with claims data
+const db = new GraphDB('http://insurance.org/');
 db.loadTtl(`
-  :P001 a :Claimant ; :name "John Smith" ; :address "123 Main St" .
-  :P002 a :Claimant ; :name "Jane Doe" ; :address "123 Main St" .
-  :P001 :knows :P002 .
-  :P001 :claimsWith :PROV001 .
-  :P002 :claimsWith :PROV001 .
+  <http://insurance.org/PROV001> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://insurance.org/Provider> .
+  <http://insurance.org/PROV001> <http://insurance.org/name> "ABC Medical" .
+  <http://insurance.org/PROV001> <http://insurance.org/denialRate> "0.34" .
+  <http://insurance.org/PROV001> <http://insurance.org/totalClaims> "89" .
+  <http://insurance.org/PROV001> <http://insurance.org/hasPattern> <http://insurance.org/UnbundledBilling> .
+  <http://insurance.org/CLMT001> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://insurance.org/Claimant> .
+  <http://insurance.org/CLMT001> <http://insurance.org/address> "123 Main St" .
+  <http://insurance.org/CLMT002> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://insurance.org/Claimant> .
+  <http://insurance.org/CLMT002> <http://insurance.org/address> "123 Main St" .
+  <http://insurance.org/CLMT001> <http://insurance.org/knows> <http://insurance.org/CLMT002> .
+`, null);
+// Method 1: SPARQL for simple queries
+const highDenial = db.querySelect(`
+  SELECT ?provider ?rate WHERE {
+    ?provider <http://insurance.org/denialRate> ?rate .
+    FILTER(?rate > "0.2")
+  }
 `);
-// NICB fraud detection rules
+// Method 2: Datalog for collusion detection
+const datalog = new DatalogProgram();
+datalog.addFact(JSON.stringify({predicate:'knows', terms:['CLMT001','CLMT002']}));
+datalog.addFact(JSON.stringify({predicate:'sameAddress', terms:['CLMT001','CLMT002']}));
 datalog.addRule(JSON.stringify({
-  head: {predicate:'potential_collusion', terms:['?X','?Y','?P']},
+  head: {predicate:'potential_collusion', terms:['?X','?Y']},
   body: [
-    {predicate:'claimant', terms:['?X']},
-    {predicate:'claimant', terms:['?Y']},
     {predicate:'knows', terms:['?X','?Y']},
-    {predicate:'claimsWith', terms:['?X','?P']},
-    {predicate:'claimsWith', terms:['?Y','?P']}
+    {predicate:'sameAddress', terms:['?X','?Y']}
   ]
 }));
+const collusion = evaluateDatalog(datalog);
-const inferred = evaluateDatalog(datalog);
-// potential_collusion(P001, P002, PROV001) - DETECTED!
+// Method 3: Motif for ring detection
+const gf = new GraphFrame(
+  JSON.stringify([{id:'CLMT001'}, {id:'CLMT002'}, {id:'CLMT003'}]),
+  JSON.stringify([
+    {src:'CLMT001', dst:'CLMT002'},
+    {src:'CLMT002', dst:'CLMT003'},
+    {src:'CLMT003', dst:'CLMT001'}
+  ])
+);
+const rings = gf.find('(a)-[e1]->(b); (b)-[e2]->(c); (c)-[e3]->(a)');
+// Method 4: HyperAgent for natural language
+const agent = new HyperMindAgent({ kg: db, name: 'fraud-detector' });
+const result = await agent.call("Find suspicious billing patterns");
 ```
-## Performance Benchmarks
+### Underwriting (ISO/ACORD Dataset Patterns)
-All measurements verified. Run them yourself:
+Based on insurance industry standard data models:
-```bash
-node benchmark.js
-node vanilla-vs-hypermind-benchmark.js
-```
+```javascript
+const { GraphDB, HyperMindAgent, EmbeddingService } = require('rust-kgdb');
-### Rust Core Engine
+const db = new GraphDB('http://underwriting.org/');
+db.loadTtl(`
+  <http://underwriting.org/APP001> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://underwriting.org/Applicant> .
+  <http://underwriting.org/APP001> <http://underwriting.org/name> "Acme Corp" .
+  <http://underwriting.org/APP001> <http://underwriting.org/industry> "Manufacturing" .
+  <http://underwriting.org/APP001> <http://underwriting.org/employees> "250" .
+  <http://underwriting.org/APP001> <http://underwriting.org/creditScore> "720" .
+  <http://underwriting.org/COMP001> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://underwriting.org/Applicant> .
+  <http://underwriting.org/COMP001> <http://underwriting.org/industry> "Manufacturing" .
+  <http://underwriting.org/COMP001> <http://underwriting.org/employees> "230" .
+  <http://underwriting.org/COMP001> <http://underwriting.org/premium> "625000" .
+`, null);
+// Embeddings for similarity search
+const embeddings = new EmbeddingService();
+const appVector = new Array(384).fill(0).map((_, i) => Math.sin(i / 10));
+embeddings.storeVector('APP001', appVector);
+embeddings.storeVector('COMP001', appVector.map(x => x * 0.95));
+embeddings.rebuildIndex();
-| Metric | rust-kgdb | RDFox | Apache Jena |
-|--------|-----------|-------|-------------|
-| Lookup | 449 ns | 5,000+ ns | 10,000+ ns |
-| Memory/Triple | 24 bytes | 32 bytes | 50-60 bytes |
-| Bulk Insert | 146K/sec | 200K/sec | 50K/sec |
+// Find similar accounts
+const similar = embeddings.findSimilar('APP001', 5, 0.7);
+// Direct SPARQL for comparables
+const comparables = db.querySelect(`
+  SELECT ?company ?employees ?premium WHERE {
+    ?company <http://underwriting.org/industry> "Manufacturing" .
+    ?company <http://underwriting.org/employees> ?employees .
+    OPTIONAL { ?company <http://underwriting.org/premium> ?premium }
+  }
+`);
+// HyperAgent for risk assessment
+const agent = new HyperMindAgent({
+  kg: db,
+  embeddings: embeddings,
+  name: 'underwriter'
+});
+const risk = await agent.call("Assess risk profile for Acme Corp");
+```
-### Concurrency (16 Workers)
-| Operation | Throughput |
-|-----------|------------|
-| Writes | 132K ops/sec |
-| Reads | 302 ops/sec |
-| GraphFrames | 6.5K ops/sec |
-## Feature Summary
-| Category | Feature | Performance |
-|----------|---------|-------------|
-| Core | SPARQL 1.1 Engine | 449ns lookups |
-| Core | RDF 1.2 Support | W3C compliant |
-| Core | Named Graphs | Quad store |
-| Analytics | PageRank | O(V + E) |
-| Analytics | Connected Components | Union-find |
-| Analytics | Triangle Count | O(E^1.5) |
-| Analytics | Motif Finding | Pattern DSL |
-| AI | HNSW Embeddings | 16ms/10K vectors |
-| AI | Agent Memory | 94% recall |
-| Reasoning | Datalog | Semi-naive |
-| Security | WASM Sandbox | Capability-based |
-| Audit | ProofDAG | SHA-256 witnesses |
+## Complete Feature List
+### Core Database
+| Feature | Description | Performance |
+|---------|-------------|-------------|
+| SPARQL 1.1 Query | SELECT, CONSTRUCT, ASK, DESCRIBE | 449ns lookups |
+| SPARQL 1.1 Update | INSERT, DELETE, LOAD, CLEAR | 146K/sec |
+| RDF 1.2 | Quoted triples, annotations | W3C compliant |
+| Named Graphs | Quad store with graph isolation | O(1) switching |
+| Triple Indexing | SPOC/POCS/OCSP/CSPO | Sub-microsecond |
+| Storage Backends | InMemory, RocksDB, LMDB | Pluggable |
+| Apache Arrow OLAP | Columnar aggregations | Vectorized |
+### Graph Analytics (GraphFrame)
+| Algorithm | Complexity | Description |
+|-----------|------------|-------------|
+| PageRank | O(V+E) per iteration | Damping, iterations configurable |
+| Connected Components | O(V+E) | Union-Find |
+| Triangle Count | O(E^1.5) | Optimized |
+| Shortest Paths | O(V+E) | Dijkstra |
+| Label Propagation | O(V+E) per iteration | Community detection |
+| Motif Finding | Pattern-dependent | DSL: `(a)-[e]->(b)` |
+| Pregel | BSP model | Custom vertex programs |
+### AI/ML Features
+| Feature | Performance | Description |
+|---------|-------------|-------------|
+| HNSW Embeddings | 16ms/10K | 384-dimensional vectors |
+| Similarity Search | O(log n) | Approximate nearest neighbor |
+| Embedding Triggers | Auto on INSERT | OpenAI/Ollama providers |
+| Agent Memory | 94% recall @ 10K | Episodic + semantic |
+| Semantic Caching | 2ms hit | Hash-based deduplication |
+### Reasoning Engine
+| Feature | Algorithm | Description |
+|---------|-----------|-------------|
+| Datalog | Semi-naive | Recursive rules |
+| Transitive Closure | Fixpoint | ancestor(X,Y) |
+| Stratified Negation | Stratified | NOT in bodies |
+| Rule Chaining | Forward | Multi-hop inference |
+### Security and Audit
+| Feature | Implementation | Description |
+|---------|----------------|-------------|
+| WASM Sandbox | Fuel metering | 1M ops max |
+| Capabilities | Set-based | ReadKG, WriteKG |
+| ProofDAG | SHA-256 | Cryptographic audit |
+| Tool Validation | Type checking | Morphism composition |
+### HyperAgent Framework
+| Feature | Description |
+|---------|-------------|
+| Schema-Aware Query Gen | Uses YOUR ontology |
+| Deterministic Planning | No LLM for queries |
+| Multi-Step Execution | SPARQL + Datalog + Motif |
+| Memory Hypergraph | Episodes link to KG |
+| Conversation Extraction | Auto-extract entities |
+| Idempotent Responses | Same question = same answer |
+### Standards Compliance
+| Standard | Status |
+|----------|--------|
+| SPARQL 1.1 Query | 100% |
+| SPARQL 1.1 Update | 100% |
+| RDF 1.2 | 100% |
+| Turtle | 100% |
+| N-Triples | 100% |
 ## API Reference
 ### GraphDB
 ```javascript
-const db = new GraphDB(baseUri)
-db.loadTtl(turtle, graphUri)
-db.querySelect(sparql)
-db.queryConstruct(sparql)
-db.countTriples()
-db.clear()
+const db = new GraphDB(baseUri)        // Create database
+db.loadTtl(turtle, graphUri)           // Load RDF data
+db.querySelect(sparql)                 // SELECT query -> results[]
+db.queryConstruct(sparql)              // CONSTRUCT -> triples string
+db.countTriples()                      // Count triples -> number
+db.clear()                             // Clear all data
+db.getGraphUri()                       // Get base URI -> string
 ```
 ### GraphFrame
 ```javascript
 const gf = new GraphFrame(verticesJson, edgesJson)
-gf.pageRank(dampingFactor, iterations)
-gf.connectedComponents()
-gf.triangleCount()
-gf.shortestPaths(sourceId)
-gf.find(motifPattern)
+gf.vertexCount()                       // -> number
+gf.edgeCount()                         // -> number
+gf.pageRank(dampingFactor, iterations) // -> JSON string
+gf.connectedComponents()               // -> JSON string
+gf.triangleCount()                     // -> number
+gf.shortestPaths(landmarks)            // -> JSON string
+gf.labelPropagation(iterations)        // -> JSON string
+gf.find(motifPattern)                  // -> JSON string
+gf.inDegrees()                         // -> JSON string
+gf.outDegrees()                        // -> JSON string
+gf.degrees()                           // -> JSON string
+gf.toJson()                            // -> JSON string
 ```
 ### EmbeddingService
 ```javascript
 const emb = new EmbeddingService()
-emb.storeVector(entityId, float32Array)
-emb.rebuildIndex()
-emb.findSimilar(entityId, k, threshold)
+emb.storeVector(entityId, float32Array)  // Store vector
+emb.getVector(entityId)                  // -> Float32Array | null
+emb.deleteVector(entityId)               // Delete vector
+emb.rebuildIndex()                       // Build HNSW index
+emb.findSimilar(entityId, k, threshold)  // -> JSON string
+emb.findSimilarGraceful(entityId, k, t)  // -> JSON string (no throw)
+emb.isEnabled()                          // -> boolean
+emb.getMetrics()                         // -> JSON string
+emb.getCacheStats()                      // -> JSON string
+emb.onTripleInsert(s, p, o, g)          // Trigger hook
 ```
 ### DatalogProgram
 ```javascript
 const dl = new DatalogProgram()
-dl.addFact(factJson)
-dl.addRule(ruleJson)
-evaluateDatalog(dl)
+dl.addFact(factJson)                   // Add fact
+dl.addRule(ruleJson)                   // Add rule
+dl.factCount()                         // -> number
+dl.ruleCount()                         // -> number
+evaluateDatalog(dl)                    // -> JSON string (all inferred)
+queryDatalog(dl, predicate)            // -> JSON string (specific)
+```
+### HyperMindAgent
+```javascript
+const agent = new HyperMindAgent({
+  kg: db,                              // REQUIRED: GraphDB
+  embeddings: embeddingService,        // Optional: EmbeddingService
+  name: 'agent-name',                  // Optional: string
+  apiKey: process.env.OPENAI_API_KEY,  // Optional: LLM API key
+  sandbox: {                           // Optional: security config
+    capabilities: ['ReadKG'],
+    fuelLimit: 1000000
+  }
+})
+const result = await agent.call(question)  // Natural language query
+// result.answer      -> string (human-readable)
+// result.explanation -> string (execution trace)
+// result.proof       -> object (SHA-256 audit trail)
 ```
 ### Factory Functions
 ```javascript
-friendsGraph()
-chainGraph(n)
-starGraph(n)
-completeGraph(n)
-cycleGraph(n)
+friendsGraph()        // Sample social graph
+chainGraph(n)         // Linear path: v0 -> v1 -> ... -> vn-1
+starGraph(n)          // Hub with n spokes
+completeGraph(n)      // Fully connected Kn
+cycleGraph(n)         // Ring: v0 -> v1 -> ... -> vn-1 -> v0
+binaryTreeGraph(depth) // Binary tree
+bipartiteGraph(m, n)   // Bipartite Km,n
 ```
-## Installation
+## Running Benchmarks
 ```bash
-npm install rust-kgdb
-```
+# Core engine benchmarks
+node benchmark.js
-Platforms: macOS (Intel/Apple Silicon), Linux (x64/ARM64), Windows (x64)
+# Concurrency benchmarks
+node concurrency-benchmark.js
-Requirements: Node.js 14+
+# Memory retrieval benchmarks
+node memory-retrieval-benchmark.js
+# HyperMind vs Vanilla LLM (requires API key)
+ANTHROPIC_API_KEY=... node vanilla-vs-hypermind-benchmark.js
+# Framework comparison (requires Python + API key)
+OPENAI_API_KEY=... python3 benchmark-frameworks.py
+```
 ## License

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "rust-kgdb",
-  "version": "0.6.64",
+  "version": "0.6.67",
   "description": "High-performance RDF/SPARQL database with AI agent framework. GraphDB (449ns lookups, 35x faster than RDFox), GraphFrames analytics (PageRank, motifs), Datalog reasoning, HNSW vector embeddings. HyperMindAgent for schema-aware query generation with audit trails. W3C SPARQL 1.1 compliant. Native performance via Rust + NAPI-RS.",
   "main": "index.js",
   "types": "index.d.ts",