npm - @199-bio/engram - Versions diffs - 0.11.1 → 0.13.0 - Mend

@199-bio/engram 0.11.1 → 0.13.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (30) hide show

package/README.md +35 -31
package/dist/consolidation/plan.d.ts.map +1 -1
package/dist/index.js +7 -3
package/dist/retrieval/hybrid.d.ts.map +1 -1
package/dist/retrieval/index.d.ts.map +1 -1
package/dist/retrieval/jina.d.ts.map +1 -0
package/dist/web/server.d.ts.map +1 -1
package/logo.png +0 -0
package/package.json +2 -3
package/src/consolidation/consolidator.ts +9 -9
package/src/consolidation/plan.ts +9 -9
package/src/index.ts +7 -3
package/src/retrieval/hybrid.ts +11 -11
package/src/retrieval/index.ts +1 -1
package/src/retrieval/jina-bridge.py +297 -0
package/src/retrieval/{colbert.ts → jina.ts} +31 -16
package/src/web/chat-handler.ts +4 -4
package/src/web/server.ts +117 -6
package/src/web/static/app.js +12 -0
package/src/web/static/index.html +35 -0
package/tests/retrieval/hybrid.test.ts +158 -0
package/tests/settings.test.ts +68 -0
package/tests/storage/database.test.ts +315 -0
package/vitest.config.ts +7 -0
package/LIVING_PLAN.md +0 -180
package/PLAN.md +0 -514
package/boba-prompt.md +0 -107
package/src/retrieval/colbert-bridge.py +0 -222
package/tests/test-interactive.js +0 -218
package/tests/test-mcp.sh +0 -81

package/LIVING_PLAN.md DELETED Viewed

@@ -1,180 +0,0 @@
-# Engram Development - Living Plan
-**Last Updated**: 2024-12-22 03:50 UTC
-This file tracks development progress. If context is lost, read this file to continue.
----
-## Current Status: Phase 5 - Production Ready
-### Completed
-- [x] Project structure created
-- [x] package.json, tsconfig.json, .gitignore, LICENSE
-- [x] SQLite storage layer (`src/storage/database.ts`)
-  - Memories table with FTS5 for BM25
-  - Entities, Observations, Relations tables
-  - Graph traversal queries
-  - All CRUD operations
-- [x] Entity extractor (`src/graph/extractor.ts`)
-  - Heuristic-based name extraction
-  - Organization detection (Goldman Sachs, etc.)
-  - Known organizations database
-  - Relationship extraction
-  - No external dependencies
-- [x] Knowledge graph manager (`src/graph/knowledge-graph.ts`)
-  - High-level graph operations
-  - Auto-extraction from text
-  - Graph traversal
-- [x] ColBERT Python bridge (`src/retrieval/colbert-bridge.py`)
-  - RAGatouille integration
-  - JSON stdin/stdout protocol
-- [x] TypeScript ColBERT wrapper (`src/retrieval/colbert.ts`)
-  - Subprocess management
-  - Fallback SimpleRetriever when Python unavailable
-- [x] Hybrid search (`src/retrieval/hybrid.ts`)
-  - BM25 + Semantic + Graph
-  - Reciprocal Rank Fusion (RRF)
-- [x] MCP server with all tools (`src/index.ts`)
-  - remember, recall, forget
-  - create_entity, observe, relate, query_entity, list_entities
-  - stats
-- [x] Install dependencies and build
-- [x] Test end-to-end with fictive examples (11 tests pass)
-- [x] Entity extraction improvements
-  - Goldman Sachs correctly detected as organization
-  - Known organizations database
-  - Place filtering (California, etc.)
-  - Nationality/religion filtering
-### Verified Working
-- All 11 MCP test cases pass
-- BM25 search working (FTS5)
-- Graph-based entity linking working
-- ColBERT Python bridge working
-- Entity extraction correctly identifies orgs vs persons
----
-## File Structure
-```
-engram/
-├── src/
-│   ├── index.ts              # MCP server (DONE)
-│   ├── storage/
-│   │   ├── database.ts       # SQLite + FTS5 (DONE)
-│   │   └── index.ts          # Exports (DONE)
-│   ├── graph/
-│   │   ├── extractor.ts      # Entity extraction (DONE)
-│   │   ├── knowledge-graph.ts # Graph operations (DONE)
-│   │   └── index.ts          # Exports (DONE)
-│   ├── retrieval/
-│   │   ├── colbert.ts        # TypeScript wrapper (DONE)
-│   │   ├── colbert-bridge.py # Python RAGatouille (DONE)
-│   │   ├── hybrid.ts         # RRF fusion (DONE)
-│   │   └── index.ts          # Exports (DONE)
-├── tests/
-│   ├── test-interactive.js   # Full test suite (DONE)
-│   └── test-mcp.sh           # Shell test script (DONE)
-├── dist/                     # Compiled JS (auto-generated)
-├── package.json              # Dependencies (DONE)
-├── tsconfig.json             # TypeScript config (DONE)
-├── README.md                 # Documentation (DONE)
-└── LIVING_PLAN.md            # This file (DONE)
-```
----
-## MCP Tools Available
-1. **remember** - Store a new memory, auto-extracts entities
-2. **recall** - Hybrid search (BM25 + semantic + graph)
-3. **forget** - Remove a memory by ID
-4. **create_entity** - Manually create an entity
-5. **observe** - Add an observation about an entity
-6. **relate** - Create a relationship between entities
-7. **query_entity** - Get entity details and relationships
-8. **list_entities** - List all entities by type
-9. **stats** - Get memory/entity/relation counts
----
-## Key Decisions
-1. **ColBERT via Python**: RAGatouille is proven, well-maintained. Use subprocess.
-2. **BM25 via SQLite FTS5**: Already implemented, zero deps.
-3. **Local-first**: No API keys required.
-4. **Entity extraction**: Heuristics + known org database. Can add GLiNER later.
-5. **Hybrid Search**: RRF fusion with k=60 constant.
----
-## Testing Commands
-```bash
-# Build TypeScript
-cd /Users/biobook/Code/stuff/engram
-npm install
-npm run build
-# Run full test suite
-node tests/test-interactive.js
-# Test MCP server manually
-echo '{"jsonrpc":"2.0","id":1,"method":"tools/list"}' | node dist/index.js
-# Install as MCP for Claude Desktop
-# Add to ~/.claude/claude_desktop_config.json:
-# {
-#   "mcpServers": {
-#     "engram": {
-#       "command": "node",
-#       "args": ["/Users/biobook/Code/stuff/engram/dist/index.js"]
-#     }
-#   }
-# }
-```
----
-## Known Limitations
-- Windows not supported (RAGatouille limitation)
-- ColBERT models are ~500MB (downloaded on first use)
-- BM25 scores for named entities are low (graph search compensates)
-- Place extraction not implemented (California detected as person)
----
-## Future Enhancements
-- [ ] GLiNER for better NER
-- [ ] Gemini embeddings (optional cloud enhancement)
-- [ ] Cohere reranking (optional cloud enhancement)
-- [ ] Temporal memory decay
-- [ ] Memory consolidation (merge similar memories)
-- [ ] Export/import functionality
----
-## To Continue Development
-If starting fresh, run these commands:
-```bash
-cd /Users/biobook/Code/stuff/engram
-cat LIVING_PLAN.md  # Read this file
-npm run build       # Rebuild if needed
-node tests/test-interactive.js  # Run tests
-```
----
-## API Keys Needed
-**NONE** - This is a local-first implementation.
-Optional (for future cloud enhancement):
-- GEMINI_API_KEY - embeddings
-- COHERE_API_KEY - reranking

package/PLAN.md DELETED Viewed

@@ -1,514 +0,0 @@
-# Engram Implementation Plan
-## Overview
-Build a local-first MCP memory server with SOTA retrieval quality using ColBERT + BM25 hybrid search and a lightweight knowledge graph.
-## Core Insight
-**The 80/20**: ColBERT (via RAGatouille) gives us embedding + reranking in one model, with better out-of-domain generalization than dense embeddings. Combined with BM25 for exact matches, this beats most API-based solutions while running entirely locally.
----
-## Phase 1: Foundation
-### 1.1 Project Setup
-- TypeScript + Node.js (MCP standard)
-- ESM modules
-- Directory structure:
-  ```
-  engram/
-  ├── src/
-  │   ├── index.ts          # MCP server entry
-  │   ├── mcp/              # Tool definitions
-  │   ├── retrieval/        # ColBERT + BM25
-  │   ├── graph/            # Knowledge graph
-  │   ├── storage/          # SQLite operations
-  │   └── utils/            # Helpers
-  ├── models/               # Downloaded models
-  ├── tests/
-  └── scripts/
-  ```
-### 1.2 Storage Layer (SQLite)
-Single SQLite database with tables:
-```sql
--- Memories: raw content
-CREATE TABLE memories (
-  id TEXT PRIMARY KEY,
-  content TEXT NOT NULL,
-  source TEXT,                    -- 'conversation', 'import', etc.
-  timestamp DATETIME DEFAULT CURRENT_TIMESTAMP,
-  importance REAL DEFAULT 0.5,    -- 0-1 score
-  access_count INTEGER DEFAULT 0,
-  last_accessed DATETIME
-);
--- FTS5 for BM25 search
-CREATE VIRTUAL TABLE memories_fts USING fts5(
-  content,
-  content='memories',
-  content_rowid='rowid'
-);
--- Entities: nodes in knowledge graph
-CREATE TABLE entities (
-  id TEXT PRIMARY KEY,
-  name TEXT NOT NULL,
-  type TEXT NOT NULL,             -- 'person', 'place', 'concept', 'event'
-  created_at DATETIME DEFAULT CURRENT_TIMESTAMP,
-  metadata JSON
-);
--- Observations: facts about entities
-CREATE TABLE observations (
-  id TEXT PRIMARY KEY,
-  entity_id TEXT NOT NULL REFERENCES entities(id),
-  content TEXT NOT NULL,
-  source_memory_id TEXT REFERENCES memories(id),
-  confidence REAL DEFAULT 1.0,
-  valid_from DATETIME DEFAULT CURRENT_TIMESTAMP,
-  valid_until DATETIME,           -- NULL = still valid
-  UNIQUE(entity_id, content)
-);
--- Relations: edges between entities
-CREATE TABLE relations (
-  id TEXT PRIMARY KEY,
-  from_entity TEXT NOT NULL REFERENCES entities(id),
-  to_entity TEXT NOT NULL REFERENCES entities(id),
-  type TEXT NOT NULL,             -- 'sibling', 'knows', 'works_at', etc.
-  properties JSON,
-  created_at DATETIME DEFAULT CURRENT_TIMESTAMP
-);
-```
-### 1.3 MCP Server Skeleton
-- Implement MCP protocol handlers
-- Tool registration
-- Error handling
-- Logging
----
-## Phase 2: Retrieval Engine
-### 2.1 ColBERT Integration
-**Option A: RAGatouille (Python)**
-- Proven, well-maintained
-- Need Python subprocess or microservice
-- ~500MB model size
-**Option B: FastEmbed + ColBERT (Node.js)**
-- Native Node.js via ONNX
-- fastembed-js has ColBERT support
-- May be less mature
-**Decision**: Start with Python subprocess calling RAGatouille. If latency is issue, migrate to ONNX later.
-```typescript
-// src/retrieval/colbert.ts
-class ColBERTRetriever {
-  private pythonProcess: ChildProcess;
-  async index(documents: Document[]): Promise<void>;
-  async search(query: string, k: number): Promise<SearchResult[]>;
-  async rerank(query: string, docs: Document[]): Promise<RankedResult[]>;
-}
-```
-### 2.2 BM25 via SQLite FTS5
-```typescript
-// src/retrieval/bm25.ts
-class BM25Retriever {
-  async search(query: string, k: number): Promise<SearchResult[]> {
-    return db.all(`
-      SELECT m.*, bm25(memories_fts) as score
-      FROM memories_fts
-      JOIN memories m ON memories_fts.rowid = m.rowid
-      WHERE memories_fts MATCH ?
-      ORDER BY score
-      LIMIT ?
-    `, [query, k]);
-  }
-}
-```
-### 2.3 Hybrid Search with RRF
-Reciprocal Rank Fusion combines rankings:
-```typescript
-// src/retrieval/hybrid.ts
-function reciprocalRankFusion(
-  rankings: SearchResult[][],
-  k: number = 60
-): SearchResult[] {
-  const scores = new Map<string, number>();
-  for (const ranking of rankings) {
-    for (let i = 0; i < ranking.length; i++) {
-      const docId = ranking[i].id;
-      const rrf = 1 / (k + i + 1);
-      scores.set(docId, (scores.get(docId) || 0) + rrf);
-    }
-  }
-  return Array.from(scores.entries())
-    .sort((a, b) => b[1] - a[1])
-    .map(([id, score]) => ({ id, score }));
-}
-```
----
-## Phase 3: Knowledge Graph
-### 3.1 Entity Extraction
-**Option A: Local NER model**
-- GLiNER or similar
-- Runs locally
-- Generic entities
-**Option B: LLM-based (using calling model)**
-- More accurate for personal context
-- Already in conversation
-- Prompt engineering needed
-**Decision**: Start with regex/heuristics for names (capitalized words), dates, etc. Add GLiNER later if needed.
-```typescript
-// src/graph/extractor.ts
-class EntityExtractor {
-  extractPersons(text: string): string[] {
-    // Heuristic: capitalized words not at sentence start
-    // + common name patterns
-  }
-  extractDates(text: string): Date[] {
-    // chrono-node for date parsing
-  }
-  extractAll(text: string): Entity[] {
-    return [
-      ...this.extractPersons(text).map(p => ({ name: p, type: 'person' })),
-      ...this.extractDates(text).map(d => ({ name: d, type: 'date' })),
-    ];
-  }
-}
-```
-### 3.2 Graph Operations
-```typescript
-// src/graph/knowledge-graph.ts
-class KnowledgeGraph {
-  async addEntity(name: string, type: EntityType): Promise<Entity>;
-  async addObservation(entityId: string, content: string, sourceMemoryId?: string): Promise<Observation>;
-  async addRelation(from: string, to: string, type: string): Promise<Relation>;
-  async getEntity(id: string): Promise<EntityWithObservations>;
-  async findEntities(query: string): Promise<Entity[]>;
-  async traverse(
-    startEntity: string,
-    depth: number,
-    relationTypes?: string[]
-  ): Promise<GraphTraversal>;
-}
-```
-### 3.3 Graph-Enhanced Retrieval
-When recalling, expand search with graph context:
-```typescript
-async function recallWithGraph(query: string, k: number): Promise<Memory[]> {
-  // 1. Hybrid search
-  const hybridResults = await hybridSearch(query, k * 2);
-  // 2. Extract entities from query
-  const queryEntities = entityExtractor.extractAll(query);
-  // 3. Get related observations
-  const relatedObs = [];
-  for (const entity of queryEntities) {
-    const e = await graph.findEntities(entity.name);
-    if (e.length > 0) {
-      const traversal = await graph.traverse(e[0].id, depth: 2);
-      relatedObs.push(...traversal.observations);
-    }
-  }
-  // 4. Add source memories from observations to candidate pool
-  const candidateIds = new Set([
-    ...hybridResults.map(r => r.id),
-    ...relatedObs.map(o => o.source_memory_id).filter(Boolean)
-  ]);
-  // 5. ColBERT rerank all candidates
-  const candidates = await getMemoriesById([...candidateIds]);
-  return await colbert.rerank(query, candidates, k);
-}
-```
----
-## Phase 4: MCP Tools
-### 4.1 Core Tools
-```typescript
-const tools = [
-  {
-    name: 'remember',
-    description: 'Store a new memory',
-    inputSchema: {
-      type: 'object',
-      properties: {
-        content: { type: 'string', description: 'The memory content' },
-        source: { type: 'string', description: 'Source of the memory' },
-        importance: { type: 'number', description: '0-1 importance score' }
-      },
-      required: ['content']
-    }
-  },
-  {
-    name: 'recall',
-    description: 'Retrieve relevant memories',
-    inputSchema: {
-      type: 'object',
-      properties: {
-        query: { type: 'string', description: 'What to search for' },
-        limit: { type: 'number', default: 5 },
-        include_graph: { type: 'boolean', default: true }
-      },
-      required: ['query']
-    }
-  },
-  // ... other tools
-];
-```
-### 4.2 Tool Implementations
-```typescript
-// src/mcp/tools/remember.ts
-async function remember(params: RememberParams): Promise<RememberResult> {
-  // 1. Store memory
-  const memory = await storage.createMemory({
-    content: params.content,
-    source: params.source || 'conversation',
-    importance: params.importance || 0.5
-  });
-  // 2. Index for ColBERT
-  await colbert.index([{ id: memory.id, text: memory.content }]);
-  // 3. Index for BM25 (automatic via FTS5 trigger)
-  // 4. Extract and store entities
-  const entities = entityExtractor.extractAll(params.content);
-  for (const entity of entities) {
-    const e = await graph.addEntity(entity.name, entity.type);
-    // Create observation linking entity to this memory
-    await graph.addObservation(e.id, params.content, memory.id);
-  }
-  return { id: memory.id, entities: entities.map(e => e.name) };
-}
-```
----
-## Phase 5: Optimizations
-### 5.1 Temporal Decay
-Recent memories weighted higher:
-```typescript
-function temporalScore(memory: Memory, now: Date): number {
-  const ageInDays = (now.getTime() - memory.timestamp.getTime()) / (1000 * 60 * 60 * 24);
-  // Exponential decay with half-life of 30 days
-  return Math.exp(-0.693 * ageInDays / 30);
-}
-function combinedScore(memory: Memory, retrievalScore: number): number {
-  const temporal = temporalScore(memory, new Date());
-  const importance = memory.importance;
-  const access = Math.log(1 + memory.access_count) / 10;
-  return retrievalScore * (0.6 + 0.2 * temporal + 0.1 * importance + 0.1 * access);
-}
-```
-### 5.2 Memory Consolidation
-Merge similar memories over time:
-```typescript
-async function consolidate(): Promise<void> {
-  // Find similar memories (high ColBERT similarity)
-  const clusters = await findSimilarClusters(threshold: 0.9);
-  for (const cluster of clusters) {
-    if (cluster.length > 1) {
-      // Merge into single consolidated memory
-      const merged = mergeMemories(cluster);
-      await storage.createMemory(merged);
-      await storage.archiveMemories(cluster.map(m => m.id));
-    }
-  }
-}
-```
-### 5.3 Lazy Loading
-Don't load ColBERT until first use:
-```typescript
-class LazyColBERT {
-  private instance: ColBERTRetriever | null = null;
-  async get(): Promise<ColBERTRetriever> {
-    if (!this.instance) {
-      this.instance = await ColBERTRetriever.initialize();
-    }
-    return this.instance;
-  }
-}
-```
----
-## Phase 6: Optional Cloud Enhancement
-### 6.1 Graceful Degradation
-```typescript
-class HybridEmbedder {
-  async embed(texts: string[]): Promise<number[][]> {
-    if (process.env.GEMINI_API_KEY) {
-      try {
-        return await geminiEmbed(texts);
-      } catch (e) {
-        console.warn('Gemini unavailable, falling back to local');
-      }
-    }
-    return await qwen3Embed(texts);
-  }
-}
-```
-### 6.2 Smart API Usage
-Only use APIs when it adds value:
-```typescript
-async function recall(query: string): Promise<Memory[]> {
-  // Local ColBERT for retrieval (always)
-  const candidates = await localRetrieval(query);
-  // Use Cohere rerank only for ambiguous queries
-  if (process.env.COHERE_API_KEY && candidates.length > 10) {
-    const topScores = candidates.slice(0, 5).map(c => c.score);
-    const variance = calculateVariance(topScores);
-    if (variance < 0.1) {
-      // Scores too close - worth API rerank
-      return await cohereRerank(query, candidates);
-    }
-  }
-  return candidates.slice(0, 5);
-}
-```
----
-## Implementation Order
-### Week 1: Core
-1. Project setup (TypeScript, deps, structure)
-2. SQLite schema + storage layer
-3. MCP server skeleton with remember/recall stubs
-### Week 2: Retrieval
-4. BM25 via FTS5
-5. RAGatouille Python bridge
-6. Hybrid search with RRF
-7. Basic remember/recall working
-### Week 3: Knowledge Graph
-8. Entity extraction (regex/heuristics)
-9. Graph schema + operations
-10. Graph-enhanced retrieval
-### Week 4: Polish
-11. Temporal decay
-12. Export/import
-13. Testing
-14. Documentation
----
-## Dependencies
-```json
-{
-  "dependencies": {
-    "@modelcontextprotocol/sdk": "latest",
-    "better-sqlite3": "^9.0.0",
-    "chrono-node": "^2.7.0",
-    "uuid": "^9.0.0",
-    "zod": "^3.22.0"
-  },
-  "devDependencies": {
-    "@types/better-sqlite3": "^7.6.0",
-    "@types/node": "^20.0.0",
-    "typescript": "^5.0.0",
-    "vitest": "^1.0.0"
-  }
-}
-```
-Python dependencies (for ColBERT):
-```
-ragatouille>=0.0.8
-torch>=2.0.0
-```
----
-## Success Metrics
-1. **Retrieval Quality**: Manually test with 50 queries, measure if correct memory is in top-3
-2. **Latency**: recall < 100ms, remember < 200ms
-3. **Storage**: < 100MB for 10,000 memories (excluding model)
-4. **Reliability**: Zero crashes in 1-week daily use
----
-## Risks & Mitigations
-| Risk | Mitigation |
-|------|------------|
-| RAGatouille Python bridge adds latency | Start Python process once, keep alive |
-| ColBERT model too large | Use quantized version, lazy load |
-| Entity extraction inaccurate | Start simple, add GLiNER if needed |
-| SQLite concurrent access | Use WAL mode, single writer |
----
-## Future Extensions
-- **Voice memos**: Whisper transcription → memory
-- **Image memories**: CLIP embeddings
-- **Calendar integration**: Auto-import events
-- **Journaling mode**: Daily summary generation
-- **Multi-user**: Shared knowledge graphs