npm - @cogitator-ai/memory - Versions diffs - 0.6.1 → 0.6.2 - Mend

@cogitator-ai/memory 0.6.1 → 0.6.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +106 -0
package/package.json +3 -3

package/README.md CHANGED Viewed

@@ -20,7 +20,9 @@ pnpm add pg       # For PostgreSQL adapter
 - **Context Builder** - Build token-aware conversation context
 - **Embedding Services** - OpenAI and Ollama embedding integration
 - **Semantic Search** - Vector similarity search with pgvector
+- **Hybrid Search** - BM25 + Vector with Reciprocal Rank Fusion
 - **Facts Storage** - Store and retrieve agent knowledge
+- **Knowledge Graph** - Entity-relationship memory with multi-hop traversal
 - **Zod Schemas** - Type-safe configuration validation
 ---
@@ -356,6 +358,110 @@ const vector = await embeddings.embed('Hello, world!');
 ---
+## Hybrid Search
+Combine keyword search (BM25) with semantic search (vector embeddings) using Reciprocal Rank Fusion for best-of-both-worlds retrieval.
+### Configuration
+```typescript
+import {
+  HybridSearch,
+  InMemoryEmbeddingAdapter,
+  OpenAIEmbeddingService,
+} from '@cogitator-ai/memory';
+const embeddingService = new OpenAIEmbeddingService({
+  apiKey: process.env.OPENAI_API_KEY!,
+});
+const embeddingAdapter = new InMemoryEmbeddingAdapter();
+const search = new HybridSearch({
+  embeddingAdapter,
+  embeddingService,
+  keywordAdapter: embeddingAdapter, // PostgresAdapter also implements KeywordSearchAdapter
+  defaultWeights: { bm25: 0.4, vector: 0.6 },
+});
+```
+### Search Strategies
+```typescript
+// Pure vector search (semantic similarity)
+const vectorResults = await search.search({
+  query: 'machine learning algorithms',
+  strategy: 'vector',
+  limit: 10,
+});
+// Pure keyword search (BM25)
+const keywordResults = await search.search({
+  query: 'machine learning algorithms',
+  strategy: 'keyword',
+  limit: 10,
+});
+// Hybrid search (combines both with RRF)
+const hybridResults = await search.search({
+  query: 'machine learning algorithms',
+  strategy: 'hybrid',
+  weights: { bm25: 0.4, vector: 0.6 },
+  limit: 10,
+});
+// Results include both scores
+hybridResults.data.forEach((result) => {
+  console.log(`${result.content} — score: ${result.score}`);
+  console.log(`  vector: ${result.vectorScore}, keyword: ${result.keywordScore}`);
+});
+```
+### Document Indexing
+For keyword search, documents must be indexed:
+```typescript
+// Add documents to BM25 index
+search.indexDocument('doc-1', 'Machine learning is a subset of AI...');
+search.indexDocument('doc-2', 'Deep learning uses neural networks...');
+// Remove from index
+search.removeDocument('doc-1');
+// Clear entire index
+search.clearIndex();
+// Check index size
+console.log('Indexed documents:', search.indexSize);
+```
+### Why Hybrid Search?
+| Search Type        | Strengths                                            | Weaknesses                      |
+| ------------------ | ---------------------------------------------------- | ------------------------------- |
+| **Vector**         | Finds semantically similar content, handles synonyms | Misses exact keyword matches    |
+| **Keyword (BM25)** | Exact term matching, fast                            | Misses synonyms and paraphrases |
+| **Hybrid**         | Best of both worlds                                  | Slightly more computation       |
+**Example:** Query "ML algorithms"
+- Vector search finds "machine learning methods" (semantic match)
+- Keyword search finds "ML algorithms comparison" (exact match)
+- Hybrid returns both, ranked by combined relevance
+### Reciprocal Rank Fusion (RRF)
+Hybrid search uses RRF to combine rankings from both search methods:
+```
+RRF_score(d) = Σ 1 / (k + rank_i(d))
+```
+Where `k` is a constant (default 60) and `rank_i(d)` is the rank of document `d` in result set `i`. This produces stable rankings even when individual scores are on different scales.
+---
 ## Zod Schemas
 Type-safe configuration validation:

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@cogitator-ai/memory",
-  "version": "0.6.1",
+  "version": "0.6.2",
   "description": "Memory adapters for Cogitator AI agents",
   "type": "module",
   "main": "./dist/index.js",
@@ -19,8 +19,8 @@
     "@types/pg": "^8.10.9",
     "nanoid": "^5.0.4",
     "zod": "^3.22.4",
-    "@cogitator-ai/redis": "0.2.11",
-    "@cogitator-ai/types": "0.12.0"
+    "@cogitator-ai/types": "0.12.0",
+    "@cogitator-ai/redis": "0.2.11"
   },
   "optionalDependencies": {
     "ioredis": "^5.3.2",