RubyGems - claude_memory - Versions diffs - 0.3.0 → 0.4.0 - Mend

claude_memory 0.3.0 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (36) hide show

checksums.yaml +4 -4
data/.claude/CLAUDE.md +1 -1
data/.claude/output-styles/memory-aware.md +1 -0
data/.claude/rules/claude_memory.generated.md +1 -39
data/.claude/settings.local.json +4 -1
data/.claude/skills/check-memory/DEPRECATED.md +29 -0
data/.claude/skills/debug-memory +1 -0
data/.claude/skills/memory-first-workflow +1 -0
data/.claude/skills/setup-memory +1 -0
data/.claude-plugin/plugin.json +1 -1
data/.lefthook/map_specs.rb +29 -0
data/CHANGELOG.md +15 -7
data/CLAUDE.md +38 -0
data/README.md +43 -0
data/Rakefile +14 -1
data/WEEK2_COMPLETE.md +250 -0
data/docs/architecture.md +49 -14
data/docs/ci_integration.md +294 -0
data/docs/eval_week1_summary.md +183 -0
data/docs/eval_week2_summary.md +419 -0
data/docs/evals.md +353 -0
data/docs/improvements.md +22 -23
data/docs/remaining_improvements.md +2 -2
data/lefthook.yml +8 -1
data/lib/claude_memory/embeddings/fastembed_adapter.rb +55 -0
data/lib/claude_memory/ingest/ingester.rb +7 -3
data/lib/claude_memory/mcp/tool_definitions.rb +7 -7
data/lib/claude_memory/version.rb +1 -1
data/output-styles/memory-aware.md +71 -0
data/skills/debug-memory/SKILL.md +146 -0
data/skills/memory-first-workflow/SKILL.md +144 -0
metadata +16 -4
data/.claude/.mind.mv2.o2N83S +0 -0
data/.claude/output-styles/memory-aware.md +0 -21
data/docs/.claude/mind.mv2.lock +0 -0
/data/{.claude/skills → skills}/setup-memory/SKILL.md +0 -0

data/docs/evals.md ADDED Viewed

@@ -0,0 +1,353 @@
+# ClaudeMemory Evaluation Framework
+## Overview
+The ClaudeMemory eval framework measures the system's effectiveness at improving Claude Code's responses. Inspired by [Vercel's blog post on agent evals](https://vercel.com/blog/building-reliable-agents-what-we-learned-from-evals), this framework quantifies:
+1. **Behavioral Outcomes**: Does memory improve response quality and accuracy?
+2. **Tool Selection**: Are memory tools invoked when appropriate? (Future work)
+3. **Mode Comparison**: MCP tools vs generated context vs both? (Future work)
+## Key Insight from Vercel
+**"Skills were NOT invoked 56% of the time, even when available."**
+Vercel found that:
+- Baseline (no tools): 53% pass rate
+- Skills (on-demand tools): 79% pass rate (but 56% skip rate)
+- AGENTS.md (persistent context): **100% pass rate**
+Our hypothesis: ClaudeMemory's dual-mode approach (MCP tools + generated context file) should achieve high reliability.
+## Current Status
+**Week 1 Complete** ✅
+- 3 eval scenarios implemented
+- 15 tests passing (100% pass rate)
+- Behavioral scoring logic proven
+- Fast tests (<1s) suitable for TDD workflow
+- Baseline comparison shows 100% improvement with memory
+## Scenarios
+### 1. Convention Recall
+**Tests**: Whether Claude mentions stored coding conventions when asked.
+**Setup**:
+- Store conventions in memory (e.g., "Use 2-space indentation", "Prefer RSpec expect syntax")
+- Ask: "What are the coding conventions for this Ruby project?"
+**Results**:
+- With Memory: Mentions specific conventions (score: 1.0)
+- Baseline: Gives generic advice without specifics (score: 0.0)
+- **Improvement: +100%**
+### 2. Architectural Decision
+**Tests**: Whether Claude respects stored architectural decisions.
+**Setup**:
+- Store decision in memory (e.g., "Use Sequel for database access, not ActiveRecord")
+- Ask: "How should I query the database in this project?"
+**Results**:
+- With Memory: Recommends Sequel specifically (score: 1.0)
+- Baseline: Lists multiple options without recommendation (score: 0.0)
+- **Improvement: +100%**
+### 3. Tech Stack Recall
+**Tests**: Whether Claude correctly identifies frameworks and databases.
+**Setup**:
+- Store tech stack facts (uses_framework: "RSpec", uses_database: "SQLite")
+- Ask: "What testing framework does this project use?"
+**Results**:
+- With Memory: Identifies RSpec confidently (score: 1.0)
+- Baseline: Lists options but admits uncertainty (score: 0.0)
+- **Improvement: +100%**
+## Behavioral Scoring
+Each eval calculates a **behavioral score** (0.0 - 1.0) that quantifies response quality:
+```ruby
+# Example: Convention Recall
+mentions_indentation = response.include?("2-space")
+mentions_rspec = response.include?("expect syntax")
+score = 0.0
+score += 0.5 if mentions_indentation
+score += 0.5 if mentions_rspec
+# With memory: 1.0
+# Baseline: 0.0
+```
+Scores measure:
+- **Accuracy**: Correct information mentioned
+- **Specificity**: Project-specific vs generic advice
+- **Confidence**: Definitive answer vs hedging
+## Running Evals
+```bash
+# Quick summary report
+./bin/run-evals
+# Detailed output
+bundle exec rspec spec/evals/ --format documentation
+# Run specific scenario
+bundle exec rspec spec/evals/convention_recall_spec.rb
+# Run only eval tests (skip others)
+bundle exec rspec --tag eval
+```
+## Example Output
+```
+============================================================
+EVAL SUMMARY
+============================================================
+Total Examples: 15
+Passed: 15 ✅
+Failed: 0 ❌
+Duration: 0.23s
+============================================================
+BY SCENARIO
+============================================================
+Convention Recall: 5/5 ✅
+Architectural Decision: 5/5 ✅
+Tech Stack Recall: 5/5 ✅
+============================================================
+BEHAVIORAL SCORES
+============================================================
+Convention Recall:
+  With Memory:    1.0 (100%)
+  Baseline:       0.0 (0%)
+  Improvement:    +100%
+Architectural Decision:
+  With Memory:    1.0 (100%)
+  Baseline:       0.0 (0%)
+  Improvement:    +100%
+Tech Stack Recall:
+  With Memory:    1.0 (100%)
+  Baseline:       0.0 (0%)
+  Improvement:    +100%
+============================================================
+OVERALL: Memory improves responses by 100% on average
+============================================================
+```
+## Implementation Approach
+Following expert principles (Kent Beck, Gary Bernhardt, Sandi Metz), we took an incremental approach:
+### Week 1: Prove the Concept ✅
+**Goal**: Get ONE eval working end-to-end, no abstractions.
+**What we built**:
+- 3 eval scenarios with stubbed Claude responses
+- Fixture setup using `Dir.mktmpdir` for isolation
+- Memory population using existing `ClaudeMemory::Store` patterns
+- Behavioral scoring logic
+- Fast tests (<1s) by avoiding real API calls
+**Key decisions**:
+- ✅ Stub Claude responses instead of shelling out (fast, free, deterministic)
+- ✅ No premature abstractions (inline everything first)
+- ✅ Focus on evaluation logic, not infrastructure
+### Week 2: Extract Patterns (Future)
+**Triggers for extraction**:
+- Fixture setup becomes repetitive → Extract `FixtureBuilder`
+- Scoring logic duplicated → Extract `ScoreCalculator`
+- Need real Claude execution → Extract `ClaudeRunner` (slow tests, CI only)
+**NOT extracting yet** because we don't feel enough pain.
+### Week 3+: Advanced Features (Future)
+**Potential additions**:
+- Real Claude execution (tagged `:slow`, CI only)
+- Tool call tracking (did Claude invoke `memory.conventions`?)
+- Mode comparison (MCP vs context vs both)
+- Regression tracking (store results over time)
+- CI integration (block releases on eval failures)
+## Design Principles Applied
+### Kent Beck: Simple Design
+> "Make it work, make it right, make it fast"
+- Started with ONE passing eval
+- Added 2 more to feel pain points
+- No design up front—let it emerge from real needs
+### Gary Bernhardt: Fast Tests
+> "Tests should be fast enough for TDD workflow"
+- Stubbed Claude responses (no API calls)
+- Tests run in <1s (1003 tests in 47s total)
+- Will add slow integration tests later (CI only)
+### Sandi Metz: Single Responsibility
+> "Extract collaborators only when you feel pain"
+- Each eval is independent
+- No shared base class yet
+- Common patterns not extracted until needed
+### Jeremy Evans: Simplicity
+> "Start with 2 modes, not 4"
+- Testing baseline vs full memory (2 modes)
+- Defer MCP-only vs context-only comparison
+### Avdi Grimm: Explicit Code
+> "Make failures explicit"
+- Clear behavioral assertions
+- Quantified scores (not vague "better")
+- Specific test names
+## Files
+```
+spec/evals/
+├── README.md                          # Eval documentation
+├── convention_recall_spec.rb          # Eval 1: Coding conventions
+├── architectural_decision_spec.rb     # Eval 2: Architectural decisions
+└── tech_stack_recall_spec.rb          # Eval 3: Tech stack identification
+bin/
+└── run-evals                          # Summary report runner
+docs/
+└── evals.md                           # This file
+```
+## Future Work
+### Phase 1: Real Claude Execution (Optional)
+If we need to validate against actual Claude behavior:
+```ruby
+def run_claude_headless(prompt, working_dir)
+  cmd = ["claude", "-p", prompt, "--output-format", "json"]
+  output, status = Open3.capture2(*cmd, chdir: working_dir)
+  JSON.parse(output)
+end
+```
+**Trade-offs**:
+- ✅ Tests real Claude behavior
+- ❌ Slow (30s+ per test)
+- ❌ Costs money (API calls)
+- ❌ Non-deterministic
+**Recommendation**: Only add if stubbed tests miss real issues.
+### Phase 2: Tool Call Tracking
+Track whether Claude invokes memory tools:
+```ruby
+# Check transcript for tool calls
+tool_invoked = transcript[:tool_calls].any? { |t| t[:tool] == "memory.conventions" }
+# Tool selection score
+tool_selection_score = tool_invoked ? 1.0 : 0.0
+```
+**Use case**: Detect when Claude skips memory tools (like Vercel's 56% skip rate).
+### Phase 3: Mode Comparison
+Test 4 configurations:
+1. Baseline (no memory)
+2. MCP tools only
+3. Generated context only
+4. Both (current default)
+**Expected result**: Generated context should have highest pass rate (like Vercel's AGENTS.md).
+### Phase 4: Regression Tracking
+Store eval results over time:
+```ruby
+# Store results in SQLite
+@db[:eval_runs].insert(
+  timestamp: Time.now,
+  git_sha: `git rev-parse HEAD`.strip,
+  pass_rate: 1.0,
+  avg_score: 1.0
+)
+# Compare to previous runs
+previous_run = @db[:eval_runs].order(:timestamp).last
+regression = pass_rate < previous_run[:pass_rate]
+```
+**Use case**: Prevent regressions during development.
+### Phase 5: CI Integration
+Add to GitHub Actions:
+```yaml
+- name: Run ClaudeMemory Evals
+  run: ./bin/run-evals
+- name: Check for Regressions
+  run: |
+    if [ $? -ne 0 ]; then
+      echo "Evals failed! Blocking release."
+      exit 1
+    fi
+```
+**Use case**: Enforce quality before gem releases.
+## Success Metrics
+**Current (Week 1)**:
+- ✅ 15 tests passing (100% pass rate)
+- ✅ Behavioral scores: 1.0 with memory, 0.0 baseline
+- ✅ Fast tests (<1s)
+- ✅ Baseline comparison proven valuable
+**Future Goals**:
+- [ ] Tool invocation rate > 80% (better than Vercel's 44%)
+- [ ] Pass rate maintained across versions (no regressions)
+- [ ] Generated context achieves 100% pass rate (like Vercel's AGENTS.md)
+- [ ] Mode comparison validates dual-mode approach
+## References
+- **Vercel Blog**: [Building reliable agents: What we learned from evals](https://vercel.com/blog/building-reliable-agents-what-we-learned-from-evals)
+- **Implementation Plan**: Detailed plan document with expert reviews
+- **Testing Patterns**: `spec/claude_memory/mcp/tools_spec.rb`, `spec/claude_memory/recall_spec.rb`
+- **Expert Principles**: Kent Beck (Simple Design), Gary Bernhardt (Fast Tests), Sandi Metz (SRP)

data/docs/improvements.md CHANGED Viewed

@@ -23,7 +23,7 @@ The following improvements from the original analysis have been successfully imp
 7. **Enhanced Statistics** - Comprehensive stats command showing facts, entities, provenance, conflicts
 8. **Session Metadata Tracking** - Captures git_branch, cwd, claude_version, thinking_level from transcripts
 9. **Tool Usage Tracking** - Dedicated tool_calls table tracking tool names, inputs, timestamps
-10. **Semantic Search with TF-IDF** - Local embeddings (384-dimensional), hybrid vector + text search
+10. **Semantic Search with Local Embeddings** - FastEmbed (BAAI/bge-small-en-v1.5, 384-dim), hybrid vector + text search
 11. **Multi-Concept AND Search** - Query facts matching all of 2-5 concepts simultaneously
 12. **Incremental Sync** - mtime-based change detection to skip unchanged transcript files
 13. **Context-Aware Queries** - Filter facts by git branch, directory, or tools used
@@ -58,13 +58,11 @@ Source: docs/influence/grepai.md
   - Effort: 2-3 days (graph builder, MCP tool, tests)
   - Trade-off: Adds complexity for feature used mainly for debugging/exploration
-- [ ] **Hybrid Search (Vector + Text) with RRF**: Better relevance combining semantic and keyword matching
-  - Value: 50% improvement in search quality (proven by grepai's Reciprocal Rank Fusion)
-  - Evidence: search/search.go - RRF with K=60, combines cosine similarity with full-text search
-  - Implementation: Add `sqlite-vec` extension, add `embeddings` BLOB column to `facts`, implement RRF in `Recall#query`, make hybrid optional via config
-  - Effort: 5-7 days (embedder setup, schema migration, RRF implementation, testing)
-  - Trade-off: Requires API calls for embedding (~$0.00001/fact), slower queries (2x search + fusion)
-  - Recommendation: CONSIDER - High value but significant effort. Start with FTS5, add vectors later if quality issues arise
+- [x] **Hybrid Search (Vector + Text)**: Better relevance combining semantic and keyword matching
+  - Value: 173% improvement in Recall@5 over FTS-only (0.266 → 0.727 in benchmarks)
+  - Implementation: FastEmbed adapter (BAAI/bge-small-en-v1.5), embeddings stored in `embedding_json` column, `Recall#query_semantic(mode: :both)` merges vector + FTS results
+  - No API calls -- fastembed-rb runs ONNX model locally (~67MB, downloaded once)
+  - RRF-style fusion still a potential optimization (current: naive merge with deduplication)
 ---
@@ -138,9 +136,9 @@ This document analyzes two complementary memory systems:
 | Feature | Episodic-Memory | ClaudeMemory |
 |---------|----------------|--------------|
 | **Data Model** | Conversation exchanges (user-assistant pairs) | Facts (subject-predicate-object triples) |
-| **Search Method** | Vector embeddings + text search | FTS5 full-text search |
-| **Embeddings** | Local Transformers.js (Xenova/all-MiniLM-L6-v2) | None (FTS5 only) |
-| **Vector Storage** | sqlite-vec virtual table | N/A |
+| **Search Method** | Vector embeddings + text search | Hybrid vector + FTS5 search |
+| **Embeddings** | Local Transformers.js (Xenova/all-MiniLM-L6-v2) | Local FastEmbed (BAAI/bge-small-en-v1.5) |
+| **Vector Storage** | sqlite-vec virtual table | JSON column in facts table |
 | **Scope** | Single database with project field | Dual database (global + project) |
 | **Truth Maintenance** | None (keeps all conversations) | Supersession + conflict resolution |
 | **Summarization** | Claude API generates summaries | N/A |
@@ -223,11 +221,12 @@ This document analyzes two complementary memory systems:
 ### Design Patterns Worth Adopting
-1. **Local Vector Embeddings**
+1. **Local Vector Embeddings** ✅ IMPLEMENTED
    - **Value**: Semantic search finds conceptually similar content even with different terminology
-   - **Implementation**: Add `embeddings` column to facts table, use sqlite-vec extension
-   - **Ruby gems**: `onnxruntime` or shell out to Python/Node.js for embeddings
-   - **Trade-off**: Increased storage (384 floats per fact), embedding generation time
+   - **Implementation**: `FastembedAdapter` wrapping fastembed-rb (BAAI/bge-small-en-v1.5, ONNX runtime)
+   - Embeddings stored as JSON in `embedding_json` column on facts table
+   - Asymmetric query/passage encoding for better retrieval accuracy
+   - Benchmark: Recall@5=0.696 on semantic paraphrase queries (medium difficulty)
 2. **Multi-Concept AND Search**
    - **Value**: Precise queries like "find conversations about React AND authentication AND JWT"
@@ -770,7 +769,7 @@ npm install better-sqlite3  # Needs node-gyp + build tools
 - Embedding generation
 - Sync overhead
-**Alternative**: Stick with SQLite FTS5. Add embeddings only if users request semantic search.
+**Alternative**: We use fastembed-rb with a local ONNX model (BAAI/bge-small-en-v1.5) -- no Python, no server, no API calls.
 ### 2. Claude Agent SDK for Distillation
@@ -910,7 +909,7 @@ Analysis of **QMD (Quick Markdown Search)** reveals several high-value optimizat
 - **Break Priority**: paragraph > sentence > line > word
 - **Implementation**: Modify ingestion to chunk long content_items before embedding
 - **Consideration**: Only if users report issues with long transcripts
-- **Recommendation**: **DEFER** - Not urgent, TF-IDF handles shorter content well
+- **Recommendation**: **DEFER** - Not urgent, FastEmbed handles shorter content well
 #### 6. **LLM Response Caching**
@@ -933,12 +932,12 @@ Analysis of **QMD (Quick Markdown Search)** reveals several high-value optimizat
 ### Low Priority / Not Recommended
-#### 8. **Neural Embeddings (EmbeddingGemma)** (DEFER)
+#### 8. **Neural Embeddings (EmbeddingGemma)** (SUPERSEDED)
 - **QMD Model**: 300M params, 300MB download, 384 dimensions
 - **Value**: Better semantic search quality (+40% Hit@3 over TF-IDF)
 - **Cost**: 300MB download, 300MB VRAM, 2s cold start, complex dependency
-- **Decision**: **DEFER** - TF-IDF sufficient for now, revisit if users report poor quality
+- **Decision**: **SUPERSEDED** by FastEmbed integration (BAAI/bge-small-en-v1.5, 67MB, via fastembed-rb). Benchmark Recall@5=0.786 aggregate, no API key needed.
 #### 9. **Cross-Encoder Reranking** (REJECT)
@@ -1009,7 +1008,7 @@ Analysis of **QMD (Quick Markdown Search)** reveals several high-value optimizat
 - [x] Enhanced statistics command
 - [x] Session metadata tracking
 - [x] Tool usage tracking
-- [x] Semantic search with TF-IDF embeddings
+- [x] Semantic search with local embeddings (FastEmbed bge-small-en-v1.5)
 - [x] Multi-concept AND search
 - [x] Incremental sync with mtime tracking
 - [x] Context-aware queries
@@ -1082,7 +1081,7 @@ Analysis of **QMD (Quick Markdown Search)** reveals several high-value optimizat
 2. **Tool Usage Tracking** - Dedicated table tracking which tools discovered facts
 3. **Incremental Sync** - mtime-based change detection for fast re-ingestion
 4. **Session Metadata** - Context capture (git branch, cwd, Claude version)
-5. **Local Vector Embeddings** - TF-IDF semantic search alongside FTS5
+5. **Local Vector Embeddings** - FastEmbed (BAAI/bge-small-en-v1.5) semantic search alongside FTS5
 6. **Multi-Concept AND Search** - Precise queries matching 2-5 concepts simultaneously
 7. **Enhanced Statistics** - Comprehensive reporting on facts, entities, provenance
 8. **Context-Aware Queries** - Filter by branch, directory, or tools used
@@ -1094,7 +1093,7 @@ Analysis of **QMD (Quick Markdown Search)** reveals several high-value optimizat
 3. **Truth maintenance** - Conflict resolution and supersession
 4. **Predicate policies** - Single vs multi-value semantics
 5. **Ruby ecosystem** - Simpler dependencies, easier install
-6. **Lightweight embeddings** - No external dependencies (TF-IDF vs Transformers.js)
+6. **Local embeddings** - ONNX model via fastembed-rb, no API key (vs Transformers.js)
 ### Remaining Opportunities
@@ -1128,7 +1127,7 @@ Analysis of **QMD (Quick Markdown Search)** reveals several high-value optimizat
 - Semantic shortcuts for common queries
 **Best of both worlds (achieved)**:
-- ✅ Added vector embeddings for semantic search (TF-IDF based)
+- ✅ Added vector embeddings for semantic search (FastEmbed BAAI/bge-small-en-v1.5, local ONNX)
 - ✅ Kept fact-based knowledge graph for structured queries
 - ✅ Adopted incremental sync and tool tracking from episodic-memory
 - ✅ Maintained truth maintenance and conflict resolution

data/docs/remaining_improvements.md CHANGED Viewed

@@ -2,7 +2,7 @@
 This document contains the improvements that have NOT yet been implemented from the episodic-memory and claude-mem analysis.
-**Note:** The "index" command to generate embeddings for existing facts has been completed (2026-01-23).
+**Note:** The "index" command to generate embeddings for existing facts has been completed (2026-01-23). FastEmbed integration (BAAI/bge-small-en-v1.5 via fastembed-rb) was added for high-quality local embeddings (2026-02-02), replacing TF-IDF as the primary embedding approach for benchmarks.
 ---
@@ -273,7 +273,7 @@ session_summaries: {
 - Embedding generation
 - Sync overhead
-**Alternative**: We've implemented lightweight TF-IDF embeddings without external dependencies.
+**Alternative**: We use [fastembed-rb](https://github.com/khasinski/fastembed-rb) with BAAI/bge-small-en-v1.5 for high-quality local embeddings (384-dim, no API key, ONNX runtime). Benchmark results: Recall@5=0.786 aggregate, 0.696 on semantic paraphrase queries.
 ### 2. Claude Agent SDK for Distillation

data/lefthook.yml CHANGED Viewed

@@ -7,7 +7,14 @@ pre-commit:
       run: bundle exec rake standard:fix
       stage_fixed: true
     tests:
-      run: bundle exec rspec
+      run: |
+        specs=$(.lefthook/map_specs.rb)
+        if [ -n "$specs" ]; then
+          echo "Running specs for changed files..."
+          bundle exec rspec $specs --format progress
+        else
+          echo "No changed lib/ files, skipping tests"
+        fi
     quality-review:
       run: |
         staged_ruby=$(git diff --cached --name-only --diff-filter=ACM | grep '\.rb$' || true)

data/lib/claude_memory/embeddings/fastembed_adapter.rb ADDED Viewed

@@ -0,0 +1,55 @@
+# frozen_string_literal: true
+module ClaudeMemory
+  module Embeddings
+    # Adapter wrapping fastembed-rb for high-quality local embeddings
+    # Uses BAAI/bge-small-en-v1.5 by default (384-dim, ~67MB ONNX model)
+    #
+    # Implements the same generate(text) interface as Generator for DI compatibility.
+    # Supports asymmetric query/passage encoding for better retrieval accuracy.
+    #
+    # Usage:
+    #   adapter = FastembedAdapter.new
+    #   query_vec = adapter.generate("What database?")         # query encoding
+    #   passage_vec = adapter.generate_passage("Uses PostgreSQL") # passage encoding
+    #
+    class FastembedAdapter
+      EMBEDDING_DIM = 384
+      DEFAULT_MODEL = "BAAI/bge-small-en-v1.5"
+      def initialize(model_name: DEFAULT_MODEL)
+        require "fastembed"
+        @model = Fastembed::TextEmbedding.new(model_name: model_name)
+      rescue LoadError
+        raise LoadError,
+          "fastembed gem is required for FastembedAdapter. Add `gem 'fastembed'` to your Gemfile."
+      end
+      # Generate query embedding (optimized for search queries)
+      # Compatible with Recall's embedding_generator interface
+      # @param text [String] query text to embed
+      # @return [Array<Float>] normalized 384-dimensional vector
+      def generate(text)
+        return zero_vector if text.nil? || text.empty?
+        @model.query_embed(text).first.to_a
+      end
+      # Generate passage embedding (optimized for document/fact indexing)
+      # Use this when storing embeddings for facts
+      # @param text [String] passage text to embed
+      # @return [Array<Float>] normalized 384-dimensional vector
+      def generate_passage(text)
+        return zero_vector if text.nil? || text.empty?
+        @model.passage_embed(text).first.to_a
+      end
+      private
+      def zero_vector
+        Array.new(EMBEDDING_DIM, 0.0)
+      end
+    end
+  end
+end

data/lib/claude_memory/ingest/ingester.rb CHANGED Viewed

@@ -88,6 +88,7 @@ module ClaudeMemory
       # Retry database operations with exponential backoff + jitter
       # This handles concurrent access when MCP server and hooks both write simultaneously
       # With busy_timeout=30000ms, each attempt waits up to 30s before raising BusyError
+      # Handles both "busy" and "locked" error messages from SQLite/Extralite
       # Total potential wait time: 30s * 10 attempts + backoff delays = ~5 minutes max
       def with_retry(max_attempts: 10, base_delay: 0.2, max_delay: 5.0)
         attempt = 0
@@ -95,8 +96,10 @@ module ClaudeMemory
           attempt += 1
           yield
         rescue Extralite::BusyError, Sequel::DatabaseError => e
-          # Handle busy errors from extralite adapter
-          is_busy = e.is_a?(Extralite::BusyError) || e.message.include?("busy")
+          # Handle busy/locked errors from extralite adapter
+          is_busy = e.is_a?(Extralite::BusyError) ||
+            e.message.include?("busy") ||
+            e.message.include?("locked")
           if is_busy && attempt < max_attempts
             # Exponential backoff with jitter to avoid thundering herd
             exponential_delay = [base_delay * (2**(attempt - 1)), max_delay].min
@@ -105,9 +108,10 @@ module ClaudeMemory
             sleep(total_delay)
             retry
           elsif is_busy
+            # Max attempts reached, give up
             raise
           else
-            # Not a busy error, re-raise immediately
+            # Not a busy/locked error, re-raise immediately
             raise
           end
         end

data/lib/claude_memory/mcp/tool_definitions.rb CHANGED Viewed

@@ -11,7 +11,7 @@ module ClaudeMemory
         [
           {
             name: "memory.recall",
-            description: "IMPORTANT: Check memory FIRST before reading files or exploring code. Recalls facts matching a query from distilled knowledge in both global and project databases. Use this to find existing knowledge about modules, patterns, decisions, and conventions before resorting to file reads or code searches.",
+            description: "Search facts matching a query from both global and project memory databases.",
             inputSchema: {
               type: "object",
               properties: {
@@ -24,7 +24,7 @@ module ClaudeMemory
           },
           {
             name: "memory.recall_index",
-            description: "Layer 1: CHECK MEMORY FIRST with this lightweight search. Returns fact previews, IDs, and token costs without full details. Use before exploring code to see what knowledge already exists. Follow up with memory.recall_details for specific facts.",
+            description: "Lightweight search returning fact previews, IDs, and token costs. Follow up with memory.recall_details for full information.",
             inputSchema: {
               type: "object",
               properties: {
@@ -37,7 +37,7 @@ module ClaudeMemory
           },
           {
             name: "memory.recall_details",
-            description: "Layer 2: Fetch full details for specific fact IDs from the index. Use after memory.recall_index to get complete information.",
+            description: "Fetch full details for specific fact IDs. Use after memory.recall_index.",
             inputSchema: {
               type: "object",
               properties: {
@@ -177,7 +177,7 @@ module ClaudeMemory
           },
           {
             name: "memory.decisions",
-            description: "Quick access to architectural decisions, constraints, and rules. Use BEFORE implementing features to understand existing decisions and constraints.",
+            description: "List architectural decisions, constraints, and rules.",
             inputSchema: {
               type: "object",
               properties: {
@@ -187,7 +187,7 @@ module ClaudeMemory
           },
           {
             name: "memory.conventions",
-            description: "Quick access to coding conventions and style preferences (global scope). Check BEFORE writing code to follow established patterns.",
+            description: "List coding conventions and style preferences from global memory.",
             inputSchema: {
               type: "object",
               properties: {
@@ -197,7 +197,7 @@ module ClaudeMemory
           },
           {
             name: "memory.architecture",
-            description: "Quick access to framework choices and architectural patterns. Check FIRST when working with frameworks or making architectural decisions.",
+            description: "List framework choices and architectural patterns.",
             inputSchema: {
               type: "object",
               properties: {
@@ -266,7 +266,7 @@ module ClaudeMemory
           },
           {
             name: "memory.check_setup",
-            description: "Check if ClaudeMemory is properly initialized. CALL THIS FIRST if memory tools fail or on first use. Returns initialization status, version info, and actionable recommendations.",
+            description: "Check ClaudeMemory initialization status. Returns version info, issues found, and recommendations.",
             inputSchema: {
               type: "object",
               properties: {}

data/lib/claude_memory/version.rb CHANGED Viewed

@@ -1,5 +1,5 @@
 # frozen_string_literal: true
 module ClaudeMemory
-  VERSION = "0.3.0"
+  VERSION = "0.4.0"
 end