RubyGems - claude_memory - Versions diffs - 0.7.1 → 0.8.0 - Mend

claude_memory 0.7.1 → 0.8.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (72) hide show

checksums.yaml +4 -4
data/.claude/memory.sqlite3 +0 -0
data/.claude/memory.sqlite3-shm +0 -0
data/.claude/memory.sqlite3-wal +0 -0
data/.claude/settings.json +78 -6
data/.claude/settings.local.json +2 -1
data/.claude/skills/improve/SKILL.md +113 -25
data/.claude-plugin/commands/distill-transcripts.md +98 -0
data/.claude-plugin/commands/memory-recall.md +67 -0
data/.claude-plugin/marketplace.json +1 -1
data/.claude-plugin/plugin.json +1 -1
data/CHANGELOG.md +49 -1
data/CLAUDE.md +29 -5
data/docs/improvements.md +18 -56
data/docs/quality_review.md +119 -224
data/hooks/hooks.json +39 -7
data/lib/claude_memory/commands/checks/distill_check.rb +61 -0
data/lib/claude_memory/commands/checks/hooks_check.rb +2 -2
data/lib/claude_memory/commands/checks/vec_check.rb +2 -1
data/lib/claude_memory/commands/completion_command.rb +179 -0
data/lib/claude_memory/commands/doctor_command.rb +2 -0
data/lib/claude_memory/commands/help_command.rb +4 -0
data/lib/claude_memory/commands/hook_command.rb +2 -1
data/lib/claude_memory/commands/index_command.rb +85 -78
data/lib/claude_memory/commands/initializers/database_ensurer.rb +16 -0
data/lib/claude_memory/commands/initializers/global_initializer.rb +2 -1
data/lib/claude_memory/commands/initializers/hooks_configurator.rb +55 -11
data/lib/claude_memory/commands/initializers/project_initializer.rb +2 -1
data/lib/claude_memory/commands/install_skill_command.rb +78 -0
data/lib/claude_memory/commands/registry.rb +3 -1
data/lib/claude_memory/commands/skills/distill-transcripts.md +98 -0
data/lib/claude_memory/commands/skills/memory-recall.md +67 -0
data/lib/claude_memory/core/fact_ranker.rb +2 -2
data/lib/claude_memory/core/rr_fusion.rb +23 -6
data/lib/claude_memory/core/snippet_extractor.rb +7 -3
data/lib/claude_memory/core/text_builder.rb +11 -0
data/lib/claude_memory/domain/provenance.rb +0 -1
data/lib/claude_memory/embeddings/api_adapter.rb +96 -0
data/lib/claude_memory/embeddings/dimension_check.rb +23 -0
data/lib/claude_memory/embeddings/fastembed_adapter.rb +4 -0
data/lib/claude_memory/embeddings/generator.rb +4 -0
data/lib/claude_memory/embeddings/resolver.rb +18 -0
data/lib/claude_memory/hook/context_injector.rb +58 -2
data/lib/claude_memory/hook/distillation_runner.rb +46 -0
data/lib/claude_memory/hook/handler.rb +11 -2
data/lib/claude_memory/index/vector_index.rb +15 -2
data/lib/claude_memory/infrastructure/schema_validator.rb +3 -3
data/lib/claude_memory/mcp/handlers/context_handlers.rb +38 -0
data/lib/claude_memory/mcp/handlers/management_handlers.rb +145 -0
data/lib/claude_memory/mcp/handlers/query_handlers.rb +115 -0
data/lib/claude_memory/mcp/handlers/setup_handlers.rb +211 -0
data/lib/claude_memory/mcp/handlers/shortcut_handlers.rb +37 -0
data/lib/claude_memory/mcp/handlers/stats_handlers.rb +202 -0
data/lib/claude_memory/mcp/instructions_builder.rb +2 -1
data/lib/claude_memory/mcp/query_guide.rb +10 -0
data/lib/claude_memory/mcp/response_formatter.rb +1 -0
data/lib/claude_memory/mcp/text_summary.rb +26 -0
data/lib/claude_memory/mcp/tool_definitions.rb +30 -1
data/lib/claude_memory/mcp/tool_helpers.rb +43 -0
data/lib/claude_memory/mcp/tools.rb +39 -678
data/lib/claude_memory/recall/dual_engine.rb +105 -0
data/lib/claude_memory/recall/legacy_engine.rb +138 -0
data/lib/claude_memory/recall/query_core.rb +371 -0
data/lib/claude_memory/recall.rb +29 -662
data/lib/claude_memory/shortcuts.rb +4 -4
data/lib/claude_memory/store/retry_handler.rb +61 -0
data/lib/claude_memory/store/schema_manager.rb +68 -0
data/lib/claude_memory/store/sqlite_store.rb +85 -201
data/lib/claude_memory/templates/hooks.example.json +26 -7
data/lib/claude_memory/version.rb +1 -1
data/lib/claude_memory.rb +11 -0
metadata +23 -1

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: 00a7e67707543e2a5266200eab3a276a5b70d25a41ebbc8a5eab8767c44952a8
-  data.tar.gz: 261f86e807d5638739a7462872ae9f5472451afd7aea5496c1cf785b54a76dc6
+  metadata.gz: 1483a3663c9c589abad58f80c71d772c592f13727d3fd1339418c2ba1a3db875
+  data.tar.gz: 34beebdf5e3cc8a70d383757ffaf7e21fde0724f963532e2d2548ec2e63ba329
 SHA512:
-  metadata.gz: 79ffe3d19420ae16dd26e2083a45c3725be43fd955bf707e3a1e157e04d790821cd5d199b8c67ea713c354c82314a931af1883d260a430119f2eb72213d8cc16
-  data.tar.gz: 864fffbf37c36d63a3a72e19f56ce7f288c26b7d6b546e55338ea0fb61fc963fb99704be28124e33fbb49d5098c3997409ecdac13735fd9cd52253b545346f4b
+  metadata.gz: 514022df68751e4421942d914c003a1e104a5c2b5ab4a639619ca05bb4d6bafd722d87009dc898a5b578f1b7efdae50df68b5b8232e1cf42a8bb5e88a06e66c3
+  data.tar.gz: 68c5a36361d3048e4c92bba7127e7fa0bc9ff09f6de3779ae59f7799cd4152d63030386ea2b124a90b345063545028b0da0401c9144f284c7e34290b2d674ba7

data/.claude/memory.sqlite3 CHANGED Viewed

Binary file

data/.claude/memory.sqlite3-shm CHANGED Viewed

Binary file

data/.claude/memory.sqlite3-wal CHANGED Viewed

Binary file

data/.claude/settings.json CHANGED Viewed

@@ -6,7 +6,16 @@
           {
             "type": "command",
             "command": "claude-memory hook ingest",
-            "timeout": 10
+            "timeout": 5
+          }
+        ]
+      },
+      {
+        "hooks": [
+          {
+            "type": "command",
+            "command": "claude-memory hook ingest --db /Users/valentinostoll/src/claude_memory/.claude/memory.sqlite3",
+            "timeout": 5
           }
         ]
       }
@@ -14,11 +23,6 @@
     "SessionStart": [
       {
         "hooks": [
-          {
-            "type": "command",
-            "command": "claude-memory hook ingest",
-            "timeout": 10
-          },
           {
             "type": "command",
             "command": "claude-memory hook context",
@@ -41,6 +45,20 @@
             "timeout": 30
           }
         ]
+      },
+      {
+        "hooks": [
+          {
+            "type": "command",
+            "command": "claude-memory hook ingest --db /Users/valentinostoll/src/claude_memory/.claude/memory.sqlite3",
+            "timeout": 30
+          },
+          {
+            "type": "command",
+            "command": "claude-memory hook sweep --db /Users/valentinostoll/src/claude_memory/.claude/memory.sqlite3",
+            "timeout": 30
+          }
+        ]
       }
     ],
     "SessionEnd": [
@@ -57,6 +75,60 @@
             "timeout": 30
           }
         ]
+      },
+      {
+        "hooks": [
+          {
+            "type": "command",
+            "command": "claude-memory hook ingest --db /Users/valentinostoll/src/claude_memory/.claude/memory.sqlite3",
+            "timeout": 30
+          },
+          {
+            "type": "command",
+            "command": "claude-memory hook sweep --db /Users/valentinostoll/src/claude_memory/.claude/memory.sqlite3",
+            "timeout": 30
+          }
+        ]
+      }
+    ],
+    "TaskCompleted": [
+      {
+        "hooks": [
+          {
+            "type": "command",
+            "command": "claude-memory hook ingest",
+            "timeout": 10
+          }
+        ]
+      },
+      {
+        "hooks": [
+          {
+            "type": "command",
+            "command": "claude-memory hook ingest --db /Users/valentinostoll/src/claude_memory/.claude/memory.sqlite3",
+            "timeout": 10
+          }
+        ]
+      }
+    ],
+    "TeammateIdle": [
+      {
+        "hooks": [
+          {
+            "type": "command",
+            "command": "claude-memory hook ingest",
+            "timeout": 15
+          }
+        ]
+      },
+      {
+        "hooks": [
+          {
+            "type": "command",
+            "command": "claude-memory hook ingest --db /Users/valentinostoll/src/claude_memory/.claude/memory.sqlite3",
+            "timeout": 15
+          }
+        ]
       }
     ]
   }

data/.claude/settings.local.json CHANGED Viewed

@@ -51,5 +51,6 @@
       "Skill(improve:*)"
     ]
   },
-  "enableAllProjectMcpServers": true
+  "enableAllProjectMcpServers": true,
+  "outputStyle": "memory-aware"
 }

data/.claude/skills/improve/SKILL.md CHANGED Viewed

@@ -3,23 +3,88 @@ name: improve
 description: Incrementally implement feature improvements from docs/improvements.md with tests and atomic commits. Focuses on new functionality rather than refactoring.
 agent: general-purpose
 allowed-tools: Read, Grep, Edit, Write, Bash
+arguments:
+  - name: mode
+    description: "Execution mode: 'sub-agent' (default, sequential) or 'agent-team' (parallel via agent teams)"
+    required: false
+    default: "sub-agent"
 ---
 # Feature Improvements - Incremental Implementation
 Systematically implement feature improvements from `docs/improvements.md`, making tested, atomic commits for each feature addition.
+## Execution Modes
+This skill supports two modes, passed as the first argument:
+- **`sub-agent`** (default): A single agent works through improvements sequentially. Best for small batches (1-3 features) or features with dependencies.
+- **`agent-team`**: Spawns a coordinated team of agents that implement independent features in parallel. Best for larger batches (3+) of independent features.
+---
+## Mode: agent-team
+When invoked with `agent-team`, follow this process:
+### Step 1: Read and Assess Improvements
+Read `docs/improvements.md` and identify all implementable features using the same feasibility criteria as sub-agent mode (skip Categories D-F, "Features to Avoid", "If Requested" items).
+### Step 2: Group Independent Features
+Partition implementable features into independent groups:
+- Features touching **different files** can be parallelized
+- Features sharing files or with dependencies must be sequential
+- Aim for 3-5 teammates maximum
+### Step 3: Create the Agent Team
+Create an agent team. For each teammate:
+1. **Assign one or two related features** per teammate
+2. **Provide full context** — teammates don't share your conversation history
+3. **Include these instructions for each teammate**:
+   - Read relevant existing code before making changes
+   - Follow the project's code style (Standard Ruby, frozen_string_literal)
+   - Write tests for all new functionality
+   - Run `bundle exec rake standard:fix` before committing
+   - Run relevant spec file after each edit, full suite before committing
+   - Run `bundle exec rspec` to verify all tests pass
+   - Make atomic commits with `[Feature]` prefix format
+   - Update `docs/improvements.md` to mark features as implemented
+   - Reference `.claude/skills/improve/feature-patterns.md` for implementation recipes
+### Step 4: Monitor and Coordinate
+- Wait for all teammates to complete their tasks
+- If a teammate reports a blocker or conflict, help resolve it
+- Do NOT implement tasks yourself — let teammates do the work
+### Step 5: Validate and Report
+After all teammates finish:
+1. Run the full test suite: `bundle exec rspec`
+2. Run the linter: `bundle exec rake standard:fix`
+3. If any failures, fix them or coordinate with the relevant teammate
+4. Provide a consolidated progress report (same Final Report format as sub-agent mode)
+---
+## Mode: sub-agent (default)
 ## Process Overview
 1. **Check memory health** by calling `memory.check_setup` to verify the system is operational
 2. **Read the improvements document** from `docs/improvements.md`
-2. **Identify unimplemented features** from "Remaining Tasks" section
-3. **Prioritize by stated priority** (Medium → Low)
-4. **Assess feasibility** (skip if too complex or requires external services)
-5. **Implement features incrementally** (one logical feature at a time)
-6. **Run tests after each change** to ensure nothing breaks
-7. **Make atomic commits** that capture the feature and its purpose
-8. **Update improvements.md** to mark features as implemented
+3. **Identify unimplemented features** from "Remaining Tasks" section
+4. **Prioritize by stated priority** (Medium → Low)
+5. **Assess feasibility** (skip if too complex or requires external services)
+6. **Implement features incrementally** (one logical feature at a time)
+7. **Run tests after each change** to ensure nothing breaks
+8. **Make atomic commits** that capture the feature and its purpose
+9. **Update improvements.md** to mark features as implemented
 ## Detailed Steps
@@ -100,11 +165,15 @@ For each feature:
    ```bash
    bundle exec rake standard:fix
    ```
-5. **Run tests**:
+5. **Run targeted tests** after each edit:
+   ```bash
+   bundle exec rspec spec/claude_memory/<relevant_spec>.rb
+   ```
+6. **Run full suite** before committing:
    ```bash
    bundle exec rspec
    ```
-6. **Fix any test failures** before proceeding
+7. **Fix any test failures** before proceeding
 ### Step 5: Make Atomic Commit
@@ -300,9 +369,15 @@ Is it marked "Features to Avoid"?
                 ↓
                 Category D (Background)?
                 ├─ YES → Assess carefully, may skip
-                └─ NO → Implement (Categories A-C safe)
+                └─ NO → Continue
                     ↓
-                    Implement the feature
+                    Does it have dependencies on other features?
+                    ├─ YES → Are dependencies complete?
+                    │   ├─ NO → SKIP, note dependency
+                    │   └─ YES → Continue
+                    └─ NO → Implement (Categories A-C safe)
+                        ↓
+                        Implement the feature
                     ↓
                     Run tests
                     ↓
@@ -324,11 +399,15 @@ Is it marked "Features to Avoid"?
 ## Time Budgets
 **Per Feature:**
-- Category A (Schema): Max 20 minutes
-- Category B (Reporting): Max 30 minutes
-- Category C (CLI): Max 30 minutes
-- Category D (Background): Max 60 minutes (or skip)
-- Category E (External): Max 45 minutes (or skip)
+- Category A (Schema): Max 15 minutes — skip if stuck after 15
+- Category B (Reporting): Max 20 minutes — skip if stuck after 20
+- Category C (CLI): Max 30 minutes — skip if stuck after 30
+- Category D (Background): Max 45 minutes (or skip at first sign of daemon complexity)
+- Category E (External): Max 30 minutes (or skip at first sign of dependency issues)
+**Per Debug Cycle:**
+- Test failure fix: Max 15 minutes — if you can't fix it in 15 minutes, revert and skip
+- Understanding code: Max 10 minutes — if unclear after 10 minutes, skip and report
 **Session Total:** Max 2 hours
@@ -337,26 +416,35 @@ If time budget exceeded: SKIP remaining features and report.
 ## Testing Strategy
 ### Test Frequency
-- After schema changes: Run all specs
-- After new command: Run command specs + integration
-- After reporting changes: Run relevant specs
-- Before commit: Full test suite
+- After each file edit: Run the relevant spec file
+- After schema changes: Run `spec/claude_memory/store/`
+- After new command: Run `spec/claude_memory/commands/`
+- Before each commit: Full test suite
+- If >5 files changed: Full test suite immediately
 ### Test Commands
 ```bash
-# Specific command tests
-bundle exec rspec spec/claude_memory/commands/
+# Single relevant spec (fastest feedback)
+bundle exec rspec spec/claude_memory/commands/metrics_command_spec.rb
-# Schema tests
+# Module-level specs
+bundle exec rspec spec/claude_memory/commands/
 bundle exec rspec spec/claude_memory/store/
-# Full suite
+# Full suite (before commit)
 bundle exec rspec
-# With linting
+# With linting (final check)
 bundle exec rake
 ```
+### Test Failure Response
+1. Read error message carefully
+2. Check if your change caused it (vs pre-existing)
+3. If your change: fix within 15 minutes or revert and skip
+4. If pre-existing: note and continue
+5. If unsure: revert change and skip item
 ### New Feature Tests
 Always add tests for new features:

data/.claude-plugin/commands/distill-transcripts.md ADDED Viewed

@@ -0,0 +1,98 @@
+# Distill Transcripts
+Extract structured knowledge (facts, entities, decisions) from undistilled transcript content and persist it to long-term memory.
+## Usage
+```
+/distill-transcripts
+/distill-transcripts --limit 10
+```
+## Instructions
+You are a knowledge extraction specialist. Your job is to read raw transcript content and extract structured facts, entities, and decisions, then persist them via the memory.store_extraction MCP tool.
+### Step 1: Get Undistilled Content
+Call `memory.undistilled` with `limit: 10` to get transcript content that hasn't been processed yet.
+If no items are returned, report "No undistilled content found" and stop.
+### Step 2: Extract Knowledge (per item)
+For each content item, carefully read the raw_text and extract:
+**Entities** — Named things mentioned:
+- type: database, framework, language, platform, repo, module, person, service
+- name: Canonical name (e.g., "PostgreSQL" not "postgres")
+- confidence: 0.0-1.0
+**Facts** — Knowledge learned:
+- subject: Entity name or "repo" for project-level facts
+- predicate: uses_database, uses_framework, convention, decision, auth_method, deployment_platform, depends_on, testing_strategy
+- object: The value
+- confidence: 0.0-1.0
+- quote: Source excerpt (max 200 chars)
+- strength: "stated" (explicitly said) or "inferred" (implied)
+- scope_hint: "project" (this project only) or "global" (all projects)
+**Decisions** — Choices made:
+- title: Short summary (max 100 chars)
+- summary: Full description
+- status_hint: "accepted", "proposed", or "rejected"
+### What to Extract
+- Technology choices ("we use PostgreSQL", "switched to React")
+- Conventions ("always use frozen_string_literal", "test files go in spec/")
+- Architectural decisions ("API uses REST", "auth via JWT")
+- Preferences ("prefer 4-space indent", "use Standard Ruby")
+- Project structure ("migrations in db/migrations/", "commands in commands/")
+### What to Skip
+- Debugging steps and transient errors
+- Code output and tool observations
+- File contents that were just being read
+- Ephemeral task details ("fix this test", "run the linter")
+- Information already obvious from the codebase itself
+### Scope Detection
+Set scope_hint to "global" when the text contains signals like:
+- "I always...", "in all my projects...", "my preference is..."
+- "everywhere", "across all repos"
+Default to "project" for everything else.
+### Step 3: Persist Each Extraction
+For each content item with extracted knowledge:
+1. Call `memory.store_extraction` with the entities, facts, and decisions arrays
+2. Call `memory.mark_distilled` with the content_item_id and facts_extracted count
+3. If nothing was extracted, still call `memory.mark_distilled` with facts_extracted: 0
+### Step 4: Report
+Return a summary:
+```
+## Distillation Complete
+- Items processed: N
+- Facts extracted: N
+- Entities found: N
+- Decisions captured: N
+- Items skipped (nothing to extract): N
+```
+### Guidelines
+- Process items one at a time to keep extractions focused
+- Use `compact: true` on `memory.undistilled` for smaller responses
+- Be conservative — only extract facts you're confident about (>0.7)
+- Prefer "stated" strength over "inferred" unless clearly implied
+- Do NOT fabricate facts — only extract what's actually in the text
+- If text is mostly code/tool output with no conversational knowledge, mark as distilled with 0 facts

data/.claude-plugin/commands/memory-recall.md ADDED Viewed

@@ -0,0 +1,67 @@
+# Memory Recall Agent
+Search long-term memory for facts, decisions, conventions, and architectural knowledge. Chains multiple memory tools to build comprehensive answers while saving main-agent context.
+## Usage
+Provide a natural language query describing what you want to recall:
+```
+/memory-recall database migration strategy
+/memory-recall authentication decisions
+/memory-recall testing conventions
+```
+## Workflow
+1. **Fast lookup** — Start with `memory.recall` for keyword matches
+2. **Semantic search** — If recall returns few results, try `memory.recall_semantic` for conceptual matches
+3. **Shortcuts** — For known categories, use `memory.decisions`, `memory.conventions`, or `memory.architecture`
+4. **Deep dive** — For specific facts, use `memory.explain` to get provenance and `memory.fact_graph` to see relationships
+5. **Synthesize** — Combine findings into a concise, structured answer
+## Instructions
+You are a memory recall specialist. Given a query, search ClaudeMemory using the available MCP tools and return a synthesized answer.
+### Step 1: Initial Search
+Run `memory.recall` with the user's query. If the query mentions decisions, conventions, or architecture, also run the appropriate shortcut tool in parallel.
+### Step 2: Expand if Needed
+If Step 1 returns fewer than 3 results:
+- Try `memory.recall_semantic` with a rephrased version of the query
+- Try `memory.search_concepts` with 2-3 key concepts extracted from the query
+### Step 3: Enrich Key Facts
+For the top 2-3 most relevant facts:
+- Run `memory.explain` to get provenance (where the fact came from)
+- If relationships matter, run `memory.fact_graph` to see connected facts
+### Step 4: Synthesize
+Return a structured response:
+```
+## Memory Recall Results
+### Key Facts
+- [Fact 1 with provenance]
+- [Fact 2 with provenance]
+### Context
+[How these facts relate to the query]
+### Confidence
+[High/Medium/Low based on number and freshness of supporting facts]
+```
+### Guidelines
+- Prefer `memory.recall` (fast, token-efficient) before escalating to semantic search
+- Use `compact: true` on all tool calls to minimize token usage
+- Do NOT fabricate facts — only report what memory tools return
+- If no relevant facts found, say so clearly rather than guessing
+- Include fact IDs so the main agent can reference them

data/.claude-plugin/marketplace.json CHANGED Viewed

@@ -7,7 +7,7 @@
   "plugins": [
     {
       "name": "claude-memory",
-      "version": "0.7.1",
+      "version": "0.8.0",
       "source": "./",
       "description": "Long-term self-managed memory for Claude Code with fact extraction, truth maintenance, and provenance tracking",
       "repository": "https://github.com/codenamev/claude_memory"

data/.claude-plugin/plugin.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "claude-memory",
-  "version": "0.7.1",
+  "version": "0.8.0",
   "description": "Long-term self-managed memory for Claude Code with fact extraction, truth maintenance, and provenance tracking",
   "author": {
     "name": "Valentino Stoll",

data/CHANGELOG.md CHANGED Viewed

@@ -4,6 +4,54 @@ All notable changes to this project will be documented in this file.
 ## [Unreleased]
+## [0.8.0] - 2026-03-30
+### Added
+**Three-Layer Distillation Pipeline**
+- Automatic distillation via NullDistiller in ingest pipeline (Layer 1: regex-based, P95 < 5ms)
+- Context hook injection for LLM-based extraction at SessionStart (Layer 2: Claude Code as distiller, zero extra cost)
+- `/distill-transcripts` skill for manual deep extraction (Layer 3: on-demand, depth-aware prompts)
+- `memory.undistilled` and `memory.mark_distilled` MCP tools for distillation tracking
+- `Hook::DistillationRunner` extracted from Handler for context hook injection
+- `TaskCompleted` and `TeammateIdle` hook events for ingest triggers
+- Distillation metrics backfill on database initialization
+- Doctor check for undistilled content
+- Pending distillation count in `memory.status` output
+**Recall Enhancements**
+- Intent parameter for recall query disambiguation (#3)
+- Retrieval score traces for semantic search (#5)
+- Configurable embedding providers with dimension checking
+**Hook Enhancements**
+- `statusMessage` on all hooks for descriptive spinner text during hook execution
+- `StopFailure` hook to capture transcript data even on session errors (rate limits, server errors)
+- `Notification` hook with `idle_prompt` matcher for opportunistic sweep during idle
+**New Commands & Skills**
+- `install-skill` command and `memory-recall` agent (#8, #12)
+- Shell completion command for bash and zsh (#18)
+**Distillation Benchmark Results**
+- NullDistiller: Concept Recall 0.952, Fact Precision/Recall 1.000 (31 test cases)
+- Claude Code LLM: Concept Recall 0.902 (all 41 cases), 0.900 on semantic cases (vs 0.333 for regex)
+- Average 1.6 facts stored per case across LLM extraction
+- E2E distillation recall benchmark and extraction quality benchmarks
+- Concept-based matching for distiller-agnostic benchmark comparison
+### Fixed
+- `--allowedTools` added to `ClaudeCliRunner` for MCP tool permissions
+- Test isolation for context hook when global database has facts
+### Internal
+- Extracted `RetryHandler` and `SchemaManager` modules from `SQLiteStore`
+- Extracted `Recall` into engine strategy pattern with `DualEngine`, `LegacyEngine`, and shared `QueryCore`
+- Extracted `Tools` god object into 6 handler modules
+- Added 36 specs for 5 previously untested files
+- All 3 god objects eliminated, 0 files over 500 lines
 ## [0.7.1] - 2026-03-17
 ### Added
@@ -45,7 +93,7 @@ All notable changes to this project will be documented in this file.
 - Opt-out: set `CLAUDE_MEMORY_ISOLATE_WORKTREES=1` for per-worktree isolation
 **MCP Enhancements**
-- Tool annotations: `readOnlyHint`, `idempotentHint`, `destructiveHint` on all 21 tools
+- Tool annotations: `readOnlyHint`, `idempotentHint`, `destructiveHint` on all 23 tools
 - Stdout protection: MCP server redirects `$stdout` to `$stderr` to prevent protocol corruption from accidental `puts`/`print` calls
 - Self-excluding agent conversations via `SELF_CONTEXT_MARKER` to prevent meta-pollution

data/CLAUDE.md CHANGED Viewed

@@ -99,6 +99,18 @@ bin/run-evals --comparative        # Run benchmarks with available tools
 bin/run-evals --comparative --setup-competitors  # Install + run in one step
 ```
+### Distillation Extraction Accuracy
+NullDistiller (regex, Layer 1):
+  - Concept Recall: 0.952 (regex-detectable entities/facts)
+  - Fact Precision: 1.000, Fact Recall: 1.000 (on 31 test cases)
+  - Pipeline latency: P95 < 5ms (medium text)
+Claude Code (LLM, Layers 2+3):
+  - Concept Recall: 0.902 (all 41 cases)
+  - Concept Recall on semantic cases: 0.900 (vs NullDistiller's 0.333)
+  - Avg facts stored per case: 1.6
 ## Architecture
 ### Dual-Database System
@@ -123,6 +135,16 @@ Transcripts → Ingest → Index (FTS5)
              Publish → .claude/rules/claude_memory.generated.md
 ```
+### Three-Layer Distillation
+The distillation pipeline operates at three levels of depth:
+- **Layer 1: NullDistiller** (automatic, regex, free) — Runs in the ingest pipeline on every hook event. Extracts entities, facts, and scope hints using pattern matching. P95 latency < 5ms.
+- **Layer 2: Context Hook Injection** (automatic, LLM, zero extra cost) — At SessionStart, undistilled content is injected into the session via `hookSpecificOutput.additionalContext` with extraction instructions. Claude Code itself acts as the distiller, extracting structured facts at no additional API cost.
+- **Layer 3: `/distill-transcripts` Skill** (manual, on-demand) — Deep extraction triggered by the user. Processes undistilled content with depth-aware prompts (initial extraction, consolidation, contradiction resolution).
+New MCP tools `memory.undistilled` and `memory.mark_distilled` support the pipeline by tracking which content items have been deeply distilled.
 ### Module Structure
 #### Application Layer
@@ -135,7 +157,7 @@ Transcripts → Ingest → Index (FTS5)
   - Each command is a separate class (HelpCommand, DoctorCommand, etc.)
   - All commands inherit from BaseCommand
   - Dependency injection for I/O (stdout, stderr, stdin)
-  - 22 commands total, each focused on single responsibility
+  - 23 commands total, each focused on single responsibility
 - **`Configuration`**: Centralized ENV access (`configuration.rb`)
   - Single source of truth for paths and environment variables
@@ -198,12 +220,13 @@ Transcripts → Ingest → Index (FTS5)
   - Modes: shared (repo), local (uncommitted), home (user directory)
 - **`MCP`**: Model Context Protocol server and tools (`mcp/`)
-  - Exposes memory tools to Claude Code (21 tools total)
+  - Exposes memory tools to Claude Code (23 tools total)
   - Dual content/structuredContent responses with compact mode
 - **`Hook`**: Hook entrypoint handlers (`hook/`)
   - Reads stdin JSON from Claude Code hooks
   - Routes to ingest/sweep/publish commands
+  - `DistillationRunner`: Manages context hook injection with undistilled content for LLM extraction
 ### Database Schema
@@ -288,7 +311,7 @@ Single-value predicates (like "uses_database") supersede old values. Multi-value
 - `lib/claude_memory.rb`: Main module, requires, database path helpers
 - `lib/claude_memory/cli.rb`: Thin command router (41 lines)
-- `lib/claude_memory/commands/`: Individual command classes (22 commands)
+- `lib/claude_memory/commands/`: Individual command classes (23 commands)
 - `lib/claude_memory/configuration.rb`: Centralized configuration and ENV access
 - `lib/claude_memory/domain/`: Domain models (Fact, Entity, Provenance, Conflict)
 - `lib/claude_memory/core/`: Value objects and null objects
@@ -303,12 +326,13 @@ Single-value predicates (like "uses_database") supersede old values. Multi-value
 The gem includes an MCP server (`claude-memory serve-mcp`) that exposes memory operations as tools. Configuration should be in `.mcp.json` at project root.
-Available MCP tools (21 total):
+Available MCP tools (23 total):
 - **Query & Recall**: `memory.recall`, `memory.recall_index`, `memory.recall_details`, `memory.recall_semantic`, `memory.search_concepts`
 - **Provenance**: `memory.explain`, `memory.fact_graph`
 - **Shortcuts**: `memory.decisions`, `memory.conventions`, `memory.architecture`
 - **Context**: `memory.facts_by_tool`, `memory.facts_by_context`
 - **Management**: `memory.promote`, `memory.store_extraction`
+- **Distillation**: `memory.undistilled`, `memory.mark_distilled`
 - **Monitoring**: `memory.status`, `memory.stats`, `memory.changes`, `memory.conflicts`
 - **Maintenance**: `memory.sweep_now`
 - **Discovery**: `memory.check_setup`, `memory.list_projects`
@@ -317,7 +341,7 @@ Available MCP tools (21 total):
 ClaudeMemory integrates with Claude Code via hooks in `.claude/settings.json`:
-- **Ingest hook**: Triggers on Stop/SessionStart/PreCompact/SessionEnd events
+- **Ingest hook**: Triggers on Stop/SessionStart/PreCompact/SessionEnd/TaskCompleted/TeammateIdle events
   - Calls `claude-memory hook ingest` with stdin JSON
   - Reads transcript delta and updates both global and project databases