RubyGems - claude_memory - Versions diffs - 0.9.1 → 0.11.0 - Mend

claude_memory 0.9.1 → 0.11.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (77) hide show

checksums.yaml +4 -4
data/.claude/memory.sqlite3 +0 -0
data/.claude/skills/dashboard/SKILL.md +42 -0
data/.claude-plugin/marketplace.json +1 -1
data/.claude-plugin/plugin.json +1 -1
data/CHANGELOG.md +130 -0
data/CLAUDE.md +30 -6
data/README.md +66 -2
data/db/migrations/015_add_activity_events.rb +26 -0
data/db/migrations/016_add_moment_feedback.rb +22 -0
data/db/migrations/017_add_last_recalled_at.rb +15 -0
data/docs/1_0_punchlist.md +371 -0
data/docs/EXAMPLES.md +41 -2
data/docs/GETTING_STARTED.md +33 -4
data/docs/architecture.md +22 -7
data/docs/audit-queries.md +131 -0
data/docs/dashboard.md +192 -0
data/docs/improvements.md +650 -9
data/docs/influence/cq.md +187 -0
data/docs/plugin.md +13 -6
data/docs/quality_review.md +524 -172
data/docs/reflection_memory_as_accumulating_judgment.md +67 -0
data/lib/claude_memory/activity_log.rb +86 -0
data/lib/claude_memory/commands/census_command.rb +210 -0
data/lib/claude_memory/commands/completion_command.rb +3 -0
data/lib/claude_memory/commands/dashboard_command.rb +54 -0
data/lib/claude_memory/commands/dedupe_conflicts_command.rb +55 -0
data/lib/claude_memory/commands/digest_command.rb +273 -0
data/lib/claude_memory/commands/hook_command.rb +61 -2
data/lib/claude_memory/commands/initializers/hooks_configurator.rb +7 -4
data/lib/claude_memory/commands/reclassify_references_command.rb +56 -0
data/lib/claude_memory/commands/registry.rb +7 -1
data/lib/claude_memory/commands/show_command.rb +90 -0
data/lib/claude_memory/commands/skills/distill-transcripts.md +13 -1
data/lib/claude_memory/commands/stats_command.rb +131 -2
data/lib/claude_memory/commands/sweep_command.rb +2 -0
data/lib/claude_memory/configuration.rb +16 -0
data/lib/claude_memory/core/relative_time.rb +9 -0
data/lib/claude_memory/dashboard/api.rb +610 -0
data/lib/claude_memory/dashboard/conflicts.rb +279 -0
data/lib/claude_memory/dashboard/efficacy.rb +127 -0
data/lib/claude_memory/dashboard/fact_presenter.rb +109 -0
data/lib/claude_memory/dashboard/health.rb +175 -0
data/lib/claude_memory/dashboard/index.html +2707 -0
data/lib/claude_memory/dashboard/knowledge.rb +136 -0
data/lib/claude_memory/dashboard/moments.rb +244 -0
data/lib/claude_memory/dashboard/reuse.rb +97 -0
data/lib/claude_memory/dashboard/scoped_fact_resolver.rb +95 -0
data/lib/claude_memory/dashboard/server.rb +211 -0
data/lib/claude_memory/dashboard/timeline.rb +68 -0
data/lib/claude_memory/dashboard/trust.rb +454 -0
data/lib/claude_memory/distill/bare_conclusion_detector.rb +71 -0
data/lib/claude_memory/distill/reference_material_detector.rb +78 -0
data/lib/claude_memory/hook/auto_memory_mirror.rb +112 -0
data/lib/claude_memory/hook/context_injector.rb +97 -3
data/lib/claude_memory/hook/handler.rb +191 -3
data/lib/claude_memory/mcp/handlers/management_handlers.rb +8 -0
data/lib/claude_memory/mcp/query_guide.rb +11 -0
data/lib/claude_memory/mcp/text_summary.rb +29 -0
data/lib/claude_memory/mcp/tool_definitions.rb +13 -0
data/lib/claude_memory/mcp/tools.rb +148 -0
data/lib/claude_memory/publish.rb +13 -21
data/lib/claude_memory/recall/stale_detector.rb +67 -0
data/lib/claude_memory/resolve/predicate_policy.rb +2 -0
data/lib/claude_memory/resolve/resolver.rb +41 -11
data/lib/claude_memory/store/llm_cache.rb +68 -0
data/lib/claude_memory/store/metrics_aggregator.rb +96 -0
data/lib/claude_memory/store/schema_manager.rb +1 -1
data/lib/claude_memory/store/sqlite_store.rb +47 -143
data/lib/claude_memory/store/store_manager.rb +29 -0
data/lib/claude_memory/sweep/maintenance.rb +216 -0
data/lib/claude_memory/sweep/recall_timestamp_refresher.rb +83 -0
data/lib/claude_memory/sweep/sweeper.rb +2 -0
data/lib/claude_memory/templates/hooks.example.json +5 -0
data/lib/claude_memory/version.rb +1 -1
data/lib/claude_memory.rb +24 -0
metadata +51 -1

data/docs/influence/cq.md ADDED Viewed

@@ -0,0 +1,187 @@
+# cq Analysis
+*Analysis Date: 2026-04-28*
+*Repository: https://github.com/technicalpickles/cq*
+*Focus: Tool usefulness (not internals)*
+---
+## Executive Summary
+**cq** is a Rust CLI that indexes Claude Code's JSONL session transcripts into a local DuckDB cache (`~/.cache/cq/index.duckdb`) and exposes four SQL views (`sessions`, `messages`, `tool_calls`, `tool_results`) for querying with raw SQL or canned subcommands.
+It is positioned squarely as **observability for your own Claude Code usage** — not a memory system, not a curation tool, not an in-session helper. You run it from a separate terminal to ask meta-questions like "which skills are firing?" or "where did context go in that bad session?"
+**Verdict for ClaudeMemory**: complementary, not competing. cq is a *read* tool over raw transcripts; ClaudeMemory is a *write/curate* tool that distills transcripts into facts. Same data source, different jobs. **Recommendation: install cq as a developer-side audit tool**, especially for validating that the ClaudeMemory plugin itself is being used correctly. Do not adopt internals — the architectures don't overlap meaningfully.
+**Tech stack**: Rust, DuckDB (with JSON extension), clap, comfy-table, fs2 file locking. ~14 source modules, MIT licensed.
+## What cq Actually Gives You
+The four views are the product. Everything else (subcommands, `--grep`, `-A/-B/-C` context flags, `--since 7d`) is convenience over those views.
+| View | What it is | Why it matters |
+|------|------------|----------------|
+| `sessions` | One row per session, with timestamps, message counts, tool counts | Fastest way to find "the session where X happened" |
+| `messages` | One row per user/assistant turn | Full-text grep across your entire history |
+| `tool_calls` | One row per tool_use block with input as queryable JSON | The killer view — `json_extract_string(input, '$.command')` etc. |
+| `tool_results` | One row per tool_result with `is_error` flag | Pairs with `tool_calls` to find silent failures |
+The `tool_calls` view is where the value is. SQL + JSON path extraction over every Bash command, every Read path, every Skill invocation, every MCP tool call, across all your sessions, scoped automatically to the current project.
+## Concrete Use Cases (lifted from their docs/use-cases.md)
+These three patterns are the strongest argument for installing cq today:
+### 1. Skill activation gaps
+> *"Out of 166 sessions that ran `git commit` in a 7-day window, only 16 activated any commit skill. The rest went straight through Bash."*
+A self-join on `tool_calls` between `Bash WHERE command LIKE '%git commit%'` and `Skill WHERE skill LIKE '%commit%'`, grouped by session_id, tells you which sessions ran `git commit` *without* invoking a commit skill. This is the cleanest "is my skill triggering?" signal that exists.
+**Direct relevance to you**: ClaudeMemory ships several skills (`/distill-transcripts`, `/release`, `/review-for-quality`, `/review-commit`, etc.) and an MCP plugin. You currently have no way to answer "is the memory plugin actually firing on questions where it should?" Same query shape works:
+```sql
+-- Sessions that asked architecture/convention questions but never called memory.*
+WITH memory_sessions AS (
+  SELECT DISTINCT session_id FROM tool_calls
+  WHERE name LIKE 'mcp__memory__%'
+)
+SELECT m.session_id, m.text
+FROM messages m
+LEFT JOIN memory_sessions ms ON m.session_id = ms.session_id
+WHERE m.type = 'user'
+  AND (m.text ILIKE '%convention%' OR m.text ILIKE '%architecture%' OR m.text ILIKE '%why did we%')
+  AND ms.session_id IS NULL
+```
+### 2. Silent failures (the wrong-path pattern)
+> *"The skill instructions referenced the wrong path... Claude recovered every time by Glob-searching for the file, so from the outside everything looked fine. Across 23 sessions over 30 days, the same silent failure repeated."*
+Detects the `Read fails → Glob → Read succeeds at different path` sequence. For ClaudeMemory's skills (which reference dozens of file paths in `.claude/skills/`), this is a maintenance multiplier — broken paths self-heal at the cost of a few wasted tool calls per invocation, and you'd never notice without this query.
+### 3. Context-budget forensics
+> *"Three calls ate the context budget. Thirty more burned it retrying queries that would never work."*
+`SELECT name, length(content) AS result_chars FROM tool_calls JOIN tool_results ... ORDER BY result_chars DESC` for a single session. Surfaces the few large tool results that dominate context. Useful when a session "felt slow" but no individual step looked wrong.
+## How cq Compares to ClaudeMemory's Existing Data
+ClaudeMemory already captures some of this in its own SQLite databases:
+| Capability | ClaudeMemory | cq |
+|------------|--------------|-----|
+| Per-project tool calls | ✅ `tool_calls` table (v3, content_item_id-scoped) | ✅ `tool_calls` view |
+| Cross-project SQL | ❌ Project DB is project-scoped by design | ✅ Default cross-project, opt out with `--project` |
+| MCP tool telemetry | ✅ `mcp_tool_calls` table (v13) | ❌ Doesn't see MCP tools as a distinct category |
+| Tool inputs as queryable JSON | ⚠️ Stored as `tool_input` text, not indexed for JSON path | ✅ DuckDB `json_extract_string` over JSON |
+| Tool results with `is_error` | ✅ `is_error` column | ✅ `is_error` column |
+| Raw SQL access for ad-hoc analysis | ⚠️ `sqlite3 .claude/memory.sqlite3` works but no view layer | ✅ `cq sql "..."` first-class |
+| Session-level rollups | Partial | ✅ `sessions` view |
+| Distills facts, resolves conflicts | ✅ Core purpose | ❌ Not a goal |
+| Cross-session message grep | ❌ FTS5 is fact-scoped | ✅ `cq messages --grep` |
+**Conclusion**: ClaudeMemory has the *write* path (ingest → distill → resolve → publish). cq has the *read* path (incremental sync → views → SQL). They share input data (Claude Code JSONLs) and stop there.
+## Adoption Opportunities
+### High Priority ⭐
+#### 1. Install cq as a developer audit tool for the ClaudeMemory project itself
+- **Value**: Answer two questions you currently can't answer cheaply:
+  1. "Is the memory plugin being invoked when it should?" (skill activation)
+  2. "Are there silent failures in `mcp__memory__*` calls?" (error rate, retry loops)
+- **Evidence**: cq's three documented use cases (use-cases.md:1–200) translate directly to ClaudeMemory's situation; you ship a plugin with similar trigger ambiguity
+- **Implementation**: `cargo install --git https://github.com/technicalpickles/cq` — no integration needed, runs out-of-band
+- **Effort**: 5 minutes
+- **Trade-off**: Adds a Rust toolchain dependency on the dev machine; DuckDB cache grows over time (rebuild via `--reindex`)
+- **Recommendation**: **ADOPT** as a personal tool, not a project dependency
+#### 2. Borrow cq's three reference queries for a `docs/audit-queries.md`
+- **Value**: Pre-written SQL the user (or a future maintainer) can run against ClaudeMemory's own databases or via cq to validate the plugin is doing its job. Useful for releases ("did v0.10 actually move the memory.recall hit rate?") and for reproducing skill-activation regressions.
+- **Evidence**: use-cases.md provides exact query templates; only the predicate names change
+- **Implementation**: New doc file, three queries, ~30 minutes
+- **Effort**: Low
+- **Trade-off**: Maintenance — queries rot when schemas change. Mitigate by pinning to ClaudeMemory's own `tool_calls` schema where possible (stable since v3) rather than cq's view schema (younger).
+- **Recommendation**: **CONSIDER** — only worth it if you're going to actually run the audits
+### Medium Priority
+#### 3. Expose ClaudeMemory's `tool_calls` data via a similar SQL view layer
+- **Value**: ClaudeMemory's `tool_calls` table already has the data, but `sqlite3 .claude/memory.sqlite3 "SELECT ..."` requires knowing column names. A `claude-memory sql` subcommand mirroring `cq sql` would lower the barrier.
+- **Evidence**: cq's `sql.rs` (intentionally unparameterized passthrough) shows the minimal viable shape
+- **Implementation**: New `SqlCommand` in `lib/claude_memory/commands/`, ~50 lines using existing Sequel connection
+- **Effort**: Half a day including tests
+- **Trade-off**: Power-user feature. Risks footgun (drop tables) — would need read-only enforcement. Adds surface area to maintain.
+- **Recommendation**: **DEFER** — only if users start asking. Right now `memory.recall_semantic` and the shortcut tools cover the curated path, and `sqlite3` covers the power-user path. The middle ground is thinly populated.
+### Low Priority
+#### 4. Adopt cq's `--since 7d` duration parser pattern
+- **Value**: Unified relative-time parsing across `claude-memory` subcommands; ClaudeMemory has `Core::RelativeTime` for *output*, less consistency on *input*
+- **Evidence**: cq's `scope.rs` parses `7d|24h|30m` uniformly across all commands
+- **Implementation**: A `Core::DurationParser` value object
+- **Effort**: A couple hours
+- **Trade-off**: Real but minor UX win
+- **Recommendation**: **DEFER** — pick up if/when adding more time-filtered commands
+### Features to Avoid
+- **DuckDB as a primary store**. ClaudeMemory's SQLite + extralite + Sequel choice is right for the curation/write workload (FTS5, vec0, transactional resolve). DuckDB is right for cq's analytical scan-everything workload. Don't conflate them.
+- **Cross-project default scoping**. cq defaults to "all projects" with auto-narrowing to current project. ClaudeMemory's project/global split is a feature for memory recall (you don't want one project's conventions leaking into another). Keep what you have.
+- **Re-indexing transcripts on every command**. cq's incremental sync exists because it has no other ingest path. ClaudeMemory's hook-driven ingest is already incremental in a different way and shouldn't be replaced.
+## Trade-offs of Using cq Long-Term
+- **Cache freshness**: cq syncs on every run via mtime/size fast-path. Cost: a few hundred ms on a large transcript history.
+- **Lock contention**: `fs2` file lock means concurrent runs may show stale data (the design choice is "stale-but-available beats error" — fine for a query tool).
+- **No curation**: cq surfaces patterns; you still have to interpret them. The "152 sessions skipped the skill" finding only matters if you act on it.
+- **Schema is Claude Code's JSONL format**: if Anthropic changes the transcript shape, cq breaks until updated. Same risk ClaudeMemory has, just exposed differently.
+## Implementation Recommendations
+**Phase 1 — Just install it (today, 5 minutes)**:
+- `cargo install --git https://github.com/technicalpickles/cq`
+- Run `cq tools` and `cq sessions` to see your own usage
+- Run the skill-activation query against your `mcp__memory__*` tool calls
+**Phase 2 — If Phase 1 surfaces something useful (~half-day)**:
+- Five concrete queries already pre-written in `docs/audit-queries.md` (activation rate, missed-memory-shaped prompts, tool ranking, error rate, result-size distribution)
+- Decide if any belong as a recurring `/schedule` agent ("audit memory plugin activation weekly")
+**Phase 3 — Speculative (defer indefinitely)**:
+- A `claude-memory sql` subcommand if users ask for one
+- A `Core::DurationParser` value object if you add another time-filtered command
+## Architecture Decisions
+**Preserve**:
+- ClaudeMemory's two-DB scope split (project vs global)
+- SQLite + extralite + Sequel as the storage stack
+- Hook-driven ingest, not on-demand re-parse
+- Distill → Resolve → Publish curation pipeline
+**Adopt** (out-of-band, not into the codebase):
+- cq itself, as a developer-side audit tool
+**Reject**:
+- DuckDB / cross-project default / replacing curation with raw SQL views
+## Key Takeaways
+1. **cq solves a different problem than ClaudeMemory**: observability vs curation. The right answer is "use both," not "absorb one into the other."
+2. **The most valuable thing in the cq repo is `docs/use-cases.md`**, not the code. The three query patterns (skill activation, silent failures, context budget) are immediately runnable against your own usage.
+3. **ClaudeMemory has data parity for the per-project case** (the `tool_calls` table covers the same ground), but lacks cq's cross-project SQL ergonomics. That gap is small — a `sqlite3` shell closes it for power users.
+4. **Highest-leverage next step**: install cq, run the skill-activation query against `mcp__memory__*`, see whether the memory plugin is firing as expected. That's a 10-minute experiment with a real chance of surfacing a fixable issue.
+## Next Steps
+- [ ] Install cq locally
+- [ ] Run `cq sql` audit on `mcp__memory__*` activation rate over the last 30d
+- [ ] If the audit surfaces a real gap, file it and decide whether the fix lives in skill descriptions, MCP server instructions, or elsewhere

data/docs/plugin.md CHANGED Viewed

@@ -133,18 +133,25 @@ Unlike traditional approaches that require a separate API key, ClaudeMemory uses
 ### MCP Server
-The plugin exposes these tools to Claude:
+The plugin exposes 25 tools to Claude. Highlights:
 | Tool | Description |
 |------|-------------|
-| `memory.recall` | Search facts by query |
+| `memory.recall` | Search facts by query (lexical + semantic + hybrid) |
+| `memory.recall_semantic` | Vector-search facts with optional `explain:` score traces |
+| `memory.search_concepts` | Multi-concept intersection search |
+| `memory.decisions` / `memory.conventions` / `memory.architecture` | Predicate-shortcut readers |
 | `memory.explain` | Get fact details with provenance |
-| `memory.store_extraction` | Store extracted facts |
-| `memory.promote` | Promote project fact to global |
-| `memory.status` | Check database health |
-| `memory.changes` | Recent fact updates |
+| `memory.fact_graph` | Walk supersession + conflict edges around a fact |
+| `memory.store_extraction` | Store extracted facts (used by /distill-transcripts) |
+| `memory.undistilled` / `memory.mark_distilled` | Distillation pipeline tracking |
+| `memory.promote` / `memory.reject_fact` | Manage fact lifecycle |
+| `memory.status` / `memory.stats` / `memory.activity` / `memory.changes` | Observability surfaces |
 | `memory.conflicts` | Open contradictions |
 | `memory.sweep_now` | Run maintenance |
+| `memory.check_setup` / `memory.list_projects` | Discovery |
+See `lib/claude_memory/mcp/tool_definitions.rb` for the full schema of every tool, including arguments, return shapes, and tool annotations (`readOnlyHint`, `idempotentHint`, `destructiveHint`).
 ### Hooks