npm - smart-context-mcp - Versions diffs - 1.0.4 → 1.2.0 - Mend

smart-context-mcp 1.0.4 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (24) hide show

package/README.md +196 -586
package/package.json +11 -7
package/scripts/init-clients.js +56 -27
package/scripts/report-metrics.js +5 -0
package/scripts/report-workflow-metrics.js +255 -0
package/src/analytics/adoption.js +197 -0
package/src/cache-warming.js +131 -0
package/src/context-patterns.js +192 -0
package/src/cross-project.js +343 -0
package/src/diff-analysis.js +291 -0
package/src/git-blame.js +324 -0
package/src/index.js +54 -5
package/src/metrics.js +6 -1
package/src/server.js +199 -13
package/src/storage/sqlite.js +50 -1
package/src/streaming.js +152 -0
package/src/tools/smart-context.js +115 -6
package/src/tools/smart-metrics.js +7 -0
package/src/tools/smart-read-batch.js +9 -0
package/src/tools/smart-read.js +21 -1
package/src/tools/smart-shell.js +33 -9
package/src/tools/smart-turn.js +1 -0
package/src/workflow-tracker-stub.js +53 -0
package/src/workflow-tracker.js +410 -0

package/README.md CHANGED Viewed

@@ -1,710 +1,320 @@
 # smart-context-mcp
+MCP server that reduces AI agent token usage by 90% with intelligent context compression.
 [![npm version](https://badge.fury.io/js/smart-context-mcp.svg)](https://www.npmjs.com/package/smart-context-mcp)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
-**MCP server that reduces AI agent token usage by 90% and improves response quality.**
-Instead of reading entire files and repeating context, this MCP provides 8 focused tools that compress, rank, and maintain context efficiently.
-## Why use this?
-**Problem:** AI agents waste tokens reading full files, repeating context, and searching inefficiently.
-**Solution:** This MCP reduces token usage by **~90%** in real projects while improving response quality.
-**Real metrics from production use:**
-- 14.5M tokens → 1.6M tokens (89.87% reduction)
-- 3,666 successful calls across the original 7 core tools
-- Compression ratios: 3x to 46x depending on tool
-## Quick Start (2 commands)
-```bash
-npm install smart-context-mcp
-npx smart-context-init --target .
-```
-That's it. Restart your AI client (Cursor, Codex, Claude Desktop) and the tools are available.
-**Important:** The init command automatically sets the correct project-root env var in the generated configs, so the MCP server runs from your project root. This works for standalone projects, monorepos, and nested workspaces.
-## What you get
-Eight focused tools that work automatically:
-- `smart_read`: compact file summaries instead of full file dumps (3x compression)
-- `smart_read_batch`: read multiple files in one call — reduces round-trip latency
-- `smart_search`: ripgrep-first code search with intent-aware ranking (21x compression)
-- `smart_context`: one-call context planner — search + read + graph expansion
-- `smart_summary`: maintain compressed conversation state across sessions (46x compression)
-- `smart_turn`: one-call turn orchestration for start/end context recovery and checkpointing
-- `smart_metrics`: inspect saved token metrics and recent usage through MCP
-- `smart_shell`: safe diagnostic shell execution with restricted commands (18x compression)
-- `build_index`: lightweight symbol index for faster lookups and smarter ranking
-**Strongest in:** Modern web/backend codebases (JS/TS, React, Next.js, Node.js, Python, Go, Rust), infra repos (Terraform, Docker, YAML)
-## Example: Before vs After
-### Without this MCP
-```
-Agent: Let me read auth.js...
-[Reads 4,000 tokens of full file]
-Agent: Let me search for "jwt validation"...
-[Returns 10,000 tokens of grep results]
-Agent: [Next turn] What were we doing?
-[Repeats 5,000 tokens of context]
-Total: ~19,000 tokens
-```
-### With this MCP
-```
-Agent: Let me use smart_read on auth.js...
-[Returns 500 tokens of signatures]
-Agent: Let me use smart_search for "jwt validation"...
-[Returns 400 tokens of ranked snippets]
-Agent: [Next turn] Let me get the context...
-[smart_summary returns 100 tokens]
-Total: ~1,000 tokens (95% reduction)
-```
-## Quick start
-```bash
-npm install smart-context-mcp
-npx smart-context-init --target .
-```
-This installs the MCP server and generates client configs for Cursor, Codex, Qwen, and Claude Code. Open the project with your IDE/agent and the server starts automatically.
-If the target is a git repository, `smart-context-init` also installs an idempotent `pre-commit` hook that blocks commits when `.devctx/state.sqlite` is staged, tracked, or not properly ignored.
-For Claude Code, `smart-context-init` also generates `.claude/settings.json` with native hooks so devctx context recovery and turn-end enforcement run automatically.
-## Binaries
-The package exposes five binaries:
-- `smart-context-headless`
-- `smart-context-server`
-- `smart-context-init`
-- `smart-context-report`
-- `smart-context-protect`
-Start the MCP server against the current project:
+## Installation
+### Cursor
 ```bash
-smart-context-server
+npm install -g smart-context-mcp
+npx smart-context-init --target . --clients cursor
 ```
+Restart Cursor. Done.
-Start it against another repository:
+### Codex CLI
 ```bash
-smart-context-server --project-root /path/to/target-repo
+npm install -g smart-context-mcp
+npx smart-context-init --target . --clients codex
 ```
+Restart Codex. Done.
-## Generate client configs
-Generate MCP config files for a target project:
+### Claude Desktop
 ```bash
-smart-context-init --target /path/to/project
+npm install -g smart-context-mcp
+npx smart-context-init --target . --clients claude
 ```
+Restart Claude Desktop. Done.
-Limit the generated clients if needed:
+### Qwen Code
 ```bash
-smart-context-init --target /path/to/project --clients cursor,codex,qwen,claude
+npm install -g smart-context-mcp
+npx smart-context-init --target . --clients qwen
 ```
+Restart Qwen Code. Done.
-Override the command used in generated configs:
+### All Clients
 ```bash
-smart-context-init --target /path/to/project --command node --args '["./tools/devctx/src/mcp-server.js"]'
-```
-## Metrics
-Each tool call persists token metrics to the target repo by default in:
-```bash
-.devctx/state.sqlite
+npm install -g smart-context-mcp
+npx smart-context-init --target .
 ```
+Restart your AI client. Done.
-SQLite is now the primary project-local store for persisted context and metrics. Running `smart-context-init` also adds `.devctx/` to the target repo's `.gitignore` idempotently.
+## How it Works in Practice
-When an active session exists, metrics entries automatically inherit its `sessionId`, so you can inspect savings per task with `smart_metrics`.
+**The reality:** This MCP does not intercept prompts automatically. Here's the actual flow:
-Show a quick report:
+1. **You:** "Fix the login bug"
+2. **Agent reads rules:** Sees debugging workflow
+3. **Agent decides:** Uses `smart_search(intent=debug)`
+4. **MCP returns:** Ranked results (errors prioritized)
+5. **Agent continues:** Calls `smart_read(symbol)` for function
+6. **Agent fixes:** Makes changes
+7. **Agent verifies:** Calls `smart_shell('npm test')`
+8. **Agent checkpoints:** Calls `smart_turn(end)`
-```bash
-smart-context-report
-```
+**Key points:**
+- ✅ Agent **chooses** to use devctx tools (not forced)
+- ✅ Rules **guide** the agent (not enforce)
+- ✅ Agent can use built-in tools when appropriate
+- ✅ Token savings: 85-90% on complex tasks
-Show JSON output or inspect a legacy/custom JSONL file explicitly:
+Check actual usage:
+- `npm run report:metrics` - Tool-level savings
+- `npm run report:workflows` - Workflow-level savings (requires `DEVCTX_WORKFLOW_TRACKING=true`)
-```bash
-smart-context-report --json
-smart-context-report --file ./.devctx/metrics.jsonl
-```
+## What it does
-Example output:
+Provides **two key components**:
-```text
-devctx metrics report
+### 1. Specialized Tools (12 tools)
-File:         /path/to/repo/.devctx/state.sqlite
-Source:       sqlite
-Entries:      148
-Raw tokens:   182,340
-Final tokens: 41,920
-Saved tokens: 140,420 (77.01%)
+| Tool | Purpose | Savings |
+|------|---------|---------|
+| `smart_read` | Read files in outline/signatures mode | 90% |
+| `smart_read_batch` | Read multiple files in one call | 90% |
+| `smart_search` | Intent-aware code search with ranking | 95% |
+| `smart_context` | One-call context builder | 85% |
+| `smart_summary` | Task checkpoint management | 98% |
+| `smart_turn` | Task recovery orchestration | - |
+| `smart_metrics` | Token usage inspection | - |
+| `smart_shell` | Safe command execution | 94% |
+| `build_index` | Symbol index builder | - |
+| `warm_cache` | File preloading (5x faster cold start) | - |
+| `git_blame` | Function-level code attribution | - |
+| `cross_project` | Multi-project context | - |
-By tool:
-  smart_context  count=42 raw=96,200 final=24,180 saved=72,020 (74.86%)
-  smart_read     count=71 raw=52,810 final=9,940 saved=42,870 (81.18%)
-  smart_search   count=35 raw=33,330 final=7,800 saved=25,530 (76.59%)
-```
+### 2. Agent Rules (Task-Specific Guidance)
-If you need JSONL compatibility for external tooling, set `DEVCTX_METRICS_FILE` or pass `--file`.
+Installation generates rules that teach agents optimal workflows:
-## Usage per client
+**Debugging:** `smart_search(intent=debug)` → `smart_read(symbol)` → fix (90% savings)
+**Code Review:** `smart_context(diff=true)` → `smart_read(signatures)` → review (87% savings)
+**Refactoring:** `smart_context(entryFile)` → `smart_read(signatures)` → refactor (89% savings)
+**Testing:** `smart_search(intent=tests)` → `smart_read(symbol)` → write test (90% savings)
+**Architecture:** `smart_context(detail=minimal)` → `smart_read(signatures)` → analyze (90% savings)
-After installing and running `smart-context-init`, each client picks up the server automatically:
+**Key insight:** The value isn't just in the tools—it's in teaching agents **when** and **how** to use them.
-### Cursor
+## Real Metrics
-Open the project in Cursor. The MCP server starts automatically. Enable it in **Cursor Settings > MCP** if needed. All eight tools are available in Agent mode.
+Production usage: **14.5M tokens → 1.6M tokens** (89.87% reduction)
-### Codex CLI
+## Core Tools
-```bash
-cd /path/to/your-project
-codex
-```
+### smart_read
-Codex reads `.codex/config.toml` and starts the MCP server on launch.
+Read files without full content:
-### Claude Code
+```javascript
+// Outline mode: structure only (~400 tokens vs 4000)
+{ filePath: 'src/server.js', mode: 'outline' }
-```bash
-cd /path/to/your-project
-claude
+// Extract specific function
+{ filePath: 'src/auth.js', mode: 'symbol', symbol: 'validateToken' }
 ```
-Claude Code reads `.mcp.json` from the project root and `.claude/settings.json` for native hook automation.
-### Codex/Qwen headless fallback
+**Modes**: `outline`, `signatures`, `symbol`, `range`, `full`
-When a client does not expose native per-turn hooks, use `smart-context-headless` to wrap a headless CLI run and force `smart_turn(start)` plus a closing checkpoint around that invocation.
+### smart_search
-Examples:
+Intent-aware search with ranking:
-```bash
-smart-context-headless --client codex --prompt "Finish the runtime repo-safety docs" -- codex exec
-smart-context-headless --client qwen --prompt "Review the persisted session and propose the next step" -- qwen -p
+```javascript
+{ query: 'authentication', intent: 'debug' }  // Prioritizes errors, logs
+{ query: 'UserModel', intent: 'implementation' }  // Prioritizes source
 ```
-This is the current automation path for non-Claude CLI agents. GUI clients without hook support still rely on generated rules plus `smart_turn`.
+**Intents**: `implementation`, `debug`, `tests`, `config`, `docs`, `explore`
-### Qwen Code
+### smart_context
-Open the project in Qwen Code. The MCP server starts from `.qwen/settings.json`.
+Get everything for a task in one call:
-## Agent rules
+```javascript
+{
+  task: 'Fix authentication bug',
+  detail: 'balanced',  // minimal | balanced | deep
+  maxTokens: 8000
+}
+```
-`smart-context-init` generates agent rules that instruct AI agents to prefer devctx tools over their built-in equivalents. This is what makes agents use `smart_read` in outline/signatures mode instead of reading full files.
+Returns: relevant files + compressed content + symbol details + graph relationships
-### Intent-based workflows
+### smart_summary
-The `intent` parameter in `smart_search` and `smart_context` adjusts ranking and suggests optimal workflows:
+Maintain task checkpoint:
-| Intent | Ranking priority | Suggested workflow |
-|--------|-----------------|-------------------|
-| `debug` | Error messages, stack traces, logs | Search error → read signatures → inspect symbol → smart_shell |
-| `implementation` | Source files, changed files | Read outline/signatures → focus on changed symbols |
-| `tests` | Test files, spec files | Find tests → read symbol of function under test |
-| `config` | Config files, env vars, YAML/JSON | Find settings → read full config files |
-| `explore` | Entry points, main modules | Directory structure → outlines of key modules |
+```javascript
+// Save checkpoint
+{ action: 'update', update: { goal: 'Implement OAuth', status: 'in_progress', nextStep: '...' }}
-### Generated files per client
+// Resume task
+{ action: 'get' }
+```
-- **Cursor**: `.cursor/rules/devctx.mdc` (always-apply rule)
-- **Codex**: `AGENTS.md` (devctx section with sentinel markers)
-- **Claude Code**: `CLAUDE.md` (devctx section with sentinel markers) and `.claude/settings.json` (native hooks)
+Stores compressed task state (~100 tokens: goal, status, decisions, blockers), not full conversation.
-The generated files are idempotent — running `smart-context-init` again updates the devctx sections and Claude hook entries without duplicating them. Existing content in `AGENTS.md`, `CLAUDE.md`, and `.claude/settings.json` is preserved.
+## New Features
-## Use against another repo
+### Diff-Aware Context
-By default, `devctx` works against the repo where it is installed. You can point it at another repo without modifying that target project:
+Analyze git changes intelligently:
-```bash
-node ./src/mcp-server.js --project-root /path/to/target-repo
+```javascript
+{ task: 'Review changes', diff: 'main' }
 ```
-or:
+Returns changed files prioritized by impact + related files (importers, tests).
-```bash
-DEVCTX_PROJECT_ROOT=/path/to/target-repo node ./src/mcp-server.js
-```
+### Context Prediction
-or (recommended for MCP clients and generated configs):
+Learn from usage and predict needed files:
-```bash
-DEVCTX_PROJECT_ROOT=/path/to/target-repo node ./src/mcp-server.js
+```javascript
+{ task: 'Implement auth', prefetch: true }
 ```
-Legacy configs that still set `MCP_PROJECT_ROOT` remain supported for backward compatibility.
-`smart-context-init` automatically sets `DEVCTX_PROJECT_ROOT` in the generated client configs (`.cursor/mcp.json`, `.codex/config.toml`, `.mcp.json`, `.qwen/settings.json`), so the MCP server always launches from the correct project context, even in monorepos or when installed globally.
-## What it is good at
-| Level | Languages / Stack | Use cases |
-|-------|------------------|-----------|
-| **Strong** | JS/TS, React, Next.js, Node.js, Python | Modern web apps, monorepos, backend services, scripts |
-| **Strong** | Terraform, Docker, YAML, shell, SQL | Infra/platform repos, config-heavy codebases |
-| **Good** | Go, Rust, Java, C#/.NET, Kotlin, PHP, Swift | Services, libraries, Android/iOS, Laravel/Symfony |
-| **Partial** | Enterprise Java/C# with heavy frameworks | Generated code, polyglot monorepos needing semantic ranking |
-| **Limited** | Ruby, Elixir, Scala | Deep semantic understanding required, general shell needs |
-## Tool behavior
-### `smart_read`
+After 3+ similar tasks: 40-60% fewer round-trips, 15-20% additional savings.
-Modes:
+### Cache Warming
-- `outline` — compact structural summary (~90% token savings)
-- `signatures` — exported API surface only
-- `range` — specific line range with line numbers (`startLine`, `endLine`)
-- `symbol` — extract function/class/method by name; accepts a string or an array for batch extraction
-- `full` — file content capped at 12k chars, with truncation marker when needed
+Eliminate cold-start latency:
-The `symbol` mode supports nested methods (class methods, object methods), interface signatures, and multiline function signatures across all supported languages.
-Cross-file symbol context:
-- Pass `context: true` with `symbol` mode to include callers, tests, and referenced types from the dependency graph
-- Callers: files that import the current file and reference the symbol (via graph + ripgrep)
-- Tests: test files related to the current file that mention the symbol
-- Types: type/interface names referenced in the symbol definition that exist in the index
-- Requires `build_index` for graph data; without it, the definition is returned with an empty context and a hint
-- Response includes `context: { callers, tests, types }` with counts, `graphCoverage: { imports, tests }` (`full|partial|none`), and `contextHints` if applicable
-- `graphCoverage` indicates how reliable cross-file context is: `full` for JS/TS/Python/Go (imports resolved), `partial` for C#/Kotlin/PHP/Swift (imports extracted but namespace-based), `none` for other languages
-Token budget mode:
-- Pass `maxTokens` to let the tool auto-select the most detailed mode that fits the budget
-- Cascade order: `full` -> `outline` -> `signatures` -> truncated
-- If the requested mode (or default `outline`) exceeds the budget, the tool falls back to a more compact mode automatically
-- `range` and `symbol` modes do not cascade but will truncate by tokens if needed
-- When the mode changes, the response includes `chosenMode` (the mode actually used) and `budgetApplied: true`
-Responses are cached in memory per session. If the same file+mode is requested again and the file's `mtime` has not changed, the cached result is returned without re-parsing. The response includes `cached: true` when served from cache.
-Every response includes a `confidence` block:
-```json
-{ "parser": "ast|heuristic|fallback|raw", "truncated": false, "cached": false }
-```
-Additional metadata: `indexHint` (symbol mode), `chosenMode`/`budgetApplied` (token budget), `graphCoverage` (symbol+context mode).
-**Example response (outline mode):**
-```json
-{
-  "mode": "outline",
-  "parser": "ast",
-  "truncated": false,
-  "cached": false,
-  "tokens": 245,
-  "confidence": { "parser": "ast", "truncated": false, "cached": false },
-  "content": "import express from 'express';\nexport class AuthMiddleware { ... }\nexport function requireRole(role: string) { ... }"
-}
+```javascript
+{ incremental: true, warmCache: true }
 ```
-Current support:
-- First-class (AST): JS, JSX, TS, TSX
-- Heuristic: Python, Go, Rust, Java, C#, Kotlin, PHP, Swift, shell, Terraform, HCL, Dockerfile, SQL, JSON, TOML, YAML
-- Fallback: plain-text structural extraction for unsupported formats
-### `smart_read_batch`
-Read multiple files in one MCP call. Reduces round-trip latency for common patterns like "read the outline of these 5 files".
+First query: 250ms → 50ms (5x faster).
-Parameters:
+### Git Blame
-- `files` (required, max 20) — array of items, each with:
-  - `path` (required) — file path
-  - `mode` (optional) — `outline`, `signatures`, `full`, `range`, `symbol`
-  - `symbol`, `startLine`, `endLine` (optional) — as in `smart_read`
-  - `maxTokens` (optional) — per-file token budget with automatic mode cascade
-- `maxTokens` (optional) — global token budget; stops reading more files once exceeded (at least 1 file is always read)
+Function-level attribution:
-Response:
-```json
-{
-  "results": [
-    { "filePath": "...", "mode": "outline", "parser": "ast", "truncated": false, "content": "..." },
-    { "filePath": "...", "mode": "signatures", "parser": "heuristic", "truncated": false, "content": "..." }
-  ],
-  "metrics": { "totalTokens": 450, "filesRead": 2, "filesSkipped": 0, "totalSavingsPct": 88 }
-}
-```
-### `smart_search`
-- Uses embedded ripgrep via `@vscode/ripgrep`
-- Falls back to filesystem walking if rg is unavailable or fails
-- Groups matches by file, ranks results to reduce noise
-- Optional `intent` parameter adjusts ranking: `implementation`, `debug`, `tests`, `config`, `docs`, `explore`
-- When a symbol index exists (via `build_index`), files with matching definitions get +50 ranking bonus, and related files (importers, tests, neighbors) get +25 graph boost
-- Index is loaded from `projectRoot`, so subdirectory searches still benefit from the project-level index
-- Returns `confidence` block: `{ "level": "high", "indexFreshness": "fresh" }`
+```javascript
+// Who wrote each function?
+{ mode: 'symbol', filePath: 'src/server.js' }
-**Example response:**
+// Find code by author
+{ mode: 'author', authorQuery: 'alice@example.com' }
-```json
-{
-  "engine": "rg",
-  "retrievalConfidence": "high",
-  "indexFreshness": "fresh",
-  "confidence": { "level": "high", "indexFreshness": "fresh" },
-  "sourceBreakdown": { "textMatch": 7, "indexBoost": 2, "graphBoost": 1 },
-  "results": [
-    { "file": "src/auth/middleware.js", "matches": 3, "rank": 150, "preview": "export class AuthMiddleware { ..." }
-  ]
-}
+// Recent changes
+{ mode: 'recent', daysBack: 7 }
 ```
-### `smart_context`
+### Cross-Project Context
-One-call context planner. Instead of the manual cycle of `smart_search` → `smart_read` → `smart_read` → ..., `smart_context` receives a task description and returns curated context in a single response.
+Work across monorepos:
-**Pipeline:**
+```javascript
+// Search all projects
+{ mode: 'search', query: 'AuthService' }
-```
-task input → intent detection → search/diff → graph expansion → smart_read_batch → symbol extraction → response
+// Find symbol across projects
+{ mode: 'symbol', symbolName: 'validateToken' }
 ```
-**Parameters:**
-- `task` (required) — natural language description (e.g., `"debug the auth flow in AuthMiddleware"`)
-- `intent` (optional) — override auto-detected intent
-- `detail` (optional) — `minimal` | `balanced` (default) | `deep`
-- `maxTokens` (optional, default 8000) — token budget
-- `entryFile` (optional) — guarantee specific file inclusion
-- `diff` (optional) — `true` (vs HEAD) or git ref (`"main"`) to scope to changed files only
-- `include` (optional) — `["content","graph","hints","symbolDetail"]` to control response fields
+Requires `.devctx-projects.json` config.
-**Detail modes:**
+## Supported Languages
-| Mode | Behavior | Use when |
-|------|----------|----------|
-| `minimal` | Index-first: paths, roles, evidence, signatures, symbol previews (no file reads) | Fastest exploration, budget-constrained |
-| `balanced` | Batch read with smart compression (outline/signatures) | Default, most tasks |
-| `deep` | Full content reads | Deep investigation, debugging |
+**AST parsing**: JavaScript, TypeScript, JSX, TSX
-**How it works:**
+**Heuristic**: Python, Go, Rust, Java, C#, Kotlin, PHP, Swift
-1. **Search or diff**: Extracts queries from task and runs `smart_search`, OR runs `git diff` when `diff` parameter provided
-2. **Graph expansion**: Expands top results via relational graph (imports, importedBy, tests, neighbors)
-3. **Read strategy**: Index-first mode (no file reads) OR batch read mode using `smart_read_batch` with role-based compression
-4. **Symbol extraction**: Detects identifiers in task and extracts focused symbol details
-5. **Deduplication**: In `minimal` mode, omits redundant outline when `symbolDetail` covers same file
-6. **Assembly**: Returns curated context with `reasonIncluded` / `evidence` per item, graph summary, hints, and confidence block
+**Structural**: Shell, Terraform, HCL, Dockerfile, SQL, JSON, YAML, TOML
-Diff mode is ideal for PR review and debugging recent changes — reads only changed files plus their tests and dependencies.
+## Client Support
-Example response:
+- Cursor (`.cursor/mcp.json`)
+- Codex CLI (`.codex/config.toml`)
+- Claude Code (`.mcp.json` + `.claude/settings.json`)
+- Qwen Code (`.qwen/settings.json`)
-```json
-{
-  "task": "debug AuthMiddleware",
-  "intent": "debug",
-  "indexFreshness": "fresh",
-  "confidence": { "indexFreshness": "fresh", "graphCoverage": { "imports": "full", "tests": "full" } },
-  "context": [
-    { "file": "src/auth/middleware.js", "role": "primary", "readMode": "outline", "reasonIncluded": "Matched task search: AuthMiddleware", "evidence": [{ "type": "searchHit", "query": "AuthMiddleware", "rank": 1 }, { "type": "symbolMatch", "symbols": ["AuthMiddleware"] }], "symbols": ["AuthMiddleware", "requireRole"], "symbolPreviews": [{ "name": "AuthMiddleware", "kind": "class", "signature": "export class AuthMiddleware", "snippet": "export class AuthMiddleware { ..." }], "content": "..." },
-    { "file": "tests/auth.test.js", "role": "test", "readMode": "signatures", "reasonIncluded": "Test for src/auth/middleware.js", "evidence": [{ "type": "testOf", "via": "src/auth/middleware.js" }], "content": "..." },
-    { "file": "src/utils/jwt.js", "role": "dependency", "readMode": "signatures", "reasonIncluded": "Imported by src/auth/middleware.js", "evidence": [{ "type": "dependencyOf", "via": "src/auth/middleware.js" }], "content": "..." },
-    { "file": "src/auth/middleware.js", "role": "symbolDetail", "readMode": "symbol", "reasonIncluded": "Focused symbol detail: AuthMiddleware", "evidence": [{ "type": "symbolDetail", "symbols": ["AuthMiddleware"] }], "content": "..." }
-  ],
-  "graph": {
-    "primaryImports": ["src/utils/jwt.js"],
-    "tests": ["tests/auth.test.js"],
-    "dependents": [],
-    "neighbors": ["src/utils/logger.js"]
-  },
-  "graphCoverage": { "imports": "full", "tests": "full" },
-  "metrics": { "totalTokens": 1200, "filesIncluded": 4, "filesEvaluated": 8, "savingsPct": 82 },
-  "hints": ["Inspect symbols with smart_read: verifyJwt, createJwt"]
-}
-```
+## Commands
-`graphCoverage` indicates how complete the relational context is: `full` for JS/TS/Python/Go (imports resolved to local files), `partial` for C#/Kotlin/PHP/Swift (imports extracted but namespace-based), `none` for other languages. When files from multiple languages are included, the level reflects the weakest coverage.
-File roles: `primary` (search hits or changed files), `test` (related test files), `dependency` (imports), `dependent` (importedBy), `symbolDetail` (extracted symbol bodies). Each item also includes `reasonIncluded` and structured `evidence` so the agent knows why it was selected.
+```bash
+# Start server
+smart-context-server
-When using diff mode, the response includes a `diffSummary`:
+# Against another repo
+smart-context-server --project-root /path/to/repo
-```json
-{
-  "diffSummary": { "ref": "main", "totalChanged": 5, "included": 3, "skippedDeleted": 1 }
-}
-```
+# Generate configs
+smart-context-init --target /path/to/project
-### `smart_summary`
-Maintain compressed conversation state across sessions. Solves the context-loss problem when resuming work after hours or days.
-**Actions:**
-| Action | Purpose | Returns |
-|--------|---------|---------|
-| `get` | Retrieve current, explicit, or auto-resolved session | Resume summary (≤500 tokens) + compression metadata |
-| `update` | Create or replace session | New session with compressed state |
-| `append` | Add to existing session | Merged session state |
-| `auto_append` | Add only when something meaningful changed | Merged session state or skipped no-op result |
-| `checkpoint` | Event-driven orchestration for persistence decisions | Persisted update or skipped event with decision metadata |
-| `reset` | Clear session | Confirmation |
-| `list_sessions` | Show all available sessions | Array of sessions with metadata |
-| `compact` | Apply retention/compaction to SQLite state | Counts for pruned sessions, events, and metrics |
-| `cleanup_legacy` | Inspect or remove imported JSON/JSONL artifacts | Dry-run or deletion report |
-**Parameters:**
-- `action` (required) — one of the actions above
-- `sessionId` (optional) — session identifier; auto-generated from `goal` if omitted. Pass `"auto"` to accept the recommended recent session when multiple candidates exist.
-- `update` (required for update/append/auto_append/checkpoint) — object with:
-  - `goal`: primary objective
-  - `status`: current state (`planning` | `in_progress` | `blocked` | `completed`)
-  - `pinnedContext`: critical context that should survive compression when possible
-  - `unresolvedQuestions`: open questions that matter for the next turn
-  - `currentFocus`: current work area in one short phrase
-  - `whyBlocked`: blocker summary when status is `blocked`
-  - `completed`: array of completed steps
-  - `decisions`: array of key decisions with rationale
-  - `blockers`: array of current blockers
-  - `nextStep`: immediate next action
-  - `touchedFiles`: array of modified files
-- `maxTokens` (optional, default 500) — hard cap on summary size
-- `event` (optional for `checkpoint`) — one of `manual`, `milestone`, `decision`, `blocker`, `status_change`, `file_change`, `task_switch`, `task_complete`, `session_end`, `read_only`, `heartbeat`
-- `force` (optional, default false) — override a suppressed checkpoint event
-- `retentionDays` (optional, default 30) — used by `compact`
-- `keepLatestEventsPerSession` (optional, default 20) — used by `compact`
-- `keepLatestMetrics` (optional, default 1000) — used by `compact`
-- `vacuum` (optional, default false) — run SQLite `VACUUM` after deletions during `compact`
-- `apply` (optional, default false) — required to actually delete files during `cleanup_legacy`
-`update` replaces the stored session state for that `sessionId`, so omitted fields are cleared. Use `append` when you want to keep existing state and add progress incrementally. Use `auto_append` when the caller may fire checkpoint saves often and you want the tool to skip no-op updates automatically. Use `checkpoint` when the caller has a meaningful event and wants the tool to decide whether that event deserves persistence.
-**Storage:**
-- Session state, session events, summary cache, and metrics persist in `.devctx/state.sqlite`
-- Legacy `.devctx/sessions/*.json`, `.devctx/sessions/active.json`, and `.devctx/metrics.jsonl` are imported idempotently when present
-- `compact` enforces retention without deleting the active session
-- `cleanup_legacy` is dry-run by default and only deletes imported legacy artifacts when `apply: true`
-**Auto-resume behavior:**
-- `get` returns the active session immediately when `active.json` exists
-- If there is no active session, `get` auto-resumes the best saved session when there is a single clear candidate
-- If multiple recent sessions are plausible, `get` returns ordered `candidates` plus `recommendedSessionId`
-- Passing `sessionId: "auto"` accepts that recommendation and restores it as the active session
-**Resume summary fields:**
-- `status` and `nextStep` are preserved with highest priority
-- `pinnedContext` and `unresolvedQuestions` preserve critical context and open questions
-- `currentFocus` and `whyBlocked` are included when relevant
-- `recentCompleted`, `keyDecisions`, and `hotFiles` are derived from the persisted state
-- `completedCount`, `decisionsCount`, and `touchedFilesCount` preserve activity scale cheaply
-- Empty fields are omitted to save tokens
-**Response metadata:**
-- `schemaVersion`: persisted session schema version
-- `truncated`: whether the resume summary had to be compressed
-- `compressionLevel`: `none` | `trimmed` | `reduced` | `status_only`
-- `omitted`: fields dropped from the resume summary to fit the token budget
-- `repoSafety`: git hygiene signal for `.devctx/state.sqlite` (`isIgnored`, `isTracked`, `isStaged`, warnings, recommended actions)
-- mutating actions (`update`, `append`, `auto_append`, `checkpoint`, `reset`, `compact`) are blocked at runtime when `.devctx/state.sqlite` is tracked or staged
-**Compression strategy:**
-- Keeps the persisted session state intact and compresses only the resume summary
-- Prioritizes `nextStep`, `status`, and active blockers over history
-- Deduplicates repeated completed steps, decisions, and touched files
-- Uses token-aware reduction until the summary fits `maxTokens`
-**Example workflow:**
+# View metrics
+smart-context-report
-```javascript
-// Start of work session
-smart_summary({ action: "get" })
-// → retrieves last active session or auto-resumes the best saved session
-// After implementing auth middleware
-smart_summary({
-  action: "checkpoint",
-  event: "milestone",
-  update: {
-    completed: ["auth middleware"],
-    decisions: ["JWT with 1h expiry, refresh tokens in Redis"],
-    touchedFiles: ["src/middleware/auth.js"],
-    nextStep: "add role-based access control"
-  }
-})
-// Monday after weekend - resume work
-smart_summary({ action: "get" })
-// → full context restored, continue from nextStep
-// List all sessions
-smart_summary({ action: "list_sessions" })
-// → see all available sessions, pick one to resume
-// Inspect git safety for project-local state from any smart_summary response
-smart_summary({ action: "get" })
-// → repoSafety warns if .devctx/state.sqlite is tracked or not ignored
-// Suppress noisy read-only exploration checkpoints
-smart_summary({
-  action: "checkpoint",
-  event: "read_only",
-  update: { currentFocus: "inspect auth flow" }
-})
-// → skipped=true, no event persisted
-// Compact old SQLite events while keeping recent history
-smart_summary({ action: "compact", retentionDays: 30, keepLatestEventsPerSession: 20, keepLatestMetrics: 1000 })
-// Inspect what legacy files are safe to remove
-smart_summary({ action: "cleanup_legacy" })
-// Remove imported legacy JSON/JSONL artifacts explicitly
-smart_summary({ action: "cleanup_legacy", apply: true })
+# Verify features
+npm run verify
 ```
-### `smart_metrics`
+## Storage
-Inspect token metrics recorded in project-local SQLite storage without leaving MCP.
+Data stored in `.devctx/`:
+- `index.json` - Symbol index
+- `state.sqlite` - Task checkpoints, metrics, patterns (Node 22+)
+- `metrics.jsonl` - Legacy fallback (Node 18-20)
-- Returns aggregated totals, savings percentage, and per-tool breakdowns
-- Supports `window`: `24h` | `7d` | `30d` | `all`
-- Supports filtering by `tool`
-- Supports filtering by `sessionId`, including `sessionId: "active"`
-- Includes `latestEntries` so an agent can explain recent savings without parsing storage manually
-- Includes `overheadTokens` and `overheadTools` so hook/wrapper context cost stays measurable against the savings
-- When `.devctx/state.sqlite` is tracked or staged, metric writes are skipped and reads fall back to a temporary read-only snapshot with `sideEffectsSuppressed: true`
-**Example workflow:**
-```javascript
-smart_metrics({ window: "7d", sessionId: "active" })
-// → totals and recent entries for the current task/session
+Add to `.gitignore`:
 ```
-### `smart_turn`
-Orchestrate the start or end of a meaningful agent turn with one MCP call.
-- `phase: "start"` rehydrates context, classifies whether the current prompt aligns with persisted work, and can auto-create a planning session for a substantial new task
-- `phase: "end"` writes a checkpoint through `smart_summary` and can optionally include compact metrics
-- Designed to make context usage almost mandatory without forcing the agent to chain `smart_summary(get)` and `smart_summary(checkpoint)` manually on every turn
-- Claude Code can invoke this automatically through generated native hooks on `SessionStart`, `UserPromptSubmit`, `PostToolUse`, and `Stop`
-- Non-Claude CLI clients can approximate the same flow with `smart-context-headless`, which wraps one headless agent invocation around `smart_turn(start)` and `smart_turn(end)`
-**Example workflow:**
-```javascript
-smart_turn({
-  phase: "start",
-  prompt: "Finish runtime repo-safety enforcement for smart metrics",
-  ensureSession: true
-})
-// → summary + continuity classification + repoSafety
-smart_turn({
-  phase: "end",
-  event: "milestone",
-  update: {
-    completed: ["Finished smart metrics repo-safety enforcement"],
-    nextStep: "Update docs and run the full suite"
-  }
-})
-// → checkpoint result + optional compact metrics
+.devctx/
 ```
-### `build_index`
+## Requirements
-- Builds a lightweight symbol index for the project (functions, classes, methods, types, etc.)
-- Supports JS/TS (via TypeScript AST), Python, Go, Rust, Java, C#, Kotlin, PHP, Swift
-- Extracts imports/exports and builds a dependency graph with `import` and `testOf` edges
-- Test files are linked to source files via import analysis and naming conventions
-- Index stored per-project in `.devctx/index.json`, invalidated by file mtime
-- Each symbol includes a condensed `signature` (one line, max 200 chars) and a short `snippet` preview so agents can inspect likely definitions without opening files
-- Accelerates `smart_search` (symbol + graph ranking) and `smart_read` symbol mode (line hints)
-- Pass `incremental=true` to only reindex files with changed mtime — much faster for large repos (10k+ files). Falls back to full rebuild if no prior index exists.
-- Incremental response includes `reindexed`, `removed`, `unchanged` counts
-- Run once after checkout or when many files changed; not required but recommended for large projects
+- Node.js 18+ (22+ for SQLite features)
+- Git (for diff and blame features)
-### `smart_shell`
+## Security
-- Runs only allowlisted diagnostic commands
-- Executes from the effective project root
-- Blocks shell operators and unsafe commands by design
+This MCP is **secure by default**:
-## Evaluations (repo development only)
+- ✅ **Allowlist-only commands** - Only safe diagnostic commands (`ls`, `git status`, `npm test`, etc.)
+- ✅ **No shell operators** - Blocks `|`, `&`, `;`, `>`, `<`, `` ` ``, `$()`
+- ✅ **Path validation** - Cannot escape project root
+- ✅ **No write access** - Cannot modify your code
+- ✅ **Repository safety** - Prevents accidental commit of local state
+- ✅ **Resource limits** - 15s timeout, 10MB buffer
-The eval harness and corpora are available in the [source repository](https://github.com/Arrayo/devctx-mcp-mvp) but are **not included in the npm package**. Clone the repo to run evaluations.
+**Configuration:**
 ```bash
-cd tools/devctx
-npm run eval
-npm run eval -- --baseline
-npm run eval:self
-npm run eval:context
-npm run eval:both
-npm run eval:report
-```
+# Disable shell execution entirely
+export DEVCTX_SHELL_DISABLED=true
-Commands:
-- `eval` — synthetic corpus with index + intent
-- `eval -- --baseline` — baseline without index/intent
-- `eval:self` — self-eval against the real devctx repo
-- `eval:context` — evaluate smart_context alongside search
-- `eval:both` — search + context evaluation
-- `eval:report` — scorecard with delta vs baseline
-The harness supports `--root=` and `--corpus=` for evaluating against any repo with custom task corpora. Use `--tool=search|context|both` to control which tools are evaluated. When `--tool=context`, pass/fail is determined by `smart_context` precision; when `--tool=both`, both search and context must pass.
+# Disable cache warming
+export DEVCTX_CACHE_WARMING=false
+```
-Metrics include: P@5, P@10, Recall, wrong-file rate, retrieval honesty, follow-up reads, tokens-to-success, latency p50/p95, confidence calibration (accuracy, over-confident rate, under-confident rate), and smart_context metrics when applicable. smart_context reporting now includes precision, explanation coverage (`reasonIncluded` + `evidence`), preview coverage (`symbolPreviews`), and preview symbol recall. Token metrics (`totalTokens`) reflect the full JSON payload, not just content blocks.
+See [SECURITY.md](https://github.com/Arrayo/smart-context-mcp/blob/main/SECURITY.md) for complete security documentation.
-## Notes
+## Documentation
-- `@vscode/ripgrep` provides a bundled `rg` binary, so a system install is not required.
-- Persistent context and metrics live in `<projectRoot>/.devctx/state.sqlite`.
-- `DEVCTX_METRICS_FILE` is now an explicit compatibility override for JSONL-based workflows and reports.
-- Symbol index stored in `<projectRoot>/.devctx/index.json` when `build_index` is used.
-- Legacy session JSON files in `<projectRoot>/.devctx/sessions/` are imported idempotently when present.
-- This package is a navigation and diagnostics layer, not a full semantic code intelligence system.
+Full documentation in [GitHub repository](https://github.com/Arrayo/smart-context-mcp):
-## Repository
+- [Streaming Progress](https://github.com/Arrayo/smart-context-mcp/blob/main/docs/features/streaming.md) - Progress notifications
+- [Context Prediction](https://github.com/Arrayo/smart-context-mcp/blob/main/docs/features/context-prediction.md) - File prediction
+- [Diff-Aware Context](https://github.com/Arrayo/smart-context-mcp/blob/main/docs/features/diff-aware.md) - Change analysis
+- [Cache Warming](https://github.com/Arrayo/smart-context-mcp/blob/main/docs/features/cache-warming.md) - Cold-start optimization
+- [Git Blame](https://github.com/Arrayo/smart-context-mcp/blob/main/docs/features/git-blame.md) - Code attribution
+- [Cross-Project Context](https://github.com/Arrayo/smart-context-mcp/blob/main/docs/features/cross-project.md) - Multi-project support
-Source repository and full project documentation:
+## Links
-- https://github.com/Arrayo/devctx-mcp-mvp
+- [GitHub](https://github.com/Arrayo/smart-context-mcp)
+- [npm](https://www.npmjs.com/package/smart-context-mcp)
+- [Issues](https://github.com/Arrayo/smart-context-mcp/issues)
 ## Author
 **Francisco Caballero Portero**
-Email: fcp1978@hotmail.com
-GitHub: [@Arrayo](https://github.com/Arrayo)
+fcp1978@hotmail.com
+[@Arrayo](https://github.com/Arrayo)
 ## License
-MIT License - see [LICENSE](LICENSE) file for details.
+MIT