npm - context-mode - Versions diffs - 0.5.0 → 0.5.3 - Mend

context-mode 0.5.0 → 0.5.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

package/.claude-plugin/marketplace.json +2 -2
package/.claude-plugin/plugin.json +2 -2
package/README.md +70 -72
package/build/executor.js +9 -9
package/build/server.js +82 -27
package/build/store.d.ts +1 -0
package/build/store.js +68 -5
package/package.json +2 -2

package/.claude-plugin/marketplace.json CHANGED Viewed

@@ -12,8 +12,8 @@
     {
       "name": "context-mode",
       "source": "./",
-      "description": "Claude Code MCP plugin that saves 94% of your context window. Sandboxed code execution in 10 languages, FTS5 knowledge base with BM25 ranking, and smart truncation.",
-      "version": "0.5.0",
+      "description": "Claude Code MCP plugin that saves 98% of your context window. Sandboxed code execution in 10 languages, FTS5 knowledge base with BM25 ranking, and intent-driven search.",
+      "version": "0.5.3",
       "author": {
         "name": "Mert Koseoğlu"
       },

package/.claude-plugin/plugin.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "name": "context-mode",
-  "version": "0.5.0",
-  "description": "Claude Code MCP plugin that saves 94% of your context window. Sandboxed code execution in 10 languages, FTS5 knowledge base with BM25 ranking, and smart truncation.",
+  "version": "0.5.3",
+  "description": "Claude Code MCP plugin that saves 98% of your context window. Sandboxed code execution in 10 languages, FTS5 knowledge base with BM25 ranking, and intent-driven search.",
   "author": {
     "name": "Mert Koseoğlu",
     "url": "https://github.com/mksglu"

package/README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # Context Mode
-**Claude Code MCP plugin that saves 94% of your context window.**
+**Claude Code MCP plugin that saves 98% of your context window.**
 Every tool call in Claude Code consumes context tokens. A single Playwright snapshot burns 10K-135K tokens. A Context7 docs lookup dumps 4K-10K tokens. GitHub's `list_commits` with 30 results costs 29K-64K tokens. With 5+ MCP servers active, you lose ~55K tokens before your first message — and after 30 minutes of real debugging, responses slow to a crawl.
@@ -12,14 +12,14 @@ Claude Code has a 200K token context window. Here's how fast popular MCP servers
 | MCP Server | Tool | Without Context Mode | With Context Mode | Savings | Source |
 |---|---|---|---|---|---|
-| **Playwright** | `browser_snapshot` | 10K-135K tokens | ~20 tokens | **99%** | [playwright-mcp#1233](https://github.com/microsoft/playwright-mcp/issues/1233) |
-| **Context7** | `query-docs` | 4K-10K tokens | ~70 tokens | **98%** | [upstash/context7](https://github.com/upstash/context7) |
-| **GitHub** | `list_commits` (30) | 29K-64K tokens | ~10 tokens | **99%** | [github-mcp-server#142](https://github.com/github/github-mcp-server/issues/142) |
-| **Sentry** | issue analysis | 5K-30K tokens | ~25 tokens | **99%** | [getsentry/sentry-mcp](https://github.com/getsentry/sentry-mcp) |
-| **Supabase** | schema queries | 2K-30K tokens | ~30 tokens | **99%** | [supabase-community/supabase-mcp](https://github.com/supabase-community/supabase-mcp) |
-| **Firecrawl** | `scrape` / `crawl` | 5K-50K+ tokens | ~70 tokens | **99%** | [firecrawl](https://github.com/mendableai/firecrawl) |
-| **Chrome DevTools** | DOM / network | 5K-50K+ tokens | ~25 tokens | **99%** | Community benchmark |
-| **Fetch** | `fetch` | 5K-50K tokens | ~70 tokens | **99%** | Official reference server |
+| **Playwright** | `browser_snapshot` | 10K-135K tokens | ~75 tokens | **99%** | [playwright-mcp#1233](https://github.com/microsoft/playwright-mcp/issues/1233) |
+| **Context7** | `query-docs` | 4K-10K tokens | ~65 tokens | **98%** | [upstash/context7](https://github.com/upstash/context7) |
+| **GitHub** | `list_commits` (30) | 29K-64K tokens | ~180 tokens | **99%** | [github-mcp-server#142](https://github.com/github/github-mcp-server/issues/142) |
+| **Sentry** | issue analysis | 5K-30K tokens | ~85 tokens | **99%** | [getsentry/sentry-mcp](https://github.com/getsentry/sentry-mcp) |
+| **Supabase** | schema queries | 2K-30K tokens | ~80 tokens | **99%** | [supabase-community/supabase-mcp](https://github.com/supabase-community/supabase-mcp) |
+| **Firecrawl** | `scrape` / `crawl` | 5K-50K+ tokens | ~65 tokens | **99%** | [firecrawl](https://github.com/mendableai/firecrawl) |
+| **Chrome DevTools** | DOM / network | 5K-50K+ tokens | ~75 tokens | **99%** | Community benchmark |
+| **Fetch** | `fetch` | 5K-50K tokens | ~65 tokens | **99%** | Official reference server |
 **Real measurement** ([Scott Spence, 2025](https://scottspence.com/posts/optimising-mcp-server-context-usage-in-claude-code)): With 81+ MCP tools enabled across multiple servers, **143K of 200K tokens (72%) consumed** — 82K tokens just for MCP tool definitions. Only 28% left for actual work.
@@ -29,15 +29,15 @@ Claude Code has a 200K token context window. Here's how fast popular MCP servers
 | What you're doing | Without Context Mode | With Context Mode | Savings |
 |---|---|---|---|
-| Playwright `browser_snapshot` | 12 KB into context | 50 B summary | **99%** |
-| Context7 `query-docs` (React) | 60 KB raw docs | 285 B search result | **99%** |
-| `gh pr list` / `gh api` | 8 KB JSON response | 40 B summary | **99%** |
-| Read `access.log` (500 req) | 45 KB raw log | 71 B status breakdown | **99%** |
-| `npm test` (30 suites) | 6 KB raw output | 37 B pass/fail | **99%** |
-| Git log (153 commits) | 12 KB raw log | 18 B summary | **99%** |
-| Supabase Edge Functions docs | 4 KB raw docs | 123 B code example | **97%** |
+| Playwright `browser_snapshot` | 56 KB into context | 299 B summary | **99%** |
+| Context7 `query-docs` (React) | 5.9 KB raw docs | 261 B summary | **96%** |
+| GitHub issues (20) | 59 KB JSON response | 1.1 KB summary | **98%** |
+| Read `access.log` (500 req) | 45 KB raw log | 155 B status breakdown | **100%** |
+| `vitest` (30 suites) | 6 KB raw output | 337 B pass/fail | **95%** |
+| Git log (153 commits) | 12 KB raw log | 107 B summary | **99%** |
+| Analytics CSV (500 rows) | 86 KB raw data | 222 B summary | **100%** |
-**Real aggregate across 13 scenarios: 194 KB raw → 12.6 KB context (94% savings)**
+**Real aggregate across 14 scenarios: 315 KB raw → 5.4 KB context (98% savings)**
 ## Quick Start
@@ -75,7 +75,7 @@ Claude calls: execute({ language: "shell", code: "gh pr list --json title,state
 Returns: "3"                  ← 2 bytes instead of 8KB JSON
 ```
-**Intent-driven search** (v0.5.0): When you provide an `intent` parameter and output exceeds 5KB, Context Mode uses BM25 search to return only the relevant sections — instead of blind head/tail truncation.
+**Intent-driven search** (v0.5.2): When you provide an `intent` parameter and output exceeds 5KB, Context Mode uses score-based BM25 search to return only the relevant sections matching your intent.
 ```
 Claude calls: execute({
@@ -83,9 +83,12 @@ Claude calls: execute({
   code: "cat /var/log/app.log",
   intent: "connection refused database error"
 })
-Returns: only the 3 matching log sections (1.5KB) ← instead of 100KB truncated log
+Returns: section titles + searchable terms (500B) ← instead of 100KB raw log
 ```
+When intent search runs, the response includes `Searchable terms` — distinctive vocabulary
+extracted from the output via IDF scoring. Use these terms for targeted follow-up `search()` calls.
 Authenticated CLIs work out of the box — `gh`, `aws`, `gcloud`, `kubectl`, `docker` credentials are passed through securely. Bun auto-detected for 3-5x faster JS/TS.
 ### `execute_file` — Process Files Without Loading
@@ -132,12 +135,12 @@ Use instead of WebFetch or Context7 when you need documentation — index once,
 ┌──────────────────────────────────────────────────────────────────┐
 │ Without Context Mode                                             │
 │                                                                  │
-│ Claude Code → Playwright snapshot → 12KB into context            │
-│ Claude Code → Context7 docs      → 60KB into context             │
-│ Claude Code → gh pr list         →  8KB into context             │
+│ Claude Code → Playwright snapshot → 56KB into context            │
+│ Claude Code → Context7 docs      →  6KB into context             │
+│ Claude Code → gh pr list         →  6KB into context             │
 │ Claude Code → cat access.log     → 45KB into context             │
 │                                                                  │
-│ Total: 125KB consumed = ~32,000 tokens = 16% of context gone     │
+│ Total: 113KB consumed = ~29,000 tokens = 14% of context gone     │
 └──────────────────────────────────────────────────────────────────┘
 ┌──────────────────────────────────────────────────────────────────┐
@@ -145,10 +148,10 @@ Use instead of WebFetch or Context7 when you need documentation — index once,
 │                                                                  │
 │ Claude Code → fetch_and_index(url)  → "Indexed 8 sections" (50B)│
 │ Claude Code → search("snapshot")    → exact element       (500B) │
-│ Claude Code → execute("gh pr list") → "3 open PRs"         (40B)│
-│ Claude Code → execute_file(log)     → "500:14, 404:89"     (30B)│
+│ Claude Code → execute("gh pr list") → "5 PRs, +59 -0"    (719B)│
+│ Claude Code → execute_file(log)     → "500:13, 404:13"    (155B)│
 │                                                                  │
-│ Total: 620B consumed = ~160 tokens = 0.08% of context            │
+│ Total: 1.4KB consumed = ~350 tokens = 0.18% of context           │
 └──────────────────────────────────────────────────────────────────┘
 ```
@@ -163,7 +166,7 @@ Use instead of WebFetch or Context7 when you need documentation — index once,
                                          │  │ • 10 language runtimes │  │
                                          │  │ • Sandboxed subprocess │  │
                                          │  │ • Auth passthrough     │  │
-                                         │  │ • Smart truncation     │  │
+                                         │  │ • Intent-driven search │  │
                                          │  └────────────────────────┘  │
                                          │                              │
                                          │  ┌────────────────────────┐  │
@@ -172,6 +175,7 @@ Use instead of WebFetch or Context7 when you need documentation — index once,
                                          │  │ • BM25 ranking         │  │
                                          │  │ • Porter stemming      │  │
                                          │  │ • Heading-aware chunks │  │
+                                         │  │ • Vocabulary hints     │  │
                                          │  └────────────────────────┘  │
                                          └──────────────────────────────┘
 ```
@@ -213,42 +217,36 @@ ORDER BY rank LIMIT 3;
 **Lazy singleton:** Database created only when `index` or `search` is first called — zero overhead for sessions that don't use it.
-### Smart Truncation
-When subprocess output exceeds the 100KB buffer, Context Mode preserves both head and tail:
-```
-Head (60%): Initial output with context
-... [47 lines / 3.2KB truncated — showing first 12 + last 8 lines] ...
-Tail (40%): Final output with errors/results
-```
+### Intent-Driven Search (v0.5.2)
-Line-boundary snapping — never cuts mid-line. Error messages at the bottom are always preserved.
+When `execute` or `execute_file` is called with an `intent` parameter and output exceeds 5KB, Context Mode uses score-based BM25 search to return only the relevant sections:
-### Intent-Driven Search (v0.5.0)
-When `execute` or `execute_file` is called with an `intent` parameter and output exceeds 5KB, Context Mode replaces blind truncation with intelligent BM25 search:
+- **Score-based search**: Searches ALL intent words independently, ranks chunks by match count
+- **Searchable terms**: Distinctive vocabulary hints extracted via IDF scoring, helping you craft precise follow-up `search()` calls
+- **Smarter chunk titles**: Uses the first content line of each chunk instead of generic "Section N" labels
 ```
-Traditional truncation:
-  stdout (100KB) → head(60%) + tail(40%) → ~100KB in context
-  Problem: relevant info in the middle is lost
+Without intent:
+  stdout (100KB) → full output enters context
-Intent-driven search:
-  stdout (100KB) → chunk by lines → in-memory FTS5 → search(intent) → 2-5KB relevant sections
-  Result: only what you need enters context
+With intent:
+  stdout (100KB) → chunk by lines → in-memory FTS5 → score all intent words → top chunks + searchable terms
+  Result: only what you need enters context, plus vocabulary for targeted follow-ups
 ```
-Tested across 4 real-world scenarios:
+**31% to 100% recall on real-world CHANGELOG test** — the score-based approach finds every relevant section, not just those matching a single query string.
-| Scenario | Smart Truncation | Intent Search | Intent Size | Truncation Size |
-|---|---|---|---|---|
-| Server log error (line 347/500) | **missed** | **found** | 1.5 KB | 5.0 KB |
-| 3 test failures among 200 tests | found 2/3 | **found 3/3** | 2.4 KB | 5.0 KB |
-| 2 build warnings among 300 lines | **missed both** | **found both** | 2.1 KB | 5.0 KB |
-| API auth error (line 743/1000) | **missed** | **found** | 1.2 KB | 4.9 KB |
+Tested across 5 real-world scenarios:
+| Scenario | Without Intent | With Intent | Size Reduction |
+|---|---|---|---|
+| Server log error (line 347/500) | error lost in output | **found** | 1.5 KB vs 5.0 KB |
+| 3 test failures among 200 tests | only 2/3 visible | **all 3 found** | 2.4 KB vs 5.0 KB |
+| 2 build warnings among 300 lines | both lost in output | **both found** | 2.1 KB vs 5.0 KB |
+| API auth error (line 743/1000) | error lost in output | **found** | 1.2 KB vs 4.9 KB |
+| Semantic gap (CHANGELOG search) | 31% recall | **100% recall** | Full coverage |
-Smart truncation fails on 3 of 4 scenarios because relevant content is in the dropped middle section. Intent search finds the target every time while using 50-75% fewer bytes.
+Intent search finds the target every time while using 50-75% fewer bytes.
 ### HTML to Markdown Conversion
@@ -269,17 +267,17 @@ Tested with tools from popular MCP servers and Claude Code workflows:
 | Scenario | Tool | Raw | Context | Savings |
 |---|---|---|---|---|
-| Playwright page snapshot | `execute_file` | 50+ KB | 78 B | **99%** |
-| Context7 React docs | `index + search` | 5.9 KB | 285 B | **95%** |
-| Context7 Supabase docs | `index + search` | 3.9 KB | 123 B | **97%** |
-| Context7 Next.js docs | `index + search` | 6.5 KB | 273 B | **96%** |
-| httpbin.org API docs | `fetch_and_index` | 9.4 KB | 50 B | **99%** |
-| GitHub API response | `execute` | 8+ KB | 40 B | **99%** |
-| Access log (500 req) | `execute_file` | 45.1 KB | 71 B | **100%** |
-| Analytics CSV (500 rows) | `execute_file` | 85.5 KB | 11.5 KB | **87%** |
-| MCP tools manifest (40 tools) | `execute_file` | 17.0 KB | 78 B | **100%** |
-| npm test (30 suites) | `execute_file` | 6.0 KB | 37 B | **99%** |
-| Git log (153 commits) | `execute` | 11.6 KB | 18 B | **100%** |
+| Playwright page snapshot | `execute` | 56.2 KB | 299 B | **99%** |
+| Context7 React docs | `execute` | 5.9 KB | 261 B | **96%** |
+| Context7 Next.js docs | `execute` | 6.5 KB | 249 B | **96%** |
+| Context7 Tailwind docs | `execute` | 4.0 KB | 186 B | **95%** |
+| GitHub Issues (20) | `execute` | 58.9 KB | 1.1 KB | **98%** |
+| GitHub PR list (5) | `execute` | 6.4 KB | 719 B | **89%** |
+| Access log (500 req) | `execute_file` | 45.1 KB | 155 B | **100%** |
+| Analytics CSV (500 rows) | `execute_file` | 85.5 KB | 222 B | **100%** |
+| MCP tools manifest (40 tools) | `execute_file` | 17.0 KB | 742 B | **96%** |
+| Test output (30 suites) | `execute` | 6.0 KB | 337 B | **95%** |
+| Git log (153 commits) | `execute` | 11.6 KB | 107 B | **99%** |
 ### Session Impact
@@ -287,9 +285,9 @@ Typical 45-minute debugging session:
 | Metric | Without | With | Delta |
 |---|---|---|---|
-| Context consumed | 177 KB | 10 KB | **-94%** |
-| Tokens used | ~45,300 | ~2,600 | **-94%** |
-| Context remaining | 77% | 95% | **+18pp** |
+| Context consumed | 315 KB | 5.4 KB | **-98%** |
+| Tokens used | ~80,600 | ~1,400 | **-98%** |
+| Context remaining | 60% | 99% | **+39pp** |
 | Time before slowdown | ~30 min | ~3 hours | **+6x** |
 ## Tool Decision Matrix
@@ -388,13 +386,13 @@ Just ask naturally — Claude automatically routes through Context Mode when it
 ## Test Suite
-99+ tests across 4 suites:
+100+ tests across 4 suites:
 | Suite | Tests | Coverage |
 |---|---|---|
-| Executor | 55 | 10 languages, sandbox, truncation, concurrency, timeouts |
+| Executor | 55 | 10 languages, sandbox, output handling, concurrency, timeouts |
 | ContentStore | 40 | FTS5 schema, BM25 ranking, chunking, stemming, plain text indexing |
-| Intent Search | 4 | Smart truncation vs intent-driven search across 4 real-world scenarios |
+| Intent Search | 5 | Intent-driven search across 5 real-world scenarios (incl. semantic gap) |
 | MCP Integration | 24 | JSON-RPC protocol, all 5 tools, fetch_and_index, errors |
 ## Development
@@ -406,7 +404,7 @@ npm install
 npm run build
 npm test              # executor (55 tests)
 npm run test:store    # FTS5/BM25 (40 tests)
-npm run test:all      # all suites (99+ tests)
+npm run test:all      # all suites (100+ tests)
 ```
 ## License

package/build/executor.js CHANGED Viewed

@@ -232,23 +232,23 @@ export class PolyglotExecutor {
         switch (language) {
             case "javascript":
             case "typescript":
-                return `const FILE_CONTENT = require("fs").readFileSync(${escaped}, "utf-8");\n${code}`;
+                return `const FILE_CONTENT_PATH = ${escaped};\nconst FILE_CONTENT = require("fs").readFileSync(FILE_CONTENT_PATH, "utf-8");\n${code}`;
             case "python":
-                return `with open(${escaped}, "r") as _f:\n    FILE_CONTENT = _f.read()\n${code}`;
+                return `FILE_CONTENT_PATH = ${escaped}\nwith open(FILE_CONTENT_PATH, "r") as _f:\n    FILE_CONTENT = _f.read()\n${code}`;
             case "shell":
-                return `FILE_CONTENT=$(cat ${escaped})\n${code}`;
+                return `FILE_CONTENT_PATH=${escaped}\nFILE_CONTENT=$(cat ${escaped})\n${code}`;
             case "ruby":
-                return `FILE_CONTENT = File.read(${escaped})\n${code}`;
+                return `FILE_CONTENT_PATH = ${escaped}\nFILE_CONTENT = File.read(FILE_CONTENT_PATH)\n${code}`;
             case "go":
-                return `package main\n\nimport (\n\t"fmt"\n\t"os"\n)\n\nfunc main() {\n\tb, _ := os.ReadFile(${escaped})\n\tFILE_CONTENT := string(b)\n\t_ = FILE_CONTENT\n${code}\n}\n`;
+                return `package main\n\nimport (\n\t"fmt"\n\t"os"\n)\n\nvar FILE_CONTENT_PATH = ${escaped}\n\nfunc main() {\n\tb, _ := os.ReadFile(FILE_CONTENT_PATH)\n\tFILE_CONTENT := string(b)\n\t_ = FILE_CONTENT\n\t_ = fmt.Sprint()\n${code}\n}\n`;
             case "rust":
-                return `use std::fs;\n\nfn main() {\n    let file_content = fs::read_to_string(${escaped}).unwrap();\n${code}\n}\n`;
+                return `use std::fs;\n\nfn main() {\n    let file_content_path = ${escaped};\n    let file_content = fs::read_to_string(file_content_path).unwrap();\n${code}\n}\n`;
             case "php":
-                return `<?php\n$FILE_CONTENT = file_get_contents(${escaped});\n${code}`;
+                return `<?php\n$FILE_CONTENT_PATH = ${escaped};\n$FILE_CONTENT = file_get_contents($FILE_CONTENT_PATH);\n${code}`;
             case "perl":
-                return `open(my $fh, '<', ${escaped}) or die "Cannot open: $!";\nmy $FILE_CONTENT = do { local $/; <$fh> };\nclose($fh);\n${code}`;
+                return `my $FILE_CONTENT_PATH = ${escaped};\nopen(my $fh, '<', $FILE_CONTENT_PATH) or die "Cannot open: $!";\nmy $FILE_CONTENT = do { local $/; <$fh> };\nclose($fh);\n${code}`;
             case "r":
-                return `FILE_CONTENT <- readLines(${escaped}, warn=FALSE)\nFILE_CONTENT <- paste(FILE_CONTENT, collapse="\\n")\n${code}`;
+                return `FILE_CONTENT_PATH <- ${escaped}\nFILE_CONTENT <- readLines(FILE_CONTENT_PATH, warn=FALSE)\nFILE_CONTENT <- paste(FILE_CONTENT, collapse="\\n")\n${code}`;
         }
     }
 }

package/build/server.js CHANGED Viewed

@@ -5,11 +5,12 @@ import { z } from "zod";
 import { PolyglotExecutor } from "./executor.js";
 import { ContentStore } from "./store.js";
 import { detectRuntimes, getRuntimeSummary, getAvailableLanguages, hasBunRuntime, } from "./runtime.js";
+const VERSION = "0.5.3";
 const runtimes = detectRuntimes();
 const available = getAvailableLanguages(runtimes);
 const server = new McpServer({
     name: "context-mode",
-    version: "0.5.0",
+    version: VERSION,
 });
 const executor = new PolyglotExecutor({ runtimes });
 // Lazy singleton — no DB overhead unless index/search is used
@@ -57,8 +58,9 @@ server.registerTool("execute", {
             .string()
             .optional()
             .describe("What you're looking for in the output. When provided and output is large (>5KB), " +
-            "returns only matching sections via BM25 search instead of truncated output. " +
-            "Example: 'find failing tests', 'HTTP 500 errors', 'memory usage statistics'."),
+            "indexes output into knowledge base and returns section titles + previews — not full content. " +
+            "Use search() to retrieve specific sections. Example: 'failing tests', 'HTTP 500 errors'." +
+            "\n\nTIP: Use specific technical terms, not just concepts. Check 'Searchable terms' in the response for available vocabulary."),
     }),
 }, async ({ language, code, timeout, intent }) => {
     try {
@@ -79,7 +81,7 @@ server.registerTool("execute", {
             if (intent && intent.trim().length > 0 && Buffer.byteLength(output) > INTENT_SEARCH_THRESHOLD) {
                 return {
                     content: [
-                        { type: "text", text: intentSearch(output, intent) },
+                        { type: "text", text: intentSearch(output, intent, `execute:${language}:error`) },
                     ],
                     isError: true,
                 };
@@ -96,7 +98,7 @@ server.registerTool("execute", {
         if (intent && intent.trim().length > 0 && Buffer.byteLength(stdout) > INTENT_SEARCH_THRESHOLD) {
             return {
                 content: [
-                    { type: "text", text: intentSearch(stdout, intent) },
+                    { type: "text", text: intentSearch(stdout, intent, `execute:${language}`) },
                 ],
             };
         }
@@ -135,30 +137,79 @@ function indexStdout(stdout, source) {
 // Helper: intent-driven search on execution output
 // ─────────────────────────────────────────────────────────
 const INTENT_SEARCH_THRESHOLD = 5_000; // bytes — ~80-100 lines
-function intentSearch(stdout, intent, maxResults = 5) {
-    const store = new ContentStore(":memory:");
+function intentSearch(stdout, intent, source, maxResults = 5) {
+    const totalLines = stdout.split("\n").length;
+    const totalBytes = Buffer.byteLength(stdout);
+    // Index into the PERSISTENT store so user can search() later
+    const persistent = getStore();
+    const indexed = persistent.indexPlainText(stdout, source);
+    // Search with an ephemeral store to find matching section titles
+    const ephemeral = new ContentStore(":memory:");
     try {
-        const totalLines = stdout.split("\n").length;
-        const totalBytes = Buffer.byteLength(stdout);
-        store.indexPlainText(stdout, "exec-output");
-        const results = store.search(intent, maxResults);
+        ephemeral.indexPlainText(stdout, source);
+        let results = ephemeral.search(intent, maxResults);
+        // Score-based relaxed search: search ALL words, rank by match count
         if (results.length === 0) {
-            return (`[Intent search: no matches for "${intent}" in ${totalLines}-line output. Returning full output.]\n\n` +
-                stdout);
+            const words = intent.trim().split(/\s+/).filter(w => w.length > 2).slice(0, 20);
+            if (words.length > 0) {
+                const sectionScores = new Map();
+                for (const word of words) {
+                    const wordResults = ephemeral.search(word, 10);
+                    for (const r of wordResults) {
+                        const existing = sectionScores.get(r.title);
+                        if (existing) {
+                            existing.score += 1;
+                            if (r.rank < existing.bestRank) {
+                                existing.bestRank = r.rank;
+                                existing.result = r;
+                            }
+                        }
+                        else {
+                            sectionScores.set(r.title, { result: r, score: 1, bestRank: r.rank });
+                        }
+                    }
+                }
+                results = Array.from(sectionScores.values())
+                    .sort((a, b) => b.score - a.score || a.bestRank - b.bestRank)
+                    .slice(0, maxResults)
+                    .map(s => s.result);
+            }
         }
-        const totalChunks = store.getStats().chunks;
-        const header = `[Intent search: ${results.length} of ${totalChunks} sections matched "${intent}" from ${totalLines}-line output (${(totalBytes / 1024).toFixed(1)}KB)]`;
-        const formatted = results
-            .map((r, i) => {
-            const matchLabel = i === 0 ? " (best match)" : "";
-            return `--- ${r.title}${matchLabel} ---\n${r.content}`;
-        })
-            .join("\n\n");
-        const footer = `[Full output: ${totalLines} lines / ${(totalBytes / 1024).toFixed(1)}KB. Re-run without intent to see raw output.]`;
-        return `${header}\n\n${formatted}\n\n${footer}`;
+        // Extract distinctive terms as vocabulary hints for the LLM
+        const distinctiveTerms = persistent.getDistinctiveTerms(indexed.sourceId);
+        if (results.length === 0) {
+            const lines = [
+                `Indexed ${indexed.totalChunks} sections from "${source}" into knowledge base.`,
+                `No sections matched intent "${intent}" in ${totalLines}-line output (${(totalBytes / 1024).toFixed(1)}KB).`,
+            ];
+            if (distinctiveTerms.length > 0) {
+                lines.push("");
+                lines.push(`Searchable terms: ${distinctiveTerms.join(", ")}`);
+            }
+            lines.push("");
+            lines.push("Use search() to explore the indexed content.");
+            return lines.join("\n");
+        }
+        // Return ONLY titles + first-line previews — not full content
+        const lines = [
+            `Indexed ${indexed.totalChunks} sections from "${source}" into knowledge base.`,
+            `${results.length} sections matched "${intent}" (${totalLines} lines, ${(totalBytes / 1024).toFixed(1)}KB):`,
+            "",
+        ];
+        for (const r of results) {
+            const preview = r.content.split("\n")[0].slice(0, 120);
+            lines.push(`  - ${r.title}: ${preview}`);
+        }
+        if (distinctiveTerms.length > 0) {
+            lines.push("");
+            lines.push(`Searchable terms: ${distinctiveTerms.join(", ")}`);
+        }
+        lines.push("");
+        lines.push("Use search() to retrieve full content of any section.");
+        return lines.join("\n");
     }
     finally {
-        store.close();
+        ephemeral.close();
     }
 }
 // ─────────────────────────────────────────────────────────
@@ -223,7 +274,7 @@ server.registerTool("execute_file", {
             if (intent && intent.trim().length > 0 && Buffer.byteLength(output) > INTENT_SEARCH_THRESHOLD) {
                 return {
                     content: [
-                        { type: "text", text: intentSearch(output, intent) },
+                        { type: "text", text: intentSearch(output, intent, `file:${path}:error`) },
                     ],
                     isError: true,
                 };
@@ -239,7 +290,7 @@ server.registerTool("execute_file", {
         if (intent && intent.trim().length > 0 && Buffer.byteLength(stdout) > INTENT_SEARCH_THRESHOLD) {
             return {
                 content: [
-                    { type: "text", text: intentSearch(stdout, intent) },
+                    { type: "text", text: intentSearch(stdout, intent, `file:${path}`) },
                 ],
             };
         }
@@ -337,6 +388,10 @@ server.registerTool("search", {
         "- Look up API signatures ('Supabase RLS policy syntax')\n" +
         "- Get configuration details ('Tailwind responsive breakpoints')\n" +
         "- Find migration steps ('App Router data fetching')\n\n" +
+        "SEARCH TIPS:\n" +
+        "- Use specific technical terms, not concepts ('__proto__' not 'security')\n" +
+        "- Check 'Searchable terms' from execute/execute_file results for available vocabulary\n" +
+        "- Combine multiple specific terms for better results\n\n" +
         "Returns exact content — not summaries. Each result includes heading hierarchy and full section text.",
     inputSchema: z.object({
         query: z.string().describe("Natural language search query"),
@@ -514,7 +569,7 @@ server.registerTool("fetch_and_index", {
 async function main() {
     const transport = new StdioServerTransport();
     await server.connect(transport);
-    console.error("Context Mode MCP server v0.4.0 running on stdio");
+    console.error(`Context Mode MCP server v${VERSION} running on stdio`);
     console.error(`Detected runtimes:\n${getRuntimeSummary(runtimes)}`);
     if (!hasBunRuntime()) {
         console.error("\nPerformance tip: Install Bun for 3-5x faster JS/TS execution");

package/build/store.d.ts CHANGED Viewed

@@ -40,6 +40,7 @@ export declare class ContentStore {
      */
     indexPlainText(content: string, source: string, linesPerChunk?: number): IndexResult;
     search(query: string, limit?: number): SearchResult[];
+    getDistinctiveTerms(sourceId: number, maxTerms?: number): string[];
     getStats(): StoreStats;
     close(): void;
 }

package/build/store.js CHANGED Viewed

@@ -12,6 +12,24 @@ import { readFileSync } from "node:fs";
 import { tmpdir } from "node:os";
 import { join } from "node:path";
 // ─────────────────────────────────────────────────────────
+// Constants
+// ─────────────────────────────────────────────────────────
+const STOPWORDS = new Set([
+    "the", "and", "for", "are", "but", "not", "you", "all", "can", "had",
+    "her", "was", "one", "our", "out", "has", "his", "how", "its", "may",
+    "new", "now", "old", "see", "way", "who", "did", "get", "got", "let",
+    "say", "she", "too", "use", "will", "with", "this", "that", "from",
+    "they", "been", "have", "many", "some", "them", "than", "each", "make",
+    "like", "just", "over", "such", "take", "into", "year", "your", "good",
+    "could", "would", "about", "which", "their", "there", "other", "after",
+    "should", "through", "also", "more", "most", "only", "very", "when",
+    "what", "then", "these", "those", "being", "does", "done", "both",
+    "same", "still", "while", "where", "here", "were", "much",
+    // Common in code/changelogs
+    "update", "updates", "updated", "deps", "dev", "tests", "test",
+    "add", "added", "fix", "fixed", "run", "running", "using",
+]);
+// ─────────────────────────────────────────────────────────
 // Helpers
 // ─────────────────────────────────────────────────────────
 function sanitizeQuery(query) {
@@ -155,6 +173,46 @@ export class ContentStore {
             contentType: r.content_type,
         }));
     }
+    // ── Vocabulary ──
+    getDistinctiveTerms(sourceId, maxTerms = 40) {
+        const stats = this.#db
+            .prepare("SELECT chunk_count FROM sources WHERE id = ?")
+            .get(sourceId);
+        if (!stats || stats.chunk_count < 3)
+            return [];
+        const totalChunks = stats.chunk_count;
+        const minAppearances = 2;
+        const maxAppearances = Math.max(3, Math.ceil(totalChunks * 0.4));
+        const rows = this.#db
+            .prepare("SELECT content FROM chunks WHERE source_id = ?")
+            .all(sourceId);
+        // Count document frequency (how many sections contain each word)
+        const docFreq = new Map();
+        for (const row of rows) {
+            const words = new Set(row.content
+                .toLowerCase()
+                .split(/[^\p{L}\p{N}_-]+/u)
+                .filter((w) => w.length >= 3 && !STOPWORDS.has(w)));
+            for (const word of words) {
+                docFreq.set(word, (docFreq.get(word) ?? 0) + 1);
+            }
+        }
+        const filtered = Array.from(docFreq.entries())
+            .filter(([, count]) => count >= minAppearances && count <= maxAppearances);
+        // Score: IDF (rarity) + length bonus + identifier bonus (underscore/camelCase)
+        const scored = filtered.map(([word, count]) => {
+            const idf = Math.log(totalChunks / count);
+            const lenBonus = Math.min(word.length / 20, 0.5);
+            const hasSpecialChars = /[_]/.test(word);
+            const isCamelOrLong = word.length >= 12;
+            const identifierBonus = hasSpecialChars ? 1.5 : isCamelOrLong ? 0.8 : 0;
+            return { word, score: idf + lenBonus + identifierBonus };
+        });
+        return scored
+            .sort((a, b) => b.score - a.score)
+            .slice(0, maxTerms)
+            .map((s) => s.word);
+    }
     // ── Stats ──
     getStats() {
         const sources = this.#db.prepare("SELECT COUNT(*) as c FROM sources").get()?.c ?? 0;
@@ -246,10 +304,14 @@ export class ContentStore {
             sections.length <= 200 &&
             sections.every((s) => Buffer.byteLength(s) < 5000)) {
             return sections
-                .map((section, i) => ({
-                title: `Section ${i + 1}`,
-                content: section.trim(),
-            }))
+                .map((section, i) => {
+                const trimmed = section.trim();
+                const firstLine = trimmed.split("\n")[0].slice(0, 80);
+                return {
+                    title: firstLine || `Section ${i + 1}`,
+                    content: trimmed,
+                };
+            })
                 .filter((s) => s.content.length > 0);
         }
         const lines = text.split("\n");
@@ -267,8 +329,9 @@ export class ContentStore {
                 break;
             const startLine = i + 1;
             const endLine = Math.min(i + slice.length, lines.length);
+            const firstLine = slice[0]?.trim().slice(0, 80);
             chunks.push({
-                title: `Lines ${startLine}-${endLine}`,
+                title: firstLine || `Lines ${startLine}-${endLine}`,
                 content: slice.join("\n"),
             });
         }

package/package.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
   "name": "context-mode",
-  "version": "0.5.0",
+  "version": "0.5.3",
   "type": "module",
-  "description": "Claude Code MCP plugin that saves 94% of your context window. Sandboxed code execution, FTS5 knowledge base, and smart truncation.",
+  "description": "Claude Code MCP plugin that saves 98% of your context window. Sandboxed code execution, FTS5 knowledge base, and intent-driven search.",
   "author": "Mert Koseoğlu",
   "license": "MIT",
   "keywords": [