grepmax 0.4.0 → 0.4.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -24,7 +24,8 @@ Natural-language search that works like `grep`. Fast, local, and built for codin
24
24
  - **Role Detection:** Distinguishes `ORCHESTRATION` (high-level logic) from `DEFINITION` (types/classes).
25
25
  - **Local & Private:** 100% local embeddings via ONNX (CPU) or MLX (Apple Silicon GPU).
26
26
  - **Centralized Index:** One database at `~/.gmax/` — index once, search from anywhere.
27
- - **Agent-Ready:** Native output with symbols, roles, and call graphs.
27
+ - **LLM Summaries:** Optional Qwen3-Coder generates one-line descriptions per code chunk at index time.
28
+ - **Agent-Ready:** Pointer mode returns metadata (symbol, role, calls, summary) — no code snippets, ~80% fewer tokens.
28
29
 
29
30
  ## Quick Start
30
31
 
@@ -99,8 +100,8 @@ In our public benchmarks, `grepmax` can save about 20% of your LLM tokens and de
99
100
 
100
101
  | Tool | Description |
101
102
  | --- | --- |
102
- | `semantic_search` | Natural language code search. Use `root` to search a parent or sibling directory. |
103
- | `search_all` | Search ALL indexed code across every directory. |
103
+ | `semantic_search` | Code search by meaning. Returns pointers (symbol, file:line, role, calls, summary) by default. Use `root` for cross-directory search, `detail: "code"` for snippets. |
104
+ | `search_all` | Search ALL indexed code across every directory. Same pointer format. |
104
105
  | `code_skeleton` | Collapsed file structure (~4x fewer tokens than reading the full file) |
105
106
  | `trace_calls` | Call graph — who calls a symbol and what it calls (unscoped, crosses project boundaries) |
106
107
  | `list_symbols` | List indexed functions, classes, and types with definition locations |
@@ -228,6 +229,19 @@ On Macs with Apple Silicon, gmax defaults to MLX for GPU-accelerated embeddings.
228
229
 
229
230
  To force CPU mode: `GMAX_EMBED_MODE=cpu gmax index`
230
231
 
232
+ ### LLM Summaries
233
+
234
+ gmax can generate one-line natural language descriptions for every code chunk using a local LLM (Qwen3-Coder-30B-A3B via MLX). Summaries are pre-computed at index time and stored in LanceDB — zero latency at search time.
235
+
236
+ The summarizer server runs on port `8101` and auto-starts alongside the embed server. If unavailable, indexing proceeds without summaries.
237
+
238
+ Example search output with summaries:
239
+ ```
240
+ handleAuth [exported ORCH C:8] src/auth/handler.ts:45-90
241
+ Validates JWT from Authorization header, checks RBAC permissions, returns 401 on failure
242
+ parent:AuthController calls:validateToken,checkRole,respond
243
+ ```
244
+
231
245
  ## Configuration
232
246
 
233
247
  ### Ignoring Files
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "grepmax",
3
- "version": "0.4.0",
3
+ "version": "0.4.1",
4
4
  "author": "Robert Owens <robowens@me.com>",
5
5
  "homepage": "https://github.com/reowens/grepmax",
6
6
  "bugs": {
@@ -29,7 +29,7 @@
29
29
  "NOTICE"
30
30
  ],
31
31
  "license": "Apache-2.0",
32
- "description": "Local grep-like search tool for your codebase.",
32
+ "description": "Semantic code search for coding agents. Local embeddings, LLM summaries, call graph tracing.",
33
33
  "dependencies": {
34
34
  "@clack/prompts": "^1.1.0",
35
35
  "@huggingface/transformers": "^3.8.0",
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "grepmax",
3
- "version": "0.4.0",
3
+ "version": "0.4.1",
4
4
  "description": "Semantic code search for Claude Code. Automatically indexes your project and provides intelligent search capabilities.",
5
5
  "author": {
6
6
  "name": "Robert Owens",