grepmax 0.4.0 → 0.4.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md
CHANGED
|
@@ -24,7 +24,8 @@ Natural-language search that works like `grep`. Fast, local, and built for codin
|
|
|
24
24
|
- **Role Detection:** Distinguishes `ORCHESTRATION` (high-level logic) from `DEFINITION` (types/classes).
|
|
25
25
|
- **Local & Private:** 100% local embeddings via ONNX (CPU) or MLX (Apple Silicon GPU).
|
|
26
26
|
- **Centralized Index:** One database at `~/.gmax/` — index once, search from anywhere.
|
|
27
|
-
- **
|
|
27
|
+
- **LLM Summaries:** Optional Qwen3-Coder generates one-line descriptions per code chunk at index time.
|
|
28
|
+
- **Agent-Ready:** Pointer mode returns metadata (symbol, role, calls, summary) — no code snippets, ~80% fewer tokens.
|
|
28
29
|
|
|
29
30
|
## Quick Start
|
|
30
31
|
|
|
@@ -99,8 +100,8 @@ In our public benchmarks, `grepmax` can save about 20% of your LLM tokens and de
|
|
|
99
100
|
|
|
100
101
|
| Tool | Description |
|
|
101
102
|
| --- | --- |
|
|
102
|
-
| `semantic_search` |
|
|
103
|
-
| `search_all` | Search ALL indexed code across every directory. |
|
|
103
|
+
| `semantic_search` | Code search by meaning. Returns pointers (symbol, file:line, role, calls, summary) by default. Use `root` for cross-directory search, `detail: "code"` for snippets. |
|
|
104
|
+
| `search_all` | Search ALL indexed code across every directory. Same pointer format. |
|
|
104
105
|
| `code_skeleton` | Collapsed file structure (~4x fewer tokens than reading the full file) |
|
|
105
106
|
| `trace_calls` | Call graph — who calls a symbol and what it calls (unscoped, crosses project boundaries) |
|
|
106
107
|
| `list_symbols` | List indexed functions, classes, and types with definition locations |
|
|
@@ -228,6 +229,19 @@ On Macs with Apple Silicon, gmax defaults to MLX for GPU-accelerated embeddings.
|
|
|
228
229
|
|
|
229
230
|
To force CPU mode: `GMAX_EMBED_MODE=cpu gmax index`
|
|
230
231
|
|
|
232
|
+
### LLM Summaries
|
|
233
|
+
|
|
234
|
+
gmax can generate one-line natural language descriptions for every code chunk using a local LLM (Qwen3-Coder-30B-A3B via MLX). Summaries are pre-computed at index time and stored in LanceDB — zero latency at search time.
|
|
235
|
+
|
|
236
|
+
The summarizer server runs on port `8101` and auto-starts alongside the embed server. If unavailable, indexing proceeds without summaries.
|
|
237
|
+
|
|
238
|
+
Example search output with summaries:
|
|
239
|
+
```
|
|
240
|
+
handleAuth [exported ORCH C:8] src/auth/handler.ts:45-90
|
|
241
|
+
Validates JWT from Authorization header, checks RBAC permissions, returns 401 on failure
|
|
242
|
+
parent:AuthController calls:validateToken,checkRole,respond
|
|
243
|
+
```
|
|
244
|
+
|
|
231
245
|
## Configuration
|
|
232
246
|
|
|
233
247
|
### Ignoring Files
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "grepmax",
|
|
3
|
-
"version": "0.4.
|
|
3
|
+
"version": "0.4.1",
|
|
4
4
|
"author": "Robert Owens <robowens@me.com>",
|
|
5
5
|
"homepage": "https://github.com/reowens/grepmax",
|
|
6
6
|
"bugs": {
|
|
@@ -29,7 +29,7 @@
|
|
|
29
29
|
"NOTICE"
|
|
30
30
|
],
|
|
31
31
|
"license": "Apache-2.0",
|
|
32
|
-
"description": "
|
|
32
|
+
"description": "Semantic code search for coding agents. Local embeddings, LLM summaries, call graph tracing.",
|
|
33
33
|
"dependencies": {
|
|
34
34
|
"@clack/prompts": "^1.1.0",
|
|
35
35
|
"@huggingface/transformers": "^3.8.0",
|