npm - @possumtech/rummy - Versions diffs - 0.2.7 → 0.3.0 - Mend

@possumtech/rummy 0.2.7 → 0.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (119) hide show

package/.env.example +12 -3
package/EXCEPTIONS.md +46 -0
package/PLUGINS.md +454 -197
package/SPEC.md +284 -93
package/migrations/001_initial_schema.sql +57 -70
package/package.json +16 -10
package/service.js +1 -1
package/src/agent/AgentLoop.js +254 -70
package/src/agent/ContextAssembler.js +18 -4
package/src/agent/KnownStore.js +156 -23
package/src/agent/ProjectAgent.js +5 -4
package/src/agent/ResponseHealer.js +21 -1
package/src/agent/TurnExecutor.js +393 -115
package/src/agent/XmlParser.js +92 -39
package/src/agent/known_checks.sql +5 -4
package/src/agent/known_queries.sql +4 -3
package/src/agent/known_store.sql +45 -15
package/src/agent/loops.sql +63 -0
package/src/agent/runs.sql +7 -7
package/src/agent/schemes.sql +5 -2
package/src/agent/tokens.js +6 -21
package/src/agent/turns.sql +13 -4
package/src/hooks/Hooks.js +18 -0
package/src/hooks/PluginContext.js +14 -10
package/src/hooks/RummyContext.js +30 -10
package/src/hooks/ToolRegistry.js +83 -19
package/src/llm/LlmProvider.js +27 -8
package/src/llm/OpenAiClient.js +20 -0
package/src/llm/OpenRouterClient.js +24 -2
package/src/llm/XaiClient.js +47 -2
package/src/plugins/ask_user/README.md +4 -4
package/src/plugins/ask_user/ask_user.js +8 -7
package/src/plugins/ask_user/ask_userDoc.js +29 -0
package/src/plugins/budget/BudgetGuard.js +74 -0
package/src/plugins/budget/README.md +43 -0
package/src/plugins/budget/budget.js +79 -0
package/src/plugins/cp/README.md +5 -4
package/src/plugins/cp/cp.js +16 -12
package/src/plugins/cp/cpDoc.js +29 -0
package/src/plugins/current/README.md +4 -4
package/src/plugins/current/current.js +12 -10
package/src/plugins/engine/engine.sql +5 -10
package/src/plugins/engine/turn_context.sql +13 -13
package/src/plugins/env/README.md +3 -4
package/src/plugins/env/env.js +8 -7
package/src/plugins/env/envDoc.js +29 -0
package/src/plugins/file/README.md +9 -12
package/src/plugins/file/file.js +34 -45
package/src/plugins/get/README.md +2 -2
package/src/plugins/get/get.js +28 -11
package/src/plugins/get/getDoc.js +41 -0
package/src/plugins/hedberg/docs.md +0 -9
package/src/plugins/hedberg/hedberg.js +4 -6
package/src/plugins/hedberg/matcher.js +1 -1
package/src/plugins/hedberg/normalize.js +28 -0
package/src/plugins/hedberg/patterns.js +31 -33
package/src/plugins/hedberg/sed.js +17 -10
package/src/plugins/helpers.js +2 -2
package/src/plugins/index.js +93 -28
package/src/plugins/instructions/README.md +6 -2
package/src/plugins/instructions/instructions.js +21 -5
package/src/plugins/instructions/preamble.md +9 -5
package/src/plugins/known/README.md +10 -7
package/src/plugins/known/known.js +33 -23
package/src/plugins/known/knownDoc.js +33 -0
package/src/plugins/mv/README.md +5 -4
package/src/plugins/mv/mv.js +16 -12
package/src/plugins/mv/mvDoc.js +31 -0
package/src/plugins/persona/persona.js +78 -0
package/src/plugins/previous/README.md +2 -2
package/src/plugins/previous/previous.js +12 -8
package/src/plugins/progress/progress.js +44 -12
package/src/plugins/prompt/README.md +5 -5
package/src/plugins/prompt/prompt.js +23 -19
package/src/plugins/rm/README.md +4 -4
package/src/plugins/rm/rm.js +29 -12
package/src/plugins/rm/rmDoc.js +30 -0
package/src/plugins/rpc/README.md +15 -28
package/src/plugins/rpc/rpc.js +63 -107
package/src/plugins/set/README.md +13 -12
package/src/plugins/set/set.js +82 -21
package/src/plugins/set/setDoc.js +45 -0
package/src/plugins/sh/README.md +4 -4
package/src/plugins/sh/sh.js +8 -7
package/src/plugins/sh/shDoc.js +29 -0
package/src/plugins/{skills/skills.js → skill/skill.js} +12 -54
package/src/plugins/summarize/README.md +6 -5
package/src/plugins/summarize/summarize.js +7 -6
package/src/plugins/summarize/summarizeDoc.js +33 -0
package/src/plugins/telemetry/telemetry.js +20 -8
package/src/plugins/think/README.md +20 -0
package/src/plugins/think/think.js +5 -0
package/src/plugins/unknown/README.md +5 -5
package/src/plugins/unknown/unknown.js +11 -8
package/src/plugins/unknown/unknownDoc.js +31 -0
package/src/plugins/update/README.md +3 -8
package/src/plugins/update/update.js +7 -6
package/src/plugins/update/updateDoc.js +33 -0
package/src/server/ClientConnection.js +3 -5
package/src/server/RpcRegistry.js +52 -4
package/src/sql/v_model_context.sql +31 -39
package/src/sql/v_run_log.sql +3 -3
package/src/agent/prompt_queue.sql +0 -39
package/src/plugins/ask_user/docs.md +0 -2
package/src/plugins/cp/docs.md +0 -2
package/src/plugins/env/docs.md +0 -2
package/src/plugins/get/docs.md +0 -6
package/src/plugins/known/docs.md +0 -3
package/src/plugins/mv/docs.md +0 -2
package/src/plugins/rm/docs.md +0 -4
package/src/plugins/set/docs.md +0 -4
package/src/plugins/sh/docs.md +0 -2
package/src/plugins/skills/README.md +0 -25
package/src/plugins/store/README.md +0 -20
package/src/plugins/store/docs.md +0 -5
package/src/plugins/store/store.js +0 -52
package/src/plugins/summarize/docs.md +0 -4
package/src/plugins/unknown/docs.md +0 -5
package/src/plugins/update/docs.md +0 -4

package/SPEC.md CHANGED Viewed

@@ -15,8 +15,8 @@ that thread a value through subscribers in priority order).
 **Every `<tag>` the model sees is a plugin.** The `<known>` section
 of the system message is rendered by the known plugin. The `<progress>`
-section is rendered by the progress plugin. The `<ask>` tag is rendered
-by the prompt plugin. No monolithic assembler decides what goes where.
+section is rendered by the progress plugin. The `<prompt>` tag is
+rendered by the prompt plugin. No monolithic assembler decides what goes where.
 Each plugin filters for its own data from the shared row set, renders
 its section, and returns.
@@ -42,7 +42,8 @@ body, attributes, and state.
 ```sql
 known_entries (
-    id, run_id, turn, path, body, scheme, state, hash,
+    id, run_id, loop_id, turn, path, body, scheme,
+    status INTEGER, fidelity TEXT, hash,
     attributes, tokens, tokens_full, refs, write_count,
     created_at, updated_at
 )
@@ -50,58 +51,63 @@ known_entries (
 | Column | Purpose |
 |--------|---------|
-| `path` | Entry identity. Bare paths (`src/app.js`) or URIs (`known://auth`) |
+| `path` | Entry identity. Bare paths (`src/app.js`) or URIs (`known://auth`). Max 2048 chars. |
 | `body` | Tag body text. File content, tool output, skill docs. |
 | `attributes` | Tag attributes as JSON. Handler-private workspace. `CHECK (json_valid)` |
 | `scheme` | Generated from path via `schemeOf()`. Drives dispatch and view routing |
-| `state` | Lifecycle stage. Determines model visibility |
+| `status` | HTTP status code (200, 202, 400, 413, etc.) |
+| `fidelity` | Visibility level: full, summary, index, archive |
 | `hash` | SHA-256 for file change detection |
-| `tokens` | Context cost at current state |
+| `tokens` | Display-only token count at current fidelity. NEVER used for budget. |
 | `tokens_full` | Cost of raw body at full fidelity |
 | `turn` | Freshness — when was this entry last touched |
-### 1.2 Schemes & States
+### 1.2 Schemes, Status & Fidelity
-Paths use URI scheme syntax. Bare paths (no `://`) are files.
-**Files** (`scheme IS NULL`):
-| State | Model sees |
-|-------|-----------|
-| `full` | File content in code fence |
-| `index` | Path listed in File Index |
-| `stored` | Invisible, retrievable via `<get>` |
-**Knowledge** (`known://`, `unknown://`):
-| State | Model sees |
-|-------|-----------|
-| `full` | Key — value in bullet list |
-| `stored` | Key listed, no value |
-**Tool results** (`set://`, `sh://`, `env://`, `rm://`, `ask_user://`,
-`mv://`, `cp://`, `search://`, `get://`, `store://`):
-All start at `full` state when recorded. Handlers set the final state:
-`proposed`, `pass`, `rejected`, `error`, `pattern`, `read`, `stored`, `info`.
+Every entry has two independent dimensions: **status** (HTTP integer)
+and **fidelity** (visibility level). These are separate concerns.
-**Skills** (`skill://`): `full` or `stored`. Rendered in system message.
+**Status** (lifecycle): 200 (OK), 202 (proposed), 400 (bad request),
+404 (not found), 409 (conflict), 413 (too large), 499 (aborted),
+500 (error).
-**Tools** (`tool://`): `full`, `model_visible = 0`. Internal plugin metadata.
+**Fidelity** (visibility): `full` (body visible), `summary`
+(model-authored summary), `index` (path only), `archive` (invisible,
+retrievable via `<get>`).
-**URLs** (`http://`, `https://`): `full`, `summary`, `stored`.
-**Structural** (`summarize://`, `update://`): Status signals.
-**Audit** (`system://`, `prompt://`, `ask://`, `act://`, `progress://`,
-`reasoning://`, `model://`, `error://`, `user://`, `assistant://`,
-`content://`): `info` state, `model_visible = 0` (hidden from model).
-### 1.3 State Validation
+Paths use URI scheme syntax. Bare paths (no `://`) are files.
-The `schemes` table is a bootstrap registry — 30 rows of static config.
-INSERT/UPDATE triggers validate state against `schemes.valid_states`.
-Plugins cannot bypass this (circular dependency prevents schemes as entries).
+Every entry plays one of four roles:
+| Role | Category | Section | Description |
+|------|----------|---------|-------------|
+| **Data** | `data` | `<knowns>` | Entries the model works with — persistent state |
+| **Logging** | `logging` | `<current>`/`<previous>` | Records of what happened — tool results, lifecycle signals |
+| **Unknowns** | `unknown` | `<unknowns>` | Open questions the model is tracking |
+| **Prompt** | `prompt` | `<prompt>` | The task driving the loop |
+`logging` is the default category. Plugins opt into `data` explicitly.
+| Scheme | Category | Description |
+|--------|----------|-------------|
+| `NULL` (bare path) | data | File content. JOINs via `COALESCE(scheme, 'file')`. `file://` prefix stripped by hedberg. |
+| `known://` | data | Model-registered knowledge. One fact per entry. |
+| `skill://` | data | Skill docs. Rendered in system message. |
+| `http://`, `https://` | data | Web content. |
+| `unknown://` | unknown | Unresolved questions. |
+| `prompt://` | prompt | User prompt with `mode` attribute (`ask`/`act`). |
+| `progress://` | prompt | Continuation prompt. |
+| `set://`, `get://`, `sh://`, `env://`, `rm://`, `mv://`, `cp://`, `ask_user://`, `search://` | logging | Tool result entries. |
+| `summarize://`, `update://` | logging | Lifecycle signals. |
+| `tool://` | audit | Internal plugin metadata. `model_visible = 0`. |
+| `system://`, `reasoning://`, `model://`, `error://`, `user://`, `assistant://`, `content://` | audit | Audit entries. `model_visible = 0`. |
+### 1.3 Scheme Registry
+The `schemes` table is a bootstrap registry — static rows of
+`(name, model_visible, category)`. Plugins register their scheme
+via `core.registerScheme()` in the constructor. The `model_visible`
+flag controls whether entries appear in `v_model_context`.
 ### 1.4 UPSERT Semantics
@@ -117,13 +123,21 @@ The K/V store is the memory. Relational tables are the skeleton.
 ```sql
 projects (id, name UNIQUE, project_root, config_path, created_at)
 models   (id, alias UNIQUE, actual, context_length, created_at)
-runs     (id, project_id, parent_run_id, model, alias UNIQUE, status,
-          temperature, persona, context_limit, next_turn, created_at)
-turns    (id, run_id, sequence, prompt_tokens, completion_tokens,
-          total_tokens, cost, created_at)
+runs     (id, project_id, parent_run_id, model, alias UNIQUE,
+          status INTEGER, temperature, persona, context_limit,
+          next_turn, next_loop, created_at)
+loops    (id, run_id, sequence, mode, model, prompt, status INTEGER,
+          config JSON, result JSON, created_at)
+turns    (id, run_id, loop_id, sequence, context_tokens,
+          reasoning_content, prompt_tokens, cached_tokens,
+          completion_tokens, reasoning_tokens, total_tokens, cost,
+          created_at)
 file_constraints (id, project_id, pattern, visibility, created_at)
-prompt_queue     (id, run_id, mode, model, prompt, config, status, result)
+  -- Project-level config. NOT tool dispatch. See §2.3.
+turn_context     (id, run_id, loop_id, turn, ordinal, path, scheme,
+                  status, fidelity, body, tokens, attributes,
+                  category, source_turn)
 rpc_log          (id, project_id, method, rpc_id, params, result, error)
 ```
@@ -136,19 +150,39 @@ client picks for every run.
 ### 2.1 Run State Machine
+All status fields are HTTP integer codes:
 ```
-queued → running → proposed → running → completed
-                → completed
-                → failed → running
-                → aborted → running
+100 (queued) → 200 (running) → 202 (proposed) → 200 (running) → 200 (completed)
+                              → 200 (completed)
+                              → 500 (failed) → 200 (running)
+                              → 499 (aborted) → 200 (running)
 ```
 All terminal states allow transition back to `running`. Runs are long-lived.
-### 2.2 Prompt Queue
+### 2.2 Loops Table
+The loops table IS the prompt queue. Each `ask`/`act` creates a loop.
+FIFO per run (ordered by sequence). One active at a time. Abort stops
+the current loop; pending loops survive. Projects > runs > loops > turns.
-All prompts flow through `prompt_queue`. FIFO per run. One active at a time.
-Abort stops the current prompt; pending prompts survive.
+### 2.3 File Constraints
+The `file_constraints` table is project-level configuration — it
+defines which files a project cares about. This is backbone, not tool
+dispatch. Constraints have three visibilities: `active` (promoted to
+full), `readonly` (promoted but not editable), `ignore` (demoted).
+**Boundary:** Setting a constraint (`File.setConstraint`) is a
+project-config write. Promoting/demoting the matching entries is tool
+dispatch that goes through the handler chain with budget enforcement.
+These are separate operations: constraint persists across runs, entry
+promotion is scoped to a run and subject to the same budget rules as
+a model `<get>`.
+`store` RPC manages constraints directly — it is not a model tool.
+`get` RPC with `persist` sets the constraint AND dispatches promotion.
 ---
@@ -169,13 +203,17 @@ object is the same shape at every tier.
 | Method | Model | Client | Plugin |
 |--------|-------|--------|--------|
-| `get`, `set`, `rm`, `mv`, `cp`, `sh`, `env`, `store` | ✓ | ✓ | ✓ |
+| `get`, `set`, `rm`, `mv`, `cp`, `sh`, `env`, `search` | ✓ | ✓ | ✓ |
 | `known`, `unknown`, `ask_user`, `summarize`, `update` | ✓ | ✓ | ✓ |
 | `ask`, `act`, `resolve`, `abort`, `startRun` | — | ✓ | ✓ |
 | `getRuns`, `getModels`, `getEntries` | — | ✓ | ✓ |
 | `on()`, `filter()`, db/store access | — | — | ✓ |
-Model tier restrictions enforced by mode (ask removes act-only tools).
+Model tier restrictions enforced by unified `resolveForLoop(mode, flags)`.
+Ask mode excludes `sh`. Flags: `noInteraction` excludes `ask_user`,
+`noWeb` excludes `search`, `noBench` excludes `ask_user`/`env`/`sh`.
+13 model tools: get, set, known, unknown, env, sh, rm, cp, mv, search,
+summarize, update, ask_user.
 Client tier requires project init. Plugin tier has no restrictions.
 ### 3.2 Dispatch Path
@@ -188,6 +226,14 @@ Client: JSON-RPC  → { method, params }   → #record() → dispatch(scheme, en
 Plugin: rummy.rm({ path })               → #record() → dispatch(scheme, entry, rummy)
 ```
+**Lifecycle/action split:** Commands are classified as lifecycle signals
+(`summarize`, `update`, `unknown`, `known`) or action commands (everything
+else). Lifecycle signals always dispatch — they are state declarations that
+cannot be 409'd by sequential dispatch. Action commands dispatch sequentially;
+a 202 proposal or error aborts subsequent actions. If the model sends
+`<summarize>` but actions in the same turn failed, the summarize is
+overridden to an update (the model's assertion that it's done is false).
 ### 3.3 Plugin Convention
 A plugin is an instantiated class. The class name matches the file name.
@@ -247,9 +293,9 @@ Two messages per turn. System = stable truth. User = active task.
         [persona/]
         [skills/]
     [/instructions]
-    <knowledge>
+    <knowns>
         ...entries sorted by fidelity (index, summary, full), then by scheme
-    </knowledge>
+    </knowns>
     <previous>
         (pre-loop user prompt, model responses, agent warnings, and tools used, in order)
     </previous>
@@ -260,16 +306,14 @@ Two messages per turn. System = stable truth. User = active task.
         (current loop model responses, agent warnings, and tools used, in order)
     </current>
     <progress>the above actions have been performed on this user prompt:</progress>
-    <ask tools="..." warn="...">user prompt</ask>
-    — OR —
-    <act tools="...">user prompt</act>
+    <prompt mode="ask|act" tools="...">user prompt</prompt>
 [/user]
 ```
 **System** contains everything the model needs to know.
 **User** contains everything the model needs to do.
-The `<ask>`/`<act>` tag is present on every turn — first turn and
+The `<prompt>` tag is present on every turn — first turn and
 continuations alike. The model always sees its task. The active prompt
 is extracted from its chronological position and placed last for maximum
 recency. `<progress>` bridges the gap, narrating the causal relationship
@@ -287,7 +331,7 @@ first turn of the first loop.
 **Current** = the active loop's work so far. Model responses, tool
 results, agent warnings — in order. Does NOT include the user prompt
-(one per loop, extracted to `<ask>`/`<act>`). Lives in the user
+(one per loop, extracted to `<prompt>`). Lives in the user
 message as immediate context. Empty on the first turn of a loop.
 When a new prompt arrives on an existing run, the prior loop's
@@ -313,7 +357,7 @@ text from body + attributes.
 Each turn:
 1. Write `instructions://system` (empty body, attributes = { persona })
-2. Run plugin hooks (`onTurn`) — plugins modify entries before the model sees them
+2. Emit `turn.started` — plugins write prompt/progress/instructions entries
 3. Project `instructions://system` → instructions text
 4. Query `v_model_context` VIEW → visible entries
 5. Project each entry through its tool's `full`/`summary` projection
@@ -325,21 +369,76 @@ Each turn:
 8. Invoke `assembly.user` filter chain (empty string as base):
    - Current plugin (priority 100) → `<current>` section
    - Progress plugin (priority 200) → `<progress>` section
-   - Prompt plugin (priority 300) → `<ask>`/`<act>` section
+   - Prompt plugin (priority 300) → `<prompt>` section
 9. Store as `system://N` and `user://N` audit entries
-The VIEW determines visibility. State IS fidelity:
+The VIEW determines visibility from `fidelity` and `status`:
 - `full` → body visible
-- `summary` → body visible
+- `summary` → summary visible (model-authored `summary` attribute if set)
 - `index` → path listed, no content
-- `stored` → invisible
-- `proposed` → invisible (pending client)
+- `archive` → invisible (retrievable via `<get>`)
+- `status = 202` → invisible (proposed, pending client)
 - `model_visible = 0` → invisible (audit, tool, instructions)
-### 4.5 progress:// as Entry
+Model controls fidelity via `<set>` attributes: `archive`, `summary`,
+`index`, `full`. The `summary="..."` attribute attaches a description
+(<= 80 chars) that persists across fidelity changes.
+### 4.5 Budget Enforcement
+The model owns its context. The system enforces a hard ceiling and
+provides advisory warnings — it does not automatically manage entries.
+**Pre-LLM check:** The budget plugin measures `countTokens()` on the
+assembled messages. If assembled tokens exceed `contextSize`, the turn
+returns 413 without calling the LLM. This triggers panic mode (see
+§4.6).
+**Write-layer gate:** BudgetGuard on KnownStore gates every write
+during dispatch. `upsert()`, `promoteByPattern()`, and
+`updateBodyByPattern()` check token delta against remaining headroom.
+Exceeding the budget throws `BudgetExceeded` — the tool 413s, the
+guard trips, and all subsequent tools in the turn fail.
+**Exemptions:** `status >= 400` entries (error results), `model_visible
+= 0` entries (audit), `fidelity = "archive"` entries (not in context).
+**Size gate:** Known entries exceeding 500 tokens are rejected with
+413, forcing atomic entries.
+**Advisory warnings** (progress plugin):
+- 50%: "You may free space by lowering the fidelity of entries"
+- 75%: "YOU MUST free space... or the run will fail"
+**Token math:** `Math.ceil(text.length / RUMMY_TOKEN_DIVISOR)`. One
+formula, one file (`src/agent/tokens.js`), env-configurable. No
+external dependencies. `contextSize` is the ceiling. Over = 413.
+Under = 200. No margins.
-The continuation prompt is a `progress://N` entry. Plugins can modify its
-body before materialization.
+### 4.6 Panic Mode
+When a new prompt arrives and the assembled context exceeds
+`contextSize`, the system enters panic mode instead of failing to
+the client.
+1. The failed loop is completed with 413 (audit trail)
+2. A panic loop is enqueued (`mode = "panic"`, `noRepo = true`)
+3. The original loop is re-enqueued to retry after panic
+4. The model receives a prompt with the exact shortfall in tokens
+5. Tools: get, set, known, unknown, rm, mv, cp, summarize, update
+6. Excluded: sh, env, search, ask_user
+**Strike system:** Each turn without context reduction = 1 strike.
+Any reduction resets the counter. 3 consecutive strikes = hard 413
+to client. Unlimited turns as long as the model makes progress.
+One panic attempt per drain cycle. If the retried original loop also
+413s, hard-fail to the client.
+**`ToolRegistry.view()`** prepends `attributes.summary` above the
+plugin's summary view output at summary fidelity. The model authors
+summaries (<= 80 chars) via `<set summary="...">`. Summaries persist
+across fidelity changes.
 ---
@@ -369,22 +468,25 @@ JSON-RPC 2.0 over WebSocket. `discover` returns the live catalog.
 | Method | Params |
 |--------|--------|
-| `read` | `{ path, run?, persist?, readonly? }` |
+| `get` | `{ path, run, persist?, readonly? }` |
+| `set` | `{ run, path, body?, attributes? }` |
+| `rm` | `{ run, path }` |
+| `mv` | `{ run, path, to }` |
+| `cp` | `{ run, path, to }` |
 | `store` | `{ path, run?, persist?, ignore?, clear? }` |
-| `write` | `{ run, path, body?, state?, attributes? }` |
-| `delete` | `{ run, path }` |
 | `getEntries` | `{ pattern?, body?, run?, limit?, offset? }` |
-`persist` creates a project-level file constraint (operator privilege).
-Without `persist`, operations dispatch through the handler chain.
+All entry operations dispatch through the handler chain. `persist`
+on `get` also sets a project-level file constraint (operator privilege).
+`store` manages file constraints — not a model tool.
 #### Runs
 | Method | Params |
 |--------|--------|
 | `startRun` | `{ model, temperature?, persona?, contextLimit? }` |
-| `ask` | `{ prompt, model, run?, temperature?, persona?, contextLimit?, noContext?, fork? }` |
-| `act` | `{ prompt, model, run?, temperature?, persona?, contextLimit?, noContext?, fork? }` |
+| `ask` | `{ prompt, model, run?, temperature?, persona?, contextLimit?, noRepo?, noInteraction?, noWeb?, fork? }` |
+| `act` | `{ prompt, model, run?, temperature?, persona?, contextLimit?, noRepo?, noInteraction?, noWeb?, fork? }` |
 | `run/resolve` | `{ run, resolution: { path, action, output? } }` |
 | `run/abort` | `{ run }` |
 | `run/rename` | `{ run, name }` |
@@ -392,6 +494,10 @@ Without `persist`, operations dispatch through the handler chain.
 | `run/config` | `{ run, temperature?, persona?, contextLimit?, model? }` |
 `model` is required on `ask`, `act`, and `startRun`. No default.
+`noRepo` disables default project/repo file scanning (files can still
+be added explicitly by the client).
+`noInteraction` removes `ask_user` from the tool list.
+`noWeb` removes `search` from the tool list.
 #### Queries
@@ -445,7 +551,80 @@ Each plugin has its own README at `src/plugins/{name}/README.md`.
 ---
-## 7. Hedberg Editing Syntax
+## 7. Tool Documentation Design
+Tool docs are the most carefully designed text in rummy. Every line
+simultaneously teaches syntax, implies workflow priority, demonstrates
+pattern capabilities, and constrains misuse. Each letter earns its place.
+### Principles
+**Show, don't tell.** Examples ARE the documentation. A model learns
+`<get path="known://*">auth</get>` from seeing it, not from being told
+"you can filter known entries by keyword." Examples are ordered from
+simple to powerful — weak models learn from examples 1-2, strong models
+pick up the pattern from example 3.
+**Lifecycle continuity.** Examples weave stories across tools. The get
+docs end with `<set path="..." fidelity="index"/>`. The known docs
+reference `<get path="known://*">keyword</get>` for recall and
+`<set path="known://..." archive/>` for archiving. The unknown docs
+reference `<get/>` for investigation and `<rm/>` for cleanup. A model
+reading the full tool docs encounters a coherent workflow:
+discover → load → reason → edit → archive → recall.
+**RFC 2119 semantics.** Constraint bullets use YOU MUST, YOU MUST NOT,
+YOU SHOULD, YOU MAY from RFC 2119. Every LLM has extensive pretraining
+on RFC documents where these keywords carry precise semantic weight.
+MUST is absolute. SHOULD is strong advisory. MAY is permissive. This
+is not decorative — it's leveraging the model's existing understanding
+of requirement levels.
+**Consistent structure.** Every tool doc follows: header (syntax), 2+
+examples, 2+ constraint bullets. Inconsistent formatting reads as
+inconsistent importance. A tool with 5 examples and dense bullets feels
+complex; a tool with 1 line feels disposable. Both are wrong — every
+tool is equally real, each doc is proportional to the tool's surface area.
+### Format
+Tool docs live in `*Doc.js` files as annotated line arrays:
+```js
+const LINES = [
+    ["* Body text filters results by content match",
+        "Generalizes examples 2-3. Body = filter, not just path."],
+];
+export default LINES.map(([text]) => text).join("\n");
+```
+The first element is the model-facing text. The second is the rationale —
+visible only in source. Changing any line requires reading all rationales
+first. This prevents well-intentioned edits from breaking subtle behavioral
+guarantees that adjacent lines depend on.
+### Tool Display Order
+Tools are presented gather → reason → act → communicate. Position in
+the list implies priority. `get` is first. `ask_user` is last. The
+order is defined in `ToolRegistry.TOOL_ORDER` and applied by
+`resolveForLoop()`. The same method handles all tool exclusions —
+mode restrictions, `noInteraction`, `noWeb`, `noBench` — through
+one unified mechanism.
+### Pattern Distribution
+Hedbergian pattern matching (globs, body filters, preview) is taught
+across multiple tools, not concentrated in one. `get` shows content
+filtering. `cp` shows glob batch operations. `rm` shows preview safety.
+Each tool reinforces the pattern vocabulary from a different angle.
+A model that sees `path="known://*"` in get, `path="known://plan_*"` in
+cp, and `path="known://temp_*" preview` in rm learns that patterns
+are universal — not a feature of any single tool.
+---
+## 8. Hedberg Editing Syntax
 The model picks its preferred edit format. The parser understands all of them:
@@ -460,26 +639,36 @@ The model picks its preferred edit format. The parser understands all of them:
 ---
-## 8. Response Healing
+## 9. Response Healing
-The server never throws on model output. Recovery order:
+The server never throws on model output. "Model behavior" is never an
+acceptable explanation. Recovery order:
 1. Can we recover? Extract the data and continue.
 2. Can we warn? Log structured warnings.
 3. Did our structure cause this? Check formatting, prompts.
-4. Model drift is the LAST answer.
 Termination protocol:
 - `<summarize>` → run terminates
+- `<summarize>` + failed actions → overridden to `<update>` (continue)
 - `<update>` → run continues
-- Both → summarize wins
-- Neither + tools → stall counter
+- Both → update wins (if the model can't decide, it's not done)
+- Neither + investigation tools → stall counter (RUMMY_MAX_STALLS)
+- Neither + action-only tools → healed to summarize
 - Neither + plain text → healed to summarize
-- Repeated commands → loop detection
+- Repeated commands → loop detection (RUMMY_MAX_REPETITIONS)
+- Repeated update text → stall (RUMMY_MAX_UPDATE_REPEATS)
+Format normalization:
+- Gemma `\`\`\`tool_code` fences → stripped before parsing
+- Qwen `<|tool_call>` format → normalized to XML
+- OpenAI function_call JSON → normalized to XML
+- Mistral `[TOOL_CALLS]` → normalized to XML
+- Sed alternate delimiters (`s|old|new|`) → parsed like `s/old/new/`
 ---
-## 9. Testing
+## 10. Testing
 | Tier | Location | LLM? |
 |------|----------|------|
@@ -493,12 +682,12 @@ E2E tests must NEVER mock the LLM. Environment cascade:
 ---
-## 10. SQL Functions
+## 11. SQL Functions
 | Function | Purpose |
 |----------|---------|
 | `schemeOf(path)` | Extract URI scheme |
-| `countTokens(text)` | Token count (tiktoken o200k_base, `ceil(len/4)` fallback) |
+| `countTokens(text)` | Token count (`ceil(len / RUMMY_TOKEN_DIVISOR)`) |
 | `hedmatch(pattern, string)` | Full-string pattern match (paths, equality) |
 | `hedsearch(pattern, string)` | Substring pattern search (content filtering) |
 | `hedreplace(pattern, replacement, string)` | Pattern-based replacement |
@@ -508,15 +697,17 @@ See [PLUGINS.md](PLUGINS.md) for the hedberg pattern type reference.
 ---
-## 11. Configuration
+## 12. Configuration
 ```env
 RUMMY_HOME=~/.rummy
-RUMMY_MAX_TURNS=15
+RUMMY_TOKEN_DIVISOR=2
+RUMMY_MAX_TURNS=99
 RUMMY_MAX_STALLS=3
 RUMMY_MAX_REPETITIONS=3
+RUMMY_MAX_UPDATE_REPEATS=3
 RUMMY_RETENTION_DAYS=31
-RUMMY_TEMPERATURE=0.7
+RUMMY_TEMPERATURE=0.5
 RUMMY_DEBUG=false
 ```