npm - @miller-tech/uap - Versions diffs - 1.39.0 → 1.40.1 - Mend

@miller-tech/uap 1.39.0 → 1.40.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (99) hide show

package/README.md +109 -642
package/dist/.tsbuildinfo +1 -1
package/dist/bin/cli.js +2 -2
package/dist/bin/cli.js.map +1 -1
package/dist/cli/deliver.d.ts +3 -2
package/dist/cli/deliver.d.ts.map +1 -1
package/dist/cli/deliver.js +10 -5
package/dist/cli/deliver.js.map +1 -1
package/docs/INDEX.md +48 -286
package/docs/architecture/OVERVIEW.md +328 -0
package/docs/architecture/PROTOCOL.md +204 -0
package/docs/benchmarks/README.md +17 -192
package/docs/getting-started/CONFIGURATION.md +237 -0
package/docs/getting-started/INSTALLATION.md +125 -0
package/docs/getting-started/QUICKSTART.md +115 -0
package/docs/guides/COORDINATION.md +162 -0
package/docs/guides/DELIVER.md +115 -0
package/docs/guides/DEPLOY_BATCHING.md +212 -0
package/docs/guides/DROIDS_AND_SKILLS.md +202 -0
package/docs/guides/LOCAL_MODELS.md +148 -0
package/docs/guides/MCP_ROUTER.md +195 -0
package/docs/guides/MEMORY.md +235 -0
package/docs/guides/MULTI_MODEL.md +223 -0
package/docs/guides/POLICIES.md +190 -0
package/docs/guides/WORKTREE_WORKFLOW.md +185 -0
package/docs/integrations/MCP_ROUTER.md +147 -0
package/docs/integrations/RTK.md +102 -0
package/docs/reference/API.md +485 -0
package/docs/reference/CLI.md +719 -0
package/docs/reference/CONFIGURATION.md +90 -193
package/docs/reference/DATABASE_SCHEMA.md +110 -344
package/docs/reference/FEATURES.md +176 -472
package/docs/reference/PATTERNS.md +102 -0
package/docs/reference/PLATFORMS.md +83 -0
package/package.json +1 -1
package/docs/AGENTS.md +0 -423
package/docs/DOCUMENTATION_AUDIT_REPORT.md +0 -131
package/docs/GETTING_STARTED.md +0 -288
package/docs/PROJECT_ANALYSIS_REPORT.md +0 -510
package/docs/architecture/COMPLETE_ARCHITECTURE.md +0 -748
package/docs/architecture/EXPERT_STACK.md +0 -137
package/docs/architecture/MULTI_MODEL.md +0 -224
package/docs/architecture/PLATFORM_GATING.md +0 -68
package/docs/architecture/SYSTEM_ANALYSIS.md +0 -334
package/docs/architecture/UAP_COMPLIANCE.md +0 -217
package/docs/architecture/UAP_PROTOCOL.md +0 -339
package/docs/architecture/UAP_STRICT_DROIDS.md +0 -172
package/docs/archive/BALLS_MODE_SELF_ANALYSIS.md +0 -260
package/docs/archive/BENCHMARK_GAPS_AND_PLAN.md +0 -146
package/docs/archive/FAILING_TASKS_SOLUTION_PLAN.md +0 -668
package/docs/archive/JINJA2-SYSTEM-MESSAGE-FIX.md +0 -209
package/docs/archive/MODEL_ROUTING_IMPLEMENTATION_SUMMARY.md +0 -281
package/docs/archive/MODEL_ROUTING_OPTIMIZATION_PLAN.md +0 -320
package/docs/archive/NPM-PUBLISH-V0.9.1.md +0 -240
package/docs/archive/OPTIMIZATION_OPTIONS.md +0 -334
package/docs/archive/PARALLELISM_GAPS_AND_OPTIONS.md +0 -422
package/docs/archive/POLICY_GATE_IMPLEMENTATION.md +0 -245
package/docs/archive/SETUP_IMPROVEMENTS.md +0 -213
package/docs/archive/UAP_GENERIC_OPTIMIZATION_PLAN.md +0 -270
package/docs/archive/UAP_OPTIMIZATION_PLAN.md +0 -701
package/docs/archive/UAP_V103_PATTERN_DESIGN.md +0 -315
package/docs/archive/UAP_V104_COMPLIANCE_DESIGN.md +0 -223
package/docs/archive/changelog/2026-03-10_uap-100-compliance.md +0 -77
package/docs/archive/changelog/2026-03-10_uap-full-system-verification.md +0 -109
package/docs/archive/opencode-integration-guide.md +0 -740
package/docs/archive/opencode-integration-quickref.md +0 -180
package/docs/benchmarks/OVERNIGHT_RUNNER.md +0 -341
package/docs/benchmarks/SPECULATIVE_DECODING_JOURNEY_2026-03.md +0 -221
package/docs/benchmarks/VALIDATION_PLAN.md +0 -568
package/docs/blog/SPECULATIVE_DECODING_PRODUCTION_PLAYBOOK.md +0 -139
package/docs/blog/local-coding-agents.md +0 -266
package/docs/blog/x-thread.md +0 -254
package/docs/deployment/DEPLOYMENT.md +0 -895
package/docs/deployment/DEPLOYMENT_STRATEGIES.md +0 -518
package/docs/deployment/DEPLOY_BATCHER_ANALYSIS.md +0 -224
package/docs/deployment/DEPLOY_BATCHING.md +0 -273
package/docs/deployment/DEPLOY_BUCKETING_ANALYSIS.md +0 -420
package/docs/deployment/QWEN35_LLAMA_CPP.md +0 -426
package/docs/deployment/UAP_LLAMA_ANTHROPIC_PROXY_BOOTSTRAP.md +0 -279
package/docs/getting-started/INTEGRATION.md +0 -628
package/docs/getting-started/OVERVIEW.md +0 -324
package/docs/getting-started/SETUP.md +0 -377
package/docs/integrations/MCP_ROUTER_SETUP.md +0 -445
package/docs/integrations/RTK_INTEGRATION.md +0 -468
package/docs/operations/TROUBLESHOOTING.md +0 -660
package/docs/pr/PR_SPECULATIVE_DOCS_TEMPLATE.md +0 -146
package/docs/pr/UPSTREAM_PRS.md +0 -424
package/docs/reference/API_REFERENCE.md +0 -903
package/docs/reference/EXPERT_DROIDS.md +0 -219
package/docs/reference/HARNESS-MATRIX.md +0 -318
package/docs/reference/PATTERN_LIBRARY.md +0 -636
package/docs/reference/UAP_CLI_REFERENCE.md +0 -620
package/docs/research/BEHAVIORAL_PATTERNS.md +0 -228
package/docs/research/DOMAIN_STRATEGIES.md +0 -316
package/docs/research/MEMORY_SYSTEMS_COMPARISON.md +0 -812
package/docs/research/PATTERN_ANALYSIS_2026-01-18.md +0 -436
package/docs/research/PERFORMANCE_ANALYSIS_2026-01-18.md +0 -209
package/docs/research/PERFORMANCE_TEST_PLAN.md +0 -383
package/docs/research/TERMINAL_BENCH_LEARNINGS.md +0 -217

package/docs/integrations/MCP_ROUTER.md ADDED Viewed

@@ -0,0 +1,147 @@
+# MCP Router
+`v1.40.0` · `src/mcp-router/`
+The MCP Router is a hierarchical Model Context Protocol server that sits in
+front of all of your downstream MCP servers and dramatically reduces the tokens
+the model spends on tool definitions and tool output. It is the mechanism behind
+UAP's "up to 98% savings on large tool calls."
+For where it fits in the wider system, see
+[../architecture/OVERVIEW.md](../architecture/OVERVIEW.md#mcp-router-srcmcp-router).
+---
+## Why it exists
+A normal MCP setup exposes every tool from every server directly to the model.
+With a dozen servers that is easily 150+ tool schemas at roughly ~500 tokens
+each — tens of thousands of tokens of context burned before the agent does any
+work. On top of that, tools like file readers and shell wrappers return large
+outputs that flood the context window.
+The router fixes both:
+1. **Tool hiding.** It exposes just three meta-tools instead of every
+   downstream tool. The documented design target is ~75,000 tokens of tool
+   definitions collapsed to ~700 (`src/mcp-router/index.ts`,
+   `src/mcp-router/server.ts`).
+2. **Output compression.** Large tool results are indexed into an in-memory
+   SQLite **FTS5** table and only the most relevant snippets are returned
+   (`src/mcp-router/output-compressor.ts`).
+---
+## How it works
+### The three meta-tools
+Instead of N downstream tools, the model sees:
+| Meta-tool | What it does |
+|-----------|--------------|
+| `discover_tools` | Natural-language query → matching downstream tool paths |
+| `execute_tool`   | Run a tool by `path` with `args` (+ optional `intent`) |
+| `deliver`        | Run the `uap deliver` convergence loop |
+Downstream tools are loaded into an in-memory fuzzy search index at startup and
+are never surfaced as definitions. The agent's flow is:
+```
+discover_tools("read the auth config")
+        │  → [ "filesystem.read_file", ... ]
+        ▼
+execute_tool({ path: "filesystem.read_file",
+               args: { path: "src/auth.ts" },
+               intent: "csrf token validation" })
+        │
+        ▼
+  ┌──────────── output compressor ────────────┐
+  │ small result → passthrough                 │
+  │ large result → FTS5 index + BM25(intent)   │
+  │   → top snippets + searchable-vocab footer │
+  │ huge / no intent → head+tail truncation    │
+  └────────────────────────────────────────────┘
+```
+The `intent` string on `execute_tool` is what drives the BM25 query — provide a
+focused intent to get focused snippets. The model can then issue a follow-up
+`execute_tool` with a refined intent using the vocabulary footer.
+---
+## Setup
+### One command, all harnesses
+```bash
+uap mcp-setup
+```
+`uap mcp-setup` (`src/cli/setup-mcp-router.ts`) configures the MCP Router as the
+single MCP server across your AI harnesses. It writes a `mcpServers.router`
+entry pointing at the router and migrates/backs up any existing servers
+(prompts unless `--force`), then validates the result with `uap mcp-router list`.
+Harnesses configured (global `~/` config paths):
+| Harness | Config file |
+|---------|-------------|
+| Claude Code | `~/.claude/settings.json` |
+| Factory.AI | `~/.factory/mcp.json` |
+| VSCode | `~/.vscode/mcp.json` (skipped if absent) |
+| Cursor | `~/.cursor/settings.json` |
+The router entry it writes looks like:
+```json
+{
+  "mcpServers": {
+    "router": {
+      "command": "npx",
+      "args": ["uap", "mcp-router", "start"]
+    }
+  }
+}
+```
+### Running and inspecting the router
+`uap mcp-router <action>` (`src/cli/mcp-router.ts`) drives the router directly:
+```bash
+uap mcp-router start      # run the stdio MCP server (what harnesses launch)
+uap mcp-router list       # list discovered downstream tools
+uap mcp-router discover   # try a natural-language tool discovery query
+uap mcp-router stats      # token-savings stats
+```
+`uap mcp-router start` is the command harnesses invoke via the generated config;
+you normally don't run it by hand.
+---
+## Verifying it works
+```bash
+uap mcp-router list       # should enumerate tools from your downstream servers
+uap mcp-router stats      # shows tool-hiding savings + per-output compression
+uap hooks doctor          # confirms gate/router wiring across harnesses
+```
+If `list` is empty, the router found no downstream MCP configs — confirm your
+harness still has its original MCP servers defined (they are migrated into the
+router's view, not deleted) and re-run `uap mcp-setup`.
+---
+## Notes
+- The router reads downstream MCP configs from Claude Desktop, Cursor, VSCode,
+  Claude Code CLI, Factory.AI, and a local `mcp.json`, expands `~`/env vars,
+  skips disabled servers, and refuses to reference itself.
+- The 98% / 75k→700 figures are the documented design target for tool hiding;
+  per-output FTS5 savings are computed live for each call and reported by
+  `uap mcp-router stats`.
+- Pair the router with **RTK** for CLI-output savings — see
+  [RTK.md](RTK.md). The two are complementary (tool definitions + CLI output).

package/docs/integrations/RTK.md ADDED Viewed

@@ -0,0 +1,102 @@
+# RTK — Rust Token Killer
+`v1.40.0` · `src/cli/rtk.ts`
+RTK (Rust Token Killer) is a fast CLI proxy that compresses and filters the
+output of command-line tools — `git status`, test runs, file reads, and similar
+heavy commands — to cut the tokens your agent spends echoing terminal output.
+Source positions it at **60–90% token savings on CLI command output**.
+RTK is a separate, open-source tool (`https://github.com/rtk-ai/rtk`,
+docs at `https://www.rtk-ai.app`). UAP integrates with it but does not bundle
+it; `uap rtk` manages installation and wiring.
+---
+## Why RTK + the MCP Router
+The two integrations target different sources of token waste and stack:
+| Layer | Tool | Saves on |
+|-------|------|----------|
+| MCP tool definitions + tool output | [MCP Router](MCP_ROUTER.md) | ~98% of tool-definition tokens |
+| Raw CLI command output | **RTK** | 60–90% of CLI-output tokens |
+Source describes the combination as **95%+ total token reduction**
+(`src/cli/rtk.ts`).
+---
+## How UAP integrates RTK
+`uap rtk <command>` (`src/cli/rtk.ts`):
+```bash
+uap rtk install     # install RTK, auto-detecting the best method
+uap rtk status      # check install + hook wiring + recent savings
+uap rtk help        # usage
+```
+### `uap rtk install`
+Auto-detects the best install method (Homebrew → Cargo → pre-built binary via
+curl) and runs it. Override with flags:
+```bash
+uap rtk install --method homebrew     # force a method: homebrew | cargo | curl
+uap rtk install --force               # reinstall
+```
+Equivalent manual installs:
+```bash
+brew install rtk                                          # Homebrew
+cargo install --git https://github.com/rtk-ai/rtk        # Cargo
+# or download a release binary from:
+#   https://github.com/rtk-ai/rtk/releases
+```
+After install, initialize and verify:
+```bash
+rtk init --global    # set up the global rewrite hook
+rtk gain             # show token savings analytics
+```
+### `uap rtk status`
+Reports whether the `rtk` binary is installed, whether the rewrite hook
+(`~/.claude/hooks/rtk-rewrite.sh`) is wired in, and recent savings from
+`rtk gain`.
+---
+## How it works in practice
+Once the rewrite hook is installed, heavy CLI commands are transparently routed
+through RTK (e.g. `git status` is rewritten to `rtk git status`) with zero
+extra tokens of overhead — the agent issues normal commands and RTK compresses
+the output before it reaches the model.
+UAP can nudge agents to route heavy CLIs through RTK via the `rtk_wrap.py`
+policy enforcer (`src/policies/enforcers/rtk_wrap.py`).
+Useful RTK meta-commands:
+```bash
+rtk gain              # token savings analytics
+rtk gain --history    # command usage history with savings
+rtk discover          # find missed savings opportunities
+rtk --version         # verify the install
+```
+---
+## Combined analytics
+`uap rtk` can surface unified analytics combining MCP Router and RTK savings
+(`showUnifiedAnalytics` in `src/cli/rtk.ts`), so you can see total context
+reduction from both layers at once.
+See also: [MCP_ROUTER.md](MCP_ROUTER.md) ·
+[../architecture/OVERVIEW.md](../architecture/OVERVIEW.md)