@miller-tech/uap 1.40.0 → 1.41.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +109 -642
- package/dist/.tsbuildinfo +1 -1
- package/dist/cli/deliver-defaults.d.ts +23 -0
- package/dist/cli/deliver-defaults.d.ts.map +1 -0
- package/dist/cli/deliver-defaults.js +121 -0
- package/dist/cli/deliver-defaults.js.map +1 -0
- package/dist/cli/init.d.ts.map +1 -1
- package/dist/cli/init.js +29 -0
- package/dist/cli/init.js.map +1 -1
- package/dist/cli/setup.d.ts.map +1 -1
- package/dist/cli/setup.js +19 -0
- package/dist/cli/setup.js.map +1 -1
- package/dist/policies/policy-tools.d.ts +7 -0
- package/dist/policies/policy-tools.d.ts.map +1 -1
- package/dist/policies/policy-tools.js +24 -2
- package/dist/policies/policy-tools.js.map +1 -1
- package/docs/INDEX.md +48 -286
- package/docs/architecture/OVERVIEW.md +328 -0
- package/docs/architecture/PROTOCOL.md +204 -0
- package/docs/benchmarks/README.md +17 -192
- package/docs/getting-started/CONFIGURATION.md +237 -0
- package/docs/getting-started/INSTALLATION.md +125 -0
- package/docs/getting-started/QUICKSTART.md +115 -0
- package/docs/guides/COORDINATION.md +162 -0
- package/docs/guides/DELIVER.md +115 -0
- package/docs/guides/DEPLOY_BATCHING.md +212 -0
- package/docs/guides/DROIDS_AND_SKILLS.md +202 -0
- package/docs/guides/LOCAL_MODELS.md +148 -0
- package/docs/guides/MCP_ROUTER.md +195 -0
- package/docs/guides/MEMORY.md +235 -0
- package/docs/guides/MULTI_MODEL.md +223 -0
- package/docs/guides/POLICIES.md +190 -0
- package/docs/guides/WORKTREE_WORKFLOW.md +185 -0
- package/docs/integrations/MCP_ROUTER.md +147 -0
- package/docs/integrations/RTK.md +102 -0
- package/docs/reference/API.md +485 -0
- package/docs/reference/CLI.md +719 -0
- package/docs/reference/CONFIGURATION.md +90 -193
- package/docs/reference/DATABASE_SCHEMA.md +110 -344
- package/docs/reference/FEATURES.md +176 -472
- package/docs/reference/PATTERNS.md +102 -0
- package/docs/reference/PLATFORMS.md +83 -0
- package/package.json +3 -1
- package/src/policies/enforcers/7ebbc721-7540-4e9f-879a-770e0213a09b_architecture_review.py +101 -0
- package/src/policies/enforcers/__pycache__/_common.cpython-312.pyc +0 -0
- package/src/policies/enforcers/_common.py +100 -0
- package/src/policies/enforcers/artifact_hygiene.py +52 -0
- package/src/policies/enforcers/cluster_routing.py +63 -0
- package/src/policies/enforcers/codebase_read_before_plan.py +52 -0
- package/src/policies/enforcers/coord_overlap.py +81 -0
- package/src/policies/enforcers/delivery_enforcement.py +97 -0
- package/src/policies/enforcers/doc_live_over_report.py +50 -0
- package/src/policies/enforcers/expert_review_required.py +135 -0
- package/src/policies/enforcers/iac_parity.py +53 -0
- package/src/policies/enforcers/mcp_router_first.py +37 -0
- package/src/policies/enforcers/memory_before_plan.py +61 -0
- package/src/policies/enforcers/parallel_reads.py +50 -0
- package/src/policies/enforcers/rtk_wrap.py +44 -0
- package/src/policies/enforcers/schema_diff_gate.py +80 -0
- package/src/policies/enforcers/session_memory_write.py +52 -0
- package/src/policies/enforcers/task_required.py +131 -0
- package/src/policies/enforcers/test_gate.py +58 -0
- package/src/policies/enforcers/validate_plan_before_build.py +75 -0
- package/src/policies/enforcers/worktree_required.py +57 -0
- package/src/policies/schemas/policies/architecture-review.md +51 -0
- package/src/policies/schemas/policies/artifact-hygiene.md +29 -0
- package/src/policies/schemas/policies/cluster-routing.md +31 -0
- package/src/policies/schemas/policies/codebase-read-before-plan.md +30 -0
- package/src/policies/schemas/policies/coord-overlap.md +24 -0
- package/src/policies/schemas/policies/delivery-enforcement.md +45 -0
- package/src/policies/schemas/policies/doc-live-over-report.md +32 -0
- package/src/policies/schemas/policies/expert-review-required.md +60 -0
- package/src/policies/schemas/policies/iac-parity.md +31 -0
- package/src/policies/schemas/policies/mandatory-testing-deployment.md +147 -0
- package/src/policies/schemas/policies/mcp-router-first.md +24 -0
- package/src/policies/schemas/policies/memory-before-plan.md +24 -0
- package/src/policies/schemas/policies/merge-deploy-monitor-verify.md +145 -0
- package/src/policies/schemas/policies/parallel-reads.md +24 -0
- package/src/policies/schemas/policies/rtk-wrap.md +26 -0
- package/src/policies/schemas/policies/schema-diff-gate.md +30 -0
- package/src/policies/schemas/policies/session-memory-write.md +24 -0
- package/src/policies/schemas/policies/task-required.md +49 -0
- package/src/policies/schemas/policies/test-gate.md +24 -0
- package/src/policies/schemas/policies/validate-plan-before-build.md +28 -0
- package/src/policies/schemas/policies/worktree-required.md +28 -0
- package/templates/hooks/uap-policy-gate.sh +5 -0
- package/docs/AGENTS.md +0 -423
- package/docs/DOCUMENTATION_AUDIT_REPORT.md +0 -131
- package/docs/GETTING_STARTED.md +0 -288
- package/docs/PROJECT_ANALYSIS_REPORT.md +0 -510
- package/docs/architecture/COMPLETE_ARCHITECTURE.md +0 -748
- package/docs/architecture/EXPERT_STACK.md +0 -137
- package/docs/architecture/MULTI_MODEL.md +0 -224
- package/docs/architecture/PLATFORM_GATING.md +0 -68
- package/docs/architecture/SYSTEM_ANALYSIS.md +0 -334
- package/docs/architecture/UAP_COMPLIANCE.md +0 -217
- package/docs/architecture/UAP_PROTOCOL.md +0 -339
- package/docs/architecture/UAP_STRICT_DROIDS.md +0 -172
- package/docs/archive/BALLS_MODE_SELF_ANALYSIS.md +0 -260
- package/docs/archive/BENCHMARK_GAPS_AND_PLAN.md +0 -146
- package/docs/archive/FAILING_TASKS_SOLUTION_PLAN.md +0 -668
- package/docs/archive/JINJA2-SYSTEM-MESSAGE-FIX.md +0 -209
- package/docs/archive/MODEL_ROUTING_IMPLEMENTATION_SUMMARY.md +0 -281
- package/docs/archive/MODEL_ROUTING_OPTIMIZATION_PLAN.md +0 -320
- package/docs/archive/NPM-PUBLISH-V0.9.1.md +0 -240
- package/docs/archive/OPTIMIZATION_OPTIONS.md +0 -334
- package/docs/archive/PARALLELISM_GAPS_AND_OPTIONS.md +0 -422
- package/docs/archive/POLICY_GATE_IMPLEMENTATION.md +0 -245
- package/docs/archive/SETUP_IMPROVEMENTS.md +0 -213
- package/docs/archive/UAP_GENERIC_OPTIMIZATION_PLAN.md +0 -270
- package/docs/archive/UAP_OPTIMIZATION_PLAN.md +0 -701
- package/docs/archive/UAP_V103_PATTERN_DESIGN.md +0 -315
- package/docs/archive/UAP_V104_COMPLIANCE_DESIGN.md +0 -223
- package/docs/archive/changelog/2026-03-10_uap-100-compliance.md +0 -77
- package/docs/archive/changelog/2026-03-10_uap-full-system-verification.md +0 -109
- package/docs/archive/opencode-integration-guide.md +0 -740
- package/docs/archive/opencode-integration-quickref.md +0 -180
- package/docs/benchmarks/OVERNIGHT_RUNNER.md +0 -341
- package/docs/benchmarks/SPECULATIVE_DECODING_JOURNEY_2026-03.md +0 -221
- package/docs/benchmarks/VALIDATION_PLAN.md +0 -568
- package/docs/blog/SPECULATIVE_DECODING_PRODUCTION_PLAYBOOK.md +0 -139
- package/docs/blog/local-coding-agents.md +0 -266
- package/docs/blog/x-thread.md +0 -254
- package/docs/deployment/DEPLOYMENT.md +0 -895
- package/docs/deployment/DEPLOYMENT_STRATEGIES.md +0 -518
- package/docs/deployment/DEPLOY_BATCHER_ANALYSIS.md +0 -224
- package/docs/deployment/DEPLOY_BATCHING.md +0 -273
- package/docs/deployment/DEPLOY_BUCKETING_ANALYSIS.md +0 -420
- package/docs/deployment/QWEN35_LLAMA_CPP.md +0 -426
- package/docs/deployment/UAP_LLAMA_ANTHROPIC_PROXY_BOOTSTRAP.md +0 -279
- package/docs/getting-started/INTEGRATION.md +0 -628
- package/docs/getting-started/OVERVIEW.md +0 -324
- package/docs/getting-started/SETUP.md +0 -377
- package/docs/integrations/MCP_ROUTER_SETUP.md +0 -445
- package/docs/integrations/RTK_INTEGRATION.md +0 -468
- package/docs/operations/TROUBLESHOOTING.md +0 -660
- package/docs/pr/PR_SPECULATIVE_DOCS_TEMPLATE.md +0 -146
- package/docs/pr/UPSTREAM_PRS.md +0 -424
- package/docs/reference/API_REFERENCE.md +0 -903
- package/docs/reference/EXPERT_DROIDS.md +0 -219
- package/docs/reference/HARNESS-MATRIX.md +0 -318
- package/docs/reference/PATTERN_LIBRARY.md +0 -636
- package/docs/reference/UAP_CLI_REFERENCE.md +0 -620
- package/docs/research/BEHAVIORAL_PATTERNS.md +0 -228
- package/docs/research/DOMAIN_STRATEGIES.md +0 -316
- package/docs/research/MEMORY_SYSTEMS_COMPARISON.md +0 -812
- package/docs/research/PATTERN_ANALYSIS_2026-01-18.md +0 -436
- package/docs/research/PERFORMANCE_ANALYSIS_2026-01-18.md +0 -209
- package/docs/research/PERFORMANCE_TEST_PLAN.md +0 -383
- package/docs/research/TERMINAL_BENCH_LEARNINGS.md +0 -217
package/docs/INDEX.md
CHANGED
|
@@ -1,306 +1,68 @@
|
|
|
1
1
|
# UAP Documentation
|
|
2
2
|
|
|
3
|
-
|
|
4
|
-
> **Last Updated:** 2026-03-28
|
|
5
|
-
> **License:** MIT
|
|
3
|
+
The complete documentation for the **Universal Agent Protocol** (`@miller-tech/uap` v1.40.0) — a layer that gives AI coding agents memory, judgment, and the discipline to finish the job.
|
|
6
4
|
|
|
7
|
-
|
|
8
|
-
|
|
9
|
-
## Quick Links
|
|
10
|
-
|
|
11
|
-
| Section | Description |
|
|
12
|
-
|---------|-------------|
|
|
13
|
-
| [Getting Started](#getting-started) | Installation and quick start |
|
|
14
|
-
| [Architecture](#architecture) | System design and components |
|
|
15
|
-
| [Reference](#reference) | API documentation and CLI commands |
|
|
16
|
-
| [Deployment](#deployment) | Production deployment guides |
|
|
17
|
-
| [Benchmarks](#benchmarks) | Performance results and validation |
|
|
18
|
-
| [Operations](#operations) | Troubleshooting and maintenance |
|
|
5
|
+
New here? Start with the [project README](../README.md), then [Getting Started](getting-started/).
|
|
19
6
|
|
|
20
7
|
---
|
|
21
8
|
|
|
22
|
-
## Getting
|
|
23
|
-
|
|
24
|
-
|
|
25
|
-
|
|
26
|
-
|
|
27
|
-
|
|
28
|
-
|
|
29
|
-
|
|
30
|
-
|
|
31
|
-
|
|
32
|
-
|
|
33
|
-
|
|
34
|
-
|
|
35
|
-
|
|
36
|
-
|
|
37
|
-
|
|
38
|
-
|
|
39
|
-
|
|
40
|
-
|
|
41
|
-
|
|
42
|
-
|
|
43
|
-
|
|
44
|
-
|
|
45
|
-
# Create a tracked task
|
|
46
|
-
uap task create "Implement feature X"
|
|
47
|
-
|
|
48
|
-
# Use worktree for safe development
|
|
49
|
-
uap worktree create feature-name
|
|
50
|
-
cd .worktrees/NNN-feature-name/
|
|
51
|
-
# Make changes...
|
|
52
|
-
uap worktree pr <id>
|
|
53
|
-
```
|
|
54
|
-
|
|
55
|
-
### Prerequisites
|
|
56
|
-
|
|
57
|
-
- Node.js >= 18.0.0
|
|
58
|
-
- npm or yarn
|
|
59
|
-
- Optional: Docker (for Qdrant semantic search)
|
|
60
|
-
|
|
61
|
-
---
|
|
9
|
+
## Getting started
|
|
10
|
+
|
|
11
|
+
| Doc | What it covers |
|
|
12
|
+
|---|---|
|
|
13
|
+
| [Installation](getting-started/INSTALLATION.md) | Prerequisites, `npm install -g @miller-tech/uap`, what `uap setup` does, per-harness hook install |
|
|
14
|
+
| [Quickstart](getting-started/QUICKSTART.md) | 5-minute path: setup → memory → `uap deliver` → dashboard |
|
|
15
|
+
| [Configuration](getting-started/CONFIGURATION.md) | `.uap.json`, environment variables, Qdrant, model profiles |
|
|
16
|
+
|
|
17
|
+
## Guides
|
|
18
|
+
|
|
19
|
+
| Doc | What it covers |
|
|
20
|
+
|---|---|
|
|
21
|
+
| [**`uap deliver`**](guides/DELIVER.md) | The delivery harness — convergence loop to verified completion ⭐ |
|
|
22
|
+
| [Memory](guides/MEMORY.md) | The 4-tier memory system, write-gates, semantic recall |
|
|
23
|
+
| [MCP Router](guides/MCP_ROUTER.md) | Token-optimizing tool proxy + FTS5 output compression |
|
|
24
|
+
| [Worktree Workflow](guides/WORKTREE_WORKFLOW.md) | Branch-per-feature isolation, auto-PR, enforcement |
|
|
25
|
+
| [Policies](guides/POLICIES.md) | Executable policy gates that block non-compliant tool calls |
|
|
26
|
+
| [Multi-Model Routing](guides/MULTI_MODEL.md) | Plan → route → execute across 7 model profiles |
|
|
27
|
+
| [Droids & Skills](guides/DROIDS_AND_SKILLS.md) | 38 expert droids, 32 skills, the expert router |
|
|
28
|
+
| [Deploy Batching](guides/DEPLOY_BATCHING.md) | Conflict-free batched git/deploy actions |
|
|
29
|
+
| [Coordination](guides/COORDINATION.md) | Multi-agent overlap detection |
|
|
30
|
+
| [Local Models](guides/LOCAL_MODELS.md) | Running agents against local llama.cpp / Qwen models |
|
|
62
31
|
|
|
63
32
|
## Architecture
|
|
64
33
|
|
|
65
|
-
|
|
66
|
-
|
|
67
|
-
|
|
68
|
-
|
|
69
|
-
```
|
|
70
|
-
┌─────────────────────────────────────────────────────────────┐
|
|
71
|
-
│ AI Agent Layer │
|
|
72
|
-
│ (Claude, Factory.AI, OpenCode, etc.) │
|
|
73
|
-
└──────────────────────┬──────────────────────────────────────┘
|
|
74
|
-
│
|
|
75
|
-
┌──────────────────────▼──────────────────────────────────────┐
|
|
76
|
-
│ UAP Core Layer │
|
|
77
|
-
│ CLI │ Memory │ Coordination │ Task │ Deploy Batcher │
|
|
78
|
-
└──────────────────────┬──────────────────────────────────────┘
|
|
79
|
-
│
|
|
80
|
-
┌──────────────────────▼──────────────────────────────────────┐
|
|
81
|
-
│ │ CLI │ Memory │ Coordination │ Task │ Deploy Batcher │
|
|
82
|
-
└──────────────────────┬──────────────────────────────────────┘
|
|
83
|
-
│
|
|
84
|
-
┌──────────────────────▼──────────────────────────────────────┐
|
|
85
|
-
│ │ CLI │ Memory │ Coordination │ Task │ Deploy Batcher │
|
|
86
|
-
└──────────────────────┬──────────────────────────────────────┘
|
|
87
|
-
│
|
|
88
|
-
┌──────────────────────▼──────────────────────────────────────┐
|
|
89
|
-
│ Storage Layer │
|
|
90
|
-
│ SQLite (Working) │ Qdrant (Semantic) │ Git (History) │
|
|
91
|
-
└─────────────────────────────────────────────────────────────┘
|
|
92
|
-
```
|
|
93
|
-
|
|
94
|
-
### Key Components
|
|
95
|
-
|
|
96
|
-
| Component | Purpose | Impact |
|
|
97
|
-
|-----------|---------|--------|
|
|
98
|
-
| **Memory System** | 4-layer persistent context | 10x retention |
|
|
99
|
-
| **Coordination** | Multi-agent sync via SQLite | Zero conflicts |
|
|
100
|
-
| **Task Management** | Structured workflow tracking | Full lifecycle |
|
|
101
|
-
| **Deploy Batcher** | CI/CD optimization | 50-80% savings |
|
|
102
|
-
| **MCP Router** | Meta-tool routing | 98% token reduction |
|
|
103
|
-
| **Worktrees** | Safe isolated development | No main branch pollution |
|
|
104
|
-
|
|
105
|
-
---
|
|
34
|
+
| Doc | What it covers |
|
|
35
|
+
|---|---|
|
|
36
|
+
| [Overview](architecture/OVERVIEW.md) | System architecture, subsystems, tool-call flow |
|
|
37
|
+
| [Protocol](architecture/PROTOCOL.md) | The harness↔UAP contract, hook lifecycle, decision loop |
|
|
106
38
|
|
|
107
39
|
## Reference
|
|
108
40
|
|
|
109
|
-
|
|
110
|
-
|
|
111
|
-
|
|
112
|
-
|
|
113
|
-
|
|
114
|
-
|
|
115
|
-
|
|
116
|
-
|
|
117
|
-
|
|
118
|
-
```
|
|
119
|
-
|
|
120
|
-
#### Memory Operations
|
|
121
|
-
|
|
122
|
-
```bash
|
|
123
|
-
# Store and query memories
|
|
124
|
-
uap memory store "Best practice: validate inputs"
|
|
125
|
-
uap memory query "authentication" --top-k 5
|
|
126
|
-
uap memory status
|
|
127
|
-
```
|
|
128
|
-
|
|
129
|
-
#### Worktree Workflow
|
|
130
|
-
|
|
131
|
-
```bash
|
|
132
|
-
# Safe isolated development
|
|
133
|
-
uap worktree create feature-name
|
|
134
|
-
cd .worktrees/NNN-feature-name/
|
|
135
|
-
g changes...
|
|
136
|
-
uap worktree pr <id>
|
|
137
|
-
uap worktree cleanup <id> # After merge
|
|
138
|
-
```
|
|
139
|
-
|
|
140
|
-
#### Compliance
|
|
141
|
-
|
|
142
|
-
```bash
|
|
143
|
-
# Verify protocol compliance
|
|
144
|
-
uap compliance check
|
|
145
|
-
uap hooks install all
|
|
146
|
-
```
|
|
147
|
-
|
|
148
|
-
### Database Schema
|
|
149
|
-
|
|
150
|
-
UAP uses SQLite for structured data and Qdrant for semantic search:
|
|
151
|
-
|
|
152
|
-
**SQLite Tables:**
|
|
153
|
-
- `memories` - Short-term working memory (50 entries)
|
|
154
|
-
- `session_memories` - Current session state
|
|
155
|
-
- `entities/relationships` - Knowledge graph
|
|
156
|
-
- `agent_registry` - Multi-agent coordination
|
|
157
|
-
- `deploy_queue` - CI/CD batching queue
|
|
158
|
-
|
|
159
|
-
**Qdrant Collections:**
|
|
160
|
-
- `agent_memory` - Semantic embeddings (384-dim vectors)
|
|
161
|
-
- `agent_patterns` - Pattern library indexing
|
|
162
|
-
|
|
163
|
-
---
|
|
164
|
-
|
|
165
|
-
## Deployment
|
|
166
|
-
|
|
167
|
-
### Production Setup
|
|
168
|
-
|
|
169
|
-
```bash
|
|
170
|
-
# Start Qdrant for semantic search
|
|
171
|
-
cd agents && docker-compose up -d
|
|
172
|
-
|
|
173
|
-
# Install hooks
|
|
174
|
-
uap hooks install all
|
|
41
|
+
| Doc | What it covers |
|
|
42
|
+
|---|---|
|
|
43
|
+
| [CLI](reference/CLI.md) | Every `uap` command and flag |
|
|
44
|
+
| [API](reference/API.md) | Programmatic API surface |
|
|
45
|
+
| [Features](reference/FEATURES.md) | Full feature catalog by subsystem |
|
|
46
|
+
| [Patterns](reference/PATTERNS.md) | The 23 Terminal-Bench patterns |
|
|
47
|
+
| [Platforms](reference/PLATFORMS.md) | The 9 supported harnesses + support matrix |
|
|
48
|
+
| [Configuration](reference/CONFIGURATION.md) | All config files and env vars |
|
|
49
|
+
| [Database Schema](reference/DATABASE_SCHEMA.md) | SQLite databases + Qdrant collections |
|
|
175
50
|
|
|
176
|
-
|
|
177
|
-
uap task ready
|
|
178
|
-
```
|
|
51
|
+
## Integrations
|
|
179
52
|
|
|
180
|
-
|
|
181
|
-
|
|
182
|
-
|
|
183
|
-
|
|
184
|
-
```yaml
|
|
185
|
-
name: UAP CI
|
|
186
|
-
on: [push, pull_request]
|
|
187
|
-
jobs:
|
|
188
|
-
build:
|
|
189
|
-
runs-on: ubuntu-latest
|
|
190
|
-
steps:
|
|
191
|
-
- uses: actions/checkout@v4
|
|
192
|
-
- name: Run UAP benchmarks
|
|
193
|
-
run: npm run benchmark:full
|
|
194
|
-
```
|
|
195
|
-
|
|
196
|
-
### Model Providers
|
|
197
|
-
|
|
198
|
-
| Provider | Use Case | Configuration |
|
|
199
|
-
|----------|----------|---------------|
|
|
200
|
-
| **Qwen3.5** | General tasks | Default, cost-effective |
|
|
201
|
-
| **Claude Opus** | Complex reasoning | High-accuracy scenarios |
|
|
202
|
-
| **GPT-4** | Analysis tasks | Code generation |
|
|
203
|
-
| **Local (llama.cpp)** | Privacy-focused | Self-hosted deployments |
|
|
204
|
-
|
|
205
|
-
---
|
|
53
|
+
| Doc | What it covers |
|
|
54
|
+
|---|---|
|
|
55
|
+
| [MCP Router](integrations/MCP_ROUTER.md) | Setting up the MCP router across harnesses |
|
|
56
|
+
| [RTK](integrations/RTK.md) | Rust Token Killer integration |
|
|
206
57
|
|
|
207
58
|
## Benchmarks
|
|
208
59
|
|
|
209
|
-
|
|
210
|
-
|
|
211
|
-
|
|
|
212
|
-
|
|
213
|
-
|
|
|
214
|
-
| **Tokens/Task** | 52K | **23.4K** | -55% |
|
|
215
|
-
| **Time/Task** | 45s | **32s** | -29% |
|
|
216
|
-
| **Error Rate** | 12% | **0%** | -100% |
|
|
217
|
-
|
|
218
|
-
### Full Benchmark Suite
|
|
219
|
-
|
|
220
|
-
Run the complete benchmark suite:
|
|
221
|
-
|
|
222
|
-
```bash
|
|
223
|
-
# Quick test (10 tasks)
|
|
224
|
-
npm run benchmark:short
|
|
225
|
-
|
|
226
|
-
# Full validation (14 tasks)
|
|
227
|
-
npm run benchmark:full
|
|
228
|
-
|
|
229
|
-
# Overnight extended run
|
|
230
|
-
npm run benchmark:overnight
|
|
231
|
-
```
|
|
232
|
-
|
|
233
|
-
Results are documented in [COMPREHENSIVE_BENCHMARKS.md](benchmarks/COMPREHENSIVE_BENCHMARKS.md).
|
|
234
|
-
|
|
235
|
-
---
|
|
236
|
-
|
|
237
|
-
## Operations
|
|
238
|
-
|
|
239
|
-
### Troubleshooting
|
|
240
|
-
|
|
241
|
-
| Issue | Solution |
|
|
242
|
-
|-------|----------|
|
|
243
|
-
| `Qdrant connection failed` | `cd agents && docker-compose up -d` |
|
|
244
|
-
| `Worktree already exists` | `uap worktree cleanup <id>` |
|
|
245
|
-
| `Memory DB locked` | Close other processes using the DB |
|
|
246
|
-
| `Compliance check failed` | Review specific gate failure in output |
|
|
247
|
-
|
|
248
|
-
### Debug Mode
|
|
249
|
-
|
|
250
|
-
```bash
|
|
251
|
-
# Enable verbose logging
|
|
252
|
-
export UAP_VERBOSE=true
|
|
253
|
-
|
|
254
|
-
# Check memory queries
|
|
255
|
-
uap task ready --verbose
|
|
256
|
-
|
|
257
|
-
# Inspect database directly
|
|
258
|
-
sqlite3 ./agents/data/memory/short_term.db ".tables"
|
|
259
|
-
```
|
|
260
|
-
|
|
261
|
-
---
|
|
60
|
+
| Doc | What it covers |
|
|
61
|
+
|---|---|
|
|
62
|
+
| [Validation Results](benchmarks/VALIDATION_RESULTS.md) | Terminal-Bench 2.0 results (−49.7% tokens, +33pp success) |
|
|
63
|
+
| [Token Optimization](benchmarks/TOKEN_OPTIMIZATION.md) | Where the token savings come from |
|
|
64
|
+
| [Accuracy Analysis](benchmarks/ACCURACY_ANALYSIS.md) | Success-rate and error analysis |
|
|
262
65
|
|
|
263
66
|
## Contributing
|
|
264
67
|
|
|
265
|
-
|
|
266
|
-
|
|
267
|
-
```bash
|
|
268
|
-
git clone https://github.com/DammianMiller/universal-agent-protocol.git
|
|
269
|
-
cd universal-agent-protocol
|
|
270
|
-
npm install
|
|
271
|
-
npm run build
|
|
272
|
-
npm test
|
|
273
|
-
```
|
|
274
|
-
|
|
275
|
-
### Worktree Workflow
|
|
276
|
-
|
|
277
|
-
All changes must be made in a worktree:
|
|
278
|
-
|
|
279
|
-
```bash
|
|
280
|
-
# Create worktree for your feature
|
|
281
|
-
uap worktree create feature-description
|
|
282
|
-
|
|
283
|
-
# Make changes, commit, create PR
|
|
284
|
-
cd .worktrees/NNN-feature-description/
|
|
285
|
-
git add -A && git commit -m "feat: description"
|
|
286
|
-
uap worktree pr <id>
|
|
287
|
-
|
|
288
|
-
# After merge, cleanup is mandatory
|
|
289
|
-
uap worktree cleanup <id>
|
|
290
|
-
```
|
|
291
|
-
|
|
292
|
-
---
|
|
293
|
-
|
|
294
|
-
## License
|
|
295
|
-
|
|
296
|
-
MIT License - See [LICENSE](../LICENSE) file
|
|
297
|
-
|
|
298
|
-
---
|
|
299
|
-
|
|
300
|
-
<div align="center">
|
|
301
|
-
|
|
302
|
-
**Maintained by:** UAP Team
|
|
303
|
-
**Repository:** https://github.com/DammianMiller/universal-agent-protocol
|
|
304
|
-
**Issues:** https://github.com/DammianMiller/universal-agent-protocol/issues
|
|
305
|
-
|
|
306
|
-
</div>
|
|
68
|
+
See [CONTRIBUTING.md](../CONTRIBUTING.md) for dev setup, the worktree workflow, completion gates, and PR conventions.
|