npm - @miller-tech/uap - Versions diffs - 1.39.0 → 1.40.1 - Mend

@miller-tech/uap 1.39.0 → 1.40.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (99) hide show

package/README.md +109 -642
package/dist/.tsbuildinfo +1 -1
package/dist/bin/cli.js +2 -2
package/dist/bin/cli.js.map +1 -1
package/dist/cli/deliver.d.ts +3 -2
package/dist/cli/deliver.d.ts.map +1 -1
package/dist/cli/deliver.js +10 -5
package/dist/cli/deliver.js.map +1 -1
package/docs/INDEX.md +48 -286
package/docs/architecture/OVERVIEW.md +328 -0
package/docs/architecture/PROTOCOL.md +204 -0
package/docs/benchmarks/README.md +17 -192
package/docs/getting-started/CONFIGURATION.md +237 -0
package/docs/getting-started/INSTALLATION.md +125 -0
package/docs/getting-started/QUICKSTART.md +115 -0
package/docs/guides/COORDINATION.md +162 -0
package/docs/guides/DELIVER.md +115 -0
package/docs/guides/DEPLOY_BATCHING.md +212 -0
package/docs/guides/DROIDS_AND_SKILLS.md +202 -0
package/docs/guides/LOCAL_MODELS.md +148 -0
package/docs/guides/MCP_ROUTER.md +195 -0
package/docs/guides/MEMORY.md +235 -0
package/docs/guides/MULTI_MODEL.md +223 -0
package/docs/guides/POLICIES.md +190 -0
package/docs/guides/WORKTREE_WORKFLOW.md +185 -0
package/docs/integrations/MCP_ROUTER.md +147 -0
package/docs/integrations/RTK.md +102 -0
package/docs/reference/API.md +485 -0
package/docs/reference/CLI.md +719 -0
package/docs/reference/CONFIGURATION.md +90 -193
package/docs/reference/DATABASE_SCHEMA.md +110 -344
package/docs/reference/FEATURES.md +176 -472
package/docs/reference/PATTERNS.md +102 -0
package/docs/reference/PLATFORMS.md +83 -0
package/package.json +1 -1
package/docs/AGENTS.md +0 -423
package/docs/DOCUMENTATION_AUDIT_REPORT.md +0 -131
package/docs/GETTING_STARTED.md +0 -288
package/docs/PROJECT_ANALYSIS_REPORT.md +0 -510
package/docs/architecture/COMPLETE_ARCHITECTURE.md +0 -748
package/docs/architecture/EXPERT_STACK.md +0 -137
package/docs/architecture/MULTI_MODEL.md +0 -224
package/docs/architecture/PLATFORM_GATING.md +0 -68
package/docs/architecture/SYSTEM_ANALYSIS.md +0 -334
package/docs/architecture/UAP_COMPLIANCE.md +0 -217
package/docs/architecture/UAP_PROTOCOL.md +0 -339
package/docs/architecture/UAP_STRICT_DROIDS.md +0 -172
package/docs/archive/BALLS_MODE_SELF_ANALYSIS.md +0 -260
package/docs/archive/BENCHMARK_GAPS_AND_PLAN.md +0 -146
package/docs/archive/FAILING_TASKS_SOLUTION_PLAN.md +0 -668
package/docs/archive/JINJA2-SYSTEM-MESSAGE-FIX.md +0 -209
package/docs/archive/MODEL_ROUTING_IMPLEMENTATION_SUMMARY.md +0 -281
package/docs/archive/MODEL_ROUTING_OPTIMIZATION_PLAN.md +0 -320
package/docs/archive/NPM-PUBLISH-V0.9.1.md +0 -240
package/docs/archive/OPTIMIZATION_OPTIONS.md +0 -334
package/docs/archive/PARALLELISM_GAPS_AND_OPTIONS.md +0 -422
package/docs/archive/POLICY_GATE_IMPLEMENTATION.md +0 -245
package/docs/archive/SETUP_IMPROVEMENTS.md +0 -213
package/docs/archive/UAP_GENERIC_OPTIMIZATION_PLAN.md +0 -270
package/docs/archive/UAP_OPTIMIZATION_PLAN.md +0 -701
package/docs/archive/UAP_V103_PATTERN_DESIGN.md +0 -315
package/docs/archive/UAP_V104_COMPLIANCE_DESIGN.md +0 -223
package/docs/archive/changelog/2026-03-10_uap-100-compliance.md +0 -77
package/docs/archive/changelog/2026-03-10_uap-full-system-verification.md +0 -109
package/docs/archive/opencode-integration-guide.md +0 -740
package/docs/archive/opencode-integration-quickref.md +0 -180
package/docs/benchmarks/OVERNIGHT_RUNNER.md +0 -341
package/docs/benchmarks/SPECULATIVE_DECODING_JOURNEY_2026-03.md +0 -221
package/docs/benchmarks/VALIDATION_PLAN.md +0 -568
package/docs/blog/SPECULATIVE_DECODING_PRODUCTION_PLAYBOOK.md +0 -139
package/docs/blog/local-coding-agents.md +0 -266
package/docs/blog/x-thread.md +0 -254
package/docs/deployment/DEPLOYMENT.md +0 -895
package/docs/deployment/DEPLOYMENT_STRATEGIES.md +0 -518
package/docs/deployment/DEPLOY_BATCHER_ANALYSIS.md +0 -224
package/docs/deployment/DEPLOY_BATCHING.md +0 -273
package/docs/deployment/DEPLOY_BUCKETING_ANALYSIS.md +0 -420
package/docs/deployment/QWEN35_LLAMA_CPP.md +0 -426
package/docs/deployment/UAP_LLAMA_ANTHROPIC_PROXY_BOOTSTRAP.md +0 -279
package/docs/getting-started/INTEGRATION.md +0 -628
package/docs/getting-started/OVERVIEW.md +0 -324
package/docs/getting-started/SETUP.md +0 -377
package/docs/integrations/MCP_ROUTER_SETUP.md +0 -445
package/docs/integrations/RTK_INTEGRATION.md +0 -468
package/docs/operations/TROUBLESHOOTING.md +0 -660
package/docs/pr/PR_SPECULATIVE_DOCS_TEMPLATE.md +0 -146
package/docs/pr/UPSTREAM_PRS.md +0 -424
package/docs/reference/API_REFERENCE.md +0 -903
package/docs/reference/EXPERT_DROIDS.md +0 -219
package/docs/reference/HARNESS-MATRIX.md +0 -318
package/docs/reference/PATTERN_LIBRARY.md +0 -636
package/docs/reference/UAP_CLI_REFERENCE.md +0 -620
package/docs/research/BEHAVIORAL_PATTERNS.md +0 -228
package/docs/research/DOMAIN_STRATEGIES.md +0 -316
package/docs/research/MEMORY_SYSTEMS_COMPARISON.md +0 -812
package/docs/research/PATTERN_ANALYSIS_2026-01-18.md +0 -436
package/docs/research/PERFORMANCE_ANALYSIS_2026-01-18.md +0 -209
package/docs/research/PERFORMANCE_TEST_PLAN.md +0 -383
package/docs/research/TERMINAL_BENCH_LEARNINGS.md +0 -217

package/docs/archive/opencode-integration-quickref.md DELETED Viewed

@@ -1,180 +0,0 @@
-# OpenCode Integration Quick Reference
-## File Structure
-```
-.project/
-├── .opencode/
-│   ├── plugin/
-│   │   ├── your-plugin.ts          # Your custom plugin
-│   │   └── index.ts                # Optional: aggregate exports
-│   └── package.json                # Dependencies (add @opencode-ai/plugin)
-└── opencode.json                   # OpenCode configuration
-```
-## Plugin Template
-```typescript
-import type { Plugin } from '@opencode-ai/plugin';
-import { tool } from '@opencode-ai/plugin';
-export const MyPlugin: Plugin = async ({ $, directory }) => {
-  return {
-    // Define tools
-    tool: {
-      my_tool: tool({
-        description: 'What this tool does',
-        args: {
-          param: tool.schema.string().describe('Parameter'),
-        },
-        async execute({ param }) {
-          const result = await $`command ${param}`;
-          return result.stdout.toString();
-        },
-      }),
-    },
-    // Optional: Event hooks
-    event: async ({ event }) => {
-      if (event.type === 'session.created') {
-        console.log('Session started');
-      }
-    },
-  };
-};
-```
-## Available Hooks
-| Hook                                 | Purpose                    | Example                        |
-| ------------------------------------ | -------------------------- | ------------------------------ |
-| `tool`                               | Define new tools           | Custom commands for LLM        |
-| `event.session.created`              | Session initialization     | Load context, initialize state |
-| `event.session.compacting`           | Before context compression | Preserve important data        |
-| `tool.execute.before`                | Before tool runs           | Validate args, log activity    |
-| `tool.execute.after`                 | After tool completes       | Record results, update state   |
-| `tool.definition`                    | Modify tool descriptions   | Add policy constraints         |
-| `experimental.chat.system.transform` | Inject system context      | RAG retrieval, dynamic context |
-| `middleware`                         | Transform messages         | Pre/post processing            |
-## Tool Schema Types
-```typescript
-// String
-tool.schema.string().describe('Text parameter');
-// Number with constraints
-tool.schema.number().min(0).max(100).default(50);
-// Enum
-tool.schema.enum(['read', 'write', 'execute']).default('read');
-// Array
-tool.schema.array().of(tool.schema.string());
-// Optional
-tool.schema.string().optional();
-```
-## Common Patterns
-### 1. CLI Wrapper
-```typescript
-tool({
-  description: 'Run external command',
-  args: { cmd: tool.schema.string() },
-  async execute({ cmd }) {
-    return (await $`${cmd}`.quiet()).stdout.toString();
-  },
-});
-```
-### 2. File Operations
-```typescript
-import { readFile, writeFile } from 'fs/promises';
-tool({
-  description: 'Read project file',
-  args: { path: tool.schema.string() },
-  async execute({ path }) {
-    return await readFile(join(projectDir, path), 'utf-8');
-  },
-});
-```
-### 3. Memory Query
-```typescript
-tool({
-  description: 'Query persistent memory',
-  args: { query: tool.schema.string() },
-  async execute({ query }) {
-    const result = await $`python3 ./scripts/query.py "${query}"`;
-    return result.stdout.toString().trim();
-  },
-});
-```
-### 4. Context Injection (RAG)
-```typescript
-middleware: async (input, next) => {
-  const lastMsg = input.messages?.[input.messages.length - 1];
-  if (lastMsg?.role === 'user') {
-    const context = await queryRAG(lastMsg.content);
-    input.messages.push({ role: 'system', content: `<context>${context}</context>` });
-  }
-  return next(input);
-};
-```
-## Plugin Examples in This Repo
-| Plugin          | File                                      | Purpose                       |
-| --------------- | ----------------------------------------- | ----------------------------- |
-| Commands        | `.opencode/plugin/uap-commands.ts`        | CLI commands as tools         |
-| Skills          | `.opencode/plugin/uap-skills.ts`          | Skill loading system          |
-| Droids          | `.opencode/plugin/uap-droids.ts`          | Specialized agent droids      |
-| Pattern RAG     | `.opencode/plugin/uap-pattern-rag.ts`     | On-demand pattern retrieval   |
-| Task Completion | `.opencode/plugin/uap-task-completion.ts` | Track task outcomes           |
-| Session Hooks   | `.opencode/plugin/uap-session-hooks.ts`   | Session lifecycle events      |
-| Enforcement     | `tools/agents/plugins/uap-enforce.ts`     | Loop detection, budget limits |
-## Dependencies
-```json
-{
-  "dependencies": {
-    "@opencode-ai/plugin": "1.2.16"
-  }
-}
-```
-## Debugging
-```bash
-# Check plugin loads
-opencode run "What tools are available?"
-# View logs
-tail -f ~/.opencode/logs/*.log
-# Test TypeScript syntax
-npx tsc --noEmit .opencode/plugin/your-plugin.ts
-```
-## Best Practices
-1. **Error Handling**: Always use `.nothrow()` and check exit codes
-2. **Security**: Validate inputs, prevent command injection
-3. **Caching**: Cache expensive operations between tool calls
-4. **Descriptions**: Write clear, comprehensive tool descriptions
-5. **Naming**: Use snake_case, prefix with domain (`mydomain_tool`)
-6. **Context**: Preserve important state across compaction
-7. **Performance**: Use `--quiet` to reduce output noise
-## Full Example
-See: `.opencode/plugin/uap-commands.ts` for a complete implementation example.

package/docs/benchmarks/OVERNIGHT_RUNNER.md DELETED Viewed

@@ -1,341 +0,0 @@
-# Overnight Benchmark Runner Guide
-> **Version:** 1.18.0
-> **Last Updated:** 2026-03-28
-> **Purpose:** Automated overnight benchmark execution
----
-## Overview
-This guide explains how to set up and run the overnight benchmark suite for comprehensive UAP validation.
-### What Gets Run
-The overnight suite executes:
-- **10 representative tasks** (short benchmark)
-- **Token tracking** per task
-- **Time measurement** per task
-- **Success/failure** tracking
-- **Error count** per task
-- **Quality scoring** (if enabled)
-### Expected Duration
-| Suite | Tasks | Duration |
-|-------|-------|----------|
-| Short | 10 | ~15-20 minutes |
-| Full | 14 | ~25-30 minutes |
-| Overnight | 10 + extended | ~4 hours |
----
-## Quick Start
-### Manual Run
-```bash
-# Run short benchmark suite
-npm run benchmark:short
-# Run full benchmark suite
-npm run benchmark:full
-# Run overnight suite
-npm run benchmark:overnight
-```
-### Automated Nightly Run
-```bash
-# Edit crontab
-crontab -e
-# Add nightly run at 2:00 AM
-0 2 * * * cd /path/to/uap && npm run benchmark:overnight >> /var/log/uap-benchmark.log 2>&1
-```
----
-## Configuration
-### Environment Variables
-```bash
-# Benchmark configuration
-UAP_BENCHMARK_TASKS=T01,T02,T03,T04,T05,T06,T07,T08,T09,T10
-UAP_BENCHMARK_UAP_ENABLED=true
-UAP_BENCHMARK_OPENCODE_ENABLED=true
-UAP_BENCHMARK_TOKEN_TRACKING=true
-UAP_BENCHMARK_QUALITY_SCORING=false
-# Results location
-UAP_BENCHMARK_RESULTS_DIR=./benchmark-results
-```
-### Task Selection
-```typescript
-// scripts/benchmark-quick-suite.ts
-const TASKS = [
-  { id: 'T01', name: 'Git Repository Recovery', category: 'system-admin' },
-  { id: 'T02', name: 'Password Hash Recovery', category: 'security' },
-  { id: 'T03', name: 'mTLS Certificate Setup', category: 'security' },
-  { id: 'T04', name: 'Docker Compose Config', category: 'containers' },
-  { id: 'T05', name: 'ML Model Training', category: 'ml' },
-  { id: 'T06', name: 'Data Compression', category: 'data-processing' },
-  { id: 'T07', name: 'Chess FEN Parser', category: 'games' },
-  { id: 'T08', name: 'SQLite WAL Recovery', category: 'database' },
-  { id: 'T09', name: 'HTTP Server Config', category: 'networking' },
-  { id: 'T10', name: 'Code Compression', category: 'development' },
-];
-```
----
-## Output Format
-### Results JSON
-```json
-[
-  {
-    "taskId": "T01",
-    "taskName": "Git Repository Recovery",
-    "category": "system-admin",
-    "tokens": 19800,
-    "time": 12.34,
-    "success": true,
-    "errors": 0
-  }
-]
-```
-### Markdown Report
-```markdown
-# UAP Benchmark Report
-**Generated:** 2026-03-28
-**Version:** 1.18.0
-## Summary
-| Metric | Value |
-|--------|-------|
-| Total Tasks | 10 |
-| Successful | 10 |
-| Avg Tokens/Task | 20,000 |
-| Avg Time/Task | 15.5s |
-| Success Rate | 100% |
-```
----
-## Results Location
-```
-benchmark-results/
-├── overnight-2026-03-28-020000/
-│   ├── benchmark.log
-│   ├── results-2026-03-28.json
-│   └── report-2026-03-28.md
-├── overnight-2026-03-27-020000/
-│   └── ...
-└── ...
-```
----
-## Monitoring
-### Check Status
-```bash
-# Check latest results
-ls -lt benchmark-results/overnight-*/ | head -5
-# View latest report
-cat benchmark-results/overnight-*/report-*.md | tail -50
-# Check benchmark log
-tail -f benchmark-results/overnight-*/benchmark.log
-```
-### Alerting
-```bash
-# Check for failures
-grep -r "Failed\|Error" benchmark-results/overnight-*/benchmark.log
-# Check success rate
-jq -s 'map(select(.success | not)) | length' benchmark-results/overnight-*/results-*.json
-```
----
-## Troubleshooting
-### Benchmark Fails
-```bash
-# Check logs
-cat benchmark-results/overnight-*/benchmark.log
-# Check Node.js version
-node --version  # Should be >= 18.0.0
-# Check dependencies
-npm install
-# Rebuild project
-npm run build
-```
-### Results Not Generated
-```bash
-# Check results directory permissions
-ls -la benchmark-results/
-# Create results directory manually
-mkdir -p benchmark-results
-# Run with verbose output
-npm run benchmark:short -- --verbose
-```
-### Performance Issues
-```bash
-# Check system resources
-free -h          # Memory
-df -h            # Disk space
-top              # CPU usage
-# Reduce concurrent tasks if needed
-export UAP_BENCHMARK_CONCURRENCY=1
-```
----
-## Advanced Usage
-### Custom Task List
-```bash
-# Create custom tasks file
-cat > custom-tasks.json << EOF
-[
-  {"id": "T01", "name": "Task 1", "category": "test"},
-  {"id": "T02", "name": "Task 2", "category": "test"}
-]
-EOF
-# Run with custom tasks
-node scripts/benchmark-quick-suite.ts --tasks custom-tasks.json
-```
-### Quality Scoring
-```bash
-# Enable quality scoring
-export UAP_BENCHMARK_QUALITY_SCORING=true
-# Quality score is calculated by:
-correctness * 0.3 +
-completeness * 0.25 +
-efficiency * 0.2 +
-security * 0.15 +
-maintainability * 0.1
-```
-### Compare Results
-```bash
-# Compare two benchmark runs
-npm run benchmark:compare \
-  -- --before benchmark-results/overnight-2026-03-27/results.json \
-     --after benchmark-results/overnight-2026-03-28/results.json
-# Generate comparison report
-npm run benchmark:report \
-  -- --input benchmark-results/overnight-2026-03-28/results.json \
-     --output benchmark-results/overnight-2026-03-28/comparison.md
-```
----
-## Expected Results
-### Based on Historical Data
-| Metric | Target | Status |
-|--------|--------|--------|
-| Success Rate | 100% | ✅ |
-| Avg Tokens/Task | <25,000 | ✅ |
-| Avg Time/Task | <20s | ✅ |
-| Error Rate | 0% | ✅ |
-### Performance Comparison
-| Version | Tokens/Task | Time/Task | Success Rate |
-|---------|-------------|-----------|--------------|
-| Baseline | 52,000 | 45s | 75% |
-| UAP v1.17 | 28,500 | 38s | 92% |
-| UAP v1.18 + OpenCode | 23,400 | 32s | 100% |
----
-## Best Practices
-### 1. Run During Off-Peak Hours
-- Avoid running during business hours
-- Schedule for 2:00 AM local time
-- Ensure no other heavy workloads
-### 2. Monitor Resources
-- Check disk space before run
-- Ensure sufficient memory
-- Monitor network connectivity
-### 3. Review Results Daily
-- Check for failures
-- Review token usage trends
-- Monitor success rate
-### 4. Archive Old Results
-```bash
-# Archive results older than 30 days
-find benchmark-results -minmtime 30 -exec mv {} benchmark-results/archive/ \;
-```
-### 5. Set Up Alerts
-```bash
-# Alert on failures
-grep -q "Failed" benchmark-results/overnight-*/benchmark.log && \
-  echo "Benchmark failures detected!" | mail -s "UAP Benchmark Alert" admin@example.com
-```
----
-## Next Steps
-After overnight run completes:
-1. **Review Report**: Check `benchmark-results/overnight-*/report-*.md`
-2. **Verify Success**: Ensure 100% success rate
-3. **Check Tokens**: Confirm token usage is within targets
-4. **Monitor Trends**: Compare with previous runs
-5. **Update Documentation**: If significant changes detected
----
-<div align="center">
-**Related Documentation:**
-- [Benchmark Results](COMPREHENSIVE_BENCHMARKS.md)
-- [Validation Plan](VALIDATION_PLAN.md)
-- [CLI Reference](../reference/UAP_CLI_REFERENCE.md)
-</div>