open-agents-ai 0.187.259 → 0.187.262
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +160 -1
- package/dist/index.js +755 -269
- package/package.json +2 -2
package/README.md
CHANGED
|
@@ -35,32 +35,162 @@ An autonomous multi-turn tool-calling agent that reads your code, makes changes,
|
|
|
35
35
|
- [How It Works](#how-it-works)
|
|
36
36
|
- [Features](#features)
|
|
37
37
|
- [Enterprise & Headless Mode](#enterprise--headless-mode)
|
|
38
|
+
- [Non-Interactive Mode](#non-interactive-mode)
|
|
39
|
+
- [Background Jobs](#background-jobs)
|
|
40
|
+
- [JSON Output Mode](#json-output-mode)
|
|
41
|
+
- [Process Management](#process-management)
|
|
42
|
+
- [REST API Service (Port 11435)](#rest-api-service-port-11435)
|
|
43
|
+
- [Access Policy & Binding](#access-policy--binding)
|
|
44
|
+
- [Working Directory](#working-directory)
|
|
45
|
+
- [Health & Observability](#health--observability)
|
|
46
|
+
- [OpenAI-Compatible Inference](#openai-compatible-inference)
|
|
47
|
+
- [Agentic Task Execution](#agentic-task-execution)
|
|
48
|
+
- [Configuration](#configuration-1)
|
|
49
|
+
- [Slash Commands via REST](#slash-commands-via-rest)
|
|
50
|
+
- [Auth Scopes](#auth-scopes)
|
|
51
|
+
- [Tool-Use Profiles](#tool-use-profiles)
|
|
52
|
+
- [Parallelism & Concurrency](#parallelism--concurrency)
|
|
53
|
+
- [Endpoint Reference](#endpoint-reference)
|
|
54
|
+
- [Stateful Chat — `/v1/chat` + `/api/chat` (OpenAI drop-in with full agent under the hood)](#stateful-chat--v1chat--apichat-openai-drop-in-with-full-agent-under-the-hood)
|
|
55
|
+
- [Live Comparison: Ollama vs OA Full Agent](#live-comparison-ollama-vs-oa-full-agent)
|
|
56
|
+
- [One-Off Completions — `/api/generate` + `/v1/generate`](#one-off-completions--apigenerate--v1generate)
|
|
57
|
+
- [Embeddings — `/v1/embeddings` + `/api/embed`](#embeddings--v1embeddings--apiembed)
|
|
58
|
+
- [Memory Recall + Knowledge Graph — `/v1/memory/*`](#memory-recall--knowledge-graph--v1memory)
|
|
59
|
+
- [Generate/Embed/Memory Test Harness](#generateembedmemory-test-harness)
|
|
60
|
+
- [AIWG Cascade — `/v1/aiwg/*`](#aiwg-cascade--v1aiwg)
|
|
61
|
+
- [ISO/IEC 42001:2023 AIMS — `/v1/aims/*`](#isoiec-420012023-aims--v1aims)
|
|
62
|
+
- [Event Bus — `/v1/events` (SSE fanout)](#event-bus--v1events-sse-fanout)
|
|
63
|
+
- [Memory + Skills + MCP + Tools + Engines (parity surface)](#memory--skills--mcp--tools--engines-parity-surface)
|
|
64
|
+
- [Sessions, Context, Cost, Sponsors, Nexus](#sessions-context-cost-sponsors-nexus)
|
|
65
|
+
- [RFC 7807 Problem Details (error envelope)](#rfc-7807-problem-details-error-envelope)
|
|
66
|
+
- [Pagination envelope](#pagination-envelope)
|
|
67
|
+
- [ETag + Conditional GET](#etag--conditional-get)
|
|
68
|
+
- [Web Interface](#web-interface)
|
|
38
69
|
- [Architecture](#architecture)
|
|
39
70
|
- [Context Engineering](#context-engineering)
|
|
40
71
|
- [Model-Tier Awareness](#model-tier-awareness)
|
|
72
|
+
- [Small Model Optimization (Research-Backed)](#small-model-optimization-research-backed)
|
|
73
|
+
- [Tool Nesting for Small Models](#tool-nesting-for-small-models)
|
|
74
|
+
- [Dynamic Context Limits](#dynamic-context-limits)
|
|
41
75
|
- [Live Code Knowledge Graph](#live-code-knowledge-graph)
|
|
76
|
+
- [How It Works](#how-it-works-1)
|
|
77
|
+
- [What the Agent Sees](#what-the-agent-sees)
|
|
78
|
+
- [Graph Tools](#graph-tools)
|
|
79
|
+
- [Storage](#storage)
|
|
80
|
+
- [Research Basis](#research-basis)
|
|
42
81
|
- [Auto-Expanding Context Window](#auto-expanding-context-window)
|
|
43
82
|
- [Tools (85+)](#tools-85)
|
|
44
83
|
- [Model Context Protocol (MCP)](#model-context-protocol-mcp)
|
|
84
|
+
- [What MCP gives you](#what-mcp-gives-you)
|
|
85
|
+
- [Spec compliance — what we implement](#spec-compliance--what-we-implement)
|
|
86
|
+
- [Three ways to add a server](#three-ways-to-add-a-server)
|
|
87
|
+
- [Verified compatibility — 12 servers connect end-to-end](#verified-compatibility---12-servers-connect-end-to-end)
|
|
88
|
+
- [Streaming, progress, and binary content](#streaming-progress-and-binary-content)
|
|
89
|
+
- [Live agent eval](#live-agent-eval)
|
|
90
|
+
- [Programmatic API](#programmatic-api)
|
|
91
|
+
- [Further reading](#further-reading)
|
|
45
92
|
- [Associative Memory & Cross-Modal Binding](#associative-memory--cross-modal-binding)
|
|
93
|
+
- [Architecture](#architecture-1)
|
|
94
|
+
- [Episode Store (SQLite)](#episode-store-sqlite)
|
|
95
|
+
- [Temporal Knowledge Graph](#temporal-knowledge-graph)
|
|
96
|
+
- [Zettelkasten Linking (A-MEM)](#zettelkasten-linking-a-mem)
|
|
97
|
+
- [PPR Retrieval (HippoRAG)](#ppr-retrieval-hipporag)
|
|
98
|
+
- [Cross-Modal Binding](#cross-modal-binding)
|
|
99
|
+
- [Gist Compression](#gist-compression)
|
|
100
|
+
- [Near-Critical Cognitive Architecture](#near-critical-cognitive-architecture)
|
|
101
|
+
- [Cross‑Modality Identity & Association (CLIP + Voice)](#crossmodality-identity--association-clip--voice)
|
|
46
102
|
- [Ralph Loop — Iteration-First Design](#ralph-loop--iteration-first-design)
|
|
47
103
|
- [Task Control](#task-control)
|
|
104
|
+
- [Pause, Stop, Resume, Destroy](#pause-stop-resume-destroy)
|
|
105
|
+
- [Session Context Persistence](#session-context-persistence)
|
|
106
|
+
- [Auto-Restore on Startup](#auto-restore-on-startup)
|
|
48
107
|
- [COHERE Cognitive Framework](#cohere-cognitive-framework)
|
|
108
|
+
- [Distributed Inference (`/cohere`)](#distributed-inference-cohere)
|
|
109
|
+
- [How It Works](#how-it-works-2)
|
|
110
|
+
- [Research Provenance](#research-provenance)
|
|
49
111
|
- [Context Compaction — Research-Backed Memory Management](#context-compaction--research-backed-memory-management)
|
|
112
|
+
- [How It Works](#how-it-works-3)
|
|
113
|
+
- [Compaction Strategies](#compaction-strategies)
|
|
114
|
+
- [Automatic Compaction](#automatic-compaction)
|
|
115
|
+
- [Deep Context Mode (`/deep`)](#deep-context-mode-deep)
|
|
116
|
+
- [Status Bar Context Tracking (`Ctx:` + `SNR:`)](#status-bar-context-tracking-ctx--snr)
|
|
117
|
+
- [Memex Experience Archive](#memex-experience-archive)
|
|
118
|
+
- [Design Rationale](#design-rationale)
|
|
119
|
+
- [Domain-Aware Preservation](#domain-aware-preservation)
|
|
50
120
|
- [Personality Core — SAC Framework Style Control](#personality-core--sac-framework-style-control)
|
|
121
|
+
- [How It Works](#how-it-works-4)
|
|
122
|
+
- [What Changes Per Style](#what-changes-per-style)
|
|
123
|
+
- [Persistence](#persistence)
|
|
124
|
+
- [Research Provenance](#research-provenance-1)
|
|
51
125
|
- [Emotion Engine — Affective State Modulation](#emotion-engine--affective-state-modulation)
|
|
126
|
+
- [Emotion Center (LLM-Generated Labels)](#emotion-center-llm-generated-labels)
|
|
127
|
+
- [TUI Status Bar](#tui-status-bar)
|
|
128
|
+
- [Proactive Admin Outreach](#proactive-admin-outreach)
|
|
129
|
+
- [Momentum Effects](#momentum-effects)
|
|
130
|
+
- [Research Foundations](#research-foundations)
|
|
52
131
|
- [Voice Feedback (TTS)](#voice-feedback-tts)
|
|
132
|
+
- [LuxTTS Voice Cloning](#luxtts-voice-cloning)
|
|
133
|
+
- [Narration Engine Architecture](#narration-engine-architecture)
|
|
134
|
+
- [Emotion-Driven Prosody (SEST)](#emotion-driven-prosody-sest)
|
|
135
|
+
- [Personality-Aware Voice](#personality-aware-voice)
|
|
136
|
+
- [Voice Narration Research Foundations](#voice-narration-research-foundations)
|
|
137
|
+
- [Live Voice Session](#live-voice-session)
|
|
138
|
+
- [Telegram Voice Messages](#telegram-voice-messages)
|
|
139
|
+
- [Auto-Install Dependencies](#auto-install-dependencies)
|
|
140
|
+
- [Call Sub-Agent Architecture](#call-sub-agent-architecture)
|
|
141
|
+
- [Content-Aware Voice Narration](#content-aware-voice-narration)
|
|
53
142
|
- [Listen Mode — Live Bidirectional Audio](#listen-mode--live-bidirectional-audio)
|
|
54
143
|
- [Vision & Desktop Automation (Moondream)](#vision--desktop-automation-moondream)
|
|
144
|
+
- [Desktop Awareness](#desktop-awareness)
|
|
145
|
+
- [Vision Analysis](#vision-analysis)
|
|
146
|
+
- [Point-and-Click](#point-and-click)
|
|
147
|
+
- [Browser Automation](#browser-automation)
|
|
55
148
|
- [Interactive TUI](#interactive-tui)
|
|
149
|
+
- [Slash Commands](#slash-commands)
|
|
150
|
+
- [Mid-Task Steering (Sub-Agent Architecture)](#mid-task-steering-sub-agent-architecture)
|
|
56
151
|
- [Telegram Bridge — Sub-Agent Per Chat](#telegram-bridge--sub-agent-per-chat)
|
|
152
|
+
- [Admin Slash Command Passthrough](#admin-slash-command-passthrough)
|
|
153
|
+
- [Sub-Agent Architecture](#sub-agent-architecture)
|
|
154
|
+
- [Access Levels](#access-levels)
|
|
155
|
+
- [Streaming Responses](#streaming-responses)
|
|
156
|
+
- [Public User Isolation](#public-user-isolation)
|
|
157
|
+
- [Context-Aware Tool Policy](#context-aware-tool-policy)
|
|
158
|
+
- [Group Chat Distinction](#group-chat-distinction)
|
|
159
|
+
- [Media Handling](#media-handling)
|
|
160
|
+
- [Rate Limit Handling](#rate-limit-handling)
|
|
57
161
|
- [x402 Payment Rails & Nexus P2P](#x402-payment-rails--nexus-p2p)
|
|
162
|
+
- [Wallet & Identity](#wallet--identity)
|
|
163
|
+
- [Expose Inference with Pricing](#expose-inference-with-pricing)
|
|
164
|
+
- [Spend — Gasless USDC Transfers (EIP-3009)](#spend--gasless-usdc-transfers-eip-3009)
|
|
165
|
+
- [Remote Inference — Tap Into the Mesh](#remote-inference--tap-into-the-mesh)
|
|
166
|
+
- [Ledger & Budget](#ledger--budget)
|
|
167
|
+
- [How x402 Works (End to End)](#how-x402-works-end-to-end)
|
|
168
|
+
- [Security Model](#security-model)
|
|
58
169
|
- [Sponsored Inference — Share Your GPU With the World](#sponsored-inference--share-your-gpu-with-the-world)
|
|
170
|
+
- [For Sponsors: `/sponsor`](#for-sponsors-sponsor)
|
|
171
|
+
- [For Consumers: `/endpoint sponsor`](#for-consumers-endpoint-sponsor)
|
|
172
|
+
- [Architecture](#architecture-2)
|
|
173
|
+
- [Ollama Endpoint Security](#ollama-endpoint-security)
|
|
59
174
|
- [COHERE Distributed Mind](#cohere-distributed-mind)
|
|
175
|
+
- [How COHERE Works](#how-cohere-works)
|
|
176
|
+
- [NATS Channels](#nats-channels)
|
|
177
|
+
- [Model Selection (Family-Based Scoring)](#model-selection-family-based-scoring)
|
|
178
|
+
- [Pressure Gate (CM-04)](#pressure-gate-cm-04)
|
|
60
179
|
- [Self-Improvement & Learning](#self-improvement--learning)
|
|
180
|
+
- [Trajectory Logging](#trajectory-logging)
|
|
181
|
+
- [Rejection Fine-Tuning Pipeline](#rejection-fine-tuning-pipeline)
|
|
182
|
+
- [Inference-Time Self-Improvement](#inference-time-self-improvement)
|
|
61
183
|
- [Dream Mode — Creative Idle Exploration](#dream-mode--creative-idle-exploration)
|
|
62
184
|
- [Blessed Mode — Infinite Warm Loop](#blessed-mode--infinite-warm-loop)
|
|
185
|
+
- [Default Mode Network (DMN) — Autonomous Task Chaining](#default-mode-network-dmn--autonomous-task-chaining)
|
|
63
186
|
- [Docker Sandbox & Collective Intelligence](#docker-sandbox--collective-intelligence)
|
|
187
|
+
- [Container Sandbox](#container-sandbox)
|
|
188
|
+
- [Multi-Agent Collective Testbed](#multi-agent-collective-testbed)
|
|
189
|
+
- [Self-Play Idle Loop (D1)](#self-play-idle-loop-d1)
|
|
190
|
+
- [Heuristic Extraction (D2)](#heuristic-extraction-d2)
|
|
191
|
+
- [Identity Kernel Evolution (D3)](#identity-kernel-evolution-d3)
|
|
192
|
+
- [Peer Delta Merge (D4)](#peer-delta-merge-d4)
|
|
193
|
+
- [6-Agent Evaluation Results](#6-agent-evaluation-results)
|
|
64
194
|
- [Code Sandbox](#code-sandbox)
|
|
65
195
|
- [Structured Data Tools](#structured-data-tools)
|
|
66
196
|
- [On-Device Web Search](#on-device-web-search)
|
|
@@ -69,9 +199,19 @@ An autonomous multi-turn tool-calling agent that reads your code, makes changes,
|
|
|
69
199
|
- [Cost Tracking & Session Metrics](#cost-tracking--session-metrics)
|
|
70
200
|
- [Configuration](#configuration)
|
|
71
201
|
- [Network Access & Binding](#network-access--binding)
|
|
202
|
+
- [Project Context](#project-context)
|
|
203
|
+
- [`.oa/` Project Directory](#oa-project-directory)
|
|
72
204
|
- [Model Support](#model-support)
|
|
73
205
|
- [Supported Inference Providers](#supported-inference-providers)
|
|
206
|
+
- [Connecting to a Provider](#connecting-to-a-provider)
|
|
207
|
+
- [P2P Inference via libp2p](#p2p-inference-via-libp2p)
|
|
208
|
+
- [Endpoint Cascade Failover](#endpoint-cascade-failover)
|
|
74
209
|
- [Evaluation Suite](#evaluation-suite)
|
|
210
|
+
- [Benchmark Results](#benchmark-results)
|
|
211
|
+
- [Collective Intelligence Evaluation (v0.186.57)](#collective-intelligence-evaluation-v018657)
|
|
212
|
+
- [Web Navigation Evaluation (v0.186.61)](#web-navigation-evaluation-v018661)
|
|
213
|
+
- [Multi-Agent Architecture Evaluation (v0.187.4)](#multi-agent-architecture-evaluation-v01874)
|
|
214
|
+
- [REST API Enterprise Evaluation (v0.185.68)](#rest-api-enterprise-evaluation-v018568)
|
|
75
215
|
- [AIWG Integration](#aiwg-integration)
|
|
76
216
|
- [Research Citations](#research-citations)
|
|
77
217
|
- [License](#license)
|
|
@@ -310,6 +450,14 @@ OA_API_KEYS="key1:admin:alice:30:50000:5,key2:run:ci:60::3,key3:read:grafana" oa
|
|
|
310
450
|
|
|
311
451
|
> **Every example below is verified against `open-agents-ai@0.187.189` on a live daemon.** Examples from earlier versions are deprecated.
|
|
312
452
|
|
|
453
|
+
#### Access Policy & Binding
|
|
454
|
+
|
|
455
|
+
Control who can reach the daemon and where it binds:
|
|
456
|
+
|
|
457
|
+
- TUI commands: `/access loopback|lan|any`, `/host <host[:port]>`, `/network config` (interactive), `--local` to save per‑project.
|
|
458
|
+
- Environment: `OA_ACCESS=loopback|lan|any`, `OA_HOST=host[:port]`.
|
|
459
|
+
- See Configuration → [Network Access & Binding](#network-access--binding) for full details and security guidance.
|
|
460
|
+
|
|
313
461
|
#### Working Directory
|
|
314
462
|
|
|
315
463
|
Pass `X-Working-Directory` header to run commands in your current terminal directory:
|
|
@@ -3523,6 +3671,18 @@ Post-task, the [ReadAgent](https://arxiv.org/abs/2402.09727) gist compressor cre
|
|
|
3523
3671
|
|
|
3524
3672
|
### Near-Critical Cognitive Architecture
|
|
3525
3673
|
|
|
3674
|
+
### Cross‑Modality Identity & Association (CLIP + Voice)
|
|
3675
|
+
|
|
3676
|
+
Open Agents binds entities across image, audio, and text using joint‑embedding models:
|
|
3677
|
+
|
|
3678
|
+
- CLIP‑based visual ID: person/object embeddings extracted from frames are matched to persistent entity nodes; cosine similarity > τ promotes to identity with temporal smoothing. Supports multi‑appearance tracking and re‑identification across sessions.
|
|
3679
|
+
- Voiceprint linkage: speaker embeddings (x‑vector/ECAPA) are associated with entities when co‑occurring in time with a visual track and a transcribed utterance; robust to background noise via median pooling across windows.
|
|
3680
|
+
- Text label fusion: natural‑language labels (names, roles, tags) are bound to the same entity when co‑referents appear in proximate context windows (heuristics + clustering).
|
|
3681
|
+
- Association graph: cross‑modal edges (image↔voice↔text) consolidate into a unified entity node with provenance (model, score, timestamp) and decay‑based confidence.
|
|
3682
|
+
- Privacy & safety: raw media never leaves the machine; embeddings are stored locally under `.oa/memory/`. Redaction controls can drop embeddings by label or recency.
|
|
3683
|
+
|
|
3684
|
+
This enables queries like: “Find where Alex spoke about deployment,” “Show files edited after the person in the red sweater approved the PR,” or “Summarize conversations where Speaker‑B and Alice appear together.”
|
|
3685
|
+
|
|
3526
3686
|
The associative memory integrates with a near-critical cognitive framework inspired by [Beggs & Plenz (2003)](https://doi.org/10.1523/JNEUROSCI.23-35-11167.2003) neuronal avalanche dynamics:
|
|
3527
3687
|
|
|
3528
3688
|
- **Auto-consolidation**: At task boundaries, the system writes consolidation snapshots to `.oa/consolidations/` with lessons learned and key patterns
|
|
@@ -4507,4 +4667,3 @@ Security tips:
|
|
|
4507
4667
|
- Prefer `loopback` on single-machine setups.
|
|
4508
4668
|
- Use `lan` for same-network devices.
|
|
4509
4669
|
- If you set `any`, put the daemon behind TLS and an authenticating reverse proxy.
|
|
4510
|
-
|