open-agents-ai 0.185.70 → 0.185.71
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +75 -0
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -1,3 +1,4 @@
|
|
|
1
|
+
<a name="top"></a>
|
|
1
2
|
<h1 align="center">Open Agents</h1>
|
|
2
3
|
|
|
3
4
|
<p align="center">
|
|
@@ -24,6 +25,8 @@ An autonomous multi-turn tool-calling agent that reads your code, makes changes,
|
|
|
24
25
|
|
|
25
26
|
## Table of Contents
|
|
26
27
|
|
|
28
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
29
|
+
|
|
27
30
|
- [The Organism, Not the Cortex](#the-organism-not-the-cortex)
|
|
28
31
|
- [How It Works](#how-it-works)
|
|
29
32
|
- [Features](#features)
|
|
@@ -63,6 +66,8 @@ An autonomous multi-turn tool-calling agent that reads your code, makes changes,
|
|
|
63
66
|
|
|
64
67
|
## The Organism, Not the Cortex
|
|
65
68
|
|
|
69
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
70
|
+
|
|
66
71
|
An LLM is a high-bandwidth associative generative core — closer to a cortex-like prior than to a complete agent. Its weights contain broad latent structure, but they do not by themselves give you situated continuity, durable task state, calibrated action policies, or grounded memory management. Open Agents treats the model as one organ inside a larger organism. The framework provides the rest: sensors, effectors, memory stores, routing, gating, evaluation, and persistence.
|
|
67
72
|
|
|
68
73
|
**What the framework provides:**
|
|
@@ -83,6 +88,8 @@ Don't chase larger models. Build the organism around whatever model you have.
|
|
|
83
88
|
|
|
84
89
|
## How It Works
|
|
85
90
|
|
|
91
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
92
|
+
|
|
86
93
|
```
|
|
87
94
|
You: oa "fix the null check in auth.ts"
|
|
88
95
|
|
|
@@ -97,6 +104,8 @@ The agent uses tools autonomously in a loop — reading errors, fixing code, and
|
|
|
97
104
|
|
|
98
105
|
## Features
|
|
99
106
|
|
|
107
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
108
|
+
|
|
100
109
|
- **61 autonomous tools** — file I/O, shell, grep, web search/fetch/crawl, memory (read/write/search), sub-agents, background tasks, image/OCR/PDF, git, diagnostics, vision, desktop automation, browser automation, temporal agency (scheduler/reminders/agenda), structured files, code sandbox, transcription, skills, opencode delegation, cron agents, nexus P2P networking + x402 micropayments, **COHERE cognitive stack** (persistent REPL, recursive LLM calls, memory metabolism, identity kernel, reflection, exploration)
|
|
101
110
|
- **Moondream vision** — see and interact with the desktop via Moondream VLM (caption, query, detect, point-and-click)
|
|
102
111
|
- **Desktop automation** — vision-guided clicking: describe a UI element in natural language, the agent finds and clicks it
|
|
@@ -199,6 +208,8 @@ D8AgCTrxpDKD5meJ2bpAfVwcST3NF3EPuy9xczYycnXn
|
|
|
199
208
|
|
|
200
209
|
## Enterprise & Headless Mode
|
|
201
210
|
|
|
211
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
212
|
+
|
|
202
213
|
Run Open Agents as a headless service for CI/CD pipelines, automation, and enterprise deployments.
|
|
203
214
|
|
|
204
215
|
### Non-Interactive Mode
|
|
@@ -622,6 +633,8 @@ Free for non-commercial use under CC-BY-NC-4.0. For enterprise/commercial licens
|
|
|
622
633
|
|
|
623
634
|
## Architecture
|
|
624
635
|
|
|
636
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
637
|
+
|
|
625
638
|
The core is `AgenticRunner` — a multi-turn tool-calling loop with structured context assembly:
|
|
626
639
|
|
|
627
640
|
```
|
|
@@ -642,6 +655,8 @@ User task → assembleContext(c_instr, c_state, c_know) → LLM → tool_calls
|
|
|
642
655
|
|
|
643
656
|
## Context Engineering
|
|
644
657
|
|
|
658
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
659
|
+
|
|
645
660
|
The agent implements structured context assembly based on current research in context engineering, modular prompt optimization, and instruction hierarchy:
|
|
646
661
|
|
|
647
662
|
```
|
|
@@ -666,6 +681,8 @@ Research provenance: grounded in "A Survey of Context Engineering for LLMs" (con
|
|
|
666
681
|
|
|
667
682
|
## Model-Tier Awareness
|
|
668
683
|
|
|
684
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
685
|
+
|
|
669
686
|
Open Agents classifies models into three tiers and adapts its behavior accordingly:
|
|
670
687
|
|
|
671
688
|
| Tier | Parameters | Base Tools | System Prompt | Compaction |
|
|
@@ -701,6 +718,8 @@ All context-dependent values scale automatically with the actual context window
|
|
|
701
718
|
|
|
702
719
|
## Auto-Expanding Context Window
|
|
703
720
|
|
|
721
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
722
|
+
|
|
704
723
|
On startup and `/model` switch, Open Agents detects your RAM/VRAM and creates an optimized model variant:
|
|
705
724
|
|
|
706
725
|
| Available Memory | Context Window |
|
|
@@ -714,6 +733,8 @@ On startup and `/model` switch, Open Agents detects your RAM/VRAM and creates an
|
|
|
714
733
|
|
|
715
734
|
## Tools (61)
|
|
716
735
|
|
|
736
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
737
|
+
|
|
717
738
|
| Tool | Description |
|
|
718
739
|
|------|-------------|
|
|
719
740
|
| **File Operations** | |
|
|
@@ -820,6 +841,8 @@ The agent has 4 web tools. Pick the right one:
|
|
|
820
841
|
|
|
821
842
|
## Ralph Loop — Iteration-First Design
|
|
822
843
|
|
|
844
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
845
|
+
|
|
823
846
|
The Ralph Loop is the core execution philosophy: **iteration beats perfection**. Instead of trying to get everything right on the first attempt, the agent executes in a retry loop where errors become learning data rather than session-ending failures.
|
|
824
847
|
|
|
825
848
|
```
|
|
@@ -844,6 +867,8 @@ The loop tracks iteration history, generates completion reports saved to `.aiwg/
|
|
|
844
867
|
|
|
845
868
|
## Task Control
|
|
846
869
|
|
|
870
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
871
|
+
|
|
847
872
|
### Pause, Stop, Resume, Destroy
|
|
848
873
|
|
|
849
874
|
| Command | Behavior |
|
|
@@ -883,6 +908,8 @@ Type `y` to restore — the previous session context will be prepended to your n
|
|
|
883
908
|
|
|
884
909
|
## COHERE Cognitive Framework
|
|
885
910
|
|
|
911
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
912
|
+
|
|
886
913
|
Open Agents implements the **COHERE layered cognitive stack** — a provenance-grounded architecture for persistent, reflective agentic systems. Each layer adds a distinct cognitive capability, grounded in specific research papers:
|
|
887
914
|
|
|
888
915
|
```
|
|
@@ -961,6 +988,8 @@ The identity kernel maintains a persistent self-model across sessions, the refle
|
|
|
961
988
|
|
|
962
989
|
## Context Compaction — Research-Backed Memory Management
|
|
963
990
|
|
|
991
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
992
|
+
|
|
964
993
|
Long conversations consume context window tokens. Open Agents uses progressive context compaction to compress older messages while preserving critical information — decisions, errors, file states, and task progress.
|
|
965
994
|
|
|
966
995
|
### How It Works
|
|
@@ -1087,6 +1116,8 @@ This ensures the agent can resume coherently after compaction without re-reading
|
|
|
1087
1116
|
|
|
1088
1117
|
## Personality Core — SAC Framework Style Control
|
|
1089
1118
|
|
|
1119
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
1120
|
+
|
|
1090
1121
|
The personality system controls how the agent communicates — from silent operator to teacher mode. It's based on the **SAC framework** ([arXiv:2506.20993](https://arxiv.org/abs/2506.20993)) which models personality along five behavioral intensity dimensions rather than binary trait toggles.
|
|
1091
1122
|
|
|
1092
1123
|
```bash
|
|
@@ -1135,6 +1166,8 @@ The personality system draws on:
|
|
|
1135
1166
|
|
|
1136
1167
|
## Emotion Engine — Affective State Modulation
|
|
1137
1168
|
|
|
1169
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
1170
|
+
|
|
1138
1171
|
The agent stack includes a real-time emotion system that modulates behavior based on an appraisal-based affective model. Built on Russell's circumplex model of affect extended with the dominance axis from UDDETTS ADV space ([arXiv:2505.10599](https://arxiv.org/abs/2505.10599)), the engine maintains a continuous emotional state defined by three axes:
|
|
1139
1172
|
|
|
1140
1173
|
- **Valence** (-1 to +1): displeasure ↔ pleasure
|
|
@@ -1197,6 +1230,8 @@ The emotion system is informed by peer-reviewed and preprint research:
|
|
|
1197
1230
|
|
|
1198
1231
|
## Voice Feedback (TTS)
|
|
1199
1232
|
|
|
1233
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
1234
|
+
|
|
1200
1235
|
```bash
|
|
1201
1236
|
/voice # Toggle on/off (default: GLaDOS)
|
|
1202
1237
|
/voice glados # GLaDOS voice (ONNX, ~50MB)
|
|
@@ -1388,6 +1423,8 @@ The stochastic narration engine generates spoken descriptions of what the agent
|
|
|
1388
1423
|
|
|
1389
1424
|
## Listen Mode — Live Bidirectional Audio
|
|
1390
1425
|
|
|
1426
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
1427
|
+
|
|
1391
1428
|
Listen mode enables real-time voice communication with the agent. Your microphone audio is captured, streamed through Whisper, and the transcription is injected directly into the input line — creating a hands-free coding workflow.
|
|
1392
1429
|
|
|
1393
1430
|
Two transcription backends ensure broad platform support:
|
|
@@ -1424,6 +1461,8 @@ The `transcribe-cli` dependency auto-installs in the background on first use. On
|
|
|
1424
1461
|
|
|
1425
1462
|
## Vision & Desktop Automation (Moondream)
|
|
1426
1463
|
|
|
1464
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
1465
|
+
|
|
1427
1466
|
Open Agents can see your screen, understand UI elements, and interact with desktop applications through natural language — powered by the Moondream vision language model running entirely locally.
|
|
1428
1467
|
|
|
1429
1468
|
### Desktop Awareness
|
|
@@ -1610,6 +1649,8 @@ Supports `apt` (Debian/Ubuntu), `dnf` (Fedora), `pacman` (Arch), and `brew` (mac
|
|
|
1610
1649
|
|
|
1611
1650
|
## Interactive TUI
|
|
1612
1651
|
|
|
1652
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
1653
|
+
|
|
1613
1654
|
Launch without arguments to enter the interactive REPL:
|
|
1614
1655
|
|
|
1615
1656
|
```bash
|
|
@@ -1713,6 +1754,8 @@ The steering sub-agent uses the same model and backend as the main agent with `m
|
|
|
1713
1754
|
|
|
1714
1755
|
## Telegram Bridge — Sub-Agent Per Chat
|
|
1715
1756
|
|
|
1757
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
1758
|
+
|
|
1716
1759
|
Connect the agent to a Telegram bot. Each incoming message spawns a dedicated sub-agent that handles the conversation independently — visible in the terminal waterfall alongside other agent activity.
|
|
1717
1760
|
|
|
1718
1761
|
```bash
|
|
@@ -1844,6 +1887,8 @@ The bridge automatically handles Telegram's rate limits (HTTP 429) with exponent
|
|
|
1844
1887
|
|
|
1845
1888
|
## x402 Payment Rails & Nexus P2P
|
|
1846
1889
|
|
|
1890
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
1891
|
+
|
|
1847
1892
|
Agents can earn and spend USDC on Base mainnet through the native x402 protocol built into [open-agents-nexus@1.5.6](https://www.npmjs.com/package/open-agents-nexus).
|
|
1848
1893
|
|
|
1849
1894
|
### Wallet & Identity
|
|
@@ -1901,6 +1946,8 @@ nexus(action='budget_set', auto_approve_below='0.01') # Auto-approve micropayme
|
|
|
1901
1946
|
|
|
1902
1947
|
## Sponsored Inference — Share Your GPU With the World
|
|
1903
1948
|
|
|
1949
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
1950
|
+
|
|
1904
1951
|
Anyone running Open Agents can become an inference sponsor — sharing their local models (or forwarded cloud endpoints) with users worldwide through a secure, branded relay.
|
|
1905
1952
|
|
|
1906
1953
|
### For Sponsors: `/sponsor`
|
|
@@ -1965,6 +2012,8 @@ The tunnel fix uses debounced restarts with exponential cooldown (10s → 20s
|
|
|
1965
2012
|
|
|
1966
2013
|
## Dream Mode — Creative Idle Exploration
|
|
1967
2014
|
|
|
2015
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2016
|
+
|
|
1968
2017
|
When you're not actively tasking the agent, Dream Mode lets it creatively explore your codebase and generate improvement proposals autonomously. The system models real human sleep architecture with four stages per cycle:
|
|
1969
2018
|
|
|
1970
2019
|
| Stage | Name | What Happens |
|
|
@@ -2039,6 +2088,8 @@ If no GPU is detected, the REM stage falls back to the standard multi-agent crea
|
|
|
2039
2088
|
|
|
2040
2089
|
## Blessed Mode — Infinite Warm Loop
|
|
2041
2090
|
|
|
2091
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2092
|
+
|
|
2042
2093
|
`/full-send-bless` activates an infinite warm loop that keeps model weights loaded in VRAM and the agent ready for instant response. The engine sends periodic keep-alive pings to the inference backend (every 2 minutes) to prevent Ollama's automatic model unloading.
|
|
2043
2094
|
|
|
2044
2095
|
```bash
|
|
@@ -2076,6 +2127,8 @@ Each DMN cycle runs a lightweight LLM agent (15 max turns, temperature 0.4) with
|
|
|
2076
2127
|
|
|
2077
2128
|
## Code Sandbox
|
|
2078
2129
|
|
|
2130
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2131
|
+
|
|
2079
2132
|
Execute code snippets in isolated environments without affecting your project:
|
|
2080
2133
|
|
|
2081
2134
|
```
|
|
@@ -2092,6 +2145,8 @@ Supports JavaScript, TypeScript, Python, and Bash. Two execution modes:
|
|
|
2092
2145
|
|
|
2093
2146
|
## Structured Data Tools
|
|
2094
2147
|
|
|
2148
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2149
|
+
|
|
2095
2150
|
### Generate structured files
|
|
2096
2151
|
|
|
2097
2152
|
Create CSV, TSV, JSON, Markdown tables, and Excel-compatible files from data:
|
|
@@ -2118,6 +2173,8 @@ Detects binary formats (XLSX, PDF, DOCX) and suggests conversion tools.
|
|
|
2118
2173
|
|
|
2119
2174
|
## Multi-Provider Web Search
|
|
2120
2175
|
|
|
2176
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2177
|
+
|
|
2121
2178
|
Web search automatically selects the best available provider:
|
|
2122
2179
|
|
|
2123
2180
|
| Provider | Trigger | Features |
|
|
@@ -2133,6 +2190,8 @@ export JINA_API_KEY=jina_... # Enable Jina AI (optional)
|
|
|
2133
2190
|
|
|
2134
2191
|
## Task Templates
|
|
2135
2192
|
|
|
2193
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2194
|
+
|
|
2136
2195
|
Set a task type to get specialized system prompts, recommended tools, and output guidance:
|
|
2137
2196
|
|
|
2138
2197
|
```
|
|
@@ -2144,6 +2203,8 @@ Set a task type to get specialized system prompts, recommended tools, and output
|
|
|
2144
2203
|
|
|
2145
2204
|
## Human Expert Speed Ratio
|
|
2146
2205
|
|
|
2206
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2207
|
+
|
|
2147
2208
|
The status bar displays a real-time `Exp: Nx` gauge estimating how fast the agent is working relative to a leading human expert performing equivalent tasks.
|
|
2148
2209
|
|
|
2149
2210
|
```
|
|
@@ -2183,6 +2244,8 @@ All 47 tools have calibrated baselines ranging from 3s (`task_stop`) to 180s (`c
|
|
|
2183
2244
|
|
|
2184
2245
|
## Cost Tracking & Session Metrics
|
|
2185
2246
|
|
|
2247
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2248
|
+
|
|
2186
2249
|
Real-time token cost estimation for cloud providers. The status bar shows running cost when using a paid endpoint.
|
|
2187
2250
|
|
|
2188
2251
|
```
|
|
@@ -2197,6 +2260,8 @@ Work evaluation uses five task-type-specific rubrics (code, document, analysis,
|
|
|
2197
2260
|
|
|
2198
2261
|
## Configuration
|
|
2199
2262
|
|
|
2263
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2264
|
+
|
|
2200
2265
|
Config priority: CLI flags > env vars > `~/.open-agents/config.json` > defaults.
|
|
2201
2266
|
|
|
2202
2267
|
```bash
|
|
@@ -2227,6 +2292,8 @@ Create `AGENTS.md`, `OA.md`, or `.open-agents.md` in your project root for agent
|
|
|
2227
2292
|
|
|
2228
2293
|
## Model Support
|
|
2229
2294
|
|
|
2295
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2296
|
+
|
|
2230
2297
|
**Primary target**: Qwen3.5-122B-A10B via Ollama (MoE, 48GB+ VRAM)
|
|
2231
2298
|
|
|
2232
2299
|
Any Ollama or OpenAI-compatible API model with tool calling works:
|
|
@@ -2239,6 +2306,8 @@ oa --backend-url http://10.0.0.5:11434 "refactor auth"
|
|
|
2239
2306
|
|
|
2240
2307
|
## Supported Inference Providers
|
|
2241
2308
|
|
|
2309
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2310
|
+
|
|
2242
2311
|
Open Agents auto-detects your provider from the endpoint URL and configures auth + health checks accordingly. All providers use standard `Authorization: Bearer <key>` authentication.
|
|
2243
2312
|
|
|
2244
2313
|
| Provider | Endpoint URL | API Key | Notes |
|
|
@@ -2395,6 +2464,8 @@ No configuration needed — the cascade is built from your endpoint usage histor
|
|
|
2395
2464
|
|
|
2396
2465
|
## Evaluation Suite
|
|
2397
2466
|
|
|
2467
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2468
|
+
|
|
2398
2469
|
46 evaluation tasks test the agent's autonomous capabilities across coding, web research, SDLC analysis, tool creation, multi-file reasoning, memory systems, and context engineering:
|
|
2399
2470
|
|
|
2400
2471
|
```bash
|
|
@@ -2510,6 +2581,8 @@ The PoT (Program-of-Thought) guidance achieves **100% code generation rate** —
|
|
|
2510
2581
|
|
|
2511
2582
|
## AIWG Integration
|
|
2512
2583
|
|
|
2584
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2585
|
+
|
|
2513
2586
|
Open Agents integrates with [AIWG](https://aiwg.io) ([npm](https://www.npmjs.com/package/aiwg)) for AI-augmented software development:
|
|
2514
2587
|
|
|
2515
2588
|
```bash
|
|
@@ -2527,6 +2600,8 @@ oa "analyze this project's SDLC health and set up documentation"
|
|
|
2527
2600
|
|
|
2528
2601
|
## License
|
|
2529
2602
|
|
|
2603
|
+
<div align="right"><a href="#top">back to top</a></div>
|
|
2604
|
+
|
|
2530
2605
|
[Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)](https://creativecommons.org/licenses/by-nc/4.0/)
|
|
2531
2606
|
|
|
2532
2607
|
Free for non-commercial use. For enterprise/commercial licensing, contact [zoomerconsulting.com](https://zoomerconsulting.com).
|
package/package.json
CHANGED