open-agents-ai 0.185.70 → 0.185.71

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (2) hide show
  1. package/README.md +75 -0
  2. package/package.json +1 -1
package/README.md CHANGED
@@ -1,3 +1,4 @@
1
+ <a name="top"></a>
1
2
  <h1 align="center">Open Agents</h1>
2
3
 
3
4
  <p align="center">
@@ -24,6 +25,8 @@ An autonomous multi-turn tool-calling agent that reads your code, makes changes,
24
25
 
25
26
  ## Table of Contents
26
27
 
28
+ <div align="right"><a href="#top">back to top</a></div>
29
+
27
30
  - [The Organism, Not the Cortex](#the-organism-not-the-cortex)
28
31
  - [How It Works](#how-it-works)
29
32
  - [Features](#features)
@@ -63,6 +66,8 @@ An autonomous multi-turn tool-calling agent that reads your code, makes changes,
63
66
 
64
67
  ## The Organism, Not the Cortex
65
68
 
69
+ <div align="right"><a href="#top">back to top</a></div>
70
+
66
71
  An LLM is a high-bandwidth associative generative core — closer to a cortex-like prior than to a complete agent. Its weights contain broad latent structure, but they do not by themselves give you situated continuity, durable task state, calibrated action policies, or grounded memory management. Open Agents treats the model as one organ inside a larger organism. The framework provides the rest: sensors, effectors, memory stores, routing, gating, evaluation, and persistence.
67
72
 
68
73
  **What the framework provides:**
@@ -83,6 +88,8 @@ Don't chase larger models. Build the organism around whatever model you have.
83
88
 
84
89
  ## How It Works
85
90
 
91
+ <div align="right"><a href="#top">back to top</a></div>
92
+
86
93
  ```
87
94
  You: oa "fix the null check in auth.ts"
88
95
 
@@ -97,6 +104,8 @@ The agent uses tools autonomously in a loop — reading errors, fixing code, and
97
104
 
98
105
  ## Features
99
106
 
107
+ <div align="right"><a href="#top">back to top</a></div>
108
+
100
109
  - **61 autonomous tools** — file I/O, shell, grep, web search/fetch/crawl, memory (read/write/search), sub-agents, background tasks, image/OCR/PDF, git, diagnostics, vision, desktop automation, browser automation, temporal agency (scheduler/reminders/agenda), structured files, code sandbox, transcription, skills, opencode delegation, cron agents, nexus P2P networking + x402 micropayments, **COHERE cognitive stack** (persistent REPL, recursive LLM calls, memory metabolism, identity kernel, reflection, exploration)
101
110
  - **Moondream vision** — see and interact with the desktop via Moondream VLM (caption, query, detect, point-and-click)
102
111
  - **Desktop automation** — vision-guided clicking: describe a UI element in natural language, the agent finds and clicks it
@@ -199,6 +208,8 @@ D8AgCTrxpDKD5meJ2bpAfVwcST3NF3EPuy9xczYycnXn
199
208
 
200
209
  ## Enterprise & Headless Mode
201
210
 
211
+ <div align="right"><a href="#top">back to top</a></div>
212
+
202
213
  Run Open Agents as a headless service for CI/CD pipelines, automation, and enterprise deployments.
203
214
 
204
215
  ### Non-Interactive Mode
@@ -622,6 +633,8 @@ Free for non-commercial use under CC-BY-NC-4.0. For enterprise/commercial licens
622
633
 
623
634
  ## Architecture
624
635
 
636
+ <div align="right"><a href="#top">back to top</a></div>
637
+
625
638
  The core is `AgenticRunner` — a multi-turn tool-calling loop with structured context assembly:
626
639
 
627
640
  ```
@@ -642,6 +655,8 @@ User task → assembleContext(c_instr, c_state, c_know) → LLM → tool_calls
642
655
 
643
656
  ## Context Engineering
644
657
 
658
+ <div align="right"><a href="#top">back to top</a></div>
659
+
645
660
  The agent implements structured context assembly based on current research in context engineering, modular prompt optimization, and instruction hierarchy:
646
661
 
647
662
  ```
@@ -666,6 +681,8 @@ Research provenance: grounded in "A Survey of Context Engineering for LLMs" (con
666
681
 
667
682
  ## Model-Tier Awareness
668
683
 
684
+ <div align="right"><a href="#top">back to top</a></div>
685
+
669
686
  Open Agents classifies models into three tiers and adapts its behavior accordingly:
670
687
 
671
688
  | Tier | Parameters | Base Tools | System Prompt | Compaction |
@@ -701,6 +718,8 @@ All context-dependent values scale automatically with the actual context window
701
718
 
702
719
  ## Auto-Expanding Context Window
703
720
 
721
+ <div align="right"><a href="#top">back to top</a></div>
722
+
704
723
  On startup and `/model` switch, Open Agents detects your RAM/VRAM and creates an optimized model variant:
705
724
 
706
725
  | Available Memory | Context Window |
@@ -714,6 +733,8 @@ On startup and `/model` switch, Open Agents detects your RAM/VRAM and creates an
714
733
 
715
734
  ## Tools (61)
716
735
 
736
+ <div align="right"><a href="#top">back to top</a></div>
737
+
717
738
  | Tool | Description |
718
739
  |------|-------------|
719
740
  | **File Operations** | |
@@ -820,6 +841,8 @@ The agent has 4 web tools. Pick the right one:
820
841
 
821
842
  ## Ralph Loop — Iteration-First Design
822
843
 
844
+ <div align="right"><a href="#top">back to top</a></div>
845
+
823
846
  The Ralph Loop is the core execution philosophy: **iteration beats perfection**. Instead of trying to get everything right on the first attempt, the agent executes in a retry loop where errors become learning data rather than session-ending failures.
824
847
 
825
848
  ```
@@ -844,6 +867,8 @@ The loop tracks iteration history, generates completion reports saved to `.aiwg/
844
867
 
845
868
  ## Task Control
846
869
 
870
+ <div align="right"><a href="#top">back to top</a></div>
871
+
847
872
  ### Pause, Stop, Resume, Destroy
848
873
 
849
874
  | Command | Behavior |
@@ -883,6 +908,8 @@ Type `y` to restore — the previous session context will be prepended to your n
883
908
 
884
909
  ## COHERE Cognitive Framework
885
910
 
911
+ <div align="right"><a href="#top">back to top</a></div>
912
+
886
913
  Open Agents implements the **COHERE layered cognitive stack** — a provenance-grounded architecture for persistent, reflective agentic systems. Each layer adds a distinct cognitive capability, grounded in specific research papers:
887
914
 
888
915
  ```
@@ -961,6 +988,8 @@ The identity kernel maintains a persistent self-model across sessions, the refle
961
988
 
962
989
  ## Context Compaction — Research-Backed Memory Management
963
990
 
991
+ <div align="right"><a href="#top">back to top</a></div>
992
+
964
993
  Long conversations consume context window tokens. Open Agents uses progressive context compaction to compress older messages while preserving critical information — decisions, errors, file states, and task progress.
965
994
 
966
995
  ### How It Works
@@ -1087,6 +1116,8 @@ This ensures the agent can resume coherently after compaction without re-reading
1087
1116
 
1088
1117
  ## Personality Core — SAC Framework Style Control
1089
1118
 
1119
+ <div align="right"><a href="#top">back to top</a></div>
1120
+
1090
1121
  The personality system controls how the agent communicates — from silent operator to teacher mode. It's based on the **SAC framework** ([arXiv:2506.20993](https://arxiv.org/abs/2506.20993)) which models personality along five behavioral intensity dimensions rather than binary trait toggles.
1091
1122
 
1092
1123
  ```bash
@@ -1135,6 +1166,8 @@ The personality system draws on:
1135
1166
 
1136
1167
  ## Emotion Engine — Affective State Modulation
1137
1168
 
1169
+ <div align="right"><a href="#top">back to top</a></div>
1170
+
1138
1171
  The agent stack includes a real-time emotion system that modulates behavior based on an appraisal-based affective model. Built on Russell's circumplex model of affect extended with the dominance axis from UDDETTS ADV space ([arXiv:2505.10599](https://arxiv.org/abs/2505.10599)), the engine maintains a continuous emotional state defined by three axes:
1139
1172
 
1140
1173
  - **Valence** (-1 to +1): displeasure ↔ pleasure
@@ -1197,6 +1230,8 @@ The emotion system is informed by peer-reviewed and preprint research:
1197
1230
 
1198
1231
  ## Voice Feedback (TTS)
1199
1232
 
1233
+ <div align="right"><a href="#top">back to top</a></div>
1234
+
1200
1235
  ```bash
1201
1236
  /voice # Toggle on/off (default: GLaDOS)
1202
1237
  /voice glados # GLaDOS voice (ONNX, ~50MB)
@@ -1388,6 +1423,8 @@ The stochastic narration engine generates spoken descriptions of what the agent
1388
1423
 
1389
1424
  ## Listen Mode — Live Bidirectional Audio
1390
1425
 
1426
+ <div align="right"><a href="#top">back to top</a></div>
1427
+
1391
1428
  Listen mode enables real-time voice communication with the agent. Your microphone audio is captured, streamed through Whisper, and the transcription is injected directly into the input line — creating a hands-free coding workflow.
1392
1429
 
1393
1430
  Two transcription backends ensure broad platform support:
@@ -1424,6 +1461,8 @@ The `transcribe-cli` dependency auto-installs in the background on first use. On
1424
1461
 
1425
1462
  ## Vision & Desktop Automation (Moondream)
1426
1463
 
1464
+ <div align="right"><a href="#top">back to top</a></div>
1465
+
1427
1466
  Open Agents can see your screen, understand UI elements, and interact with desktop applications through natural language — powered by the Moondream vision language model running entirely locally.
1428
1467
 
1429
1468
  ### Desktop Awareness
@@ -1610,6 +1649,8 @@ Supports `apt` (Debian/Ubuntu), `dnf` (Fedora), `pacman` (Arch), and `brew` (mac
1610
1649
 
1611
1650
  ## Interactive TUI
1612
1651
 
1652
+ <div align="right"><a href="#top">back to top</a></div>
1653
+
1613
1654
  Launch without arguments to enter the interactive REPL:
1614
1655
 
1615
1656
  ```bash
@@ -1713,6 +1754,8 @@ The steering sub-agent uses the same model and backend as the main agent with `m
1713
1754
 
1714
1755
  ## Telegram Bridge — Sub-Agent Per Chat
1715
1756
 
1757
+ <div align="right"><a href="#top">back to top</a></div>
1758
+
1716
1759
  Connect the agent to a Telegram bot. Each incoming message spawns a dedicated sub-agent that handles the conversation independently — visible in the terminal waterfall alongside other agent activity.
1717
1760
 
1718
1761
  ```bash
@@ -1844,6 +1887,8 @@ The bridge automatically handles Telegram's rate limits (HTTP 429) with exponent
1844
1887
 
1845
1888
  ## x402 Payment Rails & Nexus P2P
1846
1889
 
1890
+ <div align="right"><a href="#top">back to top</a></div>
1891
+
1847
1892
  Agents can earn and spend USDC on Base mainnet through the native x402 protocol built into [open-agents-nexus@1.5.6](https://www.npmjs.com/package/open-agents-nexus).
1848
1893
 
1849
1894
  ### Wallet & Identity
@@ -1901,6 +1946,8 @@ nexus(action='budget_set', auto_approve_below='0.01') # Auto-approve micropayme
1901
1946
 
1902
1947
  ## Sponsored Inference — Share Your GPU With the World
1903
1948
 
1949
+ <div align="right"><a href="#top">back to top</a></div>
1950
+
1904
1951
  Anyone running Open Agents can become an inference sponsor — sharing their local models (or forwarded cloud endpoints) with users worldwide through a secure, branded relay.
1905
1952
 
1906
1953
  ### For Sponsors: `/sponsor`
@@ -1965,6 +2012,8 @@ The tunnel fix uses debounced restarts with exponential cooldown (10s → 20s
1965
2012
 
1966
2013
  ## Dream Mode — Creative Idle Exploration
1967
2014
 
2015
+ <div align="right"><a href="#top">back to top</a></div>
2016
+
1968
2017
  When you're not actively tasking the agent, Dream Mode lets it creatively explore your codebase and generate improvement proposals autonomously. The system models real human sleep architecture with four stages per cycle:
1969
2018
 
1970
2019
  | Stage | Name | What Happens |
@@ -2039,6 +2088,8 @@ If no GPU is detected, the REM stage falls back to the standard multi-agent crea
2039
2088
 
2040
2089
  ## Blessed Mode — Infinite Warm Loop
2041
2090
 
2091
+ <div align="right"><a href="#top">back to top</a></div>
2092
+
2042
2093
  `/full-send-bless` activates an infinite warm loop that keeps model weights loaded in VRAM and the agent ready for instant response. The engine sends periodic keep-alive pings to the inference backend (every 2 minutes) to prevent Ollama's automatic model unloading.
2043
2094
 
2044
2095
  ```bash
@@ -2076,6 +2127,8 @@ Each DMN cycle runs a lightweight LLM agent (15 max turns, temperature 0.4) with
2076
2127
 
2077
2128
  ## Code Sandbox
2078
2129
 
2130
+ <div align="right"><a href="#top">back to top</a></div>
2131
+
2079
2132
  Execute code snippets in isolated environments without affecting your project:
2080
2133
 
2081
2134
  ```
@@ -2092,6 +2145,8 @@ Supports JavaScript, TypeScript, Python, and Bash. Two execution modes:
2092
2145
 
2093
2146
  ## Structured Data Tools
2094
2147
 
2148
+ <div align="right"><a href="#top">back to top</a></div>
2149
+
2095
2150
  ### Generate structured files
2096
2151
 
2097
2152
  Create CSV, TSV, JSON, Markdown tables, and Excel-compatible files from data:
@@ -2118,6 +2173,8 @@ Detects binary formats (XLSX, PDF, DOCX) and suggests conversion tools.
2118
2173
 
2119
2174
  ## Multi-Provider Web Search
2120
2175
 
2176
+ <div align="right"><a href="#top">back to top</a></div>
2177
+
2121
2178
  Web search automatically selects the best available provider:
2122
2179
 
2123
2180
  | Provider | Trigger | Features |
@@ -2133,6 +2190,8 @@ export JINA_API_KEY=jina_... # Enable Jina AI (optional)
2133
2190
 
2134
2191
  ## Task Templates
2135
2192
 
2193
+ <div align="right"><a href="#top">back to top</a></div>
2194
+
2136
2195
  Set a task type to get specialized system prompts, recommended tools, and output guidance:
2137
2196
 
2138
2197
  ```
@@ -2144,6 +2203,8 @@ Set a task type to get specialized system prompts, recommended tools, and output
2144
2203
 
2145
2204
  ## Human Expert Speed Ratio
2146
2205
 
2206
+ <div align="right"><a href="#top">back to top</a></div>
2207
+
2147
2208
  The status bar displays a real-time `Exp: Nx` gauge estimating how fast the agent is working relative to a leading human expert performing equivalent tasks.
2148
2209
 
2149
2210
  ```
@@ -2183,6 +2244,8 @@ All 47 tools have calibrated baselines ranging from 3s (`task_stop`) to 180s (`c
2183
2244
 
2184
2245
  ## Cost Tracking & Session Metrics
2185
2246
 
2247
+ <div align="right"><a href="#top">back to top</a></div>
2248
+
2186
2249
  Real-time token cost estimation for cloud providers. The status bar shows running cost when using a paid endpoint.
2187
2250
 
2188
2251
  ```
@@ -2197,6 +2260,8 @@ Work evaluation uses five task-type-specific rubrics (code, document, analysis,
2197
2260
 
2198
2261
  ## Configuration
2199
2262
 
2263
+ <div align="right"><a href="#top">back to top</a></div>
2264
+
2200
2265
  Config priority: CLI flags > env vars > `~/.open-agents/config.json` > defaults.
2201
2266
 
2202
2267
  ```bash
@@ -2227,6 +2292,8 @@ Create `AGENTS.md`, `OA.md`, or `.open-agents.md` in your project root for agent
2227
2292
 
2228
2293
  ## Model Support
2229
2294
 
2295
+ <div align="right"><a href="#top">back to top</a></div>
2296
+
2230
2297
  **Primary target**: Qwen3.5-122B-A10B via Ollama (MoE, 48GB+ VRAM)
2231
2298
 
2232
2299
  Any Ollama or OpenAI-compatible API model with tool calling works:
@@ -2239,6 +2306,8 @@ oa --backend-url http://10.0.0.5:11434 "refactor auth"
2239
2306
 
2240
2307
  ## Supported Inference Providers
2241
2308
 
2309
+ <div align="right"><a href="#top">back to top</a></div>
2310
+
2242
2311
  Open Agents auto-detects your provider from the endpoint URL and configures auth + health checks accordingly. All providers use standard `Authorization: Bearer <key>` authentication.
2243
2312
 
2244
2313
  | Provider | Endpoint URL | API Key | Notes |
@@ -2395,6 +2464,8 @@ No configuration needed — the cascade is built from your endpoint usage histor
2395
2464
 
2396
2465
  ## Evaluation Suite
2397
2466
 
2467
+ <div align="right"><a href="#top">back to top</a></div>
2468
+
2398
2469
  46 evaluation tasks test the agent's autonomous capabilities across coding, web research, SDLC analysis, tool creation, multi-file reasoning, memory systems, and context engineering:
2399
2470
 
2400
2471
  ```bash
@@ -2510,6 +2581,8 @@ The PoT (Program-of-Thought) guidance achieves **100% code generation rate** —
2510
2581
 
2511
2582
  ## AIWG Integration
2512
2583
 
2584
+ <div align="right"><a href="#top">back to top</a></div>
2585
+
2513
2586
  Open Agents integrates with [AIWG](https://aiwg.io) ([npm](https://www.npmjs.com/package/aiwg)) for AI-augmented software development:
2514
2587
 
2515
2588
  ```bash
@@ -2527,6 +2600,8 @@ oa "analyze this project's SDLC health and set up documentation"
2527
2600
 
2528
2601
  ## License
2529
2602
 
2603
+ <div align="right"><a href="#top">back to top</a></div>
2604
+
2530
2605
  [Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)](https://creativecommons.org/licenses/by-nc/4.0/)
2531
2606
 
2532
2607
  Free for non-commercial use. For enterprise/commercial licensing, contact [zoomerconsulting.com](https://zoomerconsulting.com).
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "open-agents-ai",
3
- "version": "0.185.70",
3
+ "version": "0.185.71",
4
4
  "description": "AI coding agent powered by open-source models (Ollama/vLLM) — interactive TUI with agentic tool-calling loop",
5
5
  "type": "module",
6
6
  "main": "./dist/index.js",