open-agents-ai 0.5.3 → 0.7.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (3) hide show
  1. package/README.md +65 -1
  2. package/dist/index.js +1084 -115
  3. package/package.json +1 -1
package/README.md CHANGED
@@ -91,7 +91,7 @@ pnpm -r test # 911 tests across 77 files
91
91
 
92
92
  ## Tools
93
93
 
94
- The agent has access to 18 tools that it calls autonomously:
94
+ The agent has access to 26 tools that it calls autonomously:
95
95
 
96
96
  | Tool | Description |
97
97
  |------|-------------|
@@ -113,6 +113,70 @@ The agent has access to 18 tools that it calls autonomously:
113
113
  | `codebase_map` | High-level project structure overview |
114
114
  | `diagnostic` | Run lint/typecheck/test/build validation pipeline |
115
115
  | `git_info` | Structured git status, log, diff, and branch info |
116
+ | `background_run` | Run a shell command in the background (returns task ID) |
117
+ | `task_status` | Check status of background tasks |
118
+ | `task_output` | Read output from a background task |
119
+ | `task_stop` | Stop a running background task |
120
+ | `sub_agent` | Delegate a sub-task to an independent agent |
121
+ | `image_read` | Read image files (base64 + dimensions + OCR text) |
122
+ | `screenshot` | Capture screen or window to file |
123
+ | `ocr` | Extract text from images (supports region cropping/zoom) |
124
+
125
+ ### Parallel Execution & Sub-Agents
126
+
127
+ The agent can run multiple operations in parallel:
128
+
129
+ ```
130
+ You: oa "run the test suite and lint checks in parallel, then fix any issues"
131
+
132
+ Agent: [Turn 1] background_run(command="npm test") → task-1
133
+ [Turn 2] background_run(command="npm run lint") → task-2
134
+ [Turn 3] task_status() → task-1: running, task-2: completed
135
+ [Turn 4] task_output(task_id="task-2") → 3 lint errors
136
+ [Turn 5] file_edit(...) → fix lint errors
137
+ [Turn 6] task_output(task_id="task-1") → all tests pass
138
+ [Turn 7] task_complete(summary="Fixed lint, tests pass")
139
+ ```
140
+
141
+ Sub-agents can be delegated independent tasks:
142
+
143
+ ```
144
+ Agent: [Turn 1] sub_agent(task="refactor auth module", background=true) → task-3
145
+ [Turn 2] sub_agent(task="add pagination to users API") → completed
146
+ [Turn 3] task_output(task_id="task-3") → auth refactored
147
+ ```
148
+
149
+ ### Image & Visual Context
150
+
151
+ Drag-and-drop image files onto the terminal to provide visual context:
152
+
153
+ ```bash
154
+ # Drop an image file path while agent is working → injected as context
155
+ # Drop an image file path at idle prompt → agent describes and analyzes it
156
+ ```
157
+
158
+ The agent can also take screenshots and extract text via OCR:
159
+
160
+ ```
161
+ Agent: [Turn 1] screenshot(region="active") → captured window
162
+ [Turn 2] ocr(path="/tmp/screenshot.png") → extracted text
163
+ [Turn 3] image_read(path="mockup.png") → base64 + OCR text
164
+ ```
165
+
166
+ ### Mid-Task Steering
167
+
168
+ While the agent is working (shown by the `+` prompt), you can type to add context:
169
+
170
+ ```
171
+ > fix the auth bug
172
+ ⎿ 📄 Read: src/auth.ts
173
+ + also check the session handling ← typed while agent works
174
+ ↪ Context added: also check the session handling
175
+ ⎿ 🔍 Search: session
176
+ ⎿ ✏️ Edit: src/auth.ts
177
+ ```
178
+
179
+ Press `Ctrl+C` to abort the current task. Slash commands (`/model`, `/help`) work during active tasks.
116
180
 
117
181
  ### Self-Learning
118
182