@botbotgo/agent-harness 0.0.184 → 0.0.185

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -47,6 +47,17 @@
47
47
  - **Extend:** drop `tool({...})` modules and SKILL trees under `resources/`, wire shared tools and MCP in catalogs, and let agents whitelist what they use.
48
48
  - **Built into the runtime:** persisted `requests`, `sessions`, `approvals`, and `events`; recovery and queueing; streaming listeners; MCP in/out; LangChain v1 and DeepAgents adapters—so you do not rebuild that layer per app.
49
49
 
50
+ ## Current Public Surface
51
+
52
+ The repository now ships more than a thin runtime bootstrap. The public surface is already broad enough that the homepage and docs need to describe it as a product runtime with external protocol boundaries, not only as YAML plus tools.
53
+
54
+ - **Core runtime API:** `createAgentHarness`, `request`, `subscribe`, `resolveApproval`, inspection helpers, and stable persisted runtime records for `requests`, `sessions`, `approvals`, `events`, and artifacts.
55
+ - **Runtime memory and evidence:** `memorize`, `recall`, `listMemories`, memory policy hooks, `listArtifacts`, `getArtifact`, `exportEvaluationBundle`, `replayEvaluationBundle`, and request/session evidence export helpers.
56
+ - **Protocol and transport surfaces:** `createAcpServer`, `serveAcpStdio`, `serveAcpHttp`, `serveA2aHttp`, `serveAgUiHttp`, and `createRuntimeMcpServer` / `serveRuntimeMcpOverStdio`.
57
+ - **Governed workspace runtime:** YAML-owned routing, concurrency, maintenance, MCP policy, runtime governance bundles, and approval defaults for sensitive memory or write-like MCP side effects.
58
+
59
+ `deepagents-acp` is part of the current public story, not only a direction hidden in planning notes: `agent-harness` is moving toward a standard runtime boundary that external clients can talk to directly while the harness keeps persistence, recovery, approvals, and operator control runtime-owned.
60
+
50
61
  ## What Problem We Solve
51
62
 
52
63
  In one line: `agent-harness` productizes the runtime work that usually appears after the demo.
@@ -127,13 +138,13 @@ The runtime provides:
127
138
  - YAML-defined workspace assembly for routing, models, tools, stores, backends, MCP, recovery, and maintenance
128
139
  - backend-adapted execution with current LangChain v1 and DeepAgents adapters
129
140
  - local `resources/tools/` `tool({...})` modules and `resources/skills/` discovery
130
- - persisted threads, runs, approvals, events, queue state, and recovery metadata
141
+ - persisted sessions, requests, approvals, events, queue state, and recovery metadata
131
142
 
132
143
  In practice, the harness exists for the parts that are expensive and repetitive to rebuild inside every agent app:
133
144
 
134
145
  - approval inboxes and human decision flow
135
- - persisted runs, threads, and inspectable event history
136
- - run correlation, continuity, and recovery inspection that still works after a stream fallback or restart
146
+ - persisted requests, sessions, and inspectable event history
147
+ - request correlation, continuity, and recovery inspection that still works after a stream fallback or restart
137
148
  - runtime-managed recovery after interrupts, failures, or process restart
138
149
  - queueing, concurrency, maintenance, and operational policy
139
150
  - stable runtime records that stay usable even if the backend changes
@@ -189,17 +200,17 @@ Most agent tooling stops at execution. Production software does not.
189
200
 
190
201
  Real products need a runtime that can answer harder questions:
191
202
 
192
- - Where do runs live?
203
+ - Where do requests live?
193
204
  - How are approvals resolved?
194
205
  - What survives process restart?
195
- - How do you inspect threads and events without exposing raw backend state?
206
+ - How do you inspect sessions and events without exposing raw backend state?
196
207
  - How do you swap backend implementations without rewriting the product model?
197
208
 
198
209
  `agent-harness` is the layer that answers those questions without turning your application API into a mirror of LangChain v1 or DeepAgents.
199
210
 
200
211
  ## What Makes It Different
201
212
 
202
- - It treats `runs`, `threads`, `approvals`, `events`, and recovery as first-class product records
213
+ - It treats `requests`, `sessions`, `approvals`, `events`, and recovery as first-class product records
203
214
  - It gives operators a runtime control surface instead of exposing raw backend internals
204
215
  - It keeps observability and governance runtime-owned with trace correlation, continuity metadata, and approval defaults for sensitive side effects
205
216
  - It keeps checkpoint resume system-managed instead of promoting checkpoint internals into the primary API
@@ -211,7 +222,7 @@ Real products need a runtime that can answer harder questions:
211
222
  Use `agent-harness` when:
212
223
 
213
224
  - you already know your product needs agents, tools, prompts, or MCP access, but the missing layer is runtime operations
214
- - you need approvals, restart recovery, queueing, or inspectable run records as part of the shipped product
225
+ - you need approvals, restart recovery, queueing, or inspectable request records as part of the shipped product
215
226
  - you want one workspace-shaped assembly model instead of hand-written runtime bootstrapping in every app
216
227
  - you want to keep backend execution semantics upstream while holding the product contract stable
217
228
 
@@ -300,9 +311,9 @@ If you want the shortest possible mental model:
300
311
  - LangChain v1 and DeepAgents backend adaptation
301
312
  - Auto-discovered local `tool({...})` tools and SKILL packages
302
313
  - provider-native tools, MCP tools, and workspace-local tool modules
303
- - persisted threads, runs, approvals, lifecycle events, and queued runs
314
+ - persisted sessions, requests, approvals, lifecycle events, and queued requests
304
315
  - runtime-managed recovery and checkpoint maintenance
305
- - structured output and multimodal content preservation in run results
316
+ - structured output and multimodal content preservation in request results
306
317
  - MCP bridge support for agent-declared MCP servers
307
318
  - MCP server support for exposing harness tools outward
308
319
  - optional `mem0` semantic recall augmentation over canonical SQLite durable memory
@@ -658,7 +669,7 @@ Important fields:
658
669
  `maintenance.checkpoints` and `maintenance.records` are separate retention layers:
659
670
 
660
671
  - `maintenance.checkpoints` trims backend checkpoint state used for resume/recovery
661
- - `maintenance.records` trims harness-owned terminal thread/run records stored in `runtime.sqlite`
672
+ - `maintenance.records` trims harness-owned terminal session/request records stored in `runtime.sqlite`
662
673
 
663
674
  Example:
664
675
 
@@ -845,7 +856,7 @@ spec:
845
856
  systemPrompt: Answer simple requests directly.
846
857
  ```
847
858
 
848
- When `config.filesystem.sessionStorage.enabled: true` is set for a LangChain binding, the runtime keeps one filesystem root per persisted session/thread and reuses the same runnable cache entry for repeated work on that session instead of collapsing every run onto one shared workspace directory.
859
+ When `config.filesystem.sessionStorage.enabled: true` is set for a LangChain binding, the runtime keeps one filesystem root per persisted session and reuses the same runnable cache entry for repeated work on that session instead of collapsing every request onto one shared workspace directory.
849
860
 
850
861
  Example orchestra host:
851
862
 
package/README.zh.md CHANGED
@@ -47,6 +47,17 @@
47
47
  - **扩展:** 在 `resources/` 下放置 `tool({...})` 模块与 SKILL 目录,在目录里声明共享工具与 MCP,再由各 agent 按名字白名单启用。
48
48
  - **内建运行时:** 持久化 `requests`、`sessions`、`approvals` 与 `events`;恢复与排队;流式监听;MCP 接入与对外暴露;LangChain v1 与 DeepAgents 适配——避免每个应用重复造这一层。
49
49
 
50
+ ## 当前公开能力面
51
+
52
+ 这个仓库现在公开交付的已经不只是一个很薄的 runtime bootstrap。对外能力已经足够大,首页和文档也应该把它描述成一个带外部协议边界的产品级 runtime,而不只是 “YAML + tools”。
53
+
54
+ - **核心 runtime API:** `createAgentHarness`、`request`、`subscribe`、`resolveApproval`、各类 inspection helper,以及稳定持久化的 `requests`、`sessions`、`approvals`、`events` 和 artifacts 记录。
55
+ - **运行时 memory 与证据能力:** `memorize`、`recall`、`listMemories`、memory policy hooks、`listArtifacts`、`getArtifact`、`exportEvaluationBundle`、`replayEvaluationBundle`,以及 request / session 级证据导出 helper。
56
+ - **协议与传输接面:** `createAcpServer`、`serveAcpStdio`、`serveAcpHttp`、`serveA2aHttp`、`serveAgUiHttp`、以及 `createRuntimeMcpServer` / `serveRuntimeMcpOverStdio`。
57
+ - **受治理的工作区运行时:** 由 YAML 持有的路由、并发、维护、MCP 策略、runtime governance bundles,以及针对敏感 memory 或写类 MCP 副作用的默认审批门槛。
58
+
59
+ `deepagents-acp` 已不应该只藏在规划文档里。它已经是当前公开产品叙事的一部分:`agent-harness` 正在朝一个可被外部客户端直接接入的标准 runtime boundary 演进,同时继续把持久化、恢复、审批和 operator control 保持为 harness 自有职责。
60
+
50
61
  ## 我们解决什么问题
51
62
 
52
63
  一句话概括:`agent-harness` 把通常在 demo 之后才暴露出来的运行时工作,提前做成产品 runtime 的一部分。
@@ -127,13 +138,13 @@ AI 让 agent 逻辑、工具调用和工作流代码更容易生成,真正更
127
138
  - 以 YAML 描述的工作区装配:路由、模型、工具、存储、后端、MCP、恢复与维护等
128
139
  - 通过适配器对接当前的 LangChain v1 与 DeepAgents 执行
129
140
  - 本地 `resources/tools/` 中 `tool({...})` 工具模块与 `resources/skills/` 的发现
130
- - 持久化的线程、运行、审批、事件、队列状态与恢复元数据
141
+ - 持久化的会话、请求、审批、事件、队列状态与恢复元数据
131
142
 
132
143
  落到实际系统里,harness 主要解决那些每个 agent 应用都不应该重复造一遍的运行时难题:
133
144
 
134
145
  - 审批收件箱与人工决策流
135
- - 持久化的 runsthreads 与可查询事件历史
136
- - 即使发生 stream fallback 或进程重启也还能成立的 run 关联、连续性与恢复观测
146
+ - 持久化的 requestssessions 与可查询事件历史
147
+ - 即使发生 stream fallback 或进程重启也还能成立的 request 关联、连续性与恢复观测
137
148
  - 中断、失败或进程重启后的运行时托管恢复
138
149
  - 队列、并发、维护与运维策略
139
150
  - 即使后端变更也保持稳定的运行时记录模型
@@ -186,17 +197,17 @@ AI 让 agent 逻辑、工具调用和工作流代码更容易生成,真正更
186
197
 
187
198
  真实产品需要能回答更难问题的运行时:
188
199
 
189
- - 运行(runs)存在哪里?
200
+ - 请求(requests)存在哪里?
190
201
  - 审批如何闭环?
191
202
  - 进程重启后什么还在?
192
- - 如何在不暴露原始后端状态的前提下检查线程与事件?
203
+ - 如何在不暴露原始后端状态的前提下检查会话与事件?
193
204
  - 如何在不大改产品模型的前提下替换后端实现?
194
205
 
195
206
  `agent-harness` 在不把应用 API 做成 LangChain v1 / DeepAgents 翻版的前提下,回答这些问题。
196
207
 
197
208
  ## 有何不同
198
209
 
199
- - 将 `runs`、`threads`、`approvals`、`events` 与恢复视为一等产品记录
210
+ - 将 `requests`、`sessions`、`approvals`、`events` 与恢复视为一等产品记录
200
211
  - 给运维侧提供运行时控制面,而不是暴露原始后端内部结构
201
212
  - 将可观测性与治理留在运行时:包括 trace correlation、continuity metadata,以及高风险副作用的默认审批
202
213
  - 将 checkpoint 恢复作为系统管理的行为,而不是把 checkpoint 细节抬成主 API
@@ -208,7 +219,7 @@ AI 让 agent 逻辑、工具调用和工作流代码更容易生成,真正更
208
219
  下面这些场景适合用 `agent-harness`:
209
220
 
210
221
  - 你已经确定产品需要 agents、tools、prompts 或 MCP,但真正缺的是运行时运维层
211
- - 你需要把审批、重启恢复、排队调度或可查询运行记录一起作为产品能力交付出去
222
+ - 你需要把审批、重启恢复、排队调度或可查询请求记录一起作为产品能力交付出去
212
223
  - 你希望用一个 workspace 形态的装配模型取代每个应用各写一套启动和运行时胶水
213
224
  - 你想把 backend 的执行语义留在上游,同时把产品契约稳定下来
214
225
 
@@ -297,7 +308,7 @@ try {
297
308
  - LangChain v1 与 DeepAgents 后端适配
298
309
  - 自动发现本地工具与 SKILL 包
299
310
  - 原生 provider 工具、MCP 工具与工作区本地工具模块
300
- - 持久化线程、运行、审批、生命周期事件与排队运行
311
+ - 持久化会话、请求、审批、生命周期事件与排队请求
301
312
  - 运行时管理的恢复与 checkpoint 维护
302
313
  - 运行结果中的结构化输出与多模态内容保留
303
314
  - 将 agent 声明的 MCP 服务桥接进来
@@ -618,7 +629,7 @@ await stop(runtime);
618
629
  `maintenance.checkpoints` 与 `maintenance.records` 是两层独立的保留策略:
619
630
 
620
631
  - `maintenance.checkpoints` 清理后端用于 resume/recovery 的 checkpoint 状态
621
- - `maintenance.records` 清理 harness 自己保存在 `runtime.sqlite` 中、已结束的 thread/run 记录
632
+ - `maintenance.records` 清理 harness 自己保存在 `runtime.sqlite` 中、已结束的 session/request 记录
622
633
 
623
634
  示例:
624
635
 
@@ -804,7 +815,7 @@ spec:
804
815
  systemPrompt: Answer simple requests directly.
805
816
  ```
806
817
 
807
- 当 LangChain 绑定启用 `config.filesystem.sessionStorage.enabled: true` 时,runtime 会为每个持久化 session/thread 维护独立的 filesystem 根目录,并按 session 复用 runnable cache,而不是把所有运行都压到同一个共享工作目录里。
818
+ 当 LangChain 绑定启用 `config.filesystem.sessionStorage.enabled: true` 时,runtime 会为每个持久化 session 维护独立的 filesystem 根目录,并按 session 复用 runnable cache,而不是把所有请求都压到同一个共享工作目录里。
808
819
 
809
820
  orchestra 主机示例:
810
821
 
@@ -1 +1 @@
1
- export declare const AGENT_HARNESS_VERSION = "0.0.183";
1
+ export declare const AGENT_HARNESS_VERSION = "0.0.184";
@@ -1 +1 @@
1
- export const AGENT_HARNESS_VERSION = "0.0.183";
1
+ export const AGENT_HARNESS_VERSION = "0.0.184";
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@botbotgo/agent-harness",
3
- "version": "0.0.184",
3
+ "version": "0.0.185",
4
4
  "description": "Workspace runtime for multi-agent applications",
5
5
  "type": "module",
6
6
  "packageManager": "npm@10.9.2",