@matheuskrumenauer/tanya 0.9.0-beta.0 → 0.11.0-beta.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -174,6 +174,7 @@ tanya runs # show recent run logs with cost/status
174
174
  tanya video presets # list available video presets
175
175
  tanya video one-terminal-simctl # generate the exact transparent terminal asset
176
176
  tanya providers test # provider smoke test
177
+ tanya mcp serve # expose Tanya verifier/run/skills over MCP stdio
177
178
  tanya doctor # local environment check
178
179
  ```
179
180
 
@@ -188,6 +189,7 @@ model:
188
189
  /verify # print the deterministic verifier report for the cwd
189
190
  /cost # show persisted token usage and estimated cost
190
191
  /memory --limit 5 # list recent golden-task memory
192
+ /mcp # list connected MCP servers and tools
191
193
  ```
192
194
 
193
195
  Project-local commands live in `.tania/commands/*.{js,ts,sh}` and appear in
@@ -227,6 +229,44 @@ children.
227
229
  See [docs/sub-agents.md](./docs/sub-agents.md) for permission inheritance,
228
230
  budget-ledger semantics, cancellation, verifier composition, and memory rollup.
229
231
 
232
+ ## MCP integration
233
+
234
+ Tanya can consume external Model Context Protocol servers and expose Tanya's own
235
+ verifier and memory primitives to MCP-speaking clients.
236
+
237
+ Client configuration is allowlist-only. User-global servers are read from
238
+ `~/.tanya/mcp.json` with a fallback read of `~/.tania/mcp.json`; project servers
239
+ live in `.tania/mcp.json` and override same-named user servers. Connected tools
240
+ are registered as normal Tanya tools named `mcp:<server>:<tool>`, so permission
241
+ rules, audit logging, truncation, and verifier visibility apply exactly as they
242
+ do for native tools.
243
+
244
+ ```json
245
+ {
246
+ "version": 1,
247
+ "servers": [
248
+ {
249
+ "name": "filesystem",
250
+ "transport": "stdio",
251
+ "command": "npx",
252
+ "args": ["-y", "@modelcontextprotocol/server-filesystem", "."]
253
+ }
254
+ ]
255
+ }
256
+ ```
257
+
258
+ Use `/mcp` in the REPL to inspect connected servers. Use `tanya mcp serve` to
259
+ start Tanya's MCP server over stdio; it exposes `tanya.verify`,
260
+ `tanya.golden_task_search`, `tanya.run`, and `tanya.skills_list`.
261
+
262
+ MCP servers are untrusted code. Tanya refuses undeclared servers, gates every
263
+ MCP tool call through the permission engine, captures stdio server stderr under
264
+ `.tania/mcp/logs/`, restarts crashed servers up to three times, and rejects
265
+ schema-invalid tool responses before they reach model history.
266
+
267
+ See [docs/mcp.md](./docs/mcp.md) for the full schema, transports, server tools,
268
+ and security model.
269
+
230
270
  ## Multi-model routing
231
271
 
232
272
  Tanya can route each agent step to a different provider/model. Planning and
@@ -256,6 +296,26 @@ up to `TANYA_ESCALATION_CAP` per session.
256
296
  See [docs/routing.md](./docs/routing.md) for schema, examples, context-window
257
297
  guards, per-tool model overrides, and sub-agent model pins.
258
298
 
299
+ ## Reasoning models
300
+
301
+ Reasoning routes such as `deepseek-reasoner`, `qwen3-thinking-*`, and
302
+ `grok-3-reasoning` are handled as a separate stream. Tanya archives reasoning to
303
+ `.tania/runs/<runId>/reasoning.jsonl`, emits `reasoning_chunk` events, and keeps
304
+ assistant history reasoning-free so replay and verifier inputs stay stable.
305
+
306
+ Reasoning tokens appear separately in `/cost` and `/budget`. Route rules can set
307
+ `reasoningCap.maxTokens`; built-in defaults are 2k for planning-like turns and
308
+ 8k for synthesis/verification/reasoning turns. If the cap is exceeded, Tanya
309
+ emits `reasoning_truncated` and asks the model to finish.
310
+
311
+ Use `/memory --reasoning <runId>` to inspect archived reasoning. Use
312
+ `TANYA_HIDE_REASONING=1` to hide reasoning from the human UI while preserving
313
+ JSONL/Cosmo events. Verifier reasoning annotations are off by default; enable
314
+ them with `--verbose-verifier` or `TANYA_VERIFIER_INCLUDE_REASONING=1`.
315
+
316
+ See [docs/reasoning.md](./docs/reasoning.md) for provider notes, billing math,
317
+ budget defaults, and UX modes.
318
+
259
319
  `--verify` adds required verification commands to the run context. Tanya must run and report each exact command before finishing the coding task.
260
320
 
261
321
  `tanya benchmark run --all` currently exercises 27 executable low-to-medium regression fixtures: targeted edits, new files, dependency/lockfile updates, framework-style migrations, failing-test repair, frontend smoke checks, artifact/context reuse, streaming long-tool execution, compaction-boundary recovery, run-history logging, dirty worktrees, report repair, and the CosmoHQ mobile/backend smoke profiles.