@matheuskrumenauer/tanya 0.9.0-beta.0 → 0.11.0-beta.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +60 -0
- package/dist/cli.js +1305 -259
- package/dist/cli.js.map +1 -1
- package/package.json +4 -2
package/README.md
CHANGED
|
@@ -174,6 +174,7 @@ tanya runs # show recent run logs with cost/status
|
|
|
174
174
|
tanya video presets # list available video presets
|
|
175
175
|
tanya video one-terminal-simctl # generate the exact transparent terminal asset
|
|
176
176
|
tanya providers test # provider smoke test
|
|
177
|
+
tanya mcp serve # expose Tanya verifier/run/skills over MCP stdio
|
|
177
178
|
tanya doctor # local environment check
|
|
178
179
|
```
|
|
179
180
|
|
|
@@ -188,6 +189,7 @@ model:
|
|
|
188
189
|
/verify # print the deterministic verifier report for the cwd
|
|
189
190
|
/cost # show persisted token usage and estimated cost
|
|
190
191
|
/memory --limit 5 # list recent golden-task memory
|
|
192
|
+
/mcp # list connected MCP servers and tools
|
|
191
193
|
```
|
|
192
194
|
|
|
193
195
|
Project-local commands live in `.tania/commands/*.{js,ts,sh}` and appear in
|
|
@@ -227,6 +229,44 @@ children.
|
|
|
227
229
|
See [docs/sub-agents.md](./docs/sub-agents.md) for permission inheritance,
|
|
228
230
|
budget-ledger semantics, cancellation, verifier composition, and memory rollup.
|
|
229
231
|
|
|
232
|
+
## MCP integration
|
|
233
|
+
|
|
234
|
+
Tanya can consume external Model Context Protocol servers and expose Tanya's own
|
|
235
|
+
verifier and memory primitives to MCP-speaking clients.
|
|
236
|
+
|
|
237
|
+
Client configuration is allowlist-only. User-global servers are read from
|
|
238
|
+
`~/.tanya/mcp.json` with a fallback read of `~/.tania/mcp.json`; project servers
|
|
239
|
+
live in `.tania/mcp.json` and override same-named user servers. Connected tools
|
|
240
|
+
are registered as normal Tanya tools named `mcp:<server>:<tool>`, so permission
|
|
241
|
+
rules, audit logging, truncation, and verifier visibility apply exactly as they
|
|
242
|
+
do for native tools.
|
|
243
|
+
|
|
244
|
+
```json
|
|
245
|
+
{
|
|
246
|
+
"version": 1,
|
|
247
|
+
"servers": [
|
|
248
|
+
{
|
|
249
|
+
"name": "filesystem",
|
|
250
|
+
"transport": "stdio",
|
|
251
|
+
"command": "npx",
|
|
252
|
+
"args": ["-y", "@modelcontextprotocol/server-filesystem", "."]
|
|
253
|
+
}
|
|
254
|
+
]
|
|
255
|
+
}
|
|
256
|
+
```
|
|
257
|
+
|
|
258
|
+
Use `/mcp` in the REPL to inspect connected servers. Use `tanya mcp serve` to
|
|
259
|
+
start Tanya's MCP server over stdio; it exposes `tanya.verify`,
|
|
260
|
+
`tanya.golden_task_search`, `tanya.run`, and `tanya.skills_list`.
|
|
261
|
+
|
|
262
|
+
MCP servers are untrusted code. Tanya refuses undeclared servers, gates every
|
|
263
|
+
MCP tool call through the permission engine, captures stdio server stderr under
|
|
264
|
+
`.tania/mcp/logs/`, restarts crashed servers up to three times, and rejects
|
|
265
|
+
schema-invalid tool responses before they reach model history.
|
|
266
|
+
|
|
267
|
+
See [docs/mcp.md](./docs/mcp.md) for the full schema, transports, server tools,
|
|
268
|
+
and security model.
|
|
269
|
+
|
|
230
270
|
## Multi-model routing
|
|
231
271
|
|
|
232
272
|
Tanya can route each agent step to a different provider/model. Planning and
|
|
@@ -256,6 +296,26 @@ up to `TANYA_ESCALATION_CAP` per session.
|
|
|
256
296
|
See [docs/routing.md](./docs/routing.md) for schema, examples, context-window
|
|
257
297
|
guards, per-tool model overrides, and sub-agent model pins.
|
|
258
298
|
|
|
299
|
+
## Reasoning models
|
|
300
|
+
|
|
301
|
+
Reasoning routes such as `deepseek-reasoner`, `qwen3-thinking-*`, and
|
|
302
|
+
`grok-3-reasoning` are handled as a separate stream. Tanya archives reasoning to
|
|
303
|
+
`.tania/runs/<runId>/reasoning.jsonl`, emits `reasoning_chunk` events, and keeps
|
|
304
|
+
assistant history reasoning-free so replay and verifier inputs stay stable.
|
|
305
|
+
|
|
306
|
+
Reasoning tokens appear separately in `/cost` and `/budget`. Route rules can set
|
|
307
|
+
`reasoningCap.maxTokens`; built-in defaults are 2k for planning-like turns and
|
|
308
|
+
8k for synthesis/verification/reasoning turns. If the cap is exceeded, Tanya
|
|
309
|
+
emits `reasoning_truncated` and asks the model to finish.
|
|
310
|
+
|
|
311
|
+
Use `/memory --reasoning <runId>` to inspect archived reasoning. Use
|
|
312
|
+
`TANYA_HIDE_REASONING=1` to hide reasoning from the human UI while preserving
|
|
313
|
+
JSONL/Cosmo events. Verifier reasoning annotations are off by default; enable
|
|
314
|
+
them with `--verbose-verifier` or `TANYA_VERIFIER_INCLUDE_REASONING=1`.
|
|
315
|
+
|
|
316
|
+
See [docs/reasoning.md](./docs/reasoning.md) for provider notes, billing math,
|
|
317
|
+
budget defaults, and UX modes.
|
|
318
|
+
|
|
259
319
|
`--verify` adds required verification commands to the run context. Tanya must run and report each exact command before finishing the coding task.
|
|
260
320
|
|
|
261
321
|
`tanya benchmark run --all` currently exercises 27 executable low-to-medium regression fixtures: targeted edits, new files, dependency/lockfile updates, framework-style migrations, failing-test repair, frontend smoke checks, artifact/context reuse, streaming long-tool execution, compaction-boundary recovery, run-history logging, dirty worktrees, report repair, and the CosmoHQ mobile/backend smoke profiles.
|