@zhixuan92/multi-model-agent 3.10.5 → 3.10.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -287,7 +287,7 @@ Full design rationale: [DIRECTION.md](https://github.com/zhixuan312/multi-model-
287
287
 
288
288
  ## What's new
289
289
 
290
- Latest: **3.10.4** — review stages were recording the implementer's model (V3 R3 violation root cause). Now record the actual reviewer's resolved tier + model. Plus: telemetry validation is fully warn-only events never drop, and cross-field warnings now include actual offending values (model, tokens, totals) so config issues vs lifecycle bugs are distinguishable at a glance. Full history: [CHANGELOG](https://github.com/zhixuan312/multi-model-agent/blob/master/CHANGELOG.md).
290
+ Latest: **3.10.6** — Skill docs now match executor code. Only `mma-delegate` accepts `agentType` per task every other route hardcodes its tier and rejects `agentType` with HTTP 400 (`/execute-plan` → standard; `/audit` / `/review` / `/debug` / `/verify` / `/investigate` → complex). The router skill previously claimed `mma-execute-plan` accepts `agentType` (false) and that `mma-verify` defaults to standard (also false; defaults to complex). Also added a new "Reasoning effort: auto-inferred" section documenting the `inferEffort()` heuristics. Full history: [CHANGELOG](https://github.com/zhixuan312/multi-model-agent/blob/master/CHANGELOG.md).
291
291
 
292
292
  ## Full documentation
293
293
 
@@ -8,7 +8,7 @@ when_to_use: >-
8
8
  User asks for a doc/spec/config audit OR a methodology skill
9
9
  (superpowers:dispatching-parallel-agents, /security-review) points at one AND
10
10
  mmagent is running. Audit on PROSE/SPEC docs — use mma-review for source code.
11
- version: 3.10.5
11
+ version: 3.10.6
12
12
  ---
13
13
 
14
14
  # mma-audit
@@ -57,6 +57,8 @@ Send a document or set of files to workers for structured auditing. Each file is
57
57
 
58
58
  Either `document` or `filePaths` (or both) must be provided.
59
59
 
60
+ > Worker tier for `mma-audit` is hardcoded to `complex` and is not caller-configurable. Sending `agentType` is rejected with HTTP 400.
61
+
60
62
  ## Full example
61
63
 
62
64
  ```bash
@@ -12,7 +12,7 @@ when_to_use: >-
12
12
  `proposedInterpretation` is a hard gate — the batch is paused, not
13
13
  informational. The batch will not complete until the caller responds. Treating
14
14
  it as advisory is the clarification-as-info anti-pattern (AP5).
15
- version: 3.10.5
15
+ version: 3.10.6
16
16
  ---
17
17
 
18
18
  # mma-clarifications
@@ -12,7 +12,7 @@ when_to_use: >-
12
12
  Register once here, then pass the ID via `contextBlockIds` on mma-delegate /
13
13
  mma-execute-plan / mma-audit / mma-review / mma-verify / mma-debug /
14
14
  mma-investigate. Cheaper and faster than inlining the same content N times.
15
- version: 3.10.5
15
+ version: 3.10.6
16
16
  ---
17
17
 
18
18
  # mma-context-blocks
@@ -10,7 +10,7 @@ when_to_use: >-
10
10
  read files, reproduce, trace — OR a methodology skill
11
11
  (superpowers:systematic-debugging) points at the investigation step. Delegate
12
12
  the read/reproduce/trace; the main agent stays on the hypothesis and the fix.
13
- version: 3.10.5
13
+ version: 3.10.6
14
14
  ---
15
15
 
16
16
  # mma-debug
@@ -63,6 +63,8 @@ Submit a problem, context, and hypothesis to a worker for focused debugging. Unl
63
63
  | `filePaths` | string[] | no | All files investigated together (cross-file reasoning) |
64
64
  | `contextBlockIds` | string[] | no | IDs from `mma-context-blocks` (e.g. error logs, traces) |
65
65
 
66
+ > Worker tier for `mma-debug` is hardcoded to `complex` and is not caller-configurable. Sending `agentType` is rejected with HTTP 400.
67
+
66
68
  ## Full example
67
69
 
68
70
  ```bash
@@ -11,7 +11,7 @@ when_to_use: >-
11
11
  and keep main context free. If a plan file exists → use mma-execute-plan. If
12
12
  the task is audit / review / verify / debug / investigate → use the matching
13
13
  specialized skill.
14
- version: 3.10.5
14
+ version: 3.10.6
15
15
  ---
16
16
 
17
17
  # mma-delegate
@@ -10,7 +10,7 @@ when_to_use: >-
10
10
  superpowers:subagent-driven-development / superpowers:executing-plans —
11
11
  workers are cheaper and don't pollute main context. Task descriptors must
12
12
  match plan headings verbatim.
13
- version: 3.10.5
13
+ version: 3.10.6
14
14
  ---
15
15
 
16
16
  # mma-execute-plan
@@ -68,7 +68,7 @@ Dispatch named tasks from a plan file to workers. Each `tasks` string must match
68
68
 
69
69
  @include _shared/verify-and-review.md
70
70
 
71
- > **No `agentType` here.** Worker tier is set by the plan and per-route defaults. For ad-hoc work where you need direct tier control, use `mma-delegate`.
71
+ > **No `agentType` here.** Worker tier is hardcoded to `standard` for every plan task; sending `agentType` (top-level or per-task) is rejected with HTTP 400. For tasks that need `complex` tier, dispatch via `mma-delegate` with the plan task as the prompt and `agentType: "complex"`.
72
72
 
73
73
  If the batch reaches `awaiting_clarification`, use `mma-clarifications` to confirm or correct the proposed interpretation.
74
74
 
@@ -12,7 +12,7 @@ when_to_use: >-
12
12
  git-history queries. OR you are about to read 3+ files / run any grep in main
13
13
  context — that's the inline-labor-leakage anti-pattern (AP2); delegate to this
14
14
  skill instead.
15
- version: 3.10.5
15
+ version: 3.10.6
16
16
  ---
17
17
 
18
18
  # mma-investigate
@@ -10,7 +10,7 @@ when_to_use: >-
10
10
  you want to re-try the failed indices only. Prefer this over re-dispatching
11
11
  the whole batch or inline-retrying — it's idempotent and preserves the
12
12
  original batch's diagnostics.
13
- version: 3.10.5
13
+ version: 3.10.6
14
14
  ---
15
15
 
16
16
  # mma-retry
@@ -10,7 +10,7 @@ when_to_use: >-
10
10
  AND mmagent is running. Delegate so each file reviews on its own worker; the
11
11
  main agent only decides what to merge. Review on SOURCE CODE — use mma-audit
12
12
  for prose specs / configs.
13
- version: 3.10.5
13
+ version: 3.10.6
14
14
  ---
15
15
 
16
16
  # mma-review
@@ -60,6 +60,8 @@ Send code files to workers for structured review. Each file is reviewed independ
60
60
 
61
61
  Either `code` or `filePaths` (or both) must be provided.
62
62
 
63
+ > Worker tier for `mma-review` is hardcoded to `complex` and is not caller-configurable. Sending `agentType` is rejected with HTTP 400.
64
+
63
65
  ## Full example
64
66
 
65
67
  ```bash
@@ -10,7 +10,7 @@ when_to_use: >-
10
10
  against implemented work BEFORE claiming success. Delegate so each checklist
11
11
  item gets independent evidence-gathering on a worker. Use this BEFORE saying
12
12
  "done" — never after.
13
- version: 3.10.5
13
+ version: 3.10.6
14
14
  ---
15
15
 
16
16
  # mma-verify
@@ -61,6 +61,8 @@ Submit work product and a checklist to workers for independent verification. Eac
61
61
  | `filePaths` | string[] | no | Files to verify against (workers can read them) |
62
62
  | `contextBlockIds` | string[] | no | IDs from `mma-context-blocks` (e.g. the spec the work was supposed to satisfy) |
63
63
 
64
+ > Worker tier for `mma-verify` is hardcoded to `complex` and is not caller-configurable. Sending `agentType` is rejected with HTTP 400.
65
+
64
66
  ## Full example
65
67
 
66
68
  ```bash
@@ -11,7 +11,7 @@ when_to_use: >-
11
11
  tasks — AND mmagent is running. Read this once, pick the matching mma-* skill,
12
12
  and delegate there. Applies equally whether the user invoked a superpowers
13
13
  methodology skill or asked directly.
14
- version: 3.10.5
14
+ version: 3.10.6
15
15
  ---
16
16
 
17
17
  # multi-model-agent (router)
@@ -156,13 +156,34 @@ Every request requires `Authorization: Bearer $MMAGENT_AUTH_TOKEN`. The token ro
156
156
 
157
157
  ## Worker tier: `agentType`
158
158
 
159
- `mma-delegate` and `mma-execute-plan` accept `agentType: "standard" | "complex"`. Default is `"standard"` (cheaper, faster). Pick `"complex"` when:
159
+ Only `mma-delegate` accepts `agentType: "standard" | "complex"` per task — default `"standard"` (cheaper, faster). Pick `"complex"` when:
160
160
 
161
161
  - The task touches many files or requires multi-step reasoning a standard-tier model cannot hold in context.
162
162
  - A prior standard run came back with `filesWritten: 0` or `incompleteReason: "turn_cap"` / `"cost_cap"` / `"timeout"`.
163
163
  - The task is security-sensitive or ambiguous enough that being wrong is costly.
164
164
 
165
- `mma-audit`, `mma-review`, `mma-debug`, `mma-investigate` already default to complex; `mma-verify` already defaults to standard. These are not caller-configurable.
165
+ Every other route hardcodes its tier and rejects `agentType` with HTTP 400:
166
+
167
+ | Route | Hardcoded tier |
168
+ |---|---|
169
+ | `mma-execute-plan` | `standard` |
170
+ | `mma-audit` | `complex` |
171
+ | `mma-review` | `complex` |
172
+ | `mma-debug` | `complex` |
173
+ | `mma-verify` | `complex` |
174
+ | `mma-investigate` | `complex` |
175
+
176
+ If you need `complex` tier on plan-style work, dispatch via `mma-delegate` with the plan task as the prompt and `agentType: "complex"`.
177
+
178
+ ## Reasoning effort: auto-inferred
179
+
180
+ Independent of tier, every task runs through `inferEffort(prompt)` (`run-tasks/index.ts`) when `effort` is undefined:
181
+
182
+ - Code block > 20 lines in the prompt → `low` (treated as exact-write).
183
+ - File path + action verb (`edit`/`modify`/`update`/`fix`/`refactor`/`replace`) → `medium`.
184
+ - Otherwise → falls through to provider config default.
185
+
186
+ This is automatic and not caller-overridable from any `mma-*` skill — it shapes how hard the worker thinks within its tier.
166
187
 
167
188
  ## General flow
168
189
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@zhixuan92/multi-model-agent",
3
- "version": "3.10.5",
3
+ "version": "3.10.6",
4
4
  "type": "module",
5
5
  "license": "MIT",
6
6
  "description": "Standalone HTTP server for multi-model-agent. Routes tool-invocation work to Claude, Codex, or OpenAI-compatible sub-agents with async-polling REST dispatch and installable skills for Claude Code, Gemini CLI, Codex CLI, and Cursor.",
@@ -52,7 +52,7 @@
52
52
  },
53
53
  "dependencies": {
54
54
  "@asteasolutions/zod-to-openapi": "^8.5.0",
55
- "@zhixuan92/multi-model-agent-core": "^3.10.5",
55
+ "@zhixuan92/multi-model-agent-core": "^3.10.6",
56
56
  "gray-matter": "^4.0.3",
57
57
  "minimist": "^1.2.8",
58
58
  "proper-lockfile": "^4.1.2",