npm - @zhixuan92/multi-model-agent - Versions diffs - 4.5.4 → 4.6.0 - Mend

@zhixuan92/multi-model-agent 4.5.4 → 4.6.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (59) hide show

package/README.md +6 -3
package/dist/http/async-dispatch.d.ts.map +1 -1
package/dist/http/async-dispatch.js +21 -16
package/dist/http/async-dispatch.js.map +1 -1
package/dist/http/execution-context.d.ts.map +1 -1
package/dist/http/execution-context.js +12 -9
package/dist/http/execution-context.js.map +1 -1
package/dist/http/handler-deps.d.ts +0 -6
package/dist/http/handler-deps.d.ts.map +1 -1
package/dist/http/handlers/control/batch.d.ts.map +1 -1
package/dist/http/handlers/control/batch.js +50 -0
package/dist/http/handlers/control/batch.js.map +1 -1
package/dist/http/handlers/control/context-blocks.d.ts +0 -2
package/dist/http/handlers/control/context-blocks.d.ts.map +1 -1
package/dist/http/handlers/control/context-blocks.js +3 -1
package/dist/http/handlers/control/context-blocks.js.map +1 -1
package/dist/http/handlers/control/retry.js.map +1 -1
package/dist/http/handlers/tools/audit.d.ts.map +1 -1
package/dist/http/handlers/tools/audit.js +1 -11
package/dist/http/handlers/tools/audit.js.map +1 -1
package/dist/http/handlers/tools/debug.d.ts.map +1 -1
package/dist/http/handlers/tools/debug.js +1 -11
package/dist/http/handlers/tools/debug.js.map +1 -1
package/dist/http/handlers/tools/delegate.d.ts.map +1 -1
package/dist/http/handlers/tools/delegate.js +1 -11
package/dist/http/handlers/tools/delegate.js.map +1 -1
package/dist/http/handlers/tools/execute-plan.d.ts.map +1 -1
package/dist/http/handlers/tools/execute-plan.js +1 -11
package/dist/http/handlers/tools/execute-plan.js.map +1 -1
package/dist/http/handlers/tools/investigate.d.ts.map +1 -1
package/dist/http/handlers/tools/investigate.js +1 -11
package/dist/http/handlers/tools/investigate.js.map +1 -1
package/dist/http/handlers/tools/research.d.ts.map +1 -1
package/dist/http/handlers/tools/research.js +1 -11
package/dist/http/handlers/tools/research.js.map +1 -1
package/dist/http/handlers/tools/retry.d.ts.map +1 -1
package/dist/http/handlers/tools/retry.js +6 -16
package/dist/http/handlers/tools/retry.js.map +1 -1
package/dist/http/handlers/tools/review.d.ts.map +1 -1
package/dist/http/handlers/tools/review.js +1 -11
package/dist/http/handlers/tools/review.js.map +1 -1
package/dist/http/request-observability.d.ts.map +1 -1
package/dist/http/request-observability.js +6 -8
package/dist/http/request-observability.js.map +1 -1
package/dist/http/server.d.ts.map +1 -1
package/dist/http/server.js +20 -42
package/dist/http/server.js.map +1 -1
package/dist/skills/mma-audit/SKILL.md +38 -25
package/dist/skills/mma-context-blocks/SKILL.md +22 -1
package/dist/skills/mma-debug/SKILL.md +38 -25
package/dist/skills/mma-delegate/SKILL.md +103 -11
package/dist/skills/mma-execute-plan/SKILL.md +101 -2
package/dist/skills/mma-explore/SKILL.md +21 -5
package/dist/skills/mma-investigate/SKILL.md +62 -38
package/dist/skills/mma-research/SKILL.md +52 -3
package/dist/skills/mma-retry/SKILL.md +102 -3
package/dist/skills/mma-review/SKILL.md +38 -25
package/dist/skills/multi-model-agent/SKILL.md +1 -1
package/package.json +2 -2

package/dist/skills/mma-review/SKILL.md CHANGED Viewed

@@ -10,7 +10,7 @@ when_to_use: >-
   AND mmagent is running. Delegate so each file reviews on its own worker; the
   main agent only decides what to merge. Review on SOURCE CODE — use mma-audit
   for prose specs / configs.
-version: 4.5.4
+version: 4.6.0
 ---
 # mma-review
@@ -90,41 +90,54 @@ BATCH_ID=$(echo "$BATCH" | jq -r '.batchId')
 @include _shared/response-shape.md
-## Reading the findings (3.10.5+)
+## Reading the findings
-The terminal envelope's `results[N].annotatedFindings` is a list of structured
-findings the reviewer extracted and scored from the implementer's narrative.
-Every finding has the same shape:
+The main agent reads `completed` + `message` + `findings` — the findings are the answer. For
+read-only routes, `filesChanged` is always `[]` and `commitSha` is always `null`.
+```json
+{
+  "completed": true,
+  "message": "Review complete; 3 findings.",
+  "findings": [
+    { "id": "F1", "severity": "critical", "category": "test-gap",
+      "claim": "login.ts has no test for null username edge case.",
+      "evidence": "Worker read login.ts and grepped for test files — no null-case test found.",
+      "suggestion": "Add test case: `login(null) throws ValidationError`.",
+      "source": "reviewer" }
+  ],
+  "filesChanged": [],
+  "commitSha": null,
+  "summary": "...",
+  "telemetry": { ... }
+}
+```
+### Finding shape
+Every finding has this shape:
 | Field | Type | Notes |
 |---|---|---|
-| `id` | string | Reviewer-assigned, e.g. `F1`, `F2`. |
+| `id` | string | Worker-assigned, e.g. `F1`, `F2`. Stable across chain. |
 | `severity` | `'critical' \| 'high' \| 'medium' \| 'low'` | 4-tier. |
+| `category` | string | Topical bucket, e.g. `test-gap`, `cross-file-ripple`. |
 | `claim` | string | One-sentence summary. |
-| `evidence` | string ≥20 chars | Quoted from worker output when grounded. |
+| `evidence` | string ≥20 chars | Verbatim from source when grounded. |
 | `suggestion?` | string | Optional fix recommendation. |
-| `annotatorConfidence` | `number \| null` | 0–100 from the reviewer; `null` when emitted via deterministic fallback. |
-| `evidenceGrounded` | boolean | True when `evidence` is a verbatim substring of worker output. |
-### Verdict states (`qualityReviewVerdict`)
+| `source` | `'implementer' \| 'reviewer'` | Who produced the finding. |
-- `'annotated'` — every finding is structured. May be reviewer-emitted (with
-  numeric `annotatorConfidence`) or deterministic-fallback (with
-  `annotatorConfidence: null`). The route ALWAYS reaches `'annotated'` unless
-  the reviewer call itself fails transport.
-- `'error'` — only when the reviewer call fails transport (network / 5xx).
+`annotatorConfidence` and `evidenceGrounded` are retired — they were v4 fields with no producers.
 ### Recommended rendering by the main agent
-1. Show ALL findings — never silently drop. Confidence and grounding are
-   soft signals, not gates.
-2. Default sort: severity (critical → low) then `annotatorConfidence` desc
-   (nulls last).
-3. `severity` is the reviewer's authoritative final value — use it directly.
-4. Mark findings with `evidenceGrounded: false` or
-   `annotatorConfidence < 70` as "lower-trust" (collapsed section, lighter
-   color, or `(low confidence)` annotation). User decides what to do.
-5. Severity-tier counts feed the dashboard via V3 `findingsBySeverity`.
+1. Show ALL findings — never silently drop. Severity and grounding are soft
+   signals, not gates.
+2. Default sort: severity (critical → low), then `id` ascending.
+3. `severity` is the authoritative value — use it directly.
+4. Mark findings with `evidence` shorter than 30 chars as "low-evidence"
+   (lighter color or `(low evidence)` annotation). User decides what to do.
+5. Severity-tier counts feed the dashboard.
 @include _shared/budget-defaults.md

package/dist/skills/multi-model-agent/SKILL.md CHANGED Viewed

@@ -11,7 +11,7 @@ when_to_use: >-
   tasks — AND mmagent is running. Read this once, pick the matching mma-* skill,
   and delegate there. Applies equally whether the user invoked a superpowers
   methodology skill or asked directly.
-version: 4.5.4
+version: 4.6.0
 ---
 # multi-model-agent (router)

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@zhixuan92/multi-model-agent",
-  "version": "4.5.4",
+  "version": "4.6.0",
   "type": "module",
   "license": "MIT",
   "description": "Standalone HTTP server for multi-model-agent. Routes tool-invocation work to Claude, Codex, or OpenAI-compatible sub-agents with async-polling REST dispatch and installable skills for Claude Code, Gemini CLI, Codex CLI, and Cursor.",
@@ -53,7 +53,7 @@
   },
   "dependencies": {
     "@asteasolutions/zod-to-openapi": "^8.5.0",
-    "@zhixuan92/multi-model-agent-core": "^4.5.4",
+    "@zhixuan92/multi-model-agent-core": "^4.6.0",
     "gray-matter": "^4.0.3",
     "minimist": "^1.2.8",
     "proper-lockfile": "^4.1.2",