gossipcat 0.4.28 → 0.4.31
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +13 -8
- package/dist-dashboard/assets/{index-LJgwsQh9.js → index-CLlV_738.js} +17 -17
- package/dist-dashboard/index.html +1 -1
- package/dist-mcp/mcp-server.js +4157 -2870
- package/docs/HANDBOOK.md +12 -0
- package/package.json +2 -2
package/README.md
CHANGED
|
@@ -7,7 +7,7 @@
|
|
|
7
7
|
</p>
|
|
8
8
|
|
|
9
9
|
<p align="center">
|
|
10
|
-
<a href="https://www.npmjs.com/package/gossipcat"><img src="https://img.shields.io/npm/v/gossipcat?color=0ea5e9" alt="npm version" /></a>
|
|
10
|
+
<a href="https://www.npmjs.com/package/gossipcat"><img src="https://img.shields.io/npm/v/gossipcat?color=0ea5e9&v=0430" alt="npm version" /></a>
|
|
11
11
|
<a href="https://www.npmjs.com/package/gossipcat"><img src="https://img.shields.io/npm/dw/gossipcat?color=0ea5e9" alt="npm weekly downloads" /></a>
|
|
12
12
|
<a href="https://github.com/gossipcat-ai/gossipcat-ai/blob/master/LICENSE"><img src="https://img.shields.io/badge/license-MIT-blue" alt="MIT License" /></a>
|
|
13
13
|
<a href="#quickstart"><img src="https://img.shields.io/badge/node-22%2B-green" alt="Node 22+" /></a>
|
|
@@ -85,15 +85,14 @@ The core difference: gossipcat verifies findings against actual `file:line` cita
|
|
|
85
85
|
|
|
86
86
|
## Real-world session
|
|
87
87
|
|
|
88
|
-
What a typical gossipcat session looks like in practice (2026-
|
|
88
|
+
What a typical gossipcat session looks like in practice (2026-05-22, v0.4.30 ship):
|
|
89
89
|
|
|
90
|
-
- **
|
|
91
|
-
- **
|
|
92
|
-
- **1
|
|
93
|
-
- **
|
|
94
|
-
- **1 hallucination caught** on haiku-researcher — extrapolated from a 50-row sample when the full dataset was needed; `hallucination_caught` signal recorded, accuracy score updated, no fix shipped on bad data
|
|
90
|
+
- **1 feature shipped end-to-end** — consensus auto-verify (PR #448, master commit `4b28a1c`, 1255+ LOC, 50/50 new tests, zero regressions). Full design ↔ ship arc through gossipcat itself: 6 consensus rounds on the spec before any code was written.
|
|
91
|
+
- **6 consensus rounds on the spec** caught **21+ HIGH-severity defects** across rev-1 → rev-6 — including a double-dispatch bug on the `run()` path (rev-3, opus-implementer), a phantom `AgentTeam` type the implementer would have invented if shipped (rev-5, sonnet-reviewer grep-grounded against live `AgentConfig`), and a `metadata` field that didn't exist on `ConsensusSignal` at all (rev-4, sonnet — `metadata` lives only on `MetaSignal` / `PipelineSignal`). Each round produced a measurable rev: rev-1 had 4 HIGHs, rev-6 had 0.
|
|
92
|
+
- **1 post-merge bug caught by a pre-existing drift test** — `signal-allowlist-drift.test.ts:108` (which exists precisely to catch this — same failure mode as PR #329's silent-drop of `transport_failure`) flagged that the implementer added the 2 new signals to `KNOWN_SIGNALS` + the type union + `OPERATIONAL_SIGNAL_NAMES` but missed `VALID_CONSENSUS_SIGNALS` in `performance-writer.ts`. 7-line fix landed in the same PR before merge.
|
|
93
|
+
- **Methodology lesson recorded** — sonnet-reviewer's habit of grepping cited file:line against live code (rather than trusting prose claims) is what ultimately closed the spec. Recorded as a `citation_grounding` agreement signal so the agent's pattern compounds across sessions.
|
|
95
94
|
|
|
96
|
-
|
|
95
|
+
Nothing landed without cross-review. Two agents got `+/-` score adjustments based on what they caught vs. what they missed. The spec is now usable as a worked example of what 6 rounds of multi-agent design review looks like — `docs/superpowers/specs/2026-05-21-consensus-auto-verify-design.md`.
|
|
97
96
|
|
|
98
97
|
<br/>
|
|
99
98
|
|
|
@@ -149,6 +148,12 @@ Real-time view of tasks, consensus reports, agent scores, and activity feed. Ter
|
|
|
149
148
|
Per-agent cognitive memory persists across sessions. Agents remember past findings, patterns, and project context.
|
|
150
149
|
</td>
|
|
151
150
|
</tr>
|
|
151
|
+
<tr>
|
|
152
|
+
<td align="center" colspan="3">
|
|
153
|
+
<h3>Auto-Verify (v0.4.30)</h3>
|
|
154
|
+
Opt-in. Every UNVERIFIED finding gets <code>file_read</code>-checked by a verifier agent before the report is returned. <code>tag</code> stays <code>'unverified'</code> — auto-verify is metadata, not state transition. Flag: <code>GOSSIP_CONSENSUS_AUTO_VERIFY_UNVERIFIED=1</code>.
|
|
155
|
+
</td>
|
|
156
|
+
</tr>
|
|
152
157
|
</table>
|
|
153
158
|
|
|
154
159
|
<br/>
|