@hegemonart/get-design-done 1.57.1 → 1.57.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude-plugin/marketplace.json +26 -41
- package/.claude-plugin/plugin.json +23 -48
- package/CHANGELOG.md +91 -0
- package/README.md +166 -511
- package/SKILL.md +2 -0
- package/agents/README.md +33 -36
- package/agents/a11y-mapper.md +3 -3
- package/agents/component-benchmark-harvester.md +6 -6
- package/agents/component-benchmark-synthesizer.md +3 -3
- package/agents/compose-executor.md +3 -3
- package/agents/cost-forecaster.md +2 -2
- package/agents/design-auditor.md +7 -7
- package/agents/design-authority-watcher.md +15 -15
- package/agents/design-context-builder.md +4 -4
- package/agents/design-context-checker-gate.md +1 -1
- package/agents/design-discussant.md +2 -2
- package/agents/design-doc-writer.md +1 -1
- package/agents/design-executor.md +2 -2
- package/agents/design-figma-writer.md +2 -2
- package/agents/design-fixer.md +7 -7
- package/agents/design-integration-checker-gate.md +1 -1
- package/agents/design-integration-checker.md +1 -1
- package/agents/design-paper-writer.md +3 -3
- package/agents/design-pencil-writer.md +1 -1
- package/agents/design-planner.md +21 -0
- package/agents/design-reflector.md +39 -39
- package/agents/design-research-synthesizer.md +1 -0
- package/agents/design-start-writer.md +1 -1
- package/agents/design-update-checker.md +5 -5
- package/agents/design-verifier-gate.md +1 -1
- package/agents/design-verifier.md +52 -48
- package/agents/ds-generator.md +2 -2
- package/agents/ds-migration-planner.md +4 -4
- package/agents/email-executor.md +9 -9
- package/agents/experiment-result-ingester.md +3 -3
- package/agents/flutter-executor.md +5 -5
- package/agents/gdd-graph-refresh.md +3 -3
- package/agents/gdd-intel-updater.md +2 -2
- package/agents/motion-mapper.md +2 -2
- package/agents/motion-verifier.md +4 -4
- package/agents/pdf-executor.md +8 -8
- package/agents/perf-analyzer.md +17 -17
- package/agents/pr-commenter.md +9 -9
- package/agents/prototype-gate.md +2 -2
- package/agents/quality-gate-runner.md +1 -1
- package/agents/rollout-coordinator.md +3 -3
- package/agents/swift-executor.md +4 -4
- package/agents/ticket-sync-agent.md +6 -6
- package/agents/user-research-synthesizer.md +2 -2
- package/connections/connections.md +44 -45
- package/connections/cursor.md +73 -0
- package/connections/preview.md +3 -3
- package/dist/claude-code/.claude/skills/cache-manager/SKILL.md +3 -3
- package/dist/claude-code/.claude/skills/cache-manager/cache-policy.md +1 -1
- package/dist/claude-code/.claude/skills/design/SKILL.md +19 -0
- package/dist/claude-code/.claude/skills/explore/SKILL.md +11 -0
- package/dist/claude-code/.claude/skills/figma-write/SKILL.md +13 -2
- package/dist/claude-code/.claude/skills/paper-write/SKILL.md +54 -0
- package/dist/claude-code/.claude/skills/pencil-write/SKILL.md +54 -0
- package/dist/claude-code/.claude/skills/report-issue/SKILL.md +2 -2
- package/dist/claude-code/.claude/skills/router/SKILL.md +2 -2
- package/dist/claude-code/.claude/skills/verify/verify-procedure.md +10 -11
- package/dist/claude-code/.claude/skills/warm-cache/SKILL.md +1 -1
- package/hooks/first-run-nudge.cjs +171 -0
- package/hooks/gdd-intel-trigger.js +243 -0
- package/hooks/gdd-mcp-circuit-breaker.js +62 -7
- package/hooks/gdd-precompact-snapshot.js +50 -29
- package/hooks/gdd-protected-paths.js +150 -18
- package/hooks/gdd-risk-gate.js +93 -1
- package/hooks/gdd-sessionstart-recap.js +59 -24
- package/hooks/hooks.json +13 -4
- package/hooks/inject-using-gdd.cjs +188 -0
- package/hooks/update-check.cjs +511 -0
- package/package.json +9 -2
- package/reference/STATE-TEMPLATE.md +10 -13
- package/reference/audit-scoring.md +1 -1
- package/reference/cache-tier-doctrine.md +46 -0
- package/reference/config-schema.md +9 -9
- package/reference/i18n.md +1 -1
- package/reference/intel-schema.md +37 -2
- package/reference/meta-rules.md +4 -4
- package/reference/model-tiers.md +2 -2
- package/reference/registry.json +101 -94
- package/reference/runtime-models.md +11 -1
- package/reference/shared-preamble.md +13 -14
- package/reference/skill-graph.md +24 -1
- package/scripts/bootstrap.cjs +373 -0
- package/scripts/injection-patterns.cjs +58 -0
- package/scripts/lib/apply-reflections/incubator-proposals.cjs +57 -26
- package/scripts/lib/install/converters/codex-plugin.cjs +5 -2
- package/scripts/lib/install/converters/cursor.cjs +20 -0
- package/scripts/lib/issue-reporter/report-flow.cjs +1 -1
- package/scripts/lib/manifest/skills.json +80 -13
- package/scripts/lib/state/query-surface.cjs +67 -9
- package/scripts/lib/state/state-store.cjs +68 -26
- package/sdk/cli/commands/stage.ts +17 -0
- package/sdk/cli/index.js +14 -0
- package/skills/cache-manager/SKILL.md +3 -3
- package/skills/cache-manager/cache-policy.md +1 -1
- package/skills/design/SKILL.md +19 -0
- package/skills/explore/SKILL.md +11 -0
- package/skills/figma-write/SKILL.md +13 -2
- package/skills/paper-write/SKILL.md +54 -0
- package/skills/pencil-write/SKILL.md +54 -0
- package/skills/report-issue/SKILL.md +2 -2
- package/skills/router/SKILL.md +2 -2
- package/skills/verify/verify-procedure.md +10 -11
- package/skills/warm-cache/SKILL.md +1 -1
- package/hooks/first-run-nudge.sh +0 -82
- package/hooks/inject-using-gdd.sh +0 -72
- package/hooks/update-check.sh +0 -251
- package/scripts/lib/audit-aggregator/index.cjs +0 -219
- package/scripts/lib/hedge-ensemble.cjs +0 -217
|
@@ -4,15 +4,15 @@
|
|
|
4
4
|
"name": "hegemonart"
|
|
5
5
|
},
|
|
6
6
|
"metadata": {
|
|
7
|
-
"description": "Get Design Done — 5-stage agent-orchestrated design pipeline
|
|
8
|
-
"version": "1.57.
|
|
7
|
+
"description": "Get Design Done — 5-stage agent-orchestrated design pipeline (Brief → Explore → Plan → Design → Verify) for AI coding agents. 61 agents, 96 skills, 42 connection integrations, two MCP servers, opt-in SQLite state backbone, bidirectional Figma write-back, and a reflector-driven self-improvement loop. Cross-runtime install for Claude Code, Codex, Cursor, OpenCode, Gemini, and more.",
|
|
8
|
+
"version": "1.57.2"
|
|
9
9
|
},
|
|
10
10
|
"plugins": [
|
|
11
11
|
{
|
|
12
12
|
"name": "get-design-done",
|
|
13
13
|
"source": "./",
|
|
14
|
-
"description": "Agent-orchestrated 5-stage design pipeline
|
|
15
|
-
"version": "1.57.
|
|
14
|
+
"description": "Agent-orchestrated 5-stage design pipeline (Brief → Explore → Plan → Design → Verify) for AI coding agents. 61 specialized agents, 96 skills, 42 connection integrations (Figma, Refero, Preview, Storybook, Chromatic, Graphify, Linear, Jira, Notion, …), bidirectional Figma write-back, queryable intel store, opt-in SQLite state backbone, and a reflector-driven self-improvement loop. Two MCP servers (gdd-state for typed STATE mutators, gdd-mcp for 13 read-only project-priming tools), tier-aware routing with cost telemetry, and defense-in-depth hooks (protected paths, MCP circuit breaker, injection scanner, budget enforcer). Cross-runtime install for Claude Code, Codex, Cursor, OpenCode, Gemini, Copilot, and more.",
|
|
15
|
+
"version": "1.57.2",
|
|
16
16
|
"author": {
|
|
17
17
|
"name": "hegemonart"
|
|
18
18
|
},
|
|
@@ -21,63 +21,48 @@
|
|
|
21
21
|
"license": "MIT",
|
|
22
22
|
"category": "design",
|
|
23
23
|
"keywords": [
|
|
24
|
+
"claude",
|
|
25
|
+
"claude-code",
|
|
26
|
+
"claude-code-plugin",
|
|
27
|
+
"plugin",
|
|
28
|
+
"mcp",
|
|
29
|
+
"mcp-server",
|
|
30
|
+
"agent-sdk",
|
|
31
|
+
"parallel-agents",
|
|
24
32
|
"design",
|
|
33
|
+
"design-system",
|
|
34
|
+
"design-system-sync",
|
|
25
35
|
"ui",
|
|
26
36
|
"ux",
|
|
27
37
|
"frontend",
|
|
28
38
|
"pipeline",
|
|
29
39
|
"orchestration",
|
|
30
|
-
"
|
|
31
|
-
"
|
|
40
|
+
"figma",
|
|
41
|
+
"figma-write",
|
|
32
42
|
"accessibility",
|
|
33
43
|
"wcag",
|
|
34
|
-
"nng",
|
|
35
44
|
"typography",
|
|
36
45
|
"motion",
|
|
37
46
|
"anti-patterns",
|
|
38
47
|
"audit",
|
|
39
|
-
"visual-regression",
|
|
40
48
|
"storybook",
|
|
41
49
|
"graphify",
|
|
42
|
-
"playwright",
|
|
43
|
-
"figma-write",
|
|
44
|
-
"handoff",
|
|
45
|
-
"claude-design",
|
|
46
|
-
"pinterest",
|
|
47
|
-
"bidirectional",
|
|
48
50
|
"knowledge-graph",
|
|
49
51
|
"intel-store",
|
|
50
|
-
"
|
|
51
|
-
"
|
|
52
|
-
"
|
|
53
|
-
"
|
|
54
|
-
"
|
|
55
|
-
"
|
|
56
|
-
"
|
|
57
|
-
"protected-paths",
|
|
58
|
-
"decision-injector",
|
|
59
|
-
"reference-registry",
|
|
60
|
-
"mcp-circuit-breaker",
|
|
61
|
-
"schema-validation",
|
|
62
|
-
"cost-optimization",
|
|
63
|
-
"cache-aware",
|
|
64
|
-
"budget",
|
|
52
|
+
"self-improvement",
|
|
53
|
+
"reflection",
|
|
54
|
+
"anti-slop",
|
|
55
|
+
"confidence-gate",
|
|
56
|
+
"skill-composition",
|
|
57
|
+
"skill-graph",
|
|
58
|
+
"worktree-safe",
|
|
65
59
|
"headless",
|
|
66
60
|
"cli",
|
|
67
61
|
"codex",
|
|
62
|
+
"cursor",
|
|
68
63
|
"gemini",
|
|
69
|
-
"
|
|
70
|
-
"
|
|
71
|
-
"agent-sdk",
|
|
72
|
-
"figma",
|
|
73
|
-
"extractor",
|
|
74
|
-
"design-system-sync",
|
|
75
|
-
"swift",
|
|
76
|
-
"compose",
|
|
77
|
-
"flutter",
|
|
78
|
-
"email",
|
|
79
|
-
"print",
|
|
80
|
-
"pdf"
|
|
64
|
+
"tested",
|
|
65
|
+
"ci"
|
|
81
66
|
]
|
|
82
67
|
}
|
|
83
68
|
]
|
|
@@ -1,8 +1,8 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "get-design-done",
|
|
3
3
|
"short_name": "gdd",
|
|
4
|
-
"version": "1.57.
|
|
5
|
-
"description": "Agent-orchestrated 5-stage design pipeline
|
|
4
|
+
"version": "1.57.2",
|
|
5
|
+
"description": "Agent-orchestrated 5-stage design pipeline (Brief → Explore → Plan → Design → Verify) for AI coding agents. 61 specialized agents, 96 skills, 42 connection integrations (Figma, Refero, Preview, Storybook, Chromatic, Graphify, Linear, Jira, Notion, …), bidirectional Figma write-back, queryable intel store for O(1) design-surface lookups, opt-in SQLite state backbone, and a reflector-driven self-improvement loop. Two MCP servers (`gdd-state` for typed STATE mutators, `gdd-mcp` for 13 read-only project-priming tools), tier-aware agent routing with cost telemetry, defense-in-depth hooks (protected paths, MCP circuit breaker, injection scanner, budget enforcer), and a cross-runtime install layer for Claude Code, Codex, Cursor, OpenCode, Gemini, Copilot, and more.",
|
|
6
6
|
"author": {
|
|
7
7
|
"name": "hegemonart",
|
|
8
8
|
"url": "https://github.com/hegemonart"
|
|
@@ -11,73 +11,48 @@
|
|
|
11
11
|
"repository": "https://github.com/hegemonart/get-design-done",
|
|
12
12
|
"license": "MIT",
|
|
13
13
|
"keywords": [
|
|
14
|
+
"claude",
|
|
15
|
+
"claude-code",
|
|
16
|
+
"claude-code-plugin",
|
|
17
|
+
"plugin",
|
|
18
|
+
"mcp",
|
|
19
|
+
"mcp-server",
|
|
20
|
+
"agent-sdk",
|
|
21
|
+
"parallel-agents",
|
|
14
22
|
"design",
|
|
23
|
+
"design-system",
|
|
24
|
+
"design-system-sync",
|
|
15
25
|
"ui",
|
|
16
26
|
"ux",
|
|
17
27
|
"frontend",
|
|
18
28
|
"pipeline",
|
|
19
|
-
"
|
|
20
|
-
"
|
|
29
|
+
"orchestration",
|
|
30
|
+
"figma",
|
|
31
|
+
"figma-write",
|
|
21
32
|
"accessibility",
|
|
22
33
|
"wcag",
|
|
23
|
-
"nng",
|
|
24
34
|
"typography",
|
|
25
35
|
"motion",
|
|
26
36
|
"anti-patterns",
|
|
27
37
|
"audit",
|
|
28
|
-
"visual-regression",
|
|
29
38
|
"storybook",
|
|
30
39
|
"graphify",
|
|
31
|
-
"playwright",
|
|
32
|
-
"figma-write",
|
|
33
|
-
"handoff",
|
|
34
|
-
"claude-design",
|
|
35
|
-
"pinterest",
|
|
36
|
-
"bidirectional",
|
|
37
40
|
"knowledge-graph",
|
|
38
41
|
"intel-store",
|
|
39
42
|
"self-improvement",
|
|
40
43
|
"reflection",
|
|
41
|
-
"
|
|
42
|
-
"
|
|
43
|
-
"
|
|
44
|
-
"
|
|
45
|
-
"
|
|
46
|
-
"tested",
|
|
47
|
-
"ci",
|
|
48
|
-
"cicd",
|
|
49
|
-
"release-automation",
|
|
50
|
-
"gitleaks",
|
|
51
|
-
"shellcheck",
|
|
52
|
-
"schema-validation",
|
|
53
|
-
"cost-optimization",
|
|
54
|
-
"cache-aware",
|
|
55
|
-
"budget",
|
|
44
|
+
"anti-slop",
|
|
45
|
+
"confidence-gate",
|
|
46
|
+
"skill-composition",
|
|
47
|
+
"skill-graph",
|
|
48
|
+
"worktree-safe",
|
|
56
49
|
"headless",
|
|
57
50
|
"cli",
|
|
58
51
|
"codex",
|
|
52
|
+
"cursor",
|
|
59
53
|
"gemini",
|
|
60
|
-
"
|
|
61
|
-
"
|
|
62
|
-
"agent-sdk",
|
|
63
|
-
"mcp-server",
|
|
64
|
-
"context-loading",
|
|
65
|
-
"cross-session",
|
|
66
|
-
"figma",
|
|
67
|
-
"extractor",
|
|
68
|
-
"design-system-sync",
|
|
69
|
-
"swift",
|
|
70
|
-
"compose",
|
|
71
|
-
"flutter",
|
|
72
|
-
"email",
|
|
73
|
-
"print",
|
|
74
|
-
"pdf",
|
|
75
|
-
"worktree-safe",
|
|
76
|
-
"anti-slop",
|
|
77
|
-
"confidence-gate",
|
|
78
|
-
"anti-slop-rubric",
|
|
79
|
-
"skill-composition",
|
|
80
|
-
"skill-graph"
|
|
54
|
+
"tested",
|
|
55
|
+
"ci"
|
|
81
56
|
],
|
|
82
57
|
"skills": [
|
|
83
58
|
"./skills/"
|
package/CHANGELOG.md
CHANGED
|
@@ -4,6 +4,89 @@ All notable changes to get-design-done are documented here. Versions follow [sem
|
|
|
4
4
|
|
|
5
5
|
---
|
|
6
6
|
|
|
7
|
+
## [1.57.2] - 2026-06-04
|
|
8
|
+
|
|
9
|
+
### Polish wave - 24 commits closing the v1.57 audit + 5 new CI gates
|
|
10
|
+
|
|
11
|
+
A single grooming wave: closes ~80% of the gap between what the plugin claims and what it does, plus structural CI gates that prevent the same drift from coming back. 5,007/5,007 tests pass; all 16 CI checks green. No breaking changes - additive only.
|
|
12
|
+
|
|
13
|
+
### Fixed (8 ship-blockers)
|
|
14
|
+
|
|
15
|
+
- **`scripts/injection-patterns.cjs` was excluded from the npm tarball** via `.npmignore`. `hooks/gdd-read-injection-scanner.ts` calls `createRequire()` to load it at runtime, so the scanner was a silent no-op for every shipped install (PostToolUse:Read security check disabled). Now in `package.json#files`; verified via `npm pack --dry-run`.
|
|
16
|
+
- **`gdd-sdk stage audit` errored without hinting at the real top-level subcommand.** The error now suggests "did you mean `gdd-sdk audit`?" - `audit` is a top-level subcommand, not a pipeline stage.
|
|
17
|
+
- **Codex Plugin Generator shipped `longDescription: "<div align=\"center\">"`** (the README's HTML wrapper). The README-first-paragraph scan now skips HTML wrappers, GitHub callouts, nav rows, and badge clusters.
|
|
18
|
+
- **2 hooks (`gdd-sessionstart-recap.js`, `gdd-precompact-snapshot.js`) resolved paths at module load** via `process.cwd()` instead of `payload.cwd` - broken in worktrees. Now thread `payload.cwd` through a `computePaths()` factory.
|
|
19
|
+
- **`hooks/gdd-mcp-circuit-breaker.js` substring-matched `'timeout'` / `'failed'` against the entire stringified MCP response.** False-positives on legitimate successful Figma payloads (e.g. a node literally named "TimeoutBanner" or a summary line "2 of 5 nodes failed to update"). Now uses the structured `isError` / `is_error` envelope as the primary signal and only inspects dedicated error-message fields for timeout-vs-error classification. +4 regression tests.
|
|
20
|
+
- **Windows SessionStart hooks (4 `.sh` scripts) failed without Git Bash on PATH.** Ported `scripts/bootstrap.sh`, `hooks/update-check.sh`, `hooks/first-run-nudge.sh`, `hooks/inject-using-gdd.sh` to Node `.cjs` modules. `hooks/hooks.json` rewired to `node …cjs`. One source of truth, no drift surface.
|
|
21
|
+
- **`hooks/gdd-protected-paths.js` had 4 bypass vectors** in `extractBashTargets()`: chained commands (`rm safe.txt && rm reference/protected.md` extracted only `safe.txt`), multi-arg destructive verbs, `$(…)` subshell substitution, `` `…` `` backtick substitution. Rewrote as a 3-pass walker that recurses into subshells, splits on `&&`/`||`/`;`/`|`, and collects ALL non-flag args per segment. +12 regression tests.
|
|
22
|
+
- **`skills/figma-write/SKILL.md` used invented `<agent>design-figma-writer</agent>` dispatch syntax** that Claude Code does not parse. The entire Figma write-back skill was a silent no-op since it was first authored. Replaced with the canonical `Task("design-figma-writer", "...")` block. Added `test/suite/skill-dispatch-syntax.test.cjs` regression gate that sweeps every SKILL.md.
|
|
23
|
+
|
|
24
|
+
### Added (5 new CI gates - structural drift prevention)
|
|
25
|
+
|
|
26
|
+
- **`npm run validate:feature-counts`** (`scripts/check-feature-counts.cjs`): walks the filesystem and asserts every shipped surface's count claims (`plugin.json` / `marketplace.json` / `README.md` / `SKILL.md`) match reality (61 agents / 96 skills / 42 connections / 13 MCP tools).
|
|
27
|
+
- **`npm run validate:registry-tiers`** (`scripts/validate-registry-tiers.cjs`): asserts every `registry.json` entry's `tier` field is one of `L0|L1|L2|L3`. Detects model-tier paste-errors (haiku/sonnet/opus accidentally placed in the cache-tier slot).
|
|
28
|
+
- **`npm run validate:no-internal-refs`** (`scripts/validate-no-internal-refs.cjs`): ratcheted baseline at 1,749 hits across 286 files. Fails when any file regresses beyond its baseline count of Phase NN / Plan NN-MM / .planning/ / D-NN references. `--rebaseline` ratchets after legitimate cleanup.
|
|
29
|
+
- **`npm run validate:cache-tiers`** (`scripts/check-cache-tiers.cjs`): SHA-256 of `reference/meta-rules.md` + `reference/shared-preamble.md` (the L0 cache prefix imported by 58 of 62 agents). Drift fails the build. See `reference/cache-tier-doctrine.md`.
|
|
30
|
+
- **`npm run validate:skill-surface`** (`test/suite/skill-surface-sync.test.cjs`): asserts every `skills/` dir on disk is documented in at least one of SKILL.md's three surfaces (argument-hint / Command Reference table / Jump Mode).
|
|
31
|
+
|
|
32
|
+
### Added (Batch D wirings - 8 aspirational features made real)
|
|
33
|
+
|
|
34
|
+
- **`skills/paper-write/`** (new): paper.design canvas write-back (modes: annotate / tokenize / roundtrip). Modeled on `skills/figma-write/`.
|
|
35
|
+
- **`skills/pencil-write/`** (new): pencil.dev `.pen` file write-back (modes: annotate / roundtrip). File-based - no MCP.
|
|
36
|
+
- **`hooks/gdd-intel-trigger.js`** (new): PostToolUse hook that fires `scripts/build-intel.cjs --incremental` in the background when Edit/Write touches `(skills|agents|reference|source/skills)/.*\.(md|json)`. 5-minute lock dedups rapid sequential edits. Opt-out via `GDD_DISABLE_INTEL_TRIGGER=1`.
|
|
37
|
+
- **`design-component-generator` dispatch** wired into `skills/design/SKILL.md`. Fires opt-in when STATE.md `<connections>` shows a generator connection available (21st-dev / magic-patterns / plasmic / builder-io / v0-dev) and the plan has a `task_type: component` task without matching `src/components/*.tsx`.
|
|
38
|
+
- **`design-context-reviewer` + gate** wired into `skills/explore/` Step 2.6. Cheap Haiku gate runs first; full 9-check graph review fires only when gate says "review needed".
|
|
39
|
+
- **`design-research-synthesizer` opts into peer-CLI delegation** (`delegate_to: gemini-research`) - first agent in the 61-agent fleet. Bandit posterior now collects real arm data on `(design-research-synthesizer, *, gemini)` tuples.
|
|
40
|
+
- **`touches-pattern-miner` wired into `apply-reflections`**: archived task-file `Touches:` signatures now surface as auto-crystallization proposals via `discoverTouchesPatternProposals()`.
|
|
41
|
+
- **`design-planner` + `design-verifier` emit JSON output contracts** per `reference/output-contracts/{planner,verifier}-decision.schema.json` (fenced ```json block before the prose body). Typed envelope consumption by `scripts/lib/parse-contract.cjs#parsePlannerDecision` / `parseVerifierDecision`. DESIGN-PLAN.md and DESIGN-VERIFICATION.md continue to include both formats.
|
|
42
|
+
|
|
43
|
+
### Added (Batch H - Phase 57 wire-ups)
|
|
44
|
+
|
|
45
|
+
- **Phase 56 calibration loop wired end-to-end.** `hooks/gdd-risk-gate.js` now calls `updateCalibration` after every scored decision (allow/block/review/require_confirmation → accepted true/false). `detectDrift` now fires from production traffic, not synthetic data. +9 regression tests.
|
|
46
|
+
- **`scripts/lib/state/query-surface.cjs` backup-guard hardening**: `_safeBackup(srcPath, bakPath)` returns true only when the backup file exists AND is non-empty after `copyFileSync`. `backupCycle()`, `demigrate()`, `recover()` now gate `fs.unlinkSync` behind it. +7 regression tests covering happy path / missing source / zero-byte copy / mocked failing copy.
|
|
47
|
+
- **`scripts/lib/state/state-store.cjs` `migrate()` async + JSDoc.** The underlying `migrateToSqlite` was already async; the wrapper now awaits it. Comprehensive JSDoc covers degraded paths, error semantics, idempotency, opt-in via `force`, dual-channel result shapes. +4 sqlite-path regression tests.
|
|
48
|
+
|
|
49
|
+
### Removed
|
|
50
|
+
|
|
51
|
+
- **`scripts/lib/audit-aggregator/index.cjs`** (219 LOC + 15 unit tests). Shipped in Plan 23-04; zero production callers across `scripts/ sdk/ hooks/ agents/ skills/`. With Phase 56 risk-gate owning post-action consolidation and design-verifier owning audit aggregation natively, the module was dead-on-arrival.
|
|
52
|
+
- **`scripts/lib/hedge-ensemble.cjs`** (AdaNormalHedge implementation). Shipped in Plan 23.5-02; never wired into `adaptive_mode='hedge'` or any production path. `isHedgeEnabled()` now always returns false.
|
|
53
|
+
|
|
54
|
+
### Changed (manifests + docs)
|
|
55
|
+
|
|
56
|
+
- **Manifest count claims synchronized across all 11 surfaces** to filesystem truth: 61 agents / 96 skills / 42 connections / 13 MCP tools. plugin.json + marketplace.json descriptions trimmed from 2000+ chars to ~700 (dropped sprawling per-version history - see CHANGELOG for that).
|
|
57
|
+
- **README rewritten from 968 lines to 623 lines.** Removed 30 per-version "Highlights" chronicle sections (all verified present in CHANGELOG before deletion). Product surface kept verbatim; release chronicle moves to CHANGELOG.
|
|
58
|
+
- **agents/README.md authoring contract cleaned**: 13 Phase NN refs + 11 Plan NN-MM refs + 3 dangling `.planning/phases/` cross-refs + 4 CONTEXT D-NN shorthand refs removed.
|
|
59
|
+
- **L0 cache prefix sanitized**: `reference/shared-preamble.md` "GSD Agent" → "GDD Agent" identity (every agent's first byte block). `reference/meta-rules.md` Phase NN refs stripped from the commit-scope guidance that propagates into every user repo.
|
|
60
|
+
- **`reference/STATE-TEMPLATE.md` HTML comments scrubbed** - three `<!-- Phase 25 -->` comments inside the `==== BEGIN/END TEMPLATE ====` block were being copied verbatim into every user's `.design/STATE.md` at scan entry.
|
|
61
|
+
- **`reference/registry.json` description-field sweep**: ~90 "Phase NN" prefixes stripped from description fields that load into router prompts.
|
|
62
|
+
- **7 schema files cleaned of Phase / .planning refs** in description fields (they ship via `generated.d.ts` to user IDE autocomplete).
|
|
63
|
+
- **9 skill `description:` frontmatter fields cleaned** (router LLM-signal surface): bandit-status, openrouter-status, state, override, peers, debug, report-issue, scan, discover.
|
|
64
|
+
- **`agents/design-verifier.md` H2 headings renamed** `## Phase 1..5` → `## Stage 1..5` (those headings bleed into user `.design/DESIGN-VERIFICATION.md` output and collided with internal GDD-roadmap nomenclature).
|
|
65
|
+
- **`gsd-health` references** in `skills/report-issue/SKILL.md`, `source/skills/report-issue/SKILL.md`, `scripts/lib/issue-reporter/report-flow.cjs` → `/gdd:health` (the actual command).
|
|
66
|
+
- **`benchmark` agents output paths** moved from `.planning/benchmarks/` to `.design/benchmarks/` (`.planning/` is the plugin's own development workspace and must never appear in a user repo's writes).
|
|
67
|
+
- **`connections/connections.md` matrix stage columns** `scan|discover` → `brief|explore` (the actual current 5-stage pipeline).
|
|
68
|
+
- **`runtime-models.md` added "Verification status" banner** flagging 10 of 14 entries as unverified placeholder fills (BYOK / multi-provider; Anthropic-default mapping).
|
|
69
|
+
- **`reference/cache-tier-doctrine.md`** (new): codifies the L0/L1/L2/L3 cache-tier contract that the new gates enforce.
|
|
70
|
+
- **`reference/intel-schema.md`** documented the `agent-tiers.json` slice and softened the misleading "kept current" claim.
|
|
71
|
+
- **`agents/design-research-synthesizer.md`** + **`agents/design-planner.md`** + **`agents/design-verifier.md`** gained delegate_to / Output Contract sections (see "Added" above).
|
|
72
|
+
- **42nd connection added**: `connections/cursor.md` documents the cursor install path + the documented sibling-drop limitation (Cursor `installMultiArtifact` doesn't enumerate `<skill>/<sibling>.md` files; the fix requires extending the StagedArtifact contract and is deferred with explicit doc trail).
|
|
73
|
+
- **`hooks/budget-enforcer.js` → `.ts` references updated** across 11 docs files (Plan 20-13 conversion landed; docs lagged).
|
|
74
|
+
- **Pipeline phrasing corrected**: `brief → plan → implement → verify` (4 surfaces) → `brief → explore → plan → design → verify` (the actual 5-stage pipeline).
|
|
75
|
+
- **2 stale "Executor B pending" / "Executor A hasn't run" strings** updated to present tense (the SQLite migration shipped in v1.57.0).
|
|
76
|
+
- **Composition graph edges added** for 5 high-traffic skills (compare, complete-cycle, darkmode, new-cycle, new-project) via `scripts/lib/manifest/skills.json`. `reference/skill-graph.md` regenerated.
|
|
77
|
+
|
|
78
|
+
### Test infrastructure
|
|
79
|
+
|
|
80
|
+
- **`startsWith('design-')` filter widened to all agents** in 5 test files. Coverage jumped from 30 design-* agents to 61 (all). `agent-frontmatter` alone went from 60 tests to 122.
|
|
81
|
+
- **7 Phase 28.8 baseline skips closed.** Disambiguated "runtime" (14 Tier-1 install targets) vs "registry entry" (16 entries with 2 Tier-2 channels codex-plugin + cursor-marketplace).
|
|
82
|
+
- **`insight-line.schema tier:"haiku"` paste-error fixed** to L2 + new CI gate prevents recurrence.
|
|
83
|
+
|
|
84
|
+
### Security
|
|
85
|
+
|
|
86
|
+
- 3 CodeQL alerts closed: `scripts/bootstrap.cjs` git arg-injection (added `--` separator + leading-`-` validation), `scripts/build-distribution-bundles.cjs` multi-character HTML strip (loop iteration + `/s` flag).
|
|
87
|
+
|
|
88
|
+
---
|
|
89
|
+
|
|
7
90
|
## [1.57.1] - 2026-06-03
|
|
8
91
|
|
|
9
92
|
### Fixed
|
|
@@ -28,6 +111,14 @@ degrade real users who have the module). No new dependency; the markdown floor i
|
|
|
28
111
|
no `event_id`, extra fields); `hooks/gdd-risk-gate.js` now emits the schema-correct shape, and an Ajv validation test
|
|
29
112
|
guards it. The **dashboard risk column** read the wrong fields and case-mismatched the action vocabulary, so it was
|
|
30
113
|
permanently blank; it is now wired and case-correct.
|
|
114
|
+
- **The Phase 56 calibration loop was unwired.** `scripts/lib/risk/calibration.cjs` was a complete library - rolling-50
|
|
115
|
+
window, `updateCalibration`, `detectDrift`, `recordRiskOutcome` - but no caller invoked it outside its own tests, so
|
|
116
|
+
drift detection could only fire from synthetic data. `hooks/gdd-risk-gate.js` now calls `updateCalibration` on every
|
|
117
|
+
scored call when the writer agent is known (`payload.agent` or `GDD_AGENT`): `block` records `accepted:false`; `allow`,
|
|
118
|
+
`review`, and `require_confirmation` record `accepted:true`. The store accrues from real traffic, so `detectDrift` flags
|
|
119
|
+
`under_scoring` / `over_scoring` from production behaviour. Best-effort (a calibration write never breaks a tool call);
|
|
120
|
+
no-op when the agent is unknown so an "unknown" bucket cannot pool the signal. `user_undo` / `post_apply_correct` are
|
|
121
|
+
left unresolved at the PreToolUse boundary by design; a later PostToolUse pass can resolve them.
|
|
31
122
|
- **`budget-enforcer` PreToolUse blocks** used `message` instead of `stopReason`, so the block reason was invisible to
|
|
32
123
|
the user; they now use `stopReason`. The **read-injection scanner** loads its pattern file fail-open (a missing file
|
|
33
124
|
no longer crashes the hook). Three package-root walk-ups now match the scoped package name.
|