npm - agentic-sdlc-wizard - Versions diffs - 1.65.0 → 1.67.0 - Mend

agentic-sdlc-wizard 1.65.0 → 1.67.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

package/.claude-plugin/marketplace.json +1 -1
package/.claude-plugin/plugin.json +1 -1
package/CHANGELOG.md +38 -0
package/CLAUDE_CODE_SDLC_WIZARD.md +4 -2
package/README.md +2 -0
package/package.json +1 -1
package/skills/update/SKILL.md +5 -15

package/.claude-plugin/marketplace.json CHANGED Viewed

@@ -13,7 +13,7 @@
       "name": "sdlc-wizard",
       "source": ".",
       "description": "SDLC enforcement for AI agents — TDD, planning, self-review, CI shepherd",
-      "version": "1.65.0",
+      "version": "1.67.0",
       "author": {
         "name": "Stefan Ayala"
       },

package/.claude-plugin/plugin.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "sdlc-wizard",
-  "version": "1.65.0",
+  "version": "1.67.0",
   "description": "SDLC enforcement for AI agents — TDD, planning, self-review, CI shepherd",
   "author": {
     "name": "Stefan Ayala",

package/CHANGELOG.md CHANGED Viewed

@@ -4,6 +4,44 @@ All notable changes to the SDLC Wizard.
 > **Note:** This changelog is for humans to read. Don't manually apply these changes - just run the wizard ("Check for SDLC wizard updates") and it handles everything automatically.
+## [1.67.0] - 2026-05-04
+### Closed (paperwork-stale roadmap rows)
+- **ROADMAP #99 — AutoGPT integration audit** ✅ DONE 2026-05-04 with NO-GO verdict. Research write-up at `.reviews/research-99-autogpt.md`. Three blockers: (a) AutoGPT is now an agent platform/framework — same layer as Claude Code / Codex / OpenCode, not a target for SDLC enforcement; (b) no hook primitive — AutoGPT's "blocks" system is workflow composition, not pre-tool-call enforcement, so the wizard's TDD/seam/prompt hooks have no place to live; (c) audience mismatch — AutoGPT users build continuous-service agents, not interactive SWE workflows. If a real demand signal ever surfaces, the right layering is "AutoGPT agent invokes Claude Code as a sub-tool" — that inherits the wizard for free without an AutoGPT port. **5/5 external-product audits NO-GO** (continues #76 Promptfoo, #77 constrain-to-playbook, #235 Thoughtworks AI Evals, #95 Nous Research).
+### Files
+- `.reviews/research-99-autogpt.md` (new — research write-up, force-added past `.reviews/` gitignore)
+- `ROADMAP.md` (#99 marked DONE with verdict reference)
+- `CHANGELOG.md`, `SDLC.md`, `skills/update/SKILL.md`, `package.json`, `.claude-plugin/plugin.json` + `marketplace.json`, `CLAUDE_CODE_SDLC_WIZARD.md` (1.66.0 → 1.67.0)
+### Notes
+Zero code changes. Same pattern as v1.65.0 + v1.66.0 paperwork closes.
+## [1.66.0] - 2026-05-04
+### Closed (paperwork-stale roadmap rows)
+- **ROADMAP #95 — Nous Research competitive audit** ✅ DONE 2026-05-04 with NO-GO verdict. Research write-up at `.reviews/research-95-nous.md`. Nous Research builds open-weights LLMs (Hermes), agent frameworks (Hermes Agent), RL environments (atropos), and distributed-training infra (Psyche) — different layer of the stack from SDLC enforcement. Hermes Agent is the same layer as Claude Code / Codex CLI / OpenCode (a *target* for the wizard, not a competitor); the OpenCode port (#9) is the right vehicle if anyone ever wants the wizard to run against a self-hosted Nous endpoint. Pattern continues with #76 (Promptfoo NO-GO), #77 (constrain-to-playbook NO-GO), #235 (Thoughtworks AI Evals NO-GO): external-product audits keep validating the wizard's niche.
+- **`docs/codex-near-top` cross-reference** ✅ DONE 2026-05-04 in PR #309. Surfaced `codex-sdlc-wizard` sibling at the top of README.md (after the tagline, before Install) and `CLAUDE_CODE_SDLC_WIZARD.md` (after the "What This Is" intro), so users on OpenAI's Codex CLI find the alternative without scrolling 250+/500+ lines to the Ecosystem section. Two new doc-consistency tests (head -30 / head -50 grep) keep the callout from drifting out of the top fold. OpenCode sibling intentionally not mentioned yet (per maintainer — bootstrap shipping in a different session).
+- **GitHub issue #308 (API features review)** ✅ CLOSED 2026-05-04. 4/4 entries audited (`gh issue view 308#issuecomment-4375055751`): zero wizard changes needed. Sonnet 1M-beta retirement (2026-04-30) only affects Sonnet 4.5 and Sonnet 4 — wizard's `sonnet[1m]` mixed-mode tier resolves to Sonnet 4.6 which keeps 1M GA per [API release notes](https://platform.claude.com/docs/en/release-notes/api.md). Rate Limits API + Memory for Managed Agents are different products. Haiku 3 retirement is irrelevant — `grep -ri "haiku.3\|claude-3-haiku"` returned zero hits across the wizard.
+### Files
+- `.reviews/research-95-nous.md` (new — research write-up, force-added past `.reviews/` gitignore matching #206 + #235 precedent)
+- `ROADMAP.md` (#95 marked DONE with verdict reference)
+- `README.md` + `CLAUDE_CODE_SDLC_WIZARD.md` (codex sibling callout near top, shipped in PR #309)
+- `tests/test-doc-consistency.sh` (2 new top-of-doc grep tests for codex callout, shipped in PR #309)
+- `CHANGELOG.md`, `SDLC.md`, `skills/update/SKILL.md`, `package.json`, `.claude-plugin/plugin.json` + `marketplace.json`, `CLAUDE_CODE_SDLC_WIZARD.md` (1.65.0 → 1.66.0)
+### Notes
+Zero code changes (research + cross-reference docs only). Backlog after this release: 1 open issue (#302 user-level setup skill — design-blocked) + ROADMAP top items #212 (multi-day), #9 OpenCode (separate session per maintainer).
 ## [1.65.0] - 2026-05-04
 ### Closed (paperwork-stale roadmap rows)

package/CLAUDE_CODE_SDLC_WIZARD.md CHANGED Viewed

@@ -22,6 +22,8 @@ You set it up, Claude follows it. The magic is that structured human engineering
 **The result:** Claude follows a disciplined engineering process automatically. You just review and approve.
+> **Built for Claude Code.** Using OpenAI's Codex CLI instead? See [`codex-sdlc-wizard`](https://github.com/BaseInfinity/codex-sdlc-wizard) — same SDLC enforcement, ported.
 ---
 ## The Vision
@@ -2974,7 +2976,7 @@ If deployment fails or post-deploy verification catches issues:
 **SDLC.md:**
 ```markdown
-<!-- SDLC Wizard Version: 1.65.0 -->
+<!-- SDLC Wizard Version: 1.67.0 -->
 <!-- Setup Date: [DATE] -->
 <!-- Completed Steps: step-0.1, step-0.2, step-0.4, step-1, step-2, step-3, step-4, step-5, step-6, step-7, step-8, step-9 -->
 <!-- Git Workflow: [PRs or Solo] -->
@@ -4053,7 +4055,7 @@ Walk through updates? (y/n)
 Store wizard state in `SDLC.md` as metadata comments (invisible to readers, parseable by Claude):
 ```markdown
-<!-- SDLC Wizard Version: 1.65.0 -->
+<!-- SDLC Wizard Version: 1.67.0 -->
 <!-- Setup Date: 2026-01-24 -->
 <!-- Completed Steps: step-0.1, step-0.2, step-1, step-2, step-3, step-4, step-5, step-6, step-7, step-8, step-9 -->
 <!-- Git Workflow: PRs -->

package/README.md CHANGED Viewed

@@ -4,6 +4,8 @@ A **self-evolving Software Development Life Cycle (SDLC) enforcement system for
 **Built on 15+ years of software engineering and founding engineering experience** — battle-tested patterns from real production systems, baked into an AI agent that follows tried-and-true software quality practices so you don't have to enforce them manually.
+> **Built for Claude Code.** Using OpenAI's Codex CLI instead? Check out [`codex-sdlc-wizard`](https://github.com/BaseInfinity/codex-sdlc-wizard) — same SDLC enforcement, ported. ([Full ecosystem](#xdlc-ecosystem-sibling-projects).)
 ## Install
 **Requires [Claude Code](https://docs.anthropic.com/en/docs/claude-code/overview)** (Anthropic's CLI for Claude).

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "agentic-sdlc-wizard",
-  "version": "1.65.0",
+  "version": "1.67.0",
   "description": "SDLC enforcement for Claude Code — hooks, skills, and wizard setup in one command",
   "bin": {
     "sdlc-wizard": "cli/bin/sdlc-wizard.js"

package/skills/update/SKILL.md CHANGED Viewed

@@ -93,9 +93,11 @@ Parse CHANGELOG entries between the user's installed version and latest. Present
 ```
 Installed: 1.42.0
-Latest:    1.65.0
+Latest:    1.67.0
 What changed:
+- [1.67.0] roadmap hygiene — closed #99 (AutoGPT integration audit — NO-GO; AutoGPT is an agent host like Claude Code / Codex / OpenCode, no hook primitive maps, audience is agent-builders not SWE workflow). Research write-up at `.reviews/research-99-autogpt.md`. 5/5 external-product audits NO-GO (continues #76 + #77 + #235 + #95).
+- [1.66.0] roadmap hygiene — closed #95 (Nous Research competitive audit — NO-GO; different layer of the stack, Nous builds open-weights LLMs + agent frameworks, we enforce SDLC process; pattern continues with #76 + #77 + #235 NO-GOs). Research write-up at `.reviews/research-95-nous.md`. Also rolls in PR #309 (codex sibling callout near top of README + wizard doc) and `gh issue close 308` (4/4 API entries audited, zero wizard changes — Sonnet 1M retirement only affects 4.5/4, our mixed-mode tier pins to 4.6).
 - [1.65.0] roadmap hygiene — closed paperwork-stale rows #210 (Node 24 false-green test, already shipped in PR #217) and #235 (Thoughtworks AI Evals research — NO-GO verdict, methodology already implemented under different naming; pattern continues with #76 + #77 NO-GOs). Research write-up at `.reviews/research-235-ai-evals.md`. No code changes.
 - [1.64.0] XDLC ecosystem cross-references — README, wizard doc, and ROADMAP now cross-reference all three sibling packages (`agentic-sdlc-wizard`, `codex-sdlc-wizard`, `claude-gdlc-wizard`). New "Ecosystem (Sibling Projects)" section in README. 3 new doc-consistency tests prevent drift.
 - [1.63.0] cache-cost observability closeout (#204 absorbed by #220) — `tests/test-token-spike.sh` gains explicit cache-miss regression test + negative-control test. SDLC skill + wizard doc gain "Cache-Cost Surprises" sections covering 10-20× silent cost blowups (mid-session CLAUDE.md edits, idle pruning, upstream cache bugs) and detection via `hooks/token-spike-check.sh`'s `costly_tokens` metric.
@@ -106,20 +108,8 @@ What changed:
 - [1.58.0] ground-truth gate for E2E benchmark (#96 Phase 2) — `tests/e2e/ground-truth.sh` runs `npm test` post-sim; final score capped at 5 if tests fail. Catches "agent followed protocol but produced broken code"
 - [1.57.0] de-coach E2E benchmark prompt (#96 Phase 1) — remove answer-key leakage that saturated benchmark scores at 10/10; new neutral task framing measures organic SDLC behavior
 - [1.56.0] community feature-discovery fetcher (#207) — `tests/e2e/fetch-community.sh` pulls Reddit + HN; pipe to `scan-community.sh` to surface candidate /slash-commands
-- [1.55.0] shrink weekly-update.yml dead code (#231 Phase 4) — delete env.VERSION_SCENARIO + has_overlap/overlap_paths outputs + placeholder/parse steps; 289 → 161 lines (-44%)
-- [1.54.0] delete check-updates Claude-ranker (#231 Phase 3d) — weekly-update.yml is now zero-API-spend; auto-update PR body links the manual analyze-release.md command
-- [1.53.0] delete scan-community cron (#231 Phase 3c) — manual `claude --print` invocation of analyze-community.md
-- [1.52.0] delete community-e2e-test cron (#231 Phase 3b) — manual local-shepherd review of scan-community digest
-- [1.51.0] delete version-test cron (#231 Phase 3a) — manual local-Max replacement via npm i + local-shepherd
-- [1.50.0] local-shepherd.sh --strip-paths flag (#231 Phase 2 — replaces deleted prove-it-test cron)
-- [1.49.0] local-shepherd.sh --compare-baseline flag (#230)
-- [1.48.0] SKILL.md trim — token bloat audit phase 2 follow-up
-- [1.47.0] Codex review progress wrapper (#259)
-- [1.46.1] npx check surfaces dangling+enabled plugin state (#266)
-- [1.46.0] PreCompact dry-run env vars (#240)
-- [1.45.0] PreCompact path (c) — SHA-ancestry self-heal (#257)
-- [1.44.1] Autocompact compound-misconfig detection (#207)
-... (full entries from fetched CHANGELOG)
+- [1.55.0–1.51.0] ROADMAP #231 weekly-update.yml dismantle (Phase 2 → Phase 4): five releases, each deleting a cron-API job in favor of manual local-Max replacements. End state: weekly-update.yml shrunk from ~1670 → 161 lines (-90%), zero API spend, only `check-updates` release detection survives.
+... (older entries omitted — read the full CHANGELOG.md for anything pre-1.51.0)
 ```
 Read the actual entries from the fetched CHANGELOG; don't paraphrase. The user wants to see exactly what shipped.