agentic-sdlc-wizard 1.18.0 → 1.20.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CHANGELOG.md
CHANGED
|
@@ -4,6 +4,25 @@ All notable changes to the SDLC Wizard.
|
|
|
4
4
|
|
|
5
5
|
> **Note:** This changelog is for humans to read. Don't manually apply these changes - just run the wizard ("Check for SDLC wizard updates") and it handles everything automatically.
|
|
6
6
|
|
|
7
|
+
## [1.20.0] - 2026-03-31
|
|
8
|
+
|
|
9
|
+
### Added
|
|
10
|
+
- CC version-pinned update gate — E2E tests run actual new CC version, not bundled binary (#46)
|
|
11
|
+
- Tier 1 E2E flakiness fix — regression threshold 1.5→3.0, absorbs ±2-3 point LLM variance (#47)
|
|
12
|
+
- Flaky test prevention guidance with external reference in wizard, SKILL.md
|
|
13
|
+
- 2 release consistency tests (package.json ↔ CHANGELOG ↔ SDLC.md version parity)
|
|
14
|
+
|
|
15
|
+
## [1.19.0] - 2026-03-31
|
|
16
|
+
|
|
17
|
+
### Added
|
|
18
|
+
- CI Local Shepherd Model — two-tier CI fix model (shepherd primary, bot fallback), SHA-based suppression (#36)
|
|
19
|
+
- Gap Analysis vs `/claude-automation-recommender` — complementary tools positioning (#35)
|
|
20
|
+
- `/clear` vs `/compact` context management guidance (#38)
|
|
21
|
+
- Token efficiency auditing — `/cost`, `--max-budget-usd`, OpenTelemetry (#42)
|
|
22
|
+
- Blank repo support — verified clean install, 10 new E2E tests (#31)
|
|
23
|
+
- Feature documentation enforcement — ADR guidance, `claude-md-improver`, doc sync in SDLC (#43)
|
|
24
|
+
- Setup skill description trimmed to 199 chars (v2.1.86 caps at 250)
|
|
25
|
+
|
|
7
26
|
## [1.18.0] - 2026-03-30
|
|
8
27
|
|
|
9
28
|
### Added
|
|
@@ -93,8 +93,10 @@ This prevents both false positives (crying wolf) and false negatives (missing re
|
|
|
93
93
|
|
|
94
94
|
**How We Apply This:**
|
|
95
95
|
- Weekly workflow tests new Claude Code versions before recommending upgrade
|
|
96
|
+
- Version-pinned gate: installs the specific CC version and passes it via `path_to_claude_code_executable` so E2E actually runs the new binary
|
|
96
97
|
- Phase A: Does new CC version break SDLC enforcement?
|
|
97
98
|
- Phase B: Do changelog-suggested improvements actually help?
|
|
99
|
+
- Green CI = safe to upgrade. Red = stay on current version until fixed
|
|
98
100
|
- Results shown in PR with statistical confidence
|
|
99
101
|
|
|
100
102
|
---
|
|
@@ -209,6 +211,8 @@ When Anthropic provides official plugins or tools that handle something:
|
|
|
209
211
|
| **Claude Code v2.1.69+** | Required for InstructionsLoaded hook, skill directory variable, and Tasks system |
|
|
210
212
|
| **Git repository** | Files should be committed for team sharing |
|
|
211
213
|
|
|
214
|
+
**Blank repos (no CLAUDE.md, no code):** The wizard works on empty repos. Run `npx agentic-sdlc-wizard init` — it installs hooks, skills, and the wizard doc. On first session, the hooks detect missing SDLC files and redirect to `/setup-wizard`, which generates CLAUDE.md, SDLC.md, TESTING.md, and ARCHITECTURE.md interactively. You do NOT need to run Claude's built-in `/init` first — the setup wizard handles everything.
|
|
215
|
+
|
|
212
216
|
---
|
|
213
217
|
|
|
214
218
|
## Recommended Effort Level
|
|
@@ -644,6 +648,33 @@ Security review depth should match your project's risk profile. During wizard se
|
|
|
644
648
|
|
|
645
649
|
---
|
|
646
650
|
|
|
651
|
+
## Context Management: `/clear` vs `/compact`
|
|
652
|
+
|
|
653
|
+
Two tools for managing context — use the right one:
|
|
654
|
+
|
|
655
|
+
| | `/compact` | `/clear` |
|
|
656
|
+
|---|---|---|
|
|
657
|
+
| **What it does** | Summarizes conversation, frees space | Resets conversation entirely |
|
|
658
|
+
| **When to use** | Continuing same task, need more room | Switching to an unrelated task |
|
|
659
|
+
| **Preserves** | Summary of decisions + progress | Nothing (fresh start) |
|
|
660
|
+
| **CLAUDE.md** | Re-loaded from disk | Re-loaded from disk |
|
|
661
|
+
| **Hooks/skills/settings** | Unaffected | Unaffected |
|
|
662
|
+
| **Task list** | Persists | Cleared |
|
|
663
|
+
|
|
664
|
+
**Rules of thumb:**
|
|
665
|
+
- `/compact` between planning and implementation (plan preserved in summary)
|
|
666
|
+
- `/clear` between unrelated tasks (stale context wastes tokens and misleads Claude)
|
|
667
|
+
- `/clear` after 2+ failed corrections on the same issue (context is polluted with bad approaches — start fresh with a better prompt)
|
|
668
|
+
- After committing a PR, `/clear` before starting the next feature
|
|
669
|
+
|
|
670
|
+
**Auto-compact** fires automatically at ~95% context capacity. You don't need to manage this manually — Claude Code handles it. The SDLC skill suggests `/compact` during CI idle time as a "context GC" opportunity.
|
|
671
|
+
|
|
672
|
+
**What survives `/compact`:** Key decisions, code changes, task state (as a summary). What can be lost: detailed early-conversation instructions not in CLAUDE.md, specific file contents read long ago.
|
|
673
|
+
|
|
674
|
+
**Best practice:** Put persistent instructions in CLAUDE.md (survives both `/compact` and `/clear`), not in conversation.
|
|
675
|
+
|
|
676
|
+
---
|
|
677
|
+
|
|
647
678
|
## Example Workflow (End-to-End)
|
|
648
679
|
|
|
649
680
|
Here's what a typical task looks like with this system:
|
|
@@ -912,21 +943,25 @@ The wizard creates TDD-specific automations that official plugins don't provide:
|
|
|
912
943
|
|
|
913
944
|
### Step 0.3: Additional Recommendations (Optional)
|
|
914
945
|
|
|
915
|
-
After SDLC setup is complete, run
|
|
946
|
+
After SDLC setup is complete, run `/claude-automation-recommender` for stack-specific tooling:
|
|
916
947
|
|
|
917
948
|
```
|
|
918
|
-
|
|
949
|
+
/claude-automation-recommender
|
|
919
950
|
```
|
|
920
951
|
|
|
921
|
-
|
|
922
|
-
- MCP Servers (context7 for docs, Playwright for frontend)
|
|
923
|
-
- Additional hooks (auto-format if Prettier configured)
|
|
924
|
-
- Subagents (security-reviewer if auth code detected)
|
|
952
|
+
**The wizard is an enforcement engine** — it installs working hooks, skills, and process guardrails that run automatically. **The recommender is a suggestion engine** — it analyzes your codebase and suggests additional automations you might want. They're complementary:
|
|
925
953
|
|
|
926
|
-
|
|
927
|
-
|
|
954
|
+
| Category | Wizard Ships | Recommender Suggests |
|
|
955
|
+
|----------|-------------|---------------------|
|
|
956
|
+
| SDLC process (TDD, planning, review) | Enforced via hooks + skills | Not covered |
|
|
957
|
+
| CI workflows (self-heal, PR review) | Templates + docs | Not covered |
|
|
958
|
+
| MCP servers (context7, Playwright, DB) | Not covered | Per-stack suggestions |
|
|
959
|
+
| Auto-formatting hooks (Prettier, ESLint) | Not covered | Per-stack suggestions |
|
|
960
|
+
| Type-checking hooks (tsc, mypy) | Not covered | Per-stack suggestions |
|
|
961
|
+
| Subagent templates (code-reviewer, etc.) | Cross-model review only | 8 templates |
|
|
962
|
+
| Plugin recommendations (LSPs, etc.) | Not covered | Per-stack suggestions |
|
|
928
963
|
|
|
929
|
-
|
|
964
|
+
The recommender's suggestions are additive — they don't replace the wizard's TDD hooks or SDLC enforcement.
|
|
930
965
|
|
|
931
966
|
### Git Workflow Preference
|
|
932
967
|
|
|
@@ -1660,7 +1695,7 @@ TodoWrite([
|
|
|
1660
1695
|
{ content: "Present approach + STATE CONFIDENCE LEVEL", status: "pending", activeForm: "Presenting approach" },
|
|
1661
1696
|
{ content: "Signal ready - user exits plan mode", status: "pending", activeForm: "Awaiting plan approval" },
|
|
1662
1697
|
// TRANSITION PHASE (After plan mode, before compact)
|
|
1663
|
-
{ content: "
|
|
1698
|
+
{ content: "Doc sync: update feature docs if code change contradicts or extends documented behavior", status: "pending", activeForm: "Syncing feature docs" },
|
|
1664
1699
|
{ content: "Request /compact before TDD", status: "pending", activeForm: "Requesting compact" },
|
|
1665
1700
|
// IMPLEMENTATION PHASE (After compact)
|
|
1666
1701
|
{ content: "TDD RED: Write failing test FIRST", status: "pending", activeForm: "Writing failing test" },
|
|
@@ -1701,7 +1736,7 @@ TodoWrite([
|
|
|
1701
1736
|
|
|
1702
1737
|
**Workflow:**
|
|
1703
1738
|
1. **Plan Mode** (editing blocked): Research → Write plan file → Present approach + confidence
|
|
1704
|
-
2. **Transition** (after approval):
|
|
1739
|
+
2. **Transition** (after approval): Doc sync (update feature docs if code contradicts/extends them) → Request /compact
|
|
1705
1740
|
3. **Implementation** (after compact): TDD RED → GREEN → PASS
|
|
1706
1741
|
|
|
1707
1742
|
**Before TDD, MUST ask:** "Docs updated. Run `/compact` before implementation?"
|
|
@@ -1898,7 +1933,7 @@ Debug it. Find root cause. Fix it properly. Tests ARE code.
|
|
|
1898
1933
|
|
|
1899
1934
|
## Flaky Test Prevention
|
|
1900
1935
|
|
|
1901
|
-
**Flaky tests are bugs. Period.** They erode trust in the test suite, slow down teams, and mask real regressions.
|
|
1936
|
+
**Flaky tests are bugs. Period.** They erode trust in the test suite, slow down teams, and mask real regressions. For a deep dive, see: [How do you Address and Prevent Flaky Tests?](https://softwareautomation.notion.site/How-do-you-Address-and-Prevent-Flaky-Tests-23c539e19b3c46eeb655642b95237dc0)
|
|
1902
1937
|
|
|
1903
1938
|
### Principles
|
|
1904
1939
|
|
|
@@ -1926,7 +1961,9 @@ Sometimes the flakiness is genuinely in CI infrastructure (runner environment, G
|
|
|
1926
1961
|
- **Keep quality gates strict** — the actual pass/fail decision must NOT have `continue-on-error`
|
|
1927
1962
|
- **Separate "fail the build" from "nice to have"** — a missing PR comment is not a regression
|
|
1928
1963
|
|
|
1929
|
-
## CI Feedback Loop (After Commit)
|
|
1964
|
+
## CI Feedback Loop — Local Shepherd (After Commit)
|
|
1965
|
+
|
|
1966
|
+
**This is the "local shepherd" — the primary CI fix mechanism.** It runs in your active session with full context. The optional CI Auto-Fix bot (`.github/workflows/ci-autofix.yml`) is a fallback for unattended PRs only. When both are active, the bot detects your local pushes via SHA comparison and skips automatically.
|
|
1930
1967
|
|
|
1931
1968
|
**The SDLC doesn't end at local tests.** CI must pass too.
|
|
1932
1969
|
|
|
@@ -1972,7 +2009,7 @@ Local tests pass -> Commit -> Push -> Watch CI
|
|
|
1972
2009
|
- Flaky? Investigate - flakiness is a bug
|
|
1973
2010
|
- Stuck? ASK USER
|
|
1974
2011
|
|
|
1975
|
-
## CI Review Feedback Loop (After CI Passes)
|
|
2012
|
+
## CI Review Feedback Loop — Local Shepherd (After CI Passes)
|
|
1976
2013
|
|
|
1977
2014
|
**CI passing isn't the end.** If CI includes a code reviewer, read and address its suggestions.
|
|
1978
2015
|
|
|
@@ -2004,6 +2041,25 @@ CI passes -> Read review suggestions
|
|
|
2004
2041
|
- **Ask first**: Present suggestions to user, let them decide which to implement
|
|
2005
2042
|
- **Skip review feedback**: Ignore CI review suggestions, only fix CI failures
|
|
2006
2043
|
|
|
2044
|
+
## Shepherd vs. Bot: Two-Tier CI Fix Model
|
|
2045
|
+
|
|
2046
|
+
| Aspect | Local Shepherd | CI Auto-Fix Bot |
|
|
2047
|
+
|--------|---------------|-----------------|
|
|
2048
|
+
| **When** | Active session (you're working) | Unattended (pushed and walked away) |
|
|
2049
|
+
| **Context** | Full: codebase, conversation, intent | Minimal: `--bare`, 200-line truncated logs |
|
|
2050
|
+
| **Cost** | Session tokens (marginal cost ~$0) | Separate API calls ($0.50-$2.00 per fix) |
|
|
2051
|
+
| **Noise** | 0 extra commits | 1+ `[autofix N/M]` commits per attempt |
|
|
2052
|
+
| **Quality** | High: full diagnosis, targeted fix | Lower: stateless, may repeat same approach |
|
|
2053
|
+
| **Speed** | Immediate: fix locally, push once | Delayed: workflow_run trigger + runner queue |
|
|
2054
|
+
| **Deconfliction** | N/A (is the primary) | SHA check: skips if branch advanced since failure |
|
|
2055
|
+
|
|
2056
|
+
**The shepherd is the default.** It runs as part of the SDLC checklist above whenever you push from an active session. The bot is optional and only adds value for:
|
|
2057
|
+
- Dependabot/Renovate PRs (no human session)
|
|
2058
|
+
- PRs where you push and walk away
|
|
2059
|
+
- Overnight CI runs
|
|
2060
|
+
|
|
2061
|
+
If you set up the bot, the SHA-based suppression ensures they never conflict.
|
|
2062
|
+
|
|
2007
2063
|
## DRY Principle
|
|
2008
2064
|
|
|
2009
2065
|
**Before coding:** "What patterns exist I can reuse?"
|
|
@@ -2266,7 +2322,7 @@ If deployment fails or post-deploy verification catches issues:
|
|
|
2266
2322
|
|
|
2267
2323
|
**SDLC.md:**
|
|
2268
2324
|
```markdown
|
|
2269
|
-
<!-- SDLC Wizard Version: 1.
|
|
2325
|
+
<!-- SDLC Wizard Version: 1.20.0 -->
|
|
2270
2326
|
<!-- Setup Date: [DATE] -->
|
|
2271
2327
|
<!-- Completed Steps: step-0.1, step-0.2, step-0.4, step-1, step-2, step-3, step-4, step-5, step-6, step-7, step-8, step-9 -->
|
|
2272
2328
|
<!-- Git Workflow: [PRs or Solo] -->
|
|
@@ -2577,9 +2633,17 @@ Want me to file these? (yes/no/not now)
|
|
|
2577
2633
|
|
|
2578
2634
|
## Going Further
|
|
2579
2635
|
|
|
2580
|
-
###
|
|
2636
|
+
### Feature Documentation
|
|
2581
2637
|
|
|
2582
|
-
|
|
2638
|
+
Keep feature docs alongside code. Three patterns, use what fits:
|
|
2639
|
+
|
|
2640
|
+
| Pattern | When to Use | Example |
|
|
2641
|
+
|---------|-------------|---------|
|
|
2642
|
+
| `*_PLAN.md` / `*_DOCS.md` | Per-feature living docs | `AUTH_DOCS.md`, `PAYMENTS_PLAN.md` |
|
|
2643
|
+
| `docs/decisions/NNN-title.md` (ADR) | Architecture decisions that need rationale | `docs/decisions/001-use-postgres.md` |
|
|
2644
|
+
| `docs/features/name.md` | Feature docs in a `docs/` directory | `docs/features/auth.md` |
|
|
2645
|
+
|
|
2646
|
+
**Feature doc template:**
|
|
2583
2647
|
|
|
2584
2648
|
```markdown
|
|
2585
2649
|
# Feature Name
|
|
@@ -2597,7 +2661,36 @@ Things that can trip you up.
|
|
|
2597
2661
|
What's planned but not done.
|
|
2598
2662
|
```
|
|
2599
2663
|
|
|
2600
|
-
|
|
2664
|
+
**ADR (Architecture Decision Record) template** — for decisions that need context:
|
|
2665
|
+
|
|
2666
|
+
```markdown
|
|
2667
|
+
# ADR-NNN: Decision Title
|
|
2668
|
+
|
|
2669
|
+
## Status
|
|
2670
|
+
Accepted | Superseded by ADR-NNN | Deprecated
|
|
2671
|
+
|
|
2672
|
+
## Context
|
|
2673
|
+
What is the problem? What forces are at play?
|
|
2674
|
+
|
|
2675
|
+
## Decision
|
|
2676
|
+
What did we decide and why?
|
|
2677
|
+
|
|
2678
|
+
## Consequences
|
|
2679
|
+
What are the trade-offs? What becomes easier/harder?
|
|
2680
|
+
```
|
|
2681
|
+
|
|
2682
|
+
Store ADRs in `docs/decisions/`. Number sequentially. Claude reads these during planning to understand why things are built the way they are.
|
|
2683
|
+
|
|
2684
|
+
**Keeping docs in sync with code:**
|
|
2685
|
+
|
|
2686
|
+
Docs drift when code changes but docs don't. The SDLC skill's planning phase detects this:
|
|
2687
|
+
|
|
2688
|
+
- During planning, Claude reads feature docs for the area being changed
|
|
2689
|
+
- If the code change contradicts what the doc says, Claude updates the doc
|
|
2690
|
+
- The "After Session" step routes learnings to the right doc
|
|
2691
|
+
- Stale docs cause low confidence — if Claude struggles, the doc may need updating
|
|
2692
|
+
|
|
2693
|
+
**CLAUDE.md health:** Run `/claude-md-improver` periodically (quarterly or after major changes). It audits CLAUDE.md specifically — structure, clarity, completeness (6 criteria, 100-point rubric). It does NOT cover feature docs, TESTING.md, or ADRs — the SDLC workflow handles those.
|
|
2601
2694
|
|
|
2602
2695
|
### Expand TESTING.md
|
|
2603
2696
|
|
|
@@ -2621,6 +2714,10 @@ Add project-specific guidance to skills:
|
|
|
2621
2714
|
- Preferred patterns
|
|
2622
2715
|
- Architecture decisions
|
|
2623
2716
|
|
|
2717
|
+
### Complementary Tools
|
|
2718
|
+
|
|
2719
|
+
The wizard handles SDLC process enforcement. For stack-specific tooling, run `/claude-automation-recommender` — it suggests MCP servers, formatting hooks, type-checking hooks, subagent templates, and plugins based on your detected tech stack. See [Step 0.3](#step-03-additional-recommendations-optional) for the full comparison.
|
|
2720
|
+
|
|
2624
2721
|
---
|
|
2625
2722
|
|
|
2626
2723
|
## Testing AI Apps: What's Different
|
|
@@ -2688,6 +2785,49 @@ _Sources: [Confident AI](https://www.confident-ai.com/blog/llm-testing-in-2024-t
|
|
|
2688
2785
|
|
|
2689
2786
|
---
|
|
2690
2787
|
|
|
2788
|
+
## Token Efficiency
|
|
2789
|
+
|
|
2790
|
+
Practical techniques to reduce token consumption without sacrificing quality.
|
|
2791
|
+
|
|
2792
|
+
### Monitor Costs
|
|
2793
|
+
|
|
2794
|
+
| Tool | What It Shows | When to Use |
|
|
2795
|
+
|------|---------------|-------------|
|
|
2796
|
+
| `/cost` | Session total: USD, API time, code changes | After a session to review spend |
|
|
2797
|
+
| `/context` | What's consuming context window space | When hitting context limits |
|
|
2798
|
+
| Status line | Real-time `cost.total_cost_usd` + token counts | Continuous monitoring |
|
|
2799
|
+
|
|
2800
|
+
### Reduce Consumption
|
|
2801
|
+
|
|
2802
|
+
| Technique | Savings | How |
|
|
2803
|
+
|-----------|---------|-----|
|
|
2804
|
+
| `/compact` between phases | ~40-60% context | Plan → compact → implement (plan preserved) |
|
|
2805
|
+
| `/clear` between tasks | 100% context reset | No stale context from prior work |
|
|
2806
|
+
| Delegate verbose ops to subagents | Separate context | `Agent` tool returns summary, not full output |
|
|
2807
|
+
| Use skills for on-demand knowledge | Smaller base context | Skills load only when invoked |
|
|
2808
|
+
| Scope investigations narrowly | Fewer tokens read | "investigate auth module" > "investigate codebase" |
|
|
2809
|
+
| `--effort low` for simple tasks | ~50% thinking tokens | Simple renames, config changes |
|
|
2810
|
+
|
|
2811
|
+
### CI Cost Control
|
|
2812
|
+
|
|
2813
|
+
Add `--max-budget-usd` to CI workflows as a safety net:
|
|
2814
|
+
|
|
2815
|
+
```yaml
|
|
2816
|
+
claude_args: "--max-budget-usd 5.00 --max-turns 30"
|
|
2817
|
+
```
|
|
2818
|
+
|
|
2819
|
+
| Flag | Purpose |
|
|
2820
|
+
|------|---------|
|
|
2821
|
+
| `--max-budget-usd` | Hard dollar cap per CI invocation |
|
|
2822
|
+
| `--max-turns` | Limit agentic turns (prevents infinite loops) |
|
|
2823
|
+
| `--effort` | `low`/`medium`/`high` controls thinking depth |
|
|
2824
|
+
|
|
2825
|
+
### Advanced: OpenTelemetry
|
|
2826
|
+
|
|
2827
|
+
For organization-wide cost tracking, enable `CLAUDE_CODE_ENABLE_TELEMETRY=1`. This exports per-request `cost_usd`, `input_tokens`, `output_tokens` to any OTLP-compatible backend (Datadog, Honeycomb, Prometheus).
|
|
2828
|
+
|
|
2829
|
+
---
|
|
2830
|
+
|
|
2691
2831
|
## CI/CD Gotchas
|
|
2692
2832
|
|
|
2693
2833
|
Common pitfalls when automating AI-assisted development workflows.
|
|
@@ -2749,7 +2889,9 @@ Claude: [fetches via gh api, discusses with you interactively]
|
|
|
2749
2889
|
|
|
2750
2890
|
This is optional - skip if you prefer fresh reviews only.
|
|
2751
2891
|
|
|
2752
|
-
### CI Auto-Fix Loop (Optional)
|
|
2892
|
+
### CI Auto-Fix Loop (Optional — Bot Fallback)
|
|
2893
|
+
|
|
2894
|
+
> **Two-tier model:** The SDLC skill's CI loops (above) are the "local shepherd" — they handle CI fixes during active sessions. This bot is the second tier: an unattended fallback for when no one is watching. The bot includes SHA-based suppression — if you push a fix locally before the bot runs, it skips automatically.
|
|
2753
2895
|
|
|
2754
2896
|
Automatically fix CI failures and PR review findings. Claude reads the error context, fixes the code, commits, and re-triggers CI. Loops until CI passes AND review has no findings at your chosen level, or max retries hit.
|
|
2755
2897
|
|
|
@@ -3079,21 +3221,19 @@ Claude reads the CHANGELOG to show you what's new **before** applying anything.
|
|
|
3079
3221
|
```
|
|
3080
3222
|
Claude: "Fetching CHANGELOG to check for updates..."
|
|
3081
3223
|
|
|
3082
|
-
Your version:
|
|
3083
|
-
Latest version:
|
|
3224
|
+
Your version: X.Y.0
|
|
3225
|
+
Latest version: X.Z.0
|
|
3084
3226
|
|
|
3085
|
-
What's new since
|
|
3086
|
-
-
|
|
3087
|
-
-
|
|
3088
|
-
|
|
3089
|
-
- v1.10.0: "Prove It's Better" CI automation
|
|
3090
|
-
- v1.9.0: Workflow consolidation (6 → 5 workflows)
|
|
3227
|
+
What's new since X.Y.0:
|
|
3228
|
+
- vX.Z.0: Latest features and improvements
|
|
3229
|
+
- vX.Y+1.0: Previous version changes
|
|
3230
|
+
(... entries from CHANGELOG between your version and latest ...)
|
|
3091
3231
|
|
|
3092
3232
|
Now checking your setup against latest wizard...
|
|
3093
3233
|
|
|
3094
3234
|
✓ Hooks - up to date
|
|
3095
3235
|
✓ Skills - content differs (update available)
|
|
3096
|
-
✗ step-update-notify - NOT DONE (new in
|
|
3236
|
+
✗ step-update-notify - NOT DONE (new in vX.Z.0, optional)
|
|
3097
3237
|
|
|
3098
3238
|
Summary:
|
|
3099
3239
|
- 1 file update available (SDLC skill)
|
|
@@ -3109,7 +3249,7 @@ Walk through updates? (y/n)
|
|
|
3109
3249
|
Store wizard state in `SDLC.md` as metadata comments (invisible to readers, parseable by Claude):
|
|
3110
3250
|
|
|
3111
3251
|
```markdown
|
|
3112
|
-
<!-- SDLC Wizard Version: 1.
|
|
3252
|
+
<!-- SDLC Wizard Version: 1.20.0 -->
|
|
3113
3253
|
<!-- Setup Date: 2026-01-24 -->
|
|
3114
3254
|
<!-- Completed Steps: step-0.1, step-0.2, step-1, step-2, step-3, step-4, step-5, step-6, step-7, step-8, step-9 -->
|
|
3115
3255
|
<!-- Git Workflow: PRs -->
|
|
@@ -26,7 +26,7 @@ TodoWrite([
|
|
|
26
26
|
{ content: "Present approach + STATE CONFIDENCE LEVEL", status: "pending", activeForm: "Presenting approach" },
|
|
27
27
|
{ content: "Signal ready - user exits plan mode", status: "pending", activeForm: "Awaiting plan approval" },
|
|
28
28
|
// TRANSITION PHASE (After plan mode)
|
|
29
|
-
{ content: "
|
|
29
|
+
{ content: "Doc sync: update feature docs if code change contradicts or extends documented behavior", status: "pending", activeForm: "Syncing feature docs" },
|
|
30
30
|
// IMPLEMENTATION PHASE
|
|
31
31
|
{ content: "TDD RED: Write failing test FIRST", status: "pending", activeForm: "Writing failing test" },
|
|
32
32
|
{ content: "TDD GREEN: Implement, verify test passes", status: "pending", activeForm: "Implementing feature" },
|
|
@@ -280,6 +280,8 @@ Everything else needs integration tests.
|
|
|
280
280
|
|
|
281
281
|
## Flaky Test Recovery
|
|
282
282
|
|
|
283
|
+
**Flaky tests are bugs. Period.** See: [How do you Address and Prevent Flaky Tests?](https://softwareautomation.notion.site/How-do-you-Address-and-Prevent-Flaky-Tests-23c539e19b3c46eeb655642b95237dc0)
|
|
284
|
+
|
|
283
285
|
When a test fails intermittently:
|
|
284
286
|
1. **Don't dismiss it** — "flaky" means "bug we haven't found yet"
|
|
285
287
|
2. **Identify the layer** — test code? app code? environment?
|
|
@@ -333,7 +335,9 @@ If tests fail:
|
|
|
333
335
|
|
|
334
336
|
Debug it. Find root cause. Fix it properly. Tests ARE code.
|
|
335
337
|
|
|
336
|
-
## CI Feedback Loop (After Commit)
|
|
338
|
+
## CI Feedback Loop — Local Shepherd (After Commit)
|
|
339
|
+
|
|
340
|
+
**This is the "local shepherd" — the primary CI fix mechanism.** It runs in your active session with full context. The optional CI Auto-Fix bot (`.github/workflows/ci-autofix.yml`) is a fallback for unattended PRs only. When both are active, the bot detects your local pushes via SHA comparison and skips automatically.
|
|
337
341
|
|
|
338
342
|
**The SDLC doesn't end at local tests.** CI must pass too.
|
|
339
343
|
|
|
@@ -379,7 +383,7 @@ Local tests pass -> Commit -> Push -> Watch CI
|
|
|
379
383
|
- Flaky? Investigate - flakiness is a bug
|
|
380
384
|
- Stuck? ASK USER
|
|
381
385
|
|
|
382
|
-
## CI Review Feedback Loop (After CI Passes)
|
|
386
|
+
## CI Review Feedback Loop — Local Shepherd (After CI Passes)
|
|
383
387
|
|
|
384
388
|
**CI passing isn't the end.** If CI includes a code reviewer, read and address its suggestions.
|
|
385
389
|
|
|
@@ -411,6 +415,14 @@ CI passes -> Read review suggestions
|
|
|
411
415
|
- **Ask first**: Present suggestions to user, let them decide which to implement
|
|
412
416
|
- **Skip review feedback**: Ignore CI review suggestions, only fix CI failures
|
|
413
417
|
|
|
418
|
+
## Context Management
|
|
419
|
+
|
|
420
|
+
- `/compact` between planning and implementation (plan preserved in summary)
|
|
421
|
+
- `/clear` between unrelated tasks (stale context wastes tokens and misleads)
|
|
422
|
+
- `/clear` after 2+ failed corrections (context polluted — start fresh with better prompt)
|
|
423
|
+
- Auto-compact fires at ~95% capacity — no manual management needed
|
|
424
|
+
- After committing a PR, `/clear` before starting the next feature
|
|
425
|
+
|
|
414
426
|
## DRY Principle
|
|
415
427
|
|
|
416
428
|
**Before coding:** "What patterns exist I can reuse?"
|
|
@@ -480,11 +492,25 @@ CI passes -> Read review suggestions
|
|
|
480
492
|
|
|
481
493
|
**THE RULE:** Delete old code first. If it breaks, fix it properly.
|
|
482
494
|
|
|
495
|
+
## Documentation Sync (During Planning)
|
|
496
|
+
|
|
497
|
+
When a code change affects a documented feature, update the doc in the same PR:
|
|
498
|
+
|
|
499
|
+
1. **During planning**, read feature docs for the area being changed (`*_PLAN.md`, `*_DOCS.md`, `docs/features/`, `docs/decisions/`)
|
|
500
|
+
2. If your code change contradicts what the doc says → update the doc
|
|
501
|
+
3. If your code change extends behavior the doc describes → add to the doc
|
|
502
|
+
4. If no feature doc exists and the change is substantial → note it in the summary (don't create one unprompted)
|
|
503
|
+
|
|
504
|
+
**Doc staleness signals:** Low confidence in an area often means the docs are stale, missing, or misleading. If you struggle during planning, check whether the docs match the actual code.
|
|
505
|
+
|
|
506
|
+
**CLAUDE.md health:** `/claude-md-improver` audits CLAUDE.md structure and completeness. Run it periodically. It does NOT cover feature docs — the SDLC workflow handles those.
|
|
507
|
+
|
|
483
508
|
## After Session (Capture Learnings)
|
|
484
509
|
|
|
485
510
|
If this session revealed insights, update the right place:
|
|
486
511
|
- **Testing patterns, gotchas** → `TESTING.md`
|
|
487
|
-
- **Feature-specific quirks** → Feature docs (`*_PLAN.md`)
|
|
512
|
+
- **Feature-specific quirks** → Feature docs (`*_PLAN.md`, `*_DOCS.md`)
|
|
513
|
+
- **Architecture decisions** → `docs/decisions/` (ADR format) or `ARCHITECTURE.md`
|
|
488
514
|
- **General project context** → `CLAUDE.md` (or `/revise-claude-md`)
|
|
489
515
|
|
|
490
516
|
---
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: setup-wizard
|
|
3
|
-
description:
|
|
3
|
+
description: Setup wizard — scans codebase, asks 16 config questions, generates SDLC files (CLAUDE.md, SDLC.md, TESTING.md, ARCHITECTURE.md), verifies installation. Use for first-time setup or re-running setup.
|
|
4
4
|
argument-hint: [optional: regenerate | verify-only]
|
|
5
5
|
effort: high
|
|
6
6
|
---
|
|
@@ -159,12 +159,14 @@ Run verification checks:
|
|
|
159
159
|
|
|
160
160
|
Report any issues found.
|
|
161
161
|
|
|
162
|
-
### Step 11: Instruct Restart
|
|
162
|
+
### Step 11: Instruct Restart and Next Steps
|
|
163
163
|
|
|
164
164
|
Tell the user:
|
|
165
165
|
> Setup complete. Hooks and settings load at session start.
|
|
166
166
|
> **Exit Claude Code and restart it** for the new configuration to take effect.
|
|
167
167
|
> On restart, the SDLC hook will fire and you'll see the checklist in every response.
|
|
168
|
+
>
|
|
169
|
+
> **Optional next step:** Run `/claude-automation-recommender` for stack-specific tooling suggestions (MCP servers, formatting hooks, type-checking hooks, plugins). These are complementary to the SDLC wizard — they add per-stack tooling, not process enforcement.
|
|
168
170
|
|
|
169
171
|
## Rules
|
|
170
172
|
|
|
@@ -46,12 +46,12 @@ Parse all CHANGELOG entries between the user's installed version and the latest.
|
|
|
46
46
|
|
|
47
47
|
```
|
|
48
48
|
Installed: 1.15.0
|
|
49
|
-
Latest: 1.
|
|
49
|
+
Latest: 1.20.0
|
|
50
50
|
|
|
51
51
|
What changed:
|
|
52
|
+
- [1.20.0] Version-pinned CC update gate, Tier 1 flakiness fix, flaky test guidance, ...
|
|
53
|
+
- [1.19.0] CI shepherd model, token efficiency, feature doc enforcement, ...
|
|
52
54
|
- [1.18.0] Added /update-wizard skill, ...
|
|
53
|
-
- [1.17.0] Consolidated /testing into /sdlc, ...
|
|
54
|
-
- [1.16.0] Cross-model review protocol, ...
|
|
55
55
|
```
|
|
56
56
|
|
|
57
57
|
**If versions match:** Say "You're up to date! (version X.X.X)" and stop.
|