agentic-sdlc-wizard 1.18.0 → 1.20.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CHANGELOG.md CHANGED
@@ -4,6 +4,25 @@ All notable changes to the SDLC Wizard.
4
4
 
5
5
  > **Note:** This changelog is for humans to read. Don't manually apply these changes - just run the wizard ("Check for SDLC wizard updates") and it handles everything automatically.
6
6
 
7
+ ## [1.20.0] - 2026-03-31
8
+
9
+ ### Added
10
+ - CC version-pinned update gate — E2E tests run actual new CC version, not bundled binary (#46)
11
+ - Tier 1 E2E flakiness fix — regression threshold 1.5→3.0, absorbs ±2-3 point LLM variance (#47)
12
+ - Flaky test prevention guidance with external reference in wizard, SKILL.md
13
+ - 2 release consistency tests (package.json ↔ CHANGELOG ↔ SDLC.md version parity)
14
+
15
+ ## [1.19.0] - 2026-03-31
16
+
17
+ ### Added
18
+ - CI Local Shepherd Model — two-tier CI fix model (shepherd primary, bot fallback), SHA-based suppression (#36)
19
+ - Gap Analysis vs `/claude-automation-recommender` — complementary tools positioning (#35)
20
+ - `/clear` vs `/compact` context management guidance (#38)
21
+ - Token efficiency auditing — `/cost`, `--max-budget-usd`, OpenTelemetry (#42)
22
+ - Blank repo support — verified clean install, 10 new E2E tests (#31)
23
+ - Feature documentation enforcement — ADR guidance, `claude-md-improver`, doc sync in SDLC (#43)
24
+ - Setup skill description trimmed to 199 chars (v2.1.86 caps at 250)
25
+
7
26
  ## [1.18.0] - 2026-03-30
8
27
 
9
28
  ### Added
@@ -93,8 +93,10 @@ This prevents both false positives (crying wolf) and false negatives (missing re
93
93
 
94
94
  **How We Apply This:**
95
95
  - Weekly workflow tests new Claude Code versions before recommending upgrade
96
+ - Version-pinned gate: installs the specific CC version and passes it via `path_to_claude_code_executable` so E2E actually runs the new binary
96
97
  - Phase A: Does new CC version break SDLC enforcement?
97
98
  - Phase B: Do changelog-suggested improvements actually help?
99
+ - Green CI = safe to upgrade. Red = stay on current version until fixed
98
100
  - Results shown in PR with statistical confidence
99
101
 
100
102
  ---
@@ -209,6 +211,8 @@ When Anthropic provides official plugins or tools that handle something:
209
211
  | **Claude Code v2.1.69+** | Required for InstructionsLoaded hook, skill directory variable, and Tasks system |
210
212
  | **Git repository** | Files should be committed for team sharing |
211
213
 
214
+ **Blank repos (no CLAUDE.md, no code):** The wizard works on empty repos. Run `npx agentic-sdlc-wizard init` — it installs hooks, skills, and the wizard doc. On first session, the hooks detect missing SDLC files and redirect to `/setup-wizard`, which generates CLAUDE.md, SDLC.md, TESTING.md, and ARCHITECTURE.md interactively. You do NOT need to run Claude's built-in `/init` first — the setup wizard handles everything.
215
+
212
216
  ---
213
217
 
214
218
  ## Recommended Effort Level
@@ -644,6 +648,33 @@ Security review depth should match your project's risk profile. During wizard se
644
648
 
645
649
  ---
646
650
 
651
+ ## Context Management: `/clear` vs `/compact`
652
+
653
+ Two tools for managing context — use the right one:
654
+
655
+ | | `/compact` | `/clear` |
656
+ |---|---|---|
657
+ | **What it does** | Summarizes conversation, frees space | Resets conversation entirely |
658
+ | **When to use** | Continuing same task, need more room | Switching to an unrelated task |
659
+ | **Preserves** | Summary of decisions + progress | Nothing (fresh start) |
660
+ | **CLAUDE.md** | Re-loaded from disk | Re-loaded from disk |
661
+ | **Hooks/skills/settings** | Unaffected | Unaffected |
662
+ | **Task list** | Persists | Cleared |
663
+
664
+ **Rules of thumb:**
665
+ - `/compact` between planning and implementation (plan preserved in summary)
666
+ - `/clear` between unrelated tasks (stale context wastes tokens and misleads Claude)
667
+ - `/clear` after 2+ failed corrections on the same issue (context is polluted with bad approaches — start fresh with a better prompt)
668
+ - After committing a PR, `/clear` before starting the next feature
669
+
670
+ **Auto-compact** fires automatically at ~95% context capacity. You don't need to manage this manually — Claude Code handles it. The SDLC skill suggests `/compact` during CI idle time as a "context GC" opportunity.
671
+
672
+ **What survives `/compact`:** Key decisions, code changes, task state (as a summary). What can be lost: detailed early-conversation instructions not in CLAUDE.md, specific file contents read long ago.
673
+
674
+ **Best practice:** Put persistent instructions in CLAUDE.md (survives both `/compact` and `/clear`), not in conversation.
675
+
676
+ ---
677
+
647
678
  ## Example Workflow (End-to-End)
648
679
 
649
680
  Here's what a typical task looks like with this system:
@@ -912,21 +943,25 @@ The wizard creates TDD-specific automations that official plugins don't provide:
912
943
 
913
944
  ### Step 0.3: Additional Recommendations (Optional)
914
945
 
915
- After SDLC setup is complete, run `claude-code-setup` for additional recommendations:
946
+ After SDLC setup is complete, run `/claude-automation-recommender` for stack-specific tooling:
916
947
 
917
948
  ```
918
- "Based on your codebase, recommend additional automations"
949
+ /claude-automation-recommender
919
950
  ```
920
951
 
921
- This may suggest:
922
- - MCP Servers (context7 for docs, Playwright for frontend)
923
- - Additional hooks (auto-format if Prettier configured)
924
- - Subagents (security-reviewer if auth code detected)
952
+ **The wizard is an enforcement engine** — it installs working hooks, skills, and process guardrails that run automatically. **The recommender is a suggestion engine** — it analyzes your codebase and suggests additional automations you might want. They're complementary:
925
953
 
926
- **Claude prompts for each:**
927
- > "[Detected: Prettier config] Want to add auto-format hook? (y/n)"
954
+ | Category | Wizard Ships | Recommender Suggests |
955
+ |----------|-------------|---------------------|
956
+ | SDLC process (TDD, planning, review) | Enforced via hooks + skills | Not covered |
957
+ | CI workflows (self-heal, PR review) | Templates + docs | Not covered |
958
+ | MCP servers (context7, Playwright, DB) | Not covered | Per-stack suggestions |
959
+ | Auto-formatting hooks (Prettier, ESLint) | Not covered | Per-stack suggestions |
960
+ | Type-checking hooks (tsc, mypy) | Not covered | Per-stack suggestions |
961
+ | Subagent templates (code-reviewer, etc.) | Cross-model review only | 8 templates |
962
+ | Plugin recommendations (LSPs, etc.) | Not covered | Per-stack suggestions |
928
963
 
929
- These are additive—they don't replace our TDD hooks.
964
+ The recommender's suggestions are additive they don't replace the wizard's TDD hooks or SDLC enforcement.
930
965
 
931
966
  ### Git Workflow Preference
932
967
 
@@ -1660,7 +1695,7 @@ TodoWrite([
1660
1695
  { content: "Present approach + STATE CONFIDENCE LEVEL", status: "pending", activeForm: "Presenting approach" },
1661
1696
  { content: "Signal ready - user exits plan mode", status: "pending", activeForm: "Awaiting plan approval" },
1662
1697
  // TRANSITION PHASE (After plan mode, before compact)
1663
- { content: "Update feature docs with discovered gotchas", status: "pending", activeForm: "Updating feature docs" },
1698
+ { content: "Doc sync: update feature docs if code change contradicts or extends documented behavior", status: "pending", activeForm: "Syncing feature docs" },
1664
1699
  { content: "Request /compact before TDD", status: "pending", activeForm: "Requesting compact" },
1665
1700
  // IMPLEMENTATION PHASE (After compact)
1666
1701
  { content: "TDD RED: Write failing test FIRST", status: "pending", activeForm: "Writing failing test" },
@@ -1701,7 +1736,7 @@ TodoWrite([
1701
1736
 
1702
1737
  **Workflow:**
1703
1738
  1. **Plan Mode** (editing blocked): Research → Write plan file → Present approach + confidence
1704
- 2. **Transition** (after approval): Update feature docs → Request /compact
1739
+ 2. **Transition** (after approval): Doc sync (update feature docs if code contradicts/extends them) → Request /compact
1705
1740
  3. **Implementation** (after compact): TDD RED → GREEN → PASS
1706
1741
 
1707
1742
  **Before TDD, MUST ask:** "Docs updated. Run `/compact` before implementation?"
@@ -1898,7 +1933,7 @@ Debug it. Find root cause. Fix it properly. Tests ARE code.
1898
1933
 
1899
1934
  ## Flaky Test Prevention
1900
1935
 
1901
- **Flaky tests are bugs. Period.** They erode trust in the test suite, slow down teams, and mask real regressions.
1936
+ **Flaky tests are bugs. Period.** They erode trust in the test suite, slow down teams, and mask real regressions. For a deep dive, see: [How do you Address and Prevent Flaky Tests?](https://softwareautomation.notion.site/How-do-you-Address-and-Prevent-Flaky-Tests-23c539e19b3c46eeb655642b95237dc0)
1902
1937
 
1903
1938
  ### Principles
1904
1939
 
@@ -1926,7 +1961,9 @@ Sometimes the flakiness is genuinely in CI infrastructure (runner environment, G
1926
1961
  - **Keep quality gates strict** — the actual pass/fail decision must NOT have `continue-on-error`
1927
1962
  - **Separate "fail the build" from "nice to have"** — a missing PR comment is not a regression
1928
1963
 
1929
- ## CI Feedback Loop (After Commit)
1964
+ ## CI Feedback Loop — Local Shepherd (After Commit)
1965
+
1966
+ **This is the "local shepherd" — the primary CI fix mechanism.** It runs in your active session with full context. The optional CI Auto-Fix bot (`.github/workflows/ci-autofix.yml`) is a fallback for unattended PRs only. When both are active, the bot detects your local pushes via SHA comparison and skips automatically.
1930
1967
 
1931
1968
  **The SDLC doesn't end at local tests.** CI must pass too.
1932
1969
 
@@ -1972,7 +2009,7 @@ Local tests pass -> Commit -> Push -> Watch CI
1972
2009
  - Flaky? Investigate - flakiness is a bug
1973
2010
  - Stuck? ASK USER
1974
2011
 
1975
- ## CI Review Feedback Loop (After CI Passes)
2012
+ ## CI Review Feedback Loop — Local Shepherd (After CI Passes)
1976
2013
 
1977
2014
  **CI passing isn't the end.** If CI includes a code reviewer, read and address its suggestions.
1978
2015
 
@@ -2004,6 +2041,25 @@ CI passes -> Read review suggestions
2004
2041
  - **Ask first**: Present suggestions to user, let them decide which to implement
2005
2042
  - **Skip review feedback**: Ignore CI review suggestions, only fix CI failures
2006
2043
 
2044
+ ## Shepherd vs. Bot: Two-Tier CI Fix Model
2045
+
2046
+ | Aspect | Local Shepherd | CI Auto-Fix Bot |
2047
+ |--------|---------------|-----------------|
2048
+ | **When** | Active session (you're working) | Unattended (pushed and walked away) |
2049
+ | **Context** | Full: codebase, conversation, intent | Minimal: `--bare`, 200-line truncated logs |
2050
+ | **Cost** | Session tokens (marginal cost ~$0) | Separate API calls ($0.50-$2.00 per fix) |
2051
+ | **Noise** | 0 extra commits | 1+ `[autofix N/M]` commits per attempt |
2052
+ | **Quality** | High: full diagnosis, targeted fix | Lower: stateless, may repeat same approach |
2053
+ | **Speed** | Immediate: fix locally, push once | Delayed: workflow_run trigger + runner queue |
2054
+ | **Deconfliction** | N/A (is the primary) | SHA check: skips if branch advanced since failure |
2055
+
2056
+ **The shepherd is the default.** It runs as part of the SDLC checklist above whenever you push from an active session. The bot is optional and only adds value for:
2057
+ - Dependabot/Renovate PRs (no human session)
2058
+ - PRs where you push and walk away
2059
+ - Overnight CI runs
2060
+
2061
+ If you set up the bot, the SHA-based suppression ensures they never conflict.
2062
+
2007
2063
  ## DRY Principle
2008
2064
 
2009
2065
  **Before coding:** "What patterns exist I can reuse?"
@@ -2266,7 +2322,7 @@ If deployment fails or post-deploy verification catches issues:
2266
2322
 
2267
2323
  **SDLC.md:**
2268
2324
  ```markdown
2269
- <!-- SDLC Wizard Version: 1.18.0 -->
2325
+ <!-- SDLC Wizard Version: 1.20.0 -->
2270
2326
  <!-- Setup Date: [DATE] -->
2271
2327
  <!-- Completed Steps: step-0.1, step-0.2, step-0.4, step-1, step-2, step-3, step-4, step-5, step-6, step-7, step-8, step-9 -->
2272
2328
  <!-- Git Workflow: [PRs or Solo] -->
@@ -2577,9 +2633,17 @@ Want me to file these? (yes/no/not now)
2577
2633
 
2578
2634
  ## Going Further
2579
2635
 
2580
- ### Create Feature Plan Docs
2636
+ ### Feature Documentation
2581
2637
 
2582
- For each major feature, create `FEATURE_NAME_PLAN.md`:
2638
+ Keep feature docs alongside code. Three patterns, use what fits:
2639
+
2640
+ | Pattern | When to Use | Example |
2641
+ |---------|-------------|---------|
2642
+ | `*_PLAN.md` / `*_DOCS.md` | Per-feature living docs | `AUTH_DOCS.md`, `PAYMENTS_PLAN.md` |
2643
+ | `docs/decisions/NNN-title.md` (ADR) | Architecture decisions that need rationale | `docs/decisions/001-use-postgres.md` |
2644
+ | `docs/features/name.md` | Feature docs in a `docs/` directory | `docs/features/auth.md` |
2645
+
2646
+ **Feature doc template:**
2583
2647
 
2584
2648
  ```markdown
2585
2649
  # Feature Name
@@ -2597,7 +2661,36 @@ Things that can trip you up.
2597
2661
  What's planned but not done.
2598
2662
  ```
2599
2663
 
2600
- Claude will read these during planning and update them with discoveries.
2664
+ **ADR (Architecture Decision Record) template** for decisions that need context:
2665
+
2666
+ ```markdown
2667
+ # ADR-NNN: Decision Title
2668
+
2669
+ ## Status
2670
+ Accepted | Superseded by ADR-NNN | Deprecated
2671
+
2672
+ ## Context
2673
+ What is the problem? What forces are at play?
2674
+
2675
+ ## Decision
2676
+ What did we decide and why?
2677
+
2678
+ ## Consequences
2679
+ What are the trade-offs? What becomes easier/harder?
2680
+ ```
2681
+
2682
+ Store ADRs in `docs/decisions/`. Number sequentially. Claude reads these during planning to understand why things are built the way they are.
2683
+
2684
+ **Keeping docs in sync with code:**
2685
+
2686
+ Docs drift when code changes but docs don't. The SDLC skill's planning phase detects this:
2687
+
2688
+ - During planning, Claude reads feature docs for the area being changed
2689
+ - If the code change contradicts what the doc says, Claude updates the doc
2690
+ - The "After Session" step routes learnings to the right doc
2691
+ - Stale docs cause low confidence — if Claude struggles, the doc may need updating
2692
+
2693
+ **CLAUDE.md health:** Run `/claude-md-improver` periodically (quarterly or after major changes). It audits CLAUDE.md specifically — structure, clarity, completeness (6 criteria, 100-point rubric). It does NOT cover feature docs, TESTING.md, or ADRs — the SDLC workflow handles those.
2601
2694
 
2602
2695
  ### Expand TESTING.md
2603
2696
 
@@ -2621,6 +2714,10 @@ Add project-specific guidance to skills:
2621
2714
  - Preferred patterns
2622
2715
  - Architecture decisions
2623
2716
 
2717
+ ### Complementary Tools
2718
+
2719
+ The wizard handles SDLC process enforcement. For stack-specific tooling, run `/claude-automation-recommender` — it suggests MCP servers, formatting hooks, type-checking hooks, subagent templates, and plugins based on your detected tech stack. See [Step 0.3](#step-03-additional-recommendations-optional) for the full comparison.
2720
+
2624
2721
  ---
2625
2722
 
2626
2723
  ## Testing AI Apps: What's Different
@@ -2688,6 +2785,49 @@ _Sources: [Confident AI](https://www.confident-ai.com/blog/llm-testing-in-2024-t
2688
2785
 
2689
2786
  ---
2690
2787
 
2788
+ ## Token Efficiency
2789
+
2790
+ Practical techniques to reduce token consumption without sacrificing quality.
2791
+
2792
+ ### Monitor Costs
2793
+
2794
+ | Tool | What It Shows | When to Use |
2795
+ |------|---------------|-------------|
2796
+ | `/cost` | Session total: USD, API time, code changes | After a session to review spend |
2797
+ | `/context` | What's consuming context window space | When hitting context limits |
2798
+ | Status line | Real-time `cost.total_cost_usd` + token counts | Continuous monitoring |
2799
+
2800
+ ### Reduce Consumption
2801
+
2802
+ | Technique | Savings | How |
2803
+ |-----------|---------|-----|
2804
+ | `/compact` between phases | ~40-60% context | Plan → compact → implement (plan preserved) |
2805
+ | `/clear` between tasks | 100% context reset | No stale context from prior work |
2806
+ | Delegate verbose ops to subagents | Separate context | `Agent` tool returns summary, not full output |
2807
+ | Use skills for on-demand knowledge | Smaller base context | Skills load only when invoked |
2808
+ | Scope investigations narrowly | Fewer tokens read | "investigate auth module" > "investigate codebase" |
2809
+ | `--effort low` for simple tasks | ~50% thinking tokens | Simple renames, config changes |
2810
+
2811
+ ### CI Cost Control
2812
+
2813
+ Add `--max-budget-usd` to CI workflows as a safety net:
2814
+
2815
+ ```yaml
2816
+ claude_args: "--max-budget-usd 5.00 --max-turns 30"
2817
+ ```
2818
+
2819
+ | Flag | Purpose |
2820
+ |------|---------|
2821
+ | `--max-budget-usd` | Hard dollar cap per CI invocation |
2822
+ | `--max-turns` | Limit agentic turns (prevents infinite loops) |
2823
+ | `--effort` | `low`/`medium`/`high` controls thinking depth |
2824
+
2825
+ ### Advanced: OpenTelemetry
2826
+
2827
+ For organization-wide cost tracking, enable `CLAUDE_CODE_ENABLE_TELEMETRY=1`. This exports per-request `cost_usd`, `input_tokens`, `output_tokens` to any OTLP-compatible backend (Datadog, Honeycomb, Prometheus).
2828
+
2829
+ ---
2830
+
2691
2831
  ## CI/CD Gotchas
2692
2832
 
2693
2833
  Common pitfalls when automating AI-assisted development workflows.
@@ -2749,7 +2889,9 @@ Claude: [fetches via gh api, discusses with you interactively]
2749
2889
 
2750
2890
  This is optional - skip if you prefer fresh reviews only.
2751
2891
 
2752
- ### CI Auto-Fix Loop (Optional)
2892
+ ### CI Auto-Fix Loop (Optional — Bot Fallback)
2893
+
2894
+ > **Two-tier model:** The SDLC skill's CI loops (above) are the "local shepherd" — they handle CI fixes during active sessions. This bot is the second tier: an unattended fallback for when no one is watching. The bot includes SHA-based suppression — if you push a fix locally before the bot runs, it skips automatically.
2753
2895
 
2754
2896
  Automatically fix CI failures and PR review findings. Claude reads the error context, fixes the code, commits, and re-triggers CI. Loops until CI passes AND review has no findings at your chosen level, or max retries hit.
2755
2897
 
@@ -3079,21 +3221,19 @@ Claude reads the CHANGELOG to show you what's new **before** applying anything.
3079
3221
  ```
3080
3222
  Claude: "Fetching CHANGELOG to check for updates..."
3081
3223
 
3082
- Your version: 1.8.0
3083
- Latest version: 1.13.0
3224
+ Your version: X.Y.0
3225
+ Latest version: X.Z.0
3084
3226
 
3085
- What's new since 1.8.0:
3086
- - v1.13.0: Self-update improvements, optional CI notification
3087
- - v1.12.0: Full system audit, apply step fixes
3088
- - v1.11.0: Stale output cleanup, error handling
3089
- - v1.10.0: "Prove It's Better" CI automation
3090
- - v1.9.0: Workflow consolidation (6 → 5 workflows)
3227
+ What's new since X.Y.0:
3228
+ - vX.Z.0: Latest features and improvements
3229
+ - vX.Y+1.0: Previous version changes
3230
+ (... entries from CHANGELOG between your version and latest ...)
3091
3231
 
3092
3232
  Now checking your setup against latest wizard...
3093
3233
 
3094
3234
  ✓ Hooks - up to date
3095
3235
  ✓ Skills - content differs (update available)
3096
- ✗ step-update-notify - NOT DONE (new in v1.13.0, optional)
3236
+ ✗ step-update-notify - NOT DONE (new in vX.Z.0, optional)
3097
3237
 
3098
3238
  Summary:
3099
3239
  - 1 file update available (SDLC skill)
@@ -3109,7 +3249,7 @@ Walk through updates? (y/n)
3109
3249
  Store wizard state in `SDLC.md` as metadata comments (invisible to readers, parseable by Claude):
3110
3250
 
3111
3251
  ```markdown
3112
- <!-- SDLC Wizard Version: 1.18.0 -->
3252
+ <!-- SDLC Wizard Version: 1.20.0 -->
3113
3253
  <!-- Setup Date: 2026-01-24 -->
3114
3254
  <!-- Completed Steps: step-0.1, step-0.2, step-1, step-2, step-3, step-4, step-5, step-6, step-7, step-8, step-9 -->
3115
3255
  <!-- Git Workflow: PRs -->
@@ -26,7 +26,7 @@ TodoWrite([
26
26
  { content: "Present approach + STATE CONFIDENCE LEVEL", status: "pending", activeForm: "Presenting approach" },
27
27
  { content: "Signal ready - user exits plan mode", status: "pending", activeForm: "Awaiting plan approval" },
28
28
  // TRANSITION PHASE (After plan mode)
29
- { content: "Update feature docs with discovered gotchas", status: "pending", activeForm: "Updating feature docs" },
29
+ { content: "Doc sync: update feature docs if code change contradicts or extends documented behavior", status: "pending", activeForm: "Syncing feature docs" },
30
30
  // IMPLEMENTATION PHASE
31
31
  { content: "TDD RED: Write failing test FIRST", status: "pending", activeForm: "Writing failing test" },
32
32
  { content: "TDD GREEN: Implement, verify test passes", status: "pending", activeForm: "Implementing feature" },
@@ -280,6 +280,8 @@ Everything else needs integration tests.
280
280
 
281
281
  ## Flaky Test Recovery
282
282
 
283
+ **Flaky tests are bugs. Period.** See: [How do you Address and Prevent Flaky Tests?](https://softwareautomation.notion.site/How-do-you-Address-and-Prevent-Flaky-Tests-23c539e19b3c46eeb655642b95237dc0)
284
+
283
285
  When a test fails intermittently:
284
286
  1. **Don't dismiss it** — "flaky" means "bug we haven't found yet"
285
287
  2. **Identify the layer** — test code? app code? environment?
@@ -333,7 +335,9 @@ If tests fail:
333
335
 
334
336
  Debug it. Find root cause. Fix it properly. Tests ARE code.
335
337
 
336
- ## CI Feedback Loop (After Commit)
338
+ ## CI Feedback Loop — Local Shepherd (After Commit)
339
+
340
+ **This is the "local shepherd" — the primary CI fix mechanism.** It runs in your active session with full context. The optional CI Auto-Fix bot (`.github/workflows/ci-autofix.yml`) is a fallback for unattended PRs only. When both are active, the bot detects your local pushes via SHA comparison and skips automatically.
337
341
 
338
342
  **The SDLC doesn't end at local tests.** CI must pass too.
339
343
 
@@ -379,7 +383,7 @@ Local tests pass -> Commit -> Push -> Watch CI
379
383
  - Flaky? Investigate - flakiness is a bug
380
384
  - Stuck? ASK USER
381
385
 
382
- ## CI Review Feedback Loop (After CI Passes)
386
+ ## CI Review Feedback Loop — Local Shepherd (After CI Passes)
383
387
 
384
388
  **CI passing isn't the end.** If CI includes a code reviewer, read and address its suggestions.
385
389
 
@@ -411,6 +415,14 @@ CI passes -> Read review suggestions
411
415
  - **Ask first**: Present suggestions to user, let them decide which to implement
412
416
  - **Skip review feedback**: Ignore CI review suggestions, only fix CI failures
413
417
 
418
+ ## Context Management
419
+
420
+ - `/compact` between planning and implementation (plan preserved in summary)
421
+ - `/clear` between unrelated tasks (stale context wastes tokens and misleads)
422
+ - `/clear` after 2+ failed corrections (context polluted — start fresh with better prompt)
423
+ - Auto-compact fires at ~95% capacity — no manual management needed
424
+ - After committing a PR, `/clear` before starting the next feature
425
+
414
426
  ## DRY Principle
415
427
 
416
428
  **Before coding:** "What patterns exist I can reuse?"
@@ -480,11 +492,25 @@ CI passes -> Read review suggestions
480
492
 
481
493
  **THE RULE:** Delete old code first. If it breaks, fix it properly.
482
494
 
495
+ ## Documentation Sync (During Planning)
496
+
497
+ When a code change affects a documented feature, update the doc in the same PR:
498
+
499
+ 1. **During planning**, read feature docs for the area being changed (`*_PLAN.md`, `*_DOCS.md`, `docs/features/`, `docs/decisions/`)
500
+ 2. If your code change contradicts what the doc says → update the doc
501
+ 3. If your code change extends behavior the doc describes → add to the doc
502
+ 4. If no feature doc exists and the change is substantial → note it in the summary (don't create one unprompted)
503
+
504
+ **Doc staleness signals:** Low confidence in an area often means the docs are stale, missing, or misleading. If you struggle during planning, check whether the docs match the actual code.
505
+
506
+ **CLAUDE.md health:** `/claude-md-improver` audits CLAUDE.md structure and completeness. Run it periodically. It does NOT cover feature docs — the SDLC workflow handles those.
507
+
483
508
  ## After Session (Capture Learnings)
484
509
 
485
510
  If this session revealed insights, update the right place:
486
511
  - **Testing patterns, gotchas** → `TESTING.md`
487
- - **Feature-specific quirks** → Feature docs (`*_PLAN.md`)
512
+ - **Feature-specific quirks** → Feature docs (`*_PLAN.md`, `*_DOCS.md`)
513
+ - **Architecture decisions** → `docs/decisions/` (ADR format) or `ARCHITECTURE.md`
488
514
  - **General project context** → `CLAUDE.md` (or `/revise-claude-md`)
489
515
 
490
516
  ---
@@ -1,6 +1,6 @@
1
1
  ---
2
2
  name: setup-wizard
3
- description: Interactive project setup wizard that scans the codebase, asks all 16 configuration questions, generates SDLC files (CLAUDE.md, SDLC.md, TESTING.md, ARCHITECTURE.md), and verifies the installation. Use this skill when setting up the SDLC wizard for the first time or re-running setup.
3
+ description: Setup wizard scans codebase, asks 16 config questions, generates SDLC files (CLAUDE.md, SDLC.md, TESTING.md, ARCHITECTURE.md), verifies installation. Use for first-time setup or re-running setup.
4
4
  argument-hint: [optional: regenerate | verify-only]
5
5
  effort: high
6
6
  ---
@@ -159,12 +159,14 @@ Run verification checks:
159
159
 
160
160
  Report any issues found.
161
161
 
162
- ### Step 11: Instruct Restart
162
+ ### Step 11: Instruct Restart and Next Steps
163
163
 
164
164
  Tell the user:
165
165
  > Setup complete. Hooks and settings load at session start.
166
166
  > **Exit Claude Code and restart it** for the new configuration to take effect.
167
167
  > On restart, the SDLC hook will fire and you'll see the checklist in every response.
168
+ >
169
+ > **Optional next step:** Run `/claude-automation-recommender` for stack-specific tooling suggestions (MCP servers, formatting hooks, type-checking hooks, plugins). These are complementary to the SDLC wizard — they add per-stack tooling, not process enforcement.
168
170
 
169
171
  ## Rules
170
172
 
@@ -46,12 +46,12 @@ Parse all CHANGELOG entries between the user's installed version and the latest.
46
46
 
47
47
  ```
48
48
  Installed: 1.15.0
49
- Latest: 1.18.0
49
+ Latest: 1.20.0
50
50
 
51
51
  What changed:
52
+ - [1.20.0] Version-pinned CC update gate, Tier 1 flakiness fix, flaky test guidance, ...
53
+ - [1.19.0] CI shepherd model, token efficiency, feature doc enforcement, ...
52
54
  - [1.18.0] Added /update-wizard skill, ...
53
- - [1.17.0] Consolidated /testing into /sdlc, ...
54
- - [1.16.0] Cross-model review protocol, ...
55
55
  ```
56
56
 
57
57
  **If versions match:** Say "You're up to date! (version X.X.X)" and stop.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "agentic-sdlc-wizard",
3
- "version": "1.18.0",
3
+ "version": "1.20.0",
4
4
  "description": "SDLC enforcement for Claude Code — hooks, skills, and wizard setup in one command",
5
5
  "bin": {
6
6
  "sdlc-wizard": "./cli/bin/sdlc-wizard.js"