npm - @ai-agent-lead/skills - Versions diffs - 1.1.0 → 1.2.0 - Mend

@ai-agent-lead/skills 1.1.0 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (11) hide show

package/README.md +3 -1
package/package.json +1 -1
package/skills/README.md +18 -15
package/skills/TRIGGERS.md +6 -0
package/skills/WORKFLOWS.md +18 -9
package/skills/caveman/SKILL.md +60 -0
package/skills/code-hygiene/SKILL.md +17 -2
package/skills/formats/STYLE-comments.md +113 -0
package/skills/pr-review/SKILL.md +1 -0
package/skills/simplify/SKILL.md +7 -1
package/skills/tdd-rounds/SKILL.md +14 -1

package/README.md CHANGED Viewed

@@ -2,6 +2,8 @@
 A library of Claude Code skills that encode engineering discipline so Claude follows it by default.
+**Website:** [AgentLead.Dev](https://AgentLead.Dev)
 ## Why
 Out of the box Claude writes code competently, but shipping well needs more than that: investigate before building, design the interface before writing, test-first, review at PR boundaries, and verify before merge.
@@ -19,7 +21,7 @@ The goal: turn engineering discipline from something I have to enforce into some
 You can install all the skills in this repository directly using `npx`:
 ```bash
-npx github:ai-agent-lead/skills
+npx @ai-agent-lead/skills
 ```
 ### Options and Customization

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@ai-agent-lead/skills",
-  "version": "1.1.0",
+  "version": "1.2.0",
   "description": "Install AI agent skills for Claude Code, Codex, Antigravity, and OpenCode",
   "main": "bin/install.js",
   "bin": {

package/skills/README.md CHANGED Viewed

@@ -41,6 +41,7 @@ Grouped by role. Trigger phrases are in each skill's `description` frontmatter.
 | Skill | Trigger | Produces | Location |
 | --- | --- | --- | --- |
+| `sync-check` | Before pr-review or after refactor | Terminology/ADR drift report | [sync-check/](./sync-check/) |
 | `prod-ready` | After tdd green, before merge | Verified pre-merge checklist (incl. doc drift) | [prod-ready/](./prod-ready/) |
 | `security-review` | Surface-changing work — new entry points, identity flows, authz, sensitive data, external deps | Threat model + verified controls; appended to feature doc, or `docs/security/<feature>.md` for high-stakes | [security-review/](./security-review/) |
 | `pr-review` | Reviewing someone else's PR (or self-reviewing before opening) | Structured review with severity-classified findings (blocker / suggestion / nit / question) | [pr-review/](./pr-review/) |
@@ -52,16 +53,16 @@ Orthogonal axis. The trigger-phrase index above tells you *when* a skill fires;
 | Role | Skills | What they have in common |
 | --- | --- | --- |
-| **Doc-producing** (writes a durable artifact under `docs/`) | `feature-doc`, `investigate`, `system-design`, `grill-plan`, `debug` (optional), `security-review` (optional), `verify-real-deps` | Output survives the conversation. The discipline of writing it is the value. |
+| **Doc-producing** (writes a durable artifact under `docs/`) | `feature-doc`, `investigate`, `system-design`, `grill-plan`, `debug` (optional), `security-review` (optional), `verify-real-deps`, `bench`, `bootstrap` | Output survives the conversation. The discipline of writing it is the value. |
 | **Build** (writes code) | `tdd`, `tdd-rounds`, `simplify` | Diff-producing. Always behind a contract (feature doc + ACs). |
-| **Gate** (verifies before merge / tag) | `prod-ready`, `security-review`, `pr-review`, `verify-real-deps` | Pre-merge or pre-tag — refuse to advance until the checklist passes. |
+| **Gate** (verifies before merge / tag) | `prod-ready`, `security-review`, `pr-review`, `verify-real-deps`, `sync-check` | Pre-merge or pre-tag — refuse to advance until the checklist passes. |
 | **Diagnose** (no code, no doc — just analysis) | `debug`, `zoom-out`, `sync-check` | Run *before* a build skill when the input isn't yet clear. |
 | **Shape** (decides module / topology) | `design`, `system-design`, `improve-codebase-architecture` | Greenfield-module / greenfield-system / brownfield. Same vocabulary ([`LANGUAGE.md`](./LANGUAGE.md)). |
-| **Lens** (applied during other skills, not invoked alone) | `code-hygiene` | Five principles you carry into `simplify`, `pr-review`, and any code-reading session. |
+| **Lens** (applied during other skills, not invoked alone) | `code-hygiene`, `caveman` | Principles you carry into any turn to maintain quality or efficiency. |
 ## Skill relationship map
-The 16 skills + their dependencies. Lateral edges are vocabulary / lens; vertical edges are workflow flow.
+The 20 skills + their dependencies. Lateral edges are vocabulary / lens; vertical edges are workflow flow.
 ```
                      ┌─────────────────────────────────────────────┐
@@ -80,16 +81,16 @@ The 16 skills + their dependencies. Lateral edges are vocabulary / lens; vertica
    │       │              ▲                │          │     prod-ready│
    │       │              │                │          │          │   │
    │       └──────────────┘                │          │          ▼   │
-   │                                       │          │     pr-review │
+   │                                       │          │      sync-check │
    │   system-design ──► design (per mod) ─┘          │          │   │
    │       │                                          │          ▼   │
-   │       ▼                                          │  verify-real-deps │
+   │       ▼                                          │      pr-review │
    │  docs/architecture.md                            │          │   │
    │                                                  │          ▼   │
-   │   improve-codebase-architecture ─────────────────┘     [merge]  │
-   │       ▲                                                          │
-   │       │                                                          │
-   │   tdd-rounds (orchestrator) ── dispatches Builders ──────────────│
+   │   improve-codebase-architecture ─────────────────┘  verify-real-deps│
+   │       ▲                                                     │   │
+   │       │                                                     ▼   │
+   │   tdd-rounds (orchestrator) ── dispatches Builders ────── [merge]│
    │       │      ▲   ▲     ▲                                         │
    │       │      │   │     │                                         │
    │       │   debug security-review                                  │
@@ -97,7 +98,7 @@ The 16 skills + their dependencies. Lateral edges are vocabulary / lens; vertica
    │                                                                  │
    │   ┌────────────── LENSES & UTILITIES ──────────────┐             │
    │   │  code-hygiene ──► applied during simplify,    │             │
-   │   │                   pr-review, any code reading  │             │
+   │   │  caveman     ──► pr-review, any code reading  │             │
    │   │                                                │             │
    │   │  zoom-out ──► interrupts any workflow,        │             │
    │   │              maps unfamiliar areas             │             │
@@ -105,14 +106,16 @@ The 16 skills + their dependencies. Lateral edges are vocabulary / lens; vertica
    └──────────────────────────────────────────────────────────────────┘
 ```
-**Six things the map shows:**
+**Seven things the map shows:**
-1. `code-hygiene` and `zoom-out` are not nodes in any flow — they're lenses / utilities applied across.
+1. `code-hygiene`, `caveman`, and `zoom-out` are not nodes in any flow — they're lenses / utilities applied across.
 2. `grill-plan` is the only skill invoked from three upstreams (`investigate`, `feature-doc`, `improve-codebase-architecture`) — it's the shared pressure-test step.
 3. `tdd-rounds` is the only multi-callee orchestrator — dispatches `design`, `tdd`, `simplify`, `prod-ready`, `verify-real-deps` to Builders.
 4. `pr-review` is the reviewer-side mirror of `prod-ready` — Section 7 doc-drift covered in both.
-5. `system-design` and `improve-codebase-architecture` are duals — same [LANGUAGE.md](./LANGUAGE.md), greenfield vs brownfield.
-6. The shared substrate ([LANGUAGE.md](./LANGUAGE.md), [formats/](./formats/), [TRIGGERS.md](./TRIGGERS.md)) is referenced everywhere — never copy-paste the content into a skill.
+5. `sync-check` sits before `pr-review` as a final terminology and ADR audit.
+6. `system-design` and `improve-codebase-architecture` are duals — same [LANGUAGE.md](./LANGUAGE.md), greenfield vs brownfield.
+7. The shared substrate ([LANGUAGE.md](./LANGUAGE.md), [formats/](./formats/), [TRIGGERS.md](./TRIGGERS.md)) is referenced everywhere — never copy-paste the content into a skill.
 ## Workflows

package/skills/TRIGGERS.md CHANGED Viewed

@@ -25,6 +25,12 @@ If a phrase appears in multiple rows, the disambiguation column names how to cho
 | "simpler", "boring", "naming", "YAGNI", "premature abstraction", "over-engineered", "clean this up at line level" | [`code-hygiene`](code-hygiene/SKILL.md) | Lens, not phase — applies during simplify, pr-review, or whenever you re-read code. |
 | "I'm lost", "give me higher-level context", "zoom out", "I don't know this area" | [`zoom-out`](zoom-out/SKILL.md) | User-invoked utility (`disable-model-invocation`). |
+## Utilities & efficiency
+| Phrase / situation | Routes to | Disambiguator |
+| --- | --- | --- |
+| "caveman mode", "be concise", "save tokens", "ooga booga" | [`caveman`](caveman/SKILL.md) | Token optimization lens. |
 ## Implementation
 | Phrase / situation | Routes to | Disambiguator |

package/skills/WORKFLOWS.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # Workflows
-How the 16 skills compose into the common paths users actually walk.
+How the 20 skills compose into the common paths users actually walk.
 The skills aren't a flat menu — they form a workflow with branching. This file maps the canonical paths so you can see where to enter, what to expect, and what produces what.
@@ -32,6 +32,8 @@ Got a task? Pick by what you have in front of you:
 │ Surface-changing work (auth, public │ /security-review (gate, runs     │
 │   API, sensitive data, new entry pt)│  alongside Workflow 1/2/6)       │
 ├─────────────────────────────────────┼──────────────────────────────────┤
+│ Audit terminology or ADR compliance │ /sync-check (gate)               │
+├─────────────────────────────────────┼──────────────────────────────────┤
 │ Lost in unfamiliar area, mid-task   │ /zoom-out  (utility, anytime)    │
 └─────────────────────────────────────┴──────────────────────────────────┘
 ```
@@ -73,6 +75,9 @@ For a single-package feature with a manageable acceptance-criteria count.
        prod-ready  ──── 7-section checklist
             │
             ▼
+       sync-check  ──── terminology and ADR audit
+            │
+            ▼
        (optional) pr-review  ── self-check against the diff before opening
             │
             ▼
@@ -331,19 +336,21 @@ A few things that happen across all workflows:
 4. **`code-hygiene` is a lens, not a phase.** Apply it during the simplify sweep that follows TDD green, during `pr-review`, or whenever you re-read code and pause to understand it. Especially relevant in Workflows 1, 2, 4, and 5.
-5. **`debug` runs *before* `tdd` for non-trivial bugs.** Workflow 5b makes this explicit. The reproduction from `debug` becomes the failing test for `tdd`. Skip for bugs whose root cause is obvious from the trace (Workflow 5a).
+5. **`caveman` is an efficiency lens.** Apply it whenever token optimization is needed, particularly in Builder reports or long sessions. It doesn't change the workflow but compresses the communication.
+6. **`debug` runs *before* `tdd` for non-trivial bugs.** Workflow 5b makes this explicit. The reproduction from `debug` becomes the failing test for `tdd`. Skip for bugs whose root cause is obvious from the trace (Workflow 5a).
-6. **`security-review` is a gate, not a workflow.** Fires when a change is **surface-changing** — new entry point, identity / session / token flow, authorization logic, sensitive-data path, new external dependency, secrets handling. Runs alongside `design` and `tdd` in Workflows 1, 2, 4, 5a, 5b, 6 whenever those criteria hit. Not a substitute for `prod-ready` Section 3 — both run when the surface changes.
+7. **`security-review` is a gate, not a workflow.** Fires when a change is **surface-changing** — new entry point, identity / session / token flow, authorization logic, sensitive-data path, new external dependency, secrets handling. Runs alongside `design` and `tdd` in Workflows 1, 2, 4, 5a, 5b, 6 whenever those criteria hit. Not a substitute for `prod-ready` Section 3 — both run when the surface changes.
-7. **`pr-review` is a utility workflow.** Runs when reviewing someone else's PR. Also runs (lighter form) as a self-check before opening the PR. The `tdd-rounds` parent's per-round verification borrows from it.
+8. **`pr-review` is a utility workflow.** Runs when reviewing someone else's PR. Also runs (lighter form) as a self-check before opening the PR. The `tdd-rounds` parent's per-round verification borrows from it.
-8. **`prod-ready` is the universal pre-merge gate** for Workflows 1, 2, 4, and sometimes 5. Single exit ramp before opening a PR.
+9. **`prod-ready` is the universal pre-merge gate** for Workflows 1, 2, 4, and sometimes 5. Single exit ramp before opening a PR.
-9. **`verify-real-deps` fires whenever a workflow ends in a tagged release that touches a third-party API.** Most commonly that's Workflow 2 (large feature → tag), but it also applies when Workflow 1, 4, or 6 culminates in a release whose code path talks to an upstream you don't control. It does **not** fire for pure-internal services with database-only state, or for continuous-deploy environments that don't tag.
+10. **`verify-real-deps` fires whenever a workflow ends in a tagged release that touches a third-party API.** Most commonly that's Workflow 2 (large feature → tag), but it also applies when Workflow 1, 4, or 6 culminates in a release whose code path talks to an upstream you don't control. It does **not** fire for pure-internal services with database-only state, or for continuous-deploy environments that don't tag.
-10. **`system-design` runs once per system, not per feature.** It's the greenfield precursor to Workflows 1 and 2. Once the topology is set, individual features run their own Workflow 1 or 2 inside it.
+11. **`system-design` runs once per system, not per feature.** It's the greenfield precursor to Workflows 1 and 2. Once the topology is set, individual features run their own Workflow 1 or 2 inside it.
-11. **Vocabulary is shared.** All architecture-talking skills (`design`, `system-design`, `improve-codebase-architecture`, `pr-review`, `grill-plan`) read from [`skills/LANGUAGE.md`](./LANGUAGE.md). Format references (ADRs, CONTEXT.md) live in [`skills/formats/`](./formats/). Domain vocabulary (Customer, Order, etc.) lives in `docs/CONTEXT.md`. Keep them distinct.
+12. **Vocabulary is shared.** All architecture-talking skills (`design`, `system-design`, `improve-codebase-architecture`, `pr-review`, `grill-plan`) read from [`skills/LANGUAGE.md`](./LANGUAGE.md). Format references (ADRs, CONTEXT.md) live in [`skills/formats/`](./formats/). Domain vocabulary (Customer, Order, etc.) lives in `docs/CONTEXT.md`. Keep them distinct.
 13. **Bootstrap mode for greenfield repos** is documented in one place — [`grill-plan/BOOTSTRAP.md`](./grill-plan/BOOTSTRAP.md). `feature-doc` and `system-design` defer there rather than re-explaining the rules.
@@ -359,7 +366,9 @@ A few things that happen across all workflows:
     | `docs/adr/<n>-<topic>.md` | `grill-plan`, `improve-codebase-architecture` | One per architectural decision |
     | `docs/CONTEXT.md` | `grill-plan`, `improve-codebase-architecture` (inline updates) | One per repo / context |
     | `docs/architecture.md` | `system-design` | One per system (the system map) |
-    | `docs/STATE.md` | `tdd-rounds` parent (append-only) | One per multi-round project |
+    | `docs/features/<name>/state/snapshot.md` | `tdd-rounds` parent | Living snapshot per feature |
+    | `docs/features/<name>/state/rounds/*.md` | `tdd-rounds` Builder | Immutable round logs |
+    | `docs/STATE.md` | `tdd-rounds` parent | Global manifest (index only) |
     | `docs/security/<feature>.md` | `security-review` (high-stakes only) | One per surface-changing feature where a feature-doc section isn't enough |
     | `docs/benchmarks/<feature>.md` | `bench` | One per performance-critical feature |
     | `docs/known-issues.md` | `verify-real-deps` | One per repo (post-mortem record) |

package/skills/caveman/SKILL.md ADDED Viewed

@@ -0,0 +1,60 @@
+---
+name: caveman
+description: Ultra-concise communication utility for token optimization. Use when the user mentions "caveman mode", "be concise", "save tokens", or "ooga booga". Strips articles, pleasantries, and filler words to maximize efficiency. Skip when the user explicitly requests professional or detailed explanations.
+complexity: low
+expected_duration: 5 minutes
+---
+# Caveman Mode (Token Optimizer)
+The discipline of "speaking in tokens" to minimize cost and maximize context window longevity.
+## Why this skill exists
+Standard AI responses are often padded with 30-50% filler words ("Certainly," "I have completed," "As a senior engineer"). In long-running sessions or multi-round orchestration, this "politeness tax" adds up to significant cost and premature context exhaustion.
+## When to use
+- The user explicitly invokes "caveman mode" or "ooga booga".
+- The session is approaching context limits (warning from system).
+- You are operating as a Builder sub-agent where brevity is the primary mandate.
+## When to skip
+- The user asks for a detailed explanation or architectural rationale.
+- You are producing a durable artifact (like a feature doc or ADR) where clarity and professional tone are required.
+## Process
+### 1. Strip the "Fluff"
+Remove all conversational filler:
+- "Sure, I can help with that."
+- "I have successfully updated the file."
+- "Let me know if you need anything else."
+### 2. Compress the Prose
+Remove articles (a, an, the) and simplify verbs:
+- **Normal**: "I am updating the user controller to handle the new session timeout logic."
+- **Caveman**: "Update user controller. Handle session timeout."
+### 3. Bullet-Only Output
+Prefer short, punchy bullet points over paragraphs. One line per action.
+## Anti-patterns
+- **Losing Technical Accuracy.** Conciseness must not compromise path names, variable names, or command flags. `Update src/main.go` is good; `Fix file` is bad.
+- **Being Rude.** Caveman mode is about *token density*, not hostility. It is a technical optimization, not a personality shift.
+## Pairing with other skills
+- **[`tdd-rounds`](../tdd-rounds/SKILL.md)** — Excellent for Builder reports.
+- **[`simplify`](../simplify/SKILL.md)** — Use to report the "before/after" of a cleanup pass.
+## Done when
+- The response contains zero conversational filler.
+- All technical details remain accurate and actionable.
+- Token usage for the turn is significantly lower than a standard response.

package/skills/code-hygiene/SKILL.md CHANGED Viewed

@@ -66,9 +66,11 @@ Duplicate twice; extract on the third occurrence — not the second.
 The first occurrence is unique. The second might be coincidence. The third is a pattern. Extracting at two reveals only one axis of variation; extracting at three reveals the *real* axis.
 **Why**: premature abstractions calcify. Once a wrong abstraction exists, callers shape themselves to it, and rewriting becomes expensive. Three concrete copies are cheap; one wrong abstraction is not.
+5. **Locality of behavior** — related code lives together; don't split by category.
+6. **Comments earn their keep** — default to none. Keep only WHY-comments: a constraint, an invariant, a trade-off, or a provenance link to an ADR / round / snapshot.
-**Smell**: a helper function with one caller, or a base class with one subclass. That's an abstraction in search of a use.
+## When to use
+...
 ## Principle 5: Locality of behavior
 Related code lives close together. Don't split a system by *type of code* (`controllers/`, `services/`, `repositories/`) — split by *responsibility* (`orders/`, `billing/`, `auth/`).
@@ -77,6 +79,18 @@ Related code lives close together. Don't split a system by *type of code* (`cont
 **Smell**: changing one feature requires editing 5 files in 5 directories. That's a sign the structure separates *type* of code, not *responsibility*. (This is a `improve-codebase-architecture` issue at scale, but at smaller scale you can fix it inline by colocating files.)
+## Principle 6: Comments earn their keep
+Default to none. Keep only WHY-comments: a constraint, an invariant, a trade-off, or a provenance link to an ADR / round / snapshot. Delete WHAT-comments, "used by X" / "added for Y" caller references, banner dividers, and commented-out code on sight.
+**Smells**:
+- A comment that restates the next line of code.
+- A comment naming a caller — that reference rots the moment someone renames or removes the caller. Use `grep`.
+- A TODO / FIXME / HACK with no owner, ticket, ADR, or date.
+- Commented-out code "in case we need it again". Git has it.
+**Project-specific conventions** (package docstrings, provenance grammar) live in the repo's comment style doc — usually [`skills/formats/STYLE-comments.md`](../formats/STYLE-comments.md).
 ## Done when
 - Names communicate intent — a stranger reads them and forms the right mental model.
@@ -84,6 +98,7 @@ Related code lives close together. Don't split a system by *type of code* (`cont
 - No "in case we need it" parameters, classes, or interfaces remain.
 - Duplications either survived the 2-occurrence test (left as-is) or proved themselves at the 3rd occurrence (extracted).
 - Related code lives near related code.
+- Every surviving comment names a why, an invariant, a trade-off, or a provenance link. None restates the code.
 ## Pairing with other skills

package/skills/formats/STYLE-comments.md ADDED Viewed

@@ -0,0 +1,113 @@
+# Comment Style
+Codifies the comment discipline for this codebase. New contributors (human or agent) should match what's here, and what's already in the code, rather than re-deriving conventions per package.
+> Default: write **no** comment. Add one only when the WHY is non-obvious.
+The code's identifiers, types, and structure already explain the *what*. A comment earns its line only when it captures something the reader cannot recover by reading the next ten lines: a constraint, an invariant, a trade-off, a workaround, or a pointer to the decision record.
+---
+## 1. Five kinds of comment we keep
+### 1.1 Package docstring (required, one per package)
+Every package starts with a multi-paragraph docstring block in its primary file.
+Shape:
+- One-line purpose.
+- One paragraph on the design choice and its rationale.
+- Round / ADR provenance for major decisions.
+- The failure mode or invariant the next reader needs to know.
+### 1.2 Docstrings on every exported identifier (required)
+- Starts with the identifier name (`// Cache stores …`, not `// Stores …`).
+- Interface methods state the **contract**, not just the action: is the call cheap, idempotent, safe under concurrency, called once per request or many times?
+### 1.3 Invariant / constraint comments (encouraged)
+A comment that catches a bug the next maintainer would otherwise write.
+- `// Zero means 'unknown'` (and re-stated where the bug would bite).
+- `// Selector must be cheap and side-effect-free; ServeHTTP may invoke Pick multiple times on 429 retry.`
+### 1.4 Trade-off comments (encouraged)
+Name the alternative considered and why it lost. Cite the failure mode if the choice has one.
+### 1.5 Provenance comments (use the grammar in §3)
+Anchor non-obvious behavior to its decision record (ADR, round, AC, snapshot).
+---
+## 2. Five kinds of comment we delete on sight
+1. **Restates the code.** `// increment i`, `// return the user`, `// loop over items`. The code already says it.
+2. **Commented-out code.** Git has it. Delete.
+3. **"Added for X" / "Used by Y" caller references.** These rot the day someone renames or removes the caller. Use `grep` instead.
+4. **Banner / ASCII-art dividers.** `// =====` and `// --- section ---`. Use a function or a file boundary, not a banner.
+5. **Stale TODO / FIXME / HACK with no owner, ticket, ADR, or date.** Either link it to a decision record or fix it now.
+---
+## 3. Provenance grammar (canonical forms)
+When a comment cites the project record, use **exactly one** of these forms. Comments that follow the canonical grammar are greppable and stable.
+| Kind                                   | Canonical form          | Examples                                |
+| -------------------------------------- | ----------------------- | --------------------------------------- |
+| Architectural decision record          | `ADR-NNN §SECTION`      | `ADR-007 §7`, `ADR-002 §"Routing"`      |
+| TDD round                              | `RN`                    | `R6`, `R3`                              |
+| TDD round + acceptance criterion       | `RN AC-M`               | `R6 AC-3`, `R1 AC-12`                   |
+| Tagged round + batch within a release  | `vX.Y RNbM`             | `v0.3 R1b2`, `v0.8 R2b1`                |
+| Dated verification                     | `<kind> YYYY-MM-DD`     | `smoke-test 2026-05-09`                 |
+| Feature state snapshot                 | `snapshot §SECTION`     | `snapshot §"Invariants"`                |
+**Avoid** (normalize to the table above):
+- `Round 6`, `round-six`, `R 6` → `R6`
+- `ADR 007`, `ADR7`, `ADR-7`, `adr-007` → `ADR-007`
+- `AC 3`, `AC3`, `ac-3` → `AC-3`
+- Section anchors: prefer `§7` or `§"Quoted Heading"`; avoid `section 7`, `sec. 7`, `(7)`.
+---
+## 4. When the comment grows
+If the explanation needs more than **~6 lines**, the right home is an ADR or a `docs/research/<topic>.md` note. Leave a one-line pointer in the code:
+```go
+// Selection rationale: see docs/research/tier-spill.md and ADR-002 §"Routing".
+```
+---
+## 5. "Documented for the next reader" framing
+When a comment exists because a future reader is **likely to make the opposite assumption**, say so explicitly:
+```go
+// Cold-cache choice (documented for the next reader): an account whose
+// (pk, model) is not yet in the cache is treated as INELIGIBLE …
+```
+---
+## 6. Test files
+Non-trivial tests — those with fakes, harnesses, time control, channel-driven ceremonies, or unusual lifecycle — get a **5–20 line block comment at the top** of the file (or above the harness type) stating:
+- What is **faked** versus real (and why each choice).
+- Why the design (deterministic? avoid `time.Sleep`? exercise a specific retry path?).
+- Which knobs are overridden for the test and to **what value**.
+---
+## 7. Done when
+Apply this checklist during the [`simplify`](../simplify/SKILL.md) sweep or in code review:
+- [ ] Every package has a docstring matching §1.1.
+- [ ] Every exported identifier has a docstring starting with its name.
+- [ ] Every comment that survived the sweep is a why, an invariant, a trade-off, or a provenance link — none restates the code.
+- [ ] No commented-out code, no `// used by X`, no banner dividers, no orphan TODOs.
+- [ ] Every provenance citation matches the canonical grammar in §3.
+- [ ] Test files with non-trivial harnesses have a strategy preamble; trivial ones do not.

package/skills/pr-review/SKILL.md CHANGED Viewed

@@ -104,6 +104,7 @@ The doc-map is small enough to walk in 2–3 minutes. Skip it and you're trading
 Apply `code-hygiene` as a lens here, not as a primary phase:
+- **Comment churn**: new WHAT-comments, stale "used by X" references, citation grammar that doesn't match the repo's comment style doc ([`skills/formats/STYLE-comments.md`](../formats/STYLE-comments.md)). These are nits individually; in aggregate they're a signal the author didn't run `simplify`. Flag as suggestions, not blockers.
 - Names that mislead (boolean returning non-bool, `getX` that mutates, `Manager`/`Helper` suffixes hiding what the thing is).
 - Cleverness that earns its cost? Or could be boring?
 - YAGNI — "in case we need it" parameters / interfaces / classes? Strip.

package/skills/simplify/SKILL.md CHANGED Viewed

@@ -49,7 +49,13 @@ Walk every changed file. Apply each lens in order. Fix what you find inline.
 - Names that read clearly out of context — would a stranger guess what `result`, `data`, `value` referred to? If not, rename.
 - Error messages that name the failing input — `"could not parse: <value>"` beats `"parse error"`.
 - Abstractions that haven't earned their keep — a base class with one subclass, an interface with one implementation. Inline.
-- Comments that explain *what* the code does — delete; the code already says it. Keep comments that explain *why* (a constraint, an invariant, a workaround).
+- Comments:
+  - Delete WHAT-comments (the code already says it).
+  - Delete "used by X" / "added for Y" caller references — these rot; use grep.
+  - Delete commented-out code (git has it).
+  - Delete banner / ASCII-art dividers.
+  - KEEP why-comments: constraint, invariant, trade-off, provenance.
+  - If a kept comment cites a round / ADR / AC / snapshot, normalize to the project grammar in [`skills/formats/STYLE-comments.md`](../formats/STYLE-comments.md) (§3 — `ADR-007 §7`, `R6 AC-3`, `v0.3 R1b2`, `snapshot §"Invariants"`).
 ### 3. Efficiency — dead code, redundant work, premature defensive checks

package/skills/tdd-rounds/SKILL.md CHANGED Viewed

@@ -69,7 +69,7 @@ Known follow-ups: <one line>
 - **Builder stuck (test won't go green / design feels wrong).** Builder reports a *blocking* open question instead of thrashing. Hard rule: **Builder must not silently descope an AC.**
   - **Concrete escalation signals** (any one fires → stop and surface):
     - 3 consecutive failed attempts at making the same test green with the same approach.
-    - 2 design-level questions that the brief + STATE.md + cited ADRs don't answer.
+    - 2 design-level questions that the brief + feature state + cited ADRs don't answer.
     - The Builder finds itself wanting to modify a file in the "must NOT touch" allowlist to make progress.
     - A new test would require mocking >3 internal collaborators (smell: design is wrong, not the test).
   - **Parent's response**: shrink scope (split the round), or invoke `design` explicitly with the friction described, or run `grill-plan` if a load-bearing decision is wobbling. Don't push the Builder to keep trying.
@@ -94,3 +94,16 @@ When the final round completes:
 1. Run `verify-real-deps`. Capture surfaced bugs into `docs/known-issues.md`.
 2. Iterate fix-rounds until clean, or document deferrals to vN.1 with rationale.
 3. Tag and publish via whatever release / distribution channel applies.
+hape the Builder returns.
+## Supporting docs
+- [`COMMITS.md`](COMMITS.md) — commit cadence and message style (per-AC slicing, `R<N>:` prefix, when single-commit is OK, honesty rule). Builders read this before the first commit.
+## Handoff
+When the final round completes:
+1. Run `verify-real-deps`. Capture surfaced bugs into `docs/known-issues.md`.
+2. Iterate fix-rounds until clean, or document deferrals to vN.1 with rationale.
+3. Tag and publish via whatever release / distribution channel applies.