npm - @event4u/agent-config - Versions diffs - 1.14.0 → 1.15.0 - Mend

@event4u/agent-config 1.14.0 → 1.15.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (106) hide show

package/.agent-src/commands/agent-handoff.md +1 -1
package/.agent-src/commands/bug-fix.md +2 -2
package/.agent-src/commands/chat-history-checkpoint.md +2 -2
package/.agent-src/commands/chat-history-clear.md +1 -1
package/.agent-src/commands/chat-history-resume.md +2 -2
package/.agent-src/commands/chat-history.md +2 -2
package/.agent-src/commands/check-current-md.md +43 -32
package/.agent-src/commands/commit-in-chunks.md +43 -23
package/.agent-src/commands/compress.md +34 -2
package/.agent-src/commands/feature-roadmap.md +2 -2
package/.agent-src/commands/fix-portability.md +2 -2
package/.agent-src/commands/onboard.md +14 -5
package/.agent-src/commands/optimize-augmentignore.md +9 -0
package/.agent-src/commands/refine-ticket.md +9 -7
package/.agent-src/commands/review-changes.md +35 -8
package/.agent-src/commands/roadmap-create.md +13 -2
package/.agent-src/commands/roadmap-execute.md +9 -7
package/.agent-src/commands/set-cost-profile.md +8 -0
package/.agent-src/commands/sync-agent-settings.md +9 -0
package/.agent-src/commands/tests-execute.md +2 -3
package/.agent-src/rules/artifact-engagement-recording.md +1 -1
package/.agent-src/rules/augment-portability.md +56 -37
package/.agent-src/rules/chat-history-cadence.md +109 -0
package/.agent-src/rules/chat-history-ownership.md +123 -0
package/.agent-src/rules/chat-history-visibility.md +96 -0
package/.agent-src/rules/cli-output-handling.md +1 -1
package/.agent-src/rules/command-suggestion.md +3 -2
package/.agent-src/rules/commit-policy.md +44 -34
package/.agent-src/rules/direct-answers.md +1 -1
package/.agent-src/rules/language-and-tone.md +19 -15
package/.agent-src/rules/non-destructive-by-default.md +18 -18
package/.agent-src/rules/roadmap-progress-sync.md +133 -74
package/.agent-src/rules/role-mode-adherence.md +1 -1
package/.agent-src/rules/size-enforcement.md +2 -1
package/.agent-src/rules/user-interaction.md +28 -4
package/.agent-src/scripts/update_roadmap_progress.py +56 -4
package/.agent-src/skills/blade-ui/SKILL.md +29 -10
package/.agent-src/skills/command-writing/SKILL.md +15 -4
package/.agent-src/skills/existing-ui-audit/SKILL.md +24 -9
package/.agent-src/skills/fe-design/SKILL.md +20 -15
package/.agent-src/skills/file-editor/SKILL.md +9 -0
package/.agent-src/skills/livewire/SKILL.md +26 -7
package/.agent-src/skills/refine-ticket/SKILL.md +30 -24
package/.agent-src/skills/roadmap-management/SKILL.md +22 -16
package/.agent-src/skills/skill-writing/SKILL.md +3 -3
package/.agent-src/skills/upstream-contribute/SKILL.md +2 -2
package/.agent-src/templates/agent-settings.md +1 -1
package/.agent-src/templates/roadmaps.md +9 -8
package/.agent-src/templates/scripts/memory_lookup.py +1 -1
package/.agent-src/templates/scripts/work_engine/__init__.py +2 -2
package/.agent-src/templates/scripts/work_engine/cli.py +64 -461
package/.agent-src/templates/scripts/work_engine/cli_args.py +116 -0
package/.agent-src/templates/scripts/work_engine/delivery_state.py +3 -3
package/.agent-src/templates/scripts/work_engine/directives/backend/__init__.py +1 -1
package/.agent-src/templates/scripts/work_engine/directives/backend/implement.py +1 -1
package/.agent-src/templates/scripts/work_engine/directives/backend/memory.py +1 -1
package/.agent-src/templates/scripts/work_engine/directives/backend/plan.py +1 -1
package/.agent-src/templates/scripts/work_engine/directives/backend/report.py +1 -1
package/.agent-src/templates/scripts/work_engine/dispatcher.py +1 -1
package/.agent-src/templates/scripts/work_engine/emitters.py +43 -0
package/.agent-src/templates/scripts/work_engine/errors.py +19 -0
package/.agent-src/templates/scripts/work_engine/hook_bootstrap.py +76 -0
package/.agent-src/templates/scripts/work_engine/input_builders.py +163 -0
package/.agent-src/templates/scripts/work_engine/migration/v0_to_v1.py +34 -2
package/.agent-src/templates/scripts/work_engine/persona_policy.py +1 -1
package/.agent-src/templates/scripts/work_engine/resolvers/prompt.py +1 -1
package/.agent-src/templates/scripts/work_engine/state_io.py +202 -0
package/.claude-plugin/marketplace.json +1 -1
package/AGENTS.md +6 -4
package/CHANGELOG.md +83 -8
package/README.md +24 -23
package/docs/MIGRATION.md +122 -0
package/docs/architecture.md +83 -34
package/docs/contracts/STABILITY.md +95 -0
package/docs/contracts/adr-chat-history-split.md +132 -0
package/docs/contracts/adr-command-suggestion.md +146 -0
package/docs/contracts/adr-implement-ticket-runtime.md +122 -0
package/docs/contracts/adr-product-ui-track.md +384 -0
package/docs/contracts/adr-prompt-driven-execution.md +187 -0
package/docs/contracts/agent-memory-contract.md +149 -0
package/docs/contracts/artifact-engagement-flow.md +262 -0
package/docs/contracts/command-clusters.md +126 -0
package/docs/contracts/command-suggestion-flow.md +148 -0
package/docs/contracts/implement-ticket-flow.md +628 -0
package/docs/contracts/linear-ai-rules-inclusion.md +143 -0
package/docs/contracts/linear-ai-three-layers.md +131 -0
package/docs/contracts/rule-interactions.md +107 -0
package/docs/contracts/rule-interactions.yml +142 -0
package/docs/contracts/ui-stack-extension.md +236 -0
package/docs/contracts/ui-track-flow.md +338 -0
package/docs/getting-started.md +2 -2
package/docs/installation.md +42 -6
package/docs/migrations/commands-1.15.0.md +112 -0
package/docs/ui-track-mental-model.md +121 -0
package/package.json +1 -1
package/scripts/build_linear_digest.py +4 -4
package/scripts/check_portability.py +2 -0
package/scripts/check_public_links.py +185 -0
package/scripts/check_references.py +1 -0
package/scripts/lint_no_new_atomic_commands.py +179 -0
package/scripts/lint_rule_interactions.py +149 -0
package/scripts/memory_lookup.py +1 -1
package/scripts/release.py +297 -64
package/scripts/skill_linter.py +14 -0
package/scripts/update_counts.py +10 -0
package/.agent-src/rules/chat-history.md +0 -200

package/docs/contracts/ui-stack-extension.md ADDED Viewed

@@ -0,0 +1,236 @@
+---
+stability: beta
+---
+# UI Stack Extension — adding a new frontend stack to the UI track
+> **Audience:** maintainers adding a new stack (Svelte, SolidJS, Astro,
+> Qwik, …) to the UI directive set. Consumers of the package never run
+> this; they get the stacks shipped here.
+> **Source of truth:** [`ui-track-flow.md`](ui-track-flow.md) for the
+> contract; [`adr-product-ui-track.md`](adr-product-ui-track.md) for
+> the rationale.
+> **Status:** recipe only — R3 ships four stacks
+> (`blade-livewire-flux`, `react-shadcn`, `vue`, `plain`); no
+> additional stacks are scheduled. Add one only when a real consumer
+> project asks for it.
+## What "adding a stack" actually means
+Three artefacts plus a Golden fixture. The engine's directive set is
+fixed; only the dispatch tables and the implementation skill bundles
+change.
+| Artefact | File | Change |
+|---|---|---|
+| Stack label | [`scripts/work_engine/stack/detect.py`](../../.agent-src.uncompressed/templates/scripts/work_engine/stack/detect.py) | New entry in `KNOWN_STACKS` + a heuristic in `detect_stack` |
+| Apply skill | `.agent-src.uncompressed/skills/ui-apply-<stack>/SKILL.md` | New skill bundle |
+| Review skill | `.agent-src.uncompressed/skills/ui-design-review-<stack>/SKILL.md` | New skill bundle |
+| Polish skill | `.agent-src.uncompressed/skills/ui-polish-<stack>/SKILL.md` | New skill bundle |
+| Dispatch tables | `directives/ui/{apply,review,polish}.py` | New row in each `STACK_DIRECTIVES` map |
+| Golden fixture | `tests/golden/sandbox/recipes/gt_u<NN>_<stack>_*.py` | One happy-path baseline at minimum |
+Skill names are not free — they are read by the dispatcher as
+`ui-apply-<stack>`, `ui-design-review-<stack>`, `ui-polish-<stack>`.
+A typo is silently handled by the `DEFAULT_DIRECTIVE` fallback
+(`ui-apply-plain` / `ui-design-review-plain` / `ui-polish-plain`),
+which is recoverable but probably not what the maintainer intended.
+## Step 1 — pick the label
+Convention: lowercase, hyphenated, framework-only (no version pin).
+| Good | Bad | Why |
+|---|---|---|
+| `svelte` | `svelte5` | Major-version drift handled by the audit's version-anchor warning, not by splitting the stack. |
+| `solid` | `solid-js` | Match the package name developers say out loud. |
+| `astro` | `astro-content` | Sub-modes (Astro components vs MDX) live inside the apply skill, not in the label. |
+Add the label to `KNOWN_STACKS` (the frozenset is exported and used by
+state validation; missing it makes detection silent-fail back to
+`plain`).
+## Step 2 — heuristic in `detect_stack`
+The detector reads `composer.json` and `package.json` once and applies
+heuristics in **priority order** — first match wins. The order matters:
+a Laravel + React project must hit `blade-livewire-flux` only when
+Livewire and Flux are present, otherwise `react-shadcn`.
+Insert the new heuristic **before** the `plain` fallback and **after**
+any stack that could legitimately co-exist with yours. For Svelte:
+```python
+def _has_svelte(package: dict[str, object]) -> bool:
+    deps = _all_dependencies(
+        package,
+        "dependencies",
+        "devDependencies",
+        "peerDependencies",
+        "optionalDependencies",
+    )
+    return "svelte" in deps
+```
+```python
+# inside detect_stack(), before the `plain` fallback
+if _has_svelte(package):
+    return StackResult(frontend="svelte", mtime=mtime)
+```
+**Failure mode to watch.** A heuristic that overlaps with an existing
+stack (Astro lists `react` as a peer dep when used with React adapters)
+must be ordered carefully. Test against real fixtures, not hand-edited
+JSON.
+## Step 3 — three skills, one shape
+Each skill is a SKILL.md bundle with frontmatter and prose. The
+contract for each:
+### `ui-apply-<stack>`
+Reads `state.ui_design.brief` and `state.ui_audit.components_found`.
+Writes a populated `state.ticket["ui_apply"]` envelope:
+```yaml
+ui_apply:
+  files: ["resources/components/UserCard.svelte"]
+  rendered:                 # full text per file, microcopy-locked
+    "resources/components/UserCard.svelte": |
+      <script>...</script>
+      ...
+  components_added: ["UserCard"]
+  components_reused: ["Button", "Card"]   # from audit
+  microcopy_lock: true      # affirms strings come from brief verbatim
+```
+**Hard rule:** strings in `rendered` must not contain
+`PLACEHOLDER_PATTERNS` (`<placeholder>`, `lorem`, `todo:`, `tbd`,
+`xxx`). The dispatcher rejects on producer side; the skill must reject
+on consumer side too.
+### `ui-design-review-<stack>`
+Reads `state.ui_apply` + `state.ui_design.brief`. Writes
+`state.ui_review` with `findings: list` and `review_clean: bool`.
+Findings are tagged `kind` ∈ `{token_violation, microcopy_drift,
+a11y_gap, layout_break, prop_misuse, …}` — extend the kind set
+sparingly. Token violations carry `category` and `value` so polish
+can route them through token extraction.
+### `ui-polish-<stack>`
+Reads `state.ui_review.findings` + `state.ui_audit.design_tokens`.
+Writes a fix envelope and increments `state.ui_polish.rounds`. Hard
+ceiling at `POLISH_CEILING = 2` is enforced by the dispatcher;
+the skill does **not** check the ceiling itself but must respect
+`token_repeat_threshold = 2` for unmatched-token extraction.
+## Step 4 — wire dispatch tables
+Three identical edits in
+[`directives/ui/apply.py`](../../.agent-src.uncompressed/templates/scripts/work_engine/directives/ui/apply.py),
+[`directives/ui/review.py`](../../.agent-src.uncompressed/templates/scripts/work_engine/directives/ui/review.py),
+and [`directives/ui/polish.py`](../../.agent-src.uncompressed/templates/scripts/work_engine/directives/ui/polish.py):
+```python
+STACK_DIRECTIVES: dict[str, str] = {
+    "blade-livewire-flux": "ui-apply-blade-livewire-flux",
+    "react-shadcn": "ui-apply-react-shadcn",
+    "vue": "ui-apply-vue",
+    "plain": "ui-apply-plain",
+    "svelte": "ui-apply-svelte",          # ← new row
+}
+```
+Same shape in `review.py` (`ui-design-review-svelte`) and `polish.py`
+(`ui-polish-svelte`). The dispatcher does not validate that the skill
+actually exists — it emits the `@agent-directive` and trusts the agent
+loader. A typo or missing skill manifests as a halt loop the maintainer
+notices on first replay.
+## Step 5 — version anchor in skill frontmatter
+Every stack-specific skill declares the upstream library version it
+was tested against, in frontmatter:
+```yaml
+tested_against:
+  svelte: "5.x"
+  svelte_kit: "2.x"      # if applicable
+```
+The audit step reads installed versions from `package.json` and
+compares against the anchor. A mismatch warns but does not block;
+the user picks whether to proceed or pin. Forgetting the anchor
+fails CI (`task lint-skills` enforces presence on `ui-apply-*`,
+`ui-design-review-*`, `ui-polish-*`).
+## Step 6 — Golden fixture
+Add at least one happy-path Golden that exercises the full UI flow
+on the new stack. Recipe lives at
+`tests/golden/sandbox/recipes/gt_u<NN>_<stack>_happy.py` and follows
+the pattern of `gt_u11_high_confidence.py`:
+1. Seed state with a clear `ui-build` prompt and an audit that finds
+   exactly one strong match (so `audit_path = "high_confidence"`).
+2. Pin the halt budget — high-confidence happy path is **1 halt**
+   (design-brief sign-off only).
+3. Capture under `tests/golden/baseline/GT-U<NN>/` via `task golden-capture`.
+4. The replay harness (`tests/golden/test_replay.py`) auto-discovers
+   the new baseline; no Taskfile change needed.
+For a complete pair, also add an `ambiguous` fixture — `2 halts max`,
+matching `gt_u12_ambiguous.py`'s shape. Skip the greenfield branch
+unless the stack has a meaningful difference there.
+See [`tests/golden/CAPTURING.md`](../../tests/golden/CAPTURING.md)
+for capture mechanics.
+## Step 7 — verify end-to-end
+```bash
+task sync                  # propagate detect.py + skill changes
+task generate-tools        # refresh .claude/, .cursor/, etc.
+task lint-skills           # checks frontmatter + version anchor
+task golden-replay         # runs all R1+R2+R3 baselines
+task ci                    # full pipeline
+```
+If `golden-replay` regresses on a non-`<stack>` baseline, your
+detector heuristic is over-matching — re-order the priority chain.
+## What you do **not** add
+- **A new directive set.** UI / UI-trivial / mixed are exhaustive for
+  R3. A stack-specific directive set means you've reached for a hammer
+  the engine doesn't have.
+- **A new entrypoint command.** `/work` and `/implement-ticket` route
+  through `directive_set` and `state.stack.frontend`; a `/build-svelte`
+  command would create a UX divergence the engine's intent classifier
+  is supposed to prevent.
+- **`fe-design` content.** That skill is the framework-agnostic
+  reference; new stack-specific heuristics live in the apply skill,
+  not in `fe-design`.
+- **Visual review.** Roadmap 4's headless-browser pipeline is the
+  destination for screenshot capture and a11y tooling. Stack
+  extensions don't need to wait on it.
+## When NOT to add a stack
+Defer the work and stay on `plain` if any apply:
+- Only one consumer project asks; the cost of maintaining the apply
+  skill exceeds the value.
+- The framework's idiom is close enough to an existing stack that
+  the existing apply skill produces acceptable output (Preact ≈
+  `react-shadcn` for most components).
+- The framework is in beta or pre-1.0 — the version anchor will drift
+  faster than you can re-capture goldens.
+The audit gate is the safety net: even on `plain`, the audit finds
+existing components and the design step uses them. The cost of
+**not** adding a stack is generally lower than the cost of adding
+one prematurely.

package/docs/contracts/ui-track-flow.md ADDED Viewed

@@ -0,0 +1,338 @@
+---
+stability: beta
+---
+# UI Track — Flow Contract
+> Technical contracts for the UI directive sets shipped under
+> [`road-to-product-ui-track.md`](../../agents/roadmaps/road-to-product-ui-track.md).
+> Sibling of [`implement-ticket-flow.md`](implement-ticket-flow.md) — that
+> doc covers `backend`; this one covers `ui`, `ui-trivial`, and the
+> `mixed` set that stitches both.
+>
+> - **Created:** 2026-05-01
+> - **Status:** Phase 1–6 shipped — audit / design / apply / review /
+>   polish handlers live under
+>   [`.agent-src.uncompressed/templates/scripts/work_engine/directives/ui/`](../../.agent-src.uncompressed/templates/scripts/work_engine/directives/ui/).
+>   Mixed (Phase 4) under `directives/mixed/`. `ui-trivial` (Phase 2 Step 6)
+>   under `directives/ui_trivial/`. R4 (Visual Review Loop) added the
+>   a11y gate, the preview envelope, and a polish-termination rewrite
+>   that splits subjective ceilings from objective a11y blocks — see
+>   [`road-to-visual-review-loop.md`](../../agents/roadmaps/road-to-visual-review-loop.md).
+>   Golden Transcripts GT-U1..U4, U7, U8, U9..U12 plus GT-U5 (mixed
+>   flow), GT-U6A/B (stack dispatch), and R4's GT-U13..U15 (a11y polish,
+>   a11y ceiling, preview render failure) pin happy-path, ambiguity,
+>   mixed orchestration, stack dispatch, trivial happy path, and the
+>   visual-review gates.
+> - **Runtime:** Python 3.10+. Same `DeliveryState` envelope, same
+>   eight-slot dispatcher as the backend track — only the slot handlers
+>   differ.
+## What this doc is
+The **shape** of the four UI-related directive sets: which slot runs
+which handler, what each handler reads / writes on `DeliveryState`, the
+hard thresholds (similarity, polish ceiling, trivial limits), and the
+sentinels that release each gate.
+## What this doc is *not*
+- A skill spec — `existing-ui-audit`, `ui-design-brief`, the stack
+  apply / review / polish skills are documented in their own
+  `SKILL.md` files.
+- A migration guide for the schema — see
+  [`implement-ticket-flow.md`](implement-ticket-flow.md#state-schema-v1).
+- A roadmap — phased delivery lives in
+  [`road-to-product-ui-track.md`](../../agents/roadmaps/road-to-product-ui-track.md).
+## The four directive sets
+| Set | When picked | Slot 1–8 |
+|---|---|---|
+| `backend` | Default — no UI keywords, no UI envelope | `refine → memory → analyze → plan → implement → test → verify → report` (see sibling doc) |
+| `ui` | UI keywords, `improve` envelope, or refine routing to UI | `audit → ⊘ → design → ⊘ → apply → review → polish → report` |
+| `ui-trivial` | Phase-1 classifier hits "single-line tweak" pattern | `refine → ⊘ → ⊘ → ⊘ → apply → test → ⊘ → report` |
+| `mixed` | Backend + UI in one input | `refine → memory → analyze → contract → ui → stitch → verify → report` |
+`⊘` = `_passthrough.run` (or `_skipped.run` in `ui-trivial`). The slot
+exists so the dispatcher's completeness check is satisfied; no logic
+runs and no state is touched.
+Source of truth for slot wiring:
+[`directives/ui/__init__.py`](../../.agent-src.uncompressed/templates/scripts/work_engine/directives/ui/__init__.py),
+[`directives/mixed/__init__.py`](../../.agent-src.uncompressed/templates/scripts/work_engine/directives/mixed/__init__.py),
+[`directives/ui_trivial/__init__.py`](../../.agent-src.uncompressed/templates/scripts/work_engine/directives/ui_trivial/__init__.py).
+## The `ui` set — slot-by-slot
+### `refine` → audit
+Mandatory pre-step. Routes on `state.ui_audit` shape:
+| State | Outcome | Handler |
+|---|---|---|
+| `None` / empty / non-dict | `BLOCKED` + `@agent-directive: existing-ui-audit` | First-pass delegation |
+| `greenfield=True`, no `greenfield_decision` | `BLOCKED` numbered options | User picks `scaffold` / `bare` / `external_reference` |
+| `shadcn_inventory.version` major ≠ `TESTED_AGAINST_SHADCN_MAJOR` (`2`) and no `version_mismatch_decision` | `BLOCKED` soft halt | "Cautious composition / abort" |
+| Confidence `high` + ≥1 match with similarity ≥ `STRONG_SIMILARITY` (`0.7`) and no runner-up within `TIE_GAP` (`0.05`) | `SUCCESS`, `audit_path = "high_confidence"` | Design folds findings into brief |
+| Anything else populated | `BLOCKED` numbered options | User picks candidate to extend (or "build new"); records `audit_path = "ambiguous"` + `candidate_pick` |
+Constants live in
+[`directives/ui/audit.py`](../../.agent-src.uncompressed/templates/scripts/work_engine/directives/ui/audit.py):
+`STRONG_SIMILARITY = 0.7`, `TIE_GAP = 0.05`,
+`TESTED_AGAINST_SHADCN_MAJOR = 2`. Idempotent re-entry: once
+`audit_path` is set the step round-trips through `SUCCESS` without
+re-emitting.
+### `analyze` → design
+Produces the **locked design brief**. `apply` consumes microcopy
+verbatim — that's the lock.
+Required brief keys (`REQUIRED_BRIEF_KEYS`): `layout`, `components`,
+`states`, `microcopy`, `a11y`. Required state coverage
+(`REQUIRED_STATE_KEYS`): `empty`, `loading`, `error`, `success`,
+`disabled`.
+Microcopy is rejected when any string matches `PLACEHOLDER_PATTERNS`:
+`<placeholder>`, `lorem`, `todo:`, `tbd`, `xxx` (case-insensitive
+substring). The same tuple is re-imported by `apply` so the rejection
+fires at the producer first, at the consumer as defense-in-depth.
+Sentinel: `state.ui_design.design_confirmed`. Without it the brief
+halt fires every pass; with it the step round-trips through `SUCCESS`.
+### `implement` → apply
+Stack-dispatched. Routes on `state.stack.frontend`:
+| `state.stack.frontend` | Directive | Skill bundle |
+|---|---|---|
+| `blade-livewire-flux` | `ui-apply-blade-livewire-flux` | `flux` + `livewire` + `blade-ui` |
+| `react-shadcn` | `ui-apply-react-shadcn` | `react-shadcn-ui` |
+| `vue` | `ui-apply-vue` | `ui-apply-vue` |
+| `plain` (or unknown — `DEFAULT_DIRECTIVE`) | `ui-apply-plain` | `blade-ui` + Tailwind base |
+Apply does **not** re-validate the brief — it validates *output* against
+`PLACEHOLDER_PATTERNS`. A hallucinated `<placeholder>` string in the
+rendered envelope triggers
+`apply_placeholders_in_output` and forces re-render with the locked
+microcopy. Once `state.ticket["ui_apply"]` is well-formed, apply records
+changes and returns `SUCCESS`.
+### `test` → review
+Stack-dispatched design-review pass. Same dispatch table shape as
+apply, prefix `ui-design-review-`. Writes
+`state.ui_review.findings` (list) + `state.ui_review.review_clean`
+(bool). The step does **not** enforce
+`review_clean == (len(findings) == 0)` — that would block the
+legitimate "ship as-is with open findings" replay path. Honesty of the
+flag is the producer's contract; review only validates shape.
+**R4 — a11y gate** (after the basic clean/findings gates pass).
+`_apply_a11y_gate` reads `state.ui_review.a11y.violations`, filters
+out entries already in `state.ui_audit.a11y_baseline` (pre-existing
+violations stay informational, never block), drops anything below
+`severity_floor` (default `moderate`; unknown severities default to
+`moderate` so a malformed envelope cannot weaken the gate), and
+filters entries listed in `state.ui_review.a11y.accepted_violations`
+(idempotent re-entry after the polish-ceiling Accept choice).
+Surviving violations are synthesised as
+`{kind: "a11y_violation", rule, selector, severity}` findings (deduped
+by `(rule, selector)`) and `review_clean` is forced to `False`
+engine-side. Opt-in: when `state.ui_audit.a11y_baseline` exists but
+`state.ui_review.a11y` is missing, the step halts with
+`review_a11y_pending` so the skill writes the envelope on the next
+pass; pre-R4 envelopes without a baseline bypass the gate entirely.
+**R4 — preview envelope** (the engine never renders).
+`_apply_preview_gate` reads `state.ui_review.preview`. Shape:
+`render_ok: bool`, optional `screenshot_path`, `dom_dump_path`,
+`error`, `skipped`. `render_ok: False` halts with
+`preview_render_failed` so the user picks retry / skip / abort; Skip
+flips `state.ui_review.preview.skipped = true` and the gate becomes a
+no-op on re-entry. `render_ok: True` with `screenshot_path` set
+threads the path into the delivery report's `artifacts` list. The
+gate is independent of the a11y gate; both can fire on the same pass.
+### `verify` → polish
+Bounded fix loop. Base ceiling: `POLISH_CEILING = 2` rounds. R4
+splits termination into **subjective** and **objective** branches:
+the subjective `polish_ceiling_reached` halt only fires when the
+remaining findings are non-a11y; objective a11y violations take the
+explicit `polish_a11y_blocking` branch with its own option set.
+| `review_clean` | `rounds` | Remaining findings | Behaviour |
+|---|---|---|---|
+| `True` | any | — | `SUCCESS` — advance to report |
+| `False` | `< effective_ceiling` | any | `BLOCKED` + `@agent-directive: ui-polish-<stack>`; skill applies fixes, re-runs review, increments `rounds` |
+| `False` | `== effective_ceiling` | contains `a11y_violation` | `BLOCKED` numbered options: extend (one extra round, sets `extension_used = True`; option disappears once spent) / accept (appends rule ids to `state.ui_review.a11y.accepted_violations`, then continues) / abort |
+| `False` | `== effective_ceiling` | non-a11y only | `BLOCKED` numbered options: ship as-is / abort / hand off |
+`effective_ceiling = POLISH_CEILING + 1` once
+`state.ui_polish.extension_used` is set; the schema validator widens
+the upper bound from `[0, 2]` to `[0, 3]` only when the flag is
+`True`, so the ceiling holds across in-memory state, on-disk state,
+and the dispatcher. `rounds > 3` is rejected unconditionally, even
+with the extension flag.
+**Idempotent re-entry on Accept.** A `state.ui_review.a11y.accepted_violations`
+list with rule ids matching the remaining a11y findings round-trips
+through `SUCCESS` because the review gate's `_apply_a11y_gate`
+filters accepted entries before synthesising `a11y_violation`
+findings. The Accept branch and the Ship-as-is branch are therefore
+asymmetric: Ship-as-is flips `review_clean` directly; Accept records
+explicit rule ids so replay reproduces the same gate decision.
+**Token-violation extraction.** Findings with
+`kind == "token_violation"` carry `category` and `value`. Polish
+classifies them against `state.ui_audit.design_tokens`:
+- Matched value → fix uses the named token; counted as a regular round.
+- Unmatched value repeated `> TOKEN_REPEAT_THRESHOLD` (`2`) times →
+  emits `polish_token_extraction_pending`: extract the value to a new
+  token before the next round runs. One-off unmatched values stay
+  inline.
+Stack-directive table mirrors apply / review with prefix
+`ui-polish-`. `DEFAULT_DIRECTIVE = "ui-polish-plain"`.
+### `report` → backend renderer
+Re-export of
+[`directives.backend.report.run`](../../.agent-src.uncompressed/templates/scripts/work_engine/directives/backend/report.py).
+The renderer is pure and state-driven; the same Markdown contract
+serves both tracks.
+## Halt budget — happy path
+`ui` set, fresh state, audit + design pass cleanly:
+1. **Audit pick** — first-pass `existing-ui-audit` directive halt.
+2. **Design sign-off** — `design_confirmed` numbered-options halt.
+Two user halts. Apply / review / polish all run silently when their
+producers write clean envelopes on the first attempt. GT-U1..U4 pin
+this budget.
+Additional halts surface only on real ambiguity:
+greenfield-undecided (+1), shadcn-version-mismatch (+1), audit-ambiguous
+(+1), placeholder rejection (+N until microcopy is fixed),
+polish round (+1 per dirty review, capped at the effective ceiling),
+polish ceiling — subjective (+1 when both rounds fail and remaining
+findings are non-a11y) **or** a11y-blocking (+1 when remaining
+findings include `a11y_violation` entries; the Extend option grants
+one extra round, then disappears),
+preview render failure (+1 when `state.ui_review.preview.render_ok`
+is `False`; user picks retry / skip / abort),
+review a11y pending (+1 when an `a11y_baseline` exists but the review
+skill has not yet written `state.ui_review.a11y`).
+## The `ui-trivial` set — short-circuit path
+For provably bounded edits (single class swap, copy tweak, one-prop
+adjustment). Phase-1 intent classifier writes
+`directive_set = "ui-trivial"`.
+Hard preconditions in
+[`directives/ui_trivial/apply.py`](../../.agent-src.uncompressed/templates/scripts/work_engine/directives/ui_trivial/apply.py):
+- `MAX_FILES = 1` — exactly one file touched.
+- `MAX_LINES_CHANGED = 5` — diff stays under five changed lines.
+Violation flips `state.directive_set` to `ui` (the full audit gate)
+and the dispatcher restarts. The trivial path **never** silently
+swallows scope creep.
+Skipped slots (`memory`, `analyze`, `plan`, `verify`) share
+`_skipped.run` — they record `success` without work so the dispatcher
+completeness check is satisfied. `report` renders a one-line summary
+instead of the full delivery report.
+## The `mixed` set — contract + UI + stitch
+Used when a single input touches both layers. Slot mapping:
+| Slot | Handler | Purpose |
+|---|---|---|
+| `refine`, `memory`, `analyze`, `verify`, `report` | reused from `backend` | Same handlers, by reference |
+| `plan` | `mixed.contract` | Lock `data_model` + `api_surface` |
+| `implement` | `mixed.ui` | Delegate to UI sub-flow |
+| `test` | `mixed.stitch` | End-to-end smoke scenarios |
+**Sentinels** that release each mixed gate:
+- `state.contract.contract_confirmed = True` — UI sub-flow refuses to
+  start without it (defense-in-depth even if `outcomes["plan"] ==
+  "success"`). Required keys: `data_model`, `api_surface`
+  (`REQUIRED_CONTRACT_KEYS`).
+- `state.ui_review.review_clean = True` — mixed `ui` step's success
+  condition. Polish-ceiling semantics live in the UI track; if the
+  user reaches mixed.ui's "ship as-is / hand off / abort" halt, the
+  UI track has already given up.
+- `state.stitch.verdict = "success"` — stitch's success condition.
+  `blocked` / `partial` halts with three numbered options unless
+  `state.stitch.integration_confirmed = True` (explicit user override).
+`stitch` emits `@agent-directive: integration-test` so an
+agent-side handler runs the end-to-end smokes; `mixed.ui` emits
+`@agent-directive: ui-track` to delegate the visible-surface work
+back into the full UI directive set.
+## Idempotency and replay
+Every UI step is idempotent on its sentinel:
+| Step | Sentinel | Effect on replay |
+|---|---|---|
+| audit | `audit_path` ∈ `{"high_confidence", "ambiguous", "greenfield"}` | `SUCCESS` without halt |
+| design | `design_confirmed = True` | `SUCCESS` without halt |
+| apply | `state.ticket["ui_apply"]` well-formed, no placeholders | `SUCCESS`, changes recorded once |
+| review | well-formed envelope (`findings` list + `review_clean` bool) | `SUCCESS` |
+| polish | `review_clean = True` (any round count) | `SUCCESS` |
+| contract | `contract_confirmed = True` | `SUCCESS` |
+| stitch | `verdict = "success"` OR `integration_confirmed = True` | `SUCCESS` |
+The dispatcher walks the same eight slots on every replay; sentinels
+are the only thing keeping a re-run from re-asking a question the
+user already answered. Replay coverage is locked by the
+Golden-Transcript suite under `tests/golden/baseline/GT-U*/`.
+## Declared ambiguity surfaces
+Each step re-exports an `AMBIGUITIES: tuple[dict[str, str], ...]`
+constant. The
+[`test_ambiguity_coverage.py`](../../tests/implement_ticket/test_ambiguity_coverage.py)
+suite asserts every `BLOCKED` path has a matching declaration.
+| Step | Codes |
+|---|---|
+| `audit` | `audit_missing`, `greenfield_undecided`, `shadcn_version_mismatch`, `audit_ambiguous` |
+| `design` | `design_missing`, `design_placeholders`, `design_unconfirmed` |
+| `apply` | `apply_envelope_missing`, `apply_placeholders_in_output` |
+| `review` | `review_envelope_missing`, `review_findings_missing`, `review_clean_missing`, `review_a11y_pending`, `preview_render_failed` |
+| `polish` | `polish_round_pending`, `polish_ceiling_reached`, `polish_a11y_blocking`, `polish_token_extraction_pending` |
+| `contract` (mixed) | `upstream_analyze_failed`, `contract_missing`, `contract_incomplete`, `contract_unconfirmed` |
+| `mixed.ui` | `contract_sentinel_missing`, `ui_subflow_missing`, `ui_subflow_dirty` |
+| `stitch` | `upstream_ui_failed`, `stitch_missing`, `stitch_malformed`, `stitch_verdict_unsuccessful` |
+## See also
+- [`implement-ticket-flow.md`](implement-ticket-flow.md) — sibling
+  contract for the `backend` set; covers `DeliveryState`, schema v1,
+  hooks, persona policies, replay protocol.
+- [`road-to-product-ui-track.md`](../../agents/roadmaps/road-to-product-ui-track.md)
+  — phased delivery and Golden-Transcript matrix.
+- [`road-to-product-ui-track-followup.md`](../../agents/roadmaps/archive/road-to-product-ui-track-followup.md)
+  — pinned GT-U5 (mixed flow), GT-U6A/B (stack dispatch), GT-U7
+  (trivial happy path), GT-U8 (trivial reclassification).
+- [`road-to-visual-review-loop.md`](../../agents/roadmaps/road-to-visual-review-loop.md)
+  — R4 contract: a11y gate, preview envelope, polish-termination
+  rewrite. Pinned by GT-U13 (a11y polish), GT-U14 (a11y ceiling),
+  GT-U15 (preview render failure).
+- [`existing-ui-audit` SKILL](../../.agent-src.uncompressed/skills/existing-ui-audit/SKILL.md)
+  — producer of `state.ui_audit`.
+- [`ui-audit-before-build` rule](../../.agent-src.uncompressed/rules/ui-audit-before-build.md)
+  — the always-on rule that mirrors the audit gate at the agent layer.

package/docs/getting-started.md CHANGED Viewed

@@ -99,7 +99,7 @@ Your agent is now:
 - **Respecting your codebase** — no conflicting patterns
 - **Following standards** — consistent code quality
-This is enforced automatically by 53 rules. No configuration needed.
+This is enforced automatically by 55 rules. No configuration needed.
 ---
@@ -165,7 +165,7 @@ Run `/chat-history-resume` to walk through the prompts explicitly, or
 let the agent ask on the first turn of a new chat. All merge/replace/
 resume paths read the on-disk entries into context before any write.
-See the [`chat-history` rule](../.agent-src/rules/chat-history.md) and
+See the [`chat-history` rule](../.agent-src/rules/chat-history-ownership.md) and
 [`scripts/chat_history.py`](../scripts/chat_history.py) for the mechanics.
 ---

package/docs/installation.md CHANGED Viewed

@@ -23,6 +23,23 @@ No Task, no Make, no build tools required for installation.
 | **Project-installed** (recommended) | Teams, shared standards | Repository-wide |
 | **Plugin-installed** | Individual users, global use | User-wide |
+> **All paths on this page are still supported.** The labels
+> (`advanced` / `experimental` / `staged`) describe how prominent the
+> path is in our recommendation order, not its support status.
+> Composer + npm are the default; everything else stays shipped and
+> tested. Nothing on this page is being removed in 1.15.0 — the
+> reorder simply marks which paths get the most maintenance attention
+> and which we keep as fallbacks. See R9 in
+> [`agents/roadmaps/archive/road-to-post-pr29-optimize.md`](../agents/roadmaps/archive/road-to-post-pr29-optimize.md)
+> for the rationale.
+| Label | Meaning | Examples |
+|---|---|---|
+| (no label) | Primary path — first-class, fully supported | Composer, npm, Augment / Claude Code / Copilot CLI plugins |
+| `advanced` | Supported fallback — works, expects familiarity with the toolchain | Git submodule, manual clone, VS Code Git URL |
+| `experimental` | Shipped but evolving — interface may shift between minor releases | Claude.ai Web Skills UI |
+| `staged` | Shipped, narrow surface area — kept for users who already use the platform | Linear AI workspace guidance |
 ---
 ## Project-installed mode (recommended for teams)
@@ -210,7 +227,16 @@ These channels are **additional** to project- and plugin-installed
 modes; use them when the agent loop runs on the platform's servers,
 not on your machine.
-### Claude.ai Web (Skills UI)
+> Both cloud channels remain shipped and tested. The labels reflect
+> recommendation prominence, not support status — see the label table
+> at the top of this page.
+### Claude.ai Web (Skills UI) — `experimental`
+> `experimental` — shipped, still tested, but the upload surface and
+> bundle format may shift between minor releases as Claude.ai's Skills
+> UI evolves. Pin to a release tag if you depend on a specific bundle
+> shape.
 Claude.ai Web supports Skills via manual ZIP upload through the Skills
 UI. The package builds one ZIP per cloud-eligible skill.
@@ -238,7 +264,12 @@ UI. The package builds one ZIP per cloud-eligible skill.
 3. **Verify** — open a fresh Claude.ai conversation and confirm the
    skill appears in the Skills picker.
-### Linear AI (Codegen, Charlie, …)
+### Linear AI (Codegen, Charlie, …) — `staged`
+> `staged` — shipped, narrow surface area, kept primarily for users
+> already operating inside Linear. Iteration cadence is slower than
+> the project- and plugin-installed paths; major changes land first
+> on Composer + npm and propagate to the Linear digest in a follow-up.
 Linear AI agents read free-form guidance from Linear's workspace
 settings; there is no plugin or upload mechanism. The package ships
@@ -264,16 +295,21 @@ the matching Linear field.
    - Leave `personal.md` empty unless you have personal overrides
 3. **Per-layer rationale** — see
-   [`agents/contexts/linear-ai-three-layers.md`](../agents/contexts/linear-ai-three-layers.md)
+   [`docs/contracts/linear-ai-three-layers.md`](contracts/linear-ai-three-layers.md)
    for the split rationale and
-   [`agents/contexts/linear-ai-rules-inclusion.md`](../agents/contexts/linear-ai-rules-inclusion.md)
+   [`docs/contracts/linear-ai-rules-inclusion.md`](contracts/linear-ai-rules-inclusion.md)
    for which rules go where.
 ---
-## Alternative install methods
+## Alternative install methods — `advanced`
-These are fallbacks when the recommended paths above don't work.
+> `advanced` — supported fallbacks for users comfortable driving the
+> orchestrator directly. They share the same `scripts/install` entry
+> point as Composer and npm; the only difference is how the package
+> source ends up on disk. Pick these when you cannot use Composer or
+> npm (e.g. a polyglot repo without either, or a CI runner that
+> already vendors the package via submodule).
 ### Git Submodule