aw-ecc 1.4.21 → 1.4.25

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (83) hide show
  1. package/.claude-plugin/plugin.json +1 -1
  2. package/.codex/hooks/aw-post-tool-use.sh +1 -0
  3. package/.codex/hooks/aw-pre-tool-use.sh +1 -0
  4. package/.codex/hooks/aw-session-start.sh +4 -0
  5. package/.codex/hooks/aw-stop.sh +1 -0
  6. package/.codex/hooks/aw-user-prompt-submit.sh +1 -0
  7. package/.cursor/INSTALL.md +9 -0
  8. package/.cursor/hooks/adapter.js +34 -3
  9. package/.cursor/hooks/aw-phase-definitions.js +11 -0
  10. package/.cursor/hooks/before-submit-prompt.js +1 -1
  11. package/.cursor/hooks/before-submit-prompt.sh +17 -0
  12. package/.cursor/hooks/session-start.sh +36 -0
  13. package/.cursor/hooks/shared/aw-phase-definitions.js +1 -19
  14. package/.cursor/hooks/shared/aw-phase-runner.js +38 -2
  15. package/.cursor/hooks/shared/session-start.sh +4 -0
  16. package/.cursor/hooks/shared/user-prompt-submit.sh +33 -140
  17. package/.cursor/hooks.json +15 -15
  18. package/.cursor/rules/common-aw-routing.md +43 -0
  19. package/.cursor/skills/api-and-interface-design/SKILL.md +75 -0
  20. package/.cursor/skills/aw-brainstorm/SKILL.md +115 -0
  21. package/.cursor/skills/aw-build/SKILL.md +152 -0
  22. package/.cursor/skills/aw-build/evals/build-stage-cases.json +28 -0
  23. package/.cursor/skills/aw-debug/SKILL.md +49 -0
  24. package/.cursor/skills/aw-deploy/SKILL.md +101 -0
  25. package/.cursor/skills/aw-deploy/evals/deploy-stage-cases.json +32 -0
  26. package/.cursor/skills/aw-execute/SKILL.md +47 -0
  27. package/.cursor/skills/aw-execute/references/mode-code.md +47 -0
  28. package/.cursor/skills/aw-execute/references/mode-docs.md +28 -0
  29. package/.cursor/skills/aw-execute/references/mode-infra.md +44 -0
  30. package/.cursor/skills/aw-execute/references/mode-migration.md +58 -0
  31. package/.cursor/skills/aw-execute/references/worker-implementer.md +26 -0
  32. package/.cursor/skills/aw-execute/references/worker-parallel-worker.md +23 -0
  33. package/.cursor/skills/aw-execute/references/worker-quality-reviewer.md +23 -0
  34. package/.cursor/skills/aw-execute/references/worker-spec-reviewer.md +23 -0
  35. package/.cursor/skills/aw-execute/scripts/build-worker-bundle.js +229 -0
  36. package/.cursor/skills/aw-finish/SKILL.md +111 -0
  37. package/.cursor/skills/aw-investigate/SKILL.md +109 -0
  38. package/.cursor/skills/aw-plan/SKILL.md +368 -0
  39. package/.cursor/skills/aw-prepare/SKILL.md +118 -0
  40. package/.cursor/skills/aw-review/SKILL.md +118 -0
  41. package/.cursor/skills/aw-ship/SKILL.md +115 -0
  42. package/.cursor/skills/aw-spec/SKILL.md +104 -0
  43. package/.cursor/skills/aw-tasks/SKILL.md +138 -0
  44. package/.cursor/skills/aw-test/SKILL.md +118 -0
  45. package/.cursor/skills/aw-verify/SKILL.md +51 -0
  46. package/.cursor/skills/aw-yolo/SKILL.md +111 -0
  47. package/.cursor/skills/browser-testing-with-devtools/SKILL.md +81 -0
  48. package/.cursor/skills/ci-cd-and-automation/SKILL.md +71 -0
  49. package/.cursor/skills/code-simplification/SKILL.md +74 -0
  50. package/.cursor/skills/context-engineering/SKILL.md +74 -0
  51. package/.cursor/skills/deprecation-and-migration/SKILL.md +75 -0
  52. package/.cursor/skills/documentation-and-adrs/SKILL.md +75 -0
  53. package/.cursor/skills/frontend-ui-engineering/SKILL.md +68 -0
  54. package/.cursor/skills/git-workflow-and-versioning/SKILL.md +75 -0
  55. package/.cursor/skills/idea-refine/SKILL.md +84 -0
  56. package/.cursor/skills/incremental-implementation/SKILL.md +75 -0
  57. package/.cursor/skills/performance-optimization/SKILL.md +77 -0
  58. package/.cursor/skills/security-and-hardening/SKILL.md +70 -0
  59. package/.cursor/skills/using-aw-skills/SKILL.md +290 -0
  60. package/.cursor/skills/using-aw-skills/evals/skill-trigger-cases.tsv +25 -0
  61. package/.cursor/skills/using-aw-skills/evals/test-skill-triggers.sh +171 -0
  62. package/.cursor/skills/using-aw-skills/hooks/hooks.json +9 -0
  63. package/.cursor/skills/using-aw-skills/hooks/session-start.sh +67 -0
  64. package/.cursor/skills/using-platform-skills/SKILL.md +163 -0
  65. package/.cursor/skills/using-platform-skills/evals/platform-selection-cases.json +52 -0
  66. package/.opencode/package.json +1 -1
  67. package/package.json +3 -1
  68. package/scripts/cursor-aw-home/hooks.json +15 -15
  69. package/scripts/cursor-aw-hooks/adapter.js +34 -3
  70. package/scripts/cursor-aw-hooks/aw-phase-definitions.js +11 -0
  71. package/scripts/cursor-aw-hooks/before-submit-prompt.js +1 -1
  72. package/scripts/cursor-aw-hooks/before-submit-prompt.sh +17 -0
  73. package/scripts/cursor-aw-hooks/session-start.sh +36 -0
  74. package/scripts/hooks/session-start-rules-context.sh +8 -1
  75. package/scripts/hooks/shared/aw-phase-definitions.js +1 -19
  76. package/scripts/hooks/shared/aw-phase-runner.js +38 -2
  77. package/scripts/hooks/shared/session-start.sh +4 -0
  78. package/scripts/hooks/shared/user-prompt-submit.sh +33 -140
  79. package/scripts/lib/cursor-aw-hook-files.js +2 -0
  80. package/scripts/lib/cursor-hook-config.js +47 -15
  81. package/scripts/lib/install-executor.js +7 -0
  82. package/scripts/lib/install-targets/cursor-project.js +7 -0
  83. package/skills/using-aw-skills/hooks/session-start.sh +4 -0
@@ -0,0 +1,118 @@
1
+ ---
2
+ name: aw-review
3
+ description: Review work for findings, governance, and readiness using explicit severity, org-standard playbooks, and fresh evidence.
4
+ trigger: User requests review, readiness, or PR governance, or a legacy verify request resolves to the review stage.
5
+ ---
6
+
7
+ # AW Review
8
+
9
+ ## Overview
10
+
11
+ `aw-review` is now a public stage skill.
12
+ It turns evidence into findings, release decisions, and clear next actions.
13
+
14
+ ## When to Use
15
+
16
+ - the request is findings-oriented review
17
+ - PR governance or readiness matters
18
+ - the work needs a release recommendation
19
+ - a legacy `/aw:verify` request is mostly about review
20
+
21
+ Do not use as a hidden helper that replaces explicit review intent.
22
+
23
+ ## Workflow
24
+
25
+ 1. Review the evidence first.
26
+ Start with tests, local validation, E2E, runtime proof, and prior investigation artifacts.
27
+ 2. Load the right org-standard playbooks.
28
+ Pull in platform review, design, accessibility, and quality-gate playbooks from the resolved baseline.
29
+ 3. Review across the five core axes.
30
+ Correctness, readability and simplicity, architecture, security, and performance.
31
+ Use `../../references/review-findings-severity.md`.
32
+ For readability and maintainability concerns, load `code-simplification`.
33
+ For public contract or boundary changes, load `api-and-interface-design`.
34
+ For security-sensitive work, load `security-and-hardening` and `../../references/security-checklist.md`.
35
+ For performance-sensitive work, load `performance-optimization` and `../../references/performance-checklist.md`.
36
+ For branch hygiene, save-point quality, or reviewability concerns, load `git-workflow-and-versioning`.
37
+ 4. Classify findings explicitly.
38
+ Separate blocking findings from advisory notes.
39
+ Name evidence, scope, and required fix.
40
+ 5. Continue until the requested review scope is covered.
41
+ Do not stop after the first finding or the first review axis if correctness, governance, architecture, security, performance, or readiness checks still remain.
42
+ 6. Check governance and readiness.
43
+ Confirm PR checklist, approvals, status checks, rollback notes, and release recommendation.
44
+ For architecture or public-behavior changes that need durable rationale, load `documentation-and-adrs`.
45
+ 7. Request fresh testing when needed.
46
+ If evidence is stale, missing, or too broad, route back to `aw-test` for the smallest targeted rerun.
47
+ 8. Persist the result.
48
+ Write `verification.md` and update `state.json`.
49
+
50
+ ## Completion Contract
51
+
52
+ Review is complete only when one of these is true:
53
+
54
+ - the requested findings, governance, and readiness scope is covered with current evidence
55
+ - the work fails review and the repair path is explicit
56
+ - a blocker prevents a trustworthy decision and that blocker is named
57
+
58
+ Every review handoff must make these things obvious:
59
+
60
+ - which evidence was reviewed
61
+ - which findings are blocking versus advisory
62
+ - which governance checks were completed
63
+ - what the readiness outcome is
64
+ - which exact next command should run next
65
+
66
+ ## Common Rationalizations
67
+
68
+ | Rationalization | Reality |
69
+ |---|---|
70
+ | "The tests passed, so review is basically done." | Tests are evidence, not the whole decision. |
71
+ | "This looks fine overall." | Findings need explicit severity, scope, and evidence. |
72
+ | "I can clear the finding from memory." | Resolutions require fresh evidence against the original issue. |
73
+
74
+ ## Red Flags
75
+
76
+ - no explicit severity language
77
+ - PR governance is implied instead of checked
78
+ - platform review or design playbooks are skipped for applicable work
79
+ - stale test evidence is reused after repairs
80
+
81
+ ## State File
82
+
83
+ `state.json` should record at least:
84
+
85
+ - `feature_slug`
86
+ - `stage: "review"`
87
+ - `mode`
88
+ - `status`
89
+ - written artifacts
90
+ - evidence reviewed
91
+ - blocking findings
92
+ - advisory notes
93
+ - governance status
94
+ - readiness outcome
95
+ - blockers
96
+ - recommended next commands
97
+
98
+ ## Verification
99
+
100
+ Before leaving review, confirm:
101
+
102
+ - [ ] test and runtime evidence were reviewed first
103
+ - [ ] findings are explicit, evidence-backed, and severity-tagged
104
+ - [ ] governance and readiness checks match the resolved baseline
105
+ - [ ] repairs point back to build or test with clear scope
106
+ - [ ] `verification.md` and `state.json` are updated
107
+
108
+ ## Final Output Shape
109
+
110
+ Always end with:
111
+
112
+ - `Mode`
113
+ - `Evidence`
114
+ - `Findings`
115
+ - `Governance`
116
+ - `Readiness`
117
+ - `Outcome`
118
+ - `Next`
@@ -0,0 +1,115 @@
1
+ ---
2
+ name: aw-ship
3
+ description: Launch, rollout, rollback readiness, and release closeout skill for work that already has the required build, test, review, and deploy context.
4
+ trigger: User requests launch readiness, rollout management, or release closeout after deploy or during final release preparation.
5
+ ---
6
+
7
+ # AW Ship
8
+
9
+ ## Overview
10
+
11
+ `aw-ship` is now the shipping stage, not the composite "do everything" workflow.
12
+ It owns launch safety, rollback readiness, monitoring posture, and release closeout.
13
+
14
+ ## When to Use
15
+
16
+ - launch readiness is the goal
17
+ - rollout checkpoints and monitoring need to be named
18
+ - the release action already happened and needs closeout
19
+ - rollback confidence must be documented before claiming readiness
20
+
21
+ Do not use for end-to-end orchestration.
22
+ That belongs to `aw-yolo`.
23
+
24
+ ## Preparation Gate
25
+
26
+ Before risky shipping work, call `aw-prepare` as the internal preparation gate when any of these are unclear:
27
+
28
+ - release context
29
+ - artifact readiness
30
+ - workspace isolation or repo cleanliness
31
+ - whether the current request is really shipping versus deploy or review
32
+
33
+ Do not guess through missing release inputs when `aw-prepare` should be used first.
34
+
35
+ ## Workflow
36
+
37
+ 1. Load the release context.
38
+ Start from deploy evidence, review status, known risks, and rollout intent.
39
+ If release inputs or workspace readiness are unclear, run the internal `aw-prepare` gate first instead of guessing.
40
+ 2. Apply the launch checklist.
41
+ Use `../../references/ship-launch-checklist.md`.
42
+ Load `ci-cd-and-automation` when release automation, staged rollout, or rollback machinery is part of the ship decision.
43
+ 3. Continue until the selected shipping scope is covered.
44
+ Do not stop after one checklist item, one smoke result, or one monitoring note if the requested launch-readiness, rollout, or closeout scope still has open gaps.
45
+ 4. Confirm rollback posture.
46
+ If rollback is unclear, do not claim launch readiness.
47
+ 5. Capture monitoring and follow-through.
48
+ Name health checks, monitoring links, smoke results, and ownership.
49
+ For release notes, runbooks, or durable decision capture, load `documentation-and-adrs`.
50
+ 6. Close out the release.
51
+ Update `release.md` and `state.json` with launch or blocker notes.
52
+
53
+ ## Completion Contract
54
+
55
+ Shipping is complete only when one of these is true:
56
+
57
+ - launch readiness, rollout, or closeout scope is covered clearly enough for operations
58
+ - a launch blocker prevents safe shipping and that blocker is named clearly
59
+
60
+ Every shipping handoff must make these things obvious:
61
+
62
+ - the current launch readiness decision
63
+ - the rollout or monitoring plan
64
+ - the rollback path or rollback blocker
65
+ - the operational evidence captured
66
+ - which exact next command should run next, if any
67
+
68
+ ## Common Rationalizations
69
+
70
+ | Rationalization | Reality |
71
+ |---|---|
72
+ | "Deploy already proved we're safe to ship." | Execution success is not the same as rollout safety. |
73
+ | "Rollback can be figured out later." | No rollback story means no real launch confidence. |
74
+ | "Ship should still mean the whole pipeline." | Overloading ship is what made the old model confusing. |
75
+
76
+ ## Red Flags
77
+
78
+ - no rollback plan or rollback blocker is named
79
+ - no monitoring or smoke evidence exists for a risky release
80
+ - ship is used as a synonym for end-to-end automation
81
+
82
+ ## State File
83
+
84
+ `state.json` should record at least:
85
+
86
+ - `feature_slug`
87
+ - `stage: "ship"`
88
+ - `mode`
89
+ - `status`
90
+ - written artifacts
91
+ - launch readiness
92
+ - rollout plan
93
+ - rollback path
94
+ - evidence captured
95
+ - blockers
96
+ - recommended next commands
97
+
98
+ ## Verification
99
+
100
+ - [ ] launch checklist or blocker is explicit
101
+ - [ ] rollback readiness is documented
102
+ - [ ] monitoring and smoke expectations are named
103
+ - [ ] `release.md` and `state.json` are updated with closeout evidence
104
+
105
+ ## Final Output Shape
106
+
107
+ Always end with:
108
+
109
+ - `Mode`
110
+ - `Launch Readiness`
111
+ - `Rollout Plan`
112
+ - `Rollback Path`
113
+ - `Evidence`
114
+ - `Outcome`
115
+ - `Next`
@@ -0,0 +1,104 @@
1
+ ---
2
+ name: aw-spec
3
+ description: Internal spec helper that turns an approved direction into a deterministic technical spec without writing implementation tasks or code.
4
+ trigger: Internal only. Invoked by aw-plan after discovery is approved or when a technical contract must be written before task planning.
5
+ ---
6
+
7
+ # AW Spec
8
+
9
+ ## Overview
10
+
11
+ `aw-spec` owns the technical contract layer inside AW planning.
12
+ It turns an approved direction into `.aw_docs/features/<feature_slug>/spec.md` without collapsing directly into implementation tasks or code.
13
+
14
+ The public planning route remains `/aw:plan`.
15
+
16
+ ## When to Use
17
+
18
+ - discovery or planning has already approved the direction
19
+ - the request needs a technical contract before task breakdown
20
+ - an existing spec is vague, incomplete, or inconsistent
21
+ - the work spans interfaces, file boundaries, rollout concerns, or migration risk
22
+
23
+ Do not use when execution-ready tasks already exist or when the request is still fundamentally a product or design question.
24
+
25
+ ## Workflow
26
+
27
+ This legacy heading maps to the gated workflow below.
28
+
29
+ ## The Gated Workflow
30
+
31
+ 1. Enter technical planning mode.
32
+ Read the approved direction, relevant code paths, and the smallest set of architecture context needed.
33
+ Do not write code while authoring the spec.
34
+ 2. Check the scope shape.
35
+ If the request spans multiple independent subsystems, decompose it before writing one giant spec.
36
+ 3. Define the stable contract.
37
+ Name interfaces, boundaries, file responsibilities, rollout constraints, and failure modes.
38
+ Use `../../references/interface-stability.md` when an API or contract is changing.
39
+ 4. Write `spec.md` for a fresh planner or builder.
40
+ Make it specific enough that `aw-tasks` can proceed without rediscovering the architecture.
41
+ 5. Run a fast review pass.
42
+ Fix placeholders, contradictions, scope drift, and ambiguous names before handoff.
43
+ 6. Update state and hand off.
44
+ Update `.aw_docs/features/<feature_slug>/state.json` and hand the approved spec to `aw-tasks`.
45
+
46
+ ## Required `spec.md` Content
47
+
48
+ Capture at least:
49
+
50
+ - implementation goal
51
+ - scope and non-goals
52
+ - architecture or technical approach
53
+ - interfaces, contracts, or integration points
54
+ - expected file or module map when it can be inferred safely
55
+ - failure modes and rollback constraints
56
+ - acceptance criteria
57
+ - verification targets
58
+ - rollout, migration, or environment constraints when relevant
59
+
60
+ ## Common Rationalizations
61
+
62
+ | Rationalization | Reality |
63
+ |---|---|
64
+ | "The builder can figure the interfaces out." | Unstable interfaces are where rework and regressions start. |
65
+ | "This is only one feature, so one large spec is fine." | Mixed subsystems still need decomposition if the boundaries differ. |
66
+ | "I’ll leave the risky rollout details for later." | Migration and rollback constraints belong in the spec if they affect implementation. |
67
+
68
+ ## Red Flags
69
+
70
+ - the spec says "update as needed" instead of naming boundaries
71
+ - file or module responsibility is still vague
72
+ - rollout or migration constraints are implied instead of stated
73
+ - contradictory helper names, interface names, or paths appear across sections
74
+
75
+ ## Verification
76
+
77
+ Before handoff, run this inline review:
78
+
79
+ 1. placeholder scan
80
+ 2. internal consistency check
81
+ 3. scope check
82
+ 4. ambiguity check
83
+
84
+ Fix issues inline instead of carrying them into task planning.
85
+
86
+ ## Hard Gates
87
+
88
+ - do not write implementation code
89
+ - do not jump directly to `/aw:build`
90
+ - do not create `tasks.md`
91
+ - do not leave contradictory interfaces, names, or file boundaries unresolved
92
+ - do not treat a multi-subsystem request as one spec when it should be decomposed
93
+
94
+ ## Final Output Shape
95
+
96
+ Always end with:
97
+
98
+ - `Feature Slug`
99
+ - `Spec Path`
100
+ - `Scope Decision`
101
+ - `Architecture`
102
+ - `Acceptance Criteria`
103
+ - `Open Approval Needs`
104
+ - `Recommended Next`
@@ -0,0 +1,138 @@
1
+ ---
2
+ name: aw-tasks
3
+ description: Internal task helper that turns an approved spec into a recipe-level tasks.md artifact for aw-build.
4
+ trigger: Internal only. Invoked by aw-plan after spec authoring or when an existing spec must be expanded into execution-ready tasks.
5
+ ---
6
+
7
+ # AW Tasks
8
+
9
+ ## Overview
10
+
11
+ `aw-tasks` owns `tasks.md`.
12
+ It turns an approved technical contract into a fresh-worker execution recipe without adding a new public planning command.
13
+
14
+ The canonical public route remains `/aw:plan`.
15
+
16
+ ## When to Use
17
+
18
+ - an approved `spec.md` exists and work needs an execution recipe
19
+ - execution scope is still too large or vague for a fresh builder
20
+ - task ordering, checkpoints, or parallel boundaries are unclear
21
+
22
+ Do not use when the request is still missing a stable technical contract.
23
+
24
+ ## Workflow
25
+
26
+ This legacy heading maps to the detailed planning process below.
27
+
28
+ ## The Planning Process
29
+
30
+ 1. Enter task-planning mode.
31
+ Load `.aw_docs/features/<feature_slug>/spec.md` and only the supporting design or product artifacts that matter.
32
+ Do not write code during task authoring.
33
+ 2. Map the file structure first.
34
+ Name the exact files to create, modify, or verify before defining tasks.
35
+ 3. Break the work into vertical slices.
36
+ Prefer end-to-end slices and checkpointed phases over horizontal batches.
37
+ Use `../../references/task-sizing-and-checkpoints.md` when sizing gets fuzzy.
38
+ 4. Write tasks as fresh-worker instructions.
39
+ Include explicit file paths, commands, expected outcomes, commit boundaries, save-point commit expectations, phase ordering, and any bounded parallel execution metadata.
40
+ 5. Review the task list before handoff.
41
+ Remove placeholders, fix dependency drift, and confirm the steps are build-ready.
42
+ 6. Update state and hand off.
43
+ Update `.aw_docs/features/<feature_slug>/state.json` and recommend `/aw:build`.
44
+
45
+ ## Task Granularity
46
+
47
+ Each implementation step should usually be one action that takes about 2-5 minutes, for example:
48
+
49
+ - write the failing test
50
+ - run it to verify it fails for the expected reason
51
+ - write the minimal implementation
52
+ - rerun the test to verify it passes
53
+ - commit the focused change
54
+
55
+ ## Required `tasks.md` Structure
56
+
57
+ `tasks.md` should include:
58
+
59
+ - an explicit `## Spec Brief` section near the top with the feature goal, approved scope, and architecture summary
60
+ - a file structure map before the tasks
61
+ - explicit phase sections such as `## Phase 1`, `## Phase 2`, and so on
62
+ - a short phase outcome or exit check for each phase
63
+ - task sections with exact file paths
64
+ - checkbox steps for tracking
65
+ - validation commands with expected outcomes
66
+ - commit steps for meaningful slices
67
+ - save-point commit expectation for each meaningful slice
68
+ - dependency or ordering notes
69
+ - bounded `parallel_candidate` markers only for disjoint write scopes
70
+ - `parallel_group`, `parallel_ready_when`, and `parallel_write_scope` details for any parallel slice
71
+ - `max_parallel_subagents: 3` by default when parallel fan-out is planned, unless another cap is explicitly justified
72
+
73
+ If a slice cannot end in a clean save-point commit, it should usually be merged into the next dependent slice before handoff to build.
74
+ If no safe disjoint work exists, say so explicitly instead of forcing fake parallelism.
75
+ Even a small plan should still start with `## Spec Brief` and label the execution order with at least `## Phase 1` so the next worker can see the sequence immediately.
76
+
77
+ ## Code and Command Detail
78
+
79
+ When a step changes code or tests, include the smallest concrete snippet, interface shape, or patch intent needed for a fresh worker to execute the step correctly.
80
+
81
+ Every validation step should include:
82
+
83
+ - exact command
84
+ - why that command is being run
85
+ - expected signal such as `FAIL`, `PASS`, or the named artifact/evidence result
86
+
87
+ ## Common Rationalizations
88
+
89
+ | Rationalization | Reality |
90
+ |---|---|
91
+ | "The builder can improvise the missing steps." | If the worker must guess, the task plan is not ready. |
92
+ | "I’ll keep tasks broad so the plan stays short." | Broad tasks hide dependencies and break rollback-friendly execution. |
93
+ | "I don’t need checkpoints for a small project." | Checkpoints are how you keep multi-slice work verifiable. |
94
+
95
+ ## Red Flags
96
+
97
+ - a task title includes multiple independent changes joined by "and"
98
+ - file scope is missing or generic
99
+ - verification steps do not name exact commands or evidence targets
100
+ - parallel work is marked without disjoint write boundaries
101
+ - parallel fan-out is proposed without a cap or without naming the owned write scope
102
+
103
+ ## No Placeholders
104
+
105
+ Never write:
106
+
107
+ - `TODO`
108
+ - `TBD`
109
+ - `implement later`
110
+ - `write tests`
111
+ - `handle edge cases`
112
+ - `same as previous task`
113
+
114
+ If a worker would have to guess, the task is not ready.
115
+
116
+ ## Verification
117
+
118
+ Before handoff:
119
+
120
+ 1. confirm every spec requirement maps to at least one task
121
+ 2. confirm file paths and interface names stay consistent across tasks
122
+ 3. confirm no task relies on an undefined helper, type, or command
123
+ 4. confirm `## Spec Brief` and the phase order are obvious from the top of `tasks.md`
124
+ 5. confirm execution can route straight to `/aw:build`
125
+
126
+ ## Final Output Shape
127
+
128
+ Always end with:
129
+
130
+ - `Feature Slug`
131
+ - `Tasks Path`
132
+ - `Spec Brief`
133
+ - `File Map`
134
+ - `Phases`
135
+ - `Execution Route`
136
+ - `Parallel Candidates`
137
+ - `Review Result`
138
+ - `Recommended Next`
@@ -0,0 +1,118 @@
1
+ ---
2
+ name: aw-test
3
+ description: Produce fresh QA evidence for feature, bugfix, frontend runtime, and release scopes using repo and org quality gates.
4
+ trigger: User requests QA, validation, regression proof, runtime proof, or a legacy verify request resolves to the testing stage.
5
+ ---
6
+
7
+ # AW Test
8
+
9
+ ## Overview
10
+
11
+ `aw-test` owns QA proof.
12
+ It turns implementation into current, scoped evidence that review and deploy can trust, and it should continue until the requested QA scope is actually covered.
13
+
14
+ ## When to Use
15
+
16
+ - the request is to test a feature or bugfix
17
+ - the work needs fresh validation after build
18
+ - frontend behavior needs runtime proof
19
+ - release preparation needs broader QA evidence
20
+ - a legacy `/aw:verify` request is primarily about testing
21
+
22
+ Do not use for findings-oriented review or launch closeout.
23
+
24
+ ## Workflow
25
+
26
+ 1. Select the smallest correct QA scope.
27
+ Use `../../references/test-scope-and-evidence.md`.
28
+ Distinguish feature QA, bugfix regression proof, UI runtime proof, and release QA.
29
+ 2. Resolve org standards.
30
+ Load the resolved GHL baseline profile.
31
+ Pull in the required local validation, E2E, external validation, sandbox, and quality-gate expectations.
32
+ 3. Run the right checks.
33
+ Use `../../references/testing-patterns.md` for test structure.
34
+ For frontend work, load `../../references/frontend-quality-checklist.md`.
35
+ For accessibility-sensitive UI, load `../../references/accessibility-checklist.md`.
36
+ For substantial UI validation, load `frontend-ui-engineering`.
37
+ For live browser, DOM, console, network, or runtime-proof work, load `browser-testing-with-devtools`.
38
+ For performance-sensitive paths, load `performance-optimization` and `../../references/performance-checklist.md`.
39
+ 4. Continue through the requested QA scope.
40
+ Do not stop after the first passing check if the requested feature, bugfix, UI-runtime, or release scope still has missing evidence.
41
+ Only stop early when the remaining work clearly belongs to another stage or a blocker prevents additional testing.
42
+ 5. Record evidence, not vibes.
43
+ Name which commands ran, what passed, what failed, what was unavailable, and what still remains unproven.
44
+ 6. Decide the handoff.
45
+ Route to `aw-review` when findings, governance, or readiness decisions remain.
46
+ Route to `aw-build` when a failure needs repair.
47
+ If the stage is blocked, name the blocker and the smallest safe next action explicitly.
48
+ 7. Persist the result.
49
+ Write `verification.md` and update `state.json`.
50
+
51
+ ## Completion Contract
52
+
53
+ Testing is complete only when one of these is true:
54
+
55
+ - the requested QA scope has fresh enough evidence for the next stage
56
+ - a failure sends the work back to `aw-build`
57
+ - a blocker prevents additional proof and that blocker is explicit
58
+
59
+ Every testing handoff must make these things obvious:
60
+
61
+ - which scope was tested
62
+ - which checks ran
63
+ - which evidence passed or failed
64
+ - which gaps are still unavailable or unresolved
65
+ - which exact next command should run next
66
+
67
+ ## Common Rationalizations
68
+
69
+ | Rationalization | Reality |
70
+ |---|---|
71
+ | "Review can cover testing later." | Review is stronger when QA evidence is already concrete. |
72
+ | "The unit tests are enough for this frontend change." | UI work may also need responsive, accessibility, and runtime proof. |
73
+ | "The check doesn't exist, so I can treat it as passed." | Unavailable is not the same as passed. Record the gap explicitly. |
74
+
75
+ ## Red Flags
76
+
77
+ - stale evidence after code changed
78
+ - no regression proof for a bugfix
79
+ - frontend changes have no runtime or responsive evidence
80
+ - release QA is described vaguely instead of by concrete checks
81
+
82
+ ## State File
83
+
84
+ `state.json` should record at least:
85
+
86
+ - `feature_slug`
87
+ - `stage: "test"`
88
+ - `mode`
89
+ - `status`
90
+ - written artifacts
91
+ - commands run
92
+ - evidence artifacts
93
+ - failures
94
+ - unavailable checks
95
+ - blockers
96
+ - recommended next commands
97
+
98
+ ## Verification
99
+
100
+ Before leaving test, confirm:
101
+
102
+ - [ ] the QA scope is explicit and proportional
103
+ - [ ] org-standard baseline checks were applied where available
104
+ - [ ] unavailable checks are marked unavailable, not silently passed
105
+ - [ ] fresh evidence is written to `verification.md`
106
+ - [ ] `state.json` is updated with checks, failures, and next action
107
+
108
+ ## Final Output Shape
109
+
110
+ Always end with:
111
+
112
+ - `Mode`
113
+ - `Scope`
114
+ - `Checks Run`
115
+ - `Evidence`
116
+ - `Failures`
117
+ - `Unavailable`
118
+ - `Next`
@@ -0,0 +1,51 @@
1
+ ---
2
+ name: aw-verify
3
+ description: Compatibility skill for the older verify stage. Resolve the request into aw-test, aw-review, or the smallest correct combined verification flow.
4
+ trigger: Internal compatibility only. Invoked when older docs, habits, or commands still reference aw-verify.
5
+ ---
6
+
7
+ # AW Verify
8
+
9
+ ## Overview
10
+
11
+ `aw-verify` is a compatibility layer.
12
+ The canonical public verification model is now:
13
+
14
+ - `aw-test` for QA evidence
15
+ - `aw-review` for findings, governance, and readiness
16
+
17
+ ## When to Use
18
+
19
+ - a legacy `/aw:verify` request arrives
20
+ - an older doc still frames testing and review as one overloaded stage
21
+
22
+ ## Workflow
23
+
24
+ 1. Identify the real need.
25
+ If the request is mostly QA, route to `aw-test`.
26
+ If it is mostly findings, governance, or readiness, route to `aw-review`.
27
+ If both are clearly needed, use `aw-test -> aw-review`.
28
+ 2. Preserve the compatibility artifact contract.
29
+ Keep writing `verification.md` and `state.json`.
30
+ 3. Preserve stage continuation as well as stage boundaries.
31
+ Do not stop after partial evidence if the selected test or review scope still remains.
32
+ 4. Do not preserve old ambiguity.
33
+ Prefer the narrowest modern stage once the intent is clear.
34
+
35
+ ## Common Rationalizations
36
+
37
+ | Rationalization | Reality |
38
+ |---|---|
39
+ | "Verify can keep meaning everything forever." | Overloaded stages make routing and ownership unclear. |
40
+
41
+ ## Red Flags
42
+
43
+ - verification requests are not decomposed into test or review intent
44
+ - old verify language is used to hide missing evidence or findings
45
+
46
+ ## Verification
47
+
48
+ - [ ] the request resolved to `aw-test`, `aw-review`, or both
49
+ - [ ] the selected test or review scope was completed or blocked explicitly
50
+ - [ ] compatibility did not create a second verification workflow
51
+ - [ ] `verification.md` and `state.json` stayed consistent with the resolved canonical flow