project-tiny-context-harness 0.2.57 → 0.2.59

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (33) hide show
  1. package/LICENSE +21 -21
  2. package/README.md +496 -496
  3. package/assets/README.md +550 -550
  4. package/assets/README.zh-CN.md +133 -133
  5. package/assets/agents/.gitkeep +1 -1
  6. package/assets/agents/AGENTS_CORE.md +59 -59
  7. package/assets/context_templates/architecture.md +31 -31
  8. package/assets/context_templates/area.md +38 -38
  9. package/assets/context_templates/context.toml +27 -27
  10. package/assets/context_templates/deployment.md +35 -35
  11. package/assets/context_templates/global.md +53 -53
  12. package/assets/context_templates/product-surface-contract.md +60 -60
  13. package/assets/context_templates/verification.md +32 -32
  14. package/assets/github/.gitkeep +1 -1
  15. package/assets/github/harness.yml +41 -41
  16. package/assets/make/.gitkeep +1 -1
  17. package/assets/make/ty-context.mk +48 -48
  18. package/assets/skills/context_development_engineer/SKILL.md +108 -108
  19. package/assets/skills/context_full_project_export/SKILL.md +55 -55
  20. package/assets/skills/context_harness_upgrade/SKILL.md +60 -60
  21. package/assets/skills/context_product_plan/SKILL.md +89 -89
  22. package/assets/skills/context_surface_contract/SKILL.md +168 -168
  23. package/assets/skills/context_uiux_design/SKILL.md +113 -113
  24. package/assets/skills/plan_acceptance_checklist_compiler/SKILL.md +435 -393
  25. package/assets/tools/validate_context.py +276 -276
  26. package/dist/commands/check-modularity.js +8 -8
  27. package/dist/commands/export-context.js +9 -9
  28. package/dist/commands/index.js +16 -16
  29. package/dist/commands/package-source.js +2 -2
  30. package/dist/commands/upgrade.js +9 -9
  31. package/migrations/README.md +3 -3
  32. package/package.json +68 -68
  33. package/source-mappings.yaml +25 -25
@@ -1,279 +1,307 @@
1
- ---
2
- name: plan_acceptance_checklist_compiler
3
- description: Use when the user provides or references a plan, Markdown file, RFC, implementation proposal, milestone plan, or long-task goal/target-mode task and explicitly asks for an acceptance checklist, completion definition, or goal/target-mode prompt. Triggers include acceptance checklist for this plan, goal-mode prompt for this implementation plan, target-mode prompt for this plan, 为这份方案生成验收清单, 整理这份方案输出目标模式文本. Do not trigger on standalone acceptance/目标模式/完成定义 terms without a plan-like source. This Skill writes temporary artifacts under tmp/ty-context/plan-acceptance and never executes the plan or marks completion.
4
- ---
5
-
6
- # Plan Acceptance Checklist Compiler
7
-
8
- ## Package-Managed Boundary
9
-
10
- This Skill is generated by `ty-context sync` and owned by the Harness package. Do not edit the generated `plan_acceptance_checklist_compiler` Skill directly in a consumer project.
11
-
12
- Project-specific acceptance rules belong in `project_context/**`, `DESIGN.md` when visual design facts are durable, or a separate project-local Skill. Keep this package-managed Skill business-agnostic.
13
-
14
- ## Purpose
15
-
16
- Convert a given long-task plan into a strict, project-context-aware acceptance checklist and a short goal/target-mode execution prompt.
17
-
18
- The output answers:
19
-
20
- ```text
21
- For this exact plan, what must be true before a future executor can honestly say the task is complete?
22
- ```
23
-
24
- This Skill is generic. Do not embed business-specific rules, provider-specific rules, product-domain assumptions, artifact schemas, API names, UI names, status names, or one-off project logic in this Skill. If the user's plan contains a specific domain, derive acceptance items from that plan and the project Context during the current invocation only.
25
-
26
- ## Required Outputs
27
-
28
- Every completed invocation must produce:
29
-
30
- 1. A preserved copy of the plan under `tmp/ty-context/plan-acceptance/`.
31
- 2. A rigorous acceptance checklist derived from the plan and relevant project Context.
32
- 3. A goal/target-mode prompt the user can paste directly into Codex.
33
-
34
- Exception: if the Context confirmation gate below triggers, stop after materializing the plan and reading enough Context to explain the uncertainty. Ask the user for confirmation before producing the checklist or goal/target-mode prompt.
35
-
36
- The goal/target-mode prompt must be no longer than 4000 characters. Use the user's language when practical. For Chinese requests, use this shape:
37
-
38
- ```text
39
- 实施计划: tmp/ty-context/plan-acceptance/<plan>.md
40
- 可多开agent,agent名额不够了就关掉不用的。
41
- <验收清单>
42
- ```
43
-
44
- For English requests, use this shape:
45
-
46
- ```text
47
- Plan: tmp/ty-context/plan-acceptance/<plan>.md
48
- You may use multiple agents; if agent slots run low, close idle or unnecessary agents.
49
- <acceptance checklist>
50
- ```
51
-
52
- This resource lifecycle line is part of the generated execution prompt. It authorizes parallel decomposition for the future executor while requiring unused agents to be closed when slots run low.
53
-
54
- ## Non-Scope
55
-
56
- Do not execute the plan unless the user separately asks for execution.
57
-
58
- Do not modify product code, migrations, UI, API, tests, runtime artifacts, or durable Context while using this Skill.
59
-
60
- Do not run long validation, external probes, browser flows, cloud smoke tests, database writes, deployment commands, or other side-effect commands unless the user explicitly asks for validation execution.
61
-
62
- Do not mark any goal complete. This Skill defines acceptance; it does not complete acceptance.
63
-
64
- Do not lower the plan target to match current implementation.
65
-
66
- ## Trigger Conditions
67
-
68
- Use this Skill when the user asks for any of the following:
69
-
70
- - Give a plan, Markdown file, RFC, milestone, or implementation proposal an acceptance checklist.
71
- - Combine project durable Context with a long-task plan to produce acceptance criteria.
72
- - Convert a long-range task plan into a checklist for Codex goal/target mode.
73
- - Generate a completion definition for "execute this plan until done."
74
- - Check whether a plan is missing acceptance conditions.
75
- - Combine user objective, plan steps, project Context, code entry points, and verification paths into an executable acceptance matrix.
76
-
77
- Typical trigger phrases:
78
-
79
- ```text
80
- acceptance checklist for this plan
81
- goal-mode prompt for this implementation plan
82
- target-mode prompt for this plan
83
- completion definition for this plan
84
- long-task plan acceptance
85
- audit this plan for acceptance criteria
86
- 整理这份方案,输出一份目标模式文本给我
87
- 整理方案输出目标模式文本
88
- 为这份方案生成验收清单
89
- 方案验收清单
90
- 长程任务方案验收
91
- 实施计划验收清单
92
- 结合项目 context 梳理方案验收
93
- 为这份 md 生成目标模式验收文本
94
- 把这份 md 整理成验收清单
95
- ```
96
-
97
- Do not trigger from standalone broad phrases such as `goal mode`, `target mode`, `acceptance criteria`, `completion definition`, `目标模式文本`, `验收标准`, or `完成定义` unless the same user request also identifies a plan-like source.
98
-
99
- ## Input Priority
100
-
101
- Build the checklist from these sources, in order:
102
-
103
- 1. User's current instruction.
104
- 2. The referenced or pasted plan.
105
- 3. Relevant durable project Context under `project_context/**`.
106
- 4. Repository guidance such as `AGENTS.md`, `README.md`, `DESIGN.md`, and relevant local Skills.
107
- 5. Current code and tests, only to identify real surfaces, commands, routes, schemas, artifacts, entry points, and likely verification paths.
108
- 6. Existing reports or artifacts, only to identify required evidence and invalidation risks. Existing artifacts are not proof unless the user explicitly asks to audit current completion.
109
-
110
- If plan and Context conflict, preserve the conflict in the checklist. Do not silently choose the easier side.
111
-
112
- ## Step 1: Materialize The Plan Under `tmp/ty-context/plan-acceptance/`
113
-
114
- Before writing the checklist, write the user-specified plan into the repository `tmp/ty-context/plan-acceptance/` directory.
115
-
116
- Rules:
117
-
118
- - If `tmp/ty-context/plan-acceptance/` does not exist, create it.
119
- - If the user references a file outside `tmp/ty-context/plan-acceptance/`, copy its current content into `tmp/ty-context/plan-acceptance/<safe-plan-name>.md`.
120
- - If the user references a file already under `tmp/ty-context/plan-acceptance/`, use that path directly unless the user asks for a new copy.
121
- - If the user pasted the plan text, write the pasted plan exactly into `tmp/ty-context/plan-acceptance/<safe-plan-name>.md`.
122
- - Preserve the plan content. Do not summarize, normalize, reorder, translate, or edit it while materializing it.
123
- - Use a stable readable filename derived from the plan title, source filename, or user topic. Use lowercase letters, digits, hyphens, and `.md`.
124
- - If the source plan cannot be found or read, stop and report the missing source. Do not invent a plan.
125
- - The materialized plan is temporary execution input. It is not durable Context and not proof that any acceptance item passed.
126
-
1
+ ---
2
+ name: plan_acceptance_checklist_compiler
3
+ description: Use when the user provides or references a plan, Markdown file, RFC, implementation proposal, milestone plan, or long-task goal/target-mode task and explicitly asks for an acceptance checklist, completion definition, or goal/target-mode prompt. Triggers include acceptance checklist for this plan, goal-mode prompt for this implementation plan, target-mode prompt for this plan, 为这份方案生成验收清单, 整理这份方案输出目标模式文本. Do not trigger on standalone acceptance/目标模式/完成定义 terms without a plan-like source. This Skill writes temporary artifacts under tmp/ty-context/plan-acceptance and never executes the plan or marks completion.
4
+ ---
5
+
6
+ # Plan Acceptance Checklist Compiler
7
+
8
+ ## Package-Managed Boundary
9
+
10
+ This Skill is generated by `ty-context sync` and owned by the Harness package. Do not edit the generated `plan_acceptance_checklist_compiler` Skill directly in a consumer project.
11
+
12
+ Project-specific acceptance rules belong in `project_context/**`, `DESIGN.md` when visual design facts are durable, or a separate project-local Skill. Keep this package-managed Skill business-agnostic.
13
+
14
+ ## Purpose
15
+
16
+ Convert a given long-task plan into a strict, project-context-aware acceptance checklist and a short goal/target-mode execution prompt.
17
+
18
+ The output answers:
19
+
20
+ ```text
21
+ For this exact plan, what must be true before a future executor can honestly say the task is complete?
22
+ ```
23
+
24
+ This Skill is generic. Do not embed business-specific rules, provider-specific rules, product-domain assumptions, artifact schemas, API names, UI names, status names, or one-off project logic in this Skill. If the user's plan contains a specific domain, derive acceptance items from that plan and the project Context during the current invocation only.
25
+
26
+ ## Required Outputs
27
+
28
+ Every completed invocation must produce:
29
+
30
+ 1. A preserved copy of the plan under `tmp/ty-context/plan-acceptance/`.
31
+ 2. A rigorous acceptance checklist derived from the plan and relevant project Context.
32
+ 3. A goal/target-mode prompt the user can paste directly into Codex.
33
+
34
+ Exception: if the Context confirmation gate below triggers, stop after materializing the plan and reading enough Context to explain the uncertainty. Ask the user for confirmation before producing the checklist or goal/target-mode prompt.
35
+
36
+ The goal/target-mode prompt must be no longer than 3980 characters, including line breaks. This conservative budget keeps the prompt below Codex's 4000-character practical paste boundary even when character counting differs slightly. Keep the prompt information-dense: preserve required paths, core acceptance criteria, evidence rules, blockers and false-completion traps while using compact wording. Use the user's language when practical. For Chinese requests, use this shape:
37
+
38
+ ```text
39
+ 实施计划: tmp/ty-context/plan-acceptance/<plan>.md
40
+ 可多开agent,agent名额不够了就关掉不用的。
41
+ <验收清单>
42
+ ```
43
+
44
+ For English requests, use this shape:
45
+
46
+ ```text
47
+ Plan: tmp/ty-context/plan-acceptance/<plan>.md
48
+ You may use multiple agents; if agent slots run low, close idle or unnecessary agents.
49
+ <acceptance checklist>
50
+ ```
51
+
52
+ This resource lifecycle line is part of the generated execution prompt. It authorizes parallel decomposition for the future executor while requiring unused agents to be closed when slots run low.
53
+
54
+ ## Non-Scope
55
+
56
+ Do not execute the plan unless the user separately asks for execution.
57
+
58
+ Do not modify product code, migrations, UI, API, tests, runtime artifacts, or durable Context while using this Skill.
59
+
60
+ Do not run long validation, external probes, browser flows, cloud smoke tests, database writes, deployment commands, or other side-effect commands unless the user explicitly asks for validation execution.
61
+
62
+ Do not mark any goal complete. This Skill defines acceptance; it does not complete acceptance.
63
+
64
+ Do not lower the plan target to match current implementation.
65
+
66
+ ## Trigger Conditions
67
+
68
+ Use this Skill when the user asks for any of the following:
69
+
70
+ - Give a plan, Markdown file, RFC, milestone, or implementation proposal an acceptance checklist.
71
+ - Combine project durable Context with a long-task plan to produce acceptance criteria.
72
+ - Convert a long-range task plan into a checklist for Codex goal/target mode.
73
+ - Generate a completion definition for "execute this plan until done."
74
+ - Check whether a plan is missing acceptance conditions.
75
+ - Combine user objective, plan steps, project Context, code entry points, and verification paths into an executable acceptance matrix.
76
+
77
+ Typical trigger phrases:
78
+
79
+ ```text
80
+ acceptance checklist for this plan
81
+ goal-mode prompt for this implementation plan
82
+ target-mode prompt for this plan
83
+ completion definition for this plan
84
+ long-task plan acceptance
85
+ audit this plan for acceptance criteria
86
+ 整理这份方案,输出一份目标模式文本给我
87
+ 整理方案输出目标模式文本
88
+ 为这份方案生成验收清单
89
+ 方案验收清单
90
+ 长程任务方案验收
91
+ 实施计划验收清单
92
+ 结合项目 context 梳理方案验收
93
+ 为这份 md 生成目标模式验收文本
94
+ 把这份 md 整理成验收清单
95
+ ```
96
+
97
+ Do not trigger from standalone broad phrases such as `goal mode`, `target mode`, `acceptance criteria`, `completion definition`, `目标模式文本`, `验收标准`, or `完成定义` unless the same user request also identifies a plan-like source.
98
+
99
+ ## Input Priority
100
+
101
+ Build the checklist from these sources, in order:
102
+
103
+ 1. User's current instruction.
104
+ 2. The referenced or pasted plan.
105
+ 3. Relevant durable project Context under `project_context/**`.
106
+ 4. Repository guidance such as `AGENTS.md`, `README.md`, `DESIGN.md`, and relevant local Skills.
107
+ 5. Current code and tests, only to identify real surfaces, commands, routes, schemas, artifacts, entry points, and likely verification paths.
108
+ 6. Existing reports or artifacts, only to identify required evidence and invalidation risks. Existing artifacts are not proof unless the user explicitly asks to audit current completion.
109
+
110
+ If plan and Context conflict, preserve the conflict in the checklist. Do not silently choose the easier side.
111
+
112
+ ## Step 1: Materialize The Plan Under `tmp/ty-context/plan-acceptance/`
113
+
114
+ Before writing the checklist, write the user-specified plan into the repository `tmp/ty-context/plan-acceptance/` directory.
115
+
116
+ Rules:
117
+
118
+ - If `tmp/ty-context/plan-acceptance/` does not exist, create it.
119
+ - If the user references a file outside `tmp/ty-context/plan-acceptance/`, copy its current content into `tmp/ty-context/plan-acceptance/<safe-plan-name>.md`.
120
+ - If the user references a file already under `tmp/ty-context/plan-acceptance/`, use that path directly unless the user asks for a new copy.
121
+ - If the user pasted the plan text, write the pasted plan exactly into `tmp/ty-context/plan-acceptance/<safe-plan-name>.md`.
122
+ - Preserve the plan content. Do not summarize, normalize, reorder, translate, or edit it while materializing it.
123
+ - Use a stable readable filename derived from the plan title, source filename, or user topic. Use lowercase letters, digits, hyphens, and `.md`.
124
+ - If the source plan cannot be found or read, stop and report the missing source. Do not invent a plan.
125
+ - The materialized plan is temporary execution input. It is not durable Context and not proof that any acceptance item passed.
126
+
127
127
  Recommended paths:
128
128
 
129
129
  ```text
130
130
  tmp/ty-context/plan-acceptance/<plan-slug>.md
131
131
  tmp/ty-context/plan-acceptance/<plan-slug>-acceptance-checklist.md
132
+ tmp/ty-context/plan-acceptance/<plan-slug>-local-audit.md
132
133
  ```
133
134
 
134
- ## Step 2: Read Relevant Project Context
135
-
136
- Read only the Context needed for the plan's impacted surfaces. Use Context to identify what the project says the system should mean.
137
-
138
- Use code and tests to identify current implementation surfaces and verification commands. Code describes current state; it must not silently redefine the plan's completion definition.
139
-
140
- Extract:
141
-
142
- - Product or system contract.
143
- - Module boundaries.
144
- - API, data, artifact, runtime, UI, and test contracts.
145
- - Validation and deployment paths.
146
- - Security, privacy, redaction, and environment rules.
147
- - Known non-goals and forbidden dependencies.
148
- - Existing implementation entry points.
149
-
150
- ## Step 3: Run The Context Confirmation Gate
151
-
152
- After reading the plan and relevant Context, decide whether generating an acceptance checklist would require guessing or silently accepting many durable fact changes. If yes, stop and ask the user to confirm details before continuing.
153
-
154
- Trigger this gate when any of these apply:
155
-
156
- - The plan appears to add or change many long-term facts, such as product ownership, module boundaries, API/schema semantics, data contracts, state semantics, verification/deployment entry points, UI information architecture, security/privacy rules, or cross-domain dependency rules.
157
- - The plan would likely require updates across multiple Context roles or areas, or more than a small number of durable Context files.
158
- - The plan conflicts with current Context and the conflict changes the completion definition rather than only implementation details.
159
- - The plan uses broad or strategic language whose meaning cannot be made falsifiable from the plan and Context alone, such as "rebuild the whole architecture", "unify every module", "production-grade", "fully integrated", "maximize capability", `重构整体架构`, `统一所有模块`, `生产级`, `长期方案`, `完全接入`, or similar.
160
- - The acceptance outcome depends on choosing between product/architecture alternatives that the user has not explicitly chosen.
161
- - External approvals, credentials, runtime environments, legal/compliance constraints, paid/provider access, or irreversible operational effects materially change what can be accepted.
162
- - You judge that proceeding would encode a durable assumption the user probably expects to review.
163
-
164
- When the gate triggers:
165
-
166
- - Do not compile the final checklist, write the checklist file, or generate the goal/target-mode prompt yet.
167
- - Ask 1-3 concise questions focused on the blocking durable assumptions.
168
- - Include the materialized plan path if already written.
169
- - State the specific Context files or Context roles that made confirmation necessary.
170
- - Offer a concrete default only when it is safe and reversible; otherwise ask for an explicit decision.
171
- - Resume from Step 2 after the user answers.
172
-
173
- Do not trigger this gate merely because the plan is large. If the Context impact is clear, bounded, and the checklist can represent the durable Context updates as acceptance items without guessing, continue.
174
-
175
- ## Step 4: Compile The Acceptance Contract
176
-
177
- Before the detailed checklist, write a compact contract:
178
-
179
- ```markdown
180
- ## Acceptance Contract
181
-
182
- - Plan source: `tmp/ty-context/plan-acceptance/<plan>.md`
183
- - User objective: <objective>
184
- - Completion definition: <what must be true>
185
- - Core surfaces: <code/API/UI/data/runtime/artifact/test/context/etc.>
186
- - Explicit non-goals: <items>
187
- - External blockers: <accounts/session/credentials/network/cloud/manual approval/etc.>
188
- - Evidence standard: <accepted proof types>
189
- ```
190
-
191
- If the plan uses broad words such as `all`, `complete`, `maximum`, `maximized`, `accepted`, `formal`, `production`, `real`, `current`, or `verified`, convert them into explicit inventory and evidence requirements.
192
-
193
- ## Step 5: Build The Full Acceptance Checklist
194
-
195
- Treat the checklist as the set of falsifiable acceptance items a future executor must satisfy. It is not a progress log.
196
-
197
- Create a checklist table with these columns:
198
-
199
- ```markdown
200
- | ID | Acceptance item | Source | Scope | Required evidence | Verification method | Fail / not accepted if |
201
- |---|---|---|---|---|---|---|
202
- ```
203
-
204
- Column rules:
205
-
206
- - `ID`: stable ID such as `AC-01`; group by workstream if useful.
207
- - `Acceptance item`: one atomic condition that must be true.
208
- - `Source`: user instruction, plan section, Context file, code contract, schema, API, UI, test, or risk note.
209
- - `Scope`: one of `core`, `conditional`, `blocked-external`, `out-of-scope`, `nice-to-have`.
210
- - `Required evidence`: exact proof required.
211
- - `Verification method`: concrete command, check, inspection path, API call, UI check, artifact validation, or manual proof.
212
- - `Fail / not accepted if`: concrete invalidation case.
213
-
214
- Each item must be evidence-bound and falsifiable. Split combined requirements into separate rows.
215
-
216
- ## Hard Blocker Handling
217
-
218
- Treat any unresolved required blocker as non-completion. A checklist may describe blocked acceptance work, but blocked work is still not accepted until the required evidence exists.
219
-
220
- Use `blocked-external` only when the item is required for full completion but depends on unavailable user input, credentials, account/session access, manual approval, external environment state, provider availability, legal/compliance decision, or another dependency the future executor cannot satisfy locally.
221
-
222
- For every `blocked-external` item, state:
223
-
224
- - The exact missing evidence.
225
- - Why the future executor cannot produce it locally.
226
- - The required user, owner, external system, or approval action.
227
- - The fallback or degraded path, if the plan or Context allows one.
228
- - The acceptance impact if the blocker remains unresolved.
229
-
230
- Do not mark blocked work as `out-of-scope`, `conditional`, or `nice-to-have` merely because it is hard or currently unavailable. Do not treat partial local work, preparation, mocks, old evidence, or an unexercised fallback as completion for a blocked required item.
231
-
135
+ The local audit path is for the future goal/target-mode executor. This compiler may require the prompt to maintain it, but the file is a temporary recovery cache and not proof that any acceptance item passed.
136
+
137
+ ## Step 2: Read Relevant Project Context
138
+
139
+ Read only the Context needed for the plan's impacted surfaces. Use Context to identify what the project says the system should mean.
140
+
141
+ Use code and tests to identify current implementation surfaces and verification commands. Code describes current state; it must not silently redefine the plan's completion definition.
142
+
143
+ Extract:
144
+
145
+ - Product or system contract.
146
+ - Module boundaries.
147
+ - API, data, artifact, runtime, UI, and test contracts.
148
+ - Validation and deployment paths.
149
+ - Security, privacy, redaction, and environment rules.
150
+ - Known non-goals and forbidden dependencies.
151
+ - Existing implementation entry points.
152
+
153
+ ## Step 3: Run The Context Confirmation Gate
154
+
155
+ After reading the plan and relevant Context, decide whether generating an acceptance checklist would require guessing or silently accepting many durable fact changes. If yes, stop and ask the user to confirm details before continuing.
156
+
157
+ Trigger this gate when any of these apply:
158
+
159
+ - The plan appears to add or change many long-term facts, such as product ownership, module boundaries, API/schema semantics, data contracts, state semantics, verification/deployment entry points, UI information architecture, security/privacy rules, or cross-domain dependency rules.
160
+ - The plan would likely require updates across multiple Context roles or areas, or more than a small number of durable Context files.
161
+ - The plan conflicts with current Context and the conflict changes the completion definition rather than only implementation details.
162
+ - The plan uses broad or strategic language whose meaning cannot be made falsifiable from the plan and Context alone, such as "rebuild the whole architecture", "unify every module", "production-grade", "fully integrated", "maximize capability", `重构整体架构`, `统一所有模块`, `生产级`, `长期方案`, `完全接入`, or similar.
163
+ - The acceptance outcome depends on choosing between product/architecture alternatives that the user has not explicitly chosen.
164
+ - External approvals, credentials, runtime environments, legal/compliance constraints, paid/provider access, or irreversible operational effects materially change what can be accepted.
165
+ - You judge that proceeding would encode a durable assumption the user probably expects to review.
166
+
167
+ When the gate triggers:
168
+
169
+ - Do not compile the final checklist, write the checklist file, or generate the goal/target-mode prompt yet.
170
+ - Ask 1-3 concise questions focused on the blocking durable assumptions.
171
+ - Include the materialized plan path if already written.
172
+ - State the specific Context files or Context roles that made confirmation necessary.
173
+ - Offer a concrete default only when it is safe and reversible; otherwise ask for an explicit decision.
174
+ - Resume from Step 2 after the user answers.
175
+
176
+ Do not trigger this gate merely because the plan is large. If the Context impact is clear, bounded, and the checklist can represent the durable Context updates as acceptance items without guessing, continue.
177
+
178
+ ## Step 4: Compile The Acceptance Contract
179
+
180
+ Before the detailed checklist, write a compact contract:
181
+
182
+ ```markdown
183
+ ## Acceptance Contract
184
+
185
+ - Plan source: `tmp/ty-context/plan-acceptance/<plan>.md`
186
+ - User objective: <objective>
187
+ - Completion definition: <what must be true>
188
+ - Core surfaces: <code/API/UI/data/runtime/artifact/test/context/etc.>
189
+ - Explicit non-goals: <items>
190
+ - External blockers: <accounts/session/credentials/network/cloud/manual approval/etc.>
191
+ - Evidence standard: <accepted proof types>
192
+ ```
193
+
194
+ If the plan uses broad words such as `all`, `complete`, `maximum`, `maximized`, `accepted`, `formal`, `production`, `real`, `current`, or `verified`, convert them into explicit inventory and evidence requirements.
195
+
196
+ ## Step 5: Build The Full Acceptance Checklist
197
+
198
+ Treat the checklist as the set of falsifiable acceptance items a future executor must satisfy. It is not a progress log.
199
+
200
+ Create a checklist table with these columns:
201
+
202
+ ```markdown
203
+ | ID | Acceptance item | Source | Scope | Required evidence | Verification method | Fail / not accepted if |
204
+ |---|---|---|---|---|---|---|
205
+ ```
206
+
207
+ Column rules:
208
+
209
+ - `ID`: stable ID such as `AC-01`; group by workstream if useful.
210
+ - `Acceptance item`: one atomic condition that must be true.
211
+ - `Source`: user instruction, plan section, Context file, code contract, schema, API, UI, test, or risk note.
212
+ - `Scope`: one of `core`, `conditional`, `blocked-external`, `out-of-scope`, `nice-to-have`.
213
+ - `Required evidence`: exact proof required.
214
+ - `Verification method`: concrete command, check, inspection path, API call, UI check, artifact validation, or manual proof.
215
+ - `Fail / not accepted if`: concrete invalidation case.
216
+
217
+ Each item must be evidence-bound and falsifiable. Split combined requirements into separate rows.
218
+
219
+ ## Hard Blocker Handling
220
+
221
+ Treat any unresolved required blocker as non-completion. A checklist may describe blocked acceptance work, but blocked work is still not accepted until the required evidence exists.
222
+
223
+ Use `blocked-external` only when the item is required for full completion but depends on unavailable user input, credentials, account/session access, manual approval, external environment state, provider availability, legal/compliance decision, or another dependency the future executor cannot satisfy locally.
224
+
225
+ For every `blocked-external` item, state:
226
+
227
+ - The exact missing evidence.
228
+ - Why the future executor cannot produce it locally.
229
+ - The required user, owner, external system, or approval action.
230
+ - The fallback or degraded path, if the plan or Context allows one.
231
+ - The acceptance impact if the blocker remains unresolved.
232
+
233
+ Do not mark blocked work as `out-of-scope`, `conditional`, or `nice-to-have` merely because it is hard or currently unavailable. Do not treat partial local work, preparation, mocks, old evidence, or an unexercised fallback as completion for a blocked required item.
234
+
232
235
  The generated goal/target-mode prompt must tell the future executor:
233
236
 
234
237
  - If any executable `core` item remains, continue working it before asking the user to resolve blockers.
235
238
  - If the only remaining incomplete required items are hard blockers that cannot be satisfied locally, pause and wait for the user or external owner instead of marking the goal complete.
236
239
  - The handoff must list blocker cause, missing evidence, acceptance impact, and the exact user/external action needed to resume.
237
240
 
238
- ## Generic Acceptance Dimensions
239
-
240
- Include only dimensions relevant to the plan. Do not mechanically add irrelevant categories.
241
-
242
- Consider these generic dimensions:
243
-
244
- - Objective and completion definition.
245
- - Scope inventory and coverage.
246
- - Domain / project Context conformance.
247
- - Code implementation behavior.
248
- - API or interface contract.
249
- - Data model, schema, migration, or persistence.
250
- - Runtime state, configuration, session, credential, environment, or degraded behavior.
251
- - Artifact generation, schema, freshness, provenance, and acceptance.
252
- - UI or user-visible projection.
253
- - Async job, worker, scheduler, queue, or background process.
254
- - Security, privacy, redaction, secrets, and access control.
255
- - Observability, logs, diagnostics, and operator visibility.
256
- - Performance, timeout, boundedness, pagination, and resource budget.
257
- - Backward compatibility and non-regression.
258
- - Failure states, blocked states, retries, recovery, and next actions.
259
- - Unit, integration, contract, smoke, browser, cloud, or production-like validation.
260
- - Documentation or durable Context update, only when the plan requires a durable fact change.
261
-
262
- ## Evidence Layer Separation
263
-
241
+ ## Minimal User Blocker Protocol
242
+
243
+ The generated goal/target-mode prompt must make user questions minimal and action-oriented when the plan says to do as much locally as possible.
244
+
245
+ Before asking the user, the future executor must complete safe self-service discovery that is available in the repository or current environment:
246
+
247
+ - Read relevant project Context, code, docs, tests, runtime examples, local artifacts and official documentation when current external rules matter.
248
+ - Use local CLIs, APIs, Browser/Chrome/Computer Use or other tools only when appropriate, safe and allowed by the current tool and security policy.
249
+ - Do not bypass user approval, account boundaries or sensitive-operation rules.
250
+
251
+ When user input is still required, the blocker request must be the smallest concrete action list, not a research assignment. It must include:
252
+
253
+ - What was already tried.
254
+ - What is still missing.
255
+ - The exact page, menu, path, field, button, command input or setting when it can be identified.
256
+ - The minimum value or action the user needs to provide.
257
+ - Sensitive material the user should not send, such as cookies, full pages, HAR files, password-store data, raw payloads, unrelated account data, screenshots when a value-only request is enough, or secrets in chat when a safer runtime-only setup path exists.
258
+ - Acceptance impact if the blocker remains unresolved.
259
+ - Allowed fallback, deferred or narrowed-scope option, if the plan or Context permits one.
260
+
261
+ Do not replace this with broad requests such as "provide the token", "send the AppSecret" or "give me the configuration credentials" when a narrower page, field, runtime setup step or delegated user action would satisfy the blocker.
262
+
263
+ ## Generic Acceptance Dimensions
264
+
265
+ Include only dimensions relevant to the plan. Do not mechanically add irrelevant categories.
266
+
267
+ Consider these generic dimensions:
268
+
269
+ - Objective and completion definition.
270
+ - Scope inventory and coverage.
271
+ - Domain / project Context conformance.
272
+ - Code implementation behavior.
273
+ - API or interface contract.
274
+ - Data model, schema, migration, or persistence.
275
+ - Runtime state, configuration, session, credential, environment, or degraded behavior.
276
+ - Artifact generation, schema, freshness, provenance, and acceptance.
277
+ - UI or user-visible projection.
278
+ - Async job, worker, scheduler, queue, or background process.
279
+ - Security, privacy, redaction, secrets, and access control.
280
+ - Observability, logs, diagnostics, and operator visibility.
281
+ - Performance, timeout, boundedness, pagination, and resource budget.
282
+ - Backward compatibility and non-regression.
283
+ - Failure states, blocked states, retries, recovery, and next actions.
284
+ - Unit, integration, contract, smoke, browser, cloud, or production-like validation.
285
+ - Documentation or durable Context update, only when the plan requires a durable fact change.
286
+
287
+ ## Evidence Layer Separation
288
+
264
289
  For evidence-heavy or validation-heavy plans, keep these layers separate:
265
290
 
266
291
  ```text
267
292
  infrastructure implemented
293
+ runtime configured
268
294
  runtime exercised
269
- evidence collected
270
- evidence accepted
271
- system surfaces reflect evidence
272
- final validation passed
295
+ artifact generated
296
+ artifact accepted by validator
297
+ API/UI reflects accepted evidence
298
+ final gate/check command passed
273
299
  ```
274
300
 
275
301
  Do not merge these layers unless the plan explicitly treats them as one.
276
302
 
303
+ For runtime, provider, artifact or external-evidence acceptance, distinguish adapter or infrastructure implementation, formal artifact production, scope alignment, redaction, replayability, validator acceptance, registry/API/UI projection, full gate/check success and any explicitly narrowed or deferred scope. A fallback path must be exercised before it can prove full completion.
304
+
277
305
  Examples of generic false completion:
278
306
 
279
307
  - Code exists, but runtime behavior was not exercised.
@@ -281,119 +309,129 @@ Examples of generic false completion:
281
309
  - Tests pass, but the required API/UI/runtime proof was not checked.
282
310
  - A partial sample ran, but the plan required full inventory coverage.
283
311
  - A document was updated, but the system state did not change.
284
-
285
- ## Scope Labels
286
-
287
- Use these exact labels:
288
-
289
- - `core`: required to call the plan complete.
290
- - `conditional`: required only when a stated condition applies.
291
- - `blocked-external`: required for full completion but depends on unavailable external input.
292
- - `out-of-scope`: explicitly excluded by the user or plan.
293
- - `nice-to-have`: useful but not required; use sparingly.
294
-
295
- Do not mark an item out of scope only because it is difficult.
296
- Do not mark a `blocked-external` item complete until its required evidence exists.
297
-
298
- ## Conflict Handling
299
-
300
- If the plan, Context, and code disagree, add a conflict table:
301
-
302
- ```markdown
303
- | Conflict | Plan says | Context says | Code currently does | Acceptance impact | Required resolution |
304
- |---|---|---|---|---|---|
305
- ```
306
-
307
- Do not resolve conflict by silently following current code.
308
-
309
- ## False-Completion Traps
310
-
311
- After the checklist, include traps that a future executor must not mistake for completion.
312
-
313
- Use generic wording. Do not add business-specific traps to the Skill itself.
314
-
315
- Template:
316
-
317
- ```markdown
318
- ## False-Completion Traps
319
-
320
- - <activity> is not enough unless <acceptance proof> also exists.
321
- - <partial proof> cannot prove <full acceptance item>.
322
- - <old/stale/mismatched evidence> cannot prove current completion.
323
- - <surface A> passing does not prove <surface B> unless the checklist links them.
324
- ```
325
-
326
- ## Suggested Execution Order
327
-
328
- Suggest an execution order that prioritizes the highest-risk proof first:
329
-
330
- 1. Confirm scope inventory and completion definition.
331
- 2. Confirm schemas/contracts and invalid evidence rules.
332
- 3. Implement or repair the core path.
333
- 4. Produce or inspect required runtime/artifact/API/UI evidence.
334
- 5. Run contract/integration/regression checks.
335
- 6. Run build/typecheck/lint/smoke checks.
336
- 7. Update temporary checklist status only if the user asked for status tracking.
337
-
338
- Do not put low-signal checks before the core acceptance proof when the plan is evidence-driven.
339
-
340
- ## Checklist Self-Test
341
-
342
- Every generated checklist must pass this self-test:
343
-
344
- ```markdown
345
- ## Checklist Self-Test
346
-
347
- - [ ] Every original plan section is mapped to at least one acceptance item or explicitly marked non-goal.
348
- - [ ] Every broad word such as all / maximum / complete / formal / production / verified is converted into inventory + evidence.
349
- - [ ] Every core item has a verification method.
350
- - [ ] Every evidence item defines invalid evidence.
351
- - [ ] Code, API, UI, data, runtime, artifacts, and tests are separated when they prove different things.
352
- - [ ] External blockers are separated from local remaining work.
353
- - [ ] Hard blockers are treated as non-completion, with required user/external action and pause conditions.
354
- - [ ] Current implementation was not used to reduce the user's completion definition.
355
- - [ ] The checklist can be handed to another Codex session without hidden assumptions.
356
- ```
357
-
358
- ## Goal/Target-Mode Prompt Generation
359
-
360
- After compiling the checklist, produce a paste-ready goal/target-mode prompt.
361
-
362
- Hard requirements:
363
-
364
- - The prompt must be no longer than 4000 characters including line breaks.
365
- - The first line must identify the plan path.
366
- - Use `实施计划: <path>` for Chinese prompts and `Plan: <path>` for English prompts.
367
- - The second line must be a resource lifecycle instruction: `可多开agent,agent名额不够了就关掉不用的。` for Chinese prompts or `You may use multiple agents; if agent slots run low, close idle or unnecessary agents.` for English prompts.
368
- - The remaining content must be the acceptance checklist or a compact version of it.
312
+ - A fallback exists in code, but the fallback was not configured or exercised.
313
+ - Old, partial, smoke, dry-run or research evidence exists, but the current acceptance contract requires full current proof.
314
+
315
+ ## Scope Labels
316
+
317
+ Use these exact labels:
318
+
319
+ - `core`: required to call the plan complete.
320
+ - `conditional`: required only when a stated condition applies.
321
+ - `blocked-external`: required for full completion but depends on unavailable external input.
322
+ - `out-of-scope`: explicitly excluded by the user or plan.
323
+ - `nice-to-have`: useful but not required; use sparingly.
324
+
325
+ Do not mark an item out of scope only because it is difficult.
326
+ Do not mark a `blocked-external` item complete until its required evidence exists.
327
+
328
+ ## Conflict Handling
329
+
330
+ If the plan, Context, and code disagree, add a conflict table:
331
+
332
+ ```markdown
333
+ | Conflict | Plan says | Context says | Code currently does | Acceptance impact | Required resolution |
334
+ |---|---|---|---|---|---|
335
+ ```
336
+
337
+ Do not resolve conflict by silently following current code.
338
+
339
+ ## False-Completion Traps
340
+
341
+ After the checklist, include traps that a future executor must not mistake for completion.
342
+
343
+ Use generic wording. Do not add business-specific traps to the Skill itself.
344
+
345
+ Template:
346
+
347
+ ```markdown
348
+ ## False-Completion Traps
349
+
350
+ - <activity> is not enough unless <acceptance proof> also exists.
351
+ - <partial proof> cannot prove <full acceptance item>.
352
+ - <old/stale/mismatched evidence> cannot prove current completion.
353
+ - <surface A> passing does not prove <surface B> unless the checklist links them.
354
+ ```
355
+
356
+ ## Suggested Execution Order
357
+
358
+ Suggest an execution order that prioritizes the highest-risk proof first:
359
+
360
+ 1. Confirm scope inventory and completion definition.
361
+ 2. Confirm schemas/contracts and invalid evidence rules.
362
+ 3. Implement or repair the core path.
363
+ 4. Produce or inspect required runtime/artifact/API/UI evidence.
364
+ 5. Run contract/integration/regression checks.
365
+ 6. Run build/typecheck/lint/smoke checks.
366
+ 7. Update temporary checklist status only if the user asked for status tracking.
367
+
368
+ Do not put low-signal checks before the core acceptance proof when the plan is evidence-driven.
369
+
370
+ ## Checklist Self-Test
371
+
372
+ Every generated checklist must pass this self-test:
373
+
374
+ ```markdown
375
+ ## Checklist Self-Test
376
+
377
+ - [ ] Every original plan section is mapped to at least one acceptance item or explicitly marked non-goal.
378
+ - [ ] Every broad word such as all / maximum / complete / formal / production / verified is converted into inventory + evidence.
379
+ - [ ] Every core item has a verification method.
380
+ - [ ] Every evidence item defines invalid evidence.
381
+ - [ ] Code, API, UI, data, runtime, artifacts, and tests are separated when they prove different things.
382
+ - [ ] External blockers are separated from local remaining work.
383
+ - [ ] Hard blockers are treated as non-completion, with required user/external action and pause conditions.
384
+ - [ ] Current implementation was not used to reduce the user's completion definition.
385
+ - [ ] The checklist can be handed to another Codex session without hidden assumptions.
386
+ ```
387
+
388
+ ## Goal/Target-Mode Prompt Generation
389
+
390
+ After compiling the checklist, produce a paste-ready goal/target-mode prompt.
391
+
392
+ Hard requirements:
393
+
394
+ - The prompt must be no longer than 3980 characters including line breaks. Treat 3980 as the effective hard budget and preserve information density; do not drop required paths, core acceptance categories, blocker rules, evidence rules or false-completion traps merely to be short.
395
+ - The first line must identify the plan path.
396
+ - Use `实施计划: <path>` for Chinese prompts and `Plan: <path>` for English prompts.
397
+ - The second line must be a resource lifecycle instruction: `可多开agent,agent名额不够了就关掉不用的。` for Chinese prompts or `You may use multiple agents; if agent slots run low, close idle or unnecessary agents.` for English prompts.
398
+ - The remaining content must be the acceptance checklist or a compact version of it.
369
399
  - The prompt must be self-contained enough for goal/target-mode execution.
400
+ - The prompt must identify a local audit path, normally `tmp/ty-context/plan-acceptance/<plan-slug>-local-audit.md`, and require the future executor to read it before resuming, keep it current during execution, and use it only as target-mode acceptance progress state.
401
+ - The prompt must require the local audit to record overall status (`complete`, `incomplete`, `blocked` or `narrowed-scope-complete`), each core AC status and current evidence, commands with result/time/failure reason, artifact or evidence paths, blockers and missing evidence, acceptance impact, explicit deferred or narrowed scope, and stale/partial/smoke/dry-run/research evidence that cannot prove full completion.
402
+ - The prompt must say that local audit is not Context, not product-quality proof, not a global task manager, and not a replacement for project tests, CI, review, human acceptance, Task Contract or workflow-contract `plan.md`.
403
+ - The prompt must say that when a Task Contract or workflow-contract `plan.md` exists, each acceptance item execution still follows it and the repository's Tiny Context workflow contract.
370
404
  - Do not include explanatory preface inside the prompt.
371
405
  - Do not include Markdown tables inside the prompt if they make it too long.
372
406
  - Prefer short numbered items over verbose prose.
373
- - If the full checklist is too large, write the full checklist to `tmp/ty-context/plan-acceptance/<plan-slug>-acceptance-checklist.md`, then compress the goal/target-mode prompt while preserving all core acceptance categories.
407
+ - If the full checklist is too large, write the full checklist to `tmp/ty-context/plan-acceptance/<plan-slug>-acceptance-checklist.md`, then compress the goal/target-mode prompt by increasing information density while preserving all core acceptance categories.
374
408
  - The compact prompt may reference the full checklist path, but it must still include the core completion criteria directly.
375
-
376
- Recommended compact Chinese prompt shape:
377
-
409
+
410
+ Recommended compact Chinese prompt shape:
411
+
378
412
  ```text
379
413
  实施计划: tmp/ty-context/plan-acceptance/<plan-slug>.md
380
414
  可多开agent,agent名额不够了就关掉不用的。
415
+ 完整验收清单: tmp/ty-context/plan-acceptance/<plan-slug>-acceptance-checklist.md
416
+ 执行审计: tmp/ty-context/plan-acceptance/<plan-slug>-local-audit.md
381
417
 
382
418
  验收清单:
383
419
  AC1 <核心完成定义,包含验收证据>
384
- AC2 <范围/清单/覆盖要求>
385
- AC3 <Context/架构/边界要求>
386
- AC4 <核心实现行为要求>
387
- AC5 <数据/API/接口/契约要求>
388
- AC6 <运行态/配置/外部依赖/阻塞分类要求>
389
- AC7 <artifact/evidence/schema/freshness/provenance 要求>
390
- AC8 <UI/用户可见/API 投影一致性要求>
391
- AC9 <安全/隐私/脱敏/secret 要求>
420
+ AC2 <范围/清单/覆盖要求>
421
+ AC3 <Context/架构/边界要求>
422
+ AC4 <核心实现行为要求>
423
+ AC5 <数据/API/接口/契约要求>
424
+ AC6 <运行态/配置/外部依赖/阻塞分类要求>
425
+ AC7 <artifact/evidence/schema/freshness/provenance 要求>
426
+ AC8 <UI/用户可见/API 投影一致性要求>
427
+ AC9 <安全/隐私/脱敏/secret 要求>
392
428
  AC10 <测试/构建/集成/smoke/回归要求>
393
429
  AC11 <文档/Context 更新要求,仅在计划要求时执行>
394
- AC12 完成前审计:逐条对照实施计划;每个 core 项必须有当前证据;未跑验证必须明示;有可继续执行的 core 项不得标记完成;外部/强卡点必须写明原因、缺失证据、验收影响和下一步;若剩余未完成项只有无法本地解决的强卡点,暂停并等待用户/外部 owner,不能标记目标完成。
430
+ AC12 维护执行审计:恢复执行先读 audit;记录总体状态、每个 AC 当前证据、命令/结果/时间、artifact/evidence 路径、blocker、deferred/narrowed scope、不能证明 full completion 的旧/部分/smoke/dry-run/research 证据;audit 不是 Context、完成证明、全局任务管理器,也不替代 Task Contract 或流程契约 plan.md。
431
+ AC13 最小用户卡点:问用户前先完成安全自助发现;需要用户介入时只给最小动作清单,写明已尝试、缺失项、具体页面/菜单/字段/按钮、最小值/动作、不要发送的敏感信息、验收影响、fallback/deferred。
432
+ AC14 完成前审计:逐条对照实施计划和完整 checklist;每个 core 项必须有当前证据;未跑验证必须明示;有可继续执行的 core 项不得标记完成;外部/强卡点必须写明原因、缺失证据、验收影响和下一步;若剩余未完成项只有无法本地解决的强卡点,暂停并等待用户/外部 owner,不能标记目标完成。
395
433
 
396
- 禁止把以下内容当完成:只改代码、只更新计划、只跑部分测试、只生成旧/部分/不被当前契约接受的证据、只完成基础设施但未完成验收证据、强卡点未解除、API/UI/数据/测试之间仍矛盾。
434
+ 禁止把以下内容当完成:只改代码、只更新计划、只跑部分测试、只生成旧/部分/不被当前契约接受的证据、只完成基础设施但未完成验收证据、runtime 未配置/未演练、artifact 未被 validator 接受、API/UI 未反映验收证据、fallback 未演练、强卡点未解除、API/UI/数据/测试之间仍矛盾。
397
435
  ```
398
436
 
399
437
  Recommended compact English prompt shape:
@@ -401,51 +439,55 @@ Recommended compact English prompt shape:
401
439
  ```text
402
440
  Plan: tmp/ty-context/plan-acceptance/<plan-slug>.md
403
441
  You may use multiple agents; if agent slots run low, close idle or unnecessary agents.
442
+ Full checklist: tmp/ty-context/plan-acceptance/<plan-slug>-acceptance-checklist.md
443
+ Local audit: tmp/ty-context/plan-acceptance/<plan-slug>-local-audit.md
404
444
 
405
445
  Acceptance checklist:
406
446
  AC1 <core completion definition with required evidence>
407
- AC2 <scope inventory and coverage>
408
- AC3 <Context / architecture / boundary conformance>
409
- AC4 <core implementation behavior>
410
- AC5 <data / API / interface / contract requirements>
411
- AC6 <runtime / configuration / external dependency / blocker classification>
412
- AC7 <artifact / evidence / schema / freshness / provenance requirements>
413
- AC8 <UI / user-visible / API projection consistency>
447
+ AC2 <scope inventory and coverage>
448
+ AC3 <Context / architecture / boundary conformance>
449
+ AC4 <core implementation behavior>
450
+ AC5 <data / API / interface / contract requirements>
451
+ AC6 <runtime / configuration / external dependency / blocker classification>
452
+ AC7 <artifact / evidence / schema / freshness / provenance requirements>
453
+ AC8 <UI / user-visible / API projection consistency>
414
454
  AC9 <security / privacy / redaction / secret handling>
415
455
  AC10 <test / build / integration / smoke / regression requirements>
416
456
  AC11 <documentation / Context updates only when required by the plan>
417
- AC12 Final audit: compare every item against the plan; every core item needs current evidence; missing validation must be stated; any executable core item left open means the task is not complete; external or hard blockers need cause, missing evidence, acceptance impact, and next action; if only locally unsatisfiable hard blockers remain, pause for the user or external owner instead of marking the goal complete.
457
+ AC12 Maintain local audit: read it before resuming; record overall status, every AC's current evidence, commands/results/time, artifact/evidence paths, blockers, deferred/narrowed scope, and stale/partial/smoke/dry-run/research evidence that cannot prove full completion; audit is not Context, completion proof, a global task manager, or a replacement for Task Contract or workflow-contract plan.md.
458
+ AC13 Minimal user blocker protocol: before asking the user, complete safe self-service discovery; when user action is needed, provide only the smallest action list with what was tried, missing item, exact page/menu/path/field/button, minimum value/action, sensitive material not to send, acceptance impact, and fallback/deferred option.
459
+ AC14 Final audit: compare every item against the plan and full checklist; every core item needs current evidence; missing validation must be stated; any executable core item left open means the task is not complete; external or hard blockers need cause, missing evidence, acceptance impact, and next action; if only locally unsatisfiable hard blockers remain, pause for the user or external owner instead of marking the goal complete.
418
460
 
419
- Do not count these as completion: code-only changes, plan-only updates, partial tests, stale or partial evidence, infrastructure without acceptance proof, unresolved hard blockers, or contradictions between API/UI/data/tests.
461
+ Do not count these as completion: code-only changes, plan-only updates, partial tests, stale or partial evidence, infrastructure without acceptance proof, runtime not configured/exercised, artifact not accepted by validator, API/UI not reflecting accepted evidence, unexercised fallback, unresolved hard blockers, or contradictions between API/UI/data/tests.
420
462
  ```
421
-
422
- Before final response, check the prompt length. If it exceeds 4000 characters, compress it and check again.
423
-
424
- ## Final Response Requirements
425
-
426
- For completed checklist runs, the final response must include:
427
-
428
- - `tmp/ty-context/plan-acceptance/` plan path.
429
- - Full checklist path if a full checklist file was written.
430
- - The paste-ready goal/target-mode prompt in a code block.
431
- - Any unresolved ambiguity that affects checklist accuracy.
432
-
433
- If the Context confirmation gate triggers, the response must ask for the needed confirmation instead of including the checklist or goal/target-mode prompt.
434
-
435
- Do not claim that any acceptance item has passed unless the user explicitly asked for a current completion audit and current evidence was inspected.
436
-
437
- ## Forbidden Behaviors
438
-
439
- Do not include concrete business-domain logic in this Skill.
440
-
441
- Do not hardcode project-specific provider names, API names, UI names, artifact paths, schemas, statuses, or commands in this Skill.
442
-
443
- Do not execute the generated goal/target-mode prompt.
444
-
445
- Do not use the checklist to mark the task complete.
446
-
447
- Do not hide missing source files, ambiguous plan scope, or external blockers.
448
-
449
- Do not rewrite the user's plan while copying it into `tmp/ty-context/plan-acceptance/`.
450
-
451
- Do not use a short sample, stale artifact, old result, or current implementation convenience to reduce a plan that asked for full completion.
463
+
464
+ Before final response, check the prompt length. If it exceeds 3980 characters, compress by tightening wording and referring to the full checklist path while preserving required paths, core AC categories, blocker/evidence rules and false-completion traps; then check again.
465
+
466
+ ## Final Response Requirements
467
+
468
+ For completed checklist runs, the final response must include:
469
+
470
+ - `tmp/ty-context/plan-acceptance/` plan path.
471
+ - Full checklist path if a full checklist file was written.
472
+ - The paste-ready goal/target-mode prompt in a code block.
473
+ - Any unresolved ambiguity that affects checklist accuracy.
474
+
475
+ If the Context confirmation gate triggers, the response must ask for the needed confirmation instead of including the checklist or goal/target-mode prompt.
476
+
477
+ Do not claim that any acceptance item has passed unless the user explicitly asked for a current completion audit and current evidence was inspected.
478
+
479
+ ## Forbidden Behaviors
480
+
481
+ Do not include concrete business-domain logic in this Skill.
482
+
483
+ Do not hardcode project-specific provider names, API names, UI names, artifact paths, schemas, statuses, or commands in this Skill.
484
+
485
+ Do not execute the generated goal/target-mode prompt.
486
+
487
+ Do not use the checklist to mark the task complete.
488
+
489
+ Do not hide missing source files, ambiguous plan scope, or external blockers.
490
+
491
+ Do not rewrite the user's plan while copying it into `tmp/ty-context/plan-acceptance/`.
492
+
493
+ Do not use a short sample, stale artifact, old result, or current implementation convenience to reduce a plan that asked for full completion.