project-tiny-context-harness 0.2.62 → 0.2.63
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +42 -18
- package/assets/README.md +44 -18
- package/assets/README.zh-CN.md +41 -14
- package/assets/skills/plan_acceptance_checklist_compiler/SKILL.md +45 -9
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -8,15 +8,15 @@
|
|
|
8
8
|
|
|
9
9
|
Translations: [Chinese (Simplified)](https://github.com/Seven128/project-tiny-context-harness/blob/main/README.zh-CN.md)
|
|
10
10
|
|
|
11
|
-
`project-tiny-context-harness` ships the `ty-context` CLI for Project Tiny Context Harness: repo-native project memory for AI coding agents.
|
|
12
|
-
|
|
13
|
-
The default is **Minimal Context Harness**. It maintains a compact `project_context/**` fact source, a short `AGENTS.md` startup router, role Skills and a `validate-context` gate so fresh agents can recover project intent, constraints, verification entry points and next safe actions quickly.
|
|
11
|
+
`project-tiny-context-harness` ships the `ty-context` CLI for Project Tiny Context Harness: repo-native project memory for AI coding agents and a repo-native context contract.
|
|
12
|
+
|
|
13
|
+
The default is **Minimal Context Harness**. It maintains a compact `project_context/**` fact source, a short `AGENTS.md` startup router, role Skills, priority guidance for Context/code/evidence, and a `validate-context` gate so fresh agents can recover project intent, constraints, verification entry points and next safe actions quickly.
|
|
14
14
|
|
|
15
15
|
It does not default to lifecycle phases, plan tasks, stage skills, stage documents or phase gates. Harness maintains context quality; your project tests, CI, review process and human acceptance remain responsible for product quality.
|
|
16
16
|
|
|
17
17
|
Use it when coding agents repeatedly lose project intent across new chats, handoffs, RFC/debug turns or tool changes. The intended tradeoff is: keep durable intent and recovery paths; leave execution evidence to code, tests and review.
|
|
18
18
|
|
|
19
|
-
Think of it as durable project memory behind `AGENTS.md`, not another agent, process framework or task manager.
|
|
19
|
+
Think of it as durable project memory behind `AGENTS.md`, plus priority rules for Context/code/evidence, not another agent, process framework or task manager.
|
|
20
20
|
|
|
21
21
|
Best for:
|
|
22
22
|
|
|
@@ -63,19 +63,43 @@ No-install preview:
|
|
|
63
63
|
|
|
64
64
|
Coding agents can move quickly inside one thread and still drift when a new chat, model, tool, reviewer or debugging session loses the project-specific facts that were never encoded anywhere stable.
|
|
65
65
|
|
|
66
|
-
Minimal Context Harness creates a small, explicit recovery path: project goal, boundaries, architecture context, validation entry points and durable task conclusions. It is designed to sit beside specs, tests, issues, docs and code intelligence tools instead of replacing them.
|
|
67
|
-
|
|
68
|
-
The
|
|
69
|
-
|
|
70
|
-
|
|
71
|
-
|
|
72
|
-
|
|
73
|
-
|
|
74
|
-
|
|
75
|
-
|
|
76
|
-
|
|
77
|
-
|
|
78
|
-
|
|
66
|
+
Minimal Context Harness creates a small, explicit recovery path: project goal, boundaries, architecture context, validation entry points and durable task conclusions. It is designed to sit beside specs, tests, issues, docs and code intelligence tools instead of replacing them.
|
|
67
|
+
|
|
68
|
+
The concrete failure mode is not only missing file search. In an ABCD module chain where A/B/C are upstream of downstream D, a D feature can expose a missing capability. Without Context, an agent may change upstream A/B to make D pass because current code permits it. Minimal Context adds a repo-owned intent layer: it records whether downstream D may change upstream A/B, whether the gap belongs in C's contract, or whether the task needs a `Context Delta` before implementation continues. Code shows what is possible; it cannot decide whether that is allowed project intent.
|
|
69
|
+
|
|
70
|
+
The core bet is: **keep the memory, drop the ceremony**. Earlier stage-based workflows pushed ordinary software work through explicit phase artifacts and gates. Modern coding agents already internalize much of the understand, design, implement, test and repair loop, so Project Tiny Context Harness keeps the high-density repo context that survives fresh chats without making every task follow Tiny Context-stage choreography.
|
|
71
|
+
|
|
72
|
+
## Current Best Practice
|
|
73
|
+
|
|
74
|
+
For short tasks, use the workflow contract and Context layer directly:
|
|
75
|
+
|
|
76
|
+
```text
|
|
77
|
+
workflow contract + project_context/** -> implementation -> verification -> drift check
|
|
78
|
+
```
|
|
79
|
+
|
|
80
|
+
For long-running tasks, externalize the target first:
|
|
81
|
+
|
|
82
|
+
```text
|
|
83
|
+
Web GPT or another external planning model produces a plan
|
|
84
|
+
-> plan acceptance checklist Skill produces a goal/target-mode prompt
|
|
85
|
+
-> Superpowers derives concrete implementation slices
|
|
86
|
+
-> each slice follows the workflow contract + project_context/**
|
|
87
|
+
```
|
|
88
|
+
|
|
89
|
+
The recommended Superpowers layer is the specific [obra/Superpowers](https://github.com/obra/superpowers) plugin/workflow, not a generic planning substitute. Use `superpowers:writing-plans` when the target-mode prompt or source plan still needs bite-sized implementation tasks, then prefer `superpowers:subagent-driven-development` when subagents are available and `superpowers:executing-plans` otherwise. Behavior changes should use `superpowers:test-driven-development`.
|
|
90
|
+
|
|
91
|
+
The reason is drift control. The workflow contract plus Context layer is intentionally a soft constraint. It works well for short tasks, and Context can still capture the expected facts for long tasks, but long execution makes the Context-to-code step drift as the context window grows, work is handed off, subagents split scope or validation loops multiply. A Web GPT plan, target-mode prompt, full acceptance checklist and Superpowers execution layer make the completion target recoverable without restoring a phase-gated workflow.
|
|
92
|
+
|
|
93
|
+
## Positioning
|
|
94
|
+
|
|
95
|
+
| Adjacent tool type | Use it for | Harness stance |
|
|
96
|
+
|---|---|---|
|
|
97
|
+
| Spec-first kits | Turning feature ideas into structured specs and plans. | Complementary; Harness keeps durable repo facts and module boundary intent beyond one feature spec. |
|
|
98
|
+
| BMAD-style workflows and full Tiny Context processes | Coordinated role/process ceremonies on high-risk work. | Lighter default; no phase gates or work-product trees. |
|
|
99
|
+
| Superpowers-style execution | Turning approved requirements into plans, subagent execution, TDD, review and finish discipline. | Complementary; use it to execute while Tiny Context owns durable repo intent and acceptance priority. |
|
|
100
|
+
| Task Master-style planners | Backlog decomposition and task execution state. | Complementary; Harness does not own task state. |
|
|
101
|
+
| Context7/Serena-style retrieval or code-intelligence tools | Pulling external docs, symbols or repository facts on demand. | Complementary; they do not answer whether downstream D may change upstream A/B. Harness stores that local repo truth. |
|
|
102
|
+
| IDE or agent memory | Tool-specific continuity inside one product surface. | Portable fallback; plain files any agent can read. |
|
|
79
103
|
|
|
80
104
|
## Try It In 60 Seconds
|
|
81
105
|
|
|
@@ -115,7 +139,7 @@ npm ci
|
|
|
115
139
|
npm run smoke:quickstart
|
|
116
140
|
npm run preview:pack
|
|
117
141
|
cd /path/to/your/test-repo
|
|
118
|
-
npm install -D /path/to/project-tiny-context-harness/tmp/ty-context/source-preview/package/project-tiny-context-harness-0.2.
|
|
142
|
+
npm install -D /path/to/project-tiny-context-harness/tmp/ty-context/source-preview/package/project-tiny-context-harness-0.2.63.tgz
|
|
119
143
|
npx --no-install ty-context init --adopt
|
|
120
144
|
make validate-context
|
|
121
145
|
```
|
package/assets/README.md
CHANGED
|
@@ -8,13 +8,13 @@
|
|
|
8
8
|
|
|
9
9
|
Translations: [Chinese (Simplified)](README.zh-CN.md)
|
|
10
10
|
|
|
11
|
-
Project Tiny Context Harness is repo-native project memory for AI coding agents.
|
|
12
|
-
|
|
13
|
-
`project-tiny-context-harness` ships Project Tiny Context Harness through the `ty-context` CLI. It installs **Minimal Context Harness**: a compact `project_context/**` fact source, a short `AGENTS.md` startup router, role Skills and a `validate-context` gate so fresh agents can recover project intent, boundaries, verification entry points and next safe actions quickly.
|
|
11
|
+
Project Tiny Context Harness is repo-native project memory for AI coding agents and a repo-native context contract.
|
|
12
|
+
|
|
13
|
+
`project-tiny-context-harness` ships Project Tiny Context Harness through the `ty-context` CLI. It installs **Minimal Context Harness**: a compact `project_context/**` fact source, a short `AGENTS.md` startup router, role Skills, priority guidance for Context/code/evidence, and a `validate-context` gate so fresh agents can recover project intent, boundaries, verification entry points and next safe actions quickly.
|
|
14
14
|
|
|
15
15
|
It is not another full Tiny Context ceremony. The Harness maintains context quality; project tests, reviews, CI and human acceptance still own product quality.
|
|
16
16
|
|
|
17
|
-
Think of it as durable project memory behind `AGENTS.md`, not another agent, process framework or task manager.
|
|
17
|
+
Think of it as durable project memory behind `AGENTS.md`, plus priority rules for Context/code/evidence, not another agent, process framework or task manager.
|
|
18
18
|
|
|
19
19
|
Best for:
|
|
20
20
|
|
|
@@ -94,7 +94,7 @@ That smoke packs the local workspace, installs it into a disposable repo, runs `
|
|
|
94
94
|
```sh
|
|
95
95
|
npm run preview:pack
|
|
96
96
|
cd /path/to/your/test-repo
|
|
97
|
-
npm install -D /path/to/project-tiny-context-harness/tmp/ty-context/source-preview/package/project-tiny-context-harness-0.2.
|
|
97
|
+
npm install -D /path/to/project-tiny-context-harness/tmp/ty-context/source-preview/package/project-tiny-context-harness-0.2.63.tgz
|
|
98
98
|
npx --no-install ty-context init --adopt
|
|
99
99
|
make validate-context
|
|
100
100
|
```
|
|
@@ -107,19 +107,43 @@ Use it when coding agents repeatedly lose project intent across new chats, hando
|
|
|
107
107
|
|
|
108
108
|
Coding agents can move quickly inside one thread and still drift when a new chat, model, tool, reviewer or debugging session loses the project-specific facts that were never encoded anywhere stable.
|
|
109
109
|
|
|
110
|
-
Most repositories already have README files, specs, tests and issue history, but fresh agents need a small, explicit recovery path: what the project is trying to do, what it must not do, where architecture boundaries live, how to validate changes and what durable facts changed after the last task. Minimal Context Harness makes that recovery path a first-class repo surface without adding a full planning ceremony.
|
|
111
|
-
|
|
112
|
-
The
|
|
113
|
-
|
|
114
|
-
|
|
115
|
-
|
|
116
|
-
|
|
117
|
-
|
|
118
|
-
|
|
119
|
-
|
|
120
|
-
|
|
121
|
-
|
|
122
|
-
|
|
110
|
+
Most repositories already have README files, specs, tests and issue history, but fresh agents need a small, explicit recovery path: what the project is trying to do, what it must not do, where architecture boundaries live, how to validate changes and what durable facts changed after the last task. Minimal Context Harness makes that recovery path a first-class repo surface without adding a full planning ceremony.
|
|
111
|
+
|
|
112
|
+
The concrete failure mode is not only "the agent did not read enough files." In an ABCD module chain where A/B/C are upstream of downstream D, a D feature may reveal a missing capability. Without Context, the agent can satisfy D by changing upstream A/B because the current code makes that path available. What is missing is a repo-owned intent layer that says whether D may change upstream A/B, whether the gap belongs in C's contract, or whether the task must stop for a `Context Delta` before implementation continues. Current code can show what is possible; it cannot decide whether that is allowed project intent.
|
|
113
|
+
|
|
114
|
+
The product lesson is: **keep the memory, drop the ceremony**. Earlier stage-based workflows externalized requirements, design, implementation, review, test and release into explicit phase artifacts. Modern coding agents already internalize much of that ordinary software loop. Project Tiny Context Harness keeps the useful part: the smallest high-density repo context that survives fresh chats without forcing every task through phase transitions, work-product validation or Tiny Context-stage context splits.
|
|
115
|
+
|
|
116
|
+
## Current Best Practice
|
|
117
|
+
|
|
118
|
+
For short tasks, use the workflow contract and Context layer directly:
|
|
119
|
+
|
|
120
|
+
```text
|
|
121
|
+
workflow contract + project_context/** -> implementation -> verification -> drift check
|
|
122
|
+
```
|
|
123
|
+
|
|
124
|
+
For long-running tasks, externalize the target first:
|
|
125
|
+
|
|
126
|
+
```text
|
|
127
|
+
Web GPT or another external planning model produces a plan
|
|
128
|
+
-> plan acceptance checklist Skill produces a goal/target-mode prompt
|
|
129
|
+
-> Superpowers derives concrete implementation slices
|
|
130
|
+
-> each slice follows the workflow contract + project_context/**
|
|
131
|
+
```
|
|
132
|
+
|
|
133
|
+
The recommended Superpowers layer is the specific [obra/Superpowers](https://github.com/obra/superpowers) plugin/workflow, not a generic planning substitute. Use `superpowers:writing-plans` when the target-mode prompt or source plan still needs bite-sized implementation tasks, then prefer `superpowers:subagent-driven-development` when subagents are available and `superpowers:executing-plans` otherwise. Behavior changes should use `superpowers:test-driven-development`.
|
|
134
|
+
|
|
135
|
+
The reason is drift control. The workflow contract plus Context layer is intentionally a soft constraint. It works well for short tasks, and Context can still capture the expected facts for long tasks, but long execution makes the Context-to-code step drift as the context window grows, work is handed off, subagents split scope or validation loops multiply. A Web GPT plan, target-mode prompt, full acceptance checklist and Superpowers execution layer make the completion target recoverable without restoring a phase-gated workflow.
|
|
136
|
+
|
|
137
|
+
## Positioning
|
|
138
|
+
|
|
139
|
+
| Adjacent tool type | Use it for | Harness stance |
|
|
140
|
+
|---|---|---|
|
|
141
|
+
| Spec-first kits | Turning a feature idea into structured specs and implementation plans. | Complementary. Specs can define a feature, but they do not automatically maintain repo-wide module boundary intent across every later task. |
|
|
142
|
+
| BMAD-style workflows and full Tiny Context processes | Coordinated role/process ceremonies on high-risk work. | Lighter default. Preserve context quality without shipping phase gates or work-product trees. |
|
|
143
|
+
| Superpowers-style execution | Turning approved requirements into plans, subagent execution, TDD, review and finish discipline. | Complementary. Use it to execute; keep Tiny Context as the durable repo intent and acceptance-priority layer. |
|
|
144
|
+
| Task Master-style planners | Backlog decomposition and task execution state. | Complementary. Harness does not own task state; it owns durable project memory and module boundary facts. |
|
|
145
|
+
| Context7/Serena-style retrieval or code-intelligence tools | Pulling external docs, symbols or repository facts on demand. | Complementary. They improve retrieval and editing, but do not answer whether downstream D may change upstream A/B; Harness keeps that local project truth in repo. |
|
|
146
|
+
| IDE or agent memory | Tool-specific continuity inside one product surface. | Portable fallback. Harness files are plain repo assets that any agent can read. |
|
|
123
147
|
|
|
124
148
|
## Try It In 60 Seconds
|
|
125
149
|
|
|
@@ -275,6 +299,8 @@ For high-risk product, UI/UX and engineering tasks that affect durable architect
|
|
|
275
299
|
Technical architecture support is a Minimal Context capability: use restrained `architecture.md`, area Module Design Capsules and existing `contract` / `decision-rationale` roles when durable architecture or rationale matters. Do not invent rationale; store stable reasons, rejected alternatives or tradeoffs only in the smallest durable Context surface when they will affect future implementation or verification choices.
|
|
276
300
|
|
|
277
301
|
For long-running plans, RFCs or implementation proposals, the plan acceptance checklist compiler can turn a plan plus relevant Context into a falsifiable acceptance checklist and a paste-ready goal/target-mode prompt. If the plan already contains an explicit concrete acceptance checklist, the Skill copies that checklist verbatim into a separate full-checklist file instead of generating a competing checklist. This is one pre-execution acceptance pass, not a task planner or workflow engine: it stores temporary inputs under `tmp/ty-context/plan-acceptance/**`, asks for confirmation when durable assumptions are unclear, and leaves execution evidence to the future executor, tests, CI, review or human acceptance. The generated prompt may require a local audit under the same temporary directory so future sessions can recover acceptance progress; that audit is not Context, not a quality proof and not a replacement for the project's Tiny Context workflow contract. The full checklist is the acceptance authority, while any compact prompt summary exists for navigation, priority and recovery after context compaction.
|
|
302
|
+
|
|
303
|
+
Important usage note: Minimal Context intentionally keeps Context read order, Context/code priority and drift checks as agent-level soft constraints rather than machine-enforced gates. That tradeoff works well for short tasks, but long tasks with large context windows, multiple handoffs or many verification loops are expected to drift unless the completion target is externalized. Use the plan acceptance checklist compiler before long-running execution when there is a plan-like source; treat the full checklist as the acceptance target, and treat the local audit only as temporary progress/recovery state.
|
|
278
304
|
|
|
279
305
|
For Product Surface work, `context_surface_contract` turns broad product/page/UI principles into project-owned surface responsibilities. A Product Surface can be a Web page, mobile screen, desktop window, game UI/HUD/menu, CLI/TUI output, extension UI or embedded/device interface. Cross-surface contracts use the existing `contract` role; area-owned screen facts stay in `area` or `subdomain`; repeatable validation paths use `verification`. The Harness does not add a new surface-specific role, does not create business surface contracts during `init` or `upgrade`, and does not turn surface conformance into a validator gate. Projects that want mandatory task blocks should add a separate project-local Skill, while `product-surface-contract.md` is only a compact managed template for optional Context authoring.
|
|
280
306
|
|
package/assets/README.zh-CN.md
CHANGED
|
@@ -2,15 +2,16 @@
|
|
|
2
2
|
|
|
3
3
|
[English README](README.md)
|
|
4
4
|
|
|
5
|
-
Project Tiny Context Harness 是给 AI coding agents
|
|
6
|
-
|
|
7
|
-
它不是新的全流程 Tiny Context 框架,也不是任务管理器。它做一件小事:把新会话 agent
|
|
5
|
+
Project Tiny Context Harness 是给 AI coding agents 用的轻量项目记忆层,也是 repo-native context contract。
|
|
6
|
+
|
|
7
|
+
它不是新的全流程 Tiny Context 框架,也不是任务管理器。它做一件小事:把新会话 agent 最容易丢掉、但又必须长期稳定保留的项目事实,以及 Context / 代码 / 验证证据之间的读取和变更优先级放进仓库里,让下一次聊天、交接、调试或换工具时不用从头重新发现。
|
|
8
8
|
|
|
9
9
|
一句话:
|
|
10
10
|
|
|
11
11
|
```text
|
|
12
|
-
Keep the memory. Drop the ceremony.
|
|
13
|
-
保留项目记忆,丢掉流程仪式感。
|
|
12
|
+
Keep the memory. Drop the ceremony.
|
|
13
|
+
保留项目记忆,丢掉流程仪式感。
|
|
14
|
+
同时保留 Context / 代码 / 验证证据之间的优先级契约。
|
|
14
15
|
```
|
|
15
16
|
|
|
16
17
|
## 它解决什么问题
|
|
@@ -22,9 +23,10 @@ Keep the memory. Drop the ceremony.
|
|
|
22
23
|
- 架构边界在哪里
|
|
23
24
|
- 哪些文件是事实源
|
|
24
25
|
- 改完以后应该跑什么验证
|
|
25
|
-
- 上一次任务留下了哪些长期约束
|
|
26
|
-
|
|
27
|
-
|
|
26
|
+
- 上一次任务留下了哪些长期约束
|
|
27
|
+
- Context、实现和验证证据冲突时谁优先
|
|
28
|
+
|
|
29
|
+
Project Tiny Context Harness 把这些内容压缩到几个 repo-native 文件里,并通过简单工作流约束 agent 先读 Context、判断是否 context-first、实现后做 drift check:
|
|
28
30
|
|
|
29
31
|
- `AGENTS.md`
|
|
30
32
|
- `project_context/context.toml`
|
|
@@ -44,13 +46,38 @@ Fresh agent 先读这些文件,再开始改代码。
|
|
|
44
46
|
- 现代 coding agents 已经内化了很多普通软件工程循环:理解、设计、实现、测试、修复。
|
|
45
47
|
- 真正值得保留下来的不是“每次任务都走完整流程”,而是“新 agent 能快速恢复项目长期事实”。
|
|
46
48
|
|
|
47
|
-
所以当前默认方向是 Minimal Context Harness:只维护高密度、长期有效、能帮助恢复上下文的项目事实。
|
|
48
|
-
|
|
49
|
+
所以当前默认方向是 Minimal Context Harness:只维护高密度、长期有效、能帮助恢复上下文的项目事实。
|
|
50
|
+
|
|
51
|
+
一个典型失败场景是 ABCD 模块链:A/B/C 是上游,D 是下游。现在做 D 的需求时发现能力缺口;如果没有 Context 和优先级约束,agent 很容易为了让 D 完成而去改上游 A/B,因为当前代码让这条路可行。但真正需要判断的是:D 是否有权改 A/B?缺口是不是属于 C 的契约?是否必须先声明 `Context Delta`,让项目意图变化被确认后再实现?代码能说明“现在怎么改得动”,不能说明“项目意图是否允许这样改”。Tiny Context 要补的就是这一层 repo 内长期事实和优先级契约。
|
|
52
|
+
|
|
49
53
|
对于长程任务,Harness 也提供一个轻量的计划验收清单 Skill:当用户明确给出或引用某份方案 / 计划 / RFC / implementation plan,并要求生成验收清单、完成定义或 goal/target 模式提示词时,它会把计划和验收清单临时放到 `tmp/ty-context/plan-acceptance/**`。如果方案里已经有明确、具体的“验收清单”,Skill 会直接复用那份清单并单独写入完整验收清单文件,不再另行生成一份竞争清单。这只是执行前的一次验收标准梳理,不执行计划、不证明完成,也不会把临时清单注册成 `project_context/**`。
|
|
50
|
-
|
|
51
|
-
|
|
52
|
-
|
|
53
|
-
|
|
54
|
+
|
|
55
|
+
重要使用提示:Minimal Context 有意把 Context 读取顺序、Context / 代码优先级和漂移检查保持为 agent 级软约束,而不是机器强制 gate。这个取舍适合短任务,但长任务、大上下文、多次交接或多轮验证时预期会漂移。遇到这类任务且已有方案/计划来源时,应先用计划验收清单 Skill 外化一个可证伪完成目标;完整验收清单才是验收标准,local audit 只是临时进度/恢复状态。
|
|
56
|
+
|
|
57
|
+
## 当前最佳实践
|
|
58
|
+
|
|
59
|
+
短程任务直接使用流程契约和 Context 层:
|
|
60
|
+
|
|
61
|
+
```text
|
|
62
|
+
流程契约 + project_context/** -> 实现 -> 验证 -> drift check
|
|
63
|
+
```
|
|
64
|
+
|
|
65
|
+
长程任务先外化目标,再进入实现:
|
|
66
|
+
|
|
67
|
+
```text
|
|
68
|
+
Web GPT 或其他外部规划模型产出方案
|
|
69
|
+
-> 计划验收清单 Skill 生成目标模式文本
|
|
70
|
+
-> Superpowers 得出具体落地执行片段
|
|
71
|
+
-> 每个执行片段都回到流程契约 + project_context/**
|
|
72
|
+
```
|
|
73
|
+
|
|
74
|
+
这里的 Superpowers 指具体的 [obra/Superpowers](https://github.com/obra/superpowers) 插件/开源工作流,不是泛化的执行规划替代品。如果目标模式文本或原方案还不够可执行,用 `superpowers:writing-plans` 转成 bite-sized implementation plan;有 subagent 支持时优先用 `superpowers:subagent-driven-development`,否则用 `superpowers:executing-plans`;涉及行为变更时用 `superpowers:test-driven-development`。
|
|
75
|
+
|
|
76
|
+
原因是漂移控制。流程契约 + Context 层是软约束,短任务里通常能让 agent 按预期执行;长程任务里,Context 仍然能记录符合预期的事实,但 Context 到代码 的实现步骤会随着上下文窗口变大、多次交接、subagent 拆分和多轮验证而漂移。Web GPT 方案、目标模式文本、完整验收清单和 Superpowers 执行层,把完成目标外化成可恢复、可审计的临时执行标准,同时不恢复阶段式 gate。
|
|
77
|
+
|
|
78
|
+
## 适合谁
|
|
79
|
+
|
|
80
|
+
适合:
|
|
54
81
|
|
|
55
82
|
- 经常用 Codex、Claude Code、Cursor、Gemini CLI、OpenCode 等 agent 改代码的项目。
|
|
56
83
|
- 经常开新 chat,agent 反复重新理解项目的项目。
|
|
@@ -38,9 +38,17 @@ The goal/target-mode prompt must be no longer than 3850 characters, including li
|
|
|
38
38
|
|
|
39
39
|
```text
|
|
40
40
|
实施计划: tmp/ty-context/plan-acceptance/<plan>.md(source/implementation plan,非验收证明)
|
|
41
|
-
可多开agent,agent名额不够了就关掉不用的。
|
|
42
41
|
完整验收清单: tmp/ty-context/plan-acceptance/<plan>-acceptance-checklist.md(该文件是完整验收标准,验收以这个为准。完成前必须逐项检查,不满足则继续实现。)
|
|
43
42
|
执行审计: tmp/ty-context/plan-acceptance/<plan>-local-audit.md(临时 progress state,非 Context/proof)
|
|
43
|
+
可多开agent,agent名额不够了就关掉不用的。
|
|
44
|
+
如果 Superpowers 未安装,先按当前平台官方 Superpowers 安装路径安装;若安装被权限/网络/平台限制阻塞,写入执行审计,不得把阻塞当完成。
|
|
45
|
+
如果 Superpowers 已安装,使用 Superpowers 执行本任务:
|
|
46
|
+
- 先读完整验收清单,验收以它为准;compact prompt 只负责 direction/priority/recovery navigation
|
|
47
|
+
- 若实施计划不够可执行,用 superpowers:writing-plans 转成 bite-sized implementation plan
|
|
48
|
+
- 有 subagent 支持时优先用 superpowers:subagent-driven-development;否则用 superpowers:executing-plans
|
|
49
|
+
- 行为变更使用 superpowers:test-driven-development;先写失败测试并观察失败,再写最小实现
|
|
50
|
+
- review / finish 不能覆盖完整验收清单;完整验收清单不满足则继续实现
|
|
51
|
+
- 每轮执行后更新 local audit,记录 AC 状态、当前证据、命令结果、blocker、deferred/narrowed scope、无效证据
|
|
44
52
|
<验收清单>
|
|
45
53
|
```
|
|
46
54
|
|
|
@@ -48,9 +56,17 @@ For English requests, use this shape:
|
|
|
48
56
|
|
|
49
57
|
```text
|
|
50
58
|
Plan: tmp/ty-context/plan-acceptance/<plan>.md (implementation/source plan, not acceptance proof)
|
|
51
|
-
You may use multiple agents; if agent slots run low, close idle or unnecessary agents.
|
|
52
59
|
Full checklist: tmp/ty-context/plan-acceptance/<plan>-acceptance-checklist.md (complete acceptance standard; acceptance is judged against it; every item must be checked before completion)
|
|
53
60
|
Local audit: tmp/ty-context/plan-acceptance/<plan>-local-audit.md (temporary execution/progress state, not Context or proof)
|
|
61
|
+
You may use multiple agents; if agent slots run low, close idle or unnecessary agents.
|
|
62
|
+
If Superpowers is not installed, install it through the current platform's official Superpowers installation path first; if installation is blocked by permissions, network or platform limits, record it in local audit and do not treat the blocker as completion.
|
|
63
|
+
If Superpowers is installed, Use Superpowers for this task:
|
|
64
|
+
- Read the full checklist first; acceptance is judged against it, while the compact prompt only provides direction, priority and recovery navigation.
|
|
65
|
+
- If the implementation plan is not executable enough, use superpowers:writing-plans to convert it into a bite-sized implementation plan.
|
|
66
|
+
- Prefer superpowers:subagent-driven-development when subagents are available; otherwise use superpowers:executing-plans.
|
|
67
|
+
- Use superpowers:test-driven-development for behavior changes; write a failing test, observe it fail, then write the minimal implementation.
|
|
68
|
+
- review / finish cannot override the full checklist; if the full checklist is not satisfied, continue implementation.
|
|
69
|
+
- update local audit after each execution round with AC status, current evidence, command results, blockers, deferred/narrowed scope and invalid evidence.
|
|
54
70
|
<acceptance checklist>
|
|
55
71
|
```
|
|
56
72
|
|
|
@@ -453,12 +469,14 @@ Hard requirements:
|
|
|
453
469
|
- The prompt must be no longer than 3850 characters including line breaks. Treat 3850 as the effective hard budget and preserve information density; do not drop required paths, core acceptance categories, blocker rules, evidence rules or false-completion traps merely to be short.
|
|
454
470
|
- The first line must identify the plan path.
|
|
455
471
|
- Use `实施计划: <path>` for Chinese prompts and `Plan: <path>` for English prompts. The line must say the plan is the implementation/source plan and not acceptance proof.
|
|
456
|
-
- The
|
|
472
|
+
- The prompt must identify the full checklist path immediately after the plan path and say it is the complete acceptance standard. Chinese prompts must include this exact sentence: `该文件是完整验收标准,验收以这个为准。完成前必须逐项检查,不满足则继续实现。` English prompts must say the full checklist is the complete acceptance standard, acceptance is judged against it, and every item must be checked before completion.
|
|
473
|
+
- The prompt must identify a local audit path, normally `tmp/ty-context/plan-acceptance/<plan-slug>-local-audit.md`, and require the future executor to read it before resuming, keep it current during execution, and use it only as target-mode acceptance progress state.
|
|
474
|
+
- After the plan/checklist/audit paths, include a resource lifecycle instruction: `可多开agent,agent名额不够了就关掉不用的。` for Chinese prompts or `You may use multiple agents; if agent slots run low, close idle or unnecessary agents.` for English prompts.
|
|
475
|
+
- The prompt must include a Superpowers execution block. If Superpowers is not installed, tell the executor to install it through the current platform's official Superpowers installation path; if installation is blocked by permissions, network or platform limits, record it in local audit and do not treat the blocker as completion. If Superpowers is installed, Use Superpowers for this task.
|
|
476
|
+
- The Superpowers block must require: read the full checklist first and make it the acceptance authority; use `superpowers:writing-plans` when the plan is not executable enough; prefer `superpowers:subagent-driven-development` when subagents are available; otherwise use `superpowers:executing-plans`; use `superpowers:test-driven-development` for behavior changes; review / finish cannot override the full checklist; update local audit after each execution round.
|
|
457
477
|
- The remaining content must be the acceptance checklist or a compact version of it.
|
|
458
478
|
- The prompt must be self-contained enough for goal/target-mode execution.
|
|
459
|
-
- The prompt must identify the full checklist path and say it is the complete acceptance standard. Chinese prompts must include this exact sentence: `该文件是完整验收标准,验收以这个为准。完成前必须逐项检查,不满足则继续实现。` English prompts must say the full checklist is the complete acceptance standard, acceptance is judged against it, and every item must be checked before completion.
|
|
460
479
|
- If the prompt uses a compact checklist summary, say the full checklist owns details and acceptance authority; the compact summary owns direction, priority and recovery navigation; overlap is allowed; conflicts are resolved in favor of the full checklist.
|
|
461
|
-
- The prompt must identify a local audit path, normally `tmp/ty-context/plan-acceptance/<plan-slug>-local-audit.md`, and require the future executor to read it before resuming, keep it current during execution, and use it only as target-mode acceptance progress state.
|
|
462
480
|
- The prompt must require the local audit to record overall status (`complete`, `incomplete`, `blocked` or `narrowed-scope-complete`), each core AC status and current evidence, commands with result/time/failure reason, artifact or evidence paths, blockers and missing evidence, acceptance impact, explicit deferred or narrowed scope, and stale/partial/smoke/dry-run/research evidence that cannot prove full completion.
|
|
463
481
|
- The prompt must say that local audit is not Context, not product-quality proof, not a global task manager, and not a replacement for project tests, CI, review, human acceptance, Task Contract or workflow-contract `plan.md`.
|
|
464
482
|
- The prompt must say that when a Task Contract or workflow-contract `plan.md` exists, each acceptance item execution still follows it and the repository's Tiny Context workflow contract.
|
|
@@ -473,14 +491,23 @@ Recommended compact Chinese prompt shape:
|
|
|
473
491
|
|
|
474
492
|
```text
|
|
475
493
|
实施计划: tmp/ty-context/plan-acceptance/<plan-slug>.md(source/implementation plan,非验收证明)
|
|
476
|
-
可多开agent,agent名额不够了就关掉不用的。
|
|
477
494
|
完整验收清单: tmp/ty-context/plan-acceptance/<plan-slug>-acceptance-checklist.md(该文件是完整验收标准,验收以这个为准。完成前必须逐项检查,不满足则继续实现。)
|
|
478
495
|
执行审计: tmp/ty-context/plan-acceptance/<plan-slug>-local-audit.md(临时 progress state,非 Context/proof)
|
|
496
|
+
可多开agent,agent名额不够了就关掉不用的。
|
|
479
497
|
本摘要只负责 direction/priority/recovery navigation;允许与完整 checklist 重叠,冲突时以完整 checklist 为准。
|
|
480
498
|
|
|
499
|
+
如果 Superpowers 未安装,先按当前平台官方 Superpowers 安装路径安装;若安装被权限/网络/平台限制阻塞,写入执行审计,不得把阻塞当完成。
|
|
500
|
+
如果 Superpowers 已安装,使用 Superpowers 执行本任务:
|
|
501
|
+
- 先读完整验收清单,验收以它为准
|
|
502
|
+
- 若实施计划不够可执行,用 superpowers:writing-plans 转成 bite-sized implementation plan
|
|
503
|
+
- 有 subagent 支持时优先用 superpowers:subagent-driven-development;否则用 superpowers:executing-plans
|
|
504
|
+
- 行为变更用 superpowers:test-driven-development;先写失败测试并观察失败,再写最小实现
|
|
505
|
+
- review / finish 不能覆盖完整验收清单;不满足则继续实现
|
|
506
|
+
- 每轮执行后更新 local audit,记录 AC 状态、证据、命令结果、blocker、deferred/narrowed scope、无效证据
|
|
507
|
+
|
|
481
508
|
验收清单:
|
|
482
509
|
AC1 <核心完成定义,包含验收证据>
|
|
483
|
-
AC2 <范围/清单/覆盖要求>
|
|
510
|
+
AC2 <范围/清单/覆盖要求>
|
|
484
511
|
AC3 <Context/架构/边界要求>
|
|
485
512
|
AC4 <核心实现行为要求>
|
|
486
513
|
AC5 <数据/API/接口/契约要求>
|
|
@@ -501,14 +528,23 @@ Recommended compact English prompt shape:
|
|
|
501
528
|
|
|
502
529
|
```text
|
|
503
530
|
Plan: tmp/ty-context/plan-acceptance/<plan-slug>.md (implementation/source plan, not acceptance proof)
|
|
504
|
-
You may use multiple agents; if agent slots run low, close idle or unnecessary agents.
|
|
505
531
|
Full checklist: tmp/ty-context/plan-acceptance/<plan-slug>-acceptance-checklist.md (complete acceptance standard; acceptance is judged against it; every item must be checked before completion)
|
|
506
532
|
Local audit: tmp/ty-context/plan-acceptance/<plan-slug>-local-audit.md (temporary progress state, not Context or proof)
|
|
533
|
+
You may use multiple agents; if agent slots run low, close idle or unnecessary agents.
|
|
507
534
|
This summary is only direction, priority and recovery navigation; overlap with the full checklist is allowed, and the full checklist wins conflicts.
|
|
508
535
|
|
|
536
|
+
If Superpowers is not installed, install it through the current platform's official Superpowers installation path first; if installation is blocked by permissions, network or platform limits, record it in local audit and do not treat the blocker as completion.
|
|
537
|
+
If Superpowers is installed, Use Superpowers for this task:
|
|
538
|
+
- Read the full checklist first; acceptance is judged against it.
|
|
539
|
+
- If the plan is not executable enough, use superpowers:writing-plans for a bite-sized implementation plan.
|
|
540
|
+
- Prefer superpowers:subagent-driven-development when subagents are available; otherwise use superpowers:executing-plans.
|
|
541
|
+
- Use superpowers:test-driven-development for behavior changes; write a failing test, observe failure, then implement minimally.
|
|
542
|
+
- review / finish cannot override the full checklist; if unsatisfied, continue implementation.
|
|
543
|
+
- update local audit after each execution round with AC status, evidence, command results, blockers, deferred/narrowed scope and invalid evidence.
|
|
544
|
+
|
|
509
545
|
Acceptance checklist:
|
|
510
546
|
AC1 <core completion definition with required evidence>
|
|
511
|
-
AC2 <scope inventory and coverage>
|
|
547
|
+
AC2 <scope inventory and coverage>
|
|
512
548
|
AC3 <Context / architecture / boundary conformance>
|
|
513
549
|
AC4 <core implementation behavior>
|
|
514
550
|
AC5 <data / API / interface / contract requirements>
|