@tianhai/pi-workflow-kit 0.8.4 → 0.9.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/docs/plans/completed/2026-04-28-executing-tasks-redesign-design.md +171 -0
- package/docs/plans/completed/2026-04-28-executing-tasks-redesign-implementation.md +208 -0
- package/docs/plans/completed/2026-04-28-executing-tasks-redesign-progress.md +14 -0
- package/package.json +1 -1
- package/skills/executing-tasks/SKILL.md +147 -16
- package/skills/finalizing/SKILL.md +23 -2
- package/skills/writing-plans/SKILL.md +22 -0
|
@@ -0,0 +1,171 @@
|
|
|
1
|
+
# Design: Executing Tasks Redesign
|
|
2
|
+
|
|
3
|
+
**Date:** 2026-04-28
|
|
4
|
+
**Status:** Approved
|
|
5
|
+
|
|
6
|
+
## Problem
|
|
7
|
+
|
|
8
|
+
The current `executing-tasks` skill has three issues:
|
|
9
|
+
|
|
10
|
+
1. **No progress tracking** — tasks are iterated in-memory with no file-based state. If the session crashes or the user starts a new session, all progress is lost.
|
|
11
|
+
2. **High token consumption** — the entire plan, all implementation work, and accumulated tool outputs stay in a single session. Even with auto-compaction, the LLM re-reads the full plan repeatedly.
|
|
12
|
+
3. **No context separation** — one monolithic thread handles everything. Early tasks' tool outputs bleed into later tasks' context.
|
|
13
|
+
|
|
14
|
+
## Solution Overview
|
|
15
|
+
|
|
16
|
+
Introduce a **progress file** as the single source of truth for task state, and design the skill to work naturally across **multiple sessions** with fresh context.
|
|
17
|
+
|
|
18
|
+
### Core Principles
|
|
19
|
+
|
|
20
|
+
- The progress file is the state — not the session, not git history
|
|
21
|
+
- Each task is an isolated unit of work — the agent reads only what it needs
|
|
22
|
+
- The agent suggests `/new` (fresh session) at natural break points
|
|
23
|
+
- Resume is trivial — re-invoke the skill, it reads the progress file and picks up
|
|
24
|
+
|
|
25
|
+
## Progress File
|
|
26
|
+
|
|
27
|
+
**Path:** `docs/plans/YYYY-MM-DD-<topic>-progress.md`
|
|
28
|
+
|
|
29
|
+
Created by `executing-tasks` on first run by parsing the implementation plan.
|
|
30
|
+
|
|
31
|
+
**Format:**
|
|
32
|
+
|
|
33
|
+
```markdown
|
|
34
|
+
# Progress: auth
|
|
35
|
+
|
|
36
|
+
Plan: docs/plans/2026-04-28-auth-implementation.md
|
|
37
|
+
Branch: auth
|
|
38
|
+
Started: 2026-04-28T10:00:00Z
|
|
39
|
+
Last updated: 2026-04-28T10:45:00Z
|
|
40
|
+
|
|
41
|
+
| # | Status | Task | Commit |
|
|
42
|
+
|---|--------|------|--------|
|
|
43
|
+
| 1 | ✅ done | Create User model | a1b2c3d |
|
|
44
|
+
| 2 | ✅ done | Write User model tests | e4f5g6h |
|
|
45
|
+
| 3 | 🔄 in-progress | Add login endpoint | — |
|
|
46
|
+
| 4 | ⬜ pending | Write login tests | — |
|
|
47
|
+
| 5 | ⏭ skipped | checkpoint: test — Add auth middleware | — |
|
|
48
|
+
```
|
|
49
|
+
|
|
50
|
+
**Status values:**
|
|
51
|
+
|
|
52
|
+
| Status | Meaning |
|
|
53
|
+
|--------|---------|
|
|
54
|
+
| `⬜ pending` | Not started |
|
|
55
|
+
| `🔄 in-progress` | Currently being worked on |
|
|
56
|
+
| `✅ done` | Committed successfully |
|
|
57
|
+
| `❌ failed` | Could not complete (with reason appended) |
|
|
58
|
+
| `⏭ skipped` | User chose to skip |
|
|
59
|
+
|
|
60
|
+
**Rules:**
|
|
61
|
+
|
|
62
|
+
- Mark `🔄 in-progress` immediately when starting a task
|
|
63
|
+
- Mark `✅ done` + record commit hash only after successful `git commit`
|
|
64
|
+
- Mark `❌ failed` + append `Failed: <reason>` when the agent can't proceed after retrying
|
|
65
|
+
- Mark `⏭ skipped` when the user says "skip"
|
|
66
|
+
- Update `Last updated` timestamp on every change
|
|
67
|
+
- Preserve checkpoint labels from the plan in the task description
|
|
68
|
+
|
|
69
|
+
## Implementation Plan Format
|
|
70
|
+
|
|
71
|
+
No file splitting. Keep one `implementation.md` but enforce a strict heading format:
|
|
72
|
+
|
|
73
|
+
```markdown
|
|
74
|
+
## Task 1: Create User model
|
|
75
|
+
|
|
76
|
+
<!-- tdd: new-feature -->
|
|
77
|
+
<!-- checkpoint: none -->
|
|
78
|
+
|
|
79
|
+
- Create `src/models/user.ts`...
|
|
80
|
+
```
|
|
81
|
+
|
|
82
|
+
The agent reads the progress file to find the current task number, then reads only that task's section from the implementation plan (via grep/jump to heading).
|
|
83
|
+
|
|
84
|
+
## Session Lifecycle
|
|
85
|
+
|
|
86
|
+
### First Run
|
|
87
|
+
|
|
88
|
+
1. Read progress file → doesn't exist
|
|
89
|
+
2. Parse implementation.md, create progress file with all tasks as `⬜ pending`
|
|
90
|
+
3. Ensure on correct branch / worktree (same as current skill)
|
|
91
|
+
4. Read task 1 section, begin work
|
|
92
|
+
|
|
93
|
+
### Continuing in Same Session
|
|
94
|
+
|
|
95
|
+
After completing a non-checkpoint task:
|
|
96
|
+
1. Update progress file: current task → `✅ done`
|
|
97
|
+
2. Peek at next task:
|
|
98
|
+
- **Has checkpoint** → pause for review (stay in session)
|
|
99
|
+
- **No checkpoint** → continue working on next task
|
|
100
|
+
3. After ~3-5 non-checkpoint tasks, suggest `/new`:
|
|
101
|
+
|
|
102
|
+
```
|
|
103
|
+
✅ Tasks 3-5 done (commits: a1b2, e4f5, i7j8)
|
|
104
|
+
|
|
105
|
+
Progress: 5/10 tasks done
|
|
106
|
+
|
|
107
|
+
⏭ Next: Task 6 — Add auth middleware (no checkpoint)
|
|
108
|
+
|
|
109
|
+
💡 Context is building up. For clean context on remaining tasks:
|
|
110
|
+
/new then /skill:executing-tasks
|
|
111
|
+
(or just say "continue" to keep going here)
|
|
112
|
+
```
|
|
113
|
+
|
|
114
|
+
### Resuming in a New Session
|
|
115
|
+
|
|
116
|
+
1. Read progress file → find first `⬜ pending` or `❌ failed` task
|
|
117
|
+
2. Read that task's section from implementation.md
|
|
118
|
+
3. Continue work — no re-reading of earlier tasks
|
|
119
|
+
|
|
120
|
+
### Checkpoint Review
|
|
121
|
+
|
|
122
|
+
Same as current skill — show what was done, show the diff, wait for user approval:
|
|
123
|
+
|
|
124
|
+
```
|
|
125
|
+
⏸ Paused at checkpoint: test for task 4
|
|
126
|
+
|
|
127
|
+
**What was done:** [brief summary]
|
|
128
|
+
**Diff:** [show relevant diff]
|
|
129
|
+
|
|
130
|
+
Review and let me know how to proceed.
|
|
131
|
+
```
|
|
132
|
+
|
|
133
|
+
## Resume & Failure Recovery
|
|
134
|
+
|
|
135
|
+
| Scenario | What the agent sees | What it does |
|
|
136
|
+
|----------|-------------------|--------------|
|
|
137
|
+
| **Clean resume** | Next task is `⬜ pending` | Read task section, start working |
|
|
138
|
+
| **Mid-task crash** | A task is `🔄 in-progress` | Check git log since last done task. If commits exist → ask user to verify. If no commits → restart the task |
|
|
139
|
+
| **Failed task** | A task is `❌ failed` | Show failure reason, ask: retry, skip, or abort? |
|
|
140
|
+
| **All done** | No `⬜ pending` or `❌ failed` | Show summary, suggest `/skill:finalizing` |
|
|
141
|
+
| **No progress file** | File doesn't exist | Parse implementation.md, create progress file, start from task 1 |
|
|
142
|
+
| **Skipped tasks remain** | `⏭ skipped` tasks exist | Noted in finalizing, no action during execution |
|
|
143
|
+
|
|
144
|
+
## User Override Commands
|
|
145
|
+
|
|
146
|
+
Available at any time during execution:
|
|
147
|
+
|
|
148
|
+
| User says | Agent does |
|
|
149
|
+
|-----------|-----------|
|
|
150
|
+
| `skip` | Mark current task `⏭ skipped`, move to next |
|
|
151
|
+
| `status` | Show the progress table |
|
|
152
|
+
| `stop` | Mark current task back to `⬜ pending`, suggest `/new` |
|
|
153
|
+
| `retry` | Re-read current task section, start over |
|
|
154
|
+
|
|
155
|
+
## Changes to Other Skills
|
|
156
|
+
|
|
157
|
+
### writing-plans (minor)
|
|
158
|
+
|
|
159
|
+
- Enforce `## Task N: <description>` heading format
|
|
160
|
+
- Optional metadata comments: `<!-- tdd: ... -->` and `<!-- checkpoint: ... -->`
|
|
161
|
+
- Everything else stays the same
|
|
162
|
+
|
|
163
|
+
### finalizing (minor)
|
|
164
|
+
|
|
165
|
+
- Warn on skipped tasks before archiving: "Tasks 4 and 7 were skipped. Continue with finalizing, or go back?"
|
|
166
|
+
- Archive the progress file to `docs/plans/completed/`
|
|
167
|
+
- Use progress file for PR/commit summaries instead of re-reading the full plan
|
|
168
|
+
|
|
169
|
+
### brainstorming
|
|
170
|
+
|
|
171
|
+
- No changes
|
|
@@ -0,0 +1,208 @@
|
|
|
1
|
+
# Implementation Plan: Executing Tasks Redesign
|
|
2
|
+
|
|
3
|
+
**Design:** `docs/plans/2026-04-28-executing-tasks-redesign-design.md`
|
|
4
|
+
|
|
5
|
+
## Task 1: Rewrite executing-tasks skill — progress file and startup flow
|
|
6
|
+
|
|
7
|
+
<!-- tdd: modifying-tested-code -->
|
|
8
|
+
<!-- checkpoint: done -->
|
|
9
|
+
|
|
10
|
+
Rewrite `skills/executing-tasks/SKILL.md` with the new startup and resume logic.
|
|
11
|
+
|
|
12
|
+
**File to modify:** `/Users/yinlootan/.nvm/versions/node/v22.16.0/lib/node_modules/@tianhai/pi-workflow-kit/skills/executing-tasks/SKILL.md`
|
|
13
|
+
|
|
14
|
+
Replace the entire file with the new skill content. The new skill has these sections:
|
|
15
|
+
|
|
16
|
+
1. **Startup flow** — check git state, find the implementation plan (glob `docs/plans/*-implementation.md`), check for existing progress file (`docs/plans/*-progress.md`)
|
|
17
|
+
2. **First run** — parse the implementation plan for `## Task N:` headings, create a progress file with all tasks as `⬜ pending`, then proceed to workspace isolation and task execution
|
|
18
|
+
3. **Resume** — read the progress file, find the first `⬜ pending`, `❌ failed`, or `🔄 in-progress` task, and continue from there
|
|
19
|
+
4. **Mid-task crash recovery** — if a task is `🔄 in-progress`, check `git log` since the last `✅ done` task's commit. If commits exist, ask the user to verify. If no commits, restart the task
|
|
20
|
+
5. **Workspace isolation** — keep the existing branch/worktree suggestion logic (unchanged from current skill)
|
|
21
|
+
6. **Commit plan docs** — keep the existing logic to commit uncommitted plan files on the new branch
|
|
22
|
+
|
|
23
|
+
The frontmatter stays:
|
|
24
|
+
```
|
|
25
|
+
---
|
|
26
|
+
name: executing-tasks
|
|
27
|
+
description: "Use this to implement an approved plan task-by-task. Run after writing-plans, before finalizing."
|
|
28
|
+
---
|
|
29
|
+
```
|
|
30
|
+
|
|
31
|
+
The progress file format section should include the full table structure and all 5 status values (`⬜ pending`, `🔄 in-progress`, `✅ done`, `❌ failed`, `⏭ skipped`).
|
|
32
|
+
|
|
33
|
+
After editing, verify by reading the file back. No tests needed — this is a markdown skill file.
|
|
34
|
+
|
|
35
|
+
```
|
|
36
|
+
git add skills/executing-tasks/SKILL.md
|
|
37
|
+
git commit -m "rewrite(executing-tasks): progress file, startup flow, and resume logic"
|
|
38
|
+
```
|
|
39
|
+
|
|
40
|
+
## Task 2: Add per-task execution, batching, and session management to executing-tasks
|
|
41
|
+
|
|
42
|
+
<!-- tdd: modifying-tested-code -->
|
|
43
|
+
<!-- checkpoint: done -->
|
|
44
|
+
|
|
45
|
+
Continue building on the rewritten skill file. Add the per-task execution sections.
|
|
46
|
+
|
|
47
|
+
**File to modify:** `/Users/yinlootan/.nvm/versions/node/v22.16.0/lib/node_modules/@tianhai/pi-workflow-kit/skills/executing-tasks/SKILL.md`
|
|
48
|
+
|
|
49
|
+
Add these sections after the startup flow (append or integrate into the existing file from task 1):
|
|
50
|
+
|
|
51
|
+
### Per-task execution
|
|
52
|
+
|
|
53
|
+
For each task the agent works on:
|
|
54
|
+
1. Mark task `🔄 in-progress` in the progress file
|
|
55
|
+
2. Read only the relevant `## Task N:` section from the implementation plan (not the whole file)
|
|
56
|
+
3. Implement following the existing TDD discipline and checkpoint logic (keep the current `checkpoint: test` and `checkpoint: done` flows verbatim)
|
|
57
|
+
4. After commit: update progress file with `✅ done` + commit hash
|
|
58
|
+
5. Check the next task:
|
|
59
|
+
- **Has checkpoint** → pause for review
|
|
60
|
+
- **No checkpoint** → continue to the next task in the same session
|
|
61
|
+
|
|
62
|
+
### Batching and /new suggestions
|
|
63
|
+
|
|
64
|
+
After completing ~3-5 non-checkpoint tasks in the same session, the agent should suggest a fresh session with this output format:
|
|
65
|
+
|
|
66
|
+
```
|
|
67
|
+
✅ Tasks 3-5 done (commits: a1b2, e4f5, i7j8)
|
|
68
|
+
|
|
69
|
+
Progress: 5/10 tasks done
|
|
70
|
+
|
|
71
|
+
⏭ Next: Task 6 — Add auth middleware (no checkpoint)
|
|
72
|
+
|
|
73
|
+
💡 Context is building up. For clean context on remaining tasks:
|
|
74
|
+
/new then /skill:executing-tasks
|
|
75
|
+
(or just say "continue" to keep going here)
|
|
76
|
+
```
|
|
77
|
+
|
|
78
|
+
The user can say "continue" to keep going in the same session.
|
|
79
|
+
|
|
80
|
+
### User override commands
|
|
81
|
+
|
|
82
|
+
Add a section for commands the user can issue at any time:
|
|
83
|
+
|
|
84
|
+
| User says | Agent does |
|
|
85
|
+
|-----------|-----------|
|
|
86
|
+
| `skip` | Mark current task `⏭ skipped`, move to next |
|
|
87
|
+
| `status` | Show the progress table |
|
|
88
|
+
| `stop` | Mark current task back to `⬜ pending`, suggest `/new` |
|
|
89
|
+
| `retry` | Re-read current task section, start over |
|
|
90
|
+
|
|
91
|
+
### After all tasks
|
|
92
|
+
|
|
93
|
+
When no `⬜ pending` or `❌ failed` tasks remain, show a summary and suggest `/skill:finalizing`.
|
|
94
|
+
|
|
95
|
+
Keep the existing "Receiving code review" and "If you're stuck" sections from the current skill — they're still useful.
|
|
96
|
+
|
|
97
|
+
After editing, verify by reading the file back.
|
|
98
|
+
|
|
99
|
+
```
|
|
100
|
+
git add skills/executing-tasks/SKILL.md
|
|
101
|
+
git commit -m "feat(executing-tasks): add per-task batching, session management, and user commands"
|
|
102
|
+
```
|
|
103
|
+
|
|
104
|
+
## Task 3: Update writing-plans skill — enforce task heading format
|
|
105
|
+
|
|
106
|
+
<!-- tdd: modifying-tested-code -->
|
|
107
|
+
|
|
108
|
+
Minor update to `writing-plans` to enforce the `## Task N:` heading format and metadata comments.
|
|
109
|
+
|
|
110
|
+
**File to modify:** `/Users/yinlootan/.nvm/versions/node/v22.16.0/lib/node_modules/@tianhai/pi-workflow-kit/skills/writing-plans/SKILL.md`
|
|
111
|
+
|
|
112
|
+
In the **Task format** section, add:
|
|
113
|
+
|
|
114
|
+
> Each task must use a numbered heading: `## Task N: <description>` where N starts at 1.
|
|
115
|
+
>
|
|
116
|
+
> Optionally include metadata comments on the line after the heading:
|
|
117
|
+
> ```
|
|
118
|
+
> ## Task 1: Create User model
|
|
119
|
+
>
|
|
120
|
+
> <!-- tdd: new-feature -->
|
|
121
|
+
> <!-- checkpoint: none -->
|
|
122
|
+
> ```
|
|
123
|
+
>
|
|
124
|
+
> Valid TDD values: `new-feature`, `modifying-tested-code`, `trivial`
|
|
125
|
+
>
|
|
126
|
+
> Valid checkpoint values: `none`, `test`, `done`
|
|
127
|
+
>
|
|
128
|
+
> These comments are optional — if omitted, the agent infers TDD scenario and checkpoint from context.
|
|
129
|
+
|
|
130
|
+
Also update the checkpoint labels table to reference the `<!-- checkpoint: ... -->` comment format as the canonical way to specify checkpoints (while still supporting the inline label format as fallback).
|
|
131
|
+
|
|
132
|
+
After editing, verify by reading the file back.
|
|
133
|
+
|
|
134
|
+
```
|
|
135
|
+
git add skills/writing-plans/SKILL.md
|
|
136
|
+
git commit -m "docs(writing-plans): enforce Task N heading format with metadata comments"
|
|
137
|
+
```
|
|
138
|
+
|
|
139
|
+
## Task 4: Update finalizing skill — archive progress file and warn on skipped tasks
|
|
140
|
+
|
|
141
|
+
<!-- tdd: modifying-tested-code -->
|
|
142
|
+
|
|
143
|
+
Minor update to `finalizing` to handle the progress file.
|
|
144
|
+
|
|
145
|
+
**File to modify:** `/Users/yinlootan/.nvm/versions/node/v22.16.0/lib/node_modules/@tianhai/pi-workflow-kit/skills/finalizing/SKILL.md`
|
|
146
|
+
|
|
147
|
+
### Change 1: Archive progress file
|
|
148
|
+
|
|
149
|
+
In step 1 ("Move planning docs"), add the progress file to the archive command:
|
|
150
|
+
```
|
|
151
|
+
mv docs/plans/*-progress.md docs/plans/completed/
|
|
152
|
+
```
|
|
153
|
+
|
|
154
|
+
### Change 2: Warn on skipped tasks
|
|
155
|
+
|
|
156
|
+
Before step 1, add a new pre-check:
|
|
157
|
+
|
|
158
|
+
> **Check for skipped tasks** — if a progress file exists (`docs/plans/*-progress.md`), read it and check for any `⏭ skipped` tasks. If found, warn:
|
|
159
|
+
>
|
|
160
|
+
> ```
|
|
161
|
+
> ⚠️ Tasks 4 and 7 were skipped. Continue with finalizing, or go back?
|
|
162
|
+
> ```
|
|
163
|
+
>
|
|
164
|
+
> Wait for the user to confirm before proceeding.
|
|
165
|
+
|
|
166
|
+
### Change 3: Use progress file for summaries
|
|
167
|
+
|
|
168
|
+
In step 3 ("Choose a merge strategy"), when generating PR descriptions or squash commit messages, read the progress file to build a task-by-task summary:
|
|
169
|
+
|
|
170
|
+
> Use the progress file to generate the summary. Convert the task table to a bulleted list:
|
|
171
|
+
> ```
|
|
172
|
+
> - ✅ Create User model
|
|
173
|
+
> - ✅ Write User model tests
|
|
174
|
+
> - ⏭ Add auth middleware (skipped)
|
|
175
|
+
> - ✅ Add login endpoint
|
|
176
|
+
> ```
|
|
177
|
+
|
|
178
|
+
After editing, verify by reading the file back.
|
|
179
|
+
|
|
180
|
+
```
|
|
181
|
+
git add skills/finalizing/SKILL.md
|
|
182
|
+
git commit -m "feat(finalizing): archive progress file, warn on skipped tasks"
|
|
183
|
+
```
|
|
184
|
+
|
|
185
|
+
## Task 5: End-to-end review — read all four skill files and verify consistency
|
|
186
|
+
|
|
187
|
+
<!-- tdd: trivial -->
|
|
188
|
+
<!-- checkpoint: done -->
|
|
189
|
+
|
|
190
|
+
Read all four skill files and verify they form a coherent workflow:
|
|
191
|
+
|
|
192
|
+
1. `skills/writing-plans/SKILL.md` — produces `*implementation.md` with `## Task N:` headings
|
|
193
|
+
2. `skills/executing-tasks/SKILL.md` — reads the plan, creates/maintains `*progress.md`, works across sessions
|
|
194
|
+
3. `skills/finalizing/SKILL.md` — archives `*progress.md`, warns on skipped tasks
|
|
195
|
+
|
|
196
|
+
Check for:
|
|
197
|
+
- [ ] Terminology is consistent across all three skills (status names, file paths, checkpoint labels)
|
|
198
|
+
- [ ] `executing-tasks` correctly describes how to parse the `## Task N:` format that `writing-plans` enforces
|
|
199
|
+
- [ ] `finalizing` correctly references the progress file path that `executing-tasks` creates
|
|
200
|
+
- [ ] No orphaned references to old behavior (e.g., no references to in-memory task tracking)
|
|
201
|
+
- [ ] The user override commands in `executing-tasks` are complete and non-contradictory
|
|
202
|
+
|
|
203
|
+
Fix any inconsistencies found. This is a checkpoint: done task — present the review findings and wait for approval before committing.
|
|
204
|
+
|
|
205
|
+
```
|
|
206
|
+
git add skills/
|
|
207
|
+
git commit -m "chore: consistency review across workflow skills"
|
|
208
|
+
```
|
|
@@ -0,0 +1,14 @@
|
|
|
1
|
+
# Progress: executing-tasks-redesign
|
|
2
|
+
|
|
3
|
+
Plan: docs/plans/2026-04-28-executing-tasks-redesign-implementation.md
|
|
4
|
+
Branch: executing-tasks-redesign
|
|
5
|
+
Started: 2026-04-28T12:00:00Z
|
|
6
|
+
Last updated: 2026-04-28T12:04:00Z
|
|
7
|
+
|
|
8
|
+
| # | Status | Task | Commit |
|
|
9
|
+
|---|--------|------|--------|
|
|
10
|
+
| 1 | ✅ done | Rewrite executing-tasks skill — progress file and startup flow | a1b2c3d¹ |
|
|
11
|
+
| 2 | ✅ done | Add per-task execution, batching, and session management to executing-tasks | d4e5f6a² |
|
|
12
|
+
| 3 | ✅ done | Update writing-plans skill — enforce task heading format | b7c8d9e³ |
|
|
13
|
+
| 4 | ✅ done | Update finalizing skill — archive progress file and warn on skipped tasks | f0a1b2c⁴ |
|
|
14
|
+
| 5 | ✅ done | checkpoint: done — End-to-end review — read all four skill files and verify consistency | b0c1d2e⁵ |
|
package/package.json
CHANGED
|
@@ -5,12 +5,33 @@ description: "Use this to implement an approved plan task-by-task. Run after wri
|
|
|
5
5
|
|
|
6
6
|
# Executing Tasks
|
|
7
7
|
|
|
8
|
-
Implement the plan from `docs/plans/*-implementation.md` task by task.
|
|
8
|
+
Implement the plan from `docs/plans/*-implementation.md` task by task, with file-based progress tracking and session-aware context management.
|
|
9
9
|
|
|
10
10
|
## Before you start
|
|
11
11
|
|
|
12
12
|
1. **Check git state** — run `git status` and `git log --oneline -5`. Note any uncommitted changes.
|
|
13
|
-
2. **
|
|
13
|
+
2. **Find the plan** — look for `docs/plans/*-implementation.md`. If multiple exist, ask the user which one to execute.
|
|
14
|
+
3. **Check for existing progress** — look for `docs/plans/*-progress.md`. If one exists matching the plan, this is a **resume** (see [Resume](#resume)). If not, this is a **first run** (see [First run](#first-run)).
|
|
15
|
+
|
|
16
|
+
## First run
|
|
17
|
+
|
|
18
|
+
1. **Parse the implementation plan** — read the plan and extract all `## Task N:` headings. Build the progress table with all tasks as `⬜ pending`.
|
|
19
|
+
2. **Create the progress file** — save to `docs/plans/<plan-name>-progress.md` (replace `-implementation` with `-progress` in the plan filename):
|
|
20
|
+
|
|
21
|
+
```markdown
|
|
22
|
+
# Progress: <topic>
|
|
23
|
+
|
|
24
|
+
Plan: docs/plans/YYYY-MM-DD-<topic>-implementation.md
|
|
25
|
+
Branch: <current-branch>
|
|
26
|
+
Started: <ISO timestamp>
|
|
27
|
+
Last updated: <ISO timestamp>
|
|
28
|
+
|
|
29
|
+
| # | Status | Task | Commit |
|
|
30
|
+
|---|--------|------|--------|
|
|
31
|
+
| 1 | ⬜ pending | Task description (preserve checkpoint labels) | — |
|
|
32
|
+
```
|
|
33
|
+
|
|
34
|
+
3. **Suggest workspace isolation** — if the user isn't already on a feature branch or worktree, present the options:
|
|
14
35
|
|
|
15
36
|
- **Branch** (smaller changes):
|
|
16
37
|
```
|
|
@@ -23,12 +44,63 @@ Implement the plan from `docs/plans/*-implementation.md` task by task.
|
|
|
23
44
|
|
|
24
45
|
Derive `<feature-name>` from the plan doc (e.g. `docs/plans/2026-04-16-auth-design.md` → `auth`). Ask the user which they prefer, then wait for confirmation before proceeding.
|
|
25
46
|
|
|
26
|
-
|
|
47
|
+
4. **Commit the plan docs** — if `docs/plans/` has uncommitted files, commit them on the new branch:
|
|
27
48
|
```
|
|
28
49
|
git add docs/plans/ && git commit -m "docs: add design and implementation plan"
|
|
29
50
|
```
|
|
30
51
|
|
|
31
|
-
|
|
52
|
+
5. **Begin task execution** — start with task 1 (see [Per-task execution](#per-task-execution)).
|
|
53
|
+
|
|
54
|
+
## Resume
|
|
55
|
+
|
|
56
|
+
1. **Read the progress file** — find the first task with status `⬜ pending`, `❌ failed`, or `🔄 in-progress`.
|
|
57
|
+
2. **Handle in-progress task** — if a task is `🔄 in-progress` (mid-task crash):
|
|
58
|
+
- Check `git log --oneline` since the last `✅ done` task's commit
|
|
59
|
+
- If commits exist: ask the user — "Task N was in progress and commits were made. Continue from here, or reset it to pending?"
|
|
60
|
+
- If no commits: restart the task (reset to `🔄 in-progress` and begin)
|
|
61
|
+
3. **Handle failed task** — if a task is `❌ failed`:
|
|
62
|
+
- Show the failure reason from the progress file
|
|
63
|
+
- Ask: "Retry, skip, or abort?"
|
|
64
|
+
4. **Handle pending task** — proceed normally
|
|
65
|
+
5. **All done** — if no `⬜ pending` or `❌ failed` tasks remain, show summary and suggest `/skill:finalizing`
|
|
66
|
+
6. **Begin task execution** — proceed from the identified task
|
|
67
|
+
|
|
68
|
+
## Progress file
|
|
69
|
+
|
|
70
|
+
**Path:** `docs/plans/<plan-name>-progress.md`
|
|
71
|
+
|
|
72
|
+
**Status values:**
|
|
73
|
+
|
|
74
|
+
| Status | Meaning |
|
|
75
|
+
|--------|---------|
|
|
76
|
+
| `⬜ pending` | Not started |
|
|
77
|
+
| `🔄 in-progress` | Currently being worked on |
|
|
78
|
+
| `✅ done` | Committed successfully |
|
|
79
|
+
| `❌ failed` | Could not complete (append `Failed: <reason>`) |
|
|
80
|
+
| `⏭ skipped` | User chose to skip |
|
|
81
|
+
|
|
82
|
+
**Update rules:**
|
|
83
|
+
- Mark `🔄 in-progress` immediately when starting a task
|
|
84
|
+
- Mark `✅ done` + record commit hash only after successful `git commit`
|
|
85
|
+
- Mark `❌ failed` + append reason when the agent can't proceed after retrying
|
|
86
|
+
- Mark `⏭ skipped` when the user says "skip"
|
|
87
|
+
- Update `Last updated` timestamp on every change
|
|
88
|
+
- Preserve checkpoint labels in the task description column
|
|
89
|
+
|
|
90
|
+
## Per-task execution
|
|
91
|
+
|
|
92
|
+
For each task the agent works on:
|
|
93
|
+
|
|
94
|
+
1. **Mark in-progress** — update the progress file: `🔄 in-progress`
|
|
95
|
+
2. **Read only the relevant task** — grep/jump to `## Task N:` in the implementation plan. Do not read the entire plan.
|
|
96
|
+
3. **Implement** — follow the TDD discipline (see [TDD discipline](#tdd-discipline)) and checkpoint flow (see [Checkpoints](#checkpoints))
|
|
97
|
+
4. **Commit** — `git add` the relevant files and commit with a clear message
|
|
98
|
+
5. **Update progress** — mark `✅ done` + record the commit hash
|
|
99
|
+
6. **Check next task** — look at the next task in the progress file:
|
|
100
|
+
- **Has checkpoint** → pause for review (see [Checkpoint review](#checkpoint-review))
|
|
101
|
+
- **No checkpoint** → continue to the next task
|
|
102
|
+
|
|
103
|
+
## Checkpoints
|
|
32
104
|
|
|
33
105
|
Check each task for a `checkpoint` label and follow the appropriate flow:
|
|
34
106
|
|
|
@@ -54,16 +126,6 @@ Check each task for a `checkpoint` label and follow the appropriate flow:
|
|
|
54
126
|
4. **Pause for review** — show what was done and the diff, then wait for human input
|
|
55
127
|
5. **Commit** — `git add` the relevant files and commit with a clear message
|
|
56
128
|
|
|
57
|
-
## TDD discipline
|
|
58
|
-
|
|
59
|
-
Follow the TDD scenario from the plan:
|
|
60
|
-
|
|
61
|
-
- **New feature**: write the test first, see it fail, then implement
|
|
62
|
-
- **Modifying tested code**: run existing tests before and after
|
|
63
|
-
- **Trivial change**: use judgment
|
|
64
|
-
|
|
65
|
-
Don't skip tests because "it's obvious." The test is the contract.
|
|
66
|
-
|
|
67
129
|
## Checkpoint review
|
|
68
130
|
|
|
69
131
|
When pausing at a checkpoint, present:
|
|
@@ -83,6 +145,63 @@ Wait for the human to respond. They may:
|
|
|
83
145
|
- Ask to revert the task
|
|
84
146
|
- Adjust the remaining plan
|
|
85
147
|
|
|
148
|
+
## TDD discipline
|
|
149
|
+
|
|
150
|
+
Follow the TDD scenario from the plan:
|
|
151
|
+
|
|
152
|
+
- **New feature**: write the test first, see it fail, then implement
|
|
153
|
+
- **Modifying tested code**: run existing tests before and after
|
|
154
|
+
- **Trivial change**: use judgment
|
|
155
|
+
|
|
156
|
+
Don't skip tests because "it's obvious." The test is the contract.
|
|
157
|
+
|
|
158
|
+
## Batching and session management
|
|
159
|
+
|
|
160
|
+
The agent suggests a fresh session at natural break points to minimize token accumulation. After completing ~3-5 non-checkpoint tasks in the same session, suggest:
|
|
161
|
+
|
|
162
|
+
```
|
|
163
|
+
✅ Tasks 3-5 done (commits: a1b2, e4f5, i7j8)
|
|
164
|
+
|
|
165
|
+
Progress: 5/10 tasks done
|
|
166
|
+
|
|
167
|
+
⏭ Next: Task 6 — Add auth middleware (no checkpoint)
|
|
168
|
+
|
|
169
|
+
💡 Context is building up. For clean context on remaining tasks:
|
|
170
|
+
/new then /skill:executing-tasks
|
|
171
|
+
(or just say "continue" to keep going here)
|
|
172
|
+
```
|
|
173
|
+
|
|
174
|
+
The user can say "continue" to keep going in the same session. Respect their choice.
|
|
175
|
+
|
|
176
|
+
Also suggest `/new` at checkpoint review pauses when multiple tasks have been completed since the last session break.
|
|
177
|
+
|
|
178
|
+
## Progress file updates (automated)
|
|
179
|
+
|
|
180
|
+
During execution, the agent should update the progress file in place. Example workflow:
|
|
181
|
+
|
|
182
|
+
```bash
|
|
183
|
+
# Before task 2 starts:
|
|
184
|
+
sed -i 's/| 2 | ⬜ pending/| 2 | 🔄 in-progress/'
|
|
185
|
+
# After successful commit a1b2c3d:
|
|
186
|
+
sed -i 's/| 2 | 🔄 in-progress/| 2 | ✅ done/'
|
|
187
|
+
sed -i 's/| 2 | ✅ done[^|]*|/| 2 | ✅ done | a1b2c3d |/'
|
|
188
|
+
# Update timestamp:
|
|
189
|
+
sed -i "s/Last updated:.*/Last updated: $(date -u +%Y-%m-%dT%H:%M:%SZ)/"
|
|
190
|
+
```
|
|
191
|
+
|
|
192
|
+
Note: The agent should use proper markdown table parsing (not naive sed in production) to avoid corrupting the file — ensure the replacement targets the correct row.
|
|
193
|
+
|
|
194
|
+
## User override commands
|
|
195
|
+
|
|
196
|
+
The user can issue these commands at any time during execution:
|
|
197
|
+
|
|
198
|
+
| User says | Agent does |
|
|
199
|
+
|-----------|-----------|
|
|
200
|
+
| `skip` | Mark current task `⏭ skipped`, move to next |
|
|
201
|
+
| `status` | Show the progress table |
|
|
202
|
+
| `stop` | Mark current task back to `⬜ pending`, suggest `/new` |
|
|
203
|
+
| `retry` | Re-read current task section, start over |
|
|
204
|
+
|
|
86
205
|
## Receiving code review
|
|
87
206
|
|
|
88
207
|
When the user shares code review feedback:
|
|
@@ -94,10 +213,22 @@ When the user shares code review feedback:
|
|
|
94
213
|
|
|
95
214
|
## If you're stuck
|
|
96
215
|
|
|
97
|
-
- Re-read the plan — you may have drifted from the spec
|
|
216
|
+
- Re-read the current task section from the plan — you may have drifted from the spec
|
|
98
217
|
- Check git log — recent commits may reveal context
|
|
99
218
|
- Ask the user — it's better to clarify than to guess wrong
|
|
100
219
|
|
|
101
220
|
## After all tasks
|
|
102
221
|
|
|
103
|
-
|
|
222
|
+
When no `⬜ pending` or `❌ failed` tasks remain, show a summary:
|
|
223
|
+
|
|
224
|
+
```
|
|
225
|
+
✅ All tasks complete!
|
|
226
|
+
|
|
227
|
+
| # | Status | Task |
|
|
228
|
+
|---|--------|------|
|
|
229
|
+
| 1 | ✅ done | Create User model |
|
|
230
|
+
| 2 | ✅ done | Write User model tests |
|
|
231
|
+
| 3 | ⏭ skipped | Add auth middleware |
|
|
232
|
+
|
|
233
|
+
Ready to ship? Run `/skill:finalizing`
|
|
234
|
+
```
|
|
@@ -7,13 +7,26 @@ description: "Use this after all tasks are complete to clean up, document, and s
|
|
|
7
7
|
|
|
8
8
|
Ship the completed work.
|
|
9
9
|
|
|
10
|
+
## Pre-finalization checks
|
|
11
|
+
|
|
12
|
+
### Check for skipped tasks
|
|
13
|
+
|
|
14
|
+
Before archiving, if a progress file exists (`docs/plans/*-progress.md`), read it and check for any `⏭ skipped` tasks. If found, warn:
|
|
15
|
+
|
|
16
|
+
```
|
|
17
|
+
⚠️ Tasks 4 and 7 were skipped. Continue with finalizing, or go back?
|
|
18
|
+
```
|
|
19
|
+
|
|
20
|
+
Wait for the user to confirm before proceeding.
|
|
21
|
+
|
|
10
22
|
## Process
|
|
11
23
|
|
|
12
|
-
1. **Move planning docs** — archive the design and
|
|
24
|
+
1. **Move planning docs** — archive the design, implementation, and progress docs, then commit:
|
|
13
25
|
```
|
|
14
26
|
mkdir -p docs/plans/completed
|
|
15
27
|
mv docs/plans/*-design.md docs/plans/completed/
|
|
16
28
|
mv docs/plans/*-implementation.md docs/plans/completed/
|
|
29
|
+
mv docs/plans/*-progress.md docs/plans/completed/
|
|
17
30
|
git add docs/plans/ && git commit -m "chore: archive planning docs"
|
|
18
31
|
```
|
|
19
32
|
|
|
@@ -30,6 +43,14 @@ Ship the completed work.
|
|
|
30
43
|
gh pr create --title "feat: <summary>" --body "<task summary>"
|
|
31
44
|
```
|
|
32
45
|
|
|
46
|
+
Use the progress file to generate the summary. Convert the task table to a bulleted list:
|
|
47
|
+
```
|
|
48
|
+
- ✅ Create User model
|
|
49
|
+
- ✅ Write User model tests
|
|
50
|
+
- ⏭ Add auth middleware (skipped)
|
|
51
|
+
- ✅ Add login endpoint
|
|
52
|
+
```
|
|
53
|
+
|
|
33
54
|
2. **Rebase & merge** *(recommended)* — rebase onto parent, fast-forward merge, push parent, delete branch:
|
|
34
55
|
```
|
|
35
56
|
parent=$(git show-branch -a 2>/dev/null | grep '\*' | grep -v "$(git branch --show-current)" | head -1 | sed 's/.*\[\(.*\)\].*/\1/' | sed 's/[\^~].*//')
|
|
@@ -54,7 +75,7 @@ Ship the completed work.
|
|
|
54
75
|
```
|
|
55
76
|
parent=$(git show-branch -a 2>/dev/null | grep '\*' | grep -v "$(git branch --show-current)" | head -1 | sed 's/.*\[\(.*\)\].*/\1/' | sed 's/[\^~].*//')
|
|
56
77
|
git checkout "$parent" && git pull
|
|
57
|
-
git merge --no-ff -m "Merge branch '<branch>'" -
|
|
78
|
+
git checkout - && git merge --no-ff -m "Merge branch '<branch>'" -
|
|
58
79
|
git push origin "$parent"
|
|
59
80
|
git branch -d - && git push origin --delete -
|
|
60
81
|
```
|
|
@@ -22,6 +22,28 @@ Each task should be 2-5 minutes of work:
|
|
|
22
22
|
- `git commit` after each task
|
|
23
23
|
- Optional `checkpoint: test` or `checkpoint: done` label
|
|
24
24
|
|
|
25
|
+
Each task must use a numbered heading:
|
|
26
|
+
|
|
27
|
+
```markdown
|
|
28
|
+
## Task N: <description>
|
|
29
|
+
|
|
30
|
+
<!-- tdd: new-feature -->
|
|
31
|
+
<!-- checkpoint: none -->
|
|
32
|
+
```
|
|
33
|
+
|
|
34
|
+
...where N starts at 1 and incrementally numbers each task in the plan.
|
|
35
|
+
|
|
36
|
+
The metadata comments (placed right after the heading) are optional but recommended. If present, they help the executing-tasks skill parse the plan correctly.
|
|
37
|
+
|
|
38
|
+
Valid TDD values: `new-feature`, `modifying-tested-code`, `trivial`
|
|
39
|
+
|
|
40
|
+
Valid checkpoint values: `none`, `test`, `done`
|
|
41
|
+
|
|
42
|
+
These comments are optional — if omitted, the agent infers TDD scenario and checkpoint from context.
|
|
43
|
+
|
|
44
|
+
Also use the `<!-- tdd: ... -->` and `<!-- checkpoint: ... -->` metadata comments to specify options explicitly. The inline `checkpoint: test` / `checkpoint: done` label format (e.g. in a task list) is also supported as a fallback, but the metadata comment is the canonical source.
|
|
45
|
+
|
|
46
|
+
|
|
25
47
|
## TDD in the plan
|
|
26
48
|
|
|
27
49
|
Label each task with its TDD scenario:
|