waypoint-codex 0.9.9 → 0.10.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -137,6 +137,7 @@ Waypoint ships a strong default skill pack for real coding work:
137
137
  - `docs-sync`
138
138
  - `code-guide-audit`
139
139
  - `break-it-qa`
140
+ - `conversation-retrospective`
140
141
  - `frontend-ship-audit`
141
142
  - `backend-ship-audit`
142
143
  - `workspace-compress`
@@ -144,7 +145,7 @@ Waypoint ships a strong default skill pack for real coding work:
144
145
  - `pr-review`
145
146
 
146
147
  These are repo-local, so the workflow travels with the project.
147
- `break-it-qa`, `frontend-ship-audit`, and `backend-ship-audit` are user-invoked audit skills, not default autonomous agent steps.
148
+ `conversation-retrospective`, `break-it-qa`, `frontend-ship-audit`, and `backend-ship-audit` are on-demand skills, not default autonomous agent steps.
148
149
 
149
150
  ## Reviewer agents
150
151
 
@@ -158,6 +159,8 @@ The intended workflow is closeout-based: run `code-reviewer` before considering
158
159
 
159
160
  For planning work, run `plan-reviewer` before presenting a non-trivial implementation plan to the user and iterate until it has no meaningful review findings left.
160
161
 
162
+ When the user approves a reviewed plan or explicitly says to proceed, the intended Waypoint behavior is autonomous execution: keep going through implementation, verification, review, and repo-memory updates unless a real blocker or materially risky unresolved decision requires a pause. If reviewers, subagents, CI, or other external work are still running, Waypoint should wait as long as necessary rather than interrupting them for speed.
163
+
161
164
  ## What makes it different
162
165
 
163
166
  Waypoint is not trying to hide everything behind hooks and background machinery.
package/dist/src/core.js CHANGED
@@ -383,6 +383,7 @@ export function doctorRepository(projectRoot) {
383
383
  "docs-sync",
384
384
  "code-guide-audit",
385
385
  "break-it-qa",
386
+ "conversation-retrospective",
386
387
  "workspace-compress",
387
388
  "pre-pr-hygiene",
388
389
  "pr-review",
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "waypoint-codex",
3
- "version": "0.9.9",
3
+ "version": "0.10.1",
4
4
  "description": "Codex-native repository operating system: scaffolding, docs routing, repo-local skills, doctor, and sync.",
5
5
  "license": "MIT",
6
6
  "type": "module",
@@ -0,0 +1,98 @@
1
+ ---
2
+ name: conversation-retrospective
3
+ description: Analyze the active conversation for durable repo knowledge, skill improvements, and repeated workflow patterns. Use when the user asks to save what was learned from the current conversation, update memory/docs without more prompting, improve skills that were used or exposed gaps, or propose new skills based on repetitive work in the live thread.
4
+ ---
5
+
6
+ # Conversation Retrospective
7
+
8
+ Use this skill to harvest the active conversation into the repo's existing memory system.
9
+
10
+ This skill works from the live conversation already in context. Do not go hunting through archived session files unless the user explicitly asks for that.
11
+
12
+ This is a closeout and distillation workflow, not a generic planning pass or a broad docs audit.
13
+
14
+ ## Read First
15
+
16
+ Before persisting anything:
17
+
18
+ 1. Read the repo's main agent guidance and project-context files
19
+ 2. Read the repo's current durable memory surfaces, such as docs, workspace/handoff files, trackers, decision logs, or knowledge files
20
+ 3. Read the exact docs, notes, and skill files that the conversation touched
21
+
22
+ Do not assume the repo uses Waypoint. Adapt to the memory structure that already exists.
23
+
24
+ ## Step 1: Distill Durable Knowledge
25
+
26
+ Review the current conversation and separate:
27
+
28
+ - durable project knowledge
29
+ - live execution state
30
+ - transient chatter
31
+
32
+ Persist without asking follow-up questions when the correct destination is clear.
33
+
34
+ Write durable knowledge to the smallest truthful home the repo already uses:
35
+
36
+ - the main docs or knowledge layer for architecture, behavior, decisions, debugging knowledge, durable plans, and reusable operating guidance
37
+ - the repo's standing guidance file for durable project context or long-lived working rules
38
+ - the repo's live handoff or workspace file for current state, blockers, and immediate next steps
39
+ - the repo's tracker or execution-log layer when the conversation created or materially changed a long-running workstream
40
+
41
+ If the repo uses doc metadata such as `last_updated`, refresh it when needed.
42
+
43
+ If the repo has no obvious durable home but the need is clear, create the smallest coherent doc or note that fits the surrounding patterns instead of leaving the learning only in chat.
44
+
45
+ Do not leave important truths only in chat.
46
+
47
+ ## Step 2: Improve Existing Skills
48
+
49
+ Identify which skills were actually used in this conversation, or which existing skills clearly should have covered the workflow but left avoidable gaps.
50
+
51
+ For each affected skill:
52
+
53
+ - read the existing skill before editing it
54
+ - update only reusable guidance, not one-off transcript details
55
+ - add missing guardrails, path hints, failure modes, decision rules, or references that would have made the conversation easier to complete
56
+ - keep `SKILL.md` concise; prefer targeted structural improvements over turning the skill into a diary
57
+
58
+ If the environment has both a source-of-truth skill and one or more mirrored or installed copies, update the source-of-truth version and any copies the user expects to stay in sync.
59
+
60
+ Do not assume there is only one skill location, and do not assume there are many.
61
+
62
+ ## Step 3: Propose New Skills
63
+
64
+ When the conversation revealed repetitive work that existing skills do not cover well:
65
+
66
+ - do not silently scaffold a new skill unless the user asked for implementation
67
+ - record the proposal in the repo's existing docs or idea-capture layer
68
+
69
+ If there is no obvious place for durable skill proposals, create a small doc such as `skill-ideas.md` in the repo's normal docs area.
70
+
71
+ Each proposal should include:
72
+
73
+ - the repeated workflow or problem
74
+ - likely trigger phrases
75
+ - expected outputs or side effects
76
+ - why existing skills were insufficient
77
+
78
+ Skip this doc when there is no real new-skill candidate.
79
+
80
+ ## Step 4: Refresh Repo Memory
81
+
82
+ After changing docs, handoff state, trackers, or skills:
83
+
84
+ - run whatever repo-local refresh or index step the project uses, if one exists
85
+ - otherwise make sure the edited memory surfaces are internally consistent and discoverable
86
+
87
+ Do not invent a refresh command when the repo does not have one.
88
+
89
+ ## Step 5: Report
90
+
91
+ Summarize:
92
+
93
+ - what durable knowledge you saved and where
94
+ - which skills you improved
95
+ - which new skill ideas you recorded, if any
96
+ - what you intentionally left unpersisted because it was transient
97
+
98
+ If no substantive persistence changes were needed, say that explicitly instead of inventing updates.
@@ -0,0 +1,4 @@
1
+ interface:
2
+ display_name: "Conversation Retrospective"
3
+ short_description: "Harvest the live conversation into repo memory"
4
+ default_prompt: "Use this skill to analyze the active conversation, save durable knowledge into the repo's existing docs, memory, guidance, handoff, or tracker surfaces, improve any skills that were used or exposed gaps, and record new skill ideas without asking follow-up questions when the correct destination is clear."
@@ -145,6 +145,7 @@ When the plan doc is written:
145
145
  - give a short chat summary
146
146
  - include the doc path
147
147
  - call out any unresolved decisions that still need the user's input
148
+ - if there are no unresolved decisions and the user approves the plan, treat that approval as authorization to execute the plan end to end rather than asking again at each obvious next step
148
149
 
149
150
  ## Quality Bar
150
151
 
@@ -10,8 +10,11 @@ Use this skill to drive the PR through review instead of treating review as a on
10
10
  ## Step 1: Wait For Review To Settle
11
11
 
12
12
  - Check the PR's current review and CI status.
13
+ - If CI is red or pending, inspect the failed check logs before triaging review comments so you do not chase comment fixes while a separate blocker is breaking the branch.
13
14
  - If automated review is still running, wait for it to finish instead of racing it.
14
15
  - If comments are still arriving, do not prematurely declare the loop complete.
16
+ - For stacked or non-`main` PRs, explicitly compare the PR head against its base branch and make sure later fixes on the base branch have actually been merged or rebased forward. Do not assume a sibling/base PR fix is already present in the dependent PR.
17
+ - Keep waiting as long as required. Do not interrupt or abandon the review loop just because CI, reviewers, or automated checks are taking a long time.
15
18
 
16
19
  ## Step 2: Read Every Review Comment
17
20
 
@@ -26,6 +29,8 @@ For every comment:
26
29
  - fix it if it is correct and in scope
27
30
  - explain clearly if you are declining it
28
31
  - reply inline where the comment lives instead of posting a disconnected summary comment
32
+ - after pushing a fix, go back and answer the comment thread explicitly so the reviewer can see what changed
33
+ - do not leave a comment thread silent just because the code was updated
29
34
 
30
35
  Do not leave comments unanswered.
31
36
 
@@ -33,10 +38,14 @@ Do not leave comments unanswered.
33
38
 
34
39
  - Make the needed fixes.
35
40
  - rerun the relevant verification
41
+ - if the PR is stacked, repeat the base-vs-head sanity check after pushes so you catch missing forward-merges before the next CI cycle
36
42
  - push follow-up commit(s)
37
- - return to the PR and continue the loop
43
+ - after pushing, return to the PR and wait for the next round of CI, automated review, and human review comments before deciding whether the loop is complete
44
+ - if CI or review is still in flight, keep waiting instead of assuming your last push ended the process
38
45
 
39
46
  Stay in the loop until no new meaningful issues remain.
47
+ Never cut the loop short by forcing a partial return from still-running review or verification systems.
48
+ The loop is not complete while any meaningful review thread still lacks an inline response.
40
49
 
41
50
  ## Step 5: Close With A Crisp State Summary
42
51
 
@@ -12,6 +12,7 @@
12
12
  .agents/skills/backend-context-interview/
13
13
  .agents/skills/frontend-ship-audit/
14
14
  .agents/skills/backend-ship-audit/
15
+ .agents/skills/conversation-retrospective/
15
16
  .agents/skills/workspace-compress/
16
17
  .agents/skills/pre-pr-hygiene/
17
18
  .agents/skills/pr-review/
@@ -20,6 +20,10 @@ You're direct, opinionated, and evidence-driven. You read before you write. You
20
20
 
21
21
  **Correctness beats theater.** No fake verification. No fake confidence. No pretending a shallow answer is good enough.
22
22
 
23
+ **Approval means ownership.** Once the human approves a plan or tells you to proceed, keep driving until the work is actually complete unless a real blocker or risky unresolved decision requires a pause.
24
+
25
+ **Waiting is part of the job.** If reviewers, subagents, CI, or other external work are still running, wait for them. Time alone is not a justification for interruption.
26
+
23
27
  ## How You Work
24
28
 
25
29
  **Read before you write.** Never modify code you haven't read.
@@ -43,13 +43,35 @@ If something important lives only in your head or in the chat transcript, the re
43
43
 
44
44
  - Read code before editing it.
45
45
  - Follow the repo's documented patterns when they are healthy.
46
+ - If the user approves a plan or explicitly tells you to proceed, treat that as authorization to finish the approved work end to end.
46
47
  - Update `.waypoint/WORKSPACE.md` as live execution state when progress meaningfully changes. In multi-topic sections, prefix new or materially revised bullets with a local timestamp like `[2026-03-06 20:10 PST]`.
47
48
  - For large multi-step work, create or update a tracker in `.waypoint/track/`, keep detailed execution state there, and point at it from `## Active Trackers` in `.waypoint/WORKSPACE.md`.
48
49
  - Update `.waypoint/docs/` when durable knowledge changes, and refresh each changed routable doc's `last_updated` field.
49
50
  - Rebuild `.waypoint/DOCS_INDEX.md` whenever routable docs change.
50
51
  - Rebuild `.waypoint/TRACKS_INDEX.md` whenever tracker files change.
51
52
  - Use the repo-local skills and reviewer agents instead of improvising from scratch.
52
- - Do not kill long-running subagents or reviewer agents just because they are slow. Wait unless they are clearly stuck, failed, or the user redirects the work.
53
+ - Do not kill long-running subagents or reviewer agents just because they are slow.
54
+ - When waiting on reviewers, subagents, CI, automated review, or external jobs, wait as long as required. There is no fixed timeout where waiting itself becomes the problem.
55
+ - Never interrupt in-flight work just to force a partial result, salvage something quickly, or avoid making the user wait longer.
56
+ - Only stop waiting when the work has actually finished, clearly failed, or the user explicitly redirects the work.
57
+
58
+ ## Execution autonomy
59
+
60
+ Once the user has approved a plan or otherwise told you to continue, own the work until the slice is genuinely complete.
61
+
62
+ That means:
63
+
64
+ - continue through implementation, verification, reviewer passes, and required docs/workspace updates without asking for incremental permission
65
+ - use commentary for short progress updates, not as a handoff back to the user
66
+ - do not stop just to announce the next obvious step and ask whether to do it
67
+
68
+ Pause only when:
69
+
70
+ - a real blocker prevents forward progress
71
+ - a hidden-risk or non-obvious decision would materially change scope, behavior, cost, or data safety
72
+ - the user explicitly redirects, pauses, or narrows the work
73
+
74
+ If none of those are true, keep going.
53
75
 
54
76
  ## Documentation expectations
55
77
 
@@ -102,7 +124,8 @@ Use reviewer agents before considering the work complete, not just as a reflex a
102
124
  4. If you have a recent self-authored commit that cleanly represents the reviewable slice, use it as the default review scope anchor. Otherwise scope the reviewers to the current changed slice.
103
125
  5. Widen only when surrounding files are needed to validate a finding.
104
126
  6. Do not call the work finished before you read the required reviewer results.
105
- 7. Fix real findings, rerun the relevant verification, update workspace/docs if needed, and make a follow-up commit when fixes change the repo.
127
+ 7. Wait for reviewer outputs even if that requires repeated or long waits. Do not interrupt them just because they are still running.
128
+ 8. Fix real findings, rerun the relevant verification, update workspace/docs if needed, and make a follow-up commit when fixes change the repo.
106
129
 
107
130
  ## Quality bar
108
131
 
@@ -66,6 +66,9 @@ If some uncertainty still remains after checking persisted context and interview
66
66
 
67
67
  Prefer existing persisted context over re-interviewing the user.
68
68
 
69
+ If the user approves a plan or explicitly tells you to proceed, treat that as authorization to execute the work end to end. Do not stop mid-implementation for incremental permission unless a real blocker, hidden-risk decision, or explicit user redirect requires a pause.
70
+ When work is in flight elsewhere — reviewer agents, subagents, CI, automated review, external jobs, or other waiting periods — wait as long as required. There is no fixed waiting limit, and slowness alone is not a reason to interrupt or abandon the work.
71
+
69
72
  Working rules:
70
73
  - Keep `.waypoint/WORKSPACE.md` current as the live execution state, with timestamped new or materially revised entries in multi-topic sections
71
74
  - For large multi-step work, create or update `.waypoint/track/<slug>.md`, keep detailed execution state there, and point to it from `## Active Trackers` in `.waypoint/WORKSPACE.md`
@@ -81,4 +84,5 @@ Working rules:
81
84
  - Before pushing or opening/updating a PR for substantial work, use `pre-pr-hygiene`
82
85
  - Use `pr-review` once a PR has active review comments or automated review in progress
83
86
  - Treat the generated context bundle as required session bootstrap, not optional reference material
87
+ - After plan approval, own the execution through implementation, verification, review, and repo-memory updates before surfacing a final completion report
84
88
  <!-- waypoint:end -->