waypoint-codex 0.19.0 → 0.19.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -133,14 +133,14 @@ This helps the agent keep moving until the work is actually ready, not just
133
133
  Waypoint treats user corrections as product input, not just conversation noise.
134
134
 
135
135
  When the user corrects behavior, rules, or workflow, the agent is pushed to
136
- update the right durable files so the same issue is less likely to happen
136
+ update the right durable layer so the same issue is less likely to happen
137
137
  again.
138
138
 
139
139
  That includes:
140
140
 
141
- - user-scoped guidance
142
- - project-scoped guidance
143
- - repo-local skills
141
+ - user-scoped guidance for true cross-project standing rules
142
+ - project-scoped guidance for durable repo-wide context and always-on rules
143
+ - repo-local skills for workflow-specific or method-specific guidance
144
144
  - retrospectives that turn friction from the current conversation into lasting
145
145
  improvements
146
146
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "waypoint-codex",
3
- "version": "0.19.0",
3
+ "version": "0.19.2",
4
4
  "description": "Make Codex better by default with stronger planning, code quality, reviews, tracking, and repo guidance.",
5
5
  "license": "MIT",
6
6
  "type": "module",
@@ -103,11 +103,13 @@ Plans document your understanding. Include what matters for this task:
103
103
 
104
104
  - **Current State**: What exists today — relevant files, data flows, constraints, existing patterns
105
105
  - **Changes**: Every file to create/modify/delete, how changes connect
106
+ - **Removals**: What obsolete code, compatibility logic, unused files, debug logs, dead props, or stale branches will be deleted as part of the change
106
107
  - **Decisions**: Why this approach, tradeoffs, assumptions
107
108
  - **Phase breakdown**: Distinct execution phases in the order they should happen
108
109
  - **Scope checklist**: Concrete implementation items that can be marked done or not done
109
110
  - **Acceptance criteria**: What must be true when each phase is "done"
110
111
  - **Phase checkpoints**: What verification, reviewer passes, tests, typechecks, builds, or manual QA must pass before moving to the next phase
112
+ - **Cleanup expectations**: What legacy or replaced paths must be removed before the work can be called complete
111
113
  - **Test cases**: For behavioral changes, include input -> expected output examples
112
114
  - **Non-Goals**: Explicitly out of scope to prevent implementation drift
113
115
 
@@ -132,6 +134,7 @@ Before presenting the plan, verify against real code:
132
134
  - No pretending you verified something you didn't
133
135
  - Approved scope must be explicit enough to act as an execution contract after user approval
134
136
  - The plan must be explicit enough to support phase-by-phase execution and checkpoints without rediscovering the intended order in chat
137
+ - Refactor and replacement plans should explicitly call out what legacy or obsolete code will be removed instead of preserving it by default
135
138
  - If the user approves the plan, do not silently defer or drop checklist items later; discuss any proposed scope change first
136
139
 
137
140
  If the change touches durable project behavior, include docs/workspace updates in the plan.
@@ -175,6 +178,7 @@ If the plan would make the implementer ask "where does this hook in?" or "what e
175
178
  - Do not spend interview turns on implementation facts that are already in the code or routed docs.
176
179
  - Do not stop exploring just because you have a plausible plan. The usual failure mode is shallow repo understanding.
177
180
  - Do not leave unresolved architecture or product decisions hidden behind "we can figure that out during implementation."
181
+ - Do not plan a refactor as a rewrite-plus-compatibility-preservation bundle unless compatibility is explicitly required. Call out what should be deleted.
178
182
  - Do not dump a transcript into the plan doc. Distill the decisions and requirements into a clean implementation handoff.
179
183
  - Do not treat a reviewed plan as a stopping point. Once the user approves it, the workflow expects execution to continue.
180
184
 
@@ -2,6 +2,8 @@
2
2
 
3
3
  This repository uses Waypoint as its operating system for Codex.
4
4
 
5
+ These instructions are mandatory for work in this repo. Treat them as overriding weaker generic guidance outside these files unless the user explicitly tells you otherwise.
6
+
5
7
  If the repo needs custom AGENTS guidance, write it outside the managed `waypoint:start/end` block in `AGENTS.md`. Treat the managed block as Waypoint-owned and replaceable.
6
8
  Treat repo guidance as the authoritative repo-local contract, but not as something that can outrank higher-priority system or developer instructions injected by the environment.
7
9
  If a higher-priority instruction conflicts with Waypoint workflow, do not silently ignore the repo rule or pretend it happened anyway. State the conflict plainly, explain the practical consequence, and ask the user for the missing authorization or use a compliant fallback path.
@@ -54,14 +56,19 @@ If something important lives only in your head or in the chat transcript, the re
54
56
  - Once a plan is approved, treat its scope and acceptance criteria as the execution contract. Do not silently narrow, defer, or drop approved work because the system feels good enough, the remaining work looks less valuable, or you would prefer a smaller PR. If the scope should change, discuss that with the user first unless a real blocker, hidden-risk decision, or explicit user redirect forces the change.
55
57
  - When the user shows a bug, screenshot, or broken behavior, investigate first. Lead with what is happening, why it is likely happening, the important options or tradeoffs if they matter, what you checked, and what you are doing next.
56
58
  - After investigation, explain the diagnosis before jumping into implementation whenever the cause, tradeoffs, or solution shape are not already obvious.
57
- - Fix underlying causes instead of papering over symptoms. If the real fix requires changing a shaky abstraction, deleting stale compatibility logic, or cleaning up debt that is directly causing the bug, do that work instead of shipping a hot patch around it.
59
+ - Fix underlying causes instead of papering over symptoms. If the real fix requires changing a shaky abstraction, deleting stale compatibility logic, cleaning up debt that is directly causing the bug, or deleting obsolete code that the new path replaces, do that work instead of shipping a hot patch around it.
60
+ - When replacing a brittle path, aggressively remove obsolete files, debug logs, dead props, dead branches, and compatibility scaffolding instead of preserving them by default.
61
+ - Do not preserve backward compatibility or legacy code paths unless the user or documented project constraints explicitly require it.
58
62
  - Do not stop at the first local patch that makes the symptom disappear if the root cause is still obviously in place.
59
63
  - Do not lead with readiness disclaimers such as "I can't call this done yet" unless the user explicitly asked whether the work is ready, shippable, or complete.
60
64
  - Keep communication concise by default. Lead with the answer, diagnosis, decision, or next step, and include only the most important supporting detail unless the user asks for more.
65
+ - Do not hide behind generic heuristics like "try the simplest approach first" or "avoid refactoring beyond the ask" when the approved work or root-cause fix clearly requires deeper cleanup. Do the level of work a strong senior engineer would choose for the real codebase.
61
66
  - Honesty means accurate diagnosis, explicit uncertainty, and clear verification limits. It does not mean substituting process language for investigation.
62
67
  - Before making meaningful frontend or backend decisions, inspect the available user-scoped and project-scoped `AGENTS.md` guidance. If the task depends on frontend or backend context that is missing from the project-scoped guidance and routed docs, use the corresponding `*-context-interview` skill to fill that gap instead of guessing.
63
- - Update the user-scoped `AGENTS.md` when you learn a durable preference, workflow rule, or default that should apply across projects and your environment allows you to edit it.
64
- - Update the project-scoped repo `AGENTS.md` when you learn durable repo truth, project constraints, or stable project-specific collaboration rules.
68
+ - Persist corrections and newly learned context in the right durable layer instead of defaulting to `AGENTS.md`.
69
+ - Update the user-scoped `AGENTS.md` only for true cross-project standing preferences or global operating rules.
70
+ - Update the project-scoped repo `AGENTS.md` only for durable repo truth, project constraints, or stable project-wide rules that should always apply in this repo.
71
+ - If the correction is workflow-specific or method-specific, update the relevant skill instead. If no existing skill owns it well, propose creating one instead of stuffing that guidance into `AGENTS.md`.
65
72
  - Treat `.waypoint/WORKSPACE.md` as mandatory live execution state, not end-of-task paperwork.
66
73
  - Treat `.waypoint/ACTIVE_PLANS.md` as mandatory live plan state for approved work, not a forgotten side file.
67
74
  - Update `.waypoint/WORKSPACE.md` during the work whenever the active goal, phase, next step, blocker, verification state, or review state materially changes. In multi-topic sections, prefix new or materially revised bullets with a local timestamp like `[2026-03-06 20:10 PST]`.
@@ -99,6 +106,8 @@ Execute approved work phase by phase. Complete the current phase, run the releva
99
106
  That means:
100
107
 
101
108
  - continue through implementation, verification, and required repo-memory updates without asking for incremental permission
109
+ - before reporting completion, reread `.waypoint/ACTIVE_PLANS.md`, the active tracker if one exists, `.waypoint/WORKSPACE.md`, and the relevant routed docs, then compare reality against the approved scope, current phase checkpoint, and acceptance criteria
110
+ - if that reread shows the work is not actually complete, keep going instead of reporting partial progress as completion
102
111
  - use commentary for short progress updates, not as a handoff back to the user
103
112
  - do not stop just to announce the next obvious step and ask whether to do it
104
113
 
@@ -147,7 +156,7 @@ Deliberate closeout review is available when you want a second pass for ship-rea
147
156
  - If you use it, follow the skill's loop fully: define the reviewable slice, run the needed reviewers, wait for the outputs, fix meaningful findings, and rerun fresh passes when warranted.
148
157
  - Treat reviewer agents as one-shot workers. Once a reviewer returns its findings, read the result and close it.
149
158
  - Do not widen the scope casually; keep the second pass anchored to the slice you are actually trying to validate.
150
- - For non-trivial work, the healthy default is to use reviewer passes at meaningful milestones instead of saving all second-pass scrutiny for the very end.
159
+ - For non-trivial work, the healthy default is to use reviewer passes at phase checkpoints instead of saving all second-pass scrutiny for the very end.
151
160
 
152
161
  ## Quality bar
153
162
 
@@ -21,6 +21,7 @@ Write code that keeps behavior explicit, failure visible, and the next change ea
21
21
  Do not preserve old behavior unless a user-facing requirement explicitly asks for it.
22
22
 
23
23
  - Remove replaced paths instead of leaving shims, aliases, or silent compatibility branches.
24
+ - When a refactor or replacement makes code obsolete, delete the obsolete files, dead props, debug logs, and stale branches as part of the change instead of keeping them around.
24
25
  - Do not keep dead fields, dual formats, or migration-only logic "just in case."
25
26
  - If compatibility must stay, document the exact contract being preserved and the removal condition.
26
27
 
@@ -3,6 +3,8 @@
3
3
 
4
4
  This repository uses Waypoint as its Codex operating system.
5
5
 
6
+ These instructions are mandatory for work in this repo. Treat them as overriding any weaker generic guidance outside these files unless the user explicitly tells you otherwise.
7
+
6
8
  Waypoint owns only the text inside these `waypoint:start/end` markers.
7
9
  If you need repo-specific AGENTS instructions, write them outside this managed block.
8
10
  Do not put durable repo guidance inside the managed block, because `waypoint init` may replace it during upgrades.
@@ -85,11 +87,15 @@ Delivery expectations:
85
87
  - When the user shows a bug, broken behavior, or a screenshot of something wrong, investigate before discussing readiness.
86
88
  - After investigation, explain the problem to the user before jumping into implementation whenever the diagnosis, tradeoffs, or solution shape are not already obvious.
87
89
  - Lead with the useful truth: what is happening, the likely cause, the important options or tradeoffs if they matter, what you checked, and what you are doing next.
88
- - Fix the underlying problem, not only the visible symptom. If the real fix requires removing a bad old decision, paying down local technical debt, or simplifying shaky architecture, do that instead of hot-patching around it.
90
+ - Fix the underlying problem, not only the visible symptom. If the real fix requires removing a bad old decision, paying down local technical debt, simplifying shaky architecture, or deleting obsolete code, do that instead of hot-patching around it.
91
+ - When replacing a brittle path, aggressively delete obsolete code, stale compatibility branches, dead props, unused files, and debug logs instead of preserving them by default.
92
+ - Do not preserve backward compatibility, old branches, or legacy code paths unless the user or documented project constraints explicitly require that compatibility.
89
93
  - Do not ship a bug fix that knowingly leaves the real cause in place behind a cosmetic patch unless the user explicitly asked for a temporary workaround.
90
94
  - Do not lead with refusal or readiness-disclaimer language like "I can't call this done yet" unless the user explicitly asked for a ship/readiness judgment.
91
95
  - Honesty means accurate diagnosis, explicit uncertainty, and clear verification limits. It does not mean hiding behind procedural disclaimers when you could be investigating.
92
96
  - Before you say the work is complete, verify it yourself whenever you reasonably can with the tools available in the environment.
97
+ - Before you report completion, reread `.waypoint/ACTIVE_PLANS.md`, the active tracker if one exists, `WORKSPACE.md`, and any relevant routed docs, then compare the actual result against the approved scope, current phase checkpoint, and acceptance criteria.
98
+ - If that reread shows the task is not actually complete, continue working. Do not stop just to report partial progress as if it were completion.
93
99
  - Match the verification to the task. Run code and inspect real output for scripts and backend changes. Click through flows, inspect rendered states, and check behavior in the browser for visual or interactive work.
94
100
  - Use representative or real inputs when practical instead of toy examples, so the check tells you something meaningful about the actual request.
95
101
  - If there are realistic edge cases, failure modes, or recovery paths you can exercise without turning the task into a science project, do that too.
@@ -105,11 +111,14 @@ Working rules:
105
111
  - Update `.waypoint/ACTIVE_PLANS.md` whenever the active approved plan, current phase, phase checklist, checkpoint, or approved scope changes.
106
112
  - For multi-step work, keep the workspace and active plan file moving as you move: do not wait until the end of the task to reconstruct what happened.
107
113
  - If a tracker exists for the active workstream, update the tracker during the work as well and keep `WORKSPACE.md` pointing at the current tracker state.
108
- - Update user-scoped `AGENTS.md` when you learn a durable preference, standing rule, or default that should apply across projects and your environment allows you to edit that file
109
- - Update the project-scoped repo `AGENTS.md` when you learn durable repo truth, project constraints, or stable project-specific collaboration rules
114
+ - Persist corrections and newly learned context in the right durable layer instead of defaulting to `AGENTS.md`.
115
+ - Update user-scoped `AGENTS.md` only for true cross-project standing preferences or global operating rules.
116
+ - Update the project-scoped repo `AGENTS.md` only for durable repo context or project-wide rules that should always apply in this repo.
117
+ - If the correction is workflow-specific or method-specific, update the relevant repo skill instead. If no existing skill owns it well, propose creating one instead of stuffing that guidance into `AGENTS.md`.
110
118
  - Update `.waypoint/docs/` when durable project knowledge changes, update `.waypoint/plans/` when a durable plan changes, update `.waypoint/ACTIVE_PLANS.md` when the active approved plan or current phase changes, and refresh `last_updated` on touched routable docs
111
119
  - Keep most work in the main agent. Use repo-local skills, trackers, and reviewer agents when they create clear leverage, not as default ceremony.
112
120
  - Let repo-local skills describe their own triggers. The managed block should keep only the high-level rule: use those tools deliberately when they clearly help the task.
121
+ - Do not hide behind generic heuristics like "try the simplest approach first" or "avoid refactoring beyond the ask" when the approved work or root-cause fix clearly requires deeper cleanup. Do the level of work a strong senior engineer would choose for the real codebase.
113
122
  - Use reviewer agents proactively at phase checkpoints when the work is non-trivial, risky, user-facing, merge-bound, or otherwise expensive to get wrong.
114
123
  - Strong default moments for reviewer-agent passes are: after completing a plan phase, before opening or materially updating a PR, after fixing substantial review findings, and before finally calling the work clear.
115
124
  - Do not interrupt implementation for heavyweight checks after every tiny edit. Batch related work into the current plan phase, then run the checkpoint.