waypoint-codex 0.6.1 → 0.7.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -135,7 +135,6 @@ Waypoint ships a strong default skill pack for real coding work:
135
135
  - `workspace-compress`
136
136
  - `pre-pr-hygiene`
137
137
  - `pr-review`
138
- - `e2e-verify`
139
138
 
140
139
  These are repo-local, so the workflow travels with the project.
141
140
 
@@ -147,7 +146,7 @@ If you initialize with `--with-roles`, Waypoint scaffolds:
147
146
  - `code-reviewer`
148
147
  - `plan-reviewer`
149
148
 
150
- The intended workflow is post-commit: make your change, commit it, run the reviewers in parallel, fix real findings, then close out.
149
+ The intended workflow is chunk-based: once there is a meaningful reviewable slice, run the reviewers in parallel, fix real findings, then close out. A recent self-authored commit is the preferred scope anchor when one cleanly represents the slice, but it is not the only valid trigger.
151
150
 
152
151
  ## What makes it different
153
152
 
package/dist/src/core.js CHANGED
@@ -459,7 +459,6 @@ export function doctorRepository(projectRoot) {
459
459
  "workspace-compress",
460
460
  "pre-pr-hygiene",
461
461
  "pr-review",
462
- "e2e-verify",
463
462
  ]) {
464
463
  const skillPath = path.join(projectRoot, ".agents/skills", skillName, "SKILL.md");
465
464
  if (!existsSync(skillPath)) {
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "waypoint-codex",
3
- "version": "0.6.1",
3
+ "version": "0.7.0",
4
4
  "description": "Codex-native repository operating system: scaffolding, docs routing, repo-local skills, doctor, and sync.",
5
5
  "license": "MIT",
6
6
  "type": "module",
@@ -7,10 +7,7 @@ description: Verify a user-facing feature by trying to break it on purpose inste
7
7
 
8
8
  Use this skill to attack the feature like an impatient, confused, or careless user.
9
9
 
10
- This is not the same as `e2e-verify`.
11
-
12
- - `e2e-verify` proves the intended flow works end to end.
13
- - `break-it-qa` tries to make the feature fail through invalid, interrupted, stale, repeated, or out-of-order interactions.
10
+ This skill is for adversarial manual QA. It tries to make the feature fail through invalid, interrupted, stale, repeated, or out-of-order interactions instead of only proving the happy path works.
14
11
 
15
12
  ## Step 1: Ask The Three Setup Questions
16
13
 
@@ -11,7 +11,9 @@ This skill owns one job: inspect the specific code the user points at, map it ag
11
11
 
12
12
  ## Step 1: Load The Right Scope
13
13
 
14
- - Read `.waypoint/docs/code-guide.md`.
14
+ - Read the repo's routed code guide.
15
+ - In standard Waypoint repos, use `.waypoint/docs/code-guide.md`.
16
+ - If the repo routes the code guide somewhere else, follow the repo's own docs and routing instead of assuming another fixed path.
15
17
  - Read only the files, routes, tests, contracts, and nearby docs needed to understand the specific feature or slice under review.
16
18
  - If the scope is ambiguous, resolve it to a concrete file set, feature path, or commit-sized change surface before auditing.
17
19
 
@@ -16,7 +16,9 @@ Before the hygiene pass:
16
16
  3. Read `.waypoint/WORKSPACE.md`
17
17
  4. Read `.waypoint/context/MANIFEST.md`
18
18
  5. Read every file listed in that manifest
19
- 6. Read `.waypoint/docs/code-guide.md` and the routed docs relevant to the area being shipped
19
+ 6. Read the repo's routed code guide and the routed docs relevant to the area being shipped
20
+
21
+ In standard Waypoint repos, the code guide lives at `.waypoint/docs/code-guide.md`. If the repo routes it somewhere else, follow the repo's own docs and routing instead of assuming another fixed path.
20
22
 
21
23
  ## Step 1: Audit The Whole Change Surface
22
24
 
@@ -73,7 +73,6 @@ Do not document every trivial implementation detail. Document the non-obvious, d
73
73
  - `workspace-compress` after meaningful chunks, before stopping, and before review when the live handoff needs compression
74
74
  - `pre-pr-hygiene` before pushing or opening/updating a PR for substantial work
75
75
  - `pr-review` once a PR has active review comments or automated review in progress
76
- - `e2e-verify` for major user-facing or cross-system changes that need manual end-to-end verification
77
76
 
78
77
  ## When to use the optional reviewer agents
79
78
 
@@ -83,14 +82,15 @@ If the repo was initialized with Waypoint roles enabled, use them as focused sec
83
82
  - `code-health-reviewer` for maintainability drift
84
83
  - `plan-reviewer` to challenge weak implementation plans before execution
85
84
 
86
- ## Post-Commit Review Loop
85
+ ## Review Loop
87
86
 
88
- If Waypoint's optional roles are enabled and you authored a commit, immediately after that commit:
87
+ If Waypoint's optional roles are enabled, run the reviewer pair after a meaningful reviewable implementation chunk, not just as a reflex after every tiny commit.
89
88
 
90
- 1. Launch `code-reviewer` and `code-health-reviewer` in parallel as background, read-only reviewers.
91
- 2. Scope them to the commit you just made, then widen only when surrounding files are needed to validate a finding.
92
- 3. Do not call the work finished before you read both reviewer results.
93
- 4. Fix real findings, rerun the relevant verification, update workspace/docs if needed, and make a follow-up commit when fixes change the repo.
89
+ 1. Launch `code-reviewer` and `code-health-reviewer` in parallel as background, read-only reviewers once there is a coherent slice of work worth reviewing.
90
+ 2. If you have a recent self-authored commit that cleanly represents that slice, use it as the default review scope anchor. Otherwise scope the reviewers to the current changed slice.
91
+ 3. Widen only when surrounding files are needed to validate a finding.
92
+ 4. Do not call the work finished before you read both reviewer results.
93
+ 5. Fix real findings, rerun the relevant verification, update workspace/docs if needed, and make a follow-up commit when fixes change the repo.
94
94
 
95
95
  ## Quality bar
96
96
 
@@ -59,7 +59,13 @@ Do not create findings for:
59
59
 
60
60
  ## Scope
61
61
 
62
- In Waypoint's default post-commit review loop, start with the latest self-authored commit, then widen only when related files are needed to validate a maintainability issue. Focus on:
62
+ In Waypoint's default review loop, start with the reviewable slice the main agent hands you.
63
+
64
+ - If there is a recent self-authored commit that cleanly represents the slice, use that commit as the default scope anchor.
65
+ - Otherwise, start from the current changed files or diff under review.
66
+ - Widen only when related files are needed to validate a maintainability issue.
67
+
68
+ Focus on:
63
69
 
64
70
  - recently changed files
65
71
  - their importers
@@ -41,7 +41,11 @@ Not:
41
41
 
42
42
  ### 1. Get the Changes
43
43
 
44
- In Waypoint's default post-commit review loop, start with the latest self-authored commit. Review the actual diff or recent changed files first, then widen only as needed.
44
+ In Waypoint's default review loop, start with the reviewable slice the main agent hands you.
45
+
46
+ - If there is a recent self-authored commit that cleanly represents the slice, use that commit as the default scope anchor.
47
+ - Otherwise, start from the current changed files or diff the main agent is asking you to review.
48
+ - Widen only as needed.
45
49
 
46
50
  ### 2. Deep Research
47
51
 
@@ -37,9 +37,8 @@ Working rules:
37
37
  - Use `docs-sync` when the docs may be stale or a change altered shipped behavior, contracts, routes, or commands
38
38
  - Use `code-guide-audit` for a targeted coding-guide compliance pass on a specific feature, file set, or change slice
39
39
  - Use `break-it-qa` for browser-facing features that should be tested against invalid inputs, refreshes, repeated clicks, wrong navigation, and other adversarial user behavior
40
- - If optional reviewer roles are present and you make a commit, run `code-reviewer` and `code-health-reviewer` in parallel before calling the work done
40
+ - If optional reviewer roles are present and there is a meaningful reviewable implementation chunk, run `code-reviewer` and `code-health-reviewer` in parallel before calling the work done; use a recent self-authored commit as the default scope anchor when one cleanly represents that slice
41
41
  - Before pushing or opening/updating a PR for substantial work, use `pre-pr-hygiene`
42
42
  - Use `pr-review` once a PR has active review comments or automated review in progress
43
- - Use `e2e-verify` for major user-facing or cross-system changes that need manual end-to-end verification
44
43
  - Treat the generated context bundle as required session bootstrap, not optional reference material
45
44
  <!-- waypoint:end -->
@@ -1,63 +0,0 @@
1
- ---
2
- name: e2e-verify
3
- description: Perform manual end-to-end verification for a shipped feature or major change. Use when frontend and backend behavior must be verified together, when a feature needs a realistic walkthrough, or when the agent should manually exercise the flow, inspect logs and persisted state, document issues, fix them, and repeat until no meaningful end-to-end issues remain.
4
- ---
5
-
6
- # E2E Verify
7
-
8
- Use this skill when "it should work" is not enough and the flow needs to be proven end to end.
9
-
10
- ## Read First
11
-
12
- Before verification:
13
-
14
- 1. Read `.waypoint/SOUL.md`
15
- 2. Read `.waypoint/agent-operating-manual.md`
16
- 3. Read `.waypoint/WORKSPACE.md`
17
- 4. Read `.waypoint/context/MANIFEST.md`
18
- 5. Read every file listed in that manifest
19
- 6. Read the routed docs that define the feature, flow, or contract being verified
20
-
21
- ## Step 1: Exercise The Real Flow
22
-
23
- - For browser-facing paths, manually exercise the feature through the real UI.
24
- - For backend-only or service flows, drive the real API or runtime path directly.
25
- - Follow the feature from entry point to persistence to user-visible outcome.
26
-
27
- ## Step 2: Inspect End-To-End State
28
-
29
- Check the surfaces that prove the system actually behaved correctly:
30
-
31
- - UI state
32
- - server responses
33
- - logs
34
- - background-job state if relevant
35
- - database or persisted records when relevant
36
-
37
- Do not stop at "the page looked okay."
38
-
39
- ## Step 3: Record And Fix Issues
40
-
41
- - Document each meaningful issue you find.
42
- - Fix the issue when the remediation is clear.
43
- - Update docs or contracts if verification exposes stale assumptions.
44
-
45
- ## Step 4: Repeat Until Clean
46
-
47
- Re-run the end-to-end flow after fixes.
48
-
49
- The skill is complete only when:
50
-
51
- - the intended flow works
52
- - the persisted state is correct
53
- - the logs tell a truthful story
54
- - no meaningful issues remain
55
-
56
- ## Step 5: Report Verification Truthfully
57
-
58
- Summarize:
59
-
60
- - the flows exercised
61
- - the state surfaces inspected
62
- - the issues found and fixed
63
- - any residual risks or unverified edges
@@ -1,4 +0,0 @@
1
- interface:
2
- display_name: "E2E Verify"
3
- short_description: "Manually verify a feature end to end"
4
- default_prompt: "Use this skill for manual end-to-end verification of a feature or major change. Exercise the real flow, inspect UI plus logs and persisted state, document issues, fix them, and repeat until no meaningful end-to-end issues remain."