waypoint-codex 0.5.0 → 0.6.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -61,6 +61,7 @@ repo/
61
61
  - `ux-states-audit`
62
62
  - `docs-sync`
63
63
  - `code-guide-audit`
64
+ - `break-it-qa`
64
65
  - `workspace-compress`
65
66
  - `pre-pr-hygiene`
66
67
  - `pr-review`
package/dist/src/core.js CHANGED
@@ -455,6 +455,7 @@ export function doctorRepository(projectRoot) {
455
455
  "ux-states-audit",
456
456
  "docs-sync",
457
457
  "code-guide-audit",
458
+ "break-it-qa",
458
459
  "workspace-compress",
459
460
  "pre-pr-hygiene",
460
461
  "pr-review",
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "waypoint-codex",
3
- "version": "0.5.0",
3
+ "version": "0.6.0",
4
4
  "description": "Codex-native repository operating system: scaffolding, docs routing, repo-local skills, doctor, and sync.",
5
5
  "license": "MIT",
6
6
  "type": "module",
@@ -0,0 +1,87 @@
1
+ ---
2
+ name: break-it-qa
3
+ description: Verify a user-facing feature by trying to break it on purpose instead of only following the happy path. Use after building forms, multistep flows, settings pages, onboarding, stateful UI, destructive actions, or any browser-facing feature where invalid inputs, refreshes, back navigation, repeated clicks, wrong action order, or recovery paths might expose real bugs.
4
+ ---
5
+
6
+ # Break-It QA
7
+
8
+ Use this skill to attack the feature like an impatient, confused, or careless user.
9
+
10
+ This is not the same as `e2e-verify`.
11
+
12
+ - `e2e-verify` proves the intended flow works end to end.
13
+ - `break-it-qa` tries to make the feature fail through invalid, interrupted, stale, repeated, or out-of-order interactions.
14
+
15
+ ## Read First
16
+
17
+ Before verification:
18
+
19
+ 1. Read `.waypoint/SOUL.md`
20
+ 2. Read `.waypoint/agent-operating-manual.md`
21
+ 3. Read `.waypoint/WORKSPACE.md`
22
+ 4. Read `.waypoint/context/MANIFEST.md`
23
+ 5. Read every file listed in that manifest
24
+ 6. Read the routed docs or nearby code that define the feature being tested
25
+
26
+ ## Step 1: Identify Break Surfaces
27
+
28
+ - Identify the happy path first so you know what "broken" means.
29
+ - Find the fragile surfaces: forms, wizards, pending states, destructive actions, async transitions, navigation changes, and persisted state.
30
+
31
+ Do not test blindly.
32
+
33
+ ## Step 2: Use The Real UI
34
+
35
+ - Use `playwright-interactive`.
36
+ - Exercise the actual UI instead of mocking the flow in code.
37
+ - Keep the scope focused on the feature the user asked you to verify.
38
+
39
+ ## Step 3: Try To Break It On Purpose
40
+
41
+ Do more than a happy-path walkthrough.
42
+
43
+ Actively try:
44
+
45
+ - invalid inputs
46
+ - empty required fields
47
+ - boundary-length or malformed inputs
48
+ - repeated or double clicks
49
+ - submitting twice
50
+ - wrong action order
51
+ - back and forward navigation
52
+ - page refresh during the flow
53
+ - closing and reopening modals or screens
54
+ - canceling mid-flow and re-entering
55
+ - stale UI state after edits
56
+ - conflicting selections or toggles
57
+ - error recovery after a failed action
58
+
59
+ If the feature is stateful, also check whether the UI, network result, and persisted state stay coherent after those interactions.
60
+
61
+ ## Step 4: Record And Fix Real Bugs
62
+
63
+ - Document each meaningful issue you find.
64
+ - Fix the issue when the remediation is clear.
65
+ - If the behavior is ambiguous, call out the product decision instead of bluffing a fix.
66
+ - Update docs when the verification exposes stale assumptions about how the feature works.
67
+
68
+ Do not stop at the first bug.
69
+
70
+ ## Step 5: Repeat Until The Feature Resists Abuse
71
+
72
+ After fixes:
73
+
74
+ - rerun the relevant happy path
75
+ - rerun the break attempts that previously failed
76
+ - verify the fix did not create a new inconsistent state
77
+
78
+ The skill is not done when the feature only works once. It is done when the feature behaves predictably under sloppy real-world use.
79
+
80
+ ## Step 6: Report Truthfully
81
+
82
+ Summarize:
83
+
84
+ - what break attempts you tried
85
+ - which issues you found
86
+ - what you fixed
87
+ - what still looks risky or was not exercised
@@ -0,0 +1,4 @@
1
+ interface:
2
+ display_name: "Break-It QA"
3
+ short_description: "Try to break a feature through the UI"
4
+ default_prompt: "Use this skill to verify a user-facing feature by trying to break it through the browser with invalid inputs, wrong action order, refreshes, back navigation, repeated clicks, and other adversarial interactions, then fix clear issues and repeat."
@@ -69,6 +69,7 @@ Do not document every trivial implementation detail. Document the non-obvious, d
69
69
  - `ux-states-audit` when async/data-driven UI likely lacks loading, empty, or error states
70
70
  - `docs-sync` when routed docs may be stale, missing, or inconsistent with the codebase
71
71
  - `code-guide-audit` when a specific feature or file set needs a targeted coding-guide compliance check
72
+ - `break-it-qa` when a browser-facing feature should be attacked with invalid inputs, refreshes, repeated clicks, wrong action order, or other adversarial manual QA
72
73
  - `workspace-compress` after meaningful chunks, before stopping, and before review when the live handoff needs compression
73
74
  - `pre-pr-hygiene` before pushing or opening/updating a PR for substantial work
74
75
  - `pr-review` once a PR has active review comments or automated review in progress
@@ -36,6 +36,7 @@ Working rules:
36
36
  - Use the repo-local skills Waypoint ships for structured workflows when relevant
37
37
  - Use `docs-sync` when the docs may be stale or a change altered shipped behavior, contracts, routes, or commands
38
38
  - Use `code-guide-audit` for a targeted coding-guide compliance pass on a specific feature, file set, or change slice
39
+ - Use `break-it-qa` for browser-facing features that should be tested against invalid inputs, refreshes, repeated clicks, wrong navigation, and other adversarial user behavior
39
40
  - If optional reviewer roles are present and you make a commit, run `code-reviewer` and `code-health-reviewer` in parallel before calling the work done
40
41
  - Before pushing or opening/updating a PR for substantial work, use `pre-pr-hygiene`
41
42
  - Use `pr-review` once a PR has active review comments or automated review in progress