waypoint-codex 0.5.0 → 0.6.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +1 -0
- package/dist/src/core.js +1 -0
- package/package.json +1 -1
- package/templates/.agents/skills/break-it-qa/SKILL.md +87 -0
- package/templates/.agents/skills/break-it-qa/agents/openai.yaml +4 -0
- package/templates/.waypoint/agent-operating-manual.md +1 -0
- package/templates/managed-agents-block.md +1 -0
package/README.md
CHANGED
package/dist/src/core.js
CHANGED
package/package.json
CHANGED
|
@@ -0,0 +1,87 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: break-it-qa
|
|
3
|
+
description: Verify a user-facing feature by trying to break it on purpose instead of only following the happy path. Use after building forms, multistep flows, settings pages, onboarding, stateful UI, destructive actions, or any browser-facing feature where invalid inputs, refreshes, back navigation, repeated clicks, wrong action order, or recovery paths might expose real bugs.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Break-It QA
|
|
7
|
+
|
|
8
|
+
Use this skill to attack the feature like an impatient, confused, or careless user.
|
|
9
|
+
|
|
10
|
+
This is not the same as `e2e-verify`.
|
|
11
|
+
|
|
12
|
+
- `e2e-verify` proves the intended flow works end to end.
|
|
13
|
+
- `break-it-qa` tries to make the feature fail through invalid, interrupted, stale, repeated, or out-of-order interactions.
|
|
14
|
+
|
|
15
|
+
## Read First
|
|
16
|
+
|
|
17
|
+
Before verification:
|
|
18
|
+
|
|
19
|
+
1. Read `.waypoint/SOUL.md`
|
|
20
|
+
2. Read `.waypoint/agent-operating-manual.md`
|
|
21
|
+
3. Read `.waypoint/WORKSPACE.md`
|
|
22
|
+
4. Read `.waypoint/context/MANIFEST.md`
|
|
23
|
+
5. Read every file listed in that manifest
|
|
24
|
+
6. Read the routed docs or nearby code that define the feature being tested
|
|
25
|
+
|
|
26
|
+
## Step 1: Identify Break Surfaces
|
|
27
|
+
|
|
28
|
+
- Identify the happy path first so you know what "broken" means.
|
|
29
|
+
- Find the fragile surfaces: forms, wizards, pending states, destructive actions, async transitions, navigation changes, and persisted state.
|
|
30
|
+
|
|
31
|
+
Do not test blindly.
|
|
32
|
+
|
|
33
|
+
## Step 2: Use The Real UI
|
|
34
|
+
|
|
35
|
+
- Use `playwright-interactive`.
|
|
36
|
+
- Exercise the actual UI instead of mocking the flow in code.
|
|
37
|
+
- Keep the scope focused on the feature the user asked you to verify.
|
|
38
|
+
|
|
39
|
+
## Step 3: Try To Break It On Purpose
|
|
40
|
+
|
|
41
|
+
Do more than a happy-path walkthrough.
|
|
42
|
+
|
|
43
|
+
Actively try:
|
|
44
|
+
|
|
45
|
+
- invalid inputs
|
|
46
|
+
- empty required fields
|
|
47
|
+
- boundary-length or malformed inputs
|
|
48
|
+
- repeated or double clicks
|
|
49
|
+
- submitting twice
|
|
50
|
+
- wrong action order
|
|
51
|
+
- back and forward navigation
|
|
52
|
+
- page refresh during the flow
|
|
53
|
+
- closing and reopening modals or screens
|
|
54
|
+
- canceling mid-flow and re-entering
|
|
55
|
+
- stale UI state after edits
|
|
56
|
+
- conflicting selections or toggles
|
|
57
|
+
- error recovery after a failed action
|
|
58
|
+
|
|
59
|
+
If the feature is stateful, also check whether the UI, network result, and persisted state stay coherent after those interactions.
|
|
60
|
+
|
|
61
|
+
## Step 4: Record And Fix Real Bugs
|
|
62
|
+
|
|
63
|
+
- Document each meaningful issue you find.
|
|
64
|
+
- Fix the issue when the remediation is clear.
|
|
65
|
+
- If the behavior is ambiguous, call out the product decision instead of bluffing a fix.
|
|
66
|
+
- Update docs when the verification exposes stale assumptions about how the feature works.
|
|
67
|
+
|
|
68
|
+
Do not stop at the first bug.
|
|
69
|
+
|
|
70
|
+
## Step 5: Repeat Until The Feature Resists Abuse
|
|
71
|
+
|
|
72
|
+
After fixes:
|
|
73
|
+
|
|
74
|
+
- rerun the relevant happy path
|
|
75
|
+
- rerun the break attempts that previously failed
|
|
76
|
+
- verify the fix did not create a new inconsistent state
|
|
77
|
+
|
|
78
|
+
The skill is not done when the feature only works once. It is done when the feature behaves predictably under sloppy real-world use.
|
|
79
|
+
|
|
80
|
+
## Step 6: Report Truthfully
|
|
81
|
+
|
|
82
|
+
Summarize:
|
|
83
|
+
|
|
84
|
+
- what break attempts you tried
|
|
85
|
+
- which issues you found
|
|
86
|
+
- what you fixed
|
|
87
|
+
- what still looks risky or was not exercised
|
|
@@ -0,0 +1,4 @@
|
|
|
1
|
+
interface:
|
|
2
|
+
display_name: "Break-It QA"
|
|
3
|
+
short_description: "Try to break a feature through the UI"
|
|
4
|
+
default_prompt: "Use this skill to verify a user-facing feature by trying to break it through the browser with invalid inputs, wrong action order, refreshes, back navigation, repeated clicks, and other adversarial interactions, then fix clear issues and repeat."
|
|
@@ -69,6 +69,7 @@ Do not document every trivial implementation detail. Document the non-obvious, d
|
|
|
69
69
|
- `ux-states-audit` when async/data-driven UI likely lacks loading, empty, or error states
|
|
70
70
|
- `docs-sync` when routed docs may be stale, missing, or inconsistent with the codebase
|
|
71
71
|
- `code-guide-audit` when a specific feature or file set needs a targeted coding-guide compliance check
|
|
72
|
+
- `break-it-qa` when a browser-facing feature should be attacked with invalid inputs, refreshes, repeated clicks, wrong action order, or other adversarial manual QA
|
|
72
73
|
- `workspace-compress` after meaningful chunks, before stopping, and before review when the live handoff needs compression
|
|
73
74
|
- `pre-pr-hygiene` before pushing or opening/updating a PR for substantial work
|
|
74
75
|
- `pr-review` once a PR has active review comments or automated review in progress
|
|
@@ -36,6 +36,7 @@ Working rules:
|
|
|
36
36
|
- Use the repo-local skills Waypoint ships for structured workflows when relevant
|
|
37
37
|
- Use `docs-sync` when the docs may be stale or a change altered shipped behavior, contracts, routes, or commands
|
|
38
38
|
- Use `code-guide-audit` for a targeted coding-guide compliance pass on a specific feature, file set, or change slice
|
|
39
|
+
- Use `break-it-qa` for browser-facing features that should be tested against invalid inputs, refreshes, repeated clicks, wrong navigation, and other adversarial user behavior
|
|
39
40
|
- If optional reviewer roles are present and you make a commit, run `code-reviewer` and `code-health-reviewer` in parallel before calling the work done
|
|
40
41
|
- Before pushing or opening/updating a PR for substantial work, use `pre-pr-hygiene`
|
|
41
42
|
- Use `pr-review` once a PR has active review comments or automated review in progress
|