orchestrix-skills 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -0,0 +1,71 @@
1
+ ---
2
+ name: review-code
3
+ description: Use after implementing a task or feature and before merging, to check a diff against its spec and for quality.
4
+ license: MIT
5
+ allowed-tools: [Read, Bash, Grep, Glob]
6
+ metadata:
7
+ contract:
8
+ inputs: [diff, spec]
9
+ reads: [taste/coding-standards]
10
+ outputs: [review_report]
11
+ authority: "Read-only on code. Produces a report. No edits, no commits, no production."
12
+ verify: "Report contains both verdicts: spec compliance AND code quality."
13
+ accept:
14
+ when: "never — findings route back to implement; the human reviews at the end batch."
15
+ timing: deferred
16
+ ---
17
+
18
+ # Review Code
19
+
20
+ Review a diff against its spec with fresh eyes. Two verdicts, in order: does it
21
+ do what was asked, then is it well built.
22
+
23
+ **Core principle:** Review the work product against the spec — not the author's
24
+ reasoning. You get the diff and the spec, not the session that produced them.
25
+
26
+ ## Inputs
27
+
28
+ - `diff` — the change to review (a base..head range or a diff file).
29
+ - `spec` — the story / acceptance criteria the change must satisfy.
30
+
31
+ Read the spec first. It is your attention lens. Then read the full diff.
32
+
33
+ ## Verdict 1 — Spec compliance
34
+
35
+ For each acceptance criterion: met, missing, or extra (built but not asked for).
36
+
37
+ - **Missing** a criterion → spec verdict is FAIL.
38
+ - **Extra** scope not in the spec → flag it; over-building is a defect.
39
+
40
+ ## Verdict 2 — Code quality
41
+
42
+ Check, against `taste/coding-standards`:
43
+
44
+ - Correctness and edge cases the tests miss
45
+ - Duplication, unclear names, dead code
46
+ - Tests that assert nothing or test mocks instead of behavior
47
+ - Security / data-loss / trust-boundary issues (always Critical)
48
+
49
+ Rate each finding: **Critical** (must fix) · **Important** (fix before merge) ·
50
+ **Minor** (note it).
51
+
52
+ ## Output: `review_report`
53
+
54
+ ```yaml
55
+ spec_compliance: pass # pass | fail
56
+ missing: [<AC not met>]
57
+ extra: [<scope not requested>]
58
+ findings:
59
+ - { severity: Important, where: src/x.ts:40, issue: "...", fix: "..." }
60
+ verdict: changes_requested # approved | changes_requested
61
+ ```
62
+
63
+ ## Rules
64
+
65
+ - **Do not pre-judge.** Never decide a finding is fine because "the spec said
66
+ so" — surface it; conflicts with the spec are the human's call, not yours.
67
+ - **Be specific.** Every finding has a location and a concrete fix.
68
+ - **Findings route back, not to the human.** Critical/Important findings go to
69
+ `implement` (re-run with this report as `qa_feedback`). The human sees the
70
+ result at the end-of-run acceptance, not each review.
71
+ - If the diff is clean, say so plainly and approve.
@@ -0,0 +1,73 @@
1
+ ---
2
+ name: run-tests
3
+ description: Use before claiming any work is complete, fixed, or passing, and before committing or handing off.
4
+ license: MIT
5
+ allowed-tools: [Read, Bash]
6
+ metadata:
7
+ contract:
8
+ inputs: [target, expected_outcome?]
9
+ reads: []
10
+ outputs: [verification_report]
11
+ authority: "Run test, lint, and build commands. Read-only on source. No edits, no production."
12
+ verify: "self — this skill IS the objective verification primitive."
13
+ accept:
14
+ when: "never"
15
+ timing: deferred
16
+ ---
17
+
18
+ # Run Tests (Verify Before Completion)
19
+
20
+ Claiming work is done without running the check is a false claim, not a shortcut.
21
+
22
+ **Core principle:** Evidence before claims, always. If you did not run the
23
+ command in this step, you cannot say it passes.
24
+
25
+ ## The Iron Law
26
+
27
+ ```
28
+ NO COMPLETION CLAIM WITHOUT FRESH VERIFICATION EVIDENCE
29
+ ```
30
+
31
+ ## The gate
32
+
33
+ ```
34
+ 1. IDENTIFY the command that proves the claim.
35
+ 2. RUN it fully and fresh (not a remembered run).
36
+ 3. READ the full output: exit code, pass/fail counts, warnings.
37
+ 4. DECIDE: output confirms the claim, or it does not.
38
+ 5. REPORT the claim WITH the evidence.
39
+ ```
40
+
41
+ ## Verify independently — do not trust self-reports
42
+
43
+ | Claim | Proof required | Not sufficient |
44
+ | ----------------------- | ----------------------------------- | ----------------------------- |
45
+ | Tests pass | Test command output: 0 failures | "should pass", a previous run |
46
+ | Build succeeds | Build command: exit 0 | linter passed |
47
+ | Bug fixed | Test of the original symptom passes | code changed, assumed fixed |
48
+ | Another skill succeeded | Its diff and tests checked here | its success report |
49
+
50
+ A regression test counts only after a red→green check: revert the fix, watch
51
+ the test fail, restore the fix, watch it pass.
52
+
53
+ ## Output: `verification_report`
54
+
55
+ ```yaml
56
+ target: <what was checked>
57
+ command: <exact command run>
58
+ exit_code: 0
59
+ result: { passed: 34, failed: 0, warnings: 0 }
60
+ verdict: pass # pass | fail
61
+ evidence: |
62
+ <relevant lines of real output>
63
+ ```
64
+
65
+ `verdict: fail` is a valid, honest result. Report it with evidence; never round
66
+ a failure up to a pass.
67
+
68
+ ## Red flags — stop
69
+
70
+ - "should", "probably", "looks correct", "Done!" before running the command
71
+ - Reusing an earlier run's result
72
+ - Partial check used to claim full success
73
+ - Trusting "the agent said it worked"