npm - @xera-ai/skills - Versions diffs - 0.13.0 → 0.14.0 - Mend

@xera-ai/skills 0.13.0 → 0.14.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,13 @@
 # @xera-ai/skills
+## 0.14.0
+### Minor Changes
+- [#121](https://github.com/xera-ai/xera/pull/121) [`f1baccd`](https://github.com/xera-ai/xera/commit/f1baccd268379b22c366ea1a2563e4d4d67ce293) Thanks [@thanhtrinity](https://github.com/thanhtrinity)! - trustworthy coverage — SKIPPED bucket, additive AC backfill, normalize emits run.completed (auto-generated from [#121](https://github.com/xera-ai/xera/issues/121))
+## 0.13.1
 ## 0.13.0
 ### Minor Changes

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@xera-ai/skills",
-  "version": "0.13.0",
+  "version": "0.14.0",
   "files": [
     "*.md",
     "version.json"

package/xera-coverage.md CHANGED Viewed

@@ -48,7 +48,7 @@ Read `.xera/coverage/report.json`. If `acBackfillNeeded === true`:
 bun run xera:ac-coverage-backfill-prepare
 ```
-This writes `.xera/coverage/ac-backfill-input.json` listing tickets that have ACs + scenarios but no `satisfies` edges yet.
+This writes `.xera/coverage/ac-backfill-input.json` listing tickets with at least one **unmapped** scenario (a scenario that has no `satisfies` edge to any of its ticket's ACs). Tickets with partially mapped scenarios surface only their unmapped scenarios — finalize is additive per scenarioId (#119), so generating decisions for just the unmapped set will not clobber prior mappings.
 If the input file is `{ "tickets": [] }`, skip to Step 4 — there's nothing to backfill (the `acBackfillNeeded` flag in report.json may be a leftover stale state; re-running `coverage-prepare` will refresh it).

package/xera-exec.md CHANGED Viewed

@@ -18,12 +18,14 @@ The user invoked `/xera-exec <TICKET>`. If no key, ask.
 4. Suggest: "Diagnose this run with `/xera-report {{TICKET}}`."
-## Step 5 — Record graph events (v0.6)
+## Step 5 — Record graph events
-After Playwright reporter writes `runs/<RUN_ID>/reporter.json`:
+`/xera-report` calls `bun run xera:normalize {{TICKET}}` as its first step, which now emits the `run.completed` events for this run automatically (see #118). No explicit `graph-record exec` call is needed here.
+If you skip `/xera-report` (e.g. running `/xera-exec` standalone for a smoke check), trigger the same emission with:
 ```bash
-bun run xera:graph-record exec <TICKET> --run-id <RUN_ID>
+bun run xera:normalize <TICKET>
 ```
-Non-fatal.
+Non-fatal. (The lower-level `bun run xera:graph-record exec <TICKET> --run-id <RUN_ID>` still works for manual replay, but produces duplicate events if `xera:normalize` already ran for the same run.)

package/xera-report.md CHANGED Viewed

@@ -26,7 +26,7 @@ Step 4 below is *cognitive work that YOU, the session, must do*. It is not a she
    - `node_modules/@xera-ai/prompts/diagnose-failure.md` (the prompt template — read it in full; the rest of step 4 follows ITS rules)
 4. **Classify (YOUR job, no CLI shortcut here).** Follow `diagnose-failure.md`'s decision algorithm scenario-by-scenario. For each scenario in `normalized.json`, decide:
-   - `class`: one of `PASS`, `REAL_BUG`, `SELECTOR_DRIFT`, `FLAKY`, `TEST_BUG`
+   - `class`: one of `PASS`, `SKIPPED`, `REAL_BUG`, `SELECTOR_DRIFT`, `FLAKY`, `TEST_BUG`. If `outcome === "SKIPPED"`, set `class: "SKIPPED"` — never `PASS`, because skipped scenarios do not verify their AC and coverage will over-report.
    - `confidence`: `low`, `medium`, or `high`
    - `rationale`: 1–3 sentences in English citing concrete evidence (URL, HTTP status, element name, prior run timestamps, hash drift, etc.)
@@ -39,7 +39,7 @@ Step 4 below is *cognitive work that YOU, the session, must do*. It is not a she
        {
          "name": "<scenario name>",
          "outcome": "PASS" | "FAIL" | "SKIPPED",
-         "class": "PASS" | "REAL_BUG" | "SELECTOR_DRIFT" | "FLAKY" | "TEST_BUG",
+         "class": "PASS" | "SKIPPED" | "REAL_BUG" | "SELECTOR_DRIFT" | "FLAKY" | "TEST_BUG",
          "confidence": "low" | "medium" | "high",
          "rationale": "..."
        }

package/xera-run.md CHANGED Viewed

@@ -65,7 +65,7 @@ Run `bun run xera:exec {{TICKET}}`.
 ## Step 5 — Normalize
-Run `bun run xera:normalize {{TICKET}}`.
+Run `bun run xera:normalize {{TICKET}}`. This writes `normalized.json` AND emits `run.completed` events to the graph for every PASS/FAIL scenario in the run, so `latest_failures` and risk scoring stay in sync with reality (see #118 — earlier versions silently rendered failed scenarios green on `graph.html`).
 ## Step 6 — Diagnose + report + post