npm - @windyroad/risk-scorer - Versions diffs - 0.3.5-preview.185 → 0.3.5-preview.188 - Mend

@windyroad/risk-scorer 0.3.5-preview.185 → 0.3.5-preview.188

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

package/agents/pipeline.md +3 -2
package/agents/plan.md +3 -2
package/agents/test/risk-scorer-structured-remediations.bats +23 -0
package/agents/wip.md +3 -2
package/hooks/git-push-gate.sh +1 -1
package/hooks/lib/risk-gate.sh +1 -1
package/package.json +1 -1
package/skills/create-risk/SKILL.md +172 -0

package/agents/pipeline.md CHANGED Viewed

@@ -149,9 +149,9 @@ exceeds appetite. The only sanctioned above-appetite output is the Risk Report
 structure, `RISK_SCORES: ...`, and the structured `RISK_REMEDIATIONS:` block
 defined below.
-Emit a structured `RISK_REMEDIATIONS:` block after the `RISK_SCORES:` line. This gives the calling skill machine-readable input for structured decision prompts.
+Emit a structured `RISK_REMEDIATIONS:` block after the `RISK_SCORES:` line. This gives the calling skill machine-readable input.
-Format (5 columns — machine-readable for structured AskUserQuestion prompts in calling skills):
+Format (5 columns):
 ```
 RISK_REMEDIATIONS:
 - R1 | <description of remediation> | <effort S/M/L> | <risk_delta -N> | <files affected>
@@ -161,6 +161,7 @@ RISK_REMEDIATIONS:
 Column definitions:
 - **effort**: estimated size of the remediation — S (< 1h, single file), M (1-4h, few files), L (> 4h, multiple files)
 - **risk_delta**: estimated reduction in residual risk if this remediation is applied (e.g., `-3` means risk drops by 3 points)
+- **description**: free-form prose. The agent reads this and decides what to do. No structured action_class column.
 Include downstream back-pressure in the remediation list:
 - **Commit**: If adding this commit would push the push queue risk >= 5, include a remediation to split the commit.

package/agents/plan.md CHANGED Viewed

@@ -55,7 +55,7 @@ not policy-authorised — the only sanctioned FAIL output is the Plan Risk Repor
 the `RISK_VERDICT: FAIL` marker, and the structured `RISK_REMEDIATIONS:` block
 defined below.
-Emit a structured `RISK_REMEDIATIONS:` block after the verdict (5 columns — machine-readable for structured AskUserQuestion prompts in calling skills):
+Emit a structured `RISK_REMEDIATIONS:` block after the verdict (5 columns):
 ```
 RISK_REMEDIATIONS:
 - R1 | <description of what the plan must add/change> | <effort S/M/L> | <risk_delta -N> | <affected area>
@@ -64,8 +64,9 @@ RISK_REMEDIATIONS:
 Column definitions:
 - **effort**: estimated size of the remediation — S (< 1h, single file), M (1-4h, few files), L (> 4h, multiple files)
 - **risk_delta**: estimated reduction in residual risk if this remediation is applied
+- **description**: free-form prose. The agent reads this and decides what to do. No structured action_class column.
-Do NOT emit free-text "consider" or "you should" prose. The structured block is the only output for above-appetite guidance.
+Do NOT emit free-text "consider" or "you should" prose outside the structured block. The `RISK_REMEDIATIONS:` block is the only output for above-appetite guidance.
 ## Control Discovery

package/agents/test/risk-scorer-structured-remediations.bats CHANGED Viewed

@@ -98,3 +98,26 @@ setup() {
   run grep -q "risk_delta" "$PLAN"
   [ "$status" -eq 0 ]
 }
+# ──────────────────────────────────────────────────────────────────────────────
+# P108: scorer writes prose descriptions; agent decides (ADR-042 Rule 2a)
+# ──────────────────────────────────────────────────────────────────────────────
+@test "pipeline.md RISK_REMEDIATIONS format has no action_class column" {
+  # ADR-042 Rule 2a: no structured action_class column. The agent reads
+  # the description and decides. Match only markdown-table column-header
+  # rows so prose mentions of "action_class" (e.g. "No structured
+  # action_class column.") do not trip the assertion (P114).
+  run grep -qE '^\| *action_class\b' "$PIPELINE"
+  [ "$status" -ne 0 ]
+}
+@test "wip.md RISK_REMEDIATIONS format has no action_class column" {
+  run grep -qE '^\| *action_class\b' "$WIP"
+  [ "$status" -ne 0 ]
+}
+@test "plan.md RISK_REMEDIATIONS format has no action_class column" {
+  run grep -qE '^\| *action_class\b' "$PLAN"
+  [ "$status" -ne 0 ]
+}

package/agents/wip.md CHANGED Viewed

@@ -61,7 +61,7 @@ structured `RISK_REMEDIATIONS:` block defined below.
 Provide the assessment table, then emit a structured `RISK_REMEDIATIONS:` block with specific risk-reducing actions:
-Format (5 columns — machine-readable for structured AskUserQuestion prompts in calling skills):
+Format (5 columns):
 ```
 RISK_REMEDIATIONS:
 - R1 | Commit current changes to move WIP forward | S | -2 | <uncommitted files>
@@ -73,8 +73,9 @@ RISK_REMEDIATIONS:
 Column definitions:
 - **effort**: estimated size of the remediation — S (< 1h, single file), M (1-4h, few files), L (> 4h, multiple files)
 - **risk_delta**: estimated reduction in residual risk if this remediation is applied (e.g., `-3` means risk drops by 3 points)
+- **description**: free-form prose. The agent reads this and decides what to do. No structured action_class column.
-Do NOT emit free-text suggestions as prose. The structured block is the only output for above-appetite guidance.
+Do NOT emit free-text suggestions outside the structured block. The `RISK_REMEDIATIONS:` block is the only output for above-appetite guidance.
 The verdict is `RISK_VERDICT: PAUSE`. This blocks the next edit until the risk is addressed.

package/hooks/git-push-gate.sh CHANGED Viewed

@@ -53,7 +53,7 @@ if echo "$COMMAND" | grep -qE '(^|;|&&|\|\|)\s*npm run push:watch(\s|$)'; then
         PUSH_NOW=$(date +%s)
         PUSH_SCORE_TIME=$(_mtime "$PUSH_SCORE_FILE")
         PUSH_AGE=$(( PUSH_NOW - PUSH_SCORE_TIME ))
-        PUSH_TTL="${RISK_TTL:-1800}"
+        PUSH_TTL="${RISK_TTL:-3600}"
         if [ "$PUSH_AGE" -ge "$PUSH_TTL" ]; then
             risk_gate_deny "Push blocked: Push risk score expired (${PUSH_AGE}s old, TTL ${PUSH_TTL}s). Delegate to risk-scorer to rescore."
             exit 0

package/hooks/lib/risk-gate.sh CHANGED Viewed

@@ -17,7 +17,7 @@ check_risk_gate() {
   RDIR=$(_risk_dir "$SESSION_ID")
   local SCORE_FILE="${RDIR}/${ACTION}"
   local HASH_FILE="${RDIR}/state-hash"
-  local TTL_SECONDS="${RISK_TTL:-1800}"
+  local TTL_SECONDS="${RISK_TTL:-3600}"
   # 1. Score file must exist (fail-closed)
   if [ ! -f "$SCORE_FILE" ]; then

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@windyroad/risk-scorer",
-  "version": "0.3.5-preview.185",
+  "version": "0.3.5-preview.188",
   "description": "Pipeline risk scoring, commit/push gates, and secret leak detection",
   "bin": {
     "windyroad-risk-scorer": "./bin/install.mjs"

package/skills/create-risk/SKILL.md ADDED Viewed

@@ -0,0 +1,172 @@
+---
+name: wr-risk-scorer:create-risk
+description: Create a new standing-risk entry in docs/risks/. Examines existing risks, gathers impact/likelihood/controls from the user, writes a file matching docs/risks/TEMPLATE.md, and updates the register index.
+allowed-tools: Read, Write, Edit, Bash, Glob, Grep, AskUserQuestion
+---
+# Risk Register Entry Generator
+Create a new standing-risk file in `docs/risks/` following the format defined by `docs/risks/TEMPLATE.md`. The register captures persistent risks (distinct from the ephemeral per-change reports in `.risk-reports/`), and its criteria come from `RISK-POLICY.md`.
+This skill is the invocation surface for populating the register (scaffolded by P033; populated per P102). Per ADR-015, it is a plugin-namespaced on-demand skill. Per ADR-014, the skill commits its own work.
+## Steps
+### 1. Discover existing risks
+Scan for existing risk files:
+- Glob `docs/risks/R*.md` (skip `README.md`, `TEMPLATE.md`)
+- Note the highest numbered risk to determine the next sequence number
+- Read any risks related to the topic being discussed (if the user has mentioned a topic)
+- If `docs/risks/` does not exist, explain that `/wr-risk-scorer:update-policy` must be run first (it ships the scaffolding) and stop
+### 2. Gather context from the user
+You MUST use the AskUserQuestion tool to collect context that cannot be derived. Do not proceed to step 3 until you have answers. Apply ADR-013 Rule 6 non-interactive defaults if the tool is unavailable (AFK mode): choose the most conservative option for each question and note auto-selection in the output.
+Auto-derive where possible (do not ask):
+- **ID number** — next free slot per step 3 (do not ask per `feedback_dont_ask_trivial_id_choices.md`).
+- **Today's date** — use the current date for `Identified` and `Last reviewed`.
+- **Category** — infer from description keywords where unambiguous: "token", "secret", "leak" → `infosec`; "install", "hook", "pipeline" → `operational`. Confirm only if ambiguous.
+- **Next review** — default to 6 months from today.
+Ask the user (one AskUserQuestion call with grouped questions):
+1. **What is the risk?** A short title and 1-2 paragraph description — what could go wrong, for whom, and why it matters. This is the condition, not the control.
+2. **Impact level (from `RISK-POLICY.md`)?** 1 Negligible · 2 Minor · 3 Moderate · 4 Significant · 5 Severe. Read the policy's Impact table to the user if they need the descriptions.
+3. **Likelihood level?** 1 Rare · 2 Unlikely · 3 Possible · 4 Likely · 5 Almost certain.
+4. **Existing controls?** Each control names what it does and where it is implemented (file path or `ADR-NNN`). If none, leave empty.
+5. **Residual impact and likelihood** (after controls). If controls are minimal, residual = inherent — do not fabricate reductions. Per ADR-026, quantitative reduction claims must cite evidence (test, hook gate, pipeline report). If no evidence, state "Residual same as inherent pending control evidence" in the Treatment section and set residual = inherent.
+6. **Treatment choice?** Accept · Mitigate · Transfer · Avoid. Include brief justification.
+7. **Owner?** Persona or role (e.g. `solo-developer`, `plugin-maintainer`, `tech-lead`).
+If the user has already provided this context in the conversation (e.g. as arguments, or as part of a pipeline-finding hand-off), use what they have given and only ask about what is missing.
+### 3. Determine sequence number and filename
+- Next number = **max of the local and origin highest risk numbers**, plus 1 (or 001 if none exist).
+- Filename: `R<NNN>-<kebab-case-title>.active.md`
+- Pad the number to 3 digits (001, 002, ... 010, 011, etc.)
+**Why compare against origin?** Per ADR-019 confirmation criterion 2, ticket-creator skills MUST re-check next-number assignment against `git ls-tree origin/<base>` before assigning. Without it, parallel sessions can mint the same ID for different risks, causing a destructive surgical rebase on push.
+```bash
+# Local-max number
+local_max=$(ls docs/risks/R*.md 2>/dev/null | sed 's/.*\///' | grep -oE '^R[0-9]+' | sed 's/^R//' | sort -n | tail -1)
+# Origin-max number — reads remote-tracking ref. `--name-only` required per P056
+# to avoid false-matches on blob SHAs.
+origin_max=$(git ls-tree --name-only origin/main docs/risks/ 2>/dev/null | sed 's|^docs/risks/||' | grep -oE '^R[0-9]+' | sed 's/^R//' | sort -n | tail -1)
+# Take the max of the two and increment.
+next=$(printf '%03d' $(( $(echo -e "${local_max:-0}\n${origin_max:-0}" | sort -n | tail -1) + 1 )))
+```
+If the local choice would have collided with an origin risk file created since the last fetch, the `git ls-tree` lookup catches it here and the renumber is automatic. Log the renumber in the user-facing report (e.g. "Bumped next risk number from R012 → R013 to avoid collision with origin").
+### 4. Compute scores and bands
+Use the Risk Matrix from `RISK-POLICY.md`:
+- **Inherent Score** = Impact × Likelihood
+- **Residual Score** = Impact × Likelihood (after controls)
+- **Band** (for each) per the Label Bands table: 1-2 Very Low · 3-4 Low · 5-9 Medium · 10-16 High · 17-25 Very High
+- **Within appetite?** = residual score ≤ `RISK-POLICY.md`'s appetite threshold (read the threshold at runtime; do not hardcode)
+### 5. Write the risk file
+Write the file to `docs/risks/` using the structure from `TEMPLATE.md`:
+```markdown
+# Risk R<NNN>: <Title>
+**Status**: Active
+**Category**: <infosec | operational | brand | delivery>
+**Identified**: <YYYY-MM-DD>
+**Owner**: <persona or role>
+**Last reviewed**: <YYYY-MM-DD>
+**Next review**: <YYYY-MM-DD + 6 months>
+## Description
+<1-2 paragraph description from step 2.>
+## Inherent Risk
+Impact × Likelihood *before* controls.
+- **Impact**: <level> (<label>)
+- **Likelihood**: <level> (<label>)
+- **Inherent Score**: <product>
+- **Inherent Band**: <band>
+## Controls
+- **<control-name>** — <what it does>. Implemented in <file path or ADR-NNN>.
+## Residual Risk
+Impact × Likelihood *after* controls.
+- **Impact**: <level> (<label>)
+- **Likelihood**: <level> (<label>)
+- **Residual Score**: <product>
+- **Residual Band**: <band>
+- **Within appetite?**: <Yes | No>
+## Treatment
+<Accept | Mitigate | Transfer | Avoid>. <Justification.>
+## Monitoring
+- **Trigger to re-assess**: <event or threshold>
+- **Metrics**: <if any>
+## Related
+- Criteria: `RISK-POLICY.md`
+- Realised-as: <links to `docs/problems/P<NNN>` if any>
+- Treatment ADRs: <links if any>
+- Personas affected: <links to `docs/jtbd/<persona>/persona.md`>
+## Change Log
+- <YYYY-MM-DD>: Initial identification.
+```
+### 6. Update the register index
+`docs/risks/README.md` has a **Register** table that MUST reflect the new risk. Append a row with the following columns:
+```
+| R<NNN> | <Title> | <Category> | <Inherent Score> | <Residual Score> | <Treatment verb> | <Owner> | <Next review date> |
+```
+This step is not optional: the README drifts from the register without it, and the ISO 27001 audit signal depends on the index being accurate.
+### 7. Confirm with the user
+Present the written file path, inherent/residual bands, and any `Within appetite?: No` flag. Ask via AskUserQuestion:
+1. Does the description accurately capture the risk?
+2. Are the inherent and residual scores defensible?
+3. Is the treatment choice appropriate for the residual band?
+4. Should the owner or next review date be adjusted?
+Apply any feedback by editing the file and re-updating the README row if scores/treatment change.
+### 8. Commit the risk (ADR-014)
+Per ADR-014, this skill commits its own work. Stage both files and commit:
+```bash
+git add docs/risks/R<NNN>-<title>.active.md docs/risks/README.md
+git commit -m "docs(risks): open R<NNN> <title>"
+```
+The commit message convention `docs(risks): open R<NNN> <title>` matches `docs/risks/README.md` step 6 and mirrors `docs(problems): open P<NNN>` used by `/wr-itil:manage-problem`.
+If the commit-gate pattern-matches `git commit` text and blocks, run `/wr-risk-scorer:assess-release` first to produce a fresh pipeline marker, then retry the commit.
+$ARGUMENTS