@doidor/agentrig 0.11.1 → 0.11.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (2) hide show
  1. package/README.md +12 -5
  2. package/package.json +1 -1
package/README.md CHANGED
@@ -23,10 +23,11 @@ You keep **one** source of truth (`AGENTS.md` + rules + skills); AgentRig compil
23
23
  evolve, and ships a way to **evaluate the harness itself**.
24
24
 
25
25
  ```bash
26
- npx @doidor/agentrig init # investigate this repo, install a tailored harness, compile all surfaces
27
- npx @doidor/agentrig compile # re-project AGENTS.md + rules into every agent surface (local + remote)
28
- npx @doidor/agentrig eval # score the harness (deterministic, no model needed)
29
- npx @doidor/agentrig update # pull in the latest best practices
26
+ npx @doidor/agentrig init # investigate this repo, install a tailored harness, compile all surfaces
27
+ npx @doidor/agentrig compile # re-project AGENTS.md + rules into every agent surface (local + remote)
28
+ npx @doidor/agentrig eval --scaffold # auto-generate eval scenarios tailored to YOUR repo's stack
29
+ npx @doidor/agentrig eval # score the harness (add --static for the no-model audit)
30
+ npx @doidor/agentrig update # pull in the latest best practices
30
31
  ```
31
32
 
32
33
  > Or install it globally — `npm i -g @doidor/agentrig` — and run `agentrig <command>`.
@@ -105,6 +106,12 @@ the generated files. Commit them so remote agents (and teammates' tools) pick th
105
106
 
106
107
  ## Evaluating the harness itself
107
108
 
109
+ > **Generate eval scenarios tailored to your repo — one command.** `agentrig eval --scaffold` reads
110
+ > the investigation `init` wrote about your repo (`.agentrig/context.md`) and writes fixture-based
111
+ > scenarios that exercise *your* stack's real test runner, package manager, and defect patterns —
112
+ > then validates them against the axis registry. The bundled scenarios are only generic templates;
113
+ > this is how the eval kit becomes specific to *your* repo.
114
+
108
115
  This is a first-class feature, not an afterthought — and it's **repo-specific and runnable by you**.
109
116
  The eval kit installs into your repo (`.agentrig/eval/`) and is tailored to it during `init`, so you
110
117
  can measure whether AgentRig actually helps *here*. `agentrig eval` **defaults to the full agentic,
@@ -230,7 +237,7 @@ preserved-file decision in `.agentrig/state.json` so the next update doesn't re-
230
237
  | `agentrig compile [path]` | Project AGENTS.md + rules into every agent surface (local + remote); re-populates auto-maintained marker blocks (e.g. `AGENTRIG:skills-inventory`) |
231
238
  | `agentrig update [path] [--diff] [--auto-fix]` | Re-sync the latest best practices (re-compiles surfaces). `--diff` classifies preserved-file drift (🔴 broken / 🟡 stale / 🟢 enhancement / ⚪ mixed); `--auto-fix` runs `fix` after the refresh |
232
239
  | `agentrig fix [path] [--dry-run]` | Deterministically repair the install: restore broken YAML from canonical, replace unknown model ids with `auto`. No agent needed; `.bak` files are written before each change |
233
- | `agentrig eval [path] [--static\|--rubric] [--scenario id] [--variant name] [--n trials] [--producer-model id] [--judge-model id]` | Evaluate the harness (default: agentic; `--static` for the cheap CI-safe audit; `--rubric` shows what's measured) |
240
+ | `agentrig eval [path] [--scaffold] [--static\|--rubric] [--scenario id] [--variant name] [--n trials] [--producer-model id] [--judge-model id]` | Evaluate the harness (default: agentic; `--scaffold` generates repo-tailored scenarios from your stack; `--static` for the cheap CI-safe audit; `--rubric` shows what's measured) |
234
241
  | `agentrig dashboard [path] [--html [file]] [--no-tasks] [--json]` | Roster, live GitHub tasks, score, evals |
235
242
  | `agentrig doctor [path] [--json]` | Health check (installed? agent reachable? score? **validation findings**? **linked-checkout vs npm latest**?) |
236
243
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@doidor/agentrig",
3
- "version": "0.11.1",
3
+ "version": "0.11.2",
4
4
  "description": "AgentRig — an agentic meta-harness. A CLI that investigates a repository and installs (and evaluates) a best-practice agent harness.",
5
5
  "type": "module",
6
6
  "bin": {