@doidor/agentrig 0.11.1 → 0.11.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +12 -5
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -23,10 +23,11 @@ You keep **one** source of truth (`AGENTS.md` + rules + skills); AgentRig compil
|
|
|
23
23
|
evolve, and ships a way to **evaluate the harness itself**.
|
|
24
24
|
|
|
25
25
|
```bash
|
|
26
|
-
npx @doidor/agentrig init
|
|
27
|
-
npx @doidor/agentrig compile
|
|
28
|
-
npx @doidor/agentrig eval
|
|
29
|
-
npx @doidor/agentrig
|
|
26
|
+
npx @doidor/agentrig init # investigate this repo, install a tailored harness, compile all surfaces
|
|
27
|
+
npx @doidor/agentrig compile # re-project AGENTS.md + rules into every agent surface (local + remote)
|
|
28
|
+
npx @doidor/agentrig eval --scaffold # auto-generate eval scenarios tailored to YOUR repo's stack
|
|
29
|
+
npx @doidor/agentrig eval # score the harness (add --static for the no-model audit)
|
|
30
|
+
npx @doidor/agentrig update # pull in the latest best practices
|
|
30
31
|
```
|
|
31
32
|
|
|
32
33
|
> Or install it globally — `npm i -g @doidor/agentrig` — and run `agentrig <command>`.
|
|
@@ -105,6 +106,12 @@ the generated files. Commit them so remote agents (and teammates' tools) pick th
|
|
|
105
106
|
|
|
106
107
|
## Evaluating the harness itself
|
|
107
108
|
|
|
109
|
+
> **Generate eval scenarios tailored to your repo — one command.** `agentrig eval --scaffold` reads
|
|
110
|
+
> the investigation `init` wrote about your repo (`.agentrig/context.md`) and writes fixture-based
|
|
111
|
+
> scenarios that exercise *your* stack's real test runner, package manager, and defect patterns —
|
|
112
|
+
> then validates them against the axis registry. The bundled scenarios are only generic templates;
|
|
113
|
+
> this is how the eval kit becomes specific to *your* repo.
|
|
114
|
+
|
|
108
115
|
This is a first-class feature, not an afterthought — and it's **repo-specific and runnable by you**.
|
|
109
116
|
The eval kit installs into your repo (`.agentrig/eval/`) and is tailored to it during `init`, so you
|
|
110
117
|
can measure whether AgentRig actually helps *here*. `agentrig eval` **defaults to the full agentic,
|
|
@@ -230,7 +237,7 @@ preserved-file decision in `.agentrig/state.json` so the next update doesn't re-
|
|
|
230
237
|
| `agentrig compile [path]` | Project AGENTS.md + rules into every agent surface (local + remote); re-populates auto-maintained marker blocks (e.g. `AGENTRIG:skills-inventory`) |
|
|
231
238
|
| `agentrig update [path] [--diff] [--auto-fix]` | Re-sync the latest best practices (re-compiles surfaces). `--diff` classifies preserved-file drift (🔴 broken / 🟡 stale / 🟢 enhancement / ⚪ mixed); `--auto-fix` runs `fix` after the refresh |
|
|
232
239
|
| `agentrig fix [path] [--dry-run]` | Deterministically repair the install: restore broken YAML from canonical, replace unknown model ids with `auto`. No agent needed; `.bak` files are written before each change |
|
|
233
|
-
| `agentrig eval [path] [--static\|--rubric] [--scenario id] [--variant name] [--n trials] [--producer-model id] [--judge-model id]` | Evaluate the harness (default: agentic; `--static` for the cheap CI-safe audit; `--rubric` shows what's measured) |
|
|
240
|
+
| `agentrig eval [path] [--scaffold] [--static\|--rubric] [--scenario id] [--variant name] [--n trials] [--producer-model id] [--judge-model id]` | Evaluate the harness (default: agentic; `--scaffold` generates repo-tailored scenarios from your stack; `--static` for the cheap CI-safe audit; `--rubric` shows what's measured) |
|
|
234
241
|
| `agentrig dashboard [path] [--html [file]] [--no-tasks] [--json]` | Roster, live GitHub tasks, score, evals |
|
|
235
242
|
| `agentrig doctor [path] [--json]` | Health check (installed? agent reachable? score? **validation findings**? **linked-checkout vs npm latest**?) |
|
|
236
243
|
|
package/package.json
CHANGED