@plaited/agent-eval-harness 0.11.0 → 0.12.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +5 -5
- package/package.json +2 -2
package/README.md
CHANGED
|
@@ -25,7 +25,7 @@ export ANTHROPIC_API_KEY=sk-... # For Claude
|
|
|
25
25
|
export GEMINI_API_KEY=... # For Gemini
|
|
26
26
|
```
|
|
27
27
|
|
|
28
|
-
Pre-built schemas are available in `.
|
|
28
|
+
Pre-built schemas are available in `.agents/skills/headless-adapters/schemas/` for Claude and Gemini.
|
|
29
29
|
|
|
30
30
|
### Core Commands
|
|
31
31
|
|
|
@@ -98,11 +98,11 @@ bunx @plaited/agent-eval-harness compare trials1.jsonl trials2.jsonl -o comparis
|
|
|
98
98
|
**Install skills** for use with AI coding agents:
|
|
99
99
|
|
|
100
100
|
```bash
|
|
101
|
-
|
|
101
|
+
npx skills add plaited/agent-eval-harness
|
|
102
|
+
# or
|
|
103
|
+
bunx skills add plaited/agent-eval-harness
|
|
102
104
|
```
|
|
103
105
|
|
|
104
|
-
Replace `<agent-name>` with your agent: `claude`, `cursor`, `copilot`, `opencode`, `amp`, `goose`, `factory`
|
|
105
|
-
|
|
106
106
|
### Available Skills
|
|
107
107
|
|
|
108
108
|
#### Agent Eval Harness
|
|
@@ -416,7 +416,7 @@ ANTHROPIC_API_KEY=sk-... GEMINI_API_KEY=... \
|
|
|
416
416
|
## Requirements
|
|
417
417
|
|
|
418
418
|
- **Runtime:** Bun >= 1.2.9
|
|
419
|
-
- **Schema:** JSON schema describing CLI agent interaction (see `.
|
|
419
|
+
- **Schema:** JSON schema describing CLI agent interaction (see `.agents/skills/headless-adapters/schemas/`)
|
|
420
420
|
- **API Key:** `ANTHROPIC_API_KEY` for Claude, `GEMINI_API_KEY` for Gemini
|
|
421
421
|
|
|
422
422
|
## License
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "@plaited/agent-eval-harness",
|
|
3
|
-
"version": "0.
|
|
3
|
+
"version": "0.12.0",
|
|
4
4
|
"description": "CLI tool for capturing agent trajectories from headless CLI agents",
|
|
5
5
|
"license": "ISC",
|
|
6
6
|
"engines": {
|
|
@@ -56,7 +56,7 @@
|
|
|
56
56
|
]
|
|
57
57
|
},
|
|
58
58
|
"dependencies": {
|
|
59
|
-
"@plaited/development-skills": "0.
|
|
59
|
+
"@plaited/development-skills": "0.7.0",
|
|
60
60
|
"zod": "^4.3.6"
|
|
61
61
|
},
|
|
62
62
|
"devDependencies": {
|