@plaited/acp 0.1.0-next.1 → 0.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +7 -14
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -42,9 +42,9 @@ console.log(summary.text, summary.completedToolCalls)
|
|
|
42
42
|
await client.disconnect()
|
|
43
43
|
```
|
|
44
44
|
|
|
45
|
-
## Recommended: Use the Bundled
|
|
45
|
+
## Recommended: Use the Bundled Skill
|
|
46
46
|
|
|
47
|
-
This package includes a comprehensive **
|
|
47
|
+
This package includes a comprehensive **acp-harness skill** designed for AI-assisted evaluation development. The skill provides:
|
|
48
48
|
|
|
49
49
|
- Complete API reference for `createACPClient` and helpers
|
|
50
50
|
- Harness CLI usage with all options and examples
|
|
@@ -53,28 +53,21 @@ This package includes a comprehensive **eval-harness plugin** designed for AI-as
|
|
|
53
53
|
- Downstream integration patterns (Braintrust, jq, custom scorers)
|
|
54
54
|
- Docker execution guidance
|
|
55
55
|
|
|
56
|
-
### Install the
|
|
57
|
-
|
|
58
|
-
**Claude Code:**
|
|
59
|
-
|
|
60
|
-
```
|
|
61
|
-
/plugin marketplace add plaited/acp-harness
|
|
62
|
-
```
|
|
63
|
-
|
|
64
|
-
**Other agents (Gemini CLI, GitHub Copilot, Cursor, OpenCode, Amp, Goose, Factory):**
|
|
56
|
+
### Install the Skill
|
|
65
57
|
|
|
66
58
|
```bash
|
|
67
|
-
|
|
59
|
+
# For Claude Code, Cursor, OpenCode, Amp, Goose, or Factory
|
|
60
|
+
curl -sSL https://raw.githubusercontent.com/plaited/acp-harness/main/scripts/install-acp.sh | bash
|
|
68
61
|
```
|
|
69
62
|
|
|
70
|
-
Once installed, the
|
|
63
|
+
Once installed, the skill auto-activates when working on evaluation tasks. Ask your AI agent to help you:
|
|
71
64
|
|
|
72
65
|
- Set up evaluation prompts
|
|
73
66
|
- Configure the harness CLI
|
|
74
67
|
- Design scoring pipelines
|
|
75
68
|
- Integrate with Braintrust or custom analysis tools
|
|
76
69
|
|
|
77
|
-
The
|
|
70
|
+
The skill contains everything needed to build agent evaluations - use it as your primary reference.
|
|
78
71
|
|
|
79
72
|
## Development
|
|
80
73
|
|