ancoder-skill-cli 0.9.1 → 0.10.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md
CHANGED
|
@@ -252,6 +252,67 @@ If you already have OMC installed via the Claude Code plugin marketplace, the sk
|
|
|
252
252
|
|
|
253
253
|
The embedded OMC version is pinned to the release tagged in `embedded/omc/VERSION`. To bump it, re-run the sync workflow that downloads a fresh GitHub release tarball into `embedded/omc/` and rebuild.
|
|
254
254
|
|
|
255
|
+
## Meta-Harness (experimental)
|
|
256
|
+
|
|
257
|
+
`meta-harness/` is a Python sub-project that implements the outer-loop harness
|
|
258
|
+
optimizer from [arXiv:2603.28052](https://arxiv.org/abs/2603.28052) (Stanford, 2026).
|
|
259
|
+
|
|
260
|
+
### Architecture
|
|
261
|
+
|
|
262
|
+
```
|
|
263
|
+
meta-harness search ← outer loop (Python, Claude Code proposer)
|
|
264
|
+
│
|
|
265
|
+
└─ skill-cli eval validate / run / ls / diff ← evaluator backend (Go)
|
|
266
|
+
│
|
|
267
|
+
└─ harness.py (user-supplied Python) ← inner execution layer
|
|
268
|
+
```
|
|
269
|
+
|
|
270
|
+
**Two independent binaries — intentionally decoupled:**
|
|
271
|
+
- `skill-cli` knows nothing about `meta-harness`; it only runs harness candidates and emits scores/traces.
|
|
272
|
+
- `meta-harness` knows nothing about OMC internals; it calls `skill-cli` via CLI contract only.
|
|
273
|
+
|
|
274
|
+
### Quick start
|
|
275
|
+
|
|
276
|
+
```bash
|
|
277
|
+
# Build skill-cli
|
|
278
|
+
go build -o bin/skill-cli .
|
|
279
|
+
|
|
280
|
+
# Install meta-harness
|
|
281
|
+
cd meta-harness
|
|
282
|
+
python3 -m venv .venv && source .venv/bin/activate
|
|
283
|
+
pip install -e ".[dev]"
|
|
284
|
+
|
|
285
|
+
# Run smoke test (no API key needed)
|
|
286
|
+
cd ..
|
|
287
|
+
bash scripts/meta-harness-smoke.sh
|
|
288
|
+
|
|
289
|
+
# Real search (requires ANTHROPIC_API_KEY + claude CLI)
|
|
290
|
+
meta-harness search \
|
|
291
|
+
--suite meta-harness/domains/text_classification/suite.yaml \
|
|
292
|
+
--out search-runs/run-01 \
|
|
293
|
+
--max-iter 5 \
|
|
294
|
+
--k 2 \
|
|
295
|
+
--seed meta-harness/domains/text_classification/seeds/zero_shot.py \
|
|
296
|
+
--seed meta-harness/domains/text_classification/seeds/few_shot.py \
|
|
297
|
+
--skill-cli bin/skill-cli \
|
|
298
|
+
--samples 20
|
|
299
|
+
```
|
|
300
|
+
|
|
301
|
+
### CLI contract (skill-cli eval)
|
|
302
|
+
|
|
303
|
+
| Command | Description |
|
|
304
|
+
|---|---|
|
|
305
|
+
| `skill-cli eval validate <dir>` | Cheap structural check (exit 0 = valid) |
|
|
306
|
+
| `skill-cli eval run <dir> --suite <f> --out <d>` | Full eval → scores.json + traces/ |
|
|
307
|
+
| `skill-cli eval ls --store <d> [--pareto]` | List / filter candidates |
|
|
308
|
+
| `skill-cli eval diff <a> <b> --store <d>` | Code + score diff |
|
|
309
|
+
|
|
310
|
+
### Tuning
|
|
311
|
+
|
|
312
|
+
The `meta-harness/src/meta_harness/skill.md` file is the most important lever on search quality.
|
|
313
|
+
Per Appendix D of the paper: run 3–5 short iterations (`--max-iter 3`) specifically to
|
|
314
|
+
debug and refine it before committing to a full run.
|
|
315
|
+
|
|
255
316
|
## License
|
|
256
317
|
|
|
257
318
|
MIT
|
|
Binary file
|
|
Binary file
|
|
Binary file
|
|
Binary file
|
|
Binary file
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "ancoder-skill-cli",
|
|
3
|
-
"version": "0.
|
|
3
|
+
"version": "0.10.0",
|
|
4
4
|
"description": "CLI for managing everything-claude-code (ECC) components — agents, skills, commands, rules, hooks, MCP configs. Single binary, all assets embedded.",
|
|
5
5
|
"bin": {
|
|
6
6
|
"skill-cli": "bin/skill-cli.js"
|