ancoder-skill-cli 0.9.1 → 0.10.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -252,6 +252,67 @@ If you already have OMC installed via the Claude Code plugin marketplace, the sk
252
252
 
253
253
  The embedded OMC version is pinned to the release tagged in `embedded/omc/VERSION`. To bump it, re-run the sync workflow that downloads a fresh GitHub release tarball into `embedded/omc/` and rebuild.
254
254
 
255
+ ## Meta-Harness (experimental)
256
+
257
+ `meta-harness/` is a Python sub-project that implements the outer-loop harness
258
+ optimizer from [arXiv:2603.28052](https://arxiv.org/abs/2603.28052) (Stanford, 2026).
259
+
260
+ ### Architecture
261
+
262
+ ```
263
+ meta-harness search ← outer loop (Python, Claude Code proposer)
264
+
265
+ └─ skill-cli eval validate / run / ls / diff ← evaluator backend (Go)
266
+
267
+ └─ harness.py (user-supplied Python) ← inner execution layer
268
+ ```
269
+
270
+ **Two independent binaries — intentionally decoupled:**
271
+ - `skill-cli` knows nothing about `meta-harness`; it only runs harness candidates and emits scores/traces.
272
+ - `meta-harness` knows nothing about OMC internals; it calls `skill-cli` via CLI contract only.
273
+
274
+ ### Quick start
275
+
276
+ ```bash
277
+ # Build skill-cli
278
+ go build -o bin/skill-cli .
279
+
280
+ # Install meta-harness
281
+ cd meta-harness
282
+ python3 -m venv .venv && source .venv/bin/activate
283
+ pip install -e ".[dev]"
284
+
285
+ # Run smoke test (no API key needed)
286
+ cd ..
287
+ bash scripts/meta-harness-smoke.sh
288
+
289
+ # Real search (requires ANTHROPIC_API_KEY + claude CLI)
290
+ meta-harness search \
291
+ --suite meta-harness/domains/text_classification/suite.yaml \
292
+ --out search-runs/run-01 \
293
+ --max-iter 5 \
294
+ --k 2 \
295
+ --seed meta-harness/domains/text_classification/seeds/zero_shot.py \
296
+ --seed meta-harness/domains/text_classification/seeds/few_shot.py \
297
+ --skill-cli bin/skill-cli \
298
+ --samples 20
299
+ ```
300
+
301
+ ### CLI contract (skill-cli eval)
302
+
303
+ | Command | Description |
304
+ |---|---|
305
+ | `skill-cli eval validate <dir>` | Cheap structural check (exit 0 = valid) |
306
+ | `skill-cli eval run <dir> --suite <f> --out <d>` | Full eval → scores.json + traces/ |
307
+ | `skill-cli eval ls --store <d> [--pareto]` | List / filter candidates |
308
+ | `skill-cli eval diff <a> <b> --store <d>` | Code + score diff |
309
+
310
+ ### Tuning
311
+
312
+ The `meta-harness/src/meta_harness/skill.md` file is the most important lever on search quality.
313
+ Per Appendix D of the paper: run 3–5 short iterations (`--max-iter 3`) specifically to
314
+ debug and refine it before committing to a full run.
315
+
255
316
  ## License
256
317
 
257
318
  MIT
Binary file
Binary file
Binary file
Binary file
Binary file
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "ancoder-skill-cli",
3
- "version": "0.9.1",
3
+ "version": "0.10.0",
4
4
  "description": "CLI for managing everything-claude-code (ECC) components — agents, skills, commands, rules, hooks, MCP configs. Single binary, all assets embedded.",
5
5
  "bin": {
6
6
  "skill-cli": "bin/skill-cli.js"