npm - @verica-app/cli - Versions diffs - 0.1.0 - Mend

@verica-app/cli 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/README.md ADDED Viewed

@@ -0,0 +1,77 @@
+# @verica-app/cli
+Run a [Verica](https://verica.app) eval from CI and **gate the merge on the result**.
+The eval — criteria, golden set, graders, and the pass condition — lives in Verica.
+The CLI just triggers a run, waits for the verdict, writes a JUnit report, and exits
+`0`/`1` so your pipeline can block the merge. It's a thin HTTP client: the only secret
+it needs is a workspace token (provider keys stay in Verica via BYOK).
+## Install
+No install needed — run it with `npx`:
+```bash
+npx @verica-app/cli run --eval <eval-id> --prompt prompts/agent.txt --wait --junit report.xml
+```
+Or add it as a dev dependency:
+```bash
+npm i -D @verica-app/cli
+```
+## Usage
+```bash
+verica run \
+  --eval eval_8x2k9d \
+  --prompt prompts/support-agent.txt \
+  --model gpt-4.1-mini \
+  --wait \
+  --junit verica-results.xml \
+  --json
+```
+Multi-prompt via a manifest:
+```bash
+verica run --manifest .verica.yml --wait --junit report.xml
+```
+```yaml
+# .verica.yml
+evals:
+  - id: eval_8x2k9d
+    prompt: prompts/support-agent.txt
+    sampling: { temperature: 0.2, maxTokens: 512 }
+    model: gpt-4.1-mini
+  - id: eval_3p1m7q
+    prompt: prompts/triage.txt
+    model: claude-sonnet-4-6
+```
+## Environment
+| Var               | Required | Notes                                        |
+| ----------------- | -------- | -------------------------------------------- |
+| `VERICA_TOKEN`    | yes      | Workspace API token (Settings → API tokens). |
+| `VERICA_BASE_URL` | yes\*    | Your Verica base URL (or pass `--base-url`). |
+## Key flags
+- `--eval <id>` / `--manifest <file>` — what to run.
+- `--prompt <file>` / `--system-prompt <file>` — prompt content to push (versioned by content).
+- `--model <model>` · `--sampling <file.json>` — execution config.
+- `--wait` — poll to completion; the exit code reflects the gate.
+- `--junit <file>` · `--junit-mode rows|gate` — JUnit report (default `rows`).
+- `--json` — machine-readable results on stdout.
+- `--threshold <0..1>` · `--baseline-ref <ref>` · `--baseline-run <id>` — override the gate per branch.
+- `--git-sha` / `--git-ref` — provenance (auto-detected from CI env otherwise).
+## Exit codes
+`0` passed · `1` gate failed · `2` validation/transport error.
+MIT licensed. There's no IP in the client — the engine, graders, gate, and crypto all
+run server-side behind the token API.