npm - agentv - Versions diffs - 0.5.3 → 0.7.0 - Mend

agentv 0.5.3 → 0.7.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

package/README.md +7 -3
package/dist/{chunk-5WBKOCCW.js → chunk-X6NSDSD2.js} +1608 -1102
package/dist/chunk-X6NSDSD2.js.map +1 -0
package/dist/cli.js +1 -1
package/dist/index.js +1 -1
package/dist/templates/agentv/targets.yaml +29 -29
package/package.json +3 -2
package/dist/chunk-5WBKOCCW.js.map +0 -1

package/README.md CHANGED Viewed

@@ -120,6 +120,9 @@ agentv eval "path/to/eval.yaml"
 # Override the eval file's target with CLI flag
 agentv eval --target vscode_projectx "path/to/eval.yaml"
+# Run multiple evals via glob
+agentv eval "path/to/evals/**/*.yaml"
 ```
 Run a specific eval case with custom targets path:
@@ -130,17 +133,18 @@ agentv eval --target vscode_projectx --targets "path/to/targets.yaml" --eval-id
 ### Command Line Options
-- `eval_file`: Path to eval YAML file (required, positional argument)
+- `eval_paths...`: Path(s) or glob(s) to eval YAML files (required; e.g., `evals/**/*.yaml`)
 - `--target TARGET`: Execution target name from targets.yaml (overrides target specified in eval file)
 - `--targets TARGETS`: Path to targets.yaml file (default: ./.agentv/targets.yaml)
 - `--eval-id EVAL_ID`: Run only the eval case with this specific ID
-- `--out OUTPUT_FILE`: Output file path (default: results/{evalname}_{timestamp}.jsonl)
+- `--out OUTPUT_FILE`: Output file path (default: .agentv/results/eval_<timestamp>.jsonl)
 - `--output-format FORMAT`: Output format: 'jsonl' or 'yaml' (default: jsonl)
 - `--dry-run`: Run with mock model for testing
 - `--agent-timeout SECONDS`: Timeout in seconds for agent response polling (default: 120)
 - `--max-retries COUNT`: Maximum number of retries for timeout cases (default: 2)
 - `--cache`: Enable caching of LLM responses (default: disabled)
 - `--dump-prompts`: Save all prompts to `.agentv/prompts/` directory
+- `--workers COUNT`: Parallel workers for eval cases (default: 3; target `workers` setting used when provided)
 - `--verbose`: Verbose output
 ### Target Selection Priority
@@ -153,7 +157,7 @@ The CLI determines which execution target to use with the following precedence:
 This allows eval files to specify their preferred target while still allowing command-line overrides for flexibility, and maintains backward compatibility with existing workflows.
-Output goes to `.agentv/results/{evalname}_{timestamp}.jsonl` (or `.yaml`) unless `--out` is provided.
+Output goes to `.agentv/results/eval_<timestamp>.jsonl` (or `.yaml`) unless `--out` is provided.
 ### Tips for VS Code Copilot Evals