agent-gauntlet 0.10.0 → 0.11.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +25 -23
- package/dist/index.js +9226 -0
- package/dist/index.js.map +65 -0
- package/dist/scripts/status.js +280 -0
- package/dist/scripts/status.js.map +10 -0
- package/package.json +22 -8
- package/src/built-in-reviews/code-quality.md +0 -25
- package/src/built-in-reviews/index.ts +0 -28
- package/src/bun-plugins.d.ts +0 -4
- package/src/cli-adapters/claude.ts +0 -327
- package/src/cli-adapters/codex.ts +0 -290
- package/src/cli-adapters/cursor.ts +0 -128
- package/src/cli-adapters/gemini.ts +0 -510
- package/src/cli-adapters/github-copilot.ts +0 -141
- package/src/cli-adapters/index.ts +0 -250
- package/src/cli-adapters/thinking-budget.ts +0 -23
- package/src/commands/check.ts +0 -311
- package/src/commands/ci/index.ts +0 -15
- package/src/commands/ci/init.ts +0 -96
- package/src/commands/ci/list-jobs.ts +0 -90
- package/src/commands/clean.ts +0 -54
- package/src/commands/detect.ts +0 -173
- package/src/commands/health.ts +0 -169
- package/src/commands/help.ts +0 -34
- package/src/commands/index.ts +0 -13
- package/src/commands/init.ts +0 -1878
- package/src/commands/list.ts +0 -33
- package/src/commands/review.ts +0 -311
- package/src/commands/run.ts +0 -29
- package/src/commands/shared.ts +0 -267
- package/src/commands/stop-hook.ts +0 -567
- package/src/commands/validate.ts +0 -20
- package/src/commands/wait-ci.ts +0 -518
- package/src/config/ci-loader.ts +0 -33
- package/src/config/ci-schema.ts +0 -28
- package/src/config/global.ts +0 -87
- package/src/config/loader.ts +0 -301
- package/src/config/schema.ts +0 -165
- package/src/config/stop-hook-config.ts +0 -130
- package/src/config/types.ts +0 -65
- package/src/config/validator.ts +0 -592
- package/src/core/change-detector.ts +0 -137
- package/src/core/diff-stats.ts +0 -442
- package/src/core/entry-point.ts +0 -190
- package/src/core/job.ts +0 -96
- package/src/core/run-executor.ts +0 -621
- package/src/core/runner.ts +0 -290
- package/src/gates/check.ts +0 -118
- package/src/gates/resolve-check-command.ts +0 -21
- package/src/gates/result.ts +0 -54
- package/src/gates/review.ts +0 -1333
- package/src/hooks/adapters/claude-stop-hook.ts +0 -99
- package/src/hooks/adapters/cursor-stop-hook.ts +0 -122
- package/src/hooks/adapters/types.ts +0 -94
- package/src/hooks/stop-hook-handler.ts +0 -748
- package/src/index.ts +0 -47
- package/src/output/app-logger.ts +0 -214
- package/src/output/console-log.ts +0 -168
- package/src/output/console.ts +0 -359
- package/src/output/logger.ts +0 -126
- package/src/output/sinks/console-sink.ts +0 -59
- package/src/output/sinks/file-sink.ts +0 -110
- package/src/scripts/status.ts +0 -433
- package/src/templates/workflow.yml +0 -79
- package/src/types/gauntlet-status.ts +0 -79
- package/src/utils/debug-log.ts +0 -392
- package/src/utils/diff-parser.ts +0 -103
- package/src/utils/execution-state.ts +0 -472
- package/src/utils/log-parser.ts +0 -696
- package/src/utils/sanitizer.ts +0 -3
- package/src/utils/session-ref.ts +0 -91
package/README.md
CHANGED
|
@@ -11,30 +11,17 @@
|
|
|
11
11
|
|
|
12
12
|
Agent Gauntlet is a configurable “feedback loop” runner for AI-assisted development workflows.
|
|
13
13
|
|
|
14
|
-
You configure which paths in your repo should trigger which validations — shell commands like tests and linters, plus AI-powered code reviews. When files change, Gauntlet automatically runs the relevant validations and reports results.
|
|
14
|
+
You configure which paths in your repo should trigger which validations — shell commands like tests and linters, plus AI-powered local code reviews. When files change, Gauntlet automatically runs the relevant validations and reports results.
|
|
15
15
|
|
|
16
16
|
For AI reviews, it uses the CLI tool of your choice: Gemini, Codex, Claude Code, GitHub Copilot, or Cursor.
|
|
17
17
|
|
|
18
18
|
## Features
|
|
19
19
|
|
|
20
20
|
- **Agent validation loop**: Keep your coding agent on track with automated feedback loops. Detect problems — deterministically and/or non-deterministically — and let your agent fix and Gauntlet verify.
|
|
21
|
-
- **
|
|
21
|
+
- **Local cross-agent code reviews**: Enable one AI agent to automatically request code reviews from another. For example, if Claude made changes, Gauntlet can request a review from Codex — spreading token usage across your subscriptions instead of burning through one.
|
|
22
|
+
- Multiple AI review adapters have been evaluated for quality and efficiency. Claude and Codex deliver optimal review quality with superior token efficiency. For detailed metrics, see [Eval Results](docs/eval-results.md).
|
|
22
23
|
- **Leverage existing subscriptions**: Agent Gauntlet is *free* and tool-agnostic, leveraging the AI CLI tools you already have installed.
|
|
23
|
-
- **Easy CI setup**: Define your
|
|
24
|
-
|
|
25
|
-
### vs AI Code Review Tools
|
|
26
|
-
|
|
27
|
-
Unlike traditional code review tools designed for PR workflows, Agent Gauntlet provides real-time feedback loops for autonomous coding agents.
|
|
28
|
-
|
|
29
|
-
| Use Case | Recommended |
|
|
30
|
-
| :--- | :--- |
|
|
31
|
-
| Autonomous agent development | **Agent Gauntlet** |
|
|
32
|
-
| Traditional PR review with human reviewers | Other tools |
|
|
33
|
-
| IDE-integrated review while coding | Other tools |
|
|
34
|
-
| Enterprise with strict compliance requirements | Other tools |
|
|
35
|
-
| Budget-conscious teams with existing AI CLI tools | **Agent Gauntlet** |
|
|
36
|
-
|
|
37
|
-
[Full comparison →](docs/feature_comparison.md)
|
|
24
|
+
- **Easy CI setup**: Define your CI gates once, run them locally and in GitHub.
|
|
38
25
|
|
|
39
26
|
## Common Workflows
|
|
40
27
|
|
|
@@ -42,15 +29,30 @@ Agent Gauntlet supports three workflows, ranging from simple CLI execution to fu
|
|
|
42
29
|
|
|
43
30
|
- **CLI Mode** — Run checks via command line; ideal for CI pipelines and scripts.
|
|
44
31
|
- **Assistant Mode** — AI assistant runs validation loop, fixing issues iteratively.
|
|
45
|
-
- **Agentic Mode** — Autonomous agent validates and fixes in real-time via stop hook.
|
|
32
|
+
- **Agentic Mode** — Autonomous agent validates and fixes in real-time via stop hook (experimental).
|
|
46
33
|
|
|
47
34
|

|
|
48
35
|
|
|
49
|
-
|
|
36
|
+
### Example Workflow
|
|
37
|
+
|
|
38
|
+
1. Claude implements a feature
|
|
39
|
+
2. Agent Gauntlet reports quality issues detected by static code analysis and Codex reviewer agent
|
|
40
|
+
3. Claude fixes issues
|
|
41
|
+
4. Agent Gauntlet verifies
|
|
42
|
+
|
|
43
|
+
### Comparison vs Other Tools
|
|
50
44
|
|
|
51
|
-
|
|
52
|
-
|
|
53
|
-
|
|
45
|
+
### AI Code Review Tools
|
|
46
|
+
|
|
47
|
+
Agent Gauntlet is not a replacement for tools that provide AI pull request code reviews. It provides real-time feedback loops for autonomous coding agents, combining deterministic static checks (build, lint, test) with multi-agent AI reviews in a single pipeline. This enables agents to iterate and self-correct until all checks and reviews pass, without human intervention.
|
|
48
|
+
|
|
49
|
+
[Full comparison →](docs/feature_comparison.md)
|
|
50
|
+
|
|
51
|
+
### Spec-Driven Workflow Tools
|
|
52
|
+
|
|
53
|
+
It is recommended to use Agent Gauntlet in conjunction with other spec-driven development tools. We believe is the ideal implementation of the validation step in any Spec → Implement → Validate workflow.
|
|
54
|
+
|
|
55
|
+
## Quick Start
|
|
54
56
|
|
|
55
57
|
For basic usage and configuration guide, see the [Quick Start Guide](docs/quick-start.md).
|
|
56
58
|
|
|
@@ -59,7 +61,7 @@ For basic usage and configuration guide, see the [Quick Start Guide](docs/quick-
|
|
|
59
61
|
- [Quick Start Guide](docs/quick-start.md) — installation, basic usage, and config layout
|
|
60
62
|
- [User Guide](docs/user-guide.md) — full usage details
|
|
61
63
|
- [Configuration Reference](docs/config-reference.md) — all configuration fields + defaults
|
|
62
|
-
- [Stop Hook Guide](docs/stop-hook-guide.md) — integrate with Claude Code's stop hook
|
|
64
|
+
- [Stop Hook Guide](docs/stop-hook-guide.md) — integrate with Claude Code's stop hook (experimental).
|
|
63
65
|
- [CLI Invocation Details](docs/cli-invocation-details.md) — how we securely invoke AI CLIs
|
|
64
66
|
- [Feature Comparison](docs/feature_comparison.md) — how Agent Gauntlet compares to other tools
|
|
65
67
|
- [Development Guide](docs/development.md) — how to build and develop this project
|