npm - agent-gauntlet - Versions diffs - 0.15.3 → 0.15.4 - Mend

agent-gauntlet 0.15.3 → 0.15.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (4) hide show

package/dist/index.js +130 -84
package/dist/index.js.map +9 -8
package/package.json +1 -1
package/skills/gauntlet-run/SKILL.md +2 -2

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "agent-gauntlet",
-  "version": "0.15.3",
+  "version": "0.15.4",
   "description": "A CLI tool for testing AI coding agents",
   "license": "MIT",
   "author": "Paul Caplan",

package/skills/gauntlet-run/SKILL.md CHANGED Viewed

@@ -35,7 +35,7 @@ Execute the autonomous verification suite.
 ## Procedure
 1. Run `agent-gauntlet clean` to archive any previous log files.
-2. Run `agent-gauntlet run` using `Bash` with `timeout: 300000`. Wait for the complete output. **Verify you can see a `Status:` line in the output before continuing.**
+2. Run `agent-gauntlet run 2>&1` using `Bash` with `timeout: 300000`. Wait for the complete output. **Verify you can see a `Status:` line in the output before continuing.**
 3. **Check the status line:**
    - `Status: Passed` → Go to step 8.
    - `Status: Passed with warnings` → Go to step 8.
@@ -59,7 +59,7 @@ Execute the autonomous verification suite.
      a. **Task tool** (Claude Code): `Task` with `subagent_type="general-purpose"`, `model="haiku"`, `prompt=` update-prompt content + log directory + decisions list. NEVER use `run_in_background: true`.
      b. **Subagent delegation**: Delegate the update-prompt instructions with the log directory and decisions to a subagent.
      c. **Inline fallback**: Follow the update-prompt instructions yourself to update the review JSON files.
-7. **Re-run verification:** Run `agent-gauntlet run` again with `Bash` and `timeout: 300000`. Do NOT run `agent-gauntlet clean` between retries. The tool detects existing logs and automatically switches to verification mode. **Go back to step 3** to check the status line and repeat.
+7. **Re-run verification:** Run `agent-gauntlet run 2>&1` again with `Bash` and `timeout: 300000`. Do NOT run `agent-gauntlet clean` between retries. The tool detects existing logs and automatically switches to verification mode. **Go back to step 3** to check the status line and repeat.
 8. **Provide a summary** of the session:
    - Final Status: (Passed / Passed with warnings / Retry limit exceeded)
    - Issues Fixed: (list key fixes)