npm - @covibes/zeroshot - Versions diffs - 5.2.0 → 5.3.0 - Mend

@covibes/zeroshot 5.2.0 → 5.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (62) hide show

package/CHANGELOG.md +178 -186
package/README.md +199 -248
package/cli/commands/providers.js +150 -0
package/cli/index.js +214 -58
package/cli/lib/first-run.js +40 -3
package/cluster-templates/base-templates/debug-workflow.json +24 -78
package/cluster-templates/base-templates/full-workflow.json +44 -145
package/cluster-templates/base-templates/single-worker.json +23 -15
package/cluster-templates/base-templates/worker-validator.json +47 -34
package/cluster-templates/conductor-bootstrap.json +7 -5
package/lib/docker-config.js +6 -1
package/lib/provider-detection.js +59 -0
package/lib/provider-names.js +56 -0
package/lib/settings.js +191 -6
package/lib/stream-json-parser.js +4 -238
package/package.json +21 -5
package/scripts/validate-templates.js +100 -0
package/src/agent/agent-config.js +37 -13
package/src/agent/agent-context-builder.js +64 -2
package/src/agent/agent-hook-executor.js +82 -9
package/src/agent/agent-lifecycle.js +53 -14
package/src/agent/agent-task-executor.js +196 -194
package/src/agent/output-extraction.js +200 -0
package/src/agent/output-reformatter.js +175 -0
package/src/agent/schema-utils.js +111 -0
package/src/agent-wrapper.js +102 -30
package/src/agents/git-pusher-agent.json +2 -2
package/src/claude-task-runner.js +80 -30
package/src/config-router.js +13 -13
package/src/config-validator.js +231 -10
package/src/github.js +36 -0
package/src/isolation-manager.js +243 -154
package/src/ledger.js +28 -6
package/src/orchestrator.js +391 -96
package/src/preflight.js +85 -82
package/src/providers/anthropic/cli-builder.js +45 -0
package/src/providers/anthropic/index.js +134 -0
package/src/providers/anthropic/models.js +23 -0
package/src/providers/anthropic/output-parser.js +159 -0
package/src/providers/base-provider.js +181 -0
package/src/providers/capabilities.js +51 -0
package/src/providers/google/cli-builder.js +55 -0
package/src/providers/google/index.js +116 -0
package/src/providers/google/models.js +24 -0
package/src/providers/google/output-parser.js +92 -0
package/src/providers/index.js +75 -0
package/src/providers/openai/cli-builder.js +122 -0
package/src/providers/openai/index.js +135 -0
package/src/providers/openai/models.js +21 -0
package/src/providers/openai/output-parser.js +129 -0
package/src/sub-cluster-wrapper.js +18 -3
package/src/task-runner.js +8 -6
package/src/tui/layout.js +20 -3
package/task-lib/attachable-watcher.js +80 -78
package/task-lib/claude-recovery.js +119 -0
package/task-lib/commands/list.js +1 -1
package/task-lib/commands/resume.js +3 -2
package/task-lib/commands/run.js +12 -3
package/task-lib/runner.js +59 -38
package/task-lib/scheduler.js +2 -2
package/task-lib/store.js +43 -30
package/task-lib/watcher.js +81 -62

package/README.md CHANGED Viewed

@@ -1,265 +1,229 @@
 # zeroshot CLI
+> **🎉 New Release:** Now supports **Codex** and **Gemini** CLI in addition to Claude! Use any provider or mix them in multi-agent workflows. See [Providers](#providers) for details.
+<!-- install-placeholder -->
+<p align="center">
+  <code>npm install -g @covibes/zeroshot</code>
+</p>
+<p align="center">
+  <img src="./docs/assets/zeroshot-demo.gif" alt="Demo" width="700">
+  <br>
+  <em>Demo (100x speed, 90-minute run, 5 iterations to approval)</em>
+</p>
 [![CI](https://github.com/covibes/zeroshot/actions/workflows/ci.yml/badge.svg)](https://github.com/covibes/zeroshot/actions/workflows/ci.yml)
 [![npm version](https://img.shields.io/npm/v/@covibes/zeroshot.svg)](https://www.npmjs.com/package/@covibes/zeroshot)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
 [![Node 18+](https://img.shields.io/badge/node-18%2B-brightgreen.svg)](https://nodejs.org/)
-[![Platform: Linux | macOS](https://img.shields.io/badge/platform-Linux%20%7C%20macOS-blue.svg)]()
-> **2024** was the year of LLMs. **2025** was the year of agents. **2026** is the year of agent clusters.
-**Autonomous engineering teams for Claude Code.**
+![Platform: Linux | macOS](https://img.shields.io/badge/platform-Linux%20%7C%20macOS-blue.svg)
-## Install
+<!-- discord-placeholder -->
-**Platforms**: Linux, macOS
+[![Discord](https://img.shields.io/badge/Discord-Join-5865F2?logo=discord&logoColor=white)](https://discord.gg/PdZ3UEXB)
-```bash
-npm install -g @covibes/zeroshot
-```
+Zeroshot is an open-source AI coding agent orchestration CLI that runs multi-agent workflows to autonomously implement, review, test, and verify code changes.
-**Requires**: Node 18+, [Claude Code CLI](https://claude.com/product/claude-code), [GitHub CLI](https://cli.github.com/)
+It runs a **planner**, an **implementer**, and independent **validators** in isolated environments, looping until changes are **verified** or **rejected** with actionable, reproducible failures.
-```bash
-npm i -g @anthropic-ai/claude-code && claude auth login
-gh auth login
-```
+Built for tasks where correctness matters more than speed.
----
-You know the problem. Your AI agent:
-- Says "tests pass" (never ran them)
-- Says "done!" (nothing works)
-- Implements 60% of what you asked
-- Ignores your coding guidelines
-- Introduces antipatterns like a junior dev
-- Gets sloppy on long tasks
-**AI is extremely capable. But not when one agent does everything in one session.**
+## How It Works
-Context degrades. Attention drifts. Shortcuts get taken.
+- Plan: translate a task into concrete acceptance criteria
+- Implement: make changes in an isolated workspace (local, worktree, or Docker)
+- Validate: run automated checks with independent validators
+- Iterate: repeat until verified, or return actionable failures
+- Resume: crash-safe state persisted for recovery
-Zeroshot fixes this with **multiple isolated agents** that check each other's work. The validator didn't write the code, so it can't lie about tests. Fail? Fix and retry until it works.
+## Quick Start
 ```bash
-zeroshot 123
+zeroshot run 123                    # GitHub issue number
 ```
-Point at a GitHub issue, walk away, come back to working code.
-### Demo
+Or describe the task inline:
 ```bash
-zeroshot "Add optimistic locking with automatic retry: when updating a user,
-detect if another request modified it first using version numbers,
-automatically retry with exponential backoff up to 3 times,
-merge non-conflicting field changes, surface true conflicts to the caller
-with details of what conflicted. Handle the ABA problem where version goes A->B->A."
+zeroshot run "Add optimistic locking with automatic retry: when updating a user,
+retry with exponential backoff up to 3 times, merge non-conflicting field changes,
+and surface conflicts with details. Handle the ABA problem where version goes A->B->A."
 ```
-<p align="center">
-  <img src="./docs/assets/zeroshot-demo.gif" alt="Demo" width="700">
-  <br>
-  <em>Sped up 100x — 90 minutes, 5 iterations until validators approved</em>
-</p>
-**The full fix cycle.** Initial implementation passed basic tests but validators caught edge cases: race conditions in concurrent updates, ABA problem not fully handled, retry backoff timing issues. Each rejection triggered fixes until all 48 tests passed with 91%+ coverage.
+## Why Not Just Use a Single AI Agent?
-A single agent would say "done!" after the first implementation. Here, the adversarial tester actually *runs* concurrent requests, times the retry backoff, and verifies conflict detection works under load.
-**This is what production-grade looks like.** Not "tests pass" — validators reject until it actually works. 5 iterations, each one fixing real bugs the previous attempt missed.
----
-## When to Use Zeroshot
+| Approach                   | Writes Code | Runs Tests | Blind Validation | Iterates Until Verified |
+| -------------------------- | ----------- | ---------- | ---------------- | ----------------------- |
+| Chat-based assistant       | ✅          | ⚠️         | ❌               | ❌                      |
+| Single coding agent        | ✅          | ⚠️         | ❌               | ⚠️                      |
+| **Zeroshot (multi-agent)** | ✅          | ✅         | ✅               | ✅                      |
-**Zeroshot requires well-defined tasks with clear acceptance criteria.**
+## Use Cases
-| Scenario | Use? | Why |
-|----------|:----:|-----|
-| Add rate limiting (sliding window, per-IP, 429) | ✅ | Clear requirements |
-| Refactor auth to JWT | ✅ | Defined end state |
-| Fix login bug | ✅ | Success is measurable |
-| Fix 2410 lint violations | ✅ | Clear completion criteria |
-| Make the app faster | ❌ | Needs exploration first |
-| Improve the codebase | ❌ | No acceptance criteria |
-| Figure out flaky tests | ❌ | Exploratory |
+- Autonomous AI code refactoring
+- AI-powered pull request automation
+- Automated bug fixing with validation
+- Multi-agent code generation for software engineering
+- Agentic coding workflows with blind validation
-**Known unknowns** (implementation details unclear) → Zeroshot handles this. The planner figures it out.
+## Who Is This For?
-**Unknown unknowns** (don't know what you'll discover) → Use single-agent Claude Code for exploration first, then come back with a well-defined task.
+- Senior engineers who care about correctness and reproducibility
+- Teams automating PR workflows and code review gates
+- Infra/platform teams standardizing agentic workflows
+- Open-source maintainers working through issue backlogs
+- AI power users who want verification, not vibes
-**Long-running batch tasks** → Zeroshot excels here. Run overnight with `-d` (daemon mode):
-- "Fix all 2410 semantic linting violations"
-- "Add TypeScript types to all 47 untyped files"
-- "Migrate all API calls from v1 to v2"
+## Install and Requirements
-Crash recovery (`zeroshot resume`) means multi-hour tasks survive interruptions.
+**Platforms**: Linux, macOS (Windows WSL not yet supported)
-**Rule of thumb:** If you can't describe what "done" looks like, zeroshot's validators can't verify it.
----
+```bash
+npm install -g @covibes/zeroshot
+```
-## Commands
+**Requires**: Node 18+, at least one provider CLI (Claude Code, Codex, Gemini). [GitHub CLI](https://cli.github.com/) is required when running by issue number.
 ```bash
-zeroshot run 123               # Run on GitHub issue
-zeroshot run "Add dark mode"   # Run from description
+# Install one or more providers
+npm i -g @anthropic-ai/claude-code
+npm i -g @openai/codex
+npm i -g @google/gemini-cli
-# Automation levels (cascading: --ship → --pr → --worktree)
-zeroshot run 123 --docker      # Docker isolation (full container)
-zeroshot run 123 --worktree    # Git worktree isolation (lightweight)
-zeroshot run 123 --pr          # Worktree + PR (human reviews)
-zeroshot run 123 --ship        # Worktree + PR + auto-merge (full automation)
+# Authenticate with the provider CLI
+claude login        # Claude
+codex login         # Codex
+gemini auth login   # Gemini
-# Background mode
-zeroshot run 123 -d            # Detached/daemon
-zeroshot run 123 --ship -d     # Full automation, background
+# GitHub auth (for issue numbers)
+gh auth login
+```
-# Control
-zeroshot list                  # See all running (--json for scripting)
-zeroshot status <id>           # Cluster status (--json for scripting)
-zeroshot logs <id> -f          # Follow output
-zeroshot resume <id>           # Continue after crash
-zeroshot kill <id>             # Stop
-zeroshot watch                 # TUI dashboard
+## Providers
-# Agent library
-zeroshot agents list           # View available agents
-zeroshot agents show <name>    # Agent details
+Zeroshot shells out to provider CLIs. Pick a default and override per run:
-# Maintenance
-zeroshot clean                 # Remove old records
-zeroshot purge                 # NUCLEAR: kill all + delete all
+```bash
+zeroshot providers
+zeroshot providers set-default codex
+zeroshot run 123 --provider gemini
 ```
----
-<details>
-<summary><strong>FAQ</strong></summary>
+See `docs/providers.md` for setup, model levels, and Docker mounts.
-**Q: Why Claude-only (for now)?**
+## Why Multiple Agents?
-Claude Code is the most capable agentic coding tool available. We wrap it directly - same tools, same reliability, no custom implementations to break.
+Single-agent sessions degrade. Context gets buried under thousands of tokens. The model optimizes for "done" over "correct."
-Multi-model support (Codex CLI, Gemini CLI) is planned - see [#19](https://github.com/covibes/zeroshot/issues/19).
+Zeroshot fixes this with isolated agents that check each other's work. Validators can't lie about code they didn't write. Fail the check? Fix and retry until it actually works.
-**Q: Why do single-agent coding sessions get sloppy?**
+## What Makes It Different
-Three failure modes compound when one agent does everything in one session:
+- **Blind validation** - Validators never see the worker's context or code history
+- **Repeatable workflows** - Task complexity determines agent count and model selection
+- **Accept/reject loop** - Rejections include actionable findings, not vague complaints
+- **Crash recovery** - All state persisted to SQLite; resume anytime
+- **Isolation modes** - None, git worktree, or Docker container
+- **Cost control** - Model ceilings prevent runaway API spend
-- **Context Dilution**: Your initial guidelines compete with thousands of tokens of code, errors, and edits. Instructions from 50 messages ago get buried.
-- **Success Bias**: LLMs optimize for "Task Complete" - even if that means skipping steps to get there.
-- **Error Snowball**: When fixing mistakes repeatedly, the context fills with broken code. The model starts copying its own bad patterns.
+## When to Use Zeroshot
-Zeroshot fixes this with **isolated agents** where validators check work they didn't write - no self-grading, no shortcuts.
+Zeroshot performs best when tasks have clear acceptance criteria.
-**Q: Can I customize the team?**
+| Scenario                                        | Use | Why                       |
+| ----------------------------------------------- | --- | ------------------------- |
+| Add rate limiting (sliding window, per-IP, 429) | Yes | Clear requirements        |
+| Refactor auth to JWT                            | Yes | Defined end state         |
+| Fix login bug                                   | Yes | Success is measurable     |
+| Fix 2410 lint violations                        | Yes | Clear completion criteria |
+| Make the app faster                             | No  | Needs exploration first   |
+| Improve the codebase                            | No  | No acceptance criteria    |
+| Figure out flaky tests                          | No  | Exploratory               |
-Yes, see CLAUDE.md. But most people never need to.
+Rule of thumb: if you cannot describe what "done" means, validators cannot verify it.
-**Q: Why is it called "zeroshot"?**
+## Command Overview
-In machine learning, "zero-shot" means solving tasks the model has never seen before - using only the task description, no prior examples needed.
+```bash
+# Run
+zeroshot run 123
+zeroshot run "Add dark mode"
-Same idea here: give zeroshot a well-defined task, get back a result. No examples. No iterative feedback. No hand-holding.
+# Isolation
+zeroshot run 123 --worktree       # git worktree
+zeroshot run 123 --docker         # container
-The multi-agent architecture handles planning, implementation, and validation internally. You provide a clear problem statement. Zeroshot handles the rest.
+# Automation (--ship implies --pr implies --worktree)
+zeroshot run 123 --pr             # worktree + create PR
+zeroshot run 123 --ship           # PR + auto-merge on approval
-</details>
+# Background mode
+zeroshot run 123 -d
+zeroshot run 123 --ship -d
----
+# Control
+zeroshot list
+zeroshot status <id>
+zeroshot logs <id> -f
+zeroshot resume <id>
+zeroshot stop <id>
+zeroshot kill <id>
+zeroshot watch
+# Providers
+zeroshot providers
+zeroshot providers set-default codex
-## How It Works
+# Agent library
+zeroshot agents list
+zeroshot agents show <name>
-Zeroshot is a **multi-agent coordination framework** with smart defaults.
+# Maintenance
+zeroshot clean
+zeroshot purge
+```
-### Zero Config
+## Architecture
-```bash
-zeroshot 123  # Analyzes task → picks team → done
-```
+Zeroshot is a message-driven coordination layer with smart defaults.
-The conductor classifies your task (complexity × type) and picks the right workflow:
+- The conductor classifies tasks by complexity and type.
+- A workflow template selects agents and validators.
+- Agents publish results to a SQLite ledger.
+- Validators approve or reject with specific findings.
+- Rejections route back to the worker for fixes.
-```
-                                ┌─────────────────┐
-                                │      TASK       │
-                                └────────┬────────┘
-                                         │
-                                         ▼
-                ┌────────────────────────────────────────────┐
-                │                 CONDUCTOR                  │
-                │     Complexity × TaskType → Workflow       │
-                └────────────────────────┬───────────────────┘
-                                         │
-           ┌─────────────────────────────┼─────────────────────────────┐
-           │                             │                             │
-           ▼                             ▼                             ▼
-     ┌───────────┐                ┌───────────┐                ┌───────────┐
-     │  TRIVIAL  │                │  SIMPLE   │                │ STANDARD+ │
-     │  1 agent  │──────────▶     │  worker   │                │ planner   │
-     │  (haiku)  │  COMPLETE      │ + 1 valid.│                │ + worker  │
-     │ no valid. │                └─────┬─────┘                │ + 3-5 val.│
-     └───────────┘                      │                      └─────┬─────┘
-                                        ▼                            │
-                                 ┌─────────────┐                     ▼
-                             ┌──▶│   WORKER    │             ┌─────────────┐
-                             │   └──────┬──────┘             │   PLANNER   │
-                             │          │                    └──────┬──────┘
-                             │          ▼                           │
-                             │   ┌─────────────────────┐            ▼
-                             │   │ ✓ validator         │     ┌─────────────┐
-                             │   │   (generic check)   │ ┌──▶│   WORKER    │
-                             │   └──────────┬──────────┘ │   └──────┬──────┘
-                             │       REJECT │ ALL OK     │          │
-                             └──────────────┘     │      │          ▼
-                                                  │      │   ┌──────────────────────┐
-                                                  │      │   │ ✓ requirements       │
-                                                  │      │   │ ✓ code (STANDARD+)   │
-                                                  │      │   │ ✓ security (CRIT)    │
-                                                  │      │   │ ✓ tester (CRIT)      │
-                                                  │      │   │ ✓ adversarial        │
-                                                  │      │   │   (real execution)   │
-                                                  │      │   └──────────┬───────────┘
-                                                  │      │       REJECT │ ALL OK
-                                                  │      └──────────────┘     │
-                                                  ▼                           ▼
-     ┌─────────────────────────────────────────────────────────────────────────────┐
-     │                                COMPLETE                                     │
-     └─────────────────────────────────────────────────────────────────────────────┘
-```
+### Complexity Model
 | Task                   | Complexity | Agents | Validators                                        |
 | ---------------------- | ---------- | ------ | ------------------------------------------------- |
 | Fix typo in README     | TRIVIAL    | 1      | None                                              |
-| Add dark mode toggle   | SIMPLE     | 2      | generic validator                                 |
-| Refactor auth system   | STANDARD   | 4      | requirements, code                                |
-| Implement payment flow | CRITICAL   | 7      | requirements, code, security, tester, adversarial |
+| Add dark mode toggle   | SIMPLE     | 2      | Generic validator                                 |
+| Refactor auth system   | STANDARD   | 4      | Requirements, code                                |
+| Implement payment flow | CRITICAL   | 7      | Requirements, code, security, tester, adversarial |
 ### Model Selection by Complexity
 | Complexity | Planner | Worker | Validators |
 | ---------- | ------- | ------ | ---------- |
-| TRIVIAL    | -       | haiku  | 0          |
-| SIMPLE     | -       | sonnet | 1 (sonnet) |
-| STANDARD   | sonnet  | sonnet | 2 (sonnet) |
-| CRITICAL   | opus    | sonnet | 5 (sonnet) |
+| TRIVIAL    | -       | level1 | -          |
+| SIMPLE     | -       | level2 | 1 (level2) |
+| STANDARD   | level2  | level2 | 2 (level2) |
+| CRITICAL   | level3  | level2 | 5 (level2) |
-Set model ceiling: `zeroshot settings set maxModel sonnet` (prevents opus)
----
+Levels map to provider-specific models. Configure with `zeroshot providers setup <provider>` or
+`settings.providerSettings`. (Legacy `maxModel` applies to Claude only.)
 <details>
 <summary><strong>Custom Workflows (Framework Mode)</strong></summary>
-Zeroshot is **message-driven** - define any agent topology:
+Zeroshot is message-driven, so you can define any agent topology.
-- **Expert panels**: Parallel specialists → aggregator → decision
-- **Staged gates**: Sequential validators, each with veto power
-- **Hierarchical**: Supervisor dynamically spawns workers
-- **Dynamic**: Conductor adds agents mid-execution
+- Expert panels: parallel specialists -> aggregator -> decision
+- Staged gates: sequential validators, each with veto power
+- Hierarchical: supervisor dynamically spawns workers
+- Dynamic: conductor adds agents mid-execution
 **Coordination primitives:**
@@ -268,20 +232,14 @@ Zeroshot is **message-driven** - define any agent topology:
 - Ledger (SQLite, crash recovery)
 - Dynamic spawning (CLUSTER_OPERATIONS)
-#### Creating Custom Clusters with Claude Code
+#### Creating Custom Clusters with a Provider CLI
-**The easiest way to create a custom cluster: just ask Claude Code.**
+Start your provider CLI and describe your cluster:
-```bash
-# In your zeroshot repo
-claude
-```
-**Example prompt:**
 ```
 Create a zeroshot cluster config for security-critical features:
-1. Implementation agent (sonnet) implements the feature
+1. Implementation agent (level2) implements the feature
 2. FOUR parallel validators:
    - Security validator: OWASP checks, SQL injection, XSS, CSRF
    - Performance validator: No N+1 queries, proper indexing
@@ -290,76 +248,64 @@ Create a zeroshot cluster config for security-critical features:
 3. ALL validators must approve before merge
 4. If ANY validator rejects, implementation agent fixes and resubmits
-5. Use opus for security validator (highest stakes)
+5. Use level3 for security validator (highest stakes)
 Look at cluster-templates/base-templates/full-workflow.json
 and create a similar cluster. Save to cluster-templates/security-review.json
 ```
-Claude Code will read existing templates, create valid JSON config, and iterate until it works.
+Built-in validation checks for missing triggers, deadlocks, and invalid type wiring before running.
-**Built-in validation catches failures before running:**
-- Never start (no bootstrap trigger)
-- Never complete (no path to completion)
-- Loop infinitely (circular dependencies)
-- Deadlock (impossible consensus)
-- Type mismatches (boolean → string in JSON)
-See [CLAUDE.md](./CLAUDE.md) for cluster config schema and examples.
+See [CLAUDE.md](./CLAUDE.md) for the cluster schema and examples.
 </details>
----
 ## Crash Recovery
-Everything saves to SQLite. If your 2-hour run crashes at 1:59:
+All state is persisted in the SQLite ledger. You can resume at any time:
 ```bash
 zeroshot resume cluster-bold-panther
-# Continues from exact point
 ```
----
 ## Isolation Modes
 ### Git Worktree (Default for --pr/--ship)
 ```bash
-zeroshot 123 --worktree
+zeroshot run 123 --worktree
 ```
-Lightweight isolation using git worktree. Creates a separate working directory with its own branch. Fast (<1s setup), no Docker required. Auto-enabled with `--pr` and `--ship`.
+Lightweight isolation using git worktree. Creates a separate working directory with its own branch. Auto-enabled with `--pr` and `--ship`.
 ### Docker Container
 ```bash
-zeroshot 123 --docker
+zeroshot run 123 --docker
 ```
-Full isolation in a fresh container. Your workspace stays untouched. Good for risky experiments or parallel agents.
+Full isolation in a fresh container. Your workspace stays untouched. Useful for risky experiments or parallel runs.
 ### When to Use Which
-| Scenario | Recommended |
-| -------- | ----------- |
-| Quick task, review changes yourself | No isolation (default) |
-| PR workflow, code review | `--worktree` or `--pr` |
-| Risky experiment, might break things | `--docker` |
-| Running multiple tasks in parallel | `--docker` |
-| Full automation, no review needed | `--ship` |
+| Scenario                             | Recommended            |
+| ------------------------------------ | ---------------------- |
+| Quick task, review changes yourself  | No isolation (default) |
+| PR workflow, code review             | `--worktree` or `--pr` |
+| Risky experiment, might break things | `--docker`             |
+| Running multiple tasks in parallel   | `--docker`             |
+| Full automation, no review needed    | `--ship`               |
-**Default mode:** Agents are instructed to only modify files (no git commit/push). You review and commit yourself.
+**Default behavior:** Agents modify files only; they do not commit or push unless using an isolation mode that explicitly allows it.
 <details>
 <summary><strong>Docker Credential Mounts</strong></summary>
-When using `--docker`, zeroshot mounts credential directories so Claude can access tools like AWS, Azure, kubectl.
+When using `--docker`, zeroshot mounts credential directories so agents can access provider CLIs and tools like AWS, Azure, and kubectl.
 **Default mounts**: `gh`, `git`, `ssh` (GitHub CLI, git config, SSH keys)
-**Available presets**: `gh`, `git`, `ssh`, `aws`, `azure`, `kube`, `terraform`, `gcloud`
+**Available presets**: `gh`, `git`, `ssh`, `aws`, `azure`, `kube`, `terraform`, `gcloud`, `claude`, `codex`, `gemini`
 ```bash
 # Configure via settings (persistent)
@@ -371,6 +317,10 @@ zeroshot settings get dockerMounts
 # Per-run override
 zeroshot run 123 --docker --mount ~/.aws:/root/.aws:ro
+# Provider credentials
+zeroshot run 123 --docker --mount ~/.config/codex:/home/node/.config/codex:ro
+zeroshot run 123 --docker --mount ~/.config/gemini:/home/node/.config/gemini:ro
 # Disable all mounts
 zeroshot run 123 --docker --no-mounts
@@ -378,7 +328,10 @@ zeroshot run 123 --docker --no-mounts
 ZEROSHOT_DOCKER_MOUNTS='["aws","azure"]' zeroshot run 123 --docker
 ```
+See `docs/providers.md` for provider CLI setup and mount details.
 **Custom mounts** (mix presets with explicit paths):
 ```bash
 zeroshot settings set dockerMounts '[
   "gh",
@@ -388,57 +341,55 @@ zeroshot settings set dockerMounts '[
 ```
 **Container home**: Presets use `$HOME` placeholder. Default: `/root`. Override with:
 ```bash
 zeroshot settings set dockerContainerHome '/home/node'
 # Or per-run:
 zeroshot run 123 --docker --container-home /home/node
 ```
-**Env var passthrough**: Presets auto-pass related env vars (e.g., `aws` → `AWS_REGION`, `AWS_PROFILE`). Add custom:
+**Env var passthrough**: Presets auto-pass related env vars (for example, `aws` -> `AWS_REGION`, `AWS_PROFILE`). Add custom:
 ```bash
 zeroshot settings set dockerEnvPassthrough '["MY_API_KEY", "TF_VAR_*"]'
 ```
 </details>
----
-## More
-- **Debug**: `sqlite3 ~/.zeroshot/cluster-abc.db "SELECT * FROM messages;"`
-- **Export**: `zeroshot export <id> --format markdown`
-- **Architecture**: See [CLAUDE.md](./CLAUDE.md)
+## Resources
----
+- [CLAUDE.md](./CLAUDE.md) - Architecture, cluster config schema, agent primitives
+- `docs/providers.md` - Provider setup, model levels, and Docker mounts
+- [Discord](https://discord.gg/PdZ3UEXB) - Support and community
+- `zeroshot export <id>` - Export conversation to markdown
+- `sqlite3 ~/.zeroshot/*.db` - Direct ledger access for debugging
 <details>
 <summary><strong>Troubleshooting</strong></summary>
-| Issue                         | Fix                                                                  |
-| ----------------------------- | -------------------------------------------------------------------- |
-| `claude: command not found`   | `npm i -g @anthropic-ai/claude-code && claude auth login`            |
-| `gh: command not found`       | [Install GitHub CLI](https://cli.github.com/)                        |
-| `--docker` fails              | Docker must be running: `docker ps` to verify                        |
-| Cluster stuck                 | `zeroshot resume <id>` to continue with guidance                     |
-| Agent keeps failing           | Check `zeroshot logs <id>` for actual error                          |
-| `zeroshot: command not found` | `npm install -g @covibes/zeroshot`                                   |
+| Issue                         | Fix                                                       |
+| ----------------------------- | --------------------------------------------------------- |
+| `claude: command not found`   | `npm i -g @anthropic-ai/claude-code && claude auth login` |
+| `codex: command not found`    | `npm i -g @openai/codex && codex login`                   |
+| `gemini: command not found`   | `npm i -g @google/gemini-cli && gemini auth login`        |
+| `gh: command not found`       | [Install GitHub CLI](https://cli.github.com/)             |
+| `--docker` fails              | Docker must be running: `docker ps` to verify             |
+| Cluster stuck                 | `zeroshot resume <id>` to continue                        |
+| Agent keeps failing           | Check `zeroshot logs <id>` for actual error               |
+| `zeroshot: command not found` | `npm install -g @covibes/zeroshot`                        |
 </details>
----
 ## Contributing
 See [CONTRIBUTING.md](CONTRIBUTING.md) for development setup and guidelines.
-Please read our [Code of Conduct](CODE_OF_CONDUCT.md) before participating.
+Please read [CODE_OF_CONDUCT.md](CODE_OF_CONDUCT.md) before participating.
 For security issues, see [SECURITY.md](SECURITY.md).
 ---
-MIT — [Covibes](https://github.com/covibes)
+MIT - [Covibes](https://github.com/covibes)
 Built on [Claude Code](https://claude.com/product/claude-code) by Anthropic.