npm - @defai.digital/automatosx - Versions diffs - 5.2.2 → 5.3.3 - Mend

@defai.digital/automatosx 5.2.2 → 5.3.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (30) hide show

package/CHANGELOG.md +280 -0
package/README.md +185 -35
package/dist/index.js +5171 -2087
package/dist/index.js.map +1 -1
package/dist/version.json +2 -2
package/examples/AGENTS_INFO.md +35 -1
package/examples/agents/backend.yaml +4 -3
package/examples/agents/ceo.yaml +4 -3
package/examples/agents/creative-marketer.yaml +121 -0
package/examples/agents/cto.yaml +4 -1
package/examples/agents/data-scientist.yaml +4 -3
package/examples/agents/data.yaml +4 -3
package/examples/agents/design.yaml +4 -3
package/examples/agents/devops.yaml +4 -3
package/examples/agents/frontend.yaml +4 -3
package/examples/agents/fullstack.yaml +4 -3
package/examples/agents/mobile.yaml +4 -3
package/examples/agents/product.yaml +4 -3
package/examples/agents/quality.yaml +3 -1
package/examples/agents/security.yaml +3 -1
package/examples/agents/writer.yaml +4 -3
package/package.json +1 -1
package/version.json +2 -2
/package/examples/claude/commands/{ax:agent.md → ax-agent.md} +0 -0
/package/examples/claude/commands/{ax:clear.md → ax-clear.md} +0 -0
/package/examples/claude/commands/{ax:init.md → ax-init.md} +0 -0
/package/examples/claude/commands/{ax:list.md → ax-list.md} +0 -0
/package/examples/claude/commands/{ax:memory.md → ax-memory.md} +0 -0
/package/examples/claude/commands/{ax:status.md → ax-status.md} +0 -0
/package/examples/claude/commands/{ax:update.md → ax-update.md} +0 -0

package/CHANGELOG.md CHANGED Viewed

@@ -5,6 +5,286 @@ All notable changes to AutomatosX will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [5.3.3] - 2025-10-14
+### 🏗️ Foundation for Agent Optimization
+**This release establishes the infrastructure and comprehensive analysis for intelligent ability loading, setting the stage for 50-90% token savings in v5.4.0.**
+#### Added
+- **Ability Metadata Infrastructure**:
+  - Created `schema/ability-metadata.json` with tier framework (core/advanced/specialized)
+  - Established foundation for intelligent ability loading system
+  - Defined tier constraints: core ≤250 words, advanced ≤600 words, specialized unlimited
+  - Infrastructure ready for task complexity-based loading (v5.4.0)
+- **Comprehensive Optimization Analysis**:
+  - Complete analysis of all 16 agents and 63 abilities (`automatosx/PRD/v5.3-agent-optimization.md`)
+  - Token waste analysis identifying 50-92% savings potential
+  - Ability classification matrix (core/advanced/specialized)
+  - Per-agent optimization recommendations
+  - 4-phase implementation roadmap
+- **User Feedback Integration** (`automatosx/PRD/v5.4.0_Recommendations_and_Roadmap.md`):
+  - Agent role expansion strategy (community-driven framework for 50+ roles)
+  - Delegation Guard architecture (cycle detection, deadlock prevention)
+  - Configurable timeout system (25→35-45 minutes)
+  - Delegation depth increase plan (2→3 levels with safety guards)
+  - Prioritized implementation roadmap (P0-P3)
+- **Technical Implementation Specifications** (`automatosx/PRD/v5.4.0_Implementation_Guide.md`):
+  - Detailed architecture diagrams for Delegation Guard
+  - Graph-based cycle detection algorithm
+  - Role similarity scoring mechanism
+  - Context preservation for deep delegation chains
+  - Complete code examples and integration points
+- **Feature Roadmap** (`automatosx/PRD/FEATURE-ROADMAP-v5.4.md`):
+  - Agent interaction visualizer
+  - Public agent/ability registry
+  - Interactive debugger
+  - Provider response caching
+  - Git and CI/CD integrations
+  - Enterprise features (RBAC, audit logging, secrets management)
+#### Documentation
+- **v5.3-agent-optimization.md**: Complete optimization analysis with ability classification matrix, intelligent loading strategy, and success metrics
+- **v5.3.3-implementation-plan.md**: Foundation release plan and roadmap for v5.4.0
+- **v5.4.0_Recommendations_and_Roadmap.md**: User feedback analysis with prioritized P0-P3 recommendations
+- **v5.4.0_Implementation_Guide.md**: Technical specifications with pseudocode and implementation details
+- **FEATURE-ROADMAP-v5.4.md**: General feature roadmap for future releases
+#### Performance Impact (Foundation for v5.4.0)
+No immediate performance changes in v5.3.3. This release establishes infrastructure for v5.4.0 optimizations:
+| Agent | Current Avg Tokens | v5.4.0 Target | Savings | Use Case |
+|-------|-------------------|---------------|---------|----------|
+| Creative marketer | 5,242 | 400-800 | **85-92%** | Simple social media |
+| Design | 1,468 | 250-600 | **59-83%** | Quick wireframes |
+| Mobile | 1,732 | 250-500 | **71-86%** | Basic UI questions |
+| Data scientist | 992 | 250-500 | **50-75%** | Simple data queries |
+| Backend | 1,185 | 350-600 | **41-70%** | Simple CRUD |
+| Frontend | 846 | 250-450 | **47-70%** | Component questions |
+**Overall**: 50-90% token reduction for simple tasks while maintaining full power for complex workflows (v5.4.0).
+#### Changed
+None (infrastructure-only release)
+#### Fixed
+None (infrastructure-only release)
+#### Notes
+- **Zero breaking changes** - This is a pure infrastructure and documentation release
+- **All tests passing** - 1,702 tests (99.59% pass rate)
+- **TypeScript strict mode** - 0 errors
+- **Foundation complete** - Ready for v5.4.0 implementation
+- **Estimated v5.4.0 timeline** - 8-10 weeks for full optimization
+---
+## [5.3.1] - 2025-10-14
+### 🪟 Windows CLI Provider Detection & Enhanced Robustness
+**This patch release fixes critical Windows compatibility issues and adds robust provider detection with security enhancements.**
+#### Added
+- **Windows CLI Provider Detection** (Phase 1):
+  - Cross-platform CLI provider detector (`src/core/cli-provider-detector.ts`)
+  - Windows-specific detection using `where.exe` + PATH×PATHEXT fallback
+  - Unix detection using `which` command
+  - Detection caching for performance (< 1ms cached lookups)
+  - Support for `.CMD`, `.BAT`, `.EXE`, `.COM` extensions on Windows
+- **ENV Variable Override** (Phase 2):
+  - `CLAUDE_CLI` - Override Claude CLI path
+  - `GEMINI_CLI` - Override Gemini CLI path
+  - `CODEX_CLI` - Override Codex CLI path
+  - Three-layer detection: ENV → Config → PATH
+  - `ax status` shows ENV variable status with validation
+- **Provider Configuration** (Phase 2):
+  - `customPath` - Custom CLI path in provider config
+  - `versionArg` - Custom version check argument
+  - `minVersion` - Minimum required version (semantic versioning)
+- **Version Validation**:
+  - Semantic version parsing and comparison
+  - Automatic CLI version detection via `--version`
+  - Warning logs when version requirement not met
+  - Permissive behavior (allows if version check fails)
+- **Cross-Platform CI** (Phase 3):
+  - GitHub Actions workflows for Ubuntu, macOS, Windows
+  - Automatic testing on all platforms
+  - Coverage report artifacts
+  - 30-minute timeout for Windows tests
+#### Fixed
+- **Critical**: Windows CLI provider detection failures (GitHub Issue #1)
+  - Providers now detected correctly on Windows using `where.exe`
+  - Fallback to PATH×PATHEXT scanning if `where.exe` fails
+  - Standard PATH detection works on all platforms
+- **Provider Detection**:
+  - Enhanced `BaseProvider.checkCLIAvailabilityEnhanced()` with version validation
+  - Proper fallback chain: ENV → customPath → PATH
+  - Graceful degradation on detection failures
+- **CI Configuration**:
+  - Artifact upload paths corrected (coverage/ only)
+  - Removed non-existent test-results/ path
+  - Added `if-no-files-found: warn` for graceful handling
+- **Documentation**:
+  - Async error handling documentation for `detectAll()`
+  - Clear usage examples with proper error handling
+  - JSDoc annotations updated with `@throws` tags
+#### Security
+- **Path Traversal Protection**:
+  - Added validation to reject `..` (parent directory) patterns
+  - Added validation to reject `~` (home directory) shortcuts
+  - Security warnings logged for suspicious paths
+  - Read-only validation (no writes, minimal risk)
+#### Changed
+- **Provider Detection Priority**:
+  1. ENV variables (highest priority)
+  2. Config `customPath` (second priority)
+  3. Standard PATH detection (fallback)
+  4. Version validation (if `minVersion` set)
+#### Performance
+- **Detection Caching**: First call ~100-500ms, cached calls < 1ms
+- **Version Check Overhead**: +100-200ms when `minVersion` configured
+- **Path Validation**: +0.1-0.5ms per path (negligible)
+- **Overall Impact**: < 1% overhead
+#### Documentation
+- Added comprehensive JSDoc for all new APIs
+- Added usage examples for ENV variables
+- Added Windows-specific troubleshooting
+- Added version validation configuration guide
+- **README.md**: Enhanced with tested platforms (macOS 15, Ubuntu 24.04, Windows 10/11)
+- **README.md**: Simplified Windows Support section with clearer quick-start instructions
+- **Windows Setup Guide** (NEW): Complete installation walkthrough for Windows users
+- **Windows Troubleshooting**: Updated to v5.3.1 with provider detection solutions
+#### Testing
+- **1,670 tests passing** (100% pass rate)
+- **0 TypeScript errors** (strict mode)
+- **Cross-platform CI**: Ubuntu, macOS, Windows
+- **2 Windows-specific tests** (PATH×PATHEXT detection)
+#### Related
+- Fixes: GitHub Issue #1 (Windows CLI provider detection)
+- PRD: `tmp/PRD-WINDOWS-CLI-DETECTION.md`
+- Reports: `tmp/PHASE{1,2,3}-COMPLETION-REPORT.md`
+- Code Review: `tmp/CODE-REVIEW-REPORT.md`
+- Bug Fixes: `tmp/BUG-FIX-COMPLETION-REPORT.md`
+---
+## [5.3.0] - 2025-10-14
+### 🚀 Stage Execution & Checkpoint System
+**This release introduces a checkpoint-based stage execution system for fault-tolerant, long-running workflows with interactive, streaming, and hybrid execution modes.**
+#### Added
+- **Stage Execution System**:
+  - `StageExecutionController` - Orchestrates multi-stage execution with checkpoint support
+  - `CheckpointManager` - JSON-based checkpoint persistence with automatic cleanup
+  - `ProgressChannel` - Event-based real-time progress tracking
+  - `PromptManager` - User interaction prompts with timeout handling
+  - **Commands**:
+    - `ax resume <run-id>` - Resume execution from saved checkpoint
+    - `ax runs list` - List all checkpoint runs with filtering
+    - `ax runs show <run-id>` - Show detailed checkpoint information
+    - `ax runs delete <run-id>` - Delete checkpoint with confirmation
+- **Execution Modes**:
+  - `--interactive` - Pause between stages for user decisions
+  - `--streaming` - Real-time progress updates during execution
+  - `--hybrid` - Both interactive and streaming (shortcut for `--interactive --streaming`)
+  - `--resumable` - Enable checkpoint save for resume capability
+  - `--auto-continue` - Auto-confirm all checkpoints (CI-friendly mode)
+- **Configuration** (`automatosx.config.json`):
+  - `execution.stages.enabled` - Enable stage-based execution (opt-in)
+  - `execution.stages.autoSaveCheckpoint` - Auto-save checkpoints after each stage
+  - `execution.stages.checkpointPath` - Checkpoint storage directory
+  - `execution.stages.cleanupAfterDays` - Automatic checkpoint cleanup
+  - `execution.stages.prompts.autoConfirm` - Default auto-confirm behavior
+  - `execution.stages.progress.updateInterval` - Progress update frequency
+  - `execution.stages.progress.syntheticProgress` - Enable synthetic progress
+#### Fixed
+- **Critical**: Removed `argv.interactive || true` forcing all executions into interactive mode (src/cli/commands/run.ts:458)
+  - Now respects CLI flags: `--interactive`, `--streaming`, `--hybrid`, `--resumable` work correctly
+  - Fixes regression where flagship v5.3.0 features were broken
+- **Major**:
+  - Resume command now passes `memoryManager` to `StageExecutionController` for memory persistence (src/cli/commands/resume.ts:243-244)
+  - Config-driven automation settings now properly honored instead of being overridden by CLI defaults
+    - `autoSaveCheckpoint` uses config value when CLI flag not specified
+    - `autoConfirm` uses config value when CLI flag not specified
+  - Resume flow now preserves original `autoConfirm` choice from checkpoint instead of defaulting to `false`
+  - CLI options (`--interactive`, `--resumable`, `--auto-continue`, `--streaming`, `--hybrid`) no longer have hardcoded `default: false`, allowing proper config fallback
+- **Minor**:
+  - Removed duplicate spinner in streaming mode - `ProgressRenderer` now handles all visual feedback (src/core/stage-execution-controller.ts:1286)
+#### Technical Details
+- **New Core Modules**:
+  - `src/core/stage-execution-controller.ts` - Stage lifecycle, checkpoint integration, progress tracking
+  - `src/core/checkpoint-manager.ts` - Checkpoint CRUD, JSON persistence, automatic cleanup
+  - `src/core/progress-channel.ts` - Event-based progress updates with percentage tracking
+  - `src/core/prompt-manager.ts` - CLI user prompts with timeout and validation
+  - `src/cli/commands/resume.ts` - Resume from checkpoint with mode override support
+  - `src/cli/commands/runs.ts` - Checkpoint management (list, show, delete)
+  - `src/cli/renderers/progress-renderer.ts` - Real-time progress visualization
+  - `src/types/stage-execution.ts` - Complete type definitions for stage system
+- **Checkpoint Structure**:
+  ```
+  .automatosx/checkpoints/
+  └── <run-id>/
+      ├── checkpoint.json    # Checkpoint metadata and stage states
+      └── artifacts/         # Stage outputs and files
+  ```
+- **Benefits**:
+  - ✅ Fault tolerance: Resume from failure points
+  - ✅ Long-running workflows: Execute multi-hour tasks safely
+  - ✅ User control: Pause and review between stages
+  - ✅ Real-time feedback: Monitor progress during execution
+  - ✅ Audit trail: Complete execution history with artifacts
+#### Known Limitations
+- Test coverage for new features (StageExecutionController, CheckpointManager, resume, runs) is minimal
+  - Recommendation: Add comprehensive tests before production use
 ## [5.2.2] - 2025-10-14
 ### 🧪 Quality & Maintenance Release

package/README.md CHANGED Viewed

@@ -7,9 +7,9 @@
 [![npm version](https://img.shields.io/npm/v/@defai.digital/automatosx.svg)](https://www.npmjs.com/package/@defai.digital/automatosx)
 [![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](LICENSE)
 [![TypeScript](https://img.shields.io/badge/TypeScript-100%25-blue.svg)](https://www.typescriptlang.org/)
-[![Tests](https://img.shields.io/badge/tests-1,259%20passing-brightgreen.svg)](#)
+[![Tests](https://img.shields.io/badge/tests-1,657%20passing-brightgreen.svg)](#)
-**Status**: ✅ Production Ready · v5.2.2 · October 2025
+**Status**: ✅ Production Ready · v5.3.3 · October 2025
 Looking for answers? See the [FAQ](FAQ.md).
@@ -19,13 +19,42 @@ Looking for answers? See the [FAQ](FAQ.md).
 **AutomatosX extends Claude Code with specialized AI agents that remember context, delegate tasks, and collaborate autonomously.**
-```bash
-# In Claude Code, simply use /ax:agent
-/ax:agent Paris, design authentication system with JWT
-/ax:agent Bob, implement the auth design  # Bob auto-receives Paris's design from memory
+### 💡 Best Way to Use: Natural Language Collaboration (Recommended)
+Instead of directly commanding agents, **let Claude Code think and coordinate**:
+```
+✅ RECOMMENDED: Natural language collaboration
+"please work with ax agent to implement user authentication with JWT"
+What happens:
+1. Claude Code analyzes your project structure
+2. AutomatosX selects the most suitable agent automatically
+3. Provides full context to the agent
+4. Validates the results
+5. Helps you understand and iterate
+```
+**vs.**
 ```
+⚡ EXPRESS: Direct slash command (for simple tasks)
+/ax:agent backend, implement JWT auth
+What happens:
+1. Backend agent executes directly
+2. Limited project context
+3. No validation or planning
+```
+### 🎓 Think of it This Way
-**The result**: Claude Code becomes a **learning, coordinated team** instead of a stateless assistant.
+- **Natural Collaboration** = Having a conversation with an intelligent coordinator who summons experts
+- **Slash Commands** = Directly commanding the experts without coordination
+**Recommendation**: Use natural language for 80% of tasks, slash commands for quick 20%.
+📖 **[Complete Best Practices Guide](docs/BEST-PRACTICES.md)**
 ---
@@ -203,22 +232,45 @@ AutomatosX offers **two powerful modes** to fit your workflow:
 ### 1️⃣ Claude Code Integration (Recommended)
-**Use AutomatosX agents directly inside Claude Code conversations** with the `/ax:agent` slash command.
+**The best way**: Use **natural language collaboration** to let Claude Code coordinate agents intelligently.
+#### Natural Language Collaboration (Primary Method - 80% of tasks)
+```
+# Let Claude Code think, plan, and coordinate
+"please work with ax agent to implement user authentication"
+"please work with ax agent to design a secure API for our application"
+"please work with ax agent to refactor this module with best practices"
+```
+**Why this is better**:
+- 🧠 Claude Code analyzes your project first
+- 🎯 Automatically selects the best agents
+- 📚 Provides full context from your codebase
+- ✅ Validates results and handles errors
+- 🔄 Easy to iterate and refine
+#### Slash Commands (Express Method - 20% of tasks)
 ```bash
-# In Claude Code, use the slash command
+# Direct execution for simple, well-defined tasks
 /ax:agent Paris, design a REST API for user authentication
-/ax:agent Bob, implement the auth API from Paris's design
-/ax:agent Steve, security audit the authentication code
+/ax:agent Bob, write a function to validate emails
+/ax:agent Steve, review this code snippet
 ```
+**Use slash commands when**:
+- ⚡ Task is simple and well-defined
+- 🎯 You know exactly which agent to use
+- 🚀 Speed matters more than planning
 **Perfect for**:
-- 💬 Interactive development workflows
-- 🔄 Seamless context switching within Claude Code
-- 🤝 Collaborative coding sessions
-- 🎯 Quick agent delegation while coding
+- 💬 All types of development workflows
+- 🔄 Both simple and complex tasks
+- 🤝 Single and multi-agent coordination
+- 🎯 Interactive and automated workflows
-**How it works**: Claude Code executes AutomatosX commands behind the scenes, brings results back into your conversation, and maintains full context.
+**How it works**: Claude Code acts as an intelligent coordinator, analyzing context, selecting agents, and orchestrating their work seamlessly.
 ### 2️⃣ Terminal/CLI Mode (Power Users)
@@ -278,7 +330,7 @@ npm install -g @defai.digital/automatosx
 ```bash
 ax --version
-# Should show: 5.1.0 (or later)
+# Should show: 5.3.3 (or later)
 ```
 > **Windows Users**: If `ax` command not found, see [Windows Troubleshooting](docs/troubleshooting/windows-troubleshooting.md)
@@ -332,9 +384,94 @@ ax list agents
 ---
+### 🪟 Windows Support (Fully Tested)
+**AutomatosX v5.3.1+ fully supports Windows 10 & 11** with automatic CLI provider detection.
+#### Quick Start for Windows Users
+**Most users don't need any configuration** - AutomatosX automatically detects provider CLIs installed via npm:
+```bash
+# 1. Install providers (if not already installed)
+npm install -g @anthropic-ai/claude-cli
+npm install -g @google/generative-ai-cli
+npm install -g openai
+# 2. Verify detection
+ax status
+```
+**If providers are not detected**, you can manually specify paths:
+**Windows (Command Prompt)**:
+```cmd
+set CLAUDE_CLI=C:\Users\YourName\AppData\Roaming\npm\claude.cmd
+set GEMINI_CLI=C:\Users\YourName\AppData\Roaming\npm\gemini.cmd
+ax status
+```
+**Windows (PowerShell)**:
+```powershell
+$env:CLAUDE_CLI="C:\Users\YourName\AppData\Roaming\npm\claude.cmd"
+$env:GEMINI_CLI="C:\Users\YourName\AppData\Roaming\npm\gemini.cmd"
+ax status
+```
+#### How Provider Detection Works
+AutomatosX uses a **three-layer detection system**:
+1. **ENV Variables** (highest priority) - `CLAUDE_CLI`, `GEMINI_CLI`, `CODEX_CLI`
+2. **Config File** - Custom paths in `automatosx.config.json`
+3. **PATH Detection** (automatic) - Standard system PATH
+   - **Windows**: Uses `where.exe` + PATH×PATHEXT scanning
+   - **Unix/macOS**: Uses `which` command
+#### Windows-Specific Help
+Having issues on Windows? See our comprehensive guides:
+- 📖 **[Windows Setup Guide](docs/troubleshooting/windows-setup.md)** - Complete Windows installation walkthrough
+- 🔧 **[Windows Troubleshooting](docs/troubleshooting/windows-troubleshooting.md)** - Common Windows issues and solutions
+- ⚙️ **[Advanced Configuration](docs/guide/configuration.md)** - Custom paths, version requirements, and more
+> **💡 Quick Tip**: Run `ax status --verbose` to see exactly which paths are being detected and used.
+---
 ### Step 3: Run Your First Agent
-**Terminal Mode** (any platform):
+#### Option A: Claude Code Integration (Recommended)
+**Best Practice: Natural Language Collaboration**
+Open Claude Code and try these prompts:
+```
+✅ "please work with ax agent to create a simple calculator function"
+✅ "please work with ax agent to design a REST API for user management"
+✅ "please work with ax agent to implement secure authentication"
+```
+**What happens**:
+1. Claude Code analyzes your project context
+2. Selects and coordinates the best agents
+3. Agents execute with full context
+4. Results are validated and explained
+5. Easy to iterate: "please improve the error handling"
+**Express Option: Slash Commands** (for simple tasks)
+```bash
+# Quick, direct execution
+/ax:agent backend, write a function to validate email
+/ax:agent quality, review this code snippet
+```
+📖 **Learn more**: [Best Practices Guide](docs/BEST-PRACTICES.md)
+#### Option B: Terminal Mode (Power Users)
 ```bash
 # Test with backend agent
@@ -346,22 +483,6 @@ ax run Bob "Implement the API"           # Auto-receives Paris's design
 ax run Queenie "Write tests for the API" # Auto-receives design + implementation
 ```
-**Claude Code Integration**:
-```bash
-# In Claude Code, use the slash command
-/ax:agent Paris, design REST API for users
-/ax:agent Bob, implement the API
-/ax:agent Queenie, write tests for the API
-```
-**What happens**:
-1. Claude Code executes AutomatosX behind the scenes
-2. Paris designs the API → Saved to memory
-3. Bob reads Paris's design from memory → Implements code
-4. Queenie reads everything → Writes comprehensive tests
-5. Results flow back into your Claude Code conversation
 ---
 ### Common Issues
@@ -426,6 +547,26 @@ ax mcp
 ---
+## ⏱️ Stage Checkpoints & Run History
+AutomatosX 5.3 introduces stage-aware checkpoints so you can pause long-running agent workflows, inspect intermediate outputs, and resume exactly where the run stopped.
+### Enable resumable runs
+- CLI: `ax run <agent> "<task>" --resumable` (add `--interactive` or `--hybrid` for live approval)
+- Config: set `execution.stages.enabled` to `true` in `automatosx.config.json` to make stage checkpoints the default. Combine with `execution.stages.autoSaveCheckpoint` to persist after every stage.
+When a stage finishes, AutomatosX stores a checkpoint under `.automatosx/checkpoints/<run-id>/` (artifacts, logs, and metadata). The CLI prints the UUID so you can resume or inspect it later.
+### Manage checkpoints
+- `ax resume <run-id>` — Resume a saved run. Flags such as `--interactive`, `--streaming`, `--hybrid`, or `--auto-continue` override the saved execution mode.
+- `ax runs list [--status running|paused|completed|failed|aborted] [--agent <name>] [--limit <n>]` — Review recent checkpoints with progress and status.
+- `ax runs show <run-id> [--artifacts]` — Inspect stage history, retry counts, and generated artifacts before resuming.
+- `ax runs delete <run-id> [--force]` — Remove stale checkpoints or clear sensitive artifacts once a run is finalized.
+Set `execution.stages.cleanupAfterDays` to control automatic pruning (default 7 days). For an end-to-end guide, see [Checkpoints & Run History](docs/guide/checkpoints-and-resume.md).
+---
 ## 📚 Documentation
 ### Getting Started
@@ -441,6 +582,7 @@ ax mcp
 - **[Multi-Agent Orchestration](docs/guide/multi-agent-orchestration.md)** - Natural language delegation
 - **[Team Configuration](docs/guide/team-configuration.md)** - Team-based agent organization
 - **[Agent Templates](docs/guide/agent-templates.md)** - Quick agent creation
+- **[Checkpoints & Run History](docs/guide/checkpoints-and-resume.md)** - Manage resumable runs and checkpoint storage
 ### Tutorials
 - **[Memory Management](docs/tutorials/memory-management.md)** - Hands-on memory system guide
@@ -578,12 +720,20 @@ I need Daisy to analyze the data    # Need expression
 ## 🛠️ Production-Ready
-✅ **1,259 tests passing** (100% pass rate)
+✅ **1,670 tests passing** (100% pass rate)
 ✅ **TypeScript strict mode** (zero errors)
 ✅ **~56% test coverage** (comprehensive testing)
 ✅ **458KB bundle** (99.9% smaller than v3.x)
 ✅ **< 1ms memory search** (62x faster than v3.x)
+### Tested Platforms
+AutomatosX has been thoroughly tested across multiple operating systems:
+- ✅ **macOS**: macOS 15+ (tested on macOS 15)
+- ✅ **Ubuntu**: Ubuntu 24.04 LTS
+- ✅ **Windows**: Windows 10 & Windows 11
 ### Performance Metrics
 ```