npm - @defai.digital/automatosx - Versions diffs - 5.3.1 → 5.3.4 - Mend

@defai.digital/automatosx 5.3.1 → 5.3.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (30) hide show

package/CHANGELOG.md +179 -0
package/README.md +112 -37
package/dist/index.js +541 -73
package/dist/index.js.map +1 -1
package/dist/version.json +1 -1
package/examples/AGENTS_INFO.md +35 -1
package/examples/agents/backend.yaml +4 -3
package/examples/agents/ceo.yaml +4 -3
package/examples/agents/creative-marketer.yaml +121 -0
package/examples/agents/cto.yaml +12 -3
package/examples/agents/data-scientist.yaml +11 -6
package/examples/agents/data.yaml +4 -3
package/examples/agents/design.yaml +4 -3
package/examples/agents/devops.yaml +12 -5
package/examples/agents/frontend.yaml +4 -3
package/examples/agents/fullstack.yaml +4 -3
package/examples/agents/mobile.yaml +4 -3
package/examples/agents/product.yaml +4 -3
package/examples/agents/quality.yaml +3 -1
package/examples/agents/security.yaml +3 -1
package/examples/agents/writer.yaml +4 -3
package/package.json +1 -1
package/version.json +1 -1
/package/examples/claude/commands/{ax:agent.md → ax-agent.md} +0 -0
/package/examples/claude/commands/{ax:clear.md → ax-clear.md} +0 -0
/package/examples/claude/commands/{ax:init.md → ax-init.md} +0 -0
/package/examples/claude/commands/{ax:list.md → ax-list.md} +0 -0
/package/examples/claude/commands/{ax:memory.md → ax-memory.md} +0 -0
/package/examples/claude/commands/{ax:status.md → ax-status.md} +0 -0
/package/examples/claude/commands/{ax:update.md → ax-update.md} +0 -0

package/CHANGELOG.md CHANGED Viewed

@@ -5,6 +5,185 @@ All notable changes to AutomatosX will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [5.3.4] - 2025-10-14
+### 🚀 Enhanced Delegation Depth for Coordinators (Phase 2 Pilot)
+**This release implements Phase 2 of the user-requested delegation enhancements, increasing delegation depth from 1-2 to 3 layers for coordinator agents while maintaining robust safety mechanisms.**
+#### Added
+- **3-Layer Delegation Support**:
+  - **CTO (Tony)**: Strategic coordinator (`maxDelegationDepth: 3`) - orchestrate multi-phase technical initiatives
+    - Layer 1: Direct delegation to implementation teams
+    - Layer 2: Coordinated cross-team initiatives
+    - Layer 3: Strategic multi-phase projects with sub-coordination
+  - **DevOps (Oliver)**: Infrastructure coordinator (`maxDelegationDepth: 3`) - manage complex deployment pipelines
+    - Layer 1: Direct delegation to development teams
+    - Layer 2: Cross-team infrastructure initiatives
+    - Layer 3: Complex deployment pipelines with multiple coordination points
+  - **Data Scientist (Dana)**: Data science coordinator (`maxDelegationDepth: 3`) - orchestrate end-to-end ML workflows
+    - Layer 1: Direct delegation to data engineer, backend, quality
+    - Layer 2: Cross-functional analytics initiatives
+    - Layer 3: End-to-end ML pipelines with multiple coordination points
+- **Improved Depth Enforcement Logic** (`src/agents/executor.ts:755-757`):
+  - Changed depth checking from `fromAgent` to delegation chain `initiator`
+  - Allows coordinators to delegate through implementers without hitting depth limits
+  - Example: CTO (depth 3) → Backend (depth 1) → Frontend (depth 1) → Done ✅
+  - Previously would fail at 2nd delegation due to Backend's depth 1 limit ❌
+- **Comprehensive Test Coverage**:
+  - Created `tests/unit/executor-delegation-depth-3.test.ts` with 15 new tests
+  - 5 tests for 3-layer success scenarios
+  - 3 tests for 4-layer rejection (exceeds limit)
+  - 3 tests for backward compatibility
+  - 2 tests for cycle detection at 3 layers
+  - 2 tests for delegation chain tracking
+  - **All 1,717 tests passing** (100% pass rate)
+#### Changed
+- **Agent Configuration Updates**:
+  - `.automatosx/agents/cto.yaml`: `maxDelegationDepth: 1 → 3` (strategic coordinator)
+  - `.automatosx/agents/devops.yaml`: `maxDelegationDepth: 0 → 3` (infrastructure coordinator)
+  - `.automatosx/agents/data-scientist.yaml`: `maxDelegationDepth: 1 → 3` (data science coordinator)
+  - Updated system prompts to reflect new coordinator roles
+- **Delegation Safety**:
+  - Existing cycle detection continues to work at all depth levels
+  - 4-layer delegation attempts are rejected with clear error messages
+  - Implementer agents (Backend, Frontend, etc.) remain at `maxDelegationDepth: 1`
+#### Fixed
+- **Windows Provider Detection** (`src/providers/base-provider.ts`):
+  - Fixed provider CLI detection on Windows by using cross-platform `findOnPath()` from `cli-provider-detector`
+  - Previously, `spawn('claude', ['--version'])` would fail on Windows because Node.js doesn't auto-append `.cmd` extension
+  - Now uses `where.exe` + PATH×PATHEXT fallback for proper Windows detection
+  - **Impact**: Providers installed via npm on Windows (e.g., `claude.cmd`) are now correctly detected
+  - **Issue**: Users could run `claude` in terminal but AutomatosX showed "provider unavailable"
+#### Documentation
+- **CLAUDE.md**: Updated Agent Directory & Governance section with v5.3.4 enhancements
+- **CHANGELOG.md**: This entry documenting all Phase 2 changes
+#### Technical Details
+**Depth Enforcement Change** (Breaking for test implementations, not user-facing):
+```typescript
+// Before (v5.3.3 and earlier):
+const maxDepth = fromAgentProfile.orchestration?.maxDelegationDepth ?? 2;
+// After (v5.3.4):
+const initiatorName = delegationChain.length > 0 ? delegationChain[0] : request.fromAgent;
+const initiatorProfile = await this.profileLoader.loadProfile(initiatorName);
+const maxDepth = initiatorProfile.orchestration?.maxDelegationDepth ?? 2;
+```
+**Impact**: Allows coordinators to orchestrate deep delegation chains through implementers without hitting depth limits. Implementers can still only delegate once, but coordinator's depth limit applies to the entire chain.
+#### Performance
+- No performance impact: Logic change is O(1) (single profile lookup)
+- All existing tests passing (1,717 tests, 100% pass rate)
+- Test execution time: ~50s (no regression)
+#### Migration
+**100% Backward Compatible** - No action required for existing deployments:
+- Default `maxDelegationDepth` remains 2 for agents without orchestration config
+- Implementer agents (Backend, Frontend, etc.) remain at depth 1
+- Only 3 coordinator agents updated to depth 3 (CTO, DevOps, Data Scientist)
+- Existing delegation logic fully preserved
+## [5.3.3] - 2025-10-14
+### 🏗️ Foundation for Agent Optimization
+**This release establishes the infrastructure and comprehensive analysis for intelligent ability loading, setting the stage for 50-90% token savings in v5.4.0.**
+#### Added
+- **Ability Metadata Infrastructure**:
+  - Created `schema/ability-metadata.json` with tier framework (core/advanced/specialized)
+  - Established foundation for intelligent ability loading system
+  - Defined tier constraints: core ≤250 words, advanced ≤600 words, specialized unlimited
+  - Infrastructure ready for task complexity-based loading (v5.4.0)
+- **Comprehensive Optimization Analysis**:
+  - Complete analysis of all 16 agents and 63 abilities (`automatosx/PRD/v5.3-agent-optimization.md`)
+  - Token waste analysis identifying 50-92% savings potential
+  - Ability classification matrix (core/advanced/specialized)
+  - Per-agent optimization recommendations
+  - 4-phase implementation roadmap
+- **User Feedback Integration** (`automatosx/PRD/v5.4.0_Recommendations_and_Roadmap.md`):
+  - Agent role expansion strategy (community-driven framework for 50+ roles)
+  - Delegation Guard architecture (cycle detection, deadlock prevention)
+  - Configurable timeout system (25→35-45 minutes)
+  - Delegation depth increase plan (2→3 levels with safety guards)
+  - Prioritized implementation roadmap (P0-P3)
+- **Technical Implementation Specifications** (`automatosx/PRD/v5.4.0_Implementation_Guide.md`):
+  - Detailed architecture diagrams for Delegation Guard
+  - Graph-based cycle detection algorithm
+  - Role similarity scoring mechanism
+  - Context preservation for deep delegation chains
+  - Complete code examples and integration points
+- **Feature Roadmap** (`automatosx/PRD/FEATURE-ROADMAP-v5.4.md`):
+  - Agent interaction visualizer
+  - Public agent/ability registry
+  - Interactive debugger
+  - Provider response caching
+  - Git and CI/CD integrations
+  - Enterprise features (RBAC, audit logging, secrets management)
+#### Documentation
+- **v5.3-agent-optimization.md**: Complete optimization analysis with ability classification matrix, intelligent loading strategy, and success metrics
+- **v5.3.3-implementation-plan.md**: Foundation release plan and roadmap for v5.4.0
+- **v5.4.0_Recommendations_and_Roadmap.md**: User feedback analysis with prioritized P0-P3 recommendations
+- **v5.4.0_Implementation_Guide.md**: Technical specifications with pseudocode and implementation details
+- **FEATURE-ROADMAP-v5.4.md**: General feature roadmap for future releases
+#### Performance Impact (Foundation for v5.4.0)
+No immediate performance changes in v5.3.3. This release establishes infrastructure for v5.4.0 optimizations:
+| Agent | Current Avg Tokens | v5.4.0 Target | Savings | Use Case |
+|-------|-------------------|---------------|---------|----------|
+| Creative marketer | 5,242 | 400-800 | **85-92%** | Simple social media |
+| Design | 1,468 | 250-600 | **59-83%** | Quick wireframes |
+| Mobile | 1,732 | 250-500 | **71-86%** | Basic UI questions |
+| Data scientist | 992 | 250-500 | **50-75%** | Simple data queries |
+| Backend | 1,185 | 350-600 | **41-70%** | Simple CRUD |
+| Frontend | 846 | 250-450 | **47-70%** | Component questions |
+**Overall**: 50-90% token reduction for simple tasks while maintaining full power for complex workflows (v5.4.0).
+#### Changed
+None (infrastructure-only release)
+#### Fixed
+None (infrastructure-only release)
+#### Notes
+- **Zero breaking changes** - This is a pure infrastructure and documentation release
+- **All tests passing** - 1,702 tests (99.59% pass rate)
+- **TypeScript strict mode** - 0 errors
+- **Foundation complete** - Ready for v5.4.0 implementation
+- **Estimated v5.4.0 timeline** - 8-10 weeks for full optimization
+---
 ## [5.3.1] - 2025-10-14
 ### 🪟 Windows CLI Provider Detection & Enhanced Robustness

package/README.md CHANGED Viewed

@@ -7,9 +7,9 @@
 [![npm version](https://img.shields.io/npm/v/@defai.digital/automatosx.svg)](https://www.npmjs.com/package/@defai.digital/automatosx)
 [![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](LICENSE)
 [![TypeScript](https://img.shields.io/badge/TypeScript-100%25-blue.svg)](https://www.typescriptlang.org/)
-[![Tests](https://img.shields.io/badge/tests-1,657%20passing-brightgreen.svg)](#)
+[![Tests](https://img.shields.io/badge/tests-1,717%20passing-brightgreen.svg)](#)
-**Status**: ✅ Production Ready · v5.3.0 · October 2025
+**Status**: ✅ Production Ready · v5.3.4 · October 2025
 Looking for answers? See the [FAQ](FAQ.md).
@@ -19,13 +19,42 @@ Looking for answers? See the [FAQ](FAQ.md).
 **AutomatosX extends Claude Code with specialized AI agents that remember context, delegate tasks, and collaborate autonomously.**
-```bash
-# In Claude Code, simply use /ax:agent
-/ax:agent Paris, design authentication system with JWT
-/ax:agent Bob, implement the auth design  # Bob auto-receives Paris's design from memory
+### 💡 Best Way to Use: Natural Language Collaboration (Recommended)
+Instead of directly commanding agents, **let Claude Code think and coordinate**:
+```
+✅ RECOMMENDED: Natural language collaboration
+"please work with ax agent to implement user authentication with JWT"
+What happens:
+1. Claude Code analyzes your project structure
+2. AutomatosX selects the most suitable agent automatically
+3. Provides full context to the agent
+4. Validates the results
+5. Helps you understand and iterate
+```
+**vs.**
 ```
+⚡ EXPRESS: Direct slash command (for simple tasks)
+/ax:agent backend, implement JWT auth
+What happens:
+1. Backend agent executes directly
+2. Limited project context
+3. No validation or planning
+```
+### 🎓 Think of it This Way
-**The result**: Claude Code becomes a **learning, coordinated team** instead of a stateless assistant.
+- **Natural Collaboration** = Having a conversation with an intelligent coordinator who summons experts
+- **Slash Commands** = Directly commanding the experts without coordination
+**Recommendation**: Use natural language for 80% of tasks, slash commands for quick 20%.
+📖 **[Complete Best Practices Guide](docs/BEST-PRACTICES.md)**
 ---
@@ -150,6 +179,7 @@ Product response:
 ## 🎭 12 Specialized Agents with Clear Governance
 **v5.0.12 introduces strict role ownership and delegation controls to eliminate cycles**:
+**v5.3.4 Phase 2 Pilot**: 3 coordinator agents now support 3-layer delegation for complex multi-phase workflows
 ### 💻 Engineering Team (Implementers)
 **maxDelegationDepth: 1** - Can delegate once for cross-domain needs, no re-delegation
@@ -157,12 +187,16 @@ Product response:
   - Can delegate to: frontend, data, security, quality, devops
 - **Frank** (frontend) - Component architecture, state management, accessibility
   - Can delegate to: backend, design, security, quality, devops
-- **Oliver** (devops) - Infrastructure as code, CI/CD pipelines, observability
+- **Oliver** (devops) - **🆕 v5.3.4: Infrastructure Coordinator (depth 3)** - Orchestrate complex deployment pipelines
   - Can delegate to: backend, frontend, security, quality
+  - 3-layer capability for multi-phase infrastructure workflows
 - **Daisy** (data) - Data modeling, ETL pipelines, SQL optimization
   - Can delegate to: backend, security, quality
 - **Steve** (security) - **Sole owner** of security-audit, threat modeling, secure coding review
   - Can delegate to: backend, frontend, devops, quality
+- **Dana** (data-scientist) - **🆕 v5.3.4: Data Science Coordinator (depth 3)** - End-to-end ML pipelines
+  - Can delegate to: data, backend, quality
+  - 3-layer capability for complex data science workflows
 ### 🎯 Quality Team (Coordinator Role)
 **maxDelegationDepth: 1** - Can delegate fixes back to implementers, no re-delegation
@@ -177,13 +211,16 @@ Product response:
   - Can delegate to: backend, frontend, design, quality
 ### 📊 Leadership Team (Coordinators)
-**maxDelegationDepth: 1** - Delegate to implementers, focus on strategy, no re-delegation
+**maxDelegationDepth: 1-3** - Delegate to implementers, focus on strategy
 - **Paris** (product) - Product strategy, feature planning, roadmap
   - Can delegate to: backend, frontend, design, writer, quality
+  - maxDelegationDepth: 1
 - **Eric** (ceo) - Business strategy, organizational leadership
   - Can delegate to: paris, tony, all agents
-- **Tony** (cto) - Technology strategy, technical leadership
+  - maxDelegationDepth: 1
+- **Tony** (cto) - **🆕 v5.3.4: Strategic Coordinator (depth 3)** - Multi-phase technical initiatives
   - Can delegate to: backend, frontend, devops, security, quality
+  - 3-layer capability for strategic technology projects with sub-coordination
 ### 🔬 Research Team (Specialist)
 **maxDelegationDepth: 0** - Execute research work directly, no delegation
@@ -193,6 +230,8 @@ Product response:
 **New in v5.0.12**: Each agent has role-specific workflow stages, smart ability loading (abilitySelection), and explicit delegation scopes. Most agents have `maxDelegationDepth: 1` to allow cross-domain collaboration while preventing delegation cycles.
+**New in v5.3.4 (Phase 2 Pilot)**: 3 coordinator agents (Tony/CTO, Oliver/DevOps, Dana/Data Scientist) now support `maxDelegationDepth: 3` for orchestrating complex multi-layer workflows. This enables strategic coordination of multi-phase projects while maintaining safety through depth limits and cycle detection.
 [📖 Complete Agent Directory](examples/AGENTS_INFO.md)
 ---
@@ -203,22 +242,45 @@ AutomatosX offers **two powerful modes** to fit your workflow:
 ### 1️⃣ Claude Code Integration (Recommended)
-**Use AutomatosX agents directly inside Claude Code conversations** with the `/ax:agent` slash command.
+**The best way**: Use **natural language collaboration** to let Claude Code coordinate agents intelligently.
+#### Natural Language Collaboration (Primary Method - 80% of tasks)
+```
+# Let Claude Code think, plan, and coordinate
+"please work with ax agent to implement user authentication"
+"please work with ax agent to design a secure API for our application"
+"please work with ax agent to refactor this module with best practices"
+```
+**Why this is better**:
+- 🧠 Claude Code analyzes your project first
+- 🎯 Automatically selects the best agents
+- 📚 Provides full context from your codebase
+- ✅ Validates results and handles errors
+- 🔄 Easy to iterate and refine
+#### Slash Commands (Express Method - 20% of tasks)
 ```bash
-# In Claude Code, use the slash command
+# Direct execution for simple, well-defined tasks
 /ax:agent Paris, design a REST API for user authentication
-/ax:agent Bob, implement the auth API from Paris's design
-/ax:agent Steve, security audit the authentication code
+/ax:agent Bob, write a function to validate emails
+/ax:agent Steve, review this code snippet
 ```
+**Use slash commands when**:
+- ⚡ Task is simple and well-defined
+- 🎯 You know exactly which agent to use
+- 🚀 Speed matters more than planning
 **Perfect for**:
-- 💬 Interactive development workflows
-- 🔄 Seamless context switching within Claude Code
-- 🤝 Collaborative coding sessions
-- 🎯 Quick agent delegation while coding
+- 💬 All types of development workflows
+- 🔄 Both simple and complex tasks
+- 🤝 Single and multi-agent coordination
+- 🎯 Interactive and automated workflows
-**How it works**: Claude Code executes AutomatosX commands behind the scenes, brings results back into your conversation, and maintains full context.
+**How it works**: Claude Code acts as an intelligent coordinator, analyzing context, selecting agents, and orchestrating their work seamlessly.
 ### 2️⃣ Terminal/CLI Mode (Power Users)
@@ -278,7 +340,7 @@ npm install -g @defai.digital/automatosx
 ```bash
 ax --version
-# Should show: 5.3.0 (or later)
+# Should show: 5.3.3 (or later)
 ```
 > **Windows Users**: If `ax` command not found, see [Windows Troubleshooting](docs/troubleshooting/windows-troubleshooting.md)
@@ -390,7 +452,36 @@ Having issues on Windows? See our comprehensive guides:
 ### Step 3: Run Your First Agent
-**Terminal Mode** (any platform):
+#### Option A: Claude Code Integration (Recommended)
+**Best Practice: Natural Language Collaboration**
+Open Claude Code and try these prompts:
+```
+✅ "please work with ax agent to create a simple calculator function"
+✅ "please work with ax agent to design a REST API for user management"
+✅ "please work with ax agent to implement secure authentication"
+```
+**What happens**:
+1. Claude Code analyzes your project context
+2. Selects and coordinates the best agents
+3. Agents execute with full context
+4. Results are validated and explained
+5. Easy to iterate: "please improve the error handling"
+**Express Option: Slash Commands** (for simple tasks)
+```bash
+# Quick, direct execution
+/ax:agent backend, write a function to validate email
+/ax:agent quality, review this code snippet
+```
+📖 **Learn more**: [Best Practices Guide](docs/BEST-PRACTICES.md)
+#### Option B: Terminal Mode (Power Users)
 ```bash
 # Test with backend agent
@@ -402,22 +493,6 @@ ax run Bob "Implement the API"           # Auto-receives Paris's design
 ax run Queenie "Write tests for the API" # Auto-receives design + implementation
 ```
-**Claude Code Integration**:
-```bash
-# In Claude Code, use the slash command
-/ax:agent Paris, design REST API for users
-/ax:agent Bob, implement the API
-/ax:agent Queenie, write tests for the API
-```
-**What happens**:
-1. Claude Code executes AutomatosX behind the scenes
-2. Paris designs the API → Saved to memory
-3. Bob reads Paris's design from memory → Implements code
-4. Queenie reads everything → Writes comprehensive tests
-5. Results flow back into your Claude Code conversation
 ---
 ### Common Issues