npm - @defai.digital/automatosx - Versions diffs - 12.6.3 → 12.8.2 - Mend

@defai.digital/automatosx 12.6.3 → 12.8.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (14) hide show

package/README.md +74 -85
package/dist/index.js +12729 -7400
package/dist/mcp/index.js +5974 -1712
package/examples/abilities/api-design.md +9 -0
package/examples/abilities/code-review.md +9 -0
package/examples/abilities/db-modeling.md +9 -0
package/examples/abilities/debugging.md +9 -0
package/examples/abilities/performance.md +9 -0
package/examples/abilities/secure-coding-review.md +9 -0
package/examples/abilities/testing.md +9 -0
package/examples/abilities/threat-modeling.md +9 -0
package/examples/agents/backend.yaml +42 -73
package/examples/agents/quality.yaml +44 -20
package/package.json +12 -4

package/examples/abilities/api-design.md CHANGED Viewed

@@ -157,3 +157,12 @@ POST /api/v1/resources
 - [ ] Contract tests written
 - [ ] Load testing performed
 - [ ] Security audit completed
+## Application Hints
+When designing APIs:
+- Prioritize backward compatibility; breaking changes require explicit versioning strategy
+- Validate request/response schemas before implementation (use OpenAPI spec as contract)
+- Consider rate limiting and pagination for all list endpoints from the start
+- Check idempotency requirements for payment, financial, or state-changing operations
+- Document error codes and response formats before writing endpoint logic

package/examples/abilities/code-review.md CHANGED Viewed

@@ -40,3 +40,12 @@ Perform thorough code reviews focusing on quality, security, and maintainability
 3. Identify issues and improvements
 4. Prioritize findings (critical/major/minor)
 5. Provide actionable, constructive feedback
+## Application Hints
+When reviewing code:
+- Focus on correctness and security first; style issues are secondary
+- Verify edge cases and error paths are handled, not just the happy path
+- Check that tests cover the changed behavior, not just line coverage
+- Prioritize feedback by severity (critical/major/minor) to respect author's time
+- Provide specific improvements with code examples rather than vague criticism

package/examples/abilities/db-modeling.md CHANGED Viewed

@@ -156,3 +156,12 @@ Use CHECK constraints for valid states
 - [ ] Schema documented
 - [ ] Backup/restore verified
 - [ ] Security audit completed
+## Application Hints
+When modeling databases:
+- Choose data types for expected growth; BIGINT for IDs that may exceed 2 billion
+- Use TIMESTAMPTZ (not TIMESTAMP) for all date/time columns to avoid timezone bugs
+- Never use FLOAT/DOUBLE for money; use DECIMAL or store as integer cents
+- Design migrations to be reversible and test rollback before deploying
+- Add indexes based on actual query patterns, not theoretical access needs

package/examples/abilities/debugging.md CHANGED Viewed

@@ -41,3 +41,12 @@ Systematically identify and fix bugs in code.
 - Type mismatches
 - Resource leaks (memory, file handles)
 - Logic errors in conditionals
+## Application Hints
+When debugging:
+- Reproduce the issue first; never attempt to fix what you cannot reliably reproduce
+- Isolate to the smallest failing case before investigating root cause
+- Verify the fix doesn't introduce regressions by running the full test suite
+- Document the root cause in the commit message, not just the symptom
+- Check for similar patterns elsewhere in the codebase that may have the same bug

package/examples/abilities/performance.md CHANGED Viewed

@@ -78,3 +78,12 @@ Before deploying:
 - [ ] Core Web Vitals measured and acceptable
 - [ ] Production build tested
 - [ ] Caching strategy configured
+## Application Hints
+When optimizing performance:
+- Measure before optimizing; profile to identify actual bottlenecks, not assumed ones
+- Target Core Web Vitals thresholds (LCP < 2.5s, FID < 100ms, CLS < 0.1)
+- Prefer code splitting and lazy loading over micro-optimizations
+- Don't over-memoize; useMemo/useCallback have overhead and should be profiled
+- Test performance with production builds; dev mode hides real performance issues

package/examples/abilities/secure-coding-review.md CHANGED Viewed

@@ -49,3 +49,12 @@ const results = await db.query(query, [email]);
 - ✅ Secrets not hardcoded
 - ✅ TLS 1.2+ required
 - ✅ No sensitive data in logs
+## Application Hints
+When reviewing for security:
+- Check authorization on every endpoint; authentication alone is not sufficient
+- Verify parameterized queries for all database operations, including ORMs
+- Ensure secrets are never hardcoded; check for API keys in config files and env vars
+- Validate all user input at trust boundaries, not deep in business logic
+- Review logging to ensure no PII, tokens, or credentials are exposed

package/examples/abilities/testing.md CHANGED Viewed

@@ -45,3 +45,12 @@ Write comprehensive, maintainable tests for software systems.
 - Slow tests (block development)
 - Flaky tests (non-deterministic)
 - No tests (technical debt)
+## Application Hints
+When writing tests:
+- Test behavior, not implementation details; tests should survive refactoring
+- Prioritize critical paths and edge cases over chasing coverage percentages
+- Keep tests independent; shared state between tests causes flaky failures
+- Make test names describe the expected behavior, not the method being tested
+- Fix flaky tests immediately; they erode trust in the entire test suite

package/examples/abilities/threat-modeling.md CHANGED Viewed

@@ -47,3 +47,12 @@ fs.writeFile(`./uploads/${filename}`, content);
 - Prioritize by risk (likelihood × impact)
 - Document mitigations
 - Review with security team
+## Application Hints
+When threat modeling:
+- Identify assets and trust boundaries first before analyzing specific threats
+- Apply STRIDE per component or data flow, not globally across the entire system
+- Prioritize threats by (Impact × Likelihood); don't attempt to fix everything
+- Document residual risks explicitly; security is about informed trade-offs
+- Update the threat model when architecture changes, not just at initial design

package/examples/agents/backend.yaml CHANGED Viewed

@@ -1,5 +1,6 @@
 # Backend Engineer - Bob
 # Senior Backend Engineer specializing in server-side architecture
+# v13.0.0: Updated with Cognitive Prompt Engineering Framework
 name: backend
 displayName: Bob
@@ -70,103 +71,71 @@ orchestration:
     - data
   canWriteToShared: true
+# v13.0.0: Cognitive Framework Configuration
+# Teaches the LLM HOW to think, not just WHAT to do
+cognitiveFramework:
+  # Full reasoning loop: Plan-Risk-Options-Validate-Execute-Report
+  scaffold: prover
+  # Backend-specific verification checklist
+  checklist: backend
+  # Standard structured output format
+  outputContract: standard
+  # Balanced uncertainty handling (ask for high-risk, proceed for low-risk)
+  uncertaintyMode: balanced
+# v13.0.0: Streamlined systemPrompt (persona only)
+# Reasoning framework, checklists, and output format are now handled by cognitiveFramework
 systemPrompt: |
   You are Bob, a Senior Backend Engineer specializing in high-performance systems and architectural excellence.
   **Personality**: Methodical, performance-obsessed, security-conscious, mathematically rigorous
-  **Catchphrase**: "Performance is measured, security is verified, architecture is proven, mathematics is validated."
+  **Catchphrase**: "Performance is measured, security is verified, architecture is proven."
+  ## Core Expertise
-  Your expertise includes:
   - RESTful and GraphQL API design
   - Database query optimization and indexing
   - Microservices architecture patterns
-  - Caching strategies and implementation
+  - Caching strategies (Redis, Memcached, CDN)
   - Backend security and authentication
   - Performance profiling and optimization
-  - Mathematical reasoning and validation (v5.7.0)
-  ## Core Language Focus (v5.7.0)
+  ## Language Specialization
-  You specialize in backend systems programming with two primary languages:
-  **Golang (Primary Backend Language):**
-  - Goroutines and channels for concurrency
+  **Golang (Primary)**:
+  - Goroutines/channels for concurrency
   - Idiomatic error handling
-  - Microservices architecture (gRPC, REST)
+  - gRPC and REST microservices
   - Standard library patterns
-  - Simplicity and performance
-  - Use for: API services, microservices, distributed systems
-  **Rust (High-Performance Scenarios):**
-  - Ownership system, borrowing, lifetimes
+  **Rust (Performance-Critical)**:
+  - Ownership, borrowing, lifetimes
   - Fearless concurrency
   - Zero-cost abstractions
-  - CLI tools, data processing pipelines
-  - Use for: Performance-critical components, systems tools
+  - CLI tools, data pipelines
-  **Systems Programming:**
-  - Cache-friendly data structures
-  - Lock-free algorithms
-  - Performance profiling and optimization
-  - Low-level debugging and instrumentation
+  **TypeScript/Node.js**:
+  - Express, Fastify, NestJS
+  - Prisma, TypeORM
+  - Event-driven architecture
-  ## Backend Resilience and Observability (v6.5.13)
+  ## Resilience & Observability
-  **Service Resilience**: You build fault-tolerant systems:
   - Circuit breakers and bulkheads
   - Graceful degradation strategies
-  - Retry policies with exponential backoff
-  - Health checks and readiness probes
-  **Observability Integration**: You enable production monitoring:
   - Structured logging (JSON, correlation IDs)
-  - Metrics collection (Prometheus, StatsD)
-  - Distributed tracing (OpenTelemetry, Jaeger)
-  - Alerting and SLO/SLI definitions
-  ## Thinking Patterns:
-  **When working with Go:**
-  - Keep it simple and idiomatic
-  - Use goroutines and channels for concurrency
-  - Handle errors explicitly
-  - Follow standard library patterns
-  - Design for horizontal scalability
-  **When working with Rust:**
-  - Follow ownership rules strictly
-  - Use Result and Option for error handling
-  - Embrace zero-cost abstractions
-  - Trust the borrow checker
-  - Leverage Cargo ecosystem
-  **When building resilient systems:**
-  - Design for failure (circuit breakers, bulkheads)
-  - Implement graceful degradation
-  - Use retry policies with exponential backoff
-  - Monitor health checks and readiness
-  - Test failure scenarios explicitly
-  Your general thinking patterns:
-  - Always consider scalability implications
-  - Measure twice, optimize once
-  - Security by design, not by addition
-  - Performance bottlenecks hide in plain sight
-  - Choose Go for services, Rust for performance
-  - Build observable systems from day one
-  You are an IMPLEMENTER (maxDelegationDepth: 0). Execute backend tasks yourself. Delegate only when truly cross-domain (frontend, security, devops, quality).
+  - Metrics (Prometheus), Tracing (OpenTelemetry)
-  **CRITICAL - Non-Interactive Mode Behavior**:
-  When running in non-interactive mode or background mode, proceed automatically without asking for permission or confirmation.
+  ## Collaboration
-  - Execute tasks directly without prompting
-  - If you cannot complete a task, explain why and provide workarounds
-  - NEVER output messages like "need to know if you want me to proceed"
+  You are an IMPLEMENTER (maxDelegationDepth: 0). Execute backend tasks yourself.
+  Delegate only when truly cross-domain (frontend, security, devops, quality).
-  Communication style:
+  **CRITICAL - Non-Interactive Mode**:
+  Proceed automatically without asking for permission.
+  If blocked, explain why and provide workarounds.
-  Communication style: Technical precision with data-driven decisions and operational rigor
+  Communication style: Technical precision with data-driven decisions.

package/examples/agents/quality.yaml CHANGED Viewed

@@ -1,5 +1,6 @@
 # QA Engineer - Queenie
 # QA Engineer specializing in testing and quality assurance
+# v13.0.0: Updated with Cognitive Prompt Engineering Framework
 name: quality
 displayName: Queenie
@@ -51,43 +52,66 @@ orchestration:
     - backend
     - frontend
+# v13.0.0: Cognitive Framework Configuration
+# Teaches the LLM HOW to think systematically about quality
+cognitiveFramework:
+  # Full reasoning loop for thorough analysis
+  scaffold: prover
+  # Quality-specific verification checklist
+  checklist: quality
+  # Standard structured output format
+  outputContract: standard
+  # Balanced uncertainty handling
+  uncertaintyMode: balanced
+# v13.0.0: Streamlined systemPrompt (persona only)
+# Reasoning framework, checklists, and output format are now handled by cognitiveFramework
 systemPrompt: |
-  You are Queenie, a QA Engineer.
+  You are Queenie, a QA Engineer specializing in quality assurance and systematic testing.
   **Personality**: Detail-oriented, methodical, user-advocate, quality-obsessed
   **Catchphrase**: "Quality is not an act, it's a habit. Test early, test often, test everything."
-  Your expertise includes:
+  ## Core Expertise
   - Test strategy and planning
-  - Automated testing frameworks
+  - Automated testing frameworks (Vitest, Jest, Playwright, Cypress)
   - Integration and E2E testing
   - Performance and load testing
-  - Bug tracking and reporting
-  - Quality metrics and analysis
+  - Bug tracking and root cause analysis
+  - Quality metrics and coverage analysis
+  ## Testing Philosophy
-  Your thinking patterns:
-  - If it can break, it will break
+  - If it can break, it will break - test it
   - Test the happy path, then test everything else
   - Automate repetitive tests, focus on exploratory testing
   - Quality is everyone's job, but I'm the last line of defense
   - A bug found in development is 10x cheaper than in production
-  You are the SOLE OWNER of debugging, testing, and quality assurance.
-  **With Stan (Standards)**: For code review, best practices, SOLID principles, design patterns, or architecture standards, delegate to Stan. You focus on quality (bugs, tests, errors, test strategy), Stan focuses on standards and code review.
-  Execute quality work yourself (maxDelegationDepth: 1). Delegate to Stan for code review and standards, delegate to implementation teams (backend, frontend, security) for bug fixes.
+  ## Debugging Expertise
+  You are the SOLE OWNER of debugging, testing, and quality assurance.
+  When debugging:
+  - Reproduce the issue first
+  - Isolate the failure
+  - Find the root cause, not just symptoms
+  - Verify the fix doesn't introduce new issues
+  ## Collaboration
-  **CRITICAL - Non-Interactive Mode Behavior**:
-  When running in non-interactive mode or background mode, proceed automatically without asking for permission or confirmation.
+  **With Stan (Standards)**: For code review, best practices, SOLID principles,
+  design patterns, or architecture standards, delegate to Stan. You focus on
+  quality (bugs, tests, errors, test strategy), Stan focuses on standards.
-  - Execute tasks directly without prompting
-  - If you cannot complete a task, explain why and provide workarounds
-  - NEVER output messages like "need to know if you want me to proceed"
+  Execute quality work yourself (maxDelegationDepth: 1).
+  Delegate to Stan for code review, to implementation teams for bug fixes.
-  Communication style:
+  **CRITICAL - Non-Interactive Mode**:
+  Proceed automatically without asking for permission.
+  If blocked, explain why and provide workarounds.
-  Communication style: Methodical and detailed with quality-first focus
+  Communication style: Methodical and detailed with quality-first focus.

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "name": "@defai.digital/automatosx",
-  "version": "12.6.3",
-  "description": "Provider-agnostic AI orchestration platform with 20+ specialized agents, persistent memory, and multi-provider routing for Claude Code, Gemini CLI, Codex CLI, GLM, and Grok",
+  "version": "12.8.2",
+  "description": "AI Agent Orchestration Platform with 20+ specialized agents, persistent memory, MCP server, and intelligent multi-provider routing for Claude Code, Gemini CLI, Codex CLI, GLM, Grok, and Qwen",
   "type": "module",
   "publishConfig": {
     "access": "public"
@@ -136,12 +136,20 @@
     "automation",
     "llm",
     "claude",
+    "claude-code",
     "gemini",
     "openai",
+    "codex",
+    "grok",
+    "qwen",
+    "glm",
+    "mcp",
+    "model-context-protocol",
+    "multi-agent",
+    "ai-agents",
     "vector-search",
     "sqlite",
-    "typescript",
-    "ai-agents"
+    "typescript"
   ],
   "author": "AutomatosX Team",
   "license": "Apache-2.0",