npm - cortex-agents - Versions diffs - 3.4.0 → 4.0.1 - Mend

cortex-agents 3.4.0 → 4.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (49) hide show

package/.opencode/agents/architect.md +82 -89
package/.opencode/agents/audit.md +57 -188
package/.opencode/agents/{crosslayer.md → coder.md} +9 -52
package/.opencode/agents/debug.md +151 -0
package/.opencode/agents/devops.md +142 -0
package/.opencode/agents/docs-writer.md +195 -0
package/.opencode/agents/fix.md +119 -189
package/.opencode/agents/implement.md +115 -74
package/.opencode/agents/perf.md +151 -0
package/.opencode/agents/refactor.md +163 -0
package/.opencode/agents/{guard.md → security.md} +20 -85
package/.opencode/agents/testing.md +115 -0
package/.opencode/skills/data-engineering/SKILL.md +221 -0
package/.opencode/skills/monitoring-observability/SKILL.md +251 -0
package/.opencode/skills/ui-design/SKILL.md +402 -0
package/README.md +303 -287
package/dist/cli.js +6 -9
package/dist/index.d.ts.map +1 -1
package/dist/index.js +26 -28
package/dist/registry.d.ts +4 -4
package/dist/registry.d.ts.map +1 -1
package/dist/registry.js +6 -6
package/dist/tools/branch.d.ts +2 -2
package/dist/tools/docs.d.ts +2 -2
package/dist/tools/github.d.ts +3 -3
package/dist/tools/plan.d.ts +28 -4
package/dist/tools/plan.d.ts.map +1 -1
package/dist/tools/plan.js +232 -4
package/dist/tools/quality-gate.d.ts +28 -0
package/dist/tools/quality-gate.d.ts.map +1 -0
package/dist/tools/quality-gate.js +233 -0
package/dist/tools/repl.d.ts +5 -0
package/dist/tools/repl.d.ts.map +1 -1
package/dist/tools/repl.js +58 -7
package/dist/tools/worktree.d.ts +5 -32
package/dist/tools/worktree.d.ts.map +1 -1
package/dist/tools/worktree.js +75 -458
package/dist/utils/change-scope.d.ts +33 -0
package/dist/utils/change-scope.d.ts.map +1 -0
package/dist/utils/change-scope.js +198 -0
package/dist/utils/plan-extract.d.ts +21 -0
package/dist/utils/plan-extract.d.ts.map +1 -1
package/dist/utils/plan-extract.js +65 -0
package/dist/utils/repl.d.ts +31 -0
package/dist/utils/repl.d.ts.map +1 -1
package/dist/utils/repl.js +126 -13
package/package.json +1 -1
package/.opencode/agents/qa.md +0 -265
package/.opencode/agents/ship.md +0 -249

package/README.md CHANGED Viewed

@@ -2,7 +2,7 @@
   <img src="https://img.shields.io/badge/cortex-agents-111?style=for-the-badge&labelColor=111&color=4d96ff" alt="cortex-agents" height="40">
 </p>
-<h3 align="center">Supercharge OpenCode with structured workflows, intelligent agents, and automated development practices.</h3>
+<h3 align="center">Structured AI development workflows for <a href="https://opencode.ai">OpenCode</a>.<br>Plan. Build. Ship. With discipline.</h3>
 <p align="center">
   <a href="https://www.npmjs.com/package/cortex-agents"><img src="https://img.shields.io/npm/v/cortex-agents.svg?style=flat-square&color=4d96ff" alt="npm version"></a>
@@ -12,238 +12,243 @@
 </p>
 <p align="center">
-  <a href="#-quick-start">Quick Start</a>&nbsp;&nbsp;|&nbsp;&nbsp;
-  <a href="#-what-it-does">What It Does</a>&nbsp;&nbsp;|&nbsp;&nbsp;
-  <a href="#-agents">Agents</a>&nbsp;&nbsp;|&nbsp;&nbsp;
-  <a href="#-tools">Tools</a>&nbsp;&nbsp;|&nbsp;&nbsp;
-  <a href="#-skills">Skills</a>&nbsp;&nbsp;|&nbsp;&nbsp;
+  <a href="#-quick-start">Quick Start</a>&nbsp;&nbsp;&bull;&nbsp;&nbsp;
+  <a href="#-architecture">Architecture</a>&nbsp;&nbsp;&bull;&nbsp;&nbsp;
+  <a href="#-agents">Agents</a>&nbsp;&nbsp;&bull;&nbsp;&nbsp;
+  <a href="#-tools">Tools</a>&nbsp;&nbsp;&bull;&nbsp;&nbsp;
+  <a href="#-skills">Skills</a>&nbsp;&nbsp;&bull;&nbsp;&nbsp;
   <a href="#-contributing">Contributing</a>
 </p>
-<br>
 ---
-<br>
 ## Why Cortex Agents?
 AI coding assistants are powerful, but without structure they produce inconsistent results. **Cortex Agents** adds the missing layer: a complete development workflow that turns OpenCode into a disciplined engineering partner.
-- **Before**: AI writes code wherever, no branching discipline, no documentation, no plan.
-- **After**: AI checks git status, asks about branching strategy, loads implementation plans, creates docs with architecture diagrams, commits cleanly, and opens PRs.
+```
+ Before                                    After
+ ──────                                    ─────
+ AI writes code wherever                   AI checks git status first
+ No branching discipline                   Creates worktrees/branches automatically
+ No documentation                          Generates docs with mermaid diagrams
+ No quality checks                         Runs parallel quality gates (6 sub-agents)
+ No plan, no traceability                  Plans with acceptance criteria, ships PRs
+```
-<br>
+---
 ## Quick Start
 ```bash
 npx cortex-agents install       # Add plugin + agents + skills
 npx cortex-agents configure     # Pick your models interactively
-# Restart OpenCode - done.
+# Restart OpenCode — done.
 ```
-That's it. Your OpenCode session now has 8 specialized agents, 32 tools, and 14 domain skills.
+Your OpenCode session now has **12 specialized agents**, **33 tools**, and **17 domain skills**.
-> **Built-in Agent Replacement** — When installed, cortex-agents automatically disables OpenCode's native `build` and `plan` agents (replaced by `implement` and `architect`). The `architect` agent becomes the default, promoting a planning-first workflow. Native agents are fully restored on `uninstall`.
+> **Built-in Agent Replacement** — Cortex automatically disables OpenCode's native `build` and `plan` agents (replaced by `implement` and `architect`). The `architect` agent becomes the default, promoting a planning-first workflow. Native agents are fully restored on `uninstall`.
-<br>
+---
-## What It Does
+## Architecture
-### Plan, Build, Ship
+### Agent Hierarchy
-Cortex agents follow a structured workflow from planning through to PR:
+```
+User Request
+    |
+    v
+ Architect (read-only planning)
+    |
+    |-- read-only analysis -----> @security  @coder  @perf
+    |
+    v
+ Implement / Fix (execution)
+    |
+    |-- REPL Loop (task-by-task) --> build + test per task
+    |
+    v
+ Quality Gate (two-phase)
+    |
+    |-- Phase 1 (parallel, scope-based):
+    |     @testing  @security  @audit  @docs-writer  @devops  @perf
+    |
+    |-- Phase 2 (cross-agent):
+    |     @testing reacts to @security findings
+    |
+    v
+ quality_gate_summary --> GO / NO-GO / GO-WITH-WARNINGS
+    |
+    v
+ Fix blockers --> task_finalize --> PR
+```
+### The Workflow
 ```
 You: "Add user authentication"
 Architect Agent                         reads codebase, creates plan with mermaid diagrams
-   saves to .cortex/plans/             "Plan saved. Switch to Implement?"
+   saves to .cortex/plans/             commits plan to feature branch
+   "Plan committed. Switch to          offers worktree or branch
+    Implement?"
 Implement Agent                         loads plan, checks git status
-   "You're on main. Create a branch     two-step prompt: strategy -> execution
-    or worktree?"
-   creates feature/user-auth            implements following the plan
-   "Ready to finalize?"                 stages, commits, pushes, opens PR
-```
-### Worktree Launcher
-Create isolated development environments and launch them instantly:
-| Mode | What Happens |
-|------|-------------|
-| **IDE Terminal** | Opens in your detected IDE (VS Code, Cursor, Windsurf, Zed) with integrated terminal |
-| **New Terminal** | Opens a new terminal tab with OpenCode pre-configured in the worktree |
-| **In-App PTY** | Spawns an embedded terminal inside your current OpenCode session |
-| **Background** | AI implements headlessly while you keep working - toast notifications on completion |
-Plans are automatically propagated into the worktree's `.cortex/plans/` so the new session has full context.
-**IDE-Aware Launch Options** — The launcher detects your development environment and offers contextual options:
-- **VS Code / Cursor / Windsurf / Zed**: "Open in [IDE] (Recommended)" as the first option
-- **JetBrains IDEs**: Terminal tab with manual IDE opening instructions
-- **Terminal only**: Standard terminal tab options
-**Cross-platform terminal support** via the terminal driver system — automatically detects and integrates with VS Code, Cursor, Windsurf, Zed, JetBrains IDEs, tmux, iTerm2, Terminal.app, kitty, wezterm, Konsole, and GNOME Terminal. Tabs opened by the launcher are tracked and automatically closed when the worktree is removed.
-### Task Finalizer
-One tool to close the loop:
-```
-task_finalize
-   git add -A
-   git commit -m "feat: add user auth"
-   git push -u origin feature/user-auth
-   gh pr create --base main               auto-detected if in worktree
-       PR body auto-populated from .cortex/plans/
-   "PR created! Clean up worktree?"
+   repl_init → parses tasks + ACs      iterates task-by-task with build+test
+   Quality Gate → 6 agents in parallel  testing + security + audit + docs + devops + perf
+   quality_gate_summary → GO            aggregates findings, recommends go/no-go
+   task_finalize                        stages, commits, pushes, opens PR
 ```
-### Auto-Prompted Documentation
+### Scope-Based Quality Gate
-After every task, agents prompt you to document what you built:
+Not every change needs a full audit. The quality gate scales with risk:
-| Type | What's Generated | Includes |
-|------|-----------------|----------|
-| **Decision** | Architecture Decision Record | Mermaid graph comparing options |
-| **Feature** | Feature documentation | Mermaid component diagram |
-| **Flow** | Process/data flow doc | Mermaid sequence diagram |
+| Scope | Criteria | Sub-Agents Launched |
+|-------|----------|-------------------|
+| **Trivial** | Docs, comments, formatting | `@docs-writer` only (or skip) |
+| **Low** | Tests, config files | `@testing` |
+| **Standard** | Normal code changes | `@testing` + `@security` + `@audit` + `@docs-writer` |
+| **High** | Auth, payments, crypto, infra, DB migrations | All 6: `@testing` + `@security` + `@audit` + `@docs-writer` + `@devops` + `@perf` |
-All docs are saved to `docs/` with an auto-generated `INDEX.md`.
-<br>
+---
 ## Agents
-### Primary Agents
+### Primary Agents (3)
 Handle complex, multi-step work. Use your best model.
-| Agent | Role | Superpower |
-|-------|------|-----------|
-| **implement** | Full-access development | Skill-aware implementation, worktree launcher, quality gates, task finalizer |
-| **architect** | Read-only analysis | Architectural plans with mermaid diagrams, NFR analysis, hands off to implement |
-| **fix** | Deep troubleshooting | Performance debugging, distributed tracing, hotfix workflow |
-| **audit** | Code quality assessment | Tech debt scoring, pattern review, refactoring advisor (read-only) |
+| Agent | Role | Key Capabilities |
+|-------|------|-----------------|
+| **architect** | Read-only analysis & planning | Plans with mermaid diagrams, acceptance criteria, NFR analysis. Commits plans to branches. Delegates read-only analysis to `@security`, `@coder`, `@perf`. |
+| **implement** | Full-access development | Skill-aware implementation, REPL loop with ACs, two-phase quality gate, parallel sub-agent orchestration, task finalizer. |
+| **fix** | Quick turnaround bug fixes | Rapid diagnosis, scope-based quality gate, optional REPL loop. Delegates deep debugging to `@debug`. |
-### Subagents
+### Sub-Agents (9)
-Focused specialists launched **automatically** as parallel quality gates. Each auto-loads its core domain skill for deeper analysis. Use a fast/cheap model.
+Focused specialists launched **automatically** by primary agents. Each auto-loads domain skills for deeper analysis. Use a fast/cheap model.
 | Agent | Role | Auto-Loads Skill | Triggered By |
 |-------|------|-----------------|-------------|
-| **@qa** | Writes tests, runs suite, reports coverage | `testing-strategies` | Implement (always), Fix (always) |
-| **@guard** | OWASP audit, secrets scan, code-level fix patches | `security-hardening` | Implement (always), Fix (if security-relevant) |
-| **@crosslayer** | Cross-layer implementation + feasibility analysis | Per-layer skills | Implement (multi-layer features), Architect (analysis) |
-| **@ship** | CI/CD validation, IaC review, deployment strategy | `deployment-automation` | Implement (when CI/Docker/infra files change) |
-Subagents return **structured reports** with severity levels (`BLOCKING`, `CRITICAL`, `HIGH`, `MEDIUM`, `LOW`) that the orchestrating agent uses to decide whether to proceed or fix issues first.
+| **@testing** | Test writing, suite execution, coverage | `testing-strategies` | Implement (standard+high), Fix (low+standard+high) |
+| **@security** | OWASP audit, secrets scan, threat modeling | `security-hardening` | Implement (standard+high), Fix (standard+high), Architect (read-only) |
+| **@audit** | Code quality, tech debt, pattern review | `code-quality` | Implement (standard+high) |
+| **@docs-writer** | Auto-documentation generation | — | Implement (standard+high) |
+| **@perf** | Complexity analysis, N+1 detection, bundle impact | `performance-optimization` | Implement (high), Fix (high), Architect (read-only) |
+| **@devops** | CI/CD validation, IaC review | `deployment-automation` | Implement (high, or infra files changed) |
+| **@coder** | Cross-layer implementation, feasibility | Per-layer skills | Implement (3+ layers), Architect (feasibility analysis) |
+| **@refactor** | Behavior-preserving restructuring | `design-patterns` + `code-quality` | Implement (refactor plans) |
+| **@debug** | Root cause analysis, troubleshooting | `testing-strategies` | Fix (complex issues) |
+Sub-agents return **structured reports** with severity levels (`BLOCKING`, `CRITICAL`, `HIGH`, `MEDIUM`, `LOW`) that the orchestrating agent uses to decide whether to proceed or fix issues first.
 ### Skill Routing
-All agents detect the project's technology stack and **automatically load relevant skills** before working. This turns the 14 domain skills from passive knowledge into active intelligence:
+All agents detect the project's technology stack and **automatically load relevant skills** before working:
 ```
 Implement Agent detects: package.json has React + Express + Prisma
-  → auto-loads: frontend-development, backend-development, database-design, api-design
-  → implements with deep framework-specific knowledge
+  -> auto-loads: frontend-development, backend-development, database-design, api-design
+  -> implements with deep framework-specific knowledge
 ```
-<br>
+---
 ## Tools
-32 tools bundled and auto-registered. No configuration needed.
+33 tools bundled and auto-registered. No configuration needed.
 <table>
 <tr><td width="50%">
 **Git Workflow**
-- `branch_status` - Current branch + change detection
-- `branch_create` - Convention-named branches (with toast notifications)
-- `branch_switch` - Safe branch switching
-- `worktree_create` - Isolated worktree in `.worktrees/` (with toast notifications)
-- `worktree_launch` - Launch worktree (terminal/PTY/background)
+- `branch_status` — Current branch + change detection
+- `branch_create` — Convention-named branches (with toast)
+- `branch_switch` — Safe branch switching
+- `worktree_create` — Isolated worktree in `.worktrees/`
 - `worktree_list` / `worktree_remove` / `worktree_open`
 </td><td width="50%">
 **Planning & Sessions**
 - `plan_save` / `plan_load` / `plan_list` / `plan_delete`
+- `plan_commit` — Commit plan to feature branch
 - `session_save` / `session_list` / `session_load`
 - `cortex_init` / `cortex_status` / `cortex_configure`
-- `detect_environment` - Detect IDE/terminal for contextual launch options
 </td></tr>
 <tr><td width="50%">
 **Documentation**
-- `docs_init` - Set up `docs/` structure
-- `docs_save` - Save doc with mermaid diagrams
-- `docs_list` - Browse all docs
-- `docs_index` - Rebuild `docs/INDEX.md`
+- `docs_init` — Set up `docs/` structure
+- `docs_save` — Save doc with mermaid diagrams
+- `docs_list` — Browse all docs
+- `docs_index` — Rebuild `docs/INDEX.md`
 </td><td width="50%">
-**Finalization & Config**
-- `task_finalize` - Stage, commit, push, create PR
+**Finalization**
+- `task_finalize` — Stage, commit, push, create PR
   - Auto-detects worktree (targets main)
   - Auto-populates PR from `.cortex/plans/`
-  - Auto-links issues via `Closes #N` from plan metadata
-  - Warns if docs are missing
-- `cortex_configure` - Set models from within an agent session
+  - Auto-links issues via `Closes #N`
+- `quality_gate_summary` — Aggregate sub-agent findings, GO/NO-GO recommendation
 </td></tr>
 <tr><td colspan="2">
 **GitHub Integration**
-- `github_status` - Check `gh` CLI availability, authentication, and detect GitHub Projects
-- `github_issues` - List/filter repo issues by state, labels, milestone, assignee
-- `github_projects` - List GitHub Project boards and their work items
+- `github_status` — Check `gh` CLI availability, auth, and detect projects
+- `github_issues` — List/filter repo issues by state, labels, milestone, assignee
+- `github_projects` — List GitHub Project boards and their work items
-The architect agent uses these tools to browse your backlog and seed plans from real GitHub issues. Issue numbers are stored in plan frontmatter (`issues: [42, 51]`) and automatically appended as `Closes #N` to the PR body when `task_finalize` runs — GitHub auto-closes the issues when the PR merges. Supports both github.com and GitHub Enterprise Server URLs.
+The architect uses these to browse your backlog and seed plans from real issues. Issue numbers stored in plan frontmatter (`issues: [42, 51]`) are automatically appended as `Closes #N` to the PR body.
 </td></tr>
 <tr><td colspan="2">
-**REPL Loop** (Iterative Task-by-Task Implementation)
-- `repl_init` - Initialize a loop from a plan (parses tasks, auto-detects build/test commands)
-- `repl_status` - Get current progress, active task, retry counts (auto-advances to next task)
-- `repl_report` - Report task outcome (`pass`/`fail`/`skip`) with auto-retry and escalation
-- `repl_summary` - Generate markdown summary table for PR body
+**REPL Loop** — Iterative task-by-task implementation
+- `repl_init` — Initialize loop from plan (parses tasks + ACs, auto-detects build/test)
+- `repl_status` — Current progress, active task with ACs, retry counts
+- `repl_report` — Report outcome (`pass`/`fail`/`skip`) with auto-retry and escalation
+- `repl_resume` — Detect and resume interrupted loop from previous session
+- `repl_summary` — Markdown results table with AC satisfaction for PR body
-The implement agent uses these tools to work through plan tasks one at a time, running build+test verification after each task. Failed tasks are automatically retried (up to a configurable limit) before escalating to the user. State is persisted to `.cortex/repl-state.json` so progress survives context compaction and session restarts.
+State persists to `.cortex/repl-state.json` — survives context compaction, session restarts, and agent switches.
 </td></tr>
 </table>
-<br>
+---
 ## Skills
-14 domain-specific skill packs loaded on demand via the `skill` tool:
+17 domain-specific skill packs loaded on demand:
 | Skill | Covers |
 |-------|--------|
-| **frontend-development** | React, Vue, Svelte, CSS architecture, accessibility |
-| **backend-development** | API design, middleware, auth, caching, queue systems |
-| **mobile-development** | React Native, Flutter, native iOS/Android patterns |
-| **desktop-development** | Electron, Tauri, native desktop application patterns |
-| **database-design** | Schema design, migrations, indexing, query optimization |
-| **api-design** | REST, GraphQL, gRPC, versioning, documentation |
-| **testing-strategies** | Unit, integration, E2E, TDD, coverage strategies |
-| **security-hardening** | OWASP, auth/authz, input validation, secure coding |
-| **deployment-automation** | CI/CD, Docker, Kubernetes, cloud deployment |
-| **architecture-patterns** | Microservices, monorepo, event-driven, CQRS |
-| **design-patterns** | GoF patterns, SOLID principles, DDD |
-| **performance-optimization** | Profiling, caching, lazy loading, bundle optimization |
-| **code-quality** | Refactoring, linting, code review, maintainability |
-| **git-workflow** | Branching strategies, worktrees, rebase vs merge |
-<br>
+| `frontend-development` | React, Vue, Svelte, CSS architecture, accessibility |
+| `ui-design` | Visual hierarchy, typography, color systems, spacing, motion, professional polish |
+| `backend-development` | API design, middleware, auth, caching, queue systems |
+| `mobile-development` | React Native, Flutter, native iOS/Android patterns |
+| `desktop-development` | Electron, Tauri, native desktop application patterns |
+| `database-design` | Schema design, migrations, indexing, query optimization |
+| `api-design` | REST, GraphQL, gRPC, versioning, documentation |
+| `testing-strategies` | Unit, integration, E2E, TDD, coverage strategies |
+| `security-hardening` | OWASP, auth/authz, input validation, secure coding |
+| `deployment-automation` | CI/CD, Docker, Kubernetes, cloud deployment |
+| `architecture-patterns` | Microservices, monorepo, event-driven, CQRS |
+| `design-patterns` | GoF patterns, SOLID principles, DDD |
+| `performance-optimization` | Profiling, caching, lazy loading, bundle optimization |
+| `code-quality` | Refactoring, linting, code review, maintainability |
+| `git-workflow` | Branching strategies, worktrees, rebase vs merge |
+| `monitoring-observability` | Structured logging, metrics, distributed tracing, health checks |
+| `data-engineering` | ETL pipelines, data validation, streaming, message queues, partitioning |
+---
 ## Model Configuration
@@ -255,7 +260,7 @@ npx cortex-agents configure --project  # Per-project (saves to .opencode/models.
 ```
 ```
-? Select model for PRIMARY agents:
+? Select model for PRIMARY agents (architect, implement, fix):
   Claude Sonnet 4    (anthropic)     Best balance of intelligence and speed
   Claude Opus 4      (anthropic)     Most capable, best for complex architecture
   GPT-4.1            (openai)        Fast multimodal model
@@ -263,40 +268,97 @@ npx cortex-agents configure --project  # Per-project (saves to .opencode/models.
   Kimi K2P5          (kimi)          Optimized for code generation
   Enter custom model ID
-? Select model for SUBAGENTS:
+? Select model for SUBAGENTS (debug, coder, testing, security, devops, audit, ...):
   Claude 3.5 Haiku   (anthropic)     Fast and cost-effective
   o4 Mini            (openai)        Fast reasoning, cost-effective
   Gemini 2.5 Flash   (google)        Fast and efficient
   Same as primary
 ```
-### In-Agent Configuration
+Agents can also configure models mid-session via `cortex_configure` — no need to leave OpenCode.
+> Don't see your provider in the picker? Select **"Enter custom model ID"** and type any `provider/model` string.
+---
+## How It Works
+### Implement Agent — Step by Step
-Agents can also configure models during a session via the `cortex_configure` tool — no need to leave OpenCode. The agent will prompt you to select models when `.cortex/` is first initialized.
+```
+Step 1   branch_status           Am I on a protected branch?
+Step 2   cortex_status           Is .cortex initialized?
+Step 3   plan_list / plan_load   Is there a plan for this work?
+Step 4   Ask: strategy           Worktree (recommended) or branch?
+Step 5   Execute                 Create worktree/branch
+Step 6   REPL Loop               repl_init -> iterate tasks one-by-one
+  6a     repl_init               Parse tasks + ACs, detect build/test commands
+  6b     repl_status             Get current task with ACs, auto-advance
+  6c     Implement task          Write code to satisfy acceptance criteria
+  6d     Build + test            Run detected build/test commands
+  6e     repl_report             Report pass/fail/skip -> auto-advance or retry
+  6f     Loop                    Repeat 6b-6e until all tasks complete
+Step 7   Quality Gate            Two-phase sub-agent review (scope-based)
+  7a     repl_summary            Generate loop results
+  7b     Assess scope            Classify changed files by risk tier
+  7c     Phase 1                 Launch sub-agents in parallel
+  7d     Phase 2                 Cross-agent reactions (@testing <- @security)
+  7e     quality_gate_summary    Aggregate findings -> GO / NO-GO
+Step 8   Documentation           Review @docs-writer output or prompt user
+Step 9   session_save            Record what was done and why
+Step 10  task_finalize           Commit, push, create PR (with quality gate in body)
+Step 11  Cleanup                 Remove worktree if applicable
+```
-### Per-Project vs Global
+### REPL Loop Example
-| Scope | Where | Use Case |
-|-------|-------|----------|
-| **Global** | `~/.config/opencode/opencode.json` | Default for all projects |
-| **Per-project** | `.opencode/models.json` + `opencode.json` | Different models for different repos |
+```
+repl_init("my-plan.md")
+  -> Parses tasks (- [ ] checkboxes) with ACs (- AC: lines)
+  -> Auto-detects: npm run build, npx vitest run
+  -> Creates .cortex/repl-state.json
-Per-project config takes priority. Team members get the same model settings when they clone the repo (`.opencode/models.json` is git-tracked).
+Loop:
+  repl_status              -> "Task #1: Implement user model"
+                               AC: User model has name, email, password
+                               AC: Email validation rejects malformed addresses
+  [implement task]
+  [run build + tests]
+  repl_report(pass, "42 tests pass")
+                           -> "Task #1 PASSED (1st attempt)"
+                           -> "Next: Task #2"
+  repl_status              -> "Task #2: Add API endpoints"
+  [implement task]
+  [run build + tests]
+  repl_report(fail, "POST /users 500")
+                           -> "Task #2 FAILED (attempt 1/3)"
+                           -> "Fix and retry. 2 retries remaining."
+  [fix issue, re-run tests]
+  repl_report(pass, "All green")
+                           -> "Task #2 PASSED (2nd attempt)"
+  ...
-### Supported Providers
+repl_summary               -> Markdown table for PR body
+```
-| Provider | Premium | Standard | Fast |
-|----------|---------|----------|------|
-| **Anthropic** | Claude Opus 4 | Claude Sonnet 4 | Claude 3.5 Haiku |
-| **OpenAI** | o3 | GPT-4.1 | o4 Mini |
-| **Google** | Gemini 2.5 Pro | - | Gemini 2.5 Flash |
-| **xAI** | Grok 3 | - | Grok 3 Mini |
-| **DeepSeek** | DeepSeek R1 | - | DeepSeek Chat |
-| **Kimi** | - | Kimi K2P5 | - |
+### Quality Gate Example
-> Don't see your provider? Select **"Enter custom model ID"** and type any `provider/model` string.
+```
+quality_gate_summary receives reports from 6 agents:
+  @testing:     PASS — 12 tests written, all passing
+  @security:    PASS WITH WARNINGS — 1 medium finding (XSS in tooltip)
+  @audit:       PASS — score A, no critical issues
+  @docs-writer: 1 feature doc created
+  @devops:      N/A
+  @perf:        PASS — no regressions, all O(n) or better
+  -> Recommendation: GO-WITH-WARNINGS
+  -> Blocker: none
+  -> PR body section auto-generated
+```
-<br>
+---
 ## Project Structure
@@ -304,12 +366,13 @@ Per-project config takes priority. Team members get the same model settings when
 your-project/
   .cortex/                     Project context (auto-initialized)
      config.json              Configuration
-     plans/                   Implementation plans (gitignored)
-     sessions/                Session summaries (gitignored)
-     repl-state.json          REPL loop progress (gitignored, auto-managed)
+     plans/                   Implementation plans
+     sessions/                Session summaries
+     repl-state.json          REPL loop progress (auto-managed)
+     quality-gate.json        Last quality gate results
   .opencode/
      models.json              Per-project model config (git tracked)
-  .worktrees/                  Git worktrees (gitignored)
+  .worktrees/                  Git worktrees
      feature-auth/            Isolated development copy
      bugfix-login/
   docs/                        Documentation (git tracked)
@@ -319,7 +382,7 @@ your-project/
      flows/                   Process/data flow docs
 ```
-<br>
+---
 ## CLI Reference
@@ -333,131 +396,20 @@ npx cortex-agents uninstall                    # Clean removal of everything
 npx cortex-agents status                       # Show installation and model status
 ```
-<br>
-## How It Works
-### The Implement Agent Workflow
-Every time the implement agent starts, it follows a structured pre-implementation checklist:
-```
-Step 1   branch_status           Am I on a protected branch?
-Step 2   cortex_status           Is .cortex initialized? Offer model config if new project.
-Step 3   plan_list / plan_load   Is there a plan for this work?
-Step 4   Ask: strategy           Worktree (recommended) or branch?
-Step 4b  Ask: launch mode        Terminal tab (recommended) / stay / PTY / background?
-Step 5   Execute                 Create worktree/branch, auto-detect terminal
-Step 6   REPL Loop               If plan loaded: repl_init → iterate tasks one-by-one
-  6a     repl_init               Parse plan tasks, auto-detect build/test commands
-  6b     repl_status             Get current task, auto-advance from pending
-  6c     Implement task          Write code for the current task only
-  6d     Build + test            Run detected build/test commands
-  6e     repl_report             Report pass/fail/skip → auto-advance or retry
-  6f     Repeat 6b-6e            Until all tasks done or user intervenes
-Step 7   Quality Gate            Launch @qa + @guard in parallel (includes repl_summary)
-Step 8   Ask: documentation      Decision doc / feature doc / flow doc?
-Step 9   session_save            Record what was done and why
-Step 10  task_finalize           Commit, push, create PR
-Step 11  Ask: cleanup            Remove worktree + close terminal tab? (if applicable)
-```
-This isn't just documentation - it's enforced by the agent's instructions. The AI follows this workflow every time.
-### Sub-Agent Quality Gates
-After implementation (Step 7), the implement agent **automatically** launches sub-agents in parallel as quality gates:
-```
-Implement Agent completes implementation
-   |
-   +-- launches in parallel (single message) --+
-   |                                            |
-   v                                            v
-@qa                                        @guard
-  Writes unit tests                          OWASP audit
-  Runs test suite                            Secrets scan
-  Reports coverage                           Severity ratings
-  Returns: PASS/FAIL                         Returns: PASS/FAIL
-   |                                            |
-   +------ results reviewed by Implement ------+
-   |
-   v
-Quality Gate Summary included in PR body
-```
-The fix agent uses the same pattern: `@qa` for regression tests (always) and `@guard` when the fix touches sensitive code.
-Sub-agents use **structured return contracts** so results are actionable:
-- `BLOCKING` / `CRITICAL` / `HIGH` findings block finalization
-- `MEDIUM` findings are noted in the PR body
-- `LOW` findings are deferred
-### REPL Loop (Iterative Implementation)
-When a plan is loaded, the implement agent activates a **Read-Eval-Print Loop** that works through tasks one at a time with build+test verification after each:
-```
-repl_init("my-plan.md")
-  → Parses plan tasks (- [ ] checkboxes)
-  → Auto-detects: npm run build, npx vitest run (vitest)
-  → Creates .cortex/repl-state.json
-Loop:
-  repl_status                    → "Task #1: Implement user model"
-  [agent implements task]
-  [agent runs build + tests]
-  repl_report(pass, "42 tests pass")  → "✓ Task #1 PASSED (1st attempt)"
-                                       → "→ Next: Task #2"
-  repl_status                    → "Task #2: Add API endpoints"
-  [agent implements task]
-  [agent runs build + tests]
-  repl_report(fail, "POST /users 500") → "⚠ Task #2 FAILED (attempt 1/3)"
-                                        → "Fix and retry. 2 retries remaining."
-  [agent fixes the issue]
-  [agent runs build + tests]
-  repl_report(pass, "All green")  → "✓ Task #2 PASSED (2nd attempt)"
-                                   → "→ Next: Task #3"
-  ...
-repl_summary                     → Markdown table for PR body
-```
-**Key behaviors:**
-- **Opt-in**: Only activates when a plan is loaded. No-plan sessions use the standard linear workflow.
-- **Auto-detection**: Scans `package.json`, `Cargo.toml`, `go.mod`, `pyproject.toml`, `Makefile`, `mix.exs` for build/test/lint commands.
-- **Retry with escalation**: Failed tasks retry up to `maxRetries` (default: 3) before asking the user how to proceed.
-- **Persistent state**: Progress saved to `.cortex/repl-state.json` — survives context compaction, session restarts, and agent switches.
-- **Skip support**: Tasks can be skipped with a reason, which is tracked in the summary.
-### Agent Handover
-When agents switch, a toast notification tells you what mode you're in:
-```
-Agent: implement              Development mode - ready to implement
-Agent: architect             Planning mode - read-only analysis
-Agent: fix                   Debug mode - troubleshooting and fixes
-Agent: audit                 Review mode - code quality assessment
-```
-The Architect agent creates plans with mermaid diagrams and hands off to Implement. Implement loads the plan, detects the tech stack, loads relevant skills, and implements. If something breaks, Fix takes over with performance debugging tools. Audit provides code quality assessment and tech debt analysis on demand.
-<br>
+---
 ## Requirements
 - [OpenCode](https://opencode.ai) >= 1.0.0
 - Node.js >= 18.0.0
 - Git (for branch/worktree features)
-- [GitHub CLI](https://cli.github.com/) (optional, for `task_finalize` PR creation and `github_*` tools)
+- [GitHub CLI](https://cli.github.com/) (optional — for PR creation and issue integration)
-<br>
+---
 ## Contributing
-Contributions are welcome! This is an Apache-2.0 licensed project and we'd love your help.
+We welcome contributions of all sizes. Whether it's a typo fix, a new skill pack, or a whole new agent — we appreciate it.
 ### Getting Started
@@ -466,45 +418,109 @@ git clone https://github.com/ps-carvalho/cortex-agents.git
 cd cortex-agents
 npm install
 npm run build
+npm test                    # 447 tests, all should pass
 ```
-### Development Workflow
+### Local Development
 ```bash
-# Link for local testing
+# Link for local testing with OpenCode
 npm link
 cd ~/.config/opencode && npm link cortex-agents
-# Make changes, rebuild, restart OpenCode
+# Edit, rebuild, restart OpenCode to test
 npm run build
 # Unlink when done
 cd ~/.config/opencode && npm unlink cortex-agents && npm install
 ```
+### Project Layout
+```
+src/
+  index.ts                   Plugin entry point, tool registration, event hooks
+  registry.ts                Agent/model registry constants
+  cli.ts                     CLI (install, configure, uninstall, status)
+  tools/
+    repl.ts                  REPL loop tools (init, status, report, resume, summary)
+    quality-gate.ts          Quality gate aggregation tool
+    cortex.ts                Project initialization tools
+    worktree.ts              Git worktree tools
+    branch.ts                Git branch tools
+    plan.ts                  Plan persistence tools
+    session.ts               Session summary tools
+    docs.ts                  Documentation tools
+    task.ts                  Task finalization tool
+    github.ts                GitHub integration tools
+  utils/
+    repl.ts                  REPL state management, command detection, formatting
+    change-scope.ts          Risk-based file classification
+    plan-extract.ts          Plan parsing utilities
+    shell.ts                 Shell command helpers
+    github.ts                GitHub API helpers
+    worktree-detect.ts       Worktree detection
+  __tests__/                 Test files mirror src/ structure
+.opencode/
+  agents/                    12 agent definition files (.md frontmatter)
+  skills/                    17 skill pack directories (SKILL.md each)
+```
 ### What We're Looking For
-- **New skills** - Domain-specific knowledge packs (e.g., Rust, Go, DevOps for AWS)
-- **New agents** - Specialized agents (e.g., reviewer, migration, docs-writer)
-- **Terminal drivers** - Improve detection/support for additional terminal emulators
-- **Tool improvements** - Better PR templates, test runners, linter integration
-- **Bug fixes** - Anything that doesn't work as expected
+| Type | Examples | Difficulty |
+|------|----------|-----------|
+| **New skills** | Rust, Go, AWS, Terraform, GraphQL | Easy — add a `SKILL.md` file |
+| **New agents** | Reviewer, migration specialist, API designer | Medium — agent `.md` + registry update |
+| **Tool improvements** | Better PR templates, test runners, linter integration | Medium — TypeScript + tests |
+| **Quality gate enhancements** | New parsers for agent report formats, smarter severity mapping | Medium |
+| **Bug fixes** | Anything that doesn't work as expected | Varies |
+| **Documentation** | Guides, examples, tutorials | Easy |
+### Adding a New Skill
+1. Create `.opencode/skills/your-skill/SKILL.md` with frontmatter (`name`, `description`, `license`, `compatibility`)
+2. Write the skill content — patterns, checklists, examples
+3. Update the skill count in tests if applicable
+4. Submit a PR
+### Adding a New Agent
+1. Create `.opencode/agents/your-agent.md` with frontmatter (`description`, `mode`, `temperature`, `tools`, `permission`)
+2. Add the agent name to `SUBAGENTS` or `PRIMARY_AGENTS` in `src/registry.ts`
+3. Add an agent description in `AGENT_DESCRIPTIONS` in `src/index.ts`
+4. Update test expectations in `src/__tests__/registry.test.ts`
+5. Submit a PR
 ### Submitting Changes
 1. Fork the repository
 2. Create your branch (`git checkout -b feature/amazing-feature`)
-3. Commit with conventional format (`git commit -m 'feat: add amazing feature'`)
-4. Push and open a Pull Request
+3. Write tests for new functionality
+4. Ensure all tests pass (`npm test`) and the build is clean (`npm run build`)
+5. Commit with conventional format (`feat:`, `fix:`, `docs:`, `chore:`)
+6. Push and open a Pull Request
-<br>
+### Commit Convention
-## License
+We use [Conventional Commits](https://www.conventionalcommits.org/):
-[Apache-2.0](LICENSE)
+```
+feat: add new capability
+fix: correct a bug
+docs: update documentation
+chore: maintenance, dependencies
+refactor: code restructuring without behavior change
+test: add or update tests
+```
+---
+## License
-<br>
+[Apache-2.0](LICENSE) — use it, modify it, ship it.
 <p align="center">
+  <br>
   <sub>Built for the <a href="https://opencode.ai">OpenCode</a> community</sub>
 </p>