npm - tribunal-kit - Versions diffs - 1.0.0 → 2.4.0 - Mend

tribunal-kit 1.0.0 → 2.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (125) hide show

package/.agent/.shared/ui-ux-pro-max/README.md +3 -3
package/.agent/ARCHITECTURE.md +205 -10
package/.agent/GEMINI.md +37 -7
package/.agent/agents/accessibility-reviewer.md +134 -0
package/.agent/agents/ai-code-reviewer.md +129 -0
package/.agent/agents/frontend-specialist.md +3 -0
package/.agent/agents/game-developer.md +21 -21
package/.agent/agents/logic-reviewer.md +12 -0
package/.agent/agents/mobile-reviewer.md +79 -0
package/.agent/agents/orchestrator.md +56 -26
package/.agent/agents/performance-reviewer.md +36 -0
package/.agent/agents/supervisor-agent.md +156 -0
package/.agent/agents/swarm-worker-contracts.md +166 -0
package/.agent/agents/swarm-worker-registry.md +92 -0
package/.agent/rules/GEMINI.md +134 -5
package/.agent/scripts/bundle_analyzer.py +259 -0
package/.agent/scripts/dependency_analyzer.py +247 -0
package/.agent/scripts/lint_runner.py +188 -0
package/.agent/scripts/patch_skills_meta.py +177 -0
package/.agent/scripts/patch_skills_output.py +285 -0
package/.agent/scripts/schema_validator.py +279 -0
package/.agent/scripts/security_scan.py +224 -0
package/.agent/scripts/session_manager.py +144 -3
package/.agent/scripts/skill_integrator.py +234 -0
package/.agent/scripts/strengthen_skills.py +220 -0
package/.agent/scripts/swarm_dispatcher.py +317 -0
package/.agent/scripts/test_runner.py +192 -0
package/.agent/scripts/test_swarm_dispatcher.py +163 -0
package/.agent/skills/agent-organizer/SKILL.md +132 -0
package/.agent/skills/agentic-patterns/SKILL.md +335 -0
package/.agent/skills/api-patterns/SKILL.md +226 -50
package/.agent/skills/app-builder/SKILL.md +215 -52
package/.agent/skills/architecture/SKILL.md +176 -31
package/.agent/skills/bash-linux/SKILL.md +150 -134
package/.agent/skills/behavioral-modes/SKILL.md +152 -160
package/.agent/skills/brainstorming/SKILL.md +148 -101
package/.agent/skills/brainstorming/dynamic-questioning.md +10 -0
package/.agent/skills/clean-code/SKILL.md +139 -134
package/.agent/skills/code-review-checklist/SKILL.md +177 -80
package/.agent/skills/config-validator/SKILL.md +165 -0
package/.agent/skills/csharp-developer/SKILL.md +107 -0
package/.agent/skills/database-design/SKILL.md +252 -29
package/.agent/skills/deployment-procedures/SKILL.md +122 -175
package/.agent/skills/devops-engineer/SKILL.md +134 -0
package/.agent/skills/devops-incident-responder/SKILL.md +98 -0
package/.agent/skills/documentation-templates/SKILL.md +175 -121
package/.agent/skills/dotnet-core-expert/SKILL.md +103 -0
package/.agent/skills/edge-computing/SKILL.md +213 -0
package/.agent/skills/frontend-design/SKILL.md +76 -0
package/.agent/skills/frontend-design/color-system.md +18 -0
package/.agent/skills/frontend-design/typography-system.md +18 -0
package/.agent/skills/game-development/SKILL.md +69 -0
package/.agent/skills/geo-fundamentals/SKILL.md +158 -99
package/.agent/skills/i18n-localization/SKILL.md +158 -96
package/.agent/skills/intelligent-routing/SKILL.md +89 -285
package/.agent/skills/intelligent-routing/router-manifest.md +65 -0
package/.agent/skills/lint-and-validate/SKILL.md +229 -27
package/.agent/skills/llm-engineering/SKILL.md +258 -0
package/.agent/skills/local-first/SKILL.md +203 -0
package/.agent/skills/mcp-builder/SKILL.md +159 -111
package/.agent/skills/mobile-design/SKILL.md +102 -282
package/.agent/skills/nextjs-react-expert/SKILL.md +143 -227
package/.agent/skills/nodejs-best-practices/SKILL.md +201 -254
package/.agent/skills/observability/SKILL.md +285 -0
package/.agent/skills/parallel-agents/SKILL.md +124 -118
package/.agent/skills/performance-profiling/SKILL.md +143 -89
package/.agent/skills/plan-writing/SKILL.md +133 -97
package/.agent/skills/platform-engineer/SKILL.md +135 -0
package/.agent/skills/powershell-windows/SKILL.md +167 -104
package/.agent/skills/python-patterns/SKILL.md +149 -361
package/.agent/skills/python-pro/SKILL.md +114 -0
package/.agent/skills/react-specialist/SKILL.md +107 -0
package/.agent/skills/realtime-patterns/SKILL.md +296 -0
package/.agent/skills/red-team-tactics/SKILL.md +136 -134
package/.agent/skills/rust-pro/SKILL.md +237 -173
package/.agent/skills/seo-fundamentals/SKILL.md +134 -82
package/.agent/skills/server-management/SKILL.md +155 -104
package/.agent/skills/sql-pro/SKILL.md +104 -0
package/.agent/skills/systematic-debugging/SKILL.md +156 -79
package/.agent/skills/tailwind-patterns/SKILL.md +163 -205
package/.agent/skills/tdd-workflow/SKILL.md +148 -88
package/.agent/skills/test-result-analyzer/SKILL.md +299 -0
package/.agent/skills/testing-patterns/SKILL.md +141 -114
package/.agent/skills/trend-researcher/SKILL.md +228 -0
package/.agent/skills/ui-ux-pro-max/SKILL.md +107 -0
package/.agent/skills/ui-ux-researcher/SKILL.md +234 -0
package/.agent/skills/vue-expert/SKILL.md +118 -0
package/.agent/skills/vulnerability-scanner/SKILL.md +228 -188
package/.agent/skills/web-design-guidelines/SKILL.md +148 -33
package/.agent/skills/webapp-testing/SKILL.md +171 -122
package/.agent/skills/whimsy-injector/SKILL.md +349 -0
package/.agent/skills/workflow-optimizer/SKILL.md +219 -0
package/.agent/workflows/api-tester.md +279 -0
package/.agent/workflows/audit.md +168 -0
package/.agent/workflows/brainstorm.md +65 -19
package/.agent/workflows/changelog.md +144 -0
package/.agent/workflows/create.md +67 -14
package/.agent/workflows/debug.md +122 -30
package/.agent/workflows/deploy.md +82 -31
package/.agent/workflows/enhance.md +59 -27
package/.agent/workflows/fix.md +143 -0
package/.agent/workflows/generate.md +84 -20
package/.agent/workflows/migrate.md +163 -0
package/.agent/workflows/orchestrate.md +66 -17
package/.agent/workflows/performance-benchmarker.md +305 -0
package/.agent/workflows/plan.md +76 -33
package/.agent/workflows/preview.md +73 -17
package/.agent/workflows/refactor.md +153 -0
package/.agent/workflows/review-ai.md +140 -0
package/.agent/workflows/review.md +83 -16
package/.agent/workflows/session.md +154 -0
package/.agent/workflows/status.md +74 -18
package/.agent/workflows/strengthen-skills.md +99 -0
package/.agent/workflows/swarm.md +194 -0
package/.agent/workflows/test.md +80 -31
package/.agent/workflows/tribunal-backend.md +55 -13
package/.agent/workflows/tribunal-database.md +62 -18
package/.agent/workflows/tribunal-frontend.md +58 -12
package/.agent/workflows/tribunal-full.md +70 -11
package/.agent/workflows/tribunal-mobile.md +123 -0
package/.agent/workflows/tribunal-performance.md +152 -0
package/.agent/workflows/ui-ux-pro-max.md +100 -82
package/README.md +117 -62
package/bin/tribunal-kit.js +329 -75
package/package.json +10 -6

package/.agent/.shared/ui-ux-pro-max/README.md CHANGED Viewed

@@ -1,4 +1,4 @@
-# .shared/ui-ux-pro-max
-Shared assets for the /ui-ux-pro-max workflow.
+# .shared/ui-ux-pro-max
+Shared assets for the /ui-ux-pro-max workflow.
 Place reusable design tokens, color palettes, and reference snippets here.

package/.agent/ARCHITECTURE.md CHANGED Viewed

@@ -4,22 +4,74 @@
 ---
+## System Flow
+```mermaid
+flowchart TD
+    A["User Prompt"] --> B{"Classify Request"}
+    B -->|Question| C["Text Answer — No Agents"]
+    B -->|Survey| D["Read + Report — No Code"]
+    B -->|Simple Edit| E["Direct Edit"]
+    B -->|Complex Build| F["Socratic Gate"]
+    B -->|Slash Command| G["Route to Workflow"]
+    F --> H{"Questions Answered?"}
+    H -->|No| I["Ask Clarifying Questions"]
+    I --> H
+    H -->|Yes| J["Auto-Route to Agent"]
+    J --> K["Agent Generates Code"]
+    K --> L{"Tribunal Review"}
+    L -->|Approved| M{"Human Gate"}
+    L -->|Rejected| N["Feedback to Agent"]
+    N --> O{"Retry Count < 3?"}
+    O -->|Yes| K
+    O -->|No| P["HALT — Escalate to Human"]
+    M -->|Approved| Q["Write to Disk"]
+    M -->|Rejected| R["Revise or Abandon"]
+```
+---
 ## Slash Commands (Workflows)
 Type any of these in your AI IDE chat:
-| Command | Purpose |
-|---|---|
-| `/generate` | Full Tribunal: Maker → Parallel Review → Human Gate |
-| `/review` | Audit existing code (no generation) |
-| `/tribunal-full` | ALL 8 agents at once — maximum coverage |
-| `/tribunal-backend` | Logic + Security + Deps + Types |
-| `/tribunal-frontend` | Logic + Security + Frontend + Types |
-| `/tribunal-database` | Logic + Security + SQL |
+| Command | Purpose | File |
+|---|---|---|
+| `/generate` | Full Tribunal: Maker → Parallel Review → Human Gate | `workflows/generate.md` |
+| `/review` | Audit existing code (no generation) | `workflows/review.md` |
+| `/tribunal-full` | ALL 8 agents at once — maximum coverage | `workflows/tribunal-full.md` |
+| `/tribunal-backend` | Logic + Security + Deps + Types | `workflows/tribunal-backend.md` |
+| `/tribunal-frontend` | Logic + Security + Frontend + Types | `workflows/tribunal-frontend.md` |
+| `/tribunal-database` | Logic + Security + SQL | `workflows/tribunal-database.md` |
+| `/tribunal-mobile` | Logic + Security + Mobile | `workflows/tribunal-mobile.md` |
+| `/tribunal-performance` | Logic + Performance | `workflows/tribunal-performance.md` |
+| `/brainstorm` | Exploration mode — no code, just options | `workflows/brainstorm.md` |
+| `/create` | Structured app creation (4-stage) | `workflows/create.md` |
+| `/enhance` | Add/update features in existing apps | `workflows/enhance.md` |
+| `/debug` | Systematic debugging with root cause analysis | `workflows/debug.md` |
+| `/plan` | Project planning only — no code | `workflows/plan.md` |
+| `/deploy` | Pre-flight checks + deployment | `workflows/deploy.md` |
+| `/test` | Test generation and execution | `workflows/test.md` |
+| `/preview` | Start/stop local dev server | `workflows/preview.md` |
+| `/status` | Agent and project status board | `workflows/status.md` |
+| `/session` | Multi-session state tracking | `workflows/session.md` |
+| `/orchestrate` | Multi-agent coordination | `workflows/orchestrate.md` |
+| `/swarm` | Supervisor → specialist Workers → unified synthesis | `workflows/swarm.md` |
+| `/strengthen-skills` | Audit and harden all skills — appends Tribunal guardrails to any SKILL.md missing them | `workflows/strengthen-skills.md` |
+| `/ui-ux-pro-max` | Plan and implement cutting-edge UI/UX | `workflows/ui-ux-pro-max.md` |
+| `/refactor` | Dependency-safe code refactoring | `workflows/refactor.md` |
+| `/migrate` | Framework upgrades, DB migrations | `workflows/migrate.md` |
+| `/audit` | Full project health audit | `workflows/audit.md` |
+| `/fix` | Auto-fix lint, formatting, imports | `workflows/fix.md` |
+| `/changelog` | Generate changelog from git history | `workflows/changelog.md` |
+| `/review-ai` | AI/LLM integration audit | `workflows/review-ai.md` |
 ---
-## The 8 Tribunal Agents
+## The 11 Tribunal Agents
 | Agent | File | Activates When |
 |---|---|---|
@@ -31,6 +83,88 @@ Type any of these in your AI IDE chat:
 | `sql-reviewer` | `agents/sql-reviewer.md` | "query", "database", `/tribunal-full` |
 | `frontend-reviewer` | `agents/frontend-reviewer.md` | "react", "hook", "component", `/tribunal-full` |
 | `test-coverage-reviewer` | `agents/test-coverage-reviewer.md` | "test", "spec", "coverage", `/tribunal-full` |
+| `mobile-reviewer` | `agents/mobile-reviewer.md` | "mobile", "react native", "flutter", `/tribunal-full` |
+| `ai-code-reviewer` | `agents/ai-code-reviewer.md` | "llm", "openai", "anthropic", "ai", `/tribunal-full`, `/review-ai` |
+| `accessibility-reviewer` | `agents/accessibility-reviewer.md` | "a11y", "wcag", "aria", `/tribunal-frontend`, `/tribunal-full` |
+---
+## Swarm / Supervisor Architecture
+The Swarm system decomposes complex multi-domain goals into independent sub-tasks dispatched to specialist Workers.
+```
+/swarm [complex multi-domain goal]
+        │
+        ▽
+  supervisor-agent (triage)
+  └─ reads: swarm-worker-registry.md
+  └─ emits: WorkerRequest JSON per sub-task
+        │
+        ├───── WorkerRequest ───→ Worker A (e.g. backend-specialist)
+        ├───── WorkerRequest ───→ Worker B (e.g. database-architect)
+        └───── WorkerRequest ───→ Worker C (e.g. documentation-writer)
+                                    │
+                          WorkerResult (success/failure/escalate)
+                                    │
+                         supervisor-agent (synthesize)
+                                    │
+                         ━━━ Swarm Complete ━━━
+                         Human Gate → Y / N / R
+```
+**Key files:**
+| File | Role |
+|---|---|
+| `agents/supervisor-agent.md` | Triage, dispatch, retry, synthesis logic |
+| `agents/swarm-worker-contracts.md` | WorkerRequest + WorkerResult JSON schemas |
+| `agents/swarm-worker-registry.md` | Maps task types and keywords to specialist agents |
+| `workflows/swarm.md` | `/swarm` slash command procedure |
+| `scripts/swarm_dispatcher.py` | Validates WorkerRequest/WorkerResult JSON (use `--mode swarm`) |
+**Constraints:**
+- Maximum 5 Workers per swarm invocation
+- Workers are independent — no Worker depends on another's pending result
+- Failed workers are retried up to 3 times with targeted feedback
+- Workers that fail after 3 retries are escalated, not silently dropped
+- Human Gate is never skipped
+---
+## Specialist Agents
+| Agent / Expert | Domain |
+|---|---|
+| `supervisor-agent` | Swarm triage, Worker dispatch, result synthesis |
+| `orchestrator` | Multi-agent coordination |
+| `agent-organizer` | Specialist agent operations |
+| `project-planner` | 4-phase structured planning |
+| `backend-specialist` | API, server, auth |
+| `dotnet-core-expert` | C# / .NET architecture |
+| `python-pro` | Python backend development |
+| `frontend-specialist` | Web UI / Components |
+| `react-specialist` | React / Next.js architecture |
+| `vue-expert` | Vue / Nuxt applications |
+| `database-architect` | Schema, migrations |
+| `sql-pro` | Complex queries, optimization |
+| `mobile-developer` | React Native, Flutter |
+| `devops-engineer` | CI/CD, Docker, deployment |
+| `platform-engineer` | Infrastructure, cloud native |
+| `devops-incident-responder` | Production issues |
+| `debugger` | Systematic debugging |
+| `game-developer` | Game development |
+| `security-auditor` | Penetration testing, OWASP |
+| `penetration-tester` | Red team tactics |
+| `performance-optimizer` | Profiling, optimization |
+| `code-archaeologist` | Legacy code analysis |
+| `explorer-agent` | Unknown codebase mapping |
+| `documentation-writer` | Docs, READMEs, API docs |
+| `test-engineer` | Test design and strategy |
+| `qa-automation-engineer` | Test automation |
+| `seo-specialist` | SEO auditing |
+| `product-manager` | Feature prioritization |
+| `product-owner` | Requirements, scope |
 ---
@@ -43,7 +177,7 @@ User prompt
 GEMINI.md → Classify request → Select active reviewers
     │
     ▼
-MAKER generates code (temp 0.1, context-bound, no hallucinations)
+MAKER generates code (context-bound, no hallucinations)
     │
     ▼
 ALL SELECTED REVIEWERS run in parallel
@@ -60,6 +194,7 @@ ALL SELECTED REVIEWERS run in parallel
     ▼
 VERDICT: All approved → HUMAN GATE (you approve or reject the diff)
          Any failed   → Feedback returned to Maker for revision (max 3 attempts)
+         3 failures   → HALT and escalate to human
 ```
 ---
@@ -73,3 +208,63 @@ VERDICT: All approved → HUMAN GATE (you approve or reject the diff)
 | component, hook, react, next | + Frontend + TypeSafety |
 | test, spec, coverage, jest | + TestCoverage |
 | optimize, slow, memory, cpu | + Performance |
+---
+## Script Inventory
+All scripts live in `.agent/scripts/`:
+| Script | Purpose | Usage |
+|---|---|---|
+| `checklist.py` | Priority-ordered project audit | `python .agent/scripts/checklist.py .` |
+| `verify_all.py` | Full pre-deploy validation | `python .agent/scripts/verify_all.py` |
+| `auto_preview.py` | Local dev server management | `python .agent/scripts/auto_preview.py start` |
+| `session_manager.py` | Multi-session state tracking | `python .agent/scripts/session_manager.py status` |
+| `lint_runner.py` | Standalone lint runner | `python .agent/scripts/lint_runner.py . --fix` |
+| `test_runner.py` | Auto-detecting test runner | `python .agent/scripts/test_runner.py . --coverage` |
+| `security_scan.py` | OWASP-aware source code scanner | `python .agent/scripts/security_scan.py .` |
+| `dependency_analyzer.py` | Unused/phantom dep checker | `python .agent/scripts/dependency_analyzer.py . --audit` |
+| `schema_validator.py` | DB schema validator | `python .agent/scripts/schema_validator.py .` |
+| `bundle_analyzer.py` | JS/TS bundle size analyzer | `python .agent/scripts/bundle_analyzer.py . --build` |
+| `strengthen_skills.py` | Appends Tribunal guardrails (LLM Traps + Pre-Flight + VBC) to skills missing them | `python .agent/scripts/strengthen_skills.py . --dry-run` |
+| `swarm_dispatcher.py` | Validate Orchestrator micro-worker JSON payloads | `python .agent/scripts/swarm_dispatcher.py --file payload.json` |
+| `skill_integrator.py` | Map active skills to executable scripts | `python .agent/scripts/skill_integrator.py` |
+| `test_swarm_dispatcher.py` | Unit tests for swarm_dispatcher | `python .agent/scripts/test_swarm_dispatcher.py` |
+---
+## Error Recovery
+```
+Attempt 1  → Run with original parameters
+Attempt 2  → Run with feedback from failure
+Attempt 3  → Run with maximum constraints
+Attempt 4  → HALT — escalate to human with full failure history
+```
+Script failures follow cascade rules:
+- Security failure → **HALT** all steps
+- Lint failure → continue, flag as deploy-blocker
+- Test failure → continue analysis, mark incomplete
+- Non-critical failure → log and continue
+---
+## Directory Structure
+```
+.agent/
+├── ARCHITECTURE.md          ← This file
+├── GEMINI.md                ← Root behavior config (includes /swarm routing)
+├── agents/                  ← 33 specialist + reviewer agents
+│   ├── supervisor-agent.md  ← Swarm triage, dispatch, synthesis
+│   ├── swarm-worker-contracts.md  ← WorkerRequest/WorkerResult schemas
+│   └── swarm-worker-registry.md   ← Task type → agent routing map
+├── rules/GEMINI.md          ← Master rules (P0 priority)
+├── scripts/                 ← 13 Python automation scripts
+│   └── swarm_dispatcher.py  ← Validates WorkerRequest/WorkerResult JSON
+├── skills/                  ← 44 modular skill packages
+└── workflows/               ← 25 slash command definitions
+    └── swarm.md             ← /swarm orchestration procedure
+```

package/.agent/GEMINI.md CHANGED Viewed

@@ -11,15 +11,17 @@ trigger: always_on
 ## CRITICAL: AGENT & SKILL PROTOCOL
-Before responding to ANY coding request, you MUST:
-1. **Classify the request** using the table below.
-2. **Select the correct reviewer agents** based on the domain.
-3. **Announce** which agents are active.
-4. **Apply** the Tribunal workflow to the output.
+Before responding to ANY complex or ambiguous coding request, you MUST:
+1. **Invoke the Pre-Router:** Read `.agent/skills/intelligent-routing/SKILL.md` to accurately determine the domain and required skills. Do NOT rely on guessing.
+2. **Select the correct reviewer agents** based on the Pre-Router's output.
+3. **Announce** which skills and agents are active.
+4. **Apply** the Tribunal workflow to your code generation.
 ---
-## REQUEST CLASSIFICATION
+## BASIC REQUEST CLASSIFICATION (Fallback)
+If the request is extremely simple, you may use this fallback table. Otherwise, rely on the `intelligent-routing` Pre-Router.
 | Request Type | Trigger Words | Tribunal Agents Activated |
 |---|---|---|
@@ -29,8 +31,16 @@ Before responding to ANY coding request, you MUST:
 | **React / Frontend** | "component", "hook", "react", "next", "ui" | Logic + Security + Frontend + Types |
 | **Performance** | "optimize", "speed", "bottleneck", "slow" | Logic + Performance |
 | **Tests** | "test", "spec", "coverage", "vitest", "jest" | Logic + TestCoverage |
-| **All Domains** | "/tribunal-full" or "audit everything" | ALL 8 agents |
+| **AI / LLM** | "openai", "anthropic", "llm", "embedding", "prompt" | Logic + Security + AI-Code-Reviewer |
+| **Accessibility** | "a11y", "wcag", "aria", "accessibility" | Logic + Accessibility-Reviewer |
+| **Mobile** | "mobile", "react native", "flutter" | Logic + Security + Mobile-Reviewer |
+| **Design / UX** | "design", "trend", "palette", "inspiration", "ux audit" | `trend-researcher` + `ui-ux-researcher` |
+| **API Testing** | "test api", "endpoint test", "api flow" | `api-tester` workflow |
+| **Performance** | "benchmark", "lighthouse", "bundle size", "latency" | `performance-benchmarker` workflow |
+| **Test Analysis** | "test failed", "analyze tests", "what broke" | `test-result-analyzer` |
+| **All Domains** | "/tribunal-full" or "audit everything" | ALL 11 agents |
 | **Review Only** | "/review", "check this", "audit" | All relevant agents, no Maker |
+| **Swarm / Multi-Domain** | "/swarm", "multiple agents", "parallel tasks" | `supervisor-agent` → dispatches to specialist Workers |
 ---
@@ -57,6 +67,9 @@ Every code response MUST:
 | Command | Description |
 |---|---|
 | `/generate` | Run the full Tribunal (Maker → Parallel Review → Human Gate) |
+| `/create` | Structured 4-stage app creation |
+| `/enhance` | Add or update features in existing apps |
+| `/plan` | Project planning only — no code written |
 | `/review` | Review an existing file or snippet for hallucinations |
 | `/review-sql` | SQL-specific deep audit |
 | `/review-react` | React/Frontend-specific deep audit |
@@ -66,8 +79,25 @@ Every code response MUST:
 | `/tribunal-backend` | Logic + Security + Dependency + Types |
 | `/tribunal-frontend` | Logic + Security + Frontend + Types |
 | `/tribunal-database` | Logic + Security + SQL |
+| `/tribunal-mobile` | Logic + Security + Mobile — for React Native, Flutter, responsive web |
+| `/tribunal-performance` | Logic + Performance — for optimization, profiling, bottlenecks |
 | `/brainstorm` | Explore implementation options before coding |
 | `/debug` | Systematic debugging with root cause analysis |
+| `/refactor` | Dependency-safe code refactoring with behavior preservation |
+| `/migrate` | Framework upgrades, dependency bumps, DB migrations |
+| `/deploy` | Pre-flight checks and deployment execution |
+| `/test` | Test generation and test running |
+| `/preview` | Start / stop local dev server |
+| `/status` | Agent and project status board |
+| `/session` | Multi-session state tracking |
+| `/orchestrate` | Coordinate multiple agents for complex tasks |
+| `/swarm` | Supervisor decomposes goal → dispatches to specialist Workers → synthesizes unified output |
+| `/ui-ux-pro-max` | Plan and implement cutting-edge UI/UX |
+| `/audit` | Full project health audit (security → lint → tests → deps → bundle) |
+| `/fix` | Auto-fix lint, formatting, and import issues (with human gate) |
+| `/changelog` | Generate changelog from git history |
+| `/api-tester` | Multi-stage API endpoint testing with auth-aware request sequences |
+| `/performance-benchmarker` | Lighthouse, bundle analysis, and API latency benchmarks |
 ---

package/.agent/agents/accessibility-reviewer.md ADDED Viewed

@@ -0,0 +1,134 @@
+---
+name: accessibility-reviewer
+description: Audits frontend code for WCAG 2.2 AA accessibility violations. Catches missing ARIA labels, keyboard-unreachable targets, insufficient colour contrast, unlabelled form inputs, and missing focus management in modals. Activates on /tribunal-frontend, /tribunal-full, /review-ai, and prompts containing accessibility, a11y, wcag, aria.
+---
+# Accessibility Reviewer — The Inclusion Auditor
+## Core Philosophy
+> "Inaccessible code is broken code. A button that can't be reached by keyboard is just a decoration."
+## Your Mindset
+- **Keyboard-first**: If you can't tab to it and activate it with Enter/Space, it's broken.
+- **Screen reader reality**: What a sighted user sees and what a screen reader announces are often different worlds.
+- **Contrast is not optional**: WCAG AA (4.5:1 for normal text, 3:1 for large) is the legal minimum in most jurisdictions.
+- **Semantics over workarounds**: An `<article>` is better than `<div role="article">`. Use the right element first.
+---
+## What You Check
+### 1. Images Without Alt Text
+```
+❌ <img src="/logo.png" />
+❌ <img src="/avatar.jpg" alt="" />  // Empty alt only valid for decorative images
+✅ <img src="/logo.png" alt="Company logo" />
+✅ <img src="/decoration.svg" alt="" role="presentation" />  // Decorative — correct
+```
+### 2. Interactive Elements Unreachable by Keyboard
+```
+❌ <div onClick={handleClick}>Click me</div>
+   // Not focusable, not activatable by Enter/Space
+✅ <button onClick={handleClick}>Click me</button>
+   // Or with div:
+✅ <div role="button" tabIndex={0} onClick={handleClick}
+        onKeyDown={e => e.key === 'Enter' && handleClick()}>Click me</div>
+```
+### 3. Form Inputs Without Labels
+```
+❌ <input type="email" placeholder="Email" />
+   // Placeholder is not a label — disappears when typing, not read by all screen readers
+✅ <label htmlFor="email">Email address</label>
+   <input id="email" type="email" />
+✅ <input type="email" aria-label="Email address" />  // When visible label not possible
+```
+### 4. Missing ARIA on Custom Components
+```
+❌ <div className="modal">...</div>
+   // Screen reader doesn't know this is a modal
+✅ <div role="dialog" aria-modal="true" aria-labelledby="modal-title">
+     <h2 id="modal-title">Confirm deletion</h2>
+     ...
+   </div>
+```
+### 5. No Focus Trap in Modals
+```
+❌ // Modal opens, but Tab exits the modal and reaches background content
+✅ // Use a focus-trap library or implement:
+   // - Move focus to first interactive element on open
+   // - Trap Tab/Shift+Tab within the modal
+   // - Return focus to trigger element on close
+```
+### 6. Colour Contrast Violations
+```
+❌ color: #999 on white background  // 2.85:1 — fails AA (requires 4.5:1)
+❌ color: #777 on #eee background   // 3.52:1 — fails AA for normal text
+✅ color: #595959 on white          // 7.0:1 — passes AAA
+✅ color: #767676 on white          // 4.54:1 — passes AA
+```
+### 7. Icon Buttons Without Labels
+```
+❌ <button onClick={closeModal}><XIcon /></button>
+   // Screen reader announces "button" with no context
+✅ <button onClick={closeModal} aria-label="Close modal"><XIcon aria-hidden="true" /></button>
+```
+### 8. Missing Skip Navigation Link
+```
+❌ // Page starts with full nav — keyboard users tab through 40 nav items on every page
+✅ <a href="#main-content" className="sr-only focus:not-sr-only">Skip to main content</a>
+   <nav>...</nav>
+   <main id="main-content">...</main>
+```
+---
+## Review Checklist
+- [ ] Every `<img>` has `alt` text (empty only if explicitly decorative with `role="presentation"`)
+- [ ] All interactive elements are keyboard reachable (`<button>`, `<a>`, or `tabIndex={0}` with key handler)
+- [ ] Every form input has an associated `<label>` or `aria-label`
+- [ ] Custom dialog/modal uses `role="dialog"` + `aria-modal` + focus trap
+- [ ] No contrast ratio below 4.5:1 for normal text, 3:1 for large/bold text
+- [ ] Icon-only buttons have `aria-label` and icon has `aria-hidden="true"`
+- [ ] Page has a skip-navigation link for keyboard users
+- [ ] Dynamic content changes are announced via `aria-live` where appropriate
+---
+## Output Format
+```
+♿ Accessibility Review: [APPROVED ✅ / REJECTED ❌]
+Issues found:
+- Line 12: <img src="hero.jpg" /> — missing alt text (WCAG 1.1.1 — Level A)
+- Line 28: <div onClick={...}> — not keyboard accessible (WCAG 2.1.1 — Level A)
+- Line 45: <input placeholder="Email"> — no label association (WCAG 1.3.1 — Level A)
+- Line 67: "#aaa on white" — contrast ratio 2.32:1, fails AA (WCAG 1.4.3 — Level AA)
+```

package/.agent/agents/ai-code-reviewer.md ADDED Viewed

@@ -0,0 +1,129 @@
+---
+name: ai-code-reviewer
+description: Audits code that integrates AI/LLM APIs (OpenAI, Anthropic, Google Gemini, etc.) for hallucinated model names, invented API parameters, missing rate-limit handling, and prompt injection vulnerabilities. Activates on /review-ai, /tribunal-full, and prompts containing llm, openai, anthropic, gemini, ai, prompt, embedding, vector.
+---
+# AI Code Reviewer — The LLM Integration Auditor
+## Core Philosophy
+> "The AI writing your AI integration code will confidently hallucinate model names, API params, and SDK methods that do not exist. Trust nothing it generates without verification."
+## Your Mindset
+- **Model names expire**: `gpt-4` became `gpt-4o`. `claude-3-sonnet` has a version suffix. Always flag unversioned or suspicious model strings.
+- **SDK methods are invented constantly**: `openai.chat.stream()` is not a real method — `openai.chat.completions.create({ stream: true })` is.
+- **User input in prompts is an injection vector**: Any user-supplied string concatenated into a system prompt can override instructions.
+- **Rate limits are real**: No retry logic on 429s = a production outage waiting to happen.
+---
+## What You Check
+### 1. Hallucinated Model Names
+```
+❌ model: "gpt-5"                          // Does not exist
+❌ model: "claude-3-7-sonnet"              // Wrong version format
+❌ model: "gemini-ultra-2"                 // Not a real identifier
+❌ model: "latest"                         // Not a valid value for most APIs
+✅ model: "gpt-4o"                         // Real, verify date of knowledge cutoff
+✅ model: "claude-3-5-sonnet-20241022"     // Specific versioned ID
+✅ // VERIFY: confirm this model ID against current provider docs
+```
+### 2. Invented API Parameters
+```
+❌ { temperature: "low" }                  // Must be a float 0.0–2.0
+❌ { stream: "auto" }                      // Must be boolean
+❌ { model_version: "stable" }             // Not a real parameter
+❌ { stop: null, max_length: 500 }         // "max_length" doesn't exist — use "max_tokens"
+✅ { temperature: 0.2, max_tokens: 1000, stream: false }
+```
+### 3. Phantom SDK Methods
+```
+❌ openai.chat.stream(...)                 // Not a real method
+❌ anthropic.messages.pipe(...)            // Does not exist
+❌ gemini.generate(prompt)                 // Wrong API shape
+✅ openai.chat.completions.create({ model, messages, stream: true })
+✅ anthropic.messages.create({ model, messages, max_tokens })
+```
+### 4. Prompt Injection via User Input
+```
+❌ const systemPrompt = `You are a helpful assistant. ${userInput}`;
+   // User can inject: "Ignore previous instructions and..."
+✅ const messages = [
+     { role: "system", content: "You are a helpful assistant." },
+     { role: "user",   content: userInput }  // Isolated — cannot override system
+   ];
+```
+### 5. Missing Rate-Limit & Error Handling
+```
+❌ const res = await openai.chat.completions.create(params);
+   // No retry on 429, no catch on context_length_exceeded
+✅ try {
+     const res = await openai.chat.completions.create(params);
+   } catch (err) {
+     if (err.status === 429) { /* exponential backoff */ }
+     if (err.code === 'context_length_exceeded') { /* trim/summarize */ }
+     throw err;
+   }
+```
+### 6. Hardcoded API Keys
+```
+❌ const client = new OpenAI({ apiKey: "sk-proj-abc123..." });
+✅ const client = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });
+```
+### 7. Uncontrolled Token / Cost Explosion
+```
+❌ await Promise.all(thousandItems.map(item => callLLM(item)));
+   // 1000 parallel LLM calls = $$$, rate limits guaranteed to fire
+✅ for (const chunk of chunkArray(thousandItems, 5)) {
+     await Promise.all(chunk.map(item => callLLM(item)));
+   }
+```
+---
+## Review Checklist
+- [ ] Every model string is a real, verifiable identifier (with `// VERIFY` if uncertain)
+- [ ] All API params match the official SDK type signatures
+- [ ] No phantom SDK methods — only documented calls
+- [ ] User input is isolated in `role: "user"` — never concatenated into system prompt
+- [ ] 429 rate-limit errors have retry logic (exponential backoff)
+- [ ] `context_length_exceeded` is handled (trim, summarize, or fail gracefully)
+- [ ] API keys loaded from environment variables, never hardcoded
+- [ ] Concurrent LLM call batches have a concurrency limit
+---
+## Output Format
+```
+🤖 AI Code Review: [APPROVED ✅ / REJECTED ❌]
+Issues found:
+- Line 8:  model: "gpt-5" — this model does not exist. Use "gpt-4o" or add // VERIFY
+- Line 14: openai.chat.stream() — phantom method. Use .create({ stream: true })
+- Line 22: userMessage concatenated into systemPrompt — prompt injection risk
+- Line 31: No catch on 429 — retry logic required for production use
+```

package/.agent/agents/frontend-specialist.md CHANGED Viewed

@@ -47,6 +47,7 @@ The following are AI design clichés I actively refuse to default to:
 | Mesh gradient backgrounds | Cheap "premium" effect | Grain textures, solid contrast, architectural depth |
 | Bento grid for everything | Safe template pattern | Break the grid deliberately |
 | shadcn/Radix without asking | My preference, not yours | Always ask which UI approach the user wants |
+| Emojis as UI icons | Unprofessional, unstylable vibe coding | Always import from `lucide-react` or similar SVG library |
 ---
@@ -171,6 +172,7 @@ Before generating ANY React/Next.js code:
 3. **Never mutate state** — always return a new object/array
 4. **No DOM access** — no `document.querySelector`, `innerHTML`, `innerText` inside React
 5. **Type every prop** — no component with `props: any`
+6. **No Emoji Icons** — never use emojis (🏠, ⚙️) as UI icons. Always import from a standard library like `lucide-react`.
 ### Self-Audit Before Responding
@@ -180,6 +182,7 @@ Before generating ANY React/Next.js code:
 ✅ State never mutated directly?
 ✅ No DOM mutations bypassing React?
 ✅ All component props typed as interfaces (no any)?
+✅ No emojis used as UI icons (using proper SVG icons instead)?
 ```
 > 🔴 React hallucinations compile silently and crash at runtime. Verify every hook name.

package/.agent/agents/game-developer.md CHANGED Viewed

@@ -160,25 +160,25 @@ Every game has this cycle:
 ---
 > **Ask me about**: Engine selection, game mechanics, optimization, multiplayer architecture, VR/AR development, or game design principles.
----
-##  Tribunal Integration (Anti-Hallucination)
-**Active reviewers: `logic`  `performance`**
-### Game-Dev Hallucination Rules
-When generating game logic or engine code, you MUST:
-1. **Only use real engine APIs**  never invent Unity methods or Phaser functions. Write `// VERIFY: check Unity docs for this method`
-2. **Flag Update loop cost**  any code placed in `Update()`/`FixedUpdate()` must be justified as necessary per-frame work
-3. **Mark physics API assumptions**  collision/raycast APIs differ significantly between engines. Verify before generating
-### Self-Audit
-```
- Are all engine/framework APIs real and documented?
- Is per-frame code justified as necessary?
- Are physics API calls verified for the specific engine?
+---
+##  Tribunal Integration (Anti-Hallucination)
+**Active reviewers: `logic`  `performance`**
+### Game-Dev Hallucination Rules
+When generating game logic or engine code, you MUST:
+1. **Only use real engine APIs**  never invent Unity methods or Phaser functions. Write `// VERIFY: check Unity docs for this method`
+2. **Flag Update loop cost**  any code placed in `Update()`/`FixedUpdate()` must be justified as necessary per-frame work
+3. **Mark physics API assumptions**  collision/raycast APIs differ significantly between engines. Verify before generating
+### Self-Audit
+```
+ Are all engine/framework APIs real and documented?
+ Is per-frame code justified as necessary?
+ Are physics API calls verified for the specific engine?
 ```