npm - tribunal-kit - Versions diffs - 2.4.6 → 3.1.0 - Mend

tribunal-kit 2.4.6 → 3.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (250) hide show

package/.agent/ARCHITECTURE.md +99 -99
package/.agent/GEMINI.md +52 -52
package/.agent/agents/accessibility-reviewer.md +139 -86
package/.agent/agents/ai-code-reviewer.md +160 -90
package/.agent/agents/backend-specialist.md +164 -127
package/.agent/agents/code-archaeologist.md +115 -73
package/.agent/agents/database-architect.md +130 -110
package/.agent/agents/debugger.md +137 -97
package/.agent/agents/dependency-reviewer.md +78 -30
package/.agent/agents/devops-engineer.md +161 -118
package/.agent/agents/documentation-writer.md +151 -87
package/.agent/agents/explorer-agent.md +117 -99
package/.agent/agents/frontend-reviewer.md +127 -47
package/.agent/agents/frontend-specialist.md +169 -109
package/.agent/agents/game-developer.md +28 -164
package/.agent/agents/logic-reviewer.md +87 -49
package/.agent/agents/mobile-developer.md +151 -103
package/.agent/agents/mobile-reviewer.md +133 -50
package/.agent/agents/orchestrator.md +121 -110
package/.agent/agents/penetration-tester.md +103 -77
package/.agent/agents/performance-optimizer.md +136 -92
package/.agent/agents/performance-reviewer.md +139 -69
package/.agent/agents/product-manager.md +104 -70
package/.agent/agents/product-owner.md +6 -25
package/.agent/agents/project-planner.md +95 -95
package/.agent/agents/qa-automation-engineer.md +174 -87
package/.agent/agents/security-auditor.md +133 -129
package/.agent/agents/seo-specialist.md +160 -99
package/.agent/agents/sql-reviewer.md +132 -44
package/.agent/agents/supervisor-agent.md +137 -109
package/.agent/agents/swarm-worker-contracts.md +17 -17
package/.agent/agents/swarm-worker-registry.md +46 -46
package/.agent/agents/test-coverage-reviewer.md +132 -53
package/.agent/agents/test-engineer.md +0 -21
package/.agent/agents/type-safety-reviewer.md +143 -33
package/.agent/patterns/generator.md +9 -9
package/.agent/patterns/inversion.md +12 -12
package/.agent/patterns/pipeline.md +9 -9
package/.agent/patterns/reviewer.md +13 -13
package/.agent/patterns/tool-wrapper.md +9 -9
package/.agent/rules/GEMINI.md +63 -63
package/.agent/scripts/__pycache__/auto_preview.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/bundle_analyzer.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/checklist.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/dependency_analyzer.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/security_scan.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/session_manager.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/skill_integrator.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/swarm_dispatcher.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/test_runner.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/verify_all.cpython-311.pyc +0 -0
package/.agent/scripts/compress_skills.py +167 -0
package/.agent/scripts/consolidate_skills.py +173 -0
package/.agent/scripts/deep_compress.py +202 -0
package/.agent/scripts/minify_context.py +80 -0
package/.agent/scripts/security_scan.py +1 -1
package/.agent/scripts/strip_tribunal.py +41 -0
package/.agent/skills/agent-organizer/SKILL.md +60 -100
package/.agent/skills/agentic-patterns/SKILL.md +0 -70
package/.agent/skills/ai-prompt-injection-defense/SKILL.md +108 -53
package/.agent/skills/api-patterns/SKILL.md +197 -257
package/.agent/skills/api-security-auditor/SKILL.md +125 -57
package/.agent/skills/app-builder/SKILL.md +326 -50
package/.agent/skills/app-builder/templates/SKILL.md +13 -15
package/.agent/skills/app-builder/templates/astro-static/TEMPLATE.md +16 -16
package/.agent/skills/app-builder/templates/chrome-extension/TEMPLATE.md +22 -22
package/.agent/skills/app-builder/templates/cli-tool/TEMPLATE.md +18 -18
package/.agent/skills/app-builder/templates/electron-desktop/TEMPLATE.md +20 -20
package/.agent/skills/app-builder/templates/express-api/TEMPLATE.md +17 -17
package/.agent/skills/app-builder/templates/flutter-app/TEMPLATE.md +18 -18
package/.agent/skills/app-builder/templates/monorepo-turborepo/TEMPLATE.md +21 -21
package/.agent/skills/app-builder/templates/nextjs-fullstack/TEMPLATE.md +19 -19
package/.agent/skills/app-builder/templates/nextjs-saas/TEMPLATE.md +26 -26
package/.agent/skills/app-builder/templates/nextjs-static/TEMPLATE.md +26 -26
package/.agent/skills/app-builder/templates/nuxt-app/TEMPLATE.md +19 -19
package/.agent/skills/app-builder/templates/python-fastapi/TEMPLATE.md +18 -18
package/.agent/skills/app-builder/templates/react-native-app/TEMPLATE.md +20 -20
package/.agent/skills/appflow-wireframe/SKILL.md +71 -98
package/.agent/skills/architecture/SKILL.md +161 -200
package/.agent/skills/authentication-best-practices/SKILL.md +121 -54
package/.agent/skills/bash-linux/SKILL.md +71 -166
package/.agent/skills/behavioral-modes/SKILL.md +8 -69
package/.agent/skills/brainstorming/SKILL.md +345 -127
package/.agent/skills/building-native-ui/SKILL.md +125 -57
package/.agent/skills/clean-code/SKILL.md +266 -149
package/.agent/skills/code-review-checklist/SKILL.md +0 -62
package/.agent/skills/config-validator/SKILL.md +73 -131
package/.agent/skills/csharp-developer/SKILL.md +434 -73
package/.agent/skills/database-design/SKILL.md +190 -275
package/.agent/skills/deployment-procedures/SKILL.md +81 -158
package/.agent/skills/devops-engineer/SKILL.md +255 -94
package/.agent/skills/devops-incident-responder/SKILL.md +50 -69
package/.agent/skills/doc.md +5 -5
package/.agent/skills/documentation-templates/SKILL.md +19 -63
package/.agent/skills/edge-computing/SKILL.md +75 -165
package/.agent/skills/extract-design-system/SKILL.md +84 -58
package/.agent/skills/framer-motion-expert/SKILL.md +195 -0
package/.agent/skills/frontend-design/SKILL.md +151 -499
package/.agent/skills/game-design-expert/SKILL.md +71 -0
package/.agent/skills/game-engineering-expert/SKILL.md +88 -0
package/.agent/skills/geo-fundamentals/SKILL.md +52 -178
package/.agent/skills/github-operations/SKILL.md +197 -272
package/.agent/skills/gsap-expert/SKILL.md +194 -0
package/.agent/skills/i18n-localization/SKILL.md +60 -172
package/.agent/skills/intelligent-routing/SKILL.md +123 -103
package/.agent/skills/lint-and-validate/SKILL.md +8 -52
package/.agent/skills/llm-engineering/SKILL.md +281 -195
package/.agent/skills/local-first/SKILL.md +76 -159
package/.agent/skills/mcp-builder/SKILL.md +48 -188
package/.agent/skills/mobile-design/SKILL.md +213 -219
package/.agent/skills/motion-engineering/SKILL.md +184 -0
package/.agent/skills/nextjs-react-expert/SKILL.md +184 -203
package/.agent/skills/nodejs-best-practices/SKILL.md +403 -185
package/.agent/skills/observability/SKILL.md +211 -203
package/.agent/skills/parallel-agents/SKILL.md +53 -146
package/.agent/skills/performance-profiling/SKILL.md +171 -151
package/.agent/skills/plan-writing/SKILL.md +49 -153
package/.agent/skills/platform-engineer/SKILL.md +57 -103
package/.agent/skills/playwright-best-practices/SKILL.md +110 -63
package/.agent/skills/powershell-windows/SKILL.md +61 -179
package/.agent/skills/python-patterns/SKILL.md +7 -35
package/.agent/skills/python-pro/SKILL.md +273 -114
package/.agent/skills/react-specialist/SKILL.md +227 -108
package/.agent/skills/readme-builder/SKILL.md +15 -85
package/.agent/skills/realtime-patterns/SKILL.md +216 -243
package/.agent/skills/red-team-tactics/SKILL.md +10 -51
package/.agent/skills/rust-pro/SKILL.md +525 -142
package/.agent/skills/seo-fundamentals/SKILL.md +92 -153
package/.agent/skills/server-management/SKILL.md +110 -166
package/.agent/skills/shadcn-ui-expert/SKILL.md +154 -55
package/.agent/skills/skill-creator/SKILL.md +18 -58
package/.agent/skills/sql-pro/SKILL.md +543 -68
package/.agent/skills/supabase-postgres-best-practices/SKILL.md +28 -68
package/.agent/skills/swiftui-expert/SKILL.md +124 -57
package/.agent/skills/systematic-debugging/SKILL.md +49 -151
package/.agent/skills/tailwind-patterns/SKILL.md +433 -149
package/.agent/skills/tdd-workflow/SKILL.md +63 -169
package/.agent/skills/test-result-analyzer/SKILL.md +33 -73
package/.agent/skills/testing-patterns/SKILL.md +437 -130
package/.agent/skills/trend-researcher/SKILL.md +30 -71
package/.agent/skills/ui-ux-pro-max/SKILL.md +0 -41
package/.agent/skills/ui-ux-researcher/SKILL.md +51 -91
package/.agent/skills/vue-expert/SKILL.md +225 -119
package/.agent/skills/vulnerability-scanner/SKILL.md +264 -226
package/.agent/skills/web-accessibility-auditor/SKILL.md +141 -58
package/.agent/skills/web-design-guidelines/SKILL.md +17 -61
package/.agent/skills/webapp-testing/SKILL.md +71 -196
package/.agent/skills/whimsy-injector/SKILL.md +58 -132
package/.agent/skills/workflow-optimizer/SKILL.md +28 -68
package/.agent/workflows/api-tester.md +96 -224
package/.agent/workflows/audit.md +81 -122
package/.agent/workflows/brainstorm.md +69 -105
package/.agent/workflows/changelog.md +65 -97
package/.agent/workflows/create.md +73 -88
package/.agent/workflows/debug.md +80 -111
package/.agent/workflows/deploy.md +119 -92
package/.agent/workflows/enhance.md +80 -91
package/.agent/workflows/fix.md +68 -97
package/.agent/workflows/generate.md +165 -164
package/.agent/workflows/migrate.md +106 -109
package/.agent/workflows/orchestrate.md +103 -86
package/.agent/workflows/performance-benchmarker.md +77 -268
package/.agent/workflows/plan.md +120 -98
package/.agent/workflows/preview.md +39 -96
package/.agent/workflows/refactor.md +105 -97
package/.agent/workflows/review-ai.md +63 -102
package/.agent/workflows/review.md +71 -110
package/.agent/workflows/session.md +53 -113
package/.agent/workflows/status.md +42 -88
package/.agent/workflows/strengthen-skills.md +90 -51
package/.agent/workflows/swarm.md +114 -129
package/.agent/workflows/test.md +125 -102
package/.agent/workflows/tribunal-backend.md +60 -78
package/.agent/workflows/tribunal-database.md +62 -100
package/.agent/workflows/tribunal-frontend.md +62 -82
package/.agent/workflows/tribunal-full.md +56 -100
package/.agent/workflows/tribunal-mobile.md +65 -94
package/.agent/workflows/tribunal-performance.md +62 -105
package/.agent/workflows/ui-ux-pro-max.md +72 -121
package/README.md +11 -15
package/package.json +1 -1
package/.agent/skills/api-patterns/api-style.md +0 -42
package/.agent/skills/api-patterns/auth.md +0 -24
package/.agent/skills/api-patterns/documentation.md +0 -26
package/.agent/skills/api-patterns/graphql.md +0 -41
package/.agent/skills/api-patterns/rate-limiting.md +0 -31
package/.agent/skills/api-patterns/response.md +0 -37
package/.agent/skills/api-patterns/rest.md +0 -40
package/.agent/skills/api-patterns/security-testing.md +0 -122
package/.agent/skills/api-patterns/trpc.md +0 -41
package/.agent/skills/api-patterns/versioning.md +0 -22
package/.agent/skills/app-builder/agent-coordination.md +0 -71
package/.agent/skills/app-builder/feature-building.md +0 -53
package/.agent/skills/app-builder/project-detection.md +0 -34
package/.agent/skills/app-builder/scaffolding.md +0 -118
package/.agent/skills/app-builder/tech-stack.md +0 -40
package/.agent/skills/architecture/context-discovery.md +0 -43
package/.agent/skills/architecture/examples.md +0 -94
package/.agent/skills/architecture/pattern-selection.md +0 -68
package/.agent/skills/architecture/patterns-reference.md +0 -50
package/.agent/skills/architecture/trade-off-analysis.md +0 -77
package/.agent/skills/brainstorming/dynamic-questioning.md +0 -360
package/.agent/skills/database-design/database-selection.md +0 -43
package/.agent/skills/database-design/indexing.md +0 -39
package/.agent/skills/database-design/migrations.md +0 -48
package/.agent/skills/database-design/optimization.md +0 -36
package/.agent/skills/database-design/orm-selection.md +0 -30
package/.agent/skills/database-design/schema-design.md +0 -56
package/.agent/skills/dotnet-core-expert/SKILL.md +0 -103
package/.agent/skills/framer-motion-animations/SKILL.md +0 -74
package/.agent/skills/frontend-design/animation-guide.md +0 -331
package/.agent/skills/frontend-design/color-system.md +0 -329
package/.agent/skills/frontend-design/decision-trees.md +0 -418
package/.agent/skills/frontend-design/motion-graphics.md +0 -306
package/.agent/skills/frontend-design/typography-system.md +0 -363
package/.agent/skills/frontend-design/ux-psychology.md +0 -1116
package/.agent/skills/frontend-design/visual-effects.md +0 -383
package/.agent/skills/game-development/2d-games/SKILL.md +0 -119
package/.agent/skills/game-development/3d-games/SKILL.md +0 -135
package/.agent/skills/game-development/SKILL.md +0 -236
package/.agent/skills/game-development/game-art/SKILL.md +0 -185
package/.agent/skills/game-development/game-audio/SKILL.md +0 -190
package/.agent/skills/game-development/game-design/SKILL.md +0 -129
package/.agent/skills/game-development/mobile-games/SKILL.md +0 -108
package/.agent/skills/game-development/multiplayer/SKILL.md +0 -132
package/.agent/skills/game-development/pc-games/SKILL.md +0 -144
package/.agent/skills/game-development/vr-ar/SKILL.md +0 -123
package/.agent/skills/game-development/web-games/SKILL.md +0 -150
package/.agent/skills/intelligent-routing/router-manifest.md +0 -65
package/.agent/skills/mobile-design/decision-trees.md +0 -516
package/.agent/skills/mobile-design/mobile-backend.md +0 -491
package/.agent/skills/mobile-design/mobile-color-system.md +0 -420
package/.agent/skills/mobile-design/mobile-debugging.md +0 -122
package/.agent/skills/mobile-design/mobile-design-thinking.md +0 -357
package/.agent/skills/mobile-design/mobile-navigation.md +0 -458
package/.agent/skills/mobile-design/mobile-performance.md +0 -767
package/.agent/skills/mobile-design/mobile-testing.md +0 -356
package/.agent/skills/mobile-design/mobile-typography.md +0 -433
package/.agent/skills/mobile-design/platform-android.md +0 -666
package/.agent/skills/mobile-design/platform-ios.md +0 -561
package/.agent/skills/mobile-design/touch-psychology.md +0 -537
package/.agent/skills/nextjs-react-expert/1-async-eliminating-waterfalls.md +0 -312
package/.agent/skills/nextjs-react-expert/2-bundle-bundle-size-optimization.md +0 -240
package/.agent/skills/nextjs-react-expert/3-server-server-side-performance.md +0 -490
package/.agent/skills/nextjs-react-expert/4-client-client-side-data-fetching.md +0 -264
package/.agent/skills/nextjs-react-expert/5-rerender-re-render-optimization.md +0 -581
package/.agent/skills/nextjs-react-expert/6-rendering-rendering-performance.md +0 -432
package/.agent/skills/nextjs-react-expert/7-js-javascript-performance.md +0 -684
package/.agent/skills/nextjs-react-expert/8-advanced-advanced-patterns.md +0 -150
package/.agent/skills/vulnerability-scanner/checklists.md +0 -121

package/.agent/workflows/orchestrate.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-description: Coordinate multiple agents for complex tasks. Use for multi-perspective analysis, comprehensive reviews, or tasks requiring different domain expertise.
+description: Coordinate multiple agents for complex tasks. Use for multi-perspective analysis, comprehensive reviews requiring different domain expertise, or tasks where a single agent would miss domain-specific failures. Fan-Out dispatch → parallel execution → Fan-In synthesis → Human Gate.
 ---
 # /orchestrate — Multi-Agent Coordination
@@ -8,144 +8,161 @@ $ARGUMENTS
 ---
-This command coordinates multiple specialists to solve a problem that requires more than one domain. **One agent is not orchestration.**
+## When to Use /orchestrate
+|Use `/orchestrate` when...|Use something else when...|
+|:---|:---|
+|Task spans 2+ technical domains|Single domain → use specialist directly|
+|Multi-perspective review is needed|Simple code generation → `/generate`|
+|Fan-out parallelism would save time|Debugging → `/debug` (sequential by nature)|
+|One agent would miss domain failures|Planning only → `/plan`|
 ---
-## When to Use /orchestrate vs Other Commands
+## Phase 1 — Scope Classification
-| Use `/orchestrate` when... | Use something else when... |
-|---|---|
-| Task requires 3+ domain specialists | Single domain → use the right `/tribunal-*` |
-| Sequential work with review gates between waves | Parallel, independent tasks → `/swarm` |
-| Existing codebase with complex dependencies | Greenfield project → `/create` |
-| Human gates required between every wave | Maximum parallel output → `/swarm` |
+Before dispatching workers:
+```
+1. Is this actually multi-domain? (2+ distinct technical areas)
+   → YES → proceed to Phase 2
+   → NO  → route to the single correct specialist agent
+2. Can tasks be parallelized (no dependencies between them)?
+   → YES → Fan-Out dispatch (all workers simultaneous)
+   → NO  → Sequential wave dispatch
+3. Context budget check:
+   □ How many files does each worker need?
+   □ Total context across all workers manageable?
+   □ Can I pass context_summary instead of full file dumps?
+```
 ---
-## The Minimum Rule
+## Phase 2 — Worker Decomposition
-> **Fewer than 3 agents = not orchestration.**
->
-> Before marking any orchestration session as complete, count the agents invoked. If the count is less than 3, activate more. A single agent delegated to is just a delegation.
+Break the goal into atomic, non-overlapping worker tasks:
----
+```
+Goal: Review the full checkout feature before launch
-## Agent Selection by Task Type
+Decomposed Workers:
+├── Worker A [backend-specialist]: Review API routes for auth and validation
+├── Worker B [database-architect]: Review DB queries for N+1 and transactions
+├── Worker C [frontend-specialist]: Review UI components for RSC compliance
+└── Worker D [security-auditor]: Review the full checkout flow for OWASP issues
+```
-| Task | Required Specialists |
-|---|---|
-| Full-stack feature | `frontend-specialist` + `backend-specialist` + `test-engineer` |
-| API build | `backend-specialist` + `security-auditor` + `test-engineer` |
-| Database-heavy work | `database-architect` + `backend-specialist` + `security-auditor` |
-| Complete product | `project-planner` + `frontend-specialist` + `backend-specialist` + `devops-engineer` |
-| Security investigation | `security-auditor` + `penetration-tester` + `devops-engineer` |
-| Complex bug | `debugger` + `explorer-agent` + `test-engineer` |
-| New codebase or unknown repo | `explorer-agent` + relevant specialists |
+**Worker files cannot overlap.** If two workers both need to modify the same file → one worker owns it.
 ---
-## Two-Phase Protocol (Strict)
-### Phase A — Planning Only
+## Fan-Out Pattern (Parallel Dispatch)
-Only two agents are allowed during planning:
+When tasks are independent, dispatch all simultaneously:
 ```
-project-planner   → writes docs/PLAN-{slug}.md
-explorer-agent    → (if working in existing code) maps the codebase structure
+━━━ Wave 1: Fan-Out ━━━━━━━━━━━━━━━━━━━━━━━
+Worker A (backend)    → RUNNING  (reading: src/app/api/checkout/)
+Worker B (database)   → RUNNING  (reading: prisma/schema.prisma, checkout queries)
+Worker C (frontend)   → RUNNING  (reading: src/app/checkout/page.tsx)
+Worker D (security)   → RUNNING  (reading: all of the above)
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+Wait for ALL workers (allSettled — single worker failure doesn't cancel siblings)
+━━━ Wave 1: Results ━━━━━━━━━━━━━━━━━━━━━━━
+Worker A: ✅ COMPLETE
+Worker B: ✅ COMPLETE
+Worker C: ⚠️ BLOCKED (missing: what state management pattern to assume)
+Worker D: ✅ COMPLETE
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+Supervisor provides missing info to Worker C → redispatch
 ```
-No other agent runs. No code is produced.
+---
-After planning, the plan is shown to the user:
+## BLOCKED Worker Protocol
-```
-✅ Plan ready: docs/PLAN-{slug}.md
+When a worker cannot proceed:
-Approve to start implementation? (Y / N)
+```
+Status: BLOCKED
+Reason: Missing context — the auth middleware file is not in provided scope
+Unblocked by: Read src/middleware.ts first and pass auth pattern to worker
+Supervisor action:
+1. Provide the missing context if available
+2. Escalate to human if decision is needed
+3. Never guess — BLOCKED beats hallucinating
 ```
-**Phase B does NOT start without a Y.**
+---
-### Phase B — Implementation (Manager & Micro-Workers)
+## Sequential Wave Execution
-After approval, the Orchestrator acts as Manager and dispatches Micro-Workers using **isolated JSON payloads**.
+When tasks depend on each other:
 ```
-Wave 1:  database-architect + security-auditor (JSON dispatch #1)
-         ↓
-[Wait for completion & Tribunal review]
-         ↓
-Wave 2:  backend-specialist + frontend-specialist (JSON dispatch #2)
-         ↓
-[Wait for completion & Tribunal review]
-         ↓
-Wave 3:  test-engineer (JSON dispatch #3)
-         ↓
-[Wait for completion & Human Gate]
+Wave 1 → Foundation (must complete first)
+Wave 2 → Depends on Wave 1 output (receives context_summary, not full output)
+Wave 3 → Synthesis (combines all wave outputs)
 ```
-Workers execute in parallel **within** their wave, but waves execute **sequentially**. Each wave waits for the previous wave's Tribunal gate before proceeding.
+**Context discipline between waves:** Summarize Wave N output in 3-5 bullets before passing to Wave N+1.
 ---
-## Hierarchical Context Pruning
-When dispatching workers, the Orchestrator MUST use the `dispatch_micro_workers` JSON format.
+## Fan-In — Synthesis
-**Context discipline is strictly enforced:**
+After all workers complete:
 ```
-❌ Never pass full chat histories to workers
-❌ Never attach every file — attach only files the worker will actually read
-✅ The context_summary injected by the Orchestrator is the ONLY shared context
-✅ Files attached are strictly limited to what's needed for that specific task
+1. Merge findings by severity
+2. Identify conflicts (Worker A says X, Worker B says Y)
+3. Resolve conflicts with evidence (which worker has specific file evidence?)
+4. Produce unified output sorted by priority
 ```
-**Per-worker context limit:** Excerpt only the relevant function or schema section — never the entire file.
 ---
-## Retry Protocol
+## Human Gate
 ```
-Attempt 1  → Worker runs with original parameters
-Attempt 2  → Worker runs with stricter constraints + failure feedback
-Attempt 3  → Worker runs with max constraints + full context dump
-Attempt 4  → HALT. Report to human with full failure history.
-```
+━━━ Orchestration Complete ━━━━━━━━━━━━━━━━━
-Hard limit: **3 retries per worker**. After 3 failures, escalate — do not silently proceed.
+Workers: 4 dispatched / 4 complete / 0 blocked
----
+━━━ Synthesized Findings ━━━━━━━━━━━━━━━━━━
+[Critical issues first, then high, then medium]
-## Hallucination Guard
+━━━ Required Changes ━━━━━━━━━━━━━━━━━━━━━
+Files to modify: [list]
+Files to create: [list]
-- Every agent's output goes through Tribunal before it reaches the user
-- The Human Gate fires before any file is written — user sees the diff and approves
-- Per-agent scope is enforced — `frontend-specialist` **never** writes DB migrations
-- Retry limit: 3 Maker revisions per agent; after 3 failures → stop and report
-- `context_summary` is the only mechanism for sharing context across agents — no full dumps
+━━━ Human Gate ━━━━━━━━━━━━━━━━━━━━━━━━━━━
+Approve?  Y = proceed | N = discard | R = revise
+```
 ---
-## Cross-Workflow Navigation
+## Error Recovery
-| When /orchestrate reveals... | Go to |
-|---|---|
-| Worker keeps failing after 3 retries | `/debug` the isolated worker task |
-| Plan needed before orchestrating | `/plan` first, then run `/orchestrate` against it |
-| Fully parallel independent sub-tasks | `/swarm` is more efficient |
-| Single domain needs specialist audit | Use the domain-specific `/tribunal-*` |
+```
+Worker failure (after 3 retries):
+  Report: agent=[name], task=[what], attempts=3, last_error=[error], suggestion=[what to check]
+  Action: continue remaining workers, include failure in final synthesis
+```
 ---
-## Usage
+## Usage Examples
 ```
-/orchestrate build a complete auth system with JWT and refresh tokens
-/orchestrate review the entire API layer for security issues
-/orchestrate build a multi-tenant SaaS onboarding flow
-/orchestrate analyze this repo and implement all security findings
+/orchestrate review the entire authentication system for security and correctness
+/orchestrate analyze the payment feature: backend logic + DB queries + frontend UX
+/orchestrate comprehensive code review before launch: security + tests + performance
+/orchestrate compare three different caching strategies and recommend the best fit
 ```

package/.agent/workflows/performance-benchmarker.md CHANGED Viewed

@@ -1,305 +1,114 @@
 ---
-description: Run standardized performance benchmarks including Lighthouse, bundle analysis, and latency checks.
+description: Run standardized performance benchmarks including Lighthouse CI, bundle analysis, and API latency checks. Records before/after metrics. No optimization claims without measured evidence.
 ---
-# /performance-benchmarker — Automated Performance Audit
+# /performance-benchmarker — Evidence-Based Performance Measurement
 $ARGUMENTS
 ---
-This command runs a comprehensive suite of performance benchmarks against your project and generates a structured report with numerical scores, regression detection, and prioritized actionable fixes.
+## When to Use /performance-benchmarker
----
-## When to Use
-- Before any `/deploy` to catch performance regressions.
-- After adding new dependencies or large features.
-- When user reports "it feels slow" or asks to "check performance".
-- When triggered by `benchmark`, `lighthouse`, `bundle size`, or `latency` keywords.
----
-## Pipeline Flow
-```
-Request (scope: full / web-vitals / bundle / api)
-    │
-    ▼
-Environment detection — framework, build tool, package manager
-    │
-    ▼
-Tool availability check — lighthouse? build script? dev server?
-    │
-    ▼
-Benchmark execution — run selected checks
-    │
-    ▼
-Score calculation — weighted composite
-    │
-    ▼
-Regression detection — compare against previous baselines (if available)
-    │
-    ▼
-Report — scores, pass/fail, recommendations, fix priority
-```
+|Use `/performance-benchmarker` when...|Use something else when...|
+|:---|:---|
+|Establishing performance baseline|Code optimization decisions → `/tribunal-performance`|
+|After optimization — verify improvement|Memory leaks investigation → `/debug`|
+|Pre-release performance gate|Bundle analysis only → run ANALYZE=true npm run build|
+|Regular weekly benchmark|API review only → `/tribunal-backend`|
 ---
-## Benchmark Suite
+## Benchmark Suite (Run in Order)
-### 1. Web Vitals (Frontend Performance)
-| Metric | Good | Needs Work | Poor | Measurement |
-|---|---|---|---|---|
-| LCP (Largest Contentful Paint) | < 2.5s | 2.5-4.0s | > 4.0s | Lighthouse or `web-vitals` library |
-| INP (Interaction to Next Paint) | < 200ms | 200-500ms | > 500ms | Lab approximation via TBT |
-| CLS (Cumulative Layout Shift) | < 0.1 | 0.1-0.25 | > 0.25 | Layout shift detection |
-| TTFB (Time to First Byte) | < 800ms | 800-1800ms | > 1800ms | Server response timing |
-| FCP (First Contentful Paint) | < 1.8s | 1.8-3.0s | > 3.0s | Lighthouse |
-| Speed Index | < 3.4s | 3.4-5.8s | > 5.8s | Lighthouse |
-**How to Run:**
 ```bash
-# If lighthouse is available
-npx lighthouse http://localhost:3000 --output json --chrome-flags="--headless"
-# If web-vitals is installed, inject into page and measure
-# VERIFY: check if lighthouse-cli is available before running
+# 1. Lighthouse CI — Core Web Vitals
+npx lighthouse http://localhost:3000 \
+  --output=json \
+  --output-path=./reports/lighthouse-$(date +%Y%m%d).json \
+  --only-categories=performance,accessibility,best-practices,seo
+# 2. Bundle Analysis
+ANALYZE=true npm run build
+# 3. API latency (using autocannon for load test)
+npx autocannon -c 10 -d 20 http://localhost:3000/api/products
+# -c: 10 concurrent connections
+# -d: 20 second duration
+# 4. Database query analysis
+# (Prisma): Add to your test route temporarily
+const plan = await prisma.$queryRaw`EXPLAIN ANALYZE SELECT * FROM orders WHERE user_id = ${userId}`;
+console.log(plan);
 ```
-**Common Fixes by Metric:**
-| Metric | Fix | Impact |
-|---|---|---|
-| LCP slow | Preload hero image, use `fetchpriority="high"` | High |
-| LCP slow | Eliminate render-blocking CSS/JS | High |
-| INP slow | Break long tasks > 50ms into smaller chunks | High |
-| INP slow | Use `requestIdleCallback` for non-critical work | Medium |
-| CLS high | Set explicit `width`/`height` on images/embeds | High |
-| CLS high | Use `font-display: swap` + font preload | Medium |
-| TTFB slow | Add caching headers, use CDN | High |
-| TTFB slow | Optimize database queries, add indexes | High |
-| FCP slow | Inline critical CSS, defer non-critical | High |
-### 2. Bundle Analysis (JavaScript/CSS)
-| Check | Target | Warning | Fail | Tool |
-|---|---|---|---|---|
-| Total JS (gzipped) | < 100KB | 100-200KB | > 200KB | Build output |
-| Largest chunk (gzipped) | < 50KB | 50-100KB | > 100KB | Build output |
-| CSS total | < 50KB | 50-100KB | > 100KB | Build output |
-| Unused CSS | < 5% | 5-15% | > 15% | PurgeCSS |
-| Duplicate packages | 0 | 1-2 | > 2 | Bundle analyzer |
-| Tree-shaking | No side-effect barrel exports | — | Side-effect imports found | Manual analysis |
-**How to Run:**
-```bash
-# Build and analyze
-npm run build -- --stats
-# VERIFY: check if the build script supports --stats flag
-# Alternative: analyze existing build output
-npx source-map-explorer dist/**/*.js
-# VERIFY: check if source-map-explorer is available
-```
-**Common Fixes:**
-| Issue | Fix | Savings |
-|---|---|---|
-| Large lodash import | `import debounce from 'lodash/debounce'` not `import { debounce } from 'lodash'` | 50-80KB |
-| Moment.js | Replace with `dayjs` or `date-fns` | 60-70KB |
-| Full icon library | Use tree-shakeable imports or individual icon files | 20-100KB |
-| Uncompressed images | Use WebP/AVIF, add `loading="lazy"` | 50-500KB |
-| CSS framework unused | PurgeCSS or `content` config in Tailwind | 30-90KB |
-### 3. API Latency (Backend Performance)
-| Check | Target | Warning | Fail | Method |
-|---|---|---|---|---|
-| Avg response (simple GET) | < 100ms | 100-300ms | > 300ms | 10 sequential requests |
-| Avg response (complex query) | < 300ms | 300-800ms | > 800ms | 10 sequential requests |
-| P95 response | < 500ms | 500-1000ms | > 1000ms | Sort, pick 95th percentile |
-| P99 response | < 1000ms | 1-3s | > 3s | Sort, pick 99th percentile |
-| Cold start | < 1s | 1-3s | > 3s | First request after 30s idle |
-| Concurrent handling | Linear scaling up to 10 req | — | Exponential degradation | 10 parallel requests |
-**How to Run:**
-```bash
-# Using curl timing
-curl -o /dev/null -s -w "time_total: %{time_total}s\n" http://localhost:3000/api/health
-# Loop for average
-for i in $(seq 1 10); do
-  curl -o /dev/null -s -w "%{time_total}\n" http://localhost:3000/api/endpoint
-done
-```
-**Common Fixes:**
-| Symptom | Likely Cause | Fix |
-|---|---|---|
-| Slow first request | Cold start, no connection pool | Pre-warm, use connection pooling |
-| Slow list endpoints | N+1 queries | Add eager loading / `include` |
-| Slow under load | No caching | Add Redis/in-memory cache for hot paths |
-| Inconsistent P95 | GC pauses | Optimize memory allocation, reduce object churn |
-### 4. Build Performance (DX)
-| Check | Target | Warning | Fail |
-|---|---|---|---|
-| Dev server cold start | < 3s | 3-8s | > 8s |
-| Hot reload (HMR) | < 200ms | 200-500ms | > 500ms |
-| Full production build | < 30s | 30-60s | > 60s |
-| TypeScript type-check | < 15s | 15-30s | > 30s |
 ---
-## Composite Score
+## Benchmark Report Format
 ```
-Performance Score = (
-  Web_Vitals_Score × 0.35 +
-  Bundle_Score     × 0.25 +
-  API_Score        × 0.25 +
-  Build_Score      × 0.15
-) × 100
-Grade:
-  90-100  →  A  (Ship with confidence)
-  75-89   →  B  (Minor optimizations available)
-  60-74   →  C  (Notable performance issues)
-  40-59   →  D  (Significant problems — fix before deploy)
-  < 40    →  F  (Critical — likely impacts user retention)
+━━━ Performance Benchmark — [date] ━━━━━━━━━
+━━━ Core Web Vitals (Lighthouse) ━━━━━━━━━━
+LCP:  [time]   [✅ Good | ⚠️ Needs Work | ❌ Poor]
+INP:  [time]   [✅ Good | ⚠️ Needs Work | ❌ Poor]
+CLS:  [score]  [✅ Good | ⚠️ Needs Work | ❌ Poor]
+FCP:  [time]
+TTFB: [time]
+Performance Score: [N]/100
+━━━ Bundle Sizes ━━━━━━━━━━━━━━━━━━━━━━━━━
+First Load JS (shared): [size]
+Largest page:           [size] ([route])
+Largest 3 bundles:
+  [bundle]: [size]
+  [bundle]: [size]
+  [bundle]: [size]
+━━━ API Latency (10 concurrent, 20s) ━━━━━━
+GET /api/products: avg [ms] | p99 [ms] | [req/s] req/s
+POST /api/orders:  avg [ms] | p99 [ms]
+━━━ Comparison (vs last run) ━━━━━━━━━━━━━━
+LCP:    4.2s → 1.9s  ▼ IMPROVED ✅
+INP:    480ms → 140ms ▼ IMPROVED ✅
+Bundle: 890kb → 310kb ▼ IMPROVED ✅
+p99 latency: 230ms → 89ms ▼ IMPROVED ✅
 ```
-Each sub-score is calculated as: `(checks_passed / total_checks)` weighted by target (1.0), warning (0.6), fail (0.0).
 ---
-## Output Format
+## Performance Gates (Fail Criteria)
 ```
-━━━ Performance Benchmark Report ━━━━━━━━━
-Project:  [name]
-Date:     [timestamp]
-Score:    [0-100] / 100 → Grade [A-F]
-━━━ Web Vitals ━━━━━━━━━━━━━━━━━━━━━━━━━
-LCP:    1.8s  ✅ Good    (target: < 2.5s)
-INP:    95ms  ✅ Good    (target: < 200ms)
-CLS:    0.05  ✅ Good    (target: < 0.1)
-TTFB:   420ms ✅ Good    (target: < 800ms)
-FCP:    1.2s  ✅ Good    (target: < 1.8s)
-Score:  92/100
-━━━ Bundle ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
-Total JS:      156KB gzipped  🟡 Warning  (target: < 100KB)
-Largest chunk:  82KB gzipped  🟡 Warning  (target: < 50KB)
-CSS total:      28KB gzipped  ✅ Good
-Unused CSS:    4.2%           ✅ Good
-Duplicates:    0              ✅ Good
-Score: 72/100
-━━━ API Latency ━━━━━━━━━━━━━━━━━━━━━━━━
-GET /api/users:     avg 89ms  ✅  |  p95 142ms  ✅
-POST /api/auth:     avg 210ms 🟡  |  p95 480ms  🟡
-GET /api/dashboard: avg 340ms ❌  |  p95 820ms  ❌
-Cold start:         680ms     ✅
-Score: 58/100
-━━━ Build ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
-Dev cold start:   2.1s  ✅
-HMR:              89ms  ✅
-Production build: 18s   ✅
-Type-check:       12s   ✅
-Score: 100/100
-━━━ Fix Priority (by impact) ━━━━━━━━━━━
-1. 🔴 GET /api/dashboard avg 340ms
-   → Add database index on dashboard query joins
-   → Expected: < 100ms (70% improvement)
-2. 🟡 Total JS 156KB
-   → Lazy-load chart library (80KB)
-   → Expected: < 80KB initial (50% reduction)
-3. 🟡 POST /api/auth avg 210ms
-   → Cache user lookup in auth flow
-   → Expected: < 100ms (50% improvement)
-━━━ Trend (if baseline available) ━━━━━━
-LCP:    1.8s → 1.8s  → (no change)
-Bundle: 140KB → 156KB ↑ (+11%) ⚠️ Regression
-API p95: 400ms → 480ms ↑ (+20%) ⚠️ Regression
+Failing these means optimization is blocking — not optional:
+LCP   > 4.0s     → ❌ Must fix — users see blank page
+INP   > 500ms    → ❌ Must fix — UI feels unresponsive
+CLS   > 0.25     → ❌ Must fix — layout jumps are jarring
+Bundle > 1mb     → ❌ Must fix — 3G users abandon
+p99 API > 2000ms → ❌ Must fix — timeout risk on slow connections
+Warning range (fix before major release):
+LCP   2.5–4.0s  → ⚠️
+INP   200–500ms → ⚠️
+Bundle 500kb–1mb → ⚠️
 ```
 ---
-## Regression Detection
-If a previous benchmark baseline exists (stored in `perf-baseline.json` or similar):
+## Historical Tracking
-| Metric | Change | Status |
-|---|---|---|
-| < 5% increase | No change | ✅ Stable |
-| 5-15% increase | Minor regression | 🟡 Flag |
-| > 15% increase | Significant regression | 🔴 Block deploy |
-| Any decrease | Improvement | 🎉 Celebrate |
----
-## Baseline Management
-After a successful benchmark, save a baseline to detect future regressions:
+Save every benchmark run:
 ```bash
-# Save current benchmark as baseline
-python .agent/scripts/bundle_analyzer.py . --save-baseline
+# Benchmarks should be saved with date stamps
+./reports/lighthouse-2026-04-02.json
+./reports/bundle-2026-04-02.txt
+./reports/latency-2026-04-02.txt
 ```
-The baseline file is `perf-baseline.json` in the project root. Check it into version control so regressions are caught in CI.
----
-## Cross-Workflow Navigation
-| After /performance-benchmarker shows... | Go to |
-|---|---|
-| Grade D or F | `/tribunal-performance` on the slowest code paths |
-| Bundle regression (+15%) | `/audit` for dependency analysis, then `/fix` |
-| API latency P95 > 500ms | `/debug` to identify the slow query or operation |
-| Web vitals LCP > 4s | `/enhance` to add image preloading and critical CSS |
-| Grade A or B, ready for deploy | `/deploy` following pre-flight checklist |
+This enables trend analysis: is performance improving or degrading over time?
 ---
-## Hallucination Guard
-- **Only run benchmarks with installed tools** — check with `which` or `npx --dry-run` first.
-- **Never fabricate benchmark numbers** — report "SKIPPED: [tool] not installed" if unavailable.
-- **Flag anomalies**: `// NOTE: unusually fast — may be cached` or `// NOTE: first run, cold start included`.
-- **Mark tool availability**: `// VERIFY: lighthouse-cli not detected, using fallback estimation`.
-- **Don't guess fixes** — only recommend fixes for issues that have measured evidence.
----
-## Usage
-```
-/performance-benchmarker full audit
-/performance-benchmarker web vitals only
-/performance-benchmarker bundle analysis
-/performance-benchmarker api latency for /api/users /api/posts
-/performance-benchmarker build performance
-/performance-benchmarker compare with baseline
-```