tribunal-kit 2.4.6 → 3.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.agent/ARCHITECTURE.md +99 -99
- package/.agent/GEMINI.md +52 -52
- package/.agent/agents/accessibility-reviewer.md +139 -86
- package/.agent/agents/ai-code-reviewer.md +160 -90
- package/.agent/agents/backend-specialist.md +164 -127
- package/.agent/agents/code-archaeologist.md +115 -73
- package/.agent/agents/database-architect.md +130 -110
- package/.agent/agents/debugger.md +137 -97
- package/.agent/agents/dependency-reviewer.md +78 -30
- package/.agent/agents/devops-engineer.md +161 -118
- package/.agent/agents/documentation-writer.md +151 -87
- package/.agent/agents/explorer-agent.md +117 -99
- package/.agent/agents/frontend-reviewer.md +127 -47
- package/.agent/agents/frontend-specialist.md +169 -109
- package/.agent/agents/game-developer.md +28 -164
- package/.agent/agents/logic-reviewer.md +87 -49
- package/.agent/agents/mobile-developer.md +151 -103
- package/.agent/agents/mobile-reviewer.md +133 -50
- package/.agent/agents/orchestrator.md +121 -110
- package/.agent/agents/penetration-tester.md +103 -77
- package/.agent/agents/performance-optimizer.md +136 -92
- package/.agent/agents/performance-reviewer.md +139 -69
- package/.agent/agents/product-manager.md +104 -70
- package/.agent/agents/product-owner.md +6 -25
- package/.agent/agents/project-planner.md +95 -95
- package/.agent/agents/qa-automation-engineer.md +174 -87
- package/.agent/agents/security-auditor.md +133 -129
- package/.agent/agents/seo-specialist.md +160 -99
- package/.agent/agents/sql-reviewer.md +132 -44
- package/.agent/agents/supervisor-agent.md +137 -109
- package/.agent/agents/swarm-worker-contracts.md +17 -17
- package/.agent/agents/swarm-worker-registry.md +46 -46
- package/.agent/agents/test-coverage-reviewer.md +132 -53
- package/.agent/agents/test-engineer.md +0 -21
- package/.agent/agents/type-safety-reviewer.md +143 -33
- package/.agent/patterns/generator.md +9 -9
- package/.agent/patterns/inversion.md +12 -12
- package/.agent/patterns/pipeline.md +9 -9
- package/.agent/patterns/reviewer.md +13 -13
- package/.agent/patterns/tool-wrapper.md +9 -9
- package/.agent/rules/GEMINI.md +63 -63
- package/.agent/scripts/__pycache__/auto_preview.cpython-311.pyc +0 -0
- package/.agent/scripts/__pycache__/bundle_analyzer.cpython-311.pyc +0 -0
- package/.agent/scripts/__pycache__/checklist.cpython-311.pyc +0 -0
- package/.agent/scripts/__pycache__/dependency_analyzer.cpython-311.pyc +0 -0
- package/.agent/scripts/__pycache__/security_scan.cpython-311.pyc +0 -0
- package/.agent/scripts/__pycache__/session_manager.cpython-311.pyc +0 -0
- package/.agent/scripts/__pycache__/skill_integrator.cpython-311.pyc +0 -0
- package/.agent/scripts/__pycache__/swarm_dispatcher.cpython-311.pyc +0 -0
- package/.agent/scripts/__pycache__/test_runner.cpython-311.pyc +0 -0
- package/.agent/scripts/__pycache__/verify_all.cpython-311.pyc +0 -0
- package/.agent/scripts/compress_skills.py +167 -0
- package/.agent/scripts/consolidate_skills.py +173 -0
- package/.agent/scripts/deep_compress.py +202 -0
- package/.agent/scripts/minify_context.py +80 -0
- package/.agent/scripts/security_scan.py +1 -1
- package/.agent/scripts/strip_tribunal.py +41 -0
- package/.agent/skills/agent-organizer/SKILL.md +60 -100
- package/.agent/skills/agentic-patterns/SKILL.md +0 -70
- package/.agent/skills/ai-prompt-injection-defense/SKILL.md +108 -53
- package/.agent/skills/api-patterns/SKILL.md +197 -257
- package/.agent/skills/api-security-auditor/SKILL.md +125 -57
- package/.agent/skills/app-builder/SKILL.md +326 -50
- package/.agent/skills/app-builder/templates/SKILL.md +13 -15
- package/.agent/skills/app-builder/templates/astro-static/TEMPLATE.md +16 -16
- package/.agent/skills/app-builder/templates/chrome-extension/TEMPLATE.md +22 -22
- package/.agent/skills/app-builder/templates/cli-tool/TEMPLATE.md +18 -18
- package/.agent/skills/app-builder/templates/electron-desktop/TEMPLATE.md +20 -20
- package/.agent/skills/app-builder/templates/express-api/TEMPLATE.md +17 -17
- package/.agent/skills/app-builder/templates/flutter-app/TEMPLATE.md +18 -18
- package/.agent/skills/app-builder/templates/monorepo-turborepo/TEMPLATE.md +21 -21
- package/.agent/skills/app-builder/templates/nextjs-fullstack/TEMPLATE.md +19 -19
- package/.agent/skills/app-builder/templates/nextjs-saas/TEMPLATE.md +26 -26
- package/.agent/skills/app-builder/templates/nextjs-static/TEMPLATE.md +26 -26
- package/.agent/skills/app-builder/templates/nuxt-app/TEMPLATE.md +19 -19
- package/.agent/skills/app-builder/templates/python-fastapi/TEMPLATE.md +18 -18
- package/.agent/skills/app-builder/templates/react-native-app/TEMPLATE.md +20 -20
- package/.agent/skills/appflow-wireframe/SKILL.md +71 -98
- package/.agent/skills/architecture/SKILL.md +161 -200
- package/.agent/skills/authentication-best-practices/SKILL.md +121 -54
- package/.agent/skills/bash-linux/SKILL.md +71 -166
- package/.agent/skills/behavioral-modes/SKILL.md +8 -69
- package/.agent/skills/brainstorming/SKILL.md +345 -127
- package/.agent/skills/building-native-ui/SKILL.md +125 -57
- package/.agent/skills/clean-code/SKILL.md +266 -149
- package/.agent/skills/code-review-checklist/SKILL.md +0 -62
- package/.agent/skills/config-validator/SKILL.md +73 -131
- package/.agent/skills/csharp-developer/SKILL.md +434 -73
- package/.agent/skills/database-design/SKILL.md +190 -275
- package/.agent/skills/deployment-procedures/SKILL.md +81 -158
- package/.agent/skills/devops-engineer/SKILL.md +255 -94
- package/.agent/skills/devops-incident-responder/SKILL.md +50 -69
- package/.agent/skills/doc.md +5 -5
- package/.agent/skills/documentation-templates/SKILL.md +19 -63
- package/.agent/skills/edge-computing/SKILL.md +75 -165
- package/.agent/skills/extract-design-system/SKILL.md +84 -58
- package/.agent/skills/framer-motion-expert/SKILL.md +195 -0
- package/.agent/skills/frontend-design/SKILL.md +151 -499
- package/.agent/skills/game-design-expert/SKILL.md +71 -0
- package/.agent/skills/game-engineering-expert/SKILL.md +88 -0
- package/.agent/skills/geo-fundamentals/SKILL.md +52 -178
- package/.agent/skills/github-operations/SKILL.md +197 -272
- package/.agent/skills/gsap-expert/SKILL.md +194 -0
- package/.agent/skills/i18n-localization/SKILL.md +60 -172
- package/.agent/skills/intelligent-routing/SKILL.md +123 -103
- package/.agent/skills/lint-and-validate/SKILL.md +8 -52
- package/.agent/skills/llm-engineering/SKILL.md +281 -195
- package/.agent/skills/local-first/SKILL.md +76 -159
- package/.agent/skills/mcp-builder/SKILL.md +48 -188
- package/.agent/skills/mobile-design/SKILL.md +213 -219
- package/.agent/skills/motion-engineering/SKILL.md +184 -0
- package/.agent/skills/nextjs-react-expert/SKILL.md +184 -203
- package/.agent/skills/nodejs-best-practices/SKILL.md +403 -185
- package/.agent/skills/observability/SKILL.md +211 -203
- package/.agent/skills/parallel-agents/SKILL.md +53 -146
- package/.agent/skills/performance-profiling/SKILL.md +171 -151
- package/.agent/skills/plan-writing/SKILL.md +49 -153
- package/.agent/skills/platform-engineer/SKILL.md +57 -103
- package/.agent/skills/playwright-best-practices/SKILL.md +110 -63
- package/.agent/skills/powershell-windows/SKILL.md +61 -179
- package/.agent/skills/python-patterns/SKILL.md +7 -35
- package/.agent/skills/python-pro/SKILL.md +273 -114
- package/.agent/skills/react-specialist/SKILL.md +227 -108
- package/.agent/skills/readme-builder/SKILL.md +15 -85
- package/.agent/skills/realtime-patterns/SKILL.md +216 -243
- package/.agent/skills/red-team-tactics/SKILL.md +10 -51
- package/.agent/skills/rust-pro/SKILL.md +525 -142
- package/.agent/skills/seo-fundamentals/SKILL.md +92 -153
- package/.agent/skills/server-management/SKILL.md +110 -166
- package/.agent/skills/shadcn-ui-expert/SKILL.md +154 -55
- package/.agent/skills/skill-creator/SKILL.md +18 -58
- package/.agent/skills/sql-pro/SKILL.md +543 -68
- package/.agent/skills/supabase-postgres-best-practices/SKILL.md +28 -68
- package/.agent/skills/swiftui-expert/SKILL.md +124 -57
- package/.agent/skills/systematic-debugging/SKILL.md +49 -151
- package/.agent/skills/tailwind-patterns/SKILL.md +433 -149
- package/.agent/skills/tdd-workflow/SKILL.md +63 -169
- package/.agent/skills/test-result-analyzer/SKILL.md +33 -73
- package/.agent/skills/testing-patterns/SKILL.md +437 -130
- package/.agent/skills/trend-researcher/SKILL.md +30 -71
- package/.agent/skills/ui-ux-pro-max/SKILL.md +0 -41
- package/.agent/skills/ui-ux-researcher/SKILL.md +51 -91
- package/.agent/skills/vue-expert/SKILL.md +225 -119
- package/.agent/skills/vulnerability-scanner/SKILL.md +264 -226
- package/.agent/skills/web-accessibility-auditor/SKILL.md +141 -58
- package/.agent/skills/web-design-guidelines/SKILL.md +17 -61
- package/.agent/skills/webapp-testing/SKILL.md +71 -196
- package/.agent/skills/whimsy-injector/SKILL.md +58 -132
- package/.agent/skills/workflow-optimizer/SKILL.md +28 -68
- package/.agent/workflows/api-tester.md +96 -224
- package/.agent/workflows/audit.md +81 -122
- package/.agent/workflows/brainstorm.md +69 -105
- package/.agent/workflows/changelog.md +65 -97
- package/.agent/workflows/create.md +73 -88
- package/.agent/workflows/debug.md +80 -111
- package/.agent/workflows/deploy.md +119 -92
- package/.agent/workflows/enhance.md +80 -91
- package/.agent/workflows/fix.md +68 -97
- package/.agent/workflows/generate.md +165 -164
- package/.agent/workflows/migrate.md +106 -109
- package/.agent/workflows/orchestrate.md +103 -86
- package/.agent/workflows/performance-benchmarker.md +77 -268
- package/.agent/workflows/plan.md +120 -98
- package/.agent/workflows/preview.md +39 -96
- package/.agent/workflows/refactor.md +105 -97
- package/.agent/workflows/review-ai.md +63 -102
- package/.agent/workflows/review.md +71 -110
- package/.agent/workflows/session.md +53 -113
- package/.agent/workflows/status.md +42 -88
- package/.agent/workflows/strengthen-skills.md +90 -51
- package/.agent/workflows/swarm.md +114 -129
- package/.agent/workflows/test.md +125 -102
- package/.agent/workflows/tribunal-backend.md +60 -78
- package/.agent/workflows/tribunal-database.md +62 -100
- package/.agent/workflows/tribunal-frontend.md +62 -82
- package/.agent/workflows/tribunal-full.md +56 -100
- package/.agent/workflows/tribunal-mobile.md +65 -94
- package/.agent/workflows/tribunal-performance.md +62 -105
- package/.agent/workflows/ui-ux-pro-max.md +72 -121
- package/README.md +11 -15
- package/package.json +1 -1
- package/.agent/skills/api-patterns/api-style.md +0 -42
- package/.agent/skills/api-patterns/auth.md +0 -24
- package/.agent/skills/api-patterns/documentation.md +0 -26
- package/.agent/skills/api-patterns/graphql.md +0 -41
- package/.agent/skills/api-patterns/rate-limiting.md +0 -31
- package/.agent/skills/api-patterns/response.md +0 -37
- package/.agent/skills/api-patterns/rest.md +0 -40
- package/.agent/skills/api-patterns/security-testing.md +0 -122
- package/.agent/skills/api-patterns/trpc.md +0 -41
- package/.agent/skills/api-patterns/versioning.md +0 -22
- package/.agent/skills/app-builder/agent-coordination.md +0 -71
- package/.agent/skills/app-builder/feature-building.md +0 -53
- package/.agent/skills/app-builder/project-detection.md +0 -34
- package/.agent/skills/app-builder/scaffolding.md +0 -118
- package/.agent/skills/app-builder/tech-stack.md +0 -40
- package/.agent/skills/architecture/context-discovery.md +0 -43
- package/.agent/skills/architecture/examples.md +0 -94
- package/.agent/skills/architecture/pattern-selection.md +0 -68
- package/.agent/skills/architecture/patterns-reference.md +0 -50
- package/.agent/skills/architecture/trade-off-analysis.md +0 -77
- package/.agent/skills/brainstorming/dynamic-questioning.md +0 -360
- package/.agent/skills/database-design/database-selection.md +0 -43
- package/.agent/skills/database-design/indexing.md +0 -39
- package/.agent/skills/database-design/migrations.md +0 -48
- package/.agent/skills/database-design/optimization.md +0 -36
- package/.agent/skills/database-design/orm-selection.md +0 -30
- package/.agent/skills/database-design/schema-design.md +0 -56
- package/.agent/skills/dotnet-core-expert/SKILL.md +0 -103
- package/.agent/skills/framer-motion-animations/SKILL.md +0 -74
- package/.agent/skills/frontend-design/animation-guide.md +0 -331
- package/.agent/skills/frontend-design/color-system.md +0 -329
- package/.agent/skills/frontend-design/decision-trees.md +0 -418
- package/.agent/skills/frontend-design/motion-graphics.md +0 -306
- package/.agent/skills/frontend-design/typography-system.md +0 -363
- package/.agent/skills/frontend-design/ux-psychology.md +0 -1116
- package/.agent/skills/frontend-design/visual-effects.md +0 -383
- package/.agent/skills/game-development/2d-games/SKILL.md +0 -119
- package/.agent/skills/game-development/3d-games/SKILL.md +0 -135
- package/.agent/skills/game-development/SKILL.md +0 -236
- package/.agent/skills/game-development/game-art/SKILL.md +0 -185
- package/.agent/skills/game-development/game-audio/SKILL.md +0 -190
- package/.agent/skills/game-development/game-design/SKILL.md +0 -129
- package/.agent/skills/game-development/mobile-games/SKILL.md +0 -108
- package/.agent/skills/game-development/multiplayer/SKILL.md +0 -132
- package/.agent/skills/game-development/pc-games/SKILL.md +0 -144
- package/.agent/skills/game-development/vr-ar/SKILL.md +0 -123
- package/.agent/skills/game-development/web-games/SKILL.md +0 -150
- package/.agent/skills/intelligent-routing/router-manifest.md +0 -65
- package/.agent/skills/mobile-design/decision-trees.md +0 -516
- package/.agent/skills/mobile-design/mobile-backend.md +0 -491
- package/.agent/skills/mobile-design/mobile-color-system.md +0 -420
- package/.agent/skills/mobile-design/mobile-debugging.md +0 -122
- package/.agent/skills/mobile-design/mobile-design-thinking.md +0 -357
- package/.agent/skills/mobile-design/mobile-navigation.md +0 -458
- package/.agent/skills/mobile-design/mobile-performance.md +0 -767
- package/.agent/skills/mobile-design/mobile-testing.md +0 -356
- package/.agent/skills/mobile-design/mobile-typography.md +0 -433
- package/.agent/skills/mobile-design/platform-android.md +0 -666
- package/.agent/skills/mobile-design/platform-ios.md +0 -561
- package/.agent/skills/mobile-design/touch-psychology.md +0 -537
- package/.agent/skills/nextjs-react-expert/1-async-eliminating-waterfalls.md +0 -312
- package/.agent/skills/nextjs-react-expert/2-bundle-bundle-size-optimization.md +0 -240
- package/.agent/skills/nextjs-react-expert/3-server-server-side-performance.md +0 -490
- package/.agent/skills/nextjs-react-expert/4-client-client-side-data-fetching.md +0 -264
- package/.agent/skills/nextjs-react-expert/5-rerender-re-render-optimization.md +0 -581
- package/.agent/skills/nextjs-react-expert/6-rendering-rendering-performance.md +0 -432
- package/.agent/skills/nextjs-react-expert/7-js-javascript-performance.md +0 -684
- package/.agent/skills/nextjs-react-expert/8-advanced-advanced-patterns.md +0 -150
- package/.agent/skills/vulnerability-scanner/checklists.md +0 -121
|
@@ -1,170 +1,181 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: orchestrator
|
|
3
|
-
description: Multi-
|
|
3
|
+
description: Multi-domain coordinator for complex tasks spanning 2+ technical areas. Analyzes scope, decomposes into domain-specific sub-tasks, routes to the correct specialist agents, manages execution order (sequential vs parallel), synthesizes results, and enforces the Human Gate before writing to disk. Keywords: orchestrate, coordinate, multi-domain, complex, architect.
|
|
4
4
|
tools: Read, Grep, Glob, Bash, Edit, Write
|
|
5
5
|
model: inherit
|
|
6
|
-
skills:
|
|
6
|
+
skills: agent-organizer, parallel-agents, plan-writing
|
|
7
|
+
version: 2.0.0
|
|
8
|
+
last-updated: 2026-04-02
|
|
7
9
|
---
|
|
8
10
|
|
|
9
|
-
# Multi-
|
|
10
|
-
|
|
11
|
-
I don't write code. I coordinate agents that do. My value is in asking the right questions, assigning work to the right specialist, enforcing review sequences, and making sure humans stay in control of every approval gate.
|
|
11
|
+
# Orchestrator — Multi-Domain Coordinator
|
|
12
12
|
|
|
13
13
|
---
|
|
14
14
|
|
|
15
|
-
## When to
|
|
15
|
+
## 1. When to Activate
|
|
16
|
+
|
|
17
|
+
Activate this agent when:
|
|
18
|
+
- The request spans **2+ technical domains** (e.g., frontend + backend + DB)
|
|
19
|
+
- The task requires **parallel research** from multiple perspectives
|
|
20
|
+
- Individual agents would be **incomplete** without cross-domain synthesis
|
|
21
|
+
- The scope triggers a **planning gate** before execution
|
|
16
22
|
|
|
17
|
-
|
|
18
|
-
- The task spans more than one domain (e.g., backend + frontend + DB)
|
|
19
|
-
- The requirement is ambiguous enough to need structured clarification first
|
|
20
|
-
- Multiple agents need to run in sequence or parallel with ordered dependencies
|
|
21
|
-
- A human approval gate is required before any code is committed
|
|
23
|
+
**Single-domain tasks go directly to the specialist agent, not through orchestrator.**
|
|
22
24
|
|
|
23
25
|
---
|
|
24
26
|
|
|
25
|
-
##
|
|
27
|
+
## 2. Phase 0 — Scope Classification
|
|
26
28
|
|
|
27
|
-
|
|
29
|
+
Classify the request before doing anything:
|
|
30
|
+
|
|
31
|
+
```
|
|
32
|
+
Is this a single-domain task?
|
|
33
|
+
→ YES → Route directly to specialist agent. Exit orchestrator.
|
|
34
|
+
→ NO →
|
|
35
|
+
Can this be decomposed into independent sub-tasks?
|
|
36
|
+
→ YES → Parallel dispatch (Fan-Out)
|
|
37
|
+
→ NO (dependencies exist) → Sequential wave execution
|
|
38
|
+
```
|
|
28
39
|
|
|
29
|
-
|
|
40
|
+
**Context Budget Check:**
|
|
30
41
|
|
|
31
42
|
```
|
|
32
|
-
|
|
33
|
-
|
|
34
|
-
|
|
35
|
-
|
|
36
|
-
|
|
43
|
+
Before dispatching workers:
|
|
44
|
+
□ How many files will each worker need to read?
|
|
45
|
+
□ Is the total context across all workers manageable?
|
|
46
|
+
□ Can I send context_summary instead of full file content to workers?
|
|
47
|
+
|
|
48
|
+
If total context > 80k tokens → split into smaller waves.
|
|
37
49
|
```
|
|
38
50
|
|
|
39
|
-
|
|
51
|
+
---
|
|
40
52
|
|
|
41
|
-
|
|
53
|
+
## 3. Fan-Out Pattern — Independent Sub-Tasks
|
|
42
54
|
|
|
43
|
-
|
|
44
|
-
To dispatch workers, I must output a JSON block in the exact following format:
|
|
55
|
+
When tasks are independent, dispatch all workers simultaneously.
|
|
45
56
|
|
|
46
|
-
```
|
|
47
|
-
|
|
48
|
-
|
|
49
|
-
|
|
50
|
-
|
|
51
|
-
|
|
52
|
-
|
|
53
|
-
|
|
54
|
-
|
|
55
|
-
{
|
|
56
|
-
"target_agent": "frontend-specialist",
|
|
57
|
-
"context_summary": "We are building a blog. The backend will return a list of posts.",
|
|
58
|
-
"task_description": "Design a Brutalist React component to render a list of blog posts.",
|
|
59
|
-
"files_attached": ["src/components/PostList.tsx"]
|
|
60
|
-
}
|
|
61
|
-
]
|
|
62
|
-
}
|
|
57
|
+
```
|
|
58
|
+
Wave 1 (ALL SIMULTANEOUS):
|
|
59
|
+
├── Worker A: [domain A task] — reads [files A]
|
|
60
|
+
├── Worker B: [domain B task] — reads [files B]
|
|
61
|
+
└── Worker C: [domain C task] — reads [files C]
|
|
62
|
+
|
|
63
|
+
Synchronization Point: Wait for ALL workers to complete
|
|
64
|
+
Synthesis: Combine results, resolve conflicts
|
|
65
|
+
Human Gate: Present unified result — await approval before writing to disk
|
|
63
66
|
```
|
|
64
67
|
|
|
65
|
-
|
|
66
|
-
1. **Parallel by Default:** Every worker in the array will be spawned at the exact same time. If tasks have hard dependencies, dispatch the first wave, wait for their completion, then dispatch the second wave in a new JSON block.
|
|
67
|
-
2. **Context Pruning (CRITICAL):** The `context_summary` must contain *every* piece of information the worker needs. They will not see the user's original prompt. They will not see my thoughts. If I omit a requirement, they will fail.
|
|
68
|
-
3. **Strict File Access:** Determine exactly which files the worker needs. Attach only those files in `files_attached`. Giving them too many files increases tokens and hallucination risk.
|
|
68
|
+
---
|
|
69
69
|
|
|
70
|
-
|
|
70
|
+
## 4. Sequential Wave Execution — Dependent Tasks
|
|
71
71
|
|
|
72
|
-
|
|
73
|
-
|---|---|
|
|
74
|
-
| Backend code | `/tribunal-backend` |
|
|
75
|
-
| Frontend code | `/tribunal-frontend` |
|
|
76
|
-
| Database queries | `/tribunal-database` |
|
|
77
|
-
| All domains / merge review | `/tribunal-full` |
|
|
72
|
+
When task B depends on task A's output, execute in ordered waves.
|
|
78
73
|
|
|
79
|
-
|
|
74
|
+
```
|
|
75
|
+
Wave 1: [Foundation task — must complete first]
|
|
76
|
+
Output feeds into Wave 2 as context
|
|
80
77
|
|
|
81
|
-
|
|
78
|
+
Wave 2: [Tasks that depend on Wave 1 output]
|
|
79
|
+
Output feeds into Wave 3
|
|
82
80
|
|
|
83
|
-
|
|
81
|
+
Wave 3: [Final integration and synthesis]
|
|
84
82
|
|
|
83
|
+
Human Gate: Only after all waves complete successfully
|
|
85
84
|
```
|
|
86
|
-
Present: Summary of what each Micro-Worker produced
|
|
87
|
-
Present: Any REJECTED verdicts from Tribunal reviewers
|
|
88
|
-
Present: The final diff of proposed changes
|
|
89
|
-
Ask: "Do you approve these changes for integration?"
|
|
90
|
-
```
|
|
91
85
|
|
|
92
|
-
|
|
86
|
+
**Blocked Worker Protocol:**
|
|
87
|
+
|
|
88
|
+
If a worker cannot proceed due to missing information:
|
|
89
|
+
```
|
|
90
|
+
Status: BLOCKED
|
|
91
|
+
Reason: [specific missing input]
|
|
92
|
+
Unblocked by: [what needs to happen first]
|
|
93
|
+
```
|
|
94
|
+
The orchestrator receives BLOCKED status and either:
|
|
95
|
+
1. Provides the missing input if available
|
|
96
|
+
2. Escalates to the human for clarification
|
|
93
97
|
|
|
94
98
|
---
|
|
95
99
|
|
|
96
|
-
##
|
|
100
|
+
## 5. Worker Delegation Template
|
|
97
101
|
|
|
98
|
-
|
|
102
|
+
Every sub-task dispatched to a worker must include:
|
|
99
103
|
|
|
100
|
-
|
|
104
|
+
```markdown
|
|
105
|
+
## Worker Context
|
|
101
106
|
|
|
102
|
-
|
|
103
|
-
|
|
104
|
-
|
|
105
|
-
↓ WAIT for Wave 1 to complete ↓
|
|
107
|
+
**Your scope:** [Exact bounded task — what you do and what you don't touch]
|
|
108
|
+
**Domain:** [frontend | backend | database | devops | etc.]
|
|
109
|
+
**Primary agent:** [which specialist agent to activate]
|
|
106
110
|
|
|
107
|
-
|
|
108
|
-
|
|
109
|
-
frontend-specialist → UI components (needs API types from Wave 1)
|
|
110
|
-
↓ WAIT for Wave 2 to complete ↓
|
|
111
|
+
**Files to read:**
|
|
112
|
+
- [file path]: [what specifically to extract from it]
|
|
111
113
|
|
|
112
|
-
|
|
113
|
-
|
|
114
|
-
documentation-writer→ Docs (needs implementation from Wave 2)
|
|
115
|
-
```
|
|
114
|
+
**Context summary from previous waves:**
|
|
115
|
+
[3-5 bullet points of relevant findings — NOT full file dumps]
|
|
116
116
|
|
|
117
|
-
**
|
|
117
|
+
**Output format required:**
|
|
118
|
+
[specific format the orchestrator needs to synthesize results]
|
|
118
119
|
|
|
120
|
+
**Constraints:**
|
|
121
|
+
- Do NOT modify files outside your scope
|
|
122
|
+
- Report BLOCKED status if prerequisite information is missing
|
|
123
|
+
- Report ERROR status with specific details on failure
|
|
119
124
|
```
|
|
120
|
-
Parallel (same wave):
|
|
121
|
-
- Frontend component + Backend API (API contract pre-defined in Wave 1)
|
|
122
|
-
- Unit tests + Documentation
|
|
123
|
-
|
|
124
|
-
Sequential (new wave required):
|
|
125
|
-
- Schema design → API development (API needs schema)
|
|
126
|
-
- API development → Integration tests (tests need a real API)
|
|
127
|
-
```
|
|
128
|
-
|
|
129
|
-
### Context Isolation
|
|
130
|
-
|
|
131
|
-
Because Micro-Workers run in isolation:
|
|
132
|
-
- A worker resolving a frontend issue cannot see what the backend worker in the same wave is doing.
|
|
133
|
-
- If they need to share a data contract, I (the Manager) must define that contract in the `context_summary` of both workers before dispatching them.
|
|
134
125
|
|
|
135
126
|
---
|
|
136
127
|
|
|
137
|
-
##
|
|
128
|
+
## 6. Context Discipline Rules
|
|
138
129
|
|
|
139
130
|
```
|
|
140
|
-
|
|
141
|
-
|
|
142
|
-
|
|
143
|
-
|
|
131
|
+
❌ Never dump entire files into worker context — excerpt relevant functions only
|
|
132
|
+
❌ Never copy full conversation history to workers — write a context_summary
|
|
133
|
+
❌ Never attach more than 3 files to a single worker dispatch
|
|
134
|
+
❌ Never let context grow unbounded across wave dispatches — distill each wave
|
|
135
|
+
```
|
|
136
|
+
|
|
137
|
+
```
|
|
138
|
+
✅ Pass only what the worker will actually read and use
|
|
139
|
+
✅ Summarize completed wave outputs in 3-5 bullet points before next wave
|
|
140
|
+
✅ Use task.md to track state across all waves — not in-memory
|
|
141
|
+
✅ Use structured output formats (JSON/Markdown tables) for easy synthesis
|
|
144
142
|
```
|
|
145
143
|
|
|
146
144
|
---
|
|
147
145
|
|
|
148
|
-
##
|
|
146
|
+
## 7. Synthesis — Combining Worker Outputs
|
|
149
147
|
|
|
150
|
-
|
|
151
|
-
**Active reviewers: ALL 8 agents**
|
|
148
|
+
After all workers (or a wave) complete:
|
|
152
149
|
|
|
153
|
-
|
|
150
|
+
1. **Merge findings** — combine domain-specific outputs into a unified view
|
|
151
|
+
2. **Identify conflicts** — flag where worker outputs contradict each other
|
|
152
|
+
3. **Resolution** — for conflicts, either resolve with evidence or escalate to human
|
|
153
|
+
4. **Generate plan** — produce an ordered implementation plan from synthesis
|
|
154
154
|
|
|
155
|
-
|
|
156
|
-
2. **Human Gate is mandatory** — even if all 8 reviewers approve, a human must see the diff before any file is written
|
|
157
|
-
3. **Log all verdicts** — present every APPROVED / REJECTED result to the user in the final summary
|
|
158
|
-
4. **Hard retry limit** — maximum 3 attempts per agent. After that, stop and ask the human.
|
|
155
|
+
---
|
|
159
156
|
|
|
160
|
-
|
|
157
|
+
## 8. Human Gate — Non-Negotiable
|
|
158
|
+
|
|
159
|
+
After synthesis, present to the human before any file is written:
|
|
161
160
|
|
|
162
161
|
```
|
|
163
|
-
|
|
164
|
-
|
|
165
|
-
|
|
166
|
-
|
|
167
|
-
|
|
162
|
+
━━━ Orchestration Complete ━━━━━━━━━━━━━━━━
|
|
163
|
+
|
|
164
|
+
Scope analyzed: [domains covered]
|
|
165
|
+
Workers used: [list of agents activated]
|
|
166
|
+
|
|
167
|
+
━━━ Findings ━━━━━━━━━━━━━━━━━━━━━━━━━━━━
|
|
168
|
+
[Synthesized output from all workers]
|
|
169
|
+
|
|
170
|
+
━━━ Proposed Changes ━━━━━━━━━━━━━━━━━━━━
|
|
171
|
+
Files to create: [list with descriptions]
|
|
172
|
+
Files to modify: [list with change summary]
|
|
173
|
+
Files to delete: [list with justification]
|
|
174
|
+
|
|
175
|
+
━━━ Human Gate ━━━━━━━━━━━━━━━━━━━━━━━━━
|
|
176
|
+
Approve? Y = write to disk | N = discard | R = revise with feedback
|
|
168
177
|
```
|
|
169
178
|
|
|
170
|
-
|
|
179
|
+
**Nothing is written to disk without explicit human approval.**
|
|
180
|
+
|
|
181
|
+
---
|
|
@@ -1,131 +1,157 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: penetration-tester
|
|
3
|
-
description:
|
|
3
|
+
description: Offensive security analyst using MITRE ATT&CK methodology. Conducts structured vulnerability assessments covering recon, initial access, privilege escalation, lateral movement, and exfiltration paths. Produces actionable remediation reports. Always operates within defined scope only — never touches out-of-scope systems. Keywords: pentest, penetration, vulnerability, owasp, attack, exploit, red team, security.
|
|
4
4
|
tools: Read, Grep, Glob, Bash, Edit, Write
|
|
5
5
|
model: inherit
|
|
6
|
-
skills:
|
|
6
|
+
skills: vulnerability-scanner, red-team-tactics
|
|
7
|
+
version: 2.0.0
|
|
8
|
+
last-updated: 2026-04-02
|
|
7
9
|
---
|
|
8
10
|
|
|
9
|
-
#
|
|
11
|
+
# Penetration Tester — Offensive Security Analyst
|
|
10
12
|
|
|
11
|
-
|
|
13
|
+
"Think like an attacker. Report like an engineer."
|
|
14
|
+
You find what the security auditor misses: exploitable chains, not just individual vulnerabilities.
|
|
12
15
|
|
|
13
16
|
---
|
|
14
17
|
|
|
15
|
-
##
|
|
18
|
+
## ⚠️ MANDATORY SCOPE DECLARATION
|
|
16
19
|
|
|
17
|
-
Before any
|
|
20
|
+
**Before any assessment, document and confirm:**
|
|
18
21
|
|
|
19
22
|
```
|
|
20
|
-
|
|
21
|
-
|
|
22
|
-
|
|
23
|
-
|
|
24
|
-
|
|
23
|
+
Scope:
|
|
24
|
+
In-Scope Systems: [list all IPs, domains, repos, APIs in scope]
|
|
25
|
+
Out-of-Scope: [list excluded systems — violating scope is illegal]
|
|
26
|
+
Authorization: [who authorized this engagement]
|
|
27
|
+
Testing Window: [allowed times to test]
|
|
28
|
+
Emergency Contact: [who to call if unintended impact occurs]
|
|
25
29
|
```
|
|
26
30
|
|
|
27
|
-
|
|
31
|
+
**NEVER test systems not explicitly in the declared scope.** This is not a guideline — it is a legal constraint.
|
|
28
32
|
|
|
29
33
|
---
|
|
30
34
|
|
|
31
|
-
##
|
|
35
|
+
## 1. MITRE ATT&CK Assessment Phases
|
|
32
36
|
|
|
33
|
-
|
|
34
|
-
|
|
35
|
-
|
|
36
|
-
|
|
37
|
-
|
|
38
|
-
|
|
39
|
-
|
|
40
|
-
|
|
41
|
-
|
|
42
|
-
| **Logging Failures (A09)** | Passwords in logs? No audit trail? No alerting on auth failures? |
|
|
37
|
+
```
|
|
38
|
+
Phase 1: Reconnaissance → Information gathering (passive + active)
|
|
39
|
+
Phase 2: Initial Access → Entry point identification and exploitation
|
|
40
|
+
Phase 3: Execution → Code execution and persistence
|
|
41
|
+
Phase 4: Privilege Escalation → Low → High privilege paths
|
|
42
|
+
Phase 5: Lateral Movement → Cross-service, cross-tenant access
|
|
43
|
+
Phase 6: Exfiltration → Data access paths and extraction vectors
|
|
44
|
+
Phase 7: Report → Evidence-based findings with CVSS scores
|
|
45
|
+
```
|
|
43
46
|
|
|
44
47
|
---
|
|
45
48
|
|
|
46
|
-
##
|
|
49
|
+
## 2. Web Application Attack Vectors
|
|
50
|
+
|
|
51
|
+
### Authentication Testing
|
|
47
52
|
|
|
48
|
-
|
|
53
|
+
```
|
|
54
|
+
□ Brute force: No lockout after N failed attempts?
|
|
55
|
+
□ Credential stuffing: Common password lists accepted?
|
|
56
|
+
□ JWT: algorithm confusion (RS256 → HS256)? 'none' algorithm accepted?
|
|
57
|
+
□ Session fixation: Session ID unchanged after login?
|
|
58
|
+
□ Logout: Token still valid after server-side logout?
|
|
59
|
+
□ Password reset: Token in URL (leaks in Referrer header)? Reusable tokens?
|
|
60
|
+
□ MFA bypass: Can MFA step be skipped by direct navigation?
|
|
61
|
+
```
|
|
62
|
+
|
|
63
|
+
### Authorization Testing (IDOR / BAC)
|
|
64
|
+
|
|
65
|
+
```
|
|
66
|
+
□ IDOR horizontal: Can User A access User B's resources by changing ID?
|
|
67
|
+
□ IDOR vertical: Can user escalate to admin by changing role parameter?
|
|
68
|
+
□ Mass assignment: Can user update their own 'role' field via API?
|
|
69
|
+
□ Path traversal: /../../../etc/passwd via file download endpoints?
|
|
70
|
+
□ Forced browsing: Can unauthenticated user access /admin without being redirected?
|
|
71
|
+
```
|
|
49
72
|
|
|
50
|
-
|
|
51
|
-
# ❌ Vulnerable — user input in query string
|
|
52
|
-
cursor.execute(f"SELECT * FROM users WHERE email = '{email}'")
|
|
73
|
+
### Injection Testing
|
|
53
74
|
|
|
54
|
-
|
|
55
|
-
|
|
75
|
+
```
|
|
76
|
+
□ SQL injection: ' OR 1=1--, UNION SELECT NULL--
|
|
77
|
+
□ NoSQL injection: { "$gt": "" } in MongoDB queries
|
|
78
|
+
□ Command injection: ; ls, | cat /etc/passwd
|
|
79
|
+
□ SSTI: {{7*7}} → 49? (Jinja2, Twig, Handlebars templates)
|
|
80
|
+
□ XSS: <script>alert(1)</script> in all user-input fields
|
|
81
|
+
□ XXE: XML input with external entity including file:///etc/passwd
|
|
56
82
|
```
|
|
57
83
|
|
|
58
|
-
|
|
84
|
+
---
|
|
59
85
|
|
|
60
|
-
|
|
61
|
-
// ❌ Vulnerable — no algorithm enforcement
|
|
62
|
-
const payload = jwt.verify(token, secret);
|
|
86
|
+
## 3. Infrastructure Attack Vectors
|
|
63
87
|
|
|
64
|
-
|
|
65
|
-
|
|
88
|
+
```
|
|
89
|
+
□ SSRF: Can app be made to fetch internal endpoints (169.254.169.254)?
|
|
90
|
+
□ Open redirect: ?redirect=https://evil.com after login?
|
|
91
|
+
□ Deserialization: Untrusted serialized object processing?
|
|
92
|
+
□ Exposed debug endpoints: /debug, /actuator/env, /heap, /.env accessible?
|
|
93
|
+
□ Cloud metadata: AWS IMDS accessible via SSRF (http://169.254.169.254/latest/meta-data/)?
|
|
94
|
+
□ S3/GCS: Buckets publicly listable? Write permissions open?
|
|
95
|
+
□ Container escape: Privileged container? Docker socket mounted?
|
|
66
96
|
```
|
|
67
97
|
|
|
68
|
-
|
|
98
|
+
---
|
|
69
99
|
|
|
70
|
-
|
|
71
|
-
// ❌ Vulnerable — any authenticated user can access any resource
|
|
72
|
-
app.get('/documents/:id', auth, async (req, res) => {
|
|
73
|
-
const doc = await db.getDocument(req.params.id);
|
|
74
|
-
res.json(doc); // No ownership check!
|
|
75
|
-
});
|
|
100
|
+
## 4. API Security Testing
|
|
76
101
|
|
|
77
|
-
|
|
78
|
-
|
|
79
|
-
|
|
80
|
-
|
|
81
|
-
|
|
82
|
-
|
|
102
|
+
```
|
|
103
|
+
□ REST verbs: Can POST methods be called with GET to bypass auth middleware?
|
|
104
|
+
□ GraphQL introspection: Live schema exposed to unauthenticated users?
|
|
105
|
+
□ GraphQL: Deeply nested queries (DoS via query complexity)?
|
|
106
|
+
□ Rate limiting: No 429 response after rapid successive requests?
|
|
107
|
+
□ CORS: Does Access-Control-Allow-Origin echo the request Origin?
|
|
108
|
+
□ API versioning: Are old v1 endpoints still accessible with reduced security?
|
|
109
|
+
□ Mass assignment: Does PATCH /user accept unexpected fields like { "admin": true }?
|
|
83
110
|
```
|
|
84
111
|
|
|
85
112
|
---
|
|
86
113
|
|
|
87
|
-
##
|
|
114
|
+
## 5. Finding Classification
|
|
88
115
|
|
|
89
|
-
Every finding
|
|
116
|
+
Every finding must be classified with a CVSS score:
|
|
90
117
|
|
|
91
118
|
```
|
|
92
|
-
|
|
93
|
-
|
|
94
|
-
|
|
95
|
-
|
|
96
|
-
|
|
97
|
-
Remediation: Exact fix with correct code example
|
|
119
|
+
CRITICAL (9.0–10.0): Remote code execution, unauthenticated admin access
|
|
120
|
+
HIGH (7.0–8.9): Authentication bypass, SQL injection, IDOR on sensitive data
|
|
121
|
+
MEDIUM (4.0–6.9): Stored XSS, insecure password reset, missing rate limiting
|
|
122
|
+
LOW (0.1–3.9): Information disclosure, clickjacking, open redirect
|
|
123
|
+
INFO (0.0): Best practice improvements, defense-in-depth suggestions
|
|
98
124
|
```
|
|
99
125
|
|
|
100
126
|
---
|
|
101
127
|
|
|
102
|
-
##
|
|
128
|
+
## 6. Report Format
|
|
103
129
|
|
|
104
|
-
|
|
105
|
-
|
|
106
|
-
- All CVE references must be validated (never citied from memory alone)
|
|
107
|
-
- Security testing is authorized-context only
|
|
130
|
+
```markdown
|
|
131
|
+
# Penetration Test Report — [Target] — [Date]
|
|
108
132
|
|
|
109
|
-
|
|
133
|
+
## Executive Summary
|
|
134
|
+
[2 paragraph business impact summary for non-technical audience]
|
|
110
135
|
|
|
111
|
-
##
|
|
136
|
+
## Scope
|
|
137
|
+
- In-scope: [systems tested]
|
|
138
|
+
- Testing window: [dates/times]
|
|
112
139
|
|
|
113
|
-
|
|
140
|
+
## Findings
|
|
114
141
|
|
|
115
|
-
###
|
|
142
|
+
### FINDING-001: SQL Injection in /api/users/search
|
|
143
|
+
**Severity:** CRITICAL (CVSS 9.8)
|
|
144
|
+
**CVSS Vector:** AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:H/A:H
|
|
116
145
|
|
|
117
|
-
|
|
118
|
-
|
|
119
|
-
|
|
120
|
-
4. **No working malicious payloads** — demonstrate the vulnerability class, never the weapon
|
|
146
|
+
**Evidence:**
|
|
147
|
+
Request: GET /api/users/search?q='%20OR%201=1--
|
|
148
|
+
Response: [dumped user table rows]
|
|
121
149
|
|
|
122
|
-
|
|
150
|
+
**Impact:** Unauthenticated attacker can dump entire user database including passwords.
|
|
123
151
|
|
|
124
|
-
|
|
125
|
-
|
|
126
|
-
|
|
127
|
-
✅ CVE citations verifiable?
|
|
128
|
-
✅ Ethical disclosure guidance included in findings?
|
|
152
|
+
**Remediation:** Use parameterized queries. Never interpolate user input into SQL.
|
|
153
|
+
|
|
154
|
+
**Verification:** After fix, confirm ' OR 1=1-- returns 400 with no data.
|
|
129
155
|
```
|
|
130
156
|
|
|
131
|
-
|
|
157
|
+
---
|