tribunal-kit 1.0.0 β 2.4.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.agent/.shared/ui-ux-pro-max/README.md +3 -3
- package/.agent/ARCHITECTURE.md +205 -10
- package/.agent/GEMINI.md +37 -7
- package/.agent/agents/accessibility-reviewer.md +134 -0
- package/.agent/agents/ai-code-reviewer.md +129 -0
- package/.agent/agents/frontend-specialist.md +3 -0
- package/.agent/agents/game-developer.md +21 -21
- package/.agent/agents/logic-reviewer.md +12 -0
- package/.agent/agents/mobile-reviewer.md +79 -0
- package/.agent/agents/orchestrator.md +56 -26
- package/.agent/agents/performance-reviewer.md +36 -0
- package/.agent/agents/supervisor-agent.md +156 -0
- package/.agent/agents/swarm-worker-contracts.md +166 -0
- package/.agent/agents/swarm-worker-registry.md +92 -0
- package/.agent/rules/GEMINI.md +134 -5
- package/.agent/scripts/bundle_analyzer.py +259 -0
- package/.agent/scripts/dependency_analyzer.py +247 -0
- package/.agent/scripts/lint_runner.py +188 -0
- package/.agent/scripts/patch_skills_meta.py +177 -0
- package/.agent/scripts/patch_skills_output.py +285 -0
- package/.agent/scripts/schema_validator.py +279 -0
- package/.agent/scripts/security_scan.py +224 -0
- package/.agent/scripts/session_manager.py +144 -3
- package/.agent/scripts/skill_integrator.py +234 -0
- package/.agent/scripts/strengthen_skills.py +220 -0
- package/.agent/scripts/swarm_dispatcher.py +317 -0
- package/.agent/scripts/test_runner.py +192 -0
- package/.agent/scripts/test_swarm_dispatcher.py +163 -0
- package/.agent/skills/agent-organizer/SKILL.md +132 -0
- package/.agent/skills/agentic-patterns/SKILL.md +335 -0
- package/.agent/skills/api-patterns/SKILL.md +226 -50
- package/.agent/skills/app-builder/SKILL.md +215 -52
- package/.agent/skills/architecture/SKILL.md +176 -31
- package/.agent/skills/bash-linux/SKILL.md +150 -134
- package/.agent/skills/behavioral-modes/SKILL.md +152 -160
- package/.agent/skills/brainstorming/SKILL.md +148 -101
- package/.agent/skills/brainstorming/dynamic-questioning.md +10 -0
- package/.agent/skills/clean-code/SKILL.md +139 -134
- package/.agent/skills/code-review-checklist/SKILL.md +177 -80
- package/.agent/skills/config-validator/SKILL.md +165 -0
- package/.agent/skills/csharp-developer/SKILL.md +107 -0
- package/.agent/skills/database-design/SKILL.md +252 -29
- package/.agent/skills/deployment-procedures/SKILL.md +122 -175
- package/.agent/skills/devops-engineer/SKILL.md +134 -0
- package/.agent/skills/devops-incident-responder/SKILL.md +98 -0
- package/.agent/skills/documentation-templates/SKILL.md +175 -121
- package/.agent/skills/dotnet-core-expert/SKILL.md +103 -0
- package/.agent/skills/edge-computing/SKILL.md +213 -0
- package/.agent/skills/frontend-design/SKILL.md +76 -0
- package/.agent/skills/frontend-design/color-system.md +18 -0
- package/.agent/skills/frontend-design/typography-system.md +18 -0
- package/.agent/skills/game-development/SKILL.md +69 -0
- package/.agent/skills/geo-fundamentals/SKILL.md +158 -99
- package/.agent/skills/github-operations/SKILL.md +354 -0
- package/.agent/skills/i18n-localization/SKILL.md +158 -96
- package/.agent/skills/intelligent-routing/SKILL.md +89 -285
- package/.agent/skills/intelligent-routing/router-manifest.md +65 -0
- package/.agent/skills/lint-and-validate/SKILL.md +229 -27
- package/.agent/skills/llm-engineering/SKILL.md +258 -0
- package/.agent/skills/local-first/SKILL.md +203 -0
- package/.agent/skills/mcp-builder/SKILL.md +159 -111
- package/.agent/skills/mobile-design/SKILL.md +102 -282
- package/.agent/skills/nextjs-react-expert/SKILL.md +143 -227
- package/.agent/skills/nodejs-best-practices/SKILL.md +201 -254
- package/.agent/skills/observability/SKILL.md +285 -0
- package/.agent/skills/parallel-agents/SKILL.md +124 -118
- package/.agent/skills/performance-profiling/SKILL.md +143 -89
- package/.agent/skills/plan-writing/SKILL.md +133 -97
- package/.agent/skills/platform-engineer/SKILL.md +135 -0
- package/.agent/skills/powershell-windows/SKILL.md +167 -104
- package/.agent/skills/python-patterns/SKILL.md +149 -361
- package/.agent/skills/python-pro/SKILL.md +114 -0
- package/.agent/skills/react-specialist/SKILL.md +107 -0
- package/.agent/skills/readme-builder/SKILL.md +270 -0
- package/.agent/skills/realtime-patterns/SKILL.md +296 -0
- package/.agent/skills/red-team-tactics/SKILL.md +136 -134
- package/.agent/skills/rust-pro/SKILL.md +237 -173
- package/.agent/skills/seo-fundamentals/SKILL.md +134 -82
- package/.agent/skills/server-management/SKILL.md +155 -104
- package/.agent/skills/sql-pro/SKILL.md +104 -0
- package/.agent/skills/systematic-debugging/SKILL.md +156 -79
- package/.agent/skills/tailwind-patterns/SKILL.md +163 -205
- package/.agent/skills/tdd-workflow/SKILL.md +148 -88
- package/.agent/skills/test-result-analyzer/SKILL.md +299 -0
- package/.agent/skills/testing-patterns/SKILL.md +141 -114
- package/.agent/skills/trend-researcher/SKILL.md +228 -0
- package/.agent/skills/ui-ux-pro-max/SKILL.md +107 -0
- package/.agent/skills/ui-ux-researcher/SKILL.md +234 -0
- package/.agent/skills/vue-expert/SKILL.md +118 -0
- package/.agent/skills/vulnerability-scanner/SKILL.md +228 -188
- package/.agent/skills/web-design-guidelines/SKILL.md +148 -33
- package/.agent/skills/webapp-testing/SKILL.md +171 -122
- package/.agent/skills/whimsy-injector/SKILL.md +349 -0
- package/.agent/skills/workflow-optimizer/SKILL.md +219 -0
- package/.agent/workflows/api-tester.md +279 -0
- package/.agent/workflows/audit.md +168 -0
- package/.agent/workflows/brainstorm.md +65 -19
- package/.agent/workflows/changelog.md +144 -0
- package/.agent/workflows/create.md +67 -14
- package/.agent/workflows/debug.md +122 -30
- package/.agent/workflows/deploy.md +82 -31
- package/.agent/workflows/enhance.md +59 -27
- package/.agent/workflows/fix.md +143 -0
- package/.agent/workflows/generate.md +84 -20
- package/.agent/workflows/migrate.md +163 -0
- package/.agent/workflows/orchestrate.md +66 -17
- package/.agent/workflows/performance-benchmarker.md +305 -0
- package/.agent/workflows/plan.md +76 -33
- package/.agent/workflows/preview.md +73 -17
- package/.agent/workflows/refactor.md +153 -0
- package/.agent/workflows/review-ai.md +140 -0
- package/.agent/workflows/review.md +83 -16
- package/.agent/workflows/session.md +154 -0
- package/.agent/workflows/status.md +74 -18
- package/.agent/workflows/strengthen-skills.md +99 -0
- package/.agent/workflows/swarm.md +194 -0
- package/.agent/workflows/test.md +80 -31
- package/.agent/workflows/tribunal-backend.md +55 -13
- package/.agent/workflows/tribunal-database.md +62 -18
- package/.agent/workflows/tribunal-frontend.md +58 -12
- package/.agent/workflows/tribunal-full.md +70 -11
- package/.agent/workflows/tribunal-mobile.md +123 -0
- package/.agent/workflows/tribunal-performance.md +152 -0
- package/.agent/workflows/ui-ux-pro-max.md +100 -82
- package/README.md +117 -62
- package/bin/tribunal-kit.js +542 -288
- package/package.json +10 -6
|
@@ -1,242 +1,234 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: behavioral-modes
|
|
3
3
|
description: AI operational modes (brainstorm, implement, debug, review, teach, ship, orchestrate). Use to adapt behavior based on task type.
|
|
4
|
-
allowed-tools: Read, Glob, Grep
|
|
4
|
+
allowed-tools: Read, Write, Edit, Glob, Grep
|
|
5
|
+
version: 1.0.0
|
|
6
|
+
last-updated: 2026-03-12
|
|
7
|
+
applies-to-model: gemini-2.5-pro, claude-3-7-sonnet
|
|
5
8
|
---
|
|
6
9
|
|
|
7
|
-
# Behavioral Modes
|
|
10
|
+
# Behavioral Modes
|
|
8
11
|
|
|
9
|
-
|
|
10
|
-
|
|
12
|
+
> The same task handled carelessly vs. carefully looks identical until it fails.
|
|
13
|
+
> Mode selection is about matching rigor to risk.
|
|
11
14
|
|
|
12
15
|
---
|
|
13
16
|
|
|
14
|
-
##
|
|
17
|
+
## Overview
|
|
15
18
|
|
|
16
|
-
|
|
19
|
+
Different work contexts require different operating behaviors. A debugging session requires patience and hypothesis testing. A code review requires skepticism. A teaching response requires no implementation at all.
|
|
17
20
|
|
|
18
|
-
|
|
21
|
+
This skill defines how to behave in each context β not just what to produce.
|
|
19
22
|
|
|
20
|
-
|
|
21
|
-
- Ask clarifying questions before assumptions
|
|
22
|
-
- Offer multiple alternatives (at least 3)
|
|
23
|
-
- Think divergently - explore unconventional solutions
|
|
24
|
-
- No code yet - focus on ideas and options
|
|
25
|
-
- Use visual diagrams (mermaid) to explain concepts
|
|
23
|
+
---
|
|
26
24
|
|
|
27
|
-
|
|
28
|
-
```
|
|
29
|
-
"Let's explore this together. Here are some approaches:
|
|
25
|
+
## Mode Definitions
|
|
30
26
|
|
|
31
|
-
|
|
32
|
-
|
|
33
|
-
β Cons: ...
|
|
27
|
+
### DISCOVER Mode
|
|
28
|
+
*When:* Request is vague, requirements are unclear, multiple valid interpretations exist
|
|
34
29
|
|
|
35
|
-
|
|
36
|
-
|
|
37
|
-
|
|
30
|
+
**Behavior:**
|
|
31
|
+
- Ask the minimum questions needed to reduce ambiguity
|
|
32
|
+
- Don't propose solutions until the problem is understood
|
|
33
|
+
- Surface hidden assumptions explicitly
|
|
34
|
+
- Validate understanding before proceeding
|
|
38
35
|
|
|
39
|
-
|
|
40
|
-
```
|
|
36
|
+
**Output:** Questions, restated problem, confirmed scope β not code
|
|
41
37
|
|
|
42
38
|
---
|
|
43
39
|
|
|
44
|
-
###
|
|
45
|
-
|
|
46
|
-
**When to use:** Writing code, building features, executing plans
|
|
40
|
+
### PLAN Mode
|
|
41
|
+
*When:* Feature is complex enough to touch multiple files or systems
|
|
47
42
|
|
|
48
43
|
**Behavior:**
|
|
49
|
-
-
|
|
50
|
-
-
|
|
51
|
-
-
|
|
52
|
-
- Write
|
|
53
|
-
- Include error handling and edge cases
|
|
54
|
-
- **NO tutorial-style explanations** - just code
|
|
55
|
-
- **NO unnecessary comments** - let code self-document
|
|
56
|
-
- **NO over-engineering** - solve the problem directly
|
|
57
|
-
- **NO RUSHING** - Quality > Speed. Read ALL references before coding.
|
|
58
|
-
|
|
59
|
-
**Output style:**
|
|
60
|
-
```
|
|
61
|
-
[Code block]
|
|
44
|
+
- Break work into ordered, dependency-aware tasks
|
|
45
|
+
- Identify risks before implementation begins
|
|
46
|
+
- Document assumptions that need validation
|
|
47
|
+
- Write the plan β don't write the code yet
|
|
62
48
|
|
|
63
|
-
|
|
64
|
-
```
|
|
49
|
+
**Output:** Structured task breakdown with dependencies and verification steps
|
|
65
50
|
|
|
66
|
-
|
|
67
|
-
```
|
|
68
|
-
"Building [feature]...
|
|
51
|
+
---
|
|
69
52
|
|
|
70
|
-
|
|
71
|
-
|
|
72
|
-
β Updated [file3]
|
|
53
|
+
### BUILD Mode
|
|
54
|
+
*When:* Plan is approved, scope is clear, implementation begins
|
|
73
55
|
|
|
74
|
-
|
|
56
|
+
**Behavior:**
|
|
57
|
+
- One module at a time β not the entire system in one shot
|
|
58
|
+
- Write `// VERIFY:` on anything uncertain about external APIs or methods
|
|
59
|
+
- Run linting and type checks after each significant change
|
|
60
|
+
- Stop if an assumption proves wrong β don't continue building on a broken foundation
|
|
75
61
|
|
|
76
|
-
|
|
77
|
-
```
|
|
62
|
+
**Output:** Working code, one piece at a time
|
|
78
63
|
|
|
79
64
|
---
|
|
80
65
|
|
|
81
|
-
###
|
|
82
|
-
|
|
83
|
-
**When to use:** Fixing bugs, troubleshooting errors, investigating issues
|
|
66
|
+
### REVIEW Mode
|
|
67
|
+
*When:* Auditing existing code for hallucinations, bugs, or quality issues
|
|
84
68
|
|
|
85
69
|
**Behavior:**
|
|
86
|
-
-
|
|
87
|
-
-
|
|
88
|
-
-
|
|
89
|
-
-
|
|
90
|
-
- Prevent future occurrences
|
|
91
|
-
|
|
92
|
-
**Output style:**
|
|
93
|
-
```
|
|
94
|
-
"Investigating...
|
|
70
|
+
- Read before commenting
|
|
71
|
+
- Label each finding: CRITICAL / WARNING / SUGGESTION
|
|
72
|
+
- Explain the impact β not just "this is wrong"
|
|
73
|
+
- Propose the fix, not just the problem
|
|
95
74
|
|
|
96
|
-
|
|
97
|
-
π― Root cause: [why it's happening]
|
|
98
|
-
β
Fix: [the solution]
|
|
99
|
-
π‘οΈ Prevention: [how to avoid in future]
|
|
100
|
-
```
|
|
75
|
+
**Output:** Labeled findings with impact descriptions and concrete fixes
|
|
101
76
|
|
|
102
77
|
---
|
|
103
78
|
|
|
104
|
-
###
|
|
105
|
-
|
|
106
|
-
**When to use:** Code review, architecture review, security audit
|
|
79
|
+
### DEBUG Mode
|
|
80
|
+
*When:* Something is broken and the root cause is unknown
|
|
107
81
|
|
|
108
82
|
**Behavior:**
|
|
109
|
-
-
|
|
110
|
-
-
|
|
111
|
-
-
|
|
112
|
-
-
|
|
113
|
-
- Acknowledge what's done well
|
|
83
|
+
- Form a hypothesis before changing anything
|
|
84
|
+
- Test one variable at a time
|
|
85
|
+
- Document what was tried and what the outcome was
|
|
86
|
+
- Root cause first β workaround only if root cause can't be addressed
|
|
114
87
|
|
|
115
|
-
**Output
|
|
116
|
-
```
|
|
117
|
-
## Code Review: [file/feature]
|
|
88
|
+
**Output:** Root cause statement, minimal fix, regression prevention note
|
|
118
89
|
|
|
119
|
-
|
|
120
|
-
- [issue with explanation]
|
|
90
|
+
---
|
|
121
91
|
|
|
122
|
-
###
|
|
123
|
-
|
|
92
|
+
### TEACH Mode
|
|
93
|
+
*When:* User asks "how does X work" or "explain Y"
|
|
124
94
|
|
|
125
|
-
|
|
126
|
-
-
|
|
127
|
-
|
|
95
|
+
**Behavior:**
|
|
96
|
+
- Answer the question directly before elaborating
|
|
97
|
+
- Use concrete examples, not abstract descriptions
|
|
98
|
+
- No implementation unless explicitly requested
|
|
99
|
+
- Check for follow-up understanding
|
|
128
100
|
|
|
129
|
-
|
|
101
|
+
**Output:** Explanation, examples, no code unless asked
|
|
130
102
|
|
|
131
|
-
|
|
103
|
+
---
|
|
132
104
|
|
|
133
|
-
|
|
105
|
+
### ORCHESTRATE Mode
|
|
106
|
+
*When:* Task spans multiple domains or requires multiple specialist perspectives
|
|
134
107
|
|
|
135
108
|
**Behavior:**
|
|
136
|
-
-
|
|
137
|
-
-
|
|
138
|
-
-
|
|
139
|
-
-
|
|
140
|
-
- Check understanding
|
|
109
|
+
- Identify which domains are involved
|
|
110
|
+
- Activate the appropriate specialists in sequence
|
|
111
|
+
- Synthesize their outputs into a coherent result
|
|
112
|
+
- Ensure consistency across domain boundaries (e.g., API contract matches frontend expectations)
|
|
141
113
|
|
|
142
|
-
**Output
|
|
143
|
-
```
|
|
144
|
-
## Understanding [Concept]
|
|
114
|
+
**Output:** Coordinated multi-domain response
|
|
145
115
|
|
|
146
|
-
|
|
147
|
-
[Simple explanation with analogy]
|
|
116
|
+
---
|
|
148
117
|
|
|
149
|
-
###
|
|
150
|
-
|
|
118
|
+
### SHIP Mode
|
|
119
|
+
*When:* Everything is ready, user confirms deployment
|
|
151
120
|
|
|
152
|
-
|
|
153
|
-
|
|
121
|
+
**Behavior:**
|
|
122
|
+
- Run the full verification suite before touching production
|
|
123
|
+
- Follow the 5-phase deployment sequence
|
|
124
|
+
- Verify each phase before proceeding to the next
|
|
125
|
+
- Have a rollback plan confirmed before starting
|
|
154
126
|
|
|
155
|
-
|
|
156
|
-
[Exercise or task]
|
|
157
|
-
```
|
|
127
|
+
**Output:** Pre-flight checklist results, deployment execution, post-deploy verification
|
|
158
128
|
|
|
159
129
|
---
|
|
160
130
|
|
|
161
|
-
|
|
131
|
+
## Mode Selection Rules
|
|
162
132
|
|
|
163
|
-
|
|
133
|
+
| Signal in Request | Activate |
|
|
134
|
+
|---|---|
|
|
135
|
+
| "how does", "explain", "what is" | TEACH |
|
|
136
|
+
| "why is X broken", "error:", traceback | DEBUG |
|
|
137
|
+
| "review this", "audit", "check" | REVIEW |
|
|
138
|
+
| "build", "create", "implement" | PLAN β BUILD |
|
|
139
|
+
| "I'm not sure what I need" | DISCOVER |
|
|
140
|
+
| "deploy", "release", "publish" | SHIP |
|
|
141
|
+
| Multiple domains in one request | ORCHESTRATE |
|
|
164
142
|
|
|
165
|
-
|
|
166
|
-
- Focus on stability over features
|
|
167
|
-
- Check for missing error handling
|
|
168
|
-
- Verify environment configs
|
|
169
|
-
- Run all tests
|
|
170
|
-
- Create deployment checklist
|
|
143
|
+
---
|
|
171
144
|
|
|
172
|
-
|
|
173
|
-
|
|
174
|
-
|
|
145
|
+
## π€ Mode Leakage Mitigation (Anti-Hallucination)
|
|
146
|
+
|
|
147
|
+
LLMs naturally want to "help" by writing code immediately. **Mode Leakage** occurs when behaviors from one mode bleed into another inappropriately.
|
|
175
148
|
|
|
176
|
-
|
|
177
|
-
|
|
178
|
-
|
|
179
|
-
|
|
149
|
+
1. **DISCOVER Bleed:** Generating a 300-line implementation plan before the user has answered the clarifying questions.
|
|
150
|
+
* *β AI Trait:* "Here are my questions. Also, here is how we will build it..."
|
|
151
|
+
* *β
Correction:* "I cannot propose an architecture until these 3 questions are answered."
|
|
152
|
+
2. **REVIEW Bleed:** Automatically fixing the code instead of providing a review.
|
|
153
|
+
* *β AI Trait:* "I reviewed your code. Here is the completely rewritten file."
|
|
154
|
+
* *β
Correction:* State the findings. Let the user ask for the fix.
|
|
155
|
+
3. **DEBUG Bleed:** Guessing a fix without proving the root cause.
|
|
156
|
+
* *β AI Trait:* "It looks like a configuration error. Try adding this line."
|
|
157
|
+
* *β
Correction:* "To verify if this is a configuration error, run this diagnostic command first."
|
|
158
|
+
|
|
159
|
+
---
|
|
180
160
|
|
|
181
|
-
|
|
182
|
-
- [ ] No exposed secrets
|
|
183
|
-
- [ ] Input validation complete
|
|
161
|
+
## Output Format
|
|
184
162
|
|
|
185
|
-
|
|
186
|
-
- [ ] Bundle size acceptable
|
|
187
|
-
- [ ] No console.logs
|
|
163
|
+
When this skill produces a recommendation or design decision, structure your output as:
|
|
188
164
|
|
|
189
|
-
### π Ready to deploy
|
|
190
165
|
```
|
|
166
|
+
βββ Behavioral Modes Recommendation ββββββββββββββββ
|
|
167
|
+
Decision: [what was chosen / proposed]
|
|
168
|
+
Rationale: [why β one concise line]
|
|
169
|
+
Trade-offs: [what is consciously accepted]
|
|
170
|
+
Next action: [concrete next step for the user]
|
|
171
|
+
βββββββββββββββββββββββββββββββββββββββββββββββββ
|
|
172
|
+
Pre-Flight: β
All checks passed
|
|
173
|
+
or β [blocking item that must be resolved first]
|
|
174
|
+
```
|
|
175
|
+
|
|
191
176
|
|
|
192
177
|
---
|
|
193
178
|
|
|
194
|
-
##
|
|
179
|
+
## ποΈ Tribunal Integration
|
|
195
180
|
|
|
196
|
-
|
|
181
|
+
**Slash command: Contextually applied based on user intent**
|
|
182
|
+
**Active reviewers: `logic-reviewer` (monitors for mode adherence)**
|
|
197
183
|
|
|
198
|
-
|
|
199
|
-
|---------|------|
|
|
200
|
-
| "what if", "ideas", "options" | BRAINSTORM |
|
|
201
|
-
| "build", "create", "add" | IMPLEMENT |
|
|
202
|
-
| "not working", "error", "bug" | DEBUG |
|
|
203
|
-
| "review", "check", "audit" | REVIEW |
|
|
204
|
-
| "explain", "how does", "learn" | TEACH |
|
|
205
|
-
| "deploy", "release", "production" | SHIP |
|
|
184
|
+
### β
Pre-Flight Self-Audit
|
|
206
185
|
|
|
207
|
-
|
|
186
|
+
Review these questions before sending any response to ensure you are in the correct mode:
|
|
187
|
+
```
|
|
188
|
+
β
Have I explicitly announced which mode I am operating in?
|
|
189
|
+
β
If in DISCOVER or TEACH, have I successfully suppressed the urge to write implementation code?
|
|
190
|
+
β
If in DEBUG, am I proving a hypothesis or just guessing a fix?
|
|
191
|
+
β
If in REVIEW, am I commenting on the existing code rather than silently rewriting it?
|
|
192
|
+
```
|
|
208
193
|
|
|
209
|
-
## Multi-Agent Collaboration Patterns (2025)
|
|
210
194
|
|
|
211
|
-
|
|
195
|
+
---
|
|
212
196
|
|
|
213
|
-
|
|
214
|
-
**Role:** Discovery and Analysis (Explorer Agent)
|
|
215
|
-
**Behavior:** Socratic questioning, deep-dive code reading, dependency mapping.
|
|
216
|
-
**Output:** `discovery-report.json`, architectural visualization.
|
|
197
|
+
## π€ LLM-Specific Traps
|
|
217
198
|
|
|
218
|
-
|
|
219
|
-
Cyclic mode transitions for high-complexity tasks:
|
|
220
|
-
1. **Planner:** Decomposes the task into atomic steps (`task.md`).
|
|
221
|
-
2. **Executor:** Performs the actual coding (`IMPLEMENT`).
|
|
222
|
-
3. **Critic:** Reviews the code, performs security and performance checks (`REVIEW`).
|
|
199
|
+
AI coding assistants often fall into specific bad habits when dealing with this domain. These are strictly forbidden:
|
|
223
200
|
|
|
224
|
-
|
|
225
|
-
|
|
201
|
+
1. **Over-engineering:** Proposing complex abstractions or distributed systems when a simpler approach suffices.
|
|
202
|
+
2. **Hallucinated Libraries/Methods:** Using non-existent methods or packages. Always `// VERIFY` or check `package.json` / `requirements.txt`.
|
|
203
|
+
3. **Skipping Edge Cases:** Writing the "happy path" and ignoring error handling, timeouts, or data validation.
|
|
204
|
+
4. **Context Amnesia:** Forgetting the user's constraints and offering generic advice instead of tailored solutions.
|
|
205
|
+
5. **Silent Degradation:** Catching and suppressing errors without logging or re-raising.
|
|
226
206
|
|
|
227
207
|
---
|
|
228
208
|
|
|
229
|
-
##
|
|
209
|
+
## ποΈ Tribunal Integration (Anti-Hallucination)
|
|
230
210
|
|
|
231
|
-
|
|
211
|
+
**Slash command: `/review` or `/tribunal-full`**
|
|
212
|
+
**Active reviewers: `logic-reviewer` Β· `security-auditor`**
|
|
213
|
+
|
|
214
|
+
### β Forbidden AI Tropes
|
|
232
215
|
|
|
233
|
-
|
|
216
|
+
1. **Blind Assumptions:** Never make an assumption without documenting it clearly with `// VERIFY: [reason]`.
|
|
217
|
+
2. **Silent Degradation:** Catching and suppressing errors without logging or handling.
|
|
218
|
+
3. **Context Amnesia:** Forgetting the user's constraints and offering generic advice instead of tailored solutions.
|
|
234
219
|
|
|
235
|
-
|
|
220
|
+
### β
Pre-Flight Self-Audit
|
|
236
221
|
|
|
222
|
+
Review these questions before confirming output:
|
|
237
223
|
```
|
|
238
|
-
|
|
239
|
-
|
|
240
|
-
|
|
241
|
-
|
|
224
|
+
β
Did I rely ONLY on real, verified tools and methods?
|
|
225
|
+
β
Is this solution appropriately scoped to the user's constraints?
|
|
226
|
+
β
Did I handle potential failure modes and edge cases?
|
|
227
|
+
β
Have I avoided generic boilerplate that doesn't add value?
|
|
242
228
|
```
|
|
229
|
+
|
|
230
|
+
### π Verification-Before-Completion (VBC) Protocol
|
|
231
|
+
|
|
232
|
+
**CRITICAL:** You must follow a strict "evidence-based closeout" state machine.
|
|
233
|
+
- β **Forbidden:** Declaring a task complete because the output "looks correct."
|
|
234
|
+
- β
**Required:** You are explicitly forbidden from finalizing any task without providing **concrete evidence** (terminal output, passing tests, compile success, or equivalent proof) that your output works as intended.
|