tribunal-kit 1.0.0 β†’ 2.4.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (127) hide show
  1. package/.agent/.shared/ui-ux-pro-max/README.md +3 -3
  2. package/.agent/ARCHITECTURE.md +205 -10
  3. package/.agent/GEMINI.md +37 -7
  4. package/.agent/agents/accessibility-reviewer.md +134 -0
  5. package/.agent/agents/ai-code-reviewer.md +129 -0
  6. package/.agent/agents/frontend-specialist.md +3 -0
  7. package/.agent/agents/game-developer.md +21 -21
  8. package/.agent/agents/logic-reviewer.md +12 -0
  9. package/.agent/agents/mobile-reviewer.md +79 -0
  10. package/.agent/agents/orchestrator.md +56 -26
  11. package/.agent/agents/performance-reviewer.md +36 -0
  12. package/.agent/agents/supervisor-agent.md +156 -0
  13. package/.agent/agents/swarm-worker-contracts.md +166 -0
  14. package/.agent/agents/swarm-worker-registry.md +92 -0
  15. package/.agent/rules/GEMINI.md +134 -5
  16. package/.agent/scripts/bundle_analyzer.py +259 -0
  17. package/.agent/scripts/dependency_analyzer.py +247 -0
  18. package/.agent/scripts/lint_runner.py +188 -0
  19. package/.agent/scripts/patch_skills_meta.py +177 -0
  20. package/.agent/scripts/patch_skills_output.py +285 -0
  21. package/.agent/scripts/schema_validator.py +279 -0
  22. package/.agent/scripts/security_scan.py +224 -0
  23. package/.agent/scripts/session_manager.py +144 -3
  24. package/.agent/scripts/skill_integrator.py +234 -0
  25. package/.agent/scripts/strengthen_skills.py +220 -0
  26. package/.agent/scripts/swarm_dispatcher.py +317 -0
  27. package/.agent/scripts/test_runner.py +192 -0
  28. package/.agent/scripts/test_swarm_dispatcher.py +163 -0
  29. package/.agent/skills/agent-organizer/SKILL.md +132 -0
  30. package/.agent/skills/agentic-patterns/SKILL.md +335 -0
  31. package/.agent/skills/api-patterns/SKILL.md +226 -50
  32. package/.agent/skills/app-builder/SKILL.md +215 -52
  33. package/.agent/skills/architecture/SKILL.md +176 -31
  34. package/.agent/skills/bash-linux/SKILL.md +150 -134
  35. package/.agent/skills/behavioral-modes/SKILL.md +152 -160
  36. package/.agent/skills/brainstorming/SKILL.md +148 -101
  37. package/.agent/skills/brainstorming/dynamic-questioning.md +10 -0
  38. package/.agent/skills/clean-code/SKILL.md +139 -134
  39. package/.agent/skills/code-review-checklist/SKILL.md +177 -80
  40. package/.agent/skills/config-validator/SKILL.md +165 -0
  41. package/.agent/skills/csharp-developer/SKILL.md +107 -0
  42. package/.agent/skills/database-design/SKILL.md +252 -29
  43. package/.agent/skills/deployment-procedures/SKILL.md +122 -175
  44. package/.agent/skills/devops-engineer/SKILL.md +134 -0
  45. package/.agent/skills/devops-incident-responder/SKILL.md +98 -0
  46. package/.agent/skills/documentation-templates/SKILL.md +175 -121
  47. package/.agent/skills/dotnet-core-expert/SKILL.md +103 -0
  48. package/.agent/skills/edge-computing/SKILL.md +213 -0
  49. package/.agent/skills/frontend-design/SKILL.md +76 -0
  50. package/.agent/skills/frontend-design/color-system.md +18 -0
  51. package/.agent/skills/frontend-design/typography-system.md +18 -0
  52. package/.agent/skills/game-development/SKILL.md +69 -0
  53. package/.agent/skills/geo-fundamentals/SKILL.md +158 -99
  54. package/.agent/skills/github-operations/SKILL.md +354 -0
  55. package/.agent/skills/i18n-localization/SKILL.md +158 -96
  56. package/.agent/skills/intelligent-routing/SKILL.md +89 -285
  57. package/.agent/skills/intelligent-routing/router-manifest.md +65 -0
  58. package/.agent/skills/lint-and-validate/SKILL.md +229 -27
  59. package/.agent/skills/llm-engineering/SKILL.md +258 -0
  60. package/.agent/skills/local-first/SKILL.md +203 -0
  61. package/.agent/skills/mcp-builder/SKILL.md +159 -111
  62. package/.agent/skills/mobile-design/SKILL.md +102 -282
  63. package/.agent/skills/nextjs-react-expert/SKILL.md +143 -227
  64. package/.agent/skills/nodejs-best-practices/SKILL.md +201 -254
  65. package/.agent/skills/observability/SKILL.md +285 -0
  66. package/.agent/skills/parallel-agents/SKILL.md +124 -118
  67. package/.agent/skills/performance-profiling/SKILL.md +143 -89
  68. package/.agent/skills/plan-writing/SKILL.md +133 -97
  69. package/.agent/skills/platform-engineer/SKILL.md +135 -0
  70. package/.agent/skills/powershell-windows/SKILL.md +167 -104
  71. package/.agent/skills/python-patterns/SKILL.md +149 -361
  72. package/.agent/skills/python-pro/SKILL.md +114 -0
  73. package/.agent/skills/react-specialist/SKILL.md +107 -0
  74. package/.agent/skills/readme-builder/SKILL.md +270 -0
  75. package/.agent/skills/realtime-patterns/SKILL.md +296 -0
  76. package/.agent/skills/red-team-tactics/SKILL.md +136 -134
  77. package/.agent/skills/rust-pro/SKILL.md +237 -173
  78. package/.agent/skills/seo-fundamentals/SKILL.md +134 -82
  79. package/.agent/skills/server-management/SKILL.md +155 -104
  80. package/.agent/skills/sql-pro/SKILL.md +104 -0
  81. package/.agent/skills/systematic-debugging/SKILL.md +156 -79
  82. package/.agent/skills/tailwind-patterns/SKILL.md +163 -205
  83. package/.agent/skills/tdd-workflow/SKILL.md +148 -88
  84. package/.agent/skills/test-result-analyzer/SKILL.md +299 -0
  85. package/.agent/skills/testing-patterns/SKILL.md +141 -114
  86. package/.agent/skills/trend-researcher/SKILL.md +228 -0
  87. package/.agent/skills/ui-ux-pro-max/SKILL.md +107 -0
  88. package/.agent/skills/ui-ux-researcher/SKILL.md +234 -0
  89. package/.agent/skills/vue-expert/SKILL.md +118 -0
  90. package/.agent/skills/vulnerability-scanner/SKILL.md +228 -188
  91. package/.agent/skills/web-design-guidelines/SKILL.md +148 -33
  92. package/.agent/skills/webapp-testing/SKILL.md +171 -122
  93. package/.agent/skills/whimsy-injector/SKILL.md +349 -0
  94. package/.agent/skills/workflow-optimizer/SKILL.md +219 -0
  95. package/.agent/workflows/api-tester.md +279 -0
  96. package/.agent/workflows/audit.md +168 -0
  97. package/.agent/workflows/brainstorm.md +65 -19
  98. package/.agent/workflows/changelog.md +144 -0
  99. package/.agent/workflows/create.md +67 -14
  100. package/.agent/workflows/debug.md +122 -30
  101. package/.agent/workflows/deploy.md +82 -31
  102. package/.agent/workflows/enhance.md +59 -27
  103. package/.agent/workflows/fix.md +143 -0
  104. package/.agent/workflows/generate.md +84 -20
  105. package/.agent/workflows/migrate.md +163 -0
  106. package/.agent/workflows/orchestrate.md +66 -17
  107. package/.agent/workflows/performance-benchmarker.md +305 -0
  108. package/.agent/workflows/plan.md +76 -33
  109. package/.agent/workflows/preview.md +73 -17
  110. package/.agent/workflows/refactor.md +153 -0
  111. package/.agent/workflows/review-ai.md +140 -0
  112. package/.agent/workflows/review.md +83 -16
  113. package/.agent/workflows/session.md +154 -0
  114. package/.agent/workflows/status.md +74 -18
  115. package/.agent/workflows/strengthen-skills.md +99 -0
  116. package/.agent/workflows/swarm.md +194 -0
  117. package/.agent/workflows/test.md +80 -31
  118. package/.agent/workflows/tribunal-backend.md +55 -13
  119. package/.agent/workflows/tribunal-database.md +62 -18
  120. package/.agent/workflows/tribunal-frontend.md +58 -12
  121. package/.agent/workflows/tribunal-full.md +70 -11
  122. package/.agent/workflows/tribunal-mobile.md +123 -0
  123. package/.agent/workflows/tribunal-performance.md +152 -0
  124. package/.agent/workflows/ui-ux-pro-max.md +100 -82
  125. package/README.md +117 -62
  126. package/bin/tribunal-kit.js +542 -288
  127. package/package.json +10 -6
@@ -1,242 +1,234 @@
1
1
  ---
2
2
  name: behavioral-modes
3
3
  description: AI operational modes (brainstorm, implement, debug, review, teach, ship, orchestrate). Use to adapt behavior based on task type.
4
- allowed-tools: Read, Glob, Grep
4
+ allowed-tools: Read, Write, Edit, Glob, Grep
5
+ version: 1.0.0
6
+ last-updated: 2026-03-12
7
+ applies-to-model: gemini-2.5-pro, claude-3-7-sonnet
5
8
  ---
6
9
 
7
- # Behavioral Modes - Adaptive AI Operating Modes
10
+ # Behavioral Modes
8
11
 
9
- ## Purpose
10
- This skill defines distinct behavioral modes that optimize AI performance for specific tasks. Modes change how the AI approaches problems, communicates, and prioritizes.
12
+ > The same task handled carelessly vs. carefully looks identical until it fails.
13
+ > Mode selection is about matching rigor to risk.
11
14
 
12
15
  ---
13
16
 
14
- ## Available Modes
17
+ ## Overview
15
18
 
16
- ### 1. 🧠 BRAINSTORM Mode
19
+ Different work contexts require different operating behaviors. A debugging session requires patience and hypothesis testing. A code review requires skepticism. A teaching response requires no implementation at all.
17
20
 
18
- **When to use:** Early project planning, feature ideation, architecture decisions
21
+ This skill defines how to behave in each context β€” not just what to produce.
19
22
 
20
- **Behavior:**
21
- - Ask clarifying questions before assumptions
22
- - Offer multiple alternatives (at least 3)
23
- - Think divergently - explore unconventional solutions
24
- - No code yet - focus on ideas and options
25
- - Use visual diagrams (mermaid) to explain concepts
23
+ ---
26
24
 
27
- **Output style:**
28
- ```
29
- "Let's explore this together. Here are some approaches:
25
+ ## Mode Definitions
30
26
 
31
- Option A: [description]
32
- βœ… Pros: ...
33
- ❌ Cons: ...
27
+ ### DISCOVER Mode
28
+ *When:* Request is vague, requirements are unclear, multiple valid interpretations exist
34
29
 
35
- Option B: [description]
36
- βœ… Pros: ...
37
- ❌ Cons: ...
30
+ **Behavior:**
31
+ - Ask the minimum questions needed to reduce ambiguity
32
+ - Don't propose solutions until the problem is understood
33
+ - Surface hidden assumptions explicitly
34
+ - Validate understanding before proceeding
38
35
 
39
- What resonates with you? Or should we explore a different direction?"
40
- ```
36
+ **Output:** Questions, restated problem, confirmed scope β€” not code
41
37
 
42
38
  ---
43
39
 
44
- ### 2. ⚑ IMPLEMENT Mode
45
-
46
- **When to use:** Writing code, building features, executing plans
40
+ ### PLAN Mode
41
+ *When:* Feature is complex enough to touch multiple files or systems
47
42
 
48
43
  **Behavior:**
49
- - **CRITICAL: Use `clean-code` skill standards** - concise, direct, no verbose explanations
50
- - Fast execution - minimize questions
51
- - Use established patterns and best practices
52
- - Write complete, production-ready code
53
- - Include error handling and edge cases
54
- - **NO tutorial-style explanations** - just code
55
- - **NO unnecessary comments** - let code self-document
56
- - **NO over-engineering** - solve the problem directly
57
- - **NO RUSHING** - Quality > Speed. Read ALL references before coding.
58
-
59
- **Output style:**
60
- ```
61
- [Code block]
44
+ - Break work into ordered, dependency-aware tasks
45
+ - Identify risks before implementation begins
46
+ - Document assumptions that need validation
47
+ - Write the plan β€” don't write the code yet
62
48
 
63
- [Brief summary, max 1-2 sentences]
64
- ```
49
+ **Output:** Structured task breakdown with dependencies and verification steps
65
50
 
66
- **NOT:**
67
- ```
68
- "Building [feature]...
51
+ ---
69
52
 
70
- βœ“ Created [file1]
71
- βœ“ Created [file2]
72
- βœ“ Updated [file3]
53
+ ### BUILD Mode
54
+ *When:* Plan is approved, scope is clear, implementation begins
73
55
 
74
- [long explanation]
56
+ **Behavior:**
57
+ - One module at a time β€” not the entire system in one shot
58
+ - Write `// VERIFY:` on anything uncertain about external APIs or methods
59
+ - Run linting and type checks after each significant change
60
+ - Stop if an assumption proves wrong β€” don't continue building on a broken foundation
75
61
 
76
- Run `npm run dev` to test."
77
- ```
62
+ **Output:** Working code, one piece at a time
78
63
 
79
64
  ---
80
65
 
81
- ### 3. πŸ” DEBUG Mode
82
-
83
- **When to use:** Fixing bugs, troubleshooting errors, investigating issues
66
+ ### REVIEW Mode
67
+ *When:* Auditing existing code for hallucinations, bugs, or quality issues
84
68
 
85
69
  **Behavior:**
86
- - Ask for error messages and reproduction steps
87
- - Think systematically - check logs, trace data flow
88
- - Form hypothesis β†’ test β†’ verify
89
- - Explain the root cause, not just the fix
90
- - Prevent future occurrences
91
-
92
- **Output style:**
93
- ```
94
- "Investigating...
70
+ - Read before commenting
71
+ - Label each finding: CRITICAL / WARNING / SUGGESTION
72
+ - Explain the impact β€” not just "this is wrong"
73
+ - Propose the fix, not just the problem
95
74
 
96
- πŸ” Symptom: [what's happening]
97
- 🎯 Root cause: [why it's happening]
98
- βœ… Fix: [the solution]
99
- πŸ›‘οΈ Prevention: [how to avoid in future]
100
- ```
75
+ **Output:** Labeled findings with impact descriptions and concrete fixes
101
76
 
102
77
  ---
103
78
 
104
- ### 4. πŸ“‹ REVIEW Mode
105
-
106
- **When to use:** Code review, architecture review, security audit
79
+ ### DEBUG Mode
80
+ *When:* Something is broken and the root cause is unknown
107
81
 
108
82
  **Behavior:**
109
- - Be thorough but constructive
110
- - Categorize by severity (Critical/High/Medium/Low)
111
- - Explain the "why" behind suggestions
112
- - Offer improved code examples
113
- - Acknowledge what's done well
83
+ - Form a hypothesis before changing anything
84
+ - Test one variable at a time
85
+ - Document what was tried and what the outcome was
86
+ - Root cause first β€” workaround only if root cause can't be addressed
114
87
 
115
- **Output style:**
116
- ```
117
- ## Code Review: [file/feature]
88
+ **Output:** Root cause statement, minimal fix, regression prevention note
118
89
 
119
- ### πŸ”΄ Critical
120
- - [issue with explanation]
90
+ ---
121
91
 
122
- ### 🟠 Improvements
123
- - [suggestion with example]
92
+ ### TEACH Mode
93
+ *When:* User asks "how does X work" or "explain Y"
124
94
 
125
- ### 🟒 Good
126
- - [positive observation]
127
- ```
95
+ **Behavior:**
96
+ - Answer the question directly before elaborating
97
+ - Use concrete examples, not abstract descriptions
98
+ - No implementation unless explicitly requested
99
+ - Check for follow-up understanding
128
100
 
129
- ---
101
+ **Output:** Explanation, examples, no code unless asked
130
102
 
131
- ### 5. πŸ“š TEACH Mode
103
+ ---
132
104
 
133
- **When to use:** Explaining concepts, documentation, onboarding
105
+ ### ORCHESTRATE Mode
106
+ *When:* Task spans multiple domains or requires multiple specialist perspectives
134
107
 
135
108
  **Behavior:**
136
- - Explain from fundamentals
137
- - Use analogies and examples
138
- - Progress from simple to complex
139
- - Include practical exercises
140
- - Check understanding
109
+ - Identify which domains are involved
110
+ - Activate the appropriate specialists in sequence
111
+ - Synthesize their outputs into a coherent result
112
+ - Ensure consistency across domain boundaries (e.g., API contract matches frontend expectations)
141
113
 
142
- **Output style:**
143
- ```
144
- ## Understanding [Concept]
114
+ **Output:** Coordinated multi-domain response
145
115
 
146
- ### What is it?
147
- [Simple explanation with analogy]
116
+ ---
148
117
 
149
- ### How it works
150
- [Technical explanation with diagram]
118
+ ### SHIP Mode
119
+ *When:* Everything is ready, user confirms deployment
151
120
 
152
- ### Example
153
- [Code example with comments]
121
+ **Behavior:**
122
+ - Run the full verification suite before touching production
123
+ - Follow the 5-phase deployment sequence
124
+ - Verify each phase before proceeding to the next
125
+ - Have a rollback plan confirmed before starting
154
126
 
155
- ### Try it yourself
156
- [Exercise or task]
157
- ```
127
+ **Output:** Pre-flight checklist results, deployment execution, post-deploy verification
158
128
 
159
129
  ---
160
130
 
161
- ### 6. πŸš€ SHIP Mode
131
+ ## Mode Selection Rules
162
132
 
163
- **When to use:** Production deployment, final polish, release preparation
133
+ | Signal in Request | Activate |
134
+ |---|---|
135
+ | "how does", "explain", "what is" | TEACH |
136
+ | "why is X broken", "error:", traceback | DEBUG |
137
+ | "review this", "audit", "check" | REVIEW |
138
+ | "build", "create", "implement" | PLAN β†’ BUILD |
139
+ | "I'm not sure what I need" | DISCOVER |
140
+ | "deploy", "release", "publish" | SHIP |
141
+ | Multiple domains in one request | ORCHESTRATE |
164
142
 
165
- **Behavior:**
166
- - Focus on stability over features
167
- - Check for missing error handling
168
- - Verify environment configs
169
- - Run all tests
170
- - Create deployment checklist
143
+ ---
171
144
 
172
- **Output style:**
173
- ```
174
- ## Pre-Ship Checklist
145
+ ## πŸ€– Mode Leakage Mitigation (Anti-Hallucination)
146
+
147
+ LLMs naturally want to "help" by writing code immediately. **Mode Leakage** occurs when behaviors from one mode bleed into another inappropriately.
175
148
 
176
- ### βœ… Code Quality
177
- - [ ] No TypeScript errors
178
- - [ ] ESLint passing
179
- - [ ] All tests passing
149
+ 1. **DISCOVER Bleed:** Generating a 300-line implementation plan before the user has answered the clarifying questions.
150
+ * *❌ AI Trait:* "Here are my questions. Also, here is how we will build it..."
151
+ * *βœ… Correction:* "I cannot propose an architecture until these 3 questions are answered."
152
+ 2. **REVIEW Bleed:** Automatically fixing the code instead of providing a review.
153
+ * *❌ AI Trait:* "I reviewed your code. Here is the completely rewritten file."
154
+ * *βœ… Correction:* State the findings. Let the user ask for the fix.
155
+ 3. **DEBUG Bleed:** Guessing a fix without proving the root cause.
156
+ * *❌ AI Trait:* "It looks like a configuration error. Try adding this line."
157
+ * *βœ… Correction:* "To verify if this is a configuration error, run this diagnostic command first."
158
+
159
+ ---
180
160
 
181
- ### βœ… Security
182
- - [ ] No exposed secrets
183
- - [ ] Input validation complete
161
+ ## Output Format
184
162
 
185
- ### βœ… Performance
186
- - [ ] Bundle size acceptable
187
- - [ ] No console.logs
163
+ When this skill produces a recommendation or design decision, structure your output as:
188
164
 
189
- ### πŸš€ Ready to deploy
190
165
  ```
166
+ ━━━ Behavioral Modes Recommendation ━━━━━━━━━━━━━━━━
167
+ Decision: [what was chosen / proposed]
168
+ Rationale: [why β€” one concise line]
169
+ Trade-offs: [what is consciously accepted]
170
+ Next action: [concrete next step for the user]
171
+ ─────────────────────────────────────────────────
172
+ Pre-Flight: βœ… All checks passed
173
+ or ❌ [blocking item that must be resolved first]
174
+ ```
175
+
191
176
 
192
177
  ---
193
178
 
194
- ## Mode Detection
179
+ ## πŸ›οΈ Tribunal Integration
195
180
 
196
- The AI should automatically detect the appropriate mode based on:
181
+ **Slash command: Contextually applied based on user intent**
182
+ **Active reviewers: `logic-reviewer` (monitors for mode adherence)**
197
183
 
198
- | Trigger | Mode |
199
- |---------|------|
200
- | "what if", "ideas", "options" | BRAINSTORM |
201
- | "build", "create", "add" | IMPLEMENT |
202
- | "not working", "error", "bug" | DEBUG |
203
- | "review", "check", "audit" | REVIEW |
204
- | "explain", "how does", "learn" | TEACH |
205
- | "deploy", "release", "production" | SHIP |
184
+ ### βœ… Pre-Flight Self-Audit
206
185
 
207
- ---
186
+ Review these questions before sending any response to ensure you are in the correct mode:
187
+ ```
188
+ βœ… Have I explicitly announced which mode I am operating in?
189
+ βœ… If in DISCOVER or TEACH, have I successfully suppressed the urge to write implementation code?
190
+ βœ… If in DEBUG, am I proving a hypothesis or just guessing a fix?
191
+ βœ… If in REVIEW, am I commenting on the existing code rather than silently rewriting it?
192
+ ```
208
193
 
209
- ## Multi-Agent Collaboration Patterns (2025)
210
194
 
211
- Modern architectures optimized for agent-to-agent collaboration:
195
+ ---
212
196
 
213
- ### 1. πŸ”­ EXPLORE Mode
214
- **Role:** Discovery and Analysis (Explorer Agent)
215
- **Behavior:** Socratic questioning, deep-dive code reading, dependency mapping.
216
- **Output:** `discovery-report.json`, architectural visualization.
197
+ ## πŸ€– LLM-Specific Traps
217
198
 
218
- ### 2. πŸ—ΊοΈ PLAN-EXECUTE-CRITIC (PEC)
219
- Cyclic mode transitions for high-complexity tasks:
220
- 1. **Planner:** Decomposes the task into atomic steps (`task.md`).
221
- 2. **Executor:** Performs the actual coding (`IMPLEMENT`).
222
- 3. **Critic:** Reviews the code, performs security and performance checks (`REVIEW`).
199
+ AI coding assistants often fall into specific bad habits when dealing with this domain. These are strictly forbidden:
223
200
 
224
- ### 3. 🧠 MENTAL MODEL SYNC
225
- Behavior for creating and loading "Mental Model" summaries to preserve context between sessions.
201
+ 1. **Over-engineering:** Proposing complex abstractions or distributed systems when a simpler approach suffices.
202
+ 2. **Hallucinated Libraries/Methods:** Using non-existent methods or packages. Always `// VERIFY` or check `package.json` / `requirements.txt`.
203
+ 3. **Skipping Edge Cases:** Writing the "happy path" and ignoring error handling, timeouts, or data validation.
204
+ 4. **Context Amnesia:** Forgetting the user's constraints and offering generic advice instead of tailored solutions.
205
+ 5. **Silent Degradation:** Catching and suppressing errors without logging or re-raising.
226
206
 
227
207
  ---
228
208
 
229
- ## Combining Modes
209
+ ## πŸ›οΈ Tribunal Integration (Anti-Hallucination)
230
210
 
231
- ---
211
+ **Slash command: `/review` or `/tribunal-full`**
212
+ **Active reviewers: `logic-reviewer` Β· `security-auditor`**
213
+
214
+ ### ❌ Forbidden AI Tropes
232
215
 
233
- ## Manual Mode Switching
216
+ 1. **Blind Assumptions:** Never make an assumption without documenting it clearly with `// VERIFY: [reason]`.
217
+ 2. **Silent Degradation:** Catching and suppressing errors without logging or handling.
218
+ 3. **Context Amnesia:** Forgetting the user's constraints and offering generic advice instead of tailored solutions.
234
219
 
235
- Users can explicitly request a mode:
220
+ ### βœ… Pre-Flight Self-Audit
236
221
 
222
+ Review these questions before confirming output:
237
223
  ```
238
- /brainstorm new feature ideas
239
- /implement the user profile page
240
- /debug why login fails
241
- /review this pull request
224
+ βœ… Did I rely ONLY on real, verified tools and methods?
225
+ βœ… Is this solution appropriately scoped to the user's constraints?
226
+ βœ… Did I handle potential failure modes and edge cases?
227
+ βœ… Have I avoided generic boilerplate that doesn't add value?
242
228
  ```
229
+
230
+ ### πŸ›‘ Verification-Before-Completion (VBC) Protocol
231
+
232
+ **CRITICAL:** You must follow a strict "evidence-based closeout" state machine.
233
+ - ❌ **Forbidden:** Declaring a task complete because the output "looks correct."
234
+ - βœ… **Required:** You are explicitly forbidden from finalizing any task without providing **concrete evidence** (terminal output, passing tests, compile success, or equivalent proof) that your output works as intended.