tribunal-kit 3.0.0 → 4.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (233) hide show
  1. package/.agent/ARCHITECTURE.md +99 -99
  2. package/.agent/GEMINI.md +52 -52
  3. package/.agent/agents/accessibility-reviewer.md +187 -220
  4. package/.agent/agents/ai-code-reviewer.md +199 -233
  5. package/.agent/agents/backend-specialist.md +215 -238
  6. package/.agent/agents/code-archaeologist.md +161 -181
  7. package/.agent/agents/database-architect.md +184 -207
  8. package/.agent/agents/debugger.md +191 -218
  9. package/.agent/agents/dependency-reviewer.md +103 -136
  10. package/.agent/agents/devops-engineer.md +218 -238
  11. package/.agent/agents/documentation-writer.md +201 -221
  12. package/.agent/agents/explorer-agent.md +160 -180
  13. package/.agent/agents/frontend-reviewer.md +160 -194
  14. package/.agent/agents/frontend-specialist.md +248 -237
  15. package/.agent/agents/game-developer.md +48 -52
  16. package/.agent/agents/logic-reviewer.md +116 -149
  17. package/.agent/agents/mobile-developer.md +200 -223
  18. package/.agent/agents/mobile-reviewer.md +162 -195
  19. package/.agent/agents/orchestrator.md +181 -211
  20. package/.agent/agents/penetration-tester.md +157 -174
  21. package/.agent/agents/performance-optimizer.md +183 -203
  22. package/.agent/agents/performance-reviewer.md +178 -211
  23. package/.agent/agents/precedence-reviewer.md +213 -0
  24. package/.agent/agents/product-manager.md +142 -162
  25. package/.agent/agents/product-owner.md +6 -25
  26. package/.agent/agents/project-planner.md +142 -162
  27. package/.agent/agents/qa-automation-engineer.md +225 -242
  28. package/.agent/agents/security-auditor.md +174 -194
  29. package/.agent/agents/seo-specialist.md +193 -213
  30. package/.agent/agents/sql-reviewer.md +161 -194
  31. package/.agent/agents/supervisor-agent.md +184 -203
  32. package/.agent/agents/swarm-worker-contracts.md +17 -17
  33. package/.agent/agents/swarm-worker-registry.md +46 -46
  34. package/.agent/agents/test-coverage-reviewer.md +160 -193
  35. package/.agent/agents/test-engineer.md +0 -21
  36. package/.agent/agents/type-safety-reviewer.md +175 -208
  37. package/.agent/patterns/generator.md +9 -9
  38. package/.agent/patterns/inversion.md +12 -12
  39. package/.agent/patterns/pipeline.md +9 -9
  40. package/.agent/patterns/reviewer.md +13 -13
  41. package/.agent/patterns/tool-wrapper.md +9 -9
  42. package/.agent/rules/GEMINI.md +63 -63
  43. package/.agent/scripts/append_flow.js +72 -0
  44. package/.agent/scripts/case_law_manager.py +525 -0
  45. package/.agent/scripts/compress_skills.py +167 -0
  46. package/.agent/scripts/consolidate_skills.py +173 -0
  47. package/.agent/scripts/deep_compress.py +202 -0
  48. package/.agent/scripts/minify_context.py +80 -0
  49. package/.agent/scripts/security_scan.py +1 -1
  50. package/.agent/scripts/skill_evolution.py +563 -0
  51. package/.agent/scripts/strip_tribunal.py +41 -0
  52. package/.agent/skills/agent-organizer/SKILL.md +100 -126
  53. package/.agent/skills/agentic-patterns/SKILL.md +0 -70
  54. package/.agent/skills/ai-prompt-injection-defense/SKILL.md +134 -160
  55. package/.agent/skills/api-patterns/SKILL.md +123 -215
  56. package/.agent/skills/api-security-auditor/SKILL.md +143 -177
  57. package/.agent/skills/app-builder/SKILL.md +334 -50
  58. package/.agent/skills/app-builder/templates/SKILL.md +13 -15
  59. package/.agent/skills/app-builder/templates/astro-static/TEMPLATE.md +16 -16
  60. package/.agent/skills/app-builder/templates/chrome-extension/TEMPLATE.md +22 -22
  61. package/.agent/skills/app-builder/templates/cli-tool/TEMPLATE.md +18 -18
  62. package/.agent/skills/app-builder/templates/electron-desktop/TEMPLATE.md +20 -20
  63. package/.agent/skills/app-builder/templates/express-api/TEMPLATE.md +17 -17
  64. package/.agent/skills/app-builder/templates/flutter-app/TEMPLATE.md +18 -18
  65. package/.agent/skills/app-builder/templates/monorepo-turborepo/TEMPLATE.md +21 -21
  66. package/.agent/skills/app-builder/templates/nextjs-fullstack/TEMPLATE.md +19 -19
  67. package/.agent/skills/app-builder/templates/nextjs-saas/TEMPLATE.md +26 -26
  68. package/.agent/skills/app-builder/templates/nextjs-static/TEMPLATE.md +26 -26
  69. package/.agent/skills/app-builder/templates/nuxt-app/TEMPLATE.md +19 -19
  70. package/.agent/skills/app-builder/templates/python-fastapi/TEMPLATE.md +18 -18
  71. package/.agent/skills/app-builder/templates/react-native-app/TEMPLATE.md +20 -20
  72. package/.agent/skills/appflow-wireframe/SKILL.md +95 -121
  73. package/.agent/skills/architecture/SKILL.md +169 -331
  74. package/.agent/skills/authentication-best-practices/SKILL.md +139 -173
  75. package/.agent/skills/bash-linux/SKILL.md +129 -154
  76. package/.agent/skills/behavioral-modes/SKILL.md +8 -69
  77. package/.agent/skills/brainstorming/SKILL.md +436 -104
  78. package/.agent/skills/building-native-ui/SKILL.md +152 -174
  79. package/.agent/skills/clean-code/SKILL.md +331 -360
  80. package/.agent/skills/code-review-checklist/SKILL.md +0 -62
  81. package/.agent/skills/config-validator/SKILL.md +115 -141
  82. package/.agent/skills/csharp-developer/SKILL.md +468 -528
  83. package/.agent/skills/database-design/SKILL.md +104 -369
  84. package/.agent/skills/deployment-procedures/SKILL.md +119 -145
  85. package/.agent/skills/devops-engineer/SKILL.md +295 -332
  86. package/.agent/skills/devops-incident-responder/SKILL.md +87 -113
  87. package/.agent/skills/doc.md +5 -5
  88. package/.agent/skills/documentation-templates/SKILL.md +27 -63
  89. package/.agent/skills/edge-computing/SKILL.md +131 -157
  90. package/.agent/skills/extract-design-system/SKILL.md +108 -134
  91. package/.agent/skills/framer-motion-expert/SKILL.md +111 -855
  92. package/.agent/skills/frontend-design/SKILL.md +151 -499
  93. package/.agent/skills/game-design-expert/SKILL.md +79 -105
  94. package/.agent/skills/game-engineering-expert/SKILL.md +96 -122
  95. package/.agent/skills/geo-fundamentals/SKILL.md +97 -124
  96. package/.agent/skills/github-operations/SKILL.md +279 -314
  97. package/.agent/skills/gsap-expert/SKILL.md +119 -826
  98. package/.agent/skills/i18n-localization/SKILL.md +113 -138
  99. package/.agent/skills/intelligent-routing/SKILL.md +167 -127
  100. package/.agent/skills/lint-and-validate/SKILL.md +16 -52
  101. package/.agent/skills/llm-engineering/SKILL.md +344 -357
  102. package/.agent/skills/local-first/SKILL.md +128 -154
  103. package/.agent/skills/mcp-builder/SKILL.md +92 -118
  104. package/.agent/skills/mobile-design/SKILL.md +213 -219
  105. package/.agent/skills/motion-engineering/SKILL.md +184 -0
  106. package/.agent/skills/nextjs-react-expert/SKILL.md +99 -698
  107. package/.agent/skills/nodejs-best-practices/SKILL.md +498 -559
  108. package/.agent/skills/observability/SKILL.md +293 -330
  109. package/.agent/skills/parallel-agents/SKILL.md +96 -122
  110. package/.agent/skills/performance-profiling/SKILL.md +217 -254
  111. package/.agent/skills/plan-writing/SKILL.md +92 -118
  112. package/.agent/skills/platform-engineer/SKILL.md +97 -123
  113. package/.agent/skills/playwright-best-practices/SKILL.md +137 -162
  114. package/.agent/skills/powershell-windows/SKILL.md +112 -146
  115. package/.agent/skills/project-idioms/SKILL.md +87 -0
  116. package/.agent/skills/python-patterns/SKILL.md +15 -35
  117. package/.agent/skills/python-pro/SKILL.md +148 -754
  118. package/.agent/skills/react-specialist/SKILL.md +123 -827
  119. package/.agent/skills/readme-builder/SKILL.md +23 -85
  120. package/.agent/skills/realtime-patterns/SKILL.md +269 -304
  121. package/.agent/skills/red-team-tactics/SKILL.md +18 -51
  122. package/.agent/skills/rust-pro/SKILL.md +623 -701
  123. package/.agent/skills/seo-fundamentals/SKILL.md +129 -154
  124. package/.agent/skills/server-management/SKILL.md +164 -190
  125. package/.agent/skills/shadcn-ui-expert/SKILL.md +181 -206
  126. package/.agent/skills/skill-creator/SKILL.md +24 -56
  127. package/.agent/skills/sql-pro/SKILL.md +579 -633
  128. package/.agent/skills/supabase-postgres-best-practices/SKILL.md +35 -66
  129. package/.agent/skills/swiftui-expert/SKILL.md +151 -176
  130. package/.agent/skills/systematic-debugging/SKILL.md +92 -118
  131. package/.agent/skills/tailwind-patterns/SKILL.md +516 -576
  132. package/.agent/skills/tdd-workflow/SKILL.md +111 -137
  133. package/.agent/skills/test-result-analyzer/SKILL.md +33 -73
  134. package/.agent/skills/testing-patterns/SKILL.md +512 -573
  135. package/.agent/skills/trend-researcher/SKILL.md +30 -71
  136. package/.agent/skills/ui-ux-pro-max/SKILL.md +8 -41
  137. package/.agent/skills/ui-ux-researcher/SKILL.md +51 -91
  138. package/.agent/skills/vue-expert/SKILL.md +127 -866
  139. package/.agent/skills/vulnerability-scanner/SKILL.md +354 -269
  140. package/.agent/skills/web-accessibility-auditor/SKILL.md +168 -193
  141. package/.agent/skills/web-design-guidelines/SKILL.md +25 -61
  142. package/.agent/skills/webapp-testing/SKILL.md +119 -145
  143. package/.agent/skills/whimsy-injector/SKILL.md +58 -132
  144. package/.agent/skills/workflow-optimizer/SKILL.md +28 -68
  145. package/.agent/workflows/api-tester.md +151 -151
  146. package/.agent/workflows/audit.md +127 -138
  147. package/.agent/workflows/brainstorm.md +110 -110
  148. package/.agent/workflows/changelog.md +112 -112
  149. package/.agent/workflows/create.md +124 -124
  150. package/.agent/workflows/debug.md +165 -189
  151. package/.agent/workflows/deploy.md +180 -189
  152. package/.agent/workflows/enhance.md +128 -151
  153. package/.agent/workflows/fix.md +114 -135
  154. package/.agent/workflows/generate.md +13 -4
  155. package/.agent/workflows/migrate.md +160 -160
  156. package/.agent/workflows/orchestrate.md +168 -168
  157. package/.agent/workflows/performance-benchmarker.md +114 -123
  158. package/.agent/workflows/plan.md +173 -173
  159. package/.agent/workflows/preview.md +80 -80
  160. package/.agent/workflows/refactor.md +161 -183
  161. package/.agent/workflows/review-ai.md +101 -129
  162. package/.agent/workflows/review.md +116 -116
  163. package/.agent/workflows/session.md +94 -94
  164. package/.agent/workflows/status.md +79 -79
  165. package/.agent/workflows/strengthen-skills.md +138 -139
  166. package/.agent/workflows/swarm.md +179 -179
  167. package/.agent/workflows/test.md +189 -211
  168. package/.agent/workflows/tribunal-backend.md +94 -113
  169. package/.agent/workflows/tribunal-database.md +95 -115
  170. package/.agent/workflows/tribunal-frontend.md +96 -118
  171. package/.agent/workflows/tribunal-full.md +93 -133
  172. package/.agent/workflows/tribunal-mobile.md +95 -119
  173. package/.agent/workflows/tribunal-performance.md +110 -133
  174. package/.agent/workflows/ui-ux-pro-max.md +122 -143
  175. package/README.md +30 -1
  176. package/bin/tribunal-kit.js +175 -12
  177. package/package.json +25 -4
  178. package/.agent/skills/api-patterns/api-style.md +0 -42
  179. package/.agent/skills/api-patterns/auth.md +0 -24
  180. package/.agent/skills/api-patterns/documentation.md +0 -26
  181. package/.agent/skills/api-patterns/graphql.md +0 -41
  182. package/.agent/skills/api-patterns/rate-limiting.md +0 -31
  183. package/.agent/skills/api-patterns/response.md +0 -37
  184. package/.agent/skills/api-patterns/rest.md +0 -40
  185. package/.agent/skills/api-patterns/security-testing.md +0 -122
  186. package/.agent/skills/api-patterns/trpc.md +0 -41
  187. package/.agent/skills/api-patterns/versioning.md +0 -22
  188. package/.agent/skills/app-builder/agent-coordination.md +0 -71
  189. package/.agent/skills/app-builder/feature-building.md +0 -53
  190. package/.agent/skills/app-builder/project-detection.md +0 -34
  191. package/.agent/skills/app-builder/scaffolding.md +0 -118
  192. package/.agent/skills/app-builder/tech-stack.md +0 -40
  193. package/.agent/skills/architecture/context-discovery.md +0 -43
  194. package/.agent/skills/architecture/examples.md +0 -94
  195. package/.agent/skills/architecture/pattern-selection.md +0 -68
  196. package/.agent/skills/architecture/patterns-reference.md +0 -50
  197. package/.agent/skills/architecture/trade-off-analysis.md +0 -77
  198. package/.agent/skills/brainstorming/dynamic-questioning.md +0 -360
  199. package/.agent/skills/database-design/database-selection.md +0 -43
  200. package/.agent/skills/database-design/indexing.md +0 -39
  201. package/.agent/skills/database-design/migrations.md +0 -48
  202. package/.agent/skills/database-design/optimization.md +0 -36
  203. package/.agent/skills/database-design/orm-selection.md +0 -30
  204. package/.agent/skills/database-design/schema-design.md +0 -56
  205. package/.agent/skills/frontend-design/animation-guide.md +0 -331
  206. package/.agent/skills/frontend-design/color-system.md +0 -329
  207. package/.agent/skills/frontend-design/decision-trees.md +0 -418
  208. package/.agent/skills/frontend-design/motion-graphics.md +0 -306
  209. package/.agent/skills/frontend-design/typography-system.md +0 -363
  210. package/.agent/skills/frontend-design/ux-psychology.md +0 -1116
  211. package/.agent/skills/frontend-design/visual-effects.md +0 -383
  212. package/.agent/skills/intelligent-routing/router-manifest.md +0 -65
  213. package/.agent/skills/mobile-design/decision-trees.md +0 -516
  214. package/.agent/skills/mobile-design/mobile-backend.md +0 -491
  215. package/.agent/skills/mobile-design/mobile-color-system.md +0 -420
  216. package/.agent/skills/mobile-design/mobile-debugging.md +0 -122
  217. package/.agent/skills/mobile-design/mobile-design-thinking.md +0 -357
  218. package/.agent/skills/mobile-design/mobile-navigation.md +0 -458
  219. package/.agent/skills/mobile-design/mobile-performance.md +0 -767
  220. package/.agent/skills/mobile-design/mobile-testing.md +0 -356
  221. package/.agent/skills/mobile-design/mobile-typography.md +0 -433
  222. package/.agent/skills/mobile-design/platform-android.md +0 -666
  223. package/.agent/skills/mobile-design/platform-ios.md +0 -561
  224. package/.agent/skills/mobile-design/touch-psychology.md +0 -537
  225. package/.agent/skills/nextjs-react-expert/1-async-eliminating-waterfalls.md +0 -312
  226. package/.agent/skills/nextjs-react-expert/2-bundle-bundle-size-optimization.md +0 -240
  227. package/.agent/skills/nextjs-react-expert/3-server-server-side-performance.md +0 -490
  228. package/.agent/skills/nextjs-react-expert/4-client-client-side-data-fetching.md +0 -264
  229. package/.agent/skills/nextjs-react-expert/5-rerender-re-render-optimization.md +0 -581
  230. package/.agent/skills/nextjs-react-expert/6-rendering-rendering-performance.md +0 -432
  231. package/.agent/skills/nextjs-react-expert/7-js-javascript-performance.md +0 -684
  232. package/.agent/skills/nextjs-react-expert/8-advanced-advanced-patterns.md +0 -150
  233. package/.agent/skills/vulnerability-scanner/checklists.md +0 -121
@@ -1,137 +1,111 @@
1
- ---
2
- name: tdd-workflow
3
- description: Test-Driven Development (TDD) mastery. Red-Green-Refactor cycles, behavior-driven design (BDD), strict mutation coverage, test doubles (mocks/stubs/spies), and avoiding test-induced design damage. Use when building complex algorithms, deep business logic, or strictly regulated systems.
4
- allowed-tools: Read, Write, Edit, Glob, Grep
5
- version: 2.0.0
6
- last-updated: 2026-04-02
7
- applies-to-model: gemini-2.5-pro, claude-3-7-sonnet
8
- ---
9
-
10
- # Test-Driven Development (TDD) — Defect-Free Execution Mastery
11
-
12
- > You do not write tests to verify your code. You write tests to design your code.
13
- > Unverified code is a liability. TDD is the professional hygiene of software engineering.
14
-
15
- ---
16
-
17
- ## 1. The Red-Green-Refactor Cycle
18
-
19
- TDD is a strict, irrevocable discipline. Do not write the implementation first.
20
-
21
- ### Step 1: RED (Write the failing test)
22
- Write the test as if the API already exists exactly how you *wish* it were designed.
23
- Run the test. It MUST fail (because the function doesn't exist, or returns the wrong value). If it passes, the test is useless.
24
-
25
- ```typescript
26
- // 1. The failing test
27
- import { calculateDiscount } from './pricing';
28
-
29
- test('Should apply 10% discount for orders over $100', () => {
30
- expect(calculateDiscount(150)).toBe(135);
31
- });
32
- // FAILS: calculateDiscount is not defined
33
- ```
34
-
35
- ### Step 2: GREEN (Make it pass exactly)
36
- Write the absolute minimum, dumbest code required to make the test pass. Do not over-engineer.
37
-
38
- ```typescript
39
- // 2. The minimum implementation
40
- export function calculateDiscount(subtotal: number): number {
41
- if (subtotal >= 100) return subtotal * 0.90;
42
- return subtotal;
43
- }
44
- // PASSES.
45
- ```
46
-
47
- ### Step 3: REFACTOR
48
- Now wrap the implementation in clean architectural principles. The tests guarantee you haven't broken the behavior while you optimize.
49
-
50
- ```typescript
51
- // 3. The Refactor
52
- const DISCOUNT_THRESHOLD = 100;
53
- const DISCOUNT_RATE = 0.90;
54
-
55
- export function calculateDiscount(subtotal: number): number {
56
- return subtotal >= DISCOUNT_THRESHOLD ? subtotal * DISCOUNT_RATE : subtotal;
57
- }
58
- // STILL PASSES. Safe to commit.
59
- ```
60
-
61
- ---
62
-
63
- ## 2. Test Doubles (Mocks, Stubs, Spies)
64
-
65
- Knowing *how* to mock separates amateurs from professionals. Over-mocking destroys architectural integrity.
66
-
67
- | Type | When to use | Example |
68
- |:---|:---|:---|
69
- | **Dummy** | Filler objects passed but never used | `processOrder(new UserDummy(), payload)` |
70
- | **Stub** | Hardcodes a specific response | `db.getUser.mockResolvedValue({ id: 1 })` |
71
- | **Spy** | Records how many times a function was called | `expect(emailService.send).toHaveBeenCalledTimes(1)` |
72
- | **Mock** | A spy with predefined expectations of exact payloads | `expect(logger.info).toHaveBeenCalledWith('Authorized')` |
73
-
74
- ### The Mocking Rule
75
- **Only mock at the architectural boundaries (Database, Network, External FileSystem).**
76
- NEVER mock internal business logic or child pure-functions. If function A calls function B, test A by allowing it to genuinely call B.
77
-
78
- ---
79
-
80
- ## 3. Anti-Pattern: Testing Implementation Details
81
-
82
- Tests should verify the *behavior* output, not the underlying code structure.
83
-
84
- ```typescript
85
- class Account {
86
- private balance = 0;
87
- deposit(amount: number) { this.balance += amount; }
88
- getBalance() { return this.balance; }
89
- }
90
-
91
- // BAD: Testing internal state (Fragile)
92
- test('Deposit updates the internal balance variable', () => {
93
- const acc = new Account();
94
- acc.deposit(50);
95
- expect(acc['balance']).toBe(50); // Intrusive test breaks if variable is renamed
96
- });
97
-
98
- // GOOD: Testing external behavior contract
99
- test('Deposit makes the funds available via getBalance', () => {
100
- const acc = new Account();
101
- acc.deposit(50);
102
- expect(acc.getBalance()).toBe(50); // Tests the public API only
103
- });
104
- ```
105
-
106
- ---
107
-
108
- ## 🤖 LLM-Specific Traps (TDD)
109
-
110
- 1. **Executing Green-First:** Writing the implementation *before* the test. This completely bypasses the design guidance inherent to TDD.
111
- 2. **Test-Induced Design Damage:** Making private methods public just so they can be individually unit tested. Test the private methods exclusively through the public interface.
112
- 3. **Mocks as Reality:** AI deeply mocking internal functions (`vi.mock('./utils')`) to the point where the test simply verifies the mock configuration, providing zero real-world confidence.
113
- 4. **Fragile "Any" Mocks:** AI writing `expect(mock).toHaveBeenCalledWith(expect.anything())`, neutralizing the actual verification value of the spy.
114
- 5. **No Edge Cases:** Generating tests exclusively for the "Happy Path" (valid inputs). TDD requires boundary testing (nulls, negatives, MAX_INT, empty arrays).
115
- 6. **Massive Arrange Blocks:** Constructing 100-line object setups before the action occurs. Strongly indicates the code under test requires too many dependencies.
116
- 7. **Random Execution Dependency:** Writing tests relying on `Math.random()`, `new Date()`, or real database connections. Tests must be deterministic. Inject interfaces for time and randomizers.
117
- 8. **Catch-All Error Checks:** AI writes `expect(fn).toThrowError()`. Assert against specific error messages so regressions in the exact failure reason are detected.
118
- 9. **Test Name Obscurity:** `test('Works properly', () => ...)`. The test name should read as explicit documentation of system constraints (`test('Throws InsufficientFundsError when withdrawal exceeds balance')`).
119
- 10. **Refactor Skip:** Completing the "Green" phase and stopping. The Refactor phase is where the technical debt is permanently cleared.
120
-
121
- ---
122
-
123
- ## 🏛️ Tribunal Integration
124
-
125
- ### ✅ Pre-Flight Self-Audit
126
- ```
127
- ✅ Did I write the failing test requirements BEFORE creating the implementation?
128
- ✅ Are internal private methods accessed solely via verifying the public API layer?
129
- ✅ Were mocks restricted entirely to architectural boundaries (Network/DB/Disk)?
130
- ✅ Are date/time instances mocked via FakeTimers to ensure strict determinism?
131
- ✅ Do assertions verify precise error messages instead of generic catch-all throws?
132
- ✅ Are test case titles descriptive enough to serve as living documentation?
133
- ✅ Have all negative edge cases (boundaries, empty states) been accounted for?
134
- ✅ Upon achieving 'Green', was a deliberate refactor pass initiated for clean code?
135
- ✅ Has `expect.anything()` been avoided to enforce rigid verification of call payloads?
136
- ✅ Does the payload setup rely on minimal dummy data instead of colossal stubs?
137
- ```
1
+ ---
2
+ name: tdd-workflow
3
+ description: Test-Driven Development (TDD) mastery. Red-Green-Refactor cycles, behavior-driven design (BDD), strict mutation coverage, test doubles (mocks/stubs/spies), and avoiding test-induced design damage. Use when building complex algorithms, deep business logic, or strictly regulated systems.
4
+ allowed-tools: Read, Write, Edit, Glob, Grep
5
+ version: 2.0.0
6
+ last-updated: 2026-04-02
7
+ applies-to-model: gemini-2.5-pro, claude-3-7-sonnet
8
+ ---
9
+
10
+ ## Hallucination Traps (Read First)
11
+ - ❌ Writing tests that test implementation details instead of behavior -> ✅ Test WHAT it does (inputs/outputs), not HOW (internal methods)
12
+ - Skipping the Red phase (writing a failing test first) -> ✅ If the test passes before you write the code, it tests nothing
13
+ - Refactoring during the Red or Green phase -> ✅ Red: write failing test. Green: make it pass minimally. THEN refactor. Never mix phases
14
+
15
+ ---
16
+
17
+
18
+ # Test-Driven Development (TDD) — Defect-Free Execution Mastery
19
+
20
+ ---
21
+
22
+ ## 1. The Red-Green-Refactor Cycle
23
+
24
+ TDD is a strict, irrevocable discipline. Do not write the implementation first.
25
+
26
+ ### Step 1: RED (Write the failing test)
27
+ Write the test as if the API already exists exactly how you *wish* it were designed.
28
+ Run the test. It MUST fail (because the function doesn't exist, or returns the wrong value). If it passes, the test is useless.
29
+
30
+ ```typescript
31
+ // 1. The failing test
32
+ import { calculateDiscount } from './pricing';
33
+
34
+ test('Should apply 10% discount for orders over $100', () => {
35
+ expect(calculateDiscount(150)).toBe(135);
36
+ });
37
+ // ❌ FAILS: calculateDiscount is not defined
38
+ ```
39
+
40
+ ### Step 2: GREEN (Make it pass exactly)
41
+ Write the absolute minimum, dumbest code required to make the test pass. Do not over-engineer.
42
+
43
+ ```typescript
44
+ // 2. The minimum implementation
45
+ export function calculateDiscount(subtotal: number): number {
46
+ if (subtotal >= 100) return subtotal * 0.90;
47
+ return subtotal;
48
+ }
49
+ // ✅ PASSES.
50
+ ```
51
+
52
+ ### Step 3: REFACTOR
53
+ Now wrap the implementation in clean architectural principles. The tests guarantee you haven't broken the behavior while you optimize.
54
+
55
+ ```typescript
56
+ // 3. The Refactor
57
+ const DISCOUNT_THRESHOLD = 100;
58
+ const DISCOUNT_RATE = 0.90;
59
+
60
+ export function calculateDiscount(subtotal: number): number {
61
+ return subtotal >= DISCOUNT_THRESHOLD ? subtotal * DISCOUNT_RATE : subtotal;
62
+ }
63
+ // STILL PASSES. Safe to commit.
64
+ ```
65
+
66
+ ---
67
+
68
+ ## 2. Test Doubles (Mocks, Stubs, Spies)
69
+
70
+ Knowing *how* to mock separates amateurs from professionals. Over-mocking destroys architectural integrity.
71
+
72
+ |Type|When to use|Example|
73
+ |:---|:---|:---|
74
+ |**Dummy**|Filler objects passed but never used|`processOrder(new UserDummy(), payload)`|
75
+ |**Stub**|Hardcodes a specific response|`db.getUser.mockResolvedValue({ id: 1 })`|
76
+ |**Spy**|Records how many times a function was called|`expect(emailService.send).toHaveBeenCalledTimes(1)`|
77
+ |**Mock**|A spy with predefined expectations of exact payloads|`expect(logger.info).toHaveBeenCalledWith('Authorized')`|
78
+
79
+ ### The Mocking Rule
80
+ **Only mock at the architectural boundaries (Database, Network, External FileSystem).**
81
+ NEVER mock internal business logic or child pure-functions. If function A calls function B, test A by allowing it to genuinely call B.
82
+
83
+ ---
84
+
85
+ ## 3. Anti-Pattern: Testing Implementation Details
86
+
87
+ Tests should verify the *behavior* output, not the underlying code structure.
88
+
89
+ ```typescript
90
+ class Account {
91
+ private balance = 0;
92
+ deposit(amount: number) { this.balance += amount; }
93
+ getBalance() { return this.balance; }
94
+ }
95
+
96
+ // ❌ BAD: Testing internal state (Fragile)
97
+ test('Deposit updates the internal balance variable', () => {
98
+ const acc = new Account();
99
+ acc.deposit(50);
100
+ expect(acc['balance']).toBe(50); // Intrusive test breaks if variable is renamed
101
+ });
102
+
103
+ // ✅ GOOD: Testing external behavior contract
104
+ test('Deposit makes the funds available via getBalance', () => {
105
+ const acc = new Account();
106
+ acc.deposit(50);
107
+ expect(acc.getBalance()).toBe(50); // Tests the public API only
108
+ });
109
+ ```
110
+
111
+ ---
@@ -52,16 +52,16 @@ Report — structured output with confidence levels
52
52
 
53
53
  Auto-detect the test framework from output patterns:
54
54
 
55
- | Framework | Detection Pattern | Failure Marker |
55
+ |Framework|Detection Pattern|Failure Marker|
56
56
  |---|---|---|
57
- | Jest | `PASS`/`FAIL` with file paths, `●` for test names | `FAIL src/...` |
58
- | Vitest | `✓`/`×` markers, `FAIL` blocks | `❯ FAIL` or `× test name` |
59
- | pytest | `PASSED`/`FAILED` with `::` separator | `FAILED tests/...::test_name` |
60
- | Go test | `ok`/`FAIL` with package paths | `--- FAIL: TestName` |
61
- | Mocha | `passing`/`failing` counts, indented suites | `N failing` section |
62
- | JUnit (XML) | `<testsuite>` XML structure | `<failure>` elements |
63
- | RSpec | `.F` markers, `Failures:` section | `Failure/Error:` |
64
- | Cargo test | `test result: FAILED` | `---- test_name stdout ----` |
57
+ |Jest|`PASS`/`FAIL` with file paths, `●` for test names|`FAIL src/...`|
58
+ |Vitest|`✓`/`×` markers, `FAIL` blocks|`❯ FAIL` or `× test name`|
59
+ |pytest|`PASSED`/`FAILED` with `::` separator|`FAILED tests/...::test_name`|
60
+ |Go test|`ok`/`FAIL` with package paths|`--- FAIL: TestName`|
61
+ |Mocha|`passing`/`failing` counts, indented suites|`N failing` section|
62
+ |JUnit (XML)|`<testsuite>` XML structure|`<failure>` elements|
63
+ |RSpec|`.F` markers, `Failures:` section|`Failure/Error:`|
64
+ |Cargo test|`test result: FAILED`|`---- test_name stdout ----`|
65
65
 
66
66
  ## Step 2: Failure Extraction
67
67
 
@@ -86,15 +86,15 @@ Group failures into clusters based on shared characteristics:
86
86
 
87
87
  ### Cluster Types
88
88
 
89
- | Cluster Type | How to Detect | Typical Root Cause |
89
+ |Cluster Type|How to Detect|Typical Root Cause|
90
90
  |---|---|---|
91
- | **Shared Module** | Multiple tests import from the same file that changed | Missing export, type change, API change |
92
- | **Same Error Type** | All failures throw `TypeError` or `ConnectionError` | Broken dependency, env issue |
93
- | **Shared Fixture** | Tests using same `beforeEach`/setup fail together | Fixture setup failure cascading |
94
- | **Import Chain** | Failures follow the import graph | Dependency that fails to resolve |
95
- | **Environment** | All tests fail with connection/config errors | Missing env var, DB not running |
96
- | **Timing** | Tests pass individually, fail together | Race condition, shared state |
97
- | **Snapshot** | Multiple `toMatchSnapshot` failures | Intentional UI change (update snapshots) |
91
+ |**Shared Module**|Multiple tests import from the same file that changed|Missing export, type change, API change|
92
+ |**Same Error Type**|All failures throw `TypeError` or `ConnectionError`|Broken dependency, env issue|
93
+ |**Shared Fixture**|Tests using same `beforeEach`/setup fail together|Fixture setup failure cascading|
94
+ |**Import Chain**|Failures follow the import graph|Dependency that fails to resolve|
95
+ |**Environment**|All tests fail with connection/config errors|Missing env var, DB not running|
96
+ |**Timing**|Tests pass individually, fail together|Race condition, shared state|
97
+ |**Snapshot**|Multiple `toMatchSnapshot` failures|Intentional UI change (update snapshots)|
98
98
 
99
99
  ### Cascade Detection Algorithm
100
100
 
@@ -125,25 +125,25 @@ Example:
125
125
 
126
126
  **FPF Confidence Levels:**
127
127
 
128
- | Confidence | Criteria |
128
+ |Confidence|Criteria|
129
129
  |---|---|
130
- | **HIGH** | Same source file in >50% of failure stack traces |
131
- | **MEDIUM** | Same error type across multiple test files |
132
- | **LOW** | Failures appear independent, multiple root causes likely |
130
+ |**HIGH**|Same source file in >50% of failure stack traces|
131
+ |**MEDIUM**|Same error type across multiple test files|
132
+ |**LOW**|Failures appear independent, multiple root causes likely|
133
133
 
134
134
  ## Step 5: Fix Recommendations
135
135
 
136
136
  For each cluster, provide actionable fixes:
137
137
 
138
- | Fix Type | Example | How to Verify |
138
+ |Fix Type|Example|How to Verify|
139
139
  |---|---|---|
140
- | **Missing Export** | `export { fn }` added to module | Re-run failing tests |
141
- | **Type Mismatch** | Function signature changed, callers need update | Check callers with `grep_search` |
142
- | **Stale Mock** | Mock doesn't match new interface | Compare mock to actual implementation |
143
- | **Env Variable** | `.env.test` missing `DATABASE_URL` | Check `.env.example` vs `.env.test` |
144
- | **Snapshot Update** | Intentional UI change | Run with `--updateSnapshot` flag |
145
- | **Race Condition** | Tests share global state | Add isolation or `beforeEach` reset |
146
- | **Dependency Update** | Package API changed after upgrade | Check changelog of updated package |
140
+ |**Missing Export**|`export { fn }` added to module|Re-run failing tests|
141
+ |**Type Mismatch**|Function signature changed, callers need update|Check callers with `grep_search`|
142
+ |**Stale Mock**|Mock doesn't match new interface|Compare mock to actual implementation|
143
+ |**Env Variable**|`.env.test` missing `DATABASE_URL`|Check `.env.example` vs `.env.test`|
144
+ |**Snapshot Update**|Intentional UI change|Run with `--updateSnapshot` flag|
145
+ |**Race Condition**|Tests share global state|Add isolation or `beforeEach` reset|
146
+ |**Dependency Update**|Package API changed after upgrade|Check changelog of updated package|
147
147
 
148
148
  ### Fix Priority Formula
149
149
  ```
@@ -241,11 +241,11 @@ If only snapshot tests fail → likely intentional UI change:
241
241
 
242
242
  ## Cross-Skill Integration
243
243
 
244
- | Paired Skill | Integration Point |
244
+ |Paired Skill|Integration Point|
245
245
  |---|---|
246
- | `systematic-debugging` | Escalate when FPF is unclear → 4-phase debug methodology |
247
- | `testing-patterns` | Reference when recommending test structure improvements |
248
- | `workflow-optimizer` | Flag inefficient test-debug-retest loops |
246
+ |`systematic-debugging`|Escalate when FPF is unclear → 4-phase debug methodology|
247
+ |`testing-patterns`|Reference when recommending test structure improvements|
248
+ |`workflow-optimizer`|Flag inefficient test-debug-retest loops|
249
249
 
250
250
  ## Anti-Hallucination Guard
251
251
 
@@ -256,44 +256,4 @@ If only snapshot tests fail → likely intentional UI change:
256
256
  - **Never guess at assertion values** — quote exactly what "Expected" and "Received" say in the output.
257
257
  - **Don't assume test runner** — auto-detect from output format, don't assume Jest.
258
258
 
259
-
260
- ---
261
-
262
- ## 🤖 LLM-Specific Traps
263
-
264
- AI coding assistants often fall into specific bad habits when dealing with this domain. These are strictly forbidden:
265
-
266
- 1. **Over-engineering:** Proposing complex abstractions or distributed systems when a simpler approach suffices.
267
- 2. **Hallucinated Libraries/Methods:** Using non-existent methods or packages. Always `// VERIFY` or check `package.json` / `requirements.txt`.
268
- 3. **Skipping Edge Cases:** Writing the "happy path" and ignoring error handling, timeouts, or data validation.
269
- 4. **Context Amnesia:** Forgetting the user's constraints and offering generic advice instead of tailored solutions.
270
- 5. **Silent Degradation:** Catching and suppressing errors without logging or re-raising.
271
-
272
259
  ---
273
-
274
- ## 🏛️ Tribunal Integration (Anti-Hallucination)
275
-
276
- **Slash command: `/review` or `/tribunal-full`**
277
- **Active reviewers: `logic-reviewer` · `security-auditor`**
278
-
279
- ### ❌ Forbidden AI Tropes
280
-
281
- 1. **Blind Assumptions:** Never make an assumption without documenting it clearly with `// VERIFY: [reason]`.
282
- 2. **Silent Degradation:** Catching and suppressing errors without logging or handling.
283
- 3. **Context Amnesia:** Forgetting the user's constraints and offering generic advice instead of tailored solutions.
284
-
285
- ### ✅ Pre-Flight Self-Audit
286
-
287
- Review these questions before confirming output:
288
- ```
289
- ✅ Did I rely ONLY on real, verified tools and methods?
290
- ✅ Is this solution appropriately scoped to the user's constraints?
291
- ✅ Did I handle potential failure modes and edge cases?
292
- ✅ Have I avoided generic boilerplate that doesn't add value?
293
- ```
294
-
295
- ### 🛑 Verification-Before-Completion (VBC) Protocol
296
-
297
- **CRITICAL:** You must follow a strict "evidence-based closeout" state machine.
298
- - ❌ **Forbidden:** Declaring a task complete because the output "looks correct."
299
- - ✅ **Required:** You are explicitly forbidden from finalizing any task without providing **concrete evidence** (terminal output, passing tests, compile success, or equivalent proof) that your output works as intended.