tribunal-kit 3.0.0 → 4.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (233) hide show
  1. package/.agent/ARCHITECTURE.md +99 -99
  2. package/.agent/GEMINI.md +52 -52
  3. package/.agent/agents/accessibility-reviewer.md +187 -220
  4. package/.agent/agents/ai-code-reviewer.md +199 -233
  5. package/.agent/agents/backend-specialist.md +215 -238
  6. package/.agent/agents/code-archaeologist.md +161 -181
  7. package/.agent/agents/database-architect.md +184 -207
  8. package/.agent/agents/debugger.md +191 -218
  9. package/.agent/agents/dependency-reviewer.md +103 -136
  10. package/.agent/agents/devops-engineer.md +218 -238
  11. package/.agent/agents/documentation-writer.md +201 -221
  12. package/.agent/agents/explorer-agent.md +160 -180
  13. package/.agent/agents/frontend-reviewer.md +160 -194
  14. package/.agent/agents/frontend-specialist.md +248 -237
  15. package/.agent/agents/game-developer.md +48 -52
  16. package/.agent/agents/logic-reviewer.md +116 -149
  17. package/.agent/agents/mobile-developer.md +200 -223
  18. package/.agent/agents/mobile-reviewer.md +162 -195
  19. package/.agent/agents/orchestrator.md +181 -211
  20. package/.agent/agents/penetration-tester.md +157 -174
  21. package/.agent/agents/performance-optimizer.md +183 -203
  22. package/.agent/agents/performance-reviewer.md +178 -211
  23. package/.agent/agents/precedence-reviewer.md +213 -0
  24. package/.agent/agents/product-manager.md +142 -162
  25. package/.agent/agents/product-owner.md +6 -25
  26. package/.agent/agents/project-planner.md +142 -162
  27. package/.agent/agents/qa-automation-engineer.md +225 -242
  28. package/.agent/agents/security-auditor.md +174 -194
  29. package/.agent/agents/seo-specialist.md +193 -213
  30. package/.agent/agents/sql-reviewer.md +161 -194
  31. package/.agent/agents/supervisor-agent.md +184 -203
  32. package/.agent/agents/swarm-worker-contracts.md +17 -17
  33. package/.agent/agents/swarm-worker-registry.md +46 -46
  34. package/.agent/agents/test-coverage-reviewer.md +160 -193
  35. package/.agent/agents/test-engineer.md +0 -21
  36. package/.agent/agents/type-safety-reviewer.md +175 -208
  37. package/.agent/patterns/generator.md +9 -9
  38. package/.agent/patterns/inversion.md +12 -12
  39. package/.agent/patterns/pipeline.md +9 -9
  40. package/.agent/patterns/reviewer.md +13 -13
  41. package/.agent/patterns/tool-wrapper.md +9 -9
  42. package/.agent/rules/GEMINI.md +63 -63
  43. package/.agent/scripts/append_flow.js +72 -0
  44. package/.agent/scripts/case_law_manager.py +525 -0
  45. package/.agent/scripts/compress_skills.py +167 -0
  46. package/.agent/scripts/consolidate_skills.py +173 -0
  47. package/.agent/scripts/deep_compress.py +202 -0
  48. package/.agent/scripts/minify_context.py +80 -0
  49. package/.agent/scripts/security_scan.py +1 -1
  50. package/.agent/scripts/skill_evolution.py +563 -0
  51. package/.agent/scripts/strip_tribunal.py +41 -0
  52. package/.agent/skills/agent-organizer/SKILL.md +100 -126
  53. package/.agent/skills/agentic-patterns/SKILL.md +0 -70
  54. package/.agent/skills/ai-prompt-injection-defense/SKILL.md +134 -160
  55. package/.agent/skills/api-patterns/SKILL.md +123 -215
  56. package/.agent/skills/api-security-auditor/SKILL.md +143 -177
  57. package/.agent/skills/app-builder/SKILL.md +334 -50
  58. package/.agent/skills/app-builder/templates/SKILL.md +13 -15
  59. package/.agent/skills/app-builder/templates/astro-static/TEMPLATE.md +16 -16
  60. package/.agent/skills/app-builder/templates/chrome-extension/TEMPLATE.md +22 -22
  61. package/.agent/skills/app-builder/templates/cli-tool/TEMPLATE.md +18 -18
  62. package/.agent/skills/app-builder/templates/electron-desktop/TEMPLATE.md +20 -20
  63. package/.agent/skills/app-builder/templates/express-api/TEMPLATE.md +17 -17
  64. package/.agent/skills/app-builder/templates/flutter-app/TEMPLATE.md +18 -18
  65. package/.agent/skills/app-builder/templates/monorepo-turborepo/TEMPLATE.md +21 -21
  66. package/.agent/skills/app-builder/templates/nextjs-fullstack/TEMPLATE.md +19 -19
  67. package/.agent/skills/app-builder/templates/nextjs-saas/TEMPLATE.md +26 -26
  68. package/.agent/skills/app-builder/templates/nextjs-static/TEMPLATE.md +26 -26
  69. package/.agent/skills/app-builder/templates/nuxt-app/TEMPLATE.md +19 -19
  70. package/.agent/skills/app-builder/templates/python-fastapi/TEMPLATE.md +18 -18
  71. package/.agent/skills/app-builder/templates/react-native-app/TEMPLATE.md +20 -20
  72. package/.agent/skills/appflow-wireframe/SKILL.md +95 -121
  73. package/.agent/skills/architecture/SKILL.md +169 -331
  74. package/.agent/skills/authentication-best-practices/SKILL.md +139 -173
  75. package/.agent/skills/bash-linux/SKILL.md +129 -154
  76. package/.agent/skills/behavioral-modes/SKILL.md +8 -69
  77. package/.agent/skills/brainstorming/SKILL.md +436 -104
  78. package/.agent/skills/building-native-ui/SKILL.md +152 -174
  79. package/.agent/skills/clean-code/SKILL.md +331 -360
  80. package/.agent/skills/code-review-checklist/SKILL.md +0 -62
  81. package/.agent/skills/config-validator/SKILL.md +115 -141
  82. package/.agent/skills/csharp-developer/SKILL.md +468 -528
  83. package/.agent/skills/database-design/SKILL.md +104 -369
  84. package/.agent/skills/deployment-procedures/SKILL.md +119 -145
  85. package/.agent/skills/devops-engineer/SKILL.md +295 -332
  86. package/.agent/skills/devops-incident-responder/SKILL.md +87 -113
  87. package/.agent/skills/doc.md +5 -5
  88. package/.agent/skills/documentation-templates/SKILL.md +27 -63
  89. package/.agent/skills/edge-computing/SKILL.md +131 -157
  90. package/.agent/skills/extract-design-system/SKILL.md +108 -134
  91. package/.agent/skills/framer-motion-expert/SKILL.md +111 -855
  92. package/.agent/skills/frontend-design/SKILL.md +151 -499
  93. package/.agent/skills/game-design-expert/SKILL.md +79 -105
  94. package/.agent/skills/game-engineering-expert/SKILL.md +96 -122
  95. package/.agent/skills/geo-fundamentals/SKILL.md +97 -124
  96. package/.agent/skills/github-operations/SKILL.md +279 -314
  97. package/.agent/skills/gsap-expert/SKILL.md +119 -826
  98. package/.agent/skills/i18n-localization/SKILL.md +113 -138
  99. package/.agent/skills/intelligent-routing/SKILL.md +167 -127
  100. package/.agent/skills/lint-and-validate/SKILL.md +16 -52
  101. package/.agent/skills/llm-engineering/SKILL.md +344 -357
  102. package/.agent/skills/local-first/SKILL.md +128 -154
  103. package/.agent/skills/mcp-builder/SKILL.md +92 -118
  104. package/.agent/skills/mobile-design/SKILL.md +213 -219
  105. package/.agent/skills/motion-engineering/SKILL.md +184 -0
  106. package/.agent/skills/nextjs-react-expert/SKILL.md +99 -698
  107. package/.agent/skills/nodejs-best-practices/SKILL.md +498 -559
  108. package/.agent/skills/observability/SKILL.md +293 -330
  109. package/.agent/skills/parallel-agents/SKILL.md +96 -122
  110. package/.agent/skills/performance-profiling/SKILL.md +217 -254
  111. package/.agent/skills/plan-writing/SKILL.md +92 -118
  112. package/.agent/skills/platform-engineer/SKILL.md +97 -123
  113. package/.agent/skills/playwright-best-practices/SKILL.md +137 -162
  114. package/.agent/skills/powershell-windows/SKILL.md +112 -146
  115. package/.agent/skills/project-idioms/SKILL.md +87 -0
  116. package/.agent/skills/python-patterns/SKILL.md +15 -35
  117. package/.agent/skills/python-pro/SKILL.md +148 -754
  118. package/.agent/skills/react-specialist/SKILL.md +123 -827
  119. package/.agent/skills/readme-builder/SKILL.md +23 -85
  120. package/.agent/skills/realtime-patterns/SKILL.md +269 -304
  121. package/.agent/skills/red-team-tactics/SKILL.md +18 -51
  122. package/.agent/skills/rust-pro/SKILL.md +623 -701
  123. package/.agent/skills/seo-fundamentals/SKILL.md +129 -154
  124. package/.agent/skills/server-management/SKILL.md +164 -190
  125. package/.agent/skills/shadcn-ui-expert/SKILL.md +181 -206
  126. package/.agent/skills/skill-creator/SKILL.md +24 -56
  127. package/.agent/skills/sql-pro/SKILL.md +579 -633
  128. package/.agent/skills/supabase-postgres-best-practices/SKILL.md +35 -66
  129. package/.agent/skills/swiftui-expert/SKILL.md +151 -176
  130. package/.agent/skills/systematic-debugging/SKILL.md +92 -118
  131. package/.agent/skills/tailwind-patterns/SKILL.md +516 -576
  132. package/.agent/skills/tdd-workflow/SKILL.md +111 -137
  133. package/.agent/skills/test-result-analyzer/SKILL.md +33 -73
  134. package/.agent/skills/testing-patterns/SKILL.md +512 -573
  135. package/.agent/skills/trend-researcher/SKILL.md +30 -71
  136. package/.agent/skills/ui-ux-pro-max/SKILL.md +8 -41
  137. package/.agent/skills/ui-ux-researcher/SKILL.md +51 -91
  138. package/.agent/skills/vue-expert/SKILL.md +127 -866
  139. package/.agent/skills/vulnerability-scanner/SKILL.md +354 -269
  140. package/.agent/skills/web-accessibility-auditor/SKILL.md +168 -193
  141. package/.agent/skills/web-design-guidelines/SKILL.md +25 -61
  142. package/.agent/skills/webapp-testing/SKILL.md +119 -145
  143. package/.agent/skills/whimsy-injector/SKILL.md +58 -132
  144. package/.agent/skills/workflow-optimizer/SKILL.md +28 -68
  145. package/.agent/workflows/api-tester.md +151 -151
  146. package/.agent/workflows/audit.md +127 -138
  147. package/.agent/workflows/brainstorm.md +110 -110
  148. package/.agent/workflows/changelog.md +112 -112
  149. package/.agent/workflows/create.md +124 -124
  150. package/.agent/workflows/debug.md +165 -189
  151. package/.agent/workflows/deploy.md +180 -189
  152. package/.agent/workflows/enhance.md +128 -151
  153. package/.agent/workflows/fix.md +114 -135
  154. package/.agent/workflows/generate.md +13 -4
  155. package/.agent/workflows/migrate.md +160 -160
  156. package/.agent/workflows/orchestrate.md +168 -168
  157. package/.agent/workflows/performance-benchmarker.md +114 -123
  158. package/.agent/workflows/plan.md +173 -173
  159. package/.agent/workflows/preview.md +80 -80
  160. package/.agent/workflows/refactor.md +161 -183
  161. package/.agent/workflows/review-ai.md +101 -129
  162. package/.agent/workflows/review.md +116 -116
  163. package/.agent/workflows/session.md +94 -94
  164. package/.agent/workflows/status.md +79 -79
  165. package/.agent/workflows/strengthen-skills.md +138 -139
  166. package/.agent/workflows/swarm.md +179 -179
  167. package/.agent/workflows/test.md +189 -211
  168. package/.agent/workflows/tribunal-backend.md +94 -113
  169. package/.agent/workflows/tribunal-database.md +95 -115
  170. package/.agent/workflows/tribunal-frontend.md +96 -118
  171. package/.agent/workflows/tribunal-full.md +93 -133
  172. package/.agent/workflows/tribunal-mobile.md +95 -119
  173. package/.agent/workflows/tribunal-performance.md +110 -133
  174. package/.agent/workflows/ui-ux-pro-max.md +122 -143
  175. package/README.md +30 -1
  176. package/bin/tribunal-kit.js +175 -12
  177. package/package.json +25 -4
  178. package/.agent/skills/api-patterns/api-style.md +0 -42
  179. package/.agent/skills/api-patterns/auth.md +0 -24
  180. package/.agent/skills/api-patterns/documentation.md +0 -26
  181. package/.agent/skills/api-patterns/graphql.md +0 -41
  182. package/.agent/skills/api-patterns/rate-limiting.md +0 -31
  183. package/.agent/skills/api-patterns/response.md +0 -37
  184. package/.agent/skills/api-patterns/rest.md +0 -40
  185. package/.agent/skills/api-patterns/security-testing.md +0 -122
  186. package/.agent/skills/api-patterns/trpc.md +0 -41
  187. package/.agent/skills/api-patterns/versioning.md +0 -22
  188. package/.agent/skills/app-builder/agent-coordination.md +0 -71
  189. package/.agent/skills/app-builder/feature-building.md +0 -53
  190. package/.agent/skills/app-builder/project-detection.md +0 -34
  191. package/.agent/skills/app-builder/scaffolding.md +0 -118
  192. package/.agent/skills/app-builder/tech-stack.md +0 -40
  193. package/.agent/skills/architecture/context-discovery.md +0 -43
  194. package/.agent/skills/architecture/examples.md +0 -94
  195. package/.agent/skills/architecture/pattern-selection.md +0 -68
  196. package/.agent/skills/architecture/patterns-reference.md +0 -50
  197. package/.agent/skills/architecture/trade-off-analysis.md +0 -77
  198. package/.agent/skills/brainstorming/dynamic-questioning.md +0 -360
  199. package/.agent/skills/database-design/database-selection.md +0 -43
  200. package/.agent/skills/database-design/indexing.md +0 -39
  201. package/.agent/skills/database-design/migrations.md +0 -48
  202. package/.agent/skills/database-design/optimization.md +0 -36
  203. package/.agent/skills/database-design/orm-selection.md +0 -30
  204. package/.agent/skills/database-design/schema-design.md +0 -56
  205. package/.agent/skills/frontend-design/animation-guide.md +0 -331
  206. package/.agent/skills/frontend-design/color-system.md +0 -329
  207. package/.agent/skills/frontend-design/decision-trees.md +0 -418
  208. package/.agent/skills/frontend-design/motion-graphics.md +0 -306
  209. package/.agent/skills/frontend-design/typography-system.md +0 -363
  210. package/.agent/skills/frontend-design/ux-psychology.md +0 -1116
  211. package/.agent/skills/frontend-design/visual-effects.md +0 -383
  212. package/.agent/skills/intelligent-routing/router-manifest.md +0 -65
  213. package/.agent/skills/mobile-design/decision-trees.md +0 -516
  214. package/.agent/skills/mobile-design/mobile-backend.md +0 -491
  215. package/.agent/skills/mobile-design/mobile-color-system.md +0 -420
  216. package/.agent/skills/mobile-design/mobile-debugging.md +0 -122
  217. package/.agent/skills/mobile-design/mobile-design-thinking.md +0 -357
  218. package/.agent/skills/mobile-design/mobile-navigation.md +0 -458
  219. package/.agent/skills/mobile-design/mobile-performance.md +0 -767
  220. package/.agent/skills/mobile-design/mobile-testing.md +0 -356
  221. package/.agent/skills/mobile-design/mobile-typography.md +0 -433
  222. package/.agent/skills/mobile-design/platform-android.md +0 -666
  223. package/.agent/skills/mobile-design/platform-ios.md +0 -561
  224. package/.agent/skills/mobile-design/touch-psychology.md +0 -537
  225. package/.agent/skills/nextjs-react-expert/1-async-eliminating-waterfalls.md +0 -312
  226. package/.agent/skills/nextjs-react-expert/2-bundle-bundle-size-optimization.md +0 -240
  227. package/.agent/skills/nextjs-react-expert/3-server-server-side-performance.md +0 -490
  228. package/.agent/skills/nextjs-react-expert/4-client-client-side-data-fetching.md +0 -264
  229. package/.agent/skills/nextjs-react-expert/5-rerender-re-render-optimization.md +0 -581
  230. package/.agent/skills/nextjs-react-expert/6-rendering-rendering-performance.md +0 -432
  231. package/.agent/skills/nextjs-react-expert/7-js-javascript-performance.md +0 -684
  232. package/.agent/skills/nextjs-react-expert/8-advanced-advanced-patterns.md +0 -150
  233. package/.agent/skills/vulnerability-scanner/checklists.md +0 -121
@@ -1,113 +1,87 @@
1
- ---
2
- name: devops-incident-responder
3
- description: Production incident response mastery. MTTR (Mean Time to Recovery) reduction, blameless post-mortems, rapid triaging, halting systemic cascading failures, isolating problematic deployments, and evidence-based forensic analysis. Use when stabilizing broken systems, fighting active production fires, or conducting root-cause post-mortems.
4
- allowed-tools: Read, Write, Edit, Glob, Grep
5
- version: 2.0.0
6
- last-updated: 2026-04-02
7
- applies-to-model: gemini-2.5-pro, claude-3-7-sonnet
8
- ---
9
-
10
- # Incident Responder Production Stabilization Mastery
11
-
12
- > Time is blood. The goal of an incident response is Mitigation first, Resolution second.
13
- > DO NOT investigate the root cause while the building is burning. Put out the fire (Rollback), then investigate the ashes.
14
-
15
- ---
16
-
17
- ## 1. The Prime Directive (Stop the Bleeding)
18
-
19
- When an outage is declared (e.g., 502 Bad Gateway across the entire primary cluster), do not ask the developer to check the database logs to figure out why the code crashed.
20
-
21
- **Immediate Action Pipeline:**
22
- 1. **Identify the Trigger:** What changed in the last 15 minutes? (90% of outages are caused by deployments).
23
- 2. **Revert the Change:** Execute the emergency rollback pipeline instantly. Revert the Git commit, swap the Docker tag, or disable the Feature Flag.
24
- 3. **Verify Stabilization:** Ensure metrics return to healthy thresholds.
25
- 4. **Communicate:** "Mitigation complete. Services restored. Root cause investigation underway."
26
-
27
- ---
28
-
29
- ## 2. Isolating Cascading Failures
30
-
31
- A cascading failure occurs when Service A dies, causing Service B to overload with retries, which kills Service B, which kills the database.
32
-
33
- **The Circuit Breaker Protocol:**
34
- If a downstream dependency is dead, sever it immediately to save the rest of the ecosystem.
35
-
36
- ```javascript
37
- // ❌ VULNERABLE: Infinite Retry Death Spiral
38
- async function fetchUser(id) {
39
- while(true) {
40
- try { return await api.get(`/user/${id}`); }
41
- catch { await sleep(100); } // Hundreds of containers doing this will execute a DDoSing attack on the API
42
- }
43
- }
44
-
45
- // RESILIENT: Circuit Breaking / Fallbacks
46
- const breaker = new CircuitBreaker(fetchUser, {
47
- errorThresholdPercentage: 50, // If 50% of requests fail...
48
- resetTimeout: 30000 // Open the circuit (stop sending requests) for 30s
49
- });
50
-
51
- breaker.fallback(() => ({ id: "cached-user", status: "degraded" }));
52
- ```
53
-
54
- **Heavy Mitigation Tactics:**
55
- - **Shed Load:** Aggressively drop non-critical traffic (e.g., disable background syncs, temporarily ban aggressive scraping IPs).
56
- - **Scale Out (Band-Aid):** If the memory leak is crashing nodes every 10 minutes, scale the nodes up 3x to buy yourself 30 minutes of runway to find the actual bug.
57
-
58
- ---
59
-
60
- ## 3. The Investigative Triage Routine
61
-
62
- Once the bleeding is stopped (or if you are investigating a non-fatal anomaly), follow the data strictly:
63
-
64
- 1. **Metrics (The "What"):** Look at the Dashboards. Did latency spike? Did CPU pin at 100%? Did Database active connections max out?
65
- 2. **Traces (The "Where"):** Look at OpenTelemetry/Datadog traces. Which specific microservice is the bottleneck?
66
- 3. **Logs (The "Why"):** Query the centralized logs (Splunk/Elastic/CloudWatch) exactly around the timestamp the trace spiked.
67
-
68
- ---
69
-
70
- ## 4. The Blameless Post-Mortem
71
-
72
- Incident response does not end when the system recovers. It ends when the system is architected to survive the same failure tomorrow automatically.
73
-
74
- **A Professional Post-Mortem Must Include:**
75
- 1. **The Timeline:** Chronological factual representation of the event to the minute.
76
- 2. **Root Cause Analysis (The 5 Whys):**
77
- - *Why did the site go down?* DB exhausted connections.
78
- - *Why did it exhaust?* The new background worker didn't pool connections.
79
- - *Why did the worker deploy?* It bypassed CI tests for speed.
80
- 3. **Action Items:** Tangible Jira tickets preventing recurrence (e.g., "Implement PgBouncer connection limits", "Enforce CI checks block on all branches").
81
-
82
- ---
83
-
84
- ## 🤖 LLM-Specific Traps (Incident Response)
85
-
86
- 1. **Investigating the Fire:** Identifying a crash and immediately demanding the user rewrite the deeply nested API logic while the site remains completely offline to customers. Always prescribe an instant Rollback first.
87
- 2. **Shotgun Reboots:** Telling the user to just restart all the servers blindly. This destroys all volatile memory evidence (Heap Dumps, core dumps) required to actually solve the root cause.
88
- 3. **Restart Loops:** The AI identifies an OOM (Out Of Memory) crash and writes a bash loop to infinitely restart the system every time it crashes, actively masking the fatal memory leak from architectural review.
89
- 4. **Ignoring Network Geometries:** Trying to debug an API failure for 20 minutes by analyzing Node.js code, while totally forgetting to check if the AWS Security Group / Firewall simply blocked the port.
90
- 5. **Assuming Code is Flawless:** Recommending complex database re-indexing strategies because queries got slow, without recognizing that the recent Git Commit deployed an N+1 query loop. Assume recent deployments are guilty until proven innocent.
91
- 6. **No Circuit Breakers:** Fixing a timeout bug by suggesting the user increase the global HTTP timeout from 10s to 60s, guaranteeing the entire thread pool becomes exhausted instantly during the next network fluctuation.
92
- 7. **The Blame Game:** Writing analysis reports that focus heavily on the individual developer who pushed the bad code, rather than identifying the systemic CI/CD pipeline failure that *allowed* the bad code to merge.
93
- 8. **Logging in Production:** Advising the user to `console.log` massive payloads to track an error in production environments, massively polluting logs and potentially leaking PII compliance data.
94
- 9. **Tunnel Vision:** Debugging Application A intensely, failing to realize Application A failed because the underlying global Redis cache failed completely, affecting all services simultaneously. Focus on shared infrastructure metrics first.
95
- 10. **The Unverifiable Fix:** Proposing an intricate solution and skipping the final critical step: "How will we prove this actually fixed the problem under load?". Deploy without telemetry monitoring is flying blind.
96
-
97
- ---
98
-
99
- ## 🏛️ Tribunal Integration
100
-
101
- ### ✅ Pre-Flight Self-Audit
102
- ```
103
- ✅ Was immediate Mitigation/Rollback explicitly ordered prior to initiating Root Cause Analysis?
104
- ✅ Did I enforce strategies to break cascading failures (e.g., Circuit Breakers, load shedding)?
105
- ✅ Are diagnostic analyses tracing specific metrics to explicit log anomalies?
106
- ✅ Ensure that system reboots did not intentionally destroy critical volatile diagnostic evidence.
107
- ✅ Have recent deployments/git-commits been flagged as the primary initial vector of suspicion?
108
- ✅ Did I avoid masking problems by extending timeout thresholds, opting instead to fix blocking execution?
109
- ✅ Is the incident Post-Mortem methodology completely focused on systemic flaws rather than human error?
110
- ✅ Were cross-service infrastructure layers (DBs, Redis, Load Balancers) cleared before deep-diving into local code?
111
- ✅ Has an action item been established verifying how future iterations of this failure will be programmatically blocked?
112
- ✅ Are there explicit mechanisms checking for PII leakage before increasing diagnostic logging verbosity in production?
113
- ```
1
+ ---
2
+ name: devops-incident-responder
3
+ description: Production incident response mastery. MTTR (Mean Time to Recovery) reduction, blameless post-mortems, rapid triaging, halting systemic cascading failures, isolating problematic deployments, and evidence-based forensic analysis. Use when stabilizing broken systems, fighting active production fires, or conducting root-cause post-mortems.
4
+ allowed-tools: Read, Write, Edit, Glob, Grep
5
+ version: 2.0.0
6
+ last-updated: 2026-04-02
7
+ applies-to-model: gemini-2.5-pro, claude-3-7-sonnet
8
+ ---
9
+
10
+ ## Hallucination Traps (Read First)
11
+ - ❌ Changing code during an active incident -> ✅ STABILIZE first (rollback, feature flag, traffic shift), investigate AFTER
12
+ - Assigning blame in post-mortems -> Blameless post-mortems focus on systemic causes, not individual errors
13
+ - Skipping the 'what went well' section -> Understanding what prevented worse outcomes is as valuable as the root cause
14
+
15
+ ---
16
+
17
+
18
+ # Incident Responder — Production Stabilization Mastery
19
+
20
+ ---
21
+
22
+ ## 1. The Prime Directive (Stop the Bleeding)
23
+
24
+ When an outage is declared (e.g., 502 Bad Gateway across the entire primary cluster), do not ask the developer to check the database logs to figure out why the code crashed.
25
+
26
+ **Immediate Action Pipeline:**
27
+ 1. **Identify the Trigger:** What changed in the last 15 minutes? (90% of outages are caused by deployments).
28
+ 2. **Revert the Change:** Execute the emergency rollback pipeline instantly. Revert the Git commit, swap the Docker tag, or disable the Feature Flag.
29
+ 3. **Verify Stabilization:** Ensure metrics return to healthy thresholds.
30
+ 4. **Communicate:** "Mitigation complete. Services restored. Root cause investigation underway."
31
+
32
+ ---
33
+
34
+ ## 2. Isolating Cascading Failures
35
+
36
+ A cascading failure occurs when Service A dies, causing Service B to overload with retries, which kills Service B, which kills the database.
37
+
38
+ **The Circuit Breaker Protocol:**
39
+ If a downstream dependency is dead, sever it immediately to save the rest of the ecosystem.
40
+
41
+ ```javascript
42
+ // ❌ VULNERABLE: Infinite Retry Death Spiral
43
+ async function fetchUser(id) {
44
+ while(true) {
45
+ try { return await api.get(`/user/${id}`); }
46
+ catch { await sleep(100); } // Hundreds of containers doing this will execute a DDoSing attack on the API
47
+ }
48
+ }
49
+
50
+ // ✅ RESILIENT: Circuit Breaking / Fallbacks
51
+ const breaker = new CircuitBreaker(fetchUser, {
52
+ errorThresholdPercentage: 50, // If 50% of requests fail...
53
+ resetTimeout: 30000 // Open the circuit (stop sending requests) for 30s
54
+ });
55
+
56
+ breaker.fallback(() => ({ id: "cached-user", status: "degraded" }));
57
+ ```
58
+
59
+ **Heavy Mitigation Tactics:**
60
+ - **Shed Load:** Aggressively drop non-critical traffic (e.g., disable background syncs, temporarily ban aggressive scraping IPs).
61
+ - **Scale Out (Band-Aid):** If the memory leak is crashing nodes every 10 minutes, scale the nodes up 3x to buy yourself 30 minutes of runway to find the actual bug.
62
+
63
+ ---
64
+
65
+ ## 3. The Investigative Triage Routine
66
+
67
+ Once the bleeding is stopped (or if you are investigating a non-fatal anomaly), follow the data strictly:
68
+
69
+ 1. **Metrics (The "What"):** Look at the Dashboards. Did latency spike? Did CPU pin at 100%? Did Database active connections max out?
70
+ 2. **Traces (The "Where"):** Look at OpenTelemetry/Datadog traces. Which specific microservice is the bottleneck?
71
+ 3. **Logs (The "Why"):** Query the centralized logs (Splunk/Elastic/CloudWatch) exactly around the timestamp the trace spiked.
72
+
73
+ ---
74
+
75
+ ## 4. The Blameless Post-Mortem
76
+
77
+ Incident response does not end when the system recovers. It ends when the system is architected to survive the same failure tomorrow automatically.
78
+
79
+ **A Professional Post-Mortem Must Include:**
80
+ 1. **The Timeline:** Chronological factual representation of the event to the minute.
81
+ 2. **Root Cause Analysis (The 5 Whys):**
82
+ - *Why did the site go down?* DB exhausted connections.
83
+ - *Why did it exhaust?* The new background worker didn't pool connections.
84
+ - *Why did the worker deploy?* It bypassed CI tests for speed.
85
+ 3. **Action Items:** Tangible Jira tickets preventing recurrence (e.g., "Implement PgBouncer connection limits", "Enforce CI checks block on all branches").
86
+
87
+ ---
@@ -1,6 +1,6 @@
1
1
  # Antigravity Skills
2
2
 
3
- > **Guide to creating and using Skills in the Antigravity Kit**
3
+ **Guide to creating and using Skills in the Antigravity Kit**
4
4
 
5
5
  ---
6
6
 
@@ -16,9 +16,9 @@ While Antigravity's base models (like Gemini) are powerful generalists, they don
16
16
 
17
17
  Skills are folder-based packages. You can define these scopes based on your needs:
18
18
 
19
- | Scope | Path | Description |
20
- | ------------- | --------------------------------- | ------------------------------------ |
21
- | **Workspace** | `<workspace-root>/.agent/skills/` | Available only in a specific project |
19
+ |Scope|Path|Description|
20
+ |-------------|---------------------------------|------------------------------------|
21
+ |**Workspace**|`<workspace-root>/.agent/skills/`|Available only in a specific project|
22
22
 
23
23
  ### Skill Directory Structure
24
24
 
@@ -68,7 +68,7 @@ When reviewing code, follow these steps:
68
68
  - Suggest alternatives when possible
69
69
  ```
70
70
 
71
- > **Note**: The `SKILL.md` file contains metadata (name, description) at the top, followed by the instructions. The agent will only read the metadata and load the full instructions only when needed.
71
+ **Note**: The `SKILL.md` file contains metadata (name, description) at the top, followed by the instructions. The agent will only read the metadata and load the full instructions only when needed.
72
72
 
73
73
  ### Try it out
74
74
 
@@ -7,22 +7,27 @@ last-updated: 2026-03-12
7
7
  applies-to-model: gemini-2.5-pro, claude-3-7-sonnet
8
8
  ---
9
9
 
10
- # Documentation Standards
10
+ ## Hallucination Traps (Read First)
11
+ - ❌ Writing documentation that only AI-generated code can understand -> ✅ Docs are for HUMANS; use clear language and real examples
12
+ - ❌ Documenting implementation details instead of behavior -> ✅ Document WHAT it does and WHY, not HOW (code shows how)
13
+ - ❌ Skipping the 'Quick Start' section -> ✅ The first 30 seconds of a README determine if someone uses your project
14
+
15
+ ---
11
16
 
12
- > Documentation is a product. It has users. Those users are often future-you,
13
- > three months from now, having completely forgotten how this works.
17
+
18
+ # Documentation Standards
14
19
 
15
20
  ---
16
21
 
17
22
  ## Documentation Types and Their Audiences
18
23
 
19
- | Type | Audience | Goal |
24
+ |Type|Audience|Goal|
20
25
  |---|---|---|
21
- | README | New developer joining the project | "Get me running in 10 minutes" |
22
- | API docs | External integrator or frontend dev | "Tell me exactly what I can call and what I'll get back" |
23
- | Architecture decision (ADR) | Future engineer inheriting the codebase | "Tell me why it works this way, not just how" |
24
- | Code comment | Reviewer, maintainer | "Explain the non-obvious; skip the obvious" |
25
- | Runbook | On-call engineer at 2am | "Tell me what to do, not what to think about" |
26
+ |README|New developer joining the project|"Get me running in 10 minutes"|
27
+ |API docs|External integrator or frontend dev|"Tell me exactly what I can call and what I'll get back"|
28
+ |Architecture decision (ADR)|Future engineer inheriting the codebase|"Tell me why it works this way, not just how"|
29
+ |Code comment|Reviewer, maintainer|"Explain the non-obvious; skip the obvious"|
30
+ |Runbook|On-call engineer at 2am|"Tell me what to do, not what to think about"|
26
31
 
27
32
  ---
28
33
 
@@ -31,13 +36,13 @@ applies-to-model: gemini-2.5-pro, claude-3-7-sonnet
31
36
  The Tribunal Agent Kit supports 5 standard Agent Design Kit (ADK) base patterns.
32
37
  To build a skill using a robust, tested agent behavior model, add `pattern: [pattern-name]` to the YAML frontmatter of your `SKILL.md`.
33
38
 
34
- | Pattern | Value | When to use |
39
+ |Pattern|Value|When to use|
35
40
  |---|---|---|
36
- | **Inversion** | `pattern: inversion` | Forces the agent to interview the user (Socratic Gate) before acting. |
37
- | **Reviewer** | `pattern: reviewer` | Evaluates artifacts against a checklist and severity levels. |
38
- | **Tool Wrapper** | `pattern: tool-wrapper` | Strictly executes external CLI tools via provided documentation without guessing. |
39
- | **Generator** | `pattern: generator` | Produces structured output (docs, boilerplate) by filling a rigid template. |
40
- | **Pipeline** | `pattern: pipeline` | Executes sequential tasks with strict halting gates between steps. |
41
+ |**Inversion**|`pattern: inversion`|Forces the agent to interview the user (Socratic Gate) before acting.|
42
+ |**Reviewer**|`pattern: reviewer`|Evaluates artifacts against a checklist and severity levels.|
43
+ |**Tool Wrapper**|`pattern: tool-wrapper`|Strictly executes external CLI tools via provided documentation without guessing.|
44
+ |**Generator**|`pattern: generator`|Produces structured output (docs, boilerplate) by filling a rigid template.|
45
+ |**Pipeline**|`pattern: pipeline`|Executes sequential tasks with strict halting gates between steps.|
41
46
 
42
47
  *Templates defining the specific rules for these patterns live in `.agent/patterns/`.*
43
48
 
@@ -79,10 +84,10 @@ src/
79
84
 
80
85
  ## Environment Variables
81
86
 
82
- | Variable | Required | Description |
87
+ |Variable|Required|Description|
83
88
  |---|---|---|
84
- | DATABASE_URL | Yes | PostgreSQL connection string |
85
- | JWT_SECRET | Yes | Secret for signing JWTs |
89
+ |DATABASE_URL|Yes|PostgreSQL connection string|
90
+ |JWT_SECRET|Yes|Secret for signing JWTs|
86
91
 
87
92
  ## Running Tests
88
93
 
@@ -118,11 +123,11 @@ Creates a new user account.
118
123
 
119
124
  **Responses**
120
125
 
121
- | Status | Meaning | Body |
126
+ |Status|Meaning|Body|
122
127
  |---|---|---|
123
- | 201 | User created | `{ data: User }` |
124
- | 400 | Validation failed | `{ error: string, details: string[] }` |
125
- | 409 | Email already exists | `{ error: string }` |
128
+ |201|User created|`{ data: User }`|
129
+ |400|Validation failed|`{ error: string, details: string[] }`|
130
+ |409|Email already exists|`{ error: string }`|
126
131
 
127
132
  **Example**
128
133
  \`\`\`bash
@@ -221,45 +226,4 @@ VBC status: PENDING → VERIFIED
221
226
  Evidence: [link to terminal output, test result, or file diff]
222
227
  ```
223
228
 
224
-
225
-
226
- ---
227
-
228
- ## 🤖 LLM-Specific Traps
229
-
230
- AI coding assistants often fall into specific bad habits when dealing with this domain. These are strictly forbidden:
231
-
232
- 1. **Over-engineering:** Proposing complex abstractions or distributed systems when a simpler approach suffices.
233
- 2. **Hallucinated Libraries/Methods:** Using non-existent methods or packages. Always `// VERIFY` or check `package.json` / `requirements.txt`.
234
- 3. **Skipping Edge Cases:** Writing the "happy path" and ignoring error handling, timeouts, or data validation.
235
- 4. **Context Amnesia:** Forgetting the user's constraints and offering generic advice instead of tailored solutions.
236
- 5. **Silent Degradation:** Catching and suppressing errors without logging or re-raising.
237
-
238
229
  ---
239
-
240
- ## 🏛️ Tribunal Integration (Anti-Hallucination)
241
-
242
- **Slash command: `/review` or `/tribunal-full`**
243
- **Active reviewers: `logic-reviewer` · `security-auditor`**
244
-
245
- ### ❌ Forbidden AI Tropes
246
-
247
- 1. **Blind Assumptions:** Never make an assumption without documenting it clearly with `// VERIFY: [reason]`.
248
- 2. **Silent Degradation:** Catching and suppressing errors without logging or handling.
249
- 3. **Context Amnesia:** Forgetting the user's constraints and offering generic advice instead of tailored solutions.
250
-
251
- ### ✅ Pre-Flight Self-Audit
252
-
253
- Review these questions before confirming output:
254
- ```
255
- ✅ Did I rely ONLY on real, verified tools and methods?
256
- ✅ Is this solution appropriately scoped to the user's constraints?
257
- ✅ Did I handle potential failure modes and edge cases?
258
- ✅ Have I avoided generic boilerplate that doesn't add value?
259
- ```
260
-
261
- ### 🛑 Verification-Before-Completion (VBC) Protocol
262
-
263
- **CRITICAL:** You must follow a strict "evidence-based closeout" state machine.
264
- - ❌ **Forbidden:** Declaring a task complete because the output "looks correct."
265
- - ✅ **Required:** You are explicitly forbidden from finalizing any task without providing **concrete evidence** (terminal output, passing tests, compile success, or equivalent proof) that your output works as intended.