opencode-metis 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (156) hide show
  1. package/README.md +140 -0
  2. package/dist/cli.cjs +63 -0
  3. package/dist/mcp-server.cjs +51 -0
  4. package/dist/plugin.cjs +4 -0
  5. package/dist/worker.cjs +224 -0
  6. package/opencode/agent/the-analyst/feature-prioritization.md +66 -0
  7. package/opencode/agent/the-analyst/market-research.md +77 -0
  8. package/opencode/agent/the-analyst/project-coordination.md +81 -0
  9. package/opencode/agent/the-analyst/requirements-analysis.md +77 -0
  10. package/opencode/agent/the-architect/compatibility-review.md +138 -0
  11. package/opencode/agent/the-architect/complexity-review.md +137 -0
  12. package/opencode/agent/the-architect/quality-review.md +67 -0
  13. package/opencode/agent/the-architect/security-review.md +127 -0
  14. package/opencode/agent/the-architect/system-architecture.md +119 -0
  15. package/opencode/agent/the-architect/system-documentation.md +83 -0
  16. package/opencode/agent/the-architect/technology-research.md +85 -0
  17. package/opencode/agent/the-chief.md +79 -0
  18. package/opencode/agent/the-designer/accessibility-implementation.md +101 -0
  19. package/opencode/agent/the-designer/design-foundation.md +74 -0
  20. package/opencode/agent/the-designer/interaction-architecture.md +75 -0
  21. package/opencode/agent/the-designer/user-research.md +70 -0
  22. package/opencode/agent/the-meta-agent.md +155 -0
  23. package/opencode/agent/the-platform-engineer/ci-cd-pipelines.md +109 -0
  24. package/opencode/agent/the-platform-engineer/containerization.md +106 -0
  25. package/opencode/agent/the-platform-engineer/data-architecture.md +81 -0
  26. package/opencode/agent/the-platform-engineer/dependency-review.md +144 -0
  27. package/opencode/agent/the-platform-engineer/deployment-automation.md +81 -0
  28. package/opencode/agent/the-platform-engineer/infrastructure-as-code.md +107 -0
  29. package/opencode/agent/the-platform-engineer/performance-tuning.md +82 -0
  30. package/opencode/agent/the-platform-engineer/pipeline-engineering.md +81 -0
  31. package/opencode/agent/the-platform-engineer/production-monitoring.md +105 -0
  32. package/opencode/agent/the-qa-engineer/exploratory-testing.md +66 -0
  33. package/opencode/agent/the-qa-engineer/performance-testing.md +81 -0
  34. package/opencode/agent/the-qa-engineer/quality-assurance.md +77 -0
  35. package/opencode/agent/the-qa-engineer/test-execution.md +66 -0
  36. package/opencode/agent/the-software-engineer/api-development.md +78 -0
  37. package/opencode/agent/the-software-engineer/component-development.md +79 -0
  38. package/opencode/agent/the-software-engineer/concurrency-review.md +141 -0
  39. package/opencode/agent/the-software-engineer/domain-modeling.md +66 -0
  40. package/opencode/agent/the-software-engineer/performance-optimization.md +113 -0
  41. package/opencode/command/analyze.md +149 -0
  42. package/opencode/command/constitution.md +178 -0
  43. package/opencode/command/debug.md +194 -0
  44. package/opencode/command/document.md +178 -0
  45. package/opencode/command/implement.md +225 -0
  46. package/opencode/command/refactor.md +207 -0
  47. package/opencode/command/review.md +229 -0
  48. package/opencode/command/simplify.md +267 -0
  49. package/opencode/command/specify.md +191 -0
  50. package/opencode/command/validate.md +224 -0
  51. package/opencode/skill/accessibility-design/SKILL.md +566 -0
  52. package/opencode/skill/accessibility-design/checklists/wcag-checklist.md +435 -0
  53. package/opencode/skill/agent-coordination/SKILL.md +224 -0
  54. package/opencode/skill/api-contract-design/SKILL.md +550 -0
  55. package/opencode/skill/api-contract-design/templates/graphql-schema-template.md +818 -0
  56. package/opencode/skill/api-contract-design/templates/rest-api-template.md +417 -0
  57. package/opencode/skill/architecture-design/SKILL.md +160 -0
  58. package/opencode/skill/architecture-design/examples/architecture-examples.md +170 -0
  59. package/opencode/skill/architecture-design/template.md +749 -0
  60. package/opencode/skill/architecture-design/validation.md +99 -0
  61. package/opencode/skill/architecture-selection/SKILL.md +522 -0
  62. package/opencode/skill/architecture-selection/examples/adrs/001-example-adr.md +71 -0
  63. package/opencode/skill/architecture-selection/examples/architecture-patterns.md +239 -0
  64. package/opencode/skill/bug-diagnosis/SKILL.md +235 -0
  65. package/opencode/skill/code-quality-review/SKILL.md +337 -0
  66. package/opencode/skill/code-quality-review/examples/anti-patterns.md +629 -0
  67. package/opencode/skill/code-quality-review/reference.md +322 -0
  68. package/opencode/skill/code-review/SKILL.md +363 -0
  69. package/opencode/skill/code-review/reference.md +450 -0
  70. package/opencode/skill/codebase-analysis/SKILL.md +139 -0
  71. package/opencode/skill/codebase-navigation/SKILL.md +227 -0
  72. package/opencode/skill/codebase-navigation/examples/exploration-patterns.md +263 -0
  73. package/opencode/skill/coding-conventions/SKILL.md +178 -0
  74. package/opencode/skill/coding-conventions/checklists/accessibility-checklist.md +176 -0
  75. package/opencode/skill/coding-conventions/checklists/performance-checklist.md +154 -0
  76. package/opencode/skill/coding-conventions/checklists/security-checklist.md +127 -0
  77. package/opencode/skill/constitution-validation/SKILL.md +315 -0
  78. package/opencode/skill/constitution-validation/examples/CONSTITUTION.md +202 -0
  79. package/opencode/skill/constitution-validation/reference/rule-patterns.md +328 -0
  80. package/opencode/skill/constitution-validation/template.md +115 -0
  81. package/opencode/skill/context-preservation/SKILL.md +445 -0
  82. package/opencode/skill/data-modeling/SKILL.md +385 -0
  83. package/opencode/skill/data-modeling/templates/schema-design-template.md +268 -0
  84. package/opencode/skill/deployment-pipeline-design/SKILL.md +579 -0
  85. package/opencode/skill/deployment-pipeline-design/templates/pipeline-template.md +633 -0
  86. package/opencode/skill/documentation-extraction/SKILL.md +259 -0
  87. package/opencode/skill/documentation-sync/SKILL.md +431 -0
  88. package/opencode/skill/domain-driven-design/SKILL.md +509 -0
  89. package/opencode/skill/domain-driven-design/examples/ddd-patterns.md +688 -0
  90. package/opencode/skill/domain-driven-design/reference.md +465 -0
  91. package/opencode/skill/drift-detection/SKILL.md +383 -0
  92. package/opencode/skill/drift-detection/reference.md +340 -0
  93. package/opencode/skill/error-recovery/SKILL.md +162 -0
  94. package/opencode/skill/error-recovery/examples/error-patterns.md +484 -0
  95. package/opencode/skill/feature-prioritization/SKILL.md +419 -0
  96. package/opencode/skill/feature-prioritization/examples/rice-template.md +139 -0
  97. package/opencode/skill/feature-prioritization/reference.md +256 -0
  98. package/opencode/skill/git-workflow/SKILL.md +453 -0
  99. package/opencode/skill/implementation-planning/SKILL.md +215 -0
  100. package/opencode/skill/implementation-planning/examples/phase-examples.md +217 -0
  101. package/opencode/skill/implementation-planning/template.md +220 -0
  102. package/opencode/skill/implementation-planning/validation.md +88 -0
  103. package/opencode/skill/implementation-verification/SKILL.md +272 -0
  104. package/opencode/skill/knowledge-capture/SKILL.md +265 -0
  105. package/opencode/skill/knowledge-capture/reference/knowledge-capture.md +402 -0
  106. package/opencode/skill/knowledge-capture/reference.md +444 -0
  107. package/opencode/skill/knowledge-capture/templates/domain-template.md +325 -0
  108. package/opencode/skill/knowledge-capture/templates/interface-template.md +255 -0
  109. package/opencode/skill/knowledge-capture/templates/pattern-template.md +144 -0
  110. package/opencode/skill/observability-design/SKILL.md +291 -0
  111. package/opencode/skill/observability-design/references/monitoring-patterns.md +461 -0
  112. package/opencode/skill/pattern-detection/SKILL.md +171 -0
  113. package/opencode/skill/pattern-detection/examples/common-patterns.md +359 -0
  114. package/opencode/skill/performance-analysis/SKILL.md +266 -0
  115. package/opencode/skill/performance-analysis/references/profiling-tools.md +499 -0
  116. package/opencode/skill/requirements-analysis/SKILL.md +139 -0
  117. package/opencode/skill/requirements-analysis/examples/good-prd.md +66 -0
  118. package/opencode/skill/requirements-analysis/template.md +177 -0
  119. package/opencode/skill/requirements-analysis/validation.md +69 -0
  120. package/opencode/skill/requirements-elicitation/SKILL.md +518 -0
  121. package/opencode/skill/requirements-elicitation/examples/interview-questions.md +226 -0
  122. package/opencode/skill/requirements-elicitation/examples/user-stories.md +414 -0
  123. package/opencode/skill/safe-refactoring/SKILL.md +312 -0
  124. package/opencode/skill/safe-refactoring/reference/code-smells.md +347 -0
  125. package/opencode/skill/security-assessment/SKILL.md +421 -0
  126. package/opencode/skill/security-assessment/checklists/security-review-checklist.md +285 -0
  127. package/opencode/skill/specification-management/SKILL.md +143 -0
  128. package/opencode/skill/specification-management/readme-template.md +32 -0
  129. package/opencode/skill/specification-management/reference.md +115 -0
  130. package/opencode/skill/specification-management/spec.py +229 -0
  131. package/opencode/skill/specification-validation/SKILL.md +397 -0
  132. package/opencode/skill/specification-validation/reference/3cs-framework.md +306 -0
  133. package/opencode/skill/specification-validation/reference/ambiguity-detection.md +132 -0
  134. package/opencode/skill/specification-validation/reference/constitution-validation.md +301 -0
  135. package/opencode/skill/specification-validation/reference/drift-detection.md +383 -0
  136. package/opencode/skill/task-delegation/SKILL.md +607 -0
  137. package/opencode/skill/task-delegation/examples/file-coordination.md +495 -0
  138. package/opencode/skill/task-delegation/examples/parallel-research.md +337 -0
  139. package/opencode/skill/task-delegation/examples/sequential-build.md +504 -0
  140. package/opencode/skill/task-delegation/reference.md +825 -0
  141. package/opencode/skill/tech-stack-detection/SKILL.md +89 -0
  142. package/opencode/skill/tech-stack-detection/references/framework-signatures.md +598 -0
  143. package/opencode/skill/technical-writing/SKILL.md +190 -0
  144. package/opencode/skill/technical-writing/templates/adr-template.md +205 -0
  145. package/opencode/skill/technical-writing/templates/system-doc-template.md +380 -0
  146. package/opencode/skill/test-design/SKILL.md +464 -0
  147. package/opencode/skill/test-design/examples/test-pyramid.md +724 -0
  148. package/opencode/skill/testing/SKILL.md +213 -0
  149. package/opencode/skill/testing/examples/test-pyramid.md +724 -0
  150. package/opencode/skill/user-insight-synthesis/SKILL.md +576 -0
  151. package/opencode/skill/user-insight-synthesis/templates/research-plan-template.md +217 -0
  152. package/opencode/skill/user-research/SKILL.md +508 -0
  153. package/opencode/skill/user-research/examples/interview-questions.md +265 -0
  154. package/opencode/skill/user-research/examples/personas.md +267 -0
  155. package/opencode/skill/vibe-security/SKILL.md +654 -0
  156. package/package.json +45 -0
@@ -0,0 +1,291 @@
1
+ ---
2
+ name: observability-design
3
+ description: "Monitoring strategies, distributed tracing, SLI/SLO design, and alerting patterns. Use when designing monitoring infrastructure, defining service level objectives, implementing distributed tracing, creating alert rules, building dashboards, or establishing incident response procedures."
4
+ license: MIT
5
+ compatibility: opencode
6
+ metadata:
7
+ category: infrastructure
8
+ version: "1.0"
9
+ ---
10
+
11
+ # Observability Design
12
+
13
+ Roleplay as an observability design specialist for building monitoring, alerting, and diagnostic systems that turn telemetry into actionable insight.
14
+
15
+ ObservabilityDesign {
16
+ Activation {
17
+ Designing monitoring infrastructure for new services
18
+ Defining SLIs, SLOs, and error budgets for reliability
19
+ Implementing distributed tracing across microservices
20
+ Creating alert rules that minimize noise and maximize signal
21
+ Building dashboards for operations and business stakeholders
22
+ Establishing incident response and postmortem processes
23
+ Diagnosing production issues through telemetry correlation
24
+ }
25
+
26
+ Constraints {
27
+ 1. You cannot fix what you cannot see
28
+ 2. Observability is about answering questions you have not thought to ask yet
29
+ 3. Turn every incident into a learning opportunity
30
+ 4. Turn every metric into actionable insight
31
+ 5. Correlate metrics, logs, and traces with shared identifiers
32
+ }
33
+
34
+ ThreePillars {
35
+ Metrics {
36
+ Description: "Numeric measurements aggregated over time. Best for understanding system behavior at scale."
37
+
38
+ Characteristics {
39
+ Highly efficient storage (aggregated values)
40
+ Support mathematical operations (rates, percentiles)
41
+ Enable alerting on thresholds
42
+ Limited cardinality (avoid high-cardinality labels)
43
+ }
44
+
45
+ Types {
46
+ | Type | Use Case | Example |
47
+ |------|----------|---------|
48
+ | Counter | Cumulative values that only increase | Total requests, errors, bytes sent |
49
+ | Gauge | Values that go up and down | Current memory, active connections |
50
+ | Histogram | Distribution of values in buckets | Request latency, payload sizes |
51
+ | Summary | Similar to histogram, calculated client-side | Pre-computed percentiles |
52
+ }
53
+ }
54
+
55
+ Logs {
56
+ Description: "Immutable records of discrete events. Best for understanding specific occurrences."
57
+
58
+ Characteristics {
59
+ Rich context and arbitrary data
60
+ Expensive to store and query at scale
61
+ Essential for debugging specific issues
62
+ Should be structured (JSON) for parseability
63
+ }
64
+
65
+ Structure {
66
+ ```
67
+ Required fields:
68
+ - timestamp: ISO 8601 format with timezone
69
+ - level: ERROR, WARN, INFO, DEBUG
70
+ - message: Human-readable description
71
+ - service: Service identifier
72
+ - trace_id: Correlation identifier
73
+
74
+ Context fields:
75
+ - user_id: Sanitized user identifier
76
+ - request_id: Request correlation
77
+ - duration_ms: Operation timing
78
+ - error_type: Classification for errors
79
+ ```
80
+ }
81
+ }
82
+
83
+ Traces {
84
+ Description: "Records of request flow across distributed systems. Best for understanding causality and latency."
85
+
86
+ Characteristics {
87
+ Show request path through services
88
+ Identify latency bottlenecks
89
+ Reveal dependencies and failure points
90
+ Higher overhead than metrics
91
+ }
92
+
93
+ Components {
94
+ Trace: "Complete request journey"
95
+ Span: "Single operation within a trace"
96
+ Context: "Metadata propagated across services"
97
+ }
98
+ }
99
+ }
100
+
101
+ SLISLOFramework {
102
+ ServiceLevelIndicators {
103
+ Description: "Quantitative measures of service behavior from the user perspective"
104
+
105
+ Categories {
106
+ | Category | Measures | Example SLI |
107
+ |----------|----------|-------------|
108
+ | Availability | Service is responding | % of successful requests |
109
+ | Latency | Response speed | % of requests < 200ms |
110
+ | Throughput | Capacity | Requests processed per second |
111
+ | Error Rate | Correctness | % of requests without errors |
112
+ | Freshness | Data currency | % of data < 1 minute old |
113
+ }
114
+
115
+ SpecificationExample {
116
+ ```
117
+ SLI: Request Latency
118
+ Definition: Time from request received to response sent
119
+ Measurement: Server-side histogram at p50, p95, p99
120
+ Exclusions: Health checks, internal tooling
121
+ Data source: Application metrics
122
+ ```
123
+ }
124
+ }
125
+
126
+ ServiceLevelObjectives {
127
+ Description: "Target reliability levels for SLIs over a time window"
128
+
129
+ Formula {
130
+ ```
131
+ SLO = (Good events / Total events) >= Target over Window
132
+
133
+ Example:
134
+ 99.9% of requests complete successfully in < 200ms
135
+ measured over a 30-day rolling window
136
+ ```
137
+ }
138
+
139
+ SettingTargets {
140
+ Start with current baseline performance
141
+ Consider user expectations and business impact
142
+ Balance reliability investment against feature velocity
143
+ Document error budget policy
144
+ }
145
+ }
146
+
147
+ ErrorBudgets {
148
+ Description: "The allowed amount of unreliability within an SLO"
149
+
150
+ Calculation {
151
+ ```
152
+ Error Budget = 1 - SLO Target
153
+
154
+ 99.9% SLO = 0.1% error budget
155
+ = 43.2 minutes downtime per 30 days
156
+ = 8.64 seconds per day
157
+ ```
158
+ }
159
+
160
+ Policies {
161
+ BudgetRemaining: "Continue feature development"
162
+ BudgetDepleted: "Focus on reliability work"
163
+ BudgetBurningFast: "Freeze deploys, investigate"
164
+ }
165
+ }
166
+ }
167
+
168
+ AlertingStrategies {
169
+ SymptomBasedAlerts {
170
+ Description: "Alert on user-visible symptoms, not internal causes"
171
+
172
+ GoodAlerts {
173
+ Error rate exceeds threshold (users experiencing failures)
174
+ Latency SLO at risk (users experiencing slowness)
175
+ Queue depth growing (backlog affecting users)
176
+ }
177
+
178
+ PoorAlerts {
179
+ CPU at 80% (may not affect users)
180
+ Pod restarted (self-healing, may not affect users)
181
+ Disk at 70% (not yet impacting service)
182
+ }
183
+ }
184
+
185
+ MultiWindowMultiBurnRate {
186
+ Description: "Detect fast burns quickly, slow burns before budget depletion"
187
+
188
+ Configuration {
189
+ ```
190
+ Fast burn: 14.4x burn rate over 1 hour
191
+ - Fires in 1 hour if issue persists
192
+ - Catches severe incidents quickly
193
+
194
+ Slow burn: 3x burn rate over 3 days
195
+ - Fires before 30-day budget depletes
196
+ - Catches gradual degradation
197
+ ```
198
+ }
199
+ }
200
+
201
+ AlertFatiguePrevention {
202
+ Strategies {
203
+ Alert only on actionable issues
204
+ Consolidate related alerts
205
+ Set meaningful thresholds (not arbitrary)
206
+ Require sustained condition before firing
207
+ Include runbook links in every alert
208
+ Review and prune alerts quarterly
209
+ }
210
+
211
+ QualityChecklist {
212
+ Can someone take action right now?
213
+ Is the severity appropriate?
214
+ Does it include enough context?
215
+ Is there a runbook linked?
216
+ Has it fired false positives recently?
217
+ }
218
+ }
219
+ }
220
+
221
+ DashboardDesign {
222
+ Hierarchy {
223
+ ServiceHealthOverview {
224
+ High-level SLO status
225
+ Error budget consumption
226
+ Key business metrics
227
+ Designed for quick triage
228
+ }
229
+
230
+ DeepDiveDiagnostic {
231
+ Detailed metrics breakdown
232
+ Resource utilization
233
+ Dependency health
234
+ Designed for investigation
235
+ }
236
+
237
+ BusinessMetrics {
238
+ User-facing KPIs
239
+ Conversion and engagement
240
+ Revenue impact
241
+ Designed for stakeholders
242
+ }
243
+ }
244
+
245
+ Principles {
246
+ Answer specific questions, not show all data
247
+ Use consistent color coding (green=good, red=bad)
248
+ Show time ranges appropriate to the metric
249
+ Include context (deployments, incidents) on graphs
250
+ Mobile-responsive for on-call use
251
+ Provide drill-down paths to detailed views
252
+ }
253
+
254
+ EssentialPanels {
255
+ | Panel | Purpose | Audience |
256
+ |-------|---------|----------|
257
+ | SLO Status | Current reliability vs target | Everyone |
258
+ | Error Budget | Remaining budget and burn rate | Engineering |
259
+ | Request Rate | Traffic patterns and anomalies | Operations |
260
+ | Latency Distribution | p50, p95, p99 over time | Engineering |
261
+ | Error Breakdown | Errors by type and endpoint | Engineering |
262
+ | Dependency Health | Status of upstream services | Operations |
263
+ }
264
+ }
265
+
266
+ BestPractices {
267
+ Correlate metrics, logs, and traces with shared identifiers
268
+ Instrument code at service boundaries, not everywhere
269
+ Use structured logging with consistent field names
270
+ Set retention policies appropriate to data value
271
+ Test alerts in staging before production
272
+ Document SLOs and share with stakeholders
273
+ Conduct regular game days to validate observability
274
+ Automate common diagnostic procedures in runbooks
275
+ }
276
+
277
+ AntiPatterns {
278
+ Alert on every possible metric (alert fatigue)
279
+ Create dashboards without specific questions in mind
280
+ Log without structure or correlation IDs
281
+ Set SLOs without measuring current baseline
282
+ Ignore error budget policies when convenient
283
+ Treat all alerts with equal severity
284
+ Store high-cardinality data in metrics (use logs/traces)
285
+ Skip postmortems when issues resolve themselves
286
+ }
287
+ }
288
+
289
+ ## References
290
+
291
+ - [references/monitoring-patterns.md](references/monitoring-patterns.md) - Detailed implementation patterns and examples