npm - @luanpdd/kit-mcp - Versions diffs - 1.35.0 → 1.36.0 - Mend

@luanpdd/kit-mcp 1.35.0 → 1.36.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (117) hide show

package/bin/cli.js +2 -2
package/bin/mcp.js +6 -6
package/bin/ui.js +74 -74
package/gates/ai-prompt-stability.md +120 -120
package/gates/budget-description.md +68 -68
package/gates/confidence.md +29 -29
package/gates/dependency-check.md +33 -33
package/gates/dept-cycle-prevention.md +179 -179
package/gates/golden-signals-coverage.md +133 -133
package/gates/legacy-refactor-safety.md +178 -178
package/gates/multi-tenant-rls-coverage.md +102 -102
package/gates/no-personal-uuid.md +72 -72
package/gates/obs-agents-mcp-supabase.md +86 -86
package/gates/obs-skills-frontmatter.md +76 -76
package/gates/observability-coverage.md +151 -151
package/gates/omm-no-regression.md +83 -83
package/gates/postmortem-template-required.md +127 -127
package/gates/prr-checklist-coverage.md +128 -128
package/gates/regression.md +32 -32
package/gates/release-pipeline-policy.md +132 -132
package/gates/secrets-scan.md +33 -33
package/gates/service-role-not-in-user-facing.md +113 -113
package/gates/skill-must-include.md +71 -71
package/gates/sync-idempotent.md +62 -62
package/gates/verify-phase-goal.md +34 -34
package/kit/agents/designer-ui.md +216 -216
package/kit/agents/workflow-generator.md +537 -167
package/kit/commands/adicionar-backlog.md +1 -1
package/kit/commands/adicionar-fase.md +1 -1
package/kit/commands/adicionar-tarefa.md +1 -1
package/kit/commands/auditar-observabilidade.md +103 -103
package/kit/commands/auditar-toil.md +129 -129
package/kit/commands/caracterizar-prompt.md +195 -195
package/kit/commands/criar-workflow.md +158 -158
package/kit/commands/definir-perfil.md +1 -1
package/kit/commands/definir-slo.md +108 -108
package/kit/commands/fio.md +1 -1
package/kit/commands/golden-signals.md +142 -142
package/kit/commands/instrumentar-fase.md +200 -200
package/kit/commands/investigar-producao.md +162 -162
package/kit/commands/observabilidade.md +118 -118
package/kit/commands/postmortem.md +179 -179
package/kit/commands/prr.md +205 -205
package/kit/commands/publicar-rapido.md +207 -207
package/kit/commands/risk-budget.md +220 -220
package/kit/commands/sre.md +230 -230
package/kit/file-manifest.json +424 -424
package/kit/framework/references/output-style.md +22 -22
package/kit/hooks/post-apply-migration.js +199 -199
package/kit/hooks/sidecar-tool-publisher.js +210 -210
package/kit/skills/_shared-dados-distribuidos/glossary.md +224 -224
package/kit/skills/_shared-legacy/glossary.md +389 -389
package/kit/skills/_shared-multi-tenant/glossary.md +186 -186
package/kit/skills/_shared-observability/glossary.md +396 -396
package/kit/skills/_shared-sre/glossary.md +712 -712
package/kit/skills/_shared-supabase/glossary.md +234 -234
package/kit/skills/blameless-postmortems/SKILL.md +340 -340
package/kit/skills/burn-rate-alerting/SKILL.md +258 -258
package/kit/skills/cascading-failures/SKILL.md +311 -311
package/kit/skills/core-analysis-loop/SKILL.md +352 -352
package/kit/skills/distributed-tracing/SKILL.md +362 -362
package/kit/skills/dynamic-workflow-authoring/SKILL.md +327 -223
package/kit/skills/eliminating-toil/SKILL.md +243 -243
package/kit/skills/event-based-slos/SKILL.md +296 -296
package/kit/skills/four-golden-signals/SKILL.md +314 -314
package/kit/skills/hermetic-builds/SKILL.md +323 -323
package/kit/skills/legacy-monster-methods/SKILL.md +444 -444
package/kit/skills/llm-as-dependency/SKILL.md +436 -436
package/kit/skills/load-shedding-graceful-degradation/SKILL.md +396 -396
package/kit/skills/observability-driven-development/SKILL.md +315 -315
package/kit/skills/observability-maturity-model/SKILL.md +222 -222
package/kit/skills/opentelemetry-standard/SKILL.md +351 -351
package/kit/skills/production-readiness-review/SKILL.md +305 -305
package/kit/skills/release-engineering/SKILL.md +367 -367
package/kit/skills/retry-strategies/SKILL.md +372 -372
package/kit/skills/sre-risk-management/SKILL.md +221 -221
package/kit/skills/structured-events/SKILL.md +265 -265
package/kit/skills/supabase-cron-queues/SKILL.md +275 -275
package/kit/skills/supabase-database-functions/SKILL.md +332 -332
package/kit/skills/supabase-declarative-schema/SKILL.md +183 -183
package/kit/skills/supabase-pgvector-rag/SKILL.md +253 -253
package/kit/skills/supabase-postgres-style/SKILL.md +138 -138
package/kit/skills/supabase-storage/SKILL.md +234 -234
package/kit/skills/telemetry-pipelines/SKILL.md +259 -259
package/kit/skills/telemetry-sampling/SKILL.md +256 -256
package/kit/skills/ui-anti-padroes-ia/SKILL.md +261 -261
package/kit/skills/ui-contexto-produto/SKILL.md +248 -248
package/kit/skills/ui-cor-estrategia/SKILL.md +213 -213
package/kit/skills/ui-critica-auditoria/SKILL.md +260 -260
package/kit/skills/ui-motion-funcional/SKILL.md +264 -264
package/kit/skills/ui-ritmo-espacial/SKILL.md +259 -259
package/kit/skills/ui-tipografia/SKILL.md +211 -211
package/package.json +1 -1
package/src/cli/index.js +1114 -1114
package/src/cli/render.js +194 -194
package/src/cli/upgrade-check.js +135 -135
package/src/core/error-redaction.js +76 -76
package/src/core/failures.js +153 -153
package/src/core/gate-runner.js +205 -205
package/src/core/gates.js +82 -82
package/src/core/logger.js +170 -170
package/src/core/manifest-verify.js +174 -174
package/src/core/metrics.js +268 -268
package/src/core/notify.js +60 -60
package/src/core/path-safety.js +141 -141
package/src/core/replays.js +120 -120
package/src/core/ui.js +185 -185
package/src/mcp-server/install.js +149 -149
package/src/mcp-server/roots.js +124 -124
package/src/ui/auto-spawn.js +113 -113
package/src/ui/browser.js +78 -78
package/src/ui/client.js +130 -130
package/src/ui/events.js +65 -65
package/src/ui/lockfile.js +191 -191
package/src/ui/port.js +67 -67
package/src/ui/server.js +547 -547
package/src/ui/wrapper.js +129 -129

package/kit/skills/structured-events/SKILL.md CHANGED Viewed

@@ -1,265 +1,265 @@
----
-name: structured-events
-description: Use ao instrumentar — wide events de alta cardinalidade (1/request), campos canônicos com dot notation, evite logs unstructured e métricas pre-aggregated.
----
-# Observabilidade — Structured Events (Wide Events)
-## Quando usar
-LLM carrega esta skill quando instrumentar código para emitir telemetria. Trigger phrases:
-- "structured logging", "wide events", "observability events"
-- "instrumentar handler", "emitir telemetria", "log estruturado"
-- "como salvar evento de request"
-- "campos canônicos", "atributos de span"
-- "alta cardinalidade", "debug por user_id"
-## Regras absolutas
-- **1 evento por request** — não múltiplos. Acumule contexto durante o request, emita 1 wide event no final (ou em erros).
-- **Wide é melhor que narrow** — adicione campos liberalmente. Custo de 100 campos/evento ≈ 10 campos. Disco é barato; falta de campo no incidente é caro.
-- **Alta cardinalidade é OBRIGATÓRIA** — `user.id`, `tenant_id`, `request.id`, `customer.email`. Sem isso, observabilidade não funciona (Cap 1).
-- **Dot notation OTel** — `user.id` (não `userId` nem `user_id`). `error.type`, `http.status_code`, `db.query`. Snake_case apenas em colunas de DB.
-- **NUNCA pre-aggregate** — não emita "p99 latency = 247ms"; emita o `duration_ms` cru de cada request. Aggregation no read time.
-- **Estruturado, não texto livre** — JSON, OTel attributes, ou colunas tipadas. **Nunca** `console.log("user 123 did X at 12:34")`.
-- **Errors são especiais** — sample 100% de eventos com `result.success = false`. Sucesso pode ser samplado (skill `telemetry-sampling`).
-- **Capture context, não code** — emita atributos de business logic (`customer.tier`, `feature_flag.x`), não estado interno de código (`var_x_value_at_line_42`).
-## Patterns canônicos
-### Pattern: handler instrumentado (Node/TypeScript)
-```ts
-// PT-BR: 1 evento por request, alta cardinalidade, atributos canônicos
-import { trace, SpanStatusCode } from '@opentelemetry/api'
-const tracer = trace.getTracer('orders-service')
-export async function handlePlaceOrder(req: Request) {
-  return tracer.startActiveSpan('place_order', async (span) => {
-    // PT-BR: campos canônicos sempre — alta cardinalidade
-    span.setAttribute('user.id', req.user.id)
-    span.setAttribute('tenant_id', req.user.tenant)
-    span.setAttribute('customer.tier', req.user.tier)
-    span.setAttribute('request.id', req.headers['x-request-id'])
-    span.setAttribute('endpoint', '/api/v1/orders')
-    span.setAttribute('http.method', 'POST')
-    span.setAttribute('build_id', process.env.BUILD_ID ?? 'dev')
-    // PT-BR: feature flags como dimensões
-    span.setAttribute('feature_flag.new_pricing', req.flags.newPricing)
-    try {
-      const order = await createOrder(req.body)
-      // PT-BR: result e atributos de domínio
-      span.setAttribute('result.success', true)
-      span.setAttribute('order.id', order.id)
-      span.setAttribute('order.amount_cents', order.amount)
-      span.setAttribute('order.items_count', order.items.length)
-      span.setAttribute('http.status_code', 200)
-      span.setStatus({ code: SpanStatusCode.OK })
-      return order
-    } catch (e) {
-      // PT-BR: erros — sample 100%, classificar por tipo
-      span.setAttribute('result.success', false)
-      span.setAttribute('error.type', classifyError(e))
-      span.setAttribute('error.message', e.message)
-      span.setAttribute('http.status_code', e.statusCode ?? 500)
-      span.setStatus({ code: SpanStatusCode.ERROR, message: e.message })
-      throw e
-    } finally {
-      span.end()  // PT-BR: SEMPRE — duration_ms é calculado aqui
-    }
-  })
-}
-function classifyError(e: any): string {
-  if (e.code === 'P2002') return 'db_conflict'
-  if (e.statusCode === 401) return 'auth'
-  if (e.statusCode === 403) return 'authz'
-  if (e.statusCode === 422) return 'validation'
-  if (e.statusCode === 429) return 'rate_limit'
-  if (e.code === 'ETIMEDOUT') return 'timeout'
-  return 'unknown'
-}
-```
-### Pattern: Edge Function (Deno) com structured event
-```ts
-// PT-BR: Supabase Edge Function — 1 evento estruturado por invocação
-import { trace } from 'npm:@opentelemetry/api@1.9.0'
-const tracer = trace.getTracer('edge-process-emails')
-Deno.serve(async (req) => {
-  return tracer.startActiveSpan('process_emails', async (span) => {
-    const requestId = crypto.randomUUID()
-    span.setAttribute('request.id', requestId)
-    span.setAttribute('build_id', Deno.env.get('SUPABASE_GIT_SHA') ?? 'local')
-    try {
-      const body = await req.json()
-      span.setAttribute('user.id', body.user_id)
-      span.setAttribute('tenant_id', body.tenant_id)
-      span.setAttribute('email.batch_size', body.emails?.length ?? 0)
-      const result = await processBatch(body.emails)
-      span.setAttribute('result.success', true)
-      span.setAttribute('email.sent_count', result.sent)
-      span.setAttribute('email.failed_count', result.failed)
-      span.setAttribute('duration_ms', result.duration)
-      return new Response(JSON.stringify(result), { status: 200 })
-    } catch (e) {
-      span.setAttribute('result.success', false)
-      span.setAttribute('error.type', classify(e))
-      span.setAttribute('error.message', String(e))
-      return new Response(JSON.stringify({ error: 'failed' }), { status: 500 })
-    } finally {
-      span.end()
-    }
-  })
-})
-```
-### Pattern: campos canônicos por categoria
-| Categoria | Campos | Exemplo |
-|---|---|---|
-| **Identidade** | `user.id`, `tenant_id`, `session.id` | `"550e8400-e29b-41d4-..."` |
-| **Request** | `request.id`, `endpoint`, `http.method`, `http.status_code` | `"req_abc123"`, `"/api/v1/orders"`, `"POST"`, `200` |
-| **Resultado** | `result.success`, `error.type`, `error.message` | `true`, `"validation"`, `"email already exists"` |
-| **Performance** | `duration_ms`, `db.query_count`, `cache.hit` | `127`, `3`, `true` |
-| **Build/Deploy** | `build_id`, `service.version`, `region` | `"abc123f"`, `"v1.9.0"`, `"us-east-1"` |
-| **Business** | `customer.tier`, `order.amount_cents`, `feature_flag.<name>` | `"pro"`, `4990`, `true` |
-| **Tracing** | `trace.id`, `span.id`, `span.parent_id` | (auto via OTel) |
-### Pattern: query observability — encontrar pattern em wide events
-```sql
--- PT-BR: alta cardinalidade permite group by ad hoc — sem schema rigido
--- Exemplo: qual tenant + endpoint + error_type domina os erros da última hora?
-select
-  tenant_id,
-  endpoint,
-  error_type,
-  count(*) as error_count,
-  avg(duration_ms) as avg_duration
-from observability.events
-where
-  result_success = false
-  and timestamp > now() - interval '1 hour'
-group by tenant_id, endpoint, error_type
-order by error_count desc
-limit 20;
-```
-## Anti-patterns
-### ANTI: log unstructured
-```ts
-// PT-BR: BAD — não estruturado, não queryable, sem alta cardinalidade
-console.log(`User ${userId} placed order ${orderId} for $${amount}`)
-// PT-BR: GOOD — structured wide event
-span.setAttribute('user.id', userId)
-span.setAttribute('order.id', orderId)
-span.setAttribute('order.amount_cents', amount * 100)
-```
-### ANTI: pre-aggregate em métricas
-```ts
-// PT-BR: BAD — pre-aggregation perde alta cardinalidade
-metrics.histogram('order_latency_ms').record(duration, { service: 'orders' })
-// PT-BR: GOOD — emit raw event, agregue no read
-span.setAttribute('duration_ms', duration)
-// PT-BR: ao queryar: SELECT percentile_cont(0.99) WITHIN GROUP (ORDER BY duration_ms)
-//        FROM events GROUP BY tenant_id, endpoint  -- alta cardinalidade preservada!
-```
-### ANTI: múltiplos eventos por request
-```ts
-// PT-BR: BAD — 5 eventos para 1 request, sem trace context
-log('user_action_started', { user_id })
-log('user_action_db_query', { user_id, query })
-log('user_action_email_sent', { user_id, to })
-log('user_action_completed', { user_id })
-log('user_action_response_sent', { user_id, status })
-// PT-BR: GOOD — 1 wide event acumulando contexto
-const span = tracer.startSpan('user_action')
-span.setAttribute('user.id', user_id)
-// ... ao longo do handler, span.setAttribute('email.recipient', ...) etc.
-span.end()  // 1 evento emitido com todos os atributos
-```
-### ANTI: cardinalidade baixa
-```ts
-// PT-BR: BAD — apenas service e endpoint, sem identidade
-span.setAttribute('service', 'orders')
-span.setAttribute('endpoint', '/place')
-// PT-BR: durante incident você não consegue responder "afeta quem?"
-// PT-BR: GOOD — adicione identidades de alta cardinalidade
-span.setAttribute('user.id', '550e8400-...')
-span.setAttribute('tenant_id', 'acme-corp')
-span.setAttribute('customer.tier', 'pro')
-// PT-BR: durante incident: "afeta quem?" → group by customer.tier, tenant_id
-```
-### ANTI: capturar valores internos de código
-```ts
-// PT-BR: BAD — atributos sobre estado de variáveis, não sobre business
-span.setAttribute('var_temp_array_length', tempArr.length)
-span.setAttribute('loop_iteration', i)
-// PT-BR: GOOD — atributos sobre business + identidade
-span.setAttribute('order.items_count', items.length)
-span.setAttribute('user.id', userId)
-```
-### ANTI: nomes inconsistentes de atributos
-```ts
-// PT-BR: BAD — mesmo conceito com nomes diferentes em handlers diferentes
-span.setAttribute('userId', user.id)        // handler A
-span.setAttribute('user_id', user.id)       // handler B
-span.setAttribute('user', user.id)          // handler C
-// PT-BR: query `WHERE user_id = X` falha em handler A; agg cross-handler quebra
-// PT-BR: GOOD — convenção única em todo o projeto
-span.setAttribute('user.id', user.id)       // sempre dot notation OTel
-```
-## Verificação
-Antes de marcar instrumentação completa:
-1. **1 evento por request** — em request de exemplo, contar eventos emitidos. Deve ser 1 (ou 2 se houver retry interno).
-2. **Atributos canônicos presentes** — checar `user.id`, `tenant_id`, `request.id`, `result.success`, `endpoint`, `duration_ms` no evento emitido.
-3. **Alta cardinalidade verificada** — `select count(distinct user_id)` deve crescer com tráfego real (não estagnar em N pequeno).
-4. **`result.success` define SLI** — boolean confiável para alimentar SLO downstream (ver skill `event-based-slos`).
-5. **Erros têm `error.type` enum** — não `error.message` cru. Permite group by por categoria.
-6. **Build_id presente** — permite comparar versão antes vs depois de deploy.
-7. **Smoke local** — emitir 100 eventos sintéticos, queryar via `select * from events where user_id = X` deve retornar todos.
----
-## Ver também
-- `kit/skills/_shared-observability/glossary.md` — termos canônicos, campos canônicos, anti-patterns
-- `kit/skills/distributed-tracing/SKILL.md` — como spans se conectam em traces
-- `kit/skills/opentelemetry-standard/SKILL.md` — SDK e exporters
-- `kit/skills/core-analysis-loop/SKILL.md` — como queryar wide events para debug
-*Material-fonte: Observability Engineering (O'Reilly, 2022) — Cap 5: "Structured Events Are the Building Blocks of Observability".*
+---
+name: structured-events
+description: Use ao instrumentar — wide events de alta cardinalidade (1/request), campos canônicos com dot notation, evite logs unstructured e métricas pre-aggregated.
+---
+# Observabilidade — Structured Events (Wide Events)
+## Quando usar
+LLM carrega esta skill quando instrumentar código para emitir telemetria. Trigger phrases:
+- "structured logging", "wide events", "observability events"
+- "instrumentar handler", "emitir telemetria", "log estruturado"
+- "como salvar evento de request"
+- "campos canônicos", "atributos de span"
+- "alta cardinalidade", "debug por user_id"
+## Regras absolutas
+- **1 evento por request** — não múltiplos. Acumule contexto durante o request, emita 1 wide event no final (ou em erros).
+- **Wide é melhor que narrow** — adicione campos liberalmente. Custo de 100 campos/evento ≈ 10 campos. Disco é barato; falta de campo no incidente é caro.
+- **Alta cardinalidade é OBRIGATÓRIA** — `user.id`, `tenant_id`, `request.id`, `customer.email`. Sem isso, observabilidade não funciona (Cap 1).
+- **Dot notation OTel** — `user.id` (não `userId` nem `user_id`). `error.type`, `http.status_code`, `db.query`. Snake_case apenas em colunas de DB.
+- **NUNCA pre-aggregate** — não emita "p99 latency = 247ms"; emita o `duration_ms` cru de cada request. Aggregation no read time.
+- **Estruturado, não texto livre** — JSON, OTel attributes, ou colunas tipadas. **Nunca** `console.log("user 123 did X at 12:34")`.
+- **Errors são especiais** — sample 100% de eventos com `result.success = false`. Sucesso pode ser samplado (skill `telemetry-sampling`).
+- **Capture context, não code** — emita atributos de business logic (`customer.tier`, `feature_flag.x`), não estado interno de código (`var_x_value_at_line_42`).
+## Patterns canônicos
+### Pattern: handler instrumentado (Node/TypeScript)
+```ts
+// PT-BR: 1 evento por request, alta cardinalidade, atributos canônicos
+import { trace, SpanStatusCode } from '@opentelemetry/api'
+const tracer = trace.getTracer('orders-service')
+export async function handlePlaceOrder(req: Request) {
+  return tracer.startActiveSpan('place_order', async (span) => {
+    // PT-BR: campos canônicos sempre — alta cardinalidade
+    span.setAttribute('user.id', req.user.id)
+    span.setAttribute('tenant_id', req.user.tenant)
+    span.setAttribute('customer.tier', req.user.tier)
+    span.setAttribute('request.id', req.headers['x-request-id'])
+    span.setAttribute('endpoint', '/api/v1/orders')
+    span.setAttribute('http.method', 'POST')
+    span.setAttribute('build_id', process.env.BUILD_ID ?? 'dev')
+    // PT-BR: feature flags como dimensões
+    span.setAttribute('feature_flag.new_pricing', req.flags.newPricing)
+    try {
+      const order = await createOrder(req.body)
+      // PT-BR: result e atributos de domínio
+      span.setAttribute('result.success', true)
+      span.setAttribute('order.id', order.id)
+      span.setAttribute('order.amount_cents', order.amount)
+      span.setAttribute('order.items_count', order.items.length)
+      span.setAttribute('http.status_code', 200)
+      span.setStatus({ code: SpanStatusCode.OK })
+      return order
+    } catch (e) {
+      // PT-BR: erros — sample 100%, classificar por tipo
+      span.setAttribute('result.success', false)
+      span.setAttribute('error.type', classifyError(e))
+      span.setAttribute('error.message', e.message)
+      span.setAttribute('http.status_code', e.statusCode ?? 500)
+      span.setStatus({ code: SpanStatusCode.ERROR, message: e.message })
+      throw e
+    } finally {
+      span.end()  // PT-BR: SEMPRE — duration_ms é calculado aqui
+    }
+  })
+}
+function classifyError(e: any): string {
+  if (e.code === 'P2002') return 'db_conflict'
+  if (e.statusCode === 401) return 'auth'
+  if (e.statusCode === 403) return 'authz'
+  if (e.statusCode === 422) return 'validation'
+  if (e.statusCode === 429) return 'rate_limit'
+  if (e.code === 'ETIMEDOUT') return 'timeout'
+  return 'unknown'
+}
+```
+### Pattern: Edge Function (Deno) com structured event
+```ts
+// PT-BR: Supabase Edge Function — 1 evento estruturado por invocação
+import { trace } from 'npm:@opentelemetry/api@1.9.0'
+const tracer = trace.getTracer('edge-process-emails')
+Deno.serve(async (req) => {
+  return tracer.startActiveSpan('process_emails', async (span) => {
+    const requestId = crypto.randomUUID()
+    span.setAttribute('request.id', requestId)
+    span.setAttribute('build_id', Deno.env.get('SUPABASE_GIT_SHA') ?? 'local')
+    try {
+      const body = await req.json()
+      span.setAttribute('user.id', body.user_id)
+      span.setAttribute('tenant_id', body.tenant_id)
+      span.setAttribute('email.batch_size', body.emails?.length ?? 0)
+      const result = await processBatch(body.emails)
+      span.setAttribute('result.success', true)
+      span.setAttribute('email.sent_count', result.sent)
+      span.setAttribute('email.failed_count', result.failed)
+      span.setAttribute('duration_ms', result.duration)
+      return new Response(JSON.stringify(result), { status: 200 })
+    } catch (e) {
+      span.setAttribute('result.success', false)
+      span.setAttribute('error.type', classify(e))
+      span.setAttribute('error.message', String(e))
+      return new Response(JSON.stringify({ error: 'failed' }), { status: 500 })
+    } finally {
+      span.end()
+    }
+  })
+})
+```
+### Pattern: campos canônicos por categoria
+| Categoria | Campos | Exemplo |
+|---|---|---|
+| **Identidade** | `user.id`, `tenant_id`, `session.id` | `"550e8400-e29b-41d4-..."` |
+| **Request** | `request.id`, `endpoint`, `http.method`, `http.status_code` | `"req_abc123"`, `"/api/v1/orders"`, `"POST"`, `200` |
+| **Resultado** | `result.success`, `error.type`, `error.message` | `true`, `"validation"`, `"email already exists"` |
+| **Performance** | `duration_ms`, `db.query_count`, `cache.hit` | `127`, `3`, `true` |
+| **Build/Deploy** | `build_id`, `service.version`, `region` | `"abc123f"`, `"v1.9.0"`, `"us-east-1"` |
+| **Business** | `customer.tier`, `order.amount_cents`, `feature_flag.<name>` | `"pro"`, `4990`, `true` |
+| **Tracing** | `trace.id`, `span.id`, `span.parent_id` | (auto via OTel) |
+### Pattern: query observability — encontrar pattern em wide events
+```sql
+-- PT-BR: alta cardinalidade permite group by ad hoc — sem schema rigido
+-- Exemplo: qual tenant + endpoint + error_type domina os erros da última hora?
+select
+  tenant_id,
+  endpoint,
+  error_type,
+  count(*) as error_count,
+  avg(duration_ms) as avg_duration
+from observability.events
+where
+  result_success = false
+  and timestamp > now() - interval '1 hour'
+group by tenant_id, endpoint, error_type
+order by error_count desc
+limit 20;
+```
+## Anti-patterns
+### ANTI: log unstructured
+```ts
+// PT-BR: BAD — não estruturado, não queryable, sem alta cardinalidade
+console.log(`User ${userId} placed order ${orderId} for $${amount}`)
+// PT-BR: GOOD — structured wide event
+span.setAttribute('user.id', userId)
+span.setAttribute('order.id', orderId)
+span.setAttribute('order.amount_cents', amount * 100)
+```
+### ANTI: pre-aggregate em métricas
+```ts
+// PT-BR: BAD — pre-aggregation perde alta cardinalidade
+metrics.histogram('order_latency_ms').record(duration, { service: 'orders' })
+// PT-BR: GOOD — emit raw event, agregue no read
+span.setAttribute('duration_ms', duration)
+// PT-BR: ao queryar: SELECT percentile_cont(0.99) WITHIN GROUP (ORDER BY duration_ms)
+//        FROM events GROUP BY tenant_id, endpoint  -- alta cardinalidade preservada!
+```
+### ANTI: múltiplos eventos por request
+```ts
+// PT-BR: BAD — 5 eventos para 1 request, sem trace context
+log('user_action_started', { user_id })
+log('user_action_db_query', { user_id, query })
+log('user_action_email_sent', { user_id, to })
+log('user_action_completed', { user_id })
+log('user_action_response_sent', { user_id, status })
+// PT-BR: GOOD — 1 wide event acumulando contexto
+const span = tracer.startSpan('user_action')
+span.setAttribute('user.id', user_id)
+// ... ao longo do handler, span.setAttribute('email.recipient', ...) etc.
+span.end()  // 1 evento emitido com todos os atributos
+```
+### ANTI: cardinalidade baixa
+```ts
+// PT-BR: BAD — apenas service e endpoint, sem identidade
+span.setAttribute('service', 'orders')
+span.setAttribute('endpoint', '/place')
+// PT-BR: durante incident você não consegue responder "afeta quem?"
+// PT-BR: GOOD — adicione identidades de alta cardinalidade
+span.setAttribute('user.id', '550e8400-...')
+span.setAttribute('tenant_id', 'acme-corp')
+span.setAttribute('customer.tier', 'pro')
+// PT-BR: durante incident: "afeta quem?" → group by customer.tier, tenant_id
+```
+### ANTI: capturar valores internos de código
+```ts
+// PT-BR: BAD — atributos sobre estado de variáveis, não sobre business
+span.setAttribute('var_temp_array_length', tempArr.length)
+span.setAttribute('loop_iteration', i)
+// PT-BR: GOOD — atributos sobre business + identidade
+span.setAttribute('order.items_count', items.length)
+span.setAttribute('user.id', userId)
+```
+### ANTI: nomes inconsistentes de atributos
+```ts
+// PT-BR: BAD — mesmo conceito com nomes diferentes em handlers diferentes
+span.setAttribute('userId', user.id)        // handler A
+span.setAttribute('user_id', user.id)       // handler B
+span.setAttribute('user', user.id)          // handler C
+// PT-BR: query `WHERE user_id = X` falha em handler A; agg cross-handler quebra
+// PT-BR: GOOD — convenção única em todo o projeto
+span.setAttribute('user.id', user.id)       // sempre dot notation OTel
+```
+## Verificação
+Antes de marcar instrumentação completa:
+1. **1 evento por request** — em request de exemplo, contar eventos emitidos. Deve ser 1 (ou 2 se houver retry interno).
+2. **Atributos canônicos presentes** — checar `user.id`, `tenant_id`, `request.id`, `result.success`, `endpoint`, `duration_ms` no evento emitido.
+3. **Alta cardinalidade verificada** — `select count(distinct user_id)` deve crescer com tráfego real (não estagnar em N pequeno).
+4. **`result.success` define SLI** — boolean confiável para alimentar SLO downstream (ver skill `event-based-slos`).
+5. **Erros têm `error.type` enum** — não `error.message` cru. Permite group by por categoria.
+6. **Build_id presente** — permite comparar versão antes vs depois de deploy.
+7. **Smoke local** — emitir 100 eventos sintéticos, queryar via `select * from events where user_id = X` deve retornar todos.
+---
+## Ver também
+- `kit/skills/_shared-observability/glossary.md` — termos canônicos, campos canônicos, anti-patterns
+- `kit/skills/distributed-tracing/SKILL.md` — como spans se conectam em traces
+- `kit/skills/opentelemetry-standard/SKILL.md` — SDK e exporters
+- `kit/skills/core-analysis-loop/SKILL.md` — como queryar wide events para debug
+*Material-fonte: Observability Engineering (O'Reilly, 2022) — Cap 5: "Structured Events Are the Building Blocks of Observability".*