npm - tribunal-kit - Versions diffs - 2.4.6 → 3.1.0 - Mend

tribunal-kit 2.4.6 → 3.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (250) hide show

package/.agent/ARCHITECTURE.md +99 -99
package/.agent/GEMINI.md +52 -52
package/.agent/agents/accessibility-reviewer.md +139 -86
package/.agent/agents/ai-code-reviewer.md +160 -90
package/.agent/agents/backend-specialist.md +164 -127
package/.agent/agents/code-archaeologist.md +115 -73
package/.agent/agents/database-architect.md +130 -110
package/.agent/agents/debugger.md +137 -97
package/.agent/agents/dependency-reviewer.md +78 -30
package/.agent/agents/devops-engineer.md +161 -118
package/.agent/agents/documentation-writer.md +151 -87
package/.agent/agents/explorer-agent.md +117 -99
package/.agent/agents/frontend-reviewer.md +127 -47
package/.agent/agents/frontend-specialist.md +169 -109
package/.agent/agents/game-developer.md +28 -164
package/.agent/agents/logic-reviewer.md +87 -49
package/.agent/agents/mobile-developer.md +151 -103
package/.agent/agents/mobile-reviewer.md +133 -50
package/.agent/agents/orchestrator.md +121 -110
package/.agent/agents/penetration-tester.md +103 -77
package/.agent/agents/performance-optimizer.md +136 -92
package/.agent/agents/performance-reviewer.md +139 -69
package/.agent/agents/product-manager.md +104 -70
package/.agent/agents/product-owner.md +6 -25
package/.agent/agents/project-planner.md +95 -95
package/.agent/agents/qa-automation-engineer.md +174 -87
package/.agent/agents/security-auditor.md +133 -129
package/.agent/agents/seo-specialist.md +160 -99
package/.agent/agents/sql-reviewer.md +132 -44
package/.agent/agents/supervisor-agent.md +137 -109
package/.agent/agents/swarm-worker-contracts.md +17 -17
package/.agent/agents/swarm-worker-registry.md +46 -46
package/.agent/agents/test-coverage-reviewer.md +132 -53
package/.agent/agents/test-engineer.md +0 -21
package/.agent/agents/type-safety-reviewer.md +143 -33
package/.agent/patterns/generator.md +9 -9
package/.agent/patterns/inversion.md +12 -12
package/.agent/patterns/pipeline.md +9 -9
package/.agent/patterns/reviewer.md +13 -13
package/.agent/patterns/tool-wrapper.md +9 -9
package/.agent/rules/GEMINI.md +63 -63
package/.agent/scripts/__pycache__/auto_preview.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/bundle_analyzer.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/checklist.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/dependency_analyzer.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/security_scan.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/session_manager.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/skill_integrator.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/swarm_dispatcher.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/test_runner.cpython-311.pyc +0 -0
package/.agent/scripts/__pycache__/verify_all.cpython-311.pyc +0 -0
package/.agent/scripts/compress_skills.py +167 -0
package/.agent/scripts/consolidate_skills.py +173 -0
package/.agent/scripts/deep_compress.py +202 -0
package/.agent/scripts/minify_context.py +80 -0
package/.agent/scripts/security_scan.py +1 -1
package/.agent/scripts/strip_tribunal.py +41 -0
package/.agent/skills/agent-organizer/SKILL.md +60 -100
package/.agent/skills/agentic-patterns/SKILL.md +0 -70
package/.agent/skills/ai-prompt-injection-defense/SKILL.md +108 -53
package/.agent/skills/api-patterns/SKILL.md +197 -257
package/.agent/skills/api-security-auditor/SKILL.md +125 -57
package/.agent/skills/app-builder/SKILL.md +326 -50
package/.agent/skills/app-builder/templates/SKILL.md +13 -15
package/.agent/skills/app-builder/templates/astro-static/TEMPLATE.md +16 -16
package/.agent/skills/app-builder/templates/chrome-extension/TEMPLATE.md +22 -22
package/.agent/skills/app-builder/templates/cli-tool/TEMPLATE.md +18 -18
package/.agent/skills/app-builder/templates/electron-desktop/TEMPLATE.md +20 -20
package/.agent/skills/app-builder/templates/express-api/TEMPLATE.md +17 -17
package/.agent/skills/app-builder/templates/flutter-app/TEMPLATE.md +18 -18
package/.agent/skills/app-builder/templates/monorepo-turborepo/TEMPLATE.md +21 -21
package/.agent/skills/app-builder/templates/nextjs-fullstack/TEMPLATE.md +19 -19
package/.agent/skills/app-builder/templates/nextjs-saas/TEMPLATE.md +26 -26
package/.agent/skills/app-builder/templates/nextjs-static/TEMPLATE.md +26 -26
package/.agent/skills/app-builder/templates/nuxt-app/TEMPLATE.md +19 -19
package/.agent/skills/app-builder/templates/python-fastapi/TEMPLATE.md +18 -18
package/.agent/skills/app-builder/templates/react-native-app/TEMPLATE.md +20 -20
package/.agent/skills/appflow-wireframe/SKILL.md +71 -98
package/.agent/skills/architecture/SKILL.md +161 -200
package/.agent/skills/authentication-best-practices/SKILL.md +121 -54
package/.agent/skills/bash-linux/SKILL.md +71 -166
package/.agent/skills/behavioral-modes/SKILL.md +8 -69
package/.agent/skills/brainstorming/SKILL.md +345 -127
package/.agent/skills/building-native-ui/SKILL.md +125 -57
package/.agent/skills/clean-code/SKILL.md +266 -149
package/.agent/skills/code-review-checklist/SKILL.md +0 -62
package/.agent/skills/config-validator/SKILL.md +73 -131
package/.agent/skills/csharp-developer/SKILL.md +434 -73
package/.agent/skills/database-design/SKILL.md +190 -275
package/.agent/skills/deployment-procedures/SKILL.md +81 -158
package/.agent/skills/devops-engineer/SKILL.md +255 -94
package/.agent/skills/devops-incident-responder/SKILL.md +50 -69
package/.agent/skills/doc.md +5 -5
package/.agent/skills/documentation-templates/SKILL.md +19 -63
package/.agent/skills/edge-computing/SKILL.md +75 -165
package/.agent/skills/extract-design-system/SKILL.md +84 -58
package/.agent/skills/framer-motion-expert/SKILL.md +195 -0
package/.agent/skills/frontend-design/SKILL.md +151 -499
package/.agent/skills/game-design-expert/SKILL.md +71 -0
package/.agent/skills/game-engineering-expert/SKILL.md +88 -0
package/.agent/skills/geo-fundamentals/SKILL.md +52 -178
package/.agent/skills/github-operations/SKILL.md +197 -272
package/.agent/skills/gsap-expert/SKILL.md +194 -0
package/.agent/skills/i18n-localization/SKILL.md +60 -172
package/.agent/skills/intelligent-routing/SKILL.md +123 -103
package/.agent/skills/lint-and-validate/SKILL.md +8 -52
package/.agent/skills/llm-engineering/SKILL.md +281 -195
package/.agent/skills/local-first/SKILL.md +76 -159
package/.agent/skills/mcp-builder/SKILL.md +48 -188
package/.agent/skills/mobile-design/SKILL.md +213 -219
package/.agent/skills/motion-engineering/SKILL.md +184 -0
package/.agent/skills/nextjs-react-expert/SKILL.md +184 -203
package/.agent/skills/nodejs-best-practices/SKILL.md +403 -185
package/.agent/skills/observability/SKILL.md +211 -203
package/.agent/skills/parallel-agents/SKILL.md +53 -146
package/.agent/skills/performance-profiling/SKILL.md +171 -151
package/.agent/skills/plan-writing/SKILL.md +49 -153
package/.agent/skills/platform-engineer/SKILL.md +57 -103
package/.agent/skills/playwright-best-practices/SKILL.md +110 -63
package/.agent/skills/powershell-windows/SKILL.md +61 -179
package/.agent/skills/python-patterns/SKILL.md +7 -35
package/.agent/skills/python-pro/SKILL.md +273 -114
package/.agent/skills/react-specialist/SKILL.md +227 -108
package/.agent/skills/readme-builder/SKILL.md +15 -85
package/.agent/skills/realtime-patterns/SKILL.md +216 -243
package/.agent/skills/red-team-tactics/SKILL.md +10 -51
package/.agent/skills/rust-pro/SKILL.md +525 -142
package/.agent/skills/seo-fundamentals/SKILL.md +92 -153
package/.agent/skills/server-management/SKILL.md +110 -166
package/.agent/skills/shadcn-ui-expert/SKILL.md +154 -55
package/.agent/skills/skill-creator/SKILL.md +18 -58
package/.agent/skills/sql-pro/SKILL.md +543 -68
package/.agent/skills/supabase-postgres-best-practices/SKILL.md +28 -68
package/.agent/skills/swiftui-expert/SKILL.md +124 -57
package/.agent/skills/systematic-debugging/SKILL.md +49 -151
package/.agent/skills/tailwind-patterns/SKILL.md +433 -149
package/.agent/skills/tdd-workflow/SKILL.md +63 -169
package/.agent/skills/test-result-analyzer/SKILL.md +33 -73
package/.agent/skills/testing-patterns/SKILL.md +437 -130
package/.agent/skills/trend-researcher/SKILL.md +30 -71
package/.agent/skills/ui-ux-pro-max/SKILL.md +0 -41
package/.agent/skills/ui-ux-researcher/SKILL.md +51 -91
package/.agent/skills/vue-expert/SKILL.md +225 -119
package/.agent/skills/vulnerability-scanner/SKILL.md +264 -226
package/.agent/skills/web-accessibility-auditor/SKILL.md +141 -58
package/.agent/skills/web-design-guidelines/SKILL.md +17 -61
package/.agent/skills/webapp-testing/SKILL.md +71 -196
package/.agent/skills/whimsy-injector/SKILL.md +58 -132
package/.agent/skills/workflow-optimizer/SKILL.md +28 -68
package/.agent/workflows/api-tester.md +96 -224
package/.agent/workflows/audit.md +81 -122
package/.agent/workflows/brainstorm.md +69 -105
package/.agent/workflows/changelog.md +65 -97
package/.agent/workflows/create.md +73 -88
package/.agent/workflows/debug.md +80 -111
package/.agent/workflows/deploy.md +119 -92
package/.agent/workflows/enhance.md +80 -91
package/.agent/workflows/fix.md +68 -97
package/.agent/workflows/generate.md +165 -164
package/.agent/workflows/migrate.md +106 -109
package/.agent/workflows/orchestrate.md +103 -86
package/.agent/workflows/performance-benchmarker.md +77 -268
package/.agent/workflows/plan.md +120 -98
package/.agent/workflows/preview.md +39 -96
package/.agent/workflows/refactor.md +105 -97
package/.agent/workflows/review-ai.md +63 -102
package/.agent/workflows/review.md +71 -110
package/.agent/workflows/session.md +53 -113
package/.agent/workflows/status.md +42 -88
package/.agent/workflows/strengthen-skills.md +90 -51
package/.agent/workflows/swarm.md +114 -129
package/.agent/workflows/test.md +125 -102
package/.agent/workflows/tribunal-backend.md +60 -78
package/.agent/workflows/tribunal-database.md +62 -100
package/.agent/workflows/tribunal-frontend.md +62 -82
package/.agent/workflows/tribunal-full.md +56 -100
package/.agent/workflows/tribunal-mobile.md +65 -94
package/.agent/workflows/tribunal-performance.md +62 -105
package/.agent/workflows/ui-ux-pro-max.md +72 -121
package/README.md +11 -15
package/package.json +1 -1
package/.agent/skills/api-patterns/api-style.md +0 -42
package/.agent/skills/api-patterns/auth.md +0 -24
package/.agent/skills/api-patterns/documentation.md +0 -26
package/.agent/skills/api-patterns/graphql.md +0 -41
package/.agent/skills/api-patterns/rate-limiting.md +0 -31
package/.agent/skills/api-patterns/response.md +0 -37
package/.agent/skills/api-patterns/rest.md +0 -40
package/.agent/skills/api-patterns/security-testing.md +0 -122
package/.agent/skills/api-patterns/trpc.md +0 -41
package/.agent/skills/api-patterns/versioning.md +0 -22
package/.agent/skills/app-builder/agent-coordination.md +0 -71
package/.agent/skills/app-builder/feature-building.md +0 -53
package/.agent/skills/app-builder/project-detection.md +0 -34
package/.agent/skills/app-builder/scaffolding.md +0 -118
package/.agent/skills/app-builder/tech-stack.md +0 -40
package/.agent/skills/architecture/context-discovery.md +0 -43
package/.agent/skills/architecture/examples.md +0 -94
package/.agent/skills/architecture/pattern-selection.md +0 -68
package/.agent/skills/architecture/patterns-reference.md +0 -50
package/.agent/skills/architecture/trade-off-analysis.md +0 -77
package/.agent/skills/brainstorming/dynamic-questioning.md +0 -360
package/.agent/skills/database-design/database-selection.md +0 -43
package/.agent/skills/database-design/indexing.md +0 -39
package/.agent/skills/database-design/migrations.md +0 -48
package/.agent/skills/database-design/optimization.md +0 -36
package/.agent/skills/database-design/orm-selection.md +0 -30
package/.agent/skills/database-design/schema-design.md +0 -56
package/.agent/skills/dotnet-core-expert/SKILL.md +0 -103
package/.agent/skills/framer-motion-animations/SKILL.md +0 -74
package/.agent/skills/frontend-design/animation-guide.md +0 -331
package/.agent/skills/frontend-design/color-system.md +0 -329
package/.agent/skills/frontend-design/decision-trees.md +0 -418
package/.agent/skills/frontend-design/motion-graphics.md +0 -306
package/.agent/skills/frontend-design/typography-system.md +0 -363
package/.agent/skills/frontend-design/ux-psychology.md +0 -1116
package/.agent/skills/frontend-design/visual-effects.md +0 -383
package/.agent/skills/game-development/2d-games/SKILL.md +0 -119
package/.agent/skills/game-development/3d-games/SKILL.md +0 -135
package/.agent/skills/game-development/SKILL.md +0 -236
package/.agent/skills/game-development/game-art/SKILL.md +0 -185
package/.agent/skills/game-development/game-audio/SKILL.md +0 -190
package/.agent/skills/game-development/game-design/SKILL.md +0 -129
package/.agent/skills/game-development/mobile-games/SKILL.md +0 -108
package/.agent/skills/game-development/multiplayer/SKILL.md +0 -132
package/.agent/skills/game-development/pc-games/SKILL.md +0 -144
package/.agent/skills/game-development/vr-ar/SKILL.md +0 -123
package/.agent/skills/game-development/web-games/SKILL.md +0 -150
package/.agent/skills/intelligent-routing/router-manifest.md +0 -65
package/.agent/skills/mobile-design/decision-trees.md +0 -516
package/.agent/skills/mobile-design/mobile-backend.md +0 -491
package/.agent/skills/mobile-design/mobile-color-system.md +0 -420
package/.agent/skills/mobile-design/mobile-debugging.md +0 -122
package/.agent/skills/mobile-design/mobile-design-thinking.md +0 -357
package/.agent/skills/mobile-design/mobile-navigation.md +0 -458
package/.agent/skills/mobile-design/mobile-performance.md +0 -767
package/.agent/skills/mobile-design/mobile-testing.md +0 -356
package/.agent/skills/mobile-design/mobile-typography.md +0 -433
package/.agent/skills/mobile-design/platform-android.md +0 -666
package/.agent/skills/mobile-design/platform-ios.md +0 -561
package/.agent/skills/mobile-design/touch-psychology.md +0 -537
package/.agent/skills/nextjs-react-expert/1-async-eliminating-waterfalls.md +0 -312
package/.agent/skills/nextjs-react-expert/2-bundle-bundle-size-optimization.md +0 -240
package/.agent/skills/nextjs-react-expert/3-server-server-side-performance.md +0 -490
package/.agent/skills/nextjs-react-expert/4-client-client-side-data-fetching.md +0 -264
package/.agent/skills/nextjs-react-expert/5-rerender-re-render-optimization.md +0 -581
package/.agent/skills/nextjs-react-expert/6-rendering-rendering-performance.md +0 -432
package/.agent/skills/nextjs-react-expert/7-js-javascript-performance.md +0 -684
package/.agent/skills/nextjs-react-expert/8-advanced-advanced-patterns.md +0 -150
package/.agent/skills/vulnerability-scanner/checklists.md +0 -121

package/.agent/skills/observability/SKILL.md CHANGED Viewed

@@ -1,285 +1,293 @@
 ---
 name: observability
-description: Production observability principles. OpenTelemetry traces, structured logs, metrics, SLOs/SLIs/error budgets, and AI observability. Use when setting up monitoring, debugging production issues, or designing observable distributed systems.
+description: Production observability mastery. Structured logging (Pino/Winston), OpenTelemetry tracing, metrics (Prometheus/Grafana), SLIs/SLOs/error budgets, distributed tracing, alerting design, health checks, and AI observability. Use when setting up monitoring, debugging production issues, or designing observable distributed systems.
 allowed-tools: Read, Write, Edit, Glob, Grep
-version: 1.0.0
-last-updated: 2026-03-12
+version: 2.0.0
+last-updated: 2026-04-01
 applies-to-model: gemini-2.5-pro, claude-3-7-sonnet
 ---
-# Observability Principles
-> Monitoring tells you when something is broken.
-> Observability tells you why.
+# Observability — Production Monitoring Mastery
 ---
 ## The Three Pillars
 ```
-TRACES    → The journey of a single request across services
-            "Why was THIS request slow?"
-LOGS      → Discrete events with context
-            "What exactly happened at 14:23:07?"
+Logs    → WHAT happened (structured events)
+Traces  → WHERE it happened (request flow across services)
+Metrics → HOW MUCH is happening (counters, histograms, gauges)
-METRICS   → Aggregated measurements over time
-            "What is our error rate over the last hour?"
+All three are needed. Logs alone are not observability.
 ```
-Use all three. They answer different questions. None replaces the others.
 ---
-## OpenTelemetry: The Standard
-OpenTelemetry (OTel) is the vendor-neutral standard for instrumentation. Use it and you can swap backends (Jaeger, Grafana Tempo, Honeycomb, Datadog) without changing application code.
+## Structured Logging
-```ts
-// src/instrumentation.ts — initialize OTel once, before app code
-import { NodeSDK } from '@opentelemetry/sdk-node';
-import { OTLPTraceExporter } from '@opentelemetry/exporter-trace-otlp-http';
-import { Resource } from '@opentelemetry/resources';
-import { SemanticResourceAttributes } from '@opentelemetry/semantic-conventions';
+```typescript
+import pino from "pino";
-const sdk = new NodeSDK({
-  resource: new Resource({
-    [SemanticResourceAttributes.SERVICE_NAME]: 'my-api',
-    [SemanticResourceAttributes.SERVICE_VERSION]: '1.0.0',
-  }),
-  traceExporter: new OTLPTraceExporter({
-    url: process.env.OTEL_EXPORTER_OTLP_ENDPOINT,
+// ✅ Structured JSON logging
+const logger = pino({
+  level: process.env.LOG_LEVEL ?? "info",
+  timestamp: pino.stdTimeFunctions.isoTime,
+  ...(process.env.NODE_ENV === "development" && {
+    transport: { target: "pino-pretty" },
   }),
 });
-sdk.start();
-process.on('SIGTERM', () => sdk.shutdown());
+// ✅ GOOD: Structured with context
+logger.info({ userId: user.id, action: "login", ip: req.ip }, "User logged in");
+logger.error({ err, orderId: order.id, paymentGateway: "stripe" }, "Payment failed");
+logger.warn({ queueDepth: 1500, threshold: 1000 }, "Queue depth exceeding threshold");
+// ❌ BAD: Unstructured string logging
+console.log("User " + user.id + " logged in from " + req.ip);
+console.log("Error: " + error.message);
+// ❌ HALLUCINATION TRAP: console.log is NOT production logging
+// - No severity levels (info/warn/error)
+// - No structured fields (can't search/filter)
+// - No timestamps in ISO format
+// - Can't be collected by log aggregators
+// ✅ Use Pino (Node.js) or structlog (Python) for production
 ```
----
-## Distributed Tracing
+### Log Levels
-Traces connect the dots across microservice boundaries:
+```
+fatal → App is crashing, immediate attention required
+error → Operation failed, needs investigation
+warn  → Something unexpected, but app continues
+info  → Business events (user login, order placed, deploy)
+debug → Technical details (query timing, cache hit/miss)
+trace → Verbose debugging (only in development)
+Rules:
+- Production default: info
+- Never log PII (names, emails, SSNs) at any level
+- Never log secrets (tokens, passwords, API keys)
+- Log request IDs for correlation
+- Log durations for performance tracking
+```
-```ts
-import { trace, context, SpanStatusCode } from '@opentelemetry/api';
+### Request Context / Correlation
-const tracer = trace.getTracer('payment-service');
+```typescript
+import { AsyncLocalStorage } from "node:async_hooks";
-async function processPayment(orderId: string, amount: number) {
-  return tracer.startActiveSpan('payment.process', async (span) => {
-    try {
-      // Add business context to the span
-      span.setAttributes({
-        'order.id': orderId,
-        'payment.amount': amount,
-        'payment.currency': 'USD',
-      });
+const requestContext = new AsyncLocalStorage<{ requestId: string; userId?: string }>();
-      const result = await chargeCard(orderId, amount);
+// Middleware: set context per request
+app.use((req, res, next) => {
+  const requestId = req.headers["x-request-id"]?.toString() ?? crypto.randomUUID();
+  res.setHeader("x-request-id", requestId);
+  requestContext.run({ requestId, userId: req.user?.id }, next);
+});
-      span.setStatus({ code: SpanStatusCode.OK });
-      return result;
-    } catch (err) {
-      // Record the error with full context
-      span.recordException(err as Error);
-      span.setStatus({ code: SpanStatusCode.ERROR, message: (err as Error).message });
-      throw err;
-    } finally {
-      span.end();
-    }
+// Child logger with context
+function getLogger() {
+  const ctx = requestContext.getStore();
+  return logger.child({
+    requestId: ctx?.requestId,
+    userId: ctx?.userId,
   });
 }
+// Every log from this request includes requestId and userId
+const log = getLogger();
+log.info("Processing order");  // { requestId: "abc-123", userId: "42", msg: "Processing order" }
 ```
 ---
-## Structured Logging
-Logs must be machine-parseable:
+## Distributed Tracing (OpenTelemetry)
-```ts
-// ❌ Unstructured — impossible to query, filter, or alert on
-console.log(`User ${userId} failed to login at ${new Date()}`);
+```typescript
+import { NodeSDK } from "@opentelemetry/sdk-node";
+import { getNodeAutoInstrumentations } from "@opentelemetry/auto-instrumentations-node";
+import { OTLPTraceExporter } from "@opentelemetry/exporter-trace-otlp-http";
-// ✅ Structured — every field is queryable
-logger.warn({
-  event: 'auth.login_failed',
-  userId,
-  reason: 'invalid_password',
-  attemptCount: 3,
-  ip: req.ip,
-  timestamp: new Date().toISOString(),
+// Initialize OpenTelemetry
+const sdk = new NodeSDK({
+  traceExporter: new OTLPTraceExporter({
+    url: process.env.OTEL_EXPORTER_OTLP_ENDPOINT ?? "http://localhost:4318/v1/traces",
+  }),
+  instrumentations: [
+    getNodeAutoInstrumentations({
+      "@opentelemetry/instrumentation-http": { enabled: true },
+      "@opentelemetry/instrumentation-express": { enabled: true },
+      "@opentelemetry/instrumentation-pg": { enabled: true },
+      "@opentelemetry/instrumentation-redis": { enabled: true },
+    }),
+  ],
 });
-```
-### What to Always Log
-| Always | Never |
-|---|---|
-| Request ID / trace ID | Passwords or password hashes |
-| User ID (not PII) | Credit card numbers |
-| Error type + message | API keys or tokens |
-| Duration (ms) | Full request bodies (may contain PII) |
-| HTTP status code | |
----
-## Metrics: What to Measure
-The four golden signals (Google SRE):
+sdk.start();
-```
-1. LATENCY       — How long does serving a request take?
-                   Track p50, p95, p99 — not just average
-                   Average hides the worst-case user experience
+// Manual span for custom business logic
+import { trace } from "@opentelemetry/api";
-2. TRAFFIC       — How much demand is there?
-                   requests/sec, messages/sec, bytes/sec
+const tracer = trace.getTracer("order-service");
-3. ERRORS        — What fraction of requests are failing?
-                   HTTP 5xx rate, exception rate, timeout rate
+async function processOrder(order: Order) {
+  return tracer.startActiveSpan("processOrder", async (span) => {
+    try {
+      span.setAttribute("order.id", order.id);
+      span.setAttribute("order.total", order.total);
+      span.setAttribute("order.items.count", order.items.length);
-4. SATURATION    — How "full" is your service?
-                   CPU %, memory %, queue depth
+      const result = await executeOrder(order);
+      span.setStatus({ code: SpanStatusCode.OK });
+      return result;
+    } catch (error) {
+      span.setStatus({ code: SpanStatusCode.ERROR, message: error.message });
+      span.recordException(error);
+      throw error;
+    } finally {
+      span.end();
+    }
+  });
+}
 ```
 ---
-## SLOs / SLIs / Error Budgets
+## Metrics
-The framework that connects technical work to business reliability:
-```
-SLI (Service Level Indicator) — a specific, measurable signal:
-  "HTTP 200 responses as % of all responses to /api/checkout"
+```typescript
+import { metrics } from "@opentelemetry/api";
-SLO (Service Level Objective) — your reliability promise:
-  "99.9% of checkout requests succeed over a 30-day window"
+const meter = metrics.getMeter("api-server");
-Error Budget — how much unreliability you can afford:
-  "30 days × 0.1% error tolerance = 43.2 minutes of downtime allowed"
+// Counter — things that only go up
+const requestCounter = meter.createCounter("http.requests.total", {
+  description: "Total HTTP requests",
+});
-Error Budget Policy:
-  Budget healthy  → ship new features freely
-  Budget depleted → freeze releases, focus only on reliability
-```
+// Histogram — request durations
+const requestDuration = meter.createHistogram("http.request.duration_ms", {
+  description: "HTTP request duration in milliseconds",
+  unit: "ms",
+});
----
+// Gauge — current values
+const activeConnections = meter.createUpDownCounter("db.connections.active", {
+  description: "Active database connections",
+});
-## AI Observability
-Standard metrics don't cover AI systems. Add these:
-```ts
-// Track every AI call with these dimensions
-logger.info({
-  event: 'ai.completion',
-  model: 'gpt-4o',
-  prompt_tokens: response.usage.prompt_tokens,
-  completion_tokens: response.usage.completion_tokens,
-  total_tokens: response.usage.total_tokens,
-  latency_ms: duration,
-  cost_usd: calculateCost(model, usage),
-  trace_id: currentTraceId(),
-  // Eval scores (from async evaluation pipeline)
-  eval_faithfulness: 0.92,    // Did output match sources?
-  eval_relevance: 0.88,       // Did output answer the question?
+// Middleware to record metrics
+app.use((req, res, next) => {
+  const start = performance.now();
+  res.on("finish", () => {
+    const duration = performance.now() - start;
+    requestCounter.add(1, {
+      method: req.method,
+      path: req.route?.path ?? req.path,
+      status: res.statusCode.toString(),
+    });
+    requestDuration.record(duration, {
+      method: req.method,
+      status: res.statusCode.toString(),
+    });
+  });
+  next();
 });
 ```
-### AI-Specific Alerts
+### Key Metrics to Track
 ```
-🚨 TOKEN COST SPIKE     → cost per request > 2x trailing average → alert
-🚨 LATENCY DEGRADATION  → p95 LLM latency > 5s → alert
-🚨 EVAL SCORE DECLINE   → faithfulness drops below 0.8 (model drift?) → alert
-🚨 ERROR RATE SPIKE     → 429s or context_length errors > 5% → alert
+RED method (for services):
+  Rate     → requests per second
+  Errors   → error rate (4xx, 5xx)
+  Duration → latency percentiles (P50, P95, P99)
+USE method (for resources):
+  Utilization → CPU %, memory %, disk %
+  Saturation  → queue depth, thread pool saturation
+  Errors      → disk failures, OOM kills
+Business metrics:
+  - Sign-ups per hour
+  - Orders processed per minute
+  - Revenue per day
+  - API calls per customer
 ```
 ---
-## Output Format
+## SLIs, SLOs & Error Budgets
-When this skill produces a recommendation or design decision, structure your output as:
-```
-━━━ Observability Recommendation ━━━━━━━━━━━━━━━━
-Decision:    [what was chosen / proposed]
-Rationale:   [why — one concise line]
-Trade-offs:  [what is consciously accepted]
-Next action: [concrete next step for the user]
-─────────────────────────────────────────────────
-Pre-Flight:  ✅ All checks passed
-             or ❌ [blocking item that must be resolved first]
 ```
+SLI (Service Level Indicator) → What you measure
+  "99.2% of requests complete in <500ms"
+SLO (Service Level Objective) → Your target
+  "99.9% of requests should complete in <500ms"
----
-## 🏛️ Tribunal Integration (Anti-Hallucination)
-**Slash command: `/tribunal-backend`**
-**Active reviewers: `logic` · `security` · `performance`**
-### ❌ Forbidden AI Tropes in Observability
-1. **Logging sensitive data** — never log request bodies wholesale — they contain passwords, tokens, PII. Log only specific, safe fields.
-2. **Tracking averages only** — `avg(latency)` hides the 1% of users who get 10x worse experience. Always use percentiles (p95, p99).
-3. **100% SLO targets** — `99.999%` SLOs are wrong for most services. They consume all error budget instantly and paralyze product velocity.
-4. **Inventing OTel packages** — only use `@opentelemetry/{sdk-node,api,exporter-*}` from the official `@opentelemetry` npm org.
+SLA (Service Level Agreement) → Your contract (with penalties)
+  "99.95% uptime or we refund 10%"
-### ✅ Pre-Flight Self-Audit
+Error Budget = 100% - SLO
+  SLO: 99.9% → Error budget: 0.1% → 43 min downtime/month
+  SLO: 99.5% → Error budget: 0.5% → 3.6 hours downtime/month
+Rules:
+- Burn error budget too fast → freeze deployments
+- Error budget remaining → ship features faster
+- Don't set SLOs you can't measure
+- SLOs should be slightly below actual performance
 ```
-✅ Are logs structured JSON (not string-interpolated messages)?
-✅ Is no PII or credential data being logged?
-✅ Are latency measurements tracking percentiles (p95/p99), not just averages?
-✅ Does every async operation have a trace span with error recording?
-✅ Are AI calls instrumented with token count + cost + latency tracking?
-✅ Is there an SLO defined with an explicit error budget policy?
-```
 ---
-## 🤖 LLM-Specific Traps
-AI coding assistants often fall into specific bad habits when dealing with this domain. These are strictly forbidden:
-1. **Over-engineering:** Proposing complex abstractions or distributed systems when a simpler approach suffices.
-2. **Hallucinated Libraries/Methods:** Using non-existent methods or packages. Always `// VERIFY` or check `package.json` / `requirements.txt`.
-3. **Skipping Edge Cases:** Writing the "happy path" and ignoring error handling, timeouts, or data validation.
-4. **Context Amnesia:** Forgetting the user's constraints and offering generic advice instead of tailored solutions.
-5. **Silent Degradation:** Catching and suppressing errors without logging or re-raising.
+## Health Checks
----
-## 🏛️ Tribunal Integration (Anti-Hallucination)
+```typescript
+// Liveness: Is the process running?
+app.get("/health/live", (req, res) => {
+  res.status(200).json({ status: "ok" });
+});
-**Slash command: `/review` or `/tribunal-full`**
-**Active reviewers: `logic-reviewer` · `security-auditor`**
+// Readiness: Can it accept traffic?
+app.get("/health/ready", async (req, res) => {
+  try {
+    await db.raw("SELECT 1");           // database check
+    await redis.ping();                  // cache check
+    res.status(200).json({
+      status: "ready",
+      checks: { database: "ok", cache: "ok" },
+    });
+  } catch (error) {
+    res.status(503).json({
+      status: "not ready",
+      checks: { database: error.message },
+    });
+  }
+});
-### ❌ Forbidden AI Tropes
+// ❌ HALLUCINATION TRAP: Liveness ≠ Readiness
+// Liveness fails → container restarts (only for unrecoverable states)
+// Readiness fails → stop sending traffic (temporary — DB down, etc.)
+// Making liveness check the DB → DB outage restarts all containers → cascade failure
+```
-1. **Blind Assumptions:** Never make an assumption without documenting it clearly with `// VERIFY: [reason]`.
-2. **Silent Degradation:** Catching and suppressing errors without logging or handling.
-3. **Context Amnesia:** Forgetting the user's constraints and offering generic advice instead of tailored solutions.
+---
-### ✅ Pre-Flight Self-Audit
+## Alerting
-Review these questions before confirming output:
 ```
-✅ Did I rely ONLY on real, verified tools and methods?
-✅ Is this solution appropriately scoped to the user's constraints?
-✅ Did I handle potential failure modes and edge cases?
-✅ Have I avoided generic boilerplate that doesn't add value?
+Alert design rules:
+1. Alert on SYMPTOMS, not causes (high latency, not "CPU is 80%")
+2. Every alert must have a runbook link
+3. Every alert must be ACTIONABLE — if you can't do anything, it's a notification
+4. Use severity levels:
+   - Critical → page on-call (customer-facing outage)
+   - Warning  → Slack notification (degraded, not broken)
+   - Info     → dashboard only (awareness)
+5. Avoid alert fatigue — fewer, meaningful alerts beat many noisy ones
 ```
-### 🛑 Verification-Before-Completion (VBC) Protocol
-**CRITICAL:** You must follow a strict "evidence-based closeout" state machine.
-- ❌ **Forbidden:** Declaring a task complete because the output "looks correct."
-- ✅ **Required:** You are explicitly forbidden from finalizing any task without providing **concrete evidence** (terminal output, passing tests, compile success, or equivalent proof) that your output works as intended.
+---