npm - @probelabs/visor - Versions diffs - 0.1.124 → 0.1.126 - Mend

@probelabs/visor 0.1.124 → 0.1.126

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (195) hide show

package/dist/config.d.ts.map +1 -1
package/dist/docs/DEPLOYMENT.md +117 -11
package/dist/docs/GITHUB_CHECKS.md +18 -4
package/dist/docs/NPM_USAGE.md +112 -39
package/dist/docs/action-reference.md +63 -9
package/dist/docs/advanced-ai.md +58 -51
package/dist/docs/ai-configuration.md +99 -11
package/dist/docs/ai-custom-tools-usage.md +70 -33
package/dist/docs/ai-custom-tools.md +50 -27
package/dist/docs/architecture.md +1232 -0
package/dist/docs/bot-transports-rfc.md +13 -3
package/dist/docs/ci-cli-mode.md +116 -8
package/dist/docs/claude-code.md +111 -41
package/dist/docs/command-provider.md +37 -15
package/dist/docs/commands.md +252 -6
package/dist/docs/configuration.md +138 -4
package/dist/docs/contributing.md +737 -0
package/dist/docs/custom-tools.md +39 -8
package/dist/docs/dashboards/README.md +33 -19
package/dist/docs/debug-visualizer-progress.md +14 -13
package/dist/docs/debug-visualizer-rfc.md +14 -13
package/dist/docs/debug-visualizer.md +30 -5
package/dist/docs/debugging.md +73 -8
package/dist/docs/default-output-schema.md +24 -20
package/dist/docs/dependencies.md +75 -21
package/dist/docs/dev-playbook.md +85 -9
package/dist/docs/engine-pause-resume-rfc.md +11 -11
package/dist/docs/engine-state-machine-plan.md +10 -3
package/dist/docs/event-driven-github-integration-rfc.md +20 -11
package/dist/docs/event-triggers.md +95 -6
package/dist/docs/execution-statistics-rfc.md +16 -4
package/dist/docs/fact-validator-gap-analysis.md +12 -1
package/dist/docs/fact-validator-implementation-plan.md +19 -11
package/dist/docs/fail-if.md +116 -11
package/dist/docs/failure-conditions-implementation.md +40 -6
package/dist/docs/failure-conditions-schema.md +243 -87
package/dist/docs/failure-routing-rfc.md +43 -18
package/dist/docs/failure-routing.md +80 -23
package/dist/docs/faq.md +836 -0
package/dist/docs/foreach-dependency-propagation.md +32 -15
package/dist/docs/github-ops.md +6 -5
package/dist/docs/glossary.md +322 -0
package/dist/docs/goto-forward-run-plan.md +23 -10
package/dist/docs/guides/criticality-modes.md +15 -13
package/dist/docs/guides/fault-management-and-contracts.md +8 -5
package/dist/docs/guides/workflow-style-guide.md +17 -8
package/dist/docs/http.md +102 -3
package/dist/docs/human-input-provider.md +20 -36
package/dist/docs/index.md +206 -0
package/dist/docs/lifecycle-hooks.md +322 -2
package/dist/docs/limits.md +20 -5
package/dist/docs/liquid-templates.md +86 -14
package/dist/docs/loop-routing-refactor.md +4 -2
package/dist/docs/mcp-provider.md +53 -19
package/dist/docs/mcp.md +27 -1
package/dist/docs/memory.md +7 -2
package/dist/docs/migration.md +596 -0
package/dist/docs/observability.md +227 -6
package/dist/docs/output-formats.md +388 -9
package/dist/docs/output-history.md +36 -6
package/dist/docs/performance.md +510 -4
package/dist/docs/pluggable.md +95 -4
package/dist/docs/proposals/snapshot-scope-execution.md +6 -5
package/dist/docs/providers/git-checkout.md +16 -14
package/dist/docs/providers/noop.md +696 -0
package/dist/docs/recipes.md +8 -9
package/dist/docs/rfc/git-checkout-step.md +3 -1
package/dist/docs/rfc/on_init-hook.md +18 -5
package/dist/docs/rfc/workspace-isolation.md +16 -0
package/dist/docs/roadmap/criticality-implementation-tasks.md +27 -27
package/dist/docs/router-patterns.md +155 -43
package/dist/docs/schema-templates.md +51 -15
package/dist/docs/script.md +162 -13
package/dist/docs/sdk.md +46 -12
package/dist/docs/security.md +464 -5
package/dist/docs/slack-integration.md +481 -0
package/dist/docs/tag-filtering.md +60 -20
package/dist/docs/telemetry-setup.md +157 -46
package/dist/docs/test-framework-rfc.md +37 -36
package/dist/docs/testing/assertions.md +92 -4
package/dist/docs/testing/ci.md +56 -7
package/dist/docs/testing/cli.md +57 -15
package/dist/docs/testing/cookbook.md +53 -20
package/dist/docs/testing/dsl-reference.md +110 -9
package/dist/docs/testing/fixtures-and-mocks.md +28 -3
package/dist/docs/testing/flows.md +59 -4
package/dist/docs/testing/getting-started.md +14 -13
package/dist/docs/testing/troubleshooting.md +39 -2
package/dist/docs/timeouts.md +174 -18
package/dist/docs/troubleshooting.md +176 -6
package/dist/docs/workflow-creation-guide.md +101 -3
package/dist/docs/workflows.md +138 -41
package/dist/examples/README.md +169 -4
package/dist/examples/ai-custom-tools-simple.yaml +2 -3
package/dist/examples/cron-webhook-config.yaml +15 -0
package/dist/examples/forEach-example.yaml +6 -0
package/dist/examples/git-checkout-basic.yaml +4 -0
package/dist/examples/git-checkout-compare.yaml +6 -0
package/dist/examples/git-checkout-cross-repo.yaml +7 -0
package/dist/examples/http-integration-config.yaml +30 -0
package/dist/examples/https-server-config.yaml +15 -0
package/dist/examples/mcp-provider-example.yaml +10 -10
package/dist/examples/transform-example.yaml +3 -0
package/dist/examples/webhook-pipeline-config.yaml +18 -0
package/dist/examples/workflows/workflow-composition-example.yaml +4 -0
package/dist/frontends/slack-frontend.d.ts +2 -0
package/dist/frontends/slack-frontend.d.ts.map +1 -1
package/dist/generated/config-schema.d.ts +11 -7
package/dist/generated/config-schema.d.ts.map +1 -1
package/dist/generated/config-schema.json +11 -7
package/dist/index.js +3127 -974
package/dist/output/traces/{run-2026-01-28T16-15-24-569Z.ndjson → run-2026-01-31T16-37-22-321Z.ndjson} +84 -84
package/dist/output/traces/{run-2026-01-28T16-16-09-757Z.ndjson → run-2026-01-31T16-38-06-031Z.ndjson} +1013 -1013
package/dist/providers/ai-check-provider.d.ts +9 -2
package/dist/providers/ai-check-provider.d.ts.map +1 -1
package/dist/providers/command-check-provider.d.ts.map +1 -1
package/dist/providers/mcp-custom-sse-server.d.ts +17 -1
package/dist/providers/mcp-custom-sse-server.d.ts.map +1 -1
package/dist/providers/workflow-check-provider.d.ts.map +1 -1
package/dist/providers/workflow-tool-executor.d.ts +68 -0
package/dist/providers/workflow-tool-executor.d.ts.map +1 -0
package/dist/sdk/{check-provider-registry-AQ3JETBG.mjs → check-provider-registry-3KI5RKXT.mjs} +6 -5
package/dist/sdk/check-provider-registry-IYILYY35.mjs +28 -0
package/dist/sdk/chunk-2CPMMNIX.mjs +1459 -0
package/dist/sdk/chunk-2CPMMNIX.mjs.map +1 -0
package/dist/sdk/chunk-5LI6T4O3.mjs +3600 -0
package/dist/sdk/chunk-5LI6T4O3.mjs.map +1 -0
package/dist/sdk/{chunk-YLQ4UN62.mjs → chunk-A4PGHURG.mjs} +6838 -6257
package/dist/sdk/chunk-A4PGHURG.mjs.map +1 -0
package/dist/sdk/chunk-EXFGO4FX.mjs +147 -0
package/dist/sdk/chunk-EXFGO4FX.mjs.map +1 -0
package/dist/sdk/chunk-PJ7K5UFC.mjs +17732 -0
package/dist/sdk/chunk-PJ7K5UFC.mjs.map +1 -0
package/dist/sdk/{chunk-BHZ4CKUS.mjs → chunk-PXFIALUH.mjs} +77 -8
package/dist/sdk/chunk-PXFIALUH.mjs.map +1 -0
package/dist/sdk/{chunk-PVITVJ6J.mjs → chunk-RTKJXNZS.mjs} +32 -9
package/dist/sdk/chunk-RTKJXNZS.mjs.map +1 -0
package/dist/sdk/chunk-VW2GBXQT.mjs +606 -0
package/dist/sdk/chunk-VW2GBXQT.mjs.map +1 -0
package/dist/sdk/{config-RQQPMLRD.mjs → config-5AUYQFHE.mjs} +2 -2
package/dist/sdk/config-6CUVEH7H.mjs +16 -0
package/dist/sdk/config-6CUVEH7H.mjs.map +1 -0
package/dist/sdk/{github-frontend-6Q4BISZX.mjs → github-frontend-BZ4N3BFZ.mjs} +7 -3
package/dist/sdk/github-frontend-BZ4N3BFZ.mjs.map +1 -0
package/dist/sdk/host-4MT3EW2I.mjs +52 -0
package/dist/sdk/{host-P5NQICP7.mjs → host-NYWXLIFC.mjs} +2 -2
package/dist/sdk/host-NYWXLIFC.mjs.map +1 -0
package/dist/sdk/{routing-DEY2AIXM.mjs → routing-6R42GXUO.mjs} +2 -2
package/dist/sdk/routing-6R42GXUO.mjs.map +1 -0
package/dist/sdk/routing-7FXPULTO.mjs +24 -0
package/dist/sdk/routing-7FXPULTO.mjs.map +1 -0
package/dist/sdk/sdk.d.mts +3 -1
package/dist/sdk/sdk.d.ts +3 -1
package/dist/sdk/sdk.js +12163 -11204
package/dist/sdk/sdk.js.map +1 -1
package/dist/sdk/sdk.mjs +14 -10
package/dist/sdk/sdk.mjs.map +1 -1
package/dist/sdk/slack-frontend-JUT3TYVC.mjs +821 -0
package/dist/sdk/slack-frontend-JUT3TYVC.mjs.map +1 -0
package/dist/sdk/workflow-check-provider-H3CUOLUD.mjs +28 -0
package/dist/sdk/workflow-check-provider-H3CUOLUD.mjs.map +1 -0
package/dist/sdk/workflow-check-provider-YUNNF4KC.mjs +28 -0
package/dist/sdk/workflow-check-provider-YUNNF4KC.mjs.map +1 -0
package/dist/sdk/workflow-registry-KFWSDSLM.mjs +12 -0
package/dist/sdk/workflow-registry-KFWSDSLM.mjs.map +1 -0
package/dist/slack/socket-runner.d.ts +2 -0
package/dist/slack/socket-runner.d.ts.map +1 -1
package/dist/state-machine/context/workflow-inputs.d.ts +20 -0
package/dist/state-machine/context/workflow-inputs.d.ts.map +1 -0
package/dist/state-machine/dispatch/execution-invoker.d.ts.map +1 -1
package/dist/state-machine/dispatch/foreach-processor.d.ts.map +1 -1
package/dist/state-machine/dispatch/stats-manager.d.ts.map +1 -1
package/dist/state-machine/states/level-dispatch.d.ts.map +1 -1
package/dist/state-machine/states/routing.d.ts +2 -1
package/dist/state-machine/states/routing.d.ts.map +1 -1
package/dist/traces/{run-2026-01-28T16-15-24-569Z.ndjson → run-2026-01-31T16-37-22-321Z.ndjson} +84 -84
package/dist/traces/{run-2026-01-28T16-16-09-757Z.ndjson → run-2026-01-31T16-38-06-031Z.ndjson} +1013 -1013
package/dist/types/config.d.ts +3 -1
package/dist/types/config.d.ts.map +1 -1
package/dist/utils/human-id.d.ts +12 -0
package/dist/utils/human-id.d.ts.map +1 -0
package/dist/utils/worktree-manager.d.ts +3 -0
package/dist/utils/worktree-manager.d.ts.map +1 -1
package/dist/workflow-executor.d.ts.map +1 -1
package/dist/workflow-registry.d.ts +1 -0
package/dist/workflow-registry.d.ts.map +1 -1
package/package.json +2 -2
package/dist/sdk/chunk-BHZ4CKUS.mjs.map +0 -1
package/dist/sdk/chunk-PVITVJ6J.mjs.map +0 -1
package/dist/sdk/chunk-YLQ4UN62.mjs.map +0 -1
package/dist/sdk/github-frontend-6Q4BISZX.mjs.map +0 -1
/package/dist/sdk/{check-provider-registry-AQ3JETBG.mjs.map → check-provider-registry-3KI5RKXT.mjs.map} +0 -0
/package/dist/sdk/{config-RQQPMLRD.mjs.map → check-provider-registry-IYILYY35.mjs.map} +0 -0
/package/dist/sdk/{routing-DEY2AIXM.mjs.map → config-5AUYQFHE.mjs.map} +0 -0
/package/dist/sdk/{host-P5NQICP7.mjs.map → host-4MT3EW2I.mjs.map} +0 -0

package/dist/docs/telemetry-setup.md CHANGED Viewed

@@ -4,34 +4,66 @@ This guide shows how to enable Visor telemetry and tracing with OpenTelemetry, e
 ## Quick Start (CLI)
-- Enable telemetry to serverless NDJSON traces:
-  - `VISOR_TELEMETRY_ENABLED=true`
-  - `VISOR_TELEMETRY_SINK=file`
-  - (optional) `VISOR_TRACE_DIR=output/traces`
-- Run:
-  - `visor --config ./.visor.yaml --output json`
-- Inspect traces:
-  - `ls output/traces/*.ndjson`
-## CLI Flags
-- `--telemetry` — enable telemetry (overrides config)
-- `--telemetry-sink <otlp|file|console>` — sink selection
-- `--telemetry-endpoint <url>` — OTLP endpoint (HTTP) for traces/metrics
-- `--trace-report` — write a static HTML trace report to output/traces
-- `--auto-instrumentations` — enable OpenTelemetry auto‑instrumentations
+Enable telemetry to serverless NDJSON traces:
+```bash
+export VISOR_TELEMETRY_ENABLED=true
+export VISOR_TELEMETRY_SINK=file
+export VISOR_TRACE_DIR=output/traces  # optional, defaults to output/traces
+visor --config ./.visor.yaml --output json
+# Inspect traces
+ls output/traces/*.ndjson
+```
+## Environment Variables
+Telemetry is configured via environment variables (highest precedence):
+| Variable | Description | Default |
+|----------|-------------|---------|
+| `VISOR_TELEMETRY_ENABLED` | Enable telemetry (`true`/`false`) | `false` |
+| `VISOR_TELEMETRY_SINK` | Sink type: `otlp`, `file`, or `console` | `file` |
+| `VISOR_TRACE_DIR` | Directory for trace files | `output/traces` |
+| `VISOR_TRACE_REPORT` | Generate static HTML trace report (`true`/`false`) | `false` |
+| `VISOR_TELEMETRY_AUTO_INSTRUMENTATIONS` | Enable auto‑instrumentations (`true`/`false`) | `false` |
+| `VISOR_TELEMETRY_FULL_CAPTURE` | Capture full AI prompts/responses in spans | `false` |
+| `VISOR_FALLBACK_TRACE_FILE` | Explicit path for NDJSON trace file | auto-generated |
+| `OTEL_EXPORTER_OTLP_ENDPOINT` | OTLP endpoint URL (for both traces and metrics) | - |
+| `OTEL_EXPORTER_OTLP_TRACES_ENDPOINT` | OTLP endpoint for traces (overrides above) | - |
+| `OTEL_EXPORTER_OTLP_METRICS_ENDPOINT` | OTLP endpoint for metrics (overrides above) | - |
+| `OTEL_EXPORTER_OTLP_HEADERS` | Headers for OTLP requests (e.g., auth tokens) | - |
 Examples:
-- `visor --config ./.visor.yaml --telemetry --telemetry-sink otlp --telemetry-endpoint https://otel.example.com`
-- `visor --config ./.visor.yaml --telemetry --trace-report --auto-instrumentations`
+```bash
+# File sink (serverless mode)
+VISOR_TELEMETRY_ENABLED=true \
+VISOR_TELEMETRY_SINK=file \
+visor --config ./.visor.yaml
+# OTLP sink with Jaeger
+VISOR_TELEMETRY_ENABLED=true \
+VISOR_TELEMETRY_SINK=otlp \
+OTEL_EXPORTER_OTLP_TRACES_ENDPOINT=http://localhost:4318/v1/traces \
+visor --config ./.visor.yaml
+# With static HTML trace report
+VISOR_TELEMETRY_ENABLED=true \
+VISOR_TRACE_REPORT=true \
+visor --config ./.visor.yaml
+```
 ## Config (visor.yaml)
+Telemetry can also be configured via the `telemetry` section in your config file:
 ```yaml
 version: "1.0"
 telemetry:
   enabled: true
-  sink: file       # otlp|file|console
+  sink: file       # otlp | file | console
   otlp:
     protocol: http
     endpoint: ${OTEL_EXPORTER_OTLP_ENDPOINT}
@@ -45,18 +77,16 @@ telemetry:
       enabled: true
 ```
-ENV overrides (highest precedence):
-- `VISOR_TELEMETRY_ENABLED`, `VISOR_TELEMETRY_SINK`, `OTEL_EXPORTER_OTLP_ENDPOINT`, `OTEL_EXPORTER_OTLP_HEADERS`, `VISOR_TRACE_DIR`
-- `VISOR_TELEMETRY_AUTO_INSTRUMENTATIONS=true`
-- `VISOR_TRACE_REPORT=true`
+> **Note:** Environment variables take precedence over config file settings.
 ## Serverless Mode (NDJSON)
-- Visor writes NDJSON simplified spans to `output/traces/run-<id>-<ts>.ndjson`.
-- Ingest with OTel Collector `filelog` receiver + transform to OTLP.
+When using `VISOR_TELEMETRY_SINK=file` (the default), Visor writes NDJSON simplified spans to `output/traces/run-<timestamp>.ndjson`. This is ideal for serverless/CI environments where you can't run a persistent collector.
+You can then ingest these files using the OTel Collector `filelog` receiver:
-OTel Collector (example):
 ```yaml
+# otel-collector-config.yaml
 receivers:
   filelog:
     include: [ "/work/output/traces/*.ndjson" ]
@@ -75,45 +105,126 @@ service:
 ## Connected Mode (OTLP HTTP)
-- Set `VISOR_TELEMETRY_SINK=otlp` and `OTEL_EXPORTER_OTLP_ENDPOINT=https://collector.example.com`.
-- Metrics exporter is enabled automatically (optional dependency) — histograms/counters for checks, providers, foreach items, fail_if triggers, and diagram blocks.
+For real-time trace streaming to a collector:
+```bash
+export VISOR_TELEMETRY_ENABLED=true
+export VISOR_TELEMETRY_SINK=otlp
+export OTEL_EXPORTER_OTLP_TRACES_ENDPOINT=https://collector.example.com/v1/traces
+# Optional: authentication headers
+export OTEL_EXPORTER_OTLP_HEADERS="Authorization=Bearer your-token"
+```
+When using OTLP sink, the metrics exporter is automatically enabled if the required dependencies are installed (`@opentelemetry/exporter-metrics-otlp-http`, `@opentelemetry/sdk-metrics`). Metrics include histograms and counters for checks, providers, forEach items, and fail_if triggers.
 ## Auto‑Instrumentations
-- Enable with `--auto-instrumentations` or `telemetry.tracing.auto_instrumentations: true`.
-- Adds `@opentelemetry/auto-instrumentations-node` (http/undici/child_process/etc.) and correlates with Visor spans via context.
-- Optional dependency; if not installed, Visor skips auto‑instrumentation gracefully.
+Enable with `VISOR_TELEMETRY_AUTO_INSTRUMENTATIONS=true` or in config:
+```yaml
+telemetry:
+  tracing:
+    auto_instrumentations: true
+```
+This activates `@opentelemetry/auto-instrumentations-node` (http/undici/child_process/etc.) and correlates external calls with Visor spans via context propagation.
+> **Note:** Auto-instrumentations require `@opentelemetry/auto-instrumentations-node` as an optional dependency. If not installed, Visor skips auto‑instrumentation gracefully.
 ## Static Trace Report
-- Enable `--trace-report` or `telemetry.tracing.trace_report.enabled: true`.
-- Outputs two files per run:
-  - `*.trace.json` — simplified span JSON
-  - `*.report.html` — self‑contained HTML timeline (open locally)
+Enable with `VISOR_TRACE_REPORT=true` or in config:
-## Mermaid Telemetry
+```yaml
+telemetry:
+  tracing:
+    trace_report:
+      enabled: true
+```
+This outputs two files per run to your trace directory:
+- `*.trace.json` — simplified span JSON
+- `*.report.html` — self‑contained HTML timeline (open locally in your browser)
+## Span Attributes and Events
-- Visor emits full `diagram.block` events with Mermaid code from outputs and issue messages.
-- Metric: `visor.diagram.blocks{origin}` increments per diagram block.
+Visor emits spans with detailed attributes for debugging:
+### Check Spans (`visor.check.<checkId>`)
+- `visor.check.id` — Check identifier
+- `visor.check.type` — Provider type (ai, command, etc.)
+- `visor.check.input.context` — Liquid template context (sanitized)
+- `visor.check.output` — Check result (truncated if large)
+- `visor.foreach.index` — Index for forEach iterations
+### State Spans (`engine.state.*`)
+- `wave` — Current execution wave number
+- `wave_kind` — Wave type
+- `session_id` — Session identifier
+- `level_size` — Number of checks in wave
+- `level_checks_preview` — Preview of checks in wave
+### Routing Events (`visor.routing`)
+- `trigger` — What triggered the routing decision
+- `action` — Routing action (retry, goto, run)
+- `source` — Source check
+- `target` — Target check(s)
+- `scope` — Execution scope
+- `goto_event` — Event override for goto
 ## Security & Redaction
-- Diagram events are sent verbatim by default (as requested). You can later opt‑in to redaction via `telemetry.redaction` (not enforced by default).
+By default, sensitive environment variables (containing `api_key`, `secret`, `token`, `password`, `auth`, `credential`, `private_key`) are automatically redacted in span attributes.
+To capture full AI prompts and responses (for debugging), set:
+```bash
+export VISOR_TELEMETRY_FULL_CAPTURE=true
+```
+> **Warning:** Full capture may include sensitive data. Use only in secure debugging environments.
 ## GitHub Actions
-- Visor wraps the Action run in a single root span (`visor.run`). Publish the `trace_id` in logs/checks for linking.
-- Example step:
+Visor wraps each execution in a root span (`visor.run`). You can correlate traces with GitHub workflow runs by publishing the `trace_id` in logs or checks.
+Example workflow step:
 ```yaml
-- name: Visor
+- name: Run Visor with tracing
   run: |
     export VISOR_TELEMETRY_ENABLED=true
     export VISOR_TELEMETRY_SINK=otlp
-    export OTEL_EXPORTER_OTLP_ENDPOINT=${{ secrets.OTEL_ENDPOINT }}
+    export OTEL_EXPORTER_OTLP_TRACES_ENDPOINT=${{ secrets.OTEL_ENDPOINT }}
     export OTEL_EXPORTER_OTLP_HEADERS="Authorization=Bearer ${{ secrets.OTEL_TOKEN }}"
     npx -y @probelabs/visor@latest --config ./.visor.yaml --output json
 ```
-Troubleshooting:
-- No spans? Check `VISOR_TELEMETRY_ENABLED`, `VISOR_TELEMETRY_SINK`, and that optional deps resolved in the environment.
-- Huge mermaid outputs? Consider adding a soft length cap in Visor or pre-truncating in templates.
+For file-based tracing in CI (useful for artifact upload):
+```yaml
+- name: Run Visor with file traces
+  run: |
+    export VISOR_TELEMETRY_ENABLED=true
+    export VISOR_TELEMETRY_SINK=file
+    export VISOR_TRACE_DIR=./traces
+    npx -y @probelabs/visor@latest --config ./.visor.yaml
+- name: Upload traces
+  uses: actions/upload-artifact@v4
+  with:
+    name: visor-traces
+    path: ./traces/*.ndjson
+```
+## Troubleshooting
+- **No spans?** Verify `VISOR_TELEMETRY_ENABLED=true` and check that OpenTelemetry packages are installed.
+- **Missing metrics?** Install `@opentelemetry/exporter-metrics-otlp-http` and `@opentelemetry/sdk-metrics`.
+- **Auto-instrumentations not working?** Install `@opentelemetry/auto-instrumentations-node`.
+- **Large span attributes?** Visor truncates attributes at 10,000 characters. For full capture, use `VISOR_TELEMETRY_FULL_CAPTURE=true`.
+## Related Documentation
+- [Debugging Guide](./debugging.md) — Comprehensive debugging techniques
+- [Debug Visualizer](./debug-visualizer.md) — Live execution visualization with `--debug-server`
+- [Telemetry RFC](./telemetry-tracing-rfc.md) — Design rationale and architecture

package/dist/docs/test-framework-rfc.md CHANGED Viewed

@@ -1,7 +1,8 @@
 # Visor Integration Test Framework (RFC)
-Status: In Progress
+Status: Implemented
 Date: 2025-10-27
+Last Updated: 2026-01-28
 Owners: @probelabs/visor
 ## Summary
@@ -17,15 +18,15 @@ Key ideas:
   (Docs use a “fact validation” workflow as an example pattern only; it is not a built‑in feature.)
 Developer Guides
-- Getting started: docs/testing/getting-started.md
-- DSL reference: docs/testing/dsl-reference.md
-- Flows: docs/testing/flows.md
-- Fixtures & mocks: docs/testing/fixtures-and-mocks.md
-- Assertions: docs/testing/assertions.md
-- Cookbook: docs/testing/cookbook.md
-- CLI & reporters: docs/testing/cli.md
-- CI integration: docs/testing/ci.md
-- Troubleshooting: docs/testing/troubleshooting.md
+- Getting started: [testing/getting-started.md](testing/getting-started.md)
+- DSL reference: [testing/dsl-reference.md](testing/dsl-reference.md)
+- Flows: [testing/flows.md](testing/flows.md)
+- Fixtures & mocks: [testing/fixtures-and-mocks.md](testing/fixtures-and-mocks.md)
+- Assertions: [testing/assertions.md](testing/assertions.md)
+- Cookbook: [testing/cookbook.md](testing/cookbook.md)
+- CLI & reporters: [testing/cli.md](testing/cli.md)
+- CI integration: [testing/ci.md](testing/ci.md)
+- Troubleshooting: [testing/troubleshooting.md](testing/troubleshooting.md)
 ## Progress Update (Oct 29, 2025)
@@ -64,7 +65,7 @@ Next steps (milestones excerpt)
 ## File Layout
 - Base config (unchanged): `defaults/.visor.yaml` (regular steps live here).
-- Test suite (new): `defaults/.visor.tests.yaml`
+- Test suite (new): `defaults/visor.tests.yaml`
   - `extends: ".visor.yaml"` to inherit the base checks.
   - Contains `tests.defaults`, `tests.fixtures`, `tests.cases`.
@@ -226,16 +227,16 @@ tests:
 ## CLI Usage
 - Discover tests:
-  - `node dist/index.js test --config defaults/.visor.tests.yaml --list`
+  - `node dist/index.js test --config defaults/visor.tests.yaml --list`
 - Validate test file shape (schema):
-  - `node dist/index.js test --config defaults/.visor.tests.yaml --validate`
+  - `node dist/index.js test --config defaults/visor.tests.yaml --validate`
 - Run all tests with compact progress (default):
-  - `node dist/index.js test --config defaults/.visor.tests.yaml`
+  - `node dist/index.js test --config defaults/visor.tests.yaml`
 - Run a single case:
-  - `node dist/index.js test --config defaults/.visor.tests.yaml --only label-flow`
+  - `node dist/index.js test --config defaults/visor.tests.yaml --only label-flow`
 - Run a single stage in a flow (by name or 1‑based index):
-  - `node dist/index.js test --config defaults/.visor.tests.yaml --only pr-review-e2e-flow#facts-invalid`
-  - `node dist/index.js test --config defaults/.visor.tests.yaml --only pr-review-e2e-flow#3`
+  - `node dist/index.js test --config defaults/visor.tests.yaml --only pr-review-e2e-flow#facts-invalid`
+  - `node dist/index.js test --config defaults/visor.tests.yaml --only pr-review-e2e-flow#3`
 - Emit artifacts:
   - JSON: `--json output/visor-tests.json`
   - JUnit: `--report junit:output/visor-tests.xml`
@@ -433,9 +434,9 @@ expect:
 ## CLI
 ```
-visor test --config defaults/.visor.tests.yaml         # run all cases
-visor test --config defaults/.visor.tests.yaml --only pr-review-e2e-flow
-visor test --config defaults/.visor.tests.yaml --list  # list case names
+visor test --config defaults/visor.tests.yaml         # run all cases
+visor test --config defaults/visor.tests.yaml --only pr-review-e2e-flow
+visor test --config defaults/visor.tests.yaml --list  # list case names
 ```
 Exit codes:
@@ -459,7 +460,7 @@ The runner prints a concise, human‑friendly summary optimized for scanning:
   - First mismatch shows an inline diff (expected vs actual substring/regex or value), with a clear hint to fix.
 - Flow cases show each stage nested under the parent with roll‑up status.
 - Summary footer with pass/fail counts, slowest cases, and a hint to rerun focused:
-  - e.g., visor test --config defaults/.visor.tests.yaml --only security-fail-if
+  - e.g., visor test --config defaults/visor.tests.yaml --only security-fail-if
 Color, symbols, and truncation rules mirror our main CLI:
 - Green checks for passes, red crosses for failures, yellow for skipped.
@@ -496,7 +497,7 @@ Progress Tracker
 - Milestone 7 — CLI reporters/UX polish — DONE (2025-10-27)
 - Milestone 8 — Validation and helpful errors — DONE (2025-10-27)
 - Milestone 9 — Coverage and perf — DONE (2025-10-27)
-- Milestone 10 — Docs, examples, migration — PENDING
+- Milestone 10 — Docs, examples, migration — DONE (2026-01-28)
 Progress Update — 2025-10-29
 - FlowStage refactor: each stage now recomputes prompts/output‑history deltas and execution statistics after any fallback run that executes “missing” expected steps. Coverage tables reflect the final state of the stage.
@@ -513,7 +514,7 @@ Progress Update — 2025-10-28
 Milestone 0 — DSL freeze and scaffolding (0.5 week) — DONE 2025-10-27
 - Finalize DSL keys: tests.defaults, fixtures, cases, flow, fixture, mocks, expect.{calls,prompts,outputs,fail,strict_violation}. ✅
-- Rename use_fixture → fixture across examples (done in this RFC and defaults/.visor.tests.yaml). ✅
+- Rename use_fixture → fixture across examples (done in this RFC and defaults/visor.tests.yaml). ✅
 - Create module skeletons: ✅
   - src/test-runner/index.ts (entry + orchestration)
   - src/test-runner/fixture-loader.ts (builtin + overrides)
@@ -526,7 +527,7 @@ Milestone 0 — DSL freeze and scaffolding (0.5 week) — DONE 2025-10-27
 Progress Notes
 - Discovery works against any .visor.tests.yaml (general-purpose, not tied to defaults).
 - Recording Octokit records arbitrary rest ops without hardcoding method lists.
-- defaults/.visor.tests.yaml updated to consistent count grammar and fixed indentation issues.
+- defaults/visor.tests.yaml updated to consistent count grammar and fixed indentation issues.
 Milestone 1 — MVP runner and single‑event cases (1 week) — DONE 2025-10-27 (non‑flow)
 - CLI: add visor test [--config path] [--only name] [--bail] [--list]. ✅
@@ -543,12 +544,12 @@ Notes
 Verification
 - Build CLI + SDK: npm run build — success.
-- Discovery: visor test --config defaults/.visor.tests.yaml --list — lists suite and cases.
+- Discovery: visor test --config defaults/visor.tests.yaml --list — lists suite and cases.
 - Run single cases:
-  - visor test --config defaults/.visor.tests.yaml --only label-flow — PASS
-  - visor test --config defaults/.visor.tests.yaml --only issue-triage — PASS
-  - visor test --config defaults/.visor.tests.yaml --only security-fail-if — PASS
-  - visor test --config defaults/.visor.tests.yaml --only strict-mode-example — PASS
+  - visor test --config defaults/visor.tests.yaml --only label-flow — PASS
+  - visor test --config defaults/visor.tests.yaml --only issue-triage — PASS
+  - visor test --config defaults/visor.tests.yaml --only security-fail-if — PASS
+  - visor test --config defaults/visor.tests.yaml --only strict-mode-example — PASS
 - Behavior observed:
   - Strict mode enforced (steps executed but not asserted would fail).
   - GitHub ops recorded by default with dynamic recorder, no network calls.
@@ -606,7 +607,7 @@ Milestone 8 — Validation and helpful errors (0.5 week) — DONE 2025-10-27
 Usage:
 ```
-visor test --validate --config defaults/.visor.tests.yaml
+visor test --validate --config defaults/visor.tests.yaml
 ```
 Example error output:
@@ -626,12 +627,12 @@ Milestone 9 — Coverage and perf (0.5 week) — DONE 2025-10-27
 Usage examples:
 ```
-visor test --config defaults/.visor.tests.yaml --max-parallel 4
-visor test --config defaults/.visor.tests.yaml --prompt-max-chars 16000
+visor test --config defaults/visor.tests.yaml --max-parallel 4
+visor test --config defaults/visor.tests.yaml --prompt-max-chars 16000
 ```
-Milestone 10 — Docs, examples, and migration (0.5 week) — IN PROGRESS 2025-10-31
-- Update README to link the RFC and defaults/.visor.tests.yaml.
+Milestone 10 — Docs, examples, and migration (0.5 week) — DONE 2026-01-28
+- Update README to link the RFC and defaults/visor.tests.yaml.
 - Document built-in fixtures catalog and examples.
 - Migration note: how to move from embedded tests and from `returns` to new mocks.
 - Document `depends_on` ANY‑OF (pipe) groups with examples (done).
@@ -650,7 +651,7 @@ Success Metrics
 ## Compatibility & Migration
-- Tests moved from `defaults/.visor.yaml` into `defaults/.visor.tests.yaml` with `extends: ".visor.yaml"`.
+- Tests moved from `defaults/.visor.yaml` into `defaults/visor.tests.yaml` with `extends: ".visor.yaml"`.
 - Old `mocks.*.returns` is replaced by direct values (object/array/string).
 - You no longer need `run: steps` in tests; cases are integration‑driven by `event + fixture`.
 - `no_other_calls` is unnecessary with strict mode; it’s implied and enforced.
@@ -671,7 +672,7 @@ Success Metrics
 ## Appendix: Example Suite
-See `defaults/.visor.tests.yaml` in the repo for a complete, multi‑event example covering:
+See `defaults/visor.tests.yaml` in the repo for a complete, multi‑event example covering:
 - PR opened → overview + labels
 - Standard PR comment → no action
 - `/visor` comment → reply

package/dist/docs/testing/assertions.md CHANGED Viewed

@@ -1,10 +1,15 @@
 # Writing Assertions
-Assertions live under `expect:` and cover three surfaces:
+Assertions live under `expect:` and cover several surfaces:
-- `calls`: step counts and provider effects (GitHub ops)
+- `calls`: step counts and provider effects (GitHub/Slack ops)
 - `prompts`: final AI prompts (post templating/context)
 - `outputs`: step outputs with history and selectors
+- `workflow_output`: workflow-level outputs (for workflow testing)
+- `no_calls`: assert that specific steps or provider ops were NOT called
+- `fail`: assert that the case failed with a specific message
+- `strict_violation`: assert strict mode failure for a missing expect on a step
+- `use`: reference reusable macros defined in `tests.defaults.macros`
 ## Calls
@@ -18,10 +23,22 @@ expect:
       at_least: 1
       args:
         contains: [feature, "review/effort:2"]
+    - provider: slack
+      op: chat.postMessage
+      at_least: 1
+      args:
+        contains: ["Review complete"]
 ```
 Counts are consistent everywhere: `exactly`, `at_least`, `at_most`.
+Supported providers:
+- `github`: GitHub API operations (e.g., `labels.add`, `issues.createComment`, `pulls.createReview`, `checks.create`)
+- `slack`: Slack API operations (e.g., `chat.postMessage`)
+The `args` field supports:
+- `contains`: array of values that must be present (for labels) or substrings (for Slack text)
 ## Prompts
 ```yaml
@@ -44,7 +61,7 @@ expect:
 - `index`: `first` | `last` | N (default: last)
 - `where`: selector to choose a prompt from history using `contains`/`not_contains`/`matches` before applying the assertion
-Tip: enable `--prompt-max-chars` or `tests.defaults.prompt_max_chars` to cap stored prompt size for large diffs.
+Tip: Enable `--prompt-max-chars` CLI flag or `tests.defaults.prompt_max_chars` config setting to cap stored prompt size for large diffs.
 ## Outputs
@@ -72,7 +89,31 @@ Supported comparators:
 - `matches` (regex)
 - `contains_unordered` (array membership ignoring order)
-## Strict mode and “no calls”
+## Workflow Outputs
+For workflow testing, use `workflow_output` to assert on workflow-level outputs (defined in the workflow's `outputs:` section):
+```yaml
+expect:
+  workflow_output:
+    - path: summary
+      contains: "Review completed"
+    - path: issues_found
+      equals: 3
+    - path: categories
+      contains_unordered: ["security", "performance"]
+```
+Supported comparators for workflow outputs:
+- `equals` (primitive)
+- `equalsDeep` (structural)
+- `matches` (regex)
+- `contains` (substring check, can be string or array)
+- `not_contains` (forbidden substrings)
+- `contains_unordered` (array membership ignoring order)
+- `where` (selector with `path` + `equals`/`matches`)
+## Strict mode and "no calls"
 Strict mode (default) fails any executed step without a corresponding `expect.calls` entry. You can also assert absence explicitly:
@@ -81,5 +122,52 @@ expect:
   no_calls:
     - provider: github
       op: issues.createComment
+    - provider: slack
+      op: chat.postMessage
     - step: extract-facts
 ```
+## Failure Assertions
+Assert that a test case fails with a specific error message:
+```yaml
+expect:
+  fail:
+    message_contains: "validation failed"
+```
+Assert that strict mode caught an unexpected step execution:
+```yaml
+expect:
+  strict_violation:
+    for_step: unexpected-step
+    message_contains: "Step executed without expect"
+```
+## Reusable Macros
+Define reusable assertion blocks in `tests.defaults.macros` and reference them with `use`:
+```yaml
+tests:
+  defaults:
+    macros:
+      basic-github-check:
+        calls:
+          - provider: github
+            op: checks.create
+            at_least: 1
+  cases:
+    - name: my-test
+      event: pr_opened
+      expect:
+        use: [basic-github-check]
+        calls:
+          - step: overview
+            exactly: 1
+```
+Macros are merged with inline expectations, allowing you to compose reusable assertion patterns.

package/dist/docs/testing/ci.md CHANGED Viewed

@@ -1,6 +1,8 @@
 # CI Integration for Tests
-Run your in‑YAML integration tests in CI using the Visor CLI. Below is a GitHub Actions example. Adapt for other CIs similarly.
+Run your in-YAML integration tests in CI using the Visor CLI. Below is a GitHub Actions example. Adapt for other CIs similarly.
+## Basic GitHub Actions Example
 ```yaml
 name: Visor Tests
@@ -14,13 +16,13 @@ jobs:
       - uses: actions/setup-node@v4
         with: { node-version: '20' }
       - run: npm ci
-      - run: npm run build --ignore-scripts
+      - run: npm run build
-      - name: Run integration tests (defaults)
+      - name: Run integration tests
         run: |
           mkdir -p output
-          node ./dist/index.js test \
-            --config defaults/.visor.tests.yaml \
+          npx visor test \
+            --config defaults/visor.tests.yaml \
             --json output/visor-tests.json \
             --report junit:output/visor-tests.xml \
             --summary md:output/visor-tests.md
@@ -36,9 +38,56 @@ jobs:
             output/visor-tests.md
 ```
-Tips
+## Multi-Suite Discovery
+When you have multiple test files, Visor can discover and run them all:
+```yaml
+- name: Run all test suites
+  run: |
+    mkdir -p output
+    npx visor test tests/ \
+      --max-suites 4 \
+      --max-parallel 2 \
+      --json output/visor-tests.json \
+      --report junit:output/visor-tests.xml
+```
+The test runner automatically discovers:
+- Files ending with `.tests.yaml` or `.tests.yml`
+- YAML files containing a top-level `tests:` key with a `cases` array
+## Validation-Only Step
+Add a fast validation step before running tests to catch YAML syntax errors early:
+```yaml
+- name: Validate test files
+  run: npx visor test --validate --config defaults/visor.tests.yaml
+```
+## Environment Variables
+| Variable | Default | Description |
+|----------|---------|-------------|
+| `VISOR_DEBUG` | `false` | Enable debug logging |
+| `VISOR_TEST_PROMPT_MAX_CHARS` | `4000` (CI) / `8000` | Truncate captured prompts |
+| `VISOR_TEST_HISTORY_LIMIT` | `200` (CI) / `500` | Limit output history entries |
+| `CI` | - | Automatically detected; adjusts defaults |
+## Tips
 - Keep `ai_provider: mock` in `tests.defaults` for fast, deterministic runs.
-- Set `--max-parallel` to speed up large suites (flows still run sequentially per case).
+- Set `--max-parallel` to speed up case execution within a suite.
+- Set `--max-suites` to run multiple test files in parallel.
 - Use `--bail` for faster feedback on PRs; run full suite on main.
 - Collect artifacts so you can inspect failures without re-running.
+- Use `--validate` in a separate step for faster feedback on syntax errors.
+## See Also
+- [CLI Reference](cli.md) - Full list of test command flags
+- [Getting Started](getting-started.md) - Writing your first tests
+- [Fixtures and Mocks](fixtures-and-mocks.md) - Mock AI providers for CI
+- [Troubleshooting](troubleshooting.md) - Common CI issues