npm - @kylebegeman/pulse - Versions diffs - 0.3.0 → 0.3.1 - Mend

@kylebegeman/pulse 0.3.0 → 0.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (16) hide show

package/README.md +1006 -571
package/dist/executor.d.ts +0 -4
package/dist/executor.d.ts.map +1 -1
package/dist/executor.js +11 -23
package/dist/executor.js.map +1 -1
package/dist/queue.js +6 -6
package/dist/runs.d.ts +1 -12
package/dist/runs.d.ts.map +1 -1
package/dist/runs.js +3 -28
package/dist/runs.js.map +1 -1
package/dist/scheduler.js +4 -4
package/dist/scheduler.js.map +1 -1
package/dist/schema/migrate.d.ts.map +1 -1
package/dist/schema/migrate.js +16 -51
package/dist/schema/migrate.js.map +1 -1
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -1,42 +1,74 @@
-<div align="center">
-# Pulse
-**Signal-driven workflow engine for multi-tenant applications**
-[![TypeScript](https://img.shields.io/badge/TypeScript-5.0+-3178c6?logo=typescript&logoColor=white)](https://www.typescriptlang.org)
-[![Node.js](https://img.shields.io/badge/Node.js-18+-339933?logo=node.js&logoColor=white)](https://nodejs.org)
-[![PostgreSQL](https://img.shields.io/badge/PostgreSQL-12+-4169e1?logo=postgresql&logoColor=white)](https://www.postgresql.org)
-[![Redis](https://img.shields.io/badge/Redis-6+-dc382d?logo=redis&logoColor=white)](https://redis.io)
-[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
-[Quick Start](#quick-start) &#8226; [Core Concepts](#core-concepts) &#8226; [API Reference](#api-reference) &#8226; [Examples](#examples) &#8226; [Database Schema](#database-schema)
-</div>
+<p align="center">
+  <strong>pulse</strong>
+</p>
+<p align="center">
+  Universal workflow engine — signal-driven automation for multi-tenant applications
+</p>
+<p align="center">
+  <a href="#quick-start">Quick Start</a> &middot;
+  <a href="#core-concepts">Core Concepts</a> &middot;
+  <a href="#api-reference">API Reference</a> &middot;
+  <a href="#database-schema">Database Schema</a> &middot;
+  <a href="#examples">Examples</a>
+</p>
 ---
 ## Overview
-Pulse is a TypeScript library for building event-driven workflow automation. Your application emits **signals**, and the engine matches them to registered **workflows** — scheduling and executing **steps** in sequence with full persistence, observability, and replay support.
+Pulse is a TypeScript library that powers event-driven workflow automation. Your application emits **signals** (triggers), and the engine matches them to registered **workflows**, scheduling and executing **steps** in sequence with full persistence, observability, and replay support.
 ```
-Signal  ->  Match  ->  Schedule Steps  ->  Evaluate Conditions  ->  Execute Actions  ->  Record Results
+Signal → Match → Schedule Steps → Evaluate Conditions → Execute Actions → Record Results
 ```
-Built for multi-tenant SaaS. Every signal, workflow, and run is scoped to a tenant. The engine runs in-process alongside your application using your existing PostgreSQL database and Redis instance.
+Built for multi-tenant SaaS applications. Each signal, workflow, and run is scoped to a tenant. The engine runs in-process alongside your application and uses your existing PostgreSQL database and Redis instance.
-### Highlights
+---
-- **Signal-driven** — Emit structured events, let the engine match and execute workflows
-- **Multi-tenant** — Every resource is scoped to a tenant out of the box
-- **Parallel execution** — Fan out into concurrent branches with automatic convergence
-- **Cron scheduling** — Trigger workflows on recurring schedules via BullMQ
-- **Replay** — Re-process historical signals with replay-safety controls
-- **Cancellation & retry** — Cancel in-progress runs or retry from the point of failure
-- **Timeline** — Chronological audit trail for every run
-- **Lifecycle hooks** — Observe step and run completion for integrations
-- **Production-hardened** — CAS step claiming, state guards, idempotent migrations
+## Table of Contents
+- [Quick Start](#quick-start)
+- [Installation](#installation)
+- [Core Concepts](#core-concepts)
+  - [Signals (Triggers)](#signals-triggers)
+  - [Workflows](#workflows)
+  - [Steps](#steps)
+  - [Parallel Steps](#parallel-steps)
+  - [Cron-Triggered Workflows](#cron-triggered-workflows)
+  - [Handler Context](#handler-context)
+  - [Lifecycle Hooks](#lifecycle-hooks)
+  - [Replay](#replay)
+  - [Run Timeline](#run-timeline)
+  - [Cancel & Retry](#cancel--retry)
+- [API Reference](#api-reference)
+  - [createEngine(config)](#createengineconfig)
+  - [Engine Methods](#engine-methods)
+    - [Registration](#registration)
+    - [Signals](#signals)
+    - [Lifecycle](#lifecycle)
+    - [Queries](#queries)
+    - [Timeline](#timeline)
+    - [Cancel & Retry](#cancel--retry-1)
+    - [Replay](#replay-1)
+    - [Workflow Management](#workflow-management)
+    - [Schema](#schema)
+- [Type Reference](#type-reference)
+- [Database Schema](#database-schema)
+- [Runtime Guarantees](#runtime-guarantees)
+- [Architecture](#architecture)
+- [Examples](#examples)
+  - [Monitoring: Incident on Missed Heartbeat](#monitoring-incident-on-missed-heartbeat)
+  - [Billing: Trial Expiration Workflow](#billing-trial-expiration-workflow)
+  - [Chaining: Workflow That Triggers Another Workflow](#chaining-workflow-that-triggers-another-workflow)
+  - [Parallel: Multi-Channel Notification](#parallel-multi-channel-notification)
+  - [Cron: Daily Cleanup Job](#cron-daily-cleanup-job)
+  - [Observability: Run Timeline Dashboard](#observability-run-timeline-dashboard)
+  - [Error Recovery: Retry Failed Runs](#error-recovery-retry-failed-runs)
+- [Requirements](#requirements)
+- [License](#license)
 ---
@@ -47,34 +79,39 @@ import { createEngine } from '@kylebegeman/pulse'
 import { Pool } from 'pg'
 import Redis from 'ioredis'
+// 1. Create the engine
 const engine = createEngine({
   db: new Pool({ connectionString: process.env.DATABASE_URL }),
   redis: new Redis(process.env.REDIS_URL),
 })
-// Register a trigger, action, and condition
+// 2. Register a trigger type with optional schema validation
 engine.registerTrigger('heartbeat.missed', {
   source: 'heartbeat',
   resourceType: 'service',
 })
+// 3. Register an action handler
 engine.registerAction('create_incident', async (ctx) => {
   const incident = await createIncident({
     service: ctx.trigger.resourceId,
     tenant: ctx.tenantId,
+    reason: 'Heartbeat missed',
   })
   ctx.log('Incident created', { incidentId: incident.id })
   return { success: true, data: { incidentId: incident.id } }
 }, { replaySafe: true })
+// 4. Register a condition
 engine.registerCondition('is_still_failing', async (ctx) => {
   const status = await checkServiceHealth(ctx.trigger.resourceId)
   return status === 'unhealthy'
 })
-// Run migrations and create a workflow
+// 5. Run migrations (creates pulse_* tables)
 await engine.migrate()
+// 6. Create a workflow
 await engine.createWorkflow({
   tenantId: 'workspace_1',
   name: 'Incident on missed heartbeat',
@@ -88,9 +125,10 @@ await engine.createWorkflow({
   isEnabled: true,
 })
-// Start processing and emit signals
+// 7. Start the engine (BullMQ workers begin processing)
 await engine.start()
+// 8. Emit signals from anywhere in your application
 await engine.emit({
   tenantId: 'workspace_1',
   source: 'heartbeat',
@@ -100,7 +138,7 @@ await engine.emit({
   payload: { lastSeenAt: new Date().toISOString() },
 })
-// Graceful shutdown
+// 9. Graceful shutdown
 await engine.stop()
 ```
@@ -121,7 +159,7 @@ npm install git+ssh://git@github.com:mrbagels/pulse.git
 **Peer requirements:**
 | Dependency | Purpose |
-|:--|:--|
+|---|---|
 | `pg` | PostgreSQL client — engine uses your connection pool |
 | `ioredis` | Redis client — used by BullMQ for job queuing |
@@ -135,17 +173,17 @@ A **signal** is a structured event emitted by your application. Every signal inc
 ```ts
 await engine.emit({
-  tenantId: 'workspace_1',
-  source: 'heartbeat',
-  type: 'heartbeat.missed',
-  resourceType: 'service',
-  resourceId: 'api-server',
-  environment: 'production',
-  payload: { lastSeenAt: '...' },
+  tenantId: 'workspace_1',       // Required — tenant scope
+  source: 'heartbeat',           // Required — originating system
+  type: 'heartbeat.missed',      // Required — event type
+  resourceType: 'service',       // Optional — what kind of resource
+  resourceId: 'api-server',      // Optional — which specific resource
+  environment: 'production',     // Optional — environment scope
+  payload: { lastSeenAt: '...' } // Optional — arbitrary data
 })
 ```
-Triggers are **registered** to declare their source, expected resource type, and optional Zod payload schema:
+Triggers are **registered** to declare their source, expected resource type, and optional payload schema (validated with Zod):
 ```ts
 import { z } from 'zod'
@@ -159,7 +197,9 @@ engine.registerTrigger('heartbeat.missed', {
 })
 ```
-All emitted signals are persisted to the database for auditing and [replay](#replay).
+All emitted signals are **persisted** to the database for auditing and [replay](#replay).
+---
 ### Workflows
@@ -169,58 +209,64 @@ A **workflow** is a named sequence of steps that runs when a matching signal is
 await engine.createWorkflow({
   tenantId: 'workspace_1',
   name: 'Incident on missed heartbeat',
-  triggerType: 'heartbeat.missed',
-  environmentFilter: 'production',
-  resourceTypeFilter: 'service',
+  triggerType: 'heartbeat.missed',        // Matches signals of this type
+  environmentFilter: 'production',         // Optional — only match this environment
+  resourceTypeFilter: 'service',           // Optional — only match this resource type
   steps: [
     { type: 'delay', name: 'wait_5m', delayMs: 300_000 },
     { type: 'condition', name: 'is_still_failing' },
     { type: 'action', name: 'create_incident' },
   ],
-  config: { severity: 'high' },
+  config: { severity: 'high' },            // Passed to handlers via ctx.config
   isEnabled: true,
 })
 ```
-When a signal is emitted, the engine finds all enabled workflows matching the signal type (and optional filters), creates a **run** for each, and begins scheduling steps. Workflows can be enabled or disabled at runtime:
+When a signal is emitted, the engine finds **all enabled workflows** matching the signal type (and optional environment/resource filters), creates a **run** for each, and begins scheduling steps.
+Workflows can be enabled/disabled at runtime:
 ```ts
 await engine.disableWorkflow('wfd_abc123')
 await engine.enableWorkflow('wfd_abc123')
 ```
+---
 ### Steps
-Each workflow contains an ordered list of steps. Steps execute sequentially — each must complete before the next begins.
+Each workflow contains an ordered list of steps. Steps execute sequentially — each step must complete before the next begins.
-| Type | Purpose |
-|:--|:--|
-| `action` | Execute a registered handler function |
-| `condition` | Evaluate a boolean to branch or complete early |
-| `delay` | Pause execution for a duration (via BullMQ delayed jobs) |
-| `parallel` | Execute multiple branches concurrently |
+#### `action` — Execute logic
-#### Action steps
+Calls a registered handler function. The handler receives a [`WorkflowContext`](#handler-context) and returns an `ActionResult`.
 ```ts
 engine.registerAction('send_alert', async (ctx) => {
   await sendSlackMessage(ctx.config.channel, `Alert for ${ctx.trigger.resourceId}`)
   return { success: true, data: { sent: true } }
-}, { replaySafe: false })
+}, { replaySafe: false }) // Will be skipped during replay
 ```
-Actions support retry policies and timeouts:
+```ts
+// In workflow steps:
+{ type: 'action', name: 'send_alert' }
+```
+**Retry policy:** Action steps can define a retry policy for automatic retries on failure:
 ```ts
 {
   type: 'action',
   name: 'send_alert',
   retryPolicy: { maxAttempts: 3, backoffMs: 5000 },
-  timeoutMs: 30_000,
+  timeoutMs: 30_000,  // Optional step timeout
 }
 ```
-#### Condition steps
+#### `condition` — Branch on logic
+Evaluates a boolean. If the condition returns `false`, the workflow **completes early** by default (status: `completed`, not `failed`).
 ```ts
 engine.registerCondition('monitor_still_failing', async (ctx) => {
@@ -229,90 +275,159 @@ engine.registerCondition('monitor_still_failing', async (ctx) => {
 })
 ```
-Control what happens when a condition returns `false`:
+```ts
+// In workflow steps:
+{ type: 'condition', name: 'monitor_still_failing' }
+```
+**`onFalse` behavior:** Control what happens when a condition returns `false`:
 ```ts
-{ type: 'condition', name: 'check', onFalse: 'complete' }  // Default: complete early
-{ type: 'condition', name: 'check', onFalse: 'skip' }      // Skip next step
-{ type: 'condition', name: 'check', onFalse: 3 }            // Skip next N steps
+// Default: complete the workflow early
+{ type: 'condition', name: 'check', onFalse: 'complete' }
+// Skip the next step and continue
+{ type: 'condition', name: 'check', onFalse: 'skip' }
+// Skip the next N steps
+{ type: 'condition', name: 'check', onFalse: 3 }
 ```
-#### Delay steps
+#### `delay` — Wait before continuing
+Pauses execution for a duration in milliseconds. Uses BullMQ delayed jobs for reliable scheduling. Maximum delay: 30 days.
 ```ts
+// In workflow steps:
 { type: 'delay', name: 'wait_5_minutes', delayMs: 5 * 60 * 1000 }
 ```
-Maximum delay: 30 days. Uses BullMQ delayed jobs for reliability.
+#### `parallel` — Execute branches concurrently
+Runs multiple branches of steps simultaneously. See [Parallel Steps](#parallel-steps) below.
+---
-#### Parallel steps
+### Parallel Steps
-Run multiple branches concurrently. All branches must complete before the workflow continues.
+The `parallel` step type executes multiple branches concurrently. Each branch is an independent sequence of steps. All branches must complete before the workflow continues to the next top-level step.
 ```ts
-{
-  type: 'parallel',
-  name: 'notify_all_channels',
-  branches: [
-    [{ type: 'action', name: 'send_email' }, { type: 'action', name: 'log_email_sent' }],
-    [{ type: 'action', name: 'send_slack' }],
-    [{ type: 'action', name: 'send_sms' }],
+await engine.createWorkflow({
+  tenantId: 'workspace_1',
+  name: 'Multi-channel notification',
+  triggerType: 'alert.triggered',
+  steps: [
+    {
+      type: 'parallel',
+      name: 'notify_all_channels',
+      branches: [
+        // Branch 0: Email notification
+        [
+          { type: 'action', name: 'send_email' },
+          { type: 'action', name: 'log_email_sent' },
+        ],
+        // Branch 1: Slack notification
+        [
+          { type: 'action', name: 'send_slack' },
+        ],
+        // Branch 2: SMS notification
+        [
+          { type: 'action', name: 'send_sms' },
+        ],
+      ],
+    },
+    // This step runs after ALL branches complete
+    { type: 'action', name: 'mark_notified' },
   ],
-}
+  config: {},
+  isEnabled: true,
+})
 ```
-- Branches execute independently via BullMQ jobs
-- Branches can contain any step types
-- If any branch fails, the parallel step fails and the run fails
-- Minimum 2 branches required
+**Behavior:**
+- Each branch executes independently and concurrently via BullMQ jobs
+- Branches can contain any step types (action, condition, delay)
+- All branches share the run's `context` object
+- The parallel step completes when **all** branches complete
+- If **any** branch fails, the parallel step fails and the run fails
+- A parallel step must have at least 2 branches
-### Cron Scheduling
+**Branch step tracking:** Each step within a branch is tracked with a `branchIndex` and `parentStepRunId`, linking it back to the parallel step that spawned it. This is visible in the step runs returned by `getRunSteps()`.
-Workflows can run on recurring schedules. When a cron fires, the engine emits a trigger of the workflow's type automatically.
+---
+### Cron-Triggered Workflows
+Workflows can be scheduled to run on a recurring basis using cron expressions. When a cron fires, the engine automatically emits a trigger of the workflow's `triggerType`, which matches and creates a new run.
 ```ts
 await engine.createWorkflow({
   tenantId: 'workspace_1',
   name: 'Daily cleanup',
   triggerType: 'maintenance.cleanup',
-  steps: [{ type: 'action', name: 'cleanup_old_records' }],
+  steps: [
+    { type: 'action', name: 'cleanup_old_records' },
+    { type: 'action', name: 'send_cleanup_report' },
+  ],
   config: {},
   isEnabled: true,
-  cronExpression: '0 2 * * *', // Daily at 2:00 AM
+  cronExpression: '0 2 * * *',  // Run daily at 2:00 AM
 })
 ```
-Cron jobs use BullMQ repeatable jobs. Enabling/disabling a workflow starts/stops its cron. Standard 5-field format: `minute hour dayOfMonth month dayOfWeek`.
+**Behavior:**
+- Cron jobs are managed via BullMQ repeatable jobs on a dedicated cron queue
+- When `engine.start()` is called, all enabled cron definitions are restored
+- Enabling/disabling a workflow also starts/stops its cron job
+- The cron fires a trigger with `source: 'cron'` and the workflow's `triggerType`
+- Standard cron expression format: `minute hour dayOfMonth month dayOfWeek`
+**Examples of cron expressions:**
+- `* * * * *` — every minute
+- `0 */6 * * *` — every 6 hours
+- `0 9 * * 1-5` — weekdays at 9:00 AM
+- `0 0 1 * *` — first day of every month at midnight
+---
 ### Handler Context
-Every action and condition handler receives a `WorkflowContext`:
+Every action and condition handler receives a `WorkflowContext` object:
 ```ts
 engine.registerAction('my_action', async (ctx) => {
-  ctx.tenantId   // Current tenant
-  ctx.trigger    // The signal that started this run
-  ctx.run        // Current run state
-  ctx.step       // Current step state
-  ctx.config     // Workflow-level config
-  ctx.isReplay   // True during replay execution
-  ctx.emit(...)  // Emit another signal (workflow chaining)
-  ctx.log(...)   // Write to execution log
+  ctx.tenantId     // string — current tenant
+  ctx.trigger      // TriggerEnvelope — the signal that started this run
+  ctx.run          // WorkflowRun — current run state
+  ctx.step         // WorkflowStepRun — current step state
+  ctx.config       // Record<string, unknown> — workflow-level config
+  ctx.isReplay     // boolean — true if this is a replay execution
+  ctx.emit(...)    // Emit another signal (enables workflow chaining)
+  ctx.log(...)     // Write to execution log
   return { success: true }
 })
 ```
+The `emit` function on the context lets actions trigger other workflows, enabling powerful **workflow chaining** patterns. See [the chaining example](#chaining-workflow-that-triggers-another-workflow).
+---
 ### Lifecycle Hooks
+The engine supports lifecycle hooks for observability and integration:
 ```ts
 const engine = createEngine({
   db: pool,
   redis: redisClient,
   onStepComplete: (event) => {
+    // Called after each step completes, fails, or is skipped
     console.log(`Step ${event.step.stepName}: ${event.step.status}`)
   },
   onRunComplete: (event) => {
+    // Called when a run reaches completed, failed, or canceled status
     console.log(`Run ${event.run.id}: ${event.status}`)
   },
 })
@@ -320,384 +435,500 @@ const engine = createEngine({
 Hooks are fire-and-forget — errors in hooks do not affect workflow execution.
+---
 ### Replay
-Re-process a historical signal through the matching pipeline:
+All signals are persisted, enabling **replay** — re-processing a historical signal through the matching pipeline as if it were emitted again.
 ```ts
-await engine.replay('trg_abc123')                     // Full replay
-await engine.replay('trg_abc123', { dryRun: true })   // Dry run — skip actions
+// Full replay — creates new runs and executes all steps
+await engine.replay('trg_abc123')
+// Dry run — creates runs but skips all actions (logs only)
+await engine.replay('trg_abc123', { dryRun: true })
 ```
-Actions declare replay safety. Actions with `replaySafe: false` are **skipped** during replay to prevent duplicate side effects.
+**Replay safety:** Each action declares whether it is safe to re-execute via the `replaySafe` option:
 ```ts
+// Safe to replay — idempotent or read-only
 engine.registerAction('create_incident', handler, { replaySafe: true })
+// Not safe to replay — side effects like emails, charges
 engine.registerAction('send_email', handler, { replaySafe: false })
 ```
-### Cancel & Retry
+During replay, actions with `replaySafe: false` are **skipped** (status: `skipped`). This prevents duplicate emails, charges, or other non-idempotent side effects.
+Runs created by replay are marked with `isReplay: true` and handlers can check `ctx.isReplay` to adjust behavior.
+---
+### Run Timeline
-**Cancel** an in-progress run:
+The run timeline provides a chronological view of everything that happened during a workflow run — useful for debugging dashboards and audit trails.
 ```ts
-const canceledRun = await engine.cancelRun('run_abc123', 'No longer needed')
+const timeline = await engine.getRunTimeline('run_abc123')
+for (const entry of timeline) {
+  console.log(`[${entry.timestamp}] ${entry.type}`, {
+    stepName: entry.stepName,
+    detail: entry.detail,
+  })
+}
 ```
-Sets status to `canceled`, marks pending steps as `skipped`, removes pending BullMQ jobs.
+**Entry types:**
+- `run_created` — run was created
+- `step_scheduled` — step was queued for execution
+- `step_started` — step began executing
+- `step_completed` — step finished successfully
+- `step_failed` — step failed with an error
+- `step_skipped` — step was skipped (condition false, replay, etc.)
+- `run_completed` — run finished successfully
+- `run_failed` — run ended due to a step failure
+- `run_canceled` — run was canceled (includes cancel reason in `detail`)
+- `log` — execution log entry from `ctx.log()` (includes level, message, data in `detail`)
+The timeline is assembled from a single SQL query across runs, steps, and execution logs tables, ordered chronologically.
+---
+### Cancel & Retry
+#### Canceling a Running Workflow
-**Retry** a failed run from the point of failure:
+Cancel an in-progress run to stop further step execution:
 ```ts
-const retriedRun = await engine.retryRun('run_abc123')
+const canceledRun = await engine.cancelRun('run_abc123', 'No longer needed')
 ```
-Resets the failed step to `pending`, sets the run back to `running`, and re-enqueues via BullMQ.
+**Behavior:**
+- Sets the run status to `canceled` with a timestamp and optional reason
+- Marks all pending/scheduled step runs as `skipped`
+- Removes pending BullMQ jobs for that run
+- The executor checks for canceled status before executing each step
+- Fires the `onRunComplete` hook with `status: 'canceled'`
+- Cannot cancel runs that are already `completed`, `failed`, or `canceled`
-**List** failed runs:
+#### Retrying Failed Runs
+Resume a failed run from the step that failed:
 ```ts
-const allFailed = await engine.getFailedRuns()
-const tenantFailed = await engine.getFailedRuns('workspace_1')
+const retriedRun = await engine.retryRun('run_abc123')
 ```
-### Run Timeline
+**Behavior:**
+- Finds the first failed step in the run
+- Resets that step to `pending` status (clears error, timestamps)
+- Sets the run back to `running` status
+- Re-enqueues the step in BullMQ with its original retry policy
+- Cannot retry runs that aren't in `failed` status
+#### Listing Failed Runs
-Chronological audit trail for debugging dashboards:
+Query failed runs for monitoring dashboards:
 ```ts
-const timeline = await engine.getRunTimeline('run_abc123')
+// All failed runs
+const allFailed = await engine.getFailedRuns()
-for (const entry of timeline) {
-  console.log(`[${entry.timestamp}] ${entry.type}`, entry.detail)
-}
+// Failed runs for a specific tenant
+const tenantFailed = await engine.getFailedRuns('workspace_1')
 ```
-Entry types: `run_created`, `step_scheduled`, `step_started`, `step_completed`, `step_failed`, `step_skipped`, `run_completed`, `run_failed`, `run_canceled`, `log`
 ---
 ## API Reference
 ### `createEngine(config)`
+Creates and returns an engine instance.
 ```ts
 import { createEngine } from '@kylebegeman/pulse'
 const engine = createEngine({
-  db: pool,                       // Required — pg Pool instance
-  redis: redisClient,             // Required — ioredis instance
-  tablePrefix: 'pulse_',          // Default: 'pulse_'
-  queuePrefix: 'pulse',           // Default: 'pulse'
-  concurrency: 5,                 // Default: 5
-  onStepComplete: (e) => {},      // Optional lifecycle hook
-  onRunComplete: (e) => {},       // Optional lifecycle hook
+  db: pool,              // Required — pg Pool instance
+  redis: redisClient,    // Required — ioredis instance
+  tablePrefix: 'pulse_',    // Optional — table name prefix (default: 'pulse_')
+  queuePrefix: 'pulse',     // Optional — BullMQ queue prefix (default: 'pulse')
+  concurrency: 5,        // Optional — step worker concurrency (default: 5)
+  onStepComplete: (e) => {},  // Optional — lifecycle hook
+  onRunComplete: (e) => {},   // Optional — lifecycle hook
 })
 ```
+---
 ### Engine Methods
 #### Registration
 | Method | Description |
-|:--|:--|
+|---|---|
 | `registerTrigger(type, registration)` | Register a signal type with source, optional resource type, and optional Zod payload schema |
-| `registerAction(name, handler, options?)` | Register an action handler. Set `replaySafe` in options |
-| `registerCondition(name, handler)` | Register a condition handler returning a boolean |
+| `registerAction(name, handler, options?)` | Register an action handler. Set `replaySafe: true/false` in options |
+| `registerCondition(name, handler)` | Register a condition handler that returns a boolean |
 #### Signals
 | Method | Returns | Description |
-|:--|:--|:--|
-| `emit(trigger)` | `TriggerEnvelope` | Emit a signal — persists, matches workflows, creates runs |
+|---|---|---|
+| `emit(trigger)` | `Promise<TriggerEnvelope>` | Emit a signal. Persists the trigger, matches workflows, and creates runs |
 #### Lifecycle
-| Method | Description |
-|:--|:--|
-| `start()` | Start BullMQ workers and restore cron jobs |
-| `stop()` | Graceful shutdown — waits for active jobs, stops cron |
+| Method | Returns | Description |
+|---|---|---|
+| `start()` | `Promise<void>` | Start BullMQ workers and restore cron jobs. Call after registering handlers |
+| `stop()` | `Promise<void>` | Graceful shutdown — waits for active jobs to complete, stops cron workers |
 #### Queries
 | Method | Returns | Description |
-|:--|:--|:--|
-| `getRun(runId)` | `WorkflowRun \| null` | Get a run by ID |
-| `getRunSteps(runId)` | `WorkflowStepRun[]` | Get all step runs for a run |
-| `getRunsByTrigger(triggerId)` | `WorkflowRun[]` | Get runs spawned by a trigger |
-| `getTrigger(triggerId)` | `TriggerEnvelope \| null` | Get a persisted trigger |
+|---|---|---|
+| `getRun(runId)` | `Promise<WorkflowRun \| null>` | Get a workflow run by ID |
+| `getRunSteps(runId)` | `Promise<WorkflowStepRun[]>` | Get all step runs for a workflow run (includes branch steps) |
+| `getRunsByTrigger(triggerId)` | `Promise<WorkflowRun[]>` | Get all runs spawned by a trigger |
+| `getTrigger(triggerId)` | `Promise<TriggerEnvelope \| null>` | Get a persisted trigger by ID |
 #### Timeline
 | Method | Returns | Description |
-|:--|:--|:--|
-| `getRunTimeline(runId)` | `RunTimelineEntry[]` | Chronological lifecycle view |
+|---|---|---|
+| `getRunTimeline(runId)` | `Promise<RunTimelineEntry[]>` | Chronological lifecycle view of a run — steps, logs, status changes |
 #### Cancel & Retry
 | Method | Returns | Description |
-|:--|:--|:--|
-| `cancelRun(runId, reason?)` | `WorkflowRun` | Cancel an in-progress run |
-| `retryRun(runId)` | `WorkflowRun` | Retry a failed run from the failed step |
-| `getFailedRuns(tenantId?)` | `WorkflowRun[]` | List failed runs |
+|---|---|---|
+| `cancelRun(runId, reason?)` | `Promise<WorkflowRun>` | Cancel an in-progress run. Removes pending jobs |
+| `retryRun(runId)` | `Promise<WorkflowRun>` | Retry a failed run from the failed step |
+| `getFailedRuns(tenantId?)` | `Promise<WorkflowRun[]>` | List failed runs, optionally filtered by tenant |
 #### Replay
 | Method | Returns | Description |
-|:--|:--|:--|
-| `replay(triggerId, options?)` | `WorkflowRun[]` | Re-emit a historical trigger |
+|---|---|---|
+| `replay(triggerId, options?)` | `Promise<WorkflowRun[]>` | Re-emit a historical trigger. Pass `{ dryRun: true }` to skip actions |
 #### Workflow Management
 | Method | Returns | Description |
-|:--|:--|:--|
-| `createWorkflow(definition)` | `WorkflowDefinition` | Create a workflow, schedules cron if applicable |
-| `enableWorkflow(definitionId)` | `void` | Enable a workflow |
-| `disableWorkflow(definitionId)` | `void` | Disable a workflow |
+|---|---|---|
+| `createWorkflow(definition)` | `Promise<WorkflowDefinition>` | Create a new workflow definition. Schedules cron if applicable |
+| `enableWorkflow(definitionId)` | `Promise<void>` | Enable a workflow definition. Starts cron if defined |
+| `disableWorkflow(definitionId)` | `Promise<void>` | Disable a workflow definition. Stops cron if defined |
 #### Schema
-| Method | Description |
-|:--|:--|
-| `migrate()` | Create `pulse_*` tables (idempotent) |
+| Method | Returns | Description |
+|---|---|---|
+| `migrate()` | `Promise<void>` | Create `pulse_*` tables in your database (idempotent) |
 ---
-## Examples
-### Monitoring: Incident on Missed Heartbeat
+## Type Reference
-Wait for a missed heartbeat, verify the service is still down, then create an incident and page on-call.
+<details>
+<summary><code>TriggerInput</code> — Signal data you pass to <code>emit()</code></summary>
 ```ts
-engine.registerTrigger('heartbeat.missed', {
-  source: 'heartbeat',
-  resourceType: 'service',
-})
-engine.registerCondition('service_still_down', async (ctx) => {
-  const status = await healthCheck(ctx.trigger.resourceId)
-  return status !== 'healthy'
-})
+interface TriggerInput {
+  tenantId: string
+  source: string
+  type: string
+  resourceType?: string
+  resourceId?: string
+  environment?: string
+  payload?: Record<string, unknown>
+}
+```
-engine.registerAction('create_incident', async (ctx) => {
-  const incident = await db.incidents.create({
-    tenantId: ctx.tenantId,
-    serviceId: ctx.trigger.resourceId,
-    severity: ctx.config.severity || 'medium',
-  })
-  ctx.log('Incident created', { incidentId: incident.id })
-  return { success: true, data: { incidentId: incident.id } }
-}, { replaySafe: true })
+</details>
-engine.registerAction('notify_oncall', async (ctx) => {
-  await pagerduty.trigger({ service: ctx.trigger.resourceId })
-  return { success: true }
-}, { replaySafe: false })
+<details>
+<summary><code>TriggerEnvelope</code> — Persisted signal with ID and timestamp</summary>
-await engine.createWorkflow({
-  tenantId: 'workspace_1',
-  name: 'Incident on missed heartbeat',
-  triggerType: 'heartbeat.missed',
-  environmentFilter: 'production',
-  steps: [
-    { type: 'delay', name: 'wait_5m', delayMs: 5 * 60 * 1000 },
-    { type: 'condition', name: 'service_still_down' },
-    { type: 'action', name: 'create_incident' },
-    { type: 'action', name: 'notify_oncall' },
-  ],
-  config: { severity: 'high' },
-  isEnabled: true,
-})
+```ts
+interface TriggerEnvelope extends TriggerInput {
+  id: string          // Auto-generated (e.g., 'trg_...')
+  createdAt: Date
+}
 ```
-### Billing: Trial Expiration
+</details>
-Send a warning email, wait 3 days, then downgrade if the user hasn't upgraded.
+<details>
+<summary><code>TriggerRegistration</code> — Trigger type declaration</summary>
 ```ts
-engine.registerAction('send_trial_warning', async (ctx) => {
-  await emails.send({
-    to: ctx.trigger.payload.email,
-    template: 'trial-expiring',
-    data: { daysLeft: ctx.trigger.payload.daysLeft },
-  })
-  return { success: true }
-}, { replaySafe: false })
+interface TriggerRegistration {
+  source: string
+  resourceType?: string
+  payloadSchema?: ZodSchema    // Optional Zod schema for payload validation
+}
+```
-engine.registerCondition('has_not_upgraded', async (ctx) => {
-  const sub = await billing.getSubscription(ctx.tenantId)
-  return sub.plan === 'trial'
-})
+</details>
-engine.registerAction('downgrade_to_free', async (ctx) => {
-  await billing.changePlan(ctx.tenantId, 'free')
-  ctx.log('Downgraded to free plan')
-  return { success: true }
-}, { replaySafe: true })
+<details>
+<summary><code>WorkflowDefinition</code> — Stored workflow template</summary>
-await engine.createWorkflow({
-  tenantId: 'workspace_1',
-  name: 'Trial expiration',
-  triggerType: 'trial.expiring',
-  steps: [
-    { type: 'action', name: 'send_trial_warning' },
-    { type: 'delay', name: 'wait_3_days', delayMs: 3 * 24 * 60 * 60 * 1000 },
-    { type: 'condition', name: 'has_not_upgraded' },
-    { type: 'action', name: 'downgrade_to_free' },
-  ],
-  config: {},
-  isEnabled: true,
-})
+```ts
+interface WorkflowDefinition {
+  id: string
+  tenantId: string
+  name: string
+  description?: string
+  triggerType: string
+  environmentFilter?: string
+  resourceTypeFilter?: string
+  steps: WorkflowStep[]
+  config: Record<string, unknown>
+  isEnabled: boolean
+  cronExpression?: string       // Cron schedule (e.g., '0 2 * * *')
+  createdAt: Date
+  updatedAt: Date
+}
 ```
-### Workflow Chaining
+</details>
-Actions can emit signals, allowing workflows to trigger other workflows.
+<details>
+<summary><code>WorkflowStep</code> — Step definition within a workflow</summary>
 ```ts
-engine.registerAction('attempt_auto_fix', async (ctx) => {
-  const result = await autoRemediate(ctx.trigger.resourceId)
-  if (result.fixed) {
-    await ctx.emit({
-      tenantId: ctx.tenantId,
-      source: 'auto-remediation',
-      type: 'service.recovered',
-      resourceType: 'service',
-      resourceId: ctx.trigger.resourceId,
-      payload: { fix: result.action },
-    })
-  }
-  return { success: true, data: result }
-}, { replaySafe: true })
+type StepType = 'action' | 'condition' | 'delay' | 'parallel'
-// A separate workflow reacts to the recovery signal
-await engine.createWorkflow({
-  tenantId: 'workspace_1',
-  name: 'Auto-close on recovery',
-  triggerType: 'service.recovered',
-  steps: [{ type: 'action', name: 'close_incident' }],
-  config: {},
-  isEnabled: true,
-})
+interface WorkflowStep {
+  type: StepType
+  name: string
+  config?: Record<string, unknown>
+  delayMs?: number                      // Only for 'delay' steps
+  timeoutMs?: number                    // Step execution timeout
+  retryPolicy?: RetryPolicy             // Retry on failure
+  onFalse?: 'complete' | 'skip' | number  // Condition false behavior
+  branches?: WorkflowStep[][]           // Only for 'parallel' steps
+}
 ```
-### Parallel: Multi-Channel Notification
+</details>
-Send notifications across multiple channels simultaneously.
+<details>
+<summary><code>WorkflowRun</code> — Execution record for a workflow</summary>
 ```ts
-await engine.createWorkflow({
-  tenantId: 'workspace_1',
-  name: 'Multi-channel alert',
-  triggerType: 'alert.triggered',
-  steps: [
-    {
-      type: 'parallel',
-      name: 'notify_all',
-      branches: [
-        [{ type: 'action', name: 'send_email' }],
-        [{ type: 'action', name: 'send_slack' }],
-        [{ type: 'action', name: 'send_sms' }],
-      ],
-    },
-    { type: 'action', name: 'mark_notified' },
-  ],
-  config: {
-    alertEmail: 'oncall@company.com',
-    slackChannel: '#alerts',
-    phoneNumber: '+1234567890',
-  },
-  isEnabled: true,
-})
+type WorkflowStatus = 'pending' | 'running' | 'waiting' | 'completed' | 'failed' | 'canceled'
+interface WorkflowRun {
+  id: string
+  definitionId: string
+  tenantId: string
+  triggerId: string
+  status: WorkflowStatus
+  context: Record<string, unknown>
+  currentStepIndex: number
+  isReplay: boolean
+  definitionSnapshot: DefinitionSnapshot
+  startedAt?: Date
+  completedAt?: Date
+  failedAt?: Date
+  canceledAt?: Date               // When the run was canceled
+  cancelReason?: string           // Why the run was canceled
+  createdAt: Date
+  updatedAt: Date
+}
 ```
-### Cron: Daily Cleanup Job
+</details>
+<details>
+<summary><code>WorkflowStepRun</code> — Execution record for a single step</summary>
 ```ts
-await engine.createWorkflow({
-  tenantId: 'workspace_1',
-  name: 'Daily cleanup',
-  triggerType: 'maintenance.cleanup',
-  steps: [
-    { type: 'action', name: 'cleanup_old_records' },
-    { type: 'action', name: 'send_cleanup_report' },
-  ],
-  config: {},
-  isEnabled: true,
-  cronExpression: '0 2 * * *',
-})
+type StepStatus = 'pending' | 'scheduled' | 'running' | 'completed' | 'failed' | 'skipped'
+interface WorkflowStepRun {
+  id: string
+  runId: string
+  tenantId: string
+  stepIndex: number
+  stepType: StepType
+  stepName: string
+  status: StepStatus
+  scheduledFor?: Date
+  startedAt?: Date
+  completedAt?: Date
+  result?: Record<string, unknown>
+  errorMessage?: string
+  branchIndex?: number            // Branch index for parallel step branches
+  parentStepRunId?: string        // Parent step run ID for branch steps
+  createdAt: Date
+}
 ```
-### Error Recovery: Retry Failed Runs
+</details>
+<details>
+<summary><code>RunTimelineEntry</code> — Chronological run event</summary>
 ```ts
-const failed = await engine.getFailedRuns('workspace_1')
+interface RunTimelineEntry {
+  timestamp: Date
+  type:
+    | 'run_created' | 'run_completed' | 'run_failed' | 'run_canceled'
+    | 'step_scheduled' | 'step_started' | 'step_completed'
+    | 'step_failed' | 'step_skipped' | 'log'
+  stepIndex?: number
+  stepName?: string
+  stepType?: string
+  detail?: Record<string, unknown>
+}
+```
-for (const run of failed) {
-  const retried = await engine.retryRun(run.id)
-  console.log(`Retried run ${retried.id}, now ${retried.status}`)
+</details>
+<details>
+<summary><code>DefinitionSnapshot</code> — Frozen copy of workflow at run creation</summary>
+```ts
+interface DefinitionSnapshot {
+  steps: WorkflowStep[]
+  config: Record<string, unknown>
+  triggerType: string
 }
+```
-await engine.cancelRun('run_xyz789', 'Superseded by newer deployment')
+</details>
+<details>
+<summary><code>RetryPolicy</code> — Retry configuration for action steps</summary>
+```ts
+interface RetryPolicy {
+  maxAttempts: number    // 1-10
+  backoffMs: number      // 100-300000 ms
+}
 ```
----
+</details>
-## Runtime Guarantees
+<details>
+<summary><code>WorkflowContext</code> — Passed to action and condition handlers</summary>
-### Execution Semantics
+```ts
+interface WorkflowContext {
+  tenantId: string
+  trigger: TriggerEnvelope
+  run: WorkflowRun
+  step: WorkflowStepRun
+  config: Record<string, unknown>
+  isReplay: boolean
+  emit: (trigger: TriggerInput) => Promise<TriggerEnvelope>
+  log: (message: string, data?: Record<string, unknown>) => void
+}
+```
+</details>
-Pulse provides **at-least-once** execution. If a worker crashes after an action completes but before the engine records success, the step may execute again on retry. Design actions to be **idempotent** where possible:
+<details>
+<summary><code>ActionResult</code> — Return value from action handlers</summary>
 ```ts
-engine.registerAction('send_email', async (ctx) => {
-  const idempotencyKey = `${ctx.run.id}:${ctx.step.name}`
-  // Use this key with your provider to prevent duplicates
-})
+interface ActionResult {
+  success: boolean
+  data?: Record<string, unknown>
+  error?: string
+}
 ```
-### Step Claiming
+</details>
-Every step is claimed atomically before execution using compare-and-set:
+<details>
+<summary><code>ActionOptions</code> — Options for action registration</summary>
-```sql
-UPDATE step_runs SET status = 'running', started_at = NOW()
-WHERE id = $1 AND status IN ('pending', 'scheduled')
-RETURNING *
+```ts
+interface ActionOptions {
+  replaySafe: boolean    // Whether this action can be re-executed during replay
+}
+```
+</details>
+<details>
+<summary><code>EngineConfig</code> — Configuration for <code>createEngine()</code></summary>
+```ts
+interface EngineConfig {
+  db: Pool                    // pg Pool instance
+  redis: unknown              // ioredis instance
+  tablePrefix?: string        // Default: 'pulse_'
+  queuePrefix?: string        // Default: 'pulse'
+  concurrency?: number        // Step worker concurrency (default: 5)
+  onStepComplete?: LifecycleHook<StepCompleteEvent>
+  onRunComplete?: LifecycleHook<RunCompleteEvent>
+}
 ```
-If two workers pick up the same job, only one successfully claims it. The other silently no-ops.
+</details>
-### State Transitions
+<details>
+<summary><code>StepCompleteEvent</code> / <code>RunCompleteEvent</code> — Lifecycle hook payloads</summary>
-Run status changes are guarded by the current status. Invalid transitions are rejected:
+```ts
+interface StepCompleteEvent {
+  run: WorkflowRun
+  step: WorkflowStepRun
+}
+interface RunCompleteEvent {
+  run: WorkflowRun
+  status: 'completed' | 'failed' | 'canceled'
+}
 ```
-pending  ->  running, canceled
-running  ->  waiting, completed, failed, canceled
-waiting  ->  running, canceled
-failed   ->  running (retry)
+</details>
+<details>
+<summary><code>ReplayOptions</code> — Options for <code>replay()</code></summary>
+```ts
+interface ReplayOptions {
+  dryRun?: boolean    // If true, skip all actions (log only)
+}
 ```
-### Context Accumulation
+</details>
-Each action's return value is stored in the run context keyed by action name:
+<details>
+<summary><code>ExecutionLog</code> — Structured log entry</summary>
 ```ts
-// After "check_status" returns { healthy: true }
-// ctx.run.context.check_status === { healthy: true }
+interface ExecutionLog {
+  id: string
+  runId: string
+  stepRunId?: string
+  tenantId: string
+  level: 'info' | 'warn' | 'error'
+  message: string
+  data?: Record<string, unknown>
+  createdAt: Date
+}
 ```
-Step names within a workflow must be unique to prevent key collisions.
+</details>
 ---
 ## Database Schema
-The engine creates tables with the prefix `pulse_` (configurable). All tables use `TEXT` primary keys with auto-generated IDs. Migrations are idempotent — safe to call on every startup.
+The engine creates tables in your PostgreSQL database with the prefix `pulse_` (configurable). All tables use `TEXT` primary keys with auto-generated IDs. Migrations are idempotent — safe to call on every startup.
 ```ts
 await engine.migrate()
@@ -705,9 +936,11 @@ await engine.migrate()
 ### `pulse_triggers`
+Persisted signal records for auditing and replay.
 | Column | Type | Description |
-|:--|:--|:--|
-| `id` | `TEXT PK` | Trigger ID (`trg_...`) |
+|---|---|---|
+| `id` | `TEXT PK` | Trigger ID (e.g., `trg_...`) |
 | `tenant_id` | `TEXT` | Tenant scope |
 | `source` | `TEXT` | Originating system |
 | `type` | `TEXT` | Signal type |
@@ -715,12 +948,16 @@ await engine.migrate()
 | `resource_id` | `TEXT` | Specific resource |
 | `environment` | `TEXT` | Environment scope |
 | `payload` | `JSONB` | Arbitrary event data |
-| `created_at` | `TIMESTAMPTZ` | When emitted |
+| `created_at` | `TIMESTAMPTZ` | When the signal was emitted |
+**Indexes:** `(tenant_id, type)`, `(created_at)`
 ### `pulse_workflow_definitions`
+Workflow templates that define step sequences and matching rules.
 | Column | Type | Description |
-|:--|:--|:--|
+|---|---|---|
 | `id` | `TEXT PK` | Definition ID |
 | `tenant_id` | `TEXT` | Tenant scope |
 | `name` | `TEXT` | Human-readable name |
@@ -730,15 +967,19 @@ await engine.migrate()
 | `resource_type_filter` | `TEXT` | Optional resource type filter |
 | `steps` | `JSONB` | Ordered step definitions |
 | `config` | `JSONB` | Config passed to handlers |
-| `is_enabled` | `BOOLEAN` | Whether active |
-| `cron_expression` | `TEXT` | Optional cron schedule |
+| `is_enabled` | `BOOLEAN` | Whether the workflow is active |
+| `cron_expression` | `TEXT` | Optional cron schedule for automatic trigger emission |
 | `created_at` | `TIMESTAMPTZ` | Creation time |
-| `updated_at` | `TIMESTAMPTZ` | Last update |
+| `updated_at` | `TIMESTAMPTZ` | Last update time |
+**Indexes:** `(tenant_id, trigger_type, is_enabled)`
 ### `pulse_workflow_runs`
+Execution records — one per workflow triggered by a signal.
 | Column | Type | Description |
-|:--|:--|:--|
+|---|---|---|
 | `id` | `TEXT PK` | Run ID |
 | `definition_id` | `TEXT FK` | Workflow definition |
 | `tenant_id` | `TEXT` | Tenant scope |
@@ -746,330 +987,524 @@ await engine.migrate()
 | `status` | `TEXT` | `pending` / `running` / `waiting` / `completed` / `failed` / `canceled` |
 | `context` | `JSONB` | Shared run context |
 | `current_step_index` | `INTEGER` | Active step position |
-| `is_replay` | `BOOLEAN` | Whether this is a replay |
-| `definition_snapshot` | `JSONB` | Frozen copy of workflow at run creation |
+| `is_replay` | `BOOLEAN` | Whether this is a replay run |
+| `definition_snapshot` | `JSONB` | Frozen copy of workflow definition at run creation |
 | `started_at` | `TIMESTAMPTZ` | When execution began |
-| `completed_at` | `TIMESTAMPTZ` | When completed |
-| `failed_at` | `TIMESTAMPTZ` | When failed |
-| `canceled_at` | `TIMESTAMPTZ` | When canceled |
-| `cancel_reason` | `TEXT` | Cancel reason |
+| `completed_at` | `TIMESTAMPTZ` | When execution completed |
+| `failed_at` | `TIMESTAMPTZ` | When execution failed |
+| `canceled_at` | `TIMESTAMPTZ` | When the run was canceled |
+| `cancel_reason` | `TEXT` | Why the run was canceled |
 | `created_at` | `TIMESTAMPTZ` | Creation time |
-| `updated_at` | `TIMESTAMPTZ` | Last update |
+| `updated_at` | `TIMESTAMPTZ` | Last update time |
+**Indexes:** `(tenant_id, status)`, `(trigger_id)`
 ### `pulse_workflow_step_runs`
+Individual step execution records within a run.
 | Column | Type | Description |
-|:--|:--|:--|
+|---|---|---|
 | `id` | `TEXT PK` | Step run ID |
 | `run_id` | `TEXT FK` | Parent workflow run |
 | `tenant_id` | `TEXT` | Tenant scope |
-| `step_index` | `INTEGER` | Position in sequence |
+| `step_index` | `INTEGER` | Position in step sequence |
 | `step_type` | `TEXT` | `action` / `condition` / `delay` / `parallel` |
 | `step_name` | `TEXT` | Registered handler name |
 | `status` | `TEXT` | `pending` / `scheduled` / `running` / `completed` / `failed` / `skipped` |
-| `scheduled_for` | `TIMESTAMPTZ` | When to execute (delays) |
+| `scheduled_for` | `TIMESTAMPTZ` | When the step should execute (for delays) |
 | `started_at` | `TIMESTAMPTZ` | When execution began |
-| `completed_at` | `TIMESTAMPTZ` | When completed |
+| `completed_at` | `TIMESTAMPTZ` | When execution completed |
 | `result` | `JSONB` | Action result data |
-| `error_message` | `TEXT` | Error details |
-| `branch_index` | `INTEGER` | Branch index (parallel) |
-| `parent_step_run_id` | `TEXT` | Parent parallel step |
+| `error_message` | `TEXT` | Error details on failure |
+| `branch_index` | `INTEGER` | Branch index within a parallel step (null for top-level) |
+| `parent_step_run_id` | `TEXT` | Parent parallel step run ID (null for top-level) |
 | `created_at` | `TIMESTAMPTZ` | Creation time |
+**Indexes:** `(run_id, step_index)`, `(status, scheduled_for) WHERE status = 'scheduled'`
 ### `pulse_execution_logs`
+Structured execution logs written by `ctx.log()` calls.
 | Column | Type | Description |
-|:--|:--|:--|
+|---|---|---|
 | `id` | `TEXT PK` | Log entry ID |
 | `run_id` | `TEXT FK` | Parent workflow run |
-| `step_run_id` | `TEXT FK` | Associated step |
+| `step_run_id` | `TEXT FK` | Associated step (optional) |
 | `tenant_id` | `TEXT` | Tenant scope |
 | `level` | `TEXT` | `info` / `warn` / `error` |
 | `message` | `TEXT` | Log message |
 | `data` | `JSONB` | Structured log data |
-| `created_at` | `TIMESTAMPTZ` | When written |
+| `created_at` | `TIMESTAMPTZ` | When the log was written |
+**Indexes:** `(run_id, created_at)`
 ---
-## Type Reference
+## Runtime Guarantees
-<details>
-<summary><code>TriggerInput</code></summary>
+### Execution Semantics
-```ts
-interface TriggerInput {
-  tenantId: string
-  source: string
-  type: string
-  resourceType?: string
-  resourceId?: string
-  environment?: string
-  payload?: Record<string, unknown>
-}
-```
-</details>
+Pulse provides **at-least-once** execution semantics. If a worker crashes after an action completes but before the engine records success, the step may execute again on retry. Design your actions to be **idempotent** where possible — especially for side effects like sending emails, charging customers, or creating external resources.
-<details>
-<summary><code>TriggerEnvelope</code></summary>
+You can derive idempotency keys from the workflow context passed to every action:
 ```ts
-interface TriggerEnvelope extends TriggerInput {
-  id: string
-  createdAt: Date
-}
+engine.registerAction('send_email', async (ctx) => {
+  const idempotencyKey = `${ctx.run.id}:${ctx.step.name}`
+  // Use this key with your email provider to prevent duplicates
+})
 ```
-</details>
-<details>
-<summary><code>TriggerRegistration</code></summary>
-```ts
-interface TriggerRegistration {
-  source: string
-  resourceType?: string
-  payloadSchema?: ZodSchema
-}
-```
-</details>
+### Step Claiming
-<details>
-<summary><code>WorkflowDefinition</code></summary>
+Every step is claimed atomically before execution using a compare-and-set query:
-```ts
-interface WorkflowDefinition {
-  id: string
-  tenantId: string
-  name: string
-  description?: string
-  triggerType: string
-  environmentFilter?: string
-  resourceTypeFilter?: string
-  steps: WorkflowStep[]
-  config: Record<string, unknown>
-  isEnabled: boolean
-  cronExpression?: string
-  createdAt: Date
-  updatedAt: Date
-}
+```sql
+UPDATE step_runs SET status = 'running', started_at = NOW()
+WHERE id = $1 AND status IN ('pending', 'scheduled')
+RETURNING *
 ```
-</details>
-<details>
-<summary><code>WorkflowStep</code></summary>
+If two workers pick up the same job from the queue (e.g., due to retry or crash recovery), only one will successfully claim the step. The other receives a null result and silently no-ops.
-```ts
-type StepType = 'action' | 'condition' | 'delay' | 'parallel'
+### Run State Transitions
+Run status changes are guarded by the current status. Invalid transitions (e.g., completing an already-canceled run) are rejected:
-interface WorkflowStep {
-  type: StepType
-  name: string
-  config?: Record<string, unknown>
-  delayMs?: number
-  timeoutMs?: number
-  retryPolicy?: RetryPolicy
-  onFalse?: 'complete' | 'skip' | number
-  branches?: WorkflowStep[][]
-}
 ```
-</details>
+pending  → running, canceled
+running  → waiting, completed, failed, canceled
+waiting  → running, canceled
+failed   → running (retry)
+```
-<details>
-<summary><code>WorkflowRun</code></summary>
+Attempts to transition from an unexpected state throw an error rather than silently corrupting run state.
+### Cancellation
+When a run is canceled:
+1. The run status is set to `canceled` with a timestamp and optional reason.
+2. All pending and scheduled step runs are marked as `skipped`.
+3. A step that is already running will complete its current execution — the engine checks run status before scheduling the next step.
+### Retry
+Retrying a failed run:
+1. Finds the first failed step run and resets it to `pending`.
+2. Sets the run status back to `running`.
+3. The step is re-enqueued and executes with the original retry policy.
+4. Execution continues from the failed step — previously completed steps are not re-run.
+### Replay
+Replay re-emits a historical trigger to create a new run:
+- A new run is created with `isReplay: true`.
+- Actions flagged with `replaySafe: false` are **skipped** during replay execution.
+- All other steps execute normally against the replayed trigger data.
+- Replay is useful for debugging, backfilling, and testing new workflows against historical signals.
+### Parallel Execution
+Parallel steps fan out into concurrent branches:
+- Each branch executes independently with its own step runs.
+- **All branches must complete** for the parallel step to succeed.
+- If **any branch fails**, pending sibling branch steps are canceled and the run is marked as failed.
+- Context from each branch is merged — each step writes to `context[actionName]`, so branch steps should have unique action names.
+### Context Accumulation
+Each action's return value is stored in the run context keyed by action name:
 ```ts
-type WorkflowStatus = 'pending' | 'running' | 'waiting' | 'completed' | 'failed' | 'canceled'
+// After "check_status" action returns { healthy: true }
+// context.check_status === { healthy: true }
-interface WorkflowRun {
-  id: string
-  definitionId: string
-  tenantId: string
-  triggerId: string
-  status: WorkflowStatus
-  context: Record<string, unknown>
-  currentStepIndex: number
-  isReplay: boolean
-  definitionSnapshot: DefinitionSnapshot
-  startedAt?: Date
-  completedAt?: Date
-  failedAt?: Date
-  canceledAt?: Date
-  cancelReason?: string
-  createdAt: Date
-  updatedAt: Date
-}
+// Next action can access it:
+engine.registerAction('notify', async (ctx) => {
+  if (ctx.context.check_status.healthy) { /* ... */ }
+})
 ```
-</details>
-<details>
-<summary><code>WorkflowStepRun</code></summary>
+Step names within a workflow must be unique, preventing key collisions.
-```ts
-type StepStatus = 'pending' | 'scheduled' | 'running' | 'completed' | 'failed' | 'skipped'
+---
+## Architecture
-interface WorkflowStepRun {
-  id: string
-  runId: string
-  tenantId: string
-  stepIndex: number
-  stepType: StepType
-  stepName: string
-  status: StepStatus
-  scheduledFor?: Date
-  startedAt?: Date
-  completedAt?: Date
-  result?: Record<string, unknown>
-  errorMessage?: string
-  branchIndex?: number
-  parentStepRunId?: string
-  createdAt: Date
-}
 ```
-</details>
+┌─────────────────────────────────────────────────────────────────┐
+│                        Your Application                         │
+│                                                                 │
+│   engine.emit({ type: 'heartbeat.missed', ... })                │
+│       │                                                         │
+│       ▼                                                         │
+│   ┌──────────┐     ┌───────────┐     ┌───────────────────────┐  │
+│   │ Registry │     │  Matcher  │────▶│   Run Manager         │  │
+│   │          │     │           │     │   (state machine)     │  │
+│   │ triggers │     │ finds     │     │                       │  │
+│   │ actions  │     │ matching  │     │ pending → running     │  │
+│   │ conditions│    │ workflows │     │ → waiting → completed │  │
+│   └──────────┘     └───────────┘     └───────┬───────────────┘  │
+│                                              │                  │
+│                                              ▼                  │
+│   ┌──────────────────────┐     ┌─────────────────────────────┐  │
+│   │   Step Scheduler     │     │     Step Executor           │  │
+│   │                      │────▶│                             │  │
+│   │ BullMQ delayed jobs  │     │ delay → condition → action  │  │
+│   │ + parallel branches  │     │ + parallel branch dispatch  │  │
+│   └──────────────────────┘     └─────────────────────────────┘  │
+│                                                                 │
+│   ┌────────────────────┐  ┌─────────────────────────────────┐   │
+│   │  Replay Manager    │  │       Execution Logs            │   │
+│   │                    │  │                                 │   │
+│   │ re-emit historical │  │  structured logs per run/step   │   │
+│   │ signals with safety│  │  ctx.log() → pulse_execution_logs│   │
+│   └────────────────────┘  └─────────────────────────────────┘   │
+│                                                                 │
+│   ┌────────────────────┐  ┌─────────────────────────────────┐   │
+│   │   Cron Manager     │  │       Run Timeline              │   │
+│   │                    │  │                                 │   │
+│   │ BullMQ repeatable  │  │  chronological event view       │   │
+│   │ jobs → auto-emit   │  │  for debugging & dashboards     │   │
+│   └────────────────────┘  └─────────────────────────────────┘   │
+│                                                                 │
+│   ┌──────────────┐        ┌────────────────┐                    │
+│   │  PostgreSQL  │        │     Redis      │                    │
+│   │  (your DB)   │        │  (BullMQ jobs) │                    │
+│   └──────────────┘        └────────────────┘                    │
+└─────────────────────────────────────────────────────────────────┘
+```
-<details>
-<summary><code>RunTimelineEntry</code></summary>
+### Module Breakdown
+| Module | File | Purpose |
+|---|---|---|
+| **Engine Factory** | `src/index.ts` | `createEngine()` — wires everything together |
+| **Types** | `src/types.ts` | All TypeScript interfaces and type definitions |
+| **Registry** | `src/registry.ts` | Trigger, action, and condition registration |
+| **Matcher** | `src/matcher.ts` | Signal-to-workflow matching, run creation, and validation |
+| **Run Manager** | `src/runs.ts` | Workflow run lifecycle, state machine, timeline, cancel |
+| **Scheduler** | `src/scheduler.ts` | Step scheduling with BullMQ delayed jobs + parallel branches |
+| **Executor** | `src/executor.ts` | Step dispatch — delay, condition, action, parallel |
+| **Cron Manager** | `src/cron.ts` | Cron-triggered workflow scheduling via BullMQ repeatable jobs |
+| **Replay** | `src/replay.ts` | Trigger persistence and replay support |
+| **Queue** | `src/queue.ts` | BullMQ queue and worker setup (match, step, cron queues) |
+| **Logs** | `src/logs.ts` | Execution logging and query helpers |
+| **Schema** | `src/schema/` | Table definitions and migration runner |
+---
+## Examples
+### Monitoring: Incident on Missed Heartbeat
+A service stops sending heartbeats. Wait 5 minutes, check if it's still down, then create an incident.
 ```ts
-interface RunTimelineEntry {
-  timestamp: Date
-  type:
-    | 'run_created' | 'run_completed' | 'run_failed' | 'run_canceled'
-    | 'step_scheduled' | 'step_started' | 'step_completed'
-    | 'step_failed' | 'step_skipped' | 'log'
-  stepIndex?: number
-  stepName?: string
-  stepType?: string
-  detail?: Record<string, unknown>
-}
+// Register handlers
+engine.registerTrigger('heartbeat.missed', {
+  source: 'heartbeat',
+  resourceType: 'service',
+})
+engine.registerCondition('service_still_down', async (ctx) => {
+  const status = await healthCheck(ctx.trigger.resourceId)
+  return status !== 'healthy'
+})
+engine.registerAction('create_incident', async (ctx) => {
+  const incident = await db.incidents.create({
+    tenantId: ctx.tenantId,
+    serviceId: ctx.trigger.resourceId,
+    severity: ctx.config.severity || 'medium',
+    title: `Service ${ctx.trigger.resourceId} is unresponsive`,
+  })
+  ctx.log('Incident created', { incidentId: incident.id })
+  return { success: true, data: { incidentId: incident.id } }
+}, { replaySafe: true })
+engine.registerAction('notify_oncall', async (ctx) => {
+  await pagerduty.trigger({
+    service: ctx.trigger.resourceId,
+    severity: ctx.config.severity,
+  })
+  return { success: true }
+}, { replaySafe: false }) // Don't re-page during replay
+// Create the workflow
+await engine.createWorkflow({
+  tenantId: 'workspace_1',
+  name: 'Incident on missed heartbeat',
+  triggerType: 'heartbeat.missed',
+  environmentFilter: 'production',
+  steps: [
+    { type: 'delay', name: 'wait_5m', delayMs: 5 * 60 * 1000 },
+    { type: 'condition', name: 'service_still_down' },
+    { type: 'action', name: 'create_incident' },
+    { type: 'action', name: 'notify_oncall' },
+  ],
+  config: { severity: 'high' },
+  isEnabled: true,
+})
 ```
-</details>
-<details>
-<summary><code>WorkflowContext</code></summary>
+### Billing: Trial Expiration Workflow
+A trial is about to expire. Send a warning email, wait, then downgrade if they haven't upgraded.
 ```ts
-interface WorkflowContext {
-  tenantId: string
-  trigger: TriggerEnvelope
-  run: WorkflowRun
-  step: WorkflowStepRun
-  config: Record<string, unknown>
-  isReplay: boolean
-  emit: (trigger: TriggerInput) => Promise<TriggerEnvelope>
-  log: (message: string, data?: Record<string, unknown>) => void
-}
+engine.registerTrigger('trial.expiring', {
+  source: 'billing',
+  resourceType: 'subscription',
+})
+engine.registerAction('send_trial_warning', async (ctx) => {
+  await emails.send({
+    to: ctx.trigger.payload.email,
+    template: 'trial-expiring',
+    data: { daysLeft: ctx.trigger.payload.daysLeft },
+  })
+  return { success: true }
+}, { replaySafe: false })
+engine.registerCondition('has_not_upgraded', async (ctx) => {
+  const sub = await billing.getSubscription(ctx.tenantId)
+  return sub.plan === 'trial'
+})
+engine.registerAction('downgrade_to_free', async (ctx) => {
+  await billing.changePlan(ctx.tenantId, 'free')
+  ctx.log('Downgraded to free plan')
+  return { success: true }
+}, { replaySafe: true })
+await engine.createWorkflow({
+  tenantId: 'workspace_1',
+  name: 'Trial expiration',
+  triggerType: 'trial.expiring',
+  steps: [
+    { type: 'action', name: 'send_trial_warning' },
+    { type: 'delay', name: 'wait_3_days', delayMs: 3 * 24 * 60 * 60 * 1000 },
+    { type: 'condition', name: 'has_not_upgraded' },
+    { type: 'action', name: 'downgrade_to_free' },
+  ],
+  config: {},
+  isEnabled: true,
+})
 ```
-</details>
-<details>
-<summary><code>EngineConfig</code></summary>
+### Chaining: Workflow That Triggers Another Workflow
+Actions can emit signals, allowing workflows to trigger other workflows.
 ```ts
-interface EngineConfig {
-  db: Pool
-  redis: unknown
-  tablePrefix?: string     // Default: 'pulse_'
-  queuePrefix?: string     // Default: 'pulse'
-  concurrency?: number     // Default: 5
-  onStepComplete?: LifecycleHook<StepCompleteEvent>
-  onRunComplete?: LifecycleHook<RunCompleteEvent>
-}
+// First workflow: detect issue → emit resolution signal
+engine.registerAction('attempt_auto_fix', async (ctx) => {
+  const result = await autoRemediate(ctx.trigger.resourceId)
+  if (result.fixed) {
+    // Emit a new signal — this will match other workflows
+    await ctx.emit({
+      tenantId: ctx.tenantId,
+      source: 'auto-remediation',
+      type: 'service.recovered',
+      resourceType: 'service',
+      resourceId: ctx.trigger.resourceId,
+      payload: { fix: result.action },
+    })
+  }
+  return { success: true, data: result }
+}, { replaySafe: true })
+// Second workflow: triggered by the recovery signal
+engine.registerAction('close_incident', async (ctx) => {
+  await db.incidents.close({
+    serviceId: ctx.trigger.resourceId,
+    resolution: ctx.trigger.payload.fix,
+  })
+  return { success: true }
+}, { replaySafe: true })
+await engine.createWorkflow({
+  tenantId: 'workspace_1',
+  name: 'Auto-close on recovery',
+  triggerType: 'service.recovered',
+  steps: [
+    { type: 'action', name: 'close_incident' },
+  ],
+  config: {},
+  isEnabled: true,
+})
 ```
-</details>
-<details>
-<summary><code>ActionResult</code> / <code>ActionOptions</code> / <code>RetryPolicy</code> / <code>ReplayOptions</code></summary>
+### Parallel: Multi-Channel Notification
+Send notifications across multiple channels simultaneously, then mark the alert as notified.
 ```ts
-interface ActionResult {
-  success: boolean
-  data?: Record<string, unknown>
-  error?: string
-}
+engine.registerAction('send_email', async (ctx) => {
+  await emails.send({ to: ctx.config.alertEmail, subject: 'Alert' })
+  return { success: true }
+}, { replaySafe: false })
-interface ActionOptions {
-  replaySafe: boolean
-}
+engine.registerAction('send_slack', async (ctx) => {
+  await slack.post({ channel: ctx.config.slackChannel, text: 'Alert triggered' })
+  return { success: true }
+}, { replaySafe: false })
-interface RetryPolicy {
-  maxAttempts: number   // 1-10
-  backoffMs: number     // 100-300000 ms
-}
+engine.registerAction('send_sms', async (ctx) => {
+  await sms.send({ to: ctx.config.phoneNumber, body: 'Alert triggered' })
+  return { success: true }
+}, { replaySafe: false })
-interface ReplayOptions {
-  dryRun?: boolean
-}
+engine.registerAction('mark_notified', async (ctx) => {
+  await db.alerts.update(ctx.trigger.resourceId, { notified: true })
+  return { success: true }
+}, { replaySafe: true })
+await engine.createWorkflow({
+  tenantId: 'workspace_1',
+  name: 'Multi-channel alert',
+  triggerType: 'alert.triggered',
+  steps: [
+    {
+      type: 'parallel',
+      name: 'notify_all',
+      branches: [
+        [{ type: 'action', name: 'send_email' }],
+        [{ type: 'action', name: 'send_slack' }],
+        [{ type: 'action', name: 'send_sms' }],
+      ],
+    },
+    { type: 'action', name: 'mark_notified' },
+  ],
+  config: {
+    alertEmail: 'oncall@company.com',
+    slackChannel: '#alerts',
+    phoneNumber: '+1234567890',
+  },
+  isEnabled: true,
+})
 ```
-</details>
----
+### Cron: Daily Cleanup Job
-## Architecture
+Run a cleanup workflow on a schedule without needing an external signal.
+```ts
+engine.registerTrigger('maintenance.cleanup', {
+  source: 'cron',
+})
+engine.registerAction('cleanup_old_records', async (ctx) => {
+  const deleted = await db.query(
+    'DELETE FROM temp_data WHERE created_at < NOW() - INTERVAL \'30 days\' RETURNING id'
+  )
+  ctx.log('Cleaned up old records', { count: deleted.rowCount })
+  return { success: true, data: { deletedCount: deleted.rowCount } }
+}, { replaySafe: true })
+engine.registerAction('send_cleanup_report', async (ctx) => {
+  await emails.send({
+    to: 'admin@company.com',
+    subject: 'Daily cleanup report',
+    body: `Deleted ${ctx.run.context.deletedCount ?? 0} old records`,
+  })
+  return { success: true }
+}, { replaySafe: false })
+await engine.createWorkflow({
+  tenantId: 'workspace_1',
+  name: 'Daily cleanup',
+  triggerType: 'maintenance.cleanup',
+  steps: [
+    { type: 'action', name: 'cleanup_old_records' },
+    { type: 'action', name: 'send_cleanup_report' },
+  ],
+  config: {},
+  isEnabled: true,
+  cronExpression: '0 2 * * *',  // Every day at 2 AM
+})
 ```
-┌──────────────────────────────────────────────────────────────┐
-│                      Your Application                        │
-│                                                              │
-│   engine.emit({ type: 'heartbeat.missed', ... })             │
-│       │                                                      │
-│       v                                                      │
-│   ┌──────────┐    ┌──────────┐    ┌────────────────────┐     │
-│   │ Registry │    │ Matcher  │--->│    Run Manager      │     │
-│   │          │    │          │    │   (state machine)   │     │
-│   │ triggers │    │ matches  │    │                     │     │
-│   │ actions  │    │ signal   │    │ pending -> running   │     │
-│   │ conds    │    │ to wfs   │    │ -> waiting -> done   │     │
-│   └──────────┘    └──────────┘    └─────────┬──────────┘     │
-│                                             │                │
-│                                             v                │
-│   ┌──────────────────────┐    ┌────────────────────────┐     │
-│   │   Step Scheduler     │    │    Step Executor        │     │
-│   │                      │--->│                         │     │
-│   │ BullMQ delayed jobs  │    │ delay -> cond -> action │     │
-│   │ + parallel branches  │    │ + parallel dispatch     │     │
-│   └──────────────────────┘    └────────────────────────┘     │
-│                                                              │
-│   ┌───────────────┐  ┌──────────────┐  ┌────────────────┐   │
-│   │ Replay Mgr    │  │  Cron Mgr    │  │  Timeline &    │   │
-│   │               │  │              │  │  Exec Logs     │   │
-│   │ re-emit with  │  │ BullMQ       │  │                │   │
-│   │ safety checks │  │ repeatables  │  │ ctx.log()      │   │
-│   └───────────────┘  └──────────────┘  └────────────────┘   │
-│                                                              │
-│   ┌──────────────┐           ┌────────────────┐              │
-│   │  PostgreSQL  │           │     Redis      │              │
-│   │  (your DB)   │           │  (BullMQ jobs) │              │
-│   └──────────────┘           └────────────────┘              │
-└──────────────────────────────────────────────────────────────┘
+### Observability: Run Timeline Dashboard
+Use the timeline API to build a debugging view of what happened during a run.
+```ts
+const timeline = await engine.getRunTimeline('run_abc123')
+// Display in a dashboard
+for (const entry of timeline) {
+  const time = entry.timestamp.toISOString()
+  switch (entry.type) {
+    case 'run_created':
+      console.log(`${time} Run started`)
+      break
+    case 'step_started':
+      console.log(`${time} Step "${entry.stepName}" (${entry.stepType}) started`)
+      break
+    case 'step_completed':
+      console.log(`${time} Step "${entry.stepName}" completed`, entry.detail)
+      break
+    case 'step_failed':
+      console.log(`${time} Step "${entry.stepName}" FAILED:`, entry.detail?.error)
+      break
+    case 'log':
+      console.log(`${time} [${entry.detail?.level}] ${entry.detail?.message}`)
+      break
+    case 'run_completed':
+      console.log(`${time} Run completed successfully`)
+      break
+    case 'run_canceled':
+      console.log(`${time} Run canceled: ${entry.detail?.reason}`)
+      break
+  }
+}
 ```
-### Module Breakdown
+### Error Recovery: Retry Failed Runs
-| Module | File | Purpose |
-|:--|:--|:--|
-| Engine Factory | `src/index.ts` | `createEngine()` — wires everything together |
-| Types | `src/types.ts` | All TypeScript interfaces and type definitions |
-| Registry | `src/registry.ts` | Trigger, action, and condition registration |
-| Matcher | `src/matcher.ts` | Signal-to-workflow matching, run creation, validation |
-| Run Manager | `src/runs.ts` | Run lifecycle, state machine, timeline, cancel |
-| Scheduler | `src/scheduler.ts` | Step scheduling with BullMQ delayed jobs + parallel |
-| Executor | `src/executor.ts` | Step dispatch — delay, condition, action, parallel |
-| Cron Manager | `src/cron.ts` | Cron scheduling via BullMQ repeatable jobs |
-| Replay | `src/replay.ts` | Trigger persistence and replay support |
-| Queue | `src/queue.ts` | BullMQ queue and worker setup |
-| Logs | `src/logs.ts` | Execution logging and query helpers |
-| Schema | `src/schema/` | Table definitions and migration runner |
+Monitor for failures and automatically or manually retry.
+```ts
+// List all failed runs for a tenant
+const failed = await engine.getFailedRuns('workspace_1')
+for (const run of failed) {
+  console.log(`Run ${run.id} failed at ${run.failedAt}`)
+  // Retry the run — resumes from the failed step
+  try {
+    const retried = await engine.retryRun(run.id)
+    console.log(`Retried run ${retried.id}, now ${retried.status}`)
+  } catch (err) {
+    console.error(`Cannot retry: ${err.message}`)
+  }
+}
+// Cancel a run that's stuck or no longer needed
+await engine.cancelRun('run_xyz789', 'Superseded by newer deployment')
+```
 ---
 ## Requirements
-| Requirement | Version |
-|:--|:--|
-| Node.js | 18+ |
-| PostgreSQL | 12+ |
-| Redis | 6+ |
-| TypeScript | 5.0+ |
+| Requirement | Version | Notes |
+|---|---|---|
+| **Node.js** | 18+ | ESM module |
+| **PostgreSQL** | 12+ | Engine creates `pulse_*` tables |
+| **Redis** | 6+ | Used by BullMQ for job queuing |
+| **TypeScript** | 5.0+ | Full type definitions included |
 ---