npm - coalesce-transform-mcp - Versions diffs - 0.4.8 → 0.5.0-alpha.1 - Mend

coalesce-transform-mcp 0.4.8 → 0.5.0-alpha.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (151) hide show

package/README.md +480 -211
package/agents/lineage-analyst.agent.md +21 -0
package/agents/pipeline-builder.agent.md +33 -0
package/agents/run-operator.agent.md +28 -0
package/agents/workspace-auditor.agent.md +23 -0
package/dist/cache-dir.d.ts +1 -0
package/dist/cache-dir.d.ts.map +1 -1
package/dist/cache-dir.js +13 -1
package/dist/cache-dir.js.map +1 -1
package/dist/client.d.ts.map +1 -1
package/dist/client.js +6 -13
package/dist/client.js.map +1 -1
package/dist/coalesce/api/jobs.d.ts +4 -0
package/dist/coalesce/api/jobs.d.ts.map +1 -1
package/dist/coalesce/api/jobs.js +4 -0
package/dist/coalesce/api/jobs.js.map +1 -1
package/dist/coalesce/api/runs.d.ts.map +1 -1
package/dist/coalesce/api/runs.js +5 -2
package/dist/coalesce/api/runs.js.map +1 -1
package/dist/coalesce/run-schemas.d.ts +28 -31
package/dist/coalesce/run-schemas.d.ts.map +1 -1
package/dist/coalesce/run-schemas.js +5 -77
package/dist/coalesce/run-schemas.js.map +1 -1
package/dist/coalesce/tool-response.d.ts.map +1 -1
package/dist/coalesce/tool-response.js +2 -2
package/dist/coalesce/tool-response.js.map +1 -1
package/dist/mcp/coa.d.ts +323 -0
package/dist/mcp/coa.d.ts.map +1 -0
package/dist/mcp/coa.js +529 -0
package/dist/mcp/coa.js.map +1 -0
package/dist/mcp/environments.d.ts.map +1 -1
package/dist/mcp/environments.js +17 -2
package/dist/mcp/environments.js.map +1 -1
package/dist/mcp/git-accounts.d.ts.map +1 -1
package/dist/mcp/git-accounts.js +24 -2
package/dist/mcp/git-accounts.js.map +1 -1
package/dist/mcp/jobs.d.ts.map +1 -1
package/dist/mcp/jobs.js +22 -3
package/dist/mcp/jobs.js.map +1 -1
package/dist/mcp/lineage.d.ts.map +1 -1
package/dist/mcp/lineage.js +53 -3
package/dist/mcp/lineage.js.map +1 -1
package/dist/mcp/nodes.d.ts.map +1 -1
package/dist/mcp/nodes.js +22 -3
package/dist/mcp/nodes.js.map +1 -1
package/dist/mcp/pipelines.js +6 -6
package/dist/mcp/pipelines.js.map +1 -1
package/dist/mcp/projects.d.ts.map +1 -1
package/dist/mcp/projects.js +34 -2
package/dist/mcp/projects.js.map +1 -1
package/dist/mcp/repo-node-types.js +4 -4
package/dist/mcp/repo-node-types.js.map +1 -1
package/dist/mcp/runs.d.ts.map +1 -1
package/dist/mcp/runs.js +27 -4
package/dist/mcp/runs.js.map +1 -1
package/dist/mcp/setup.d.ts +4 -0
package/dist/mcp/setup.d.ts.map +1 -0
package/dist/mcp/setup.js +15 -0
package/dist/mcp/setup.js.map +1 -0
package/dist/mcp/subgraphs.d.ts.map +1 -1
package/dist/mcp/subgraphs.js +28 -2
package/dist/mcp/subgraphs.js.map +1 -1
package/dist/mcp/tool-helpers.d.ts +43 -1
package/dist/mcp/tool-helpers.d.ts.map +1 -1
package/dist/mcp/tool-helpers.js +84 -2
package/dist/mcp/tool-helpers.js.map +1 -1
package/dist/mcp/users.d.ts.map +1 -1
package/dist/mcp/users.js +54 -2
package/dist/mcp/users.js.map +1 -1
package/dist/prompts/index.d.ts.map +1 -1
package/dist/prompts/index.js +82 -0
package/dist/prompts/index.js.map +1 -1
package/dist/resources/coa-describe.d.ts +12 -0
package/dist/resources/coa-describe.d.ts.map +1 -0
package/dist/resources/coa-describe.js +106 -0
package/dist/resources/coa-describe.js.map +1 -0
package/dist/resources/context/aggregation-patterns.md +2 -2
package/dist/resources/context/ecosystem-boundaries.md +130 -0
package/dist/resources/context/tool-usage.md +2 -2
package/dist/resources/index.d.ts.map +1 -1
package/dist/resources/index.js +9 -0
package/dist/resources/index.js.map +1 -1
package/dist/server.d.ts +1 -1
package/dist/server.d.ts.map +1 -1
package/dist/server.js +17 -0
package/dist/server.js.map +1 -1
package/dist/services/cache/snapshots.d.ts.map +1 -1
package/dist/services/cache/snapshots.js +11 -10
package/dist/services/cache/snapshots.js.map +1 -1
package/dist/services/coa/describe.d.ts +49 -0
package/dist/services/coa/describe.d.ts.map +1 -0
package/dist/services/coa/describe.js +182 -0
package/dist/services/coa/describe.js.map +1 -0
package/dist/services/coa/preflight.d.ts +40 -0
package/dist/services/coa/preflight.d.ts.map +1 -0
package/dist/services/coa/preflight.js +461 -0
package/dist/services/coa/preflight.js.map +1 -0
package/dist/services/coa/project.d.ts +11 -0
package/dist/services/coa/project.d.ts.map +1 -0
package/dist/services/coa/project.js +35 -0
package/dist/services/coa/project.js.map +1 -0
package/dist/services/coa/redact.d.ts +22 -0
package/dist/services/coa/redact.d.ts.map +1 -0
package/dist/services/coa/redact.js +84 -0
package/dist/services/coa/redact.js.map +1 -0
package/dist/services/coa/resolver.d.ts +21 -0
package/dist/services/coa/resolver.d.ts.map +1 -0
package/dist/services/coa/resolver.js +90 -0
package/dist/services/coa/resolver.js.map +1 -0
package/dist/services/coa/runner.d.ts +30 -0
package/dist/services/coa/runner.d.ts.map +1 -0
package/dist/services/coa/runner.js +117 -0
package/dist/services/coa/runner.js.map +1 -0
package/dist/services/config/coa-config.d.ts +52 -0
package/dist/services/config/coa-config.d.ts.map +1 -0
package/dist/services/config/coa-config.js +159 -0
package/dist/services/config/coa-config.js.map +1 -0
package/dist/services/config/credentials.d.ts +37 -0
package/dist/services/config/credentials.d.ts.map +1 -0
package/dist/services/config/credentials.js +124 -0
package/dist/services/config/credentials.js.map +1 -0
package/dist/services/lineage/lineage-cache.d.ts.map +1 -1
package/dist/services/lineage/lineage-cache.js +26 -7
package/dist/services/lineage/lineage-cache.js.map +1 -1
package/dist/services/lineage/lineage-propagation.d.ts +1 -0
package/dist/services/lineage/lineage-propagation.d.ts.map +1 -1
package/dist/services/lineage/lineage-propagation.js +139 -56
package/dist/services/lineage/lineage-propagation.js.map +1 -1
package/dist/services/pipelines/intent-resolution.d.ts.map +1 -1
package/dist/services/pipelines/intent-resolution.js +7 -1
package/dist/services/pipelines/intent-resolution.js.map +1 -1
package/dist/services/pipelines/planning-types.d.ts +4 -4
package/dist/services/repo/path.d.ts.map +1 -1
package/dist/services/repo/path.js +6 -1
package/dist/services/repo/path.js.map +1 -1
package/dist/services/setup/diagnose.d.ts +131 -0
package/dist/services/setup/diagnose.d.ts.map +1 -0
package/dist/services/setup/diagnose.js +435 -0
package/dist/services/setup/diagnose.js.map +1 -0
package/dist/services/setup/hint.d.ts +15 -0
package/dist/services/setup/hint.d.ts.map +1 -0
package/dist/services/setup/hint.js +21 -0
package/dist/services/setup/hint.js.map +1 -0
package/dist/services/workspace/join-helpers.d.ts +9 -0
package/dist/services/workspace/join-helpers.d.ts.map +1 -1
package/dist/services/workspace/join-helpers.js +20 -2
package/dist/services/workspace/join-helpers.js.map +1 -1
package/dist/services/workspace/join-operations.d.ts.map +1 -1
package/dist/services/workspace/join-operations.js +14 -2
package/dist/services/workspace/join-operations.js.map +1 -1
package/package.json +5 -3

package/README.md CHANGED Viewed

@@ -1,41 +1,153 @@
 # coalesce-transform-mcp
-MCP server for the [Coalesce](https://coalesce.io/) Transform API. Connect AI assistants like Claude, Cursor, or Windsurf to your Coalesce workspace to manage nodes, pipelines, environments, jobs, runs, and more.
+MCP server for [Coalesce](https://coalesce.io/). Connect AI assistants like Claude, Cursor, or Windsurf to Coalesce to manage nodes, pipelines, environments, jobs, and runs, and drive the local-first [`coa`](https://www.npmjs.com/package/@coalescesoftware/coa) CLI from the same server: validate a project, preview DDL/DML, plan a deployment, and apply it to a cloud environment. One install, two execution surfaces.
-## Quick Start
+- **Cloud REST tools** — build pipelines declaratively, edit node YAML, review lineage, run deployed jobs, audit documentation.
+- **Local COA CLI tools** — validate projects before check-in, preview generated DDL/DML (`--dry-run`), iterate on V2 `.sql` node files, run `plan → deploy → refresh` cycles. COA is bundled — no separate install.
-**1. Set your access token** in `~/.zshrc` or `~/.bashrc`:
+The two surfaces are orthogonal. Use both, one, or neither. Every destructive tool — on either surface — requires explicit confirmation before running. New? Run the `/coalesce-setup` prompt after install — it walks you through anything missing.
+## I want to…
+| Task | Jump to |
+| ---- | ------- |
+| Get running in 2 minutes | [Quick start](#quick-start) |
+| Authenticate (env var or `~/.coa/config`) | [Credentials](#credentials) |
+| Run against multiple Coalesce environments | [Multiple environments](#multiple-environments) |
+| Lock prod down to read-only | [Safety model](#safety-model) |
+| Use the `coa` CLI tools | [Using the COA CLI tools](#using-the-coa-cli-tools) |
+| Try a prerelease build | [Prerelease channel](#prerelease-channel) |
+| Debug "why isn't auth working?" | [Diagnosing setup](#diagnosing-setup) |
+| Customize agent behavior | [Context skills](#context-skills) |
+| Find a specific tool | [Tool reference](#tool-reference) |
+| Query warehouse data (add companion MCP) | [Companion MCPs](#companion-mcps) |
+## Quick start
+**Requirements:**
+- [Node.js](https://nodejs.org/) 22+
+- A [Coalesce](https://coalesce.io/) account with a workspace
+- An MCP-compatible AI client (Claude Code, Claude Desktop, Cursor, Windsurf)
+- Snowflake credentials — only if you plan to use run tools or `coa_create`/`coa_run` (see [Credentials](#credentials))
+- Install footprint is ~76 MB unpacked (the bundled `@coalescesoftware/coa` CLI ships its own runtime; the MCP tarball itself is under 1 MB)
+**1. Clone your project**
+If your team already has a Coalesce project in Git, clone it locally — the bundled `coa` CLI operates on a project directory, so most local create/run tools require one on disk:
 ```bash
-export COALESCE_ACCESS_TOKEN="your-token-here"
+git clone <your-coalesce-project-repo-url>
+cd my-project
+```
+**Don't have a Git-linked project yet?** In the Coalesce UI, open your workspace → **Settings → Git** and connect a repo (or create one via your Git provider and paste the URL). Coalesce will commit the project skeleton on first push; clone that repo locally once it's populated.
+A Coalesce project has this shape:
+```text
+my-project/
+├── data.yml                 # Root metadata (fileVersion, platformKind)
+├── locations.yml            # Storage location manifest
+├── nodes/                   # Pipeline nodes (.yml for V1, .sql for V2)
+├── nodeTypes/               # Node type definitions with templates
+├── environments/            # Environment configs with storage mappings
+├── macros/                  # Reusable SQL macros
+├── jobs/                    # Job definitions
+└── subgraphs/               # Subgraph definitions
 ```
-Generate a token from the Deploy tab in your Coalesce workspace ([docs](https://docs.coalesce.io/docs/api/authentication)).
+> **V1 vs V2** — the format is pinned by `fileVersion` in `data.yml`. **V1** (`fileVersion: 1` or `2`) stores each node as a single YAML file with columns, transforms, and config inline. **V2** (`fileVersion: 3`) is SQL-first: the node body lives in a `.sql` file using `@id` / `@nodeType` annotations and `{{ ref() }}` references, with YAML retained for config. New projects default to V2; existing V1 projects keep working unchanged.
+Point the MCP at this directory by setting `repoPath` in `~/.coa/config` or `COALESCE_REPO_PATH` in your env block.
+### Create `workspaces.yml`
+This file is **required** for `coa_create` / `coa_run` and their dry-run variants. It maps each storage location declared in `locations.yml` to a physical database + schema for local development. It's typically gitignored (per-developer), so cloning the project does not give it to you — you have to create it.
+The `/coalesce-setup` prompt detects a missing `workspaces.yml` and walks you through it. If you'd rather do it directly, pick one of:
+- **Let COA bootstrap it** (easiest): from the project root, run
+  ```bash
+  npx @coalescesoftware/coa doctor --fix
+  ```
+  Or from your MCP client, call the `coa_bootstrap_workspaces` tool (requires `confirmed: true`) which runs the same command.
-**2. Add to your MCP client config:**
+  > **⚠️ The generated file contains placeholder values.** `coa doctor --fix` seeds `database`/`schema` with defaults that won't match your real warehouse. Open the file and replace every placeholder before running `coa_create` / `coa_run` — otherwise the generated DDL/DML will target the wrong (or non-existent) database.
+- **Hand-write it.** Authoritative schema (from `coa describe schema workspaces` — no top-level wrapper, no `fileVersion`):
-| Client | Config file |
-| ------ | ----------- |
-| Claude Code | `.mcp.json` in project root (or `~/.claude.json` for global) |
-| Claude Desktop (macOS) | `~/Library/Application Support/Claude/claude_desktop_config.json` |
-| Cursor | `.cursor/mcp.json` in project root |
-| Windsurf | `~/.codeium/windsurf/mcp_config.json` |
+  ```yaml
+  # workspaces.yml — keys are workspace names; `dev` is the default if --workspace is omitted
+  dev:
+    connection: snowflake          # required — name of the connection block COA should use
+    locations:                     # optional — one entry per storage location name from locations.yml
+      SRC_INGEST_TASTY_BITES:
+        database: JESSE_DEV        # required
+        schema: INGEST_TASTY_BITES # required
+      ETL_STAGE:
+        database: JESSE_DEV
+        schema: ETL_STAGE
+      ANALYTICS:
+        database: JESSE_DEV
+        schema: ANALYTICS
+  ```
-**Claude Code** (`.mcp.json`):
+Verify with `coa_doctor` (or `npx @coalescesoftware/coa doctor`) — it checks `data.yml`, `workspaces.yml`, credentials, and warehouse connectivity end to end.
+**2. Pick an auth path:**
+- **Option A — env var** (simplest for first-time MCP users). Generate a `COALESCE_ACCESS_TOKEN` from Coalesce → Deploy → User Settings.
+- **Option B — reuse `~/.coa/config`** (best if you already use the `coa` CLI). The server reads the same file — nothing to duplicate. Skip to step 3 and drop the `env` block below. See [Credentials](#credentials) for the schema.
+When both sources set a field, the env var wins.
+**3. Add the server to your MCP client config.** Pick your client below and paste the block into the indicated file. Replace `<YOUR_TOKEN>` with a real token only if your client does not support env var substitution (noted per client).
+#### Claude Code
+File: `.mcp.json` in project root (or `~/.claude.json` for global).
 ```json
 {
-  "coalesce-transform": {
-    "command": "npx",
-    "args": ["coalesce-transform-mcp"],
-    "env": {
-      "COALESCE_ACCESS_TOKEN": "${COALESCE_ACCESS_TOKEN}"
+  "mcpServers": {
+    "coalesce-transform": {
+      "command": "npx",
+      "args": ["coalesce-transform-mcp"],
+      "env": {
+        "COALESCE_ACCESS_TOKEN": "${COALESCE_ACCESS_TOKEN}"
+      }
+    }
+  }
+}
+```
+Claude Code expands `${VAR}` from your shell env at load time. Omit the `env` block entirely if you're using `~/.coa/config` (Option B).
+#### Claude Desktop
+File: `~/Library/Application Support/Claude/claude_desktop_config.json` (macOS) or `%APPDATA%\Claude\claude_desktop_config.json` (Windows).
+```json
+{
+  "mcpServers": {
+    "coalesce-transform": {
+      "command": "npx",
+      "args": ["coalesce-transform-mcp"],
+      "env": {
+        "COALESCE_ACCESS_TOKEN": "<YOUR_TOKEN>"
+      }
     }
   }
 }
 ```
-**Claude Desktop, Cursor, Windsurf** — same thing, wrapped in `"mcpServers"`:
+Claude Desktop does **not** expand `${VAR}` — paste the literal token, or drop the `env` block and use `~/.coa/config` (Option B) so nothing sensitive lives in this file.
+#### Cursor
+File: `.cursor/mcp.json` in project root (or `~/.cursor/mcp.json` for global).
 ```json
 {
@@ -44,36 +156,86 @@ Generate a token from the Deploy tab in your Coalesce workspace ([docs](https://
       "command": "npx",
       "args": ["coalesce-transform-mcp"],
       "env": {
-        "COALESCE_ACCESS_TOKEN": "${COALESCE_ACCESS_TOKEN}"
+        "COALESCE_ACCESS_TOKEN": "<YOUR_TOKEN>"
       }
     }
   }
 }
 ```
-The server defaults to the US region. See [Environment Variables](#environment-variables) if you need to change the region, enable run tools, or configure repo-backed features.
+Cursor does **not** expand `${VAR}` — paste the literal token, or drop the `env` block and use `~/.coa/config` (Option B).
+#### Windsurf
+File: `~/.codeium/windsurf/mcp_config.json`.
+```json
+{
+  "mcpServers": {
+    "coalesce-transform": {
+      "command": "npx",
+      "args": ["coalesce-transform-mcp"],
+      "env": {
+        "COALESCE_ACCESS_TOKEN": "<YOUR_TOKEN>"
+      }
+    }
+  }
+}
+```
+Windsurf does **not** expand `${VAR}` — paste the literal token, or drop the `env` block and use `~/.coa/config` (Option B).
+**4. Restart your client**, then run the `/coalesce-setup` prompt to verify everything is wired up.
+> **Never hardcode credentials in git-tracked config files.** Only Claude Code's `.mcp.json` expands `${VAR}` from your shell env. For any other client, keep secrets in `~/.coa/config` (Option B) or a secrets manager your client integrates with — don't commit literals into these JSON files.
-> **Never hardcode credentials in config files tracked by git.** The `${VAR}` syntax pulls values from your shell environment.
+If you have more than one Coalesce environment to manage, see [Multiple environments](#multiple-environments).
-## Requirements
+## Configuration
-- [Node.js](https://nodejs.org/) >= 22.0.0
-- A [Coalesce](https://coalesce.io/) account with a workspace and access token
-- An MCP-compatible AI client
-- **For run tools only:** Snowflake key pair authentication (see below)
+### Credentials
-## Environment Variables
+The server reads credentials from two sources and merges them with **env-wins precedence** — a matching env var always overrides the profile value, so you can pin a single field per session without editing the config file. Call `diagnose_setup` to see which source supplied each value.
-Only `COALESCE_ACCESS_TOKEN` is required. Everything else is optional.
+#### Source 1: `~/.coa/config` (shared with the `coa` CLI)
+COA stores credentials in a standard INI file. You create it by hand, or let `coa` write it as you use the CLI. The MCP reads the profile selected by `COALESCE_PROFILE` (default `[default]`) and maps the keys below onto their matching env vars.
+```ini
+[default]
+token=<your-coalesce-refresh-token>
+domain=https://your-org.app.coalescesoftware.io
+snowflakeAccount=<your-snowflake-account>   # e.g., abc12345.us-east-1 — required by coa CLI
+snowflakeUsername=YOUR_USER
+snowflakeRole=YOUR_ROLE
+snowflakeWarehouse=YOUR_WAREHOUSE
+snowflakeKeyPairKey=/Users/you/.coa/rsa_key.p8   # see deprecation note below
+snowflakeAuthType=KeyPair
+orgID=<your-org-id>              # optional; fallback for cancel-run
+repoPath=/Users/you/path/to/repo # optional; for repo-backed tools
+cacheDir=/Users/you/.coa/cache   # optional; per-profile cache isolation
+[staging]
+# …additional profiles; select with COALESCE_PROFILE
+```
+> **`snowflakeKeyPairKey` deprecation loop (known quirk).** The `coa` CLI currently emits a deprecation warning on `snowflakeKeyPairKey` and points you at `snowflakeKeyPairPath`, but `snowflakeKeyPairPath` does not yet accept a file path value. Until the upstream fix ships, keep using `snowflakeKeyPairKey=` (the name shown in `coa describe config`) — the deprecation warning is harmless.
+Key mapping: `token` ↔ `COALESCE_ACCESS_TOKEN`, `domain` ↔ `COALESCE_BASE_URL`, each `snowflake*` key ↔ its corresponding `SNOWFLAKE_*` env var, `orgID` ↔ `COALESCE_ORG_ID`, `repoPath` ↔ `COALESCE_REPO_PATH`, `cacheDir` ↔ `COALESCE_CACHE_DIR`. `snowflakeAuthType` is read by COA itself (not mapped to an env var) — include it when you're using key-pair auth. `orgID`, `repoPath`, and `cacheDir` are MCP-specific (the COA CLI ignores them). Only the fields the MCP needs are shown above — COA's config supports many more (run `npx @coalescesoftware/coa describe config` for the authoritative reference). Unknown keys are ignored.
+If `~/.coa/config` doesn't exist the server runs env-only — startup never fails on a missing or malformed profile file; it just logs a stderr warning.
+#### Source 2: env vars in your MCP config
 <!-- ENV_METADATA_CORE_TABLE_START -->
 | Variable | Description | Default |
 | -------- | -------- | -------- |
-| `COALESCE_ACCESS_TOKEN` | **Required.** Bearer token from the Coalesce Deploy tab. | — |
+| `COALESCE_ACCESS_TOKEN` | Bearer token from the Coalesce Deploy tab. Optional when `~/.coa/config` provides a `token`. | — |
+| `COALESCE_PROFILE` | Selects which `~/.coa/config` profile to load. | `default` |
 | `COALESCE_BASE_URL` | Region-specific base URL. | `https://app.coalescesoftware.io (US)` |
-| `COALESCE_ORG_ID` | Fallback org ID for cancel-run. | — |
-| `COALESCE_REPO_PATH` | Local repo root for repo-backed tools and pipeline planning. | — |
-| `COALESCE_CACHE_DIR` | Base directory for the local data cache. When set, cache files are written here instead of the working directory. | — |
+| `COALESCE_ORG_ID` | Fallback org ID for cancel-run. Also readable from `orgID` in the active ~/.coa/config profile. | — |
+| `COALESCE_REPO_PATH` | Local repo root for repo-backed tools and pipeline planning. Also readable from `repoPath` in the active ~/.coa/config profile. | — |
+| `COALESCE_CACHE_DIR` | Base directory for the local data cache. When set, cache files are written here instead of the working directory. Also readable from `cacheDir` in the active ~/.coa/config profile. | — |
 | `COALESCE_MCP_AUTO_CACHE_MAX_BYTES` | JSON size threshold before auto-caching to disk. | `32768` |
 | `COALESCE_MCP_LINEAGE_TTL_MS` | In-memory lineage cache TTL in milliseconds. | `1800000` |
 | `COALESCE_MCP_MAX_REQUEST_BODY_BYTES` | Max outbound API request body size. | `524288` |
@@ -81,13 +243,14 @@ Only `COALESCE_ACCESS_TOKEN` is required. Everything else is optional.
 | `COALESCE_MCP_SKILLS_DIR` | Directory for customizable AI skill resources. When set, reads context resources from this directory and seeds defaults on first run. Users can augment or override any skill. | — |
 <!-- ENV_METADATA_CORE_TABLE_END -->
-### Snowflake (for run tools only)
+#### Snowflake credentials (run tools only)
-Required for `start_run`, `retry_run`, `run_and_wait`, and `retry_and_wait`. The server starts without them — they're validated when you first use a run tool.
+`start_run`, `retry_run`, `run_and_wait`, `retry_and_wait`, and the warehouse-touching COA tools (`coa_create`, `coa_run`) need Snowflake credentials. These normally come from `~/.coa/config`. Override any field via env var:
 <!-- ENV_METADATA_SNOWFLAKE_TABLE_START -->
 | Variable | Required | Description |
 | -------- | -------- | -------- |
+| `SNOWFLAKE_ACCOUNT` | Yes | Snowflake account identifier (e.g., `abc12345.us-east-1`). Required by the local `coa` CLI and `coa doctor`; not used by the MCP's REST run path. |
 | `SNOWFLAKE_USERNAME` | Yes | Snowflake account username |
 | `SNOWFLAKE_KEY_PAIR_KEY` | No | Path to PEM-encoded private key (required if SNOWFLAKE_PAT not set) |
 | `SNOWFLAKE_PAT` | No | Snowflake Programmatic Access Token (alternative to key pair) |
@@ -96,23 +259,11 @@ Required for `start_run`, `retry_run`, `run_and_wait`, and `retry_and_wait`. The
 | `SNOWFLAKE_ROLE` | Yes | Snowflake user role |
 <!-- ENV_METADATA_SNOWFLAKE_TABLE_END -->
-To use optional variables, add them to your shell profile and pass them through in your MCP config. Here's a full example with everything enabled:
-**`~/.zshrc`:**
+"Required" means one of env OR the matching `~/.coa/config` field must supply the value. **`SNOWFLAKE_PAT` is env-only** — COA's config uses `snowflakePassword` for Basic auth (a different concept), which this server deliberately doesn't read.
-```bash
-export COALESCE_ACCESS_TOKEN="your-token-here"
-export COALESCE_BASE_URL="https://app.eu.coalescesoftware.io"
-export COALESCE_REPO_PATH="/path/to/local/coalesce-repo"
-export SNOWFLAKE_USERNAME="your-username"
-export SNOWFLAKE_KEY_PAIR_KEY="/path/to/snowflake_key.pem"   # Option A: Key Pair
-export SNOWFLAKE_KEY_PAIR_PASS="your-passphrase"              # (only if key is encrypted)
-export SNOWFLAKE_PAT="your-programmatic-access-token"         # Option B: PAT (if both set, Key Pair wins)
-export SNOWFLAKE_WAREHOUSE="your-warehouse"
-export SNOWFLAKE_ROLE="your-role"
-```
+#### Field-level overrides
-**`.mcp.json`:**
+To pin a profile but override one field without editing the config file:
 ```json
 {
@@ -120,49 +271,144 @@ export SNOWFLAKE_ROLE="your-role"
     "command": "npx",
     "args": ["coalesce-transform-mcp"],
     "env": {
-      "COALESCE_ACCESS_TOKEN": "${COALESCE_ACCESS_TOKEN}",
-      "COALESCE_BASE_URL": "${COALESCE_BASE_URL}",
-      "COALESCE_REPO_PATH": "${COALESCE_REPO_PATH}",
-      "SNOWFLAKE_USERNAME": "${SNOWFLAKE_USERNAME}",
-      "SNOWFLAKE_KEY_PAIR_KEY": "${SNOWFLAKE_KEY_PAIR_KEY}",
-      "SNOWFLAKE_KEY_PAIR_PASS": "${SNOWFLAKE_KEY_PAIR_PASS}",
-      "SNOWFLAKE_PAT": "${SNOWFLAKE_PAT}",
-      "SNOWFLAKE_WAREHOUSE": "${SNOWFLAKE_WAREHOUSE}",
-      "SNOWFLAKE_ROLE": "${SNOWFLAKE_ROLE}"
+      "COALESCE_PROFILE": "staging",
+      "SNOWFLAKE_ROLE": "TRANSFORMER_ADMIN"
     }
   }
 }
 ```
-Only include the variables you need — the Quick Start config with just `COALESCE_ACCESS_TOKEN` is enough to get started.
+Reads: "use the `[staging]` profile, but override its `snowflakeRole`."
-## Skills Directory
+### Multiple environments
-The server ships 23 AI skill resources that guide how agents interact with Coalesce. By default these are bundled inside the package, but you can make them visible and editable by setting `COALESCE_MCP_SKILLS_DIR`.
+If you work across several Coalesce environments (dev/staging/prod, or multiple orgs), register the package once per profile under distinct server names:
-### Setup
+```json
+{
+  "mcpServers": {
+    "coalesce-prod": {
+      "command": "npx",
+      "args": ["coalesce-transform-mcp"],
+      "env": {
+        "COALESCE_PROFILE": "prod",
+        "COALESCE_MCP_READ_ONLY": "true"
+      }
+    },
+    "coalesce-dev": {
+      "command": "npx",
+      "args": ["coalesce-transform-mcp"],
+      "env": { "COALESCE_PROFILE": "dev" }
+    }
+  }
+}
+```
+Why this pattern:
+- **Namespaced tools.** The client surfaces `coalesce-prod__*` vs `coalesce-dev__*`, so an agent can't accidentally mutate the wrong environment.
+- **Per-environment safety.** Pair prod with `COALESCE_MCP_READ_ONLY=true` to hide every write tool on that server while leaving dev fully writable.
+- **No per-call profile juggling.** Each server is pinned at startup.
+Skip this pattern if you only use one environment — a single registration is simpler. For 2–3 environments it's worth the extra config; beyond that, each server is a separate Node process, so consider whether you actually need them all loaded at once.
+### Safety model
+Three layers prevent destructive surprises:
+1. **Tool annotations.** Every tool carries MCP annotations (`readOnlyHint`, `destructiveHint`, `idempotentHint`). Clients that respect them can filter proactively. The ⚠️ marker in [Tool reference](#tool-reference) marks `destructiveHint: true` tools.
+2. **`COALESCE_MCP_READ_ONLY=true`** hides all write/mutation tools at server startup. Only read, list, search, cache, analyze, review, diagnose, and plan tools are registered. Use it for audits, agent sandboxes, or pairing with a prod profile (see [Multiple environments](#multiple-environments)).
+3. **Explicit confirmation for destructive ops.** Tools marked destructive require `confirmed: true`. When the MCP client supports elicitation, the server prompts interactively; otherwise it returns a `STOP_AND_CONFIRM` response the agent must surface before retrying with `confirmed: true`. Applies to: `delete_*`, `propagate_column_change`, `cancel_run`, `clear_data_cache`, `coa_create`, `coa_run`, `coa_deploy`, `coa_refresh`.
+**COA preflight.** Local COA write tools run preflight validation before shelling out. Errors block execution; warnings pass through in the tool response as `preflightWarnings` so agents can surface them.
+| Code | Level | What it catches |
+| ---- | ----- | --------------- |
+| `SQL_DOUBLE_QUOTED_REF` | error | `.sql` nodes using `ref("…")` — silently returns `UNKNOWN` columns; must be single-quoted |
+| `WORKSPACES_YML_MISSING` | error | `workspaces.yml` not in project root — required for local create/run |
+| `SELECTOR_COMBINED_OR` | error | `{ A \|\| B }` selector form — matches zero nodes; must be `{ A } \|\| { B }` |
+| `SQL_LITERAL_UNION_ALL` | warning | Literal `UNION ALL` in a V2 `.sql` node — silently dropped by the V2 parser; use `insertStrategy: UNION ALL` instead |
+| `DATA_YML_UNEXPECTED_FILEVERSION` | warning | `data.yml` missing or not `fileVersion: 3` |
+| `DATA_YML_NO_FILEVERSION` | warning | `data.yml` has no `fileVersion` field |
+### Using the COA CLI tools
+COA is bundled — no extra install. Usage notes:
+- **Local commands** (`coa_doctor`, `coa_validate`, `coa_dry_run_create`, `coa_dry_run_run`, `coa_create`, `coa_run`, `coa_plan`) need a COA project directory (one that contains `data.yml`). Pass the path via the `projectPath` tool argument.
+- **Cloud commands** (`coa_list_environments`, `coa_list_environment_nodes`, `coa_list_runs`, `coa_deploy`, `coa_refresh`) read credentials from `~/.coa/config` — the same file the MCP uses. Populate it once and both surfaces agree.
+- **Profile resolution.** Cloud tools accept an optional `profile` arg. When omitted, they fall back to `COALESCE_PROFILE`, then to COA's own `[default]` — so you don't have to pass it on every call.
+- **Warehouse-touching commands** (`coa_create`, `coa_run`) need a valid `workspaces.yml` in the project root with storage-location mappings. Preflight catches a missing file before execution.
+### Prerelease channel
+Prerelease builds publish to `@alpha` while `@latest` stays on stable. Point `npx` at the alpha channel:
+```json
+{
+  "coalesce-transform": {
+    "command": "npx",
+    "args": ["coalesce-transform-mcp@alpha"]
+  }
+}
+```
+Restart your MCP client after changing the config so `npx` re-resolves. To pin an exact prerelease rather than whatever `@alpha` resolves to today, replace `@alpha` with the full version, e.g. `coalesce-transform-mcp@0.5.0-alpha.2`. If `npx` serves a stale cached copy when `@alpha` advances, force a fresh fetch with `npx -y coalesce-transform-mcp@alpha`.
+To run alpha and stable side-by-side, register both under different server names (e.g. `coalesce-transform` for stable and `coalesce-transform-alpha` for the prerelease).
+### Diagnosing setup
+`diagnose_setup` is a stateless probe that reports which first-time-setup pieces are configured: access token, Snowflake credentials, `~/.coa/config` profile, local repo path, and a best-effort `coa doctor` check. It returns a structured report plus ordered `nextSteps` and per-field `source` markers (`env`, `profile:<name>`, or `missing`).
+It pairs with the `/coalesce-setup` MCP prompt, which walks a user through any remaining gaps. Run it any time something isn't working the way you expect.
+## Companion MCPs
+This server manages Coalesce node definitions — **not** live warehouse data. For Snowflake data questions (tables, schemas, row counts, sample data, permissions), add [Cortex Code](https://ai.snowflake.com) as a companion MCP server. The agent will route Snowflake questions to cortex and node/pipeline questions to Coalesce.
+```bash
+curl -LsS https://ai.snowflake.com/static/cc-scripts/install.sh | sh
+cortex connections  # interactive connection setup
+```
+```json
+{
+  "cortex": {
+    "command": "cortex",
+    "args": ["--mcp-server"]
+  }
+}
+```
+## Resources
+Resources are read-only context documents exposed via MCP that clients can pull into their prompts on demand. Two families.
+### Context skills
+24 curated markdown resources under `coalesce://context/*` guide how agents interact with the server — SQL conventions per warehouse, node-type selection, pipeline workflows, lineage/impact guidance. Set `COALESCE_MCP_SKILLS_DIR` to make them editable on disk:
 ```bash
 export COALESCE_MCP_SKILLS_DIR="/path/to/my-skills"
 ```
-On first run, the server seeds the directory with 46 files:
+On first run the server seeds the directory with two files per skill:
 - `coalesce_skills.<name>.md` — the default skill content (editable)
 - `user_skills.<name>.md` — your customization file (starts as an inactive stub with instructions)
-### How customization works
 Each resource resolves using this priority:
 1. **Override** — `user_skills.<name>.md` starts with `<!-- OVERRIDE -->` → only the user file is served
 2. **Augment** — `user_skills.<name>.md` has custom content (remove the `<!-- STUB -->` line first) → default + user content are concatenated
 3. **Default** — `user_skills.<name>.md` is missing, empty, or still has the seeded stub → default skill content is served
-4. **Disabled** — both files deleted → empty content is served (effectively disabled)
+4. **Disabled** — both files deleted → empty content is served
 Seeding is idempotent — it never overwrites files you've already modified.
-### Available Skills
+<details>
+<summary><strong>All context skills (24)</strong></summary>
 | Skill | File | Description |
 | ----- | ---- | ----------- |
@@ -189,204 +435,227 @@ Seeding is idempotent — it never overwrites files you've already modified.
 | Run Diagnostics Guide | `run-diagnostics-guide` | Using `diagnose_run_failure` to analyze failed runs and determine fixes |
 | Pipeline Review Guide | `pipeline-review-guide` | Using `review_pipeline` for pipeline analysis and optimization |
 | Pipeline Workshop Guide | `pipeline-workshop-guide` | Using pipeline workshop tools for iterative, conversational pipeline building |
+| Ecosystem Boundaries | `ecosystem-boundaries` | Scope of this MCP vs adjacent data engineering MCPs (Snowflake, Fivetran, dbt, Catalog) |
-## Tool Reference
-⚠️ = Destructive operation
+</details>
-### API Tools
+### COA describe topics
-Coalesce Platform Tools: manage workspaces, environments, projects, runs, and other platform resources.
+10 resources under `coalesce://coa/describe/*` surface the bundled COA CLI's self-describing documentation. Content is fetched from `coa describe <topic>` on first access and cached to disk, keyed by the pinned COA version — agents always see docs that match the CLI they're driving. Topics: `overview`, `commands`, `selectors`, `schemas`, `workflow`, `structure`, `concepts`, `sql-format`, `node-types`, `config`.
-#### Environments
+For parameterized topics (`command <name>`, `schema <type>`), use the `coa_describe` tool with a `subtopic` argument.
-- `list_environments` - List all available environments
-- `get_environment` - Get details of a specific environment
-- `create_environment` - Create a new environment within a project
-- `delete_environment` - Delete an environment ⚠️
+## Tool reference
-#### Workspaces
+⚠️ = Destructive (requires `confirmed: true`). 🧰 = Runs bundled `coa` CLI.
-- `list_workspaces` - List all workspaces
-- `get_workspace` - Get details of a specific workspace
+<details>
+<summary><strong>Cloud REST tools (49)</strong> — Coalesce platform resources via the Deploy API</summary>
-#### Nodes
+### Environments
-- `list_environment_nodes` - List nodes in an environment
-- `list_workspace_nodes` - List nodes in a workspace
-- `get_environment_node` - Get a specific environment node
-- `get_workspace_node` - Get a specific workspace node
-- `set_workspace_node` - Replace a workspace node with a full body
-- `update_workspace_node` - Safely update selected fields of a workspace node
-- `delete_workspace_node` - Delete a node from a workspace ⚠️
+- `list_environments` — List all available environments
+- `get_environment` — Get details of a specific environment
+- `create_environment` — Create a new environment within a project
+- `delete_environment` — Delete an environment ⚠️
-#### Jobs
+### Workspaces
-- `list_environment_jobs` - List all jobs for an environment
-- `create_workspace_job` - Create a job in a workspace with node include/exclude selectors
-- `get_environment_job` - Get details of a specific job (via environment)
-- `update_workspace_job` - Update a job's name and node selectors
-- `delete_workspace_job` - Delete a job ⚠️
+- `list_workspaces` — List all workspaces
+- `get_workspace` — Get details of a specific workspace
-#### Subgraphs
+### Nodes
-- `list_workspace_subgraphs` - List subgraphs in a workspace
-- `get_workspace_subgraph` - Get details of a specific subgraph
-- `create_workspace_subgraph` - Create a subgraph to group nodes visually
-- `update_workspace_subgraph` - Update a subgraph's name and node membership
-- `delete_workspace_subgraph` - Delete a subgraph (nodes are NOT deleted) ⚠️
+- `list_environment_nodes` — List nodes in an environment
+- `list_workspace_nodes` — List nodes in a workspace
+- `get_environment_node` — Get a specific environment node
+- `get_workspace_node` — Get a specific workspace node
+- `set_workspace_node` — Replace a workspace node with a full body
+- `update_workspace_node` — Safely update selected fields of a workspace node
+- `delete_workspace_node` — Delete a node from a workspace ⚠️
-#### Runs
+### Jobs
-- `diagnose_run_failure` - Diagnose a failed run with error classification, root-cause analysis, and actionable fix suggestions
-- `list_runs` - List runs with optional filters
-- `get_run` - Get details of a specific run
-- `get_run_results` - Get results of a completed run
-- `start_run` - Start a new run; requires Snowflake auth (Key Pair or PAT, credentials from env vars)
-- `run_status` - Check status of a running job
-- `retry_run` - Retry a failed run; requires Snowflake auth (Key Pair or PAT, credentials from env vars)
-- `cancel_run` - Cancel a running job (requires `runID` and `environmentID`; `orgID` may come from `COALESCE_ORG_ID`) ⚠️
+- `list_environment_jobs` — List all jobs for an environment
+- `create_workspace_job` — Create a job in a workspace with node include/exclude selectors
+- `get_environment_job` — Get details of a specific job (via environment)
+- `update_workspace_job` — Update a job's name and node selectors
+- `delete_workspace_job` — Delete a job ⚠️
-#### Projects
+### Subgraphs
-- `list_projects` - List all projects
-- `get_project` - Get project details
-- `create_project` - Create a new project
-- `update_project` - Update a project
-- `delete_project` - Delete a project ⚠️
+- `list_workspace_subgraphs` — List subgraphs in a workspace
+- `get_workspace_subgraph` — Get details of a specific subgraph
+- `create_workspace_subgraph` — Create a subgraph to group nodes visually
+- `update_workspace_subgraph` — Update a subgraph's name and node membership
+- `delete_workspace_subgraph` — Delete a subgraph (nodes are NOT deleted) ⚠️
-#### Git Accounts
+### Runs
-- `list_git_accounts` - List all git accounts
-- `get_git_account` - Get git account details
-- `create_git_account` - Create a new git account
-- `update_git_account` - Update a git account
-- `delete_git_account` - Delete a git account ⚠️
+- `diagnose_run_failure` — Diagnose a failed run with error classification, root-cause analysis, and actionable fix suggestions
+- `list_runs` — List runs with optional filters
+- `get_run` — Get details of a specific run
+- `get_run_results` — Get results of a completed run
+- `start_run` — Start a new run; requires Snowflake auth (Key Pair or PAT, credentials from env vars)
+- `run_status` — Check status of a running job
+- `retry_run` — Retry a failed run; requires Snowflake auth (Key Pair or PAT, credentials from env vars)
+- `cancel_run` — Cancel a running job (requires `runID` and `environmentID`; `orgID` may come from `COALESCE_ORG_ID` or the `orgID` field in your ~/.coa/config profile) ⚠️
-#### Users
+### Projects
-- `list_org_users` - List all organization users
-- `get_user_roles` - Get roles for a specific user
-- `list_user_roles` - List all user roles
-- `set_org_role` - Set organization role for a user
-- `set_project_role` - Set project role for a user
-- `delete_project_role` - Remove project role from a user ⚠️
-- `set_env_role` - Set environment role for a user
-- `delete_env_role` - Remove environment role from a user ⚠️
+- `list_projects` — List all projects
+- `get_project` — Get project details
+- `create_project` — Create a new project
+- `update_project` — Update a project
+- `delete_project` — Delete a project ⚠️
-### Intelligent Tools
+### Git Accounts
-Custom logic built on top of the API: pipeline planning, config completion, join analysis, workspace analysis, and more.
+- `list_git_accounts` — List all git accounts
+- `get_git_account` — Get git account details
+- `create_git_account` — Create a new git account
+- `update_git_account` — Update a git account
+- `delete_git_account` — Delete a git account ⚠️
-#### Node Creation and Configuration
+### Users and roles
-- `create_workspace_node_from_scratch` - Create a workspace node with no predecessors, apply fields to the requested completion level, and run automatic config completion
-- `create_workspace_node_from_predecessor` - Create a node from predecessor nodes, verify column coverage, suggest join columns, and run automatic config completion
-- `replace_workspace_node_columns` - Replace `metadata.columns` wholesale and optionally apply additional changes for complex column rewrites
-- `convert_join_to_aggregation` - Convert a join-style node into an aggregated fact-style node with generated JOIN/GROUP BY analysis
-- `apply_join_condition` - Auto-generate and write a FROM/JOIN/ON clause for a multi-predecessor node
-- `create_node_from_external_schema` - Create a workspace node whose columns match an existing warehouse table or external schema
-- `complete_node_configuration` - Intelligently complete a node's configuration by analyzing context and applying best-practice rules
-- `list_workspace_node_types` - List distinct node types observed in current workspace nodes
-- `analyze_workspace_patterns` - Analyze workspace nodes to detect package adoption, pipeline layers, methodology, and generate recommendations
+- `list_org_users` — List all organization users
+- `get_user_roles` — Get roles for a specific user
+- `list_user_roles` — List all user roles
+- `set_org_role` — Set organization role for a user
+- `set_project_role` — Set project role for a user
+- `delete_project_role` — Remove project role from a user ⚠️
+- `set_env_role` — Set environment role for a user
+- `delete_env_role` — Remove environment role from a user ⚠️
-#### Pipeline Planning and Execution
+</details>
-- `plan_pipeline` - Plan a pipeline from SQL or a natural-language goal without mutating the workspace; ranks best-fit node types from the local repo
-- `create_pipeline_from_plan` - Execute an approved pipeline plan using predecessor-based creation
-- `create_pipeline_from_sql` - Plan and create a pipeline directly from SQL
-- `build_pipeline_from_intent` - Build a pipeline from a natural language goal with automatic entity resolution and node type selection
-- `review_pipeline` - Analyze an existing pipeline for redundant nodes, missing joins, layer violations, naming issues, and optimization opportunities
-- `parse_sql_structure` - Parse a SQL statement into structural components (CTEs, source tables, projected columns) without touching the workspace
-- `select_pipeline_node_type` - Rank and select the best Coalesce node type for a pipeline step using the deliberative selection loop against repo or workspace-observed types
+<details>
+<summary><strong>Intelligent tools (46)</strong> — pipeline planning, config completion, join analysis, lineage</summary>
-#### Pipeline Workshop
+### Node creation and configuration
-- `pipeline_workshop_open` - Open an iterative pipeline builder session with workspace context pre-loaded
-- `pipeline_workshop_instruct` - Send a natural language instruction to modify the current workshop plan
-- `get_pipeline_workshop_status` - Get the current state of a workshop session
-- `pipeline_workshop_close` - Close a workshop session and release resources
+- `create_workspace_node_from_scratch` — Create a workspace node with no predecessors, apply fields to the requested completion level, and run automatic config completion
+- `create_workspace_node_from_predecessor` — Create a node from predecessor nodes, verify column coverage, suggest join columns, and run automatic config completion
+- `replace_workspace_node_columns` — Replace `metadata.columns` wholesale and optionally apply additional changes for complex column rewrites
+- `convert_join_to_aggregation` — Convert a join-style node into an aggregated fact-style node with generated JOIN/GROUP BY analysis
+- `apply_join_condition` — Auto-generate and write a FROM/JOIN/ON clause for a multi-predecessor node
+- `create_node_from_external_schema` — Create a workspace node whose columns match an existing warehouse table or external schema
+- `complete_node_configuration` — Intelligently complete a node's configuration by analyzing context and applying best-practice rules
+- `list_workspace_node_types` — List distinct node types observed in current workspace nodes
+- `analyze_workspace_patterns` — Analyze workspace nodes to detect package adoption, pipeline layers, methodology, and generate recommendations
-#### Repo-Backed Node Types and Templates
+### Pipeline planning and execution
-- `list_repo_packages` - Inspect a committed local Coalesce repo and list package aliases plus enabled node-type coverage from `packages/*.yml`
-- `list_repo_node_types` - List exact resolvable committed node-type identifiers from `nodeTypes/`, optionally scoped to one package alias or currently in-use types
-- `get_repo_node_type_definition` - Resolve one exact committed node type from a local repo and return its outer definition plus raw and parsed `metadata.nodeMetadataSpec`
-- `generate_set_workspace_node_template` - Generate a YAML-friendly `set_workspace_node` body template from either a raw definition object or an exact committed repo definition resolved by `repoPath` or `COALESCE_REPO_PATH`
+- `plan_pipeline` — Plan a pipeline from SQL or a natural-language goal without mutating the workspace; ranks best-fit node types from the local repo
+- `create_pipeline_from_plan` — Execute an approved pipeline plan using predecessor-based creation
+- `create_pipeline_from_sql` — Plan and create a pipeline directly from SQL
+- `build_pipeline_from_intent` — Build a pipeline from a natural language goal with automatic entity resolution and node type selection
+- `review_pipeline` — Analyze an existing pipeline for redundant nodes, missing joins, layer violations, naming issues, and optimization opportunities
+- `parse_sql_structure` — Parse a SQL statement into structural components (CTEs, source tables, projected columns) without touching the workspace
+- `select_pipeline_node_type` — Rank and select the best Coalesce node type for a pipeline step using the deliberative selection loop against repo or workspace-observed types
+### Pipeline workshop
+- `pipeline_workshop_open` — Open an iterative pipeline builder session with workspace context pre-loaded
+- `pipeline_workshop_instruct` — Send a natural language instruction to modify the current workshop plan
+- `get_pipeline_workshop_status` — Get the current state of a workshop session
+- `pipeline_workshop_close` — Close a workshop session and release resources
+### Repo-backed node types and templates
+- `list_repo_packages` — Inspect a committed local Coalesce repo and list package aliases plus enabled node-type coverage
+- `list_repo_node_types` — List exact resolvable committed node-type identifiers from `nodeTypes/`
+- `get_repo_node_type_definition` — Resolve one exact committed node type and return its outer definition plus parsed `nodeMetadataSpec`
+- `generate_set_workspace_node_template` — Generate a YAML-friendly `set_workspace_node` body template from a definition object or committed repo definition
+- `search_node_type_variants` — Search the committed node-type corpus by normalized family, package, primitive, or support status
+- `get_node_type_variant` — Load one exact node-type corpus variant by variant key
+- `generate_set_workspace_node_template_from_variant` — Generate a `set_workspace_node` body template from a committed corpus variant
+### Lineage and impact
+- `get_upstream_nodes` — Walk the full upstream dependency graph for a node
+- `get_downstream_nodes` — Walk the full downstream dependency graph for a node
+- `get_column_lineage` — Trace a column through the pipeline upstream and downstream via column-level references
+- `analyze_impact` — Analyze downstream impact of changing a node or specific column — returns impacted counts, grouped by depth, and critical path
+- `propagate_column_change` — Update all downstream columns after a column rename or data type change ⚠️
+- `search_workspace_content` — Search across node SQL, column names, descriptions, and config values using the lineage cache as a searchable index
+- `audit_documentation_coverage` — Scan all workspace nodes and columns for missing descriptions and report coverage statistics
+### Cache and snapshots
+- `cache_workspace_nodes` — Fetch every page of workspace nodes, write a full snapshot, and return cache metadata
+- `cache_environment_nodes` — Fetch every page of environment nodes, write a full snapshot, and return cache metadata
+- `cache_runs` — Fetch every page of run results, write a full snapshot, and return cache metadata
+- `cache_org_users` — Fetch every page of organization users, write a full snapshot, and return cache metadata
+- `clear_data_cache` — Delete all cached snapshots, auto-cached responses, and plan summaries ⚠️
+### Run workflows
+- `run_and_wait` — Start a run and poll until completion; requires Snowflake auth (Key Pair or PAT)
+- `retry_and_wait` — Retry a failed run and poll until completion; requires Snowflake auth (Key Pair or PAT)
+- `get_run_details` — Get run metadata and results in one call
+- `get_environment_overview` — Get environment details with full node list
+- `get_environment_health` — Comprehensive health dashboard: node counts by type, run statuses, failed runs in last 24h, stale nodes, dependency health, and overall health score (walks all paginated environment runs before scoring — slower on busy environments)
+### Skills
-#### Node Type Corpus
+- `personalize_skills` — Export bundled skill files to a local directory for customization; creates editable `coalesce_skills.{name}.md` and `user_skills.{name}.md` pairs (idempotent — never overwrites existing files)
-- `search_node_type_variants` - Search the committed node-type corpus snapshot by normalized family, package, primitive, or support status
-- `get_node_type_variant` - Load one exact node-type corpus variant by variant key
-- `generate_set_workspace_node_template_from_variant` - Generate a `set_workspace_node` body template from a committed corpus variant without needing the original external source repo at runtime; partial variants are rejected unless `allowPartial=true`
-#### Cache and Snapshots
-- `cache_workspace_nodes` - Fetch every page of workspace nodes, write the full snapshot to `coalesce_transform_mcp_data_cache/nodes/`, and return only cache metadata
-- `cache_environment_nodes` - Fetch every page of environment nodes, write the full snapshot to `coalesce_transform_mcp_data_cache/nodes/`, and return only cache metadata
-- `cache_runs` - Fetch every page of run results, write the full snapshot to `coalesce_transform_mcp_data_cache/runs/`, and return only cache metadata
-- `cache_org_users` - Fetch every page of organization users, write the full snapshot to `coalesce_transform_mcp_data_cache/users/`, and return only cache metadata
-- `clear_data_cache` - Delete all cached snapshots, auto-cached responses, and plan summaries under `coalesce_transform_mcp_data_cache/` ⚠️
-#### Workflows
+### Setup
-- `run_and_wait` - Start a run and poll until completion; requires Snowflake auth (Key Pair or PAT)
-- `retry_and_wait` - Retry a failed run and poll until completion; requires Snowflake auth (Key Pair or PAT)
-- `get_run_details` - Get run metadata and results in one call
-- `get_environment_overview` - Get environment details with full node list
-- `get_environment_health` - Get a comprehensive health dashboard: node counts by type, run statuses, failed runs in last 24h, stale nodes, dependency health, and overall health score. This walks all paginated environment runs before scoring, so it can take longer on busy environments.
+- `diagnose_setup` — Stateless probe reporting which first-time-setup pieces are configured: access token, Snowflake credentials, `~/.coa/config` profile, local repo path, and a best-effort `coa doctor` check. Returns a structured report plus ordered `nextSteps` and per-field `source` markers (`env`, `profile:<name>`, or `missing`). Pairs with the `/coalesce-setup` MCP prompt.
-#### Skills
+</details>
-- `personalize_skills` - Export bundled skill files to a local directory for customization; creates editable `coalesce_skills.{name}.md` and `user_skills.{name}.md` pairs (idempotent — never overwrites existing files)
+<details>
+<summary><strong>COA CLI tools (14)</strong> — bundled <code>@coalescesoftware/coa</code> CLI</summary>
-#### Lineage & Impact
+All local tools accept a `projectPath` argument and validate that it contains `data.yml` before shelling out. Destructive tools run preflight validation; see [Safety model](#safety-model).
-- `get_upstream_nodes` - Walk the full upstream dependency graph for a node and return every ancestor with depth level (no depth limit)
-- `get_downstream_nodes` - Walk the full downstream dependency graph for a node and return every dependent with depth level (no depth limit)
-- `get_column_lineage` - Trace a column through the entire pipeline upstream and downstream via column-level references
-- `analyze_impact` - Analyze downstream impact of changing a node or specific column — returns impacted counts, grouped by depth, and critical path
-- `propagate_column_change` - Update all downstream columns after a column rename or data type change ⚠️
-- `search_workspace_content` - Search across node SQL, column names, descriptions, and config values using the lineage cache as a searchable index
-- `audit_documentation_coverage` - Scan all workspace nodes and columns for missing descriptions and report coverage statistics
+### Read-only, local
-## Snowflake Exploration via Cortex Code
+- 🧰 `coa_doctor` — Check config, credentials, and warehouse connectivity for a project. Wraps `coa doctor --json`
+- 🧰 `coa_validate` — Validate YAML schemas and scan a project for configuration problems. Wraps `coa validate --json`
+- 🧰 `coa_list_project_nodes` — List all nodes defined in a local project (pre-deploy). Wraps `coa create --list-nodes`
+- 🧰 `coa_dry_run_create` — Preview DDL without executing against the warehouse. Forces `--dry-run --verbose`. Does **not** validate that referenced columns/types exist in the warehouse — catches SQL generation bugs, not schema-drift bugs
+- 🧰 `coa_dry_run_run` — Preview DML without executing against the warehouse. Forces `--dry-run --verbose`. Same caveat as `coa_dry_run_create`: SQL that looks valid here can still fail at run-time on missing columns
-This server manages node definitions, not live warehouse data. For Snowflake data questions (tables, schemas, row counts, sample data, permissions), add [Cortex Code](https://ai.snowflake.com) as a companion MCP server. The agent will automatically route Snowflake questions to cortex tools.
+### Read-only, cloud (require `~/.coa/config`)
-**Setup:**
+- 🧰 `coa_list_environments` — List deployment environments. Wraps `coa environments list --format json`
+- 🧰 `coa_list_environment_nodes` — List deployed nodes in an environment. Wraps `coa nodes list --environmentID ...`
+- 🧰 `coa_list_runs` — List pipeline runs in a cloud environment (or across all environments). Wraps `coa runs list`
-1. Install Cortex Code and configure a Snowflake connection:
+### Describe
-   ```bash
-   curl -LsS https://ai.snowflake.com/static/cc-scripts/install.sh | sh
-   cortex connections  # interactive connection setup
-   ```
+- 🧰 `coa_describe` — Fetch a section of COA's self-describing documentation by topic + optional subtopic. Also exposed as `coalesce://coa/describe/*` [resources](#coa-describe-topics)
-2. Add cortex as an MCP server in your `.mcp.json`:
+### Write and deploy
-   ```json
-   {
-     "cortex": {
-       "command": "cortex",
-       "args": ["--mcp-server"]
-     }
-   }
-   ```
+- 🧰 `coa_plan` — Generate a deployment plan JSON by diffing the local project against a cloud environment. Writes `coa-plan.json` (configurable via `out`). Non-destructive
+- 🧰 `coa_create` — Run DDL (CREATE/REPLACE) against the warehouse for selected nodes. Preflight-gated. ⚠️
+- 🧰 `coa_run` — Run DML (INSERT/MERGE) to populate selected nodes. Preflight-gated. ⚠️
+- 🧰 `coa_deploy` — Apply a plan JSON to a cloud environment. Verifies the plan file exists before running. ⚠️
+- 🧰 `coa_refresh` — Run DML for selected nodes in an already-deployed environment (no local project required). ⚠️
-The agent will see both servers' tools and route Snowflake data questions to cortex and node/pipeline questions to Coalesce tools.
+</details>
-## Notes
+## Design notes
-- **Caching:** Large responses are auto-cached to disk. Use `cache_workspace_nodes` and similar tools when you want a reusable snapshot. Configure the threshold with `COALESCE_MCP_AUTO_CACHE_MAX_BYTES`.
-- **Repo-backed tools:** Set `COALESCE_REPO_PATH` to your local Coalesce repo root (containing `nodeTypes/`, `nodes/`, `packages/`) or pass `repoPath` on individual tool calls. The server does not clone repos or install packages.
 - **SQL override is disallowed.** Nodes are built via YAML/config (columns, transforms, join conditions), not raw SQL. Template generation strips `overrideSQLToggle`, and write helpers reject `overrideSQL` fields.
+- **Caching.** Large responses are auto-cached to disk. Use `cache_workspace_nodes` and siblings when you want a reusable snapshot. Configure the threshold with `COALESCE_MCP_AUTO_CACHE_MAX_BYTES`.
+- **Repo-backed tools.** Set `COALESCE_REPO_PATH` (or add `repoPath=` to your ~/.coa/config profile) to your local Coalesce repo root (containing `nodeTypes/`, `nodes/`, `packages/`), or pass `repoPath` on individual tool calls. The server does not clone repos or install packages.
+- **COA CLI versioning.** The bundled COA CLI is pinned to an exact alpha version — *not* a floating `@next` tag. Every release of this MCP ships with a known-good COA build. Changelog and bump policy: [docs/RELEASES.md](docs/RELEASES.md).
+- **COA describe cache.** COA describe output is cached under `~/.cache/coalesce-transform-mcp/coa-describe/<coa-version>/` after first access. Cache is version-keyed — upgrading the MCP automatically invalidates stale content.
 ## Links
 - [Coalesce Docs](https://docs.coalesce.io/docs)
 - [Coalesce API Docs](https://docs.coalesce.io/docs/api/authentication)
+- [Coalesce CLI (`coa`)](https://docs.coalesce.io/docs/cli)
 - [Coalesce Marketplace Docs](https://docs.coalesce.io/docs/marketplace)
+- [Model Context Protocol](https://modelcontextprotocol.io/)
 ## License