npm - unbound-cli - Versions diffs - 1.4.0 → 1.5.0 - Mend

unbound-cli 1.4.0 → 1.5.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

package/PLAN-web-4887.md +38 -0
package/PLAN.md +79 -90
package/README.md +4 -2
package/package.json +1 -1
package/src/commands/policy.js +14 -7
package/src/lib/no-ai-guard.js +77 -0
package/test/no-ai-guard.test.js +363 -0

package/PLAN-web-4887.md CHANGED Viewed

@@ -320,6 +320,8 @@ forward-compatible: if the backend later adds `explanation`, the existing
 ## 8. Claude Code skill — distribution and content
+> ~~DEPRECATED — replaced by CLI-side guards in §14~~. The external Claude Code skill described below was never shipped; setup-repo PR #163 was closed unmerged. CLI-side deterministic enforcement (layers 1+2+3) supersedes this approach. See §14 for the shipped design.
 The CLI cannot edit Claude's system prompt, but it can drop a skill file. Skills are the closest analog to a CLAUDE.md hook that we can install programmatically.
 ### 8.1 Where it lives
@@ -475,3 +477,39 @@ These are intentionally out of scope for WEB-4887 to keep the surface narrow. Sp
 Each `unbound setup claude-code` and `unbound onboard` run **overwrites** `~/.claude/skills/unbound-tool-policy/SKILL.md` with the version fetched from the `setup` repo's main branch at install time. No checksum gating, no diff-and-prompt. Rationale: the skill is a derived artifact of whatever's in the setup repo's main; treating it as an install-time output (not user state) keeps the mental model simple and ensures Claude is always steered with the current rules.
 (The skill source-of-truth was originally bundled in unbound-cli during initial implementation; relocated to the setup repo to follow the existing per-tool-artifact convention. See companion PR https://github.com/websentry-ai/setup/pull/163.)
+---
+## 14. Steering enforcement (post-skill-kill)
+§8 (external Claude Code skill) is deprecated. Setup-repo PR #163 was closed unmerged, and the install-side artifact path was abandoned in favor of CLI-side deterministic enforcement that ships in the same binary as the commands it protects. Three layers, evaluated in order, implemented in `src/lib/no-ai-guard.js` and wired into the `.action()` handlers of `create-terminal` (line 1430) and `create-mcp` (line 1602) immediately after `requireLogin()`.
+### 14.1 Layer 1 — hard guard at the action boundary
+`unbound policy tool create-terminal` and `create-mcp` reject invocations that pass neither `--prompt` nor an explicit `--no-ai` opt-out. The error message names the AI-assist form, instructs the caller to retry with `--prompt "<text>"`, and surfaces `--no-ai` as the documented opt-out for raw classification flags. Exit code is `2` (invalid usage), routed through `err.exitCode = 2` and propagated by the `process.exitCode = err.exitCode || 1` catch block.
+`--prompt` and `--no-ai` are also declared mutex: passing both exits 2 with a "not both" message. This avoids ambiguity about which path the caller intended.
+Why an action-time guard and not a commander.js parse-time validator: commander has no first-class mutex API for "exactly one of A, B" that also accommodates a default-to-A steering nudge, and threading the AI-assist routing through commander's option parsing would complicate the existing `--prompt` branch in `policy.js`. Throwing from `.action()` keeps the guard local to the two subcommands it protects.
+### 14.2 Layer 2 — Claude Code detection + env-gated escape hatch
+When `process.env.CLAUDECODE === '1'`, `--no-ai` is additionally rejected unless `process.env.UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE === '1'` is also set. The rejection message explicitly says "intended for interactive humans, not for agents." This is steering, not security: the env var is in plain sight in the error message, and an adversarial agent that reads the message could set it itself. Accepted residual risk per the original principal-architect review. Server-side detection (deferred ticket) if pick-rate telemetry later shows leakage.
+`CLAUDECODE=1` is set by current Claude Code variants. If future variants do not set it, layer 1 still steers to `--prompt`, and the layer-2 nudge silently no-ops — an acceptable degradation. Future Claude Code variants can match `CLAUDECODE` to remain steered by this layer.
+### 14.3 Layer 3 — banner-first `--help`
+`unbound policy tool create-terminal --help` (and `create-mcp --help`) opens with a short banner that names the AI-assisted form before commander's auto-generated `Usage:` line. This is the first text a reader (human or agent) sees on `--help`, so it leads the eye to `--prompt`. Implementation: `.addHelpText('before', helpBannerFor(subcommandName))`. The existing `addHelpText('after', ...)` examples are updated so every manual-flag example prepends `--no-ai`, making the documented opt-out form unambiguous.
+### 14.4 Why the external skill was killed
+The original §8 plan installed `~/.claude/skills/unbound-tool-policy/SKILL.md` via `unbound setup claude-code` and `unbound onboard`, with per-user MDM rollout in setup-repo PR #163. Three problems killed it:
+1. **Install-time artifact, runtime enforcement.** A skill file in `~/.claude/` is a soft nudge — Claude reads it, but nothing in the CLI enforces what the skill recommends. CLI-side guards close the loop: the binary that creates the policy is also the binary that decides which path to route through.
+2. **Distribution sprawl.** Skill content lived in the setup repo to follow the per-tool-artifact convention. Updating steering rules meant a setup-repo PR, an MDM rollout, and a CLI release on different cadences. Folding the rules into the CLI collapses that to one release surface.
+3. **MDM rollout complexity.** Per-user install across `get_all_user_homes()` under `_run_as_user(username, ...)` was correct but added scope to a security-critical primitive for a feature that the CLI can enforce on its own.
+### 14.5 Versioning
+Minor bump: `1.4.0` → `1.5.0`. The breaking surface is two subcommands and the migration is a one-flag (`--no-ai`) prepend. Major bump would over-signal. README ships a `BREAKING CHANGE in 1.5.0` callout at the top of the Tool policy section; PR description names both subcommands and shows the migration.

package/PLAN.md CHANGED Viewed

@@ -1,117 +1,106 @@
-# Implementation Plan: WEB-4887 Phase 2 — AI-assisted MCP tool policy creation in `unbound` CLI
+# Implementation Plan: WEB-4887 CLI `--no-ai` steering guard (layers 1+2+3)
-> Generated by /implementation-plan on 2026-06-18. Source: principal-architect.
+> Generated by /implementation-plan on 2026-06-19. Source: principal-architect.
 >
-> Phase 1 (terminal command AI-assist) is **DELIVERED** on the same branch
-> (`web-4887-cli-ai-assist-policy`, commits `a90713e`..`f4508dc`, PR #54).
-> Phase 1 acceptance criteria still hold — see git history. This document
-> contracts Phase 2 (MCP tool AI-assist) only; downstream engineers should
-> read Phase 1's exports in `src/lib/policy-ai-assist.js` to understand the
-> infrastructure being extended (preflight, admin probe, error matrix,
-> confirmation, BLOCK/WARN guard, flag-precedence merge).
+> Successor to PR #54 (`web-4887-cli-ai-assist-policy`) which shipped `--prompt`-based AI-assist for `create-terminal` / `create-mcp`. This plan replaces the killed external Claude Code skill (`unbound-tool-policy`, setup repo PR #163 closed unmerged) with CLI-side deterministic enforcement.
 ## Goal
-Add `--prompt <text>` to `unbound policy tool create-mcp` so an admin can describe an MCP tool policy in natural language; the CLI routes the prompt through `POST /api/v1/command-policies/assist-mcp/`, merges the AI response with user-supplied override flags, previews the resolved policy, confirms, and creates it via the existing create endpoint.
+`unbound policy tool create-terminal` and `unbound policy tool create-mcp` deterministically route Claude Code agents through the AI-assist `--prompt` path: invocations without `--prompt` and without an explicit `--no-ai` opt-out exit with code 2 and a copy-paste-ready remediation message; under `CLAUDECODE=1` even `--no-ai` is rejected unless an env-gated escape hatch is set; `--help` leads with the AI-assisted form.
 ## Established Context (do not override)
-- Single PR — Phase 2 ships on existing branch `web-4887-cli-ai-assist-policy`, same PR #54 as Phase 1. Phase 1 infrastructure is shared.
-- Same `--prompt` flag pattern — add to existing `unbound policy tool create-mcp` subcommand (NOT a parallel `-ai` subcommand). Mutex with MCP classification flags (`--mcp-server`, `--mcp-tool` — singular, what the CLI actually exposes today; `--mcp-action-type`; `--config`). Compatible with `--name`, `--description`, `--action`, `--custom-message`, `--group`, `--disabled`, `--yes`, `--json` as overrides.
-- Endpoint: `POST {API_BASE}/api/v1/command-policies/assist-mcp/`. Admin-only. Request body `{user_input: string, current_form_state: {mcp_canonical_group_id, mcp_tools, name, description, action}}`. Success response: `{success:true, canonical_group_id:int, mcp_tools:string[], name:string, description:string, action:enum, custom_message:string}`. Failure response: `{success:false, error:string}`. No `explanation` field in the response (asymmetric with the terminal endpoint).
-- Field whitelist (spec §4.2). AI-fills: `mcp_canonical_group_id`, `mcp_tools[]`, `name` (≤80 chars), `description`, `action`, `custom_message`. User-via-flags overrides: `--name`, `--description`, `--action`, `--custom-message`, `--group`, `--disabled`. Out-of-scope (mutex with `--prompt`): `--mcp-server`, `--mcp-tool`, `--mcp-action-type`, `--config`.
-- No backend fallback (Haiku endpoint). Two extra failure modes vs Phase 1: `success:true` + `mcp_tools:[]` → soft fail; `success:true` + missing `canonical_group_id` → soft fail. Surface manual escape-hatch and exit 2; do NOT create the policy.
-- Reuse Phase 1 helpers from `src/lib/policy-ai-assist.js`: `validatePromptPreflight`, `loadPrivileges` + module-level `_privilegesCache`, `confirmCreate`, `confirmContinue`, `promptYesNo`, `routeBackendError`, `validateActionOverride`, `OUT_OF_SCOPE_KEYWORDS`, `MAX_PROMPT_LEN`, `TOOL_ACTIONS`, `colorAction`.
-- New helpers added to the SAME file (`src/lib/policy-ai-assist.js`): `runMcpPromptCreate`, `mergeAiAndFlagsMcp`, `renderMcpPreview`, `routeMcpSuccessFalse`. Module grows from ~340 LOC to ~600 LOC — within "wieldy".
-- BLOCK/WARN custom-message guard same shape as Phase 1: post-merge, if action ∈ {BLOCK, WARN} and no `custom_message`, exit 2 with user-vs-AI attributed wording.
-- Out of Scope this ticket: MCP catalog cache pre-flight (spec §7.2), backend `source:"cli"|"web"` telemetry (spec §12.2), AI-assist for Cost/Model/Security (spec §3 / §12.3), MDM-side MCP catalog awareness.
-- Skill update needed in setup PR #163 — separate cross-repo deliverable (see "Sequencing" step 7).
+- Repo `/Users/dinesh/Code/unbound-cli` — Node.js CLI on commander.js `^12.1.0`, tests via `node --test`.
+- Active branch `web-4887-cli-noai-guard`, stacked on `web-4887-cli-ai-assist-policy` (PR #54). The eventual PR base = `web-4887-cli-ai-assist-policy`; GitHub auto-retargets to `main` after #54 merges.
+- `src/commands/policy.js` — `create-terminal` at line 1430, `create-mcp` at line 1602. Both have an existing `opts.prompt`-presence branch (lines 1473 / 1646) with an empty-`--prompt` guard that exits 2 (lines 1475 / 1648). Flag-path required-field errors at lines 1539–1547 (terminal) / 1708–1715 (MCP), thrown into the handler `catch` which sets `process.exitCode = 1` (lines 1595–1598 / 1767–1770).
+- `src/lib/policy-ai-assist.js` owns the AI-assist `--prompt` path and **MUST NOT be modified** by this work — the new layers route BEFORE that module is reached.
+- Test convention: integration tests at the `commander.parseAsync` layer. Fresh `new Command()` per test, `src/api.js` `.get`/`.post` and `src/config.js` `isLoggedIn`/`getApiKey`/`getBaseUrl` stubbed, `require.cache` invalidated to reset module-level caches. Pattern established in `test/policy-ai-assist.test.js` (`loadFreshModules`, `buildHarness`, `runCreate`).
+- Exit-code convention: `output.error(msg)` + `process.exitCode = N` (no `process.exit`). Exit 2 is already used for invalid-flag failures (line 1476). Exit 1 is reserved for runtime/operational failures.
+- This is a **breaking change** for any caller of `create-terminal` / `create-mcp` with raw flags (no `--prompt`). Clean break recommended over deprecation window; breaking surface is exactly two subcommands.
+- Version bump: `package.json` `1.4.0` → `1.5.0` (minor; team versions liberally; breaking surface is narrow).
+- Do NOT bundle a Claude Code skill or anything that writes to `~/.claude/` — the whole point of this work is to eliminate that external artifact.
+- The original spec doc `PLAN-web-4887.md` is the source of truth for §1–§13 (Phase 1 and Phase 2 AI-assist scope) and must not be overwritten — only extended with a new §14 and a deprecation marker on §8.
 ## Out of Scope
-- MCP catalog cache pre-flight (spec §7.2). The "warn before sending if service not in org's catalog" optimization is deferred.
-- Backend `source: "cli" | "web"` telemetry field on the assist-mcp endpoint (spec §12.2).
-- AI-assist for Cost / Model / Security policy types (spec §3 / §12.3).
-- MDM-side MCP catalog awareness (catalogs are per-org).
-- Reverse-resolving `canonical_group_id` → service display name in the preview "Service" row. v1 shows the integer id; post-create `displayToolPolicy` shows the resolved name from the backend's full serialization. Polish follow-up.
-- Migrating the existing flag-based `create-mcp` body shape (`mcp_server` string + `mcp_tool` singular + `mcp_tool_action_type`) to the AI-assist shape (`mcp_canonical_group_id` int + `mcp_tools[]` array). The flag path stays exactly as today; only the AI-assist branch sends the new shape. Backend accepts both.
-- Renaming `--mcp-tool` (singular, current) to `--mcp-tools` (plural, as referenced in spec §7). Separate flag-rename ticket if wanted.
-- Updating `claude-code/skills/unbound-tool-policy/SKILL.md` in the `setup` repo (PR #163). Cross-repo follow-up commit, NOT bundled with the unbound-cli PR. Sequencing in step 7.
-- Running the eval harness in CI. The terminal eval set at `test/eval/policy-prompts.json` is manual-only; the MCP extension (10 single-service + 3 multi-service prompts per spec §7.5) is also manual-only and noted in `test/eval/README.md` as a follow-up entry — not a Phase 2 file change.
+- Modifying `src/lib/policy-ai-assist.js`. Routing decisions in this work happen BEFORE the AI-assist module is reached.
+- Any Claude Code skill, hook, or `~/.claude/` artifact. The point of this work is to remove that external steering layer.
+- Changes to `create-terminal`/`create-mcp` AI-assist behavior, prompt handling, preview rendering, merge precedence, BLOCK/WARN custom-message guard, or response routing.
+- Changes to `update`, `delete`, `get`, `list` tool-policy subcommands. The guard does NOT extend to those (no AI-assist exists for them).
+- Changes to non-tool policy subcommands (cost, model, security).
+- Eval harness updates (`test/eval/`).
+- Telemetry on guard fires (deferred until adoption data demands it).
+- Deprecation-window plumbing (one-release-with-warning option (b)). Clean break (option (a)) is committed.
+- Server-side detection of agents bypassing the guard (deferred — separate ticket if pick-rate telemetry shows leakage).
+- Major-version bump to `2.0.0` (architect endorses minor `1.5.0` per team versioning practice).
 ## Files to Change
 | Path | Change | Acceptance criterion |
 |------|--------|----------------------|
-| `src/commands/policy.js` | (a) Downgrade three `.requiredOption(...)` calls on `create-mcp` (lines ~1586, ~1587, ~1590 — `--name`, `--mcp-server`, `--action`) to `.option(...)`. (b) Add `.option('--prompt <text>', ...)`, `.option('--yes', ...)`. (Confirm `--json` is present; add if absent.) (c) Update the descriptions of `--name`, `--description`, `--action`, `--custom-message` to note "Override AI suggestion when used with --prompt". (d) In the `create-mcp` action handler, branch on `opts.prompt` BEFORE the existing flag validation. Branch body: empty-prompt check (exit 1, message "--prompt cannot be empty."); mutex against `--mcp-server`, `--mcp-tool`, `--mcp-action-type`, `--config` with verbatim wording "Pass --prompt for AI-assist or the field flags for explicit creation, not both." (exit 1); resolve `--group` via `loadFormData` and `resolveByName` to populate `opts.scopeUserGroupIds`; call `runMcpPromptCreate(opts)`; handle `err.exitCode === 0` as user-abort (warn + return); on success, `api.post('/api/v1/command-policies/', { body: result.body })`; success message `MCP policy${body.name ? ` "${body.name}"` : ''} created.`; `displayToolPolicy(unwrapToolPolicy(data))`. (e) Preserve the flag-based path verbatim; add `(or use --prompt for AI assist)` suffix to the three required-flag in-handler errors so non-`--prompt` users see equivalent guidance. (f) Update `addHelpText('after', ...)` so the first example uses `--prompt`. | M2, M3, M5, M6, M7, M9, M10, M11 |
-| `src/lib/policy-ai-assist.js` | Add four new symbols. (a) `routeMcpSuccessFalse(error)` — same shape as the existing `routeSuccessFalse`, but MCP-specific manual-escape-hatch wording (mentions `--mcp-server`/`--mcp-action-type`, not `--command-family`/`unbound policy tool families`). (b) `mergeAiAndFlagsMcp(aiResponse, opts)` — accepts the full AI response (`canonical_group_id`, `mcp_tools`, `name`, `description`, `action`, `custom_message`), returns body `{policy_type:'MCP_TOOL', mcp_canonical_group_id, mcp_tools, name, description, action, enabled, custom_message?, scope_user_group_ids?}`. Flag-precedence rules: trimmed `--name`/`--description`/`--action` win over AI; whitespace-only falls back to AI silently (mirrors Phase 1 regression fix); `--custom-message` always wins; `--disabled` always wins; default action `AUDIT` if neither AI nor flag sets it. (c) `renderMcpPreview(body, explanation)` — same `console.log` + `output.keyValue` shape as `renderTerminalPreview`. Rows in order: Name, Description (skip if empty), Type (`MCP_TOOL`), Service (`canonical_group_id: <int>` — see Out of Scope for why no reverse-resolve), Tools (`body.mcp_tools.join(', ')`), Action (colored via existing `colorAction`), Custom message (skip if empty), Scope (groups, `(org-wide)` if none), Enabled. Header "ℹ Resolved MCP policy (from AI assist):" in cyan. Explanation block guarded by `if (explanation)` — Phase 2 callers pass `undefined` because assist-mcp returns no explanation; forward-compatible if backend later adds it. (d) `runMcpPromptCreate(opts)` — orchestrator mirroring `runTerminalPromptCreate`. Sequence: `config.isLoggedIn()` check (exit 3); `validateActionOverride(opts.action)` early; `validatePromptPreflight(opts)` and warn loop over `warnings`; if warnings present and not `--yes`/`--json`, `confirmContinue()` gate (abort → throw with `exitCode:0`); `loadPrivileges()` + non-admin gate (exit 3 with role-naming message); `api.post('/api/v1/command-policies/assist-mcp/', {body: {user_input: sanitizedPrompt, current_form_state: {mcp_canonical_group_id:'', mcp_tools:[], name:'', description:'', action:''}}})` wrapped in try/catch that routes through `routeBackendError`; on `success === false`, route through `routeMcpSuccessFalse` and throw; **soft-fail check**: if `!response.canonical_group_id` OR `!Array.isArray(response.mcp_tools)` OR `response.mcp_tools.length === 0`, throw `Error` with `exitCode:2` and message `"AI assist could not match any tools for your description. Try naming the service and tools more directly, or use \`unbound policy tool create-mcp --name "..." --mcp-server <server> --mcp-action-type <read\|write\|destructive> --action <action>\`."`; `body = mergeAiAndFlagsMcp(response, opts)`; BLOCK/WARN guard reusing Phase 1's user-vs-AI attribution pattern (check `typeof opts.action === 'string' && opts.action.trim().length > 0` to decide attribution); `renderMcpPreview(body, undefined)` unless `opts.json`; `confirmed = opts.yes || opts.json ? true : await confirmCreate()`; return `{body, confirmed, explanation: undefined}`. (e) Append `runMcpPromptCreate`, `mergeAiAndFlagsMcp`, `renderMcpPreview` to `module.exports` (keep all existing exports verbatim). | M1, M2, M3, M4, M5, M6, M7, M8, M11 |
-| `README.md` | Under "Tool policy examples", in the MCP section, lead the first example with `unbound policy tool create-mcp --prompt "audit all Linear writes"`. Keep the flag-based example below as the documented fallback. Symmetric to Phase 1's terminal section update. | M12 |
-| `PLAN-web-4887.md` | §7: replace "Phase 2 spec, not yet implemented" language with "Phase 2 DELIVERED" markers. At the end of §7.4, add a note that the assist-mcp response has no `explanation` field (preview's explanation block is unconditionally skipped). §11.1 (catalog cache), §12.2 (`source:cli` telemetry), §12.3 (Cost/Model/Security AI-assist) remain in the deferred list — Phase 2 does NOT mark them delivered. | M13 |
+| `src/commands/policy.js` | (a) On both `create-terminal` (line 1430) and `create-mcp` (line 1602), add `.option('--no-ai', 'Opt out of the AI-assist guard and use raw classification flags. Mutually exclusive with --prompt.')`. (b) Insert `.addHelpText('before', helpBannerFor('create-terminal'))` (resp. `'create-mcp'`) above the existing `.action(...)`. (c) Update the existing `.addHelpText('after', ...)` examples (lines 1444–1468 and 1617–1641) so every manual-flag example prepends `--no-ai`; AI-assisted (`--prompt`) examples unchanged. (d) At the top of each `.action()` handler — inside the existing `try`, immediately after `requireLogin()` (lines 1471 / 1644) — call `assertSteering(opts, { subcommandName: 'create-terminal' })` (resp. `'create-mcp'`). (e) Change the outer handler `catch` (lines 1596–1598 and 1768–1770) from `process.exitCode = 1` to `process.exitCode = err.exitCode || 1` so the guard's `err.exitCode = 2` propagates (mirrors the already-existing pattern on the AI-assist branch at line 1512). | #1, #2, #3, #4, #5, #6 |
+| `package.json` | Bump `"version"` from `"1.4.0"` to `"1.5.0"`. | #7 |
+| `README.md` | In the "Tool policy" examples section, update every manual-flag example for `create-terminal` and `create-mcp` to prepend `--no-ai`. AI-assisted (`--prompt`) examples unchanged. Add a one-paragraph note under the section header documenting (i) the guard, (ii) the `--no-ai` opt-out, (iii) the `UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE=1` env-var escape hatch for Claude Code interactive humans. Add a top-of-section `BREAKING CHANGE in 1.5.0:` callout. | #9 |
+| `PLAN-web-4887.md` | Append §14 "Steering enforcement (post-skill-kill)" documenting (i) layer 1 hard guard, (ii) layer 2 `CLAUDECODE=1` detection + `UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE=1` escape hatch, (iii) layer 3 banner-first `--help`, (iv) why the external skill (PR #163) was killed. Mark §8 "Claude Code skill — distribution and content" as `~~DEPRECATED — replaced by CLI-side guards in §14~~` with a forward pointer to §14. Do NOT alter §1–§7 or §9–§13. | #10 |
 ## Files to Create
 | Path | Purpose | Acceptance criterion |
 |------|---------|----------------------|
-| `test/policy-ai-assist-mcp.test.js` | NEW integration test file at the CLI command layer. Uses `node:test` + `commander.parseAsync` exactly like Phase 1's `test/policy-ai-assist.test.js`. Extends the harness pattern: `buildHarness` in this file accepts `assistMcpResponse` / `assistMcpError` options, and the `api.post` stub routes on `path === '/api/v1/command-policies/assist-mcp/'` to either return `assistMcpResponse` or throw `assistMcpError` (otherwise routes to `/api/v1/command-policies/` for the create POST). Defines a local `runCreateMcp(argv)` helper wrapping `program.parseAsync(['node','unbound','policy','tool','create-mcp', ...argv])`. ~50 LOC of harness duplication accepted vs shared-helper plumbing. Defines 12 distinct tests (mutex tests parametrized over a `MUTEX_FLAGS` array; routing tests parametrized over a `ROUTING_CASES` array). Total ~13–15 `test(...)` calls. | M2, M3, M4, M5, M6, M7, M8, M9, M11 |
+| `src/lib/no-ai-guard.js` | New module, ~80 LOC, no external dependencies. Exports two pure functions: `assertSteering(opts, { subcommandName, env = process.env })` (runs layers 1 + 2; throws an `Error` with `err.exitCode = 2` and the appropriate stderr message — see AD-8 wording in the source plan) and `helpBannerFor(subcommandName)` (returns the multi-line banner string for `.addHelpText('before', ...)`). The `env` parameter is dependency-injected (defaults to `process.env`) so tests can exercise the production codepath without touching global env. | #1, #3, #4 |
+| `test/no-ai-guard.test.js` | New integration test file at `commander.parseAsync` layer. Uses `node:test` + `node:assert/strict`. Reuses the harness pattern from `test/policy-ai-assist.test.js` (`loadFreshModules`, fresh `new Command()`, stubbed `api.get`/`api.post`, captured `output.error`/`output.success`/`output.warn`, `program.exitOverride()`). Adds a local `withEnv(overrides, fn)` helper that saves+restores `process.env.CLAUDECODE` and `process.env.UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE` per test. Defines 11 `test(...)` cases (T1–T11) covering every acceptance criterion below; `beforeEach`/`after` hooks reset `process.exitCode = 0` (same convention as `test/policy-ai-assist.test.js` lines 97–106). | #1, #2, #3, #4, #5, #6, #8 |
 ## Test Surface
-- **Layer:** CLI command. Tests invoke `unbound policy tool create-mcp` via `commander.parseAsync(['node','unbound','policy','tool','create-mcp', ...argv])` against a fresh `new Command()` with the policy module re-registered per test (`loadFreshModules()` invalidates `require.cache` so module-level `_privilegesCache` and `_formDataCache` are clean). `src/api.js` `.post`/`.get` and `src/config.js` `isLoggedIn`/`getApiKey`/`getBaseUrl` are monkey-patched. NO internal helper unit tests — preflight is already covered by the policy-type-agnostic `test/policy-ai-assist-preflight.test.js` from Phase 1 and is NOT re-covered.
+- **Layer:** CLI command — `commander.parseAsync(['node','unbound','policy','tool','create-terminal'|'create-mcp', ...argv])` against a fresh `new Command()` with `src/commands/policy.js`'s `register` invoked per test. `src/api.js` `.get`/`.post` and `src/config.js` `isLoggedIn`/`getApiKey`/`getBaseUrl` are stubbed. `process.env.CLAUDECODE` and `process.env.UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE` are set per-test via a save+restore helper. NO unit tests on `assertSteering` or `helpBannerFor` directly — coverage is end-to-end through `parseAsync`.
 - **Tests to add or extend:**
-  - `test/policy-ai-assist-mcp.test.js` — covers acceptance criteria M2, M3, M4, M5, M6, M7, M8, M9, M11. Test cases:
-    1. Mutex `--prompt + --mcp-server` → M2.
-    2. Mutex `--prompt + --mcp-tool` → M2.
-    3. Mutex `--prompt + --mcp-action-type` → M2.
-    4. Mutex `--prompt + --config` → M2.
-    5. Empty `--prompt` → M2.
-    6. Happy-path body shape (assist-mcp returns full success response) → M3.
-    7. Merge precedence with all override flags + AI fields preserved where flags absent → M4.
-    8. Whitespace-only `--name` falls back to AI name → M4.
-    9. Soft-fail `mcp_tools:[]` → M8.
-    10. Soft-fail missing `canonical_group_id` → M8.
-    11. AI-attributed BLOCK without `--custom-message` → M11.
-    12. User-attributed `--action BLOCK` without `--custom-message` → M11.
-    13. 7 HTTP routing cases parametrized (200 success, 200 success_false length, 200 success_false generic, 401, 422, 5xx, network/timeout) → exit codes 0/2/2/3/2/4/4 → M5.
-    14. `--yes` prints preview + skips confirm + issues create POST; assert "Resolved MCP policy" header on stdout, Name/Tools/Action rows present, create POST fires → M6, M7.
-    15. `--json` suppresses preview but still POSTs create → M6.
-    16. No `--yes`, readline-stubbed `'n'` → preview prints, no create POST → M6.
-    17. Out-of-scope keyword `staging` + `--yes` → warning logged AND assist-mcp call fires → M9.
-    18. Non-admin probe → exit 3 before assist-mcp call (defensive smoke test against `_privilegesCache` bleed across tests).
-- M1, M10, M12, M13 are verified by inspection / build smoke (M1: `require('../src/lib/policy-ai-assist')` exposes new symbols AND all Phase 1 symbols; M10: existing `test/policy-conditions.test.js` + flag-path tests if any continue to pass — running `npm test` validates this; M12: `README.md` diff; M13: `PLAN-web-4887.md` diff).
+  - `test/no-ai-guard.test.js` — covers acceptance criteria #1, #2, #3, #4, #5, #6, #8. Test cases:
+    1. **T1** — `create-terminal` with no `--prompt` and no `--no-ai`, no `CLAUDECODE` → `process.exitCode === 2` AND captured `output.error` contains `"AI-assisted"`, `"Retry with"`, AND `"--prompt"`. No `api.post` calls observed. (#1)
+    2. **T2** — `create-mcp` with no `--prompt` and no `--no-ai`, no `CLAUDECODE` → same shape as T1, scoped to `create-mcp`. (#1)
+    3. **T3** — `create-terminal --no-ai --name X --command-family git --field command='git push*' --action AUDIT` with no `CLAUDECODE` → `api.post('/api/v1/command-policies/', ...)` fires (observed in stub). `process.exitCode === 0`. (#2)
+    4. **T4** — `create-mcp --no-ai --name X --mcp-server linear --mcp-action-type read --action AUDIT` with no `CLAUDECODE` → `api.post('/api/v1/command-policies/', ...)` fires. `process.exitCode === 0`. (#2)
+    5. **T5** — `create-terminal --prompt "block rm -rf" --no-ai` → `process.exitCode === 2` AND captured `output.error` contains `"not both"`. No `api.post` calls. (#3)
+    6. **T6** — `create-terminal --no-ai` under `CLAUDECODE=1`, `UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE` unset → `process.exitCode === 2` AND captured `output.error` contains `"CLAUDECODE=1"` AND `"intended for interactive humans"`. No `api.post` calls. (#4)
+    7. **T7** — `create-terminal --no-ai` under `CLAUDECODE=1` AND `UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE=1` → layer 2 bypassed, falls into existing flag-path, which then exits 1 with `"--name is required"` error (asserting the env-escape hatch worked end-to-end). (#4)
+    8. **T8** — `create-terminal --prompt "block rm -rf"` under `CLAUDECODE=1` → assist endpoint stub observes `api.post('/api/v1/command-policies/assist/', ...)`. Layer 2 does not interfere. (#5)
+    9. **T9** — `create-terminal --help` (collected via `program.outputHelp()` redirected stdout or `--help` triggering commander's help output capture) → stdout contains the AI-assisted banner block AND `idx(banner) < idx("Usage:")` (banner appears before commander's auto-generated Usage line). (#6)
+    10. **T10** — same as T9 for `create-mcp --help`. (#6)
+    11. **T11** (regression smoke) — `create-terminal --prompt "audit npm installs"` with no `CLAUDECODE`, no `--no-ai` → assist endpoint stub observes the assist POST. Confirms the existing Phase-1 happy-path is unaffected by the new guard. (#8)
 ## Acceptance Criteria
-1. **M1.** Given `require('../src/lib/policy-ai-assist')`, when destructured, then the module exports `runMcpPromptCreate`, `mergeAiAndFlagsMcp`, `renderMcpPreview` AND retains every Phase 1 export (`MAX_PROMPT_LEN`, `OUT_OF_SCOPE_KEYWORDS`, `validatePromptPreflight`, `mergeAiAndFlags`, `renderTerminalPreview`, `routeBackendError`, `runTerminalPromptCreate`) verbatim.
-2. **M2.** Given `unbound policy tool create-mcp --prompt "X" --mcp-server linear` (and the three other mutex pairings `--mcp-tool`, `--mcp-action-type`, `--config`), when the command runs, then `process.exitCode === 1` AND `output.error` is called with exactly `Pass --prompt for AI-assist or the field flags for explicit creation, not both.` AND no network calls fire. Given `unbound policy tool create-mcp --prompt ""`, then `process.exitCode === 1` AND `output.error` is called with a message containing `--prompt cannot be empty.` AND no network calls fire.
-3. **M3.** Given `unbound policy tool create-mcp --prompt "audit all linear writes" --yes` and a stubbed `api.post('/api/v1/command-policies/assist-mcp/')` returning `{success:true, canonical_group_id:12, mcp_tools:['create_issue','update_issue'], name:'Audit Linear writes', description:'…', action:'AUDIT'}`, when the command runs, then the body POSTed to `/api/v1/command-policies/` equals `{policy_type:'MCP_TOOL', mcp_canonical_group_id:12, mcp_tools:['create_issue','update_issue'], name:'Audit Linear writes', description:'…', action:'AUDIT', enabled:true}` (custom_message and scope_user_group_ids absent because not supplied).
-4. **M4.** Given a stubbed assist-mcp response `{success:true, canonical_group_id:12, mcp_tools:['t1'], name:'AI Name', description:'AI desc', action:'BLOCK', custom_message:'AI msg'}` and the CLI invoked with `--prompt "..." --name "Custom" --custom-message "Override" --action AUDIT --group GroupX --disabled --yes` and `loadFormData` stubbed to return `{user_groups:[{id:7,name:'GroupX'}]}`, when the command runs, then the create-POST body has `name:'Custom'`, `action:'AUDIT'`, `custom_message:'Override'`, `enabled:false`, `scope_user_group_ids:[7]`, AND retains `mcp_canonical_group_id:12` and `mcp_tools:['t1']` from the AI response. Given the same setup but `--name "   "` (whitespace-only), then the body has `name:'AI Name'` (AI value preserved, whitespace silently ignored).
-5. **M5.** Given each of 7 backend response shapes returned by the assist-mcp stub — (a) 200 + `{success:true, canonical_group_id, mcp_tools, ...}`, (b) 200 + `{success:false, error:"Input is too long (max 2000 characters)."}`, (c) 200 + `{success:false, error:"<generic>"}`, (d) `throw new FakeApiError(401, ...)`, (e) `throw new FakeApiError(422, ...)`, (f) `throw new FakeApiError(503, ...)`, (g) `throw new Error("connect ECONNREFUSED ...")` — when the command runs with `--yes`, then `process.exitCode` is `0 / 2 / 2 / 3 / 2 / 4 / 4` respectively AND `output.error` contains the verbatim wording from Phase 1's spec §6.4 matrix for that row.
-6. **M6.** Given `unbound policy tool create-mcp --prompt "audit linear writes"` without `--yes` and a stubbed successful assist-mcp response, when the command runs against a stubbed `readline.createInterface().question(_, cb)` that calls `cb('n')`, then `console.log` captures emit the header `Resolved MCP policy (from AI assist):` AND key rows for Name, Type (`MCP_TOOL`), Service, Tools (comma-joined), Action, Enabled BEFORE the prompt; AND no POST to `/api/v1/command-policies/` fires (user declined). Description and Custom message rows are omitted when their values are empty. Given `--json`, then NO "Resolved MCP policy" string appears on stdout or in `output.info`, but the create POST still fires.
-7. **M7.** Given `unbound policy tool create-mcp --prompt "audit linear writes" --yes` and a stubbed successful assist-mcp response, when the command runs, then the preview header AND key rows still print on stdout AND no readline prompt is invoked AND a POST to `/api/v1/command-policies/` fires with the merged body.
-8. **M8.** Given a stubbed assist-mcp response `{success:true, canonical_group_id:12, mcp_tools:[], name:'X', action:'AUDIT'}`, when the command runs with `--yes`, then `process.exitCode === 2` AND `output.error` contains the substring `could not match any tools` AND `unbound policy tool create-mcp --name` AND no POST to `/api/v1/command-policies/` fires. Given a stubbed assist-mcp response `{success:true, mcp_tools:['t'], name:'X', action:'AUDIT'}` (no `canonical_group_id`), the same exit code and message wording apply.
-9. **M9.** Given `unbound policy tool create-mcp --prompt "audit staging linear writes" --yes`, when the command runs, then `output.warn` is called with a message containing the substring `` `staging` `` AND a POST to `/api/v1/command-policies/assist-mcp/` is fired (does NOT exit early).
-10. **M10.** Given `unbound policy tool create-mcp --name X --mcp-server linear --mcp-tool create_issue --action AUDIT` (no `--prompt`), when the command runs against a stubbed `api.post('/api/v1/command-policies/')`, then the body POSTed equals `{name:'X', description:'', policy_type:'MCP_TOOL', mcp_server:'linear', mcp_tool:'create_issue', action:'AUDIT', enabled:true, config:{}}` — identical to today's flag-path body shape. Given `unbound policy tool create-mcp` with `--name` omitted, then the in-handler error `--name is required (or use --prompt for AI assist).` fires (replacing commander's auto-emitted `required option not specified` message).
-11. **M11.** Given the merged body has `action ∈ {BLOCK, WARN}` and no `custom_message`, when the command runs with `--yes`, then `process.exitCode === 2` AND no create POST fires AND `output.error` wording is user-attributed (`--action BLOCK requires --custom-message. ...`) when `opts.action` was passed by the user, or AI-attributed (`AI assist set --action to BLOCK ...`) when only the AI returned the action.
-12. **M12.** Given `README.md`, when read, the "Tool policy examples" MCP section's first example uses `unbound policy tool create-mcp --prompt "..."` AND a flag-based example appears below it as a documented fallback.
-13. **M13.** Given `PLAN-web-4887.md`, when read, §7 is marked "Phase 2 DELIVERED" with a cross-link to the Phase 2 commit AND §7.4 includes a note that assist-mcp has no `explanation` field AND §11.1, §12.2, §12.3 remain in the deferred list.
+1. Given `unbound policy tool create-terminal` (resp. `create-mcp`) invoked with neither `--prompt` nor `--no-ai`, when the command runs, then `process.exitCode === 2`, `output.error` is called with a message containing the substrings `"AI-assisted"`, `"Retry with"`, and `"--prompt"`, and no network call to `api.post` is made.
+2. Given `unbound policy tool create-terminal --no-ai --name X --command-family git --field command='git push*' --action AUDIT` (resp. `create-mcp --no-ai --name X --mcp-server linear --mcp-action-type read --action AUDIT`) with `process.env.CLAUDECODE` unset, when the command runs, then `api.post` is called with path `/api/v1/command-policies/` and `process.exitCode === 0`.
+3. Given `unbound policy tool create-terminal --prompt "X" --no-ai`, when the command runs, then `process.exitCode === 2`, `output.error` is called with a message containing the substring `"not both"`, and no network call is made.
+4. Given `unbound policy tool create-terminal --no-ai` under `process.env.CLAUDECODE === '1'` and `process.env.UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE` unset, when the command runs, then `process.exitCode === 2` AND `output.error` is called with a message containing `"CLAUDECODE=1"` AND `"intended for interactive humans"`. Given the same invocation but with `process.env.UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE === '1'`, layer 2 is bypassed and the existing flag-path errors out on `"--name is required"` (with `process.exitCode === 1`).
+5. Given `unbound policy tool create-terminal --prompt "block rm -rf"` under `process.env.CLAUDECODE === '1'`, when the command runs, then `api.post('/api/v1/command-policies/assist/', ...)` is invoked (assist endpoint is reached; layer 2 does not interfere).
+6. Given `unbound policy tool create-terminal --help` (resp. `create-mcp --help`), when the command runs, then the help output (captured stdout) contains the AI-assisted banner block (substring `"AI-ASSISTED (preferred):"`) AND the index of that banner is strictly less than the index of `"Usage:"`.
+7. `package.json` `version` is `"1.5.0"`.
+8. `npm test` exits 0 with all existing tests in `test/*.test.js` (including `test/policy-ai-assist.test.js` and `test/policy-ai-assist-mcp.test.js`) passing unmodified.
+9. `README.md` Tool policy section contains: (i) every manual-flag example using `--no-ai`, (ii) a paragraph documenting the guard + `--no-ai` + `UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE` env var, (iii) a `BREAKING CHANGE in 1.5.0:` callout at the top of the section.
+10. `PLAN-web-4887.md` has a new §14 "Steering enforcement (post-skill-kill)" documenting the three layers and the env var, AND §8 begins with `~~DEPRECATED — replaced by CLI-side guards in §14~~` and a forward pointer to §14.
 ## Risks & Mitigations
-- **Risk:** Downgrading three `.requiredOption(...)` calls on `create-mcp` to `.option(...)` changes commander's auto-emitted error wording for users who omit `--name` / `--mcp-server` / `--action` on the flag-based path. Existing scripts that grep stderr will see different text.
-  - **Mitigation:** In-handler validation throws functionally equivalent messages (`"--name is required (or use --prompt for AI assist)."`, etc.) — same flag name, same imperative, additional `--prompt` hint. Document the wording change in the PR description. Phase 1 made the exact same change to `create-terminal`.
-- **Risk:** The assist-mcp response has no `explanation` field (asymmetric with the terminal endpoint). A naive `renderMcpPreview` that always prints "AI assist explanation:" would emit an empty trailing line.
-  - **Mitigation:** Reuse Phase 1's `if (explanation) { ... }` guard pattern verbatim. Phase 2 callers pass `explanation: undefined`; the block is skipped. Forward-compatible: if backend later adds `explanation`, the same branch picks it up with zero CLI change.
-- **Risk:** Soft-fail on `mcp_tools:[]` could mask a legitimate "match all tools on this service" intent. The Haiku endpoint's post-validation drops invented names; a legitimate "all writes on Linear" might survive validation as `mcp_tools:[]` if Linear has no write tools.
-  - **Mitigation:** Established Context #5 makes this an explicit product decision: do NOT auto-create with no tools. If backend later wants wildcard semantics, it can return `mcp_tools:['*']` or a sentinel; the CLI passes through unchanged. Surface message points users at the manual flag path for the unambiguous "all destructive tools on Linear" intent.
-- **Risk:** Preview shows `Service: canonical_group_id: 12` (integer) instead of "Linear". Users seeing only the preview won't know what service they're authorizing.
-  - **Mitigation:** v1 limitation, explicit in Out of Scope. Post-create `displayToolPolicy(unwrapToolPolicy(data))` shows the backend's resolved `mcp_server` name in the success output, so the integer-only window is just the pre-confirmation preview. Polish follow-up: have `renderMcpPreview` accept an optional name-resolution map fetched from `/api/v1/command-policies/metadata/mcp-servers/` (one extra round trip; defer unless user feedback demands it).
-- **Risk:** `runMcpPromptCreate` and `runTerminalPromptCreate` diverge subtly over time. The BLOCK/WARN guard wording is duplicated text, not a shared helper.
-  - **Mitigation:** Both functions explicitly call shared `validatePromptPreflight`, `loadPrivileges`, `confirmCreate`, `confirmContinue`, `routeBackendError`, `validateActionOverride`. The duplicated guard wording is ~10 lines — extract into a `assertCustomMessageForBlockWarn(body, opts)` helper if a third policy type ships with AI-assist (per spec §3, deferred).
-- **Risk:** `buildHarness` in `test/policy-ai-assist-mcp.test.js` duplicates ~50 LOC of the Phase 1 harness scaffolding (module-cache invalidation, output-capture, ApiError shape).
-  - **Mitigation:** Duplication is intentional. A shared `test/_helpers/harness.js` would couple Phase 1 and Phase 2 test files; any harness change risks breaking Phase 1's stable test suite. Accepted trade-off: per-file harnesses, ~50 LOC duplication, full Phase 1 / Phase 2 test isolation.
-- **Risk:** Cross-repo SKILL.md update on setup PR #163 lands BEFORE unbound-cli PR #54 — the skill would advertise an MCP `--prompt` path that doesn't exist yet on installed CLIs.
-  - **Mitigation:** Sequencing in step 7 explicitly orders PR #54 to merge first, THEN push the skill update commit to PR #163. CLI versioning ensures users who upgrade get the binary before the skill steers them to the new flag.
+- **Risk R1: BREAKING CHANGE breaks existing scripts.** Any CI / shell script today running `unbound policy tool create-terminal --name X --command-family git --field K=V --action AUDIT` starts exiting 2 immediately on upgrade.
+  - **Mitigation:** PR description includes a `BREAKING CHANGE:` callout naming both subcommands and showing the one-flag (`--no-ai`) migration. README breaking-change banner. Team automation audited in PR review. Breaking surface is narrow (two subcommands).
+- **Risk R2: Claude reads layer-2 stderr and sets `UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE=1` itself.** The env var is in plain sight in the error message; an adversarial reading would lead the agent to set the var and bypass.
+  - **Mitigation:** Message wording explicitly says `"This is intended for interactive humans, not for agents."` Accepted residual risk — this is steering, not security. Server-side detection (deferred ticket) if pick-rate telemetry later shows leakage.
+- **Risk R3: `addHelpText('before', ...)` placement varies across commander.js versions.** Could cause the banner to render in the wrong slot.
+  - **Mitigation:** Project pins `commander ^12.1.0`. T9/T10 assert `idx(banner) < idx("Usage:")` and fail loudly on drift. Smoke-verify locally before commit.
+- **Risk R4: `process.env.CLAUDECODE` may not be `'1'` in every Claude Code variant.** Worst case: layer 2 silently no-ops.
+  - **Mitigation:** Layer 1 still steers in the absence of layer 2. Acceptable degradation; document the env var the CLI checks for in `PLAN-web-4887.md` §14 so future Claude Code variants can match.
+- **Risk R5: Test brittleness around `process.exitCode`.** The runner conflates a leftover non-zero `process.exitCode` with a test-file failure.
+  - **Mitigation:** Reuse `beforeEach(() => { process.exitCode = 0; })` and `after(() => { process.exitCode = 0; })` pattern verbatim from `test/policy-ai-assist.test.js` lines 97–106. Do not invent a new convention.
+- **Risk R6: `catch`-block change (`process.exitCode = err.exitCode || 1`) subtly alters exit codes for existing thrown `Error`s that lack `.exitCode`.**
+  - **Mitigation:** Today's behavior for such errors is `process.exitCode = 1`; the `|| 1` fallback preserves that exactly. Change is purely additive for errors that DO carry `.exitCode`. Existing `test/policy-ai-assist.test.js` and `test/policy-ai-assist-mcp.test.js` confirm regression-free behavior.
+- **Risk R7: Test harness module-cache bleed between `test/no-ai-guard.test.js` and the existing assist tests.** Module-level caches (e.g. `_privilegesCache` in `src/lib/policy-ai-assist.js`) could leak state.
+  - **Mitigation:** Copy the `loadFreshModules` pattern from `test/policy-ai-assist.test.js` lines 9–25 verbatim — it already invalidates `require.cache` for the relevant modules.
 ## Sequencing
-1. Extend `src/lib/policy-ai-assist.js`: add `routeMcpSuccessFalse`, `mergeAiAndFlagsMcp`, `renderMcpPreview`, `runMcpPromptCreate`. Append to `module.exports`. ~250 LOC added; module reaches ~600 LOC.
-2. Modify `src/commands/policy.js` `create-mcp` registration: downgrade three `.requiredOption(...)` to `.option(...)`, add `--prompt`/`--yes` (and `--json` if absent), update flag descriptions, branch in the action handler on `opts.prompt`, update `addHelpText` to lead with `--prompt`. ~80 LOC added/modified.
-3. Write `test/policy-ai-assist-mcp.test.js` with the ~13–15 tests enumerated in Test Surface. Run `npm test` — both the new file and the existing Phase 1 suite (`test/policy-ai-assist.test.js`, `test/policy-ai-assist-preflight.test.js`, all other existing tests) must pass.
-4. Update `README.md` "Tool policy examples" MCP section to lead with `--prompt`.
-5. Update `PLAN-web-4887.md` §7 to mark Phase 2 DELIVERED with commit cross-links; append the no-`explanation`-field note at the end of §7.4. Leave §11.1 / §12.2 / §12.3 in the deferred list.
-6. Commit on `web-4887-cli-ai-assist-policy`; PR #54 updates with the Phase 2 changes.
-7. (Cross-repo, AFTER step 6 merges) Push a follow-up commit to setup PR #163: update `claude-code/skills/unbound-tool-policy/SKILL.md` — drop the "MCP not available yet" footer, update the frontmatter description to cover both terminal AND MCP policy creation via `--prompt`, add a "Creating an Unbound MCP tool policy" section mirroring the terminal one (rules: one policy per service per invocation, tool names belong in the prompt, `--group` for scoping, `--custom-message` for BLOCK/WARN, fall back to flag-based on failure). Merge PR #163 after PR #54 is in main so the skill never advertises a flag the installed CLI doesn't have.
+1. Create `src/lib/no-ai-guard.js` with `assertSteering` and `helpBannerFor`. No call sites yet. Verify the module loads cleanly: `node -e "require('./src/lib/no-ai-guard')"`.
+2. In `src/commands/policy.js`, add `.option('--no-ai', ...)` to both `create-terminal` (line 1430 block) and `create-mcp` (line 1602 block).
+3. In `src/commands/policy.js`, wire `assertSteering(opts, { subcommandName: ... })` into both `.action()` handlers immediately after `requireLogin()` (lines 1471 / 1644).
+4. In `src/commands/policy.js`, change the outer handler `catch` blocks (lines 1596–1598 and 1768–1770) to use `process.exitCode = err.exitCode || 1`.
+5. In `src/commands/policy.js`, add `.addHelpText('before', helpBannerFor('create-terminal'))` / `'create-mcp'` and update the manual examples in the existing `.addHelpText('after', ...)` blocks (lines 1444–1468 and 1617–1641) to prepend `--no-ai`.
+6. Bump `package.json` `version` to `1.5.0`.
+7. Write `test/no-ai-guard.test.js` (T1–T11). Run `npm test`. Iterate stderr wording in `src/lib/no-ai-guard.js` until tests pass and copy remains human-readable.
+8. Update `README.md` Tool policy section per the table above.
+9. Update `PLAN-web-4887.md` — append §14, mark §8 deprecated.
+10. Run `npm test` once more for full regression confirmation.
+11. Self-review the diff. Commit. Open PR with base `web-4887-cli-ai-assist-policy`; PR description includes the `BREAKING CHANGE:` callout and the `--no-ai` migration example.
 ## Open Questions
-None — all decisions are locked in Established Context. The setup PR #163 SKILL.md update is sequencing, not a question.
+- None.

package/README.md CHANGED Viewed

@@ -195,6 +195,8 @@ unbound policy security create --name "429 Fallback" --sub-type error-code-routi
 Tool policy examples:
+> **BREAKING CHANGE in 1.5.0:** `create-terminal` and `create-mcp` now require either `--prompt` (AI-assisted creation) or an explicit `--no-ai` opt-out. Invocations with raw classification flags but no `--no-ai` exit with code 2 and a remediation message. Under Claude Code (`CLAUDECODE=1`), even `--no-ai` is rejected unless you also set `UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE=1` — this is intended for interactive humans, not agents.
 ```bash
 # AI-assisted (preferred): describe the policy in natural language.
 unbound policy tool create-terminal --prompt "block rm -rf"
@@ -207,11 +209,11 @@ unbound policy tool families
 unbound policy tool mcp-servers
 # Flag-based fallback: block destructive shell commands explicitly
-unbound policy tool create-terminal --name "Block rm -rf" --command-family filesystem \
+unbound policy tool create-terminal --no-ai --name "Block rm -rf" --command-family filesystem \
     --field command='rm -rf*' --action BLOCK --custom-message "Destructive command blocked."
 # Flag-based MCP fallback: audit Linear write operations
-unbound policy tool create-mcp --name "Audit Linear writes" --mcp-server Linear \
+unbound policy tool create-mcp --no-ai --name "Audit Linear writes" --mcp-server Linear \
     --mcp-action-type write --action AUDIT
 # List, get, delete

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "unbound-cli",
-  "version": "1.4.0",
+  "version": "1.5.0",
   "description": "CLI tool for Unbound - AI Gateway management",
   "main": "src/index.js",
   "bin": {

package/src/commands/policy.js CHANGED Viewed

@@ -2,6 +2,7 @@ const config = require('../config');
 const api = require('../api');
 const output = require('../output');
 const { formatDate, confirm, parseCommaSeparated } = require('../utils');
+const { assertSteering, helpBannerFor } = require('../lib/no-ai-guard');
 // ============================================================================
 // Constants and docs URLs
@@ -1430,6 +1431,7 @@ Subcommands:
     .command('create-terminal')
     .description('Create a TERMINAL_COMMAND policy: monitor or block shell commands run by AI coding tools.')
     .option('--prompt <text>', 'Natural-language description; routes through Unbound AI assist. Mutually exclusive with --command-family/--field/--config. Compatible with --name/--description/--action overrides.')
+    .option('--no-ai', 'Opt out of the AI-assist guard and use raw classification flags. Mutually exclusive with --prompt.')
     .option('--name <name>', 'Policy name (required unless --prompt is used)')
     .option('--command-family <family>', 'Command family (e.g. git, filesystem, network). See `policy tool families`.')
     .option('--field <key=pattern>', 'Match field: <key>=<pattern>. Repeatable — multiple --field are ANDed (all must match), one per field, for a more specific policy. At least one required unless --config is used.', (val, prev = []) => [...prev, val])
@@ -1441,6 +1443,7 @@ Subcommands:
     .option('--config <json>', 'Advanced: raw config JSON (skips --field builder)')
     .option('--yes', 'Skip the confirmation prompt for --prompt flows (preview is still printed)')
     .option('--json', 'Output raw JSON of the created policy')
+    .addHelpText('before', helpBannerFor('create-terminal'))
     .addHelpText('after', `
 Examples:
   # AI-assisted (preferred): describe the policy in natural language.
@@ -1450,15 +1453,15 @@ Examples:
   $ unbound policy tool create-terminal --prompt "block rm -rf" \\
       --name "Block recursive deletes" --action WARN
-  $ unbound policy tool create-terminal --name "Block rm -rf" \\
+  $ unbound policy tool create-terminal --no-ai --name "Block rm -rf" \\
       --command-family filesystem --field command='rm -rf*' --action BLOCK \\
       --custom-message "Destructive commands are blocked."
-  $ unbound policy tool create-terminal --name "Audit git push" \\
+  $ unbound policy tool create-terminal --no-ai --name "Audit git push" \\
       --command-family git --field command='git push*' --action AUDIT
   # Multiple --field are ANDed — this fires only on a push to main on origin:
-  $ unbound policy tool create-terminal --name "Block push to main on origin" \\
+  $ unbound policy tool create-terminal --no-ai --name "Block push to main on origin" \\
       --command-family git_action \\
       --field operation=push --field branch=main --field remote='*origin*' \\
       --action BLOCK --custom-message "Pushing to main on origin is blocked."
@@ -1469,6 +1472,7 @@ Learn more: ${DOCS_TOOL}
     .action(async (opts) => {
       try {
         requireLogin();
+        assertSteering(opts, { subcommandName: 'create-terminal' });
         if (opts.prompt !== undefined) {
           if (typeof opts.prompt !== 'string' || opts.prompt.trim() === '') {
@@ -1594,7 +1598,7 @@ Learn more: ${DOCS_TOOL}
         displayToolPolicy(unwrapToolPolicy(data));
       } catch (err) {
         output.error(err.message);
-        process.exitCode = 1;
+        process.exitCode = err.exitCode || 1;
       }
     });
@@ -1602,6 +1606,7 @@ Learn more: ${DOCS_TOOL}
     .command('create-mcp')
     .description('Create an MCP_TOOL policy: monitor or block MCP server tool calls.')
     .option('--prompt <text>', 'Natural-language description; routes through Unbound AI assist. Mutually exclusive with --mcp-server/--mcp-tool/--mcp-action-type/--config. Compatible with --name/--description/--action overrides.')
+    .option('--no-ai', 'Opt out of the AI-assist guard and use raw classification flags. Mutually exclusive with --prompt.')
     .option('--name <name>', 'Policy name (required unless --prompt is used). Override AI suggestion when used with --prompt.')
     .option('--mcp-server <server>', 'MCP server name as shown by `policy tool mcp-servers` (e.g. linear, github). Resolved to its canonical group automatically.')
     .option('--mcp-tool <tool>', 'Specific tool name on the MCP server (e.g. create_issue). Mutually exclusive with --mcp-action-type.')
@@ -1614,6 +1619,7 @@ Learn more: ${DOCS_TOOL}
     .option('--config <json>', 'Advanced: raw config JSON')
     .option('--yes', 'Skip the confirmation prompt for --prompt flows (preview is still printed)')
     .option('--json', 'Output raw JSON of the created policy')
+    .addHelpText('before', helpBannerFor('create-mcp'))
     .addHelpText('after', `
 Examples:
   # AI-assisted (preferred): describe the policy in natural language.
@@ -1625,12 +1631,12 @@ Examples:
       --custom-message "Linear writes are monitored."
   # Match a specific tool on a server (flag-based fallback):
-  $ unbound policy tool create-mcp --name "Block Linear writes" \\
+  $ unbound policy tool create-mcp --no-ai --name "Block Linear writes" \\
       --mcp-server linear --mcp-tool create_issue --action BLOCK \\
       --custom-message "Issue creation is blocked. Contact admin."
   # Match all destructive tools on a server:
-  $ unbound policy tool create-mcp --name "Audit all destructive Slack" \\
+  $ unbound policy tool create-mcp --no-ai --name "Audit all destructive Slack" \\
       --mcp-server slack --mcp-action-type destructive --action AUDIT
 The server name is resolved to its canonical group automatically, so pass the
@@ -1642,6 +1648,7 @@ Learn more: ${DOCS_TOOL}
     .action(async (opts) => {
       try {
         requireLogin();
+        assertSteering(opts, { subcommandName: 'create-mcp' });
         if (opts.prompt !== undefined) {
           if (typeof opts.prompt !== 'string' || opts.prompt.trim() === '') {
@@ -1766,7 +1773,7 @@ Learn more: ${DOCS_TOOL}
         displayToolPolicy(unwrapToolPolicy(data));
       } catch (err) {
         output.error(err.message);
-        process.exitCode = 1;
+        process.exitCode = err.exitCode || 1;
       }
     });

package/src/lib/no-ai-guard.js ADDED Viewed

@@ -0,0 +1,77 @@
+// Steering guard for `policy tool create-terminal` / `create-mcp`.
+//
+// Three layers, evaluated in order. Routing happens BEFORE src/lib/policy-ai-assist.js
+// is reached — this module never touches the network and has no external deps.
+//
+//   Layer 1 (assertSteering): require either --prompt or an explicit --no-ai opt-out.
+//                             Reject --prompt + --no-ai (mutex).
+//   Layer 2 (assertSteering): under CLAUDECODE=1, reject --no-ai unless the env-gated
+//                             escape hatch UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE=1 is set.
+//   Layer 3 (helpBannerFor):  banner-first --help so the AI-assisted form is the
+//                             one a reader sees first.
+//
+// Errors carry `err.exitCode = 2` so the .action() catch block in src/commands/policy.js
+// can propagate via `process.exitCode = err.exitCode || 1`.
+function makeGuardError(message) {
+  const err = new Error(message);
+  err.exitCode = 2;
+  return err;
+}
+// Layer 1 + Layer 2. Throws a guard error (exitCode 2) on rejection.
+// `opts` is the parsed commander options object; `--no-ai` arrives as `opts.ai === false`.
+// "prompt provided" means `--prompt` appeared on the command line at all (even
+// empty-string) — the empty-prompt case is validated by the existing
+// `opts.prompt.trim() === ''` check in the prompt branch so that callers see the
+// dedicated "--prompt cannot be empty." error rather than the steering message.
+function assertSteering(opts, { subcommandName, env = process.env } = {}) {
+  const hasPrompt = opts.prompt !== undefined;
+  const noAi = opts.ai === false;
+  // Mutex (layer 1): --prompt + --no-ai is incoherent — pick one.
+  if (hasPrompt && noAi) {
+    throw makeGuardError(
+      'Pass --prompt for AI-assisted creation or --no-ai for raw flags, not both.'
+    );
+  }
+  // Layer 1: neither --prompt nor --no-ai → steer to the AI-assisted form.
+  if (!hasPrompt && !noAi) {
+    throw makeGuardError(
+      `${subcommandName} requires AI-assisted creation. Retry with --prompt "<natural language description>". To use raw classification flags instead, opt out explicitly with --no-ai.`
+    );
+  }
+  // Layer 2: under Claude Code, even --no-ai is rejected unless the human-only
+  // escape hatch is set. This is steering, not security — the env var is in
+  // plain sight in the error message. See PLAN risk R2.
+  if (noAi && env.CLAUDECODE === '1' && env.UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE !== '1') {
+    throw makeGuardError(
+      `--no-ai is disabled under CLAUDECODE=1. This is intended for interactive humans, not for agents. Retry with --prompt "<natural language description>", or set UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE=1 if you are an interactive human.`
+    );
+  }
+}
+// Layer 3: banner rendered above commander's auto-generated `Usage:` line by
+// `.addHelpText('before', helpBannerFor(...))`. Keep it short so the help screen
+// is still scannable.
+function helpBannerFor(subcommandName) {
+  const examples = {
+    'create-terminal': 'unbound policy tool create-terminal --prompt "block rm -rf"',
+    'create-mcp': 'unbound policy tool create-mcp --prompt "audit all Linear writes"',
+  };
+  if (!Object.prototype.hasOwnProperty.call(examples, subcommandName)) {
+    throw new Error(`helpBannerFor: unknown subcommand "${subcommandName}"`);
+  }
+  const promptExample = examples[subcommandName];
+  return [
+    'AI-ASSISTED (preferred):',
+    `  $ ${promptExample}`,
+    '',
+    'To use raw classification flags instead, opt out explicitly with --no-ai.',
+    '',
+  ].join('\n');
+}
+module.exports = { assertSteering, helpBannerFor };

package/test/no-ai-guard.test.js ADDED Viewed

@@ -0,0 +1,363 @@
+const { test, beforeEach, after } = require('node:test');
+const assert = require('node:assert/strict');
+const { Command } = require('commander');
+// Integration tests for the --no-ai steering guard (WEB-4887, layers 1+2+3).
+// Invokes `unbound policy tool create-terminal` / `create-mcp` against a fresh
+// Command instance with src/api.js and src/config.js stubbed. We never hit the
+// network and never read/write user config.
+//
+// Harness mirrors test/policy-ai-assist.test.js (loadFreshModules, buildHarness,
+// per-test process.exitCode reset). See PLAN risk R7.
+function loadFreshModules() {
+  // Drop the module cache so each test gets a clean module-level
+  // `_privilegesCache` in policy-ai-assist and a clean stub surface.
+  for (const m of [
+    '../src/commands/policy',
+    '../src/lib/no-ai-guard',
+    '../src/lib/policy-ai-assist',
+    '../src/api',
+    '../src/config',
+    '../src/output',
+  ]) {
+    delete require.cache[require.resolve(m)];
+  }
+  const api = require('../src/api');
+  const config = require('../src/config');
+  const output = require('../src/output');
+  return { api, config, output };
+}
+function buildHarness({
+  privilegesResponse = { is_admin: true, is_manager: false, is_member: false },
+  assistResponse = null,
+  assistMcpResponse = null,
+  createResponse = { id: 1, name: 'ok' },
+} = {}) {
+  const { api, config, output } = loadFreshModules();
+  config.isLoggedIn = () => true;
+  config.getApiKey = () => 'fake-key';
+  config.getBaseUrl = () => 'https://b.acme';
+  const calls = { posts: [], gets: [] };
+  api.get = async (path, opts) => {
+    calls.gets.push({ path, opts });
+    if (path === '/api/v1/users/privileges/') return privilegesResponse;
+    throw new Error(`unexpected GET ${path}`);
+  };
+  api.post = async (path, opts) => {
+    calls.posts.push({ path, body: opts && opts.body });
+    if (path === '/api/v1/command-policies/assist/') return assistResponse;
+    if (path === '/api/v1/command-policies/assist-mcp/') return assistMcpResponse;
+    if (path === '/api/v1/command-policies/') return createResponse;
+    throw new Error(`unexpected POST ${path}`);
+  };
+  const captured = { success: [], error: [], warn: [], info: [], stdout: [] };
+  output.success = (m) => captured.success.push(m);
+  output.error = (m) => captured.error.push(m);
+  output.warn = (m) => captured.warn.push(m);
+  output.info = (m) => captured.info.push(m);
+  const origLog = console.log;
+  console.log = (m) => captured.stdout.push(String(m || ''));
+  const restoreLog = () => { console.log = origLog; };
+  return { api, config, output, calls, captured, restoreLog };
+}
+async function runArgv(argv) {
+  const { register } = require('../src/commands/policy');
+  const program = new Command();
+  program.exitOverride();
+  register(program);
+  await program.parseAsync(['node', 'unbound', ...argv]);
+}
+// Save+restore the two env vars the guard reads. Always set them to a known
+// state on entry (undefined unless explicitly overridden) so a stale value from
+// a previous test or the ambient shell cannot leak into this test.
+function withEnv(overrides, fn) {
+  const keys = ['CLAUDECODE', 'UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE'];
+  const saved = {};
+  for (const k of keys) {
+    saved[k] = process.env[k];
+    if (Object.prototype.hasOwnProperty.call(overrides, k)) {
+      if (overrides[k] === undefined) delete process.env[k];
+      else process.env[k] = overrides[k];
+    } else {
+      delete process.env[k];
+    }
+  }
+  return Promise.resolve()
+    .then(fn)
+    .finally(() => {
+      for (const k of keys) {
+        if (saved[k] === undefined) delete process.env[k];
+        else process.env[k] = saved[k];
+      }
+    });
+}
+beforeEach(() => {
+  process.exitCode = 0;
+});
+// Several tests intentionally set process.exitCode to non-zero values to
+// observe the CLI's exit-code routing. Reset it at the end so node:test does
+// not interpret the file itself as having failed.
+after(() => {
+  process.exitCode = 0;
+});
+// T1 — Layer 1, create-terminal: no --prompt + no --no-ai → exit 2, AI steering message.
+test('T1: create-terminal without --prompt and without --no-ai exits 2 with AI-assist steering message', async () => {
+  const h = buildHarness();
+  await withEnv({}, async () => {
+    try {
+      await runArgv(['policy', 'tool', 'create-terminal']);
+      assert.equal(process.exitCode, 2);
+      const errs = h.captured.error.join('\n');
+      assert.ok(errs.includes('AI-assisted'), `expected "AI-assisted" in error: ${errs}`);
+      assert.ok(errs.includes('Retry with'), `expected "Retry with" in error: ${errs}`);
+      assert.ok(errs.includes('--prompt'), `expected "--prompt" in error: ${errs}`);
+      assert.equal(h.calls.posts.length, 0, 'no network call should have fired');
+    } finally {
+      h.restoreLog();
+    }
+  });
+});
+// T2 — Layer 1, create-mcp: same shape as T1, scoped to create-mcp.
+test('T2: create-mcp without --prompt and without --no-ai exits 2 with AI-assist steering message', async () => {
+  const h = buildHarness();
+  await withEnv({}, async () => {
+    try {
+      await runArgv(['policy', 'tool', 'create-mcp']);
+      assert.equal(process.exitCode, 2);
+      const errs = h.captured.error.join('\n');
+      assert.ok(errs.includes('AI-assisted'), `expected "AI-assisted" in error: ${errs}`);
+      assert.ok(errs.includes('Retry with'), `expected "Retry with" in error: ${errs}`);
+      assert.ok(errs.includes('--prompt'), `expected "--prompt" in error: ${errs}`);
+      assert.equal(h.calls.posts.length, 0, 'no network call should have fired');
+    } finally {
+      h.restoreLog();
+    }
+  });
+});
+// T3 — Layer 1 happy path, create-terminal: --no-ai + raw flags → create POST fires.
+test('T3: create-terminal --no-ai with required raw flags POSTs to /command-policies/ with exit 0', async () => {
+  const h = buildHarness();
+  await withEnv({}, async () => {
+    try {
+      await runArgv([
+        'policy', 'tool', 'create-terminal',
+        '--no-ai',
+        '--name', 'X',
+        '--command-family', 'git',
+        '--field', 'command=git push*',
+        '--action', 'AUDIT',
+      ]);
+      assert.equal(process.exitCode || 0, 0);
+      assert.ok(
+        h.calls.posts.some((c) => c.path === '/api/v1/command-policies/'),
+        `expected create POST, got: ${JSON.stringify(h.calls.posts)}`
+      );
+    } finally {
+      h.restoreLog();
+    }
+  });
+});
+// T4 — Layer 1 happy path, create-mcp.
+test('T4: create-mcp --no-ai with required raw flags POSTs to /command-policies/ with exit 0', async () => {
+  const h = buildHarness();
+  await withEnv({}, async () => {
+    try {
+      await runArgv([
+        'policy', 'tool', 'create-mcp',
+        '--no-ai',
+        '--name', 'X',
+        '--mcp-server', 'linear',
+        '--mcp-action-type', 'read',
+        '--action', 'AUDIT',
+      ]);
+      assert.equal(process.exitCode || 0, 0);
+      assert.ok(
+        h.calls.posts.some((c) => c.path === '/api/v1/command-policies/'),
+        `expected create POST, got: ${JSON.stringify(h.calls.posts)}`
+      );
+    } finally {
+      h.restoreLog();
+    }
+  });
+});
+// T5 — Mutex (layer 1): --prompt + --no-ai exits 2 with "not both" wording.
+test('T5: create-terminal --prompt + --no-ai exits 2 with "not both" wording and no network call', async () => {
+  const h = buildHarness();
+  await withEnv({}, async () => {
+    try {
+      await runArgv(['policy', 'tool', 'create-terminal', '--prompt', 'block rm -rf', '--no-ai']);
+      assert.equal(process.exitCode, 2);
+      const errs = h.captured.error.join('\n');
+      assert.ok(errs.includes('not both'), `expected "not both" in error: ${errs}`);
+      assert.equal(h.calls.posts.length, 0, 'no network call should have fired');
+    } finally {
+      h.restoreLog();
+    }
+  });
+});
+// T6 — Layer 2: --no-ai under CLAUDECODE=1 without escape hatch → exit 2.
+test('T6: create-terminal --no-ai under CLAUDECODE=1 exits 2 with human-only wording and no network call', async () => {
+  const h = buildHarness();
+  await withEnv({ CLAUDECODE: '1' }, async () => {
+    try {
+      await runArgv([
+        'policy', 'tool', 'create-terminal',
+        '--no-ai',
+        '--name', 'X',
+        '--command-family', 'git',
+        '--field', 'command=git push*',
+        '--action', 'AUDIT',
+      ]);
+      assert.equal(process.exitCode, 2);
+      const errs = h.captured.error.join('\n');
+      assert.ok(errs.includes('CLAUDECODE=1'), `expected "CLAUDECODE=1" in error: ${errs}`);
+      assert.ok(
+        errs.includes('intended for interactive humans'),
+        `expected "intended for interactive humans" in error: ${errs}`
+      );
+      assert.equal(h.calls.posts.length, 0, 'no network call should have fired');
+    } finally {
+      h.restoreLog();
+    }
+  });
+});
+// T7 — Layer 2 escape hatch: with UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE=1, layer 2 is
+// bypassed and the existing flag-path then errors out on missing --name.
+test('T7: --no-ai under CLAUDECODE=1 + UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE=1 bypasses layer 2 and falls through to flag-path required-field error', async () => {
+  const h = buildHarness();
+  await withEnv({ CLAUDECODE: '1', UNBOUND_ALLOW_NO_AI_UNDER_CLAUDE: '1' }, async () => {
+    try {
+      await runArgv(['policy', 'tool', 'create-terminal', '--no-ai']);
+      assert.equal(process.exitCode, 1, 'flag-path required-field error must use exit 1, not the guard exit 2');
+      const errs = h.captured.error.join('\n');
+      assert.ok(errs.includes('--name is required'), `expected "--name is required" in error: ${errs}`);
+      assert.equal(h.calls.posts.length, 0, 'no network call should have fired');
+    } finally {
+      h.restoreLog();
+    }
+  });
+});
+// T8 — Layer 2 does not interfere with --prompt under CLAUDECODE=1.
+test('T8: create-terminal --prompt under CLAUDECODE=1 routes to the assist endpoint', async () => {
+  const h = buildHarness({
+    assistResponse: {
+      success: true,
+      form_updates: {
+        command_family: 'filesystem',
+        selected_field: 'command',
+        field_value: 'rm -rf*',
+        action: 'AUDIT',
+        name: 'X',
+      },
+      explanation: 'ok',
+    },
+  });
+  await withEnv({ CLAUDECODE: '1' }, async () => {
+    try {
+      await runArgv(['policy', 'tool', 'create-terminal', '--prompt', 'block rm -rf', '--yes']);
+      assert.ok(
+        h.calls.posts.some((c) => c.path === '/api/v1/command-policies/assist/'),
+        `expected assist POST, got: ${JSON.stringify(h.calls.posts.map((c) => c.path))}`
+      );
+    } finally {
+      h.restoreLog();
+    }
+  });
+});
+// T9 — Layer 3: create-terminal --help banner appears before "Usage:".
+test('T9: create-terminal --help renders AI-ASSISTED banner before "Usage:"', async () => {
+  const h = buildHarness();
+  let captured = '';
+  const origWrite = process.stdout.write.bind(process.stdout);
+  process.stdout.write = (s) => { captured += s; return true; };
+  try {
+    await withEnv({}, async () => {
+      try {
+        await runArgv(['policy', 'tool', 'create-terminal', '--help']);
+      } catch (err) {
+        // commander.exitOverride() throws on --help with code 'commander.helpDisplayed'.
+        if (err && err.code && err.code !== 'commander.helpDisplayed') throw err;
+      }
+    });
+  } finally {
+    process.stdout.write = origWrite;
+    h.restoreLog();
+  }
+  const bannerIdx = captured.indexOf('AI-ASSISTED (preferred):');
+  const usageIdx = captured.indexOf('Usage:');
+  assert.notEqual(bannerIdx, -1, `expected "AI-ASSISTED (preferred):" banner in help output: ${captured}`);
+  assert.notEqual(usageIdx, -1, `expected "Usage:" in help output: ${captured}`);
+  assert.ok(bannerIdx < usageIdx, `banner (idx ${bannerIdx}) should precede Usage (idx ${usageIdx})`);
+});
+// T10 — Layer 3: create-mcp --help banner appears before "Usage:".
+test('T10: create-mcp --help renders AI-ASSISTED banner before "Usage:"', async () => {
+  const h = buildHarness();
+  let captured = '';
+  const origWrite = process.stdout.write.bind(process.stdout);
+  process.stdout.write = (s) => { captured += s; return true; };
+  try {
+    await withEnv({}, async () => {
+      try {
+        await runArgv(['policy', 'tool', 'create-mcp', '--help']);
+      } catch (err) {
+        if (err && err.code && err.code !== 'commander.helpDisplayed') throw err;
+      }
+    });
+  } finally {
+    process.stdout.write = origWrite;
+    h.restoreLog();
+  }
+  const bannerIdx = captured.indexOf('AI-ASSISTED (preferred):');
+  const usageIdx = captured.indexOf('Usage:');
+  assert.notEqual(bannerIdx, -1, `expected "AI-ASSISTED (preferred):" banner in help output: ${captured}`);
+  assert.notEqual(usageIdx, -1, `expected "Usage:" in help output: ${captured}`);
+  assert.ok(bannerIdx < usageIdx, `banner (idx ${bannerIdx}) should precede Usage (idx ${usageIdx})`);
+});
+// T11 — Regression smoke: existing --prompt happy path still reaches assist endpoint.
+test('T11: create-terminal --prompt without CLAUDECODE still routes to the assist endpoint (Phase-1 regression)', async () => {
+  const h = buildHarness({
+    assistResponse: {
+      success: true,
+      form_updates: {
+        command_family: 'git',
+        selected_field: 'command',
+        field_value: 'npm install*',
+        action: 'AUDIT',
+        name: 'X',
+      },
+      explanation: 'ok',
+    },
+  });
+  await withEnv({}, async () => {
+    try {
+      await runArgv(['policy', 'tool', 'create-terminal', '--prompt', 'audit npm installs', '--yes']);
+      assert.ok(
+        h.calls.posts.some((c) => c.path === '/api/v1/command-policies/assist/'),
+        `expected assist POST, got: ${JSON.stringify(h.calls.posts.map((c) => c.path))}`
+      );
+    } finally {
+      h.restoreLog();
+    }
+  });
+});