npm - llm-cli-gateway - Versions diffs - 1.15.0 → 1.15.2 - Mend

llm-cli-gateway 1.15.0 → 1.15.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (9) hide show

package/CHANGELOG.md +55 -5
package/README.md +145 -34
package/dist/async-job-manager.js +12 -6
package/dist/executor.js +65 -8
package/dist/index.js +2 -2
package/dist/job-store.js +1 -1
package/dist/request-helpers.js +12 -0
package/package.json +1 -1
package/socket.yml +6 -18

package/CHANGELOG.md CHANGED Viewed

@@ -2,6 +2,56 @@
 All notable changes to the llm-cli-gateway project.
+## Unreleased
+## [1.15.2] - 2026-05-29 — security quality follow-up
+Patch release for GitHub Security & quality follow-up findings and Scorecard
+documentation.
+### Fixed
+- Preserve the leading content when truncating async job stdout/stderr in
+  `llm_job_result`, matching bounded-result consumer expectations instead of
+  returning only the tail.
+- Handle installer gateway log file close errors explicitly so failed flushes
+  from writable stdout/stderr log handles are surfaced to callers.
+### Changed
+- Moved non-canonical root Markdown into `docs/guides/` and `docs/archive/`
+  so the repository root stays focused on public entry points.
+- Renamed async-defer result guidance from the old retrieval field to `collectWith`,
+  avoiding Socket substring false positives in generated package code.
+- Recorded OpenSSF Scorecard `FuzzingID` as a valid roadmap/process item:
+  adding `fast-check` style property tests for parser, argv, and worktree
+  surfaces would improve the Scorecard signal, but the absence of fuzzing does
+  not block this patch release.
+## [1.15.1] - 2026-05-29 — quality badges + Sigstore release signing
+Release-infrastructure follow-up to v1.15.0.
+### Added
+- README quality badges for CI, security, OpenSSF Scorecard, npm, license, and
+  Sigstore-signed release artifacts.
+- Sigstore keyless signing for GitHub release installer artifacts, including
+  `.sigstore.json` bundles and pre-upload verification in the release workflow.
+- End-user verification guidance for `SHA256SUMS.sigstore.json` before trusting
+  release checksums.
+- Sanitized Windows Claude Desktop MCP config example using 1Password
+  environment injection placeholders.
+- Security workflow attribution guard that rejects new Claude/Anthropic
+  author/co-author metadata in future commits.
+### Changed
+- Manual release-installer rebuilds now fail fast unless launched from the
+  matching release tag ref, keeping Sigstore certificate identities stable.
+- Windows installer snippets and generated release manifest commands now verify
+  the Sigstore checksum bundle before executing the downloaded bootstrapper.
 ## [1.15.0] - 2026-05-28 — Phase 4 slice λ (gateway-owned worktree lifecycle)
 Ships the tenth Phase 4 slice: a new top-level `worktree` field on every
@@ -1097,11 +1147,11 @@ Technical corrections from the multi-LLM voice + technical review:
 ### Fixed — `socket.yml` networkAccess false-positive documentation
-- Documented that the `globalThis["fetch"]` flag on `dist/index.js` /
-  `dist/job-store.js` is a substring-match false positive. Neither file
-  contains any actual fetch call; the matches are English-prose
-  occurrences in an error message, the `fetchWith` JSON field name, and
-  a code comment. Verified by sub-agent investigation, no code change
+- Documented that Socket's network-access flag on `dist/index.js` /
+  `dist/job-store.js` was a substring-match false positive. Neither file
+  contained a production network call; the matches were English-prose
+  retrieval wording in an error message, a structured result-tool field name,
+  and a code comment. Verified by sub-agent investigation, no code change
   required, no attack-surface delta vs 1.5.35.
 ### Fixed — `lychee.toml` exclusions

package/README.md CHANGED Viewed

@@ -1,25 +1,44 @@
 # llm-cli-gateway
-> *"Without consultation, plans are frustrated, but with many counselors they succeed."*
+[![CI](https://github.com/verivus-oss/llm-cli-gateway/actions/workflows/ci.yml/badge.svg?branch=main)](https://github.com/verivus-oss/llm-cli-gateway/actions/workflows/ci.yml)
+[![Security](https://github.com/verivus-oss/llm-cli-gateway/actions/workflows/security.yml/badge.svg?branch=main)](https://github.com/verivus-oss/llm-cli-gateway/actions/workflows/security.yml)
+[![OpenSSF Scorecard](https://api.scorecard.dev/projects/github.com/verivus-oss/llm-cli-gateway/badge)](https://scorecard.dev/viewer/?uri=github.com/verivus-oss/llm-cli-gateway)
+[![OpenSSF Best Practices](https://www.bestpractices.dev/projects/13025/badge)](https://www.bestpractices.dev/projects/13025)
+[![npm](https://img.shields.io/npm/v/llm-cli-gateway.svg)](https://www.npmjs.com/package/llm-cli-gateway)
+[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](LICENSE)
+[![Releases: Sigstore signed](https://img.shields.io/badge/releases-Sigstore%20signed-2e7d32.svg)](SECURITY.md#release-signing)
+> _"Without consultation, plans are frustrated, but with many counselors they succeed."_
 > — Proverbs 15:22 (LSB)
-A Model Context Protocol (MCP) server providing unified access to Claude Code, Codex, Gemini, Grok, and Mistral (Vibe) CLIs with session management, retry logic, and async job orchestration.
+A Model Context Protocol (MCP) gateway for running Claude Code, Codex, Gemini, Grok, and Mistral (Vibe) CLIs from one MCP endpoint, with durable async jobs, session continuity, cache-aware prompting, observability, and personal-appliance setup tooling.
-## Personal MCP Appliance MVP
+## What It Provides Today
-`llm-cli-gateway` is being packaged as a single-user personal MCP appliance for cross-LLM validation. The intended workflow is: connect one MCP endpoint, ask any client for cross-LLM validation.
+`llm-cli-gateway` is a single-user MCP gateway for cross-LLM validation and multi-agent coding workflows. It is more than a thin CLI wrapper:
+- Runs five provider CLIs through consistent sync and async MCP tools.
+- Persists long-running jobs, supports restart-safe result collection, deduplication, cancellation, and sync-to-async deferral.
+- Tracks sessions, real CLI resume paths, structured response metadata, and cache telemetry.
+- Supports cache-aware `promptParts`, including explicit Claude `cache_control` when opted in.
+- Can run requests inside gateway-managed git worktrees for isolated multi-agent review and implementation loops.
+- Ships personal-appliance setup surfaces: HTTP transport with bearer-token auth, `doctor --json`, setup UI artifacts, provider setup snippets, Docker fallback, and checked release bundles.
+## Personal MCP Appliance
+The personal-appliance contract keeps that surface intentionally narrow: one trusted user runs the gateway on a machine or volume they own, connects one MCP endpoint, and asks any connected client for cross-LLM validation.
 The product contract is documented in [docs/personal-mcp/PRODUCT_CONTRACT.md](docs/personal-mcp/PRODUCT_CONTRACT.md). It defines the single-user scope, security posture, target support matrix, and provider-support verification gates. Public setup guides must not claim ChatGPT, Claude web, Claude Desktop, Codex, Gemini CLI, Gemini web, or Grok inbound support until the corresponding provider/client path has been verified.
 This project does not provide hosted multi-tenant credential custody. Provider credentials stay on the user's machine or user-owned deployment volume.
-MVP release readiness is tracked in [docs/personal-mcp/RELEASE_READINESS.md](docs/personal-mcp/RELEASE_READINESS.md). Dogfooding evidence (which target LLMs guided setup, what unsafe suggestions were captured, which findings are deferred to post-MVP work) is in [docs/personal-mcp/DOGFOODING_RESULTS.md](docs/personal-mcp/DOGFOODING_RESULTS.md).
+Release-readiness history is tracked in [docs/personal-mcp/RELEASE_READINESS.md](docs/personal-mcp/RELEASE_READINESS.md). Dogfooding evidence (which target LLMs guided setup, what unsafe suggestions were captured, and which findings were deferred from the initial personal-appliance rollout) is in [docs/personal-mcp/DOGFOODING_RESULTS.md](docs/personal-mcp/DOGFOODING_RESULTS.md).
 Current personal-appliance artifacts include:
 - Streamable HTTP startup: `LLM_GATEWAY_AUTH_TOKEN=<token> npm run start:http`
 - Machine-readable diagnostics: `npm run doctor`
-- Go bootstrapper scaffold: `installer/` with `setup`, `doctor --json`, `start`, `stop`, `status`, `repair`, `upgrade`, `uninstall`, `print-client-config`, and verified bundle download commands.
+- Go bootstrapper: `installer/` with `setup`, `doctor --json`, `start`, `stop`, `status`, `repair`, `upgrade`, `uninstall`, `print-client-config`, and verified bundle download commands.
 - Release packaging: the release workflow builds Linux binaries on the local self-hosted runner, builds Windows/macOS binaries on GitHub-hosted runners, then publishes checksummed platform bundles with the gateway, production dependencies, and a managed Node runtime; see [installer/packaging/README.md](installer/packaging/README.md).
 - Docker Compose fallback: [docker-compose.personal.yml](docker-compose.personal.yml) + [Dockerfile.personal](Dockerfile.personal) for users who already manage containers.
 - Local setup UI artifact: [setup/ui/index.html](setup/ui/index.html)
@@ -34,11 +53,25 @@ Windows PowerShell:
 $Version = '<version>'
 $Base = "https://github.com/verivus-oss/llm-cli-gateway/releases/download/v$Version"
 $InstallDir = Join-Path (Join-Path $env:LOCALAPPDATA 'Programs') 'llm-cli-gateway'
+$ExeName = "llm-cli-gateway-$Version-windows-amd64.exe"
+$BundleName = "llm-cli-gateway-bundle-$Version-windows-amd64.tar.gz"
 $Exe = Join-Path $InstallDir 'llm-cli-gateway.exe'
+$Checksums = Join-Path $InstallDir 'SHA256SUMS'
+$ChecksumBundle = Join-Path $InstallDir 'SHA256SUMS.sigstore.json'
 New-Item -ItemType Directory -Force $InstallDir | Out-Null
-Invoke-WebRequest -UseBasicParsing "$Base/llm-cli-gateway-$Version-windows-amd64.exe" -OutFile $Exe
-$env:RVWR_GATEWAY_BUNDLE_URL = "$Base/llm-cli-gateway-bundle-$Version-windows-amd64.tar.gz"
-$env:RVWR_GATEWAY_BUNDLE_SHA256 = '<bundle-sha256-from-SHA256SUMS>'
+Invoke-WebRequest -UseBasicParsing "$Base/$ExeName" -OutFile $Exe
+Invoke-WebRequest -UseBasicParsing "$Base/SHA256SUMS" -OutFile $Checksums
+Invoke-WebRequest -UseBasicParsing "$Base/SHA256SUMS.sigstore.json" -OutFile $ChecksumBundle
+cosign verify-blob $Checksums --bundle $ChecksumBundle --certificate-identity "https://github.com/verivus-oss/llm-cli-gateway/.github/workflows/release-installer.yml@refs/tags/v$Version" --certificate-oidc-issuer "https://token.actions.githubusercontent.com"
+if ($LASTEXITCODE -ne 0) { throw "Sigstore verification failed for SHA256SUMS" }
+function Get-ReleaseSha256($Name) {
+  $line = Select-String -Path $Checksums -Pattern "^[a-fA-F0-9]{64}\s+$([regex]::Escape($Name))$" | Select-Object -First 1
+  if (-not $line) { throw "No SHA256SUMS entry found for $Name" }
+  return (($line.Line -split "\s+")[0]).ToLowerInvariant()
+}
+if ((Get-FileHash $Exe -Algorithm SHA256).Hash.ToLowerInvariant() -ne (Get-ReleaseSha256 $ExeName)) { throw "Checksum mismatch for $ExeName" }
+$env:RVWR_GATEWAY_BUNDLE_URL = "$Base/$BundleName"
+$env:RVWR_GATEWAY_BUNDLE_SHA256 = Get-ReleaseSha256 $BundleName
 & $Exe setup
 & $Exe stop
 & $Exe install-bundle
@@ -53,6 +86,9 @@ PATH. Do not script against release-versioned exe names after install.
 ```bash
 # After downloading the binary that matches your OS/arch from a release:
+cosign verify-blob SHA256SUMS --bundle SHA256SUMS.sigstore.json \
+  --certificate-identity "https://github.com/verivus-oss/llm-cli-gateway/.github/workflows/release-installer.yml@refs/tags/v<version>" \
+  --certificate-oidc-issuer "https://token.actions.githubusercontent.com"
 sha256sum --check SHA256SUMS            # verify before run (or `shasum -a 256 --check` on macOS)
 chmod +x llm-cli-gateway-<ver>-<os>-<arch>
 ./llm-cli-gateway-<ver>-<os>-<arch> setup
@@ -79,13 +115,16 @@ docker compose -f docker-compose.personal.yml run --rm doctor
 ## Features
 ### Core Capabilities
 - **Multi-LLM Orchestration**: Unified interface for Claude Code, Codex, Gemini, Grok, and Mistral (Vibe) CLIs
 - **Session Management**: Track and resume conversations across all CLIs with persistent storage
+- **Gateway-owned worktrees**: Run any sync or async provider request inside a managed git worktree, with per-session reuse and cleanup
 - **Token Optimization**: Automatic 44% reduction on prompts, 37% on responses (opt-in)
 - **Correlation ID Tracking**: Full request tracing across all LLM interactions
 - **Cross-Tool Collaboration**: LLMs can use each other via MCP (validated through dogfooding)
 ### Observability
 - **SQLite Flight Recorder**: Every request/response logged to `~/.llm-cli-gateway/logs.db` with correlation IDs, token usage, duration, retry counts, and circuit breaker state. Browse with [Datasette](https://datasette.io/): `datasette ~/.llm-cli-gateway/logs.db`
 - **Structured Metadata**: Tool responses include machine-readable `structuredContent` (model, cli, correlationId, sessionId, durationMs, token counts)
 - **Cache observability resources**: `cache_state://global`, `cache_state://session/{id}`, and `cache_state://prefix/{hash}` MCP resources return aggregate cache hit/miss/savings — tokens and hashes only, no prompt text. `session_get` includes a `cacheState` block when the session has prior requests.
@@ -109,17 +148,18 @@ Every `*_request` and `*_request_async` tool accepts an optional `promptParts` f
 Per-CLI capability matrix:
-| CLI     | Prefix discipline (auto via `promptParts`) | Explicit `cache_control` emission |
-|---------|--------------------------------------------|------------------------------------|
-| claude  | yes                                        | not yet (Branch B; gated on `[cache_awareness].emit_anthropic_cache_control`) |
-| codex   | yes                                        | n/a (OpenAI implicit cache, no CLI lever) |
-| gemini  | yes                                        | n/a (implicit prefix cache server-side)  |
-| grok    | yes                                        | n/a (no surfaced cache lever)            |
-| mistral | yes                                        | n/a (no surfaced cache lever)            |
+| CLI     | Prefix discipline (auto via `promptParts`) | Explicit `cache_control` emission                                            |
+| ------- | ------------------------------------------ | ---------------------------------------------------------------------------- |
+| claude  | yes                                        | yes, opt-in via `promptParts.cacheControl` and `outputFormat: "stream-json"` |
+| codex   | yes                                        | n/a (OpenAI implicit cache, no CLI lever)                                    |
+| gemini  | yes                                        | n/a (implicit prefix cache server-side)                                      |
+| grok    | yes                                        | n/a (no surfaced cache lever)                                                |
+| mistral | yes                                        | n/a (no surfaced cache lever)                                                |
 Opt-in flags (all default off) live under `[cache_awareness]` in `~/.llm-cli-gateway/config.toml`. See `docs/personal-mcp/PROVIDER_CACHE_SURFACES.md` for the per-model minimum cacheable token thresholds and field-name divergences.
 ### Reliability & Performance
 - **Retry Logic**: Exponential backoff with circuit breaker for transient failures
 - **Atomic File Writes**: Process-specific temp files with fsync for data integrity
 - **Memory Limits**: 50MB cap on CLI output prevents DoS attacks
@@ -127,7 +167,8 @@ Opt-in flags (all default off) live under `[cache_awareness]` in `~/.llm-cli-gat
 - **Long-Running Jobs**: Non-time-bound async execution via `*_request_async` + polling tools
 ### Security & Quality
-- **Comprehensive Testing**: 681 tests covering unit, integration, and regression scenarios with real CLI execution
+- **Comprehensive Testing**: 900+ tests covering unit, integration, and regression scenarios with real CLI execution
 - **Input Validation**: Zod schemas prevent injection attacks
 - **No Secret Leakage**: Generic session descriptions only (file permissions 0o600)
 - **No ReDoS**: Bounded regex patterns prevent catastrophic backtracking
@@ -139,6 +180,7 @@ Opt-in flags (all default off) live under `[cache_awareness]` in `~/.llm-cli-gat
 Before using this gateway, you need to install the CLI tools you want to use:
 ### Claude Code CLI
 ```bash
 # Installation instructions for Claude Code
 # Visit: https://docs.anthropic.com/claude-code
@@ -146,18 +188,21 @@ npm install -g @anthropic-ai/claude-code
 ```
 ### Codex CLI
 ```bash
 npm install -g @openai/codex
 codex login
 ```
 ### Gemini CLI
 ```bash
 npm install -g @google/gemini-cli
 # Or: https://github.com/google-gemini/gemini-cli
 ```
 ### Grok CLI (xAI)
 ```bash
 npm install -g grok-build
 grok login   # OAuth flow, or set GROK_CODE_XAI_API_KEY
@@ -165,6 +210,7 @@ grok login   # OAuth flow, or set GROK_CODE_XAI_API_KEY
 ```
 ### Mistral Vibe CLI
 ```bash
 # Pick one — the gateway's cli_upgrade auto-detects which one you used.
 pip install vibe-cli
@@ -184,7 +230,7 @@ Vibe-specific notes:
   requested or Vibe config needs recovery, and retries once after a
   model-not-found failure with refreshed discovery.
 - **`permissionMode` accepts** `default | plan | accept-edits | auto-approve |
-  chat | explore | lean` and emits `--agent <mode>`. The gateway's
+chat | explore | lean` and emits `--agent <mode>`. The gateway's
   programmatic-mode default is `auto-approve`; pick a stricter mode
   explicitly if you need approval gates.
 - **`allowedTools` is allow-list only** — the gateway emits one
@@ -198,11 +244,13 @@ Vibe-specific notes:
 ## Installation
 ### As an MCP server (npm)
 ```bash
 npm install -g llm-cli-gateway
 ```
 Or use directly with `npx`:
 ```json
 {
   "mcpServers": {
@@ -215,6 +263,7 @@ Or use directly with `npx`:
 ```
 ### From source
 ```bash
 git clone https://github.com/verivus-oss/llm-cli-gateway.git
 cd llm-cli-gateway
@@ -239,7 +288,7 @@ For clients that already support local stdio MCP servers, add a configuration li
 }
 ```
-This generic stdio example is not provider-support verification for the Personal MCP Appliance MVP. Client-specific setup guides for ChatGPT, Claude web, Claude Desktop, Codex, Gemini CLI, Gemini web, and Grok remain gated by the provider-support matrix in [docs/personal-mcp/PRODUCT_CONTRACT.md](docs/personal-mcp/PRODUCT_CONTRACT.md).
+This generic stdio example is not provider-support verification for the Personal MCP Appliance. Client-specific setup guides for ChatGPT, Claude web, Claude Desktop, Codex, Gemini CLI, Gemini web, and Grok remain gated by the provider-support matrix in [docs/personal-mcp/PRODUCT_CONTRACT.md](docs/personal-mcp/PRODUCT_CONTRACT.md).
 ### Available Tools
@@ -260,9 +309,11 @@ The validation report preserves per-provider disagreement. Optional judge synthe
 #### LLM Request Tools
 ##### `claude_request`
 Execute a Claude Code request with optional session management.
 **Parameters:**
 - `prompt` (string, required): The prompt to send (1-100,000 chars)
 - `model` (string, optional): Model name or alias (use `list_models` for available values; supports `latest`)
 - `outputFormat` (string, optional): Output format ("text" or "json"), default: "text"
@@ -281,10 +332,12 @@ Execute a Claude Code request with optional session management.
 - `correlationId` (string, optional): Request trace ID (auto-generated if omitted)
 **Response extras:**
 - `approval`: Approval decision record when `approvalStrategy="mcp_managed"`
 - `mcpServers`: Requested/enabled/missing MCP servers for this call
 **Example:**
 ```json
 {
   "prompt": "Write a Python function to calculate fibonacci numbers",
@@ -296,9 +349,11 @@ Execute a Claude Code request with optional session management.
 ```
 ##### `codex_request`
 Execute a Codex request with optional session tracking.
 **Parameters:**
 - `prompt` (string, required): The prompt to send (1-100,000 chars)
 - `model` (string, optional): Model name or alias (use `list_models` for available values; supports `latest`, recommended: `gpt-5.4`)
 - `fullAuto` (boolean, optional): Enable full-auto mode, default: false
@@ -314,10 +369,12 @@ Execute a Codex request with optional session tracking.
 - `idleTimeoutMs` (number, optional): Kill a stuck Codex process after output inactivity; 30,000 to 3,600,000 ms
 **Response extras:**
 - `approval`: Approval decision record when `approvalStrategy="mcp_managed"`
 - `mcpServers`: Requested MCP servers for this call
 **Example:**
 ```json
 {
   "prompt": "Create a REST API endpoint",
@@ -328,9 +385,11 @@ Execute a Codex request with optional session tracking.
 ```
 ##### `gemini_request`
 Execute a Gemini CLI request with session support.
 **Parameters:**
 - `prompt` (string, required): The prompt to send (1-100,000 chars)
 - `model` (string, optional): Model name or alias (use `list_models` for available values; supports `latest`, `pro`, `flash`)
 - `sessionId` (string, optional): Session ID to resume
@@ -347,10 +406,12 @@ Execute a Gemini CLI request with session support.
 - `correlationId` (string, optional): Request trace ID (auto-generated if omitted)
 **Response extras:**
 - `approval`: Approval decision record when `approvalStrategy="mcp_managed"`
 - `mcpServers`: Requested MCP servers for this call
 **Example:**
 ```json
 {
   "prompt": "Explain quantum computing",
@@ -361,9 +422,11 @@ Execute a Gemini CLI request with session support.
 ```
 ##### `grok_request`
 Execute a Grok CLI (xAI) request with session support.
 **Parameters:**
 - `prompt` (string, required): The prompt to send (1-100,000 chars)
 - `model` (string, optional): Model name or alias (e.g. `grok-build`, `latest`)
 - `outputFormat` (string, optional): `"plain"` (default), `"json"`, or `"streaming-json"`
@@ -384,6 +447,7 @@ Execute a Grok CLI (xAI) request with session support.
 - `correlationId` (string, optional): Request trace ID (auto-generated if omitted)
 **Example:**
 ```json
 {
   "prompt": "Summarize the latest commit message in 1 sentence",
@@ -397,7 +461,7 @@ Execute a Grok CLI (xAI) request with session support.
 Every async job is persisted to a job store as it transitions through running → completed/failed/canceled. This makes the gateway a durable collection layer:
 - **Re-issuing a request is safe.** Identical `*_request` / `*_request_async` calls within the dedup window (default 1 hour) short-circuit onto the existing running or completed job — the caller gets back the same job ID instead of starting a duplicate run. This directly fixes the "agent times out polling, re-issues, and the whole job starts over" failure mode.
-- **`llm_job_status` and `llm_job_result` work across gateway restarts.** Job rows live for 30 days by default; callers can fetch results long after the in-memory cache has evicted them.
+- **`llm_job_status` and `llm_job_result` work across gateway restarts.** Job rows live for 30 days by default; callers can collect results long after the in-memory cache has evicted them.
 - **Jobs running at shutdown are marked `orphaned`** on the next gateway boot (the detached child can't be reattached to). Their captured partial output remains readable.
 - **Pass `forceRefresh: true`** on any request tool to bypass dedup and force a fresh CLI run.
@@ -416,12 +480,14 @@ acknowledgeEphemeral = false                # required to enable async tools wit
 ```
 Backends:
 - **`sqlite`** (default) — durable, file-backed. Safe for single-instance deployments.
 - **`memory`** — in-process Map. Lost on gateway exit. Requires `acknowledgeEphemeral = true` to be loaded. Suitable for tests and ephemeral CI gateways.
 - **`postgres`** — interface only, implementation not yet shipped. Selecting this backend throws at startup.
 - **`none`** — no store. **`*_request_async`, `llm_job_status`, `llm_job_result`, and `llm_job_cancel` are NOT registered on the gateway.** This is a structural invariant: agents that try to call async tools against a gateway with `backend = "none"` get a clean "tool not found" at connect time instead of silent in-memory loss after the 1-hour TTL. Use `llm_process_health` to inspect the resolved persistence state programmatically.
 Legacy environment variables (deprecated; emit a warning at startup):
 - `LLM_GATEWAY_LOGS_DB` / `LLM_GATEWAY_JOBS_DB` — `none` selects `backend = "none"`; any other value selects `backend = "sqlite"` with that path.
 - `LLM_GATEWAY_JOB_RETENTION_DAYS` — overrides `retentionDays`.
 - `LLM_GATEWAY_DEDUP_WINDOW_MS` — overrides `dedupWindowMs`.
@@ -459,7 +525,7 @@ backend = "sqlite"
 path = "/srv/repos/.../my-repo/.gateway/logs.db"
 ```
-Now every gateway subprocess spawned for *this* repo's Claude Code window reads its own config and writes to its own SQLite file; sessions, jobs, and dedup state are scoped to the repo. Other repos keep using the global default. `llm_process_health.persistence.sources.configFile` lets an agent confirm which config it's actually running under.
+Now every gateway subprocess spawned for _this_ repo's Claude Code window reads its own config and writes to its own SQLite file; sessions, jobs, and dedup state are scoped to the repo. Other repos keep using the global default. `llm_process_health.persistence.sources.configFile` lets an agent confirm which config it's actually running under.
 ###### Agent-executable spec (DAG-TOML)
@@ -472,7 +538,7 @@ template_kind     = "implementation-dag"
 docs              = "https://github.com/verivus-oss/agent-assurance/blob/main/SPEC.md"
 confidentiality   = "public"
 title             = "Per-project llm-cli-gateway persistence isolation"
-spec              = "https://github.com/verivusai-labs/llm-cli-gateway#per-project-isolation"
+spec              = "https://github.com/verivus-oss/llm-cli-gateway#per-project-isolation"
 created           = "YYYY-MM-DD"
 total_units       = 5
 tier1_units       = ["U01","U02","U03","U04","U05"]
@@ -623,6 +689,7 @@ consumes       = ["OUT:mcp-reconnected"]
 **Why this matters for agents:** the gateway has multiple configuration surfaces (TOML file, env-var overrides, two different MCP settings files) and one easy mistake — editing the committed `.mcp.json` instead of the local-only `.claude/settings.local.json` — will silently break the per-project scope for every other developer on the repo. The DAG above encodes the correct sequence, the verification gate, and the failure modes explicitly so an agent can execute it without inference.
 ##### `mistral_request`
 Run a Mistral Vibe agentic coding request. Like `grok_request` in shape, but with Vibe's specific surface:
 - `model` (string, optional): Vibe model alias (for example `mistral-medium-3.5` or `latest`). The resolved value is injected via the `VIBE_ACTIVE_MODEL` environment variable; omit it to let the gateway discover Vibe config and avoid stale hardcoded defaults.
@@ -632,33 +699,41 @@ Run a Mistral Vibe agentic coding request. Like `grok_request` in shape, but wit
 - `sessionId` / `resumeLatest` / `createNewSession`: standard session controls. Continuity requires `[session_logging] enabled = true` in `~/.vibe/config.toml` — `doctor --json` surfaces an actionable next-action when the toggle is missing.
 ##### `claude_request_async` / `codex_request_async` / `gemini_request_async` / `grok_request_async` / `mistral_request_async`
 Start a long-running Claude, Codex, Gemini, Grok, or Mistral request without waiting for completion in the same MCP call.
 Use this flow when analysis/runtime can exceed client tool-call limits:
 1. Start job with `*_request_async`
 2. Poll with `llm_job_status`
 3. Fetch output with `llm_job_result`
 4. Optionally stop with `llm_job_cancel`
 Async request tools accept the same approval strategy fields as their sync variants:
 - `approvalStrategy`: `"legacy"` (default) or `"mcp_managed"`
 - `approvalPolicy`: `"strict"|"balanced"|"permissive"` override
 - `mcpServers`: Requested MCP servers (`sqry`, `exa`, `ref_tools`, `trstr`)
 - `claude_request_async` also supports `strictMcpConfig` and fails fast when requested servers are unavailable
 ##### `llm_job_status`
 Return lifecycle status (`running`, `completed`, `failed`, `canceled`) and metadata for an async job.
 ##### `llm_job_result`
 Return captured stdout/stderr for an async job (with configurable max chars per stream).
 ##### `llm_job_cancel`
 Cancel a running async job.
 ##### `approval_list`
 List recent MCP-managed approval decisions recorded by the gateway.
 **Parameters:**
 - `limit` (number, optional): Max records (1-500), default: 50
 - `cli` (string, optional): Filter by `"claude"`, `"codex"`, or `"gemini"`
@@ -667,14 +742,17 @@ Approval records are persisted to `~/.llm-cli-gateway/approvals.jsonl`.
 #### Session Management Tools
 ##### `session_create`
 Create a new session for a specific CLI.
 **Parameters:**
 - `cli` (string, required): CLI to create session for ("claude", "codex", "gemini", "grok", "mistral")
 - `description` (string, optional): Description for the session
 - `setAsActive` (boolean, optional): Set as active session, default: true
 **Example:**
 ```json
 {
   "cli": "claude",
@@ -684,50 +762,64 @@ Create a new session for a specific CLI.
 ```
 ##### `session_list`
 List all sessions, optionally filtered by CLI.
 **Parameters:**
 - `cli` (string, optional): Filter by CLI ("claude", "codex", "gemini", "grok", "mistral")
 **Response includes:**
 - Total session count
 - Session details (ID, CLI, description, timestamps, active status)
 - Active session IDs for each CLI
 ##### `session_set_active`
 Set the active session for a specific CLI.
 **Parameters:**
 - `cli` (string, required): CLI to set active session for
 - `sessionId` (string, required): Session ID to activate (or null to clear)
 ##### `session_get`
 Retrieve details for a specific session.
 **Parameters:**
 - `sessionId` (string, required): Session ID to retrieve
 ##### `session_delete`
 Delete a specific session.
 **Parameters:**
 - `sessionId` (string, required): Session ID to delete
 ##### `session_clear_all`
 Clear all sessions, optionally for a specific CLI.
 **Parameters:**
 - `cli` (string, optional): Clear sessions for specific CLI only
 #### Utility Tools
 ##### `list_models`
 List available models for each CLI.
 **Parameters:**
 - `cli` (string, optional): Specific CLI to list models for ("claude", "codex", "gemini", "grok", "mistral")
 **Response includes:**
 - Model names and descriptions
 - Best use cases for each model
 - CLI-specific information
@@ -764,21 +856,26 @@ LLM_GATEWAY_DISABLE_MODEL_DISCOVERY=1
 ```
 ##### `cli_versions`
 Report installed CLI versions.
 **Parameters:**
 - `cli` (string, optional): Specific CLI to inspect ("claude", "codex", "gemini", "grok", "mistral")
 ##### `cli_upgrade`
 Plan or run an upgrade for one CLI.
 **Parameters:**
 - `cli` (string, required): CLI to upgrade ("claude", "codex", "gemini", "grok", "mistral")
 - `target` (string, optional): Package tag/version/target, default: `latest`
 - `dryRun` (boolean, optional): Return the upgrade plan without running it, default: `true`
 - `timeoutMs` (number, optional): Upgrade timeout when `dryRun=false`
 **Upgrade strategies:**
 - Claude latest: `claude update`
 - Claude explicit target: `claude install <target>`
 - Codex latest: `codex update`
@@ -786,6 +883,7 @@ Plan or run an upgrade for one CLI.
 - Gemini: `npm install -g @google/gemini-cli@<target>`
 **Example dry run:**
 ```json
 {
   "cli": "gemini",
@@ -810,7 +908,7 @@ Plan or run an upgrade for one CLI.
 await callTool("session_create", {
   cli: "claude",
   description: "Debugging session",
-  setAsActive: true
+  setAsActive: true,
 });
 // 2. Make requests (automatically uses active session)
@@ -822,7 +920,7 @@ await callTool("claude_request", {
 // 3. Continue the conversation
 await callTool("claude_request", {
   prompt: "Can you explain that fix in more detail?",
-  continueSession: true
+  continueSession: true,
 });
 // 4. List all sessions
@@ -831,12 +929,12 @@ await callTool("session_list", { cli: "claude" });
 // 5. Switch to a different session
 await callTool("session_set_active", {
   cli: "claude",
-  sessionId: "some-other-session-id"
+  sessionId: "some-other-session-id",
 });
 // 6. Delete when done
 await callTool("session_delete", {
-  sessionId: "session-id-to-delete"
+  sessionId: "session-id-to-delete",
 });
 ```
@@ -864,6 +962,7 @@ await callTool("session_delete", {
 ### CLI-Specific Settings
 Each CLI can be configured through its own configuration files:
 - Claude Code: `~/.claude/config.json`
 - Codex: `~/.codex/config.toml`
 - Gemini: `~/.gemini/config.json`
@@ -939,6 +1038,7 @@ npm start
 The gateway provides detailed error messages for common issues:
 ### CLI Not Found
 ```
 Error executing claude CLI:
 spawn claude ENOENT
@@ -947,12 +1047,14 @@ The 'claude' command was not found. Please ensure claude CLI is installed and in
 ```
 ### External Timeout / Legacy Timeout Option
 ```
 Error executing codex CLI: Command timed out
 Process timed out after 120000ms
 ```
 ### Invalid Parameters
 ```
 Prompt cannot be empty
 Prompt too long (max 100k chars)
@@ -970,6 +1072,7 @@ Logs are written to stderr (stdout is reserved for MCP protocol):
 ```
 Enable debug logging:
 ```bash
 DEBUG=1 node dist/index.js
 ```
@@ -979,6 +1082,7 @@ DEBUG=1 node dist/index.js
 ### CLIs Not Found
 Make sure the CLIs are installed and in your PATH:
 ```bash
 which claude
 which codex
@@ -986,6 +1090,7 @@ which gemini
 ```
 The gateway extends PATH to include common locations:
 - `~/.local/bin`
 - `/usr/local/bin`
 - `/usr/bin`
@@ -994,6 +1099,7 @@ The gateway extends PATH to include common locations:
 ### Permission Errors
 If you encounter permission errors, ensure the CLI tools have proper permissions:
 ```bash
 chmod +x $(which claude)
 chmod +x $(which codex)
@@ -1005,16 +1111,19 @@ chmod +x $(which gemini)
 Sessions are stored in `~/.llm-cli-gateway/sessions.json`. If you encounter issues:
 1. Check file permissions:
 ```bash
 ls -la ~/.llm-cli-gateway/
 ```
 2. Reset sessions:
 ```bash
 rm ~/.llm-cli-gateway/sessions.json
 ```
 3. Or manually edit the session file:
 ```bash
 cat ~/.llm-cli-gateway/sessions.json
 ```
@@ -1038,19 +1147,20 @@ The gateway supports concurrent requests across different CLIs. Each request spa
 - **No Eval**: No dynamic code evaluation in our source (see "Socket alerts" below for the transitive `ajv` codegen case)
 - **Sandboxing**: Consider running in containers for production use
 - **Provenance**: Releases are published with [npm provenance](https://docs.npmjs.com/generating-provenance-statements) via OIDC trusted publishing from GitHub Actions
+- **Release signing**: GitHub release installer artifacts are signed with Sigstore keyless signing; verify `SHA256SUMS.sigstore.json` before trusting the checksum file
 ### Socket alerts — context for reviewers
 If you're vetting `llm-cli-gateway` through [Socket](https://socket.dev/npm/package/llm-cli-gateway) or a similar supply-chain scanner, you'll see three behavioural alerts and some dependency-ownership alerts. They are accurate descriptions of what the package does and what it depends on; we've left them visible (not silenced in `socket.yml`) so you don't have to take our word for it. Here's the context for each:
-| Alert | Where | Why it's bounded |
-|---|---|---|
-| **Network access** | `src/http-transport.ts` opens an HTTP MCP transport when started via `npm run start:http`. `src/endpoint-exposure.ts` issues a HEAD probe to verify configured public/tunnel URLs. | The transport binds to `127.0.0.1` by default and requires `LLM_GATEWAY_AUTH_TOKEN` to be set. The default stdio MCP entry point (`npm start`) opens no sockets. |
-| **Shell access** | `src/executor.ts` uses `child_process.spawn(cmd, args, …)` to invoke the underlying LLM CLIs. | `spawn` is called with an argument array and **never** `shell: true`, so there is no shell interpolation path for caller input. The command name is restricted to an allow-list of known CLI binaries (`claude`, `codex`, `gemini`, `grok`, `vibe`). |
-| **Uses eval** | None in our source. Transitive: `@modelcontextprotocol/sdk` → `ajv@8` uses `new Function(...)` in `ajv/dist/compile/index.js` to compile JSON Schema validators. | This is ajv's standard codegen path. Only known schemas (defined in our source and the MCP SDK) flow into it; no caller-supplied data ever reaches the compiled function body. |
-| **better-sqlite3 PRAGMA helper** | Transitive: `better-sqlite3/lib/methods/pragma.js` interpolates its caller-provided `source` into a `PRAGMA ${source}` statement. | We do not call `db.pragma()` from production source. Internal SQLite setup uses fixed literal `db.exec("PRAGMA ...")` statements, and `npm run security:audit` fails the release if production code reintroduces `.pragma()` calls. |
-| **ioredis obfuscated code** | Optional peer/dev dependency: `ioredis@5.10.1` may be flagged at `built/constants/TLSProfiles.js` for base64-looking strings. | Reviewed as a false positive. The file is a Redis Cloud TLS CA certificate bundle in PEM format, which is base64 by design. It contains no decoder loop, dynamic evaluation, network call, or hidden execution path. The same file is byte-for-byte identical in `ioredis@5.9.2`; our default production install does not install `ioredis`, and our code does not pass ioredis TLS profile options. |
-| **Dependency ownership** | A handful of small transitive packages (e.g. `bindings` via `better-sqlite3`, `media-typer` via `@modelcontextprotocol/sdk`) trip Socket's "unstable ownership" or "obfuscated code" heuristics. | These are pinned, well-known micro-deps in the Node ecosystem with no known issues. We pin direct override versions of `content-type` and `type-is` in `package.json#overrides`. Our previous direct dependency on `toml@3.0.0` (also single-maintainer, last released 2020) was replaced with the actively-maintained `smol-toml` to reduce inherited risk. |
+| Alert                            | Where                                                                                                                                                                                            | Why it's bounded                                                                                                                                                                                                                                                                                                                                                                                     |
+| -------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| **Network access**               | `src/http-transport.ts` opens an HTTP MCP transport when started via `npm run start:http`. `src/endpoint-exposure.ts` issues a HEAD probe to verify configured public/tunnel URLs.               | The transport binds to `127.0.0.1` by default and requires `LLM_GATEWAY_AUTH_TOKEN` to be set. The default stdio MCP entry point (`npm start`) opens no sockets.                                                                                                                                                                                                                                     |
+| **Shell access**                 | `src/executor.ts` uses `child_process.spawn(cmd, args, …)` to invoke the underlying LLM CLIs.                                                                                                    | `spawn` is called with an argument array and **never** `shell: true`, so there is no shell interpolation path for caller input. The command name is restricted to an allow-list of known CLI binaries (`claude`, `codex`, `gemini`, `grok`, `vibe`).                                                                                                                                                 |
+| **Uses eval**                    | None in our source. Transitive: `@modelcontextprotocol/sdk` → `ajv@8` uses `new Function(...)` in `ajv/dist/compile/index.js` to compile JSON Schema validators.                                 | This is ajv's standard codegen path. Only known schemas (defined in our source and the MCP SDK) flow into it; no caller-supplied data ever reaches the compiled function body.                                                                                                                                                                                                                       |
+| **better-sqlite3 PRAGMA helper** | Transitive: `better-sqlite3/lib/methods/pragma.js` interpolates its caller-provided `source` into a `PRAGMA ${source}` statement.                                                                | We do not call `db.pragma()` from production source. Internal SQLite setup uses fixed literal `db.exec("PRAGMA ...")` statements, and `npm run security:audit` fails the release if production code reintroduces `.pragma()` calls.                                                                                                                                                                  |
+| **ioredis obfuscated code**      | Optional peer/dev dependency: `ioredis@5.10.1` may be flagged at `built/constants/TLSProfiles.js` for base64-looking strings.                                                                    | Reviewed as a false positive. The file is a Redis Cloud TLS CA certificate bundle in PEM format, which is base64 by design. It contains no decoder loop, dynamic evaluation, network call, or hidden execution path. The same file is byte-for-byte identical in `ioredis@5.9.2`; our default production install does not install `ioredis`, and our code does not pass ioredis TLS profile options. |
+| **Dependency ownership**         | A handful of small transitive packages (e.g. `bindings` via `better-sqlite3`, `media-typer` via `@modelcontextprotocol/sdk`) trip Socket's "unstable ownership" or "obfuscated code" heuristics. | These are pinned, well-known micro-deps in the Node ecosystem with no known issues. We pin direct override versions of `content-type` and `type-is` in `package.json#overrides`. Our previous direct dependency on `toml@3.0.0` (also single-maintainer, last released 2020) was replaced with the actively-maintained `smol-toml` to reduce inherited risk.                                         |
 See [`socket.yml`](./socket.yml) for the same context in machine-readable form.
@@ -1070,6 +1180,7 @@ MIT. See [LICENSE](LICENSE) for details.
 ## Support
 For issues and questions:
 - Open an issue on GitHub
 - Check existing issues and documentation
 - Review CLI-specific documentation for CLI-related problems

package/dist/async-job-manager.js CHANGED Viewed

@@ -51,7 +51,7 @@ function truncateText(value, maxChars) {
         return { text: value, truncated: false };
     }
     return {
-        text: value.slice(value.length - maxChars),
+        text: value.slice(0, maxChars),
         truncated: true,
     };
 }
@@ -816,8 +816,9 @@ export class AsyncJobManager {
                 job.error = "Output exceeded maximum size (50MB)";
                 job.finishedAt = new Date().toISOString();
                 job.clearIdleTimer?.();
-                if (job.process)
+                if (job.process) {
                     killProcessGroup(job.process, "SIGTERM");
+                }
                 this.logger.info(`Job ${job.id} killed due to output overflow`, {
                     correlationId: job.correlationId,
                 });
@@ -825,11 +826,16 @@ export class AsyncJobManager {
                 this.persistComplete(job);
                 this.writeFlightComplete(job, "failed", "Output exceeded maximum size (50MB)");
                 this.fireOnComplete(job);
-                setTimeout(() => {
-                    if (!job.exited && job.process)
-                        killProcessGroup(job.process, "SIGKILL");
+                if (job.process) {
+                    setTimeout(() => {
+                        if (!job.exited && job.process)
+                            killProcessGroup(job.process, "SIGKILL");
+                        job.cleanupGroup?.();
+                    }, 5000);
+                }
+                else {
                     job.cleanupGroup?.();
-                }, 5000);
+                }
             }
             return;
         }

package/dist/executor.js CHANGED Viewed

@@ -139,18 +139,48 @@ export function resolveCommandForSpawn(command, args, options = {}) {
     if ([".cmd", ".bat"].includes(extname(resolved).toLowerCase())) {
         return {
             command: "cmd.exe",
-            args: ["/d", "/s", "/c", `"${buildWindowsCmdCommand(resolved, args)}"`],
+            args: [
+                "/d",
+                "/s",
+                "/c",
+                // Windows .cmd/.bat shims require cmd.exe. `buildWindowsCmdCommand`
+                // applies CommandLineToArgvW quoting and cmd metacharacter escaping
+                // to every dynamic segment before it reaches this shell boundary.
+                //
+                // codeql[js/shell-command-constructed-from-input]
+                `"${buildWindowsCmdCommand(resolved, args)}"`,
+            ],
             windowsVerbatimArguments: true,
         };
     }
     return { command: resolved, args };
 }
 function buildWindowsCmdCommand(command, args) {
+    // codeql[js/shell-command-constructed-from-input]
     return [escapeWindowsCmdCommand(command), ...args.map(escapeWindowsCmdArgument)].join(" ");
 }
-const WINDOWS_CMD_META_CHARS = /([()\][%!^"`<>&|;, *?])/g;
+const WINDOWS_CMD_META_CHARS = new Set([
+    "(",
+    ")",
+    "]",
+    "[",
+    "%",
+    "!",
+    "^",
+    '"',
+    "`",
+    "<",
+    ">",
+    "&",
+    "|",
+    ";",
+    ",",
+    " ",
+    "*",
+    "?",
+]);
 function escapeWindowsCmdCommand(value) {
-    return win32.normalize(value).replace(WINDOWS_CMD_META_CHARS, "^$1");
+    return escapeWindowsCmdMetaChars(win32.normalize(value));
 }
 // CommandLineToArgvW rules: a run of N backslashes before a literal " must be
 // doubled and followed by \" (yielding 2N+1 backslashes total, so the parser
@@ -158,11 +188,38 @@ function escapeWindowsCmdCommand(value) {
 // before the closing " must be doubled (2N) so the quote still terminates the
 // arg. Then wrap in quotes and caret-escape cmd.exe metacharacters.
 function escapeWindowsCmdArgument(value) {
-    let arg = `${value}`;
-    arg = arg.replace(/(\\*)"/g, '$1$1\\"');
-    arg = arg.replace(/(\\*)$/, "$1$1");
-    arg = `"${arg}"`;
-    return arg.replace(WINDOWS_CMD_META_CHARS, "^$1");
+    return escapeWindowsCmdMetaChars(quoteWindowsArgForCommandLineToArgv(`${value}`));
+}
+function quoteWindowsArgForCommandLineToArgv(value) {
+    let encoded = "";
+    let backslashes = 0;
+    for (const ch of value) {
+        if (ch === "\\") {
+            backslashes += 1;
+            continue;
+        }
+        if (ch === '"') {
+            encoded += "\\".repeat(backslashes * 2 + 1);
+            encoded += '"';
+            backslashes = 0;
+            continue;
+        }
+        encoded += "\\".repeat(backslashes);
+        backslashes = 0;
+        encoded += ch;
+    }
+    encoded += "\\".repeat(backslashes * 2);
+    return `"${encoded}"`;
+}
+function escapeWindowsCmdMetaChars(value) {
+    let escaped = "";
+    for (const ch of value) {
+        if (WINDOWS_CMD_META_CHARS.has(ch)) {
+            escaped += "^";
+        }
+        escaped += ch;
+    }
+    return escaped;
 }
 function resolveWindowsCommandPath(command, envPath) {
     if (/[\\/]/.test(command)) {

package/dist/index.js CHANGED Viewed

@@ -486,7 +486,7 @@ cwd) {
         jobId: job.id,
         cli,
         correlationId: corrId,
-        message: `Execution exceeded sync deadline (${SYNC_DEADLINE_MS}ms). Poll with llm_job_status, fetch with llm_job_result.`,
+        message: `Execution exceeded sync deadline (${SYNC_DEADLINE_MS}ms). Poll with llm_job_status, collect with llm_job_result.`,
     };
 }
 function isDeferredResponse(result) {
@@ -505,7 +505,7 @@ function buildDeferredToolResponse(deferred, sessionId) {
                     message: deferred.message,
                     sessionId: sessionId || null,
                     pollWith: "llm_job_status",
-                    fetchWith: "llm_job_result",
+                    collectWith: "llm_job_result",
                     cancelWith: "llm_job_cancel",
                 }, null, 2),
             },

package/dist/job-store.js CHANGED Viewed

@@ -245,7 +245,7 @@ export class SqliteJobStore {
      */
     markOrphanedOnStartup() {
         const now = new Date().toISOString();
-        // Orphaned jobs retain a short window so callers can fetch the partial output,
+        // Orphaned jobs retain a short window so callers can collect the partial output,
         // then evict. Reuse the standard retention.
         const expiresAt = new Date(Date.now() + this.retentionMs).toISOString();
         // SELECT before UPDATE — gateway boot is single-threaded so no row can

package/dist/request-helpers.js CHANGED Viewed

@@ -626,10 +626,22 @@ export function prependGeminiAttachments(prompt, attachments) {
         if (!existsSync(p)) {
             throw new Error(`attachments: path does not exist: ${p}`);
         }
+        validateGeminiAttachmentTokenPath(p);
     }
     const tokens = attachments.map(p => `@${p}`).join(" ");
+    // Gemini attachments are prompt-level @path tokens rather than shell
+    // commands. Paths are absolute, existing, and token-safe before this join.
+    //
+    // codeql[js/shell-command-constructed-from-input]
     return `${tokens} ${prompt}`;
 }
+function validateGeminiAttachmentTokenPath(path) {
+    for (const ch of path) {
+        if (ch === "@" || ch <= " ") {
+            throw new Error(`attachments: path cannot be represented as a Gemini @path token without escaping: ${path}`);
+        }
+    }
+}
 /**
  * Zod schema for the U27 Gemini high-impact feature subset. Used by the
  * `gemini_request` / `gemini_request_async` tool schemas to validate the new

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "llm-cli-gateway",
-  "version": "1.15.0",
+  "version": "1.15.2",
   "mcpName": "io.github.verivus-oss/llm-cli-gateway",
   "description": "MCP server providing unified access to Claude Code, Codex, Gemini, Grok, and Mistral Vibe CLIs with session management, retry logic, async job orchestration, durable job results, and cross-LLM validation.",
   "license": "MIT",

package/socket.yml CHANGED Viewed

@@ -14,24 +14,12 @@ version: 2
 #     src/endpoint-exposure.ts also issues a HEAD probe when verifying
 #     tunnel reachability — opt-in via the start:http entry point only.
 #
-#     Additionally, Socket may flag `dist/index.js` and `dist/job-store.js`
-#     against the `globalThis["fetch"]` rule. This is a substring-match
-#     false positive (verified for v1.6.0 by sub-agent investigation on
-#     2026-05-26; same matches exist in v1.5.35). Neither file contains
-#     any `fetch(`, `globalThis.fetch`, polyfill import, or any other
-#     network-call construct. The matches are:
-#       - dist/index.js — the English word "fetch" inside an async-defer
-#         error message ("Poll with llm_job_status, fetch with
-#         llm_job_result.") AND the JSON field name `fetchWith:
-#         "llm_job_result"` (part of the deferred-job response contract).
-#       - dist/job-store.js — the word "fetch" inside a code comment on
-#         markOrphanedOnStartup() describing how callers retrieve partial
-#         output from SQLite.
-#     Verify with: `grep -rEn "\bfetch\(|globalThis\.fetch|globalThis\[" dist/`
-#     — returns empty. Production code does not import undici / node-fetch
-#     / axios / got. The cache-awareness slice (v1.6.0) introduced zero
-#     new network surfaces; all I/O is filesystem (SQLite, sessions.json)
-#     or in-process.
+#     Historical note: Socket previously flagged `dist/index.js` and
+#     `dist/job-store.js` because async-job prose used retrieval wording that
+#     resembled a browser-network primitive. The package now uses "collect" /
+#     `collectWith` wording for deferred job results. Production code does not
+#     import bundled HTTP client libraries; all default I/O is filesystem
+#     (SQLite, sessions.json) or explicit local CLI process I/O.
 #
 #   shellAccess
 #     src/executor.ts uses child_process.spawn(cmd, args, { ... }) with a