PyPI - mcp2cli - Versions diffs - 2.1.0__tar.gz → 2.2.0__tar.gz - Mend

mcp2cli 2.1.0tar.gz → 2.2.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

{mcp2cli-2.1.0 → mcp2cli-2.2.0}/PKG-INFO +18 -183
{mcp2cli-2.1.0 → mcp2cli-2.2.0}/README.md +17 -182
{mcp2cli-2.1.0 → mcp2cli-2.2.0}/pyproject.toml +1 -1
{mcp2cli-2.1.0 → mcp2cli-2.2.0}/src/mcp2cli/__init__.py +31 -10
{mcp2cli-2.1.0 → mcp2cli-2.2.0}/src/mcp2cli/__main__.py +0 -0
{mcp2cli-2.1.0 → mcp2cli-2.2.0}/src/mcp2cli/py.typed +0 -0

{mcp2cli-2.1.0 → mcp2cli-2.2.0}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: mcp2cli
-Version: 2.1.0
+Version: 2.2.0
 Summary: Turn any MCP server or OpenAPI spec into a CLI
 Author: Stephan Fitzpatrick
 Author-email: Stephan Fitzpatrick <stephan@knowsuchagency.com>
@@ -68,25 +68,35 @@ mcp2cli --mcp https://mcp.example.com/sse --auth-header "x-api-key:sk-..." \
 # Force a specific transport (skip streamable HTTP fallback dance)
 mcp2cli --mcp https://mcp.example.com/sse --transport sse --list
+# Search tools by name or description (case-insensitive substring match)
+mcp2cli --mcp https://mcp.example.com/sse --search "task"
 ```
+`--search` implies `--list` and works across all modes (`--mcp`, `--spec`, `--graphql`, `--mcp-stdio`).
 ### OAuth authentication
-MCP servers that require OAuth are supported out of the box. mcp2cli handles token acquisition,
-caching, and refresh automatically.
+APIs that require OAuth are supported out of the box — across MCP, OpenAPI, and GraphQL modes.
+mcp2cli handles token acquisition, caching, and refresh automatically.
 ```bash
 # Authorization code + PKCE flow (opens browser for login)
 mcp2cli --mcp https://mcp.example.com/sse --oauth --list
+mcp2cli --spec https://api.example.com/openapi.json --oauth --list
+mcp2cli --graphql https://api.example.com/graphql --oauth --list
 # Client credentials flow (machine-to-machine, no browser)
-mcp2cli --mcp https://mcp.example.com/sse \
+mcp2cli --spec https://api.example.com/openapi.json \
   --oauth-client-id "my-client-id" \
   --oauth-client-secret "my-secret" \
-  search --query "test"
+  list-pets
 # With specific scopes
-mcp2cli --mcp https://mcp.example.com/sse --oauth --oauth-scope "read write" --list
+mcp2cli --graphql https://api.example.com/graphql --oauth --oauth-scope "read write" users
+# Local spec file — use --base-url for OAuth discovery
+mcp2cli --spec ./openapi.json --base-url https://api.example.com --oauth --list
 ```
 Tokens are persisted in `~/.cache/mcp2cli/oauth/` so subsequent calls reuse existing tokens
@@ -170,68 +180,7 @@ mcp2cli --graphql https://api.example.com/graphql users --fields "id name email"
 mcp2cli --graphql https://api.example.com/graphql --auth-header "Authorization:Bearer tok_..." users
 ```
-mcp2cli introspects the endpoint, discovers queries and mutations, auto-generates selection sets, and constructs parameterized queries with proper variable declarations. Here's what that looks like in practice:
-**GraphQL schema:**
-```graphql
-type Query {
-  users: [User!]!
-  user(id: ID!): User
-}
-type Mutation {
-  createUser(name: String!, email: String!, age: Int): User
-  deleteUser(id: ID!): Boolean
-}
-type User {
-  id: ID!
-  name: String!
-  email: String
-  age: Int
-  status: Status
-}
-enum Status { ACTIVE INACTIVE BANNED }
-```
-**What mcp2cli generates:**
-```
-$ mcp2cli --graphql https://api.example.com/graphql --list
-query:
-  users                                          List all users
-  user                                           Get a user by ID
-mutation:
-  create-user                                    Create a new user
-  delete-user                                    Delete a user by ID
-$ mcp2cli --graphql https://api.example.com/graphql create-user --help
-usage: mcp2cli create-user [--name NAME] [--email EMAIL] [--age AGE]
-  --name    User name (String!, required)
-  --email   User email (String!, required)
-  --age     User age (Int)
-$ mcp2cli --graphql https://api.example.com/graphql create-user --name "Alice" --email "alice@co.org"
-{"id": "4", "name": "Alice", "email": "alice@co.org", "age": null, "status": null}
-```
-No SDL parsing, no code generation — just point and run.
-### Tool search
-```bash
-# Search tools by name or description (case-insensitive substring match)
-mcp2cli --mcp https://mcp.example.com/sse --search "task"
-mcp2cli --spec ./openapi.json --search "create"
-mcp2cli --mcp-stdio "npx @mcp/server" --search "deploy"
-```
-`--search` implies `--list` — it filters the tool listing to matching results.
+mcp2cli introspects the endpoint, discovers queries and mutations, auto-generates selection sets, and constructs parameterized queries with proper variable declarations. No SDL parsing, no code generation — just point and run.
 ### Bake mode — save connection settings
@@ -347,121 +296,7 @@ Bake mode:
 Subcommands and their flags are generated dynamically from the spec or MCP server tool definitions. Run `<subcommand> --help` for details.
-## The problem: tool sprawl is eating your tokens
-If you've connected an LLM to more than a handful of tools, you've felt the pain. Every MCP server, every OpenAPI endpoint — their full schemas get injected into the system prompt on *every single turn*. Your 50-endpoint API costs 3,579 tokens of context *before the conversation even starts*, and that bill is paid again on every message, whether the model touches those tools or not.
-This isn't a theoretical concern. 6 MCP servers with 84 tools consume ~15,540 tokens at session start. Converting those servers to CLIs and letting the LLM discover tools on-demand can slash that cost by 92-98%.
-Even Anthropic recognized the problem, building [Tool Search](https://www.anthropic.com/engineering/advanced-tool-use) directly into their API — a deferred-loading pattern where tools are marked `defer_loading: true` and Claude discovers them via a search index (~500 tokens) instead of loading all schemas upfront. It typically cuts token usage by 85%. But when Tool Search fetches a tool, the full JSON Schema still enters context (~121 tokens/tool).
-mcp2cli takes the CLI approach further.
-## What mcp2cli adds
-The idea is simple: give the LLM a CLI instead of raw tool schemas, and let it `--list` and `--help` its way to what it needs. mcp2cli builds on this with a few key differences:
-- **No codegen, no recompilation.** Point mcp2cli at a spec URL or MCP server and the CLI exists immediately. When the server adds new endpoints, they appear on the next invocation — no rebuild step, no generated code to commit.
-- **Provider-agnostic.** Tool Search is an Anthropic API feature. mcp2cli works with any LLM — Claude, GPT, Gemini, local models — because it's just a CLI tool the model can shell out to.
-- **Compact discovery.** Tool Search defers loading but still injects full JSON schemas when a tool is fetched (~121 tokens/tool). mcp2cli's `--help` returns human-readable text that's typically cheaper than the raw schema, and `--list` summaries cost ~16 tokens/tool vs ~121 for native schemas.
-- **OpenAPI and GraphQL support.** MCP isn't the only schema-rich protocol. mcp2cli handles OpenAPI specs (JSON or YAML, local or remote) and GraphQL endpoints (via introspection) with the same CLI interface, the same caching, and the same on-demand discovery. One tool for all three worlds.
-- **Spec caching with TTL control.** Fetched specs and MCP tool lists are cached locally with configurable TTL, so repeated invocations don't hit the network. `--refresh` bypasses the cache when you need it.
-## The numbers: how much context do you actually save?
-We measured this. Not estimates — actual token counts using the cl100k_base tokenizer against real schemas, verified by [an automated test suite](tests/test_token_savings.py).
-### What mcp2cli actually costs
-Let's be upfront about what mcp2cli adds to context. It's not zero — it's just dramatically less than injecting full schemas.
-| Component | Cost | When |
-|---|--:|---|
-| System prompt | 67 tokens | Every turn (fixed) |
-| `--list` output | ~16 tokens/tool | Once per conversation |
-| `--help` output | ~80-200 tokens/tool | Once per unique tool used |
-| Tool call output | same as native | Per call |
-The `--list` cost scales linearly with the number of tools — 30 tools costs ~464 tokens, 120 tools costs ~1,850 tokens. This is still 7-8x cheaper than the full schemas, and you only pay it once.
-Compare that to native MCP injection: **~121 tokens per tool, every single turn**, whether the model uses those tools or not. For OpenAPI endpoints, it's ~72 tokens per endpoint per turn.
-### Over a full conversation
-Here's the total token cost across a realistic multi-turn conversation. The mcp2cli column includes all overhead: the system prompt on every turn, one `--list` discovery, `--help` for each unique tool the LLM actually uses, and tool call outputs.
-**MCP servers:**
-| Scenario | Turns | Unique tools used | Native total | mcp2cli total | Saved |
-|---|--:|--:|--:|--:|--:|
-| Task manager (30 tools) | 15 | 5 | 54,525 | 2,309 | **96%** |
-| Multi-server (80 tools) | 20 | 8 | 193,360 | 3,897 | **98%** |
-| Full platform (120 tools) | 25 | 10 | 362,350 | 5,181 | **99%** |
-**OpenAPI specs:**
-| Scenario | Turns | Unique endpoints used | Native total | mcp2cli total | Saved |
-|---|--:|--:|--:|--:|--:|
-| Petstore (5 endpoints) | 10 | 3 | 3,730 | 1,199 | **68%** |
-| Medium API (20 endpoints) | 15 | 5 | 21,720 | 1,905 | **91%** |
-| Large API (50 endpoints) | 20 | 8 | 71,940 | 2,810 | **96%** |
-| Enterprise API (200 endpoints) | 25 | 10 | 358,425 | 3,925 | **99%** |
-A 120-tool MCP platform over 25 turns: **357,169 tokens saved**.
-### Turn-by-turn: watching the gap widen
-Here's a 30-tool MCP server over 10 turns. The mcp2cli column includes the real costs: `--list` discovery on turn 1, `--help` + tool output when each new tool is first used.
-```
-Turn   Native       mcp2cli      Savings
-──────────────────────────────────────────────────────────
-1      3,619        531          3,088       ← --list (464 tokens)
-2      7,238        598          6,640
-3      10,887       815          10,072      ← --help (120) + tool call
-4      14,506       882          13,624
-5      18,155       1,099        17,056      ← --help (120) + tool call
-6      21,774       1,166        20,608
-7      25,423       1,383        24,040      ← --help (120) + tool call
-8      29,042       1,450        27,592
-9      32,691       1,667        31,024      ← --help (120) + tool call
-10     36,310       1,734        34,576
-Total: 34,576 tokens saved (95.2%)
-```
-### Why the gap is so large
-**Native MCP approach** — pay the full schema tax on every turn:
-```
-System prompt: "You have these 30 tools: [3,619 tokens of JSON schemas]"
-  → 3,619 tokens consumed per turn, whether used or not
-  → 10 turns = 36,310 tokens
-```
-**mcp2cli approach** — pay only for what you use:
-```
-System prompt: "Use mcp2cli --mcp <url> <command> [--flags]"   (67 tokens/turn)
-  → mcp2cli --mcp <url> --list                                (464 tokens, once)
-  → mcp2cli --mcp <url> create-task --help                    (120 tokens, once per tool)
-  → mcp2cli --mcp <url> create-task --title "Fix bug"         (0 extra tokens)
-  → 10 turns, 4 unique tools = 1,734 tokens
-```
-The LLM discovers what it needs, when it needs it. Everything else stays out of context.
-### The multi-server problem
-This is where it really hurts. Connect 3 MCP servers (a task manager, a filesystem server, and a database server — 60 tools total) and you're paying 7,238 tokens per turn. Over a 20-turn conversation, that's **145,060 tokens** just for tool schemas. mcp2cli reduces that to **3,288 tokens** — a **97.7% reduction** — even after accounting for `--list` discovery (928 tokens) and `--help` for 6 unique tools (720 tokens).
-## How it works
-1. **Load** -- Fetch the OpenAPI spec or connect to the MCP server. Resolve `$ref`s. Cache for reuse.
-2. **Extract** -- Walk the spec paths/tools and produce a uniform list of command definitions with typed parameters.
-3. **Build** -- Generate an argparse parser with subcommands, flags, types, choices, and help text.
-4. **Execute** -- Dispatch the parsed args as an HTTP request (OpenAPI) or tool call (MCP).
-Both adapters produce the same internal `CommandDef` structure, so the CLI builder and output handling are shared.
+> For token savings analysis, architecture details, and comparison to Anthropic's Tool Search, see the **[full writeup on the OCAI blog](https://www.orangecountyai.com/blog/mcp2cli-one-cli-for-every-api-zero-wasted-tokens)**.
 ## Development

{mcp2cli-2.1.0 → mcp2cli-2.2.0}/README.md RENAMED Viewed

@@ -49,25 +49,35 @@ mcp2cli --mcp https://mcp.example.com/sse --auth-header "x-api-key:sk-..." \
 # Force a specific transport (skip streamable HTTP fallback dance)
 mcp2cli --mcp https://mcp.example.com/sse --transport sse --list
+# Search tools by name or description (case-insensitive substring match)
+mcp2cli --mcp https://mcp.example.com/sse --search "task"
 ```
+`--search` implies `--list` and works across all modes (`--mcp`, `--spec`, `--graphql`, `--mcp-stdio`).
 ### OAuth authentication
-MCP servers that require OAuth are supported out of the box. mcp2cli handles token acquisition,
-caching, and refresh automatically.
+APIs that require OAuth are supported out of the box — across MCP, OpenAPI, and GraphQL modes.
+mcp2cli handles token acquisition, caching, and refresh automatically.
 ```bash
 # Authorization code + PKCE flow (opens browser for login)
 mcp2cli --mcp https://mcp.example.com/sse --oauth --list
+mcp2cli --spec https://api.example.com/openapi.json --oauth --list
+mcp2cli --graphql https://api.example.com/graphql --oauth --list
 # Client credentials flow (machine-to-machine, no browser)
-mcp2cli --mcp https://mcp.example.com/sse \
+mcp2cli --spec https://api.example.com/openapi.json \
   --oauth-client-id "my-client-id" \
   --oauth-client-secret "my-secret" \
-  search --query "test"
+  list-pets
 # With specific scopes
-mcp2cli --mcp https://mcp.example.com/sse --oauth --oauth-scope "read write" --list
+mcp2cli --graphql https://api.example.com/graphql --oauth --oauth-scope "read write" users
+# Local spec file — use --base-url for OAuth discovery
+mcp2cli --spec ./openapi.json --base-url https://api.example.com --oauth --list
 ```
 Tokens are persisted in `~/.cache/mcp2cli/oauth/` so subsequent calls reuse existing tokens
@@ -151,68 +161,7 @@ mcp2cli --graphql https://api.example.com/graphql users --fields "id name email"
 mcp2cli --graphql https://api.example.com/graphql --auth-header "Authorization:Bearer tok_..." users
 ```
-mcp2cli introspects the endpoint, discovers queries and mutations, auto-generates selection sets, and constructs parameterized queries with proper variable declarations. Here's what that looks like in practice:
-**GraphQL schema:**
-```graphql
-type Query {
-  users: [User!]!
-  user(id: ID!): User
-}
-type Mutation {
-  createUser(name: String!, email: String!, age: Int): User
-  deleteUser(id: ID!): Boolean
-}
-type User {
-  id: ID!
-  name: String!
-  email: String
-  age: Int
-  status: Status
-}
-enum Status { ACTIVE INACTIVE BANNED }
-```
-**What mcp2cli generates:**
-```
-$ mcp2cli --graphql https://api.example.com/graphql --list
-query:
-  users                                          List all users
-  user                                           Get a user by ID
-mutation:
-  create-user                                    Create a new user
-  delete-user                                    Delete a user by ID
-$ mcp2cli --graphql https://api.example.com/graphql create-user --help
-usage: mcp2cli create-user [--name NAME] [--email EMAIL] [--age AGE]
-  --name    User name (String!, required)
-  --email   User email (String!, required)
-  --age     User age (Int)
-$ mcp2cli --graphql https://api.example.com/graphql create-user --name "Alice" --email "alice@co.org"
-{"id": "4", "name": "Alice", "email": "alice@co.org", "age": null, "status": null}
-```
-No SDL parsing, no code generation — just point and run.
-### Tool search
-```bash
-# Search tools by name or description (case-insensitive substring match)
-mcp2cli --mcp https://mcp.example.com/sse --search "task"
-mcp2cli --spec ./openapi.json --search "create"
-mcp2cli --mcp-stdio "npx @mcp/server" --search "deploy"
-```
-`--search` implies `--list` — it filters the tool listing to matching results.
+mcp2cli introspects the endpoint, discovers queries and mutations, auto-generates selection sets, and constructs parameterized queries with proper variable declarations. No SDL parsing, no code generation — just point and run.
 ### Bake mode — save connection settings
@@ -328,121 +277,7 @@ Bake mode:
 Subcommands and their flags are generated dynamically from the spec or MCP server tool definitions. Run `<subcommand> --help` for details.
-## The problem: tool sprawl is eating your tokens
-If you've connected an LLM to more than a handful of tools, you've felt the pain. Every MCP server, every OpenAPI endpoint — their full schemas get injected into the system prompt on *every single turn*. Your 50-endpoint API costs 3,579 tokens of context *before the conversation even starts*, and that bill is paid again on every message, whether the model touches those tools or not.
-This isn't a theoretical concern. 6 MCP servers with 84 tools consume ~15,540 tokens at session start. Converting those servers to CLIs and letting the LLM discover tools on-demand can slash that cost by 92-98%.
-Even Anthropic recognized the problem, building [Tool Search](https://www.anthropic.com/engineering/advanced-tool-use) directly into their API — a deferred-loading pattern where tools are marked `defer_loading: true` and Claude discovers them via a search index (~500 tokens) instead of loading all schemas upfront. It typically cuts token usage by 85%. But when Tool Search fetches a tool, the full JSON Schema still enters context (~121 tokens/tool).
-mcp2cli takes the CLI approach further.
-## What mcp2cli adds
-The idea is simple: give the LLM a CLI instead of raw tool schemas, and let it `--list` and `--help` its way to what it needs. mcp2cli builds on this with a few key differences:
-- **No codegen, no recompilation.** Point mcp2cli at a spec URL or MCP server and the CLI exists immediately. When the server adds new endpoints, they appear on the next invocation — no rebuild step, no generated code to commit.
-- **Provider-agnostic.** Tool Search is an Anthropic API feature. mcp2cli works with any LLM — Claude, GPT, Gemini, local models — because it's just a CLI tool the model can shell out to.
-- **Compact discovery.** Tool Search defers loading but still injects full JSON schemas when a tool is fetched (~121 tokens/tool). mcp2cli's `--help` returns human-readable text that's typically cheaper than the raw schema, and `--list` summaries cost ~16 tokens/tool vs ~121 for native schemas.
-- **OpenAPI and GraphQL support.** MCP isn't the only schema-rich protocol. mcp2cli handles OpenAPI specs (JSON or YAML, local or remote) and GraphQL endpoints (via introspection) with the same CLI interface, the same caching, and the same on-demand discovery. One tool for all three worlds.
-- **Spec caching with TTL control.** Fetched specs and MCP tool lists are cached locally with configurable TTL, so repeated invocations don't hit the network. `--refresh` bypasses the cache when you need it.
-## The numbers: how much context do you actually save?
-We measured this. Not estimates — actual token counts using the cl100k_base tokenizer against real schemas, verified by [an automated test suite](tests/test_token_savings.py).
-### What mcp2cli actually costs
-Let's be upfront about what mcp2cli adds to context. It's not zero — it's just dramatically less than injecting full schemas.
-| Component | Cost | When |
-|---|--:|---|
-| System prompt | 67 tokens | Every turn (fixed) |
-| `--list` output | ~16 tokens/tool | Once per conversation |
-| `--help` output | ~80-200 tokens/tool | Once per unique tool used |
-| Tool call output | same as native | Per call |
-The `--list` cost scales linearly with the number of tools — 30 tools costs ~464 tokens, 120 tools costs ~1,850 tokens. This is still 7-8x cheaper than the full schemas, and you only pay it once.
-Compare that to native MCP injection: **~121 tokens per tool, every single turn**, whether the model uses those tools or not. For OpenAPI endpoints, it's ~72 tokens per endpoint per turn.
-### Over a full conversation
-Here's the total token cost across a realistic multi-turn conversation. The mcp2cli column includes all overhead: the system prompt on every turn, one `--list` discovery, `--help` for each unique tool the LLM actually uses, and tool call outputs.
-**MCP servers:**
-| Scenario | Turns | Unique tools used | Native total | mcp2cli total | Saved |
-|---|--:|--:|--:|--:|--:|
-| Task manager (30 tools) | 15 | 5 | 54,525 | 2,309 | **96%** |
-| Multi-server (80 tools) | 20 | 8 | 193,360 | 3,897 | **98%** |
-| Full platform (120 tools) | 25 | 10 | 362,350 | 5,181 | **99%** |
-**OpenAPI specs:**
-| Scenario | Turns | Unique endpoints used | Native total | mcp2cli total | Saved |
-|---|--:|--:|--:|--:|--:|
-| Petstore (5 endpoints) | 10 | 3 | 3,730 | 1,199 | **68%** |
-| Medium API (20 endpoints) | 15 | 5 | 21,720 | 1,905 | **91%** |
-| Large API (50 endpoints) | 20 | 8 | 71,940 | 2,810 | **96%** |
-| Enterprise API (200 endpoints) | 25 | 10 | 358,425 | 3,925 | **99%** |
-A 120-tool MCP platform over 25 turns: **357,169 tokens saved**.
-### Turn-by-turn: watching the gap widen
-Here's a 30-tool MCP server over 10 turns. The mcp2cli column includes the real costs: `--list` discovery on turn 1, `--help` + tool output when each new tool is first used.
-```
-Turn   Native       mcp2cli      Savings
-──────────────────────────────────────────────────────────
-1      3,619        531          3,088       ← --list (464 tokens)
-2      7,238        598          6,640
-3      10,887       815          10,072      ← --help (120) + tool call
-4      14,506       882          13,624
-5      18,155       1,099        17,056      ← --help (120) + tool call
-6      21,774       1,166        20,608
-7      25,423       1,383        24,040      ← --help (120) + tool call
-8      29,042       1,450        27,592
-9      32,691       1,667        31,024      ← --help (120) + tool call
-10     36,310       1,734        34,576
-Total: 34,576 tokens saved (95.2%)
-```
-### Why the gap is so large
-**Native MCP approach** — pay the full schema tax on every turn:
-```
-System prompt: "You have these 30 tools: [3,619 tokens of JSON schemas]"
-  → 3,619 tokens consumed per turn, whether used or not
-  → 10 turns = 36,310 tokens
-```
-**mcp2cli approach** — pay only for what you use:
-```
-System prompt: "Use mcp2cli --mcp <url> <command> [--flags]"   (67 tokens/turn)
-  → mcp2cli --mcp <url> --list                                (464 tokens, once)
-  → mcp2cli --mcp <url> create-task --help                    (120 tokens, once per tool)
-  → mcp2cli --mcp <url> create-task --title "Fix bug"         (0 extra tokens)
-  → 10 turns, 4 unique tools = 1,734 tokens
-```
-The LLM discovers what it needs, when it needs it. Everything else stays out of context.
-### The multi-server problem
-This is where it really hurts. Connect 3 MCP servers (a task manager, a filesystem server, and a database server — 60 tools total) and you're paying 7,238 tokens per turn. Over a 20-turn conversation, that's **145,060 tokens** just for tool schemas. mcp2cli reduces that to **3,288 tokens** — a **97.7% reduction** — even after accounting for `--list` discovery (928 tokens) and `--help` for 6 unique tools (720 tokens).
-## How it works
-1. **Load** -- Fetch the OpenAPI spec or connect to the MCP server. Resolve `$ref`s. Cache for reuse.
-2. **Extract** -- Walk the spec paths/tools and produce a uniform list of command definitions with typed parameters.
-3. **Build** -- Generate an argparse parser with subcommands, flags, types, choices, and help text.
-4. **Execute** -- Dispatch the parsed args as an HTTP request (OpenAPI) or tool call (MCP).
-Both adapters produce the same internal `CommandDef` structure, so the CLI builder and output handling are shared.
+> For token savings analysis, architecture details, and comparison to Anthropic's Tool Search, see the **[full writeup on the OCAI blog](https://www.orangecountyai.com/blog/mcp2cli-one-cli-for-every-api-zero-wasted-tokens)**.
 ## Development

{mcp2cli-2.1.0 → mcp2cli-2.2.0}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "mcp2cli"
-version = "2.1.0"
+version = "2.2.0"
 description = "Turn any MCP server or OpenAPI spec into a CLI"
 readme = "README.md"
 license = "MIT"

{mcp2cli-2.1.0 → mcp2cli-2.2.0}/src/mcp2cli/__init__.py RENAMED Viewed

@@ -371,7 +371,7 @@ def build_oauth_provider(
     client_secret: str | None = None,
     scope: str | None = None,
 ) -> "httpx.Auth":
-    """Build an OAuth provider for MCP HTTP connections.
+    """Build an OAuth provider for HTTP connections.
     If client_id and client_secret are provided, uses client credentials flow.
     Otherwise, uses authorization code + PKCE with a local callback server.
@@ -482,6 +482,7 @@ def load_openapi_spec(
     cache_key: str | None,
     ttl: int,
     refresh: bool,
+    oauth_provider: "httpx.Auth | None" = None,
 ) -> dict:
     is_url = source.startswith("http://") or source.startswith("https://")
@@ -493,7 +494,7 @@ def load_openapi_spec(
                 return cached
         headers = dict(auth_headers)
-        with httpx.Client(timeout=30) as client:
+        with httpx.Client(timeout=30, auth=oauth_provider) as client:
             resp = client.get(source, headers=headers)
             resp.raise_for_status()
             raw = resp.text
@@ -840,6 +841,7 @@ def load_graphql_schema(
     cache_key: str | None,
     ttl: int,
     refresh: bool,
+    oauth_provider: "httpx.Auth | None" = None,
 ) -> dict:
     """POST introspection query to a GraphQL endpoint, with caching."""
     key = cache_key or cache_key_for(f"graphql:{url}")
@@ -850,7 +852,7 @@ def load_graphql_schema(
     headers = dict(auth_headers)
     headers.setdefault("Content-Type", "application/json")
-    with httpx.Client(timeout=30) as client:
+    with httpx.Client(timeout=30, auth=oauth_provider) as client:
         resp = client.post(
             url,
             headers=headers,
@@ -999,6 +1001,7 @@ def execute_graphql(
     raw: bool,
     toon: bool = False,
     fields_override: str | None = None,
+    oauth_provider: "httpx.Auth | None" = None,
 ):
     """Build and execute a GraphQL query/mutation."""
     types_by_name = {t["name"]: t for t in schema.get("types", []) if t.get("name")}
@@ -1044,7 +1047,7 @@ def execute_graphql(
     headers = dict(auth_headers)
     headers.setdefault("Content-Type", "application/json")
-    with httpx.Client(timeout=60) as client:
+    with httpx.Client(timeout=60, auth=oauth_provider) as client:
         resp = client.post(
             url,
             headers=headers,
@@ -1084,9 +1087,10 @@ def handle_graphql(
     refresh: bool,
     toon: bool = False,
     fields_override: str | None = None,
+    oauth_provider: "httpx.Auth | None" = None,
 ):
     """Top-level handler for --graphql mode."""
-    schema = load_graphql_schema(url, auth_headers, cache_key, ttl, refresh)
+    schema = load_graphql_schema(url, auth_headers, cache_key, ttl, refresh, oauth_provider=oauth_provider)
     commands = extract_graphql_commands(schema)
     if list_mode:
@@ -1110,7 +1114,7 @@ def handle_graphql(
     cmd: CommandDef = args._cmd
     execute_graphql(
         args, cmd, url, schema, auth_headers, pretty, raw, toon=toon,
-        fields_override=fields_override,
+        fields_override=fields_override, oauth_provider=oauth_provider,
     )
@@ -1565,6 +1569,7 @@ def execute_openapi(
     pretty: bool,
     raw: bool,
     toon: bool = False,
+    oauth_provider: "httpx.Auth | None" = None,
 ):
     path = cmd.path or ""
     # Substitute path parameters
@@ -1616,7 +1621,7 @@ def execute_openapi(
             if not body:
                 body = None
-    with httpx.Client(timeout=60) as client:
+    with httpx.Client(timeout=60, auth=oauth_provider) as client:
         resp = client.request(
             (cmd.method or "get").upper(),
             url,
@@ -2930,9 +2935,22 @@ def _main_impl(argv: list[str], bake_config: BakeConfig | None = None):
                 file=sys.stderr,
             )
             sys.exit(1)
-        if not pre_args.mcp:
+        if pre_args.mcp_stdio:
             print(
-                "Error: OAuth is only supported with --mcp (HTTP/SSE)", file=sys.stderr
+                "Error: OAuth is not supported with --mcp-stdio", file=sys.stderr
+            )
+            sys.exit(1)
+        # Determine OAuth server URL for discovery
+        server_url = pre_args.mcp or pre_args.graphql
+        if not server_url and pre_args.spec:
+            if pre_args.spec.startswith("http"):
+                server_url = pre_args.spec
+            else:
+                server_url = pre_args.base_url
+        if not server_url:
+            print(
+                "Error: OAuth requires an HTTP URL (use --base-url with local spec files)",
+                file=sys.stderr,
             )
             sys.exit(1)
         client_id = (
@@ -2946,7 +2964,7 @@ def _main_impl(argv: list[str], bake_config: BakeConfig | None = None):
             else None
         )
         oauth_provider = build_oauth_provider(
-            pre_args.mcp,
+            server_url,
             client_id=client_id,
             client_secret=client_secret,
             scope=pre_args.oauth_scope,
@@ -3129,6 +3147,7 @@ def _main_impl(argv: list[str], bake_config: BakeConfig | None = None):
             pre_args.refresh,
             toon=pre_args.toon,
             fields_override=pre_args.fields,
+            oauth_provider=oauth_provider,
         )
         return
@@ -3168,6 +3187,7 @@ def _main_impl(argv: list[str], bake_config: BakeConfig | None = None):
         pre_args.cache_key,
         pre_args.cache_ttl,
         pre_args.refresh,
+        oauth_provider=oauth_provider,
     )
     commands = extract_openapi_commands(spec)
     if bake_config:
@@ -3229,6 +3249,7 @@ def _main_impl(argv: list[str], bake_config: BakeConfig | None = None):
         pre_args.pretty,
         pre_args.raw,
         toon=pre_args.toon,
+        oauth_provider=oauth_provider,
     )

{mcp2cli-2.1.0 → mcp2cli-2.2.0}/src/mcp2cli/__main__.py RENAMED Viewed

File without changes

{mcp2cli-2.1.0 → mcp2cli-2.2.0}/src/mcp2cli/py.typed RENAMED Viewed

File without changes

mcp2cli 2.1.0__tar.gz → 2.2.0__tar.gz

mcp2cli 2.1.0tar.gz → 2.2.0tar.gz