PyPI - minitap-mcp - Versions diffs - 0.4.2__tar.gz → 0.4.3__tar.gz - Mend

minitap-mcp 0.4.2tar.gz → 0.4.3tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (25) hide show

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.3
 Name: minitap-mcp
-Version: 0.4.2
+Version: 0.4.3
 Summary: Model Context Protocol server for controlling Android & iOS devices with natural language
 Author: Pierre-Louis Favreau, Jean-Pierre Lo, Clément Guiguet
 Requires-Dist: fastmcp>=2.12.4
@@ -84,11 +84,18 @@ minitap-mcp --server
 CLI flags always override environment variables when both are present.
-By default, the server will bind to `0.0.0.0:8000`. Configure via environment variables:
+By default, the server will bind to `0.0.0.0:8000`. You can customize the port:
 ```bash
+# Using CLI argument
+minitap-mcp --server --port 9000
+# Or using environment variable
+export MCP_SERVER_PORT="9000"
+minitap-mcp --server
+# You can also customize the host
 export MCP_SERVER_HOST="0.0.0.0"
-export MCP_SERVER_PORT="8000"
 ```
 ## IDE Integration
@@ -120,41 +127,122 @@ export MCP_SERVER_PORT="8000"
 ```
-## Available Tools
+## Available Resources & Tools
+Once connected, your AI assistant can use these resources and tools:
+### Resource: `data://devices`
+Lists all connected mobile devices (Android and iOS).
-Once connected, your AI assistant can use these tools:
+**Returns:** Array of device information objects with:
-### `execute_mobile_command`
-Execute natural language commands on your mobile device using the Minitap SDK. This tool allows you to control your Android or iOS device using natural language.
+- `device_id`: Device serial (Android) or UDID (iOS)
+- `platform`: `"android"` or `"ios"`
+- `name`: Device name
+- `state`: Device state (`"connected"` or `"Booted"`)
+### Tool: `analyze_screen`
+Captures a screenshot from a mobile device and analyzes it using a vision language model.
 **Parameters:**
-- `goal` (required): High-level goal describing the action to perform
-- `output_description` (optional): Natural language description of the desired output format. Results are returned as structured JSON (e.g., "An array with sender and subject for each email")
-- `profile` (optional): Profile name to use (defaults to "default")
-**Examples:**
+- `prompt` (required): Analysis prompt describing what information to extract
+- `device_id` (optional): Specific device ID to target. If not provided, uses the first available device.
+**Returns:** AI-generated analysis of the screenshot based on the prompt.
+**Example:**
 ```
-"Open the settings app and tell me the battery level"
-"Find the first 3 unread emails in Gmail"
-"Open Google Maps and search for the nearest coffee shop"
-"Take a screenshot and save it"
+Prompt: "What app is currently open? List all visible UI elements."
 ```
-### `analyze_screen`
-Capture and analyze what's currently shown on the mobile device screen using a vision-capable LLM. Useful for understanding UI elements, extracting text, or identifying specific features.
+The tool will:
+1. Find the specified device (or first available)
+2. Capture a screenshot
+3. Analyze it with the vision model
+4. Return the analysis
+### Tool: `execute_mobile_command`
+Execute natural language commands on your mobile device using the mobile-use SDK. This tool allows you to control your Android or iOS device with simple instructions.
 **Parameters:**
-- `prompt` (required): Analysis prompt describing what information to extract
-- `device_id` (optional): Specific device ID to target
+- `goal` (required): Natural language command to execute on the device
+- `output_description` (optional): Description of the expected output format (e.g., "A JSON list of objects with sender and subject keys")
+- `profile` (optional): Name of the profile to use for this task. Defaults to 'default'
+**Returns:** Execution result with status, output, and any extracted data.
 **Examples:**
-```js
-"What app is currently open?"
-"Read the text messages visible on screen"
-"List all buttons and their labels on the current screen"
-"Extract the phone number displayed"
+```python
+# Simple command
+goal: "Go to settings and tell me my current battery level"
+# Data extraction with structured output
+goal: "Open Gmail, find first 3 unread emails, and list their sender and subject line"
+output_description: "A JSON list of objects, each with 'sender' and 'subject' keys"
+# App navigation
+goal: "Open Twitter and scroll to the latest tweet"
 ```
+The tool will:
+1. Find the specified device (or first available)
+2. Execute the command using the mobile-use AI agent
+3. Return the result or extracted data
+### Tool: `save_figma_assets`
+Fetch Figma design assets and React implementation code, then save them locally in the workspace.
+**Parameters:**
+- `node_id` (required): The node ID of the Figma design in format "1:2" (colon-separated). Extract from URLs like `https://figma.com/design/:fileKey/:fileName?node-id=1-2`
+- `file_key` (required): The file key from the Figma URL (e.g., "abc123" from `https://figma.com/design/abc123/MyFile`)
+- `workspace_path` (optional): The workspace path where assets should be saved. Defaults to current directory.
+**Returns:** Download summary with list of successfully downloaded assets and any failures.
+**Example:**
+```python
+node_id: "1:2"
+file_key: "abc123xyz"
+workspace_path: "."
+```
+The tool will:
+1. Call `get_design_context` from Figma MCP to get React/TypeScript code
+2. Extract all asset URLs from the code implementation
+3. Download each asset to `.mobile-use/figma_assets/<node-id>/` folder
+4. Save the code implementation to `.mobile-use/figma_assets/<node-id>/code_implementation.ts`
+5. Return a list of downloaded files with success/failure status
+### Tool: `compare_screenshot_with_figma`
+Compare a screenshot of the current mobile device state with a Figma design to identify visual differences.
+**Parameters:**
+- `node_id` (required): The node ID of the Figma design in format "1:2" (colon-separated). Extract from URLs like `https://figma.com/design/:fileKey/:fileName?node-id=1-2`
+**Returns:** Detailed comparison report with both the Figma design and current device screenshots for visual context.
+The tool will:
+1. Capture a screenshot of the current device state
+2. Fetch the Figma design screenshot
+3. Compare both screenshots using vision AI
+4. Return a detailed analysis highlighting differences
 ## Advanced Configuration
 ### Custom ADB Server

minitap_mcp-0.4.3/PYPI_README.md ADDED Viewed

@@ -0,0 +1,269 @@
+# Minitap MCP Server
+A Model Context Protocol (MCP) server that enables AI assistants to control and interact with real mobile devices (Android & iOS) through natural language commands.
+## Quick Start
+### Installation
+```bash
+pip install minitap-mcp
+```
+### Prerequisites
+Before running the MCP server, ensure you have the required mobile automation tools installed:
+- **For Android devices:**
+  - [ADB (Android Debug Bridge)](https://developer.android.com/tools/adb) - For device communication
+  - [Maestro](https://maestro.mobile.dev/) - For mobile automation
+- **For iOS devices (macOS only):**
+  - Xcode Command Line Tools with `xcrun`
+  - [Maestro](https://maestro.mobile.dev/) - For mobile automation
+For detailed setup instructions, see the [mobile-use repository](https://github.com/minitap-ai/mobile-use).
+### Running the Server
+The simplest way to start:
+```bash
+minitap-mcp --server --api-key your_minitap_api_key
+```
+This starts the server on `localhost:8000` with your API key. Get your free API key at [platform.minitap.ai/api-keys](https://platform.minitap.ai/api-keys).
+**Available CLI options:**
+```bash
+minitap-mcp --server --api-key YOUR_KEY --llm-profile PROFILE_NAME
+```
+- `--api-key`: Your Minitap API key (overrides `MINITAP_API_KEY` env var). Get yours at [platform.minitap.ai/api-keys](https://platform.minitap.ai/api-keys).
+- `--llm-profile`: LLM profile name to use (overrides `MINITAP_LLM_PROFILE_NAME` env var). If unset, uses the default profile. Configure profiles at [platform.minitap.ai/llm-profiles](https://platform.minitap.ai/llm-profiles).
+### Configuration (Optional)
+Alternatively, you can set environment variables instead of using CLI flags:
+```bash
+export MINITAP_API_KEY="your_minitap_api_key"
+export MINITAP_API_BASE_URL="https://platform.minitap.ai/api/v1"
+export MINITAP_LLM_PROFILE_NAME="default"
+```
+You can set these in your `.bashrc` or equivalent, then simply run:
+```bash
+minitap-mcp --server
+```
+CLI flags always override environment variables when both are present.
+By default, the server will bind to `0.0.0.0:8000`. You can customize the port:
+```bash
+# Using CLI argument
+minitap-mcp --server --port 9000
+# Or using environment variable
+export MCP_SERVER_PORT="9000"
+minitap-mcp --server
+# You can also customize the host
+export MCP_SERVER_HOST="0.0.0.0"
+```
+## IDE Integration
+1. Start the server: `minitap-mcp --server --api-key your_minitap_api_key`
+2. Add to your IDE MCP settings file:
+```jsonc
+# For Windsurf
+{
+  "mcpServers": {
+    "minitap-mcp": {
+      "serverUrl": "http://localhost:8000/mcp"
+    }
+  }
+}
+```
+```jsonc
+# For Cursor
+{
+  "mcpServers": {
+    "minitap-mcp": {
+      "transport": "http",
+      "url": "http://localhost:8000/mcp"
+    }
+  }
+}
+```
+## Available Resources & Tools
+Once connected, your AI assistant can use these resources and tools:
+### Resource: `data://devices`
+Lists all connected mobile devices (Android and iOS).
+**Returns:** Array of device information objects with:
+- `device_id`: Device serial (Android) or UDID (iOS)
+- `platform`: `"android"` or `"ios"`
+- `name`: Device name
+- `state`: Device state (`"connected"` or `"Booted"`)
+### Tool: `analyze_screen`
+Captures a screenshot from a mobile device and analyzes it using a vision language model.
+**Parameters:**
+- `prompt` (required): Analysis prompt describing what information to extract
+- `device_id` (optional): Specific device ID to target. If not provided, uses the first available device.
+**Returns:** AI-generated analysis of the screenshot based on the prompt.
+**Example:**
+```
+Prompt: "What app is currently open? List all visible UI elements."
+```
+The tool will:
+1. Find the specified device (or first available)
+2. Capture a screenshot
+3. Analyze it with the vision model
+4. Return the analysis
+### Tool: `execute_mobile_command`
+Execute natural language commands on your mobile device using the mobile-use SDK. This tool allows you to control your Android or iOS device with simple instructions.
+**Parameters:**
+- `goal` (required): Natural language command to execute on the device
+- `output_description` (optional): Description of the expected output format (e.g., "A JSON list of objects with sender and subject keys")
+- `profile` (optional): Name of the profile to use for this task. Defaults to 'default'
+**Returns:** Execution result with status, output, and any extracted data.
+**Examples:**
+```python
+# Simple command
+goal: "Go to settings and tell me my current battery level"
+# Data extraction with structured output
+goal: "Open Gmail, find first 3 unread emails, and list their sender and subject line"
+output_description: "A JSON list of objects, each with 'sender' and 'subject' keys"
+# App navigation
+goal: "Open Twitter and scroll to the latest tweet"
+```
+The tool will:
+1. Find the specified device (or first available)
+2. Execute the command using the mobile-use AI agent
+3. Return the result or extracted data
+### Tool: `save_figma_assets`
+Fetch Figma design assets and React implementation code, then save them locally in the workspace.
+**Parameters:**
+- `node_id` (required): The node ID of the Figma design in format "1:2" (colon-separated). Extract from URLs like `https://figma.com/design/:fileKey/:fileName?node-id=1-2`
+- `file_key` (required): The file key from the Figma URL (e.g., "abc123" from `https://figma.com/design/abc123/MyFile`)
+- `workspace_path` (optional): The workspace path where assets should be saved. Defaults to current directory.
+**Returns:** Download summary with list of successfully downloaded assets and any failures.
+**Example:**
+```python
+node_id: "1:2"
+file_key: "abc123xyz"
+workspace_path: "."
+```
+The tool will:
+1. Call `get_design_context` from Figma MCP to get React/TypeScript code
+2. Extract all asset URLs from the code implementation
+3. Download each asset to `.mobile-use/figma_assets/<node-id>/` folder
+4. Save the code implementation to `.mobile-use/figma_assets/<node-id>/code_implementation.ts`
+5. Return a list of downloaded files with success/failure status
+### Tool: `compare_screenshot_with_figma`
+Compare a screenshot of the current mobile device state with a Figma design to identify visual differences.
+**Parameters:**
+- `node_id` (required): The node ID of the Figma design in format "1:2" (colon-separated). Extract from URLs like `https://figma.com/design/:fileKey/:fileName?node-id=1-2`
+**Returns:** Detailed comparison report with both the Figma design and current device screenshots for visual context.
+The tool will:
+1. Capture a screenshot of the current device state
+2. Fetch the Figma design screenshot
+3. Compare both screenshots using vision AI
+4. Return a detailed analysis highlighting differences
+## Advanced Configuration
+### Custom ADB Server
+If using a remote or custom ADB server (like on WSL):
+```bash
+export ADB_SERVER_SOCKET="tcp:192.168.1.100:5037"
+```
+### Vision Model
+Customize the vision model used for screen analysis:
+```bash
+export VISION_MODEL="qwen/qwen-2.5-vl-7b-instruct"
+```
+## Device Setup
+### Android
+1. Enable USB debugging on your device
+2. Connect via USB or network ADB
+3. Verify connection: `adb devices`
+### iOS (macOS only)
+1. Install Xcode Command Line Tools
+2. Start a simulator or connect a physical device
+3. Verify: `xcrun simctl list devices booted`
+## Troubleshooting
+**No devices found:**
+- Verify ADB/xcrun connection
+- Check USB debugging is enabled (Android)
+- Ensure device is unlocked
+**Connection refused errors:**
+- Check ADB/xcrun connection
+**API authentication errors:**
+- Verify `MINITAP_API_KEY` is set correctly
+## Links
+- **Mobile-Use SDK:** [github.com/minitap-ai/mobile-use](https://github.com/minitap-ai/mobile-use)
+- **Mobile-Use Documentation:** [docs.minitap.ai](https://docs.minitap.ai)

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "minitap-mcp"
-version = "0.4.2"
+version = "0.4.3"
 description = "Model Context Protocol server for controlling Android & iOS devices with natural language"
 readme = "PYPI_README.md"

minitap_mcp-0.4.2/PYPI_README.md DELETED Viewed

@@ -1,181 +0,0 @@
-# Minitap MCP Server
-A Model Context Protocol (MCP) server that enables AI assistants to control and interact with real mobile devices (Android & iOS) through natural language commands.
-## Quick Start
-### Installation
-```bash
-pip install minitap-mcp
-```
-### Prerequisites
-Before running the MCP server, ensure you have the required mobile automation tools installed:
-- **For Android devices:**
-  - [ADB (Android Debug Bridge)](https://developer.android.com/tools/adb) - For device communication
-  - [Maestro](https://maestro.mobile.dev/) - For mobile automation
-- **For iOS devices (macOS only):**
-  - Xcode Command Line Tools with `xcrun`
-  - [Maestro](https://maestro.mobile.dev/) - For mobile automation
-For detailed setup instructions, see the [mobile-use repository](https://github.com/minitap-ai/mobile-use).
-### Running the Server
-The simplest way to start:
-```bash
-minitap-mcp --server --api-key your_minitap_api_key
-```
-This starts the server on `localhost:8000` with your API key. Get your free API key at [platform.minitap.ai/api-keys](https://platform.minitap.ai/api-keys).
-**Available CLI options:**
-```bash
-minitap-mcp --server --api-key YOUR_KEY --llm-profile PROFILE_NAME
-```
-- `--api-key`: Your Minitap API key (overrides `MINITAP_API_KEY` env var). Get yours at [platform.minitap.ai/api-keys](https://platform.minitap.ai/api-keys).
-- `--llm-profile`: LLM profile name to use (overrides `MINITAP_LLM_PROFILE_NAME` env var). If unset, uses the default profile. Configure profiles at [platform.minitap.ai/llm-profiles](https://platform.minitap.ai/llm-profiles).
-### Configuration (Optional)
-Alternatively, you can set environment variables instead of using CLI flags:
-```bash
-export MINITAP_API_KEY="your_minitap_api_key"
-export MINITAP_API_BASE_URL="https://platform.minitap.ai/api/v1"
-export MINITAP_LLM_PROFILE_NAME="default"
-```
-You can set these in your `.bashrc` or equivalent, then simply run:
-```bash
-minitap-mcp --server
-```
-CLI flags always override environment variables when both are present.
-By default, the server will bind to `0.0.0.0:8000`. Configure via environment variables:
-```bash
-export MCP_SERVER_HOST="0.0.0.0"
-export MCP_SERVER_PORT="8000"
-```
-## IDE Integration
-1. Start the server: `minitap-mcp --server --api-key your_minitap_api_key`
-2. Add to your IDE MCP settings file:
-```jsonc
-# For Windsurf
-{
-  "mcpServers": {
-    "minitap-mcp": {
-      "serverUrl": "http://localhost:8000/mcp"
-    }
-  }
-}
-```
-```jsonc
-# For Cursor
-{
-  "mcpServers": {
-    "minitap-mcp": {
-      "transport": "http",
-      "url": "http://localhost:8000/mcp"
-    }
-  }
-}
-```
-## Available Tools
-Once connected, your AI assistant can use these tools:
-### `execute_mobile_command`
-Execute natural language commands on your mobile device using the Minitap SDK. This tool allows you to control your Android or iOS device using natural language.
-**Parameters:**
-- `goal` (required): High-level goal describing the action to perform
-- `output_description` (optional): Natural language description of the desired output format. Results are returned as structured JSON (e.g., "An array with sender and subject for each email")
-- `profile` (optional): Profile name to use (defaults to "default")
-**Examples:**
-```
-"Open the settings app and tell me the battery level"
-"Find the first 3 unread emails in Gmail"
-"Open Google Maps and search for the nearest coffee shop"
-"Take a screenshot and save it"
-```
-### `analyze_screen`
-Capture and analyze what's currently shown on the mobile device screen using a vision-capable LLM. Useful for understanding UI elements, extracting text, or identifying specific features.
-**Parameters:**
-- `prompt` (required): Analysis prompt describing what information to extract
-- `device_id` (optional): Specific device ID to target
-**Examples:**
-```js
-"What app is currently open?"
-"Read the text messages visible on screen"
-"List all buttons and their labels on the current screen"
-"Extract the phone number displayed"
-```
-## Advanced Configuration
-### Custom ADB Server
-If using a remote or custom ADB server (like on WSL):
-```bash
-export ADB_SERVER_SOCKET="tcp:192.168.1.100:5037"
-```
-### Vision Model
-Customize the vision model used for screen analysis:
-```bash
-export VISION_MODEL="qwen/qwen-2.5-vl-7b-instruct"
-```
-## Device Setup
-### Android
-1. Enable USB debugging on your device
-2. Connect via USB or network ADB
-3. Verify connection: `adb devices`
-### iOS (macOS only)
-1. Install Xcode Command Line Tools
-2. Start a simulator or connect a physical device
-3. Verify: `xcrun simctl list devices booted`
-## Troubleshooting
-**No devices found:**
-- Verify ADB/xcrun connection
-- Check USB debugging is enabled (Android)
-- Ensure device is unlocked
-**Connection refused errors:**
-- Check ADB/xcrun connection
-**API authentication errors:**
-- Verify `MINITAP_API_KEY` is set correctly
-## Links
-- **Mobile-Use SDK:** [github.com/minitap-ai/mobile-use](https://github.com/minitap-ai/mobile-use)
-- **Mobile-Use Documentation:** [docs.minitap.ai](https://docs.minitap.ai)

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/__init__.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/agents/compare_screenshots.md RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/agents/compare_screenshots.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/agents/extract_figma_assets.md RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/agents/extract_figma_assets.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/config.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/decorators.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/device.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/llm.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/logging_config.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/models.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/sdk_agent.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/core/utils.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/main.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/server/middleware.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/server/poller.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/tools/analyze_screen.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/tools/compare_screenshot_with_figma.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/tools/execute_mobile_command.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/tools/save_figma_assets.py RENAMED Viewed

File without changes

{minitap_mcp-0.4.2 → minitap_mcp-0.4.3}/minitap/mcp/tools/screen_analyzer.md RENAMED Viewed

File without changes

minitap-mcp 0.4.2__tar.gz → 0.4.3__tar.gz

minitap-mcp 0.4.2tar.gz → 0.4.3tar.gz