native-devtools-mcp 0.3.3 → 0.3.5
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +14 -2
- package/package.json +3 -3
package/README.md
CHANGED
|
@@ -15,7 +15,16 @@ A Model Context Protocol (MCP) server that provides **Computer Use** capabilitie
|
|
|
15
15
|
|
|
16
16
|
[Features](#-features) • [Installation](#-installation) • [For AI Agents](#-for-ai-agents-llms) • [Permissions](#-required-permissions-macos)
|
|
17
17
|
|
|
18
|
-
|
|
18
|
+
<table>
|
|
19
|
+
<tr>
|
|
20
|
+
<td align="center"><strong>macOS</strong></td>
|
|
21
|
+
<td align="center"><strong>Windows</strong></td>
|
|
22
|
+
</tr>
|
|
23
|
+
<tr>
|
|
24
|
+
<td><img src="demo.gif" width="450" alt="macOS Demo"></td>
|
|
25
|
+
<td><img src="windows-demo-1.gif" width="450" alt="Windows Demo"></td>
|
|
26
|
+
</tr>
|
|
27
|
+
</table>
|
|
19
28
|
|
|
20
29
|
</div>
|
|
21
30
|
|
|
@@ -45,7 +54,7 @@ This MCP server is designed to be **highly discoverable and usable** by AI model
|
|
|
45
54
|
**Core Capabilities for System Prompts:**
|
|
46
55
|
1. `take_screenshot`: The "eyes". Returns images + layout metadata + text locations (OCR).
|
|
47
56
|
2. `click` / `type_text`: The "hands". Interacts with the system based on visual feedback.
|
|
48
|
-
3. `find_text`: A shortcut to find text on screen and get its coordinates immediately.
|
|
57
|
+
3. `find_text`: A shortcut to find text on screen and get its coordinates immediately. On Windows, uses **UI Automation** for precise element-level matching, with OCR fallback.
|
|
49
58
|
4. `load_image` / `find_image`: Template matching for non-text UI elements (icons, shapes), returning screen coordinates for clicking.
|
|
50
59
|
|
|
51
60
|
## 📦 Installation (macOS + Windows)
|
|
@@ -181,6 +190,7 @@ graph TD
|
|
|
181
190
|
subgraph "Your Machine"
|
|
182
191
|
Sys -->|Screen/OCR| macOS[CoreGraphics / Vision]
|
|
183
192
|
Sys -->|Input| Win[Win32 / SendInput]
|
|
193
|
+
Sys -->|Text Search| UIA[UI Automation]
|
|
184
194
|
Debug -.->|Inspect| App[Target App]
|
|
185
195
|
end
|
|
186
196
|
```
|
|
@@ -196,6 +206,7 @@ graph TD
|
|
|
196
206
|
| **Windows** | Screenshots | `BitBlt` (GDI) |
|
|
197
207
|
| | Input | `SendInput` (Win32) |
|
|
198
208
|
| | OCR | `Windows.Media.Ocr` (WinRT) |
|
|
209
|
+
| | Text Search (`find_text`) | `UI Automation` (primary), OCR (fallback) |
|
|
199
210
|
|
|
200
211
|
### Screenshot Coordinate Precision
|
|
201
212
|
|
|
@@ -245,6 +256,7 @@ On macOS, you must grant permissions to the **host application** (e.g., Terminal
|
|
|
245
256
|
|
|
246
257
|
Works out of the box on **Windows 10/11**.
|
|
247
258
|
* Uses standard Win32 APIs (GDI, SendInput).
|
|
259
|
+
* `find_text` uses **UI Automation (UIA)** as the primary search mechanism, querying the accessibility tree for element names. This is faster and more precise than OCR for standard UI elements (buttons, labels, menus). Falls back to OCR automatically when UIA finds no matches.
|
|
248
260
|
* OCR uses the built-in Windows Media OCR engine (offline).
|
|
249
261
|
* **Note:** Cannot interact with "Run as Administrator" windows unless the MCP server itself is also running as Administrator.
|
|
250
262
|
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "native-devtools-mcp",
|
|
3
|
-
"version": "0.3.
|
|
3
|
+
"version": "0.3.5",
|
|
4
4
|
"description": "MCP server for computer-use / desktop automation of native apps (screenshots, OCR, input)",
|
|
5
5
|
"license": "MIT",
|
|
6
6
|
"repository": {
|
|
@@ -39,8 +39,8 @@
|
|
|
39
39
|
"bin"
|
|
40
40
|
],
|
|
41
41
|
"optionalDependencies": {
|
|
42
|
-
"@sh3ll3x3c/native-devtools-mcp-darwin-arm64": "0.3.
|
|
43
|
-
"@sh3ll3x3c/native-devtools-mcp-win32-x64": "0.3.
|
|
42
|
+
"@sh3ll3x3c/native-devtools-mcp-darwin-arm64": "0.3.5",
|
|
43
|
+
"@sh3ll3x3c/native-devtools-mcp-win32-x64": "0.3.5"
|
|
44
44
|
},
|
|
45
45
|
"engines": {
|
|
46
46
|
"node": ">=18"
|