windows-mcp 0.3.0__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -0,0 +1,21 @@
1
+ MIT License
2
+
3
+ Copyright (c) 2025 JEOMON GEORGE
4
+
5
+ Permission is hereby granted, free of charge, to any person obtaining a copy
6
+ of this software and associated documentation files (the "Software"), to deal
7
+ in the Software without restriction, including without limitation the rights
8
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9
+ copies of the Software, and to permit persons to whom the Software is
10
+ furnished to do so, subject to the following conditions:
11
+
12
+ The above copyright notice and this permission notice shall be included in all
13
+ copies or substantial portions of the Software.
14
+
15
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21
+ SOFTWARE.
@@ -0,0 +1,361 @@
1
+ Metadata-Version: 2.4
2
+ Name: windows-mcp
3
+ Version: 0.3.0
4
+ Summary: Lightweight MCP Server for interacting with Windows Operating System.
5
+ Author-email: Jeomon George <jeogeoalukka@gmail.com>
6
+ License: MIT License
7
+
8
+ Copyright (c) 2025 JEOMON GEORGE
9
+
10
+ Permission is hereby granted, free of charge, to any person obtaining a copy
11
+ of this software and associated documentation files (the "Software"), to deal
12
+ in the Software without restriction, including without limitation the rights
13
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
14
+ copies of the Software, and to permit persons to whom the Software is
15
+ furnished to do so, subject to the following conditions:
16
+
17
+ The above copyright notice and this permission notice shall be included in all
18
+ copies or substantial portions of the Software.
19
+
20
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
21
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
22
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
23
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
24
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
25
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
26
+ SOFTWARE.
27
+ Project-URL: homepage, https://github.com/CursorTouch
28
+ Keywords: windows,mcp,ai,desktop,ai agent
29
+ Requires-Python: >=3.13
30
+ Description-Content-Type: text/markdown
31
+ License-File: LICENSE.md
32
+ Requires-Dist: click>=8.2.1
33
+ Requires-Dist: fastmcp>=2.8.1
34
+ Requires-Dist: fuzzywuzzy>=0.18.0
35
+ Requires-Dist: humancursor>=1.1.5
36
+ Requires-Dist: ipykernel>=6.30.0
37
+ Requires-Dist: live-inspect>=0.1.1
38
+ Requires-Dist: markdownify>=1.1.0
39
+ Requires-Dist: pillow>=11.2.1
40
+ Requires-Dist: psutil>=7.0.0
41
+ Requires-Dist: pyautogui>=0.9.54
42
+ Requires-Dist: pygetwindow>=0.0.9
43
+ Requires-Dist: python-levenshtein>=0.27.1
44
+ Requires-Dist: requests>=2.32.3
45
+ Requires-Dist: uiautomation>=2.0.24
46
+ Dynamic: license-file
47
+
48
+ <div align="center">
49
+ <h1>🪟 Windows-MCP</h1>
50
+
51
+ <a href="https://github.com/CursorTouch/Windows-MCP/blob/main/LICENSE">
52
+ <img src="https://img.shields.io/badge/license-MIT-green" alt="License">
53
+ </a>
54
+ <img src="https://img.shields.io/badge/python-3.13%2B-blue" alt="Python">
55
+ <img src="https://img.shields.io/badge/platform-Windows%207–11-blue" alt="Platform: Windows 7 to 11">
56
+ <img src="https://img.shields.io/github/last-commit/CursorTouch/Windows-MCP" alt="Last Commit">
57
+ <br>
58
+ <a href="https://x.com/CursorTouch">
59
+ <img src="https://img.shields.io/badge/follow-%40CursorTouch-1DA1F2?logo=twitter&style=flat" alt="Follow on Twitter">
60
+ </a>
61
+ <a href="https://discord.com/invite/Aue9Yj2VzS">
62
+ <img src="https://img.shields.io/badge/Join%20on-Discord-5865F2?logo=discord&logoColor=white&style=flat" alt="Join us on Discord">
63
+ </a>
64
+
65
+ </div>
66
+
67
+ <br>
68
+
69
+ **Windows MCP** is a lightweight, open-source project that enables seamless integration between AI agents and the Windows operating system. Acting as an MCP server bridges the gap between LLMs and the Windows operating system, allowing agents to perform tasks such as **file navigation, application control, UI interaction, QA testing,** and more.
70
+
71
+ ## Updates
72
+
73
+ - Try out 🪟[Windows-Use](https://github.com/CursorTouch/Windows-Use)!!, an agent built using Windows-MCP.
74
+ - Windows-MCP is now featured as Desktop Extension in `Claude Desktop`.
75
+
76
+ ### Supported Operating Systems
77
+
78
+ - Windows 7
79
+ - Windows 8, 8.1
80
+ - Windows 10
81
+ - Windows 11
82
+
83
+ ## 🎥 Demos
84
+
85
+ <https://github.com/user-attachments/assets/d0e7ed1d-6189-4de6-838a-5ef8e1cad54e>
86
+
87
+ <https://github.com/user-attachments/assets/d2b372dc-8d00-4d71-9677-4c64f5987485>
88
+
89
+ ## ✨ Key Features
90
+
91
+ - **Seamless Windows Integration**
92
+ Interacts natively with Windows UI elements, opens apps, controls windows, simulates user input, and more.
93
+
94
+ - **Use Any LLM (Vision Optional)**
95
+ Unlike many automation tools, Windows MCP doesn't rely on any traditional computer vision techniques or specific fine-tuned models; it works with any LLMs, reducing complexity and setup time.
96
+
97
+ - **Rich Toolset for UI Automation**
98
+ Includes tools for basic keyboard, mouse operation and capturing window/UI state.
99
+
100
+ - **Lightweight & Open-Source**
101
+ Minimal dependencies and easy setup with full source code available under MIT license.
102
+
103
+ - **Customizable & Extendable**
104
+ Easily adapt or extend tools to suit your unique automation or AI integration needs.
105
+
106
+ - **Real-Time Interaction**
107
+ Typical latency between actions (e.g., from one mouse click to the next) ranges from **0.7 to 2.5 secs**, and may slightly vary based on the number of active applications and system load, also the inferencing speed of the llm.
108
+
109
+ ## 🛠️Installation
110
+
111
+ ### Prerequisites
112
+
113
+ - Python 3.13+
114
+ - UV (Package Manager) from Astra, install with `pip install uv` or `curl -LsSf https://astral.sh/uv/install.sh | sh`
115
+ - `English` as the default language in Windows highly preferred or disable the `Launch-Tool` and `Switch-Tool` in the MCP Server for Windows with other languages.
116
+
117
+ <details>
118
+ <summary>Install in Claude Desktop</summary>
119
+
120
+ 1. Install [Claude Desktop](https://claude.ai/download) and
121
+
122
+ ```shell
123
+ npm install -g @anthropic-ai/dxt
124
+ ```
125
+
126
+ 2. Clone the repository.
127
+
128
+ ```shell
129
+ git clone https://github.com/CursorTouch/Windows-MCP.git
130
+
131
+ cd Windows-MCP
132
+ ```
133
+
134
+ 3. Build Desktop Extension `DXT`:
135
+
136
+ ```shell
137
+ npx @anthropic-ai/dxt pack
138
+ ```
139
+
140
+ 4. Open Claude Desktop:
141
+
142
+ Go to `Settings->Extensions->Advance Settings->Install Extension` (locate the `.dxt` file)-> Install
143
+
144
+ 5. Enjoy 🥳.
145
+
146
+ For additional Claude Desktop integration troubleshooting, see the [MCP documentation](https://modelcontextprotocol.io/quickstart/server#claude-for-desktop-integration-issues). The documentation includes helpful tips for checking logs and resolving common issues.
147
+ </details>
148
+
149
+ <details>
150
+ <summary>Install in Perplexity Desktop</summary>
151
+
152
+ 1. Install [Perplexity Desktop](https://apps.microsoft.com/detail/xp8jnqfbqh6pvf):
153
+
154
+ 2. Clone the repository.
155
+
156
+ ```shell
157
+ git clone https://github.com/CursorTouch/Windows-MCP.git
158
+
159
+ cd Windows-MCP
160
+ ```
161
+
162
+ 3. Open Perplexity Desktop:
163
+
164
+ Go to `Settings->Connectors->Add Connector->Advanced`
165
+
166
+ 4. Enter the name as `Windows-MCP`, then paste the following JSON in the text area.
167
+
168
+ ```json
169
+ {
170
+ "command": "uv",
171
+ "args": [
172
+ "--directory",
173
+ "<path to the windows-mcp directory>",
174
+ "run",
175
+ "main.py"
176
+ ]
177
+ }
178
+ ```
179
+
180
+ 5. Click `Save` and Enjoy 🥳.
181
+
182
+ For additional Claude Desktop integration troubleshooting, see the [Perplexity MCP Support](https://www.perplexity.ai/help-center/en/articles/11502712-local-and-remote-mcps-for-perplexity). The documentation includes helpful tips for checking logs and resolving common issues.
183
+ </details>
184
+
185
+ <details>
186
+ <summary> Install in Gemini CLI</summary>
187
+
188
+ 1. Install Gemini CLI:
189
+
190
+ ```shell
191
+ npm install -g @google/gemini-cli
192
+ ```
193
+
194
+ 2. Clone the repository.
195
+
196
+ ```shell
197
+ git clone https://github.com/CursorTouch/Windows-MCP.git
198
+
199
+ cd Windows-MCP
200
+ ```
201
+
202
+ 3. Navigate to `%USERPROFILE%/.gemini` in File Explorer and open `settings.json`.
203
+
204
+ 4. Add the `windows-mcp` config in the `settings.json` and save it.
205
+
206
+ ```json
207
+ {
208
+ "theme": "Default",
209
+ ...
210
+ //MCP Server Config
211
+ "mcpServers": {
212
+ "windows-mcp": {
213
+ "command": "uv",
214
+ "args": [
215
+ "--directory",
216
+ "<path to the windows-mcp directory>",
217
+ "run",
218
+ "main.py"
219
+ ]
220
+ }
221
+ }
222
+ }
223
+ ```
224
+
225
+ 5. Rerun Gemini CLI in terminal. Enjoy 🥳
226
+ </details>
227
+
228
+ <details>
229
+ <summary>Install in Qwen Code</summary>
230
+ 1. Install Qwen Code:
231
+
232
+ ```shell
233
+ npm install -g @qwen-code/qwen-code@latest
234
+ ```
235
+ 2. Clone the repository.
236
+
237
+ ```shell
238
+ git clone https://github.com/CursorTouch/Windows-MCP.git
239
+
240
+ cd Windows-MCP
241
+ ```
242
+
243
+ 3. Navigate to `%USERPROFILE%/.qwen/settings.json`.
244
+
245
+ 4. Add the `windows-mcp` config in the `settings.json` and save it.
246
+
247
+ ```json
248
+ {
249
+ //MCP Server Config
250
+ "mcpServers": {
251
+ "windows-mcp": {
252
+ "command": "uv",
253
+ "args": [
254
+ "--directory",
255
+ "<path to the windows-mcp directory>",
256
+ "run",
257
+ "main.py"
258
+ ]
259
+ }
260
+ }
261
+ }
262
+ ```
263
+
264
+ 5. Rerun Qwen Code in terminal. Enjoy 🥳
265
+ </details>
266
+
267
+ <details>
268
+ <summary>Install in Codex CLI</summary>
269
+ 1. Install Codex CLI:
270
+
271
+ ```shell
272
+ npm install -g @openai/codex
273
+ ```
274
+ 2. Clone the repository.
275
+
276
+ ```shell
277
+ git clone https://github.com/CursorTouch/Windows-MCP.git
278
+
279
+ cd Windows-MCP
280
+ ```
281
+ 3. Navigate to `%USERPROFILE%/.codex/config.toml`.
282
+
283
+ 4. Add the `windows-mcp` config in the `config.toml` and save it.
284
+
285
+ ```toml
286
+ [mcp_servers.windows-mcp]
287
+ command="uv"
288
+ args=[
289
+ "--directory",
290
+ "<path to the windows-mcp directory>",
291
+ "run",
292
+ "main.py"
293
+ ]
294
+ ```
295
+
296
+ 5. Rerun Codex CLI in terminal. Enjoy 🥳
297
+ </details>
298
+
299
+ ---
300
+
301
+ ## 🔨MCP Tools
302
+
303
+ MCP Client can access the following tools to interact with Windows:
304
+
305
+ - `Click-Tool`: Click on the screen at the given coordinates.
306
+ - `Type-Tool`: Type text on an element (optionally clears existing text).
307
+ - `Clipboard-Tool`: Copy or paste using the system clipboard.
308
+ - `Scroll-Tool`: Scroll vertically or horizontally on the window or specific regions.
309
+ - `Drag-Tool`: Drag from one point to another.
310
+ - `Move-Tool`: Move mouse pointer.
311
+ - `Shortcut-Tool`: Press keyboard shortcuts (`Ctrl+c`, `Alt+Tab`, etc).
312
+ - `Key-Tool`: Press a single key.
313
+ - `Wait-Tool`: Pause for a defined duration.
314
+ - `State-Tool`: Combined snapshot of default language, browser, active apps and interactive, textual and scrollable elements along with screenshot of the desktop.
315
+ - `Resize-Tool`: Used to change the window size or location of an app.
316
+ - `Launch-Tool`: To launch an application from the start menu.
317
+ - `Shell-Tool`: To execute PowerShell commands.
318
+ - `Scrape-Tool`: To scrape the entire webpage for information.
319
+
320
+ ## 🤝 Connect with Us
321
+ Stay updated and join our community:
322
+
323
+ - 📢 Follow us on [X](https://x.com/CursorTouch) for the latest news and updates
324
+
325
+ - 💬 Join our [Discord Community](https://discord.com/invite/Aue9Yj2VzS)
326
+
327
+ ## Star History
328
+
329
+ [![Star History Chart](https://api.star-history.com/svg?repos=CursorTouch/Windows-MCP&type=Date)](https://www.star-history.com/#CursorTouch/Windows-MCP&Date)
330
+
331
+ ## ⚠️Caution
332
+
333
+ This MCP interacts directly with your Windows operating system to perform actions. Use with caution and avoid deploying it in environments where such risks cannot be tolerated.
334
+
335
+ ## 📝 Limitations
336
+
337
+ - Selecting specific sections of the text in a paragraph, as the MCP is relying on a11y tree. (⌛ Working on it.)
338
+ - `Type-Tool` is meant for typing text, not programming in IDE because of it types program as a whole in a file. (⌛ Working on it.)
339
+ - This MCP server can't be used to play video games.
340
+
341
+ ## 🪪License
342
+
343
+ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
344
+
345
+ ## 🤝Contributing
346
+
347
+ Contributions are welcome! Please see [CONTRIBUTING](CONTRIBUTING) for setup instructions and development guidelines.
348
+
349
+ Made with ❤️ by [CursorTouch](https://github.com/CursorTouch)
350
+
351
+ ## Citation
352
+
353
+ ```bibtex
354
+ @software{
355
+ author = {CursorTouch},
356
+ title = {Windows-MCP: Lightweight open-source project for integrating LLM agents with Windows},
357
+ year = {2024},
358
+ publisher = {GitHub},
359
+ url={https://github.com/CursorTouch/Windows-MCP}
360
+ }
361
+ ```
@@ -0,0 +1,314 @@
1
+ <div align="center">
2
+ <h1>🪟 Windows-MCP</h1>
3
+
4
+ <a href="https://github.com/CursorTouch/Windows-MCP/blob/main/LICENSE">
5
+ <img src="https://img.shields.io/badge/license-MIT-green" alt="License">
6
+ </a>
7
+ <img src="https://img.shields.io/badge/python-3.13%2B-blue" alt="Python">
8
+ <img src="https://img.shields.io/badge/platform-Windows%207–11-blue" alt="Platform: Windows 7 to 11">
9
+ <img src="https://img.shields.io/github/last-commit/CursorTouch/Windows-MCP" alt="Last Commit">
10
+ <br>
11
+ <a href="https://x.com/CursorTouch">
12
+ <img src="https://img.shields.io/badge/follow-%40CursorTouch-1DA1F2?logo=twitter&style=flat" alt="Follow on Twitter">
13
+ </a>
14
+ <a href="https://discord.com/invite/Aue9Yj2VzS">
15
+ <img src="https://img.shields.io/badge/Join%20on-Discord-5865F2?logo=discord&logoColor=white&style=flat" alt="Join us on Discord">
16
+ </a>
17
+
18
+ </div>
19
+
20
+ <br>
21
+
22
+ **Windows MCP** is a lightweight, open-source project that enables seamless integration between AI agents and the Windows operating system. Acting as an MCP server bridges the gap between LLMs and the Windows operating system, allowing agents to perform tasks such as **file navigation, application control, UI interaction, QA testing,** and more.
23
+
24
+ ## Updates
25
+
26
+ - Try out 🪟[Windows-Use](https://github.com/CursorTouch/Windows-Use)!!, an agent built using Windows-MCP.
27
+ - Windows-MCP is now featured as Desktop Extension in `Claude Desktop`.
28
+
29
+ ### Supported Operating Systems
30
+
31
+ - Windows 7
32
+ - Windows 8, 8.1
33
+ - Windows 10
34
+ - Windows 11
35
+
36
+ ## 🎥 Demos
37
+
38
+ <https://github.com/user-attachments/assets/d0e7ed1d-6189-4de6-838a-5ef8e1cad54e>
39
+
40
+ <https://github.com/user-attachments/assets/d2b372dc-8d00-4d71-9677-4c64f5987485>
41
+
42
+ ## ✨ Key Features
43
+
44
+ - **Seamless Windows Integration**
45
+ Interacts natively with Windows UI elements, opens apps, controls windows, simulates user input, and more.
46
+
47
+ - **Use Any LLM (Vision Optional)**
48
+ Unlike many automation tools, Windows MCP doesn't rely on any traditional computer vision techniques or specific fine-tuned models; it works with any LLMs, reducing complexity and setup time.
49
+
50
+ - **Rich Toolset for UI Automation**
51
+ Includes tools for basic keyboard, mouse operation and capturing window/UI state.
52
+
53
+ - **Lightweight & Open-Source**
54
+ Minimal dependencies and easy setup with full source code available under MIT license.
55
+
56
+ - **Customizable & Extendable**
57
+ Easily adapt or extend tools to suit your unique automation or AI integration needs.
58
+
59
+ - **Real-Time Interaction**
60
+ Typical latency between actions (e.g., from one mouse click to the next) ranges from **0.7 to 2.5 secs**, and may slightly vary based on the number of active applications and system load, also the inferencing speed of the llm.
61
+
62
+ ## 🛠️Installation
63
+
64
+ ### Prerequisites
65
+
66
+ - Python 3.13+
67
+ - UV (Package Manager) from Astra, install with `pip install uv` or `curl -LsSf https://astral.sh/uv/install.sh | sh`
68
+ - `English` as the default language in Windows highly preferred or disable the `Launch-Tool` and `Switch-Tool` in the MCP Server for Windows with other languages.
69
+
70
+ <details>
71
+ <summary>Install in Claude Desktop</summary>
72
+
73
+ 1. Install [Claude Desktop](https://claude.ai/download) and
74
+
75
+ ```shell
76
+ npm install -g @anthropic-ai/dxt
77
+ ```
78
+
79
+ 2. Clone the repository.
80
+
81
+ ```shell
82
+ git clone https://github.com/CursorTouch/Windows-MCP.git
83
+
84
+ cd Windows-MCP
85
+ ```
86
+
87
+ 3. Build Desktop Extension `DXT`:
88
+
89
+ ```shell
90
+ npx @anthropic-ai/dxt pack
91
+ ```
92
+
93
+ 4. Open Claude Desktop:
94
+
95
+ Go to `Settings->Extensions->Advance Settings->Install Extension` (locate the `.dxt` file)-> Install
96
+
97
+ 5. Enjoy 🥳.
98
+
99
+ For additional Claude Desktop integration troubleshooting, see the [MCP documentation](https://modelcontextprotocol.io/quickstart/server#claude-for-desktop-integration-issues). The documentation includes helpful tips for checking logs and resolving common issues.
100
+ </details>
101
+
102
+ <details>
103
+ <summary>Install in Perplexity Desktop</summary>
104
+
105
+ 1. Install [Perplexity Desktop](https://apps.microsoft.com/detail/xp8jnqfbqh6pvf):
106
+
107
+ 2. Clone the repository.
108
+
109
+ ```shell
110
+ git clone https://github.com/CursorTouch/Windows-MCP.git
111
+
112
+ cd Windows-MCP
113
+ ```
114
+
115
+ 3. Open Perplexity Desktop:
116
+
117
+ Go to `Settings->Connectors->Add Connector->Advanced`
118
+
119
+ 4. Enter the name as `Windows-MCP`, then paste the following JSON in the text area.
120
+
121
+ ```json
122
+ {
123
+ "command": "uv",
124
+ "args": [
125
+ "--directory",
126
+ "<path to the windows-mcp directory>",
127
+ "run",
128
+ "main.py"
129
+ ]
130
+ }
131
+ ```
132
+
133
+ 5. Click `Save` and Enjoy 🥳.
134
+
135
+ For additional Claude Desktop integration troubleshooting, see the [Perplexity MCP Support](https://www.perplexity.ai/help-center/en/articles/11502712-local-and-remote-mcps-for-perplexity). The documentation includes helpful tips for checking logs and resolving common issues.
136
+ </details>
137
+
138
+ <details>
139
+ <summary> Install in Gemini CLI</summary>
140
+
141
+ 1. Install Gemini CLI:
142
+
143
+ ```shell
144
+ npm install -g @google/gemini-cli
145
+ ```
146
+
147
+ 2. Clone the repository.
148
+
149
+ ```shell
150
+ git clone https://github.com/CursorTouch/Windows-MCP.git
151
+
152
+ cd Windows-MCP
153
+ ```
154
+
155
+ 3. Navigate to `%USERPROFILE%/.gemini` in File Explorer and open `settings.json`.
156
+
157
+ 4. Add the `windows-mcp` config in the `settings.json` and save it.
158
+
159
+ ```json
160
+ {
161
+ "theme": "Default",
162
+ ...
163
+ //MCP Server Config
164
+ "mcpServers": {
165
+ "windows-mcp": {
166
+ "command": "uv",
167
+ "args": [
168
+ "--directory",
169
+ "<path to the windows-mcp directory>",
170
+ "run",
171
+ "main.py"
172
+ ]
173
+ }
174
+ }
175
+ }
176
+ ```
177
+
178
+ 5. Rerun Gemini CLI in terminal. Enjoy 🥳
179
+ </details>
180
+
181
+ <details>
182
+ <summary>Install in Qwen Code</summary>
183
+ 1. Install Qwen Code:
184
+
185
+ ```shell
186
+ npm install -g @qwen-code/qwen-code@latest
187
+ ```
188
+ 2. Clone the repository.
189
+
190
+ ```shell
191
+ git clone https://github.com/CursorTouch/Windows-MCP.git
192
+
193
+ cd Windows-MCP
194
+ ```
195
+
196
+ 3. Navigate to `%USERPROFILE%/.qwen/settings.json`.
197
+
198
+ 4. Add the `windows-mcp` config in the `settings.json` and save it.
199
+
200
+ ```json
201
+ {
202
+ //MCP Server Config
203
+ "mcpServers": {
204
+ "windows-mcp": {
205
+ "command": "uv",
206
+ "args": [
207
+ "--directory",
208
+ "<path to the windows-mcp directory>",
209
+ "run",
210
+ "main.py"
211
+ ]
212
+ }
213
+ }
214
+ }
215
+ ```
216
+
217
+ 5. Rerun Qwen Code in terminal. Enjoy 🥳
218
+ </details>
219
+
220
+ <details>
221
+ <summary>Install in Codex CLI</summary>
222
+ 1. Install Codex CLI:
223
+
224
+ ```shell
225
+ npm install -g @openai/codex
226
+ ```
227
+ 2. Clone the repository.
228
+
229
+ ```shell
230
+ git clone https://github.com/CursorTouch/Windows-MCP.git
231
+
232
+ cd Windows-MCP
233
+ ```
234
+ 3. Navigate to `%USERPROFILE%/.codex/config.toml`.
235
+
236
+ 4. Add the `windows-mcp` config in the `config.toml` and save it.
237
+
238
+ ```toml
239
+ [mcp_servers.windows-mcp]
240
+ command="uv"
241
+ args=[
242
+ "--directory",
243
+ "<path to the windows-mcp directory>",
244
+ "run",
245
+ "main.py"
246
+ ]
247
+ ```
248
+
249
+ 5. Rerun Codex CLI in terminal. Enjoy 🥳
250
+ </details>
251
+
252
+ ---
253
+
254
+ ## 🔨MCP Tools
255
+
256
+ MCP Client can access the following tools to interact with Windows:
257
+
258
+ - `Click-Tool`: Click on the screen at the given coordinates.
259
+ - `Type-Tool`: Type text on an element (optionally clears existing text).
260
+ - `Clipboard-Tool`: Copy or paste using the system clipboard.
261
+ - `Scroll-Tool`: Scroll vertically or horizontally on the window or specific regions.
262
+ - `Drag-Tool`: Drag from one point to another.
263
+ - `Move-Tool`: Move mouse pointer.
264
+ - `Shortcut-Tool`: Press keyboard shortcuts (`Ctrl+c`, `Alt+Tab`, etc).
265
+ - `Key-Tool`: Press a single key.
266
+ - `Wait-Tool`: Pause for a defined duration.
267
+ - `State-Tool`: Combined snapshot of default language, browser, active apps and interactive, textual and scrollable elements along with screenshot of the desktop.
268
+ - `Resize-Tool`: Used to change the window size or location of an app.
269
+ - `Launch-Tool`: To launch an application from the start menu.
270
+ - `Shell-Tool`: To execute PowerShell commands.
271
+ - `Scrape-Tool`: To scrape the entire webpage for information.
272
+
273
+ ## 🤝 Connect with Us
274
+ Stay updated and join our community:
275
+
276
+ - 📢 Follow us on [X](https://x.com/CursorTouch) for the latest news and updates
277
+
278
+ - 💬 Join our [Discord Community](https://discord.com/invite/Aue9Yj2VzS)
279
+
280
+ ## Star History
281
+
282
+ [![Star History Chart](https://api.star-history.com/svg?repos=CursorTouch/Windows-MCP&type=Date)](https://www.star-history.com/#CursorTouch/Windows-MCP&Date)
283
+
284
+ ## ⚠️Caution
285
+
286
+ This MCP interacts directly with your Windows operating system to perform actions. Use with caution and avoid deploying it in environments where such risks cannot be tolerated.
287
+
288
+ ## 📝 Limitations
289
+
290
+ - Selecting specific sections of the text in a paragraph, as the MCP is relying on a11y tree. (⌛ Working on it.)
291
+ - `Type-Tool` is meant for typing text, not programming in IDE because of it types program as a whole in a file. (⌛ Working on it.)
292
+ - This MCP server can't be used to play video games.
293
+
294
+ ## 🪪License
295
+
296
+ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
297
+
298
+ ## 🤝Contributing
299
+
300
+ Contributions are welcome! Please see [CONTRIBUTING](CONTRIBUTING) for setup instructions and development guidelines.
301
+
302
+ Made with ❤️ by [CursorTouch](https://github.com/CursorTouch)
303
+
304
+ ## Citation
305
+
306
+ ```bibtex
307
+ @software{
308
+ author = {CursorTouch},
309
+ title = {Windows-MCP: Lightweight open-source project for integrating LLM agents with Windows},
310
+ year = {2024},
311
+ publisher = {GitHub},
312
+ url={https://github.com/CursorTouch/Windows-MCP}
313
+ }
314
+ ```