@goonnguyen/human-mcp 2.13.0 → 2.14.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (4) hide show
  1. package/README.md +129 -220
  2. package/dist/index.js +33648 -7351
  3. package/dist/index.js.map +197 -14
  4. package/package.json +3 -1
package/README.md CHANGED
@@ -6,6 +6,9 @@
6
6
 
7
7
  Human MCP v2.10.0 is a comprehensive Model Context Protocol server that provides AI coding agents with human-like capabilities including visual analysis, document processing, speech generation, content creation, image editing, browser automation, and advanced reasoning for debugging, understanding, and enhancing multimodal content.
8
8
 
9
+ ## "Human MCP" is a part of [ClaudeKit](https://claudekit.cc)
10
+ ![ClaudeKit.cc](claudekit.png)
11
+
9
12
  ## Features
10
13
 
11
14
  🎯 **Visual Analysis (Eyes) - ✅ Complete (4 tools)**
@@ -123,6 +126,104 @@ curl -H "Content-Type: application/json" \
123
126
  -X POST "https://generativelanguage.googleapis.com/v1beta/models/gemini-pro:generateContent?key=YOUR_API_KEY"
124
127
  ```
125
128
 
129
+ ### Choosing Your Gemini Provider
130
+
131
+ Human MCP supports two ways to access Google's Gemini models:
132
+
133
+ #### Option 1: Google AI Studio (Default - Recommended for Getting Started)
134
+
135
+ **Best for:** Quick start, development, prototyping
136
+
137
+ **Setup:**
138
+ 1. Get your API key from [Google AI Studio](https://aistudio.google.com/)
139
+ 2. Set environment variable: `export GOOGLE_GEMINI_API_KEY=your_api_key`
140
+
141
+ **Pros:**
142
+ - Simple setup (just one API key)
143
+ - No GCP account required
144
+ - Free tier available
145
+ - Perfect for development and testing
146
+
147
+ #### Option 2: Vertex AI (Recommended for Production)
148
+
149
+ **Best for:** Production deployments, enterprise use, GCP integration
150
+
151
+ **Setup:**
152
+ 1. Create a GCP project and enable Vertex AI API
153
+ 2. Set up authentication:
154
+ ```bash
155
+ # Option A: Application Default Credentials (for local dev)
156
+ gcloud auth application-default login
157
+
158
+ # Option B: Service Account (for production)
159
+ export GOOGLE_APPLICATION_CREDENTIALS=/path/to/service-account.json
160
+ ```
161
+ 3. Set environment variables:
162
+ ```bash
163
+ export USE_VERTEX=1
164
+ export VERTEX_PROJECT_ID=your-gcp-project-id
165
+ export VERTEX_LOCATION=us-central1 # optional, defaults to us-central1
166
+ ```
167
+
168
+ **Pros:**
169
+ - Enterprise-grade quotas and SLAs
170
+ - Better integration with GCP services
171
+ - Advanced IAM and security controls
172
+ - Usage tracking via Cloud Console
173
+ - Better for production workloads
174
+
175
+ **Configuration Example (Claude Desktop):**
176
+
177
+ ```json
178
+ {
179
+ "mcpServers": {
180
+ "human-mcp-vertex": {
181
+ "command": "npx",
182
+ "args": ["@goonnguyen/human-mcp"],
183
+ "env": {
184
+ "USE_VERTEX": "1",
185
+ "VERTEX_PROJECT_ID": "your-gcp-project-id",
186
+ "VERTEX_LOCATION": "us-central1"
187
+ }
188
+ }
189
+ }
190
+ }
191
+ ```
192
+
193
+ **Note:** With Vertex AI, you don't need `GOOGLE_GEMINI_API_KEY` - authentication is handled by GCP credentials.
194
+
195
+ #### Vertex AI Authentication Methods
196
+
197
+ **1. Application Default Credentials (ADC)** - Best for local development
198
+ ```bash
199
+ gcloud auth application-default login
200
+ ```
201
+
202
+ **2. Service Account** - Best for production
203
+ ```bash
204
+ # Download service account JSON from GCP Console
205
+ export GOOGLE_APPLICATION_CREDENTIALS=/path/to/service-account.json
206
+ ```
207
+
208
+ **3. Workload Identity** - Best for GKE deployments
209
+ - Automatically configured when running on GKE
210
+ - No credentials file needed
211
+ - Recommended for Kubernetes deployments
212
+
213
+ **Troubleshooting Vertex AI:**
214
+
215
+ If you encounter authentication errors:
216
+ 1. Verify your GCP project ID: `gcloud config get-value project`
217
+ 2. Check ADC status: `gcloud auth application-default print-access-token`
218
+ 3. Ensure Vertex AI API is enabled: Visit [Vertex AI Console](https://console.cloud.google.com/vertex-ai)
219
+ 4. Verify IAM permissions: Your account needs `Vertex AI User` role
220
+
221
+ **Cost Considerations:**
222
+
223
+ Both Google AI Studio and Vertex AI use the same Gemini models and pricing, but:
224
+ - Google AI Studio: Generous free tier for testing
225
+ - Vertex AI: Production-grade quotas, better for high-volume usage
226
+
126
227
  #### Alternative Methods for API Key
127
228
 
128
229
  **Using Google Cloud Console:**
@@ -156,36 +257,8 @@ echo $GOOGLE_GEMINI_API_KEY
156
257
 
157
258
  ### Prerequisites
158
259
 
159
- - Node.js v18+ or [Bun](https://bun.sh) v1.2+
160
- - Google Gemini API key (configured as shown above)
161
-
162
- ### Installation
163
-
164
- Install Human MCP as an npm package:
165
-
166
- ```bash
167
- # Using npm
168
- npm install -g @goonnguyen/human-mcp
169
-
170
- # Using bun (recommended)
171
- bun install -g @goonnguyen/human-mcp
172
-
173
- # Using pnpm
174
- pnpm install -g @goonnguyen/human-mcp
175
- ```
176
-
177
- ### Environment Setup
178
-
179
- Configure your Google Gemini API key:
180
-
181
- ```bash
182
- # Option 1: Environment variable (recommended)
183
- export GOOGLE_GEMINI_API_KEY="your_api_key_here"
184
-
185
- # Option 2: Add to your shell profile
186
- echo 'export GOOGLE_GEMINI_API_KEY="your_api_key_here"' >> ~/.zshrc
187
- source ~/.zshrc
188
- ```
260
+ - Node.js v22+ or [Bun](https://bun.sh) v1.2+
261
+ - Google [Gemini API key](http://aistudio.google.com/) (configured as shown above)
189
262
 
190
263
  ### Development (For Contributors)
191
264
 
@@ -247,55 +320,15 @@ Claude Desktop is a desktop application that provides a user-friendly interface
247
320
  }
248
321
  ```
249
322
 
250
- **Alternative Configuration (if globally installed):**
251
-
252
- ```json
253
- {
254
- "mcpServers": {
255
- "human-mcp": {
256
- "command": "human-mcp",
257
- "env": {
258
- "GOOGLE_GEMINI_API_KEY": "your_gemini_api_key_here"
259
- }
260
- }
261
- }
262
- }
263
- ```
264
-
265
- **Setup Steps:**
266
- 1. Install Human MCP globally: `npm install -g @goonnguyen/human-mcp`
267
- 2. Create or edit the Claude Desktop configuration file
268
- 3. Add the Human MCP server configuration (use the first example with `npx` for reliability)
269
- 4. Set your Google Gemini API key in environment variables or the config
270
- 5. Restart Claude Desktop
271
-
272
- **Verification:**
273
- - Look for the connection indicator in Claude Desktop
274
- - Try using the `eyes_analyze` tool with a test image
275
-
276
323
  #### Claude Code (CLI)
277
324
 
278
325
  Claude Code is the official CLI for Claude that supports MCP servers for enhanced coding workflows.
279
326
 
280
327
  **Prerequisites:**
281
- - Node.js v18+ or Bun v1.2+
328
+ - Node.js v22+ or Bun v1.2+
282
329
  - Google Gemini API key
283
330
  - Claude Code CLI installed
284
331
 
285
- **Installation:**
286
-
287
- ```bash
288
- # Install Claude Code CLI
289
- npm install -g @anthropic-ai/claude-code
290
-
291
- # Install Human MCP server
292
- npm install -g @goonnguyen/human-mcp
293
-
294
- # Verify installations
295
- claude --version
296
- human-mcp --version # or: npx @goonnguyen/human-mcp --version
297
- ```
298
-
299
332
  **Configuration Methods:**
300
333
 
301
334
  Claude Code offers multiple ways to configure MCP servers. Choose the method that best fits your workflow:
@@ -306,8 +339,8 @@ Claude Code offers multiple ways to configure MCP servers. Choose the method tha
306
339
  # Add Human MCP server with automatic configuration
307
340
  claude mcp add --scope user human-mcp npx @goonnguyen/human-mcp --env GOOGLE_GEMINI_API_KEY=your_api_key_here
308
341
 
309
- # Alternative: Add globally installed version
310
- claude mcp add --scope user human-mcp human-mcp --env GOOGLE_GEMINI_API_KEY=your_api_key_here
342
+ # Alternative: Add locally installed version
343
+ claude mcp add --scope project human-mcp npx @goonnguyen/human-mcp --env GOOGLE_GEMINI_API_KEY=your_api_key_here
311
344
 
312
345
  # List configured MCP servers
313
346
  claude mcp list
@@ -339,23 +372,6 @@ claude mcp remove human-mcp
339
372
  }
340
373
  ```
341
374
 
342
- **Alternative Configuration (if globally installed):**
343
-
344
- ```json
345
- {
346
- "mcpServers": {
347
- "human-mcp": {
348
- "command": "human-mcp",
349
- "env": {
350
- "GOOGLE_GEMINI_API_KEY": "your_gemini_api_key_here",
351
- "LOG_LEVEL": "info",
352
- "MCP_TIMEOUT": "30000"
353
- }
354
- }
355
- }
356
- }
357
- ```
358
-
359
375
  **Configuration Scopes:**
360
376
 
361
377
  Claude Code supports different configuration scopes:
@@ -374,10 +390,9 @@ claude mcp add --scope local human-mcp npx @goonnguyen/human-mcp --env GOOGLE_GE
374
390
 
375
391
  **Setup Steps:**
376
392
  1. Install Claude Code CLI: `npm install -g @anthropic-ai/claude-code`
377
- 2. Install Human MCP: `npm install -g @goonnguyen/human-mcp`
378
- 3. Configure your Google Gemini API key (see Environment Setup section)
379
- 4. Add Human MCP server using CLI or manual configuration
380
- 5. Verify configuration: `claude mcp list`
393
+ 2. Configure your Google Gemini API key (see Environment Setup section)
394
+ 3. Add Human MCP server using CLI or manual configuration
395
+ 4. Verify configuration: `claude mcp list`
381
396
 
382
397
  **Verification:**
383
398
  ```bash
@@ -462,31 +477,10 @@ OpenCode is a powerful AI coding agent that supports MCP servers for enhanced ca
462
477
  }
463
478
  ```
464
479
 
465
- **Alternative Configuration (if globally installed):**
466
-
467
- ```json
468
- {
469
- "$schema": "https://opencode.ai/config.json",
470
- "mcp": {
471
- "human": {
472
- "type": "local",
473
- "command": ["human-mcp"],
474
- "enabled": true,
475
- "environment": {
476
- "GOOGLE_GEMINI_API_KEY": "your_gemini_api_key_here",
477
- "TRANSPORT_TYPE": "stdio"
478
- }
479
- }
480
- }
481
- }
482
- ```
483
-
484
480
  **Setup Steps:**
485
- 1. Install Human MCP: `npm install -g @goonnguyen/human-mcp`
486
- 2. Create or edit your OpenCode configuration file
487
- 3. Add the Human MCP server configuration (use `npx` version for reliability)
488
- 4. Set your Google Gemini API key in environment variables or the config
489
- 5. Restart OpenCode
481
+ 1. Configure your Google Gemini API key (see Environment Setup section)
482
+ 2. Add Human MCP server using CLI or manual configuration
483
+ 3. Verify configuration: `claude mcp list`
490
484
 
491
485
  **Important Notes:**
492
486
  - **STDIO Mode**: Human MCP uses stdio transport by default, which provides the best compatibility with OpenCode
@@ -502,16 +496,6 @@ OpenCode is a powerful AI coding agent that supports MCP servers for enhanced ca
502
496
 
503
497
  While Gemini CLI doesn't directly support MCP, you can use Human MCP as a bridge to access visual analysis capabilities.
504
498
 
505
- **Direct Usage:**
506
-
507
- ```bash
508
- # Run Human MCP server directly (if globally installed)
509
- human-mcp
510
-
511
- # Or using npx (no global installation needed)
512
- npx @goonnguyen/human-mcp
513
- ```
514
-
515
499
  **Integration Example:**
516
500
  ```bash
517
501
  # Create a wrapper script for Gemini CLI integration
@@ -552,28 +536,14 @@ These IDE extensions support MCP servers for enhanced AI-assisted coding with vi
552
536
  }
553
537
  ```
554
538
 
555
- **Alternative Configuration (if globally installed):**
556
-
557
- ```json
558
- {
559
- "cline.mcpServers": {
560
- "human-mcp": {
561
- "command": "human-mcp",
562
- "env": {
563
- "GOOGLE_GEMINI_API_KEY": "your_gemini_api_key_here"
564
- }
565
- }
566
- }
567
- }
568
- ```
569
-
570
539
  **Setup Steps:**
571
- 1. Install Cline extension from VS Code Marketplace
572
- 2. Install Human MCP: `npm install -g @goonnguyen/human-mcp`
573
- 3. Open VS Code in your project directory
574
- 4. Add Human MCP configuration to workspace settings (use `npx` version for reliability)
575
- 5. Restart VS Code or reload the window
576
- 6. Open Cline panel and verify MCP connection
540
+ 1. Configure your Google Gemini API key (see Environment Setup section)
541
+ 2. Add Human MCP server using CLI or manual configuration
542
+ 3. Verify configuration: `claude mcp list`
543
+
544
+ **Verification:**
545
+ - Check Cline logs for successful MCP connection
546
+ - Try using `eyes_analyze` tool: "Analyze this screenshot for UI issues"
577
547
 
578
548
  ##### Cursor
579
549
 
@@ -597,25 +567,10 @@ These IDE extensions support MCP servers for enhanced AI-assisted coding with vi
597
567
  }
598
568
  ```
599
569
 
600
- **Alternative Configuration (if globally installed):**
601
-
602
- ```json
603
- {
604
- "mcp.servers": {
605
- "human-mcp": {
606
- "command": "human-mcp",
607
- "env": {
608
- "GOOGLE_GEMINI_API_KEY": "your_gemini_api_key_here"
609
- }
610
- }
611
- }
612
- }
613
- ```
614
-
615
570
  **Setup Steps:**
616
571
  1. Install latest version of Cursor
617
- 2. Install Human MCP: `npm install -g @goonnguyen/human-mcp`
618
- 3. Open your project in Cursor
572
+ 2. Configure your Google Gemini API key (see Environment Setup section)
573
+ 3. Add Human MCP server using CLI or manual configuration
619
574
  4. Configure MCP servers in settings (use `npx` version for reliability)
620
575
  5. Enable MCP integration in Cursor preferences
621
576
  6. Test visual analysis features
@@ -643,29 +598,12 @@ These IDE extensions support MCP servers for enhanced AI-assisted coding with vi
643
598
  }
644
599
  ```
645
600
 
646
- **Alternative Configuration (if globally installed):**
647
-
648
- ```json
649
- {
650
- "servers": {
651
- "human-mcp": {
652
- "command": "human-mcp",
653
- "env": {
654
- "GOOGLE_GEMINI_API_KEY": "your_gemini_api_key_here"
655
- },
656
- "timeout": 30000
657
- }
658
- }
659
- }
660
- ```
661
-
662
601
  **Setup Steps:**
663
602
  1. Install Windsurf IDE
664
- 2. Install Human MCP: `npm install -g @goonnguyen/human-mcp`
665
- 3. Create MCP server configuration file
666
- 4. Add Human MCP server configuration (use `npx` version for reliability)
667
- 5. Restart Windsurf
668
- 6. Verify connection in MCP panel
603
+ 2. Configure your Google Gemini API key (see Environment Setup section)
604
+ 3. Add Human MCP server using CLI or manual configuration
605
+ 4. Restart Windsurf
606
+ 5. Verify connection in MCP panel
669
607
 
670
608
  ### Environment Variable Setup
671
609
 
@@ -696,12 +634,6 @@ echo "source ~/.env" >> ~/.zshrc
696
634
 
697
635
  **Test Human MCP Server:**
698
636
  ```bash
699
- # Test the server directly (if globally installed)
700
- human-mcp
701
-
702
- # Or using npx (no installation needed)
703
- npx @goonnguyen/human-mcp
704
-
705
637
  # For development/testing, use the MCP inspector from source
706
638
  # (only if you have cloned the repository for development)
707
639
  cd /path/to/human-mcp && bun run inspector
@@ -719,47 +651,24 @@ cd /path/to/human-mcp && bun run inspector
719
651
 
720
652
  1. **Connection Failed**
721
653
  - Verify Node.js/npm or Bun is installed and accessible
722
- - Ensure `@goonnguyen/human-mcp` package is installed
654
+ - Ensure `@goonnguyen/human-mcp` package is installed in the client configuration
723
655
  - Check Google Gemini API key is valid and properly configured
724
656
 
725
- 2. **Package Not Found**
726
- - Install Human MCP globally: `npm install -g @goonnguyen/human-mcp`
727
- - Or use `npx @goonnguyen/human-mcp` without global installation
728
- - Verify package installation: `npm list -g @goonnguyen/human-mcp`
729
-
730
- 3. **Tool Not Found**
657
+ 2. **Tool Not Found**
731
658
  - Restart the MCP client after configuration changes
732
659
  - Check Human MCP server logs for errors
733
660
  - Verify the server starts: `npx @goonnguyen/human-mcp`
734
661
 
735
- 4. **API Errors**
662
+ 3. **API Errors**
736
663
  - Validate Google Gemini API key
737
664
  - Check API quota and usage limits
738
665
  - Review network connectivity and firewall settings
739
666
 
740
- 5. **Permission Errors**
667
+ 4. **Permission Errors**
741
668
  - Check npm global installation permissions
742
669
  - Use `npx` instead of global installation if needed
743
670
  - Verify API key has necessary permissions
744
671
 
745
- **Debug Steps:**
746
- ```bash
747
- # Enable debug logging
748
- export LOG_LEVEL=debug
749
-
750
- # Run Human MCP with verbose output
751
- npx @goonnguyen/human-mcp --verbose
752
-
753
- # Check package installation
754
- npm list -g @goonnguyen/human-mcp
755
-
756
- # Test direct execution
757
- human-mcp --version # if globally installed
758
-
759
- # Check MCP client logs
760
- # (Location varies by client - check client documentation)
761
- ```
762
-
763
672
  **Getting Help:**
764
673
  - Check [Human MCP Issues](https://github.com/human-mcp/human-mcp/issues)
765
674
  - Review client-specific MCP documentation