npm - mcp-hydrocoder-vision - Versions diffs - 0.2.0 → 0.3.0 - Mend

mcp-hydrocoder-vision 0.2.0 → 0.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/{INSTALL_EN.md → INSTALL_.md} +2 -0
package/INSTALL_CN.md +2 -0
package/README.md +2 -0
package/README_CN.md +2 -0
package/package.json +2 -2
package/src/index.ts +5 -4

package/{INSTALL_EN.md → INSTALL_.md} RENAMED Viewed

@@ -1,5 +1,7 @@
 # Installation Guide
+[中文版](./INSTALL_CN.md) | [English README](./README.md) | [中文 README](./README_CN.md)
 ## Prerequisites
 - Node.js 18+ installed

package/INSTALL_CN.md CHANGED Viewed

@@ -1,5 +1,7 @@
 # 安装说明
+[English Version](./INSTALL_.md) | [英文 README](./README.md) | [中文 README](./README_CN.md)
 ## 前置要求
 - Node.js 18+ 已安装

package/README.md CHANGED Viewed

@@ -1,5 +1,7 @@
 # MCP HydroCoder Vision
+[English Installation](./INSTALL_.md) | [中文安装](./INSTALL_CN.md) | [中文 README](./README_CN.md)
 A vision-language MCP server that enables Claude Code to analyze images using **Qwen3 VL 4B** model running locally via LM Studio.
 ## Features

package/README_CN.md CHANGED Viewed

@@ -1,5 +1,7 @@
 # MCP HydroCoder Vision
+[English Installation](./INSTALL_.md) | [中文安装](./INSTALL_CN.md) | [English README](./README.md)
 基于 **Qwen3 VL 4B** 模型的本地视觉语言 MCP 服务器，让 Claude Code 能够识别和分析图像。
 ## 功能特性

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "mcp-hydrocoder-vision",
-  "version": "0.2.0",
+  "version": "0.3.0",
   "description": "Vision MCP Server for Claude Code - Qwen3 VL 4B integration",
   "type": "module",
   "main": "src/index.ts",
@@ -11,7 +11,7 @@
     "src/",
     "README.md",
     "README_CN.md",
-    "INSTALL_EN.md",
+    "INSTALL_.md",
     "INSTALL_CN.md",
     "LICENSE"
   ],

package/src/index.ts CHANGED Viewed

@@ -160,7 +160,8 @@ server.setRequestHandler(CallToolRequestSchema, async (request) => {
         const validated = AnalyzeImageInputSchema.parse(args);
         const mimeType = getMimeType(validated.imagePath);
         const imageData = await fileToBase64(validated.imagePath);
-        const prompt = validated.prompt || 'Describe this image in detail.';
+        const prompt = validated.prompt ||
+          'Describe all visible elements concisely. Use factual, structured language. Include colors, sizes, positions, and relationships between elements. No fluff. Assume the image is a static screenshot or photograph.';
         const result = await analyzeImageWithLMStudio(imageData, mimeType, prompt);
         return {
@@ -173,8 +174,8 @@ server.setRequestHandler(CallToolRequestSchema, async (request) => {
         const mimeType = getMimeType(validated.imagePath);
         const imageData = await fileToBase64(validated.imagePath);
         const prompt = validated.language
-          ? `Extract all text from this image. The text is in ${validated.language}.`
-          : 'Extract all text from this image (OCR).';
+          ? `Extract ALL visible text in ${validated.language}. Output only the text, no commentary. Return text in the order it appears.`
+          : `Extract ALL visible text. Output only the text, no commentary. Return text in the order it appears.`;
         const result = await analyzeImageWithLMStudio(imageData, mimeType, prompt);
         return {
@@ -187,7 +188,7 @@ server.setRequestHandler(CallToolRequestSchema, async (request) => {
         const mimeType = getMimeType(validated.imagePath);
         const imageData = await fileToBase64(validated.imagePath);
         const framework = validated.framework || 'HTML/CSS/JavaScript';
-        const prompt = `Analyze this UI/design image and generate ${framework} code that replicates it. Focus on structure, styling, and layout.`;
+        const prompt = `Generate ${framework} code matching this design. Include all clickable elements, form submissions, state changes, and dynamic interactions. Use standard practices. Keep code readable and maintainable. Omit unnecessary comments. Do not output code for unsupported frameworks. Output only the code.`;
         const result = await analyzeImageWithLMStudio(imageData, mimeType, prompt);
         return {