npm - image-to-code - Versions diffs - 0.1.0 - Mend

image-to-code 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (4) hide show

package/README.md ADDED Viewed

@@ -0,0 +1,243 @@
+# image-to-code
+> Extract structured data (colors, layout, OCR text) from images. **No AI vision required.**
+> Cross-platform: macOS · Linux · Windows
+> Uses Tesseract OCR + Pillow for fully programmatic analysis.
+## Quick Install
+```bash
+# NPX (easiest — auto-installs Python deps)
+npx image-to-code screenshot.png
+# Pip
+pip install image-to-code
+image-to-code screenshot.png
+```
+## Features
+- **Color Extraction** — dominant colors, semantic role detection (background, text, button, border, surface), WCAG contrast ratio, color harmony classification, gradient detection
+- **Layout Detection** — horizontal section segmentation, vertical column detection, component labeling with hero-padding awareness
+- **OCR Text Extraction** — multi-PSM scanning, histogram stretch + adaptive threshold preprocessing, footer/branding crop scans, Thai grapheme cluster merging, intelligent dedup
+- **Button Detection** — heuristic-based UI button identification from bounding box sizes
+- **Photo/UI Classification** — classifies images as photo (organic background) vs UI (flat/schematic) using luminance variance + edge ratio heuristics
+- **CSS Output** — generates CSS custom properties and media query recommendations
+- **Clipboard Support** — read directly from clipboard (`--clipboard` flag)
+## Requirements
+| Dependency | Version | Notes |
+|---|---|---|
+| Python | 3.10+ | Core runtime |
+| [Tesseract OCR](https://github.com/tesseract-ocr/tesseract) | 5.x | OCR engine (must be on PATH) |
+| Node.js (optional) | 18+ | Only needed for `npx image-to-code` |
+| [Pillow](https://python-pillow.org/) | 10.0+ | Image processing |
+| [pytesseract](https://github.com/madmaze/pytesseract) | 0.3.10+ | Python Tesseract wrapper |
+### Install Tesseract
+```bash
+# macOS
+brew install tesseract tesseract-lang
+# Linux (Ubuntu/Debian)
+sudo apt install tesseract-ocr tesseract-ocr-tha tesseract-ocr-osd
+# Linux (Arch)
+sudo pacman -S tesseract tesseract-data-tha
+# Windows
+winget install -e --id UB-Mannheim.TesseractOCR
+# Or download from https://github.com/UB-Mannheim/tesseract/wiki
+```
+## Installation
+### Option 1: NPX (easiest)
+```bash
+npx image-to-code screenshot.png
+```
+> First run auto-installs the Python package. Requires Python 3.10+.
+### Option 2: pip
+```bash
+pip install image-to-code
+image-to-code screenshot.png
+```
+### Option 3: From source
+```bash
+git clone https://github.com/phumitchreal/image-to-code.git
+cd image-to-code
+pip install -r requirements.txt
+python -m image_to_code.analyze screenshot.png
+```
+## Usage
+### CLI
+```bash
+# Basic analysis
+python -m image_to_code.analyze screenshot.png
+# JSON output only
+python -m image_to_code.analyze screenshot.png --json
+# Full report + JSON
+python -m image_to_code.analyze screenshot.png --full
+# Read from clipboard
+python -m image_to_code.analyze --clipboard
+# Specify language and confidence threshold
+python -m image_to_code.analyze screenshot.png --lang eng --min-confidence 80
+# Custom color sampling
+python -m image_to_code.analyze screenshot.png --sample-count 3000 --quantize-tolerance 20
+```
+### Python Library
+```python
+from image_to_code import analyze
+# Full analysis pipeline
+result = analyze.analyze_image("screenshot.png")
+# Or use individual modules
+from image_to_code.colors import extract_colors
+from image_to_code.layout import detect_layout
+from image_to_code.ocr import extract_text
+colors = extract_colors("image.png")
+layout = detect_layout("image.png")
+text = extract_text("image.png", language="tha+eng", min_confidence=70)
+print(f"Background: {colors['background']}")
+print(f"Text: {colors['text']} (contrast: {colors['contrastRatio']}:1)")
+print(f"Layout: {layout['layoutType']}")
+print(f"OCR: {text['rawText']}")
+```
+## Output Example
+```
+=======================================================================
+  IMAGE ANALYSIS REPORT
+=======================================================================
+Image: 1913x995 (landscape/desktop, photo)
+--- Colors ---
+  Background: #0F0F0F
+  Surfaces:   #1E1E1E, #000000, #2D2D1E
+  Text:       #FFFFFF  (contrast: 16.9:1)
+  Button:     #5A69F0
+  Border:     #2D1E1E
+  Harmony:    neutral
+  Palette:    20 unique colors
+--- Layout Components ---
+  hero-padding     y= 0% h=45%  color=#282828
+  bottom-segment   y=45% h=53%  color=#141414
+  bottom-segment   y=98% h= 2%  color=#000000
+--- OCR Text (35 words >=70%) ---
+DISCORD COMMUNITY HUB
+ติดตามสมาชิกแก๊งแบบเรียลไทม์และดูพาร์ทเนอร์ที่ร่วมงานกับเรา
+Gang Partners
+--- UI Buttons (2) ---
+  [button] Gang  (z=middle, y=570, c=94.4%)
+  [button] Partners  (z=middle, y=580, c=96.6%)
+--- CSS Recommendations ---
+  --bg: #0F0F0F
+  --surface: #1E1E1E
+  --text: #FFFFFF
+  --primary: #5A69F0
+  --border: #2D1E1E
+  --radius: 6px
+=======================================================================
+```
+## JSON Output Structure
+```json
+{
+  "imageType": "photo",
+  "image": { "width": 1913, "height": 995 },
+  "colors": {
+    "background": "#0F0F0F",
+    "text": "#FFFFFF",
+    "button": "#5A69F0",
+    "border": "#2D1E1E",
+    "contrastRatio": 16.9,
+    "harmony": "neutral",
+    "palette": [ ... ],
+    "gradient": null
+  },
+  "layout": {
+    "type": "landscape/desktop",
+    "components": [ ... ]
+  },
+  "text": {
+    "words": 35,
+    "boxes": [ ... ],
+    "buttons": [ ... ],
+    "fullText": "...",
+    "byZone": { "top": "...", "middle": "...", "bottom": "..." }
+  },
+  "css": {
+    "customProperties": {
+      "--bg": "#0F0F0F",
+      "--text": "#FFFFFF",
+      "--primary": "#5A69F0"
+    }
+  }
+}
+```
+## PowerShell Version (Windows)
+The `powershell/` directory contains the original Windows PowerShell scripts. These work on Windows only (require `System.Drawing`). Usage:
+```powershell
+# Full analysis
+.\powershell\analyze-image.ps1 -ImagePath screenshot.png -Full
+# From clipboard
+.\powershell\analyze-image.ps1 -Clipboard
+# JSON output
+.\powershell\analyze-image.ps1 -ImagePath screenshot.png -Json
+```
+## How It Works
+### Photo vs UI Classification
+Uses three heuristics on a coarse pixel sample:
+1. **Distinct color count** — photos have >50 distinct colors (after 4-bit quantization)
+2. **Luminance IQR** — photos have narrow interquartile range (<80) with moderate color count
+3. **Edge ratio** — photos have low edge ratio (<0.3) on adjacent spatial samples with wide luminance range
+### Thai Text Handling
+Tesseract splits Thai characters into individual grapheme components. The `merge_thai_text()` post-processor removes spaces between Thai Unicode characters (U+0E00–U+0E7F) to reconstruct correct words.
+### Adaptive Thresholding
+For photo backgrounds, two preprocessing passes run:
+1. Histogram stretch (full contrast enhancement)
+2. Adaptive threshold (hard clip at 100/160 luminance)
+OCR runs on all versions (original + preprocessed) with multiple PSM modes and deduplicates results.
+## License
+MIT

package/README.th.md ADDED Viewed

@@ -0,0 +1,206 @@
+# image-to-code
+> แยกข้อมูลโครงสร้าง (สี, เลย์เอาต์, ข้อความ) จากรูปภาพ **โดยไม่ต้องใช้ AI ภาพ**
+> รองรับทุกแพลตฟอร์ม: macOS · Linux · Windows
+> ใช้ Tesseract OCR + Pillow วิเคราะห์ภาพแบบ programmatic 100%
+## ติดตั้ง
+### ตัวเลือก 1: NPX (ง่ายที่สุด)
+```bash
+npx image-to-code รูป.png
+```
+> ครั้งแรกจะโหลด Python package โดยอัตโนมัติ ต้องการ Python 3.10+
+### ตัวเลือก 2: pip
+```bash
+pip install image-to-code
+image-to-code รูป.png
+```
+### ตัวเลือก 3: จาก source
+```bash
+git clone https://github.com/phumitchreal/image-to-code.git
+cd image-to-code
+pip install -r requirements.txt
+python -m image_to_code.analyze รูป.png
+```
+### ติดตั้ง Tesseract OCR
+```bash
+# macOS
+brew install tesseract tesseract-lang
+# Linux (Ubuntu/Debian)
+sudo apt install tesseract-ocr tesseract-ocr-tha tesseract-ocr-osd
+# Linux (Arch)
+sudo pacman -S tesseract tesseract-data-tha
+# Windows
+winget install -e --id UB-Mannheim.TesseractOCR
+# หรือโหลดจาก https://github.com/UB-Mannheim/tesseract/wiki
+```
+> ภาษาไทย (`tha.traineddata`) จะโหลดอัตโนมัติครั้งแรกที่ใช้งาน OCR
+## ความสามารถ
+| ฟีเจอร์ | รายละเอียด |
+|---|---|
+| **แยกสี** | สีหลัก, สีพื้นหลัง, สีข้อความ, สีปุ่ม, สีขอบ, WCAG contrast ratio, ประเภทสี harmony, gradient |
+| **วิเคราะห์เลย์เอาต์** | หาส่วนแนวนอน, คอลัมน์แนวตั้ง, component labeling (hero-padding, bottom-segment) |
+| **OCR ข้อความ** | หลาย PSM mode, histogram stretch + adaptive threshold, สแกน footer/branding เพิ่มเติม, จับกลุ่มตัวอักษรไทย |
+| **ปุ่ม UI** | จำแนกปุ่มจากขนาด bounding box |
+| **แยกประเภทภาพ** | photo (พื้นหลังออร์แกนิก) vs UI (แบน/ schematic) |
+| **CSS Output** | สร้าง CSS custom properties และ media query |
+| **Clipboard** | อ่านรูปจากคลิปบอร์ด (`--clipboard`) |
+## การใช้งาน
+### CLI
+```bash
+# วิเคราะห์พื้นฐาน
+image-to-code screenshot.png
+# แสดงเป็น JSON อย่างเดียว
+image-to-code screenshot.png --json
+# รายงานเต็ม + JSON
+image-to-code screenshot.png --full
+# อ่านจากคลิปบอร์ด
+image-to-code --clipboard
+# เปลี่ยนภาษา OCR และความมั่นใจขั้นต่ำ
+image-to-code screenshot.png --lang eng --min-confidence 80
+# กำหนดจำนวนตัวอย่างสี
+image-to-code screenshot.png --sample-count 3000 --quantize-tolerance 20
+```
+### ใช้เป็น Python Library
+```python
+from image_to_code.colors import extract_colors
+from image_to_code.layout import detect_layout
+from image_to_code.ocr import extract_text
+colors = extract_colors("image.png")
+layout = detect_layout("image.png")
+text = extract_text("image.png", language="tha+eng", min_confidence=70)
+print(f"พื้นหลัง: {colors['background']}")
+print(f"ข้อความ: {colors['text']} (contrast: {colors['contrastRatio']}:1)")
+print(f"เลย์เอาต์: {layout['layoutType']}")
+print(f"OCR: {text['rawText']}")
+```
+## ตัวอย่าง Output
+```
+=======================================================================
+   รายงานวิเคราะห์ภาพ
+=======================================================================
+Image: 1913x995 (landscape/desktop, photo)
+--- สี ---
+  พื้นหลัง:    #0F0F0F
+  พื้นผิว:     #1E1E1E, #000000, #2D2D1E
+  ข้อความ:     #FFFFFF  (contrast: 16.9:1)
+  ปุ่ม:        #5A69F0
+  ขอบ:        #2D1E1E
+--- ส่วนประกอบเลย์เอาต์ ---
+  hero-padding     y= 0% h=45%  color=#282828
+  bottom-segment   y=45% h=53%  color=#141414
+  bottom-segment   y=98% h= 2%  color=#000000
+--- OCR (35 คำ >=70%) ---
+DISCORD COMMUNITY HUB
+ติดตามสมาชิกแก๊งแบบเรียลไทม์และดูพาร์ทเนอร์ที่ร่วมงานกับเรา
+Gang Partners
+--- CSS ---
+  --bg: #0F0F0F
+  --surface: #1E1E1E
+  --text: #FFFFFF
+  --primary: #5A69F0
+  --border: #2D1E1E
+```
+## โครงสร้าง JSON Output
+```json
+{
+  "imageType": "photo",
+  "image": { "width": 1913, "height": 995 },
+  "colors": {
+    "background": "#0F0F0F",
+    "text": "#FFFFFF",
+    "button": "#5A69F0",
+    "border": "#2D1E1E",
+    "contrastRatio": 16.9,
+    "harmony": "neutral",
+    "palette": [ ... ],
+    "gradient": null
+  },
+  "layout": {
+    "type": "landscape/desktop",
+    "components": [ ... ]
+  },
+  "text": {
+    "words": 35,
+    "boxes": [ ... ],
+    "buttons": [ { "text": "Gang", "x": 0, "y": 570, "w": 100, "h": 40 } ],
+    "fullText": "...",
+    "byZone": { "top": "...", "middle": "...", "bottom": "..." }
+  },
+  "css": {
+    "customProperties": {
+      "--bg": "#0F0F0F",
+      "--text": "#FFFFFF",
+      "--primary": "#5A69F0"
+    }
+  }
+}
+```
+## การทำงานภายใน
+### แยกประเภท Photo vs UI
+ใช้ 3 heuristic กับ pixel sample:
+1. **จำนวนสี distinct** — ภาพถ่ายมี >50 สี (หลัง 4-bit quantization)
+2. **Luminance IQR** — ภาพถ่ายมีช่วง interquartile แคบ (<80) + จำนวนสีปานกลาง
+3. **Edge ratio** — ภาพถ่ายมี edge ratio ต่ำ (<0.3) บน spatial sample ที่อยู่ติดกัน
+### การจัดการภาษาไทย
+Tesseract มักแยกตัวอักษรไทยออกเป็น grapheme ย่อยๆ ฟังก์ชัน `merge_thai_text()` จะลบช่องว่างระหว่างอักขระไทย (U+0E00–U+0E7F) เพื่อรวมเป็นคำที่ถูกต้อง
+### Adaptive Thresholding
+สำหรับภาพพื้นหลังที่เป็นรูปถ่าย จะมีการประมวลผลล่วงหน้า 2 แบบ:
+1. Histogram stretch (เพิ่ม contrast เต็มที่)
+2. Adaptive threshold (ตัดที่ 100/160 luminance)
+OCR จะรันบนทุกเวอร์ชัน (ต้นฉบับ + processed) ด้วยหลาย PSM mode และ deduplicate ผลลัพธ์
+## PowerShell Version (Windows)
+โฟลเดอร์ `powershell/` มี PowerShell scripts ต้นฉบับ สำหรับ Windows เท่านั้น:
+```powershell
+.\powershell\analyze-image.ps1 -ImagePath screenshot.png -Full
+.\powershell\analyze-image.ps1 -Clipboard
+.\powershell\analyze-image.ps1 -ImagePath screenshot.png -Json
+```
+## License
+MIT

package/bin/cli.js ADDED Viewed

@@ -0,0 +1,66 @@
+#!/usr/bin/env node
+/**
+ * image-to-code — npm wrapper.
+ * Auto-installs the Python package via pip on first run, then delegates.
+ */
+const { execSync, spawn } = require("child_process");
+const path = require("path");
+const PYTHON_MODULE = "image_to_code";
+const REQUIRED_DEPS = ["Pillow>=10.0.0", "pytesseract>=0.3.10"];
+function checkPython() {
+  try {
+    execSync("python --version", { stdio: "pipe", timeout: 10000 });
+    return "python";
+  } catch {
+    try {
+      execSync("python3 --version", { stdio: "pipe", timeout: 10000 });
+      return "python3";
+    } catch {
+      return null;
+    }
+  }
+}
+function checkPackage(python) {
+  try {
+    execSync(`${python} -c "import ${PYTHON_MODULE}"`, {
+      stdio: "pipe",
+      timeout: 10000,
+    });
+    return true;
+  } catch {
+    return false;
+  }
+}
+function installPackage(python) {
+  console.log("→ Installing image-to-code Python package...");
+  execSync(`${python} -m pip install ${PYTHON_MODULE} --upgrade`, {
+    stdio: "inherit",
+    timeout: 120000,
+  });
+}
+function main() {
+  const python = checkPython();
+  if (!python) {
+    console.error(
+      "✖ Python not found. Install Python 3.10+ from https://python.org"
+    );
+    process.exit(1);
+  }
+  if (!checkPackage(python)) {
+    installPackage(python);
+  }
+  const args = process.argv.slice(2);
+  const child = spawn(python, ["-m", PYTHON_MODULE + ".analyze", ...args], {
+    stdio: "inherit",
+  });
+  child.on("exit", (code) => process.exit(code));
+}
+main();

package/package.json ADDED Viewed

@@ -0,0 +1,30 @@
+{
+  "name": "image-to-code",
+  "version": "0.1.0",
+  "description": "Extract structured data (colors, layout, OCR text) from images. No AI vision required.",
+  "bin": {
+    "image-to-code": "bin/cli.js"
+  },
+  "repository": {
+    "type": "git",
+    "url": "git+https://github.com/phumitchreal/image-to-code.git"
+  },
+  "keywords": [
+    "ocr",
+    "image-analysis",
+    "color-extraction",
+    "layout-detection",
+    "tesseract",
+    "thai"
+  ],
+  "license": "MIT",
+  "bugs": {
+    "url": "https://github.com/phumitchreal/image-to-code/issues"
+  },
+  "homepage": "https://github.com/phumitchreal/image-to-code#readme",
+  "files": [
+    "bin/",
+    "package.json",
+    "README.md"
+  ]
+}