PyPI - ollamadiffuser - Versions diffs - 1.2.2__py3-none-any.whl → 2.0.0__py3-none-any.whl - Mend

ollamadiffuser 1.2.2py3-none-any.whl → 2.0.0py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (36) hide show

ollamadiffuser/__init__.py +1 -1
ollamadiffuser/api/server.py +312 -312
ollamadiffuser/cli/config_commands.py +119 -0
ollamadiffuser/cli/lora_commands.py +169 -0
ollamadiffuser/cli/main.py +85 -1233
ollamadiffuser/cli/model_commands.py +664 -0
ollamadiffuser/cli/recommend_command.py +205 -0
ollamadiffuser/cli/registry_commands.py +197 -0
ollamadiffuser/core/config/model_registry.py +562 -11
ollamadiffuser/core/config/settings.py +24 -2
ollamadiffuser/core/inference/__init__.py +5 -0
ollamadiffuser/core/inference/base.py +182 -0
ollamadiffuser/core/inference/engine.py +204 -1405
ollamadiffuser/core/inference/strategies/__init__.py +1 -0
ollamadiffuser/core/inference/strategies/controlnet_strategy.py +170 -0
ollamadiffuser/core/inference/strategies/flux_strategy.py +136 -0
ollamadiffuser/core/inference/strategies/generic_strategy.py +164 -0
ollamadiffuser/core/inference/strategies/gguf_strategy.py +113 -0
ollamadiffuser/core/inference/strategies/hidream_strategy.py +104 -0
ollamadiffuser/core/inference/strategies/sd15_strategy.py +134 -0
ollamadiffuser/core/inference/strategies/sd3_strategy.py +80 -0
ollamadiffuser/core/inference/strategies/sdxl_strategy.py +131 -0
ollamadiffuser/core/inference/strategies/video_strategy.py +108 -0
ollamadiffuser/mcp/__init__.py +0 -0
ollamadiffuser/mcp/server.py +184 -0
ollamadiffuser/ui/templates/index.html +62 -1
ollamadiffuser/ui/web.py +116 -54
{ollamadiffuser-1.2.2.dist-info → ollamadiffuser-2.0.0.dist-info}/METADATA +337 -108
ollamadiffuser-2.0.0.dist-info/RECORD +61 -0
{ollamadiffuser-1.2.2.dist-info → ollamadiffuser-2.0.0.dist-info}/WHEEL +1 -1
{ollamadiffuser-1.2.2.dist-info → ollamadiffuser-2.0.0.dist-info}/entry_points.txt +1 -0
ollamadiffuser/core/models/registry.py +0 -384
ollamadiffuser/ui/samples/.DS_Store +0 -0
ollamadiffuser-1.2.2.dist-info/RECORD +0 -45
{ollamadiffuser-1.2.2.dist-info → ollamadiffuser-2.0.0.dist-info}/licenses/LICENSE +0 -0
{ollamadiffuser-1.2.2.dist-info → ollamadiffuser-2.0.0.dist-info}/top_level.txt +0 -0

{ollamadiffuser-1.2.2.dist-info → ollamadiffuser-2.0.0.dist-info}/METADATA RENAMED Viewed

@@ -1,7 +1,7 @@
 Metadata-Version: 2.4
 Name: ollamadiffuser
-Version: 1.2.2
-Summary: 🎨 Local AI Image Generation with Ollama-style CLI for Stable Diffusion, FLUX.1, and LoRA support
+Version: 2.0.0
+Summary: Local AI Image Generation with Ollama-style CLI for Stable Diffusion, FLUX, and LoRA support
 Home-page: https://github.com/ollamadiffuser/ollamadiffuser
 Author: OllamaDiffuser Team
 Author-email: OllamaDiffuser Team <ollamadiffuser@gmail.com>
@@ -14,7 +14,7 @@ Project-URL: Documentation, https://www.ollamadiffuser.com/
 Project-URL: Bug Reports, https://github.com/ollamadiffuser/ollamadiffuser/issues
 Project-URL: Feature Requests, https://github.com/ollamadiffuser/ollamadiffuser/issues
 Project-URL: Source Code, https://github.com/ollamadiffuser/ollamadiffuser
-Keywords: diffusion,image-generation,ai,machine-learning,lora,ollama,stable-diffusion,flux,local-ai,controlnet,web-ui,cli
+Keywords: diffusion,image-generation,ai,machine-learning,lora,ollama,stable-diffusion,flux,local-ai,controlnet,web-ui,cli,img2img,inpainting,mcp,openclaw
 Classifier: Development Status :: 4 - Beta
 Classifier: Intended Audience :: Developers
 Classifier: Intended Audience :: End Users/Desktop
@@ -33,40 +33,64 @@ Classifier: Environment :: Web Environment
 Requires-Python: >=3.10
 Description-Content-Type: text/markdown
 License-File: LICENSE
-Requires-Dist: torch>=2.1.0
-Requires-Dist: diffusers>=0.26.0
-Requires-Dist: transformers>=4.35.0
-Requires-Dist: accelerate>=0.25.0
-Requires-Dist: fastapi>=0.104.0
+Requires-Dist: torch>=2.4.0
+Requires-Dist: diffusers>=0.34.0
+Requires-Dist: transformers>=4.40.0
+Requires-Dist: accelerate>=1.0.0
+Requires-Dist: fastapi>=0.110.0
 Requires-Dist: uvicorn>=0.23.0
-Requires-Dist: huggingface-hub>=0.16.0
-Requires-Dist: Pillow>=9.0.0
+Requires-Dist: huggingface-hub>=0.25.0
+Requires-Dist: Pillow>=10.0.0
 Requires-Dist: click>=8.0.0
 Requires-Dist: rich>=13.0.0
 Requires-Dist: pydantic>=2.0.0
 Requires-Dist: protobuf>=3.20.0
 Requires-Dist: sentencepiece>=0.1.99
-Requires-Dist: safetensors>=0.3.0
+Requires-Dist: safetensors>=0.4.0
 Requires-Dist: python-multipart>=0.0.0
 Requires-Dist: psutil>=5.9.0
 Requires-Dist: jinja2>=3.0.0
-Requires-Dist: peft>=0.10.0
-Requires-Dist: numpy>=1.21.0
+Requires-Dist: peft>=0.13.0
+Requires-Dist: numpy>=1.26.0
 Requires-Dist: controlnet-aux>=0.0.7
 Requires-Dist: opencv-python>=4.8.0
-Requires-Dist: stable-diffusion-cpp-python>=0.1.0
-Requires-Dist: gguf>=0.1.0
+Requires-Dist: requests>=2.28.0
+Requires-Dist: PyYAML>=6.0
+Provides-Extra: gguf
+Requires-Dist: stable-diffusion-cpp-python>=0.1.0; extra == "gguf"
+Requires-Dist: gguf>=0.1.0; extra == "gguf"
+Provides-Extra: full
+Requires-Dist: stable-diffusion-cpp-python>=0.1.0; extra == "full"
+Requires-Dist: gguf>=0.1.0; extra == "full"
+Requires-Dist: mcp[cli]>=1.0.0; extra == "full"
+Provides-Extra: mcp
+Requires-Dist: mcp[cli]>=1.0.0; extra == "mcp"
+Provides-Extra: openclaw
+Requires-Dist: mcp[cli]>=1.0.0; extra == "openclaw"
 Provides-Extra: dev
 Requires-Dist: pytest>=7.0.0; extra == "dev"
 Requires-Dist: pytest-asyncio>=0.21.0; extra == "dev"
+Requires-Dist: pytest-cov>=4.0.0; extra == "dev"
+Requires-Dist: httpx>=0.24.0; extra == "dev"
 Requires-Dist: black>=23.0.0; extra == "dev"
 Requires-Dist: isort>=5.12.0; extra == "dev"
 Requires-Dist: flake8>=6.0.0; extra == "dev"
+Requires-Dist: mypy>=1.0.0; extra == "dev"
 Dynamic: author
 Dynamic: home-page
 Dynamic: license-file
 Dynamic: requires-python
+### ⚠️ Project Status: Maintenance Mode
+**Thank you for the incredible support and over 5,000 downloads!**
+Please be aware that `ollamadiffuser` is currently in **maintenance mode**. Due to the creator's other professional commitments, active feature development has been paused.
+The project in its current state is stable and will remain available for use. However, new features will not be added, and non-critical issues may not be addressed in the near future.
+This project laid the foundation for a more ambitious vision: **[LocalKinAI](https://github.com/LocalKinAI)**. Thank you for being part of the journey.
 # OllamaDiffuser 🎨
 [![PyPI version](https://badge.fury.io/py/ollamadiffuser.svg)](https://badge.fury.io/py/ollamadiffuser)
@@ -76,102 +100,93 @@ Dynamic: requires-python
 ## Local AI Image Generation with OllamaDiffuser
-**OllamaDiffuser** simplifies local deployment of **Stable Diffusion**, **FLUX.1**, and other AI image generation models. An intuitive **local SD** tool inspired by **Ollama's** simplicity - perfect for **local diffuser** workflows with CLI, web UI, and LoRA support.
+**OllamaDiffuser** simplifies local deployment of **Stable Diffusion**, **FLUX**, **CogView4**, **Kolors**, **SANA**, **PixArt-Sigma**, and 40+ other AI image generation models. An intuitive **local SD** tool inspired by **Ollama's** simplicity - perfect for **local diffuser** workflows with CLI, web UI, and LoRA support.
 🌐 **Website**: [ollamadiffuser.com](https://www.ollamadiffuser.com/) | 📦 **PyPI**: [pypi.org/project/ollamadiffuser](https://pypi.org/project/ollamadiffuser/)
----
-## 🔑 Hugging Face Authentication
-**Do you need a Hugging Face token?** It depends on which models you want to use!
-### 🟢 Models that DON'T require a token:
-- **FLUX.1-schnell** - Apache 2.0 license, ready to use ✅
-- **Stable Diffusion 1.5** - Basic model, no authentication needed ✅
-- **Most ControlNet models** - Generally public access ✅
+> **Upgrading from v1.x?** v2.0 is a major rewrite requiring **Python 3.10+**. Run `pip install --upgrade "ollamadiffuser[full]"` and see the [Migration Guide](#-migration-guide) below.
-### 🟡 Models that DO require a token:
-- **FLUX.1-dev** - Requires HF token and license agreement ⚠️
-- **Stable Diffusion 3.5** - Requires HF token and license agreement ⚠️
-- **Some premium LoRAs** - Gated models from Hugging Face ⚠️
+---
-### 🚀 Quick Setup
+## 🚀 Quick Start (v2.0)
-**For basic usage** (no token needed):
+**For Mac/PC Users:**
 ```bash
-# These work immediately without any setup:
-ollamadiffuser pull flux.1-schnell
-ollamadiffuser pull stable-diffusion-1.5
+pip install "ollamadiffuser[full]"
+ollamadiffuser recommend  # Find which models fit your GPU
 ```
-**For advanced models** (token required):
+**For OpenClaw/Agent Users:**
 ```bash
-# 1. Set your token
-export HF_TOKEN=your_token_here
+pip install "ollamadiffuser[mcp]"
+ollamadiffuser mcp        # Starts the MCP server
+```
-# 2. Now you can access gated models
-ollamadiffuser pull flux.1-dev
-ollamadiffuser pull stable-diffusion-3.5-medium
+**For Low-VRAM / Budget GPU Users:**
+```bash
+pip install "ollamadiffuser[gguf]"
+ollamadiffuser pull flux.1-dev-gguf-q4ks  # Only 6GB VRAM needed
+ollamadiffuser run flux.1-dev-gguf-q4ks
 ```
-### 🔧 How to get a Hugging Face token:
-1. **Create account**: Visit [huggingface.co](https://huggingface.co) and sign up
-2. **Generate token**: Go to Settings → Access Tokens → Create new token
-3. **Accept licenses**: Visit the model pages and accept license agreements:
-   - [FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev)
-   - [Stable Diffusion 3.5](https://huggingface.co/stabilityai/stable-diffusion-3.5-medium)
-4. **Set environment variable**:
-   ```bash
-   # Temporary (current session)
-   export HF_TOKEN=your_token_here
-   # Permanent (add to ~/.bashrc or ~/.zshrc)
-   echo 'export HF_TOKEN=your_token_here' >> ~/.bashrc
-   ```
-### 💡 Pro Tips:
-- **Start simple**: Begin with FLUX.1-schnell (no token required, commercial use OK)
-- **Token scope**: Use "read" permissions for downloading models
-- **Privacy**: Your token stays local - never shared with OllamaDiffuser servers
-- **Troubleshooting**: If downloads fail, verify your token and model access permissions
+Most models work **without any token** -- just install and go. See [Hugging Face Authentication](#-hugging-face-authentication) when you want gated models like FLUX.1-dev or SD 3.5.
 ---
 ## ✨ Features
-- **🚀 Fast Startup**: Instant application launch with lazy loading architecture
+- **🏗️ Strategy Architecture**: Clean per-model strategy pattern (SD1.5, SDXL, FLUX, SD3, ControlNet, Video, HiDream, GGUF, Generic)
+- **🌐 40+ Models**: FLUX.2, SD 3.5, SDXL Lightning, CogView4, Kolors, SANA, PixArt-Sigma, and more
+- **🔌 Generic Pipeline**: Add new diffusers models via registry config alone -- no code changes needed
+- **🖼️ img2img & Inpainting**: Image-to-image and inpainting support across SD1.5, SDXL, and the API/Web UI
+- **⚡ Async API**: Non-blocking FastAPI server using `asyncio.to_thread` for GPU operations
+- **🎲 Random Seeds**: Reproducible generation with explicit seeds, random by default
 - **🎛️ ControlNet Support**: Precise image generation control with 10+ control types
 - **🔄 LoRA Integration**: Dynamic LoRA loading and management
-- **📦 GGUF Support**: Memory-efficient quantized models (3GB VRAM minimum!)
+- **🔌 MCP & OpenClaw**: Model Context Protocol server for AI assistant integration (OpenClaw, Claude Code, Cursor)
+- **🍎 Apple Silicon**: MPS dtype safety, GGUF Metal acceleration, `ollamadiffuser recommend` for hardware-aware model suggestions
+- **📦 GGUF Support**: Memory-efficient quantized models (3GB VRAM minimum!) with CUDA and Metal acceleration
 - **🌐 Multiple Interfaces**: CLI, Python API, Web UI, and REST API
 - **📦 Model Management**: Easy installation and switching between models
 - **⚡ Performance Optimized**: Memory-efficient with GPU acceleration
-- **🎨 Professional Results**: High-quality image generation with fine-tuned control
-## 🚀 Quick Start
+- **🧪 Test Suite**: 82 tests across settings, registry, engine, API, MPS, and MCP
 ### Option 1: Install from PyPI (Recommended)
 ```bash
 # Install from PyPI
 pip install ollamadiffuser
-# Pull and run a model (4-command setup)
+# Pull and run a model
 ollamadiffuser pull flux.1-schnell
 ollamadiffuser run flux.1-schnell
-# Generate via API
+# Generate via API (seed is optional for reproducibility)
 curl -X POST http://localhost:8000/api/generate \
   -H "Content-Type: application/json" \
-  -d '{"prompt": "A beautiful sunset"}' \
+  -d '{"prompt": "A beautiful sunset", "seed": 12345}' \
   --output image.png
 ```
+### 🔄 Update to Latest Version
+**Always use the latest version** for the newest features and bug fixes:
+```bash
+# Update to latest version
+pip uninstall ollamadiffuser
+pip install --no-cache-dir ollamadiffuser
+```
+This ensures you get:
+- 🐛 **Latest bug fixes**
+- ✨ **New features and improvements**
+- 🚀 **Performance optimizations**
+- 🔒 **Security updates**
 ### GGUF Quick Start (Low VRAM)
 ```bash
 # For systems with limited VRAM (3GB+)
-pip install ollamadiffuser stable-diffusion-cpp-python gguf
+pip install "ollamadiffuser[gguf]"
 # Download memory-efficient GGUF model
 ollamadiffuser pull flux.1-dev-gguf-q4ks
@@ -180,6 +195,22 @@ ollamadiffuser pull flux.1-dev-gguf-q4ks
 ollamadiffuser run flux.1-dev-gguf-q4ks
 ```
+### Apple Silicon Quick Start (Mac Mini / MacBook)
+```bash
+# See which models fit your Mac
+ollamadiffuser recommend
+# Best lightweight model (0.6B, <6GB)
+ollamadiffuser pull pixart-sigma
+ollamadiffuser run pixart-sigma
+# GGUF with Metal acceleration (6GB, great quality)
+pip install "ollamadiffuser[gguf]"
+CMAKE_ARGS="-DSD_METAL=ON" pip install stable-diffusion-cpp-python
+ollamadiffuser pull flux.1-dev-gguf-q4ks
+ollamadiffuser run flux.1-dev-gguf-q4ks
+```
 ### Option 2: Development Installation
 ```bash
 # Clone the repository
@@ -229,21 +260,92 @@ curl -X POST http://localhost:8000/api/generate/controlnet \
 ---
-## 🎯 Supported Models
+## 🔑 Hugging Face Authentication
+**Do you need a Hugging Face token?** It depends on which models you want to use!
-Choose from a variety of state-of-the-art image generation models:
+**Models that DON'T require a token** -- ready to use right away:
+- FLUX.1-schnell, Stable Diffusion 1.5, DreamShaper, PixArt-Sigma, SANA 1.5, most ControlNet models
-| Model | License | Quality | Speed | Commercial Use | VRAM |
-|-------|---------|---------|-------|----------------|------|
-| **FLUX.1-schnell** | Apache 2.0 | High | **4 steps** (12x faster) | ✅ Commercial OK | 20GB+ |
-| **FLUX.1-dev** | Non-commercial | High | 50 steps | ❌ Non-commercial | 20GB+ |
-| **FLUX.1-dev-gguf** | Non-commercial | High | 4 steps | ❌ Non-commercial | **3-16GB** |
-| **Stable Diffusion 3.5** | CreativeML | Medium | 28 steps | ⚠️ Check License | 12GB+ |
-| **Stable Diffusion 1.5** | CreativeML | Fast | Lightweight | ⚠️ Check License | 6GB+ |
+**Models that DO require a token:**
+- FLUX.1-dev, Stable Diffusion 3.5, some premium LoRAs
+**Setup** (only needed for gated models):
+```bash
+# 1. Create account at https://huggingface.co and generate an access token
+# 2. Accept license on the model page (e.g. FLUX.1-dev, SD 3.5)
+# 3. Set your token
+export HF_TOKEN=your_token_here
+# 4. Now you can access gated models
+ollamadiffuser pull flux.1-dev
+ollamadiffuser pull stable-diffusion-3.5-medium
+```
+> **Tips:** Use "read" permissions for the token. Your token stays local -- never shared with OllamaDiffuser servers. Add `export HF_TOKEN=...` to `~/.bashrc` or `~/.zshrc` to make it permanent.
+---
+## 🎯 Supported Models
+Choose from 40+ models spanning every major architecture:
+### Core Models
+| Model | Type | Steps | VRAM | Commercial | License |
+|-------|------|-------|------|------------|---------|
+| `flux.1-schnell` | flux | 4 | 16GB+ | ✅ | Apache 2.0 |
+| `flux.1-dev` | flux | 20 | 20GB+ | ❌ | Non-commercial |
+| `stable-diffusion-3.5-medium` | sd3 | 28 | 8GB+ | ⚠️ | Stability AI |
+| `stable-diffusion-3.5-large` | sd3 | 28 | 12GB+ | ⚠️ | Stability AI |
+| `stable-diffusion-3.5-large-turbo` | sd3 | 4 | 12GB+ | ⚠️ | Stability AI |
+| `stable-diffusion-xl-base` | sdxl | 50 | 6GB+ | ⚠️ | CreativeML |
+| `stable-diffusion-1.5` | sd15 | 50 | 4GB+ | ⚠️ | CreativeML |
+### Next-Generation Models
+| Model | Origin | Params | Steps | VRAM | Commercial | License |
+|-------|--------|--------|-------|------|------------|---------|
+| `flux.2-dev` | Black Forest Labs | 32B | 28 | 14GB+ | ❌ | Non-commercial |
+| `flux.2-klein-4b` | Black Forest Labs | 4B | 28 | 10GB+ | ✅ | Apache 2.0 |
+| `z-image-turbo` | Alibaba (Tongyi) | 6B | 8 | 10GB+ | ✅ | Apache 2.0 |
+| `sana-1.5` | NVIDIA | 1.6B | 20 | 8GB+ | ✅ | Apache 2.0 |
+| `cogview4` | Zhipu AI | 6B | 50 | 12GB+ | ✅ | Apache 2.0 |
+| `kolors` | Kuaishou | 8.6B | 50 | 8GB+ | ✅ | Kolors License |
+| `hunyuan-dit` | Tencent | 1.5B | 50 | 6GB+ | ✅ | Tencent Community |
+| `lumina-2` | Alpha-VLLM | 2B | 30 | 8GB+ | ✅ | Apache 2.0 |
+| `pixart-sigma` | PixArt | 0.6B | 20 | 6GB+ | ✅ | Open |
+| `auraflow` | Fal | 6.8B | 50 | 12GB+ | ✅ | Apache 2.0 |
+| `omnigen` | BAAI | 3.8B | 50 | 12GB+ | ✅ | MIT |
+### Fast / Turbo Models
+| Model | Steps | VRAM | Notes |
+|-------|-------|------|-------|
+| `sdxl-turbo` | 1 | 6GB+ | Single-step distilled SDXL |
+| `sdxl-lightning-4step` | 4 | 6GB+ | ByteDance, custom scheduler |
+| `stable-diffusion-3.5-large-turbo` | 4 | 12GB+ | Distilled SD 3.5 Large |
+| `z-image-turbo` | 8 | 10GB+ | Alibaba 6B turbo |
+### Community Fine-Tunes
+| Model | Base | Notes |
+|-------|------|-------|
+| `realvisxl-v4` | SDXL | Photorealistic, very popular |
+| `dreamshaper` | SD 1.5 | Versatile artistic model |
+| `realistic-vision-v6` | SD 1.5 | Portrait specialist |
+### FLUX Pipeline Variants
+| Model | Pipeline | Use Case |
+|-------|----------|----------|
+| `flux.1-fill-dev` | FluxFillPipeline | Inpainting / outpainting |
+| `flux.1-canny-dev` | FluxControlPipeline | Canny edge control |
+| `flux.1-depth-dev` | FluxControlPipeline | Depth map control |
 ### 💾 GGUF Models - Reduced Memory Requirements
-**NEW**: GGUF quantized models enable running FLUX.1-dev on budget hardware!
+GGUF quantized models enable running FLUX.1-dev on budget hardware:
 | GGUF Variant | VRAM | Quality | Best For |
 |--------------|------|---------|----------|
@@ -254,11 +356,6 @@ Choose from a variety of state-of-the-art image generation models:
 📖 **[Complete GGUF Guide](GGUF_GUIDE.md)** - Hardware recommendations, installation, and optimization tips
-### Why Choose FLUX.1-schnell?
-- **Apache 2.0 license** - Perfect for commercial use
-- **4-step generation** - Lightning fast results
-- **Commercial OK** - Use in your business
 ---
 ## 🎛️ ControlNet Features
@@ -319,6 +416,16 @@ ollamadiffuser lora unload
 ollamadiffuser pull stable-diffusion-1.5
 ollamadiffuser run stable-diffusion-1.5
+# Model registry management
+ollamadiffuser registry list
+ollamadiffuser registry list --installed-only
+ollamadiffuser registry check-gguf
+# Configuration management
+ollamadiffuser config                                    # show all config
+ollamadiffuser config set models_dir /mnt/ssd/models     # custom model path
+ollamadiffuser config set server.port 9000               # change server port
 # In another terminal, generate images via API
 curl -X POST http://localhost:8000/api/generate \
   -H "Content-Type: application/json" \
@@ -350,18 +457,75 @@ Features:
 ```bash
 # Start API server
 ollamadiffuser --mode api
 ollamadiffuser load stable-diffusion-1.5
-# Generate image
+# Text-to-image
 curl -X POST http://localhost:8000/api/generate \
   -H "Content-Type: application/json" \
-  -d '{"prompt": "a beautiful landscape", "width": 1024, "height": 1024}'
+  -d '{"prompt": "a beautiful landscape", "width": 1024, "height": 1024, "seed": 42}'
+# Image-to-image
+curl -X POST http://localhost:8000/api/generate/img2img \
+  -F "prompt=oil painting style" \
+  -F "strength=0.75" \
+  -F "image=@input.png" \
+  --output result.png
+# Inpainting
+curl -X POST http://localhost:8000/api/generate/inpaint \
+  -F "prompt=a red car" \
+  -F "image=@photo.png" \
+  -F "mask=@mask.png" \
+  --output inpainted.png
+# API docs: http://localhost:8000/docs
+```
-# API document
-http://localhost:8000/docs
+### MCP Server (AI Assistant Integration)
+OllamaDiffuser includes a [Model Context Protocol](https://modelcontextprotocol.io/) server for integration with AI assistants like OpenClaw, Claude Code, and Cursor.
+```bash
+# Install MCP support
+pip install "ollamadiffuser[mcp]"
+# Start MCP server (stdio transport)
+ollamadiffuser mcp
 ```
+**MCP client configuration** (e.g. `claude_desktop_config.json`):
+```json
+{
+  "mcpServers": {
+    "ollamadiffuser": {
+      "command": "ollamadiffuser-mcp"
+    }
+  }
+}
+```
+**Available MCP tools:**
+- `generate_image` -- Generate images from text prompts (auto-loads model)
+- `list_models` -- List available and installed models
+- `load_model` -- Load a model into memory
+- `get_status` -- Check device, loaded model, and system status
+### OpenClaw AgentSkill
+An [OpenClaw](https://github.com/openclaw/openclaw) skill is included at `integrations/openclaw/SKILL.md`. It uses the REST API with `response_format=b64_json` for agent-friendly base64 image responses. Copy the skill directory to your OpenClaw skills folder or publish to ClawHub.
+### Base64 JSON API Response
+For AI agents and messaging platforms, use `response_format=b64_json` to get images as JSON:
+```bash
+curl -X POST http://localhost:8000/api/generate \
+  -H "Content-Type: application/json" \
+  -d '{"prompt": "a sunset over mountains", "response_format": "b64_json"}'
+```
+Response: `{"image": "<base64 PNG>", "format": "png", "width": 1024, "height": 1024}`
 ### Python API
 ```python
 from ollamadiffuser.core.models.manager import model_manager
@@ -370,30 +534,59 @@ from ollamadiffuser.core.models.manager import model_manager
 success = model_manager.load_model("stable-diffusion-1.5")
 if success:
     engine = model_manager.loaded_model
-    # Generate image
+    # Text-to-image (seed is optional; omit for random)
     image = engine.generate_image(
         prompt="a beautiful sunset",
         width=1024,
-        height=1024
+        height=1024,
+        seed=42,
     )
     image.save("output.jpg")
+    # Image-to-image
+    from PIL import Image
+    input_img = Image.open("photo.jpg")
+    result = engine.generate_image(
+        prompt="watercolor painting",
+        image=input_img,
+        strength=0.7,
+    )
+    result.save("img2img_output.jpg")
 else:
     print("Failed to load model")
 ```
-## 📦 Supported Models
+## 📦 Model Ecosystem
 ### Base Models
-- **Stable Diffusion 1.5**: Classic, reliable, fast
-- **Stable Diffusion XL**: High-resolution, detailed
-- **Stable Diffusion 3**: Latest architecture
-- **FLUX.1**: State-of-the-art quality
+- **Stable Diffusion 1.5**: Classic, reliable, fast (img2img + inpainting)
+- **Stable Diffusion XL**: High-resolution, detailed (img2img + inpainting, scheduler overrides)
+- **Stable Diffusion 3.5**: Medium, Large, and Large Turbo variants
+- **FLUX.1**: schnell, dev, Fill, Canny, Depth pipeline variants
+- **HiDream**: Multi-prompt generation with bfloat16
+- **AnimateDiff**: Video/animation generation
+### Next-Generation Models
+- **FLUX.2**: 32B dev and 4B Klein variants from Black Forest Labs
+- **Chinese Models**: CogView4 (Zhipu), Kolors (Kuaishou), Hunyuan-DiT (Tencent), Z-Image (Alibaba)
+- **Efficient Models**: SANA 1.5 (1.6B), PixArt-Sigma (0.6B) -- high quality at low VRAM
+- **Open Models**: AuraFlow (6.8B, Apache 2.0), OmniGen (3.8B, MIT), Lumina 2.0 (2B, Apache 2.0)
+### Fast / Turbo Models
+- **SDXL Turbo**: Single-step inference from Stability AI
+- **SDXL Lightning**: 4-step with custom scheduler from ByteDance
+- **Z-Image Turbo**: 8-step turbo from Alibaba
+### Community Fine-Tunes
+- **RealVisXL V4**: Photorealistic SDXL, very popular
+- **DreamShaper**: Versatile artistic SD 1.5 model
+- **Realistic Vision V6**: Portrait specialist
 ### GGUF Quantized Models
 - **FLUX.1-dev GGUF**: 7 quantization levels (3GB-16GB VRAM)
 - **Memory Efficient**: Run high-quality models on budget hardware
-- **Same API**: Works seamlessly with existing commands
+- **Optional Install**: `pip install "ollamadiffuser[gguf]"`
 ### ControlNet Models
 - **SD 1.5 ControlNet**: 4 control types (canny, depth, openpose, scribble)
@@ -405,14 +598,32 @@ else:
 - **Dynamic Loading**: Load/unload without model restart
 - **Strength Control**: Adjustable influence (0.1-2.0)
-## ⚙️ Configuration
+## ⚙️ Architecture
+### Strategy Pattern Engine
+Each model type has a dedicated strategy class handling loading and generation:
-### Model Configuration
+```
+InferenceEngine (facade)
+  -> SD15Strategy            (512x512, float32 on MPS, img2img, inpainting)
+  -> SDXLStrategy            (1024x1024, img2img, inpainting, scheduler overrides)
+  -> FluxStrategy            (schnell/dev/Fill/Canny/Depth, dynamic pipeline class)
+  -> SD3Strategy             (1024x1024, 28 steps, guidance=3.5)
+  -> ControlNetStrategy      (SD15 + SDXL base models)
+  -> VideoStrategy           (AnimateDiff, 16 frames)
+  -> HiDreamStrategy         (bfloat16, multi-prompt)
+  -> GGUFStrategy            (quantized via stable-diffusion-cpp)
+  -> GenericPipelineStrategy (any diffusers pipeline via config)
+```
+The `GenericPipelineStrategy` dynamically loads any `diffusers` pipeline class specified in the model registry, so new models can be added with zero code changes.
+### Configuration
 Models are automatically configured with optimal settings:
 - **Memory Optimization**: Attention slicing, CPU offloading
 - **Device Detection**: Automatic CUDA/MPS/CPU selection
-- **Precision Handling**: FP16/BF16 support for efficiency
-- **Safety Features**: NSFW filter bypass for creative freedom
+- **Precision Handling**: FP16/BF16 per model type
+- **Safety Disabled**: Unified `SAFETY_DISABLED_KWARGS` (no monkey-patching)
 ## 🔧 Advanced Usage
@@ -487,7 +698,7 @@ with open("control.jpg", "rb") as f:
 ### Minimum Requirements
 - **RAM**: 8GB system RAM
 - **Storage**: 10GB free space
-- **Python**: 3.8+
+- **Python**: 3.10+
 ### Recommended Hardware
@@ -496,6 +707,12 @@ with open("control.jpg", "rb") as f:
 - **RAM**: 16GB+ system RAM
 - **Storage**: SSD with 50GB+ free space
+#### For Apple Silicon (Mac Mini / MacBook)
+- **16GB unified memory**: PixArt-Sigma, SANA 1.5, DreamShaper, SD 1.5/XL, GGUF q2k-q5ks
+- **24GB+ unified memory**: CogView4, Kolors, Lumina 2.0, GGUF q6k-q8
+- **GGUF with Metal**: Install with `CMAKE_ARGS="-DSD_METAL=ON"` for GPU acceleration
+- Run `ollamadiffuser recommend` to see what fits your hardware
 #### For GGUF Models (Memory Efficient)
 - **GPU**: 3GB+ VRAM (or CPU only)
 - **RAM**: 8GB+ system RAM (16GB+ for CPU inference)
@@ -503,7 +720,7 @@ with open("control.jpg", "rb") as f:
 ### Supported Platforms
 - **CUDA**: NVIDIA GPUs (recommended)
-- **MPS**: Apple Silicon (M1/M2/M3)
+- **MPS**: Apple Silicon (M1/M2/M3/M4) -- native support for 30+ models including GGUF
 - **CPU**: All platforms (slower but functional)
 ## 🔧 Troubleshooting
@@ -534,7 +751,7 @@ pip install 'ollamadiffuser[full]'
 #### GGUF Support Issues
 ```bash
 # Install GGUF dependencies
-pip install stable-diffusion-cpp-python gguf
+pip install "ollamadiffuser[gguf]"
 # Check GGUF support
 ollamadiffuser registry check-gguf
@@ -673,9 +890,21 @@ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file
 ## 🙏 Acknowledgments
 - **Stability AI**: For Stable Diffusion models
-- **Black Forest Labs**: For FLUX.1 models
+- **Black Forest Labs**: For FLUX.1 and FLUX.2 models
+- **Alibaba (Tongyi-MAI)**: For Z-Image Turbo
+- **NVIDIA (Efficient-Large-Model)**: For SANA 1.5
+- **Zhipu AI (THUDM)**: For CogView4
+- **Kuaishou (Kwai-Kolors)**: For Kolors
+- **Tencent (Hunyuan)**: For Hunyuan-DiT
+- **Alpha-VLLM**: For Lumina 2.0
+- **PixArt-alpha**: For PixArt-Sigma
+- **Fal**: For AuraFlow
+- **BAAI (Shitao)**: For OmniGen
+- **ByteDance**: For SDXL Lightning
 - **city96**: For FLUX.1-dev GGUF quantizations
 - **Hugging Face**: For model hosting and diffusers library
+- **Anthropic**: For Model Context Protocol (MCP)
+- **OpenClaw**: For AI agent ecosystem integration
 - **ControlNet Team**: For ControlNet architecture
 - **Community**: For feedback and contributions

ollamadiffuser 1.2.2__py3-none-any.whl → 2.0.0__py3-none-any.whl

ollamadiffuser 1.2.2py3-none-any.whl → 2.0.0py3-none-any.whl