@agentutility/mcp-synthforge 0.1.8 → 0.7.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -2,7 +2,7 @@
2
2
 
3
3
  > Generative media for agents that ship products.
4
4
 
5
- Image, video, music, voice generation across three price tiers. One API surface, USDC-settled, no SaaS account.
5
+ Image, video, music, and voice generation with fast Venice models, pro creative models, and flagship Google/OpenAI image tiers. One API surface, USDC-settled, no SaaS account.
6
6
 
7
7
  **Pricing:** pay-per-call in USDC on Base. No subscriptions, no API keys. See per-tool prices below.
8
8
 
@@ -22,7 +22,7 @@ Edit `~/Library/Application Support/Claude/claude_desktop_config.json` (macOS) o
22
22
  }
23
23
  ```
24
24
 
25
- Restart Claude Desktop. 14 tools appear in the tool palette.
25
+ Restart Claude Desktop. 52 tools appear in the tool palette.
26
26
 
27
27
  ## Install — Cursor
28
28
 
@@ -46,29 +46,67 @@ Send any amount of **USDC on Base mainnet** to the address derived from your `X4
46
46
 
47
47
  USDC on Base contract: `0x833589fCD6eDb6E08f4c7C32D4f71b54bdA02913`
48
48
 
49
- ## Tools (14)
49
+ ## Tools (52)
50
50
 
51
51
  | Tool | Description |
52
52
  |---|---|
53
- | `image-edit` | (0.02 USDC/call) Image edit / instruction-based image edit / text-driven photo edit / nano-banana image editor / GPT-image-2 edit. Mask-free instruction-driven image editing describe the change in text and the model applies it to the whole image. Default model: nano-banana-pro. Returns a permanent fal-hosted PNG URL. |
53
+ | `ad-banner-image-expand` | (0.15 USDC/call) Expands existing images into wide, square, or portrait ad banner placements. Same Bria Expand backend as image-expand. Use it as an ad banner image expand API, paid social canvas outpaint, or campaign creative extender. |
54
+ | `ad-creative-image-generate` | (0.01 USDC/call) Generates ad creative images: SFW marketing visuals, concept images, and campaign thumbnails from prompts. Same backend as image-generate. Use it as a campaign visual generator or paid social creative API. |
55
+ | `app-icon-image-generate` | (0.02 USDC/call) Generates SFW square app icon concepts, product marks, and launch visuals. Same image-generate backend. Use it as an app icon generator API for mobile icon concepts or SaaS icon image generation. |
56
+ | `app-store-screenshot-expand` | (0.15 USDC/call) Expands mobile app screenshots into store, ad, and product-page layouts. Same Bria Expand backend as image-expand. Use it as an app store screenshot expand API, screenshot backdrop extender, or launch asset outpaint. |
57
+ | `background-remove` | (0.08 USDC/call) Removes the background from a public image URL and returns the subject with alpha transparency. Optional crop_to_bbox. fal.ai imageutils/rembg. Use it as a background remove API, image background remover, or transparent PNG foreground cutout. |
58
+ | `background-remover` | (0.08 USDC/call) Removes the background from product, portrait, and object images, returning alpha PNG foreground cutouts with optional crop-to-bounding-box behavior. Same remove-bg backend. Use it as a background remover API, to remove image background, or for transparent PNG cutouts. |
59
+ | `banner-image-expand` | (0.15 USDC/call) Turns existing photos into wide website, ad, and email-header canvases. Same backend as image-expand. Use it as a banner image expand API, hero image outpainting tool, or website header extender. |
60
+ | `blog-header-image-generate` | (0.01 USDC/call) Generates blog header images: SFW header images, newsletter visuals, and documentation illustrations from prompts. Same backend as image-generate. Use it as an article hero image or editorial illustration generator. |
61
+ | `book-cover-image-generate` | (0.02 USDC/call) Generates SFW book cover art concepts, genre mood boards, and launch visuals from a prompt. Same image-generate backend. Use it as a book cover image generator API, ebook cover concept tool, or publishing creative source. |
62
+ | `brand-logo-concept-image` | (0.02 USDC/call) Generates SFW logo concept boards, abstract marks, and brand direction visuals from prompts. Same fast Venice-backed backend as image-generate. Use it as a brand logo concept image API, for startup logo visual ideation, or as a mark concept generator. |
63
+ | `course-thumbnail-expand` | (0.15 USDC/call) Expands existing screenshots, instructor photos, or diagrams into 16:9 course thumbnails. Same image-expand backend. Use it as a course thumbnail expand API, tutorial cover outpaint, or education hero image extender. |
64
+ | `course-thumbnail-image-generate` | (0.02 USDC/call) Generates SFW course thumbnails, lesson covers, learning-path artwork, and academy launch visuals, without text rendering. Same fast image-generate backend. Use it as a course thumbnail image generator API, education cover art tool, or tutorial hero image source. |
65
+ | `ecommerce-lifestyle-image-generate` | (0.02 USDC/call) Generates SFW ecommerce lifestyle scenes and ad visuals around product prompts. Same image-generate backend. Use it as an ecommerce lifestyle image generator API, product lifestyle creative tool, or marketplace campaign image source. |
66
+ | `email-header-image-expand` | (0.15 USDC/call) Expands existing product, event, or editorial images into wide email-header canvases. Same image-expand backend. Use it as an email header image expand API, newsletter banner outpaint, or lifecycle email creative extender. |
67
+ | `event-poster-image-generate` | (0.02 USDC/call) Generates SFW event artwork, launch posters, and campaign backgrounds without relying on stock art. Same fast image-generate backend. Use it as an event poster image generator API, conference visual tool, or meetup campaign image source. |
68
+ | `fashion-lookbook-image-generate` | (0.02 USDC/call) Generates SFW fashion visuals: clothing concepts, seasonal lookbook scenes, and ecommerce fashion creative from prompts. Same image-generate backend. Use it as a fashion lookbook image generator API, apparel campaign visual tool, or style mood board source. |
69
+ | `game-asset-concept-image` | (0.02 USDC/call) Game asset concept image API / character prop environment concept generator. Same image-generate backend, exposed for agents drafting SFW game art concepts, item thumbnails, environment mood boards, and prototype visual assets. |
70
+ | `healthcare-campaign-image-generate` | (0.02 USDC/call) Generates SFW healthcare, wellness, clinic, and patient-education campaign visuals, without medical claims. Same image-generate backend. Use it as a healthcare campaign image generator API, clinic marketing visual tool, or wellness program creative source. |
71
+ | `image-describe-api` | (0.02 USDC/call) Describes images from public URLs, producing captions, accessibility alt text, short descriptions, and visual summaries. Same image-description backend. Use it as an image describe API, image captioning service, or alt text generator. |
72
+ | `image-edit` | (0.02 USDC/call) Edits an image from a plain-text instruction, mask-free: describe the change and the model applies it to the whole image. Default model: nano-banana-pro. Returns a permanent fal-hosted PNG URL. Use it for instruction-based image editing, text-driven photo edits, nano-banana image editing, or GPT-image-2 edits. |
54
73
  | `image-expand` | (0.15 USDC/call) AI image outpainting / image expansion. Bria Expand model. Generates realistic content beyond original borders. Set canvas size + original placement. Commercial-license model. |
55
- | `image-generate` | (0.01 USDC/call) Image generate (fast/cheap) / text-to-image / AI art. SFW, sub-5s turnaround. Four tiers backed by curated $0.01 Venice models: 'fast' (z-image-turbo, default), 'creative' (chroma), 'anime' (wai-Illustrious), 'sd35' (venice-sd35). For top-tier quality use image-generate-pro ($0.10, Flux 2 Pro / Recraft / Seedream / Qwen Image 2 Pro) or image-generate-ultra ($0.30, Google nano-banana-pro / OpenAI gpt-image-2). Returns a permanent fal-hosted PNG URL. |
56
- | `image-generate-pro` | (0.10 USDC/call) Image generate (pro) / premium text-to-image / Flux 2 Pro / Recraft / Seedream / Qwen Image 2 Pro / xAI Grok Imagine. Premium multi-model lineup for photoreal, design/illustration, text-in-image, and stylized art. Tiers: 'balanced' (flux-2-pro, default), 'max' (flux-2-max), 'text' (qwen-image-2-pro best at rendering text in images), 'recraft' (recraft-v4), 'seedream' (seedream-v4), 'grok' (grok-imagine-image), 'art' (imagineart-1.5-pro), 'hunyuan' (hunyuan-image-v3). For flagship Google/OpenAI models use image-generate-ultra. Returns a permanent fal-hosted PNG URL. |
57
- | `image-generate-ultra` | (0.30 USDC/call) Image generate (ultra) / flagship text-to-image / Google nano-banana-pro (Gemini Image 3) / OpenAI gpt-image-2 / Recraft V4 Pro / xAI Grok SOTA. Top proprietary models for the highest quality output. Tiers: 'nano-banana' (nano-banana-pro, default Google Gemini Image 3), 'nano-banana-2' (cheaper Google variant), 'gpt' (gpt-image-2 OpenAI flagship), 'gpt-1-5' (gpt-image-1-5), 'recraft-pro' (recraft-v4-pro), 'grok-sota' (grok-imagine-image-quality). Output is capped to 1024x1024 to keep wholesale within retail; for higher resolution, chain image-upscale. Returns a permanent fal-hosted PNG URL. |
58
- | `image-inpaint` | (0.02 USDC/call) Image inpainting / mask-based image edit / fill in masked region / object replacement / face swap (mask-driven) / generative fill. Replaces the masked region of an image with content matching a text prompt. White pixels in the mask = region to inpaint. Default model: gpt-image-2. Returns a permanent fal-hosted PNG URL. |
59
- | `image-to-video` | (0.20 USDC/call) Image-to-video / animate still image / Seedance image-to-video / motion-from-photo / camera-movement on photo. Animates a still image into video via Venice's seedance-2-0-fast-image-to-video. Optional prompt steers the motion (camera moves, subject motion). Same async-vs-sync handling as text-to-video. |
60
- | `music-generate` | (0.05 USDC/call) Music generation / text-to-music / AI music / generative song / instrumental and vocal music. Text-to-music via Venice with the minimax-music-v26 model. Optional lyrics input. Duration 5-120 seconds. Returns a permanent fal-hosted audio URL (or a Venice-hosted URL when Venice already provides one). |
61
- | `remove-bg` | (0.08 USDC/call) AI background remover / background eraser / cutout tool. Returns transparent PNG. Optional crop_to_bbox. fal.ai imageutils/rembg. |
62
- | `seedance-video` | (0.20 USDC/call) Seedance 2.0 / Seedance 2.0 video generation / Seedance video AI / generative AI video / text-to-video AI / cinematic AI clips on AI Gateway. Powered by Venice's seedance-2-0-fast-text-to-video model. Duration / aspect-ratio / resolution configurable. Same backend as text-to-video under a model-named slug for direct discovery by agents searching for 'Seedance'. |
63
- | `sound-effect-generate` | (0.01 USDC/call) Sound effect generation / text-to-SFX / Foley generator / ElevenLabs sound effects / ambient audio synth. Text-to-SFX via Venice with elevenlabs-sound-effects-v2. Duration 0.5-22 seconds. Returns a permanent fal-hosted audio URL (or Venice-hosted when applicable). |
64
- | `text-to-speech` | (0.05 USDC/call) Text to speech / TTS / voice generator. Venice TTS (Kokoro / xAI / ElevenLabs / Orpheus / MiniMax / Gemini). 30+ voices, 6 audio formats. Returns hosted MP3 URL. |
65
- | `text-to-video` | (0.20 USDC/call) Text-to-video / AI video / Seedance / generative video / cinematic clip from prompt. Text-to-video via Venice's seedance-2-0-fast-text-to-video. Duration / aspect-ratio / resolution configurable. The synchronous path has a 22s budget; if Venice can't return inline within that window, the response surfaces a job_id + poll_url for the caller to resolve later. |
66
- | `voice` | (0.05 USDC/call) Text-to-speech / TTS / voice synthesis. Venice TTS (Kokoro/xAI/ElevenLabs/Orpheus/MiniMax). 30+ voices, MP3/WAV/OPUS/AAC/FLAC. |
74
+ | `image-generate` | (0.02 USDC/call) Generate an image from text in under 5 seconds. Four style tiers backed by curated $0.01 Venice models: 'fast' (z-image-turbo, default), 'creative' (chroma), 'anime' (wai-Illustrious), 'sd35' (venice-sd35). Returns a permanent fal-hosted PNG URL. SFW text-to-image / AI art at the cheapest tier; for top-tier quality use image-generate-pro ($0.10, Flux 2 Pro / Recraft / Seedream / Qwen Image 2 Pro) or image-generate-ultra ($0.30, Google nano-banana-pro / OpenAI gpt-image-2). |
75
+ | `image-generate-pro` | (0.10 USDC/call) Premium text-to-image generation across a multi-model lineup for photoreal, design/illustration, text-in-image, and stylized art. Tiers: 'balanced' (flux-2-pro, default), 'max' (flux-2-max), 'text' (qwen-image-2-pro, best at rendering text in images), 'recraft' (recraft-v4), 'seedream' (seedream-v4), 'grok' (grok-imagine-image), 'art' (imagineart-1.5-pro), 'hunyuan' (hunyuan-image-v3). For flagship Google/OpenAI models use image-generate-ultra. Returns a permanent fal-hosted PNG URL. Use it for Flux 2 Pro, Recraft, Seedream, Qwen Image 2 Pro, or xAI Grok Imagine image generation. |
76
+ | `image-generate-ultra` | (0.30 USDC/call) Flagship text-to-image generation using top proprietary models for the highest quality output. Tiers: 'nano-banana' (nano-banana-pro, default, Google Gemini Image 3), 'nano-banana-2' (cheaper Google variant), 'gpt' (gpt-image-2, OpenAI flagship), 'gpt-1-5' (gpt-image-1-5), 'recraft-pro' (recraft-v4-pro), 'grok-sota' (grok-imagine-image-quality). Output is capped to 1024x1024 to keep wholesale within retail; for higher resolution, chain image-upscale. Returns a permanent fal-hosted PNG URL. Use it for Google nano-banana-pro, OpenAI gpt-image-2, Recraft V4 Pro, or xAI Grok SOTA image generation. |
77
+ | `image-inpaint` | (0.02 USDC/call) Inpaints an image by replacing the masked region with content matching a text prompt. White pixels in the mask = region to inpaint. Default model: gpt-image-2. Returns a permanent fal-hosted PNG URL. Use it for mask-based image edits, object replacement, mask-driven face swap, or generative fill. |
78
+ | `image-to-video` | (5.00 USDC/call) Animates a still image into video via Venice's seedance-2-0-fast-image-to-video. Optional prompt steers the motion (camera moves, subject motion). Same async-vs-sync handling as text-to-video. Use it for image-to-video, Seedance image-to-video, motion-from-photo, or camera movement on a photo. |
79
+ | `linkedin-banner-expand` | (0.15 USDC/call) Expands existing photos into wide LinkedIn banner and social-profile header canvases. Same Bria Expand backend as image-expand. Use it as a LinkedIn banner expand API, profile header outpainting tool, or brand banner image extender. |
80
+ | `marketplace-product-expand` | (0.15 USDC/call) Expands marketplace product images, adapting product shots to marketplace, catalog, and ad aspect ratios. Same Bria Expand backend as image-expand. Use it as a product image expand API, ecommerce photo canvas extender, or product listing outpaint. |
81
+ | `mobile-story-image-expand` | (0.15 USDC/call) Expands existing images into tall story, reel-cover, and short-form promotional canvases. Same image-expand backend. Use it as a mobile story image expand API, vertical social outpaint, or Instagram TikTok story extender. |
82
+ | `music-generate` | (0.05 USDC/call) Generates music from a text prompt via Venice using the minimax-music-v26 model. Optional lyrics input. Duration 5-120 seconds. Returns a permanent fal-hosted audio URL (or a Venice-hosted URL when Venice already provides one). Use it for AI music generation, text-to-music, generative songs, and instrumental or vocal music. |
83
+ | `newsletter-image-generate` | (0.02 USDC/call) Generates SFW newsletter artwork, digest headers, and publication graphics. Same image-generate backend. Use it as a newsletter image generator API, email header illustration tool, or editorial campaign visual source. |
84
+ | `podcast-cover-expand` | (0.15 USDC/call) Expands guest photos, product images, or show art into podcast-cover dimensions. Same image-expand backend. Use it as a podcast cover expand API, episode art outpaint, or square artwork canvas extender. |
85
+ | `podcast-cover-image-generate` | (0.02 USDC/call) Generates SFW podcast cover art: square cover concepts, guest episode art, channel thumbnails, and audio-show campaign visuals. Same Venice-backed image-generate backend. Use it as a podcast cover image generator API, episode art tool, or show artwork concept source. |
86
+ | `presentation-hero-image-generate` | (0.02 USDC/call) Generates SFW presentation visuals: deck openers, product narrative visuals, and board-slide hero images. Same image-generate backend. Use it as a presentation hero image generator API, pitch deck visual tool, or slide cover image source. |
87
+ | `product-image-generate` | (0.01 USDC/call) Generates product images for ecommerce: SFW product concepts, studio-style mockups, and catalog creative from prompts. Same fast Venice-backed backend as image-generate. Use it as an ecommerce product creative or marketplace listing image generation API. |
88
+ | `product-photo-background-expand` | (0.15 USDC/call) Expands tight product crops into marketplace, catalog, and lifestyle ad canvases by extending the background. Same image-expand backend. Use it as a product photo background expand API, ecommerce product outpaint, or studio backdrop extender. |
89
+ | `product-photo-expand` | (0.15 USDC/call) Expands the canvas around an existing product shot for product-detail pages, marketplaces, and catalog creative. Same Bria Expand backend as image-expand. Use it as a product photo outpainting API, ecommerce image expand, or product hero image extender. |
90
+ | `real-estate-ad-image-generate` | (0.02 USDC/call) Generates SFW real estate ad visuals: property marketing imagery, neighborhood concept art, open-house ads, and listing campaign backgrounds. Same image-generate backend. Use it as a real estate ad image generator API, property campaign visual tool, or listing promotion creative source. |
91
+ | `real-estate-hero-expand` | (0.15 USDC/call) Expands room, exterior, and neighborhood photos into wide property-page and ad hero canvases. Same image-expand backend. Use it as a real estate hero image expand API, listing banner outpaint, or property website image extender. |
92
+ | `real-estate-photo-expand` | (0.15 USDC/call) Extends an existing room or exterior photo into a wider hero crop or vertical social version for listing creatives. Same backend as image-expand. Use it as a real estate photo outpainting API, property listing image expand, or room photo canvas extender. |
93
+ | `real-estate-render-generate` | (0.02 USDC/call) Generates SFW real estate visuals: listing concepts, room mood boards, exterior concepts, and property campaign imagery. Same image-generate backend. Use it as a real estate render generator API, property marketing image tool, or interior concept visual source. |
94
+ | `remove-background` | (0.08 USDC/call) Removes the background from an image URL and returns the subject as a PNG with alpha channel. Optional crop_to_bbox. fal.ai imageutils/rembg. Use it as a background remove API or transparent PNG cutout. |
95
+ | `remove-bg` | (0.08 USDC/call) Removes the background from an image and returns a foreground cutout with alpha channel. Optional crop_to_bbox. fal.ai imageutils/rembg. Use it as an AI background remover, background eraser, or transparent PNG cutout tool. |
96
+ | `restaurant-menu-image-generate` | (0.02 USDC/call) Generates SFW restaurant imagery: dish concepts, menu hero visuals, and local restaurant ad creative. Same image-generate backend. Use it as a restaurant menu image generator API, food concept visual tool, or hospitality creative source. |
97
+ | `seedance-video` | (5.00 USDC/call) Generates AI video with Seedance 2.0, powered by Venice's seedance-2-0-fast-text-to-video model. Duration, aspect ratio, and resolution are configurable. Same backend as text-to-video under a model-named slug for direct discovery by agents searching for 'Seedance'. Use it for Seedance 2.0 video generation, text-to-video AI, and cinematic AI clips on AI Gateway. |
98
+ | `social-crop-expand` | (0.15 USDC/call) Adapts one image into square, portrait, landscape, and banner canvases. Same backend as image-expand, for agents reformatting creative per platform. Use it as a social crop expand API, Instagram TikTok LinkedIn crop extender, or image aspect-ratio outpainting tool. |
99
+ | `social-image-generate` | (0.01 USDC/call) Generates SFW social images in square, portrait, and landscape sizes from prompts. Same backend as image-generate. Use it as a post visual generator or LinkedIn, X, and Instagram creative API. |
100
+ | `sound-effect-generate` | (0.01 USDC/call) Generates sound effects from a text prompt via Venice using elevenlabs-sound-effects-v2. Duration 0.5-22 seconds. Returns a permanent fal-hosted audio URL (or Venice-hosted when applicable). Use it as a text-to-SFX tool, Foley generator, ElevenLabs sound effects endpoint, or ambient audio synth. |
101
+ | `text-to-speech` | (0.05 USDC/call) Converts text to speech with 30+ voices and 5 audio formats. Morpheus primary for Kokoro, Venice fallback and alternate TTS models (xAI / ElevenLabs / Orpheus / MiniMax / Gemini), with fal.ai storage for hosted audio URLs. Use it as a TTS API or voice generator. |
102
+ | `text-to-video` | (5.00 USDC/call) Generates video from a text prompt via Venice's seedance-2-0-fast-text-to-video. Duration, aspect ratio, and resolution are configurable. The synchronous path has a 22s budget; if Venice can't return inline within that window, the response surfaces a job_id + poll_url for the caller to resolve later. Use it for AI video, Seedance generation, generative video, and cinematic clips from a prompt. |
103
+ | `voice` | (0.05 USDC/call) Converts text to speech with 30+ voices and MP3/WAV/OPUS/AAC/FLAC output. Powered by Venice TTS (Kokoro/xAI/ElevenLabs/Orpheus/MiniMax). Use it as a TTS or voice synthesis API. |
104
+ | `youtube-thumbnail-expand` | (0.15 USDC/call) Expands source images into YouTube thumbnail and video-cover canvases. Same Bria Expand backend as image-expand. Use it as a YouTube thumbnail expand API, video thumbnail outpaint, or 16:9 image extender. |
67
105
 
68
106
  ## How it works
69
107
 
70
- 1. Agent calls a tool (e.g. `image-edit`).
71
- 2. MCP server POSTs to `https://x402.agentutility.ai/image-edit`.
108
+ 1. Agent calls a tool (e.g. `ad-banner-image-expand`).
109
+ 2. MCP server POSTs to `https://x402.agentutility.ai/ad-banner-image-expand`.
72
110
  3. The endpoint responds **HTTP 402** with payment instructions.
73
111
  4. The MCP server signs an EIP-3009 USDC transfer authorization with `X402_PRIVATE_KEY` and retries.
74
112
  5. CDP facilitator settles on Base.
@@ -84,4 +122,4 @@ The agent never sees the payment flow — it just gets the result.
84
122
 
85
123
  ---
86
124
 
87
- **Version:** 0.1.8 · **License:** MIT
125
+ **Version:** 0.7.4 · **License:** MIT