npm - @metagptx/web-sdk - Versions diffs - 0.0.59-beta.2 → 0.0.59-beta.3 - Mend

@metagptx/web-sdk 0.0.59-beta.2 → 0.0.59-beta.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +80 -2
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -54,7 +54,7 @@ The SDK provides eight main modules and a Vite plugin:
 - **integrations**: Integration function invocations
 - **frame**: Frame communication operations for iframe/parent window messaging
 - **utils**: Utility functions for URL opening and window management
-- **ai**: AI-powered text and image generation
+- **ai**: AI-powered text, image, video, and audio generation
 - **storage**: Object storage operations (buckets, files, upload/download)
 - **vitePlugin404**: Vite plugin for automatically adding a 404 page to React Router applications
@@ -591,7 +591,7 @@ client.utils.openUrl('https://stripe.com/checkout'); // Navigates to URL when no
 ### AI Module
-Provides AI-powered text and image generation capabilities with support for streaming responses, multimodal inputs, and image editing.
+Provides AI-powered text, image, video, and audio generation capabilities with support for streaming responses, multimodal inputs, image editing, and text-to-speech.
 #### `ai.gentxt(params)`
@@ -720,6 +720,76 @@ const response = await client.ai.genimg({
 }, { timeout: 600_000 });
 ```
+#### `ai.genvideo(params, options?)`
+Generate videos using AI models. Supports text-to-video and image-to-video (using an image as the first frame). Video generation is async — the API polls internally until completion.
+**HTTP Details:**
+- **Method:** `POST`
+- **Path:** `/api/v1/aihub/genvideo`
+- **Parameters:**
+  - `prompt` (required): Text prompt describing the desired video
+  - `model` (required): Model identifier (e.g., `'wan2.6-t2v'` for text-to-video, `'wan2.6-i2v'` for image-to-video)
+  - `size` (optional): Video size (default: `"1280x720"`)
+  - `seconds` (optional): Video duration in seconds (default: `"8"`)
+  - `image` (optional): Base64 Data URI image as the first frame reference (for image-to-video)
+- **Options:**
+  - `timeout` (optional): Request timeout in milliseconds (default: 600000ms / 10 minutes). Video generation is slow; consider setting a longer timeout (e.g., `600_000` ms or more)
+**Response:** `response.data.url` is the CDN URL of the generated video.
+**Example - Text-to-Video:**
+```typescript
+const video = await client.ai.genvideo(
+  { prompt: 'Ocean waves at sunset', model: 'wan2.6-t2v' },
+  { timeout: 600_000 }
+);
+const videoUrl = video.data.url;
+```
+**Example - Image-to-Video (use image as first frame):**
+```typescript
+const videoFromImage = await client.ai.genvideo(
+  { prompt: 'Animate the scene', model: 'wan2.6-i2v', image: 'data:image/png;base64,...' },
+  { timeout: 600_000 }
+);
+const videoUrl = videoFromImage.data.url;
+```
+#### `ai.genaudio(params, options?)`
+Generate audio (text-to-speech) using AI models. Voice is auto-selected based on model and gender — no manual voice selection needed.
+**HTTP Details:**
+- **Method:** `POST`
+- **Path:** `/api/v1/aihub/genaudio`
+- **Parameters:**
+  - `text` (required): Text content to convert to speech
+  - `model` (required): Model identifier (e.g., `'qwen3-tts-flash'`, `'eleven-v3-alpha'`)
+  - `gender` (optional): Voice gender — `"male"` or `"female"` (default: `"female"`)
+- **Options:**
+  - `timeout` (optional): Request timeout in milliseconds (default: 60000ms / 1 minute)
+**Response:** `response.data.url` is the CDN URL of the generated audio (mp3).
+**Example - Female voice (default):**
+```typescript
+const audio = await client.ai.genaudio(
+  { text: 'Welcome to our website', model: 'qwen3-tts-flash', gender: 'female' },
+  { timeout: 60_000 }
+);
+const audioUrl = audio.data.url;
+```
+**Example - Male voice:**
+```typescript
+const maleAudio = await client.ai.genaudio(
+  { text: 'Product introduction', model: 'eleven-v3-alpha', gender: 'male' },
+  { timeout: 60_000 }
+);
+const audioUrl = maleAudio.data.url;
+```
 ---
 ### Storage Module
@@ -1150,12 +1220,16 @@ import type {
   CreateBucketParams,
   CreateBucketResponse,
   DownloadParams,
+  GenAudioParams,
+  GenAudioResponse,
   GenImgParams,
   GenImgResponse,
   GenTxtNonStreamParams,
   GenTxtParams,
   GenTxtResponse,
   GenTxtStreamParams,
+  GenVideoParams,
+  GenVideoResponse,
   GetDownloadUrlParams,
   GetDownloadUrlResponse,
   GetObjectInfoParams,
@@ -1196,6 +1270,10 @@ import type {
 - **`GenTxtResponse`**: Text generation response
 - **`GenImgParams`**: Image generation parameters
 - **`GenImgResponse`**: Image generation response
+- **`GenVideoParams`**: Video generation parameters
+- **`GenVideoResponse`**: Video generation response (CDN URL)
+- **`GenAudioParams`**: Audio generation (TTS) parameters
+- **`GenAudioResponse`**: Audio generation response (CDN URL)
 - **`StreamChunk`**: Chunk received during streaming
 - **`StreamResult`**: Complete streaming result
 - **`ImageContent`**: Image content for multimodal messages

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "name": "@metagptx/web-sdk",
   "type": "module",
-  "version": "0.0.59-beta.2",
+  "version": "0.0.59-beta.3",
   "packageManager": "pnpm@10.15.0+sha512.486ebc259d3e999a4e8691ce03b5cac4a71cbeca39372a9b762cb500cfdf0873e2cb16abe3d951b1ee2cf012503f027b98b6584e4df22524e0c7450d9ec7aa7b",
   "description": "TypeScript SDK for interacting with FuncSea API",
   "author": "MetaGPTX",