@genspark/cli 1.0.7 → 1.0.9

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (35) hide show
  1. package/README.md +114 -5
  2. package/dist/index.js +180 -3
  3. package/dist/index.js.map +1 -1
  4. package/docs/skills.md +38 -0
  5. package/package.json +4 -3
  6. package/skills/gsk-aidrive/SKILL.md +53 -0
  7. package/skills/gsk-analyze-media/SKILL.md +42 -0
  8. package/skills/gsk-audio-generation/SKILL.md +52 -0
  9. package/skills/gsk-audio-transcribe/SKILL.md +42 -0
  10. package/skills/gsk-calendar-create/SKILL.md +42 -0
  11. package/skills/gsk-calendar-list/SKILL.md +36 -0
  12. package/skills/gsk-crawler/SKILL.md +39 -0
  13. package/skills/gsk-create-task/SKILL.md +42 -0
  14. package/skills/gsk-email-list/SKILL.md +38 -0
  15. package/skills/gsk-email-read/SKILL.md +34 -0
  16. package/skills/gsk-email-search/SKILL.md +39 -0
  17. package/skills/gsk-email-send/SKILL.md +41 -0
  18. package/skills/gsk-get-service-url/SKILL.md +39 -0
  19. package/skills/gsk-image-generation/SKILL.md +48 -0
  20. package/skills/gsk-image-search/SKILL.md +36 -0
  21. package/skills/gsk-meeting-get/SKILL.md +34 -0
  22. package/skills/gsk-meeting-list/SKILL.md +34 -0
  23. package/skills/gsk-meeting-search/SKILL.md +39 -0
  24. package/skills/gsk-phone-call/SKILL.md +41 -0
  25. package/skills/gsk-shared/SKILL.md +177 -0
  26. package/skills/gsk-social-instagram/SKILL.md +42 -0
  27. package/skills/gsk-social-reddit/SKILL.md +41 -0
  28. package/skills/gsk-social-twitter/SKILL.md +43 -0
  29. package/skills/gsk-stock-price/SKILL.md +35 -0
  30. package/skills/gsk-summarize-large-document/SKILL.md +41 -0
  31. package/skills/gsk-understand-images/SKILL.md +41 -0
  32. package/skills/gsk-video-generation/SKILL.md +53 -0
  33. package/skills/gsk-vm-email-send/SKILL.md +39 -0
  34. package/skills/gsk-web-search/SKILL.md +35 -0
  35. package/AVAILABLE_MODELS.md +0 -96
@@ -1,96 +0,0 @@
1
- # Available Models
2
-
3
- This document lists all models supported by the `gsk` CLI for image, video, and audio generation.
4
-
5
- ## Table of Contents
6
-
7
- - [Image Generation Models](#image-generation-models)
8
- - [Video Generation Models](#video-generation-models)
9
- - [Audio Generation Models](#audio-generation-models)
10
-
11
- ---
12
-
13
- ## Image Generation Models
14
-
15
- Use with `gsk img -m <model>`.
16
-
17
- | Model | Description |
18
- |-------|-------------|
19
- | `nano-banana-2` | Gemini 3.1 Flash Image - Fast and efficient with advanced reasoning. Multi-image fusion with up to 14 references. Supports 0.5K-4K resolution |
20
- | `fal-ai/gpt-image-1.5` | GPT Image 1.5 - Supports text-to-image and image editing with multi-image input |
21
- | `imagen4` | Latest high quality image generation model, upgrade from Imagen 3 |
22
- | `recraft-v3` | Realistic image generation model |
23
- | `fal-ai/bytedance/seedream/v5/lite` | Bytedance Seedream v5 Lite - Text-to-image and image editing with native 2K resolution and excellent text layout |
24
- | `fal-ai/flux-2` | Flux 2 - Text-to-image and image editing with enhanced realism and crisp text generation. Supports up to 3 images for edit mode |
25
- | `fal-ai/flux-2-pro` | Flux 2 Pro - Higher quality version of Flux 2 with professional-grade output |
26
- | `fal-ai/z-image/turbo` | Z-Image Turbo - Optimized for speed. Good for quick iterations, bulk generation, and style transfer |
27
- | `ideogram/V_3` | Ideogram V3 - Character reference specialist with superior facial feature preservation and character consistency |
28
- | `qwen-image` | Chinese poster specialist with outstanding Chinese text rendering and cultural context mastery |
29
- | `bbox-segment` | Extract subjects from images based on bounding box region |
30
- | `fal-bria-rmbg` | Remove background from image |
31
- | `fal-ai/recraft-clarity-upscale` | Upscale image |
32
- | `fal-ai/image-editing/text-removal` | Remove text and watermarks from images while preserving background |
33
- | `flux-pro/outpaint` | Expand image to a specific aspect ratio |
34
-
35
- ---
36
-
37
- ## Video Generation Models
38
-
39
- Use with `gsk video -m <model>`.
40
-
41
- | Model | Capabilities | Aspect Ratios | Duration | Notes |
42
- |-------|-------------|---------------|----------|-------|
43
- | `kling/v3` | Text/Image-to-video | 16:9, 9:16, 1:1 | 3-15s | Latest Kling V3 with audio. Pro/Standard quality modes |
44
- | `gemini/veo3.1` | Text/Image-to-video | 16:9, 9:16 | 8s | Latest Veo with enhanced quality. Supports fast_mode and hd_mode (1080p) |
45
- | `gemini/veo3.1/reference-to-video` | Reference-to-video | 16:9, 9:16 | 8s | Generate video using 1+ reference images. Supports fast_mode and hd_mode |
46
- | `gemini/veo3.1/first-last-frame-to-video` | Frame transition | 16:9, 9:16 | 8s | Precise transitions from first to last frame. Requires exactly 2 images |
47
- | `minimax/hailuo-2.3/standard` | Text/Image-to-video | 16:9, 9:16 | 6s, 10s | Fast (~4min), cost-effective. Supports first & last frame control |
48
- | `wan/v2.6` | Text/Image/Video-to-video | 16:9, 9:16, 1:1, 4:3, 3:4 | 5s, 10s, 15s | 1080p with audio. Supports reference-to-video with 1-3 reference videos |
49
- | `vidu/q3` | Text/Image-to-video | 16:9, 9:16, 4:3, 3:4, 1:1 | 1-16s | Enhanced quality with audio generation. Resolution: 720p, 1080p |
50
- | `runway/gen4_turbo` | Image-to-video | 5:3, 3:5 | 5s, 10s | Fast, high quality. Requires reference image |
51
- | `pixverse/v5` | Text/Image-to-video | 16:9, 9:16, 4:3, 1:1, 3:4 | 5s | Fast (~30s). Supports start/end frame transitions |
52
- | `fal-ai/bytedance/seedance/v1.5/pro` | Text/Image-to-video | 21:9, 16:9, 4:3, 1:1, 3:4, 9:16 | 4-12s | Seedance v1.5 Pro with native audio support. Supports first & last frame control |
53
- | `sora-2` | Text/Image/Video-to-video | 16:9, 9:16 | 4s, 8s, 12s | OpenAI Sora 2 for fast, creative videos. Supports video remixing |
54
- | `sora-2-pro` | Text/Image-to-video | 16:9, 9:16 | 4s, 8s | Sora 2 Pro - Higher fidelity, cinematic quality. 720p and 1080p |
55
- | `fal-ai/bytedance-upscaler/upscale/video` | Video upscaling | — | — | Upscale existing videos to 2K. Requires video_url parameter |
56
- | `xai/grok-imagine-video` | Text/Image-to-video | 16:9, 4:3, 1:1, 3:4, 9:16, 21:9, 9:21 | 1-15s | xAI Grok Imagine Video. 720p HD output |
57
-
58
- ---
59
-
60
- ## Audio Generation Models
61
-
62
- Use with `gsk audio -m <model>`.
63
-
64
- ### Text-to-Speech (TTS)
65
-
66
- | Model | Description |
67
- |-------|-------------|
68
- | `google/gemini-2.5-pro-preview-tts` | Best, high-quality, realistic TTS. Supports one or multiple speakers with speaker prefixes (e.g., `Speaker1: text, Speaker2: text`) |
69
- | `elevenlabs/v3-tts` | Advanced multilingual TTS with multi-speaker dialogue support. Supports emotional tags like `[excited]`, `[whispers]`, `[laughs]` |
70
- | `fal-ai/elevenlabs/tts/multilingual-v2` | High-quality multilingual TTS. Preferred for English |
71
- | `fal-ai/minimax/speech-2.8-hd` | High-quality multilingual TTS. Preferred for Chinese, Cantonese, Japanese, Korean. One speaker per generation |
72
-
73
- ### Sound Effects
74
-
75
- | Model | Description |
76
- |-------|-------------|
77
- | `elevenlabs/sound-effects` | Sound effect generation. Duration: 0.1-22 seconds |
78
-
79
- ### Music Generation
80
-
81
- | Model | Description |
82
- |-------|-------------|
83
- | `elevenlabs/music` | ElevenLabs music generation with vocals/singing. Lyrics auto-generated (no custom lyrics). Duration: 10s-5min |
84
- | `CassetteAI/music-generator` | Background music generation. Duration: 10-180 seconds |
85
- | `mureka/song-generator` | Professional song generation with lyrics. Supports style prompts, reference tracks, vocal and melody inputs. Max: 180s |
86
- | `mureka/instrumental-generator` | Instrumental music generation without vocals. Supports style prompts and reference tracks. Max: 180s |
87
- | `fal-ai/lyria2` | Google Lyria 2 text-to-music. Good for sound effects and lyrics-free music. Max: 30 seconds |
88
- | `fal-ai/minimax-music/v2.5` | Song generation with lyrics using MiniMax Music 2.5. Supports markers (Verse), (Chorus), (Bridge), etc. Requires style prompt and lyrics |
89
-
90
- ### Voice Cloning & Transformation
91
-
92
- | Model | Description |
93
- |-------|-------------|
94
- | `elevenlabs/voice-clone` | Clone a voice from audio samples. Returns voice ID for use in TTS generation |
95
- | `elevenlabs/voice-changer` | Transform audio from one voice to another. Requires source audio and target voice ID |
96
- | `fal-ai/minimax/voice-clone` | Clone a voice from a sample audio and generate speech from text prompts (gated feature) |