npm - @optima-chat/comfy-cli - Versions diffs - 0.9.7 → 0.9.8 - Mend

@optima-chat/comfy-cli 0.9.7 → 0.9.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/.claude/skills/comfy-cli/SKILL.md +39 -1
package/package.json +1 -1

package/.claude/skills/comfy-cli/SKILL.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 name: comfy-cli
-description: "ComfyUI CLI tool for AI agents. ALWAYS use when user wants to: generate images (生成图片/画图/图像/生成画), edit images (编辑图片/修改图片/图生图/改图), create videos (生成视频/图生视频/制作视频), manage ComfyUI workflows (工作流/ComfyUI). Uses 'comfy image', 'comfy generate', 'comfy edit', 'comfy video' commands."
+description: "ComfyUI CLI tool for AI agents. ALWAYS use when user wants to: generate images (生成图片/画图/图像/生成画), edit images (编辑图片/修改图片/图生图/改图), create videos (生成视频/图生视频/制作视频), text-to-speech (TTS/语音合成/朗读/文字转语音), speech recognition (ASR/语音识别/语音转文字/转录), manage ComfyUI workflows (工作流/ComfyUI). Uses 'comfy image', 'comfy generate', 'comfy edit', 'comfy video', 'comfy tts', 'comfy asr' commands."
 ---
 # ComfyUI CLI
@@ -112,6 +112,23 @@ comfy edit <图像路径> "提示词" [--no-wait] [--pretty]
 - 等同于 `comfy image "提示词" -i <图像路径>`
 - 支持风格转换、细节增强等
+**文本转语音 (TTS)：**
+```bash
+comfy tts "文本内容" [-o 输出路径] [--voice Cherry] [--play] [--pretty]
+```
+- 使用 DashScope qwen3-tts-flash 模型
+- 支持 50+ 种声音（女声：Cherry, Serena, Chelsie 等；男声：Ethan, Aiden, Brandon 等）
+- 自动检测语言（中、英、日、韩、法、德等）
+- `--voices` 列出所有可用声音
+**语音识别 (ASR)：**
+```bash
+comfy asr <音频文件> [--language zh|en|ja|ko] [--pretty]
+```
+- 使用 Groq Whisper (whisper-large-v3-turbo)
+- 支持 mp3, wav, m4a, ogg, webm 格式
+- 文件大小限制 25MB
 **生成视频：**
 ```bash
 comfy video <图像路径> [-p "运动描述"] [-b auto|dashscope|comfyui] [-r 720P|1080P] [-d 5|10|15]
@@ -244,6 +261,27 @@ comfy workflow get abc123
 comfy download abc123
 ```
+### 示例 10：文本转语音
+```bash
+# 中文语音合成
+comfy tts "你好，欢迎使用语音合成功能" -o greeting.wav
+# 使用男声
+comfy tts "Hello, welcome!" --voice Ethan -o hello.wav
+# 生成后自动播放
+comfy tts "测试语音" --play
+```
+### 示例 11：语音识别
+```bash
+# 自动检测语言
+comfy asr recording.mp3
+# 指定语言提示
+comfy asr meeting.wav --language zh
+```
 ## 重要提示
 - **推荐使用 `comfy image`**：统一入口，自动选择最佳后端

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@optima-chat/comfy-cli",
-  "version": "0.9.7",
+  "version": "0.9.8",
   "description": "A CLI tool for ComfyUI designed for LLM interactions",
   "type": "module",
   "main": "dist/index.js",