npm - yuanflow-cli - Versions diffs - 0.1.12 → 0.1.14 - Mend

yuanflow-cli 0.1.12 → 0.1.14

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "yuanflow-cli",
-  "version": "0.1.12",
+  "version": "0.1.14",
   "description": "YuanFlow API CLI and skill installer for supported AI coding agents.",
   "type": "module",
   "license": "MIT",

package/skills/yuanflow-skill//346/234/254/345/234/260/351/237/263/350/247/206/351/242/221/350/275/254/346/226/207/345/255/227/SKILL.md CHANGED Viewed

@@ -25,12 +25,19 @@ description: 仅当用户明确要求使用本地音视频转文字、本地转
 本 Skill 的脚本目录为当前 Skill 目录下的 `scripts/`。
-默认目录：
+默认运行目录：
-- 模型保存目录：`scripts/models`
-- 任务缓存目录：`scripts/cache`
-- 抽取音频目录：`scripts/cache/audio`
-- 转写文本目录：`scripts/cache/transcripts`
+- Windows：`%APPDATA%/YuanFlow/runtime_tools/local-transcribe`
+- 其它系统：`~/.yuanflow/runtime_tools/local-transcribe`
+默认子目录：
+- 模型保存目录：`<运行目录>/models`
+- 任务缓存目录：`<运行目录>/cache`
+- 抽取音频目录：`<运行目录>/cache/audio`
+- 转写文本目录：`<运行目录>/cache/transcripts`
+虚拟环境仍创建在当前 Skill 的 `scripts/.venv`，不要全局安装依赖。
 在 YuanFlow 程序内置环境中，`skill_read` 返回的 `config.managed_skill_dir` 是当前 Skill 的真实目录。执行脚本时优先以这个目录为基准：
@@ -38,14 +45,14 @@ description: 仅当用户明确要求使用本地音视频转文字、本地转
 cd "<config.managed_skill_dir>\scripts"
 ```
-不要把模型下载到用户桌面、项目根目录或系统临时目录。不要把模型文件打包进 Skill 或 npm 包。
+不要把模型下载到用户桌面、项目根目录或系统临时目录。不要把模型文件打包进 Skill 或 npm 包。Windows 下 FunASR/SentencePiece 对中文路径不稳定，所以不要强制把模型放到中文 Skill 安装目录下。
 ## 首次使用模型下载规则
 开始转写前先检查模型目录是否已经存在：
-- `scripts/models/SenseVoiceSmall`
-- `scripts/models/fsmn-vad`
+- `<运行目录>/models/SenseVoiceSmall`
+- `<运行目录>/models/fsmn-vad`
 如果这两个目录都存在且不为空，直接执行后续任务。
@@ -54,7 +61,7 @@ cd "<config.managed_skill_dir>\scripts"
 - SenseVoice：`iic/SenseVoiceSmall`
 - VAD：`iic/speech_fsmn_vad_zh-cn-16k-common-pytorch`
-下载由 `modelscope.snapshot_download()` 完成，保存到 `scripts/models`。下载完成后继续转写。
+下载由 `modelscope.snapshot_download()` 完成，保存到 `<运行目录>/models`。下载完成后继续转写。
 ## 执行流程
@@ -71,15 +78,23 @@ cd "<config.managed_skill_dir>\scripts"
 在 `scripts/` 目录下创建虚拟环境并安装依赖：
 ```powershell
-python -m venv .venv
+py -3.10 -m venv .venv
 .\.venv\Scripts\python.exe -m pip install -r requirements-transcribe.txt
 ```
+优先使用 Python 3.10，其次 3.11/3.12。不要优先使用 Python 3.13 或 3.14，因为 FunASR 依赖链里的部分包在这些版本上可能需要本地编译，容易出现 `editdistance` wheel build failed。Windows 上可以先运行 `py -0p` 查看可用版本。
+`requirements-transcribe.txt` 必须包含 `funasr`、`modelscope`、`torch` 和 `torchaudio`。如果导入 `funasr` 时提示 `No module named 'torch'` 或 `No module named 'torchaudio'`，先补齐缺失依赖，再重新执行转写。
 视频转音频需要本机可用 `ffmpeg`。如果用户给的是视频且系统找不到 `ffmpeg`，先明确报告缺少 ffmpeg，不要伪造转写结果。
 ## 推荐调用方式
-统一入口脚本：
+统一入口脚本。
+如果是在 YuanFlow 的 `execute_shell_command` 中执行，命令会先经过 Windows `cmd.exe`。当 Skill 目录或输入文件路径包含中文时，优先使用 PowerShell `-EncodedCommand`，避免中文路径被转义成 `\uXXXX` 后找不到目录。
+普通 PowerShell 里可以直接运行：
 ```powershell
 cd "<Skill目录>\scripts"
@@ -93,13 +108,27 @@ cd "<Skill目录>\scripts"
 .\.venv\Scripts\python.exe .\transcribe_media.py "C:\path\to\input.mp4"
 ```
+YuanFlow Agent 工具调用时推荐生成 UTF-16LE base64 后执行：
+```powershell
+$script = "Set-Location -LiteralPath '<Skill目录>\scripts'; .\.venv\Scripts\python.exe .\transcribe_media.py 'C:\path\to\input.mp3'"
+$encoded = [Convert]::ToBase64String([Text.Encoding]::Unicode.GetBytes($script))
+powershell -NoProfile -ExecutionPolicy Bypass -EncodedCommand $encoded
+```
+如果尚未创建虚拟环境，把 `$script` 改成：
+```powershell
+Set-Location -LiteralPath '<Skill目录>\scripts'; if (-not (Test-Path -LiteralPath '.\.venv\Scripts\python.exe')) { py -3.10 -m venv .venv }; .\.venv\Scripts\python.exe -m pip install -r requirements-transcribe.txt; .\.venv\Scripts\python.exe .\transcribe_media.py 'C:\path\to\input.mp3'
+```
 常用参数：
 | 参数 | 说明 |
 | --- | --- |
 | `input_path` | 音频文件、视频文件或目录。 |
-| `--cache-root` | 缓存目录，默认 `scripts/cache`。 |
-| `--models-root` | 模型目录，默认 `scripts/models`。 |
+| `--cache-root` | 缓存目录，默认 `<运行目录>/cache`。 |
+| `--models-root` | 模型目录，默认 `<运行目录>/models`。 |
 | `--recursive` | 输入为目录时递归扫描。 |
 | `--device` | `auto`、`cpu`、`cuda:0` 等，默认 `auto`。 |
 | `--language` | `zh`、`en`、`yue`、`ja`、`ko`、`auto`，默认 `auto`。 |
@@ -130,10 +159,10 @@ cd "<Skill目录>\scripts"
 只有用户明确要求删除缓存或模型文件时，才可以删除：
-- 缓存目录：`scripts/cache`
-- 模型目录：`scripts/models`
+- 缓存目录：`<运行目录>/cache`
+- 模型目录：`<运行目录>/models`
-删除前必须确认目标路径位于当前 Skill 的 `scripts/` 目录下，不能删除其它项目目录、用户桌面目录或系统目录。
+删除前必须确认目标路径位于 YuanFlow 本地转写运行目录下，不能删除其它项目目录、用户桌面目录、其它 Skill 或系统目录。
 ## 输出要求

package/skills/yuanflow-skill//346/234/254/345/234/260/351/237/263/350/247/206/351/242/221/350/275/254/346/226/207/345/255/227/scripts/common/utils.py CHANGED Viewed

@@ -1,6 +1,7 @@
 from __future__ import annotations
 import json
+import os
 import re
 from pathlib import Path
 from typing import Iterable
@@ -31,6 +32,13 @@ def ensure_dir(path: Path) -> Path:
     return path
+def default_runtime_root() -> Path:
+    appdata = os.environ.get("APPDATA")
+    if appdata:
+        return Path(appdata) / "YuanFlow" / "runtime_tools" / "local-transcribe"
+    return Path.home() / ".yuanflow" / "runtime_tools" / "local-transcribe"
 def write_json(path: Path, data: object) -> Path:
     ensure_parent(path)
     path.write_text(json.dumps(data, ensure_ascii=False, indent=2), encoding="utf-8")

package/skills/yuanflow-skill//346/234/254/345/234/260/351/237/263/350/247/206/351/242/221/350/275/254/346/226/207/345/255/227/scripts/requirements-transcribe.txt CHANGED Viewed

@@ -1,2 +1,4 @@
 funasr>=1.1.6
 modelscope>=1.18.1
+torch>=2.0
+torchaudio>=2.0

package/skills/yuanflow-skill//346/234/254/345/234/260/351/237/263/350/247/206/351/242/221/350/275/254/346/226/207/345/255/227/scripts/transcribe_media.py CHANGED Viewed

@@ -6,6 +6,7 @@ from pathlib import Path
 from common.media import extract_audio
 from common.sensevoice import build_model
 from common.sensevoice import clean_transcript
+from common.utils import default_runtime_root
 from common.utils import ensure_dir
 from common.utils import is_audio_file
 from common.utils import is_video_file
@@ -15,8 +16,9 @@ from common.utils import write_text
 SCRIPT_DIR = Path(__file__).resolve().parent
-DEFAULT_CACHE_ROOT = SCRIPT_DIR / "cache"
-DEFAULT_MODELS_ROOT = SCRIPT_DIR / "models"
+DEFAULT_RUNTIME_ROOT = default_runtime_root()
+DEFAULT_CACHE_ROOT = DEFAULT_RUNTIME_ROOT / "cache"
+DEFAULT_MODELS_ROOT = DEFAULT_RUNTIME_ROOT / "models"
 def prepare_audio(