npm - cerevox - Versions diffs - 1.8.0 → 1.10.0 - Mend

cerevox 1.8.0 → 1.10.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

package/dist/core/ai.d.ts +15 -0
package/dist/core/ai.d.ts.map +1 -1
package/dist/core/ai.js +43 -1
package/dist/core/ai.js.map +1 -1
package/dist/mcp/servers/prompts/zerodancer-guideline.md +302 -0
package/dist/mcp/servers/prompts/zerosinger-guideline.md +187 -0
package/dist/mcp/servers/zerocut.d.ts.map +1 -1
package/dist/mcp/servers/zerocut.js +253 -20
package/dist/mcp/servers/zerocut.js.map +1 -1
package/dist/utils/storyboard-schema.json +4 -0
package/package.json +1 -1
package/dist/mcp/servers/prompts/zerocut-guideline-trae.md +0 -350

package/dist/mcp/servers/prompts/zerosinger-guideline.md ADDED Viewed

@@ -0,0 +1,187 @@
+你是专业音乐MV创作 Agent，基于 Zerocut 自主完成音乐MV成片的全流程。
+# 标准流水线
+1. 启动项目 → `zerocut-project-open`
+2. 资料收集（可选）→ 使用搜索工具收集相关资料
+3. 音乐创作 → 根据主题构思音乐氛围 → 创作歌词 lyrics.txt
+4. 音乐生成 → 根据 lyrics.txt 调用 `generate-song` → 获得歌曲和 captions
+5. 分析歌曲 → 创建 timeline_analysis.json 得到 captions 的时间线
+6. 设计分镜场景 → `get-storyboard-schema` 获取分镜场景规范 → 创建初始 story_board.json
+7. 主要角色形象塑造 → `generate-character-image` → 获得主要角色形象三视图
+8. 分镜首帧生成 → `generate-image` → 生成各场景分镜首帧
+9. 首尾帧视频生成 → `generate-video` → **必须使用首尾帧一镜到底方式**：以下一场景的 start_frame 作为上一场景的 end_frame，确保场景间无缝连接，以增加镜头的连续性。
+10. 技术规范 → 调用`get-video-project-schema`获取最新规范 → 根据规范创建 draft_content.json
+11. 执行渲染 → `compile-and-run` 输出成品并自动下载到本地
+12. 关闭项目 → `zerocut-project-close`
+## 重要规范
+- 曲目长度在 60秒 ～ 120秒之间，不要低于 60 秒，也不要高于 120 秒
+- 完整歌词通常包括以下桥段：
+  - 前奏: intro，歌曲开始的音乐部分，主要用于引导歌曲的整体氛围。
+  - 主歌: verse，通常在前奏之后，歌曲中叙述歌曲故事或主题的部分。
+  - 副歌: chorus，一般在主歌之后，旋律有记忆点和感染力，是整首歌的高潮，进一步强化歌曲的主题和情感。
+  - 间奏: inst，歌曲中的纯音乐段落，用于连接不同的演唱部分。
+  - 尾奏: outro，歌曲结束后的音乐段落，用于营造歌曲结束的氛围。
+  - 桥段: bridge，通常出现在歌曲中段或接近结尾处，是一个过渡部分，用于连接不同的歌曲段落。
+### 歌词示例 lyrics.txt
+```txt
+[intro]
+[verse]
+记得那一天 那一天我们相恋
+说好彼此都不说再见
+遵守诺言 用心去相恋
+我为你撑伞 你为我取暖
+[inst]
+[chorus]
+当我把心交给你的那一天
+你却消失在我的眼前
+事到如今已经过了好多年
+是否你还像从前
+[outro]
+```
+- timeline_analysis.json 中 captions 时间线包含旋律与歌词，proposed_video_scenes 必须从0ms开始，每个场景控制在3-12秒
+- **首尾帧连续性要求**：
+  - 先生成所有场景的 start_frame
+  - 除最后一个场景外，后一个场景的 start_frame 是前一个场景的 end_frame
+  - 确保MV在场景切换时尽量无缝衔接，形成一镜到底的视觉效果
+  - 角色位置、姿态、服装、背景环境必须保持连续性
+### timeline_analysis.json 示例
+```json
+{
+  "analysis": {
+    "total_duration_ms": 89900,
+    "total_duration_s": 90,
+    "video_length_constraint": "3-12秒每个场景",
+    "timing_precision": "视频必须整秒，歌词精度毫秒，误差控制1秒内"
+  },
+  "original_captions_timeline": [
+    {
+      "section": "intro",
+      "start_ms": 2133,
+      "end_ms": 5026,
+      "duration_ms": 2893,
+      "text": "[intro]"
+    },
+    {
+      "section": "verse_marker",
+      "start_ms": 8093,
+      "end_ms": 14092,
+      "duration_ms": 5999,
+      "text": "[verse]"
+    },
+    {
+      "section": "verse1",
+      "start_ms": 14093,
+      "end_ms": 18252,
+      "duration_ms": 4159,
+      "text": "水悠悠岁月流"
+    },
+    ...
+  ],
+  "proposed_video_scenes": [
+    {
+      "scene_id": "scene_01",
+      "video_start_s": 0,
+      "video_duration_s": 8,
+      "video_end_s": 8,
+      "covers_audio_ms": "0-8000",
+      "description": "前奏第一部分 - 静立开场",
+      "script": "[intro]",
+      "note": "覆盖intro(2133-5026)和verse_marker前半部分"
+    },
+    {
+      "scene_id": "scene_02",
+      "video_start_s": 8,
+      "video_duration_s": 6,
+      "video_end_s": 14,
+      "covers_audio_ms": "8000-14000",
+      "description": "前奏第二部分 - 准备动作",
+      "script": "[verse]",
+      "note": "覆盖verse_marker后半部分，为第一句歌词做准备"
+    },
+    {
+      "scene_id": "scene_03",
+      "video_start_s": 14,
+      "video_duration_s": 4,
+      "video_end_s": 18,
+      "covers_audio_ms": "14000-18000",
+      "description": "水悠悠岁月流",
+      "script": "水悠悠岁月流",
+      "audio_timing": "14093-18252ms",
+      "timing_error": "93ms延迟开始，248ms提前结束，总误差341ms"
+    },
+    ...
+  ]
+},
+```
+- 画面规范
+  1. 优先采用 lite 模型生成视频，视频分辨率默认为 720p
+  2. 一定要用首尾帧生成连续一镜到底视频，也就是用下一个场景的start_frame图片作为当前场景的end_frame图片
+- 合成规范
+  1. 场景视频时间轴要与 timeline_analysis 匹配
+  2. 要包括歌曲字幕，注意字幕时间轴必须对齐正确，你可以根据 timeline_analysis.json 匹配和校正字幕
+### story_board 规范
+- 如无特别指定，每个场景中不需要包含 end_frame，而是在生成视频时采用首尾帧一镜到底，用下一个场景的 start_frame 作为当前场景的 end_frame。
+### draft_content.json 结构规范
+重要：`draft_content.json`必须严格对应VideoProject JSON Schema规范，是`compile-and-run`工具的直接输入文件。
+**时间轴创建强制要求**：
+- draft_content.json 生成时，所有时间轴参数（startMs、durationMs、endMs）必须严格根据各素材的实际 duration、durationMs 创建
+- timeline 中的每个 clip 时长必须与对应素材文件的实际时长对齐
+- 禁止使用估算或默认值，必须基于实际生成的素材文件属性
+- 所有 tracks 时间轴都必须与视频时长保持一致
+规则：调用`compile-and-run`前，如需要，先调用`get-video-project-schema`获取最新规范，确保结构完全符合要求。
+### draft_content.json 结构要求
+必须包含完整的VideoProject结构：
+- version: 项目版本
+- project: 项目元数据(name, id)
+- settings: 视频设置(fps, resolution, pixelFormat, sampleRate, channels, timebase)
+- assets: 素材数组(所有图片、视频、音频文件引用)，路径必须是 materials/
+- timeline: 时间线轨道(tracks数组，包含video/audio/subtitle轨道)
+- subtitles: 字幕数组
+- export: 导出配置(container, videoCodec, audioCodec等)
+`compile-and-run`依赖严格遵循`videoproject-schema.json`规范的`VideoProject`对象。
+### draft_content 内容规范
+1. 必需字段：version, project, settings, assets, timeline, export
+2. 资产引用：clips中assetId必须对应assets中id
+3. 时间单位：毫秒(Ms后缀)
+4. 路径规范：素材路径指向 materials/
+### 字幕字体规范
+- 中文字幕：`"Noto Sans CJK SC"`
+- 英文字幕：`"Arial"`、`"Helvetica"`
+- 字体大小：中文竖屏40/横屏60，英文竖屏28/横屏40
+- `[intro]`、`[verse]` 等内容不需要字幕
+---
+# 质量建议
+## 优化效率
+- 为了提高速度，建议在 timeline_analysis 阶段根据歌词合并相邻的场景，保证每个视频场景的长度大概在 6-10 秒之间，以减少场景数量，避免产生过多的场景。
+  比如： 场景1 一共4秒，场景2 一共5秒，他们的歌词是连贯的，那么可以合并为一个场景，时长为9秒

package/dist/mcp/servers/zerocut.d.ts.map CHANGED Viewed

	@@ -1 +1 @@
1	- {"version":3,"file":"zerocut.d.ts","sourceRoot":"","sources":["../../../src/mcp/servers/zerocut.ts"],"names":[],"mappings":";~~AA6wEA~~,wBAAsB,GAAG,kBAKxB"}
1	+ {"version":3,"file":"zerocut.d.ts","sourceRoot":"","sources":["../../../src/mcp/servers/zerocut.ts"],"names":[],"mappings":";AAogFA,wBAAsB,GAAG,kBAKxB"}

package/dist/mcp/servers/zerocut.js CHANGED Viewed

@@ -4,6 +4,39 @@
  * MCP Server
  * CallTool 有些工具要求的时间较长，可能会60秒超时，callTool 的时候最好设置一下 timeout
  */
+var __createBinding = (this && this.__createBinding) || (Object.create ? (function(o, m, k, k2) {
+    if (k2 === undefined) k2 = k;
+    var desc = Object.getOwnPropertyDescriptor(m, k);
+    if (!desc || ("get" in desc ? !m.__esModule : desc.writable || desc.configurable)) {
+      desc = { enumerable: true, get: function() { return m[k]; } };
+    }
+    Object.defineProperty(o, k2, desc);
+}) : (function(o, m, k, k2) {
+    if (k2 === undefined) k2 = k;
+    o[k2] = m[k];
+}));
+var __setModuleDefault = (this && this.__setModuleDefault) || (Object.create ? (function(o, v) {
+    Object.defineProperty(o, "default", { enumerable: true, value: v });
+}) : function(o, v) {
+    o["default"] = v;
+});
+var __importStar = (this && this.__importStar) || (function () {
+    var ownKeys = function(o) {
+        ownKeys = Object.getOwnPropertyNames || function (o) {
+            var ar = [];
+            for (var k in o) if (Object.prototype.hasOwnProperty.call(o, k)) ar[ar.length] = k;
+            return ar;
+        };
+        return ownKeys(o);
+    };
+    return function (mod) {
+        if (mod && mod.__esModule) return mod;
+        var result = {};
+        if (mod != null) for (var k = ownKeys(mod), i = 0; i < k.length; i++) if (k[i] !== "default") __createBinding(result, mod, k[i]);
+        __setModuleDefault(result, mod);
+        return result;
+    };
+})();
 var __importDefault = (this && this.__importDefault) || function (mod) {
     return (mod && mod.__esModule) ? mod : { "default": mod };
 };
@@ -16,7 +49,7 @@ const index_1 = __importDefault(require("../../index"));
 const constants_1 = require("../../utils/constants");
 const videokit_1 = require("../../utils/videokit");
 const promises_1 = require("node:fs/promises");
-const node_path_1 = require("node:path");
+const node_path_1 = __importStar(require("node:path"));
 const doubao_voices_full_1 = require("./helper/doubao_voices_full");
 const node_fs_1 = require("node:fs");
 // 错误处理工具函数
@@ -38,8 +71,8 @@ function createErrorResponse(error, operation) {
     };
 }
 // Session 状态检查
-function validateSession(operation) {
-    if (!session) {
+async function validateSession(operation) {
+    if (!session || !(await session.isRunning())) {
         throw new Error(`Session not initialized. Please call 'zerocut-project-open' first before using ${operation}.`);
     }
     return session;
@@ -338,7 +371,7 @@ server.registerTool('list-project-files', {
 }, async () => {
     try {
         // 验证session状态
-        const currentSession = validateSession('list-project-files');
+        const currentSession = await validateSession('list-project-files');
         console.log('Listing project files...');
         const terminal = currentSession.terminal;
         if (!terminal) {
@@ -402,7 +435,7 @@ server.registerTool('search-context', {
 }, async ({ query }) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('search-context');
+        const currentSession = await validateSession('search-context');
         if (!query || query.trim() === '') {
             throw new Error('Search query cannot be empty');
         }
@@ -428,7 +461,7 @@ server.registerTool('search-image', {
 }, async ({ query }) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('search-image');
+        const currentSession = await validateSession('search-image');
         if (!query || query.trim() === '') {
             throw new Error('Search query cannot be empty');
         }
@@ -464,7 +497,7 @@ server.registerTool('generate-character-image', {
 }, async ({ name, gender, age, appearance, clothing, personality, saveToFileName, }) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('generate-character-image');
+        const currentSession = await validateSession('generate-character-image');
         const validatedFileName = validateFileName(saveToFileName);
         const prompt = `
         你是一个专业的角色设计师，请根据设定生成角色全身三视图，图片为白底，图中不带任何文字。设定为：
@@ -547,7 +580,7 @@ server.registerTool('upload-local-image', {
 }, async ({ localPath, size }) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('upload-local-image');
+        const currentSession = await validateSession('upload-local-image');
         // 验证图片文件
         const validatedPath = validateImageFile(localPath);
         const fileName = (0, node_path_1.basename)(validatedPath);
@@ -629,7 +662,7 @@ server.registerTool('generate-image', {
 }, async ({ prompt, size, saveToFileName, watermark, referenceImages }) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('generate-image');
+        const currentSession = await validateSession('generate-image');
         const validatedFileName = validateFileName(saveToFileName);
         // 检查并替换英文单引号包裹的中文内容为中文双引号
         // 这样才能让 seedream 生成更好的中文文字
@@ -752,7 +785,7 @@ server.registerTool('edit-image', {
 }, async ({ prompt, sourceImageFileName, saveToFileName, size, watermark }) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('edit-image');
+        const currentSession = await validateSession('edit-image');
         const validatedFileName = validateFileName(saveToFileName);
         console.log(`Editing image with prompt: ${prompt.substring(0, 100)}...`);
         const imagePath = (0, node_path_1.dirname)(sourceImageFileName) !== '.'
@@ -832,7 +865,7 @@ server.registerTool('generate-video', {
     inputSchema: {
         prompt: zod_1.z.string().describe('The prompt to generate.'),
         type: zod_1.z
-            .enum(['lite', 'pro'])
+            .enum(['lite', 'pro', 'hailuo'])
             .optional()
             .default('lite')
             .describe('Use pro model when you need higher quality.'),
@@ -849,16 +882,21 @@ server.registerTool('generate-video', {
             .string()
             .optional()
             .describe('The image file name of the end frame.'),
+        resolution: zod_1.z
+            .enum(['720p', '1080p'])
+            .optional()
+            .default('720p')
+            .describe('The resolution of the video.'),
         watermark: zod_1.z
             .boolean()
             .optional()
             .default(false)
             .describe('Whether to add watermark to the video.'),
     },
-}, async ({ prompt, saveToFileName, start_frame, end_frame, duration, watermark }, context) => {
+}, async ({ prompt, saveToFileName, start_frame, end_frame, duration, watermark, resolution, type, }, context) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('generate-video');
+        const currentSession = await validateSession('generate-video');
         const validatedFileName = validateFileName(saveToFileName);
         console.log(`Generating video with prompt: ${prompt.substring(0, 100)}...`);
         const ai = currentSession.ai;
@@ -872,8 +910,9 @@ server.registerTool('generate-video', {
             start_frame: startFrameUri,
             end_frame: endFrameUri,
             duration,
-            resolution: '720p',
+            resolution,
             watermark,
+            type,
             onProgress: async (metaData) => {
                 try {
                     await sendProgress(context, ++progress, undefined, JSON.stringify(metaData));
@@ -978,7 +1017,7 @@ server.registerTool('generate-video-kenburns', {
 }, async ({ image_path, duration, camera_motion = 'zoom_in', size, saveToFileName, }) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('generate-video-kenburns');
+        const currentSession = await validateSession('generate-video-kenburns');
         const validatedFileName = validateFileName(saveToFileName);
         const files = currentSession.files;
         const terminal = currentSession.terminal;
@@ -1098,7 +1137,7 @@ server.registerTool('generate-sound-effect', {
     },
 }, async ({ prompt_in_english, loop, saveToFileName, duration_seconds }) => {
     try {
-        const currentSession = validateSession('generate-sound-effect');
+        const currentSession = await validateSession('generate-sound-effect');
         const ai = currentSession.ai;
         const res = await ai.generateSoundEffect({
             prompt: prompt_in_english,
@@ -1167,7 +1206,7 @@ server.registerTool('generate-sound-effect', {
 //   async ({ userPrompt, duration, size, saveToFileName }, context) => {
 //     try {
 //       // 验证session状态
-//       const currentSession = validateSession('generate-principle-video');
+//       const currentSession = await validateSession('generate-principle-video');
 //       const [width, height] = size.split('x').map(Number);
 //       const ai = currentSession.ai;
 //       // 使用 generatePrincipleVideo 方法
@@ -1230,6 +1269,200 @@ server.registerTool('generate-sound-effect', {
 //     }
 //   }
 // );
+server.registerTool('generate-song', {
+    title: 'Generate Song',
+    description: 'Generate a song with vocals and customizable parameters.',
+    inputSchema: {
+        lyrics: zod_1.z.string().describe(`The lyrics to generate the song.
+- 完整歌词通常包括以下桥段：
+  - 前奏: intro，歌曲开始的音乐部分，主要用于引导歌曲的整体氛围。
+  - 主歌: verse，通常在前奏之后，歌曲中叙述歌曲故事或主题的部分。
+  - 副歌: chorus，一般在主歌之后，旋律有记忆点和感染力，是整首歌的高潮，进一步强化歌曲的主题和情感。
+  - 间奏: inst，歌曲中的纯音乐段落，用于连接不同的演唱部分。
+  - 尾奏: outro，歌曲结束后的音乐段落，用于营造歌曲结束的氛围。
+  - 桥段: bridge，通常出现在歌曲中段或接近结尾处，是一个过渡部分，用于连接不同的歌曲段落。
+### 歌词示例 lyrics.txt
+\`\`\`txt
+[intro]
+[verse]
+记得那一天 那一天我们相恋
+说好彼此都不说再见
+遵守诺言 用心去相恋
+我为你撑伞 你为我取暖
+[inst]
+[chorus]
+当我把心交给你的那一天
+你却消失在我的眼前
+事到如今已经过了好多年
+是否你还像从前
+[outro]
+\`\`\`
+`),
+        duration: zod_1.z
+            .number()
+            .min(30)
+            .max(240)
+            .describe('The duration of the song in seconds (30-240).'),
+        genre: zod_1.z
+            .enum([
+            'Folk',
+            'Pop',
+            'Rock',
+            'Chinese Style',
+            'Hip Hop/Rap',
+            'R&B/Soul',
+            'Punk',
+            'Electronic',
+            'Jazz',
+            'Reggae',
+            'DJ',
+            'Pop Punk',
+            'Disco',
+            'Future Bass',
+            'Pop Rap',
+            'Trap Rap',
+            'R&B Rap',
+            'Chinoiserie Electronic',
+            'GuFeng Music',
+            'Pop Rock',
+            'Jazz Pop',
+            'Bossa Nova',
+            'Contemporary R&B',
+        ])
+            .optional()
+            .describe('The genre of the song.'),
+        mood: zod_1.z
+            .enum([
+            'Happy',
+            'Dynamic/Energetic',
+            'Sentimental/Melancholic/Lonely',
+            'Inspirational/Hopeful',
+            'Nostalgic/Memory',
+            'Excited',
+            'Sorrow/Sad',
+            'Chill',
+            'Relaxing',
+            'Romantic',
+            'Miss',
+            'Groovy/Funky',
+            'Dreamy/Ethereal',
+            'Calm/Relaxing',
+        ])
+            .optional()
+            .describe('The mood of the song.'),
+        gender: zod_1.z
+            .enum(['Female', 'Male'])
+            .optional()
+            .describe('The gender of the vocalist.'),
+        timbre: zod_1.z
+            .enum([
+            'Warm',
+            'Bright',
+            'Husky',
+            'Electrified voice',
+            'Sweet_AUDIO_TIMBRE',
+            'Cute_AUDIO_TIMBRE',
+            'Loud and sonorous',
+            'Powerful',
+            'Sexy/Lazy',
+        ])
+            .optional()
+            .describe('The timbre/voice quality of the vocalist.'),
+        skipCopyCheck: zod_1.z
+            .boolean()
+            .optional()
+            .default(false)
+            .describe('Whether to skip copyright check.'),
+        saveToFileName: zod_1.z.string().describe('The filename to save.'),
+    },
+}, async ({ lyrics, duration, genre, mood, gender, timbre, skipCopyCheck, saveToFileName, }, context) => {
+    try {
+        // 验证session状态
+        const currentSession = await validateSession('generate-song');
+        const validatedFileName = validateFileName(saveToFileName);
+        console.log(`Generating Song with lyrics: ${lyrics.substring(0, 100)}... (${duration}s, genre: ${genre || 'auto'}, mood: ${mood || 'auto'})`);
+        const ai = currentSession.ai;
+        let progress = 0;
+        const res = await ai.generateSong({
+            lyrics: lyrics.trim(),
+            duration,
+            genre,
+            mood,
+            gender,
+            timbre,
+            skipCopyCheck,
+            onProgress: async (metaData) => {
+                try {
+                    await sendProgress(context, metaData.Result?.Progress ?? ++progress, metaData.Result?.Progress ? 100 : undefined, JSON.stringify(metaData));
+                }
+                catch (progressError) {
+                    console.warn('Failed to send progress update:', progressError);
+                }
+            },
+        });
+        if (!res) {
+            throw new Error('Failed to generate Song: no response from AI service');
+        }
+        if (res.url) {
+            console.log('Song generated successfully, saving to materials...');
+            const uri = await saveMaterial(currentSession, res.url, validatedFileName);
+            const { url, duration: songDuration, captions, ...opts } = res;
+            // 保存captions到本地
+            if (captions) {
+                const captionsText = JSON.stringify(captions, null, 2);
+                // 本地路径
+                const localPath = node_path_1.default.resolve(projectLocalDir, 'materials', `${validatedFileName}.captions.json`);
+                // 保存到本地
+                await (0, promises_1.writeFile)(localPath, captionsText);
+            }
+            const result = {
+                success: true,
+                // source: url,
+                uri,
+                durationMs: Math.floor((songDuration || duration) * 1000),
+                lyrics: lyrics.substring(0, 100),
+                requestedDuration: duration,
+                genre,
+                mood,
+                gender,
+                timbre,
+                captions,
+                timestamp: new Date().toISOString(),
+                ...opts,
+            };
+            return {
+                content: [
+                    {
+                        type: 'text',
+                        text: JSON.stringify(result),
+                    },
+                ],
+            };
+        }
+        else {
+            console.warn('Song generation completed but no URL returned');
+            return {
+                content: [
+                    {
+                        type: 'text',
+                        text: JSON.stringify({
+                            success: false,
+                            error: 'No Song URL returned from AI service',
+                            response: res,
+                            timestamp: new Date().toISOString(),
+                        }),
+                    },
+                ],
+            };
+        }
+    }
+    catch (error) {
+        return createErrorResponse(error, 'generate-song');
+    }
+});
 server.registerTool('generate-bgm', {
     title: 'Generate BGM',
     description: 'Generate the bgm.',
@@ -1245,7 +1478,7 @@ server.registerTool('generate-bgm', {
 }, async ({ prompt, duration, saveToFileName }, context) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('generate-bgm');
+        const currentSession = await validateSession('generate-bgm');
         const validatedFileName = validateFileName(saveToFileName);
         console.log(`Generating BGM with prompt: ${prompt.substring(0, 100)}... (${duration}s)`);
         const ai = currentSession.ai;
@@ -1364,7 +1597,7 @@ server.registerTool('generate-scene-tts', {
 }, async ({ text, voiceID, saveToFileName, speed, pitch, volume, emotion, explicit_language, }) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('generate-scene-tts');
+        const currentSession = await validateSession('generate-scene-tts');
         const validatedFileName = validateFileName(saveToFileName);
         const finalSpeed = speed ?? 1;
         volume = volume ?? 1;
@@ -1478,7 +1711,7 @@ server.registerTool('compile-and-run', {
 }, async ({ projectFileName, outputFileName }) => {
     try {
         // 验证session状态
-        const currentSession = validateSession('compile-and-run');
+        const currentSession = await validateSession('compile-and-run');
         console.log('Starting video compilation and rendering...');
         // 验证terminal可用性
         const terminal = currentSession.terminal;