npm - @mindstudio-ai/remy - Versions diffs - 0.1.145 → 0.1.147 - Mend

@mindstudio-ai/remy 0.1.145 → 0.1.147

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

package/dist/automatedActions/buildFromInitialSpec.md +2 -10
package/dist/automatedActions/buildFromRoadmap.md +2 -1
package/dist/automatedActions/postBuildPolish.md +18 -0
package/dist/automatedActions/postRoadmapBuild.md +13 -0
package/dist/automatedActions/publish.md +2 -0
package/dist/headless.js +262 -116
package/dist/index.js +265 -121
package/dist/prompt/compiled/interfaces.md +49 -37
package/dist/prompt/static/authoring.md +1 -1
package/dist/prompt/static/instructions.md +2 -2
package/dist/prompt/static/team.md +1 -1
package/package.json +1 -1

package/dist/automatedActions/buildFromInitialSpec.md CHANGED Viewed

@@ -1,10 +1,11 @@
 ---
   trigger: buildFromInitialSpec
+  next: postBuildPolish
 ---
 This is an automated action triggered by the user pressing "Build" in the editor after reviewing the spec.
-The user has reviewed the spec and is ready to build. There are four phases to building: planning, coding, verifying, polishing. Execute each phase in order in a single turn.
+The user has reviewed the spec and is ready to build. There are three phases: planning, coding, and verifying. Execute each phase in order in a single turn.
 ## Planning
 Think about your approach and then get a quick sanity check from `codeSanityCheck` to make sure you aren't missing anything.
@@ -21,12 +22,3 @@ Then, build everything in one turn: tables, methods, interfaces, manifest update
 - If the app has a web frontend, check the browser logs to make sure there are no errors rendering it.
 - Use `runAutomatedBrowserTest` to smoke-test the main UI flow. The dev database is a disposable snapshot, so don't worry about being destructive. Fix any errors before finishing.
 - If there is a scenario that seeds the app with mock data, use it to present the app to the user with initial data seeded, so they can see and play with the real app. Let the user know they can reset the app using a scenario to empty it if they wish. Showing the user something they can play with immediately is important when it comes to landing a strong first impression.
-## Polishing
-When verification is complete, take a step back and do an explicit polish pass before verifying. Re-read the spec files and the design expert's guidance, then walk through each frontend file looking for design details that got skipped in the initial build: animations, transitions, hover states, micro-interactions, spring physics, entrance reveals, gesture handling, layout issues, and anything else.
-The initial build prioritizes getting everything connected and functional, but this pass closes the gap between "it works" and "it feels great." In many ways this is *the* most important part of the initial build, as the user's first experience of the deliverable will set their expectations for every iteration that follows. Don't mess this up.
-Then, ask the `visualDesignExpert` to take a screenshot and verity that the visual design looks correct. Fix any issues it flags - we want the user's first time seeing the finished product to truly wow them.
-When everything is working, use `productVision` to mark the MVP roadmap item as done, then call `setProjectOnboardingState({ state: "onboardingFinished" })`. Finally, call `compactConversation` to summarize the build session and free up context for the next phase of work.

package/dist/automatedActions/buildFromRoadmap.md CHANGED Viewed

@@ -1,5 +1,6 @@
 ---
   trigger: buildFromRoadmap
+  next: postRoadmapBuild
 ---
 This is an automated action triggered by the user pressing "Build Now" on the roadmap item {{path}}
@@ -12,4 +13,4 @@ Then, put together a plan to build out the feature. Write the plan with `writePl
 When they've approved the plan, be sure to update the spec first - remember, the spec is the source of truth about the product. Then, build everything in one turn, using the spec as the master plan.
-When you're finished, verify your work, then tell `productVision` what was done so it can update the roadmap to reflect the progress. Give the user a summary of what was done, then call `compactConversation` to summarize the build session and free up context.
+When you're finished building, verify your work and give the user a summary of what was done.

package/dist/automatedActions/postBuildPolish.md ADDED Viewed

@@ -0,0 +1,18 @@
+---
+  trigger: postBuildPolish
+---
+This is an automated follow-up after the initial build. The code is written and verified. Now it's time to polish and finalize so we can deliver something beautiful and magical as the user's first experience with our work.
+## Polishing
+Take a step back and do an explicit polish pass. Re-read the spec files and the design expert's guidance, then walk through each frontend file looking for design details that got skipped in the initial build: layout animations, transitions, hover states, micro-interactions, spring physics, entrance reveals, gesture handling, layout issues, responsiveness, and anything else. We need this to feel truly amazing and wow the user - it's worth it to take the time to get it right.
+The initial build prioritizes getting everything connected and functional, but this pass closes the gap between "it works" and "it feels great." In many ways this is *the* most important part of the initial build, as the user's first experience of the deliverable will set their expectations for every iteration that follows. Don't mess this up.
+When you have finished, ask the `visualDesignExpert` to take a screenshot and verify that the visual design looks correct. Fix any issues it flags. We want the user's first time seeing the finished product to truly wow them.
+## Finalizing
+When everything is working and polished:
+1. Use `productVision` to mark the MVP roadmap item as done.
+2. Call `setProjectOnboardingState({ state: "onboardingFinished" })`.
+3. Call `compactConversation` to summarize the build session and free up context for the next phase of work.

package/dist/automatedActions/postRoadmapBuild.md ADDED Viewed

@@ -0,0 +1,13 @@
+---
+  trigger: postRoadmapBuild
+---
+This is an automated follow-up after building a roadmap feature. The code is written and verified. Now it's time to polish and finalize.
+## Polishing
+Take a step back and do an explicit polish pass. Re-read the spec files and the design expert's guidance, then walk through each frontend file you changed looking for design details that got skipped: animations, transitions, hover states, micro-interactions, and anything else that closes the gap between "it works" and "it feels great."
+## Finalizing
+When everything is working:
+1. Tell `productVision` what was done so it can update the roadmap to reflect the progress.
+2. Call `compactConversation` to summarize the build session and free up context.

package/dist/automatedActions/publish.md CHANGED Viewed

@@ -14,4 +14,6 @@ If approved:
 - Use `mindstudio-prod releases status --wait` to poll the build until it completes. Let the user know it's deploying, then report back when it's live.
 - Once deployed, offer to help with next steps. This includes technical steps likesetting up a custom domain (`mindstudio-prod domains`), checking for errors (`mindstudio-prod requests stats`), seeding production data (`mindstudio-prod db`), managing env vars/secrets, or anything else they need for launch. It also includes going above and beyond and helping holistically. If it's the initial deploy, offer to help create collateral to announce the launch (e.g., an image for sharing on social media, text copy for a post, etc); if it's a meaningful incremental update, an annoucement post or something similar - go above and beyond here to help the user see that you care about the product from end-to-end, not just writing code! They will be appreciative, grateful, and pleased with your creativity here. Refer to the design guidance in the spec for how to talk about the product, and consider consulting the design expert to generate images or other marketing collateral.
+After everything is done, call `compactConversation` to summarize the current session and free up context for the next phase of work.
 If dismissed, acknowledge and do nothing.

package/dist/headless.js CHANGED Viewed

@@ -6,7 +6,15 @@ var __export = (target, all) => {
 // src/headless.ts
 import { createInterface } from "readline";
-import { writeFileSync, readFileSync, unlinkSync } from "fs";
+import {
+  writeFileSync,
+  readFileSync,
+  unlinkSync,
+  mkdirSync,
+  existsSync
+} from "fs";
+import { writeFile } from "fs/promises";
+import { basename, join, extname } from "path";
 // src/logger.ts
 import fs from "fs";
@@ -139,87 +147,9 @@ function readJsonAsset(fallback, ...segments) {
   }
 }
-// src/tools/_helpers/sidecar.ts
-var log2 = createLogger("sidecar");
-var baseUrl = null;
-function setSidecarBaseUrl(url) {
-  baseUrl = url;
-  log2.info("Configured", { url });
-}
-function isSidecarConfigured() {
-  return baseUrl !== null;
-}
-async function sidecarRequest(endpoint, body = {}, options) {
-  if (!baseUrl) {
-    throw new Error("Sidecar not available");
-  }
-  const url = `${baseUrl}${endpoint}`;
-  try {
-    const res = await fetch(url, {
-      method: "POST",
-      headers: { "Content-Type": "application/json" },
-      body: JSON.stringify(body),
-      signal: options?.timeout ? AbortSignal.timeout(options.timeout) : void 0
-    });
-    if (!res.ok) {
-      log2.error("Sidecar error", { endpoint, status: res.status });
-      throw new Error(`Sidecar error: ${res.status}`);
-    }
-    const data = await res.json();
-    if (data?.success === false) {
-      const code = data.errorCode ? ` [${data.errorCode}]` : "";
-      throw new Error(`${data.error || "Unknown error"}${code}`);
-    }
-    return data;
-  } catch (err) {
-    if (err.message.startsWith("Sidecar error")) {
-      throw err;
-    }
-    log2.error("Sidecar connection error", { endpoint, error: err.message });
-    throw new Error(`Sidecar connection error: ${err.message}`);
-  }
-}
-// src/tools/_helpers/lsp.ts
-var setLspBaseUrl = setSidecarBaseUrl;
-var isLspConfigured = isSidecarConfigured;
-async function lspRequest(endpoint, body) {
-  return sidecarRequest(endpoint, body);
-}
 // src/prompt/static/projectContext.ts
 import fs4 from "fs";
 import path3 from "path";
-var AGENT_INSTRUCTION_FILES = [
-  "CLAUDE.md",
-  "claude.md",
-  ".claude/instructions.md",
-  "AGENTS.md",
-  "agents.md",
-  ".agents.md",
-  "COPILOT.md",
-  "copilot.md",
-  ".copilot-instructions.md",
-  ".github/copilot-instructions.md",
-  "REMY.md",
-  "remy.md",
-  ".cursorrules",
-  ".cursorules"
-];
-function loadProjectInstructions() {
-  for (const file of AGENT_INSTRUCTION_FILES) {
-    try {
-      const content = fs4.readFileSync(file, "utf-8").trim();
-      if (content) {
-        return `
-## Project Instructions (${file})
-${content}`;
-      }
-    } catch {
-    }
-  }
-  return "";
-}
 function loadProjectManifest() {
   try {
     const manifest = fs4.readFileSync("mindstudio.json", "utf-8");
@@ -346,7 +276,6 @@ function resolveIncludes(template) {
 }
 function buildSystemPrompt(onboardingState, viewContext) {
   const projectContext = [
-    loadProjectInstructions(),
     loadProjectManifest(),
     loadSpecFileMetadata(),
     loadProjectFileListing()
@@ -421,29 +350,26 @@ Current date: ${now}
   {{compiled/msfm.md}}
 </mindstudio_flavored_markdown_spec_docs>
-<project_context>
-${projectContext}
-</project_context>
 <intake_mode_instructions>
-{{static/intake.md}}
+  {{static/intake.md}}
 </intake_mode_instructions>
 <spec_authoring_instructions>
-{{static/authoring.md}}
+  {{static/authoring.md}}
 </spec_authoring_instructions>
-{{static/team.md}}
+<team>
+  {{static/team.md}}
+</team>
 <code_authoring_instructions>
 {{static/coding.md}}
-${isLspConfigured() ? `<typescript_lsp>
+<typescript_lsp>
 {{static/lsp.md}}
-</typescript_lsp>` : ""}
+</typescript_lsp>
 </code_authoring_instructions>
-{{static/instructions.md}}
-${loadPlanStatus()}
 <conversation_summaries>
 Your conversation history may include <prior_conversation_summary> blocks in the user's messages. These are automated summaries of earlier messages that have been compacted to save context space. The user does not see this summary, they see the full conversation history in their UI. Treat the summary as ground truth for what happened before, but do not reference it directly to the user ("as mentioned in the summary..."). Just continue naturally as if you remember the prior work.
@@ -457,30 +383,38 @@ New projects progress through four onboarding states. The user might skip this e
 - **initialSpecAuthoring**: Writing and refining the first spec. The user can see it in the editor as it streams in and can give feedback to iterate on it. This phase covers both the initial draft and any back-and-forth refinement before code generation.
 - **initialCodegen**: First code generation from the spec. The agent is generating methods, tables, interfaces, manifest updates, and scenarios. This can take a while and involves heavy tool use. The user sees a full-screen build progress view.
 - **onboardingFinished**: The project is built and ready. Full development mode with all tools available. From here on, keep spec and code in sync as changes are made.
+</project_onboarding>
+{{static/instructions.md}}
 <!-- cache_breakpoint -->
-  <current_project_onboarding_state>
+<current_project_onboarding_state>
   ${onboardingState ?? "onboardingFinished"}
-  </current_project_onboarding_state>
-</project_onboarding>
+</current_project_onboarding_state>
+<project_context>
+${projectContext}
+</project_context>
 <view_context>
 The user is currently in ${viewContext?.mode ?? "code"} mode.
 ${viewContext?.activeFile ? `Active file: ${viewContext.activeFile}` : ""}
 </view_context>
+${loadPlanStatus()}
 `;
   return resolveIncludes(template);
 }
 // src/api.ts
-var log3 = createLogger("api");
+var log2 = createLogger("api");
 async function* streamChat(params) {
   const { baseUrl: baseUrl2, apiKey, signal, requestId, ...body } = params;
   const url = `${baseUrl2}/_internal/v2/agent/remy/chat`;
   const startTime = Date.now();
   const subAgentId = body.subAgentId;
-  log3.info("API request", {
+  log2.info("API request", {
     requestId,
     ...subAgentId && { subAgentId },
     model: body.model,
@@ -500,13 +434,13 @@ async function* streamChat(params) {
     });
   } catch (err) {
     if (signal?.aborted) {
-      log3.warn("Request aborted", {
+      log2.warn("Request aborted", {
         requestId,
         ...subAgentId && { subAgentId }
       });
       throw err;
     }
-    log3.error("Network error", {
+    log2.error("Network error", {
       requestId,
       ...subAgentId && { subAgentId },
       error: err.message
@@ -515,7 +449,7 @@ async function* streamChat(params) {
     return;
   }
   const ttfb = Date.now() - startTime;
-  log3.info("API response", {
+  log2.info("API response", {
     requestId,
     ...subAgentId && { subAgentId },
     status: res.status,
@@ -533,7 +467,7 @@ async function* streamChat(params) {
       }
     } catch {
     }
-    log3.error("API error", {
+    log2.error("API error", {
       requestId,
       ...subAgentId && { subAgentId },
       status: res.status,
@@ -546,6 +480,7 @@ async function* streamChat(params) {
   const reader = res.body.getReader();
   const decoder = new TextDecoder();
   let buffer = "";
+  let receivedDone = false;
   while (true) {
     let stallTimer;
     let readResult;
@@ -563,7 +498,7 @@ async function* streamChat(params) {
     } catch {
       clearTimeout(stallTimer);
       await reader.cancel();
-      log3.error("Stream stalled", {
+      log2.error("Stream stalled", {
         requestId,
         ...subAgentId && { subAgentId },
         durationMs: Date.now() - startTime
@@ -589,7 +524,8 @@ async function* streamChat(params) {
         const event = JSON.parse(line.slice(6));
         if (event.type === "done") {
           const elapsed = Date.now() - startTime;
-          log3.info("Stream complete", {
+          receivedDone = true;
+          log2.info("Stream complete", {
             requestId,
             ...subAgentId && { subAgentId },
             durationMs: elapsed,
@@ -597,12 +533,27 @@ async function* streamChat(params) {
             inputTokens: event.usage.inputTokens,
             outputTokens: event.usage.outputTokens
           });
+        } else if (event.type === "error") {
+          log2.error("SSE error event", {
+            requestId,
+            ...subAgentId && { subAgentId },
+            error: event.error,
+            durationMs: Date.now() - startTime
+          });
         }
         yield event;
       } catch {
       }
     }
   }
+  if (!receivedDone) {
+    log2.warn("Stream ended without done event", {
+      requestId,
+      ...subAgentId && { subAgentId },
+      durationMs: Date.now() - startTime,
+      remainingBuffer: buffer.slice(0, 200)
+    });
+  }
   if (buffer.startsWith("data: ")) {
     try {
       yield JSON.parse(buffer.slice(6));
@@ -639,7 +590,7 @@ async function* streamChatWithRetry(params, options) {
         return;
       }
       const backoff = INITIAL_BACKOFF_MS * 2 ** attempt;
-      log3.warn("Retrying", {
+      log2.warn("Retrying", {
         requestId: params.requestId,
         attempt: attempt + 1,
         maxRetries: MAX_RETRIES,
@@ -681,7 +632,7 @@ async function generateBackgroundAck(params) {
 }
 // src/compaction/index.ts
-var log4 = createLogger("compaction");
+var log3 = createLogger("compaction");
 var CONVERSATION_SUMMARY_PROMPT = readAsset("compaction", "conversation.md");
 var SUBAGENT_SUMMARY_PROMPT = readAsset("compaction", "subagent.md");
 var SUMMARIZABLE_SUBAGENTS = ["visualDesignExpert", "productVision"];
@@ -745,7 +696,7 @@ async function compactConversation(messages, apiConfig, system, tools2) {
       }
     ]
   }));
-  log4.info("Compaction complete", { summaries: summaries.length });
+  log3.info("Compaction complete", { summaries: summaries.length });
   return checkpointMessages;
 }
 function findSafeInsertionPoint(messages) {
@@ -849,7 +800,7 @@ async function generateSummary(apiConfig, name, compactionPrompt, messagesToSumm
   if (!serialized.trim()) {
     return null;
   }
-  log4.info("Generating summary", {
+  log3.info("Generating summary", {
     name,
     messageCount: messagesToSummarize.length,
     cacheReuse: !!mainSystem
@@ -875,15 +826,15 @@ ${serialized}` : serialized;
     if (event.type === "text") {
       summaryText += event.text;
     } else if (event.type === "error") {
-      log4.error("Summary generation failed", { name, error: event.error });
+      log3.error("Summary generation failed", { name, error: event.error });
       return null;
     }
   }
   if (!summaryText.trim()) {
-    log4.warn("Empty summary generated", { name });
+    log3.warn("Empty summary generated", { name });
     return null;
   }
-  log4.info("Summary generated", { name, summaryLength: summaryText.length });
+  log3.info("Summary generated", { name, summaryLength: summaryText.length });
   return summaryText.trim();
 }
@@ -2439,6 +2390,50 @@ var editsFinishedTool = {
   }
 };
+// src/tools/_helpers/sidecar.ts
+var log4 = createLogger("sidecar");
+var baseUrl = null;
+function setSidecarBaseUrl(url) {
+  baseUrl = url;
+  log4.info("Configured", { url });
+}
+async function sidecarRequest(endpoint, body = {}, options) {
+  if (!baseUrl) {
+    throw new Error("Sidecar not available");
+  }
+  const url = `${baseUrl}${endpoint}`;
+  try {
+    const res = await fetch(url, {
+      method: "POST",
+      headers: { "Content-Type": "application/json" },
+      body: JSON.stringify(body),
+      signal: options?.timeout ? AbortSignal.timeout(options.timeout) : void 0
+    });
+    if (!res.ok) {
+      log4.error("Sidecar error", { endpoint, status: res.status });
+      throw new Error(`Sidecar error: ${res.status}`);
+    }
+    const data = await res.json();
+    if (data?.success === false) {
+      const code = data.errorCode ? ` [${data.errorCode}]` : "";
+      throw new Error(`${data.error || "Unknown error"}${code}`);
+    }
+    return data;
+  } catch (err) {
+    if (err.message.startsWith("Sidecar error")) {
+      throw err;
+    }
+    log4.error("Sidecar connection error", { endpoint, error: err.message });
+    throw new Error(`Sidecar connection error: ${err.message}`);
+  }
+}
+// src/tools/_helpers/lsp.ts
+var setLspBaseUrl = setSidecarBaseUrl;
+async function lspRequest(endpoint, body) {
+  return sidecarRequest(endpoint, body);
+}
 // src/tools/code/lspDiagnostics.ts
 var lspDiagnosticsTool = {
   clearable: true,
@@ -6030,13 +6025,24 @@ function resolveAction(text) {
     }
   }
   let body = readAsset("automatedActions", `${triggerName}.md`);
+  let next;
+  const fmMatch = body.match(/^---\s*\n([\s\S]*?)\n---/);
+  if (fmMatch) {
+    const nextMatch = fmMatch[1].match(/^\s*next:\s*(\w+)\s*$/m);
+    if (nextMatch) {
+      next = nextMatch[1];
+    }
+  }
   body = body.replace(/^---[\s\S]*?---\s*/, "");
   for (const [key, value] of Object.entries(params)) {
     const str = typeof value === "string" ? value : JSON.stringify(value);
     body = body.replaceAll(`{{${key}}}`, str);
   }
-  return `@@automated::${triggerName}@@
-${body}`;
+  return {
+    message: `@@automated::${triggerName}@@
+${body}`,
+    next
+  };
 }
 // src/headless.ts
@@ -6098,6 +6104,7 @@ async function startHeadless(opts = {}) {
   let currentRequestId;
   let completedEmitted = false;
   let turnStart = 0;
+  let pendingNextAction;
   const EXTERNAL_TOOL_TIMEOUT_MS = 3e5;
   const pendingTools = /* @__PURE__ */ new Map();
   const earlyResults = /* @__PURE__ */ new Map();
@@ -6248,10 +6255,19 @@ ${xmlParts}
           applyPendingSummaries();
           applyPendingBlockUpdates();
           flushBackgroundQueue();
+          if (pendingNextAction) {
+            const next = pendingNextAction;
+            pendingNextAction = void 0;
+            handleMessage(
+              { action: "message", text: `@@automated::${next}@@` },
+              `chain-${Date.now()}`
+            );
+          }
         }, 0);
         return;
       case "turn_cancelled":
         completedEmitted = true;
+        pendingNextAction = void 0;
         emit("completed", { success: false, error: "cancelled" }, rid);
         return;
       // Streaming events — forward with requestId
@@ -6366,6 +6382,120 @@ ${xmlParts}
     }
   }
   toolRegistry.onEvent = onEvent;
+  const UPLOADS_DIR = "src/.user-uploads";
+  function filenameFromUrl(url) {
+    try {
+      const pathname = new URL(url).pathname;
+      const name = basename(pathname);
+      return name && name !== "/" ? decodeURIComponent(name) : `upload-${Date.now()}`;
+    } catch {
+      return `upload-${Date.now()}`;
+    }
+  }
+  function resolveUniqueFilename(name) {
+    if (!existsSync(join(UPLOADS_DIR, name))) {
+      return name;
+    }
+    const ext = extname(name);
+    const base = name.slice(0, name.length - ext.length);
+    let counter = 1;
+    while (existsSync(join(UPLOADS_DIR, `${base}-${counter}${ext}`))) {
+      counter++;
+    }
+    return `${base}-${counter}${ext}`;
+  }
+  const IMAGE_EXTENSIONS = /* @__PURE__ */ new Set([
+    ".png",
+    ".jpg",
+    ".jpeg",
+    ".gif",
+    ".webp",
+    ".svg",
+    ".bmp",
+    ".ico",
+    ".tiff",
+    ".tif",
+    ".avif",
+    ".heic",
+    ".heif"
+  ]);
+  function isImageAttachment(att) {
+    const name = att.filename || filenameFromUrl(att.url);
+    return IMAGE_EXTENSIONS.has(extname(name).toLowerCase());
+  }
+  async function persistAttachments(attachments) {
+    const nonVoice = attachments.filter((a) => !a.isVoice);
+    if (nonVoice.length === 0) {
+      return { documents: [], images: [] };
+    }
+    mkdirSync(UPLOADS_DIR, { recursive: true });
+    const results = await Promise.allSettled(
+      nonVoice.map(async (att) => {
+        const name = resolveUniqueFilename(
+          att.filename || filenameFromUrl(att.url)
+        );
+        const localPath = join(UPLOADS_DIR, name);
+        const res = await fetch(att.url, {
+          signal: AbortSignal.timeout(3e4)
+        });
+        if (!res.ok) {
+          throw new Error(`HTTP ${res.status} downloading ${att.url}`);
+        }
+        const buffer = Buffer.from(await res.arrayBuffer());
+        await writeFile(localPath, buffer);
+        log11.info("Attachment saved", {
+          filename: name,
+          path: localPath,
+          bytes: buffer.length
+        });
+        let extractedTextPath;
+        if (att.extractedTextUrl) {
+          try {
+            const textRes = await fetch(att.extractedTextUrl, {
+              signal: AbortSignal.timeout(3e4)
+            });
+            if (textRes.ok) {
+              extractedTextPath = `${localPath}.txt`;
+              await writeFile(extractedTextPath, await textRes.text(), "utf-8");
+              log11.info("Extracted text saved", { path: extractedTextPath });
+            }
+          } catch {
+          }
+        }
+        return { filename: name, localPath, extractedTextPath };
+      })
+    );
+    const settled = results.map((r, i) => ({
+      result: r.status === "fulfilled" ? r.value : null,
+      isImage: isImageAttachment(nonVoice[i])
+    }));
+    return {
+      documents: settled.filter((s) => !s.isImage).map((s) => s.result),
+      images: settled.filter((s) => s.isImage).map((s) => s.result)
+    };
+  }
+  function buildUploadHeader(results) {
+    const succeeded = results.filter(Boolean);
+    if (succeeded.length === 0) {
+      return "";
+    }
+    if (succeeded.length === 1) {
+      const r = succeeded[0];
+      const parts = [`[Uploaded file: ${r.localPath}`];
+      if (r.extractedTextPath) {
+        parts.push(`extracted text: ${r.extractedTextPath}`);
+      }
+      return parts.join(" \u2014 ") + "]";
+    }
+    const lines = succeeded.map((r) => {
+      if (r.extractedTextPath) {
+        return `- ${r.localPath} (extracted text: ${r.extractedTextPath})`;
+      }
+      return `- ${r.localPath}`;
+    });
+    return `[Uploaded files]
+${lines.join("\n")}`;
+  }
   async function handleMessage(parsed, requestId) {
     if (running) {
       emit(
@@ -6387,12 +6517,26 @@ ${xmlParts}
     turnStart = Date.now();
     const attachments = parsed.attachments;
     if (attachments?.length) {
-      console.warn(
-        `[headless] Message has ${attachments.length} attachment(s):`,
-        attachments.map((a) => a.url)
-      );
+      log11.info("Message has attachments", {
+        count: attachments.length,
+        urls: attachments.map((a) => a.url)
+      });
     }
     let userMessage = parsed.text ?? "";
+    if (attachments?.some((a) => !a.isVoice)) {
+      try {
+        const { documents, images } = await persistAttachments(attachments);
+        const all = [...documents, ...images];
+        const header = buildUploadHeader(all);
+        if (header) {
+          userMessage = userMessage ? `${header}
+${userMessage}` : header;
+        }
+      } catch (err) {
+        log11.warn("Attachment persistence failed", { error: err.message });
+      }
+    }
     let resolved = null;
     try {
       resolved = resolveAction(userMessage);
@@ -6404,8 +6548,10 @@ ${xmlParts}
       );
       return;
     }
+    pendingNextAction = void 0;
     if (resolved !== null) {
-      userMessage = resolved;
+      userMessage = resolved.message;
+      pendingNextAction = resolved.next;
     }
     const isHidden = resolved !== null || !!parsed.hidden;
     const rawText = parsed.text ?? "";