npm - @mindstudio-ai/remy - Versions diffs - 0.1.146 → 0.1.148 - Mend

@mindstudio-ai/remy 0.1.146 → 0.1.148

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (11) hide show

package/dist/automatedActions/buildFromInitialSpec.md +2 -10
package/dist/automatedActions/buildFromRoadmap.md +2 -1
package/dist/automatedActions/postBuildPolish.md +18 -0
package/dist/automatedActions/postRoadmapBuild.md +13 -0
package/dist/automatedActions/publish.md +2 -0
package/dist/headless.js +264 -117
package/dist/index.js +267 -122
package/dist/prompt/static/authoring.md +1 -1
package/dist/prompt/static/instructions.md +2 -2
package/dist/prompt/static/team.md +1 -1
package/package.json +1 -1

package/dist/automatedActions/buildFromInitialSpec.md CHANGED Viewed

@@ -1,10 +1,11 @@
 ---
   trigger: buildFromInitialSpec
+  next: postBuildPolish
 ---
 This is an automated action triggered by the user pressing "Build" in the editor after reviewing the spec.
-The user has reviewed the spec and is ready to build. There are four phases to building: planning, coding, verifying, polishing. Execute each phase in order in a single turn.
+The user has reviewed the spec and is ready to build. There are three phases: planning, coding, and verifying. Execute each phase in order in a single turn.
 ## Planning
 Think about your approach and then get a quick sanity check from `codeSanityCheck` to make sure you aren't missing anything.
@@ -21,12 +22,3 @@ Then, build everything in one turn: tables, methods, interfaces, manifest update
 - If the app has a web frontend, check the browser logs to make sure there are no errors rendering it.
 - Use `runAutomatedBrowserTest` to smoke-test the main UI flow. The dev database is a disposable snapshot, so don't worry about being destructive. Fix any errors before finishing.
 - If there is a scenario that seeds the app with mock data, use it to present the app to the user with initial data seeded, so they can see and play with the real app. Let the user know they can reset the app using a scenario to empty it if they wish. Showing the user something they can play with immediately is important when it comes to landing a strong first impression.
-## Polishing
-When verification is complete, take a step back and do an explicit polish pass before verifying. Re-read the spec files and the design expert's guidance, then walk through each frontend file looking for design details that got skipped in the initial build: animations, transitions, hover states, micro-interactions, spring physics, entrance reveals, gesture handling, layout issues, and anything else.
-The initial build prioritizes getting everything connected and functional, but this pass closes the gap between "it works" and "it feels great." In many ways this is *the* most important part of the initial build, as the user's first experience of the deliverable will set their expectations for every iteration that follows. Don't mess this up.
-Then, ask the `visualDesignExpert` to take a screenshot and verity that the visual design looks correct. Fix any issues it flags - we want the user's first time seeing the finished product to truly wow them.
-When everything is working, use `productVision` to mark the MVP roadmap item as done, then call `setProjectOnboardingState({ state: "onboardingFinished" })`. Finally, call `compactConversation` to summarize the build session and free up context for the next phase of work.

package/dist/automatedActions/buildFromRoadmap.md CHANGED Viewed

@@ -1,5 +1,6 @@
 ---
   trigger: buildFromRoadmap
+  next: postRoadmapBuild
 ---
 This is an automated action triggered by the user pressing "Build Now" on the roadmap item {{path}}
@@ -12,4 +13,4 @@ Then, put together a plan to build out the feature. Write the plan with `writePl
 When they've approved the plan, be sure to update the spec first - remember, the spec is the source of truth about the product. Then, build everything in one turn, using the spec as the master plan.
-When you're finished, verify your work, then tell `productVision` what was done so it can update the roadmap to reflect the progress. Give the user a summary of what was done, then call `compactConversation` to summarize the build session and free up context.
+When you're finished building, verify your work and give the user a summary of what was done.

package/dist/automatedActions/postBuildPolish.md ADDED Viewed

@@ -0,0 +1,18 @@
+---
+  trigger: postBuildPolish
+---
+This is an automated follow-up after the initial build. The code is written and verified. Now it's time to polish and finalize so we can deliver something beautiful and magical as the user's first experience with our work.
+## Polishing
+Take a step back and do an explicit polish pass. Re-read the spec files and the design expert's guidance, then walk through each frontend file looking for design details that got skipped in the initial build: layout animations, transitions, hover states, micro-interactions, spring physics, entrance reveals, gesture handling, layout issues, responsiveness, and anything else. We need this to feel truly amazing and wow the user - it's worth it to take the time to get it right.
+The initial build prioritizes getting everything connected and functional, but this pass closes the gap between "it works" and "it feels great." In many ways this is *the* most important part of the initial build, as the user's first experience of the deliverable will set their expectations for every iteration that follows. Don't mess this up.
+When you have finished, ask the `visualDesignExpert` to take a screenshot and verify that the visual design looks correct. Fix any issues it flags. We want the user's first time seeing the finished product to truly wow them.
+## Finalizing
+When everything is working and polished:
+1. Use `productVision` to mark the MVP roadmap item as done.
+2. Call `setProjectOnboardingState({ state: "onboardingFinished" })`.
+3. Call `compactConversation` to summarize the build session and free up context for the next phase of work.

package/dist/automatedActions/postRoadmapBuild.md ADDED Viewed

@@ -0,0 +1,13 @@
+---
+  trigger: postRoadmapBuild
+---
+This is an automated follow-up after building a roadmap feature. The code is written and verified. Now it's time to polish and finalize.
+## Polishing
+Take a step back and do an explicit polish pass. Re-read the spec files and the design expert's guidance, then walk through each frontend file you changed looking for design details that got skipped: animations, transitions, hover states, micro-interactions, and anything else that closes the gap between "it works" and "it feels great."
+## Finalizing
+When everything is working:
+1. Tell `productVision` what was done so it can update the roadmap to reflect the progress.
+2. Call `compactConversation` to summarize the build session and free up context.

package/dist/automatedActions/publish.md CHANGED Viewed

@@ -14,4 +14,6 @@ If approved:
 - Use `mindstudio-prod releases status --wait` to poll the build until it completes. Let the user know it's deploying, then report back when it's live.
 - Once deployed, offer to help with next steps. This includes technical steps likesetting up a custom domain (`mindstudio-prod domains`), checking for errors (`mindstudio-prod requests stats`), seeding production data (`mindstudio-prod db`), managing env vars/secrets, or anything else they need for launch. It also includes going above and beyond and helping holistically. If it's the initial deploy, offer to help create collateral to announce the launch (e.g., an image for sharing on social media, text copy for a post, etc); if it's a meaningful incremental update, an annoucement post or something similar - go above and beyond here to help the user see that you care about the product from end-to-end, not just writing code! They will be appreciative, grateful, and pleased with your creativity here. Refer to the design guidance in the spec for how to talk about the product, and consider consulting the design expert to generate images or other marketing collateral.
+After everything is done, call `compactConversation` to summarize the current session and free up context for the next phase of work.
 If dismissed, acknowledge and do nothing.

package/dist/headless.js CHANGED Viewed

@@ -6,7 +6,15 @@ var __export = (target, all) => {
 // src/headless.ts
 import { createInterface } from "readline";
-import { writeFileSync, readFileSync, unlinkSync } from "fs";
+import {
+  writeFileSync,
+  readFileSync,
+  unlinkSync,
+  mkdirSync,
+  existsSync
+} from "fs";
+import { writeFile } from "fs/promises";
+import { basename, join, extname } from "path";
 // src/logger.ts
 import fs from "fs";
@@ -139,87 +147,9 @@ function readJsonAsset(fallback, ...segments) {
   }
 }
-// src/tools/_helpers/sidecar.ts
-var log2 = createLogger("sidecar");
-var baseUrl = null;
-function setSidecarBaseUrl(url) {
-  baseUrl = url;
-  log2.info("Configured", { url });
-}
-function isSidecarConfigured() {
-  return baseUrl !== null;
-}
-async function sidecarRequest(endpoint, body = {}, options) {
-  if (!baseUrl) {
-    throw new Error("Sidecar not available");
-  }
-  const url = `${baseUrl}${endpoint}`;
-  try {
-    const res = await fetch(url, {
-      method: "POST",
-      headers: { "Content-Type": "application/json" },
-      body: JSON.stringify(body),
-      signal: options?.timeout ? AbortSignal.timeout(options.timeout) : void 0
-    });
-    if (!res.ok) {
-      log2.error("Sidecar error", { endpoint, status: res.status });
-      throw new Error(`Sidecar error: ${res.status}`);
-    }
-    const data = await res.json();
-    if (data?.success === false) {
-      const code = data.errorCode ? ` [${data.errorCode}]` : "";
-      throw new Error(`${data.error || "Unknown error"}${code}`);
-    }
-    return data;
-  } catch (err) {
-    if (err.message.startsWith("Sidecar error")) {
-      throw err;
-    }
-    log2.error("Sidecar connection error", { endpoint, error: err.message });
-    throw new Error(`Sidecar connection error: ${err.message}`);
-  }
-}
-// src/tools/_helpers/lsp.ts
-var setLspBaseUrl = setSidecarBaseUrl;
-var isLspConfigured = isSidecarConfigured;
-async function lspRequest(endpoint, body) {
-  return sidecarRequest(endpoint, body);
-}
 // src/prompt/static/projectContext.ts
 import fs4 from "fs";
 import path3 from "path";
-var AGENT_INSTRUCTION_FILES = [
-  "CLAUDE.md",
-  "claude.md",
-  ".claude/instructions.md",
-  "AGENTS.md",
-  "agents.md",
-  ".agents.md",
-  "COPILOT.md",
-  "copilot.md",
-  ".copilot-instructions.md",
-  ".github/copilot-instructions.md",
-  "REMY.md",
-  "remy.md",
-  ".cursorrules",
-  ".cursorules"
-];
-function loadProjectInstructions() {
-  for (const file of AGENT_INSTRUCTION_FILES) {
-    try {
-      const content = fs4.readFileSync(file, "utf-8").trim();
-      if (content) {
-        return `
-## Project Instructions (${file})
-${content}`;
-      }
-    } catch {
-    }
-  }
-  return "";
-}
 function loadProjectManifest() {
   try {
     const manifest = fs4.readFileSync("mindstudio.json", "utf-8");
@@ -346,7 +276,6 @@ function resolveIncludes(template) {
 }
 function buildSystemPrompt(onboardingState, viewContext) {
   const projectContext = [
-    loadProjectInstructions(),
     loadProjectManifest(),
     loadSpecFileMetadata(),
     loadProjectFileListing()
@@ -421,29 +350,26 @@ Current date: ${now}
   {{compiled/msfm.md}}
 </mindstudio_flavored_markdown_spec_docs>
-<project_context>
-${projectContext}
-</project_context>
 <intake_mode_instructions>
-{{static/intake.md}}
+  {{static/intake.md}}
 </intake_mode_instructions>
 <spec_authoring_instructions>
-{{static/authoring.md}}
+  {{static/authoring.md}}
 </spec_authoring_instructions>
-{{static/team.md}}
+<team>
+  {{static/team.md}}
+</team>
 <code_authoring_instructions>
 {{static/coding.md}}
-${isLspConfigured() ? `<typescript_lsp>
+<typescript_lsp>
 {{static/lsp.md}}
-</typescript_lsp>` : ""}
+</typescript_lsp>
 </code_authoring_instructions>
-{{static/instructions.md}}
-${loadPlanStatus()}
 <conversation_summaries>
 Your conversation history may include <prior_conversation_summary> blocks in the user's messages. These are automated summaries of earlier messages that have been compacted to save context space. The user does not see this summary, they see the full conversation history in their UI. Treat the summary as ground truth for what happened before, but do not reference it directly to the user ("as mentioned in the summary..."). Just continue naturally as if you remember the prior work.
@@ -457,30 +383,38 @@ New projects progress through four onboarding states. The user might skip this e
 - **initialSpecAuthoring**: Writing and refining the first spec. The user can see it in the editor as it streams in and can give feedback to iterate on it. This phase covers both the initial draft and any back-and-forth refinement before code generation.
 - **initialCodegen**: First code generation from the spec. The agent is generating methods, tables, interfaces, manifest updates, and scenarios. This can take a while and involves heavy tool use. The user sees a full-screen build progress view.
 - **onboardingFinished**: The project is built and ready. Full development mode with all tools available. From here on, keep spec and code in sync as changes are made.
+</project_onboarding>
+{{static/instructions.md}}
 <!-- cache_breakpoint -->
-  <current_project_onboarding_state>
+<current_project_onboarding_state>
   ${onboardingState ?? "onboardingFinished"}
-  </current_project_onboarding_state>
-</project_onboarding>
+</current_project_onboarding_state>
+<project_context>
+${projectContext}
+</project_context>
 <view_context>
 The user is currently in ${viewContext?.mode ?? "code"} mode.
 ${viewContext?.activeFile ? `Active file: ${viewContext.activeFile}` : ""}
 </view_context>
+${loadPlanStatus()}
 `;
   return resolveIncludes(template);
 }
 // src/api.ts
-var log3 = createLogger("api");
+var log2 = createLogger("api");
 async function* streamChat(params) {
   const { baseUrl: baseUrl2, apiKey, signal, requestId, ...body } = params;
   const url = `${baseUrl2}/_internal/v2/agent/remy/chat`;
   const startTime = Date.now();
   const subAgentId = body.subAgentId;
-  log3.info("API request", {
+  log2.info("API request", {
     requestId,
     ...subAgentId && { subAgentId },
     model: body.model,
@@ -500,13 +434,13 @@ async function* streamChat(params) {
     });
   } catch (err) {
     if (signal?.aborted) {
-      log3.warn("Request aborted", {
+      log2.warn("Request aborted", {
         requestId,
         ...subAgentId && { subAgentId }
       });
       throw err;
     }
-    log3.error("Network error", {
+    log2.error("Network error", {
       requestId,
       ...subAgentId && { subAgentId },
       error: err.message
@@ -515,7 +449,7 @@ async function* streamChat(params) {
     return;
   }
   const ttfb = Date.now() - startTime;
-  log3.info("API response", {
+  log2.info("API response", {
     requestId,
     ...subAgentId && { subAgentId },
     status: res.status,
@@ -533,7 +467,7 @@ async function* streamChat(params) {
       }
     } catch {
     }
-    log3.error("API error", {
+    log2.error("API error", {
       requestId,
       ...subAgentId && { subAgentId },
       status: res.status,
@@ -546,6 +480,7 @@ async function* streamChat(params) {
   const reader = res.body.getReader();
   const decoder = new TextDecoder();
   let buffer = "";
+  let receivedDone = false;
   while (true) {
     let stallTimer;
     let readResult;
@@ -563,7 +498,7 @@ async function* streamChat(params) {
     } catch {
       clearTimeout(stallTimer);
       await reader.cancel();
-      log3.error("Stream stalled", {
+      log2.error("Stream stalled", {
         requestId,
         ...subAgentId && { subAgentId },
         durationMs: Date.now() - startTime
@@ -589,7 +524,8 @@ async function* streamChat(params) {
         const event = JSON.parse(line.slice(6));
         if (event.type === "done") {
           const elapsed = Date.now() - startTime;
-          log3.info("Stream complete", {
+          receivedDone = true;
+          log2.info("Stream complete", {
             requestId,
             ...subAgentId && { subAgentId },
             durationMs: elapsed,
@@ -597,12 +533,27 @@ async function* streamChat(params) {
             inputTokens: event.usage.inputTokens,
             outputTokens: event.usage.outputTokens
           });
+        } else if (event.type === "error") {
+          log2.error("SSE error event", {
+            requestId,
+            ...subAgentId && { subAgentId },
+            error: event.error,
+            durationMs: Date.now() - startTime
+          });
         }
         yield event;
       } catch {
       }
     }
   }
+  if (!receivedDone) {
+    log2.warn("Stream ended without done event", {
+      requestId,
+      ...subAgentId && { subAgentId },
+      durationMs: Date.now() - startTime,
+      remainingBuffer: buffer.slice(0, 200)
+    });
+  }
   if (buffer.startsWith("data: ")) {
     try {
       yield JSON.parse(buffer.slice(6));
@@ -639,7 +590,7 @@ async function* streamChatWithRetry(params, options) {
         return;
       }
       const backoff = INITIAL_BACKOFF_MS * 2 ** attempt;
-      log3.warn("Retrying", {
+      log2.warn("Retrying", {
         requestId: params.requestId,
         attempt: attempt + 1,
         maxRetries: MAX_RETRIES,
@@ -681,7 +632,7 @@ async function generateBackgroundAck(params) {
 }
 // src/compaction/index.ts
-var log4 = createLogger("compaction");
+var log3 = createLogger("compaction");
 var CONVERSATION_SUMMARY_PROMPT = readAsset("compaction", "conversation.md");
 var SUBAGENT_SUMMARY_PROMPT = readAsset("compaction", "subagent.md");
 var SUMMARIZABLE_SUBAGENTS = ["visualDesignExpert", "productVision"];
@@ -745,7 +696,7 @@ async function compactConversation(messages, apiConfig, system, tools2) {
       }
     ]
   }));
-  log4.info("Compaction complete", { summaries: summaries.length });
+  log3.info("Compaction complete", { summaries: summaries.length });
   return checkpointMessages;
 }
 function findSafeInsertionPoint(messages) {
@@ -849,7 +800,7 @@ async function generateSummary(apiConfig, name, compactionPrompt, messagesToSumm
   if (!serialized.trim()) {
     return null;
   }
-  log4.info("Generating summary", {
+  log3.info("Generating summary", {
     name,
     messageCount: messagesToSummarize.length,
     cacheReuse: !!mainSystem
@@ -875,15 +826,15 @@ ${serialized}` : serialized;
     if (event.type === "text") {
       summaryText += event.text;
     } else if (event.type === "error") {
-      log4.error("Summary generation failed", { name, error: event.error });
+      log3.error("Summary generation failed", { name, error: event.error });
       return null;
     }
   }
   if (!summaryText.trim()) {
-    log4.warn("Empty summary generated", { name });
+    log3.warn("Empty summary generated", { name });
     return null;
   }
-  log4.info("Summary generated", { name, summaryLength: summaryText.length });
+  log3.info("Summary generated", { name, summaryLength: summaryText.length });
   return summaryText.trim();
 }
@@ -1770,7 +1721,7 @@ var compactConversationTool = {
   clearable: false,
   definition: {
     name: "compactConversation",
-    description: "Compact the conversation history by summarizing older messages into a checkpoint. The summary preserves key decisions, what was built, and the current state of the project, but drops the verbose tool results, diffs, and intermediate steps that are no longer useful. Use this when you have just finished a large block of mechanical work (building, refactoring, debugging) and are about to shift back into conversational mode with the user. Runs in the background. Do not use after small changes like fixing a bug or editing copy.",
+    description: "Compact the conversation history by summarizing older messages into a checkpoint. The summary preserves key decisions, what was built, and the current state of the project, but drops the verbose tool results, diffs, and intermediate steps that are no longer useful. Runs in the background.",
     inputSchema: {
       type: "object",
       properties: {}
@@ -2439,6 +2390,50 @@ var editsFinishedTool = {
   }
 };
+// src/tools/_helpers/sidecar.ts
+var log4 = createLogger("sidecar");
+var baseUrl = null;
+function setSidecarBaseUrl(url) {
+  baseUrl = url;
+  log4.info("Configured", { url });
+}
+async function sidecarRequest(endpoint, body = {}, options) {
+  if (!baseUrl) {
+    throw new Error("Sidecar not available");
+  }
+  const url = `${baseUrl}${endpoint}`;
+  try {
+    const res = await fetch(url, {
+      method: "POST",
+      headers: { "Content-Type": "application/json" },
+      body: JSON.stringify(body),
+      signal: options?.timeout ? AbortSignal.timeout(options.timeout) : void 0
+    });
+    if (!res.ok) {
+      log4.error("Sidecar error", { endpoint, status: res.status });
+      throw new Error(`Sidecar error: ${res.status}`);
+    }
+    const data = await res.json();
+    if (data?.success === false) {
+      const code = data.errorCode ? ` [${data.errorCode}]` : "";
+      throw new Error(`${data.error || "Unknown error"}${code}`);
+    }
+    return data;
+  } catch (err) {
+    if (err.message.startsWith("Sidecar error")) {
+      throw err;
+    }
+    log4.error("Sidecar connection error", { endpoint, error: err.message });
+    throw new Error(`Sidecar connection error: ${err.message}`);
+  }
+}
+// src/tools/_helpers/lsp.ts
+var setLspBaseUrl = setSidecarBaseUrl;
+async function lspRequest(endpoint, body) {
+  return sidecarRequest(endpoint, body);
+}
 // src/tools/code/lspDiagnostics.ts
 var lspDiagnosticsTool = {
   clearable: true,
@@ -6030,13 +6025,24 @@ function resolveAction(text) {
     }
   }
   let body = readAsset("automatedActions", `${triggerName}.md`);
+  let next;
+  const fmMatch = body.match(/^---\s*\n([\s\S]*?)\n---/);
+  if (fmMatch) {
+    const nextMatch = fmMatch[1].match(/^\s*next:\s*(\w+)\s*$/m);
+    if (nextMatch) {
+      next = nextMatch[1];
+    }
+  }
   body = body.replace(/^---[\s\S]*?---\s*/, "");
   for (const [key, value] of Object.entries(params)) {
     const str = typeof value === "string" ? value : JSON.stringify(value);
     body = body.replaceAll(`{{${key}}}`, str);
   }
-  return `@@automated::${triggerName}@@
-${body}`;
+  return {
+    message: `@@automated::${triggerName}@@
+${body}`,
+    next
+  };
 }
 // src/headless.ts
@@ -6098,6 +6104,7 @@ async function startHeadless(opts = {}) {
   let currentRequestId;
   let completedEmitted = false;
   let turnStart = 0;
+  let pendingNextAction;
   const EXTERNAL_TOOL_TIMEOUT_MS = 3e5;
   const pendingTools = /* @__PURE__ */ new Map();
   const earlyResults = /* @__PURE__ */ new Map();
@@ -6248,10 +6255,19 @@ ${xmlParts}
           applyPendingSummaries();
           applyPendingBlockUpdates();
           flushBackgroundQueue();
+          if (pendingNextAction) {
+            const next = pendingNextAction;
+            pendingNextAction = void 0;
+            handleMessage(
+              { action: "message", text: `@@automated::${next}@@` },
+              `chain-${Date.now()}`
+            );
+          }
         }, 0);
         return;
       case "turn_cancelled":
         completedEmitted = true;
+        pendingNextAction = void 0;
         emit("completed", { success: false, error: "cancelled" }, rid);
         return;
       // Streaming events — forward with requestId
@@ -6366,6 +6382,120 @@ ${xmlParts}
     }
   }
   toolRegistry.onEvent = onEvent;
+  const UPLOADS_DIR = "src/.user-uploads";
+  function filenameFromUrl(url) {
+    try {
+      const pathname = new URL(url).pathname;
+      const name = basename(pathname);
+      return name && name !== "/" ? decodeURIComponent(name) : `upload-${Date.now()}`;
+    } catch {
+      return `upload-${Date.now()}`;
+    }
+  }
+  function resolveUniqueFilename(name) {
+    if (!existsSync(join(UPLOADS_DIR, name))) {
+      return name;
+    }
+    const ext = extname(name);
+    const base = name.slice(0, name.length - ext.length);
+    let counter = 1;
+    while (existsSync(join(UPLOADS_DIR, `${base}-${counter}${ext}`))) {
+      counter++;
+    }
+    return `${base}-${counter}${ext}`;
+  }
+  const IMAGE_EXTENSIONS = /* @__PURE__ */ new Set([
+    ".png",
+    ".jpg",
+    ".jpeg",
+    ".gif",
+    ".webp",
+    ".svg",
+    ".bmp",
+    ".ico",
+    ".tiff",
+    ".tif",
+    ".avif",
+    ".heic",
+    ".heif"
+  ]);
+  function isImageAttachment(att) {
+    const name = att.filename || filenameFromUrl(att.url);
+    return IMAGE_EXTENSIONS.has(extname(name).toLowerCase());
+  }
+  async function persistAttachments(attachments) {
+    const nonVoice = attachments.filter((a) => !a.isVoice);
+    if (nonVoice.length === 0) {
+      return { documents: [], images: [] };
+    }
+    mkdirSync(UPLOADS_DIR, { recursive: true });
+    const results = await Promise.allSettled(
+      nonVoice.map(async (att) => {
+        const name = resolveUniqueFilename(
+          att.filename || filenameFromUrl(att.url)
+        );
+        const localPath = join(UPLOADS_DIR, name);
+        const res = await fetch(att.url, {
+          signal: AbortSignal.timeout(3e4)
+        });
+        if (!res.ok) {
+          throw new Error(`HTTP ${res.status} downloading ${att.url}`);
+        }
+        const buffer = Buffer.from(await res.arrayBuffer());
+        await writeFile(localPath, buffer);
+        log11.info("Attachment saved", {
+          filename: name,
+          path: localPath,
+          bytes: buffer.length
+        });
+        let extractedTextPath;
+        if (att.extractedTextUrl) {
+          try {
+            const textRes = await fetch(att.extractedTextUrl, {
+              signal: AbortSignal.timeout(3e4)
+            });
+            if (textRes.ok) {
+              extractedTextPath = `${localPath}.txt`;
+              await writeFile(extractedTextPath, await textRes.text(), "utf-8");
+              log11.info("Extracted text saved", { path: extractedTextPath });
+            }
+          } catch {
+          }
+        }
+        return { filename: name, localPath, extractedTextPath };
+      })
+    );
+    const settled = results.map((r, i) => ({
+      result: r.status === "fulfilled" ? r.value : null,
+      isImage: isImageAttachment(nonVoice[i])
+    }));
+    return {
+      documents: settled.filter((s) => !s.isImage).map((s) => s.result),
+      images: settled.filter((s) => s.isImage).map((s) => s.result)
+    };
+  }
+  function buildUploadHeader(results) {
+    const succeeded = results.filter(Boolean);
+    if (succeeded.length === 0) {
+      return "";
+    }
+    if (succeeded.length === 1) {
+      const r = succeeded[0];
+      const parts = [`[Uploaded file: ${r.localPath}`];
+      if (r.extractedTextPath) {
+        parts.push(`extracted text: ${r.extractedTextPath}`);
+      }
+      return parts.join(" \u2014 ") + "]";
+    }
+    const lines = succeeded.map((r) => {
+      if (r.extractedTextPath) {
+        return `- ${r.localPath} (extracted text: ${r.extractedTextPath})`;
+      }
+      return `- ${r.localPath}`;
+    });
+    return `[Uploaded files]
+${lines.join("\n")}`;
+  }
   async function handleMessage(parsed, requestId) {
     if (running) {
       emit(
@@ -6387,12 +6517,26 @@ ${xmlParts}
     turnStart = Date.now();
     const attachments = parsed.attachments;
     if (attachments?.length) {
-      console.warn(
-        `[headless] Message has ${attachments.length} attachment(s):`,
-        attachments.map((a) => a.url)
-      );
+      log11.info("Message has attachments", {
+        count: attachments.length,
+        urls: attachments.map((a) => a.url)
+      });
     }
     let userMessage = parsed.text ?? "";
+    if (attachments?.some((a) => !a.isVoice)) {
+      try {
+        const { documents, images } = await persistAttachments(attachments);
+        const all = [...documents, ...images];
+        const header = buildUploadHeader(all);
+        if (header) {
+          userMessage = userMessage ? `${header}
+${userMessage}` : header;
+        }
+      } catch (err) {
+        log11.warn("Attachment persistence failed", { error: err.message });
+      }
+    }
     let resolved = null;
     try {
       resolved = resolveAction(userMessage);
@@ -6404,8 +6548,10 @@ ${xmlParts}
       );
       return;
     }
+    pendingNextAction = void 0;
     if (resolved !== null) {
-      userMessage = resolved;
+      userMessage = resolved.message;
+      pendingNextAction = resolved.next;
     }
     const isHidden = resolved !== null || !!parsed.hidden;
     const rawText = parsed.text ?? "";
@@ -6570,6 +6716,7 @@ ${xmlParts}
         },
         onFinally: () => {
           sessionStats.compactionInProgress = false;
+          sessionStats.lastContextSize = 0;
           sessionStats.messageCount = state.messages.length;
           sessionStats.updatedAt = Date.now();
           try {

package/dist/index.js CHANGED Viewed

@@ -156,6 +156,7 @@ async function* streamChat(params) {
   const reader = res.body.getReader();
   const decoder = new TextDecoder();
   let buffer = "";
+  let receivedDone = false;
   while (true) {
     let stallTimer;
     let readResult;
@@ -199,6 +200,7 @@ async function* streamChat(params) {
         const event = JSON.parse(line.slice(6));
         if (event.type === "done") {
           const elapsed = Date.now() - startTime;
+          receivedDone = true;
           log.info("Stream complete", {
             requestId,
             ...subAgentId && { subAgentId },
@@ -207,12 +209,27 @@ async function* streamChat(params) {
             inputTokens: event.usage.inputTokens,
             outputTokens: event.usage.outputTokens
           });
+        } else if (event.type === "error") {
+          log.error("SSE error event", {
+            requestId,
+            ...subAgentId && { subAgentId },
+            error: event.error,
+            durationMs: Date.now() - startTime
+          });
         }
         yield event;
       } catch {
       }
     }
   }
+  if (!receivedDone) {
+    log.warn("Stream ended without done event", {
+      requestId,
+      ...subAgentId && { subAgentId },
+      durationMs: Date.now() - startTime,
+      remainingBuffer: buffer.slice(0, 200)
+    });
+  }
   if (buffer.startsWith("data: ")) {
     try {
       yield JSON.parse(buffer.slice(6));
@@ -1541,85 +1558,9 @@ var init_compaction = __esm({
   }
 });
-// src/tools/_helpers/sidecar.ts
-function setSidecarBaseUrl(url) {
-  baseUrl = url;
-  log3.info("Configured", { url });
-}
-function isSidecarConfigured() {
-  return baseUrl !== null;
-}
-async function sidecarRequest(endpoint, body = {}, options) {
-  if (!baseUrl) {
-    throw new Error("Sidecar not available");
-  }
-  const url = `${baseUrl}${endpoint}`;
-  try {
-    const res = await fetch(url, {
-      method: "POST",
-      headers: { "Content-Type": "application/json" },
-      body: JSON.stringify(body),
-      signal: options?.timeout ? AbortSignal.timeout(options.timeout) : void 0
-    });
-    if (!res.ok) {
-      log3.error("Sidecar error", { endpoint, status: res.status });
-      throw new Error(`Sidecar error: ${res.status}`);
-    }
-    const data = await res.json();
-    if (data?.success === false) {
-      const code = data.errorCode ? ` [${data.errorCode}]` : "";
-      throw new Error(`${data.error || "Unknown error"}${code}`);
-    }
-    return data;
-  } catch (err) {
-    if (err.message.startsWith("Sidecar error")) {
-      throw err;
-    }
-    log3.error("Sidecar connection error", { endpoint, error: err.message });
-    throw new Error(`Sidecar connection error: ${err.message}`);
-  }
-}
-var log3, baseUrl;
-var init_sidecar = __esm({
-  "src/tools/_helpers/sidecar.ts"() {
-    "use strict";
-    init_logger();
-    log3 = createLogger("sidecar");
-    baseUrl = null;
-  }
-});
-// src/tools/_helpers/lsp.ts
-async function lspRequest(endpoint, body) {
-  return sidecarRequest(endpoint, body);
-}
-var setLspBaseUrl, isLspConfigured;
-var init_lsp = __esm({
-  "src/tools/_helpers/lsp.ts"() {
-    "use strict";
-    init_sidecar();
-    setLspBaseUrl = setSidecarBaseUrl;
-    isLspConfigured = isSidecarConfigured;
-  }
-});
 // src/prompt/static/projectContext.ts
 import fs9 from "fs";
 import path4 from "path";
-function loadProjectInstructions() {
-  for (const file of AGENT_INSTRUCTION_FILES) {
-    try {
-      const content = fs9.readFileSync(file, "utf-8").trim();
-      if (content) {
-        return `
-## Project Instructions (${file})
-${content}`;
-      }
-    } catch {
-    }
-  }
-  return "";
-}
 function loadProjectManifest() {
   try {
     const manifest = fs9.readFileSync("mindstudio.json", "utf-8");
@@ -1735,26 +1676,9 @@ ${listing}
     return "";
   }
 }
-var AGENT_INSTRUCTION_FILES;
 var init_projectContext = __esm({
   "src/prompt/static/projectContext.ts"() {
     "use strict";
-    AGENT_INSTRUCTION_FILES = [
-      "CLAUDE.md",
-      "claude.md",
-      ".claude/instructions.md",
-      "AGENTS.md",
-      "agents.md",
-      ".agents.md",
-      "COPILOT.md",
-      "copilot.md",
-      ".copilot-instructions.md",
-      ".github/copilot-instructions.md",
-      "REMY.md",
-      "remy.md",
-      ".cursorrules",
-      ".cursorules"
-    ];
   }
 });
@@ -1768,7 +1692,6 @@ function resolveIncludes(template) {
 }
 function buildSystemPrompt(onboardingState, viewContext) {
   const projectContext = [
-    loadProjectInstructions(),
     loadProjectManifest(),
     loadSpecFileMetadata(),
     loadProjectFileListing()
@@ -1843,29 +1766,26 @@ Current date: ${now}
   {{compiled/msfm.md}}
 </mindstudio_flavored_markdown_spec_docs>
-<project_context>
-${projectContext}
-</project_context>
 <intake_mode_instructions>
-{{static/intake.md}}
+  {{static/intake.md}}
 </intake_mode_instructions>
 <spec_authoring_instructions>
-{{static/authoring.md}}
+  {{static/authoring.md}}
 </spec_authoring_instructions>
-{{static/team.md}}
+<team>
+  {{static/team.md}}
+</team>
 <code_authoring_instructions>
 {{static/coding.md}}
-${isLspConfigured() ? `<typescript_lsp>
+<typescript_lsp>
 {{static/lsp.md}}
-</typescript_lsp>` : ""}
+</typescript_lsp>
 </code_authoring_instructions>
-{{static/instructions.md}}
-${loadPlanStatus()}
 <conversation_summaries>
 Your conversation history may include <prior_conversation_summary> blocks in the user's messages. These are automated summaries of earlier messages that have been compacted to save context space. The user does not see this summary, they see the full conversation history in their UI. Treat the summary as ground truth for what happened before, but do not reference it directly to the user ("as mentioned in the summary..."). Just continue naturally as if you remember the prior work.
@@ -1879,18 +1799,26 @@ New projects progress through four onboarding states. The user might skip this e
 - **initialSpecAuthoring**: Writing and refining the first spec. The user can see it in the editor as it streams in and can give feedback to iterate on it. This phase covers both the initial draft and any back-and-forth refinement before code generation.
 - **initialCodegen**: First code generation from the spec. The agent is generating methods, tables, interfaces, manifest updates, and scenarios. This can take a while and involves heavy tool use. The user sees a full-screen build progress view.
 - **onboardingFinished**: The project is built and ready. Full development mode with all tools available. From here on, keep spec and code in sync as changes are made.
+</project_onboarding>
+{{static/instructions.md}}
 <!-- cache_breakpoint -->
-  <current_project_onboarding_state>
+<current_project_onboarding_state>
   ${onboardingState ?? "onboardingFinished"}
-  </current_project_onboarding_state>
-</project_onboarding>
+</current_project_onboarding_state>
+<project_context>
+${projectContext}
+</project_context>
 <view_context>
 The user is currently in ${viewContext?.mode ?? "code"} mode.
 ${viewContext?.activeFile ? `Active file: ${viewContext.activeFile}` : ""}
 </view_context>
+${loadPlanStatus()}
 `;
   return resolveIncludes(template);
 }
@@ -1898,7 +1826,6 @@ var init_prompt = __esm({
   "src/prompt/index.ts"() {
     "use strict";
     init_assets();
-    init_lsp();
     init_projectContext();
   }
 });
@@ -1914,15 +1841,15 @@ function triggerCompaction(state, apiConfig, callbacks) {
   compactConversation(state.messages, apiConfig, system, tools2).then((summaries) => {
     pendingSummaries.push(...summaries);
     callbacks?.onSummariesReady?.();
-    log4.info("Compaction complete");
+    log3.info("Compaction complete");
   }).catch((err) => {
     callbacks?.onError?.(err.message || "Compaction failed");
-    log4.error("Compaction failed", { error: err.message });
+    log3.error("Compaction failed", { error: err.message });
   }).finally(() => {
     callbacks?.onFinally?.();
   });
 }
-var log4, pendingSummaries;
+var log3, pendingSummaries;
 var init_trigger = __esm({
   "src/compaction/trigger.ts"() {
     "use strict";
@@ -1930,7 +1857,7 @@ var init_trigger = __esm({
     init_prompt();
     init_tools6();
     init_logger();
-    log4 = createLogger("compaction:trigger");
+    log3 = createLogger("compaction:trigger");
     pendingSummaries = [];
   }
 });
@@ -1945,7 +1872,7 @@ var init_compactConversation = __esm({
       clearable: false,
       definition: {
         name: "compactConversation",
-        description: "Compact the conversation history by summarizing older messages into a checkpoint. The summary preserves key decisions, what was built, and the current state of the project, but drops the verbose tool results, diffs, and intermediate steps that are no longer useful. Use this when you have just finished a large block of mechanical work (building, refactoring, debugging) and are about to shift back into conversational mode with the user. Runs in the background. Do not use after small changes like fixing a bug or editing copy.",
+        description: "Compact the conversation history by summarizing older messages into a checkpoint. The summary preserves key decisions, what was built, and the current state of the project, but drops the verbose tool results, diffs, and intermediate steps that are no longer useful. Runs in the background.",
         inputSchema: {
           type: "object",
           properties: {}
@@ -2672,6 +2599,64 @@ var init_editsFinished = __esm({
   }
 });
+// src/tools/_helpers/sidecar.ts
+function setSidecarBaseUrl(url) {
+  baseUrl = url;
+  log4.info("Configured", { url });
+}
+async function sidecarRequest(endpoint, body = {}, options) {
+  if (!baseUrl) {
+    throw new Error("Sidecar not available");
+  }
+  const url = `${baseUrl}${endpoint}`;
+  try {
+    const res = await fetch(url, {
+      method: "POST",
+      headers: { "Content-Type": "application/json" },
+      body: JSON.stringify(body),
+      signal: options?.timeout ? AbortSignal.timeout(options.timeout) : void 0
+    });
+    if (!res.ok) {
+      log4.error("Sidecar error", { endpoint, status: res.status });
+      throw new Error(`Sidecar error: ${res.status}`);
+    }
+    const data = await res.json();
+    if (data?.success === false) {
+      const code = data.errorCode ? ` [${data.errorCode}]` : "";
+      throw new Error(`${data.error || "Unknown error"}${code}`);
+    }
+    return data;
+  } catch (err) {
+    if (err.message.startsWith("Sidecar error")) {
+      throw err;
+    }
+    log4.error("Sidecar connection error", { endpoint, error: err.message });
+    throw new Error(`Sidecar connection error: ${err.message}`);
+  }
+}
+var log4, baseUrl;
+var init_sidecar = __esm({
+  "src/tools/_helpers/sidecar.ts"() {
+    "use strict";
+    init_logger();
+    log4 = createLogger("sidecar");
+    baseUrl = null;
+  }
+});
+// src/tools/_helpers/lsp.ts
+async function lspRequest(endpoint, body) {
+  return sidecarRequest(endpoint, body);
+}
+var setLspBaseUrl;
+var init_lsp = __esm({
+  "src/tools/_helpers/lsp.ts"() {
+    "use strict";
+    init_sidecar();
+    setLspBaseUrl = setSidecarBaseUrl;
+  }
+});
 // src/tools/code/lspDiagnostics.ts
 var lspDiagnosticsTool;
 var init_lspDiagnostics = __esm({
@@ -6701,13 +6686,24 @@ function resolveAction(text) {
     }
   }
   let body = readAsset("automatedActions", `${triggerName}.md`);
+  let next;
+  const fmMatch = body.match(/^---\s*\n([\s\S]*?)\n---/);
+  if (fmMatch) {
+    const nextMatch = fmMatch[1].match(/^\s*next:\s*(\w+)\s*$/m);
+    if (nextMatch) {
+      next = nextMatch[1];
+    }
+  }
   body = body.replace(/^---[\s\S]*?---\s*/, "");
   for (const [key, value] of Object.entries(params)) {
     const str = typeof value === "string" ? value : JSON.stringify(value);
     body = body.replaceAll(`{{${key}}}`, str);
   }
-  return `@@automated::${triggerName}@@
-${body}`;
+  return {
+    message: `@@automated::${triggerName}@@
+${body}`,
+    next
+  };
 }
 var NON_ACTION_SENTINELS;
 var init_resolve = __esm({
@@ -6724,7 +6720,15 @@ __export(headless_exports, {
   startHeadless: () => startHeadless
 });
 import { createInterface } from "readline";
-import { writeFileSync, readFileSync, unlinkSync } from "fs";
+import {
+  writeFileSync,
+  readFileSync,
+  unlinkSync,
+  mkdirSync,
+  existsSync
+} from "fs";
+import { writeFile } from "fs/promises";
+import { basename, join, extname } from "path";
 function emit(event, data, requestId) {
   const payload = { event, ...data };
   if (requestId) {
@@ -6782,6 +6786,7 @@ async function startHeadless(opts = {}) {
   let currentRequestId;
   let completedEmitted = false;
   let turnStart = 0;
+  let pendingNextAction;
   const EXTERNAL_TOOL_TIMEOUT_MS = 3e5;
   const pendingTools = /* @__PURE__ */ new Map();
   const earlyResults = /* @__PURE__ */ new Map();
@@ -6932,10 +6937,19 @@ ${xmlParts}
           applyPendingSummaries();
           applyPendingBlockUpdates();
           flushBackgroundQueue();
+          if (pendingNextAction) {
+            const next = pendingNextAction;
+            pendingNextAction = void 0;
+            handleMessage(
+              { action: "message", text: `@@automated::${next}@@` },
+              `chain-${Date.now()}`
+            );
+          }
         }, 0);
         return;
       case "turn_cancelled":
         completedEmitted = true;
+        pendingNextAction = void 0;
         emit("completed", { success: false, error: "cancelled" }, rid);
         return;
       // Streaming events — forward with requestId
@@ -7050,6 +7064,120 @@ ${xmlParts}
     }
   }
   toolRegistry.onEvent = onEvent;
+  const UPLOADS_DIR = "src/.user-uploads";
+  function filenameFromUrl(url) {
+    try {
+      const pathname = new URL(url).pathname;
+      const name = basename(pathname);
+      return name && name !== "/" ? decodeURIComponent(name) : `upload-${Date.now()}`;
+    } catch {
+      return `upload-${Date.now()}`;
+    }
+  }
+  function resolveUniqueFilename(name) {
+    if (!existsSync(join(UPLOADS_DIR, name))) {
+      return name;
+    }
+    const ext = extname(name);
+    const base = name.slice(0, name.length - ext.length);
+    let counter = 1;
+    while (existsSync(join(UPLOADS_DIR, `${base}-${counter}${ext}`))) {
+      counter++;
+    }
+    return `${base}-${counter}${ext}`;
+  }
+  const IMAGE_EXTENSIONS = /* @__PURE__ */ new Set([
+    ".png",
+    ".jpg",
+    ".jpeg",
+    ".gif",
+    ".webp",
+    ".svg",
+    ".bmp",
+    ".ico",
+    ".tiff",
+    ".tif",
+    ".avif",
+    ".heic",
+    ".heif"
+  ]);
+  function isImageAttachment(att) {
+    const name = att.filename || filenameFromUrl(att.url);
+    return IMAGE_EXTENSIONS.has(extname(name).toLowerCase());
+  }
+  async function persistAttachments(attachments) {
+    const nonVoice = attachments.filter((a) => !a.isVoice);
+    if (nonVoice.length === 0) {
+      return { documents: [], images: [] };
+    }
+    mkdirSync(UPLOADS_DIR, { recursive: true });
+    const results = await Promise.allSettled(
+      nonVoice.map(async (att) => {
+        const name = resolveUniqueFilename(
+          att.filename || filenameFromUrl(att.url)
+        );
+        const localPath = join(UPLOADS_DIR, name);
+        const res = await fetch(att.url, {
+          signal: AbortSignal.timeout(3e4)
+        });
+        if (!res.ok) {
+          throw new Error(`HTTP ${res.status} downloading ${att.url}`);
+        }
+        const buffer = Buffer.from(await res.arrayBuffer());
+        await writeFile(localPath, buffer);
+        log11.info("Attachment saved", {
+          filename: name,
+          path: localPath,
+          bytes: buffer.length
+        });
+        let extractedTextPath;
+        if (att.extractedTextUrl) {
+          try {
+            const textRes = await fetch(att.extractedTextUrl, {
+              signal: AbortSignal.timeout(3e4)
+            });
+            if (textRes.ok) {
+              extractedTextPath = `${localPath}.txt`;
+              await writeFile(extractedTextPath, await textRes.text(), "utf-8");
+              log11.info("Extracted text saved", { path: extractedTextPath });
+            }
+          } catch {
+          }
+        }
+        return { filename: name, localPath, extractedTextPath };
+      })
+    );
+    const settled = results.map((r, i) => ({
+      result: r.status === "fulfilled" ? r.value : null,
+      isImage: isImageAttachment(nonVoice[i])
+    }));
+    return {
+      documents: settled.filter((s) => !s.isImage).map((s) => s.result),
+      images: settled.filter((s) => s.isImage).map((s) => s.result)
+    };
+  }
+  function buildUploadHeader(results) {
+    const succeeded = results.filter(Boolean);
+    if (succeeded.length === 0) {
+      return "";
+    }
+    if (succeeded.length === 1) {
+      const r = succeeded[0];
+      const parts = [`[Uploaded file: ${r.localPath}`];
+      if (r.extractedTextPath) {
+        parts.push(`extracted text: ${r.extractedTextPath}`);
+      }
+      return parts.join(" \u2014 ") + "]";
+    }
+    const lines = succeeded.map((r) => {
+      if (r.extractedTextPath) {
+        return `- ${r.localPath} (extracted text: ${r.extractedTextPath})`;
+      }
+      return `- ${r.localPath}`;
+    });
+    return `[Uploaded files]
+${lines.join("\n")}`;
+  }
   async function handleMessage(parsed, requestId) {
     if (running) {
       emit(
@@ -7071,12 +7199,26 @@ ${xmlParts}
     turnStart = Date.now();
     const attachments = parsed.attachments;
     if (attachments?.length) {
-      console.warn(
-        `[headless] Message has ${attachments.length} attachment(s):`,
-        attachments.map((a) => a.url)
-      );
+      log11.info("Message has attachments", {
+        count: attachments.length,
+        urls: attachments.map((a) => a.url)
+      });
     }
     let userMessage = parsed.text ?? "";
+    if (attachments?.some((a) => !a.isVoice)) {
+      try {
+        const { documents, images } = await persistAttachments(attachments);
+        const all = [...documents, ...images];
+        const header = buildUploadHeader(all);
+        if (header) {
+          userMessage = userMessage ? `${header}
+${userMessage}` : header;
+        }
+      } catch (err) {
+        log11.warn("Attachment persistence failed", { error: err.message });
+      }
+    }
     let resolved = null;
     try {
       resolved = resolveAction(userMessage);
@@ -7088,8 +7230,10 @@ ${xmlParts}
       );
       return;
     }
+    pendingNextAction = void 0;
     if (resolved !== null) {
-      userMessage = resolved;
+      userMessage = resolved.message;
+      pendingNextAction = resolved.next;
     }
     const isHidden = resolved !== null || !!parsed.hidden;
     const rawText = parsed.text ?? "";
@@ -7254,6 +7398,7 @@ ${xmlParts}
         },
         onFinally: () => {
           sessionStats.compactionInProgress = false;
+          sessionStats.lastContextSize = 0;
           sessionStats.messageCount = state.messages.length;
           sessionStats.updatedAt = Date.now();
           try {

package/dist/prompt/static/authoring.md CHANGED Viewed

@@ -35,7 +35,7 @@ box-shadow: 0 8px 32px rgba(0,0,0,0.3) for floating depth
 ~~~
 ```
-When you have image URLs (from the design expert), embed them directly in the spec using markdown image syntax. Write descriptive alt text that captures what the image actually depicts (this helps accessibility and helps the coding agent understand the image without loading it). Use the surrounding prose to explain the design intent — what the image is for, how it should be used in the layout, and why it was chosen.
+When you have image URLs (from the design expert), embed them directly in the spec using markdown image syntax. Write descriptive alt text that captures what the image actually depicts (this helps accessibility and helps the coding agent understand the image without loading it). Use the surrounding prose to explain the design intent — what the image is for, how it should be used in the layout, and why it was chosen. User-uploaded files (images, documents, reference materials) are saved to `src/.user-uploads/` and can be referenced from specs using their disk path.
 When the design expert provides wireframes, include them directly in the spec for future reference.

package/dist/prompt/static/instructions.md CHANGED Viewed

@@ -28,7 +28,7 @@ The user can already see your tool calls, so most of your work is visible withou
 Skip the rest: narrating what you're about to do, restating what the user asked, explaining tool calls they can already see.
 ### User attachments
-User messages may include uploaded documents (PDFs, Word docs, etc.) as XML blocks prepended to the message content (e.g., `<user_uploaded_document_1>`). These are inline in the conversation history, not files in the project directory. When a user says "here is the document" or "use this document," the document content is in that same message. Do not ask the user to re-share a document that is already in the conversation.
+When a user uploads a file (PDF, Word doc, image, etc.), it is automatically saved to `src/.user-uploads/` in the project directory. The message includes the file path and, for documents with extractable text, a `.txt` sidecar with the extracted content. Use `readFile` on the sidecar to access document contents. The raw binary is also on disk at the indicated path. Uploaded images can be referenced in specs and code by their disk path (e.g., `![logo](src/.user-uploads/logo.png)`). These files persist across the conversation — they survive compaction and session restarts. Do not ask the user to re-upload a document that has already been saved. Voice messages are not saved to disk — their transcripts appear inline in the message.
 ### Automated messages
 You will occasionally receive automated messages prefixed with `@@automated_message@@` - these are triggered by things like background agents returning their work, or by the user clicking a button in the UI (e.g., the user might click a "Build Feature" button in the product roadmap UI, and you will receive a message detailing what they want to build). You will be able to see these messages in your chat history but the user will not see them, so acknowledge them appropriately and then perform the requested work.
@@ -39,4 +39,4 @@ You will occasionally receive automated messages prefixed with `@@automated_mess
 - Keep language accessible. Describe what the app *does*, not how it's implemented, unless the user demonstrates technical fluency.
 - Always use full paths relative to the project root when mentioning files (`dist/interfaces/web/src/App.tsx`, not `App.tsx`). Paths will be rendered as clickable links for the user.
 - Use inline `code` formatting only for things the user needs to type or search for.
-- When writing prose or communicating with the user, avoid em dashes (and especially when writing specs); use periods, commas, colons, or parentheses instead. No emojis.
+- When writing prose or communicating with the user, avoid em dashes (and especially when writing specs); use periods, commas, colons, or parentheses instead. Do not use emojis.

package/dist/prompt/static/team.md CHANGED Viewed

@@ -28,7 +28,7 @@ Your architect for anything that touches external services, AI models, media pro
 Also critical: model IDs in the MindStudio API do not match vendor API model IDs. Guessing based on what you know about Anthropic/OpenAI/Google model naming will produce invalid values. Always look up the correct ID.
-Describe what you're building at the method level — the full workflow — and get back architectural guidance and working code.
+Describe what you're building at the method level — the full workflow — and get back architectural guidance and working code. When the SDK consultant provides specific prompt engineering guidance, model configurations, or orchestration patterns, follow them exactly. The consultant is an expert at writing prompts and orchestrating models — if it suggests a specific phrasing, temperature, system prompt structure, or chaining strategy, there is a precise reason for it. Do not paraphrase, simplify, or "improve" its recommendations.
 ### Architecture Expert (aka Code Sanity Check) (`codeSanityCheck`)

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@mindstudio-ai/remy",
-  "version": "0.1.146",
+  "version": "0.1.148",
   "description": "MindStudio coding agent",
   "repository": {
     "type": "git",