npm - oh-my-opencode - Versions diffs - 2.4.5 → 2.4.7 - Mend

oh-my-opencode 2.4.5 → 2.4.7

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

package/README.ja.md +4 -4
package/README.ko.md +4 -4
package/README.md +4 -4
package/README.zh-cn.md +4 -4
package/dist/hooks/non-interactive-env/constants.d.ts +32 -0
package/dist/hooks/non-interactive-env/index.d.ts +1 -0
package/dist/index.js +119 -15
package/package.json +1 -1

package/README.ja.md CHANGED Viewed

@@ -317,12 +317,12 @@ opencode auth login
   "agents": {
     "frontend-ui-ux-engineer": { "model": "google/gemini-3-pro-high" },
     "document-writer": { "model": "google/gemini-3-flash" },
-    "multimodal-looker": { "model": "google/gemini-2.5-flash" }
+    "multimodal-looker": { "model": "google/gemini-3-flash" }
   }
 }
 ```
-**利用可能なモデル名**: `google/gemini-3-pro-high`, `google/gemini-3-pro-medium`, `google/gemini-3-pro-low`, `google/gemini-3-flash`, `google/gemini-2.5-flash`, `google/gemini-2.5-flash-lite`, `google/claude-sonnet-4-5`, `google/claude-sonnet-4-5-thinking`, `google/claude-opus-4-5-thinking`, `google/gpt-oss-120b-medium`
+**利用可能なモデル名**: `google/gemini-3-pro-high`, `google/gemini-3-pro-medium`, `google/gemini-3-pro-low`, `google/gemini-3-flash`, `google/gemini-3-flash`, `google/gemini-3-flash-lite`, `google/claude-sonnet-4-5`, `google/claude-sonnet-4-5-thinking`, `google/claude-opus-4-5-thinking`, `google/gpt-oss-120b-medium`
 その後、認証を行います：
@@ -432,7 +432,7 @@ gh repo star code-yeongyu/oh-my-opencode
 - **explore** (`opencode/grok-code`): 高速なコードベース探索、ファイルパターンマッチング。Claude Code は Haiku を使用しますが、私たちは Grok を使います。現在無料であり、極めて高速で、ファイル探索タスクには十分な知能を備えているからです。Claude Code からインスピレーションを得ました。
 - **frontend-ui-ux-engineer** (`google/gemini-3-pro-preview`): 開発者に転身したデザイナーという設定です。素晴らしい UI を作ります。美しく独創的な UI コードを生成することに長けた Gemini を使用します。
 - **document-writer** (`google/gemini-3-pro-preview`): テクニカルライティングの専門家という設定です。Gemini は文筆家であり、流れるような文章を書きます。
-- **multimodal-looker** (`google/gemini-2.5-flash`): 視覚コンテンツ解釈のための専門エージェント。PDF、画像、図表を分析して情報を抽出します。
+- **multimodal-looker** (`google/gemini-3-flash`): 視覚コンテンツ解釈のための専門エージェント。PDF、画像、図表を分析して情報を抽出します。
 メインエージェントはこれらを自動的に呼び出しますが、明示的に呼び出すことも可能です：
@@ -675,7 +675,7 @@ Oh My OpenCode は以下の場所からフックを読み込んで実行しま
   "agents": {
     "frontend-ui-ux-engineer": { "model": "google/gemini-3-pro-high" },
     "document-writer": { "model": "google/gemini-3-flash" },
-    "multimodal-looker": { "model": "google/gemini-2.5-flash" }
+    "multimodal-looker": { "model": "google/gemini-3-flash" }
   }
 }
 ```

package/README.ko.md CHANGED Viewed

@@ -314,12 +314,12 @@ opencode auth login
   "agents": {
     "frontend-ui-ux-engineer": { "model": "google/gemini-3-pro-high" },
     "document-writer": { "model": "google/gemini-3-flash" },
-    "multimodal-looker": { "model": "google/gemini-2.5-flash" }
+    "multimodal-looker": { "model": "google/gemini-3-flash" }
   }
 }
 ```
-**사용 가능한 모델 이름**: `google/gemini-3-pro-high`, `google/gemini-3-pro-medium`, `google/gemini-3-pro-low`, `google/gemini-3-flash`, `google/gemini-2.5-flash`, `google/gemini-2.5-flash-lite`, `google/claude-sonnet-4-5`, `google/claude-sonnet-4-5-thinking`, `google/claude-opus-4-5-thinking`, `google/gpt-oss-120b-medium`
+**사용 가능한 모델 이름**: `google/gemini-3-pro-high`, `google/gemini-3-pro-medium`, `google/gemini-3-pro-low`, `google/gemini-3-flash`, `google/gemini-3-flash`, `google/gemini-3-flash-lite`, `google/claude-sonnet-4-5`, `google/claude-sonnet-4-5-thinking`, `google/claude-opus-4-5-thinking`, `google/gpt-oss-120b-medium`
 그 후 인증:
@@ -429,7 +429,7 @@ gh repo star code-yeongyu/oh-my-opencode
 - **explore** (`opencode/grok-code`): 빠른 코드베이스 탐색, 파일 패턴 매칭. Claude Code는 Haiku를 쓰지만, 우리는 Grok을 씁니다. 현재 무료이고, 극도로 빠르며, 파일 탐색 작업에 충분한 지능을 갖췄기 때문입니다. Claude Code 에서 영감을 받았습니다.
 - **frontend-ui-ux-engineer** (`google/gemini-3-pro-preview`): 개발자로 전향한 디자이너라는 설정을 갖고 있습니다. 멋진 UI를 만듭니다. 아름답고 창의적인 UI 코드를 생성하는 데 탁월한 Gemini를 사용합니다.
 - **document-writer** (`google/gemini-3-pro-preview`): 기술 문서 전문가라는 설정을 갖고 있습니다. Gemini 는 문학가입니다. 글을 기가막히게 씁니다.
-- **multimodal-looker** (`google/gemini-2.5-flash`): 시각적 콘텐츠 해석을 위한 전문 에이전트. PDF, 이미지, 다이어그램을 분석하여 정보를 추출합니다.
+- **multimodal-looker** (`google/gemini-3-flash`): 시각적 콘텐츠 해석을 위한 전문 에이전트. PDF, 이미지, 다이어그램을 분석하여 정보를 추출합니다.
 각 에이전트는 메인 에이전트가 알아서 호출하지만, 명시적으로 요청할 수도 있습니다:
@@ -669,7 +669,7 @@ Schema 자동 완성이 지원됩니다:
   "agents": {
     "frontend-ui-ux-engineer": { "model": "google/gemini-3-pro-high" },
     "document-writer": { "model": "google/gemini-3-flash" },
-    "multimodal-looker": { "model": "google/gemini-2.5-flash" }
+    "multimodal-looker": { "model": "google/gemini-3-flash" }
   }
 }
 ```

package/README.md CHANGED Viewed

@@ -346,12 +346,12 @@ The `opencode-antigravity-auth` plugin uses different model names than the built
   "agents": {
     "frontend-ui-ux-engineer": { "model": "google/gemini-3-pro-high" },
     "document-writer": { "model": "google/gemini-3-flash" },
-    "multimodal-looker": { "model": "google/gemini-2.5-flash" }
+    "multimodal-looker": { "model": "google/gemini-3-flash" }
   }
 }
 ```
-**Available model names**: `google/gemini-3-pro-high`, `google/gemini-3-pro-medium`, `google/gemini-3-pro-low`, `google/gemini-3-flash`, `google/gemini-2.5-flash`, `google/gemini-2.5-flash-lite`, `google/claude-sonnet-4-5`, `google/claude-sonnet-4-5-thinking`, `google/claude-opus-4-5-thinking`, `google/gpt-oss-120b-medium`
+**Available model names**: `google/gemini-3-pro-high`, `google/gemini-3-pro-medium`, `google/gemini-3-pro-low`, `google/gemini-3-flash`, `google/gemini-3-flash`, `google/gemini-3-flash-lite`, `google/claude-sonnet-4-5`, `google/claude-sonnet-4-5-thinking`, `google/claude-opus-4-5-thinking`, `google/gpt-oss-120b-medium`
 Then authenticate:
@@ -493,7 +493,7 @@ To remove oh-my-opencode:
 - **explore** (`opencode/grok-code`): Fast codebase exploration and pattern matching. Claude Code uses Haiku; we use Grok—it's free, blazing fast, and plenty smart for file traversal. Inspired by Claude Code.
 - **frontend-ui-ux-engineer** (`google/gemini-3-pro-preview`): A designer turned developer. Builds gorgeous UIs. Gemini excels at creative, beautiful UI code.
 - **document-writer** (`google/gemini-3-pro-preview`): Technical writing expert. Gemini is a wordsmith—writes prose that flows.
-- **multimodal-looker** (`google/gemini-2.5-flash`): Visual content specialist. Analyzes PDFs, images, diagrams to extract information.
+- **multimodal-looker** (`google/gemini-3-flash`): Visual content specialist. Analyzes PDFs, images, diagrams to extract information.
 The main agent invokes these automatically, but you can call them explicitly:
@@ -733,7 +733,7 @@ When using `opencode-antigravity-auth`, disable the built-in auth and override a
   "agents": {
     "frontend-ui-ux-engineer": { "model": "google/gemini-3-pro-high" },
     "document-writer": { "model": "google/gemini-3-flash" },
-    "multimodal-looker": { "model": "google/gemini-2.5-flash" }
+    "multimodal-looker": { "model": "google/gemini-3-flash" }
   }
 }
 ```

package/README.zh-cn.md CHANGED Viewed

@@ -325,12 +325,12 @@ opencode auth login
   "agents": {
     "frontend-ui-ux-engineer": { "model": "google/gemini-3-pro-high" },
     "document-writer": { "model": "google/gemini-3-flash" },
-    "multimodal-looker": { "model": "google/gemini-2.5-flash" }
+    "multimodal-looker": { "model": "google/gemini-3-flash" }
   }
 }
 ```
-**可用模型名**：`google/gemini-3-pro-high`, `google/gemini-3-pro-medium`, `google/gemini-3-pro-low`, `google/gemini-3-flash`, `google/gemini-2.5-flash`, `google/gemini-2.5-flash-lite`, `google/claude-sonnet-4-5`, `google/claude-sonnet-4-5-thinking`, `google/claude-opus-4-5-thinking`, `google/gpt-oss-120b-medium`
+**可用模型名**：`google/gemini-3-pro-high`, `google/gemini-3-pro-medium`, `google/gemini-3-pro-low`, `google/gemini-3-flash`, `google/gemini-3-flash`, `google/gemini-3-flash-lite`, `google/claude-sonnet-4-5`, `google/claude-sonnet-4-5-thinking`, `google/claude-opus-4-5-thinking`, `google/gpt-oss-120b-medium`
 然后认证：
@@ -440,7 +440,7 @@ gh repo star code-yeongyu/oh-my-opencode
 - **explore** (`opencode/grok-code`)：极速代码库扫描、模式匹配。Claude Code 用 Haiku，我们用 Grok——免费、飞快、扫文件够用了。致敬 Claude Code。
 - **frontend-ui-ux-engineer** (`google/gemini-3-pro-preview`)：设计师出身的程序员。UI 做得那是真漂亮。Gemini 写这种创意美观的代码是一绝。
 - **document-writer** (`google/gemini-3-pro-preview`)：技术写作专家。Gemini 文笔好，写出来的东西读着顺畅。
-- **multimodal-looker** (`google/gemini-2.5-flash`)：视觉内容专家。PDF、图片、图表，看一眼就知道里头有啥。
+- **multimodal-looker** (`google/gemini-3-flash`)：视觉内容专家。PDF、图片、图表，看一眼就知道里头有啥。
 主 Agent 会自动调遣它们，你也可以亲自点名：
@@ -675,7 +675,7 @@ Agent 爽了，你自然也爽。但我还想直接让你爽。
   "agents": {
     "frontend-ui-ux-engineer": { "model": "google/gemini-3-pro-high" },
     "document-writer": { "model": "google/gemini-3-flash" },
-    "multimodal-looker": { "model": "google/gemini-2.5-flash" }
+    "multimodal-looker": { "model": "google/gemini-3-flash" }
   }
 }
 ```

package/dist/hooks/non-interactive-env/constants.d.ts CHANGED Viewed

@@ -1,2 +1,34 @@
 export declare const HOOK_NAME = "non-interactive-env";
 export declare const NON_INTERACTIVE_ENV: Record<string, string>;
+/**
+ * Shell command guidance for non-interactive environments.
+ * These patterns should be followed to avoid hanging on user input.
+ */
+export declare const SHELL_COMMAND_PATTERNS: {
+    readonly npm: {
+        readonly bad: readonly ["npm init", "npm install (prompts)"];
+        readonly good: readonly ["npm init -y", "npm install --yes"];
+    };
+    readonly apt: {
+        readonly bad: readonly ["apt-get install pkg"];
+        readonly good: readonly ["apt-get install -y pkg", "DEBIAN_FRONTEND=noninteractive apt-get install pkg"];
+    };
+    readonly pip: {
+        readonly bad: readonly ["pip install pkg (with prompts)"];
+        readonly good: readonly ["pip install --no-input pkg", "PIP_NO_INPUT=1 pip install pkg"];
+    };
+    readonly git: {
+        readonly bad: readonly ["git commit", "git merge branch", "git add -p", "git rebase -i"];
+        readonly good: readonly ["git commit -m 'msg'", "git merge --no-edit branch", "git add .", "git rebase --no-edit"];
+    };
+    readonly system: {
+        readonly bad: readonly ["rm file (prompts)", "cp a b (prompts)", "ssh host"];
+        readonly good: readonly ["rm -f file", "cp -f a b", "ssh -o BatchMode=yes host", "unzip -o file.zip"];
+    };
+    readonly banned: readonly ["vim", "nano", "vi", "emacs", "less", "more", "man", "python (REPL)", "node (REPL)", "git add -p", "git rebase -i"];
+    readonly workarounds: {
+        readonly yesPipe: "yes | ./script.sh";
+        readonly heredoc: "./script.sh <<EOF\noption1\noption2\nEOF";
+        readonly expectAlternative: "Use environment variables or config files instead of expect";
+    };
+};

package/dist/hooks/non-interactive-env/index.d.ts CHANGED Viewed

@@ -8,5 +8,6 @@ export declare function createNonInteractiveEnvHook(_ctx: PluginInput): {
         callID: string;
     }, output: {
         args: Record<string, unknown>;
+        message?: string;
     }) => Promise<void>;
 };

package/dist/index.js CHANGED Viewed

@@ -2674,7 +2674,7 @@ You are a technical writer who creates documentation that developers actually wa
 var multimodalLookerAgent = {
   description: "Analyze media files (PDFs, images, diagrams) that require interpretation beyond raw text. Extracts specific information or summaries from documents, describes visual content. Use when you need analyzed/extracted data rather than literal file contents.",
   mode: "subagent",
-  model: "google/gemini-2.5-flash",
+  model: "google/gemini-3-flash",
   temperature: 0.1,
   tools: { write: false, edit: false, bash: false, background_task: false },
   prompt: `You interpret media files that cannot be read as plain text.
@@ -6178,9 +6178,14 @@ function createPreemptiveCompactionHook(ctx, options) {
       state2.compactionInProgress.delete(sessionID);
       setTimeout(async () => {
         try {
+          const messageDir = getMessageDir4(sessionID);
+          const storedMessage = messageDir ? findNearestMessageWithFields(messageDir) : null;
           await ctx.client.session.promptAsync({
             path: { id: sessionID },
-            body: { parts: [{ type: "text", text: "Continue" }] },
+            body: {
+              agent: storedMessage?.agent,
+              parts: [{ type: "text", text: "Continue" }]
+            },
             query: { directory: ctx.directory }
           });
         } catch {}
@@ -7355,7 +7360,10 @@ ${result.inputLines ?? ""}`,
       const claudeConfig = await loadClaudeHooksConfig();
       const extendedConfig = await loadPluginExtendedConfig();
       const cachedInput = getToolInput(input.sessionID, input.tool, input.callID) || {};
-      recordToolResult(input.sessionID, input.tool, cachedInput, output.metadata || {});
+      const metadata = output.metadata;
+      const hasMetadata = metadata && typeof metadata === "object" && Object.keys(metadata).length > 0;
+      const toolOutput = hasMetadata ? metadata : { output: output.output };
+      recordToolResult(input.sessionID, input.tool, cachedInput, toolOutput);
       if (!isHookDisabled(config, "PostToolUse")) {
         const postClient = {
           session: {
@@ -8634,19 +8642,27 @@ function createKeywordDetectorHook() {
     "chat.message": async (input, output) => {
       const isFirstMessage = !sessionFirstMessageProcessed2.has(input.sessionID);
       sessionFirstMessageProcessed2.add(input.sessionID);
-      if (isFirstMessage) {
-        log("Skipping keyword detection on first message for title generation", { sessionID: input.sessionID });
-        return;
-      }
       const promptText = extractPromptText2(output.parts);
       const messages = detectKeywords(promptText);
       if (messages.length === 0) {
         return;
       }
-      log(`Keywords detected: ${messages.length}`, { sessionID: input.sessionID });
-      const message = output.message;
       const context = messages.join(`
 `);
+      if (isFirstMessage) {
+        log(`Keywords detected on first message, transforming parts directly`, { sessionID: input.sessionID, keywordCount: messages.length });
+        const idx = output.parts.findIndex((p) => p.type === "text" && p.text);
+        if (idx >= 0) {
+          output.parts[idx].text = `${context}
+---
+${output.parts[idx].text ?? ""}`;
+        }
+        return;
+      }
+      log(`Keywords detected: ${messages.length}`, { sessionID: input.sessionID });
+      const message = output.message;
       log(`[keyword-detector] Injecting context for ${messages.length} keywords`, { sessionID: input.sessionID, contextLength: context.length });
       const success = injectHookMessage(input.sessionID, context, {
         agent: message.agent,
@@ -8673,10 +8689,65 @@ var NON_INTERACTIVE_ENV = {
   VISUAL: "true",
   GIT_SEQUENCE_EDITOR: "true",
   GIT_PAGER: "cat",
-  PAGER: "cat"
+  PAGER: "cat",
+  npm_config_yes: "true",
+  PIP_NO_INPUT: "1",
+  YARN_ENABLE_IMMUTABLE_INSTALLS: "false"
+};
+var SHELL_COMMAND_PATTERNS = {
+  npm: {
+    bad: ["npm init", "npm install (prompts)"],
+    good: ["npm init -y", "npm install --yes"]
+  },
+  apt: {
+    bad: ["apt-get install pkg"],
+    good: ["apt-get install -y pkg", "DEBIAN_FRONTEND=noninteractive apt-get install pkg"]
+  },
+  pip: {
+    bad: ["pip install pkg (with prompts)"],
+    good: ["pip install --no-input pkg", "PIP_NO_INPUT=1 pip install pkg"]
+  },
+  git: {
+    bad: ["git commit", "git merge branch", "git add -p", "git rebase -i"],
+    good: ["git commit -m 'msg'", "git merge --no-edit branch", "git add .", "git rebase --no-edit"]
+  },
+  system: {
+    bad: ["rm file (prompts)", "cp a b (prompts)", "ssh host"],
+    good: ["rm -f file", "cp -f a b", "ssh -o BatchMode=yes host", "unzip -o file.zip"]
+  },
+  banned: [
+    "vim",
+    "nano",
+    "vi",
+    "emacs",
+    "less",
+    "more",
+    "man",
+    "python (REPL)",
+    "node (REPL)",
+    "git add -p",
+    "git rebase -i"
+  ],
+  workarounds: {
+    yesPipe: "yes | ./script.sh",
+    heredoc: `./script.sh <<EOF
+option1
+option2
+EOF`,
+    expectAlternative: "Use environment variables or config files instead of expect"
+  }
 };
 // src/hooks/non-interactive-env/index.ts
+var BANNED_COMMAND_PATTERNS = SHELL_COMMAND_PATTERNS.banned.filter((cmd) => !cmd.includes("(")).map((cmd) => new RegExp(`\\b${cmd}\\b`));
+function detectBannedCommand(command) {
+  for (let i = 0;i < BANNED_COMMAND_PATTERNS.length; i++) {
+    if (BANNED_COMMAND_PATTERNS[i].test(command)) {
+      return SHELL_COMMAND_PATTERNS.banned[i];
+    }
+  }
+  return;
+}
 function createNonInteractiveEnvHook(_ctx) {
   return {
     "tool.execute.before": async (input, output) => {
@@ -8691,6 +8762,10 @@ function createNonInteractiveEnvHook(_ctx) {
         ...output.args.env,
         ...NON_INTERACTIVE_ENV
       };
+      const bannedCmd = detectBannedCommand(command);
+      if (bannedCmd) {
+        output.message = `\u26A0\uFE0F Warning: '${bannedCmd}' is an interactive command that may hang in non-interactive environments.`;
+      }
       log(`[${HOOK_NAME2}] Set non-interactive environment variables`, {
         sessionID: input.sessionID,
         env: NON_INTERACTIVE_ENV
@@ -26710,6 +26785,30 @@ session_id: ${sessionID}
 var MULTIMODAL_LOOKER_AGENT = "multimodal-looker";
 var LOOK_AT_DESCRIPTION = `Analyze media files (PDFs, images, diagrams) via Gemini 2.5 Flash in separate context. Saves main context tokens.`;
 // src/tools/look-at/tools.ts
+import { extname as extname3, basename as basename4 } from "path";
+function inferMimeType(filePath) {
+  const ext = extname3(filePath).toLowerCase();
+  const mimeTypes = {
+    ".jpg": "image/jpeg",
+    ".jpeg": "image/jpeg",
+    ".png": "image/png",
+    ".gif": "image/gif",
+    ".webp": "image/webp",
+    ".svg": "image/svg+xml",
+    ".bmp": "image/bmp",
+    ".ico": "image/x-icon",
+    ".pdf": "application/pdf",
+    ".txt": "text/plain",
+    ".md": "text/markdown",
+    ".json": "application/json",
+    ".xml": "application/xml",
+    ".html": "text/html",
+    ".css": "text/css",
+    ".js": "text/javascript",
+    ".ts": "text/typescript"
+  };
+  return mimeTypes[ext] || "application/octet-stream";
+}
 function createLookAt(ctx) {
   return tool({
     description: LOOK_AT_DESCRIPTION,
@@ -26719,12 +26818,13 @@ function createLookAt(ctx) {
     },
     async execute(args, toolContext) {
       log(`[look_at] Analyzing file: ${args.file_path}, goal: ${args.goal}`);
+      const mimeType = inferMimeType(args.file_path);
+      const filename = basename4(args.file_path);
       const prompt = `Analyze this file and extract the requested information.
-File path: ${args.file_path}
 Goal: ${args.goal}
-Read the file using the Read tool, then provide ONLY the extracted information that matches the goal.
+Provide ONLY the extracted information that matches the goal.
 Be thorough on what was requested, concise on everything else.
 If the requested information is not found, clearly state what is missing.`;
       log(`[look_at] Creating session with parent: ${toolContext.sessionID}`);
@@ -26740,7 +26840,7 @@ If the requested information is not found, clearly state what is missing.`;
       }
       const sessionID = createResult.data.id;
       log(`[look_at] Created session: ${sessionID}`);
-      log(`[look_at] Sending prompt to session ${sessionID}`);
+      log(`[look_at] Sending prompt with file passthrough to session ${sessionID}`);
       await ctx.client.session.prompt({
         path: { id: sessionID },
         body: {
@@ -26748,9 +26848,13 @@ If the requested information is not found, clearly state what is missing.`;
           tools: {
             task: false,
             call_omo_agent: false,
-            look_at: false
+            look_at: false,
+            read: false
           },
-          parts: [{ type: "text", text: prompt }]
+          parts: [
+            { type: "text", text: prompt },
+            { type: "file", mime: mimeType, url: `file://${args.file_path}`, filename }
+          ]
         }
       });
       log(`[look_at] Prompt sent, fetching messages...`);

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "oh-my-opencode",
-  "version": "2.4.5",
+  "version": "2.4.7",
   "description": "OpenCode plugin - custom agents (oracle, librarian) and enhanced features",
   "main": "dist/index.js",
   "types": "dist/index.d.ts",