npm - fluxflow-cli - Versions diffs - 1.21.1 → 1.21.3 - Mend

fluxflow-cli 1.21.1 → 1.21.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/ARCHITECTURE.md CHANGED Viewed

@@ -1,65 +1,65 @@
-# 🏛️ Architecture & Design
-Flux Flow is built on a modern, reactive stack that brings web-like development paradigms to the terminal. It utilizes a custom agentic loop for reasoning and a unique dual-model system for background processing.
-## UI Layer: React & Ink
-The entire terminal interface is built using **React** via the [Ink](https://github.com/vadimdemedes/ink) renderer.
-- **Component-Based**: The UI is composed of isolated, reusable React components (`ChatLayout`, `StatusBar`, `CommandMenu`, `TerminalBox`, `ProfileForm`).
-- **Reactive State**: The application uses React hooks (`useState`, `useEffect`) to manage user input, application mode, model selection, and the terminal's resizing events.
-- **Zero-Render Overheads**: Critical performance trackers, like the session start time, are kept outside the React render cycle to maintain terminal responsiveness during high-speed AI text streaming.
-## The Agentic Loop
-The core intelligence of Flux Flow resides in `src/utils/ai.js`. It does not rely on opaque third-party agent frameworks; instead, it uses a custom, highly transparent string-based protocol powered by an asynchronous generator (`async function*`). This approach allows for real-time UI updates while managing complex multi-step reasoning.
-The execution flow of a single user prompt follows this loop:
-1. **Context Assembly**: The user's prompt is combined with the system instructions, temporary session context, persistent user memories, and the current chat history. If the history gets too large (e.g., >254k tokens) and compression is disabled, it is gracefully truncated.
-2. **Stream Processing**: The main loop initiates a streaming request to the Gemini API (`client.models.generateContentStream`). It yields chunks of text and status updates directly back to the React UI as they arrive.
-3. **Detection & Tool Execution**: Once the stream completes for a given turn, the entire response is scanned for tool calls using a custom regex and bracket-balancing parser (looking for `tool:functions.tool_name(args...)`).
-   - If tools are found, the loop pauses.
-   - Each tool is dispatched to its respective handler in `src/tools/`.
-   - Tool outputs are collected and appended to the context as `[TOOL RESULT]: ...`.
-4. **Security Governance**: During tool execution, the loop enforces security checks (e.g., blocking `exec_command` from accessing system root drives if "External Workspace Access" is off) and pauses for Human-in-the-Loop (HITL) approval if necessary.
-5. **Turn Management & Continuation**: The model is instructed to append `[turn: finish]` if its goal is complete, or `[turn: continue]` if it expects tool results.
-   - If tools were called or `[turn: continue]` is present, the loop increments and re-prompts the model with the newly gathered `[TOOL RESULT]` data.
-   - If `[turn: finish]` is detected and no further tools were called, the main loop terminates, passing the final synthesized context to the background Janitor process.
-6. **Loop Limits & Resilience**: To prevent infinite loops or excessive API usage, **Flux mode** is capped at 70 iterations per user prompt, while **Flow mode** is capped at 7.
-   - **Multi-Stage Failover**: The loop features a sophisticated 8-attempt retry engine with random backoff (800ms - 2s).
-   - **Critical Fallback Pivot**: If the primary model fails 13 consecutive times, the agent surgically pivots to a lighter, high-concurrency fallback model (`gemini-3.1-flash-lite`) for the final 3 attempts to ensure session navigation through API congestion.
-## Multimodal Pipeline
-Flux Flow implements a native multimodal processing engine in `src/tools/view_file.js`. This allows the agent to move beyond text-based reasoning and analyze visual assets directly.
-- **Binary Detection**: The pipeline uses `is-binary-path` to distinguish between text and binary files.
-- **Visual Encoding**: If an image or PDF is detected, the engine reads the raw bytes and converts them into base64-encoded `InlineData` objects.
-- **PDF Extraction**: For PDF documents, the engine extracts visual representation of pages to provide the model with high-fidelity spatial and textual context simultaneously.
-- **Context Injection**: These multimodal assets are injected directly into the Gemini model's multimodal part array, allowing the model to "see" the file as if it were looking at a screenshot.
-## The Dual-Model System
-To maintain a fast, snappy UI while still performing complex data management, Flux Flow employs two separate AI models for every interaction:
-### 1. The Main Agent
-- **Responsibility**: Direct user interaction, reasoning, and tool execution.
-- **Behavior**: Streams text directly to the UI. It focuses entirely on solving the user's immediate problem or answering their question.
-### 2. The Janitor (Background Process)
-- **Responsibility**: System maintenance, long-term memory extraction, and chat summarization.
-- **Behavior**: After the Main Agent finishes its loop, the entire context (User Prompt + Agent Raws) is sent to the Janitor model.
-- **Headless Operation**: The Janitor is explicitly instructed to be a "silent background system process" with "no mouth." It *only* outputs valid tool calls (e.g., updating the chat title or saving a new user preference to the persistent memory vault).
-## Data Persistence & Safety
-- **High-Fidelity Lock**: Because both the UI and the Janitor model may attempt to write to the `history.json` file simultaneously, a Promise-based `WRITE_LOCK` (`src/utils/history.js`) is utilized. This prevents race conditions and ensures data integrity.
-- **Encryption**: User secrets and persistent memories (`secret/memories.json`) are handled by `src/utils/crypto.js` to ensure local privacy.
-## Redirection & The Anchor Strategy
-To support data portability (e.g., storing all app data on an external encrypted drive), Flux Flow utilizes a synchronous "Anchor" strategy in `src/utils/paths.js`.
-- **Synchronous Pivot**: Because many core modules (History, Secrets, Usage) initialize their file paths as constants during module loading, the application must determine the "Actual" data root before anything else.
-- **Boot-Sequence Priority**: On every launch, `paths.js` performs a synchronous file system check for `~/.fluxflow/settings.json`. If a redirection path is found (`useExternalData: true`), it immediately overrides the global `DATA_DIR` constant for the entire process.
-- **Sub-Coordinate Resolution**: All secondary directories (`LOGS_DIR`, `SECRET_DIR`) are derived dynamically from the redirected `DATA_DIR`, ensuring that all session data flows to the external sanctuary without requiring individual configuration updates across the codebase.
+# 🏛️ Architecture & Design
+Flux Flow is built on a modern, reactive stack that brings web-like development paradigms to the terminal. It utilizes a custom agentic loop for reasoning and a unique dual-model system for background processing.
+## UI Layer: React & Ink
+The entire terminal interface is built using **React** via the [Ink](https://github.com/vadimdemedes/ink) renderer.
+- **Component-Based**: The UI is composed of isolated, reusable React components (`ChatLayout`, `StatusBar`, `CommandMenu`, `TerminalBox`, `ProfileForm`).
+- **Reactive State**: The application uses React hooks (`useState`, `useEffect`) to manage user input, application mode, model selection, and the terminal's resizing events.
+- **Zero-Render Overheads**: Critical performance trackers, like the session start time, are kept outside the React render cycle to maintain terminal responsiveness during high-speed AI text streaming.
+## The Agentic Loop
+The core intelligence of Flux Flow resides in `src/utils/ai.js`. It does not rely on opaque third-party agent frameworks; instead, it uses a custom, highly transparent string-based protocol powered by an asynchronous generator (`async function*`). This approach allows for real-time UI updates while managing complex multi-step reasoning.
+The execution flow of a single user prompt follows this loop:
+1. **Context Assembly**: The user's prompt is combined with the system instructions, temporary session context, persistent user memories, and the current chat history. If the history gets too large (e.g., >256k tokens) and compression is disabled, it is gracefully truncated. Some models/modes can go upto 400k context.
+2. **Stream Processing**: The main loop initiates a streaming request to the Gemini API (`client.models.generateContentStream`). It yields chunks of text and status updates directly back to the React UI as they arrive.
+3. **Detection & Tool Execution**: Once the stream completes for a given turn, the entire response is scanned for tool calls using a custom regex and bracket-balancing parser (looking for `tool:functions.tool_name(args...)`).
+   - If tools are found, the loop pauses.
+   - Each tool is dispatched to its respective handler in `src/tools/`.
+   - Tool outputs are collected and appended to the context as `[TOOL RESULT]: ...`.
+4. **Security Governance**: During tool execution, the loop enforces security checks (e.g., blocking `exec_command` from accessing system root drives if "External Workspace Access" is off) and pauses for Human-in-the-Loop (HITL) approval if necessary.
+5. **Turn Management & Continuation**: The model is instructed to append `[turn: finish]` if its goal is complete, or `[turn: continue]` if it expects tool results.
+   - If tools were called or `[turn: continue]` is present, the loop increments and re-prompts the model with the newly gathered `[TOOL RESULT]` data.
+   - If `[turn: finish]` is detected and no further tools were called, the main loop terminates, passing the final synthesized context to the background Janitor process.
+6. **Loop Limits & Resilience**: To prevent infinite loops or excessive API usage, **Flux mode** is capped at 70 iterations per user prompt, while **Flow mode** is capped at 7.
+   - **Multi-Stage Failover**: The loop features a sophisticated 16-attempt retry engine with exponential backoff (1s - 32s).
+   - **Critical Fallback Pivot**: If the primary model fails 14 consecutive times, the agent surgically pivots to a lighter, high-concurrency fallback model (`gemini-3.1-flash-lite`) for the final 3 attempts to ensure session navigation through API congestion.
+## Multimodal Pipeline
+Flux Flow implements a native multimodal processing engine in `src/tools/view_file.js`. This allows the agent to move beyond text-based reasoning and analyze visual assets directly (Only on supported models).
+- **Binary Detection**: The pipeline uses `is-binary-path` to distinguish between text and binary files.
+- **Visual Encoding**: If an image or PDF is detected, the engine reads the raw bytes and converts them into base64-encoded `InlineData` objects.
+- **PDF Extraction**: For PDF documents, the engine extracts visual representation of pages to provide the model with high-fidelity spatial and textual context simultaneously.
+- **Context Injection**: These multimodal assets are injected directly into the Gemini model's multimodal part array, allowing the model to "see" the file as if it were looking at a screenshot.
+## The Dual-Model System
+To maintain a fast, snappy UI while still performing complex data management, Flux Flow employs two separate AI models for every interaction:
+### 1. The Main Agent
+- **Responsibility**: Direct user interaction, reasoning, and tool execution.
+- **Behavior**: Streams text directly to the UI. It focuses entirely on solving the user's immediate problem or answering their question.
+### 2. The Janitor (Background Process)
+- **Responsibility**: System maintenance, long-term memory extraction, and chat summarization.
+- **Behavior**: After the Main Agent finishes its loop, the entire context (User Prompt + Agent Raws) is sent to the Janitor model.
+- **Headless Operation**: The Janitor is explicitly instructed to be a "silent background system process" with "no mouth." It *only* outputs valid tool calls (e.g., updating the chat title or saving a new user preference to the persistent memory vault).
+## Data Persistence & Safety
+- **High-Fidelity Lock**: Because both the UI and the Janitor model may attempt to write to the `history.json` file simultaneously, a Promise-based `WRITE_LOCK` (`src/utils/history.js`) is utilized. This prevents race conditions and ensures data integrity.
+- **Encryption**: User secrets and persistent memories (`secret/memories.json`) are handled by `src/utils/crypto.js` to ensure local privacy.
+## Redirection & The Anchor Strategy
+To support data portability (e.g., storing all app data on an external encrypted drive), Flux Flow utilizes a synchronous "Anchor" strategy in `src/utils/paths.js`.
+- **Synchronous Pivot**: Because many core modules (History, Secrets, Usage) initialize their file paths as constants during module loading, the application must determine the "Actual" data root before anything else.
+- **Boot-Sequence Priority**: On every launch, `paths.js` performs a synchronous file system check for `~/.fluxflow/settings.json`. If a redirection path is found (`useExternalData: true`), it immediately overrides the global `DATA_DIR` constant for the entire process.
+- **Sub-Coordinate Resolution**: All secondary directories (`LOGS_DIR`, `SECRET_DIR`) are derived dynamically from the redirected `DATA_DIR`, ensuring that all session data flows to the external sanctuary without requiring individual configuration updates across the codebase.

package/README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 ![Flux Flow Logo](https://github.com/KushalRoyChowdhury/fluxflow-cli/blob/main/fluxflow.png)
 <p align="left">
-  <a href="https://github.com/KushalRoyChowdhury/fluxflow-cli"><img src="https://img.shields.io/badge/FluxFlow-v1.19.0-blue?style=plastic" alt="FluxFlow Version"></a>
+  <a href="https://github.com/KushalRoyChowdhury/fluxflow-cli"><img src="https://img.shields.io/badge/FluxFlow-v1.21-blue?style=plastic" alt="FluxFlow Version"></a>
   <a href="https://deepmind.google"><img src="https://img.shields.io/badge/Engine-Gemma%204-red?style=plastic" alt="Engine Gemma 4"></a>
   <a href="https://pollinations.ai"><img src="https://img.shields.io/badge/Built%20With-pollinations.ai-cyan?style=plastic" alt="Built With pollinations.ai"></a>
   <a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-yellow.svg?style=plastic" alt="License MIT"></a>
@@ -53,7 +53,6 @@ Flux Flow can now see! Use the `view_file` tool to analyze images (JPG, PNG) or
 Need a report or a presentation? Just ask. Flux Flow features a high-fidelity "Printing Press" that generates professional, branded documents natively:
 - **PDF**: Branded documents from HTML/CSS with automatic watermarking.
 - **DOCX**: Native Word documents with multi-page support and automatic numbering.
-- **PPTX**: High-fidelity PowerPoint presentations using native elements (selectable text, shapes) translated directly from HTML.
 ### ⏱️ **Codebase Time Travel (Git-less Reversion)**
 Refactor and build with absolute fearlessness. Flux Flow maintains transaction-based secure snapshots of files before they are generated or edited:
@@ -68,8 +67,8 @@ Zero setup means zero setup. On first run, Flux Flow performs an integrity check
 - **Rich Aesthetics**: High-contrast, sleek design with smooth transitions and micro-animations.
 ### 🧠 **The Dual-Intelligence System**
-- **Flux Mode (Dev)**: High-speed, agentic problem solving with a 50-turn persistent loop for massive coding tasks.
-- **Flow Mode (Chat)**: Optimized for deep research, high-quality conversation, and web-assisted reasoning.
+- **Flux Mode (Dev)**: High-speed, agentic problem solving with a 70-turn persistent loop for massive coding tasks.
+- **Flow Mode (Chat)**: Optimized for high-quality conversation and web-assisted reasoning.
 ### 🛡️ **Digital Fortress Governance**
 Security isn't an afterthought; it's a boundary.

package/TOOLS.md CHANGED Viewed

@@ -11,7 +11,6 @@ Flux Flow provides a robust set of tools that allow the AI to interact with the
 | **Generate Image** | ✅ | ❌ |
 | **Write PDF** | ✅ | ❌ |
 | **Write DOCX** | ✅ | ❌ |
-| **Write PPTX** | ✅ | ❌ |
 | **View/Read Files** | ✅ | ❌ |
 | **Write/Update Files** | ✅ | ❌ |
 | **Execute Commands** | ✅ | ❌ |
@@ -22,37 +21,33 @@ Flux Flow provides a robust set of tools that allow the AI to interact with the
 ## Core Tools
 ### 🌐 Web & Research
-- **`web_search`**: Uses DuckDuckGo to find up-to-date information on the internet. Crucial for answering questions about recent events or unlearned documentation.
-- **`web_scrape`**: Extracts the detailed text content from a specific URL, allowing the agent to read documentation or articles.
+- **`WebSearch`**: Uses DuckDuckGo to find up-to-date information on the internet. Crucial for answering questions about recent events or unlearned documentation.
+- **`WebScrape`**: Extracts the detailed text content from a specific URL, allowing the agent to read documentation or articles.
 ### 📄 Document Engineering (The Office Suite)
-- **`write_pdf`**: Generates high-fidelity, branded PDF documents from HTML/CSS. Features automatic watermarking and page-aware layout management.
-- **`write_docx`**: Generates professional Word documents (.docx) from HTML. Supports multi-page layouts, automatic page numbering, and native styling.
-- **`write_pptx`**: Generates high-fidelity, native PowerPoint presentations (.pptx) from HTML.
-  - **Native Translation**: Uses `html2pptxgenjs` to translate HTML tags (`<h1>`, `<ul>`, `<b>`) directly into selectable PowerPoint text objects.
-  - **Rich Styling**: Supports CSS-like properties (color, font-size, text-align) for professional slide design.
-  - **Flat Protocol**: Optimized for AI stability using a flat HTML string interface.
+- **`WritePDF`**: Generates high-fidelity, branded PDF documents from HTML/CSS. Features automatic watermarking and page-aware layout management.
+- **`WriteDOCX`**: Generates professional Word documents (.docx) from HTML. Supports multi-page layouts, automatic page numbering, and native styling.
 ### 🎨 Creative & Visual
-- **`generate_image`**: Generates high-fidelity images using Pollinations AI.
+- **`GenerateImage`**: Generates high-fidelity images using Pollinations AI.
   - **Customization**: Supports customizable models (Flux, ZImage, Qwen, Nanobanana-Pro, etc.), aspect ratios, custom prompt generation, and random seeds.
   - **Telemetry**: Tracks hourly credit usage (Low, Medium, Ultra, Premium tiers) with built-in daily limit checks and interactive dashboard displays.
 ### 📁 File System Operations
-- **`list_files`**: Lists the contents of a directory to help the agent understand the project structure.
-- **`read_folder`**: Provides detailed statistics and metadata about a directory's contents.
-- **`view_file`**: Reads the content of a file.
+- **`ListFiles`**: Lists the contents of a directory to help the agent understand the project structure.
+- **`ReadFolder`**: Provides detailed statistics and metadata about a directory's contents.
+- **`ViewFile`**: Reads the content of a file.
   - **Native Multimodality**: Supports analyzing images (JPG, PNG, WEBP) and PDF documents. The tool automatically detects binary formats and encodes them for AI analysis.
   - **Text Reading**: Supports specific line ranges (`start_line`, `end_line`) to manage context size efficiently.
-- **`search_keyword`**: Performs a global project search for a specific string or keyword. Returns file paths and line numbers where matches are found, making it essential for navigation and impact analysis.
+- **`SearchKeyword`**: Performs a global project search for a specific string or keyword. Returns file paths and line numbers where matches are found, making it essential for navigation and impact analysis.
 ### ✍️ Code Editing
-- **`write_file`**: Creates a new file or completely overwrites an existing one with new content.
-- **`update_file` (Smart Patching)**: Surgically replaces a specific block of text within a file.
+- **`WriteFile`**: Creates a new file or completely overwrites an existing one with new content.
+- **`UpdateFile` (Smart Patching)**: Surgically replaces a specific block of text within a file.
   - *Diff Generation*: It returns a high-fidelity visual diff (Red/Green changes with context lines) to the UI, allowing the user to see exactly what the agent modified.
 ### 💻 Terminal Execution
-- **`exec_command`**: Runs a shell command directly in the terminal using Node's `child_process.spawn`.
+- **`Run`**: Runs a shell command directly in the terminal using Node's `child_process.spawn` or `node-pty` when available.
   - *Context Aware*: Runs in the current working directory.
   - *Cross-Platform*: Uses `shell: true` to handle Windows `.cmd`/`.bat` files natively.

package/UI_FEATURES.md CHANGED Viewed

@@ -6,8 +6,8 @@ Flux Flow is designed to be a high-performance terminal application. Beyond basi
 You can control the application using `/` commands directly in the chat input:
-- **/mode [flux|flow]**: Quickly switch between **Flux** (Dev) and **Flow** (Chat) modes. Using it without arguments opens the selection menu.
-- **/thinking [low|medium|high|max|show|hide]**: Adjust reasoning depth or toggle visibility of the thinking process. Using it without arguments opens the selection menu.
+- **/mode [flux|flow]**: Quickly switch between **Flux** (Dev) and **Flow** (Chat) modes.
+- **/thinking [fast|low|medium|high|max]**: Adjust reasoning depth.
 - **/model [name]**: Choose which AI model to use for the main interaction.
 - **/key**: Open the API Key management view to update or remove your credentials.
 - **/settings**: Access the system configuration menu.
@@ -26,23 +26,11 @@ Flux Flow allows you to "Anchor" your data outside of the default user directory
 - **Portability**: Once set, the application will synchronously "pivot" all data operations to your specified `externalDataPath` upon startup.
 - **Privacy**: Keeps sensitive data off your primary system drive.
-### Command Shortcuts
-For power users, several commands support direct arguments to skip the menus:
-- ` /mode flux ` or ` /mode flow `
-- ` /thinking low ` / ` /thinking medium ` / ` /thinking high ` / ` /thinking max `
-- ` /thinking show ` / ` /thinking hide ` (Toggles thinking process visibility)
-- ` /model gemini-3.1-pro-preview ` (Switches model directly)
-- ` /update check ` (Checks for updates)
 ## 🧠 Thinking Levels & Visualization
 Flux Flow separates the model's "internal monologue" (reasoning) from its final response using `<think>` tags.
 - **Thinking Levels**: Depending on the mode, you can choose from **Fast**, **Low**, **Medium**, **High**, or **xHigh**. Higher levels allow the agent more "space" to reason through complex architecture or debugging problems.
-- **Show/Hide Thinking**: You can toggle the visibility of the thinking process using `/thinking show/hide`.
-  - When **Hidden**, the agent doesn't just disappear; it provides a "minimalist" view showing only the core **Headings** and **Action Steps** (bolded lines) from its reasoning. This keeps you informed of its current "step" without cluttering the screen with detailed internal monologue.
 ## ⚡ Interactive Sub-Terminal
@@ -73,6 +61,8 @@ By default, the agent **cannot** execute dangerous actions without your consent.
 For power users, **Auto-Exec** can be enabled in `/settings`.
 - **⚠️ Warning**: This allows the agent to run any tool and execute any command autonomously.
 - **External Access**: You can also toggle whether the agent is allowed to access files outside of its current working directory.
+- **Manual Commad Control**: Specify which commands you want the agent to auto-execute, manually approve or auto-deny.
+- **Network Sandboxing**: Turn off network access in integrated terminal.
 ## 🔄 Steering & Resolution

package/dist/fluxflow.js CHANGED Viewed

@@ -940,9 +940,13 @@ var StatusBar, StatusBar_default;
 var init_StatusBar = __esm({
   "src/components/StatusBar.jsx"() {
     init_text();
-    StatusBar = React4.memo(({ mode, thinkingLevel, tokens = "0.0k", tokensTotal = "0.0k", chatId = "NEW-SESSION", isMemoryEnabled = true, apiTier = "Free" }) => {
+    StatusBar = React4.memo(({ mode, thinkingLevel, tokens = "0.0k", tokensTotal = "0.0k", chatId = "NEW-SESSION", isMemoryEnabled = true, apiTier = "Free", aiProvider = "Google" }) => {
       const modeColor = mode === "Flux" ? "yellow" : "cyan";
       const modeIcon = mode === "Flux" ? "\u26A1" : "\u{1F30A}";
+      let maxLimit = 256e3;
+      if (aiProvider === "DeepSeek" || aiProvider === "Google" && apiTier === "Paid") {
+        maxLimit = 4e5;
+      }
       return /* @__PURE__ */ React4.createElement(
         Box4,
         {
@@ -955,7 +959,7 @@ var init_StatusBar = __esm({
         },
         /* @__PURE__ */ React4.createElement(Box4, null, /* @__PURE__ */ React4.createElement(Box4, { marginRight: 1 }, /* @__PURE__ */ React4.createElement(Text4, { color: modeColor, bold: true }, modeIcon, " ", mode.toUpperCase())), /* @__PURE__ */ React4.createElement(Text4, { color: "gray", dimColor: true }, "\u2503 "), /* @__PURE__ */ React4.createElement(Box4, { marginX: 1 }, /* @__PURE__ */ React4.createElement(Text4, { color: "magenta" }, "\u{1F9E0} ", thinkingLevel)), /* @__PURE__ */ React4.createElement(Text4, { color: "gray", dimColor: true }, "\u2503 "), /* @__PURE__ */ React4.createElement(Box4, { marginX: 1 }, /* @__PURE__ */ React4.createElement(Text4, { color: "gray" }, "MEM: "), /* @__PURE__ */ React4.createElement(Text4, { color: isMemoryEnabled ? "green" : "red", bold: true }, isMemoryEnabled ? "ON" : "OFF"))),
         /* @__PURE__ */ React4.createElement(Box4, { flexGrow: 1, justifyContent: "center", paddingX: 2 }, /* @__PURE__ */ React4.createElement(Text4, null, "\u{1F4C1}"), /* @__PURE__ */ React4.createElement(Text4, { color: "gray", italic: true }, " ", truncatePath(process.cwd(), 35))),
-        /* @__PURE__ */ React4.createElement(Box4, null, /* @__PURE__ */ React4.createElement(Text4, { color: "gray", dimColor: true }, "\u2503 "), /* @__PURE__ */ React4.createElement(Box4, { marginX: 1 }, /* @__PURE__ */ React4.createElement(Text4, null, "\u2728"), /* @__PURE__ */ React4.createElement(Text4, { color: "blue" }, " ", formatTokens(tokensTotal), " ", /* @__PURE__ */ React4.createElement(Text4, { dimColor: true }, "(", (tokens / 254e3 * 100).toFixed(0), "%)"))), /* @__PURE__ */ React4.createElement(Text4, { color: "gray", dimColor: true }, "\u2503 "), /* @__PURE__ */ React4.createElement(Box4, { marginLeft: 1 }, /* @__PURE__ */ React4.createElement(Text4, null, "\u{1F194}"), /* @__PURE__ */ React4.createElement(Text4, { color: "gray", dimColor: true, italic: true }, " ", chatId), (apiTier === "Custom" || apiTier === "Paid") && /* @__PURE__ */ React4.createElement(Text4, { color: "gray", dimColor: true }, " | ", /* @__PURE__ */ React4.createElement(Text4, { color: "green", bold: true }, "PAID"))))
+        /* @__PURE__ */ React4.createElement(Box4, null, /* @__PURE__ */ React4.createElement(Text4, { color: "gray", dimColor: true }, "\u2503 "), /* @__PURE__ */ React4.createElement(Box4, { marginX: 1 }, /* @__PURE__ */ React4.createElement(Text4, null, "\u2728"), /* @__PURE__ */ React4.createElement(Text4, { color: "blue" }, " ", formatTokens(tokensTotal), " ", /* @__PURE__ */ React4.createElement(Text4, { dimColor: true }, "(", (tokens / maxLimit * 100).toFixed(0), "%)"))), /* @__PURE__ */ React4.createElement(Text4, { color: "gray", dimColor: true }, "\u2503 "), /* @__PURE__ */ React4.createElement(Box4, { marginLeft: 1 }, /* @__PURE__ */ React4.createElement(Text4, null, "\u{1F194}"), /* @__PURE__ */ React4.createElement(Text4, { color: "gray", dimColor: true, italic: true }, " ", chatId), (apiTier === "Custom" || apiTier === "Paid") && /* @__PURE__ */ React4.createElement(Text4, { color: "gray", dimColor: true }, " | ", /* @__PURE__ */ React4.createElement(Text4, { color: "green", bold: true }, "PAID"))))
       );
     });
     StatusBar_default = StatusBar;
@@ -3648,7 +3652,7 @@ var view_file;
 var init_view_file = __esm({
   "src/tools/view_file.js"() {
     init_arg_parser();
-    view_file = async (args) => {
+    view_file = async (args, context = {}) => {
       let { path: targetPath, StartLine, EndLine, start_line, end_line, startLine, endLine } = parseArgs(args);
       const sLine = parseInt(StartLine || start_line || startLine);
       const eLine = parseInt(EndLine || end_line || endLine);
@@ -3680,11 +3684,15 @@ var init_view_file = __esm({
           ".doc": "application/msword"
         };
         if (mimeMap[ext]) {
+          const isMultiModal = context.isMultiModal !== false;
+          if (!isMultiModal) {
+            return `ERROR: Multimodality is not supported for the current model. Unable to load [${targetPath}].`;
+          }
           const buffer = fs8.readFileSync(absolutePath);
           const base64 = buffer.toString("base64");
           const mimeType = mimeMap[ext];
           return {
-            text: `[BINARY_FILE]: ${targetPath} (${mimeType}) - Loaded as multimodal part.`,
+            text: `[BINARY FILE]: ${targetPath} (${mimeType}) - Loaded as multimodal part.`,
             binaryPart: {
               inlineData: {
                 data: base64,
@@ -4637,7 +4645,7 @@ var init_generate_image = __esm({
         return buffer;
       }
     };
-    generate_image = async (args) => {
+    generate_image = async (args, context = {}) => {
       const parsed = parseArgs(args);
       const prompt = parsed.prompt || parsed.text;
       const outputPath = parsed.path || parsed.outputPath || parsed.output || "generated_image.png";
@@ -4765,6 +4773,10 @@ var init_generate_image = __esm({
           ".webp": "image/webp"
         };
         const mimeType = mimeMap[ext] || "image/png";
+        const isMultiModal = context.isMultiModal !== false;
+        if (!isMultiModal) {
+          return `SUCCESS: Image successfully generated from prompt [${prompt}] and saved to [${outputPath}].`;
+        }
         return {
           text: `SUCCESS: Image successfully generated from prompt [${prompt}] and saved to [${outputPath}]. Output attached to multimodal part`,
           binaryPart: {
@@ -4949,7 +4961,7 @@ var init_tools = __esm({
 import { GoogleGenAI, ThinkingLevel, HarmBlockThreshold, HarmCategory } from "@google/genai";
 import path16 from "path";
 import fs17 from "fs";
-var client, TERMINATION_SIGNAL, stripAnsi2, fetchWithBackoff, getDeepSeekStream, getOpenRouterStream, signalTermination, TOOL_LABELS2, getToolDetail, runJanitorTask, getActiveToolContext, getContextSafeText, contextSafeReplace, getSanitizedText, detectToolCalls, initAI, generateSimpleContent, consolidatePastMemories, getAIStream;
+var client, TERMINATION_SIGNAL, MULTIMODAL_MODELS, isModelMultimodal, stripAnsi2, fetchWithBackoff, getDeepSeekStream, getOpenRouterStream, signalTermination, TOOL_LABELS2, getToolDetail, runJanitorTask, getActiveToolContext, getContextSafeText, contextSafeReplace, getSanitizedText, detectToolCalls, initAI, generateSimpleContent, consolidatePastMemories, getAIStream;
 var init_ai = __esm({
   async "src/utils/ai.js"() {
     await init_prompts();
@@ -4963,6 +4975,34 @@ var init_ai = __esm({
     init_revert();
     client = null;
     TERMINATION_SIGNAL = false;
+    MULTIMODAL_MODELS = [
+      // OpenRouter models
+      "google/gemma-4-31b-it:free",
+      "moonshotai/kimi-k2.6:free",
+      "google/gemini-3.5-flash",
+      "qwen/qwen3.7-plus",
+      "minimax/minimax-m3",
+      "anthropic/claude-sonnet-4.5",
+      "anthropic/claude-opus-4.6",
+      "anthropic/claude-opus-4.8",
+      "openai/gpt-5.2-codex",
+      "openai/gpt-5.2-pro",
+      "openai/gpt-5.5-pro",
+      "moonshotai/kimi-k2.6",
+      // Google models
+      "gemma-4-31b-it",
+      "gemini-2.5-flash",
+      "gemini-3-flash-preview",
+      "gemini-3.5-flash",
+      "gemini-3.1-flash-lite",
+      "gemini-3.1-pro-preview"
+    ];
+    isModelMultimodal = (model) => {
+      if (!model) return false;
+      const lower = model.toLowerCase();
+      if (lower.startsWith("gemini-") || lower.startsWith("gemma-")) return true;
+      return MULTIMODAL_MODELS.some((m) => m.toLowerCase() === lower);
+    };
     stripAnsi2 = (str) => {
       if (typeof str !== "string") return str;
       return str.replace(/[\u001b\u009b][[()#;?]*(?:[0-9]{1,4}(?:;[0-9]{0,4})*)?[0-9A-ORZcf-nqry=><]/g, "");
@@ -5099,7 +5139,7 @@ var init_ai = __esm({
           } catch (e) {
           }
         }
-        if (Date.now() - lastFlushTime >= 100 && hasNewData) {
+        if (Date.now() - lastFlushTime >= 150 && hasNewData) {
           yield {
             candidates: pendingParts.length > 0 ? [{ content: { parts: [...pendingParts] } }] : [],
             usageMetadata: latestUsageMetadata
@@ -5236,7 +5276,7 @@ var init_ai = __esm({
           } catch (e) {
           }
         }
-        if (Date.now() - lastFlushTime >= 100 && hasNewData) {
+        if (Date.now() - lastFlushTime >= 150 && hasNewData) {
           yield {
             candidates: pendingParts.length > 0 ? [{ content: { parts: [...pendingParts] } }] : [],
             usageMetadata: latestUsageMetadata
@@ -5789,12 +5829,12 @@ ${newMemoryListStr}
           }
         }
         let attempts = 0;
-        const maxAttempts = 3;
+        const maxAttempts = 5;
         let success = false;
-        let targetModel = "gemini-3.1-flash-lite";
+        let targetModel = "gemma-4-26b-a4b-it";
         if (aiProvider === "OpenRouter") targetModel = "google/gemma-4-26b-a4b-it:free";
         if (aiProvider === "DeepSeek") targetModel = "deepseek-v4-flash";
-        while (attempts < maxAttempts && !success) {
+        while (attempts <= maxAttempts && !success) {
           attempts++;
           try {
             const response = await generateSimpleContent(settings, targetModel, prompt, null, "Fast");
@@ -5830,7 +5870,8 @@ ${newMemoryListStr}
       }
     };
     getAIStream = async function* (modelName, history, settings, steeringCallback, versionFluxflow2) {
-      const { profile, thinkingLevel, mode, janitorModel, chatId, systemSettings, sessionStats, aiProvider = "Google", isMultiModal } = settings;
+      const { profile, thinkingLevel, mode, janitorModel, chatId, systemSettings, sessionStats, aiProvider = "Google", apiTier } = settings;
+      const isMultiModal = isModelMultimodal(modelName);
       if (!client && aiProvider === "Google") throw new Error("AI not initialized");
       const isMemoryEnabled = systemSettings?.memory !== false;
       const originalText = history[history.length - 1].text;
@@ -5844,8 +5885,14 @@ ${newMemoryListStr}
       await RevertManager.startTransaction(chatId, agentText);
       try {
         let modifiedHistory = [...history.slice(0, -1)];
-        if (systemSettings?.compression === 0 && (sessionStats?.tokens || 0) > 244e3) {
-          yield { type: "status", content: "Condensing session context..." };
+        let contextCompressionCount = 252e3;
+        let contextTruncationCount = 254e3;
+        if (aiProvider === "DeepSeek" || aiProvider === "Google" && apiTier === "Paid") {
+          contextCompressionCount = 396e3;
+          contextTruncationCount = 4e5;
+        }
+        if (systemSettings?.compression === 0 && (sessionStats?.tokens || 0) > contextCompressionCount) {
+          yield { type: "status_history", content: "Context Limit Reached. Condensing session history..." };
           const flattenContext = (hist) => {
             return hist.filter((m) => (m.role === "user" || m.role === "agent" || m.role === "system") && !String(m.id).startsWith("welcome") && !m.isMeta).map((m) => {
               const role = m.text?.startsWith("[TOOL RESULT]") ? "TOOL" : m.role === "agent" ? "AGENT" : "USER";
@@ -5864,23 +5911,32 @@ Provide a new consolidated summary of the entire session.` : `Here is the conver
 ${flattenedText2}
 Provide a consolidated summary of the entire session.`;
-            let targetModel = "gemini-3.1-flash-lite";
+            let targetModel = "gemma-4-26b-a4b-it";
             if (aiProvider === "OpenRouter") targetModel = "google/gemma-4-26b-a4b-it:free";
             if (aiProvider === "DeepSeek") targetModel = "deepseek-v4-flash";
-            try {
-              const response = await generateSimpleContent(settings, targetModel, prompt, systemInstruction, "Fast");
-              return response.text || "";
-            } catch (err) {
-              if (aiProvider === "Google") {
-                try {
-                  const fallback = await generateSimpleContent(settings, "gemini-2.5-flash", prompt, systemInstruction, "Fast");
-                  return fallback.text || "";
-                } catch (e) {
+            let attempts = 0;
+            let success = false;
+            let response = null;
+            while (attempts <= 3 && !success) {
+              attempts++;
+              try {
+                response = await generateSimpleContent(settings, targetModel, prompt, systemInstruction, "Fast");
+                success = true;
+              } catch (err) {
+                if (attempts > 3) {
+                  if (aiProvider === "Google") {
+                    try {
+                      const fallback = await generateSimpleContent(settings, "gemini-3.1-flash-lite", prompt, systemInstruction, "Fast");
+                      return fallback.text || "";
+                    } catch (e) {
+                      return "";
+                    }
+                  }
                   return "";
                 }
               }
-              return "";
             }
+            return response ? response.text || "" : "";
           };
           const flattenedText = flattenContext(modifiedHistory);
           const summaries2 = readEncryptedJson(summariesFile, {});
@@ -5905,9 +5961,6 @@ Provide a consolidated summary of the entire session.`;
             wasCompressedInStream = true;
           }
         }
-        if (systemSettings?.compression === 0 && (sessionStats?.tokens || 0) > 254e3) {
-          modifiedHistory = getTruncatedHistory(modifiedHistory, 6);
-        }
         if (isFirstPrompt && isMemoryEnabled) {
           yield { type: "status", content: "Condensing past chat memories..." };
           await consolidatePastMemories(chatId, settings);
@@ -6206,6 +6259,9 @@ ${thinkingLevel != "Fast" && aiProvider === "Google" ? `${modelName.toLowerCase(
           }
         });
         for (let loop = 0; loop <= MAX_LOOPS; loop++) {
+          if (systemSettings?.compression === 0 && (sessionStats?.tokens || 0) > contextTruncationCount) {
+            modifiedHistory = getTruncatedHistory(modifiedHistory, 6);
+          }
           if (loop > 0) {
             yield { type: "status", content: "Processed. Reconnecting..." };
           }
@@ -6265,7 +6321,7 @@ ${thinkingLevel != "Fast" && aiProvider === "Google" ? `${modelName.toLowerCase(
               }
               const contents = modifiedHistory.filter((msg) => (msg.role === "user" || msg.role === "agent" || msg.role === "system") && !String(msg.id).startsWith("welcome") && !msg.isMeta && !msg.isTerminalRecord && !(msg.text && msg.text.startsWith("[TERMINAL_RECORD]"))).map((msg, idx, arr) => {
                 const parts = [{ text: msg.text }];
-                if (msg.binaryPart) {
+                if (msg.binaryPart && isModelMultimodal(targetModel)) {
                   const physicalUserTurnsAfter = arr.slice(idx + 1).filter((m) => m.role === "user" && !m.text?.startsWith("[TOOL RESULT]")).length;
                   if (physicalUserTurnsAfter <= 2) {
                     parts.push(msg.binaryPart);
@@ -6628,7 +6684,7 @@ ${thinkingLevel != "Fast" && aiProvider === "Google" ? `${modelName.toLowerCase(
                     await new Promise((resolve) => setTimeout(resolve, 3e3));
                     break;
                   }
-                  const toolActionableText = turnText.replace(/(?:<think>|\[think\])[\s\S]*?(?:<\/think>|\[\/think\]|$)/gi, "");
+                  const toolActionableText = turnText.replace(/(?:<(think|thought|thoughts)>|\[(think|thought|thoughts)\])[\s\S]*?(?:<\/(think|thought|thoughts)>|\[\/(think|thought|thoughts)\]|$)/gi, "");
                   const allToolsFound = detectToolCalls(toolActionableText);
                   while (allToolsFound.length > toolCallPointer) {
                     const toolCall = allToolsFound[toolCallPointer];
@@ -6890,7 +6946,8 @@ ${boxBottom}` };
                       onChunk: (chunk2) => settings.onExecChunk ? settings.onExecChunk(chunk2) : null,
                       onAskUser: settings.onAskUser,
                       systemSettings: settings.systemSettings,
-                      mode
+                      mode,
+                      isMultiModal: isModelMultimodal(targetModel)
                     });
                     yield { type: "spinner", content: true };
                     if (process.stdout.isTTY) {
@@ -6948,7 +7005,7 @@ ${boxBottom}` };
                     if (normToolName === "memory" && result.includes("SUCCESS")) yield { type: "memory_updated" };
                     toolCallPointer++;
                   }
-                  if (aiProvider === "Google" && pendingGoogleText && Date.now() - lastGoogleFlushTime >= 100) {
+                  if (aiProvider === "Google" && pendingGoogleText && Date.now() - lastGoogleFlushTime >= 150) {
                     yield { type: "text", content: pendingGoogleText };
                     pendingGoogleText = "";
                     lastGoogleFlushTime = Date.now();
@@ -9185,7 +9242,7 @@ ${timestamp}` };
         let isFirstPacket = true;
         try {
           const rawHistory = [...messages, userMessage].filter(
-            (m) => m.role !== "think" && !m.isVisualFeedback && !String(m.id).startsWith("welcome")
+            (m) => m.role !== "think" && !m.isVisualFeedback && !m.isMeta && !String(m.id).startsWith("welcome")
           );
           const cleanHistoryForAI = [];
           rawHistory.forEach((m, idx) => {
@@ -9208,9 +9265,6 @@ ${timestamp}` };
               text
             });
           });
-          const modelCmd = COMMANDS.find((c) => c.cmd === "/model");
-          const currentModelObj = modelCmd?.subs?.find((s) => s.cmd === activeModel);
-          const isMultiModal = currentModelObj?.desc?.toLowerCase().includes("multimodal");
           const stream = getAIStream(
             activeModel,
             cleanHistoryForAI,
@@ -9222,9 +9276,9 @@ ${timestamp}` };
               janitorModel,
               sessionStats,
               chatId,
-              isMultiModal,
               aiProvider,
               apiKey,
+              apiTier,
               cols: terminalSize.columns - 6,
               rows: 30,
               onExecStart: (cmd) => {
@@ -9372,6 +9426,11 @@ Selection: ${val}`,
               setStatusText(packet.content);
               continue;
             }
+            if (packet.type === "status_history") {
+              setStatusText(packet.content);
+              setMessages((prev) => [...prev, { id: "condense-" + Date.now(), role: "system", text: `\u2699\uFE0F [SYSTEM] ${packet.content}`, isMeta: true }]);
+              continue;
+            }
             if (packet.type === "spinner") {
               setIsSpinnerActive(packet.content);
               continue;
@@ -10298,7 +10357,8 @@ Selection: ${val}`,
       tokensTotal: sessionStats.tokens,
       chatId,
       isMemoryEnabled: systemSettings.memory,
-      apiTier
+      apiTier,
+      aiProvider
     }
   )), activeView === "exit" && (() => {
     const wallTimeMs = Date.now() - SESSION_START_TIME;

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
 	"name": "fluxflow-cli",
-	"version": "1.21.1",
+	"version": "1.21.3",
 	"date": "2026-06-06",
 	"description": "A high-fidelity agentic terminal assistant for the Flux Era.",
 	"keywords": [