RubyGems - openclacky - Versions diffs - 1.0.0.beta.3 → 1.0.0.beta.5 - Mend

openclacky 1.0.0.beta.3 → 1.0.0.beta.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (35) hide show

checksums.yaml +4 -4
data/CHANGELOG.md +36 -4
data/lib/clacky/agent/message_compressor.rb +46 -8
data/lib/clacky/agent/message_compressor_helper.rb +18 -2
data/lib/clacky/agent/session_serializer.rb +23 -1
data/lib/clacky/agent/tool_executor.rb +14 -4
data/lib/clacky/agent.rb +31 -0
data/lib/clacky/agent_config.rb +16 -1
data/lib/clacky/brand_config.rb +16 -8
data/lib/clacky/client.rb +10 -1
data/lib/clacky/default_skills/new/SKILL.md +13 -5
data/lib/clacky/default_skills/recall-memory/SKILL.md +0 -1
data/lib/clacky/message_format/open_ai.rb +80 -3
data/lib/clacky/providers.rb +7 -18
data/lib/clacky/server/browser_manager.rb +25 -2
data/lib/clacky/server/channel/adapters/feishu/bot.rb +43 -3
data/lib/clacky/server/channel/channel_ui_controller.rb +2 -2
data/lib/clacky/server/web_ui_controller.rb +1 -1
data/lib/clacky/tools/browser.rb +0 -57
data/lib/clacky/tools/file_reader.rb +26 -10
data/lib/clacky/tools/security.rb +67 -38
data/lib/clacky/tools/terminal/persistent_session.rb +16 -6
data/lib/clacky/tools/terminal.rb +117 -12
data/lib/clacky/tools/todo_manager.rb +117 -30
data/lib/clacky/utils/login_shell.rb +72 -0
data/lib/clacky/utils/model_pricing.rb +44 -0
data/lib/clacky/version.rb +1 -1
data/lib/clacky/web/app.css +7 -0
data/lib/clacky/web/index.html +7 -1
data/lib/clacky/web/onboard.js +40 -4
data/lib/clacky/web/sessions.js +2 -2
data/lib/clacky.rb +1 -1
data/scripts/install.ps1 +76 -68
metadata +2 -2
data/lib/clacky/tools/run_project.rb +0 -295

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: b145934170a510f46e3263c3fbce94acf618cdd416c93e17a7758984361c02b7
-  data.tar.gz: 618fdf4917ce68514e0332ad44c522afa061a21884d6e95511f564da985f8433
+  metadata.gz: 39e25cd04a3d01fdacbb0382c2c367a1e72e8d2be88408e7fb29f804b3af1ba6
+  data.tar.gz: 492ca66bcfb55a6cfc3f2cf38f171ce983f142a7a4b0f8655e5aafa317b79a69
 SHA512:
-  metadata.gz: 50a63fc4087f97c9a3c3242e2e379da95de6e73cdac3bb75ab11ad67bc03eb151afaf47e3a229bd769a4790f57e3b116309c0908807f35618ae64222feb30575
-  data.tar.gz: f492569e101a0b6af312c65cffa244b29a5c47d79201129214e66ac62db7181c29678d64c9ddec88e6dc61525cf2442b0f22647e8dcdb430e0bbc87ca9f1a370
+  metadata.gz: 014eeb8227bcc4cd94104a1da3bb2877083a1c70c4baaaf408233eec57ef684cbc2bcbac632ca52a771e2f1a8f436f2a09d89b697a165f1147891cabfe3708a0
+  data.tar.gz: cc54f77d960bfd2db73906b713a84d0da6465fc18c65d9ec3ceb75d250bf426adaf4d9ba42c71900beab889bb6acf6a6472fa3843420fec8bbd3460a13f00088

data/CHANGELOG.md CHANGED Viewed

@@ -7,10 +7,42 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 ## [Unreleased]
+## [1.0.0.beta.5] - 2026-04-29
+### Added
+- **WSL2 mirrored networking mode for localhost access.** Windows users running under WSL2 can now configure mirrored networking, allowing the Clacky server to be reached at `localhost` from the Windows host instead of needing to look up the WSL IP address.
+- **Message compressor preserves chunk order.** Compression chunks are now consistently ordered with `chunk-nn` naming, making it easier to browse and understand compressed conversation history.
+- **Session model is now saved.** The currently active model selection is persisted in session data, so it survives page refreshes and server restarts.
+- **Feedback button styling in Web UI.** The feedback interface now has improved CSS styling for a better user experience.
+### Improved
+- **Fewer LLM turns for common tool operations.** The file reader, security tool, and todo manager have been optimized to require fewer round-trips with the AI model, making tasks faster and cheaper.
+- **Terminal now supports mise-based Node.js.** The terminal tool correctly resolves Node.js when installed through `mise` version manager, not just `nvm` or system paths.
+### Fixed
+- **Browser MCP connection recovers from crashes.** The browser tool's MCP daemon handles process restarts more gracefully, and stale Node.js detection code has been cleaned up.
+- **Brand configuration no longer crashes on empty data.** When brand config data is empty or missing, the system now handles it gracefully instead of raising an error.
+- **Kimi K2.5 and K2.6 models now show correct pricing.** These models are now in the pricing table, so cost tracking reflects actual usage costs.
+- **Feishu messages with images no longer silently dropped.** Image markdown syntax in Feishu messages is now sanitized before sending, preventing the Feishu API from silently rejecting them.
+- **Onboarding model selector and provider presets fixed.** The model combobox in the onboarding flow now works correctly, and provider presets are properly updated.
+- **File reader now works correctly with OpenAI provider.** Files attached to sessions are now properly read and processed when using the OpenAI API format.
+- **Image URLs with special tokens no longer mis-handled.** The message formatter no longer mis-handles image URLs containing special tokens (e.g., `bong`).
+### Changed
+- **`run_project` tool removed.** This deprecated tool has been removed. Use the terminal tool to run commands in projects instead.
+### More
+- Improved WSL2 detection on Windows PowerShell installer
+- Minor test and documentation fixes
+## [1.0.0.beta.4] - 2026-04-28
+### Fixed
+- **Fix**: onboard.js was calling defunct `POST /api/config` → now calls `POST /api/config/models`
 ## [1.0.0.beta.3] - 2026-04-28
 ### Added
-- **Gemini 2.5 Pro support.** The new `gemini2.5-pro` model is now available as a selectable option, giving you access to Google's latest flagship model.
 - **File attachments now support Markdown, plain text, and `.tar.gz` archives.** When you attach `.md`, `.txt`, or `.tar.gz` files to a session, the agent can read and reason over their contents directly.
 - **Image type auto-detection.** Image files are now correctly identified by their binary content (magic bytes), not just their file extension — preventing misclassified images from causing upload or vision errors.
@@ -28,7 +60,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 - **New session creation supports model & working-directory options.** The Web UI "new session" dialog now lets you pick the model and starting directory up front, instead of having to adjust them after the session opens.
 ### Fixed
-- **System prompt now refreshes when you switch models.** Previously the system prompt captured at session start stuck around even after `/model` or `/provider` switches, which could leave model-specific instructions out of sync. The agent now re-injects the correct system prompt on every model change.
+- **System prompt now refreshes when you switch models.** Previously the system prompt captured at session start stuck around even after model switches, which could leave model-specific instructions out of sync. The agent now re-injects the correct system prompt on every model change.
 - **Port 7070 properly released when the terminal tool exits.** A lingering listener on port 7070 could block subsequent runs; the terminal tool now cleans it up on shutdown.
 - **Windows installer uses `[IO.Path]::GetTempPath()` for the temp directory** (#58) — more reliable than `$env:TEMP` on systems where the env var is unset or points to a non-ASCII path.
@@ -36,7 +68,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 ### Added
 - **Vision support — agents can now "see" images.** When you attach image files (PNG, JPG, GIF, WebP), the agent can analyze them visually with vision-capable models. Non-vision models automatically fall back to disk references instead of breaking.
-- **DeepSeek V4 (Clacky-DS) provider.** New `deepseekv4` provider preset with native DeepSeek API endpoint, supporting `deepseek-v4-pro` and `deepseek-v4-flash` models with accurate pricing.
+- **DeepSeek V4 (Clacky-DS) provider.** New `deepseekv4` provider preset with native DeepSeek API endpoint, supporting `dsk-deepseek-v4-pro` and `dsk-deepseek-v4-flash` models with accurate pricing.
 - **Memory subagent.** Long-term memory management now runs as a dedicated background subagent — writes memories when the task reaches meaningful completion, instead of on every turn.
 - **Usage telemetry.** Anonymous usage data collection helps us understand how the product is used and prioritize improvements. No personal or conversation data is collected.
 - **Brand configuration auto-refresh.** White-label brand settings now refresh automatically when the WebUI starts up, no manual restart needed.
@@ -44,7 +76,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 ### Improved
 - **Progress handles revamped.** Nested progress handles now hide/show automatically, ticker threads keep animations smooth, and fast-completing tasks no longer flash a pointless "done" message.
 - **Todo manager tool upgrades.** Batch add/remove multiple todos at once, and completed todos auto-clear when you add new ones.
-- **Model switching more robust.** CLI slash commands (`/model`, `/provider`) now work seamlessly, server-side routing handles dynamic endpoints correctly, and switching between all provider types is more reliable.
+- **Model switching more robust.** CLI slash commands (/config) now work seamlessly, server-side routing handles dynamic endpoints correctly, and switching between all provider types is more reliable.
 ### Fixed
 - **Access key now persists via cookies.** The WebUI login key was stored only in `localStorage`, causing WebSocket connections to lose authentication. Now also written to a `clacky_access_key` cookie for consistent auth across all connection types.

data/lib/clacky/agent/message_compressor.rb CHANGED Viewed

@@ -94,12 +94,18 @@ module Clacky
     # @param recent_messages [Array<Hash>] Recent messages to preserve
     # @param chunk_path [String, nil] Path to the archived chunk MD file (if saved)
     # @return [Array<Hash>] Rebuilt message list: system + compressed + recent
-    def rebuild_with_compression(compressed_content, original_messages:, recent_messages:, chunk_path: nil)
+    def rebuild_with_compression(compressed_content, original_messages:, recent_messages:, chunk_path: nil, topics: nil, previous_chunks: [])
       # Find and preserve system message
       system_msg = original_messages.find { |m| m[:role] == "system" }
-      # Parse the compressed result
-      parsed_messages = parse_compressed_result(compressed_content, chunk_path: chunk_path)
+      # Parse the compressed result, embedding previous chunk references so the
+      # new summary carries a complete index of all older archives. This avoids
+      # keeping all prior compressed_summary messages in active history while
+      # still giving the AI a path to find old conversations via file_reader.
+      parsed_messages = parse_compressed_result(compressed_content,
+                                                chunk_path: chunk_path,
+                                                topics: topics,
+                                                previous_chunks: previous_chunks)
       # If parsing fails or returns empty, raise error
       if parsed_messages.nil? || parsed_messages.empty?
@@ -124,7 +130,7 @@ module Clacky
       m ? m[1].strip : nil
     end
-    def parse_compressed_result(result, chunk_path: nil)
+    def parse_compressed_result(result, chunk_path: nil, topics: nil, previous_chunks: [])
       # Return the compressed result as a single user message (role: "user").
       #
       # Why role:"user" instead of "assistant":
@@ -144,6 +150,10 @@ module Clacky
       # The `compressed_summary: true` flag is preserved so that replay_history still
       # routes this message through the chunk-expansion path (which keys off that flag,
       # not the role).
+      #
+      # @param topics [String, nil] Short topic description extracted from <topics> tag
+      # @param previous_chunks [Array<Hash>] Info about older chunk files
+      #   Each hash: { basename:, path:, topics: }
       content = result.to_s.strip
       if content.empty?
@@ -152,22 +162,50 @@ module Clacky
         # Strip out the <topics> block — it's metadata for the chunk file, not for AI context
         content_without_topics = content.gsub(/<topics>.*?<\/topics>\n*/m, "").strip
-        # Inject chunk anchor so AI knows where to find original conversation
+        # Build previous chunks index section — links to older chunk files so the AI
+        # can find earlier conversations without keeping all prior compressed_summary
+        # messages in the active history. Shows newest chunks first (reverse order),
+        # capped at 10 to keep the message size bounded.
+        previous_chunks_section = ""
+        if previous_chunks.any?
+          max_visible = 10
+          visible = previous_chunks.last(max_visible).reverse
+          older_count = previous_chunks.size - visible.size
+          previous_chunks_section = "\n\n---\n📁 **Previous chunks (newest first):**\n"
+          visible.each do |pc|
+            topic_str = pc[:topics] ? " — #{pc[:topics]}" : ""
+            previous_chunks_section += "- `#{pc[:basename]}`#{topic_str}\n"
+          end
+          if older_count > 0
+            oldest = previous_chunks.first
+            previous_chunks_section += "- ... and #{older_count} older chunks back to `#{oldest[:basename]}`\n"
+          end
+          previous_chunks_section += "_Use `file_reader` to recall details from these chunks._"
+        end
+        # Inject chunk anchor so AI knows where to find original conversation for THIS chunk
+        anchor = ""
         if chunk_path
-          anchor = "\n\n---\n📁 **Original conversation archived at:** `#{chunk_path}`\n" \
+          anchor = "\n\n---\n📁 **Current chunk archived at:** `#{chunk_path}`\n" \
                    "_Use `file_reader` tool to recall details from this chunk._"
-          content_without_topics = content_without_topics + anchor
         end
         # Prefix lets the model recognise this is injected context, not a user utterance.
+        # Order: summary → previous chunks → current anchor (chronological)
         framed_content = "[Compressed conversation summary — previous turns archived]\n\n" \
-                         "#{content_without_topics}"
+                         "#{content_without_topics}" \
+                         "#{previous_chunks_section}" \
+                         "#{anchor}"
         [{
           role: "user",
           content: framed_content,
           compressed_summary: true,
           chunk_path: chunk_path,
+          topics: topics,
           system_injected: true
         }]
       end

data/lib/clacky/agent/message_compressor_helper.rb CHANGED Viewed

@@ -160,19 +160,35 @@ module Clacky
         # chunk files, creating circular chunk references. Counting from history is always accurate.
         existing_chunk_count = original_messages.count { |m| m[:compressed_summary] }
         chunk_index = existing_chunk_count + 1
+        # Extract topics from the LLM response to store in both the chunk MD front
+        # matter and the compressed_summary message hash (for future chunk indexing).
+        topics = @message_compressor.parse_topics(compressed_content)
         chunk_path = save_compressed_chunk(
           original_messages,
           compression_context[:recent_messages],
           chunk_index: chunk_index,
           compression_level: compression_context[:compression_level],
-          topics: @message_compressor.parse_topics(compressed_content)
+          topics: topics
         )
+        # Collect previous chunk references so the new summary carries a complete
+        # index of all older archives. Without this, each new compression would
+        # lose all prior chunk references — leaving only the newest chunk reachable
+        # via replay_history. The AI can still access older chunks via file_reader
+        # using the embedded basenames and topics.
+        previous_chunks = original_messages
+          .select { |m| m[:compressed_summary] && m[:chunk_path] }
+          .map { |m| { basename: File.basename(m[:chunk_path]), path: m[:chunk_path], topics: m[:topics] } }
         @history.replace_all(@message_compressor.rebuild_with_compression(
           compressed_content,
           original_messages: original_messages,
           recent_messages: compression_context[:recent_messages],
-          chunk_path: chunk_path
+          chunk_path: chunk_path,
+          topics: topics,
+          previous_chunks: previous_chunks
         ))
         # Reset to the estimated size of the rebuilt (small) history.

data/lib/clacky/agent/session_serializer.rb CHANGED Viewed

@@ -54,6 +54,20 @@ module Clacky
           @pending_error_rollback = true
         end
+        # Restore the session's original model if it still exists in the current
+        # config. This prevents all sessions from silently switching to the new
+        # default model when the user changes it and restarts. Falls back to the
+        # current default if the model was deleted/renamed since the session was
+        # last saved.
+        saved_model_name = session_data.dig(:config, :model_name)
+        if saved_model_name
+          saved_base_url = session_data.dig(:config, :model_base_url)
+          model_entry = @config.find_model_by_name_and_url(saved_model_name, saved_base_url)
+          if model_entry && model_entry["id"]
+            switch_model_by_id(model_entry["id"])
+          end
+        end
         # Rebuild and refresh the system prompt so any newly installed skills
         # (or other configuration changes since the session was saved) are
         # reflected immediately — without requiring the user to create a new session.
@@ -98,11 +112,19 @@ module Clacky
           config: {
             # NOTE: api_key and other sensitive credentials are intentionally excluded
             # to prevent leaking secrets into session files on disk.
+            # model_name is saved so the session can restore its original model on restart
+            # (falling back to the current default if the model no longer exists).
             permission_mode: @config.permission_mode.to_s,
             enable_compression: @config.enable_compression,
             enable_prompt_caching: @config.enable_prompt_caching,
             max_tokens: @config.max_tokens,
-            verbose: @config.verbose
+            verbose: @config.verbose,
+            # Persist the current model identity so the session can restore its
+            # original model on restart. model_name + model_base_url form a
+            # composite key to avoid matching a different provider's model of
+            # the same name. Falls back to default if the model no longer exists.
+            model_name: @config.current_model&.dig("model"),
+            model_base_url: @config.current_model&.dig("base_url")
           },
           stats: stats_data,
           messages: @history.to_a

data/lib/clacky/agent/tool_executor.rb CHANGED Viewed

@@ -169,6 +169,17 @@ module Clacky
         # Inject TODO reminder for non-todo_manager tools
         formatted_result = inject_todo_reminder(call[:name], formatted_result)
+        # Extract image_inject sidecar before building the tool content string.
+        # image_inject carries the base64 payload that must be delivered as a
+        # follow-up `role:"user"` message (OpenAI/OpenRouter/Gemini only accept
+        # image_url blocks in user messages, not in tool messages).
+        # Strip it from the content sent to the API so it isn't tokenised as text.
+        image_inject = nil
+        if formatted_result.is_a?(Hash) && formatted_result[:image_inject]
+          image_inject = formatted_result[:image_inject]
+          formatted_result = formatted_result.reject { |k, _| k == :image_inject }
+        end
         # If the tool returned a plain string, use it directly (avoids double-escaping).
         # If it returned an Array (e.g. multipart vision blocks with image + text),
         # pass it through as-is so format_tool_results can send it to the API.
@@ -182,10 +193,9 @@ module Clacky
                     JSON.generate(formatted_result)
                   end
-        {
-          id: call[:id],
-          content: content
-        }
+        result = { id: call[:id], content: content }
+        result[:image_inject] = image_inject if image_inject
+        result
       end
       # Build error result for tool execution

data/lib/clacky/agent.rb CHANGED Viewed

@@ -883,6 +883,36 @@ module Clacky
       formatted_messages = @client.format_tool_results(response, tool_results, model: current_model)
       formatted_messages.each { |msg| @history.append(msg.merge(task_id: @current_task_id)) }
+      # Append a follow-up `role:"user"` message for any image payloads that
+      # could not be delivered inside the tool message.
+      #
+      # Background: OpenAI-compatible APIs (OpenRouter, Gemini, GPT-4o, etc.)
+      # only accept image_url content blocks in `role:"user"` messages.  Putting
+      # base64 data in a `role:"tool"` message causes it to be JSON-encoded as
+      # plain text, inflating token counts by 20-40x.  The tool result carries a
+      # plain-text description for the LLM; the actual image is delivered here.
+      tool_results.each do |tr|
+        inject = tr[:image_inject]
+        next unless inject
+        mime_type  = inject[:mime_type]
+        base64_data = inject[:base64_data]
+        path       = inject[:path]
+        next unless mime_type && base64_data
+        data_url = "data:#{mime_type};base64,#{base64_data}"
+        image_content = [
+          { type: "text",      text: "[Image from file_reader: #{File.basename(path.to_s)}]" },
+          { type: "image_url", image_url: { url: data_url } }
+        ]
+        @history.append({
+          role:             "user",
+          content:          image_content,
+          system_injected:  true,
+          task_id:          @current_task_id
+        })
+      end
     end
     # Interrupt the agent's current run
@@ -1397,6 +1427,7 @@ module Clacky
       ].compact.join(". ")
       content = "[Session context: #{parts}]"
       @history.append({
         role: "user",
         content: content,

data/lib/clacky/agent_config.rb CHANGED Viewed

@@ -158,7 +158,7 @@ module Clacky
     def initialize(options = {})
       @permission_mode = validate_permission_mode(options[:permission_mode])
-      @max_tokens = options[:max_tokens] || 8192
+      @max_tokens = options[:max_tokens] || 16384
       @verbose = options[:verbose] || false
       @enable_compression = options[:enable_compression].nil? ? true : options[:enable_compression]
       # Enable prompt caching by default for cost savings
@@ -549,6 +549,21 @@ module Clacky
       @models.find { |m| m["type"] == type }
     end
+    # Find model by composite key (model name + base_url).
+    # Used when restoring a session to match its original model without relying
+    # on the runtime-only id (which changes on every process restart).
+    # base_url is optional for backward compatibility with sessions saved
+    # before base_url was persisted.
+    # @param model_name [String] the model's "model" field (e.g. "dsk-deepseek-v4-pro")
+    # @param base_url [String, nil] the model's "base_url" field
+    # @return [Hash, nil] the matching model entry or nil
+    def find_model_by_name_and_url(model_name, base_url = nil)
+      @models.find do |m|
+        m["model"] == model_name &&
+          (base_url.nil? || m["base_url"] == base_url)
+      end
+    end
     # Get the default model (type: default)
     # Falls back to current_model for backward compatibility
     def default_model

data/lib/clacky/brand_config.rb CHANGED Viewed

@@ -964,16 +964,24 @@ module Clacky
           key = fetch_decryption_key(skill_id: skill_id, skill_version_id: skill_version_id)
           ciphertext = File.binread(enc_path)
-          pt         = aes_gcm_decrypt(key, ciphertext, file_meta["iv"], file_meta["tag"])
-          # Integrity check
-          actual   = Digest::SHA256.hexdigest(pt)
-          expected = file_meta["original_checksum"]
-          if expected && actual != expected
-            raise "Checksum mismatch for #{rel_plain}: expected #{expected}, got #{actual}"
-          end
+          if ciphertext.nil? || ciphertext.empty?
+            # AES-GCM of empty data still produces 16+ bytes (auth tag + IV).
+            # A 0-byte file means the skill package is corrupted; skip
+            # decryption and produce an empty output so the skill can still run.
+            ""
+          else
+            pt = aes_gcm_decrypt(key, ciphertext, file_meta["iv"], file_meta["tag"])
+            # Integrity check
+            actual   = Digest::SHA256.hexdigest(pt)
+            expected = file_meta["original_checksum"]
+            if expected && actual != expected
+              raise "Checksum mismatch for #{rel_plain}: expected #{expected}, got #{actual}"
+            end
-          pt
+            pt
+          end
         else
           # Mock/plain skill: raw bytes
           File.binread(enc_path).force_encoding("UTF-8")

data/lib/clacky/client.rb CHANGED Viewed

@@ -15,6 +15,12 @@ module Clacky
       @use_anthropic_format = anthropic_format
       # Detect Bedrock: ABSK key prefix (native AWS) or abs- model prefix (Clacky AI proxy)
       @use_bedrock = MessageFormat::Bedrock.bedrock_api_key?(api_key, model)
+      # Determine vision support once at construction time.
+      # Non-vision models (DeepSeek, Kimi, MiniMax, etc.) reject image_url
+      # content blocks; the conversion layer strips them when this is false.
+      provider_id = Providers.resolve_provider(base_url: @base_url, api_key: @api_key)
+      @vision_supported = Providers.supports?(provider_id, :vision, model_name: @model)
     end
     # Returns true when the client is using the AWS Bedrock Converse API.
@@ -185,7 +191,10 @@ module Clacky
       # OpenRouter proxies Claude with the same cache_control field convention as Anthropic direct.
       messages = apply_message_caching(messages) if caching_enabled
-      body     = MessageFormat::OpenAI.build_request_body(messages, model, tools, max_tokens, caching_enabled)
+      body     = MessageFormat::OpenAI.build_request_body(
+        messages, model, tools, max_tokens, caching_enabled,
+        vision_supported: @vision_supported
+      )
       response = openai_connection.post("chat/completions") { |r| r.body = body.to_json }
       raise_error(response) unless response.status == 200

data/lib/clacky/default_skills/new/SKILL.md CHANGED Viewed

@@ -183,12 +183,20 @@ Print a success summary:
 ```
 ### 4. Start Development Server
-After the script completes, use the run_project tool to start the server:
+After the script completes, read the `.1024` config file in the project root
+to find the `run_command`, then start it in the background via the terminal tool:
 ```
-run_project(action: "start")
+# First, read .1024 to get the run_command (usually `bin/dev` for Rails):
+file_reader(path: ".1024")
+# Then start the server in the background:
+terminal(command: "<run_command from .1024>", background: true)
 ```
-**Important**: If run_project executes without errors, the server has started successfully.
+**Important**: If the terminal call returns a session_id (and no error), the
+server has started successfully. You can inspect logs later by polling the
+same session_id with an empty input.
 Then inform the user and ask what to develop next:
 ```
@@ -210,7 +218,7 @@ What would you like to develop next?
 - bin/setup fails → Show error, suggest running `./bin/setup` manually
 - Cloud project creation fails → Soft-fail with warning, continue to start server
 - workspace_key missing → Ask user interactively; skip cloud init if user declines
-- run_project fails → Check logs with `run_project(action: "output")` and verify database status
+- Dev server fails to start → Poll the terminal session (empty input) to check logs, verify database status
 ## Example Interaction
 User: "/new"
@@ -224,5 +232,5 @@ Response:
 6. Project setup complete!
 7. Initializing cloud project binding...
 8. ✅ Cloud project created and config injected into config/application.yml!
-9. Starting development server with run_project...
+9. Starting development server via terminal (background)...
 10. ✨ Server running! Visit http://localhost:3000

data/lib/clacky/default_skills/recall-memory/SKILL.md CHANGED Viewed

@@ -7,7 +7,6 @@ auto_summarize: true
 forbidden_tools:
   - write
   - edit
-  - run_project
   - web_search
   - web_fetch
   - browser

data/lib/clacky/message_format/open_ai.rb CHANGED Viewed

@@ -27,15 +27,27 @@ module Clacky
       # ── Request building ──────────────────────────────────────────────────────
       # Build an OpenAI-compatible request body.
-      # Canonical messages are already in OpenAI format — no conversion needed.
+      #
+      # Messages go through the canonical→OpenAI conversion layer
+      # (normalize_messages). For most models this is identity because
+      # the internal canonical format IS OpenAI format. The conversion
+      # handles one edge case: image_url content blocks are stripped
+      # when vision_supported is false (e.g. DeepSeek, Kimi, MiniMax),
+      # replacing them with a text placeholder so the API doesn't reject
+      # the request with "unknown variant 'image_url'".
+      #
       # @param messages [Array<Hash>] canonical messages
       # @param model    [String]
       # @param tools    [Array<Hash>] OpenAI-style tool definitions
       # @param max_tokens [Integer]
       # @param caching_enabled [Boolean] (only effective for Claude via OpenRouter)
+      # @param vision_supported [Boolean] whether the target model accepts
+      #   image_url content blocks (default true, conservative)
       # @return [Hash]
-      def build_request_body(messages, model, tools, max_tokens, caching_enabled)
-        body = { model: model, max_tokens: max_tokens, messages: messages }
+      def build_request_body(messages, model, tools, max_tokens, caching_enabled, vision_supported: true)
+        api_messages = messages.map { |msg| normalize_message_content(msg, vision_supported: vision_supported) }
+        body = { model: model, max_tokens: max_tokens, messages: api_messages }
         if tools&.any?
           if caching_enabled
@@ -50,6 +62,71 @@ module Clacky
         body
       end
+      # ── Canonical → OpenAI conversion ─────────────────────────────────────────
+      # Process a single message's content through the canonical→OpenAI
+      # conversion layer. For String content this is a no-op; for Array
+      # content each block goes through normalize_block.
+      #
+      # @param msg [Hash] canonical message
+      # @param vision_supported [Boolean]
+      # @return [Hash] message with content normalised for OpenAI API
+      def normalize_message_content(msg, vision_supported:)
+        content = msg[:content]
+        return msg unless content.is_a?(Array)
+        blocks = content_to_blocks(content, vision_supported: vision_supported)
+        # Most APIs reject empty content arrays — use a placeholder text block.
+        blocks = [{ type: "text", text: "..." }] if blocks.empty?
+        msg.merge(content: blocks)
+      end
+      # Convert canonical content array to OpenAI-compatible block array.
+      # Each block goes through normalize_block; nil results are compacted.
+      #
+      # @param content [Array<Hash>] canonical content blocks
+      # @param vision_supported [Boolean]
+      # @return [Array<Hash>]
+      def content_to_blocks(content, vision_supported:)
+        content.map { |b| normalize_block(b, vision_supported: vision_supported) }.compact
+      end
+      # Normalize a single canonical content block to OpenAI API format.
+      #
+      # Canonical text blocks pass through (with cache_control preserved).
+      # image_url blocks are kept for vision-capable models and replaced
+      # with a text placeholder for non-vision models (DeepSeek, Kimi, etc.).
+      #
+      # @param block [Hash] canonical content block
+      # @param vision_supported [Boolean]
+      # @return [Hash, nil] nil for empty-text blocks (dropped)
+      def normalize_block(block, vision_supported:)
+        return block unless block.is_a?(Hash)
+        case block[:type]
+        when "text"
+          # Drop empty text blocks — most APIs (Anthropic, DeepSeek, etc.)
+          # reject { type: "text", text: "" }.
+          text = block[:text]
+          return nil if text.nil? || text.empty?
+          result = { type: "text", text: text }
+          result[:cache_control] = block[:cache_control] if block[:cache_control]
+          result
+        when "image_url"
+          if vision_supported
+            block  # Pass through — GPT-4V, Gemini, etc. accept image_url
+          else
+            # Replace with text placeholder so the API doesn't reject the
+            # request. The model will still see the context that an image
+            # was present (from file_prompt / system_injected metadata).
+            { type: "text", text: "[Image content removed — current model does not support vision input]" }
+          end
+        else
+          block  # Pass through unknown block types (tool_use, tool_result, etc.)
+        end
+      end
       # ── Response parsing ──────────────────────────────────────────────────────
       # Parse OpenAI-compatible API response into canonical internal format.

data/lib/clacky/providers.rb CHANGED Viewed

@@ -114,10 +114,10 @@ module Clacky
         "name" => "Kimi (Moonshot)",
         "base_url" => "https://api.moonshot.cn/v1",
         "api" => "openai-completions",
-        "default_model" => "kimi-k2.5",
-        "models" => ["kimi-k2.5"],
-        # Kimi k2.5 (text family) does not accept image inputs.
-        "capabilities" => { "vision" => false }.freeze,
+        "default_model" => "kimi-k2.6",
+        "models" => ["kimi-k2.6", "kimi-k2.5"],
+        # k2.5 / k2.6 are multimodal; legacy k2 text-only models need model_capabilities override if added.
+        "capabilities" => { "vision" => true }.freeze,
         "website_url" => "https://platform.moonshot.cn/console/api-keys"
       }.freeze,
@@ -136,29 +136,18 @@ module Clacky
         "api" => "bedrock",
         "default_model" => "abs-claude-sonnet-4-5",
         "models" => [
-          "abs-claude-opus-4-7",
           "abs-claude-opus-4-6",
           "abs-claude-sonnet-4-6",
           "abs-claude-sonnet-4-5",
-          "abs-claude-haiku-4-5",
-          "dsk-deepseek-v4-pro",
-          "dsk-deepseek-v4-flash",
-          "or-gemini-3-1-pro"
+          "abs-claude-haiku-4-5"
         ],
-        # Same lineup as openclacky — Claude is vision, DeepSeek is text-only,
-        # Gemini inherits the provider-default vision=true.
+        # Claude family — all vision-capable.
         "capabilities" => { "vision" => true }.freeze,
-        "model_capabilities" => {
-          "dsk-deepseek-v4-pro"   => { "vision" => false }.freeze,
-          "dsk-deepseek-v4-flash" => { "vision" => false }.freeze
-        }.freeze,
         # Per-primary lite pairing — see openclacky preset for rationale.
         "lite_models" => {
-          "abs-claude-opus-4-7"   => "abs-claude-haiku-4-5",
           "abs-claude-opus-4-6"   => "abs-claude-haiku-4-5",
           "abs-claude-sonnet-4-6" => "abs-claude-haiku-4-5",
-          "abs-claude-sonnet-4-5" => "abs-claude-haiku-4-5",
-          "dsk-deepseek-v4-pro"   => "dsk-deepseek-v4-flash"
+          "abs-claude-sonnet-4-5" => "abs-claude-haiku-4-5"
         },
         # Fallback chain: if a model is unavailable, try the next one in order.
         # Keys are primary model names; values are the fallback model to use instead.