RubyGems - llm.rb - Versions diffs - 11.2.0 → 11.3.0 - Mend

llm.rb 11.2.0 → 11.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

checksums.yaml +4 -4
data/CHANGELOG.md +57 -11
data/README.md +78 -66
data/lib/llm/agent.rb +25 -4
data/lib/llm/error.rb +4 -0
data/lib/llm/providers/anthropic/error_handler.rb +2 -0
data/lib/llm/providers/bedrock/error_handler.rb +1 -1
data/lib/llm/providers/google/error_handler.rb +2 -0
data/lib/llm/providers/ollama/error_handler.rb +2 -0
data/lib/llm/providers/openai/error_handler.rb +2 -0
data/lib/llm/version.rb +1 -1
data/llm.gemspec +3 -2
metadata +18 -5

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: bb1ffd1e0ecb17422014ec8f75c8b729f74d0a7cef4fd4e12681ef254411b24a
-  data.tar.gz: e5a7815d52c6fa99a38dec111c6d71aef9782272b97bfbad81e8f5bee913f918
+  metadata.gz: 314712380b36e57b1492cef3850f5c4c2397522b74d3cc913fc0d09a796d8973
+  data.tar.gz: aefda31d90067a0a49ada778c6658243595b6698cc11ecf342a11e26f69ad93b
 SHA512:
-  metadata.gz: d7c2d1dac8ef97a5be2540896828b523ce1491e43f2fd8c78e53b8fbab34432bf6dedaee066afd43d261fb8f4e2fb9f7c3d8c112de83e60ee809d6ef77f41feb
-  data.tar.gz: d5073e8c5c156739cff4c68c65c4efaf88403229841ada3cdd1adb4920dfb58c00017ae7dbd73d12ef17b6c191b0b71ab4f15fd0c0ae29991dcffb1b840381d0
+  metadata.gz: 3a998015696027d232e0865c60ff840d11155206b705443035f6af7dcbb18f52d0e82b019cc82379a7ca919b60e3e50bf4156c8c4388beb8ba47a5d57775354a
+  data.tar.gz: 83faf786980a3307a760aec9698e29129dd34f8a838fa8f596caa60498f029bf3b5b4ac9400dd662c70a67a8c1eb748266d89b777773c8185025c8b8c86754bd

data/CHANGELOG.md CHANGED Viewed

@@ -2,6 +2,52 @@
 ## Unreleased
+## v11.3.0
+Changes since `v11.2.0`.
+This release promotes `LLM::Agent` as the default high-level runtime,
+raises `LLM::NotFoundError` for provider 404 responses, and adds
+Symbol resolution to `LLM::Agent.confirm` and `LLM::Agent.skills` for
+dynamic tool confirmation and skill lists.
+### Add
+* **Raise `LLM::NotFoundError` for provider 404 responses** <br>
+  Raise `LLM::NotFoundError` when a provider returns HTTP 404. One
+  example is calling the embeddings API on DeepSeek
+  (`LLM.deepseek(...).embed(["foobar"])`), which returns 404 because
+  DeepSeek does not implement that endpoint.
+* **Add Symbol resolution to `LLM::Agent.confirm`** <br>
+  When `confirm` receives a single Symbol argument, it stores it
+  as-is instead of converting it to a string array. At initialization
+  time, `resolve_option` resolves the Symbol by calling the method
+  with that name on the agent instance, and the result is converted
+  to strings. This allows dynamic tool confirmation lists:
+      class MyAgent < LLM::Agent
+        confirm :tools_that_need_confirmation
+        def tools_that_need_confirmation
+          some_condition ? %w[delete destroy] : %w[delete]
+        end
+      end
+  Ported from llmrb/mruby-llm@89a232e3 and @2dd04e2d.
+  Extend the same pattern to `LLM::Agent.skills` so the skills DSL
+  accepts a Symbol that resolves through the agent instance at
+  initialization time.
+### Change
+* **Clarify `LLM::Agent` as the default high-level runtime** <br>
+  Document that `LLM::Context` remains at the heart of llm.rb, but
+  `LLM::Agent` is the better default unless an application needs advanced
+  manual tool loops. `LLM::Agent` manages the tool loop for callers and
+  enables guards against runaway or repeated tool-call loops.
 ## v11.2.0
 Changes since `v11.1.0`.
@@ -222,7 +268,7 @@ requests outside `#session`, `LLM::Function#def` as a short alias for
   Fix block-form `model { ... }`, `tools { ... }`, and
   `schema { ... }` declarations in the ActiveRecord and Sequel agent
   wrappers so persisted agent models configure the internal agent class
-  the same way as `LLM::Agent`.
+  the same way `LLM::Agent` does.
 * **Fix missing `skills` in ORM agent wrappers** <br>
   Fix the ActiveRecord and Sequel agent wrappers to expose `skills`, so
@@ -465,7 +511,7 @@ DSML tool-marker filtering in streamed text.
   blocks that Bedrock rejects.
 * **Suppress Bedrock DSML tool markers in streamed text** <br>
-  Filter `\"<｜DSML｜function_calls\"` markers out of streamed Bedrock
+  Filter `"\u003c\u003cDSML\u003efunction_calls\u003e\u003e"` markers out of streamed Bedrock
   assistant text so tool-call sentinels do not leak into user-visible
   output.
@@ -475,7 +521,7 @@ Changes since `v7.0.0`.
 This release adds Unix-fork concurrency for process-isolated tool
 execution, extends `LLM::Object` with `#merge` and `#delete`, and drops
-Ruby 3.2 support due to segfaults observed with the `:fork` path. It
+Ruby 3.2 support due to a segfault observed with the `:fork` path. It
 promotes `LLM::Pipe` to the top-level namespace and adds
 `persistent: true` on `LLM::MCP.http` for direct persistent transport
 configuration. `LLM::Function#runner` is exposed as public API, agent
@@ -616,7 +662,7 @@ provider usage has been recorded yet.
   buffer API.
 * **Support percentage compaction token thresholds** <br>
-  Let `LLM::Compactor` accept `token_threshold:` values like `\"90%\"` so
+  Let `LLM::Compactor` accept `token_threshold:` values like `"90%"` so
   compaction can trigger at a percentage of the active model context
   window.
@@ -775,7 +821,7 @@ interruption use the active per-call stream correctly.
 * **Refresh provider model metadata** <br>
   Add current DeepSeek and OpenAI model metadata to `data/` and update the
-  Google Gemma model entry to match the current provider naming.
+  Google Gemini model entry to match the current provider naming.
 ### Fix
@@ -1216,12 +1262,12 @@ Changes since `v4.14.0`.
   storage when Sequel JSON typecasting is enabled.
 * **Improve streaming parser performance** <br>
-  In the local replay-based `stream_parser` benchmark versus
-  `v4.14.0` (median of 20 samples, 5000 iterations), plain Ruby is a
+  In the local replay-based `stream_parser` benchmark versus `v4.14.0`
+  (median of 20 samples, 5000 iterations), plain Ruby is a
   small overall win: the generic eventstream path is about 0.4%
   faster, the OpenAI stream parser is about 0.5% faster, and the
   OpenAI Responses parser is about 1.6% faster, with unchanged
-  allocations. Under YJIT on the same benchmark, the generic
+  allocations. Under YJIT on the same benchmark harness, the generic
   eventstream path is about 0.9% faster and the OpenAI stream parser
   is about 0.4% faster, while the OpenAI Responses parser is about
   0.7% slower, also with unchanged allocations.
@@ -1263,7 +1309,7 @@ parallel tool calls can safely share one connection.
 * **Reduce provider streaming allocations** <br>
   Decode streamed provider payloads directly in
   `LLM::Provider::Transport::HTTP` before handing them to provider
-  parsers, which cuts allocation churn and gives a smaller streaming
+  parsers, which cuts allocation churn and gives a small streaming
   speed bump.
 * **Reduce generic SSE parser allocations** <br>
@@ -1399,7 +1445,7 @@ Changes since `v4.9.0`.
 - Add HTTP transport for MCP with `LLM::MCP::Transport::HTTP` for remote servers
 - Add JSON Schema union types (`any_of`, `all_of`, `one_of`) with parser integration
-- Add JSON Schema type array union support (e.g., `\"type\": [\"object\", \"null\"]`)
+- Add JSON Schema type array union support (e.g., `"type": ["object", "null"]`)
 - Add JSON Schema type inference from `const`, `enum`, or `default` fields
 ### Change
@@ -1500,7 +1546,7 @@ Notable merged work in this range includes:
 - `Add rack + websocket example (#130)`
 - `feat(gemspec): add changelog URI (#136)`
 - `feat(function): alias ThreadGroup#wait as ThreadGroup#value (#62)`
-- README and screencast refresh across `#66`, `#68`, `#71`, and
+- `README and screencast refresh across `#66`, `#68`, `#71`, and
   `#72`
 - `chore(bot): update deprecation warning from v5.0 to v6.0`
 - `fix(deepseek): tolerate malformed tool arguments`

data/README.md CHANGED Viewed

@@ -11,7 +11,7 @@
     <img src="https://img.shields.io/badge/License-0BSD-orange.svg?" alt="License">
   </a>
   <a href="https://github.com/llmrb/llm.rb/tags">
-    <img src="https://img.shields.io/badge/version-11.2.0-green.svg?" alt="Version">
+    <img src="https://img.shields.io/badge/version-11.3.0-green.svg?" alt="Version">
   </a>
 </p>
@@ -30,10 +30,27 @@ also includes built-in ActiveRecord and Sequel support, plus concurrent
 tool execution through threads, tasks (via async gem), fibers, ractors,
 and fork (via xchan.rb gem).
-As a bonus, llm.rb is also available to embedded systems [via mruby](https://github.com/llmrb/mruby-llm#readme),
-to the browser and edge devices [via WebAssembly](https://github.com/llmrb/wasm-llm#readme),
-and has first-class [Rails support](https://github.com/llmrb/rails-llm#readme)
-via a separate gem.
+## Services
+The llm.rb runtime and its forks
+([mruby-llm](https://github.com/llmrb/mruby-llm),
+[wasm-llm](https://github.com/llmrb/wasm-llm))
+power a growing family of AI applications, and
+services. The following applications are publicly
+accessible over SSH and are free to try. No account
+required. Nothing to install.
+#### matz - the mruby expert
+> ssh matz@r.uby.dev
+See [https://r.uby.dev/matz](https://r.uby.dev/matz) for more information.
+#### robert - the freebsd expert
+> ssh robert@4.4bsd.dev
+See [https://4.4bsd.dev/robert](https://4.4bsd.dev/robert) for more information.
 ## Quick start
@@ -138,10 +155,10 @@ to either
 or
 [LLM::Agent](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html).
 In this example, the MCP server runs over stdio and
-[LLM::Context](https://0x1eef.github.io/x/llm.rb/LLM/Context.html)
-uses the same tool loop as local tools. For **stdio**, `mcp.session`
-is the preferred pattern because it keeps one MCP session alive across
-discovery and tool calls:
+[LLM::Agent](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html)
+manages the tool loop. For **stdio**, `mcp.session` is the preferred
+pattern because it keeps one MCP session alive across discovery and
+tool calls:
 ```ruby
 require "llm"
@@ -150,9 +167,8 @@ llm = LLM.openai(key: ENV["KEY"])
 mcp = LLM::MCP.stdio(argv: ["ruby", "server.rb"])
 mcp.session do
-  ctx = LLM::Context.new(llm, stream: $stdout, tools: mcp.tools)
-  ctx.talk "Use the available tools to inspect the environment."
-  ctx.talk(ctx.wait(:call)) while ctx.functions?
+  agent = LLM::Agent.new(llm, stream: $stdout, tools: mcp.tools)
+  agent.talk "Use the available tools to inspect the environment."
 end
 ```
@@ -167,9 +183,8 @@ require "llm"
 llm = LLM.openai(key: ENV["KEY"])
 mcp = LLM::MCP.stdio(argv: ["ruby", "server.rb"])
-ctx = LLM::Context.new(llm, tools: mcp.tools)
-ctx.talk("Use the available tools to inspect the environment.")
-ctx.talk(ctx.wait(:call)) while ctx.functions?
+agent = LLM::Agent.new(llm, tools: mcp.tools)
+agent.talk("Use the available tools to inspect the environment.")
 ```
 The HTTP transport can be used with or without the `session` method,
@@ -188,9 +203,8 @@ mcp = LLM::MCP.http(
   transport: :net_http_persistent
 )
-ctx = LLM::Context.new(llm, tools: mcp.tools)
-ctx.talk("Use the available tools to inspect the environment.")
-ctx.talk(ctx.wait(:call)) while ctx.functions?
+agent = LLM::Agent.new(llm, tools: mcp.tools)
+agent.talk("Use the available tools to inspect the environment.")
 ```
 #### A2A (Agent 2 Agent)
@@ -214,9 +228,8 @@ a2a = LLM::A2A.rest(
   headers: {"Authorization" => "Bearer token"}
 )
 llm = LLM.openai(key: ENV["KEY"])
-ctx = LLM::Context.new(llm, tools: a2a.skills)
-ctx.talk "Analyze this CSV and summarize the trends."
-ctx.talk(ctx.wait(:call)) while ctx.functions?
+agent = LLM::Agent.new(llm, tools: a2a.skills)
+agent.talk "Analyze this CSV and summarize the trends."
 ```
 Use persistent HTTP connections:
@@ -317,8 +330,8 @@ class Stream < LLM::Stream
 end
 llm = LLM.openai(key: ENV["KEY"])
-ctx = LLM::Context.new(llm, stream: Stream.new)
-ctx.talk "Write a haiku about Ruby."
+agent = LLM::Agent.new(llm, stream: Stream.new)
+agent.talk "Write a haiku about Ruby."
 ```
 #### LLM::Stream (advanced)
@@ -375,30 +388,31 @@ agent.talk "Read README.md and CHANGELOG.md and compare them."
 #### Serialization
-The [`LLM::Context`](https://0x1eef.github.io/x/llm.rb/LLM/Context.html)
+The [`LLM::Agent`](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html)
 object can be serialized to JSON, which makes it suitable for storing
 in a file, a database column, or a Redis queue. The built-in
-ActiveRecord and Sequel plugins are built on top of this feature:
+ActiveRecord and Sequel plugins are built on top of the same underlying
+serialization feature:
 ```ruby
 require "llm"
 llm = LLM.openai(key: ENV["KEY"])
-# Serialize a context
-ctx1 = LLM::Context.new(llm)
-ctx1.talk "Remember that my favorite language is Ruby"
-string = ctx1.to_json
+# Serialize an agent
+agent1 = LLM::Agent.new(llm)
+agent1.talk "Remember that my favorite language is Ruby"
+string = agent1.to_json
-# Restore a context (from JSON)
-ctx2 = LLM::Context.new(llm, stream: $stdout)
-ctx2.restore(string:)
-ctx2.talk "What is my favorite language?"
+# Restore an agent (from JSON)
+agent2 = LLM::Agent.new(llm, stream: $stdout)
+agent2.restore(string:)
+agent2.talk "What is my favorite language?"
 ```
 #### ask
-[`LLM::Context`](https://0x1eef.github.io/x/llm.rb/LLM/Context.html)
+[`LLM::Agent`](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html)
 also provides `ask`, a convenience interface that is compatible with
 RubyLLM's `ask` method. It accepts a prompt, an optional `with:`
 attachment path or paths, an optional `stream:` target, and an optional
@@ -410,11 +424,11 @@ so use `.content` when you want the text directly:
 require "llm"
 llm = LLM.openai(key: ENV["KEY"])
-ctx = LLM::Context.new(llm)
+agent = LLM::Agent.new(llm)
-puts ctx.ask("Hello world").content
-puts ctx.ask("Summarize this document.", with: "README.md").content
-ctx.ask("Stream this reply.") { $stdout << _1 }
+puts agent.ask("Hello world").content
+puts agent.ask("Summarize this document.", with: "README.md").content
+agent.ask("Stream this reply.") { $stdout << _1 }
 ```
 ## Installation
@@ -427,8 +441,8 @@ gem install llm.rb
 #### REPL
-This example uses [`LLM::Context`](https://0x1eef.github.io/x/llm.rb/LLM/Context.html)
-directly for an interactive REPL. <br> See the
+This example uses [`LLM::Agent`](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html)
+for an interactive REPL. <br> See the
 [deepdive (web)](https://llmrb.github.io/llm.rb/) or
 [deepdive (markdown)](resources/deepdive.md) for more examples.
@@ -436,11 +450,11 @@ directly for an interactive REPL. <br> See the
 require "llm"
 llm = LLM.openai(key: ENV["KEY"])
-ctx = LLM::Context.new(llm, stream: $stdout)
+agent = LLM::Agent.new(llm, stream: $stdout)
 loop do
   print "> "
-  ctx.talk(STDIN.gets || break)
+  agent.talk(STDIN.gets || break)
   puts
 end
 ```
@@ -449,36 +463,36 @@ end
 In llm.rb, a prompt can be a string, an [`LLM::Prompt`](https://0x1eef.github.io/x/llm.rb/LLM/Prompt.html), or an array.
 When you use an array, each element can be plain text or a tagged object such as
-[`ctx.image_url(...)`](https://0x1eef.github.io/x/llm.rb/LLM/Context.html#image_url-instance_method),
-[`ctx.local_file(...)`](https://0x1eef.github.io/x/llm.rb/LLM/Context.html#local_file-instance_method),
-or [`ctx.remote_file(...)`](https://0x1eef.github.io/x/llm.rb/LLM/Context.html#remote_file-instance_method).
+[`agent.image_url(...)`](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html#image_url-instance_method),
+[`agent.local_file(...)`](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html#local_file-instance_method),
+or [`agent.remote_file(...)`](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html#remote_file-instance_method).
 Those tagged objects carry the metadata the provider adapter needs to turn one
 Ruby prompt into the provider-specific multimodal request schema.
 If the model understands that file type, you can attach a local file directly
-with `ctx.ask(..., with: path)` instead of uploading it first through a
+with `agent.ask(..., with: path)` instead of uploading it first through a
 provider Files API. Under the hood, llm.rb tags the path as a
-[`ctx.local_file(...)`](https://0x1eef.github.io/x/llm.rb/LLM/Context.html#local_file-instance_method)
+[`agent.local_file(...)`](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html#local_file-instance_method)
 object:
 ```ruby
 require "llm"
 llm = LLM.openai(key: ENV["KEY"])
-ctx = LLM::Context.new(llm)
-puts ctx.ask("Summarize this document.", with: "README.md").content
+agent = LLM::Agent.new(llm)
+puts agent.ask("Summarize this document.", with: "README.md").content
 ```
 #### Context Compaction
-This example uses [`LLM::Context`](https://0x1eef.github.io/x/llm.rb/LLM/Context.html),
+This example uses [`LLM::Agent`](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html),
 [`LLM::Compactor`](https://0x1eef.github.io/x/llm.rb/LLM/Compactor.html), and
 [`LLM::Stream`](https://0x1eef.github.io/x/llm.rb/LLM/Stream.html) together so
-long-lived contexts can summarize older history and expose the lifecycle
+long-lived conversations can summarize older history and expose the lifecycle
 through stream hooks. This approach is inspired by General Intelligence
 Systems. The
 compactor can also use its own `model:` if you want summarization to run on a
-different model from the main context. `token_threshold:` accepts either a
+different model from the main conversation. `token_threshold:` accepts either a
 fixed token count or a percentage string like `"90%"`, which resolves
 against the active model context window and triggers compaction once total
 token usage goes over that percentage. See the
@@ -499,7 +513,7 @@ class Stream < LLM::Stream
 end
 llm = LLM.openai(key: ENV["KEY"])
-ctx = LLM::Context.new(
+agent = LLM::Agent.new(
   llm,
   stream: Stream.new,
   compactor: {
@@ -518,9 +532,8 @@ visible assistant output. See the
 [deepdive (web)](https://llmrb.github.io/llm.rb/) or
 [deepdive (markdown)](resources/deepdive.md) for more examples.
-To use the Responses API (OpenAI-specific), initialize a
-context or agent with `mode: :responses` and keep using
-`talk` for turns.
+To use the Responses API (OpenAI-specific), initialize an agent with
+`mode: :responses` and keep using `talk` for turns.
 ```ruby
 require "llm"
@@ -536,20 +549,20 @@ class Stream < LLM::Stream
 end
 llm = LLM.openai(key: ENV["KEY"])
-ctx = LLM::Context.new(
+agent = LLM::Agent.new(
   llm,
   model: "gpt-5.4-mini",
   mode: :responses,
   reasoning: {effort: "medium"},
   stream: Stream.new
 )
-ctx.talk("Solve 17 * 19 and show your work.")
+agent.talk("Solve 17 * 19 and show your work.")
 ```
 #### Request Cancellation
 Need to cancel a stream? llm.rb has you covered through
-[`LLM::Context#interrupt!`](https://0x1eef.github.io/x/llm.rb/LLM/Context.html#interrupt-21-instance_method).
+[`LLM::Agent#interrupt!`](https://0x1eef.github.io/x/llm.rb/LLM/Agent.html#interrupt-21-instance_method).
 <br> See the [deepdive (web)](https://llmrb.github.io/llm.rb/)
 or [deepdive (markdown)](resources/deepdive.md) for more examples.
@@ -558,15 +571,15 @@ require "llm"
 require "io/console"
 llm = LLM.openai(key: ENV["KEY"])
-ctx = LLM::Context.new(llm, stream: $stdout)
+agent = LLM::Agent.new(llm, stream: $stdout)
 worker = Thread.new do
-  ctx.talk("Write a very long essay about network protocols.")
+  agent.talk("Write a very long essay about network protocols.")
 rescue LLM::Interrupt
   puts "Request was interrupted!"
 end
 STDIN.getch
-ctx.interrupt!
+agent.interrupt!
 worker.join
 ```
@@ -727,7 +740,7 @@ end
 This example uses [`LLM::MCP`](https://0x1eef.github.io/x/llm.rb/LLM/MCP.html)
 over HTTP so remote GitHub MCP tools run through the same
-`LLM::Context` tool path as local tools. It expects a GitHub token in
+`LLM::Agent` tool path as local tools. It expects a GitHub token in
 `ENV["GITHUB_PAT"]`. See the
 [deepdive (web)](https://llmrb.github.io/llm.rb/) or
 [deepdive (markdown)](resources/deepdive.md) for more examples.
@@ -743,9 +756,8 @@ mcp = LLM::MCP.http(
   persistent: true
 )
-ctx = LLM::Context.new(llm, stream: $stdout, tools: mcp.tools)
-ctx.talk("Pull information about my GitHub account.")
-ctx.talk(ctx.wait(:call)) while ctx.functions?
+agent = LLM::Agent.new(llm, stream: $stdout, tools: mcp.tools)
+agent.talk("Pull information about my GitHub account.")
 ```
 ## Resources

data/lib/llm/agent.rb CHANGED Viewed

@@ -72,7 +72,11 @@ module LLM
     #  Returns the current skills when no argument is provided
     def self.skills(*skills, &block)
       return @skills if skills.empty? && !block
-      @skills = block || skills.flatten
+      if skills.size == 1 and skills.grep(Symbol).any?
+        @skills = skills.first
+      else
+        @skills = block || skills.flatten
+      end
     end
     ##
@@ -160,14 +164,31 @@ module LLM
     ##
     # Set or get the tool names that require confirmation before they can run.
     #
+    # When a single Symbol is given, it is stored as-is and resolved at
+    # initialization time by calling the method with that name on the agent
+    # instance. This allows dynamic tool confirmation lists.
+    #
+    # @example
+    #   class MyAgent < LLM::Agent
+    #     confirm :tools_that_need_confirmation
+    #
+    #     def tools_that_need_confirmation
+    #       some_condition ? %w[delete destroy] : %w[delete]
+    #     end
+    #   end
+    #
     # @param [String, Symbol, Array<String, Symbol>, Proc] tool_names
     #  One or more tool names.
     # @param [Proc] block
     #  An optional, lazy-evaluated Proc
-    # @return [Array<String>, Proc, nil]
+    # @return [Array<String>, Proc, Symbol, nil]
     def self.confirm(*tool_names, &block)
       return @confirm if tool_names.empty? && !block
-      @confirm = block || tool_names.flatten.map(&:to_s)
+      if tool_names.size == 1 && tool_names.grep(Symbol).any?
+        @confirm = tool_names.first
+      else
+        @confirm = block || tool_names.flatten.map(&:to_s)
+      end
     end
     ##
@@ -190,7 +211,7 @@ module LLM
       fields_ivar = %i[tracer concurrency instructions confirm]
       fields.each do |field|
         resolvable = params.key?(field) ? params.delete(field) : self.class.public_send(field)
-        resolve_symbol = !%i[concurrency confirm].include?(field)
+        resolve_symbol = !%i[concurrency].include?(field)
         resolved = resolvable != nil ? resolve_option(self, resolvable, resolve_symbol:) : resolvable
         resolved = [*resolved].map(&:to_s) if field == :confirm && resolved
         if field == :model

data/lib/llm/error.rb CHANGED Viewed

@@ -35,6 +35,10 @@ module LLM
   # HTTPServerError
   ServerError = Class.new(Error)
+  ##
+  # HTTPNotFound
+  NotFoundError = Class.new(Error)
   ##
   # When an given an input object that is not understood
   FormatError = Class.new(Error)

data/lib/llm/providers/anthropic/error_handler.rb CHANGED Viewed

@@ -49,6 +49,8 @@ class LLM::Anthropic
         LLM::UnauthorizedError.new("Authentication error").tap { _1.response = res }
       elsif res.rate_limited?
         LLM::RateLimitError.new("Too many requests").tap { _1.response = res }
+      elsif res.not_found?
+        LLM::NotFoundError.new("Server response: not found (404)").tap { _1.response = res }
       else
         LLM::Error.new("Unexpected response").tap { _1.response = res }
       end

data/lib/llm/providers/bedrock/error_handler.rb CHANGED Viewed

@@ -53,7 +53,7 @@ class LLM::Bedrock
       elsif res.rate_limited?
         LLM::RateLimitError.new(message).tap { _1.response = res }
       elsif res.not_found?
-        LLM::Error.new("Bedrock model not found: #{message}").tap { _1.response = res }
+        LLM::NotFoundError.new("Server response: not found (404)").tap { _1.response = res }
       else
         LLM::Error.new(message).tap { _1.response = res }
       end

data/lib/llm/providers/google/error_handler.rb CHANGED Viewed

@@ -60,6 +60,8 @@ class LLM::Google
         end
       elsif res.rate_limited?
         LLM::RateLimitError.new("Too many requests").tap { _1.response = res }
+      elsif res.not_found?
+        LLM::NotFoundError.new("Server response: not found (404)").tap { _1.response = res }
       else
         LLM::Error.new("Unexpected response").tap { _1.response = res }
       end

data/lib/llm/providers/ollama/error_handler.rb CHANGED Viewed

@@ -49,6 +49,8 @@ class LLM::Ollama
         LLM::UnauthorizedError.new("Authentication error").tap { _1.response = res }
       elsif res.rate_limited?
         LLM::RateLimitError.new("Too many requests").tap { _1.response = res }
+      elsif res.not_found?
+        LLM::NotFoundError.new("Server response: not found (404)").tap { _1.response = res }
       else
         LLM::Error.new("Unexpected response").tap { _1.response = res }
       end

data/lib/llm/providers/openai/error_handler.rb CHANGED Viewed

@@ -55,6 +55,8 @@ class LLM::OpenAI
         LLM::UnauthorizedError.new("Authentication error").tap { _1.response = res }
       elsif res.rate_limited?
         LLM::RateLimitError.new("Too many requests").tap { _1.response = res }
+      elsif res.not_found?
+        LLM::NotFoundError.new("Server response: not found (404)").tap { _1.response = res }
       else
         error = body["error"] || {}
         case error["type"]

data/lib/llm/version.rb CHANGED Viewed

@@ -1,5 +1,5 @@
 # frozen_string_literal: true
 module LLM
-  VERSION = "11.2.0"
+  VERSION = "11.3.0"
 end

data/llm.gemspec CHANGED Viewed

@@ -5,8 +5,8 @@ require_relative "lib/llm/version"
 Gem::Specification.new do |spec|
   spec.name = "llm.rb"
   spec.version = LLM::VERSION
-  spec.authors = ["Antar Azri", "0x1eef", "Christos Maris", "Rodrigo Serrano"]
-  spec.email = ["azantar@proton.me", "0x1eef@hardenedbsd.org"]
+  spec.authors = ["0x1eef (Robert)", "Antar Azri", "Rodrigo Serrano", "Christos Maris"]
+  spec.email = ["robert@4.4bsd.dev"]
   spec.summary = "Ruby's most capable AI runtime"
   spec.description = <<~DESC
@@ -60,4 +60,5 @@ Gem::Specification.new do |spec|
   spec.add_development_dependency "sqlite3", "~> 2.0"
   spec.add_development_dependency "xchan.rb", "~> 0.20"
   spec.add_development_dependency "pg", "~> 1.5"
+  spec.add_development_dependency "irb", "~> 1.18"
 end

metadata CHANGED Viewed

@@ -1,13 +1,13 @@
 --- !ruby/object:Gem::Specification
 name: llm.rb
 version: !ruby/object:Gem::Version
-  version: 11.2.0
+  version: 11.3.0
 platform: ruby
 authors:
+- 0x1eef (Robert)
 - Antar Azri
-- '0x1eef'
-- Christos Maris
 - Rodrigo Serrano
+- Christos Maris
 bindir: bin
 cert_chain: []
 date: 1980-01-02 00:00:00.000000000 Z
@@ -264,6 +264,20 @@ dependencies:
     - - "~>"
       - !ruby/object:Gem::Version
         version: '1.5'
+- !ruby/object:Gem::Dependency
+  name: irb
+  requirement: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '1.18'
+  type: :development
+  prerelease: false
+  version_requirements: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '1.18'
 description: |
   llm.rb is Ruby's most capable AI runtime.
@@ -279,8 +293,7 @@ description: |
   tool execution through threads, tasks (via async gem), fibers, ractors,
   and fork (via xchan.rb gem).
 email:
-- azantar@proton.me
-- 0x1eef@hardenedbsd.org
+- robert@4.4bsd.dev
 executables: []
 extensions: []
 extra_rdoc_files: []