RubyGems - smith-agents - Versions diffs - 0.4.0 - Mend

smith-agents 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (115) hide show

checksums.yaml +7 -0
data/CHANGELOG.md +139 -0
data/CODE_OF_CONDUCT.md +128 -0
data/LICENSE +21 -0
data/README.md +226 -0
data/Rakefile +14 -0
data/UPSTREAM_PROPOSAL.md +141 -0
data/docs/CONFIGURATION.md +123 -0
data/docs/PATTERNS.md +492 -0
data/docs/PERSISTENCE.md +169 -0
data/docs/TOOLS_AND_GUARDRAILS.md +140 -0
data/docs/workflow_claim.md +58 -0
data/exe/smith +7 -0
data/lib/generators/smith/install/install_generator.rb +22 -0
data/lib/generators/smith/install/templates/smith.rb.tt +44 -0
data/lib/smith/agent/lifecycle.rb +264 -0
data/lib/smith/agent/registry.rb +128 -0
data/lib/smith/agent.rb +259 -0
data/lib/smith/artifacts/file.rb +59 -0
data/lib/smith/artifacts/memory.rb +75 -0
data/lib/smith/artifacts/scoped_store.rb +29 -0
data/lib/smith/artifacts.rb +5 -0
data/lib/smith/budget/ledger.rb +42 -0
data/lib/smith/budget.rb +5 -0
data/lib/smith/cli.rb +82 -0
data/lib/smith/context/observation_masking.rb +19 -0
data/lib/smith/context/session.rb +42 -0
data/lib/smith/context/state_injection.rb +24 -0
data/lib/smith/context.rb +61 -0
data/lib/smith/doctor/check.rb +12 -0
data/lib/smith/doctor/checks/baseline.rb +84 -0
data/lib/smith/doctor/checks/configuration.rb +56 -0
data/lib/smith/doctor/checks/durability.rb +103 -0
data/lib/smith/doctor/checks/live.rb +55 -0
data/lib/smith/doctor/checks/models_registry.rb +66 -0
data/lib/smith/doctor/checks/openai_api_mode.rb +51 -0
data/lib/smith/doctor/checks/persistence.rb +99 -0
data/lib/smith/doctor/checks/persistence_capabilities.rb +60 -0
data/lib/smith/doctor/checks/persistence_registry.rb +82 -0
data/lib/smith/doctor/checks/rails.rb +39 -0
data/lib/smith/doctor/checks/serialization.rb +78 -0
data/lib/smith/doctor/installer.rb +103 -0
data/lib/smith/doctor/printer.rb +62 -0
data/lib/smith/doctor/report.rb +39 -0
data/lib/smith/doctor.rb +53 -0
data/lib/smith/errors.rb +191 -0
data/lib/smith/event.rb +11 -0
data/lib/smith/events/.keep +0 -0
data/lib/smith/events/bus.rb +60 -0
data/lib/smith/events/step_completed.rb +11 -0
data/lib/smith/events/subscription.rb +24 -0
data/lib/smith/events.rb +5 -0
data/lib/smith/guardrails/runner.rb +44 -0
data/lib/smith/guardrails/url_verifier.rb +7 -0
data/lib/smith/guardrails.rb +35 -0
data/lib/smith/models/inference.rb +199 -0
data/lib/smith/models/normalizer.rb +186 -0
data/lib/smith/models/profile.rb +39 -0
data/lib/smith/models.rb +132 -0
data/lib/smith/persistence_adapters/active_record_store.rb +99 -0
data/lib/smith/persistence_adapters/cache_store.rb +79 -0
data/lib/smith/persistence_adapters/memory.rb +105 -0
data/lib/smith/persistence_adapters/rails_cache.rb +20 -0
data/lib/smith/persistence_adapters/redis_store.rb +136 -0
data/lib/smith/persistence_adapters/retry.rb +42 -0
data/lib/smith/persistence_adapters.rb +112 -0
data/lib/smith/pricing.rb +65 -0
data/lib/smith/providers/openai/responses.rb +315 -0
data/lib/smith/providers/openai/routing.rb +67 -0
data/lib/smith/providers/openai/tools_extensions.rb +106 -0
data/lib/smith/railtie.rb +9 -0
data/lib/smith/tasks/doctor.rake +38 -0
data/lib/smith/tool/budget_enforcement.rb +33 -0
data/lib/smith/tool/capability_builder.rb +18 -0
data/lib/smith/tool/capture.rb +22 -0
data/lib/smith/tool/compatibility.rb +72 -0
data/lib/smith/tool/policy.rb +40 -0
data/lib/smith/tool.rb +171 -0
data/lib/smith/tools/think.rb +25 -0
data/lib/smith/tools/url_fetcher.rb +16 -0
data/lib/smith/tools/web_search.rb +17 -0
data/lib/smith/tools.rb +5 -0
data/lib/smith/trace/logger.rb +46 -0
data/lib/smith/trace/memory.rb +53 -0
data/lib/smith/trace/open_telemetry.rb +57 -0
data/lib/smith/trace.rb +89 -0
data/lib/smith/types.rb +16 -0
data/lib/smith/version.rb +5 -0
data/lib/smith/workflow/artifact_integration.rb +41 -0
data/lib/smith/workflow/budget_integration.rb +105 -0
data/lib/smith/workflow/claim.rb +118 -0
data/lib/smith/workflow/data_volume_policy.rb +36 -0
data/lib/smith/workflow/deadline_enforcement.rb +100 -0
data/lib/smith/workflow/deterministic_execution.rb +53 -0
data/lib/smith/workflow/deterministic_step.rb +57 -0
data/lib/smith/workflow/dsl.rb +223 -0
data/lib/smith/workflow/durability.rb +369 -0
data/lib/smith/workflow/evaluator_optimizer.rb +220 -0
data/lib/smith/workflow/event_integration.rb +24 -0
data/lib/smith/workflow/execution.rb +127 -0
data/lib/smith/workflow/execution_frame.rb +166 -0
data/lib/smith/workflow/guardrail_integration.rb +40 -0
data/lib/smith/workflow/nested_execution.rb +69 -0
data/lib/smith/workflow/orchestrator_worker.rb +145 -0
data/lib/smith/workflow/parallel.rb +50 -0
data/lib/smith/workflow/parallel_execution.rb +75 -0
data/lib/smith/workflow/persistence.rb +358 -0
data/lib/smith/workflow/pipeline.rb +117 -0
data/lib/smith/workflow/router.rb +53 -0
data/lib/smith/workflow/transition.rb +208 -0
data/lib/smith/workflow.rb +555 -0
data/lib/smith.rb +254 -0
data/script/profile_tool_results.rb +94 -0
data/sig/smith.rbs +4 -0
metadata +258 -0

data/UPSTREAM_PROPOSAL.md ADDED Viewed

@@ -0,0 +1,141 @@
+# Upstream proposal: Capability profiles + before_complete hook for RubyLLM
+## Motivation
+Smith (a workflow-first multi-agent orchestration library built on RubyLLM) currently maintains a "shadow registry" of model capabilities — pattern-based provider rules describing how each provider shapes its API payload (Anthropic Opus 4.7+ uses adaptive thinking; OpenAI gpt-5 family needs `/v1/responses` for tools+thinking; Gemini 2.5+ accepts `thinking_budget`; etc.). When the underlying RubyLLM client doesn't know these distinctions, Smith's normalizer rewrites the chat object's `@temperature`, `@thinking`, and `@params` ivars before the request leaves.
+This works but requires Smith to:
+1. Maintain a parallel capability registry (`Smith::Models::Inference`)
+2. Mutate RubyLLM-owned chat ivars directly (`@temperature`, `@thinking`) because RubyLLM 1.15 has no `Chat#without_thinking` / `#without_temperature` public API
+3. Vendor PR #770's `/v1/responses` adapter ahead of upstream merge so gpt-5 family can use tools + reasoning_effort together
+A cleaner design lives upstream in RubyLLM. This proposal describes three additive changes that would let Smith retire ~400 lines of vendored code and ~6 monkey-patches.
+## Proposed RubyLLM API
+### 1. `RubyLLM::Chat#without_thinking` and `#without_temperature`
+Smith currently does `chat.instance_variable_set(:@thinking, nil)` and `chat.instance_variable_set(:@temperature, nil)` because there's no public way to clear these. `with_thinking` requires at least one of `effort:` or `budget:`. Add the no-arg clearers:
+```ruby
+module RubyLLM
+  class Chat
+    def without_thinking
+      @thinking = nil
+      self
+    end
+    def without_temperature
+      @temperature = nil
+      self
+    end
+  end
+end
+```
+Small additive change. Smith retires its only remaining `instance_variable_set` calls.
+### 2. `RubyLLM::Capabilities::Profile` + `Model::Info#capabilities`
+Add a structured capability profile to model info:
+```ruby
+module RubyLLM
+  module Capabilities
+    Profile = Data.define(
+      :thinking_shape,            # :budget_tokens | :reasoning_effort | :adaptive | nil
+      :accepts_temperature,
+      :tools_with_thinking_native,
+      :tools_with_thinking_route  # :responses | nil for OpenAI
+    )
+    # Public registration API. Idempotent.
+    def self.register(model_id, profile)
+      registry[model_id.to_s] = profile
+    end
+    def self.find(model_id)
+      registry[model_id.to_s]
+    end
+    def self.registry
+      @registry ||= {}
+    end
+  end
+  class Model::Info
+    attr_reader :capabilities  # RubyLLM::Capabilities::Profile?
+  end
+end
+```
+RubyLLM's `models.json` could ship default capability profiles for the models it bundles. Smith would migrate its `Smith::Models::Inference` defaults upstream as a `Capabilities.default_rules` table.
+### 3. `RubyLLM::Provider.before_complete` hook
+Add an extension point for per-request shaping:
+```ruby
+module RubyLLM
+  class Provider
+    # Hosts register normalizers that run AFTER chat construction but
+    # BEFORE render_payload. The hook receives the chat and the
+    # capabilities profile of the resolved model.
+    def self.before_complete(&block)
+      normalizers << block
+    end
+    def self.normalizers
+      @normalizers ||= []
+    end
+    # Existing complete(...) signature unchanged. Internally invokes
+    # the registered normalizers before render_payload.
+  end
+end
+```
+## What Smith looks like after upstream lands
+```ruby
+# lib/smith.rb (post-upstream)
+# Register Smith's library-shipped pattern rules into RubyLLM's catalog
+Smith::Models::Inference.rules.each do |rule|
+  RubyLLM::Capabilities.register_rule(
+    provider: rule.provider,
+    matcher:  rule.matcher,
+    profile:  rule.to_profile("anyone").to_h.except(:model_id)
+  )
+end
+# Register Smith's normalizer as a public RubyLLM hook
+RubyLLM::Provider.before_complete do |chat, profile|
+  Smith::Models::Normalizer.apply!(chat, profile: profile) if profile
+end
+```
+Smith's `Models` registry, the `Smith::Agent.chat()` override, and the `lib/smith/providers/openai/routing.rb` vendor patch all retire. What remains in Smith: the capability defaults table and the normalizer's translation logic — still Smith-owned (orchestration concerns), just consumed through a public RubyLLM hook.
+## Retirement checklist
+Once the upstream API ships and Smith adopts it, the following files retire:
+- `lib/smith/models.rb` — becomes a thin wrapper around `RubyLLM::Capabilities`
+- `lib/smith/models/profile.rb` — replaced by `RubyLLM::Capabilities::Profile`
+- `lib/smith/agent.rb#chat` override — replaced by `RubyLLM::Provider.before_complete` hook
+- `lib/smith/providers/openai/routing.rb` — replaced when PR #770 merges (independent track)
+- `lib/smith/providers/openai/responses.rb` — same
+- `lib/smith/providers/openai/tools_extensions.rb` — same
+What stays Smith-owned (orchestration concerns, not provider-API concerns):
+- `lib/smith/models/inference.rb` — pattern rules table; registers itself into RubyLLM via `Capabilities.register_rule`
+- `lib/smith/models/normalizer.rb` — translation logic; registered via `Provider.before_complete`
+- `lib/smith/tool/compatibility.rb` — tool-side compatibility checks
+- The agent / workflow / tool DSLs
+## Tracking
+- RubyLLM PR #770 (OpenAI `/v1/responses` support) is the related upstream track. Smith's vendored `Smith::Providers::OpenAI::Responses` retires when #770 merges.
+- This proposal (capability profiles + before_complete) is a separate, additive RubyLLM RFC. Once accepted, Smith files a migration PR to consume it.

data/docs/CONFIGURATION.md ADDED Viewed

@@ -0,0 +1,123 @@
+# Configuration
+There are three different configuration scopes.
+### 1. Global runtime configuration: `Smith.configure`
+Use this for shared runtime services:
+- artifact backend
+- tracing
+- pricing catalog
+- logger
+### 2. Agent configuration: `Smith::Agent`
+Use agent classes for invocation behavior:
+- `model`
+- `tools`
+- `instructions`
+- `temperature`
+- `thinking`
+- `budget`
+- `guardrails`
+- `output_schema`
+- `data_volume`
+- `fallback_models`
+- `register_as`
+### 3. Workflow configuration: `Smith::Workflow`
+Use workflow classes for orchestration behavior:
+- `initial_state`
+- `state`
+- `transition`
+- `pipeline`
+- `budget`
+- `max_transitions`
+- `guardrails`
+- `context_manager`
+### If You Are Unsure Where Something Goes
+- "Which model should this agent use?" -> agent class
+- "How do I store artifacts or emit traces?" -> `Smith.configure`
+- "What happens after this step succeeds or fails?" -> workflow class
+- "How many tokens/cost/tool calls can this one invocation use?" -> agent budget
+- "How much total budget can the whole workflow consume?" -> workflow budget
+- "Which provider credentials should the app use?" -> RubyLLM, not Smith
+### Full `Smith.configure` Example
+```ruby
+Smith.configure do |config|
+  config.artifact_store = Smith::Artifacts::Memory.new
+  config.artifact_retention = 3600
+  config.artifact_encryption = :none
+  config.artifact_tenant_isolation = false
+  config.trace_adapter = Smith::Trace::Logger
+  config.trace_transitions = true
+  config.trace_tool_calls = true
+  config.trace_token_usage = true
+  config.trace_cost = true
+  config.trace_fields = {
+    transition: %i[transition from to],
+    tool_call: %i[tool duration]
+  }
+  config.trace_content = false
+  config.trace_retention = 86_400
+  config.trace_tenant_isolation = false
+  config.pricing = {
+    "gpt-4.1-nano" => {
+      input_cost_per_token: 0.0000001,
+      output_cost_per_token: 0.0000004
+    }
+  }
+  config.logger = Logger.new($stdout)
+end
+```
+### What Each `Smith.configure` Setting Is For
+| Setting | What it controls | Typical first use |
+| --- | --- | --- |
+| `artifact_store` | Where large handoff payloads are stored | Start with `Smith::Artifacts::Memory.new` |
+| `artifact_retention` | Default retention window for artifact expiry checks | Set once you have a cleanup policy |
+| `artifact_encryption` | Metadata-level encryption policy flag | Leave at default until you wire a real backend |
+| `artifact_tenant_isolation` | Require namespaced artifact writes | Enable in multi-tenant systems |
+| `trace_adapter` | Where structural traces go | Use `Smith::Trace::Memory` or `Smith::Trace::Logger` first |
+| `trace_transitions` | Emit transition traces | Usually leave on |
+| `trace_tool_calls` | Emit tool call traces | Usually leave on |
+| `trace_token_usage` | Emit usage traces | Useful for budget visibility |
+| `trace_cost` | Emit cost traces | Useful once pricing is configured |
+| `trace_fields` | Allowlist structural trace fields | Use when you want tighter trace output |
+| `trace_content` | Whether content appears in traces | Leave `false` first |
+| `trace_retention` | Trace retention policy hook | Useful when traces leave memory |
+| `trace_tenant_isolation` | Trace multi-tenant isolation flag | Enable in multi-tenant systems |
+| `pricing` | Best-known model-call cost catalog | Add once you care about `total_cost` |
+| `logger` | Smith's runtime logger | Usually the first setting to add |
+| `persistence_adapter` | Adapter for durable workflow state | `:redis`, `:rails_cache`, `:active_record`, `:memory`, or a custom object |
+| `persistence_options` | Per-adapter options (client, namespace, model, columns) | See "Built-In Persistence Adapters" |
+| `persistence_ttl` | Global TTL for persisted state (Integer/Float seconds; nil = no expiry) | Set when long-tail abandoned workflows accumulate in storage |
+| `persistence_retry_policy` | Exponential-backoff policy for transient adapter I/O failures | Defaults to `{ attempts: 3, base_delay: 0.1, max_delay: 1.0 }` |
+| `test_mode` | Auto-select `:memory` adapter when `persistence_adapter` is nil | Enable in `spec_helper.rb` to skip Redis/cache wiring in tests |
+| `openai_api_mode` | `:auto` routes (gpt-5 family + tools + thinking) via `/v1/responses` using Smith's vendored Responses adapter (sync only; streaming over `/v1/responses` is not yet supported); `:off` drops incompatible tools instead | Leave `:auto` (default) unless you need streaming with the (gpt-5 + tools + thinking) combo, in which case set `:off` for graceful tool-dropping |
+| `trace_normalizer` | Emit `:normalizer_decision` trace events from `Smith::Models::Normalizer` | Useful when debugging cross-provider request shaping |
+| `ruby_llm_model_registry` | `:database` to require an AR-backed RubyLLM model registry; `:bundled` for the JSON fallback | Leave at default unless you've migrated to DB-backed |
+### Recommended First Additions
+Add settings in this order:
+1. `config.logger`
+2. `config.trace_adapter`
+3. `config.artifact_store`
+4. `config.pricing`
+Do not start by configuring every advanced switch at once.