RubyGems - llm_conductor - Versions diffs - 1.4.1 → 1.5.0 - Mend

llm_conductor 1.4.1 → 1.5.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

checksums.yaml +4 -4
data/README.md +158 -540
data/docs/README.md +42 -0
data/docs/custom-parameters.md +352 -0
data/examples/ollama_params_usage.rb +99 -0
data/lib/llm_conductor/client_factory.rb +2 -2
data/lib/llm_conductor/clients/base_client.rb +3 -2
data/lib/llm_conductor/clients/ollama_client.rb +2 -1
data/lib/llm_conductor/version.rb +1 -1
data/lib/llm_conductor.rb +11 -9
metadata +6 -3
/data/{VISION_USAGE.md → docs/vision-support.md} +0 -0

data/README.md CHANGED Viewed

@@ -1,461 +1,219 @@
 # LLM Conductor
-A powerful Ruby gem from [Ekohe](https://ekohe.com) for orchestrating multiple Language Model providers with a unified, modern interface. LLM Conductor provides seamless integration with OpenAI GPT, Anthropic Claude, Google Gemini, Groq, Ollama, OpenRouter, and Z.ai (Zhipu AI) with advanced prompt management, data building patterns, vision/multimodal support, and comprehensive response handling.
+A unified Ruby interface for multiple Language Model providers from [Ekohe](https://ekohe.com). Seamlessly integrate OpenAI GPT, Anthropic Claude, Google Gemini, Groq, Ollama, OpenRouter, and Z.ai (Zhipu AI) with a single, consistent API.
 ## Features
-🚀 **Multi-Provider Support** - OpenAI GPT, Anthropic Claude, Google Gemini, Groq, Ollama, OpenRouter, and Z.ai with automatic vendor detection
-🎯 **Unified Modern API** - Simple `LlmConductor.generate()` interface with rich Response objects
-🖼️ **Vision/Multimodal Support** - Send images alongside text prompts for vision-enabled models (OpenRouter, Z.ai GLM-4.5V)
-📝 **Advanced Prompt Management** - Registrable prompt classes with inheritance and templating
-🏗️ **Data Builder Pattern** - Structured data preparation for complex LLM inputs
-⚡ **Smart Configuration** - Rails-style configuration with environment variable support
-💰 **Cost Tracking** - Automatic token counting and cost estimation
-🔧 **Extensible Architecture** - Easy to add new providers and prompt types
-🛡️ **Robust Error Handling** - Comprehensive error handling with detailed metadata
+- 🚀 **Multi-Provider Support** - 7+ LLM providers with automatic vendor detection
+- 🎯 **Unified API** - Same interface across all providers
+- 🖼️ **Vision Support** - Send images alongside text (OpenAI, Anthropic, OpenRouter, Z.ai, Gemini)
+- 🔧 **Custom Parameters** - Fine-tune with temperature, top_p, and more
+- 💰 **Cost Tracking** - Automatic token counting and cost estimation
+- ⚡ **Smart Configuration** - Environment variables or code-based setup
 ## Installation
-Add this line to your application's Gemfile:
 ```ruby
 gem 'llm_conductor'
 ```
-And then execute:
 ```bash
-$ bundle install
-```
-Or install it yourself as:
-```bash
-$ gem install llm_conductor
+bundle install
 ```
 ## Quick Start
-### 1. Simple Text Generation
+### 1. Simple Generation
 ```ruby
-# Direct prompt generation - easiest way to get started
+require 'llm_conductor'
+# Set up your API key (or use ENV variables)
+LlmConductor.configure do |config|
+  config.openai(api_key: 'your-api-key')
+end
+# Generate text
 response = LlmConductor.generate(
-  model: 'gpt-5-mini',
+  model: 'gpt-4o-mini',
   prompt: 'Explain quantum computing in simple terms'
 )
-puts response.output           # The generated text
-puts response.total_tokens     # Token usage
+puts response.output           # Generated text
+puts response.total_tokens     # Token count
 puts response.estimated_cost   # Cost in USD
 ```
-### 2. Template-Based Generation
+### 2. With Custom Parameters
 ```ruby
-# Use built-in text summarization template
+# Control creativity with temperature
 response = LlmConductor.generate(
-  model: 'gpt-5-mini',
-  type: :summarize_text,
-  data: {
-    text: 'Ekohe (ee-koh-hee) means "boundless possibility." Our way is to make AI practical, achievable, and most importantly, useful for you — and we prove it every day. With almost 16 years of wins under our belt, a market-leading 24-hr design & development cycle, and 5 offices in the most vibrant cities in the world, we surf the seas of innovation. We create efficient, elegant, and scalable digital products — delivering the right interactive solutions to achieve your audience and business goals. We help you transform. We break new ground across the globe — from AI and ML automation that drives the enterprise, to innovative customer experiences and mobile apps for startups. Our special sauce is the care, curiosity, and dedication we offer to solve for your needs. We focus on your success and deliver the most impactful experiences in the most efficient manner. Our clients tell us we partner with them in a trusted and capable way, driving the right design and technical choices.',
-    max_length: '20 words',
-    style: 'professional and engaging',
-    focus_areas: ['core business', 'expertise', 'target market'],
-    audience: 'potential investors',
-    include_key_points: true,
-    output_format: 'paragraph'
-  }
+  model: 'llama2',
+  prompt: 'Write a creative story',
+  vendor: :ollama,
+  params: { temperature: 0.9 }
 )
+```
-# Response object provides rich information
-if response.success?
-  puts "Generated: #{response.output}"
-  puts "Tokens: #{response.total_tokens}"
-  puts "Cost: $#{response.estimated_cost || 'N/A (free model)'}"
-else
-  puts "Error: #{response.metadata[:error]}"
-end
+### 3. Vision/Multimodal
+```ruby
+# Send images with your prompt
+response = LlmConductor.generate(
+  model: 'gpt-4o',
+  prompt: {
+    text: 'What is in this image?',
+    images: ['https://example.com/image.jpg']
+  }
+)
 ```
 ## Configuration
-### Rails-Style Configuration
+### Environment Variables (Easiest)
-Create `config/initializers/llm_conductor.rb` (Rails) or configure in your application:
+Set these environment variables and the gem auto-configures:
-```ruby
-LlmConductor.configure do |config|
-  # Default settings
-  config.default_model = 'gpt-5-mini'
-  config.default_vendor = :openai
-  config.timeout = 30
-  config.max_retries = 3
-  config.retry_delay = 1.0
-  # Provider configurations
-  config.openai(
-    api_key: ENV['OPENAI_API_KEY'],
-    organization: ENV['OPENAI_ORG_ID'] # Optional
-  )
-  config.anthropic(
-    api_key: ENV['ANTHROPIC_API_KEY']
-  )
-  config.gemini(
-    api_key: ENV['GEMINI_API_KEY']
-  )
-  config.groq(
-    api_key: ENV['GROQ_API_KEY']
-  )
-  config.ollama(
-    base_url: ENV['OLLAMA_ADDRESS'] || 'http://localhost:11434'
-  )
-  config.openrouter(
-    api_key: ENV['OPENROUTER_API_KEY'],
-    uri_base: 'https://openrouter.ai/api/v1' # Optional, this is the default
-  )
-  config.zai(
-    api_key: ENV['ZAI_API_KEY'],
-    uri_base: 'https://api.z.ai/api/paas/v4' # Optional, this is the default
-  )
-  # Optional: Configure custom logger
-  config.logger = Logger.new($stdout)                  # Log to stdout
-  config.logger = Logger.new('log/llm_conductor.log')  # Log to file
-  config.logger = Rails.logger                         # Use Rails logger (in Rails apps)
-end
+```bash
+export OPENAI_API_KEY=your-key-here
+export ANTHROPIC_API_KEY=your-key-here
+export GEMINI_API_KEY=your-key-here
+export GROQ_API_KEY=your-key-here
+export OLLAMA_ADDRESS=http://localhost:11434  # Optional
+export OPENROUTER_API_KEY=your-key-here
+export ZAI_API_KEY=your-key-here
 ```
-### Logging Configuration
-LLM Conductor supports flexible logging using Ruby's built-in Logger class. By default, when a logger is configured, it uses the DEBUG log level to provide detailed information during development.
+### Code Configuration
 ```ruby
 LlmConductor.configure do |config|
-  # Option 1: Log to stdout - uses DEBUG level by default
-  config.logger = Logger.new($stdout)
-  # Option 2: Log to file - set appropriate level
-  config.logger = Logger.new('log/llm_conductor.log')
-  # Option 3: Use Rails logger (Rails apps)
-  config.logger = Rails.logger
-  # Option 4: Custom logger with formatting
-  config.logger = Logger.new($stderr).tap do |logger|
-    logger.formatter = proc { |severity, datetime, progname, msg| "#{msg}\n" }
-  end
+  config.default_model = 'gpt-4o-mini'
+  config.openai(api_key: ENV['OPENAI_API_KEY'])
+  config.anthropic(api_key: ENV['ANTHROPIC_API_KEY'])
+  config.gemini(api_key: ENV['GEMINI_API_KEY'])
+  config.groq(api_key: ENV['GROQ_API_KEY'])
+  config.ollama(base_url: 'http://localhost:11434')
+  config.openrouter(api_key: ENV['OPENROUTER_API_KEY'])
+  config.zai(api_key: ENV['ZAI_API_KEY'])
 end
 ```
-### Environment Variables
+## Supported Providers
-The gem automatically detects these environment variables:
+| Provider | Auto-Detect | Vision | Custom Params |
+|----------|-------------|--------|---------------|
+| OpenAI (GPT) | ✅ `gpt-*` | ✅ | 🔜 |
+| Anthropic (Claude) | ✅ `claude-*` | ✅ | 🔜 |
+| Google (Gemini) | ✅ `gemini-*` | ✅ | 🔜 |
+| Groq | ✅ `llama/mixtral` | ❌ | 🔜 |
+| Ollama | ✅ (default) | ❌ | ✅ |
+| OpenRouter | 🔧 Manual | ✅ | 🔜 |
+| Z.ai (Zhipu) | ✅ `glm-*` | ✅ | 🔜 |
-- `OPENAI_API_KEY` - OpenAI API key
-- `OPENAI_ORG_ID` - OpenAI organization ID (optional)
-- `ANTHROPIC_API_KEY` - Anthropic API key
-- `GEMINI_API_KEY` - Google Gemini API key
-- `GROQ_API_KEY` - Groq API key
-- `OLLAMA_ADDRESS` - Ollama server address
-- `OPENROUTER_API_KEY` - OpenRouter API key
-- `ZAI_API_KEY` - Z.ai (Zhipu AI) API key
+## Common Use Cases
-## Supported Providers & Models
+### Simple Q&A
-### OpenAI (Automatic for GPT models)
 ```ruby
 response = LlmConductor.generate(
-  model: 'gpt-5-mini',  # Auto-detects OpenAI
-  prompt: 'Your prompt here'
+  model: 'gpt-4o-mini',
+  prompt: 'What is Ruby programming language?'
 )
 ```
-### Anthropic Claude (Automatic for Claude models)
-```ruby
-response = LlmConductor.generate(
-  model: 'claude-3-5-sonnet-20241022',  # Auto-detects Anthropic
-  prompt: 'Your prompt here'
-)
+### Content Summarization
-# Or explicitly specify vendor
+```ruby
 response = LlmConductor.generate(
   model: 'claude-3-5-sonnet-20241022',
-  vendor: :anthropic,
-  prompt: 'Your prompt here'
+  type: :summarize_text,
+  data: {
+    text: 'Long article content here...',
+    max_length: '100 words',
+    style: 'professional'
+  }
 )
 ```
-### Google Gemini (Automatic for Gemini models)
-```ruby
-response = LlmConductor.generate(
-  model: 'gemini-2.5-flash',  # Auto-detects Gemini
-  prompt: 'Your prompt here'
-)
+### Deterministic Output (Testing)
-# Or explicitly specify vendor
-response = LlmConductor.generate(
-  model: 'gemini-2.5-flash',
-  vendor: :gemini,
-  prompt: 'Your prompt here'
-)
-```
-### Groq (Automatic for Llama, Mixtral, Gemma, Qwen models)
 ```ruby
 response = LlmConductor.generate(
-  model: 'llama-3.1-70b-versatile',  # Auto-detects Groq
-  prompt: 'Your prompt here'
-)
-# Supported Groq models
-response = LlmConductor.generate(
-  model: 'mixtral-8x7b-32768',  # Auto-detects Groq
-  prompt: 'Your prompt here'
-)
-# Or explicitly specify vendor
-response = LlmConductor.generate(
-  model: 'qwen-2.5-72b-instruct',
-  vendor: :groq,
-  prompt: 'Your prompt here'
-)
-```
-### Ollama (Default for other models)
-```ruby
-response = LlmConductor.generate(
-  model: 'deepseek-r1',
-  prompt: 'Your prompt here'
+  model: 'llama2',
+  prompt: 'Extract email addresses from: contact@example.com',
+  vendor: :ollama,
+  params: { temperature: 0.0, seed: 42 }
 )
 ```
-### OpenRouter (Access to Multiple Providers)
-OpenRouter provides unified access to various LLM providers with automatic routing. It also supports vision/multimodal models with automatic retry logic for handling intermittent availability issues.
-**Vision-capable models:**
-- `nvidia/nemotron-nano-12b-v2-vl:free` - **FREE** 12B vision model (may need retries)
-- `openai/gpt-4o-mini` - Fast and reliable
-- `google/gemini-flash-1.5` - Fast vision processing
-- `anthropic/claude-3.5-sonnet` - High quality analysis
-- `openai/gpt-4o` - Best quality (higher cost)
-**Note:** Free-tier models may experience intermittent 502 errors. The client includes automatic retry logic with exponential backoff (up to 5 retries) to handle these transient failures.
+### Vision Analysis
 ```ruby
-# Text-only request
-response = LlmConductor.generate(
-  model: 'nvidia/nemotron-nano-12b-v2-vl:free',
-  vendor: :openrouter,
-  prompt: 'Your prompt here'
-)
-# Vision/multimodal request with single image
-response = LlmConductor.generate(
-  model: 'nvidia/nemotron-nano-12b-v2-vl:free',
-  vendor: :openrouter,
-  prompt: {
-    text: 'What is in this image?',
-    images: 'https://example.com/image.jpg'
-  }
-)
-# Vision request with multiple images
 response = LlmConductor.generate(
-  model: 'nvidia/nemotron-nano-12b-v2-vl:free',
-  vendor: :openrouter,
-  prompt: {
-    text: 'Compare these images',
-    images: [
-      'https://example.com/image1.jpg',
-      'https://example.com/image2.jpg'
-    ]
-  }
-)
-# Vision request with detail level
-response = LlmConductor.generate(
-  model: 'nvidia/nemotron-nano-12b-v2-vl:free',
-  vendor: :openrouter,
+  model: 'gpt-4o',
   prompt: {
     text: 'Describe this image in detail',
     images: [
-      { url: 'https://example.com/image.jpg', detail: 'high' }
+      'https://example.com/photo.jpg',
+      'https://example.com/diagram.png'
     ]
   }
 )
-# Advanced: Raw array format (OpenAI-compatible)
-response = LlmConductor.generate(
-  model: 'nvidia/nemotron-nano-12b-v2-vl:free',
-  vendor: :openrouter,
-  prompt: [
-    { type: 'text', text: 'What is in this image?' },
-    { type: 'image_url', image_url: { url: 'https://example.com/image.jpg' } }
-  ]
-)
 ```
-**Reliability:** The OpenRouter client includes intelligent retry logic:
-- Automatically retries on 502 errors (up to 5 attempts)
-- Exponential backoff: 2s, 4s, 8s, 16s, 32s
-- Transparent to your code - works seamlessly
-- Enable logging to see retry attempts:
+## Response Object
 ```ruby
-LlmConductor.configure do |config|
-  config.logger = Logger.new($stdout)
-  config.logger.level = Logger::INFO
-end
-```
-### Z.ai (Zhipu AI) - GLM Models with Vision Support
-Z.ai provides access to GLM (General Language Model) series including the powerful GLM-4.5V multimodal model with 64K context window and vision capabilities.
-**Text models:**
-- `glm-4-plus` - Enhanced text-only model
-- `glm-4` - Standard GLM-4 model
-**Vision-capable models:**
-- `glm-4.5v` - Latest multimodal model with 64K context ✅ **RECOMMENDED**
-- `glm-4v` - Previous generation vision model
-```ruby
-# Text-only request with GLM-4-plus
-response = LlmConductor.generate(
-  model: 'glm-4-plus',
-  vendor: :zai,
-  prompt: 'Explain quantum computing in simple terms'
-)
-# Vision request with GLM-4.5V - single image
-response = LlmConductor.generate(
-  model: 'glm-4.5v',
-  vendor: :zai,
-  prompt: {
-    text: 'What is in this image?',
-    images: 'https://example.com/image.jpg'
-  }
-)
-# Vision request with multiple images
-response = LlmConductor.generate(
-  model: 'glm-4.5v',
-  vendor: :zai,
-  prompt: {
-    text: 'Compare these images and identify differences',
-    images: [
-      'https://example.com/image1.jpg',
-      'https://example.com/image2.jpg'
-    ]
-  }
-)
-# Vision request with detail level
-response = LlmConductor.generate(
-  model: 'glm-4.5v',
-  vendor: :zai,
-  prompt: {
-    text: 'Analyze this document in detail',
-    images: [
-      { url: 'https://example.com/document.jpg', detail: 'high' }
-    ]
-  }
-)
-# Base64 encoded local images
-require 'base64'
-image_data = Base64.strict_encode64(File.read('path/to/image.jpg'))
-response = LlmConductor.generate(
-  model: 'glm-4.5v',
-  vendor: :zai,
-  prompt: {
-    text: 'What is in this image?',
-    images: "data:image/jpeg;base64,#{image_data}"
-  }
-)
-```
-**GLM-4.5V Features:**
-- 64K token context window
-- Multimodal understanding (text + images)
-- Document understanding and OCR
-- Image reasoning and analysis
-- Base64 image support for local files
-- OpenAI-compatible API format
-### Vendor Detection
-The gem automatically detects the appropriate provider based on model names:
+response = LlmConductor.generate(...)
-- **OpenAI**: Models starting with `gpt-` (e.g., `gpt-4`, `gpt-3.5-turbo`)
-- **Anthropic**: Models starting with `claude-` (e.g., `claude-3-5-sonnet-20241022`)
-- **Google Gemini**: Models starting with `gemini-` (e.g., `gemini-2.5-flash`, `gemini-2.0-flash`)
-- **Z.ai**: Models starting with `glm-` (e.g., `glm-4.5v`, `glm-4-plus`, `glm-4v`)
-- **Groq**: Models starting with `llama`, `mixtral`, `gemma`, or `qwen` (e.g., `llama-3.1-70b-versatile`, `mixtral-8x7b-32768`, `gemma-7b-it`, `qwen-2.5-72b-instruct`)
-- **Ollama**: All other models (e.g., `llama3.2`, `mistral`, `codellama`)
+response.output           # String - Generated text
+response.success?         # Boolean - Success status
+response.model            # String - Model used
+response.input_tokens     # Integer - Input token count
+response.output_tokens    # Integer - Output token count
+response.total_tokens     # Integer - Total tokens
+response.estimated_cost   # Float - Cost in USD (if available)
+response.metadata         # Hash - Additional info
-You can also explicitly specify the vendor:
+# Parse JSON responses
+response.parse_json       # Hash - Parsed JSON output
-```ruby
-response = LlmConductor.generate(
-  model: 'llama-3.1-70b-versatile',
-  vendor: :groq,  # Explicitly use Groq
-  prompt: 'Your prompt here'
-)
+# Extract code blocks
+response.extract_code_block('ruby')  # String - Code content
 ```
 ## Advanced Features
-### 1. Custom Prompt Registration
+### Custom Prompt Classes
-Create reusable, testable prompt classes:
+Create reusable, testable prompt templates:
 ```ruby
-class CompanyAnalysisPrompt < LlmConductor::Prompts::BasePrompt
+class AnalysisPrompt < LlmConductor::Prompts::BasePrompt
   def render
     <<~PROMPT
-      Company: #{name}
-      Domain: #{domain_name}
-      Description: #{truncate_text(description, max_length: 1000)}
-      Please analyze this company and provide:
-      1. Core business model
-      2. Target market
-      3. Competitive advantages
-      4. Growth potential
-      Format as JSON.
+      Analyze: #{title}
+      Content: #{truncate_text(content, max_length: 500)}
+      Provide insights in JSON format.
     PROMPT
   end
 end
-# Register the prompt
-LlmConductor::PromptManager.register(:detailed_analysis, CompanyAnalysisPrompt)
+# Register and use
+LlmConductor::PromptManager.register(:analyze, AnalysisPrompt)
-# Use the registered prompt
 response = LlmConductor.generate(
-  model: 'gpt-5-mini',
-  type: :detailed_analysis,
-  data: {
-    name: 'Ekohe',
-    domain_name: 'ekohe.com',
-    description: 'A leading AI company...'
-  }
+  model: 'gpt-4o-mini',
+  type: :analyze,
+  data: { title: 'Article', content: '...' }
 )
-# Parse structured responses
-analysis = response.parse_json
-puts analysis
 ```
-### 2. Data Builder Pattern
+### Data Builder Pattern
 Structure complex data for LLM consumption:
@@ -463,236 +221,96 @@ Structure complex data for LLM consumption:
 class CompanyDataBuilder < LlmConductor::DataBuilder
   def build
     {
-      id: source_object.id,
       name: source_object.name,
       description: format_for_llm(source_object.description, max_length: 500),
-      industry: extract_nested_data(:data, 'categories', 'primary'),
       metrics: build_metrics,
-      summary: build_company_summary,
-      domain_name: source_object.domain_name
+      summary: build_company_summary
     }
   end
   private
   def build_metrics
     {
       employees: format_number(source_object.employee_count),
-      revenue: format_number(source_object.annual_revenue),
-      growth_rate: "#{source_object.growth_rate}%"
+      revenue: format_number(source_object.annual_revenue, format: :currency)
     }
   end
-  def build_company_summary
-    name = safe_extract(:name, default: 'Company')
-    industry = extract_nested_data(:data, 'categories', 'primary')
-    "#{name} is a #{industry} company..."
-  end
 end
-# Usage
-company = Company.find(123)
-data = CompanyDataBuilder.new(company).build
-response = LlmConductor.generate(
-  model: 'gpt-5-mini',
-  type: :detailed_analysis,
-  data: data
-)
-```
-### 3. Built-in Prompt Templates
-#### Featured Links Extraction
-```ruby
-response = LlmConductor.generate(
-  model: 'gpt-5-mini',
-  type: :featured_links,
-  data: {
-    htmls: '<html>...</html>',
-    current_url: 'https://example.com'
-  }
-)
-```
-#### HTML Summarization
-```ruby
-response = LlmConductor.generate(
-  model: 'gpt-5-mini',
-  type: :summarize_htmls,
-  data: { htmls: '<html>...</html>' }
-)
-```
-#### Description Summarization
-```ruby
-response = LlmConductor.generate(
-  model: 'gpt-5-mini',
-  type: :summarize_description,
-  data: {
-    name: 'Company Name',
-    description: 'Long description...',
-    industries: ['Tech', 'AI']
-  }
-)
 ```
-#### Custom Templates
-```ruby
-response = LlmConductor.generate(
-  model: 'gpt-5-mini',
-  type: :custom,
-  data: {
-    template: "Analyze this data: %{data}",
-    data: "Your data here"
-  }
-)
-```
-### 4. Response Object
-All methods return a rich `LlmConductor::Response` object:
+### Error Handling
 ```ruby
 response = LlmConductor.generate(...)
-# Main content
-response.output           # Generated text
-response.success?         # Boolean success status
-# Token information
-response.input_tokens     # Input tokens used
-response.output_tokens    # Output tokens generated
-response.total_tokens     # Total tokens
-# Cost tracking (for supported models)
-response.estimated_cost   # Estimated cost in USD
-# Metadata
-response.model           # Model used
-response.metadata        # Hash with vendor, timestamp, etc.
-# Structured data parsing
-response.parse_json                    # Parse as JSON
-response.extract_code_block('json')    # Extract code blocks
-```
-### 5. Error Handling
-The gem provides comprehensive error handling:
-```ruby
-response = LlmConductor.generate(
-  model: 'gpt-5-mini',
-  prompt: 'Your prompt'
-)
 if response.success?
   puts response.output
 else
   puts "Error: #{response.metadata[:error]}"
-  puts "Failed model: #{response.model}"
-end
-# Exception handling for critical errors
-begin
-  response = LlmConductor.generate(...)
-rescue LlmConductor::Error => e
-  puts "LLM Conductor error: #{e.message}"
-rescue StandardError => e
-  puts "General error: #{e.message}"
+  puts "Error class: #{response.metadata[:error_class]}"
 end
 ```
-## Extending the Gem
-### Adding Custom Clients
-```ruby
-module LlmConductor
-  module Clients
-    class CustomClient < BaseClient
-      private
-      def generate_content(prompt)
-        # Implement your provider's API call
-        your_custom_api.generate(prompt)
-      end
-    end
-  end
-end
-```
+## Documentation
-### Adding Prompt Types
-```ruby
-module LlmConductor
-  module Prompts
-    def prompt_custom_analysis(data)
-      <<~PROMPT
-        Custom analysis for: #{data[:subject]}
-        Context: #{data[:context]}
-        Please provide detailed analysis.
-      PROMPT
-    end
-  end
-end
-```
+- **[Custom Parameters Guide](docs/custom-parameters.md)** - Temperature, top_p, and more
+- **[Vision Support Guide](docs/vision-support.md)** - Using images with LLMs
+- **[Examples](examples/)** - Working code examples for all providers
 ## Examples
-Check the `/examples` directory for comprehensive usage examples:
+Check the [examples/](examples/) directory for comprehensive examples:
 - `simple_usage.rb` - Basic text generation
+- `ollama_params_usage.rb` - Custom parameters with Ollama
+- `gpt_vision_usage.rb` - Vision with OpenAI
+- `claude_vision_usage.rb` - Vision with Anthropic
+- `gemini_vision_usage.rb` - Vision with Gemini
+- `openrouter_vision_usage.rb` - Vision with OpenRouter
+- `zai_usage.rb` - Using Z.ai GLM models
+- `data_builder_usage.rb` - Data builder patterns
 - `prompt_registration.rb` - Custom prompt classes
-- `data_builder_usage.rb` - Data structuring patterns
-- `rag_usage.rb` - RAG implementation examples
-- `gemini_usage.rb` - Google Gemini integration
-- `groq_usage.rb` - Groq integration with various models
-- `openrouter_vision_usage.rb` - OpenRouter vision/multimodal examples
-- `zai_usage.rb` - Z.ai GLM-4.5V vision and text examples
+- `rag_usage.rb` - Retrieval-Augmented Generation
-## Development
+Run any example:
-After checking out the repo, run `bin/setup` to install dependencies. Then, run `rake spec` to run the tests.
+```bash
+ruby examples/simple_usage.rb
+```
+## Development
 ```bash
-# Install dependencies
-bin/setup
+# Clone and setup
+git clone https://github.com/ekohe/llm-conductor.git
+cd llm-conductor
+bundle install
-# Run tests
-rake spec
+# Run tests
+bundle exec rspec
-# Run RuboCop
-rubocop
+# Run linter
+bundle exec rubocop
 # Interactive console
 bin/console
 ```
-## Testing
-The gem includes comprehensive test coverage with unit, integration, and performance tests.
-## Performance
-- **Token Efficiency**: Automatic prompt optimization and token counting
-- **Cost Tracking**: Real-time cost estimation for all supported models
-- **Response Caching**: Built-in mechanisms to avoid redundant API calls
-- **Async Support**: Ready for async/background processing
 ## Contributing
-Bug reports and pull requests are welcome on GitHub at https://github.com/ekohe/llm_conductor.
-1. Fork the repository
+1. Fork it
 2. Create your feature branch (`git checkout -b my-new-feature`)
 3. Commit your changes (`git commit -am 'Add some feature'`)
 4. Push to the branch (`git push origin my-new-feature`)
-5. Create a new Pull Request
+5. Create new Pull Request
+Ensure tests pass and RuboCop is clean before submitting.
 ## License
-The gem is available as open source under the terms of the [MIT License](https://opensource.org/licenses/MIT).
+The gem is available as open source under the terms of the [MIT License](LICENSE).
+## Credits
+Developed with ❤️ by [Ekohe](https://ekohe.com) - Making AI practical, achievable, and useful.