RubyGems - llm_conductor - Versions diffs - 0.1.0 - Mend

llm_conductor 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (28) hide show

checksums.yaml +7 -0
data/.DS_Store +0 -0
data/.rspec +4 -0
data/.rubocop.yml +103 -0
data/.rubocop_todo.yml +54 -0
data/.ruby-version +1 -0
data/README.md +413 -0
data/Rakefile +12 -0
data/config/initializers/llm_conductor.rb +27 -0
data/examples/data_builder_usage.rb +301 -0
data/examples/prompt_registration.rb +133 -0
data/examples/rag_usage.rb +108 -0
data/examples/simple_usage.rb +48 -0
data/lib/llm_conductor/client_factory.rb +33 -0
data/lib/llm_conductor/clients/base_client.rb +98 -0
data/lib/llm_conductor/clients/gpt_client.rb +24 -0
data/lib/llm_conductor/clients/ollama_client.rb +24 -0
data/lib/llm_conductor/clients/openrouter_client.rb +30 -0
data/lib/llm_conductor/configuration.rb +97 -0
data/lib/llm_conductor/data_builder.rb +211 -0
data/lib/llm_conductor/prompt_manager.rb +72 -0
data/lib/llm_conductor/prompts/base_prompt.rb +90 -0
data/lib/llm_conductor/prompts.rb +127 -0
data/lib/llm_conductor/response.rb +86 -0
data/lib/llm_conductor/version.rb +5 -0
data/lib/llm_conductor.rb +76 -0
data/sig/llm_conductor.rbs +4 -0
metadata +157 -0

data/lib/llm_conductor/prompts.rb ADDED Viewed

@@ -0,0 +1,127 @@
+# frozen_string_literal: true
+module LlmConductor
+  # Collection of pre-built prompt templates for common LLM tasks including
+  # content analysis, link extraction, and data summarization.
+  module Prompts
+    def prompt_featured_links(data)
+      <<~PROMPT
+        You are an AI assistant tasked with analyzing a webpage's HTML content to extract the most valuable links. Your goal is to identify links related to features, products, solutions, pricing, and social media profiles, prioritizing those from the same domain as the current page. Here are your instructions:
+        - You will be provided with the HTML content of the current page in the following format:
+        <page_html>
+        #{data[:htmls]}
+        </page_html>
+        - Parse the HTML content and extract all hyperlinks (a href attributes). Pay special attention to links in the navigation menu, footer, and main content areas.
+        - Filter and prioritize the extracted links based on the following criteria:
+           a. The link must be from the same domain as the current URL.
+           b. Prioritize links containing keywords such as "features", "products", "solutions", "pricing", "about", "contact", or similar variations.
+           c. Include social media profile links (e.g., LinkedIn, Instagram, Twitter, Facebook) if available.
+           d. Exclude links to login pages, search pages, or other utility pages.
+        - Select the top 3 most valuable links based on the above criteria.
+        - Format your output as a JSON array of strings, where each string is a full URL. Use the following format:
+        <output_format>
+        ["https://example.com/about-us", "https://example.com/products", "https://example.com/services"]
+        </output_format>
+        - The links must be the same domain of following
+        <domain>
+          #{data[:current_url]}
+        </domain>
+        If fewer than 3 relevant links are found, include only the available links in the output array.
+        Remember to use the full URL for each link, including the domain name. If you encounter relative URLs, combine them with the domain from the current URL to create absolute URLs.
+        Provide your final output without any additional explanation or commentary.
+      PROMPT
+    end
+    def prompt_summarize_htmls(data)
+      <<~PROMPT
+        Extract useful information from the webpage including a domain, detailed description of what the company does, founding year, country, business model, product description and features, customers and partners, development stage, and social media links. output will be json
+        You are tasked with extracting useful information about a company from a given webpage content. Your goal is to analyze the content and extract specific details about the company, its products, and its operations.
+        You will be provided with raw HTML content in the following format:
+        <html_content>
+        #{data[:htmls]}
+        </html_content>
+        Carefully read through the webpage content and extract the following information about the company:
+        - Name(field name): The company's name
+        - Domain name(field domain_name): The company's domain
+        - Description(field description): A comprehensive explanation of what the company does
+        - Country(field country): The company's country
+        - Region(field region): The company's region
+        - Location(field location): The company's location
+        - Founding on(field founded_on): Which year the company was established
+        - Business model(field business_model): How the company generates revenue
+        - Product description(product_description): A brief overview of the company's main product(s) or service(s)
+        - Product features(product_features): Key features or capabilities of the product(s) or service(s)
+        - Customers and partners(field customers_and_partners): Notable clients or business partners
+        - Development stage(field development_stage): The current phase of the company (e.g., startup, growth, established)
+        - Social media links(field social_media_links): URLs to the company's social media profiles
+          - instagram_url
+          - linkedin_url
+          - twitter_url
+        If any of the above information is not available in the webpage content, use "Not available" as the value for that field.
+        Present your findings in a JSON format. Here's an example of the expected structure:
+        <output_format>
+        {
+          "name": "AI-powered customer service",
+          "domain_name": "example.com",
+          "description": "XYZ Company develops AI chatbots that help businesses automate customer support...",
+          "founding_on": 2018,
+          "country": "United States",
+          "Region": "SA",
+          "Location": "SFO",
+          "business_model": "SaaS subscription",
+          "product_description": "AI-powered chatbot platform for customer service automation",
+          "product_features": ["Natural language processing", "Multi-language support", "Integration with CRM systems"],
+          "customers_and_partners": ["ABC Corp", "123 Industries", "Big Tech Co."],
+          "development_stage": "Growth",
+          "social_media_links": {
+            "linkedin_url": "https://www.linkedin.com/company/xyzcompany",
+            "twitter_url": "https://twitter.com/xyzcompany",
+            "instagram_url": "https://www.instagram.com/xyzcompany"
+          }
+        }
+        </output_format>
+        Remember to use only the information provided in the webpage content. Do not include any external information or make assumptions beyond what is explicitly stated or strongly implied in the given content.
+        Present your final output in JSON format, enclosed within <json_output> tags.
+      PROMPT
+    end
+    def prompt_summarize_description(data)
+      <<~PROMPT
+        Given the company's name, domain, description, and a list of industry-related keywords,
+        please summarize the company's core business and identify the three most relevant industries.
+        Highlight the company's unique value proposition, its primary market focus,
+        and any distinguishing features that set it apart within the identified industries.
+        Be as objective as possible.
+        Name: #{data[:name]}
+        Domain Name: #{data[:domain_name]}
+        Industry: #{data[:industries]}
+        Description: #{data[:description]}
+      PROMPT
+    end
+    def prompt_custom(data)
+      template = data.fetch(:template)
+      template % data
+    end
+  end
+end

data/lib/llm_conductor/response.rb ADDED Viewed

@@ -0,0 +1,86 @@
+# frozen_string_literal: true
+module LlmConductor
+  # Response object that encapsulates the result of LLM generation
+  # with metadata like token usage and cost information
+  class Response
+    attr_reader :output, :input_tokens, :output_tokens, :metadata, :model
+    def initialize(output:, model:, input_tokens: nil, output_tokens: nil, metadata: {})
+      @output = output
+      @model = model
+      @input_tokens = input_tokens
+      @output_tokens = output_tokens
+      @metadata = metadata || {}
+    end
+    def total_tokens
+      (@input_tokens || 0) + (@output_tokens || 0)
+    end
+    # Calculate estimated cost based on model and token usage
+    def estimated_cost
+      return nil unless valid_for_cost_calculation?
+      pricing = model_pricing
+      return nil unless pricing
+      calculate_cost(pricing[:input_rate], pricing[:output_rate])
+    end
+    # Check if the response was successful
+    def success?
+      !@output.nil? && !@output.empty? && @metadata[:error].nil?
+    end
+    # Get metadata with cost included if available
+    def metadata_with_cost
+      cost = estimated_cost
+      cost ? @metadata.merge(cost:) : @metadata
+    end
+    # Parse JSON from the output
+    def parse_json
+      return nil unless success? && @output
+      JSON.parse(@output.strip)
+    rescue JSON::ParserError => e
+      raise JSON::ParserError, "Failed to parse JSON response: #{e.message}"
+    end
+    # Extract text between code blocks
+    def extract_code_block(language = nil)
+      return nil unless @output
+      pattern = if language
+                  /```#{Regexp.escape(language)}\s*(.*?)```/m
+                else
+                  /```(?:\w*)\s*(.*?)```/m
+                end
+      match = @output.match(pattern)
+      match ? match[1].strip : nil
+    end
+    private
+    def valid_for_cost_calculation?
+      @model && total_tokens.positive?
+    end
+    def model_pricing
+      case @model
+      when /gpt-3\.5-turbo/
+        { input_rate: 0.0000015, output_rate: 0.000002 }
+      when /gpt-4o-mini/
+        { input_rate: 0.000000150, output_rate: 0.0000006 }
+      when /gpt-4/
+        { input_rate: 0.00003, output_rate: 0.00006 }
+      end
+    end
+    def calculate_cost(input_rate, output_rate)
+      (@input_tokens || 0) * input_rate + (@output_tokens || 0) * output_rate
+    end
+  end
+end

data/lib/llm_conductor/version.rb ADDED Viewed

@@ -0,0 +1,5 @@
+# frozen_string_literal: true
+module LlmConductor
+  VERSION = '0.1.0'
+end

data/lib/llm_conductor.rb ADDED Viewed

@@ -0,0 +1,76 @@
+# frozen_string_literal: true
+require_relative 'llm_conductor/version'
+require_relative 'llm_conductor/configuration'
+require_relative 'llm_conductor/response'
+require_relative 'llm_conductor/data_builder'
+require_relative 'llm_conductor/prompts'
+require_relative 'llm_conductor/prompts/base_prompt'
+require_relative 'llm_conductor/prompt_manager'
+require_relative 'llm_conductor/clients/base_client'
+require_relative 'llm_conductor/clients/gpt_client'
+require_relative 'llm_conductor/clients/ollama_client'
+require_relative 'llm_conductor/clients/openrouter_client'
+require_relative 'llm_conductor/client_factory'
+# LLM Conductor provides a unified interface for multiple Language Model providers
+# including OpenAI GPT, OpenRouter, and Ollama with built-in prompt templates,
+# token counting, and extensible client architecture.
+module LlmConductor
+  class Error < StandardError; end
+  # Main entry point for creating LLM clients
+  def self.build_client(model:, type:, vendor: nil)
+    ClientFactory.build(model:, type:, vendor:)
+  end
+  # Unified generate method supporting both simple prompts and legacy template-based generation
+  def self.generate(model: nil, prompt: nil, type: nil, data: nil, vendor: nil)
+    if prompt && !type && !data
+      generate_simple_prompt(model:, prompt:, vendor:)
+    elsif type && data && !prompt
+      generate_with_template(model:, type:, data:, vendor:)
+    else
+      raise ArgumentError,
+            "Invalid arguments. Use either: generate(prompt: 'text') or generate(type: :custom, data: {...})"
+    end
+  end
+  class << self
+    private
+    def generate_simple_prompt(model:, prompt:, vendor:)
+      model ||= configuration.default_model
+      vendor ||= ClientFactory.determine_vendor(model)
+      client_class = client_class_for_vendor(vendor)
+      client = client_class.new(model:, type: :direct)
+      client.generate_simple(prompt:)
+    end
+    def generate_with_template(model:, type:, data:, vendor:)
+      client = build_client(model:, type:, vendor:)
+      client.generate(data:)
+    end
+    def client_class_for_vendor(vendor)
+      case vendor
+      when :openai, :gpt then Clients::GptClient
+      when :openrouter then Clients::OpenrouterClient
+      when :ollama then Clients::OllamaClient
+      else
+        raise ArgumentError, "Unsupported vendor: #{vendor}. Supported vendors: openai, openrouter, ollama"
+      end
+    end
+  end
+  # List of supported vendors
+  SUPPORTED_VENDORS = %i[openai openrouter ollama].freeze
+  # List of supported prompt types
+  SUPPORTED_PROMPT_TYPES = %i[
+    featured_links
+    summarize_htmls
+    summarize_description
+    custom
+  ].freeze
+end

data/sig/llm_conductor.rbs ADDED Viewed

@@ -0,0 +1,4 @@
+module LlmConductor
+  VERSION: String
+  # See the writing guide of rbs: https://github.com/ruby/rbs#guides
+end

metadata ADDED Viewed

@@ -0,0 +1,157 @@
+--- !ruby/object:Gem::Specification
+name: llm_conductor
+version: !ruby/object:Gem::Version
+  version: 0.1.0
+platform: ruby
+authors:
+- Ben Zheng
+bindir: exe
+cert_chain: []
+date: 2025-09-19 00:00:00.000000000 Z
+dependencies:
+- !ruby/object:Gem::Dependency
+  name: activesupport
+  requirement: !ruby/object:Gem::Requirement
+    requirements:
+    - - ">="
+      - !ruby/object:Gem::Version
+        version: '6.0'
+  type: :runtime
+  prerelease: false
+  version_requirements: !ruby/object:Gem::Requirement
+    requirements:
+    - - ">="
+      - !ruby/object:Gem::Version
+        version: '6.0'
+- !ruby/object:Gem::Dependency
+  name: ollama-ai
+  requirement: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '1.3'
+  type: :runtime
+  prerelease: false
+  version_requirements: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '1.3'
+- !ruby/object:Gem::Dependency
+  name: ruby-openai
+  requirement: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '7.0'
+  type: :runtime
+  prerelease: false
+  version_requirements: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '7.0'
+- !ruby/object:Gem::Dependency
+  name: tiktoken_ruby
+  requirement: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: 0.0.7
+  type: :runtime
+  prerelease: false
+  version_requirements: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: 0.0.7
+- !ruby/object:Gem::Dependency
+  name: rubocop-performance
+  requirement: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '1.19'
+  type: :development
+  prerelease: false
+  version_requirements: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '1.19'
+- !ruby/object:Gem::Dependency
+  name: rubocop-rspec
+  requirement: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '3.0'
+  type: :development
+  prerelease: false
+  version_requirements: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '3.0'
+description: LLM Conductor provides a clean, unified interface for working with multiple
+  Language Model providers including OpenAI GPT, OpenRouter, and Ollama. Features
+  include prompt templating, token counting, and extensible client architecture.
+email:
+- ben@ekohe.com
+executables: []
+extensions: []
+extra_rdoc_files: []
+files:
+- ".DS_Store"
+- ".rspec"
+- ".rubocop.yml"
+- ".rubocop_todo.yml"
+- ".ruby-version"
+- README.md
+- Rakefile
+- config/initializers/llm_conductor.rb
+- examples/data_builder_usage.rb
+- examples/prompt_registration.rb
+- examples/rag_usage.rb
+- examples/simple_usage.rb
+- lib/llm_conductor.rb
+- lib/llm_conductor/client_factory.rb
+- lib/llm_conductor/clients/base_client.rb
+- lib/llm_conductor/clients/gpt_client.rb
+- lib/llm_conductor/clients/ollama_client.rb
+- lib/llm_conductor/clients/openrouter_client.rb
+- lib/llm_conductor/configuration.rb
+- lib/llm_conductor/data_builder.rb
+- lib/llm_conductor/prompt_manager.rb
+- lib/llm_conductor/prompts.rb
+- lib/llm_conductor/prompts/base_prompt.rb
+- lib/llm_conductor/response.rb
+- lib/llm_conductor/version.rb
+- sig/llm_conductor.rbs
+homepage: https://github.com/ekohe/llm_conductor
+licenses: []
+metadata:
+  allowed_push_host: https://rubygems.org
+  homepage_uri: https://github.com/ekohe/llm_conductor
+  source_code_uri: https://github.com/ekohe/llm_conductor
+  changelog_uri: https://github.com/ekohe/llm_conductor/blob/main/CHANGELOG.md
+  rubygems_mfa_required: 'true'
+rdoc_options: []
+require_paths:
+- lib
+required_ruby_version: !ruby/object:Gem::Requirement
+  requirements:
+  - - ">="
+    - !ruby/object:Gem::Version
+      version: 3.1.0
+required_rubygems_version: !ruby/object:Gem::Requirement
+  requirements:
+  - - ">="
+    - !ruby/object:Gem::Version
+      version: '0'
+requirements: []
+rubygems_version: 3.6.2
+specification_version: 4
+summary: A flexible Ruby gem for orchestrating multiple LLM providers with unified
+  interface
+test_files: []