RubyGems - groq - Versions diffs - 0.2.0 → 0.3.0 - Mend

groq 0.2.0 → 0.3.0

Files changed (11) hide show

checksums.yaml +4 -4
data/README.md +298 -11
data/examples/README.md +61 -0
data/examples/agent-prompts/helloworld.yml +6 -0
data/examples/agent-prompts/pizzeria-sales.yml +19 -0
data/examples/groq-user-chat-streaming.rb +132 -0
data/examples/groq-user-chat.rb +109 -0
data/lib/groq/client.rb +104 -21
data/lib/groq/helpers.rb +4 -1
data/lib/groq/version.rb +1 -1
metadata +35 -2

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: 8d1461971dedb839a98ceba16edeec695a3fbc48216295314e6c319e5976f621
-  data.tar.gz: ac0437a0a14d79c9faab3c88054100928970606e90997187c6e908e67a67dc8c
+  metadata.gz: af8963c428e4a7760f76a17a48d6c92cdae50363c96d6e97eef21293a3321beb
+  data.tar.gz: 92d619e893e9fa727c76f42c85f095cc1e874ba78474840f7c6b1d139d691077
 SHA512:
-  metadata.gz: 422b5c160196127928397e568aa15e76dc4f63d1388391bce9cae4ad4d6d0b0fb4063fb52126f04a5b32667179532ea62af96f64acb785ffffed875d4c0646cb
-  data.tar.gz: a537f489dedaa533e9fdb444c6e6d3007dab7164c8306260b0409cd5d2b8bf8802373320d7a79016f04a8e71295845f2504162aad8d7f4997db30c2cad7e32f5
+  metadata.gz: bc25feb5cfdaf37955932e59f079568ce4f135dcd61eade7d13bf1d62777fec08642f246bbe2b90d4b0c0ffaaa28e9bc18092814cd19b2da3f9c06486d713417
+  data.tar.gz: 2c436ded0ab2152c5625a7b0690942b7d24b71cb36aef56a95984a5faa31ee8b959230f7f5389c4941ed613f19608971a00c415c22b7b6199d04f6a8674f2a14

data/README.md CHANGED Viewed

@@ -60,16 +60,12 @@ JSON.parse(response["content"])
 Install the gem and add to the application's Gemfile by executing:
-```plain
-bundle add groq
+> bundle add groq
 ```
 If bundler is not being used to manage dependencies, install the gem by executing:
-```plain
-gem install groq
+> gem install groq
 ```
 ## Usage
 - Get your API key from [console.groq.com/keys](https://console.groq.com/keys)
@@ -105,10 +101,8 @@ client.chat([
 ### Interactive console (IRb)
-```plain
-bin/console
+> bin/console
 ```
 This repository has a `bin/console` script to start an interactive console to play with the Groq API. The `@client` variable is setup using `$GROQ_API_KEY` environment variable; and the `U`, `A`, `T` helpers are already included.
 ```ruby
@@ -190,9 +184,8 @@ end
 The output might looks similar to:
 ```plain
-User message: Hello, world!
+> User message: Hello, world!
 Assistant reply with model llama3-8b-8192:
-{"role"=>"assistant", "content"=>"Hello, world! It's great to meet you! Is there something I can help you with, or would you like to chat?"}
 Assistant reply with model llama3-70b-8192:
 {"role"=>"assistant", "content"=>"The classic \"Hello, world!\" It's great to see you here! Is there something I can help you with, or would you like to just chat?"}
 Assistant reply with model llama2-70b-4096:
@@ -227,6 +220,33 @@ JSON.parse(response["content"])
 # => {"number"=>7}
 ```
+### Using dry-schema with JSON mode
+As a bonus, the `S` or `System` helper can take a `json_schema:` argument and the system message will include the `JSON` keyword and the formatted schema in its content.
+For example, if you're using [dry-schema](https://dry-rb.org/gems/dry-schema/1.13/extensions/json_schema/) with its `:json_schema` extension you can use Ruby to describe JSON schema.
+```ruby
+require "dry-schema"
+Dry::Schema.load_extensions(:json_schema)
+person_schema_defn = Dry::Schema.JSON do
+  required(:name).filled(:string)
+  optional(:age).filled(:integer)
+  optional(:email).filled(:string)
+end
+person_schema = person_schema_defn.json_schema
+response = @client.chat([
+  S("You're excellent at extracting personal information", json_schema: person_schema),
+  U("I'm Dr Nic and I'm almost 50.")
+], json: true)
+JSON.parse(response["content"])
+# => {"name"=>"Dr Nic", "age"=>49}
+```
+NOTE: `bin/console` already loads the `dry-schema` library and the `json_schema` extension because its handy.
 ### Tools/Functions
 LLMs are increasingly supporting deferring to tools or functions to fetch data, perform calculations, or store structured data. Groq Cloud in turn then supports their tool implementations through its API.
@@ -309,6 +329,273 @@ end
 @client.chat("Hello, world!", max_tokens: 512, temperature: 0.5)
 ```
+### Debugging API calls
+The underlying HTTP library being used is faraday, and you can enabled debugging, or configure other faraday internals by passing a block to the `Groq::Client.new` constructor.
+```ruby
+require 'logger'
+# Create a logger instance
+logger = Logger.new(STDOUT)
+logger.level = Logger::DEBUG
+@client = Groq::Client.new do |faraday|
+  # Log request and response bodies
+  faraday.response :logger, logger, bodies: true
+end
+```
+If you pass `--debug` to `bin/console` you will have this logger setup for you.
+```plain
+bin/console --debug
+```
+### Streaming
+If your AI assistant responses are being telecast live to a human, then that human might want some progressive responses. The Groq API supports streaming responses.
+Pass a block to `chat()` with either one or two arguments.
+1. The first argument is the string content chunk of the response.
+2. The optional second argument is the full response object from the API containing extra metadata.
+The final block call will be the last chunk of the response:
+1. The first argument will be `nil`
+2. The optional second argument, the full response object, contains a summary of the Groq API usage, such as prompt tokens, prompt time, etc.
+```ruby
+puts "🍕 "
+messages = [
+  S("You are a pizza sales person."),
+  U("What do you sell?")
+]
+@client.chat(messages) do |content|
+  print content
+end
+puts
+```
+Each chunk of the response will be printed to the console as it is received. It will look pretty.
+The default `llama3-7b-8192` model is very very fast and you might not see any streaming. Try a slower model like `llama3-70b-8192` or `mixtral-8x7b-32768`.
+```ruby
+@client = Groq::Client.new(model_id: "llama3-70b-8192")
+@client.chat("Write a long poem about patience") do |content|
+  print content
+end
+puts
+```
+You can pass in a second argument to get the full response JSON object:
+```ruby
+@client.chat("Write a long poem about patience") do |content, response|
+  pp content
+  pp response
+end
+```
+Alternately, you can pass a `Proc` or any object that responds to `call` via a `stream:` keyword argument:
+```ruby
+@client.chat("Write a long poem about patience", stream: ->(content) { print content })
+```
+You could use a class with a `call` method with either one or two arguments, like the `Proc` discussion above.
+```ruby
+class MessageBits
+  def initialize(emoji)
+    print "#{emoji} "
+    @bits = []
+  end
+  def call(content)
+    if content.nil?
+      puts
+    else
+      print(content)
+      @bits << content
+    end
+  end
+  def to_s
+    @bits.join("")
+  end
+  def to_assistant_message
+    Assistant(to_s)
+  end
+end
+bits = MessageBits.new("🍕")
+@client.chat("Write a long poem about pizza", stream: bits)
+```
+## Examples
+Here are some example uses of Groq, of the `groq` gem and its syntax.
+Also, see the [`examples/`](examples/) folder for more example apps.
+### Pizzeria agent
+Talking with a pizzeria.
+Our pizzeria agent can be as simple as a function that combines a system message and the current messages array:
+```ruby
+@agent_message = <<~EOS
+  You are an employee at a pizza store.
+  You sell hawaiian, and pepperoni pizzas; in small and large sizes for $10, and $20 respectively.
+  Pick up only in. Ready in 10 mins. Cash on pickup.
+EOS
+def chat_pizza_agent(messages)
+  @client.chat([
+    System(@agent_message),
+    *messages
+  ])
+end
+```
+Now for our first interaction:
+```ruby
+messages = [U("Is this the pizza shop? Do you sell hawaiian?")]
+response = chat_pizza_agent(messages)
+puts response["content"]
+```
+The output might be:
+> Yeah! This is the place! Yes, we sell Hawaiian pizzas here! We've got both small and large sizes available for you. The small Hawaiian pizza is $10, and the large one is $20. Plus, because we're all about getting you your pizza fast, our pick-up time is only 10 minutes! So, what can I get for you today? Would you like to order a small or large Hawaiian pizza?
+Continue with user's reply.
+Note, we build the `messages` array with the previous user and assistant messages and the new user message:
+```ruby
+messages << response << U("Yep, give me a large.")
+response = chat_pizza_agent(messages)
+puts response["content"]
+```
+Response:
+> I'll get that ready for you. So, to confirm, you'd like to order a large Hawaiian pizza for $20, and I'll have it ready for you in 10 minutes. When you come to pick it up, please have the cash ready as we're a cash-only transaction. See you in 10!
+Making a change:
+```ruby
+messages << response << U("Actually, make it two smalls.")
+response = chat_pizza_agent(messages)
+puts response["content"]
+```
+Response:
+> I've got it! Two small Hawaiian pizzas on the way! That'll be $20 for two small pizzas. Same deal, come back in 10 minutes to pick them up, and bring cash for the payment. See you soon!
+### Pizza customer agent
+Oh my. Let's also have an agent that represents the customer.
+```ruby
+@customer_message = <<~EOS
+  You are a customer at a pizza store.
+  You want to order a pizza. You can ask about the menu, prices, sizes, and pickup times.
+  You'll agree with the price and terms of the pizza order.
+  You'll make a choice of the available options.
+  If you're first in the conversation, you'll say hello and ask about the menu.
+EOS
+def chat_pizza_customer(messages)
+  @client.chat([
+    System(@customer_message),
+    *messages
+  ])
+end
+```
+First interaction starts with no user or assistant messages. We're generating the customer's first message:
+```ruby
+customer_messages = []
+response = chat_pizza_customer(customer_messages)
+puts response["content"]
+```
+Customer's first message:
+> Hello! I'd like to order a pizza. Could you tell me more about the menu and prices? What kind of pizzas do you have available?
+Now we need to pass this to the pizzeria agent:
+```ruby
+customer_message = response["content"]
+pizzeria_messages = [U(customer_message)]
+response = chat_pizza_agent(pizzeria_messages)
+puts response["content"]
+```
+Pizzeria agent response:
+> Hi there! Yeah, sure thing! We've got two delicious options to choose from: Hawaiian and Pepperoni. Both come in small and large sizes. The small pizzas are $10 and the large pizzas are $20.
+>
+> Our Hawaiian pizza features fresh ham and pineapple on a bed of melted mozzarella. And if you're in the mood for something classic, our Pepperoni pizza is loaded with plenty of sliced pepperoni and melted mozzarella cheese.
+Now let's add this response to the customer agent's message array, and generate the customer's next response to the pizzera:
+```ruby
+customer_messages << U(response["content"])
+response = chat_pizza_customer(customer_messages)
+puts response["content"]
+```
+Customer agent response:
+> Wow, those both sound delicious! I'm intrigued by the Hawaiian combo, I never thought of putting ham and pineapple on a pizza before. How would you recommend I customize it? Can I add any extra toppings or keep it as is? And do you have any recommendations for the size? Small or large?
+Add this to the pizzeria agent's message array, and generate the pizzeria's response:
+```ruby
+pizzeria_messages << U(response["content"])
+response = chat_pizza_agent(pizzeria_messages)
+puts response["content"]
+```
+Pizzeria agent response:
+> The Hawaiian pizza is definitely a unique twist on traditional toppings! You can definitely customize it to your liking. We allow two extra toppings of your choice for an additional $1 each. If you want to add any other toppings beyond that, it's $2 per topping.
+>
+> As for recommends, I'd say the small size is a great starting point, especially if you're trying something new like the Hawaiian pizza. The small size is $10 and it's a great bite-sized portion. But if you're looking for a bigger pie, the large size is $20 and would be a great option if you're feeding a crowd or want leftovers.
+>
+> Keep in mind that our pizzas are cooked fresh in 10 minutes, so it's ready when it's ready! Would you like to place an order now?
+Will the customer actually buy anything now?
+> I think I'd like to go with the Hawaiian pizza in the small size, so the total would be $10. And I'll take advantage of the extra topping option. I think I'll add some mushrooms to it. So, that's an extra $1 for the mushroom topping. Would that be $11 total? And do you have a pickup time available soon?
+OMG, the customer bought something.
+Pizzeria agent response:
+> That sounds like a great choice! Yeah, the total would be $11, the small Hawaiian pizza with mushrooms. And yes, we do have pickup available shortly. It'll be ready in about 10 minutes. Cash on pickup, okay? Would you like to pay when you pick up your pizza?
+Maybe these two do not know how to stop talking. The Halting Problem exists in pizza shops too.
 ## Development
 After checking out the repo, run `bin/setup` to install dependencies. Then, run `rake test` to run the tests. You can also run `bin/console` for an interactive prompt that will allow you to experiment.

data/examples/README.md ADDED Viewed

@@ -0,0 +1,61 @@
+# Examples
+## User Chat
+Chat with a pre-defined agent using the following command:
+```bash
+bundle exec examples/groq-user-chat.rb
+# or
+bundle exec examples/groq-user-chat.rb --agent-prompt examples/agent-prompts/helloworld.yml
+```
+There are two example agent prompts available:
+- `examples/agent-prompts/helloworld.yml` (the default)
+- `examples/agent-prompts/pizzeria-sales.yml`
+At the prompt, either talk to the AI agent, or some special commands:
+- `exit` to exit the conversation
+- `summary` to get a summary of the conversation so far
+### Streaming
+There is also an example of streaming the conversation to terminal as it is received from Groq API.
+It defaults to the slower `llama3-70b-8192` model so that the streaming is more noticable.
+```bash
+bundle exec examples/groq-user-chat-streaming.rb --agent-prompt examples/agent-prompts/pizzeria-sales.yml
+```
+### Pizzeria
+Run the pizzeria example with the following command:
+```bash
+bundle exec examples/groq-user-chat.rb --agent-prompt examples/agent-prompts/pizzeria-sales.yml
+```
+> 🍕 Hello! Thank you for calling our pizzeria. I'm happy to help you with your inquiry. Do you have a question about our menu or would you like to place an order?
+>
+> 😋 What's your cheapest?
+>
+> 🍕 Our cheapest menu item is the Garlic Knots (6 pieces), which is served with marinara sauce and priced at $5. Would you like to add it to your order or is there something else that catches your attention?
+>
+> 😋 Yes. And a cheap pizza.
+>
+> 🍕 Our cheapest pizza option is the Veggie Pizza. It comes with tomato sauce, mozzarella, and a mix of bell peppers, onions, and mushrooms, all for $13. Would you like to order the Veggie Pizza and the Garlic Knots together?
+>
+> 😋 Yep
+>
+> 🍕 I've taken note of your order. You're ordering a Veggie Pizza and 6 Garlic Knots. Your total comes out to be $18. Would you like to pay by credit card, cash, or take any other payment method?
+>
+> 😋 summary
+>
+> Here is a summary of the current conversation:
+>
+> The conversation started with a customer calling the pizzeria and speaking with an AI assistant. The assistant offered to help with menu inquiries or taking an order. The customer was considering ordering from the menu. The assistant presented the cheapest menu item, Garlic Knots, and asked if the customer wanted to add it to their order. The customer may have been interested in other options as well. The assistant then presented the cheapest pizza option, the Veggie Pizza, and asked if the customer wanted to order it along with the Garlic Knots. The customer agreed, and the assistant took note of the order, which consisted of a Veggie Pizza and 6 Garlic Knots for a total of $18. The assistant asked how the customer would like to pay for their order.
+>
+> 😋 exit

data/examples/agent-prompts/helloworld.yml ADDED Viewed

@@ -0,0 +1,6 @@
+---
+system: |-
+  I am a friendly agent who always replies to any prompt
+  with a pleasant "Hello" and wishing them well.
+agent_emoji: "🤖"
+user_emoji: "👤"

data/examples/agent-prompts/pizzeria-sales.yml ADDED Viewed

@@ -0,0 +1,19 @@
+---
+system: |-
+  You are a phone operator at a busy pizzeria. Your responsibilities include answering calls and online chats from customers who may ask about the menu, wish to place or change orders, or inquire about opening hours.
+  Here are some of our popular menu items:
+  <menu>
+  Margherita Pizza: Classic with tomato sauce, mozzarella, and basil - $12
+  Pepperoni Pizza: Tomato sauce, mozzarella, and a generous layer of pepperoni - $14
+  Veggie Pizza: Tomato sauce, mozzarella, and a mix of bell peppers, onions, and mushrooms - $13
+  BBQ Chicken Pizza: BBQ sauce, chicken, onions, and cilantro - $15
+  Garlic Knots (6 pieces): Served with marinara sauce - $5
+  Cannoli: Classic Sicilian dessert filled with sweet ricotta cream - $4 each
+  </menu>
+  Your goal is to provide accurate information, confirm order details, and ensure a pleasant customer experience. Please maintain a polite and professional tone, be prompt in your responses, and ensure accuracy in order transmission.
+agent_emoji: "🍕"
+user_emoji: "😋"
+can_go_first: true

data/examples/groq-user-chat-streaming.rb ADDED Viewed

@@ -0,0 +1,132 @@
+#!/usr/bin/env ruby
+require "optparse"
+require "groq"
+require "yaml"
+include Groq::Helpers
+@options = {
+  model: "llama3-70b-8192",
+  agent_prompt_path: File.join(File.dirname(__FILE__), "agent-prompts/helloworld.yml"),
+  timeout: 20
+}
+OptionParser.new do |opts|
+  opts.banner = "Usage: ruby script.rb [options]"
+  opts.on("-m", "--model MODEL", "Model name") do |v|
+    @options[:model] = v
+  end
+  opts.on("-a", "--agent-prompt PATH", "Path to agent prompt file") do |v|
+    @options[:agent_prompt_path] = v
+  end
+  opts.on("-t", "--timeout TIMEOUT", "Timeout in seconds") do |v|
+    @options[:timeout] = v.to_i
+  end
+  opts.on("-d", "--debug", "Enable debug mode") do |v|
+    @options[:debug] = v
+  end
+end.parse!
+raise "Missing --model option" if @options[:model].nil?
+raise "Missing --agent-prompt option" if @options[:agent_prompt_path].nil?
+def debug?
+  @options[:debug]
+end
+# Read the agent prompt from the file
+agent_prompt = YAML.load_file(@options[:agent_prompt_path])
+user_emoji = agent_prompt["user_emoji"]
+agent_emoji = agent_prompt["agent_emoji"]
+system_prompt = agent_prompt["system_prompt"] || agent_prompt["system"]
+can_go_first = agent_prompt["can_go_first"]
+# Initialize the Groq client
+@client = Groq::Client.new(model_id: @options[:model], request_timeout: @options[:timeout]) do |f|
+  if debug?
+    require "logger"
+    # Create a logger instance
+    logger = Logger.new($stdout)
+    logger.level = Logger::DEBUG
+    f.response :logger, logger, bodies: true  # Log request and response bodies
+  end
+end
+puts "Welcome to the AI assistant! I'll respond to your queries."
+puts "You can quit by typing 'exit'."
+def produce_summary(messages)
+  combined = messages.map do |message|
+    if message["role"] == "user"
+      "User: #{message["content"]}"
+    else
+      "Assistant: #{message["content"]}"
+    end
+  end.join("\n")
+  response = @client.chat([
+    S("You are excellent at reading a discourse between a human and an AI assistant and summarising the current conversation."),
+    U("Here is the current conversation:\n\n------\n\n#{combined}")
+  ])
+  puts response["content"]
+end
+messages = [S(system_prompt)]
+if can_go_first
+  print "#{agent_emoji} "
+  message_bits = []
+  response = @client.chat(messages) do |content|
+    # content == nil on last message; and "" on first message
+    next unless content
+    print(content)
+    message_bits << content
+  end
+  puts
+  messages << A(message_bits.join(""))
+end
+class MessageBits
+  def initialize(emoji)
+    print "#{emoji} "
+    @bits = []
+  end
+  def call(content)
+    if content.nil?
+      puts
+    else
+      print(content)
+      @bits << content
+    end
+  end
+  def to_assistant_message
+    Assistant(@bits.join(""))
+  end
+end
+loop do
+  print "#{user_emoji} "
+  user_input = gets.chomp
+  break if user_input.downcase == "exit"
+  # produce summary
+  if user_input.downcase == "summary"
+    produce_summary(messages)
+    next
+  end
+  messages << U(user_input)
+  # Use Groq to generate a response
+  message_bits = MessageBits.new(agent_emoji)
+  @client.chat(messages, stream: message_bits)
+  messages << message_bits.to_assistant_message
+end

data/examples/groq-user-chat.rb ADDED Viewed

@@ -0,0 +1,109 @@
+#!/usr/bin/env ruby
+require "optparse"
+require "groq"
+require "yaml"
+include Groq::Helpers
+@options = {
+  model: "llama3-8b-8192",
+  # model: "llama3-70b-8192",
+  agent_prompt_path: File.join(File.dirname(__FILE__), "agent-prompts/helloworld.yml"),
+  timeout: 20
+}
+OptionParser.new do |opts|
+  opts.banner = "Usage: ruby script.rb [options]"
+  opts.on("-m", "--model MODEL", "Model name") do |v|
+    @options[:model] = v
+  end
+  opts.on("-a", "--agent-prompt PATH", "Path to agent prompt file") do |v|
+    @options[:agent_prompt_path] = v
+  end
+  opts.on("-t", "--timeout TIMEOUT", "Timeout in seconds") do |v|
+    @options[:timeout] = v.to_i
+  end
+  opts.on("-d", "--debug", "Enable debug mode") do |v|
+    @options[:debug] = v
+  end
+end.parse!
+raise "Missing --model option" if @options[:model].nil?
+raise "Missing --agent-prompt option" if @options[:agent_prompt_path].nil?
+def debug?
+  @options[:debug]
+end
+# Read the agent prompt from the file
+agent_prompt = YAML.load_file(@options[:agent_prompt_path])
+user_emoji = agent_prompt["user_emoji"]
+agent_emoji = agent_prompt["agent_emoji"]
+system_prompt = agent_prompt["system_prompt"] || agent_prompt["system"]
+can_go_first = agent_prompt["can_go_first"]
+# Initialize the Groq client
+@client = Groq::Client.new(model_id: @options[:model], request_timeout: @options[:timeout]) do |f|
+  if debug?
+    require "logger"
+    # Create a logger instance
+    logger = Logger.new($stdout)
+    logger.level = Logger::DEBUG
+    f.response :logger, logger, bodies: true  # Log request and response bodies
+  end
+end
+puts "Welcome to the AI assistant! I'll respond to your queries."
+puts "You can quit by typing 'exit'."
+def produce_summary(messages)
+  combined = messages.map do |message|
+    if message["role"] == "user"
+      "User: #{message["content"]}"
+    else
+      "Assistant: #{message["content"]}"
+    end
+  end.join("\n")
+  response = @client.chat([
+    S("You are excellent at reading a discourse between a human and an AI assistant and summarising the current conversation."),
+    U("Here is the current conversation:\n\n------\n\n#{combined}")
+  ])
+  puts response["content"]
+end
+messages = [S(system_prompt)]
+if can_go_first
+  response = @client.chat(messages)
+  puts "#{agent_emoji} #{response["content"]}"
+  messages << response
+end
+loop do
+  print "#{user_emoji} "
+  user_input = gets.chomp
+  break if user_input.downcase == "exit"
+  # produce summary
+  if user_input.downcase == "summary"
+    produce_summary(messages)
+    next
+  end
+  messages << U(user_input)
+  # Use Groq to generate a response
+  response = @client.chat(messages)
+  message = response.dig("content")
+  puts "#{agent_emoji} #{message}"
+  messages << response
+end

data/lib/groq/client.rb CHANGED Viewed

@@ -7,6 +7,7 @@ class Groq::Client
     model_id
     max_tokens
     temperature
+    request_timeout
   ].freeze
   attr_reader(*CONFIG_KEYS, :faraday_middleware)
@@ -21,8 +22,7 @@ class Groq::Client
     @faraday_middleware = faraday_middleware
   end
-  # TODO: support stream: true; or &stream block
-  def chat(messages, model_id: nil, tools: nil, max_tokens: nil, temperature: nil, json: false)
+  def chat(messages, model_id: nil, tools: nil, tool_choice: nil, max_tokens: nil, temperature: nil, json: false, stream: nil, &stream_chunk)
     unless messages.is_a?(Array) || messages.is_a?(String)
       raise ArgumentError, "require messages to be an Array or String"
     end
@@ -33,45 +33,128 @@ class Groq::Client
     model_id ||= @model_id
+    if stream_chunk ||= stream
+      require "event_stream_parser"
+    end
     body = {
       model: model_id,
       messages: messages,
       tools: tools,
+      tool_choice: tool_choice,
       max_tokens: max_tokens || @max_tokens,
       temperature: temperature || @temperature,
-      response_format: json ? {type: "json_object"} : nil
+      response_format: json ? {type: "json_object"} : nil,
+      stream_chunk: stream_chunk
     }.compact
     response = post(path: "/openai/v1/chat/completions", body: body)
-    if response.status == 200
-      response.body.dig("choices", 0, "message")
-    else
-      # TODO: send the response.body back in Error object
-      puts "Error: #{response.status}"
-      pp response.body
-      raise Error, "Request failed with status #{response.status}: #{response.body}"
+    # Configured to raise exceptions on 4xx/5xx responses
+    if response.body.is_a?(Hash)
+      return response.body.dig("choices", 0, "message")
     end
+    response.body
   end
   def get(path:)
-    client.get do |req|
-      req.url path
-      req.headers["Authorization"] = "Bearer #{@api_key}"
+    client.get(path) do |req|
+      req.headers = headers
     end
   end
   def post(path:, body:)
-    client.post do |req|
-      req.url path
-      req.headers["Authorization"] = "Bearer #{@api_key}"
-      req.body = body
+    client.post(path) do |req|
+      configure_json_post_request(req, body)
     end
   end
   def client
-    @client ||= Faraday.new(url: @api_url) do |f|
-      f.request :json # automatically encode the request body as JSON
-      f.response :json # automatically decode JSON responses
-      f.adapter Faraday.default_adapter
+    @client ||= begin
+      connection = Faraday.new(url: @api_url) do |f|
+        f.request :json # automatically encode the request body as JSON
+        f.response :json # automatically decode JSON responses
+        f.response :raise_error # raise exceptions on 4xx/5xx responses
+        f.adapter Faraday.default_adapter
+        f.options[:timeout] = request_timeout
+      end
+      @faraday_middleware&.call(connection)
+      connection
+    end
+  end
+  private
+  def headers
+    {
+      "Authorization" => "Bearer #{@api_key}",
+      "User-Agent" => "groq-ruby/#{Groq::VERSION}"
+    }
+  end
+  #
+  # Code/ideas borrowed from lib/openai/http.rb in https://github.com/alexrudall/ruby-openai/
+  #
+  def configure_json_post_request(req, body)
+    req_body = body.dup
+    if body[:stream_chunk].respond_to?(:call)
+      req.options.on_data = to_json_stream(user_proc: body[:stream_chunk])
+      req_body[:stream] = true # Tell Groq to stream
+      req_body.delete(:stream_chunk)
+    elsif body[:stream_chunk]
+      raise ArgumentError, "The stream_chunk parameter must be a Proc or have a #call method"
+    end
+    req.headers = headers
+    req.body = req_body
+  end
+  # Given a proc, returns an outer proc that can be used to iterate over a JSON stream of chunks.
+  # For each chunk, the inner user_proc is called giving it the JSON object. The JSON object could
+  # be a data object or an error object as described in the OpenAI API documentation.
+  #
+  # @param user_proc [Proc] The inner proc to call for each JSON object in the chunk.
+  # @return [Proc] An outer proc that iterates over a raw stream, converting it to JSON.
+  def to_json_stream(user_proc:)
+    parser = EventStreamParser::Parser.new
+    proc do |chunk, _bytes, env|
+      if env && env.status != 200
+        raise_error = Faraday::Response::RaiseError.new
+        raise_error.on_complete(env.merge(body: try_parse_json(chunk)))
+      end
+      parser.feed(chunk) do |_type, data|
+        next if data == "[DONE]"
+        chunk = JSON.parse(data)
+        delta = chunk.dig("choices", 0, "delta")
+        content = delta.dig("content")
+        if user_proc.is_a?(Proc)
+          # if user_proc takes one argument, pass the content
+          if user_proc.arity == 1
+            user_proc.call(content)
+          else
+            user_proc.call(content, chunk)
+          end
+        elsif user_proc.respond_to?(:call)
+          # if call method takes one argument, pass the content
+          if user_proc.method(:call).arity == 1
+            user_proc.call(content)
+          else
+            user_proc.call(content, chunk)
+          end
+        else
+          raise ArgumentError, "The stream_chunk parameter must be a Proc or have a #call method"
+        end
+      end
     end
   end
+  def try_parse_json(maybe_json)
+    JSON.parse(maybe_json)
+  rescue JSON::ParserError
+    maybe_json
+  end
 end

data/lib/groq/helpers.rb CHANGED Viewed

@@ -13,7 +13,10 @@ module Groq::Helpers
     end
     alias_method :Assistant, :A
-    def S(content)
+    def S(content, json_schema: nil)
+      if json_schema
+        content += "\nJSON must use schema: #{json_schema}"
+      end
       {role: "system", content: content}
     end
     alias_method :System, :S

data/lib/groq/version.rb CHANGED Viewed

@@ -1,5 +1,5 @@
 # frozen_string_literal: true
 module Groq
-  VERSION = "0.2.0"
+  VERSION = "0.3.0"
 end

metadata CHANGED Viewed

@@ -1,14 +1,14 @@
 --- !ruby/object:Gem::Specification
 name: groq
 version: !ruby/object:Gem::Version
-  version: 0.2.0
+  version: 0.3.0
 platform: ruby
 authors:
 - Dr Nic Williams
 autorequire:
 bindir: exe
 cert_chain: []
-date: 2024-04-20 00:00:00.000000000 Z
+date: 2024-04-25 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: faraday
@@ -52,6 +52,20 @@ dependencies:
     - - ">"
       - !ruby/object:Gem::Version
         version: '5'
+- !ruby/object:Gem::Dependency
+  name: event_stream_parser
+  requirement: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '1.0'
+  type: :runtime
+  prerelease: false
+  version_requirements: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '1.0'
 - !ruby/object:Gem::Dependency
   name: vcr
   requirement: !ruby/object:Gem::Requirement
@@ -80,6 +94,20 @@ dependencies:
     - - "~>"
       - !ruby/object:Gem::Version
         version: '3.0'
+- !ruby/object:Gem::Dependency
+  name: dry-schema
+  requirement: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '1.13'
+  type: :development
+  prerelease: false
+  version_requirements: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '1.13'
 description: Client library for Groq API for fast LLM inference.
 email:
 - drnicwilliams@gmail.com
@@ -94,6 +122,11 @@ files:
 - README.md
 - Rakefile
 - docs/images/groq-speed-price-20240421.png
+- examples/README.md
+- examples/agent-prompts/helloworld.yml
+- examples/agent-prompts/pizzeria-sales.yml
+- examples/groq-user-chat-streaming.rb
+- examples/groq-user-chat.rb
 - lib/groq-ruby.rb
 - lib/groq.rb
 - lib/groq/client.rb