RubyGems - gemini-ai - Versions diffs - 2.1.0 → 3.0.0 - Mend

gemini-ai 2.1.0 → 3.0.0

Files changed (10) hide show

checksums.yaml +4 -4
data/Gemfile.lock +1 -1
data/README.md +352 -24
data/components/errors.rb +26 -0
data/controllers/client.rb +32 -19
data/ports/dsl/gemini-ai/errors.rb +5 -0
data/static/gem.rb +1 -1
data/tasks/generate-readme.clj +1 -1
data/template.md +336 -20
metadata +5 -3

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: 3c6743cedf074b42aed890203de2d5db7697b6302c349c66bae9ac538bf2c680
-  data.tar.gz: ad0c8dd1ba69f58c0e37ed170bbd758231adea02245a63eb10fffa9a57fa2400
+  metadata.gz: 80621f6cd2de526141e994a339d645b53492f6ece960955bc56f2e7430be1d0c
+  data.tar.gz: b627386a0dacc899112e0b08806f63b571dd980f2fb749df7681d7c70656d707
 SHA512:
-  metadata.gz: 558e2a13c12931f6bfbde1d625cb38b7779e08d8eb5bc0a845861902f50fbcbc43b64c67348220f35316b4a37504daa220c334b0479fb5493973bc49ab34e77c
-  data.tar.gz: 2df9f463de540a3d76f86252bd178ca5638caca7e172d59ea244f5f12d04c86d22dcbe2e219049a33ad318ffecc9aaff07c28ffe30aa69ebc109befa08acbcca
+  metadata.gz: f5425c3239dac6eca23cccdb2483f90bea23500b0830b865e8acc0fe235cf2ad3cb001d110e7a6ec0cbb59ed5c2f65e74f883edef961cb675b76e7d6117d88fc
+  data.tar.gz: 041122dc5a6d1d596411ea697503aa8d7846b4d2bc6c493b8b7d9102033c9ef3226957340d8caeb443751f9f4f465624a80ef1749e0f5893833e4c72d43ba907

data/Gemfile.lock CHANGED Viewed

@@ -1,7 +1,7 @@
 PATH
   remote: .
   specs:
-    gemini-ai (2.1.0)
+    gemini-ai (3.0.0)
       event_stream_parser (~> 1.0)
       faraday (~> 2.7, >= 2.7.12)
       googleauth (~> 1.9, >= 1.9.1)

data/README.md CHANGED Viewed

@@ -9,7 +9,7 @@ A Ruby Gem for interacting with [Gemini](https://deepmind.google/technologies/ge
 ## TL;DR and Quick Start
 ```ruby
-gem 'gemini-ai', '~> 2.1.0'
+gem 'gemini-ai', '~> 3.0.0'
 ```
 ```ruby
@@ -21,7 +21,7 @@ client = Gemini.new(
     service: 'generative-language-api',
     api_key: ENV['GOOGLE_API_KEY']
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 # With a Service Account Credentials File
@@ -31,7 +31,7 @@ client = Gemini.new(
     file_path: 'google-credentials.json',
     region: 'us-east4'
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 # With Application Default Credentials
@@ -40,7 +40,7 @@ client = Gemini.new(
     service: 'vertex-ai-api',
     region: 'us-east4'
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 result = client.stream_generate_content({
@@ -81,13 +81,25 @@ Result:
         - [Required Data](#required-data)
 - [Usage](#usage)
     - [Client](#client)
-    - [Generate Content](#generate-content)
-        - [Synchronous](#synchronous)
-        - [Streaming](#streaming)
-        - [Streaming Hang](#streaming-hang)
+    - [Methods](#methods)
+        - [stream_generate_content](#stream_generate_content)
+            - [Receiving Stream Events](#receiving-stream-events)
+            - [Without Events](#without-events)
+        - [generate_content](#generate_content)
+    - [Modes](#modes)
+        - [Text](#text)
+        - [Image](#image)
+        - [Video](#video)
+    - [Streaming vs. Server-Sent Events (SSE)](#streaming-vs-server-sent-events-sse)
+        - [Server-Sent Events (SSE) Hang](#server-sent-events-sse-hang)
+        - [Non-Streaming](#non-streaming)
     - [Back-and-Forth Conversations](#back-and-forth-conversations)
     - [Tools (Functions) Calling](#tools-functions-calling)
     - [New Functionalities and APIs](#new-functionalities-and-apis)
+    - [Error Handling](#error-handling)
+        - [Rescuing](#rescuing)
+        - [For Short](#for-short)
+        - [Errors](#errors)
 - [Development](#development)
     - [Purpose](#purpose)
     - [Publish to RubyGems](#publish-to-rubygems)
@@ -100,15 +112,20 @@ Result:
 ### Installing
 ```sh
-gem install gemini-ai -v 2.1.0
+gem install gemini-ai -v 3.0.0
 ```
 ```sh
-gem 'gemini-ai', '~> 2.1.0'
+gem 'gemini-ai', '~> 3.0.0'
 ```
 ### Credentials
+- [Option 1: API Key (Generative Language API)](#option-1-api-key-generative-language-api)
+- [Option 2: Service Account Credentials File (Vertex AI API)](#option-2-service-account-credentials-file-vertex-ai-api)
+- [Option 3: Application Default Credentials (Vertex AI API)](#option-3-application-default-credentials-vertex-ai-api)
+- [Required Data](#required-data)
 > ⚠️ DISCLAIMER: Be careful with what you are doing, and never trust others' code related to this. These commands and instructions alter the level of access to your Google Cloud Account, and running them naively can lead to security risks as well as financial risks. People with access to your account can use it to steal data or incur charges. Run these commands at your own responsibility and due diligence; expect no warranties from the contributors of this project.
 #### Option 1: API Key (Generative Language API)
@@ -266,7 +283,7 @@ client = Gemini.new(
     service: 'generative-language-api',
     api_key: ENV['GOOGLE_API_KEY']
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 # With a Service Account Credentials File
@@ -276,7 +293,7 @@ client = Gemini.new(
     file_path: 'google-credentials.json',
     region: 'us-east4'
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 # With Application Default Credentials
@@ -285,13 +302,118 @@ client = Gemini.new(
     service: 'vertex-ai-api',
     region: 'us-east4'
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
+)
+```
+### Methods
+#### stream_generate_content
+##### Receiving Stream Events
+Ensure that you have enabled [Server-Sent Events](#streaming-vs-server-sent-events-sse) before using blocks for streaming:
+```ruby
+client.stream_generate_content(
+  { contents: { role: 'user', parts: { text: 'hi!' } } }
+) do |event, parsed, raw|
+  puts event
+end
+```
+Event:
+```ruby
+{ 'candidates' =>
+  [{ 'content' => {
+       'role' => 'model',
+       'parts' => [{ 'text' => 'Hello! How may I assist you?' }]
+     },
+     'finishReason' => 'STOP',
+     'safetyRatings' =>
+     [{ 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }],
+  'usageMetadata' => {
+    'promptTokenCount' => 2,
+    'candidatesTokenCount' => 8,
+    'totalTokenCount' => 10
+  } }
+```
+##### Without Events
+You can use `stream_generate_content` without events:
+```ruby
+result = client.stream_generate_content(
+  { contents: { role: 'user', parts: { text: 'hi!' } } }
+)
+```
+In this case, the result will be an array with all the received events:
+```ruby
+[{ 'candidates' =>
+   [{ 'content' => {
+        'role' => 'model',
+        'parts' => [{ 'text' => 'Hello! How may I assist you?' }]
+      },
+      'finishReason' => 'STOP',
+      'safetyRatings' =>
+      [{ 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }],
+   'usageMetadata' => {
+     'promptTokenCount' => 2,
+     'candidatesTokenCount' => 8,
+     'totalTokenCount' => 10
+   } }]
+```
+You can mix both as well:
+```ruby
+result = client.stream_generate_content(
+  { contents: { role: 'user', parts: { text: 'hi!' } } }
+) do |event, parsed, raw|
+  puts event
+end
+```
+#### generate_content
+```ruby
+result = client.generate_content(
+  { contents: { role: 'user', parts: { text: 'hi!' } } }
 )
 ```
-### Generate Content
+Result:
+```ruby
+{ 'candidates' =>
+  [{ 'content' => { 'parts' => [{ 'text' => 'Hello! How can I assist you today?' }], 'role' => 'model' },
+     'finishReason' => 'STOP',
+     'index' => 0,
+     'safetyRatings' =>
+     [{ 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }],
+  'promptFeedback' =>
+  { 'safetyRatings' =>
+    [{ 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+     { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+     { 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+     { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] } }
+```
-#### Synchronous
+As of the writing of this README, only the `generative-language-api` service supports the `generate_content` method; `vertex-ai-api` does not.
+### Modes
+#### Text
 ```ruby
 result = client.stream_generate_content({
@@ -319,13 +441,132 @@ Result:
    } }]
 ```
-#### Streaming
+#### Image
+![A black and white image of an old piano. The piano is an upright model, with the keys on the right side of the image. The piano is sitting on a tiled floor. There is a small round object on the top of the piano.](https://raw.githubusercontent.com/gbaptista/assets/main/gemini-ai/piano.jpg)
+> _Courtesy of [Unsplash](https://unsplash.com/photos/greyscale-photo-of-grand-piano-czPs0z3-Ggg)_
+Switch to the `gemini-pro-vision` model:
+```ruby
+client = Gemini.new(
+  credentials: { service: 'vertex-ai-api', region: 'us-east4' },
+  options: { model: 'gemini-pro-vision', server_sent_events: true }
+)
+```
+Then, encode the image as [Base64](https://en.wikipedia.org/wiki/Base64) and add its [MIME type](https://developer.mozilla.org/en-US/docs/Web/HTTP/Basics_of_HTTP/MIME_types/Common_types):
+```ruby
+require 'base64'
+result = client.stream_generate_content(
+  { contents: [
+    { role: 'user', parts: [
+      { text: 'Please describe this image.' },
+      { inline_data: {
+        mime_type: 'image/jpeg',
+        data: Base64.strict_encode64(File.read('piano.jpg'))
+      } }
+    ] }
+  ] }
+)
+```
+The result:
+```ruby
+[{ 'candidates' =>
+   [{ 'content' =>
+      { 'role' => 'model',
+        'parts' =>
+        [{ 'text' =>
+           ' A black and white image of an old piano. The piano is an upright model, with the keys on the right side of the image. The piano is' }] },
+      'safetyRatings' =>
+      [{ 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }] },
+ { 'candidates' =>
+   [{ 'content' => { 'role' => 'model', 'parts' => [{ 'text' => ' sitting on a tiled floor. There is a small round object on the top of the piano.' }] },
+      'finishReason' => 'STOP',
+      'safetyRatings' =>
+      [{ 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }],
+   'usageMetadata' => { 'promptTokenCount' => 263, 'candidatesTokenCount' => 50, 'totalTokenCount' => 313 } }]
+```
+#### Video
+https://gist.github.com/assets/29520/f82bccbf-02d2-4899-9c48-eb8a0a5ef741
+> ALT: A white and gold cup is being filled with coffee. The coffee is dark and rich. The cup is sitting on a black surface. The background is blurred.
+> _Courtesy of [Pexels](https://www.pexels.com/video/pouring-of-coffee-855391/)_
+Switch to the `gemini-pro-vision` model:
-You can set up the client to use streaming for all supported endpoints:
+```ruby
+client = Gemini.new(
+  credentials: { service: 'vertex-ai-api', region: 'us-east4' },
+  options: { model: 'gemini-pro-vision', server_sent_events: true }
+)
+```
+Then, encode the video as [Base64](https://en.wikipedia.org/wiki/Base64) and add its [MIME type](https://developer.mozilla.org/en-US/docs/Web/HTTP/Basics_of_HTTP/MIME_types/Common_types):
+```ruby
+require 'base64'
+result = client.stream_generate_content(
+  { contents: [
+    { role: 'user', parts: [
+      { text: 'Please describe this video.' },
+      { inline_data: {
+        mime_type: 'video/mp4',
+        data: Base64.strict_encode64(File.read('coffee.mp4'))
+      } }
+    ] }
+  ] }
+)
+```
+The result:
+```ruby
+[{"candidates"=>
+   [{"content"=>
+      {"role"=>"model",
+       "parts"=>
+        [{"text"=>
+           " A white and gold cup is being filled with coffee. The coffee is dark and rich. The cup is sitting on a black surface. The background is blurred"}]},
+     "safetyRatings"=>
+      [{"category"=>"HARM_CATEGORY_HARASSMENT", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_HATE_SPEECH", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_SEXUALLY_EXPLICIT", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_DANGEROUS_CONTENT", "probability"=>"NEGLIGIBLE"}]}],
+  "usageMetadata"=>{"promptTokenCount"=>1037, "candidatesTokenCount"=>31, "totalTokenCount"=>1068}},
+ {"candidates"=>
+   [{"content"=>{"role"=>"model", "parts"=>[{"text"=>"."}]},
+     "finishReason"=>"STOP",
+     "safetyRatings"=>
+      [{"category"=>"HARM_CATEGORY_HARASSMENT", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_HATE_SPEECH", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_SEXUALLY_EXPLICIT", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_DANGEROUS_CONTENT", "probability"=>"NEGLIGIBLE"}]}],
+  "usageMetadata"=>{"promptTokenCount"=>1037, "candidatesTokenCount"=>32, "totalTokenCount"=>1069}}]
+```
+### Streaming vs. Server-Sent Events (SSE)
+[Server-Sent Events (SSE)](https://en.wikipedia.org/wiki/Server-sent_events) is a technology that allows certain endpoints to offer streaming capabilities, such as creating the impression that "the model is typing along with you," rather than delivering the entire answer all at once.
+You can set up the client to use Server-Sent Events (SSE) for all supported endpoints:
 ```ruby
 client = Gemini.new(
   credentials: { ... },
-  options: { model: 'gemini-pro', stream: true }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 ```
@@ -333,11 +574,11 @@ Or, you can decide on a request basis:
 ```ruby
 client.stream_generate_content(
   { contents: { role: 'user', parts: { text: 'hi!' } } },
-  stream: true
+  server_sent_events: true
 )
 ```
-With streaming enabled, you can use a block to receive the results:
+With Server-Sent Events (SSE) enabled, you can use a block to receive partial results via events. This feature is particularly useful for methods that offer streaming capabilities, such as `stream_generate_content`:
 ```ruby
 client.stream_generate_content(
@@ -367,14 +608,16 @@ Event:
   } }
 ```
-#### Streaming Hang
+Even though streaming methods utilize Server-Sent Events (SSE), using this feature doesn't necessarily mean streaming data. For example, when `generate_content` is called with SSE enabled, you will receive all the data at once in a single event, rather than through multiple partial events. This occurs because `generate_content` isn't designed for streaming, even though it is capable of utilizing Server-Sent Events.
+#### Server-Sent Events (SSE) Hang
-Method calls will _hang_ until the stream finishes, so even without providing a block, you can get the final results of the stream events:
+Method calls will _hang_ until the server-sent events finish, so even without providing a block, you can obtain the final results of the received events:
 ```ruby
 result = client.stream_generate_content(
   { contents: { role: 'user', parts: { text: 'hi!' } } },
-  stream: true
+  server_sent_events: true
 )
 ```
@@ -398,6 +641,39 @@ Result:
    } }]
 ```
+#### Non-Streaming
+Depending on the service, you can use the [`generate_content`](#generate_content) method, which does not stream the answer.
+You can also use methods designed for streaming without necessarily processing partial events; instead, you can wait for the result of all received events:
+```ruby
+result = client.stream_generate_content({
+  contents: { role: 'user', parts: { text: 'hi!' } },
+  server_sent_events: false
+})
+```
+Result:
+```ruby
+[{ 'candidates' =>
+   [{ 'content' => {
+        'role' => 'model',
+        'parts' => [{ 'text' => 'Hello! How may I assist you?' }]
+      },
+      'finishReason' => 'STOP',
+      'safetyRatings' =>
+      [{ 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }],
+   'usageMetadata' => {
+     'promptTokenCount' => 2,
+     'candidatesTokenCount' => 8,
+     'totalTokenCount' => 10
+   } }]
+```
 ### Back-and-Forth Conversations
 To maintain a back-and-forth conversation, you need to append the received responses and build a history for your requests:
@@ -596,6 +872,58 @@ result = client.request(
 )
 ```
+### Error Handling
+#### Rescuing
+```ruby
+require 'gemini-ai'
+begin
+  client.stream_generate_content({
+    contents: { role: 'user', parts: { text: 'hi!' } }
+  })
+rescue Gemini::Errors::GeminiError => error
+  puts error.class # Gemini::Errors::RequestError
+  puts error.message # 'the server responded with status 500'
+  puts error.payload
+  # { contents: [{ role: 'user', parts: { text: 'hi!' } }],
+  #   generationConfig: { candidateCount: 1 },
+  #   ...
+  # }
+  puts error.request
+  # #<Faraday::ServerError response={:status=>500, :headers...
+end
+```
+#### For Short
+```ruby
+require 'gemini-ai/errors'
+begin
+  client.stream_generate_content({
+    contents: { role: 'user', parts: { text: 'hi!' } }
+  })
+rescue GeminiError => error
+  puts error.class # Gemini::Errors::RequestError
+end
+```
+#### Errors
+```ruby
+GeminiError
+MissingProjectIdError
+UnsupportedServiceError
+BlockWithoutServerSentEventsError
+RequestError
+```
 ## Development
 ```bash
@@ -614,7 +942,7 @@ gem build gemini-ai.gemspec
 gem signin
-gem push gemini-ai-2.1.0.gem
+gem push gemini-ai-3.0.0.gem
 ```
 ### Updating the README

data/components/errors.rb ADDED Viewed

@@ -0,0 +1,26 @@
+# frozen_string_literal: true
+module Gemini
+  module Errors
+    class GeminiError < StandardError
+      def initialize(message = nil)
+        super(message)
+      end
+    end
+    class MissingProjectIdError < GeminiError; end
+    class UnsupportedServiceError < GeminiError; end
+    class BlockWithoutServerSentEventsError < GeminiError; end
+    class RequestError < GeminiError
+      attr_reader :request, :payload
+      def initialize(message = nil, request: nil, payload: nil)
+        @request = request
+        @payload = payload
+        super(message)
+      end
+    end
+  end
+end

data/controllers/client.rb CHANGED Viewed

@@ -5,6 +5,8 @@ require 'faraday'
 require 'json'
 require 'googleauth'
+require_relative '../ports/dsl/gemini-ai/errors'
 module Gemini
   module Controllers
     class Client
@@ -24,48 +26,57 @@ module Gemini
         end
         if @authentication == :service_account || @authentication == :default_credentials
-          @project_id = if config[:credentials][:project_id].nil?
-                          @authorizer.project_id || @authorizer.quota_project_id
-                        else
-                          config[:credentials][:project_id]
-                        end
+          @project_id = config[:credentials][:project_id] || @authorizer.project_id || @authorizer.quota_project_id
-          raise StandardError, 'Could not determine project_id, which is required.' if @project_id.nil?
+          raise MissingProjectIdError, 'Could not determine project_id, which is required.' if @project_id.nil?
         end
-        @address = case config[:credentials][:service]
+        @service = config[:credentials][:service]
+        @address = case @service
                    when 'vertex-ai-api'
                      "https://#{config[:credentials][:region]}-aiplatform.googleapis.com/v1/projects/#{@project_id}/locations/#{config[:credentials][:region]}/publishers/google/models/#{config[:options][:model]}"
                    when 'generative-language-api'
                      "https://generativelanguage.googleapis.com/v1/models/#{config[:options][:model]}"
                    else
-                     raise StandardError, "Unsupported service: #{config[:credentials][:service]}"
+                     raise UnsupportedServiceError, "Unsupported service: #{@service}"
                    end
-        @stream = config[:options][:stream]
+        @server_sent_events = config[:options][:server_sent_events]
+      end
+      def stream_generate_content(payload, server_sent_events: nil, &callback)
+        request('streamGenerateContent', payload, server_sent_events:, &callback)
       end
-      def stream_generate_content(payload, stream: nil, &callback)
-        request('streamGenerateContent', payload, stream:, &callback)
+      def generate_content(payload, server_sent_events: nil, &callback)
+        result = request('generateContent', payload, server_sent_events:, &callback)
+        return result.first if result.is_a?(Array) && result.size == 1
+        result
       end
-      def request(path, payload, stream: nil, &callback)
-        stream_enabled = stream.nil? ? @stream : stream
+      def request(path, payload, server_sent_events: nil, &callback)
+        server_sent_events_enabled = server_sent_events.nil? ? @server_sent_events : server_sent_events
         url = "#{@address}:#{path}"
         params = []
-        params << 'alt=sse' if stream_enabled
+        params << 'alt=sse' if server_sent_events_enabled
         params << "key=#{@api_key}" if @authentication == :api_key
         url += "?#{params.join('&')}" if params.size.positive?
-        if !callback.nil? && !stream_enabled
-          raise StandardError, 'You are trying to use a block without stream enabled."'
+        if !callback.nil? && !server_sent_events_enabled
+          raise BlockWithoutServerSentEventsError,
+                'You are trying to use a block without Server Sent Events (SSE) enabled.'
         end
         results = []
-        response = Faraday.new.post do |request|
+        response = Faraday.new do |faraday|
+          faraday.response :raise_error
+        end.post do |request|
           request.url url
           request.headers['Content-Type'] = 'application/json'
           if @authentication == :service_account || @authentication == :default_credentials
@@ -74,7 +85,7 @@ module Gemini
           request.body = payload.to_json
-          if stream_enabled
+          if server_sent_events_enabled
             parser = EventStreamParser::Parser.new
             request.options.on_data = proc do |chunk, bytes, env|
@@ -103,9 +114,11 @@ module Gemini
           end
         end
-        return safe_parse_json(response.body) unless stream_enabled
+        return safe_parse_json(response.body) unless server_sent_events_enabled
         results.map { |result| result[:event] }
+      rescue Faraday::ServerError => e
+        raise RequestError.new(e.message, request: e, payload:)
       end
       def safe_parse_json(raw)

data/ports/dsl/gemini-ai/errors.rb ADDED Viewed

@@ -0,0 +1,5 @@
+# frozen_string_literal: true
+require_relative '../../../components/errors'
+include Gemini::Errors

data/static/gem.rb CHANGED Viewed

@@ -3,7 +3,7 @@
 module Gemini
   GEM = {
     name: 'gemini-ai',
-    version: '2.1.0',
+    version: '3.0.0',
     author: 'gbaptista',
     summary: "Interact with Google's Gemini AI.",
     description: "A Ruby Gem for interacting with Gemini through Vertex AI, Generative Language API, or AI Studio, Google's generative AI services.",

data/tasks/generate-readme.clj CHANGED Viewed

@@ -4,7 +4,7 @@
   (-> text
       (clojure.string/lower-case)
       (clojure.string/replace " " "-")
-      (clojure.string/replace #"[^a-z0-9\-]" "")))
+      (clojure.string/replace #"[^a-z0-9\-_]" "")))
 (defn remove-code-blocks [content]
   (let [code-block-regex #"(?s)```.*?```"]

data/template.md CHANGED Viewed

@@ -9,7 +9,7 @@ A Ruby Gem for interacting with [Gemini](https://deepmind.google/technologies/ge
 ## TL;DR and Quick Start
 ```ruby
-gem 'gemini-ai', '~> 2.1.0'
+gem 'gemini-ai', '~> 3.0.0'
 ```
 ```ruby
@@ -21,7 +21,7 @@ client = Gemini.new(
     service: 'generative-language-api',
     api_key: ENV['GOOGLE_API_KEY']
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 # With a Service Account Credentials File
@@ -31,7 +31,7 @@ client = Gemini.new(
     file_path: 'google-credentials.json',
     region: 'us-east4'
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 # With Application Default Credentials
@@ -40,7 +40,7 @@ client = Gemini.new(
     service: 'vertex-ai-api',
     region: 'us-east4'
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 result = client.stream_generate_content({
@@ -77,15 +77,20 @@ Result:
 ### Installing
 ```sh
-gem install gemini-ai -v 2.1.0
+gem install gemini-ai -v 3.0.0
 ```
 ```sh
-gem 'gemini-ai', '~> 2.1.0'
+gem 'gemini-ai', '~> 3.0.0'
 ```
 ### Credentials
+- [Option 1: API Key (Generative Language API)](#option-1-api-key-generative-language-api)
+- [Option 2: Service Account Credentials File (Vertex AI API)](#option-2-service-account-credentials-file-vertex-ai-api)
+- [Option 3: Application Default Credentials (Vertex AI API)](#option-3-application-default-credentials-vertex-ai-api)
+- [Required Data](#required-data)
 > ⚠️ DISCLAIMER: Be careful with what you are doing, and never trust others' code related to this. These commands and instructions alter the level of access to your Google Cloud Account, and running them naively can lead to security risks as well as financial risks. People with access to your account can use it to steal data or incur charges. Run these commands at your own responsibility and due diligence; expect no warranties from the contributors of this project.
 #### Option 1: API Key (Generative Language API)
@@ -243,7 +248,7 @@ client = Gemini.new(
     service: 'generative-language-api',
     api_key: ENV['GOOGLE_API_KEY']
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 # With a Service Account Credentials File
@@ -253,7 +258,7 @@ client = Gemini.new(
     file_path: 'google-credentials.json',
     region: 'us-east4'
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 # With Application Default Credentials
@@ -262,13 +267,118 @@ client = Gemini.new(
     service: 'vertex-ai-api',
     region: 'us-east4'
   },
-  options: { model: 'gemini-pro', stream: false }
+  options: { model: 'gemini-pro', server_sent_events: true }
+)
+```
+### Methods
+#### stream_generate_content
+##### Receiving Stream Events
+Ensure that you have enabled [Server-Sent Events](#streaming-vs-server-sent-events-sse) before using blocks for streaming:
+```ruby
+client.stream_generate_content(
+  { contents: { role: 'user', parts: { text: 'hi!' } } }
+) do |event, parsed, raw|
+  puts event
+end
+```
+Event:
+```ruby
+{ 'candidates' =>
+  [{ 'content' => {
+       'role' => 'model',
+       'parts' => [{ 'text' => 'Hello! How may I assist you?' }]
+     },
+     'finishReason' => 'STOP',
+     'safetyRatings' =>
+     [{ 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }],
+  'usageMetadata' => {
+    'promptTokenCount' => 2,
+    'candidatesTokenCount' => 8,
+    'totalTokenCount' => 10
+  } }
+```
+##### Without Events
+You can use `stream_generate_content` without events:
+```ruby
+result = client.stream_generate_content(
+  { contents: { role: 'user', parts: { text: 'hi!' } } }
+)
+```
+In this case, the result will be an array with all the received events:
+```ruby
+[{ 'candidates' =>
+   [{ 'content' => {
+        'role' => 'model',
+        'parts' => [{ 'text' => 'Hello! How may I assist you?' }]
+      },
+      'finishReason' => 'STOP',
+      'safetyRatings' =>
+      [{ 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }],
+   'usageMetadata' => {
+     'promptTokenCount' => 2,
+     'candidatesTokenCount' => 8,
+     'totalTokenCount' => 10
+   } }]
+```
+You can mix both as well:
+```ruby
+result = client.stream_generate_content(
+  { contents: { role: 'user', parts: { text: 'hi!' } } }
+) do |event, parsed, raw|
+  puts event
+end
+```
+#### generate_content
+```ruby
+result = client.generate_content(
+  { contents: { role: 'user', parts: { text: 'hi!' } } }
 )
 ```
-### Generate Content
+Result:
+```ruby
+{ 'candidates' =>
+  [{ 'content' => { 'parts' => [{ 'text' => 'Hello! How can I assist you today?' }], 'role' => 'model' },
+     'finishReason' => 'STOP',
+     'index' => 0,
+     'safetyRatings' =>
+     [{ 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+      { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }],
+  'promptFeedback' =>
+  { 'safetyRatings' =>
+    [{ 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+     { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+     { 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+     { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] } }
+```
-#### Synchronous
+As of the writing of this README, only the `generative-language-api` service supports the `generate_content` method; `vertex-ai-api` does not.
+### Modes
+#### Text
 ```ruby
 result = client.stream_generate_content({
@@ -296,13 +406,132 @@ Result:
    } }]
 ```
-#### Streaming
+#### Image
+![A black and white image of an old piano. The piano is an upright model, with the keys on the right side of the image. The piano is sitting on a tiled floor. There is a small round object on the top of the piano.](https://raw.githubusercontent.com/gbaptista/assets/main/gemini-ai/piano.jpg)
+> _Courtesy of [Unsplash](https://unsplash.com/photos/greyscale-photo-of-grand-piano-czPs0z3-Ggg)_
+Switch to the `gemini-pro-vision` model:
+```ruby
+client = Gemini.new(
+  credentials: { service: 'vertex-ai-api', region: 'us-east4' },
+  options: { model: 'gemini-pro-vision', server_sent_events: true }
+)
+```
+Then, encode the image as [Base64](https://en.wikipedia.org/wiki/Base64) and add its [MIME type](https://developer.mozilla.org/en-US/docs/Web/HTTP/Basics_of_HTTP/MIME_types/Common_types):
+```ruby
+require 'base64'
+result = client.stream_generate_content(
+  { contents: [
+    { role: 'user', parts: [
+      { text: 'Please describe this image.' },
+      { inline_data: {
+        mime_type: 'image/jpeg',
+        data: Base64.strict_encode64(File.read('piano.jpg'))
+      } }
+    ] }
+  ] }
+)
+```
+The result:
+```ruby
+[{ 'candidates' =>
+   [{ 'content' =>
+      { 'role' => 'model',
+        'parts' =>
+        [{ 'text' =>
+           ' A black and white image of an old piano. The piano is an upright model, with the keys on the right side of the image. The piano is' }] },
+      'safetyRatings' =>
+      [{ 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }] },
+ { 'candidates' =>
+   [{ 'content' => { 'role' => 'model', 'parts' => [{ 'text' => ' sitting on a tiled floor. There is a small round object on the top of the piano.' }] },
+      'finishReason' => 'STOP',
+      'safetyRatings' =>
+      [{ 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }],
+   'usageMetadata' => { 'promptTokenCount' => 263, 'candidatesTokenCount' => 50, 'totalTokenCount' => 313 } }]
+```
+#### Video
+https://gist.github.com/assets/29520/f82bccbf-02d2-4899-9c48-eb8a0a5ef741
+> ALT: A white and gold cup is being filled with coffee. The coffee is dark and rich. The cup is sitting on a black surface. The background is blurred.
+> _Courtesy of [Pexels](https://www.pexels.com/video/pouring-of-coffee-855391/)_
+Switch to the `gemini-pro-vision` model:
-You can set up the client to use streaming for all supported endpoints:
+```ruby
+client = Gemini.new(
+  credentials: { service: 'vertex-ai-api', region: 'us-east4' },
+  options: { model: 'gemini-pro-vision', server_sent_events: true }
+)
+```
+Then, encode the video as [Base64](https://en.wikipedia.org/wiki/Base64) and add its [MIME type](https://developer.mozilla.org/en-US/docs/Web/HTTP/Basics_of_HTTP/MIME_types/Common_types):
+```ruby
+require 'base64'
+result = client.stream_generate_content(
+  { contents: [
+    { role: 'user', parts: [
+      { text: 'Please describe this video.' },
+      { inline_data: {
+        mime_type: 'video/mp4',
+        data: Base64.strict_encode64(File.read('coffee.mp4'))
+      } }
+    ] }
+  ] }
+)
+```
+The result:
+```ruby
+[{"candidates"=>
+   [{"content"=>
+      {"role"=>"model",
+       "parts"=>
+        [{"text"=>
+           " A white and gold cup is being filled with coffee. The coffee is dark and rich. The cup is sitting on a black surface. The background is blurred"}]},
+     "safetyRatings"=>
+      [{"category"=>"HARM_CATEGORY_HARASSMENT", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_HATE_SPEECH", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_SEXUALLY_EXPLICIT", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_DANGEROUS_CONTENT", "probability"=>"NEGLIGIBLE"}]}],
+  "usageMetadata"=>{"promptTokenCount"=>1037, "candidatesTokenCount"=>31, "totalTokenCount"=>1068}},
+ {"candidates"=>
+   [{"content"=>{"role"=>"model", "parts"=>[{"text"=>"."}]},
+     "finishReason"=>"STOP",
+     "safetyRatings"=>
+      [{"category"=>"HARM_CATEGORY_HARASSMENT", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_HATE_SPEECH", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_SEXUALLY_EXPLICIT", "probability"=>"NEGLIGIBLE"},
+       {"category"=>"HARM_CATEGORY_DANGEROUS_CONTENT", "probability"=>"NEGLIGIBLE"}]}],
+  "usageMetadata"=>{"promptTokenCount"=>1037, "candidatesTokenCount"=>32, "totalTokenCount"=>1069}}]
+```
+### Streaming vs. Server-Sent Events (SSE)
+[Server-Sent Events (SSE)](https://en.wikipedia.org/wiki/Server-sent_events) is a technology that allows certain endpoints to offer streaming capabilities, such as creating the impression that "the model is typing along with you," rather than delivering the entire answer all at once.
+You can set up the client to use Server-Sent Events (SSE) for all supported endpoints:
 ```ruby
 client = Gemini.new(
   credentials: { ... },
-  options: { model: 'gemini-pro', stream: true }
+  options: { model: 'gemini-pro', server_sent_events: true }
 )
 ```
@@ -310,11 +539,11 @@ Or, you can decide on a request basis:
 ```ruby
 client.stream_generate_content(
   { contents: { role: 'user', parts: { text: 'hi!' } } },
-  stream: true
+  server_sent_events: true
 )
 ```
-With streaming enabled, you can use a block to receive the results:
+With Server-Sent Events (SSE) enabled, you can use a block to receive partial results via events. This feature is particularly useful for methods that offer streaming capabilities, such as `stream_generate_content`:
 ```ruby
 client.stream_generate_content(
@@ -344,14 +573,16 @@ Event:
   } }
 ```
-#### Streaming Hang
+Even though streaming methods utilize Server-Sent Events (SSE), using this feature doesn't necessarily mean streaming data. For example, when `generate_content` is called with SSE enabled, you will receive all the data at once in a single event, rather than through multiple partial events. This occurs because `generate_content` isn't designed for streaming, even though it is capable of utilizing Server-Sent Events.
+#### Server-Sent Events (SSE) Hang
-Method calls will _hang_ until the stream finishes, so even without providing a block, you can get the final results of the stream events:
+Method calls will _hang_ until the server-sent events finish, so even without providing a block, you can obtain the final results of the received events:
 ```ruby
 result = client.stream_generate_content(
   { contents: { role: 'user', parts: { text: 'hi!' } } },
-  stream: true
+  server_sent_events: true
 )
 ```
@@ -375,6 +606,39 @@ Result:
    } }]
 ```
+#### Non-Streaming
+Depending on the service, you can use the [`generate_content`](#generate_content) method, which does not stream the answer.
+You can also use methods designed for streaming without necessarily processing partial events; instead, you can wait for the result of all received events:
+```ruby
+result = client.stream_generate_content({
+  contents: { role: 'user', parts: { text: 'hi!' } },
+  server_sent_events: false
+})
+```
+Result:
+```ruby
+[{ 'candidates' =>
+   [{ 'content' => {
+        'role' => 'model',
+        'parts' => [{ 'text' => 'Hello! How may I assist you?' }]
+      },
+      'finishReason' => 'STOP',
+      'safetyRatings' =>
+      [{ 'category' => 'HARM_CATEGORY_HARASSMENT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_HATE_SPEECH', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability' => 'NEGLIGIBLE' },
+       { 'category' => 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability' => 'NEGLIGIBLE' }] }],
+   'usageMetadata' => {
+     'promptTokenCount' => 2,
+     'candidatesTokenCount' => 8,
+     'totalTokenCount' => 10
+   } }]
+```
 ### Back-and-Forth Conversations
 To maintain a back-and-forth conversation, you need to append the received responses and build a history for your requests:
@@ -573,6 +837,58 @@ result = client.request(
 )
 ```
+### Error Handling
+#### Rescuing
+```ruby
+require 'gemini-ai'
+begin
+  client.stream_generate_content({
+    contents: { role: 'user', parts: { text: 'hi!' } }
+  })
+rescue Gemini::Errors::GeminiError => error
+  puts error.class # Gemini::Errors::RequestError
+  puts error.message # 'the server responded with status 500'
+  puts error.payload
+  # { contents: [{ role: 'user', parts: { text: 'hi!' } }],
+  #   generationConfig: { candidateCount: 1 },
+  #   ...
+  # }
+  puts error.request
+  # #<Faraday::ServerError response={:status=>500, :headers...
+end
+```
+#### For Short
+```ruby
+require 'gemini-ai/errors'
+begin
+  client.stream_generate_content({
+    contents: { role: 'user', parts: { text: 'hi!' } }
+  })
+rescue GeminiError => error
+  puts error.class # Gemini::Errors::RequestError
+end
+```
+#### Errors
+```ruby
+GeminiError
+MissingProjectIdError
+UnsupportedServiceError
+BlockWithoutServerSentEventsError
+RequestError
+```
 ## Development
 ```bash
@@ -591,7 +907,7 @@ gem build gemini-ai.gemspec
 gem signin
-gem push gemini-ai-2.1.0.gem
+gem push gemini-ai-3.0.0.gem
 ```
 ### Updating the README

metadata CHANGED Viewed

@@ -1,14 +1,14 @@
 --- !ruby/object:Gem::Specification
 name: gemini-ai
 version: !ruby/object:Gem::Version
-  version: 2.1.0
+  version: 3.0.0
 platform: ruby
 authors:
 - gbaptista
 autorequire:
 bindir: bin
 cert_chain: []
-date: 2023-12-16 00:00:00.000000000 Z
+date: 2023-12-17 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: event_stream_parser
@@ -78,9 +78,11 @@ files:
 - Gemfile.lock
 - LICENSE
 - README.md
+- components/errors.rb
 - controllers/client.rb
 - gemini-ai.gemspec
 - ports/dsl/gemini-ai.rb
+- ports/dsl/gemini-ai/errors.rb
 - static/gem.rb
 - tasks/generate-readme.clj
 - template.md
@@ -107,7 +109,7 @@ required_rubygems_version: !ruby/object:Gem::Requirement
     - !ruby/object:Gem::Version
       version: '0'
 requirements: []
-rubygems_version: 3.3.3
+rubygems_version: 3.4.22
 signing_key:
 specification_version: 4
 summary: Interact with Google's Gemini AI.