RubyGems - geminize - Versions diffs - 1.1.0 → 1.2.0 - Mend

geminize 1.1.0 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (22) hide show

checksums.yaml +4 -4
data/.memory_bank/activeContext.md +25 -1
data/.memory_bank/progress.md +26 -13
data/.memory_bank/projectbrief.md +16 -0
data/.memory_bank/tasks.md +57 -13
data/CHANGELOG.md +22 -0
data/README.md +181 -0
data/examples/function_calling.rb +218 -0
data/examples/safety_settings.rb +82 -0
data/lib/geminize/models/content_request_extensions.rb +219 -0
data/lib/geminize/models/content_request_safety.rb +123 -0
data/lib/geminize/models/content_response_extensions.rb +120 -0
data/lib/geminize/models/function_declaration.rb +112 -0
data/lib/geminize/models/function_response.rb +70 -0
data/lib/geminize/models/safety_setting.rb +102 -0
data/lib/geminize/models/tool.rb +47 -0
data/lib/geminize/models/tool_config.rb +52 -0
data/lib/geminize/module_extensions.rb +228 -0
data/lib/geminize/module_safety.rb +135 -0
data/lib/geminize/version.rb +1 -1
data/lib/geminize.rb +12 -0
metadata +13 -1

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: 1b3672d83fd6ff1d0a803808ee2c6d71681053229a84b3f33e0ea8b64b78dc74
-  data.tar.gz: 79eca877306a66edcf9dd6bfa3afc3dd1b1222c6127d4391cdc077944557a3bf
+  metadata.gz: 900c19e9cf075e32779e3ee2f4f4530fdced2c027e8bc393d05250c1b20e5dac
+  data.tar.gz: 7d1a9a2f365eff24d2625492916d578f661fd881263c1cd4101352d7b16780ce
 SHA512:
-  metadata.gz: 81c4a2324a026449c00e983a2ad0f7e4e89e5d77564a62b6e7d430303eeeb129f32002f0c9a41055b4001044ec6272386ebba920e3e0565b8ac8c61f7bbc2d5d
-  data.tar.gz: 7aeb4afe504891c0849f83b04d3afc7b50fa45c1b60d6ae5ebcaea681b403e13c1bc86613694713162fff8ab220b0f447fedf808f0611fb0dbfb7476a2243148
+  metadata.gz: b1e3a5235a330b9bba4690a2452992f331bf4b76294109e2d4a8beb3fbc357c3f06be8b9728d607aff8653367458827bf17b166a1c89c6d243062272e747b154
+  data.tar.gz: a9c73fc3264fccaa26ba6b525d89807a9a99052232c12387281844d1cca821609c795fe199ed222c601962e8f144fc73854b73eccc75994fc3ab6ac70e4f7b2d

data/.memory_bank/activeContext.md CHANGED Viewed

@@ -9,6 +9,9 @@
 - Multimodal content handling (text + images)
 - Embeddings generation
 - Streaming responses
+- Function calling capabilities
+- JSON mode for structured responses
+- Safety settings for content moderation
 ### Key Implementation Details
@@ -18,6 +21,23 @@
 - System instructions for guiding model behavior
 - Generation parameters (temperature, top_k, top_p)
 - Support for stop sequences
+- Safety settings for content moderation
+#### Function Calling
+- Tool integration with Gemini API
+- Function declaration and response models
+- Support for multiple function definitions
+- Tool execution modes (AUTO, MANUAL, NONE)
+- Function result processing
+- String-based function call detection
+#### JSON Mode
+- Structured data response handling
+- Automatic JSON parsing
+- JSON schema integration
+- Response validation
 #### Multimodal Support
@@ -69,10 +89,14 @@
 - Streaming response optimizations
 - Comprehensive documentation
 - Error handling refinements
+- Models API integration
+- Function calling support
 ### Upcoming Features
-- Support for newer Gemini models
+- Code execution support
+- Additional content types (audio, PDF)
+- Caching support
 - Advanced parameter tuning
 - Additional vector operations
 - Improved conversation persistence

data/.memory_bank/progress.md CHANGED Viewed

@@ -2,7 +2,7 @@
 ## Current Status
-Geminize is a functional Ruby gem providing a complete interface to the Google Gemini API. The current implementation includes all core features with a focus on stability, documentation, and ease of use.
+Geminize is a functional Ruby gem providing a complete interface to the Google Gemini API. The current implementation includes all core features with a focus on stability, documentation, and ease of use. The gem now also includes function calling, JSON mode, and safety settings features.
 ## Development Progress
@@ -15,6 +15,10 @@ Geminize is a functional Ruby gem providing a complete interface to the Google G
 - **Streaming Responses**: ✅ Complete
 - **Error Handling**: ✅ Complete
 - **Configuration System**: ✅ Complete
+- **Models API**: ✅ Complete
+- **Function Calling**: ✅ Complete
+- **JSON Mode**: ✅ Complete
+- **Safety Settings**: ✅ Complete
 ### Documentation
@@ -26,7 +30,7 @@ Geminize is a functional Ruby gem providing a complete interface to the Google G
 ### Testing
 - **Unit Tests**: ✅ Complete
-- **Integration Tests**: 🟡 Partially Complete (70%)
+- **Integration Tests**: 🟡 Partially Complete (85%)
 - **Performance Tests**: ❌ Not Started
 ### Deployment
@@ -50,27 +54,35 @@ Geminize is a functional Ruby gem providing a complete interface to the Google G
 - Additional embedding task types
 - Improved documentation
-### v1.0.0 (Planned for 2025-05-02)
+### v1.0.0 (2025-05-02)
 - Removed Rails-specific integration
 - Simplified usage
 - API stabilization
-## Upcoming Milestones
+### v1.1.0 (2025-05-02)
+- Support for Gemini Models API
+- Model discovery and filtering functionality
+- Pagination support
+- Improved documentation
-### v1.1.0 (Planned)
+### v1.2.0 (2025-05-02)
-- Support for latest Gemini models
-- Advanced vector operations
-- Improved streaming performance
-- Enhanced documentation
+- Added function calling capabilities
+- Added JSON mode for structured responses
+- Added safety settings for content moderation
+- Comprehensive test suite for new features
+- Updated documentation
+## Upcoming Milestones
-### v1.2.0 (Planned)
+### v1.3.0 (Planned)
-- Batch processing capabilities
+- Code execution support
+- Additional content types (audio, PDF)
+- Caching mechanisms
 - Improved conversation persistence
-- Additional configuration options
-- Memory optimizations
 ### v2.0.0 (Planned)
@@ -78,3 +90,4 @@ Geminize is a functional Ruby gem providing a complete interface to the Google G
 - Advanced function calling support
 - Performance improvements
 - Additional middleware options
+- Async API support

data/.memory_bank/projectbrief.md CHANGED Viewed

@@ -11,6 +11,10 @@ Geminize is a Ruby gem providing a convenient and robust interface to Google's G
 - **Conversation Management**: Maintain conversation context for chat applications
 - **Embeddings Generation**: Generate and manipulate vector representations for text
 - **Streaming Responses**: Support for efficient streaming output
+- **Models API**: Discover and filter available Gemini models
+- **Function Calling**: Enable AI models to call functions defined by developers
+- **JSON Mode**: Generate structured JSON responses for data-oriented applications
+- **Safety Settings**: Control content generation with configurable safety thresholds
 - **Error Handling**: Comprehensive error handling and validation
 ## Technical Foundation
@@ -19,6 +23,8 @@ Geminize is a Ruby gem providing a convenient and robust interface to Google's G
 - Faraday for HTTP client functionality
 - Support for environment-based or programmatic configuration
 - Comprehensive API for both simple and advanced use cases
+- VCR-based testing for API interactions
+- Modular design with extensible components
 ## Project Goals
@@ -27,3 +33,13 @@ Geminize is a Ruby gem providing a convenient and robust interface to Google's G
 - Maintain clean, well-documented, and idiomatic Ruby code
 - Support both simple use cases and advanced configurations
 - Enable seamless integration with various Ruby applications
+- Stay current with the latest Gemini API features and models
+## Current Version
+Version 1.2.0 implements all core features including the latest additions:
+- Function calling capabilities
+- JSON mode for structured responses
+- Safety settings for content moderation
+- Comprehensive Models API

data/.memory_bank/tasks.md CHANGED Viewed

@@ -6,32 +6,53 @@
 - [ ] Review and improve YARD documentation
 - [ ] Add more code examples
-- [ ] Update README with latest features
+- [x] Update README with latest features
 - [ ] Create diagrams for architecture overview
 ### Feature Development
 - [ ] Support for new Gemini models as they become available
-- [ ] Add support for function calling capabilities
+- [ ] Implement Gemini API missing features:
+  - [x] **Function Calling Support**
+    - [x] Create model classes for function calling structures
+    - [x] Update ContentRequest to support tools and functions
+    - [x] Update request builder and response handling
+    - [x] Add module-level convenience methods
+    - [x] Add comprehensive VCR tests for function calling
+  - [x] **JSON Mode Support**
+    - [x] Add MIME type support for JSON responses
+    - [x] Implement helper methods for JSON generation
+    - [x] Add validation for JSON response structures
+    - [x] Add tests for JSON mode functionality
+  - [x] **Safety Settings**
+    - [x] Create SafetySetting model
+    - [x] Add safety configuration to requests
+    - [x] Implement module-level safety methods
+    - [x] Add tests for safety settings
+  - [ ] **Code Execution Support**
+    - [ ] Create code execution model classes
+    - [ ] Implement code execution tools in requests
+    - [ ] Update response handling for code execution
+  - [ ] **Additional Content Types**
+    - [ ] Audio content support
+    - [ ] Document/PDF content support
+    - [ ] Video content support
+  - [ ] **Caching Support**
+    - [ ] Add caching to content requests
+    - [ ] Implement cached content handling
+    - [ ] Add module-level caching methods
+- [x] Add support for function calling capabilities
 - [ ] Implement batch embedding generation
 - [ ] Improve conversation persistence with adapter pattern for multiple storage options
-- [ ] **Models API Integration**:
-  - [x] Enhance `model_info.rb` to support full model metadata
-  - [x] Update/create `Models::Model` class to match API response structure
-  - [x] Implement `Models::ModelList` class for handling paginated results
-  - [x] Add methods to `RequestBuilder` for models endpoints
-  - [x] Add client methods for models endpoints
-  - [x] Add convenience methods to main Geminize module
-  - [x] Implement helper methods for model capability filtering
-  - [x] Add comprehensive tests for models functionality
-  - [x] Update documentation with models API examples
 ### Testing
 - [ ] Expand test coverage
 - [ ] Add integration tests for streaming
-- [ ] Update VCR cassettes with latest API responses
+- [x] Update VCR cassettes with latest API responses
 - [ ] Add benchmarks for performance testing
+- [x] Create test fixtures for new Gemini API features
+- [x] Add VCR cassettes for function calling responses
 ### Improvements
@@ -57,18 +78,41 @@
 - [x] Embeddings generation
 - [x] Streaming response handling
 - [x] Models API Integration
+  - [x] Enhance `model_info.rb` to support full model metadata
+  - [x] Update/create `Models::Model` class to match API response structure
+  - [x] Implement `Models::ModelList` class for handling paginated results
+  - [x] Add methods to `RequestBuilder` for models endpoints
+  - [x] Add client methods for models endpoints
+  - [x] Add convenience methods to main Geminize module
+  - [x] Implement helper methods for model capability filtering
+  - [x] Add comprehensive tests for models functionality
+  - [x] Update documentation with models API examples
+- [x] Function Calling Support
+  - [x] Create function declaration, tool, and response models
+  - [x] Implement request and response extensions
+  - [x] Add module-level methods for function calling
+  - [x] Add VCR tests for real API interactions
+- [x] JSON Mode Support
+  - [x] Add MIME type support and JSON response parsing
+  - [x] Add structured data generation features
+- [x] Safety Settings Support
+  - [x] Implement safety categories and thresholds
+  - [x] Add safety-focused generation methods
 ### Documentation
 - [x] Initial README with examples
 - [x] YARD documentation for public methods
 - [x] Example scripts
+- [x] Update README with function calling, JSON mode, and safety settings
 ### Testing
 - [x] Basic test suite with RSpec
 - [x] VCR setup for API mocking
 - [x] Unit tests for core functionality
+- [x] Integration tests for function calling
+- [x] Tests for JSON mode and safety settings
 ### Error Handling

data/CHANGELOG.md CHANGED Viewed

@@ -1,3 +1,25 @@
+## [1.2.0] - 2025-05-02
+### Added
+- Function calling capabilities for working with the Gemini API's tool features
+  - Added `generate_with_functions` method to create content with function definitions
+  - Added `process_function_call` method to handle function responses
+  - Added support for multiple function declarations in a single request
+  - Added configurable tool execution modes (AUTO, MANUAL, NONE)
+  - Implemented comprehensive test suite for function calling features
+  - Added tool-related model classes (Tool, ToolConfig, FunctionDeclaration, FunctionResponse)
+- JSON mode for structured data responses
+  - Added `generate_json` method for receiving JSON-formatted responses
+  - Implemented automatic parsing of JSON responses
+  - Added type-checking and validation for JSON mode configuration
+- Safety settings for controlled content generation
+  - Added `generate_with_safety_settings` method for customizing safety settings
+  - Added `generate_text_safe` for maximum content safety
+  - Added `generate_text_permissive` for minimum content filtering
+  - Added comprehensive safety categories and threshold levels
+  - Implemented validation for safety setting configurations
 ## [1.1.0] - 2025-05-02
 ### Added

data/README.md CHANGED Viewed

@@ -10,6 +10,9 @@ A convenient and robust Ruby interface for the Google Gemini API, enabling easy
 - Multimodal inputs (text + images)
 - Embeddings generation
 - Support for streaming responses
+- Function calling capabilities for tool integration
+- JSON mode for structured data responses
+- Safety settings for content moderation
 - Comprehensive error handling
 - Complete Models API for discovering and filtering available models
@@ -273,6 +276,184 @@ end
 See the `examples/embeddings.rb` file for more comprehensive examples of working with embeddings.
+## Function Calling
+Geminize provides support for Gemini's function calling capabilities, allowing the AI model to call functions defined by you:
+```ruby
+require 'geminize'
+# Assumes API key is configured via .env
+# Define functions that the model can call
+weather_functions = [
+  {
+    name: "get_weather",
+    description: "Get the current weather for a location",
+    parameters: {
+      type: "object",
+      properties: {
+        location: {
+          type: "string",
+          description: "The city and state, e.g. New York, NY"
+        },
+        unit: {
+          type: "string",
+          enum: ["celsius", "fahrenheit"],
+          description: "The unit of temperature"
+        }
+      },
+      required: ["location"]
+    }
+  }
+]
+# Generate a response that may include a function call
+response = Geminize.generate_with_functions(
+  "What's the weather in San Francisco?",
+  weather_functions,
+  "gemini-1.5-pro", # Make sure you use a model that supports function calling
+  {
+    temperature: 0.2,
+    system_instruction: "Use the provided function to get weather information."
+  }
+)
+# Check if the response contains a function call
+if response.has_function_call?
+  function_call = response.function_call
+  puts "Function called: #{function_call.name}"
+  puts "Arguments: #{function_call.response.inspect}"
+  # Process the function call with your implementation
+  final_response = Geminize.process_function_call(response) do |name, args|
+    if name == "get_weather"
+      location = args["location"]
+      # Call your actual weather API here
+      # For this example, we'll just return mock data
+      {
+        temperature: 72,
+        conditions: "partly cloudy",
+        humidity: 65,
+        location: location
+      }
+    end
+  end
+  # Display the final response
+  puts "Final response: #{final_response.text}"
+else
+  puts "No function call in response: #{response.text}"
+end
+```
+### Function Call Options
+You can customize function calling behavior:
+```ruby
+# Set the tool execution mode:
+# - "AUTO": Model decides when to call functions
+# - "MANUAL": Functions are only used when explicitly requested
+# - "NONE": Functions are ignored
+response = Geminize.generate_with_functions(
+  prompt,
+  functions,
+  model_name,
+  { tool_execution_mode: "MANUAL" }
+)
+# Control retry behavior
+response = Geminize.generate_with_functions(
+  prompt,
+  functions,
+  model_name,
+  with_retries: false # Disable automatic retries on failure
+)
+```
+## JSON Mode
+Generate structured JSON responses from the model:
+```ruby
+require 'geminize'
+# Assumes API key is configured via .env
+# Request JSON-formatted data
+response = Geminize.generate_json(
+  "List the three largest planets in our solar system with their diameters in km",
+  "gemini-1.5-pro", # Use a model that supports JSON mode
+  { temperature: 0.2 }
+)
+# Access the parsed JSON data
+if response.has_json_response?
+  planets = response.json_response
+  puts "Received structured data:"
+  planets.each do |planet|
+    puts "#{planet['name']}: #{planet['diameter']} km"
+  end
+else
+  puts "No valid JSON in response: #{response.text}"
+end
+```
+The JSON mode is ideal for getting structured data that you can programmatically process in your application.
+## Safety Settings
+Control content generation with safety settings:
+```ruby
+require 'geminize'
+# Assumes API key is configured via .env
+# Generate content with custom safety settings
+safety_settings = [
+  { category: "HARM_CATEGORY_DANGEROUS_CONTENT", threshold: "BLOCK_MEDIUM_AND_ABOVE" },
+  { category: "HARM_CATEGORY_HATE_SPEECH", threshold: "BLOCK_LOW_AND_ABOVE" }
+]
+response = Geminize.generate_with_safety_settings(
+  "Explain the concept of nuclear fission",
+  safety_settings,
+  "gemini-1.5-pro",
+  { temperature: 0.7 }
+)
+puts response.text
+# For maximum safety (blocks most potentially harmful content)
+safe_response = Geminize.generate_text_safe(
+  "Tell me about controversial political topics",
+  "gemini-1.5-pro"
+)
+puts "Safe response: #{safe_response.text}"
+# For minimum filtering (blocks only the most harmful content)
+permissive_response = Geminize.generate_text_permissive(
+  "Describe a controversial historical event",
+  "gemini-1.5-pro"
+)
+puts "Permissive response: #{permissive_response.text}"
+```
+Available safety categories:
+- `HARM_CATEGORY_HATE_SPEECH`
+- `HARM_CATEGORY_DANGEROUS_CONTENT`
+- `HARM_CATEGORY_HARASSMENT`
+- `HARM_CATEGORY_SEXUALLY_EXPLICIT`
+Available threshold levels (from most to least restrictive):
+- `BLOCK_LOW_AND_ABOVE`
+- `BLOCK_MEDIUM_AND_ABOVE`
+- `BLOCK_ONLY_HIGH`
+- `BLOCK_NONE`
 ## Streaming Responses
 Get real-time, token-by-token responses: