RubyGems - active_genie - Versions diffs - 0.25.1 → 0.25.2 - Mend

active_genie 0.25.1 → 0.25.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (41) hide show

checksums.yaml +4 -4
data/README.md +5 -5
data/VERSION +1 -1
data/lib/active_genie/battle/README.md +7 -7
data/lib/active_genie/battle/generalist.json +36 -0
data/lib/active_genie/battle/generalist.md +16 -0
data/lib/active_genie/battle/generalist.rb +16 -69
data/lib/active_genie/clients/providers/anthropic_client.rb +62 -39
data/lib/active_genie/clients/providers/base_client.rb +38 -51
data/lib/active_genie/clients/providers/deepseek_client.rb +56 -49
data/lib/active_genie/clients/providers/google_client.rb +53 -53
data/lib/active_genie/clients/providers/openai_client.rb +53 -54
data/lib/active_genie/clients/unified_client.rb +4 -4
data/lib/active_genie/config/battle_config.rb +2 -0
data/lib/active_genie/config/llm_config.rb +3 -1
data/lib/active_genie/config/log_config.rb +33 -11
data/lib/active_genie/config/providers/anthropic_config.rb +2 -2
data/lib/active_genie/config/providers/deepseek_config.rb +2 -2
data/lib/active_genie/config/providers/google_config.rb +2 -2
data/lib/active_genie/config/providers/openai_config.rb +2 -2
data/lib/active_genie/config/providers_config.rb +4 -4
data/lib/active_genie/config/scoring_config.rb +2 -0
data/lib/active_genie/configuration.rb +14 -8
data/lib/active_genie/data_extractor/from_informal.json +11 -0
data/lib/active_genie/data_extractor/from_informal.rb +3 -11
data/lib/active_genie/data_extractor/generalist.json +9 -0
data/lib/active_genie/data_extractor/generalist.rb +12 -11
data/lib/active_genie/errors/invalid_log_output_error.rb +19 -0
data/lib/active_genie/logger.rb +10 -4
data/lib/active_genie/{concerns → ranking/concerns}/loggable.rb +2 -5
data/lib/active_genie/ranking/elo_round.rb +31 -27
data/lib/active_genie/ranking/free_for_all.rb +29 -21
data/lib/active_genie/ranking/player.rb +45 -17
data/lib/active_genie/ranking/players_collection.rb +16 -6
data/lib/active_genie/ranking/ranking.rb +21 -20
data/lib/active_genie/ranking/ranking_scoring.rb +2 -19
data/lib/active_genie/scoring/generalist.json +9 -0
data/lib/active_genie/scoring/generalist.md +46 -0
data/lib/active_genie/scoring/generalist.rb +13 -65
data/lib/active_genie/scoring/recommended_reviewers.rb +2 -2
metadata +11 -4

data/lib/active_genie/scoring/generalist.md ADDED Viewed

@@ -0,0 +1,46 @@
+Evaluate and score the provided text based on predefined criteria, using a scoring range of 0 to 100 with 100 representing the highest possible score.
+Follow the instructions below to ensure a comprehensive and objective assessment.
+# Evaluation Process
+1. **Analysis**:
+  - Thoroughly compare the text against each criterion for a comprehensive evaluation.
+2. **Document Deviations**:
+  - Identify and document areas where the content does not align with the specified criteria.
+3. **Highlight Strengths**:
+  - Note notable features or elements that enhance the quality or effectiveness of the content.
+4. **Identify Weaknesses**:
+  - Specify areas where the content fails to meet the criteria or where improvements could be made.
+# Scoring Fairness
+- Ensure the assigned score reflects both the alignment with the criteria and the content's effectiveness.
+- Consider if the fulfillment of other criteria compensates for areas lacking extreme details.
+# Scoring Range
+Segment scores into five parts before assigning a final score:
+- **Terrible**: 0-20 - Content does not meet the criteria.
+- **Bad**: 21-40 - Content is substandard but meets some criteria.
+- **Average**: 41-60 - Content meets criteria with room for improvement.
+- **Good**: 61-80 - Content exceeds criteria and is above average.
+- **Great**: 81-100 - Content exceeds all expectations.
+# Guidelines
+- Maintain objectivity and avoid biases.
+- Deconstruct each criterion into actionable components for systematic evaluation.
+- Apply reasonable judgment in assigning a score, justifying your rationale clearly.
+# Output Format
+- Provide a detailed review including:
+  - A final score (0-100)
+  - Specific reasoning for the assigned score, detailing all evaluated criteria
+  - Include both positive aspects and suggested improvements
+# Notes
+- Consider edge cases where the text may partially align with criteria.
+- If lacking information, reasonably judge and explain your scoring approach.

data/lib/active_genie/scoring/generalist.rb CHANGED Viewed

@@ -44,21 +44,9 @@ module ActiveGenie
           {  role: 'user', content: "Text to score: #{@text}" }
         ]
-        properties = build_properties
-        function = {
-          name: 'scoring',
-          description: 'Score the text based on the given criteria.',
-          parameters: {
-            type: 'object',
-            properties:,
-            required: properties.keys
-          }
-        }
         result = ::ActiveGenie::Clients::UnifiedClient.function_calling(
           messages,
-          function,
+          build_function,
           config: @config
         )
@@ -76,8 +64,20 @@ module ActiveGenie
         result
       end
+      PROMPT = File.read(File.join(__dir__, 'generalist.md'))
       private
+      def build_function
+        properties = build_properties
+        function = JSON.parse(File.read(File.join(__dir__, 'generalist.json')), symbolize_names: true)
+        function[:parameters][:properties] = properties
+        function[:parameters][:required] = properties.keys
+        function
+      end
       def build_properties
         properties = {}
         reviewers.each do |reviewer|
@@ -114,58 +114,6 @@ module ActiveGenie
                          [result['reviewer1'], result['reviewer2'], result['reviewer3']]
                        end
       end
-      PROMPT = <<~PROMPT
-        Evaluate and score the provided text based on predefined criteria, using a scoring range of 0 to 100 with 100 representing the highest possible score.
-        Follow the instructions below to ensure a comprehensive and objective assessment.
-        # Evaluation Process
-        1. **Analysis**:
-          - Thoroughly compare the text against each criterion for a comprehensive evaluation.
-        2. **Document Deviations**:
-          - Identify and document areas where the content does not align with the specified criteria.
-        3. **Highlight Strengths**:
-          - Note notable features or elements that enhance the quality or effectiveness of the content.
-        4. **Identify Weaknesses**:
-          - Specify areas where the content fails to meet the criteria or where improvements could be made.
-        # Scoring Fairness
-        - Ensure the assigned score reflects both the alignment with the criteria and the content's effectiveness.
-        - Consider if the fulfillment of other criteria compensates for areas lacking extreme details.
-        # Scoring Range
-        Segment scores into five parts before assigning a final score:
-        - **Terrible**: 0-20 - Content does not meet the criteria.
-        - **Bad**: 21-40 - Content is substandard but meets some criteria.
-        - **Average**: 41-60 - Content meets criteria with room for improvement.
-        - **Good**: 61-80 - Content exceeds criteria and is above average.
-        - **Great**: 81-100 - Content exceeds all expectations.
-        # Guidelines
-        - Maintain objectivity and avoid biases.
-        - Deconstruct each criterion into actionable components for systematic evaluation.
-        - Apply reasonable judgment in assigning a score, justifying your rationale clearly.
-        # Output Format
-        - Provide a detailed review including:
-          - A final score (0-100)
-          - Specific reasoning for the assigned score, detailing all evaluated criteria
-          - Include both positive aspects and suggested improvements
-        # Notes
-        - Consider edge cases where the text may partially align with criteria.
-        - If lacking information, reasonably judge and explain your scoring approach.
-      PROMPT
     end
   end
 end

data/lib/active_genie/scoring/recommended_reviewers.rb CHANGED Viewed

@@ -62,8 +62,6 @@ module ActiveGenie
         )
       end
-      private
       PROMPT = <<~PROMPT
         Identify the top 3 suitable reviewer titles or roles based on the provided text and criteria. Selected reviewers must possess subject matter expertise, offer valuable insights, and ensure diverse yet aligned perspectives on the content.
@@ -79,6 +77,8 @@ module ActiveGenie
         - Avoid redundant or overly similar titles/roles to maintain diversity.
       PROMPT
+      private
       def client
         ::ActiveGenie::Clients::UnifiedClient
       end

metadata CHANGED Viewed

@@ -1,14 +1,14 @@
 --- !ruby/object:Gem::Specification
 name: active_genie
 version: !ruby/object:Gem::Version
-  version: 0.25.1
+  version: 0.25.2
 platform: ruby
 authors:
 - Radamés Roriz
 autorequire:
 bindir: bin
 cert_chain: []
-date: 2025-05-19 00:00:00.000000000 Z
+date: 2025-06-01 00:00:00.000000000 Z
 dependencies: []
 description: |
   The lodash for GenAI, stop reinventing the wheel
@@ -26,6 +26,8 @@ files:
 - lib/active_genie.rb
 - lib/active_genie/battle.rb
 - lib/active_genie/battle/README.md
+- lib/active_genie/battle/generalist.json
+- lib/active_genie/battle/generalist.md
 - lib/active_genie/battle/generalist.rb
 - lib/active_genie/clients/providers/anthropic_client.rb
 - lib/active_genie/clients/providers/base_client.rb
@@ -33,7 +35,6 @@ files:
 - lib/active_genie/clients/providers/google_client.rb
 - lib/active_genie/clients/providers/openai_client.rb
 - lib/active_genie/clients/unified_client.rb
-- lib/active_genie/concerns/loggable.rb
 - lib/active_genie/config/battle_config.rb
 - lib/active_genie/config/data_extractor_config.rb
 - lib/active_genie/config/llm_config.rb
@@ -49,13 +50,17 @@ files:
 - lib/active_genie/configuration.rb
 - lib/active_genie/data_extractor.rb
 - lib/active_genie/data_extractor/README.md
+- lib/active_genie/data_extractor/from_informal.json
 - lib/active_genie/data_extractor/from_informal.rb
+- lib/active_genie/data_extractor/generalist.json
 - lib/active_genie/data_extractor/generalist.md
 - lib/active_genie/data_extractor/generalist.rb
+- lib/active_genie/errors/invalid_log_output_error.rb
 - lib/active_genie/errors/invalid_provider_error.rb
 - lib/active_genie/logger.rb
 - lib/active_genie/ranking.rb
 - lib/active_genie/ranking/README.md
+- lib/active_genie/ranking/concerns/loggable.rb
 - lib/active_genie/ranking/elo_round.rb
 - lib/active_genie/ranking/free_for_all.rb
 - lib/active_genie/ranking/player.rb
@@ -64,6 +69,8 @@ files:
 - lib/active_genie/ranking/ranking_scoring.rb
 - lib/active_genie/scoring.rb
 - lib/active_genie/scoring/README.md
+- lib/active_genie/scoring/generalist.json
+- lib/active_genie/scoring/generalist.md
 - lib/active_genie/scoring/generalist.rb
 - lib/active_genie/scoring/recommended_reviewers.rb
 - lib/tasks/benchmark.rake
@@ -86,7 +93,7 @@ required_ruby_version: !ruby/object:Gem::Requirement
   requirements:
   - - ">="
     - !ruby/object:Gem::Version
-      version: 3.3.0
+      version: 3.4.0
 required_rubygems_version: !ruby/object:Gem::Requirement
   requirements:
   - - ">="