RubyGems - dspy-datasets - Versions diffs - 0.29.1 → 1.0.0 - Mend

dspy-datasets 0.29.1 → 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

checksums.yaml +4 -4
data/README.md +120 -100
data/lib/dspy/datasets/hotpot_qa.rb +182 -0
data/lib/dspy/datasets/loaders/huggingface_parquet.rb +66 -33
data/lib/dspy/datasets/manifest.rb +22 -0
data/lib/dspy/datasets/version.rb +1 -1
data/lib/dspy/datasets.rb +1 -0
metadata +9 -5

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: c8de3f972de17ce584e6f1f8f7eec8084b6d24c3517fd14001d58d12537b98d1
-  data.tar.gz: f47577ccf5b0826387bfb991d3f6372f9a41cccef7e1d9f3583030a0b5a4c61e
+  metadata.gz: f279306528c5bfcfaabda4c088e8d9f909c0da3ec1d259c2ef2ffcf0daf64476
+  data.tar.gz: b07a55b2949e177d5dd8756211831099abf5aa1fbd942dc3d99c635410bc4bee
 SHA512:
-  metadata.gz: e02a16d9b3321c2841d052e1c69fa91106cbbbeb8b44394f1c41052b01936a2757cb94b26c0292309effe724477eae12487ce6a9ac85b6bd10c1bd12f13a9798
-  data.tar.gz: 9ac56b72949104a5bb5d998768f419283b9a47b00653f54f84e99de492c987cdc548f060faa84855db4a8a54f2c231524f3ebf5269e774d8e10888d3fbdcabbf
+  metadata.gz: 507c2dd52c37d081bd452529ad88973b3f4821bb42a50214eeb8cd99fd7c6a8aec5497354ccf2a379e284196bfdf27ad44332f02bb326c637a10c2c8203a4fdf
+  data.tar.gz: 100a758e3242f4dd0788f349323e688bc00240c9b4a6095f4066c2bb6f9955e09de9c0231e0656ef31a05dc1ec88e51b90e815037b3e5cf254da4d4d3c9642b4

data/README.md CHANGED Viewed

@@ -5,26 +5,79 @@
 [![Build Status](https://img.shields.io/github/actions/workflow/status/vicentereig/dspy.rb/ruby.yml?branch=main&label=build)](https://github.com/vicentereig/dspy.rb/actions/workflows/ruby.yml)
 [![Documentation](https://img.shields.io/badge/docs-vicentereig.github.io%2Fdspy.rb-blue)](https://vicentereig.github.io/dspy.rb/)
+> [!NOTE]
+> The core Prompt Engineering Framework is production-ready with
+> comprehensive documentation. I am focusing now on educational content on systematic Prompt Optimization and Context Engineering.
+> Your feedback is invaluable. if you encounter issues, please open an [issue](https://github.com/vicentereig/dspy.rb/issues). If you have suggestions, open a [new thread](https://github.com/vicentereig/dspy.rb/discussions).
+>
+> If you want to contribute, feel free to reach out to me to coordinate efforts: hey at vicente.services
+>
+> And, yes, this is 100% a legit project. :)
 **Build reliable LLM applications in idiomatic Ruby using composable, type-safe modules.**
-The Ruby framework for programming with large language models. DSPy.rb brings structured LLM programming to Ruby developers. Instead of wrestling with prompt strings and parsing responses, you define typed signatures using idiomatic Ruby to compose and decompose AI Worklows and AI Agents.
+The Ruby framework for programming with large language models. DSPy.rb brings structured LLM programming to Ruby developers, programmatic Prompt Engineering and Context Engineering.
+Instead of wrestling with prompt strings and parsing responses, you define typed signatures using idiomatic Ruby to compose and decompose AI Worklows and AI Agents.
 **Prompts are the just Functions.** Traditional prompting is like writing code with string concatenation: it works until it doesn't. DSPy.rb brings you
 the programming approach pioneered by [dspy.ai](https://dspy.ai/): instead of crafting fragile prompts, you define modular
 signatures and let the framework handle the messy details.
 DSPy.rb is an idiomatic Ruby surgical port of Stanford's [DSPy framework](https://github.com/stanfordnlp/dspy). While implementing
-the core concepts of signatures, predictors, and optimization from the original Python library, DSPy.rb embraces Ruby
-conventions and adds Ruby-specific innovations like CodeAct agents and enhanced production instrumentation.
+the core concepts of signatures, predictors, and the main optimization algorithms from the original Python library, DSPy.rb embraces Ruby
+conventions and adds Ruby-specific innovations like Sorbet-base Typed system, ReAct loops, and production-ready integrations like non-blocking Open Telemetry Instrumentation.
-The result? LLM applications that actually scale and don't break when you sneeze.
+**What you get?** Ruby LLM applications that actually scale and don't break when you sneeze.
+Check the [examples](examples/) and take them for a spin!
 ## Your First DSPy Program
+### Installation
+Add to your Gemfile:
+```ruby
+gem 'dspy'
+```
+and
+```bash
+bundle install
+```
+### Optional Sibling Gems
+DSPy.rb ships multiple gems from this monorepo so you only install what you need. Add these alongside `dspy`:
+| Gem | Description | Status |
+| --- | --- | --- |
+| `dspy-schema` | Exposes `DSPy::TypeSystem::SorbetJsonSchema` for downstream reuse. | **Stable** (v1.0.0) |
+| `dspy-code_act` | Think-Code-Observe agents that synthesize and execute Ruby safely. | Preview (0.x) |
+| `dspy-datasets` | Dataset helpers plus Parquet/Polars tooling for richer evaluation corpora. | Preview (0.x) |
+| `dspy-evals` | High-throughput evaluation harness with metrics, callbacks, and regression fixtures. | Preview (0.x) |
+| `dspy-miprov2` | Bayesian optimization + Gaussian Process backend for the MIPROv2 teleprompter. | Preview (0.x) |
+| `dspy-gepa` | `DSPy::Teleprompt::GEPA`, reflection loops, experiment tracking, telemetry adapters. | Preview (mirrors `dspy` version) |
+| `gepa` | GEPA optimizer core (Pareto engine, telemetry, reflective proposer). | Preview (mirrors `dspy` version) |
+| `dspy-o11y` | Core observability APIs: `DSPy::Observability`, async span processor, observation types. | **Stable** (v1.0.0) |
+| `dspy-o11y-langfuse` | Auto-configures DSPy observability to stream spans to Langfuse via OTLP. | **Stable** (v1.0.0) |
+Set the matching `DSPY_WITH_*` environment variables (see `Gemfile`) to include or exclude each sibling gem when running Bundler locally (for example `DSPY_WITH_GEPA=1` or `DSPY_WITH_O11Y_LANGFUSE=1`). Refer to `docs/core-concepts/dependency-tree.md` for the full dependency map and roadmap.
+### Your First Reliable Predictor
 ```ruby
-# Define a signature for sentiment classification
+# Configure DSPy globablly to use your fave LLM - you can override this on an instance levle.
+DSPy.configure do |c|
+  c.lm = DSPy::LM.new('openai/gpt-4o-mini',
+                      api_key: ENV['OPENAI_API_KEY'],
+                      structured_outputs: true)  # Enable OpenAI's native JSON mode
+end
+# Define a signature for sentiment classification - instead of writing a full prompt!
 class Classify < DSPy::Signature
-  description "Classify sentiment of a given sentence."
+  description "Classify sentiment of a given sentence." # sets the goal of the underlying prompt
   class Sentiment < T::Enum
     enums do
@@ -33,26 +86,22 @@ class Classify < DSPy::Signature
       Neutral = new('neutral')
     end
   end
+  # Structured Inputs: makes sure you are sending only valid prompt inputs to your model
   input do
-    const :sentence, String
+    const :sentence, String, description: 'The sentence to analyze'
   end
+  # Structured Outputs: your predictor will validate the output of the model too.
   output do
-    const :sentiment, Sentiment
-    const :confidence, Float
+    const :sentiment, Sentiment, description: 'The sentiment of the sentence'
+    const :confidence, Float, description: 'A number between 0.0 and 1.0'
   end
 end
-# Configure DSPy with your LLM
-DSPy.configure do |c|
-  c.lm = DSPy::LM.new('openai/gpt-4o-mini',
-                      api_key: ENV['OPENAI_API_KEY'],
-                      structured_outputs: true)  # Enable OpenAI's native JSON mode
-end
-# Create the predictor and run inference
+# Wire it to the simplest prompting technique - a Predictn.
 classify = DSPy::Predict.new(Classify)
+# it may raise an error if you mess the inputs or your LLM messes the outputs.
 result = classify.call(sentence: "This book was super fun to read!")
 puts result.sentiment    # => #<Sentiment::Positive>
@@ -99,12 +148,22 @@ end
 ## What You Get
+**Developer Experience:**
+- LLM provider support using official Ruby clients:
+  - [OpenAI Ruby](https://github.com/openai/openai-ruby) with vision model support
+  - [Anthropic Ruby SDK](https://github.com/anthropics/anthropic-sdk-ruby) with multimodal capabilities
+  - [Google Gemini API](https://ai.google.dev/) with native structured outputs
+  - [Ollama](https://ollama.com/) via OpenAI compatibility layer for local models
+- **Multimodal Support** - Complete image analysis with DSPy::Image, type-safe bounding boxes, vision-capable models
+- Runtime type checking with [Sorbet](https://sorbet.org/) including T::Enum and union types
+- Type-safe tool definitions for ReAct agents
+- Comprehensive instrumentation and observability
 **Core Building Blocks:**
 - **Signatures** - Define input/output schemas using Sorbet types with T::Enum and union type support
 - **Predict** - LLM completion with structured data extraction and multimodal support
 - **Chain of Thought** - Step-by-step reasoning for complex problems with automatic prompt optimization
 - **ReAct** - Tool-using agents with type-safe tool definitions and error recovery
-- **CodeAct** - Dynamic code execution agents for programming tasks
 - **Module Composition** - Combine multiple LLM calls into production-ready workflows
 **Optimization & Evaluation:**
@@ -122,24 +181,40 @@ end
 - **File-based Storage** - Optimization result persistence with versioning
 - **Structured Logging** - JSON and key=value formats with span tracking
-**Developer Experience:**
-- LLM provider support using official Ruby clients:
-  - [OpenAI Ruby](https://github.com/openai/openai-ruby) with vision model support
-  - [Anthropic Ruby SDK](https://github.com/anthropics/anthropic-sdk-ruby) with multimodal capabilities
-  - [Google Gemini API](https://ai.google.dev/) with native structured outputs
-  - [Ollama](https://ollama.com/) via OpenAI compatibility layer for local models
-- **Multimodal Support** - Complete image analysis with DSPy::Image, type-safe bounding boxes, vision-capable models
-- Runtime type checking with [Sorbet](https://sorbet.org/) including T::Enum and union types
-- Type-safe tool definitions for ReAct agents
-- Comprehensive instrumentation and observability
+## Recent Achievements
-## Development Status
+DSPy.rb has rapidly evolved from experimental to production-ready:
-DSPy.rb is actively developed and approaching stability. The core framework is production-ready with
-comprehensive documentation, but I'm battle-testing features through the 0.x series before committing
-to a stable v1.0 API.
+### Foundation
+- ✅ **JSON Parsing Reliability** - Native OpenAI structured outputs with adaptive retry logic and schema-aware fallbacks
+- ✅ **Type-Safe Strategy Configuration** - Provider-optimized strategy selection and enum-backed optimizer presets
+- ✅ **Core Module System** - Predict, ChainOfThought, ReAct with type safety (add `dspy-code_act` for Think-Code-Observe agents)
+- ✅ **Production Observability** - OpenTelemetry, New Relic, and Langfuse integration
+- ✅ **Advanced Optimization** - MIPROv2 with Bayesian optimization, Gaussian Processes, and multi-mode search
+### Recent Advances
+- ✅ **MIPROv2 ADE Integrity (v0.29.1)** - Stratified train/val/test splits, honest precision accounting, and enum-driven `--auto` presets with integration coverage
+- ✅ **Instruction Deduplication (v0.29.1)** - Candidate generation now filters repeated programs so optimization logs highlight unique strategies
+- ✅ **GEPA Teleprompter (v0.29.0)** - Genetic-Pareto reflective prompt evolution with merge proposer scheduling, reflective mutation, and ADE demo parity
+- ✅ **Optimizer Utilities Parity (v0.29.0)** - Bootstrap strategies, dataset summaries, and Layer 3 utilities unlock multi-predictor programs on Ruby
+- ✅ **Observability Hardening (v0.29.0)** - OTLP exporter runs on a single-thread executor preventing frozen SSL contexts without blocking spans
+- ✅ **Documentation Refresh (v0.29.x)** - New GEPA guide plus ADE optimization docs covering presets, stratified splits, and error-handling defaults
+**Current Focus Areas:**
+### Production Readiness
+- 🚧 **Production Patterns** - Real-world usage validation and performance optimization
+- 🚧 **Ruby Ecosystem Integration** - Rails integration, Sidekiq compatibility, deployment patterns
+### Community & Adoption
+- 🚧 **Community Examples** - Real-world applications and case studies
+- 🚧 **Contributor Experience** - Making it easier to contribute and extend
+- 🚧 **Performance Benchmarks** - Comparative analysis vs other frameworks
+**v1.0 Philosophy:**
+v1.0 will be released after extensive production battle-testing, not after checking off features.
+The API is already stable - v1.0 represents confidence in production reliability backed by real-world validation.
-Real-world usage feedback is invaluable - if you encounter issues or have suggestions, please open a GitHub issue!
 ## Documentation
@@ -156,92 +231,37 @@ For LLMs and AI assistants working with DSPy.rb:
 - **[Quick Start Guide](docs/src/getting-started/quick-start.md)** - Your first DSPy programs
 - **[Core Concepts](docs/src/getting-started/core-concepts.md)** - Understanding signatures, predictors, and modules
-### Core Features
+### Prompt Engineering
 - **[Signatures & Types](docs/src/core-concepts/signatures.md)** - Define typed interfaces for LLM operations
 - **[Predictors](docs/src/core-concepts/predictors.md)** - Predict, ChainOfThought, ReAct, and more
 - **[Modules & Pipelines](docs/src/core-concepts/modules.md)** - Compose complex multi-stage workflows
 - **[Multimodal Support](docs/src/core-concepts/multimodal.md)** - Image analysis with vision-capable models
 - **[Examples & Validation](docs/src/core-concepts/examples.md)** - Type-safe training data
+- **[Rich Types](docs/src/advanced/complex-types.md)** - Sorbet type integration with automatic coercion for structs, enums, and arrays
+- **[Composable Pipelines](docs/src/advanced/pipelines.md)** - Manual module composition patterns
-### Optimization
+### Prompt Optimization
 - **[Evaluation Framework](docs/src/optimization/evaluation.md)** - Advanced metrics beyond simple accuracy
 - **[Prompt Optimization](docs/src/optimization/prompt-optimization.md)** - Manipulate prompts as objects
 - **[MIPROv2 Optimizer](docs/src/optimization/miprov2.md)** - Advanced Bayesian optimization with Gaussian Processes
 - **[GEPA Optimizer](docs/src/optimization/gepa.md)** *(beta)* - Reflective mutation with optional reflection LMs
-### Production Features
-- **[Storage System](docs/src/production/storage.md)** - Persistence and optimization result storage
-- **[Observability](docs/src/production/observability.md)** - Zero-config Langfuse integration with a dedicated export worker that never blocks your LLMs
-### Advanced Usage
-- **[Complex Types](docs/src/advanced/complex-types.md)** - Sorbet type integration with automatic coercion for structs, enums, and arrays
-- **[Manual Pipelines](docs/src/advanced/pipelines.md)** - Manual module composition patterns
+### Context Engineering
+- **[Tools](docs/src/core-concepts/toolsets.md)** - Tool wieldint agents.
+- **[Agentic Memory](docs/src/core-concepts/memory.md)** - Memory Tools & Agentic Loops
 - **[RAG Patterns](docs/src/advanced/rag.md)** - Manual RAG implementation with external services
-- **[Custom Metrics](docs/src/advanced/custom-metrics.md)** - Proc-based evaluation logic
-## Quick Start
-### Installation
-Add to your Gemfile:
-```ruby
-gem 'dspy'
-```
-Then run:
-```bash
-bundle install
-```
-## Recent Achievements
-DSPy.rb has rapidly evolved from experimental to production-ready:
-### Foundation
-- ✅ **JSON Parsing Reliability** - Native OpenAI structured outputs, strategy selection, retry logic
-- ✅ **Type-Safe Strategy Configuration** - Provider-optimized automatic strategy selection
-- ✅ **Core Module System** - Predict, ChainOfThought, ReAct, CodeAct with type safety
-- ✅ **Production Observability** - OpenTelemetry, New Relic, and Langfuse integration
-- ✅ **Advanced Optimization** - MIPROv2 with Bayesian optimization, Gaussian Processes, and multiple strategies
-### Recent Advances
-- ✅ **Enhanced Langfuse Integration (v0.25.0)** - Comprehensive OpenTelemetry span reporting with proper input/output, hierarchical nesting, accurate timing, and observation types
-- ✅ **Comprehensive Multimodal Framework** - Complete image analysis with `DSPy::Image`, type-safe bounding boxes, vision model integration
-- ✅ **Advanced Type System** - `T::Enum` integration, union types for agentic workflows, complex type coercion
-- ✅ **Production-Ready Evaluation** - Multi-factor metrics beyond accuracy, error-resilient evaluation pipelines
-- ✅ **Documentation Ecosystem** - `llms.txt` for AI assistants, ADRs, blog articles, comprehensive examples
-- ✅ **API Maturation** - Simplified idiomatic patterns, better error handling, production-proven designs
+### Production Features
+- **[Observability](docs/src/production/observability.md)** - Zero-config Langfuse integration with a dedicated export worker that never blocks your LLMs
+- **[Storage System](docs/src/production/storage.md)** - Persistence and optimization result storage
+- **[Custom Metrics](docs/src/advanced/custom-metrics.md)** - Proc-based evaluation logic
-## Roadmap - Production Battle-Testing Toward v1.0
-DSPy.rb has transitioned from **feature building** to **production validation**. The core framework is
-feature-complete and stable - now I'm focusing on real-world usage patterns, performance optimization,
-and ecosystem integration.
-**Current Focus Areas:**
-### Production Readiness
-- 🚧 **Production Patterns** - Real-world usage validation and performance optimization
-- 🚧 **Ruby Ecosystem Integration** - Rails integration, Sidekiq compatibility, deployment patterns
-- 🚧 **Scale Testing** - High-volume usage, memory management, connection pooling
-- 🚧 **Error Recovery** - Robust failure handling patterns for production environments
-### Ecosystem Expansion
-- 🚧 **Model Context Protocol (MCP)** - Integration with MCP ecosystem
-- 🚧 **Additional Provider Support** - Azure OpenAI, local models beyond Ollama
-- 🚧 **Tool Ecosystem** - Expanded tool integrations for ReAct agents
-### Community & Adoption
-- 🚧 **Community Examples** - Real-world applications and case studies
-- 🚧 **Contributor Experience** - Making it easier to contribute and extend
-- 🚧 **Performance Benchmarks** - Comparative analysis vs other frameworks
-**v1.0 Philosophy:**
-v1.0 will be released after extensive production battle-testing, not after checking off features.
-The API is already stable - v1.0 represents confidence in production reliability backed by real-world validation.
 ## License
 This project is licensed under the MIT License.

data/lib/dspy/datasets/hotpot_qa.rb ADDED Viewed

@@ -0,0 +1,182 @@
+# frozen_string_literal: true
+require 'set'
+require_relative 'info'
+require_relative 'loaders'
+module DSPy
+  module Datasets
+    # Ruby implementation of the HotPotQA dataset loader backed by Hugging Face parquet files.
+    # Provides convenience helpers to create train/dev/test splits matching the Python DSPy defaults.
+    class HotPotQA
+      DATASET_INFO = DatasetInfo.new(
+        id: 'hotpotqa/hotpot_qa/fullwiki',
+        name: 'HotPotQA (FullWiki)',
+        provider: 'huggingface',
+        splits: %w[train validation],
+        features: {
+          'id' => { 'type' => 'string' },
+          'question' => { 'type' => 'string' },
+          'answer' => { 'type' => 'string' },
+          'level' => { 'type' => 'string' },
+          'type' => { 'type' => 'string' },
+          'supporting_facts' => { 'type' => 'list' },
+          'context' => { 'type' => 'list' }
+        },
+        loader: :huggingface_parquet,
+        loader_options: {
+          dataset: ['hotpotqa/hotpot_qa', 'hotpot_qa'],
+          config: 'fullwiki'
+        },
+        metadata: {
+          description: 'HotPotQA FullWiki split filtered to hard examples. Train split is further divided into train/dev (75/25) matching Python DSPy defaults. Supports dataset rename on Hugging Face.',
+          homepage: 'https://huggingface.co/datasets/hotpot_qa',
+          approx_row_count: 112_000
+        }
+      ).freeze
+      DEFAULT_KEEP_DETAILS = :dev_titles
+      attr_reader :train_size, :dev_size, :test_size
+      def initialize(
+        only_hard_examples: true,
+        keep_details: DEFAULT_KEEP_DETAILS,
+        unofficial_dev: true,
+        train_seed: 0,
+        train_size: nil,
+        dev_size: nil,
+        test_size: nil,
+        cache_dir: nil
+      )
+        raise ArgumentError, 'only_hard_examples must be true' unless only_hard_examples
+        @keep_details = keep_details
+        @unofficial_dev = unofficial_dev
+        @train_seed = train_seed
+        @train_size = train_size
+        @dev_size = dev_size
+        @test_size = test_size
+        @cache_dir = cache_dir
+        @loaded = false
+      end
+      def train
+        ensure_loaded
+        subset(@train_examples, train_size)
+      end
+      def dev
+        ensure_loaded
+        subset(@dev_examples, dev_size)
+      end
+      def test
+        ensure_loaded
+        subset(@test_examples, test_size)
+      end
+      def context_lookup
+        ensure_loaded
+        @context_lookup ||= begin
+          all_examples = @train_examples + @dev_examples + @test_examples
+          all_examples.each_with_object({}) do |example, memo|
+            memo[example[:question]] = example[:context] || []
+          end
+        end
+      end
+      private
+      attr_reader :keep_details, :unofficial_dev, :train_seed, :cache_dir
+      def ensure_loaded
+        return if @loaded
+        load_data
+        @loaded = true
+      end
+      def subset(examples, limit)
+        return examples unless limit
+        examples.first(limit)
+      end
+      def load_data
+        train_rows = collect_rows(split: 'train')
+        shuffled = train_rows.shuffle(random: Random.new(train_seed))
+        split_point = (shuffled.length * 0.75).floor
+        @train_examples = shuffled.first(split_point)
+        @dev_examples = unofficial_dev ? shuffled.drop(split_point) : []
+        if keep_details == DEFAULT_KEEP_DETAILS
+          @train_examples.each { |example| example.delete(:gold_titles) }
+        end
+        @test_examples = collect_rows(split: 'validation')
+      end
+      def collect_rows(split:)
+        loader = Loaders.build(DATASET_INFO, split: split, cache_dir: cache_dir)
+        examples = []
+        loader.each_row do |row|
+          next unless row['level'] == 'hard'
+          examples << transform_row(row)
+        end
+        examples
+      end
+      def transform_row(row)
+        example = {
+          id: row['id'],
+          question: row['question'],
+          answer: row['answer'],
+          type: row['type'],
+          context: normalize_context(row['context']),
+          gold_titles: extract_gold_titles(row['supporting_facts'])
+        }
+        example.delete(:context) unless example[:context]&.any?
+        example.delete(:gold_titles) if example[:gold_titles].empty?
+        example
+      end
+      def normalize_context(raw_context)
+        return [] unless raw_context.respond_to?(:map)
+        raw_context.map do |pair|
+          if pair.is_a?(Array) && pair.size == 2
+            title, sentences = pair
+            sentences_text = if sentences.is_a?(Array)
+                               sentences.join(' ')
+                             else
+                               sentences.to_s
+                             end
+            "#{title}: #{sentences_text}".strip
+          else
+            pair.to_s
+          end
+        end
+      end
+      def extract_gold_titles(supporting_facts)
+        case supporting_facts
+        when Hash
+          titles = supporting_facts['title'] || supporting_facts[:title]
+          Array(titles).to_set
+        when Array
+          supporting_facts.each_with_object(Set.new) do |fact, memo|
+            memo << (fact.is_a?(Array) ? fact[0] : fact)
+          end
+        else
+          Set.new
+        end
+      end
+    end
+  end
+end

data/lib/dspy/datasets/loaders/huggingface_parquet.rb CHANGED Viewed

@@ -52,24 +52,20 @@ module DSPy
         def parquet_files
           @parquet_files ||= begin
-            uri = URI("#{BASE_URL}/parquet")
-            params = {
-              dataset: info.loader_options.fetch(:dataset),
-              config: info.loader_options.fetch(:config),
-              split: split
-            }
-            uri.query = URI.encode_www_form(params)
-            response = http_get(uri)
-            unless response.is_a?(Net::HTTPSuccess)
-              raise DatasetError, "Failed to fetch parquet manifest: #{response.code}"
+            datasets = Array(info.loader_options.fetch(:dataset))
+            last_error = nil
+            datasets.each do |dataset_name|
+              begin
+                files = fetch_parquet_files(dataset_name)
+                return files unless files.empty?
+                last_error = DatasetError.new("No parquet files available for #{dataset_name} (#{split})")
+              rescue DatasetError => e
+                last_error = e
+              end
             end
-            body = JSON.parse(response.body)
-            files = body.fetch('parquet_files', [])
-            raise DatasetError, "No parquet files available for #{info.id} (#{split})" if files.empty?
-            files
+            raise(last_error || DatasetError.new("Failed to fetch parquet manifest for #{info.id} (#{split})"))
           end
         end
@@ -82,6 +78,24 @@ module DSPy
           path
         end
+        def fetch_parquet_files(dataset_name)
+          uri = URI("#{BASE_URL}/parquet")
+          params = {
+            dataset: dataset_name,
+            config: info.loader_options.fetch(:config),
+            split: split
+          }
+          uri.query = URI.encode_www_form(params)
+          response = http_get(uri)
+          unless response.is_a?(Net::HTTPSuccess)
+            raise DatasetError, "Failed to fetch parquet manifest: #{response.code}"
+          end
+          body = JSON.parse(response.body)
+          body.fetch('parquet_files', [])
+        end
         def cache_dir
           @cache_dir ||= File.join(cache_root, split)
         end
@@ -102,31 +116,50 @@ module DSPy
         end
         def http_get(uri)
-          Net::HTTP.start(uri.host, uri.port, use_ssl: uri.scheme == 'https') do |http|
-            request = Net::HTTP::Get.new(uri)
-            http.request(request)
-          end
+          perform_request_with_redirects(uri)
         end
         def download_file(url, destination)
-          uri = URI(url)
+          fetch_with_redirects(URI(url)) do |response|
+            File.binwrite(destination, response.body)
+          end
+        rescue => e
+          File.delete(destination) if File.exist?(destination)
+          raise
+        end
+        MAX_REDIRECTS = 5
+        def perform_request_with_redirects(uri, limit = MAX_REDIRECTS)
+          raise DownloadError, 'Too many HTTP redirects' if limit <= 0
           Net::HTTP.start(uri.host, uri.port, use_ssl: uri.scheme == 'https') do |http|
             request = Net::HTTP::Get.new(uri)
-            http.request(request) do |response|
-              unless response.is_a?(Net::HTTPSuccess)
-                raise DownloadError, "Failed to download parquet file: #{response.code}"
-              end
+            response = http.request(request)
-              File.open(destination, 'wb') do |file|
-                response.read_body do |chunk|
-                  file.write(chunk)
-                end
-              end
+            if response.is_a?(Net::HTTPRedirection)
+              location = response['location']
+              raise DownloadError, 'Redirect without location header' unless location
+              new_uri = URI(location)
+              new_uri = uri + location if new_uri.relative?
+              return perform_request_with_redirects(new_uri, limit - 1)
             end
+            response
           end
-        rescue => e
-          File.delete(destination) if File.exist?(destination)
-          raise
+        end
+        def fetch_with_redirects(uri, limit = MAX_REDIRECTS, &block)
+          response = perform_request_with_redirects(uri, limit)
+          unless response.is_a?(Net::HTTPSuccess)
+            message = response ? "Failed to download parquet file: #{response.code}" : 'Failed to download parquet file'
+            raise DownloadError, message
+          end
+          return yield response if block_given?
+          response
         end
       end
     end

data/lib/dspy/datasets/manifest.rb CHANGED Viewed

@@ -28,6 +28,28 @@ module DSPy
               homepage: 'https://huggingface.co/datasets/ade-benchmark-corpus/ade_corpus_v2',
               approx_row_count: 23516
             }
+          ),
+          DatasetInfo.new(
+            id: 'hotpot_qa/fullwiki',
+            name: 'HotPotQA (FullWiki)',
+            provider: 'huggingface',
+            splits: %w[train validation],
+            features: {
+              'id' => { 'type' => 'string' },
+              'question' => { 'type' => 'string' },
+              'answer' => { 'type' => 'string' },
+              'level' => { 'type' => 'string' }
+            },
+            loader: :huggingface_parquet,
+            loader_options: {
+              dataset: 'hotpot_qa',
+              config: 'fullwiki'
+            },
+            metadata: {
+              description: 'HotPotQA FullWiki configuration. The DSPy::Datasets::HotPotQA helper further filters to hard examples and produces train/dev/test splits.',
+              homepage: 'https://huggingface.co/datasets/hotpot_qa',
+              approx_row_count: 112_000
+            }
           )
         ].freeze
       end

data/lib/dspy/datasets/version.rb CHANGED Viewed

@@ -2,6 +2,6 @@
 module DSPy
   module Datasets
-    VERSION = DSPy::VERSION
+    VERSION = '1.0.0'
   end
 end

data/lib/dspy/datasets.rb CHANGED Viewed

@@ -7,6 +7,7 @@ require_relative 'datasets/manifest'
 require_relative 'datasets/loaders'
 require_relative 'datasets/hugging_face/api'
 require_relative 'datasets/ade'
+require_relative 'datasets/hotpot_qa'
 module DSPy
   module Datasets

metadata CHANGED Viewed

@@ -1,13 +1,14 @@
 --- !ruby/object:Gem::Specification
 name: dspy-datasets
 version: !ruby/object:Gem::Version
-  version: 0.29.1
+  version: 1.0.0
 platform: ruby
 authors:
 - Vicente Reig Rincón de Arellano
+autorequire:
 bindir: bin
 cert_chain: []
-date: 2025-10-20 00:00:00.000000000 Z
+date: 2025-10-25 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: dspy
@@ -15,14 +16,14 @@ dependencies:
     requirements:
     - - '='
       - !ruby/object:Gem::Version
-        version: 0.29.1
+        version: 0.30.0
   type: :runtime
   prerelease: false
   version_requirements: !ruby/object:Gem::Requirement
     requirements:
     - - '='
       - !ruby/object:Gem::Version
-        version: 0.29.1
+        version: 0.30.0
 - !ruby/object:Gem::Dependency
   name: red-parquet
   requirement: !ruby/object:Gem::Requirement
@@ -51,6 +52,7 @@ files:
 - lib/dspy/datasets/ade.rb
 - lib/dspy/datasets/dataset.rb
 - lib/dspy/datasets/errors.rb
+- lib/dspy/datasets/hotpot_qa.rb
 - lib/dspy/datasets/hugging_face/api.rb
 - lib/dspy/datasets/info.rb
 - lib/dspy/datasets/loaders.rb
@@ -62,6 +64,7 @@ licenses:
 - MIT
 metadata:
   github_repo: git@github.com:vicentereig/dspy.rb
+post_install_message:
 rdoc_options: []
 require_paths:
 - lib
@@ -76,7 +79,8 @@ required_rubygems_version: !ruby/object:Gem::Requirement
     - !ruby/object:Gem::Version
       version: '0'
 requirements: []
-rubygems_version: 3.6.5
+rubygems_version: 3.0.3.1
+signing_key:
 specification_version: 4
 summary: Curated datasets and loaders for DSPy.rb.
 test_files: []