PyPI - llm_batch_helper - Versions diffs - 0.1.5__tar.gz → 0.2.0__tar.gz - Mend

llm_batch_helper 0.1.5tar.gz → 0.2.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (10) hide show

{llm_batch_helper-0.1.5 → llm_batch_helper-0.2.0}/PKG-INFO RENAMED Viewed

@@ -1,9 +1,9 @@
 Metadata-Version: 2.3
 Name: llm_batch_helper
-Version: 0.1.5
+Version: 0.2.0
 Summary: A Python package that enables batch submission of prompts to LLM APIs, with built-in async capabilities and response caching.
 License: MIT
-Keywords: llm,openai,together,batch,async,ai,nlp,api
+Keywords: llm,openai,together,openrouter,batch,async,ai,nlp,api
 Author: Tianyi Peng
 Author-email: tianyipeng95@gmail.com
 Requires-Python: >=3.11,<4.0
@@ -19,7 +19,6 @@ Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
 Classifier: Topic :: Software Development :: Libraries :: Python Modules
 Requires-Dist: httpx (>=0.24.0,<2.0.0)
 Requires-Dist: openai (>=1.0.0,<2.0.0)
-Requires-Dist: python-dotenv (>=1.0.0,<2.0.0)
 Requires-Dist: tenacity (>=8.0.0,<9.0.0)
 Requires-Dist: tqdm (>=4.65.0,<5.0.0)
 Project-URL: Homepage, https://github.com/TianyiPeng/LLM_batch_helper
@@ -28,7 +27,29 @@ Description-Content-Type: text/markdown
 # LLM Batch Helper
-A Python package that enables batch submission of prompts to LLM APIs, with built-in async capabilities and response caching.
+[![PyPI version](https://badge.fury.io/py/llm_batch_helper.svg)](https://badge.fury.io/py/llm_batch_helper)
+[![Downloads](https://pepy.tech/badge/llm_batch_helper)](https://pepy.tech/project/llm_batch_helper)
+[![Downloads/Month](https://pepy.tech/badge/llm_batch_helper/month)](https://pepy.tech/project/llm_batch_helper)
+[![Documentation Status](https://readthedocs.org/projects/llm-batch-helper/badge/?version=latest)](https://llm-batch-helper.readthedocs.io/en/latest/?badge=latest)
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+A Python package that enables batch submission of prompts to LLM APIs, with built-in async capabilities, response caching, prompt verification, and more. This package is designed to streamline applications like LLM simulation, LLM-as-a-judge, and other batch processing scenarios.
+📖 **[Complete Documentation](https://llm-batch-helper.readthedocs.io/)** | 🚀 **[Quick Start Guide](https://llm-batch-helper.readthedocs.io/en/latest/quickstart.html)**
+## Why we designed this package
+Calling LLM APIs has become increasingly common, but several pain points exist in practice:
+1. **Efficient Batch Processing**: How do you run LLM calls in batches efficiently? Our async implementation is 3X-100X faster than multi-thread/multi-process approaches.
+2. **API Reliability**: LLM APIs can be unstable, so we need robust retry mechanisms when calls get interrupted.
+3. **Long-Running Simulations**: During long-running LLM simulations, computers can crash and APIs can fail. Can we cache LLM API calls to avoid repeating completed work?
+4. **Output Validation**: LLM outputs often have format requirements. If the output isn't right, we need to retry with validation.
+This package is designed to solve these exact pain points with async processing, intelligent caching, and comprehensive error handling. If there are some additional features you need, please post an issue.
 ## Features
@@ -67,6 +88,7 @@ poetry shell
 ### 1. Set up environment variables
+**Option A: Environment Variables**
 ```bash
 # For OpenAI
 export OPENAI_API_KEY="your-openai-api-key"
@@ -75,6 +97,22 @@ export OPENAI_API_KEY="your-openai-api-key"
 export TOGETHER_API_KEY="your-together-api-key"
 ```
+**Option B: .env File (Recommended for Development)**
+```python
+# In your script, before importing llm_batch_helper
+from dotenv import load_dotenv
+load_dotenv()  # Load from .env file
+# Then use the package normally
+from llm_batch_helper import LLMConfig, process_prompts_batch
+```
+Create a `.env` file in your project:
+```
+OPENAI_API_KEY=your-openai-api-key
+TOGETHER_API_KEY=your-together-api-key
+```
 ### 2. Interactive Tutorial (Recommended)
 Check out the comprehensive Jupyter notebook [tutorial](https://github.com/TianyiPeng/LLM_batch_helper/blob/main/tutorials/llm_batch_helper_tutorial.ipynb).
@@ -85,8 +123,12 @@ The tutorial covers all features with interactive examples!
 ```python
 import asyncio
+from dotenv import load_dotenv  # Optional: for .env file support
 from llm_batch_helper import LLMConfig, process_prompts_batch
+# Optional: Load environment variables from .env file
+load_dotenv()
 async def main():
     # Create configuration
     config = LLMConfig(
@@ -203,7 +245,7 @@ Main function for batch processing of prompts.
 ```python
 async def process_prompts_batch(
     config: LLMConfig,
-    provider: str,  # "openai" or "together"
+    provider: str,  # "openai", "together", or "openrouter"
     prompts: Optional[List[str]] = None,
     input_dir: Optional[str] = None,
     cache_dir: str = "llm_cache",

{llm_batch_helper-0.1.5 → llm_batch_helper-0.2.0}/README.md RENAMED Viewed

@@ -1,6 +1,28 @@
 # LLM Batch Helper
-A Python package that enables batch submission of prompts to LLM APIs, with built-in async capabilities and response caching.
+[![PyPI version](https://badge.fury.io/py/llm_batch_helper.svg)](https://badge.fury.io/py/llm_batch_helper)
+[![Downloads](https://pepy.tech/badge/llm_batch_helper)](https://pepy.tech/project/llm_batch_helper)
+[![Downloads/Month](https://pepy.tech/badge/llm_batch_helper/month)](https://pepy.tech/project/llm_batch_helper)
+[![Documentation Status](https://readthedocs.org/projects/llm-batch-helper/badge/?version=latest)](https://llm-batch-helper.readthedocs.io/en/latest/?badge=latest)
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+A Python package that enables batch submission of prompts to LLM APIs, with built-in async capabilities, response caching, prompt verification, and more. This package is designed to streamline applications like LLM simulation, LLM-as-a-judge, and other batch processing scenarios.
+📖 **[Complete Documentation](https://llm-batch-helper.readthedocs.io/)** | 🚀 **[Quick Start Guide](https://llm-batch-helper.readthedocs.io/en/latest/quickstart.html)**
+## Why we designed this package
+Calling LLM APIs has become increasingly common, but several pain points exist in practice:
+1. **Efficient Batch Processing**: How do you run LLM calls in batches efficiently? Our async implementation is 3X-100X faster than multi-thread/multi-process approaches.
+2. **API Reliability**: LLM APIs can be unstable, so we need robust retry mechanisms when calls get interrupted.
+3. **Long-Running Simulations**: During long-running LLM simulations, computers can crash and APIs can fail. Can we cache LLM API calls to avoid repeating completed work?
+4. **Output Validation**: LLM outputs often have format requirements. If the output isn't right, we need to retry with validation.
+This package is designed to solve these exact pain points with async processing, intelligent caching, and comprehensive error handling. If there are some additional features you need, please post an issue.
 ## Features
@@ -39,6 +61,7 @@ poetry shell
 ### 1. Set up environment variables
+**Option A: Environment Variables**
 ```bash
 # For OpenAI
 export OPENAI_API_KEY="your-openai-api-key"
@@ -47,6 +70,22 @@ export OPENAI_API_KEY="your-openai-api-key"
 export TOGETHER_API_KEY="your-together-api-key"
 ```
+**Option B: .env File (Recommended for Development)**
+```python
+# In your script, before importing llm_batch_helper
+from dotenv import load_dotenv
+load_dotenv()  # Load from .env file
+# Then use the package normally
+from llm_batch_helper import LLMConfig, process_prompts_batch
+```
+Create a `.env` file in your project:
+```
+OPENAI_API_KEY=your-openai-api-key
+TOGETHER_API_KEY=your-together-api-key
+```
 ### 2. Interactive Tutorial (Recommended)
 Check out the comprehensive Jupyter notebook [tutorial](https://github.com/TianyiPeng/LLM_batch_helper/blob/main/tutorials/llm_batch_helper_tutorial.ipynb).
@@ -57,8 +96,12 @@ The tutorial covers all features with interactive examples!
 ```python
 import asyncio
+from dotenv import load_dotenv  # Optional: for .env file support
 from llm_batch_helper import LLMConfig, process_prompts_batch
+# Optional: Load environment variables from .env file
+load_dotenv()
 async def main():
     # Create configuration
     config = LLMConfig(
@@ -175,7 +218,7 @@ Main function for batch processing of prompts.
 ```python
 async def process_prompts_batch(
     config: LLMConfig,
-    provider: str,  # "openai" or "together"
+    provider: str,  # "openai", "together", or "openrouter"
     prompts: Optional[List[str]] = None,
     input_dir: Optional[str] = None,
     cache_dir: str = "llm_cache",

{llm_batch_helper-0.1.5 → llm_batch_helper-0.2.0}/llm_batch_helper/__init__.py RENAMED Viewed

@@ -3,7 +3,7 @@ from .config import LLMConfig
 from .input_handlers import get_prompts, read_prompt_files, read_prompt_list
 from .providers import process_prompts_batch
-__version__ = "0.1.5"
+__version__ = "0.2.0"
 __all__ = [
     "LLMCache",

{llm_batch_helper-0.1.5 → llm_batch_helper-0.2.0}/llm_batch_helper/config.py RENAMED Viewed

@@ -16,6 +16,7 @@ class LLMConfig:
         verification_callback: Optional[Callable[..., bool]] = None,
         verification_callback_args: Optional[Dict] = None,
         max_completion_tokens: Optional[int] = None,
+        **kwargs
     ):
         self.model_name = model_name
         self.temperature = temperature
@@ -30,3 +31,4 @@ class LLMConfig:
         self.verification_callback_args = (
             verification_callback_args if verification_callback_args is not None else {}
         )
+        self.kwargs = kwargs

{llm_batch_helper-0.1.5 → llm_batch_helper-0.2.0}/llm_batch_helper/providers.py RENAMED Viewed

@@ -4,7 +4,6 @@ from typing import Any, Dict, List, Optional, Tuple, Union
 import httpx
 import openai
-from dotenv import load_dotenv
 from tenacity import retry, retry_if_exception_type, stop_after_attempt, wait_exponential
 from tqdm.asyncio import tqdm_asyncio
@@ -12,8 +11,6 @@ from .cache import LLMCache
 from .config import LLMConfig
 from .input_handlers import get_prompts
-load_dotenv()
 @retry(
     stop=stop_after_attempt(5),
@@ -49,6 +46,7 @@ async def _get_openai_response_direct(
             messages=messages,
             temperature=config.temperature,
             max_completion_tokens=config.max_completion_tokens,
+            **config.kwargs,
         )
         usage_details = {
             "prompt_token_count": response.usage.prompt_tokens,
@@ -97,6 +95,7 @@ async def _get_together_response_direct(
             "messages": messages,
             "temperature": config.temperature,
             "max_tokens": config.max_completion_tokens,
+            **config.kwargs,
         }
         response = await client.post(
@@ -119,6 +118,66 @@ async def _get_together_response_direct(
             "usage_details": usage_details,
         }
+@retry(
+    stop=stop_after_attempt(5),
+    wait=wait_exponential(multiplier=1, min=4, max=60),
+    retry=retry_if_exception_type(
+        (
+            ConnectionError,
+            TimeoutError,
+            httpx.HTTPStatusError,
+            httpx.RequestError,
+        )
+    ),
+    reraise=True,
+)
+async def _get_openrouter_response_direct(
+    prompt: str, config: LLMConfig
+) -> Dict[str, Union[str, Dict]]:
+    api_key = os.environ.get("OPENROUTER_API_KEY")
+    if not api_key:
+        raise ValueError("OPENROUTER_API_KEY environment variable not set")
+    async with httpx.AsyncClient(timeout=1000.0) as client:
+        messages = [
+            {"role": "system", "content": config.system_instruction},
+            {"role": "user", "content": prompt},
+        ]
+        headers = {
+            "Authorization": f"Bearer {api_key}",
+            "Content-Type": "application/json",
+        }
+        payload = {
+            "model": config.model_name,
+            "messages": messages,
+            "temperature": config.temperature,
+            "max_tokens": config.max_completion_tokens,
+            **config.kwargs,
+        }
+        response = await client.post(
+            "https://openrouter.ai/api/v1/chat/completions",
+            json=payload,
+            headers=headers,
+        )
+        response.raise_for_status()
+        response_data = response.json()
+        usage = response_data.get("usage", {})
+        usage_details = {
+            "prompt_token_count": usage.get("prompt_tokens", 0),
+            "completion_token_count": usage.get("completion_tokens", 0),
+            "total_token_count": usage.get("total_tokens", 0),
+        }
+        return {
+            "response_text": response_data["choices"][0]["message"]["content"],
+            "usage_details": usage_details,
+        }
 async def get_llm_response_with_internal_retry(
     prompt_id: str,
     prompt: str,
@@ -138,6 +197,8 @@ async def get_llm_response_with_internal_retry(
             response = await _get_openai_response_direct(prompt, config)
         elif provider.lower() == "together":
             response = await _get_together_response_direct(prompt, config)
+        elif provider.lower() == "openrouter":
+            response = await _get_openrouter_response_direct(prompt, config)
         else:
             raise ValueError(f"Unsupported provider: {provider}")
@@ -168,7 +229,7 @@ async def process_prompts_batch(
         prompts: Optional list of prompts in any supported format (string, tuple, or dict)
         input_dir: Optional path to directory containing prompt files
         config: LLM configuration
-        provider: LLM provider to use ("openai", "together", or "gemini")
+        provider: LLM provider to use ("openai", "together", or "openrouter")
         desc: Description for progress bar
         cache_dir: Optional directory for caching responses
         force: If True, force regeneration even if cached response exists

{llm_batch_helper-0.1.5 → llm_batch_helper-0.2.0}/pyproject.toml RENAMED Viewed

@@ -1,13 +1,13 @@
 [tool.poetry]
 name = "llm_batch_helper"
-version = "0.1.5"
+version = "0.2.0"
 description = "A Python package that enables batch submission of prompts to LLM APIs, with built-in async capabilities and response caching."
 authors = ["Tianyi Peng <tianyipeng95@gmail.com>"]
 readme = "README.md"
 license = "MIT"
 homepage = "https://github.com/TianyiPeng/LLM_batch_helper"
 repository = "https://github.com/TianyiPeng/LLM_batch_helper"
-keywords = ["llm", "openai", "together", "batch", "async", "ai", "nlp", "api"]
+keywords = ["llm", "openai", "together", "openrouter", "batch", "async", "ai", "nlp", "api"]
 classifiers = [
     "Development Status :: 4 - Beta",
     "Intended Audience :: Developers",
@@ -25,11 +25,11 @@ packages = [{include = "llm_batch_helper"}]
 python = "^3.11"
 httpx = ">=0.24.0,<2.0.0"
 openai = "^1.0.0"
-python-dotenv = "^1.0.0"
 tenacity = "^8.0.0"
 tqdm = "^4.65.0"
 [tool.poetry.group.dev.dependencies]
+python-dotenv = "^1.0.0"  # Optional for .env file support
 pytest = "^7.0.0"
 black = "^23.0.0"
 isort = "^5.12.0"

{llm_batch_helper-0.1.5 → llm_batch_helper-0.2.0}/LICENSE RENAMED Viewed

File without changes

{llm_batch_helper-0.1.5 → llm_batch_helper-0.2.0}/llm_batch_helper/cache.py RENAMED Viewed

File without changes

{llm_batch_helper-0.1.5 → llm_batch_helper-0.2.0}/llm_batch_helper/exceptions.py RENAMED Viewed

File without changes

{llm_batch_helper-0.1.5 → llm_batch_helper-0.2.0}/llm_batch_helper/input_handlers.py RENAMED Viewed

File without changes

llm_batch_helper 0.1.5__tar.gz → 0.2.0__tar.gz

llm_batch_helper 0.1.5tar.gz → 0.2.0tar.gz