PyPI - prompture - Versions diffs - 0.0.35.dev1__tar.gz → 0.0.36__tar.gz - Mend

prompture 0.0.35.dev1tar.gz → 0.0.36tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (125) hide show

prompture-0.0.36/PKG-INFO ADDED Viewed

@@ -0,0 +1,369 @@
+Metadata-Version: 2.4
+Name: prompture
+Version: 0.0.36
+Summary: Ask LLMs to return structured JSON and run cross-model tests. API-first.
+Author-email: Juan Denis <juan@vene.co>
+License-Expression: MIT
+Project-URL: Homepage, https://github.com/jhd3197/prompture
+Classifier: Programming Language :: Python :: 3
+Classifier: Operating System :: OS Independent
+Requires-Python: >=3.9
+Description-Content-Type: text/markdown
+License-File: LICENSE
+Requires-Dist: anthropic>=0.8.0
+Requires-Dist: click>=8.0
+Requires-Dist: google-generativeai>=0.3.0
+Requires-Dist: groq>=0.4.0
+Requires-Dist: httpx>=0.25.0
+Requires-Dist: jsonschema>=4.0
+Requires-Dist: openai>=1.0.0
+Requires-Dist: pandas>=1.3.0
+Requires-Dist: pydantic>=1.10
+Requires-Dist: pydantic-settings>=2.0
+Requires-Dist: python-dotenv>=0.19.0
+Requires-Dist: python-toon>=0.1.0
+Requires-Dist: requests>=2.28
+Requires-Dist: python-dateutil>=2.9.0
+Requires-Dist: tukuy>=0.0.6
+Requires-Dist: pyyaml>=6.0
+Provides-Extra: test
+Requires-Dist: pytest>=7.0; extra == "test"
+Requires-Dist: pytest-asyncio>=0.23.0; extra == "test"
+Provides-Extra: dev
+Requires-Dist: pytest>=7.0; extra == "dev"
+Requires-Dist: pytest-asyncio>=0.23.0; extra == "dev"
+Requires-Dist: ruff>=0.8.0; extra == "dev"
+Provides-Extra: airllm
+Requires-Dist: airllm>=2.8.0; extra == "airllm"
+Provides-Extra: redis
+Requires-Dist: redis>=4.0; extra == "redis"
+Provides-Extra: serve
+Requires-Dist: fastapi>=0.100; extra == "serve"
+Requires-Dist: uvicorn[standard]>=0.20; extra == "serve"
+Requires-Dist: sse-starlette>=1.6; extra == "serve"
+Provides-Extra: scaffold
+Requires-Dist: jinja2>=3.0; extra == "scaffold"
+Dynamic: license-file
+<p align="center">
+  <h1 align="center">Prompture</h1>
+  <p align="center">Structured JSON extraction from any LLM. Schema-enforced, Pydantic-native, multi-provider.</p>
+</p>
+<p align="center">
+  <a href="https://pypi.org/project/prompture/"><img src="https://badge.fury.io/py/prompture.svg" alt="PyPI version"></a>
+  <a href="https://pypi.org/project/prompture/"><img src="https://img.shields.io/pypi/pyversions/prompture.svg" alt="Python versions"></a>
+  <a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-blue.svg" alt="License: MIT"></a>
+  <a href="https://pepy.tech/project/prompture"><img src="https://static.pepy.tech/badge/prompture" alt="Downloads"></a>
+  <a href="https://github.com/jhd3197/prompture"><img src="https://img.shields.io/github/stars/jhd3197/prompture?style=social" alt="GitHub stars"></a>
+</p>
+---
+**Prompture** is a Python library that turns LLM responses into validated, structured data. Define a schema or Pydantic model, point it at any provider, and get typed output back — with token tracking, cost calculation, and automatic JSON repair built in.
+```python
+from pydantic import BaseModel
+from prompture import extract_with_model
+class Person(BaseModel):
+    name: str
+    age: int
+    profession: str
+person = extract_with_model(Person, "Maria is 32, a developer in NYC.", model_name="openai/gpt-4")
+print(person.name)  # Maria
+```
+## Key Features
+- **Structured output** — JSON schema enforcement and direct Pydantic model population
+- **12 providers** — OpenAI, Claude, Google, Groq, Grok, Azure, Ollama, LM Studio, OpenRouter, HuggingFace, AirLLM, and generic HTTP
+- **TOON input conversion** — 45-60% token savings when sending structured data via [Token-Oriented Object Notation](https://github.com/jhd3197/python-toon)
+- **Stepwise extraction** — Per-field prompts with smart type coercion (shorthand numbers, multilingual booleans, dates)
+- **Field registry** — 50+ predefined extraction fields with template variables and Pydantic integration
+- **Conversations** — Stateful multi-turn sessions with sync and async support
+- **Tool use** — Function calling and streaming across supported providers
+- **Caching** — Built-in response cache with memory, SQLite, and Redis backends
+- **Plugin system** — Register custom drivers via entry points
+- **Usage tracking** — Token counts and cost calculation on every call
+- **Auto-repair** — Optional second LLM pass to fix malformed JSON
+- **Batch testing** — Spec-driven suites to compare models side by side
+## Installation
+```bash
+pip install prompture
+```
+Optional extras:
+```bash
+pip install prompture[redis]     # Redis cache backend
+pip install prompture[serve]     # FastAPI server mode
+pip install prompture[airllm]    # AirLLM local inference
+```
+## Configuration
+Set API keys for the providers you use. Prompture reads from environment variables or a `.env` file:
+```bash
+OPENAI_API_KEY=sk-...
+ANTHROPIC_API_KEY=sk-ant-...
+GOOGLE_API_KEY=...
+GROQ_API_KEY=...
+GROK_API_KEY=...
+OPENROUTER_API_KEY=...
+AZURE_OPENAI_ENDPOINT=...
+AZURE_OPENAI_API_KEY=...
+```
+Local providers (Ollama, LM Studio) work out of the box with no keys required.
+## Providers
+Model strings use `"provider/model"` format. The provider prefix routes to the correct driver automatically.
+| Provider | Example Model | Cost |
+|---|---|---|
+| `openai` | `openai/gpt-4` | Automatic |
+| `claude` | `claude/claude-3` | Automatic |
+| `google` | `google/gemini-1.5-pro` | Automatic |
+| `groq` | `groq/llama2-70b-4096` | Automatic |
+| `grok` | `grok/grok-4-fast-reasoning` | Automatic |
+| `azure` | `azure/deployed-name` | Automatic |
+| `openrouter` | `openrouter/anthropic/claude-2` | Automatic |
+| `ollama` | `ollama/llama3.1:8b` | Free (local) |
+| `lmstudio` | `lmstudio/local-model` | Free (local) |
+| `huggingface` | `hf/model-name` | Free (local) |
+| `http` | `http/self-hosted` | Free |
+## Usage
+### One-Shot Pydantic Extraction
+Single LLM call, returns a validated Pydantic instance:
+```python
+from typing import List, Optional
+from pydantic import BaseModel
+from prompture import extract_with_model
+class Person(BaseModel):
+    name: str
+    age: int
+    profession: str
+    city: str
+    hobbies: List[str]
+    education: Optional[str] = None
+person = extract_with_model(
+    Person,
+    "Maria is 32, a software developer in New York. She loves hiking and photography.",
+    model_name="openai/gpt-4"
+)
+print(person.model_dump())
+```
+### Stepwise Extraction
+One LLM call per field. Higher accuracy, per-field error recovery:
+```python
+from prompture import stepwise_extract_with_model
+result = stepwise_extract_with_model(
+    Person,
+    "Maria is 32, a software developer in New York. She loves hiking and photography.",
+    model_name="openai/gpt-4"
+)
+print(result["model"].model_dump())
+print(result["usage"])  # per-field and total token usage
+```
+| Aspect | `extract_with_model` | `stepwise_extract_with_model` |
+|---|---|---|
+| LLM calls | 1 | N (one per field) |
+| Speed / cost | Faster, cheaper | Slower, higher |
+| Accuracy | Good global coherence | Higher per-field accuracy |
+| Error handling | All-or-nothing | Per-field recovery |
+### JSON Schema Extraction
+For raw JSON output with full control:
+```python
+from prompture import ask_for_json
+schema = {
+    "type": "object",
+    "required": ["name", "age"],
+    "properties": {
+        "name": {"type": "string"},
+        "age": {"type": "integer"}
+    }
+}
+result = ask_for_json(
+    content_prompt="Extract the person's info from: John is 28 and lives in Miami.",
+    json_schema=schema,
+    model_name="openai/gpt-4"
+)
+print(result["json_object"])  # {"name": "John", "age": 28}
+print(result["usage"])        # token counts and cost
+```
+### TOON Input — Token Savings
+Analyze structured data with automatic TOON conversion for 45-60% fewer tokens:
+```python
+from prompture import extract_from_data
+products = [
+    {"id": 1, "name": "Laptop", "price": 999.99, "rating": 4.5},
+    {"id": 2, "name": "Book", "price": 19.99, "rating": 4.2},
+    {"id": 3, "name": "Headphones", "price": 149.99, "rating": 4.7},
+]
+result = extract_from_data(
+    data=products,
+    question="What is the average price and highest rated product?",
+    json_schema={
+        "type": "object",
+        "properties": {
+            "average_price": {"type": "number"},
+            "highest_rated": {"type": "string"}
+        }
+    },
+    model_name="openai/gpt-4"
+)
+print(result["json_object"])
+# {"average_price": 389.99, "highest_rated": "Headphones"}
+print(f"Token savings: {result['token_savings']['percentage_saved']}%")
+```
+Works with Pandas DataFrames via `extract_from_pandas()`.
+### Field Definitions
+Use the built-in field registry for consistent extraction across models:
+```python
+from pydantic import BaseModel
+from prompture import field_from_registry, stepwise_extract_with_model
+class Person(BaseModel):
+    name: str = field_from_registry("name")
+    age: int = field_from_registry("age")
+    email: str = field_from_registry("email")
+    occupation: str = field_from_registry("occupation")
+result = stepwise_extract_with_model(
+    Person,
+    "John Smith, 25, software engineer at TechCorp, john@example.com",
+    model_name="openai/gpt-4"
+)
+```
+Register custom fields with template variables:
+```python
+from prompture import register_field
+register_field("document_date", {
+    "type": "str",
+    "description": "Document creation date",
+    "instructions": "Use {{current_date}} if not specified",
+    "default": "{{current_date}}",
+    "nullable": False
+})
+```
+### Conversations
+Stateful multi-turn sessions:
+```python
+from prompture import Conversation
+conv = Conversation(model_name="openai/gpt-4")
+conv.add_message("system", "You are a helpful assistant.")
+response = conv.send("What is the capital of France?")
+follow_up = conv.send("What about Germany?")  # retains context
+```
+### Model Discovery
+Auto-detect available models from configured providers:
+```python
+from prompture import get_available_models
+models = get_available_models()
+for model in models:
+    print(model)  # "openai/gpt-4", "ollama/llama3:latest", ...
+```
+### Logging and Debugging
+```python
+import logging
+from prompture import configure_logging
+configure_logging(logging.DEBUG)
+```
+### Response Shape
+All extraction functions return a consistent structure:
+```python
+{
+    "json_string": str,       # raw JSON text
+    "json_object": dict,      # parsed result
+    "usage": {
+        "prompt_tokens": int,
+        "completion_tokens": int,
+        "total_tokens": int,
+        "cost": float,
+        "model_name": str
+    }
+}
+```
+## CLI
+```bash
+prompture run <spec-file>
+```
+Run spec-driven extraction suites for cross-model comparison.
+## Development
+```bash
+# Install with dev dependencies
+pip install -e ".[test,dev]"
+# Run tests
+pytest
+# Run integration tests (requires live LLM access)
+pytest --run-integration
+# Lint and format
+ruff check .
+ruff format .
+```
+## Contributing
+PRs welcome. Please add tests for new functionality and examples under `examples/` for new drivers or patterns.
+## License
+[MIT](https://opensource.org/licenses/MIT)

prompture-0.0.36/README.md ADDED Viewed

@@ -0,0 +1,322 @@
+<p align="center">
+  <h1 align="center">Prompture</h1>
+  <p align="center">Structured JSON extraction from any LLM. Schema-enforced, Pydantic-native, multi-provider.</p>
+</p>
+<p align="center">
+  <a href="https://pypi.org/project/prompture/"><img src="https://badge.fury.io/py/prompture.svg" alt="PyPI version"></a>
+  <a href="https://pypi.org/project/prompture/"><img src="https://img.shields.io/pypi/pyversions/prompture.svg" alt="Python versions"></a>
+  <a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-blue.svg" alt="License: MIT"></a>
+  <a href="https://pepy.tech/project/prompture"><img src="https://static.pepy.tech/badge/prompture" alt="Downloads"></a>
+  <a href="https://github.com/jhd3197/prompture"><img src="https://img.shields.io/github/stars/jhd3197/prompture?style=social" alt="GitHub stars"></a>
+</p>
+---
+**Prompture** is a Python library that turns LLM responses into validated, structured data. Define a schema or Pydantic model, point it at any provider, and get typed output back — with token tracking, cost calculation, and automatic JSON repair built in.
+```python
+from pydantic import BaseModel
+from prompture import extract_with_model
+class Person(BaseModel):
+    name: str
+    age: int
+    profession: str
+person = extract_with_model(Person, "Maria is 32, a developer in NYC.", model_name="openai/gpt-4")
+print(person.name)  # Maria
+```
+## Key Features
+- **Structured output** — JSON schema enforcement and direct Pydantic model population
+- **12 providers** — OpenAI, Claude, Google, Groq, Grok, Azure, Ollama, LM Studio, OpenRouter, HuggingFace, AirLLM, and generic HTTP
+- **TOON input conversion** — 45-60% token savings when sending structured data via [Token-Oriented Object Notation](https://github.com/jhd3197/python-toon)
+- **Stepwise extraction** — Per-field prompts with smart type coercion (shorthand numbers, multilingual booleans, dates)
+- **Field registry** — 50+ predefined extraction fields with template variables and Pydantic integration
+- **Conversations** — Stateful multi-turn sessions with sync and async support
+- **Tool use** — Function calling and streaming across supported providers
+- **Caching** — Built-in response cache with memory, SQLite, and Redis backends
+- **Plugin system** — Register custom drivers via entry points
+- **Usage tracking** — Token counts and cost calculation on every call
+- **Auto-repair** — Optional second LLM pass to fix malformed JSON
+- **Batch testing** — Spec-driven suites to compare models side by side
+## Installation
+```bash
+pip install prompture
+```
+Optional extras:
+```bash
+pip install prompture[redis]     # Redis cache backend
+pip install prompture[serve]     # FastAPI server mode
+pip install prompture[airllm]    # AirLLM local inference
+```
+## Configuration
+Set API keys for the providers you use. Prompture reads from environment variables or a `.env` file:
+```bash
+OPENAI_API_KEY=sk-...
+ANTHROPIC_API_KEY=sk-ant-...
+GOOGLE_API_KEY=...
+GROQ_API_KEY=...
+GROK_API_KEY=...
+OPENROUTER_API_KEY=...
+AZURE_OPENAI_ENDPOINT=...
+AZURE_OPENAI_API_KEY=...
+```
+Local providers (Ollama, LM Studio) work out of the box with no keys required.
+## Providers
+Model strings use `"provider/model"` format. The provider prefix routes to the correct driver automatically.
+| Provider | Example Model | Cost |
+|---|---|---|
+| `openai` | `openai/gpt-4` | Automatic |
+| `claude` | `claude/claude-3` | Automatic |
+| `google` | `google/gemini-1.5-pro` | Automatic |
+| `groq` | `groq/llama2-70b-4096` | Automatic |
+| `grok` | `grok/grok-4-fast-reasoning` | Automatic |
+| `azure` | `azure/deployed-name` | Automatic |
+| `openrouter` | `openrouter/anthropic/claude-2` | Automatic |
+| `ollama` | `ollama/llama3.1:8b` | Free (local) |
+| `lmstudio` | `lmstudio/local-model` | Free (local) |
+| `huggingface` | `hf/model-name` | Free (local) |
+| `http` | `http/self-hosted` | Free |
+## Usage
+### One-Shot Pydantic Extraction
+Single LLM call, returns a validated Pydantic instance:
+```python
+from typing import List, Optional
+from pydantic import BaseModel
+from prompture import extract_with_model
+class Person(BaseModel):
+    name: str
+    age: int
+    profession: str
+    city: str
+    hobbies: List[str]
+    education: Optional[str] = None
+person = extract_with_model(
+    Person,
+    "Maria is 32, a software developer in New York. She loves hiking and photography.",
+    model_name="openai/gpt-4"
+)
+print(person.model_dump())
+```
+### Stepwise Extraction
+One LLM call per field. Higher accuracy, per-field error recovery:
+```python
+from prompture import stepwise_extract_with_model
+result = stepwise_extract_with_model(
+    Person,
+    "Maria is 32, a software developer in New York. She loves hiking and photography.",
+    model_name="openai/gpt-4"
+)
+print(result["model"].model_dump())
+print(result["usage"])  # per-field and total token usage
+```
+| Aspect | `extract_with_model` | `stepwise_extract_with_model` |
+|---|---|---|
+| LLM calls | 1 | N (one per field) |
+| Speed / cost | Faster, cheaper | Slower, higher |
+| Accuracy | Good global coherence | Higher per-field accuracy |
+| Error handling | All-or-nothing | Per-field recovery |
+### JSON Schema Extraction
+For raw JSON output with full control:
+```python
+from prompture import ask_for_json
+schema = {
+    "type": "object",
+    "required": ["name", "age"],
+    "properties": {
+        "name": {"type": "string"},
+        "age": {"type": "integer"}
+    }
+}
+result = ask_for_json(
+    content_prompt="Extract the person's info from: John is 28 and lives in Miami.",
+    json_schema=schema,
+    model_name="openai/gpt-4"
+)
+print(result["json_object"])  # {"name": "John", "age": 28}
+print(result["usage"])        # token counts and cost
+```
+### TOON Input — Token Savings
+Analyze structured data with automatic TOON conversion for 45-60% fewer tokens:
+```python
+from prompture import extract_from_data
+products = [
+    {"id": 1, "name": "Laptop", "price": 999.99, "rating": 4.5},
+    {"id": 2, "name": "Book", "price": 19.99, "rating": 4.2},
+    {"id": 3, "name": "Headphones", "price": 149.99, "rating": 4.7},
+]
+result = extract_from_data(
+    data=products,
+    question="What is the average price and highest rated product?",
+    json_schema={
+        "type": "object",
+        "properties": {
+            "average_price": {"type": "number"},
+            "highest_rated": {"type": "string"}
+        }
+    },
+    model_name="openai/gpt-4"
+)
+print(result["json_object"])
+# {"average_price": 389.99, "highest_rated": "Headphones"}
+print(f"Token savings: {result['token_savings']['percentage_saved']}%")
+```
+Works with Pandas DataFrames via `extract_from_pandas()`.
+### Field Definitions
+Use the built-in field registry for consistent extraction across models:
+```python
+from pydantic import BaseModel
+from prompture import field_from_registry, stepwise_extract_with_model
+class Person(BaseModel):
+    name: str = field_from_registry("name")
+    age: int = field_from_registry("age")
+    email: str = field_from_registry("email")
+    occupation: str = field_from_registry("occupation")
+result = stepwise_extract_with_model(
+    Person,
+    "John Smith, 25, software engineer at TechCorp, john@example.com",
+    model_name="openai/gpt-4"
+)
+```
+Register custom fields with template variables:
+```python
+from prompture import register_field
+register_field("document_date", {
+    "type": "str",
+    "description": "Document creation date",
+    "instructions": "Use {{current_date}} if not specified",
+    "default": "{{current_date}}",
+    "nullable": False
+})
+```
+### Conversations
+Stateful multi-turn sessions:
+```python
+from prompture import Conversation
+conv = Conversation(model_name="openai/gpt-4")
+conv.add_message("system", "You are a helpful assistant.")
+response = conv.send("What is the capital of France?")
+follow_up = conv.send("What about Germany?")  # retains context
+```
+### Model Discovery
+Auto-detect available models from configured providers:
+```python
+from prompture import get_available_models
+models = get_available_models()
+for model in models:
+    print(model)  # "openai/gpt-4", "ollama/llama3:latest", ...
+```
+### Logging and Debugging
+```python
+import logging
+from prompture import configure_logging
+configure_logging(logging.DEBUG)
+```
+### Response Shape
+All extraction functions return a consistent structure:
+```python
+{
+    "json_string": str,       # raw JSON text
+    "json_object": dict,      # parsed result
+    "usage": {
+        "prompt_tokens": int,
+        "completion_tokens": int,
+        "total_tokens": int,
+        "cost": float,
+        "model_name": str
+    }
+}
+```
+## CLI
+```bash
+prompture run <spec-file>
+```
+Run spec-driven extraction suites for cross-model comparison.
+## Development
+```bash
+# Install with dev dependencies
+pip install -e ".[test,dev]"
+# Run tests
+pytest
+# Run integration tests (requires live LLM access)
+pytest --run-integration
+# Lint and format
+ruff check .
+ruff format .
+```
+## Contributing
+PRs welcome. Please add tests for new functionality and examples under `examples/` for new drivers or patterns.
+## License
+[MIT](https://opensource.org/licenses/MIT)

prompture-0.0.36/VERSION ADDED Viewed

	@@ -0,0 +1 @@
1	+ 0.0.36

{prompture-0.0.35.dev1 → prompture-0.0.36}/prompture/_version.py RENAMED Viewed

@@ -28,7 +28,7 @@ version_tuple: VERSION_TUPLE
 commit_id: COMMIT_ID
 __commit_id__: COMMIT_ID
-__version__ = version = '0.0.35.dev1'
-__version_tuple__ = version_tuple = (0, 0, 35, 'dev1')
+__version__ = version = '0.0.36'
+__version_tuple__ = version_tuple = (0, 0, 36)
 __commit_id__ = commit_id = None

prompture 0.0.35.dev1__tar.gz → 0.0.36__tar.gz

prompture 0.0.35.dev1tar.gz → 0.0.36tar.gz