PyPI - predikit - Versions diffs - 0.4.1__tar.gz → 0.4.2__tar.gz - Mend

predikit 0.4.1tar.gz → 0.4.2tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (50) hide show

{predikit-0.4.1 → predikit-0.4.2}/.claude/settings.local.json RENAMED Viewed

@@ -3,7 +3,8 @@
     "allow": [
       "Bash(git add *)",
       "Bash(git commit -m ' *)",
-      "Bash(python -m pytest tests/ -v --tb=short --cov=src/predikit --cov-report=term-missing)"
+      "Bash(python -m pytest tests/ -v --tb=short --cov=src/predikit --cov-report=term-missing)",
+      "WebFetch(domain:github.com)"
     ]
   }
 }

{predikit-0.4.1 → predikit-0.4.2}/.gitignore RENAMED Viewed

@@ -10,3 +10,4 @@ htmlcov/
 .venv/
 venv/
 *.egg
+file.py

{predikit-0.4.1 → predikit-0.4.2}/CHANGELOG.md RENAMED Viewed

@@ -6,6 +6,20 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 ## [Unreleased]
+## [0.4.2] - 2026-06-13
+### Changed
+- Redesigned PyPI/README hero: logo, centered tagline, and badges in a unified `<p align="center">` block
+- Tagline moved from a `##` heading to a proper descriptive paragraph
+- Badges converted to centered HTML `<img>` links for consistent rendering on PyPI
+- Quick code teaser repositioned directly below badges (before Table of Contents)
+- "Field naming rule" added to Table of Contents
+- `ainvoke()` added to `ModelTool` Core API reference table
+- `ModelEnsemble` Core API subsection added with constructor signature and full strategy table
+- Project Traffic / download badge moved to bottom of README
+- Development Status classifier bumped from `3 - Alpha` to `4 - Beta` in `pyproject.toml`
+- Removed CI test status badge from README
 ## [0.4.1] - 2026-06-02
 ### Added

{predikit-0.4.1 → predikit-0.4.2}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: predikit
-Version: 0.4.1
+Version: 0.4.2
 Summary: Turn any trained sklearn/XGBoost model into an LLM-callable tool with auto-generated schemas and typed I/O.
 Project-URL: Homepage, https://github.com/Tejas-TA/predikit
 Project-URL: Repository, https://github.com/Tejas-TA/predikit
@@ -10,7 +10,7 @@ Author-email: Tejas Tumakuru Ashok <tejasta@gmail.com>
 License: MIT
 License-File: LICENSE
 Keywords: agents,function-calling,llm,ml-tools,sklearn,xgboost
-Classifier: Development Status :: 3 - Alpha
+Classifier: Development Status :: 4 - Beta
 Classifier: Intended Audience :: Developers
 Classifier: Intended Audience :: Science/Research
 Classifier: License :: OSI Approved :: MIT License
@@ -48,34 +48,46 @@ Provides-Extra: xgboost
 Requires-Dist: xgboost>=1.7; extra == 'xgboost'
 Description-Content-Type: text/markdown
-# predikit
-[![PyPI version](https://img.shields.io/pypi/v/predikit.svg)](https://pypi.org/project/predikit/)
-[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/)
-[![License: MIT](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)
-[![CI](https://github.com/Tejas-TA/predikit/actions/workflows/test.yml/badge.svg)](https://github.com/Tejas-TA/predikit/actions/workflows/test.yml)
-[![Ruff](https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/astral-sh/ruff/main/assets/badge/v2.json)](https://github.com/astral-sh/ruff)
+<p align="center">
+  <picture>
+    <source srcset="https://raw.githubusercontent.com/Tejas-TA/predikit/main/docs/logo.gif">
+    <img src="https://raw.githubusercontent.com/Tejas-TA/predikit/main/docs/logo.png" alt="predikit" width="500"/>
+  </picture>
+</p>
+<p align="center">
+  Turn any trained scikit-learn or XGBoost model into an LLM-callable tool —<br/>
+  auto-generated JSON schemas, typed I/O, zero boilerplate.
+</p>
+<p align="center">
+  <a href="https://pypi.org/project/predikit/"><img src="https://img.shields.io/pypi/v/predikit.svg" alt="PyPI version"/></a>
+  <a href="https://www.python.org/"><img src="https://img.shields.io/badge/python-3.10+-blue.svg" alt="Python 3.10+"/></a>
+  <a href="LICENSE"><img src="https://img.shields.io/badge/License-MIT-green.svg" alt="License: MIT"/></a>
+  <a href="https://github.com/astral-sh/ruff"><img src="https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/astral-sh/ruff/main/assets/badge/v2.json" alt="Ruff"/></a>
+</p>
+<p align="center">
+  <a href="https://pepy.tech/project/predikit"><img src="https://static.pepy.tech/personalized-badge/predikit?period=week&units=international_system&left_color=grey&right_color=blue&left_text=weekly+downloads" alt="Weekly Downloads"/></a>
+  <a href="https://pepy.tech/project/predikit"><img src="https://static.pepy.tech/personalized-badge/predikit?period=month&units=international_system&left_color=grey&right_color=blue&left_text=monthly+downloads" alt="Monthly Downloads"/></a>
+  <a href="https://pepy.tech/project/predikit"><img src="https://static.pepy.tech/personalized-badge/predikit?period=total&units=international_system&left_color=grey&right_color=blue&left_text=total+downloads" alt="Total Downloads"/></a>
+</p>
-### 📈 Project Traffic
-Detailed breakdown of downloads by version, region, and platform:
-[![Downloads](https://pepy.tech/badge/predikit?style=for-the-badge)](https://pepy.tech/project/predikit)
+```python
+tool = ModelTool(model=clf, name="classify_iris", ...)
+tool.to_openai()              # OpenAI function schema, ready to pass to the API
+tool.invoke({"sqft": 2200})   # → {"price_usd": 370730}
+```
 ## Table of Contents
 - [Install](#install)
 - [30-second example](#30-second-example)
 - [Core API](#core-api)
+- [Field naming rule](#field-naming-rule)
 - [Cookbook](#cookbook)
 - [Contributing](#contributing)
 - [License](#license)
-## Turn any trained scikit-learn or XGBoost model into an LLM-callable tool — auto-generated JSON schemas, typed I/O, zero boilerplate.
-```python
-tool = ModelTool(model=clf, name="classify_iris", ...)
-tool.to_openai()              # OpenAI function schema, ready to pass to the API
-tool.invoke({"sqft": 2200})   # → {"price_usd": 370730}
-```
 ## Install
 ```bash
@@ -153,6 +165,7 @@ ModelTool(
 | Method | Returns | What it does |
 |--------|---------|--------------|
 | `.invoke(input_dict)` | `dict` | Validates → predicts → returns `{output_name: value}` |
+| `.ainvoke(input_dict)` | `dict` | Async version of `.invoke()` |
 | `.to_openai()` | `dict` | OpenAI function-calling schema |
 | `.to_langchain()` | `StructuredTool` | LangChain tool |
 | `.to_callable()` | `Callable` | Plain Python function |
@@ -168,6 +181,30 @@ registry.to_langchain()  # → list[StructuredTool]
 registry.get("name")     # → ModelTool
 ```
+### `ModelEnsemble`
+Call multiple models and reconcile their outputs in one step:
+```python
+ModelEnsemble(
+    tools: list[ModelTool],   # models to run in parallel
+    name: str,                # ensemble tool name the LLM sees
+    description: str,
+    strategy: str,            # "collect" | "mean" | "vote" | "weighted_mean" | "weighted_vote"
+    weights: list[float],     # optional, for weighted strategies
+)
+```
+| Strategy | Behaviour |
+|----------|-----------|
+| `"collect"` | Merges all outputs into one dict (tools can have different `output_name`) |
+| `"mean"` | Averages numeric outputs (all tools must share `output_name`) |
+| `"vote"` | Majority class vote (all tools must share `output_name`) |
+| `"weighted_mean"` | Weighted average — provide a `weights` list |
+| `"weighted_vote"` | Weighted majority vote — provide a `weights` list |
+`ModelEnsemble` exposes the same `.invoke()`, `.ainvoke()`, `.to_openai()`, and `.to_langchain()` interface as `ModelTool`.
 ## Field naming rule
 **Your Pydantic schema field names must exactly match the column names the model was trained on.**
@@ -287,8 +324,6 @@ Only applies to classifiers that implement `predict_proba`. Regressors are unaff
 ### Multi-model ensemble
-Call multiple models and reconcile their outputs in one step:
 ```python
 from predikit import ModelEnsemble, ToolRegistry
@@ -303,12 +338,6 @@ result  = ensemble.invoke(inputs)  # → {"price_usd": 370112}
 schema  = ensemble.to_openai()     # works exactly like ModelTool
 ```
-| strategy | behaviour |
-|----------|-----------|
-| `"collect"` | merges all outputs into one dict (tools can have different `output_name`) |
-| `"mean"` | averages numeric outputs (all tools must share `output_name`) |
-| `"vote"` | majority class vote (all tools must share `output_name`) |
 Register ensembles alongside individual tools:
 ```python
@@ -379,3 +408,4 @@ See [CONTRIBUTING.md](CONTRIBUTING.md) for development setup, code style, and PR
 ## License
 MIT © Tejas Tumakuru Ashok

{predikit-0.4.1 → predikit-0.4.2}/README.md RENAMED Viewed

@@ -1,331 +1,361 @@
-# predikit
-[![PyPI version](https://img.shields.io/pypi/v/predikit.svg)](https://pypi.org/project/predikit/)
-[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/)
-[![License: MIT](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)
-[![CI](https://github.com/Tejas-TA/predikit/actions/workflows/test.yml/badge.svg)](https://github.com/Tejas-TA/predikit/actions/workflows/test.yml)
-[![Ruff](https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/astral-sh/ruff/main/assets/badge/v2.json)](https://github.com/astral-sh/ruff)
-### 📈 Project Traffic
-Detailed breakdown of downloads by version, region, and platform:
-[![Downloads](https://pepy.tech/badge/predikit?style=for-the-badge)](https://pepy.tech/project/predikit)
-## Table of Contents
-- [Install](#install)
-- [30-second example](#30-second-example)
-- [Core API](#core-api)
-- [Cookbook](#cookbook)
-- [Contributing](#contributing)
-- [License](#license)
-## Turn any trained scikit-learn or XGBoost model into an LLM-callable tool — auto-generated JSON schemas, typed I/O, zero boilerplate.
-```python
-tool = ModelTool(model=clf, name="classify_iris", ...)
-tool.to_openai()              # OpenAI function schema, ready to pass to the API
-tool.invoke({"sqft": 2200})   # → {"price_usd": 370730}
-```
-## Install
-```bash
-pip install predikit
-# With XGBoost support
-pip install predikit[xgboost]
-# With LangChain support
-pip install predikit[langchain]
-# With MLflow Model Registry support
-pip install predikit[mlflow]
-# With Snowflake Model Registry support
-pip install predikit[snowflake]
-```
-## 30-second example
-```python
-from pydantic import BaseModel, Field
-from sklearn.datasets import load_iris
-from sklearn.linear_model import LogisticRegression
-from predikit import ModelTool
-# Train
-X, y = load_iris(return_X_y=True)
-clf = LogisticRegression(max_iter=200).fit(X, y)
-# Define what the LLM will pass in
-class IrisInput(BaseModel):
-    sepal_length: float = Field(description="Sepal length in cm")
-    sepal_width:  float = Field(description="Sepal width in cm")
-    petal_length: float = Field(description="Petal length in cm")
-    petal_width:  float = Field(description="Petal width in cm")
-# Wrap the model
-tool = ModelTool(
-    model=clf,
-    name="classify_iris",
-    description="Classify an iris flower: 0=setosa, 1=versicolor, 2=virginica.",
-    input_schema=IrisInput,
-    output_name="species",
-    output_description="Predicted species index",
-)
-# Get an OpenAI-ready schema
-import json
-print(json.dumps(tool.to_openai(), indent=2))
-# Call it directly
-tool.invoke({
-    "sepal_length": 5.1, "sepal_width": 3.5,
-    "petal_length": 1.4, "petal_width": 0.2,
-})
-# → {"species": 0}
-```
-## Core API
-### `ModelTool`
-```python
-ModelTool(
-    model,               # fitted sklearn-compatible estimator
-    name: str,           # tool name the LLM sees
-    description: str,    # tool description the LLM sees
-    input_schema,        # Pydantic BaseModel describing inputs
-    output_name: str,    # key for the prediction in the returned dict
-    output_description: str,
-)
-```
-| Method | Returns | What it does |
-|--------|---------|--------------|
-| `.invoke(input_dict)` | `dict` | Validates → predicts → returns `{output_name: value}` |
-| `.to_openai()` | `dict` | OpenAI function-calling schema |
-| `.to_langchain()` | `StructuredTool` | LangChain tool |
-| `.to_callable()` | `Callable` | Plain Python function |
-### `ToolRegistry`
-Group multiple tools for bulk export:
-```python
-registry = ToolRegistry([price_tool, risk_tool])
-registry.to_openai()     # → list[dict], pass directly to OpenAI
-registry.to_langchain()  # → list[StructuredTool]
-registry.get("name")     # → ModelTool
-```
-## Field naming rule
-**Your Pydantic schema field names must exactly match the column names the model was trained on.**
-predikit maps inputs to features by name, not position. If you trained on a DataFrame with columns `["sqft", "bedrooms"]`, your schema fields must be `sqft` and `bedrooms` — not `sq_ft`, not `Sqft`.
-```python
-# ✓ Columns match: sqft, bedrooms, bathrooms
-class GoodInput(BaseModel):
-    sqft:      float
-    bedrooms:  float
-    bathrooms: float
-# ✗ Name mismatch — raises ValueError at runtime
-class BadInput(BaseModel):
-    square_footage: float  # model expects "sqft"
-    beds:           float  # model expects "bedrooms"
-    baths:          float  # model expects "bathrooms"
-```
-When there's a mismatch, predikit tells you exactly which names are wrong:
-```
-ValueError: Input schema is missing model features: ['sqft', 'bedrooms'].
-Schema has: ['square_footage', 'beds', 'bathrooms'], model expects: ['sqft', 'bedrooms', 'bathrooms']
-```
-> **Tip:** If you trained with a numpy array (no DataFrame), predikit has no feature names to check — it uses your schema's field definition order instead.
-## Cookbook
-### XGBoost regression
-```python
-from xgboost import XGBRegressor
-from predikit import ModelTool
-reg = XGBRegressor().fit(X_train, y_train)
-class HouseInput(BaseModel):
-    sqft:       float
-    bedrooms:   float
-    year_built: float
-tool = ModelTool(
-    model=reg,
-    name="price_estimate",
-    description="Predict home price in USD.",
-    input_schema=HouseInput,
-    output_name="price_usd",
-    output_description="Predicted sale price in USD",
-)
-```
-### Multiple tools in one registry
-```python
-registry = ToolRegistry([price_tool, risk_tool, demand_tool])
-# OpenAI
-response = client.chat.completions.create(
-    model="gpt-4o",
-    tools=registry.to_openai(),
-    ...
-)
-# LangChain
-agent = initialize_agent(tools=registry.to_langchain(), ...)
-```
-### Bool inputs from an LLM
-LLMs sometimes return `"yes"`, `"true"`, or `"1"` for boolean fields. predikit coerces these automatically before Pydantic validation:
-```python
-class Input(BaseModel):
-    has_pool: bool
-tool.invoke({"has_pool": "yes"})   # → coerced to True
-tool.invoke({"has_pool": "false"}) # → coerced to False
-tool.invoke({"has_pool": "maybe"}) # → raises ValueError with clear message
-```
-Supported strings: `true/false`, `yes/no`, `1/0`, `on/off`.
-### Confidence-aware routing
-Route uncertain predictions to a fallback tool, or raise an error the agent can catch:
-```python
-from predikit import ModelTool, LowConfidenceError
-tool = ModelTool(
-    model=clf,
-    name="churn_risk",
-    description="Predict member churn risk.",
-    input_schema=MemberInput,
-    output_name="churn_probability",
-    output_description="Probability of churn (0–1)",
-    confidence_threshold=0.80,       # classifiers with predict_proba only
-    on_low_confidence="warn",        # "warn" | "raise" | "fallback"
-    fallback_tool=rule_based_tool,   # used when mode="fallback"
-)
-result = tool.invoke(inputs)
-if result.get("_low_confidence"):
-    print(f"Uncertain ({result['_confidence']:.2f}) — consider routing to a human")
-```
-| mode | behaviour |
-|------|-----------|
-| `"warn"` | returns prediction + `_confidence` + `_low_confidence: True` |
-| `"raise"` | raises `LowConfidenceError` |
-| `"fallback"` | invokes `fallback_tool` and returns its result |
-Only applies to classifiers that implement `predict_proba`. Regressors are unaffected.
-### Multi-model ensemble
-Call multiple models and reconcile their outputs in one step:
-```python
-from predikit import ModelEnsemble, ToolRegistry
-ensemble = ModelEnsemble(
-    tools=[price_tool_a, price_tool_b],
-    name="averaged_price",
-    description="Ensemble price: mean of two XGBoost models.",
-    strategy="mean",              # "collect" | "mean" | "vote"
-)
-result  = ensemble.invoke(inputs)  # → {"price_usd": 370112}
-schema  = ensemble.to_openai()     # works exactly like ModelTool
-```
-| strategy | behaviour |
-|----------|-----------|
-| `"collect"` | merges all outputs into one dict (tools can have different `output_name`) |
-| `"mean"` | averages numeric outputs (all tools must share `output_name`) |
-| `"vote"` | majority class vote (all tools must share `output_name`) |
-Register ensembles alongside individual tools:
-```python
-registry = ToolRegistry(tools=[price_tool], ensembles=[ensemble])
-registry.to_openai()  # includes both tools and ensembles
-```
-### MLflow Model Registry loader
-Load a registered MLflow model directly — no manual `.load_model()` call:
-```python
-from predikit.loaders import from_mlflow
-tool = from_mlflow(
-    model_uri="models:/churn-classifier/Production",
-    name="churn_risk",
-    description="Predict member churn probability.",
-    input_schema=MemberInput,
-    output_name="churn_probability",
-    output_description="Churn probability 0–1",
-)
-tool.invoke({"tenure_months": 24, "trips_last_year": 2, "avg_spend": 500})
-# → {"churn_probability": 0.73}
-```
-The loader auto-detects `classes_` and `feature_names_in_` from the underlying sklearn model, so confidence routing and ensemble work unchanged. Requires `pip install predikit[mlflow]`.
-### Snowflake Model Registry loader
-Load a model registered in the Snowflake Model Registry via the Snowpark ML Python library:
-```python
-from predikit.loaders import from_snowflake
-tool = from_snowflake(
-    session=snowpark_session,
-    model_name="VACATION_CHURN",
-    model_version="V3",
-    name="churn_risk",
-    description="Churn classifier.",
-    input_schema=MemberInput,
-    output_name="churn_probability",
-    output_description="Churn probability 0–1",
-    output_method="predict",   # method to call on the Snowflake model object
-)
-```
-Pass `output_method="predict_proba"` or any other method your Snowflake model exposes. The returned `ModelTool` is identical to one built directly — all exporters, confidence routing, and ensemble strategies work as-is. Requires `pip install predikit[snowflake]`.
-### Orlando real estate demo
-See [`examples/03_orlando_real_estate.py`](examples/03_orlando_real_estate.py) for a full end-to-end walkthrough: synthetic dataset → XGBoost training → `ModelTool` → registry → OpenAI schema → prediction.
-## Roadmap
-Planned for later releases:
-- HuggingFace / PyTorch / TensorFlow model support
-- Streaming inference support
-- OpenAI Assistants API integration
-## Contributing
-See [CONTRIBUTING.md](CONTRIBUTING.md) for development setup, code style, and PR guidelines. The [CHANGELOG](CHANGELOG.md) tracks notable changes per release.
-## License
-MIT © Tejas Tumakuru Ashok
+<p align="center">
+  <picture>
+    <source srcset="https://raw.githubusercontent.com/Tejas-TA/predikit/main/docs/logo.gif">
+    <img src="https://raw.githubusercontent.com/Tejas-TA/predikit/main/docs/logo.png" alt="predikit" width="500"/>
+  </picture>
+</p>
+<p align="center">
+  Turn any trained scikit-learn or XGBoost model into an LLM-callable tool —<br/>
+  auto-generated JSON schemas, typed I/O, zero boilerplate.
+</p>
+<p align="center">
+  <a href="https://pypi.org/project/predikit/"><img src="https://img.shields.io/pypi/v/predikit.svg" alt="PyPI version"/></a>
+  <a href="https://www.python.org/"><img src="https://img.shields.io/badge/python-3.10+-blue.svg" alt="Python 3.10+"/></a>
+  <a href="LICENSE"><img src="https://img.shields.io/badge/License-MIT-green.svg" alt="License: MIT"/></a>
+  <a href="https://github.com/astral-sh/ruff"><img src="https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/astral-sh/ruff/main/assets/badge/v2.json" alt="Ruff"/></a>
+</p>
+<p align="center">
+  <a href="https://pepy.tech/project/predikit"><img src="https://static.pepy.tech/personalized-badge/predikit?period=week&units=international_system&left_color=grey&right_color=blue&left_text=weekly+downloads" alt="Weekly Downloads"/></a>
+  <a href="https://pepy.tech/project/predikit"><img src="https://static.pepy.tech/personalized-badge/predikit?period=month&units=international_system&left_color=grey&right_color=blue&left_text=monthly+downloads" alt="Monthly Downloads"/></a>
+  <a href="https://pepy.tech/project/predikit"><img src="https://static.pepy.tech/personalized-badge/predikit?period=total&units=international_system&left_color=grey&right_color=blue&left_text=total+downloads" alt="Total Downloads"/></a>
+</p>
+```python
+tool = ModelTool(model=clf, name="classify_iris", ...)
+tool.to_openai()              # OpenAI function schema, ready to pass to the API
+tool.invoke({"sqft": 2200})   # → {"price_usd": 370730}
+```
+## Table of Contents
+- [Install](#install)
+- [30-second example](#30-second-example)
+- [Core API](#core-api)
+- [Field naming rule](#field-naming-rule)
+- [Cookbook](#cookbook)
+- [Contributing](#contributing)
+- [License](#license)
+## Install
+```bash
+pip install predikit
+# With XGBoost support
+pip install predikit[xgboost]
+# With LangChain support
+pip install predikit[langchain]
+# With MLflow Model Registry support
+pip install predikit[mlflow]
+# With Snowflake Model Registry support
+pip install predikit[snowflake]
+```
+## 30-second example
+```python
+from pydantic import BaseModel, Field
+from sklearn.datasets import load_iris
+from sklearn.linear_model import LogisticRegression
+from predikit import ModelTool
+# Train
+X, y = load_iris(return_X_y=True)
+clf = LogisticRegression(max_iter=200).fit(X, y)
+# Define what the LLM will pass in
+class IrisInput(BaseModel):
+    sepal_length: float = Field(description="Sepal length in cm")
+    sepal_width:  float = Field(description="Sepal width in cm")
+    petal_length: float = Field(description="Petal length in cm")
+    petal_width:  float = Field(description="Petal width in cm")
+# Wrap the model
+tool = ModelTool(
+    model=clf,
+    name="classify_iris",
+    description="Classify an iris flower: 0=setosa, 1=versicolor, 2=virginica.",
+    input_schema=IrisInput,
+    output_name="species",
+    output_description="Predicted species index",
+)
+# Get an OpenAI-ready schema
+import json
+print(json.dumps(tool.to_openai(), indent=2))
+# Call it directly
+tool.invoke({
+    "sepal_length": 5.1, "sepal_width": 3.5,
+    "petal_length": 1.4, "petal_width": 0.2,
+})
+# → {"species": 0}
+```
+## Core API
+### `ModelTool`
+```python
+ModelTool(
+    model,               # fitted sklearn-compatible estimator
+    name: str,           # tool name the LLM sees
+    description: str,    # tool description the LLM sees
+    input_schema,        # Pydantic BaseModel describing inputs
+    output_name: str,    # key for the prediction in the returned dict
+    output_description: str,
+)
+```
+| Method | Returns | What it does |
+|--------|---------|--------------|
+| `.invoke(input_dict)` | `dict` | Validates → predicts → returns `{output_name: value}` |
+| `.ainvoke(input_dict)` | `dict` | Async version of `.invoke()` |
+| `.to_openai()` | `dict` | OpenAI function-calling schema |
+| `.to_langchain()` | `StructuredTool` | LangChain tool |
+| `.to_callable()` | `Callable` | Plain Python function |
+### `ToolRegistry`
+Group multiple tools for bulk export:
+```python
+registry = ToolRegistry([price_tool, risk_tool])
+registry.to_openai()     # → list[dict], pass directly to OpenAI
+registry.to_langchain()  # → list[StructuredTool]
+registry.get("name")     # → ModelTool
+```
+### `ModelEnsemble`
+Call multiple models and reconcile their outputs in one step:
+```python
+ModelEnsemble(
+    tools: list[ModelTool],   # models to run in parallel
+    name: str,                # ensemble tool name the LLM sees
+    description: str,
+    strategy: str,            # "collect" | "mean" | "vote" | "weighted_mean" | "weighted_vote"
+    weights: list[float],     # optional, for weighted strategies
+)
+```
+| Strategy | Behaviour |
+|----------|-----------|
+| `"collect"` | Merges all outputs into one dict (tools can have different `output_name`) |
+| `"mean"` | Averages numeric outputs (all tools must share `output_name`) |
+| `"vote"` | Majority class vote (all tools must share `output_name`) |
+| `"weighted_mean"` | Weighted average — provide a `weights` list |
+| `"weighted_vote"` | Weighted majority vote — provide a `weights` list |
+`ModelEnsemble` exposes the same `.invoke()`, `.ainvoke()`, `.to_openai()`, and `.to_langchain()` interface as `ModelTool`.
+## Field naming rule
+**Your Pydantic schema field names must exactly match the column names the model was trained on.**
+predikit maps inputs to features by name, not position. If you trained on a DataFrame with columns `["sqft", "bedrooms"]`, your schema fields must be `sqft` and `bedrooms` — not `sq_ft`, not `Sqft`.
+```python
+# ✓ Columns match: sqft, bedrooms, bathrooms
+class GoodInput(BaseModel):
+    sqft:      float
+    bedrooms:  float
+    bathrooms: float
+# ✗ Name mismatch — raises ValueError at runtime
+class BadInput(BaseModel):
+    square_footage: float  # model expects "sqft"
+    beds:           float  # model expects "bedrooms"
+    baths:          float  # model expects "bathrooms"
+```
+When there's a mismatch, predikit tells you exactly which names are wrong:
+```
+ValueError: Input schema is missing model features: ['sqft', 'bedrooms'].
+Schema has: ['square_footage', 'beds', 'bathrooms'], model expects: ['sqft', 'bedrooms', 'bathrooms']
+```
+> **Tip:** If you trained with a numpy array (no DataFrame), predikit has no feature names to check — it uses your schema's field definition order instead.
+## Cookbook
+### XGBoost regression
+```python
+from xgboost import XGBRegressor
+from predikit import ModelTool
+reg = XGBRegressor().fit(X_train, y_train)
+class HouseInput(BaseModel):
+    sqft:       float
+    bedrooms:   float
+    year_built: float
+tool = ModelTool(
+    model=reg,
+    name="price_estimate",
+    description="Predict home price in USD.",
+    input_schema=HouseInput,
+    output_name="price_usd",
+    output_description="Predicted sale price in USD",
+)
+```
+### Multiple tools in one registry
+```python
+registry = ToolRegistry([price_tool, risk_tool, demand_tool])
+# OpenAI
+response = client.chat.completions.create(
+    model="gpt-4o",
+    tools=registry.to_openai(),
+    ...
+)
+# LangChain
+agent = initialize_agent(tools=registry.to_langchain(), ...)
+```
+### Bool inputs from an LLM
+LLMs sometimes return `"yes"`, `"true"`, or `"1"` for boolean fields. predikit coerces these automatically before Pydantic validation:
+```python
+class Input(BaseModel):
+    has_pool: bool
+tool.invoke({"has_pool": "yes"})   # → coerced to True
+tool.invoke({"has_pool": "false"}) # → coerced to False
+tool.invoke({"has_pool": "maybe"}) # → raises ValueError with clear message
+```
+Supported strings: `true/false`, `yes/no`, `1/0`, `on/off`.
+### Confidence-aware routing
+Route uncertain predictions to a fallback tool, or raise an error the agent can catch:
+```python
+from predikit import ModelTool, LowConfidenceError
+tool = ModelTool(
+    model=clf,
+    name="churn_risk",
+    description="Predict member churn risk.",
+    input_schema=MemberInput,
+    output_name="churn_probability",
+    output_description="Probability of churn (0–1)",
+    confidence_threshold=0.80,       # classifiers with predict_proba only
+    on_low_confidence="warn",        # "warn" | "raise" | "fallback"
+    fallback_tool=rule_based_tool,   # used when mode="fallback"
+)
+result = tool.invoke(inputs)
+if result.get("_low_confidence"):
+    print(f"Uncertain ({result['_confidence']:.2f}) — consider routing to a human")
+```
+| mode | behaviour |
+|------|-----------|
+| `"warn"` | returns prediction + `_confidence` + `_low_confidence: True` |
+| `"raise"` | raises `LowConfidenceError` |
+| `"fallback"` | invokes `fallback_tool` and returns its result |
+Only applies to classifiers that implement `predict_proba`. Regressors are unaffected.
+### Multi-model ensemble
+```python
+from predikit import ModelEnsemble, ToolRegistry
+ensemble = ModelEnsemble(
+    tools=[price_tool_a, price_tool_b],
+    name="averaged_price",
+    description="Ensemble price: mean of two XGBoost models.",
+    strategy="mean",              # "collect" | "mean" | "vote"
+)
+result  = ensemble.invoke(inputs)  # → {"price_usd": 370112}
+schema  = ensemble.to_openai()     # works exactly like ModelTool
+```
+Register ensembles alongside individual tools:
+```python
+registry = ToolRegistry(tools=[price_tool], ensembles=[ensemble])
+registry.to_openai()  # includes both tools and ensembles
+```
+### MLflow Model Registry loader
+Load a registered MLflow model directly — no manual `.load_model()` call:
+```python
+from predikit.loaders import from_mlflow
+tool = from_mlflow(
+    model_uri="models:/churn-classifier/Production",
+    name="churn_risk",
+    description="Predict member churn probability.",
+    input_schema=MemberInput,
+    output_name="churn_probability",
+    output_description="Churn probability 0–1",
+)
+tool.invoke({"tenure_months": 24, "trips_last_year": 2, "avg_spend": 500})
+# → {"churn_probability": 0.73}
+```
+The loader auto-detects `classes_` and `feature_names_in_` from the underlying sklearn model, so confidence routing and ensemble work unchanged. Requires `pip install predikit[mlflow]`.
+### Snowflake Model Registry loader
+Load a model registered in the Snowflake Model Registry via the Snowpark ML Python library:
+```python
+from predikit.loaders import from_snowflake
+tool = from_snowflake(
+    session=snowpark_session,
+    model_name="VACATION_CHURN",
+    model_version="V3",
+    name="churn_risk",
+    description="Churn classifier.",
+    input_schema=MemberInput,
+    output_name="churn_probability",
+    output_description="Churn probability 0–1",
+    output_method="predict",   # method to call on the Snowflake model object
+)
+```
+Pass `output_method="predict_proba"` or any other method your Snowflake model exposes. The returned `ModelTool` is identical to one built directly — all exporters, confidence routing, and ensemble strategies work as-is. Requires `pip install predikit[snowflake]`.
+### Orlando real estate demo
+See [`examples/03_orlando_real_estate.py`](examples/03_orlando_real_estate.py) for a full end-to-end walkthrough: synthetic dataset → XGBoost training → `ModelTool` → registry → OpenAI schema → prediction.
+## Roadmap
+Planned for later releases:
+- HuggingFace / PyTorch / TensorFlow model support
+- Streaming inference support
+- OpenAI Assistants API integration
+## Contributing
+See [CONTRIBUTING.md](CONTRIBUTING.md) for development setup, code style, and PR guidelines. The [CHANGELOG](CHANGELOG.md) tracks notable changes per release.
+## License
+MIT © Tejas Tumakuru Ashok

predikit-0.4.2/docs/logo.gif ADDED Viewed

Binary file

predikit-0.4.2/docs/logo.png ADDED Viewed

Binary file

{predikit-0.4.1 → predikit-0.4.2}/pyproject.toml RENAMED Viewed

@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 [project]
 name = "predikit"
-version = "0.4.1"
+version = "0.4.2"
 description = "Turn any trained sklearn/XGBoost model into an LLM-callable tool with auto-generated schemas and typed I/O."
 readme = "README.md"
 license = {text = "MIT"}
@@ -14,7 +14,7 @@ authors = [
 ]
 keywords = ["llm", "agents", "sklearn", "xgboost", "function-calling", "ml-tools"]
 classifiers = [
-    "Development Status :: 3 - Alpha",
+    "Development Status :: 4 - Beta",
     "License :: OSI Approved :: MIT License",
     "Programming Language :: Python :: 3",
     "Programming Language :: Python :: 3.10",

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/__init__.py RENAMED Viewed

@@ -4,4 +4,4 @@ from .registry import ToolRegistry
 from .tool import ModelTool
 __all__ = ["ModelTool", "ToolRegistry", "ModelEnsemble", "LowConfidenceError"]
-__version__ = "0.4.1"
+__version__ = "0.4.2"

{predikit-0.4.1 → predikit-0.4.2}/.github/ISSUE_TEMPLATE/bug_report.md RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/.github/ISSUE_TEMPLATE/feature_request.md RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/.github/workflows/publish.yml RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/.github/workflows/test.yml RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/.pre-commit-config.yaml RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/CLAUDE.md RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/CONTRIBUTING.md RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/LICENSE RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/examples/01_basic_sklearn.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/examples/02_xgboost_regression.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/examples/03_orlando_real_estate.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/examples/04_confidence_routing.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/examples/05_multi_model_ensemble.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/examples/06_mlflow_loader.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/examples/07_snowflake_loader.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/cli.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/coerce.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/ensemble.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/exceptions.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/exporters/__init__.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/exporters/langchain.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/exporters/openai.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/introspect.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/loaders/__init__.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/loaders/mlflow.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/loaders/snowflake.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/registry.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/src/predikit/tool.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/__init__.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_cli.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_coerce.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_confidence.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_ensemble.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_exporters_openai.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_introspect.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_loaders_mlflow.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_loaders_snowflake.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_logging.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_registry.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_tool.py RENAMED Viewed

File without changes

{predikit-0.4.1 → predikit-0.4.2}/tests/test_weighted_ensemble.py RENAMED Viewed

File without changes

predikit 0.4.1__tar.gz → 0.4.2__tar.gz

predikit 0.4.1tar.gz → 0.4.2tar.gz