PyPI - ai-lib-python - Versions diffs - 0.5.0__py3-none-any.whl - Mend

ai-lib-python 0.5.0__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (84) hide show

ai_lib_python/__init__.py +43 -0
ai_lib_python/batch/__init__.py +15 -0
ai_lib_python/batch/collector.py +244 -0
ai_lib_python/batch/executor.py +224 -0
ai_lib_python/cache/__init__.py +26 -0
ai_lib_python/cache/backends.py +380 -0
ai_lib_python/cache/key.py +237 -0
ai_lib_python/cache/manager.py +332 -0
ai_lib_python/client/__init__.py +37 -0
ai_lib_python/client/builder.py +528 -0
ai_lib_python/client/cancel.py +368 -0
ai_lib_python/client/core.py +433 -0
ai_lib_python/client/response.py +134 -0
ai_lib_python/embeddings/__init__.py +36 -0
ai_lib_python/embeddings/client.py +339 -0
ai_lib_python/embeddings/types.py +234 -0
ai_lib_python/embeddings/vectors.py +246 -0
ai_lib_python/errors/__init__.py +41 -0
ai_lib_python/errors/base.py +316 -0
ai_lib_python/errors/classification.py +210 -0
ai_lib_python/guardrails/__init__.py +35 -0
ai_lib_python/guardrails/base.py +336 -0
ai_lib_python/guardrails/filters.py +583 -0
ai_lib_python/guardrails/validators.py +475 -0
ai_lib_python/pipeline/__init__.py +55 -0
ai_lib_python/pipeline/accumulate.py +248 -0
ai_lib_python/pipeline/base.py +240 -0
ai_lib_python/pipeline/decode.py +281 -0
ai_lib_python/pipeline/event_map.py +506 -0
ai_lib_python/pipeline/fan_out.py +284 -0
ai_lib_python/pipeline/select.py +297 -0
ai_lib_python/plugins/__init__.py +32 -0
ai_lib_python/plugins/base.py +294 -0
ai_lib_python/plugins/hooks.py +296 -0
ai_lib_python/plugins/middleware.py +285 -0
ai_lib_python/plugins/registry.py +294 -0
ai_lib_python/protocol/__init__.py +71 -0
ai_lib_python/protocol/loader.py +317 -0
ai_lib_python/protocol/manifest.py +385 -0
ai_lib_python/protocol/validator.py +460 -0
ai_lib_python/py.typed +1 -0
ai_lib_python/resilience/__init__.py +102 -0
ai_lib_python/resilience/backpressure.py +225 -0
ai_lib_python/resilience/circuit_breaker.py +318 -0
ai_lib_python/resilience/executor.py +343 -0
ai_lib_python/resilience/fallback.py +341 -0
ai_lib_python/resilience/preflight.py +413 -0
ai_lib_python/resilience/rate_limiter.py +291 -0
ai_lib_python/resilience/retry.py +299 -0
ai_lib_python/resilience/signals.py +283 -0
ai_lib_python/routing/__init__.py +118 -0
ai_lib_python/routing/manager.py +593 -0
ai_lib_python/routing/strategy.py +345 -0
ai_lib_python/routing/types.py +397 -0
ai_lib_python/structured/__init__.py +33 -0
ai_lib_python/structured/json_mode.py +281 -0
ai_lib_python/structured/schema.py +316 -0
ai_lib_python/structured/validator.py +334 -0
ai_lib_python/telemetry/__init__.py +127 -0
ai_lib_python/telemetry/exporters/__init__.py +9 -0
ai_lib_python/telemetry/exporters/prometheus.py +111 -0
ai_lib_python/telemetry/feedback.py +446 -0
ai_lib_python/telemetry/health.py +409 -0
ai_lib_python/telemetry/logger.py +389 -0
ai_lib_python/telemetry/metrics.py +496 -0
ai_lib_python/telemetry/tracer.py +473 -0
ai_lib_python/tokens/__init__.py +25 -0
ai_lib_python/tokens/counter.py +282 -0
ai_lib_python/tokens/estimator.py +286 -0
ai_lib_python/transport/__init__.py +34 -0
ai_lib_python/transport/auth.py +141 -0
ai_lib_python/transport/http.py +364 -0
ai_lib_python/transport/pool.py +425 -0
ai_lib_python/types/__init__.py +41 -0
ai_lib_python/types/events.py +343 -0
ai_lib_python/types/message.py +332 -0
ai_lib_python/types/tool.py +191 -0
ai_lib_python/utils/__init__.py +21 -0
ai_lib_python/utils/tool_call_assembler.py +317 -0
ai_lib_python-0.5.0.dist-info/METADATA +837 -0
ai_lib_python-0.5.0.dist-info/RECORD +84 -0
ai_lib_python-0.5.0.dist-info/WHEEL +4 -0
ai_lib_python-0.5.0.dist-info/licenses/LICENSE-APACHE +201 -0
ai_lib_python-0.5.0.dist-info/licenses/LICENSE-MIT +21 -0

ai_lib_python-0.5.0.dist-info/METADATA ADDED Viewed

@@ -0,0 +1,837 @@
+Metadata-Version: 2.4
+Name: ai-lib-python
+Version: 0.5.0
+Summary: Official Python Runtime for AI-Protocol - The canonical Pythonic implementation for unified AI model interaction
+Project-URL: Homepage, https://github.com/hiddenpath/ai-lib-python
+Project-URL: Documentation, https://github.com/hiddenpath/ai-lib-python#readme
+Project-URL: Repository, https://github.com/hiddenpath/ai-lib-python
+Project-URL: Issues, https://github.com/hiddenpath/ai-lib-python/issues
+Author: AI-Protocol Team
+License-Expression: MIT OR Apache-2.0
+License-File: LICENSE-APACHE
+License-File: LICENSE-MIT
+Keywords: ai,anthropic,claude,gpt,llm,openai,protocol,streaming
+Classifier: Development Status :: 4 - Beta
+Classifier: Intended Audience :: Developers
+Classifier: License :: OSI Approved :: Apache Software License
+Classifier: License :: OSI Approved :: MIT License
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Programming Language :: Python :: 3.12
+Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
+Classifier: Topic :: Software Development :: Libraries :: Python Modules
+Classifier: Typing :: Typed
+Requires-Python: >=3.10
+Requires-Dist: fastjsonschema>=2.19
+Requires-Dist: httpx>=0.25.0
+Requires-Dist: jsonpath-ng>=1.6
+Requires-Dist: pydantic-settings>=2.0
+Requires-Dist: pydantic>=2.0
+Requires-Dist: pyyaml>=6.0
+Provides-Extra: dev
+Requires-Dist: mypy>=1.8; extra == 'dev'
+Requires-Dist: pytest-asyncio>=0.23; extra == 'dev'
+Requires-Dist: pytest-cov>=4.0; extra == 'dev'
+Requires-Dist: pytest-httpx>=0.30; extra == 'dev'
+Requires-Dist: pytest>=8.0; extra == 'dev'
+Requires-Dist: ruff>=0.2; extra == 'dev'
+Requires-Dist: types-pyyaml>=6.0; extra == 'dev'
+Provides-Extra: docs
+Requires-Dist: mkdocs-material>=9.0; extra == 'docs'
+Requires-Dist: mkdocs>=1.5; extra == 'docs'
+Requires-Dist: mkdocstrings[python]>=0.24; extra == 'docs'
+Provides-Extra: full
+Requires-Dist: keyring>=24.0; extra == 'full'
+Requires-Dist: opentelemetry-api>=1.20; extra == 'full'
+Requires-Dist: opentelemetry-exporter-otlp>=1.20; extra == 'full'
+Requires-Dist: opentelemetry-sdk>=1.20; extra == 'full'
+Requires-Dist: tiktoken>=0.5; extra == 'full'
+Requires-Dist: watchdog>=3.0; extra == 'full'
+Provides-Extra: jupyter
+Requires-Dist: ipywidgets>=8.0; extra == 'jupyter'
+Provides-Extra: telemetry
+Requires-Dist: opentelemetry-api>=1.20; extra == 'telemetry'
+Requires-Dist: opentelemetry-exporter-otlp>=1.20; extra == 'telemetry'
+Requires-Dist: opentelemetry-sdk>=1.20; extra == 'telemetry'
+Provides-Extra: tokenizer
+Requires-Dist: tiktoken>=0.5; extra == 'tokenizer'
+Description-Content-Type: text/markdown
+# ai-lib-python
+**Official Python Runtime for AI-Protocol** - The canonical Pythonic implementation for unified AI model interaction.
+[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)
+[![License](https://img.shields.io/badge/license-MIT%20OR%20Apache--2.0-green.svg)](LICENSE)
+[![Tests](https://github.com/hiddenpath/ai-lib-python/actions/workflows/ci.yml/badge.svg)](https://github.com/hiddenpath/ai-lib-python/actions)
+[![PyPI](https://img.shields.io/pypi/v/ai-lib-python.svg)](https://pypi.org/project/ai-lib-python/)
+## Overview
+`ai-lib-python` is the **official Python runtime** for the [AI-Protocol](https://github.com/hiddenpath/ai-protocol) specification. As the canonical Python implementation maintained by the AI-Protocol team, it embodies the core design principle:
+> **All logic is operators, all configuration is protocol.**
+Unlike traditional adapter libraries that hardcode provider-specific logic, `ai-lib-python` is a **protocol-driven runtime** that executes AI-Protocol specifications.
+## Features
+- **Protocol-Driven**: All behavior is driven by YAML/JSON protocol files
+- **Unified Interface**: Single API for all AI providers (OpenAI, Anthropic, Gemini, DeepSeek, etc.)
+- **Streaming First**: Native async streaming with Python's `async for`
+- **Type Safe**: Full type hints with Pydantic v2 models
+- **Production Ready**: Built-in retry, rate limiting, circuit breaker, and fallback
+- **Extensible**: Easy to add new providers via protocol configuration
+- **Multimodal**: Support for text, images (base64/URL), and audio
+- **Telemetry**: Structured logging, metrics, distributed tracing, and user feedback collection
+- **Token Counting**: tiktoken integration and cost estimation
+- **Connection Pooling**: Efficient HTTP connection management
+- **Request Batching**: Parallel execution with concurrency control
+- **Model Routing**: Smart model selection with load balancing strategies
+- **Embeddings**: Embedding generation with vector operations
+- **Structured Output**: JSON mode with schema validation
+- **Response Caching**: Multi-backend caching with TTL support
+- **Plugin System**: Extensible hooks and middleware architecture
+- **Stream Cancellation**: Cooperative cancellation for streaming operations
+## Installation
+```bash
+pip install ai-lib-python
+```
+With optional features:
+```bash
+# Full installation with all features
+pip install ai-lib-python[full]
+# For telemetry (OpenTelemetry integration)
+pip install ai-lib-python[telemetry]
+# For token counting (tiktoken)
+pip install ai-lib-python[tokenizer]
+# For Jupyter notebook integration
+pip install ai-lib-python[jupyter]
+# For development
+pip install ai-lib-python[dev]
+```
+## Quick Start
+### Basic Usage
+```python
+import asyncio
+from ai_lib_python import AiClient, Message
+async def main():
+    # Create client with model
+    client = await AiClient.create("openai/gpt-4o")
+    # Simple chat completion
+    response = await (
+        client.chat()
+        .user("Hello! What's 2+2?")
+        .execute()
+    )
+    print(response.content)
+    # Output: 2+2 equals 4.
+    await client.close()
+asyncio.run(main())
+```
+### Streaming
+```python
+async def stream_example():
+    client = await AiClient.create("anthropic/claude-3-5-sonnet")
+    async for event in (
+        client.chat()
+        .system("You are a helpful assistant.")
+        .user("Tell me a short story.")
+        .stream()
+    ):
+        if event.is_content_delta:
+            print(event.as_content_delta.content, end="", flush=True)
+    print()  # Newline at end
+    await client.close()
+```
+### With Messages List
+```python
+from ai_lib_python import Message
+messages = [
+    Message.system("You are a Python expert."),
+    Message.user("How do I read a file in Python?"),
+]
+response = await (
+    client.chat()
+    .messages(messages)
+    .temperature(0.7)
+    .max_tokens(1024)
+    .execute()
+)
+```
+### Tool Calling (Function Calling)
+```python
+from ai_lib_python import ToolDefinition
+# Define a tool
+weather_tool = ToolDefinition.from_function(
+    name="get_weather",
+    description="Get current weather for a location",
+    parameters={
+        "type": "object",
+        "properties": {
+            "location": {"type": "string", "description": "City name"},
+            "unit": {"type": "string", "enum": ["celsius", "fahrenheit"]}
+        },
+        "required": ["location"]
+    }
+)
+# Use tool in request
+response = await (
+    client.chat()
+    .user("What's the weather in Tokyo?")
+    .tools([weather_tool])
+    .execute()
+)
+# Check for tool calls
+if response.tool_calls:
+    for tool_call in response.tool_calls:
+        print(f"Call {tool_call.function_name}: {tool_call.arguments}")
+```
+### Multimodal (Images)
+```python
+from ai_lib_python import Message, ContentBlock
+# Image from URL
+message = Message.user_with_image(
+    "What's in this image?",
+    image_url="https://example.com/image.jpg"
+)
+# Image from base64
+with open("photo.jpg", "rb") as f:
+    image_data = base64.b64encode(f.read()).decode()
+message = Message(
+    role=MessageRole.USER,
+    content=[
+        ContentBlock.text("Describe this image:"),
+        ContentBlock.image_base64(image_data, "image/jpeg"),
+    ]
+)
+response = await client.chat().messages([message]).execute()
+```
+### Production-Ready Configuration
+```python
+from ai_lib_python import AiClient
+# Enable all resilience patterns
+client = await (
+    AiClient.builder()
+    .model("openai/gpt-4o")
+    .production_ready()  # Enables retry, rate limit, circuit breaker
+    .with_fallbacks(["anthropic/claude-3-5-sonnet"])
+    .build()
+)
+# Check resilience status
+print(f"Circuit state: {client.circuit_state}")
+print(f"In-flight requests: {client.current_inflight}")
+print(client.get_resilience_stats())
+```
+### Custom Resilience Configuration
+```python
+from ai_lib_python import AiClient
+from ai_lib_python.resilience import (
+    RetryConfig,
+    RateLimiterConfig,
+    CircuitBreakerConfig,
+)
+client = await (
+    AiClient.builder()
+    .model("openai/gpt-4o")
+    .with_retry(RetryConfig(
+        max_retries=5,
+        min_delay_ms=1000,
+        max_delay_ms=30000,
+    ))
+    .with_rate_limit(RateLimiterConfig.from_rps(10))
+    .with_circuit_breaker(CircuitBreakerConfig(
+        failure_threshold=5,
+        cooldown_seconds=30,
+    ))
+    .max_inflight(20)
+    .build()
+)
+```
+### Context Manager
+```python
+async with await AiClient.create("openai/gpt-4o") as client:
+    response = await client.chat().user("Hello!").execute()
+    print(response.content)
+# Client automatically closed
+```
+### Token Counting and Cost Estimation
+```python
+from ai_lib_python.tokens import TokenCounter, estimate_cost, get_model_pricing
+# Count tokens
+counter = TokenCounter.for_model("gpt-4o")
+token_count = counter.count("Hello, how are you?")
+print(f"Token count: {token_count}")
+# Count message tokens
+messages = [Message.user("Hello!"), Message.assistant("Hi there!")]
+total_tokens = counter.count_messages(messages)
+# Estimate cost
+cost = estimate_cost(input_tokens=1000, output_tokens=500, model="gpt-4o")
+print(f"Estimated cost: ${cost.total_cost:.4f}")
+# Get model pricing info
+pricing = get_model_pricing("gpt-4o")
+print(f"Input: ${pricing.input_price_per_1k}/1K tokens")
+print(f"Context window: {pricing.context_window} tokens")
+```
+### Metrics and Telemetry
+```python
+from ai_lib_python.telemetry import (
+    get_logger,
+    MetricsCollector,
+    MetricLabels,
+    Tracer,
+)
+# Structured logging
+logger = get_logger("my_app")
+logger.info("Request started", model="gpt-4o", tokens=100)
+# Metrics collection
+collector = MetricsCollector()
+labels = MetricLabels(provider="openai", model="gpt-4o")
+collector.record_request(labels, latency=0.5, status="success", tokens_in=100, tokens_out=50)
+# Get metrics snapshot
+snapshot = collector.get_snapshot()
+print(f"Total requests: {snapshot.total_requests}")
+print(f"P99 latency: {snapshot.latency_p99_ms:.2f}ms")
+# Export to Prometheus format
+prometheus_metrics = collector.to_prometheus()
+# Distributed tracing
+tracer = Tracer("my_service")
+with tracer.span("api_call") as span:
+    span.set_attribute("model", "gpt-4o")
+    # ... do work
+```
+### Batch Processing
+```python
+from ai_lib_python.batch import BatchExecutor, BatchConfig
+# Execute multiple requests concurrently
+async def process_question(question: str) -> str:
+    client = await AiClient.create("openai/gpt-4o")
+    response = await client.chat().user(question).execute()
+    await client.close()
+    return response.content
+questions = ["What is AI?", "What is Python?", "What is async?"]
+executor = BatchExecutor(process_question, max_concurrent=5)
+result = await executor.execute(questions)
+print(f"Successful: {result.successful_count}")
+print(f"Failed: {result.failed_count}")
+for answer in result.get_successful_results():
+    print(answer)
+```
+### Connection Pooling
+```python
+from ai_lib_python.transport import ConnectionPool, PoolConfig
+# Create connection pool with custom config
+pool = ConnectionPool(PoolConfig.high_throughput())
+# Use pooled connections
+async with pool:
+    client = await pool.get_client("openai", "https://api.openai.com")
+    response = await client.post("/v1/chat/completions", json=payload)
+# Get pool statistics
+stats = pool.get_stats("openai")
+print(f"Active connections: {stats['openai']['active_connections']}")
+```
+### Model Routing & Selection
+```python
+from ai_lib_python.routing import (
+    ModelManager, ModelInfo, create_openai_models, create_anthropic_models,
+    CostBasedSelector, QualityBasedSelector,
+)
+# Create a model manager with pre-configured models
+manager = create_openai_models()
+manager.merge(create_anthropic_models())
+# Select model by capability
+code_models = manager.filter_by_capability("code_generation")
+print(f"Code models: {[m.name for m in code_models]}")
+# Select cheapest model
+selector = CostBasedSelector()
+cheapest = selector.select(manager.list_models())
+print(f"Cheapest: {cheapest.name} @ ${cheapest.pricing.input_cost_per_1k}/1K")
+# Select highest quality model
+quality_selector = QualityBasedSelector()
+best = quality_selector.select(manager.list_models())
+print(f"Best quality: {best.name}")
+# Recommend model for use case
+recommended = manager.recommend_for("chat")
+```
+### Stream Cancellation
+```python
+from ai_lib_python.client import create_cancel_pair, CancellableStream, CancelReason
+async def cancellable_stream():
+    client = await AiClient.create("openai/gpt-4o")
+    # Create cancel token and handle
+    token, handle = create_cancel_pair()
+    # Start streaming with cancellation support
+    stream = client.chat().user("Write a long story...").stream()
+    cancellable = CancellableStream(stream, token)
+    # In another task, you can cancel:
+    # handle.cancel(CancelReason.USER_REQUEST)
+    async for event in cancellable:
+        if event.is_content_delta:
+            print(event.as_content_delta.content, end="")
+        # Check if cancelled
+        if token.is_cancelled:
+            print("\n[Cancelled]")
+            break
+```
+### User Feedback Collection
+```python
+from ai_lib_python.telemetry import (
+    RatingFeedback, ThumbsFeedback, ChoiceSelectionFeedback,
+    InMemoryFeedbackSink, set_feedback_sink, report_feedback,
+)
+# Set up feedback collection
+sink = InMemoryFeedbackSink(max_events=1000)
+set_feedback_sink(sink)
+# Report user feedback
+await report_feedback(RatingFeedback(
+    request_id="req-123",
+    rating=5,
+    category="helpfulness",
+    comment="Great response!"
+))
+await report_feedback(ThumbsFeedback(
+    request_id="req-456",
+    is_positive=True
+))
+# Report multi-candidate selection (for A/B testing)
+await report_feedback(ChoiceSelectionFeedback(
+    request_id="req-789",
+    chosen_index=0,
+    rejected_indices=[1, 2],
+    latency_to_select_ms=1500.0
+))
+# Retrieve feedback
+all_feedback = sink.get_events()
+request_feedback = sink.get_events_by_request("req-123")
+```
+### Embeddings
+```python
+from ai_lib_python.embeddings import (
+    EmbeddingClient, cosine_similarity, find_most_similar
+)
+# Create embedding client
+client = await EmbeddingClient.create("openai/text-embedding-3-small")
+# Generate embeddings
+response = await client.embed("Hello, world!")
+embedding = response.first.vector
+print(f"Dimensions: {len(embedding)}")
+# Batch embeddings
+texts = ["Hello", "World", "Python", "AI"]
+response = await client.embed_batch(texts)
+# Find most similar
+query = response.embeddings[0].vector
+candidates = [e.vector for e in response.embeddings[1:]]
+results = find_most_similar(query, candidates, top_k=2)
+for idx, score in results:
+    print(f"Text '{texts[idx+1]}' similarity: {score:.4f}")
+await client.close()
+```
+### Response Caching
+```python
+from ai_lib_python.cache import CacheManager, CacheConfig, MemoryCache
+# Create cache manager
+cache = CacheManager(
+    config=CacheConfig(default_ttl_seconds=3600),
+    backend=MemoryCache(max_size=1000)
+)
+# Cache responses
+key = cache.generate_key(model="gpt-4o", messages=messages)
+# Check cache first
+cached = await cache.get(key)
+if cached:
+    print("Cache hit!")
+    response = cached
+else:
+    response = await client.chat().messages(messages).execute()
+    await cache.set(key, response)
+# Get cache statistics
+stats = cache.stats()
+print(f"Hit ratio: {stats.hit_ratio:.2%}")
+```
+### Plugin System
+```python
+from ai_lib_python.plugins import (
+    Plugin, PluginContext, PluginRegistry, HookType, HookManager
+)
+# Create a custom plugin
+class LoggingPlugin(Plugin):
+    def name(self) -> str:
+        return "logging"
+    async def on_before_request(self, ctx: PluginContext) -> None:
+        print(f"Request to {ctx.model}: {ctx.request}")
+    async def on_after_response(self, ctx: PluginContext) -> None:
+        print(f"Response received: {ctx.response}")
+# Register plugin
+registry = PluginRegistry()
+await registry.register(LoggingPlugin())
+# Use hooks for fine-grained control
+hooks = HookManager()
+hooks.register(HookType.BEFORE_REQUEST, "log", lambda ctx: print(f"Starting {ctx.model}"))
+# Trigger hooks
+ctx = PluginContext(model="gpt-4o", request={"messages": [...]})
+await registry.trigger_before_request(ctx)
+```
+## Supported Providers
+| Provider | Models | Streaming | Tools | Vision |
+|----------|--------|-----------|-------|--------|
+| OpenAI | GPT-4o, GPT-4, GPT-3.5 | ✅ | ✅ | ✅ |
+| Anthropic | Claude 3.5, Claude 3 | ✅ | ✅ | ✅ |
+| Google | Gemini Pro, Gemini Flash | ✅ | ✅ | ✅ |
+| DeepSeek | DeepSeek Chat, Coder | ✅ | ✅ | ❌ |
+| Qwen | Qwen2.5, Qwen-Max | ✅ | ✅ | ✅ |
+| Groq | Llama, Mixtral | ✅ | ✅ | ❌ |
+| Mistral | Mistral Large, Medium | ✅ | ✅ | ❌ |
+## API Reference
+### Core Classes
+- **`AiClient`**: Main entry point for AI model interaction
+- **`Message`**: Represents a chat message with role and content
+- **`ContentBlock`**: Content blocks for multimodal messages
+- **`ToolDefinition`**: Tool/function definition for function calling
+- **`StreamingEvent`**: Events from streaming responses
+### Resilience Classes
+- **`RetryPolicy`**: Exponential backoff with jitter
+- **`RateLimiter`**: Token bucket rate limiting
+- **`CircuitBreaker`**: Circuit breaker pattern
+- **`Backpressure`**: Concurrency limiting
+- **`FallbackChain`**: Multi-target failover
+- **`PreflightChecker`**: Unified request gating
+- **`SignalsSnapshot`**: Runtime state aggregation
+### Routing Classes
+- **`ModelManager`**: Centralized model management
+- **`ModelInfo`**: Model information with capabilities
+- **`ModelArray`**: Load balancing across endpoints
+- **`ModelSelectionStrategy`**: Selection strategies (Cost, Quality, Performance, etc.)
+### Telemetry Classes
+- **`AiLibLogger`**: Structured logging with masking
+- **`MetricsCollector`**: Request metrics collection
+- **`Tracer`**: Distributed tracing
+- **`HealthChecker`**: Health monitoring
+- **`FeedbackSink`**: User feedback collection
+### Embedding Classes
+- **`EmbeddingClient`**: Embedding generation client
+- **`Embedding`**: Single embedding result
+- **`EmbeddingResponse`**: Response with usage stats
+### Token Classes
+- **`TokenCounter`**: Token counting interface
+- **`CostEstimate`**: Cost estimation result
+- **`ModelPricing`**: Model pricing information
+### Cache Classes
+- **`CacheManager`**: High-level cache management
+- **`CacheBackend`**: Cache backend interface (Memory, Disk, Null)
+- **`CacheKeyGenerator`**: Deterministic key generation
+### Batch Classes
+- **`BatchCollector`**: Request grouping
+- **`BatchExecutor`**: Parallel execution
+### Plugin Classes
+- **`Plugin`**: Base plugin class
+- **`PluginRegistry`**: Plugin management
+- **`HookManager`**: Event-driven hooks
+- **`Middleware`**: Request/response chain
+### Transport Classes
+- **`ConnectionPool`**: HTTP connection pooling
+- **`PoolConfig`**: Pool configuration
+### Cancellation Classes
+- **`CancelToken`**: Cooperative cancellation token
+- **`CancelHandle`**: Public cancel interface
+- **`CancellableStream`**: Cancellable async iterator
+### Error Classes
+- **`AiLibError`**: Base error class
+- **`ProtocolError`**: Protocol loading/validation errors
+- **`TransportError`**: HTTP transport errors
+- **`RemoteError`**: API errors from providers
+## Architecture
+```
+┌─────────────────────────────────────────────────────────────┐
+│                        AiClient                             │
+│  ┌─────────────┐  ┌─────────────┐  ┌─────────────────────┐  │
+│  │ ChatRequest │  │  Resilience │  │    Protocol         │  │
+│  │   Builder   │  │  Executor   │  │    Loader           │  │
+│  └──────┬──────┘  └──────┬──────┘  └──────────┬──────────┘  │
+└─────────┼────────────────┼───────────────────┼──────────────┘
+          │                │                   │
+          ▼                ▼                   ▼
+┌─────────────────┐ ┌──────────────┐ ┌─────────────────────┐
+│   HttpTransport │ │   Pipeline   │ │  ProtocolManifest   │
+│   (httpx)       │ │   (decode→   │ │  (YAML/JSON)        │
+│                 │ │   select→    │ │                     │
+│                 │ │   map)       │ │                     │
+└─────────────────┘ └──────────────┘ └─────────────────────┘
+```
+## Development
+```bash
+# Clone the repository
+git clone https://github.com/hiddenpath/ai-lib-python.git
+cd ai-lib-python
+# Install development dependencies
+pip install -e ".[dev]"
+# Run tests
+pytest
+# Run tests with coverage
+pytest --cov=src/ai_lib_python
+# Type checking
+mypy src
+# Linting
+ruff check src tests
+# Format code
+ruff format src tests
+```
+## Project Structure
+```
+ai-lib-python/
+├── src/ai_lib_python/
+│   ├── __init__.py         # Package exports
+│   ├── types/              # Type definitions
+│   │   ├── message.py      # Message, ContentBlock
+│   │   ├── tool.py         # ToolDefinition, ToolCall
+│   │   └── events.py       # StreamingEvent types
+│   ├── protocol/           # Protocol layer
+│   │   ├── manifest.py     # ProtocolManifest models
+│   │   ├── loader.py       # Protocol loading
+│   │   └── validator.py    # Schema validation (+ version/streaming checks)
+│   ├── transport/          # HTTP transport
+│   │   ├── http.py         # HttpTransport
+│   │   ├── auth.py         # API key resolution
+│   │   └── pool.py         # ConnectionPool
+│   ├── pipeline/           # Stream processing
+│   │   ├── decode.py       # SSE/NDJSON decoders
+│   │   ├── select.py       # JSONPath selectors
+│   │   ├── accumulate.py   # Tool call accumulator
+│   │   ├── event_map.py    # Event mappers
+│   │   └── fan_out.py      # FanOut, Replicate, Split transforms
+│   ├── resilience/         # Resilience patterns
+│   │   ├── retry.py        # RetryPolicy
+│   │   ├── rate_limiter.py # RateLimiter
+│   │   ├── circuit_breaker.py
+│   │   ├── backpressure.py
+│   │   ├── fallback.py     # FallbackChain
+│   │   ├── executor.py     # ResilientExecutor
+│   │   ├── signals.py      # SignalsSnapshot
+│   │   └── preflight.py    # PreflightChecker
+│   ├── routing/            # Model routing & load balancing
+│   │   ├── models.py       # ModelInfo, ModelCapabilities
+│   │   ├── strategies.py   # Selection strategies
+│   │   ├── manager.py      # ModelManager
+│   │   └── array.py        # ModelArray (load balancing)
+│   ├── client/             # User API
+│   │   ├── core.py         # AiClient
+│   │   ├── builder.py      # Builders
+│   │   ├── response.py     # ChatResponse
+│   │   └── cancel.py       # CancelToken, CancellableStream
+│   ├── embeddings/         # Embedding support
+│   │   ├── client.py       # EmbeddingClient
+│   │   ├── types.py        # Embedding, EmbeddingRequest
+│   │   └── vectors.py      # Vector operations
+│   ├── cache/              # Response caching
+│   │   ├── manager.py      # CacheManager
+│   │   ├── backend.py      # MemoryCache, DiskCache
+│   │   └── key.py          # CacheKeyGenerator
+│   ├── tokens/             # Token counting
+│   │   ├── counter.py      # TokenCounter, TiktokenCounter
+│   │   └── pricing.py      # ModelPricing, CostEstimate
+│   ├── telemetry/          # Observability
+│   │   ├── logging.py      # AiLibLogger
+│   │   ├── metrics.py      # MetricsCollector
+│   │   ├── tracing.py      # Tracer
+│   │   ├── health.py       # HealthChecker
+│   │   └── feedback.py     # Feedback types and sinks
+│   ├── batch/              # Request batching
+│   │   ├── collector.py    # BatchCollector
+│   │   └── executor.py     # BatchExecutor
+│   ├── plugins/            # Plugin system
+│   │   ├── base.py         # Plugin base class
+│   │   ├── registry.py     # PluginRegistry
+│   │   ├── hooks.py        # HookManager
+│   │   └── middleware.py   # Middleware chain
+│   ├── structured/         # Structured output
+│   │   ├── json_mode.py    # JsonModeConfig
+│   │   ├── schema.py       # SchemaGenerator
+│   │   └── validator.py    # OutputValidator
+│   ├── utils/              # Utilities
+│   │   └── tool_call_assembler.py  # ToolCallAssembler
+│   └── errors/             # Error hierarchy
+├── tests/
+│   ├── unit/               # Unit tests
+│   └── integration/        # Integration tests
+├── docs/                   # Documentation
+├── examples/               # Example scripts
+└── pyproject.toml
+```
+## Environment Variables
+| Variable | Description | Default |
+|----------|-------------|---------|
+| `OPENAI_API_KEY` | OpenAI API key | - |
+| `ANTHROPIC_API_KEY` | Anthropic API key | - |
+| `GOOGLE_API_KEY` | Google AI API key | - |
+| `AI_PROTOCOL_PATH` | Custom protocol directory | - |
+| `AI_HTTP_TIMEOUT_SECS` | HTTP timeout | 60 |
+| `AI_LIB_MAX_INFLIGHT` | Max concurrent requests | 10 |
+## Related Projects
+- [AI-Protocol](https://github.com/hiddenpath/ai-protocol) - Protocol specification
+- [ai-lib-rust](https://github.com/hiddenpath/ai-lib-rust) - Rust runtime implementation
+## Contributing
+Contributions are welcome! Please read our [Contributing Guide](CONTRIBUTING.md) for details.
+## License
+This project is licensed under either of:
+- Apache License, Version 2.0 ([LICENSE-APACHE](LICENSE-APACHE) or http://www.apache.org/licenses/LICENSE-2.0)
+- MIT License ([LICENSE-MIT](LICENSE-MIT) or http://opensource.org/licenses/MIT)
+at your option.