PyPI - cost-katana - Versions diffs - 1.0.2__tar.gz → 2.0.0__tar.gz - Mend

cost-katana 1.0.2tar.gz → 2.0.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (29) hide show

{cost_katana-1.0.2/cost_katana.egg-info → cost_katana-2.0.0}/PKG-INFO RENAMED Viewed

@@ -1,14 +1,14 @@
 Metadata-Version: 2.4
 Name: cost-katana
-Version: 1.0.2
-Summary: Unified AI interface with cost optimization and failover
+Version: 2.0.0
+Summary: Revolutionary AI SDK with Cortex Meta-Language for 70-95% token reduction
 Home-page: https://github.com/Hypothesize-Tech/cost-katana-python
 Author: Cost Katana Team
 Author-email: abdul@hypothesize.tech
 Project-URL: Bug Reports, https://github.com/Hypothesize-Tech/cost-katana-python/issues
 Project-URL: Source, https://github.com/Hypothesize-Tech/cost-katana-python
 Project-URL: Documentation, https://docs.costkatana.com
-Keywords: ai,machine learning,cost optimization,openai,anthropic,aws bedrock,gemini
+Keywords: ai,machine learning,cost optimization,cortex,lisp,token reduction,openai,anthropic,aws bedrock,gemini,claude opus
 Classifier: Development Status :: 4 - Beta
 Classifier: Intended Audience :: Developers
 Classifier: License :: OSI Approved :: MIT License
@@ -45,7 +45,7 @@ Dynamic: summary
 # Cost Katana Python SDK
-A simple, unified interface for AI models with built-in cost optimization, failover, and analytics. Use any AI provider through one consistent API - no need to manage API keys or worry about provider-specific implementations!
+A revolutionary AI SDK with **Cortex Meta-Language** for 70-95% token reduction. Features built-in cost optimization, failover, and analytics. Use any AI provider through one consistent API with breakthrough LISP-based optimization!
 ## 🚀 Quick Start
@@ -100,13 +100,105 @@ total_cost = sum(msg.get('metadata', {}).get('cost', 0) for msg in chat.history)
 print(f"Total conversation cost: ${total_cost:.4f}")
 ```
+## 🧠 Cortex Meta-Language: Revolutionary AI Optimization
+Cost Katana's **Cortex** system achieves **70-95% token reduction** through a breakthrough 3-stage pipeline that generates complete answers in optimized LISP format.
+### 🚀 Enable Cortex Optimization
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+# Enable Cortex for massive token savings
+model = ck.GenerativeModel('claude-3-sonnet')
+response = model.generate_content(
+    "Write a complete Python web scraper with error handling",
+    cortex={
+        'enabled': True,
+        'mode': 'answer_generation',  # Generate complete answers in LISP
+        'encoding_model': 'claude-3-5-sonnet',
+        'core_model': 'claude-opus-4-1',
+        'decoding_model': 'claude-3-5-sonnet',
+        'dynamic_instructions': True,  # AI-powered LISP instruction generation
+        'analytics': True
+    }
+)
+print("Generated Answer:", response.text)
+print(f"Token Reduction: {response.cortex_metadata.token_reduction}%")
+print(f"Cost Savings: ${response.cortex_metadata.cost_savings:.4f}")
+print(f"Confidence Score: {response.cortex_metadata.confidence}%")
+print(f"Semantic Integrity: {response.cortex_metadata.semantic_integrity}%")
+```
+### 🔬 Advanced Cortex Features
+```python
+# Bulk optimization with Cortex
+queries = [
+    "Explain machine learning algorithms",
+    "Write a React authentication component",
+    "Create a database migration script"
+]
+results = model.bulk_generate_content(
+    queries,
+    cortex={
+        'enabled': True,
+        'mode': 'answer_generation',
+        'batch_processing': True,
+        'dynamic_instructions': True
+    }
+)
+for i, result in enumerate(results):
+    print(f"Query {i+1}: {result.cortex_metadata.token_reduction}% reduction")
+# Context-aware processing
+technical_response = model.generate_content(
+    "Implement a distributed caching system",
+    cortex={
+        'enabled': True,
+        'context': 'technical',
+        'complexity': 'high',
+        'include_examples': True,
+        'code_generation': True
+    }
+)
+```
+### 📊 Traditional vs Cortex Comparison
+```python
+# Compare traditional vs Cortex processing
+comparison = model.compare_cortex(
+    query="Write a REST API with authentication in Flask",
+    max_tokens=2000
+)
+print("=== COMPARISON RESULTS ===")
+print(f"Traditional: {comparison['traditional']['tokens_used']} tokens, ${comparison['traditional']['cost']:.4f}")
+print(f"Cortex: {comparison['cortex']['tokens_used']} tokens, ${comparison['cortex']['cost']:.4f}")
+print(f"Savings: {comparison['savings']['token_reduction']}% tokens, ${comparison['savings']['cost_savings']:.4f}")
+print(f"Semantic Integrity: {comparison['quality']['semantic_integrity']}%")
+```
 ## 🎯 Why Cost Katana?
+### 🧠 Cortex-Powered Intelligence
+- **70-95% Token Reduction**: Revolutionary LISP-based answer generation
+- **3-Stage Optimization Pipeline**: Encoder → Core Processor → Decoder
+- **Dynamic LISP Instructions**: AI-powered instruction generation for any context
+- **Real-time Analytics**: Confidence, cost impact, and semantic integrity metrics
+- **Universal Context Handling**: Technical, business, and industry-specific processing
 ### Simple Interface, Powerful Backend
 - **One API for all providers**: Use Google Gemini, Anthropic Claude, OpenAI GPT, AWS Bedrock models through one interface
 - **No API key juggling**: Store your provider keys securely in Cost Katana, use one key in your code
 - **Automatic failover**: If one provider is down, automatically switch to alternatives
-- **Cost optimization**: Intelligent routing to minimize costs while maintaining quality
+- **Intelligent routing**: Cortex-powered optimization to minimize costs while maintaining quality
 ### Enterprise Features
 - **Cost tracking**: Real-time cost monitoring and budgets
@@ -264,7 +356,7 @@ balanced_response = model.generate_content(
 ## 🖥️ Command Line Interface
-Cost Katana includes a CLI for easy interaction:
+Cost Katana includes a comprehensive CLI for easy interaction:
 ```bash
 # Initialize configuration
@@ -283,6 +375,198 @@ cost-katana chat --model gemini-2.0-flash
 cost-katana chat --config my-config.json
 ```
+## 🧬 SAST (Semantic Abstract Syntax Tree) Features
+Cost Katana includes advanced SAST capabilities for semantic optimization and analysis:
+### SAST Optimization
+```bash
+# Optimize a prompt using SAST
+cost-katana sast optimize "Write a detailed analysis of market trends"
+# Optimize from file
+cost-katana sast optimize --file prompt.txt --output optimized.txt
+# Cross-lingual optimization
+cost-katana sast optimize "Analyze data" --cross-lingual --language en
+# Preserve ambiguity for analysis
+cost-katana sast optimize "Complex query" --preserve-ambiguity
+```
+### SAST Comparison
+```bash
+# Compare traditional vs SAST optimization
+cost-katana sast compare "Your prompt here"
+# Compare with specific language
+cost-katana sast compare --file prompt.txt --language en
+```
+### SAST Vocabulary & Analytics
+```bash
+# Explore SAST vocabulary
+cost-katana sast vocabulary
+# Search semantic primitives
+cost-katana sast vocabulary --search "analysis" --category "action"
+# Get SAST performance statistics
+cost-katana sast stats
+# View SAST showcase with examples
+cost-katana sast showcase
+# Telescope ambiguity demonstration
+cost-katana sast telescope
+# Test universal semantics across languages
+cost-katana sast universal "concept" --languages "en,es,fr"
+```
+### SAST Python API
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+client = ck.CostKatanaClient()
+# Optimize with SAST
+result = client.optimize_with_sast(
+    prompt="Your prompt here",
+    language="en",
+    cross_lingual=True,
+    preserve_ambiguity=False
+)
+# Compare SAST vs traditional
+comparison = client.compare_sast_vs_traditional(
+    prompt="Your prompt here",
+    language="en"
+)
+# Get SAST vocabulary stats
+stats = client.get_sast_vocabulary_stats()
+# Search semantic primitives
+primitives = client.search_semantic_primitives(
+    term="analysis",
+    category="action",
+    limit=10
+)
+# Test universal semantics
+universal_test = client.test_universal_semantics(
+    concept="love",
+    languages=["en", "es", "fr"]
+)
+```
+## 🧠 Cortex Engine Features
+Cost Katana's Cortex engine provides intelligent processing capabilities:
+### Cortex Operations
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+client = ck.CostKatanaClient()
+# Enable Cortex with SAST processing
+result = client.optimize_with_sast(
+    prompt="Your prompt",
+    service="openai",
+    model="gpt-4o-mini",
+    # Cortex features
+    enableCortex=True,
+    cortexOperation="sast",
+    cortexStyle="conversational",
+    cortexFormat="plain",
+    cortexSemanticCache=True,
+    cortexPreserveSemantics=True,
+    cortexIntelligentRouting=True,
+    cortexSastProcessing=True,
+    cortexAmbiguityResolution=True,
+    cortexCrossLingualMode=False
+)
+```
+### Cortex Capabilities
+- **Semantic Caching**: Intelligent caching of semantic representations
+- **Intelligent Routing**: Smart routing based on content analysis
+- **Ambiguity Resolution**: Automatic resolution of ambiguous language
+- **Cross-lingual Processing**: Multi-language semantic understanding
+- **Semantic Preservation**: Maintains semantic meaning during optimization
+## 🌐 Gateway Features
+Cost Katana acts as a unified gateway to multiple AI providers:
+### Provider Abstraction
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+# Same interface, different providers
+models = [
+    'nova-lite',           # Amazon Nova
+    'claude-3-sonnet',     # Anthropic Claude
+    'gemini-2.0-flash',    # Google Gemini
+    'gpt-4',               # OpenAI GPT
+    'llama-3.1-70b'        # Meta Llama
+]
+for model in models:
+    response = ck.GenerativeModel(model).generate_content("Hello!")
+    print(f"{model}: {response.text[:50]}...")
+```
+### Intelligent Routing
+```python
+# Cost Katana automatically routes to the best provider
+model = ck.GenerativeModel('balanced')  # Uses intelligent routing
+# Different optimization modes
+fast_response = model.generate_content(
+    "Quick summary",
+    chat_mode='fastest'    # Routes to fastest provider
+)
+cheap_response = model.generate_content(
+    "Detailed analysis",
+    chat_mode='cheapest'   # Routes to most cost-effective provider
+)
+balanced_response = model.generate_content(
+    "Complex reasoning",
+    chat_mode='balanced'   # Balances speed and cost
+)
+```
+### Failover & Redundancy
+```python
+# Automatic failover if primary provider is down
+model = ck.GenerativeModel('claude-3-sonnet')
+try:
+    response = model.generate_content("Your prompt")
+except ck.ModelNotAvailableError:
+    # Cost Katana automatically tries alternative providers
+    print("Primary model unavailable, using fallback...")
+    response = model.generate_content("Your prompt")
+```
 ## 📊 Usage Analytics
 Track your AI usage and costs:
@@ -400,6 +684,30 @@ class ChatSession:
     def delete_conversation(self) -> None
 ```
+### CostKatanaClient
+```python
+class CostKatanaClient:
+    def __init__(self, api_key: str = None, base_url: str = None, config_file: str = None)
+    # Core Methods
+    def send_message(self, message: str, model_id: str, **kwargs) -> Dict[str, Any]
+    def get_available_models(self) -> List[Dict[str, Any]]
+    def create_conversation(self, title: str = None, model_id: str = None) -> Dict[str, Any]
+    def get_conversation_history(self, conversation_id: str) -> Dict[str, Any]
+    def delete_conversation(self, conversation_id: str) -> Dict[str, Any]
+    # SAST Methods
+    def optimize_with_sast(self, prompt: str, **kwargs) -> Dict[str, Any]
+    def compare_sast_vs_traditional(self, prompt: str, **kwargs) -> Dict[str, Any]
+    def get_sast_vocabulary_stats(self) -> Dict[str, Any]
+    def search_semantic_primitives(self, term: str = None, **kwargs) -> Dict[str, Any]
+    def get_telescope_demo(self) -> Dict[str, Any]
+    def test_universal_semantics(self, concept: str, languages: List[str] = None) -> Dict[str, Any]
+    def get_sast_stats(self) -> Dict[str, Any]
+    def get_sast_showcase(self) -> Dict[str, Any]
+```
 ### GenerateContentResponse
 ```python
@@ -409,6 +717,20 @@ class GenerateContentResponse:
     thinking: Dict                      # AI reasoning (if available)
 ```
+### UsageMetadata
+```python
+class UsageMetadata:
+    model: str                          # Model used
+    cost: float                         # Cost in USD
+    latency: float                      # Response time in seconds
+    total_tokens: int                   # Total tokens used
+    cache_hit: bool                     # Whether response was cached
+    risk_level: str                     # Risk assessment level
+    agent_path: List[str]               # Multi-agent processing path
+    optimizations_applied: List[str]    # Applied optimizations
+```
 ## 🤝 Support
 - **Documentation**: [docs.costkatana.com](https://docs.costkatana.com)

{cost_katana-1.0.2 → cost_katana-2.0.0}/README.md RENAMED Viewed

@@ -1,6 +1,6 @@
 # Cost Katana Python SDK
-A simple, unified interface for AI models with built-in cost optimization, failover, and analytics. Use any AI provider through one consistent API - no need to manage API keys or worry about provider-specific implementations!
+A revolutionary AI SDK with **Cortex Meta-Language** for 70-95% token reduction. Features built-in cost optimization, failover, and analytics. Use any AI provider through one consistent API with breakthrough LISP-based optimization!
 ## 🚀 Quick Start
@@ -55,13 +55,105 @@ total_cost = sum(msg.get('metadata', {}).get('cost', 0) for msg in chat.history)
 print(f"Total conversation cost: ${total_cost:.4f}")
 ```
+## 🧠 Cortex Meta-Language: Revolutionary AI Optimization
+Cost Katana's **Cortex** system achieves **70-95% token reduction** through a breakthrough 3-stage pipeline that generates complete answers in optimized LISP format.
+### 🚀 Enable Cortex Optimization
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+# Enable Cortex for massive token savings
+model = ck.GenerativeModel('claude-3-sonnet')
+response = model.generate_content(
+    "Write a complete Python web scraper with error handling",
+    cortex={
+        'enabled': True,
+        'mode': 'answer_generation',  # Generate complete answers in LISP
+        'encoding_model': 'claude-3-5-sonnet',
+        'core_model': 'claude-opus-4-1',
+        'decoding_model': 'claude-3-5-sonnet',
+        'dynamic_instructions': True,  # AI-powered LISP instruction generation
+        'analytics': True
+    }
+)
+print("Generated Answer:", response.text)
+print(f"Token Reduction: {response.cortex_metadata.token_reduction}%")
+print(f"Cost Savings: ${response.cortex_metadata.cost_savings:.4f}")
+print(f"Confidence Score: {response.cortex_metadata.confidence}%")
+print(f"Semantic Integrity: {response.cortex_metadata.semantic_integrity}%")
+```
+### 🔬 Advanced Cortex Features
+```python
+# Bulk optimization with Cortex
+queries = [
+    "Explain machine learning algorithms",
+    "Write a React authentication component",
+    "Create a database migration script"
+]
+results = model.bulk_generate_content(
+    queries,
+    cortex={
+        'enabled': True,
+        'mode': 'answer_generation',
+        'batch_processing': True,
+        'dynamic_instructions': True
+    }
+)
+for i, result in enumerate(results):
+    print(f"Query {i+1}: {result.cortex_metadata.token_reduction}% reduction")
+# Context-aware processing
+technical_response = model.generate_content(
+    "Implement a distributed caching system",
+    cortex={
+        'enabled': True,
+        'context': 'technical',
+        'complexity': 'high',
+        'include_examples': True,
+        'code_generation': True
+    }
+)
+```
+### 📊 Traditional vs Cortex Comparison
+```python
+# Compare traditional vs Cortex processing
+comparison = model.compare_cortex(
+    query="Write a REST API with authentication in Flask",
+    max_tokens=2000
+)
+print("=== COMPARISON RESULTS ===")
+print(f"Traditional: {comparison['traditional']['tokens_used']} tokens, ${comparison['traditional']['cost']:.4f}")
+print(f"Cortex: {comparison['cortex']['tokens_used']} tokens, ${comparison['cortex']['cost']:.4f}")
+print(f"Savings: {comparison['savings']['token_reduction']}% tokens, ${comparison['savings']['cost_savings']:.4f}")
+print(f"Semantic Integrity: {comparison['quality']['semantic_integrity']}%")
+```
 ## 🎯 Why Cost Katana?
+### 🧠 Cortex-Powered Intelligence
+- **70-95% Token Reduction**: Revolutionary LISP-based answer generation
+- **3-Stage Optimization Pipeline**: Encoder → Core Processor → Decoder
+- **Dynamic LISP Instructions**: AI-powered instruction generation for any context
+- **Real-time Analytics**: Confidence, cost impact, and semantic integrity metrics
+- **Universal Context Handling**: Technical, business, and industry-specific processing
 ### Simple Interface, Powerful Backend
 - **One API for all providers**: Use Google Gemini, Anthropic Claude, OpenAI GPT, AWS Bedrock models through one interface
 - **No API key juggling**: Store your provider keys securely in Cost Katana, use one key in your code
 - **Automatic failover**: If one provider is down, automatically switch to alternatives
-- **Cost optimization**: Intelligent routing to minimize costs while maintaining quality
+- **Intelligent routing**: Cortex-powered optimization to minimize costs while maintaining quality
 ### Enterprise Features
 - **Cost tracking**: Real-time cost monitoring and budgets
@@ -219,7 +311,7 @@ balanced_response = model.generate_content(
 ## 🖥️ Command Line Interface
-Cost Katana includes a CLI for easy interaction:
+Cost Katana includes a comprehensive CLI for easy interaction:
 ```bash
 # Initialize configuration
@@ -238,6 +330,198 @@ cost-katana chat --model gemini-2.0-flash
 cost-katana chat --config my-config.json
 ```
+## 🧬 SAST (Semantic Abstract Syntax Tree) Features
+Cost Katana includes advanced SAST capabilities for semantic optimization and analysis:
+### SAST Optimization
+```bash
+# Optimize a prompt using SAST
+cost-katana sast optimize "Write a detailed analysis of market trends"
+# Optimize from file
+cost-katana sast optimize --file prompt.txt --output optimized.txt
+# Cross-lingual optimization
+cost-katana sast optimize "Analyze data" --cross-lingual --language en
+# Preserve ambiguity for analysis
+cost-katana sast optimize "Complex query" --preserve-ambiguity
+```
+### SAST Comparison
+```bash
+# Compare traditional vs SAST optimization
+cost-katana sast compare "Your prompt here"
+# Compare with specific language
+cost-katana sast compare --file prompt.txt --language en
+```
+### SAST Vocabulary & Analytics
+```bash
+# Explore SAST vocabulary
+cost-katana sast vocabulary
+# Search semantic primitives
+cost-katana sast vocabulary --search "analysis" --category "action"
+# Get SAST performance statistics
+cost-katana sast stats
+# View SAST showcase with examples
+cost-katana sast showcase
+# Telescope ambiguity demonstration
+cost-katana sast telescope
+# Test universal semantics across languages
+cost-katana sast universal "concept" --languages "en,es,fr"
+```
+### SAST Python API
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+client = ck.CostKatanaClient()
+# Optimize with SAST
+result = client.optimize_with_sast(
+    prompt="Your prompt here",
+    language="en",
+    cross_lingual=True,
+    preserve_ambiguity=False
+)
+# Compare SAST vs traditional
+comparison = client.compare_sast_vs_traditional(
+    prompt="Your prompt here",
+    language="en"
+)
+# Get SAST vocabulary stats
+stats = client.get_sast_vocabulary_stats()
+# Search semantic primitives
+primitives = client.search_semantic_primitives(
+    term="analysis",
+    category="action",
+    limit=10
+)
+# Test universal semantics
+universal_test = client.test_universal_semantics(
+    concept="love",
+    languages=["en", "es", "fr"]
+)
+```
+## 🧠 Cortex Engine Features
+Cost Katana's Cortex engine provides intelligent processing capabilities:
+### Cortex Operations
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+client = ck.CostKatanaClient()
+# Enable Cortex with SAST processing
+result = client.optimize_with_sast(
+    prompt="Your prompt",
+    service="openai",
+    model="gpt-4o-mini",
+    # Cortex features
+    enableCortex=True,
+    cortexOperation="sast",
+    cortexStyle="conversational",
+    cortexFormat="plain",
+    cortexSemanticCache=True,
+    cortexPreserveSemantics=True,
+    cortexIntelligentRouting=True,
+    cortexSastProcessing=True,
+    cortexAmbiguityResolution=True,
+    cortexCrossLingualMode=False
+)
+```
+### Cortex Capabilities
+- **Semantic Caching**: Intelligent caching of semantic representations
+- **Intelligent Routing**: Smart routing based on content analysis
+- **Ambiguity Resolution**: Automatic resolution of ambiguous language
+- **Cross-lingual Processing**: Multi-language semantic understanding
+- **Semantic Preservation**: Maintains semantic meaning during optimization
+## 🌐 Gateway Features
+Cost Katana acts as a unified gateway to multiple AI providers:
+### Provider Abstraction
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+# Same interface, different providers
+models = [
+    'nova-lite',           # Amazon Nova
+    'claude-3-sonnet',     # Anthropic Claude
+    'gemini-2.0-flash',    # Google Gemini
+    'gpt-4',               # OpenAI GPT
+    'llama-3.1-70b'        # Meta Llama
+]
+for model in models:
+    response = ck.GenerativeModel(model).generate_content("Hello!")
+    print(f"{model}: {response.text[:50]}...")
+```
+### Intelligent Routing
+```python
+# Cost Katana automatically routes to the best provider
+model = ck.GenerativeModel('balanced')  # Uses intelligent routing
+# Different optimization modes
+fast_response = model.generate_content(
+    "Quick summary",
+    chat_mode='fastest'    # Routes to fastest provider
+)
+cheap_response = model.generate_content(
+    "Detailed analysis",
+    chat_mode='cheapest'   # Routes to most cost-effective provider
+)
+balanced_response = model.generate_content(
+    "Complex reasoning",
+    chat_mode='balanced'   # Balances speed and cost
+)
+```
+### Failover & Redundancy
+```python
+# Automatic failover if primary provider is down
+model = ck.GenerativeModel('claude-3-sonnet')
+try:
+    response = model.generate_content("Your prompt")
+except ck.ModelNotAvailableError:
+    # Cost Katana automatically tries alternative providers
+    print("Primary model unavailable, using fallback...")
+    response = model.generate_content("Your prompt")
+```
 ## 📊 Usage Analytics
 Track your AI usage and costs:
@@ -355,6 +639,30 @@ class ChatSession:
     def delete_conversation(self) -> None
 ```
+### CostKatanaClient
+```python
+class CostKatanaClient:
+    def __init__(self, api_key: str = None, base_url: str = None, config_file: str = None)
+    # Core Methods
+    def send_message(self, message: str, model_id: str, **kwargs) -> Dict[str, Any]
+    def get_available_models(self) -> List[Dict[str, Any]]
+    def create_conversation(self, title: str = None, model_id: str = None) -> Dict[str, Any]
+    def get_conversation_history(self, conversation_id: str) -> Dict[str, Any]
+    def delete_conversation(self, conversation_id: str) -> Dict[str, Any]
+    # SAST Methods
+    def optimize_with_sast(self, prompt: str, **kwargs) -> Dict[str, Any]
+    def compare_sast_vs_traditional(self, prompt: str, **kwargs) -> Dict[str, Any]
+    def get_sast_vocabulary_stats(self) -> Dict[str, Any]
+    def search_semantic_primitives(self, term: str = None, **kwargs) -> Dict[str, Any]
+    def get_telescope_demo(self) -> Dict[str, Any]
+    def test_universal_semantics(self, concept: str, languages: List[str] = None) -> Dict[str, Any]
+    def get_sast_stats(self) -> Dict[str, Any]
+    def get_sast_showcase(self) -> Dict[str, Any]
+```
 ### GenerateContentResponse
 ```python
@@ -364,6 +672,20 @@ class GenerateContentResponse:
     thinking: Dict                      # AI reasoning (if available)
 ```
+### UsageMetadata
+```python
+class UsageMetadata:
+    model: str                          # Model used
+    cost: float                         # Cost in USD
+    latency: float                      # Response time in seconds
+    total_tokens: int                   # Total tokens used
+    cache_hit: bool                     # Whether response was cached
+    risk_level: str                     # Risk assessment level
+    agent_path: List[str]               # Multi-agent processing path
+    optimizations_applied: List[str]    # Applied optimizations
+```
 ## 🤝 Support
 - **Documentation**: [docs.costkatana.com](https://docs.costkatana.com)

{cost_katana-1.0.2 → cost_katana-2.0.0}/cost_katana/__init__.py RENAMED Viewed

@@ -31,7 +31,7 @@ from .exceptions import (
 )
 from .config import Config
-__version__ = "1.0.2"
+__version__ = "1.0.3"
 __all__ = [
     "configure",
     "create_generative_model",

{cost_katana-1.0.2 → cost_katana-2.0.0/cost_katana.egg-info}/PKG-INFO RENAMED Viewed

@@ -1,14 +1,14 @@
 Metadata-Version: 2.4
 Name: cost-katana
-Version: 1.0.2
-Summary: Unified AI interface with cost optimization and failover
+Version: 2.0.0
+Summary: Revolutionary AI SDK with Cortex Meta-Language for 70-95% token reduction
 Home-page: https://github.com/Hypothesize-Tech/cost-katana-python
 Author: Cost Katana Team
 Author-email: abdul@hypothesize.tech
 Project-URL: Bug Reports, https://github.com/Hypothesize-Tech/cost-katana-python/issues
 Project-URL: Source, https://github.com/Hypothesize-Tech/cost-katana-python
 Project-URL: Documentation, https://docs.costkatana.com
-Keywords: ai,machine learning,cost optimization,openai,anthropic,aws bedrock,gemini
+Keywords: ai,machine learning,cost optimization,cortex,lisp,token reduction,openai,anthropic,aws bedrock,gemini,claude opus
 Classifier: Development Status :: 4 - Beta
 Classifier: Intended Audience :: Developers
 Classifier: License :: OSI Approved :: MIT License
@@ -45,7 +45,7 @@ Dynamic: summary
 # Cost Katana Python SDK
-A simple, unified interface for AI models with built-in cost optimization, failover, and analytics. Use any AI provider through one consistent API - no need to manage API keys or worry about provider-specific implementations!
+A revolutionary AI SDK with **Cortex Meta-Language** for 70-95% token reduction. Features built-in cost optimization, failover, and analytics. Use any AI provider through one consistent API with breakthrough LISP-based optimization!
 ## 🚀 Quick Start
@@ -100,13 +100,105 @@ total_cost = sum(msg.get('metadata', {}).get('cost', 0) for msg in chat.history)
 print(f"Total conversation cost: ${total_cost:.4f}")
 ```
+## 🧠 Cortex Meta-Language: Revolutionary AI Optimization
+Cost Katana's **Cortex** system achieves **70-95% token reduction** through a breakthrough 3-stage pipeline that generates complete answers in optimized LISP format.
+### 🚀 Enable Cortex Optimization
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+# Enable Cortex for massive token savings
+model = ck.GenerativeModel('claude-3-sonnet')
+response = model.generate_content(
+    "Write a complete Python web scraper with error handling",
+    cortex={
+        'enabled': True,
+        'mode': 'answer_generation',  # Generate complete answers in LISP
+        'encoding_model': 'claude-3-5-sonnet',
+        'core_model': 'claude-opus-4-1',
+        'decoding_model': 'claude-3-5-sonnet',
+        'dynamic_instructions': True,  # AI-powered LISP instruction generation
+        'analytics': True
+    }
+)
+print("Generated Answer:", response.text)
+print(f"Token Reduction: {response.cortex_metadata.token_reduction}%")
+print(f"Cost Savings: ${response.cortex_metadata.cost_savings:.4f}")
+print(f"Confidence Score: {response.cortex_metadata.confidence}%")
+print(f"Semantic Integrity: {response.cortex_metadata.semantic_integrity}%")
+```
+### 🔬 Advanced Cortex Features
+```python
+# Bulk optimization with Cortex
+queries = [
+    "Explain machine learning algorithms",
+    "Write a React authentication component",
+    "Create a database migration script"
+]
+results = model.bulk_generate_content(
+    queries,
+    cortex={
+        'enabled': True,
+        'mode': 'answer_generation',
+        'batch_processing': True,
+        'dynamic_instructions': True
+    }
+)
+for i, result in enumerate(results):
+    print(f"Query {i+1}: {result.cortex_metadata.token_reduction}% reduction")
+# Context-aware processing
+technical_response = model.generate_content(
+    "Implement a distributed caching system",
+    cortex={
+        'enabled': True,
+        'context': 'technical',
+        'complexity': 'high',
+        'include_examples': True,
+        'code_generation': True
+    }
+)
+```
+### 📊 Traditional vs Cortex Comparison
+```python
+# Compare traditional vs Cortex processing
+comparison = model.compare_cortex(
+    query="Write a REST API with authentication in Flask",
+    max_tokens=2000
+)
+print("=== COMPARISON RESULTS ===")
+print(f"Traditional: {comparison['traditional']['tokens_used']} tokens, ${comparison['traditional']['cost']:.4f}")
+print(f"Cortex: {comparison['cortex']['tokens_used']} tokens, ${comparison['cortex']['cost']:.4f}")
+print(f"Savings: {comparison['savings']['token_reduction']}% tokens, ${comparison['savings']['cost_savings']:.4f}")
+print(f"Semantic Integrity: {comparison['quality']['semantic_integrity']}%")
+```
 ## 🎯 Why Cost Katana?
+### 🧠 Cortex-Powered Intelligence
+- **70-95% Token Reduction**: Revolutionary LISP-based answer generation
+- **3-Stage Optimization Pipeline**: Encoder → Core Processor → Decoder
+- **Dynamic LISP Instructions**: AI-powered instruction generation for any context
+- **Real-time Analytics**: Confidence, cost impact, and semantic integrity metrics
+- **Universal Context Handling**: Technical, business, and industry-specific processing
 ### Simple Interface, Powerful Backend
 - **One API for all providers**: Use Google Gemini, Anthropic Claude, OpenAI GPT, AWS Bedrock models through one interface
 - **No API key juggling**: Store your provider keys securely in Cost Katana, use one key in your code
 - **Automatic failover**: If one provider is down, automatically switch to alternatives
-- **Cost optimization**: Intelligent routing to minimize costs while maintaining quality
+- **Intelligent routing**: Cortex-powered optimization to minimize costs while maintaining quality
 ### Enterprise Features
 - **Cost tracking**: Real-time cost monitoring and budgets
@@ -264,7 +356,7 @@ balanced_response = model.generate_content(
 ## 🖥️ Command Line Interface
-Cost Katana includes a CLI for easy interaction:
+Cost Katana includes a comprehensive CLI for easy interaction:
 ```bash
 # Initialize configuration
@@ -283,6 +375,198 @@ cost-katana chat --model gemini-2.0-flash
 cost-katana chat --config my-config.json
 ```
+## 🧬 SAST (Semantic Abstract Syntax Tree) Features
+Cost Katana includes advanced SAST capabilities for semantic optimization and analysis:
+### SAST Optimization
+```bash
+# Optimize a prompt using SAST
+cost-katana sast optimize "Write a detailed analysis of market trends"
+# Optimize from file
+cost-katana sast optimize --file prompt.txt --output optimized.txt
+# Cross-lingual optimization
+cost-katana sast optimize "Analyze data" --cross-lingual --language en
+# Preserve ambiguity for analysis
+cost-katana sast optimize "Complex query" --preserve-ambiguity
+```
+### SAST Comparison
+```bash
+# Compare traditional vs SAST optimization
+cost-katana sast compare "Your prompt here"
+# Compare with specific language
+cost-katana sast compare --file prompt.txt --language en
+```
+### SAST Vocabulary & Analytics
+```bash
+# Explore SAST vocabulary
+cost-katana sast vocabulary
+# Search semantic primitives
+cost-katana sast vocabulary --search "analysis" --category "action"
+# Get SAST performance statistics
+cost-katana sast stats
+# View SAST showcase with examples
+cost-katana sast showcase
+# Telescope ambiguity demonstration
+cost-katana sast telescope
+# Test universal semantics across languages
+cost-katana sast universal "concept" --languages "en,es,fr"
+```
+### SAST Python API
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+client = ck.CostKatanaClient()
+# Optimize with SAST
+result = client.optimize_with_sast(
+    prompt="Your prompt here",
+    language="en",
+    cross_lingual=True,
+    preserve_ambiguity=False
+)
+# Compare SAST vs traditional
+comparison = client.compare_sast_vs_traditional(
+    prompt="Your prompt here",
+    language="en"
+)
+# Get SAST vocabulary stats
+stats = client.get_sast_vocabulary_stats()
+# Search semantic primitives
+primitives = client.search_semantic_primitives(
+    term="analysis",
+    category="action",
+    limit=10
+)
+# Test universal semantics
+universal_test = client.test_universal_semantics(
+    concept="love",
+    languages=["en", "es", "fr"]
+)
+```
+## 🧠 Cortex Engine Features
+Cost Katana's Cortex engine provides intelligent processing capabilities:
+### Cortex Operations
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+client = ck.CostKatanaClient()
+# Enable Cortex with SAST processing
+result = client.optimize_with_sast(
+    prompt="Your prompt",
+    service="openai",
+    model="gpt-4o-mini",
+    # Cortex features
+    enableCortex=True,
+    cortexOperation="sast",
+    cortexStyle="conversational",
+    cortexFormat="plain",
+    cortexSemanticCache=True,
+    cortexPreserveSemantics=True,
+    cortexIntelligentRouting=True,
+    cortexSastProcessing=True,
+    cortexAmbiguityResolution=True,
+    cortexCrossLingualMode=False
+)
+```
+### Cortex Capabilities
+- **Semantic Caching**: Intelligent caching of semantic representations
+- **Intelligent Routing**: Smart routing based on content analysis
+- **Ambiguity Resolution**: Automatic resolution of ambiguous language
+- **Cross-lingual Processing**: Multi-language semantic understanding
+- **Semantic Preservation**: Maintains semantic meaning during optimization
+## 🌐 Gateway Features
+Cost Katana acts as a unified gateway to multiple AI providers:
+### Provider Abstraction
+```python
+import cost_katana as ck
+ck.configure(api_key='dak_your_key_here')
+# Same interface, different providers
+models = [
+    'nova-lite',           # Amazon Nova
+    'claude-3-sonnet',     # Anthropic Claude
+    'gemini-2.0-flash',    # Google Gemini
+    'gpt-4',               # OpenAI GPT
+    'llama-3.1-70b'        # Meta Llama
+]
+for model in models:
+    response = ck.GenerativeModel(model).generate_content("Hello!")
+    print(f"{model}: {response.text[:50]}...")
+```
+### Intelligent Routing
+```python
+# Cost Katana automatically routes to the best provider
+model = ck.GenerativeModel('balanced')  # Uses intelligent routing
+# Different optimization modes
+fast_response = model.generate_content(
+    "Quick summary",
+    chat_mode='fastest'    # Routes to fastest provider
+)
+cheap_response = model.generate_content(
+    "Detailed analysis",
+    chat_mode='cheapest'   # Routes to most cost-effective provider
+)
+balanced_response = model.generate_content(
+    "Complex reasoning",
+    chat_mode='balanced'   # Balances speed and cost
+)
+```
+### Failover & Redundancy
+```python
+# Automatic failover if primary provider is down
+model = ck.GenerativeModel('claude-3-sonnet')
+try:
+    response = model.generate_content("Your prompt")
+except ck.ModelNotAvailableError:
+    # Cost Katana automatically tries alternative providers
+    print("Primary model unavailable, using fallback...")
+    response = model.generate_content("Your prompt")
+```
 ## 📊 Usage Analytics
 Track your AI usage and costs:
@@ -400,6 +684,30 @@ class ChatSession:
     def delete_conversation(self) -> None
 ```
+### CostKatanaClient
+```python
+class CostKatanaClient:
+    def __init__(self, api_key: str = None, base_url: str = None, config_file: str = None)
+    # Core Methods
+    def send_message(self, message: str, model_id: str, **kwargs) -> Dict[str, Any]
+    def get_available_models(self) -> List[Dict[str, Any]]
+    def create_conversation(self, title: str = None, model_id: str = None) -> Dict[str, Any]
+    def get_conversation_history(self, conversation_id: str) -> Dict[str, Any]
+    def delete_conversation(self, conversation_id: str) -> Dict[str, Any]
+    # SAST Methods
+    def optimize_with_sast(self, prompt: str, **kwargs) -> Dict[str, Any]
+    def compare_sast_vs_traditional(self, prompt: str, **kwargs) -> Dict[str, Any]
+    def get_sast_vocabulary_stats(self) -> Dict[str, Any]
+    def search_semantic_primitives(self, term: str = None, **kwargs) -> Dict[str, Any]
+    def get_telescope_demo(self) -> Dict[str, Any]
+    def test_universal_semantics(self, concept: str, languages: List[str] = None) -> Dict[str, Any]
+    def get_sast_stats(self) -> Dict[str, Any]
+    def get_sast_showcase(self) -> Dict[str, Any]
+```
 ### GenerateContentResponse
 ```python
@@ -409,6 +717,20 @@ class GenerateContentResponse:
     thinking: Dict                      # AI reasoning (if available)
 ```
+### UsageMetadata
+```python
+class UsageMetadata:
+    model: str                          # Model used
+    cost: float                         # Cost in USD
+    latency: float                      # Response time in seconds
+    total_tokens: int                   # Total tokens used
+    cache_hit: bool                     # Whether response was cached
+    risk_level: str                     # Risk assessment level
+    agent_path: List[str]               # Multi-agent processing path
+    optimizations_applied: List[str]    # Applied optimizations
+```
 ## 🤝 Support
 - **Documentation**: [docs.costkatana.com](https://docs.costkatana.com)

{cost_katana-1.0.2 → cost_katana-2.0.0}/setup.py RENAMED Viewed

@@ -14,10 +14,10 @@ with open("requirements.txt", "r", encoding="utf-8") as fh:
 setup(
     name="cost-katana",
-    version="1.0.2",
+    version="2.0.0",
     author="Cost Katana Team",
     author_email="abdul@hypothesize.tech",
-    description="Unified AI interface with cost optimization and failover",
+    description="Revolutionary AI SDK with Cortex Meta-Language for 70-95% token reduction",
     long_description=long_description,
     long_description_content_type="text/markdown",
     url="https://github.com/Hypothesize-Tech/cost-katana-python",
@@ -38,7 +38,7 @@ setup(
     ],
     python_requires=">=3.8",
     install_requires=requirements,
-    keywords="ai, machine learning, cost optimization, openai, anthropic, aws bedrock, gemini",
+    keywords="ai, machine learning, cost optimization, cortex, lisp, token reduction, openai, anthropic, aws bedrock, gemini, claude opus",
     project_urls={
         "Bug Reports": "https://github.com/Hypothesize-Tech/cost-katana-python/issues",
         "Source": "https://github.com/Hypothesize-Tech/cost-katana-python",