PyPI - flock-core - Versions diffs - 0.5.0b54__py3-none-any.whl → 0.5.0b55__py3-none-any.whl - Mend - Supply Chain Defender

flock-core 0.5.0b54py3-none-any.whl → 0.5.0b55py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of flock-core might be problematic. Click here for more details.

Files changed (8) hide show

flock_core-0.5.0b54.dist-info/METADATA DELETED Viewed

@@ -1,916 +0,0 @@
-Metadata-Version: 2.4
-Name: flock-core
-Version: 0.5.0b54
-Summary: Add your description here
-Author-email: Andre Ratzenberger <andre.ratzenberger@whiteduck.de>
-License-File: LICENSE
-Requires-Python: >=3.10
-Requires-Dist: devtools>=0.12.2
-Requires-Dist: dspy==3.0.0
-Requires-Dist: duckdb>=1.1.0
-Requires-Dist: fastapi>=0.117.1
-Requires-Dist: httpx>=0.28.1
-Requires-Dist: litellm==1.75.3
-Requires-Dist: loguru>=0.7.3
-Requires-Dist: mcp>=1.7.1
-Requires-Dist: opentelemetry-api>=1.30.0
-Requires-Dist: opentelemetry-exporter-jaeger-proto-grpc>=1.21.0
-Requires-Dist: opentelemetry-exporter-jaeger>=1.21.0
-Requires-Dist: opentelemetry-exporter-otlp>=1.30.0
-Requires-Dist: opentelemetry-instrumentation-logging>=0.51b0
-Requires-Dist: opentelemetry-sdk>=1.30.0
-Requires-Dist: poethepoet>=0.30.0
-Requires-Dist: pydantic[email]>=2.11.9
-Requires-Dist: rich>=14.1.0
-Requires-Dist: toml>=0.10.2
-Requires-Dist: typer>=0.19.2
-Requires-Dist: uvicorn>=0.37.0
-Requires-Dist: websockets>=15.0.1
-Description-Content-Type: text/markdown
-<p align="center">
-  <img alt="Flock Banner" src="https://raw.githubusercontent.com/whiteducksoftware/flock/master/docs/assets/images/flock.png" width="800">
-</p>
-<p align="center">
-  <a href="https://pypi.org/project/flock-core/" target="_blank"><img alt="PyPI Version" src="https://img.shields.io/pypi/v/flock-core?style=for-the-badge&logo=pypi&label=pip%20version"></a>
-  <img alt="Python Version" src="https://img.shields.io/badge/python-3.10%2B-blue?style=for-the-badge&logo=python">
-  <a href="https://github.com/whiteducksoftware/flock/blob/master/LICENSE" target="_blank"><img alt="License" src="https://img.shields.io/pypi/l/flock-core?style=for-the-badge"></a>
-  <a href="https://whiteduck.de" target="_blank"><img alt="Built by white duck" src="https://img.shields.io/badge/Built%20by-white%20duck%20GmbH-white?style=for-the-badge&labelColor=black"></a>
-  <a href="https://www.linkedin.com/company/whiteduck" target="_blank"><img alt="LinkedIn" src="https://img.shields.io/badge/linkedin-%230077B5.svg?style=for-the-badge&logo=linkedin&logoColor=white&label=whiteduck"></a>
-  <a href="https://bsky.app/profile/whiteduck-gmbh.bsky.social" target="_blank"><img alt="Bluesky" src="https://img.shields.io/badge/bluesky-Follow-blue?style=for-the-badge&logo=bluesky&logoColor=%23fff&color=%23333&labelColor=%230285FF&label=whiteduck-gmbh"></a>
-</p>
----
-# 🚀 Flock 0.5: Agent Systems Without the Graphs
-> **What if agents collaborated like experts at a whiteboard—not like nodes in a rigid workflow?**
----
-## The Problem You Know Too Well
-🤯 **Prompt Hell**: Brittle 500-line prompts that break with every model update
-💥 **System Failures**: One bad LLM response crashes your entire workflow
-🧪 **Testing Nightmares**: "How do I unit test a prompt?" (You don't.)
-📏 **Measuring Quality**: "How do I know my prompts are optimal?" (You also don't.)
-📄 **Output Chaos**: Parsing unstructured LLM responses into reliable data
-⛓️ **Orchestration Limits**: Graph-based frameworks create rigid, tightly-coupled systems
-🚀 **Production Gap**: Jupyter notebooks don't scale to enterprise systems
-🔓 **No Security Model**: Every agent sees everything—no access controls
-**The tooling is fundamentally broken. It's time for a better approach.**
-Most issues are solvable, because decades of experience with micro services tought us hard lessons about decoupling, orchestration and reliability.
-**Let's introduce these learnings to AI agents!**
----
-## The Flock Solution: Declarative + Blackboard Architecture
-**What if you could skip the 'prompt engineering' step AND avoid rigid workflow graphs?**
-Flock 0.5 combines **declarative AI workflows** with **blackboard architecture**—the pattern that powered groundbreaking AI systems since the 1970s (Hearsay-II speech recognition at CMU).
-### ✅ Declarative at Heart
-**No natural language prompts. No brittle instructions. Just type-safe contracts.**
-```python
-@flock_type
-class MyDreamPizza(BaseModel):
-    pizza_idea: str
-@flock_type
-class Pizza(BaseModel):
-    ingredients: list[str]
-    size: str
-    crust_type: str
-    step_by_step_instructions: list[str]
-# Create orchestrator
-flock = Flock("openai/gpt-4o")
-# Define agent with ZERO natural language
-pizza_master = (
-    flock.agent("pizza_master")
-    .consumes(MyDreamPizza)
-    .publishes(Pizza)
-)
-```
-**Hard-binding type contracts will even work with GPT-4729.**
-<p align="center">
-  <img alt="Flock Blackboard" src="docs/img/pizza.png" width="1000">
-</p>
-### ✅ Key Advantages
-✅ **Declarative Contracts**: Define inputs/outputs with Pydantic models. Flock handles the LLM complexity.
-⚡ **Built-in Resilience**: Blackboard persists context—agents crash? They recover and resume.
-🧪 **Actually Testable**: Clear contracts make agents unit-testable like any other code
-🔐 **Zero-Trust Security**: 5 built-in visibility types (Public, Private, Tenant, Label-based, Time-delayed)
-🚀 **Dynamic Workflows**: Self-correcting loops, conditional routing, intelligent decision-making
-🔧 **Production-Ready**: Real-time dashboard, WebSocket streaming, 743 passing tests
-📊 **True Observability**: Agent View + Blackboard View with full data lineage
----
-## Why Graphs Fail (and Blackboards Win)
-### The Problem with Graph-Based Frameworks
-**LangGraph. CrewAI. AutoGen.** They all make the same fundamental mistake: **treating agent collaboration as a directed graph**.
-```python
-# ❌ The Graph-Based Way (LangGraph, CrewAI, etc.)
-workflow.add_edge("agent_a", "agent_b")  # Tight coupling!
-workflow.add_edge("agent_b", "agent_c")  # Predefined flow!
-# What happens when you need to:
-# - Add agent_d that consumes data from agent_a?
-# - Run agent_b and agent_c in parallel?
-# - Route conditionally based on agent_a's output quality?
-# Answer: Rewrite the graph. Again. And again.
-```
-**Why graphs fail at scale:**
-- 🔗 **Tight coupling**: Agents hardcode their successors
-- 📐 **Rigid topology**: Adding an agent means rewiring the graph
-- 🐌 **Sequential thinking**: Even independent agents wait in line
-- 🧪 **Testing nightmare**: Can't test agents in isolation
-- 🔓 **No security model**: Every agent sees everything
-- 📈 **Doesn't scale**: 20+ agents = spaghetti graph
-- 💀 **Single point of failure**: Orchestrator dies? Everything dies.
-- 🧠 **God object anti-pattern**: One orchestrator needs domain knowledge of 20+ agents to route correctly
-- 📦 **No context resilience**: Agent crashes? Context disappears. No recovery.
-**This is workflow orchestration dressed up as "agent systems."**
----
-### The Blackboard Alternative: How Experts Actually Collaborate
-<p align="center">
-  <img alt="Flock Blackboard" src="docs/img/flock_ui_blackboard_view.png" width="1000">
-</p>
-Watch a team of specialists solve a complex problem:
-1. **Radiologist** posts X-ray analysis on the whiteboard
-2. **Lab tech** sees it, adds blood work results
-3. **Diagnostician** waits for BOTH, then posts diagnosis
-4. **Pharmacist** reacts to diagnosis, suggests treatment
-**No one manages the workflow.** No directed graph. Just specialists reacting to relevant information appearing on a shared workspace.
-**This is the blackboard pattern—proven since the 1970s (Hearsay-II speech recognition system at CMU).**
-**Why this matters:**
-- **Context IS the blackboard**: All state lives in one place, not scattered across agents
-- **Crash resilience**: Agent dies? Blackboard persists. Restart agent, it picks up where it left off.
-- **100% decoupled**: Agents don't know about each other. They only know data types.
-- **Microservices lessons applied**: We learned in the 2000s that tight coupling kills scalability. Blackboards apply that wisdom to AI agents.
----
-## 🎯 Flock 0.5: Blackboard-First Architecture
-```python
-from flock_flow.orchestrator import Flock
-from flock_flow.registry import flock_type
-from pydantic import BaseModel
-# 1. Define typed artifacts (what goes on the blackboard)
-@flock_type
-class XRayAnalysis(BaseModel):
-    finding: str
-    confidence: float
-@flock_type
-class LabResults(BaseModel):
-    markers: dict[str, float]
-@flock_type
-class Diagnosis(BaseModel):
-    condition: str
-    reasoning: str
-# 2. Create orchestrator (the blackboard)
-orchestrator = Flock("openai/gpt-4o")
-# 3. Agents subscribe to what they care about (NO explicit workflow!)
-radiologist = (
-    orchestrator.agent("radiologist")
-    .consumes(PatientScan)
-    .publishes(XRayAnalysis)
-)
-lab_tech = (
-    orchestrator.agent("lab_tech")
-    .consumes(PatientScan)
-    .publishes(LabResults)
-)
-diagnostician = (
-    orchestrator.agent("diagnostician")
-    .consumes(XRayAnalysis, LabResults)  # Waits for BOTH!
-    .publishes(Diagnosis)
-)
-# 4. Publish input, agents react opportunistically
-await orchestrator.publish(PatientScan(patient_id="12345", ...))
-await orchestrator.run_until_idle()
-```
-**What just happened:**
-- Radiologist and lab_tech ran **in parallel** (both consume PatientScan)
-- Diagnostician **automatically waited** for both to finish
-- **No workflow graph.** No `.add_edge()`. Just subscriptions.
-- Add new agents? Just subscribe them. No rewiring.
-**Resilience built-in:**
-- Lab agent crashes? Blackboard still has XRayAnalysis. Restart lab agent, it processes the scan again.
-- No "orchestrator god object" deciding which agent runs when—agents decide themselves based on what's on the blackboard.
-- Context lives on the blackboard, not in memory. Agents are stateless and recoverable.
----
-## 🔥 Why Blackboard Beats Graphs
-| Dimension | Graph-Based (LangGraph, CrewAI) | Blackboard (Flock 0.5) |
-|-----------|--------------------------------|------------------------|
-| **Add new agent** | Rewrite graph, update edges | Just subscribe to types |
-| **Parallel execution** | Manual (split nodes, join nodes) | Automatic (multiple consumers) |
-| **Conditional routing** | Complex graph branches | `where=lambda x: x.score > 8` |
-| **Testing** | Need full graph setup | Test agents in isolation |
-| **Security** | Add-on (if exists) | Built-in (5 visibility types) |
-| **Coupling** | Tight (agents know successors) | Loose (agents know types) |
-| **Scalability** | O(n²) edges at 20+ agents | O(n) subscriptions |
-| **Mental model** | "Draw the workflow" | "What data triggers this?" |
-| **Context management** | Scattered across agents | **Blackboard IS the context** |
-| **Resilience** | Agent crash = data loss | **Blackboard persists, agents recover** |
-| **Orchestrator pattern** | **God object with domain knowledge** | **Agents decide autonomously** |
-| **Single point of failure** | Orchestrator dies = everything dies | **Agents independent, blackboard survives** |
-| **Architecture wisdom** | Ignores 20 years of microservices | **Applies decoupling lessons learned** |
----
-## 💡 Core Concepts: Rethinking Agent Coordination
-### 1. Typed Artifacts (Not Unstructured Messages)
-**Graph frameworks:** Agents pass dictionaries or unstructured text.
-```python
-# ❌ LangGraph/CrewAI style
-agent_a.output = {"result": "some text", "score": 8}  # What's the schema?
-```
-**Flock 0.5:** Every artifact is a validated Pydantic model.
-```python
-# ✅ Flock 0.5 style
-@flock_type
-class Review(BaseModel):
-    text: str = Field(max_length=1000)
-    score: int = Field(ge=1, le=10)
-    confidence: float = Field(ge=0.0, le=1.0)
-# Type errors caught at definition time, not runtime!
-```
-**Benefits:**
-- ✅ **Debuggable**: Strong typing catches errors at development time
-- ✅ **Measurable**: Validate outputs against explicit schemas
-- ✅ **Migratable**: Type contracts survive model upgrades (GPT-4 → GPT-6)
-- ✅ **Testable**: Mock inputs/outputs with concrete types
----
-### 2. Subscriptions (Not Edges)
-**Graph frameworks:** Explicit edges define flow.
-```python
-# ❌ LangGraph style
-graph.add_edge("review_agent", "high_quality_handler")
-graph.add_edge("review_agent", "low_quality_handler")  # How to route?
-```
-**Flock 0.5:** Declarative subscriptions define reactions.
-```python
-# ✅ Flock 0.5 style
-high_quality = orchestrator.agent("high_quality").consumes(
-    Review,
-    where=lambda r: r.score > 8  # Conditional routing!
-)
-low_quality = orchestrator.agent("low_quality").consumes(
-    Review,
-    where=lambda r: r.score <= 8
-)
-# Both subscribe to Review, predicate determines who fires
-```
----
-### 3. Visibility Controls (Not Open Access)
-**Graph frameworks:** Any agent can see any data.
-**Flock 0.5:** Producer-controlled access to artifacts.
-```python
-# Multi-tenancy (customer data isolation)
-agent.publishes(
-    CustomerData,
-    visibility=TenantVisibility(tenant_id="customer_123")
-)
-# Private (allowlist)
-agent.publishes(
-    SensitiveData,
-    visibility=PrivateVisibility(agents={"compliance_agent"})
-)
-# Time-delayed (embargo periods)
-artifact.visibility = AfterVisibility(
-    ttl=timedelta(hours=24),
-    then=PublicVisibility()
-)
-# Label-based RBAC
-artifact.visibility = LabelledVisibility(
-    required_labels={"clearance:secret"}
-)
-```
-**Why this matters:** Financial services, healthcare, SaaS platforms NEED this for compliance.
----
-### 4. Opportunistic Execution (Not Sequential Workflows)
-**Graph frameworks:** Define start node, execute path.
-```python
-# ❌ LangGraph style
-result = graph.invoke({"input": "..."}, config={"start": "node_a"})
-# Executes: node_a → node_b → node_c (even if b and c are independent!)
-```
-**Flock 0.5:** Publish data, all matching agents fire (in parallel if independent).
-```python
-# ✅ Flock 0.5 style
-await orchestrator.publish(Review(text="Great product!", score=9))
-# Three agents all consume Review, run concurrently:
-# - sentiment_analyzer
-# - rating_validator
-# - summary_generator
-await orchestrator.run_until_idle()  # Waits for all agents
-```
----
-## 🔥 What You Get With Flock 0.5
-<p align="center">
-  <img alt="Flock Banner" src="docs/img/flock_ui_agent_view.png" width="1000">
-</p>
-### ✅ Production Safety Built-In
-```python
-# Prevent infinite feedback loops
-agent = (
-    orchestrator.agent("processor")
-    .consumes(Document)
-    .publishes(Document)  # Could trigger itself!
-    .prevent_self_trigger(True)  # But won't! ✅
-)
-# Circuit breaker for runaway agents
-orchestrator = Flock(max_agent_iterations=1000)  # Automatic failsafe
-# Configuration validation
-agent.best_of(150, ...)  # ⚠️ Warns: "best_of(150) is very high"
-```
-**Graph frameworks:** No built-in loop prevention. No circuit breakers. Silent failures.
----
-### ✅ Real-Time Observability
-```python
-# One line to activate dashboard
-await orchestrator.serve(dashboard=True)
-```
-**What you get:**
-- 🎯 **Agent View**: Live graph of agents and message flows
-- 📋 **Blackboard View**: Transformation edges showing data lineage
-- 🎛️ **Control Panel**: Publish artifacts and invoke agents from UI
-- 📊 **EventLog Module**: Searchable, sortable event history
-- ⌨️ **Keyboard Shortcuts**: Full accessibility (Ctrl+/ for help)
-- 🔍 **Auto-Filter**: Correlation ID tracking
-**Graph frameworks:** Basic logging at best. No real-time visualization.
----
-### ✅ Advanced Execution Strategies
-```python
-# Best-of-N execution (run agent 5x, pick best)
-agent.best_of(5, score=lambda r: r.metrics["confidence"])
-# Exclusive delivery (lease-based, exactly-once)
-agent.consumes(Task, delivery="exclusive")
-# Batch processing (accumulate 10 items before triggering)
-agent.consumes(Event, batch=BatchSpec(size=10, timeout=timedelta(seconds=30)))
-# Join operations (wait for multiple artifact types)
-agent.consumes(Review, Rating, join=JoinSpec(within=timedelta(minutes=5)))
-```
-**Graph frameworks:** None of these patterns exist.
----
-## ⚡ Quick Start
-```bash
-# Install
-pip install flock-flow
-# Set API key
-export OPENAI_API_KEY="sk-..."
-export DEFAULT_MODEL="openai/gpt-4o"
-```
-**Your First Blackboard System (60 seconds):**
-```python
-import asyncio
-from pydantic import BaseModel, Field
-from flock_flow.orchestrator import Flock
-from flock_flow.registry import flock_type
-# 1. Define typed artifacts
-@flock_type
-class Idea(BaseModel):
-    topic: str
-    genre: str
-@flock_type
-class Movie(BaseModel):
-    title: str = Field(description="Title in CAPS")
-    runtime: int = Field(ge=60, le=400)
-    synopsis: str
-@flock_type
-class Tagline(BaseModel):
-    line: str
-# 2. Create orchestrator (the blackboard)
-orchestrator = Flock("openai/gpt-4o")
-# 3. Agents subscribe to types (NO workflow graph!)
-movie = (
-    orchestrator.agent("movie")
-    .description("Generate a compelling movie concept.")
-    .consumes(Idea)
-    .publishes(Movie)
-)
-tagline = (
-    orchestrator.agent("tagline")
-    .description("Write a one-sentence marketing tagline.")
-    .consumes(Movie)  # Auto-chains after movie!
-    .publishes(Tagline)
-)
-# 4. Run with real-time dashboard
-async def main():
-    await orchestrator.serve(dashboard=True)
-asyncio.run(main())
-```
-**Publish an artifact:**
-```bash
-curl -X POST http://localhost:8000/api/control/publish \
-  -H "Content-Type: application/json" \
-  -d '{"type_name": "Idea", "payload": {"topic": "AI cats", "genre": "comedy"}}'
-```
-**Watch it execute:**
-1. `movie` agent consumes `Idea`, publishes `Movie`
-2. `tagline` agent automatically reacts (subscribed to `Movie`)
-3. Dashboard shows live execution with full lineage
-4. No graph wiring. Just subscriptions.
----
-## 🚀 Enterprise Use Cases
-### Financial Services: Real-Time Risk Monitoring
-```python
-# 20+ agents monitoring different market signals
-volatility = orchestrator.agent("volatility").consumes(
-    MarketData,
-    where=lambda m: m.volatility > 0.5
-).publishes(VolatilityAlert)
-sentiment = orchestrator.agent("sentiment").consumes(
-    NewsArticle,
-    text="market crash",
-    min_p=0.9
-).publishes(SentimentAlert)
-# Execution agent waits for BOTH signals
-execute = orchestrator.agent("execute").consumes(
-    VolatilityAlert,
-    SentimentAlert,
-    join=JoinSpec(within=timedelta(minutes=5))
-).publishes(TradeOrder)
-# Complete audit trail for regulators ✅
-# Multi-agent decision making ✅
-# Real-time risk correlation ✅
-```
----
-### Healthcare: Multi-Modal Clinical Decision Support
-```python
-# Different specialists contribute to diagnosis
-radiology.publishes(
-    XRayAnalysis,
-    visibility=PrivateVisibility(agents=["diagnosis_agent"])  # HIPAA!
-)
-lab.publishes(
-    LabResults,
-    visibility=TenantVisibility(tenant_id="patient_123")  # Multi-tenancy!
-)
-# Diagnostician waits for both inputs
-diagnosis.consumes(XRayAnalysis, LabResults).publishes(
-    Diagnosis,
-    visibility=PrivateVisibility(agents=["physician", "pharmacist"])
-)
-# Built-in access controls ✅
-# Full data lineage ✅
-# Compliance-ready ✅
-```
----
-### E-Commerce: 50-Agent Personalization Engine
-```python
-# Parallel signal analysis (all run concurrently!)
-for signal in ["browsing", "purchase", "reviews", "social", "email", ...]:
-    orchestrator.agent(f"{signal}_analyzer").consumes(UserEvent).publishes(Signal)
-# Recommendation engine consumes ALL signals (batched)
-recommender = orchestrator.agent("recommender").consumes(
-    Signal,
-    batch=BatchSpec(size=50, timeout=timedelta(seconds=1))
-).publishes(Recommendation)
-# Add new signal? Just create agent, no graph rewiring ✅
-# Scale to 100+ agents? Linear complexity ✅
-```
----
-## 🗺️ Roadmap
-**✅ Phase 1: Core Framework (DONE - v0.5.00)**
-- [x] Blackboard orchestrator with typed artifacts
-- [x] Sequential + parallel execution
-- [x] Visibility controls (5 types)
-- [x] Real-time dashboard with WebSocket streaming
-- [x] Safety features (circuit breaker, feedback prevention)
-- [x] 743 tests, 77.65% coverage
-**🚧 Phase 2: Roadmap to 1.0 (Q1 2026)**
-- [ ] **YAML/JSON Serialization** - Export/import full orchestrators
-- [ ] **LLM-Powered Routing** - AI agent selection based on context
-- [ ] **Batch API** - Process DataFrames/CSV files
-- [ ] **Advanced Predicates** - Complex subscription logic
-- [ ] **CLI Tool** - Management console
-- [ ] Persistent blackboard (Redis/Postgres)
-- [ ] Event log replay (Kafka)
-- [ ] Distributed orchestration (multi-region)
-- [ ] OAuth/SSO for dashboard
-- [ ] Audit trail export (compliance)
-**📅 Phase 3: Post 1.0 ideas**
-- [ ] Migration tool (auto-convert from LangGraph/CrewAI)
-- [ ] Template marketplace
-- [ ] VS Code extension
----
-## 📚 What's Built-In
-✅ **LLM Provider Support** - LiteLLM (OpenAI, Anthropic, Azure, Google, etc.)
-✅ **DSPy Integration** - Prompt optimization and structured outputs
-✅ **MCP Protocol** - Model Context Protocol servers
-✅ **Tool System** - Function calling with any LLM
-✅ **Pydantic Models** - Type validation with Field constraints
-✅ **Rich Output** - Beautiful console themes
-✅ **FastAPI Service** - Production-grade HTTP API
-✅ **Streaming** - Real-time LLM output
-✅ **Async-First** - True concurrent execution
----
-## 🔬 Production Quality
-| Metric | Graph Frameworks | Flock 0.5 |
-|--------|------------------|-----------|
-| Test Coverage | Varies | **77.65%** (743 tests) |
-| Critical Path Coverage | Unknown | **86-100%** |
-| E2E Tests | Few | 6 comprehensive scenarios |
-| Safety Features | None/Manual | Circuit breaker, feedback prevention |
-| Real-time Monitoring | None/Basic | WebSocket streaming dashboard |
-| Security | Add-on | 5 built-in visibility types |
-| Documentation | Good | Excellent (AGENTS.md + examples) |
----
-## 🔍 Observability & Debugging
-### Built-in OpenTelemetry Tracing with DuckDB
-Flock includes **production-ready distributed tracing** powered by OpenTelemetry and DuckDB—enabling AI-assisted debugging and performance analysis.
-**Why DuckDB?** It's a columnar analytical database **10-100x faster than SQLite** for trace analytics. No external services, no Docker—just a single embedded database file.
-> **📊 Production Status**: 85% Production-Ready | [View Assessment](docs/TRACING_PRODUCTION_READINESS.md)
->
-> ✅ Complete architecture • ✅ Zero-config storage • ✅ Comprehensive UI • ⚠️ Add auth before production
-### Enable Tracing
-```bash
-# Enable auto-tracing for all agents
-export FLOCK_AUTO_TRACE=true
-export FLOCK_TRACE_FILE=true
-# Run your application
-python your_app.py
-# Traces stored in: .flock/traces.duckdb
-```
-### Filtering: Control What Gets Traced
-Use whitelist/blacklist filtering to reduce overhead and avoid tracing noisy operations like streaming tokens:
-```bash
-# Trace only core services (recommended for production)
-export FLOCK_TRACE_SERVICES='["flock", "agent", "dspyengine", "outpututilitycomponent"]'
-# Exclude specific noisy operations
-export FLOCK_TRACE_IGNORE='["DashboardEventCollector.set_websocket_manager"]'
-```
-**How filtering works:**
-- **Whitelist** (`FLOCK_TRACE_SERVICES`): Only trace specified classes (case-insensitive)
-- **Blacklist** (`FLOCK_TRACE_IGNORE`): Never trace specific operations (exact match)
-- Filtering happens **before** span creation for near-zero overhead
-📖 **Full documentation:** [docs/AUTO_TRACING.md](docs/AUTO_TRACING.md)
-### Real-Time Trace Viewer
-<p align="center">
-  <img alt="Trace Viewer" src="docs/img/trace_viewer.png" width="1000">
-</p>
-The dashboard includes a **production-ready trace viewer** with **7 powerful view modes**:
-- 📅 **Timeline**: Waterfall visualization showing execution flow and span hierarchies
-- 📊 **Statistics**: Sortable table view with durations, span counts, and error tracking
-- 🔴 **RED Metrics**: Rate, Errors, Duration monitoring for service health
-- 🔗 **Dependencies**: Service-to-service communication with operation-level drill-down
-- 🗄️ **DuckDB SQL**: Interactive SQL query editor with CSV export for custom analytics
-- ⚙️ **Configuration**: Real-time service/operation filtering without restarts
-- 📚 **Guide**: Built-in documentation and query examples
-**Additional Features:**
-- **Smart Sorting**: Sort traces by date, span count, or duration with visual indicators
-- **CSV Export**: Download query results for offline analysis and reporting
-- **Maximize Mode**: Full-screen view for deep data exploration
-- **Multi-Trace Support**: Open and compare multiple traces simultaneously
-- **Full I/O Capture**: Complete input/output data with collapsible JSON viewer
-### AI-Powered Debugging
-**AI agents (including Claude Code) can query your traces directly:**
-```python
-import duckdb
-conn = duckdb.connect('.flock/traces.duckdb', read_only=True)
-# Find slow operations
-slow_ops = conn.execute("""
-    SELECT name, AVG(duration_ms) as avg_duration
-    FROM spans
-    WHERE duration_ms > 1000
-    GROUP BY name
-    ORDER BY avg_duration DESC
-""").fetchall()
-# Find errors with their inputs
-errors = conn.execute("""
-    SELECT name, status_description,
-           json_extract(attributes, '$.input.message') as input
-    FROM spans
-    WHERE status_code = 'ERROR'
-""").fetchall()
-# Performance analysis by service
-perf = conn.execute("""
-    SELECT service,
-           COUNT(*) as calls,
-           AVG(duration_ms) as avg_ms,
-           MAX(duration_ms) as max_ms,
-           PERCENTILE_CONT(0.95) WITHIN GROUP (ORDER BY duration_ms) as p95_ms
-    FROM spans
-    GROUP BY service
-""").fetchall()
-```
-**Example AI-assisted debugging:**
-```
-You: "My pizza agent is slow, help me find why"
-AI: [queries DuckDB] "The DSPyEngine.evaluate span takes 23s on average.
-     Checking input attributes... You're passing 50KB of conversation history.
-     Recommendation: Limit context window to last 5 messages."
-```
-### What Gets Traced
-**Every operation is automatically traced with:**
-✅ Full input arguments (with JSON serialization)
-✅ Complete output values
-✅ Duration and timestamps
-✅ Parent-child span relationships
-✅ Service and operation names
-✅ Error messages and stack traces
-✅ Agent metadata (name, description)
-✅ Correlation IDs for request tracking
-**No manual instrumentation required—just enable `FLOCK_AUTO_TRACE=true`.**
-### Performance Analytics
-```sql
--- Find bottlenecks
-SELECT name, service, duration_ms
-FROM spans
-WHERE duration_ms > 5000
-ORDER BY start_time DESC;
--- Track P95 latency by operation
-SELECT operation,
-       PERCENTILE_CONT(0.95) WITHIN GROUP (ORDER BY duration_ms) as p95
-FROM spans
-WHERE service = 'DSPyEngine'
-GROUP BY operation;
--- Error rate by service
-SELECT service,
-       COUNT(*) as total,
-       SUM(CASE WHEN status_code = 'ERROR' THEN 1 ELSE 0 END) as errors,
-       (errors * 100.0 / total) as error_rate
-FROM spans
-GROUP BY service;
-```
-### Production Monitoring
-Deploy Flock with **OTEL exporters** to send traces to your observability platform:
-```bash
-# Send to Grafana Tempo/Loki
-export OTEL_EXPORTER_OTLP_ENDPOINT="http://tempo:4317"
-export FLOCK_AUTO_TRACE=true
-# Or use local DuckDB + periodic exports
-export FLOCK_TRACE_FILE=true
-```
----
-## 🤝 Contributing
-We're building Flock 0.5 in the open! See [`AGENTS.md`](AGENTS.md) for development setup and debugging guide.
-```bash
-git clone https://github.com/whiteducksoftware/flock-flow.git
-cd flock-flow
-uv sync
-uv run pytest  # 743 tests pass!
-```
----
-## 💬 Community & Support
-- **GitHub Issues:** [Report bugs or request features](https://github.com/whiteducksoftware/flock-flow/issues)
-- **Discussions:** [Ask questions or share ideas](https://github.com/whiteducksoftware/flock-flow/discussions)
-- **Documentation:** [Full docs and examples](https://whiteducksoftware.github.io/flock/)
-- **Email:** [support@whiteduck.de](mailto:support@whiteduck.de)
----
-## 🌟 Why "0.5"?
-We're calling this 0.5 to signal:
-1. **It's production-ready** - 743 tests, enterprise features, dashboard
-2. **It's still evolving** - Some advanced features coming in Q1/Q2 2026
-3. **It's the future** - Blackboard architecture scales better than graphs
-**1.0 will arrive** when we've added advanced routing, serialization, and enterprise persistence.
----
-## 🔖 The Bottom Line
-**Graph-based frameworks** treat agents like nodes in a workflow. Rigid. Sequential. Hard to scale.
-**Flock 0.5** combines **declarative AI workflows** with **blackboard architecture**:
-- ✅ No brittle prompts (type-safe contracts)
-- ✅ No rigid graphs (opportunistic execution)
-- ✅ No testing nightmares (unit-testable agents)
-- ✅ No security gaps (5 visibility types)
-- ✅ No production fears (743 tests, real-time monitoring)
-**The future of AI agents isn't workflows—it's declarative blackboards.**
-**Try it. You'll never go back to graphs.**
----
-<div align="center">
-**Built with ❤️ by white duck GmbH**
-**"Agents are just microservices. Let's treat them that way."**
-[⭐ Star us on GitHub](https://github.com/whiteducksoftware/flock-flow) | [📖 Read the Docs](https://whiteducksoftware.github.io/flock/) | [🚀 Try Examples](examples/)
-</div>
----
-## 📊 Framework Comparison
-| | LangGraph | CrewAI | AutoGen | Flock 0.5 |
-|-|-----------|---------|---------|-----------|
-| **Pattern** | Directed Graph | Sequential Tasks | Chat-Based | Blackboard |
-| **Coordination** | Explicit edges | Task context | Messages | Subscriptions |
-| **Parallelism** | Manual (split/join) | None | None | Automatic |
-| **Type Safety** | TypedDict | None | None | Pydantic |
-| **Security** | None | None | None | 5 visibility types |
-| **Conditional** | Route functions | Manual | Manual | `where=lambda` |
-| **Testing** | Full graph | Full crew | Full group | Isolated agents |
-| **Real-time UI** | None | None | None | WebSocket streaming |
-| **Feedback Prevention** | Manual | Manual | Manual | Automatic |
-| **Add Agent** | Rewrite graph | Rewrite tasks | Rewrite group | Just subscribe |
-| **Learning Curve** | Medium | Easy | Easy | Medium |
-| **Scalability** | 10-20 agents | 5-10 agents | 5-10 agents | 100+ agents |
----
-**Last Updated:** October 6, 2025
-**Version:** Flock 0.5.0 (Blackboard Edition) / flock-flow 0.1.20
-**Status:** Production-Ready, Active Development
----
-**"The blackboard pattern has been battle-tested for 50 years. Declarative contracts eliminate prompt hell. Together, they're the future of AI agents."**