PyPI - haiku.rag - Versions diffs - 0.14.0__py3-none-any.whl → 0.19.3__py3-none-any.whl - Mend

haiku.rag 0.14.0py3-none-any.whl → 0.19.3py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

README.md CHANGED Viewed

@@ -4,22 +4,19 @@ Retrieval-Augmented Generation (RAG) library built on LanceDB.
 `haiku.rag` is a Retrieval-Augmented Generation (RAG) library built to work with LanceDB as a local vector database. It uses LanceDB for storing embeddings and performs semantic (vector) search as well as full-text search combined through native hybrid search with Reciprocal Rank Fusion. Both open-source (Ollama) as well as commercial (OpenAI, VoyageAI) embedding providers are supported.
-> **Note**: Configuration now uses YAML files instead of environment variables. If you're upgrading from an older version, run `haiku-rag init-config --from-env` to migrate your `.env` file to `haiku.rag.yaml`. See [Configuration](https://ggozad.github.io/haiku.rag/configuration/) for details.
 ## Features
 - **Local LanceDB**: No external servers required, supports also LanceDB cloud storage, S3, Google Cloud & Azure
-- **Multiple embedding providers**: Ollama, VoyageAI, OpenAI, vLLM
-- **Multiple QA providers**: Any provider/model supported by Pydantic AI
-- **Research graph (multi‑agent)**: Plan → Search → Evaluate → Synthesize with agentic AI
+- **Multiple embedding providers**: Ollama, LM Studio, VoyageAI, OpenAI, vLLM
+- **Multiple QA providers**: Any provider/model supported by Pydantic AI (Ollama, LM Studio, OpenAI, Anthropic, etc.)
 - **Native hybrid search**: Vector + full-text search with native LanceDB RRF reranking
 - **Reranking**: Default search result reranking with MixedBread AI, Cohere, Zero Entropy, or vLLM
 - **Question answering**: Built-in QA agents on your documents
+- **Research graph (multi‑agent)**: Plan → Search → Evaluate → Synthesize with agentic AI
 - **File monitoring**: Auto-index files when run as server
-- **40+ file formats**: PDF, DOCX, HTML, Markdown, code files, URLs
-- **MCP server**: Expose as tools for AI assistants
-- **A2A agent**: Conversational agent with context and multi-turn dialogue
 - **CLI & Python API**: Use from command line or Python
+- **MCP server**: Expose as tools for AI assistants
+- **Flexible document processing**: Local (docling) or remote (docling-serve) processing
 ## Installation
@@ -31,7 +28,7 @@ Retrieval-Augmented Generation (RAG) library built on LanceDB.
 uv pip install haiku.rag
 ```
-Includes all features: document processing, all embedding providers, rerankers, and A2A agent support.
+Includes all features: document processing, all embedding providers, and rerankers.
 ### Slim Package (Minimal Dependencies)
@@ -88,13 +85,13 @@ To customize settings, create a `haiku.rag.yaml` config file (see [Configuration
 ```python
 from haiku.rag.client import HaikuRAG
-from haiku.rag.research import (
-    PlanNode,
+from haiku.rag.config import Config
+from haiku.rag.graph.agui import stream_graph
+from haiku.rag.graph.research import (
     ResearchContext,
     ResearchDeps,
     ResearchState,
     build_research_graph,
-    stream_research_graph,
 )
 async with HaikuRAG("database.lancedb") as client:
@@ -115,41 +112,31 @@ async with HaikuRAG("database.lancedb") as client:
     print(answer)
     # Multi‑agent research pipeline (Plan → Search → Evaluate → Synthesize)
-    graph = build_research_graph()
+    # Graph settings (provider, model, max_iterations, etc.) come from config
+    graph = build_research_graph(config=Config)
     question = (
         "What are the main drivers and trends of global temperature "
         "anomalies since 1990?"
     )
-    state = ResearchState(
-        context=ResearchContext(original_question=question),
-        max_iterations=2,
-        confidence_threshold=0.8,
-        max_concurrency=2,
-    )
+    context = ResearchContext(original_question=question)
+    state = ResearchState.from_config(context=context, config=Config)
     deps = ResearchDeps(client=client)
     # Blocking run (final result only)
-    result = await graph.run(
-        PlanNode(provider="openai", model="gpt-4o-mini"),
-        state=state,
-        deps=deps,
-    )
-    print(result.output.title)
-    # Streaming progress (log/report/error events)
-    async for event in stream_research_graph(
-        graph,
-        PlanNode(provider="openai", model="gpt-4o-mini"),
-        state,
-        deps,
-    ):
-        if event.type == "log":
-            iteration = event.state.iterations if event.state else state.iterations
-            print(f"[{iteration}] {event.message}")
-        elif event.type == "report":
+    report = await graph.run(state=state, deps=deps)
+    print(report.title)
+    # Streaming progress (AG-UI events)
+    async for event in stream_graph(graph, state, deps):
+        if event["type"] == "STEP_STARTED":
+            print(f"Starting step: {event['stepName']}")
+        elif event["type"] == "ACTIVITY_SNAPSHOT":
+            print(f"  {event['content']}")
+        elif event["type"] == "RUN_FINISHED":
             print("\nResearch complete!\n")
-            print(event.report.title)
-            print(event.report.executive_summary)
+            result = event["result"]
+            print(result["title"])
+            print(result["executive_summary"])
 ```
 ## MCP Server
@@ -162,32 +149,13 @@ haiku-rag serve --stdio
 Provides tools for document management and search directly in your AI assistant.
-## A2A Agent
-Run as a conversational agent with the Agent-to-Agent protocol:
-```bash
-# Start the A2A server
-haiku-rag serve --a2a
-# Connect with the interactive client (in another terminal)
-haiku-rag a2aclient
-```
-The A2A agent provides:
-- Multi-turn dialogue with context
-- Intelligent multi-search for complex questions
-- Source citations with titles and URIs
-- Full document retrieval on request
 ## Examples
 See the [examples directory](examples/) for working examples:
 - **[Interactive Research Assistant](examples/ag-ui-research/)** - Full-stack research assistant with Pydantic AI and AG-UI featuring human-in-the-loop approval and real-time state synchronization
-- **[Docker Setup](examples/docker/)** - Complete Docker deployment with file monitoring, MCP server, and A2A agent
-- **[A2A Security](examples/a2a-security/)** - Authentication examples (API key, OAuth2, GitHub)
+- **[Docker Setup](examples/docker/)** - Complete Docker deployment with file monitoring and MCP server
+- **[A2A Server](examples/a2a-server/)** - Self-contained A2A protocol server package with conversational agent interface
 ## Documentation
@@ -199,7 +167,6 @@ Full documentation at: https://ggozad.github.io/haiku.rag/
 - [Python API](https://ggozad.github.io/haiku.rag/python/) - Complete API docs
 - [Agents](https://ggozad.github.io/haiku.rag/agents/) - QA agent and multi-agent research
 - [MCP Server](https://ggozad.github.io/haiku.rag/mcp/) - Model Context Protocol integration
-- [A2A Agent](https://ggozad.github.io/haiku.rag/a2a/) - Agent-to-Agent protocol support
 - [Benchmarks](https://ggozad.github.io/haiku.rag/benchmarks/) - Performance Benchmarks
 mcp-name: io.github.ggozad/haiku-rag

{haiku_rag-0.14.0.dist-info → haiku_rag-0.19.3.dist-info}/METADATA RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: haiku.rag
-Version: 0.14.0
+Version: 0.19.3
 Summary: Agentic Retrieval Augmented Generation (RAG) with LanceDB
 Author-email: Yiorgis Gozadinos <ggozadinos@gmail.com>
 License: MIT
@@ -17,7 +17,9 @@ Classifier: Programming Language :: Python :: 3.12
 Classifier: Programming Language :: Python :: 3.13
 Classifier: Typing :: Typed
 Requires-Python: >=3.12
-Requires-Dist: haiku-rag-slim[a2a,cohere,docling,mxbai,voyageai,zeroentropy]
+Requires-Dist: haiku-rag-slim[cohere,docling,inspector,mxbai,voyageai,zeroentropy]==0.19.3
+Provides-Extra: inspector
+Requires-Dist: textual>=1.0.0; extra == 'inspector'
 Description-Content-Type: text/markdown
 # Haiku RAG
@@ -26,22 +28,19 @@ Retrieval-Augmented Generation (RAG) library built on LanceDB.
 `haiku.rag` is a Retrieval-Augmented Generation (RAG) library built to work with LanceDB as a local vector database. It uses LanceDB for storing embeddings and performs semantic (vector) search as well as full-text search combined through native hybrid search with Reciprocal Rank Fusion. Both open-source (Ollama) as well as commercial (OpenAI, VoyageAI) embedding providers are supported.
-> **Note**: Configuration now uses YAML files instead of environment variables. If you're upgrading from an older version, run `haiku-rag init-config --from-env` to migrate your `.env` file to `haiku.rag.yaml`. See [Configuration](https://ggozad.github.io/haiku.rag/configuration/) for details.
 ## Features
 - **Local LanceDB**: No external servers required, supports also LanceDB cloud storage, S3, Google Cloud & Azure
-- **Multiple embedding providers**: Ollama, VoyageAI, OpenAI, vLLM
-- **Multiple QA providers**: Any provider/model supported by Pydantic AI
-- **Research graph (multi‑agent)**: Plan → Search → Evaluate → Synthesize with agentic AI
+- **Multiple embedding providers**: Ollama, LM Studio, VoyageAI, OpenAI, vLLM
+- **Multiple QA providers**: Any provider/model supported by Pydantic AI (Ollama, LM Studio, OpenAI, Anthropic, etc.)
 - **Native hybrid search**: Vector + full-text search with native LanceDB RRF reranking
 - **Reranking**: Default search result reranking with MixedBread AI, Cohere, Zero Entropy, or vLLM
 - **Question answering**: Built-in QA agents on your documents
+- **Research graph (multi‑agent)**: Plan → Search → Evaluate → Synthesize with agentic AI
 - **File monitoring**: Auto-index files when run as server
-- **40+ file formats**: PDF, DOCX, HTML, Markdown, code files, URLs
-- **MCP server**: Expose as tools for AI assistants
-- **A2A agent**: Conversational agent with context and multi-turn dialogue
 - **CLI & Python API**: Use from command line or Python
+- **MCP server**: Expose as tools for AI assistants
+- **Flexible document processing**: Local (docling) or remote (docling-serve) processing
 ## Installation
@@ -53,7 +52,7 @@ Retrieval-Augmented Generation (RAG) library built on LanceDB.
 uv pip install haiku.rag
 ```
-Includes all features: document processing, all embedding providers, rerankers, and A2A agent support.
+Includes all features: document processing, all embedding providers, and rerankers.
 ### Slim Package (Minimal Dependencies)
@@ -110,13 +109,13 @@ To customize settings, create a `haiku.rag.yaml` config file (see [Configuration
 ```python
 from haiku.rag.client import HaikuRAG
-from haiku.rag.research import (
-    PlanNode,
+from haiku.rag.config import Config
+from haiku.rag.graph.agui import stream_graph
+from haiku.rag.graph.research import (
     ResearchContext,
     ResearchDeps,
     ResearchState,
     build_research_graph,
-    stream_research_graph,
 )
 async with HaikuRAG("database.lancedb") as client:
@@ -137,41 +136,31 @@ async with HaikuRAG("database.lancedb") as client:
     print(answer)
     # Multi‑agent research pipeline (Plan → Search → Evaluate → Synthesize)
-    graph = build_research_graph()
+    # Graph settings (provider, model, max_iterations, etc.) come from config
+    graph = build_research_graph(config=Config)
     question = (
         "What are the main drivers and trends of global temperature "
         "anomalies since 1990?"
     )
-    state = ResearchState(
-        context=ResearchContext(original_question=question),
-        max_iterations=2,
-        confidence_threshold=0.8,
-        max_concurrency=2,
-    )
+    context = ResearchContext(original_question=question)
+    state = ResearchState.from_config(context=context, config=Config)
     deps = ResearchDeps(client=client)
     # Blocking run (final result only)
-    result = await graph.run(
-        PlanNode(provider="openai", model="gpt-4o-mini"),
-        state=state,
-        deps=deps,
-    )
-    print(result.output.title)
-    # Streaming progress (log/report/error events)
-    async for event in stream_research_graph(
-        graph,
-        PlanNode(provider="openai", model="gpt-4o-mini"),
-        state,
-        deps,
-    ):
-        if event.type == "log":
-            iteration = event.state.iterations if event.state else state.iterations
-            print(f"[{iteration}] {event.message}")
-        elif event.type == "report":
+    report = await graph.run(state=state, deps=deps)
+    print(report.title)
+    # Streaming progress (AG-UI events)
+    async for event in stream_graph(graph, state, deps):
+        if event["type"] == "STEP_STARTED":
+            print(f"Starting step: {event['stepName']}")
+        elif event["type"] == "ACTIVITY_SNAPSHOT":
+            print(f"  {event['content']}")
+        elif event["type"] == "RUN_FINISHED":
             print("\nResearch complete!\n")
-            print(event.report.title)
-            print(event.report.executive_summary)
+            result = event["result"]
+            print(result["title"])
+            print(result["executive_summary"])
 ```
 ## MCP Server
@@ -184,32 +173,13 @@ haiku-rag serve --stdio
 Provides tools for document management and search directly in your AI assistant.
-## A2A Agent
-Run as a conversational agent with the Agent-to-Agent protocol:
-```bash
-# Start the A2A server
-haiku-rag serve --a2a
-# Connect with the interactive client (in another terminal)
-haiku-rag a2aclient
-```
-The A2A agent provides:
-- Multi-turn dialogue with context
-- Intelligent multi-search for complex questions
-- Source citations with titles and URIs
-- Full document retrieval on request
 ## Examples
 See the [examples directory](examples/) for working examples:
 - **[Interactive Research Assistant](examples/ag-ui-research/)** - Full-stack research assistant with Pydantic AI and AG-UI featuring human-in-the-loop approval and real-time state synchronization
-- **[Docker Setup](examples/docker/)** - Complete Docker deployment with file monitoring, MCP server, and A2A agent
-- **[A2A Security](examples/a2a-security/)** - Authentication examples (API key, OAuth2, GitHub)
+- **[Docker Setup](examples/docker/)** - Complete Docker deployment with file monitoring and MCP server
+- **[A2A Server](examples/a2a-server/)** - Self-contained A2A protocol server package with conversational agent interface
 ## Documentation
@@ -221,7 +191,6 @@ Full documentation at: https://ggozad.github.io/haiku.rag/
 - [Python API](https://ggozad.github.io/haiku.rag/python/) - Complete API docs
 - [Agents](https://ggozad.github.io/haiku.rag/agents/) - QA agent and multi-agent research
 - [MCP Server](https://ggozad.github.io/haiku.rag/mcp/) - Model Context Protocol integration
-- [A2A Agent](https://ggozad.github.io/haiku.rag/a2a/) - Agent-to-Agent protocol support
 - [Benchmarks](https://ggozad.github.io/haiku.rag/benchmarks/) - Performance Benchmarks
 mcp-name: io.github.ggozad/haiku-rag

haiku_rag-0.19.3.dist-info/RECORD ADDED Viewed

@@ -0,0 +1,6 @@
+README.md,sha256=3Y4bUYJ-gnWPH_zeCdHK_NZcej9JvA2JdnKoSY0eu6o,6377
+haiku_rag-0.19.3.dist-info/METADATA,sha256=BUpAqkIkKsZTQ1mGopYaSKlvlUC6cRBQtpvLfz-5h5M,7338
+haiku_rag-0.19.3.dist-info/WHEEL,sha256=WLgqFyCfm_KASv4WHyYy0P3pM_m7J5L9k2skdKLirC8,87
+haiku_rag-0.19.3.dist-info/entry_points.txt,sha256=G1U3nAkNd5YDYd4v0tuYFbriz0i-JheCsFuT9kIoGCI,48
+haiku_rag-0.19.3.dist-info/licenses/LICENSE,sha256=eXZrWjSk9PwYFNK9yUczl3oPl95Z4V9UXH7bPN46iPo,1065
+haiku_rag-0.19.3.dist-info/RECORD,,

{haiku_rag-0.14.0.dist-info → haiku_rag-0.19.3.dist-info}/WHEEL RENAMED Viewed

@@ -1,4 +1,4 @@
 Wheel-Version: 1.0
-Generator: hatchling 1.27.0
+Generator: hatchling 1.28.0
 Root-Is-Purelib: true
 Tag: py3-none-any

haiku_rag-0.14.0.dist-info/RECORD DELETED Viewed

@@ -1,6 +0,0 @@
-README.md,sha256=N8nk6cs6JkWHAmVz3ci7lTiLr6Xq_UqifWGivZnuPJU,7216
-haiku_rag-0.14.0.dist-info/METADATA,sha256=I3H0hBrGIgDwtIvFe9tnu2-PeLESLlhEHVrgmrKCnr4,8085
-haiku_rag-0.14.0.dist-info/WHEEL,sha256=qtCwoSJWgHk21S1Kb4ihdzI2rlJ1ZKaIurTj_ngOhyQ,87
-haiku_rag-0.14.0.dist-info/entry_points.txt,sha256=G1U3nAkNd5YDYd4v0tuYFbriz0i-JheCsFuT9kIoGCI,48
-haiku_rag-0.14.0.dist-info/licenses/LICENSE,sha256=eXZrWjSk9PwYFNK9yUczl3oPl95Z4V9UXH7bPN46iPo,1065
-haiku_rag-0.14.0.dist-info/RECORD,,

{haiku_rag-0.14.0.dist-info → haiku_rag-0.19.3.dist-info}/entry_points.txt RENAMED Viewed

File without changes

{haiku_rag-0.14.0.dist-info → haiku_rag-0.19.3.dist-info}/licenses/LICENSE RENAMED Viewed

File without changes

haiku.rag 0.14.0__py3-none-any.whl → 0.19.3__py3-none-any.whl

haiku.rag 0.14.0py3-none-any.whl → 0.19.3py3-none-any.whl