PyPI - haiku.rag - Versions diffs - 0.11.4__tar.gz → 0.12.1__tar.gz - Mend

haiku.rag 0.11.4tar.gz → 0.12.1tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of haiku.rag might be problematic. Click here for more details.

Files changed (91) hide show

haiku_rag-0.12.1/.dockerignore ADDED Viewed

@@ -0,0 +1,55 @@
+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+# Virtual environments (uv best practice)
+.venv/
+venv/
+env/
+# Data
+*.lancedb/
+data/
+docs/
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+# OS
+.DS_Store
+Thumbs.db
+# Git
+.git/
+.gitignore
+# Development
+tests/
+.pytest_cache/
+.coverage
+htmlcov/
+# Examples
+examples/

{haiku_rag-0.11.4 → haiku_rag-0.12.1}/.gitignore RENAMED Viewed

@@ -21,3 +21,7 @@ tests/data/
 TODO.md
 PLAN.md
 DEVNOTES.md
+# mcp registry
+.mcpregistry_github_token
+.mcpregistry_registry_token

haiku_rag-0.12.1/.python-version ADDED Viewed

	@@ -0,0 +1 @@
1	+ 3.13

{haiku_rag-0.11.4 → haiku_rag-0.12.1}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: haiku.rag
-Version: 0.11.4
+Version: 0.12.1
 Summary: Agentic Retrieval Augmented Generation (RAG) with LanceDB
 Author-email: Yiorgis Gozadinos <ggozadinos@gmail.com>
 License: MIT
@@ -18,18 +18,20 @@ Classifier: Programming Language :: Python :: 3.11
 Classifier: Programming Language :: Python :: 3.12
 Classifier: Typing :: Typed
 Requires-Python: >=3.12
-Requires-Dist: docling>=2.52.0
-Requires-Dist: fastmcp>=2.12.3
+Requires-Dist: docling>=2.56.1
+Requires-Dist: fastmcp>=2.12.4
 Requires-Dist: httpx>=0.28.1
-Requires-Dist: lancedb>=0.25.0
-Requires-Dist: pydantic-ai>=1.0.8
-Requires-Dist: pydantic-graph>=1.0.8
-Requires-Dist: pydantic>=2.11.9
+Requires-Dist: lancedb>=0.25.2
+Requires-Dist: pydantic-ai>=1.0.18
+Requires-Dist: pydantic-graph>=1.0.18
+Requires-Dist: pydantic>=2.12.2
 Requires-Dist: python-dotenv>=1.1.1
-Requires-Dist: rich>=14.1.0
-Requires-Dist: tiktoken>=0.11.0
-Requires-Dist: typer>=0.16.1
+Requires-Dist: rich>=14.2.0
+Requires-Dist: tiktoken>=0.12.0
+Requires-Dist: typer>=0.19.2
 Requires-Dist: watchfiles>=1.1.0
+Provides-Extra: a2a
+Requires-Dist: fasta2a>=0.1.0; extra == 'a2a'
 Provides-Extra: mxbai
 Requires-Dist: mxbai-rerank>=0.1.6; extra == 'mxbai'
 Provides-Extra: voyageai
@@ -56,6 +58,7 @@ Retrieval-Augmented Generation (RAG) library built on LanceDB.
 - **File monitoring**: Auto-index files when run as server
 - **40+ file formats**: PDF, DOCX, HTML, Markdown, code files, URLs
 - **MCP server**: Expose as tools for AI assistants
+- **A2A agent**: Conversational agent with context and multi-turn dialogue
 - **CLI & Python API**: Use from command line or Python
 ## Quick Start
@@ -181,6 +184,24 @@ haiku-rag serve --stdio
 Provides tools for document management and search directly in your AI assistant.
+## A2A Agent
+Run as a conversational agent with the Agent-to-Agent protocol:
+```bash
+# Start the A2A server
+haiku-rag serve --a2a
+# Connect with the interactive client (in another terminal)
+haiku-rag a2aclient
+```
+The A2A agent provides:
+- Multi-turn dialogue with context
+- Intelligent multi-search for complex questions
+- Source citations with titles and URIs
+- Full document retrieval on request
 ## Documentation
 Full documentation at: https://ggozad.github.io/haiku.rag/
@@ -190,4 +211,6 @@ Full documentation at: https://ggozad.github.io/haiku.rag/
 - [CLI](https://ggozad.github.io/haiku.rag/cli/) - Command reference
 - [Python API](https://ggozad.github.io/haiku.rag/python/) - Complete API docs
 - [Agents](https://ggozad.github.io/haiku.rag/agents/) - QA agent and multi-agent research
+- [MCP Server](https://ggozad.github.io/haiku.rag/mcp/) - Model Context Protocol integration
+- [A2A Agent](https://ggozad.github.io/haiku.rag/a2a/) - Agent-to-Agent protocol support
 - [Benchmarks](https://ggozad.github.io/haiku.rag/benchmarks/) - Performance Benchmarks

{haiku_rag-0.11.4 → haiku_rag-0.12.1}/README.md RENAMED Viewed

@@ -18,6 +18,7 @@ Retrieval-Augmented Generation (RAG) library built on LanceDB.
 - **File monitoring**: Auto-index files when run as server
 - **40+ file formats**: PDF, DOCX, HTML, Markdown, code files, URLs
 - **MCP server**: Expose as tools for AI assistants
+- **A2A agent**: Conversational agent with context and multi-turn dialogue
 - **CLI & Python API**: Use from command line or Python
 ## Quick Start
@@ -143,6 +144,24 @@ haiku-rag serve --stdio
 Provides tools for document management and search directly in your AI assistant.
+## A2A Agent
+Run as a conversational agent with the Agent-to-Agent protocol:
+```bash
+# Start the A2A server
+haiku-rag serve --a2a
+# Connect with the interactive client (in another terminal)
+haiku-rag a2aclient
+```
+The A2A agent provides:
+- Multi-turn dialogue with context
+- Intelligent multi-search for complex questions
+- Source citations with titles and URIs
+- Full document retrieval on request
 ## Documentation
 Full documentation at: https://ggozad.github.io/haiku.rag/
@@ -152,4 +171,6 @@ Full documentation at: https://ggozad.github.io/haiku.rag/
 - [CLI](https://ggozad.github.io/haiku.rag/cli/) - Command reference
 - [Python API](https://ggozad.github.io/haiku.rag/python/) - Complete API docs
 - [Agents](https://ggozad.github.io/haiku.rag/agents/) - QA agent and multi-agent research
+- [MCP Server](https://ggozad.github.io/haiku.rag/mcp/) - Model Context Protocol integration
+- [A2A Agent](https://ggozad.github.io/haiku.rag/a2a/) - Agent-to-Agent protocol support
 - [Benchmarks](https://ggozad.github.io/haiku.rag/benchmarks/) - Performance Benchmarks

{haiku_rag-0.11.4 → haiku_rag-0.12.1}/mkdocs.yml RENAMED Viewed

@@ -64,6 +64,7 @@ nav:
       - Agents: agents.md
       - Python: python.md
       - MCP: mcp.md
+      - A2A: a2a.md
       - Benchmarks: benchmarks.md
 markdown_extensions:
   - admonition

{haiku_rag-0.11.4 → haiku_rag-0.12.1}/pyproject.toml RENAMED Viewed

@@ -2,7 +2,7 @@
 name = "haiku.rag"
 description = "Agentic Retrieval Augmented Generation (RAG) with LanceDB"
-version = "0.11.4"
+version = "0.12.1"
 authors = [{ name = "Yiorgis Gozadinos", email = "ggozadinos@gmail.com" }]
 license = { text = "MIT" }
 readme = { file = "README.md", content-type = "text/markdown" }
@@ -23,23 +23,24 @@ classifiers = [
 ]
 dependencies = [
-    "docling>=2.52.0",
-    "fastmcp>=2.12.3",
+    "docling>=2.56.1",
+    "fastmcp>=2.12.4",
     "httpx>=0.28.1",
-    "lancedb>=0.25.0",
-    "pydantic>=2.11.9",
-    "pydantic-ai>=1.0.8",
-    "pydantic-graph>=1.0.8",
+    "lancedb>=0.25.2",
+    "pydantic>=2.12.2",
+    "pydantic-ai>=1.0.18",
+    "pydantic-graph>=1.0.18",
     "python-dotenv>=1.1.1",
-    "rich>=14.1.0",
-    "tiktoken>=0.11.0",
-    "typer>=0.16.1",
+    "rich>=14.2.0",
+    "tiktoken>=0.12.0",
+    "typer>=0.19.2",
     "watchfiles>=1.1.0",
 ]
 [project.optional-dependencies]
 voyageai = ["voyageai>=0.3.5"]
 mxbai = ["mxbai-rerank>=0.1.6"]
+a2a = ["fasta2a>=0.1.0"]
 [project.scripts]
 haiku-rag = "haiku.rag.cli:cli"
@@ -49,7 +50,7 @@ requires = ["hatchling"]
 build-backend = "hatchling.build"
 [tool.hatch.build]
-exclude = ["/docs", "/tests", "/.github"]
+exclude = ["/docs", "/examples", "/tests", "/docker", "/.github"]
 [tool.hatch.build.targets.wheel]
 packages = ["src/haiku"]
@@ -62,7 +63,7 @@ dev = [
     "mkdocs-material>=9.6.14",
     "pydantic-evals>=1.0.8",
     "pre-commit>=4.2.0",
-    "pyright>=1.1.405",
+    "pyright>=1.1.406",
     "pytest>=8.4.2",
     "pytest-asyncio>=1.2.0",
     "pytest-cov>=7.0.0",

haiku_rag-0.12.1/server.json ADDED Viewed

@@ -0,0 +1,253 @@
+{
+  "$schema": "https://static.modelcontextprotocol.io/schemas/2025-09-29/server.schema.json",
+  "name": "io.github.ggozad/haiku-rag",
+  "version": "{{VERSION}}",
+  "description": "Agentic Retrieval Augmented Generation (RAG) with LanceDB",
+  "repository": {
+    "url": "https://github.com/ggozad/haiku.rag",
+    "source": "github"
+  },
+  "homepage": "https://github.com/ggozad/haiku.rag",
+  "license": "MIT",
+  "keywords": ["rag", "lancedb", "vector-database", "embeddings", "search", "qa", "research"],
+  "vendor": {
+    "name": "Yiorgis Gozadinos",
+    "url": "https://github.com/ggozad"
+  },
+  "deployment": {
+    "packages": [
+      {
+        "type": "pypi",
+        "package": "haiku.rag",
+        "command": {
+          "linux-x86_64": {
+            "shell": "uvx",
+            "args": ["haiku.rag", "serve", "--stdio"]
+          },
+          "darwin-arm64": {
+            "shell": "uvx",
+            "args": ["haiku.rag", "serve", "--stdio"]
+          },
+          "darwin-x86_64": {
+            "shell": "uvx",
+            "args": ["haiku.rag", "serve", "--stdio"]
+          },
+          "win32-x86_64": {
+            "shell": "uvx.exe",
+            "args": ["haiku.rag", "serve", "--stdio"]
+          }
+        },
+        "environmentVariables": [
+          {
+            "name": "ENV",
+            "description": "Runtime environment (production or development)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "DEFAULT_DATA_DIR",
+            "description": "Default directory for LanceDB data and assets",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "MONITOR_DIRECTORIES",
+            "description": "Comma-separated paths to watch for file changes in server mode",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "LANCEDB_URI",
+            "description": "LanceDB connection URI (use db:// for cloud or a filesystem path)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "LANCEDB_REGION",
+            "description": "LanceDB cloud region (if using cloud)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "LANCEDB_API_KEY",
+            "description": "LanceDB API key (required for LanceDB Cloud)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": true
+          },
+          {
+            "name": "EMBEDDINGS_PROVIDER",
+            "description": "Embeddings provider (e.g. ollama, openai, voyageai)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "EMBEDDINGS_MODEL",
+            "description": "Embeddings model name (provider-specific)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "EMBEDDINGS_VECTOR_DIM",
+            "description": "Embedding vector dimension (must match model)",
+            "format": "number",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "QA_PROVIDER",
+            "description": "Question answering provider (e.g. ollama, openai, anthropic)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "QA_MODEL",
+            "description": "Question answering model name (provider-specific)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "RESEARCH_PROVIDER",
+            "description": "Research provider for multi-agent research (e.g. ollama, openai, anthropic)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "RESEARCH_MODEL",
+            "description": "Research model name for multi-agent research (provider-specific)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "RERANK_PROVIDER",
+            "description": "Rerank provider (e.g. mixedbread, cohere)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "RERANK_MODEL",
+            "description": "Rerank model name (provider-specific)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "CHUNK_SIZE",
+            "description": "Chunk size for splitting documents (characters)",
+            "format": "number",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "CONTEXT_CHUNK_RADIUS",
+            "description": "Number of adjacent chunks to include around search hits",
+            "format": "number",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "OLLAMA_BASE_URL",
+            "description": "Base URL for Ollama server",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "VLLM_EMBEDDINGS_BASE_URL",
+            "description": "Base URL for vLLM embeddings endpoint",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "VLLM_RERANK_BASE_URL",
+            "description": "Base URL for vLLM rerank endpoint",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "VLLM_QA_BASE_URL",
+            "description": "Base URL for vLLM QA endpoint",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "VLLM_RESEARCH_BASE_URL",
+            "description": "Base URL for vLLM research endpoint",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "MARKDOWN_PREPROCESSOR",
+            "description": "Dotted path or file path to a callable that preprocesses markdown content before chunking",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "DISABLE_DB_AUTOCREATE",
+            "description": "If true, refuse to auto-create a new LanceDB database or tables",
+            "format": "boolean",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "VACUUM_RETENTION_SECONDS",
+            "description": "Vacuum retention threshold in seconds (default: 60)",
+            "format": "number",
+            "isRequired": false,
+            "isSecret": false
+          },
+          {
+            "name": "OPENAI_API_KEY",
+            "description": "OpenAI API key (if using OpenAI for embeddings or QA)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": true
+          },
+          {
+            "name": "VOYAGE_API_KEY",
+            "description": "VoyageAI API key (if using VoyageAI for embeddings)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": true
+          },
+          {
+            "name": "ANTHROPIC_API_KEY",
+            "description": "Anthropic API key (if using Anthropic for QA)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": true
+          },
+          {
+            "name": "COHERE_API_KEY",
+            "description": "Cohere API key (if using Cohere for reranking)",
+            "format": "string",
+            "isRequired": false,
+            "isSecret": true
+          }
+        ]
+      }
+    ]
+  },
+  "transports": [
+    {
+      "type": "stdio"
+    }
+  ]
+}

{haiku_rag-0.11.4 → haiku_rag-0.12.1}/src/evaluations/benchmark.py RENAMED Viewed

@@ -212,8 +212,6 @@ async def run_qa_benchmark(
                 return await qa.answer(question)
             for case in evaluation_dataset.cases:
-                progress.console.print(f"\n[bold]Evaluating case:[/bold] {case.name}")
                 single_case_dataset = EvalDataset[str, str, dict[str, str]](
                     cases=[case],
                     evaluators=evaluation_dataset.evaluators,
@@ -232,32 +230,24 @@ async def run_qa_benchmark(
                     result_case = report.cases[0]
                     equivalence = result_case.assertions.get("answer_equivalent")
-                    progress.console.print(f"Question: {result_case.inputs}")
-                    progress.console.print(f"Expected: {result_case.expected_output}")
-                    progress.console.print(f"Generated: {result_case.output}")
                     if equivalence is not None:
-                        progress.console.print(
-                            f"Equivalent: {equivalence.value}"
-                            + (f" — {equivalence.reason}" if equivalence.reason else "")
-                        )
                         if equivalence.value:
                             passing_cases += 1
-                    progress.console.print("")
                 if report.failures:
                     failures.extend(report.failures)
                     failure = report.failures[0]
                     progress.console.print(
                         "[red]Failure encountered during case evaluation:[/red]"
                     )
-                    progress.console.print(f"Question: {failure.inputs}")
                     progress.console.print(f"Error: {failure.error_message}")
                     progress.console.print("")
-                progress.console.print(
-                    f"[green]Accuracy: {(passing_cases / total_processed):.4f} "
-                    f"{passing_cases}/{total_processed}[/green]"
+                progress.update(
+                    qa_task,
+                    description="[yellow]Evaluating QA cases...[/yellow] "
+                    f"[green]Accuracy: {(passing_cases / total_processed):.2f} "
+                    f"{passing_cases}/{total_processed}[/green]",
                 )
                 progress.advance(qa_task)

haiku.rag 0.11.4__tar.gz → 0.12.1__tar.gz

Potentially problematic release.

haiku.rag 0.11.4tar.gz → 0.12.1tar.gz