PyPI - agent-harnesses-mcp - Versions diffs - 0.1.0__tar.gz - Mend

agent-harnesses-mcp 0.1.0__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

agent_harnesses_mcp-0.1.0/.gitignore +103 -0
agent_harnesses_mcp-0.1.0/PKG-INFO +79 -0
agent_harnesses_mcp-0.1.0/README.md +66 -0
agent_harnesses_mcp-0.1.0/pyproject.toml +24 -0
agent_harnesses_mcp-0.1.0/server.py +227 -0

agent_harnesses_mcp-0.1.0/.gitignore ADDED Viewed

@@ -0,0 +1,103 @@
+# IntelliJ
+target/
+.idea/
+*.iml
+# Sublime
+*.sublime-workspace
+# Eclipse
+.settings
+# VS Code
+.project
+.classpath
+.vscode/*
+# Ignore all local history of files
+**/.history
+# Java
+*.class
+target/
+# C
+*.so
+# Python
+*.pyc
+*.egg-info
+__pycache__
+.ipynb_checkpoints
+.Python
+dist/
+.python-version
+.installed.cfg
+*.egg
+reqlib-metadata
+.mypy_cache/
+.venv
+venv/
+build/
+# Byte-compiled / optimized / DLL files
+*.pyc
+__pycache__/
+*.py[cod]
+*$py.class
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*,cover
+.hypothesis/
+.pytest_cache/
+# NPM / Node / JavaScript
+.npm
+node_modules/
+jspm_packages/
+# Runtime data
+pids
+*.pid
+*.seed
+*.pid.lock
+# Logs
+logs
+*.log
+npm-debug.log*
+yarn-debug.log*
+yarn-error.log*
+lerna-debug.log*
+# vim temporary files
+*~
+.*.sw?
+# Other Artifacts
+hs_err_pid*
+*.log
+*.swp
+*.swo
+temp/*
+.DS_Store
+# Local / editor (do not publish)
+.cursor/*
+!.cursor/rules/
+.cursor/rules/*
+!.cursor/rules/git-author.mdc
+claude.md
+PLAN.md
+history/
+.env
+.env.*
+!.env.example

agent_harnesses_mcp-0.1.0/PKG-INFO ADDED Viewed

@@ -0,0 +1,79 @@
+Metadata-Version: 2.4
+Name: agent-harnesses-mcp
+Version: 0.1.0
+Summary: MCP server for best-of-Agent-Harnesses: harness recommendations, search, and head-to-head decision guides over a hand-curated, weekly-rescored list of 110 agent harnesses
+Project-URL: Repository, https://github.com/RyanAlberts/best-of-Agent-Harnesses
+Project-URL: Documentation, https://github.com/RyanAlberts/best-of-Agent-Harnesses/tree/main/mcp
+Author: Ryan Alberts
+License-Expression: MIT
+Keywords: agent-harness,agents,llm,mcp,model-context-protocol
+Requires-Python: >=3.10
+Requires-Dist: mcp>=1.2
+Description-Content-Type: text/markdown
+# agent-harnesses MCP server
+The [best-of-Agent-Harnesses](https://github.com/RyanAlberts/best-of-Agent-Harnesses) list as an MCP server, so agents can recommend harnesses instead of you reading 101 table rows.
+Single file, stdio transport, no clone needed — it fetches [harnesses.json](../harnesses.json) from this repo at startup (or reads it locally from a checkout). Requires [uv](https://docs.astral.sh/uv/).
+## Install
+Claude Code:
+```sh
+claude mcp add agent-harnesses -- uv run https://raw.githubusercontent.com/RyanAlberts/best-of-Agent-Harnesses/main/mcp/server.py
+```
+Any other MCP client (Cursor, Codex, Gemini CLI, ...):
+```json
+{
+  "mcpServers": {
+    "agent-harnesses": {
+      "command": "uv",
+      "args": ["run", "https://raw.githubusercontent.com/RyanAlberts/best-of-Agent-Harnesses/main/mcp/server.py"]
+    }
+  }
+}
+```
+## Tools
+| Tool | What it does |
+|---|---|
+| `pick_harness(use_case, max_complexity?, min_autonomy?, min_recovery?, open_source_only?, limit?)` | Ranked recommendations for a use case, seeded by the list's hand-curated use-case index. `max_complexity` caps adoption surface (`super simple` → `complex`); `min_autonomy` requires a designed autonomy regime (`step-gated` → `headless`); `min_recovery` requires a failure-recovery tier (`none` → `durable`). |
+| `search_harnesses(query, limit?)` | Keyword search across names, descriptions, tags, and categories. |
+| `get_harness(github_id)` | Full record for one project. |
+| `list_comparisons()` | The head-to-head decision guides (OpenClaw vs Hermes, terminal coding agents, …) with summaries. |
+| `get_comparison(slug)` | Full markdown of one guide — architecture trade-offs, field reports, billing reality. Always current: served from the repo's `main`. |
+| `list_categories()` | The 10 categories, use-case intents, and the complexity/autonomy/recovery scales. |
+Example: *"pick_harness('sandboxed code execution for generated code', max_complexity='slightly complex', open_source_only=True)"* → E2B, smolagents, Daytona... each with stars, tier, license signal, and a one-line reason.
+Data is regenerated by [`scripts/generate.py`](../scripts/generate.py); star counts carry a `stars_captured` date, and the comparisons index is rebuilt from `comparisons/*.md` on every refresh — the server always serves current `main`.
+## Distribution
+The server is packaged as **`agent-harnesses-mcp`** (this directory's `pyproject.toml`) and registered in the official MCP registry as **`io.github.ryanalberts/agent-harnesses`** (`server.json` at the repo root). Once a release is published, package-manager installs work everywhere:
+```sh
+# any MCP client, via PyPI
+uvx agent-harnesses-mcp
+# Claude Code
+claude mcp add agent-harnesses -- uvx agent-harnesses-mcp
+```
+Until then (and forever, as the zero-install path), the raw-URL one-liner at the top of this README works from any machine with uv.
+## Publishing (maintainer runbook)
+Releases are automated by [`.github/workflows/publish-mcp.yml`](../.github/workflows/publish-mcp.yml) on a `mcp-v*` tag: it builds the wheel, publishes to PyPI via trusted publishing, and publishes `server.json` to the official MCP registry via GitHub OIDC.
+One-time setup, then never again:
+1. On pypi.org: create the project name `agent-harnesses-mcp` → Settings → Publishing → add a **trusted publisher**: owner `RyanAlberts`, repo `best-of-Agent-Harnesses`, workflow `publish-mcp.yml`. No API tokens.
+2. Nothing for the MCP registry — GitHub OIDC from this repo authorizes the `io.github.ryanalberts/*` namespace automatically.
+Per release: bump the version in `mcp/pyproject.toml` **and** `server.json` (the workflow fails loudly on mismatch), then `git tag mcp-v<version> && git push origin mcp-v<version>`.
+Also indexed via [`smithery.yaml`](../smithery.yaml) (submit the repo once at [smithery.ai](https://smithery.ai)); other directories (Glama, PulseMCP, mcpservers.org) crawl the official registry.

agent_harnesses_mcp-0.1.0/README.md ADDED Viewed

@@ -0,0 +1,66 @@
+# agent-harnesses MCP server
+The [best-of-Agent-Harnesses](https://github.com/RyanAlberts/best-of-Agent-Harnesses) list as an MCP server, so agents can recommend harnesses instead of you reading 101 table rows.
+Single file, stdio transport, no clone needed — it fetches [harnesses.json](../harnesses.json) from this repo at startup (or reads it locally from a checkout). Requires [uv](https://docs.astral.sh/uv/).
+## Install
+Claude Code:
+```sh
+claude mcp add agent-harnesses -- uv run https://raw.githubusercontent.com/RyanAlberts/best-of-Agent-Harnesses/main/mcp/server.py
+```
+Any other MCP client (Cursor, Codex, Gemini CLI, ...):
+```json
+{
+  "mcpServers": {
+    "agent-harnesses": {
+      "command": "uv",
+      "args": ["run", "https://raw.githubusercontent.com/RyanAlberts/best-of-Agent-Harnesses/main/mcp/server.py"]
+    }
+  }
+}
+```
+## Tools
+| Tool | What it does |
+|---|---|
+| `pick_harness(use_case, max_complexity?, min_autonomy?, min_recovery?, open_source_only?, limit?)` | Ranked recommendations for a use case, seeded by the list's hand-curated use-case index. `max_complexity` caps adoption surface (`super simple` → `complex`); `min_autonomy` requires a designed autonomy regime (`step-gated` → `headless`); `min_recovery` requires a failure-recovery tier (`none` → `durable`). |
+| `search_harnesses(query, limit?)` | Keyword search across names, descriptions, tags, and categories. |
+| `get_harness(github_id)` | Full record for one project. |
+| `list_comparisons()` | The head-to-head decision guides (OpenClaw vs Hermes, terminal coding agents, …) with summaries. |
+| `get_comparison(slug)` | Full markdown of one guide — architecture trade-offs, field reports, billing reality. Always current: served from the repo's `main`. |
+| `list_categories()` | The 10 categories, use-case intents, and the complexity/autonomy/recovery scales. |
+Example: *"pick_harness('sandboxed code execution for generated code', max_complexity='slightly complex', open_source_only=True)"* → E2B, smolagents, Daytona... each with stars, tier, license signal, and a one-line reason.
+Data is regenerated by [`scripts/generate.py`](../scripts/generate.py); star counts carry a `stars_captured` date, and the comparisons index is rebuilt from `comparisons/*.md` on every refresh — the server always serves current `main`.
+## Distribution
+The server is packaged as **`agent-harnesses-mcp`** (this directory's `pyproject.toml`) and registered in the official MCP registry as **`io.github.ryanalberts/agent-harnesses`** (`server.json` at the repo root). Once a release is published, package-manager installs work everywhere:
+```sh
+# any MCP client, via PyPI
+uvx agent-harnesses-mcp
+# Claude Code
+claude mcp add agent-harnesses -- uvx agent-harnesses-mcp
+```
+Until then (and forever, as the zero-install path), the raw-URL one-liner at the top of this README works from any machine with uv.
+## Publishing (maintainer runbook)
+Releases are automated by [`.github/workflows/publish-mcp.yml`](../.github/workflows/publish-mcp.yml) on a `mcp-v*` tag: it builds the wheel, publishes to PyPI via trusted publishing, and publishes `server.json` to the official MCP registry via GitHub OIDC.
+One-time setup, then never again:
+1. On pypi.org: create the project name `agent-harnesses-mcp` → Settings → Publishing → add a **trusted publisher**: owner `RyanAlberts`, repo `best-of-Agent-Harnesses`, workflow `publish-mcp.yml`. No API tokens.
+2. Nothing for the MCP registry — GitHub OIDC from this repo authorizes the `io.github.ryanalberts/*` namespace automatically.
+Per release: bump the version in `mcp/pyproject.toml` **and** `server.json` (the workflow fails loudly on mismatch), then `git tag mcp-v<version> && git push origin mcp-v<version>`.
+Also indexed via [`smithery.yaml`](../smithery.yaml) (submit the repo once at [smithery.ai](https://smithery.ai)); other directories (Glama, PulseMCP, mcpservers.org) crawl the official registry.

agent_harnesses_mcp-0.1.0/pyproject.toml ADDED Viewed

@@ -0,0 +1,24 @@
+[build-system]
+requires = ["hatchling"]
+build-backend = "hatchling.build"
+[project]
+name = "agent-harnesses-mcp"
+version = "0.1.0"
+description = "MCP server for best-of-Agent-Harnesses: harness recommendations, search, and head-to-head decision guides over a hand-curated, weekly-rescored list of 110 agent harnesses"
+readme = "README.md"
+requires-python = ">=3.10"
+license = "MIT"
+authors = [{ name = "Ryan Alberts" }]
+keywords = ["mcp", "model-context-protocol", "agents", "agent-harness", "llm"]
+dependencies = ["mcp>=1.2"]
+[project.scripts]
+agent-harnesses-mcp = "agent_harnesses_mcp:main"
+[project.urls]
+Repository = "https://github.com/RyanAlberts/best-of-Agent-Harnesses"
+Documentation = "https://github.com/RyanAlberts/best-of-Agent-Harnesses/tree/main/mcp"
+[tool.hatch.build.targets.wheel.force-include]
+"server.py" = "agent_harnesses_mcp/__init__.py"

agent_harnesses_mcp-0.1.0/server.py ADDED Viewed

@@ -0,0 +1,227 @@
+#!/usr/bin/env python3
+# /// script
+# requires-python = ">=3.10"
+# dependencies = ["mcp>=1.2"]
+# ///
+"""MCP server for best-of-Agent-Harnesses.
+Serves the curated list (harnesses.json) as tools so agents can recommend
+agent harnesses: pick_harness, search_harnesses, get_harness, list_categories.
+Run directly from GitHub (no clone needed):
+    uv run https://raw.githubusercontent.com/RyanAlberts/best-of-Agent-Harnesses/main/mcp/server.py
+"""
+import json
+import re
+import urllib.request
+from pathlib import Path
+from mcp.server.fastmcp import FastMCP
+DATA_URL = "https://raw.githubusercontent.com/RyanAlberts/best-of-Agent-Harnesses/main/harnesses.json"
+mcp = FastMCP("agent-harnesses")
+_data: dict | None = None
+def data() -> dict:
+    global _data
+    if _data is None:
+        local = Path(__file__).resolve().parent.parent / "harnesses.json"
+        if local.exists():
+            _data = json.loads(local.read_text())
+        else:
+            with urllib.request.urlopen(DATA_URL, timeout=15) as r:
+                _data = json.loads(r.read().decode())
+    return _data
+_STOP = {"i", "a", "an", "the", "to", "for", "of", "in", "on", "with", "and",
+         "or", "my", "me", "want", "need", "agent", "agents", "ai", "llm"}
+def _tokens(text: str) -> set:
+    return {w for w in re.findall(r"[a-z0-9+#-]+", text.lower()) if w not in _STOP}
+def _overlap(q: set, hay: set) -> set:
+    """Query tokens with a match in hay, tolerating inflections: tokens of 4+
+    chars match if either is a prefix of the other (benchmark/benchmarks,
+    evaluate/evaluates)."""
+    hits = set()
+    for w in q:
+        for h in hay:
+            if w == h or (len(w) >= 4 and len(h) >= 4 and (w.startswith(h) or h.startswith(w))):
+                hits.add(w)
+                break
+    return hits
+def _brief(p: dict, reason: str = "") -> dict:
+    out = {
+        "name": p["name"],
+        "github_id": p["github_id"],
+        "url": p["url"],
+        "stars": p["stars"],
+        "tier": p["tier"],
+        "autonomy": p.get("autonomy", "n/a"),
+        "recovery": p.get("recovery", "n/a"),
+        "license_signal": p["license_signal"],
+        "category": p["category_title"],
+        "description": p["description"],
+        "tags": p["tags"],
+    }
+    if reason:
+        out["why"] = reason
+    return out
+@mcp.tool()
+def pick_harness(use_case: str, max_complexity: str = "complex",
+                 min_autonomy: str = "", min_recovery: str = "",
+                 open_source_only: bool = False, limit: int = 5) -> str:
+    """Recommend agent harnesses for a use case, ranked from a hand-curated list of 101.
+    use_case: what you want to do, e.g. "terminal coding agent", "drop-in memory
+    layer", "evaluate agents on coding benchmarks".
+    max_complexity: cap on adoption surface — one of "super simple",
+    "mostly simple", "slightly complex", "complex" (default: no cap).
+    min_autonomy: require at least this designed autonomy regime — one of
+    "step-gated", "checkpoint-gated", "bounded", "headless" (e.g. "bounded"
+    means "must be able to run a whole task unattended"; excludes n/a entries).
+    min_recovery: require at least this failure-recovery tier — one of "none",
+    "retry", "resumable", "durable" (excludes n/a entries).
+    open_source_only: drop projects with restricted or unknown licenses.
+    Returns JSON: ranked picks with a one-line reason each.
+    """
+    d = data()
+    tiers: list = d["meta"]["tiers"]
+    max_rank = tiers.index(max_complexity) + 1 if max_complexity in tiers else 4
+    a_tiers: list = d["meta"].get("autonomy_tiers", [])
+    r_tiers: list = d["meta"].get("recovery_tiers", [])
+    min_a = a_tiers.index(min_autonomy) + 1 if min_autonomy in a_tiers else 0
+    min_r = r_tiers.index(min_recovery) + 1 if min_recovery in r_tiers else 0
+    q = _tokens(use_case)
+    # Curated use-case intents are the strongest signal: best word-overlap intent
+    # seeds its hand-picked projects to the top, in curated order.
+    seeded: dict = {}
+    best = max(d["use_cases"], key=lambda u: len(_overlap(q, _tokens(u["intent"]))), default=None)
+    if best and len(_overlap(q, _tokens(best["intent"]))) >= 2:
+        for rank, gid in enumerate(best["picks"]):
+            seeded[gid] = (100 - rank, f"curated pick for \"{best['intent']}\"")
+    import math
+    scored = []
+    for p in d["projects"]:
+        if p["tier_rank"] > max_rank:
+            continue
+        if min_a and p.get("autonomy_rank", 0) < min_a:
+            continue
+        if min_r and p.get("recovery_rank", 0) < min_r:
+            continue
+        if open_source_only and p["license_signal"] != "open-source":
+            continue
+        if p["github_id"] in seeded:
+            score, reason = seeded[p["github_id"]]
+        else:
+            overlap = _overlap(q, _tokens(f"{p['description']} {' '.join(p['tags'])} {p['category_title']}"))
+            if not overlap:
+                continue
+            score = len(overlap) * 3 + math.log10(max(p["stars"], 2))
+            reason = "matches: " + ", ".join(sorted(overlap))
+        scored.append((score, p, reason))
+    scored.sort(key=lambda t: -t[0])
+    picks = [_brief(p, reason) for _, p, reason in scored[:limit]]
+    return json.dumps({
+        "use_case": use_case,
+        "picks": picks,
+        "source": d["meta"]["url"],
+        "stars_captured": d["meta"]["stars_captured"],
+    }, indent=2, ensure_ascii=False)
+@mcp.tool()
+def search_harnesses(query: str, limit: int = 10) -> str:
+    """Keyword search across all 101 projects (name, description, tags, category).
+    Returns JSON: matching projects sorted by relevance then stars.
+    """
+    d = data()
+    q = _tokens(query)
+    ql = query.lower()
+    import math
+    scored = []
+    for p in d["projects"]:
+        hay = f"{p['name']} {p['github_id']} {p['description']} {' '.join(p['tags'])} {p['category_title']}"
+        name_hit = 50 if ql in p["name"].lower() or ql in p["github_id"].lower() else 0
+        overlap = _overlap(q, _tokens(hay))
+        if not (name_hit or overlap):
+            continue
+        scored.append((name_hit + len(overlap) * 3 + math.log10(max(p["stars"], 2)), p))
+    scored.sort(key=lambda t: -t[0])
+    return json.dumps({"query": query, "results": [_brief(p) for _, p in scored[:limit]]},
+                      indent=2, ensure_ascii=False)
+@mcp.tool()
+def get_harness(github_id: str) -> str:
+    """Full record for one project by github_id (e.g. "anomalyco/opencode")."""
+    for p in data()["projects"]:
+        if p["github_id"].lower() == github_id.lower():
+            return json.dumps(p, indent=2, ensure_ascii=False)
+    return json.dumps({"error": f"unknown github_id: {github_id}",
+                       "hint": "use search_harnesses to find the right id"})
+@mcp.tool()
+def list_comparisons() -> str:
+    """The list's head-to-head decision guides (e.g. "OpenClaw vs Hermes",
+    "How to pick a harness") — slug, title, and summary for each. Fetch the
+    full text of one with get_comparison(slug)."""
+    return json.dumps({"comparisons": data().get("comparisons", [])},
+                      indent=2, ensure_ascii=False)
+@mcp.tool()
+def get_comparison(slug: str) -> str:
+    """Full markdown of one decision guide by slug (see list_comparisons).
+    Guides cover architecture trade-offs, field reports, and the post-June-2026
+    billing reality — use them when a user is choosing between specific
+    harnesses, not just browsing."""
+    for c in data().get("comparisons", []):
+        if c["slug"] == slug:
+            local = Path(__file__).resolve().parent.parent / "comparisons" / f"{slug}.md"
+            if local.exists():
+                return local.read_text()
+            with urllib.request.urlopen(c["raw_url"], timeout=15) as r:
+                return r.read().decode()
+    return json.dumps({"error": f"unknown slug: {slug}",
+                       "available": [c["slug"] for c in data().get("comparisons", [])]})
+@mcp.tool()
+def list_categories() -> str:
+    """The list's 9 categories and 13 curated use-case intents, with project counts."""
+    d = data()
+    counts: dict = {}
+    for p in d["projects"]:
+        counts[p["category"]] = counts.get(p["category"], 0) + 1
+    return json.dumps({
+        "categories": [dict(c, project_count=counts.get(c["id"], 0)) for c in d["categories"]],
+        "use_cases": [u["intent"] for u in d["use_cases"]],
+        "tiers": d["meta"]["tiers"],
+        "autonomy_tiers": d["meta"].get("autonomy_tiers", []),
+        "recovery_tiers": d["meta"].get("recovery_tiers", []),
+    }, indent=2, ensure_ascii=False)
+def main():
+    mcp.run()
+if __name__ == "__main__":
+    main()