npm - auxiliar-mcp - Versions diffs - 0.9.0 → 0.9.1 - Mend

auxiliar-mcp 0.9.0 → 0.9.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/README.md +64 -28
package/dist/data/event-sources.js +9 -10
package/package.json +26 -8

package/README.md CHANGED Viewed

@@ -1,10 +1,13 @@
 # auxiliar-mcp
-MCP server that keeps your AI agent's infrastructure knowledge current.
+The MCP server that tells your agent **which tool to install** for a task, and **which cloud service to pick** for a stack.
-Your agent still thinks SendGrid has a free tier. It doesn't (removed March 2025). It recommends Auth.js — which is in maintenance mode. It quotes Neon at $19/month — pricing is now usage-based from $0.
+Your agent is intelligent but stuck. It doesn't know Surya beats Tesseract by 1.5pp on word accuracy for Brazilian NFS-e invoices. It doesn't know SendGrid killed its free tier. It guesses, installs the wrong thing, and you burn 30 minutes.
-**auxiliar-mcp** gives your agent current, Chrome-verified data about 74 cloud services across 16 categories.
+**auxiliar-mcp** gives your agent reproducible eval-backed answers for the two questions it hits most:
+1. **"What installable tool should I use for task X?"** — skills, MCPs, vendor APIs, local binaries — ranked on real-world corpora, via `solve_task`.
+2. **"What cloud service should I pick for Y?"** — Chrome-verified pricing, risks, compatibility, setup commands for 77 services across 16 categories, via `recommend_service`.
 ## Install
@@ -16,22 +19,53 @@ claude mcp add auxiliar -- npx auxiliar-mcp
 npx auxiliar-mcp
 ```
-## Tools
+## Tools (8)
 | Tool | What it does |
 |------|-------------|
-| `recommend_service` | Picks the best service based on your constraints (framework, budget, region, GDPR, edge, lock-in) |
-| `get_pricing` | Chrome-verified pricing — including JS-rendered pages agents can't read via WebFetch |
-| `get_risks` | Risk flags, gotchas, and recent breaking changes |
-| `check_compatibility` | Warns about known conflicts between services (e.g., Turso + Prisma needs adapter) |
-| `setup_service` | CLI commands, signup URLs, env vars, estimated setup time |
+| `solve_task` | Get the ranked list of installable tools for a job-to-be-done (e.g., `pdf-text-extraction-mcp`, `nfs-e`, `boleto`, `receipt-parsing`, `bookkeeping-ocr`) with scorecards, install commands, FAQ, alternatives considered, and methodological caveats. |
+| `list_solve_tasks` | Discover every `/solve/` task ranking available — slugs, top picks, categories, agent compatibility. |
+| `recommend_service` | Picks the best cloud service for your constraints (framework, budget, region, GDPR, edge, lock-in). |
+| `get_pricing` | Chrome-verified pricing — including JS-rendered pages agents can't read via WebFetch. |
+| `get_risks` | Risk flags, gotchas, recent breaking changes. |
+| `check_compatibility` | Warns about known conflicts between services (e.g., Turso + Prisma needs adapter). |
+| `setup_service` | CLI commands, signup URLs, env vars, estimated setup time. |
+| `list_services` | Browse the full 77-service catalog, filtered by category. |
+## When to use `solve_task`
+Your agent needs an **installable tool** (skill, MCP, vendor API, or local binary) and you want a reproducible evaluation, not vibes.
+```
+Agent: "I need to extract text from Brazilian NFS-e invoices, boletos, and phone-photo receipts. What should I install?"
-## Example
+→ solve_task(task_slug="pdf-text-extraction-mcp")
+  # aliases work too: "pdf", "ocr", "nfs-e", "boleto", "receipt-parsing", "bookkeeping-ocr", "invoice-extraction"
+Returns (truncated):
+{
+  "answer": "Install Surya (pip install surya-ocr + pin transformers<5.0.0). It led our 10-document real-world corpus on word accuracy (76.9%) and layout preservation (7.0/10), free, local. Tesseract 5 runs 14× faster for throughput-critical workflows. Google Document AI wins on phone-photo receipts specifically...",
+  "candidates": [
+    { "slug": "surya", "rank": 1, "scorecard": {"word_accuracy": 0.769, "layout": 7, "p50_latency_sec": 22.1, "install_friction": 7, "cost_per_10_docs_usd": 0} },
+    { "slug": "tesseract", "rank": 2, "scorecard": {"word_accuracy": 0.754, "layout": 5, "p50_latency_sec": 1.6, "install_friction": 3, "cost_per_10_docs_usd": 0} },
+    { "slug": "google-document-ai", "rank": 3, "scorecard": {"word_accuracy": 0.697, "layout": 5.7, "p50_latency_sec": 3.8, "install_friction": 7, "cost_per_10_docs_usd": 0.069} }
+  ],
+  "corpus_summary": "10 real-world documents: native-text PDFs, legal docs, Brazilian corporate-registry scans, NFS-e invoices, boletos, phone-photo receipts.",
+  "alternatives_considered": [ /* yescan, Mistral OCR, pdf-reader-mcp — dropped with reasons */ ],
+  "faq": [ /* e.g., "Why does all score 0 on the boleto?" */ ]
+}
+```
+Full page with reproducible commands: https://auxiliar.ai/solve/pdf-text-extraction-mcp/
+## When to use `recommend_service`
+Your agent needs a **cloud service** (database, email provider, auth, payments, etc.).
 ```
 Agent: "I need a database for my Next.js app. Budget is free, deployed to Cloudflare Workers."
-→ recommend_service(need="database", framework="nextjs", constraints="edge, zero cold starts")
+→ recommend_service(need="database", framework="nextjs", budget="free", constraints="edge, zero cold starts")
 Returns:
 {
@@ -40,17 +74,12 @@ Returns:
   "pricing": { "free_tier": "5 GB storage, 100 databases" },
   "risks": ["Not PostgreSQL — limited ORM support"],
   "migration_difficulty": "high",
-  "key_features": ["SQLite/libSQL", "embedded replicas", "zero cold starts"],
-  "mcp_available": false,
-  "cli_available": true,
   "cli_install": "brew install tursodatabase/tap/turso",
-  "alternatives": [
-    { "provider": "neon", "trade_off": "Has cold starts on free tier" }
-  ]
+  "alternatives": [{"provider": "neon", "trade_off": "Has cold starts on free tier"}]
 }
 ```
-## Services Covered
+## Services Covered (77)
 **Database:** Neon, Supabase, Turso, PlanetScale, Render Postgres, AWS RDS, Railway Postgres, Cloudflare D1
 **Email:** Resend, Postmark, SendGrid, AWS SES, Mailgun, Listmonk
@@ -68,17 +97,23 @@ Returns:
 **SMS:** Twilio, Vonage, MessageBird
 **Feature Flags:** LaunchDarkly, Statsig, Flagsmith, Unleash
 **Cron:** Inngest, Trigger.dev, QStash, Vercel Cron, Cloudflare Cron
+**PDF / OCR (via solve_task):** Surya, Tesseract 5, Google Document AI
+## /solve/ Tasks Available
+| Slug | Top pick | Corpus | Categories |
+|------|----------|--------|-----------|
+| `pdf-text-extraction-mcp` | Surya | 10 Brazilian docs incl. NFS-e, boleto, phone-photo receipts | pdf-processing, ocr, agent-tools |
+More `/solve/` rankings added as walkthroughs run. Each page includes its reproducible command so you can re-run the eval yourself.
 ## Data Quality
-- Pricing verified by browsing actual service websites (Chrome DevTools, not WebFetch)
-- Updated March 2026
-- Tested with 50+ agent runs across 8 iterations
-- Scores 9/10 on recommendation accuracy
-- 47 category aliases (agents can query "llm-api", "file-storage", "vector-db", etc.)
-- 27 compatibility rules with cross-service conflict detection
+- **/solve/ evals:** reproducible corpus + harness + scoring per task. Ground truth is LLM-drafted, human-finalized. Published commands can be re-run locally.
+- **Cloud-service pricing:** Chrome-verified (actual service websites, not training data). Updated through 2026-04.
+- **Trust scores:** 50+ agent runs across 8 iterations; 47 category aliases; 27 compatibility rules.
-## Constraints You Can Use
+## Constraints You Can Use on `recommend_service`
 | Constraint | Example |
 |-----------|---------|
@@ -92,12 +127,13 @@ Returns:
 ## Privacy
-The MCP server pings `auxiliar.ai/api/` on each tool call for analytics. Only query parameters are sent (e.g., `?need=database&framework=nextjs`). No personal data, no API keys, no project info. Works offline with bundled data if the ping fails.
+The MCP server pings `auxiliar.ai/api/` on each tool call for analytics. Only query parameters are sent (e.g., `?need=database&framework=nextjs` or `?task_slug=pdf-text-extraction-mcp`). No personal data, no API keys, no project info. Works offline with bundled data if the ping fails.
 ## Links
-- [auxiliar.ai](https://auxiliar.ai) — comparison site with full service entries
-- [GitHub](https://github.com/Tlalvarez/Auxiliar-ai)
+- [auxiliar.ai](https://auxiliar.ai) — the comparison site with service entries and `/solve/` task rankings
+- [/solve/pdf-text-extraction-mcp](https://auxiliar.ai/solve/pdf-text-extraction-mcp/) — the OCR walkthrough
+- [GitHub](https://github.com/Tlalvarez/Auxiliar-ai) — source + reproducible eval harness under `scripts/ocr-walkthrough/`
 ## License

package/dist/data/event-sources.js CHANGED Viewed

@@ -36,7 +36,7 @@ export const eventSources = [
         name: "Vercel",
         sources: [
             { type: "status", url: "https://www.vercel-status.com/history.rss" },
-            { type: "changelog", url: "https://vercel.com/changelog/feed.xml" },
+            { type: "changelog", url: "https://vercel.com/atom" },
             {
                 type: "security",
                 url: "https://vercel.com/kb/bulletin",
@@ -48,7 +48,7 @@ export const eventSources = [
         slug: "stripe",
         name: "Stripe",
         sources: [
-            { type: "status", url: "https://status.stripe.com/history.rss" },
+            { type: "status", url: "https://www.stripestatus.com/history.rss" },
             { type: "changelog", url: "https://stripe.com/blog/feed.rss" },
         ],
     },
@@ -57,15 +57,15 @@ export const eventSources = [
         name: "Supabase",
         sources: [
             { type: "status", url: "https://status.supabase.com/history.rss" },
-            { type: "changelog", url: "https://supabase.com/changelog/feed.xml" },
+            { type: "changelog", url: "https://supabase.com/feed.xml" },
         ],
     },
     {
         slug: "neon",
         name: "Neon",
         sources: [
-            { type: "status", url: "https://neonstatus.com/history.rss" },
-            { type: "changelog", url: "https://neon.tech/changelog/feed.xml" },
+            { type: "status", url: "https://neonstatus.com/pages/6878fc85709daa75be6c7e3c/rss" },
+            { type: "changelog", url: "https://neon.com/blog/rss.xml" },
         ],
     },
     {
@@ -73,7 +73,7 @@ export const eventSources = [
         name: "Clerk",
         sources: [
             { type: "status", url: "https://status.clerk.com/history.rss" },
-            { type: "changelog", url: "https://clerk.com/changelog/feed.xml" },
+            { type: "changelog", url: "https://clerk.com/changelog/atom.xml" },
         ],
     },
     {
@@ -92,8 +92,7 @@ export const eventSources = [
         slug: "railway",
         name: "Railway",
         sources: [
-            { type: "status", url: "https://status.railway.com/history.rss" },
-            { type: "changelog", url: "https://blog.railway.com/feed.xml" },
+            { type: "status", url: "https://railway.instatus.com/history.rss" },
         ],
     },
     {
@@ -101,7 +100,7 @@ export const eventSources = [
         name: "Render",
         sources: [
             { type: "status", url: "https://status.render.com/history.rss" },
-            { type: "changelog", url: "https://render.com/changelog/rss.xml" },
+            { type: "changelog", url: "https://render.com/changelog/feed.xml" },
         ],
     },
     {
@@ -109,7 +108,7 @@ export const eventSources = [
         name: "Resend",
         sources: [
             { type: "status", url: "https://resend-status.com/history.rss" },
-            { type: "changelog", url: "https://resend.com/changelog/feed.xml" },
+            { type: "changelog", url: "https://resend.com/blog/index.xml" },
         ],
     },
     {

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "name": "auxiliar-mcp",
-  "version": "0.9.0",
-  "description": "MCP server that keeps your AI agent's infrastructure knowledge current. Chrome-verified pricing, risk flags, compatibility checks, setup guides for 77 cloud services and local agent tools, plus /solve/ task rankings with reproducible evals for agent-installable tooling (skills, MCPs, APIs, local binaries).",
+  "version": "0.9.1",
+  "description": "Agent-installable-tool rankings (OCR, PDF extraction, NFS-e, bookkeeping, and more) for Claude Code, Cursor, Claude Desktop, OpenClaw — evaluated on real-world corpora. Call solve_task to get the best skill/MCP/API/local binary for a task, ranked by word accuracy, layout, latency, cost, and install friction. Also Chrome-verified pricing, risks, and setup for 77 cloud services.",
   "type": "module",
   "main": "dist/server.js",
   "bin": {
@@ -24,17 +24,35 @@
     "model-context-protocol",
     "ai-agent",
     "claude-code",
+    "claude-desktop",
     "cursor",
     "windsurf",
+    "openclaw",
+    "clawhub",
+    "ocr",
+    "pdf",
+    "pdf-extraction",
+    "pdf-ocr",
+    "document-ai",
+    "invoice-extraction",
+    "nfs-e",
+    "boleto",
+    "bookkeeping",
+    "brazilian-invoice",
+    "receipt-parsing",
+    "solve-task",
+    "task-ranking",
+    "agent-tools",
+    "agent-skills",
+    "tool-selection",
+    "agent-upgrade",
     "cloud-services",
     "pricing",
     "developer-tools",
-    "neon",
-    "resend",
-    "supabase",
-    "vercel",
-    "stripe",
-    "infrastructure"
+    "infrastructure",
+    "surya",
+    "tesseract",
+    "google-document-ai"
   ],
   "license": "MIT",
   "repository": {