PyPI - agentx-kit - Versions diffs - 0.4.0__tar.gz → 0.5.0__tar.gz - Mend

agentx-kit 0.4.0tar.gz → 0.5.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (90) hide show

agentx_kit-0.5.0/.agentx/llm_cache.sqlite ADDED Viewed

Binary file

agentx_kit-0.5.0/.claude-plugin/marketplace.json ADDED Viewed

@@ -0,0 +1,15 @@
+{
+  "name": "agentx-kit",
+  "owner": { "name": "AgentX", "url": "https://github.com/muhammadyahiya/agentx-kit" },
+  "metadata": {
+    "description": "AgentX-Kit — scaffold agent projects from a prompt, in Claude Code.",
+    "version": "0.1.0"
+  },
+  "plugins": [
+    {
+      "name": "agentx-kit",
+      "source": "./integrations/claude-plugin",
+      "description": "Scaffold complete LangChain/CrewAI agent projects from a single problem statement via AgentX-Kit's MCP tools."
+    }
+  ]
+}

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: agentx-kit
-Version: 0.4.0
+Version: 0.5.0
 Summary: An open-source, provider-agnostic agentic framework + interactive project scaffolder for LangChain and CrewAI. Pick your LLM provider, agents, RAG, memory, MCP tools and skills — generate a ready-to-run uv project.
 Project-URL: Homepage, https://github.com/muhammadyahiya/agentx-kit
 Project-URL: Repository, https://github.com/muhammadyahiya/agentx-kit
@@ -291,6 +291,43 @@ The assistant calls AgentX-Kit's tools and you get a complete, runnable project:
 So from one sentence the assistant produces a pre-wired project (prompts already seeded from your use case), ready to `uv sync && uv run`.
+## 🧩 Editor & assistant integrations
+The same connector powers ready-made integrations (see [`integrations/`](integrations/)):
+- **VS Code extension** ([`integrations/vscode`](integrations/vscode)) — commands for
+  *New Agent Project*, *Open Prompt Dashboard*, *Add Prompt*, *Cache Stats*, and
+  *Register MCP Server for Copilot* (writes `.vscode/mcp.json`). Build with `vsce package`.
+- **GitHub Copilot** (agent mode) — add the MCP server via `.vscode/mcp.json`:
+  ```jsonc
+  { "servers": { "agentx-kit": { "command": "agentx", "args": ["mcp"] } } }
+  ```
+  (the VS Code command above writes this for you), then ask Copilot to build an agent.
+- **Claude Code plugin** ([`integrations/claude-plugin`](integrations/claude-plugin)):
+  ```text
+  /plugin marketplace add muhammadyahiya/agentx-kit
+  /plugin install agentx-kit@agentx-kit
+  /agentx-kit:new-agent a support agent that answers from our docs and serves an API
+  ```
+- **Claude Desktop / Codex** — add the connector config from `agentx mcp --print-config`.
+## 💾 Response caching (cost & latency saver)
+Caching is the top 2026 token-optimization lever. Turn on a **global LLM response
+cache** and every provider call is served from a local store on repeat — no code changes:
+```python
+from agentx import enable_caching, cache_stats
+enable_caching()                 # all get_chat_model(...) calls are cached
+...
+print(cache_stats())             # {'hit_rate': 0.6, 'tokens_saved': 12000, 'est_usd_saved': 0.024, ...}
+```
+```bash
+agentx cache stats               # hit rate + estimated tokens/$ saved
+agentx cache clear
+```
+Generated projects can enable it automatically (it's part of `--enterprise`), and the
+**dashboard's Trends tab shows live hit-rate and $ saved**. TTL-capable, SQLite-backed
+at `.agentx/llm_cache.sqlite`.
 ## 🏢 Enterprise pack
 Generate a production-shaped project with one flag — informed by a survey of
 CrewAI/LangGraph/create-llama/AgentStack/agno/pydantic-ai (see [RESEARCH.md](RESEARCH.md)):

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/README.md RENAMED Viewed

@@ -187,6 +187,43 @@ The assistant calls AgentX-Kit's tools and you get a complete, runnable project:
 So from one sentence the assistant produces a pre-wired project (prompts already seeded from your use case), ready to `uv sync && uv run`.
+## 🧩 Editor & assistant integrations
+The same connector powers ready-made integrations (see [`integrations/`](integrations/)):
+- **VS Code extension** ([`integrations/vscode`](integrations/vscode)) — commands for
+  *New Agent Project*, *Open Prompt Dashboard*, *Add Prompt*, *Cache Stats*, and
+  *Register MCP Server for Copilot* (writes `.vscode/mcp.json`). Build with `vsce package`.
+- **GitHub Copilot** (agent mode) — add the MCP server via `.vscode/mcp.json`:
+  ```jsonc
+  { "servers": { "agentx-kit": { "command": "agentx", "args": ["mcp"] } } }
+  ```
+  (the VS Code command above writes this for you), then ask Copilot to build an agent.
+- **Claude Code plugin** ([`integrations/claude-plugin`](integrations/claude-plugin)):
+  ```text
+  /plugin marketplace add muhammadyahiya/agentx-kit
+  /plugin install agentx-kit@agentx-kit
+  /agentx-kit:new-agent a support agent that answers from our docs and serves an API
+  ```
+- **Claude Desktop / Codex** — add the connector config from `agentx mcp --print-config`.
+## 💾 Response caching (cost & latency saver)
+Caching is the top 2026 token-optimization lever. Turn on a **global LLM response
+cache** and every provider call is served from a local store on repeat — no code changes:
+```python
+from agentx import enable_caching, cache_stats
+enable_caching()                 # all get_chat_model(...) calls are cached
+...
+print(cache_stats())             # {'hit_rate': 0.6, 'tokens_saved': 12000, 'est_usd_saved': 0.024, ...}
+```
+```bash
+agentx cache stats               # hit rate + estimated tokens/$ saved
+agentx cache clear
+```
+Generated projects can enable it automatically (it's part of `--enterprise`), and the
+**dashboard's Trends tab shows live hit-rate and $ saved**. TTL-capable, SQLite-backed
+at `.agentx/llm_cache.sqlite`.
 ## 🏢 Enterprise pack
 Generate a production-shaped project with one flag — informed by a survey of
 CrewAI/LangGraph/create-llama/AgentStack/agno/pydantic-ai (see [RESEARCH.md](RESEARCH.md)):

agentx_kit-0.5.0/integrations/claude-plugin/.claude-plugin/plugin.json ADDED Viewed

@@ -0,0 +1,9 @@
+{
+  "name": "agentx-kit",
+  "version": "0.1.0",
+  "description": "Scaffold complete provider-agnostic LangChain/CrewAI agent projects from a single problem statement, via AgentX-Kit's MCP tools.",
+  "author": { "name": "AgentX" },
+  "homepage": "https://github.com/muhammadyahiya/agentx-kit",
+  "license": "MIT",
+  "keywords": ["agents", "scaffold", "langchain", "crewai", "mcp"]
+}

agentx_kit-0.5.0/integrations/claude-plugin/.mcp.json ADDED Viewed

@@ -0,0 +1,8 @@
+{
+  "mcpServers": {
+    "agentx-kit": {
+      "command": "agentx",
+      "args": ["mcp"]
+    }
+  }
+}

agentx_kit-0.5.0/integrations/claude-plugin/README.md ADDED Viewed

@@ -0,0 +1,29 @@
+# AgentX-Kit — Claude Code plugin
+Bundles AgentX-Kit's MCP server + a `/agentx-kit:new-agent` slash command so you
+can scaffold a complete agent project from a single problem statement inside
+Claude Code.
+## Prerequisite
+```bash
+pip install "agentx-kit[connector]"   # provides `agentx mcp`
+```
+## Install (from this repo's marketplace)
+```text
+/plugin marketplace add muhammadyahiya/agentx-kit
+/plugin install agentx-kit@agentx-kit
+```
+Then use it:
+```text
+/agentx-kit:new-agent a customer-support agent that answers from our docs and serves an API
+```
+## Or just add the MCP server (no plugin)
+```bash
+claude mcp add agentx-kit -- agentx mcp
+```
+The plugin ships:
+- `.mcp.json` — registers the `agentx-kit` MCP server (`agentx mcp`).
+- `commands/new-agent.md` — the `/agentx-kit:new-agent` workflow.

agentx_kit-0.5.0/integrations/claude-plugin/commands/new-agent.md ADDED Viewed

@@ -0,0 +1,14 @@
+---
+description: Scaffold a complete AgentX-Kit agent project from a problem statement
+argument-hint: <describe the agent / use case you want to build>
+---
+Build a complete, runnable agent project for this request using the AgentX-Kit MCP tools:
+**$ARGUMENTS**
+Steps:
+1. Call `recommend_project` with the problem statement and briefly show the recommended stack (framework, provider, agents, features) + rationale.
+2. Ask the user to confirm or adjust (provider/framework/enterprise), then call `create_agent_project` with the problem statement and any overrides.
+3. Report the target directory, the generated file tree, and the exact run steps it returns. Offer to open key files (main.py, agents.py, prompts.json).
+Keep it concise; the tools do the heavy lifting.

agentx_kit-0.5.0/integrations/vscode/.vscodeignore ADDED Viewed

@@ -0,0 +1,4 @@
+.vscode/**
+**/*.map
+.gitignore
+node_modules/**

agentx_kit-0.5.0/integrations/vscode/README.md ADDED Viewed

@@ -0,0 +1,34 @@
+# AgentX-Kit — VS Code extension
+Scaffold agent projects, open the prompt dashboard, and wire AgentX-Kit into
+**GitHub Copilot** (agent mode) — without leaving VS Code.
+## Prerequisite
+```bash
+pip install "agentx-kit[all]"   # provides the `agentx` CLI the extension calls
+```
+## Commands (⇧⌘P)
+- **AgentX: New Agent Project** — name + use case → `agentx new`
+- **AgentX: Open Prompt Dashboard** — `agentx dashboard`
+- **AgentX: Add Agent Prompt** — `agentx prompt set … -d`
+- **AgentX: Show Response-Cache Stats** — `agentx cache stats`
+- **AgentX: Register MCP Server for Copilot** — writes `.vscode/mcp.json` so Copilot
+  agent mode can call AgentX-Kit's tools (e.g. *"build a support agent over our docs"*).
+Set a custom CLI path with the `agentx.command` setting.
+## Build / install locally
+```bash
+npm install -g @vscode/vsce
+cd integrations/vscode
+vsce package           # -> agentx-kit-0.1.0.vsix
+code --install-extension agentx-kit-0.1.0.vsix
+```
+## Publish (needs a Marketplace publisher + PAT)
+```bash
+vsce login <publisher>
+vsce publish
+```
+See https://code.visualstudio.com/api/working-with-extensions/publishing-extension.

agentx_kit-0.5.0/integrations/vscode/extension.js ADDED Viewed

@@ -0,0 +1,87 @@
+// AgentX-Kit VS Code extension.
+// Thin wrapper over the `agentx` CLI + MCP registration for Copilot/agent mode.
+// Pure JS (no build step). Requires `pip install agentx-kit` on PATH.
+const vscode = require("vscode");
+const fs = require("fs");
+const path = require("path");
+function cli() {
+  return vscode.workspace.getConfiguration("agentx").get("command", "agentx");
+}
+function runInTerminal(name, commandLine) {
+  const term = vscode.window.createTerminal({ name });
+  term.show();
+  term.sendText(commandLine);
+}
+function workspaceRoot() {
+  const folders = vscode.workspace.workspaceFolders;
+  return folders && folders.length ? folders[0].uri.fsPath : process.cwd();
+}
+async function newProject() {
+  const name = await vscode.window.showInputBox({
+    prompt: "Project name", value: "my-agent",
+  });
+  if (!name) return;
+  const problem = await vscode.window.showInputBox({
+    prompt: "Describe the use case (optional — seeds the agent's prompt)", value: "",
+  });
+  const enterprise = await vscode.window.showQuickPick(["No", "Yes (tracing, guardrails, FastAPI, Docker, CI, evals, cache)"], {
+    placeHolder: "Enterprise pack?",
+  });
+  let cmd = `${cli()} new --yes --name ${JSON.stringify(name)}`;
+  if (problem) cmd += ` --prompt ${JSON.stringify(problem)}`;
+  if (enterprise && enterprise.startsWith("Yes")) cmd += " --enterprise";
+  runInTerminal("AgentX: new", cmd);
+}
+function dashboard() {
+  runInTerminal("AgentX: dashboard", `${cli()} dashboard`);
+}
+async function addPrompt() {
+  const agent = await vscode.window.showInputBox({ prompt: "Agent name", value: "assistant" });
+  if (!agent) return;
+  const text = await vscode.window.showInputBox({ prompt: "System prompt" });
+  if (text === undefined) return;
+  runInTerminal("AgentX: prompt", `${cli()} prompt set ${JSON.stringify(agent)} --text ${JSON.stringify(text)} -d`);
+}
+function cacheStats() {
+  runInTerminal("AgentX: cache", `${cli()} cache stats`);
+}
+// Write .vscode/mcp.json so GitHub Copilot (agent mode) / VS Code can use AgentX-Kit's MCP server.
+async function registerMcp() {
+  const root = workspaceRoot();
+  const dir = path.join(root, ".vscode");
+  const file = path.join(dir, "mcp.json");
+  let config = { servers: {} };
+  try {
+    if (fs.existsSync(file)) config = JSON.parse(fs.readFileSync(file, "utf8"));
+  } catch (e) { /* start fresh on parse error */ }
+  config.servers = config.servers || {};
+  config.servers["agentx-kit"] = { command: cli(), args: ["mcp"] };
+  fs.mkdirSync(dir, { recursive: true });
+  fs.writeFileSync(file, JSON.stringify(config, null, 2));
+  vscode.window.showInformationMessage(
+    "AgentX-Kit MCP server registered in .vscode/mcp.json. Open Copilot Chat (Agent mode) and ask it to build an agent from a problem statement."
+  );
+  const doc = await vscode.workspace.openTextDocument(file);
+  vscode.window.showTextDocument(doc);
+}
+function activate(context) {
+  const reg = (id, fn) => context.subscriptions.push(vscode.commands.registerCommand(id, fn));
+  reg("agentx.newProject", newProject);
+  reg("agentx.dashboard", dashboard);
+  reg("agentx.addPrompt", addPrompt);
+  reg("agentx.cacheStats", cacheStats);
+  reg("agentx.registerMcp", registerMcp);
+}
+function deactivate() {}
+module.exports = { activate, deactivate };

agentx_kit-0.5.0/integrations/vscode/package.json ADDED Viewed

@@ -0,0 +1,34 @@
+{
+  "name": "agentx-kit",
+  "displayName": "AgentX-Kit",
+  "description": "Scaffold provider-agnostic LangChain/CrewAI agent projects, open the prompt dashboard, and wire AgentX-Kit into Copilot — from VS Code.",
+  "version": "0.1.0",
+  "publisher": "agentx",
+  "engines": { "vscode": "^1.85.0" },
+  "categories": ["Machine Learning", "Snippets", "Other"],
+  "keywords": ["ai", "agents", "llm", "langchain", "crewai", "mcp", "scaffold", "copilot"],
+  "icon": "icon.png",
+  "repository": { "type": "git", "url": "https://github.com/muhammadyahiya/agentx-kit" },
+  "license": "MIT",
+  "main": "./extension.js",
+  "activationEvents": [],
+  "contributes": {
+    "commands": [
+      { "command": "agentx.newProject", "title": "AgentX: New Agent Project" },
+      { "command": "agentx.dashboard", "title": "AgentX: Open Prompt Dashboard" },
+      { "command": "agentx.addPrompt", "title": "AgentX: Add Agent Prompt" },
+      { "command": "agentx.cacheStats", "title": "AgentX: Show Response-Cache Stats" },
+      { "command": "agentx.registerMcp", "title": "AgentX: Register MCP Server for Copilot" }
+    ],
+    "configuration": {
+      "title": "AgentX-Kit",
+      "properties": {
+        "agentx.command": {
+          "type": "string",
+          "default": "agentx",
+          "description": "Path to the agentx CLI (from `pip install agentx-kit`)."
+        }
+      }
+    }
+  }
+}

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/pyproject.toml RENAMED Viewed

@@ -1,7 +1,7 @@
 [project]
 # PyPI distribution name (import name + CLI stay `agentx`; `agentx` was taken).
 name = "agentx-kit"
-version = "0.4.0"
+version = "0.5.0"
 description = "An open-source, provider-agnostic agentic framework + interactive project scaffolder for LangChain and CrewAI. Pick your LLM provider, agents, RAG, memory, MCP tools and skills — generate a ready-to-run uv project."
 readme = "README.md"
 requires-python = ">=3.10,<3.14"

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/__init__.py RENAMED Viewed

@@ -16,7 +16,7 @@ is enough to get started.
 """
 from __future__ import annotations
-__version__ = "0.4.0"
+__version__ = "0.5.0"
 from .providers import (  # noqa: E402
     ProviderSpec,
@@ -39,6 +39,7 @@ from .insights import (  # noqa: E402
     estimate_cost,
     optimize_prompt,
 )
+from .cache import cache_stats, clear_cache, disable_caching, enable_caching  # noqa: E402
 __all__ = [
     "__version__",
@@ -63,4 +64,9 @@ __all__ = [
     "optimize_prompt",
     "count_tokens",
     "estimate_cost",
+    # response caching
+    "enable_caching",
+    "disable_caching",
+    "cache_stats",
+    "clear_cache",
 ]

agentx_kit-0.5.0/src/agentx/cache.py ADDED Viewed

@@ -0,0 +1,166 @@
+"""LLM response caching — cut cost & latency across all providers.
+Caching is the top token-optimization lever (2026): repeat/near-repeat calls are
+served from a local store instead of the model. This is an **exact** response
+cache implemented as a LangChain ``BaseCache`` and installed globally, so every
+``get_chat_model(...)`` call benefits automatically — no code changes.
+    from agentx.cache import enable_caching, cache_stats
+    enable_caching()                 # all subsequent LLM calls are cached
+    ...
+    print(cache_stats())             # hits, misses, est. tokens/$ saved
+Persistence is a small SQLite file (default ``.agentx/llm_cache.sqlite``) with an
+optional TTL. Stats track hit/miss and estimated tokens + USD saved (derived from
+cached completion sizes — see ``agentx.insights.tokens``).
+"""
+from __future__ import annotations
+import hashlib
+import sqlite3
+import threading
+import time
+from pathlib import Path
+from typing import Any
+_DEFAULT_PATH = ".agentx/llm_cache.sqlite"
+_lock = threading.Lock()
+def _key(prompt: str, llm_string: str) -> str:
+    return hashlib.sha256(f"{llm_string}\x00{prompt}".encode("utf-8")).hexdigest()
+def _model_from_llm_string(llm_string: str) -> str:
+    # llm_string is a serialized model descriptor; best-effort model name for costing.
+    for token in ("gpt-4o-mini", "gpt-4o", "gpt-4.1", "claude-3-5", "gemini-1.5", "llama"):
+        if token in llm_string:
+            return token
+    return "gpt-4o-mini"
+class AgentXCache:
+    """A LangChain ``BaseCache`` backed by SQLite, with TTL + savings stats.
+    Implements ``lookup``/``update`` (and ``aclear``) so it can be passed to
+    ``langchain_core.globals.set_llm_cache``.
+    """
+    def __init__(self, path: str | Path = _DEFAULT_PATH, ttl: int | None = None):
+        self.path = Path(path)
+        self.path.parent.mkdir(parents=True, exist_ok=True)
+        self.ttl = ttl
+        self._init_db()
+    def _conn(self) -> sqlite3.Connection:
+        return sqlite3.connect(str(self.path))
+    def _init_db(self) -> None:
+        with _lock, self._conn() as c:
+            c.execute(
+                "CREATE TABLE IF NOT EXISTS cache "
+                "(key TEXT PRIMARY KEY, value TEXT, ts REAL, model TEXT, out_tokens INT)"
+            )
+            c.execute("CREATE TABLE IF NOT EXISTS stats (name TEXT PRIMARY KEY, val REAL)")
+            for name in ("hits", "misses", "tokens_saved"):
+                c.execute("INSERT OR IGNORE INTO stats(name, val) VALUES (?, 0)", (name,))
+    def _bump(self, conn: sqlite3.Connection, name: str, by: float = 1) -> None:
+        conn.execute("UPDATE stats SET val = val + ? WHERE name = ?", (by, name))
+    # ----- BaseCache interface -----
+    def lookup(self, prompt: str, llm_string: str) -> Any | None:
+        import warnings
+        from langchain_core.load import loads
+        key = _key(prompt, llm_string)
+        with _lock, self._conn() as c:
+            row = c.execute("SELECT value, ts, out_tokens FROM cache WHERE key = ?", (key,)).fetchone()
+            if not row:
+                self._bump(c, "misses")
+                return None
+            value, ts, out_tokens = row
+            if self.ttl is not None and (time.time() - ts) > self.ttl:
+                c.execute("DELETE FROM cache WHERE key = ?", (key,))
+                self._bump(c, "misses")
+                return None
+            self._bump(c, "hits")
+            self._bump(c, "tokens_saved", out_tokens or 0)
+        try:
+            # We wrote these values ourselves, so deserialization is trusted.
+            with warnings.catch_warnings():
+                warnings.simplefilter("ignore")
+                return loads(value)
+        except Exception:  # noqa: BLE001 - corrupt entry
+            return None
+    def update(self, prompt: str, llm_string: str, return_val: Any) -> None:
+        from langchain_core.load import dumps
+        from .insights.tokens import count_tokens
+        key = _key(prompt, llm_string)
+        model = _model_from_llm_string(llm_string)
+        text = ""
+        try:
+            text = " ".join(getattr(g, "text", "") or "" for g in return_val)
+        except Exception:  # noqa: BLE001
+            text = ""
+        out_tokens = count_tokens(text, model)
+        try:
+            payload = dumps(return_val)
+        except Exception:  # noqa: BLE001 - non-serializable result; skip caching
+            return
+        with _lock, self._conn() as c:
+            c.execute(
+                "INSERT OR REPLACE INTO cache(key, value, ts, model, out_tokens) VALUES (?, ?, ?, ?, ?)",
+                (key, payload, time.time(), model, out_tokens),
+            )
+    def clear(self, **kwargs: Any) -> None:
+        with _lock, self._conn() as c:
+            c.execute("DELETE FROM cache")
+            c.execute("UPDATE stats SET val = 0")
+    def stats(self) -> dict:
+        with _lock, self._conn() as c:
+            rows = dict(c.execute("SELECT name, val FROM stats").fetchall())
+            entries = c.execute("SELECT COUNT(*) FROM cache").fetchone()[0]
+        hits, misses = int(rows.get("hits", 0)), int(rows.get("misses", 0))
+        total = hits + misses
+        tokens_saved = int(rows.get("tokens_saved", 0))
+        # Conservative blended estimate: $0.002 / 1K output tokens saved.
+        cost_saved = round(tokens_saved / 1000 * 0.002, 6)
+        return {
+            "entries": entries,
+            "hits": hits,
+            "misses": misses,
+            "hit_rate": round(hits / total, 3) if total else 0.0,
+            "tokens_saved": tokens_saved,
+            "est_usd_saved": cost_saved,
+            "path": str(self.path),
+        }
+def enable_caching(path: str | Path = _DEFAULT_PATH, ttl: int | None = None) -> AgentXCache:
+    """Install a global LLM response cache. All providers benefit automatically."""
+    from langchain_core.globals import set_llm_cache
+    cache = AgentXCache(path, ttl=ttl)
+    set_llm_cache(cache)
+    return cache
+def disable_caching() -> None:
+    from langchain_core.globals import set_llm_cache
+    set_llm_cache(None)
+def cache_stats(path: str | Path = _DEFAULT_PATH) -> dict:
+    return AgentXCache(path).stats()
+def clear_cache(path: str | Path = _DEFAULT_PATH) -> None:
+    AgentXCache(path).clear()

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/cli.py RENAMED Viewed

@@ -68,6 +68,38 @@ def dashboard(
         raise typer.Exit(1) from exc
+cache_app = typer.Typer(help="Inspect/clear the local LLM response cache.", no_args_is_help=True)
+app.add_typer(cache_app, name="cache")
+@cache_app.command("stats")
+def cache_stats_cmd(
+    path: Path = typer.Option(".agentx/llm_cache.sqlite", "--path", help="Cache DB path."),
+) -> None:
+    """Show cache hit rate and estimated tokens/$ saved."""
+    from .cache import cache_stats
+    s = cache_stats(path)
+    table = Table(title="LLM response cache")
+    table.add_column("metric", style="cyan")
+    table.add_column("value")
+    for k in ("entries", "hits", "misses", "hit_rate", "tokens_saved", "est_usd_saved"):
+        table.add_row(k, str(s[k]))
+    console.print(table)
+    console.print(f"[dim]{s['path']}[/]")
+@cache_app.command("clear")
+def cache_clear_cmd(
+    path: Path = typer.Option(".agentx/llm_cache.sqlite", "--path", help="Cache DB path."),
+) -> None:
+    """Clear all cached responses and reset stats."""
+    from .cache import clear_cache
+    clear_cache(path)
+    console.print("[green]✓[/] Cache cleared.")
 @app.command()
 def mcp(
     print_config: bool = typer.Option(False, "--print-config", help="Print MCP client config for Claude/Codex/Copilot and exit."),

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/connector/build.py RENAMED Viewed

@@ -10,7 +10,7 @@ from pathlib import Path
 from ..scaffold import AgentSpec, ProjectSpec, generate_project
 from .recommend import recommend_spec
-_ALL_FEATURES = ["rag", "memory", "mcp", "skills", "observability", "guardrails", "serve", "docker", "ci", "evals"]
+_ALL_FEATURES = ["rag", "memory", "mcp", "skills", "observability", "guardrails", "serve", "docker", "ci", "evals", "cache"]
 _KEY_FILES_MAX = 6000
@@ -26,6 +26,7 @@ def _apply_features(spec: ProjectSpec, features: list[str]) -> None:
     spec.docker = "docker" in fl
     spec.ci = "ci" in fl
     spec.evals = "evals" in fl
+    spec.use_cache = "cache" in fl
 def build_project_from_statement(

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/connector/recommend.py RENAMED Viewed

@@ -65,6 +65,8 @@ def recommend_spec(problem_statement: str) -> dict:
     production = _has(text, "production", "enterprise", "scalable", "observability",
                       "monitor", "trace", "secure", "reliable", "deploy", "high traffic")
     coding = _has(text, "coding", "write code", "code generation", "programming task")
+    cache = _has(text, "cache", "cost", "cheap", "latency", "high traffic", "high-traffic",
+                 "fast response", "reduce cost", "save money", "repeated")
     features: list[str] = []
     if rag:
@@ -77,6 +79,8 @@ def recommend_spec(problem_statement: str) -> dict:
         features.append("skills")
     if serve or production:
         features.append("serve")
+    if cache or production:
+        features.append("cache")
     if production:
         features += ["observability", "guardrails", "docker", "ci", "evals"]
     # de-dupe, stable order

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/dashboard/app.py RENAMED Viewed

@@ -218,6 +218,21 @@ def _trends_panel():
     c2.metric("Total tokens", f"{agg['total_tokens']:,}")
     c3.metric("Total cost", f"${agg['total_cost_usd']:.4f}")
     c4.metric("Avg latency", f"{agg['avg_latency_ms']} ms")
+    # Response-cache savings (if caching has been used in this project).
+    cache_path = _PROJECT / ".agentx" / "llm_cache.sqlite"
+    if cache_path.exists():
+        try:
+            from agentx.cache import cache_stats
+            cs = cache_stats(cache_path)
+            st.markdown("###### 💾 Response cache")
+            d1, d2, d3 = st.columns(3)
+            d1.metric("Hit rate", f"{cs['hit_rate']:.0%}", help=f"{cs['hits']} hits / {cs['misses']} misses")
+            d2.metric("Tokens saved", f"{cs['tokens_saved']:,}")
+            d3.metric("Est. $ saved", f"${cs['est_usd_saved']:.4f}")
+        except Exception:  # noqa: BLE001
+            pass
     rows = [r for r in log.events() if r.get("kind") == "run"]
     if not rows:
         st.info("No runs logged yet — use **Test run** to populate trends.")

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/generator.py RENAMED Viewed

@@ -134,6 +134,7 @@ def _write_manifest(target: Path, spec: ProjectSpec) -> Path:
             "docker": spec.docker,
             "ci": spec.ci,
             "evals": spec.evals,
+            "cache": spec.use_cache,
         },
         "extras": _extras(spec),
         "telemetry_opt_out": False,

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/spec.py RENAMED Viewed

@@ -52,6 +52,7 @@ class ProjectSpec(BaseModel):
     docker: bool = False          # Dockerfile + docker-compose.yml
     ci: bool = False              # GitHub Actions (lint + test [+ eval])
     evals: bool = False           # LLM-as-judge eval harness (+ CI gate)
+    use_cache: bool = False       # global LLM response cache (cost/latency saver)
     create_venv: bool = True
     run_sync: bool = False
     # When set, generated pyproject depends on agentx from this local path
@@ -61,7 +62,7 @@ class ProjectSpec(BaseModel):
     def enable_enterprise(self) -> "ProjectSpec":
         """Turn on the full enterprise feature set in one call."""
         self.observability = self.guardrails = self.serve = True
-        self.docker = self.ci = self.evals = True
+        self.docker = self.ci = self.evals = self.use_cache = True
         return self
     @property

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/main.py.j2 RENAMED Viewed

@@ -8,6 +8,8 @@ from dotenv import load_dotenv
 {% endif %}
 {% if spec.guardrails %}from .guardrails import guard_input, guard_output
 {% endif %}
+{% if spec.use_cache %}from agentx.cache import enable_caching
+{% endif %}
 {% if spec.framework == 'langgraph' %}
 from agentx.frameworks import run_agent
 from .agents import build_agents
@@ -17,6 +19,9 @@ def main() -> None:
     load_dotenv()
 {% if spec.observability %}
     init_observability()
+{% endif %}
+{% if spec.use_cache %}
+    enable_caching()   # cache LLM responses → lower cost & latency
 {% endif %}
     agents = build_agents()
     agent_name, agent = next(iter(agents.items()))
@@ -52,6 +57,9 @@ def main() -> None:
     load_dotenv()
 {% if spec.observability %}
     init_observability()
+{% endif %}
+{% if spec.use_cache %}
+    enable_caching()   # cache LLM responses → lower cost & latency
 {% endif %}
     print("🧬 {{ spec.slug }} (CrewAI). Type 'quit' to exit.\n")
     while True:

agentx_kit-0.5.0/tests/test_cache.py ADDED Viewed

@@ -0,0 +1,60 @@
+"""Tests for the LLM response cache (BaseCache-backed, SQLite). No live LLM."""
+import time
+from agentx.cache import AgentXCache, cache_stats, clear_cache, enable_caching
+def _gens(text: str):
+    from langchain_core.outputs import Generation
+    return [Generation(text=text)]
+def test_cache_lookup_miss_then_hit(tmp_path):
+    cache = AgentXCache(tmp_path / "c.sqlite")
+    assert cache.lookup("hello", "llm::gpt-4o-mini") is None  # miss
+    cache.update("hello", "llm::gpt-4o-mini", _gens("hi there"))
+    got = cache.lookup("hello", "llm::gpt-4o-mini")
+    assert got is not None
+    assert got[0].text == "hi there"
+def test_cache_stats_track_hits_and_savings(tmp_path):
+    cache = AgentXCache(tmp_path / "c.sqlite")
+    cache.update("q", "llm::gpt-4o-mini", _gens("a fairly long cached answer " * 5))
+    cache.lookup("q", "llm::gpt-4o-mini")  # hit
+    cache.lookup("nope", "llm::gpt-4o-mini")  # miss
+    s = cache.stats()
+    assert s["hits"] == 1 and s["misses"] == 1
+    assert s["hit_rate"] == 0.5
+    assert s["tokens_saved"] > 0
+    assert s["est_usd_saved"] >= 0.0
+    assert s["entries"] == 1
+def test_cache_ttl_expiry(tmp_path):
+    cache = AgentXCache(tmp_path / "c.sqlite", ttl=1)
+    cache.update("k", "llm::x", _gens("v"))
+    assert cache.lookup("k", "llm::x") is not None
+    time.sleep(1.2)
+    assert cache.lookup("k", "llm::x") is None  # expired
+def test_clear_cache(tmp_path):
+    p = tmp_path / "c.sqlite"
+    cache = AgentXCache(p)
+    cache.update("k", "llm::x", _gens("v"))
+    clear_cache(p)
+    assert cache_stats(p)["entries"] == 0
+    assert cache.lookup("k", "llm::x") is None
+def test_enable_caching_installs_global(tmp_path):
+    from langchain_core.globals import get_llm_cache
+    from agentx.cache import disable_caching
+    cache = enable_caching(tmp_path / "c.sqlite")
+    assert get_llm_cache() is cache
+    disable_caching()
+    assert get_llm_cache() is None

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/.github/workflows/publish.yml RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/.gitignore RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/DESIGN.md RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/LICENSE RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/RESEARCH.md RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/config.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/connector/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/connector/server.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/dashboard/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/frameworks/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/frameworks/crewai_agent.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/frameworks/langchain_agent.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/guardrails.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/insights/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/insights/analyze.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/insights/log.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/insights/optimize.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/insights/tokens.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/memory/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/memory/store.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/observability.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/prompts/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/prompts/templates.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/providers/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/providers/base.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/providers/factory.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/providers/registry.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/rag/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/rag/pipeline.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/reliability.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/prompts_store.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/Dockerfile.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/README.md.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/ci.yml.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/docker-compose.yml.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/dockerignore.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/env.example.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/evals/dataset.json.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/evals/run_evals.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/gitignore.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/mcp_servers.json.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/__init__.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/agents.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/config.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/guardrails.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/memory.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/observability.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/prompts.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/rag.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/server.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pkg/tools.py.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/pyproject.toml.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/templates/skills_seed.json.j2 RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/scaffold/wizard.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/skills/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/skills/registry.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/structured.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/tools/__init__.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/tools/builtin.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/src/agentx/tools/mcp.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/tests/test_connector.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/tests/test_enterprise.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/tests/test_insights.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/tests/test_prompts.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/tests/test_providers.py RENAMED Viewed

File without changes

{agentx_kit-0.4.0 → agentx_kit-0.5.0}/tests/test_scaffold.py RENAMED Viewed

File without changes

agentx-kit 0.4.0__tar.gz → 0.5.0__tar.gz

agentx-kit 0.4.0tar.gz → 0.5.0tar.gz