PyPI - fred-runtime - Versions diffs - 2.0.4__tar.gz → 2.0.7__tar.gz - Mend

fred-runtime 2.0.4tar.gz → 2.0.7tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (106) hide show

{fred_runtime-2.0.4 → fred_runtime-2.0.7}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: fred-runtime
-Version: 2.0.4
+Version: 2.0.7
 Summary: Runtime adapters and infrastructure wiring for Fred v2 agents.
 Author-email: Thales <noreply@thalesgroup.com>
 License: Apache-2.0
@@ -58,8 +58,9 @@ fred-runtime       Platform adapters + pod factory (this package)
 ```
 **Rule of thumb:**
-- Write agent logic in `fred-sdk`.
-- Write infrastructure adapters (DB, MCP server, Keycloak, object store) in `fred-runtime`.
+- Write agent logic in `fred-sdk`.
+- Write infrastructure adapters (DB, MCP server, Keycloak, object store) in `fred-runtime`.
 - `fred-sdk` must stay importable on a bare laptop with no services running.
 ---
@@ -79,15 +80,15 @@ app    = create_agent_app(registry=REGISTRY, config=config)
 `create_agent_app` returns a FastAPI application that exposes:
-| Method | Path | Description |
-|--------|------|-------------|
-| `POST` | `{base_url}/agents/execute` | Single-turn execution — returns final JSON |
-| `POST` | `{base_url}/agents/execute/stream` | Streaming SSE execution — yields `RuntimeEvent` objects |
-| `GET`  | `{base_url}/agents` | List registered agent IDs |
-| `GET`  | `{base_url}/agents/sessions` | List session IDs for a user |
-| `GET`  | `{base_url}/agents/sessions/{id}/messages` | Full conversation history for a session |
-| `GET`  | `/v1/models` | OpenAI model list (agent IDs as model names) |
-| `POST` | `/v1/chat/completions` | OpenAI chat completions — works with Open WebUI, openai-python SDK, etc. |
+| Method | Path                                       | Description                                                              |
+| ------ | ------------------------------------------ | ------------------------------------------------------------------------ |
+| `POST` | `{base_url}/agents/execute`                | Single-turn execution — returns final JSON                               |
+| `POST` | `{base_url}/agents/execute/stream`         | Streaming SSE execution — yields `RuntimeEvent` objects                  |
+| `GET`  | `{base_url}/agents`                        | List registered agent IDs                                                |
+| `GET`  | `{base_url}/agents/sessions`               | List session IDs for a user                                              |
+| `GET`  | `{base_url}/agents/sessions/{id}/messages` | Full conversation history for a session                                  |
+| `GET`  | `/v1/models`                               | OpenAI model list (agent IDs as model names)                             |
+| `POST` | `/v1/chat/completions`                     | OpenAI chat completions — works with Open WebUI, openai-python SDK, etc. |
 The OpenAI-compatible `/v1` surface is **enabled by default**.
 Set `app.openai_compat: false` in `configuration.yaml` to disable it for internal pods.
@@ -99,11 +100,11 @@ the SQL checkpointer. The session ID is the LangGraph `thread_id`.
 ### `fred_runtime.runtime_support` — Infrastructure adapters
-| Module | What it provides |
-|--------|-----------------|
-| `sql_checkpointer` | Durable LangGraph checkpointer backed by SQLite (dev) or PostgreSQL (prod) |
-| `user_token_refresher` | Transparent Keycloak token refresh for long-lived agent sessions |
-| `request_context_helpers` | FastAPI dependency helpers for extracting user/session context |
+| Module                    | What it provides                                                           |
+| ------------------------- | -------------------------------------------------------------------------- |
+| `sql_checkpointer`        | Durable LangGraph checkpointer backed by SQLite (dev) or PostgreSQL (prod) |
+| `user_token_refresher`    | Transparent Keycloak token refresh for long-lived agent sessions           |
+| `request_context_helpers` | FastAPI dependency helpers for extracting user/session context             |
 ---
@@ -119,16 +120,16 @@ Providers: OpenAI, Azure OpenAI, Mistral, Ollama, and any LangChain-compatible b
 HTTP clients that connect agent tools to the Fred platform services:
-| Client | Connects to |
-|--------|------------|
-| `kf_http_client` | Knowledge Flow REST API (generic) |
-| `kf_vectorsearch_client` | Vector search / retrieval |
-| `kf_markdown_media_client` | Document content (Markdown + media) |
-| `kf_workspace_client` | Workspace and library management |
-| `kf_logs_client` | Audit log retrieval |
-| `kf_fast_text_client` | FastText classification |
-| `mcp_runtime` / `mcp_toolkit` | MCP server lifecycle and tool injection |
-| `context_aware_tool` | Tool base class that propagates the runtime context (user, team, token) |
+| Client                        | Connects to                                                             |
+| ----------------------------- | ----------------------------------------------------------------------- |
+| `kf_http_client`              | Knowledge Flow REST API (generic)                                       |
+| `kf_vectorsearch_client`      | Vector search / retrieval                                               |
+| `kf_markdown_media_client`    | Document content (Markdown + media)                                     |
+| `kf_workspace_client`         | Workspace and library management                                        |
+| `kf_logs_client`              | Audit log retrieval                                                     |
+| `kf_fast_text_client`         | FastText classification                                                 |
+| `mcp_runtime` / `mcp_toolkit` | MCP server lifecycle and tool injection                                 |
+| `context_aware_tool`          | Tool base class that propagates the runtime context (user, team, token) |
 ---
@@ -176,9 +177,9 @@ or overridden with `--base-url` / `FRED_AGENT_POD_URL`.
 Every Fred pod uses the same two-file convention:
-| File | Purpose |
-|------|---------|
-| `.env` (path from `ENV_FILE`) | Secrets: API keys, DB URLs, Keycloak credentials |
+| File                                           | Purpose                                                            |
+| ---------------------------------------------- | ------------------------------------------------------------------ |
+| `.env` (path from `ENV_FILE`)                  | Secrets: API keys, DB URLs, Keycloak credentials                   |
 | `configuration.yaml` (path from `CONFIG_FILE`) | App settings: port, base URL, LLM routing, observability, security |
 Minimal `configuration.yaml` for a local pod:
@@ -190,6 +191,7 @@ app:
   host: "0.0.0.0"
   port: 8010
   log_level: "info"
+  limit_concurrency: 200
   metrics_address: "127.0.0.1"
   metrics_port: 9115
   kpi_process_metrics_interval_sec: 10
@@ -208,6 +210,10 @@ When `observability.metrics: prometheus` is enabled, `create_agent_app(...)`
 starts a dedicated Prometheus exporter on `app.metrics_address:app.metrics_port`
 and restores the shared Fred KPI pipeline, including process and SQL pool KPIs.
+Set `app.limit_concurrency: null` to disable Uvicorn connection limiting, or a
+positive integer to reject excess concurrent HTTP and WebSocket connections
+with `503` before application code runs.
 ---
 ## Installation
@@ -229,6 +235,7 @@ Requires Python 3.12.
 A minimal pod is three files:
 **`main.py`**
 ```python
 from fred_runtime.app import create_agent_app, load_agent_pod_config
 from myapp.registry import REGISTRY
@@ -238,19 +245,27 @@ app = create_agent_app(registry=REGISTRY, config=config)
 ```
 **`__main__.py`**
 ```python
 import uvicorn
 from fred_runtime.app import load_agent_pod_config
 def main():
     config = load_agent_pod_config()
-    uvicorn.run("myapp.main:app", host=config.app.host, port=config.app.port, reload=True)
+    uvicorn.run(
+        "myapp.main:app",
+        host=config.app.host,
+        port=config.app.port,
+        limit_concurrency=config.app.limit_concurrency,
+        reload=True,
+    )
 if __name__ == "__main__":
     main()
 ```
 **`registry.py`**
 ```python
 from fred_sdk.contracts.models import ReActAgentDefinition
@@ -267,11 +282,11 @@ See [fred-samples](https://github.com/ThalesGroup/fred) for a working reference
 ## Related packages
-| Package | PyPI | Role |
-|---------|------|------|
-| `fred-core` | [pypi](https://pypi.org/project/fred-core/) | Pure utilities — logging, model factories, embeddings, portable observability |
-| `fred-sdk` | [pypi](https://pypi.org/project/fred-sdk/) | Agent authoring — ReAct, Graph, tool contracts |
-| `fred-runtime` | [pypi](https://pypi.org/project/fred-runtime/) | This package |
+| Package        | PyPI                                           | Role                                                                          |
+| -------------- | ---------------------------------------------- | ----------------------------------------------------------------------------- |
+| `fred-core`    | [pypi](https://pypi.org/project/fred-core/)    | Pure utilities — logging, model factories, embeddings, portable observability |
+| `fred-sdk`     | [pypi](https://pypi.org/project/fred-sdk/)     | Agent authoring — ReAct, Graph, tool contracts                                |
+| `fred-runtime` | [pypi](https://pypi.org/project/fred-runtime/) | This package                                                                  |
 ---

{fred_runtime-2.0.4 → fred_runtime-2.0.7}/README.md RENAMED Viewed

@@ -23,8 +23,9 @@ fred-runtime       Platform adapters + pod factory (this package)
 ```
 **Rule of thumb:**
-- Write agent logic in `fred-sdk`.
-- Write infrastructure adapters (DB, MCP server, Keycloak, object store) in `fred-runtime`.
+- Write agent logic in `fred-sdk`.
+- Write infrastructure adapters (DB, MCP server, Keycloak, object store) in `fred-runtime`.
 - `fred-sdk` must stay importable on a bare laptop with no services running.
 ---
@@ -44,15 +45,15 @@ app    = create_agent_app(registry=REGISTRY, config=config)
 `create_agent_app` returns a FastAPI application that exposes:
-| Method | Path | Description |
-|--------|------|-------------|
-| `POST` | `{base_url}/agents/execute` | Single-turn execution — returns final JSON |
-| `POST` | `{base_url}/agents/execute/stream` | Streaming SSE execution — yields `RuntimeEvent` objects |
-| `GET`  | `{base_url}/agents` | List registered agent IDs |
-| `GET`  | `{base_url}/agents/sessions` | List session IDs for a user |
-| `GET`  | `{base_url}/agents/sessions/{id}/messages` | Full conversation history for a session |
-| `GET`  | `/v1/models` | OpenAI model list (agent IDs as model names) |
-| `POST` | `/v1/chat/completions` | OpenAI chat completions — works with Open WebUI, openai-python SDK, etc. |
+| Method | Path                                       | Description                                                              |
+| ------ | ------------------------------------------ | ------------------------------------------------------------------------ |
+| `POST` | `{base_url}/agents/execute`                | Single-turn execution — returns final JSON                               |
+| `POST` | `{base_url}/agents/execute/stream`         | Streaming SSE execution — yields `RuntimeEvent` objects                  |
+| `GET`  | `{base_url}/agents`                        | List registered agent IDs                                                |
+| `GET`  | `{base_url}/agents/sessions`               | List session IDs for a user                                              |
+| `GET`  | `{base_url}/agents/sessions/{id}/messages` | Full conversation history for a session                                  |
+| `GET`  | `/v1/models`                               | OpenAI model list (agent IDs as model names)                             |
+| `POST` | `/v1/chat/completions`                     | OpenAI chat completions — works with Open WebUI, openai-python SDK, etc. |
 The OpenAI-compatible `/v1` surface is **enabled by default**.
 Set `app.openai_compat: false` in `configuration.yaml` to disable it for internal pods.
@@ -64,11 +65,11 @@ the SQL checkpointer. The session ID is the LangGraph `thread_id`.
 ### `fred_runtime.runtime_support` — Infrastructure adapters
-| Module | What it provides |
-|--------|-----------------|
-| `sql_checkpointer` | Durable LangGraph checkpointer backed by SQLite (dev) or PostgreSQL (prod) |
-| `user_token_refresher` | Transparent Keycloak token refresh for long-lived agent sessions |
-| `request_context_helpers` | FastAPI dependency helpers for extracting user/session context |
+| Module                    | What it provides                                                           |
+| ------------------------- | -------------------------------------------------------------------------- |
+| `sql_checkpointer`        | Durable LangGraph checkpointer backed by SQLite (dev) or PostgreSQL (prod) |
+| `user_token_refresher`    | Transparent Keycloak token refresh for long-lived agent sessions           |
+| `request_context_helpers` | FastAPI dependency helpers for extracting user/session context             |
 ---
@@ -84,16 +85,16 @@ Providers: OpenAI, Azure OpenAI, Mistral, Ollama, and any LangChain-compatible b
 HTTP clients that connect agent tools to the Fred platform services:
-| Client | Connects to |
-|--------|------------|
-| `kf_http_client` | Knowledge Flow REST API (generic) |
-| `kf_vectorsearch_client` | Vector search / retrieval |
-| `kf_markdown_media_client` | Document content (Markdown + media) |
-| `kf_workspace_client` | Workspace and library management |
-| `kf_logs_client` | Audit log retrieval |
-| `kf_fast_text_client` | FastText classification |
-| `mcp_runtime` / `mcp_toolkit` | MCP server lifecycle and tool injection |
-| `context_aware_tool` | Tool base class that propagates the runtime context (user, team, token) |
+| Client                        | Connects to                                                             |
+| ----------------------------- | ----------------------------------------------------------------------- |
+| `kf_http_client`              | Knowledge Flow REST API (generic)                                       |
+| `kf_vectorsearch_client`      | Vector search / retrieval                                               |
+| `kf_markdown_media_client`    | Document content (Markdown + media)                                     |
+| `kf_workspace_client`         | Workspace and library management                                        |
+| `kf_logs_client`              | Audit log retrieval                                                     |
+| `kf_fast_text_client`         | FastText classification                                                 |
+| `mcp_runtime` / `mcp_toolkit` | MCP server lifecycle and tool injection                                 |
+| `context_aware_tool`          | Tool base class that propagates the runtime context (user, team, token) |
 ---
@@ -141,9 +142,9 @@ or overridden with `--base-url` / `FRED_AGENT_POD_URL`.
 Every Fred pod uses the same two-file convention:
-| File | Purpose |
-|------|---------|
-| `.env` (path from `ENV_FILE`) | Secrets: API keys, DB URLs, Keycloak credentials |
+| File                                           | Purpose                                                            |
+| ---------------------------------------------- | ------------------------------------------------------------------ |
+| `.env` (path from `ENV_FILE`)                  | Secrets: API keys, DB URLs, Keycloak credentials                   |
 | `configuration.yaml` (path from `CONFIG_FILE`) | App settings: port, base URL, LLM routing, observability, security |
 Minimal `configuration.yaml` for a local pod:
@@ -155,6 +156,7 @@ app:
   host: "0.0.0.0"
   port: 8010
   log_level: "info"
+  limit_concurrency: 200
   metrics_address: "127.0.0.1"
   metrics_port: 9115
   kpi_process_metrics_interval_sec: 10
@@ -173,6 +175,10 @@ When `observability.metrics: prometheus` is enabled, `create_agent_app(...)`
 starts a dedicated Prometheus exporter on `app.metrics_address:app.metrics_port`
 and restores the shared Fred KPI pipeline, including process and SQL pool KPIs.
+Set `app.limit_concurrency: null` to disable Uvicorn connection limiting, or a
+positive integer to reject excess concurrent HTTP and WebSocket connections
+with `503` before application code runs.
 ---
 ## Installation
@@ -194,6 +200,7 @@ Requires Python 3.12.
 A minimal pod is three files:
 **`main.py`**
 ```python
 from fred_runtime.app import create_agent_app, load_agent_pod_config
 from myapp.registry import REGISTRY
@@ -203,19 +210,27 @@ app = create_agent_app(registry=REGISTRY, config=config)
 ```
 **`__main__.py`**
 ```python
 import uvicorn
 from fred_runtime.app import load_agent_pod_config
 def main():
     config = load_agent_pod_config()
-    uvicorn.run("myapp.main:app", host=config.app.host, port=config.app.port, reload=True)
+    uvicorn.run(
+        "myapp.main:app",
+        host=config.app.host,
+        port=config.app.port,
+        limit_concurrency=config.app.limit_concurrency,
+        reload=True,
+    )
 if __name__ == "__main__":
     main()
 ```
 **`registry.py`**
 ```python
 from fred_sdk.contracts.models import ReActAgentDefinition
@@ -232,11 +247,11 @@ See [fred-samples](https://github.com/ThalesGroup/fred) for a working reference
 ## Related packages
-| Package | PyPI | Role |
-|---------|------|------|
-| `fred-core` | [pypi](https://pypi.org/project/fred-core/) | Pure utilities — logging, model factories, embeddings, portable observability |
-| `fred-sdk` | [pypi](https://pypi.org/project/fred-sdk/) | Agent authoring — ReAct, Graph, tool contracts |
-| `fred-runtime` | [pypi](https://pypi.org/project/fred-runtime/) | This package |
+| Package        | PyPI                                           | Role                                                                          |
+| -------------- | ---------------------------------------------- | ----------------------------------------------------------------------------- |
+| `fred-core`    | [pypi](https://pypi.org/project/fred-core/)    | Pure utilities — logging, model factories, embeddings, portable observability |
+| `fred-sdk`     | [pypi](https://pypi.org/project/fred-sdk/)     | Agent authoring — ReAct, Graph, tool contracts                                |
+| `fred-runtime` | [pypi](https://pypi.org/project/fred-runtime/) | This package                                                                  |
 ---

{fred_runtime-2.0.4 → fred_runtime-2.0.7}/fred_runtime/app/_catalogs.py RENAMED Viewed

@@ -39,7 +39,7 @@ from typing import Any, Literal
 import yaml
 from fred_sdk.contracts.models import MCPServerConfiguration
-from pydantic import BaseModel, ConfigDict, Field
+from pydantic import BaseModel, ConfigDict, Field, model_validator
 from .config import AgentPodConfig
@@ -114,6 +114,36 @@ class _McpCatalog(_CatalogFile):
     version: Literal["v1"] = "v1"
     servers: list[MCPServerConfiguration] = Field(default_factory=list)
+    @model_validator(mode="after")
+    def _reject_duplicate_server_ids(self) -> "_McpCatalog":
+        """
+        Reject duplicate MCP server ids in one catalog.
+        Why this exists:
+        - the managed-agent contract now stores per-server config keyed by MCP
+          server id, so duplicates would make selection and config resolution
+          ambiguous and unsafe
+        How to use it:
+        - triggered automatically during `_McpCatalog.model_validate(...)`
+        Example:
+        - `load_mcp_catalog("./config/mcp_catalog.yaml")`
+        """
+        seen: set[str] = set()
+        duplicates: list[str] = []
+        for server in self.servers:
+            if server.id in seen and server.id not in duplicates:
+                duplicates.append(server.id)
+            seen.add(server.id)
+        if duplicates:
+            duplicates_text = ", ".join(repr(server_id) for server_id in duplicates)
+            raise ValueError(
+                f"Duplicate MCP server id(s) in catalog: {duplicates_text}"
+            )
+        return self
 def _load_yaml_mapping(path: Path) -> dict[str, Any]:
     """

{fred_runtime-2.0.4 → fred_runtime-2.0.7}/fred_runtime/app/agent_app.py RENAMED Viewed

@@ -849,7 +849,9 @@ class _ResolvedExecutionTarget:
 def _apply_runtime_tuning(
-    definition: ReActAgentDefinition | GraphAgentDefinition, tuning: AgentTuning
+    definition: ReActAgentDefinition | GraphAgentDefinition,
+    tuning: AgentTuning,
+    available_mcp_servers: list[MCPServerConfiguration],
 ) -> ReActAgentDefinition | GraphAgentDefinition:
     """
     Overlay persisted business tuning onto one registered agent template.
@@ -863,11 +865,11 @@ def _apply_runtime_tuning(
     - call after resolving an `agent_instance_id` from control-plane
     Example:
-    - `definition = _apply_runtime_tuning(template_definition, resolution.tuning)`
+    - `definition = _apply_runtime_tuning(template_definition, resolution.tuning, catalog)`
     """
     mcp_servers = tuning.mcp_servers
-    if tuning.selected_mcp_server_ids:
+    if tuning.selected_mcp_server_ids is not None:
         selected = frozenset(tuning.selected_mcp_server_ids)
         mcp_servers = [s for s in mcp_servers if s.id in selected]
@@ -886,9 +888,25 @@ def _apply_runtime_tuning(
     }
     if isinstance(definition, ReActAgentDefinition):
         # Also overlay system_prompt_template directly for ReAct runtime compatibility.
+        base_system_prompt = str(getattr(definition, "system_prompt_template", ""))
+        effective_system_prompt = base_system_prompt
         system_prompt = tuning.values.get("prompts.system")
         if isinstance(system_prompt, str) and system_prompt.strip():
-            update["system_prompt_template"] = system_prompt
+            effective_system_prompt = system_prompt
+        available_by_id = {server.id: server for server in available_mcp_servers}
+        fragments = [
+            catalog_entry.agent_instructions.strip()
+            for server_ref in mcp_servers
+            if (catalog_entry := available_by_id.get(server_ref.id)) is not None
+            and isinstance(catalog_entry.agent_instructions, str)
+            and catalog_entry.agent_instructions.strip()
+        ]
+        if fragments:
+            effective_system_prompt = f"{effective_system_prompt}\n\n" + "\n\n".join(
+                fragments
+            )
+        if effective_system_prompt != base_system_prompt:
+            update["system_prompt_template"] = effective_system_prompt
     return definition.model_copy(update=update)
@@ -952,6 +970,7 @@ async def _resolve_agent_instance(
                 f"Known agents: {list(registry.keys())}",
             )
         if request.inline_tuning:
+            available_mcp_servers = _available_mcp_servers_for_definition(definition)
             definition = _apply_runtime_tuning(
                 definition,
                 AgentTuning(
@@ -962,6 +981,7 @@ async def _resolve_agent_instance(
                     mcp_servers=list(definition.default_mcp_servers),
                     values=request.inline_tuning,
                 ),
+                available_mcp_servers,
             )
         return _ResolvedExecutionTarget(
             definition=definition,
@@ -1007,8 +1027,11 @@ async def _resolve_agent_instance(
                 f"Resolved template_agent_id '{resolution.template_agent_id}' is not registered in this pod."
             ),
         )
+    available_mcp_servers = _available_mcp_servers_for_definition(definition)
     return _ResolvedExecutionTarget(
-        definition=_apply_runtime_tuning(definition, resolution.tuning),
+        definition=_apply_runtime_tuning(
+            definition, resolution.tuning, available_mcp_servers
+        ),
         effective_agent_id=resolution.agent_instance_id,
         team_id=resolution.owner_team_id,
     )
@@ -1473,6 +1496,7 @@ def _build_eval_trace(
     agent_id: str,
     session_id: str,
     turn_start: float,
+    agent_tags: tuple[str, ...] = (),
 ) -> EvalTrace:
     outcome = _parse_turn_outcome(payloads, turn_start)
     steps: list[EvalStep] = []
@@ -1531,6 +1555,7 @@ def _build_eval_trace(
     return EvalTrace(
         session_id=session_id,
         agent_id=agent_id,
+        agent_tags=agent_tags,
         input=input_text,
         output=outcome.final_content,
         error=error,
@@ -1825,18 +1850,6 @@ async def _iterate_runtime_event_payloads(
         registry=registry,
         access_token=access_token,
     )
-    if isinstance(definition, GraphAgentDefinition):
-        runtime: ReActRuntime | GraphRuntime = GraphRuntime(
-            definition=definition,
-            services=services,
-        )
-    else:
-        runtime = ReActRuntime(
-            definition=definition,
-            services=services,
-        )
-    runtime.bind(binding)
     # session_id drives LangGraph checkpointing: the agent resumes its graph
     # state on every turn. Falls back to request_id for one-shot calls so
     # LangGraph's checkpointer invariant (thread_id required internally) is met.
@@ -1847,10 +1860,16 @@ async def _iterate_runtime_event_payloads(
         invocation_turns=getattr(request, "invocation_turns", ()),
     )
+    runtime: ReActRuntime | GraphRuntime | None = None
     try:
-        await runtime.activate()
-        executor = await runtime.get_executor()
         if isinstance(definition, GraphAgentDefinition):
+            runtime = GraphRuntime(
+                definition=definition,
+                services=services,
+            )
+            runtime.bind(binding)
+            await runtime.activate()
+            executor = await runtime.get_executor()
             # Graph agents receive their typed input schema; the agent's
             # build_turn_state() maps it to graph state before the first node runs.
             # The standard contract is a single "message" field in the input schema.
@@ -1863,12 +1882,25 @@ async def _iterate_runtime_event_payloads(
                 graph_input = input_cls.model_validate(
                     {"message": request.message or ""}
                 )
-            executor_input: ReActInput | object = graph_input
+            async for event in executor.stream(graph_input, execution_config):
+                payload = event.model_dump(mode="json")
+                if not isinstance(payload, dict):
+                    raise RuntimeError(
+                        "RuntimeEvent payload must serialize to a JSON object."
+                    )
+                yield payload
         else:
+            runtime = ReActRuntime(
+                definition=definition,
+                services=services,
+            )
+            runtime.bind(binding)
+            await runtime.activate()
+            executor = await runtime.get_executor()
             # On HITL resume, messages are ignored by the codec — the graph
             # resumes from its checkpointed interrupt via Command(resume=...).
             # On a normal turn, the user message is the only input.
-            executor_input = ReActInput(
+            react_input = ReActInput(
                 messages=(
                     ()
                     if request.resume_payload is not None
@@ -1879,20 +1911,21 @@ async def _iterate_runtime_event_payloads(
                     )
                 ),
             )
-        async for event in executor.stream(executor_input, execution_config):
-            payload = event.model_dump(mode="json")
-            if not isinstance(payload, dict):
-                raise RuntimeError(
-                    "RuntimeEvent payload must serialize to a JSON object."
-                )
-            yield payload
+            async for event in executor.stream(react_input, execution_config):
+                payload = event.model_dump(mode="json")
+                if not isinstance(payload, dict):
+                    raise RuntimeError(
+                        "RuntimeEvent payload must serialize to a JSON object."
+                    )
+                yield payload
     except Exception as exc:
         logger.exception(
             "[fred-runtime] agent execution error agent_id=%s", definition.agent_id
         )
         yield RuntimeErrorEvent(message=str(exc)).model_dump(mode="json")
     finally:
-        await runtime.dispose()
+        if runtime is not None:
+            await runtime.dispose()
 def _terminal_execute_payload(
@@ -2632,6 +2665,7 @@ def _build_agent_router(
             payloads=payloads,
             input_text=request.input or "",
             agent_id=target.definition.agent_id,
+            agent_tags=target.definition.tags,
             session_id=eval_session_id,
             turn_start=turn_start,
         )

{fred_runtime-2.0.4 → fred_runtime-2.0.7}/fred_runtime/app/config.py RENAMED Viewed

@@ -110,6 +110,15 @@ class PodAppConfig(BaseModel):
     host: str = "127.0.0.1"
     port: int = 8000
     log_level: str = "info"
+    limit_concurrency: int | None = Field(
+        default=None,
+        ge=1,
+        description=(
+            "Optional maximum number of concurrent HTTP or WebSocket "
+            "connections accepted by Uvicorn. Leave unset to disable the "
+            "limit."
+        ),
+    )
     gcu_version: str | None = None
     metrics_address: str = "127.0.0.1"
     metrics_port: int = 9000

{fred_runtime-2.0.4 → fred_runtime-2.0.7}/fred_runtime/cli/completion.py RENAMED Viewed

@@ -36,7 +36,7 @@ _COMMANDS: tuple[str, ...] = (
     "/whoami",
 )
-# Scenario keywords for fred.test.assistant — used for /run tab-completion.
+# Scenario keywords for fred.github.test_assistant — used for /run tab-completion.
 _TEST_ASSISTANT_SCENARIOS: tuple[str, ...] = (
     "echo",
     "error",

fred-runtime 2.0.4__tar.gz → 2.0.7__tar.gz

fred-runtime 2.0.4tar.gz → 2.0.7tar.gz