PyPI - rlm-code - Versions diffs - 0.1.5__tar.gz → 0.1.7__tar.gz - Mend

rlm-code 0.1.5tar.gz → 0.1.7tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (309) hide show

rlm_code-0.1.7/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,72 @@
+# Changelog
+All notable changes to this project are documented in this file.
+The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
+and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [0.1.7] - 2026-04-30
+### Added
+- HALO-style `trace_analysis` RLM environment for diagnosing agent harness failures from one-span-per-line JSONL traces.
+- Trace sidecar indexing with dataset rollups for trace counts, span counts, error traces, services, models, agents, token totals, and sample trace ids.
+- Bounded trace inspection actions: `get_dataset_overview`, `query_traces`, `count_traces`, `view_trace`, `search_trace`, and `view_spans`.
+- Large-trace safeguards: per-attribute truncation, oversized trace summaries, and higher-cap selected-span reads.
+- Tests for trace indexing, querying, searching, selected-span viewing, and trace environment actions.
+- Trace analysis documentation under the Core Engine docs.
+### Changed
+- `/rlm` command help now advertises `env=trace_analysis` for run, chat, and doctor workflows.
+## [0.1.6] - 2026-02-20
+### Added
+- Harness strategy selector with `tool_call` (default) and opt-in `codemode`.
+- CodeMode execution flow in harness: MCP tool discovery (`search_tools`), typed tool surface prompt, single-program generation, guardrail validation, and MCP chain execution (`call_tool_chain`).
+- Benchmark support for harness strategy comparison with CodeMode telemetry fields (`harness_strategy`, `codemode_chain_calls`, `codemode_search_calls`, `codemode_discovery_calls`, `codemode_guardrail_blocked`).
+- New top-level CodeMode docs section with dedicated pages for quickstart, architecture, guardrails, and evaluation.
+- Release documentation set for CodeMode:
+  - quickstart and operator workflow
+  - integration architecture and runtime controls
+  - provider/bridge separation model (Cloudflare-based, UTCP, custom)
+  - CodeMode sandbox responsibility and deployment matrix
+  - guardrail policy and safety runbook
+  - benchmark evaluation and promotion-gate criteria
+### Changed
+- `/harness run` supports `strategy=tool_call|codemode` and `mcp_server=<name>`.
+- `/rlm bench` in `mode=harness` supports `strategy=tool_call|codemode`.
+- Harness and benchmark command handling now auto-enables MCP when `strategy=codemode` is selected.
+### Security
+- Added explicit CodeMode guardrail policy documentation with blocked API classes and runtime limit defaults.
+- Codemode path remains opt-in; default harness behavior remains strict baseline `strategy=tool_call`.
+## [0.1.5] - 2026-02-15
+Initial public release of **RLM Code**.
+### Added
+- Unified Textual TUI with tabs for **RLM**, **Files**, **Details**, **Shell**, and **Research**.
+- Recursive execution engine with multiple patterns: **pure RLM**, **harness/code-agent**, and direct LLM flows.
+- Research workflows: run tracking, trajectory capture, replay, benchmark presets, compare/report flows.
+- Sandbox runtime layer (**Superbox**) with profile-driven runtime selection and fallback orchestration.
+- Secure runtime options including Docker and Monty, plus pluggable runtime adapters.
+- LLM integrations for cloud and local model routes, including BYOK workflows and ACP connectivity.
+- Coding harness with optional MCP tool integration for local/BYOK development workflows.
+- Framework adapter surface for RLM-style integrations (including DSPy-native and ADK-oriented paths).
+- Observability integrations (MLflow, LangFuse, Logfire, LangSmith, OpenTelemetry) via sink architecture.
+- Documentation site (MkDocs Material) with onboarding, CLI, TUI, sandbox, integrations, and benchmark guides.
+### Changed
+- Project identity standardized as **RLM Code** (legacy inherited naming removed from repository-facing surfaces).
+- Packaging and project metadata prepared for open-source release.
+- License updated to **Apache-2.0**.
+### Security
+- Safer sandbox-first runtime guidance in docs and configuration defaults.
+- Unsafe local `exec` usage preserved only as an explicit, opt-in path for advanced development scenarios.
+[0.1.5]: https://github.com/SuperagenticAI/rlm-code/releases/tag/v0.1.5
+[0.1.6]: https://github.com/SuperagenticAI/rlm-code/releases/tag/v0.1.6
+[0.1.7]: https://github.com/SuperagenticAI/rlm-code/releases/tag/v0.1.7

{rlm_code-0.1.5 → rlm_code-0.1.7}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: rlm-code
-Version: 0.1.5
+Version: 0.1.7
 Summary: RLM Code: Research Playground & Evaluation OS for Recursive Language Model Agentic Systems
 Project-URL: Homepage, https://github.com/SuperagenticAI/rlm-code
 Project-URL: Documentation, https://superagenticai.github.io/rlm-code/
@@ -99,20 +99,18 @@ Description-Content-Type: text/markdown
   </a>
 </p>
-<p align="center">
-  <a href="https://pypi.org/project/rlm-code/"><img alt="PyPI Version" src="https://img.shields.io/pypi/v/rlm-code"></a>
-  <a href="https://pypi.org/project/rlm-code/"><img alt="PyPI Python Versions" src="https://img.shields.io/pypi/pyversions/rlm-code"></a>
-  <a href="https://pypi.org/project/rlm-code/"><img alt="PyPI Downloads" src="https://img.shields.io/pypi/dm/rlm-code"></a>
-  <a href="https://pypi.org/project/rlm-code/"><img alt="PyPI Wheel" src="https://img.shields.io/pypi/wheel/rlm-code"></a>
-  <a href="LICENSE"><img alt="License" src="https://img.shields.io/pypi/l/rlm-code"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/actions/workflows/ci.yml"><img alt="CI" src="https://github.com/SuperagenticAI/rlm-code/actions/workflows/ci.yml/badge.svg"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/actions/workflows/pre-commit.yml"><img alt="Pre-commit" src="https://github.com/SuperagenticAI/rlm-code/actions/workflows/pre-commit.yml/badge.svg"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/actions/workflows/deploy-docs.yml"><img alt="Docs Deploy" src="https://github.com/SuperagenticAI/rlm-code/actions/workflows/deploy-docs.yml/badge.svg"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/actions/workflows/release.yml"><img alt="Release" src="https://github.com/SuperagenticAI/rlm-code/actions/workflows/release.yml/badge.svg"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/stargazers"><img alt="GitHub Stars" src="https://img.shields.io/github/stars/SuperagenticAI/rlm-code?style=social"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/issues"><img alt="GitHub Issues" src="https://img.shields.io/github/issues/SuperagenticAI/rlm-code"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/pulls"><img alt="GitHub Pull Requests" src="https://img.shields.io/github/issues-pr/SuperagenticAI/rlm-code"></a>
-</p>
+[![PyPI Version](https://img.shields.io/pypi/v/rlm-code.svg)](https://pypi.org/project/rlm-code/)
+[![Python Versions](https://img.shields.io/pypi/pyversions/rlm-code.svg)](https://pypi.org/project/rlm-code/)
+[![PyPI Wheel](https://img.shields.io/pypi/wheel/rlm-code.svg)](https://pypi.org/project/rlm-code/)
+[![License](https://img.shields.io/pypi/l/rlm-code.svg)](https://pypi.org/project/rlm-code/)
+[![CI](https://github.com/SuperagenticAI/rlm-code/actions/workflows/ci.yml/badge.svg?branch=main)](https://github.com/SuperagenticAI/rlm-code/actions/workflows/ci.yml)
+[![Pre-commit](https://github.com/SuperagenticAI/rlm-code/actions/workflows/pre-commit.yml/badge.svg?branch=main)](https://github.com/SuperagenticAI/rlm-code/actions/workflows/pre-commit.yml)
+[![Docs Deploy](https://github.com/SuperagenticAI/rlm-code/actions/workflows/deploy-docs.yml/badge.svg?branch=main)](https://github.com/SuperagenticAI/rlm-code/actions/workflows/deploy-docs.yml)
+[![Release](https://github.com/SuperagenticAI/rlm-code/actions/workflows/release.yml/badge.svg?branch=main)](https://github.com/SuperagenticAI/rlm-code/actions/workflows/release.yml)
+[![Docs](https://img.shields.io/badge/Docs-RLM%20Code-ff7a18.svg?logo=readthedocs&logoColor=white)](https://superagenticai.github.io/rlm-code/)
+[![GitHub Stars](https://img.shields.io/github/stars/SuperagenticAI/rlm-code.svg)](https://github.com/SuperagenticAI/rlm-code/stargazers)
+[![GitHub Issues](https://img.shields.io/github/issues/SuperagenticAI/rlm-code.svg)](https://github.com/SuperagenticAI/rlm-code/issues)
+[![GitHub Pull Requests](https://img.shields.io/github/issues-pr/SuperagenticAI/rlm-code.svg)](https://github.com/SuperagenticAI/rlm-code/pulls)
 **Run LLM-powered agents in a REPL loop, benchmark them, and compare results.**
@@ -120,6 +118,34 @@ RLM Code implements the [Recursive Language Models](https://arxiv.org/abs/2502.0
 RLM Code wraps this algorithm in an interactive terminal UI with built-in benchmarks, trajectory replay, and observability.
+## Release v0.1.7
+This release adds HALO-style trace analysis as a new RLM environment.
+- New `trace_analysis` environment for diagnosing agent harness failures from OTel-shaped JSONL traces
+- Sidecar trace indexing with dataset overview, query, count, search, full-trace view, and selected-span view actions
+- Bounded payload handling for large traces, including oversized summaries and higher-cap surgical span reads
+- `/rlm` help/docs updated for `env=trace_analysis`
+- Dedicated trace analysis docs under the Core Engine section
+Example:
+```text
+/rlm run "Find systemic harness failures trace=./traces.jsonl" env=trace_analysis steps=6
+```
+## Documentation
+<p align="center">
+  <a href="https://superagenticai.github.io/rlm-code/">
+    <img alt="Read the RLM Code Docs" src="https://img.shields.io/badge/Read%20the%20Docs-RLM%20Code-ff7a18?style=for-the-badge&logo=readthedocs&logoColor=white">
+  </a>
+</p>
+<p align="center">
+  <a href="https://superagenticai.github.io/rlm-code/"><strong>Open the full documentation</strong></a>
+</p>
 ## Install
 ```bash
@@ -261,6 +287,62 @@ Notes:
 - In Local/BYOK connection modes, likely coding prompts in chat can auto-route to harness.
 - In ACP mode, auto-routing is intentionally off; use `/harness run ...` explicitly.
+### 8. CodeMode with UTCP and Cloudflare MCP
+Use these server entries in your project `rlm_config.yaml`:
+```yaml
+mcp_servers:
+  utcp-codemode:
+    name: utcp-codemode
+    description: "Local CodeMode MCP bridge"
+    enabled: true
+    auto_connect: false
+    timeout_seconds: 30
+    retry_attempts: 3
+    transport:
+      type: stdio
+      command: npx
+      args:
+        - "@utcp/code-mode-mcp"
+  cloudflare-codemode:
+    name: cloudflare-codemode
+    description: "Cloudflare MCP via remote bridge"
+    enabled: true
+    auto_connect: false
+    timeout_seconds: 30
+    retry_attempts: 3
+    transport:
+      type: stdio
+      command: npx
+      args:
+        - "mcp-remote"
+        - "https://mcp.cloudflare.com/mcp"
+```
+UTCP path (native CodeMode in current release):
+```text
+/mcp-connect utcp-codemode
+/mcp-tools utcp-codemode
+/harness run "analyze this repo, find TODO/FIXME, and create report.json" steps=3 mcp=on strategy=codemode mcp_server=utcp-codemode
+```
+Cloudflare path (recommended strategy today):
+```text
+/mcp-connect cloudflare-codemode
+/mcp-tools cloudflare-codemode
+/harness run "list available tools and run one safe read-only action, then summarize in 3 bullets" steps=3 mcp=on strategy=tool_call mcp_server=cloudflare-codemode
+```
+Notes:
+- On first Cloudflare connect, `mcp-remote` may ask for interactive authentication.
+- In this release, `strategy=codemode` expects the `search_tools` + `call_tool_chain` bridge contract.
+- If a remote MCP server exposes a different tool contract, use `strategy=tool_call`.
 ## How the RLM Loop Works
 Traditional LLM usage: paste your document into the prompt, ask a question, hope the model doesn't lose details in the middle.
@@ -399,7 +481,7 @@ rlm_code/
   harness/          # Tool-using coding harness (/harness)
 ```
-## Documentation
+## Resources
 Full docs: https://superagenticai.github.io/rlm-code/

{rlm_code-0.1.5 → rlm_code-0.1.7}/README.md RENAMED Viewed

@@ -6,20 +6,18 @@
   </a>
 </p>
-<p align="center">
-  <a href="https://pypi.org/project/rlm-code/"><img alt="PyPI Version" src="https://img.shields.io/pypi/v/rlm-code"></a>
-  <a href="https://pypi.org/project/rlm-code/"><img alt="PyPI Python Versions" src="https://img.shields.io/pypi/pyversions/rlm-code"></a>
-  <a href="https://pypi.org/project/rlm-code/"><img alt="PyPI Downloads" src="https://img.shields.io/pypi/dm/rlm-code"></a>
-  <a href="https://pypi.org/project/rlm-code/"><img alt="PyPI Wheel" src="https://img.shields.io/pypi/wheel/rlm-code"></a>
-  <a href="LICENSE"><img alt="License" src="https://img.shields.io/pypi/l/rlm-code"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/actions/workflows/ci.yml"><img alt="CI" src="https://github.com/SuperagenticAI/rlm-code/actions/workflows/ci.yml/badge.svg"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/actions/workflows/pre-commit.yml"><img alt="Pre-commit" src="https://github.com/SuperagenticAI/rlm-code/actions/workflows/pre-commit.yml/badge.svg"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/actions/workflows/deploy-docs.yml"><img alt="Docs Deploy" src="https://github.com/SuperagenticAI/rlm-code/actions/workflows/deploy-docs.yml/badge.svg"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/actions/workflows/release.yml"><img alt="Release" src="https://github.com/SuperagenticAI/rlm-code/actions/workflows/release.yml/badge.svg"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/stargazers"><img alt="GitHub Stars" src="https://img.shields.io/github/stars/SuperagenticAI/rlm-code?style=social"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/issues"><img alt="GitHub Issues" src="https://img.shields.io/github/issues/SuperagenticAI/rlm-code"></a>
-  <a href="https://github.com/SuperagenticAI/rlm-code/pulls"><img alt="GitHub Pull Requests" src="https://img.shields.io/github/issues-pr/SuperagenticAI/rlm-code"></a>
-</p>
+[![PyPI Version](https://img.shields.io/pypi/v/rlm-code.svg)](https://pypi.org/project/rlm-code/)
+[![Python Versions](https://img.shields.io/pypi/pyversions/rlm-code.svg)](https://pypi.org/project/rlm-code/)
+[![PyPI Wheel](https://img.shields.io/pypi/wheel/rlm-code.svg)](https://pypi.org/project/rlm-code/)
+[![License](https://img.shields.io/pypi/l/rlm-code.svg)](https://pypi.org/project/rlm-code/)
+[![CI](https://github.com/SuperagenticAI/rlm-code/actions/workflows/ci.yml/badge.svg?branch=main)](https://github.com/SuperagenticAI/rlm-code/actions/workflows/ci.yml)
+[![Pre-commit](https://github.com/SuperagenticAI/rlm-code/actions/workflows/pre-commit.yml/badge.svg?branch=main)](https://github.com/SuperagenticAI/rlm-code/actions/workflows/pre-commit.yml)
+[![Docs Deploy](https://github.com/SuperagenticAI/rlm-code/actions/workflows/deploy-docs.yml/badge.svg?branch=main)](https://github.com/SuperagenticAI/rlm-code/actions/workflows/deploy-docs.yml)
+[![Release](https://github.com/SuperagenticAI/rlm-code/actions/workflows/release.yml/badge.svg?branch=main)](https://github.com/SuperagenticAI/rlm-code/actions/workflows/release.yml)
+[![Docs](https://img.shields.io/badge/Docs-RLM%20Code-ff7a18.svg?logo=readthedocs&logoColor=white)](https://superagenticai.github.io/rlm-code/)
+[![GitHub Stars](https://img.shields.io/github/stars/SuperagenticAI/rlm-code.svg)](https://github.com/SuperagenticAI/rlm-code/stargazers)
+[![GitHub Issues](https://img.shields.io/github/issues/SuperagenticAI/rlm-code.svg)](https://github.com/SuperagenticAI/rlm-code/issues)
+[![GitHub Pull Requests](https://img.shields.io/github/issues-pr/SuperagenticAI/rlm-code.svg)](https://github.com/SuperagenticAI/rlm-code/pulls)
 **Run LLM-powered agents in a REPL loop, benchmark them, and compare results.**
@@ -27,6 +25,34 @@ RLM Code implements the [Recursive Language Models](https://arxiv.org/abs/2502.0
 RLM Code wraps this algorithm in an interactive terminal UI with built-in benchmarks, trajectory replay, and observability.
+## Release v0.1.7
+This release adds HALO-style trace analysis as a new RLM environment.
+- New `trace_analysis` environment for diagnosing agent harness failures from OTel-shaped JSONL traces
+- Sidecar trace indexing with dataset overview, query, count, search, full-trace view, and selected-span view actions
+- Bounded payload handling for large traces, including oversized summaries and higher-cap surgical span reads
+- `/rlm` help/docs updated for `env=trace_analysis`
+- Dedicated trace analysis docs under the Core Engine section
+Example:
+```text
+/rlm run "Find systemic harness failures trace=./traces.jsonl" env=trace_analysis steps=6
+```
+## Documentation
+<p align="center">
+  <a href="https://superagenticai.github.io/rlm-code/">
+    <img alt="Read the RLM Code Docs" src="https://img.shields.io/badge/Read%20the%20Docs-RLM%20Code-ff7a18?style=for-the-badge&logo=readthedocs&logoColor=white">
+  </a>
+</p>
+<p align="center">
+  <a href="https://superagenticai.github.io/rlm-code/"><strong>Open the full documentation</strong></a>
+</p>
 ## Install
 ```bash
@@ -168,6 +194,62 @@ Notes:
 - In Local/BYOK connection modes, likely coding prompts in chat can auto-route to harness.
 - In ACP mode, auto-routing is intentionally off; use `/harness run ...` explicitly.
+### 8. CodeMode with UTCP and Cloudflare MCP
+Use these server entries in your project `rlm_config.yaml`:
+```yaml
+mcp_servers:
+  utcp-codemode:
+    name: utcp-codemode
+    description: "Local CodeMode MCP bridge"
+    enabled: true
+    auto_connect: false
+    timeout_seconds: 30
+    retry_attempts: 3
+    transport:
+      type: stdio
+      command: npx
+      args:
+        - "@utcp/code-mode-mcp"
+  cloudflare-codemode:
+    name: cloudflare-codemode
+    description: "Cloudflare MCP via remote bridge"
+    enabled: true
+    auto_connect: false
+    timeout_seconds: 30
+    retry_attempts: 3
+    transport:
+      type: stdio
+      command: npx
+      args:
+        - "mcp-remote"
+        - "https://mcp.cloudflare.com/mcp"
+```
+UTCP path (native CodeMode in current release):
+```text
+/mcp-connect utcp-codemode
+/mcp-tools utcp-codemode
+/harness run "analyze this repo, find TODO/FIXME, and create report.json" steps=3 mcp=on strategy=codemode mcp_server=utcp-codemode
+```
+Cloudflare path (recommended strategy today):
+```text
+/mcp-connect cloudflare-codemode
+/mcp-tools cloudflare-codemode
+/harness run "list available tools and run one safe read-only action, then summarize in 3 bullets" steps=3 mcp=on strategy=tool_call mcp_server=cloudflare-codemode
+```
+Notes:
+- On first Cloudflare connect, `mcp-remote` may ask for interactive authentication.
+- In this release, `strategy=codemode` expects the `search_tools` + `call_tool_chain` bridge contract.
+- If a remote MCP server exposes a different tool contract, use `strategy=tool_call`.
 ## How the RLM Loop Works
 Traditional LLM usage: paste your document into the prompt, ask a question, hope the model doesn't lose details in the middle.
@@ -306,7 +388,7 @@ rlm_code/
   harness/          # Tool-using coding harness (/harness)
 ```
-## Documentation
+## Resources
 Full docs: https://superagenticai.github.io/rlm-code/

{rlm_code-0.1.5 → rlm_code-0.1.7}/pyproject.toml RENAMED Viewed

@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 [project]
 name = "rlm-code"
-version = "0.1.5"
+version = "0.1.7"
 description = "RLM Code: Research Playground & Evaluation OS for Recursive Language Model Agentic Systems"
 readme = "README.md"
 license = "Apache-2.0"

{rlm_code-0.1.5 → rlm_code-0.1.7}/rlm_code/__init__.py RENAMED Viewed

@@ -5,5 +5,5 @@ This package provides tools for creating, managing, and optimizing DSPy componen
 through natural language interactions.
 """
-__version__ = "0.1.5"
+__version__ = "0.1.7"
 __author__ = "Super Agentic AI"

{rlm_code-0.1.5 → rlm_code-0.1.7}/rlm_code/commands/slash_commands.py RENAMED Viewed

@@ -112,6 +112,7 @@ class SlashCommandHandler:
         self.rlm_runner = RLMRunner(
             llm_connector=self.llm_connector,
             execution_engine=self.execution_engine,
+            mcp_manager=self.mcp_manager,
             reward_profile=reward_profile,
             benchmark_pack_paths=benchmark_pack_paths,
         )
@@ -1442,7 +1443,7 @@ class SlashCommandHandler:
         Usage:
             /harness tools [mcp=on|off]
             /harness doctor
-            /harness run <task> [steps=N] [mcp=on|off] [tools=name[,name2]]
+            /harness run <task> [steps=N] [mcp=on|off] [mcp_server=name] [strategy=tool_call|codemode] [tools=name[,name2]]
         """
         if not args or args[0].lower() in {"help", "--help"}:
             console.print()
@@ -1450,7 +1451,8 @@ class SlashCommandHandler:
             console.print("  [yellow]/harness tools [mcp=on|off][/yellow]")
             console.print("  [yellow]/harness doctor[/yellow]")
             console.print(
-                "  [yellow]/harness run <task> [steps=N] [mcp=on|off] [tools=name[,name2]][/yellow]"
+                "  [yellow]/harness run <task> [steps=N] [mcp=on|off] [mcp_server=name] "
+                "[strategy=tool_call|codemode] [tools=name[,name2]][/yellow]"
             )
             console.print()
             return
@@ -1555,6 +1557,8 @@ class SlashCommandHandler:
             include_mcp = True
             max_steps = 10
             allowlist: list[str] | None = None
+            strategy = "tool_call"
+            mcp_server: str | None = None
             task_tokens: list[str] = []
             for token in args[1:]:
@@ -1568,6 +1572,16 @@ class SlashCommandHandler:
                 elif lowered.startswith("mcp="):
                     value = token.split("=", 1)[1].strip().lower()
                     include_mcp = value not in {"off", "false", "0", "no"}
+                elif lowered.startswith("mcp_server="):
+                    mcp_server = token.split("=", 1)[1].strip() or None
+                elif lowered.startswith("strategy="):
+                    raw_strategy = token.split("=", 1)[1].strip().lower().replace("-", "_")
+                    if raw_strategy not in {"tool_call", "codemode"}:
+                        show_error_message(
+                            "Invalid strategy value. Use strategy=tool_call|codemode."
+                        )
+                        return
+                    strategy = raw_strategy
                 elif lowered.startswith("tools="):
                     raw = token.split("=", 1)[1].strip()
                     parsed = [part.strip() for part in raw.split(",") if part.strip()]
@@ -1578,15 +1592,25 @@ class SlashCommandHandler:
             task = " ".join(task_tokens).strip()
             if not task:
                 show_error_message(
-                    "Usage: /harness run <task> [steps=N] [mcp=on|off] [tools=name[,name2]]"
+                    "Usage: /harness run <task> [steps=N] [mcp=on|off] [mcp_server=name] "
+                    "[strategy=tool_call|codemode] [tools=name[,name2]]"
                 )
                 return
+            if strategy == "codemode" and not include_mcp:
+                show_warning_message("strategy=codemode requires mcp=on. Enabling MCP.")
+                include_mcp = True
+            if strategy == "codemode" and allowlist:
+                show_warning_message("tools=... allowlist is ignored for strategy=codemode.")
+                allowlist = None
             console.print()
             console.print("[bold cyan]🛠 Running Harness[/bold cyan]")
             console.print(f"  Task: [cyan]{task}[/cyan]")
             console.print(f"  Max steps: [cyan]{max_steps}[/cyan]")
             console.print(f"  MCP tools: [cyan]{'on' if include_mcp else 'off'}[/cyan]")
+            console.print(f"  Strategy: [cyan]{strategy}[/cyan]")
+            if mcp_server:
+                console.print(f"  MCP server: [cyan]{mcp_server}[/cyan]")
             if allowlist:
                 console.print(f"  Tool allowlist: [cyan]{', '.join(allowlist)}[/cyan]")
             console.print()
@@ -1596,6 +1620,8 @@ class SlashCommandHandler:
                 max_steps=max_steps,
                 include_mcp=include_mcp,
                 tool_allowlist=allowlist,
+                strategy=strategy,
+                mcp_server=mcp_server,
             )
             self.current_context["harness_last_response"] = result.final_response
@@ -1658,8 +1684,8 @@ class SlashCommandHandler:
         Manage RLM runs.
         Usage:
-            /rlm run <task> [steps=N] [timeout=N] [branch=N] [depth=N] [children=N] [parallel=N] [budget=N] [framework=<see /rlm frameworks>] [env=generic|dspy|pure_rlm] [sub=provider/model]
-            /rlm bench [list|preset=name] [mode=native|harness|direct-llm] [pack=path[,path2]] [limit=N] [steps=N] [timeout=N] [branch=N] [framework=<see /rlm frameworks>] [env=generic|dspy|pure_rlm] [sub=provider/model]
+            /rlm run <task> [steps=N] [timeout=N] [branch=N] [depth=N] [children=N] [parallel=N] [budget=N] [framework=<see /rlm frameworks>] [env=generic|dspy|pure_rlm|trace_analysis] [sub=provider/model]
+            /rlm bench [list|preset=name] [mode=native|harness|direct-llm] [strategy=tool_call|codemode] [mcp=on|off] [mcp_server=name] [pack=path[,path2]] [limit=N] [steps=N] [timeout=N] [branch=N] [framework=<see /rlm frameworks>] [env=generic|dspy|pure_rlm] [sub=provider/model]
             /rlm bench compare [candidate=<id|path|latest>] [baseline=<id|path|previous>] [min_reward_delta=N] [min_completion_delta=N] [max_steps_increase=N]
             /rlm bench validate [candidate=<id|path|latest>] [baseline=<id|path|previous>] [min_reward_delta=N] [min_completion_delta=N] [max_steps_increase=N] [--json]
             /rlm bench report [candidate=<id|path|latest>] [baseline=<id|path|previous>] [format=markdown|csv|json] [output=path]
@@ -1670,8 +1696,8 @@ class SlashCommandHandler:
             /rlm status [run_id]
             /rlm abort [run_id|all]
             /rlm replay [run_id|latest]
-            /rlm doctor [env=generic|dspy|pure_rlm] [--json]
-            /rlm chat <message> [session=name] [env=generic|dspy|pure_rlm] [branch=N] [depth=N] [children=N] [parallel=N] [budget=N] [framework=<see /rlm frameworks>] [sub=provider/model]
+            /rlm doctor [env=generic|dspy|pure_rlm|trace_analysis] [--json]
+            /rlm chat <message> [session=name] [env=generic|dspy|pure_rlm|trace_analysis] [branch=N] [depth=N] [children=N] [parallel=N] [budget=N] [framework=<see /rlm frameworks>] [sub=provider/model]
             /rlm chat status [session=name]
             /rlm chat reset [session=name]
             /rlm observability
@@ -1682,13 +1708,14 @@ class SlashCommandHandler:
             console.print("[bold cyan]🧠 RLM Commands[/bold cyan]")
             console.print(
                 "  [yellow]/rlm run <task> [steps=N] [timeout=N] [branch=N] [depth=N] [children=N] "
-                f"[parallel=N] [budget=N] [framework={framework_opts}] [env=generic|dspy|pure_rlm] "
+                f"[parallel=N] [budget=N] [framework={framework_opts}] [env=generic|dspy|pure_rlm|trace_analysis] "
                 "[sub=provider/model][/yellow]"
             )
             console.print(
                 "  [yellow]/rlm bench [list|preset=name] [mode=native|harness|direct-llm] "
+                "[strategy=tool_call|codemode] [mcp=on|off] [mcp_server=name] "
                 "[pack=path[,path2]] [limit=N] [steps=N] "
-                f"[timeout=N] [branch=N] [framework={framework_opts}] [env=generic|dspy|pure_rlm] [sub=provider/model][/yellow]"
+                f"[timeout=N] [branch=N] [framework={framework_opts}] [env=generic|dspy|pure_rlm|trace_analysis] [sub=provider/model][/yellow]"
             )
             console.print(
                 "  [yellow]/rlm bench compare [candidate=<id|path|latest>] [baseline=<id|path|previous>] "
@@ -1714,9 +1741,9 @@ class SlashCommandHandler:
             console.print("  [yellow]/rlm status [run_id][/yellow]")
             console.print("  [yellow]/rlm abort [run_id|all][/yellow]")
             console.print("  [yellow]/rlm replay [run_id|latest][/yellow]")
-            console.print("  [yellow]/rlm doctor [env=generic|dspy|pure_rlm] [--json][/yellow]")
+            console.print("  [yellow]/rlm doctor [env=generic|dspy|pure_rlm|trace_analysis] [--json][/yellow]")
             console.print(
-                "  [yellow]/rlm chat <message> [session=name] [env=generic|dspy|pure_rlm] [branch=N] [depth=N] "
+                "  [yellow]/rlm chat <message> [session=name] [env=generic|dspy|pure_rlm|trace_analysis] [branch=N] [depth=N] "
                 f"[children=N] [parallel=N] [budget=N] [framework={framework_opts}] "
                 "[sub=provider/model][/yellow]"
             )
@@ -2108,7 +2135,7 @@ class SlashCommandHandler:
             task = " ".join(task_tokens).strip()
             if not task:
                 show_error_message(
-                    "Usage: /rlm run <task> [steps=N] [timeout=N] [env=generic|dspy|pure_rlm] "
+                    "Usage: /rlm run <task> [steps=N] [timeout=N] [env=generic|dspy|pure_rlm|trace_analysis] "
                     "[depth=N] [children=N] [parallel=N] [budget=N] "
                     f"[framework={framework_opts}] "
                     "[branch=N] [sub=provider/model]"
@@ -2521,6 +2548,9 @@ class SlashCommandHandler:
             environment: str | None = None
             sub_model: str | None = None
             sub_provider: str | None = None
+            include_mcp = False
+            mcp_server: str | None = None
+            harness_strategy = "tool_call"
             for token in args[1:]:
                 lowered = token.lower()
@@ -2537,6 +2567,19 @@ class SlashCommandHandler:
                         )
                         return
                     mode = resolved_mode
+                elif lowered.startswith("mcp="):
+                    value = token.split("=", 1)[1].strip().lower()
+                    include_mcp = value not in {"off", "false", "0", "no"}
+                elif lowered.startswith("strategy="):
+                    strategy_token = token.split("=", 1)[1].strip().lower().replace("-", "_")
+                    if strategy_token not in {"tool_call", "codemode"}:
+                        show_error_message(
+                            "Invalid strategy value. Use strategy=tool_call|codemode."
+                        )
+                        return
+                    harness_strategy = strategy_token
+                elif lowered.startswith("mcp_server="):
+                    mcp_server = token.split("=", 1)[1].strip() or None
                 elif lowered.startswith("pack="):
                     raw_paths = token.split("=", 1)[1].strip()
                     if not raw_paths:
@@ -2593,8 +2636,10 @@ class SlashCommandHandler:
                 else:
                     show_error_message(
                         "Usage: /rlm bench [list|preset=name] [mode=native|harness|direct-llm] "
+                        "[strategy=tool_call|codemode] [mcp=on|off] [mcp_server=name] "
                         "[pack=path[,path2]] [limit=N] "
-                        f"[steps=N] [timeout=N] [branch=N] [framework={framework_opts}] [env=generic|dspy|pure_rlm] [sub=provider/model]\n"
+                        f"[steps=N] [timeout=N] [branch=N] [framework={framework_opts}] "
+                        "[env=generic|dspy|pure_rlm] [sub=provider/model]\n"
                         "       /rlm bench compare [candidate=<id|path|latest>] [baseline=<id|path|previous>] ...\n"
                         "       /rlm bench validate [candidate=<id|path|latest>] [baseline=<id|path|previous>] ...\n"
                         "       /rlm bench report [candidate=<id|path|latest>] [baseline=<id|path|previous>] "
@@ -2602,6 +2647,30 @@ class SlashCommandHandler:
                     )
                     return
+            if mode == "harness" and harness_strategy == "codemode" and not include_mcp:
+                show_warning_message("strategy=codemode requires mcp=on. Enabling MCP.")
+                include_mcp = True
+            if mode != "harness" and include_mcp:
+                show_warning_message("mcp=on is only used for mode=harness. Ignoring MCP settings.")
+                include_mcp = False
+                mcp_server = None
+            elif mode != "harness" and mcp_server:
+                show_warning_message(
+                    "mcp_server is only used for mode=harness with mcp=on. Ignoring."
+                )
+                mcp_server = None
+            elif mode == "harness" and mcp_server and not include_mcp:
+                show_warning_message(
+                    "mcp_server provided but mcp=off. MCP server filter will be ignored."
+                )
+                mcp_server = None
+            if mode != "harness" and harness_strategy != "tool_call":
+                show_warning_message(
+                    "strategy is only used for mode=harness. Resetting to tool_call."
+                )
+                harness_strategy = "tool_call"
             if list_only:
                 try:
                     rows = self.rlm_runner.benchmark_presets(pack_paths=pack_paths_override)
@@ -2681,6 +2750,11 @@ class SlashCommandHandler:
             if timeout is not None:
                 console.print(f"  Override timeout: [cyan]{timeout}s[/cyan]")
             console.print(f"  Branch width: [cyan]{branch_width}[/cyan]")
+            if mode == "harness":
+                console.print(f"  Harness strategy: [cyan]{harness_strategy}[/cyan]")
+                console.print(f"  Harness MCP: [cyan]{'on' if include_mcp else 'off'}[/cyan]")
+                if include_mcp and mcp_server:
+                    console.print(f"  Harness MCP server: [cyan]{mcp_server}[/cyan]")
             if pack_paths_override:
                 console.print(f"  Benchmark packs: [cyan]{', '.join(pack_paths_override)}[/cyan]")
             if environment:
@@ -2704,6 +2778,9 @@ class SlashCommandHandler:
                     branch_width=branch_width,
                     sub_model=sub_model,
                     sub_provider=sub_provider,
+                    include_mcp=include_mcp,
+                    mcp_server=mcp_server,
+                    harness_strategy=harness_strategy,
                     pack_paths=pack_paths_override,
                 )
             except ValueError as exc:
@@ -4413,7 +4490,7 @@ class SlashCommandHandler:
 [bold magenta]RLM Workflows:[/bold magenta]
   [yellow]/rlm run[/yellow] <task> [steps=N] [timeout=N] [branch=N] [depth=N] [children=N] [parallel=N] [budget=N] [framework=native|dspy-rlm|adk-rlm|pydantic-ai|google-adk|deepagents] [env=generic|dspy|pure_rlm] [sub=provider/model] - Run an RLM coding episode
-  [yellow]/rlm bench[/yellow] [list|preset=name] [mode=native|harness|direct-llm] [pack=path[,path2]] [limit=N] [steps=N] [timeout=N] [branch=N] [framework=native|dspy-rlm|adk-rlm|pydantic-ai|google-adk|deepagents] [env=generic|dspy|pure_rlm] [sub=provider/model] - Run benchmark preset
+  [yellow]/rlm bench[/yellow] [list|preset=name] [mode=native|harness|direct-llm] [strategy=tool_call|codemode] [mcp=on|off] [mcp_server=name] [pack=path[,path2]] [limit=N] [steps=N] [timeout=N] [branch=N] [framework=native|dspy-rlm|adk-rlm|pydantic-ai|google-adk|deepagents] [env=generic|dspy|pure_rlm] [sub=provider/model] - Run benchmark preset
   [yellow]/rlm bench compare[/yellow] [candidate=<id|path|latest>] [baseline=<id|path|previous>] [min_reward_delta=N] [min_completion_delta=N] [max_steps_increase=N] - Gate regressions
   [yellow]/rlm bench validate[/yellow] [candidate=<id|path|latest>] [baseline=<id|path|previous>] [min_reward_delta=N] [min_completion_delta=N] [max_steps_increase=N] [--json] - CI-style gate output
   [yellow]/rlm bench report[/yellow] [candidate=<id|path|latest>] [baseline=<id|path|previous>] [format=markdown|csv|json] [output=path] - Export compare report
@@ -4431,7 +4508,7 @@ class SlashCommandHandler:
   [yellow]/rlm observability[/yellow]                   - Show local/MLflow observability sink status
   [yellow]/harness tools[/yellow] [mcp=on|off]         - List coding harness tools (local + MCP)
   [yellow]/harness doctor[/yellow]                     - Show harness tool coverage report
-  [yellow]/harness run[/yellow] <task> [steps=N] [mcp=on|off] [tools=name[,name2]] - Run tool-using coding harness
+  [yellow]/harness run[/yellow] <task> [steps=N] [mcp=on|off] [mcp_server=name] [strategy=tool_call|codemode] [tools=name[,name2]] - Run tool-using coding harness
 [bold magenta]Optimization (GEPA):[/bold magenta]
   [yellow]/optimize-start[/yellow] [budget]           - Start GEPA optimization workflow

rlm-code 0.1.5__tar.gz → 0.1.7__tar.gz

rlm-code 0.1.5tar.gz → 0.1.7tar.gz