PyPI - moa-cli - Versions diffs - 0.3.1__tar.gz → 0.3.3__tar.gz - Mend

moa-cli 0.3.1tar.gz → 0.3.3tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

{moa_cli-0.3.1 → moa_cli-0.3.3}/PKG-INFO +45 -16
{moa_cli-0.3.1 → moa_cli-0.3.3}/README.md +44 -15
{moa_cli-0.3.1 → moa_cli-0.3.3}/pyproject.toml +1 -1
{moa_cli-0.3.1 → moa_cli-0.3.3}/src/moa_cli/__init__.py +1 -1
{moa_cli-0.3.1 → moa_cli-0.3.3}/src/moa_cli/cli.py +213 -57

{moa_cli-0.3.1 → moa_cli-0.3.3}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.3
 Name: moa-cli
-Version: 0.3.1
+Version: 0.3.3
 Summary: Ask one question to multiple local AI coding CLIs in parallel and collect their answers.
 Keywords: llm,agents,cli,claude,codex,agy,opencode,peer-review
 Author: Paul-Louis Pröve
@@ -50,7 +50,7 @@ A single model gives you one perspective. Asking three frontier models the same
 ```text
 $ moa ask "Is Postgres or SQLite better for a desktop app?"
-Asking claude, codex, agy (timeout 600s, read-only)
+Asking claude, codex, agy (timeout 900s, read-only)
 ──────────────── claude (opus) · OK · 3.2s ────────────────
@@ -103,11 +103,16 @@ is enforced by spawning each CLI with its own read-only flags:
 | Provider   | Read-only (default)        | Reads files | Web research              |
 | ---------- | -------------------------- | ----------- | ------------------------- |
-| `claude`   | `--permission-mode plan`   | yes         | yes                       |
+| `claude`   | `--permission-mode default` | yes        | yes                       |
 | `codex`    | `-s read-only`             | yes         | **no** (sandbox blocks network) |
 | `opencode` | `--agent plan`             | yes         | yes                       |
 | `agy`      | `--sandbox` (partial: shell only - can still edit files) | yes | yes |
+`claude`'s `--permission-mode default` is read-only in moa's non-interactive use: it reads
+files and researches online with the full toolset, but any write or edit needs an interactive
+approval that never comes under `-p`, so all mutations are denied. (`plan` mode is **not**
+usable headless - it emits a plan and waits for approval instead of answering.)
 `codex`'s read-only mode is a kernel sandbox that also blocks network, so codex does no
 web research in the default mode (it still reads local files). `agy` has **no true
 read-only mode**: its `--sandbox` flag restricts agy's terminal/shell but does **not** stop
@@ -171,13 +176,14 @@ To avoid repeating the same flags on every call, persist your own defaults in a
 **Keys** (all shared across `ask`/`distill`/`debate`):
-| Key           | Type                    | Example                       |
-| ------------- | ----------------------- | ----------------------------- |
-| `num`         | int (>= 1)              | `num = 2`                     |
-| `timeout`     | seconds (> 0)           | `timeout = 120`               |
-| `exclude`     | list of provider names  | `exclude = ["claude"]`        |
-| `synthesizer` | `auto`/`random`/provider | `synthesizer = "codex"`      |
-| `[models]`    | provider -> model table | `claude = "sonnet"`           |
+| Key                | Type                     | Example                       |
+| ------------------ | ------------------------ | ----------------------------- |
+| `num`              | int (>= 1)               | `num = 2`                     |
+| `timeout`          | seconds (> 0)            | `timeout = 120`               |
+| `exclude`          | list of provider names   | `exclude = ["claude"]`        |
+| `synthesizer`      | `auto`/`random`/provider | `synthesizer = "codex"`       |
+| `[providers.<name>]` | per-provider `model` + `effort` | see below              |
+| `[models]`         | DEPRECATED provider -> model table | `claude = "sonnet"` |
 ```toml
 # ~/.moa/config.toml
@@ -186,11 +192,17 @@ timeout = 120
 exclude = ["claude"]
 synthesizer = "auto"
-[models]
-claude = "sonnet"
-agy = "Gemini 3.1 Pro (Low)"
+[providers.codex]
+model = "gpt-5.5"
+effort = "high"
+[providers.opencode]
+model = "zai-coding-plan/glm-5.2"
+effort = "high"
 ```
+Model and effort are grouped per provider under `[providers.<name>]`. The flat `[models]` table still works as a **deprecated alias** for `[providers.<name>].model`; when both set a model for the same provider, the `[providers.<name>]` block wins (MOA prints a one-line note, not an error).
 **`moa config`** inspects and edits the file (it creates the dir/file as needed and validates provider names):
 ```bash
@@ -198,12 +210,29 @@ moa config show                       # effective config (defaults + file) + pat
 moa config path                       # print the config file path
 moa config set num 2                  # set a scalar
 moa config set exclude claude,codex   # set the exclude list (comma-separated)
-moa config set model claude=sonnet    # set one entry in [models]
+moa config set model codex=gpt-5.5    # set a provider's model
+moa config set effort codex=high      # set a provider's reasoning effort
 moa config unset num                  # remove a key
-moa config unset model claude         # remove one [models] entry
+moa config unset model codex          # remove one provider's model
+moa config unset effort codex         # remove one provider's effort
 ```
-The role defaults are persistable too: the distill `synthesizer` and the debate `moderator` (e.g. `moa config set synthesizer codex`, `moa config set moderator agy`). `debate`'s `-r/--rounds` is not persisted. CLI `-m` overrides win per-provider over the config `[models]` table.
+The role defaults are persistable too: the distill `synthesizer` and the debate `moderator` (e.g. `moa config set synthesizer codex`, `moa config set moderator agy`). `debate`'s `-r/--rounds` is not persisted. CLI `-m` overrides win per-provider over the config model.
+#### Reasoning / effort
+Pin a per-provider **reasoning/effort** level in config so the council runs each tool at the depth you want without repeating flags. This is **config-only**: there is intentionally no `-e/--effort` CLI flag.
+MOA uses **raw pass-through with zero value mapping.** It does not normalize effort across providers or invent a canonical low/med/high scale. You write the **exact value the target tool expects**, and MOA pastes it verbatim into that provider's native flag. The only thing MOA maps is *where* the value lands in each provider's argv, never the value itself:
+| Provider   | `effort` lands in                    | Notes                                                       |
+| ---------- | ------------------------------------ | ----------------------------------------------------------- |
+| `codex`    | `-c model_reasoning_effort=<value>`  | generic config override                                     |
+| `opencode` | `--variant <value>`                  | opencode's "model variant (provider-specific reasoning effort)" |
+| `agy`      | (none)                               | reasoning is part of the model name, e.g. `Gemini 3.1 Pro (High)` |
+| `claude`   | (none)                               | no per-call effort flag                                     |
+Values are **tool-specific and not validated** by MOA (only "non-empty if present"): a value the target tool rejects fails at that tool, not in MOA. When no effort is configured for a provider, MOA passes **no effort flag at all**, so the tool's own default stands. Setting `effort` for `agy`/`claude` is stored but inert (they have no effort flag); MOA notes this when you set it.
 ### Output

{moa_cli-0.3.1 → moa_cli-0.3.3}/README.md RENAMED Viewed

@@ -39,7 +39,7 @@ A single model gives you one perspective. Asking three frontier models the same
 ```text
 $ moa ask "Is Postgres or SQLite better for a desktop app?"
-Asking claude, codex, agy (timeout 600s, read-only)
+Asking claude, codex, agy (timeout 900s, read-only)
 ──────────────── claude (opus) · OK · 3.2s ────────────────
@@ -92,11 +92,16 @@ is enforced by spawning each CLI with its own read-only flags:
 | Provider   | Read-only (default)        | Reads files | Web research              |
 | ---------- | -------------------------- | ----------- | ------------------------- |
-| `claude`   | `--permission-mode plan`   | yes         | yes                       |
+| `claude`   | `--permission-mode default` | yes        | yes                       |
 | `codex`    | `-s read-only`             | yes         | **no** (sandbox blocks network) |
 | `opencode` | `--agent plan`             | yes         | yes                       |
 | `agy`      | `--sandbox` (partial: shell only - can still edit files) | yes | yes |
+`claude`'s `--permission-mode default` is read-only in moa's non-interactive use: it reads
+files and researches online with the full toolset, but any write or edit needs an interactive
+approval that never comes under `-p`, so all mutations are denied. (`plan` mode is **not**
+usable headless - it emits a plan and waits for approval instead of answering.)
 `codex`'s read-only mode is a kernel sandbox that also blocks network, so codex does no
 web research in the default mode (it still reads local files). `agy` has **no true
 read-only mode**: its `--sandbox` flag restricts agy's terminal/shell but does **not** stop
@@ -160,13 +165,14 @@ To avoid repeating the same flags on every call, persist your own defaults in a
 **Keys** (all shared across `ask`/`distill`/`debate`):
-| Key           | Type                    | Example                       |
-| ------------- | ----------------------- | ----------------------------- |
-| `num`         | int (>= 1)              | `num = 2`                     |
-| `timeout`     | seconds (> 0)           | `timeout = 120`               |
-| `exclude`     | list of provider names  | `exclude = ["claude"]`        |
-| `synthesizer` | `auto`/`random`/provider | `synthesizer = "codex"`      |
-| `[models]`    | provider -> model table | `claude = "sonnet"`           |
+| Key                | Type                     | Example                       |
+| ------------------ | ------------------------ | ----------------------------- |
+| `num`              | int (>= 1)               | `num = 2`                     |
+| `timeout`          | seconds (> 0)            | `timeout = 120`               |
+| `exclude`          | list of provider names   | `exclude = ["claude"]`        |
+| `synthesizer`      | `auto`/`random`/provider | `synthesizer = "codex"`       |
+| `[providers.<name>]` | per-provider `model` + `effort` | see below              |
+| `[models]`         | DEPRECATED provider -> model table | `claude = "sonnet"` |
 ```toml
 # ~/.moa/config.toml
@@ -175,11 +181,17 @@ timeout = 120
 exclude = ["claude"]
 synthesizer = "auto"
-[models]
-claude = "sonnet"
-agy = "Gemini 3.1 Pro (Low)"
+[providers.codex]
+model = "gpt-5.5"
+effort = "high"
+[providers.opencode]
+model = "zai-coding-plan/glm-5.2"
+effort = "high"
 ```
+Model and effort are grouped per provider under `[providers.<name>]`. The flat `[models]` table still works as a **deprecated alias** for `[providers.<name>].model`; when both set a model for the same provider, the `[providers.<name>]` block wins (MOA prints a one-line note, not an error).
 **`moa config`** inspects and edits the file (it creates the dir/file as needed and validates provider names):
 ```bash
@@ -187,12 +199,29 @@ moa config show                       # effective config (defaults + file) + pat
 moa config path                       # print the config file path
 moa config set num 2                  # set a scalar
 moa config set exclude claude,codex   # set the exclude list (comma-separated)
-moa config set model claude=sonnet    # set one entry in [models]
+moa config set model codex=gpt-5.5    # set a provider's model
+moa config set effort codex=high      # set a provider's reasoning effort
 moa config unset num                  # remove a key
-moa config unset model claude         # remove one [models] entry
+moa config unset model codex          # remove one provider's model
+moa config unset effort codex         # remove one provider's effort
 ```
-The role defaults are persistable too: the distill `synthesizer` and the debate `moderator` (e.g. `moa config set synthesizer codex`, `moa config set moderator agy`). `debate`'s `-r/--rounds` is not persisted. CLI `-m` overrides win per-provider over the config `[models]` table.
+The role defaults are persistable too: the distill `synthesizer` and the debate `moderator` (e.g. `moa config set synthesizer codex`, `moa config set moderator agy`). `debate`'s `-r/--rounds` is not persisted. CLI `-m` overrides win per-provider over the config model.
+#### Reasoning / effort
+Pin a per-provider **reasoning/effort** level in config so the council runs each tool at the depth you want without repeating flags. This is **config-only**: there is intentionally no `-e/--effort` CLI flag.
+MOA uses **raw pass-through with zero value mapping.** It does not normalize effort across providers or invent a canonical low/med/high scale. You write the **exact value the target tool expects**, and MOA pastes it verbatim into that provider's native flag. The only thing MOA maps is *where* the value lands in each provider's argv, never the value itself:
+| Provider   | `effort` lands in                    | Notes                                                       |
+| ---------- | ------------------------------------ | ----------------------------------------------------------- |
+| `codex`    | `-c model_reasoning_effort=<value>`  | generic config override                                     |
+| `opencode` | `--variant <value>`                  | opencode's "model variant (provider-specific reasoning effort)" |
+| `agy`      | (none)                               | reasoning is part of the model name, e.g. `Gemini 3.1 Pro (High)` |
+| `claude`   | (none)                               | no per-call effort flag                                     |
+Values are **tool-specific and not validated** by MOA (only "non-empty if present"): a value the target tool rejects fails at that tool, not in MOA. When no effort is configured for a provider, MOA passes **no effort flag at all**, so the tool's own default stands. Setting `effort` for `agy`/`claude` is stored but inert (they have no effort flag); MOA notes this when you set it.
 ### Output

{moa_cli-0.3.1 → moa_cli-0.3.3}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "moa-cli"
-version = "0.3.1"
+version = "0.3.3"
 description = "Ask one question to multiple local AI coding CLIs in parallel and collect their answers."
 readme = "README.md"
 authors = [

{moa_cli-0.3.1 → moa_cli-0.3.3}/src/moa_cli/__init__.py RENAMED Viewed

@@ -1,3 +1,3 @@
 """MOA CLI package."""
-__version__ = "0.3.1"
+__version__ = "0.3.3"

{moa_cli-0.3.1 → moa_cli-0.3.3}/src/moa_cli/cli.py RENAMED Viewed

@@ -29,11 +29,14 @@ import typer
 # Providers: each agent CLI we know how to drive.
 # --------------------------------------------------------------------------- #
-# A command builder turns (prompt, model, output_file, perm) into an argv list.
-# output_file is a path the CLI may be told to write its final answer to; it is
-# None for providers that answer cleanly on stdout. Only codex uses it. `perm`
-# is the permission argv (read-only or yolo flags) spliced in before the prompt.
-CommandBuilder = Callable[[str, str, str | None, tuple[str, ...]], list[str]]
+# A command builder turns (prompt, model, output_file, perm, effort) into an argv
+# list. output_file is a path the CLI may be told to write its final answer to; it
+# is None for providers that answer cleanly on stdout. Only codex uses it. `perm`
+# is the permission argv (read-only or yolo flags) and `effort` is the reasoning
+# argv (the provider's native flag with the user's value pasted in verbatim), both
+# spliced in before the prompt. `effort` is an empty tuple when no effort is
+# configured or the provider has no effort flag.
+CommandBuilder = Callable[[str, str, str | None, tuple[str, ...], tuple[str, ...]], list[str]]
 @dataclass(frozen=True)
@@ -59,6 +62,25 @@ class Provider:
     unset_env: tuple[str, ...] = ()
     # codex's stdout is session chrome; its real answer goes to an output file.
     uses_output_file: bool = False
+    # Reasoning/effort flag mapping, declared as data (like perm_args), not
+    # branched per tool. This is the ONLY thing moa knows about effort: WHERE the
+    # user's value lands in argv. It never normalizes or validates the value. The
+    # callable takes the raw effort value and returns the argv to splice in; it is
+    # called only for a non-empty value. `None` means the provider has no per-call
+    # effort flag (agy carries effort in the model name; claude has none), so
+    # effort_args always returns () for it.
+    effort_flag: Callable[[str], tuple[str, ...]] | None = None
+    def effort_args(self, value: str | None) -> tuple[str, ...]:
+        """The reasoning/effort argv for this run, value pasted in verbatim.
+        Empty tuple when the provider has no effort flag (agy/claude) or no
+        effort is configured (value is None/empty). moa never interprets the
+        value: it is the exact wording the target tool expects.
+        """
+        if not value or self.effort_flag is None:
+            return ()
+        return self.effort_flag(value)
     def env(self) -> dict[str, str]:
         env = dict(os.environ)
@@ -77,31 +99,41 @@ class Provider:
         return self.readonly or ()
-def _claude(prompt: str, model: str, _out: str | None, perm: tuple[str, ...]) -> list[str]:
+def _claude(
+    prompt: str, model: str, _out: str | None, perm: tuple[str, ...], _effort: tuple[str, ...]
+) -> list[str]:
+    # claude has no clean per-call effort flag, so _effort is always () here.
     return ["claude", "--model", model, *perm, "-p", prompt]
-def _codex(prompt: str, model: str, out: str | None, perm: tuple[str, ...]) -> list[str]:
-    cmd = ["codex", "exec", "-m", model, "--skip-git-repo-check", "--color", "never", *perm]
+def _codex(
+    prompt: str, model: str, out: str | None, perm: tuple[str, ...], effort: tuple[str, ...]
+) -> list[str]:
+    cmd = ["codex", "exec", "-m", model, "--skip-git-repo-check", "--color", "never", *perm, *effort]
     if out:
         cmd += ["-o", out]
     cmd.append(prompt)
     return cmd
-def _agy(prompt: str, model: str, _out: str | None, perm: tuple[str, ...]) -> list[str]:
+def _agy(
+    prompt: str, model: str, _out: str | None, perm: tuple[str, ...], _effort: tuple[str, ...]
+) -> list[str]:
     # agy also hosts Claude/GPT-OSS models, so we pin a Gemini model explicitly
     # to keep the panel diverse. Without --model it defaults to Gemini Flash.
     # perm (e.g. --sandbox) goes first so the default reads `agy --sandbox
-    # --model ... -p ...`.
+    # --model ... -p ...`. agy's reasoning lives in the model name, so _effort is
+    # always () here.
     return ["agy", *perm, "--model", model, "-p", prompt]
-def _opencode(prompt: str, model: str, _out: str | None, perm: tuple[str, ...]) -> list[str]:
+def _opencode(
+    prompt: str, model: str, _out: str | None, perm: tuple[str, ...], effort: tuple[str, ...]
+) -> list[str]:
     # opencode has no universal default model (it depends on which provider the
     # user has authed), so we omit -m when no model is given and let opencode
     # pick its own default. The prompt is a positional arg.
-    cmd = ["opencode", "run", *perm]
+    cmd = ["opencode", "run", *perm, *effort]
     if model:
         cmd += ["-m", model]
     cmd.append(prompt)
@@ -111,7 +143,11 @@ def _opencode(prompt: str, model: str, _out: str | None, perm: tuple[str, ...])
 PROVIDERS: dict[str, Provider] = {
     "claude": Provider(
         "claude", "claude", "opus", _claude,
-        readonly=("--permission-mode", "plan"),
+        # In headless `-p`, `default` keeps the full toolset (Read, read-only Bash
+        # like git/grep, WebFetch) but has no way to approve a write/edit, so
+        # mutations are denied: effectively read-only with all tools. (`plan` mode
+        # instead emits a plan-approval stub and never answers under `-p`.)
+        readonly=("--permission-mode", "default"),
         yolo=("--permission-mode", "bypassPermissions"),
         unset_env=("CLAUDECODE",),
     ),
@@ -120,6 +156,10 @@ PROVIDERS: dict[str, Provider] = {
         readonly=("-s", "read-only"),
         yolo=("-s", "danger-full-access"),
         uses_output_file=True,
+        # Verified against installed codex: `-c key=value` is the generic config
+        # override (`codex exec --help`); reasoning effort lives at
+        # model_reasoning_effort. The value is pasted verbatim.
+        effort_flag=lambda v: ("-c", f"model_reasoning_effort={v}"),
     ),
     "agy": Provider(
         "agy", "agy", "Gemini 3.1 Pro (High)", _agy,
@@ -134,7 +174,14 @@ PROVIDERS: dict[str, Provider] = {
     "opencode": Provider(
         "opencode", "opencode", "", _opencode,
         readonly=("--agent", "plan"),
-        yolo=(),  # default = build agent (full access)
+        # Bare `build` (the default agent) still gates doom_loop/external_directory
+        # as `ask`, which auto-rejects headless - so true full access needs the
+        # explicit skip-permissions flag. (Verified on opencode 1.17.8.)
+        yolo=("--dangerously-skip-permissions",),
+        # Verified against installed opencode: `--variant <value>` is the
+        # "model variant (provider-specific reasoning effort, e.g., high)" flag
+        # (`opencode run --help`). The value is pasted verbatim.
+        effort_flag=lambda v: ("--variant", v),
     ),
 }
@@ -232,7 +279,12 @@ async def _terminate(process: asyncio.subprocess.Process) -> None:
 async def run_provider(
-    provider: Provider, prompt: str, timeout: float, model: str | None = None, yolo: bool = False
+    provider: Provider,
+    prompt: str,
+    timeout: float,
+    model: str | None = None,
+    yolo: bool = False,
+    effort: str | None = None,
 ) -> RunResult:
     model = model or provider.default_model
     out_file: str | None = None
@@ -244,7 +296,7 @@ async def run_provider(
     try:
         try:
             process = await asyncio.create_subprocess_exec(
-                *provider.build(prompt, model, out_file, provider.perm_args(yolo)),
+                *provider.build(prompt, model, out_file, provider.perm_args(yolo), provider.effort_args(effort)),
                 # DEVNULL is essential: codex and agy block forever on an
                 # inherited TTY stdin, burning the entire timeout otherwise.
                 stdin=asyncio.subprocess.DEVNULL,
@@ -286,11 +338,15 @@ async def stream(
     timeout: float,
     models: dict[str, str] | None = None,
     yolo: bool = False,
+    efforts: dict[str, str] | None = None,
 ) -> AsyncIterator[RunResult]:
     """Run every provider in parallel, yielding each result as it finishes."""
     models = models or {}
+    efforts = efforts or {}
     tasks = [
-        asyncio.create_task(run_provider(p, prompt, timeout, models.get(p.name), yolo))
+        asyncio.create_task(
+            run_provider(p, prompt, timeout, models.get(p.name), yolo, efforts.get(p.name))
+        )
         for p in providers
     ]
     for completed in asyncio.as_completed(tasks):
@@ -686,22 +742,31 @@ def verdict_record(result: RunResult, moderator: str) -> dict:
 # merge happens once, in resolve_run, so all verbs pick up defaults identically.
 # --------------------------------------------------------------------------- #
-# Scalar config keys and the type each maps to. `exclude` (list[str]) and the
-# `[models]` table are handled separately because they aren't plain scalars.
+# Scalar config keys and the type each maps to. `exclude` (list[str]), the
+# `[providers.<name>]` blocks, and the deprecated `[models]` table are handled
+# separately because they aren't plain scalars.
 _CONFIG_SCALARS: dict[str, type] = {"num": int, "timeout": float, "synthesizer": str, "moderator": str}
-_CONFIG_KEYS: tuple[str, ...] = (*_CONFIG_SCALARS, "exclude", "models")
+# Top-level config keys a file may contain. `providers` is the canonical
+# per-provider block (model + effort); `models` is the DEPRECATED flat alias for
+# `[providers.<name>].model`, kept for back-compat with 0.2.x/0.3.x configs.
+_CONFIG_KEYS: tuple[str, ...] = (*_CONFIG_SCALARS, "exclude", "providers", "models")
+# Per-provider keys allowed inside a `[providers.<name>]` block.
+_PROVIDER_KEYS: tuple[str, ...] = ("model", "effort")
 # Synthesizer accepts the special modes plus any known provider name.
 _SYNTHESIZER_MODES: tuple[str, ...] = ("auto", "first", "random")
 # Moderator accepts "auto" (the top-priority selected agent) or a provider name.
 _MODERATOR_MODES: tuple[str, ...] = ("auto",)
 # The built-in defaults, shown by `config show` when a key isn't in the file.
+# `models`/`efforts` are the normalized per-provider maps load_config produces;
+# serialize_config renders them back as `[providers.<name>]` blocks.
 _CONFIG_DEFAULTS: dict = {
     "num": 3,
-    "timeout": 600.0,
+    "timeout": 900.0,
     "synthesizer": "auto",
     "moderator": "auto",
     "exclude": [],
     "models": {},
+    "efforts": {},
 }
@@ -770,12 +835,57 @@ def load_config() -> dict:
             raise ValueError("Config key 'exclude' must be a list of provider names.")
         _validate_providers(value, "exclude")
         config["exclude"] = value
+    # Models come from two places: the canonical `[providers.<name>].model` and
+    # the DEPRECATED flat `[models]` table. Start from the deprecated table, then
+    # let the provider blocks win on conflict (with a one-line note, not an
+    # error). Efforts come only from `[providers.<name>].effort`.
+    models: dict[str, str] = {}
+    efforts: dict[str, str] = {}
     if "models" in raw:
-        models = raw["models"]
-        if not isinstance(models, dict) or not all(isinstance(v, str) for v in models.values()):
+        legacy = raw["models"]
+        if not isinstance(legacy, dict) or not all(isinstance(v, str) for v in legacy.values()):
             raise ValueError("Config table '[models]' must map provider names to model strings.")
-        _validate_providers(models, "[models]")
-        config["models"] = dict(models)
+        _validate_providers(legacy, "[models]")
+        models.update(legacy)
+    if "providers" in raw:
+        providers = raw["providers"]
+        if not isinstance(providers, dict):
+            raise ValueError("Config table '[providers]' must map provider names to {model, effort} blocks.")
+        _validate_providers(providers, "[providers]")
+        shadowed: list[str] = []
+        for name, block in providers.items():
+            if not isinstance(block, dict):
+                raise ValueError(f"Config table '[providers.{name}]' must be a {{model, effort}} block.")
+            unknown_pk = [k for k in block if k not in _PROVIDER_KEYS]
+            if unknown_pk:
+                raise ValueError(
+                    f"Unknown key(s) in [providers.{name}]: {', '.join(unknown_pk)}. "
+                    f"Known: {', '.join(_PROVIDER_KEYS)}."
+                )
+            if "model" in block:
+                model = block["model"]
+                if not isinstance(model, str):
+                    raise ValueError(f"[providers.{name}].model must be a string.")
+                if name in raw.get("models", {}) and raw["models"][name] != model:
+                    shadowed.append(name)
+                models[name] = model
+            if "effort" in block:
+                effort = block["effort"]
+                if not isinstance(effort, str) or not effort:
+                    raise ValueError(f"[providers.{name}].effort must be a non-empty string.")
+                efforts[name] = effort
+        if shadowed:
+            _note(
+                f"Note: [providers.{shadowed[0]}].model overrides the deprecated [models] entry "
+                f"for {', '.join(shadowed)}."
+            )
+    if models:
+        config["models"] = models
+    if efforts:
+        config["efforts"] = efforts
     return config
@@ -799,10 +909,14 @@ def _toml_str(value: str) -> str:
 def serialize_config(config: dict) -> str:
-    """Render our flat config schema back to TOML text.
-    Hand-rolled on purpose (no writer dependency): we only ever emit scalars,
-    the `exclude` string list, and the `[models]` string table, in that order.
+    """Render our config schema back to TOML text.
+    Hand-rolled on purpose (no writer dependency): we emit scalars, the `exclude`
+    string list, then one `[providers.<name>]` block per provider that has a
+    model and/or effort. We always write the canonical `[providers.<name>]` form
+    (never the deprecated flat `[models]` table), so a round-trip upgrades an old
+    file's models in place. The normalized `models`/`efforts` maps are folded
+    back together per provider so model and effort stay grouped.
     """
     lines: list[str] = []
     if "num" in config:
@@ -819,11 +933,19 @@ def serialize_config(config: dict) -> str:
     if "exclude" in config:
         items = ", ".join(_toml_str(v) for v in config["exclude"])
         lines.append(f"exclude = [{items}]")
-    if config.get("models"):
+    models = config.get("models") or {}
+    efforts = config.get("efforts") or {}
+    # Emit one block per provider, in PRIORITY order then any extras, grouping the
+    # provider's model and effort together.
+    names = [n for n in PRIORITY if n in models or n in efforts]
+    names += [n for n in (*models, *efforts) if n not in names]
+    for name in names:
         lines.append("")
-        lines.append("[models]")
-        for provider, model in config["models"].items():
-            lines.append(f"{provider} = {_toml_str(model)}")
+        lines.append(f"[providers.{name}]")
+        if name in models:
+            lines.append(f"model = {_toml_str(models[name])}")
+        if name in efforts:
+            lines.append(f"effort = {_toml_str(efforts[name])}")
     return "\n".join(lines) + "\n" if lines else ""
@@ -927,6 +1049,7 @@ async def _collect(
     models: dict[str, str] | None = None,
     yolo: bool = False,
     emit_blocks: bool = True,
+    efforts: dict[str, str] | None = None,
 ) -> list[RunResult]:
     """Gather every agent's result. With emit_blocks (ask), each complete answer
     is flushed to stdout the instant it arrives. Without it (distill), the
@@ -934,7 +1057,7 @@ async def _collect(
     distilled block is content - so we keep stdout clean and just heartbeat each
     arrival to stderr so a multi-agent run doesn't look frozen while it waits."""
     results: list[RunResult] = []
-    async for result in stream(providers, prompt, timeout, models, yolo):
+    async for result in stream(providers, prompt, timeout, models, yolo, efforts):
         results.append(result)
         if emit_blocks:
             _emit(json.dumps(result_record(result)) if json_output else render_block(result))
@@ -991,6 +1114,7 @@ class RunConfig:
     timeout: float
     json_output: bool
     yolo: bool
+    efforts: dict[str, str]
 def resolve_run(
@@ -1027,13 +1151,17 @@ def resolve_run(
         raise typer.BadParameter(f"{config_path()}: {exc}") from exc
     num = resolve_option(num, "num", config, default_num)
-    timeout = resolve_option(timeout, "timeout", config, 600.0)
+    timeout = resolve_option(timeout, "timeout", config, 900.0)
     # Repeatable flags are an empty list when omitted, not None, so treat empty
     # as "fall back to config" for exclude.
     exclude_names = tuple(exclude) if exclude else tuple(config.get("exclude", ()))
     # CLI -m overrides win per-provider over config [models]; unnamed providers
     # keep their config value, then their built-in default.
     models = {**config.get("models", {}), **parse_model_overrides(model)}
+    # Effort is config-only (no CLI flag, by design); it comes straight from the
+    # per-provider config blocks. agy/claude have no effort flag, so a value set
+    # for them is simply ignored by their builder (effort_args returns ()).
+    efforts = dict(config.get("efforts", {}))
     if num < 1:
         raise typer.BadParameter("--num must be at least 1.")
@@ -1064,7 +1192,7 @@ def resolve_run(
                 note += f"; note: {p.readonly_note}"
     _note(note)
-    return RunConfig(prompt_text, selected, models, timeout, json_output, yolo)
+    return RunConfig(prompt_text, selected, models, timeout, json_output, yolo, efforts)
 @app.command()
@@ -1083,7 +1211,10 @@ def ask(
     cfg = resolve_run(prompt, file, num, provider, exclude, model, timeout, json_output, yolo)
     results = asyncio.run(
-        _collect(cfg.selected, cfg.prompt, cfg.timeout, cfg.json_output, cfg.models, cfg.yolo)
+        _collect(
+            cfg.selected, cfg.prompt, cfg.timeout, cfg.json_output, cfg.models, cfg.yolo,
+            efforts=cfg.efforts,
+        )
     )
     if not any(r.status == "ok" for r in results):
         raise typer.Exit(code=1)
@@ -1117,7 +1248,7 @@ def distill(
     results = asyncio.run(
         _collect(
             cfg.selected, cfg.prompt, cfg.timeout, cfg.json_output, cfg.models, cfg.yolo,
-            emit_blocks=False,
+            emit_blocks=False, efforts=cfg.efforts,
         )
     )
     successes = [r for r in results if r.status == "ok"]
@@ -1152,7 +1283,10 @@ def _run_synthesis(
     _note(f"Distilling with {synth_name}...")
     synth_model = cfg.models.get(synth_name)
     synth_result = asyncio.run(
-        run_provider(PROVIDERS[synth_name], synth_prompt, cfg.timeout, synth_model, cfg.yolo)
+        run_provider(
+            PROVIDERS[synth_name], synth_prompt, cfg.timeout, synth_model, cfg.yolo,
+            cfg.efforts.get(synth_name),
+        )
     )
     if cfg.json_output:
@@ -1227,7 +1361,8 @@ async def _moderator_signals_done(
     prompt = build_convergence_prompt(cfg.prompt, latest_ok)
     _note(f"Round {round_num}: moderator {moderator.name} checking for convergence...")
     result = await run_provider(
-        moderator, prompt, cfg.timeout, cfg.models.get(moderator.name), cfg.yolo
+        moderator, prompt, cfg.timeout, cfg.models.get(moderator.name), cfg.yolo,
+        cfg.efforts.get(moderator.name),
     )
     done = result.status == "ok" and result.stdout.strip().upper().startswith(CONVERGENCE_DONE)
     if done:
@@ -1266,7 +1401,8 @@ async def _run_debate(
             turn_prompt = build_debate_turn_prompt(cfg.prompt, prior)
             _note(f"Round {round_num}: {debater.name} responding...")
             result = await run_provider(
-                debater, turn_prompt, cfg.timeout, cfg.models.get(debater.name), cfg.yolo
+                debater, turn_prompt, cfg.timeout, cfg.models.get(debater.name), cfg.yolo,
+                cfg.efforts.get(debater.name),
             )
             transcript.append(result)
             latest[debater.name] = result
@@ -1298,7 +1434,8 @@ async def _run_debate(
     verdict_prompt, _label_map = build_verdict_prompt(cfg.prompt, transcript)
     _note(f"Moderator {moderator.name} writing the final answer...")
     verdict = await run_provider(
-        moderator, verdict_prompt, cfg.timeout, cfg.models.get(moderator.name), cfg.yolo
+        moderator, verdict_prompt, cfg.timeout, cfg.models.get(moderator.name), cfg.yolo,
+        cfg.efforts.get(moderator.name),
     )
     transcript.append(verdict)
     _emit(
@@ -1371,8 +1508,8 @@ def config_show() -> None:
 @config_app.command("set")
 def config_set(
-    key: Annotated[str, typer.Argument(help="Config key: num | timeout | synthesizer | moderator | exclude | model.")],
-    value: Annotated[str, typer.Argument(help="Value. For models: PROVIDER=MODEL. For exclude: comma-separated names.")],
+    key: Annotated[str, typer.Argument(help="Config key: num | timeout | synthesizer | moderator | exclude | model | effort.")],
+    value: Annotated[str, typer.Argument(help="Value. For model/effort: PROVIDER=VALUE. For exclude: comma-separated names.")],
 ) -> None:
     """Write a value to the config file, creating the dir/file if missing."""
     config = _load_config_or_exit()
@@ -1385,6 +1522,22 @@ def config_set(
         if provider not in PROVIDERS:
             raise typer.BadParameter(f"Unknown provider: {provider!r}. Known: {', '.join(PROVIDERS)}.")
         config.setdefault("models", {})[provider] = model
+    elif key == "effort":
+        if "=" not in value:
+            raise typer.BadParameter("effort expects PROVIDER=VALUE, e.g. `moa config set effort codex=high`.")
+        provider, effort = value.split("=", 1)
+        provider = provider.strip()
+        if provider not in PROVIDERS:
+            raise typer.BadParameter(f"Unknown provider: {provider!r}. Known: {', '.join(PROVIDERS)}.")
+        # Raw pass-through: the value is whatever the target tool expects; moa
+        # only refuses an empty string (no enum/scale validation, by design).
+        if not effort:
+            raise typer.BadParameter("effort value cannot be empty.")
+        config.setdefault("efforts", {})[provider] = effort
+        if PROVIDERS[provider].effort_flag is None:
+            # Stored but inert: agy carries reasoning in the model name, claude
+            # has no per-call effort flag. A note, not an error.
+            _note(f"Note: {provider} has no effort flag; this value will be ignored at runtime.")
     elif key == "exclude":
         names = [name.strip() for name in value.split(",") if name.strip()]
         try:
@@ -1404,7 +1557,7 @@ def config_set(
             raise typer.BadParameter(str(exc)) from exc
         config[key] = coerced
     else:
-        known = "num, timeout, synthesizer, moderator, exclude, model"
+        known = "num, timeout, synthesizer, moderator, exclude, model, effort"
         raise typer.BadParameter(f"Unknown config key: {key!r}. Known: {known}.")
     write_config(config)
@@ -1413,24 +1566,27 @@ def config_set(
 @config_app.command("unset")
 def config_unset(
-    key: Annotated[str, typer.Argument(help="Config key to remove. Use `model PROVIDER` to drop one model.")],
-    provider: Annotated[str | None, typer.Argument(help="Provider name, only when key is 'model'.")] = None,
+    key: Annotated[str, typer.Argument(help="Config key to remove. Use `model PROVIDER` / `effort PROVIDER` to drop one.")],
+    provider: Annotated[str | None, typer.Argument(help="Provider name, only when key is 'model' or 'effort'.")] = None,
 ) -> None:
-    """Remove a key from the config file (or a single model with `unset model PROVIDER`)."""
+    """Remove a key from the config file (or a single model/effort with `unset model|effort PROVIDER`)."""
     config = _load_config_or_exit()
-    if key == "model":
+    if key in ("model", "effort"):
+        # model -> the normalized [providers.<name>].model map (a.k.a. [models]);
+        # effort -> the [providers.<name>].effort map. Same per-provider shape.
+        table_key = "models" if key == "model" else "efforts"
         if not provider:
-            raise typer.BadParameter("unset model expects a provider, e.g. `moa config unset model claude`.")
-        models = config.get("models", {})
-        if provider in models:
-            del models[provider]
-            if not models:
-                config.pop("models", None)
+            raise typer.BadParameter(f"unset {key} expects a provider, e.g. `moa config unset {key} codex`.")
+        table = config.get(table_key, {})
+        if provider in table:
+            del table[provider]
+            if not table:
+                config.pop(table_key, None)
             write_config(config)
-            typer.echo(f"Unset model {provider} in {config_path()}")
+            typer.echo(f"Unset {key} {provider} in {config_path()}")
         else:
-            typer.echo(f"model {provider} was not set.")
+            typer.echo(f"{key} {provider} was not set.")
         return
     if key not in _CONFIG_KEYS:

moa-cli 0.3.1__tar.gz → 0.3.3__tar.gz

moa-cli 0.3.1tar.gz → 0.3.3tar.gz