devlyn-cli 1.12.2 → 1.12.3
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CLAUDE.md +1 -1
- package/README.md +1 -1
- package/config/skills/devlyn:auto-resolve/SKILL.md +4 -4
- package/config/skills/devlyn:auto-resolve/references/engine-routing.md +1 -1
- package/config/skills/devlyn:ideate/SKILL.md +3 -3
- package/config/skills/devlyn:preflight/SKILL.md +3 -3
- package/config/skills/devlyn:team-resolve/SKILL.md +1 -1
- package/config/skills/devlyn:team-review/SKILL.md +1 -1
- package/package.json +1 -1
package/CLAUDE.md
CHANGED
|
@@ -72,7 +72,7 @@ Optional flags:
|
|
|
72
72
|
- `--skip-review` — skip team-review phase
|
|
73
73
|
- `--skip-clean` — skip clean phase
|
|
74
74
|
- `--skip-docs` — skip update-docs phase
|
|
75
|
-
- `--engine auto|codex|claude` — intelligent model routing. `auto` routes each phase and team role to the optimal model (Claude or Codex GPT-5.4) based on benchmark data. `codex` forces Codex for implementation, Claude for evaluation. `claude`
|
|
75
|
+
- `--engine auto|codex|claude` — intelligent model routing. `auto` (default) routes each phase and team role to the optimal model (Claude or Codex GPT-5.4) based on benchmark data. `codex` forces Codex for implementation, Claude for evaluation. `claude` uses Claude for everything. Requires codex-mcp-server for `auto` and `codex` modes.
|
|
76
76
|
- `--with-codex [evaluate|review|both]` — (legacy, superseded by `--engine`) use OpenAI Codex as cross-model evaluator/reviewer (requires codex-mcp-server)
|
|
77
77
|
|
|
78
78
|
## Preflight Check (Post-Roadmap Verification)
|
package/README.md
CHANGED
|
@@ -111,7 +111,7 @@ Install the Codex MCP server during setup, then:
|
|
|
111
111
|
|
|
112
112
|
**`--engine auto`** routes each pipeline phase and team role to the optimal model (Claude Opus 4.6 or GPT-5.4) — validated through A/B testing, not just benchmarks.
|
|
113
113
|
|
|
114
|
-
> `--engine auto` (recommended) · `--engine codex` (force Codex for build) · `--engine claude` (
|
|
114
|
+
> `--engine auto` (default, recommended) · `--engine codex` (force Codex for build) · `--engine claude` (Claude only)
|
|
115
115
|
|
|
116
116
|
Works across the full pipeline:
|
|
117
117
|
|
|
@@ -34,10 +34,10 @@ This pipeline runs hands-free. The user launches it to walk away and come back t
|
|
|
34
34
|
- `--skip-build-gate` (false) — skip the deterministic build gate (Phase 1.4). Not recommended — the build gate is the primary defense against "tests pass locally, breaks in CI/Docker/production" class of bugs.
|
|
35
35
|
- `--build-gate MODE` (auto) — controls build gate behavior. `auto`: detect project type and run appropriate build/typecheck/lint commands; if Dockerfile(s) are present, Docker builds are included automatically. `strict`: auto + treat warnings as errors. `no-docker`: auto but skip Docker builds even if Dockerfiles exist (for faster iteration). `skip`: same as --skip-build-gate.
|
|
36
36
|
- `--with-codex` (false) — use OpenAI Codex as a cross-model evaluator/reviewer via `mcp__codex-cli__*` MCP tools. Accepts: `evaluate`, `review`, or `both` (default when flag is present without value). When enabled, Codex provides an independent second opinion from a different model family, creating a GAN-like dynamic where Claude builds and Codex critiques. **Ignored if `--engine` is set** (engine routing subsumes this).
|
|
37
|
-
- `--engine MODE` (
|
|
38
|
-
- `
|
|
37
|
+
- `--engine MODE` (auto) — controls which model handles each pipeline phase and team role. Modes:
|
|
38
|
+
- `auto` (default): each phase and team role routes to the optimal model based on benchmark data. Requires Codex MCP server. Subsumes `--with-codex both`.
|
|
39
39
|
- `codex`: Codex handles implementation/analysis phases, Claude handles orchestration, evaluation, and Chrome MCP.
|
|
40
|
-
- `
|
|
40
|
+
- `claude`: all phases use Claude subagents. No Codex calls.
|
|
41
41
|
|
|
42
42
|
Flags can be passed naturally: `/devlyn:auto-resolve fix the auth bug --max-rounds 3 --skip-docs`
|
|
43
43
|
Engine examples: `--engine auto`, `--engine codex`, `--engine claude`
|
|
@@ -58,7 +58,7 @@ Cross-model evaluation (Codex): [evaluate / review / both / disabled / subsumed
|
|
|
58
58
|
|
|
59
59
|
## PHASE 1: BUILD
|
|
60
60
|
|
|
61
|
-
**Engine routing**: If `--engine` is `auto` or `codex`, read `references/engine-routing.md` "How to Spawn a Codex BUILD/FIX Agent" section. Call `mcp__codex-cli__codex` with `model: "gpt-5.4"`, `reasoningEffort: "xhigh"`, `sandbox: "workspace-write"`, `fullAuto: true`, and the full agent prompt below as the `prompt` parameter. If `--engine` is `claude
|
|
61
|
+
**Engine routing**: If `--engine` is `auto` or `codex`, read `references/engine-routing.md` "How to Spawn a Codex BUILD/FIX Agent" section. Call `mcp__codex-cli__codex` with `model: "gpt-5.4"`, `reasoningEffort: "xhigh"`, `sandbox: "workspace-write"`, `fullAuto: true`, and the full agent prompt below as the `prompt` parameter. If `--engine` is `claude`, spawn a Claude subagent as described below.
|
|
62
62
|
|
|
63
63
|
Spawn a subagent using the Agent tool with `mode: "bypassPermissions"` to investigate and implement the fix. The subagent does NOT have access to skills, so include all necessary instructions inline.
|
|
64
64
|
|
|
@@ -197,7 +197,7 @@ mcp__codex-cli__codex({
|
|
|
197
197
|
|
|
198
198
|
## Override Behavior
|
|
199
199
|
|
|
200
|
-
- `--engine claude` → all roles and phases use Claude (
|
|
200
|
+
- `--engine claude` → all roles and phases use Claude (no Codex calls)
|
|
201
201
|
- `--engine codex` → all phases use Codex for implementation/analysis, Claude only for orchestration and Chrome MCP
|
|
202
202
|
- `--engine auto` → each role and phase routes to the optimal model per this table
|
|
203
203
|
- `--engine auto` is the recommended default when Codex MCP server is available
|
|
@@ -25,10 +25,10 @@ Concretely:
|
|
|
25
25
|
Parse these from the user's invocation message:
|
|
26
26
|
|
|
27
27
|
- `--with-codex` (default: off) — bare flag. When set, OpenAI Codex runs an independent rubric pass during Phase 3.5 CHALLENGE via `mcp__codex-cli__*` MCP tools, using the same rubric as the solo pass. Codex always runs at `reasoningEffort: "xhigh"` — the entire reason for the flag is maximum reasoning from a second model family. **Ignored if `--engine` is set** (engine routing subsumes this).
|
|
28
|
-
- `--engine MODE` (
|
|
29
|
-
- `
|
|
28
|
+
- `--engine MODE` (auto) — controls which model handles each ideation phase. Modes:
|
|
29
|
+
- `auto` (default): Claude handles FRAME/EXPLORE/CONVERGE/DOCUMENT (ambiguous intent, writing quality), Codex runs the CHALLENGE rubric pass as critic (GAN dynamic). Subsumes `--with-codex`. Requires Codex MCP server.
|
|
30
30
|
- `codex`: Codex handles FRAME/EXPLORE/CONVERGE/DOCUMENT, Claude runs CHALLENGE (role reversal — builder and critic are always different models).
|
|
31
|
-
- `
|
|
31
|
+
- `claude`: all phases use Claude. No Codex calls.
|
|
32
32
|
|
|
33
33
|
**If `--engine` is `auto` or `codex`**: call `mcp__codex-cli__ping` to verify the Codex MCP server is available. If ping fails, warn the user and offer: [1] Continue with `--engine claude`, [2] Abort. Also read `references/challenge-rubric.md` up front. The engine routing table is defined in the auto-resolve skill's `references/engine-routing.md` under "Pipeline Phase Routing (ideate)".
|
|
34
34
|
|
|
@@ -44,10 +44,10 @@ Parse from `<preflight_config>`:
|
|
|
44
44
|
- `--autofix` — auto-promote all findings to roadmap items and run auto-resolve on each
|
|
45
45
|
- `--skip-browser` — skip browser validation
|
|
46
46
|
- `--skip-docs` — skip documentation audit
|
|
47
|
-
- `--engine MODE` (
|
|
48
|
-
- `
|
|
47
|
+
- `--engine MODE` (auto) — controls which model handles audit phases. Modes:
|
|
48
|
+
- `auto` (default): code-auditor uses Codex (SWE-bench Pro +11.7pp for code analysis), docs-auditor uses Claude (writing quality), browser-auditor uses Claude (Chrome MCP). Requires Codex MCP server.
|
|
49
49
|
- `codex`: code-auditor uses Codex, docs-auditor and browser-auditor use Claude.
|
|
50
|
-
- `
|
|
50
|
+
- `claude`: all auditors use Claude subagents. No Codex calls.
|
|
51
51
|
|
|
52
52
|
Example: `/devlyn:preflight --phase 2 --skip-browser`
|
|
53
53
|
Example with engine: `/devlyn:preflight --engine auto`
|
|
@@ -147,7 +147,7 @@ Codex roles cannot use TeamCreate/SendMessage — the Team Lead (you) relays the
|
|
|
147
147
|
|
|
148
148
|
**For Dual roles** (e.g., security-auditor): Run BOTH a Claude Agent teammate AND a `mcp__codex-cli__codex` call in parallel with the same prompt. Merge findings per `engine-routing.md` "How to Spawn a Dual Role" section.
|
|
149
149
|
|
|
150
|
-
If `--engine
|
|
150
|
+
If `--engine auto` or no `--engine` flag: routes each role to the optimal model based on benchmark data (see `engine-routing.md`). If `--engine claude`: all roles use Claude Agent teammates.
|
|
151
151
|
|
|
152
152
|
### Teammate Prompts
|
|
153
153
|
|
|
@@ -83,7 +83,7 @@ Codex reviewers cannot use TeamCreate/SendMessage — the Review Lead (you) coll
|
|
|
83
83
|
|
|
84
84
|
**For Dual roles** (e.g., security-reviewer): Run BOTH a Claude Agent reviewer AND a `mcp__codex-cli__codex` call in parallel with the same prompt. Merge findings per `engine-routing.md` "How to Spawn a Dual Role" section.
|
|
85
85
|
|
|
86
|
-
If `--engine
|
|
86
|
+
If `--engine auto` or no `--engine` flag: routes each reviewer role to the optimal model based on benchmark data (see `engine-routing.md`). If `--engine claude`: all roles use Claude Agent reviewers.
|
|
87
87
|
|
|
88
88
|
### Reviewer Prompts
|
|
89
89
|
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "devlyn-cli",
|
|
3
|
-
"version": "1.12.
|
|
3
|
+
"version": "1.12.3",
|
|
4
4
|
"description": "AI development toolkit for Claude Code — ideate, auto-resolve, and ship with context engineering and agent orchestration",
|
|
5
5
|
"homepage": "https://github.com/fysoul17/devlyn-cli#readme",
|
|
6
6
|
"bin": {
|