devlyn-cli 1.5.1 → 1.5.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CLAUDE.md CHANGED
@@ -61,7 +61,7 @@ This runs the full pipeline automatically: **Build → Browser Validate → Eval
61
61
  For web projects, the Browser Validate phase starts the dev server and tests the implemented feature in a real browser — clicking buttons, filling forms, verifying results. If the feature doesn't work, findings feed back into the fix loop.
62
62
 
63
63
  Optional flags:
64
- - `--max-rounds 3` — increase max evaluate-fix iterations (default: 2)
64
+ - `--max-rounds 6` — increase max evaluate-fix iterations (default: 4)
65
65
  - `--skip-browser` — skip browser validation phase (auto-skipped for non-web changes)
66
66
  - `--skip-review` — skip team-review phase
67
67
  - `--skip-clean` — skip clean phase
@@ -11,11 +11,21 @@ $ARGUMENTS
11
11
 
12
12
  <pipeline_workflow>
13
13
 
14
+ <autonomy_contract>
15
+ This pipeline runs hands-free. The user launches it to walk away and come back to finished work, so the quality of this run is measured by how far it gets without human intervention. Apply these behaviors throughout every phase:
16
+
17
+ 1. **Make decisions autonomously and log them in the final report.** When you would otherwise ask the user something ("Should I commit this?", "Ready to proceed?", "Which approach?"), pick the safe default, proceed, and record the decision in PHASE 8's report so the user can review it at the end.
18
+ 2. **Run only the phases defined below, in the order given.** Doc updates, roadmap edits, changelog entries, and planning-doc changes belong in PHASE 7 (Docs). Resist inserting them earlier as freelance pre-work.
19
+ 3. **Delegate all file changes to spawned subagents.** As the orchestrator, your actions are: parse input, spawn phase agents, read handoff files (`.devlyn/*.md`), run `git` commands, branch on verdicts, and emit the final report.
20
+ 4. **Continue through the pipeline by default.** Stop only for: (a) a subagent reporting an unrecoverable failure, (b) PHASE 1 producing zero code changes, (c) `max-rounds` reached — in which case continue to PHASE 3 with a warning rather than halting. Every other situation means move on to the next phase.
21
+ 5. **Treat questions as a signal to act instead.** If you notice yourself drafting a question to the user mid-pipeline, convert it into a decision + log entry and spawn the next phase.
22
+ </autonomy_contract>
23
+
14
24
  ## PHASE 0: PARSE INPUT
15
25
 
16
26
  1. Extract the task/issue description from `<pipeline_config>`.
17
27
  2. Determine optional flags from the input (defaults in parentheses):
18
- - `--max-rounds N` (2) — max evaluate-fix loops before stopping with a report
28
+ - `--max-rounds N` (4) — max evaluate-fix loops before stopping with a report
19
29
  - `--skip-review` (false) — skip team-review phase
20
30
  - `--security-review` (auto) — run dedicated security audit. Auto-detects: runs when changes touch auth, secrets, user data, API endpoints, env/config, or crypto. Force with `--security-review always` or skip with `--security-review skip`
21
31
  - `--skip-clean` (false) — skip clean phase
@@ -136,7 +146,9 @@ You are an independent evaluator. Your job is to grade work produced by another
136
146
  - pattern description
137
147
  ```
138
148
 
139
- Verdict rules: BLOCKED = any CRITICAL issues. NEEDS WORK = HIGH issues that should be fixed. PASS WITH ISSUES = only MEDIUM/LOW. PASS = clean.
149
+ Verdict rules: BLOCKED = any CRITICAL issues. NEEDS WORK = HIGH or MEDIUM issues that should be fixed. PASS WITH ISSUES = only LOW cosmetic notes. PASS = clean.
150
+
151
+ Important: Do NOT label findings as "pre-existing" or "out of scope" to avoid fixing them. If a problem exists in the current code and relates to the done criteria, it's a finding regardless of when it was introduced. The goal is working software, not blame attribution.
140
152
 
141
153
  Calibration examples to guide your judgment:
142
154
  - A catch block that logs but doesn't surface error to user = HIGH (not MEDIUM). Logging is not error handling.
@@ -151,10 +163,10 @@ Do NOT delete `.devlyn/done-criteria.md` or `.devlyn/EVAL-FINDINGS.md` — the o
151
163
  3. **If `--with-codex` includes `evaluate` or `both`**: Read `references/codex-integration.md` and follow the "PHASE 2-CODEX: CROSS-MODEL EVALUATE" section. This runs Codex as a second evaluator and merges findings into `EVAL-FINDINGS.md`.
152
164
  4. Branch on verdict (from the merged findings if Codex was used):
153
165
  - `PASS` → skip to PHASE 3
154
- - `PASS WITH ISSUES` → skip to PHASE 3 (issues are shippable)
166
+ - `PASS WITH ISSUES` → go to PHASE 2.5 (fix loop) — LOW-only issues are still issues; fix them
155
167
  - `NEEDS WORK` → go to PHASE 2.5 (fix loop)
156
168
  - `BLOCKED` → go to PHASE 2.5 (fix loop)
157
- 5. If `.devlyn/EVAL-FINDINGS.md` was not created, treat as PASS WITH ISSUES and log a warning
169
+ 5. If `.devlyn/EVAL-FINDINGS.md` was not created, treat as NEEDS WORK and log a warning — absence of evidence is not evidence of absence
158
170
 
159
171
  ## PHASE 2.5: FIX LOOP (conditional)
160
172
 
@@ -164,7 +176,7 @@ Spawn a subagent using the Agent tool with `mode: "bypassPermissions"` to fix th
164
176
 
165
177
  Agent prompt — pass this to the Agent tool:
166
178
 
167
- Read `.devlyn/EVAL-FINDINGS.md` — it contains specific issues found by an independent evaluator. Fix every CRITICAL and HIGH finding. Address MEDIUM findings if straightforward.
179
+ Read `.devlyn/EVAL-FINDINGS.md` — it contains specific issues found by an independent evaluator. Fix every finding regardless of severity (CRITICAL, HIGH, MEDIUM, and LOW). The pipeline loops until the evaluator returns PASS — there is no "shippable with issues" shortcut.
168
180
 
169
181
  The original done criteria are in `.devlyn/done-criteria.md` — your fixes must still satisfy those criteria. Do not delete or weaken criteria to make them pass.
170
182
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "devlyn-cli",
3
- "version": "1.5.1",
3
+ "version": "1.5.3",
4
4
  "description": "Claude Code configuration toolkit for teams",
5
5
  "bin": {
6
6
  "devlyn": "bin/devlyn.js"