@ai-dev-methodologies/rlp-desk 0.3.2 → 0.3.4
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +10 -0
- package/package.json +1 -1
- package/src/commands/rlp-desk.md +35 -13
package/README.md
CHANGED
|
@@ -183,6 +183,14 @@ RLP Desk supports two execution modes. Both honor the same governance protocol.
|
|
|
183
183
|
| Inside tmux session | **Works** | **Works** — panes split in current window |
|
|
184
184
|
| Outside tmux session | **Works** | **Rejected** — "start tmux first" |
|
|
185
185
|
|
|
186
|
+
### Choosing Your Mode
|
|
187
|
+
|
|
188
|
+
| Need | Use |
|
|
189
|
+
|------|-----|
|
|
190
|
+
| Reliable autonomous loop (no interruption) | `--mode tmux` |
|
|
191
|
+
| Interactive development, quick tasks | `--mode agent` (default) |
|
|
192
|
+
| Long campaigns, CI, overnight runs | `--mode tmux` |
|
|
193
|
+
|
|
186
194
|
### Agent Mode (default) — "Smart Mode"
|
|
187
195
|
|
|
188
196
|
```
|
|
@@ -193,6 +201,8 @@ The current Claude Code session acts as the Leader, dispatching Workers and Veri
|
|
|
193
201
|
|
|
194
202
|
- Works anywhere — no tmux required
|
|
195
203
|
- Dynamic model routing — Leader upgrades models on failure
|
|
204
|
+
|
|
205
|
+
**Known limitation:** Agent mode runs inside Claude Code's turn-based request-response model. If the LLM outputs text without a tool call, the turn terminates and the loop pauses until the user sends "continue." This is a platform constraint — the protocol mitigates it but cannot guarantee 100% uninterrupted execution. For guaranteed autonomous loops, use tmux mode.
|
|
196
206
|
- Fix Loop — extracts verifier issues and feeds them back to the next worker
|
|
197
207
|
- Best for interactive development
|
|
198
208
|
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "@ai-dev-methodologies/rlp-desk",
|
|
3
|
-
"version": "0.3.
|
|
3
|
+
"version": "0.3.4",
|
|
4
4
|
"description": "Fresh-context iterative loops for Claude Code — autonomous task completion with independent verification",
|
|
5
5
|
"scripts": {
|
|
6
6
|
"postinstall": "node scripts/postinstall.js",
|
package/src/commands/rlp-desk.md
CHANGED
|
@@ -79,19 +79,35 @@ If brainstorm was done, auto-fill PRD and test-spec with the results.
|
|
|
79
79
|
Tell the user:
|
|
80
80
|
1. The scaffold has been created — list the generated files
|
|
81
81
|
2. Ask them to review/edit the PRD and test-spec if needed
|
|
82
|
-
3.
|
|
82
|
+
3. Present run options with explanations and ONE recommendation. The user MUST copy and paste the command themselves:
|
|
83
|
+
|
|
83
84
|
```
|
|
84
|
-
|
|
85
|
-
|
|
86
|
-
|
|
87
|
-
|
|
88
|
-
|
|
89
|
-
|
|
90
|
-
|
|
91
|
-
|
|
92
|
-
|
|
85
|
+
Available run commands (copy the one you want):
|
|
86
|
+
|
|
87
|
+
# Recommended for most cases — agent mode, per-US verification, debug logging:
|
|
88
|
+
/rlp-desk run <slug> --debug
|
|
89
|
+
|
|
90
|
+
# With self-verification campaign report (recommended for MEDIUM+ risk):
|
|
91
|
+
/rlp-desk run <slug> --debug --with-self-verification
|
|
92
|
+
|
|
93
|
+
# Tmux mode for long campaigns with real-time visibility:
|
|
94
|
+
/rlp-desk run <slug> --mode tmux --debug
|
|
95
|
+
|
|
96
|
+
# Cross-engine consensus (requires codex CLI installed):
|
|
97
|
+
/rlp-desk run <slug> --debug --verify-consensus
|
|
98
|
+
|
|
99
|
+
# Full options reference:
|
|
100
|
+
# --mode agent|tmux Agent mode (default) or tmux shell leader
|
|
101
|
+
# --debug Always-on detailed logging (recommended)
|
|
102
|
+
# --with-self-verification Post-campaign analysis report
|
|
103
|
+
# --verify-mode per-us|batch Per-US (default) or batch verification
|
|
104
|
+
# --verify-consensus Both claude+codex must pass
|
|
105
|
+
# --worker-model MODEL haiku/sonnet/opus (default: sonnet)
|
|
106
|
+
# --verifier-model MODEL haiku/sonnet/opus (default: opus)
|
|
107
|
+
# --max-iter N Max iterations (default: 100)
|
|
93
108
|
```
|
|
94
|
-
|
|
109
|
+
|
|
110
|
+
**CRITICAL: Do NOT offer to run for the user. Do NOT ask "실행할까요?" or "shall I run?". The user MUST type the run command themselves. Just present the options, recommend one, and STOP.**
|
|
95
111
|
|
|
96
112
|
---
|
|
97
113
|
|
|
@@ -170,7 +186,13 @@ DEBUG=<1 if --debug, else 0> \
|
|
|
170
186
|
|
|
171
187
|
### Leader Loop
|
|
172
188
|
|
|
173
|
-
**CRITICAL: DO NOT STOP between iterations.** You MUST continue the loop automatically until a sentinel is written (COMPLETE or BLOCKED) or max_iter is reached. Do NOT pause to ask the user. Do NOT wait for confirmation. The loop is fully autonomous
|
|
189
|
+
**CRITICAL: DO NOT STOP between iterations.** You MUST continue the loop automatically until a sentinel is written (COMPLETE or BLOCKED) or max_iter is reached. Do NOT pause to ask the user. Do NOT wait for confirmation. The loop is fully autonomous.
|
|
190
|
+
|
|
191
|
+
**PLATFORM CONSTRAINT (Agent mode):** In Agent mode, the Leader is an LLM in Claude Code's turn-based model. A turn ENDS when the response contains no tool calls. This means:
|
|
192
|
+
- **NEVER output plain text without an accompanying tool call.** Text-only output = turn ends = loop stops.
|
|
193
|
+
- **Use `Bash("echo '...'")` for all status reports** instead of plain text. This keeps the tool-call chain alive.
|
|
194
|
+
- **After every step result, IMMEDIATELY start the next step's tool call in the SAME response.** For example, after reading the verdict (⑦c), report via Bash("echo") AND start ⑧'s tool calls in one response.
|
|
195
|
+
- If you output "Iter 1 complete, moving to iter 2" as plain text without a tool call, the turn terminates and the loop breaks. This is a platform constraint, not a compliance issue — no amount of "DO NOT STOP" text can override it.
|
|
174
196
|
|
|
175
197
|
If `--debug`, at loop start debug_log: `[PLAN] slug=<slug> max_iter=<N> worker_engine=<engine> worker_model=<model> verifier_engine=<engine> verifier_model=<model> verify_mode=<mode> consensus=<0|1> consensus_scope=<scope>`
|
|
176
198
|
|
|
@@ -324,7 +346,7 @@ After the primary verifier runs, run a second verifier with the OTHER engine:
|
|
|
324
346
|
- Files changed via `git diff --stat HEAD~1 HEAD` `[git-measured]`
|
|
325
347
|
- Verifier verdict `[leader-measured]`
|
|
326
348
|
- Write `status.json`
|
|
327
|
-
- Report:
|
|
349
|
+
- Report via tool call: `Bash("echo 'Iter N | US-NNN | verdict | model | next_action'")` — NEVER plain text. This keeps the turn alive for the next iteration.
|
|
328
350
|
- **Always**: append to baseline.log: `[timestamp] iter=N verdict=<pass|fail|continue> us=<us_id> model=<worker_model>`
|
|
329
351
|
- If `--debug`: debug_log `[EXEC] iter=N phase=result status=<result> consecutive_failures=<N> verified_us=<list>`
|
|
330
352
|
|