npm - input-kanban - Versions diffs - 0.0.13 → 0.0.15 - Mend

input-kanban 0.0.13 → 0.0.15

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (9) hide show

package/ENVIRONMENT.md CHANGED Viewed

@@ -40,7 +40,7 @@ input-kanban \
 ## Notes
 - `KANBAN_DEFAULT_WORKSPACE` / `--workspace` should point to the local directory where work should run; `KANBAN_DEFAULT_REPO` / `--repo` remain compatibility aliases.
-- `input-kanban serve` starts a lightweight background scheduler that uses the same orchestrator auto-advance path as CLI `submit --auto` / `input-kanban auto <runId>`. It advances planned runs, serial batches, final judge startup, and bounded automatic retries without relying on an open browser tab.
+- `input-kanban serve` starts a lightweight background scheduler that uses the same orchestrator auto-advance path as CLI `submit --auto` / `input-kanban auto <runId>`. It advances planned runs, serial batches, final judge startup, and bounded automatic retries without relying on an open browser tab. If a run was created with `--plan-approval` / plan approval gate enabled, the scheduler stops after planning until the user confirms the plan.
 - `KANBAN_RUNNER` / `--runner tmux` runs Codex tasks inside tmux windows while keeping scheduling and status tracking in the Node.js orchestrator.
 - `KANBAN_RUNNER=tmux` is optional. Use it when you want live terminal visibility into planner, worker, and final judge sessions.
 - With `KANBAN_RUNNER=tmux`, stopping and restarting `input-kanban serve` does not interrupt already-running Codex sessions; tmux keeps them alive and the scheduler resumes after restart. Do not assume the same safety for `headless` runner child processes.

package/PROJECT_GUIDE.md CHANGED Viewed

@@ -121,6 +121,7 @@ Supported submit options:
 --task-file <path|->
 --max-parallel <n>
 --worker-sandbox <read-only|workspace-write|danger-full-access>
+--plan-approval
 --runner <headless|tmux>
 --runs-dir <path>
 --auto
@@ -130,7 +131,7 @@ Supported submit options:
 --poll-ms <ms>
 ```
-`input-kanban submit` creates a run and starts the planner. Task content can come from `--task <text>` or `--task-file <path|->`; omitting `--workspace` uses the current working directory as the target workspace, and `--repo` remains a compatibility alias. Omitting `--label` derives the run label from the first non-empty task line. Auto mode is the default for submit: it keeps polling the run through the shared orchestrator auto-advance path, dispatches batches when the plan is ready, and starts the final judge once all batches complete. `--no-auto` keeps submit to create + plan only. `-d` / `--detach` starts a background supervisor process for the same auto loop and lets the submitting terminal return immediately. The Web server also starts a lightweight scheduler that uses this shared path, so serial batch advancement does not depend on an open browser tab. The submit output includes `input-kanban status <runId> --watch` for terminal-side observation. Because it writes to the same runs directory as the Web server, CLI-created runs are visible in the 8787 dashboard when both processes use the same `--runs-dir`.
+`input-kanban submit` creates a run and starts the planner. Task content can come from `--task <text>` or `--task-file <path|->`; omitting `--workspace` uses the current working directory as the target workspace, and `--repo` remains a compatibility alias. Omitting `--label` derives the run label from the first non-empty task line. Auto mode is the default for submit: it keeps polling the run through the shared orchestrator auto-advance path, dispatches batches when the plan is ready, and starts the final judge once all batches complete. `--plan-approval` adds a durable Planner → Worker gate: auto advances through planning, then pauses at the completed plan until the user confirms it from the Web dashboard by clicking `开始执行`. `--no-auto` keeps submit to create + plan only for the current CLI process, but a running Web server scheduler can still advance the run unless a durable gate such as `--plan-approval` is configured. `-d` / `--detach` starts a background supervisor process for the same auto loop and lets the submitting terminal return immediately. The Web server also starts a lightweight scheduler that uses this shared path, so serial batch advancement does not depend on an open browser tab. The submit output includes `input-kanban status <runId> --watch` for terminal-side observation. Because it writes to the same runs directory as the Web server, CLI-created runs are visible in the 8787 dashboard when both processes use the same `--runs-dir`.
 Default behavior:
@@ -160,8 +161,8 @@ Key points:
 - `result` prefers `judge/verdict.json` and falls back to `judge/last_message.md`; `--copy` copies the result to the clipboard.
 - `stop` requires an explicit `runId` and uses the same stop path as the Web dashboard.
 - `retry` retries failed/unknown workers while preserving the failed attempt directory.
-- `submit` defaults to auto mode: planner -> dispatch -> final judge, with one automatic retry for `batch_blocked` by default. `--no-auto` keeps create + plan only, and `-d/--detach` moves the auto loop to a background supervisor.
-- The Web dashboard now follows the same default auto behavior while the page is open: after planning it auto-dispatches planned runs and auto-starts the final judge once all batches complete.
+- `submit` defaults to auto mode: planner -> dispatch -> final judge, with one automatic retry for `batch_blocked` by default. `--plan-approval` changes this to planner -> wait for plan confirmation -> dispatch -> final judge. `--no-auto` keeps create + plan only for the CLI process, and `-d/--detach` moves the auto loop to a background supervisor.
+- The Web server scheduler follows the same shared auto behavior: after planning it auto-dispatches planned runs and auto-starts the final judge once all batches complete, unless a plan approval gate is required and still unapproved.
 Example agent loop:

package/README.md CHANGED Viewed

@@ -56,7 +56,15 @@ input-kanban submit --task "修复登录问题，并补充回归测试" --label
 `submit` 默认会创建任务批次、发起拆分、自动派发所有批次，并在全部完成后自动发起最终验收。默认 workspace 是当前目录；如果不传 `--label`，任务批次名称会从任务内容自动生成。它使用同一个 runs 目录，所以只要 8787 Web 看板也使用相同的 `--runs-dir`，CLI 创建的任务会在 Web 界面里可见。
-`input-kanban serve` 会启动一个轻量后台 scheduler，持续刷新并推进未完成的 run：plan ready 后派发 batch、串行 batch 完成后启动下一批、全部 batch 完成后启动 final judge。CLI `submit --auto` / `input-kanban auto <runId>` 与 Web server 共用同一套 orchestrator 自动推进逻辑，因此任务推进不再依赖浏览器页面是否打开或刷新。
+如果希望 Planner 拆分完成后先看一眼计划，再放行执行，可以加计划确认 Gate：
+```bash
+input-kanban submit --task-file task.md --plan-approval
+```
+开启后，auto 会推进到 Planner 完成并停在“已拆分，待确认”；用户在 Web 上点一次“开始执行”即确认计划，随后 worker 批次和 final judge 继续自动推进。
+`input-kanban serve` 会启动一个轻量后台 scheduler，持续刷新并推进未完成的 run：plan ready 后派发 batch、串行 batch 完成后启动下一批、全部 batch 完成后启动 final judge。CLI `submit --auto` / `input-kanban auto <runId>` 与 Web server 共用同一套 orchestrator 自动推进逻辑，因此任务推进不再依赖浏览器页面是否打开或刷新。若 run 配置了计划确认 Gate，auto/scheduler 会停在该 Gate，不会越过未确认的计划。
 如果希望提交后立即返回，让任务在后台自动执行，可以加 `-d` / `--detach`：

package/RELEASE_NOTES.md CHANGED Viewed

@@ -1,5 +1,41 @@
 # Release Notes
+## v0.0.15
+### Highlights
+- Absorb PR #3 recovery hardening without directly merging its older `v0.0.13` branch, preserving the `v0.0.14` Plan Approval Gate and Web layout changes.
+- Route blocked-run dashboard execution actions to `/api/runs/:id/retry`, so `batch_blocked` runs retry failed/unknown tasks instead of hitting the dispatch endpoint.
+- Remove stale Web `workers_completed` / `workers_failed` UI state handling now that backend run status uses `batches_completed` and `batch_blocked`.
+- Harden final judge starts: reject archived/stopped runs, duplicate running judges, and completed judges; failed judges are archived to `judge_attempts/` before retrying.
+- Add short `/api/codex` detection caching to avoid repeatedly spawning Codex detection during frequent dashboard refreshes.
+- Keep task-table Codex session IDs and their copy buttons on one line by widening the session column and using a compact inline layout.
+### Verification
+- `npm run check` passed locally with 84 tests.
+- `npm pack --dry-run` passed for `input-kanban@0.0.15`.
+- Windows release-candidate validation on `zhangxing_win` passed with 84 tests.
+- Windows Web smoke confirmed `/api/health`, `/api/codex`, Plan Approval UI, compact header copy tools, and one-line Codex session copy layout.
+## v0.0.14
+### Highlights
+- Add a durable Plan Approval Gate: runs can now pause after planning until the generated plan is manually confirmed.
+- Add `input-kanban submit --plan-approval`, which lets auto advance through planning and then stop at the unapproved plan gate instead of dispatching workers immediately.
+- Make `dispatchRun()` confirm the plan gate before starting workers, so Web `开始执行` means “approve this plan and continue execution.”
+- Clarify auto semantics in docs: auto advances to completion, failure, or the first unapproved gate; `--no-auto` is not a durable scheduler gate.
+- Update the Web create form wording to `计划生成后手动确认后执行` and show planned gated runs as `已拆分，待确认`.
+- Rework the run detail header layout: keep title/status on the left, move `Run ID ⧉` and `tmux ⧉` copy tools to a lightweight right-side tool group, and keep long unreadable IDs out of metadata chips.
+- Record the Agent Profile / Candidate / Reviewer design direction in the private KB while keeping the current executor Codex-only.
+### Verification
+- `npm run check` passed locally with 78 tests.
+- `npm run check` passed on the remote Windows validation host `zhangxing_win` with 78 tests for the release-candidate working tree.
+- Windows Web smoke confirmed the plan approval input, compact title copy tools, `/api/health`, and `/api/codex`.
 ## v0.0.13
 ### Highlights

package/bin/input-kanban.js CHANGED Viewed

@@ -171,7 +171,7 @@ function parseSubmitArgs(argv) {
   const args = {
     host: '127.0.0.1', port: 8787, workspace: undefined, repo: undefined, runsDir: undefined, codexBin: undefined,
     runner: undefined, label: undefined, taskText: undefined, taskFile: undefined, maxParallel: 3,
-    workerSandbox: 'workspace-write', auto: true, detach: false, watch: true, json: false, pollMs: 3000, maxRetries: 1, help: false
+    workerSandbox: 'workspace-write', planApproval: false, auto: true, detach: false, watch: true, json: false, pollMs: 3000, maxRetries: 1, help: false
   };
   for (let i = 0; i < argv.length; i++) {
     const arg = argv[i];
@@ -190,6 +190,7 @@ function parseSubmitArgs(argv) {
     else if (arg === '--task-file') args.taskFile = next();
     else if (arg === '--max-parallel') args.maxParallel = Number(next());
     else if (arg === '--worker-sandbox') args.workerSandbox = validateSandbox(next(), '--worker-sandbox');
+    else if (arg === '--plan-approval') args.planApproval = true;
     else if (arg === '--auto') { args.auto = true; args.watch = true; }
     else if (arg === '--no-auto') { args.auto = false; args.watch = false; }
     else if (arg === '--detach' || arg === '-d') args.detach = true;
@@ -290,6 +291,7 @@ Submit options:
   --task-file <path>         Read task description from file, use - for stdin
   --max-parallel <n>         Default max parallel workers, default 3
   --worker-sandbox <mode>    read-only, workspace-write, or danger-full-access
+  --plan-approval            Pause after planning until the generated plan is confirmed
   --runner <mode>            Runner mode: headless or tmux
   --runs-dir <path>          Runtime runs directory shared with the Web UI
   --auto                     Plan, dispatch all batches, judge, and watch, default for submit
@@ -318,6 +320,7 @@ Options:
   --task-file <path>         Read task description from file, use - for stdin
   --max-parallel <n>         Default max parallel workers, default 3
   --worker-sandbox <mode>    read-only, workspace-write, or danger-full-access
+  --plan-approval            Pause after planning until the generated plan is confirmed
   --runner <mode>            Runner mode: headless or tmux
   --runs-dir <path>          Runtime runs directory shared with the Web UI
   --auto                     Plan, dispatch all batches, judge, and watch, default for submit
@@ -735,7 +738,8 @@ async function submit(args) {
     workspace: process.env.KANBAN_DEFAULT_WORKSPACE || process.env.KANBAN_DEFAULT_REPO,
     repo: process.env.KANBAN_DEFAULT_REPO,
     maxParallel: args.maxParallel,
-    workerSandbox: args.workerSandbox
+    workerSandbox: args.workerSandbox,
+    planApproval: args.planApproval
   });
   if (!args.json) {
     console.log(`已创建任务批次: ${state.runId}`);

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "input-kanban",
-  "version": "0.0.13",
+  "version": "0.0.15",
   "type": "module",
   "bin": {
     "input-kanban": "bin/input-kanban.js"

package/public/index.html CHANGED Viewed

@@ -41,20 +41,20 @@
     @media (prefers-reduced-motion: reduce) { button, button.state-active, button.state-pending::after, .refresh-pulse-chip.pulse .refresh-pulse-dot { animation: none !important; transition: none !important; } }
     table { width: 100%; border-collapse: collapse; font-size: 13px; table-layout: fixed; }
     th, td { border-bottom: 1px solid var(--line); padding: 9px 8px; text-align: left; vertical-align: top; }
-    th:nth-child(1), td:nth-child(1) { width: 34%; }
+    th:nth-child(1), td:nth-child(1) { width: 32%; }
     th:nth-child(2), td:nth-child(2) { width: 92px; white-space: nowrap; }
     th:nth-child(3), td:nth-child(3) { width: 132px; }
     th:nth-child(4), td:nth-child(4) { width: 64px; white-space: nowrap; }
     th:nth-child(5), td:nth-child(5) { width: 96px; }
-    th:nth-child(6), td:nth-child(6) { width: 92px; }
+    th:nth-child(6), td:nth-child(6) { width: 116px; }
     th:nth-child(7), td:nth-child(7) { width: 66px; }
     th:nth-child(8), td:nth-child(8) { width: 94px; }
     th { color: #cbd5e1; font-size: 12px; text-transform: uppercase; letter-spacing: .06em; }
     tr:hover { background: #162033; cursor: pointer; }
     .pill { display: inline-block; padding: 3px 8px; border-radius: 999px; font-size: 12px; font-weight: 800; background: var(--gray); line-height: 1.3; }
-    .completed, .judged, .workers_completed, .planned, .batches_completed { background: var(--green); }
+    .completed, .judged, .planned, .batches_completed { background: var(--green); }
     .running, .planning, .judging { background: var(--blue); }
-    .failed, .workers_failed, .plan_failed, .judge_failed, .unknown, .batch_blocked { background: var(--red); }
+    .failed, .plan_failed, .judge_failed, .unknown, .batch_blocked { background: var(--red); }
     .plan_empty { background: var(--orange); }
     .pending, .created { background: var(--gray); }
     .batch-row td { border-top: 3px solid var(--line-strong); background: #101827; font-weight: 800; font-size: 14px; padding-top: 13px; }
@@ -91,8 +91,11 @@
     .run-card-meta .meta-chip { padding: 4px 7px; font-size: 11px; }
     .run-card-meta .meta-chip.long .meta-value { max-width: 248px; }
     .build-header { border: 1px solid var(--line); border-radius: 12px; padding: 14px; background: var(--panel-2); margin-bottom: 14px; }
-    .build-title { display: flex; align-items: center; gap: 10px; font-size: 22px; font-weight: 900; }
-    .build-meta { margin-top: 10px; display: flex; flex-wrap: wrap; gap: 7px; align-items: center; }
+    .build-title { display: grid; grid-template-columns: minmax(0, 1fr) auto; gap: 10px; align-items: center; font-size: 22px; font-weight: 900; }
+    .build-title-main { min-width: 0; display: flex; align-items: center; gap: 10px; }
+    .build-title-text { min-width: 0; overflow: hidden; text-overflow: ellipsis; white-space: nowrap; }
+    .build-title-tools { display: inline-flex; align-items: center; gap: 6px; justify-self: end; }
+    .build-meta { margin-top: 12px; display: flex; flex-wrap: wrap; gap: 7px; align-items: center; }
     .meta-chip { display: inline-flex; align-items: center; gap: 6px; max-width: 100%; padding: 5px 8px; border: 1px solid var(--line); border-radius: 999px; background: #020617; color: #cbd5e1; font-size: 12px; line-height: 1.2; }
     .meta-chip.danger { border-color: #f97316; color: #fed7aa; background: rgba(180,83,9,.14); }
     .meta-label { color: var(--muted); font-weight: 800; white-space: nowrap; }
@@ -106,12 +109,17 @@
     .log-panel { margin-top: 16px; }
     .file-tabs button { font-size: 13px; }
     .copy-btn { padding: 2px 6px; margin: 0 0 0 6px; border-radius: 6px; font-size: 12px; line-height: 1.2; background: var(--gray); vertical-align: middle; }
+    .title-copy-btn { display: inline-flex; align-items: center; gap: 5px; opacity: .78; font-size: 12px; padding: 5px 8px; margin: 0; }
+    .title-copy-btn:hover, .title-copy-btn:focus { opacity: 1; }
+    .title-copy-btn .icon-svg { width: 12px; height: 12px; }
     .rename-btn { opacity: 0; pointer-events: none; transition: opacity .15s ease; }
     .run-card:hover .rename-btn, .run-card:focus-within .rename-btn, .build-title:hover .rename-btn, .build-title:focus-within .rename-btn, .rename-btn:focus { opacity: 1; pointer-events: auto; }
     .run-card-title-actions { display: inline-flex; align-items: center; gap: 3px; }
     .archive-confirm-btn { min-width: 46px; padding: 4px 10px; border-color: rgba(248,113,113,.85); background: var(--red) !important; color: white; font-weight: 900; }
     .icon-svg { width: 14px; height: 14px; display: block; }
-    .session-cell { word-break: break-all; }
+    .session-cell-wrap { display: inline-flex; align-items: center; gap: 5px; white-space: nowrap; }
+    .session-cell { min-width: 0; font-variant-numeric: tabular-nums; }
+    .session-cell-wrap .copy-btn { flex: 0 0 auto; margin-left: 0; }
     .row-actions { display: flex; align-items: center; justify-content: flex-end; gap: 6px; }
     .row-actions .danger, .row-actions .secondary { margin: 0; }
     .status-stack { display: inline-flex; flex-direction: column; align-items: flex-start; gap: 5px; }
@@ -169,6 +177,8 @@
         <option value="danger-full-access">danger-full-access（高风险，跳过沙箱限制）</option>
       </select>
       <div class="muted">仅影响 worker；任务拆分和汇总验收仍保持 read-only。若执行过程提示 Permission denied / sandbox denied，这通常不是任务本身失败，而是当前沙箱能力不足；在可信工作区可改用 danger-full-access。DNS / 网络失败则通常需要检查代理、VPN 或本地 evidence。</div>
+      <label><input id="planApproval" type="checkbox" style="width:auto; margin-right:6px;" />计划生成后手动确认后执行</label>
+      <div class="muted">开启后会停在“已拆分，待确认”；点击“开始执行”后继续自动执行到验收。</div>
       <label>任务说明</label><textarea id="taskText" placeholder="粘贴任务说明"></textarea>
       <div class="toolbar">
         <button onclick="createRun()">创建批次</button>
@@ -265,7 +275,6 @@ function userFacingErrorMessage(error) {
 const statusText = {
   created: '已创建', pending: '等待中', planning: '拆分中', planned: '已拆分',
   running: '执行中', completed: '已完成', failed: '失败', unknown: '未知',
-  workers_completed: '子任务完成', workers_failed: '子任务失败',
   batches_completed: '批次完成', batch_blocked: '批次阻塞', plan_empty: '拆分为空', stopped: '已停止',
   judging: '验收中', judged: '已验收', plan_failed: '拆分失败', judge_failed: '验收失败'
 };
@@ -273,6 +282,9 @@ const roleText = { planner: '任务拆分', judge: '汇总验收' };
 function displayStatus(s) { return statusText[s] || s || '-'; }
 function displayRole(s) { return roleText[s] || s || '-'; }
 function pill(s) { return `<span class="pill ${s || ''}">${displayStatus(s)}</span>`; }
+function planApprovalPending(state = currentState) { return !!(state?.gates?.planApproval?.required && !state.gates.planApproval.approved); }
+function runStatusLabel(state = currentState) { return planApprovalPending(state) ? '已拆分，待确认' : displayStatus(state?.status); }
+function statusPill(state = currentState) { return `<span class="pill ${state?.status || ''}">${esc(runStatusLabel(state))}</span>`; }
 function esc(s) { return String(s ?? '').replace(/[&<>]/g, c => ({'&':'&amp;','<':'&lt;','>':'&gt;'}[c])); }
 function formatDateTime(s) {
   if (!s) return '-';
@@ -314,6 +326,12 @@ function gitChip() { return '<span class="meta-chip" title="Git 工作区"><span
 function editIcon() {
   return '<svg class="icon-svg" viewBox="0 0 24 24" focusable="false" aria-hidden="true"><path d="M4 16.5V20h3.5L18.1 9.4l-3.5-3.5L4 16.5Z" fill="currentColor"/><path d="m16 4.5 3.5 3.5" stroke="currentColor" stroke-width="2" stroke-linecap="round"/></svg>';
 }
+function copyIcon() {
+  return '<svg class="icon-svg" viewBox="0 0 24 24" focusable="false" aria-hidden="true"><path d="M8 8h11v11H8V8Z" fill="none" stroke="currentColor" stroke-width="2"/><path d="M5 16H4V4h12v1" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round"/></svg>';
+}
+function titleCopyLabel(kind) {
+  return kind === 'tmux' ? `<span>tmux</span>${copyIcon()}` : `<span>Run ID</span>${copyIcon()}`;
+}
 function archiveIcon() {
   return '<svg class="icon-svg" viewBox="0 0 24 24" focusable="false" aria-hidden="true"><path d="M4 5h16v4H4V5Z" fill="currentColor"/><path d="M6 10h12v9H6v-9Z" fill="currentColor" opacity=".72"/><path d="M9 13h6" stroke="#020617" stroke-width="2" stroke-linecap="round"/></svg>';
 }
@@ -407,7 +425,7 @@ function initializeWorkerSandboxPreference() {
 }
 async function createRun() {
   saveWorkerSandboxPreference();
-  const body = { label: label.value, workspace: repo.value, repo: repo.value, maxParallel: maxParallel.value, workerSandbox: workerSandbox.value, taskText: taskText.value };
+  const body = { label: label.value, workspace: repo.value, repo: repo.value, maxParallel: maxParallel.value, workerSandbox: workerSandbox.value, planApproval: planApproval.checked, taskText: taskText.value };
   const r = await api('/api/runs', { method: 'POST', body: JSON.stringify(body) });
   selectedRun = r.runId; selectedTask = null; selectedFileName = null;
   clearFileView();
@@ -444,7 +462,7 @@ function renderRunList() {
   const visibleRuns = latestRuns.slice(0, runListVisibleCount);
   const cards = visibleRuns.map(r => `
     <div class="run-card ${selectedRun === r.runId ? 'active' : ''}" onclick="selectRun('${r.runId}')" onmouseleave="clearArchiveConfirm('${r.runId}')">
-      <div class="run-card-title"><span class="run-card-name-wrap"><span class="run-card-name" title="${esc(r.label)}">${esc(r.label)}</span><span class="run-card-title-actions"><button class="secondary copy-btn rename-btn" title="修改任务批次名称" onclick="renameRunLabel(event, '${r.runId}')">${editIcon()}</button><button class="secondary copy-btn rename-btn ${pendingArchiveRunId === r.runId ? 'archive-confirm-btn' : ''}" title="${pendingArchiveRunId === r.runId ? '再次点击确认归档' : '归档任务批次（运行中请先停止）'}" onclick="archiveRunFromCard(event, '${r.runId}')">${pendingArchiveRunId === r.runId ? '确认' : archiveIcon()}</button></span></span><span>${pill(r.status)}</span></div>
+      <div class="run-card-title"><span class="run-card-name-wrap"><span class="run-card-name" title="${esc(r.label)}">${esc(r.label)}</span><span class="run-card-title-actions"><button class="secondary copy-btn rename-btn" title="修改任务批次名称" onclick="renameRunLabel(event, '${r.runId}')">${editIcon()}</button><button class="secondary copy-btn rename-btn ${pendingArchiveRunId === r.runId ? 'archive-confirm-btn' : ''}" title="${pendingArchiveRunId === r.runId ? '再次点击确认归档' : '归档任务批次（运行中请先停止）'}" onclick="archiveRunFromCard(event, '${r.runId}')">${pendingArchiveRunId === r.runId ? '确认' : archiveIcon()}</button></span></span><span>${statusPill(r)}</span></div>
       <div class="run-card-meta">
         ${metaChip('工作区', basenamePath(r.workspacePath || r.repo), { title: r.workspacePath || r.repo, long: true, extra: `<button class="secondary copy-btn" title="复制工作区地址" onclick="copyRunRepoPath(event, '${r.runId}')">⧉</button>` })}
         ${r.git?.isGit ? gitChip() : ''}
@@ -522,8 +540,11 @@ async function refreshSelected({auto=false} = {}) {
 function renderSelectedHeader() {
   if (!currentState) return '<div class="muted">未选择任务批次</div>';
   const sandbox = currentState.workerSandbox || 'workspace-write';
+  const titleActions = [`<button class="secondary copy-btn title-copy-btn" title="复制 Run ID：${esc(currentState.runId)}" data-copy-kind="run-id" onclick="copyRunId(event)">${titleCopyLabel('run-id')}</button>`];
+  if (currentState.runner === 'tmux' && hasRunTmuxMetadata(currentState)) {
+    titleActions.push(`<button class="secondary copy-btn title-copy-btn" title="复制 tmux attach 指令" data-copy-kind="tmux" onclick="copyTmuxRunCommand(event)">${titleCopyLabel('tmux')}</button>`);
+  }
   const chips = [
-    metaChip('Run ID', currentState.runId, { long: true }),
     metaChip('工作区', basenamePath(currentState.workspacePath || currentState.repo), {
       title: currentState.workspacePath || currentState.repo,
       long: true,
@@ -536,18 +557,8 @@ function renderSelectedHeader() {
   if (currentState.git?.isGit || currentState.workspace?.git?.isGit) {
     chips.push(gitChip());
   }
-  if (currentState.runner === 'tmux') {
-    if (hasRunTmuxMetadata(currentState)) {
-      chips.push(metaChip('终端', tmuxSessionName(currentState), {
-        long: true,
-        extra: `<button class="secondary copy-btn" onclick="copyTmuxRunCommand(event)">复制tmux attach指令</button>`
-      }));
-    } else {
-      chips.push(metaChip('终端', 'tmux 现场尚未生成'));
-    }
-  }
   chips.push(refreshPulseChip());
-  return `<div class="build-title"><span>${esc(currentState.label)}</span><button class="secondary copy-btn rename-btn" title="修改任务批次名称" onclick="renameRunLabel(event, currentState.runId)">${editIcon()}</button>${pill(currentState.status)}</div><div class="build-meta">${chips.join('')}</div>`;
+  return `<div class="build-title"><div class="build-title-main"><span class="build-title-text">${esc(currentState.label)}</span><button class="secondary copy-btn rename-btn" title="修改任务批次名称" onclick="renameRunLabel(event, currentState.runId)">${editIcon()}</button>${statusPill(currentState)}</div><div class="build-title-tools">${titleActions.join('')}</div></div><div class="build-meta">${chips.join('')}</div>`;
 }
 async function loadTaskDescription() {
   if (!selectedRun) { document.getElementById('taskDescription').textContent = '未选择任务批次'; return; }
@@ -576,22 +587,22 @@ function runActionState(key) {
   const anyWorkerRunning = (currentState.tasks || []).some(t => t.status === 'running');
   if (key === 'plan') {
     if (status === 'planning' || currentState.planner?.status === 'running') return { label: '拆分中', disabled: true, state: 'active' };
-    if (status === 'planned' || status === 'running' || status === 'workers_completed' || status === 'batches_completed' || status === 'judging' || status === 'judged') return { label: '已拆分', disabled: true, state: 'done' };
+    if (status === 'planned' || status === 'running' || status === 'batches_completed' || status === 'judging' || status === 'judged') return { label: '已拆分', disabled: true, state: 'done' };
     if (status === 'plan_failed' || status === 'plan_empty') return { label: '重试拆分', disabled: false, state: 'retry' };
     return { label: '拆分', disabled: false, state: '' };
   }
   if (key === 'dispatch') {
     if (status === 'running' || anyWorkerRunning) return { label: '执行中', disabled: true, state: 'active' };
-    if (status === 'planned' || status === 'batch_blocked') return { label: '执行', disabled: false, state: status === 'batch_blocked' ? 'retry' : '' };
-    if (status === 'workers_failed') return { label: '重试执行', disabled: false, state: 'retry' };
-    if (['workers_completed','batches_completed','judging','judged'].includes(status)) return { label: '已完成', disabled: true, state: 'done' };
+    if (status === 'planned') return { label: planApprovalPending() ? '开始执行' : '执行', disabled: false, state: '' };
+    if (status === 'batch_blocked') return { label: '重试执行', disabled: false, state: 'retry' };
+    if (['batches_completed','judging','judged'].includes(status)) return { label: '已完成', disabled: true, state: 'done' };
     return { label: '执行', disabled: true, state: 'done' };
   }
   if (key === 'judge') {
     if (status === 'judging' || currentState.judge?.status === 'running') return { label: '验收中', disabled: true, state: 'active' };
     if (status === 'judged') return { label: '已验收', disabled: true, state: 'done' };
     if (status === 'judge_failed') return { label: '重试验收', disabled: false, state: 'retry' };
-    if (status === 'batches_completed' || status === 'workers_completed') return { label: '验收', disabled: false, state: '' };
+    if (status === 'batches_completed') return { label: '验收', disabled: false, state: '' };
     return { label: '验收', disabled: true, state: 'done' };
   }
   if (key === 'stop') {
@@ -705,7 +716,7 @@ function shortSessionId(thread) {
 }
 function sessionCell(thread) {
   if (!thread) return '-';
-  return `<span class="session-cell" title="${esc(thread)}">…${esc(shortSessionId(thread))}</span><button class="copy-btn" title="复制完整 Codex 会话ID" onclick="copySessionId(event, '${esc(thread)}')">⧉</button>`;
+  return `<span class="session-cell-wrap"><span class="session-cell" title="${esc(thread)}">…${esc(shortSessionId(thread))}</span><button class="copy-btn" title="复制完整 Codex 会话ID" onclick="copySessionId(event, '${esc(thread)}')">⧉</button></span>`;
 }
 function taskStartedCell(t) {
   return t?.startedAt ? formatDateTime(t.startedAt) : '-';
@@ -844,6 +855,19 @@ function hideExecutionSummary() {
   el.classList.add('hidden');
   el.innerHTML = '';
 }
+async function copyRunId(event) {
+  event.stopPropagation();
+  const runId = currentState?.runId || selectedRun || '';
+  if (!runId) return;
+  try {
+    await navigator.clipboard.writeText(runId);
+    const kind = event.currentTarget.dataset.copyKind || 'run-id';
+    event.currentTarget.textContent = '✓';
+    setTimeout(() => { event.currentTarget.innerHTML = titleCopyLabel(kind); }, 900);
+  } catch {
+    prompt('复制 Run ID', runId);
+  }
+}
 async function copyRepoPath(event, repoPath = currentState?.workspacePath || currentState?.repo || '') {
   event.stopPropagation();
   if (!repoPath) return;
@@ -864,9 +888,10 @@ async function copyTmuxRunCommand(event) {
   const command = runAttachCommand(currentState);
   if (!command) return;
   try {
+    const kind = event.currentTarget.dataset.copyKind || 'tmux';
     await navigator.clipboard.writeText(command);
-    event.currentTarget.textContent = '已复制';
-    setTimeout(() => { event.currentTarget.textContent = '复制tmux attach指令'; }, 900);
+    event.currentTarget.textContent = '✓';
+    setTimeout(() => { event.currentTarget.innerHTML = titleCopyLabel(kind); }, 900);
   } catch {
     prompt('复制 tmux attach 命令', command);
   }
@@ -1024,7 +1049,11 @@ async function renameRunLabel(event, runId = selectedRun) {
   });
 }
 async function planRun() { if (selectedRun) await runActionWithPending('plan', async () => { await api(`/api/runs/${selectedRun}/plan`, {method:'POST'}); await refreshSelected(); }); }
-async function dispatchRun() { if (selectedRun) await runActionWithPending('dispatch', async () => { await api(`/api/runs/${selectedRun}/dispatch`, {method:'POST'}); await refreshSelected(); }); }
+async function dispatchRun() {
+  if (!selectedRun) return;
+  const endpoint = currentState?.status === 'batch_blocked' ? 'retry' : 'dispatch';
+  await runActionWithPending('dispatch', async () => { await api(`/api/runs/${selectedRun}/${endpoint}`, {method:'POST'}); await refreshSelected(); });
+}
 async function judgeRun() { if (selectedRun) await runActionWithPending('judge', async () => { await api(`/api/runs/${selectedRun}/judge`, {method:'POST'}); await refreshSelected(); }); }
 async function stopSelectedRun() {
   if (!selectedRun) return;

package/src/orchestrator.js CHANGED Viewed

@@ -220,8 +220,28 @@ function deriveRunLabel(label, taskText) {
   return truncateDisplayWidth(cleaned, MAX_DERIVED_LABEL_DISPLAY_WIDTH) || 'task';
 }
+function normalizePlanApprovalGate(value = false) {
+  const required = !!value;
+  return { required, approved: !required, approvedAt: null, approvedBy: null };
+}
+function planApprovalGate(state) {
+  return state?.gates?.planApproval || { required: false, approved: true, approvedAt: null, approvedBy: null };
+}
+function requiresPlanApproval(state) {
+  const gate = planApprovalGate(state);
+  return !!gate.required && !gate.approved;
+}
-export async function createRun({ label = '', taskText = '', workspace = '', repo = DEFAULT_REPO, maxParallel = 3, workerSandbox = 'workspace-write' } = {}) {
+function approvePlanGate(state, approvedBy = 'local-user') {
+  const gate = planApprovalGate(state);
+  if (!gate.required || gate.approved) return false;
+  state.gates = { ...(state.gates || {}), planApproval: { ...gate, approved: true, approvedAt: nowIso(), approvedBy } };
+  return true;
+}
+export async function createRun({ label = '', taskText = '', workspace = '', repo = DEFAULT_REPO, maxParallel = 3, workerSandbox = 'workspace-write', planApproval = false, requiresPlanApproval = false } = {}) {
   const resolvedWorkspace = await assertWorkspacePath(workspace || repo || DEFAULT_WORKSPACE);
   const workspaceMeta = await detectWorkspaceMetadata(resolvedWorkspace);
   const runLabel = deriveRunLabel(label, taskText);
@@ -243,6 +263,7 @@ export async function createRun({ label = '', taskText = '', workspace = '', rep
     repo: resolvedWorkspace,
     maxParallel: Number(maxParallel) || 3,
     workerSandbox: normalizeSandbox(workerSandbox),
+    gates: { planApproval: normalizePlanApprovalGate(planApproval || requiresPlanApproval) },
     runner: RUNNER,
     status: 'created', createdAt: nowIso(), updatedAt: nowIso(),
     planner: { status: 'pending' }, batches: [], tasks: [], judge: { status: 'pending' }
@@ -479,6 +500,27 @@ async function rotatePlannerAttempt(state, runDir) {
   }];
 }
+async function rotateJudgeAttempt(state, runDir) {
+  const judgeDir = roleDir(runDir, 'judge');
+  if (!fs.existsSync(judgeDir)) return;
+  const attemptsDir = path.join(runDir, 'judge_attempts');
+  await ensureDir(attemptsDir);
+  const previousAttempts = (state.judgeAttempts || []).map(item => Number(item.attempt || 0)).filter(Number.isFinite);
+  const attempt = Number(state.judge?.attempt || 0) || Math.max(1, 1 + Math.max(0, ...previousAttempts));
+  const archivedDir = path.join(attemptsDir, `attempt-${String(attempt).padStart(2, '0')}`);
+  await fsp.rm(archivedDir, { recursive: true, force: true });
+  await fsp.rename(judgeDir, archivedDir);
+  state.judgeAttempts = [...(state.judgeAttempts || []), {
+    attempt,
+    status: state.judge?.status,
+    exitCode: state.judge?.exitCode ?? null,
+    startedAt: state.judge?.startedAt,
+    endedAt: state.judge?.endedAt,
+    archivedDir,
+    archivedAt: nowIso()
+  }];
+}
 async function rotateWorkerAttempt(state, task) {
   const runDir = pathForRun(state.runId);
   const workerDir = roleDir(runDir, 'worker', task.id);
@@ -580,6 +622,7 @@ export async function dispatchRun(runId) {
     if (!state.tasks?.length) throw new Error('no tasks in plan');
     if (state.status === 'batch_blocked') throw new Error('current batch is blocked by failed/unknown tasks');
     if (allBatchesCompleted(state)) throw new Error('all batches completed; run final judge next');
+    approvePlanGate(state);
     state.status = 'running';
     await scheduleMoreWorkers(state);
     recomputeRunStatus(state);
@@ -770,16 +813,26 @@ export async function startJudge(runId) {
   return await withRunStateLock(runId, async () => {
     const state = await loadRun(runId);
     if (!state) throw new Error(`run not found: ${runId}`);
+    if (state.archived) throw new Error('archived run cannot be judged');
+    if (state.status === 'stopped') throw new Error('stopped run cannot be judged');
     recomputeRunStatus(state);
+    if (state.judge?.status === 'running') throw new Error('judge already running');
+    if (state.judge?.status === 'completed') throw new Error('judge already completed');
     if (!allBatchesCompleted(state) && state.tasks?.length) throw new Error('final judge is allowed only after all batches completed');
-    const outDir = roleDir(pathForRun(runId), 'judge');
+    const runDir = pathForRun(runId);
+    if (state.judge?.status === 'failed') await rotateJudgeAttempt(state, runDir);
+    const outDir = roleDir(runDir, 'judge');
     await ensureDir(outDir);
     const judgeInputPath = path.join(outDir, 'judge_input.json');
     const judgeInput = await buildJudgeInput(state);
     await writeJsonAtomic(judgeInputPath, judgeInput);
     const prompt = defaultJudgePrompt(state, judgeInputPath);
-    const child = await runner.startCodexTask({ runId: state.runId, taskId: 'judge', batchId: 'judge', runStatePath: statePath(pathForRun(runId)), prompt, sandbox: 'read-only', cwd: workspacePathOf(state), outDir });
-    state.judge = { status: 'running', pid: child.pid, startedAt: nowIso(), dir: outDir };
+    const child = await runner.startCodexTask({ runId: state.runId, taskId: 'judge', batchId: 'judge', runStatePath: statePath(runDir), prompt, sandbox: 'read-only', cwd: workspacePathOf(state), outDir });
+    const previousJudgeAttempts = [
+      ...(state.judgeAttempts || []).map(item => Number(item.attempt || 0)),
+      Number(state.judge?.attempt || 0)
+    ].filter(Number.isFinite);
+    state.judge = { status: 'running', pid: child.pid, startedAt: nowIso(), dir: outDir, attempt: 1 + Math.max(0, ...previousJudgeAttempts) };
     state.status = 'judging';
     await saveRun(state);
     child.onExit(async code => {
@@ -816,6 +869,7 @@ export async function autoAdvanceRun(runId, { appClient = null, startCreated = f
     return await refreshRun(runId, appClient) || state;
   }
   if (state.status === 'planned') {
+    if (requiresPlanApproval(state)) return state;
     try { state = await dispatchRun(runId); }
     catch (error) { if (!/all batches completed|current batch is blocked/i.test(error.message || '')) throw error; }
     return await refreshRun(runId, appClient) || state;
@@ -1230,7 +1284,7 @@ export function summaryOfRun(s) {
   const tasks = s.tasks || [];
   const workspacePath = s.workspacePath || s.repo || '';
   const git = s.git || s.workspace?.git || null;
-  return { runId: s.runId, label: s.label, repo: s.repo || workspacePath, workspacePath, workspaceName: s.workspaceName || path.basename(workspacePath || ''), git, status: s.status, runner: s.runner || RUNNER, workerSandbox: s.workerSandbox || 'workspace-write', archived: !!s.archived, createdAt: s.createdAt, updatedAt: s.updatedAt, durationEnd: runDurationEndOfState(s), total: tasks.length, completed: tasks.filter(t => t.status === 'completed').length, failed: tasks.filter(t => ['failed','unknown'].includes(t.status)).length, running: tasks.filter(t => t.status === 'running').length, batches: (s.batches || []).map(b => ({ id: b.id, name: b.name, status: b.status, total: b.tasks?.length || 0, completed: (b.tasks || []).filter(t => t.status === 'completed').length })) };
+  return { runId: s.runId, label: s.label, repo: s.repo || workspacePath, workspacePath, workspaceName: s.workspaceName || path.basename(workspacePath || ''), git, status: s.status, runner: s.runner || RUNNER, workerSandbox: s.workerSandbox || 'workspace-write', gates: s.gates || {}, archived: !!s.archived, createdAt: s.createdAt, updatedAt: s.updatedAt, durationEnd: runDurationEndOfState(s), total: tasks.length, completed: tasks.filter(t => t.status === 'completed').length, failed: tasks.filter(t => ['failed','unknown'].includes(t.status)).length, running: tasks.filter(t => t.status === 'running').length, batches: (s.batches || []).map(b => ({ id: b.id, name: b.name, status: b.status, total: b.tasks?.length || 0, completed: (b.tasks || []).filter(t => t.status === 'completed').length })) };
 }
 export async function readRunTaskText(runId) {

package/src/server.js CHANGED Viewed

@@ -8,6 +8,8 @@ import { createRun, listRuns, startPlanner, dispatchRun, startJudge, refreshRun,
 import { startAutoScheduler } from './scheduler.js';
 const PUBLIC_DIR = path.join(APP_ROOT, 'public');
+const CODEX_INFO_TTL_MS = 30000;
+let codexInfoCache = null;
 function send(res, status, body, type = 'application/json') {
   const data = type === 'application/json' ? JSON.stringify(body, null, 2) : body;
@@ -26,6 +28,13 @@ async function readBody(req) {
 function notFound(res) { send(res, 404, { error: 'not found' }); }
 function methodNotAllowed(res) { send(res, 405, { error: 'method not allowed' }); }
+async function cachedCodexInfo(nowMs = Date.now()) {
+  if (codexInfoCache && codexInfoCache.expiresAt > nowMs) return codexInfoCache.value;
+  const value = await detectCodexInfo();
+  codexInfoCache = { value, expiresAt: nowMs + CODEX_INFO_TTL_MS };
+  return value;
+}
 async function serveStatic(req, res, pathname) {
   let file = pathname === '/' ? path.join(PUBLIC_DIR, 'index.html') : path.join(PUBLIC_DIR, pathname.replace(/^\/+/, ''));
   file = path.resolve(file);
@@ -46,7 +55,7 @@ async function handleApi(req, res, url, appClient) {
       return send(res, 200, { ok: true, version: PACKAGE_VERSION, appRoot: APP_ROOT, runsDir: RUNS_DIR, defaultWorkspace: DEFAULT_WORKSPACE, defaultRepo: DEFAULT_REPO, runner: RUNNER, codexBin: CODEX_BIN });
     }
     if (req.method === 'GET' && url.pathname === '/api/codex') {
-      return send(res, 200, { ok: true, codex: await detectCodexInfo() });
+      return send(res, 200, { ok: true, codex: await cachedCodexInfo() });
     }
     if (parts[1] === 'runs' && parts.length === 2) {
       if (req.method === 'GET') return send(res, 200, { runs: await listRuns({ includeArchived: url.searchParams.get('includeArchived') === '1', workspace: url.searchParams.get('workspace') || '' }) });