npm - pi-crew - Versions diffs - 0.1.46 → 0.1.49 - Mend

pi-crew 0.1.46 → 0.1.49

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (253) hide show

package/CHANGELOG.md +97 -0
package/agents/analyst.md +11 -11
package/agents/critic.md +11 -11
package/agents/executor.md +11 -11
package/agents/explorer.md +11 -11
package/agents/planner.md +11 -11
package/agents/reviewer.md +11 -11
package/agents/security-reviewer.md +11 -11
package/agents/test-engineer.md +11 -11
package/agents/verifier.md +11 -11
package/agents/writer.md +11 -11
package/docs/next-upgrade-roadmap.md +117 -42
package/docs/refactor-tasks-phase3.md +394 -394
package/docs/refactor-tasks-phase4.md +564 -564
package/docs/refactor-tasks-phase5.md +402 -402
package/docs/refactor-tasks-phase6.md +662 -662
package/docs/research/AGENT-EXECUTION-ARCHITECTURE.md +261 -0
package/docs/research/AGENT-LIFECYCLE-COMPARISON.md +111 -0
package/docs/research/AUDIT_OH_MY_PI.md +261 -0
package/docs/research/AUDIT_PI_CREW.md +457 -0
package/docs/research/CAVEMAN-DEEP-RESEARCH.md +281 -0
package/docs/research/COMPARISON_OH_MY_PI_VS_PI_CREW.md +264 -0
package/docs/research/DEEP-RESEARCH-PI-POWERBAR.md +343 -0
package/docs/research/DEEP_RESEARCH_SUBAGENT_ARCHITECTURE.md +480 -0
package/docs/research/GAP_CLOSURE_IMPLEMENTATION_PLAN.md +354 -0
package/docs/research/IMPLEMENTATION_PLAN.md +385 -0
package/docs/research/LIVE-SESSION-PRODUCTION-READY-PLAN.md +502 -0
package/docs/research/OH-MY-PI-DEEP-RESEARCH-v14.7.6.md +266 -0
package/docs/research/REMAINING-GAPS-PLAN.md +363 -0
package/docs/research/SESSION-SUMMARY-2026-05-08.md +146 -0
package/docs/research/UI-RESPONSIVENESS-AUDIT.md +173 -0
package/docs/research-awesome-agent-skills-distillation.md +100 -100
package/docs/research-extension-examples.md +297 -297
package/docs/research-extension-system.md +324 -324
package/docs/research-oh-my-pi-distillation.md +56 -9
package/docs/research-optimization-plan.md +548 -548
package/docs/research-phase10-distillation.md +198 -198
package/docs/research-phase11-distillation.md +201 -201
package/docs/research-pi-coding-agent.md +357 -357
package/docs/research-source-pi-crew-reference.md +174 -174
package/docs/runtime-flow.md +148 -148
package/docs/source-runtime-refactor-map.md +107 -107
package/index.ts +6 -6
package/package.json +99 -98
package/schema.json +8 -0
package/skills/async-worker-recovery/SKILL.md +42 -42
package/skills/context-artifact-hygiene/SKILL.md +52 -52
package/skills/delegation-patterns/SKILL.md +54 -54
package/skills/mailbox-interactive/SKILL.md +40 -40
package/skills/model-routing-context/SKILL.md +39 -39
package/skills/multi-perspective-review/SKILL.md +58 -58
package/skills/observability-reliability/SKILL.md +41 -41
package/skills/orchestration/SKILL.md +157 -0
package/skills/ownership-session-security/SKILL.md +41 -41
package/skills/pi-extension-lifecycle/SKILL.md +39 -39
package/skills/requirements-to-task-packet/SKILL.md +63 -63
package/skills/resource-discovery-config/SKILL.md +41 -41
package/skills/runtime-state-reader/SKILL.md +44 -44
package/skills/secure-agent-orchestration-review/SKILL.md +45 -45
package/skills/state-mutation-locking/SKILL.md +42 -42
package/skills/systematic-debugging/SKILL.md +67 -67
package/skills/ui-render-performance/SKILL.md +39 -39
package/skills/verification-before-done/SKILL.md +57 -57
package/skills/worktree-isolation/SKILL.md +39 -39
package/src/agents/agent-config.ts +6 -0
package/src/agents/agent-search.ts +98 -0
package/src/agents/agent-serializer.ts +4 -0
package/src/agents/discover-agents.ts +17 -4
package/src/config/config.ts +24 -0
package/src/config/defaults.ts +11 -0
package/src/extension/autonomous-policy.ts +26 -33
package/src/extension/cross-extension-rpc.ts +82 -82
package/src/extension/help.ts +1 -0
package/src/extension/management.ts +5 -0
package/src/extension/register.ts +58 -13
package/src/extension/registration/commands.ts +33 -1
package/src/extension/registration/compaction-guard.ts +125 -125
package/src/extension/registration/team-tool.ts +6 -4
package/src/extension/run-bundle-schema.ts +89 -89
package/src/extension/run-index.ts +24 -18
package/src/extension/run-maintenance.ts +68 -62
package/src/extension/team-tool/api.ts +23 -2
package/src/extension/team-tool/cancel.ts +86 -11
package/src/extension/team-tool/context.ts +3 -0
package/src/extension/team-tool/handle-settings.ts +188 -188
package/src/extension/team-tool/inspect.ts +41 -41
package/src/extension/team-tool/intent-policy.ts +42 -0
package/src/extension/team-tool/lifecycle-actions.ts +47 -18
package/src/extension/team-tool/parallel-dispatch.ts +156 -0
package/src/extension/team-tool/plan.ts +19 -19
package/src/extension/team-tool/respond.ts +10 -2
package/src/extension/team-tool/run.ts +3 -2
package/src/extension/team-tool/status.ts +1 -1
package/src/extension/team-tool-types.ts +1 -0
package/src/extension/team-tool.ts +13 -3
package/src/hooks/registry.ts +61 -0
package/src/hooks/types.ts +41 -0
package/src/i18n.ts +184 -184
package/src/observability/exporters/otlp-exporter.ts +77 -77
package/src/prompt/prompt-runtime.ts +72 -72
package/src/runtime/agent-control.ts +108 -2
package/src/runtime/agent-memory.ts +72 -72
package/src/runtime/agent-observability.ts +114 -114
package/src/runtime/async-marker.ts +26 -26
package/src/runtime/async-runner.ts +3 -1
package/src/runtime/attention-events.ts +28 -28
package/src/runtime/background-runner.ts +19 -0
package/src/runtime/cancellation-token.ts +89 -0
package/src/runtime/cancellation.ts +61 -51
package/src/runtime/capability-inventory.ts +116 -0
package/src/runtime/child-pi.ts +2 -1
package/src/runtime/code-summary.ts +247 -0
package/src/runtime/completion-guard.ts +190 -190
package/src/runtime/crash-recovery.ts +181 -0
package/src/runtime/crew-agent-records.ts +35 -7
package/src/runtime/crew-agent-runtime.ts +1 -0
package/src/runtime/custom-tools/irc-tool.ts +201 -0
package/src/runtime/custom-tools/submit-result-tool.ts +90 -0
package/src/runtime/delivery-coordinator.ts +3 -1
package/src/runtime/direct-run.ts +35 -35
package/src/runtime/effectiveness.ts +81 -76
package/src/runtime/event-stream-bridge.ts +90 -0
package/src/runtime/foreground-control.ts +82 -82
package/src/runtime/green-contract.ts +46 -46
package/src/runtime/group-join.ts +106 -106
package/src/runtime/heartbeat-gradient.ts +28 -28
package/src/runtime/heartbeat-watcher.ts +124 -124
package/src/runtime/live-agent-control.ts +88 -88
package/src/runtime/live-agent-manager.ts +78 -2
package/src/runtime/live-control-realtime.ts +36 -36
package/src/runtime/live-extension-bridge.ts +150 -0
package/src/runtime/live-irc.ts +92 -0
package/src/runtime/live-session-health.ts +100 -0
package/src/runtime/live-session-runtime.ts +297 -7
package/src/runtime/mcp-proxy.ts +113 -0
package/src/runtime/notebook-helpers.ts +90 -0
package/src/runtime/orphan-sentinel.ts +7 -0
package/src/runtime/output-validator.ts +187 -0
package/src/runtime/parallel-research.ts +44 -44
package/src/runtime/parallel-utils.ts +57 -0
package/src/runtime/parent-guard.ts +80 -0
package/src/runtime/pi-json-output.ts +111 -111
package/src/runtime/policy-engine.ts +79 -79
package/src/runtime/progress-event-coalescer.ts +43 -43
package/src/runtime/prose-compressor.ts +164 -0
package/src/runtime/recovery-recipes.ts +74 -74
package/src/runtime/result-extractor.ts +121 -0
package/src/runtime/role-permission.ts +39 -39
package/src/runtime/runtime-resolver.ts +1 -4
package/src/runtime/semaphore.ts +131 -0
package/src/runtime/sensitive-paths.ts +92 -0
package/src/runtime/session-resources.ts +25 -25
package/src/runtime/session-snapshot.ts +59 -59
package/src/runtime/session-usage.ts +79 -79
package/src/runtime/sidechain-output.ts +29 -29
package/src/runtime/stream-preview.ts +177 -0
package/src/runtime/subagent-manager.ts +3 -2
package/src/runtime/subprocess-tool-registry.ts +67 -0
package/src/runtime/supervisor-contact.ts +59 -59
package/src/runtime/task-display.ts +38 -38
package/src/runtime/task-output-context.ts +59 -9
package/src/runtime/task-runner/capabilities.ts +78 -78
package/src/runtime/task-runner/live-executor.ts +2 -0
package/src/runtime/task-runner/progress.ts +119 -119
package/src/runtime/task-runner/prompt-builder.ts +70 -8
package/src/runtime/task-runner/prompt-pipeline.ts +64 -64
package/src/runtime/task-runner/result-utils.ts +14 -14
package/src/runtime/task-runner/run-projection.ts +104 -0
package/src/runtime/task-runner/state-helpers.ts +22 -22
package/src/runtime/task-runner.ts +75 -4
package/src/runtime/team-runner.ts +60 -8
package/src/runtime/worker-heartbeat.ts +21 -21
package/src/runtime/worker-startup.ts +57 -57
package/src/runtime/workspace-tree.ts +298 -0
package/src/runtime/yield-handler.ts +189 -0
package/src/schema/config-schema.ts +6 -0
package/src/schema/team-tool-schema.ts +11 -1
package/src/skills/discover-skills.ts +67 -0
package/src/state/active-run-registry.ts +4 -2
package/src/state/artifact-store.ts +4 -1
package/src/state/atomic-write.ts +50 -1
package/src/state/blob-store.ts +117 -0
package/src/state/contracts.ts +1 -0
package/src/state/event-log-rotation.ts +158 -0
package/src/state/event-log.ts +52 -2
package/src/state/mailbox.ts +87 -7
package/src/state/state-store.ts +24 -4
package/src/state/task-claims.ts +44 -44
package/src/state/types.ts +20 -0
package/src/state/usage.ts +29 -29
package/src/subagents/async-entry.ts +1 -1
package/src/subagents/index.ts +3 -3
package/src/subagents/live/control.ts +1 -1
package/src/subagents/live/manager.ts +1 -1
package/src/subagents/live/realtime.ts +1 -1
package/src/subagents/live/session-runtime.ts +1 -1
package/src/subagents/manager.ts +1 -1
package/src/subagents/spawn.ts +1 -1
package/src/teams/team-serializer.ts +38 -38
package/src/types/diff.d.ts +18 -18
package/src/ui/agent-management-overlay.ts +144 -0
package/src/ui/crew-footer.ts +101 -101
package/src/ui/crew-select-list.ts +111 -111
package/src/ui/crew-widget.ts +11 -2
package/src/ui/dashboard-panes/cancellation-pane.ts +43 -0
package/src/ui/dashboard-panes/capability-pane.ts +60 -0
package/src/ui/dashboard-panes/mailbox-pane.ts +35 -11
package/src/ui/dashboard-panes/metrics-pane.ts +34 -34
package/src/ui/dynamic-border.ts +25 -25
package/src/ui/layout-primitives.ts +106 -106
package/src/ui/live-run-sidebar.ts +4 -0
package/src/ui/loaders.ts +158 -158
package/src/ui/powerbar-publisher.ts +77 -15
package/src/ui/render-coalescer.ts +51 -0
package/src/ui/render-diff.ts +119 -119
package/src/ui/render-scheduler.ts +143 -143
package/src/ui/run-dashboard.ts +4 -0
package/src/ui/run-event-bus.ts +209 -0
package/src/ui/run-snapshot-cache.ts +68 -16
package/src/ui/snapshot-types.ts +8 -0
package/src/ui/spinner.ts +17 -17
package/src/ui/status-colors.ts +58 -58
package/src/ui/syntax-highlight.ts +116 -116
package/src/ui/transcript-entries.ts +258 -0
package/src/utils/atomic-write.ts +33 -33
package/src/utils/completion-dedupe.ts +63 -63
package/src/utils/frontmatter.ts +68 -68
package/src/utils/git.ts +262 -262
package/src/utils/ids.ts +17 -12
package/src/utils/incremental-reader.ts +104 -0
package/src/utils/names.ts +27 -27
package/src/utils/redaction.ts +44 -44
package/src/utils/safe-paths.ts +47 -47
package/src/utils/scan-cache.ts +137 -0
package/src/utils/sleep.ts +32 -32
package/src/utils/sse-parser.ts +134 -0
package/src/utils/task-name-generator.ts +337 -0
package/src/utils/visual.ts +33 -2
package/src/workflows/validate-workflow.ts +40 -40
package/src/worktree/branch-freshness.ts +45 -45
package/src/worktree/cleanup.ts +2 -1
package/teams/default.team.md +12 -12
package/teams/fast-fix.team.md +11 -11
package/teams/implementation.team.md +18 -18
package/teams/parallel-research.team.md +14 -14
package/teams/research.team.md +11 -11
package/teams/review.team.md +12 -12
package/workflows/default.workflow.md +29 -29
package/workflows/fast-fix.workflow.md +22 -22
package/workflows/implementation.workflow.md +38 -38
package/workflows/parallel-research.workflow.md +46 -46
package/workflows/research.workflow.md +22 -22
package/workflows/review.workflow.md +30 -30

package/skills/ui-render-performance/SKILL.md CHANGED Viewed

@@ -1,39 +1,39 @@
----
-name: ui-render-performance
-description: Non-blocking Pi TUI render workflow. Use when changing widgets, powerbar/statusbar segments, dashboard panes, overlays, snapshot caches, or live UI refresh behavior.
----
-# ui-render-performance
-Use this skill for Pi/pi-crew TUI work.
-## Source patterns distilled
-- Pi TUI is synchronous immediate-mode/string rendering: `source/pi-mono/packages/coding-agent/src/modes/interactive/interactive-mode.ts`
-- Pi extension examples use event-driven state updates, not render-time loading.
-- pi-crew UI: `src/extension/register.ts`, `src/ui/run-dashboard.ts`, `src/ui/run-snapshot-cache.ts`, `src/ui/crew-widget.ts`, `src/ui/powerbar-publisher.ts`, `src/ui/render-scheduler.ts`
-## Rules
-- Treat every `render(width)` and widget/powerbar update as a hot synchronous path.
-- Render from in-memory snapshots only. Preload config, manifests, snapshots, agents, and mailbox counts asynchronously.
-- Use `RenderScheduler.schedule()` to coalesce renders; avoid direct repeated rendering.
-- Prefer `snapshotCache.get(runId)` in render paths. If a sync fallback is unavoidable, classify it as first-load/rare and document why.
-- Keep dashboard panes pure: accept a snapshot/model and format strings; do not call `fs.readFileSync`, `fs.readdirSync`, `fs.statSync`, or network APIs from pane render methods.
-- On session switch, cancel timers and ensure in-flight async preloads cannot update stale session UI.
-- Watch TTL interactions: a preload interval shorter than cache TTL prevents render-time refresh gaps.
-## Anti-patterns
-- Do not call `loadConfig()`, `manifestCache.list()`, or `refreshIfStale()` repeatedly inside `renderTick()` unless backed by preloaded frame data.
-- Do not do large JSON parsing or directory scans inside widget render/update functions.
-- Do not show stale health warnings for completed/cancelled/failed runs.
-## Verification
-```bash
-cd pi-crew
-npx tsc --noEmit
-node --experimental-strip-types --test test/unit/run-snapshot-cache.test.ts test/unit/crew-widget.test.ts test/unit/powerbar-publisher.test.ts test/unit/run-dashboard.test.ts
-npm test
-```
+---
+name: ui-render-performance
+description: Non-blocking Pi TUI render workflow. Use when changing widgets, powerbar/statusbar segments, dashboard panes, overlays, snapshot caches, or live UI refresh behavior.
+---
+# ui-render-performance
+Use this skill for Pi/pi-crew TUI work.
+## Source patterns distilled
+- Pi TUI is synchronous immediate-mode/string rendering: `source/pi-mono/packages/coding-agent/src/modes/interactive/interactive-mode.ts`
+- Pi extension examples use event-driven state updates, not render-time loading.
+- pi-crew UI: `src/extension/register.ts`, `src/ui/run-dashboard.ts`, `src/ui/run-snapshot-cache.ts`, `src/ui/crew-widget.ts`, `src/ui/powerbar-publisher.ts`, `src/ui/render-scheduler.ts`
+## Rules
+- Treat every `render(width)` and widget/powerbar update as a hot synchronous path.
+- Render from in-memory snapshots only. Preload config, manifests, snapshots, agents, and mailbox counts asynchronously.
+- Use `RenderScheduler.schedule()` to coalesce renders; avoid direct repeated rendering.
+- Prefer `snapshotCache.get(runId)` in render paths. If a sync fallback is unavoidable, classify it as first-load/rare and document why.
+- Keep dashboard panes pure: accept a snapshot/model and format strings; do not call `fs.readFileSync`, `fs.readdirSync`, `fs.statSync`, or network APIs from pane render methods.
+- On session switch, cancel timers and ensure in-flight async preloads cannot update stale session UI.
+- Watch TTL interactions: a preload interval shorter than cache TTL prevents render-time refresh gaps.
+## Anti-patterns
+- Do not call `loadConfig()`, `manifestCache.list()`, or `refreshIfStale()` repeatedly inside `renderTick()` unless backed by preloaded frame data.
+- Do not do large JSON parsing or directory scans inside widget render/update functions.
+- Do not show stale health warnings for completed/cancelled/failed runs.
+## Verification
+```bash
+cd pi-crew
+npx tsc --noEmit
+node --experimental-strip-types --test test/unit/run-snapshot-cache.test.ts test/unit/crew-widget.test.ts test/unit/powerbar-publisher.test.ts test/unit/run-dashboard.test.ts
+npm test
+```

package/skills/verification-before-done/SKILL.md CHANGED Viewed

@@ -1,57 +1,57 @@
----
-name: verification-before-done
-description: Use when about to claim work is complete, fixed, passing, reviewed, committed, or ready to hand off.
----
-# verification-before-done
-Core principle: evidence before claims. A worker report, green-looking log, or previous run is not fresh verification.
-Distilled from detailed reads of agent-skill patterns for verification-before-completion, TDD, review reception, and QA workflows.
-## Gate Function
-Before any completion claim:
-1. Identify the command or inspection that proves the claim.
-2. Run the full command fresh, or explicitly state why a command cannot be run.
-3. Read the output, including exit code and failure counts.
-4. Compare the output to the claim.
-5. Report the claim only with the evidence.
-## Claim-to-Evidence Table
-| Claim | Requires | Not sufficient |
-|---|---|---|
-| Tests pass | Fresh test output with zero failures | Prior run, “should pass” |
-| Typecheck passes | Typecheck command exit 0 | Lint or targeted tests only |
-| Bug fixed | Original symptom/regression test passes | Code changed |
-| Requirements met | Checklist against request/plan | Generic test success |
-| Agent completed | Worker output plus artifact/diff/state inspection | Worker says DONE |
-| Safe to commit | Relevant checks pass and status reviewed | Partial local confidence |
-## Verification Ladder
-Choose the smallest reliable gate, then escalate when risk requires it:
-1. Read-only inspection for plans/reviews.
-2. Targeted unit test for touched behavior.
-3. Typecheck for TypeScript/schema/API changes.
-4. Integration test for runtime, subprocess, state, filesystem, UI, config, or session behavior.
-5. Full suite before commit/release or broad changes.
-6. Real Pi smoke only when safe and needed.
-## Done Report
-Include:
-- changed files or read-only status;
-- commands run and pass/fail result;
-- artifacts, run IDs, logs, or state paths inspected;
-- behavior actually verified;
-- skipped checks and why;
-- risks and rollback notes.
-## Red Flags
-Stop before saying done if you are using words like “should”, “probably”, “looks”, “seems”, “I think”, or if you are trusting an agent report without checking evidence.
+---
+name: verification-before-done
+description: Use when about to claim work is complete, fixed, passing, reviewed, committed, or ready to hand off.
+---
+# verification-before-done
+Core principle: evidence before claims. A worker report, green-looking log, or previous run is not fresh verification.
+Distilled from detailed reads of agent-skill patterns for verification-before-completion, TDD, review reception, and QA workflows.
+## Gate Function
+Before any completion claim:
+1. Identify the command or inspection that proves the claim.
+2. Run the full command fresh, or explicitly state why a command cannot be run.
+3. Read the output, including exit code and failure counts.
+4. Compare the output to the claim.
+5. Report the claim only with the evidence.
+## Claim-to-Evidence Table
+| Claim | Requires | Not sufficient |
+|---|---|---|
+| Tests pass | Fresh test output with zero failures | Prior run, “should pass” |
+| Typecheck passes | Typecheck command exit 0 | Lint or targeted tests only |
+| Bug fixed | Original symptom/regression test passes | Code changed |
+| Requirements met | Checklist against request/plan | Generic test success |
+| Agent completed | Worker output plus artifact/diff/state inspection | Worker says DONE |
+| Safe to commit | Relevant checks pass and status reviewed | Partial local confidence |
+## Verification Ladder
+Choose the smallest reliable gate, then escalate when risk requires it:
+1. Read-only inspection for plans/reviews.
+2. Targeted unit test for touched behavior.
+3. Typecheck for TypeScript/schema/API changes.
+4. Integration test for runtime, subprocess, state, filesystem, UI, config, or session behavior.
+5. Full suite before commit/release or broad changes.
+6. Real Pi smoke only when safe and needed.
+## Done Report
+Include:
+- changed files or read-only status;
+- commands run and pass/fail result;
+- artifacts, run IDs, logs, or state paths inspected;
+- behavior actually verified;
+- skipped checks and why;
+- risks and rollback notes.
+## Red Flags
+Stop before saying done if you are using words like “should”, “probably”, “looks”, “seems”, “I think”, or if you are trusting an agent report without checking evidence.

package/skills/worktree-isolation/SKILL.md CHANGED Viewed

@@ -1,39 +1,39 @@
----
-name: worktree-isolation
-description: Conflict-safe git worktree workflow. Use when running parallel implementation workers, isolating risky edits, or cleaning up task worktrees.
----
-# worktree-isolation
-Use this skill for worktree-based execution or cleanup.
-## Source patterns distilled
-- pi-subagents worktree runner and cleanup patterns
-- pi-crew worktrees: `src/worktree/worktree-manager.ts`, `src/worktree/cleanup.ts`, `src/worktree/branch-freshness.ts`
-- Team runner workspace mode: `src/runtime/team-runner.ts`, workflow/team resource fields
-## Rules
-- Use worktree mode for parallel or risky code-changing tasks when the repository is clean enough and merge ownership is clear.
-- Assign one owner per file/symbol/migration path to avoid conflict-heavy merges.
-- Name branches/worktrees deterministically from run/task IDs; avoid user-controlled path fragments without sanitization.
-- Before cleanup, check dirty state. Preserve dirty worktrees unless `force` is explicitly set.
-- Record worktree paths and branch metadata in artifacts/events so the operator can inspect or recover.
-- Do not run destructive git operations without explicit confirmation and evidence of target path containment.
-## Anti-patterns
-- Parallel editing the same file in multiple worktrees without a merge plan.
-- Force-removing dirty worktrees by default.
-- Reusing stale worktrees after the base branch has moved without freshness checks.
-- Storing worktrees outside the intended contained workspace root.
-## Verification
-```bash
-cd pi-crew
-npx tsc --noEmit
-node --experimental-strip-types --test test/integration/worktree-mode.test.ts test/unit/run-index.test.ts
-npm test
-```
+---
+name: worktree-isolation
+description: Conflict-safe git worktree workflow. Use when running parallel implementation workers, isolating risky edits, or cleaning up task worktrees.
+---
+# worktree-isolation
+Use this skill for worktree-based execution or cleanup.
+## Source patterns distilled
+- pi-subagents worktree runner and cleanup patterns
+- pi-crew worktrees: `src/worktree/worktree-manager.ts`, `src/worktree/cleanup.ts`, `src/worktree/branch-freshness.ts`
+- Team runner workspace mode: `src/runtime/team-runner.ts`, workflow/team resource fields
+## Rules
+- Use worktree mode for parallel or risky code-changing tasks when the repository is clean enough and merge ownership is clear.
+- Assign one owner per file/symbol/migration path to avoid conflict-heavy merges.
+- Name branches/worktrees deterministically from run/task IDs; avoid user-controlled path fragments without sanitization.
+- Before cleanup, check dirty state. Preserve dirty worktrees unless `force` is explicitly set.
+- Record worktree paths and branch metadata in artifacts/events so the operator can inspect or recover.
+- Do not run destructive git operations without explicit confirmation and evidence of target path containment.
+## Anti-patterns
+- Parallel editing the same file in multiple worktrees without a merge plan.
+- Force-removing dirty worktrees by default.
+- Reusing stale worktrees after the base branch has moved without freshness checks.
+- Storing worktrees outside the intended contained workspace root.
+## Verification
+```bash
+cd pi-crew
+npx tsc --noEmit
+node --experimental-strip-types --test test/integration/worktree-mode.test.ts test/unit/run-index.test.ts
+npm test
+```

package/src/agents/agent-config.ts CHANGED Viewed

@@ -25,6 +25,12 @@ export interface AgentConfig {
 	inheritSkills?: boolean;
 	routing?: RoutingMetadata;
 	memory?: "user" | "project" | "local";
+	/** Tool loading strategy: "essential" = always load all tools, "lean" = only load tools in defaultTools list */
+	loadMode?: "essential" | "lean";
+	/** Explicit tool list when loadMode is "lean". null means all available tools. */
+	defaultTools?: string[] | null;
+	/** Context mode: "fresh" = clean start, "fork" = inherit parent session context */
+	contextMode?: "fresh" | "fork";
 	disabled?: boolean;
 	override?: { source: "config"; path: string };
 }

package/src/agents/agent-search.ts ADDED Viewed

@@ -0,0 +1,98 @@
+import type { AgentConfig } from "./agent-config.ts";
+// ─── BM25 Agent Search ──────────────────────────────────────────────────────
+// Lightweight BM25 search over agent descriptors for task-to-agent matching.
+// Based on the same BM25 algorithm used in oh-my-pi's tool-index.ts.
+export interface AgentSearchDocument {
+	agent: AgentConfig;
+	termFrequencies: Map<string, number>;
+	length: number;
+}
+export interface AgentSearchIndex {
+	documents: AgentSearchDocument[];
+	averageLength: number;
+	documentFrequencies: Map<string, number>;
+}
+export interface AgentSearchResult {
+	agent: AgentConfig;
+	score: number;
+}
+const BM25_K1 = 1.2;
+const BM25_B = 0.75;
+const FIELD_WEIGHTS = {
+	name: 6,
+	description: 2,
+	role: 3,
+} as const;
+function tokenize(value: string): string[] {
+	return value
+		.replace(/([a-z0-9])([A-Z])/g, "$1 $2")
+		.replace(/[^a-zA-Z0-9]+/g, " ")
+		.toLowerCase()
+		.trim()
+		.split(/\s+/)
+		.filter((token) => token.length > 0);
+}
+function addWeightedTokens(termFrequencies: Map<string, number>, value: string | undefined, weight: number): void {
+	if (!value) return;
+	for (const token of tokenize(value)) {
+		termFrequencies.set(token, (termFrequencies.get(token) ?? 0) + weight);
+	}
+}
+function buildAgentSearchDocument(agent: AgentConfig): AgentSearchDocument {
+	const termFrequencies = new Map<string, number>();
+	addWeightedTokens(termFrequencies, agent.name, FIELD_WEIGHTS.name);
+	addWeightedTokens(termFrequencies, agent.description, FIELD_WEIGHTS.description);
+	// Role from agent name heuristic
+	const roleHint = agent.name?.replace(/[-_]/g, " ") ?? "";
+	addWeightedTokens(termFrequencies, roleHint, FIELD_WEIGHTS.role);
+	const length = Array.from(termFrequencies.values()).reduce((sum, value) => sum + value, 0);
+	return { agent, termFrequencies, length };
+}
+export function buildAgentSearchIndex(agents: Iterable<AgentConfig>): AgentSearchIndex {
+	const documents = Array.from(agents, buildAgentSearchDocument);
+	const averageLength = documents.reduce((sum, document) => sum + document.length, 0) / documents.length || 1;
+	const documentFrequencies = new Map<string, number>();
+	for (const document of documents) {
+		for (const token of new Set(document.termFrequencies.keys())) {
+			documentFrequencies.set(token, (documentFrequencies.get(token) ?? 0) + 1);
+		}
+	}
+	return { documents, averageLength, documentFrequencies };
+}
+export function searchAgents(index: AgentSearchIndex, query: string, limit: number): AgentSearchResult[] {
+	const queryTokens = tokenize(query);
+	if (queryTokens.length === 0) return [];
+	if (index.documents.length === 0) return [];
+	const queryTermCounts = new Map<string, number>();
+	for (const token of queryTokens) {
+		queryTermCounts.set(token, (queryTermCounts.get(token) ?? 0) + 1);
+	}
+	return index.documents
+		.map((document) => {
+			let score = 0;
+			for (const [token, queryTermCount] of queryTermCounts) {
+				const termFrequency = document.termFrequencies.get(token) ?? 0;
+				if (termFrequency === 0) continue;
+				const documentFrequency = index.documentFrequencies.get(token) ?? 0;
+				const idf = Math.log(1 + (index.documents.length - documentFrequency + 0.5) / (documentFrequency + 0.5));
+				const normalization = BM25_K1 * (1 - BM25_B + BM25_B * (document.length / index.averageLength));
+				score += queryTermCount * idf * ((termFrequency * (BM25_K1 + 1)) / (termFrequency + normalization));
+			}
+			return { agent: document.agent, score };
+		})
+		.filter((result) => result.score > 0)
+		.sort((left, right) => right.score - left.score || left.agent.name.localeCompare(right.agent.name))
+		.slice(0, limit);
+}

package/src/agents/agent-serializer.ts CHANGED Viewed

@@ -20,6 +20,10 @@ export function serializeAgent(agent: AgentConfig): string {
 		line("systemPromptMode", agent.systemPromptMode),
 		line("inheritProjectContext", agent.inheritProjectContext),
 		line("inheritSkills", agent.inheritSkills),
+		line("memory", agent.memory),
+		line("loadMode", agent.loadMode),
+		line("defaultTools", agent.defaultTools ?? undefined),
+		line("contextMode", agent.contextMode),
 		line("triggers", agent.routing?.triggers),
 		line("useWhen", agent.routing?.useWhen),
 		line("avoidWhen", agent.routing?.avoidWhen),

package/src/agents/discover-agents.ts CHANGED Viewed

@@ -3,6 +3,7 @@ import * as path from "node:path";
 import type { AgentConfig, ResourceSource } from "./agent-config.ts";
 import { loadConfig, type LoadedPiTeamsConfig } from "../config/config.ts";
 import { parseCsv, parseFrontmatter } from "../utils/frontmatter.ts";
+import { logInternalError } from "../utils/internal-error.ts";
 import { packageRoot, projectCrewRoot, userPiRoot } from "../utils/paths.ts";
 export interface AgentDiscoveryResult {
@@ -19,6 +20,14 @@ function parseMemory(value: string | undefined): "user" | "project" | "local" |
 	return value === "user" || value === "project" || value === "local" ? value : undefined;
 }
+function parseLoadMode(value: string | undefined): "essential" | "lean" | undefined {
+	return value === "essential" || value === "lean" ? value : undefined;
+}
+function parseContextMode(value: string | undefined): "fresh" | "fork" | undefined {
+	return value === "fresh" || value === "fork" ? value : undefined;
+}
 function parseAgentFile(filePath: string, source: ResourceSource): AgentConfig | undefined {
 	try {
 		const content = fs.readFileSync(filePath, "utf-8");
@@ -43,13 +52,17 @@ function parseAgentFile(filePath: string, source: ResourceSource): AgentConfig |
 			extensions: frontmatter.extensions === "" ? [] : parseCsv(frontmatter.extensions),
 			skills: parseCsv(frontmatter.skills ?? frontmatter.skill),
 			systemPromptMode: frontmatter.systemPromptMode === "append" ? "append" : "replace",
-			inheritProjectContext: frontmatter.inheritProjectContext as unknown === true || frontmatter.inheritProjectContext === "true",
-		inheritSkills: frontmatter.inheritSkills as unknown === true || frontmatter.inheritSkills === "true",
+			inheritProjectContext: frontmatter.inheritProjectContext === "true",
+		inheritSkills: frontmatter.inheritSkills === "true",
 		memory: parseMemory(frontmatter.memory),
-		disabled: frontmatter.disabled as unknown === true || frontmatter.disabled === "true" || frontmatter.enabled as unknown === false || frontmatter.enabled === "false",
+		loadMode: parseLoadMode(frontmatter.loadMode),
+		defaultTools: frontmatter.defaultTools !== undefined ? parseCsv(frontmatter.defaultTools) ?? null : undefined,
+		contextMode: parseContextMode(frontmatter.contextMode),
+		disabled: frontmatter.disabled === "true" || frontmatter.enabled === "false",
 			routing: triggers || useWhen || avoidWhen || cost || category ? { triggers, useWhen, avoidWhen, cost, category } : undefined,
 		};
-	} catch {
+	} catch (error) {
+		logInternalError("discoverAgents.parseAgentFile", error, `filePath=${filePath}`);
 		return undefined;
 	}
 }

package/src/config/config.ts CHANGED Viewed

@@ -46,6 +46,7 @@ export interface CrewRuntimeConfig {
 	requirePlanApproval?: boolean;
 	completionMutationGuard?: CompletionMutationGuardMode;
 	effectivenessGuard?: EffectivenessGuardMode;
+	yield?: { enabled?: boolean; maxReminders?: number; reminderPrompt?: string };
 }
 export interface CrewControlConfig {
@@ -100,6 +101,11 @@ export interface CrewTelemetryConfig {
 	enabled?: boolean;
 }
+export interface CrewPolicyConfig {
+	requireIntentForDestructiveActions?: boolean;
+	disabledCapabilities?: string[];
+}
 export type CrewNotificationSeverity = "info" | "warning" | "error" | "critical";
 export interface CrewNotificationsConfig {
@@ -152,6 +158,7 @@ export interface PiTeamsConfig {
 	agents?: CrewAgentsConfig;
 	tools?: CrewToolsConfig;
 	telemetry?: CrewTelemetryConfig;
+	policy?: CrewPolicyConfig;
 	notifications?: CrewNotificationsConfig;
 	observability?: CrewObservabilityConfig;
 	reliability?: CrewReliabilityConfig;
@@ -363,6 +370,12 @@ function mergeConfig(base: PiTeamsConfig, override: PiTeamsConfig): PiTeamsConfi
 			...withoutUndefined((override.telemetry ?? {}) as Record<string, unknown>),
 		};
 	}
+	if (base.policy || override.policy) {
+		merged.policy = {
+			...(base.policy ?? {}),
+			...withoutUndefined((override.policy ?? {}) as Record<string, unknown>),
+		};
+	}
 	if (base.notifications || override.notifications) {
 		merged.notifications = {
 			...(base.notifications ?? {}),
@@ -619,6 +632,16 @@ function parseTelemetryConfig(value: unknown): CrewTelemetryConfig | undefined {
 	return Object.values(telemetry).some((entry) => entry !== undefined) ? telemetry : undefined;
 }
+function parsePolicyConfig(value: unknown): CrewPolicyConfig | undefined {
+	const obj = asRecord(value);
+	if (!obj) return undefined;
+	const policy: CrewPolicyConfig = {
+		requireIntentForDestructiveActions: parseWithSchema(Type.Boolean(), obj.requireIntentForDestructiveActions),
+		disabledCapabilities: parseWithSchema(Type.Array(Type.String()), obj.disabledCapabilities),
+	};
+	return Object.values(policy).some((entry) => entry !== undefined) ? policy : undefined;
+}
 function parseNotificationsConfig(value: unknown): CrewNotificationsConfig | undefined {
 	const obj = asRecord(value);
 	if (!obj) return undefined;
@@ -700,6 +723,7 @@ export function parseConfig(raw: unknown): PiTeamsConfig {
 		agents: parseAgentsConfig(obj.agents),
 		tools: parseToolsConfig(obj.tools),
 		telemetry: parseTelemetryConfig(obj.telemetry),
+		policy: parsePolicyConfig(obj.policy),
 		notifications: parseNotificationsConfig(obj.notifications),
 		observability: parseObservabilityConfig(obj.observability),
 		reliability: parseReliabilityConfig(obj.reliability),

package/src/config/defaults.ts CHANGED Viewed

@@ -12,6 +12,17 @@ export const DEFAULT_CHILD_PI = {
 	maxCompactContentChars: 4096,
 };
+export const DEFAULT_LIVE_SESSION = {
+	/** Maximum wall-clock time for a single live-session task before abort (ms). */
+	responseTimeoutMs: 5 * 60_000,
+	/** Maximum yield reminder attempts before accepting no-yield. */
+	maxYieldRetries: 3,
+	/** Polling interval for session idle check during yield enforcement (ms). */
+	yieldPollIntervalMs: 500,
+	/** Maximum time to wait for session idle after prompt (ms). */
+	idleWaitTimeoutMs: 60_000,
+};
 export const DEFAULT_LOCKS = {
 	staleMs: 30_000,
 };

package/src/extension/autonomous-policy.ts CHANGED Viewed

@@ -73,46 +73,39 @@ export function buildAutonomousPolicy(prompt: string, config: PiTeamsAutonomousC
 		? "Do not suggest worktree mode unless the user explicitly asks for it."
 		: "Consider workspaceMode: 'worktree' for parallel or risky code-changing work in clean git repositories.";
 	return [
-		"# pi-crew Autonomous Delegation Policy",
+		"# pi-crew Delegation Policy (MANDATORY)",
 		"",
 		`Autonomy profile: ${effective.profile}.`,
-		"You have access to the `team` tool for coordinated multi-agent work. Use it proactively when the task benefits from specialized roles, planning, review, verification, durable artifacts, async execution, or worktree isolation.",
+		"You MUST evaluate EVERY user request against the delegation decision table below BEFORE acting.",
 		"",
-		"Decision framework (not keyword-only):",
-		"- Treat a user-supplied task list with 2+ actionable bullets/numbered items as a delegation candidate even when no pi-crew keyword appears.",
-		"- Prefer `team` when tasks span multiple files/subsystems, require sequencing, combine implementation + tests/docs/review, or need independent exploration before edits.",
-		"- If unsure whether subtasks conflict, call `team` with action='recommend' first instead of manually splitting work.",
-		"- For assisted/aggressive autonomy and non-trivial multi-task work, prefer a team run or plan over direct single-agent execution.",
+		"## Decision Table — Do I delegate?",
 		"",
-		"Use `team` automatically when:",
-		"- The task spans multiple files, subsystems, or unclear code areas.",
-		"- The prompt contains a non-trivial task list, roadmap, checklist, migration plan, or ordered implementation plan.",
-		"- The task requires planning before implementation.",
-		"- The task asks for implementation plus tests, review, verification, migration, architecture, security review, or debugging.",
-		"- The task would benefit from explorer/planner/executor/reviewer/verifier roles.",
+		"| Task characteristic | Delegate? | Method |",
+		"|---|---|---|",
+		"| Read/edit 1-2 files, simple question | NO | Do it directly |",
+		"| User explicitly says \"do it yourself\" / \"no team\" | NO | Do it directly |",
+		"| Destructive action (delete/prune/forget) | NO | Ask for confirmation first |",
+		"| Research / deep-read / source audit / \"nghiên cứu\" | YES | `team action='research'` |",
+		"| Multi-file implementation / feature / refactor | YES | `team action='run', team='implementation'` |",
+		"| Small bug fix (1-2 files, clear cause) | YES | `team action='run', team='fast-fix'` |",
+		"| Code review / security review | YES | `team action='run', team='review'` |",
+		"| Task list with 2+ actionable items | YES | `team action='plan'` or `action='run'` |",
+		"| Need exploration before knowing scope | YES | `Agent(explorer)` or `team action='recommend'` |",
+		"| Parallel independent subtasks | YES | `team action='parallel'` or multiple `Agent` background |",
+		"| Unsure which team/workflow fits | YES | `team action='recommend'` first |",
 		"",
-		"Do not use `team` when:",
-		"- The user asks a simple factual question or tiny single-file edit.",
-		"- The user explicitly asks you to work directly without delegation.",
-		"- The tasks clearly modify the same small file region and can be completed safer by one agent without parallel fanout.",
-		"- The action is destructive (`delete`, `forget`, `prune`, forced cleanup) and the user has not explicitly confirmed it.",
+		"## Rules",
 		"",
-		"Recommended mappings:",
-		"- Complex feature/refactor/migration -> action='run', team='implementation'.",
-		"- Small bug fix -> action='run', team='fast-fix'.",
-		"- Code/security review -> action='run', team='review'.",
-		"- Research or documentation synthesis -> action='run', team='research'.",
-		"- Unsure which team/workflow to use -> call the `team` tool with action='recommend' and the user's goal, then follow the suggested plan/run call if appropriate.",
-		"- After delegating exploration/research/review, do not duplicate the same search manually. Continue only with non-overlapping work.",
-		"- Before claiming delegated work is complete, inspect the run with action='status' or action='summary'.",
-		"- Unsure or risky work -> action='plan' first, then run the selected team.",
+		"1. If the task needs reading >3 files OR editing >2 files OR has a research/review/planning component → you MUST delegate via `team` or `Agent`. Do not do it yourself.",
+		"2. After delegating, do NOT duplicate the same exploration/reading/review manually. Continue only with non-overlapping work.",
+		"3. Before claiming delegated work is complete, verify with `team action='status'` or `action='summary'`.",
+		"4. If unsure whether subtasks conflict, use `team action='recommend'` first.",
+		"5. For parallel implementation, prefer `workspaceMode: 'worktree'` in clean git repos.",
 		"",
-		"Conflict-safe task splitting:",
-		"- Do not parallelize subtasks that may edit the same file, same symbol, same migration path, package manifest, lockfile, or generated schema unless a planner explicitly sequences them.",
-		"- For potential overlap, use plan/recommend first, assign one owner per file/symbol, and require workers to report intended changed files before editing.",
-		"- Prefer workspaceMode: 'worktree' for parallel implementation in clean git repositories, but still avoid merging overlapping edits without review.",
-		"- If workers discover overlap, blockers, missing requirements, or need leader decisions, they must use mailbox/status artifacts to ask the leader/orchestrator and pause risky edits.",
-		"- The leader should resolve conflicts by sequencing, narrowing scope, or reassigning ownership before continuing.",
+		"## Conflict-safe task splitting",
+		"- Do not parallelize subtasks that may edit the same file, same symbol, or same lockfile.",
+		"- Assign one owner per file/symbol. Workers must report intended changed files before editing.",
+		"- If workers discover overlap, they must pause and ask the leader to resolve.",
 		"",
 		asyncGuidance,
 		worktreeGuidance,