npm - pi-crew - Versions diffs - 0.5.1 → 0.5.5 - Mend

pi-crew 0.5.1 → 0.5.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (132) hide show

package/CHANGELOG.md +95 -0
package/README.md +1 -1
package/docs/actions-reference.md +87 -0
package/docs/bugs/cross-session-notification-leakage.md +82 -0
package/docs/coding-agent-optimization.md +268 -0
package/docs/commands-reference.md +5 -0
package/docs/deep-review-report.md +384 -0
package/docs/distillation/cybersecurity-patterns.md +294 -0
package/docs/migration-v0.4-v0.5.md +191 -0
package/docs/optimization-plan.md +642 -0
package/docs/pi-crew-bugs.md +6 -0
package/docs/pi-mono-opportunities.md +969 -0
package/docs/pi-mono-review.md +291 -0
package/{skills → docs/skills}/REFERENCE.md +13 -5
package/index.ts +1 -1
package/package.json +19 -16
package/skills/artifact-analysis-loop/SKILL.md +302 -0
package/skills/async-worker-recovery/SKILL.md +19 -1
package/skills/child-pi-spawning/SKILL.md +19 -6
package/skills/context-artifact-hygiene/SKILL.md +19 -2
package/skills/delegation-patterns/SKILL.md +68 -3
package/skills/detection-pipeline-design/SKILL.md +285 -0
package/skills/event-log-tracing/SKILL.md +20 -6
package/skills/git-master/SKILL.md +20 -6
package/skills/hunting-investigation-loop/SKILL.md +401 -0
package/skills/incident-playbook-construction/SKILL.md +383 -0
package/skills/live-agent-lifecycle/SKILL.md +20 -6
package/skills/mailbox-interactive/SKILL.md +19 -6
package/skills/model-routing-context/SKILL.md +19 -1
package/skills/multi-perspective-review/SKILL.md +19 -4
package/skills/observability-reliability/SKILL.md +19 -2
package/skills/orchestration/SKILL.md +20 -2
package/skills/ownership-session-security/SKILL.md +20 -2
package/skills/pi-extension-lifecycle/SKILL.md +20 -2
package/skills/post-mortem/SKILL.md +7 -2
package/skills/read-only-explorer/SKILL.md +20 -6
package/skills/requirements-to-task-packet/SKILL.md +23 -3
package/skills/resource-discovery-config/SKILL.md +20 -2
package/skills/runtime-state-reader/SKILL.md +20 -2
package/skills/safe-bash/SKILL.md +21 -6
package/skills/scrutinize/SKILL.md +20 -2
package/skills/secure-agent-orchestration-review/SKILL.md +29 -2
package/skills/security-review/SKILL.md +560 -0
package/skills/state-mutation-locking/SKILL.md +22 -2
package/skills/systematic-debugging/SKILL.md +8 -6
package/skills/threat-hypothesis-framework/SKILL.md +175 -0
package/skills/ui-render-performance/SKILL.md +20 -2
package/skills/verification-before-done/SKILL.md +17 -2
package/skills/widget-rendering/SKILL.md +21 -6
package/skills/workspace-isolation/SKILL.md +20 -6
package/skills/worktree-isolation/SKILL.md +20 -6
package/src/agents/agent-config.ts +40 -1
package/src/benchmark/benchmark-runner.ts +245 -0
package/src/benchmark/feedback-loop.ts +66 -0
package/src/config/config.ts +22 -5
package/src/config/role-tools.ts +82 -0
package/src/config/types.ts +4 -0
package/src/extension/async-notifier.ts +1 -1
package/src/extension/autonomous-policy.ts +1 -1
package/src/extension/crew-cleanup.ts +114 -0
package/src/extension/cross-extension-rpc.ts +1 -1
package/src/extension/plan-orchestrate.ts +322 -0
package/src/extension/register.ts +46 -44
package/src/extension/registration/command-utils.ts +1 -1
package/src/extension/registration/commands.ts +1 -1
package/src/extension/registration/compaction-guard.ts +1 -1
package/src/extension/registration/subagent-helpers.ts +1 -1
package/src/extension/registration/subagent-tools.ts +1 -1
package/src/extension/registration/team-tool.ts +1 -1
package/src/extension/registration/viewers.ts +1 -1
package/src/extension/session-summary.ts +1 -1
package/src/extension/team-manager-command.ts +1 -1
package/src/extension/team-tool/context.ts +1 -1
package/src/extension/team-tool/handle-schedule.ts +183 -0
package/src/extension/team-tool/orchestrate.ts +102 -0
package/src/extension/team-tool/run.ts +222 -35
package/src/extension/team-tool.ts +10 -0
package/src/extension/tool-result.ts +1 -1
package/src/i18n.ts +1 -1
package/src/observability/event-bus.ts +60 -0
package/src/observability/event-to-metric.ts +1 -1
package/src/prompt/prompt-runtime.ts +1 -1
package/src/runtime/background-runner.ts +35 -7
package/src/runtime/child-pi.ts +122 -34
package/src/runtime/crash-recovery.ts +1 -1
package/src/runtime/crew-agent-runtime.ts +1 -0
package/src/runtime/crew-hooks.ts +240 -0
package/src/runtime/custom-tools/irc-tool.ts +1 -1
package/src/runtime/custom-tools/submit-result-tool.ts +1 -1
package/src/runtime/diagnostic-export.ts +38 -2
package/src/runtime/foreground-control.ts +87 -17
package/src/runtime/foreground-watchdog.ts +1 -1
package/src/runtime/live-session-runtime.ts +1 -1
package/src/runtime/mcp-proxy.ts +1 -1
package/src/runtime/pi-args.ts +11 -1
package/src/runtime/pi-json-output.ts +31 -0
package/src/runtime/pi-spawn.ts +20 -4
package/src/runtime/process-status.ts +15 -2
package/src/runtime/progress-tracker.ts +124 -0
package/src/runtime/runtime-resolver.ts +1 -1
package/src/runtime/session-resources.ts +1 -1
package/src/runtime/skill-effectiveness.ts +473 -0
package/src/runtime/skill-instructions.ts +37 -3
package/src/runtime/task-runner.ts +122 -18
package/src/runtime/team-runner.ts +17 -11
package/src/runtime/tool-progress.ts +10 -3
package/src/runtime/verification-gates.ts +367 -0
package/src/schema/team-tool-schema.ts +31 -1
package/src/state/crew-init.ts +56 -38
package/src/state/decision-ledger.ts +344 -0
package/src/state/event-log.ts +136 -10
package/src/state/hook-instinct-bridge.ts +90 -0
package/src/state/hook-integrations.ts +51 -0
package/src/state/instinct-store.ts +249 -0
package/src/state/run-metrics.ts +135 -0
package/src/state/state-store.ts +3 -1
package/src/state/tiered-eval.ts +471 -0
package/src/state/types-eval.ts +58 -0
package/src/state/types.ts +7 -0
package/src/tools/safe-bash-extension.ts +5 -5
package/src/types/new-api-types.ts +34 -0
package/src/ui/agent-management-overlay.ts +5 -1
package/src/ui/crew-widget.ts +30 -16
package/src/ui/pi-ui-compat.ts +1 -1
package/src/ui/powerbar-publisher.ts +100 -7
package/src/ui/run-action-dispatcher.ts +1 -1
package/src/ui/tool-render.ts +17 -17
package/src/utils/project-detector.ts +160 -0
package/src/utils/session-utils.ts +52 -0
package/src/worktree/worktree-manager.ts +32 -13
package/test-bugs-all.mjs +1 -1
package/skills/.gitkeep +0 -0

package/skills/detection-pipeline-design/SKILL.md ADDED Viewed

@@ -0,0 +1,285 @@
+---
+name: detection-pipeline-design
+description: "Design data pipelines for security monitoring and threat intelligence."
+triggers:
+  - "build pipeline"
+  - "design detection"
+  - "setup monitoring"
+  - "enrich data"
+  - "threat intelligence"
+---
+# detection-pipeline-design
+Use this skill when designing data pipelines for security detection and enrichment.
+## Source
+Distilled from `building-ioc-enrichment-pipeline-with-opencti` (Anthropic Cybersecurity Skills) and generalized for software/build context.
+## When to Use
+- Building detection and monitoring systems
+- Designing security data pipelines
+- Setting up automated threat intelligence
+- Creating alert enrichment workflows
+- Integrating security scanning into CI/CD
+## Pipeline Architecture
+```
+┌─────────┐    ┌──────────┐    ┌──────────┐    ┌─────────┐    ┌─────────┐
+│  Input  │ → │ Transform│ → │  Enrich  │ → │  Score  │ → │  Route  │
+│  Data   │    │  (Norm)  │    │ (Context)│    │ (Conf)  │    │(Action) │
+└─────────┘    └──────────┘    └──────────┘    └─────────┘    └─────────┘
+                                    ↓
+                              ┌──────────┐
+                              │  Output  │
+                              │ Findings │
+                              └──────────┘
+```
+## Pipeline Components
+### 1. Input Stage
+```yaml
+input:
+  types:
+    - name: file_change
+      sources: [git, filesystem]
+    - name: log_event
+      sources: [application, system]
+    - name: alert
+      sources: [scanner, monitor]
+    - name: dependency
+      sources: [npm, pip, cargo]
+  format: [json, plain_text, structured]
+  polling: [real_time, batch, scheduled]
+```
+### 2. Transform Stage
+```yaml
+transform:
+  operations:
+    - name: normalize
+      description: Convert to standard format
+      output: stix_like_object
+    - name: extract_indicators
+      description: Pull out IOCs
+      extract: [ips, domains, hashes, credentials, tokens]
+    - name: enrich_metadata
+      description: Add context
+      add: [file_type, language, framework, timestamp]
+  output_format: json
+```
+### 3. Enrich Stage
+```yaml
+enrich:
+  internal_sources:
+    - name: vulnerability_db
+      query: [cve_id, cwe]
+    - name: code_analysis
+      query: [pattern, structure]
+    - name: git_history
+      query: [author, commit, diff]
+  external_sources:
+    - name: npm_audit
+      api: npmjs.org
+    - name: osv
+      api: osv.dev
+    - name: gh_advisory
+      api: github.com/advisories
+  async: true
+  timeout_ms: 5000
+```
+### 4. Score Stage
+```yaml
+score:
+  confidence_calculation:
+    factors:
+      - name: source_reliability
+        weight: 0.3
+        scale: [0-10]
+      - name: contextual_evidence
+        weight: 0.4
+        scale: [0-10]
+      - name: historical_matches
+        weight: 0.3
+        scale: [0-10]
+  formula: >
+    (reliability * 0.3) +
+    (evidence * 0.4) +
+    (historical * 0.3)
+  thresholds:
+    critical: [90-100]
+    high: [70-89]
+    medium: [40-69]
+    low: [0-39]
+```
+### 5. Route Stage
+```yaml
+route:
+  paths:
+    - condition: "score >= 90"
+      action: [alert, block, notify]
+      destination: [security_team, incident_response]
+    - condition: "score >= 70"
+      action: [alert, review]
+      destination: [security_queue]
+    - condition: "score >= 40"
+      action: [log, monitor]
+      destination: [security_logs]
+    - condition: "score < 40"
+      action: [ignore]
+      destination: []
+```
+## Pipeline Design Patterns
+### Pattern 1: Real-time File Monitoring
+```yaml
+pipeline:
+  name: file-change-detection
+  trigger:
+    type: filesystem_watch
+    paths: ["src/**/*.ts", "src/**/*.js"]
+  transform:
+    - extract: [imports, function_calls, secrets]
+  enrich:
+    - check: npm_audit
+    - check: known_vulnerable_patterns
+  score:
+    - base: vulnerability_severity
+    - modifier: exploitability
+  route:
+    critical: slack_alert + block_merge
+    high: github_issue + notify
+    medium: log + track
+```
+### Pattern 2: Dependency Vulnerability Pipeline
+```yaml
+pipeline:
+  name: dependency-vuln-scan
+  trigger:
+    type: package_lock_change
+  transform:
+    - extract: [package_names, versions, sources]
+  enrich:
+    - query: osv_database
+    - query: npm_advisories
+    - query: github_advisories
+  score:
+    - base: cvss_score
+    - modifier: [has_exploit, is_dependencies]
+  route:
+    critical: [create_security_issue, alert_team]
+    high: [create_issue, schedule_fix]
+    medium: [add_to_backlog]
+    low: [note_in_changelog]
+```
+### Pattern 3: Secret Detection Pipeline
+```yaml
+pipeline:
+  name: secret-detection
+  trigger:
+    type: git_push
+  transform:
+    - extract: [api_keys, tokens, passwords, credentials]
+  enrich:
+    - validate: key_format
+    - check: blacklists
+  score:
+    - base: key_validity
+    - modifier: [key_age, exposure_scope]
+  route:
+    critical: [revoke_key, alert_security, block_push]
+    high: [notify_owner, rotate_key]
+    medium: [flag_for_review]
+    low: [log]
+```
+## Implementation Example
+```typescript
+interface DetectionPipeline {
+  name: string;
+  input: InputConfig;
+  transform: TransformConfig;
+  enrich: EnrichConfig;
+  score: ScoreConfig;
+  route: RouteConfig;
+}
+async function runPipeline(pipeline: DetectionPipeline, data: unknown): Promise<PipelineResult> {
+  // 1. Input validation
+  const normalized = normalizeInput(data, pipeline.input);
+  // 2. Transform - extract indicators
+  const indicators = extractIndicators(normalized, pipeline.transform);
+  // 3. Enrich - query external/internal sources
+  const enriched = await enrichIndicators(indicators, pipeline.enrich);
+  // 4. Score - calculate confidence
+  const scored = calculateScore(enriched, pipeline.score);
+  // 5. Route - determine action
+  const action = determineAction(scored, pipeline.route);
+  return { indicators, enriched, scored, action };
+}
+```
+## Enforcement — Detection Pipeline Design Gate
+**Before deploying detection pipelines, verify:**
+- [ ] Input format validated before transform stage
+- [ ] Scoring thresholds tuned to environment (not hardcoded defaults)
+- [ ] Confidence calculation includes multiple factors (reliability, evidence, history)
+- [ ] Route actions match score thresholds (critical → block, low → ignore)
+- [ ] False positive rate measured and acceptable
+- [ ] External API calls are async (non-blocking)
+If ANY answer is NO → Stop. Tune the pipeline before deploying.
+## Anti-Patterns
+- **Don't** skip input validation (garbage in, garbage out)
+- **Don't** skip enrichment (missing context leads to false positives)
+- **Don't** use fixed thresholds (tune based on environment)
+- **Don't** ignore false positive rates (kills analyst productivity)
+- **Don't** block on external APIs in synchronous path (use async)
+## Tools & Integrations
+| Tool | Pipeline Role |
+|------|---------------|
+| `semgrep` | Static analysis, pattern matching |
+| `npm audit` | Dependency vulnerability |
+| `trufflehog` | Secret scanning |
+| `grype` | Container vulnerability |
+| `syft` | SBOM generation |
+## Verification
+For pipeline design changes:
+```bash
+cd pi-crew
+npx tsc --noEmit
+node --experimental-strip-types --test test/unit/detection-pipeline.test.ts
+```
+*See also: `detection-signature-authoring` (in security-review) for detection rule patterns.*

package/skills/event-log-tracing/SKILL.md CHANGED Viewed

@@ -1,8 +1,13 @@
 ---
 name: event-log-tracing
-description: "Structured event logging for worker lifecycle, live agents, crash recovery. Use when debugging crashes, tracing agent lifecycle, investigating stale runs. Triggers: event log, trace events, worker crashed, agent died, stale run, events.jsonl."
+description: "Structured event logging for worker lifecycle, live agents, crash recovery."
+triggers:
+  - "event log"
+  - "trace events"
+  - "worker crashed"
+  - "agent died"
+  - "stale run"
 ---
 # event-log-tracing
 Every pi-crew run writes a persistent event log at `.crew/state/runs/<runId>/events.jsonl`. Events are the primary evidence for understanding what happened — especially when workers crash, agents get stuck, or runs become orphaned.
@@ -31,8 +36,6 @@ Every event is a JSON object on one line:
 **Optional fields:** `taskId`, `message`, `data`, `metadata`
 **Metadata auto-populated:** `seq` (line number), `provenance` (who wrote it), `fingerprint` (for terminal events)
----
 ## Event Taxonomy
 ### Worker Lifecycle Events (from child-pi.ts via onLifecycleEvent callback)
@@ -112,8 +115,6 @@ These track the full lifecycle from spawn to cleanup.
 | `crew.run.reconciled_stale` | `reconcileStaleRun` repaired a stale run | `{verdict}` |
 | `crew.run.orphan_cancelled` | `cancelOrphanedRuns` cancelled a run | `{ownerSessionId, cancelledTasks}` |
----
 ## appendEvent Pipeline
 ```
@@ -257,6 +258,19 @@ crew.run.reconciled_stale verdict=pid_dead
 ---
+## Enforcement — Event Log Tracing Gate
+**Before interpreting events or debugging crashes, verify:**
+- [ ] Event format validated (required fields: time, type, runId present)
+- [ ] runId correlation confirmed (all events have same runId for the trace)
+- [ ] Terminal events have fingerprints (completed/failed/cancelled)
+- [ ] Event sequence matches expected lifecycle pattern
+- [ ] Corrupt JSONL handled (skip malformed lines, don't fail entire read)
+- [ ] Secrets redacted in data fields before logging
+If ANY answer is NO → Stop. Re-examine event source and format.
 ## Anti-patterns
 - **`logInternalError` only logs in debug mode**: Production errors are silent — `events.jsonl` is the only durable evidence. Always emit events, never rely on `console.error`.

package/skills/git-master/SKILL.md CHANGED Viewed

@@ -1,8 +1,13 @@
 ---
 name: git-master
-description: Commit and release hygiene for safe version-control work. Use when preparing commits, releases, version bumps, publishing, or validating package installation.
+description: "Commit and release hygiene for safe version-control work."
+triggers:
+  - "commit this"
+  - "tag release"
+  - "bump version"
+  - "publish package"
+  - "prepare release"
 ---
 # git-master
 Use this skill for commit/release hygiene. This skill covers git workflow from local changes to published releases.
@@ -186,6 +191,19 @@ git stash drop          # remove latest stash
 git stash clear         # remove all stashes
 ```
+## Enforcement — Git Master Gate
+**Before committing or publishing, verify:**
+- [ ] `git status` reviewed — only related files staged
+- [ ] `git diff --staged` reviewed — no unintended changes
+- [ ] Tests pass locally (`npm test` or appropriate test command)
+- [ ] No secrets in staged changes (API keys, tokens, passwords)
+- [ ] Commit message follows format: `type(scope): subject` (50 chars or less)
+- [ ] No generated files staged unless intentional
+If ANY answer is NO → Stop. Fix issues before committing.
 ## Anti-patterns
 - **Committing generated files**: Don't commit `dist/`, `build/`, `*.min.js` unless intentional
@@ -195,8 +213,6 @@ git stash clear         # remove all stashes
 - **Committing secrets**: Check for `API_KEY`, `TOKEN`, `PASSWORD`, `SECRET` before staging
 - **Unclear messages**: "fix stuff" is not a valid commit message
----
 ## Source patterns
 - `src/state/atomic-write.ts` — atomic git-safe file writes
@@ -204,8 +220,6 @@ git stash clear         # remove all stashes
 - `src/utils/conflict-detect.ts` — git conflict detection
 - `package.json` — version field, publish scripts
----
 ## Verification
 ```bash