npm - pi-crew - Versions diffs - 0.5.2 → 0.5.6 - Mend

pi-crew 0.5.2 → 0.5.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (137) hide show

package/CHANGELOG.md +183 -0
package/README.md +17 -1
package/docs/architecture.md +2 -0
package/docs/bugs/cross-session-notification-leakage.md +82 -0
package/docs/coding-agent-optimization.md +268 -0
package/docs/deep-review-report.md +384 -0
package/docs/distillation/cybersecurity-patterns.md +294 -0
package/docs/migration-v0.4-v0.5.md +208 -0
package/docs/optimization-plan.md +642 -0
package/docs/pi-crew-v0.5.5-audit-fix-plan.md +133 -0
package/docs/pi-mono-opportunities.md +969 -0
package/docs/pi-mono-review.md +291 -0
package/docs/skills/REFERENCE.md +144 -0
package/package.json +12 -9
package/skills/artifact-analysis-loop/SKILL.md +302 -0
package/skills/async-worker-recovery/SKILL.md +19 -1
package/skills/child-pi-spawning/SKILL.md +19 -6
package/skills/context-artifact-hygiene/SKILL.md +19 -2
package/skills/delegation-patterns/SKILL.md +68 -3
package/skills/detection-pipeline-design/SKILL.md +285 -0
package/skills/event-log-tracing/SKILL.md +20 -6
package/skills/git-master/SKILL.md +20 -6
package/skills/hunting-investigation-loop/SKILL.md +401 -0
package/skills/incident-playbook-construction/SKILL.md +383 -0
package/skills/live-agent-lifecycle/SKILL.md +20 -6
package/skills/mailbox-interactive/SKILL.md +19 -6
package/skills/model-routing-context/SKILL.md +19 -1
package/skills/multi-perspective-review/SKILL.md +19 -4
package/skills/observability-reliability/SKILL.md +19 -2
package/skills/orchestration/SKILL.md +20 -2
package/skills/ownership-session-security/SKILL.md +20 -2
package/skills/pi-extension-lifecycle/SKILL.md +20 -2
package/skills/post-mortem/SKILL.md +7 -2
package/skills/read-only-explorer/SKILL.md +20 -6
package/skills/requirements-to-task-packet/SKILL.md +23 -3
package/skills/resource-discovery-config/SKILL.md +20 -2
package/skills/runtime-state-reader/SKILL.md +20 -2
package/skills/safe-bash/SKILL.md +21 -6
package/skills/scrutinize/SKILL.md +20 -2
package/skills/secure-agent-orchestration-review/SKILL.md +29 -2
package/skills/security-review/SKILL.md +560 -0
package/skills/state-mutation-locking/SKILL.md +22 -2
package/skills/systematic-debugging/SKILL.md +8 -6
package/skills/threat-hypothesis-framework/SKILL.md +175 -0
package/skills/ui-render-performance/SKILL.md +20 -2
package/skills/verification-before-done/SKILL.md +17 -2
package/skills/widget-rendering/SKILL.md +21 -6
package/skills/workspace-isolation/SKILL.md +20 -6
package/skills/worktree-isolation/SKILL.md +20 -6
package/src/agents/agent-config.ts +40 -1
package/src/benchmark/benchmark-runner.ts +45 -0
package/src/benchmark/feedback-loop.ts +5 -0
package/src/config/config.ts +32 -5
package/src/config/role-tools.ts +82 -0
package/src/config/suggestions.ts +8 -0
package/src/config/types.ts +4 -0
package/src/extension/async-notifier.ts +10 -1
package/src/extension/crew-cleanup.ts +114 -0
package/src/extension/cross-extension-rpc.ts +1 -1
package/src/extension/notification-router.ts +18 -0
package/src/extension/register.ts +27 -19
package/src/extension/registration/subagent-tools.ts +1 -1
package/src/extension/team-tool/anchor.ts +201 -0
package/src/extension/team-tool/api.ts +2 -1
package/src/extension/team-tool/auto-summarize.ts +154 -0
package/src/extension/team-tool/run.ts +42 -7
package/src/extension/team-tool.ts +44 -2
package/src/hooks/registry.ts +1 -3
package/src/observability/event-bus.ts +69 -0
package/src/observability/event-to-metric.ts +0 -2
package/src/runtime/anchor-manager.ts +473 -0
package/src/runtime/async-runner.ts +8 -4
package/src/runtime/auto-summarize.ts +350 -0
package/src/runtime/background-runner.ts +10 -3
package/src/runtime/budget-tracker.ts +354 -0
package/src/runtime/chain-runner.ts +507 -0
package/src/runtime/child-pi.ts +123 -35
package/src/runtime/crash-recovery.ts +5 -4
package/src/runtime/crew-agent-runtime.ts +1 -0
package/src/runtime/custom-tools/irc-tool.ts +13 -0
package/src/runtime/custom-tools/submit-result-tool.ts +3 -2
package/src/runtime/delivery-coordinator.ts +10 -3
package/src/runtime/dynamic-script-runner.ts +482 -0
package/src/runtime/foreground-control.ts +87 -17
package/src/runtime/handoff-manager.ts +589 -0
package/src/runtime/hidden-handoff.ts +424 -0
package/src/runtime/live-agent-manager.ts +20 -4
package/src/runtime/live-session-runtime.ts +39 -4
package/src/runtime/manifest-cache.ts +2 -1
package/src/runtime/model-resolver.ts +16 -4
package/src/runtime/phase-tracker.ts +373 -0
package/src/runtime/pi-args.ts +11 -1
package/src/runtime/pi-json-output.ts +31 -0
package/src/runtime/pipeline-runner.ts +514 -0
package/src/runtime/progress-tracker.ts +124 -0
package/src/runtime/retry-runner.ts +354 -0
package/src/runtime/sandbox.ts +252 -0
package/src/runtime/scheduler.ts +7 -2
package/src/runtime/skill-effectiveness.ts +473 -0
package/src/runtime/skill-instructions.ts +37 -3
package/src/runtime/subagent-manager.ts +1 -1
package/src/runtime/task-graph.ts +11 -1
package/src/runtime/task-runner.ts +92 -18
package/src/runtime/team-runner.ts +13 -12
package/src/runtime/tool-progress.ts +10 -3
package/src/runtime/verification-gates.ts +367 -0
package/src/schema/team-tool-schema.ts +37 -0
package/src/skills/discover-skills.ts +5 -0
package/src/state/active-run-registry.ts +9 -2
package/src/state/contracts.ts +9 -0
package/src/state/crew-init.ts +3 -3
package/src/state/decision-ledger.ts +98 -55
package/src/state/event-log-rotation.ts +2 -2
package/src/state/event-log.ts +144 -10
package/src/state/hook-instinct-bridge.ts +5 -5
package/src/state/mailbox.ts +10 -0
package/src/state/run-cache.ts +18 -8
package/src/state/state-store.ts +3 -1
package/src/state/types.ts +4 -0
package/src/tools/safe-bash-extension.ts +1 -0
package/src/tools/safe-bash.ts +152 -20
package/src/types/new-api-types.ts +34 -0
package/src/ui/agent-management-overlay.ts +5 -1
package/src/ui/crew-widget.ts +29 -15
package/src/ui/overlays/mailbox-detail-overlay.ts +13 -2
package/src/ui/powerbar-publisher.ts +101 -7
package/src/ui/tool-render.ts +15 -15
package/src/ui/transcript-cache.ts +13 -0
package/src/utils/bm25-search.ts +16 -8
package/src/utils/env-filter.ts +8 -5
package/src/utils/redaction.ts +169 -15
package/src/utils/session-utils.ts +52 -0
package/src/utils/sse-parser.ts +10 -1
package/src/worktree/cleanup.ts +6 -1
package/src/worktree/worktree-manager.ts +32 -13
package/workflows/chain.workflow.md +252 -0
package/workflows/pipeline.workflow.md +27 -0

package/skills/hunting-investigation-loop/SKILL.md ADDED Viewed

@@ -0,0 +1,401 @@
+---
+name: hunting-investigation-loop
+description: "Active hypothesis-driven investigation and threat hunting."
+triggers:
+  - "hunt for"
+  - "find evidence of"
+  - "investigate"
+  - "active search"
+  - "forensic hunt"
+---
+# hunting-investigation-loop
+Use this skill when conducting active, hypothesis-driven threat hunting and investigation.
+## Source
+Distilled from 28 `hunting-for-*` skills (Anthropic Cybersecurity Skills) and generalized for software/codebase context.
+## When to Use
+- Proactively hunting for indicators of compromise
+- Investigating suspicious patterns without clear incident
+- Periodic security assessments
+- After threat intelligence suggests specific patterns
+- Purple team exercises
+## Core Loop
+```
+┌─────────────┐    ┌─────────────┐    ┌─────────────┐    ┌─────────────┐
+│   Form      │ →  │  Locate     │ →  │   Query     │ →  │   Analyze   │
+│ Hypothesis  │    │ Data Sources│    │   Search    │    │   Results   │
+└─────────────┘    └─────────────┘    └─────────────┘    └─────────────┘
+                                                          ↓
+┌─────────────┐    ┌─────────────┐    ┌─────────────┐    ┌─────────────┐
+│   Report    │ ←  │  Document   │ ←  │   Scope     │ ←  │  Validate   │
+│  Findings   │    │  Evidence   │    │  Extent     │    │  Findings   │
+└─────────────┘    └─────────────┘    └─────────────┘    └─────────────┘
+```
+## Investigation Loop
+```markdown
+## Hunting Investigation Loop
+1. **Form Hypothesis** → "There might be [vulnerability/pattern] in [location]"
+2. **Identify Hunt** → Search location: [files, commits, logs, configs]
+3. **Execute Search** → Query: [grep, regex, pattern match]
+4. **Analyze Results** → Filter: [true_positive, false_positive, noise]
+5. **Validate** → Confirm: [secondary source, cross-reference]
+6. **Scope** → Extent: [how many files, lines, occurrences]
+7. **Document** → Findings: [file, line, pattern, severity]
+```
+## Hunt Structure
+```yaml
+hunt:
+  id: string                    # e.g., "HUNT-2026-001"
+  hypothesis: string            # What we're testing
+  technique: string             # e.g., "credential_theft", "injection"
+  status: [planned|running|completed|cancelled]
+  data_sources:
+    - name: string
+      type: [file|commit|log|config|database]
+      locations: [paths, globs, queries]
+      priority: [high|medium|low]
+  search_patterns:
+    - pattern: string
+      type: [regex|AST|signature|heuristic]
+      context_needed: int        # Lines before/after
+      expected_findings: int     # Estimated findings
+  validation:
+    methods:
+      - name: string
+        description: string
+        expected: string         # What validation should confirm
+    cross_references:
+      - source: string
+        query: string
+  findings:
+    - file: string
+      line: number
+      evidence: string
+      confidence: [high|medium|low]
+      validated: boolean
+  scope:
+    total_findings: int
+    files_affected: int
+    severity: [critical|high|medium|low]
+  next_actions:
+    - investigate: [further analysis needed]
+    - contain: [immediate action required]
+    - remediate: [fix required]
+    - close: [false positive, no action]
+```
+## Hypothesis Templates
+### Template 1: Credential Pattern Hunt
+```yaml
+hypothesis:
+  id: HUNT-2026-CRED-001
+  title: Hardcoded credentials in codebase
+  technique: credential_exposure
+  data_sources:
+    - name: source_code
+      type: file
+      locations: ["**/*.ts", "**/*.js", "**/*.py"]
+    - name: config_files
+      type: file
+      locations: ["**/*.json", "**/*.yaml", "**/*.env"]
+  search_patterns:
+    - pattern: '(password|secret|token|key)\s*[=:]\s*["\'][^"\']{10,}'
+      type: regex
+    - pattern: 'process\.env\.[A-Z_]{5,}'
+      type: regex
+  validation:
+    - method: git_history_check
+      description: Check if credentials were ever committed
+    - method: secret_scanner
+      description: Run trufflehog to confirm
+```
+### Template 2: Injection Pattern Hunt
+```yaml
+hypothesis:
+  id: HUNT-2026-INJ-001
+  title: Code injection vulnerabilities
+  technique: command_injection
+  data_sources:
+    - name: source_code
+      type: file
+      locations: ["**/*.ts", "**/*.js", "**/*.py", "**/*.go"]
+  search_patterns:
+    - pattern: '(eval|exec|Function|spawn)\s*\('
+      type: regex
+    - pattern: 'child_process.*exec.*template'
+      type: AST
+  validation:
+    - method: confirm_user_input_taint
+      description: Check if eval input includes user data
+    - method: test_in_sandbox
+      description: Execute with controlled input
+```
+### Template 3: Supply Chain Hunt
+```yaml
+hypothesis:
+  id: HUNT-2026-SUPPLY-001
+  title: Dependency confusion or typosquatting
+  technique: supply_chain_attack
+  data_sources:
+    - name: package_manifest
+      type: file
+      locations: ["package.json", "requirements.txt", "Cargo.toml"]
+  search_patterns:
+    - pattern: '"@private/.*"'
+      type: regex
+    - pattern: 'version.*>.*9999999'
+      type: regex
+  validation:
+    - method: npm_audit
+      description: Check for malicious packages
+    - method: typosquat_check
+      description: Check for similar package names
+```
+### Template 4: Persistence Mechanism Hunt
+```yaml
+hypothesis:
+  id: HUNT-2026-PERS-001
+  title: Malicious persistence mechanisms
+  technique: persistence
+  data_sources:
+    - name: startup_files
+      type: file
+      locations: ["**/startup/**", "**/init/**", "**/.profile"]
+    - name: cron_configs
+      type: file
+      locations: ["**/cron/**", "**/.crontab"]
+    - name: systemd
+      type: file
+      locations: ["**/*.service", "**/systemd/**"]
+  search_patterns:
+    - pattern: '(wget|curl).*\|.*(bash|sh)'
+      type: regex
+    - pattern: 'nohup.*background'
+      type: regex
+  validation:
+    - method: confirm_evil_binary
+      description: Check downloaded binary hash
+    - method: network_check
+      description: Check for suspicious network activity
+```
+## Hunt Execution
+### Phase 1: Form Hypothesis
+Before starting a hunt, clearly define:
+- What you're looking for
+- Why you think it might exist
+- Where to look
+- How to confirm
+```markdown
+## Hypothesis Formulation Checklist
+- [ ] Clear technique/pattern being hunted
+- [ ] Known attack chain context
+- [ ] Data sources identified
+- [ ] Search patterns defined
+- [ ] Validation method specified
+- [ ] False positive patterns identified
+```
+### Phase 2: Execute Search
+Run searches in priority order:
+```bash
+# High priority - common locations
+rg -n "pattern" --type ts src/ | head -50
+# Config files
+rg -n "pattern" --type json --type yaml config/ | head -20
+# Check for encoded/obfuscated
+rg -n "atob|b64decode|base64" --type js | head -20
+```
+### Phase 3: Analyze Results
+Filter findings by:
+1. **True Positive** - Actual vulnerability/indicator
+2. **False Positive** - Known benign pattern
+3. **Noise** - Irrelevant matches
+```yaml
+analysis:
+  true_positives:
+    count: int
+    examples:
+      - file: path
+        line: number
+        reason: why this is a finding
+  false_positives:
+    count: int
+    reasons:
+      - known_benign_pattern
+      - test_code
+      - excluded_by_validation
+  noise:
+    count: int
+    reasons:
+      - not_in_scope
+      - duplicate_findings
+```
+### Phase 4: Validate
+For each potential finding:
+1. Cross-reference with other data sources
+2. Check git history for context
+3. Verify with secondary method
+4. Assess exploitability
+```yaml
+validation:
+  method_1:
+    name: secondary_source_check
+    result: [confirmed|suspected|false_positive]
+    evidence: string
+  method_2:
+    name: git_history_check
+    result: [confirmed|suspected|false_positive]
+    evidence: string
+  method_3:
+    name: exploitability_assessment
+    result: [confirmed|suspected|false_positive]
+    evidence: string
+```
+### Phase 5: Scope and Document
+Document findings with:
+- Exact location (file:line)
+- Evidence (code snippet, pattern match)
+- Confidence level
+- Validation results
+- Recommended action
+## Hunt Report Format
+```
+Hunt Report: [HUNT-ID]
+==============
+Hypothesis: [what we tested]
+Hunt Date: [timestamp]
+Hypothesis: [technique/pattern]
+## Executive Summary
+- Total Findings: [N]
+- Critical: [N] | High: [N] | Medium: [N] | Low: [N]
+- Files Affected: [N]
+- Confidence: [Overall assessment]
+## Data Sources Searched
+- [source 1]: [findings count]
+- [source 2]: [findings count]
+## Findings
+### [Finding 1] - [Severity]
+Location: [file:line]
+Evidence:
+```
+[code snippet]
+```
+Validated: [YES/NO - how]
+Recommendation: [action]
+### [Finding 2]...
+## False Positives
+- [why certain matches were dismissed]
+## Next Actions
+- [ ] Investigate further: [specific items]
+- [ ] Remediate: [specific items]
+- [ ] Monitor: [specific items]
+## Conclusion
+[Overall assessment of hunt results]
+```
+## Hunt Status Tracking
+```yaml
+hunt_status:
+  planned:
+    - id: string
+      hypothesis: string
+      planned_date: date
+  running:
+    - id: string
+      start_time: timestamp
+      current_phase: [form|locate|query|analyze|validate|report]
+      findings_count: int
+  completed:
+    - id: string
+      end_time: timestamp
+      outcome: [findings_confirmed|no_findings|false_positive]
+      report_path: string
+```
+## Anti-Patterns
+- **Don't** hunt without clear hypothesis (scattershot searching)
+- **Don't** skip data source identification (missing coverage)
+- **Don't** skip validation (false positive flood)
+- **Don't** skip false positive documentation (repeating mistakes)
+- **Don't** report without confidence level (misleads stakeholders)
+## Tools
+| Tool | Purpose |
+|------|---------|
+| `rg` (ripgrep) | Pattern search in files |
+| `git log` | History investigation |
+| `semgrep` | AST-based pattern matching |
+| `grep` | Binary/encoded string search |
+| `jq` | JSON log analysis |
+## Verification
+For hunting framework changes:
+```bash
+cd pi-crew
+npx tsc --noEmit
+node --experimental-strip-types --test test/unit/hunting-patterns.test.ts
+```
+*See also: `threat-hypothesis-framework` for structured hypothesis creation, `read-only-explorer` for exploration fundamentals.*