npm - pi-planning-with-files - Versions diffs - 1.0.0 - Mend

pi-planning-with-files 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

package/README.md +81 -0
package/SKILL.md +204 -0
package/examples.md +202 -0
package/package.json +35 -0
package/reference.md +218 -0
package/scripts/check-complete.ps1 +42 -0
package/scripts/check-complete.sh +44 -0
package/scripts/init-session.ps1 +120 -0
package/scripts/init-session.sh +120 -0
package/scripts/session-catchup.py +288 -0
package/templates/findings.md +95 -0
package/templates/progress.md +114 -0
package/templates/task_plan.md +132 -0

package/reference.md ADDED Viewed

@@ -0,0 +1,218 @@
+# Reference: Manus Context Engineering Principles
+This skill is based on context engineering principles from Manus, the AI agent company acquired by Meta for $2 billion in December 2025.
+## The 6 Manus Principles
+### Principle 1: Design Around KV-Cache
+> "KV-cache hit rate is THE single most important metric for production AI agents."
+**Statistics:**
+- ~100:1 input-to-output token ratio
+- Cached tokens: $0.30/MTok vs Uncached: $3/MTok
+- 10x cost difference!
+**Implementation:**
+- Keep prompt prefixes STABLE (single-token change invalidates cache)
+- NO timestamps in system prompts
+- Make context APPEND-ONLY with deterministic serialization
+### Principle 2: Mask, Don't Remove
+Don't dynamically remove tools (breaks KV-cache). Use logit masking instead.
+**Best Practice:** Use consistent action prefixes (e.g., `browser_`, `shell_`, `file_`) for easier masking.
+### Principle 3: Filesystem as External Memory
+> "Markdown is my 'working memory' on disk."
+**The Formula:**
+```
+Context Window = RAM (volatile, limited)
+Filesystem = Disk (persistent, unlimited)
+```
+**Compression Must Be Restorable:**
+- Keep URLs even if web content is dropped
+- Keep file paths when dropping document contents
+- Never lose the pointer to full data
+### Principle 4: Manipulate Attention Through Recitation
+> "Creates and updates todo.md throughout tasks to push global plan into model's recent attention span."
+**Problem:** After ~50 tool calls, models forget original goals ("lost in the middle" effect).
+**Solution:** Re-read `task_plan.md` before each decision. Goals appear in the attention window.
+```
+Start of context: [Original goal - far away, forgotten]
+...many tool calls...
+End of context: [Recently read task_plan.md - gets ATTENTION!]
+```
+### Principle 5: Keep the Wrong Stuff In
+> "Leave the wrong turns in the context."
+**Why:**
+- Failed actions with stack traces let model implicitly update beliefs
+- Reduces mistake repetition
+- Error recovery is "one of the clearest signals of TRUE agentic behavior"
+### Principle 6: Don't Get Few-Shotted
+> "Uniformity breeds fragility."
+**Problem:** Repetitive action-observation pairs cause drift and hallucination.
+**Solution:** Introduce controlled variation:
+- Vary phrasings slightly
+- Don't copy-paste patterns blindly
+- Recalibrate on repetitive tasks
+---
+## The 3 Context Engineering Strategies
+Based on Lance Martin's analysis of Manus architecture.
+### Strategy 1: Context Reduction
+**Compaction:**
+```
+Tool calls have TWO representations:
+├── FULL: Raw tool content (stored in filesystem)
+└── COMPACT: Reference/file path only
+RULES:
+- Apply compaction to STALE (older) tool results
+- Keep RECENT results FULL (to guide next decision)
+```
+**Summarization:**
+- Applied when compaction reaches diminishing returns
+- Generated using full tool results
+- Creates standardized summary objects
+### Strategy 2: Context Isolation (Multi-Agent)
+**Architecture:**
+```
+┌─────────────────────────────────┐
+│         PLANNER AGENT           │
+│  └─ Assigns tasks to sub-agents │
+├─────────────────────────────────┤
+│       KNOWLEDGE MANAGER         │
+│  └─ Reviews conversations       │
+│  └─ Determines filesystem store │
+├─────────────────────────────────┤
+│      EXECUTOR SUB-AGENTS        │
+│  └─ Perform assigned tasks      │
+│  └─ Have own context windows    │
+└─────────────────────────────────┘
+```
+**Key Insight:** Manus originally used `todo.md` for task planning but found ~33% of actions were spent updating it. Shifted to dedicated planner agent calling executor sub-agents.
+### Strategy 3: Context Offloading
+**Tool Design:**
+- Use <20 atomic functions total
+- Store full results in filesystem, not context
+- Use `glob` and `grep` for searching
+- Progressive disclosure: load information only as needed
+---
+## The Agent Loop
+Manus operates in a continuous 7-step loop:
+```
+┌─────────────────────────────────────────┐
+│  1. ANALYZE CONTEXT                      │
+│     - Understand user intent             │
+│     - Assess current state               │
+│     - Review recent observations         │
+├─────────────────────────────────────────┤
+│  2. THINK                                │
+│     - Should I update the plan?          │
+│     - What's the next logical action?    │
+│     - Are there blockers?                │
+├─────────────────────────────────────────┤
+│  3. SELECT TOOL                          │
+│     - Choose ONE tool                    │
+│     - Ensure parameters available        │
+├─────────────────────────────────────────┤
+│  4. EXECUTE ACTION                       │
+│     - Tool runs in sandbox               │
+├─────────────────────────────────────────┤
+│  5. RECEIVE OBSERVATION                  │
+│     - Result appended to context         │
+├─────────────────────────────────────────┤
+│  6. ITERATE                              │
+│     - Return to step 1                   │
+│     - Continue until complete            │
+├─────────────────────────────────────────┤
+│  7. DELIVER OUTCOME                      │
+│     - Send results to user               │
+│     - Attach all relevant files          │
+└─────────────────────────────────────────┘
+```
+---
+## File Types Manus Creates
+| File | Purpose | When Created | When Updated |
+|------|---------|--------------|--------------|
+| `task_plan.md` | Phase tracking, progress | Task start | After completing phases |
+| `findings.md` | Discoveries, decisions | After ANY discovery | After viewing images/PDFs |
+| `progress.md` | Session log, what's done | At breakpoints | Throughout session |
+| Code files | Implementation | Before execution | After errors |
+---
+## Critical Constraints
+- **Single-Action Execution:** ONE tool call per turn. No parallel execution.
+- **Plan is Required:** Agent must ALWAYS know: goal, current phase, remaining phases
+- **Files are Memory:** Context = volatile. Filesystem = persistent.
+- **Never Repeat Failures:** If action failed, next action MUST be different
+- **Communication is a Tool:** Message types: `info` (progress), `ask` (blocking), `result` (terminal)
+---
+## Manus Statistics
+| Metric | Value |
+|--------|-------|
+| Average tool calls per task | ~50 |
+| Input-to-output token ratio | 100:1 |
+| Acquisition price | $2 billion |
+| Time to $100M revenue | 8 months |
+| Framework refactors since launch | 5 times |
+---
+## Key Quotes
+> "Context window = RAM (volatile, limited). Filesystem = Disk (persistent, unlimited). Anything important gets written to disk."
+> "if action_failed: next_action != same_action. Track what you tried. Mutate the approach."
+> "Error recovery is one of the clearest signals of TRUE agentic behavior."
+> "KV-cache hit rate is the single most important metric for a production-stage AI agent."
+> "Leave the wrong turns in the context."
+---
+## Source
+Based on Manus's official context engineering documentation:
+https://manus.im/blog/Context-Engineering-for-AI-Agents-Lessons-from-Building-Manus

package/scripts/check-complete.ps1 ADDED Viewed

@@ -0,0 +1,42 @@
+# Check if all phases in task_plan.md are complete
+# Exit 0 if complete, exit 1 if incomplete
+# Used by Stop hook to verify task completion
+param(
+    [string]$PlanFile = "task_plan.md"
+)
+if (-not (Test-Path $PlanFile)) {
+    Write-Host "ERROR: $PlanFile not found"
+    Write-Host "Cannot verify completion without a task plan."
+    exit 1
+}
+Write-Host "=== Task Completion Check ==="
+Write-Host ""
+# Read file content
+$content = Get-Content $PlanFile -Raw
+# Count phases by status
+$TOTAL = ([regex]::Matches($content, "### Phase")).Count
+$COMPLETE = ([regex]::Matches($content, "\*\*Status:\*\* complete")).Count
+$IN_PROGRESS = ([regex]::Matches($content, "\*\*Status:\*\* in_progress")).Count
+$PENDING = ([regex]::Matches($content, "\*\*Status:\*\* pending")).Count
+Write-Host "Total phases:   $TOTAL"
+Write-Host "Complete:       $COMPLETE"
+Write-Host "In progress:    $IN_PROGRESS"
+Write-Host "Pending:        $PENDING"
+Write-Host ""
+# Check completion
+if ($COMPLETE -eq $TOTAL -and $TOTAL -gt 0) {
+    Write-Host "ALL PHASES COMPLETE"
+    exit 0
+} else {
+    Write-Host "TASK NOT COMPLETE"
+    Write-Host ""
+    Write-Host "Do not stop until all phases are complete."
+    exit 1
+}

package/scripts/check-complete.sh ADDED Viewed

@@ -0,0 +1,44 @@
+#!/bin/bash
+# Check if all phases in task_plan.md are complete
+# Exit 0 if complete, exit 1 if incomplete
+# Used by Stop hook to verify task completion
+PLAN_FILE="${1:-task_plan.md}"
+if [ ! -f "$PLAN_FILE" ]; then
+    echo "ERROR: $PLAN_FILE not found"
+    echo "Cannot verify completion without a task plan."
+    exit 1
+fi
+echo "=== Task Completion Check ==="
+echo ""
+# Count phases by status (using -F for fixed string matching)
+TOTAL=$(grep -c "### Phase" "$PLAN_FILE" || true)
+COMPLETE=$(grep -cF "**Status:** complete" "$PLAN_FILE" || true)
+IN_PROGRESS=$(grep -cF "**Status:** in_progress" "$PLAN_FILE" || true)
+PENDING=$(grep -cF "**Status:** pending" "$PLAN_FILE" || true)
+# Default to 0 if empty
+: "${TOTAL:=0}"
+: "${COMPLETE:=0}"
+: "${IN_PROGRESS:=0}"
+: "${PENDING:=0}"
+echo "Total phases:   $TOTAL"
+echo "Complete:       $COMPLETE"
+echo "In progress:    $IN_PROGRESS"
+echo "Pending:        $PENDING"
+echo ""
+# Check completion
+if [ "$COMPLETE" -eq "$TOTAL" ] && [ "$TOTAL" -gt 0 ]; then
+    echo "ALL PHASES COMPLETE"
+    exit 0
+else
+    echo "TASK NOT COMPLETE"
+    echo ""
+    echo "Do not stop until all phases are complete."
+    exit 1
+fi

package/scripts/init-session.ps1 ADDED Viewed

@@ -0,0 +1,120 @@
+# Initialize planning files for a new session
+# Usage: .\init-session.ps1 [project-name]
+param(
+    [string]$ProjectName = "project"
+)
+$DATE = Get-Date -Format "yyyy-MM-dd"
+Write-Host "Initializing planning files for: $ProjectName"
+# Create task_plan.md if it doesn't exist
+if (-not (Test-Path "task_plan.md")) {
+    @"
+# Task Plan: [Brief Description]
+## Goal
+[One sentence describing the end state]
+## Current Phase
+Phase 1
+## Phases
+### Phase 1: Requirements & Discovery
+- [ ] Understand user intent
+- [ ] Identify constraints
+- [ ] Document in findings.md
+- **Status:** in_progress
+### Phase 2: Planning & Structure
+- [ ] Define approach
+- [ ] Create project structure
+- **Status:** pending
+### Phase 3: Implementation
+- [ ] Execute the plan
+- [ ] Write to files before executing
+- **Status:** pending
+### Phase 4: Testing & Verification
+- [ ] Verify requirements met
+- [ ] Document test results
+- **Status:** pending
+### Phase 5: Delivery
+- [ ] Review outputs
+- [ ] Deliver to user
+- **Status:** pending
+## Decisions Made
+| Decision | Rationale |
+|----------|-----------|
+## Errors Encountered
+| Error | Resolution |
+|-------|------------|
+"@ | Out-File -FilePath "task_plan.md" -Encoding UTF8
+    Write-Host "Created task_plan.md"
+} else {
+    Write-Host "task_plan.md already exists, skipping"
+}
+# Create findings.md if it doesn't exist
+if (-not (Test-Path "findings.md")) {
+    @"
+# Findings & Decisions
+## Requirements
+-
+## Research Findings
+-
+## Technical Decisions
+| Decision | Rationale |
+|----------|-----------|
+## Issues Encountered
+| Issue | Resolution |
+|-------|------------|
+## Resources
+-
+"@ | Out-File -FilePath "findings.md" -Encoding UTF8
+    Write-Host "Created findings.md"
+} else {
+    Write-Host "findings.md already exists, skipping"
+}
+# Create progress.md if it doesn't exist
+if (-not (Test-Path "progress.md")) {
+    @"
+# Progress Log
+## Session: $DATE
+### Current Status
+- **Phase:** 1 - Requirements & Discovery
+- **Started:** $DATE
+### Actions Taken
+-
+### Test Results
+| Test | Expected | Actual | Status |
+|------|----------|--------|--------|
+### Errors
+| Error | Resolution |
+|-------|------------|
+"@ | Out-File -FilePath "progress.md" -Encoding UTF8
+    Write-Host "Created progress.md"
+} else {
+    Write-Host "progress.md already exists, skipping"
+}
+Write-Host ""
+Write-Host "Planning files initialized!"
+Write-Host "Files: task_plan.md, findings.md, progress.md"

package/scripts/init-session.sh ADDED Viewed

@@ -0,0 +1,120 @@
+#!/bin/bash
+# Initialize planning files for a new session
+# Usage: ./init-session.sh [project-name]
+set -e
+PROJECT_NAME="${1:-project}"
+DATE=$(date +%Y-%m-%d)
+echo "Initializing planning files for: $PROJECT_NAME"
+# Create task_plan.md if it doesn't exist
+if [ ! -f "task_plan.md" ]; then
+    cat > task_plan.md << 'EOF'
+# Task Plan: [Brief Description]
+## Goal
+[One sentence describing the end state]
+## Current Phase
+Phase 1
+## Phases
+### Phase 1: Requirements & Discovery
+- [ ] Understand user intent
+- [ ] Identify constraints
+- [ ] Document in findings.md
+- **Status:** in_progress
+### Phase 2: Planning & Structure
+- [ ] Define approach
+- [ ] Create project structure
+- **Status:** pending
+### Phase 3: Implementation
+- [ ] Execute the plan
+- [ ] Write to files before executing
+- **Status:** pending
+### Phase 4: Testing & Verification
+- [ ] Verify requirements met
+- [ ] Document test results
+- **Status:** pending
+### Phase 5: Delivery
+- [ ] Review outputs
+- [ ] Deliver to user
+- **Status:** pending
+## Decisions Made
+| Decision | Rationale |
+|----------|-----------|
+## Errors Encountered
+| Error | Resolution |
+|-------|------------|
+EOF
+    echo "Created task_plan.md"
+else
+    echo "task_plan.md already exists, skipping"
+fi
+# Create findings.md if it doesn't exist
+if [ ! -f "findings.md" ]; then
+    cat > findings.md << 'EOF'
+# Findings & Decisions
+## Requirements
+-
+## Research Findings
+-
+## Technical Decisions
+| Decision | Rationale |
+|----------|-----------|
+## Issues Encountered
+| Issue | Resolution |
+|-------|------------|
+## Resources
+-
+EOF
+    echo "Created findings.md"
+else
+    echo "findings.md already exists, skipping"
+fi
+# Create progress.md if it doesn't exist
+if [ ! -f "progress.md" ]; then
+    cat > progress.md << EOF
+# Progress Log
+## Session: $DATE
+### Current Status
+- **Phase:** 1 - Requirements & Discovery
+- **Started:** $DATE
+### Actions Taken
+-
+### Test Results
+| Test | Expected | Actual | Status |
+|------|----------|--------|--------|
+### Errors
+| Error | Resolution |
+|-------|------------|
+EOF
+    echo "Created progress.md"
+else
+    echo "progress.md already exists, skipping"
+fi
+echo ""
+echo "Planning files initialized!"
+echo "Files: task_plan.md, findings.md, progress.md"