npm - pi-agent-extensions - Versions diffs - 0.1.0 - Mend

pi-agent-extensions 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (29) hide show

package/LICENSE +21 -0
package/README.md +181 -0
package/docs/README.md +32 -0
package/docs/dev/ask-user/test-cases.md +304 -0
package/docs/dev/handoff/eval-strategy.md +455 -0
package/docs/dev/handoff/implementation-log.md +330 -0
package/docs/dev/handoff/spec.md +567 -0
package/docs/extensions/ask-user.md +644 -0
package/docs/extensions/handoff.md +195 -0
package/docs/extensions/sessions.md +34 -0
package/docs/guides/manual-testing.md +98 -0
package/docs/guides/vertex-ai-setup.md +135 -0
package/extensions/ask-user/README.md +125 -0
package/extensions/ask-user/index.ts +103 -0
package/extensions/ask-user/modes/print.ts +62 -0
package/extensions/ask-user/tool.ts +121 -0
package/extensions/ask-user/types.ts +74 -0
package/extensions/ask-user/ui/index.ts +262 -0
package/extensions/handoff/config.ts +141 -0
package/extensions/handoff/extraction.ts +153 -0
package/extensions/handoff/index.ts +534 -0
package/extensions/handoff/metadata.ts +155 -0
package/extensions/handoff/parser.ts +180 -0
package/extensions/handoff/progress.ts +131 -0
package/extensions/handoff/prompt.ts +139 -0
package/extensions/handoff/types.ts +115 -0
package/extensions/sessions/index.ts +228 -0
package/extensions/sessions/sessions.ts +74 -0
package/package.json +51 -0

package/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2026
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

package/README.md ADDED Viewed

@@ -0,0 +1,181 @@
+# pi-agent-extensions
+A collection of extensions for the [pi coding agent](https://github.com/badlogic/pi-mono/tree/main/packages/coding-agent).
+## Extensions
+| Extension | Type | Description | Status |
+|-----------|------|-------------|--------|
+| **sessions** | Command | Quick session picker with `/sessions` command | ✅ Stable |
+| **ask_user** | Tool | LLM can ask structured questions with options | ⚙️ Beta (v0.1.0) |
+| **handoff** | Command | Transfer context to a new focused session with `/handoff` | ✅ Stable |
+## Install
+### From Source (Until Published)
+```bash
+# Clone the repository
+git clone https://github.com/jayshah5696/pi-agent-extensions.git
+cd pi-agent-extensions
+# Install globally
+pi install .
+# Or install to specific project
+cd ~/your-project
+pi install -l /path/to/pi-agent-extensions
+```
+### Quick Test Without Installing
+```bash
+pi -e /path/to/pi-agent-extensions/extensions/sessions/index.ts \
+   -e /path/to/pi-agent-extensions/extensions/ask-user/index.ts \
+   -e /path/to/pi-agent-extensions/extensions/handoff/index.ts
+```
+### From npm (When Published)
+```bash
+pi install npm:pi-agent-extensions
+```
+Both extensions will be available immediately after installation.
+## Verify Installation
+After installing, start pi and look for the startup message:
+```
+Extensions: sessions, ask_user, handoff
+```
+**Test sessions:**
+```bash
+pi
+/sessions
+```
+**Test ask_user:**
+```bash
+pi
+> Ask me which database I prefer: PostgreSQL or SQLite
+```
+The LLM should call the `ask_user` tool and show you options to select.
+**Test handoff:**
+```bash
+pi
+# Have a conversation first, then:
+/handoff implement the next feature with proper tests
+```
+You'll see a loader while context is extracted, then an editor to review the handoff prompt.
+## Uninstall
+```bash
+pi remove pi-agent-extensions
+```
+## Extensions
+### Sessions
+Quick session picker for the pi coding agent. Provides a compact `/sessions` selector (default 5 visible rows) with arrow navigation, Enter to switch, and Esc to cancel.
+**Usage:**
+```bash
+/sessions       # Show last 5 sessions
+/sessions 10    # Show last 10 sessions
+```
+**Features:**
+- Lists sessions from the **current project** only
+- Displays absolute timestamps (`YYYY-MM-DD HH:mm`)
+- Filter by typing (prefix match on session name or cwd)
+- In non-UI mode (`pi -p` or JSON/RPC), sessions are printed to stdout
+See [docs/extensions/sessions.md](docs/extensions/sessions.md) for details.
+### Ask User
+The LLM can call the `ask_user` tool to gather user input with structured questions and options.
+**Status:** ⚙️ Beta (v0.1.0) - Core features working, enhanced UI coming soon
+**Example:**
+```typescript
+ask_user({
+  questions: [{
+    question: "Which database should we use?",
+    header: "Database Selection",
+    options: [
+      { label: "PostgreSQL (Recommended)", description: "Battle-tested relational DB" },
+      { label: "SQLite", description: "Lightweight, file-based" },
+      { label: "MongoDB", description: "Document store" }
+    ]
+  }]
+})
+```
+**Features:**
+- ✅ Text input questions
+- ✅ Option selection with descriptions
+- ✅ "Other" option always available
+- ✅ Print mode (pending file workflow)
+- ✅ Session persistence
+- ⏸️ Custom TUI components (using built-in helpers for now)
+- ⏸️ Tabbed multi-question UI (sequential currently)
+See [extensions/ask-user/README.md](extensions/ask-user/README.md) and [docs/extensions/ask-user.md](docs/extensions/ask-user.md) for details.
+### Handoff
+Transfer context to a new focused session. Unlike `/compact` which summarizes everything, `/handoff` extracts only what's relevant to your next goal.
+**Usage:**
+```bash
+/handoff <goal>
+```
+**Examples:**
+```bash
+/handoff implement team-level handoff with proper tests
+/handoff fix the authentication bug in login flow
+/handoff add unit tests for the parser module
+```
+**Features:**
+- Goal-driven context extraction (files, commands, decisions, open questions)
+- Structured JSON extraction with LLM
+- Skill inheritance (preserves last `/skill:` used)
+- Git metadata (branch, dirty state)
+- Session metadata (model, tools, thinking level)
+- Interactive editor to review/edit before creating new session
+- Configurable via `.pi/settings.json`
+**What gets extracted:**
+- Relevant files with reasons
+- Commands that were run
+- Key context and decisions
+- Open questions/risks
+See [docs/extensions/handoff.md](docs/extensions/handoff.md) for full documentation.
+## Development
+```bash
+npm install
+npm test
+```
+## License
+MIT

package/docs/README.md ADDED Viewed

@@ -0,0 +1,32 @@
+# Documentation
+## Extensions
+User-facing documentation for each extension:
+| Extension | Description |
+|-----------|-------------|
+| [sessions](extensions/sessions.md) | Quick session picker with `/sessions` command |
+| [ask-user](extensions/ask-user.md) | LLM tool for structured user questions |
+| [handoff](extensions/handoff.md) | Context transfer to new focused sessions |
+## Guides
+Setup and usage guides:
+| Guide | Description |
+|-------|-------------|
+| [Manual Testing](guides/manual-testing.md) | How to manually test the extensions |
+| [Vertex AI Setup](guides/vertex-ai-setup.md) | Using Pi with Google Cloud Vertex AI |
+## Development
+Internal documentation for contributors:
+### Handoff
+- [Specification](dev/handoff/spec.md) - Full design spec and rationale
+- [Implementation Log](dev/handoff/implementation-log.md) - Development history
+- [Eval Strategy](dev/handoff/eval-strategy.md) - Testing and evaluation approach
+### Ask User
+- [Test Cases](dev/ask-user/test-cases.md) - Comprehensive test coverage plan

package/docs/dev/ask-user/test-cases.md ADDED Viewed

@@ -0,0 +1,304 @@
+# ask_user Test Cases
+**Purpose:** Define comprehensive test coverage before implementation
+**Status:** Draft - Awaiting approval
+---
+## Test Categories
+### 1. Schema Validation
+### 2. Interactive Mode (TUI)
+### 3. Non-Interactive Mode (Print)
+### 4. RPC Mode
+### 5. Session Persistence
+### 6. Edge Cases
+### 7. Integration
+---
+## 1. Schema Validation Tests
+### 1.1 Valid Parameters
+- ✓ Single text question (no options)
+- ✓ Single question with options
+- ✓ Single question with options + descriptions
+- ✓ Single question with header
+- ✓ Multiple questions
+- ✓ Question with multiSelect: true
+- ✓ Question with metadata
+### 1.2 Invalid Parameters
+- ✗ Empty questions array
+- ✗ Question with empty string
+- ✗ Option with empty label
+- ✗ Invalid multiSelect value (not boolean)
+- ✗ Malformed option (missing required fields)
+**Test File:** `tests/ask-user/schema.test.ts`
+---
+## 2. Interactive Mode (TUI) Tests
+### 2.1 Single Text Question
+**Setup:** Question with no options
+**Expected:**
+- Shows text input replacing editor
+- Header displayed if provided
+- User types answer
+- Enter submits → returns `{ answered: true, answers: [{answer: "typed text", wasCustom: true}] }`
+- Esc cancels → returns `{ answered: false, cancelled: true }`
+### 2.2 Single Select Question
+**Setup:** Question with 3 options
+**Expected:**
+- Shows numbered options (1, 2, 3, 4. Other)
+- Up/Down navigation works
+- Number keys (1-3) select directly
+- Enter on option returns selected
+- "Other" option allows text input
+- Descriptions shown below labels
+### 2.3 Multi-Select Question
+**Setup:** Question with `multiSelect: true`
+**Expected:**
+- Shows checkboxes `[ ]` and `[x]`
+- Space toggles selection
+- Multiple selections allowed
+- Enter submits all selected
+- Returns `answer` as string array
+### 2.4 Multiple Questions (Tabbed)
+**Setup:** 3 questions in array
+**Expected:**
+- Shows tab bar with question indicators (□ = unanswered, ■ = answered)
+- Tab/Arrow keys navigate between questions
+- Each question shows correctly
+- Submit tab appears after all questions
+- Can only submit when all answered
+- Esc shows confirmation if any answered
+### 2.5 Long Options List
+**Setup:** Question with 15 options
+**Expected:**
+- Scrollable list with "↓ N more..." indicator
+- Number keys 1-9 work for first 9 options
+- Scrolling reveals all options
+- "Other" is always last (accessible via 0 or scroll)
+### 2.6 Answer Length Warning
+**Setup:** User types 2500 character answer
+**Expected:**
+- Warning shows: "Answer is long (2,500 chars). Continue? [Y/n]"
+- Y allows submission
+- N returns to editing
+### 2.7 Cancellation Behavior
+**Setup:** Multi-question, user answers 2 of 3, presses Esc
+**Expected:**
+- Confirmation dialog: "Discard 2 answers? [Y/n]"
+- Y cancels all, returns `{ answered: false, cancelled: true }`
+- N returns to questions
+**Test File:** `tests/ask-user/interactive.test.ts` (requires TUI mocking)
+---
+## 3. Non-Interactive Mode (Print) Tests
+### 3.1 Pending File Creation
+**Setup:** Call ask_user in print mode (`pi -p`)
+**Expected:**
+- Creates `.pi/pending-questions.json`
+- Returns message with instructions
+- File contains: sessionId, timestamp, questions with answer: null
+- Tool result has `{ answered: false, pendingFile: ".pi/pending-questions.json" }`
+### 3.2 Natural Language Answer Parsing
+**Setup:** Pending file exists, run `pi -p @.pi/pending-questions.json "postgres and call it api-service"`
+**Expected:**
+- LLM parses natural language
+- Fills in answers correctly
+- Deletes pending file
+- Returns `{ answered: true, answers: [...] }`
+### 3.3 JSON Direct Edit
+**Setup:** User edits pending JSON, adds answers, runs `pi -c`
+**Expected:**
+- Reads answers from JSON
+- Validates format
+- Returns answers to LLM
+- Deletes pending file
+### 3.4 Inline Answers Flag
+**Setup:** `pi -p --answers '["PostgreSQL", "api-service"]'`
+**Expected:**
+- Parses JSON array
+- Matches to questions in order
+- Returns answers
+- No pending file created
+**Test File:** `tests/ask-user/print-mode.test.ts`
+---
+## 4. RPC Mode Tests
+### 4.1 Request Format
+**Setup:** ask_user called in RPC mode
+**Expected:**
+- Returns structured JSON with type: "ask_user_request"
+- Includes requestId, questions, metadata
+### 4.2 Response Handling
+**Setup:** RPC client sends ask_user_response
+**Expected:**
+- Validates response format
+- Matches requestId
+- Returns answers to LLM
+### 4.3 Client Disconnection
+**Setup:** Request sent, client disconnects before responding
+**Expected:**
+- Timeout after N seconds (configurable)
+- Returns `{ answered: false, connectionLost: true }`
+**Test File:** `tests/ask-user/rpc-mode.test.ts`
+---
+## 5. Session Persistence Tests
+### 5.1 Tool Result Storage
+**Setup:** User answers questions
+**Expected:**
+- Tool result stored in session with details:
+  - questions array
+  - answers array
+  - answeredAt timestamp
+  - mode: "interactive" | "print" | "rpc"
+  - metadata (if provided)
+### 5.2 Session Branching
+**Setup:** Navigate to before ask_user, continue from there
+**Expected:**
+- ask_user replays (no answer yet)
+- User can provide different answer
+- New branch created with new answer
+### 5.3 Session Resumption
+**Setup:** `pi -c` after answering questions
+**Expected:**
+- Previous Q&A visible in session history
+- Can reference answers later
+**Test File:** `tests/ask-user/persistence.test.ts`
+---
+## 6. Edge Cases Tests
+### 6.1 Session Interruption
+**Setup:** Terminal closed mid-question
+**Expected:**
+- On `pi -c`, question replays from beginning
+- No partial state saved
+### 6.2 Empty Options Array
+**Setup:** Question with `options: []`
+**Expected:**
+- Treated as text input question
+- No "Other" option shown
+### 6.3 Single Option
+**Setup:** Question with 1 option
+**Expected:**
+- Shows option + "Other"
+- User can still select option or type custom
+### 6.4 Extremely Long Question Text
+**Setup:** Question text is 500 characters
+**Expected:**
+- Text wraps properly
+- UI remains usable
+### 6.5 Unicode/Emoji in Options
+**Setup:** Options contain emoji and Unicode
+**Expected:**
+- Renders correctly
+- Selectable without issues
+### 6.6 Duplicate Tool Call
+**Setup:** LLM calls ask_user twice in same turn
+**Expected:**
+- Both questions shown (batched or sequential - TBD)
+- OR second call returns warning about duplicate
+### 6.7 No UI Available (ctx.hasUI = false)
+**Setup:** Extension loaded in non-interactive environment
+**Expected:**
+- Returns error message
+- Does not attempt to show UI
+**Test File:** `tests/ask-user/edge-cases.test.ts`
+---
+## 7. Integration Tests
+### 7.1 Compaction with Q&A
+**Setup:** Session with ask_user calls, trigger compaction
+**Expected:**
+- Q&A included in summary
+- Structured data preserved
+### 7.2 Tool Rendering
+**Setup:** View session with ask_user calls
+**Expected:**
+- Tool call renders with question text
+- Tool result renders with answers
+- Custom rendering works
+### 7.3 With Other Tools
+**Setup:** LLM calls ask_user, then bash, then edit
+**Expected:**
+- No interference between tools
+- State isolated correctly
+**Test File:** `tests/ask-user/integration.test.ts`
+---
+## Test Implementation Strategy
+### Unit Tests (High Priority)
+- Schema validation
+- Answer parsing logic
+- Session persistence logic
+- Print mode file I/O
+### Integration Tests (Medium Priority)
+- Tool registration
+- RPC protocol
+- Compaction integration
+### UI Tests (Low Priority / Manual)
+- Interactive TUI flows
+- Visual rendering
+- Keyboard navigation
+**Note:** TUI tests require mocking or headless testing framework. May start with manual testing for UI, automated for logic.
+---
+## Questions for Approval
+1. **TUI Testing Approach:** Should we mock the TUI for automated tests, or do manual testing for UI?
+2. **Duplicate ask_user calls:** Should we batch them or handle sequentially?
+3. **RPC timeout:** What should the default timeout be? 30s? 60s? Configurable?
+4. **Test coverage target:** Aim for 80%? 90%?
+---
+**Status:** Ready for review and approval before implementation begins.