npm - @gotgenes/pi-subagents - Versions diffs - 7.8.0 → 7.8.1 - Mend

@gotgenes/pi-subagents 7.8.0 → 7.8.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

package/CHANGELOG.md +10 -0
package/docs/plans/0219-reduce-test-duplication-top-3-clone-families.md +162 -0
package/docs/retro/0218-push-sdk-boundary-in-settings.md +37 -0
package/docs/retro/0219-reduce-test-duplication-top-3-clone-families.md +36 -0
package/package.json +1 -1

package/CHANGELOG.md CHANGED Viewed

@@ -5,6 +5,16 @@ All notable changes to this project will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [7.8.1](https://github.com/gotgenes/pi-packages/compare/pi-subagents-v7.8.0...pi-subagents-v7.8.1) (2026-05-26)
+### Documentation
+* plan reduce test duplication — top 3 clone families ([#219](https://github.com/gotgenes/pi-packages/issues/219)) ([c941b1b](https://github.com/gotgenes/pi-packages/commit/c941b1b3c047f6895eb57da3291b75082a2b99a3))
+* **retro:** add planning stage notes for issue [#219](https://github.com/gotgenes/pi-packages/issues/219) ([5122f7c](https://github.com/gotgenes/pi-packages/commit/5122f7cd666873abbbb6b6880fffb1e751beb9b5))
+* **retro:** add retro notes for issue [#218](https://github.com/gotgenes/pi-packages/issues/218) ([ef9187b](https://github.com/gotgenes/pi-packages/commit/ef9187ba8521d10212bd992cbfcf3d853886938b))
+* **retro:** add TDD stage notes for issue [#219](https://github.com/gotgenes/pi-packages/issues/219) ([975f94e](https://github.com/gotgenes/pi-packages/commit/975f94e5e868310765029050490098b335a67e1e))
 ## [7.8.0](https://github.com/gotgenes/pi-packages/compare/pi-subagents-v7.7.0...pi-subagents-v7.8.0) (2026-05-26)

package/docs/plans/0219-reduce-test-duplication-top-3-clone-families.md ADDED Viewed

@@ -0,0 +1,162 @@
+---
+issue: 219
+issue_title: "Reduce test duplication — top 3 clone families (Phase 13, Step 6)"
+---
+# Reduce test duplication — top 3 clone families
+## Problem statement
+After Phase 12, three test files carry the heaviest remaining clone families in pi-subagents:
+1. `test/lifecycle/agent-manager.test.ts` (929 lines) — 16 clone groups, ~160 duplicated lines.
+   Repeated inline runner stubs, worktree stubs, and manager-lifecycle boilerplate.
+2. `test/conversation-viewer.test.ts` (307 lines) — 8 clone groups, ~91 duplicated lines.
+   Near-identical `ConversationViewer` construction in every test, plus repeated width-loop assertion patterns.
+3. `test/ui/agent-config-editor.test.ts` (471 lines) — 5 clone groups, ~42 duplicated lines.
+   Repeated `makeEditor()` + `makeMenuUI()` + `fileOps.findAgentFile.mockReturnValue(...)` setup.
+Total target: reduce test duplication by ~200 lines (from ~1,046 combined test-setup lines to < 850).
+## Goals
+- Extract shared setup and assertion helpers for the three target test files.
+- Reduce test duplication by ~200 lines without changing test semantics.
+- Follow the existing `test/helpers/` convention (factory + matching `.test.ts` file).
+## Non-goals
+- No production code changes.
+- No new test coverage — this is purely a refactoring of existing test infrastructure.
+- Not consolidating clone families in other test files beyond the top 3.
+- Not changing any assertion logic or test structure beyond replacing inline stubs with factory calls.
+## Background
+The project already has several shared test helpers in `test/helpers/`: `make-record.ts`, `mock-session.ts`, `ui-stubs.ts`, `runner-io.ts`, `stub-ctx.ts`, `make-deps.ts`.
+Each helper has a companion `.test.ts` file — this convention must be followed.
+Dependencies #214 (closure-to-class conversions) and #216 (startAgent decomposition) are both closed, so the production code these tests cover is stable.
+## Design overview
+### File 1: `agent-manager.test.ts` — extract to `test/helpers/manager-stubs.ts`
+Five clone families to extract:
+1. **Never-resolving runner** — `{ run: vi.fn().mockImplementation(() => new Promise(() => {})), resume: vi.fn() }` appears 5 times.
+   Extract as `createBlockingRunner(): AgentRunner`.
+2. **Session-creating runner** — runner that calls `opts.onSessionCreated?.(session)` and resolves.
+   Appears 5+ times with minor variations (some emit events through the session, some don't).
+   Extract as `createSessionRunner(session?: MockSession): AgentRunner` that calls `onSessionCreated` and returns a standard result.
+3. **Worktree stubs with path+branch** — `{ create: vi.fn().mockReturnValue({ path, branch }), cleanup: vi.fn(() => ({ hasChanges: false })), prune: vi.fn() }` appears 4 times identically, plus 1 variant with `create` returning `undefined`.
+   Extract as `createMockWorktrees(overrides?)`.
+4. **Standard run result shape** — `{ responseText: "done", session, aborted: false, steered: false }` is repeated in many runner factories.
+   Extract as `createRunResult(overrides?)`.
+5. **Gated runner** — uses `Promise.withResolvers` to control when the runner completes.
+   Appears 2 times.
+   Keep inline — too tightly coupled to individual test flow-control to generalize cleanly.
+Tests that construct custom runners with unique behavior (event-emitting runners in the `lifetimeUsage` and `compactionCount` tests) keep their inline stubs — those encode test-specific emission sequences that a shared factory would obscure.
+### File 2: `conversation-viewer.test.ts` — inline factory + assertion helper
+Two clone families to extract:
+1. **`ConversationViewer` construction** — 15 near-identical constructor calls with the same 8 fields.
+   Extract as an inline `createTestViewer(overrides?)` factory at the top of the test file.
+   The factory provides defaults for `tui`, `session`, `record`, `activity`, `theme`, `done`, `registry`, and `wrapText`, and accepts overrides including a convenience `width` and `messages` parameter.
+2. **Width-loop assertion** — the `for (const w of widths) { create viewer; assertAllLinesFit(viewer.render(w), w) }` pattern repeats in 10 "render width safety" tests.
+   Extract as an inline `assertRenderFitsWidths(messages, widths?, viewerOverrides?)` helper.
+These helpers stay inline (not in `test/helpers/`) because they depend on file-local helpers (`mockTui`, `mockSession`, `ansiTheme`) and are only used by this one test file.
+### File 3: `agent-config-editor.test.ts` — inline setup helper
+One clone family to extract:
+1. **Detail-test setup** — `makeEditor()` + `makeMenuUI([...])` + `fileOps.findAgentFile.mockReturnValue(...)` + optional `fileOps.read.mockReturnValue(...)` appears in ~18 tests.
+   Extract as an inline `setupDetail(selectResults, options?)` factory that returns `{ fileOps, editor, ui }` with pre-configured mocks.
+   Options: `filePath`, `fileContent`, `config` (merged into default via `createTestAgentConfig`).
+This stays inline because it's specific to the `showAgentDetail` test suite and depends on file-local `testRegistry` setup.
+## Module-level changes
+### New files
+| File                                 | Purpose                                                                                 |
+| ------------------------------------ | --------------------------------------------------------------------------------------- |
+| `test/helpers/manager-stubs.ts`      | `createBlockingRunner`, `createSessionRunner`, `createMockWorktrees`, `createRunResult` |
+| `test/helpers/manager-stubs.test.ts` | Smoke tests for the factories                                                           |
+### Modified files
+| File                                   | Change                                                                      |
+| -------------------------------------- | --------------------------------------------------------------------------- |
+| `test/lifecycle/agent-manager.test.ts` | Replace inline runner/worktree stubs with `manager-stubs` factories         |
+| `test/conversation-viewer.test.ts`     | Add `createTestViewer` + `assertRenderFitsWidths` inline, migrate all tests |
+| `test/ui/agent-config-editor.test.ts`  | Add `setupDetail` inline, migrate `showAgentDetail` tests                   |
+### Unchanged files
+No production source files are modified.
+No other test files are modified.
+## Test impact analysis
+1. **New unit tests**: `manager-stubs.test.ts` adds smoke tests verifying factory return shapes (blocking runner never resolves, session runner calls `onSessionCreated`, worktree factory returns the expected interface, run result contains the correct fields).
+2. **Simplified tests**: ~30 tests across the three files replace 3–6 lines of inline stub construction with 1-line factory calls.
+3. **Unchanged tests**: All existing test assertions remain identical — only the setup code changes.
+   Tests with custom runner behavior (event-emitting, gated, error-throwing) keep their inline stubs.
+## TDD order
+1. **Create `test/helpers/manager-stubs.ts` + `manager-stubs.test.ts`** Add `createBlockingRunner`, `createSessionRunner`, `createMockWorktrees`, `createRunResult`.
+   Add smoke tests verifying each factory's return shape and basic behavior.
+   Commit: `test: add manager-stubs helper factories (#219)`
+2. **Migrate `agent-manager.test.ts` to use manager-stubs** Replace 5 inline never-resolving runners with `createBlockingRunner()`.
+   Replace 4 identical worktree stubs with `createMockWorktrees()` / `createMockWorktrees({ create: ... })`.
+   Replace inline session-creating runners with `createSessionRunner(session)` where the test only needs `onSessionCreated` wiring.
+   Replace inline run-result objects with `createRunResult()` where the default shape suffices.
+   Run `pnpm vitest run test/lifecycle/agent-manager.test.ts` to verify green.
+   Commit: `test: migrate agent-manager tests to manager-stubs (#219)`
+3. **Add inline factories to `conversation-viewer.test.ts` and migrate** Add `createTestViewer(overrides?)` inline factory with defaults for all 8 constructor fields.
+   Add `assertRenderFitsWidths(messages, widths?, overrides?)` inline helper.
+   Migrate all 10 "render width safety" tests to use `assertRenderFitsWidths`.
+   Migrate all 5 "safety net" tests to use `createTestViewer`.
+   Run `pnpm vitest run test/conversation-viewer.test.ts` to verify green.
+   Commit: `test: reduce conversation-viewer test duplication (#219)`
+4. **Add inline `setupDetail` to `agent-config-editor.test.ts` and migrate** Add `setupDetail(selectResults, options?)` returning `{ fileOps, editor, ui }`.
+   Migrate `showAgentDetail` tests to use `setupDetail`.
+   Run `pnpm vitest run test/ui/agent-config-editor.test.ts` to verify green.
+   Commit: `test: reduce agent-config-editor test duplication (#219)`
+5. **Final verification** Run `pnpm vitest run` (full suite) to confirm no regressions.
+   Run `pnpm run check` to confirm no type errors.
+   Commit is not needed — this is a verification-only step.
+## Risks and mitigations
+1. **Factory defaults diverge from test intent** — If a shared factory's defaults don't match what an individual test expects, assertions silently pass or fail for the wrong reason.
+   Mitigation: diff all inline stubs against the proposed factory defaults before writing the factory.
+   Keep tests with unique mock behavior inline rather than force-fitting them into a factory.
+2. **Over-abstraction obscures test intent** — Extracting too many details into helpers makes tests harder to read.
+   Mitigation: only extract truly duplicated boilerplate (stub construction); keep test-specific setup and assertions inline.
+   The gated runner pattern stays inline for this reason.
+3. **Intermediate broken state** — Partially migrated test files may have import conflicts.
+   Mitigation: each TDD step fully migrates one file before committing.
+## Open questions
+None — the issue scope is well-defined and the dependencies are resolved.

package/docs/retro/0218-push-sdk-boundary-in-settings.md CHANGED Viewed

@@ -33,3 +33,40 @@ All 970 tests pass; `settings.ts` now has 0 Pi SDK imports and all `PI_CODING_AG
 - **Test simplification was significant:** Removed `originalAgentDirEnv` save/restore scaffolding from 5 `describe` blocks; the test code shrank by 32 lines net.
 - **`/nonexistent` sentinel:** Tests that construct `SettingsManager` but never call `load()` pass `agentDir: "/nonexistent"` — a clear signal the field is unused in that scope.
 - Architecture doc Step 5 heading marked `✓` and folded into the last `feat:` commit by `pi-autoformat`.
+## Stage: Final Retrospective (2026-05-26T17:22:11Z)
+### Session summary
+Issue #218 went from plan to shipped release (`pi-subagents-v7.8.0`) in a single continuous session.
+Planning, TDD (2 feat commits + 1 doc commit), shipping, CI verification, issue close, and release-please merge all completed without user intervention beyond stage transitions.
+### Observations
+#### What went well
+- **Clean mechanical execution:** The entire change was 2 production files (`settings.ts`, `index.ts`) and 1 test file, with zero unexpected test breakage and zero rework commits.
+- **Test simplification payoff:** Removing `PI_CODING_AGENT_DIR` env var scaffolding from 5 `describe` blocks shrank the test file by 32 lines net — a tangible improvement in test readability.
+- **Ship stage model efficiency:** The `/ship-issue` stage ran on `deepseek-v4-flash`, which was appropriate for the purely mechanical push/CI/close/merge workflow.
+#### What caused friction (agent side)
+1. `wrong-abstraction` — The plan split steps 1 and 2 into separate commits, but changing `loadSettings(cwd)` to `loadSettings(agentDir, cwd)` immediately broke `SettingsManager.load()` which calls it.
+   The agent recognized this during the red phase and combined them into one commit.
+   The existing testing skill rule ("When a TDD plan lists separate steps that share a type definition… fold them into one step") already covers this — the plan just didn't apply it.
+   Impact: added friction but no rework; recognized on first test run.
+2. `missing-context` — Attempted to add `| ✓ #218 |` as an extra column to one row of the architecture doc's findings table, creating a column-count mismatch.
+   The autoformatter reverted the broken table.
+   The agent then spent ~5 tool calls (`git show --stat`, `git status`, `grep` ×2, `read`) investigating what happened before switching to the Step 5 heading approach.
+   Impact: ~2 minutes of investigation; no rework beyond the heading edit.
+#### What caused friction (user side)
+- The user asked "Are we ready for shipping?"
+  which surfaced that the TDD retro stage notes were still uncommitted.
+  This was a useful checkpoint — the ship stage committed them before pushing.
+  Opportunity: the `/tdd-plan` prompt could commit retro notes as part of its final step, but the current flow (write notes, then commit in ship) is lightweight enough that enforcing it would add complexity for marginal gain.
+### Changes made
+1. Retro file updated at `packages/pi-subagents/docs/retro/0218-push-sdk-boundary-in-settings.md` — no other files changed.

package/docs/retro/0219-reduce-test-duplication-top-3-clone-families.md ADDED Viewed

@@ -0,0 +1,36 @@
+---
+issue: 219
+issue_title: "Reduce test duplication — top 3 clone families (Phase 13, Step 6)"
+---
+# Retro: #219 — Reduce test duplication — top 3 clone families
+## Stage: Planning (2026-05-26T20:00:00Z)
+### Session summary
+Analyzed duplication patterns in the three target test files (`agent-manager.test.ts`, `conversation-viewer.test.ts`, `agent-config-editor.test.ts`).
+Produced a 5-step TDD plan with shared `manager-stubs.ts` helper for runner/worktree factories, plus inline factories for the two UI test files.
+### Observations
+- The agent-manager test has the most diverse clone families (runner stubs, worktree stubs, run-result shapes) — these benefit from a shared helper file since the patterns are reused across 15+ describe blocks.
+- The conversation-viewer and config-editor duplication is more localized — inline factories within each test file are the right granularity to avoid over-extraction.
+- Gated runners (using `Promise.withResolvers`) were deliberately kept inline since they encode test-specific flow control that a factory would obscure.
+- Both dependencies (#214, #216) are closed, so the production code is stable and the tests won't shift under us during implementation.
+## Stage: Implementation — TDD (2026-05-26T17:42:41Z)
+### Session summary
+Completed all 4 TDD cycles: created `test/helpers/manager-stubs.ts` + `manager-stubs.test.ts` (13 smoke tests), migrated `agent-manager.test.ts`, `conversation-viewer.test.ts`, and `agent-config-editor.test.ts`.
+Test count delta: 970 → 983 (+13 from smoke tests).
+All 4 commits landed cleanly; full suite green at every step.
+### Observations
+- Target file line savings: `agent-manager.test.ts` −63, `conversation-viewer.test.ts` −58, `agent-config-editor.test.ts` −16; offset by +211 for the new helper files.
+  Net LOC is positive, but the _clone_ lines fallow detects are eliminated — the metric the issue targets.
+- The `createSessionRunner` + `createRunResult` chain required careful identity-check verification: `createRunResult(sess)` calls `toAgentSession(sess)` which casts without creating a new object, so `toBe(session)` assertions in the execution-state tests still pass. ✓
+- ESLint auto-fixed two cosmetic issues on commit (`activity = undefined` → `activity` destructuring, `session as unknown` cast removal) — caught by pre-commit hooks, not a problem in practice.
+- The `assertRenderFitsWidths` helper in `conversation-viewer.test.ts` reduced the 10 render-safety tests from ~8 lines each to 1–4 lines each; the `setupDetail` helper in `agent-config-editor.test.ts` eliminated 3 repeated setup lines per test across 18 `showAgentDetail` tests.

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@gotgenes/pi-subagents",
-  "version": "7.8.0",
+  "version": "7.8.1",
   "type": "module",
   "exports": {
     ".": "./src/service.ts"