npm - @trashcodermaker/pi-pr-review-handler - Versions diffs - 1.1.4 → 1.2.1 - Mend

@trashcodermaker/pi-pr-review-handler 1.1.4 → 1.2.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/package.json +1 -1
package/skills/pi-pr-review-handler/SKILL.md +99 -16
package/skills/pi-pr-review-handler/agents/triage-agent.md +4 -1

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@trashcodermaker/pi-pr-review-handler",
-  "version": "1.1.4",
+  "version": "1.2.1",
   "description": "Systematically process GitHub PR review comments: triage for validity, fix code, and post replies. Pi package — install with `pi install npm:@trashcodermaker/pi-pr-review-handler`.",
   "type": "module",
   "license": "MIT",

package/skills/pi-pr-review-handler/SKILL.md CHANGED Viewed

@@ -41,7 +41,7 @@ Agent specs live in `agents/` relative to this skill (`agents/triage-agent.md`,
 | Platform | Dispatch mechanism |
 |----------|-------------------|
-| Pi | inline fallback (no native subtask mechanism) |
+| Pi | `subagent` tool if [pi-subagents](https://www.npmjs.com/package/pi-subagents) is installed, else inline fallback |
 | Claude Code | Task tool |
 | Cursor | background agent |
 | Gemini CLI / OpenCode / others | native subtask mechanism if available |
@@ -49,7 +49,9 @@ Agent specs live in `agents/` relative to this skill (`agents/triage-agent.md`,
 **Dispatch pattern**: read the relevant agent spec, embed its instructions into the task prompt along with the thread-specific input data (thread info for triage, verdict data for implementation), and launch one subtask per thread. Triage is read-only so subtasks run in parallel; implementation writes files so it runs serially.
-**Inline fallback**: if your platform has no subtask mechanism, you (the orchestrator) read each spec and perform its steps yourself, one thread at a time. The specs are written as direct instructions, so inline execution is straightforward.
+**Pi dispatch capability**: Pi does not bundle a subtask mechanism — it depends on the optional `pi-subagents` package (recommended in the project README). To detect it, look in your available tools list for a tool **literally named `subagent`** (the exact tool name, not a description match). If you see a tool named `subagent`, it is available — use it (PARALLEL mode for triage, SINGLE mode serially for implementation). If no tool named `subagent` appears in your tool list, fall back to inline execution. Do not assume pi-subagents is installed, and do not error if it is missing — the skill degrades gracefully either way.
+**Inline fallback**: if the `subagent` tool is not available (e.g. pi-subagents not installed) or the platform has no subtask mechanism, you (the orchestrator) read each spec and perform its steps yourself, one thread at a time. The specs are written as direct instructions, so inline execution is straightforward.
 ## Phase 0: Setup
@@ -96,11 +98,13 @@ gh api graphql -f query='
 Filter `isResolved: false`. Include full thread (not just top-level) — follow-up replies often contain the real concern.
+**Deduplicate threads before triage.** Automated reviewers (React Doctor, claude[bot], etc.) often post the same concern multiple times across different reviews, and human reviewers may re-comment after a push. Before dispatching Triage Agents, cluster threads by `(path, line)` and collapse those whose top-level comment bodies share the same core concern (match on the first sentence or the rule identifier like `react-doctor/prefer-useReducer`). Keep one representative thread per cluster, but record all `databaseId`s — Phase 4 posts the reply to **every** original thread in the cluster so no reviewer comment is left unanswered. Report the dedup result to the user (e.g. "8 threads → 3 unique concerns") so the Checkpoint 1 table stays readable.
 **Fallback: REST** (when GraphQL unavailable):
 ```bash
 gh api repos/{owner}/{repo}/pulls/{pr_number}/comments \
-  --jq 'group_by(.path, .original_commit_id) | map({
+  --jq 'group_by([.path, (.original_commit_id // "HEAD")]) | map({
     thread_id: .[0].id, path: .[0].path,
     comments: [.[] | {id, body, user: .user.login, line, in_reply_to_id}]
   })'
@@ -117,16 +121,44 @@ gh api repos/{owner}/{repo}/pulls/{pr_number}/reviews \
   --jq 'sort_by(.submitted_at) | reverse | .[:5] | .[] | {id, state, user: .user.login, body}'
 ```
-Include any non-empty review bodies alongside the thread data when presenting to the user in Phase 2.
+Present any non-empty review bodies in Checkpoint 1 (Phase 1) as a
+separate **Review-level feedback** section. These are summary comments
+without line references, so they do not go through the Triage Agent
+automatically — the user decides how to handle each one (ignore / reply
+only / needs code change). If the user marks one as needing a code change,
+convert it into an Implementation task with `path: <overall PR>` and no
+specific line; the Implementation Agent then works from the review body
+text and the PR diff.
+### Fetch PR diff
+Triage needs to see what the PR actually changed — without it, the agent cannot
+distinguish a problem the PR introduced from one that already existed in the
+base branch.
+```bash
+gh pr diff {pr_number} > /tmp/pr-{pr_number}.diff
+```
+Cache the diff to a temp file. When dispatching each Triage Agent in Phase 1,
+pass only the hunks relevant to that thread's `path` as `pr_diff_context`. If
+the total diff is small (< 500 lines), pass the full diff to every agent for
+broader context.
 ## Phase 1: Triage (parallel dispatch)
 ### Dispatch strategy
-Triage is read-only — safe to parallelize.
+Triage is read-only — safe to parallelize. First detect dispatch capability:
+- **Pi with `subagent` tool available** (pi-subagents installed): use PARALLEL mode — spawn one Triage Agent subtask per thread simultaneously. For implementation (Phase 2), use SINGLE mode serially — each fix is a separate subtask, one at a time, because fixes write files.
+- **No `subagent` tool / no subtask mechanism**: run inline, same logic, one thread at a time.
+Then choose a strategy based on thread count:
 - **≥3 threads + parallel capability**: spawn one Triage Agent per thread simultaneously
 - **≤2 threads or no parallel capability**: run inline, same logic
+- **Large PR (> 15 threads)**: batch by file path to keep context manageable. Group threads sharing the same `path` into the same batch (they share `pr_diff_context`, saving tokens). Dispatch one batch at a time, 8–10 threads per batch. Collect verdicts across batches before presenting Checkpoint 1. This avoids spawning dozens of subagents at once, which can hit API rate limits and produce a verdict table too long for the user to review.
 If unsure about parallel capability, default to inline.
@@ -142,13 +174,14 @@ comments:
   - <top-level comment text>
   - <reply 1, if any>
   - <reply 2, if any>
+pr_diff_context: <diff hunks for {path} from /tmp/pr-{pr_number}.diff, or full diff if PR is small>
 ```
 Embed the Triage Agent spec (`agents/triage-agent.md`) into the task prompt so the subtask has the full role instructions and output format, then append the thread-specific data above. Collect structured verdicts from all agents.
 ### Checkpoint 1: User confirmation
-Present verdicts as a summary table:
+Present thread verdicts as a summary table:
 | # | File:Line | Reviewer | Summary | Verdict | Affected Files |
 |---|-----------|----------|---------|---------|----------------|
@@ -156,7 +189,21 @@ Present verdicts as a summary table:
 | 2 | src/ui/Button.tsx:18 | bob | Rename for clarity | valid-fix | Button.tsx |
 | 3 | src/api/handler.ts:99 | alice | Add rate limiting | invalid | — |
-User can adjust verdicts or skip threads. Proceed with confirmed plan.
+Then present the **Review-level feedback** section (summary comments
+without line references, fetched in Phase 0):
+| # | Reviewer | State | Body (excerpt) |
+|---|----------|-------|----------------|
+| R1 | alice | CHANGES_REQUESTED | "Overall solid, but the auth module needs a refactor — see thread #1" |
+| R2 | bob | COMMENTED | "Please add tests for the token expiry edge case" |
+For each review-level item, ask the user to choose:
+- **Ignore** — no action needed (e.g. summary of already-addressed threads)
+- **Reply only** — draft a clarification in Phase 3, no code change
+- **Needs code change** — convert to an Implementation task: `path: <overall PR>`, no line, `summary: <review body>`, `affected_files: <user-specified or all PR files>`, `suggested_fix: <user describes>`. Dispatch in Phase 2.
+User can adjust thread verdicts or skip threads. Proceed with confirmed plan.
 ### Quick exits
@@ -191,6 +238,8 @@ prior_changes: <list of previous fixes in this PR, if any>
 Embed the Implementation Agent spec (`agents/implementation-agent.md`) into the task prompt so the subtask has the full role instructions, then append the verdict data above.
+**Pi dispatch**: if the `subagent` tool is available, use SINGLE mode — one subtask per fix, awaited in turn (serial). Pass `prior_changes` by collecting each completed subtask's output and appending it to the next subtask's input. If `subagent` is unavailable, execute the Implementation Agent spec inline, one thread at a time.
 After each agent completes:
 ```bash
@@ -202,15 +251,23 @@ If commit is empty (agent made no changes), skip.
 ### After all fixes
-```bash
-npx tsc --noEmit
-```
+Run the project's type checker or equivalent verification. Detect the
+project type and run the matching command — do not assume TypeScript:
-If tsc fails:
+| Project marker | Command |
+| --- | --- |
+| `tsconfig.json` | `npx tsc --noEmit` |
+| `package.json` (no tsconfig) | `npm run lint --if-present` and `npm test --if-present` |
+| `pyproject.toml` / `setup.py` | `ruff check .` then `mypy .` (if configured) |
+| `go.mod` | `go build ./...` |
+| `Cargo.toml` | `cargo check` |
+| none recognized | skip; tell user to run the project's check manually |
+If the check fails:
 - Identify which commit introduced the error (`git bisect` or check error file paths)
 - `git revert --no-commit {commit}` → fix the error → recommit
-- Re-run tsc until clean
+- Re-run the check until clean
 Also check:
@@ -236,12 +293,38 @@ Draft one reply per thread, using `git diff origin/{branch}...HEAD` as ground tr
 **valid-fix (succeeded)**: describe what was changed, referencing the reviewer's concern. If the fix differs from what the reviewer suggested, explain why.
+Examples:
+> Good (EN): "Added the null check at line 42 as you suggested — `user` can indeed be undefined when the session expires."
+>
+> Good (中文): "已在 42 行加了空值判断，session 过期时 `user` 确实可能为 undefined。"
+>
+> Good (EN, diverged from suggestion): "Your point about rate limiting is valid. Instead of a fixed window I used a token bucket in `rateLimit.ts` — it handles burst traffic better and the existing tests cover it."
 **valid-fix (failed)**: acknowledge the concern was valid. Explain why the fix couldn't be applied (type conflict, dependency issue). Suggest next steps if possible ("will address in a follow-up PR"). Don't be apologetic — just factual.
+Examples:
+> Good (EN): "You're right that this should be typed more strictly, but the `User` interface is shared with the legacy auth module which expects `any`. I'll extract a `StrictUser` type in a follow-up PR to avoid breaking that module here."
+>
+> Good (中文): "这里确实该用更严格的类型，但 `User` 接口被旧 auth 模块共用且依赖 `any`。我会在后续 PR 里拆出 `StrictUser` 类型，避免这里改动波及该模块。"
 **valid-nofix**: acknowledge the concern is valid. Explain why no code change is needed. Provide clarification if the reviewer misunderstood the code.
+Examples:
+> Good (EN): "Fair point on the naming — `handleX` is a bit vague. It's part of the public API documented in `docs/api.md`, so renaming would be a breaking change. I'll add a deprecation alias in the next major."
+>
+> Good (中文): "命名确实不够清晰，不过 `handleX` 是 `docs/api.md` 里记录的公开 API，重命名属于破坏性变更，下个大版本会加弃用别名。"
 **invalid**: explain clearly why the premise doesn't apply. Reference specific code that already handles the concern. Acknowledge the reviewer's intent ("I see why you'd think X, but..."). Be respectful but direct — don't hedge if the concern is genuinely wrong.
+Examples:
+> Good (EN): "I see why you'd think the count could overflow here, but `items` is already capped at `MAX_ITEMS` (line 15) before this loop runs, so `i` stays well within `Number.MAX_SAFE_INTEGER`."
+>
+> Good (中文): "能理解你担心这里 count 溢出，但进入循环前 `items` 已在 15 行被 `MAX_ITEMS` 截断，`i` 始终远小于 `Number.MAX_SAFE_INTEGER`。"
 ### Reply guidelines
 - Match the reviewer's language (Chinese reviewer → Chinese reply)
@@ -261,14 +344,14 @@ Present all reply drafts. User can:
 ## Phase 4: Post & Push
-Post approved replies:
+Post approved replies. The reply endpoint requires `-X POST` and the PR number in the path:
 ```bash
-gh api repos/{owner}/{repo}/pulls/comments/{comment_id}/replies \
+gh api -X POST repos/{owner}/{repo}/pulls/{pr_number}/comments/{comment_id}/replies \
   -f body='The response text'
 ```
-Reply to the top-level comment of each thread (the one with `databaseId`, not a reply).
+Reply to the top-level comment of each thread (the one with `databaseId`, not a reply). The `{pr_number}` is the PR number (e.g. `561`), and `{comment_id}` is the `databaseId` from Phase 0. Note: the path is `pulls/{pr_number}/comments/...`, **not** `pulls/comments/...` — the latter returns 404.
 Push all commits:
@@ -310,7 +393,7 @@ Output final summary:
 | API rate limit | Wait for `X-RateLimit-Reset`, retry |
 | PR closed/merged | Warn user — replies have no effect |
 | GraphQL unsupported | Fall back to REST with resolved-thread caveat |
-| tsc fails after all fixes | Fix before posting replies |
+| Type check fails after all fixes | Fix before posting replies |
 ## Key Principles

package/skills/pi-pr-review-handler/agents/triage-agent.md CHANGED Viewed

@@ -23,14 +23,17 @@ comments:
   - [top-level comment]
   - [reply 1, if any]
   - [reply 2, if any]
+pr_diff_context: [diff hunks for this file from the PR, or full diff if PR is small]
 ```
 ## Steps
-### 1. Read the referenced code
+### 1. Read the referenced code and PR diff context
 Open `{path}` and examine the code at `{line}` plus surrounding context (±20 lines minimum). Understand what the code does, what it depends on, and what depends on it.
+Then read `pr_diff_context` — the diff hunks for this file from the PR. This tells you whether the code the reviewer commented on was introduced by this PR, modified by it, or already existed in the base branch. A concern about code the PR didn't touch is usually out of scope for this review.
 ### 2. Read the full thread carefully
 The top-level comment may be refined, overridden, or clarified by follow-up replies. The actual concern may be in a reply, not the original comment. Weight later comments appropriately — they often represent the reviewer's evolved thinking.