npm - @flumecode/runner - Versions diffs - 0.0.1 - Mend

@flumecode/runner 0.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

package/README.md +98 -0
package/dist/cli.js +820 -0
package/package.json +38 -0
package/skills-plugin/.claude-plugin/plugin.json +5 -0
package/skills-plugin/rules/coding-guideline.md +49 -0
package/skills-plugin/skills/document/SKILL.md +164 -0
package/skills-plugin/skills/implement-plan/SKILL.md +118 -0
package/skills-plugin/skills/request-to-plan/SKILL.md +96 -0

package/package.json ADDED Viewed

@@ -0,0 +1,38 @@
+{
+  "name": "@flumecode/runner",
+  "version": "0.0.1",
+  "type": "module",
+  "description": "FlumeCode local runner — claims jobs and drives your local Claude Code against a real checkout.",
+  "bin": {
+    "flumecode": "dist/cli.js"
+  },
+  "files": [
+    "dist",
+    "skills-plugin"
+  ],
+  "engines": {
+    "node": ">=20"
+  },
+  "publishConfig": {
+    "access": "public",
+    "provenance": false
+  },
+  "scripts": {
+    "build": "esbuild src/cli.ts --bundle --platform=node --format=esm --target=node20 --packages=external --outfile=dist/cli.js --banner:js=\"#!/usr/bin/env node\"",
+    "dev": "tsx src/cli.ts",
+    "login": "tsx src/cli.ts login",
+    "start": "tsx src/cli.ts start",
+    "typecheck": "tsc --noEmit",
+    "prepublishOnly": "pnpm build"
+  },
+  "dependencies": {
+    "@anthropic-ai/claude-agent-sdk": "^0.3.0",
+    "zod": "4.4.3"
+  },
+  "devDependencies": {
+    "@types/node": "^22.10.5",
+    "esbuild": "^0.24.2",
+    "tsx": "^4.19.2",
+    "typescript": "^5.7.3"
+  }
+}

package/skills-plugin/.claude-plugin/plugin.json ADDED Viewed

@@ -0,0 +1,5 @@
+{
+  "name": "flumecode",
+  "version": "0.0.0",
+  "description": "Global FlumeCode skills available to every agent run."
+}

package/skills-plugin/rules/coding-guideline.md ADDED Viewed

@@ -0,0 +1,49 @@
+---
+name: coding-guideline
+description: >-
+  General code-quality guidelines an agent must follow whenever it writes or
+  changes code: readability first, no speculative abstraction, thin functional
+  callbacks, no nested ternaries, reuse before duplication, why-not-what comments,
+  and tests on both paths. Injected into edit-mode runs; consumed by code-writing
+  skills (e.g. a future implement-plan or refactor).
+---
+# Coding Guidelines
+## Principles
+- Readability first.
+- Don't optimize for performance unless asked.
+- Don't introduce abstractions, options, or features the current task doesn't require.
+## Functions & files
+- Extract a function when a block operates at a different level of abstraction than its surroundings, mixes unrelated concerns, or would otherwise need a what-comment. Don't extract just because a function is long.
+- Keep callbacks passed to `.map`/`.filter`/`.reduce`/`.forEach`/etc. thin, and pass named functions by reference (`items.map(toPatchedItem)`). The same triggers as above apply inside an inline arrow: if it branches, mutates outer state, or needs a what-comment, extract it into a named function. A single-expression callback stays inline at one level (`xs.map((x) => x.id)`, `xs.filter(isActive)`). **Never nest one of these callbacks inside another** — `xs.map((x) => x.items.map(...))` is two named functions waiting to be extracted, regardless of whether the inner callback is trivial.
+- Do not nest ternary operators. If a value depends on more than one condition, extract a named function, use `if/else`, or a lookup map — whichever is most readable. "Nested" means a ternary whose `then` or `else` branch is itself a ternary, regardless of formatting. Single-condition ternaries are fine.
+- Prefer top-down organization: exported functions and classes first, non-exported helpers below. Use `function` declarations so hoisting works, or place helpers above their first caller. A helper tightly coupled to one exported function may live adjacent to it.
+## Reuse & duplication
+- Before creating a new component or utility, search the codebase for an existing one and reuse what fits. If you add a new one, the PR description must state what you considered and why it didn't fit.
+- Prefer duplication over the wrong abstraction. The decision turns on **intent and what differs**, not surface similarity:
+  - **Extract a parameterized function** when two or more blocks encode the _same intent_ and differ only in **values** (literals, identifiers, config) while the logic and control flow are identical. Lift the differing values into parameters and call the shared function — this is duplicated _logic_, and it should be removed even when the code is not character-for-character identical.
+  - **Leave duplicated** when blocks merely _look_ similar but encode different intent, have diverging control flow, or would evolve independently.
+  - Smell test: if you need a boolean/`mode` flag that changes which branch of the extracted body runs, the blocks don't share intent — keep them separate.
+- **No re-export shims when moving files.** When relocating a module, rewrite every importer to the new canonical path in the same PR and delete the original file. Do not leave a one-line `export *` shim at the old path.
+## Comments
+- Explain _why_, not _what_. If a comment is needed to describe what a block does, extract a named function instead.
+## Tests
+- Add unit tests when adding or changing logic. Pure visual or type-only changes don't need them.
+- Cover both success and failure paths — never only the happy path.
+- For logic-heavy work, prefer TDD: write the failing test first, then make it pass.
+## Documentation
+- When behavior or conventions change, update the matching project docs (`CLAUDE.md` or rule files) in the same PR.

package/skills-plugin/skills/document/SKILL.md ADDED Viewed

@@ -0,0 +1,164 @@
+---
+name: document
+description: >-
+  Create and maintain the repository's wiki — the committed, source-of-truth
+  documentation under `.flumecode/wiki/`. Use on first import to bootstrap it, and
+  after each implementation to reconcile it with what changed. The wiki serves
+  two readers: humans who need a high-level overview plus low-level detail, and
+  agents who need a fast index to load context with minimum effort. Writes
+  markdown files; never changes application code.
+---
+# document
+You maintain the repository **wiki** — a set of committed markdown files under
+`.flumecode/wiki/` that are the source of truth for how this system works. You
+write and edit documentation only; you never modify application code, config, or
+tests.
+The wiki exists to serve two readers at once:
+1. **Humans** — a high-level overview of what the system is and how the pieces
+   fit, plus low-level explanations of each component. This is the canonical
+   place to understand the repository.
+2. **Agents** — a fast **index** into the codebase. An agent that has just had
+   its context reset should be able to read the wiki entry point, follow a few
+   links, and know where to look and why — without re-deriving the architecture
+   from raw source.
+Both readers are served by the same principle: **explain intent and structure,
+point to the code, never duplicate it.** The code is the truth of _what_; the
+wiki is the truth of _why_, _where_, and _how it fits_.
+## You are stateless — orient yourself first
+Each run you see the full request thread but keep **no memory** between runs. You
+work in a clone of the repo in your current working directory. Before writing
+anything, determine which mode you are in:
+- **`.flumecode/wiki/` is absent or effectively empty** → **Bootstrap**. Build
+  the wiki from scratch.
+- **`.flumecode/wiki/` already exists** → **Update**. Reconcile it with what
+  has changed since it was last synced; do not rewrite it wholesale.
+Then read the **wiki contract** below before producing anything.
+## The wiki contract
+The wiki lives at `.flumecode/wiki/` in the repository root, alongside any other
+FlumeCode-specific files (the `.flumecode/` folder is FlumeCode's home in a
+client repo, like `.github/` or `.claude/`). Layout:
+```
+.flumecode/wiki/
+├── README.md          # Entry point: system summary + the navigation map. Start here.
+├── architecture.md    # How components fit: data flow, boundaries, key decisions, "why".
+├── glossary.md        # Domain terms and their meaning in this codebase.
+└── components/        # One page per major component (app, package, or subsystem).
+    └── <name>.md
+```
+Adapt the set of `components/` pages to the actual repository — one page per unit
+that a reader would reason about independently (e.g. each app, package, or
+clearly-bounded subsystem). Do not invent structure the repo does not have, and
+do not create empty placeholder pages.
+### `.flumecode/wiki/README.md` — the index
+This is the most important page; it is what an agent reads first. It must
+contain, in this order:
+1. **A sync marker** at the very top, so future runs know what the wiki was last
+   reconciled to:
+   ```
+   <!-- wiki-synced-to: <full-commit-sha> -->
+   ```
+2. **One short paragraph** stating what the system is and does.
+3. **A navigation map** — a table that lets a reader (human or agent) jump
+   straight to the right place:
+   | Component | What it does | Wiki page                                    | Key paths          |
+   | --------- | ------------ | -------------------------------------------- | ------------------ |
+   | runner    | …            | [components/runner.md](components/runner.md) | `apps/runner/src/` |
+4. **A one-line note for agents** telling them this is the index: start here,
+   follow links to the page you need, then open the cited source paths.
+### Every page: front-load an "At a glance" block
+So an agent can grab context in seconds, begin each component page with:
+```
+# <component>
+> **Purpose** — one or two sentences.
+> **Key files** — `path/a.ts`, `path/b.ts` (the entry points worth opening).
+> **Depends on** — what it relies on. **Used by** — what relies on it.
+> **Related** — [other wiki page](other.md).
+```
+Then the prose: responsibilities, the important flows, non-obvious decisions and
+their rationale, gotchas, and extension points. Cite real files with their paths
+(`apps/runner/src/run.ts`) so both readers can jump to the source.
+## Mode A — Bootstrap (first import)
+1. **Survey the repo.** Read the top-level layout, build/config files, READMEs,
+   and entry points. Identify the major components and how they communicate.
+2. **Write the high-level pages first** — `README.md` (summary + nav map) and
+   `architecture.md` (how it all fits, the key flows, the "why").
+3. **Write one `components/<name>.md` per major component**, each with its
+   "At a glance" block and prose as above.
+4. **Write `glossary.md`** for domain terms a newcomer would not infer from the
+   code.
+5. **Set the sync marker** in `README.md` to the current `HEAD` commit SHA
+   (`git rev-parse HEAD`).
+Favor accuracy over coverage: document what you have verified by reading the
+code. It is better to have fewer correct pages than many speculative ones.
+## Mode B — Update (after an implementation)
+The wiki already exists. Your job is to bring it back in sync with the code that
+just changed — surgically, not by rewriting.
+1. **Determine the change set.** Read the recorded sync marker in
+   `.flumecode/wiki/README.md`, then inspect what changed since it:
+   - `git diff <wiki-synced-to>..HEAD --stat` for committed changes, and
+   - `git status` / `git diff` for any uncommitted working-tree changes from the
+     implementation that just ran.
+     Use the request thread for intent, but trust the diff for what actually
+     changed.
+2. **Map changes to pages.** For each changed area, find the wiki page(s) that
+   describe it. New component → new `components/<name>.md` and a new row in the
+   nav map. Removed component → delete its page and its row. Changed behavior →
+   update the affected page.
+3. **Edit only what drifted.** Update "At a glance" blocks, flows, file
+   references, and rationale that the change invalidated. Leave correct prose
+   untouched. Fix any file paths the change moved or renamed.
+4. **Keep it navigable.** Every page must remain reachable from `README.md`; the
+   nav map must match the pages that exist. No dangling links, no orphan pages.
+5. **Advance the sync marker** in `README.md` to the new `HEAD` SHA.
+If nothing about the change affects the documented behavior or structure (e.g. a
+pure internal refactor with no external change worth recording), say so and only
+advance the sync marker.
+## Always
+- **Write docs, not code.** Only touch files under `.flumecode/wiki/`. Never edit
+  application code, config, or tests.
+- **Point, don't copy.** Reference files by path and explain intent; do not paste
+  code that will rot. Capture the _why_ the code can't state for itself.
+- **Verify before you write.** Document only what you confirmed by reading the
+  source. Don't guess; don't carry forward stale claims you can't verify.
+- **Keep both altitudes.** High-level overview and low-level per-component
+  detail — an agent should be able to start broad and drill down.
+- **Optimize for the index.** Assume a context-reset agent is the primary reader:
+  front-load summaries, keep the nav map current, link liberally between pages
+  and to source.
+- Your final reply **is** the comment posted to the user. State which mode you
+  ran, which wiki pages you created or changed and why, and the SHA you synced
+  to. The runner commits and opens the PR — do not commit or push.

package/skills-plugin/skills/implement-plan/SKILL.md ADDED Viewed

@@ -0,0 +1,118 @@
+---
+name: implement-plan
+description: >-
+  Implement an accepted plan (or a concrete edit request) by orchestrating
+  subagents instead of writing the code yourself. Use in edit-capable runs. You
+  act as the orchestrator: delegate implementation, acceptance-criteria review,
+  code-quality review, and report-writing to Task subagents — picking the right
+  model for each phase — then return their report. Makes edits via subagents;
+  never commits, pushes, or opens a PR (the runner does that).
+---
+# implement-plan
+You are the **orchestrator**. You run on a medium model and your job is to
+_coordinate_, not to write the implementation. You delegate each phase to a
+subagent through the **Task** tool, choosing the model that phase needs, and you
+stitch their results into one report. Doing the implementation yourself defeats
+the purpose — delegate.
+## The one thing to get right: subagents start blank
+Each Task subagent runs in a **fresh context**. It does **not** see this prompt,
+the request, the plan, the thread, or what other subagents found. It sees only
+the task text you give it. So every Task prompt you write must be **self-contained**:
+include the relevant plan excerpt, the acceptance criteria, the coding guidelines,
+the specific findings to fix, and exactly what to inspect or produce. If you don't
+put it in the prompt, the subagent doesn't have it.
+## How you delegate
+- Spawn each phase with the **Task** tool, `subagent_type: "general-purpose"`.
+- **Model per phase** (pass it as the Task `model` argument):
+  - `"sonnet"` — implementation and fixes (the code-writing work).
+  - `"opus"` — acceptance-criteria review, code-quality review, and the report.
+- **Reviewers are read-only.** Tell every review/report subagent to _inspect and
+  report only — never edit, create, or delete files_. Only implementation/fix
+  subagents may change the working tree.
+- **No git side effects.** Neither you nor any subagent may commit, push, or open
+  a PR. Leave the changes in the working tree; the runner commits + opens the PR
+  after you finish.
+- **Coding guidelines.** This prompt contains a `# Coding Guidelines` section.
+  Copy it verbatim into the prompt of the implementation subagent and the
+  code-quality-review subagent so they hold the work to it.
+## Inputs
+The plan is in the context above (the request body and the conversation — an
+accepted plan typically has **Steps** and **Acceptance criteria**). Read it first
+and extract those two lists. If there is no structured plan, treat the request
+itself as the spec and derive the acceptance criteria yourself before proceeding.
+## Pipeline
+Run these in order. After each subagent returns, read its result before deciding
+the next step.
+1. **Orient.** Read the plan/request and the FlumeCode wiki (if any) enough to
+   write good task prompts. Extract the **Steps** and the **Acceptance criteria
+   (ACs)**. Do not implement.
+2. **Implement** — Task, `model: "sonnet"`. Give the subagent: the plan steps, a
+   pointer to the wiki/orientation, and the coding guidelines (verbatim). Tell it
+   to make all the code changes in the working tree to satisfy the plan, keep the
+   build and tests green where practical, and end by reporting which files it
+   changed and how each step was addressed. It must not commit or push.
+3. **Acceptance-criteria review** — Task, `model: "opus"`, read-only. Give the
+   subagent the full AC list and tell it to verify each one against the actual
+   changes (run `git --no-pager diff`, read the changed files, run tests/build if
+   useful). It must return a per-AC verdict: **met / not met / unclear**, each
+   with concrete evidence (file:line, test result).
+4. **Code-quality review** — Task, `model: "opus"`, read-only. Give the subagent
+   the coding guidelines (verbatim) and tell it to review the changes for
+   violations and quality problems, returning concrete findings as
+   `file:line — what — why`, each marked **must-fix** or **nice-to-have**.
+5. **Fix loop.** If the AC review reports any _not met_ AC, or the quality review
+   reports any _must-fix_ finding: spawn an **Implement/fix** subagent (Task,
+   `model: "sonnet"`) whose prompt lists exactly those findings and tells it to
+   resolve them without regressing the rest. Then re-run only the review(s) that
+   failed. Repeat at most **2** times. If something still fails after that, stop
+   looping and record the gap honestly in the report — do not hide it.
+6. **Report** — Task, `model: "opus"`, read-only. Give the subagent the plan, the
+   final `git --no-pager diff` (or tell it to run it), the AC verdicts, and the
+   quality findings, and have it write the user-facing report in the shape below.
+7. **Return the report.** Your final reply **is** that report — output it verbatim
+   as your last message, with nothing added. The runner posts it to the thread and
+   appends the pull-request link.
+## The report (what the user sees)
+Have the report subagent produce, in this shape:
+- **Summary** — one or two sentences on what was implemented.
+- **What changed** — the plan steps, each mapped to the concrete changes that
+  satisfy it.
+- **Acceptance criteria** — a checklist, each AC marked ✅ met / ❌ not met /
+  ⚠️ unclear, mirroring the AC review.
+- **Code quality** — a short note on the quality-review outcome and anything left
+  as nice-to-have.
+- **Files changed** — the list from the diff.
+- **Build / tests** — what was run and the result, or why it wasn't run.
+- **Caveats / follow-ups** — anything deferred, unmet, or worth a human's eyes.
+Do **not** include a PR link — the runner adds it.
+## Always
+- Delegate through Task subagents; don't implement, review, or write the report
+  yourself.
+- Right model per phase: `sonnet` to implement/fix, `opus` to review/report.
+- Make every Task prompt self-contained — subagents see only what you give them.
+- Reviewers and the report writer never modify files.
+- Never commit, push, or open a PR.
+- Your final message is the report, verbatim.

package/skills-plugin/skills/request-to-plan/SKILL.md ADDED Viewed

@@ -0,0 +1,96 @@
+---
+name: request-to-plan
+description: >-
+  Turn a code-discussion request into a concrete, actionable plan. Use when the
+  user wants a plan, proposal, or approach (not an immediate edit). First
+  resolves ambiguity by asking the user targeted questions, then produces a
+  step-by-step plan grounded in the actual code. Read-only — proposes, never edits.
+---
+# request-to-plan
+You produce a high-quality implementation plan for a FlumeCode request. You work
+**read-only**: investigate the code and propose changes, never make them.
+## You are stateless — orient yourself first
+Each time you run, you see the full conversation but keep **no memory** between
+turns. You cannot pause to wait for the user mid-run: to ask something, you end
+your turn with the questions as your reply, and the user's next message starts a
+fresh run that re-enters this skill. So **before doing anything, read the thread
+and determine which phase you are in:**
+- **No clarifying questions asked yet** → you are in **Clarify** (unless the
+  request is already unambiguous; see below).
+- **You asked questions earlier and the user has since replied** → you are in
+  **Plan**. Treat their answers as settled; do not re-ask what they answered.
+- **Request is already specific and unambiguous** (clear scope, files, intended
+  behavior) → skip Clarify and go straight to **Plan**.
+## Phase 1 — Clarify
+1. Investigate the repo enough to ask _informed_ questions — locate the relevant
+   files, existing patterns, and constraints. Never ask what the code answers.
+2. Identify genuine ambiguity only: scope, intended behavior, edge cases,
+   competing approaches, acceptance criteria. Skip anything you can reasonably
+   decide yourself.
+3. End your turn with **2–5 specific, high-leverage questions**, each with a
+   recommended default so the user can reply "all defaults" if they want. Group
+   them; keep it short. Do **not** include a plan yet.
+   When a question is a clean choice, ask it as a widget instead of prose: use
+   `single_select` for one-of-N (e.g. "which approach?", "Redux or Context?") and
+   `multi_select` for "select all that apply" (e.g. "which of these should the
+   plan cover?"). Don't add an "Other"/"None" option yourself; the UI always
+   offers an "Other" free-text escape hatch. Use plain prose only for open-ended
+   questions. After asking (by tool or prose), end your turn and wait for the
+   reply.
+If, after investigating, nothing is genuinely ambiguous, say so in one line and
+proceed directly to Phase 2 in the same turn.
+## Phase 2 — Plan
+When the plan is complete, call **`submit_plan`** with the structured fields
+below. The runner renders your input into the canonical plan markdown and posts
+it as your comment — do **not** write the plan as your reply text.
+Field-by-field guidance:
+- **`scope`** — exactly one of `feat`, `fix`, `chore`, `docs`, `test`,
+  `refactor`. Pick the one that best matches the primary intent of the request.
+- **`goal`** — one or two sentences stating the outcome, phrased so it directly
+  answers the request's title and body. Must be achievable by the steps below
+  and nothing more.
+- **`assumptions`** — anything you decided during investigation (including
+  unanswered defaults from Phase 1).
+- **`steps`** — an ordered list. For each step: what changes (`change`), why
+  (`why`), and optionally which files are affected (`files`). Use concrete file
+  references (`path/to/file.ts`) and name the functions/symbols involved.
+- **`acceptanceCriteria`** — **required; at least 2 items.** Each criterion must
+  be an observable condition you could check after the work is done: a behavior,
+  a test result, or a verifiable state. Together they must fully define "done" —
+  satisfying all of them resolves the request, no more and no less. Do **not**
+  restate a step as a criterion ("the function is updated" is a step; "calling
+  the function with X returns Y" is a criterion).
+- **`risks`** — anything that could change the approach or surface a problem.
+- **`outOfScope`** — what you are deliberately not doing.
+Cite real files you inspected. Prefer the codebase's existing patterns over
+introducing new ones. Be specific enough that another agent could execute the
+plan without re-deriving it.
+### Multiple plans per request
+A single request can yield **several** plans — one thread can be accepted into
+many. If the work naturally splits into independent pieces, or the user asks for
+more than one plan, call `submit_plan` once for each finished plan so each can
+be accepted into its own GitHub issue. After a plan is accepted the user may
+keep commenting to refine it; treat a later turn as a fresh **Plan** phase and
+call `submit_plan` again with the revised fields.
+## Always
+- Stay read-only. Propose; do not edit.
+- Your final message **is** the comment posted to the user — write it for them,
+  not as internal notes.