npm - ralphctl - Versions diffs - 0.2.1 → 0.2.2 - Mend

ralphctl 0.2.1 → 0.2.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (10) hide show

package/README.md +105 -87
package/dist/{chunk-AXNZMHFQ.mjs → chunk-XXIHDQOH.mjs} +6 -2
package/dist/cli.mjs +3 -3
package/dist/prompts/ideate-auto.md +6 -0
package/dist/prompts/plan-auto.md +2 -2
package/dist/prompts/task-evaluation.md +10 -4
package/dist/prompts/task-execution.md +5 -4
package/dist/prompts/ticket-refine.md +1 -1
package/dist/{wizard-TFJXEYD2.mjs → wizard-D7N5WZ5H.mjs} +1 -1
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -4,53 +4,74 @@
 [![License: MIT](https://img.shields.io/badge/license-MIT-blue?style=flat&logo=opensourceinitiative&logoColor=white)](./LICENSE)
 [![TypeScript](https://img.shields.io/badge/TypeScript-5.9-3178c6?style=flat&logo=typescript&logoColor=white)](https://www.typescriptlang.org/)
 [![Node.js](https://img.shields.io/badge/node-%E2%89%A5_24-5fa04e?style=flat&logo=nodedotjs&logoColor=white)](https://nodejs.org/)
-[![code style: prettier](https://img.shields.io/badge/code_style-prettier-ff69b4?style=flat&logo=prettier&logoColor=white)](https://prettier.io/)
-[![ESLint](https://img.shields.io/badge/ESLint-4b32c3?style=flat&logo=eslint&logoColor=white)](https://eslint.org/)
 [![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen?style=flat&logo=git&logoColor=white)](./CONTRIBUTING.md)
 [![Claude Code](https://img.shields.io/badge/Claude_Code-191919?style=flat&logo=anthropic&logoColor=white)](https://docs.anthropic.com/en/docs/claude-code)
 [![GitHub Copilot](https://img.shields.io/badge/GitHub_Copilot-000?style=flat&logo=githubcopilot&logoColor=white)](https://docs.github.com/en/copilot/github-copilot-in-the-cli)
-[![Built with Donuts](https://img.shields.io/badge/%F0%9F%8D%A9-Built_with_Donuts-ff6f00?style=flat)](https://github.com/lukas-grigis/ralphctl)
 ```
-  🍩 ██████╗  █████╗ ██╗     ██████╗ ██╗  ██╗ ██████╗████████╗██╗     🍩
-     ██╔══██╗██╔══██╗██║     ██╔══██╗██║  ██║██╔════╝╚══██╔══╝██║
-     ██████╔╝███████║██║     ██████╔╝███████║██║        ██║   ██║
-     ██╔══██╗██╔══██║██║     ██╔═══╝ ██╔══██║██║        ██║   ██║
-     ██║  ██║██║  ██║███████╗██║     ██║  ██║╚██████╗   ██║   ███████╗
-     ╚═╝  ╚═╝╚═╝  ╚═╝╚══════╝╚═╝     ╚═╝  ╚═╝ ╚═════╝   ╚═╝   ╚══════╝
+  ██████╗  █████╗ ██╗     ██████╗ ██╗  ██╗ ██████╗████████╗██╗
+  ██╔══██╗██╔══██╗██║     ██╔══██╗██║  ██║██╔════╝╚══██╔══╝██║
+  ██████╔╝███████║██║     ██████╔╝███████║██║        ██║   ██║
+  ██╔══██╗██╔══██║██║     ██╔═══╝ ██╔══██║██║        ██║   ██║
+  ██║  ██║██║  ██║███████╗██║     ██║  ██║╚██████╗   ██║   ███████╗
+  ╚═╝  ╚═╝╚═╝  ╚═╝╚══════╝╚═╝     ╚═╝  ╚═╝ ╚═════╝   ╚═╝   ╚══════╝
 ```
-**Agent harness for long-running AI coding tasks — orchestrates [Claude Code](https://docs.anthropic.com/en/docs/claude-code) & [GitHub Copilot](https://docs.github.com/en/copilot/github-copilot-in-the-cli) across repositories.**
+**Agent harness for long-running AI coding tasks —
+orchestrates [Claude Code](https://docs.anthropic.com/en/docs/claude-code) & [GitHub Copilot](https://docs.github.com/en/copilot/github-copilot-in-the-cli)
+across repositories.**
 > _"I'm helping!"_ — Ralph Wiggum
 > [!NOTE]
 > **Early access.** RalphCTL is under active development. Things work, but expect rough edges and breaking changes
-> before 1.0. Read the [blog post](https://lukasgrigis.dev/blog/building-ralphctl) for the backstory.
+> before 1.0.
-RalphCTL decomposes work into dependency-ordered tasks, executes them through AI coding agents, and runs a
-[generator-evaluator loop](https://www.anthropic.com/engineering/harness-design-long-running-apps) to catch issues
-before moving on. It manages context across sessions so nothing gets lost — whether you're working on a single ticket
-or coordinating changes across multiple repositories. Ralph Wiggum personality included because why not.
+---
+## Why ralphctl?
+AI coding agents are powerful but lose context on long tasks, need babysitting when things break, and have no way to
+coordinate changes across multiple repositories. RalphCTL decomposes your work into dependency-ordered tasks, runs each
+one through a [generator-evaluator loop](https://www.anthropic.com/engineering/harness-design-long-running-apps) that
+catches issues before moving on, and persists context across sessions so nothing gets lost. You describe what to build —
+ralphctl handles the rest.
 ---
-## Install
+## How It Works
-```bash
-npm install -g ralphctl
 ```
+  You describe what to build           ralphctl handles the rest
+  ─────────────────────────           ─────────────────────────────────
+  ┌──────────┐   ┌──────────┐        ┌────────┐   ┌──────┐   ┌─────────┐
+  │  Create  │──>│   Add    │───────>│ Refine │──>│ Plan │──>│ Execute │
+  │  Sprint  │   │ Tickets  │        │ (WHAT) │   │(HOW) │   │  Loop   │
+  └──────────┘   └──────────┘        └────────┘   └──────┘   └─────────┘
+                                          │            │           │
+                                     AI clarifies  AI generates  AI implements
+                                     requirements  task graph    + AI reviews
+                                     with you      from specs    each task
+```
+- **Dependency-ordered execution** — tasks run in the right sequence, one per repo at a time, with parallel execution
+  where possible
+- **Generator-evaluator cycle** — an independent AI reviewer checks each task against its spec; if it fails, the
+  generator gets feedback and iterates
+- **Context persistence** — sprint state, progress history, and task context survive across sessions; interrupted work
+  resumes where it left off
-This installs the `ralphctl` command globally.
+---
-### Prerequisites
+## Quick Start
-- [Node.js](https://nodejs.org/) **>= 24.0.0**
-- [Git](https://git-scm.com/)
-- Either [Claude CLI](https://docs.anthropic.com/en/docs/claude-code)
-  or [GitHub Copilot CLI](https://docs.github.com/en/copilot/github-copilot-in-the-cli) installed and authenticated
+```bash
+npm install -g ralphctl
+```
-### 2-Minute Quick Start
+Requires [Node.js](https://nodejs.org/) >= 24, [Git](https://git-scm.com/), and
+either [Claude CLI](https://docs.anthropic.com/en/docs/claude-code)
+or [GitHub Copilot CLI](https://docs.github.com/en/copilot/github-copilot-in-the-cli) installed and authenticated.
 ```bash
 # 1. Register a project (points to your repo)
@@ -68,36 +89,65 @@ ralphctl sprint plan
 ralphctl sprint start
 ```
-Or just run `ralphctl` with no arguments for an interactive menu that walks you through everything.
+Or run `ralphctl` with no arguments for an interactive menu that walks you through everything.
 ---
-## Table of Contents
+## Features
-- [Features](#features)
-- [CLI Overview](#cli-overview)
-- [AI Provider Configuration](#ai-provider-configuration)
-- [Documentation](#documentation)
-- [Development](#development)
-- [Contributing](#contributing)
-- [License](#license)
+- **Break big tickets into small tasks** — dependency-ordered so they execute in the right sequence
+- **Catch mistakes before they compound** — independent AI review after each task, iterating until quality passes or
+  budget is exhausted
+- **Coordinate across repositories** — one sprint can span multiple repos with automatic dependency tracking
+- **Run tasks in parallel** — one per repo, with rate-limit backoff and automatic session resume
+- **Separate the what from the how** — AI clarifies requirements first, then generates implementation tasks, with human
+  approval gates
+- **Pick up where you left off** — full state persistence across sessions; interrupted work resumes automatically
+- **Pair or let it run** — work alongside your AI agent interactively, or let it execute unattended
+- **Zero-memorization start** — run `ralphctl` with no args for a guided menu
 ---
-## Features
+## Configuration
+RalphCTL supports **Claude Code** and **GitHub Copilot** as AI backends.
+```bash
+ralphctl config set provider claude      # Use Claude Code
+ralphctl config set provider copilot     # Use GitHub Copilot
+```
+Auto-prompts on first AI command if not set. Both CLIs must be in your PATH and authenticated.
+<details>
+<summary>Provider differences</summary>
+| Feature                     | Claude Code                          | GitHub Copilot                                                       |
+| --------------------------- | ------------------------------------ | -------------------------------------------------------------------- |
+| Status                      | GA                                   | Public preview                                                       |
+| Headless execution          | `-p --output-format json`            | `-p --output-format json --autopilot --no-ask-user`                  |
+| Session IDs                 | In JSON output (`session_id`)        | In JSON output (`session_id`), `--share` file as fallback            |
+| Session resume (`--resume`) | Full support                         | Full support                                                         |
+| Per-tool permissions        | Settings files + `--permission-mode` | `--allow-all-tools` (all-or-nothing by default)                      |
+| Fine-grained tool control   | `allow`/`deny` in settings files     | `--allow-tool`, `--deny-tool` flags (not yet used)                   |
+| Rate limit detection        | Validated patterns                   | Borrowed from Claude — not yet validated against real Copilot errors |
+</details>
+---
-- **Task decomposition** — breaks tickets into dependency-ordered tasks with topological sort
-- **Generator-evaluator loop** — independent AI review after each task; iterates until quality passes or budget exhausted
-- **Multi-repo orchestration** — coordinate changes across multiple repositories in a single run
-- **Parallel execution** — one task per repo at a time, with automatic rate limit backoff and session resume
-- **Two-phase planning** — clarify requirements first (what), then generate tasks (how), with a human approval gate
-- **Context persistence** — state survives across sessions; interrupted work resumes where it left off
-- **Interactive or headless** — pair with your AI agent in a session, or let it run unattended
-- **Menu mode** — run `ralphctl` with no arguments for an interactive menu
+## Data Directory
+All data lives in `~/.ralphctl/` by default. Override with:
+```bash
+export RALPHCTL_ROOT="/path/to/custom/data-dir"
+```
 ---
-## CLI Overview
+<details>
+<summary><strong>CLI Command Reference</strong></summary>
 ### Getting Started
@@ -135,7 +185,7 @@ Or just run `ralphctl` with no arguments for an interactive menu that walks you
 | ------------------------ | --------------------------------- |
 | `ralphctl sprint start`  | Execute tasks with AI             |
 | `ralphctl sprint health` | Diagnose blockers and stale tasks |
-| `ralphctl dashboard`     | Sprint overview with progress bar |
+| `ralphctl status`        | Sprint overview with progress bar |
 | `ralphctl task list`     | List tasks in the current sprint  |
 | `ralphctl task next`     | Show the next unblocked task      |
 | `ralphctl sprint close`  | Close an active sprint            |
@@ -143,54 +193,22 @@ Or just run `ralphctl` with no arguments for an interactive menu that walks you
 Run `ralphctl <command> --help` for details on any command.
----
-## AI Provider Configuration
-RalphCTL supports **Claude Code** and **GitHub Copilot** as AI backends. Both use the same prompt templates and
-workflow.
-```bash
-ralphctl config set provider claude      # Use Claude Code
-ralphctl config set provider copilot     # Use GitHub Copilot
-```
-Auto-prompts on first AI command if not set. Both CLIs must be in your PATH and authenticated.
-### Provider Differences
-| Feature                     | Claude Code                          | GitHub Copilot                                                       |
-| --------------------------- | ------------------------------------ | -------------------------------------------------------------------- |
-| Status                      | GA                                   | Public preview                                                       |
-| Headless execution          | `-p --output-format json`            | `-p --output-format json --autopilot --no-ask-user`                  |
-| Session IDs                 | In JSON output (`session_id`)        | In JSON output (`session_id`), `--share` file as fallback            |
-| Session resume (`--resume`) | Full support                         | Full support                                                         |
-| Per-tool permissions        | Settings files + `--permission-mode` | `--allow-all-tools` (all-or-nothing by default)                      |
-| Fine-grained tool control   | `allow`/`deny` in settings files     | `--allow-tool`, `--deny-tool` flags (not yet used)                   |
-| Rate limit detection        | Validated patterns                   | Borrowed from Claude — not yet validated against real Copilot errors |
+</details>
 ---
 ## Documentation
-| Document                                                    | Description                                    |
-| ----------------------------------------------------------- | ---------------------------------------------- |
-| [REQUIREMENTS.md](./.claude/docs/REQUIREMENTS.md)           | Acceptance criteria and feature requirements   |
-| [ARCHITECTURE.md](./.claude/docs/ARCHITECTURE.md)           | Data models, file storage, and error reference |
-| [CLAUDE.md](./CLAUDE.md)                                    | Developer guide and Claude Code project config |
-| [CONTRIBUTING.md](./CONTRIBUTING.md)                        | How to contribute                              |
-| [CHANGELOG.md](./CHANGELOG.md)                              | Version history                                |
-| [Blog post](https://lukasgrigis.dev/blog/building-ralphctl) | Background and motivation                      |
+| Resource                                       | Description                                |
+| ---------------------------------------------- | ------------------------------------------ |
+| [Architecture](./.claude/docs/ARCHITECTURE.md) | Data models, file storage, error reference |
+| [Requirements](./.claude/docs/REQUIREMENTS.md) | Acceptance criteria and feature checklist  |
+| [Contributing](./CONTRIBUTING.md)              | Dev setup, code style, PR process          |
+| [Changelog](./CHANGELOG.md)                    | Version history                            |
----
-## Data Directory
-RalphCTL stores all data in `~/.ralphctl/` by default. Override with `RALPHCTL_ROOT`:
-```bash
-export RALPHCTL_ROOT="/path/to/custom/data-dir"
-```
+**Blog posts:** [Building ralphctl](https://lukasgrigis.dev/blog/building-ralphctl) (
+backstory) | [From task CLI to agent harness](https://lukasgrigis.dev/blog/ralphctl-agent-harness/) (evaluator
+deep-dive)
 ---

package/dist/{chunk-AXNZMHFQ.mjs → chunk-XXIHDQOH.mjs} RENAMED Viewed

@@ -2680,11 +2680,15 @@ async function runEvaluationLoop(params) {
       {
         cwd: task.projectPath,
         args: ["--add-dir", sprintDir, ...buildProviderArgs(options, provider)],
-        prompt: `The evaluator found issues with your work:
+        prompt: `The evaluator found issues with your implementation:
 ${evalResult.output}
-Fix these issues, then verify${options.noCommit ? "" : ", commit your fix,"} and signal completion.`,
+Review the critique carefully. Fix each identified issue in the code, then:
+1. Re-run verification commands to confirm the fix
+${options.noCommit ? "" : "2. Commit the fix with a descriptive message\n"}${options.noCommit ? "2" : "3"}. Signal completion with <task-verified> and <task-complete>
+If the critique is about something outside your task scope, fix only what is within scope and signal completion.`,
         resumeSessionId: result.sessionId ?? void 0,
         env: provider.getSpawnEnv()
       },

package/dist/cli.mjs CHANGED Viewed

@@ -52,7 +52,7 @@ import {
   sprintStartCommand,
   updateTaskStatus,
   validateImportTasks
-} from "./chunk-AXNZMHFQ.mjs";
+} from "./chunk-XXIHDQOH.mjs";
 import {
   escapableSelect
 } from "./chunk-7LZ6GOGN.mjs";
@@ -3763,7 +3763,7 @@ async function interactiveMode() {
       continue;
     }
     if (command === "wizard") {
-      const { runWizard } = await import("./wizard-TFJXEYD2.mjs");
+      const { runWizard } = await import("./wizard-D7N5WZ5H.mjs");
       await runWizard();
       continue;
     }
@@ -4234,7 +4234,7 @@ Checks performed:
 // package.json
 var package_default = {
   name: "ralphctl",
-  version: "0.2.1",
+  version: "0.2.2",
   description: "Agent harness for long-running AI coding tasks \u2014 orchestrates Claude Code & GitHub Copilot across repositories",
   homepage: "https://github.com/lukas-grigis/ralphctl",
   type: "module",

package/dist/prompts/ideate-auto.md CHANGED Viewed

@@ -58,6 +58,12 @@ Explore the selected repositories and produce implementation tasks:
 3. **Create tasks** — Following the Planning Common Context guidelines below
 4. **Validate** — Ensure tasks are non-overlapping, properly ordered, and completable
+### Blocker Handling
+If you cannot produce a valid plan, signal the issue instead of outputting incomplete JSON:
+- `<planning-blocked>reason</planning-blocked>`
 ## Idea to Implement
 **Title:** {{IDEA_TITLE}}

package/dist/prompts/plan-auto.md CHANGED Viewed

@@ -79,11 +79,11 @@ Before outputting JSON, verify EVERY item on this checklist:
    instructions
 7. **projectPath assigned** — Every task has a `projectPath` from the project's repository paths
 8. **Clear done state** — For each task, the question "how do I know this is done?" has an obvious answer
-9. **Valid JSON** — The output parses as a JSON array of task objects matching the schema
+9. **Valid JSON** — The output parses as valid JSON matching the schema
 ## Output
-**IMPORTANT:** Output ONLY a valid JSON array. No markdown, no explanation, no commentary — just the JSON.
+**IMPORTANT:** Output ONLY valid JSON matching the schema below. No markdown, no explanation, no commentary — just the JSON.
 If you cannot produce tasks, output a `<planning-blocked>` signal instead.
 JSON Schema:

package/dist/prompts/task-evaluation.md CHANGED Viewed

@@ -20,10 +20,11 @@ You are working in this project directory:
 ### Investigation Steps
 1. Run `git log --oneline -10` to identify the commits from this task, then run `git diff <base>..HEAD` for the full range of changes (tasks may produce multiple commits — do not assume a single commit)
-2. Read the changed files carefully to understand the full implementation context
-3. Look at surrounding code to understand patterns and conventions
-4. Compare the actual changes against the task specification above
-5. Identify any issues:
+2. Run `git status` to check for uncommitted changes — uncommitted work may indicate the task is incomplete
+3. Read the changed files carefully to understand the full implementation context
+4. Look at surrounding code to understand patterns and conventions
+5. Compare the actual changes against the task specification above
+6. Identify any issues:
    - **Spec drift** — changes that go beyond or fall short of what was specified
    - **Missing edge cases** — error paths, boundary conditions, empty states
    - **Unnecessary changes** — modifications unrelated to the task
@@ -33,6 +34,11 @@ You are working in this project directory:
 Do NOT suggest improvements or refactoring beyond the task scope.
 Only evaluate what was asked vs what was delivered.
+### Pass Bar
+Pass the implementation if it satisfies the task specification without correctness or security issues.
+Do not fail for style preferences, naming opinions, or improvements beyond the task scope.
 {{CHECK_SCRIPT_SECTION}}
 ## Output

package/dist/prompts/task-execution.md CHANGED Viewed

@@ -35,9 +35,10 @@ Perform these checks IN ORDER before writing any code:
    discovered, and gotchas encountered. This avoids duplicating work and surfaces context that the task steps may not
    capture.
 3. **Check git state** — Run `git status` to check for uncommitted changes
-4. **Check environment** — Look at the "Check Script" and "Environment Status" sections in your context file. If a check
-   script is configured, the harness ran it at sprint start. If not configured, run the project's verification commands
-   yourself (check CLAUDE.md, .github/copilot-instructions.md, or project config). If ANY check fails, STOP:
+4. **Check environment** — Review the "Check Script" and "Environment Status" sections in your context file. If a check
+   script is configured, the harness already verified the environment — review those results rather than re-running.
+   If no check script is configured AND no environment status is recorded, run the project's verification commands
+   yourself (check CLAUDE.md, .github/copilot-instructions.md, or project config). If ANY check shows failure, STOP:
    ```
    <task-blocked>Pre-existing failure: [details of what failed and the output]</task-blocked>
    ```
@@ -101,7 +102,7 @@ Complete these steps IN ORDER:
    - Created src/schemas/date-range.ts with DateRangeSchema (Zod + .openapi())
    - Modified src/controllers/export.ts to accept optional `startDate`/`endDate` query params
-   - Added tests in src/schemas/**tests**/date-range.test.ts
+   - Added tests in `src/schemas/__tests__/date-range.test.ts`
    ### Learnings and Context

package/dist/prompts/ticket-refine.md CHANGED Viewed

@@ -105,7 +105,7 @@ approval. Iterate until approved.
 Before writing to file, verify ALL of these are true:
 - [ ] Problem statement is clear and agreed upon
-- [ ] Every requirement has 2+ acceptance criteria with multiple scenarios each (happy path + edge case minimum)
+- [ ] Every requirement has acceptance criteria covering key scenarios (happy path + edge/error cases at minimum)
 - [ ] Scope boundaries are explicit (what's in AND what's out)
 - [ ] Edge cases and error states are addressed
 - [ ] No implementation details leaked into requirements

package/dist/{wizard-TFJXEYD2.mjs → wizard-D7N5WZ5H.mjs} RENAMED Viewed

@@ -3,7 +3,7 @@ import {
   sprintPlanCommand,
   sprintRefineCommand,
   sprintStartCommand
-} from "./chunk-AXNZMHFQ.mjs";
+} from "./chunk-XXIHDQOH.mjs";
 import "./chunk-7LZ6GOGN.mjs";
 import {
   sprintCreateCommand

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "ralphctl",
-  "version": "0.2.1",
+  "version": "0.2.2",
   "description": "Agent harness for long-running AI coding tasks — orchestrates Claude Code & GitHub Copilot across repositories",
   "homepage": "https://github.com/lukas-grigis/ralphctl",
   "type": "module",