npm - @caseyharalson/orrery - Versions diffs - 0.7.1 - Mend

@caseyharalson/orrery 0.7.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (40) hide show

package/.devcontainer.example/Dockerfile +149 -0
package/.devcontainer.example/devcontainer.json +61 -0
package/.devcontainer.example/init-firewall.sh +175 -0
package/LICENSE +21 -0
package/README.md +139 -0
package/agent/skills/discovery/SKILL.md +428 -0
package/agent/skills/discovery/schemas/plan-schema.yaml +138 -0
package/agent/skills/orrery-execute/SKILL.md +107 -0
package/agent/skills/orrery-report/SKILL.md +119 -0
package/agent/skills/orrery-review/SKILL.md +105 -0
package/agent/skills/orrery-verify/SKILL.md +105 -0
package/agent/skills/refine-plan/SKILL.md +291 -0
package/agent/skills/simulate-plan/SKILL.md +244 -0
package/bin/orrery.js +5 -0
package/lib/cli/commands/help.js +21 -0
package/lib/cli/commands/ingest-plan.js +56 -0
package/lib/cli/commands/init.js +21 -0
package/lib/cli/commands/install-devcontainer.js +97 -0
package/lib/cli/commands/install-skills.js +182 -0
package/lib/cli/commands/orchestrate.js +27 -0
package/lib/cli/commands/resume.js +146 -0
package/lib/cli/commands/status.js +137 -0
package/lib/cli/commands/validate-plan.js +288 -0
package/lib/cli/index.js +57 -0
package/lib/orchestration/agent-invoker.js +595 -0
package/lib/orchestration/condensed-plan.js +128 -0
package/lib/orchestration/config.js +213 -0
package/lib/orchestration/dependency-resolver.js +149 -0
package/lib/orchestration/edit-invoker.js +115 -0
package/lib/orchestration/index.js +1065 -0
package/lib/orchestration/plan-loader.js +212 -0
package/lib/orchestration/progress-tracker.js +208 -0
package/lib/orchestration/report-format.js +80 -0
package/lib/orchestration/review-invoker.js +305 -0
package/lib/utils/agent-detector.js +47 -0
package/lib/utils/git.js +297 -0
package/lib/utils/paths.js +43 -0
package/lib/utils/plan-detect.js +24 -0
package/lib/utils/skill-copier.js +79 -0
package/package.json +58 -0

package/agent/skills/discovery/SKILL.md ADDED Viewed

@@ -0,0 +1,428 @@
+---
+name: discovery
+description: >
+  Create executable plans for building systems, features, or modules. Use for
+  planning requests, architectural decisions, or when decomposing big ideas
+  into concrete implementation steps.
+hooks:
+  PostToolUse:
+    - matcher: "Write"
+      hooks:
+        - type: command
+          command: "orrery validate-plan"
+---
+# Discovery Skill
+## When to Use
+Use this skill for **all planning requests**, regardless of size. Discovery transforms ideas into concrete, executable plans.
+**Triggers:**
+- "Build a [system/platform/module]"
+- Request spans multiple features or domains
+- Unclear what "done" looks like
+- Requires architectural decisions before implementation
+**Also use Discovery when:**
+- Request is a single, scoped feature
+- Outcomes and scope are already clear
+- You can articulate the task in 1-3 sentences
+---
+## The Decomposition Ladder
+Discovery works top-down through five levels:
+```
+IDEA        "We need better analytics"
+   ↓
+OUTCOMES    "Users can see trends; Admins can export reports"
+   ↓
+CAPABILITIES "Trend visualization; Data aggregation; Export service"
+   ↓
+FEATURES    "Line chart component; Daily rollup job; CSV export endpoint"
+   ↓
+STEPS       "1.1 Create chart component; 1.2 Add API route; 1.3 Wire up data"
+```
+Each level must be concrete enough that the next level can be derived.
+The final output is an **orchestrator-ready plan** with implementation steps.
+---
+## How to Do It
+### Step 1: Capture the Idea
+Get the raw vision from the user. Don't judge scope yet.
+- What problem are we solving?
+- Who is this for?
+- What sparked this idea?
+### Step 2: Define Outcomes (the "why")
+Outcomes are **user-visible results**, not technical deliverables.
+Ask:
+- "What will users be able to do that they can't today?"
+- "How will we know this succeeded?"
+- "What does 'done' look like from the user's perspective?"
+**Output:** 2-5 concrete outcome statements.
+Example:
+- "Implement caching" (technical, not outcome)
+- "Dashboard loads in under 2 seconds" (user-visible result)
+### Step 3: Map Capabilities (the "what")
+Capabilities are **system abilities** that enable outcomes.
+For each outcome, ask:
+- "What must the system be able to do to deliver this?"
+- "What new behaviors or services are needed?"
+**Output:** Capabilities grouped by outcome.
+Example:
+- Outcome: "Dashboard loads in under 2 seconds"
+  - Capability: Query result caching
+  - Capability: Incremental data loading
+  - Capability: Pre-computed aggregations
+### Step 4: Decompose into Features (the "how")
+Features are **implementable units of work** that deliver capabilities.
+For each capability, ask:
+- "What specific features implement this?"
+- "Can this be shipped independently?"
+- "What's the minimum viable version?"
+**Output:** Feature list with clear boundaries.
+### Step 5: Gather Context per Feature
+This is critical for "fire and forget" plans. For each feature:
+1. **Search the codebase** - find related files, patterns, dependencies
+2. **Identify constraints** - what must this integrate with?
+3. **Define acceptance criteria** - specific, testable conditions
+4. **Note risks** - what could go wrong?
+5. **List input files** - what must an agent read to understand context?
+### Step 6: Decompose Features into Implementation Steps
+Each feature becomes 2-5 concrete implementation steps. This is what makes the plan orchestrator-ready.
+For each feature, ask:
+- "What are the individual pieces of work?"
+- "What order must they happen in?"
+- "Can any run in parallel?"
+**Step characteristics:**
+- **Scoped** - completable in a single focused session
+- **Specific files** - lists exactly which files to create/modify
+- **Testable** - has clear acceptance criteria
+- **Self-contained** - an agent can execute without asking questions
+**Naming convention:**
+- Use `{feature-number}.{step-number}` format: `1.1`, `1.2`, `2.1`, etc.
+- Group related steps by feature for readability
+**Example decomposition:**
+```
+Feature: "Line chart component"
+  → Step 1.1: Create TrendChart.tsx with basic Chart.js setup
+  → Step 1.2: Add time range toggle (7d/30d/90d)
+  → Step 1.3: Connect to trends API endpoint
+  → Step 1.4: Add loading state and error handling
+```
+#### Dependency Installation Steps
+If a plan requires installing dependencies, handle them explicitly:
+- **Create a dedicated install step** (e.g., "0.1 Install project dependencies") as the first step in the plan (after project creation/structuring if relevant)
+- **All subsequent steps MUST list the install step in their `deps` array** - otherwise parallel execution or out-of-order execution will fail because dependencies aren't available
+- **Include test dependencies if acceptance criteria involve testing** - use `npm install --include=dev`, `pip install -e ".[test]"`, or the equivalent for the project's package manager
+Example:
+```
+Step 0.1: Install project dependencies (including test frameworks)
+  → deps: []
+Step 1.1: Create TrendChart component
+  → deps: ["0.1"]  ← Must reference the install step!
+Step 1.2: Add unit tests for TrendChart
+  → deps: ["0.1", "1.1"]  ← Also depends on install step
+```
+### Step 7: Validate with User
+Before producing the plan:
+- Present the step breakdown (not just features)
+- Confirm priorities and ordering
+- Resolve any remaining ambiguities
+- Get explicit sign-off that this captures the intent
+### Step 8: Output the Plan
+Generate an orchestrator-ready plan file with implementation steps.
+Each step must be **self-contained** - an agent should be able to execute
+it without asking questions.
+Use the schema defined in `./schemas/plan-schema.yaml`.
+**Output Location:**
+- Directory: `.agent-work/plans/`
+- Filename: `<date>-<plan-name>.yaml`
+- Date format: YYYY-MM-DD (e.g., `2026-01-11`)
+- Plan name: kebab-case description of the task (e.g., `fix-clone-agent-skills`)
+**Document Project Context in Notes:**
+Use the `notes` field in metadata to capture project-specific context that agents need throughout execution. This is critical because when plans are broken into substeps, agents executing individual steps might not know project conventions.
+Include in notes:
+- **Testing commands**: How to run tests (e.g., `uv run pytest`, `npm test`, `make test`) (check the README, pyproject.toml, package.json, etc. if applicable to understand: project manager, how to run tests, project conventions)
+- **Build commands**: How to build the project (e.g., `uv run build`, `npm run build`)
+- **Linting/formatting**: How to check code quality (e.g., `uv run ruff check .`)
+- **Environment setup**: Required environment variables, virtual environments, etc.
+- **Project conventions**: Naming conventions, file organization, coding standards
+- **Tool-specific notes**: e.g., "Always prefix Python commands with 'uv run'"
+### Validate the Plan
+Plans are automatically validated via the PostToolUse hook when written.
+For manual validation, run:
+```bash
+orrery validate-plan .agent-work/plans/<plan>.yaml
+```
+This catches common YAML issues like unquoted colons and normalizes formatting.
+### YAML Formatting Rules
+- Always quote strings containing special characters (colons, brackets, etc.)
+- BAD: `criteria: Output shows: timestamp value`
+- GOOD: `criteria: "Output shows: timestamp value"`
+- Common gotchas: colons followed by space, special character prefixes, multi-line strings
+- Rule: When in doubt, use double quotes around the entire value
+---
+## Output Format
+```yaml
+# plan.yaml
+metadata:
+  created_at: "2026-01-11T10:00:00Z"
+  created_by: "Discovery-Agent"
+  version: "1.0"
+  source_idea: "We need better analytics"
+  outcomes:
+    - "Users can see usage trends over time"
+    - "Admins can export reports for stakeholders"
+  notes: |
+    This project uses uv for dependency management.
+    - Run tests: uv run pytest
+    - Run linting: uv run ruff check .
+    Always prefix Python commands with 'uv run'.
+steps:
+  # ============================================================================
+  # Step 0: Setup (Dependencies)
+  # ============================================================================
+  - id: "0.1"
+    description: "Install project dependencies including test frameworks"
+    status: "pending"
+    deps: []
+    parallel: false
+    context: |
+      Install all dependencies required for the project. Include dev/test
+      dependencies since acceptance criteria involve running tests.
+    requirements:
+      - "Run npm install (or equivalent for the project)"
+      - "Include dev dependencies for testing"
+    criteria:
+      - "All dependencies install without errors"
+      - "Test framework is available (e.g., jest, pytest)"
+    files: []
+    commands:
+      - "npm install"
+  # ============================================================================
+  # Feature 1: Trends API Endpoint
+  # ============================================================================
+  - id: "1.1"
+    description: "Create trends service with data aggregation logic"
+    status: "pending"
+    deps: ["0.1"]
+    parallel: false
+    context: |
+      Stats are currently computed on-demand in statsService.ts. This step
+      creates a new service that aggregates historical data into time-series
+      format for the trends endpoint.
+    requirements:
+      - "Create src/api/services/trendsService.ts"
+      - "Function: getTrends(range: '7d' | '30d' | '90d')"
+      - "Returns { dates: string[], values: number[] }"
+      - "Query existing stats table, group by date"
+    criteria:
+      - "Service exports getTrends function"
+      - "Returns correctly shaped data for all range values"
+      - "Unit test with mocked data passes"
+    files:
+      - "src/api/services/trendsService.ts"
+      - "src/api/services/trendsService.test.ts"
+    context_files:
+      - "src/api/services/statsService.ts"
+  - id: "1.2"
+    description: "Add trends API route with caching"
+    status: "pending"
+    deps: ["1.1"]
+    parallel: false
+    context: |
+      Wire up the trends service to an HTTP endpoint. Use existing cache
+      middleware pattern from other routes.
+    requirements:
+      - "GET /api/stats/trends?range=7d|30d|90d"
+      - "Cache responses for 1 hour"
+      - "Validate range parameter"
+    criteria:
+      - "Endpoint returns 200 with valid JSON"
+      - "Invalid range returns 400"
+      - "Response time < 200ms (cached)"
+    files:
+      - "src/api/routes/stats.ts"
+    context_files:
+      - "src/api/middleware/cache.ts"
+  # ============================================================================
+  # Feature 2: Trend Visualization Component
+  # ============================================================================
+  - id: "2.1"
+    description: "Create base TrendChart component with Chart.js"
+    status: "pending"
+    deps: ["0.1", "1.2"]
+    parallel: false
+    context: |
+      Users currently see only current-day stats. This adds a line chart
+      using the existing Chart.js setup in src/components/charts/.
+    requirements:
+      - "Create TrendChart.tsx extending BaseChart"
+      - "Line chart with responsive sizing"
+      - "Accept data prop: { dates: string[], values: number[] }"
+    criteria:
+      - "Component renders with mock data"
+      - "Chart displays correctly at mobile and desktop widths"
+    files:
+      - "src/components/TrendChart.tsx"
+    context_files:
+      - "src/components/charts/BaseChart.tsx"
+      - "src/styles/charts.css"
+    risk_notes: "Chart.js bundle size - verify no significant increase"
+  - id: "2.2"
+    description: "Add time range toggle to TrendChart"
+    status: "pending"
+    deps: ["2.1"]
+    parallel: false
+    context: |
+      Add toggle buttons for 7d/30d/90d. Selecting a range should trigger
+      a data refetch.
+    requirements:
+      - "Toggle buttons: 7 days, 30 days, 90 days"
+      - "Active state styling for selected range"
+      - "onChange callback when range changes"
+    criteria:
+      - "Toggle switches time range"
+      - "Visual indication of selected range"
+    files:
+      - "src/components/TrendChart.tsx"
+  - id: "2.3"
+    description: "Connect TrendChart to API and add loading states"
+    status: "pending"
+    deps: ["2.2"]
+    parallel: false
+    context: |
+      Wire up the component to fetch from the trends API. Handle loading
+      and error states gracefully.
+    requirements:
+      - "Fetch from GET /api/stats/trends on mount and range change"
+      - "Loading skeleton while fetching"
+      - "Error state if API fails"
+    criteria:
+      - "Data loads from API on initial render"
+      - "Range change triggers new API call"
+      - "Loading skeleton appears during fetch"
+      - "Error message displays on API failure"
+    files:
+      - "src/components/TrendChart.tsx"
+      - "src/components/TrendChart.test.tsx"
+```
+---
+## When Discovery is Complete
+Discovery is complete when:
+- [ ] All outcomes are defined and user-validated
+- [ ] Each outcome maps to concrete capabilities
+- [ ] Each capability has implementable features
+- [ ] Each feature is decomposed into implementation steps
+- [ ] Each step has sufficient context for autonomous execution
+- [ ] The plan file passes schema validation
+- [ ] User has approved the plan
+### Final Output
+When the plan is complete and validated, output the plan file path and present the user with their next options:
+```
+Plan created: .agent-work/plans/<date>-<plan-name>.yaml
+Next steps:
+- /refine-plan .agent-work/plans/<plan-file> — Analyze and improve the plan before execution
+- /simulate-plan .agent-work/plans/<plan-file> — Explore the plan through dialogue, ask "what if" questions
+- orrery exec — (Command run from the terminal) Execute the plan with the orrery orchestrator
+```
+This ensures the user knows exactly what they can do next and has the full path to reference the plan.
+---
+## Common Pitfalls
+- **Skipping outcome definition:** Jumping to features without knowing "why" leads to building the wrong thing
+- **Thin context:** A description like "Add caching" isn't enough. Include what, where, why, constraints.
+- **Implicit dependencies:** If feature B needs feature A's output, say so explicitly
+- **No user validation:** Don't assume you understood the idea correctly. Confirm the decomposition.
+- **Premature detail:** Don't write implementation code during Discovery. Just define what to build.
+- **Missing dependency step dependencies:** If step "0.1" installs dependencies, all steps that use those dependencies must include "0.1" in their deps array. Otherwise, parallel execution or out-of-order execution will fail.
+- **Forgetting test dependencies:** If acceptance criteria include running tests, ensure the dependency installation step includes dev/test dependencies (e.g., `npm install --include=dev`, `pip install -e ".[test]"`).

package/agent/skills/discovery/schemas/plan-schema.yaml ADDED Viewed

@@ -0,0 +1,138 @@
+$schema: http://json-schema.org/draft-07/schema#
+type: object
+title: Plan Schema
+description: >
+  Extended plan schema for planning output. Includes additional fields for
+  rich context, enabling agents to execute steps autonomously without further
+  user input.
+properties:
+  metadata:
+    type: object
+    description: Plan metadata including creation info and high-level context
+    properties:
+      created_at:
+        type: string
+        format: date-time
+      created_by:
+        type: string
+      version:
+        type: string
+      source_idea:
+        type: string
+        description: The original idea or request that triggered discovery
+      outcomes:
+        type: array
+        description: User-visible results this plan delivers
+        items:
+          type: string
+      notes:
+        type: string
+        description: >
+          General notes for agents executing this plan. Include testing commands,
+          environment setup, project conventions, or any context that applies
+          across all steps.
+    required:
+      - created_at
+      - created_by
+      - outcomes
+  steps:
+    type: array
+    description: Array of plan steps defining the work to be done
+    items:
+      $ref: "#/definitions/Step"
+required:
+  - metadata
+  - steps
+definitions:
+  Step:
+    type: object
+    description: Individual step in the plan, representing a feature or work unit
+    required:
+      - id
+      - description
+      - context
+      - requirements
+      - criteria
+    properties:
+      id:
+        type: string
+        description: Unique identifier for the step
+      description:
+        type: string
+        description: Concise summary of what this step accomplishes
+      status:
+        type: string
+        enum:
+          - pending
+          - in_progress
+          - complete
+          - blocked
+        default: pending
+        description: Current status of the step
+      deps:
+        type: array
+        description: List of step IDs this step depends on
+        items:
+          type: string
+        default: []
+      parallel:
+        type: boolean
+        description: Whether this step can run in parallel with others
+        default: false
+      context:
+        type: string
+        description: >
+          Background information needed to execute this step. Should include
+          why this step exists, how it fits into the larger picture, and any
+          relevant technical context an agent needs to understand before starting.
+      requirements:
+        type: array
+        description: Specific requirements for this step
+        items:
+          type: string
+        minItems: 1
+      criteria:
+        type: array
+        description: Acceptance criteria - specific, testable conditions for completion
+        items:
+          type: string
+        minItems: 1
+      files:
+        type: array
+        description: Files this step will create or modify
+        items:
+          type: string
+        default: []
+      context_files:
+        type: array
+        description: >
+          Files the agent should read for context before starting. These are
+          not modified, but provide patterns, interfaces, or background needed
+          to complete the step.
+        items:
+          type: string
+        default: []
+      commands:
+        type: array
+        description: Specific commands to execute (build, test, etc.)
+        items:
+          type: string
+        default: []
+      risk_notes:
+        type: string
+        description: Warnings, edge cases, or things to watch out for

package/agent/skills/orrery-execute/SKILL.md ADDED Viewed

@@ -0,0 +1,107 @@
+---
+name: orrery-execute
+description: >
+  Write or modify code according to a plan step. Handle implementation.
+user-invocable: false
+---
+# Execute Skill
+## When to Use
+Use this skill to **implement code changes** defined in a plan step.
+**Triggers:**
+- You are invoked by the Orchestrator to work on specific `stepIds`.
+- A plan exists with pending steps.
+**Prerequisites:**
+- Plan exists with clear step descriptions.
+- You understand what needs to be built.
+---
+## How to Do It
+### Repository Guidelines
+Before implementing, check for project-specific guidelines:
+1. **Check for guideline files** - Look for project guideline files at the repo root (e.g., `CLAUDE.md`, `AGENTS.md`, `COPILOT.md`, or similar). Read and follow their working agreements (formatting commands, validation steps, changelog requirements, etc.)
+2. **Check Plan Notes** - Read the plan's `metadata.notes` field for project-specific commands and conventions (testing commands, build commands, tool-specific notes).
+3. **Check CONTRIBUTING.md** - If present, follow commit message conventions and coding standards.
+These guidelines override generic examples in this skill. For example, if a guideline file says "run `npm run fix` after changes", do that instead of generic lint commands.
+### Git State
+The orchestrator modifies `.agent-work/` files before you start (marking steps `in_progress`, creating temp files). This is expected. **Ignore changes in `.agent-work/`** when checking git status - these are orchestrator bookkeeping files, not unexpected changes.
+### Step 1: Read the Plan
+Read the plan file provided in your instructions.
+- Identify the steps you are assigned to (via `stepIds`).
+- Read the `description`, `criteria`, `files`, and `risk_notes` for those steps.
+- **Do not edit the plan file.**
+### Step 2: Implement the Change
+Write the code:
+- Follow project conventions and patterns.
+- Keep changes focused on the step's scope.
+- Write clean, readable code.
+- **Do not** add comments to the plan file.
+### Step 3: Initial Check
+Before handing off:
+- **Compile/Build:** Ensure no syntax errors.
+- **Smoke Test:** Does it run?
+### Step 4: Handoff to Verify
+Once implementation is complete, invoke the `orrery-verify` skill using the Skill tool.
+**Important:** Do NOT commit your changes. The orchestrator handles all commits after receiving your report.
+---
+## Example
+**Plan Step:**
+```yaml
+- id: "2"
+  description: "Implement backend API endpoint for CSV upload"
+```
+**Execution:**
+1. **Read** the plan to understand Step 2.
+2. **Implement** `src/api/routes/upload.ts`.
+3. **Run** `npm build` -> Passes.
+4. **Invoke** the `orrery-verify` skill using the Skill tool.
+---
+## Error Handling
+### When Code Doesn't Work
+1. **Read error messages.**
+2. **Fix specific issues.**
+3. **If stuck:** You may mark the step as blocked by invoking the `orrery-report` skill directly using the Skill tool with a "Blocked" status (see Report skill for details).
+### Rollback Strategy
+If a change breaks things badly and you cannot fix it:
+1. `git stash` or `git checkout` to revert.
+2. Invoke the `orrery-report` skill using the Skill tool to report the blockage.