forge-orkes 0.3.4 → 0.3.5
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/package.json
CHANGED
|
@@ -42,22 +42,63 @@ After 3 auto-fix attempts on a single task → STOP fixing. Document remaining i
|
|
|
42
42
|
### Scope Boundary
|
|
43
43
|
Only fix issues DIRECTLY caused by the current task. Pre-existing warnings, tech debt, unrelated bugs → log to `.forge/deferred-issues.md`, don't fix.
|
|
44
44
|
|
|
45
|
+
## Native Task Tracking (Hybrid Approach)
|
|
46
|
+
|
|
47
|
+
Use Claude Code's native task tools (`TaskCreate`, `TaskUpdate`, `TaskList`) for **in-session visibility** during execution. The `.forge/state/milestone-{id}.yml` remains the **cross-session source of truth** — native tasks are a UI layer on top.
|
|
48
|
+
|
|
49
|
+
### On Plan Start
|
|
50
|
+
|
|
51
|
+
After loading the plan file, create native tasks from the XML task blocks:
|
|
52
|
+
|
|
53
|
+
1. Parse all `<task>` XML blocks from the plan
|
|
54
|
+
2. For each task, call `TaskCreate`:
|
|
55
|
+
- `subject`: The task's `<name>` value
|
|
56
|
+
- `description`: Combine `<action>`, `<verify>`, and `<done>` fields
|
|
57
|
+
- `activeForm`: Present-continuous form of the task name (e.g., "Implementing login form")
|
|
58
|
+
3. If the plan has ordered tasks (task 2 depends on task 1), set dependencies via `TaskUpdate` with `addBlockedBy`
|
|
59
|
+
4. Store the mapping: native task ID → plan task number (track mentally or in first task's metadata)
|
|
60
|
+
|
|
61
|
+
### On Task Start
|
|
62
|
+
|
|
63
|
+
Before implementing each task:
|
|
64
|
+
1. Call `TaskUpdate` with `status: "in_progress"` on the current native task
|
|
65
|
+
2. This gives the user a visible spinner with the `activeForm` text
|
|
66
|
+
|
|
67
|
+
### On Task Complete
|
|
68
|
+
|
|
69
|
+
After each task's commit:
|
|
70
|
+
1. Call `TaskUpdate` with `status: "completed"` on the finished native task
|
|
71
|
+
2. The next task automatically becomes unblocked (if dependencies were set)
|
|
72
|
+
|
|
73
|
+
### On Plan Complete
|
|
74
|
+
|
|
75
|
+
Native tasks are session-scoped — they disappear on `/clear` or session end. No cleanup needed. The authoritative state has already been written to `.forge/state/milestone-{id}.yml`.
|
|
76
|
+
|
|
77
|
+
### When NOT to Create Native Tasks
|
|
78
|
+
|
|
79
|
+
- **Spawned fresh agents**: Subagents don't share the parent's task list. The parent agent creates the native tasks; the subagent just does the work.
|
|
80
|
+
- **Single-task plans**: If the plan has only 1 task, skip native task creation — the overhead isn't worth it.
|
|
81
|
+
|
|
82
|
+
---
|
|
83
|
+
|
|
45
84
|
## Task Execution Flow
|
|
46
85
|
|
|
47
86
|
For each task in the plan:
|
|
48
87
|
|
|
49
88
|
1. **Read** the task XML (name, files, action, verify, done)
|
|
50
|
-
2. **
|
|
51
|
-
3. **
|
|
52
|
-
4. **
|
|
53
|
-
5. **
|
|
54
|
-
6. **
|
|
89
|
+
2. **Mark in-progress** — `TaskUpdate` the native task to `in_progress`
|
|
90
|
+
3. **Check** context.md — does this task touch a locked decision? Honor it exactly.
|
|
91
|
+
4. **Implement** following the action instructions
|
|
92
|
+
5. **Verify** using the verify step (run tests, inspect output)
|
|
93
|
+
6. **Confirm** done criteria are met
|
|
94
|
+
7. **Commit** atomically
|
|
95
|
+
8. **Mark complete** — `TaskUpdate` the native task to `completed`
|
|
55
96
|
|
|
56
97
|
## TDD Flow (When task type="tdd")
|
|
57
98
|
|
|
58
99
|
### With Test Spec (from planning Step 7)
|
|
59
100
|
When the task has a `<spec>` field, test specs already exist:
|
|
60
|
-
1. **Copy spec to test location:** Move from `.forge/phases/` to the project's test directory
|
|
101
|
+
1. **Copy spec to test location:** Move from `.forge/phases/m{M}-{N}-{name}/specs/` to the project's test directory
|
|
61
102
|
2. **RED:** Remove `skip` markers from the first test. Confirm it fails. Commit: `test({scope}): activate spec tests for {feature}`
|
|
62
103
|
3. **GREEN:** Write minimal code to make it pass. Repeat for each test in the spec.
|
|
63
104
|
4. **REFACTOR:** Clean up. Commit: `feat({scope}): implement {feature}`
|
|
@@ -112,7 +153,7 @@ After each task, mentally assess context usage:
|
|
|
112
153
|
After completing all tasks in a plan, create a summary:
|
|
113
154
|
|
|
114
155
|
```markdown
|
|
115
|
-
# Execution Summary:
|
|
156
|
+
# Execution Summary: m{M}-{N}-{name}, Plan {NN}
|
|
116
157
|
|
|
117
158
|
## Completed Tasks
|
|
118
159
|
1. [Task name] — [one-line result]
|
|
@@ -498,22 +498,24 @@ If user explicitly says "Use Quick/Standard/Full tier" — honor it. No argument
|
|
|
498
498
|
|
|
499
499
|
## Step 3: Route to Next Skill
|
|
500
500
|
|
|
501
|
-
Based on detected tier and current state, tell the user which skill comes next and invoke it
|
|
501
|
+
Based on detected tier and current state, tell the user which skill comes next and **invoke it using the `Skill` tool**.
|
|
502
|
+
|
|
503
|
+
**CRITICAL: NEVER use `EnterPlanMode` or Claude Code's native plan mode.** All Forge phases are handled by Forge skills invoked via the `Skill` tool. When the workflow says "planning", that means invoke `Skill(planning)` — not enter native plan mode. Native plan mode writes to a different file format and bypasses Forge's constitutional gates, state management, and structured plan output.
|
|
502
504
|
|
|
503
505
|
If resuming mid-workflow:
|
|
504
506
|
- Read the selected milestone's state file (`.forge/state/milestone-{id}.yml`) for current position
|
|
505
507
|
- **Use `current.status` to determine the next skill** — this is the authoritative workflow position:
|
|
506
508
|
|
|
507
|
-
| `current.status` | Next Action |
|
|
509
|
+
| `current.status` | Next Action (invoke via `Skill` tool) |
|
|
508
510
|
|-------------------|-------------|
|
|
509
511
|
| `not_started` | Detect tier, start workflow |
|
|
510
|
-
| `researching` |
|
|
511
|
-
| `discussing` |
|
|
512
|
-
| `planning` |
|
|
513
|
-
| `executing` |
|
|
514
|
-
| `verifying` |
|
|
515
|
-
| `auditing` |
|
|
516
|
-
| `refactoring` |
|
|
512
|
+
| `researching` | Invoke `Skill(researching)`, then → `discussing` |
|
|
513
|
+
| `discussing` | Invoke `Skill(discussing)`, then → `planning` (or `architecting` for Full) |
|
|
514
|
+
| `planning` | Invoke `Skill(planning)`, then → `executing` |
|
|
515
|
+
| `executing` | Invoke `Skill(executing)`, then → `verifying` |
|
|
516
|
+
| `verifying` | Invoke `Skill(verifying)`, then → `auditing` |
|
|
517
|
+
| `auditing` | Invoke `Skill(auditing)`, then → `refactoring` |
|
|
518
|
+
| `refactoring` | Invoke `Skill(refactoring)`, then → `complete` |
|
|
517
519
|
| `complete` | Milestone is done. Ask user what's next. |
|
|
518
520
|
|
|
519
521
|
- **Never treat a milestone as complete just because `overall_percent` is 100%.** Task completion and workflow completion are different. All planned tasks being done (100%) means execution is finished — verification, auditing, and refactoring still need to run.
|
|
@@ -581,7 +583,7 @@ Each skill ends with a standard handoff message. The pattern is:
|
|
|
581
583
|
| researching | Research summary (markdown in conversation or `.forge/` files) | discussing reads research findings |
|
|
582
584
|
| discussing | Decision summary → carried into planning via context.md | planning reads context.md |
|
|
583
585
|
| architecting | ADRs in `.forge/decisions/`, data models, API contracts | planning reads decisions |
|
|
584
|
-
| planning | Plans in `.forge/phases/`, requirements.yml, roadmap.yml, context.md | executing reads plans |
|
|
586
|
+
| planning | Plans in `.forge/phases/m{M}-{N}-{name}/`, requirements.yml, roadmap.yml, context.md | executing reads plans |
|
|
585
587
|
| executing | Committed code, execution summary, milestone state updated | verifying reads must_haves from plans |
|
|
586
588
|
| verifying | Verification report, desire paths updated | auditing reads project.yml + source files |
|
|
587
589
|
| auditing | Health report in `.forge/audits/` | refactoring reads health report + git diff |
|
|
@@ -7,6 +7,8 @@ description: "Use when you need to break work into executable tasks with verific
|
|
|
7
7
|
|
|
8
8
|
Turn research and requirements into executable, verifiable plans.
|
|
9
9
|
|
|
10
|
+
> **IMPORTANT:** This skill replaces Claude Code's native plan mode. Do NOT use `EnterPlanMode` — all planning output goes to `.forge/phases/` as structured plan files, not to the native plan file. Follow the steps below directly in the conversation.
|
|
11
|
+
|
|
10
12
|
## Step 1: Resolution Gate
|
|
11
13
|
|
|
12
14
|
Read `.forge/context.md`. Check the **Needs Resolution** section.
|
package/template/CLAUDE.md
CHANGED
|
@@ -2,10 +2,16 @@
|
|
|
2
2
|
|
|
3
3
|
A lean meta-prompting framework for Claude Code. Synthesizes context engineering (GSD) and constitutional governance (Spec-Kit) on Claude Code's native primitives.
|
|
4
4
|
|
|
5
|
+
## Critical: No Native Plan Mode
|
|
6
|
+
|
|
7
|
+
**NEVER use the `EnterPlanMode` tool when the Forge framework is active** (i.e., when `.forge/` exists or a Forge skill is running). Forge has its own `planning` skill that writes structured plans to `.forge/phases/`. Claude Code's native plan mode writes to a separate plan file with a different format — this conflicts with Forge's workflow and state management.
|
|
8
|
+
|
|
9
|
+
When the workflow reaches the planning phase, **invoke the `planning` skill using the `Skill` tool** — do not enter native plan mode. This applies to all tiers (Standard and Full). The same rule applies to all other Forge phases: always invoke the corresponding Forge skill, never substitute a native Claude Code behavior.
|
|
10
|
+
|
|
5
11
|
## Core Principles
|
|
6
12
|
|
|
7
13
|
1. **Lean by default, powerful when needed.** Quick fixes skip ceremony. Complex features get full governance. The framework adapts — you don't.
|
|
8
|
-
2. **Native-first.** Skills, agents, hooks, plugins — use Claude Code's built-in systems. No custom JavaScript, no reinvented orchestration.
|
|
14
|
+
2. **Native-first.** Skills, agents, hooks, plugins — use Claude Code's built-in systems. No custom JavaScript, no reinvented orchestration. Periodically audit Forge features against Claude Code's current native capabilities — if a native tool now handles what a Forge feature does, deprecate the Forge version. Use native tools for session-scoped concerns (task UI, exploration) and Forge state for cross-session persistence. When in doubt, prefer native.
|
|
9
15
|
3. **Context is sacred.** Every token earns its place. Size-gate all artifacts, lazy-load skills, spawn fresh agents for isolated work.
|
|
10
16
|
4. **Decisions are contracts.** User decisions lock before building begins. Downstream agents honor contracts or flag violations — never silently override.
|
|
11
17
|
5. **Verify against goals, not tasks.** "Does it work?" beats "Did we complete the checklist?" Goal-backward verification at every tier.
|