forge-orkes 0.3.4 → 0.3.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "forge-orkes",
3
- "version": "0.3.4",
3
+ "version": "0.3.5",
4
4
  "description": "Set up the Forge meta-prompting framework for Claude Code in your project",
5
5
  "bin": {
6
6
  "create-forge": "./bin/create-forge.js"
@@ -42,22 +42,63 @@ After 3 auto-fix attempts on a single task → STOP fixing. Document remaining i
42
42
  ### Scope Boundary
43
43
  Only fix issues DIRECTLY caused by the current task. Pre-existing warnings, tech debt, unrelated bugs → log to `.forge/deferred-issues.md`, don't fix.
44
44
 
45
+ ## Native Task Tracking (Hybrid Approach)
46
+
47
+ Use Claude Code's native task tools (`TaskCreate`, `TaskUpdate`, `TaskList`) for **in-session visibility** during execution. The `.forge/state/milestone-{id}.yml` remains the **cross-session source of truth** — native tasks are a UI layer on top.
48
+
49
+ ### On Plan Start
50
+
51
+ After loading the plan file, create native tasks from the XML task blocks:
52
+
53
+ 1. Parse all `<task>` XML blocks from the plan
54
+ 2. For each task, call `TaskCreate`:
55
+ - `subject`: The task's `<name>` value
56
+ - `description`: Combine `<action>`, `<verify>`, and `<done>` fields
57
+ - `activeForm`: Present-continuous form of the task name (e.g., "Implementing login form")
58
+ 3. If the plan has ordered tasks (task 2 depends on task 1), set dependencies via `TaskUpdate` with `addBlockedBy`
59
+ 4. Store the mapping: native task ID → plan task number (track mentally or in first task's metadata)
60
+
61
+ ### On Task Start
62
+
63
+ Before implementing each task:
64
+ 1. Call `TaskUpdate` with `status: "in_progress"` on the current native task
65
+ 2. This gives the user a visible spinner with the `activeForm` text
66
+
67
+ ### On Task Complete
68
+
69
+ After each task's commit:
70
+ 1. Call `TaskUpdate` with `status: "completed"` on the finished native task
71
+ 2. The next task automatically becomes unblocked (if dependencies were set)
72
+
73
+ ### On Plan Complete
74
+
75
+ Native tasks are session-scoped — they disappear on `/clear` or session end. No cleanup needed. The authoritative state has already been written to `.forge/state/milestone-{id}.yml`.
76
+
77
+ ### When NOT to Create Native Tasks
78
+
79
+ - **Spawned fresh agents**: Subagents don't share the parent's task list. The parent agent creates the native tasks; the subagent just does the work.
80
+ - **Single-task plans**: If the plan has only 1 task, skip native task creation — the overhead isn't worth it.
81
+
82
+ ---
83
+
45
84
  ## Task Execution Flow
46
85
 
47
86
  For each task in the plan:
48
87
 
49
88
  1. **Read** the task XML (name, files, action, verify, done)
50
- 2. **Check** context.md does this task touch a locked decision? Honor it exactly.
51
- 3. **Implement** following the action instructions
52
- 4. **Verify** using the verify step (run tests, inspect output)
53
- 5. **Confirm** done criteria are met
54
- 6. **Commit** atomically
89
+ 2. **Mark in-progress** — `TaskUpdate` the native task to `in_progress`
90
+ 3. **Check** context.md does this task touch a locked decision? Honor it exactly.
91
+ 4. **Implement** following the action instructions
92
+ 5. **Verify** using the verify step (run tests, inspect output)
93
+ 6. **Confirm** done criteria are met
94
+ 7. **Commit** atomically
95
+ 8. **Mark complete** — `TaskUpdate` the native task to `completed`
55
96
 
56
97
  ## TDD Flow (When task type="tdd")
57
98
 
58
99
  ### With Test Spec (from planning Step 7)
59
100
  When the task has a `<spec>` field, test specs already exist:
60
- 1. **Copy spec to test location:** Move from `.forge/phases/` to the project's test directory
101
+ 1. **Copy spec to test location:** Move from `.forge/phases/m{M}-{N}-{name}/specs/` to the project's test directory
61
102
  2. **RED:** Remove `skip` markers from the first test. Confirm it fails. Commit: `test({scope}): activate spec tests for {feature}`
62
103
  3. **GREEN:** Write minimal code to make it pass. Repeat for each test in the spec.
63
104
  4. **REFACTOR:** Clean up. Commit: `feat({scope}): implement {feature}`
@@ -112,7 +153,7 @@ After each task, mentally assess context usage:
112
153
  After completing all tasks in a plan, create a summary:
113
154
 
114
155
  ```markdown
115
- # Execution Summary: Phase {N}, Plan {NN}
156
+ # Execution Summary: m{M}-{N}-{name}, Plan {NN}
116
157
 
117
158
  ## Completed Tasks
118
159
  1. [Task name] — [one-line result]
@@ -498,22 +498,24 @@ If user explicitly says "Use Quick/Standard/Full tier" — honor it. No argument
498
498
 
499
499
  ## Step 3: Route to Next Skill
500
500
 
501
- Based on detected tier and current state, tell the user which skill comes next and invoke it.
501
+ Based on detected tier and current state, tell the user which skill comes next and **invoke it using the `Skill` tool**.
502
+
503
+ **CRITICAL: NEVER use `EnterPlanMode` or Claude Code's native plan mode.** All Forge phases are handled by Forge skills invoked via the `Skill` tool. When the workflow says "planning", that means invoke `Skill(planning)` — not enter native plan mode. Native plan mode writes to a different file format and bypasses Forge's constitutional gates, state management, and structured plan output.
502
504
 
503
505
  If resuming mid-workflow:
504
506
  - Read the selected milestone's state file (`.forge/state/milestone-{id}.yml`) for current position
505
507
  - **Use `current.status` to determine the next skill** — this is the authoritative workflow position:
506
508
 
507
- | `current.status` | Next Action |
509
+ | `current.status` | Next Action (invoke via `Skill` tool) |
508
510
  |-------------------|-------------|
509
511
  | `not_started` | Detect tier, start workflow |
510
- | `researching` | Resume or complete `researching`, then → `discussing` |
511
- | `discussing` | Resume or complete `discussing`, then → `planning` (or `architecting` for Full) |
512
- | `planning` | Resume or complete `planning`, then → `executing` |
513
- | `executing` | Resume or complete `executing`, then → `verifying` |
514
- | `verifying` | Resume or complete `verifying`, then → `auditing` |
515
- | `auditing` | Resume or complete `auditing`, then → `refactoring` |
516
- | `refactoring` | Resume or complete `refactoring`, then → `complete` |
512
+ | `researching` | Invoke `Skill(researching)`, then → `discussing` |
513
+ | `discussing` | Invoke `Skill(discussing)`, then → `planning` (or `architecting` for Full) |
514
+ | `planning` | Invoke `Skill(planning)`, then → `executing` |
515
+ | `executing` | Invoke `Skill(executing)`, then → `verifying` |
516
+ | `verifying` | Invoke `Skill(verifying)`, then → `auditing` |
517
+ | `auditing` | Invoke `Skill(auditing)`, then → `refactoring` |
518
+ | `refactoring` | Invoke `Skill(refactoring)`, then → `complete` |
517
519
  | `complete` | Milestone is done. Ask user what's next. |
518
520
 
519
521
  - **Never treat a milestone as complete just because `overall_percent` is 100%.** Task completion and workflow completion are different. All planned tasks being done (100%) means execution is finished — verification, auditing, and refactoring still need to run.
@@ -581,7 +583,7 @@ Each skill ends with a standard handoff message. The pattern is:
581
583
  | researching | Research summary (markdown in conversation or `.forge/` files) | discussing reads research findings |
582
584
  | discussing | Decision summary → carried into planning via context.md | planning reads context.md |
583
585
  | architecting | ADRs in `.forge/decisions/`, data models, API contracts | planning reads decisions |
584
- | planning | Plans in `.forge/phases/`, requirements.yml, roadmap.yml, context.md | executing reads plans |
586
+ | planning | Plans in `.forge/phases/m{M}-{N}-{name}/`, requirements.yml, roadmap.yml, context.md | executing reads plans |
585
587
  | executing | Committed code, execution summary, milestone state updated | verifying reads must_haves from plans |
586
588
  | verifying | Verification report, desire paths updated | auditing reads project.yml + source files |
587
589
  | auditing | Health report in `.forge/audits/` | refactoring reads health report + git diff |
@@ -7,6 +7,8 @@ description: "Use when you need to break work into executable tasks with verific
7
7
 
8
8
  Turn research and requirements into executable, verifiable plans.
9
9
 
10
+ > **IMPORTANT:** This skill replaces Claude Code's native plan mode. Do NOT use `EnterPlanMode` — all planning output goes to `.forge/phases/` as structured plan files, not to the native plan file. Follow the steps below directly in the conversation.
11
+
10
12
  ## Step 1: Resolution Gate
11
13
 
12
14
  Read `.forge/context.md`. Check the **Needs Resolution** section.
@@ -2,10 +2,16 @@
2
2
 
3
3
  A lean meta-prompting framework for Claude Code. Synthesizes context engineering (GSD) and constitutional governance (Spec-Kit) on Claude Code's native primitives.
4
4
 
5
+ ## Critical: No Native Plan Mode
6
+
7
+ **NEVER use the `EnterPlanMode` tool when the Forge framework is active** (i.e., when `.forge/` exists or a Forge skill is running). Forge has its own `planning` skill that writes structured plans to `.forge/phases/`. Claude Code's native plan mode writes to a separate plan file with a different format — this conflicts with Forge's workflow and state management.
8
+
9
+ When the workflow reaches the planning phase, **invoke the `planning` skill using the `Skill` tool** — do not enter native plan mode. This applies to all tiers (Standard and Full). The same rule applies to all other Forge phases: always invoke the corresponding Forge skill, never substitute a native Claude Code behavior.
10
+
5
11
  ## Core Principles
6
12
 
7
13
  1. **Lean by default, powerful when needed.** Quick fixes skip ceremony. Complex features get full governance. The framework adapts — you don't.
8
- 2. **Native-first.** Skills, agents, hooks, plugins — use Claude Code's built-in systems. No custom JavaScript, no reinvented orchestration.
14
+ 2. **Native-first.** Skills, agents, hooks, plugins — use Claude Code's built-in systems. No custom JavaScript, no reinvented orchestration. Periodically audit Forge features against Claude Code's current native capabilities — if a native tool now handles what a Forge feature does, deprecate the Forge version. Use native tools for session-scoped concerns (task UI, exploration) and Forge state for cross-session persistence. When in doubt, prefer native.
9
15
  3. **Context is sacred.** Every token earns its place. Size-gate all artifacts, lazy-load skills, spawn fresh agents for isolated work.
10
16
  4. **Decisions are contracts.** User decisions lock before building begins. Downstream agents honor contracts or flag violations — never silently override.
11
17
  5. **Verify against goals, not tasks.** "Does it work?" beats "Did we complete the checklist?" Goal-backward verification at every tier.