workspace-maxxing 0.1.0 → 0.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (38) hide show
  1. package/LICENSE +21 -0
  2. package/README.md +108 -35
  3. package/dist/index.js +46 -22
  4. package/dist/index.js.map +1 -1
  5. package/dist/install.d.ts +1 -1
  6. package/dist/install.d.ts.map +1 -1
  7. package/dist/install.js +7 -1
  8. package/dist/install.js.map +1 -1
  9. package/package.json +1 -1
  10. package/src/index.ts +52 -22
  11. package/src/install.ts +8 -1
  12. package/templates/SKILL.md +88 -0
  13. package/docs/superpowers/plans/2026-04-07-autonomous-iteration-plan.md +0 -1123
  14. package/docs/superpowers/plans/2026-04-07-autonomous-iteration-sub-agent-batches.md +0 -1923
  15. package/docs/superpowers/plans/2026-04-07-autonomous-workflow-sub-skill-plan.md +0 -1505
  16. package/docs/superpowers/plans/2026-04-07-benchmarking-multi-agent-plan.md +0 -854
  17. package/docs/superpowers/plans/2026-04-07-workspace-builder-logic-plan.md +0 -1426
  18. package/docs/superpowers/plans/2026-04-07-workspace-maxxing-plan.md +0 -1299
  19. package/docs/superpowers/plans/2026-04-08-session-294c-subagent-invocation-plan.md +0 -320
  20. package/docs/superpowers/plans/2026-04-08-workflow-prompt-hardening-plan.md +0 -1025
  21. package/docs/superpowers/plans/2026-04-12-workspace-agent-creation-plan.md +0 -992
  22. package/docs/superpowers/specs/2026-04-07-autonomous-iteration-design.md +0 -214
  23. package/docs/superpowers/specs/2026-04-07-autonomous-iteration-sub-agent-batches-design.md +0 -188
  24. package/docs/superpowers/specs/2026-04-07-autonomous-workflow-sub-skill-design.md +0 -137
  25. package/docs/superpowers/specs/2026-04-07-benchmarking-multi-agent-design.md +0 -105
  26. package/docs/superpowers/specs/2026-04-07-workspace-builder-logic-design.md +0 -179
  27. package/docs/superpowers/specs/2026-04-07-workspace-maxxing-design.md +0 -227
  28. package/docs/superpowers/specs/2026-04-08-session-294c-subagent-invocation-design.md +0 -265
  29. package/docs/superpowers/specs/2026-04-08-workflow-prompt-hardening-design.md +0 -146
  30. package/docs/superpowers/specs/2026-04-12-workspace-agent-creation-design.md +0 -239
  31. package/workspace/00-meta/CONTEXT.md +0 -3
  32. package/workspace/00-meta/execution-log.md +0 -17
  33. package/workspace/00-meta/tools.md +0 -11
  34. package/workspace/01-input/CONTEXT.md +0 -27
  35. package/workspace/CONTEXT.md +0 -35
  36. package/workspace/README.md +0 -14
  37. package/workspace/SYSTEM.md +0 -36
  38. package/workspace-maxxing-0.1.0.tgz +0 -0
@@ -1,146 +0,0 @@
1
- # Workflow Prompt Hardening for Scaffolded Workspaces — Design Spec
2
-
3
- ## Overview
4
-
5
- Harden generated workspace prompts so agents reliably follow the intended workflow stages and loading discipline. This design upgrades scaffolded `SYSTEM.md`, root `CONTEXT.md`, and stage-level `CONTEXT.md` generation, aligns shipped template docs to the same structure, and introduces stricter validation checks that enforce both structure and workflow semantics.
6
-
7
- ## Goals
8
-
9
- 1. Make scaffold-generated prompts robust enough to guide stage-correct execution.
10
- 2. Ensure prompt structures are consistent across generated workspaces and shipped templates.
11
- 3. Enforce workflow contracts through `validate.ts` so prompt regressions fail early.
12
- 4. Preserve existing script ergonomics (same CLI usage and pass/fail exit behavior).
13
-
14
- ## Non-Goals
15
-
16
- 1. No anti-rationalization or iron-law requirement in validation for this phase.
17
- 2. No orchestrator or benchmark behavior changes.
18
- 3. No package or dependency changes (Node builtins only).
19
-
20
- ## Architecture
21
-
22
- ### 1) Generation Contract (Scaffold)
23
-
24
- File: `src/scripts/scaffold.ts`
25
-
26
- #### SYSTEM.md (root) required sections
27
- - `## Role`
28
- - `## Folder Map`
29
- - `## Workflow Rules`
30
- - `## Stage Boundaries`
31
- - `## Tooling Policy`
32
-
33
- The generated content must explicitly communicate:
34
- - One-way stage dependencies.
35
- - Numbered stage progression.
36
- - Selective loading expectations.
37
- - Canonical-source behavior (avoid duplication).
38
-
39
- #### CONTEXT.md (root router) required sections
40
- - `## How to Use This File`
41
- - `## Task Routing`
42
- - `## Loading Order`
43
- - `## Stage Handoff Routing`
44
- - `## Escalation`
45
-
46
- The generated router must:
47
- - Route workspace intents to specific stage context files.
48
- - Include deterministic loading order (`SYSTEM.md` first, then root `CONTEXT.md`, then stage `CONTEXT.md`, then minimal task files).
49
- - Describe how to move from one stage to the next.
50
-
51
- #### Stage CONTEXT.md required sections
52
- - `## Purpose`
53
- - `## Inputs`
54
- - `## Outputs`
55
- - `## Dependencies`
56
- - `## Completion Criteria`
57
- - `## Handoff`
58
-
59
- Each stage context must include enough contract detail to prevent generic or cross-stage ambiguous behavior.
60
-
61
- ### 2) Template Alignment
62
-
63
- Files:
64
- - `templates/.workspace-templates/SYSTEM.md`
65
- - `templates/.workspace-templates/CONTEXT.md`
66
- - `templates/.workspace-templates/workspace/01-input/CONTEXT.md`
67
- - `templates/.workspace-templates/workspace/02-process/CONTEXT.md`
68
- - `templates/.workspace-templates/workspace/03-output/CONTEXT.md`
69
-
70
- These files will be updated to match the same generation contract so both:
71
- - scaffolded workspaces, and
72
- - packaged templates
73
-
74
- provide consistent guidance quality.
75
-
76
- ### 3) Validation Contract (Structural + Workflow Strictness)
77
-
78
- File: `src/scripts/validate.ts`
79
-
80
- #### Root checks
81
- - `SYSTEM.md` exists.
82
- - `SYSTEM.md` contains required section headings.
83
- - Root `CONTEXT.md` exists.
84
- - Root `CONTEXT.md` contains required section headings.
85
-
86
- #### Stage checks
87
- For each numbered stage folder:
88
- - `CONTEXT.md` exists and is non-empty.
89
- - Stage `CONTEXT.md` contains all required stage section headings.
90
-
91
- #### Workflow consistency checks
92
- - Every discovered numbered stage appears in root routing.
93
- - Routed stage targets resolve to existing stage `CONTEXT.md` files.
94
- - Root loading order enforces selective loading.
95
- - Dependency direction rule: a stage must not depend on a later-numbered stage.
96
-
97
- #### Output behavior
98
- - Keep check-by-check result structure and human-readable console reporting.
99
- - Keep exit semantics unchanged (`0` pass, `1` fail).
100
-
101
- ## Data Flow
102
-
103
- 1. `scaffold.ts` generates robust prompt contracts at root and stage levels.
104
- 2. Agents load root documents, route into stage contexts, and follow stage boundaries.
105
- 3. `validate.ts` enforces structural and workflow contracts.
106
- 4. Iteration/orchestration logic consumes stronger workflow prompts and fails fast on doc drift.
107
-
108
- ## Testing Strategy (TDD)
109
-
110
- ### Scaffold tests (`tests/scaffold.test.ts`)
111
- Add failing tests first for:
112
- - SYSTEM required section headings.
113
- - Root CONTEXT required section headings.
114
- - Stage CONTEXT required section headings including Completion Criteria and Handoff.
115
- - Root routing references all generated numbered stages.
116
-
117
- ### Validate tests (`tests/validate.test.ts`)
118
- Add failing tests first for:
119
- - Missing SYSTEM required sections.
120
- - Missing root CONTEXT required sections.
121
- - Missing stage required sections.
122
- - Missing stage routing coverage in root router.
123
- - Dependency direction violation (later-stage dependency).
124
-
125
- ### Regression checks
126
- - Existing scaffold/validate expectations that remain valid should still pass.
127
- - Run targeted suites first, then full suite and build.
128
-
129
- ## Risks and Mitigations
130
-
131
- 1. Risk: False negatives from strict heading checks.
132
- - Mitigation: validate by section headings that are stable and intentionally generated.
133
-
134
- 2. Risk: Overly rigid wording constraints.
135
- - Mitigation: enforce section presence and key workflow semantics, not full text equality.
136
-
137
- 3. Risk: Breakage in existing test fixtures.
138
- - Mitigation: update fixture content only where new contract explicitly requires it.
139
-
140
- ## Acceptance Criteria
141
-
142
- 1. Scaffold-generated `SYSTEM.md`, root `CONTEXT.md`, and stage `CONTEXT.md` files contain all required sections.
143
- 2. Shipped template docs use the same robust structure.
144
- 3. `validate.ts` fails when required sections or workflow consistency rules are violated.
145
- 4. Focused scaffold and validate test suites pass.
146
- 5. Full repository test suite and TypeScript build pass.
@@ -1,239 +0,0 @@
1
- # Workspace-Maxxing with Invokable Agent Creation - Design Spec
2
-
3
- ## Overview
4
-
5
- Extends `workspace-maxxing` skill to create, deliver, and iteratively improve an autonomous workflow agent inside the generated workspace. The skill builds both the ICM folder structure AND a self-improving agent that can be invoked with `@` in the workspace.
6
-
7
- ## Goals
8
-
9
- 1. Create invokable sub-agent inside the generated workspace
10
- 2. Agent implements the actual workflow use case (not just the workspace structure)
11
- 3. Self-improvement loop tests and strengthens the agent during build phase
12
- 4. Backward compatible with existing workspace-maxxing behavior
13
- 5. Multi-platform support (OpenCode, Claude Code, Copilot, Gemini)
14
-
15
- ## Non-Goals
16
-
17
- 1. Agent that persists outside the workspace (workspace-local only)
18
- 2. Real-time self-improvement after delivery (improvement only during build)
19
- 3. UI changes to workspace-maxxing skill
20
-
21
- ## Architecture
22
-
23
- ### Package Structure
24
-
25
- ```
26
- workspace-maxxing/
27
- ├── SKILL.md # Main skill entry
28
- ├── package.json
29
- ├── src/
30
- │ ├── index.ts # Main entry, orchestration
31
- │ ├── scaffold.ts # Creates ICM workspace
32
- │ ├── agent-creator.ts # Creates @agent implementation
33
- │ ├── agent-iterator.ts # Self-improvement loop
34
- │ └── installer.ts # Platform-specific install
35
- ├── scripts/
36
- │ ├── scaffold.ts # (existing)
37
- │ ├── validate.ts # (existing)
38
- │ ├── benchmark.ts # (existing)
39
- │ ├── dispatch.ts # (existing)
40
- │ └── orchestrator.ts # (existing)
41
- └── templates/
42
- ├── workspace/ # ICM workspace template
43
- └── agent/ # Agent template base
44
- ```
45
-
46
- ### Agent Structure (Created Inside Workspace)
47
-
48
- ```
49
- workspace/
50
- ├── .agents/
51
- │ └── skills/
52
- │ └── @<purpose>/ # The invokable agent
53
- │ ├── SKILL.md
54
- │ ├── prompts/
55
- │ │ ├── system.md
56
- │ │ └── tasks/
57
- │ ├── tools/
58
- │ │ └── index.ts
59
- │ ├── tests/
60
- │ │ └── cases.json
61
- │ └── config.json
62
- ├── 01-input/
63
- ├── 02-process/
64
- ├── 03-output/
65
- ├── SYSTEM.md
66
- └── CONTEXT.md
67
- ```
68
-
69
- ## Core Workflow
70
-
71
- ```
72
- ┌─────────────────────────────────────────────────────────────────┐
73
- │ workspace-maxxing execution │
74
- ├─────────────────────────────────────────────────────────────────┤
75
- │ 1. Research phase (dispatch research sub-skill) │
76
- │ 2. Architecture phase (dispatch architecture sub-skill) │
77
- │ 3. Build workspace (scaffold.ts) │
78
- │ 4. Create agent (agent-creator.ts) │
79
- │ - Generate agent name from purpose │
80
- │ - Create agent structure from template │
81
- │ - Initialize prompts based on workflow │
82
- │ 5. Iterate agent (agent-iterator.ts) - SELF-IMPROVING │
83
- │ a. Generate test cases (edge, empty, various) │
84
- │ b. Run agent with each test case │
85
- │ c. Assess robustness (score + issues) │
86
- │ d. If score < threshold: improve agent │
87
- │ e. Repeat until score >= threshold OR max-retries │
88
- │ 6. Validate workspace (validate.ts) │
89
- │ 7. Install agent to platform skill directory │
90
- │ 8. Deliver │
91
- └─────────────────────────────────────────────────────────────────┘
92
- ```
93
-
94
- ## Agent Self-Improvement Loop
95
-
96
- ### Input
97
-
98
- - Agent implementation (prompts, tools, config)
99
- - Test case generation rules (edge cases, empty states, varied inputs)
100
-
101
- ### Process
102
-
103
- 1. **Generate test cases** - Create diverse test inputs based on workflow type
104
- 2. **Execute agent** - Run agent with each test case, capture outputs
105
- 3. **Assess robustness** - Check for:
106
- - Crashes or errors
107
- - Invalid outputs
108
- - Edge case handling
109
- - Empty input handling
110
- 4. **Score** - Compute robustness score (0-100)
111
- 5. **Improve** - If score < threshold, modify prompts/tools to fix issues
112
- 6. **Repeat** - Max 3 improvement cycles per batch
113
-
114
- ### Output
115
-
116
- - Improved agent implementation
117
- - Test case results with pass/fail per case
118
- - Robustness score
119
- - Iteration log
120
-
121
- ### Thresholds
122
-
123
- - Default robustness threshold: 85
124
- - Max iteration cycles: 3 per batch
125
- - Batch size: 3 test cases
126
-
127
- ## Agent Naming
128
-
129
- - **Pattern**: `@<purpose>` derived from workspace purpose
130
- - **Generation**: Convert workspace name/purpose to lowercase, hyphenated
131
- - **Examples**:
132
- - "Daily Digest" → `@daily-digest`
133
- - "Project Tracker" → `@project-tracker`
134
- - "AI News Aggregator" → `@ai-news-aggregator`
135
-
136
- ## Platform Integration
137
-
138
- ### OpenCode
139
-
140
- - Location: `workspace/.agents/skills/@<name>/`
141
- - Invocation: `@agent-name`
142
- - Auto-load on first mention
143
-
144
- ### Claude Code
145
-
146
- - Location: `workspace/.claude/skills/<name>/`
147
- - Invocation: `{@name}` in prompts or instructions
148
- - Manual first invocation required
149
-
150
- ### GitHub Copilot
151
-
152
- - Location: `workspace/.github/copilot-instructions/<name>.md`
153
- - Invocation: Through inline instructions
154
-
155
- ### Gemini CLI
156
-
157
- - Location: `workspace/.gemini/skills/<name>/`
158
- - Invocation: Follows Gemini instruction format
159
-
160
- ## Backward Compatibility
161
-
162
- - **Default behavior**: Creates workspace + agent (new behavior)
163
- - **Opt-out**: `--no-agent` flag creates workspace-only (existing behavior)
164
- - **Detection**: If no agent flag specified, assumes with-agent
165
- - **Existing scripts**: Unchanged, work as before
166
- - **Existing workspaces**: No migration needed
167
-
168
- ## Configuration
169
-
170
- ### Workspace-Maxxing Options
171
-
172
- | Option | Default | Description |
173
- |--------|---------|-------------|
174
- | `--with-agent` | true | Create invokable agent |
175
- | `--agent-name` | auto | Custom agent name (overrides auto) |
176
- | `--robustness-threshold` | 85 | Min score to pass iteration |
177
- | `--max-agent-iterations` | 3 | Max improvement cycles |
178
-
179
- ### Agent Config
180
-
181
- ```json
182
- {
183
- "name": "@daily-digest",
184
- "purpose": "Create daily AI news digest",
185
- "platforms": ["opencode", "claude", "copilot", "gemini"],
186
- "robustnessThreshold": 85,
187
- "iterationCount": 0,
188
- "testCases": []
189
- }
190
- ```
191
-
192
- ## Anti-Patterns
193
-
194
- | Thought | Reality |
195
- |---------|---------|
196
- | "The agent works for basic cases" | Edge cases and empty states matter. Iterate until threshold. |
197
- | "One iteration is enough" | Self-improvement needs multiple cycles. |
198
- | "User will refine the agent later" | Goal is robust agent on delivery. |
199
- | "Agent validation is optional" | Robustness check is mandatory gate. |
200
-
201
- ## Acceptance Criteria
202
-
203
- 1. Workspace-maxxing creates both folder structure AND agent
204
- 2. Agent is invokable with `@<name>` in supported platforms
205
- 3. Self-improvement loop runs during build, achieves score >= 85
206
- 4. Backward compatible - existing usage unchanged with `--no-agent`
207
- 5. Agent persists in workspace, survives session restart
208
- 6. Iteration logs stored in workspace for debugging
209
- 7. Multi-platform installation works (OpenCode, Claude, Copilot, Gemini)
210
-
211
- ## Risks and Mitigations
212
-
213
- | Risk | Mitigation |
214
- |------|------------|
215
- | Agent name collision in workspace | Check existing agents, append number if needed |
216
- | Platform detection fails | Default to OpenCode, log warning |
217
- | Iteration infinite loop | Max 3 cycles, escalate after threshold not met |
218
- | Agent template incomplete | Provide minimal viable template, improve over time |
219
-
220
- ## File Changes
221
-
222
- ### New Files
223
-
224
- - `src/agent-creator.ts` - Creates agent implementation
225
- - `src/agent-iterator.ts` - Self-improvement loop
226
- - `templates/agent/` - Agent template base
227
-
228
- ### Modified Files
229
-
230
- - `SKILL.md` - Update workflow to include agent creation
231
- - `src/index.ts` - Add agent orchestration
232
- - `package.json` - Add dependencies if needed
233
-
234
- ## Timeline
235
-
236
- 1. Phase 1: Agent creator (scaffold agent structure from template)
237
- 2. Phase 2: Agent iterator (self-improvement loop)
238
- 3. Phase 3: Platform installer (multi-platform support)
239
- 4. Phase 4: Backward compatibility (--no-agent flag)
@@ -1,3 +0,0 @@
1
- # 00-meta Context
2
-
3
- Metadata and tool inventory for the NoAgent Test workspace.
@@ -1,17 +0,0 @@
1
- # Execution Log
2
-
3
- ## Stage Checklist
4
-
5
- - [ ] 01-input
6
-
7
- ## Rules
8
-
9
- 1. Mark a stage complete only after its completion criteria are satisfied.
10
- 2. Stages must be checked in ascending numerical order.
11
- 3. Every checked stage must have corresponding evidence notes.
12
-
13
- ## Evidence Notes
14
-
15
- ### 01-input
16
- - Artifacts:
17
- - Handoff Summary:
@@ -1,11 +0,0 @@
1
- ## Tool Inventory
2
-
3
- ## Installed Tools
4
-
5
- | Tool | Version | Manager | Installed |
6
- |------|---------|---------|-----------|
7
- | — | — | — | — |
8
-
9
- ## Pending Tools
10
-
11
- List tools that are proposed but not yet approved.
@@ -1,27 +0,0 @@
1
- # 01-input — Context
2
-
3
- ## Purpose
4
- This folder executes the 01-input stage of the NoAgent Test workflow.
5
-
6
- ## Inputs
7
- - Required data artifacts for 01-input
8
- - Upstream context from previous stage when applicable
9
-
10
- ## Outputs
11
- - Stage-specific deliverables for downstream consumption
12
- - Updated markdown artifacts needed by the next stage
13
-
14
- ## Dependencies
15
- - None (entry stage)
16
-
17
- ## Required Evidence
18
- - Update `00-meta/execution-log.md` to mark 01-input complete before handoff.
19
- - Link or reference the markdown artifacts produced in this stage.
20
-
21
- ## Completion Criteria
22
- - Required outputs are produced and non-empty
23
- - Outputs conform to stage purpose and markdown-first workflow format
24
- - Handoff notes are updated for downstream stage
25
-
26
- ## Handoff
27
- - This is the terminal stage. Package and deliver final output.
@@ -1,35 +0,0 @@
1
- # NoAgent Test — Context Router
2
-
3
- ## How to Use This File
4
- Use this file to route each task to the smallest required context scope.
5
-
6
- ## Task Routing
7
- This routing table maps task intent to the correct stage context.
8
-
9
- | When you need to... | Load | Why |
10
- |---------------------|------|-----|
11
- | Understand workspace constraints | `SYSTEM.md` | Global rules and stage boundaries |
12
- | Work in 01-input tasks | `01-input/CONTEXT.md` | Stage contract and required outputs |
13
- | Check available tools | `00-meta/tools.md` | Tool inventory and approval status |
14
-
15
- ## Loading Order
16
- 1. `SYSTEM.md` (always)
17
- 2. This root `CONTEXT.md`
18
- 3. One relevant stage `CONTEXT.md`
19
- 4. Only the task files needed for that stage
20
-
21
- ## Scope Guardrails
22
- - Route domain requests into workflow design steps and markdown deliverables.
23
- - Do not scaffold backend, frontend, or runtime product source files from this router.
24
- - Keep outputs file-structured and markdown-first across numbered workflow folders.
25
-
26
- ## Sequential Routing Contract
27
- - Route only to the earliest incomplete stage in `00-meta/execution-log.md`.
28
- - Refuse jumps to later stages when earlier stages are not marked complete.
29
- - Append handoff notes for each completed stage before routing onward.
30
-
31
- ## Stage Handoff Routing
32
- - `01-input` -> deliver final output and close loop
33
-
34
- ## Escalation
35
- Escalate when required sections are missing, dependencies are contradictory, or no valid stage route can satisfy the task.
@@ -1,14 +0,0 @@
1
- # NoAgent Test Workspace
2
-
3
- ## Structure
4
-
5
- - `01-input/`
6
- - `00-meta/`
7
-
8
- ## Usage
9
-
10
- 1. Follow the workflow stages in order
11
- 2. Load CONTEXT.md files selectively — only what you need
12
- 3. Update 00-meta/execution-log.md after each completed stage
13
- 4. Keep outputs in stage folders as markdown workflow artifacts
14
- 5. Run validate.ts to check ICM compliance
@@ -1,36 +0,0 @@
1
- # NoAgent Test — System Prompt
2
-
3
- ## Role
4
- You are an AI assistant operating inside the NoAgent Test workspace. Follow stage boundaries and route tasks through stage-specific CONTEXT files.
5
-
6
- ## Folder Map
7
-
8
- | Stage | Folder | Purpose |
9
- |------:|--------|---------|
10
- | 1 | `01-input/` | Input collection and validation |
11
- | meta | `00-meta/` | Workspace configuration, tool inventory, and session notes |
12
-
13
- ## Workflow Rules
14
- 1. Read `SYSTEM.md` first, then root `CONTEXT.md`.
15
- 2. Load only one stage `CONTEXT.md` at a time unless handoff explicitly requires another stage.
16
- 3. Keep information canonical; do not duplicate facts across files.
17
- 4. Maintain one-way stage dependencies from earlier stage numbers to later stage numbers.
18
-
19
- ## Scope Guardrails
20
- - Build and maintain workflow documentation, not product implementation code.
21
- - Keep stage outputs as markdown artifacts (plans, checklists, prompts, routing notes).
22
- - If asked to build the product itself, capture that request as workflow requirements and stay in ICM workspace scope.
23
-
24
- ## Sequential Execution Protocol
25
- 1. Complete stages strictly in ascending numeric order.
26
- 2. Record stage completion in `00-meta/execution-log.md` before moving to the next stage.
27
- 3. Do not produce final deliverables until all prior stage checkboxes are complete.
28
-
29
- ## Stage Boundaries
30
- - Each numbered folder is an execution stage.
31
- - A stage may consume upstream outputs but must not redefine upstream facts.
32
- - Cross-stage jumps require explicit routing through root `CONTEXT.md`.
33
-
34
- ## Tooling Policy
35
- - Check `00-meta/tools.md` before proposing tool installation.
36
- - Document approved tooling changes in `00-meta/tools.md`.
Binary file