codex-genesis-harness 0.1.6 → 0.1.7
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.codebase/CURRENT_STATE.md +8 -7
- package/.codebase/RECOVERY_POINTS.md +38 -393
- package/.codebase/beads.json +16 -0
- package/.codex/skills/genesis-ai-provider/SKILL.md +1 -1
- package/.codex/skills/genesis-api-contract/SKILL.md +1 -1
- package/.codex/skills/genesis-api-sync/SKILL.md +1 -1
- package/.codex/skills/genesis-architecture/SKILL.md +1 -1
- package/.codex/skills/genesis-codebase-map/SKILL.md +1 -1
- package/.codex/skills/genesis-debug-guide/SKILL.md +1 -1
- package/.codex/skills/genesis-design-spec/SKILL.md +3 -3
- package/.codex/skills/genesis-harness/SKILL.md +3 -2
- package/.codex/skills/genesis-harness-engineering/SKILL.md +1 -1
- package/.codex/skills/genesis-new-design/SKILL.md +2 -1
- package/.codex/skills/genesis-pipeline-orchestration/SKILL.md +1 -1
- package/.codex/skills/genesis-planning/SKILL.md +26 -1
- package/.codex/skills/genesis-release/SKILL.md +24 -1
- package/.codex/skills/{ui-ux-test-skill → genesis-ui-ux-test}/SKILL.md +1 -1
- package/CHANGELOG.md +14 -0
- package/README.EN.md +7 -11
- package/README.VI.md +9 -13
- package/README.md +10 -10
- package/VERSION +1 -1
- package/bin/genesis-harness.js +606 -6
- package/package.json +1 -1
- package/scripts/install.sh +2 -6
- package/scripts/run-evals.sh +20 -25
- package/scripts/uninstall.sh +2 -6
- package/scripts/verify.sh +6 -66
- package/.codex/skills/genesis-docs/SKILL.md +0 -46
- package/.codex/skills/genesis-docs/agents/openai.yaml +0 -7
- package/.codex/skills/genesis-mvp-planning/SKILL.md +0 -114
- package/.codex/skills/genesis-mvp-planning/agents/openai.yaml +0 -6
- package/.codex/skills/genesis-release-orchestration/SKILL.md +0 -653
- package/.codex/skills/genesis-release-orchestration/agents/openai.yaml +0 -7
- package/.codex/skills/genesis-research/SKILL.md +0 -46
- package/.codex/skills/genesis-research/agents/openai.yaml +0 -7
- /package/.codex/skills/{genesis-docs/checklists/checklist.md → genesis-docs-automation/checklists/manual-docs-checklist.md} +0 -0
- /package/.codex/skills/{genesis-docs/examples/example.md → genesis-docs-automation/examples/manual-docs-example.md} +0 -0
- /package/.codex/skills/{genesis-docs → genesis-docs-automation}/templates/docs-update-template.md +0 -0
- /package/.codex/skills/{genesis-state-machine/SKILL.md → genesis-harness/references/state-machine.md} +0 -0
- /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/checklists/mvp-readiness.md +0 -0
- /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/examples/5-phase-roadmap-example.md +0 -0
- /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/templates/phase-1-core.md +0 -0
- /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/templates/phase-2-auth.md +0 -0
- /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/templates/phase-3-features.md +0 -0
- /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/templates/phase-4-integrations.md +0 -0
- /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/templates/phase-5-readiness.md +0 -0
- /package/.codex/skills/{genesis-release-orchestration → genesis-release}/checklists/post-deployment-verification.md +0 -0
- /package/.codex/skills/{genesis-release-orchestration → genesis-release}/checklists/pre-release-validation.md +0 -0
- /package/.codex/skills/{genesis-release-orchestration/examples/example.md → genesis-release/examples/orchestration-example.md} +0 -0
- /package/.codex/skills/{genesis-release-orchestration → genesis-release}/observability/release-tracking.md +0 -0
- /package/.codex/skills/{genesis-release-orchestration → genesis-release}/playbooks/canary-deployment-orchestration.md +0 -0
- /package/.codex/skills/{genesis-release-orchestration → genesis-release}/playbooks/semantic-versioning-automation.md +0 -0
- /package/.codex/skills/{genesis-release-orchestration → genesis-release}/templates/deployment-strategy-template.md +0 -0
- /package/.codex/skills/{genesis-release-orchestration → genesis-release}/templates/release-runbook-template.md +0 -0
- /package/.codex/skills/{genesis-research → genesis-research-first}/checklists/checklist.md +0 -0
- /package/.codex/skills/{genesis-research/examples/example.md → genesis-research-first/examples/manual-research-example.md} +0 -0
- /package/.codex/skills/{genesis-research → genesis-research-first}/templates/research-note-template.md +0 -0
- /package/.codex/skills/{ui-ux-test-skill → genesis-ui-ux-test}/agents/openai.yaml +0 -0
- /package/.codex/skills/{ui-ux-test-skill → genesis-ui-ux-test}/checklists/checklist.md +0 -0
- /package/.codex/skills/{ui-ux-test-skill → genesis-ui-ux-test}/examples/example.md +0 -0
- /package/.codex/skills/{ui-ux-test-skill → genesis-ui-ux-test}/templates/playwright-test-template.md +0 -0
|
@@ -1,10 +1,11 @@
|
|
|
1
1
|
# Current State: COMPLETED
|
|
2
|
-
Last updated: Mon Jun 01
|
|
2
|
+
Last updated: Mon Jun 01 17:15:00 +07 2026
|
|
3
3
|
|
|
4
4
|
## Reason
|
|
5
|
-
|
|
6
|
-
-
|
|
7
|
-
-
|
|
8
|
-
-
|
|
9
|
-
-
|
|
10
|
-
|
|
5
|
+
Successfully completed the Visual Mockup Generation & Interactive TUI Mockup Viewer integration (v0.3.0), followed by Harness Engineering standardizations and preparation for release `0.1.7`:
|
|
6
|
+
- **Interactive Keyboard-Navigated CLI TUI**: Developed an elegant console interface for `genesis-harness view-mockup` capturing stdin keypresses.
|
|
7
|
+
- **Harness Verification Streamlining**: Refactored `scripts/verify.sh` and `scripts/run-evals.sh` to dynamically evaluate skill names, removing legacy hard-coded mapping logic. Cleaned up deprecated skills (e.g., `genesis-mvp-planning`, `genesis-release-orchestration`, `genesis-state-machine`, `genesis-research`, `genesis-docs`).
|
|
8
|
+
- **Skill Consolidation**: Merged overlapping skills to resolve duplicated slash commands and clean up the architecture.
|
|
9
|
+
- **Bead Memory Test Coverage**: Added rigorous CLI command validations in `scripts/run-evals.sh` to guarantee that `remember`, `recall`, `prime`, and `forget` function reliably.
|
|
10
|
+
- **Skill Enrichment Directives**: Packaged new visual contract requirements inside `genesis-design-spec` (utilizing `generate_image`) and visual alignment checks inside `genesis-new-design` (utilizing `view_file`).
|
|
11
|
+
- **Verification Evidence**: Structural checks and regression evaluations pass 100% cleanly, confirming absolute stability in the current codebase state. Ready for 0.1.7 release.
|
|
@@ -1,8 +1,8 @@
|
|
|
1
1
|
# Recovery Points
|
|
2
2
|
|
|
3
|
-
**Purpose**: Document where implementation can be paused and resumed without losing context or creating inconsistencies.
|
|
3
|
+
**Purpose**: Document where harness architecture implementation can be paused and resumed without losing context or creating inconsistencies.
|
|
4
4
|
|
|
5
|
-
**Use When**:
|
|
5
|
+
**Use When**: Evolution of the Codex harness (verification loops, CLI tools, scripts) needs to be paused, or when a rollback is necessary due to environment breakage.
|
|
6
6
|
|
|
7
7
|
---
|
|
8
8
|
|
|
@@ -10,429 +10,74 @@
|
|
|
10
10
|
|
|
11
11
|
| Phase | Status | Resumption File | Last Updated |
|
|
12
12
|
|-------|--------|-----------------|--------------|
|
|
13
|
-
|
|
|
14
|
-
|
|
|
15
|
-
|
|
|
16
|
-
|
|
|
17
|
-
| Implementation | ⏸️ In Progress | `src/auth/` | _date_ |
|
|
18
|
-
| Documentation | ⏸️ Pending | `docs/AUTH.md` | _date_ |
|
|
13
|
+
| TUI Mockup Viewer Integration | ✓ Complete | `.codebase/CURRENT_STATE.md` | 2026-06-01 |
|
|
14
|
+
| Harness Verification Streamlining | ✓ Complete | `.codebase/CURRENT_STATE.md` | 2026-06-01 |
|
|
15
|
+
| Bead Memory Regression Tests | ✓ Complete | `scripts/run-evals.sh` | 2026-06-01 |
|
|
16
|
+
| Harness Engineering Overhaul | ⏸️ Idle (Stable) | `scripts/verify.sh` | 2026-06-01 |
|
|
19
17
|
|
|
20
18
|
---
|
|
21
19
|
|
|
22
|
-
## Phase:
|
|
20
|
+
## Phase: Harness Verification Streamlining & Memory Evals
|
|
23
21
|
|
|
24
22
|
**Status**: ✓ Complete
|
|
25
|
-
**
|
|
26
|
-
**End Date**: _date_
|
|
27
|
-
**Resumption**: Not needed
|
|
23
|
+
**Last Updated**: 2026-06-01
|
|
28
24
|
|
|
29
25
|
### What Happened
|
|
30
26
|
|
|
31
|
-
|
|
27
|
+
- Cleaned up legacy/deprecated skills (e.g., `genesis-mvp-planning`, `genesis-release-orchestration`) from `scripts/verify.sh`, `scripts/uninstall.sh`, and `scripts/run-evals.sh`.
|
|
28
|
+
- Removed hard-coded skill name mappings (`expected_name` switch statements), enabling dynamic mapping directly based on directory names.
|
|
29
|
+
- Added test coverage in `run-evals.sh` for the local bead memory commands (`remember`, `recall`, `prime`, `forget`).
|
|
30
|
+
- Enforced `state-machine.md` presence in `verify_harness_skill()`.
|
|
32
31
|
|
|
33
|
-
###
|
|
34
|
-
|
|
35
|
-
Not applicable - planning is complete and locked.
|
|
36
|
-
|
|
37
|
-
---
|
|
38
|
-
|
|
39
|
-
## Phase: Design & Architecture
|
|
40
|
-
|
|
41
|
-
**Status**: ✓ Complete
|
|
42
|
-
**Start Date**: _date_
|
|
43
|
-
**End Date**: _date_
|
|
44
|
-
**Resumption**: Not needed
|
|
45
|
-
|
|
46
|
-
### What Happened
|
|
47
|
-
|
|
48
|
-
System design completed, contracts defined, technology choices made.
|
|
49
|
-
|
|
50
|
-
### Artifacts
|
|
51
|
-
|
|
52
|
-
- `.codebase/ARCHITECTURE.md` - Final design
|
|
53
|
-
- `.codebase/API_CONTRACTS.md` - API specifications
|
|
54
|
-
- `IMPLEMENTATION_HANDOFF.md` - Design decisions
|
|
55
|
-
|
|
56
|
-
### To Resume (If Needed)
|
|
57
|
-
|
|
58
|
-
Not applicable - design is frozen and approved.
|
|
59
|
-
|
|
60
|
-
---
|
|
61
|
-
|
|
62
|
-
## Phase: Test Contracts & Fixtures
|
|
63
|
-
|
|
64
|
-
**Status**: ✓ Complete
|
|
65
|
-
**Start Date**: _date_
|
|
66
|
-
**End Date**: _date_
|
|
67
|
-
**Resumption**: Not needed
|
|
68
|
-
|
|
69
|
-
### What Happened
|
|
70
|
-
|
|
71
|
-
Test contracts created, fixtures built, test infrastructure ready.
|
|
72
|
-
|
|
73
|
-
### Artifacts
|
|
74
|
-
|
|
75
|
-
```
|
|
76
|
-
tests/
|
|
77
|
-
├── contracts/
|
|
78
|
-
│ ├── auth-api.contract.json
|
|
79
|
-
│ └── session-manager.contract.json
|
|
80
|
-
├── fixtures/
|
|
81
|
-
│ ├── oauth-responses.json
|
|
82
|
-
│ └── user-data.json
|
|
83
|
-
└── setup/
|
|
84
|
-
├── test-db-init.sql
|
|
85
|
-
└── mock-providers.ts
|
|
86
|
-
```
|
|
87
|
-
|
|
88
|
-
### To Resume (If Needed)
|
|
89
|
-
|
|
90
|
-
Not applicable - test infrastructure is ready.
|
|
91
|
-
|
|
92
|
-
---
|
|
93
|
-
|
|
94
|
-
## Phase: Core Implementation (CURRENT PAUSE POINT)
|
|
95
|
-
|
|
96
|
-
**Status**: ⏸️ In Progress (50% complete)
|
|
97
|
-
**Start Date**: _date_
|
|
98
|
-
**Current Date**: _YYYY-MM-DD_
|
|
99
|
-
**Estimated Completion**: _date_
|
|
100
|
-
|
|
101
|
-
### What Was Done (Completed)
|
|
102
|
-
|
|
103
|
-
```
|
|
104
|
-
✓ src/auth/oauth-provider.ts
|
|
105
|
-
- OAuth 2.0 flow implementation
|
|
106
|
-
- Provider callback handling
|
|
107
|
-
- Token management
|
|
108
|
-
|
|
109
|
-
✓ src/db/migrations/
|
|
110
|
-
- New users table fields
|
|
111
|
-
- oauth_provider, oauth_id columns
|
|
112
|
-
|
|
113
|
-
✓ tests/auth.test.ts
|
|
114
|
-
- 15 tests written and passing
|
|
115
|
-
- 90% coverage of oauth-provider.ts
|
|
116
|
-
```
|
|
117
|
-
|
|
118
|
-
### What Remains (To Do)
|
|
119
|
-
|
|
120
|
-
```
|
|
121
|
-
⏳ src/auth/session-manager.ts
|
|
122
|
-
- Session creation and validation
|
|
123
|
-
- Session timeout handling
|
|
124
|
-
- Concurrent session management
|
|
125
|
-
|
|
126
|
-
⏳ src/ui/pages/
|
|
127
|
-
- Login component
|
|
128
|
-
- Registration component
|
|
129
|
-
- Error handling UI
|
|
130
|
-
|
|
131
|
-
⏳ API endpoints
|
|
132
|
-
- POST /auth/login
|
|
133
|
-
- POST /auth/register
|
|
134
|
-
- POST /auth/logout
|
|
135
|
-
|
|
136
|
-
⏳ Integration tests
|
|
137
|
-
- End-to-end OAuth flow
|
|
138
|
-
- Database state consistency
|
|
139
|
-
```
|
|
140
|
-
|
|
141
|
-
### Pause State
|
|
142
|
-
|
|
143
|
-
**What's Safe to Pause Here:**
|
|
144
|
-
|
|
145
|
-
The OAuth provider module is isolated and complete. Can pause safely at this point.
|
|
146
|
-
|
|
147
|
-
**Before Pausing:**
|
|
32
|
+
### Safe State Confirmation
|
|
148
33
|
|
|
34
|
+
The harness currently passes all structural tests cleanly.
|
|
149
35
|
```bash
|
|
150
|
-
#
|
|
151
|
-
|
|
152
|
-
git commit -m "feat: OAuth provider implementation (50% complete)"
|
|
36
|
+
# Verify structure
|
|
37
|
+
./scripts/verify.sh
|
|
153
38
|
|
|
154
|
-
#
|
|
155
|
-
|
|
39
|
+
# Verify regression
|
|
40
|
+
./scripts/run-evals.sh
|
|
156
41
|
|
|
157
|
-
#
|
|
158
|
-
|
|
159
|
-
|
|
160
|
-
# 4. Create recovery checklist
|
|
161
|
-
# (See section below)
|
|
162
|
-
```
|
|
163
|
-
|
|
164
|
-
### To Resume From Here
|
|
165
|
-
|
|
166
|
-
**Step 1: Environment Setup** (5 min)
|
|
167
|
-
|
|
168
|
-
```bash
|
|
169
|
-
# Pull latest changes
|
|
170
|
-
git pull origin main
|
|
171
|
-
|
|
172
|
-
# Verify you're on the right branch
|
|
173
|
-
git branch -v
|
|
174
|
-
|
|
175
|
-
# Install dependencies
|
|
176
|
-
npm install
|
|
177
|
-
|
|
178
|
-
# Check database is migrated
|
|
179
|
-
npm run db:status
|
|
180
|
-
```
|
|
181
|
-
|
|
182
|
-
**Step 2: Verify Previous Work** (5 min)
|
|
183
|
-
|
|
184
|
-
```bash
|
|
185
|
-
# Run existing tests
|
|
186
|
-
npm test -- auth.test.ts
|
|
187
|
-
# Expected: 15 passing
|
|
188
|
-
|
|
189
|
-
# Check module compiles
|
|
190
|
-
npm run build
|
|
191
|
-
|
|
192
|
-
# Verify no uncommitted changes
|
|
193
|
-
git status
|
|
194
|
-
```
|
|
195
|
-
|
|
196
|
-
**Step 3: Review What's Next** (10 min)
|
|
197
|
-
|
|
198
|
-
```bash
|
|
199
|
-
# Read the handoff document
|
|
200
|
-
cat IMPLEMENTATION_HANDOFF.md
|
|
201
|
-
|
|
202
|
-
# Check what was planned next
|
|
203
|
-
cat .codebase/CURRENT_STATE.md
|
|
204
|
-
|
|
205
|
-
# Review test contract
|
|
206
|
-
cat tests/contracts/auth-api.contract.json
|
|
207
|
-
```
|
|
208
|
-
|
|
209
|
-
**Step 4: Resume Implementation** (30 min+)
|
|
210
|
-
|
|
211
|
-
Start with session-manager.ts:
|
|
212
|
-
|
|
213
|
-
```bash
|
|
214
|
-
# Create new branch (if needed)
|
|
215
|
-
git checkout -b continue/auth-session
|
|
216
|
-
|
|
217
|
-
# Open the next module
|
|
218
|
-
code src/auth/session-manager.ts
|
|
219
|
-
|
|
220
|
-
# Reference the test contract
|
|
221
|
-
code tests/contracts/session-manager.contract.json
|
|
222
|
-
|
|
223
|
-
# Start implementing...
|
|
224
|
-
```
|
|
225
|
-
|
|
226
|
-
---
|
|
227
|
-
|
|
228
|
-
## Phase: Session Management (NEXT PHASE)
|
|
229
|
-
|
|
230
|
-
**Status**: ⏳ Ready to start
|
|
231
|
-
**Estimated Start**: _date_
|
|
232
|
-
**Estimated Duration**: 4-6 hours
|
|
233
|
-
**Estimator**: Original developer
|
|
234
|
-
|
|
235
|
-
### What Needs to Happen
|
|
236
|
-
|
|
237
|
-
1. **Implement session-manager.ts**
|
|
238
|
-
- Create session on login
|
|
239
|
-
- Validate session on requests
|
|
240
|
-
- Handle timeout
|
|
241
|
-
|
|
242
|
-
2. **Write session tests**
|
|
243
|
-
- Session creation tests
|
|
244
|
-
- Validation tests
|
|
245
|
-
- Timeout tests
|
|
246
|
-
- Concurrent session tests
|
|
247
|
-
|
|
248
|
-
3. **Update database**
|
|
249
|
-
- Add sessions table migration
|
|
250
|
-
- Add indexes for performance
|
|
251
|
-
|
|
252
|
-
### Acceptance Criteria
|
|
253
|
-
|
|
254
|
-
```
|
|
255
|
-
- [ ] Session manager module complete
|
|
256
|
-
- [ ] 12+ session tests written and passing
|
|
257
|
-
- [ ] Coverage ≥ 85%
|
|
258
|
-
- [ ] No performance regressions
|
|
259
|
-
- [ ] Integrated with OAuth provider
|
|
260
|
-
```
|
|
261
|
-
|
|
262
|
-
### Dependencies
|
|
263
|
-
|
|
264
|
-
- ✓ OAuth provider complete (already done)
|
|
265
|
-
- ✓ Database migrations (already done)
|
|
266
|
-
- ⏳ Needs: Node.js session library installed
|
|
267
|
-
|
|
268
|
-
### Blockers
|
|
269
|
-
|
|
270
|
-
None identified. Ready to proceed.
|
|
271
|
-
|
|
272
|
-
---
|
|
273
|
-
|
|
274
|
-
## Phase: UI Components (AFTER SESSION)
|
|
275
|
-
|
|
276
|
-
**Status**: ⏳ Ready to start
|
|
277
|
-
**Estimated Start**: _date_ (after session phase)
|
|
278
|
-
**Estimated Duration**: 6-8 hours
|
|
279
|
-
|
|
280
|
-
### What Needs to Happen
|
|
281
|
-
|
|
282
|
-
1. **Build Login Page**
|
|
283
|
-
- OAuth provider buttons
|
|
284
|
-
- Error messaging
|
|
285
|
-
- Loading states
|
|
286
|
-
|
|
287
|
-
2. **Build Register Page**
|
|
288
|
-
- Form validation
|
|
289
|
-
- Provider linking
|
|
290
|
-
- Success messaging
|
|
291
|
-
|
|
292
|
-
3. **Update Layout**
|
|
293
|
-
- Add auth header
|
|
294
|
-
- Add logout button
|
|
295
|
-
- Add user menu
|
|
296
|
-
|
|
297
|
-
### Files to Modify
|
|
298
|
-
|
|
299
|
-
```
|
|
300
|
-
src/ui/pages/
|
|
301
|
-
├── login.tsx (new)
|
|
302
|
-
├── register.tsx (new)
|
|
303
|
-
└── layout.tsx (modify)
|
|
304
|
-
|
|
305
|
-
src/ui/components/
|
|
306
|
-
├── oauth-button.tsx (new)
|
|
307
|
-
├── auth-error.tsx (new)
|
|
308
|
-
└── user-menu.tsx (modify)
|
|
309
|
-
|
|
310
|
-
styles/
|
|
311
|
-
├── auth.css (new)
|
|
312
|
-
```
|
|
313
|
-
|
|
314
|
-
### Acceptance Criteria
|
|
315
|
-
|
|
316
|
-
```
|
|
317
|
-
- [ ] Login page working
|
|
318
|
-
- [ ] Register page working
|
|
319
|
-
- [ ] OAuth provider flow works E2E
|
|
320
|
-
- [ ] Error messages display correctly
|
|
321
|
-
- [ ] Mobile responsive
|
|
322
|
-
- [ ] Browser compatibility tested
|
|
323
|
-
```
|
|
324
|
-
|
|
325
|
-
---
|
|
326
|
-
|
|
327
|
-
## Phase: Documentation & Testing (FINAL PHASE)
|
|
328
|
-
|
|
329
|
-
**Status**: ⏳ Ready to start
|
|
330
|
-
**Estimated Start**: _date_ (after UI phase)
|
|
331
|
-
**Estimated Duration**: 4 hours
|
|
332
|
-
|
|
333
|
-
### What Needs to Happen
|
|
334
|
-
|
|
335
|
-
1. **Update Documentation**
|
|
336
|
-
- README auth section
|
|
337
|
-
- Setup guide
|
|
338
|
-
- API documentation
|
|
339
|
-
- Troubleshooting guide
|
|
340
|
-
|
|
341
|
-
2. **Final Testing**
|
|
342
|
-
- Full E2E test suite
|
|
343
|
-
- Performance testing
|
|
344
|
-
- Security testing
|
|
345
|
-
- Browser compatibility
|
|
346
|
-
|
|
347
|
-
3. **Code Review & QA**
|
|
348
|
-
- Team code review
|
|
349
|
-
- QA testing
|
|
350
|
-
- Security review
|
|
351
|
-
|
|
352
|
-
### Acceptance Criteria
|
|
353
|
-
|
|
354
|
-
```
|
|
355
|
-
- [ ] All documentation complete
|
|
356
|
-
- [ ] All E2E tests passing
|
|
357
|
-
- [ ] Code review approved
|
|
358
|
-
- [ ] No security issues
|
|
359
|
-
- [ ] Performance acceptable
|
|
360
|
-
- [ ] Ready for production
|
|
42
|
+
# Dry-run package integrity
|
|
43
|
+
npm run pack:check
|
|
361
44
|
```
|
|
362
45
|
|
|
363
46
|
---
|
|
364
47
|
|
|
365
48
|
## Rollback Points
|
|
366
49
|
|
|
367
|
-
### If
|
|
50
|
+
### If A Future Harness Evolution Breaks the CLI/Environment
|
|
368
51
|
|
|
369
|
-
**Rollback Level 1: Last
|
|
52
|
+
**Rollback Level 1: Last Stable Run (Current State)**
|
|
53
|
+
If a new change to `bin/genesis-harness.js` or `scripts/verify.sh` creates infinite loops or immediate failures:
|
|
370
54
|
```bash
|
|
371
|
-
git
|
|
55
|
+
git checkout -- bin/genesis-harness.js scripts/verify.sh scripts/run-evals.sh
|
|
372
56
|
npm install
|
|
373
|
-
|
|
57
|
+
./scripts/verify.sh
|
|
374
58
|
```
|
|
375
59
|
|
|
376
|
-
**Rollback Level 2:
|
|
60
|
+
**Rollback Level 2: Full Repository Reset**
|
|
61
|
+
If tests are failing in a manner that contaminates local fixtures or memory:
|
|
377
62
|
```bash
|
|
378
|
-
git
|
|
63
|
+
git reset --hard HEAD
|
|
64
|
+
git clean -fd
|
|
379
65
|
npm install
|
|
380
|
-
|
|
381
|
-
# Verify everything works on main
|
|
382
|
-
```
|
|
383
|
-
|
|
384
|
-
**Rollback Level 3: Database Rollback**
|
|
385
|
-
```bash
|
|
386
|
-
npm run db:rollback -- auth-v1
|
|
387
|
-
# Removes auth tables/fields
|
|
388
|
-
# App reverts to state before auth changes
|
|
66
|
+
./scripts/verify.sh
|
|
389
67
|
```
|
|
390
68
|
|
|
391
69
|
---
|
|
392
70
|
|
|
393
|
-
## Checklist: Before Pausing Work
|
|
394
|
-
|
|
395
|
-
- [ ] All current tests passing
|
|
396
|
-
- [ ] Code builds without errors
|
|
397
|
-
- [ ] No uncommitted changes
|
|
398
|
-
- [ ] Commit message written (describe state)
|
|
399
|
-
- [ ] This Recovery Points file updated
|
|
400
|
-
- [ ] Next developer notified
|
|
401
|
-
- [ ] Slack message sent with status
|
|
402
|
-
|
|
403
|
-
---
|
|
404
|
-
|
|
405
|
-
## Checklist: When Resuming Work
|
|
71
|
+
## Checklist: Before Pausing Work on Harness Evolutions
|
|
406
72
|
|
|
407
|
-
- [ ]
|
|
408
|
-
- [ ]
|
|
409
|
-
- [ ]
|
|
410
|
-
- [ ]
|
|
411
|
-
- [ ]
|
|
412
|
-
- [ ] Update status in this file (what phase are you on?)
|
|
413
|
-
- [ ] Begin implementation from current phase
|
|
414
|
-
|
|
415
|
-
---
|
|
416
|
-
|
|
417
|
-
## Known Issues That Might Affect Resumption
|
|
418
|
-
|
|
419
|
-
| Issue | Severity | Workaround | Fixed? |
|
|
420
|
-
|-------|----------|-----------|--------|
|
|
421
|
-
| OAuth token refresh timing | Low | Add 5s buffer | ⏳ TODO |
|
|
422
|
-
| Database connection pooling | Medium | Use pool size 10 | ✓ Fixed |
|
|
423
|
-
| Rate limiting missing | Low | Not blocking | ⏳ TODO |
|
|
73
|
+
- [ ] `scripts/verify.sh` passing cleanly (Exit Code 0)
|
|
74
|
+
- [ ] `scripts/run-evals.sh` passing cleanly (Exit Code 0)
|
|
75
|
+
- [ ] Script files verified for POSIX/LF line endings
|
|
76
|
+
- [ ] No uncommitted changes in core scripts that break existing workflows
|
|
77
|
+
- [ ] `.codebase/CURRENT_STATE.md` updated with exact phase details
|
|
424
78
|
|
|
425
79
|
---
|
|
426
80
|
|
|
427
81
|
## Contact For Questions
|
|
428
|
-
|
|
429
|
-
**
|
|
430
|
-
**Current Owner**: _Name_ (_email_)
|
|
431
|
-
**Slack Channel**: #auth-feature
|
|
432
|
-
|
|
433
|
-
---
|
|
434
|
-
|
|
435
|
-
**Last Updated**: _YYYY-MM-DD HH:MM_
|
|
436
|
-
**Updated By**: _Name_
|
|
437
|
-
**Next Review**: _YYYY-MM-DD_
|
|
438
|
-
**Paused At**: _Phase: Core Implementation_
|
|
82
|
+
**Owner**: Codex Harness Engineering Team
|
|
83
|
+
**Last Validated**: 2026-06-01
|
|
@@ -0,0 +1,16 @@
|
|
|
1
|
+
[
|
|
2
|
+
{
|
|
3
|
+
"id": "212329",
|
|
4
|
+
"category": "general",
|
|
5
|
+
"fact": "Always use PORT 8080 when running development servers locally.",
|
|
6
|
+
"timestamp": "2026-06-01T07:22:12.428Z",
|
|
7
|
+
"git_commit": "92d8138"
|
|
8
|
+
},
|
|
9
|
+
{
|
|
10
|
+
"id": "25d72a",
|
|
11
|
+
"category": "api",
|
|
12
|
+
"fact": "Verify request.json schema before running visual regression tests.",
|
|
13
|
+
"timestamp": "2026-06-01T07:22:33.273Z",
|
|
14
|
+
"git_commit": "92d8138"
|
|
15
|
+
}
|
|
16
|
+
]
|
|
@@ -1,5 +1,5 @@
|
|
|
1
1
|
---
|
|
2
|
-
name: ai-provider
|
|
2
|
+
name: genesis-ai-provider
|
|
3
3
|
description: Define reliable AI provider harnesses with mock providers, schemas, retries, invalid JSON recovery, persistence tests, and provider contracts. Use for LLM, image, voice, subtitle, render, and pipeline provider work.
|
|
4
4
|
---
|
|
5
5
|
|
|
@@ -1,5 +1,5 @@
|
|
|
1
1
|
---
|
|
2
|
-
name: api-sync
|
|
2
|
+
name: genesis-api-sync
|
|
3
3
|
description: Automatic API contract synchronization. Detects API changes in implementation, updates API_CONTRACTS.md, regenerates test contracts, and maintains backward compatibility documentation. Use after API-related implementation or when contracts drift from actual code.
|
|
4
4
|
---
|
|
5
5
|
|
|
@@ -1,5 +1,5 @@
|
|
|
1
1
|
---
|
|
2
|
-
name: architecture
|
|
2
|
+
name: genesis-architecture
|
|
3
3
|
description: Guide Codex through architecture analysis, boundary decisions, dependency direction, module ownership, and architecture documentation. Use before structural changes, new subsystems, cross-module work, or when architecture memory must be updated.
|
|
4
4
|
---
|
|
5
5
|
|
|
@@ -1,5 +1,5 @@
|
|
|
1
1
|
---
|
|
2
|
-
name: codebase-map
|
|
2
|
+
name: genesis-codebase-map
|
|
3
3
|
description: Maintain repository memory and compressed codebase maps for token-efficient Codex operation. Use when files move, modules change, contracts change, tests change, or repository summaries need updating.
|
|
4
4
|
---
|
|
5
5
|
|
|
@@ -1,5 +1,5 @@
|
|
|
1
1
|
---
|
|
2
|
-
name: debug-guide
|
|
2
|
+
name: genesis-debug-guide
|
|
3
3
|
description: "Systematic debugging for test failures, runtime errors, and production bugs. Follows TDD debugging patterns, isolation strategies, and observability-driven root cause analysis. Auto-triggers after bug fixes to verify fix quality and prevent regressions."
|
|
4
4
|
---
|
|
5
5
|
|
|
@@ -1,5 +1,5 @@
|
|
|
1
1
|
---
|
|
2
|
-
name: design-spec
|
|
2
|
+
name: genesis-design-spec
|
|
3
3
|
description: Define UI design specs, route contracts, visual states, accessibility expectations, and UI/API synchronization before frontend implementation. Use for new screens, redesigns, visual QA, or UI contract updates.
|
|
4
4
|
---
|
|
5
5
|
|
|
@@ -18,7 +18,7 @@ Do not use for backend-only changes with no UI effect.
|
|
|
18
18
|
UI route, audience, state list, API dependencies, fixtures, and visual references if available.
|
|
19
19
|
|
|
20
20
|
## Outputs required
|
|
21
|
-
UI contract, Playwright fixture, acceptance states, and visual regression expectations.
|
|
21
|
+
UI contract, a generated visual mockup image representing the screen layout (saved as `mockup.png` in the active feature/bug directory), Playwright fixture, acceptance states, and visual regression expectations.
|
|
22
22
|
|
|
23
23
|
## Required tests
|
|
24
24
|
Create load, interaction, validation, API sync, and visual regression tests where applicable.
|
|
@@ -36,7 +36,7 @@ Update `.codebase/UI_ROUTES.md` and frontend summary.
|
|
|
36
36
|
Use state tables and route maps instead of long visual prose.
|
|
37
37
|
|
|
38
38
|
## Acceptance criteria
|
|
39
|
-
The UI can be implemented and tested without guessing states or API behavior.
|
|
39
|
+
The UI can be implemented and tested without guessing states or API behavior. A high-fidelity visual mockup `mockup.png` must be created using the `generate_image` tool and saved in the active task directory as the visual contract.
|
|
40
40
|
|
|
41
41
|
## Common mistakes
|
|
42
42
|
Designing only the happy path, skipping empty/error states, and omitting API synchronization.
|
|
@@ -19,7 +19,7 @@ Use for project initialization, planning, feature work, bug fixes, audits, revie
|
|
|
19
19
|
Do not use for simple read-only answers that do not require repository workflow or durable artifacts.
|
|
20
20
|
|
|
21
21
|
## Inputs required
|
|
22
|
-
|
|
22
|
+
Run `genesis-harness status` and `genesis-harness docs` (or `genesis-harness prime`) IMMEDIATELY on boot to retrieve a high-fidelity, token-minimized FSM state, active roadmap checklist, memory beads, and API contract catalog. Do NOT manually run search commands or read entire planning folders to discover the repository state. Only inspect specific relevant code files.
|
|
23
23
|
|
|
24
24
|
## Outputs required
|
|
25
25
|
Plan or implementation artifact, tests, fixtures, verification evidence, docs sync, and codebase memory updates.
|
|
@@ -37,7 +37,7 @@ Update API, agent, event, or UI contracts when public behavior changes.
|
|
|
37
37
|
Update `.codebase` memory after meaningful changes.
|
|
38
38
|
|
|
39
39
|
## Token saving rules
|
|
40
|
-
Read summaries before source files, maps before modules, and avoid loading the entire repository.
|
|
40
|
+
Always prioritize running standard CLI queries (`genesis-harness status`, `docs`, `recall`, and `prime`) to pull structured data rather than traversing directories manually. Read summaries before source files, maps before modules, and avoid loading the entire repository.
|
|
41
41
|
|
|
42
42
|
## Acceptance criteria
|
|
43
43
|
Work is complete only when tests pass, contracts and docs are current, and verification evidence is reported.
|
|
@@ -170,6 +170,7 @@ Bundled references live under `references/`. Load them only when needed:
|
|
|
170
170
|
- `references/planning-schema.md`: detailed `.planning/` file meanings and required subtrees.
|
|
171
171
|
- `references/research-rubric.md`: local/external evidence format for research.
|
|
172
172
|
- `references/quality-rubric.md`: scoring rubric for `QUALITY_SCORE.md`.
|
|
173
|
+
- `references/state-machine.md`: FSM state definitions, transitions, and verification gates.
|
|
173
174
|
|
|
174
175
|
Bundled scripts live under `scripts/`. Prefer copying or adapting these into `.planning/scripts/` or project scripts during `/init`:
|
|
175
176
|
|
|
@@ -1,5 +1,5 @@
|
|
|
1
1
|
---
|
|
2
|
-
name: harness-engineering
|
|
2
|
+
name: genesis-harness-engineering
|
|
3
3
|
description: "Evolve the Codex harness itself: verification loops, repository memory, test-first scaffolds, resumability, observability, and autonomous workflow reliability. Use for changes to this repository's skill system or harness architecture."
|
|
4
4
|
---
|
|
5
5
|
|
|
@@ -17,7 +17,7 @@ Use when building a new web page, app screen, dashboard, tool, landing page, or
|
|
|
17
17
|
Do not use for redesigning existing UI without first preserving behavior; use `genesis-upgrade-design` instead.
|
|
18
18
|
|
|
19
19
|
## Inputs required
|
|
20
|
-
Product intent, target users, primary workflow, stack details, route or entry point, state list, and visual
|
|
20
|
+
Product intent, target users, primary workflow, stack details, route or entry point, state list, visual constraints, and the generated visual contract image `mockup.png` (which MUST be loaded via the `view_file` tool to inspect layouts, color choices, and dimensions before coding).
|
|
21
21
|
|
|
22
22
|
## Outputs required
|
|
23
23
|
Implemented UI, UI contract, fixtures, responsive states, visual verification, and docs or memory updates.
|
|
@@ -55,6 +55,7 @@ If visual output fails, capture screenshot evidence, update the fixture or contr
|
|
|
55
55
|
|
|
56
56
|
2. Define the design intent from the request:
|
|
57
57
|
- Identify audience, product category, primary task, density, tone, and constraints.
|
|
58
|
+
- Proactively inspect `mockup.png` using the `view_file` tool to absorb the visual direction, color system, and placement details.
|
|
58
59
|
- Choose one clear visual direction and commit to it across typography, color, spacing, surfaces, iconography, and motion.
|
|
59
60
|
- For tools, dashboards, and operational apps, prioritize scanning, repeated use, compact controls, and predictable navigation over decorative hero layouts.
|
|
60
61
|
|
|
@@ -1,5 +1,5 @@
|
|
|
1
1
|
---
|
|
2
|
-
name: pipeline-orchestration
|
|
2
|
+
name: genesis-pipeline-orchestration
|
|
3
3
|
description: Run end-to-end Codex harness orchestration phases from repository analysis through tests, fixtures, implementation, contracts, memory, docs, and change summaries. Use for autonomous multi-phase work.
|
|
4
4
|
---
|
|
5
5
|
|