codex-genesis-harness 0.1.6 → 0.1.7

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (62) hide show
  1. package/.codebase/CURRENT_STATE.md +8 -7
  2. package/.codebase/RECOVERY_POINTS.md +38 -393
  3. package/.codebase/beads.json +16 -0
  4. package/.codex/skills/genesis-ai-provider/SKILL.md +1 -1
  5. package/.codex/skills/genesis-api-contract/SKILL.md +1 -1
  6. package/.codex/skills/genesis-api-sync/SKILL.md +1 -1
  7. package/.codex/skills/genesis-architecture/SKILL.md +1 -1
  8. package/.codex/skills/genesis-codebase-map/SKILL.md +1 -1
  9. package/.codex/skills/genesis-debug-guide/SKILL.md +1 -1
  10. package/.codex/skills/genesis-design-spec/SKILL.md +3 -3
  11. package/.codex/skills/genesis-harness/SKILL.md +3 -2
  12. package/.codex/skills/genesis-harness-engineering/SKILL.md +1 -1
  13. package/.codex/skills/genesis-new-design/SKILL.md +2 -1
  14. package/.codex/skills/genesis-pipeline-orchestration/SKILL.md +1 -1
  15. package/.codex/skills/genesis-planning/SKILL.md +26 -1
  16. package/.codex/skills/genesis-release/SKILL.md +24 -1
  17. package/.codex/skills/{ui-ux-test-skill → genesis-ui-ux-test}/SKILL.md +1 -1
  18. package/CHANGELOG.md +14 -0
  19. package/README.EN.md +7 -11
  20. package/README.VI.md +9 -13
  21. package/README.md +10 -10
  22. package/VERSION +1 -1
  23. package/bin/genesis-harness.js +606 -6
  24. package/package.json +1 -1
  25. package/scripts/install.sh +2 -6
  26. package/scripts/run-evals.sh +20 -25
  27. package/scripts/uninstall.sh +2 -6
  28. package/scripts/verify.sh +6 -66
  29. package/.codex/skills/genesis-docs/SKILL.md +0 -46
  30. package/.codex/skills/genesis-docs/agents/openai.yaml +0 -7
  31. package/.codex/skills/genesis-mvp-planning/SKILL.md +0 -114
  32. package/.codex/skills/genesis-mvp-planning/agents/openai.yaml +0 -6
  33. package/.codex/skills/genesis-release-orchestration/SKILL.md +0 -653
  34. package/.codex/skills/genesis-release-orchestration/agents/openai.yaml +0 -7
  35. package/.codex/skills/genesis-research/SKILL.md +0 -46
  36. package/.codex/skills/genesis-research/agents/openai.yaml +0 -7
  37. /package/.codex/skills/{genesis-docs/checklists/checklist.md → genesis-docs-automation/checklists/manual-docs-checklist.md} +0 -0
  38. /package/.codex/skills/{genesis-docs/examples/example.md → genesis-docs-automation/examples/manual-docs-example.md} +0 -0
  39. /package/.codex/skills/{genesis-docs → genesis-docs-automation}/templates/docs-update-template.md +0 -0
  40. /package/.codex/skills/{genesis-state-machine/SKILL.md → genesis-harness/references/state-machine.md} +0 -0
  41. /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/checklists/mvp-readiness.md +0 -0
  42. /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/examples/5-phase-roadmap-example.md +0 -0
  43. /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/templates/phase-1-core.md +0 -0
  44. /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/templates/phase-2-auth.md +0 -0
  45. /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/templates/phase-3-features.md +0 -0
  46. /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/templates/phase-4-integrations.md +0 -0
  47. /package/.codex/skills/{genesis-mvp-planning → genesis-planning}/templates/phase-5-readiness.md +0 -0
  48. /package/.codex/skills/{genesis-release-orchestration → genesis-release}/checklists/post-deployment-verification.md +0 -0
  49. /package/.codex/skills/{genesis-release-orchestration → genesis-release}/checklists/pre-release-validation.md +0 -0
  50. /package/.codex/skills/{genesis-release-orchestration/examples/example.md → genesis-release/examples/orchestration-example.md} +0 -0
  51. /package/.codex/skills/{genesis-release-orchestration → genesis-release}/observability/release-tracking.md +0 -0
  52. /package/.codex/skills/{genesis-release-orchestration → genesis-release}/playbooks/canary-deployment-orchestration.md +0 -0
  53. /package/.codex/skills/{genesis-release-orchestration → genesis-release}/playbooks/semantic-versioning-automation.md +0 -0
  54. /package/.codex/skills/{genesis-release-orchestration → genesis-release}/templates/deployment-strategy-template.md +0 -0
  55. /package/.codex/skills/{genesis-release-orchestration → genesis-release}/templates/release-runbook-template.md +0 -0
  56. /package/.codex/skills/{genesis-research → genesis-research-first}/checklists/checklist.md +0 -0
  57. /package/.codex/skills/{genesis-research/examples/example.md → genesis-research-first/examples/manual-research-example.md} +0 -0
  58. /package/.codex/skills/{genesis-research → genesis-research-first}/templates/research-note-template.md +0 -0
  59. /package/.codex/skills/{ui-ux-test-skill → genesis-ui-ux-test}/agents/openai.yaml +0 -0
  60. /package/.codex/skills/{ui-ux-test-skill → genesis-ui-ux-test}/checklists/checklist.md +0 -0
  61. /package/.codex/skills/{ui-ux-test-skill → genesis-ui-ux-test}/examples/example.md +0 -0
  62. /package/.codex/skills/{ui-ux-test-skill → genesis-ui-ux-test}/templates/playwright-test-template.md +0 -0
@@ -1,10 +1,11 @@
1
1
  # Current State: COMPLETED
2
- Last updated: Mon Jun 01 10:30:00 +07 2026
2
+ Last updated: Mon Jun 01 17:15:00 +07 2026
3
3
 
4
4
  ## Reason
5
- Completed the comprehensive refactoring and bilingual polish of the release documentation (`README.md`, `README.EN.md`, `README.VI.md`) for the `0.1.6` release.
6
- - Highlighted the **Harness Architecture** core pillars and subsystems (Context Compaction, Tool Call Offloading, Self-Healing Verify-Fix Loops, 5-Phase MVP Roadmap, and Zero-Drift Gates).
7
- - Presented a crisp **Standard Agent vs. Harness** comparison table to clearly explain its architectural advantages.
8
- - Cleaned up duplicate installation segments and standardized all skill catalogs on 25 skills, including registering the missing `genesis-mvp-planning` skill in the Vietnamese documentation table.
9
- - Verified all structural, installation, and packaging pipelines successfully (`npm run verify && npm run eval && npm run pack:check` all pass 100% cleanly).
10
-
5
+ Successfully completed the Visual Mockup Generation & Interactive TUI Mockup Viewer integration (v0.3.0), followed by Harness Engineering standardizations and preparation for release `0.1.7`:
6
+ - **Interactive Keyboard-Navigated CLI TUI**: Developed an elegant console interface for `genesis-harness view-mockup` capturing stdin keypresses.
7
+ - **Harness Verification Streamlining**: Refactored `scripts/verify.sh` and `scripts/run-evals.sh` to dynamically evaluate skill names, removing legacy hard-coded mapping logic. Cleaned up deprecated skills (e.g., `genesis-mvp-planning`, `genesis-release-orchestration`, `genesis-state-machine`, `genesis-research`, `genesis-docs`).
8
+ - **Skill Consolidation**: Merged overlapping skills to resolve duplicated slash commands and clean up the architecture.
9
+ - **Bead Memory Test Coverage**: Added rigorous CLI command validations in `scripts/run-evals.sh` to guarantee that `remember`, `recall`, `prime`, and `forget` function reliably.
10
+ - **Skill Enrichment Directives**: Packaged new visual contract requirements inside `genesis-design-spec` (utilizing `generate_image`) and visual alignment checks inside `genesis-new-design` (utilizing `view_file`).
11
+ - **Verification Evidence**: Structural checks and regression evaluations pass 100% cleanly, confirming absolute stability in the current codebase state. Ready for 0.1.7 release.
@@ -1,8 +1,8 @@
1
1
  # Recovery Points
2
2
 
3
- **Purpose**: Document where implementation can be paused and resumed without losing context or creating inconsistencies.
3
+ **Purpose**: Document where harness architecture implementation can be paused and resumed without losing context or creating inconsistencies.
4
4
 
5
- **Use When**: Work needs to be paused, passed to another developer, or interrupted by higher priority work.
5
+ **Use When**: Evolution of the Codex harness (verification loops, CLI tools, scripts) needs to be paused, or when a rollback is necessary due to environment breakage.
6
6
 
7
7
  ---
8
8
 
@@ -10,429 +10,74 @@
10
10
 
11
11
  | Phase | Status | Resumption File | Last Updated |
12
12
  |-------|--------|-----------------|--------------|
13
- | Planning | ✓ Complete | `.codebase/CURRENT_STATE.md` | _date_ |
14
- | Design | ✓ Complete | IMPLEMENTATION_HANDOFF.md | _date_ |
15
- | Contracts | ✓ Complete | `.codebase/API_CONTRACTS.md` | _date_ |
16
- | Tests | ⏸️ In Progress | `tests/auth.test.ts` | _date_ |
17
- | Implementation | ⏸️ In Progress | `src/auth/` | _date_ |
18
- | Documentation | ⏸️ Pending | `docs/AUTH.md` | _date_ |
13
+ | TUI Mockup Viewer Integration | ✓ Complete | `.codebase/CURRENT_STATE.md` | 2026-06-01 |
14
+ | Harness Verification Streamlining | ✓ Complete | `.codebase/CURRENT_STATE.md` | 2026-06-01 |
15
+ | Bead Memory Regression Tests | ✓ Complete | `scripts/run-evals.sh` | 2026-06-01 |
16
+ | Harness Engineering Overhaul | ⏸️ Idle (Stable) | `scripts/verify.sh` | 2026-06-01 |
19
17
 
20
18
  ---
21
19
 
22
- ## Phase: Planning
20
+ ## Phase: Harness Verification Streamlining & Memory Evals
23
21
 
24
22
  **Status**: ✓ Complete
25
- **Start Date**: _date_
26
- **End Date**: _date_
27
- **Resumption**: Not needed
23
+ **Last Updated**: 2026-06-01
28
24
 
29
25
  ### What Happened
30
26
 
31
- All requirements gathered, validated, and documented.
27
+ - Cleaned up legacy/deprecated skills (e.g., `genesis-mvp-planning`, `genesis-release-orchestration`) from `scripts/verify.sh`, `scripts/uninstall.sh`, and `scripts/run-evals.sh`.
28
+ - Removed hard-coded skill name mappings (`expected_name` switch statements), enabling dynamic mapping directly based on directory names.
29
+ - Added test coverage in `run-evals.sh` for the local bead memory commands (`remember`, `recall`, `prime`, `forget`).
30
+ - Enforced `state-machine.md` presence in `verify_harness_skill()`.
32
31
 
33
- ### To Resume (If Needed)
34
-
35
- Not applicable - planning is complete and locked.
36
-
37
- ---
38
-
39
- ## Phase: Design & Architecture
40
-
41
- **Status**: ✓ Complete
42
- **Start Date**: _date_
43
- **End Date**: _date_
44
- **Resumption**: Not needed
45
-
46
- ### What Happened
47
-
48
- System design completed, contracts defined, technology choices made.
49
-
50
- ### Artifacts
51
-
52
- - `.codebase/ARCHITECTURE.md` - Final design
53
- - `.codebase/API_CONTRACTS.md` - API specifications
54
- - `IMPLEMENTATION_HANDOFF.md` - Design decisions
55
-
56
- ### To Resume (If Needed)
57
-
58
- Not applicable - design is frozen and approved.
59
-
60
- ---
61
-
62
- ## Phase: Test Contracts & Fixtures
63
-
64
- **Status**: ✓ Complete
65
- **Start Date**: _date_
66
- **End Date**: _date_
67
- **Resumption**: Not needed
68
-
69
- ### What Happened
70
-
71
- Test contracts created, fixtures built, test infrastructure ready.
72
-
73
- ### Artifacts
74
-
75
- ```
76
- tests/
77
- ├── contracts/
78
- │ ├── auth-api.contract.json
79
- │ └── session-manager.contract.json
80
- ├── fixtures/
81
- │ ├── oauth-responses.json
82
- │ └── user-data.json
83
- └── setup/
84
- ├── test-db-init.sql
85
- └── mock-providers.ts
86
- ```
87
-
88
- ### To Resume (If Needed)
89
-
90
- Not applicable - test infrastructure is ready.
91
-
92
- ---
93
-
94
- ## Phase: Core Implementation (CURRENT PAUSE POINT)
95
-
96
- **Status**: ⏸️ In Progress (50% complete)
97
- **Start Date**: _date_
98
- **Current Date**: _YYYY-MM-DD_
99
- **Estimated Completion**: _date_
100
-
101
- ### What Was Done (Completed)
102
-
103
- ```
104
- ✓ src/auth/oauth-provider.ts
105
- - OAuth 2.0 flow implementation
106
- - Provider callback handling
107
- - Token management
108
-
109
- ✓ src/db/migrations/
110
- - New users table fields
111
- - oauth_provider, oauth_id columns
112
-
113
- ✓ tests/auth.test.ts
114
- - 15 tests written and passing
115
- - 90% coverage of oauth-provider.ts
116
- ```
117
-
118
- ### What Remains (To Do)
119
-
120
- ```
121
- ⏳ src/auth/session-manager.ts
122
- - Session creation and validation
123
- - Session timeout handling
124
- - Concurrent session management
125
-
126
- ⏳ src/ui/pages/
127
- - Login component
128
- - Registration component
129
- - Error handling UI
130
-
131
- ⏳ API endpoints
132
- - POST /auth/login
133
- - POST /auth/register
134
- - POST /auth/logout
135
-
136
- ⏳ Integration tests
137
- - End-to-end OAuth flow
138
- - Database state consistency
139
- ```
140
-
141
- ### Pause State
142
-
143
- **What's Safe to Pause Here:**
144
-
145
- The OAuth provider module is isolated and complete. Can pause safely at this point.
146
-
147
- **Before Pausing:**
32
+ ### Safe State Confirmation
148
33
 
34
+ The harness currently passes all structural tests cleanly.
149
35
  ```bash
150
- # 1. Commit current progress
151
- git add -A
152
- git commit -m "feat: OAuth provider implementation (50% complete)"
36
+ # Verify structure
37
+ ./scripts/verify.sh
153
38
 
154
- # 2. Verify tests pass
155
- npm test
39
+ # Verify regression
40
+ ./scripts/run-evals.sh
156
41
 
157
- # 3. Document pause point
158
- # (You are reading this file!)
159
-
160
- # 4. Create recovery checklist
161
- # (See section below)
162
- ```
163
-
164
- ### To Resume From Here
165
-
166
- **Step 1: Environment Setup** (5 min)
167
-
168
- ```bash
169
- # Pull latest changes
170
- git pull origin main
171
-
172
- # Verify you're on the right branch
173
- git branch -v
174
-
175
- # Install dependencies
176
- npm install
177
-
178
- # Check database is migrated
179
- npm run db:status
180
- ```
181
-
182
- **Step 2: Verify Previous Work** (5 min)
183
-
184
- ```bash
185
- # Run existing tests
186
- npm test -- auth.test.ts
187
- # Expected: 15 passing
188
-
189
- # Check module compiles
190
- npm run build
191
-
192
- # Verify no uncommitted changes
193
- git status
194
- ```
195
-
196
- **Step 3: Review What's Next** (10 min)
197
-
198
- ```bash
199
- # Read the handoff document
200
- cat IMPLEMENTATION_HANDOFF.md
201
-
202
- # Check what was planned next
203
- cat .codebase/CURRENT_STATE.md
204
-
205
- # Review test contract
206
- cat tests/contracts/auth-api.contract.json
207
- ```
208
-
209
- **Step 4: Resume Implementation** (30 min+)
210
-
211
- Start with session-manager.ts:
212
-
213
- ```bash
214
- # Create new branch (if needed)
215
- git checkout -b continue/auth-session
216
-
217
- # Open the next module
218
- code src/auth/session-manager.ts
219
-
220
- # Reference the test contract
221
- code tests/contracts/session-manager.contract.json
222
-
223
- # Start implementing...
224
- ```
225
-
226
- ---
227
-
228
- ## Phase: Session Management (NEXT PHASE)
229
-
230
- **Status**: ⏳ Ready to start
231
- **Estimated Start**: _date_
232
- **Estimated Duration**: 4-6 hours
233
- **Estimator**: Original developer
234
-
235
- ### What Needs to Happen
236
-
237
- 1. **Implement session-manager.ts**
238
- - Create session on login
239
- - Validate session on requests
240
- - Handle timeout
241
-
242
- 2. **Write session tests**
243
- - Session creation tests
244
- - Validation tests
245
- - Timeout tests
246
- - Concurrent session tests
247
-
248
- 3. **Update database**
249
- - Add sessions table migration
250
- - Add indexes for performance
251
-
252
- ### Acceptance Criteria
253
-
254
- ```
255
- - [ ] Session manager module complete
256
- - [ ] 12+ session tests written and passing
257
- - [ ] Coverage ≥ 85%
258
- - [ ] No performance regressions
259
- - [ ] Integrated with OAuth provider
260
- ```
261
-
262
- ### Dependencies
263
-
264
- - ✓ OAuth provider complete (already done)
265
- - ✓ Database migrations (already done)
266
- - ⏳ Needs: Node.js session library installed
267
-
268
- ### Blockers
269
-
270
- None identified. Ready to proceed.
271
-
272
- ---
273
-
274
- ## Phase: UI Components (AFTER SESSION)
275
-
276
- **Status**: ⏳ Ready to start
277
- **Estimated Start**: _date_ (after session phase)
278
- **Estimated Duration**: 6-8 hours
279
-
280
- ### What Needs to Happen
281
-
282
- 1. **Build Login Page**
283
- - OAuth provider buttons
284
- - Error messaging
285
- - Loading states
286
-
287
- 2. **Build Register Page**
288
- - Form validation
289
- - Provider linking
290
- - Success messaging
291
-
292
- 3. **Update Layout**
293
- - Add auth header
294
- - Add logout button
295
- - Add user menu
296
-
297
- ### Files to Modify
298
-
299
- ```
300
- src/ui/pages/
301
- ├── login.tsx (new)
302
- ├── register.tsx (new)
303
- └── layout.tsx (modify)
304
-
305
- src/ui/components/
306
- ├── oauth-button.tsx (new)
307
- ├── auth-error.tsx (new)
308
- └── user-menu.tsx (modify)
309
-
310
- styles/
311
- ├── auth.css (new)
312
- ```
313
-
314
- ### Acceptance Criteria
315
-
316
- ```
317
- - [ ] Login page working
318
- - [ ] Register page working
319
- - [ ] OAuth provider flow works E2E
320
- - [ ] Error messages display correctly
321
- - [ ] Mobile responsive
322
- - [ ] Browser compatibility tested
323
- ```
324
-
325
- ---
326
-
327
- ## Phase: Documentation & Testing (FINAL PHASE)
328
-
329
- **Status**: ⏳ Ready to start
330
- **Estimated Start**: _date_ (after UI phase)
331
- **Estimated Duration**: 4 hours
332
-
333
- ### What Needs to Happen
334
-
335
- 1. **Update Documentation**
336
- - README auth section
337
- - Setup guide
338
- - API documentation
339
- - Troubleshooting guide
340
-
341
- 2. **Final Testing**
342
- - Full E2E test suite
343
- - Performance testing
344
- - Security testing
345
- - Browser compatibility
346
-
347
- 3. **Code Review & QA**
348
- - Team code review
349
- - QA testing
350
- - Security review
351
-
352
- ### Acceptance Criteria
353
-
354
- ```
355
- - [ ] All documentation complete
356
- - [ ] All E2E tests passing
357
- - [ ] Code review approved
358
- - [ ] No security issues
359
- - [ ] Performance acceptable
360
- - [ ] Ready for production
42
+ # Dry-run package integrity
43
+ npm run pack:check
361
44
  ```
362
45
 
363
46
  ---
364
47
 
365
48
  ## Rollback Points
366
49
 
367
- ### If Implementation Gets Stuck
50
+ ### If A Future Harness Evolution Breaks the CLI/Environment
368
51
 
369
- **Rollback Level 1: Last Commit**
52
+ **Rollback Level 1: Last Stable Run (Current State)**
53
+ If a new change to `bin/genesis-harness.js` or `scripts/verify.sh` creates infinite loops or immediate failures:
370
54
  ```bash
371
- git reset --hard HEAD
55
+ git checkout -- bin/genesis-harness.js scripts/verify.sh scripts/run-evals.sh
372
56
  npm install
373
- npm test # Should pass
57
+ ./scripts/verify.sh
374
58
  ```
375
59
 
376
- **Rollback Level 2: Before This Feature**
60
+ **Rollback Level 2: Full Repository Reset**
61
+ If tests are failing in a manner that contaminates local fixtures or memory:
377
62
  ```bash
378
- git checkout main
63
+ git reset --hard HEAD
64
+ git clean -fd
379
65
  npm install
380
- npm test
381
- # Verify everything works on main
382
- ```
383
-
384
- **Rollback Level 3: Database Rollback**
385
- ```bash
386
- npm run db:rollback -- auth-v1
387
- # Removes auth tables/fields
388
- # App reverts to state before auth changes
66
+ ./scripts/verify.sh
389
67
  ```
390
68
 
391
69
  ---
392
70
 
393
- ## Checklist: Before Pausing Work
394
-
395
- - [ ] All current tests passing
396
- - [ ] Code builds without errors
397
- - [ ] No uncommitted changes
398
- - [ ] Commit message written (describe state)
399
- - [ ] This Recovery Points file updated
400
- - [ ] Next developer notified
401
- - [ ] Slack message sent with status
402
-
403
- ---
404
-
405
- ## Checklist: When Resuming Work
71
+ ## Checklist: Before Pausing Work on Harness Evolutions
406
72
 
407
- - [ ] Pull latest git changes
408
- - [ ] Run `npm install` (if dependencies changed)
409
- - [ ] Run existing tests (verify they still pass)
410
- - [ ] Read IMPLEMENTATION_HANDOFF.md
411
- - [ ] Read this Recovery Points file
412
- - [ ] Update status in this file (what phase are you on?)
413
- - [ ] Begin implementation from current phase
414
-
415
- ---
416
-
417
- ## Known Issues That Might Affect Resumption
418
-
419
- | Issue | Severity | Workaround | Fixed? |
420
- |-------|----------|-----------|--------|
421
- | OAuth token refresh timing | Low | Add 5s buffer | ⏳ TODO |
422
- | Database connection pooling | Medium | Use pool size 10 | ✓ Fixed |
423
- | Rate limiting missing | Low | Not blocking | ⏳ TODO |
73
+ - [ ] `scripts/verify.sh` passing cleanly (Exit Code 0)
74
+ - [ ] `scripts/run-evals.sh` passing cleanly (Exit Code 0)
75
+ - [ ] Script files verified for POSIX/LF line endings
76
+ - [ ] No uncommitted changes in core scripts that break existing workflows
77
+ - [ ] `.codebase/CURRENT_STATE.md` updated with exact phase details
424
78
 
425
79
  ---
426
80
 
427
81
  ## Contact For Questions
428
-
429
- **Original Developer**: _Name_ (_email_)
430
- **Current Owner**: _Name_ (_email_)
431
- **Slack Channel**: #auth-feature
432
-
433
- ---
434
-
435
- **Last Updated**: _YYYY-MM-DD HH:MM_
436
- **Updated By**: _Name_
437
- **Next Review**: _YYYY-MM-DD_
438
- **Paused At**: _Phase: Core Implementation_
82
+ **Owner**: Codex Harness Engineering Team
83
+ **Last Validated**: 2026-06-01
@@ -0,0 +1,16 @@
1
+ [
2
+ {
3
+ "id": "212329",
4
+ "category": "general",
5
+ "fact": "Always use PORT 8080 when running development servers locally.",
6
+ "timestamp": "2026-06-01T07:22:12.428Z",
7
+ "git_commit": "92d8138"
8
+ },
9
+ {
10
+ "id": "25d72a",
11
+ "category": "api",
12
+ "fact": "Verify request.json schema before running visual regression tests.",
13
+ "timestamp": "2026-06-01T07:22:33.273Z",
14
+ "git_commit": "92d8138"
15
+ }
16
+ ]
@@ -1,5 +1,5 @@
1
1
  ---
2
- name: ai-provider-skill
2
+ name: genesis-ai-provider
3
3
  description: Define reliable AI provider harnesses with mock providers, schemas, retries, invalid JSON recovery, persistence tests, and provider contracts. Use for LLM, image, voice, subtitle, render, and pipeline provider work.
4
4
  ---
5
5
 
@@ -1,5 +1,5 @@
1
1
  ---
2
- name: api-contract-skill
2
+ name: genesis-api-contract
3
3
  description: Create and verify API contracts before endpoint implementation. Use for API routes, service interfaces, validation rules, error shapes, snapshots, and endpoint tests.
4
4
  ---
5
5
 
@@ -1,5 +1,5 @@
1
1
  ---
2
- name: api-sync-skill
2
+ name: genesis-api-sync
3
3
  description: Automatic API contract synchronization. Detects API changes in implementation, updates API_CONTRACTS.md, regenerates test contracts, and maintains backward compatibility documentation. Use after API-related implementation or when contracts drift from actual code.
4
4
  ---
5
5
 
@@ -1,5 +1,5 @@
1
1
  ---
2
- name: architecture-skill
2
+ name: genesis-architecture
3
3
  description: Guide Codex through architecture analysis, boundary decisions, dependency direction, module ownership, and architecture documentation. Use before structural changes, new subsystems, cross-module work, or when architecture memory must be updated.
4
4
  ---
5
5
 
@@ -1,5 +1,5 @@
1
1
  ---
2
- name: codebase-map-skill
2
+ name: genesis-codebase-map
3
3
  description: Maintain repository memory and compressed codebase maps for token-efficient Codex operation. Use when files move, modules change, contracts change, tests change, or repository summaries need updating.
4
4
  ---
5
5
 
@@ -1,5 +1,5 @@
1
1
  ---
2
- name: debug-guide-skill
2
+ name: genesis-debug-guide
3
3
  description: "Systematic debugging for test failures, runtime errors, and production bugs. Follows TDD debugging patterns, isolation strategies, and observability-driven root cause analysis. Auto-triggers after bug fixes to verify fix quality and prevent regressions."
4
4
  ---
5
5
 
@@ -1,5 +1,5 @@
1
1
  ---
2
- name: design-spec-skill
2
+ name: genesis-design-spec
3
3
  description: Define UI design specs, route contracts, visual states, accessibility expectations, and UI/API synchronization before frontend implementation. Use for new screens, redesigns, visual QA, or UI contract updates.
4
4
  ---
5
5
 
@@ -18,7 +18,7 @@ Do not use for backend-only changes with no UI effect.
18
18
  UI route, audience, state list, API dependencies, fixtures, and visual references if available.
19
19
 
20
20
  ## Outputs required
21
- UI contract, Playwright fixture, acceptance states, and visual regression expectations.
21
+ UI contract, a generated visual mockup image representing the screen layout (saved as `mockup.png` in the active feature/bug directory), Playwright fixture, acceptance states, and visual regression expectations.
22
22
 
23
23
  ## Required tests
24
24
  Create load, interaction, validation, API sync, and visual regression tests where applicable.
@@ -36,7 +36,7 @@ Update `.codebase/UI_ROUTES.md` and frontend summary.
36
36
  Use state tables and route maps instead of long visual prose.
37
37
 
38
38
  ## Acceptance criteria
39
- The UI can be implemented and tested without guessing states or API behavior.
39
+ The UI can be implemented and tested without guessing states or API behavior. A high-fidelity visual mockup `mockup.png` must be created using the `generate_image` tool and saved in the active task directory as the visual contract.
40
40
 
41
41
  ## Common mistakes
42
42
  Designing only the happy path, skipping empty/error states, and omitting API synchronization.
@@ -19,7 +19,7 @@ Use for project initialization, planning, feature work, bug fixes, audits, revie
19
19
  Do not use for simple read-only answers that do not require repository workflow or durable artifacts.
20
20
 
21
21
  ## Inputs required
22
- Read `.codebase/state.json` (MANDATORY on boot), `.codebase/CURRENT_STATE.md`, `.codebase/MODULE_INDEX.md`, and `.codebase/TEST_MATRIX.md` when present, then inspect only relevant files.
22
+ Run `genesis-harness status` and `genesis-harness docs` (or `genesis-harness prime`) IMMEDIATELY on boot to retrieve a high-fidelity, token-minimized FSM state, active roadmap checklist, memory beads, and API contract catalog. Do NOT manually run search commands or read entire planning folders to discover the repository state. Only inspect specific relevant code files.
23
23
 
24
24
  ## Outputs required
25
25
  Plan or implementation artifact, tests, fixtures, verification evidence, docs sync, and codebase memory updates.
@@ -37,7 +37,7 @@ Update API, agent, event, or UI contracts when public behavior changes.
37
37
  Update `.codebase` memory after meaningful changes.
38
38
 
39
39
  ## Token saving rules
40
- Read summaries before source files, maps before modules, and avoid loading the entire repository.
40
+ Always prioritize running standard CLI queries (`genesis-harness status`, `docs`, `recall`, and `prime`) to pull structured data rather than traversing directories manually. Read summaries before source files, maps before modules, and avoid loading the entire repository.
41
41
 
42
42
  ## Acceptance criteria
43
43
  Work is complete only when tests pass, contracts and docs are current, and verification evidence is reported.
@@ -170,6 +170,7 @@ Bundled references live under `references/`. Load them only when needed:
170
170
  - `references/planning-schema.md`: detailed `.planning/` file meanings and required subtrees.
171
171
  - `references/research-rubric.md`: local/external evidence format for research.
172
172
  - `references/quality-rubric.md`: scoring rubric for `QUALITY_SCORE.md`.
173
+ - `references/state-machine.md`: FSM state definitions, transitions, and verification gates.
173
174
 
174
175
  Bundled scripts live under `scripts/`. Prefer copying or adapting these into `.planning/scripts/` or project scripts during `/init`:
175
176
 
@@ -1,5 +1,5 @@
1
1
  ---
2
- name: harness-engineering-skill
2
+ name: genesis-harness-engineering
3
3
  description: "Evolve the Codex harness itself: verification loops, repository memory, test-first scaffolds, resumability, observability, and autonomous workflow reliability. Use for changes to this repository's skill system or harness architecture."
4
4
  ---
5
5
 
@@ -17,7 +17,7 @@ Use when building a new web page, app screen, dashboard, tool, landing page, or
17
17
  Do not use for redesigning existing UI without first preserving behavior; use `genesis-upgrade-design` instead.
18
18
 
19
19
  ## Inputs required
20
- Product intent, target users, primary workflow, stack details, route or entry point, state list, and visual constraints.
20
+ Product intent, target users, primary workflow, stack details, route or entry point, state list, visual constraints, and the generated visual contract image `mockup.png` (which MUST be loaded via the `view_file` tool to inspect layouts, color choices, and dimensions before coding).
21
21
 
22
22
  ## Outputs required
23
23
  Implemented UI, UI contract, fixtures, responsive states, visual verification, and docs or memory updates.
@@ -55,6 +55,7 @@ If visual output fails, capture screenshot evidence, update the fixture or contr
55
55
 
56
56
  2. Define the design intent from the request:
57
57
  - Identify audience, product category, primary task, density, tone, and constraints.
58
+ - Proactively inspect `mockup.png` using the `view_file` tool to absorb the visual direction, color system, and placement details.
58
59
  - Choose one clear visual direction and commit to it across typography, color, spacing, surfaces, iconography, and motion.
59
60
  - For tools, dashboards, and operational apps, prioritize scanning, repeated use, compact controls, and predictable navigation over decorative hero layouts.
60
61
 
@@ -1,5 +1,5 @@
1
1
  ---
2
- name: pipeline-orchestration-skill
2
+ name: genesis-pipeline-orchestration
3
3
  description: Run end-to-end Codex harness orchestration phases from repository analysis through tests, fixtures, implementation, contracts, memory, docs, and change summaries. Use for autonomous multi-phase work.
4
4
  ---
5
5