agentic-dev 0.2.2 → 0.2.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (70) hide show
  1. package/.claude/skills/sdd/SKILL.md +178 -7
  2. package/.claude/skills/sdd/agents/openai.yaml +4 -0
  3. package/.claude/skills/sdd/references/section-map.md +67 -0
  4. package/package.json +1 -1
  5. package/.claude/skills/commit/SKILL.md +0 -37
  6. package/.claude/skills/dev-browser/SKILL.md +0 -30
  7. package/.claude/skills/otro/SKILL.md +0 -43
  8. package/.claude/skills/planning-with-files/SKILL.md +0 -37
  9. package/.claude/skills/prd/SKILL.md +0 -27
  10. package/.claude/skills/ralph-loop/SKILL.md +0 -42
  11. package/.claude/skills/sdd-dev/SKILL.md +0 -71
  12. package/.claude/skills/sdd-development/SKILL.md +0 -13
  13. package/.codex/skills/agents/openai.yaml +0 -4
  14. package/.codex/skills/commit/SKILL.md +0 -219
  15. package/.codex/skills/commit/references/commit_examples.md +0 -292
  16. package/.codex/skills/dev-browser/SKILL.md +0 -211
  17. package/.codex/skills/dev-browser/bun.lock +0 -443
  18. package/.codex/skills/dev-browser/package-lock.json +0 -2988
  19. package/.codex/skills/dev-browser/package.json +0 -31
  20. package/.codex/skills/dev-browser/references/scraping.md +0 -155
  21. package/.codex/skills/dev-browser/scripts/start-relay.ts +0 -32
  22. package/.codex/skills/dev-browser/scripts/start-server.ts +0 -117
  23. package/.codex/skills/dev-browser/server.sh +0 -24
  24. package/.codex/skills/dev-browser/src/client.ts +0 -474
  25. package/.codex/skills/dev-browser/src/index.ts +0 -287
  26. package/.codex/skills/dev-browser/src/relay.ts +0 -731
  27. package/.codex/skills/dev-browser/src/snapshot/__tests__/snapshot.test.ts +0 -223
  28. package/.codex/skills/dev-browser/src/snapshot/browser-script.ts +0 -877
  29. package/.codex/skills/dev-browser/src/snapshot/index.ts +0 -14
  30. package/.codex/skills/dev-browser/src/snapshot/inject.ts +0 -13
  31. package/.codex/skills/dev-browser/src/types.ts +0 -34
  32. package/.codex/skills/dev-browser/tsconfig.json +0 -36
  33. package/.codex/skills/dev-browser/vitest.config.ts +0 -12
  34. package/.codex/skills/otro/SKILL.md +0 -74
  35. package/.codex/skills/otro/agents/openai.yaml +0 -4
  36. package/.codex/skills/otro/references/agent-prompts.md +0 -61
  37. package/.codex/skills/otro/references/contracts.md +0 -146
  38. package/.codex/skills/otro/references/orchestration-loop.md +0 -51
  39. package/.codex/skills/otro/references/runtime.md +0 -79
  40. package/.codex/skills/otro/runs/README.md +0 -11
  41. package/.codex/skills/otro/schemas/step_plan.schema.json +0 -289
  42. package/.codex/skills/otro/schemas/task_result.schema.json +0 -142
  43. package/.codex/skills/otro/schemas/wave_plan.schema.json +0 -4
  44. package/.codex/skills/otro/scripts/README.md +0 -38
  45. package/.codex/skills/otro/scripts/bump_validation_header.py +0 -179
  46. package/.codex/skills/otro/scripts/check_validation_header.py +0 -84
  47. package/.codex/skills/otro/scripts/common.py +0 -303
  48. package/.codex/skills/otro/scripts/init_run.sh +0 -68
  49. package/.codex/skills/otro/scripts/plan_loop.py +0 -8
  50. package/.codex/skills/otro/scripts/plan_step.py +0 -367
  51. package/.codex/skills/otro/scripts/plan_wave.py +0 -8
  52. package/.codex/skills/otro/scripts/reconcile_loop.py +0 -8
  53. package/.codex/skills/otro/scripts/reconcile_step.py +0 -37
  54. package/.codex/skills/otro/scripts/reconcile_wave.py +0 -8
  55. package/.codex/skills/otro/scripts/run_loop.py +0 -300
  56. package/.codex/skills/otro/scripts/run_loop_step.py +0 -8
  57. package/.codex/skills/otro/scripts/run_step.py +0 -246
  58. package/.codex/skills/otro/scripts/run_wave.py +0 -8
  59. package/.codex/skills/otro/validation/validation.md +0 -15
  60. package/.codex/skills/planning-with-files/SKILL.md +0 -42
  61. package/.codex/skills/planning-with-files/agents/openai.yaml +0 -4
  62. package/.codex/skills/planning-with-files/assets/plan-template.md +0 -37
  63. package/.codex/skills/planning-with-files/references/plan-rules.md +0 -35
  64. package/.codex/skills/planning-with-files/scripts/new_plan.sh +0 -65
  65. package/.codex/skills/prd/SKILL.md +0 -235
  66. package/.codex/skills/ralph-loop/SKILL.md +0 -46
  67. package/.codex/skills/ralph-loop/agents/openai.yaml +0 -4
  68. package/.codex/skills/ralph-loop/references/failure-triage.md +0 -32
  69. package/.codex/skills/ralph-loop/scripts/loop_until_success.sh +0 -97
  70. package/sdd/99_toolchain/02_policies/otro-orchestration-policy.md +0 -30
@@ -1,13 +1,184 @@
1
1
  ---
2
2
  name: sdd
3
- description: "Alias skill for repositories that use `sdd/` as the canonical delivery system."
4
- user_invocable: true
3
+ description: "Use for any software development request in a repository that treats `sdd/` as the canonical delivery system. Trigger on requests like develop, implement, build, code, work on, modify, fix, patch, refactor, test, verify, deploy, monitor, or screen/UI-driven prompts such as 화면명세서, 화면설계서, 화면 설계, 화면, 화면 스펙, UI, 디자인, 디자인 가이드, screen spec, screen design, or design guide. The workflow is always SDD-first: inspect and update `sdd/01_planning`, create or update the task plan under `sdd/02_plan`, implement the change, record execution in `sdd/03_build`, capture validation in `sdd/04_verify`, and record deployment or monitoring in `sdd/05_operate` when rollout happens."
5
4
  ---
6
5
 
7
- ## /sdd
6
+ # SDD Development
8
7
 
9
- 파일은 alias skill이다. canonical SDD workflow는 [`../sdd-dev/SKILL.md`](../sdd-dev/SKILL.md)를 따른다.
8
+ ## Overview
10
9
 
11
- Rules:
12
- - 실제 workflow와 section map은 `sdd-dev` 기준을 사용한다.
13
- - downstream repo가 `sdd` 이름으로 호출해도 동일한 current-state delivery 규칙으로 해석한다.
10
+ Use this skill for implementation work in repositories that treat `sdd/` as the canonical delivery record.
11
+
12
+ This skill enforces one workflow:
13
+ 1. inspect and fix relevant `sdd/01_planning` artifacts first,
14
+ 2. create or update the task plan under `sdd/02_plan/<section>/`,
15
+ 3. perform the code work,
16
+ 4. record the current implementation summary in `sdd/03_build`,
17
+ 5. record the current retained verification summary in `sdd/04_verify`,
18
+ 6. record deployment and monitoring outcomes in `sdd/05_operate` when rollout happens.
19
+
20
+ When rollout is explicitly in scope, and the repository has separate DEV and PROD environments, this skill enforces a staged release rule: deploy to DEV first, complete the retained full-layer validation surface there, promote to PROD only after DEV passes, then rerun the same retained validation surface in PROD. If PROD validation fails, rollback is required unless the user explicitly redirects and that risk is recorded.
21
+ For persistence-affecting work, this skill also enforces schema-parity verification. Always compare migration or model intent against the real DEV and PROD schema state for the affected database objects instead of assuming deployed reality matches the code.
22
+
23
+ Read [references/section-map.md](references/section-map.md) when you need the exact destination inside `sdd/`.
24
+ For screen-spec-driven UI work, reusable static assets from the spec must be extracted through the repo's canonical asset builder before being used in code.
25
+ For screen-spec-driven layout work, inspect the repo's canonical design guide builder first when one exists.
26
+ For local screen exactness, treat the repo's Playwright exactness runner and suite registry as the canonical automation gate when they exist.
27
+ For verification work, treat regression scope selection as a required retained artifact and carry it through `sdd/02_plan`, `sdd/03_build`, and `sdd/04_verify`.
28
+ Do not infer rollout scope from the existence of `sdd/05_operate` alone; require rollout only when the user asks for deployment or the repo's current policy/plan makes rollout part of completion.
29
+ When a repo or team treats DEV deployment from `main` as the completion bar, temporary branches or worktrees are only working space. Before calling the task deployed, land the final retained change on `main` and push `origin/main`.
30
+
31
+ ## When To Use
32
+
33
+ Use this skill when:
34
+ - the repository has `sdd/01_planning`, `02_plan`, `03_build`, `04_verify`, `05_operate`,
35
+ - the user gives a development instruction such as `개발해`, `작업해`, `구현해`, `수정해`, `고쳐`, `리팩토링해`, `테스트해`, `배포해`,
36
+ - the user asks for screen/UI work with prompts such as `화면명세서`, `화면설계서`, `화면 설계`, `화면`, `화면 스펙`, `UI`, `디자인`, `디자인 가이드`, `screen spec`, `screen design`, or `design guide`,
37
+ - the user wants work to be traceable through those folders,
38
+ - the task includes both implementation and documentation/verification updates,
39
+ - the task may end in deployment or operational follow-up.
40
+
41
+ Do not use this skill for:
42
+ - casual questions with no repo changes,
43
+ - one-off local debugging where no durable SDD record is needed,
44
+ - repositories that do not use `sdd/` as their primary document system.
45
+
46
+ ## Workflow
47
+
48
+ ### 1. Inspect Planning First
49
+
50
+ - Identify the impacted planning area before editing code.
51
+ - Open only the relevant artifacts in `sdd/01_planning`:
52
+ - feature
53
+ - screen
54
+ - architecture
55
+ - data
56
+ - api
57
+ - iac
58
+ - integration
59
+ - nonfunctional
60
+ - security
61
+ - test
62
+ - If the implementation has already drifted from planning, update the planning artifact first or at least record the drift before coding.
63
+
64
+ ### 2. Create Or Update The Plan
65
+
66
+ - Create or reuse a durable plan file under `sdd/02_plan/<section>/`.
67
+ - Prefer the repo's planning scaffold if available.
68
+ - The plan must include:
69
+ - scope
70
+ - assumptions
71
+ - acceptance criteria
72
+ - execution checklist
73
+ - current notes
74
+ - validation
75
+ - For any task that may end in deployment, acceptance criteria must explicitly include the DEV gate, the matching PROD gate, the retained full-layer test surface, and the rollback trigger/path.
76
+ - For any task that touches persistence, models, repositories, migrations, SQL, ORM mappings, or runtime failures that may involve schema drift, acceptance criteria must explicitly include DEV/PROD schema verification.
77
+ - Keep exactly one checklist item in progress.
78
+
79
+ ### 3. Implement Against The Plan
80
+
81
+ - Make code changes only after the impacted planning artifact and plan are aligned enough to proceed.
82
+ - Update the plan current notes after meaningful edits or decisions.
83
+ - When document generators or capture pipelines are involved, keep those tools under `sdd/99_toolchain`.
84
+ - For local screen exactness, treat Playwright as the canonical automation gate unless the repo documents a stronger exact gate for that surface.
85
+ - In repos created from this template, prefer `python3 sdd/99_toolchain/01_automation/run_playwright_exactness.py --suite <suite-id>` over ad-hoc `npx playwright test ...`.
86
+ - Keep the suite registry in `sdd/99_toolchain/01_automation/playwright_exactness_manifest.py`.
87
+ - Browser Use, manual screenshots, or semantic extraction can supplement diagnosis, but they do not replace the retained Playwright exactness gate when a suite exists.
88
+ - If the needed suite does not exist yet, add or extend it in the same task and register it before calling the work complete.
89
+ - When the task depends on icons, logos, illustrations, or other static assets visible in a screen spec, use the repo's canonical Asset Spec Builder first.
90
+ - In repos created from this template, this is typically `sdd/99_toolchain/01_automation/spec_asset_builder.py` or a wrapper/manifest around it.
91
+ - Reusable asset planning records belong under `sdd/01_planning/02_screen/assets/`.
92
+ - Build the runtime asset from the approved PDF/image source instead of hand-tracing or screenshot-cropping it.
93
+ - Use exact verification such as `--verify-exact` when the asset is expected to match the source crop exactly.
94
+ - Record the source, manifest, generated asset path, and any exception in `sdd/03_build` and `sdd/04_verify`.
95
+ - Only fall back to manual recreation when the builder cannot express the asset, and explicitly document that exception in the plan/build/verify trail.
96
+ - When the task depends on spacing, layout density, typography, color rhythm, or component hierarchy derived from a screen spec, inspect the repo's canonical design guide builder first.
97
+ - In repos created from this template, inspect `sdd/99_toolchain/01_automation/README.md` and use the actual builder, wrapper, or manifest that exists in the repo.
98
+ - Use the generated guide as the working baseline before manual spacing or palette tweaks.
99
+ - If the repo does not provide a design guide builder yet, document the manual interpretation source in plan/build/verify.
100
+ - Define the regression surface before calling implementation complete.
101
+ - Start from `sdd/02_plan/10_test/regression_verification.md`.
102
+ - Identify the direct target plus any upstream, downstream, and shared surfaces affected by the change.
103
+ - If the change touches shared routing, shell/auth, shared state, common components, contracts, generated assets, or builder output, widen the regression scope instead of validating only the edited module.
104
+ - Record the selected regression scope and any justified exclusions in plan/build/verify.
105
+ - When rollout is in scope, define one retained full-layer validation surface and reuse it in DEV and PROD.
106
+ - Include the relevant app/runtime entrypoints, API/contracts, persistence/schema, jobs or workflow side effects, shared integrations, and health/monitoring checks affected by the change.
107
+ - Do not promote to PROD with a narrower verification surface than the DEV gate unless the user explicitly approves that exception and it is recorded.
108
+
109
+ ### 4. Record Build Summary
110
+
111
+ - Record what you implemented in `sdd/03_build`.
112
+ - Use:
113
+ - `03_build/01_feature` for feature implementation summaries
114
+ - `03_build/02_screen` for screen implementation summaries
115
+ - `03_build/03_architecture`, `06_iac`, `10_test` for current-state cross-cutting summaries
116
+ - Keep entries factual and current-state only: implemented scope, modules, assets, contracts, and current user-visible behavior.
117
+
118
+ ### 5. Record Verification
119
+
120
+ - Record retained verification status in `sdd/04_verify`.
121
+ - Use:
122
+ - `04_verify/01_feature` for feature verification summaries
123
+ - `04_verify/02_screen` for screen verification summaries
124
+ - `04_verify/03_architecture`, `06_iac`, `10_test` for current retained checks and residual risk
125
+ - Never claim completion without command-level validation evidence.
126
+ - For staged DEV -> PROD rollout, verification must use the same retained full-layer validation surface in both environments.
127
+ - Run the full-layer validation in DEV after deployment and treat it as a hard gate before PROD promotion.
128
+ - After PROD deployment, rerun the same retained validation surface in PROD.
129
+ - If PROD validation fails, execute rollback or the approved recovery procedure immediately and record the failure and recovery outcome.
130
+ - For persistence-affecting work, verification must include real schema evidence from both DEV and PROD when those environments exist.
131
+ - Check migration state and actual runtime schema separately.
132
+ - Validate the tables, columns, indexes, constraints, triggers, defaults, and any legacy compatibility objects touched by the change.
133
+ - Record the commands or queries used, the environments checked, and the drift or parity result.
134
+ - Regression verification is mandatory.
135
+ - Verification must cover the direct surface and the retained upstream/downstream/shared surfaces selected from the regression baseline.
136
+ - If no automation exists for a needed regression slice yet, run the best available manual or command checks and record that gap as current residual risk.
137
+ - When a Playwright suite exists for the surface, record the canonical runner command, suite id, screenshot/json artifact paths, and any live-vs-local split explicitly.
138
+
139
+ ### 6. Record Operate Outcomes
140
+
141
+ - If deployment or runtime follow-up happens, update `sdd/05_operate`.
142
+ - Use:
143
+ - `05_operate/01_runbooks` for durable operating procedure changes
144
+ - `05_operate/02_delivery_status` for the current live state and monitoring baseline
145
+ - Record:
146
+ - what was deployed
147
+ - which live baseline is current
148
+ - how it is monitored
149
+ - any current residual risk
150
+ - For staged DEV -> PROD rollout, record the DEV gate, PROD gate, and any rollback outcome explicitly.
151
+ - When the DEV deployment baseline is tied to `main`, do not treat a side-branch push as sufficient. Merge, cherry-pick, or otherwise replay the final change onto `main`, push `origin/main`, then record the deployment evidence against that `main` baseline.
152
+
153
+ ## Guardrails
154
+
155
+ - Do not create or repopulate a parallel `docs/` tree when `sdd/` exists.
156
+ - Do not skip planning review just because the code change looks small.
157
+ - Do not leave build, verify, or operate evidence only in chat text when it should live in `sdd/`.
158
+ - When PROD rollout is in scope, do not promote to PROD before the retained full-layer DEV validation surface has passed.
159
+ - When PROD rollout is in scope, do not use a weaker PROD validation surface than the one that gated DEV unless the user explicitly approves and that risk is recorded.
160
+ - When PROD rollout is in scope, do not stop at "PROD deployment succeeded"; post-deploy PROD validation is mandatory.
161
+ - When PROD rollout is in scope, do not leave a failed PROD deployment unreconciled; rollback or the approved recovery procedure must be executed and recorded.
162
+ - Do not assume local tests, migration heads, or current model code prove deployed schema parity.
163
+ - Do not skip DEV/PROD schema inspection for persistence-affecting work when schema drift could influence behavior.
164
+ - Do not manually redraw screen-spec static assets when a canonical Asset Spec Builder exists for the repo; extract them first and use the generated asset.
165
+ - Do not skip a relevant screen automation builder just because the requested UI change looks like a small manual tweak.
166
+ - Do not stop verification at the edited file, route, or screen when the change can affect shared or adjacent behavior.
167
+ - Do not omit regression scope selection from the retained SDD trail.
168
+ - Do not update `05_operate` for tasks that never reached deployment; explicitly note that rollout did not happen instead.
169
+ - Do not call a DEV rollout complete from a temporary branch when the repo or team baseline expects the deployed change to be on `origin/main`.
170
+
171
+ ## Output Standard
172
+
173
+ By the end of an implementation task, the expected trail is:
174
+ - planning artifact reviewed or corrected,
175
+ - plan file updated,
176
+ - build summary written,
177
+ - verification summary written, including selected regression scope and residual risk,
178
+ - operate status updated if rollout occurred.
179
+
180
+ When PROD rollout is in scope, the retained completion state also requires:
181
+ - DEV deployment happened first and the retained full-layer DEV validation surface passed,
182
+ - the same retained full-layer validation surface was executed again in PROD,
183
+ - DEV/PROD schema state was checked and recorded when schema could influence behavior,
184
+ - any PROD validation failure produced rollback or recovery evidence in verify/operate.
@@ -0,0 +1,4 @@
1
+ interface:
2
+ display_name: "SDD Development"
3
+ short_description: "Use SDD-first flow with schema parity, regression scope, and staged rollout gates"
4
+ default_prompt: "Use $sdd to handle this development task through sdd planning, build, verify, and operate, including regression scope selection, schema parity checks for persistence-affecting work, and staged DEV-first validation when multiple environments exist."
@@ -0,0 +1,67 @@
1
+ # SDD Section Map
2
+
3
+ ## Planning
4
+
5
+ - `sdd/01_planning/01_feature`
6
+ - domain or service feature specifications
7
+ - `sdd/01_planning/02_screen`
8
+ - service-level screen specifications and PDFs
9
+ - `sdd/01_planning/03_architecture`
10
+ - bounded context, runtime, and structural design
11
+ - `sdd/01_planning/04_data`
12
+ - data model and relationship definitions
13
+ - `sdd/01_planning/05_api`
14
+ - transport contracts and API definitions
15
+ - `sdd/01_planning/06_iac`
16
+ - infrastructure planning and deployment design
17
+ - `sdd/01_planning/07_integration`
18
+ - integration contracts and external dependency planning
19
+ - `sdd/01_planning/08_nonfunctional`
20
+ - performance, reliability, scalability, and operational constraints
21
+ - `sdd/01_planning/09_security`
22
+ - security posture, threat model, and control planning
23
+ - `sdd/01_planning/10_test`
24
+ - test strategy and planned cases
25
+
26
+ ## Plan
27
+
28
+ - `sdd/02_plan/<section>`
29
+ - executable plan files and migration backlogs
30
+
31
+ ## Build
32
+
33
+ - `sdd/03_build/01_feature`
34
+ - feature implementation summaries
35
+ - `sdd/03_build/02_screen`
36
+ - screen implementation summaries
37
+ - `sdd/03_build/03_architecture`
38
+ - structural and governance implementation summaries
39
+ - `sdd/03_build/06_iac`
40
+ - current delivery/runtime implementation summaries
41
+ - `sdd/03_build/10_test`
42
+ - current harness and validation implementation summaries
43
+
44
+ ## Verify
45
+
46
+ - `sdd/04_verify/01_feature`
47
+ - feature verification summaries
48
+ - `sdd/04_verify/02_screen`
49
+ - screen verification summaries
50
+ - `sdd/04_verify/03_architecture`
51
+ - governance and structure verification summaries
52
+ - `sdd/04_verify/06_iac`
53
+ - delivery/runtime verification summaries
54
+ - `sdd/04_verify/10_test`
55
+ - current harness outputs and retained validation references
56
+
57
+ ## Operate
58
+
59
+ - `sdd/05_operate/01_runbooks`
60
+ - durable operating procedures
61
+ - `sdd/05_operate/02_delivery_status`
62
+ - current live state, monitoring baseline, and residual risk
63
+
64
+ ## Tooling
65
+
66
+ - `sdd/99_toolchain`
67
+ - generators, capture tooling, manifests, and other SDD automation
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "agentic-dev",
3
- "version": "0.2.2",
3
+ "version": "0.2.3",
4
4
  "description": "Scaffold an agentic template repo with the selected frontend surface and shared toolchain assets.",
5
5
  "type": "module",
6
6
  "packageManager": "pnpm@10.32.0",
@@ -1,37 +0,0 @@
1
- ---
2
- name: commit
3
- description: "Write conventional commits with type, scope, and subject"
4
- user_invocable: true
5
- ---
6
-
7
- ## /commit
8
-
9
- Conventional Commit 형식으로 커밋 메시지를 만든다.
10
-
11
- ### Use when
12
-
13
- - 변경을 저장해야 할 때
14
- - PR 전 커밋 메시지를 정리할 때
15
- - 타입/스코프/subject를 일관되게 맞춰야 할 때
16
-
17
- ### Canonical asset
18
-
19
- - `.codex/skills/commit/SKILL.md`
20
-
21
- ### Rules
22
-
23
- - 형식: `type(scope): subject`
24
- - scope는 필수
25
- - subject는 현재형 명령문
26
- - subject 끝에 마침표 금지
27
- - vague message 금지
28
-
29
- ### Common types
30
-
31
- - `feat`
32
- - `fix`
33
- - `refactor`
34
- - `test`
35
- - `docs`
36
- - `ci`
37
- - `chore`
@@ -1,30 +0,0 @@
1
- ---
2
- name: dev-browser
3
- description: "Browser automation with persistent page state"
4
- user_invocable: true
5
- ---
6
-
7
- ## /dev-browser
8
-
9
- 브라우저 자동화와 반복 웹 검증을 수행한다.
10
-
11
- ### Use when
12
-
13
- - 페이지 이동, 클릭, 입력, 스크린샷이 필요할 때
14
- - 웹앱 검증을 자동화할 때
15
- - 반복 브라우저 작업을 스크립트로 굳힐 때
16
-
17
- ### Canonical asset
18
-
19
- - `.codex/skills/dev-browser/SKILL.md`
20
-
21
- ### Modes
22
-
23
- - standalone Chromium
24
- - extension relay via user browser
25
-
26
- ### Rules
27
-
28
- - 작은 스크립트 단위로 진행
29
- - 상태를 확인한 뒤 다음 스크립트로 넘어감
30
- - 재사용이 필요한 경우에만 `tmp/` 파일로 승격
@@ -1,43 +0,0 @@
1
- ---
2
- name: otro
3
- description: "OTRO loop orchestrator for repository-wide multi-step work"
4
- user_invocable: true
5
- ---
6
-
7
- ## /otro
8
-
9
- Repository-wide 작업을 OTRO 루프로 실행한다.
10
-
11
- ### Use when
12
-
13
- - 기능/리팩터/정합 작업이 단일 edit로 끝나지 않을 때
14
- - 전역 분석 -> TODO graph -> loop dispatch -> reconcile 이 필요할 때
15
- - 병렬 worker와 residual-based replan이 필요할 때
16
-
17
- ### Canonical runtime
18
-
19
- - Skill assets: `.codex/skills/otro/`
20
- - Run root: `.codex/skills/otro/runs/<run-name>/`
21
-
22
- ### Core contract
23
-
24
- - Canonical terms: `loop`, `step`, `loop_done`, `run_done`, `repository_done`
25
- - Task IDs are canonicalized per step as `T{step}{ordinal}`
26
- - Default planner/worker timeout is `600s`
27
- - Timeout means force-stop, record timeout, then split into at least two narrower next-step tasks
28
- - Replanning happens only at the loop boundary
29
- - If the repository uses `sdd/` or another canonical documentation tree, OTRO must use that system instead of creating parallel docs.
30
-
31
- ### Process
32
-
33
- 1. Initialize a run with `.codex/skills/otro/scripts/init_run.sh`.
34
- 2. Write the repository-wide objective in `goal.md`.
35
- 3. Generate the first global plan with `.codex/skills/otro/scripts/plan_loop.py`.
36
- 4. Execute with `.codex/skills/otro/scripts/run_loop.py <run-dir> --until-done`.
37
- 5. Inspect `plans/current-plan.json`, `results/loop-*.json`, and `state.json`.
38
- 6. Continue until `repository_done` is satisfied.
39
-
40
- ### Read first
41
-
42
- - `.codex/skills/otro/references/runtime.md`
43
- - `.codex/skills/otro/references/contracts.md`
@@ -1,37 +0,0 @@
1
- ---
2
- name: planning-with-files
3
- description: "Keep multi-step work auditable through a live markdown plan file"
4
- user_invocable: true
5
- ---
6
-
7
- ## /planning-with-files
8
-
9
- 작업을 plan file로 시작하고, 실행 내내 갱신하고, 검증 로그로 닫는다.
10
-
11
- ### Use when
12
-
13
- - 작업이 여러 단계로 길어질 때
14
- - edit/test/deploy 흔적을 남겨야 할 때
15
- - TODO를 durable하게 유지해야 할 때
16
-
17
- ### Canonical assets
18
-
19
- - Codex skill: `.codex/skills/planning-with-files/`
20
- - Template: `.codex/skills/planning-with-files/assets/plan-template.md`
21
- - Scaffolder: `.codex/skills/planning-with-files/scripts/new_plan.sh`
22
-
23
- ### Process
24
-
25
- 1. `scripts/new_plan.sh "<title>" [repo_root] [section]`로 plan을 만든다.
26
- 2. 저장소가 `sdd/02_plan/<section>/` 규칙을 쓰면 그 위치를 canonical로 사용한다.
27
- 3. `templates` 같은 저장소에서는 `docs/plans/`를 만들지 않고 `sdd/02_plan/<section>/` 아래 생성된 파일에 scope, assumptions, acceptance criteria를 적는다.
28
- 4. 5-10개의 concrete checklist로 쪼갠다.
29
- 5. 항상 하나의 active item만 둔다.
30
- 6. edit/test/deploy 뒤에는 evidence log를 짧게 남긴다.
31
- 7. 마지막에 validation status와 residual risk를 적고 닫는다.
32
-
33
- ### Guardrails
34
-
35
- - stale checklist를 남기지 않는다
36
- - evidence 없는 완료 표시는 금지
37
- - dead item을 쌓지 말고 현재 plan을 갱신한다
@@ -1,27 +0,0 @@
1
- ---
2
- name: prd
3
- description: "Generate a structured JSON PRD for executable delivery work"
4
- user_invocable: true
5
- ---
6
-
7
- ## /prd
8
-
9
- 실행 가능한 JSON PRD를 생성한다.
10
-
11
- ### Use when
12
-
13
- - 기능 요구사항을 정형화해야 할 때
14
- - stories, routes, quality gates를 한 파일로 고정해야 할 때
15
- - 구현 전에 scope를 잠가야 할 때
16
-
17
- ### Canonical asset
18
-
19
- - `.codex/skills/prd/SKILL.md`
20
-
21
- ### Rules
22
-
23
- - 구현하지 말고 PRD만 생성
24
- - quality gates는 반드시 묻고 포함
25
- - story는 single iteration 크기로 쪼갬
26
- - 새 프로젝트면 setup story를 먼저 둠
27
- - output은 JSON 정본 하나만 저장
@@ -1,42 +0,0 @@
1
- ---
2
- name: ralph-loop
3
- description: "Run a bounded implement-test-fix loop with explicit stop conditions"
4
- user_invocable: true
5
- ---
6
-
7
- ## /ralph-loop
8
-
9
- 짧은 reproduce -> edit -> validate -> decide 루프를 돌린다.
10
-
11
- ### Use when
12
-
13
- - 버그 수정이 반복 검증을 필요로 할 때
14
- - flaky 여부나 fail-same / fail-new 분류가 중요할 때
15
- - stop condition이 명확한 bounded loop가 필요할 때
16
-
17
- ### Canonical assets
18
-
19
- - Codex skill: `.codex/skills/ralph-loop/`
20
- - Script: `.codex/skills/ralph-loop/scripts/loop_until_success.sh`
21
- - Reference: `.codex/skills/ralph-loop/references/failure-triage.md`
22
-
23
- ### Loop contract
24
-
25
- 1. failing reproduction 또는 target validation command를 고정한다.
26
- 2. iteration마다 한 가지 change만 넣는다.
27
- 3. 즉시 validation을 돌린다.
28
- 4. 결과를 `pass`, `fail-same`, `fail-new`, `flaky`, `no-signal`로 분류한다.
29
- 5. 다음 action을 정한 뒤에만 다시 edit한다.
30
-
31
- ### Stop conditions
32
-
33
- - validation passes
34
- - max iterations reached
35
- - same failure repeats 3 times without new signal
36
- - missing dependency or external decision blocks progress
37
-
38
- ### Command example
39
-
40
- ```bash
41
- .codex/skills/ralph-loop/scripts/loop_until_success.sh --cmd "pytest -q tests/foo_test.py" --max 6
42
- ```
@@ -1,71 +0,0 @@
1
- ---
2
- name: sdd-dev
3
- description: "Use the SDD-first workflow for any development task in repos that use sdd/"
4
- user_invocable: true
5
- ---
6
-
7
- ## /sdd-dev
8
-
9
- 개발 작업을 항상 `sdd/` 산출물 기준으로 진행한다. 화면명세서, 화면설계서, 화면 스펙, 디자인, 디자인 가이드 같은 화면 개발 프롬프트도 이 스킬의 트리거로 본다.
10
- 배포가 실제 범위이고 개발계 배포 completion bar가 `main` 기준인 저장소에서는 임시 브랜치나 worktree를 써도 최종 반영은 반드시 `origin/main`까지 올라가야 완료로 본다.
11
- 배포가 실제 범위이고 개발계와 운영계가 분리된 저장소에서는 배포 순서를 `DEV 배포 -> DEV 전 레이어 검증 통과 -> PROD 배포 -> PROD 동일 레이어 검증`으로 고정한다. 운영 검증이 실패하면 롤백 또는 명시된 복구 절차가 필수다.
12
- 스키마가 동작에 영향을 줄 수 있는 작업은 코드나 migration head만 보지 않고 DEV/PROD 실제 스키마 상태를 확인해야 한다.
13
-
14
- ### Use when
15
-
16
- - 사용자가 `개발해`, `작업해`, `구현해`, `수정해`, `고쳐`, `리팩토링해`, `테스트해`, `배포해`처럼 개발을 지시할 때
17
- - 사용자가 `화면명세서`, `화면설계서`, `화면 설계`, `화면`, `화면 스펙`, `UI`, `디자인`, `디자인 가이드`처럼 화면 개발/정렬을 지시할 때
18
- - 저장소가 `sdd/01_planning`, `02_plan`, `03_build`, `04_verify`, `05_operate` 구조를 쓸 때
19
- - 코드 수정과 함께 계획, 검증, 운영 흔적도 남겨야 할 때
20
-
21
- ### Canonical assets
22
-
23
- - Codex skill: `.codex/skills/sdd/`
24
- - Section map: `.codex/skills/sdd/references/section-map.md`
25
-
26
- ### Workflow
27
-
28
- 1. 먼저 `sdd/01_planning`의 관련 산출물을 확인하고 구현과 어긋난 부분을 바로잡는다.
29
- 2. `sdd/02_plan/<section>/`에 작업 계획을 만들거나 기존 계획을 갱신한다.
30
- - 배포가 범위에 있으면 acceptance criteria에 `DEV 게이트`, `동일한 PROD 게이트`, `전 레이어 테스트 범위`, `롤백 조건/절차`를 명시한다.
31
- - persistence, migration, repository, ORM mapping, SQL, 운영 장애 대응이 걸린 작업이면 acceptance criteria에 `DEV/PROD 스키마 검증`을 넣는다.
32
- 3. 코드 작업을 수행하고, 의미 있는 결정은 계획 로그에 남긴다.
33
- - 화면명세서에 있는 아이콘, 로고, 일러스트, 기타 정적 자산이 필요하면 먼저 저장소의 canonical `스펙에셋빌더`로 추출한다.
34
- - 템플릿 기준 canonical 경로는 `sdd/99_toolchain/01_automation/spec_asset_builder.py` 또는 그 wrapper/manifest다.
35
- - reusable asset planning 기록은 `sdd/01_planning/02_screen/assets/` 아래에 둔다.
36
- - source crop과 동일해야 하는 자산은 가능하면 `--verify-exact`까지 수행한다.
37
- - builder로 표현 가능한 자산을 수동 redraw나 screenshot crop으로 대체하지 않는다. 예외가 필요하면 plan/build/verify에 이유를 남긴다.
38
- - 화면 여백, 레이아웃 밀도, 타이포, 색상 리듬, 컴포넌트 hierarchy가 명세 기반이면 저장소의 canonical 디자인 가이드 빌더가 있는지 먼저 확인하고, 있으면 그 산출물을 기준으로 구현한다.
39
- - 템플릿 기준으로는 `sdd/99_toolchain/01_automation/README.md`와 실제 존재하는 builder/wrapper/manifest를 canonical source로 본다.
40
- - local screen exactness는 Playwright를 canonical automation gate로 본다.
41
- - 템플릿 기준 canonical runner는 `python3 sdd/99_toolchain/01_automation/run_playwright_exactness.py --suite <suite-id>`이고, suite registry는 `sdd/99_toolchain/01_automation/playwright_exactness_manifest.py`다.
42
- - direct `npx playwright test ...` 호출은 디버깅 예외로만 쓰고, retained verification command는 toolchain wrapper 기준으로 남긴다.
43
- - Playwright suite가 이미 있으면 Browser Use/manual screenshot/semantic extraction은 보조 진단일 뿐 canonical gate를 대체하지 않는다.
44
- - 배포가 범위에 있으면 DEV와 PROD에 같은 `전 레이어 검증 surface`를 재사용하도록 미리 정의한다. 앱/라우트, API, persistence/schema, 배치/워크플로, 연동, 헬스/모니터링을 포함한다.
45
- 4. 구현/실행 기록은 `sdd/03_build`에 남긴다.
46
- 5. 테스트와 검증 근거는 `sdd/04_verify`에 남긴다.
47
- - DEV 배포가 있으면 DEV에서 먼저 전 레이어 검증을 수행하고, 통과 전에는 PROD로 올리지 않는다.
48
- - PROD 배포 후에는 DEV와 같은 검증 surface를 PROD에서 다시 실행한다.
49
- - persistence 영향 가능성이 있으면 DEV/PROD 모두에서 실제 스키마, index, constraint, trigger, default, legacy column drift를 확인한다.
50
- - PROD 검증 실패 시 즉시 롤백 또는 승인된 복구 절차를 실행하고 결과를 남긴다.
51
- - Playwright suite가 있는 surface는 suite id, canonical runner command, screenshot/json artifact path를 verify에 함께 남긴다.
52
- 6. 실제 배포나 운영 후속이 있으면 `sdd/05_operate`에 남긴다.
53
- - 개발계 배포 baseline이 `main`이면 완료 전에 `main`에 merge, cherry-pick, 또는 동등한 방식으로 반영한 뒤 `origin/main`을 push한다.
54
- - side branch에만 올라간 상태로는 개발계 배포 완료로 보지 않는다.
55
- - DEV 게이트 통과 근거, PROD 게이트 통과 근거, 롤백이 있었다면 그 트리거와 결과를 함께 기록한다.
56
-
57
- ### Guardrails
58
-
59
- - `sdd/`가 있는 저장소에서 별도 `docs/` 트리를 만들지 않는다.
60
- - planning review 없이 바로 코드부터 수정하지 않는다.
61
- - 검증 결과를 채팅에만 남기지 말고 `sdd/04_verify`에도 기록한다.
62
- - PROD 배포가 범위일 때 DEV 전 레이어 검증이 통과하기 전에는 PROD 배포를 진행하지 않는다.
63
- - PROD 배포가 범위일 때 DEV보다 약한 범위로 PROD 검증을 축소하지 않는다. 예외는 사용자 승인과 문서화가 필요하다.
64
- - PROD 배포가 범위일 때 PROD는 "배포 성공"만으로 끝내지 않는다. 배포 후 동일 레이어 검증이 필수다.
65
- - PROD 배포가 범위일 때 실패한 PROD 배포를 미해결 상태로 두지 않는다. 롤백 또는 승인된 복구 절차를 실행하고 기록한다.
66
- - local 테스트 통과, model 코드, migration head만으로 DEV/PROD 스키마가 같다고 가정하지 않는다.
67
- - schema drift 가능성이 있는 작업에서 DEV/PROD 실제 스키마 확인을 생략하지 않는다.
68
- - canonical `스펙에셋빌더`가 있는 저장소에서는 화면명세 정적 자산을 수동 재작성하지 않는다.
69
- - 화면 관련 자동화 builder가 있으면 작은 UI 수정처럼 보여도 먼저 확인한다.
70
- - 배포가 없었으면 `05_operate`에 허위 기록을 남기지 않는다.
71
- - 개발계 배포 baseline이 `main`인 저장소에서 side branch push만으로 배포 완료라고 판단하지 않는다.
@@ -1,13 +0,0 @@
1
- ---
2
- name: sdd-development
3
- description: "Alias skill for the canonical SDD-first development workflow."
4
- user_invocable: true
5
- ---
6
-
7
- ## /sdd-development
8
-
9
- 이 파일은 alias skill이다. canonical SDD workflow는 [`../sdd-dev/SKILL.md`](../sdd-dev/SKILL.md)를 따른다.
10
-
11
- Rules:
12
- - planning, build, verify, operate 기록은 `sdd-dev` 규칙과 동일하다.
13
- - downstream repo는 `sdd-development` 이름을 그대로 복제할 수 있다.
@@ -1,4 +0,0 @@
1
- interface:
2
- display_name: "OTRO"
3
- short_description: "Overlap-tolerant residual orchestration"
4
- default_prompt: "Use $OTRO to analyze the repository globally, dispatch bounded Codex workers, reconcile residual overlap, and replan until repository-level gates are satisfied."