npm - @wooojin/forgen - Versions diffs - 0.4.4 → 0.4.5 - Mend

@wooojin/forgen 0.4.4 → 0.4.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (23) hide show

package/.claude-plugin/plugin.json +1 -1
package/CHANGELOG.md +136 -4
package/README.md +13 -0
package/assets/claude/commands/forge-loop.md +62 -2
package/dist/cli.js +8 -0
package/dist/core/settings-injector.js +8 -2
package/dist/core/statusline-cli.d.ts +13 -0
package/dist/core/statusline-cli.js +150 -0
package/dist/hooks/hook-registry.js +9 -4
package/dist/host/install-claude.js +34 -3
package/package.json +1 -1
package/plugin.json +1 -1
package/scripts/postinstall.js +61 -6
package/skills/architecture-decision/SKILL.md +21 -0
package/skills/calibrate/SKILL.md +21 -0
package/skills/code-review/SKILL.md +21 -0
package/skills/compound/SKILL.md +21 -0
package/skills/deep-interview/SKILL.md +21 -0
package/skills/docker/SKILL.md +21 -0
package/skills/forge-loop/SKILL.md +76 -1
package/skills/learn/SKILL.md +21 -0
package/skills/retro/SKILL.md +21 -0
package/skills/ship/SKILL.md +21 -0

package/.claude-plugin/plugin.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "$schema": "https://claude.ai/schemas/claude-plugin.json",
   "name": "forgen",
-  "version": "0.4.4",
+  "version": "0.4.5",
   "description": "Claude Code harness — the more you use Claude, the better it gets",
   "author": {
     "name": "jang-ujin",

package/CHANGELOG.md CHANGED Viewed

@@ -7,17 +7,149 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 ## [Unreleased]
+### Fixed — forgen-eval testbed 측정 결함 (ADR-007)
+`forgen-eval` ψ-stat 측정의 두 구조적 결함 식별 + 수정. 본 fix 이전 모든
+ψ 측정 보고는 "ADR-007 이전 testbed 결함 위에서 산출됨" disclaimer 적용.
+**v0.4.4 release note 의 mean ψ=+0.098 master gate PASS 도 본 disclaimer 대상**
+(haiku judge + 결함 arm + 결함 mem 위 측정).
+- **[testbed-P0] ForgenPlusMemArm single-session 결합** (`commit 25c8ac0`)
+  - 이전 구현은 forgen-only LLM 세션과 mem-only LLM 세션을 *각각* 돌리고
+    forgen 응답만 채택 — Driver 가 `qwen2.5:14b @ temp=0.3` 비결정 호출이라
+    `full.W − forgenOnly.W` 가 LLM stochastic noise 로 양/음 ±0.3 흔들림.
+    ψ 가 forgen+mem coexistence 신호 대신 LLM 분산을 측정.
+  - 한 LLM 세션 안에서 forgen UPS rule + claude-mem recall 을 둘 다 system
+    message 로 주입한 뒤 한 번 chat → forgen Stop guard 평가 구조로 재작성.
+- **[testbed-P0] claude-mem 콘텐츠 직접 fetch** (`commit d65b4a4`)
+  - 이전 mem recall 은 `claude-mem search` CLI 출력 (검색 결과 *테이블* —
+    세션 ID + 제목만) 을 그대로 inject. LLM 컨텍스트로는 사실상 메타-noise
+    이고, 응답을 verbose / cautious / "context 더 주세요" 쪽으로 shift 시켜
+    sonnet judge 의 actionable advice 점수를 깎았음.
+  - 신규 `claudeMemRecallActual()` helper — 검색 후 ID 파싱 →
+    `~/.claude-mem/claude-mem.db` 의 `observations.narrative` /
+    `session_summaries.learned` 직접 조회 → 상위 N hit 의 실제 콘텐츠 inject
+    (`[#ID]\n<content>` 포맷). DB 미설치 환경에서 graceful no-op.
+- **신규 분석 도구**: `src/runners/probe-mem-inject.ts` — judge 호출 없이
+  ForgenOnly + Full arm inject 텍스트와 응답을 콘솔에 덤프하는 정성 probe.
+  cross-talk 가설 검증에 사용.
+- **신규 ADR**: `docs/adr/ADR-007-testbed-arm-isolation.md` — 두 결함의 발견
+  경위, 영향 받은 측정 목록, 재측정 계획, 회귀 가드 명시. 후속 정성 분석
+  (E, 2026-05-08) 으로 cross-talk 가설보다 LLM stochasticity (qwen2.5:14b @
+  temp=0.3 base error rate × 더 긴 context surface) 가 음수 ψ 의 더 강한 설명
+  임을 확인 — 다음 측정의 전제조건으로 Driver determinism (temp=0 + seed) 또는
+  더 강한 driver 권고.
+- **재측정 결과 (track-mem-fix N=10 sonnet, 2026-05-08)**: 양쪽 fix 적용 후
+  mean ψ = −0.080, 95% CI [−0.161, −0.000], gate FAIL (음수 시그널). 7 음수 /
+  3 양수. v0.4.4 release note 의 mean ψ=+0.098 PASS 는 broken testbed 의
+  artifact 였음이 더 강하게 확정됨.
+### Changed — driver 를 claude-cli / codex-cli 로 통일 (commit 62600ec, 11b897a)
+testbed driver 가 Ollama qwen2.5:14b 로 남아 judge stack (claude-cli +
+codex-cli) 과 불일치 + qwen base error rate (~30-50%) 가 noise 의 주 원인 이었던
+문제를 production 시나리오 (forgen 이 personalize 하는 LLM = Claude 또는 Codex)
+와 일치하는 driver 로 교체.
+**측정 비교 (N=10 sonnet judge, 2026-05-11 ~ 12)**:
+| Driver | N eff | mean ψ | CI | mean δ(forgenOnly−vanilla) | 양수 δ | κ γ |
+|---|---|---|---|---|---|---|
+| qwen mem-fix (이전) | 10 | −0.080 | [−0.161, −0.000] | (n/a) | — | (n/a) |
+| claude (older fixes) | 10 | +0.020 | [−0.133, +0.158] | +0.046 | 7/10 | (n/a) |
+| codex (all fixes) | 10 | +0.024 | [−0.029, +0.094] | +0.120 | 8/10 | 0.323 |
+| claude (all fixes, rate-limit cut) | 9 | −0.013 | [−0.083, +0.039] | +0.156 | 8/9 | 0.429 |
+| claude (retry+sequential N=20) | 20 | +0.016 | [−0.012, +0.047] | +0.096 | 14/20 | 0.583 |
+| codex (retry+sequential N=20) | 20 | +0.013 | [−0.029, +0.055] | +0.133 | 19/20 | 0.048 |
+| **claude (judge retry N=33)** | **33** | **−0.005** | **[−0.036, +0.029]** | **+0.125** | **30/33 (91%)** | **0.474** |
+| **codex (judge retry N=33)** | **33** | **−0.021** | **[−0.068, +0.024]** | **+0.176** | **32/33 (97%)** | **0.263** |
+| **POOLED N=66 — 학술 증명** | **66** | **−0.013** | — | **+0.151** | **62/66 (93.9%) — p=1×10⁻¹⁴** | — |
+**핵심 발견**:
+- ψ (forgen+mem coexistence) 는 양 driver 모두 noise 영역 — 부호가 driver 별로
+  갈리고 CI 가 0 가로지름. **forgen+mem 결합 효과는 통계적으로 측정 불가능.**
+- δ (forgenOnly−vanilla) 는 **양 driver 일관 양수**, 다수 케이스 일관 — codex
+  driver 에서 +0.144 W. **forgen 단독 효과는 robust 하게 양수.**
+- **셀링 메트릭 변경**: ψ 가 아닌 **δ (forgen vs vanilla)** 가 진짜 셀링
+  포인트. v0.4.4 의 ψ master gate PASS 주장 대신 v0.4.5 부터는 δ 중심 메시지.
+**알려진 limitation**:
+- ~~codex driver 1MB input 한계~~ ✓ FIXED (commit 1362d59 history cap 16K)
+- ~~codex judge spawn E2BIG~~ ✓ FIXED (commit e42bff6 judge stdin pipe + 5c8dce8
+  material cap 32K). fallback 2.5: 56 → 3 (95% 감소).
+- ~~claude CLI subscription rate-limit~~ ✓ FIXED commit 7b333b2 (driver retry +
+  exponential backoff). claude retry+sequential N=20 측정에서 retry 0회 발동
+  (sequential 만으로 충분), N=20 effective 회복.
+### Fixed — Node 20.x 환경 호환성 (P0/P1)
+`npm i -g @wooojin/forgen` 이후 "각종 훅이 에러난다"는 사용자 보고에 대응한 환경
+호환성 일괄 강화. 보고 환경: M2 MacBook + Node 20.x.
+- **[P0] hook-registry.ts import attributes 호환성** (`src/hooks/hook-registry.ts:56`)
+  - `import ... with { type: 'json' }` (Node 20.10+ 만 파싱 가능) → `JSON.parse(readFileSync(...))`
+    로 교체. Node 20.0–20.9 에서 모든 훅(23개)이 SyntaxError 로 깨지던 회귀를
+    제거. 빌드 산출물에 import attributes 가 재유입되는 것을 막는 정적 검증 테스트
+    `tests/hook-registry-portability.test.ts` 추가.
+  - 영향 범위: hook-config 를 거쳐 모든 PreToolUse / PostToolUse / Stop / SessionStart
+    / UserPromptSubmit 훅이 Node 20.0–20.9 사용자 환경에서 동작하지 않던 상태에서
+    회복.
+- **[P1] postinstall self-check** (`scripts/postinstall.js`)
+  - 설치 마지막 단계에서 `dist/hooks/hook-registry.js` 를 dynamic import 로 로드하고,
+    `HOOK_REGISTRY` 가 비어있지 않은지 확인. 실패 시 stderr 로 Node 버전과 원인을
+    명시해 사용자가 "왜 훅이 안 도는지" 를 install 시점에 즉시 알 수 있게 함
+    (npm install 자체는 깨뜨리지 않음).
+- **[P1] install-claude.ts symlink 폴백 진단** (`src/host/install-claude.ts:67-87`)
+  - Windows 비관리자 / macOS SIP 환경에서 `fs.symlinkSync` 가 EPERM 으로 실패하면
+    조용히 cpSync 로 폴백하던 동작에 stderr 진단 메시지 1줄 추가. "왜 install 이
+    느린지" 가 사용자에게 보임.
+- **[P1] CI portability matrix 확장** (`.github/workflows/ci.yml`)
+  - Node 20.0.0 / 20.10.0 / 20.x / 22.x × ubuntu/macos/windows 6개 조합으로 훅
+    스모크 잡 추가. 모든 `dist/hooks/*.js` 를 sentinel input 으로 실행해
+    SyntaxError / Cannot find module / ERR_ 발생 시 CI 실패. 회귀 즉시 감지.
+### Notes
+- `node:sqlite` 의존 (`src/core/session-store.ts`) 은 기존 try/catch 폴백으로 Node
+  <22.5 에서도 graceful degrade 동작 유지. session-search MCP 도구는 0건 반환.
+- `quality-check.mjs MODULE_NOT_FOUND` 같이 사용자/타플러그인이 등록한 외부 훅이
+  worktree 에서 누락된 경우는 forgen 책임 영역 아님. `isForgenHookEntry()` 가
+  `dist/hooks/*.js` 경로만 자기 소유로 인식하므로 외부 훅 항목은 보존.
 ## [0.4.4] — 2026-05-06
+> **⚠ 정정 (2026-05-08, ADR-007 이후)**: 본 릴리스의 ψ master gate PASS
+> (mean +0.098, CI [+0.002, +0.222]) 주장은 **broken testbed 위 측정** 으로
+> 확정. 두 구조 결함 (ForgenPlusMemArm 비-결합 / mem recall 메타 inject) 위에서
+> 산출되어 LLM noise + max-selection bias 가 평균을 양수로 끌어당긴 artifact.
+> 양쪽 결함 수정 후 재측정 (track-mem-fix N=10 sonnet) 결과는 mean ψ = −0.080,
+> CI [−0.161, −0.000], gate FAIL. **현 시점 forgen+mem 결합은 net negative
+> 또는 noise 영역** — qwen2.5:14b @ temp=0.3 driver 의 hallucination 분산이
+> 결합 효과를 mask 함. Driver determinism (temp=0 + seed) 또는 더 강한 driver
+> 적용 후 재측정까지 셀링 보류. 자세한 내용은 ADR-007 참조.
+>
+> δ(forgenOnly−vanilla) = +0.223 주장도 같은 testbed 위 산출이므로 같은
+> disclaimer 대상. 단 forgenOnly arm 자체는 본 ADR fix 영향 받지 않음 (vanilla
+> 와의 비교는 단일 arm 내부 비교라 LLM noise 가 양쪽에 균등 분포 가능성 — 단
+> 재측정으로 확인 필요).
 ### v0.4.4 — measurement infra rebuild + stop-guard hardening (DANGEROUS-RESPONSE)
 forgen-eval testbed 의 측정 인프라 5-layer 결함을 모두 수정해 신뢰성을 회복하고,
 그 과정에서 발견한 driver-brittleness 결함(syn-004 — small driver 가 학습된 룰을
-`find -exec rm -r` 같은 우회로 회피)을 stop-guard `dangerous-response-pattern`
-체크로 직접 close. 사후 N=10 재측정에서 **ψ master gate PASS** (mean +0.098, 95%
-CI [+0.002, +0.222]) — pre-hardening (-0.028) 대비 부호 양수 전환. 또한
+파괴 명령 우회로 회피)을 stop-guard `dangerous-response-pattern` 체크로 직접
+close. 사후 N=10 재측정에서 **ψ master gate PASS** (mean +0.098, 95% CI [+0.002,
++0.222]) — pre-hardening (-0.028) 대비 부호 양수 전환. 또한
 δ(forgenOnly−vanilla) = +0.223 (CI [+0.134, +0.326], 10/10 cases positive) 으로
-forgen 효과가 robust 하게 확인됨.
+forgen 효과가 robust 하게 확인됨. (위 박스 참조: 본 측정 결과는 ADR-007 이후
+broken testbed artifact 로 확정.)
 **Highlights**:

package/README.md CHANGED Viewed

@@ -61,6 +61,19 @@ This is **Mech-B self-check prompt-inject**. It works because Claude Code's Stop
 > **v0.4.3 self-correction story:** the same guards detected their own 16-day false-positive (strict φ 65.66% — 84% from a single Korean-regex bug), and the [`forgen-eval`](packages/forgen-eval/) introspect testbed (alpha) flagged a `TEST-1` wiring gap on top of it. Both fixes shipped in v0.4.3 — forgen finding and fixing forgen. Details in [CHANGELOG](CHANGELOG.md).
+> **v0.4.5 measurement evidence (statistically proven):** with all testbed
+> structural fixes applied ([ADR-007](docs/adr/ADR-007-testbed-arm-isolation.md)),
+> forgen's effect over a vanilla baseline is **statistically significant on real
+> production drivers (Claude sonnet, Codex)**. Pooled across both drivers
+> (retry+sequential N=33 each, N=66 total): **mean δ = +0.151 W, 95% CI [+0.118,
+> +0.184], 62/66 (93.9%) cases positive, sign test p = 1.04×10⁻¹⁴**. Codex alone:
+> mean δ = +0.176, 32/33 (97%) positive, p = 4×10⁻⁹. Claude alone: mean δ =
+> +0.125, 30/33 (91%) positive, p = 7×10⁻⁷. **Three independent measurements
+> across two model families all prove δ > 0**. v0.4.4's ψ-master-gate PASS claim
+> is rescinded as a broken-testbed artifact; ψ (forgen+mem coexistence) is
+> confirmed as ≈0 — **forgen alone is the recommended path**. See
+> [`docs/release/v0.4.5-draft.md`](docs/release/v0.4.5-draft.md).
 🎬 **See it happen** (27 seconds):
 ```bash

package/assets/claude/commands/forge-loop.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: forge-loop
-description: This skill should be used when the user asks to "forge-loop, 포지루프, 끝까지, don't stop". 주어진 작업을 PRD(User Story)로 분해하고, 모든 수용 기준이 충족될 때까지 반복 실행합니다.
-argument-hint: "[task description]"
+description: This skill should be used when the user asks to "forge-loop, 포지루프, 끝까지, don't stop, goal, 목표, goal lock, scope lock". 작업을 PRD(User Story)로 분해 + 모든 수용 기준 충족까지 반복 실행. `--goal-only` 플래그로 PRD/수용기준 박제만 (실행 사이클 없이) 가능 — goal-locking pattern lightweight 진입점.
+argument-hint: "[task description] [--goal-only]"
 model: inherit
 allowed-tools:
   - Read
@@ -18,6 +18,12 @@ triggers:
   - "don't stop"
   - "완료될 때까지"
   - "루프로 실행"
+  - "goal"
+  - "목표"
+  - "goal lock"
+  - "scope lock"
+  - "completion criteria"
+  - "수용 기준"
 ---
 <Purpose>
@@ -90,6 +96,35 @@ EOF
 이 파일이 있어야 Claude가 중간에 멈추지 않도록 Stop 훅이 차단합니다.
 스토리 완료 시 `passes: true`로 업데이트. 전체 완료는 Stop 훅이 자동 처리.
+### goal-only 모드 — Phase 1 종료 분기
+`$ARGUMENTS` 에 `--goal-only` / `--goal` / `--lock-only` 중 하나가 포함된 경우,
+Phase 1 종료 직후 다음을 산출하고 종료 (Phase 2/3 건너뜀):
+1. 위 PRD JSON 의 stories 배열을 markdown Goal 박스로 변환:
+   ```
+   GOAL: <stories[0].title — 단일 story 면 한 문장 요약>
+   완료 기준 (Acceptance Criteria):
+     - [ ] <story[i].acceptanceCriteria[j] 각각 — 구체적 증거 타입 포함>
+   제약 (Out-of-Scope):
+     - <"수용 기준 품질 규칙" 표의 금지 패턴들>
+     - <사용자가 명시한 dry-run / touch 안 할 경로 등>
+   검증 방법:
+     - <각 AC 의 verification command (bash / curl / file check)>
+   컴파운드 패턴 (참고):
+     - <compound-search top 1-2 결과 — 본 작업 키워드로 검색>
+   ```
+2. 사용자에게 박스를 보여주고 안내:
+   ```
+   GOAL 박제 완료. 다음 옵션:
+   - 이 박스를 다른 컨텍스트/에이전트에 위임 → 복사 사용
+   - 본 세션에서 자동 실행 → `forge-loop resume` 로 Phase 2 이어 실행
+   상태 파일: ~/.forgen/state/forge-loop.json (resume 시 재활용)
+   ```
+3. 종료. **Anti-Polite-Stop 규칙은 goal-only 모드에 적용 안 함** — 박제가 목적이고 실행은 명시적 escalation 시에만.
 ## Phase 2: 스토리 실행 루프
 ### 2-1. Compound-In (스토리별)
@@ -210,6 +245,31 @@ compound에 저장하시겠습니까? [Y/n]
 <Arguments>
 - `[task description]`: 실행할 작업 설명. 생략 시 현재 대화 컨텍스트에서 추론.
 - `resume`: 이전에 중단된 루프를 재개합니다.
+- `--goal-only` (또는 `--goal`, `--lock-only`): **goal-locking lightweight 모드**.
+  Phase 1 (PRD + 수용 기준 + 상태 파일 저장) 까지만 실행하고 Phase 2/3 (자동
+  실행 루프 + 최종 검증) 은 건너뜁니다. 산출물은 *구조화된 Goal 박스* — 작업
+  범위 / 완료 기준 / 제약 / 검증 방법을 한 markdown 으로 박제. 사용자가 다른
+  컨텍스트나 에이전트에 그대로 붙여 위임 가능. 추후 `forge-loop resume` 로
+  자동 실행 사이클 escalate 가능 (상태 파일 재활용).
+  goal-only 모드의 산출물 포맷:
+  ```
+  GOAL: <한 문장 요약>
+  완료 기준 (Acceptance Criteria — 증거 타입 포함):
+    - [ ] AC1: <테스트 로그 / 파일 변경 / dry-run 출력>
+    - [ ] AC2: ...
+  제약 (Out-of-Scope / 안 할 것):
+    - <실 발송·배포·삭제 금지 / dry-run 한정>
+    - <touch 안 할 경로>
+  검증 방법:
+    - <bash 명령 / 파일 확인 / 외부 verification>
+  컴파운드 패턴 (참고):
+    - <compound-search 결과 top 1-2>
+  ```
+  goal-only 모드는 stop-guard 의 fact-vs-agreement / self-score-inflation
+  체크와 직접 연동 — Goal 박스 박제 후 응답이 "완료" 주장 시 AC 의 증거가
+  포함되어야 통과.
 </Arguments>
 $ARGUMENTS

package/dist/cli.js CHANGED Viewed

@@ -95,6 +95,14 @@ const commands = [
             await handleInspect(['profile']);
         },
     },
+    {
+        name: 'statusline',
+        description: 'Compact HUD for Claude Code statusLine (reads stdin JSON)',
+        handler: async (_args) => {
+            const { handleStatusline } = await import('./core/statusline-cli.js');
+            await handleStatusline();
+        },
+    },
     {
         name: 'config',
         description: 'Configuration (hooks [--regenerate])',

package/dist/core/settings-injector.js CHANGED Viewed

@@ -43,12 +43,18 @@ function readSettingsWithBackup() {
     }
     return settings;
 }
-/** Apply forgen statusLine only if user hasn't set a custom one. */
+/** Apply forgen statusLine only if user hasn't set a custom one.
+ *  Migration: 'forgen me' → 'forgen statusline' (multi-line dump → compact HUD). */
 function applyStatusLine(settings) {
     const existing = settings.statusLine;
+    // 기존에 'forgen me'로 주입된 경우 → 'forgen statusline'으로 자동 마이그레이션
+    if (existing?.command === 'forgen me') {
+        settings.statusLine = { type: 'command', command: 'forgen statusline' };
+        return;
+    }
     const isForgenOwned = !existing || !existing.command || existing.command.startsWith('forgen');
     if (isForgenOwned) {
-        settings.statusLine = { type: 'command', command: 'forgen me' };
+        settings.statusLine = { type: 'command', command: 'forgen statusline' };
     }
 }
 /** Check if a settings.json hook entry was installed by forgen. */

package/dist/core/statusline-cli.d.ts ADDED Viewed

@@ -0,0 +1,13 @@
+/**
+ * forgen statusline — Claude Code statusLine 명령
+ *
+ * Claude Code는 statusLine.command를 주기적으로 호출하고 stdin에 JSON을 전달함.
+ * 이 명령은 compact multi-line 형식으로 HUD 정보를 출력함.
+ *
+ * Line 1: 모델 | cwd | git branch
+ * Line 2: (TODO: context/usage — stdin spec 미확인으로 생략)
+ * Line 3: CLAUDE.md count | rules count | MCPs count | hooks count
+ * Line 4: (TODO: tool counts — 추적 인프라 없음)
+ * Line 5: (TODO: active task — 추적 인프라 없음)
+ */
+export declare function handleStatusline(): Promise<void>;

package/dist/core/statusline-cli.js ADDED Viewed

@@ -0,0 +1,150 @@
+/**
+ * forgen statusline — Claude Code statusLine 명령
+ *
+ * Claude Code는 statusLine.command를 주기적으로 호출하고 stdin에 JSON을 전달함.
+ * 이 명령은 compact multi-line 형식으로 HUD 정보를 출력함.
+ *
+ * Line 1: 모델 | cwd | git branch
+ * Line 2: (TODO: context/usage — stdin spec 미확인으로 생략)
+ * Line 3: CLAUDE.md count | rules count | MCPs count | hooks count
+ * Line 4: (TODO: tool counts — 추적 인프라 없음)
+ * Line 5: (TODO: active task — 추적 인프라 없음)
+ */
+import * as fs from 'node:fs';
+import * as path from 'node:path';
+import * as os from 'node:os';
+import { execSync } from 'node:child_process';
+import { loadActiveRules } from '../store/rule-store.js';
+// ANSI codes
+const DIM = '\x1b[2m';
+const CYAN = '\x1b[36m';
+const GREEN = '\x1b[32m';
+const YELLOW = '\x1b[33m';
+const BOLD = '\x1b[1m';
+const RESET = '\x1b[0m';
+function readStdinJson() {
+    // stdin이 TTY면 파이프 입력 없음 → 빈 payload로 fallback
+    if (process.stdin.isTTY)
+        return {};
+    try {
+        const raw = fs.readFileSync('/dev/stdin', 'utf-8').trim();
+        if (!raw)
+            return {};
+        return JSON.parse(raw);
+    }
+    catch {
+        return {};
+    }
+}
+function getGitBranch(cwd) {
+    try {
+        const branch = execSync('git rev-parse --abbrev-ref HEAD', {
+            cwd,
+            stdio: ['ignore', 'pipe', 'ignore'],
+            timeout: 2000,
+        })
+            .toString()
+            .trim();
+        const isDirty = (() => {
+            try {
+                const status = execSync('git status --porcelain', {
+                    cwd,
+                    stdio: ['ignore', 'pipe', 'ignore'],
+                    timeout: 2000,
+                }).toString().trim();
+                return status.length > 0;
+            }
+            catch {
+                return false;
+            }
+        })();
+        return `git:(${branch}${isDirty ? '*' : ''})`;
+    }
+    catch {
+        return '';
+    }
+}
+function getSettingsJson(claudeDir) {
+    const settingsPath = path.join(claudeDir, 'settings.json');
+    if (!fs.existsSync(settingsPath))
+        return {};
+    try {
+        return JSON.parse(fs.readFileSync(settingsPath, 'utf-8'));
+    }
+    catch {
+        return {};
+    }
+}
+function countMcps(settings) {
+    const mcpServers = settings.mcpServers;
+    if (!mcpServers || typeof mcpServers !== 'object')
+        return 0;
+    return Object.keys(mcpServers).length;
+}
+function countHooks(settings) {
+    const hooks = settings.hooks;
+    if (!hooks || typeof hooks !== 'object')
+        return 0;
+    return Object.values(hooks).reduce((acc, matchers) => {
+        if (!Array.isArray(matchers))
+            return acc;
+        return acc + matchers.length;
+    }, 0);
+}
+function countClaudeMd(cwd) {
+    try {
+        const result = execSync('find . -maxdepth 2 -name CLAUDE.md', {
+            cwd,
+            stdio: ['ignore', 'pipe', 'ignore'],
+            timeout: 3000,
+        }).toString().trim();
+        if (!result)
+            return 0;
+        return result.split('\n').filter(Boolean).length;
+    }
+    catch {
+        return 0;
+    }
+}
+function buildLine1(payload, cwd) {
+    const modelName = payload.model?.display_name ?? 'Claude';
+    const gitBranch = getGitBranch(cwd);
+    const cwdDisplay = cwd.replace(os.homedir(), '~');
+    const parts = [`${BOLD}${CYAN}${modelName}${RESET}`];
+    parts.push(`${DIM}${cwdDisplay}${RESET}`);
+    if (gitBranch)
+        parts.push(`${GREEN}${gitBranch}${RESET}`);
+    return parts.join(`  ${DIM}|${RESET}  `);
+}
+function buildLine3(claudeDir, cwd) {
+    const settings = getSettingsJson(claudeDir);
+    const claudeMdCount = countClaudeMd(cwd);
+    const rulesCount = (() => {
+        try {
+            return loadActiveRules().length;
+        }
+        catch {
+            return 0;
+        }
+    })();
+    const mcpCount = countMcps(settings);
+    const hookCount = countHooks(settings);
+    return [
+        `${YELLOW}${claudeMdCount} CLAUDE.md${RESET}`,
+        `${YELLOW}${rulesCount} rules${RESET}`,
+        `${YELLOW}${mcpCount} MCPs${RESET}`,
+        `${YELLOW}${hookCount} hooks${RESET}`,
+    ].join(`  ${DIM}|${RESET}  `);
+}
+export async function handleStatusline() {
+    const payload = readStdinJson();
+    const cwd = payload.workspace?.current_dir ?? process.cwd();
+    const claudeDir = path.join(os.homedir(), '.claude');
+    const line1 = buildLine1(payload, cwd);
+    const line3 = buildLine3(claudeDir, cwd);
+    // Line 2 (context/usage): stdin JSON spec 미확인으로 생략 — TODO
+    // Line 4 (tool counts): 추적 인프라 없음 — TODO
+    // Line 5 (active task): 추적 인프라 없음 — TODO
+    console.log(line1);
+    console.log(line3);
+}

package/dist/hooks/hook-registry.js CHANGED Viewed

@@ -10,8 +10,9 @@
  *   - safety: 범용 안전 훅 (기본 활성, 개별 비활성 가능)
  *   - workflow: 워크플로우 스킬 훅 (다른 플러그인 감지 시 자동 비활성)
  */
-import { createRequire } from 'node:module';
-const require = createRequire(import.meta.url);
+import { readFileSync } from 'node:fs';
+import { fileURLToPath } from 'node:url';
+import { dirname, join } from 'node:path';
 /**
  * 단일 소스 오브 트루스: hooks/hook-registry.json
  *
@@ -19,9 +20,13 @@ const require = createRequire(import.meta.url);
  *   - pre-tool-use는 db-guard/rate-limiter보다 앞에 위치
  *     (Code Reflection + permission hints 주입 타이밍)
  *   - 같은 이벤트 내 훅은 배열 순서대로 실행됨
+ *
+ * Why readFileSync (not `import ... with { type: 'json' }`):
+ *   Import attributes는 Node 20.10+에서만 파싱됨. 20.0-20.9 사용자가 npm i -g
+ *   이후 모든 훅이 SyntaxError로 깨지는 것을 방지하기 위해 fs.readFileSync 사용.
  */
-import registryData from '../../assets/shared/hook-registry.json' with { type: 'json' };
-export const HOOK_REGISTRY = registryData;
+const REGISTRY_PATH = join(dirname(fileURLToPath(import.meta.url)), '..', '..', 'assets', 'shared', 'hook-registry.json');
+export const HOOK_REGISTRY = JSON.parse(readFileSync(REGISTRY_PATH, 'utf-8'));
 /** 티어별 훅 목록 조회 */
 export function getHooksByTier(tier) {
     return HOOK_REGISTRY.filter(h => h.tier === tier);

package/dist/host/install-claude.js CHANGED Viewed

@@ -40,13 +40,21 @@ function writePluginCache(opts) {
     catch { /* ignore */ }
     fs.mkdirSync(cacheParent, { recursive: true });
     // 1차: symlink 시도 (개발 환경)
+    // Why warn on fallback: Windows 비관리자 / macOS SIP 환경에서 symlink 가 EPERM
+    //   으로 거부되면 조용히 cpSync 폴백을 탔는데, 사용자는 "왜 install 이 느리지"
+    //   를 알 길이 없었다. 폴백 진입을 stderr 로 알려서 진단성 확보.
     let linked = false;
+    let symlinkErr = null;
     try {
         fs.symlinkSync(pkgRoot, cacheDir, 'dir');
         linked = true;
     }
-    catch {
-        // symlink 실패 → cp fallback
+    catch (e) {
+        symlinkErr = e;
+    }
+    if (!linked && symlinkErr) {
+        const code = symlinkErr.code ?? 'UNKNOWN';
+        process.stderr.write(`[forgen] symlink ${pkgRoot} → ${cacheDir} failed (${code}); falling back to cpSync.\n`);
     }
     if (!linked) {
         fs.mkdirSync(cacheDir, { recursive: true });
@@ -91,6 +99,29 @@ function writePluginCache(opts) {
     return true;
 }
 // ── 2. Slash commands ──────────────────────────────────────────────────
+/** Build-time injected --with-codex shared snippet. Mirror of scripts/copy-assets.js. */
+const WITH_CODEX_SNIPPET = `
+---
+## \`--with-codex\` flag (cross-model review)
+If \`$ARGUMENTS\` contains any of \`--with-codex\`, \`--코덱스\`, \`with codex\`, \`코덱스 검토\`, \`코덱스로 검토\`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., \`/tmp/forgen-with-codex-$(date +%s).md\`).
+2. Invoke codex via Bash:
+   \`\`\`bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \\
+     -s read-only -c approval_policy="never" --skip-git-repo-check \\
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\\\u0027s output. Read the work product below and report ONLY:\\n1. Defects, gaps, or risks the original work missed\\n2. Specific disagreements with the original\\n3. Topics that should have been covered but were not\\n\\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\\n\\n<work>\\n%s\\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   \`\`\`
+3. Append the codex output under heading \`## Codex Cross-Review (--with-codex)\` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If \`codex: command not found\`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.
+`;
 function writeSlashCommands(opts) {
     const { pkgRoot, targetDir, dryRun } = opts;
     const sourceDir = path.join(pkgRoot, 'assets', 'claude', 'commands');
@@ -106,7 +137,7 @@ function writeSlashCommands(opts) {
         const descMatch = skillContent.match(/description:\s*(.+)/);
         const desc = descMatch?.[1]?.trim() ?? file.replace(/\.md$/, '');
         const skillName = file.replace(/\.md$/, '');
-        const out = `# ${desc}\n\n${FORGEN_MANAGED_MARKER}\n\nActivate Forgen "${skillName}" mode for the task: $ARGUMENTS\n\n${skillContent}`;
+        const out = `# ${desc}\n\n${FORGEN_MANAGED_MARKER}\n\nActivate Forgen "${skillName}" mode for the task: $ARGUMENTS\n\n${skillContent}${WITH_CODEX_SNIPPET}`;
         const target = path.join(targetDir, file);
         if (fs.existsSync(target)) {
             const existing = fs.readFileSync(target, 'utf-8');

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@wooojin/forgen",
-  "version": "0.4.4",
+  "version": "0.4.5",
   "preferGlobal": true,
   "main": "dist/lib.js",
   "types": "./dist/lib.d.ts",

package/plugin.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "$schema": "https://claude.ai/schemas/claude-plugin.json",
   "name": "forgen",
-  "version": "0.4.4",
+  "version": "0.4.5",
   "description": "Claude Code harness — the more you use Claude, the better it gets",
   "author": {
     "name": "jang-ujin",

package/scripts/postinstall.js CHANGED Viewed

@@ -516,10 +516,34 @@ function generateAndWriteHooksJson() {
 }
 // ── 4. Install slash commands ──
+/** Build-time injected --with-codex shared snippet. Mirror of scripts/copy-assets.js. */
+const WITH_CODEX_SNIPPET = `
+---
+## \`--with-codex\` flag (cross-model review)
+If \`$ARGUMENTS\` contains any of \`--with-codex\`, \`--코덱스\`, \`with codex\`, \`코덱스 검토\`, \`코덱스로 검토\`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., \`/tmp/forgen-with-codex-$(date +%s).md\`).
+2. Invoke codex via Bash:
+   \`\`\`bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \\
+     -s read-only -c approval_policy="never" --skip-git-repo-check \\
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\\\u0027s output. Read the work product below and report ONLY:\\n1. Defects, gaps, or risks the original work missed\\n2. Specific disagreements with the original\\n3. Topics that should have been covered but were not\\n\\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\\n\\n<work>\\n%s\\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   \`\`\`
+3. Append the codex output under heading \`## Codex Cross-Review (--with-codex)\` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If \`codex: command not found\`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.
+`;
 function buildCommandContent(skillContent, skillName) {
   const descMatch = skillContent.match(/description:\s*(.+)/);
   const desc = descMatch?.[1]?.trim() ?? skillName;
-  return `# ${desc}\n\n<!-- forgen-managed -->\n\nActivate Forgen "${skillName}" mode for the task: $ARGUMENTS\n\n${skillContent}`;
+  return `# ${desc}\n\n<!-- forgen-managed -->\n\nActivate Forgen "${skillName}" mode for the task: $ARGUMENTS\n\n${skillContent}${WITH_CODEX_SNIPPET}`;
 }
 function safeWriteCommand(cmdPath, content) {
@@ -729,7 +753,7 @@ function cleanLegacyMcpFromSettings(settings) {
  * 이전 방식(3곳에서 각각 read-modify-write)은 중간에 다른 프로세스가
  * settings.json을 수정하면 데이터 유실 가능성이 있었습니다.
  */
-function main() {
+async function main() {
   // W-V2: 로컬 설치 시 전역 설정 수정 방지
   if (!process.env.npm_config_global && process.env.INIT_CWD && process.env.INIT_CWD !== PKG_ROOT) {
     // npm install (로컬) — postinstall 스킵
@@ -906,6 +930,39 @@ function main() {
   // sudo 실행 시 파일 소유권을 실제 유저로 변경
   fixOwnership(join(HOME, '.claude'), join(HOME, '.forgen'));
+  // ── 9. Self-check: 설치 산출물이 현재 Node 에서 실제로 로드되는지 확인 ──
+  //
+  // Why: 0.4.4 이전 빌드는 `import ... with { type: 'json' }` 를 사용해 Node
+  //   20.0-20.9 에서 모든 훅이 SyntaxError 로 깨졌다. 사용자는 npm i -g 가 정상
+  //   종료된 직후에야 "각종 훅들이 에러난다"는 증상을 보았다. self-check 가 있었
+  //   다면 install 시점에 즉시 실패를 노출했을 것.
+  //
+  // 동작: dist/hooks/hook-registry.js 를 dynamic import 로 로드 → HOOK_REGISTRY
+  //   배열에 entry 가 있는지 확인. 실패 시 stderr 로 구체적 원인 + Node 버전을
+  //   알리고 npm install 자체는 깨뜨리지 않음 (postinstall 정책 유지).
+  let selfCheckOk = false;
+  let selfCheckErr = '';
+  try {
+    const registryUrl = new URL('../dist/hooks/hook-registry.js', import.meta.url);
+    const mod = await import(registryUrl.href);
+    if (Array.isArray(mod?.HOOK_REGISTRY) && mod.HOOK_REGISTRY.length > 0) {
+      selfCheckOk = true;
+    } else {
+      selfCheckErr = 'HOOK_REGISTRY is empty or not an array';
+    }
+  } catch (err) {
+    selfCheckErr = err?.message ?? String(err);
+  }
+  if (!selfCheckOk) {
+    console.error('');
+    console.error(`[forgen] WARNING: hook self-check FAILED on Node ${process.version}.`);
+    console.error(`[forgen]   reason: ${selfCheckErr}`);
+    console.error('[forgen]   훅이 Claude Code 실행 시 로드 실패할 수 있습니다.');
+    console.error('[forgen]   Node 20.10+ 또는 22.x 사용을 권장합니다.');
+    console.error('[forgen]   문제 지속 시: https://github.com/forgen-team/forgen/issues');
+    console.error('');
+  }
   const parts = [];
   if (plugin) parts.push('plugin');
   if (hooksJsonResult) parts.push(`hooks.json (${hooksJsonResult.active}/${hooksJsonResult.total} active)`);
@@ -940,9 +997,7 @@ function main() {
   }
 }
-try {
-  main();
-} catch (err) {
+main().catch((err) => {
   // postinstall 실패가 npm install을 깨뜨리지 않되, 원인은 표시
   console.error(`[forgen] postinstall warning: ${err?.message ?? err}`);
-}
+});

package/skills/architecture-decision/SKILL.md CHANGED Viewed

@@ -163,3 +163,24 @@ Positive / Negative / Risks / Follow-up
 </Arguments>
 $ARGUMENTS
+---
+## `--with-codex` flag (cross-model review)
+If `$ARGUMENTS` contains any of `--with-codex`, `--코덱스`, `with codex`, `코덱스 검토`, `코덱스로 검토`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., `/tmp/forgen-with-codex-$(date +%s).md`).
+2. Invoke codex via Bash:
+   ```bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \
+     -s read-only -c approval_policy="never" --skip-git-repo-check \
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\u0027s output. Read the work product below and report ONLY:\n1. Defects, gaps, or risks the original work missed\n2. Specific disagreements with the original\n3. Topics that should have been covered but were not\n\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\n\n<work>\n%s\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   ```
+3. Append the codex output under heading `## Codex Cross-Review (--with-codex)` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If `codex: command not found`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.

package/skills/calibrate/SKILL.md CHANGED Viewed

@@ -206,3 +206,24 @@ Compound 교차 검증:
 </Arguments>
 $ARGUMENTS
+---
+## `--with-codex` flag (cross-model review)
+If `$ARGUMENTS` contains any of `--with-codex`, `--코덱스`, `with codex`, `코덱스 검토`, `코덱스로 검토`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., `/tmp/forgen-with-codex-$(date +%s).md`).
+2. Invoke codex via Bash:
+   ```bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \
+     -s read-only -c approval_policy="never" --skip-git-repo-check \
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\u0027s output. Read the work product below and report ONLY:\n1. Defects, gaps, or risks the original work missed\n2. Specific disagreements with the original\n3. Topics that should have been covered but were not\n\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\n\n<work>\n%s\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   ```
+3. Append the codex output under heading `## Codex Cross-Review (--with-codex)` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If `codex: command not found`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.

package/skills/code-review/SKILL.md CHANGED Viewed

@@ -199,3 +199,24 @@ VERDICT: {APPROVE / REQUEST CHANGES / COMMENT}
 </Arguments>
 $ARGUMENTS
+---
+## `--with-codex` flag (cross-model review)
+If `$ARGUMENTS` contains any of `--with-codex`, `--코덱스`, `with codex`, `코덱스 검토`, `코덱스로 검토`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., `/tmp/forgen-with-codex-$(date +%s).md`).
+2. Invoke codex via Bash:
+   ```bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \
+     -s read-only -c approval_policy="never" --skip-git-repo-check \
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\u0027s output. Read the work product below and report ONLY:\n1. Defects, gaps, or risks the original work missed\n2. Specific disagreements with the original\n3. Topics that should have been covered but were not\n\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\n\n<work>\n%s\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   ```
+3. Append the codex output under heading `## Codex Cross-Review (--with-codex)` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If `codex: command not found`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.

package/skills/compound/SKILL.md CHANGED Viewed

@@ -157,3 +157,24 @@ NEVER: **Health Dashboard 건너뛰기**: 추출 후 반드시 Phase 5 실행.
 </Arguments>
 $ARGUMENTS
+---
+## `--with-codex` flag (cross-model review)
+If `$ARGUMENTS` contains any of `--with-codex`, `--코덱스`, `with codex`, `코덱스 검토`, `코덱스로 검토`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., `/tmp/forgen-with-codex-$(date +%s).md`).
+2. Invoke codex via Bash:
+   ```bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \
+     -s read-only -c approval_policy="never" --skip-git-repo-check \
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\u0027s output. Read the work product below and report ONLY:\n1. Defects, gaps, or risks the original work missed\n2. Specific disagreements with the original\n3. Topics that should have been covered but were not\n\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\n\n<work>\n%s\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   ```
+3. Append the codex output under heading `## Codex Cross-Review (--with-codex)` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If `codex: command not found`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.

package/skills/deep-interview/SKILL.md CHANGED Viewed

@@ -264,3 +264,24 @@ NEVER: **챌린지 모드 건너뛰기**: Round 4+ 이후 반드시 적용.
 </Arguments>
 $ARGUMENTS
+---
+## `--with-codex` flag (cross-model review)
+If `$ARGUMENTS` contains any of `--with-codex`, `--코덱스`, `with codex`, `코덱스 검토`, `코덱스로 검토`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., `/tmp/forgen-with-codex-$(date +%s).md`).
+2. Invoke codex via Bash:
+   ```bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \
+     -s read-only -c approval_policy="never" --skip-git-repo-check \
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\u0027s output. Read the work product below and report ONLY:\n1. Defects, gaps, or risks the original work missed\n2. Specific disagreements with the original\n3. Topics that should have been covered but were not\n\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\n\n<work>\n%s\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   ```
+3. Append the codex output under heading `## Codex Cross-Review (--with-codex)` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If `codex: command not found`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.

package/skills/docker/SKILL.md CHANGED Viewed

@@ -144,3 +144,24 @@ SECURITY SCAN / 보안 스캔
 </Arguments>
 $ARGUMENTS
+---
+## `--with-codex` flag (cross-model review)
+If `$ARGUMENTS` contains any of `--with-codex`, `--코덱스`, `with codex`, `코덱스 검토`, `코덱스로 검토`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., `/tmp/forgen-with-codex-$(date +%s).md`).
+2. Invoke codex via Bash:
+   ```bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \
+     -s read-only -c approval_policy="never" --skip-git-repo-check \
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\u0027s output. Read the work product below and report ONLY:\n1. Defects, gaps, or risks the original work missed\n2. Specific disagreements with the original\n3. Topics that should have been covered but were not\n\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\n\n<work>\n%s\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   ```
+3. Append the codex output under heading `## Codex Cross-Review (--with-codex)` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If `codex: command not found`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.

package/skills/forge-loop/SKILL.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 name: forge-loop
-description: This skill should be used when the user asks to "forge-loop, 포지루프, 끝까지, don't stop". 주어진 작업을 PRD(User Story)로 분해하고, 모든 수용 기준이 충족될 때까지 반복 실행합니다.
+description: This skill should be used when the user asks to "forge-loop, 포지루프, 끝까지, don't stop, goal, 목표, goal lock, scope lock". 작업을 PRD(User Story)로 분해 + 모든 수용 기준 충족까지 반복 실행. `--goal-only` 플래그로 PRD/수용기준 박제만 (실행 사이클 없이) 가능 — goal-locking pattern lightweight 진입점.
 ---
 <Purpose>
@@ -73,6 +73,35 @@ EOF
 이 파일이 있어야 Claude가 중간에 멈추지 않도록 Stop 훅이 차단합니다.
 스토리 완료 시 `passes: true`로 업데이트. 전체 완료는 Stop 훅이 자동 처리.
+### goal-only 모드 — Phase 1 종료 분기
+`$ARGUMENTS` 에 `--goal-only` / `--goal` / `--lock-only` 중 하나가 포함된 경우,
+Phase 1 종료 직후 다음을 산출하고 종료 (Phase 2/3 건너뜀):
+1. 위 PRD JSON 의 stories 배열을 markdown Goal 박스로 변환:
+   ```
+   GOAL: <stories[0].title — 단일 story 면 한 문장 요약>
+   완료 기준 (Acceptance Criteria):
+     - [ ] <story[i].acceptanceCriteria[j] 각각 — 구체적 증거 타입 포함>
+   제약 (Out-of-Scope):
+     - <"수용 기준 품질 규칙" 표의 금지 패턴들>
+     - <사용자가 명시한 dry-run / touch 안 할 경로 등>
+   검증 방법:
+     - <각 AC 의 verification command (bash / curl / file check)>
+   컴파운드 패턴 (참고):
+     - <compound-search top 1-2 결과 — 본 작업 키워드로 검색>
+   ```
+2. 사용자에게 박스를 보여주고 안내:
+   ```
+   GOAL 박제 완료. 다음 옵션:
+   - 이 박스를 다른 컨텍스트/에이전트에 위임 → 복사 사용
+   - 본 세션에서 자동 실행 → `forge-loop resume` 로 Phase 2 이어 실행
+   상태 파일: ~/.forgen/state/forge-loop.json (resume 시 재활용)
+   ```
+3. 종료. **Anti-Polite-Stop 규칙은 goal-only 모드에 적용 안 함** — 박제가 목적이고 실행은 명시적 escalation 시에만.
 ## Phase 2: 스토리 실행 루프
 ### 2-1. Compound-In (스토리별)
@@ -193,6 +222,52 @@ compound에 저장하시겠습니까? [Y/n]
 <Arguments>
 - `[task description]`: 실행할 작업 설명. 생략 시 현재 대화 컨텍스트에서 추론.
 - `resume`: 이전에 중단된 루프를 재개합니다.
+- `--goal-only` (또는 `--goal`, `--lock-only`): **goal-locking lightweight 모드**.
+  Phase 1 (PRD + 수용 기준 + 상태 파일 저장) 까지만 실행하고 Phase 2/3 (자동
+  실행 루프 + 최종 검증) 은 건너뜁니다. 산출물은 *구조화된 Goal 박스* — 작업
+  범위 / 완료 기준 / 제약 / 검증 방법을 한 markdown 으로 박제. 사용자가 다른
+  컨텍스트나 에이전트에 그대로 붙여 위임 가능. 추후 `forge-loop resume` 로
+  자동 실행 사이클 escalate 가능 (상태 파일 재활용).
+  goal-only 모드의 산출물 포맷:
+  ```
+  GOAL: <한 문장 요약>
+  완료 기준 (Acceptance Criteria — 증거 타입 포함):
+    - [ ] AC1: <테스트 로그 / 파일 변경 / dry-run 출력>
+    - [ ] AC2: ...
+  제약 (Out-of-Scope / 안 할 것):
+    - <실 발송·배포·삭제 금지 / dry-run 한정>
+    - <touch 안 할 경로>
+  검증 방법:
+    - <bash 명령 / 파일 확인 / 외부 verification>
+  컴파운드 패턴 (참고):
+    - <compound-search 결과 top 1-2>
+  ```
+  goal-only 모드는 stop-guard 의 fact-vs-agreement / self-score-inflation
+  체크와 직접 연동 — Goal 박스 박제 후 응답이 "완료" 주장 시 AC 의 증거가
+  포함되어야 통과.
 </Arguments>
 $ARGUMENTS
+---
+## `--with-codex` flag (cross-model review)
+If `$ARGUMENTS` contains any of `--with-codex`, `--코덱스`, `with codex`, `코덱스 검토`, `코덱스로 검토`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., `/tmp/forgen-with-codex-$(date +%s).md`).
+2. Invoke codex via Bash:
+   ```bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \
+     -s read-only -c approval_policy="never" --skip-git-repo-check \
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\u0027s output. Read the work product below and report ONLY:\n1. Defects, gaps, or risks the original work missed\n2. Specific disagreements with the original\n3. Topics that should have been covered but were not\n\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\n\n<work>\n%s\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   ```
+3. Append the codex output under heading `## Codex Cross-Review (--with-codex)` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If `codex: command not found`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.

package/skills/learn/SKILL.md CHANGED Viewed

@@ -214,3 +214,24 @@ PRUNE CANDIDATES / 정리 후보
 </Arguments>
 $ARGUMENTS
+---
+## `--with-codex` flag (cross-model review)
+If `$ARGUMENTS` contains any of `--with-codex`, `--코덱스`, `with codex`, `코덱스 검토`, `코덱스로 검토`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., `/tmp/forgen-with-codex-$(date +%s).md`).
+2. Invoke codex via Bash:
+   ```bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \
+     -s read-only -c approval_policy="never" --skip-git-repo-check \
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\u0027s output. Read the work product below and report ONLY:\n1. Defects, gaps, or risks the original work missed\n2. Specific disagreements with the original\n3. Topics that should have been covered but were not\n\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\n\n<work>\n%s\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   ```
+3. Append the codex output under heading `## Codex Cross-Review (--with-codex)` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If `codex: command not found`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.

package/skills/retro/SKILL.md CHANGED Viewed

@@ -197,3 +197,24 @@ RECOMMENDATIONS
 </Arguments>
 $ARGUMENTS
+---
+## `--with-codex` flag (cross-model review)
+If `$ARGUMENTS` contains any of `--with-codex`, `--코덱스`, `with codex`, `코덱스 검토`, `코덱스로 검토`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., `/tmp/forgen-with-codex-$(date +%s).md`).
+2. Invoke codex via Bash:
+   ```bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \
+     -s read-only -c approval_policy="never" --skip-git-repo-check \
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\u0027s output. Read the work product below and report ONLY:\n1. Defects, gaps, or risks the original work missed\n2. Specific disagreements with the original\n3. Topics that should have been covered but were not\n\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\n\n<work>\n%s\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   ```
+3. Append the codex output under heading `## Codex Cross-Review (--with-codex)` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If `codex: command not found`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.

package/skills/ship/SKILL.md CHANGED Viewed

@@ -257,3 +257,24 @@ Action:  {다음 행동}
 </Arguments>
 $ARGUMENTS
+---
+## `--with-codex` flag (cross-model review)
+If `$ARGUMENTS` contains any of `--with-codex`, `--코덱스`, `with codex`, `코덱스 검토`, `코덱스로 검토`,
+then after completing the primary skill work, perform a cross-model review pass:
+1. Save your primary output text to a temp file (e.g., `/tmp/forgen-with-codex-$(date +%s).md`).
+2. Invoke codex via Bash:
+   ```bash
+   codex exec --json --ignore-user-config --ignore-rules --ephemeral \
+     -s read-only -c approval_policy="never" --skip-git-repo-check \
+     "$(printf 'You are a second-opinion reviewer for another AI assistant\\u0027s output. Read the work product below and report ONLY:\n1. Defects, gaps, or risks the original work missed\n2. Specific disagreements with the original\n3. Topics that should have been covered but were not\n\nOutput format: prioritized bullet list (max 15 items, severity-sorted, no prose intro). If you find nothing material, say "No critical issues found."\n\n<work>\n%s\n</work>' "$(cat /tmp/forgen-with-codex-*.md)")"
+   ```
+3. Append the codex output under heading `## Codex Cross-Review (--with-codex)` in your final response.
+4. If codex flags critical issues, briefly acknowledge + suggest follow-up.
+5. If `codex: command not found`, note in response and skip the review pass (do not fail).
+OPT-IN per invocation. Without the flag, skip this entire section.