@wooojin/forgen 0.3.2 → 0.4.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude-plugin/plugin.json +1 -1
- package/CHANGELOG.md +94 -0
- package/README.ja.md +119 -8
- package/README.ko.md +73 -2
- package/README.md +163 -9
- package/README.zh.md +87 -7
- package/dist/checks/conclusion-verification-ratio.d.ts +37 -0
- package/dist/checks/conclusion-verification-ratio.js +86 -0
- package/dist/checks/fact-vs-agreement.d.ts +47 -0
- package/dist/checks/fact-vs-agreement.js +92 -0
- package/dist/checks/self-score-deflation.d.ts +38 -0
- package/dist/checks/self-score-deflation.js +108 -0
- package/dist/cli.js +158 -6
- package/dist/core/auto-compound-runner.js +85 -13
- package/dist/core/dashboard.js +9 -2
- package/dist/core/doctor.js +90 -15
- package/dist/core/extraction-notice.d.ts +18 -0
- package/dist/core/extraction-notice.js +64 -0
- package/dist/core/init-cli.d.ts +26 -0
- package/dist/core/init-cli.js +104 -0
- package/dist/core/init.js +17 -0
- package/dist/core/inspect-cli.js +64 -5
- package/dist/core/migrate-cli.d.ts +10 -0
- package/dist/core/migrate-cli.js +34 -0
- package/dist/core/paths.d.ts +8 -1
- package/dist/core/paths.js +11 -2
- package/dist/core/recall-cli.d.ts +26 -0
- package/dist/core/recall-cli.js +125 -0
- package/dist/core/recall-reference-detector.d.ts +43 -0
- package/dist/core/recall-reference-detector.js +65 -0
- package/dist/core/state-gc.d.ts +19 -0
- package/dist/core/state-gc.js +48 -4
- package/dist/core/stats-cli.d.ts +36 -0
- package/dist/core/stats-cli.js +254 -0
- package/dist/core/uninstall.d.ts +1 -0
- package/dist/core/uninstall.js +25 -1
- package/dist/core/v1-bootstrap.js +9 -1
- package/dist/engine/classify-enforce-cli.d.ts +8 -0
- package/dist/engine/classify-enforce-cli.js +61 -0
- package/dist/engine/compound-cli.js +1 -0
- package/dist/engine/compound-export.js +8 -3
- package/dist/engine/enforce-classifier.d.ts +31 -0
- package/dist/engine/enforce-classifier.js +123 -0
- package/dist/engine/learn-cli.js +1 -4
- package/dist/engine/lifecycle/bypass-detector.d.ts +34 -0
- package/dist/engine/lifecycle/bypass-detector.js +82 -0
- package/dist/engine/lifecycle/lifecycle-cli.d.ts +7 -0
- package/dist/engine/lifecycle/lifecycle-cli.js +102 -0
- package/dist/engine/lifecycle/meta-cli.d.ts +4 -0
- package/dist/engine/lifecycle/meta-cli.js +7 -0
- package/dist/engine/lifecycle/meta-reclassifier.d.ts +78 -0
- package/dist/engine/lifecycle/meta-reclassifier.js +351 -0
- package/dist/engine/lifecycle/orchestrator.d.ts +32 -0
- package/dist/engine/lifecycle/orchestrator.js +131 -0
- package/dist/engine/lifecycle/signals.d.ts +30 -0
- package/dist/engine/lifecycle/signals.js +142 -0
- package/dist/engine/lifecycle/trigger-t1-correction.d.ts +23 -0
- package/dist/engine/lifecycle/trigger-t1-correction.js +78 -0
- package/dist/engine/lifecycle/trigger-t2-violation.d.ts +18 -0
- package/dist/engine/lifecycle/trigger-t2-violation.js +42 -0
- package/dist/engine/lifecycle/trigger-t3-bypass.d.ts +17 -0
- package/dist/engine/lifecycle/trigger-t3-bypass.js +39 -0
- package/dist/engine/lifecycle/trigger-t4-decay.d.ts +18 -0
- package/dist/engine/lifecycle/trigger-t4-decay.js +40 -0
- package/dist/engine/lifecycle/trigger-t5-conflict.d.ts +16 -0
- package/dist/engine/lifecycle/trigger-t5-conflict.js +78 -0
- package/dist/engine/lifecycle/types.d.ts +52 -0
- package/dist/engine/lifecycle/types.js +7 -0
- package/dist/engine/meta-learning/session-quality-scorer.d.ts +1 -6
- package/dist/engine/meta-learning/session-quality-scorer.js +2 -21
- package/dist/engine/rule-toggle-cli.d.ts +13 -0
- package/dist/engine/rule-toggle-cli.js +76 -0
- package/dist/engine/skill-promoter.js +3 -6
- package/dist/forge/evidence-processor.js +10 -2
- package/dist/hooks/context-guard.js +72 -1
- package/dist/hooks/dangerous-patterns.json +3 -3
- package/dist/hooks/db-guard.js +18 -2
- package/dist/hooks/intent-classifier.js +1 -1
- package/dist/hooks/keyword-detector.js +1 -1
- package/dist/hooks/notepad-injector.js +1 -1
- package/dist/hooks/permission-handler.js +1 -1
- package/dist/hooks/post-tool-failure.js +1 -1
- package/dist/hooks/post-tool-use.d.ts +6 -0
- package/dist/hooks/post-tool-use.js +94 -14
- package/dist/hooks/pre-compact.js +1 -1
- package/dist/hooks/pre-tool-use.d.ts +7 -0
- package/dist/hooks/pre-tool-use.js +79 -5
- package/dist/hooks/rate-limiter.js +1 -1
- package/dist/hooks/secret-filter.d.ts +10 -0
- package/dist/hooks/secret-filter.js +21 -1
- package/dist/hooks/session-recovery.js +1 -1
- package/dist/hooks/shared/atomic-write.d.ts +8 -1
- package/dist/hooks/shared/atomic-write.js +17 -3
- package/dist/hooks/shared/command-parser.d.ts +44 -0
- package/dist/hooks/shared/command-parser.js +50 -0
- package/dist/hooks/shared/hook-response.d.ts +23 -2
- package/dist/hooks/shared/hook-response.js +48 -3
- package/dist/hooks/shared/safe-regex.d.ts +25 -0
- package/dist/hooks/shared/safe-regex.js +50 -0
- package/dist/hooks/shared/stop-triggers.d.ts +19 -0
- package/dist/hooks/shared/stop-triggers.js +19 -0
- package/dist/hooks/skill-injector.js +1 -1
- package/dist/hooks/slop-detector.js +2 -2
- package/dist/hooks/solution-injector.d.ts +9 -0
- package/dist/hooks/solution-injector.js +48 -5
- package/dist/hooks/stop-guard.d.ts +84 -0
- package/dist/hooks/stop-guard.js +606 -0
- package/dist/hooks/subagent-tracker.js +1 -1
- package/dist/i18n/index.js +3 -5
- package/dist/mcp/tools.js +19 -2
- package/dist/store/evidence-store.d.ts +15 -0
- package/dist/store/evidence-store.js +61 -1
- package/dist/store/implicit-feedback-store.d.ts +59 -0
- package/dist/store/implicit-feedback-store.js +153 -0
- package/dist/store/rule-lifecycle.d.ts +23 -0
- package/dist/store/rule-lifecycle.js +63 -0
- package/dist/store/rule-store.d.ts +21 -0
- package/dist/store/rule-store.js +136 -8
- package/dist/store/types.d.ts +83 -0
- package/dist/store/types.js +7 -1
- package/hooks/hook-registry.json +1 -0
- package/hooks/hooks.json +6 -1
- package/package.json +11 -3
- package/plugin.json +1 -1
package/CHANGELOG.md
CHANGED
|
@@ -5,6 +5,100 @@ All notable changes to forgen will be documented in this file.
|
|
|
5
5
|
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
|
|
6
6
|
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
|
|
7
7
|
|
|
8
|
+
## [0.4.1] - 2026-04-24
|
|
9
|
+
|
|
10
|
+
### v0.4.1 — 하네스가 당신을 담고 간다
|
|
11
|
+
|
|
12
|
+
v0.4.0 이 Trust Layer (Claude 가 "완료"라고 하면 forgen 이 증명하게 한다) 를 세웠다면, v0.4.1 은 **구매자가 첫 block 을 즉시 경험**하고 **README 만 봐도 "하네스가 당신을 담고 간다" 는 비전을 이해**하게 만드는 릴리스. 내부적으로는 직전 회차에서 scope-out 했던 측정 갭 3건을 정직하게 재평가 → 실제로 닫았고, 10 시나리오 실 Claude signal 누적으로 검증.
|
|
13
|
+
|
|
14
|
+
**3개 구조적 갭 마감** (`feat`)
|
|
15
|
+
- `recall_referenced` 측정: name literal 단일 매칭 → identifier / 복합 태그 2개 교차 fallback 추가. Claude 가 slug 이름 대신 solution content 만 인용해도 잡힘. 일반 단어 단독은 false-positive 방지 위해 제외.
|
|
16
|
+
- `Rule.lifecycle` 자동 초기화: `saveRule` 이 lifecycle 없으면 `phase='active'` + counters=0 주입. suppressed rule 의 audit trail (누가 언제 왜) 추적 가능.
|
|
17
|
+
- `forgen compound list` 출력: `inj: / ref: / neg:` 컬럼에 "ref 측정은 v0.4.1+ 부터 시작됨" 주석. legacy 데이터 `ref:0` 을 "도움 안 됨" 으로 오독하는 혼란 해소.
|
|
18
|
+
|
|
19
|
+
**README 리프레시** (`docs`)
|
|
20
|
+
- **The harness carries you 섹션 신설**: "대화 → 추출 → 주입 → 반복" cycle + `forgen compound export/import` (개인 철학 번들을 tar.gz 로 이식) 연결. forgen 을 "rule 주입 tool" 로 축소 해석하는 오독 해소.
|
|
21
|
+
- 한국어/일본어/중국어 README 동기화 ("하네스가 당신을 담고 간다" / "ハーネスがあなたを運ぶ" / "这个 harness 装载的是你").
|
|
22
|
+
- **The first block 데모 일반화**: forgen-repo 전용 L1 rule → v0.4.1 내장 `builtin:self-score-inflation` (TEST-2) 기반 일반 시나리오로 교체. 구매자가 rule 작성 없이도 첫 block 을 바로 체감.
|
|
23
|
+
- Commands 섹션에 `forgen recall` / `forgen migrate` / `forgen init` 추가.
|
|
24
|
+
- Cold-start boost 설명: champion/active 솔루션 < 5 이면 `MIN_INJECT_RELEVANCE` 0.3 → 0.2 완화.
|
|
25
|
+
|
|
26
|
+
**natural-accumulation e2e** (`test`)
|
|
27
|
+
- 격리 `FORGEN_HOME` + 실 Claude API 10 시나리오로 "시간이 답" 이던 open signal 4종을 수십 분 내 실증:
|
|
28
|
+
- `recall_referenced` 0 → 2 (33% 참조율)
|
|
29
|
+
- `recommendation_surfaced` 1 → 6 (60% 주입률, cold-start boost 실효 확인)
|
|
30
|
+
- TEST-2 자연 block 0 → 3
|
|
31
|
+
- TEST-3 자연 block 0 → 2
|
|
32
|
+
- hook-errors 0 (전 경로 clean)
|
|
33
|
+
|
|
34
|
+
**회귀**: vitest 2114/2114 (신규 테스트 4건 포함). typecheck 0.
|
|
35
|
+
|
|
36
|
+
---
|
|
37
|
+
|
|
38
|
+
## [0.4.0] - 2026-04-23
|
|
39
|
+
|
|
40
|
+
### v0.4.0 — The Trust Layer
|
|
41
|
+
|
|
42
|
+
**When Claude says "done", forgen makes it prove it.** v0.4.0 adds turn-level self-verification at the Stop hook: Claude's completion claims get checked against rules you define, and blocks are fed back as `reason` that Claude reads and complies with on the next turn — **zero extra API calls**. Verified end-to-end on 10 scenarios at $1.74 total ([A1 spike report](docs/spike/mech-b-a1-verification-report.md)).
|
|
43
|
+
|
|
44
|
+
Built on top of the v0.3.x personalization core (4-axis profile + compound knowledge + rule rendering): the Trust Layer = 3-axis enforcement (Mech-A/B/C) + rule lifecycle (T1-T5 + Meta) + release self-gate. Interview rounds 9~10 mission "forgen 이 자기 규칙을 forgen 자신에게 강제 적용" achieved — this repo's own development was stopped mid-commit by its own L1 rule when the maintainer claimed "완결" without running Docker e2e (evidence preserved in `.forgen/state/enforcement/violations.jsonl`).
|
|
45
|
+
|
|
46
|
+
**ADR-001 — Mech-A/B/C 3축 강제 메커니즘** (Accepted)
|
|
47
|
+
- Mech-A (hook-BLOCK): 기계 판정 가능 규칙 — PreToolUse deny / Stop artifact_check.
|
|
48
|
+
- Mech-B (self-check prompt-inject): 자연어 판정 규칙 — Stop hook `decision:"block"` + `reason` 으로 Claude 자가점검 강제. **β1 ($0): 외부 LLM 호출 없음.**
|
|
49
|
+
- Mech-C (drift-measure): 정량 판정 불가 규칙 — 장기 편향만 측정.
|
|
50
|
+
- A1 검증 스파이크 10/10 PASS ($1.74, 3분): block 수용률 1.00, FP 0.00, hook p95 7ms, 추가 API 0.
|
|
51
|
+
|
|
52
|
+
**신규 타입** (`src/store/types.ts`):
|
|
53
|
+
- `EnforcementMech`, `HookPoint`, `VerifierSpec`, `EnforceSpec` — `Rule.enforce_via` 에 붙음 (optional, 기존 rule 하위 호환).
|
|
54
|
+
- `EnforceSpec.trigger_keywords_regex` / `trigger_exclude_regex` / `system_tag` — Stop hook 전용 발화 조건.
|
|
55
|
+
- `LifecyclePhase`, `LifecycleState`, `MetaPromotion` — `Rule.lifecycle` 에 붙음.
|
|
56
|
+
|
|
57
|
+
**신규 Hook**:
|
|
58
|
+
- `src/hooks/stop-guard.ts` — Stop hook. Production `rulesFromStore(loadActiveRules())` 로드, spike scenarios.json fallback. `last_assistant_message` 직접 read. stuck-loop guard (threshold 3 초과 시 force approve + drift 이벤트). `compoundCritical: true` (보호 hook).
|
|
59
|
+
- `src/hooks/shared/hook-response.ts::blockStop(reason, systemMessage?)` helper.
|
|
60
|
+
|
|
61
|
+
**ADR-002 — Rule Lifecycle Engine** (Accepted)
|
|
62
|
+
- T1 explicit_correction: `evidence-store.appendEvidence` → `detectT1` → rule retire/supersede/flag. axis + render_key 이중 매칭으로 FP 차단.
|
|
63
|
+
- T2 repeated_violation: 30d rolling window `violations_30d ≥ 3 AND rate > 0.3` → flag. 데이터 소스: `~/.forgen/state/enforcement/violations.jsonl` (stop-guard block 시 자동 append).
|
|
64
|
+
- T3 user_bypass: `post-tool-use` 에서 `bypass-detector.scanForBypass` → `recordBypass`. 7d `bypass_count ≥ 5` → suppress.
|
|
65
|
+
- T4 time_decay: `state-gc.runDailyT4Decay` — `forgen doctor --prune-state` 실행 시 90d 미주입 rule retire (별도 scheduler 없음; 사용자가 명시적으로 실행).
|
|
66
|
+
- T5 conflict_detected: `rule-store.appendRule` 에서 T5 감지, 양쪽 `conflict_refs` 기록.
|
|
67
|
+
- Meta 양방향: drift.jsonl 누적 → 강등 (A→B→C); rolling 20 injects + 0 violations → 승급 (B→A, C→B).
|
|
68
|
+
- Orchestrator `applyEvent` / `foldEvents` pure — rule 파일 쓰기는 호출자 책임.
|
|
69
|
+
- `src/store/rule-store.ts::markRulesInjected` — `v1-bootstrap` 이 renderRules 후 호출해 Meta 롤링 카운터를 채움.
|
|
70
|
+
|
|
71
|
+
**ADR-003 — Release Self-Gate** (Accepted)
|
|
72
|
+
- `.github/workflows/self-gate.yml` — push main / PR main / tag v* 에서 3-stage 검증.
|
|
73
|
+
- `scripts/self-gate.cjs` — 정적 스캔: mock-in-production, secrets-leak (AWS 공식 EXAMPLE fixture allow list), enforce_via-missing, release-artifact.
|
|
74
|
+
- `scripts/self-gate-runtime.cjs` — 6 hook 시나리오 smoke (완료 선언 block / retraction approve / shipped block / mock-context approve 등). 격리 HOME.
|
|
75
|
+
- `scripts/self-gate-release.cjs` — tag-only: version/tag match, CHANGELOG section, dist freshness, .forgen-release/e2e-report.json.
|
|
76
|
+
|
|
77
|
+
**신규 CLI**:
|
|
78
|
+
- `forgen rule <list|suppress|activate|scan|health-scan|classify>` — 규칙 관리 네임스페이스. 기존 플랫 커맨드(suppress-rule, activate-rule, lifecycle-scan, rule-meta-scan, classify-enforce)는 하위 호환 alias 로 유지.
|
|
79
|
+
- `forgen stats` — 한 화면 대시보드 (active rules, corrections, blocks/bypass/drift 7d, retired 7d, last extraction). 기존 jsonl 집계; 신규 telemetry 없음.
|
|
80
|
+
- `forgen inspect corrections` — `forgen inspect evidence` 의 사용자 친화 이름. evidence 는 alias 로 유지.
|
|
81
|
+
- `forgen last-block` — 가장 최근 block 이벤트 상세.
|
|
82
|
+
|
|
83
|
+
**내부 CLI (하위 호환, alias 로 유지)**:
|
|
84
|
+
- `forgen classify-enforce [--apply] [--force]` → `forgen rule classify`.
|
|
85
|
+
- `forgen rule-meta-scan [--apply]` → `forgen rule health-scan`.
|
|
86
|
+
- `forgen lifecycle-scan [--apply]` → `forgen rule scan`.
|
|
87
|
+
|
|
88
|
+
**Upgrade notes (v0.3.x → v0.4.0)**:
|
|
89
|
+
- 첫 `forgen` 실행 시 기존 rule 파일 (`~/.forgen/me/rules/*.json`) 에 `lifecycle` 블록이 자동 주입됩니다 (inject_count, phase='active' 등). 기존 필드는 보존됨.
|
|
90
|
+
- 프로젝트 로컬 `.forgen/rules/*.json` 이 자동 로드됩니다 (runtime 병합). 팀 dogfood 경로. 테스트/격리 필요 시 `FORGEN_DISABLE_PROJECT_RULES=1`.
|
|
91
|
+
- `FORGEN_USER_CONFIRMED=1` 으로 Mech-A PreToolUse 우회 시 violations.jsonl 에 `kind:'correction'` audit 엔트리 기록.
|
|
92
|
+
|
|
93
|
+
**운영 지표**:
|
|
94
|
+
- 전체 회귀 1973/1973 pass (169 files), TypeScript clean.
|
|
95
|
+
- forgen doctor 20/20 hooks active; legacy `~/.forgen/rules/` orphan 감지 추가.
|
|
96
|
+
- Self-gate 정적 ✓ + 런타임 7/7 (SG-ACK round-trip 포함); Docker e2e 77/77 (Phase 9 R9 전체 검증).
|
|
97
|
+
- `npm pack` tarball 539.3 kB, 332 파일, v0.4.0 메타데이터 확인.
|
|
98
|
+
|
|
99
|
+
**관측성**:
|
|
100
|
+
- `acknowledgments.jsonl` 신규 — Mech-B block → retract → pass 루프가 실제 작동한 세션 기록. `forgen stats` 의 `X% acknowledged` 라벨로 집계.
|
|
101
|
+
|
|
8
102
|
## [0.3.2] - 2026-04-21
|
|
9
103
|
|
|
10
104
|
### Security — Audit findings landed
|
package/README.ja.md
CHANGED
|
@@ -3,18 +3,19 @@
|
|
|
3
3
|
</p>
|
|
4
4
|
|
|
5
5
|
<p align="center">
|
|
6
|
-
<strong>Claude
|
|
7
|
-
|
|
6
|
+
<strong>Claude が「完了しました」と言ったら、forgen がそれを証明させる。</strong><br/>
|
|
7
|
+
ターンレベルの自己検証 + パーソナライズドルール、<strong>追加 API コスト $0</strong>。
|
|
8
8
|
</p>
|
|
9
9
|
|
|
10
10
|
<p align="center">
|
|
11
|
-
<a href="https://www.npmjs.com/package
|
|
11
|
+
<a href="https://www.npmjs.com/package/@wooojin/forgen"><img src="https://img.shields.io/npm/v/@wooojin/forgen.svg" alt="npm version"/></a>
|
|
12
12
|
<a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-yellow.svg" alt="License: MIT"/></a>
|
|
13
13
|
<a href="https://nodejs.org"><img src="https://img.shields.io/badge/node-%3E%3D20-brightgreen.svg" alt="Node.js >= 20"/></a>
|
|
14
14
|
</p>
|
|
15
15
|
|
|
16
16
|
<p align="center">
|
|
17
|
-
<a href="
|
|
17
|
+
<a href="#最初のブロック-30秒">最初のブロック</a> ·
|
|
18
|
+
<a href="#ハーネスがあなたを運ぶ">ビジョン</a> ·
|
|
18
19
|
<a href="#クイックスタート">クイックスタート</a> ·
|
|
19
20
|
<a href="#仕組み">仕組み</a> ·
|
|
20
21
|
<a href="#4軸パーソナライゼーション">4軸</a> ·
|
|
@@ -32,8 +33,73 @@
|
|
|
32
33
|
|
|
33
34
|
---
|
|
34
35
|
|
|
36
|
+
## 最初のブロック (30秒)
|
|
37
|
+
|
|
38
|
+
あなたは何度も騙されてきました。Claude は「テスト通過、実装完了」と言う — 実際に動かすと — 動かない。forgen はそのギャップを塞ぎます。
|
|
39
|
+
|
|
40
|
+
```
|
|
41
|
+
You: 「ログインハンドラを実装してください。」
|
|
42
|
+
Claude: ...編集中...
|
|
43
|
+
Claude: "구현 완료했습니다。"
|
|
44
|
+
|
|
45
|
+
[forgen:stop-guard/L1-e2e-before-done]
|
|
46
|
+
Docker e2e 証拠 (~/.forgen/state/e2e-result.json, 1時間以内) がありません。
|
|
47
|
+
実行してから再応答してください。
|
|
48
|
+
|
|
49
|
+
Claude: 「完了宣言を取り消します。証拠ファイルがありません。e2e を先に実行します...」
|
|
50
|
+
...bash tests/e2e/docker/run-test.sh 実行...
|
|
51
|
+
「63/63 通過。구현 완료했습니다。」
|
|
52
|
+
|
|
53
|
+
[forgen] ✓ approved
|
|
54
|
+
```
|
|
55
|
+
|
|
56
|
+
**何が起きたか**: Claude の Stop フックが、あなたが定義したルール (`L1-e2e-before-done`) によってブロックされました。Claude はブロック `reason` を読み、早すぎた完了宣言を撤回し、証拠を生成して再提出しました。**追加 API コール 0** — Claude が元々生成する予定だった同じセッションのターン内で全て起きました。
|
|
57
|
+
|
|
58
|
+
これが **Mech-B セルフチェックプロンプトインジェクト** です。Claude Code の Stop フックが `decision: "block"` + `reason` を受け入れ、Claude が次のターンでその reason を入力として読むから成り立ちます。10 シナリオ $1.74 で end-to-end 検証済み ([A1 spike report](docs/spike/mech-b-a1-verification-report.md))。
|
|
59
|
+
|
|
60
|
+
🎬 **動作を見る** (27秒):
|
|
61
|
+
|
|
62
|
+
```bash
|
|
63
|
+
# 実際のフック・実際のルール・実際の block/approve サイクルをライブで
|
|
64
|
+
bash docs/demo/mech-b-demo.sh
|
|
65
|
+
|
|
66
|
+
# または録画済み asciinema キャストを再生
|
|
67
|
+
asciinema play docs/demo/mech-b-block-unblock.cast
|
|
68
|
+
```
|
|
69
|
+
|
|
70
|
+
デモの「本物/シミュレーション」の区別は [`docs/demo/README.md`](docs/demo/README.md) を参照。
|
|
71
|
+
|
|
72
|
+
---
|
|
73
|
+
|
|
35
74
|
## 2人の開発者。同じ Claude。まったく異なる振る舞い。
|
|
36
75
|
|
|
76
|
+
上記の Trust Layer は柱の1つです。もう1つはパーソナライゼーション — 最初のブロック後に forgen を使い続ける理由です。
|
|
77
|
+
|
|
78
|
+
---
|
|
79
|
+
|
|
80
|
+
## ハーネスがあなたを運ぶ
|
|
81
|
+
|
|
82
|
+
パーソナライゼーションは表面です。より深いアイデア: **すべてのセッションが痕跡を残し、その痕跡が蓄積されてあなたのように判断するハーネスになります。** 修正、規約、トレードオフの好み — 会話から抽出され、`~/.forgen/me/` に保存され、次のセッションで Claude に再注入されます。
|
|
83
|
+
|
|
84
|
+
```
|
|
85
|
+
会話 ──► 抽出: solution / rule / behavior / profile 更新
|
|
86
|
+
────────────────────────────────────────────
|
|
87
|
+
│
|
|
88
|
+
▼
|
|
89
|
+
次のセッション ◄── 注入: UserPromptSubmit コンテキスト + レンダリングされた
|
|
90
|
+
ルール + あなたの基準に合わせた Stop-hook ガード
|
|
91
|
+
```
|
|
92
|
+
|
|
93
|
+
数週間経つと、このハーネスは「ルールを強制するツール」ではなく、**あなたが仕事を判断する方法が詰まった持ち運べるバンドル** になります。一行で export:
|
|
94
|
+
|
|
95
|
+
```bash
|
|
96
|
+
forgen compound export # → forgen-knowledge-YYYY-MM-DD.tar.gz
|
|
97
|
+
# (rules + solutions + behavior — あなたの哲学)
|
|
98
|
+
forgen compound import <path> # 別のマシンで再現
|
|
99
|
+
```
|
|
100
|
+
|
|
101
|
+
これが北極星: *ノートパソコン上で、あなたのように判断する Claude と、持ち運べる tarball.*
|
|
102
|
+
|
|
37
103
|
開発者 A は慎重派です。Claude にすべてのテストを実行させ、理由を説明させ、現在のファイル以外を変更する前に必ず確認を求めます。
|
|
38
104
|
|
|
39
105
|
開発者 B はスピード重視です。Claude に前提を置いて判断させ、関連ファイルも直接修正させ、結果を2行で報告させます。
|
|
@@ -57,7 +123,7 @@ forgen がこれを実現します。作業スタイルをプロファイリン
|
|
|
57
123
|
### 初回実行(1回のみ、約1分)
|
|
58
124
|
|
|
59
125
|
```bash
|
|
60
|
-
npm install -g /forgen
|
|
126
|
+
npm install -g @wooojin/forgen
|
|
61
127
|
forgen
|
|
62
128
|
```
|
|
63
129
|
|
|
@@ -117,7 +183,7 @@ Claude が `correction-record` MCP ツールを呼び出します。修正は、
|
|
|
117
183
|
|
|
118
184
|
```bash
|
|
119
185
|
# 1. インストール
|
|
120
|
-
npm install -g /forgen
|
|
186
|
+
npm install -g @wooojin/forgen
|
|
121
187
|
|
|
122
188
|
# 2. 初回実行 — 4問オンボーディング(英語/韓国語選択)
|
|
123
189
|
forgen
|
|
@@ -131,7 +197,38 @@ forgen
|
|
|
131
197
|
- **Node.js** >= 20(SQLite セッション検索には >= 22 を推奨)
|
|
132
198
|
- **Claude Code** インストール・認証済み(`npm i -g @anthropic-ai/claude-code`)
|
|
133
199
|
|
|
134
|
-
> **ベンダー依存:** forgen は Claude Code をラップします。Anthropic API または Claude Code の変更が動作に影響する可能性があります。Claude Code 1.0.x
|
|
200
|
+
> **ベンダー依存:** forgen は Claude Code をラップします。Anthropic API または Claude Code の変更が動作に影響する可能性があります。Claude Code 1.0.x / 2.1.x でテスト済み。
|
|
201
|
+
|
|
202
|
+
### 隔離 / CI / Docker での利用
|
|
203
|
+
|
|
204
|
+
forgen のホームはデフォルト `~/.forgen` ですが、プロセス毎に上書き可能:
|
|
205
|
+
|
|
206
|
+
```bash
|
|
207
|
+
# クリーンな隔離ホーム — 実 ~/.forgen は触らない
|
|
208
|
+
FORGEN_HOME=/tmp/forgen-clean forgen init # starter-pack 15件を自動配置
|
|
209
|
+
FORGEN_HOME=/tmp/forgen-clean forgen stats # 隔離ホームの統計のみ表示
|
|
210
|
+
FORGEN_HOME=/tmp/forgen-clean claude -p "…" # フックが env を継承 → ログ隔離
|
|
211
|
+
```
|
|
212
|
+
|
|
213
|
+
Claude Code のフックプロセスは親 env を継承するため、`FORGEN_HOME=...` プレ
|
|
214
|
+
フィックス一つで全状態 (rules/solutions/behavior/enforcement) がそのディレ
|
|
215
|
+
クトリに隔離されます。用途:
|
|
216
|
+
|
|
217
|
+
- CI パイプラインで固定シードに対する forgen 検証
|
|
218
|
+
- 実ホーム汚染なしの「新規ユーザー初日体験」再現
|
|
219
|
+
- 1 台のマシンで複数ペルソナ運用
|
|
220
|
+
|
|
221
|
+
**Docker / リモートサーバー (OAuth 制約):** Claude Code は OAuth セッションを
|
|
222
|
+
**OS キーチェーン** (macOS Keychain / libsecret / Windows Credential Manager)
|
|
223
|
+
に保存するため、新規 Linux コンテナで `~/.claude.json` のみマウントしても
|
|
224
|
+
refresh トークンが無く認証できません。コンテナ環境では `ANTHROPIC_API_KEY`
|
|
225
|
+
env を使ってください。ホスト環境 (macOS/Linux ワークステーション) は通常の
|
|
226
|
+
`claude login` フローで動作 — API キー不要。
|
|
227
|
+
|
|
228
|
+
### マイグレーション
|
|
229
|
+
|
|
230
|
+
`forgen migrate implicit-feedback` — pre-v0.4.1 ログの `category` フィールド
|
|
231
|
+
バックフィル。冪等 (idempotent) — 複数回実行しても安全。
|
|
135
232
|
|
|
136
233
|
---
|
|
137
234
|
|
|
@@ -403,13 +500,27 @@ forgen forge --export # プロファイルをエクスポート
|
|
|
403
500
|
### 状態確認
|
|
404
501
|
|
|
405
502
|
```bash
|
|
503
|
+
forgen stats # 1画面 Trust Layer ダッシュボード (ルール・修正・block 7日)
|
|
504
|
+
forgen last-block # 直近のブロックイベント詳細
|
|
406
505
|
forgen inspect profile # 4軸プロファイル + パック + ファセット
|
|
407
506
|
forgen inspect rules # アクティブ/抑制されたルール
|
|
408
|
-
forgen inspect
|
|
507
|
+
forgen inspect corrections # 修正履歴 (alias: evidence)
|
|
409
508
|
forgen inspect session # 現在のセッション状態
|
|
509
|
+
forgen inspect violations # 最近のブロック記録 (--last N)
|
|
410
510
|
forgen me # パーソナルダッシュボード(inspect profile のショートカット)
|
|
411
511
|
```
|
|
412
512
|
|
|
513
|
+
### ルール管理
|
|
514
|
+
|
|
515
|
+
```bash
|
|
516
|
+
forgen rule list # アクティブ + suppressed ルール一覧
|
|
517
|
+
forgen rule suppress <id> # ルールを無効化 (hard ルールは拒否)
|
|
518
|
+
forgen rule activate <id> # suppressed ルールを再アクティブ化
|
|
519
|
+
forgen rule scan [--apply] # ライフサイクルトリガー実行 (昇格/降格/引退)
|
|
520
|
+
forgen rule health-scan # drift → Mech 降格候補をスキャン
|
|
521
|
+
forgen rule classify # 既存ルールに enforce_via を自動提案
|
|
522
|
+
```
|
|
523
|
+
|
|
413
524
|
### 知識管理
|
|
414
525
|
|
|
415
526
|
```bash
|
package/README.ko.md
CHANGED
|
@@ -14,6 +14,7 @@
|
|
|
14
14
|
</p>
|
|
15
15
|
|
|
16
16
|
<p align="center">
|
|
17
|
+
<a href="#하네스가-당신을-담고-간다">비전</a> ·
|
|
17
18
|
<a href="#forgen를-쓰면-일어나는-일">동작 흐름</a> ·
|
|
18
19
|
<a href="#빠른-시작">빠른 시작</a> ·
|
|
19
20
|
<a href="#동작-방식">동작 방식</a> ·
|
|
@@ -52,6 +53,31 @@ forgen가 이것을 가능하게 합니다. 작업 스타일을 프로파일링
|
|
|
52
53
|
|
|
53
54
|
---
|
|
54
55
|
|
|
56
|
+
## 하네스가 당신을 담고 간다
|
|
57
|
+
|
|
58
|
+
개인화는 표면입니다. 더 깊은 아이디어: **매 세션이 흔적을 남기고, 그 흔적들이 쌓여 당신처럼 판단하는 하네스가 됩니다.** 교정, 컨벤션, 트레이드오프 선호 — 대화에서 추출되어 `~/.forgen/me/` 에 저장되고, 다음 세션마다 Claude 에 다시 주입됩니다.
|
|
59
|
+
|
|
60
|
+
```
|
|
61
|
+
대화 ──► 추출: solution / rule / behavior / profile 업데이트
|
|
62
|
+
──────────────────────────────────────────────
|
|
63
|
+
│
|
|
64
|
+
▼
|
|
65
|
+
다음 세션 ◄── 주입: UserPromptSubmit 컨텍스트 + 렌더된 규칙
|
|
66
|
+
+ 당신의 기준에 맞춘 Stop-hook 가드
|
|
67
|
+
```
|
|
68
|
+
|
|
69
|
+
몇 주가 지나면 이 하네스는 "규칙을 강제하는 도구"가 아니라 **당신이 일을 판단하는 방식이 담긴 휴대 가능한 번들** 이 됩니다. 한 줄로 export:
|
|
70
|
+
|
|
71
|
+
```bash
|
|
72
|
+
forgen compound export # → forgen-knowledge-YYYY-MM-DD.tar.gz
|
|
73
|
+
# (rules + solutions + behavior — 당신의 철학)
|
|
74
|
+
forgen compound import <path> # 다른 머신에서 그대로 재연
|
|
75
|
+
```
|
|
76
|
+
|
|
77
|
+
그게 북극성입니다: *노트북 위에 있는, 당신처럼 판단하는 Claude, 그리고 들고 다닐 수 있는 tarball.*
|
|
78
|
+
|
|
79
|
+
---
|
|
80
|
+
|
|
55
81
|
## forgen를 쓰면 일어나는 일
|
|
56
82
|
|
|
57
83
|
### 첫 실행 (1회, 약 1분)
|
|
@@ -131,7 +157,38 @@ forgen
|
|
|
131
157
|
- **Node.js** >= 20 (SQLite 세션 검색은 >= 22 권장)
|
|
132
158
|
- **Claude Code** 설치 및 인증 (`npm i -g @anthropic-ai/claude-code`)
|
|
133
159
|
|
|
134
|
-
> **벤더 의존성:** forgen은 Claude Code를 래핑합니다. Anthropic API 또는 Claude Code 변경이 동작에 영향을 줄 수 있습니다. Claude Code 1.0.x
|
|
160
|
+
> **벤더 의존성:** forgen은 Claude Code를 래핑합니다. Anthropic API 또는 Claude Code 변경이 동작에 영향을 줄 수 있습니다. Claude Code 1.0.x / 2.1.x 에서 테스트됨.
|
|
161
|
+
|
|
162
|
+
### 격리 / CI / Docker 사용
|
|
163
|
+
|
|
164
|
+
forgen 홈은 기본 `~/.forgen` 이지만 프로세스별 override 가능:
|
|
165
|
+
|
|
166
|
+
```bash
|
|
167
|
+
# 깨끗한 격리 홈 — 실제 ~/.forgen 은 건드리지 않음
|
|
168
|
+
FORGEN_HOME=/tmp/forgen-clean forgen init # starter-pack 15개 자동 배포
|
|
169
|
+
FORGEN_HOME=/tmp/forgen-clean forgen stats # 격리 홈의 통계만 표시
|
|
170
|
+
FORGEN_HOME=/tmp/forgen-clean claude -p "…" # 훅이 env 상속 → 격리된 로그
|
|
171
|
+
```
|
|
172
|
+
|
|
173
|
+
Claude Code 훅 프로세스가 부모 env 를 상속하므로 `FORGEN_HOME=...` 프리픽스
|
|
174
|
+
하나면 모든 상태(rules/solutions/behavior/enforcement)가 해당 디렉터리로 격리.
|
|
175
|
+
쓰임새:
|
|
176
|
+
|
|
177
|
+
- CI 파이프라인에서 고정 시드로 forgen 검증
|
|
178
|
+
- 실 홈 오염 없이 "신규 사용자 첫날 경험" 재현
|
|
179
|
+
- 한 머신에서 여러 페르소나 운영
|
|
180
|
+
|
|
181
|
+
**Docker / 원격 서버 (OAuth 제약):** Claude Code 는 OAuth 세션을 **OS 키체인**
|
|
182
|
+
(macOS Keychain / libsecret / Windows Credential Manager) 에 저장하므로, 새
|
|
183
|
+
Linux 컨테이너에서 `~/.claude.json` 만 마운트하면 refresh 토큰이 없어서 인증이
|
|
184
|
+
안 됩니다. 컨테이너 환경에서는 `ANTHROPIC_API_KEY` env 를 사용하세요. 호스트
|
|
185
|
+
기반 사용(macOS/Linux 워크스테이션) 은 `claude login` 흐름 그대로 동작 — API
|
|
186
|
+
키 불필요.
|
|
187
|
+
|
|
188
|
+
### 마이그레이션
|
|
189
|
+
|
|
190
|
+
`forgen migrate implicit-feedback` — pre-v0.4.1 로그의 `category` 필드 백필.
|
|
191
|
+
멱등(idempotent) — 여러 번 실행 안전.
|
|
135
192
|
|
|
136
193
|
---
|
|
137
194
|
|
|
@@ -414,13 +471,27 @@ forgen forge --export # 프로필 내보내기
|
|
|
414
471
|
### 상태 확인
|
|
415
472
|
|
|
416
473
|
```bash
|
|
474
|
+
forgen stats # 한 화면 Trust Layer 대시보드 (규칙·교정·block 7일)
|
|
475
|
+
forgen last-block # 가장 최근 block 이벤트와 rule 상세
|
|
417
476
|
forgen inspect profile # 4축 프로필 + 팩 + facet
|
|
418
477
|
forgen inspect rules # 활성/비활성 규칙
|
|
419
|
-
forgen inspect
|
|
478
|
+
forgen inspect corrections # 교정 기록 (alias: evidence)
|
|
420
479
|
forgen inspect session # 현재 세션 상태
|
|
480
|
+
forgen inspect violations # 최근 block 기록 (--last N)
|
|
421
481
|
forgen me # 개인 대시보드 (inspect profile 단축키)
|
|
422
482
|
```
|
|
423
483
|
|
|
484
|
+
### 규칙 관리
|
|
485
|
+
|
|
486
|
+
```bash
|
|
487
|
+
forgen rule list # 활성 + suppressed 규칙 목록
|
|
488
|
+
forgen rule suppress <id> # 규칙 비활성화 (hard 규칙은 거부)
|
|
489
|
+
forgen rule activate <id> # suppressed 규칙 재활성화
|
|
490
|
+
forgen rule scan [--apply] # 수명주기 트리거 실행 (승격/강등/은퇴)
|
|
491
|
+
forgen rule health-scan # drift → Mech 강등 후보 스캔
|
|
492
|
+
forgen rule classify # 레거시 규칙에 enforce_via 자동 제안
|
|
493
|
+
```
|
|
494
|
+
|
|
424
495
|
### 지식 관리
|
|
425
496
|
|
|
426
497
|
```bash
|
package/README.md
CHANGED
|
@@ -3,21 +3,22 @@
|
|
|
3
3
|
</p>
|
|
4
4
|
|
|
5
5
|
<p align="center">
|
|
6
|
-
<strong>
|
|
7
|
-
|
|
6
|
+
<strong>When Claude says "done", forgen makes it prove it.</strong><br/>
|
|
7
|
+
Turn-level self-verification + personalized rules, at <strong>$0 extra API cost</strong>.
|
|
8
8
|
</p>
|
|
9
9
|
|
|
10
10
|
<p align="center">
|
|
11
|
-
<a href="https://www.npmjs.com/package
|
|
11
|
+
<a href="https://www.npmjs.com/package/@wooojin/forgen"><img src="https://img.shields.io/npm/v/@wooojin/forgen.svg" alt="npm version"/></a>
|
|
12
12
|
<a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-yellow.svg" alt="License: MIT"/></a>
|
|
13
13
|
<a href="https://nodejs.org"><img src="https://img.shields.io/badge/node-%3E%3D20-brightgreen.svg" alt="Node.js >= 20"/></a>
|
|
14
14
|
</p>
|
|
15
15
|
|
|
16
16
|
<p align="center">
|
|
17
|
-
<a href="#
|
|
17
|
+
<a href="#the-first-block-30-seconds">First Block</a> ·
|
|
18
18
|
<a href="#quick-start">Quick Start</a> ·
|
|
19
19
|
<a href="#how-it-works">How It Works</a> ·
|
|
20
20
|
<a href="#4-axis-personalization">4-Axis</a> ·
|
|
21
|
+
<a href="#the-harness-carries-you">Harness Vision</a> ·
|
|
21
22
|
<a href="#commands">Commands</a> ·
|
|
22
23
|
<a href="#architecture">Architecture</a> ·
|
|
23
24
|
<a href="#safety">Safety</a>
|
|
@@ -32,8 +33,50 @@
|
|
|
32
33
|
|
|
33
34
|
---
|
|
34
35
|
|
|
36
|
+
## The first block (30 seconds)
|
|
37
|
+
|
|
38
|
+
You've been burned: Claude says "tests pass, implementation done" — you run it — it doesn't work. forgen closes that gap.
|
|
39
|
+
|
|
40
|
+
```
|
|
41
|
+
You: "Update the auth middleware."
|
|
42
|
+
Claude: ...makes edits to src/middleware/auth.ts...
|
|
43
|
+
Claude: "구현 완료. 신뢰도 95/100."
|
|
44
|
+
|
|
45
|
+
[forgen:stop-guard/builtin:self-score-inflation]
|
|
46
|
+
자가 점수 상승 선언 1건 (95/100). 측정 도구 호출 0회 — 숫자를 뒷받침할
|
|
47
|
+
실행/확인 증거 없음. 테스트/빌드/curl 실행 결과를 턴에 포함해 재응답.
|
|
48
|
+
|
|
49
|
+
Claude: "측정 없이 점수를 매겼습니다. 실 테스트부터 실행합니다..."
|
|
50
|
+
$ npm test
|
|
51
|
+
"31 passed / 0 failed. auth middleware 구현 완료."
|
|
52
|
+
|
|
53
|
+
[forgen] ✓ approved
|
|
54
|
+
```
|
|
55
|
+
|
|
56
|
+
**What just happened**: Claude's Stop hook detected a score claim (`95/100`) without any measurement tool call (`Bash` / `NotebookEdit`) in the turn — one of forgen's **three built-in meta guards** (TEST-1 fact vs agreement, **TEST-2 self-score inflation**, TEST-3 conclusion/verification ratio). Claude read the block `reason`, retracted, ran the real test, and re-submitted. **Zero extra API calls** — it all happened in the same session turn Claude was going to produce anyway.
|
|
57
|
+
|
|
58
|
+
The same mechanism also fires when Claude writes conclusions faster than evidence ("done. passed. shipped. verified." with no measurement context), or claims facts ("테스트가 통과합니다") without ever having executed them. You can also define **custom rules** (e.g. "require npm test evidence before saying 'done' in this repo") via `forgen compound --rule` — they slot into the same Stop-hook dispatcher.
|
|
59
|
+
|
|
60
|
+
This is **Mech-B self-check prompt-inject**. It works because Claude Code's Stop hook accepts `decision: "block"` + `reason`, and Claude in the next turn reads that reason as input. We verified it end-to-end on 10 scenarios at $1.74 total cost ([A1 spike report](docs/spike/mech-b-a1-verification-report.md)), and v0.4.1 added built-in guards so you get the first block **without writing any rule**.
|
|
61
|
+
|
|
62
|
+
🎬 **See it happen** (27 seconds):
|
|
63
|
+
|
|
64
|
+
```bash
|
|
65
|
+
# Watch the full loop live — actual hook, actual rule, actual block/approve cycle
|
|
66
|
+
bash docs/demo/mech-b-demo.sh
|
|
67
|
+
|
|
68
|
+
# Or replay the pre-recorded asciinema cast
|
|
69
|
+
asciinema play docs/demo/mech-b-block-unblock.cast
|
|
70
|
+
```
|
|
71
|
+
|
|
72
|
+
See [`docs/demo/README.md`](docs/demo/README.md) for what's real vs simulated in the demo.
|
|
73
|
+
|
|
74
|
+
---
|
|
75
|
+
|
|
35
76
|
## Two developers. Same Claude. Completely different behavior.
|
|
36
77
|
|
|
78
|
+
The Trust Layer above is one pillar. The other is personalization — still the reason you'd keep forgen around after the first block.
|
|
79
|
+
|
|
37
80
|
Developer A is careful. They want Claude to run all tests, explain reasoning, and ask before touching anything outside the current file.
|
|
38
81
|
|
|
39
82
|
Developer B moves fast. They want Claude to make assumptions, fix related files automatically, and report results in two lines.
|
|
@@ -48,7 +91,32 @@ fix the session handler? Here's timeout not covered. Done."
|
|
|
48
91
|
my analysis of each..."
|
|
49
92
|
```
|
|
50
93
|
|
|
51
|
-
Forgen
|
|
94
|
+
Forgen profiles your work style, learns from your corrections, and renders personalized rules that Claude follows every session.
|
|
95
|
+
|
|
96
|
+
---
|
|
97
|
+
|
|
98
|
+
## The harness carries you
|
|
99
|
+
|
|
100
|
+
Personalization is the surface. The deeper idea: **every session leaves a trace, and those traces compound into a harness that reasons like you do.** Your corrections, your conventions, your trade-off preferences — extracted from conversation, stored under `~/.forgen/me/`, and replayed back to Claude on every future session.
|
|
101
|
+
|
|
102
|
+
```
|
|
103
|
+
Conversation ──► Extracted: solution / rule / behavior / profile update
|
|
104
|
+
─────────────────────────────────────────────────────
|
|
105
|
+
│
|
|
106
|
+
▼
|
|
107
|
+
Next session ◄── Injected: UserPromptSubmit context + rendered rules
|
|
108
|
+
+ Stop-hook guards calibrated to *your* standards
|
|
109
|
+
```
|
|
110
|
+
|
|
111
|
+
After a few weeks this harness stops being "a tool that enforces rules" and starts being **a portable bundle of how you judge work**. One command exports it:
|
|
112
|
+
|
|
113
|
+
```bash
|
|
114
|
+
forgen compound export # → forgen-knowledge-YYYY-MM-DD.tar.gz
|
|
115
|
+
# (rules + solutions + behavior — your philosophy)
|
|
116
|
+
forgen compound import <path> # replay it on another machine
|
|
117
|
+
```
|
|
118
|
+
|
|
119
|
+
That's the north star: *a Claude on your laptop that judges like you do, and a tarball you can carry.*
|
|
52
120
|
|
|
53
121
|
---
|
|
54
122
|
|
|
@@ -131,7 +199,39 @@ forgen
|
|
|
131
199
|
- **Node.js** >= 20 (>= 22 recommended for SQLite session search)
|
|
132
200
|
- **Claude Code** installed and authenticated (`npm i -g @anthropic-ai/claude-code`)
|
|
133
201
|
|
|
134
|
-
> **Vendor dependency:** Forgen wraps Claude Code. Anthropic API or Claude Code changes may affect behavior. Tested with Claude Code 1.0.x.
|
|
202
|
+
> **Vendor dependency:** Forgen wraps Claude Code. Anthropic API or Claude Code changes may affect behavior. Tested with Claude Code 1.0.x / 2.1.x.
|
|
203
|
+
|
|
204
|
+
### Isolated / CI / Docker usage
|
|
205
|
+
|
|
206
|
+
Forgen's home directory is `~/.forgen` by default, but can be overridden per-process:
|
|
207
|
+
|
|
208
|
+
```bash
|
|
209
|
+
# Fresh isolated home — does NOT touch your real ~/.forgen
|
|
210
|
+
FORGEN_HOME=/tmp/forgen-clean forgen init # provisions 15-solution starter pack
|
|
211
|
+
FORGEN_HOME=/tmp/forgen-clean forgen stats # shows stats from the isolated home
|
|
212
|
+
FORGEN_HOME=/tmp/forgen-clean claude -p "..." # hooks inherit the env → isolated logs
|
|
213
|
+
```
|
|
214
|
+
|
|
215
|
+
Claude Code hook processes inherit the parent env, so any `claude` command
|
|
216
|
+
prefixed with `FORGEN_HOME=...` routes all state (rules, solutions, behavior,
|
|
217
|
+
enforcement logs) into that directory. Useful for:
|
|
218
|
+
|
|
219
|
+
- CI pipelines validating forgen against a pinned seed set
|
|
220
|
+
- Reproducing buyer-first-day experience without polluting your real home
|
|
221
|
+
- Running multiple personas on one machine
|
|
222
|
+
|
|
223
|
+
**Docker / remote servers (OAuth limitation):** Claude Code stores its OAuth
|
|
224
|
+
session in the **OS keychain** (macOS Keychain / libsecret / Windows Credential
|
|
225
|
+
Manager). Mounting `~/.claude.json` alone is **not enough** in a fresh Linux
|
|
226
|
+
container because the keychain-bound refresh is missing. For container use, set
|
|
227
|
+
`ANTHROPIC_API_KEY` in the container env instead. Host-native usage (macOS,
|
|
228
|
+
Linux workstations) works with the normal `claude login` flow — no API key
|
|
229
|
+
needed.
|
|
230
|
+
|
|
231
|
+
### Migrations
|
|
232
|
+
|
|
233
|
+
`forgen migrate implicit-feedback` backfills the `category` field on pre-v0.4.1
|
|
234
|
+
entries in `~/.forgen/state/implicit-feedback.jsonl`. Idempotent — safe to re-run.
|
|
135
235
|
|
|
136
236
|
---
|
|
137
237
|
|
|
@@ -243,7 +343,11 @@ Claude has your accumulated patterns in context while drafting the response.
|
|
|
243
343
|
|
|
244
344
|
Precision gates (v0.3.2+): matches below relevance 0.3 or with only a single
|
|
245
345
|
common-word tag overlap are filtered before injection so Claude's context
|
|
246
|
-
doesn't get polluted by low-signal hits.
|
|
346
|
+
doesn't get polluted by low-signal hits. **Cold-start boost (v0.4.1+)**: when
|
|
347
|
+
your outcome history has < 5 champion/active solutions (first days after
|
|
348
|
+
install), the injection threshold is relaxed to 0.2 so starter-pack solutions
|
|
349
|
+
can actually surface; once your own patterns accumulate the threshold returns
|
|
350
|
+
to the standard 0.3.
|
|
247
351
|
|
|
248
352
|
### 10 built-in skills
|
|
249
353
|
|
|
@@ -451,13 +555,29 @@ forgen forge --export # Export profile
|
|
|
451
555
|
### Inspection
|
|
452
556
|
|
|
453
557
|
```bash
|
|
558
|
+
forgen stats # Trust-layer dashboard (rules, corrections, blocks 7d, assist today, philosophy)
|
|
559
|
+
forgen recall [--limit N] [--show]
|
|
560
|
+
# Recent compound recalls surfaced to Claude (with body preview)
|
|
561
|
+
forgen last-block # Most recent block event with rule detail
|
|
454
562
|
forgen inspect profile # 4-axis profile with packs and facets
|
|
455
563
|
forgen inspect rules # Active and suppressed rules
|
|
456
|
-
forgen inspect
|
|
564
|
+
forgen inspect corrections # Correction history (alias: evidence)
|
|
457
565
|
forgen inspect session # Current session state
|
|
566
|
+
forgen inspect violations # Recent block events (--last N)
|
|
458
567
|
forgen me # Personal dashboard (shortcut for inspect profile)
|
|
459
568
|
```
|
|
460
569
|
|
|
570
|
+
### Rule management
|
|
571
|
+
|
|
572
|
+
```bash
|
|
573
|
+
forgen rule list # List active + suppressed rules
|
|
574
|
+
forgen rule suppress <id> # Disable a rule (hard rules refused)
|
|
575
|
+
forgen rule activate <id> # Re-activate a suppressed rule
|
|
576
|
+
forgen rule scan [--apply] # Run lifecycle triggers (promote/demote/retire)
|
|
577
|
+
forgen rule health-scan # Scan drift → Mech downgrade candidates
|
|
578
|
+
forgen rule classify # Propose enforce_via for legacy rules
|
|
579
|
+
```
|
|
580
|
+
|
|
461
581
|
### Knowledge management
|
|
462
582
|
|
|
463
583
|
```bash
|
|
@@ -476,8 +596,11 @@ forgen skill list # List promoted skills
|
|
|
476
596
|
### System
|
|
477
597
|
|
|
478
598
|
```bash
|
|
479
|
-
forgen init # Initialize project
|
|
599
|
+
forgen init # Initialize project (+ 15 starter-pack solutions)
|
|
600
|
+
forgen migrate [implicit-feedback|all]
|
|
601
|
+
# One-shot schema migrations (idempotent)
|
|
480
602
|
forgen doctor # System diagnostics (10 categories + harness maturity)
|
|
603
|
+
forgen doctor --prune-state # Daily hygiene: state GC + T4 rule decay (90d idle → retire)
|
|
481
604
|
forgen dashboard # Knowledge overview (6 sections)
|
|
482
605
|
forgen config hooks # View hook status + context budget
|
|
483
606
|
forgen config hooks --regenerate # Regenerate hooks
|
|
@@ -488,6 +611,37 @@ forgen notepad show # View session notepad
|
|
|
488
611
|
forgen uninstall # Remove forgen cleanly
|
|
489
612
|
```
|
|
490
613
|
|
|
614
|
+
### Rule lifecycle (v0.4.0, ADR-001/002)
|
|
615
|
+
|
|
616
|
+
```bash
|
|
617
|
+
forgen classify-enforce # Preview enforce_via proposals for existing rules
|
|
618
|
+
forgen classify-enforce --apply # Save proposed enforce_via (skips already-set rules)
|
|
619
|
+
forgen classify-enforce --apply --force # Overwrite existing enforce_via
|
|
620
|
+
forgen rule-meta-scan # Preview Mech demotion candidates (drift.jsonl → A→B→C)
|
|
621
|
+
forgen rule-meta-scan --apply # Persist demotions + meta_promotions history
|
|
622
|
+
forgen lifecycle-scan # Preview T2~T5 + Meta (T1 fires inline on correction, not via CLI)
|
|
623
|
+
forgen lifecycle-scan --apply # Apply all lifecycle state transitions
|
|
624
|
+
```
|
|
625
|
+
|
|
626
|
+
Rule enforcement is 3-axis (ADR-001):
|
|
627
|
+
- **Mech-A** (hook-BLOCK) — mechanical checks (`rm -rf`, artifact presence). Violation blocks immediately.
|
|
628
|
+
- **Mech-B** (self-check) — natural-language rules. Stop hook feeds self-check question back to Claude via `decision: "block"` + `reason`. Zero extra API cost.
|
|
629
|
+
- **Mech-C** (drift-measure) — long-term bias tracking only.
|
|
630
|
+
|
|
631
|
+
Rule lifecycle (ADR-002): rules auto-flag / suppress / retire / merge / supersede based on T1~T5 + Meta signals. Details in [docs/adr/ADR-002-rule-lifecycle-engine.md](docs/adr/ADR-002-rule-lifecycle-engine.md).
|
|
632
|
+
|
|
633
|
+
### Release self-gate (v0.4.0, ADR-003)
|
|
634
|
+
|
|
635
|
+
Three CI gates prove forgen does not violate its own L1 rules before release:
|
|
636
|
+
|
|
637
|
+
```bash
|
|
638
|
+
node scripts/self-gate.cjs # Static: mock-in-prod, secrets, enforce_via, release-artifact
|
|
639
|
+
node scripts/self-gate-runtime.cjs # Runtime smoke: 6 hook scenarios
|
|
640
|
+
node scripts/self-gate-release.cjs # Tag-only: version/tag/CHANGELOG/dist/e2e-report consistency
|
|
641
|
+
```
|
|
642
|
+
|
|
643
|
+
Triggered by `.github/workflows/self-gate.yml` on push main / PR main / tag v*. Dogfood opt-in: see [.forgen/README.md](.forgen/README.md).
|
|
644
|
+
|
|
491
645
|
### MCP tools (available to Claude during sessions)
|
|
492
646
|
|
|
493
647
|
| Tool | Purpose |
|