role-os 2.0.0 → 2.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.ja.md CHANGED
@@ -2,10 +2,8 @@
2
2
  <a href="README.md">English</a> | <a href="README.zh.md">中文</a> | <a href="README.es.md">Español</a> | <a href="README.fr.md">Français</a> | <a href="README.hi.md">हिन्दी</a> | <a href="README.it.md">Italiano</a> | <a href="README.pt-BR.md">Português (BR)</a>
3
3
  </p>
4
4
 
5
- # Role OS
6
-
7
5
  <p align="center">
8
- <img src="https://raw.githubusercontent.com/mcp-tool-shop-org/brand/main/logos/role-os/readme.png" alt="Role OS" width="400">
6
+ <img src="https://raw.githubusercontent.com/mcp-tool-shop-org/brand/main/logos/role-os/readme.png" alt="Role OS" width="600">
9
7
  </p>
10
8
 
11
9
  <p align="center">
@@ -52,6 +50,35 @@ roleos start "something completely novel"
52
50
 
53
51
  このシステムは、常に適切な抽象レベルでタスクを実行させます。各レベルを選択した理由を説明し、代替案も提示します。
54
52
 
53
+ **実行を開始するコマンド:**
54
+
55
+ ```bash
56
+ roleos run "fix the crash in save handler"
57
+ # → Created run: run-1234
58
+ # → Entry: MISSION (bugfix)
59
+ # → Started step 0: Repo Researcher → diagnosis-report
60
+ # → Guidance: Required sections: entrypoints, module-map, build-test-commands
61
+
62
+ roleos next # Start the next step
63
+ roleos complete diagnosis.md # Complete the active step with artifact
64
+ roleos explain # Show full run state and guidance
65
+ roleos resume # Continue an interrupted run
66
+ roleos report # Generate completion report
67
+ roleos friction # Measure operator touches
68
+ ```
69
+
70
+ **問題発生時の対応:**
71
+
72
+ ```bash
73
+ roleos retry 0 # Retry a failed step
74
+ roleos reroute 1 "Frontend Developer" "UI bug" # Swap a role
75
+ roleos escalate "Test Engineer" "Repo Researcher" "missed edge case" "re-diagnose"
76
+ roleos block 2 "waiting for API spec"
77
+ roleos reopen 0 "found issue in review"
78
+ ```
79
+
80
+ 実行結果はディスクに保存されます(`.claude/runs/`)。そのため、中断されたセッションも正常に再開できます。各ステップには、オペレーター向けのガイダンスが含まれており、生成すべき内容、必要なセクション、および停止条件が示されています。
81
+
55
82
  **ルーティング後:**
56
83
 
57
84
  1. **各役割は成果物を生成:** 構造化された出力で、次の役割が理解しやすいように、証拠となる情報が含まれています。
@@ -97,19 +124,24 @@ Role OSは、Claudeプロジェクトのメモリと連携します。置き換
97
124
  npx role-os init
98
125
 
99
126
  # Describe what you need — Role OS picks the right level:
100
- roleos start "fix the crash in save handler"
127
+ roleos run "fix the crash in save handler"
128
+ # → Creates run, picks bugfix mission, starts first step with guidance
129
+
130
+ # Step through:
131
+ roleos next # Start next step
132
+ roleos complete artifact.md # Complete with artifact
133
+ roleos explain # Show full state
134
+ roleos report # Completion report
101
135
 
102
136
  # Or go manual:
137
+ roleos start "fix the crash" # Entry decision only (no run)
103
138
  roleos packet new feature
104
139
  roleos route .claude/packets/my-feature.md
105
140
  roleos review .claude/packets/my-feature.md accept
106
- roleos status
107
141
 
108
142
  # Explore missions and packs:
109
143
  roleos mission list
110
- roleos mission show bugfix
111
144
  roleos packs list
112
- roleos packs show feature
113
145
  ```
114
146
 
115
147
  ## Role OSを使用しない場合
@@ -148,6 +180,12 @@ Role OSは、構造が異なる2つのリポジトリで、3つの異なるテ
148
180
  - 同じトリートメントパッケージを使用。構造は異なり、リポジトリの内容も異なる(クリエイティブワークスペース vs ゲーム)。
149
181
  - トリートメントパッケージは移植可能。契約の変更は不要。
150
182
 
183
+ **理想的な実行例(MCPサーバーマーケットプレイスのトピック)**
184
+ - 9つの役割を持つ連携、並行して4人の分析者。相互に質問し、反論するグラフ。
185
+ - 4つの課題が提示され、3つの主張が絞り込まれ、1つが未解決。健全なプレッシャーがかかっていますが、行き詰まりはありません。
186
+ - 生成された成果物から、真実の要素への16以上の追跡リンク。
187
+ - 完全なトレーサビリティが証明されています:真実 → 要素 → 反論 → 統合 → 拡張 → 評価 → 生成 → 追跡
188
+
151
189
  ## 主要な特性
152
190
 
153
191
  これらは変更できません。変更によってこれらのいずれかが弱体化する場合は、却下してください。
@@ -166,7 +204,9 @@ role-os/
166
204
  src/
167
205
  entry.mjs ← Unified entry: mission → pack → free routing
168
206
  entry-cmd.mjs ← `roleos start` CLI command
169
- mission.mjs 6 named mission types (feature, bugfix, treatment, docs, security, research)
207
+ run.mjs Persistent run engine: create step pause resume → report
208
+ run-cmd.mjs ← `roleos run/resume/next/explain/complete/fail` + interventions
209
+ mission.mjs ← 7 named mission types (feature, bugfix, treatment, docs, security, research, brainstorm)
170
210
  mission-run.mjs ← Mission runner: create → step → complete → report
171
211
  mission-cmd.mjs ← `roleos mission` CLI commands
172
212
  route.mjs ← 31-role routing + dynamic chain builder
@@ -175,14 +215,17 @@ role-os/
175
215
  escalation.mjs ← Auto-routing for blocked/rejected/split
176
216
  evidence.mjs ← Structured evidence + role-aware requirements
177
217
  dispatch.mjs ← Runtime dispatch manifests for multi-claude
178
- artifacts.mjs ← 20 per-role artifact contracts + 7 pack handoffs
218
+ artifacts.mjs ← 30 per-role artifact contracts + 7 pack handoffs
179
219
  decompose.mjs ← Composite task detection + splitting
180
220
  composite.mjs ← Dependency-ordered execution + recovery
181
221
  replan.mjs ← Mid-run adaptive replanning
182
222
  calibration.mjs ← Outcome recording + weight tuning
183
223
  hooks.mjs ← 5 lifecycle hooks for runtime enforcement
184
224
  session.mjs ← Session scaffolding + doctor
185
- test/ 527 tests across 20 test files
225
+ brainstorm.mjs Evidence modes, request validation, finding/synthesis/judge schemas
226
+ brainstorm-roles.mjs ← Role-native schemas, input partitioning, blindspot enforcement, cross-exam
227
+ brainstorm-render.mjs ← Two-layer rendering: lexical bans, render schemas, debate transcript
228
+ test/ ← 894 tests across 30 test files
186
229
  starter-pack/ ← Drop-in role contracts, policies, schemas, workflows
187
230
  ```
188
231
 
@@ -212,6 +255,8 @@ Role OSは、**ローカルでのみ**動作します。Markdownテンプレー
212
255
  | **Mission library** | 6つの名前付きミッション(新機能追加、バグ修正、改善、ドキュメントのリリース、セキュリティ強化、研究開発)。それぞれが、パッケージ、ロールチェーン、成果物の流れ、エスカレーションのブランチ、正直で部分的な定義を宣言します。6つすべてが試行錯誤され、強化されています。 | ✓ 完了 |
213
256
  | **Mission runner** | 実行を開始し、追跡された状態とともにステップを進め、正直なレポートで完了または失敗。ブロックされたステップの伝播、チェーンからの逸脱に関する警告、最後のステップの再開。 | ✓ 完了 |
214
257
  | **Unified entry** | `roleos start`は、ミッション、パッケージ、または自由ルーティングを自動的に決定します。信頼度スコア、代替案、および複合検出を備えたフォールバックシステム。 | ✓ 完了 |
258
+ | **Persistent runs** | `roleos run` コマンドは、ディスクに保存された実行結果を作成します。`resume`(再開)、`next`(次へ)、`explain`(説明)、`complete`(完了)、`fail`(失敗)。対応:`reroute`(リダイレクト)、`escalate`(エスカレーション)、`retry`(再試行)、`block`(ブロック)、`reopen`(再開)。各ステップにローカルなガイダンスがあります。摩擦の測定。 | ✓ 完了 |
259
+ | **Brainstorm** | 2層のアーキテクチャ:真実層(役割固有のスキーマ、トレーサビリティを持つ要素、相互質問と反論のグラフ)+ 生成層(5つの異なる声、禁止語、議論の記録)。追跡リンクは、生成されたすべての主張が、真実の要素に対応していることを証明します。理想的な実行例:894件のテスト。 | ✓ 完了 |
215
260
 
216
261
  ## 6つのミッション
217
262
 
@@ -223,23 +268,47 @@ Role OSは、**ローカルでのみ**動作します。Markdownテンプレー
223
268
  | `docs-release` | ドキュメント | 2 | ドキュメントの作成/更新、リリースノート |
224
269
  | `security-hardening` | セキュリティ | 4 | 脅威モデルの作成、監査、脆弱性の修正、再監査、検証 |
225
270
  | `research-launch` | 研究 | 4 | 問題の定義、調査、結果の文書化、決定 |
271
+ | `brainstorm` | ブレインストーミング | 9 | 追跡可能な意見の相違と結論を持つ、構造化された多角的な調査 |
226
272
 
227
273
  各ミッションには、正直で部分的な定義が含まれています。作業が停滞した場合、システムは完了した内容と残りの内容を記録し、進捗を偽装することはありません。
228
274
 
275
+ ### ブレインストーミングミッション
276
+
277
+ これは「AIによるブレインストーミング」ではありません。ブレインストーミングミッションは、**法に基づいて定義された役割であり、追跡可能な意見の相違と、結論を導き出すための出力を持つ**ものです。
278
+
279
+ ```bash
280
+ roleos run "explore product directions for a developer tool discovery platform"
281
+ # → MISSION: Brainstorm (Structured Inquiry)
282
+ # Chain: 4 Analysts (parallel) → Normalize → Cross-Examine → Rebut → Synthesize → Expand → Judge
283
+ ```
284
+
285
+ **何が違うのか:**
286
+
287
+ - **層1(真実):** 4人の分析者が、役割固有のスキーマ(コンテキストマップ、ユーザーバリューマップ、メカニズムマップ、ポジショニングマップ)を生成します。これは、共有された文章ではありません。各役割には、盲点防止機能が組み込まれており、禁止語、禁止される主張の種類、およびフィルタリングされた入力セクションがあります。要素には、トレーサビリティ情報が含まれています。方向性のある相互質問グラフにより、ターゲットを絞った課題が生成されます。元の分析者は、プレッシャーの下で、主張を擁護したり、絞り込んだり、撤回したりします。
288
+
289
+ - **層2(生成):** 5つの異なる人間の声(境界メモ、フィールドノート、システムスケッチ、主張概要、相互質問記録)があり、禁止語により、声の統一を防ぎます。統合は、真実の要素を使用しますが、生成された文章は使用しません。両方の層は常に利用可能です。
290
+
291
+ - **トレーサビリティ:** 生成されたすべての文は、真実の要素にトレースバックできます。統合の指示には、要素が引用されています。相互質問は、実際の主張IDを対象としています。意見の相違グラフは、文章ではなく、その結果です。
292
+
293
+ **検証済み:** v0.4の理想的な実行例:894件のテスト、完全なトレーサビリティが検証済み。完全な成果物のチェーンについては、[`examples/golden-run.md`](examples/golden-run.md) を参照してください。
294
+
229
295
  ## ステータス
230
296
 
231
- - v0.1–v0.4: 基礎 - 試行、導入、改善パッケージ、スターターパッケージ
232
- - v1.0.0: 32のロール、完全なCLI、実績のある改善、マルチリポジトリの移植性
233
- - v1.0.2: ロールOSのロックダウン(ブートストラップの真実性の修正、init --force
234
- - v1.1.0: 31のロール、完全なルーティング機能、競合検出、エスカレーション、証拠、ディスパッチ、7つの実績のあるチームパッケージ。35回の実行テスト。212件のテスト。
235
- - v1.2.0: デフォルトとして推奨されるパッケージ。自動選択、不整合の検出、代替案の提案、自由ルーティングのフォールバック。246件のテスト。
236
- - v1.3.0: 結果の調整、タスクの細分化、複合実行、適応的な再計画。317件のテスト。
237
- - v1.4.0: セッションの基盤 - `roleos init claude`、`roleos doctor`、ルートカード、/roleos-route + /roleos-review + /roleos-status コマンド。335件のテスト。
238
- - v1.5.0: フックの基盤 - 実行時の強制のための5つのライフサイクルフック。358件のテスト。
239
- - v1.6.0: 成果物の基盤 - ロールごとの20件の成果物契約、7件のパッケージ引き継ぎ契約、構造検証。385件のテスト。
240
- - v1.7.0: 完了の証明 - 実際のタスクをフルスタックで実行。`roleos artifacts` CLI。構造的な修正に関する正直なエスカレーション。398件のテスト。
241
- - v1.8.0: ミッションライブラリ(Phase S)- 6つの名前付きミッション、ランナーエンジン、完了レポート。6回の実際の試行錯誤で強化されています。481件のテスト。
242
- - **v1.9.0**: 統合されたエントリパス(Phase T)- `roleos start`は、ミッション、パッケージ、または自由ルーティングを自動的に決定します。フォールバックシステム、複合検出、エントリパスの比較テスト。527件のテスト。
297
+ - v0.1–v0.4: 基礎機能 テスト、導入、トリートメントパック、スターターパック
298
+ - v1.0.0: 32種類のロール、フルCLI、実績のあるトリートメント、マルチリポジトリ対応
299
+ - v1.0.2: ロールOSのロックダウン(初期設定の修正、`init --force`コマンド)
300
+ - v1.1.0: 31種類のロール、フルルーティング機能、競合検出、エスカレーション、証拠収集、ディスパッチ、7種類の実績のあるチームパック。35回の実行テスト。212件のテスト。
301
+ - v1.2.0: キャリブレーションされたパックがデフォルト設定に。自動選択、不整合検出、代替案の提案、フリールーティングへのフォールバック。246件のテスト。
302
+ - v1.3.0: 結果のキャリブレーション、タスクの細分化、複合実行、適応的な再計画。317件のテスト。
303
+ - v1.4.0: セッション機能 `roleos init claude`、`roleos doctor`、ルートカード、`/roleos-route`、`/roleos-review`、`/roleos-status`コマンド。335件のテスト。
304
+ - v1.5.0: フック機能 実行時強制のための5つのライフサイクルフック。358件のテスト。
305
+ - v1.6.0: アーティファクト機能 各ロールごとの20種類のアーティファクト契約、7種類のパックハンドオフ契約、構造検証。385件のテスト。
306
+ - v1.7.0: 完了の検証 実際のタスクをフルスタックで実行。`roleos artifacts` CLI。構造的な修正に対する正直なエスカレーション。398件のテスト。
307
+ - v1.8.0: ミッションライブラリ(フェーズS 6種類の名前付きミッション、実行エンジン、完了レポート。6回の実際のテストで強化。481件のテスト。
308
+ - v1.9.0: 統合されたエントリーパス(フェーズT `roleos start`コマンドが、ミッション、パック、フリールーティングを自動的に選択。フォールバック機能、複合検出、エントリーパスの比較テスト。527件のテスト。
309
+ - **v2.0.0**: ユーザーエクスペリエンス改善(フェーズU) — `roleos run`コマンドが、永続的なディスクベースの実行を作成。再開、次へ、説明、完了、失敗。介入:リルーティング、エスカレーション、再試行、ブロック、再開。各ステップでの詳細なガイダンス。摩擦の測定。6件の摩擦テスト。613件のテスト。
310
+ - **v2.0.1**: マニュアルの監査、初心者向けドキュメント、テスト件数の修正。617件のテスト。
311
+ - **v2.1.0**: ブレインストーミングミッション(v0.4) — 法分野に特化したロール、追跡可能な意見の相違、判決を含む出力。2層アーキテクチャ(真実性 + レンダリング)、クロスエグザム権限マトリックス、紛争グラフ、黄金の実行の検証。7種類のミッション、50種類のロール、8種類のパック。894件のテスト。
243
312
 
244
313
  ## ライセンス
245
314
 
package/README.md CHANGED
@@ -2,7 +2,6 @@
2
2
  <a href="README.ja.md">日本語</a> | <a href="README.zh.md">中文</a> | <a href="README.es.md">Español</a> | <a href="README.fr.md">Français</a> | <a href="README.hi.md">हिन्दी</a> | <a href="README.it.md">Italiano</a> | <a href="README.pt-BR.md">Português (BR)</a>
3
3
  </p>
4
4
 
5
-
6
5
  <p align="center">
7
6
  <img src="https://raw.githubusercontent.com/mcp-tool-shop-org/brand/main/logos/role-os/readme.png" alt="Role OS" width="600">
8
7
  </p>
@@ -14,7 +13,7 @@
14
13
  <a href="https://mcp-tool-shop-org.github.io/role-os/"><img src="https://img.shields.io/badge/Landing_Page-live-brightgreen" alt="Landing Page"></a>
15
14
  </p>
16
15
 
17
- A multi-Claude operating system that staffs, routes, validates, and runs work through 31 specialized role contracts. Creates task packets, assembles the right team from scored role matching, detects broken chains before execution, auto-routes recovery when work is blocked or rejected, and requires structured evidence in every verdict.
16
+ A multi-Claude operating system that staffs, routes, validates, and runs work through 50 specialized role contracts. Creates task packets, assembles the right team from scored role matching, detects broken chains before execution, auto-routes recovery when work is blocked or rejected, and requires structured evidence in every verdict.
18
17
 
19
18
  ## What it does
20
19
 
@@ -104,7 +103,7 @@ Full treatment is a canonical 7-phase protocol defined in Claude project memory
104
103
 
105
104
  Order: Shipcheck first, then full treatment. No v1.0.0 without passing hard gates.
106
105
 
107
- ## 31 roles across 8 packs
106
+ ## 50 roles across 8 packs
108
107
 
109
108
  | Pack | Roles |
110
109
  |------|-------|
@@ -181,6 +180,12 @@ Role OS was proven across three trial shapes in two structurally different repos
181
180
  - Same treatment pack, structurally different repo (creative workspace vs game)
182
181
  - Treatment Pack portable — no contract modifications needed
183
182
 
183
+ **Brainstorm golden run** (MCP server marketplace topic)
184
+ - 9-role chain, 4 analysts in parallel, cross-examine + rebut dispute graph
185
+ - 4 challenges issued, 3 claims narrowed, 1 unresolved — healthy pressure, not deadlock
186
+ - 16+ trace links from rendered artifacts back to truth-layer atoms
187
+ - Full chain of custody proven: truth → atoms → dispute → synthesis → expand → judge → render → trace
188
+
184
189
  ## Core properties
185
190
 
186
191
  These are non-negotiable. If a change weakens any of them, reject it.
@@ -201,7 +206,7 @@ role-os/
201
206
  entry-cmd.mjs ← `roleos start` CLI command
202
207
  run.mjs ← Persistent run engine: create → step → pause → resume → report
203
208
  run-cmd.mjs ← `roleos run/resume/next/explain/complete/fail` + interventions
204
- mission.mjs ← 6 named mission types (feature, bugfix, treatment, docs, security, research)
209
+ mission.mjs ← 7 named mission types (feature, bugfix, treatment, docs, security, research, brainstorm)
205
210
  mission-run.mjs ← Mission runner: create → step → complete → report
206
211
  mission-cmd.mjs ← `roleos mission` CLI commands
207
212
  route.mjs ← 31-role routing + dynamic chain builder
@@ -210,14 +215,17 @@ role-os/
210
215
  escalation.mjs ← Auto-routing for blocked/rejected/split
211
216
  evidence.mjs ← Structured evidence + role-aware requirements
212
217
  dispatch.mjs ← Runtime dispatch manifests for multi-claude
213
- artifacts.mjs ← 20 per-role artifact contracts + 7 pack handoffs
218
+ artifacts.mjs ← 30 per-role artifact contracts + 7 pack handoffs
214
219
  decompose.mjs ← Composite task detection + splitting
215
220
  composite.mjs ← Dependency-ordered execution + recovery
216
221
  replan.mjs ← Mid-run adaptive replanning
217
222
  calibration.mjs ← Outcome recording + weight tuning
218
223
  hooks.mjs ← 5 lifecycle hooks for runtime enforcement
219
224
  session.mjs ← Session scaffolding + doctor
220
- test/ 613 tests across 25 test files
225
+ brainstorm.mjs Evidence modes, request validation, finding/synthesis/judge schemas
226
+ brainstorm-roles.mjs ← Role-native schemas, input partitioning, blindspot enforcement, cross-exam
227
+ brainstorm-render.mjs ← Two-layer rendering: lexical bans, render schemas, debate transcript
228
+ test/ ← 894 tests across 30 test files
221
229
  starter-pack/ ← Drop-in role contracts, policies, schemas, workflows
222
230
  ```
223
231
 
@@ -243,13 +251,14 @@ Role OS operates **locally only**. It copies markdown templates and writes packe
243
251
  | **Adaptive replanning** | Mid-run scope changes, findings, or new requirements update the plan without restarting. | ✓ Shipped |
244
252
  | **Session spine** | `roleos init claude` scaffolds CLAUDE.md, /roleos-route, /roleos-review, /roleos-status. `roleos doctor` verifies wiring. Route cards prove engagement. | ✓ Shipped |
245
253
  | **Hook spine** | 5 lifecycle hooks (SessionStart, PromptSubmit, PreToolUse, SubagentStart, Stop). Advisory enforcement: route card reminders, write-tool gating, subagent role injection, completion audit. | ✓ Shipped |
246
- | **Artifact spine** | 20 per-role artifact contracts. 7 pack handoff contracts. Structural validation. Chain completeness checks. Downstream roles never guess what they received. | ✓ Shipped |
247
- | **Mission library** | 6 named missions (feature-ship, bugfix, treatment, docs-release, security-hardening, research-launch). Each declares pack, role chain, artifact flow, escalation branches, honest-partial definition. All 6 trial-run and hardened. | ✓ Shipped |
254
+ | **Artifact spine** | 30 per-role artifact contracts. 7 pack handoff contracts. Structural validation. Chain completeness checks. Downstream roles never guess what they received. | ✓ Shipped |
255
+ | **Mission library** | 7 named missions (feature-ship, bugfix, treatment, docs-release, security-hardening, research-launch, brainstorm). Each declares pack, role chain, artifact flow, escalation branches, honest-partial definition. All 7 trial-proven. | ✓ Shipped |
248
256
  | **Mission runner** | Create runs, step through with tracked state, complete/fail with honest reporting. Blocked-step propagation, out-of-chain escalation warnings, last-step re-opening. | ✓ Shipped |
249
257
  | **Unified entry** | `roleos start` decides mission vs pack vs free routing automatically. Fallback ladder with confidence scores, alternatives, and composite detection. | ✓ Shipped |
250
258
  | **Persistent runs** | `roleos run` creates disk-backed runs. `resume`, `next`, `explain`, `complete`, `fail`. Interventions: reroute, escalate, retry, block, reopen. Step-local guidance. Friction measurement. | ✓ Shipped |
259
+ | **Brainstorm** | Two-layer architecture: truth (role-native schemas, provenance atoms, cross-exam dispute graph) + render (5 distinct voices, lexical bans, debate transcript). Trace links prove every rendered claim maps to a truth atom. Golden run: 894 tests. | ✓ Shipped |
251
260
 
252
- ## 6 missions
261
+ ## 7 missions
253
262
 
254
263
  | Mission | Pack | Roles | When to use |
255
264
  |---------|------|-------|-------------|
@@ -259,9 +268,30 @@ Role OS operates **locally only**. It copies markdown templates and writes packe
259
268
  | `docs-release` | docs | 2 | Write/update documentation, release notes |
260
269
  | `security-hardening` | security | 4 | Threat model, audit, fix vulnerabilities, re-audit, verify |
261
270
  | `research-launch` | research | 4 | Frame question, research, document findings, decide |
271
+ | `brainstorm` | brainstorm | 9 | Structured multi-perspective inquiry with traceable disagreement and verdict |
262
272
 
263
273
  Each mission includes honest-partial definitions — when work stalls, the system documents what was completed and what remains instead of bluffing completion.
264
274
 
275
+ ### Brainstorm mission
276
+
277
+ Not "AI brainstorming." The brainstorm mission is **specialized roles under law, with traceable disagreement and verdict-bearing output.**
278
+
279
+ ```bash
280
+ roleos run "explore product directions for a developer tool discovery platform"
281
+ # → MISSION: Brainstorm (Structured Inquiry)
282
+ # Chain: 4 Analysts (parallel) → Normalize → Cross-Examine → Rebut → Synthesize → Expand → Judge
283
+ ```
284
+
285
+ **What makes it different:**
286
+
287
+ - **Layer 1 (truth):** Four analysts emit role-native schemas (ContextMap, UserValueMap, MechanicsMap, PositioningMap) — not shared prose. Each role is blindspot-enforced: forbidden phrases, forbidden claim kinds, filtered input partitions. Atoms carry provenance. A directed cross-examination graph produces targeted challenges. Original analysts defend, narrow, or retract under pressure.
288
+
289
+ - **Layer 2 (render):** Five distinct human voices (Boundary Memo, Field Notes, System Sketch, Claim Brief, Cross-Exam Transcript) with lexical bans preventing voice convergence. Synthesis consumes truth, never rendered prose. Both layers always available.
290
+
291
+ - **Chain of custody:** Every rendered sentence traces back to a truth-layer atom. Synthesis directions cite atoms. Cross-exam targets real claim IDs. The dispute graph is the product, not the prose.
292
+
293
+ **Proven:** v0.4 golden run — 894 tests, full chain of custody verified. See [`examples/golden-run.md`](examples/golden-run.md) for the complete artifact chain.
294
+
265
295
  ## Status
266
296
 
267
297
  - v0.1–v0.4: Foundation — trials, adoption, treatment pack, starter pack
@@ -277,6 +307,8 @@ Each mission includes honest-partial definitions — when work stalls, the syste
277
307
  - v1.8.0: Mission library (Phase S) — 6 named missions, runner engine, completion reports. Hardened from 6 real trial runs. 481 tests.
278
308
  - v1.9.0: Unified entry path (Phase T) — `roleos start` auto-decides mission vs pack vs free routing. Fallback ladder, composite detection, entry-path comparison trials. 527 tests.
279
309
  - **v2.0.0**: Operator friction pass (Phase U) — `roleos run` creates persistent disk-backed runs. Resume, next, explain, complete, fail. Interventions: reroute, escalate, retry, block, reopen. Step-local guidance at every step. Friction measurement. 6 friction trials. 613 tests.
310
+ - **v2.0.1**: Handbook audit, beginner docs, test count corrections. 617 tests.
311
+ - **v2.1.0**: Brainstorm mission (v0.4) — specialized roles under law, traceable disagreement, verdict-bearing output. Two-layer architecture (truth + render), cross-exam permission matrix, dispute graph, golden run proof. 7 missions, 50 roles, 8 packs. 894 tests.
280
312
 
281
313
  ## License
282
314
 
package/README.pt-BR.md CHANGED
@@ -2,10 +2,8 @@
2
2
  <a href="README.ja.md">日本語</a> | <a href="README.zh.md">中文</a> | <a href="README.es.md">Español</a> | <a href="README.fr.md">Français</a> | <a href="README.hi.md">हिन्दी</a> | <a href="README.it.md">Italiano</a> | <a href="README.md">English</a>
3
3
  </p>
4
4
 
5
- # Role OS
6
-
7
5
  <p align="center">
8
- <img src="https://raw.githubusercontent.com/mcp-tool-shop-org/brand/main/logos/role-os/readme.png" alt="Role OS" width="400">
6
+ <img src="https://raw.githubusercontent.com/mcp-tool-shop-org/brand/main/logos/role-os/readme.png" alt="Role OS" width="600">
9
7
  </p>
10
8
 
11
9
  <p align="center">
@@ -52,6 +50,35 @@ roleos start "something completely novel"
52
50
 
53
51
  O sistema nunca força o trabalho a passar pela camada de abstração incorreta. Ele explica por que escolheu cada nível e oferece alternativas.
54
52
 
53
+ **Um comando para iniciar a execução:**
54
+
55
+ ```bash
56
+ roleos run "fix the crash in save handler"
57
+ # → Created run: run-1234
58
+ # → Entry: MISSION (bugfix)
59
+ # → Started step 0: Repo Researcher → diagnosis-report
60
+ # → Guidance: Required sections: entrypoints, module-map, build-test-commands
61
+
62
+ roleos next # Start the next step
63
+ roleos complete diagnosis.md # Complete the active step with artifact
64
+ roleos explain # Show full run state and guidance
65
+ roleos resume # Continue an interrupted run
66
+ roleos report # Generate completion report
67
+ roleos friction # Measure operator touches
68
+ ```
69
+
70
+ **Intervenções quando algo dá errado:**
71
+
72
+ ```bash
73
+ roleos retry 0 # Retry a failed step
74
+ roleos reroute 1 "Frontend Developer" "UI bug" # Swap a role
75
+ roleos escalate "Test Engineer" "Repo Researcher" "missed edge case" "re-diagnose"
76
+ roleos block 2 "waiting for API spec"
77
+ roleos reopen 0 "found issue in review"
78
+ ```
79
+
80
+ As execuções são persistidas no disco (em `.claude/runs/`), permitindo que as sessões interrompidas sejam retomadas sem problemas. Cada etapa inclui orientações para o operador: o que produzir, as seções necessárias e as condições de parada.
81
+
55
82
  **Depois de direcionado:**
56
83
 
57
84
  1. **Cada função produz uma transferência:** saída estruturada com itens de evidência que reduzem a ambiguidade para a próxima função.
@@ -97,19 +124,24 @@ Cada função tem um contrato completo: missão, quando usar, quando não usar,
97
124
  npx role-os init
98
125
 
99
126
  # Describe what you need — Role OS picks the right level:
100
- roleos start "fix the crash in save handler"
127
+ roleos run "fix the crash in save handler"
128
+ # → Creates run, picks bugfix mission, starts first step with guidance
129
+
130
+ # Step through:
131
+ roleos next # Start next step
132
+ roleos complete artifact.md # Complete with artifact
133
+ roleos explain # Show full state
134
+ roleos report # Completion report
101
135
 
102
136
  # Or go manual:
137
+ roleos start "fix the crash" # Entry decision only (no run)
103
138
  roleos packet new feature
104
139
  roleos route .claude/packets/my-feature.md
105
140
  roleos review .claude/packets/my-feature.md accept
106
- roleos status
107
141
 
108
142
  # Explore missions and packs:
109
143
  roleos mission list
110
- roleos mission show bugfix
111
144
  roleos packs list
112
- roleos packs show feature
113
145
  ```
114
146
 
115
147
  ## Quando não usar o Role OS
@@ -148,6 +180,12 @@ O Role OS foi comprovado em três modelos de teste em dois repositórios estrutu
148
180
  - Mesmo pacote de tratamento, repositório estruturalmente diferente (ambiente de criação vs. jogo)
149
181
  - Pacote de tratamento portátil — nenhuma modificação no contrato é necessária
150
182
 
183
+ **Sessão de brainstorming de alta qualidade** (tópico do mercado de servidores MCP)
184
+ - Cadeia de 9 papéis, 4 analistas em paralelo, análise cruzada + gráfico de refutação de disputas.
185
+ - 4 desafios propostos, 3 alegações refinadas, 1 não resolvida — pressão saudável, sem impasse.
186
+ - Mais de 16 links de rastreamento dos artefatos gerados até os átomos da camada de verdade.
187
+ - Cadeia de custódia completa comprovada: verdade → átomos → disputa → síntese → expandir → julgar → renderizar → rastrear.
188
+
151
189
  ## Propriedades essenciais
152
190
 
153
191
  Estas são inegociáveis. Se uma alteração enfraquecer qualquer uma delas, rejeite-a.
@@ -166,7 +204,9 @@ role-os/
166
204
  src/
167
205
  entry.mjs ← Unified entry: mission → pack → free routing
168
206
  entry-cmd.mjs ← `roleos start` CLI command
169
- mission.mjs 6 named mission types (feature, bugfix, treatment, docs, security, research)
207
+ run.mjs Persistent run engine: create step pause resume → report
208
+ run-cmd.mjs ← `roleos run/resume/next/explain/complete/fail` + interventions
209
+ mission.mjs ← 7 named mission types (feature, bugfix, treatment, docs, security, research, brainstorm)
170
210
  mission-run.mjs ← Mission runner: create → step → complete → report
171
211
  mission-cmd.mjs ← `roleos mission` CLI commands
172
212
  route.mjs ← 31-role routing + dynamic chain builder
@@ -175,14 +215,17 @@ role-os/
175
215
  escalation.mjs ← Auto-routing for blocked/rejected/split
176
216
  evidence.mjs ← Structured evidence + role-aware requirements
177
217
  dispatch.mjs ← Runtime dispatch manifests for multi-claude
178
- artifacts.mjs ← 20 per-role artifact contracts + 7 pack handoffs
218
+ artifacts.mjs ← 30 per-role artifact contracts + 7 pack handoffs
179
219
  decompose.mjs ← Composite task detection + splitting
180
220
  composite.mjs ← Dependency-ordered execution + recovery
181
221
  replan.mjs ← Mid-run adaptive replanning
182
222
  calibration.mjs ← Outcome recording + weight tuning
183
223
  hooks.mjs ← 5 lifecycle hooks for runtime enforcement
184
224
  session.mjs ← Session scaffolding + doctor
185
- test/ 527 tests across 20 test files
225
+ brainstorm.mjs Evidence modes, request validation, finding/synthesis/judge schemas
226
+ brainstorm-roles.mjs ← Role-native schemas, input partitioning, blindspot enforcement, cross-exam
227
+ brainstorm-render.mjs ← Two-layer rendering: lexical bans, render schemas, debate transcript
228
+ test/ ← 894 tests across 30 test files
186
229
  starter-pack/ ← Drop-in role contracts, policies, schemas, workflows
187
230
  ```
188
231
 
@@ -212,6 +255,8 @@ O sistema "Role OS" opera **apenas localmente**. Ele copia modelos em formato Ma
212
255
  | **Mission library** | 6 missões nomeadas (feature-ship, bugfix, treatment, docs-release, security-hardening, research-launch). Cada uma define pacote, cadeia de papéis, fluxo de artefatos, ramificações de escalonamento, definição honesta e parcial. Todas as 6 foram testadas e aprimoradas. | ✓ Implementado. |
213
256
  | **Mission runner** | Criação de execuções, acompanhamento passo a passo com estado rastreado, conclusão/falha com relatórios precisos. Propagação de etapas bloqueadas, avisos de escalonamento fora da cadeia, reabertura da última etapa. | ✓ Implementado. |
214
257
  | **Unified entry** | `roleos start` decide automaticamente entre missão, pacote ou roteamento livre. Sistema de fallback com pontuações de confiança, alternativas e detecção composta. | ✓ Implementado. |
258
+ | **Persistent runs** | `roleos run` cria execuções com backup no disco. Comandos: `resume` (retomar), `next` (próximo), `explain` (explicar), `complete` (concluir), `fail` (falha). Intervenções: redirecionar, escalar, tentar novamente, bloquear, reabrir. Orientações específicas para cada etapa. Medição de atrito. | ✓ Implementado. |
259
+ | **Brainstorm** | Arquitetura de duas camadas: verdade (esquemas nativos do papel, átomos de procedência, gráfico de disputa de análise cruzada) + renderização (5 vozes distintas, restrições lexicais, transcrição do debate). Os links de rastreamento comprovam que cada alegação renderizada corresponde a um átomo de verdade. Sessão de brainstorming de alta qualidade: 894 testes. | ✓ Implementado. |
215
260
 
216
261
  ## 6 missões
217
262
 
@@ -223,23 +268,47 @@ O sistema "Role OS" opera **apenas localmente**. Ele copia modelos em formato Ma
223
268
  | `docs-release` | Documentação | 2 | Escrever/atualizar documentação, notas de lançamento |
224
269
  | `security-hardening` | Segurança | 4 | Modelo de ameaças, auditoria, correção de vulnerabilidades, re-auditoria, verificação |
225
270
  | `research-launch` | Pesquisa | 4 | Formular a pergunta, pesquisar, documentar os resultados, decidir |
271
+ | `brainstorm` | brainstorming | 9 | Investigação estruturada com múltiplas perspectivas, com desacordo rastreável e veredicto. |
226
272
 
227
273
  Cada missão inclui definições honestas e parciais — quando o trabalho é interrompido, o sistema documenta o que foi concluído e o que resta, em vez de apresentar uma conclusão falsa.
228
274
 
275
+ ### Missão de brainstorming
276
+
277
+ Não é "brainstorming de IA". A missão de brainstorming é **papéis especializados sob a lei, com desacordo rastreável e resultados que comprovam o veredicto.**
278
+
279
+ ```bash
280
+ roleos run "explore product directions for a developer tool discovery platform"
281
+ # → MISSION: Brainstorm (Structured Inquiry)
282
+ # Chain: 4 Analysts (parallel) → Normalize → Cross-Examine → Rebut → Synthesize → Expand → Judge
283
+ ```
284
+
285
+ **O que a diferencia:**
286
+
287
+ - **Camada 1 (verdade):** Quatro analistas emitem esquemas nativos do papel (ContextMap, UserValueMap, MechanicsMap, PositioningMap) — não é prosa compartilhada. Cada papel tem restrições para evitar pontos cegos: frases proibidas, tipos de alegações proibidas, partições de entrada filtradas. Os átomos carregam informações de procedência. Um gráfico de análise cruzada direcionada gera desafios específicos. Os analistas originais defendem, refinam ou retiram suas alegações sob pressão.
288
+
289
+ - **Camada 2 (renderização):** Cinco vozes humanas distintas (Boundary Memo, Field Notes, System Sketch, Claim Brief, Cross-Exam Transcript) com restrições lexicais para evitar a convergência das vozes. A síntese consome a verdade, nunca a prosa renderizada. Ambas as camadas estão sempre disponíveis.
290
+
291
+ - **Cadeia de custódia:** Cada frase renderizada rastreia até um átomo da camada de verdade. As instruções de síntese citam os átomos. Os alvos da análise cruzada são IDs de alegações reais. O gráfico de disputa é o produto, não a prosa.
292
+
293
+ **Comprovado:** versão 0.4 da sessão de brainstorming de alta qualidade — 894 testes, cadeia de custódia completa verificada. Consulte [`examples/golden-run.md`](examples/golden-run.md) para a cadeia completa de artefatos.
294
+
229
295
  ## Status
230
296
 
231
- - v0.1–v0.4: Fundação — testes, adoção, pacote de tratamento, pacote inicial
232
- - v1.0.0: 32 papéis, CLI completa, tratamento comprovado, portabilidade multi-repositório
233
- - v1.0.2: Bloqueio do sistema de papéis (correções de inicialização da verdade, init --force)
234
- - v1.1.0: 31 papéis, espinha dorsal de roteamento completa, detecção de conflitos, escalonamento, evidências, despacho, 7 pacotes de equipe comprovados. 35 execuções de teste. 212 testes.
235
- - v1.2.0: Pacotes calibrados promovidos à entrada padrão. Seleção automática, detecção de incompatibilidades, sugestão de alternativas, fallback de roteamento livre. 246 testes.
297
+ - v0.1–v0.4: Fundação — testes, adoção, pacote de tratamento, pacote inicial.
298
+ - v1.0.0: 32 funções, CLI completa, tratamento comprovado, portabilidade multi-repositório.
299
+ - v1.0.2: Bloqueio do sistema operacional para funções (correções de inicialização, `init --force`).
300
+ - v1.1.0: 31 funções, roteamento completo, detecção de conflitos, escalonamento, evidências, despacho, 7 pacotes de equipe comprovados. 35 testes de execução. 212 testes.
301
+ - v1.2.0: Pacotes calibrados promovidos a entrada padrão. Seleção automática, detecção de incompatibilidades, sugestão alternativa, fallback de roteamento livre. 246 testes.
236
302
  - v1.3.0: Calibração de resultados, decomposição de tarefas mistas, execução composta, replanejamento adaptativo. 317 testes.
237
- - v1.4.0: Espinha dorsal da sessão — `roleos init claude`, `roleos doctor`, cartões de roteamento, comandos /roleos-route + /roleos-review + /roleos-status. 335 testes.
238
- - v1.5.0: Espinha dorsal de ganchos — 5 ganchos de ciclo de vida para aplicação de políticas em tempo de execução. 358 testes.
239
- - v1.6.0: Espinha dorsal de artefatos — 20 contratos de artefatos por papel, 7 contratos de transferência de pacotes, validação estrutural. 385 testes.
240
- - v1.7.0: Prova de conclusão — tarefas reais executadas em toda a pilha. CLI `roleos artifacts`. Escalonamento honesto para correções estruturais. 398 testes.
241
- - v1.8.0: Biblioteca de missões (Fase S) — 6 missões nomeadas, motor de execução, relatórios de conclusão. Aprimorado a partir de 6 execuções de teste reais. 481 testes.
242
- - **v1.9.0**: Caminho de entrada unificado (Fase T) — `roleos start` decide automaticamente entre missão, pacote ou roteamento livre. Sistema de fallback, detecção composta, testes de comparação de caminho de entrada. 527 testes.
303
+ - v1.4.0: Espinha dorsal da sessão — `roleos init claude`, `roleos doctor`, cartões de rota, comandos `/roleos-route + /roleos-review + /roleos-status`. 335 testes.
304
+ - v1.5.0: Espinha dorsal de hooks — 5 hooks de ciclo de vida para aplicação em tempo de execução. 358 testes.
305
+ - v1.6.0: Espinha dorsal de artefatos — 20 contratos de artefatos por função, 7 contratos de transferência de pacotes, validação estrutural. 385 testes.
306
+ - v1.7.0: Prova de conclusão — tarefas reais executadas em toda a pilha. CLI `roleos artifacts`. Escalabilidade honesta para correções estruturais. 398 testes.
307
+ - v1.8.0: Biblioteca de missões (Fase S) — 6 missões nomeadas, motor de execução, relatórios de conclusão. Reforçado com 6 execuções de teste reais. 481 testes.
308
+ - v1.9.0: Caminho de entrada unificado (Fase T) — `roleos start` decide automaticamente entre missão, pacote ou roteamento livre. Escada de fallback, detecção composta, testes de comparação de caminho de entrada. 527 testes.
309
+ - **v2.0.0**: Otimização da experiência do usuário (Fase U) — `roleos run` cria execuções persistentes com backup em disco. Retomar, próximo, explicar, completar, falhar. Intervenções: redirecionar, escalar, tentar novamente, bloquear, reabrir. Orientação passo a passo em cada etapa. Medição de atrito. 6 testes de atrito. 613 testes.
310
+ - **v2.0.1**: Auditoria do manual, documentação para iniciantes, correções na contagem de testes. 617 testes.
311
+ - **v2.1.0**: Missão de brainstorming (v0.4) — funções especializadas sob a lei, desacordo rastreável, saída com valor de decisão. Arquitetura de duas camadas (verdade + renderização), matriz de permissão de interrogatório, grafo de disputas, prova de execução ideal. 7 missões, 50 funções, 8 pacotes. 894 testes.
243
312
 
244
313
  ## Licença
245
314