role-os 2.0.0 → 2.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +28 -0
- package/README.es.md +123 -54
- package/README.fr.md +90 -21
- package/README.hi.md +90 -21
- package/README.it.md +130 -61
- package/README.ja.md +91 -22
- package/README.md +41 -9
- package/README.pt-BR.md +90 -21
- package/README.zh.md +160 -88
- package/package.json +2 -2
- package/src/artifacts.mjs +526 -437
- package/src/brainstorm-render.mjs +462 -0
- package/src/brainstorm-roles.mjs +774 -0
- package/src/brainstorm.mjs +778 -0
- package/src/dispatch.mjs +333 -310
- package/src/mission.mjs +445 -388
- package/src/packs.mjs +397 -359
- package/src/route.mjs +685 -564
package/README.ja.md
CHANGED
|
@@ -2,10 +2,8 @@
|
|
|
2
2
|
<a href="README.md">English</a> | <a href="README.zh.md">中文</a> | <a href="README.es.md">Español</a> | <a href="README.fr.md">Français</a> | <a href="README.hi.md">हिन्दी</a> | <a href="README.it.md">Italiano</a> | <a href="README.pt-BR.md">Português (BR)</a>
|
|
3
3
|
</p>
|
|
4
4
|
|
|
5
|
-
# Role OS
|
|
6
|
-
|
|
7
5
|
<p align="center">
|
|
8
|
-
<img src="https://raw.githubusercontent.com/mcp-tool-shop-org/brand/main/logos/role-os/readme.png" alt="Role OS" width="
|
|
6
|
+
<img src="https://raw.githubusercontent.com/mcp-tool-shop-org/brand/main/logos/role-os/readme.png" alt="Role OS" width="600">
|
|
9
7
|
</p>
|
|
10
8
|
|
|
11
9
|
<p align="center">
|
|
@@ -52,6 +50,35 @@ roleos start "something completely novel"
|
|
|
52
50
|
|
|
53
51
|
このシステムは、常に適切な抽象レベルでタスクを実行させます。各レベルを選択した理由を説明し、代替案も提示します。
|
|
54
52
|
|
|
53
|
+
**実行を開始するコマンド:**
|
|
54
|
+
|
|
55
|
+
```bash
|
|
56
|
+
roleos run "fix the crash in save handler"
|
|
57
|
+
# → Created run: run-1234
|
|
58
|
+
# → Entry: MISSION (bugfix)
|
|
59
|
+
# → Started step 0: Repo Researcher → diagnosis-report
|
|
60
|
+
# → Guidance: Required sections: entrypoints, module-map, build-test-commands
|
|
61
|
+
|
|
62
|
+
roleos next # Start the next step
|
|
63
|
+
roleos complete diagnosis.md # Complete the active step with artifact
|
|
64
|
+
roleos explain # Show full run state and guidance
|
|
65
|
+
roleos resume # Continue an interrupted run
|
|
66
|
+
roleos report # Generate completion report
|
|
67
|
+
roleos friction # Measure operator touches
|
|
68
|
+
```
|
|
69
|
+
|
|
70
|
+
**問題発生時の対応:**
|
|
71
|
+
|
|
72
|
+
```bash
|
|
73
|
+
roleos retry 0 # Retry a failed step
|
|
74
|
+
roleos reroute 1 "Frontend Developer" "UI bug" # Swap a role
|
|
75
|
+
roleos escalate "Test Engineer" "Repo Researcher" "missed edge case" "re-diagnose"
|
|
76
|
+
roleos block 2 "waiting for API spec"
|
|
77
|
+
roleos reopen 0 "found issue in review"
|
|
78
|
+
```
|
|
79
|
+
|
|
80
|
+
実行結果はディスクに保存されます(`.claude/runs/`)。そのため、中断されたセッションも正常に再開できます。各ステップには、オペレーター向けのガイダンスが含まれており、生成すべき内容、必要なセクション、および停止条件が示されています。
|
|
81
|
+
|
|
55
82
|
**ルーティング後:**
|
|
56
83
|
|
|
57
84
|
1. **各役割は成果物を生成:** 構造化された出力で、次の役割が理解しやすいように、証拠となる情報が含まれています。
|
|
@@ -97,19 +124,24 @@ Role OSは、Claudeプロジェクトのメモリと連携します。置き換
|
|
|
97
124
|
npx role-os init
|
|
98
125
|
|
|
99
126
|
# Describe what you need — Role OS picks the right level:
|
|
100
|
-
roleos
|
|
127
|
+
roleos run "fix the crash in save handler"
|
|
128
|
+
# → Creates run, picks bugfix mission, starts first step with guidance
|
|
129
|
+
|
|
130
|
+
# Step through:
|
|
131
|
+
roleos next # Start next step
|
|
132
|
+
roleos complete artifact.md # Complete with artifact
|
|
133
|
+
roleos explain # Show full state
|
|
134
|
+
roleos report # Completion report
|
|
101
135
|
|
|
102
136
|
# Or go manual:
|
|
137
|
+
roleos start "fix the crash" # Entry decision only (no run)
|
|
103
138
|
roleos packet new feature
|
|
104
139
|
roleos route .claude/packets/my-feature.md
|
|
105
140
|
roleos review .claude/packets/my-feature.md accept
|
|
106
|
-
roleos status
|
|
107
141
|
|
|
108
142
|
# Explore missions and packs:
|
|
109
143
|
roleos mission list
|
|
110
|
-
roleos mission show bugfix
|
|
111
144
|
roleos packs list
|
|
112
|
-
roleos packs show feature
|
|
113
145
|
```
|
|
114
146
|
|
|
115
147
|
## Role OSを使用しない場合
|
|
@@ -148,6 +180,12 @@ Role OSは、構造が異なる2つのリポジトリで、3つの異なるテ
|
|
|
148
180
|
- 同じトリートメントパッケージを使用。構造は異なり、リポジトリの内容も異なる(クリエイティブワークスペース vs ゲーム)。
|
|
149
181
|
- トリートメントパッケージは移植可能。契約の変更は不要。
|
|
150
182
|
|
|
183
|
+
**理想的な実行例(MCPサーバーマーケットプレイスのトピック)**
|
|
184
|
+
- 9つの役割を持つ連携、並行して4人の分析者。相互に質問し、反論するグラフ。
|
|
185
|
+
- 4つの課題が提示され、3つの主張が絞り込まれ、1つが未解決。健全なプレッシャーがかかっていますが、行き詰まりはありません。
|
|
186
|
+
- 生成された成果物から、真実の要素への16以上の追跡リンク。
|
|
187
|
+
- 完全なトレーサビリティが証明されています:真実 → 要素 → 反論 → 統合 → 拡張 → 評価 → 生成 → 追跡
|
|
188
|
+
|
|
151
189
|
## 主要な特性
|
|
152
190
|
|
|
153
191
|
これらは変更できません。変更によってこれらのいずれかが弱体化する場合は、却下してください。
|
|
@@ -166,7 +204,9 @@ role-os/
|
|
|
166
204
|
src/
|
|
167
205
|
entry.mjs ← Unified entry: mission → pack → free routing
|
|
168
206
|
entry-cmd.mjs ← `roleos start` CLI command
|
|
169
|
-
|
|
207
|
+
run.mjs ← Persistent run engine: create → step → pause → resume → report
|
|
208
|
+
run-cmd.mjs ← `roleos run/resume/next/explain/complete/fail` + interventions
|
|
209
|
+
mission.mjs ← 7 named mission types (feature, bugfix, treatment, docs, security, research, brainstorm)
|
|
170
210
|
mission-run.mjs ← Mission runner: create → step → complete → report
|
|
171
211
|
mission-cmd.mjs ← `roleos mission` CLI commands
|
|
172
212
|
route.mjs ← 31-role routing + dynamic chain builder
|
|
@@ -175,14 +215,17 @@ role-os/
|
|
|
175
215
|
escalation.mjs ← Auto-routing for blocked/rejected/split
|
|
176
216
|
evidence.mjs ← Structured evidence + role-aware requirements
|
|
177
217
|
dispatch.mjs ← Runtime dispatch manifests for multi-claude
|
|
178
|
-
artifacts.mjs ←
|
|
218
|
+
artifacts.mjs ← 30 per-role artifact contracts + 7 pack handoffs
|
|
179
219
|
decompose.mjs ← Composite task detection + splitting
|
|
180
220
|
composite.mjs ← Dependency-ordered execution + recovery
|
|
181
221
|
replan.mjs ← Mid-run adaptive replanning
|
|
182
222
|
calibration.mjs ← Outcome recording + weight tuning
|
|
183
223
|
hooks.mjs ← 5 lifecycle hooks for runtime enforcement
|
|
184
224
|
session.mjs ← Session scaffolding + doctor
|
|
185
|
-
|
|
225
|
+
brainstorm.mjs ← Evidence modes, request validation, finding/synthesis/judge schemas
|
|
226
|
+
brainstorm-roles.mjs ← Role-native schemas, input partitioning, blindspot enforcement, cross-exam
|
|
227
|
+
brainstorm-render.mjs ← Two-layer rendering: lexical bans, render schemas, debate transcript
|
|
228
|
+
test/ ← 894 tests across 30 test files
|
|
186
229
|
starter-pack/ ← Drop-in role contracts, policies, schemas, workflows
|
|
187
230
|
```
|
|
188
231
|
|
|
@@ -212,6 +255,8 @@ Role OSは、**ローカルでのみ**動作します。Markdownテンプレー
|
|
|
212
255
|
| **Mission library** | 6つの名前付きミッション(新機能追加、バグ修正、改善、ドキュメントのリリース、セキュリティ強化、研究開発)。それぞれが、パッケージ、ロールチェーン、成果物の流れ、エスカレーションのブランチ、正直で部分的な定義を宣言します。6つすべてが試行錯誤され、強化されています。 | ✓ 完了 |
|
|
213
256
|
| **Mission runner** | 実行を開始し、追跡された状態とともにステップを進め、正直なレポートで完了または失敗。ブロックされたステップの伝播、チェーンからの逸脱に関する警告、最後のステップの再開。 | ✓ 完了 |
|
|
214
257
|
| **Unified entry** | `roleos start`は、ミッション、パッケージ、または自由ルーティングを自動的に決定します。信頼度スコア、代替案、および複合検出を備えたフォールバックシステム。 | ✓ 完了 |
|
|
258
|
+
| **Persistent runs** | `roleos run` コマンドは、ディスクに保存された実行結果を作成します。`resume`(再開)、`next`(次へ)、`explain`(説明)、`complete`(完了)、`fail`(失敗)。対応:`reroute`(リダイレクト)、`escalate`(エスカレーション)、`retry`(再試行)、`block`(ブロック)、`reopen`(再開)。各ステップにローカルなガイダンスがあります。摩擦の測定。 | ✓ 完了 |
|
|
259
|
+
| **Brainstorm** | 2層のアーキテクチャ:真実層(役割固有のスキーマ、トレーサビリティを持つ要素、相互質問と反論のグラフ)+ 生成層(5つの異なる声、禁止語、議論の記録)。追跡リンクは、生成されたすべての主張が、真実の要素に対応していることを証明します。理想的な実行例:894件のテスト。 | ✓ 完了 |
|
|
215
260
|
|
|
216
261
|
## 6つのミッション
|
|
217
262
|
|
|
@@ -223,23 +268,47 @@ Role OSは、**ローカルでのみ**動作します。Markdownテンプレー
|
|
|
223
268
|
| `docs-release` | ドキュメント | 2 | ドキュメントの作成/更新、リリースノート |
|
|
224
269
|
| `security-hardening` | セキュリティ | 4 | 脅威モデルの作成、監査、脆弱性の修正、再監査、検証 |
|
|
225
270
|
| `research-launch` | 研究 | 4 | 問題の定義、調査、結果の文書化、決定 |
|
|
271
|
+
| `brainstorm` | ブレインストーミング | 9 | 追跡可能な意見の相違と結論を持つ、構造化された多角的な調査 |
|
|
226
272
|
|
|
227
273
|
各ミッションには、正直で部分的な定義が含まれています。作業が停滞した場合、システムは完了した内容と残りの内容を記録し、進捗を偽装することはありません。
|
|
228
274
|
|
|
275
|
+
### ブレインストーミングミッション
|
|
276
|
+
|
|
277
|
+
これは「AIによるブレインストーミング」ではありません。ブレインストーミングミッションは、**法に基づいて定義された役割であり、追跡可能な意見の相違と、結論を導き出すための出力を持つ**ものです。
|
|
278
|
+
|
|
279
|
+
```bash
|
|
280
|
+
roleos run "explore product directions for a developer tool discovery platform"
|
|
281
|
+
# → MISSION: Brainstorm (Structured Inquiry)
|
|
282
|
+
# Chain: 4 Analysts (parallel) → Normalize → Cross-Examine → Rebut → Synthesize → Expand → Judge
|
|
283
|
+
```
|
|
284
|
+
|
|
285
|
+
**何が違うのか:**
|
|
286
|
+
|
|
287
|
+
- **層1(真実):** 4人の分析者が、役割固有のスキーマ(コンテキストマップ、ユーザーバリューマップ、メカニズムマップ、ポジショニングマップ)を生成します。これは、共有された文章ではありません。各役割には、盲点防止機能が組み込まれており、禁止語、禁止される主張の種類、およびフィルタリングされた入力セクションがあります。要素には、トレーサビリティ情報が含まれています。方向性のある相互質問グラフにより、ターゲットを絞った課題が生成されます。元の分析者は、プレッシャーの下で、主張を擁護したり、絞り込んだり、撤回したりします。
|
|
288
|
+
|
|
289
|
+
- **層2(生成):** 5つの異なる人間の声(境界メモ、フィールドノート、システムスケッチ、主張概要、相互質問記録)があり、禁止語により、声の統一を防ぎます。統合は、真実の要素を使用しますが、生成された文章は使用しません。両方の層は常に利用可能です。
|
|
290
|
+
|
|
291
|
+
- **トレーサビリティ:** 生成されたすべての文は、真実の要素にトレースバックできます。統合の指示には、要素が引用されています。相互質問は、実際の主張IDを対象としています。意見の相違グラフは、文章ではなく、その結果です。
|
|
292
|
+
|
|
293
|
+
**検証済み:** v0.4の理想的な実行例:894件のテスト、完全なトレーサビリティが検証済み。完全な成果物のチェーンについては、[`examples/golden-run.md`](examples/golden-run.md) を参照してください。
|
|
294
|
+
|
|
229
295
|
## ステータス
|
|
230
296
|
|
|
231
|
-
- v0.1–v0.4:
|
|
232
|
-
- v1.0.0: 32
|
|
233
|
-
- v1.0.2: ロールOS
|
|
234
|
-
- v1.1.0: 31
|
|
235
|
-
- v1.2.0:
|
|
236
|
-
- v1.3.0:
|
|
237
|
-
- v1.4.0:
|
|
238
|
-
- v1.5.0:
|
|
239
|
-
- v1.6.0:
|
|
240
|
-
- v1.7.0:
|
|
241
|
-
- v1.8.0:
|
|
242
|
-
-
|
|
297
|
+
- v0.1–v0.4: 基礎機能 — テスト、導入、トリートメントパック、スターターパック
|
|
298
|
+
- v1.0.0: 32種類のロール、フルCLI、実績のあるトリートメント、マルチリポジトリ対応
|
|
299
|
+
- v1.0.2: ロールOSのロックダウン(初期設定の修正、`init --force`コマンド)
|
|
300
|
+
- v1.1.0: 31種類のロール、フルルーティング機能、競合検出、エスカレーション、証拠収集、ディスパッチ、7種類の実績のあるチームパック。35回の実行テスト。212件のテスト。
|
|
301
|
+
- v1.2.0: キャリブレーションされたパックがデフォルト設定に。自動選択、不整合検出、代替案の提案、フリールーティングへのフォールバック。246件のテスト。
|
|
302
|
+
- v1.3.0: 結果のキャリブレーション、タスクの細分化、複合実行、適応的な再計画。317件のテスト。
|
|
303
|
+
- v1.4.0: セッション機能 — `roleos init claude`、`roleos doctor`、ルートカード、`/roleos-route`、`/roleos-review`、`/roleos-status`コマンド。335件のテスト。
|
|
304
|
+
- v1.5.0: フック機能 — 実行時強制のための5つのライフサイクルフック。358件のテスト。
|
|
305
|
+
- v1.6.0: アーティファクト機能 — 各ロールごとの20種類のアーティファクト契約、7種類のパックハンドオフ契約、構造検証。385件のテスト。
|
|
306
|
+
- v1.7.0: 完了の検証 — 実際のタスクをフルスタックで実行。`roleos artifacts` CLI。構造的な修正に対する正直なエスカレーション。398件のテスト。
|
|
307
|
+
- v1.8.0: ミッションライブラリ(フェーズS) — 6種類の名前付きミッション、実行エンジン、完了レポート。6回の実際のテストで強化。481件のテスト。
|
|
308
|
+
- v1.9.0: 統合されたエントリーパス(フェーズT) — `roleos start`コマンドが、ミッション、パック、フリールーティングを自動的に選択。フォールバック機能、複合検出、エントリーパスの比較テスト。527件のテスト。
|
|
309
|
+
- **v2.0.0**: ユーザーエクスペリエンス改善(フェーズU) — `roleos run`コマンドが、永続的なディスクベースの実行を作成。再開、次へ、説明、完了、失敗。介入:リルーティング、エスカレーション、再試行、ブロック、再開。各ステップでの詳細なガイダンス。摩擦の測定。6件の摩擦テスト。613件のテスト。
|
|
310
|
+
- **v2.0.1**: マニュアルの監査、初心者向けドキュメント、テスト件数の修正。617件のテスト。
|
|
311
|
+
- **v2.1.0**: ブレインストーミングミッション(v0.4) — 法分野に特化したロール、追跡可能な意見の相違、判決を含む出力。2層アーキテクチャ(真実性 + レンダリング)、クロスエグザム権限マトリックス、紛争グラフ、黄金の実行の検証。7種類のミッション、50種類のロール、8種類のパック。894件のテスト。
|
|
243
312
|
|
|
244
313
|
## ライセンス
|
|
245
314
|
|
package/README.md
CHANGED
|
@@ -2,7 +2,6 @@
|
|
|
2
2
|
<a href="README.ja.md">日本語</a> | <a href="README.zh.md">中文</a> | <a href="README.es.md">Español</a> | <a href="README.fr.md">Français</a> | <a href="README.hi.md">हिन्दी</a> | <a href="README.it.md">Italiano</a> | <a href="README.pt-BR.md">Português (BR)</a>
|
|
3
3
|
</p>
|
|
4
4
|
|
|
5
|
-
|
|
6
5
|
<p align="center">
|
|
7
6
|
<img src="https://raw.githubusercontent.com/mcp-tool-shop-org/brand/main/logos/role-os/readme.png" alt="Role OS" width="600">
|
|
8
7
|
</p>
|
|
@@ -14,7 +13,7 @@
|
|
|
14
13
|
<a href="https://mcp-tool-shop-org.github.io/role-os/"><img src="https://img.shields.io/badge/Landing_Page-live-brightgreen" alt="Landing Page"></a>
|
|
15
14
|
</p>
|
|
16
15
|
|
|
17
|
-
A multi-Claude operating system that staffs, routes, validates, and runs work through
|
|
16
|
+
A multi-Claude operating system that staffs, routes, validates, and runs work through 50 specialized role contracts. Creates task packets, assembles the right team from scored role matching, detects broken chains before execution, auto-routes recovery when work is blocked or rejected, and requires structured evidence in every verdict.
|
|
18
17
|
|
|
19
18
|
## What it does
|
|
20
19
|
|
|
@@ -104,7 +103,7 @@ Full treatment is a canonical 7-phase protocol defined in Claude project memory
|
|
|
104
103
|
|
|
105
104
|
Order: Shipcheck first, then full treatment. No v1.0.0 without passing hard gates.
|
|
106
105
|
|
|
107
|
-
##
|
|
106
|
+
## 50 roles across 8 packs
|
|
108
107
|
|
|
109
108
|
| Pack | Roles |
|
|
110
109
|
|------|-------|
|
|
@@ -181,6 +180,12 @@ Role OS was proven across three trial shapes in two structurally different repos
|
|
|
181
180
|
- Same treatment pack, structurally different repo (creative workspace vs game)
|
|
182
181
|
- Treatment Pack portable — no contract modifications needed
|
|
183
182
|
|
|
183
|
+
**Brainstorm golden run** (MCP server marketplace topic)
|
|
184
|
+
- 9-role chain, 4 analysts in parallel, cross-examine + rebut dispute graph
|
|
185
|
+
- 4 challenges issued, 3 claims narrowed, 1 unresolved — healthy pressure, not deadlock
|
|
186
|
+
- 16+ trace links from rendered artifacts back to truth-layer atoms
|
|
187
|
+
- Full chain of custody proven: truth → atoms → dispute → synthesis → expand → judge → render → trace
|
|
188
|
+
|
|
184
189
|
## Core properties
|
|
185
190
|
|
|
186
191
|
These are non-negotiable. If a change weakens any of them, reject it.
|
|
@@ -201,7 +206,7 @@ role-os/
|
|
|
201
206
|
entry-cmd.mjs ← `roleos start` CLI command
|
|
202
207
|
run.mjs ← Persistent run engine: create → step → pause → resume → report
|
|
203
208
|
run-cmd.mjs ← `roleos run/resume/next/explain/complete/fail` + interventions
|
|
204
|
-
mission.mjs ←
|
|
209
|
+
mission.mjs ← 7 named mission types (feature, bugfix, treatment, docs, security, research, brainstorm)
|
|
205
210
|
mission-run.mjs ← Mission runner: create → step → complete → report
|
|
206
211
|
mission-cmd.mjs ← `roleos mission` CLI commands
|
|
207
212
|
route.mjs ← 31-role routing + dynamic chain builder
|
|
@@ -210,14 +215,17 @@ role-os/
|
|
|
210
215
|
escalation.mjs ← Auto-routing for blocked/rejected/split
|
|
211
216
|
evidence.mjs ← Structured evidence + role-aware requirements
|
|
212
217
|
dispatch.mjs ← Runtime dispatch manifests for multi-claude
|
|
213
|
-
artifacts.mjs ←
|
|
218
|
+
artifacts.mjs ← 30 per-role artifact contracts + 7 pack handoffs
|
|
214
219
|
decompose.mjs ← Composite task detection + splitting
|
|
215
220
|
composite.mjs ← Dependency-ordered execution + recovery
|
|
216
221
|
replan.mjs ← Mid-run adaptive replanning
|
|
217
222
|
calibration.mjs ← Outcome recording + weight tuning
|
|
218
223
|
hooks.mjs ← 5 lifecycle hooks for runtime enforcement
|
|
219
224
|
session.mjs ← Session scaffolding + doctor
|
|
220
|
-
|
|
225
|
+
brainstorm.mjs ← Evidence modes, request validation, finding/synthesis/judge schemas
|
|
226
|
+
brainstorm-roles.mjs ← Role-native schemas, input partitioning, blindspot enforcement, cross-exam
|
|
227
|
+
brainstorm-render.mjs ← Two-layer rendering: lexical bans, render schemas, debate transcript
|
|
228
|
+
test/ ← 894 tests across 30 test files
|
|
221
229
|
starter-pack/ ← Drop-in role contracts, policies, schemas, workflows
|
|
222
230
|
```
|
|
223
231
|
|
|
@@ -243,13 +251,14 @@ Role OS operates **locally only**. It copies markdown templates and writes packe
|
|
|
243
251
|
| **Adaptive replanning** | Mid-run scope changes, findings, or new requirements update the plan without restarting. | ✓ Shipped |
|
|
244
252
|
| **Session spine** | `roleos init claude` scaffolds CLAUDE.md, /roleos-route, /roleos-review, /roleos-status. `roleos doctor` verifies wiring. Route cards prove engagement. | ✓ Shipped |
|
|
245
253
|
| **Hook spine** | 5 lifecycle hooks (SessionStart, PromptSubmit, PreToolUse, SubagentStart, Stop). Advisory enforcement: route card reminders, write-tool gating, subagent role injection, completion audit. | ✓ Shipped |
|
|
246
|
-
| **Artifact spine** |
|
|
247
|
-
| **Mission library** |
|
|
254
|
+
| **Artifact spine** | 30 per-role artifact contracts. 7 pack handoff contracts. Structural validation. Chain completeness checks. Downstream roles never guess what they received. | ✓ Shipped |
|
|
255
|
+
| **Mission library** | 7 named missions (feature-ship, bugfix, treatment, docs-release, security-hardening, research-launch, brainstorm). Each declares pack, role chain, artifact flow, escalation branches, honest-partial definition. All 7 trial-proven. | ✓ Shipped |
|
|
248
256
|
| **Mission runner** | Create runs, step through with tracked state, complete/fail with honest reporting. Blocked-step propagation, out-of-chain escalation warnings, last-step re-opening. | ✓ Shipped |
|
|
249
257
|
| **Unified entry** | `roleos start` decides mission vs pack vs free routing automatically. Fallback ladder with confidence scores, alternatives, and composite detection. | ✓ Shipped |
|
|
250
258
|
| **Persistent runs** | `roleos run` creates disk-backed runs. `resume`, `next`, `explain`, `complete`, `fail`. Interventions: reroute, escalate, retry, block, reopen. Step-local guidance. Friction measurement. | ✓ Shipped |
|
|
259
|
+
| **Brainstorm** | Two-layer architecture: truth (role-native schemas, provenance atoms, cross-exam dispute graph) + render (5 distinct voices, lexical bans, debate transcript). Trace links prove every rendered claim maps to a truth atom. Golden run: 894 tests. | ✓ Shipped |
|
|
251
260
|
|
|
252
|
-
##
|
|
261
|
+
## 7 missions
|
|
253
262
|
|
|
254
263
|
| Mission | Pack | Roles | When to use |
|
|
255
264
|
|---------|------|-------|-------------|
|
|
@@ -259,9 +268,30 @@ Role OS operates **locally only**. It copies markdown templates and writes packe
|
|
|
259
268
|
| `docs-release` | docs | 2 | Write/update documentation, release notes |
|
|
260
269
|
| `security-hardening` | security | 4 | Threat model, audit, fix vulnerabilities, re-audit, verify |
|
|
261
270
|
| `research-launch` | research | 4 | Frame question, research, document findings, decide |
|
|
271
|
+
| `brainstorm` | brainstorm | 9 | Structured multi-perspective inquiry with traceable disagreement and verdict |
|
|
262
272
|
|
|
263
273
|
Each mission includes honest-partial definitions — when work stalls, the system documents what was completed and what remains instead of bluffing completion.
|
|
264
274
|
|
|
275
|
+
### Brainstorm mission
|
|
276
|
+
|
|
277
|
+
Not "AI brainstorming." The brainstorm mission is **specialized roles under law, with traceable disagreement and verdict-bearing output.**
|
|
278
|
+
|
|
279
|
+
```bash
|
|
280
|
+
roleos run "explore product directions for a developer tool discovery platform"
|
|
281
|
+
# → MISSION: Brainstorm (Structured Inquiry)
|
|
282
|
+
# Chain: 4 Analysts (parallel) → Normalize → Cross-Examine → Rebut → Synthesize → Expand → Judge
|
|
283
|
+
```
|
|
284
|
+
|
|
285
|
+
**What makes it different:**
|
|
286
|
+
|
|
287
|
+
- **Layer 1 (truth):** Four analysts emit role-native schemas (ContextMap, UserValueMap, MechanicsMap, PositioningMap) — not shared prose. Each role is blindspot-enforced: forbidden phrases, forbidden claim kinds, filtered input partitions. Atoms carry provenance. A directed cross-examination graph produces targeted challenges. Original analysts defend, narrow, or retract under pressure.
|
|
288
|
+
|
|
289
|
+
- **Layer 2 (render):** Five distinct human voices (Boundary Memo, Field Notes, System Sketch, Claim Brief, Cross-Exam Transcript) with lexical bans preventing voice convergence. Synthesis consumes truth, never rendered prose. Both layers always available.
|
|
290
|
+
|
|
291
|
+
- **Chain of custody:** Every rendered sentence traces back to a truth-layer atom. Synthesis directions cite atoms. Cross-exam targets real claim IDs. The dispute graph is the product, not the prose.
|
|
292
|
+
|
|
293
|
+
**Proven:** v0.4 golden run — 894 tests, full chain of custody verified. See [`examples/golden-run.md`](examples/golden-run.md) for the complete artifact chain.
|
|
294
|
+
|
|
265
295
|
## Status
|
|
266
296
|
|
|
267
297
|
- v0.1–v0.4: Foundation — trials, adoption, treatment pack, starter pack
|
|
@@ -277,6 +307,8 @@ Each mission includes honest-partial definitions — when work stalls, the syste
|
|
|
277
307
|
- v1.8.0: Mission library (Phase S) — 6 named missions, runner engine, completion reports. Hardened from 6 real trial runs. 481 tests.
|
|
278
308
|
- v1.9.0: Unified entry path (Phase T) — `roleos start` auto-decides mission vs pack vs free routing. Fallback ladder, composite detection, entry-path comparison trials. 527 tests.
|
|
279
309
|
- **v2.0.0**: Operator friction pass (Phase U) — `roleos run` creates persistent disk-backed runs. Resume, next, explain, complete, fail. Interventions: reroute, escalate, retry, block, reopen. Step-local guidance at every step. Friction measurement. 6 friction trials. 613 tests.
|
|
310
|
+
- **v2.0.1**: Handbook audit, beginner docs, test count corrections. 617 tests.
|
|
311
|
+
- **v2.1.0**: Brainstorm mission (v0.4) — specialized roles under law, traceable disagreement, verdict-bearing output. Two-layer architecture (truth + render), cross-exam permission matrix, dispute graph, golden run proof. 7 missions, 50 roles, 8 packs. 894 tests.
|
|
280
312
|
|
|
281
313
|
## License
|
|
282
314
|
|
package/README.pt-BR.md
CHANGED
|
@@ -2,10 +2,8 @@
|
|
|
2
2
|
<a href="README.ja.md">日本語</a> | <a href="README.zh.md">中文</a> | <a href="README.es.md">Español</a> | <a href="README.fr.md">Français</a> | <a href="README.hi.md">हिन्दी</a> | <a href="README.it.md">Italiano</a> | <a href="README.md">English</a>
|
|
3
3
|
</p>
|
|
4
4
|
|
|
5
|
-
# Role OS
|
|
6
|
-
|
|
7
5
|
<p align="center">
|
|
8
|
-
<img src="https://raw.githubusercontent.com/mcp-tool-shop-org/brand/main/logos/role-os/readme.png" alt="Role OS" width="
|
|
6
|
+
<img src="https://raw.githubusercontent.com/mcp-tool-shop-org/brand/main/logos/role-os/readme.png" alt="Role OS" width="600">
|
|
9
7
|
</p>
|
|
10
8
|
|
|
11
9
|
<p align="center">
|
|
@@ -52,6 +50,35 @@ roleos start "something completely novel"
|
|
|
52
50
|
|
|
53
51
|
O sistema nunca força o trabalho a passar pela camada de abstração incorreta. Ele explica por que escolheu cada nível e oferece alternativas.
|
|
54
52
|
|
|
53
|
+
**Um comando para iniciar a execução:**
|
|
54
|
+
|
|
55
|
+
```bash
|
|
56
|
+
roleos run "fix the crash in save handler"
|
|
57
|
+
# → Created run: run-1234
|
|
58
|
+
# → Entry: MISSION (bugfix)
|
|
59
|
+
# → Started step 0: Repo Researcher → diagnosis-report
|
|
60
|
+
# → Guidance: Required sections: entrypoints, module-map, build-test-commands
|
|
61
|
+
|
|
62
|
+
roleos next # Start the next step
|
|
63
|
+
roleos complete diagnosis.md # Complete the active step with artifact
|
|
64
|
+
roleos explain # Show full run state and guidance
|
|
65
|
+
roleos resume # Continue an interrupted run
|
|
66
|
+
roleos report # Generate completion report
|
|
67
|
+
roleos friction # Measure operator touches
|
|
68
|
+
```
|
|
69
|
+
|
|
70
|
+
**Intervenções quando algo dá errado:**
|
|
71
|
+
|
|
72
|
+
```bash
|
|
73
|
+
roleos retry 0 # Retry a failed step
|
|
74
|
+
roleos reroute 1 "Frontend Developer" "UI bug" # Swap a role
|
|
75
|
+
roleos escalate "Test Engineer" "Repo Researcher" "missed edge case" "re-diagnose"
|
|
76
|
+
roleos block 2 "waiting for API spec"
|
|
77
|
+
roleos reopen 0 "found issue in review"
|
|
78
|
+
```
|
|
79
|
+
|
|
80
|
+
As execuções são persistidas no disco (em `.claude/runs/`), permitindo que as sessões interrompidas sejam retomadas sem problemas. Cada etapa inclui orientações para o operador: o que produzir, as seções necessárias e as condições de parada.
|
|
81
|
+
|
|
55
82
|
**Depois de direcionado:**
|
|
56
83
|
|
|
57
84
|
1. **Cada função produz uma transferência:** saída estruturada com itens de evidência que reduzem a ambiguidade para a próxima função.
|
|
@@ -97,19 +124,24 @@ Cada função tem um contrato completo: missão, quando usar, quando não usar,
|
|
|
97
124
|
npx role-os init
|
|
98
125
|
|
|
99
126
|
# Describe what you need — Role OS picks the right level:
|
|
100
|
-
roleos
|
|
127
|
+
roleos run "fix the crash in save handler"
|
|
128
|
+
# → Creates run, picks bugfix mission, starts first step with guidance
|
|
129
|
+
|
|
130
|
+
# Step through:
|
|
131
|
+
roleos next # Start next step
|
|
132
|
+
roleos complete artifact.md # Complete with artifact
|
|
133
|
+
roleos explain # Show full state
|
|
134
|
+
roleos report # Completion report
|
|
101
135
|
|
|
102
136
|
# Or go manual:
|
|
137
|
+
roleos start "fix the crash" # Entry decision only (no run)
|
|
103
138
|
roleos packet new feature
|
|
104
139
|
roleos route .claude/packets/my-feature.md
|
|
105
140
|
roleos review .claude/packets/my-feature.md accept
|
|
106
|
-
roleos status
|
|
107
141
|
|
|
108
142
|
# Explore missions and packs:
|
|
109
143
|
roleos mission list
|
|
110
|
-
roleos mission show bugfix
|
|
111
144
|
roleos packs list
|
|
112
|
-
roleos packs show feature
|
|
113
145
|
```
|
|
114
146
|
|
|
115
147
|
## Quando não usar o Role OS
|
|
@@ -148,6 +180,12 @@ O Role OS foi comprovado em três modelos de teste em dois repositórios estrutu
|
|
|
148
180
|
- Mesmo pacote de tratamento, repositório estruturalmente diferente (ambiente de criação vs. jogo)
|
|
149
181
|
- Pacote de tratamento portátil — nenhuma modificação no contrato é necessária
|
|
150
182
|
|
|
183
|
+
**Sessão de brainstorming de alta qualidade** (tópico do mercado de servidores MCP)
|
|
184
|
+
- Cadeia de 9 papéis, 4 analistas em paralelo, análise cruzada + gráfico de refutação de disputas.
|
|
185
|
+
- 4 desafios propostos, 3 alegações refinadas, 1 não resolvida — pressão saudável, sem impasse.
|
|
186
|
+
- Mais de 16 links de rastreamento dos artefatos gerados até os átomos da camada de verdade.
|
|
187
|
+
- Cadeia de custódia completa comprovada: verdade → átomos → disputa → síntese → expandir → julgar → renderizar → rastrear.
|
|
188
|
+
|
|
151
189
|
## Propriedades essenciais
|
|
152
190
|
|
|
153
191
|
Estas são inegociáveis. Se uma alteração enfraquecer qualquer uma delas, rejeite-a.
|
|
@@ -166,7 +204,9 @@ role-os/
|
|
|
166
204
|
src/
|
|
167
205
|
entry.mjs ← Unified entry: mission → pack → free routing
|
|
168
206
|
entry-cmd.mjs ← `roleos start` CLI command
|
|
169
|
-
|
|
207
|
+
run.mjs ← Persistent run engine: create → step → pause → resume → report
|
|
208
|
+
run-cmd.mjs ← `roleos run/resume/next/explain/complete/fail` + interventions
|
|
209
|
+
mission.mjs ← 7 named mission types (feature, bugfix, treatment, docs, security, research, brainstorm)
|
|
170
210
|
mission-run.mjs ← Mission runner: create → step → complete → report
|
|
171
211
|
mission-cmd.mjs ← `roleos mission` CLI commands
|
|
172
212
|
route.mjs ← 31-role routing + dynamic chain builder
|
|
@@ -175,14 +215,17 @@ role-os/
|
|
|
175
215
|
escalation.mjs ← Auto-routing for blocked/rejected/split
|
|
176
216
|
evidence.mjs ← Structured evidence + role-aware requirements
|
|
177
217
|
dispatch.mjs ← Runtime dispatch manifests for multi-claude
|
|
178
|
-
artifacts.mjs ←
|
|
218
|
+
artifacts.mjs ← 30 per-role artifact contracts + 7 pack handoffs
|
|
179
219
|
decompose.mjs ← Composite task detection + splitting
|
|
180
220
|
composite.mjs ← Dependency-ordered execution + recovery
|
|
181
221
|
replan.mjs ← Mid-run adaptive replanning
|
|
182
222
|
calibration.mjs ← Outcome recording + weight tuning
|
|
183
223
|
hooks.mjs ← 5 lifecycle hooks for runtime enforcement
|
|
184
224
|
session.mjs ← Session scaffolding + doctor
|
|
185
|
-
|
|
225
|
+
brainstorm.mjs ← Evidence modes, request validation, finding/synthesis/judge schemas
|
|
226
|
+
brainstorm-roles.mjs ← Role-native schemas, input partitioning, blindspot enforcement, cross-exam
|
|
227
|
+
brainstorm-render.mjs ← Two-layer rendering: lexical bans, render schemas, debate transcript
|
|
228
|
+
test/ ← 894 tests across 30 test files
|
|
186
229
|
starter-pack/ ← Drop-in role contracts, policies, schemas, workflows
|
|
187
230
|
```
|
|
188
231
|
|
|
@@ -212,6 +255,8 @@ O sistema "Role OS" opera **apenas localmente**. Ele copia modelos em formato Ma
|
|
|
212
255
|
| **Mission library** | 6 missões nomeadas (feature-ship, bugfix, treatment, docs-release, security-hardening, research-launch). Cada uma define pacote, cadeia de papéis, fluxo de artefatos, ramificações de escalonamento, definição honesta e parcial. Todas as 6 foram testadas e aprimoradas. | ✓ Implementado. |
|
|
213
256
|
| **Mission runner** | Criação de execuções, acompanhamento passo a passo com estado rastreado, conclusão/falha com relatórios precisos. Propagação de etapas bloqueadas, avisos de escalonamento fora da cadeia, reabertura da última etapa. | ✓ Implementado. |
|
|
214
257
|
| **Unified entry** | `roleos start` decide automaticamente entre missão, pacote ou roteamento livre. Sistema de fallback com pontuações de confiança, alternativas e detecção composta. | ✓ Implementado. |
|
|
258
|
+
| **Persistent runs** | `roleos run` cria execuções com backup no disco. Comandos: `resume` (retomar), `next` (próximo), `explain` (explicar), `complete` (concluir), `fail` (falha). Intervenções: redirecionar, escalar, tentar novamente, bloquear, reabrir. Orientações específicas para cada etapa. Medição de atrito. | ✓ Implementado. |
|
|
259
|
+
| **Brainstorm** | Arquitetura de duas camadas: verdade (esquemas nativos do papel, átomos de procedência, gráfico de disputa de análise cruzada) + renderização (5 vozes distintas, restrições lexicais, transcrição do debate). Os links de rastreamento comprovam que cada alegação renderizada corresponde a um átomo de verdade. Sessão de brainstorming de alta qualidade: 894 testes. | ✓ Implementado. |
|
|
215
260
|
|
|
216
261
|
## 6 missões
|
|
217
262
|
|
|
@@ -223,23 +268,47 @@ O sistema "Role OS" opera **apenas localmente**. Ele copia modelos em formato Ma
|
|
|
223
268
|
| `docs-release` | Documentação | 2 | Escrever/atualizar documentação, notas de lançamento |
|
|
224
269
|
| `security-hardening` | Segurança | 4 | Modelo de ameaças, auditoria, correção de vulnerabilidades, re-auditoria, verificação |
|
|
225
270
|
| `research-launch` | Pesquisa | 4 | Formular a pergunta, pesquisar, documentar os resultados, decidir |
|
|
271
|
+
| `brainstorm` | brainstorming | 9 | Investigação estruturada com múltiplas perspectivas, com desacordo rastreável e veredicto. |
|
|
226
272
|
|
|
227
273
|
Cada missão inclui definições honestas e parciais — quando o trabalho é interrompido, o sistema documenta o que foi concluído e o que resta, em vez de apresentar uma conclusão falsa.
|
|
228
274
|
|
|
275
|
+
### Missão de brainstorming
|
|
276
|
+
|
|
277
|
+
Não é "brainstorming de IA". A missão de brainstorming é **papéis especializados sob a lei, com desacordo rastreável e resultados que comprovam o veredicto.**
|
|
278
|
+
|
|
279
|
+
```bash
|
|
280
|
+
roleos run "explore product directions for a developer tool discovery platform"
|
|
281
|
+
# → MISSION: Brainstorm (Structured Inquiry)
|
|
282
|
+
# Chain: 4 Analysts (parallel) → Normalize → Cross-Examine → Rebut → Synthesize → Expand → Judge
|
|
283
|
+
```
|
|
284
|
+
|
|
285
|
+
**O que a diferencia:**
|
|
286
|
+
|
|
287
|
+
- **Camada 1 (verdade):** Quatro analistas emitem esquemas nativos do papel (ContextMap, UserValueMap, MechanicsMap, PositioningMap) — não é prosa compartilhada. Cada papel tem restrições para evitar pontos cegos: frases proibidas, tipos de alegações proibidas, partições de entrada filtradas. Os átomos carregam informações de procedência. Um gráfico de análise cruzada direcionada gera desafios específicos. Os analistas originais defendem, refinam ou retiram suas alegações sob pressão.
|
|
288
|
+
|
|
289
|
+
- **Camada 2 (renderização):** Cinco vozes humanas distintas (Boundary Memo, Field Notes, System Sketch, Claim Brief, Cross-Exam Transcript) com restrições lexicais para evitar a convergência das vozes. A síntese consome a verdade, nunca a prosa renderizada. Ambas as camadas estão sempre disponíveis.
|
|
290
|
+
|
|
291
|
+
- **Cadeia de custódia:** Cada frase renderizada rastreia até um átomo da camada de verdade. As instruções de síntese citam os átomos. Os alvos da análise cruzada são IDs de alegações reais. O gráfico de disputa é o produto, não a prosa.
|
|
292
|
+
|
|
293
|
+
**Comprovado:** versão 0.4 da sessão de brainstorming de alta qualidade — 894 testes, cadeia de custódia completa verificada. Consulte [`examples/golden-run.md`](examples/golden-run.md) para a cadeia completa de artefatos.
|
|
294
|
+
|
|
229
295
|
## Status
|
|
230
296
|
|
|
231
|
-
- v0.1–v0.4: Fundação — testes, adoção, pacote de tratamento, pacote inicial
|
|
232
|
-
- v1.0.0: 32
|
|
233
|
-
- v1.0.2: Bloqueio do sistema
|
|
234
|
-
- v1.1.0: 31
|
|
235
|
-
- v1.2.0: Pacotes calibrados promovidos
|
|
297
|
+
- v0.1–v0.4: Fundação — testes, adoção, pacote de tratamento, pacote inicial.
|
|
298
|
+
- v1.0.0: 32 funções, CLI completa, tratamento comprovado, portabilidade multi-repositório.
|
|
299
|
+
- v1.0.2: Bloqueio do sistema operacional para funções (correções de inicialização, `init --force`).
|
|
300
|
+
- v1.1.0: 31 funções, roteamento completo, detecção de conflitos, escalonamento, evidências, despacho, 7 pacotes de equipe comprovados. 35 testes de execução. 212 testes.
|
|
301
|
+
- v1.2.0: Pacotes calibrados promovidos a entrada padrão. Seleção automática, detecção de incompatibilidades, sugestão alternativa, fallback de roteamento livre. 246 testes.
|
|
236
302
|
- v1.3.0: Calibração de resultados, decomposição de tarefas mistas, execução composta, replanejamento adaptativo. 317 testes.
|
|
237
|
-
- v1.4.0: Espinha dorsal da sessão — `roleos init claude`, `roleos doctor`, cartões de
|
|
238
|
-
- v1.5.0: Espinha dorsal de
|
|
239
|
-
- v1.6.0: Espinha dorsal de artefatos — 20 contratos de artefatos por
|
|
240
|
-
- v1.7.0: Prova de conclusão — tarefas reais executadas em toda a pilha. CLI `roleos artifacts`.
|
|
241
|
-
- v1.8.0: Biblioteca de missões (Fase S) — 6 missões nomeadas, motor de execução, relatórios de conclusão.
|
|
242
|
-
-
|
|
303
|
+
- v1.4.0: Espinha dorsal da sessão — `roleos init claude`, `roleos doctor`, cartões de rota, comandos `/roleos-route + /roleos-review + /roleos-status`. 335 testes.
|
|
304
|
+
- v1.5.0: Espinha dorsal de hooks — 5 hooks de ciclo de vida para aplicação em tempo de execução. 358 testes.
|
|
305
|
+
- v1.6.0: Espinha dorsal de artefatos — 20 contratos de artefatos por função, 7 contratos de transferência de pacotes, validação estrutural. 385 testes.
|
|
306
|
+
- v1.7.0: Prova de conclusão — tarefas reais executadas em toda a pilha. CLI `roleos artifacts`. Escalabilidade honesta para correções estruturais. 398 testes.
|
|
307
|
+
- v1.8.0: Biblioteca de missões (Fase S) — 6 missões nomeadas, motor de execução, relatórios de conclusão. Reforçado com 6 execuções de teste reais. 481 testes.
|
|
308
|
+
- v1.9.0: Caminho de entrada unificado (Fase T) — `roleos start` decide automaticamente entre missão, pacote ou roteamento livre. Escada de fallback, detecção composta, testes de comparação de caminho de entrada. 527 testes.
|
|
309
|
+
- **v2.0.0**: Otimização da experiência do usuário (Fase U) — `roleos run` cria execuções persistentes com backup em disco. Retomar, próximo, explicar, completar, falhar. Intervenções: redirecionar, escalar, tentar novamente, bloquear, reabrir. Orientação passo a passo em cada etapa. Medição de atrito. 6 testes de atrito. 613 testes.
|
|
310
|
+
- **v2.0.1**: Auditoria do manual, documentação para iniciantes, correções na contagem de testes. 617 testes.
|
|
311
|
+
- **v2.1.0**: Missão de brainstorming (v0.4) — funções especializadas sob a lei, desacordo rastreável, saída com valor de decisão. Arquitetura de duas camadas (verdade + renderização), matriz de permissão de interrogatório, grafo de disputas, prova de execução ideal. 7 missões, 50 funções, 8 pacotes. 894 testes.
|
|
243
312
|
|
|
244
313
|
## Licença
|
|
245
314
|
|