oh-my-customcodex 0.5.8 → 0.5.9
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +4 -4
- package/dist/cli/index.js +1 -1
- package/dist/index.js +1 -1
- package/package.json +1 -1
- package/templates/.claude/hooks/scripts/agent-teams-advisor.sh +4 -1
- package/templates/.claude/rules/MUST-agent-teams.md +85 -246
- package/templates/.claude/skills/de-lead-routing/SKILL.md +6 -13
- package/templates/.claude/skills/dev-lead-routing/SKILL.md +6 -13
- package/templates/.claude/skills/intent-detection/SKILL.md +7 -9
- package/templates/.claude/skills/research/SKILL.md +8 -23
- package/templates/.claude/skills/roundtable-debate/SKILL.md +3 -4
- package/templates/.claude/skills/structured-dev-cycle/SKILL.md +7 -10
- package/templates/AGENTS.md.en +1 -2
- package/templates/AGENTS.md.ko +1 -2
- package/templates/CLAUDE.md +2 -2
- package/templates/CLAUDE.md.en +1 -2
- package/templates/CLAUDE.md.ko +1 -2
- package/templates/README.md +2 -2
- package/templates/guides/multi-agent-debate-patterns/README.md +1 -1
- package/templates/guides/multi-provider-exec/README.md +9 -79
- package/templates/manifest.json +2 -2
- package/templates/.claude/skills/agora/SKILL.md +0 -209
- package/templates/.claude/skills/codex-exec/SKILL.md +0 -218
- package/templates/.claude/skills/codex-exec/scripts/codex-wrapper.cjs +0 -433
- package/templates/.claude/skills/gemini-exec/SKILL.md +0 -215
- package/templates/.claude/skills/gemini-exec/scripts/gemini-wrapper.cjs +0 -485
|
@@ -12,7 +12,7 @@ version: 1.0.0
|
|
|
12
12
|
|
|
13
13
|
## Purpose
|
|
14
14
|
|
|
15
|
-
Run a bounded debate when convergence would hide useful disagreement.
|
|
15
|
+
Run a bounded debate when convergence would hide useful disagreement. Use this workflow when debate should preserve minority positions and require explicit justification before dismissing them.
|
|
16
16
|
|
|
17
17
|
## When To Use
|
|
18
18
|
|
|
@@ -54,11 +54,10 @@ Hard cap: two debate rounds. If the decision still depends on missing facts, sto
|
|
|
54
54
|
{adopt | defer | reject | gather-more-evidence}
|
|
55
55
|
```
|
|
56
56
|
|
|
57
|
-
## Relationship To
|
|
57
|
+
## Relationship To Other Review Workflows
|
|
58
58
|
|
|
59
59
|
| Workflow | Goal | Best For |
|
|
60
60
|
|----------|------|----------|
|
|
61
|
-
| `agora` | adversarial consensus | release gates, spec approval |
|
|
62
61
|
| `roundtable-debate` | dissent preservation | ambiguous strategy, architectural tradeoffs |
|
|
63
62
|
|
|
64
|
-
Use `
|
|
63
|
+
Use `roundtable-debate` when current guidance needs adversarial review plus explicit dissent handling.
|
|
@@ -93,15 +93,12 @@ A PreToolUse hook in `.codex/hooks/hooks.json` checks this marker and blocks Wri
|
|
|
93
93
|
└── Output: Implementation complete
|
|
94
94
|
```
|
|
95
95
|
|
|
96
|
-
**Codex
|
|
97
|
-
1.
|
|
98
|
-
2. If
|
|
99
|
-
|
|
100
|
-
|
|
101
|
-
|
|
102
|
-
3. If unavailable → display `[Codex] Unavailable — proceeding with Claude experts directly` and proceed with standard implementation via Claude experts
|
|
103
|
-
|
|
104
|
-
Suitable for codex hybrid: new files, boilerplate, test stubs, scaffolding
|
|
96
|
+
**Optional Codex Plugin Interop**: When entering Stage 3:
|
|
97
|
+
1. Use domain expert agents as the default implementation path.
|
|
98
|
+
2. If the native Claude Code plugin `openai/codex-plugin-cc` is explicitly installed and requested, it may provide Codex interop for new-file scaffolding before expert review.
|
|
99
|
+
3. Otherwise display `[Codex Plugin] Not requested — proceeding with expert agents directly` and proceed with standard implementation.
|
|
100
|
+
|
|
101
|
+
Suitable for optional plugin interop: new files, boilerplate, test stubs, scaffolding
|
|
105
102
|
Not suitable: modifying existing code, architecture-dependent changes
|
|
106
103
|
|
|
107
104
|
**Exit criteria**: All planned files created/modified, tests written.
|
|
@@ -157,7 +154,7 @@ The stage marker file (`/tmp/.codex-dev-stage`) is read by a PreToolUse hook tha
|
|
|
157
154
|
For complex tasks, Agent Teams is **preferred** when available (R018):
|
|
158
155
|
- Plan: architect agent
|
|
159
156
|
- Verify: reviewer agent(s) — multi-model-verification via Agent Teams
|
|
160
|
-
- Implement: domain expert agent (+ codex-
|
|
157
|
+
- Implement: domain expert agent (+ optional `openai/codex-plugin-cc` interop only when explicitly installed/requested)
|
|
161
158
|
- Compound: QA agent
|
|
162
159
|
|
|
163
160
|
When Agent Teams is enabled AND task involves 3+ agents or review→fix cycles, using Agent Teams is MANDATORY per R018.
|
package/templates/AGENTS.md.en
CHANGED
|
@@ -110,7 +110,6 @@ NO EXCEPTIONS. NO EXCUSES.
|
|
|
110
110
|
| `/omcustomcodex:npm-version` | Manage semantic versions |
|
|
111
111
|
| `/omcustomcodex:npm-audit` | Audit dependencies |
|
|
112
112
|
| `/omcustomcodex-release-notes` | Generate release notes from git history |
|
|
113
|
-
| `/codex-exec` | Execute Codex CLI prompts |
|
|
114
113
|
| `/optimize-analyze` | Analyze bundle and performance |
|
|
115
114
|
| `/optimize-bundle` | Optimize bundle size |
|
|
116
115
|
| `/optimize-report` | Generate optimization report |
|
|
@@ -134,7 +133,7 @@ project/
|
|
|
134
133
|
| +-- hooks/ # Hook scripts (security, validation, HUD)
|
|
135
134
|
| +-- contexts/ # Context files (ecomode)
|
|
136
135
|
+-- .agents/
|
|
137
|
-
| +-- skills/ # Installed skills (
|
|
136
|
+
| +-- skills/ # Installed skills (120 directories)
|
|
138
137
|
+-- guides/ # Reference docs (51 topics)
|
|
139
138
|
```
|
|
140
139
|
|
package/templates/AGENTS.md.ko
CHANGED
|
@@ -110,7 +110,6 @@ oh-my-customcodex로 구동됩니다.
|
|
|
110
110
|
| `/omcustomcodex:npm-version` | 시맨틱 버전 관리 |
|
|
111
111
|
| `/omcustomcodex:npm-audit` | 의존성 감사 |
|
|
112
112
|
| `/omcustomcodex-release-notes` | 릴리즈 노트 생성 (git 히스토리 기반) |
|
|
113
|
-
| `/codex-exec` | Codex CLI 프롬프트 실행 |
|
|
114
113
|
| `/optimize-analyze` | 번들 및 성능 분석 |
|
|
115
114
|
| `/optimize-bundle` | 번들 크기 최적화 |
|
|
116
115
|
| `/optimize-report` | 최적화 리포트 생성 |
|
|
@@ -134,7 +133,7 @@ project/
|
|
|
134
133
|
| +-- hooks/ # 훅 스크립트 (보안, 검증, HUD)
|
|
135
134
|
| +-- contexts/ # 컨텍스트 파일 (ecomode)
|
|
136
135
|
+-- .agents/
|
|
137
|
-
| +-- skills/ # 설치된 스킬 (
|
|
136
|
+
| +-- skills/ # 설치된 스킬 (120 디렉토리)
|
|
138
137
|
+-- guides/ # 레퍼런스 문서 (51 토픽)
|
|
139
138
|
```
|
|
140
139
|
|
package/templates/CLAUDE.md
CHANGED
|
@@ -102,7 +102,7 @@ oh-my-customcodex로 구동됩니다.
|
|
|
102
102
|
| 개발 | `/omcustomcodex:goal`, `/dev-review`, `/dev-refactor`, `/sdd-dev`, `/structured-dev-cycle` | 목표 실행, 코드 리뷰, 리팩토링, SDD |
|
|
103
103
|
| 검증 | `/omcustomcodex:sauron-watch`, `/deep-verify`, `/adversarial-review` | R017 검증, 릴리즈 품질, 보안 리뷰 |
|
|
104
104
|
| 릴리즈 | `/pipeline auto-dev`, `/omcustomcodex-release-notes`, `/release-plan` | 자동 개발, 릴리즈 노트 |
|
|
105
|
-
| 리서치 | `/research`, `/scout`, `/deep-plan`, `/
|
|
105
|
+
| 리서치 | `/research`, `/scout`, `/deep-plan`, `/roundtable-debate` | 병렬 분석, URL 평가, 토론형 검증 |
|
|
106
106
|
| 메모리 | `/memory-save`, `/memory-recall` | 세션 메모리 관리 |
|
|
107
107
|
| 시스템 | `/token-efficiency-audit`, `/pre-generation-arch-check`, `/omcustomcodex:lists`, `/omcustomcodex:status`, `/omcustomcodex:help` | 토큰 효율/아키텍처 감사, 전체 목록, 상태, 도움말 |
|
|
108
108
|
|
|
@@ -119,7 +119,7 @@ project/
|
|
|
119
119
|
| +-- hooks/ # 훅 스크립트 (보안, 검증, HUD)
|
|
120
120
|
| +-- contexts/ # 컨텍스트 파일 (ecomode)
|
|
121
121
|
+-- .agents/
|
|
122
|
-
| +-- skills/ # 스킬 (
|
|
122
|
+
| +-- skills/ # 스킬 (120 디렉토리)
|
|
123
123
|
+-- guides/ # 레퍼런스 문서 (51 토픽)
|
|
124
124
|
```
|
|
125
125
|
|
package/templates/CLAUDE.md.en
CHANGED
|
@@ -112,7 +112,6 @@ NO EXCEPTIONS. NO EXCUSES.
|
|
|
112
112
|
| `/omcustomcodex:npm-version` | Manage semantic versions |
|
|
113
113
|
| `/omcustomcodex:npm-audit` | Audit dependencies |
|
|
114
114
|
| `/omcustomcodex-release-notes` | Generate release notes from git history |
|
|
115
|
-
| `/codex-exec` | Execute Codex CLI prompts |
|
|
116
115
|
| `/optimize-analyze` | Analyze bundle and performance |
|
|
117
116
|
| `/optimize-bundle` | Optimize bundle size |
|
|
118
117
|
| `/optimize-report` | Generate optimization report |
|
|
@@ -133,7 +132,7 @@ project/
|
|
|
133
132
|
+-- AGENTS.md # Entry point
|
|
134
133
|
+-- .codex/
|
|
135
134
|
| +-- agents/ # Subagent definitions (50 files)
|
|
136
|
-
| +-- skills/ # Skills (
|
|
135
|
+
| +-- skills/ # Skills (120 directories)
|
|
137
136
|
| +-- rules/ # Global rules (22 files)
|
|
138
137
|
| +-- hooks/ # Hook scripts (security, validation, HUD)
|
|
139
138
|
| +-- contexts/ # Context files (4 files)
|
package/templates/CLAUDE.md.ko
CHANGED
|
@@ -112,7 +112,6 @@ oh-my-customcodex로 구동됩니다.
|
|
|
112
112
|
| `/omcustomcodex:npm-version` | 시맨틱 버전 관리 |
|
|
113
113
|
| `/omcustomcodex:npm-audit` | 의존성 감사 |
|
|
114
114
|
| `/omcustomcodex-release-notes` | 릴리즈 노트 생성 (git 히스토리 기반) |
|
|
115
|
-
| `/codex-exec` | Codex CLI 프롬프트 실행 |
|
|
116
115
|
| `/optimize-analyze` | 번들 및 성능 분석 |
|
|
117
116
|
| `/optimize-bundle` | 번들 크기 최적화 |
|
|
118
117
|
| `/optimize-report` | 최적화 리포트 생성 |
|
|
@@ -133,7 +132,7 @@ project/
|
|
|
133
132
|
+-- AGENTS.md # 진입점
|
|
134
133
|
+-- .codex/
|
|
135
134
|
| +-- agents/ # 서브에이전트 정의 (50 파일)
|
|
136
|
-
| +-- skills/ # 스킬 (
|
|
135
|
+
| +-- skills/ # 스킬 (120 디렉토리)
|
|
137
136
|
| +-- rules/ # 전역 규칙 (22 파일)
|
|
138
137
|
| +-- hooks/ # 훅 스크립트 (보안, 검증, HUD)
|
|
139
138
|
| +-- contexts/ # 컨텍스트 파일 (4 파일)
|
package/templates/README.md
CHANGED
|
@@ -46,7 +46,7 @@ templates/
|
|
|
46
46
|
+-- workflows/ # project-level pipeline definitions
|
|
47
47
|
+-- .claude/
|
|
48
48
|
| +-- agents/ # agent definitions (50 files)
|
|
49
|
-
| +-- skills/ # skill modules (
|
|
49
|
+
| +-- skills/ # skill modules (120 SKILL.md files)
|
|
50
50
|
| +-- rules/ # global rules (22 files)
|
|
51
51
|
| +-- hooks/ # hook registry and scripts (40 scripts)
|
|
52
52
|
| +-- contexts/ # context files
|
|
@@ -65,7 +65,7 @@ The counts below should stay aligned with `templates/manifest.json`, README comp
|
|
|
65
65
|
|
|
66
66
|
Flat agent definition files. During Codex installation these land under `.codex/agents/`.
|
|
67
67
|
|
|
68
|
-
### Skills (
|
|
68
|
+
### Skills (120)
|
|
69
69
|
|
|
70
70
|
`templates/.claude/skills/*/SKILL.md`
|
|
71
71
|
|
|
@@ -4,7 +4,7 @@
|
|
|
4
4
|
|
|
5
5
|
| Pattern | Goal | Use When |
|
|
6
6
|
|---------|------|----------|
|
|
7
|
-
| `
|
|
7
|
+
| `roundtable-debate` | Preserve dissent while reaching a bounded decision | Release gates, design approval, high-risk specs with minority risks |
|
|
8
8
|
| `roundtable-debate` | Preserve dissent | Strategy choices, tradeoffs, ambiguous product or architecture decisions |
|
|
9
9
|
|
|
10
10
|
## Failure Modes
|
|
@@ -1,83 +1,13 @@
|
|
|
1
|
-
#
|
|
1
|
+
# External Interop Guidance
|
|
2
2
|
|
|
3
|
-
|
|
3
|
+
The packaged multi-provider exec skills have been retired. For Codex interoperability, use the official Claude Code plugin `openai/codex-plugin-cc` only when it is explicitly installed and requested.
|
|
4
4
|
|
|
5
|
-
|
|
5
|
+
## Current Paths
|
|
6
6
|
|
|
7
|
-
|
|
7
|
+
| Need | Preferred path | Notes |
|
|
8
|
+
|------|----------------|-------|
|
|
9
|
+
| Codex interop | `openai/codex-plugin-cc` | Official plugin path; opt-in only. |
|
|
10
|
+
| Token-optimized local command output | `rtk-exec` | Existing RTK proxy remains supported. |
|
|
11
|
+
| Research or independent review | `researcher`, expert agents, or `roundtable-debate` | Prefer in-repo agent workflows unless plugin interop is requested. |
|
|
8
12
|
|
|
9
|
-
|
|
10
|
-
|
|
11
|
-
| Provider | Skill | CLI Dependency | Model | Strengths |
|
|
12
|
-
|----------|-------|---------------|-------|-----------|
|
|
13
|
-
| OpenAI (Codex) | `codex-exec` | `codex` CLI | GPT-5.4 | Code generation, broad knowledge |
|
|
14
|
-
| Google (Gemini) | `gemini-exec` | `gemini` CLI | Gemini 2.5 Pro | Long context, multimodal |
|
|
15
|
-
| RTK (proxy) | `rtk-exec` | `rtk` CLI | Configurable | Token-optimized output, cost reduction |
|
|
16
|
-
|
|
17
|
-
## Availability Detection
|
|
18
|
-
|
|
19
|
-
The `session-env-check.sh` hook (SessionStart) auto-detects available providers:
|
|
20
|
-
|
|
21
|
-
```
|
|
22
|
-
[SessionStart] Checking external CLI availability...
|
|
23
|
-
codex: ✓ available
|
|
24
|
-
gemini: ✗ not found
|
|
25
|
-
rtk: ✓ available
|
|
26
|
-
```
|
|
27
|
-
|
|
28
|
-
Providers are opt-in — missing CLIs are silently skipped.
|
|
29
|
-
|
|
30
|
-
## Usage Patterns
|
|
31
|
-
|
|
32
|
-
### Direct Invocation
|
|
33
|
-
|
|
34
|
-
```
|
|
35
|
-
/codex-exec "Review this function for security issues"
|
|
36
|
-
/gemini-exec "Analyze this architecture diagram"
|
|
37
|
-
/rtk-exec "List files matching pattern X"
|
|
38
|
-
```
|
|
39
|
-
|
|
40
|
-
### Provider Selection Guide
|
|
41
|
-
|
|
42
|
-
| Task | Recommended Provider | Rationale |
|
|
43
|
-
|------|---------------------|-----------|
|
|
44
|
-
| Second opinion on code review | codex-exec | Independent model reduces confirmation bias |
|
|
45
|
-
| Long document analysis | gemini-exec | 1M+ context window |
|
|
46
|
-
| Token-heavy batch operations | rtk-exec | Compressed output reduces context cost |
|
|
47
|
-
| Security audit cross-check | codex-exec | Different training data catches different patterns |
|
|
48
|
-
| Multi-model verification | All three | `/multi-model-verification` skill orchestrates this |
|
|
49
|
-
|
|
50
|
-
### Integration with Existing Skills
|
|
51
|
-
|
|
52
|
-
| Skill | Uses Provider | How |
|
|
53
|
-
|-------|--------------|-----|
|
|
54
|
-
| `multi-model-verification` | codex-exec + gemini-exec | Parallel verification with severity classification |
|
|
55
|
-
| `reasoning-sandwich` | Any exec skill | Pre/post reasoning with different models |
|
|
56
|
-
| `model-escalation` | Claude models only | Internal escalation (haiku→sonnet→opus), not cross-provider |
|
|
57
|
-
|
|
58
|
-
## Relationship to Multi-Model Routing
|
|
59
|
-
|
|
60
|
-
| Aspect | Multi-Model Routing | Multi-Provider Exec |
|
|
61
|
-
|--------|--------------------|--------------------|
|
|
62
|
-
| Scope | Claude model selection | Cross-provider execution |
|
|
63
|
-
| Models | haiku / sonnet / opus | GPT-5.4 / Gemini 2.5 / RTK proxy |
|
|
64
|
-
| Mechanism | `model` frontmatter field | Exec skill invocation |
|
|
65
|
-
| Use case | Cost/quality optimization within Claude | Independent verification, specialized tasks |
|
|
66
|
-
| Guide | `guides/multi-model-routing/` | `guides/multi-provider-exec/` |
|
|
67
|
-
|
|
68
|
-
## Configuration
|
|
69
|
-
|
|
70
|
-
No global configuration required. Each exec skill reads its own CLI configuration:
|
|
71
|
-
|
|
72
|
-
| Skill | Config Source |
|
|
73
|
-
|-------|-------------|
|
|
74
|
-
| codex-exec | `~/.codex/config` or CODEX_API_KEY env |
|
|
75
|
-
| gemini-exec | `~/.gemini/config` or GEMINI_API_KEY env |
|
|
76
|
-
| rtk-exec | RTK proxy running on localhost |
|
|
77
|
-
|
|
78
|
-
## Limitations
|
|
79
|
-
|
|
80
|
-
- Provider availability depends on user's CLI installations
|
|
81
|
-
- Cross-provider results are advisory — Claude remains the primary execution engine
|
|
82
|
-
- No automatic fallback between providers (by design — explicit selection preferred)
|
|
83
|
-
- Rate limits and costs are provider-specific and not tracked by oh-my-customcodex
|
|
13
|
+
Do not auto-delegate to retired provider wrapper skills. Keep expert agents responsible for reviewing any plugin-assisted output.
|
package/templates/manifest.json
CHANGED
|
@@ -1,5 +1,5 @@
|
|
|
1
1
|
{
|
|
2
|
-
"version": "0.5.
|
|
2
|
+
"version": "0.5.9",
|
|
3
3
|
"requiresCC": ">=2.1.121",
|
|
4
4
|
"claudeCode": {
|
|
5
5
|
"minimumVersion": "2.1.121",
|
|
@@ -23,7 +23,7 @@
|
|
|
23
23
|
"name": "skills",
|
|
24
24
|
"path": ".agents/skills",
|
|
25
25
|
"description": "Reusable skill modules (project-scoped repo skills)",
|
|
26
|
-
"files":
|
|
26
|
+
"files": 120
|
|
27
27
|
},
|
|
28
28
|
{
|
|
29
29
|
"name": "guides",
|
|
@@ -1,209 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: omcustomcodex:agora
|
|
3
|
-
description: "Multi-LLM adversarial consensus loop — 3+ LLMs compete to find flaws in designs/specs until unanimous agreement is reached"
|
|
4
|
-
user-invocable: true
|
|
5
|
-
argument-hint: "<document-path> [--rounds N] [--severity-threshold HIGH]"
|
|
6
|
-
effort: max
|
|
7
|
-
scope: core
|
|
8
|
-
version: 1.0.0
|
|
9
|
-
source:
|
|
10
|
-
type: external
|
|
11
|
-
origin: github
|
|
12
|
-
url: https://github.com/baekenough/baekenough-skills
|
|
13
|
-
version: 1.0.0
|
|
14
|
-
---
|
|
15
|
-
|
|
16
|
-
# Agora: Multi-LLM Adversarial Consensus
|
|
17
|
-
|
|
18
|
-
3개 이상의 LLM(Claude, Codex/GPT, Gemini)이 경쟁적으로 설계/문서의 결함을 찾고, 만장일치 합의에 도달할 때까지 반복하는 적대적 교차 검증 스킬.
|
|
19
|
-
|
|
20
|
-
## Prerequisites
|
|
21
|
-
|
|
22
|
-
- `codex-exec` skill (Codex/GPT 호출)
|
|
23
|
-
- `gemini-exec` skill (Gemini 호출)
|
|
24
|
-
- Agent Teams enabled (`OMCODEX_AGENT_TEAMS=1`) or Agent tool available
|
|
25
|
-
|
|
26
|
-
## Sensitive-Path Delegation
|
|
27
|
-
|
|
28
|
-
Sensitive-path compatibility note: if this skill delegates work that touches `.claude/**`, `.claude/outputs/**`, `templates/.claude/**`, or read-only measurements of those paths, keep `.codex/**` edits on the normal Codex path. On Claude Code v2.1.121+ with `bypassPermissions`, direct writes to `.claude/skills/`, `.claude/agents/`, and `.claude/commands/` are allowed; on v2.1.126+ that extends to broader protected paths. Only use `/tmp/{skill}-{timestamp}.md` as a legacy fallback when the target runtime is older or still prompts.
|
|
29
|
-
|
|
30
|
-
## Usage
|
|
31
|
-
|
|
32
|
-
```
|
|
33
|
-
/omcustomcodex:agora docs/design.md # Default: 3 LLMs, unlimited rounds
|
|
34
|
-
/omcustomcodex:agora docs/design.md --rounds 10 # Max 10 rounds
|
|
35
|
-
/omcustomcodex:agora docs/design.md --severity-threshold HIGH # Exit when no HIGH+ findings
|
|
36
|
-
/omcustomcodex:agora docs/design.md --models claude,codex # 2 LLMs only
|
|
37
|
-
```
|
|
38
|
-
|
|
39
|
-
## Workflow
|
|
40
|
-
|
|
41
|
-
### Phase 1: Setup
|
|
42
|
-
1. Read the target document
|
|
43
|
-
2. Create Agent Team: `TeamCreate("agora-review")`
|
|
44
|
-
3. Create review tasks per focus area
|
|
45
|
-
|
|
46
|
-
### Phase 2: Spawn Reviewers (parallel)
|
|
47
|
-
Spawn 3 reviewers as Agent Team members:
|
|
48
|
-
|
|
49
|
-
```
|
|
50
|
-
|
|
51
|
-
### Anti-Groupthink Mode
|
|
52
|
-
|
|
53
|
-
Use `--anti-groupthink` when consensus itself is a risk:
|
|
54
|
-
|
|
55
|
-
1. Reviewers submit independent findings before seeing peer output.
|
|
56
|
-
2. One reviewer is assigned as devil's advocate.
|
|
57
|
-
3. Minority findings are preserved unless the synthesis explicitly rejects them with evidence.
|
|
58
|
-
4. Debate is capped at two challenge rounds before the lead either decides or requests more facts.
|
|
59
|
-
|
|
60
|
-
For decisions where dissent preservation is the main goal, use `roundtable-debate` directly instead of `agora`.
|
|
61
|
-
Agent(name: "claude-critic", model: opus, effort: max)
|
|
62
|
-
→ 20-point deep adversarial review
|
|
63
|
-
|
|
64
|
-
Agent(name: "codex-critic", model: opus)
|
|
65
|
-
→ Invoke Skill(codex-exec) for GPT perspective + independent Claude analysis
|
|
66
|
-
|
|
67
|
-
Agent(name: "gemini-critic", model: opus)
|
|
68
|
-
→ Invoke Skill(gemini-exec) for Gemini perspective + independent Claude analysis
|
|
69
|
-
```
|
|
70
|
-
|
|
71
|
-
### Phase 3: Independent Review
|
|
72
|
-
Each reviewer performs adversarial review with this template:
|
|
73
|
-
|
|
74
|
-
```
|
|
75
|
-
For EACH review point:
|
|
76
|
-
### Round N: [Topic]
|
|
77
|
-
**Severity**: CRITICAL / HIGH / MEDIUM / LOW
|
|
78
|
-
**Flaw**: [Specific, concrete problem description]
|
|
79
|
-
**Evidence**: [Why this is real, not theoretical]
|
|
80
|
-
**Impact**: [What happens if not addressed]
|
|
81
|
-
**Counter-argument**: [Best case FOR the current design]
|
|
82
|
-
**Verdict**: KEEP / MODIFY / REJECT
|
|
83
|
-
```
|
|
84
|
-
|
|
85
|
-
Review areas (adapt to document type):
|
|
86
|
-
- Architecture fundamentals
|
|
87
|
-
- Component/service design
|
|
88
|
-
- Data architecture
|
|
89
|
-
- Security & resilience
|
|
90
|
-
- Feasibility & deployment
|
|
91
|
-
- Testing strategy
|
|
92
|
-
- Operational complexity
|
|
93
|
-
|
|
94
|
-
### Phase 4: Cross-Review (Peer-to-Peer)
|
|
95
|
-
Each reviewer sends findings to the other two via `SendMessage`.
|
|
96
|
-
|
|
97
|
-
Counter-review template:
|
|
98
|
-
1. Which findings do you **AGREE** with? (and why)
|
|
99
|
-
2. Which findings do you **DISAGREE** with? (evidence-based rebuttal)
|
|
100
|
-
3. What did they **MISS** that you caught?
|
|
101
|
-
4. What did they catch that you **MISSED**?
|
|
102
|
-
5. **SEVERITY** adjustments — upgrade or downgrade with justification
|
|
103
|
-
|
|
104
|
-
### Phase 5: Synthesis
|
|
105
|
-
Team lead aggregates all findings:
|
|
106
|
-
|
|
107
|
-
```
|
|
108
|
-
UNANIMOUS CRITICAL: [findings all 3 agreed on]
|
|
109
|
-
STRONG AGREEMENT: [findings 2/3 agreed on]
|
|
110
|
-
SPLIT DECISIONS: [findings with disagreement + resolution]
|
|
111
|
-
```
|
|
112
|
-
|
|
113
|
-
Determine verdict:
|
|
114
|
-
- **BUILD**: No CRITICAL, no unresolved HIGH
|
|
115
|
-
- **BUILD WITH CHANGES**: No CRITICAL, HIGH findings have accepted mitigations
|
|
116
|
-
- **REDESIGN**: Any unresolved CRITICAL findings
|
|
117
|
-
- **ABANDON**: Fundamental concept is flawed
|
|
118
|
-
|
|
119
|
-
### Phase 6: Loop (if REDESIGN)
|
|
120
|
-
1. Team lead produces/delegates redesign addressing ALL critical findings
|
|
121
|
-
2. New version sent to ALL reviewers: `SendMessage(to: "*")`
|
|
122
|
-
3. Reviewers re-review → GOTO Phase 4
|
|
123
|
-
4. Repeat until EXIT criteria met
|
|
124
|
-
|
|
125
|
-
### Phase 7: Exit (consensus reached)
|
|
126
|
-
When ALL reviewers agree BUILD or BUILD WITH CHANGES:
|
|
127
|
-
1. Produce final consensus report
|
|
128
|
-
2. Write to `.codex/outputs/sessions/{date}/agora-{topic}-{time}.md`
|
|
129
|
-
3. Shut down team: `SendMessage(to: "*", message: {type: "shutdown_request"})`
|
|
130
|
-
|
|
131
|
-
## Reviewer Principles
|
|
132
|
-
|
|
133
|
-
1. **NEUTRAL** — no reviewer has home team advantage
|
|
134
|
-
2. **COMPETITIVE** — find flaws others missed
|
|
135
|
-
3. **CRITICAL** — "fewer than 5 CRITICAL flaws = not looking hard enough"
|
|
136
|
-
4. **EVIDENCE-BASED** — every finding cites specific evidence
|
|
137
|
-
5. **CONSTRUCTIVE** — every flaw includes recommended fix
|
|
138
|
-
6. **CONVERGENT** — goal is consensus, not endless disagreement
|
|
139
|
-
|
|
140
|
-
## Consensus Criteria
|
|
141
|
-
|
|
142
|
-
| Condition | Required |
|
|
143
|
-
|-----------|----------|
|
|
144
|
-
| CRITICAL findings resolved | ALL |
|
|
145
|
-
| HIGH findings resolved or accepted | ALL |
|
|
146
|
-
| All reviewers rate BUILD or BUILD WITH CHANGES | YES |
|
|
147
|
-
| Cross-review disagreements resolved | ALL |
|
|
148
|
-
|
|
149
|
-
## Output Format
|
|
150
|
-
|
|
151
|
-
```markdown
|
|
152
|
-
# Agora Consensus Report
|
|
153
|
-
|
|
154
|
-
## Document: [path]
|
|
155
|
-
## Rounds: [N]
|
|
156
|
-
## Reviewers: [list with LLM models used]
|
|
157
|
-
|
|
158
|
-
## Verdict: [BUILD / BUILD WITH CHANGES / REDESIGN]
|
|
159
|
-
|
|
160
|
-
## Unanimous Findings
|
|
161
|
-
| # | Finding | Severity | All 3 Agree |
|
|
162
|
-
|---|---------|----------|-------------|
|
|
163
|
-
|
|
164
|
-
## Required Changes Before Build
|
|
165
|
-
1. [change with source reviewer]
|
|
166
|
-
2. ...
|
|
167
|
-
|
|
168
|
-
## Accepted Risks
|
|
169
|
-
- [finding accepted with justification]
|
|
170
|
-
|
|
171
|
-
## Unique Contributions Per Reviewer
|
|
172
|
-
| Reviewer | Findings Others Missed |
|
|
173
|
-
|----------|----------------------|
|
|
174
|
-
|
|
175
|
-
## Process Metrics
|
|
176
|
-
- Rounds: N
|
|
177
|
-
- Total findings: N
|
|
178
|
-
- Cross-adopted: N
|
|
179
|
-
- Severity upgrades: N
|
|
180
|
-
- Severity downgrades: N
|
|
181
|
-
- Disagreements raised: N
|
|
182
|
-
- Disagreements resolved: N/N
|
|
183
|
-
```
|
|
184
|
-
|
|
185
|
-
## Configuration
|
|
186
|
-
|
|
187
|
-
```yaml
|
|
188
|
-
# Default settings
|
|
189
|
-
agora:
|
|
190
|
-
max_rounds: unlimited # Set --rounds to limit
|
|
191
|
-
severity_threshold: HIGH # EXIT when no findings >= threshold
|
|
192
|
-
models:
|
|
193
|
-
- claude (opus, max effort)
|
|
194
|
-
- codex (via codex-exec skill)
|
|
195
|
-
- gemini (via gemini-exec skill)
|
|
196
|
-
review_points: 20 # Per reviewer
|
|
197
|
-
cross_review: true # Peer-to-peer sharing
|
|
198
|
-
auto_redesign: true # Auto-produce redesign on REDESIGN verdict
|
|
199
|
-
```
|
|
200
|
-
|
|
201
|
-
## Anti-Patterns
|
|
202
|
-
|
|
203
|
-
| Anti-Pattern | Why Wrong | Correct |
|
|
204
|
-
|-------------|-----------|---------|
|
|
205
|
-
| Single LLM review | Misses blind spots | 3+ LLMs find complementary flaws |
|
|
206
|
-
| No cross-review | Reviewers don't challenge each other | Peer-to-peer sharing surfaces disagreements |
|
|
207
|
-
| Accepting first BUILD | May miss edge cases | Loop until ALL agree |
|
|
208
|
-
| Ignoring split decisions | Unresolved disagreements fester | Resolve every split with evidence |
|
|
209
|
-
| Push for consensus too fast | Premature agreement | Let reviewers challenge freely |
|