agestra 4.15.0 → 4.15.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -12,7 +12,7 @@
12
12
  "name": "agestra",
13
13
  "source": "./",
14
14
  "description": "Multi-host MCP orchestration across Claude, Ollama, Gemini, and Codex for review, QA, and cross-validation",
15
- "version": "4.15.0",
15
+ "version": "4.15.1",
16
16
  "author": {
17
17
  "name": "mua-vtuber"
18
18
  },
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "agestra",
3
- "version": "4.15.0",
3
+ "version": "4.15.1",
4
4
  "description": "Claude Code plugin — multi-host MCP orchestration across Claude, Ollama, Gemini, and Codex for review, QA, and cross-validation",
5
5
  "mcpServers": {
6
6
  "agestra": {
@@ -9,17 +9,5 @@
9
9
  "${CLAUDE_PLUGIN_ROOT}/dist/bundle.js"
10
10
  ]
11
11
  }
12
- },
13
- "hooks": {
14
- "UserPromptSubmit": [
15
- {
16
- "hooks": [
17
- {
18
- "type": "command",
19
- "command": "node ${CLAUDE_PLUGIN_ROOT}/hooks/user-prompt-submit.js"
20
- }
21
- ]
22
- }
23
- ]
24
12
  }
25
13
  }
@@ -1,16 +1,15 @@
1
1
  # Generated by Agestra. Managed file.
2
- description = "Run research using a selected investigation topology"
2
+ description = "Run host-owned Agestra research with an idea, QA, or security viewpoint"
3
3
  prompt = """
4
4
  You are executing the `/agestra research` Gemini command.
5
5
 
6
- - Start with `setup_status`, then `environment_check` and `provider_list`.
7
- - For investigation-including workflows that continue into workflow consensus, route through `agent_research_start`, then start debate separately with `agent_consensus_start`.
8
- - Host research/debate contract uses workflow profiles, `aggregation`, `questionSet`, and `evidencePolicy`:
6
+ - Start with `setup_status`; provider availability is not a gate for research.
7
+ - Ask for the research viewpoint when missing: Idea exploration, QA evidence set, or Security evidence set.
8
+ - Use host-owned `agestra-research` assignments. Do not ask for a research topology or provider investigation mode.
9
+ - Host research contract uses workflow profiles, `aggregation`, `questionSet`, and `evidencePolicy`:
9
10
  호스트가 조사한다.
10
11
  호스트가 정리한다.
11
- 시스템이 토론한다.
12
12
  호스트가 문서화한다.
13
- - External AI research and debate run in separate fresh sessions, even when the same provider participates in both phases.
14
13
 
15
14
  @{commands/research.md}
16
15
  """
package/AGENTS.md CHANGED
@@ -30,18 +30,17 @@ Use `host_assets_status` to inspect generated Codex host assets, and only call `
30
30
 
31
31
  ## Workflow Mapping
32
32
 
33
+ - Public slash commands are limited to setup, research, and review.
34
+ - Setup and research requests follow `commands/setup.md` and `commands/research.md`.
33
35
  - When Agestra is active, review requests follow `commands/review.md`
34
- - When Agestra is active, QA / verification requests follow `commands/qa.md`
35
- - When Agestra is active, security audit requests follow `commands/security.md`
36
+ - Internal QA, security, design, idea, and planning viewpoints remain available through workflow profiles and skill/lens resources; they are not shipped as public slash commands.
36
37
  - Review, QA, and security workflows write durable reports under `docs/reports/review/`, `docs/reports/qa/`, and `docs/reports/security/` unless the user asks for chat-only output.
37
- - When Agestra is active, design and architecture requests follow `commands/design.md`
38
- - When Agestra is active, idea discovery requests follow `commands/idea.md`
39
38
 
40
39
  ## Core MCP Tools
41
40
 
42
41
  - `setup_status`, `environment_check`, and `provider_list`: inspect installation, host, and provider state for Agestra health checks and active Agestra workflows
43
42
  - `agent_research_start`: research-only preprocessing with workflow profile, prompt pack, questionSet, evidencePolicy, research lenses, and investigator assignments; writes `research_submissions.json`, `research_transcript.json`, and `aggregation.json`; does not start debate
44
- - `agent_consensus_start` (with `agent_debate_approve`/`_continue`/`_reject`) and `agent_debate_review`: debate-only approval-gated consensus flows from prepared `aggregation`, supplied `questionSet`, and `evidencePolicy`; `workflow` is a report/artifact label only, not a debate routing branch
43
+ - `agent_consensus_start` (with `agent_debate_approve`/`_continue`/`_reject`): debate-only approval-gated consensus flows from prepared `aggregation`, supplied `questionSet`, and `evidencePolicy`; `workflow` is a report/artifact label only, not a debate routing branch
45
44
  - `host_assets_status`, `host_assets_install`, `host_assets_uninstall`: inspect and explicitly manage generated Codex host-native assets such as custom agents and skills
46
45
  - `qa_run`: run workspace build/test verification for QA evidence
47
46
 
@@ -50,3 +49,16 @@ Use `host_assets_status` to inspect generated Codex host assets, and only call `
50
49
  - `agents/`: canonical role prompts (`agestra-team-lead`, `agestra-research`, `agestra-debate`)
51
50
  - `skills/`: reusable workflow references
52
51
  - `GEMINI.md` and `.gemini/commands/`: Gemini-specific host assets; keep behavior aligned with them when updating shared workflows
52
+
53
+ ## graphify
54
+
55
+ This project has a knowledge graph at graphify-out/ with god nodes, community structure, and cross-file relationships.
56
+
57
+ When the user types `/graphify`, invoke the `skill` tool with `skill: "graphify"` before doing anything else.
58
+
59
+ Rules:
60
+ - For codebase questions, first run `graphify query "<question>"` when graphify-out/graph.json exists. Use `graphify path "<A>" "<B>"` for relationships and `graphify explain "<concept>"` for focused concepts. These return a scoped subgraph, usually much smaller than GRAPH_REPORT.md or raw grep output.
61
+ - Dirty graphify-out/ files are expected after hooks or incremental updates; dirty graph files are not a reason to skip graphify. Only skip graphify if the task is about stale or incorrect graph output, or the user explicitly says not to use it.
62
+ - If graphify-out/wiki/index.md exists, use it for broad navigation instead of raw source browsing.
63
+ - Read graphify-out/GRAPH_REPORT.md only for broad architecture review or when query/path/explain do not surface enough context.
64
+ - After modifying code, run `graphify update .` to keep the graph current (AST-only, no API cost).
package/GEMINI.md CHANGED
@@ -50,7 +50,7 @@ QA/review/security on the result.
50
50
  profile, prompt pack, `questionSet`, `evidencePolicy`, research lenses, and
51
51
  investigator assignments; writes `research_submissions.json`,
52
52
  `research_transcript.json`, and `aggregation.json`; does not start debate
53
- - debate-only `agent_consensus_start`, `agent_debate_approve`/`_continue`/`_reject`, `agent_debate_review`: sessions from prepared `aggregation`, supplied `questionSet`, `evidencePolicy`, and approval-gated debate artifacts
53
+ - debate-only `agent_consensus_start`, `agent_debate_approve`/`_continue`/`_reject`: sessions from prepared `aggregation`, supplied `questionSet`, `evidencePolicy`, and approval-gated debate artifacts
54
54
  - `workspace_*`: document-backed review and aggregation flows
55
55
  - `qa_run`: workspace build/test verification for QA evidence
56
56
 
package/README.ja.md CHANGED
@@ -21,29 +21,27 @@ Agestra は、1 つの問題を複数の AI 視点で検討し、整理するた
21
21
 
22
22
  インストール後、プロジェクトを開いて Agestra ワークフローを呼び出します。
23
23
 
24
- - Claude Code: `/agestra review`, `/agestra qa`, `/agestra security`, `/agestra design`, `/agestra idea`
25
- - Gemini CLI: `/agestra:review`, `/agestra:qa`, `/agestra:security`, `/agestra:design`, `/agestra:idea`
24
+ - Claude Code: `/agestra research ...` または `/agestra review ...`
25
+ - Gemini CLI: `/agestra:research ...` または `/agestra:review ...`
26
26
  - Codex CLI: `Use Agestra with Gemini and Codex to review this branch.` のように、Agestra や複数 AI を明示して依頼
27
27
 
28
28
  初回は使う provider を聞かれることがあります。provider が 1 つだけでもセットアップやホスト所有の作業はできますが、複数 AI 比較は 2 つ以上あるとより有効です。
29
29
 
30
30
  ## 何に使うか
31
31
 
32
- - `review`: コード品質、回帰リスク、UX、整理ポイントを複数 AI の視点で比較
33
- - `qa`: 設計書や計画を基準に実装を検証し、PASS/FAIL の根拠を集める
34
- - `security`: セキュリティ観点に絞って確認する
35
- - `design`: 実装前に構造やトレードオフを整理する
36
- - `idea`: 改善案、代替案、類似ツールを探る
32
+ - `research`: アイデア、QA、セキュリティの質問に必要な根拠を現在のホストだけで調査し整理します。この流れでは外部 provider は調査しません。
33
+ - `review`: 既存のコード、ドキュメント、diff、または準備済みのリサーチ結果をもとに討論し、意見を比較します。レビューは新しい調査を始めません。
34
+ - レビュー観点には、コード品質、回帰リスク、UX、整理、設計適合性、性能、信頼性、テスト、安全性のにおい、リリース準備状況を含められます。
37
35
 
38
36
  ## 実行すると何が起こるか
39
37
 
40
38
  1. Agestra が設定と利用可能な provider を確認します。
41
39
  2. 依頼を対象とスコープが明確なワークフローに整理します。
42
- 3. 調査が必要なら、ホストが先に証拠を集めて整理します。
43
- 4. 選ばれた provider は残っている論点だけをレビューまたは討論します。
40
+ 3. `research` では、現在のホストが根拠を調査し、整理し、文書化します。provider fan-out はありません。
41
+ 4. `review` では、選ばれた provider がスコープ内のコード、ドキュメント、diff、または準備済みのリサーチ結果について討論します。別途調査はしません。
44
42
  5. 結論、意見の違い、根拠を 1 つの結果として返します。
45
43
 
46
- 普通のレビューや QA の依頼が自動で Agestra になるわけではありません。`/agestra ...` を使うか、複数 AI や provider-backed のレビュー、QA、セキュリティ、設計、アイデア作業を明示したときに Agestra が動きます。
44
+ 普通のレビューや QA の依頼が自動で Agestra になるわけではありません。`/agestra ...` を使うか、複数 AI や provider-backed のリサーチ/レビュー作業を明示したときに Agestra が動きます。
47
45
 
48
46
  コード変更は、まず現在のホストで直接行うのが基本です。Agestra はその後で結果をレビューし、計画との一致を確認し、複数 provider の意見と根拠を記録するところで最も力を発揮します。
49
47
 
package/README.ko.md CHANGED
@@ -21,29 +21,27 @@ Agestra는 하나의 문제를 여러 AI 시각으로 검토하고 정리해 주
21
21
 
22
22
  설치 후 프로젝트를 열고 Agestra 워크플로우를 요청하면 됩니다.
23
23
 
24
- - Claude Code: `/agestra review`, `/agestra qa`, `/agestra security`, `/agestra design`, `/agestra idea`
25
- - Gemini CLI: `/agestra:review`, `/agestra:qa`, `/agestra:security`, `/agestra:design`, `/agestra:idea`
24
+ - Claude Code: `/agestra research ...` 또는 `/agestra review ...`
25
+ - Gemini CLI: `/agestra:research ...` 또는 `/agestra:review ...`
26
26
  - Codex CLI: `Agestra로 Gemini와 Codex를 같이 써서 이 브랜치 리뷰해줘`처럼 Agestra나 여러 AI를 명시해서 요청
27
27
 
28
28
  첫 실행에서는 사용할 provider를 물어볼 수 있습니다. provider가 하나만 있어도 설정과 호스트 소유 작업은 가능하지만, 멀티 AI 비교는 둘 이상일 때 가장 잘 살아납니다.
29
29
 
30
30
  ## 무엇에 쓰나
31
31
 
32
- - `review`: 코드 품질, 회귀 위험, UX, 정리 포인트를 여러 AI 의견으로 비교
33
- - `qa`: 설계 문서나 계획 기준으로 구현을 검증하고 PASS/FAIL 근거 수집
34
- - `security`: 보안 관점만 따로 집중해서 검토
35
- - `design`: 구현 전에 구조와 트레이드오프 논의
36
- - `idea`: 개선 아이디어, 대안, 유사 도구 탐색
32
+ - `research`: 아이디어, QA, 보안 질문에 필요한 근거를 현재 호스트만 조사하고 정리합니다. 외부 provider는 이 흐름에서 조사하지 않습니다.
33
+ - `review`: 이미 있는 코드, 문서, diff, 또는 준비된 리서치 결과를 두고 토론하고 의견을 비교합니다. 리뷰는 새 조사를 시작하지 않습니다.
34
+ - 리뷰 관점은 코드 품질, 회귀 위험, UX, 정리, 설계 적합성, 성능, 안정성, 테스트, 기본 안전 냄새, 배포 준비도를 다룰 수 있습니다.
37
35
 
38
36
  ## 실행하면 어떻게 되나
39
37
 
40
38
  1. Agestra가 설정과 사용 가능한 provider를 확인합니다.
41
39
  2. 요청을 대상과 범위가 분명한 워크플로우로 정리합니다.
42
- 3. 조사가 필요하면 호스트가 먼저 근거를 모으고 정리합니다.
43
- 4. 선택된 provider들이 남은 쟁점만 검토하거나 토론합니다.
40
+ 3. `research`에서는 현재 호스트가 근거를 조사하고 정리하고 문서화합니다. provider fan-out은 없습니다.
41
+ 4. `review`에서는 선택된 provider들이 범위 안의 코드, 문서, diff, 또는 준비된 리서치 결과를 놓고 토론합니다. 별도 조사는 하지 않습니다.
44
42
  5. 결론, 이견, 근거를 하나의 결과로 돌려줍니다.
45
43
 
46
- 평범한 리뷰나 QA 요청이 자동으로 Agestra가 되는 것은 아닙니다. `/agestra ...`를 쓰거나, 여러 AI나 provider-backed 리뷰/QA/보안/설계/아이디어 작업을 명시했을 때 Agestra 워크플로우가 시작됩니다.
44
+ 평범한 리뷰나 QA 요청이 자동으로 Agestra가 되는 것은 아닙니다. `/agestra ...`를 쓰거나, 여러 AI나 provider-backed 리서치/리뷰 작업을 명시했을 때 Agestra 워크플로우가 시작됩니다.
47
45
 
48
46
  코드 변경은 먼저 현재 호스트에서 직접 진행하는 편이 좋습니다. Agestra는 그 다음 결과를 리뷰하고, 계획과 맞는지 검증하고, 여러 provider 의견과 근거를 기록할 때 가장 강합니다.
49
47
 
package/README.md CHANGED
@@ -21,29 +21,27 @@ Install Agestra in the host you already use.
21
21
 
22
22
  Then open your project and ask for an Agestra workflow.
23
23
 
24
- - Claude Code: `/agestra review`, `/agestra qa`, `/agestra security`, `/agestra design`, `/agestra idea`
25
- - Gemini CLI: `/agestra:review`, `/agestra:qa`, `/agestra:security`, `/agestra:design`, `/agestra:idea`
24
+ - Claude Code: `/agestra research ...` or `/agestra review ...`
25
+ - Gemini CLI: `/agestra:research ...` or `/agestra:review ...`
26
26
  - Codex CLI: ask explicitly for Agestra or multiple providers, for example `Use Agestra with Gemini and Codex to review this branch.`
27
27
 
28
28
  The first workflow may ask which providers you want to use. Agestra works best with two or more providers, but setup and host-owned flows still work with one.
29
29
 
30
30
  ## What To Use It For
31
31
 
32
- - `review`: compare multiple AI opinions about code quality, regressions, UX, and cleanup
33
- - `qa`: verify implementation against a design or plan and collect PASS/FAIL evidence
34
- - `security`: run a dedicated security-focused review
35
- - `design`: discuss architecture and tradeoffs before coding
36
- - `idea`: explore improvements, alternatives, and similar tools
32
+ - `research`: host-only evidence gathering for idea, QA, or security questions. External providers do not investigate in this flow.
33
+ - `review`: debate and compare opinions about existing code, docs, diffs, or prepared research. Review does not start a fresh investigation.
34
+ - Review lenses can cover code quality, regressions, UX, cleanup, design fit, performance, reliability, tests, safety smells, and production readiness.
37
35
 
38
36
  ## How It Runs
39
37
 
40
38
  1. Agestra checks setup and available providers.
41
39
  2. It turns your request into a clear workflow with a target and scope.
42
- 3. When research is needed, the host gathers and organizes the evidence first.
43
- 4. Selected providers review or debate only the unresolved points.
40
+ 3. In `research`, the current host gathers, organizes, and documents evidence. There is no provider fan-out.
41
+ 4. In `review`, selected providers discuss the code, documents, diffs, or prepared research already in scope. They do not perform separate research.
44
42
  5. Agestra returns one result with conclusions, disagreements, and evidence.
45
43
 
46
- Plain review or QA requests do not automatically become Agestra workflows. Agestra starts when you use `/agestra ...` or explicitly ask for multi-AI or provider-backed review, QA, security, design, or idea work.
44
+ Plain review or QA requests do not automatically become Agestra workflows. Agestra starts when you use `/agestra ...` or explicitly ask for multi-AI or provider-backed research/review work.
47
45
 
48
46
  For code changes, use your current host directly first. Agestra is strongest after that: reviewing the result, checking it against a plan, comparing provider opinions, and recording the evidence.
49
47
 
package/README.zh.md CHANGED
@@ -21,29 +21,27 @@ Agestra 用来让多个 AI 从不同角度审视同一个问题,并把结果
21
21
 
22
22
  安装后,打开项目并发起 Agestra 工作流。
23
23
 
24
- - Claude Code: `/agestra review`, `/agestra qa`, `/agestra security`, `/agestra design`, `/agestra idea`
25
- - Gemini CLI: `/agestra:review`, `/agestra:qa`, `/agestra:security`, `/agestra:design`, `/agestra:idea`
24
+ - Claude Code: `/agestra research ...` `/agestra review ...`
25
+ - Gemini CLI: `/agestra:research ...` `/agestra:review ...`
26
26
  - Codex CLI: 像 `Use Agestra with Gemini and Codex to review this branch.` 这样明确提到 Agestra 或多个 AI
27
27
 
28
28
  第一次运行时,它可能会询问你要启用哪些 provider。只有一个 provider 也能完成设置和宿主自有流程,但 Multi-AI 比较在两个以上 provider 时效果最好。
29
29
 
30
30
  ## 用它做什么
31
31
 
32
- - `review`: 比较多个 AI 对代码质量、回归风险、UX 和整理点的看法
33
- - `qa`: 按设计文档或计划验证实现,并收集 PASS/FAIL 证据
34
- - `security`: 专门做安全视角的检查
35
- - `design`: 在写代码前讨论结构和取舍
36
- - `idea`: 探索改进方向、备选方案和相似工具
32
+ - `research`: 由当前宿主单独收集并整理 idea、QA 或安全问题所需的证据。外部 provider 不参与这个调查流程。
33
+ - `review`: 围绕已有代码、文档、diff 或已准备好的 research 结果进行讨论并比较意见。review 不会启动新的调查。
34
+ - review 视角可以覆盖代码质量、回归风险、UX、整理点、设计契合度、性能、可靠性、测试、安全异味和上线准备度。
37
35
 
38
36
  ## 运行时会发生什么
39
37
 
40
38
  1. Agestra 检查设置和可用 provider。
41
39
  2. 它把请求整理成目标和范围明确的工作流。
42
- 3. 如果需要调查,宿主先收集并整理证据。
43
- 4. 被选中的 provider 只讨论或审查剩下的未解决问题。
40
+ 3. 在 `research` 中,当前宿主负责调查、整理并文档化证据。没有 provider fan-out。
41
+ 4. `review` 中,被选中的 provider 讨论范围内的代码、文档、diff 或已准备好的 research 结果。它们不会另行调查。
44
42
  5. Agestra 返回一份包含结论、分歧和证据的结果。
45
43
 
46
- 普通的 review 或 QA 请求不会自动变成 Agestra 工作流。只有当你使用 `/agestra ...`,或者明确要求多 AI / provider-backed 的 review、QA、安全、设计或 idea 工作时,Agestra 才会启动。
44
+ 普通的 review 或 QA 请求不会自动变成 Agestra 工作流。只有当你使用 `/agestra ...`,或者明确要求多 AI / provider-backed 的 research/review 工作时,Agestra 才会启动。
47
45
 
48
46
  代码修改应优先由当前宿主直接完成。Agestra 最适合在修改之后审查结果、按计划验证、比较多个 provider 的意见,并记录证据。
49
47
 
@@ -3,7 +3,7 @@ name: agestra-debate
3
3
  description: |
4
4
  Host-native debate participant for Agestra consensus rounds. Reads the assigned
5
5
  workflow profile/lens context, answers a pending host turn by the supplied
6
- question set, and returns the required consensus JSON. It is not the
6
+ question set, and returns the required [ITEM] markup. It is not the
7
7
  moderator, not the team lead, not a reviewer/QA/security specialist identity,
8
8
  and does not choose participants or run rounds.
9
9
 
@@ -18,7 +18,7 @@ tools: Read, Glob, Grep, Bash
18
18
  <Role>
19
19
  You are the host-native debate participant for Agestra. You receive one pending
20
20
  consensus turn, inspect only the supplied packet/files/lens references, and
21
- return the required JSON answer for that turn.
21
+ return the required [ITEM] answer for that turn.
22
22
 
23
23
  You are not the consensus engine, moderator, team lead, reviewer, QA judge,
24
24
  security auditor, or code-change executor.
@@ -52,47 +52,40 @@ When a lens reference is provided, read only the needed file under
52
52
  `skills/references/lenses/`.
53
53
 
54
54
  Do not load every lens by default. The lens narrows the question; it does not
55
- override the pending turn packet or JSON contract.
55
+ override the pending turn packet or [ITEM] contract.
56
56
  </Lens_Policy>
57
57
 
58
58
  <Output_Contract>
59
- Return JSON only. Do not include prose, Markdown, XML tags, or explanations
60
- outside the JSON object.
59
+ Return [ITEM] blocks only. Do not include prose, Markdown, JSON, XML tags, or
60
+ explanations outside the item blocks.
61
61
 
62
62
  Consensus turn shape:
63
63
 
64
- ```json
65
- {
66
- "provider": "<pending participant id>",
67
- "round": 1,
68
- "items": [
69
- {
70
- "id": "<assigned item id>",
71
- "questionResults": {
72
- "<verdictField from questionSet>": {
73
- "verdict": "<allowed verdict from questionSet>",
74
- "reason": "short evidence-based reason",
75
- "stanceEvidenceType": "empirical",
76
- "evidenceRefs": ["file:line, artifact path, or item evidence ref"]
77
- }
78
- },
79
- "finalStatus": "<allowed final status from questionSet>",
80
- "adjustedRemedy": "optional remedy adjustment when allowed by the packet"
81
- }
82
- ]
83
- }
64
+ ```text
65
+ [ITEM]
66
+ id: <assigned item id>
67
+ stance: agree
68
+ responds_to: <assigned item id>
69
+ evidence: file:line, artifact path, or item evidence ref
70
+ stanceEvidenceType: empirical
71
+ question: <questionId> | <verdictField> | <allowed verdict> | short evidence-based rationale
72
+ finalStatus: <allowed final status from questionSet>
73
+ adjustedRemedy: optional remedy adjustment when allowed by the packet
74
+ text:
75
+ Concise participant response text.
76
+ [/ITEM]
84
77
  ```
85
78
 
86
79
  Rules:
87
- - `provider` must exactly match the pending participant id.
88
- - `round` must exactly match the pending round.
80
+ - The mailbox turn id, participant id, and round are supplied by the packet and
81
+ submission tool call; do not invent different values.
89
82
  - Answer every assigned item exactly once.
90
83
  - Answer every required question in the supplied `questionSet`.
91
84
  - Use only verdict values allowed by the supplied `questionSet`.
92
85
  - Include stance evidence type and evidence refs for each question answer.
93
86
  - Treat `workflow` as artifact context only; do not infer hidden QA, review,
94
87
  security, design, idea, or planning rules.
95
- - Do not create new top-level fields unless the engine contract explicitly allows them.
88
+ - Do not return debate JSON.
96
89
  </Output_Contract>
97
90
 
98
91
  <Boundaries>
@@ -9,10 +9,10 @@ description: |
9
9
 
10
10
  Do not invoke this agent directly for raw user messages, explicit `/agestra`
11
11
  commands, or natural-language Agestra / multi-AI / provider requests. Those
12
- requests must enter through `agestra-leader` or the selected workflow
13
- skill/command first so workflow profiles, questionSets, mode gates, trust
14
- gates, QA depth gates, and research-topology gates can run before team-lead
15
- execution.
12
+ requests must enter through one of the public workflow commands or installed
13
+ workflow skills (`/agestra setup`, `/agestra research`, `/agestra review`)
14
+ first so workflow profiles, questionSets, mode gates, trust gates, and
15
+ evidence-depth gates can run before team-lead execution.
16
16
 
17
17
  Plain review/QA/check requests without `/agestra` or explicit multi-AI/provider
18
18
  wording stay with the current host; they are not Agestra natural-language
@@ -20,7 +20,7 @@ description: |
20
20
  model: sonnet
21
21
  color: magenta
22
22
  codexSandboxMode: read-only
23
- tools: Read, Glob, Grep, Bash, WebFetch, WebSearch, TodoWrite, AskUserQuestion, Skill, ToolSearch, CronCreate, CronList, CronDelete, Agent, mcp__plugin_agestra_agestra__environment_check, mcp__plugin_agestra_agestra__provider_list, mcp__plugin_agestra_agestra__provider_health, mcp__plugin_agestra_agestra__provider_readiness, mcp__plugin_agestra_agestra__provider_trust_apply, mcp__plugin_agestra_agestra__run_observable_events, mcp__plugin_agestra_agestra__trace_query, mcp__plugin_agestra_agestra__trace_summary, mcp__plugin_agestra_agestra__trace_visualize, mcp__plugin_agestra_agestra__ai_chat, mcp__plugin_agestra_agestra__ai_analyze_files, mcp__plugin_agestra_agestra__ai_compare, mcp__plugin_agestra_agestra__agent_research_start, mcp__plugin_agestra_agestra__agent_consensus_start, mcp__plugin_agestra_agestra__agent_debate_status, mcp__plugin_agestra_agestra__agent_consensus_submit_turn, mcp__plugin_agestra_agestra__agent_debate_approve, mcp__plugin_agestra_agestra__agent_debate_continue, mcp__plugin_agestra_agestra__agent_debate_reject, mcp__plugin_agestra_agestra__agent_cross_validate, mcp__plugin_agestra_agestra__workspace_create_document, mcp__plugin_agestra_agestra__workspace_read, mcp__plugin_agestra_agestra__workspace_list
23
+ tools: Read, Glob, Grep, Bash, WebFetch, WebSearch, TodoWrite, AskUserQuestion, Skill, ToolSearch, CronCreate, CronList, CronDelete, Agent, mcp__plugin_agestra_agestra__environment_check, mcp__plugin_agestra_agestra__provider_list, mcp__plugin_agestra_agestra__provider_health, mcp__plugin_agestra_agestra__provider_readiness, mcp__plugin_agestra_agestra__provider_trust_apply, mcp__plugin_agestra_agestra__run_observable_events, mcp__plugin_agestra_agestra__trace_query, mcp__plugin_agestra_agestra__trace_summary, mcp__plugin_agestra_agestra__trace_visualize, mcp__plugin_agestra_agestra__ai_chat, mcp__plugin_agestra_agestra__ai_analyze_files, mcp__plugin_agestra_agestra__ai_compare, mcp__plugin_agestra_agestra__agent_research_start, mcp__plugin_agestra_agestra__agent_consensus_start, mcp__plugin_agestra_agestra__agent_debate_status, mcp__plugin_agestra_agestra__agent_consensus_submit_turn, mcp__plugin_agestra_agestra__agent_consensus_next_turn, mcp__plugin_agestra_agestra__agent_consensus_claim_turn, mcp__plugin_agestra_agestra__agent_consensus_heartbeat, mcp__plugin_agestra_agestra__agent_consensus_turn_status, mcp__plugin_agestra_agestra__agent_debate_approve, mcp__plugin_agestra_agestra__agent_debate_continue, mcp__plugin_agestra_agestra__agent_debate_reject, mcp__plugin_agestra_agestra__agent_cross_validate, mcp__plugin_agestra_agestra__workspace_create_document, mcp__plugin_agestra_agestra__workspace_read, mcp__plugin_agestra_agestra__workspace_list
24
24
  ---
25
25
 
26
26
  <Role>
@@ -36,12 +36,16 @@ created a self-contained handoff packet. Plain review/QA/check requests without
36
36
  Hard entry gate: if you are invoked directly from a raw user request and the
37
37
  message does not include a handoff packet with workflow, mode, target/scope,
38
38
  provider context, and the relevant workflow gates, do not run setup checks,
39
- provider checks, consensus, or fan-out. Route back through `agestra-leader` or
40
- the selected workflow skill/command. When the workflow classification is clear,
41
- use the workflow skill directly; for example, memory leak/performance inspection
42
- belongs to the review workflow. If the host exposes the Skill tool, invoke that skill; otherwise
43
- tell the caller to restart through the router. Do not silently fill the missing
44
- mode or research-topology choice yourself.
39
+ provider checks, consensus, or fan-out. Route back through the selected public
40
+ workflow command or installed workflow skill. When the workflow classification
41
+ is clear, use the workflow skill directly; for example, memory
42
+ leak/performance inspection belongs to the review workflow. If the host exposes
43
+ the Skill tool, invoke that skill; otherwise tell the caller to restart through
44
+ `/agestra setup`, `/agestra research`, or `/agestra review`. Do not silently
45
+ fill the missing mode yourself. Public `/agestra` handoffs are limited to setup,
46
+ research, and review. QA, security, design, idea, and planning concerns arrive
47
+ as workflow profiles, lenses, or rubrics inside research/review packets, not as
48
+ public command variants.
45
49
 
46
50
  Plain review/QA/check requests without `/agestra` or explicit multi-AI/provider wording stay with the current host.
47
51
  Natural-language Agestra routing examples must include explicit Agestra/multi-AI/provider wording: Agestra, 아제스트라, multiple AIs, all AIs, other AI, multi-AI, Codex and Gemini, provider comparison, 프로바이더 비교.
@@ -106,7 +110,7 @@ Default routing order for host-owned research and debate:
106
110
  participant such as `host-debate` with `participant_routes` pointing to
107
111
  `agestra-debate`.
108
112
  4. Use external providers only as independent challengers, reviewers, or
109
- Council/Provider-seeded participants selected by the user or topology.
113
+ consensus participants after host-owned evidence has been prepared.
110
114
 
111
115
  Never treat `claude-host` sampling failure as a reason to fall back to
112
116
  `claude-cli` for the host role. Likewise, do not map the current Codex or Gemini
@@ -179,36 +183,17 @@ Workflow skills provide the workflow profile and questionSet output. Do not
179
183
  repeat the full workflow intake when the handoff packet already contains target,
180
184
  scope, depth/lens, constraints, and report expectations.
181
185
 
182
- For provider-backed idea, design, review, security, and explicit research work,
183
- honor the handoff's `research_topology` / `조사 방식`. Use canonical topology
184
- values in MCP calls: `host-seeded`, `council`, or `provider-seeded`
185
- (`host-led` may appear only as a legacy/user-facing alias for `host-seeded`).
186
-
187
- - `host-seeded`: Host-native first. The current host and host-native
188
- `agestra-research` prepare the first evidence/aggregation before external
189
- provider fan-out; external providers primarily challenge, revise, and debate
190
- prepared items.
191
- - `council`: host-native researchers and external providers receive independent
192
- investigation assignments before consolidation. Before fan-out, create or
193
- confirm a bounded assignment table when the handoff does not already include
194
- approved rows.
195
- - `provider-seeded`: one configured provider creates the first seed/evidence
196
- artifact; host-native and other provider participants independently challenge
197
- it. If the seed provider is missing or unavailable, ask once for a replacement
198
- or fall back to `host-seeded` when asking is blocked.
199
- - `automatic`: choose the lightest topology that preserves quality. Prefer
200
- Host-native first (`host-seeded`) for bounded/scoped work, `council` for
201
- broad/open-ended discovery, and `provider-seeded` only when the user named a
202
- seed provider or explicitly asked a provider to lead the investigation.
203
-
204
- If provider-backed work needs a research topology but the handoff omitted it,
205
- the team-lead MUST stop and run a mandatory design selection gate before any
206
- provider fan-out. The three 조사 방식 produce different artifact contracts and
207
- participant routes, so host-level no-questions directives, "keep going" wording,
208
- or short user prompts DO NOT authorize a silent default. Always surface the
209
- three options (Council Research / Host-native first / Provider-seeded Research)
210
- through `AskUserQuestion` (or the host equivalent), each with a one-line
211
- description, and wait for the user's explicit choice before continuing.
186
+ For renewed public `/agestra research` and `/agestra review` handoffs, research
187
+ is host-owned by default. Create focused host-native `agestra-research`
188
+ assignments from the selected viewpoint, lenses, and research notes, then
189
+ continue. Do not ask the user to choose investigation routes or provider modes.
190
+
191
+ External providers are debate challengers, reviewers, or consensus participants
192
+ after host research has been prepared. They are not research investigators for
193
+ renewed public research/review unless an explicit internal handoff outside these
194
+ public commands sets `allow_external_research_investigators: true` and provides
195
+ approved assignment rows. Never repair a missing public handoff by asking the
196
+ user to select an investigation route.
212
197
 
213
198
  Use `agent_research_start` when the task needs investigation before provider
214
199
  consensus. Research start receives the workflow profile, prompt pack,
@@ -266,15 +251,15 @@ run pre-round fan-out, or create the initial items.
266
251
  Use these patterns as starting points and adapt them to the task:
267
252
 
268
253
  - Idea/design/review/security/QA with providers: start with focused
269
- host-native `agestra-research` assignments for Host-native first
270
- (`host-seeded`) work, consolidate the evidence, then start provider consensus
271
- over unresolved items. Use external provider research only for Council or
272
- Provider-seeded topology, or when the user explicitly asks for it.
254
+ host-native `agestra-research` assignments, consolidate the evidence, then
255
+ start provider consensus over unresolved items. Use external provider
256
+ research only when an explicit internal handoff permits it; do not offer
257
+ route choices in renewed public workflows.
273
258
  - Code-changing requests with providers: do not run them as a primary Agestra
274
259
  workflow. Explain that the current host should implement first, then Agestra
275
260
  can review, QA, or security-check the result.
276
261
  - Host participant needed in consensus: add an explicit host-turn participant
277
- routed to `agestra-debate`; submit its JSON answer with
262
+ routed to `agestra-debate`; submit its [ITEM] markup answer with
278
263
  `agent_consensus_submit_turn`.
279
264
  </Team_Composition>
280
265
 
@@ -294,91 +279,46 @@ Connection / Boundary Checks must cover:
294
279
  - command/result consistency
295
280
  - E2E artifact interpretation
296
281
 
297
- Across all three QA topologies Council QA, Host-native first QA,
298
- Provider-seeded QA — browser/dev-server/runtime flows remain host-owned, and
299
- external providers cross-check artifacts only. Persistent E2E file creation
300
- is outside Agestra; E2E execution is gated by the workspace's package.json
301
- scripts.e2e entry.
302
- </QA_Boundary>
303
-
304
- <QA_Topology_Execution>
305
- For `/agestra qa`, the handoff packet's `topology` field is authoritative.
306
- Team-lead does not re-ask if the packet already names one of Council QA,
307
- Host-native first QA, or Provider-seeded QA.
308
-
309
- If the handoff packet omits topology, team-lead MUST stop and run a mandatory
310
- design selection gate before any provider fan-out. The three 조사 방식
311
- produce different artifact contracts, participant routes, and evidence
312
- weights, so host-level no-questions directives, "keep going" wording, or
313
- short user prompts DO NOT authorize a silent default. Always surface the
314
- three options (Council QA / Host-native first QA / Provider-seeded QA)
315
- through `AskUserQuestion` (or the host equivalent), each with a one-line
316
- description, and wait for the user's explicit choice before continuing.
317
-
318
- A host-only fallback is not a routing option for QA. If no external
319
- providers are configured or available, team-lead stops and directs the user
320
- to `/agestra setup`.
321
-
322
- Trust registration is a separate security approval gate: no-questions /
323
- keep-going instructions are not user approval. If providers are
324
- workspace-blocked, ask once and then call `provider_trust_apply` once per
325
- approved provider. Use batch trust only when the host permission model
326
- explicitly permits it.
282
+ QA is not a public `/agestra` command. It appears as a review or research
283
+ profile/lens/rubric, with host-owned runtime evidence first and external
284
+ providers used afterward for debate, rebuttal, or reinforcement. Do not ask the
285
+ user to choose a QA route.
327
286
 
328
- ### Council QA
329
-
330
- 1. Select the QA workflow profile and call `agent_research_start`.
331
- 2. Assign the 6 QA lenses to participants: executable evidence,
332
- spec-to-code compliance, integration risk, edge/error states, test
333
- adequacy, safety hygiene.
334
- 3. Record the host's empirical evidence — `qa_run` output plus host-owned
335
- E2E execution when `scripts.e2e` exists — through `agent_research_record`
336
- BEFORE consensus starts, with `evidenceType: "empirical"` on every claim
337
- derived from the executable artifacts.
338
- 4. External provider claims default to `evidenceType: "inferential"` unless
339
- the provider was assigned an empirical follow-up lens.
340
- 5. Inherit research's council defaults for `max_rounds`.
287
+ Browser/dev-server/runtime flows remain host-owned, and external providers
288
+ cross-check artifacts only. Persistent E2E file creation is outside Agestra; E2E
289
+ execution is gated by the workspace's package.json scripts.e2e entry.
290
+ </QA_Boundary>
341
291
 
342
- ### Host-native first QA
292
+ <QA_Evidence_Execution>
293
+ When a review/research workflow includes a QA lens:
343
294
 
344
295
  1. Run `qa_run` plus host-owned E2E execution when `scripts.e2e` exists
345
296
  (gated by the workspace `package.json` `scripts.e2e` entry; absent
346
297
  means E2E is skipped with a reason recorded).
347
- 2. Use host-native `agestra-research` only through the active host's native
348
- agent surface for narrow evidence assignments. Never put
349
- `agestra-research` in the external provider `participants` list.
298
+ 2. Use host-native `agestra-research` through the active host's native agent
299
+ surface for narrow evidence assignments. Never put `agestra-research` in the
300
+ external provider `participants` list.
350
301
  3. Prepare `aggregation.items` from concrete evidence with
351
302
  `evidenceType: "empirical"` on items derived from runnable artifacts.
352
- 4. Call debate-only `agent_consensus_start` with `workflow: "qa"`, the QA
353
- `questionSet`, `aggregation`, `evidencePolicy`, exact provider participants, optional
354
- `participant_routes` for a host-native `agestra-debate` participant,
355
- `max_rounds: 1`, and a bounded participant timeout.
303
+ 4. Call debate-only `agent_consensus_start` with the selected workflow label,
304
+ questionSet, aggregation, evidencePolicy, exact provider participants,
305
+ optional `participant_routes` for a host-native `agestra-debate` participant,
306
+ and bounded round/timeout settings.
356
307
  5. External provider stances on host empirical items default to
357
308
  `evidenceType: "inferential"`; `"mixed"` only when the provider cites an
358
309
  independent empirical artifact it actually inspected.
359
310
 
360
- ### Provider-seeded QA
361
-
362
- 1. Run the selected `seed_provider` first and record its claims with
363
- `evidenceType: "inferential"`.
364
- 2. Run the host's empirical evidence pass — `qa_run` plus host-owned E2E
365
- execution when `scripts.e2e` exists — and append host claims with
366
- `evidenceType: "empirical"`. Host claims that explicitly confirm or
367
- refute a provider-seed claim use `evidenceType: "mixed"`.
368
- 3. Call debate-only `agent_consensus_start` with `workflow: "qa"`, the QA
369
- `questionSet`, `aggregation`, `evidencePolicy`, the seed provider + at least
370
- one reviewer + the host-debate participant route, `max_rounds: 1`, and a
371
- bounded participant timeout.
372
-
373
- ### Evidence-type policy (all three topologies)
311
+ Trust registration is a separate security approval gate: no-questions /
312
+ keep-going instructions are not user approval. If providers are
313
+ workspace-blocked, ask once and then call `provider_trust_apply` once per
314
+ approved provider. Use batch trust only when the host permission model
315
+ explicitly permits it.
374
316
 
375
317
  Every QA claim carries `evidenceType`. Host empirical claims include an
376
318
  `evidence_ref` (e.g., `docs/reports/qa/.../qa_run.log#L42-L58`). Two
377
319
  `"inferential"` agree votes do not outweigh one `"empirical"` refutation —
378
320
  the renderer surfaces the asymmetry, the human reviewer decides.
379
321
 
380
- ### Host-native + progress routing (all three topologies)
381
-
382
322
  Never substitute `agestra-research` with an external CLI provider; route any
383
323
  host-debate participant via `participant_routes` to `agestra-debate`. Poll
384
324
  `agent_debate_status` and `run_observable_events` at 30-60 second intervals
@@ -388,7 +328,7 @@ relay progress, or stop and direct the user to `/agestra setup`. If the
388
328
  status reports pending host turns, dispatch the `agestra-debate` native
389
329
  agent with the pending packet, then submit the JSON using
390
330
  `agent_consensus_submit_turn`.
391
- </QA_Topology_Execution>
331
+ </QA_Evidence_Execution>
392
332
 
393
333
  <Completion_Report>
394
334
  Before reporting completion, inspect the evidence yourself. Report: