agestra 4.15.0 → 4.15.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude-plugin/marketplace.json +1 -1
- package/.claude-plugin/plugin.json +1 -13
- package/.gemini/commands/agestra/research.toml +5 -6
- package/AGENTS.md +17 -5
- package/GEMINI.md +1 -1
- package/README.ja.md +8 -10
- package/README.ko.md +8 -10
- package/README.md +8 -10
- package/README.zh.md +8 -10
- package/agents/agestra-debate.md +21 -28
- package/agents/agestra-team-lead.md +55 -115
- package/commands/research.md +108 -298
- package/commands/review.md +17 -16
- package/commands/setup.md +33 -17
- package/dist/bundle.js +310 -362
- package/package.json +1 -1
- package/scripts/host-assets/categories.mjs +4 -0
- package/scripts/host-assets/codex-assets.mjs +183 -6
- package/scripts/host-assets/gemini-assets.mjs +110 -7
- package/scripts/host-assets/manifest.mjs +10 -0
- package/skills/leader.md +15 -16
- package/skills/provider-guide.md +88 -49
- package/skills/references/lenses/README.md +7 -5
- package/skills/research.md +93 -315
- package/skills/review.md +20 -18
- package/skills/setup.md +26 -15
- package/.gemini/commands/agestra/design.toml +0 -16
- package/.gemini/commands/agestra/idea.toml +0 -16
- package/.gemini/commands/agestra/qa.toml +0 -16
- package/.gemini/commands/agestra/security.toml +0 -16
- package/commands/design.md +0 -158
- package/commands/idea.md +0 -151
- package/commands/qa.md +0 -162
- package/commands/security.md +0 -122
|
@@ -12,7 +12,7 @@
|
|
|
12
12
|
"name": "agestra",
|
|
13
13
|
"source": "./",
|
|
14
14
|
"description": "Multi-host MCP orchestration across Claude, Ollama, Gemini, and Codex for review, QA, and cross-validation",
|
|
15
|
-
"version": "4.15.
|
|
15
|
+
"version": "4.15.1",
|
|
16
16
|
"author": {
|
|
17
17
|
"name": "mua-vtuber"
|
|
18
18
|
},
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "agestra",
|
|
3
|
-
"version": "4.15.
|
|
3
|
+
"version": "4.15.1",
|
|
4
4
|
"description": "Claude Code plugin — multi-host MCP orchestration across Claude, Ollama, Gemini, and Codex for review, QA, and cross-validation",
|
|
5
5
|
"mcpServers": {
|
|
6
6
|
"agestra": {
|
|
@@ -9,17 +9,5 @@
|
|
|
9
9
|
"${CLAUDE_PLUGIN_ROOT}/dist/bundle.js"
|
|
10
10
|
]
|
|
11
11
|
}
|
|
12
|
-
},
|
|
13
|
-
"hooks": {
|
|
14
|
-
"UserPromptSubmit": [
|
|
15
|
-
{
|
|
16
|
-
"hooks": [
|
|
17
|
-
{
|
|
18
|
-
"type": "command",
|
|
19
|
-
"command": "node ${CLAUDE_PLUGIN_ROOT}/hooks/user-prompt-submit.js"
|
|
20
|
-
}
|
|
21
|
-
]
|
|
22
|
-
}
|
|
23
|
-
]
|
|
24
12
|
}
|
|
25
13
|
}
|
|
@@ -1,16 +1,15 @@
|
|
|
1
1
|
# Generated by Agestra. Managed file.
|
|
2
|
-
description = "Run research
|
|
2
|
+
description = "Run host-owned Agestra research with an idea, QA, or security viewpoint"
|
|
3
3
|
prompt = """
|
|
4
4
|
You are executing the `/agestra research` Gemini command.
|
|
5
5
|
|
|
6
|
-
- Start with `setup_status
|
|
7
|
-
-
|
|
8
|
-
-
|
|
6
|
+
- Start with `setup_status`; provider availability is not a gate for research.
|
|
7
|
+
- Ask for the research viewpoint when missing: Idea exploration, QA evidence set, or Security evidence set.
|
|
8
|
+
- Use host-owned `agestra-research` assignments. Do not ask for a research topology or provider investigation mode.
|
|
9
|
+
- Host research contract uses workflow profiles, `aggregation`, `questionSet`, and `evidencePolicy`:
|
|
9
10
|
호스트가 조사한다.
|
|
10
11
|
호스트가 정리한다.
|
|
11
|
-
시스템이 토론한다.
|
|
12
12
|
호스트가 문서화한다.
|
|
13
|
-
- External AI research and debate run in separate fresh sessions, even when the same provider participates in both phases.
|
|
14
13
|
|
|
15
14
|
@{commands/research.md}
|
|
16
15
|
"""
|
package/AGENTS.md
CHANGED
|
@@ -30,18 +30,17 @@ Use `host_assets_status` to inspect generated Codex host assets, and only call `
|
|
|
30
30
|
|
|
31
31
|
## Workflow Mapping
|
|
32
32
|
|
|
33
|
+
- Public slash commands are limited to setup, research, and review.
|
|
34
|
+
- Setup and research requests follow `commands/setup.md` and `commands/research.md`.
|
|
33
35
|
- When Agestra is active, review requests follow `commands/review.md`
|
|
34
|
-
-
|
|
35
|
-
- When Agestra is active, security audit requests follow `commands/security.md`
|
|
36
|
+
- Internal QA, security, design, idea, and planning viewpoints remain available through workflow profiles and skill/lens resources; they are not shipped as public slash commands.
|
|
36
37
|
- Review, QA, and security workflows write durable reports under `docs/reports/review/`, `docs/reports/qa/`, and `docs/reports/security/` unless the user asks for chat-only output.
|
|
37
|
-
- When Agestra is active, design and architecture requests follow `commands/design.md`
|
|
38
|
-
- When Agestra is active, idea discovery requests follow `commands/idea.md`
|
|
39
38
|
|
|
40
39
|
## Core MCP Tools
|
|
41
40
|
|
|
42
41
|
- `setup_status`, `environment_check`, and `provider_list`: inspect installation, host, and provider state for Agestra health checks and active Agestra workflows
|
|
43
42
|
- `agent_research_start`: research-only preprocessing with workflow profile, prompt pack, questionSet, evidencePolicy, research lenses, and investigator assignments; writes `research_submissions.json`, `research_transcript.json`, and `aggregation.json`; does not start debate
|
|
44
|
-
- `agent_consensus_start` (with `agent_debate_approve`/`_continue`/`_reject`)
|
|
43
|
+
- `agent_consensus_start` (with `agent_debate_approve`/`_continue`/`_reject`): debate-only approval-gated consensus flows from prepared `aggregation`, supplied `questionSet`, and `evidencePolicy`; `workflow` is a report/artifact label only, not a debate routing branch
|
|
45
44
|
- `host_assets_status`, `host_assets_install`, `host_assets_uninstall`: inspect and explicitly manage generated Codex host-native assets such as custom agents and skills
|
|
46
45
|
- `qa_run`: run workspace build/test verification for QA evidence
|
|
47
46
|
|
|
@@ -50,3 +49,16 @@ Use `host_assets_status` to inspect generated Codex host assets, and only call `
|
|
|
50
49
|
- `agents/`: canonical role prompts (`agestra-team-lead`, `agestra-research`, `agestra-debate`)
|
|
51
50
|
- `skills/`: reusable workflow references
|
|
52
51
|
- `GEMINI.md` and `.gemini/commands/`: Gemini-specific host assets; keep behavior aligned with them when updating shared workflows
|
|
52
|
+
|
|
53
|
+
## graphify
|
|
54
|
+
|
|
55
|
+
This project has a knowledge graph at graphify-out/ with god nodes, community structure, and cross-file relationships.
|
|
56
|
+
|
|
57
|
+
When the user types `/graphify`, invoke the `skill` tool with `skill: "graphify"` before doing anything else.
|
|
58
|
+
|
|
59
|
+
Rules:
|
|
60
|
+
- For codebase questions, first run `graphify query "<question>"` when graphify-out/graph.json exists. Use `graphify path "<A>" "<B>"` for relationships and `graphify explain "<concept>"` for focused concepts. These return a scoped subgraph, usually much smaller than GRAPH_REPORT.md or raw grep output.
|
|
61
|
+
- Dirty graphify-out/ files are expected after hooks or incremental updates; dirty graph files are not a reason to skip graphify. Only skip graphify if the task is about stale or incorrect graph output, or the user explicitly says not to use it.
|
|
62
|
+
- If graphify-out/wiki/index.md exists, use it for broad navigation instead of raw source browsing.
|
|
63
|
+
- Read graphify-out/GRAPH_REPORT.md only for broad architecture review or when query/path/explain do not surface enough context.
|
|
64
|
+
- After modifying code, run `graphify update .` to keep the graph current (AST-only, no API cost).
|
package/GEMINI.md
CHANGED
|
@@ -50,7 +50,7 @@ QA/review/security on the result.
|
|
|
50
50
|
profile, prompt pack, `questionSet`, `evidencePolicy`, research lenses, and
|
|
51
51
|
investigator assignments; writes `research_submissions.json`,
|
|
52
52
|
`research_transcript.json`, and `aggregation.json`; does not start debate
|
|
53
|
-
- debate-only `agent_consensus_start`, `agent_debate_approve`/`_continue`/`_reject
|
|
53
|
+
- debate-only `agent_consensus_start`, `agent_debate_approve`/`_continue`/`_reject`: sessions from prepared `aggregation`, supplied `questionSet`, `evidencePolicy`, and approval-gated debate artifacts
|
|
54
54
|
- `workspace_*`: document-backed review and aggregation flows
|
|
55
55
|
- `qa_run`: workspace build/test verification for QA evidence
|
|
56
56
|
|
package/README.ja.md
CHANGED
|
@@ -21,29 +21,27 @@ Agestra は、1 つの問題を複数の AI 視点で検討し、整理するた
|
|
|
21
21
|
|
|
22
22
|
インストール後、プロジェクトを開いて Agestra ワークフローを呼び出します。
|
|
23
23
|
|
|
24
|
-
- Claude Code: `/agestra
|
|
25
|
-
- Gemini CLI: `/agestra:
|
|
24
|
+
- Claude Code: `/agestra research ...` または `/agestra review ...`
|
|
25
|
+
- Gemini CLI: `/agestra:research ...` または `/agestra:review ...`
|
|
26
26
|
- Codex CLI: `Use Agestra with Gemini and Codex to review this branch.` のように、Agestra や複数 AI を明示して依頼
|
|
27
27
|
|
|
28
28
|
初回は使う provider を聞かれることがあります。provider が 1 つだけでもセットアップやホスト所有の作業はできますが、複数 AI 比較は 2 つ以上あるとより有効です。
|
|
29
29
|
|
|
30
30
|
## 何に使うか
|
|
31
31
|
|
|
32
|
-
- `
|
|
33
|
-
- `
|
|
34
|
-
-
|
|
35
|
-
- `design`: 実装前に構造やトレードオフを整理する
|
|
36
|
-
- `idea`: 改善案、代替案、類似ツールを探る
|
|
32
|
+
- `research`: アイデア、QA、セキュリティの質問に必要な根拠を現在のホストだけで調査し整理します。この流れでは外部 provider は調査しません。
|
|
33
|
+
- `review`: 既存のコード、ドキュメント、diff、または準備済みのリサーチ結果をもとに討論し、意見を比較します。レビューは新しい調査を始めません。
|
|
34
|
+
- レビュー観点には、コード品質、回帰リスク、UX、整理、設計適合性、性能、信頼性、テスト、安全性のにおい、リリース準備状況を含められます。
|
|
37
35
|
|
|
38
36
|
## 実行すると何が起こるか
|
|
39
37
|
|
|
40
38
|
1. Agestra が設定と利用可能な provider を確認します。
|
|
41
39
|
2. 依頼を対象とスコープが明確なワークフローに整理します。
|
|
42
|
-
3.
|
|
43
|
-
4.
|
|
40
|
+
3. `research` では、現在のホストが根拠を調査し、整理し、文書化します。provider fan-out はありません。
|
|
41
|
+
4. `review` では、選ばれた provider がスコープ内のコード、ドキュメント、diff、または準備済みのリサーチ結果について討論します。別途調査はしません。
|
|
44
42
|
5. 結論、意見の違い、根拠を 1 つの結果として返します。
|
|
45
43
|
|
|
46
|
-
普通のレビューや QA の依頼が自動で Agestra になるわけではありません。`/agestra ...` を使うか、複数 AI や provider-backed
|
|
44
|
+
普通のレビューや QA の依頼が自動で Agestra になるわけではありません。`/agestra ...` を使うか、複数 AI や provider-backed のリサーチ/レビュー作業を明示したときに Agestra が動きます。
|
|
47
45
|
|
|
48
46
|
コード変更は、まず現在のホストで直接行うのが基本です。Agestra はその後で結果をレビューし、計画との一致を確認し、複数 provider の意見と根拠を記録するところで最も力を発揮します。
|
|
49
47
|
|
package/README.ko.md
CHANGED
|
@@ -21,29 +21,27 @@ Agestra는 하나의 문제를 여러 AI 시각으로 검토하고 정리해 주
|
|
|
21
21
|
|
|
22
22
|
설치 후 프로젝트를 열고 Agestra 워크플로우를 요청하면 됩니다.
|
|
23
23
|
|
|
24
|
-
- Claude Code: `/agestra
|
|
25
|
-
- Gemini CLI: `/agestra:
|
|
24
|
+
- Claude Code: `/agestra research ...` 또는 `/agestra review ...`
|
|
25
|
+
- Gemini CLI: `/agestra:research ...` 또는 `/agestra:review ...`
|
|
26
26
|
- Codex CLI: `Agestra로 Gemini와 Codex를 같이 써서 이 브랜치 리뷰해줘`처럼 Agestra나 여러 AI를 명시해서 요청
|
|
27
27
|
|
|
28
28
|
첫 실행에서는 사용할 provider를 물어볼 수 있습니다. provider가 하나만 있어도 설정과 호스트 소유 작업은 가능하지만, 멀티 AI 비교는 둘 이상일 때 가장 잘 살아납니다.
|
|
29
29
|
|
|
30
30
|
## 무엇에 쓰나
|
|
31
31
|
|
|
32
|
-
- `
|
|
33
|
-
- `
|
|
34
|
-
-
|
|
35
|
-
- `design`: 구현 전에 구조와 트레이드오프 논의
|
|
36
|
-
- `idea`: 개선 아이디어, 대안, 유사 도구 탐색
|
|
32
|
+
- `research`: 아이디어, QA, 보안 질문에 필요한 근거를 현재 호스트만 조사하고 정리합니다. 외부 provider는 이 흐름에서 조사하지 않습니다.
|
|
33
|
+
- `review`: 이미 있는 코드, 문서, diff, 또는 준비된 리서치 결과를 두고 토론하고 의견을 비교합니다. 리뷰는 새 조사를 시작하지 않습니다.
|
|
34
|
+
- 리뷰 관점은 코드 품질, 회귀 위험, UX, 정리, 설계 적합성, 성능, 안정성, 테스트, 기본 안전 냄새, 배포 준비도를 다룰 수 있습니다.
|
|
37
35
|
|
|
38
36
|
## 실행하면 어떻게 되나
|
|
39
37
|
|
|
40
38
|
1. Agestra가 설정과 사용 가능한 provider를 확인합니다.
|
|
41
39
|
2. 요청을 대상과 범위가 분명한 워크플로우로 정리합니다.
|
|
42
|
-
3.
|
|
43
|
-
4. 선택된 provider들이
|
|
40
|
+
3. `research`에서는 현재 호스트가 근거를 조사하고 정리하고 문서화합니다. provider fan-out은 없습니다.
|
|
41
|
+
4. `review`에서는 선택된 provider들이 범위 안의 코드, 문서, diff, 또는 준비된 리서치 결과를 놓고 토론합니다. 별도 조사는 하지 않습니다.
|
|
44
42
|
5. 결론, 이견, 근거를 하나의 결과로 돌려줍니다.
|
|
45
43
|
|
|
46
|
-
평범한 리뷰나 QA 요청이 자동으로 Agestra가 되는 것은 아닙니다. `/agestra ...`를 쓰거나, 여러 AI나 provider-backed
|
|
44
|
+
평범한 리뷰나 QA 요청이 자동으로 Agestra가 되는 것은 아닙니다. `/agestra ...`를 쓰거나, 여러 AI나 provider-backed 리서치/리뷰 작업을 명시했을 때 Agestra 워크플로우가 시작됩니다.
|
|
47
45
|
|
|
48
46
|
코드 변경은 먼저 현재 호스트에서 직접 진행하는 편이 좋습니다. Agestra는 그 다음 결과를 리뷰하고, 계획과 맞는지 검증하고, 여러 provider 의견과 근거를 기록할 때 가장 강합니다.
|
|
49
47
|
|
package/README.md
CHANGED
|
@@ -21,29 +21,27 @@ Install Agestra in the host you already use.
|
|
|
21
21
|
|
|
22
22
|
Then open your project and ask for an Agestra workflow.
|
|
23
23
|
|
|
24
|
-
- Claude Code: `/agestra
|
|
25
|
-
- Gemini CLI: `/agestra:
|
|
24
|
+
- Claude Code: `/agestra research ...` or `/agestra review ...`
|
|
25
|
+
- Gemini CLI: `/agestra:research ...` or `/agestra:review ...`
|
|
26
26
|
- Codex CLI: ask explicitly for Agestra or multiple providers, for example `Use Agestra with Gemini and Codex to review this branch.`
|
|
27
27
|
|
|
28
28
|
The first workflow may ask which providers you want to use. Agestra works best with two or more providers, but setup and host-owned flows still work with one.
|
|
29
29
|
|
|
30
30
|
## What To Use It For
|
|
31
31
|
|
|
32
|
-
- `
|
|
33
|
-
- `
|
|
34
|
-
-
|
|
35
|
-
- `design`: discuss architecture and tradeoffs before coding
|
|
36
|
-
- `idea`: explore improvements, alternatives, and similar tools
|
|
32
|
+
- `research`: host-only evidence gathering for idea, QA, or security questions. External providers do not investigate in this flow.
|
|
33
|
+
- `review`: debate and compare opinions about existing code, docs, diffs, or prepared research. Review does not start a fresh investigation.
|
|
34
|
+
- Review lenses can cover code quality, regressions, UX, cleanup, design fit, performance, reliability, tests, safety smells, and production readiness.
|
|
37
35
|
|
|
38
36
|
## How It Runs
|
|
39
37
|
|
|
40
38
|
1. Agestra checks setup and available providers.
|
|
41
39
|
2. It turns your request into a clear workflow with a target and scope.
|
|
42
|
-
3.
|
|
43
|
-
4.
|
|
40
|
+
3. In `research`, the current host gathers, organizes, and documents evidence. There is no provider fan-out.
|
|
41
|
+
4. In `review`, selected providers discuss the code, documents, diffs, or prepared research already in scope. They do not perform separate research.
|
|
44
42
|
5. Agestra returns one result with conclusions, disagreements, and evidence.
|
|
45
43
|
|
|
46
|
-
Plain review or QA requests do not automatically become Agestra workflows. Agestra starts when you use `/agestra ...` or explicitly ask for multi-AI or provider-backed review
|
|
44
|
+
Plain review or QA requests do not automatically become Agestra workflows. Agestra starts when you use `/agestra ...` or explicitly ask for multi-AI or provider-backed research/review work.
|
|
47
45
|
|
|
48
46
|
For code changes, use your current host directly first. Agestra is strongest after that: reviewing the result, checking it against a plan, comparing provider opinions, and recording the evidence.
|
|
49
47
|
|
package/README.zh.md
CHANGED
|
@@ -21,29 +21,27 @@ Agestra 用来让多个 AI 从不同角度审视同一个问题,并把结果
|
|
|
21
21
|
|
|
22
22
|
安装后,打开项目并发起 Agestra 工作流。
|
|
23
23
|
|
|
24
|
-
- Claude Code: `/agestra
|
|
25
|
-
- Gemini CLI: `/agestra:
|
|
24
|
+
- Claude Code: `/agestra research ...` 或 `/agestra review ...`
|
|
25
|
+
- Gemini CLI: `/agestra:research ...` 或 `/agestra:review ...`
|
|
26
26
|
- Codex CLI: 像 `Use Agestra with Gemini and Codex to review this branch.` 这样明确提到 Agestra 或多个 AI
|
|
27
27
|
|
|
28
28
|
第一次运行时,它可能会询问你要启用哪些 provider。只有一个 provider 也能完成设置和宿主自有流程,但 Multi-AI 比较在两个以上 provider 时效果最好。
|
|
29
29
|
|
|
30
30
|
## 用它做什么
|
|
31
31
|
|
|
32
|
-
- `
|
|
33
|
-
- `
|
|
34
|
-
-
|
|
35
|
-
- `design`: 在写代码前讨论结构和取舍
|
|
36
|
-
- `idea`: 探索改进方向、备选方案和相似工具
|
|
32
|
+
- `research`: 由当前宿主单独收集并整理 idea、QA 或安全问题所需的证据。外部 provider 不参与这个调查流程。
|
|
33
|
+
- `review`: 围绕已有代码、文档、diff 或已准备好的 research 结果进行讨论并比较意见。review 不会启动新的调查。
|
|
34
|
+
- review 视角可以覆盖代码质量、回归风险、UX、整理点、设计契合度、性能、可靠性、测试、安全异味和上线准备度。
|
|
37
35
|
|
|
38
36
|
## 运行时会发生什么
|
|
39
37
|
|
|
40
38
|
1. Agestra 检查设置和可用 provider。
|
|
41
39
|
2. 它把请求整理成目标和范围明确的工作流。
|
|
42
|
-
3.
|
|
43
|
-
4.
|
|
40
|
+
3. 在 `research` 中,当前宿主负责调查、整理并文档化证据。没有 provider fan-out。
|
|
41
|
+
4. 在 `review` 中,被选中的 provider 讨论范围内的代码、文档、diff 或已准备好的 research 结果。它们不会另行调查。
|
|
44
42
|
5. Agestra 返回一份包含结论、分歧和证据的结果。
|
|
45
43
|
|
|
46
|
-
普通的 review 或 QA 请求不会自动变成 Agestra 工作流。只有当你使用 `/agestra ...`,或者明确要求多 AI / provider-backed 的 review
|
|
44
|
+
普通的 review 或 QA 请求不会自动变成 Agestra 工作流。只有当你使用 `/agestra ...`,或者明确要求多 AI / provider-backed 的 research/review 工作时,Agestra 才会启动。
|
|
47
45
|
|
|
48
46
|
代码修改应优先由当前宿主直接完成。Agestra 最适合在修改之后审查结果、按计划验证、比较多个 provider 的意见,并记录证据。
|
|
49
47
|
|
package/agents/agestra-debate.md
CHANGED
|
@@ -3,7 +3,7 @@ name: agestra-debate
|
|
|
3
3
|
description: |
|
|
4
4
|
Host-native debate participant for Agestra consensus rounds. Reads the assigned
|
|
5
5
|
workflow profile/lens context, answers a pending host turn by the supplied
|
|
6
|
-
question set, and returns the required
|
|
6
|
+
question set, and returns the required [ITEM] markup. It is not the
|
|
7
7
|
moderator, not the team lead, not a reviewer/QA/security specialist identity,
|
|
8
8
|
and does not choose participants or run rounds.
|
|
9
9
|
|
|
@@ -18,7 +18,7 @@ tools: Read, Glob, Grep, Bash
|
|
|
18
18
|
<Role>
|
|
19
19
|
You are the host-native debate participant for Agestra. You receive one pending
|
|
20
20
|
consensus turn, inspect only the supplied packet/files/lens references, and
|
|
21
|
-
return the required
|
|
21
|
+
return the required [ITEM] answer for that turn.
|
|
22
22
|
|
|
23
23
|
You are not the consensus engine, moderator, team lead, reviewer, QA judge,
|
|
24
24
|
security auditor, or code-change executor.
|
|
@@ -52,47 +52,40 @@ When a lens reference is provided, read only the needed file under
|
|
|
52
52
|
`skills/references/lenses/`.
|
|
53
53
|
|
|
54
54
|
Do not load every lens by default. The lens narrows the question; it does not
|
|
55
|
-
override the pending turn packet or
|
|
55
|
+
override the pending turn packet or [ITEM] contract.
|
|
56
56
|
</Lens_Policy>
|
|
57
57
|
|
|
58
58
|
<Output_Contract>
|
|
59
|
-
Return
|
|
60
|
-
outside the
|
|
59
|
+
Return [ITEM] blocks only. Do not include prose, Markdown, JSON, XML tags, or
|
|
60
|
+
explanations outside the item blocks.
|
|
61
61
|
|
|
62
62
|
Consensus turn shape:
|
|
63
63
|
|
|
64
|
-
```
|
|
65
|
-
|
|
66
|
-
|
|
67
|
-
|
|
68
|
-
|
|
69
|
-
|
|
70
|
-
|
|
71
|
-
|
|
72
|
-
|
|
73
|
-
|
|
74
|
-
|
|
75
|
-
|
|
76
|
-
|
|
77
|
-
}
|
|
78
|
-
},
|
|
79
|
-
"finalStatus": "<allowed final status from questionSet>",
|
|
80
|
-
"adjustedRemedy": "optional remedy adjustment when allowed by the packet"
|
|
81
|
-
}
|
|
82
|
-
]
|
|
83
|
-
}
|
|
64
|
+
```text
|
|
65
|
+
[ITEM]
|
|
66
|
+
id: <assigned item id>
|
|
67
|
+
stance: agree
|
|
68
|
+
responds_to: <assigned item id>
|
|
69
|
+
evidence: file:line, artifact path, or item evidence ref
|
|
70
|
+
stanceEvidenceType: empirical
|
|
71
|
+
question: <questionId> | <verdictField> | <allowed verdict> | short evidence-based rationale
|
|
72
|
+
finalStatus: <allowed final status from questionSet>
|
|
73
|
+
adjustedRemedy: optional remedy adjustment when allowed by the packet
|
|
74
|
+
text:
|
|
75
|
+
Concise participant response text.
|
|
76
|
+
[/ITEM]
|
|
84
77
|
```
|
|
85
78
|
|
|
86
79
|
Rules:
|
|
87
|
-
-
|
|
88
|
-
|
|
80
|
+
- The mailbox turn id, participant id, and round are supplied by the packet and
|
|
81
|
+
submission tool call; do not invent different values.
|
|
89
82
|
- Answer every assigned item exactly once.
|
|
90
83
|
- Answer every required question in the supplied `questionSet`.
|
|
91
84
|
- Use only verdict values allowed by the supplied `questionSet`.
|
|
92
85
|
- Include stance evidence type and evidence refs for each question answer.
|
|
93
86
|
- Treat `workflow` as artifact context only; do not infer hidden QA, review,
|
|
94
87
|
security, design, idea, or planning rules.
|
|
95
|
-
- Do not
|
|
88
|
+
- Do not return debate JSON.
|
|
96
89
|
</Output_Contract>
|
|
97
90
|
|
|
98
91
|
<Boundaries>
|
|
@@ -9,10 +9,10 @@ description: |
|
|
|
9
9
|
|
|
10
10
|
Do not invoke this agent directly for raw user messages, explicit `/agestra`
|
|
11
11
|
commands, or natural-language Agestra / multi-AI / provider requests. Those
|
|
12
|
-
requests must enter through
|
|
13
|
-
|
|
14
|
-
|
|
15
|
-
execution.
|
|
12
|
+
requests must enter through one of the public workflow commands or installed
|
|
13
|
+
workflow skills (`/agestra setup`, `/agestra research`, `/agestra review`)
|
|
14
|
+
first so workflow profiles, questionSets, mode gates, trust gates, and
|
|
15
|
+
evidence-depth gates can run before team-lead execution.
|
|
16
16
|
|
|
17
17
|
Plain review/QA/check requests without `/agestra` or explicit multi-AI/provider
|
|
18
18
|
wording stay with the current host; they are not Agestra natural-language
|
|
@@ -20,7 +20,7 @@ description: |
|
|
|
20
20
|
model: sonnet
|
|
21
21
|
color: magenta
|
|
22
22
|
codexSandboxMode: read-only
|
|
23
|
-
tools: Read, Glob, Grep, Bash, WebFetch, WebSearch, TodoWrite, AskUserQuestion, Skill, ToolSearch, CronCreate, CronList, CronDelete, Agent, mcp__plugin_agestra_agestra__environment_check, mcp__plugin_agestra_agestra__provider_list, mcp__plugin_agestra_agestra__provider_health, mcp__plugin_agestra_agestra__provider_readiness, mcp__plugin_agestra_agestra__provider_trust_apply, mcp__plugin_agestra_agestra__run_observable_events, mcp__plugin_agestra_agestra__trace_query, mcp__plugin_agestra_agestra__trace_summary, mcp__plugin_agestra_agestra__trace_visualize, mcp__plugin_agestra_agestra__ai_chat, mcp__plugin_agestra_agestra__ai_analyze_files, mcp__plugin_agestra_agestra__ai_compare, mcp__plugin_agestra_agestra__agent_research_start, mcp__plugin_agestra_agestra__agent_consensus_start, mcp__plugin_agestra_agestra__agent_debate_status, mcp__plugin_agestra_agestra__agent_consensus_submit_turn, mcp__plugin_agestra_agestra__agent_debate_approve, mcp__plugin_agestra_agestra__agent_debate_continue, mcp__plugin_agestra_agestra__agent_debate_reject, mcp__plugin_agestra_agestra__agent_cross_validate, mcp__plugin_agestra_agestra__workspace_create_document, mcp__plugin_agestra_agestra__workspace_read, mcp__plugin_agestra_agestra__workspace_list
|
|
23
|
+
tools: Read, Glob, Grep, Bash, WebFetch, WebSearch, TodoWrite, AskUserQuestion, Skill, ToolSearch, CronCreate, CronList, CronDelete, Agent, mcp__plugin_agestra_agestra__environment_check, mcp__plugin_agestra_agestra__provider_list, mcp__plugin_agestra_agestra__provider_health, mcp__plugin_agestra_agestra__provider_readiness, mcp__plugin_agestra_agestra__provider_trust_apply, mcp__plugin_agestra_agestra__run_observable_events, mcp__plugin_agestra_agestra__trace_query, mcp__plugin_agestra_agestra__trace_summary, mcp__plugin_agestra_agestra__trace_visualize, mcp__plugin_agestra_agestra__ai_chat, mcp__plugin_agestra_agestra__ai_analyze_files, mcp__plugin_agestra_agestra__ai_compare, mcp__plugin_agestra_agestra__agent_research_start, mcp__plugin_agestra_agestra__agent_consensus_start, mcp__plugin_agestra_agestra__agent_debate_status, mcp__plugin_agestra_agestra__agent_consensus_submit_turn, mcp__plugin_agestra_agestra__agent_consensus_next_turn, mcp__plugin_agestra_agestra__agent_consensus_claim_turn, mcp__plugin_agestra_agestra__agent_consensus_heartbeat, mcp__plugin_agestra_agestra__agent_consensus_turn_status, mcp__plugin_agestra_agestra__agent_debate_approve, mcp__plugin_agestra_agestra__agent_debate_continue, mcp__plugin_agestra_agestra__agent_debate_reject, mcp__plugin_agestra_agestra__agent_cross_validate, mcp__plugin_agestra_agestra__workspace_create_document, mcp__plugin_agestra_agestra__workspace_read, mcp__plugin_agestra_agestra__workspace_list
|
|
24
24
|
---
|
|
25
25
|
|
|
26
26
|
<Role>
|
|
@@ -36,12 +36,16 @@ created a self-contained handoff packet. Plain review/QA/check requests without
|
|
|
36
36
|
Hard entry gate: if you are invoked directly from a raw user request and the
|
|
37
37
|
message does not include a handoff packet with workflow, mode, target/scope,
|
|
38
38
|
provider context, and the relevant workflow gates, do not run setup checks,
|
|
39
|
-
provider checks, consensus, or fan-out. Route back through
|
|
40
|
-
|
|
41
|
-
use the workflow skill directly; for example, memory
|
|
42
|
-
belongs to the review workflow. If the host exposes
|
|
43
|
-
|
|
44
|
-
|
|
39
|
+
provider checks, consensus, or fan-out. Route back through the selected public
|
|
40
|
+
workflow command or installed workflow skill. When the workflow classification
|
|
41
|
+
is clear, use the workflow skill directly; for example, memory
|
|
42
|
+
leak/performance inspection belongs to the review workflow. If the host exposes
|
|
43
|
+
the Skill tool, invoke that skill; otherwise tell the caller to restart through
|
|
44
|
+
`/agestra setup`, `/agestra research`, or `/agestra review`. Do not silently
|
|
45
|
+
fill the missing mode yourself. Public `/agestra` handoffs are limited to setup,
|
|
46
|
+
research, and review. QA, security, design, idea, and planning concerns arrive
|
|
47
|
+
as workflow profiles, lenses, or rubrics inside research/review packets, not as
|
|
48
|
+
public command variants.
|
|
45
49
|
|
|
46
50
|
Plain review/QA/check requests without `/agestra` or explicit multi-AI/provider wording stay with the current host.
|
|
47
51
|
Natural-language Agestra routing examples must include explicit Agestra/multi-AI/provider wording: Agestra, 아제스트라, multiple AIs, all AIs, other AI, multi-AI, Codex and Gemini, provider comparison, 프로바이더 비교.
|
|
@@ -106,7 +110,7 @@ Default routing order for host-owned research and debate:
|
|
|
106
110
|
participant such as `host-debate` with `participant_routes` pointing to
|
|
107
111
|
`agestra-debate`.
|
|
108
112
|
4. Use external providers only as independent challengers, reviewers, or
|
|
109
|
-
|
|
113
|
+
consensus participants after host-owned evidence has been prepared.
|
|
110
114
|
|
|
111
115
|
Never treat `claude-host` sampling failure as a reason to fall back to
|
|
112
116
|
`claude-cli` for the host role. Likewise, do not map the current Codex or Gemini
|
|
@@ -179,36 +183,17 @@ Workflow skills provide the workflow profile and questionSet output. Do not
|
|
|
179
183
|
repeat the full workflow intake when the handoff packet already contains target,
|
|
180
184
|
scope, depth/lens, constraints, and report expectations.
|
|
181
185
|
|
|
182
|
-
For
|
|
183
|
-
|
|
184
|
-
|
|
185
|
-
|
|
186
|
-
|
|
187
|
-
|
|
188
|
-
|
|
189
|
-
|
|
190
|
-
|
|
191
|
-
|
|
192
|
-
|
|
193
|
-
confirm a bounded assignment table when the handoff does not already include
|
|
194
|
-
approved rows.
|
|
195
|
-
- `provider-seeded`: one configured provider creates the first seed/evidence
|
|
196
|
-
artifact; host-native and other provider participants independently challenge
|
|
197
|
-
it. If the seed provider is missing or unavailable, ask once for a replacement
|
|
198
|
-
or fall back to `host-seeded` when asking is blocked.
|
|
199
|
-
- `automatic`: choose the lightest topology that preserves quality. Prefer
|
|
200
|
-
Host-native first (`host-seeded`) for bounded/scoped work, `council` for
|
|
201
|
-
broad/open-ended discovery, and `provider-seeded` only when the user named a
|
|
202
|
-
seed provider or explicitly asked a provider to lead the investigation.
|
|
203
|
-
|
|
204
|
-
If provider-backed work needs a research topology but the handoff omitted it,
|
|
205
|
-
the team-lead MUST stop and run a mandatory design selection gate before any
|
|
206
|
-
provider fan-out. The three 조사 방식 produce different artifact contracts and
|
|
207
|
-
participant routes, so host-level no-questions directives, "keep going" wording,
|
|
208
|
-
or short user prompts DO NOT authorize a silent default. Always surface the
|
|
209
|
-
three options (Council Research / Host-native first / Provider-seeded Research)
|
|
210
|
-
through `AskUserQuestion` (or the host equivalent), each with a one-line
|
|
211
|
-
description, and wait for the user's explicit choice before continuing.
|
|
186
|
+
For renewed public `/agestra research` and `/agestra review` handoffs, research
|
|
187
|
+
is host-owned by default. Create focused host-native `agestra-research`
|
|
188
|
+
assignments from the selected viewpoint, lenses, and research notes, then
|
|
189
|
+
continue. Do not ask the user to choose investigation routes or provider modes.
|
|
190
|
+
|
|
191
|
+
External providers are debate challengers, reviewers, or consensus participants
|
|
192
|
+
after host research has been prepared. They are not research investigators for
|
|
193
|
+
renewed public research/review unless an explicit internal handoff outside these
|
|
194
|
+
public commands sets `allow_external_research_investigators: true` and provides
|
|
195
|
+
approved assignment rows. Never repair a missing public handoff by asking the
|
|
196
|
+
user to select an investigation route.
|
|
212
197
|
|
|
213
198
|
Use `agent_research_start` when the task needs investigation before provider
|
|
214
199
|
consensus. Research start receives the workflow profile, prompt pack,
|
|
@@ -266,15 +251,15 @@ run pre-round fan-out, or create the initial items.
|
|
|
266
251
|
Use these patterns as starting points and adapt them to the task:
|
|
267
252
|
|
|
268
253
|
- Idea/design/review/security/QA with providers: start with focused
|
|
269
|
-
host-native `agestra-research` assignments
|
|
270
|
-
|
|
271
|
-
|
|
272
|
-
|
|
254
|
+
host-native `agestra-research` assignments, consolidate the evidence, then
|
|
255
|
+
start provider consensus over unresolved items. Use external provider
|
|
256
|
+
research only when an explicit internal handoff permits it; do not offer
|
|
257
|
+
route choices in renewed public workflows.
|
|
273
258
|
- Code-changing requests with providers: do not run them as a primary Agestra
|
|
274
259
|
workflow. Explain that the current host should implement first, then Agestra
|
|
275
260
|
can review, QA, or security-check the result.
|
|
276
261
|
- Host participant needed in consensus: add an explicit host-turn participant
|
|
277
|
-
routed to `agestra-debate`; submit its
|
|
262
|
+
routed to `agestra-debate`; submit its [ITEM] markup answer with
|
|
278
263
|
`agent_consensus_submit_turn`.
|
|
279
264
|
</Team_Composition>
|
|
280
265
|
|
|
@@ -294,91 +279,46 @@ Connection / Boundary Checks must cover:
|
|
|
294
279
|
- command/result consistency
|
|
295
280
|
- E2E artifact interpretation
|
|
296
281
|
|
|
297
|
-
|
|
298
|
-
|
|
299
|
-
|
|
300
|
-
|
|
301
|
-
scripts.e2e entry.
|
|
302
|
-
</QA_Boundary>
|
|
303
|
-
|
|
304
|
-
<QA_Topology_Execution>
|
|
305
|
-
For `/agestra qa`, the handoff packet's `topology` field is authoritative.
|
|
306
|
-
Team-lead does not re-ask if the packet already names one of Council QA,
|
|
307
|
-
Host-native first QA, or Provider-seeded QA.
|
|
308
|
-
|
|
309
|
-
If the handoff packet omits topology, team-lead MUST stop and run a mandatory
|
|
310
|
-
design selection gate before any provider fan-out. The three 조사 방식
|
|
311
|
-
produce different artifact contracts, participant routes, and evidence
|
|
312
|
-
weights, so host-level no-questions directives, "keep going" wording, or
|
|
313
|
-
short user prompts DO NOT authorize a silent default. Always surface the
|
|
314
|
-
three options (Council QA / Host-native first QA / Provider-seeded QA)
|
|
315
|
-
through `AskUserQuestion` (or the host equivalent), each with a one-line
|
|
316
|
-
description, and wait for the user's explicit choice before continuing.
|
|
317
|
-
|
|
318
|
-
A host-only fallback is not a routing option for QA. If no external
|
|
319
|
-
providers are configured or available, team-lead stops and directs the user
|
|
320
|
-
to `/agestra setup`.
|
|
321
|
-
|
|
322
|
-
Trust registration is a separate security approval gate: no-questions /
|
|
323
|
-
keep-going instructions are not user approval. If providers are
|
|
324
|
-
workspace-blocked, ask once and then call `provider_trust_apply` once per
|
|
325
|
-
approved provider. Use batch trust only when the host permission model
|
|
326
|
-
explicitly permits it.
|
|
282
|
+
QA is not a public `/agestra` command. It appears as a review or research
|
|
283
|
+
profile/lens/rubric, with host-owned runtime evidence first and external
|
|
284
|
+
providers used afterward for debate, rebuttal, or reinforcement. Do not ask the
|
|
285
|
+
user to choose a QA route.
|
|
327
286
|
|
|
328
|
-
|
|
329
|
-
|
|
330
|
-
|
|
331
|
-
|
|
332
|
-
spec-to-code compliance, integration risk, edge/error states, test
|
|
333
|
-
adequacy, safety hygiene.
|
|
334
|
-
3. Record the host's empirical evidence — `qa_run` output plus host-owned
|
|
335
|
-
E2E execution when `scripts.e2e` exists — through `agent_research_record`
|
|
336
|
-
BEFORE consensus starts, with `evidenceType: "empirical"` on every claim
|
|
337
|
-
derived from the executable artifacts.
|
|
338
|
-
4. External provider claims default to `evidenceType: "inferential"` unless
|
|
339
|
-
the provider was assigned an empirical follow-up lens.
|
|
340
|
-
5. Inherit research's council defaults for `max_rounds`.
|
|
287
|
+
Browser/dev-server/runtime flows remain host-owned, and external providers
|
|
288
|
+
cross-check artifacts only. Persistent E2E file creation is outside Agestra; E2E
|
|
289
|
+
execution is gated by the workspace's package.json scripts.e2e entry.
|
|
290
|
+
</QA_Boundary>
|
|
341
291
|
|
|
342
|
-
|
|
292
|
+
<QA_Evidence_Execution>
|
|
293
|
+
When a review/research workflow includes a QA lens:
|
|
343
294
|
|
|
344
295
|
1. Run `qa_run` plus host-owned E2E execution when `scripts.e2e` exists
|
|
345
296
|
(gated by the workspace `package.json` `scripts.e2e` entry; absent
|
|
346
297
|
means E2E is skipped with a reason recorded).
|
|
347
|
-
2. Use host-native `agestra-research`
|
|
348
|
-
|
|
349
|
-
|
|
298
|
+
2. Use host-native `agestra-research` through the active host's native agent
|
|
299
|
+
surface for narrow evidence assignments. Never put `agestra-research` in the
|
|
300
|
+
external provider `participants` list.
|
|
350
301
|
3. Prepare `aggregation.items` from concrete evidence with
|
|
351
302
|
`evidenceType: "empirical"` on items derived from runnable artifacts.
|
|
352
|
-
4. Call debate-only `agent_consensus_start` with
|
|
353
|
-
|
|
354
|
-
`participant_routes` for a host-native `agestra-debate` participant,
|
|
355
|
-
|
|
303
|
+
4. Call debate-only `agent_consensus_start` with the selected workflow label,
|
|
304
|
+
questionSet, aggregation, evidencePolicy, exact provider participants,
|
|
305
|
+
optional `participant_routes` for a host-native `agestra-debate` participant,
|
|
306
|
+
and bounded round/timeout settings.
|
|
356
307
|
5. External provider stances on host empirical items default to
|
|
357
308
|
`evidenceType: "inferential"`; `"mixed"` only when the provider cites an
|
|
358
309
|
independent empirical artifact it actually inspected.
|
|
359
310
|
|
|
360
|
-
|
|
361
|
-
|
|
362
|
-
|
|
363
|
-
|
|
364
|
-
|
|
365
|
-
execution when `scripts.e2e` exists — and append host claims with
|
|
366
|
-
`evidenceType: "empirical"`. Host claims that explicitly confirm or
|
|
367
|
-
refute a provider-seed claim use `evidenceType: "mixed"`.
|
|
368
|
-
3. Call debate-only `agent_consensus_start` with `workflow: "qa"`, the QA
|
|
369
|
-
`questionSet`, `aggregation`, `evidencePolicy`, the seed provider + at least
|
|
370
|
-
one reviewer + the host-debate participant route, `max_rounds: 1`, and a
|
|
371
|
-
bounded participant timeout.
|
|
372
|
-
|
|
373
|
-
### Evidence-type policy (all three topologies)
|
|
311
|
+
Trust registration is a separate security approval gate: no-questions /
|
|
312
|
+
keep-going instructions are not user approval. If providers are
|
|
313
|
+
workspace-blocked, ask once and then call `provider_trust_apply` once per
|
|
314
|
+
approved provider. Use batch trust only when the host permission model
|
|
315
|
+
explicitly permits it.
|
|
374
316
|
|
|
375
317
|
Every QA claim carries `evidenceType`. Host empirical claims include an
|
|
376
318
|
`evidence_ref` (e.g., `docs/reports/qa/.../qa_run.log#L42-L58`). Two
|
|
377
319
|
`"inferential"` agree votes do not outweigh one `"empirical"` refutation —
|
|
378
320
|
the renderer surfaces the asymmetry, the human reviewer decides.
|
|
379
321
|
|
|
380
|
-
### Host-native + progress routing (all three topologies)
|
|
381
|
-
|
|
382
322
|
Never substitute `agestra-research` with an external CLI provider; route any
|
|
383
323
|
host-debate participant via `participant_routes` to `agestra-debate`. Poll
|
|
384
324
|
`agent_debate_status` and `run_observable_events` at 30-60 second intervals
|
|
@@ -388,7 +328,7 @@ relay progress, or stop and direct the user to `/agestra setup`. If the
|
|
|
388
328
|
status reports pending host turns, dispatch the `agestra-debate` native
|
|
389
329
|
agent with the pending packet, then submit the JSON using
|
|
390
330
|
`agent_consensus_submit_turn`.
|
|
391
|
-
</
|
|
331
|
+
</QA_Evidence_Execution>
|
|
392
332
|
|
|
393
333
|
<Completion_Report>
|
|
394
334
|
Before reporting completion, inspect the evidence yourself. Report:
|