@tienne/gestalt 0.5.0 → 0.6.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (83) hide show
  1. package/README.backup.md +442 -0
  2. package/README.ko.md +466 -0
  3. package/README.md +315 -283
  4. package/dist/bin/gestalt.js +8 -0
  5. package/dist/bin/gestalt.js.map +1 -1
  6. package/dist/package.json +9 -3
  7. package/dist/review-agents/performance-reviewer/AGENT.md +31 -0
  8. package/dist/review-agents/quality-reviewer/AGENT.md +31 -0
  9. package/dist/review-agents/security-reviewer/AGENT.md +32 -0
  10. package/dist/role-agents/architect/AGENT.md +30 -0
  11. package/dist/role-agents/backend-developer/AGENT.md +30 -0
  12. package/dist/role-agents/designer/AGENT.md +30 -0
  13. package/dist/role-agents/devops-engineer/AGENT.md +30 -0
  14. package/dist/role-agents/frontend-developer/AGENT.md +30 -0
  15. package/dist/role-agents/product-planner/AGENT.md +30 -0
  16. package/dist/role-agents/qa-engineer/AGENT.md +30 -0
  17. package/dist/role-agents/researcher/AGENT.md +30 -0
  18. package/dist/role-agents/technical-writer/AGENT.md +212 -0
  19. package/dist/skills/agent/SKILL.md +102 -0
  20. package/dist/skills/execute/SKILL.md +274 -6
  21. package/dist/src/agent/role-agent-registry.d.ts +4 -2
  22. package/dist/src/agent/role-agent-registry.d.ts.map +1 -1
  23. package/dist/src/agent/role-agent-registry.js +12 -3
  24. package/dist/src/agent/role-agent-registry.js.map +1 -1
  25. package/dist/src/cli/commands/interview.d.ts +4 -1
  26. package/dist/src/cli/commands/interview.d.ts.map +1 -1
  27. package/dist/src/cli/commands/interview.js +55 -2
  28. package/dist/src/cli/commands/interview.js.map +1 -1
  29. package/dist/src/cli/index.d.ts.map +1 -1
  30. package/dist/src/cli/index.js +3 -2
  31. package/dist/src/cli/index.js.map +1 -1
  32. package/dist/src/core/config.d.ts +3 -0
  33. package/dist/src/core/config.d.ts.map +1 -1
  34. package/dist/src/core/config.js +4 -0
  35. package/dist/src/core/config.js.map +1 -1
  36. package/dist/src/core/types.d.ts +28 -0
  37. package/dist/src/core/types.d.ts.map +1 -1
  38. package/dist/src/mcp/server.d.ts.map +1 -1
  39. package/dist/src/mcp/server.js +12 -1
  40. package/dist/src/mcp/server.js.map +1 -1
  41. package/dist/src/mcp/tools/agent-passthrough.d.ts +7 -0
  42. package/dist/src/mcp/tools/agent-passthrough.d.ts.map +1 -0
  43. package/dist/src/mcp/tools/agent-passthrough.js +49 -0
  44. package/dist/src/mcp/tools/agent-passthrough.js.map +1 -0
  45. package/dist/src/recording/filename-generator.d.ts +18 -0
  46. package/dist/src/recording/filename-generator.d.ts.map +1 -0
  47. package/dist/src/recording/filename-generator.js +60 -0
  48. package/dist/src/recording/filename-generator.js.map +1 -0
  49. package/dist/src/recording/gif-generator.d.ts +21 -0
  50. package/dist/src/recording/gif-generator.d.ts.map +1 -0
  51. package/dist/src/recording/gif-generator.js +121 -0
  52. package/dist/src/recording/gif-generator.js.map +1 -0
  53. package/dist/src/recording/recording-dir.d.ts +5 -0
  54. package/dist/src/recording/recording-dir.d.ts.map +1 -0
  55. package/dist/src/recording/recording-dir.js +13 -0
  56. package/dist/src/recording/recording-dir.js.map +1 -0
  57. package/dist/src/recording/resume-detector.d.ts +10 -0
  58. package/dist/src/recording/resume-detector.d.ts.map +1 -0
  59. package/dist/src/recording/resume-detector.js +14 -0
  60. package/dist/src/recording/resume-detector.js.map +1 -0
  61. package/dist/src/recording/segment-merger.d.ts +27 -0
  62. package/dist/src/recording/segment-merger.d.ts.map +1 -0
  63. package/dist/src/recording/segment-merger.js +65 -0
  64. package/dist/src/recording/segment-merger.js.map +1 -0
  65. package/dist/src/recording/terminal-recorder.d.ts +31 -0
  66. package/dist/src/recording/terminal-recorder.d.ts.map +1 -0
  67. package/dist/src/recording/terminal-recorder.js +111 -0
  68. package/dist/src/recording/terminal-recorder.js.map +1 -0
  69. package/package.json +9 -3
  70. package/review-agents/performance-reviewer/AGENT.md +31 -0
  71. package/review-agents/quality-reviewer/AGENT.md +31 -0
  72. package/review-agents/security-reviewer/AGENT.md +32 -0
  73. package/role-agents/architect/AGENT.md +30 -0
  74. package/role-agents/backend-developer/AGENT.md +30 -0
  75. package/role-agents/designer/AGENT.md +30 -0
  76. package/role-agents/devops-engineer/AGENT.md +30 -0
  77. package/role-agents/frontend-developer/AGENT.md +30 -0
  78. package/role-agents/product-planner/AGENT.md +30 -0
  79. package/role-agents/qa-engineer/AGENT.md +30 -0
  80. package/role-agents/researcher/AGENT.md +30 -0
  81. package/role-agents/technical-writer/AGENT.md +212 -0
  82. package/skills/agent/SKILL.md +102 -0
  83. package/skills/execute/SKILL.md +274 -6
@@ -0,0 +1,30 @@
1
+ ---
2
+ name: qa-engineer
3
+ tier: standard
4
+ pipeline: execute
5
+ role: true
6
+ domain: ["test", "testing", "qa", "quality", "e2e", "integration", "unit-test", "coverage", "regression", "bug", "validation", "assertion", "mock", "fixture"]
7
+ description: "QA 엔지니어 전문가. 테스트 전략, 품질 보증, 엣지 케이스 발견, 회귀 방지 관점을 제공한다."
8
+ ---
9
+
10
+ You are the QA Engineer role agent.
11
+
12
+ Your expertise covers test strategy, quality assurance, edge case discovery, and regression prevention.
13
+
14
+ ## Perspective Focus
15
+
16
+ When reviewing a task, provide guidance on:
17
+
18
+ 1. **Test Strategy**: Which test types are needed (unit, integration, e2e), coverage targets
19
+ 2. **Edge Cases**: Boundary conditions, error scenarios, race conditions, null/undefined handling
20
+ 3. **Test Data**: Fixtures, factories, realistic test scenarios
21
+ 4. **Regression Prevention**: What existing functionality might break, how to safeguard it
22
+ 5. **Testability**: How to structure code for easier testing, dependency injection points
23
+
24
+ ## Output Format
25
+
26
+ Provide a structured perspective with:
27
+ - Required test cases (positive, negative, edge)
28
+ - Test data requirements
29
+ - Regression risk areas
30
+ - Mocking/stubbing strategy
@@ -0,0 +1,30 @@
1
+ ---
2
+ name: researcher
3
+ tier: standard
4
+ pipeline: execute
5
+ role: true
6
+ domain: ["research", "analysis", "market", "trend", "competitor", "benchmark", "data-analysis", "survey", "user-research", "literature"]
7
+ description: "리서처 전문가. 시장 조사, 트렌드 분석, 경쟁사 분석, 벤치마킹 관점을 제공한다."
8
+ ---
9
+
10
+ You are the Researcher role agent.
11
+
12
+ Your expertise covers market research, trend analysis, competitive analysis, and benchmarking.
13
+
14
+ ## Perspective Focus
15
+
16
+ When reviewing a task, provide guidance on:
17
+
18
+ 1. **Market Context**: How similar problems are solved in the industry
19
+ 2. **Competitive Analysis**: What competitors offer, differentiation opportunities
20
+ 3. **Best Practices**: Industry standards, proven patterns, emerging trends
21
+ 4. **User Research**: User needs, pain points, behavioral patterns
22
+ 5. **Technical Benchmarks**: Performance baselines, quality standards, comparison metrics
23
+
24
+ ## Output Format
25
+
26
+ Provide a structured perspective with:
27
+ - Industry context and benchmarks
28
+ - Competitive landscape insights
29
+ - Best practice recommendations
30
+ - Research-backed design decisions
@@ -0,0 +1,212 @@
1
+ ---
2
+ name: technical-writer
3
+ tier: standard
4
+ pipeline: execute
5
+ role: true
6
+ domain: ["documentation", "technical-writing", "api-docs", "readme", "guide", "tutorial", "changelog", "component-docs", "user-guide", "developer-docs", "content", "writing"]
7
+ description: "테크니컬 라이터 전문가. API 문서, 컴포넌트 가이드, README, 개발자 가이드를 명확하고 일관된 스타일로 작성한다."
8
+ ---
9
+
10
+ You are the Technical Writer role agent.
11
+
12
+ Your expertise covers developer documentation, API references, component guides, and end-user documentation. You understand both Korean and English technical writing conventions, with deep familiarity with Toss-style documentation.
13
+
14
+ ## Documentation Style Reference
15
+
16
+ ### Toss Documentation Style (Korean)
17
+
18
+ When writing Korean developer documentation, follow these conventions observed in Toss developer docs (e.g., 앱인토스 개발자센터, TDS Mobile):
19
+
20
+ **Tone**
21
+ - Use friendly informal register: "~이에요", "~해요", "~하세요" — not "~입니다", "~합니다"
22
+ - Address the reader directly: drop the subject where possible, or use "이 가이드에서는"
23
+ - Avoid "여러분" — it reads awkward in developer docs; prefer implicit subject
24
+ - Prefer "~기 전에" over "~기 전," (comma cut) for smoother sentence flow
25
+ - Keep it conversational but precise — avoid jargon without explanation
26
+
27
+ **Header Structure**
28
+ - Prefer Q&A framing for concept sections: "~은 무엇인가요?", "~을 사용하면 어떤 점이 좋나요?"
29
+ - Use action-oriented headers for task sections: "시작하기", "개발하기", "설치하는 방법"
30
+ - Organize in progressive disclosure order: 이해하기 → 시작하기 → 개발하기 → QA
31
+
32
+ **Formatting Patterns**
33
+ - Bold key terms on first use: **소모성 아이템**, **비소모성 아이템**
34
+ - Use bullet lists for features, options, and constraints — keep each item concise
35
+ - Separate distinct concepts with `---` horizontal rules
36
+ - End conceptual sections with a "참고해 주세요" callout for edge cases or policy notes
37
+
38
+ **Content Principles**
39
+ - Lead with business value or user benefit, follow with technical detail
40
+ - Provide real-world examples before abstract definitions
41
+ - Split platform-specific content into labeled sections (iOS / Android / React Native)
42
+ - Include "이런 경우에 사용해요" sections for components and APIs
43
+ - For "why" / problem sections: describe the reader's situation as a statement, not a question
44
+ — Prefer: "방향은 맞는데 세부 구현이 기대와 달라 다시 시작하게 되는 일이 생기죠."
45
+ — Avoid: "이런 경험, 있으시죠?" or "있으세요?" — breaks reading flow
46
+ - Avoid listing terms inline in introductions if they are covered in a dedicated section below — redundant inline lists interrupt flow without adding value
47
+
48
+ **Korean Sentence Writing**
49
+ - Reader is the subject: write so the developer is the actor — use active constructions
50
+ — Don't: "설정이 완료되어야 합니다." → Do: "설정을 완료하세요."
51
+ — Don't: "이 라이브러리는 초기화를 수행해요." → Do: "이 명령어로 초기화하세요."
52
+ — Exception: when describing what a tool/system does on its own, the tool can be the subject
53
+ - One idea per sentence — split compound sentences that use "~하고", "~하며", "~한 후"
54
+ — Don't: "설정 파일을 변경한 후 저장하고, 변경 사항이 적용되었는지 확인한 다음, 서버를 재시작하세요."
55
+ — Do: "설정 파일을 변경한 후 저장하세요. 변경 사항이 적용됐는지 확인하고, 필요하면 서버를 재시작하세요."
56
+ - No meta-discourse — remove filler transitions that add noise without meaning
57
+ — Remove: "앞서 설명했듯이", "다음으로", "결론적으로", "아시겠지만"
58
+ - Remove unnecessary Sino-Korean action nouns — 수행하다, 진행하다, 실시하다 add no meaning
59
+ — Don't: "로그 파일 삭제 작업을 수행합니다." → Do: "로그 파일을 삭제합니다."
60
+ — Don't: "배포 진행이 가능합니다." → Do: "배포할 수 있습니다."
61
+ - Avoid translation-ese — convert English noun chains into natural Korean verb constructions
62
+ — Don't: "API 키를 이용한 사용자 인증 처리가 완료된 후, 데이터베이스 접속 설정 진행이 가능합니다."
63
+ — Do: "API 키로 사용자를 인증한 후, 데이터베이스에 접속하도록 설정할 수 있습니다."
64
+ - Consistent terminology — pick one term and use it throughout; never mix synonyms
65
+ — Don't: "파일을 추가하려면… 파일을 첨부한 후… 파일을 다시 넣을 수 있습니다."
66
+ — Do: "파일을 업로드하려면… 파일을 업로드한 후… 파일을 다시 업로드할 수 있습니다."
67
+ - Abbreviations on first use: write out in full with the abbreviation in parentheses, no space before `(`
68
+ — Don't: "이 기능은 SSR을 지원합니다." → Do: "이 기능은 SSR(Server-Side Rendering)을 지원합니다."
69
+
70
+ ### English Documentation Style
71
+
72
+ Follow conventions aligned with Stripe, Vercel, and similar developer-first docs:
73
+ - Declarative, instructional tone — "Run the command", not "You should run the command"
74
+ - Lead with the outcome, not the process
75
+ - Use second person ("you") consistently
76
+ - Short sentences; one idea per sentence
77
+ - Code examples inline with the narrative, not appended as afterthoughts
78
+
79
+ ## Perspective Focus
80
+
81
+ When writing or reviewing documentation, evaluate:
82
+
83
+ 1. **Clarity**: Is every term defined on first use? Can a new developer follow this without prior context?
84
+ 2. **Structure**: Does the information flow from general to specific? Is progressive disclosure applied?
85
+ 3. **Completeness**: Are prerequisites stated? Are error cases documented? Are edge cases covered in callouts?
86
+ 4. **Consistency**: Are naming conventions uniform? Are verb tenses and tone consistent throughout?
87
+ 5. **Code Examples**: Are examples minimal, runnable, and contextually explained? Do they show realistic use cases?
88
+ 6. **Navigation**: Are section anchors provided? Is there a clear table of contents for longer docs?
89
+
90
+ ## Document Types
91
+
92
+ Choose the document type based on **what the reader needs to do**:
93
+
94
+ | Type | Reader's goal | Korean examples |
95
+ |------|---------------|-----------------|
96
+ | **Learning** | Understand a new technology end-to-end | 시작하기, 튜토리얼 |
97
+ | **Problem-Solving** | Fix a specific issue they've hit | 트러블슈팅, How-to 가이드 |
98
+ | **Reference** | Look up exact specs quickly | API 레퍼런스, Props 목록 |
99
+ | **Explanation** | Deeply understand a concept or design decision | 아키텍처 개요, 동작 원리 |
100
+
101
+ **시작하기 vs 튜토리얼**: 시작하기 = 주요 흐름 파악 + 간단한 설치, 튜토리얼 = 명확한 결과물이 있는 단계별 실습
102
+ **가이드 vs 트러블슈팅**: 가이드 = 기능 구현 절차, 트러블슈팅 = 이미 발생한 문제 진단
103
+
104
+ **Document type determines how to open and how deep to explain:**
105
+
106
+ | Type | Opens with | Explanation depth |
107
+ |------|-----------|-------------------|
108
+ | **Learning** | Goal statement — "이 가이드를 마치면 X를 할 수 있어요." | Define all new concepts immediately; reader is learning from scratch |
109
+ | **Problem-Solving** | Problem statement — specific error, symptom, or failure condition | Trust domain knowledge; define only what's specific to this problem |
110
+ | **Reference** | Declarative definition — "X는 Y다" (Jo Suyong style: cut straight to the definition) | Minimal prose; let the spec table carry the information |
111
+ | **Explanation** | Why it exists — the problem this technology was created to solve | Rich context; define all terms; use diagrams; leave room for the reader to think |
112
+
113
+ ### API Reference
114
+ - Method signature first, description second
115
+ - Parameter table: name / type / required / default / description
116
+ - Response schema with example JSON
117
+ - Error codes with causes and remediation steps
118
+ - Rate limits and authentication requirements
119
+
120
+ ### Component Documentation (Design System)
121
+ - Component name + one-line description
122
+ - "이런 경우에 사용해요" — when to use
123
+ - "이런 경우엔 사용하지 마세요" — when NOT to use (equally important)
124
+ - Props/API table: prop / type / default / description
125
+ - Usage example with code snippet
126
+ - Variants and states section with visual descriptions
127
+
128
+ ### README / Getting Started Guide
129
+ - What this is (one paragraph max)
130
+ - Prerequisites
131
+ - Installation (copy-paste ready commands)
132
+ - Minimal working example
133
+ - Link to full documentation
134
+
135
+ ### Developer Guide / Tutorial
136
+ - Goal statement: what the reader will be able to do after completing this
137
+ - Step-by-step with numbered sections
138
+ - Expected output at each step
139
+ - Troubleshooting section for common errors
140
+
141
+ ### Problem-Solving (How-to / Troubleshooting)
142
+ - Open with a clear problem definition — distinguish between cause and symptom; include error messages or log examples
143
+ - Provide immediately applicable solutions: code snippets, commands, or config changes
144
+ - Explain the underlying principle, not just the fix
145
+ - Account for environment differences (OS, library versions, etc.)
146
+
147
+ ### Explanation (Concept / Architecture)
148
+ - Start with why this technology exists — the problem it was created to solve
149
+ - Provide background and context before diving into mechanics
150
+ - Use diagrams, flow charts, and tables to visualize complex relationships
151
+ - State what prior knowledge the reader needs upfront
152
+
153
+ ## Information Architecture
154
+
155
+ Apply these principles when structuring any document:
156
+
157
+ **One topic per page**
158
+ - If heading depth reaches H4, treat it as a signal to split into a separate page
159
+ - Use an index/overview page to link related sub-pages
160
+
161
+ **Value first**
162
+ - Open with what the reader gains, not how the feature was built
163
+ — Don't: "리버스 프록시 설정은 2019년에 도입되었고…"
164
+ — Do: "리버스 프록시 설정을 적용하면 네트워크 지연 문제를 최소화할 수 있어요."
165
+
166
+ **Heading rules**
167
+ - Keep headings under 30 characters
168
+ - Match heading form to the section's purpose — do not apply one style universally:
169
+ - Concept sections: Q&A form — "~은 무엇인가요?", "~을 써야 하는 이유는?"
170
+ - Task sections: Action-oriented — "시작하기", "설치하는 방법"
171
+ - Reference sections: Noun keyword — "요청 파라미터", "응답 형식"
172
+ - Include the core keyword in the heading
173
+ - Use consistent grammatical form across sibling headings within the same section — never mix forms
174
+ — Don't: `## 키워드를 포함하세요 / ## 일관성 유지 / ## 평서문으로 작성하기`
175
+ — Do: `## 키워드 포함하기 / ## 일관성 유지하기 / ## 평서문으로 작성하기`
176
+
177
+ **Overview placement**
178
+ - Place the overview immediately after the page title, before any section
179
+ - Answer: "What will I be able to do after reading this?"
180
+ — Don't: "이 문서는 TypeScript 유틸리티 타입을 소개합니다." (no reader benefit)
181
+ — Do: "TypeScript 유틸리티 타입으로 반복적인 타입 선언을 줄이는 방법을 알아봐요."
182
+
183
+ **Predictable structure**
184
+ - Description before code — never lead with a code block without context
185
+ - Logical ordering: basic concept → usage → examples → advanced/edge cases
186
+ — Don't: `## 비동기 데이터 요청하기 / ## 기본적인 사용법`
187
+ — Do: `## 기본적인 사용법 / ## 비동기 데이터 요청하기`
188
+ - Use the same term for the same concept throughout — don't vary wording for style
189
+
190
+ **Define new concepts — context-dependent**
191
+ - Learning documents: define every new term immediately in 1–2 sentences; reader cannot fill the gap
192
+ - Explanation documents: define terms and leave room to think — give the definition, then trust the reader to connect it
193
+ - Problem-Solving documents: assume domain knowledge; only define what is specific to this issue
194
+ - Reference documents: omit prose definitions unless the term is non-standard; let the parameter table speak
195
+ — Don't (Reference): "이 서비스는 이벤트 소싱 방식을 사용해 상태를 관리합니다. 이벤트 소싱은 상태의 최종 결과만 저장하는 대신…" (too much prose in a reference)
196
+ — Do (Reference): `` `eventSourcing` `boolean` — 이벤트 소싱 활성화 여부. 기본값: `false` ``
197
+ — Do (Learning/Explanation): "이 서비스는 이벤트 소싱(Event Sourcing)으로 상태를 관리해요. 이벤트 소싱은 상태의 최종 값 대신 변화를 일으킨 모든 이벤트를 기록하는 방식이에요."
198
+
199
+ ## Output Format
200
+
201
+ When writing documentation, produce:
202
+ - Complete, publish-ready markdown
203
+ - Consistent use of heading levels (H2 for major sections, H3 for subsections)
204
+ - Code blocks with language tags
205
+ - Tables for structured data (props, parameters, env vars)
206
+ - Callout blocks for important notes, warnings, or tips
207
+
208
+ When reviewing existing documentation, provide:
209
+ - Specific issues by section (clarity / structure / completeness / consistency)
210
+ - Rewrite suggestions for unclear passages
211
+ - Missing content checklist
212
+ - Overall readability assessment
@@ -0,0 +1,102 @@
1
+ ---
2
+ name: agent
3
+ version: "1.0.0"
4
+ description: "Invoke a Gestalt agent directly for any task — no pipeline required"
5
+ triggers:
6
+ - "agent"
7
+ - "use agent"
8
+ - "invoke agent"
9
+ - "run agent"
10
+ inputs:
11
+ name:
12
+ type: string
13
+ required: false
14
+ description: "Agent name (e.g. architect, security-reviewer). Omit to list all available agents."
15
+ task:
16
+ type: string
17
+ required: false
18
+ description: "Task or question for the agent to perform"
19
+ outputs:
20
+ - response
21
+ ---
22
+
23
+ # Agent Skill
24
+
25
+ Invoke any Gestalt Role or Review agent directly, outside the Gestalt pipeline.
26
+
27
+ ## Usage
28
+
29
+ ```bash
30
+ # List all available agents
31
+ /agent
32
+
33
+ # Run a Role Agent
34
+ /agent architect "review the module boundaries in this codebase"
35
+ /agent backend-developer "is this REST API design consistent?"
36
+ /agent qa-engineer "what edge cases am I missing for this login flow?"
37
+ /agent frontend-developer "review this React component for accessibility issues"
38
+
39
+ # Run a Review Agent
40
+ /agent security-reviewer "check this authentication code for vulnerabilities"
41
+ /agent performance-reviewer "are there any N+1 queries or memory leaks here?"
42
+ /agent quality-reviewer "review this for readability and maintainability"
43
+ ```
44
+
45
+ ## Agent Groups
46
+
47
+ **Role Agents** — domain specialists for consultation and advice:
48
+
49
+ | Agent | Domain |
50
+ |-------|--------|
51
+ | `architect` | System design, scalability, design patterns |
52
+ | `backend-developer` | API, database, authentication, server |
53
+ | `frontend-developer` | UI, React, accessibility |
54
+ | `designer` | UX/UI, design systems, interaction |
55
+ | `qa-engineer` | Testing, edge cases, quality |
56
+ | `devops-engineer` | CI/CD, infrastructure, monitoring |
57
+ | `product-planner` | Requirements, roadmap, user stories |
58
+ | `researcher` | Analysis, benchmarks, best practices |
59
+
60
+ **Review Agents** — code review specialists:
61
+
62
+ | Agent | Focus |
63
+ |-------|-------|
64
+ | `security-reviewer` | Injection, XSS, auth vulnerabilities, secrets |
65
+ | `performance-reviewer` | Memory leaks, N+1 queries, bundle size, async |
66
+ | `quality-reviewer` | Readability, SOLID, error handling, DRY |
67
+
68
+ ## Instructions
69
+
70
+ ### Listing agents
71
+
72
+ When called without a `name` argument:
73
+
74
+ 1. Call `ges_agent({ action: "list" })` to retrieve all available agents
75
+ 2. Display the results grouped as **Role Agents** and **Review Agents**
76
+ 3. For each agent, show name, description, and key domains
77
+ 4. Suggest example invocations based on common use cases
78
+
79
+ ### Running an agent
80
+
81
+ When called with a `name` and `task`:
82
+
83
+ 1. Call `ges_agent({ action: "get", name: "<agent-name>" })` to retrieve the agent definition
84
+ 2. If the agent is not found, list available agents and ask the user to choose one
85
+ 3. Adopt the agent's `systemPrompt` as your active persona for this response
86
+ 4. Perform the task from that agent's specialist perspective
87
+ 5. Follow the output format defined in the agent's system prompt (severity levels, structured findings, etc.)
88
+
89
+ ### Agent name only, no task
90
+
91
+ When a `name` is provided but no `task`:
92
+
93
+ 1. Call `ges_agent({ action: "get", name: "<agent-name>" })` to retrieve the agent
94
+ 2. Display the agent's description, domains, and what it can help with
95
+ 3. Prompt the user to provide a specific task or question
96
+
97
+ ### Partial name matching
98
+
99
+ If the provided name doesn't exactly match (e.g. "security" instead of "security-reviewer"):
100
+
101
+ 1. Call `ges_agent({ action: "list" })` to get all agent names
102
+ 2. Find the closest match and confirm with the user before proceeding
@@ -1,6 +1,6 @@
1
1
  ---
2
2
  name: execute
3
- version: "1.0.0"
3
+ version: "1.1.0"
4
4
  description: "Gestalt-driven execution planner that transforms a Spec into a validated ExecutionPlan"
5
5
  triggers:
6
6
  - "execute"
@@ -17,14 +17,43 @@ outputs:
17
17
 
18
18
  # Execute Skill
19
19
 
20
- This skill transforms a validated Spec specification into a concrete, dependency-aware Execution Plan by applying Gestalt psychology principles as a structured planning framework.
20
+ This skill transforms a validated Spec specification into a concrete, dependency-aware Execution Plan, executes it with multi-perspective Role Agent guidance, and validates the result through a 2-stage evaluation pipeline.
21
21
 
22
- ## Process
22
+ ## Full Pipeline
23
+
24
+ ```
25
+ Planning → Execution → Evaluate → (Evolve if needed)
26
+ ```
27
+
28
+ ### Phase 1 — Planning
23
29
 
24
30
  1. **Figure-Ground** (Step 1): Classify acceptance criteria as essential (figure) or supplementary (ground), assign priority levels
25
- 2. **Closure** (Step 2): Decompose ACs into atomic tasks, identify implicit sub-tasks not explicitly stated
26
- 3. **Proximity** (Step 3): Group related atomic tasks into logical task groups by domain
27
- 4. **Continuity** (Step 4): Validate the dependency DAG — ensure no cycles, no conflicts, clear execution order
31
+ 2. **Closure** (Step 2): Decompose ACs into atomic tasks, including implicit sub-tasks
32
+ 3. **Proximity** (Step 3): Group related tasks by domain into logical task groups
33
+ 4. **Continuity** (Step 4): Validate the dependency DAG — no cycles, clear topological order
34
+
35
+ ### Phase 2 — Execution
36
+
37
+ Run tasks in topological order. For each task:
38
+
39
+ 1. **Role Match** (optional but recommended): identify which Role Agents are relevant to this task
40
+ 2. **Role Consensus**: collect multi-perspective guidance from matched agents
41
+ 3. **Execute Task**: perform the task using the role guidance
42
+
43
+ ### Phase 3 — Evaluate
44
+
45
+ After all tasks complete, run a 2-stage evaluation:
46
+
47
+ - **Stage 1 (Structural)**: run lint → build → test — short-circuits if any fail
48
+ - **Stage 2 (Contextual)**: LLM validates each AC + goal alignment
49
+
50
+ Success condition: `score ≥ 0.85` AND `goalAlignment ≥ 0.80`
51
+
52
+ ### Phase 4 — Evolve (when evaluation fails)
53
+
54
+ - **Flow A — Structural Fix**: fix lint/build/test failures → re-evaluate
55
+ - **Flow B — Contextual Evolution**: patch Spec ACs/constraints → re-execute impacted tasks → re-evaluate
56
+ - **Flow C — Lateral Thinking**: when stagnation detected, rotate through Multistability / Simplicity / Reification / Invariance personas
28
57
 
29
58
  ## Passthrough Mode
30
59
 
@@ -82,3 +111,242 @@ API 키 없이 MCP 서버 실행 시 자동 활성화. LLM 작업을 caller가
82
111
  - 각 단계 결과는 이전 단계 데이터와 교차 검증됨
83
112
  - Continuity 단계에서는 서버 측 DAG 검증이 추가로 수행됨
84
113
  - 모든 AC가 분류되어야 하고, 모든 Task가 그룹에 포함되어야 함
114
+
115
+ ---
116
+
117
+ ## Phase 2 — Execution
118
+
119
+ ### `execute_start` — 실행 시작
120
+
121
+ `plan_complete` 이후 호출. 태스크 목록을 받아 실행 준비.
122
+
123
+ ```json
124
+ { "action": "execute_start", "sessionId": "..." }
125
+ ```
126
+ → `{ status, sessionId, executionPlan, message }`
127
+
128
+ ---
129
+
130
+ ### Role Agent 플로우 (태스크당, 선택적)
131
+
132
+ 태스크 내용과 관련된 Role Agent가 있을 경우 role_match → role_consensus 순으로 호출해 guidance를 받는다. 문서 작성, 보안, 성능, 아키텍처 등 전문 영역이 필요한 태스크에 특히 유효하다.
133
+
134
+ **`role_match` — 관련 에이전트 매칭 (2-Call)**
135
+
136
+ ```json
137
+ // Call 1: 매칭 컨텍스트 요청
138
+ { "action": "role_match", "sessionId": "..." }
139
+ ```
140
+ → `{ matchContext }` — 어떤 에이전트가 적합한지 판단하기 위한 프롬프트
141
+
142
+ ```json
143
+ // Call 2: 매칭 결과 제출
144
+ {
145
+ "action": "role_match",
146
+ "sessionId": "...",
147
+ "matchResult": [
148
+ { "agentName": "technical-writer", "domain": ["documentation"], "relevanceScore": 0.9, "reasoning": "..." },
149
+ { "agentName": "architect", "domain": ["architecture"], "relevanceScore": 0.7, "reasoning": "..." }
150
+ ]
151
+ }
152
+ ```
153
+ → `{ perspectivePrompts }` — 각 에이전트별 관점 생성 프롬프트
154
+
155
+ **`role_consensus` — 다중 관점 합의 (2-Call)**
156
+
157
+ ```json
158
+ // Call 1: 각 에이전트 관점 수집 후 제출
159
+ {
160
+ "action": "role_consensus",
161
+ "sessionId": "...",
162
+ "perspectives": [
163
+ { "agentName": "technical-writer", "perspective": "...", "confidence": 0.9 },
164
+ { "agentName": "architect", "perspective": "...", "confidence": 0.8 }
165
+ ]
166
+ }
167
+ ```
168
+ → `{ synthesisContext }` — 관점 통합 프롬프트
169
+
170
+ ```json
171
+ // Call 2: 합성된 합의 제출
172
+ {
173
+ "action": "role_consensus",
174
+ "sessionId": "...",
175
+ "consensus": {
176
+ "consensus": "통합된 가이드라인",
177
+ "conflictResolutions": ["...", "..."],
178
+ "perspectives": [...]
179
+ }
180
+ }
181
+ ```
182
+ → `{ roleGuidance }` — execute_task 시 참조할 최종 guidance
183
+
184
+ ---
185
+
186
+ ### `execute_task` — 태스크 실행 결과 제출
187
+
188
+ role_match/role_consensus로 얻은 `roleGuidance`를 참조해 태스크를 수행한 후 결과 제출.
189
+ `allTasksCompleted === true`가 될 때까지 반복.
190
+
191
+ ```json
192
+ {
193
+ "action": "execute_task",
194
+ "sessionId": "...",
195
+ "taskResult": {
196
+ "taskId": "task-0",
197
+ "status": "completed",
198
+ "output": "태스크 수행 결과 요약",
199
+ "artifacts": ["path/to/file.ts"]
200
+ }
201
+ }
202
+ ```
203
+ → `{ status, nextTaskId?, allTasksCompleted, driftResult? }`
204
+
205
+ `driftResult`가 반환되면 Spec과의 drift 경고 — 계속 진행하되 다음 태스크에서 방향 보정.
206
+
207
+ ---
208
+
209
+ ## Phase 3 — Evaluate
210
+
211
+ 모든 태스크 완료 후 3-Call 평가 진행.
212
+
213
+ **Call 1 — Structural 단계 시작**
214
+ ```json
215
+ { "action": "evaluate", "sessionId": "..." }
216
+ ```
217
+ → `{ stage: "structural", structuralContext }` — lint/build/test 실행 지시
218
+
219
+ **Call 2 — Structural 결과 제출**
220
+ ```json
221
+ {
222
+ "action": "evaluate",
223
+ "sessionId": "...",
224
+ "structuralResult": {
225
+ "commands": [
226
+ { "name": "lint", "command": "pnpm run lint", "exitCode": 0, "output": "" },
227
+ { "name": "build", "command": "pnpm run build", "exitCode": 0, "output": "" },
228
+ { "name": "test", "command": "pnpm run test", "exitCode": 0, "output": "360 tests passed" }
229
+ ],
230
+ "allPassed": true
231
+ }
232
+ }
233
+ ```
234
+ → structural 실패 시 `{ stage: "structural_failed", evolveContext }` → Evolve Flow A 진입
235
+ → structural 통과 시 `{ stage: "contextual", evaluationContext }` — AC별 LLM 검증 지시
236
+
237
+ **Call 3 — Contextual 결과 제출**
238
+ ```json
239
+ {
240
+ "action": "evaluate",
241
+ "sessionId": "...",
242
+ "evaluationResult": {
243
+ "verifications": [
244
+ { "acIndex": 0, "satisfied": true, "evidence": "...", "gaps": [] }
245
+ ],
246
+ "overallScore": 0.92,
247
+ "goalAlignment": 0.88,
248
+ "recommendations": []
249
+ }
250
+ }
251
+ ```
252
+ → `{ status: "completed" }` (score ≥ 0.85, goalAlignment ≥ 0.80)
253
+ → 미달 시 `{ evolveContext }` → Evolve Flow B 진입
254
+
255
+ ---
256
+
257
+ ## Phase 4 — Evolve
258
+
259
+ ### Flow A — Structural Fix
260
+
261
+ ```json
262
+ // 1. Fix context 요청
263
+ { "action": "evolve_fix", "sessionId": "..." }
264
+ → fixContext 반환
265
+
266
+ // 2. Fix 수행 후 결과 제출
267
+ {
268
+ "action": "evolve_fix",
269
+ "sessionId": "...",
270
+ "fixTasks": [
271
+ { "taskId": "fix-0", "failedCommand": "pnpm run lint", "errorOutput": "...", "fixDescription": "...", "artifacts": [] }
272
+ ]
273
+ }
274
+
275
+ // 3. Re-evaluate (Phase 3 반복)
276
+ { "action": "evaluate", "sessionId": "..." }
277
+ ```
278
+
279
+ ### Flow B — Contextual Evolution
280
+
281
+ ```json
282
+ // 1. Evolution context 요청
283
+ { "action": "evolve", "sessionId": "..." }
284
+ → evolveContext (또는 terminateReason으로 종료)
285
+
286
+ // 2. Spec patch 제출 (AC/constraints 수정, goal 변경 불가)
287
+ {
288
+ "action": "evolve_patch",
289
+ "sessionId": "...",
290
+ "specPatch": {
291
+ "acceptanceCriteria": ["수정된 AC..."],
292
+ "constraints": ["추가 제약조건..."]
293
+ }
294
+ }
295
+ → { impactedTaskIds, reExecuteContext }
296
+
297
+ // 3. 영향받은 태스크 재실행 (allTasksCompleted까지 반복)
298
+ {
299
+ "action": "evolve_re_execute",
300
+ "sessionId": "...",
301
+ "reExecuteTaskResult": { "taskId": "task-3", "status": "completed", "output": "...", "artifacts": [] }
302
+ }
303
+
304
+ // 4. Re-evaluate
305
+ { "action": "evaluate", "sessionId": "..." }
306
+ ```
307
+
308
+ ### Flow C — Lateral Thinking (stagnation 감지 시 자동 분기)
309
+
310
+ `evolve` 호출 시 stagnation/oscillation/hard_cap이 감지되면 자동으로 lateral thinking persona로 전환.
311
+
312
+ ```json
313
+ // evolve 호출 → lateralContext 반환
314
+ { "action": "evolve", "sessionId": "..." }
315
+ → { status: "lateral_thinking", lateralContext: { persona, pattern, lateralPrompt, ... } }
316
+
317
+ // Lateral result 제출
318
+ {
319
+ "action": "evolve_lateral_result",
320
+ "sessionId": "...",
321
+ "lateralResult": {
322
+ "persona": "multistability",
323
+ "specPatch": { "acceptanceCriteria": [...] },
324
+ "description": "관점 전환으로 요구사항 재구성"
325
+ }
326
+ }
327
+
328
+ // Re-execute + Re-evaluate (Flow B와 동일)
329
+
330
+ // 다음 persona 요청 (점수 미달 시)
331
+ { "action": "evolve_lateral", "sessionId": "..." }
332
+ ```
333
+
334
+ | Stagnation 패턴 | Persona | 전략 |
335
+ |---|---|---|
336
+ | hard_cap | Multistability | 다른 각도로 보기 |
337
+ | oscillation | Simplicity | 단순하게 줄이기 |
338
+ | no_drift | Reification | 빠진 조각 채우기 |
339
+ | diminishing_returns | Invariance | 성공 패턴 복제 |
340
+
341
+ 4개 persona 소진 → `human_escalation` 반환으로 세션 종료.
342
+
343
+ ### 종료 조건
344
+
345
+ | 조건 | 트리거 |
346
+ |------|--------|
347
+ | `success` | score ≥ 0.85 AND goalAlignment ≥ 0.80 |
348
+ | `stagnation` | 2회 연속 delta < 0.05 |
349
+ | `oscillation` | 2회 연속 점수 역전 |
350
+ | `hard_cap` | structural 3회 + contextual 3회 실패 |
351
+ | `caller` | `{ action: "evolve", terminateReason: "caller" }` |
352
+ | `human_escalation` | 4개 lateral persona 소진 |