autodoc-agent-kit 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (187) hide show
  1. package/README.md +362 -0
  2. package/package.json +49 -0
  3. package/src/core/module.yaml +5 -0
  4. package/src/modules/design/module.yaml +9 -0
  5. package/src/modules/design/skills/brand-guidelines/LICENSE.txt +202 -0
  6. package/src/modules/design/skills/brand-guidelines/SKILL.md +73 -0
  7. package/src/modules/design/skills/frontend-design/LICENSE.txt +177 -0
  8. package/src/modules/design/skills/frontend-design/SKILL.md +42 -0
  9. package/src/modules/design/skills/web-artifacts-builder/SKILL.md +229 -0
  10. package/src/modules/devops/module.yaml +10 -0
  11. package/src/modules/devops/skills/devops-helper/SKILL.md +60 -0
  12. package/src/modules/devops/skills/k8s-helm/SKILL.md +360 -0
  13. package/src/modules/devops/skills/monitoring-observability/SKILL.md +240 -0
  14. package/src/modules/devops/skills/security-auditor/SKILL.md +105 -0
  15. package/src/modules/engineering/module.yaml +22 -0
  16. package/src/modules/engineering/skills/ai-sdk/SKILL.md +314 -0
  17. package/src/modules/engineering/skills/api-designer/SKILL.md +77 -0
  18. package/src/modules/engineering/skills/code-reviewer/SKILL.md +71 -0
  19. package/src/modules/engineering/skills/db-architect/SKILL.md +50 -0
  20. package/src/modules/engineering/skills/debugger/SKILL.md +59 -0
  21. package/src/modules/engineering/skills/docs-generator/SKILL.md +51 -0
  22. package/src/modules/engineering/skills/git-workflow/SKILL.md +258 -0
  23. package/src/modules/engineering/skills/mcp-builder/LICENSE.txt +202 -0
  24. package/src/modules/engineering/skills/mcp-builder/SKILL.md +236 -0
  25. package/src/modules/engineering/skills/mcp-builder/reference/evaluation.md +602 -0
  26. package/src/modules/engineering/skills/mcp-builder/reference/mcp_best_practices.md +249 -0
  27. package/src/modules/engineering/skills/mcp-builder/reference/node_mcp_server.md +970 -0
  28. package/src/modules/engineering/skills/mcp-builder/reference/python_mcp_server.md +719 -0
  29. package/src/modules/engineering/skills/mcp-builder/scripts/connections.py +151 -0
  30. package/src/modules/engineering/skills/mcp-builder/scripts/evaluation.py +373 -0
  31. package/src/modules/engineering/skills/mcp-builder/scripts/example_evaluation.xml +22 -0
  32. package/src/modules/engineering/skills/mcp-builder/scripts/requirements.txt +2 -0
  33. package/src/modules/engineering/skills/nextjs-15/SKILL.md +312 -0
  34. package/src/modules/engineering/skills/perf-optimizer/SKILL.md +60 -0
  35. package/src/modules/engineering/skills/react-19/SKILL.md +257 -0
  36. package/src/modules/engineering/skills/refactorer/SKILL.md +60 -0
  37. package/src/modules/engineering/skills/skill-authoring-workflow/SKILL.md +183 -0
  38. package/src/modules/engineering/skills/skill-creator/LICENSE.txt +202 -0
  39. package/src/modules/engineering/skills/skill-creator/SKILL.md +356 -0
  40. package/src/modules/engineering/skills/skill-creator/references/output-patterns.md +82 -0
  41. package/src/modules/engineering/skills/skill-creator/references/workflows.md +28 -0
  42. package/src/modules/engineering/skills/skill-creator/scripts/__pycache__/quick_validate.cpython-313.pyc +0 -0
  43. package/src/modules/engineering/skills/skill-creator/scripts/init_skill.py +303 -0
  44. package/src/modules/engineering/skills/skill-creator/scripts/package_skill.py +110 -0
  45. package/src/modules/engineering/skills/skill-creator/scripts/quick_validate.py +95 -0
  46. package/src/modules/engineering/skills/typescript/SKILL.md +231 -0
  47. package/src/modules/engineering/skills/zod-4/SKILL.md +223 -0
  48. package/src/modules/product/module.yaml +51 -0
  49. package/src/modules/product/skills/acquisition-channel-advisor/SKILL.md +643 -0
  50. package/src/modules/product/skills/acquisition-channel-advisor/examples/conversation-flow.md +531 -0
  51. package/src/modules/product/skills/ai-shaped-readiness-advisor/SKILL.md +923 -0
  52. package/src/modules/product/skills/altitude-horizon-framework/SKILL.md +250 -0
  53. package/src/modules/product/skills/altitude-horizon-framework/examples/sample.md +85 -0
  54. package/src/modules/product/skills/business-health-diagnostic/SKILL.md +783 -0
  55. package/src/modules/product/skills/company-research/SKILL.md +385 -0
  56. package/src/modules/product/skills/company-research/examples/sample.md +164 -0
  57. package/src/modules/product/skills/company-research/template.md +60 -0
  58. package/src/modules/product/skills/context-engineering-advisor/SKILL.md +763 -0
  59. package/src/modules/product/skills/customer-journey-map/SKILL.md +346 -0
  60. package/src/modules/product/skills/customer-journey-map/examples/meta-product-manager-skills.md +40 -0
  61. package/src/modules/product/skills/customer-journey-map/examples/sample.md +33 -0
  62. package/src/modules/product/skills/customer-journey-map/template.md +28 -0
  63. package/src/modules/product/skills/customer-journey-mapping-workshop/SKILL.md +523 -0
  64. package/src/modules/product/skills/director-readiness-advisor/SKILL.md +351 -0
  65. package/src/modules/product/skills/director-readiness-advisor/examples/conversation-flow.md +96 -0
  66. package/src/modules/product/skills/discovery-interview-prep/SKILL.md +410 -0
  67. package/src/modules/product/skills/discovery-process/SKILL.md +504 -0
  68. package/src/modules/product/skills/discovery-process/examples/sample.md +60 -0
  69. package/src/modules/product/skills/discovery-process/template.md +39 -0
  70. package/src/modules/product/skills/eol-message/SKILL.md +348 -0
  71. package/src/modules/product/skills/eol-message/examples/sample.md +87 -0
  72. package/src/modules/product/skills/eol-message/template.md +74 -0
  73. package/src/modules/product/skills/epic-breakdown-advisor/SKILL.md +665 -0
  74. package/src/modules/product/skills/epic-hypothesis/SKILL.md +277 -0
  75. package/src/modules/product/skills/epic-hypothesis/examples/sample.md +104 -0
  76. package/src/modules/product/skills/epic-hypothesis/template.md +30 -0
  77. package/src/modules/product/skills/executive-onboarding-playbook/SKILL.md +280 -0
  78. package/src/modules/product/skills/executive-onboarding-playbook/examples/sample.md +116 -0
  79. package/src/modules/product/skills/feature-investment-advisor/SKILL.md +639 -0
  80. package/src/modules/product/skills/feature-investment-advisor/examples/conversation-flow.md +538 -0
  81. package/src/modules/product/skills/finance-based-pricing-advisor/SKILL.md +763 -0
  82. package/src/modules/product/skills/finance-metrics-quickref/SKILL.md +309 -0
  83. package/src/modules/product/skills/jobs-to-be-done/SKILL.md +370 -0
  84. package/src/modules/product/skills/jobs-to-be-done/examples/sample.md +80 -0
  85. package/src/modules/product/skills/jobs-to-be-done/template.md +65 -0
  86. package/src/modules/product/skills/lean-ux-canvas/SKILL.md +561 -0
  87. package/src/modules/product/skills/lean-ux-canvas/examples/sample.md +88 -0
  88. package/src/modules/product/skills/lean-ux-canvas/template.md +32 -0
  89. package/src/modules/product/skills/opportunity-solution-tree/SKILL.md +420 -0
  90. package/src/modules/product/skills/opportunity-solution-tree/examples/sample.md +104 -0
  91. package/src/modules/product/skills/opportunity-solution-tree/template.md +33 -0
  92. package/src/modules/product/skills/pestel-analysis/SKILL.md +376 -0
  93. package/src/modules/product/skills/pestel-analysis/examples/sample.md +143 -0
  94. package/src/modules/product/skills/pestel-analysis/template.md +53 -0
  95. package/src/modules/product/skills/pol-probe/SKILL.md +217 -0
  96. package/src/modules/product/skills/pol-probe/examples/sample.md +136 -0
  97. package/src/modules/product/skills/pol-probe/template.md +59 -0
  98. package/src/modules/product/skills/pol-probe-advisor/SKILL.md +492 -0
  99. package/src/modules/product/skills/positioning-statement/SKILL.md +230 -0
  100. package/src/modules/product/skills/positioning-statement/examples/sample.md +51 -0
  101. package/src/modules/product/skills/positioning-statement/template.md +25 -0
  102. package/src/modules/product/skills/positioning-workshop/SKILL.md +424 -0
  103. package/src/modules/product/skills/prd-development/SKILL.md +655 -0
  104. package/src/modules/product/skills/prd-development/examples/sample.md +43 -0
  105. package/src/modules/product/skills/prd-development/template.md +55 -0
  106. package/src/modules/product/skills/press-release/SKILL.md +269 -0
  107. package/src/modules/product/skills/press-release/examples/sample.md +73 -0
  108. package/src/modules/product/skills/press-release/template.md +39 -0
  109. package/src/modules/product/skills/prioritization-advisor/SKILL.md +448 -0
  110. package/src/modules/product/skills/problem-framing-canvas/SKILL.md +466 -0
  111. package/src/modules/product/skills/problem-framing-canvas/examples/sample.md +58 -0
  112. package/src/modules/product/skills/problem-framing-canvas/template.md +22 -0
  113. package/src/modules/product/skills/problem-statement/SKILL.md +246 -0
  114. package/src/modules/product/skills/problem-statement/examples/sample.md +82 -0
  115. package/src/modules/product/skills/problem-statement/template.md +37 -0
  116. package/src/modules/product/skills/product-strategy-session/SKILL.md +426 -0
  117. package/src/modules/product/skills/product-strategy-session/examples/sample.md +67 -0
  118. package/src/modules/product/skills/product-strategy-session/template.md +38 -0
  119. package/src/modules/product/skills/proto-persona/SKILL.md +326 -0
  120. package/src/modules/product/skills/proto-persona/examples/sample.md +97 -0
  121. package/src/modules/product/skills/proto-persona/template.md +45 -0
  122. package/src/modules/product/skills/recommendation-canvas/SKILL.md +375 -0
  123. package/src/modules/product/skills/recommendation-canvas/examples/sample.md +94 -0
  124. package/src/modules/product/skills/recommendation-canvas/template.md +86 -0
  125. package/src/modules/product/skills/roadmap-planning/SKILL.md +505 -0
  126. package/src/modules/product/skills/roadmap-planning/examples/sample.md +62 -0
  127. package/src/modules/product/skills/roadmap-planning/template.md +30 -0
  128. package/src/modules/product/skills/saas-economics-efficiency-metrics/SKILL.md +694 -0
  129. package/src/modules/product/skills/saas-economics-efficiency-metrics/examples/cash-trap.md +365 -0
  130. package/src/modules/product/skills/saas-economics-efficiency-metrics/examples/healthy-unit-economics.md +279 -0
  131. package/src/modules/product/skills/saas-economics-efficiency-metrics/template.md +263 -0
  132. package/src/modules/product/skills/saas-revenue-growth-metrics/SKILL.md +630 -0
  133. package/src/modules/product/skills/saas-revenue-growth-metrics/examples/healthy-saas.md +131 -0
  134. package/src/modules/product/skills/saas-revenue-growth-metrics/examples/warning-signs.md +229 -0
  135. package/src/modules/product/skills/saas-revenue-growth-metrics/template.md +192 -0
  136. package/src/modules/product/skills/storyboard/SKILL.md +252 -0
  137. package/src/modules/product/skills/storyboard/examples/sample.md +71 -0
  138. package/src/modules/product/skills/storyboard/template.md +41 -0
  139. package/src/modules/product/skills/tam-sam-som-calculator/SKILL.md +392 -0
  140. package/src/modules/product/skills/tam-sam-som-calculator/examples/sample.md +142 -0
  141. package/src/modules/product/skills/tam-sam-som-calculator/scripts/market-sizing.py +95 -0
  142. package/src/modules/product/skills/tam-sam-som-calculator/template.md +35 -0
  143. package/src/modules/product/skills/user-story/SKILL.md +272 -0
  144. package/src/modules/product/skills/user-story/examples/sample.md +110 -0
  145. package/src/modules/product/skills/user-story/scripts/user-story-template.py +65 -0
  146. package/src/modules/product/skills/user-story/template.md +32 -0
  147. package/src/modules/product/skills/user-story-mapping/SKILL.md +285 -0
  148. package/src/modules/product/skills/user-story-mapping/examples/sample.md +77 -0
  149. package/src/modules/product/skills/user-story-mapping/template.md +41 -0
  150. package/src/modules/product/skills/user-story-mapping-workshop/SKILL.md +477 -0
  151. package/src/modules/product/skills/user-story-mapping-workshop/template.md +28 -0
  152. package/src/modules/product/skills/user-story-splitting/SKILL.md +303 -0
  153. package/src/modules/product/skills/user-story-splitting/examples/sample.md +147 -0
  154. package/src/modules/product/skills/user-story-splitting/template.md +37 -0
  155. package/src/modules/product/skills/vp-cpo-readiness-advisor/SKILL.md +409 -0
  156. package/src/modules/product/skills/vp-cpo-readiness-advisor/examples/conversation-flow.md +95 -0
  157. package/src/modules/product/skills/workshop-facilitation/SKILL.md +87 -0
  158. package/src/modules/productivity/module.yaml +9 -0
  159. package/src/modules/productivity/skills/doc-coauthoring/SKILL.md +375 -0
  160. package/src/modules/productivity/skills/internal-comms/LICENSE.txt +202 -0
  161. package/src/modules/productivity/skills/internal-comms/SKILL.md +32 -0
  162. package/src/modules/productivity/skills/internal-comms/examples/3p-updates.md +47 -0
  163. package/src/modules/productivity/skills/internal-comms/examples/company-newsletter.md +65 -0
  164. package/src/modules/productivity/skills/internal-comms/examples/faq-answers.md +30 -0
  165. package/src/modules/productivity/skills/internal-comms/examples/general-comms.md +16 -0
  166. package/src/modules/productivity/skills/technical-writing/SKILL.md +266 -0
  167. package/src/modules/qa/module.yaml +9 -0
  168. package/src/modules/qa/skills/test-strategy/SKILL.md +263 -0
  169. package/src/modules/qa/skills/test-writer/SKILL.md +57 -0
  170. package/src/modules/qa/skills/webapp-testing/LICENSE.txt +202 -0
  171. package/src/modules/qa/skills/webapp-testing/SKILL.md +96 -0
  172. package/src/modules/qa/skills/webapp-testing/examples/console_logging.py +35 -0
  173. package/src/modules/qa/skills/webapp-testing/examples/element_discovery.py +40 -0
  174. package/src/modules/qa/skills/webapp-testing/examples/static_html_automation.py +33 -0
  175. package/src/modules/qa/skills/webapp-testing/scripts/with_server.py +106 -0
  176. package/tools/autodoc-npx-wrapper.js +34 -0
  177. package/tools/cli/autodoc-cli.js +55 -0
  178. package/tools/cli/commands/install.js +36 -0
  179. package/tools/cli/commands/status.js +35 -0
  180. package/tools/cli/commands/uninstall.js +60 -0
  181. package/tools/cli/installers/lib/core/installer.js +164 -0
  182. package/tools/cli/installers/lib/core/manifest.js +49 -0
  183. package/tools/cli/installers/lib/ide/manager.js +112 -0
  184. package/tools/cli/installers/lib/ide/platform-codes.yaml +207 -0
  185. package/tools/cli/installers/lib/modules/manager.js +59 -0
  186. package/tools/cli/lib/ui.js +199 -0
  187. package/tools/cli/lib/welcome.js +82 -0
@@ -0,0 +1,492 @@
1
+ ---
2
+ name: pol-probe-advisor
3
+ description: Select the right Proof of Life (PoL) probe based on hypothesis, risk, and resources. Use this to match the validation method to the real learning goal, not tooling comfort.
4
+ intent: >-
5
+ Guide product managers through selecting the right **Proof of Life (PoL) probe** type (of 5 flavors) based on their hypothesis, risk, and available resources. Use this when you need to eliminate a specific risk or test a narrow hypothesis, but aren't sure which validation method to use. This interactive skill ensures you match the cheapest prototype to the harshest truth—not the prototype you're most comfortable building.
6
+ type: interactive
7
+ best_for:
8
+ - "Choosing the cheapest useful validation method for a risky idea"
9
+ - "Matching a hypothesis to the right Proof of Life probe"
10
+ - "Avoiding overbuilding before learning the harsh truth"
11
+ scenarios:
12
+ - "Which Proof of Life probe should I use to test demand for this idea?"
13
+ - "Help me pick the right validation method for an onboarding hypothesis"
14
+ - "I have a risky AI concept. What PoL probe should I run first?"
15
+ ---
16
+
17
+ ## Purpose
18
+
19
+ Guide product managers through selecting the right **Proof of Life (PoL) probe** type (of 5 flavors) based on their hypothesis, risk, and available resources. Use this when you need to eliminate a specific risk or test a narrow hypothesis, but aren't sure which validation method to use. This interactive skill ensures you match the cheapest prototype to the harshest truth—not the prototype you're most comfortable building.
20
+
21
+ This is **not** a tool for deciding *if* you should validate (you should). It's a decision framework for choosing *how* to validate most effectively.
22
+
23
+ ## Key Concepts
24
+
25
+ ### The Core Problem: Method-Hypothesis Mismatch
26
+
27
+ **Common failure mode:** PMs choose validation methods based on tooling comfort ("I know Figma, so I'll design a prototype") rather than learning goal. Result: validate the wrong thing, miss the actual risk.
28
+
29
+ **Solution:** Work backwards from the hypothesis. Ask: "What specific risk am I eliminating? What's the cheapest path to harsh truth?"
30
+
31
+ ---
32
+
33
+ ### The 5 PoL Probe Flavors (Quick Reference)
34
+
35
+ | Type | Core Question | Best For | Timeline |
36
+ |------|---------------|----------|----------|
37
+ | **Feasibility Check** | "Can we build this?" | Technical unknowns, API dependencies, data integrity | 1-2 days |
38
+ | **Task-Focused Test** | "Can users complete this job without friction?" | Critical UI moments, field labels, decision points | 2-5 days |
39
+ | **Narrative Prototype** | "Does this workflow earn stakeholder buy-in?" | Storytelling, explaining complex flows, alignment | 1-3 days |
40
+ | **Synthetic Data Simulation** | "Can we model this without production risk?" | Edge cases, unknown-unknowns, statistical modeling | 2-4 days |
41
+ | **Vibe-Coded PoL Probe** | "Will this solution survive real user contact?" | Workflow/UX validation with real interactions | 2-3 days |
42
+
43
+ **Golden Rule:** *"Use the cheapest prototype that tells the harshest truth."*
44
+
45
+ ---
46
+
47
+ ### Anti-Patterns (What This Is NOT)
48
+
49
+ - **Not "build the prototype you're comfortable with":** Match method to hypothesis, not skillset
50
+ - **Not "pick based on stakeholder preference":** Optimize for learning, not internal politics
51
+ - **Not "choose the most impressive option":** Impressive ≠ informative
52
+ - **Not "default to code":** Writing code should be your last resort, not your first
53
+
54
+ ---
55
+
56
+ ### When to Use This Skill
57
+
58
+ ✅ **Use this when:**
59
+ - You have a clear hypothesis but don't know which validation method to use
60
+ - You're unsure whether to build code, create a video, or run a simulation
61
+ - You need to eliminate a specific risk quickly (within days)
62
+ - You want to avoid prototype theater
63
+
64
+ ❌ **Don't use this when:**
65
+ - You don't have a hypothesis yet (use `problem-statement.md` or `problem-framing-canvas.md` first)
66
+ - You're trying to impress executives (that's not validation)
67
+ - You already know the answer (confirmation bias)
68
+ - You need to ship an MVP (this is for pre-MVP reconnaissance)
69
+
70
+ ---
71
+
72
+ ### Facilitation Source of Truth
73
+
74
+ Use [`workshop-facilitation`](../workshop-facilitation/SKILL.md) as the default interaction protocol for this skill.
75
+
76
+ It defines:
77
+ - session heads-up + entry mode (Guided, Context dump, Best guess)
78
+ - one-question turns with plain-language prompts
79
+ - progress labels (for example, Context Qx/8 and Scoring Qx/5)
80
+ - interruption handling and pause/resume behavior
81
+ - numbered recommendations at decision points
82
+ - quick-select numbered response options for regular questions (include `Other (specify)` when useful)
83
+
84
+ This file defines the domain-specific assessment content. If there is a conflict, follow this file's domain logic.
85
+
86
+ ## Application
87
+
88
+ This interactive skill uses **adaptive questioning** to recommend the right PoL probe type based on your context.
89
+
90
+ ---
91
+
92
+ ### Step 0: Gather Context
93
+
94
+ **Agent asks:**
95
+
96
+ Let's figure out which PoL probe type is right for your validation needs. First, I need some context:
97
+
98
+ **1. What hypothesis are you testing?**
99
+ (Describe in one sentence, or use "If [we do X] for [persona], then [outcome]" format)
100
+
101
+ **2. What specific risk are you trying to eliminate?**
102
+ Examples:
103
+ - Technical feasibility ("Can our API handle real-time data?")
104
+ - User task completion ("Can users find the 'export' button?")
105
+ - Stakeholder alignment ("Will leadership approve this direction?")
106
+ - Edge case behavior ("How does the system handle duplicate entries?")
107
+ - Workflow validation ("Will users complete the 3-step onboarding?")
108
+
109
+ **3. What's your timeline?**
110
+ - Hours (same-day validation)
111
+ - 1-2 days (quick spike)
112
+ - 3-5 days (moderate effort)
113
+ - 1 week+ (too long—consider breaking into smaller probes)
114
+
115
+ **4. What resources do you have available?**
116
+ Examples:
117
+ - Engineering capacity (1 dev for 1 day)
118
+ - Design tools (Figma, Loom, Sora)
119
+ - AI/no-code tools (ChatGPT Canvas, Replit, Airtable)
120
+ - User access (10 users from waitlist, 5 beta customers, etc.)
121
+ - Budget (for UsabilityHub, Optimal Workshop, etc.)
122
+
123
+ ---
124
+
125
+ ### Step 1: Identify the Core Question
126
+
127
+ **Agent synthesizes user input and asks:**
128
+
129
+ Based on your hypothesis and risk, which of these core questions are you really trying to answer?
130
+
131
+ **Offer 5 options (aligned to probe types):**
132
+
133
+ 1. **"Can we build this?"** — You're uncertain about technical feasibility, API integration, data availability, or third-party dependencies
134
+ 2. **"Can users complete this job without friction?"** — You're validating critical UI moments, field labels, navigation, or decision points
135
+ 3. **"Does this workflow earn stakeholder buy-in?"** — You need to explain a complex flow, align leadership, or "tell vs. test" the story
136
+ 4. **"Can we model this without production risk?"** — You need to explore edge cases, simulate user behavior, or test prompt logic safely
137
+ 5. **"Will this solution survive real user contact?"** — You need users to interact with a semi-functional workflow to catch UX/workflow issues
138
+
139
+ **User response:** [Select one number, or describe if none fit]
140
+
141
+ ---
142
+
143
+ ### Step 2: Recommend PoL Probe Type
144
+
145
+ **Based on user selection, agent recommends the matching probe type:**
146
+
147
+ ---
148
+
149
+ #### Option 1 Selected: "Can we build this?"
150
+ → **Recommended Probe: Feasibility Check**
151
+
152
+ **What it is:**
153
+ A 1-2 day spike-and-delete test to surface technical risk. Not meant to impress anyone—meant to reveal blockers fast.
154
+
155
+ **Methods:**
156
+ - GenAI prompt chains (test if AI can handle your use case)
157
+ - API sniff tests (verify third-party integrations work)
158
+ - Data integrity sweeps (check if your data supports the feature)
159
+ - Third-party tool evaluation (test if Zapier/Stripe/Twilio does what you think)
160
+
161
+ **Timeline:** 1-2 days
162
+
163
+ **Tools:**
164
+ - ChatGPT/Claude (prompt testing)
165
+ - Postman/Insomnia (API testing)
166
+ - Jupyter notebooks (data exploration)
167
+ - Proof-of-concept scripts (throwaway code)
168
+
169
+ **Success Criteria Example:**
170
+ - **Pass:** API returns expected data format in <200ms
171
+ - **Fail:** API times out, or data structure incompatible with our schema
172
+ - **Learn:** Identify specific technical blocker
173
+
174
+ **Disposal Plan:** Delete all spike code after documenting findings.
175
+
176
+ **Next Step:** Would you like me to generate a `pol-probe` artifact documenting this feasibility check?
177
+
178
+ ---
179
+
180
+ #### Option 2 Selected: "Can users complete this job without friction?"
181
+ → **Recommended Probe: Task-Focused Test**
182
+
183
+ **What it is:**
184
+ Validate critical moments—field labels, decision points, navigation, drop-off zones—using specialized testing tools. Focus on **observable task completion**, not opinions.
185
+
186
+ **Methods:**
187
+ - Optimal Workshop (tree testing, card sorting)
188
+ - UsabilityHub (5-second tests, click tests, preference tests)
189
+ - Maze (prototype testing with heatmaps)
190
+ - Loom-recorded task walkthroughs (ask users to "think aloud")
191
+
192
+ **Timeline:** 2-5 days
193
+
194
+ **Tools:**
195
+ - Optimal Workshop ($200/month)
196
+ - UsabilityHub ($100-300/month)
197
+ - Maze (free tier available)
198
+ - Loom (free for basic)
199
+
200
+ **Success Criteria Example:**
201
+ - **Pass:** 80%+ users complete task in <2 minutes
202
+ - **Fail:** <60% completion, or 3+ users get stuck on same step
203
+ - **Learn:** Identify exact friction point (specific field, button, etc.)
204
+
205
+ **Disposal Plan:** Archive session recordings, document learnings, delete test prototype.
206
+
207
+ **Next Step:** Would you like me to generate a `pol-probe` artifact documenting this task-focused test?
208
+
209
+ ---
210
+
211
+ #### Option 3 Selected: "Does this workflow earn stakeholder buy-in?"
212
+ → **Recommended Probe: Narrative Prototype**
213
+
214
+ **What it is:**
215
+ Tell the story, don't test the interface. Use video walkthroughs or slideware storyboards to explain workflows and measure interest. This is "tell vs. test"—you're validating the narrative, not the UI.
216
+
217
+ **Methods:**
218
+ - Loom walkthroughs (screen recording with voiceover)
219
+ - Sora/Synthesia/Veo3 (AI-generated explainer videos)
220
+ - Slideware storyboards (PowerPoint/Keynote with illustrations)
221
+ - Storyboard sketches (use `storyboard.md` component skill)
222
+
223
+ **Timeline:** 1-3 days
224
+
225
+ **Tools:**
226
+ - Loom (free, fast)
227
+ - Sora/Synthesia (text-to-video, paid)
228
+ - PowerPoint/Keynote (slideware animation)
229
+ - Figma (static storyboard frames)
230
+
231
+ **Success Criteria Example:**
232
+ - **Pass:** 8/10 stakeholders say "I'd use this" or "This solves the problem"
233
+ - **Fail:** Stakeholders ask "Why would I use this?" or suggest alternative approaches
234
+ - **Learn:** Identify which part of the narrative resonates (or doesn't)
235
+
236
+ **Disposal Plan:** Archive video, document feedback, delete supporting files.
237
+
238
+ **Next Step:** Would you like me to generate a `pol-probe` artifact documenting this narrative prototype?
239
+
240
+ ---
241
+
242
+ #### Option 4 Selected: "Can we model this without production risk?"
243
+ → **Recommended Probe: Synthetic Data Simulation**
244
+
245
+ **What it is:**
246
+ Use simulated users, synthetic data, or prompt logic testing to explore edge cases and unknown-unknowns without touching production. Think "wind tunnel testing, cheaper than postmortem."
247
+
248
+ **Methods:**
249
+ - Synthea (synthetic patient data generation)
250
+ - DataStax LangFlow (test prompt logic without real users)
251
+ - Monte Carlo simulations (model probabilistic outcomes)
252
+ - Synthetic user behavior scripts (simulate click patterns, load testing)
253
+
254
+ **Timeline:** 2-4 days
255
+
256
+ **Tools:**
257
+ - Synthea (open-source, healthcare)
258
+ - DataStax LangFlow (prompt chain testing)
259
+ - Python + Faker library (generate synthetic data)
260
+ - Locust/k6 (load testing with synthetic users)
261
+
262
+ **Success Criteria Example:**
263
+ - **Pass:** System handles 10,000 synthetic users with <1% error rate
264
+ - **Fail:** Edge cases cause crashes or incorrect outputs
265
+ - **Learn:** Identify which edge cases break the system
266
+
267
+ **Disposal Plan:** Delete synthetic data, archive findings, document edge cases.
268
+
269
+ **Next Step:** Would you like me to generate a `pol-probe` artifact documenting this synthetic data simulation?
270
+
271
+ ---
272
+
273
+ #### Option 5 Selected: "Will this solution survive real user contact?"
274
+ → **Recommended Probe: Vibe-Coded PoL Probe**
275
+
276
+ **What it is:**
277
+ A Frankensoft stack (ChatGPT Canvas + Replit + Airtable) that creates just enough illusion for users to interact with a semi-functional workflow. Not production-grade—just enough to catch UX/workflow signals in 48 hours.
278
+
279
+ **⚠️ Warning:** This is the riskiest probe type. It looks real enough to confuse momentum with maturity. Use only when you need real user contact and other methods won't suffice.
280
+
281
+ **Methods:**
282
+ - ChatGPT Canvas (quick UI generation)
283
+ - Replit (host throwaway code)
284
+ - Airtable (fake database)
285
+ - Carrd/Webflow (landing page + workflow mockup)
286
+
287
+ **Timeline:** 2-3 days
288
+
289
+ **Stack Example:**
290
+ - ChatGPT Canvas: Generate form UI
291
+ - Replit: Host simple Flask/Node app
292
+ - Airtable: Capture form submissions
293
+ - Loom: Record user sessions for post-mortem analysis
294
+
295
+ **Success Criteria Example:**
296
+ - **Pass:** 8/10 users complete workflow, 0 critical confusion moments
297
+ - **Fail:** Users get stuck, ask "Is this broken?", or abandon mid-flow
298
+ - **Learn:** Identify exact step where users lose confidence
299
+
300
+ **Disposal Plan:** Delete all code after user sessions, archive Loom recordings, document learnings.
301
+
302
+ **Next Step:** Would you like me to generate a `pol-probe` artifact documenting this vibe-coded probe?
303
+
304
+ ---
305
+
306
+ ### Step 3: Apply Component Skill
307
+
308
+ **Agent offers:**
309
+
310
+ I recommend using **[selected probe type]** for your hypothesis. Would you like me to:
311
+
312
+ 1. **Generate a complete `pol-probe` artifact** (using the `pol-probe.md` template) with your hypothesis, success criteria, timeline, and disposal plan?
313
+ 2. **Walk through the execution steps** for this probe type (tools, methods, user recruitment)?
314
+ 3. **Refine the hypothesis** before proceeding (if it seems too broad or ambiguous)?
315
+
316
+ **User selects option, and agent executes accordingly.**
317
+
318
+ ---
319
+
320
+ ### Step 4: Refine or Iterate (Optional)
321
+
322
+ **If user is uncertain or hypothesis is too broad:**
323
+
324
+ **Agent asks:**
325
+
326
+ Your hypothesis seems broad or tests multiple things at once. PoL probes work best when they're narrow and falsifiable. Let me ask a few clarifying questions:
327
+
328
+ **Clarifying Questions:**
329
+ 1. **What's the smallest thing you could test first?** (Isolate one variable)
330
+ 2. **What would "failure" look like?** (If you can't describe failure, you can't measure success)
331
+ 3. **Is this testing user behavior, technical feasibility, or stakeholder alignment?** (Pick one focus)
332
+
333
+ **Agent then re-recommends probe type based on refined hypothesis.**
334
+
335
+ ---
336
+
337
+ ## Examples
338
+
339
+ ### Example 1: Choosing Task-Focused Test
340
+
341
+ **User Context:**
342
+ - **Hypothesis:** "Users can complete our checkout flow in under 3 minutes"
343
+ - **Risk:** High cart abandonment on mobile
344
+ - **Timeline:** 3 days
345
+ - **Resources:** UsabilityHub account, 20 users from waitlist
346
+
347
+ **Agent Reasoning:**
348
+ - Core question: "Can users complete this job without friction?"
349
+ - Risk is task completion (observable behavior), not technical feasibility
350
+ - Timeline fits Task-Focused Test (2-5 days)
351
+
352
+ **Recommendation:** Task-Focused Test using UsabilityHub
353
+ - Method: Maze prototype testing with mobile users
354
+ - Success criteria: 80%+ complete checkout in <3 minutes
355
+ - Disposal: Delete prototype after analyzing heatmaps
356
+
357
+ ---
358
+
359
+ ### Example 2: Choosing Feasibility Check
360
+
361
+ **User Context:**
362
+ - **Hypothesis:** "We can auto-tag support tickets using GPT-4 with <5% error rate"
363
+ - **Risk:** Unknown if AI can handle industry-specific jargon
364
+ - **Timeline:** 1 day
365
+ - **Resources:** 100 past support tickets, ChatGPT API access
366
+
367
+ **Agent Reasoning:**
368
+ - Core question: "Can we build this?"
369
+ - Risk is technical feasibility (AI capability), not user behavior
370
+ - Timeline is short (1 day = spike territory)
371
+
372
+ **Recommendation:** Feasibility Check using prompt engineering spike
373
+ - Method: Run 100 tickets through GPT-4 prompt chain, measure error rate
374
+ - Success criteria: <5% misclassified tickets
375
+ - Disposal: Delete spike code after documenting findings
376
+
377
+ **Outcome:** Error rate was 18%. Decided NOT to build feature. Saved 6 weeks of development.
378
+
379
+ ---
380
+
381
+ ### Example 3: Choosing Narrative Prototype (NOT Vibe-Coded)
382
+
383
+ **User Context:**
384
+ - **Hypothesis:** "Enterprise buyers will understand our multi-tenant security model"
385
+ - **Risk:** Complex technical concept needs explanation
386
+ - **Timeline:** 2 days
387
+ - **Resources:** Loom, 5 enterprise prospects
388
+
389
+ **Agent Reasoning:**
390
+ - Core question: "Does this workflow earn stakeholder buy-in?"
391
+ - Risk is comprehension/alignment, not task completion
392
+ - Building a functional prototype would over-invest (2 days = just explain it)
393
+
394
+ **Recommendation:** Narrative Prototype using Loom walkthrough
395
+ - Method: 5-minute Loom explaining security model with diagrams
396
+ - Success criteria: 4/5 prospects say "I understand this" without follow-up questions
397
+ - Disposal: Archive video after 5 sessions
398
+
399
+ **Why NOT Vibe-Coded Probe:** You don't need users to interact with security settings—you need them to *understand* the concept. Narrative is cheaper and faster.
400
+
401
+ ---
402
+
403
+ ### Example 4: Avoiding Prototype Theater
404
+
405
+ **User Context:**
406
+ - **Hypothesis:** "Executives will approve budget if we show a polished demo"
407
+ - **Risk:** Internal politics
408
+ - **Timeline:** 3 weeks
409
+ - **Resources:** Full design team
410
+
411
+ **Agent Response:**
412
+
413
+ ⚠️ **This is prototype theater, not a PoL probe.**
414
+
415
+ You're testing internal politics, not user behavior or technical feasibility. PoL probes eliminate product risk, not organizational risk.
416
+
417
+ **Recommendation:** Skip the prototype entirely. Instead:
418
+ 1. Use a **Narrative Prototype** (1-day Loom walkthrough) to explain the concept
419
+ 2. Test with **5 target users** (not executives) to validate the hypothesis
420
+ 3. Present user feedback to executives, not a polished demo
421
+
422
+ If executives need a demo, build it *after* you've validated the hypothesis with real users.
423
+
424
+ ---
425
+
426
+ ## Common Pitfalls
427
+
428
+ ### 1. **Choosing Based on Tooling Comfort**
429
+ **Failure Mode:** "I know Figma, so I'll design a UI prototype" (even if design isn't the risk).
430
+
431
+ **Consequence:** Validate the wrong thing; miss the actual risk.
432
+
433
+ **Fix:** Answer the core question *first*, then pick the method. If you need a Feasibility Check but only know design tools, pair with an engineer for 1 day.
434
+
435
+ ---
436
+
437
+ ### 2. **Defaulting to Code**
438
+ **Failure Mode:** "Let's just build it and see what happens."
439
+
440
+ **Consequence:** 2 weeks of development before learning you tested the wrong hypothesis.
441
+
442
+ **Fix:** Ask: "What's the cheapest prototype that tells the harshest truth?" Usually it's NOT code.
443
+
444
+ ---
445
+
446
+ ### 3. **Confusing Vibe-Coded Probes with MVPs**
447
+ **Failure Mode:** Vibe-Coded probe "looks real," so team treats it like production code.
448
+
449
+ **Consequence:** Scope creep, technical debt, resistance to disposal.
450
+
451
+ **Fix:** Set disposal date before building. Vibe-Coded probes are **Frankensoft by design**—celebrate the jank, delete after learning.
452
+
453
+ ---
454
+
455
+ ### 4. **Testing Multiple Things at Once**
456
+ **Failure Mode:** "Let's test the workflow, the pricing, and the UI in one probe."
457
+
458
+ **Consequence:** Ambiguous results—you won't know which variable caused failure.
459
+
460
+ **Fix:** One probe, one hypothesis. If you have 3 hypotheses, run 3 probes.
461
+
462
+ ---
463
+
464
+ ### 5. **Skipping Success Criteria**
465
+ **Failure Mode:** "We'll know it when we see it."
466
+
467
+ **Consequence:** No harsh truth—just opinions and vanity metrics.
468
+
469
+ **Fix:** Write success criteria *before* building. Define "pass," "fail," and "learn" thresholds.
470
+
471
+ ---
472
+
473
+ ## References
474
+
475
+ ### Related Skills
476
+ - **[pol-probe](../pol-probe/SKILL.md)** (Component) — Template for documenting PoL probes
477
+ - **[problem-statement](../problem-statement/SKILL.md)** (Component) — Frame problem before choosing validation method
478
+ - **[problem-framing-canvas](../problem-framing-canvas/SKILL.md)** (Interactive) — MITRE Problem Framing before validation
479
+ - **[discovery-process](../discovery-process/SKILL.md)** (Workflow) — Use PoL probes in validation phase
480
+ - **[epic-hypothesis](../epic-hypothesis/SKILL.md)** (Component) — Turn epics into testable hypotheses
481
+
482
+ ### External Frameworks
483
+ - **Jeff Patton** — *User Story Mapping* (lean validation principles)
484
+ - **Marty Cagan** — *Inspired* (2014 prototype flavors framework)
485
+ - **Dean Peters** — [*Vibe First, Validate Fast, Verify Fit*](https://deanpeters.substack.com/p/vibe-first-validate-fast-verify-fit) (Dean Peters' Substack, 2025)
486
+
487
+ ### Tools by Probe Type
488
+ - **Feasibility:** ChatGPT/Claude, Postman, Jupyter
489
+ - **Task-Focused:** Optimal Workshop, UsabilityHub, Maze
490
+ - **Narrative:** Loom, Sora, Synthesia, PowerPoint
491
+ - **Synthetic Data:** Synthea, DataStax LangFlow, Faker
492
+ - **Vibe-Coded:** ChatGPT Canvas, Replit, Airtable, Carrd