code-ai-installer 4.0.0 → 4.0.1-b

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (471) hide show
  1. package/README.md +83 -67
  2. package/dist/index.js +2 -0
  3. package/dist/mcp/audit_ledger.d.ts +12 -0
  4. package/dist/mcp/audit_ledger.js +82 -0
  5. package/dist/mcp/cli.js +7 -1
  6. package/dist/mcp/config.d.ts +23 -0
  7. package/dist/mcp/config.js +44 -6
  8. package/dist/mcp/index.d.ts +1 -2
  9. package/dist/mcp/index.js +1 -2
  10. package/dist/mcp/paths.d.ts +20 -2
  11. package/dist/mcp/paths.js +29 -5
  12. package/dist/mcp/proposal_dedup.d.ts +32 -0
  13. package/dist/mcp/proposal_dedup.js +102 -0
  14. package/dist/mcp/proposal_store.d.ts +18 -0
  15. package/dist/mcp/proposal_store.js +74 -0
  16. package/dist/mcp/scorecard.d.ts +140 -0
  17. package/dist/mcp/scorecard.js +103 -0
  18. package/dist/mcp/skill_invocations.d.ts +15 -0
  19. package/dist/mcp/skill_invocations.js +28 -0
  20. package/dist/mcp/task_state.d.ts +77 -2
  21. package/dist/mcp/tools/_subprocess.d.ts +16 -0
  22. package/dist/mcp/tools/_subprocess.js +56 -0
  23. package/dist/mcp/tools/advance_gate.js +2 -2
  24. package/dist/mcp/tools/aggregate_run_metrics.d.ts +19 -0
  25. package/dist/mcp/tools/aggregate_run_metrics.js +139 -0
  26. package/dist/mcp/tools/apply_diff.d.ts +2 -0
  27. package/dist/mcp/tools/apply_diff.js +29 -0
  28. package/dist/mcp/tools/audit_bilocale_parity.d.ts +2 -0
  29. package/dist/mcp/tools/audit_bilocale_parity.js +146 -0
  30. package/dist/mcp/tools/audit_budget_compliance.d.ts +35 -0
  31. package/dist/mcp/tools/audit_budget_compliance.js +172 -0
  32. package/dist/mcp/tools/build.d.ts +2 -0
  33. package/dist/mcp/tools/build.js +47 -0
  34. package/dist/mcp/tools/check_lint.d.ts +2 -0
  35. package/dist/mcp/tools/check_lint.js +23 -0
  36. package/dist/mcp/tools/classify_gate.js +2 -2
  37. package/dist/mcp/tools/current_gate.js +2 -2
  38. package/dist/mcp/tools/dependency_supply_chain.d.ts +2 -0
  39. package/dist/mcp/tools/dependency_supply_chain.js +59 -0
  40. package/dist/mcp/tools/docker_compose.d.ts +2 -0
  41. package/dist/mcp/tools/docker_compose.js +24 -0
  42. package/dist/mcp/tools/e2e_playwright.d.ts +2 -0
  43. package/dist/mcp/tools/e2e_playwright.js +88 -0
  44. package/dist/mcp/tools/get_skill.js +17 -0
  45. package/dist/mcp/tools/git_commit.d.ts +2 -0
  46. package/dist/mcp/tools/git_commit.js +30 -0
  47. package/dist/mcp/tools/list_proposals.d.ts +6 -0
  48. package/dist/mcp/tools/list_proposals.js +16 -0
  49. package/dist/mcp/tools/list_skills.js +9 -1
  50. package/dist/mcp/tools/load_role.d.ts +3 -4
  51. package/dist/mcp/tools/load_role.js +11 -13
  52. package/dist/mcp/tools/propose_change.d.ts +8 -0
  53. package/dist/mcp/tools/propose_change.js +36 -0
  54. package/dist/mcp/tools/record_decision.js +25 -25
  55. package/dist/mcp/tools/review_proposal.d.ts +17 -0
  56. package/dist/mcp/tools/review_proposal.js +99 -0
  57. package/dist/mcp/tools/run_drift_audit.d.ts +11 -0
  58. package/dist/mcp/tools/run_drift_audit.js +79 -0
  59. package/dist/mcp/tools/run_tests.d.ts +2 -0
  60. package/dist/mcp/tools/run_tests.js +92 -0
  61. package/dist/mcp/tools/sign_off.js +14 -2
  62. package/dist/mcp/tools/stubs.js +30 -9
  63. package/dist/mcp/tools/verify_claim.js +33 -6
  64. package/dist/mcp_setup.d.ts +14 -3
  65. package/dist/mcp_setup.js +15 -6
  66. package/dist/shared/frontmatter.d.ts +44 -2
  67. package/dist/shared/frontmatter.js +54 -6
  68. package/dist/shared/index.d.ts +0 -5
  69. package/dist/shared/index.js +0 -5
  70. package/dist/shared/persona.d.ts +2 -2
  71. package/dist/shared/persona.js +1 -1
  72. package/dist/shared/pipeline.d.ts +46 -1
  73. package/dist/shared/tools.d.ts +1382 -16
  74. package/dist/shared/tools.js +229 -0
  75. package/dist/shared/vocabulary.d.ts +99 -4
  76. package/dist/shared/vocabulary.js +94 -5
  77. package/domains/analytics/.agents/skills/ansoff-matrix/SKILL.md +316 -300
  78. package/domains/analytics/.agents/skills/bcg-matrix/SKILL.md +345 -329
  79. package/domains/analytics/.agents/skills/blue-ocean-strategy/SKILL.md +432 -416
  80. package/domains/analytics/.agents/skills/board/SKILL.md +22 -0
  81. package/domains/analytics/.agents/skills/cohort-analysis/SKILL.md +338 -322
  82. package/domains/analytics/.agents/skills/competitive-analysis/SKILL.md +413 -395
  83. package/domains/analytics/.agents/skills/customer-journey-mapping/SKILL.md +347 -331
  84. package/domains/analytics/.agents/skills/gates/SKILL.md +388 -366
  85. package/domains/analytics/.agents/skills/handoff/SKILL.md +402 -380
  86. package/domains/analytics/.agents/skills/html-pdf-report/SKILL.md +21 -289
  87. package/domains/analytics/.agents/skills/html-pdf-report-reference/SKILL.md +325 -0
  88. package/domains/analytics/.agents/skills/html-pdf-report-reference/agents/claude.json +17 -0
  89. package/domains/analytics/.agents/skills/html-pdf-report-reference/agents/copilot.json +17 -0
  90. package/domains/analytics/.agents/skills/html-pdf-report-reference/agents/gemini.json +17 -0
  91. package/domains/analytics/.agents/skills/html-pdf-report-reference/agents/kimi.yaml +15 -0
  92. package/domains/analytics/.agents/skills/html-pdf-report-reference/agents/openai.yaml +10 -0
  93. package/domains/analytics/.agents/skills/html-pdf-report-reference/agents/qwen.json +17 -0
  94. package/domains/analytics/.agents/skills/html-pdf-report-reference/agents/skill.yaml +23 -0
  95. package/domains/analytics/.agents/skills/icp-buyer-persona/SKILL.md +407 -390
  96. package/domains/analytics/.agents/skills/jtbd-analysis/SKILL.md +357 -341
  97. package/domains/analytics/.agents/skills/karpathy-guidelines/SKILL.md +32 -0
  98. package/domains/analytics/.agents/skills/pest-analysis/SKILL.md +324 -305
  99. package/domains/analytics/.agents/skills/porters-five-forces/SKILL.md +377 -361
  100. package/domains/analytics/.agents/skills/report-design/SKILL.md +416 -398
  101. package/domains/analytics/.agents/skills/rfm-analysis/SKILL.md +330 -314
  102. package/domains/analytics/.agents/skills/session-prompt-generator/SKILL.md +400 -378
  103. package/domains/analytics/.agents/skills/swot-analysis/SKILL.md +340 -324
  104. package/domains/analytics/.agents/skills/tam-sam-som/SKILL.md +329 -312
  105. package/domains/analytics/.agents/skills/trend-analysis/SKILL.md +347 -331
  106. package/domains/analytics/.agents/skills/unit-economics/SKILL.md +430 -413
  107. package/domains/analytics/.agents/skills/value-chain-analysis/SKILL.md +346 -330
  108. package/domains/analytics/.agents/skills/web-research/SKILL.md +323 -308
  109. package/domains/analytics/AGENTS.md +1 -0
  110. package/domains/analytics/agents/auditor.md +76 -0
  111. package/domains/analytics/agents/conductor.md +11 -0
  112. package/domains/analytics/agents/data_analyst.md +11 -0
  113. package/domains/analytics/agents/designer.md +11 -0
  114. package/domains/analytics/agents/interviewer.md +11 -0
  115. package/domains/analytics/agents/layouter.md +11 -0
  116. package/domains/analytics/agents/mediator.md +11 -0
  117. package/domains/analytics/agents/researcher.md +11 -0
  118. package/domains/analytics/agents/strategist.md +11 -0
  119. package/domains/analytics/locales/en/.agents/skills/ansoff-matrix/SKILL.md +316 -300
  120. package/domains/analytics/locales/en/.agents/skills/bcg-matrix/SKILL.md +345 -329
  121. package/domains/analytics/locales/en/.agents/skills/blue-ocean-strategy/SKILL.md +432 -416
  122. package/domains/analytics/locales/en/.agents/skills/board/SKILL.md +22 -0
  123. package/domains/analytics/locales/en/.agents/skills/cohort-analysis/SKILL.md +338 -322
  124. package/domains/analytics/locales/en/.agents/skills/competitive-analysis/SKILL.md +413 -395
  125. package/domains/analytics/locales/en/.agents/skills/customer-journey-mapping/SKILL.md +347 -331
  126. package/domains/analytics/locales/en/.agents/skills/gates/SKILL.md +388 -366
  127. package/domains/analytics/locales/en/.agents/skills/handoff/SKILL.md +402 -380
  128. package/domains/analytics/locales/en/.agents/skills/html-pdf-report/SKILL.md +21 -289
  129. package/domains/analytics/locales/en/.agents/skills/html-pdf-report-reference/SKILL.md +325 -0
  130. package/domains/analytics/locales/en/.agents/skills/html-pdf-report-reference/agents/claude.json +17 -0
  131. package/domains/analytics/locales/en/.agents/skills/html-pdf-report-reference/agents/copilot.json +17 -0
  132. package/domains/analytics/locales/en/.agents/skills/html-pdf-report-reference/agents/gemini.json +17 -0
  133. package/domains/analytics/locales/en/.agents/skills/html-pdf-report-reference/agents/kimi.yaml +15 -0
  134. package/domains/analytics/locales/en/.agents/skills/html-pdf-report-reference/agents/openai.yaml +10 -0
  135. package/domains/analytics/locales/en/.agents/skills/html-pdf-report-reference/agents/qwen.json +17 -0
  136. package/domains/analytics/locales/en/.agents/skills/html-pdf-report-reference/agents/skill.yaml +29 -0
  137. package/domains/analytics/locales/en/.agents/skills/icp-buyer-persona/SKILL.md +407 -390
  138. package/domains/analytics/locales/en/.agents/skills/jtbd-analysis/SKILL.md +357 -341
  139. package/domains/analytics/locales/en/.agents/skills/karpathy-guidelines/SKILL.md +32 -0
  140. package/domains/analytics/locales/en/.agents/skills/pest-analysis/SKILL.md +324 -305
  141. package/domains/analytics/locales/en/.agents/skills/porters-five-forces/SKILL.md +377 -361
  142. package/domains/analytics/locales/en/.agents/skills/report-design/SKILL.md +416 -398
  143. package/domains/analytics/locales/en/.agents/skills/rfm-analysis/SKILL.md +330 -314
  144. package/domains/analytics/locales/en/.agents/skills/session-prompt-generator/SKILL.md +400 -378
  145. package/domains/analytics/locales/en/.agents/skills/swot-analysis/SKILL.md +340 -324
  146. package/domains/analytics/locales/en/.agents/skills/tam-sam-som/SKILL.md +329 -312
  147. package/domains/analytics/locales/en/.agents/skills/trend-analysis/SKILL.md +347 -331
  148. package/domains/analytics/locales/en/.agents/skills/unit-economics/SKILL.md +430 -413
  149. package/domains/analytics/locales/en/.agents/skills/value-chain-analysis/SKILL.md +366 -350
  150. package/domains/analytics/locales/en/.agents/skills/web-research/SKILL.md +324 -309
  151. package/domains/analytics/locales/en/AGENTS.md +1 -0
  152. package/domains/analytics/locales/en/agents/auditor.md +76 -0
  153. package/domains/analytics/locales/en/agents/conductor.md +27 -0
  154. package/domains/analytics/locales/en/agents/data_analyst.md +29 -0
  155. package/domains/analytics/locales/en/agents/designer.md +27 -0
  156. package/domains/analytics/locales/en/agents/interviewer.md +11 -0
  157. package/domains/analytics/locales/en/agents/layouter.md +11 -0
  158. package/domains/analytics/locales/en/agents/mediator.md +11 -0
  159. package/domains/analytics/locales/en/agents/researcher.md +11 -0
  160. package/domains/analytics/locales/en/agents/strategist.md +11 -0
  161. package/domains/analytics/persona/persona-base.md +94 -0
  162. package/domains/analytics/pipeline.yaml +102 -0
  163. package/domains/content/.agents/skills/audience-analysis/SKILL.md +15 -0
  164. package/domains/content/.agents/skills/board/SKILL.md +20 -0
  165. package/domains/content/.agents/skills/brand-compliance/SKILL.md +15 -0
  166. package/domains/content/.agents/skills/brand-guidelines/SKILL.md +17 -0
  167. package/domains/content/.agents/skills/competitor-content-analysis/SKILL.md +15 -0
  168. package/domains/content/.agents/skills/content-brief/SKILL.md +15 -0
  169. package/domains/content/.agents/skills/content-calendar/SKILL.md +15 -0
  170. package/domains/content/.agents/skills/content-release-gate/SKILL.md +15 -0
  171. package/domains/content/.agents/skills/content-review-checklist/SKILL.md +15 -0
  172. package/domains/content/.agents/skills/cta-optimization/SKILL.md +15 -0
  173. package/domains/content/.agents/skills/data-storytelling/SKILL.md +15 -0
  174. package/domains/content/.agents/skills/email-copywriting/SKILL.md +15 -0
  175. package/domains/content/.agents/skills/email-engagement-tiers/SKILL.md +15 -0
  176. package/domains/content/.agents/skills/fact-checking/SKILL.md +15 -0
  177. package/domains/content/.agents/skills/gates/SKILL.md +20 -0
  178. package/domains/content/.agents/skills/google-stitch-content/SKILL.md +15 -0
  179. package/domains/content/.agents/skills/handoff/SKILL.md +24 -0
  180. package/domains/content/.agents/skills/headline-formulas/SKILL.md +15 -0
  181. package/domains/content/.agents/skills/image-prompt-engineering/SKILL.md +15 -0
  182. package/domains/content/.agents/skills/karpathy-guidelines/SKILL.md +28 -0
  183. package/domains/content/.agents/skills/mailerlite-email-ops/SKILL.md +15 -0
  184. package/domains/content/.agents/skills/marketing-psychology/SKILL.md +15 -0
  185. package/domains/content/.agents/skills/moodboard/SKILL.md +15 -0
  186. package/domains/content/.agents/skills/platform-compliance/SKILL.md +15 -0
  187. package/domains/content/.agents/skills/platform-strategy/SKILL.md +15 -0
  188. package/domains/content/.agents/skills/platform-visual-specs/SKILL.md +15 -0
  189. package/domains/content/.agents/skills/readability-scoring/SKILL.md +15 -0
  190. package/domains/content/.agents/skills/seo-copywriting/SKILL.md +15 -0
  191. package/domains/content/.agents/skills/social-media-formats/SKILL.md +15 -0
  192. package/domains/content/.agents/skills/source-verification/SKILL.md +15 -0
  193. package/domains/content/.agents/skills/storytelling-framework/SKILL.md +15 -0
  194. package/domains/content/.agents/skills/tone-of-voice/SKILL.md +15 -0
  195. package/domains/content/.agents/skills/topic-research/SKILL.md +15 -0
  196. package/domains/content/.agents/skills/trend-research/SKILL.md +15 -0
  197. package/domains/content/.agents/skills/visual-brief/SKILL.md +15 -0
  198. package/domains/content/AGENTS.md +4 -0
  199. package/domains/content/agents/auditor.md +76 -0
  200. package/domains/content/agents/conductor.md +11 -0
  201. package/domains/content/agents/copywriter.md +11 -0
  202. package/domains/content/agents/researcher.md +11 -0
  203. package/domains/content/agents/reviewer.md +11 -0
  204. package/domains/content/agents/strategist.md +11 -0
  205. package/domains/content/agents/visual_concept.md +11 -0
  206. package/domains/content/locales/en/.agents/skills/audience-analysis/SKILL.md +15 -0
  207. package/domains/content/locales/en/.agents/skills/board/SKILL.md +20 -0
  208. package/domains/content/locales/en/.agents/skills/brand-compliance/SKILL.md +15 -0
  209. package/domains/content/locales/en/.agents/skills/brand-guidelines/SKILL.md +17 -0
  210. package/domains/content/locales/en/.agents/skills/competitor-content-analysis/SKILL.md +15 -0
  211. package/domains/content/locales/en/.agents/skills/content-brief/SKILL.md +15 -0
  212. package/domains/content/locales/en/.agents/skills/content-calendar/SKILL.md +15 -0
  213. package/domains/content/locales/en/.agents/skills/content-release-gate/SKILL.md +15 -0
  214. package/domains/content/locales/en/.agents/skills/content-review-checklist/SKILL.md +15 -0
  215. package/domains/content/locales/en/.agents/skills/cta-optimization/SKILL.md +15 -0
  216. package/domains/content/locales/en/.agents/skills/data-storytelling/SKILL.md +15 -0
  217. package/domains/content/locales/en/.agents/skills/email-copywriting/SKILL.md +15 -0
  218. package/domains/content/locales/en/.agents/skills/email-engagement-tiers/SKILL.md +15 -0
  219. package/domains/content/locales/en/.agents/skills/fact-checking/SKILL.md +15 -0
  220. package/domains/content/locales/en/.agents/skills/gates/SKILL.md +20 -0
  221. package/domains/content/locales/en/.agents/skills/google-stitch-content/SKILL.md +15 -0
  222. package/domains/content/locales/en/.agents/skills/handoff/SKILL.md +24 -0
  223. package/domains/content/locales/en/.agents/skills/headline-formulas/SKILL.md +15 -0
  224. package/domains/content/locales/en/.agents/skills/image-prompt-engineering/SKILL.md +15 -0
  225. package/domains/content/locales/en/.agents/skills/karpathy-guidelines/SKILL.md +30 -1
  226. package/domains/content/locales/en/.agents/skills/mailerlite-email-ops/SKILL.md +15 -0
  227. package/domains/content/locales/en/.agents/skills/marketing-psychology/SKILL.md +15 -0
  228. package/domains/content/locales/en/.agents/skills/moodboard/SKILL.md +15 -0
  229. package/domains/content/locales/en/.agents/skills/platform-compliance/SKILL.md +15 -0
  230. package/domains/content/locales/en/.agents/skills/platform-strategy/SKILL.md +15 -0
  231. package/domains/content/locales/en/.agents/skills/platform-visual-specs/SKILL.md +15 -0
  232. package/domains/content/locales/en/.agents/skills/readability-scoring/SKILL.md +15 -0
  233. package/domains/content/locales/en/.agents/skills/seo-copywriting/SKILL.md +15 -0
  234. package/domains/content/locales/en/.agents/skills/social-media-formats/SKILL.md +15 -0
  235. package/domains/content/locales/en/.agents/skills/source-verification/SKILL.md +15 -0
  236. package/domains/content/locales/en/.agents/skills/storytelling-framework/SKILL.md +15 -0
  237. package/domains/content/locales/en/.agents/skills/tone-of-voice/SKILL.md +15 -0
  238. package/domains/content/locales/en/.agents/skills/topic-research/SKILL.md +15 -0
  239. package/domains/content/locales/en/.agents/skills/trend-research/SKILL.md +15 -0
  240. package/domains/content/locales/en/.agents/skills/visual-brief/SKILL.md +15 -0
  241. package/domains/content/locales/en/AGENTS.md +4 -0
  242. package/domains/content/locales/en/agents/auditor.md +76 -0
  243. package/domains/content/locales/en/agents/conductor.md +12 -0
  244. package/domains/content/locales/en/agents/copywriter.md +12 -0
  245. package/domains/content/locales/en/agents/researcher.md +12 -0
  246. package/domains/content/locales/en/agents/reviewer.md +12 -0
  247. package/domains/content/locales/en/agents/strategist.md +12 -0
  248. package/domains/content/locales/en/agents/visual_concept.md +12 -0
  249. package/domains/content/persona/persona-base.md +94 -0
  250. package/domains/content/pipeline.yaml +96 -0
  251. package/domains/development/.agents/skills/adr-log/SKILL.md +1 -0
  252. package/domains/development/.agents/skills/design-intake/SKILL.md +0 -4
  253. package/domains/development/.agents/skills/karpathy-guidelines/SKILL.md +2 -1
  254. package/domains/development/.agents/skills/lava-flow-legacy-detection/SKILL.md +15 -1
  255. package/domains/development/.agents/skills/mcp-integration/SKILL.md +211 -0
  256. package/domains/development/.agents/skills/mcp-integration/agents/claude.json +22 -0
  257. package/domains/development/.agents/skills/mcp-integration/agents/copilot.json +22 -0
  258. package/domains/development/.agents/skills/mcp-integration/agents/gemini.json +22 -0
  259. package/domains/development/.agents/skills/mcp-integration/agents/kimi.yaml +18 -0
  260. package/domains/development/.agents/skills/mcp-integration/agents/openai.yaml +8 -0
  261. package/domains/development/.agents/skills/mcp-integration/agents/qwen.json +22 -0
  262. package/domains/development/.agents/skills/mcp-integration/agents/skill.yaml +26 -0
  263. package/domains/development/.agents/skills/qa-ui-a11y-smoke/SKILL.md +1 -1
  264. package/domains/development/.agents/skills/ui-a11y-smoke-review/SKILL.md +1 -1
  265. package/domains/development/AGENTS.md +1 -0
  266. package/domains/development/AGENTS.yaml +1 -0
  267. package/domains/development/agents/architect.md +13 -1
  268. package/domains/development/agents/auditor.md +74 -0
  269. package/domains/development/agents/conductor.md +14 -3
  270. package/domains/development/agents/devops.md +8 -9
  271. package/domains/development/agents/reviewer.md +12 -0
  272. package/domains/development/agents/senior_full_stack.md +12 -0
  273. package/domains/development/agents/tester.md +10 -16
  274. package/domains/development/locales/en/.agents/skills/adr-log/SKILL.md +1 -0
  275. package/domains/development/locales/en/.agents/skills/current-state-analysis/SKILL.md +256 -172
  276. package/domains/development/locales/en/.agents/skills/karpathy-guidelines/SKILL.md +2 -1
  277. package/domains/development/locales/en/.agents/skills/lava-flow-legacy-detection/SKILL.md +15 -1
  278. package/domains/development/locales/en/.agents/skills/mcp-integration/SKILL.md +211 -0
  279. package/domains/development/locales/en/.agents/skills/mcp-integration/agents/claude.json +22 -0
  280. package/domains/development/locales/en/.agents/skills/mcp-integration/agents/copilot.json +22 -0
  281. package/domains/development/locales/en/.agents/skills/mcp-integration/agents/gemini.json +22 -0
  282. package/domains/development/locales/en/.agents/skills/mcp-integration/agents/kimi.yaml +18 -0
  283. package/domains/development/locales/en/.agents/skills/mcp-integration/agents/openai.yaml +8 -0
  284. package/domains/development/locales/en/.agents/skills/mcp-integration/agents/qwen.json +22 -0
  285. package/domains/development/locales/en/.agents/skills/mcp-integration/agents/skill.yaml +26 -0
  286. package/domains/development/locales/en/.agents/skills/qa-ui-a11y-smoke/SKILL.md +1 -1
  287. package/domains/development/locales/en/.agents/skills/ui-a11y-smoke-review/SKILL.md +1 -1
  288. package/domains/development/locales/en/AGENTS.md +5 -0
  289. package/domains/development/locales/en/AGENTS.yaml +1 -0
  290. package/domains/development/locales/en/agents/architect.md +13 -1
  291. package/domains/development/locales/en/agents/auditor.md +74 -0
  292. package/domains/development/locales/en/agents/conductor.md +14 -3
  293. package/domains/development/locales/en/agents/devops.md +8 -9
  294. package/domains/development/locales/en/agents/reviewer.md +12 -0
  295. package/domains/development/locales/en/agents/senior_full_stack.md +12 -0
  296. package/domains/development/locales/en/agents/tester.md +10 -16
  297. package/domains/development/persona/persona-base.md +94 -0
  298. package/domains/product/.agents/skills/aarrr-metrics/SKILL.md +451 -433
  299. package/domains/product/.agents/skills/ab-test-design/SKILL.md +428 -412
  300. package/domains/product/.agents/skills/acceptance-criteria/SKILL.md +422 -406
  301. package/domains/product/.agents/skills/assumption-mapping/SKILL.md +323 -307
  302. package/domains/product/.agents/skills/board/SKILL.md +24 -0
  303. package/domains/product/.agents/skills/design-brief/SKILL.md +433 -418
  304. package/domains/product/.agents/skills/epic-breakdown/SKILL.md +435 -420
  305. package/domains/product/.agents/skills/gates/SKILL.md +470 -446
  306. package/domains/product/.agents/skills/gtm-brief/SKILL.md +18 -321
  307. package/domains/product/.agents/skills/gtm-brief-reference/SKILL.md +348 -0
  308. package/domains/product/.agents/skills/gtm-brief-reference/agents/claude.json +17 -0
  309. package/domains/product/.agents/skills/gtm-brief-reference/agents/copilot.json +17 -0
  310. package/domains/product/.agents/skills/gtm-brief-reference/agents/gemini.json +17 -0
  311. package/domains/product/.agents/skills/gtm-brief-reference/agents/kimi.yaml +15 -0
  312. package/domains/product/.agents/skills/gtm-brief-reference/agents/openai.yaml +10 -0
  313. package/domains/product/.agents/skills/gtm-brief-reference/agents/qwen.json +17 -0
  314. package/domains/product/.agents/skills/gtm-brief-reference/agents/skill.yaml +22 -0
  315. package/domains/product/.agents/skills/handoff/SKILL.md +463 -439
  316. package/domains/product/.agents/skills/html-pdf-report/SKILL.md +21 -663
  317. package/domains/product/.agents/skills/html-pdf-report-reference/SKILL.md +699 -0
  318. package/domains/product/.agents/skills/html-pdf-report-reference/agents/claude.json +17 -0
  319. package/domains/product/.agents/skills/html-pdf-report-reference/agents/copilot.json +17 -0
  320. package/domains/product/.agents/skills/html-pdf-report-reference/agents/gemini.json +17 -0
  321. package/domains/product/.agents/skills/html-pdf-report-reference/agents/kimi.yaml +15 -0
  322. package/domains/product/.agents/skills/html-pdf-report-reference/agents/openai.yaml +10 -0
  323. package/domains/product/.agents/skills/html-pdf-report-reference/agents/qwen.json +17 -0
  324. package/domains/product/.agents/skills/html-pdf-report-reference/agents/skill.yaml +22 -0
  325. package/domains/product/.agents/skills/hypothesis-template/SKILL.md +484 -469
  326. package/domains/product/.agents/skills/jtbd-canvas/SKILL.md +274 -258
  327. package/domains/product/.agents/skills/kano-model/SKILL.md +370 -355
  328. package/domains/product/.agents/skills/karpathy-guidelines/SKILL.md +36 -0
  329. package/domains/product/.agents/skills/launch-checklist/SKILL.md +434 -419
  330. package/domains/product/.agents/skills/moscow-prioritization/SKILL.md +407 -392
  331. package/domains/product/.agents/skills/north-star-metric/SKILL.md +317 -301
  332. package/domains/product/.agents/skills/okr-framework/SKILL.md +299 -284
  333. package/domains/product/.agents/skills/opportunity-solution-tree/SKILL.md +472 -456
  334. package/domains/product/.agents/skills/prd-template/SKILL.md +18 -258
  335. package/domains/product/.agents/skills/prd-template-reference/SKILL.md +285 -0
  336. package/domains/product/.agents/skills/prd-template-reference/agents/claude.json +17 -0
  337. package/domains/product/.agents/skills/prd-template-reference/agents/copilot.json +17 -0
  338. package/domains/product/.agents/skills/prd-template-reference/agents/gemini.json +17 -0
  339. package/domains/product/.agents/skills/prd-template-reference/agents/kimi.yaml +16 -0
  340. package/domains/product/.agents/skills/prd-template-reference/agents/openai.yaml +10 -0
  341. package/domains/product/.agents/skills/prd-template-reference/agents/qwen.json +17 -0
  342. package/domains/product/.agents/skills/prd-template-reference/agents/skill.yaml +23 -0
  343. package/domains/product/.agents/skills/problem-statement/SKILL.md +327 -312
  344. package/domains/product/.agents/skills/product-roadmap/SKILL.md +320 -304
  345. package/domains/product/.agents/skills/product-vision/SKILL.md +409 -394
  346. package/domains/product/.agents/skills/release-notes/SKILL.md +18 -258
  347. package/domains/product/.agents/skills/release-notes-reference/SKILL.md +285 -0
  348. package/domains/product/.agents/skills/release-notes-reference/agents/claude.json +17 -0
  349. package/domains/product/.agents/skills/release-notes-reference/agents/copilot.json +17 -0
  350. package/domains/product/.agents/skills/release-notes-reference/agents/gemini.json +17 -0
  351. package/domains/product/.agents/skills/release-notes-reference/agents/kimi.yaml +15 -0
  352. package/domains/product/.agents/skills/release-notes-reference/agents/openai.yaml +10 -0
  353. package/domains/product/.agents/skills/release-notes-reference/agents/qwen.json +17 -0
  354. package/domains/product/.agents/skills/release-notes-reference/agents/skill.yaml +22 -0
  355. package/domains/product/.agents/skills/report-design/SKILL.md +17 -307
  356. package/domains/product/.agents/skills/report-design-reference/SKILL.md +331 -0
  357. package/domains/product/.agents/skills/report-design-reference/agents/claude.json +17 -0
  358. package/domains/product/.agents/skills/report-design-reference/agents/copilot.json +17 -0
  359. package/domains/product/.agents/skills/report-design-reference/agents/gemini.json +17 -0
  360. package/domains/product/.agents/skills/report-design-reference/agents/kimi.yaml +15 -0
  361. package/domains/product/.agents/skills/report-design-reference/agents/openai.yaml +10 -0
  362. package/domains/product/.agents/skills/report-design-reference/agents/qwen.json +17 -0
  363. package/domains/product/.agents/skills/report-design-reference/agents/skill.yaml +22 -0
  364. package/domains/product/.agents/skills/rice-scoring/SKILL.md +266 -251
  365. package/domains/product/.agents/skills/saas-metrics/SKILL.md +422 -404
  366. package/domains/product/.agents/skills/session-prompt-generator/SKILL.md +474 -450
  367. package/domains/product/.agents/skills/user-flow/SKILL.md +491 -476
  368. package/domains/product/.agents/skills/user-interview-script/SKILL.md +315 -298
  369. package/domains/product/.agents/skills/user-story/SKILL.md +401 -385
  370. package/domains/product/.agents/skills/wsjf-scoring/SKILL.md +333 -315
  371. package/domains/product/AGENTS.md +5 -0
  372. package/domains/product/AGENTS.yaml +1 -0
  373. package/domains/product/agents/auditor.md +76 -0
  374. package/domains/product/agents/conductor.md +11 -0
  375. package/domains/product/agents/data_analyst.md +11 -0
  376. package/domains/product/agents/designer.md +11 -0
  377. package/domains/product/agents/discovery.md +11 -0
  378. package/domains/product/agents/layouter.md +11 -0
  379. package/domains/product/agents/mediator.md +11 -0
  380. package/domains/product/agents/pm.md +11 -0
  381. package/domains/product/agents/product_strategist.md +11 -0
  382. package/domains/product/agents/tech_lead.md +11 -0
  383. package/domains/product/agents/ux_designer.md +11 -0
  384. package/domains/product/locales/en/.agents/skills/aarrr-metrics/SKILL.md +451 -433
  385. package/domains/product/locales/en/.agents/skills/ab-test-design/SKILL.md +428 -412
  386. package/domains/product/locales/en/.agents/skills/acceptance-criteria/SKILL.md +422 -406
  387. package/domains/product/locales/en/.agents/skills/assumption-mapping/SKILL.md +323 -307
  388. package/domains/product/locales/en/.agents/skills/board/SKILL.md +24 -0
  389. package/domains/product/locales/en/.agents/skills/design-brief/SKILL.md +433 -418
  390. package/domains/product/locales/en/.agents/skills/epic-breakdown/SKILL.md +435 -420
  391. package/domains/product/locales/en/.agents/skills/gates/SKILL.md +470 -446
  392. package/domains/product/locales/en/.agents/skills/gtm-brief/SKILL.md +18 -321
  393. package/domains/product/locales/en/.agents/skills/gtm-brief-reference/SKILL.md +348 -0
  394. package/domains/product/locales/en/.agents/skills/gtm-brief-reference/agents/claude.json +17 -0
  395. package/domains/product/locales/en/.agents/skills/gtm-brief-reference/agents/copilot.json +17 -0
  396. package/domains/product/locales/en/.agents/skills/gtm-brief-reference/agents/gemini.json +17 -0
  397. package/domains/product/locales/en/.agents/skills/gtm-brief-reference/agents/kimi.yaml +15 -0
  398. package/domains/product/locales/en/.agents/skills/gtm-brief-reference/agents/openai.yaml +10 -0
  399. package/domains/product/locales/en/.agents/skills/gtm-brief-reference/agents/qwen.json +17 -0
  400. package/domains/product/locales/en/.agents/skills/gtm-brief-reference/agents/skill.yaml +22 -0
  401. package/domains/product/locales/en/.agents/skills/handoff/SKILL.md +463 -439
  402. package/domains/product/locales/en/.agents/skills/html-pdf-report/SKILL.md +21 -663
  403. package/domains/product/locales/en/.agents/skills/html-pdf-report-reference/SKILL.md +699 -0
  404. package/domains/product/locales/en/.agents/skills/html-pdf-report-reference/agents/claude.json +17 -0
  405. package/domains/product/locales/en/.agents/skills/html-pdf-report-reference/agents/copilot.json +17 -0
  406. package/domains/product/locales/en/.agents/skills/html-pdf-report-reference/agents/gemini.json +17 -0
  407. package/domains/product/locales/en/.agents/skills/html-pdf-report-reference/agents/kimi.yaml +15 -0
  408. package/domains/product/locales/en/.agents/skills/html-pdf-report-reference/agents/openai.yaml +10 -0
  409. package/domains/product/locales/en/.agents/skills/html-pdf-report-reference/agents/qwen.json +17 -0
  410. package/domains/product/locales/en/.agents/skills/html-pdf-report-reference/agents/skill.yaml +22 -0
  411. package/domains/product/locales/en/.agents/skills/hypothesis-template/SKILL.md +484 -469
  412. package/domains/product/locales/en/.agents/skills/jtbd-canvas/SKILL.md +273 -257
  413. package/domains/product/locales/en/.agents/skills/kano-model/SKILL.md +370 -355
  414. package/domains/product/locales/en/.agents/skills/karpathy-guidelines/SKILL.md +36 -0
  415. package/domains/product/locales/en/.agents/skills/launch-checklist/SKILL.md +434 -419
  416. package/domains/product/locales/en/.agents/skills/moscow-prioritization/SKILL.md +407 -392
  417. package/domains/product/locales/en/.agents/skills/north-star-metric/SKILL.md +317 -301
  418. package/domains/product/locales/en/.agents/skills/okr-framework/SKILL.md +299 -284
  419. package/domains/product/locales/en/.agents/skills/opportunity-solution-tree/SKILL.md +472 -456
  420. package/domains/product/locales/en/.agents/skills/prd-template/SKILL.md +18 -258
  421. package/domains/product/locales/en/.agents/skills/prd-template-reference/SKILL.md +285 -0
  422. package/domains/product/locales/en/.agents/skills/prd-template-reference/agents/claude.json +16 -0
  423. package/domains/product/locales/en/.agents/skills/prd-template-reference/agents/copilot.json +16 -0
  424. package/domains/product/locales/en/.agents/skills/prd-template-reference/agents/gemini.json +16 -0
  425. package/domains/product/locales/en/.agents/skills/prd-template-reference/agents/kimi.yaml +15 -0
  426. package/domains/product/locales/en/.agents/skills/prd-template-reference/agents/openai.yaml +10 -0
  427. package/domains/product/locales/en/.agents/skills/prd-template-reference/agents/qwen.json +16 -0
  428. package/domains/product/locales/en/.agents/skills/prd-template-reference/agents/skill.yaml +22 -0
  429. package/domains/product/locales/en/.agents/skills/problem-statement/SKILL.md +327 -312
  430. package/domains/product/locales/en/.agents/skills/product-roadmap/SKILL.md +321 -305
  431. package/domains/product/locales/en/.agents/skills/product-vision/SKILL.md +410 -395
  432. package/domains/product/locales/en/.agents/skills/release-notes/SKILL.md +18 -258
  433. package/domains/product/locales/en/.agents/skills/release-notes-reference/SKILL.md +285 -0
  434. package/domains/product/locales/en/.agents/skills/release-notes-reference/agents/claude.json +16 -0
  435. package/domains/product/locales/en/.agents/skills/release-notes-reference/agents/copilot.json +16 -0
  436. package/domains/product/locales/en/.agents/skills/release-notes-reference/agents/gemini.json +16 -0
  437. package/domains/product/locales/en/.agents/skills/release-notes-reference/agents/kimi.yaml +14 -0
  438. package/domains/product/locales/en/.agents/skills/release-notes-reference/agents/openai.yaml +10 -0
  439. package/domains/product/locales/en/.agents/skills/release-notes-reference/agents/qwen.json +16 -0
  440. package/domains/product/locales/en/.agents/skills/release-notes-reference/agents/skill.yaml +21 -0
  441. package/domains/product/locales/en/.agents/skills/report-design/SKILL.md +17 -307
  442. package/domains/product/locales/en/.agents/skills/report-design-reference/SKILL.md +331 -0
  443. package/domains/product/locales/en/.agents/skills/report-design-reference/agents/claude.json +17 -0
  444. package/domains/product/locales/en/.agents/skills/report-design-reference/agents/copilot.json +17 -0
  445. package/domains/product/locales/en/.agents/skills/report-design-reference/agents/gemini.json +17 -0
  446. package/domains/product/locales/en/.agents/skills/report-design-reference/agents/kimi.yaml +15 -0
  447. package/domains/product/locales/en/.agents/skills/report-design-reference/agents/openai.yaml +10 -0
  448. package/domains/product/locales/en/.agents/skills/report-design-reference/agents/qwen.json +17 -0
  449. package/domains/product/locales/en/.agents/skills/report-design-reference/agents/skill.yaml +22 -0
  450. package/domains/product/locales/en/.agents/skills/rice-scoring/SKILL.md +266 -251
  451. package/domains/product/locales/en/.agents/skills/saas-metrics/SKILL.md +422 -404
  452. package/domains/product/locales/en/.agents/skills/session-prompt-generator/SKILL.md +474 -450
  453. package/domains/product/locales/en/.agents/skills/user-flow/SKILL.md +491 -476
  454. package/domains/product/locales/en/.agents/skills/user-interview-script/SKILL.md +314 -297
  455. package/domains/product/locales/en/.agents/skills/user-story/SKILL.md +401 -385
  456. package/domains/product/locales/en/.agents/skills/wsjf-scoring/SKILL.md +333 -315
  457. package/domains/product/locales/en/AGENTS.md +5 -0
  458. package/domains/product/locales/en/agents/auditor.md +76 -0
  459. package/domains/product/locales/en/agents/conductor.md +11 -0
  460. package/domains/product/locales/en/agents/data_analyst.md +11 -0
  461. package/domains/product/locales/en/agents/designer.md +11 -0
  462. package/domains/product/locales/en/agents/discovery.md +11 -0
  463. package/domains/product/locales/en/agents/layouter.md +11 -0
  464. package/domains/product/locales/en/agents/mediator.md +11 -0
  465. package/domains/product/locales/en/agents/pm.md +11 -0
  466. package/domains/product/locales/en/agents/product_strategist.md +11 -0
  467. package/domains/product/locales/en/agents/tech_lead.md +11 -0
  468. package/domains/product/locales/en/agents/ux_designer.md +11 -0
  469. package/domains/product/persona/persona-base.md +94 -0
  470. package/domains/product/pipeline.yaml +115 -0
  471. package/package.json +72 -70
@@ -1,456 +1,472 @@
1
- ---
2
- name: opportunity-solution-tree
3
- description: Opportunity-Solution Tree by Teresa Torres — desired outcome → opportunities → solutions → experiments
4
- ---
5
- # Opportunity-Solution Tree (OST)
6
-
7
- > **Category:** Discovery · **Slug:** `opportunity-solution-tree`
8
-
9
- ## When to Use
10
-
11
- - When you have a desired outcome (from strategy or OKR) but it's unclear **how** to achieve it.
12
- - During continuous discovery — as an artifact that evolves over time.
13
- - When the team jumps into solution mode without understanding the opportunity space.
14
- - When handing off discovery to engineering — OST shows the rationale behind solutions.
15
-
16
- ## Input
17
-
18
- | Field | Required | Description |
19
- |-------|:--------:|-------------|
20
- | Desired outcome | ✅ | Business or product outcome (from NSM/OKR) |
21
- | Customer research | ✅ | Interviews, JTBD, problems |
22
- | Current assumptions | ⬚ | Which hypotheses already exist |
23
- | Time box | ⬚ | Usually 1-2 weeks for MVP tree |
24
-
25
- ## Data Sources
26
-
27
- 1. `$user-interview-script` transcripts opportunities from pain points.
28
- 2. `$jtbd-canvas`opportunity jobs.
29
- 3. Support tickets, NPS comments evidence for opportunities.
30
- 4. Product analyticsquantitative signal.
31
-
32
- ### Related Skills
33
-
34
- | Skill | What we take | When to invoke |
35
- |-------|-------------|----------------|
36
- | `jtbd-canvas` | Underserved jobs opportunities | When branching opportunities |
37
- | `problem-statement` | Specific problem → opportunity node | For articulating a specific opportunity |
38
- | `assumption-mapping` | Assumptions underlying solutions | For each solution node |
39
- | `hypothesis-template` | Solution testable hypothesis | For experiment nodes |
40
-
41
- ## Tree Structure
42
-
43
- ```
44
- Desired Outcome (top)
45
-
46
- ├── Opportunity 1 (customer problem / unmet job)
47
- │ ├── Solution 1a (product idea)
48
- │ │ └── Experiment 1a-i (test)
49
- │ └── Solution 1b
50
-
51
- ├── Opportunity 2
52
- │ └── Solution 2a
53
- │ └── Experiment 2a-i
54
-
55
- └── Opportunity 3
56
- ```
57
-
58
- ## Protocol
59
-
60
- ### Step 0 — Desired Outcome
61
-
62
- Formulate **one** outcome. Must be measurable, tied to a business metric.
63
-
64
- Good: «Increase weekly active teams by 30% in 2 quarters»
65
- Bad: «Improve engagement»
66
-
67
- ### Step 1 — Opportunities
68
-
69
- Opportunities are customer problems or unmet jobs. From JTBD + interviews.
70
-
71
- Rules:
72
- - Formulated as a **customer** statement, not a solution
73
- - Evidence-backed (evidence ≥ 2 sources or mark 🔮 assumed)
74
- - Mutually exclusive (no overlap)
75
-
76
- Limit: 3-7 opportunities at the top level. More → split into sub-branches.
77
-
78
- | # | Opportunity | Evidence | Estimated impact |
79
- |---|-------------|----------|-------------------|
80
- | O1 | [customer pain / unmet job] | JTBD F3, Interview 4-7 | High / Medium / Low |
81
-
82
- ### Step 2 — Opportunity Assessment
83
-
84
- For each — importance × reach × gap (where the current solution falls short).
85
-
86
- Prune opportunities with low impact — focus on top 3.
87
-
88
- ### Step 3 Solutions
89
-
90
- For each **top** opportunity — 2-5 solution candidates. These are product ideas, features, process changes.
91
-
92
- Rules:
93
- - Specific (not «improve onboarding», but «in-app checklist for first 3 tasks»)
94
- - One opportunity can have multiple solutions don't choose immediately
95
- - Solutions can be «non-product» (docs, training, pricing change)
96
-
97
- ### Step 4 — Experiments
98
-
99
- For each solution — an experiment (or several) for validation.
100
-
101
- | Solution | Experiment | Prediction | Measurement | Time box |
102
- |----------|------------|------------|-------------|-----------|
103
- | In-app checklist | A/B test with 50% new users | +20% first-week activation | Funnel metric | 4 weeks |
104
-
105
- Experiments — via `$hypothesis-template` + `$ab-test-design`.
106
-
107
- ### Step 5 — Prioritization
108
-
109
- Prioritize experiments, not solutions. Sometimes quick experiments reveal that an opportunity is weaker than it seemed.
110
-
111
- Use `$rice-scoring` at the experiment level.
112
-
113
- ### Step 6Iteration
114
-
115
- OST is a living document. After each experiment:
116
- - Update evidence
117
- - Prune debunked branches
118
- - Add new opportunities from findings
119
-
120
- ## Validation (Quality Gate)
121
-
122
- - [ ] Desired outcome — measurable, tied to a metric
123
- - [ ] 3-7 opportunities at the top level
124
- - [ ] Each opportunity — customer-framed, not a solution
125
- - [ ] Evidence (≥ 2 sources) or mark 🔮
126
- - [ ] Top 3 opportunities have ≥ 2 solutions
127
- - [ ] Each solution has ≥ 1 experiment
128
- - [ ] Experiments prioritized (RICE)
129
-
130
- ## Handoff
131
-
132
- The result is input for:
133
- - **`rice-scoring`** prioritization of experiments
134
- - **`hypothesis-template`** formalize experiments
135
- - **PM**: solutions → feature candidates for roadmap
136
- - **`assumption-mapping`** assumptions behind solutions
137
-
138
- Format: OST as markdown tree or Miro-style diagram (exported). Via `$handoff`.
139
-
140
- ## Anti-patterns
141
-
142
- | Error | Why it's bad | How to do it right |
143
- |-------|-------------|-------------------|
144
- | Solution-first tree | Skip opportunity space | Start with outcome → opportunities |
145
- | Opportunities = features | Just a renamed backlog | Opportunity = customer problem |
146
- | No experiments | Solutions not validated | Each solution has an experiment |
147
- | Tree never updated | Frozen OST = dead OST | Weekly/biweekly update |
148
- | Choosing solution before validation | Locked into first idea | Keep alternatives until experiment data |
149
-
150
- ## Worked Example TeamFlow OST (AI 1:1 Summarization initiative)
151
-
152
- **Context:** TeamFlow (B2B SaaS HR-tech, $8M ARR, NRR 105%). After Discovery (8 interviews + JTBD canvas), the team builds the OST for the chosen desired outcome. Question: «Should we build AI summarization?» OST should show the opportunity space and alternatives, not jump to a conclusion.
153
-
154
- ### Desired Outcome
155
-
156
- ```
157
- Grow Net Revenue Retention from 105% to 115% within 4 quarters
158
- ```
159
-
160
- **Why this metric:** NRR is the key SaaS metric for Board and investors. Growth in NRR drives ARR, moat, margin. A 10pp lift is tied to expansion + churn reduction, both achievable by increasing manager engagement with the product.
161
-
162
- **Why not another metric:**
163
- - «Activation rate» too leading, doesn't prove business outcome.
164
- - «Weekly active teams» vanity if not linked with revenue.
165
- - «ARR growth» — too lagging, NRR = leading indicator.
166
-
167
- ### Opportunity Layer (from JTBD canvas + Discovery Brief)
168
-
169
- ```
170
- Desired: Grow NRR 105% → 115%
171
-
172
- ├── O1: Managers spend 3-4 hrs/week on 1:1 admin overhead
173
- │ (buyer and end-user pain)
174
- │ Evidence: 6 of 8 interviews, internal analytics shows prep time
175
- │ Estimated reach: 3,000 active managers across customer base
176
- │ Impact: retention + expansion (see Opp assessment)
177
-
178
- ├── O2: VP HR lack visibility into manager 1:1 practice/quality across org
179
- │ (buyer pain)
180
- │ Evidence: 4 of 4 buyer interviews
181
- │ Estimated reach: 200 buyer accounts
182
- │ Impact: expansion opportunity (enterprise tier)
183
-
184
- ├── O3: 40% of 1:1 action items dropped / forgotten
185
- │ (end-user pain)
186
- │ Evidence: 5 of 8 end-user interviews, internal product data (completed/total)
187
- Estimated reach: 3,000 managers × 8 reports × 40% = significant
188
- │ Impact: user retention + NPS
189
-
190
- ├── O4: New-manager onboarding takes 6+ months until «effective 1:1»
191
- (buyer pain F4)
192
- Evidence: 3 of 4 buyer interviews
193
- Estimated reach: ~1,000 new managers/year across customer base
194
- │ Impact: smaller, but high-value for enterprise segment
195
-
196
- └── O5: Cross-manager 1:1 experience inconsistency
197
- (buyer + skip-level + employee pain)
198
- Evidence: 2 of 4 buyer interviews, 1 end-user correlation
199
- Estimated reach: Broad but fuzzy — hard to quantify
200
- Impact: employee NPS / retention, but indirect
201
- ```
202
-
203
- ### Opportunity Assessment
204
-
205
- | # | Opportunity | Importance (1-10) | Reach (users) | Gap (current vs desired) | Priority Score |
206
- |---|-------------|:-----------------:|:-------------:|:------------------------:|:--------------:|
207
- | O1 | Admin overhead | 10 | ~3,000 | Big (no tool today) | 52,000 |
208
- | O3 | Action items dropped | 9 | ~3,000 | Big | 46,800 |
209
- | O2 | Buyer visibility | 9 | ~200 (but $$$) | Big | 1,800 (absolute) but high deal size |
210
- | O4 | New-mgr onboarding | 7 | ~1,000/yr | Medium | 7,000 |
211
- | O5 | Cross-mgr consistency | 6 | Fuzzy | Medium | Hard to score |
212
-
213
- **Pruning:** focus on top 3: **O1 (admin overhead), O3 (action items), O2 (buyer visibility)** — all interconnected and one feature can address multiple.
214
-
215
- ### Solutions Layer (for top 3 opportunities)
216
-
217
- ```
218
- O1: Admin overhead
219
- ├── S1.1: AI-powered real-time transcription + summarization during 1:1
220
- │ (hypothesis: address capture-during-meeting pain + action items extraction)
221
- ├── S1.2: Template library + auto-population from previous 1:1s
222
- │ (lighter: not AI, but reduces prep time)
223
- ├── S1.3: 1:1 duration reduction through structured agendas + time-boxing UI
224
- │ (behavioral, not AI)
225
- └── S1.4: Async 1:1s (structured written async for some updates)
226
- (radical: reduce live meeting count)
227
-
228
- O3: Action items dropped
229
- ├── S3.1: AI-extracted action items with automated reminders in the next 1:1
230
- │ (ties to S1.1 — AI summarization output)
231
- ├── S3.2: Calendar / Slack integration — reminders 2 days before 1:1
232
- │ (lighter: rules-based)
233
- └── S3.3: Weekly digest email «here's what you committed to» with status checkboxes
234
-
235
- O2: Buyer visibility
236
- ├── S2.1: Aggregate dashboard: 1:1 frequency, duration, action items velocity per team
237
- │ (rule-based analytics, no AI needed for basic version)
238
- ├── S2.2: AI-inferred 1:1 health score (combining multiple signals)
239
- │ (more advanced, needs S1.1 infrastructure)
240
- └── S2.3: Quarterly People Ops report generator (auto-composed for Board)
241
- ```
242
-
243
- ### Experiments Layer (for top solutions)
244
-
245
- | Solution | Experiment | Prediction | Measurement | Time box | RICE score |
246
- |----------|------------|------------|-------------|----------|:----------:|
247
- | S1.1: AI summarization | Concierge MVP: human «AI» assists 10 beta customers manually for 2 weeks | Managers save 30 min/week; want to keep | Survey + retention | 3 weeks | 35 |
248
- | S1.1: AI summarization | Wizard-of-Oz: Real OpenAI API integration with 20 beta managers, tightly monitored | 20% drop-off acceptable; summarization quality > 70% acceptable | Manager survey + usage data | 4 weeks | 48 |
249
- | S1.2: Template library | Ship feature flag to 50% users, measure prep time via surveys | -30% prep time self-reported | Survey + funnel | 4 weeks | 28 |
250
- | S3.1: AI action items | Builds on S1.1 — separate evaluation: after AI summary generated, opt-in reminder system | +20% action items completed | Product analytics | 6 weeks | 30 |
251
- | S3.2: Rule-based reminders | Ship to all users, measure completion rate | +10% action items completed | Product analytics | 2 weeks | 25 |
252
- | S2.1: Aggregate dashboard | Beta 5 enterprise customers, qualitative interviews after 30 days | 4 of 5 say «would use weekly» | Interviews | 6 weeks | 22 |
253
-
254
- ### Experiment Prioritization
255
-
256
- 1. **S1.2 (Template library)** low-risk, fast, independent. **Ship this first** (4 weeks).
257
- 2. **S3.2 (Rule-based reminders)** — quick win, complements S1.2.
258
- 3. **S1.1 (AI summarization) — Concierge MVP first, then Wizard-of-Oz** — validates value before engineering investment.
259
- 4. **S2.1 (Dashboard)** parallel track, different customer (enterprise), different eng workstream.
260
- 5. **S3.1 (AI action items)** — deferred pending S1.1 results.
261
-
262
- ### Iteration Notes (how this OST evolves)
263
-
264
- **Week 4 update (hypothetical):** S1.2 Template library shipped — survey shows only -12% prep time (not -30%). Learning: templates help new managers but experienced managers don't adopt. Update OST: split O1 opportunity by manager tenure. New sub-opportunity: «New managers face steepest learning curve» (O4 recombined here).
265
-
266
- **Week 6 update:** S1.1 Concierge MVP results positive (9 of 10 want feature). Green-light Wizard-of-Oz + engineering investment. Branch expanded: add «privacy architecture» as a necessary sub-solution.
267
-
268
- **Week 10 update:** Wizard-of-Oz shows 72% summarization quality acceptable but privacy concerns from enterprise beta. Added new branch for compliance architecture.
269
-
270
- > **OST lesson:** Tree is a living document. The initial focus on «build AI» was correct, but through experiments it became clear that **template library + rule-based reminders** close 40-50% of value with 20% of effort. OST forced consideration of **alternatives** (S1.2, S3.2) that would otherwise have been skipped in favor of a better AI story.
271
-
272
- > **Reference to `$assumption-mapping`:** Each solution rests on assumptions. For S1.1 the top assumptions (privacy, LLM quality, willingness to pay) — tracked in the assumption map, validated through experiments.
273
-
274
- ## Experiment Card Templates (per branch)
275
-
276
- Each solution in the OST has a prescribed experiment design. Use these templates.
277
-
278
- ### Template A: Concierge MVP (validates Value)
279
-
280
- ```markdown
281
- # Experiment: S1.1 Concierge MVP — AI Summarization
282
-
283
- **Tree branch:** Desired Outcome → O1 Admin Overhead → S1.1 AI Summarization
284
- **Status:** Active Week 2-5 (4-week experiment)
285
-
286
- ## What we're testing
287
- Value of AI-summarized 1:1s — are managers willing to change their workflow?
288
-
289
- ## Method
290
- Concierge (human-powered «AI»): 10 beta customer-managers email us meeting audio; PM + Designer manually produce summary + action items within 24 hours, deliver back in TeamFlow UI marked «AI-generated» (actually human).
291
-
292
- ## Prediction
293
- - ≥ 7 of 10 managers want to continue after 2 weeks
294
- - Reported time-saving 2 hrs / week
295
- - Action items completion rate lifts ≥ 10pp
296
-
297
- ## Measurement
298
- - Weekly survey (5-min)
299
- - Exit interview week 4
300
- - Self-reported time diary
301
- - Action items completion tracked in product
302
-
303
- ## Success threshold
304
- ≥ 7 of 10 say «want to keep» → green-light Wizard-of-Oz
305
-
306
- ## Decision rules
307
- - 9+ positive → Strong signal, invest aggressively
308
- - 7-8 positive → Proceed to Wizard-of-Oz w/ standard investment
309
- - 5-6 positive Iterate concierge, retest
310
- - <5 positive Kill branch or major re-shape
311
-
312
- ## Ties to assumption map
313
- Validates V1 (willingness to pay — indirectly via willingness to use), V2 (adoption intent)
314
- ```
315
-
316
- ### Template B: Wizard-of-Oz (validates Feasibility + Usability)
317
-
318
- ```markdown
319
- # Experiment: S1.1 Wizard-of-Oz — AI Summarization (Real LLM)
320
-
321
- **Tree branch:** Same as Concierge, progression step
322
- **Status:** Scheduled Week 6-9 (after Concierge validation)
323
-
324
- ## What we're testing
325
- (a) LLM quality can real AI produce acceptable output?
326
- (b) User trust will managers edit vs abandon?
327
-
328
- ## Method
329
- Real OpenAI GPT-4 integration with 20 beta managers. Full pipeline: meeting recording transcription → LLM summarization → manager review UI. Heavy telemetry + feedback.
330
-
331
- ## Prediction
332
- - Summary quality ≥ 4 out of 5 avg (blind rating)
333
- - Edit rate 30-50% (not 0% = blind approve; not 100% = AI useless)
334
- - P95 latency ≤ 60s
335
- - Managers continue using after 4 weeks ( 80%)
336
-
337
- ## Measurement
338
- - Summary rating (1-5) per meeting via in-product prompt
339
- - Edit telemetry (what % of summary changed)
340
- - Generation latency (server metrics)
341
- - Drop-off tracking (weekly active managers in beta)
342
-
343
- ## Success thresholds
344
- - Quality: ≥ 85% of summaries rated ≥ 4
345
- - Edit rate: 25-55% (sweet spot)
346
- - Continuation: ≥ 80% still using at week 4
347
-
348
- ## Decision rules
349
- - All thresholds met Green-light GA investment
350
- - 2/3 met Iterate prompt + retry week 10-11
351
- - 1/3 met Consider feature gate (human-review layer)
352
- - 0/3 met → Branch kill
353
-
354
- ## Ties to assumption map
355
- Primary validation for F1 (LLM quality) + U1 (manager trust)
356
- ```
357
-
358
- ### Template C: A/B Test (validates Scale)
359
-
360
- ```markdown
361
- # Experiment: S3.2 Rule-Based Reminders — A/B Test
362
-
363
- **Tree branch:** Desired Outcome → O3 Action Items Dropped → S3.2 Rule-Based Reminders
364
- **Status:** Scheduled Week 12-16 (post-MVP, iteration)
365
-
366
- ## What we're testing
367
- Do simple reminders (without AI) meaningfully improve action items completion rate?
368
-
369
- ## Method
370
- Classic A/B. 50% of users receive auto-reminder 2 days before next 1:1 showing open action items. Control: no reminder.
371
-
372
- ## Prediction
373
- - Action items completion lifts from 60% (baseline) to 68%+ (+8pp)
374
- - No increase in «reminder fatigue» (measured via opt-out rate)
375
-
376
- ## Measurement
377
- - Action items completed / created ratio (weekly)
378
- - Opt-out rate of reminders
379
- - NPS impact (separate survey)
380
-
381
- ## Sample & duration
382
- - Eligible users / week: ~500
383
- - Sample per variant: 3200 (calculated via `$ab-test-design`)
384
- - Duration: 13 weeks
385
-
386
- ## Guardrails
387
- - Opt-out rate < 15%
388
- - NPS not declining
389
-
390
- ## Decision rules
391
- - +8pp AND guardrails ok → Ship to all
392
- - +4 to +8pp → Ship with iterated copy
393
- - <+4pp Kill, pivot to S3.1 AI reminders
394
-
395
- ## Ties to assumption map
396
- Validates U2 (usability — do reminders annoy?) + V4 (lower-case value of simple solution)
397
- ```
398
-
399
- ### Template D: Design Partner Sessions (validates Enterprise Value)
400
-
401
- ```markdown
402
- # Experiment: S2.1 Aggregate Dashboard — Enterprise Design Partner
403
-
404
- **Tree branch:** Desired Outcome → O2 Buyer Visibility → S2.1 Aggregate Dashboard
405
- **Status:** Week 6-12
406
-
407
- ## What we're testing
408
- Will VP HR / CPO at enterprise customers upgrade tier based on dashboard value?
409
-
410
- ## Method
411
- Concierge dashboard for 5 enterprise prospects. Manual data compilation weekly, delivered as «beta dashboard preview» Google Sheet. Track:
412
- - Usage frequency (do they check it?)
413
- - Reactions («would you pay for this?»)
414
- - Upgrade conversion (stated intent → actual upgrade)
415
-
416
- ## Prediction
417
- - 4 of 5 prospects say «would use weekly»
418
- - 3 of 5 express upgrade intent
419
- - 2 of 5 actually upgrade within 30 days
420
-
421
- ## Measurement
422
- - Qualitative interview notes
423
- - Upgrade CRM status
424
- - Dashboard access frequency (even if manual we track requests)
425
-
426
- ## Success thresholds
427
- - Use intent: 4 of 5
428
- - Upgrade conversion: 2 of 5
429
-
430
- ## Decision rules
431
- - 2+ upgrade → Green-light productization
432
- - 0-1 upgrade → Re-evaluate dashboard scope based on feedback
433
- - Stalled at «like it but not upgrade» → Investigate pricing / packaging
434
-
435
- ## Ties to assumption map
436
- Primary validation for V3 (buyer tier upgrade) + V4 (expansion driver)
437
- ```
438
-
439
- ### Experiment Portfolio Overview
440
-
441
- | # | Experiment | Method | Duration | Signal Type | Investment |
442
- |---|-----------|--------|:--------:|-------------|:----------:|
443
- | EXP-010 | S1.1 Concierge MVP | Manual | 4 weeks | Qualitative | Low ($2K) |
444
- | EXP-011 | S1.1 Wizard-of-Oz | Real LLM + heavy telemetry | 4 weeks | Quant + Qual | Medium ($15K) |
445
- | EXP-012 | S3.2 Rule-based reminders A/B | Production A/B | 13 weeks | Quantitative | Low ($5K — eng time only) |
446
- | EXP-013 | S2.1 Enterprise dashboard | Design partners | 6 weeks | Mixed | Medium ($10K) |
447
- | EXP-014 | S1.2 Template library A/B | Production A/B | 4 weeks | Quantitative | Low ($3K) |
448
-
449
- ### Portfolio Balance
450
-
451
- - **Mix of methods:** qualitative (Concierge, Design Partner) + quantitative (A/B) + engineering-heavy (Wizard-of-Oz)
452
- - **Mix of speed:** fast (Concierge 4wk, reminders A/B early results 6wk) + slow (full A/B 13wk)
453
- - **Mix of investment:** low-risk ($2-5K) + medium ($10-15K)
454
- - **Decision dependencies:** Concierge → Wizard → Full AI investment (stepped commitment)
455
-
456
- > **Experiment portfolio lesson:** Mixing methods + durations prevents analysis paralysis. Fast low-investment experiments (Concierge) unlock confidence for higher investment (Wizard-of-Oz). Clear **decision rules per experiment** — if threshold not met, action is pre-specified (kill / iterate / pivot). Without pre-committed decision rules, «we'll see» becomes chronic postponement.
1
+ ---
2
+ name: opportunity-solution-tree
3
+ description: Opportunity-Solution Tree by Teresa Torres — desired outcome → opportunities → solutions → experiments
4
+ type: triggered
5
+ domain: product
6
+ owners:
7
+ - discovery
8
+ gates:
9
+ - DISCOVERY
10
+ tech: []
11
+ topic: []
12
+ triggers:
13
+ - opportunity-solution-tree
14
+ - OST
15
+ - дерево возможностей
16
+ - Teresa Torres
17
+ related: []
18
+ budget_lines: 472
19
+ schema_version: 1
20
+ ---
21
+ # Opportunity-Solution Tree (OST)
22
+
23
+ > **Category:** Discovery · **Slug:** `opportunity-solution-tree`
24
+
25
+ ## When to Use
26
+
27
+ - When you have a desired outcome (from strategy or OKR) but it's unclear **how** to achieve it.
28
+ - During continuous discovery as an artifact that evolves over time.
29
+ - When the team jumps into solution mode without understanding the opportunity space.
30
+ - When handing off discovery to engineering OST shows the rationale behind solutions.
31
+
32
+ ## Input
33
+
34
+ | Field | Required | Description |
35
+ |-------|:--------:|-------------|
36
+ | Desired outcome | | Business or product outcome (from NSM/OKR) |
37
+ | Customer research | | Interviews, JTBD, problems |
38
+ | Current assumptions | | Which hypotheses already exist |
39
+ | Time box | | Usually 1-2 weeks for MVP tree |
40
+
41
+ ## Data Sources
42
+
43
+ 1. `$user-interview-script` transcripts — opportunities from pain points.
44
+ 2. `$jtbd-canvas` — opportunity jobs.
45
+ 3. Support tickets, NPS comments — evidence for opportunities.
46
+ 4. Product analytics quantitative signal.
47
+
48
+ ### Related Skills
49
+
50
+ | Skill | What we take | When to invoke |
51
+ |-------|-------------|----------------|
52
+ | `jtbd-canvas` | Underserved jobs → opportunities | When branching opportunities |
53
+ | `problem-statement` | Specific problem → opportunity node | For articulating a specific opportunity |
54
+ | `assumption-mapping` | Assumptions underlying solutions | For each solution node |
55
+ | `hypothesis-template` | Solution → testable hypothesis | For experiment nodes |
56
+
57
+ ## Tree Structure
58
+
59
+ ```
60
+ Desired Outcome (top)
61
+
62
+ ├── Opportunity 1 (customer problem / unmet job)
63
+ │ ├── Solution 1a (product idea)
64
+ │ │ └── Experiment 1a-i (test)
65
+ │ └── Solution 1b
66
+
67
+ ├── Opportunity 2
68
+ │ └── Solution 2a
69
+ │ └── Experiment 2a-i
70
+
71
+ └── Opportunity 3
72
+ ```
73
+
74
+ ## Protocol
75
+
76
+ ### Step 0 Desired Outcome
77
+
78
+ Formulate **one** outcome. Must be measurable, tied to a business metric.
79
+
80
+ Good: «Increase weekly active teams by 30% in 2 quarters»
81
+ ❌ Bad: «Improve engagement»
82
+
83
+ ### Step 1 — Opportunities
84
+
85
+ Opportunities are customer problems or unmet jobs. From JTBD + interviews.
86
+
87
+ Rules:
88
+ - Formulated as a **customer** statement, not a solution
89
+ - Evidence-backed (evidence ≥ 2 sources or mark 🔮 assumed)
90
+ - Mutually exclusive (no overlap)
91
+
92
+ Limit: 3-7 opportunities at the top level. More → split into sub-branches.
93
+
94
+ | # | Opportunity | Evidence | Estimated impact |
95
+ |---|-------------|----------|-------------------|
96
+ | O1 | [customer pain / unmet job] | JTBD F3, Interview 4-7 | High / Medium / Low |
97
+
98
+ ### Step 2 — Opportunity Assessment
99
+
100
+ For each — importance × reach × gap (where the current solution falls short).
101
+
102
+ Prune opportunities with low impact — focus on top 3.
103
+
104
+ ### Step 3 — Solutions
105
+
106
+ For each **top** opportunity — 2-5 solution candidates. These are product ideas, features, process changes.
107
+
108
+ Rules:
109
+ - Specific (not «improve onboarding», but «in-app checklist for first 3 tasks»)
110
+ - One opportunity can have multiple solutions — don't choose immediately
111
+ - Solutions can be «non-product» (docs, training, pricing change)
112
+
113
+ ### Step 4Experiments
114
+
115
+ For each solution an experiment (or several) for validation.
116
+
117
+ | Solution | Experiment | Prediction | Measurement | Time box |
118
+ |----------|------------|------------|-------------|-----------|
119
+ | In-app checklist | A/B test with 50% new users | +20% first-week activation | Funnel metric | 4 weeks |
120
+
121
+ Experiments — via `$hypothesis-template` + `$ab-test-design`.
122
+
123
+ ### Step 5 Prioritization
124
+
125
+ Prioritize experiments, not solutions. Sometimes quick experiments reveal that an opportunity is weaker than it seemed.
126
+
127
+ Use `$rice-scoring` at the experiment level.
128
+
129
+ ### Step 6 — Iteration
130
+
131
+ OST is a living document. After each experiment:
132
+ - Update evidence
133
+ - Prune debunked branches
134
+ - Add new opportunities from findings
135
+
136
+ ## Validation (Quality Gate)
137
+
138
+ - [ ] Desired outcome measurable, tied to a metric
139
+ - [ ] 3-7 opportunities at the top level
140
+ - [ ] Each opportunity — customer-framed, not a solution
141
+ - [ ] Evidence (≥ 2 sources) or mark 🔮
142
+ - [ ] Top 3 opportunities have 2 solutions
143
+ - [ ] Each solution has ≥ 1 experiment
144
+ - [ ] Experiments prioritized (RICE)
145
+
146
+ ## Handoff
147
+
148
+ The result is input for:
149
+ - **`rice-scoring`** — prioritization of experiments
150
+ - **`hypothesis-template`**formalize experiments
151
+ - **PM**: solutions → feature candidates for roadmap
152
+ - **`assumption-mapping`**assumptions behind solutions
153
+
154
+ Format: OST as markdown tree or Miro-style diagram (exported). Via `$handoff`.
155
+
156
+ ## Anti-patterns
157
+
158
+ | Error | Why it's bad | How to do it right |
159
+ |-------|-------------|-------------------|
160
+ | Solution-first tree | Skip opportunity space | Start with outcome opportunities |
161
+ | Opportunities = features | Just a renamed backlog | Opportunity = customer problem |
162
+ | No experiments | Solutions not validated | Each solution has an experiment |
163
+ | Tree never updated | Frozen OST = dead OST | Weekly/biweekly update |
164
+ | Choosing solution before validation | Locked into first idea | Keep alternatives until experiment data |
165
+
166
+ ## Worked Example — TeamFlow OST (AI 1:1 Summarization initiative)
167
+
168
+ **Context:** TeamFlow (B2B SaaS HR-tech, $8M ARR, NRR 105%). After Discovery (8 interviews + JTBD canvas), the team builds the OST for the chosen desired outcome. Question: «Should we build AI summarization?» — OST should show the opportunity space and alternatives, not jump to a conclusion.
169
+
170
+ ### Desired Outcome
171
+
172
+ ```
173
+ Grow Net Revenue Retention from 105% to 115% within 4 quarters
174
+ ```
175
+
176
+ **Why this metric:** NRR is the key SaaS metric for Board and investors. Growth in NRR drives ARR, moat, margin. A 10pp lift is tied to expansion + churn reduction, both achievable by increasing manager engagement with the product.
177
+
178
+ **Why not another metric:**
179
+ - «Activation rate» — too leading, doesn't prove business outcome.
180
+ - «Weekly active teams» vanity if not linked with revenue.
181
+ - «ARR growth» too lagging, NRR = leading indicator.
182
+
183
+ ### Opportunity Layer (from JTBD canvas + Discovery Brief)
184
+
185
+ ```
186
+ Desired: Grow NRR 105% 115%
187
+
188
+ ├── O1: Managers spend 3-4 hrs/week on 1:1 admin overhead
189
+ (buyer and end-user pain)
190
+ │ Evidence: 6 of 8 interviews, internal analytics shows prep time
191
+ Estimated reach: 3,000 active managers across customer base
192
+ Impact: retention + expansion (see Opp assessment)
193
+
194
+ ├── O2: VP HR lack visibility into manager 1:1 practice/quality across org
195
+ (buyer pain)
196
+ │ Evidence: 4 of 4 buyer interviews
197
+ │ Estimated reach: 200 buyer accounts
198
+ │ Impact: expansion opportunity (enterprise tier)
199
+
200
+ ├── O3: 40% of 1:1 action items dropped / forgotten
201
+ │ (end-user pain)
202
+ │ Evidence: 5 of 8 end-user interviews, internal product data (completed/total)
203
+ │ Estimated reach: 3,000 managers × 8 reports × 40% = significant
204
+ │ Impact: user retention + NPS
205
+
206
+ ├── O4: New-manager onboarding takes 6+ months until «effective 1:1»
207
+ (buyer pain F4)
208
+ │ Evidence: 3 of 4 buyer interviews
209
+ │ Estimated reach: ~1,000 new managers/year across customer base
210
+ │ Impact: smaller, but high-value for enterprise segment
211
+
212
+ └── O5: Cross-manager 1:1 experience inconsistency
213
+ (buyer + skip-level + employee pain)
214
+ Evidence: 2 of 4 buyer interviews, 1 end-user correlation
215
+ Estimated reach: Broad but fuzzy hard to quantify
216
+ Impact: employee NPS / retention, but indirect
217
+ ```
218
+
219
+ ### Opportunity Assessment
220
+
221
+ | # | Opportunity | Importance (1-10) | Reach (users) | Gap (current vs desired) | Priority Score |
222
+ |---|-------------|:-----------------:|:-------------:|:------------------------:|:--------------:|
223
+ | O1 | Admin overhead | 10 | ~3,000 | Big (no tool today) | 52,000 |
224
+ | O3 | Action items dropped | 9 | ~3,000 | Big | 46,800 |
225
+ | O2 | Buyer visibility | 9 | ~200 (but $$$) | Big | 1,800 (absolute) but high deal size |
226
+ | O4 | New-mgr onboarding | 7 | ~1,000/yr | Medium | 7,000 |
227
+ | O5 | Cross-mgr consistency | 6 | Fuzzy | Medium | Hard to score |
228
+
229
+ **Pruning:** focus on top 3: **O1 (admin overhead), O3 (action items), O2 (buyer visibility)** all interconnected and one feature can address multiple.
230
+
231
+ ### Solutions Layer (for top 3 opportunities)
232
+
233
+ ```
234
+ O1: Admin overhead
235
+ ├── S1.1: AI-powered real-time transcription + summarization during 1:1
236
+ │ (hypothesis: address capture-during-meeting pain + action items extraction)
237
+ ├── S1.2: Template library + auto-population from previous 1:1s
238
+ │ (lighter: not AI, but reduces prep time)
239
+ ├── S1.3: 1:1 duration reduction through structured agendas + time-boxing UI
240
+ (behavioral, not AI)
241
+ └── S1.4: Async 1:1s (structured written async for some updates)
242
+ (radical: reduce live meeting count)
243
+
244
+ O3: Action items dropped
245
+ ├── S3.1: AI-extracted action items with automated reminders in the next 1:1
246
+ │ (ties to S1.1 — AI summarization output)
247
+ ├── S3.2: Calendar / Slack integration reminders 2 days before 1:1
248
+ │ (lighter: rules-based)
249
+ └── S3.3: Weekly digest email «here's what you committed to» with status checkboxes
250
+
251
+ O2: Buyer visibility
252
+ ├── S2.1: Aggregate dashboard: 1:1 frequency, duration, action items velocity per team
253
+ │ (rule-based analytics, no AI needed for basic version)
254
+ ├── S2.2: AI-inferred 1:1 health score (combining multiple signals)
255
+ │ (more advanced, needs S1.1 infrastructure)
256
+ └── S2.3: Quarterly People Ops report generator (auto-composed for Board)
257
+ ```
258
+
259
+ ### Experiments Layer (for top solutions)
260
+
261
+ | Solution | Experiment | Prediction | Measurement | Time box | RICE score |
262
+ |----------|------------|------------|-------------|----------|:----------:|
263
+ | S1.1: AI summarization | Concierge MVP: human «AI» assists 10 beta customers manually for 2 weeks | Managers save 30 min/week; want to keep | Survey + retention | 3 weeks | 35 |
264
+ | S1.1: AI summarization | Wizard-of-Oz: Real OpenAI API integration with 20 beta managers, tightly monitored | 20% drop-off acceptable; summarization quality > 70% acceptable | Manager survey + usage data | 4 weeks | 48 |
265
+ | S1.2: Template library | Ship feature flag to 50% users, measure prep time via surveys | -30% prep time self-reported | Survey + funnel | 4 weeks | 28 |
266
+ | S3.1: AI action items | Builds on S1.1 separate evaluation: after AI summary generated, opt-in reminder system | +20% action items completed | Product analytics | 6 weeks | 30 |
267
+ | S3.2: Rule-based reminders | Ship to all users, measure completion rate | +10% action items completed | Product analytics | 2 weeks | 25 |
268
+ | S2.1: Aggregate dashboard | Beta 5 enterprise customers, qualitative interviews after 30 days | 4 of 5 say «would use weekly» | Interviews | 6 weeks | 22 |
269
+
270
+ ### Experiment Prioritization
271
+
272
+ 1. **S1.2 (Template library)**low-risk, fast, independent. **Ship this first** (4 weeks).
273
+ 2. **S3.2 (Rule-based reminders)** — quick win, complements S1.2.
274
+ 3. **S1.1 (AI summarization) Concierge MVP first, then Wizard-of-Oz** — validates value before engineering investment.
275
+ 4. **S2.1 (Dashboard)** — parallel track, different customer (enterprise), different eng workstream.
276
+ 5. **S3.1 (AI action items)** deferred pending S1.1 results.
277
+
278
+ ### Iteration Notes (how this OST evolves)
279
+
280
+ **Week 4 update (hypothetical):** S1.2 Template library shipped — survey shows only -12% prep time (not -30%). Learning: templates help new managers but experienced managers don't adopt. → Update OST: split O1 opportunity by manager tenure. New sub-opportunity: «New managers face steepest learning curve» (O4 recombined here).
281
+
282
+ **Week 6 update:** S1.1 Concierge MVP results positive (9 of 10 want feature). Green-light Wizard-of-Oz + engineering investment. Branch expanded: add «privacy architecture» as a necessary sub-solution.
283
+
284
+ **Week 10 update:** Wizard-of-Oz shows 72% summarization quality acceptable but privacy concerns from enterprise beta. Added new branch for compliance architecture.
285
+
286
+ > **OST lesson:** Tree is a living document. The initial focus on «build AI» was correct, but through experiments it became clear that **template library + rule-based reminders** close 40-50% of value with 20% of effort. OST forced consideration of **alternatives** (S1.2, S3.2) that would otherwise have been skipped in favor of a better AI story.
287
+
288
+ > **Reference to `$assumption-mapping`:** Each solution rests on assumptions. For S1.1 the top assumptions (privacy, LLM quality, willingness to pay) — tracked in the assumption map, validated through experiments.
289
+
290
+ ## Experiment Card Templates (per branch)
291
+
292
+ Each solution in the OST has a prescribed experiment design. Use these templates.
293
+
294
+ ### Template A: Concierge MVP (validates Value)
295
+
296
+ ```markdown
297
+ # Experiment: S1.1 Concierge MVP — AI Summarization
298
+
299
+ **Tree branch:** Desired Outcome → O1 Admin Overhead → S1.1 AI Summarization
300
+ **Status:** Active Week 2-5 (4-week experiment)
301
+
302
+ ## What we're testing
303
+ Value of AI-summarized 1:1s — are managers willing to change their workflow?
304
+
305
+ ## Method
306
+ Concierge (human-powered «AI»): 10 beta customer-managers email us meeting audio; PM + Designer manually produce summary + action items within 24 hours, deliver back in TeamFlow UI marked «AI-generated» (actually human).
307
+
308
+ ## Prediction
309
+ - 7 of 10 managers want to continue after 2 weeks
310
+ - Reported time-saving 2 hrs / week
311
+ - Action items completion rate lifts ≥ 10pp
312
+
313
+ ## Measurement
314
+ - Weekly survey (5-min)
315
+ - Exit interview week 4
316
+ - Self-reported time diary
317
+ - Action items completion tracked in product
318
+
319
+ ## Success threshold
320
+ ≥ 7 of 10 say «want to keep» → green-light Wizard-of-Oz
321
+
322
+ ## Decision rules
323
+ - 9+ positive → Strong signal, invest aggressively
324
+ - 7-8 positive → Proceed to Wizard-of-Oz w/ standard investment
325
+ - 5-6 positive Iterate concierge, retest
326
+ - <5 positive Kill branch or major re-shape
327
+
328
+ ## Ties to assumption map
329
+ Validates V1 (willingness to pay indirectly via willingness to use), V2 (adoption intent)
330
+ ```
331
+
332
+ ### Template B: Wizard-of-Oz (validates Feasibility + Usability)
333
+
334
+ ```markdown
335
+ # Experiment: S1.1 Wizard-of-Oz AI Summarization (Real LLM)
336
+
337
+ **Tree branch:** Same as Concierge, progression step
338
+ **Status:** Scheduled Week 6-9 (after Concierge validation)
339
+
340
+ ## What we're testing
341
+ (a) LLM quality can real AI produce acceptable output?
342
+ (b) User trust — will managers edit vs abandon?
343
+
344
+ ## Method
345
+ Real OpenAI GPT-4 integration with 20 beta managers. Full pipeline: meeting recording → transcription → LLM summarization → manager review UI. Heavy telemetry + feedback.
346
+
347
+ ## Prediction
348
+ - Summary quality ≥ 4 out of 5 avg (blind rating)
349
+ - Edit rate 30-50% (not 0% = blind approve; not 100% = AI useless)
350
+ - P95 latency 60s
351
+ - Managers continue using after 4 weeks ( 80%)
352
+
353
+ ## Measurement
354
+ - Summary rating (1-5) per meeting via in-product prompt
355
+ - Edit telemetry (what % of summary changed)
356
+ - Generation latency (server metrics)
357
+ - Drop-off tracking (weekly active managers in beta)
358
+
359
+ ## Success thresholds
360
+ - Quality: ≥ 85% of summaries rated ≥ 4
361
+ - Edit rate: 25-55% (sweet spot)
362
+ - Continuation: ≥ 80% still using at week 4
363
+
364
+ ## Decision rules
365
+ - All thresholds met → Green-light GA investment
366
+ - 2/3 met → Iterate prompt + retry week 10-11
367
+ - 1/3 met Consider feature gate (human-review layer)
368
+ - 0/3 met → Branch kill
369
+
370
+ ## Ties to assumption map
371
+ Primary validation for F1 (LLM quality) + U1 (manager trust)
372
+ ```
373
+
374
+ ### Template C: A/B Test (validates Scale)
375
+
376
+ ```markdown
377
+ # Experiment: S3.2 Rule-Based Reminders A/B Test
378
+
379
+ **Tree branch:** Desired Outcome → O3 Action Items Dropped → S3.2 Rule-Based Reminders
380
+ **Status:** Scheduled Week 12-16 (post-MVP, iteration)
381
+
382
+ ## What we're testing
383
+ Do simple reminders (without AI) meaningfully improve action items completion rate?
384
+
385
+ ## Method
386
+ Classic A/B. 50% of users receive auto-reminder 2 days before next 1:1 showing open action items. Control: no reminder.
387
+
388
+ ## Prediction
389
+ - Action items completion lifts from 60% (baseline) to 68%+ (+8pp)
390
+ - No increase in «reminder fatigue» (measured via opt-out rate)
391
+
392
+ ## Measurement
393
+ - Action items completed / created ratio (weekly)
394
+ - Opt-out rate of reminders
395
+ - NPS impact (separate survey)
396
+
397
+ ## Sample & duration
398
+ - Eligible users / week: ~500
399
+ - Sample per variant: 3200 (calculated via `$ab-test-design`)
400
+ - Duration: 13 weeks
401
+
402
+ ## Guardrails
403
+ - Opt-out rate < 15%
404
+ - NPS not declining
405
+
406
+ ## Decision rules
407
+ - +8pp AND guardrails ok → Ship to all
408
+ - +4 to +8pp Ship with iterated copy
409
+ - <+4pp → Kill, pivot to S3.1 AI reminders
410
+
411
+ ## Ties to assumption map
412
+ Validates U2 (usability do reminders annoy?) + V4 (lower-case value of simple solution)
413
+ ```
414
+
415
+ ### Template D: Design Partner Sessions (validates Enterprise Value)
416
+
417
+ ```markdown
418
+ # Experiment: S2.1 Aggregate Dashboard Enterprise Design Partner
419
+
420
+ **Tree branch:** Desired Outcome → O2 Buyer Visibility → S2.1 Aggregate Dashboard
421
+ **Status:** Week 6-12
422
+
423
+ ## What we're testing
424
+ Will VP HR / CPO at enterprise customers upgrade tier based on dashboard value?
425
+
426
+ ## Method
427
+ Concierge dashboard for 5 enterprise prospects. Manual data compilation weekly, delivered as «beta dashboard preview» Google Sheet. Track:
428
+ - Usage frequency (do they check it?)
429
+ - Reactions («would you pay for this?»)
430
+ - Upgrade conversion (stated intent → actual upgrade)
431
+
432
+ ## Prediction
433
+ - 4 of 5 prospects say «would use weekly»
434
+ - 3 of 5 express upgrade intent
435
+ - 2 of 5 actually upgrade within 30 days
436
+
437
+ ## Measurement
438
+ - Qualitative interview notes
439
+ - Upgrade CRM status
440
+ - Dashboard access frequency (even if manual — we track requests)
441
+
442
+ ## Success thresholds
443
+ - Use intent: 4 of 5
444
+ - Upgrade conversion: 2 of 5
445
+
446
+ ## Decision rules
447
+ - 2+ upgrade Green-light productization
448
+ - 0-1 upgrade → Re-evaluate dashboard scope based on feedback
449
+ - Stalled at «like it but not upgrade» → Investigate pricing / packaging
450
+
451
+ ## Ties to assumption map
452
+ Primary validation for V3 (buyer tier upgrade) + V4 (expansion driver)
453
+ ```
454
+
455
+ ### Experiment Portfolio Overview
456
+
457
+ | # | Experiment | Method | Duration | Signal Type | Investment |
458
+ |---|-----------|--------|:--------:|-------------|:----------:|
459
+ | EXP-010 | S1.1 Concierge MVP | Manual | 4 weeks | Qualitative | Low ($2K) |
460
+ | EXP-011 | S1.1 Wizard-of-Oz | Real LLM + heavy telemetry | 4 weeks | Quant + Qual | Medium ($15K) |
461
+ | EXP-012 | S3.2 Rule-based reminders A/B | Production A/B | 13 weeks | Quantitative | Low ($5K — eng time only) |
462
+ | EXP-013 | S2.1 Enterprise dashboard | Design partners | 6 weeks | Mixed | Medium ($10K) |
463
+ | EXP-014 | S1.2 Template library A/B | Production A/B | 4 weeks | Quantitative | Low ($3K) |
464
+
465
+ ### Portfolio Balance
466
+
467
+ - **Mix of methods:** qualitative (Concierge, Design Partner) + quantitative (A/B) + engineering-heavy (Wizard-of-Oz)
468
+ - **Mix of speed:** fast (Concierge 4wk, reminders A/B early results 6wk) + slow (full A/B 13wk)
469
+ - **Mix of investment:** low-risk ($2-5K) + medium ($10-15K)
470
+ - **Decision dependencies:** Concierge → Wizard → Full AI investment (stepped commitment)
471
+
472
+ > **Experiment portfolio lesson:** Mixing methods + durations prevents analysis paralysis. Fast low-investment experiments (Concierge) unlock confidence for higher investment (Wizard-of-Oz). Clear **decision rules per experiment** — if threshold not met, action is pre-specified (kill / iterate / pivot). Without pre-committed decision rules, «we'll see» becomes chronic postponement.