@pennyfarthing/core 7.9.2 → 7.9.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (538) hide show
  1. package/package.json +2 -2
  2. package/pennyfarthing-dist/agents/README.md +348 -0
  3. package/pennyfarthing-dist/agents/architect.md +180 -0
  4. package/pennyfarthing-dist/agents/dev.md +169 -0
  5. package/pennyfarthing-dist/agents/devops.md +203 -0
  6. package/pennyfarthing-dist/agents/handoff.md +235 -0
  7. package/pennyfarthing-dist/agents/orchestrator.md +182 -0
  8. package/pennyfarthing-dist/agents/pm.md +152 -0
  9. package/pennyfarthing-dist/agents/reviewer-preflight.md +129 -0
  10. package/pennyfarthing-dist/agents/reviewer.md +180 -0
  11. package/pennyfarthing-dist/agents/sm-file-summary.md +79 -0
  12. package/pennyfarthing-dist/agents/sm-finish.md +82 -0
  13. package/pennyfarthing-dist/agents/sm-handoff.md +129 -0
  14. package/pennyfarthing-dist/agents/sm-setup.md +251 -0
  15. package/pennyfarthing-dist/agents/sm.md +284 -0
  16. package/pennyfarthing-dist/agents/tea.md +161 -0
  17. package/pennyfarthing-dist/agents/tech-writer.md +226 -0
  18. package/pennyfarthing-dist/agents/testing-runner.md +184 -0
  19. package/pennyfarthing-dist/agents/ux-designer.md +236 -0
  20. package/pennyfarthing-dist/agents/workflow-status-check.md +96 -0
  21. package/pennyfarthing-dist/commands/architect.md +7 -0
  22. package/pennyfarthing-dist/commands/benchmark-control.md +69 -0
  23. package/pennyfarthing-dist/commands/benchmark.md +485 -0
  24. package/pennyfarthing-dist/commands/brainstorming.md +91 -0
  25. package/pennyfarthing-dist/commands/check.md +156 -0
  26. package/pennyfarthing-dist/commands/chore.md +178 -0
  27. package/pennyfarthing-dist/commands/close-epic.md +139 -0
  28. package/pennyfarthing-dist/commands/continue-session.md +184 -0
  29. package/pennyfarthing-dist/commands/create-branches-from-story.md +358 -0
  30. package/pennyfarthing-dist/commands/create-theme.md +29 -0
  31. package/pennyfarthing-dist/commands/dev.md +7 -0
  32. package/pennyfarthing-dist/commands/devops.md +7 -0
  33. package/pennyfarthing-dist/commands/git-cleanup.md +51 -0
  34. package/pennyfarthing-dist/commands/health-check.md +141 -0
  35. package/pennyfarthing-dist/commands/help.md +264 -0
  36. package/pennyfarthing-dist/commands/job-fair.md +102 -0
  37. package/pennyfarthing-dist/commands/list-themes.md +21 -0
  38. package/pennyfarthing-dist/commands/orchestrator.md +7 -0
  39. package/pennyfarthing-dist/commands/parallel-work.md +71 -0
  40. package/pennyfarthing-dist/commands/party-mode.md +77 -0
  41. package/pennyfarthing-dist/commands/permissions.md +193 -0
  42. package/pennyfarthing-dist/commands/pm.md +7 -0
  43. package/pennyfarthing-dist/commands/prime.md +140 -0
  44. package/pennyfarthing-dist/commands/release.md +58 -0
  45. package/pennyfarthing-dist/commands/repo-status.md +49 -0
  46. package/pennyfarthing-dist/commands/retro.md +200 -0
  47. package/pennyfarthing-dist/commands/reviewer.md +7 -0
  48. package/pennyfarthing-dist/commands/run-ci.md +116 -0
  49. package/pennyfarthing-dist/commands/set-theme.md +56 -0
  50. package/pennyfarthing-dist/commands/show-theme.md +21 -0
  51. package/pennyfarthing-dist/commands/sm.md +7 -0
  52. package/pennyfarthing-dist/commands/solo.md +447 -0
  53. package/pennyfarthing-dist/commands/sprint-planning.md +109 -0
  54. package/pennyfarthing-dist/commands/sprint.md +133 -0
  55. package/pennyfarthing-dist/commands/standalone.md +194 -0
  56. package/pennyfarthing-dist/commands/start-epic.md +168 -0
  57. package/pennyfarthing-dist/commands/sync-epic-to-jira.md +184 -0
  58. package/pennyfarthing-dist/commands/sync-work-with-sprint.md +373 -0
  59. package/pennyfarthing-dist/commands/tea.md +7 -0
  60. package/pennyfarthing-dist/commands/tech-writer.md +7 -0
  61. package/pennyfarthing-dist/commands/theme-maker.md +676 -0
  62. package/pennyfarthing-dist/commands/update-domain-docs.md +83 -0
  63. package/pennyfarthing-dist/commands/ux-designer.md +7 -0
  64. package/pennyfarthing-dist/commands/work.md +23 -0
  65. package/pennyfarthing-dist/commands/workflow.md +21 -0
  66. package/pennyfarthing-dist/guides/agent-behavior.md +311 -0
  67. package/pennyfarthing-dist/guides/agent-coordination.md +480 -0
  68. package/pennyfarthing-dist/guides/agent-tag-taxonomy.md +432 -0
  69. package/pennyfarthing-dist/guides/agent-template-strategic.md +148 -0
  70. package/pennyfarthing-dist/guides/agent-template-tactical.md +162 -0
  71. package/pennyfarthing-dist/guides/hooks.md +230 -0
  72. package/pennyfarthing-dist/guides/measurement-framework.md +210 -0
  73. package/pennyfarthing-dist/guides/patterns/approval-gates-pattern.md +766 -0
  74. package/pennyfarthing-dist/guides/patterns/fan-out-fan-in-pattern.md +574 -0
  75. package/pennyfarthing-dist/guides/patterns/helper-delegation-pattern.md +488 -0
  76. package/pennyfarthing-dist/guides/patterns/tdd-flow-pattern.md +402 -0
  77. package/pennyfarthing-dist/guides/permission-protocol.md +188 -0
  78. package/pennyfarthing-dist/guides/persona-loading.md +46 -0
  79. package/pennyfarthing-dist/guides/prompt-patterns.md +338 -0
  80. package/pennyfarthing-dist/guides/scale-levels.md +114 -0
  81. package/pennyfarthing-dist/guides/session-artifacts.md +193 -0
  82. package/pennyfarthing-dist/guides/workflow-schema.md +257 -0
  83. package/pennyfarthing-dist/guides/worktree-mode.md +113 -0
  84. package/pennyfarthing-dist/guides/xml-tags.md +335 -0
  85. package/pennyfarthing-dist/output-styles/teaching.md +33 -0
  86. package/pennyfarthing-dist/output-styles/terse.md +20 -0
  87. package/pennyfarthing-dist/output-styles/verbose.md +28 -0
  88. package/pennyfarthing-dist/personas/BENCHMARK-METHODOLOGY.md +105 -0
  89. package/pennyfarthing-dist/personas/OCEAN-BENCHMARKING.md +210 -0
  90. package/pennyfarthing-dist/personas/TRAIL-OCEAN-MAPPING.md +168 -0
  91. package/pennyfarthing-dist/personas/ZEITGEIST-ANALYSIS.md +171 -0
  92. package/pennyfarthing-dist/personas/attributes.yaml +69 -0
  93. package/pennyfarthing-dist/personas/scripts/add-zeitgeist-calibrated.py +81 -0
  94. package/pennyfarthing-dist/personas/scripts/add-zeitgeist-scores.sh +56 -0
  95. package/pennyfarthing-dist/personas/themes/1984.yaml +304 -0
  96. package/pennyfarthing-dist/personas/themes/a-team.yaml +331 -0
  97. package/pennyfarthing-dist/personas/themes/agatha-christie.yaml +294 -0
  98. package/pennyfarthing-dist/personas/themes/alice-in-wonderland.yaml +324 -0
  99. package/pennyfarthing-dist/personas/themes/all-stars.yaml +326 -0
  100. package/pennyfarthing-dist/personas/themes/ancient-philosophers.yaml +312 -0
  101. package/pennyfarthing-dist/personas/themes/ancient-strategists.yaml +298 -0
  102. package/pennyfarthing-dist/personas/themes/arcane.yaml +282 -0
  103. package/pennyfarthing-dist/personas/themes/arthurian-mythos.yaml +327 -0
  104. package/pennyfarthing-dist/personas/themes/avatar-the-last-airbender.yaml +282 -0
  105. package/pennyfarthing-dist/personas/themes/babylon-5.yaml +282 -0
  106. package/pennyfarthing-dist/personas/themes/battlestar-galactica.yaml +282 -0
  107. package/pennyfarthing-dist/personas/themes/better-call-saul.yaml +282 -0
  108. package/pennyfarthing-dist/personas/themes/big-lebowski.yaml +294 -0
  109. package/pennyfarthing-dist/personas/themes/black-sails.yaml +294 -0
  110. package/pennyfarthing-dist/personas/themes/blade-runner.yaml +289 -0
  111. package/pennyfarthing-dist/personas/themes/bobiverse.yaml +282 -0
  112. package/pennyfarthing-dist/personas/themes/breaking-bad.yaml +319 -0
  113. package/pennyfarthing-dist/personas/themes/catch-22.yaml +304 -0
  114. package/pennyfarthing-dist/personas/themes/classical-composers.yaml +302 -0
  115. package/pennyfarthing-dist/personas/themes/control.yaml +201 -0
  116. package/pennyfarthing-dist/personas/themes/count-of-monte-cristo.yaml +312 -0
  117. package/pennyfarthing-dist/personas/themes/cowboy-bebop.yaml +315 -0
  118. package/pennyfarthing-dist/personas/themes/deadwood.yaml +294 -0
  119. package/pennyfarthing-dist/personas/themes/dickens.yaml +312 -0
  120. package/pennyfarthing-dist/personas/themes/discworld.yaml +334 -0
  121. package/pennyfarthing-dist/personas/themes/doctor-who.yaml +284 -0
  122. package/pennyfarthing-dist/personas/themes/don-quixote.yaml +312 -0
  123. package/pennyfarthing-dist/personas/themes/dune.yaml +301 -0
  124. package/pennyfarthing-dist/personas/themes/enlightenment-thinkers.yaml +312 -0
  125. package/pennyfarthing-dist/personas/themes/expeditionary-force.yaml +282 -0
  126. package/pennyfarthing-dist/personas/themes/fargo.yaml +322 -0
  127. package/pennyfarthing-dist/personas/themes/film-auteurs.yaml +304 -0
  128. package/pennyfarthing-dist/personas/themes/firefly.yaml +320 -0
  129. package/pennyfarthing-dist/personas/themes/foundation.yaml +284 -0
  130. package/pennyfarthing-dist/personas/themes/futurama.yaml +313 -0
  131. package/pennyfarthing-dist/personas/themes/game-of-thrones.yaml +284 -0
  132. package/pennyfarthing-dist/personas/themes/gilligans-island.yaml +365 -0
  133. package/pennyfarthing-dist/personas/themes/gothic-literature.yaml +300 -0
  134. package/pennyfarthing-dist/personas/themes/great-gatsby.yaml +300 -0
  135. package/pennyfarthing-dist/personas/themes/greek-mythology.yaml +326 -0
  136. package/pennyfarthing-dist/personas/themes/hannibal.yaml +294 -0
  137. package/pennyfarthing-dist/personas/themes/harry-potter.yaml +316 -0
  138. package/pennyfarthing-dist/personas/themes/his-dark-materials.yaml +285 -0
  139. package/pennyfarthing-dist/personas/themes/historical-figures.yaml +282 -0
  140. package/pennyfarthing-dist/personas/themes/hitchhikers-guide.yaml +323 -0
  141. package/pennyfarthing-dist/personas/themes/house-md.yaml +313 -0
  142. package/pennyfarthing-dist/personas/themes/imperial-radch.yaml +283 -0
  143. package/pennyfarthing-dist/personas/themes/inspector-morse.yaml +294 -0
  144. package/pennyfarthing-dist/personas/themes/jane-austen.yaml +281 -0
  145. package/pennyfarthing-dist/personas/themes/jazz-legends.yaml +312 -0
  146. package/pennyfarthing-dist/personas/themes/justified.yaml +294 -0
  147. package/pennyfarthing-dist/personas/themes/legion-of-doom.yaml +343 -0
  148. package/pennyfarthing-dist/personas/themes/les-miserables.yaml +293 -0
  149. package/pennyfarthing-dist/personas/themes/lord-of-the-rings.yaml +326 -0
  150. package/pennyfarthing-dist/personas/themes/lovecraft-mythos.yaml +325 -0
  151. package/pennyfarthing-dist/personas/themes/mad-max.yaml +349 -0
  152. package/pennyfarthing-dist/personas/themes/mad-men.yaml +283 -0
  153. package/pennyfarthing-dist/personas/themes/marvel-mcu.yaml +294 -0
  154. package/pennyfarthing-dist/personas/themes/mash.yaml +329 -0
  155. package/pennyfarthing-dist/personas/themes/mass-effect.yaml +283 -0
  156. package/pennyfarthing-dist/personas/themes/military-commanders.yaml +298 -0
  157. package/pennyfarthing-dist/personas/themes/moby-dick.yaml +312 -0
  158. package/pennyfarthing-dist/personas/themes/monty-python.yaml +297 -0
  159. package/pennyfarthing-dist/personas/themes/neuromancer.yaml +294 -0
  160. package/pennyfarthing-dist/personas/themes/norse-mythology.yaml +321 -0
  161. package/pennyfarthing-dist/personas/themes/parks-and-rec.yaml +364 -0
  162. package/pennyfarthing-dist/personas/themes/peaky-blinders.yaml +292 -0
  163. package/pennyfarthing-dist/personas/themes/princess-bride.yaml +344 -0
  164. package/pennyfarthing-dist/personas/themes/renaissance-masters.yaml +312 -0
  165. package/pennyfarthing-dist/personas/themes/rome.yaml +294 -0
  166. package/pennyfarthing-dist/personas/themes/russian-masters.yaml +310 -0
  167. package/pennyfarthing-dist/personas/themes/sandman.yaml +282 -0
  168. package/pennyfarthing-dist/personas/themes/scientific-revolutionaries.yaml +312 -0
  169. package/pennyfarthing-dist/personas/themes/shakespeare.yaml +295 -0
  170. package/pennyfarthing-dist/personas/themes/sherlock-holmes.yaml +283 -0
  171. package/pennyfarthing-dist/personas/themes/snow-crash.yaml +290 -0
  172. package/pennyfarthing-dist/personas/themes/software-pioneers.yaml +294 -0
  173. package/pennyfarthing-dist/personas/themes/star-trek-tng.yaml +358 -0
  174. package/pennyfarthing-dist/personas/themes/star-trek-tos.yaml +327 -0
  175. package/pennyfarthing-dist/personas/themes/star-wars.yaml +297 -0
  176. package/pennyfarthing-dist/personas/themes/succession.yaml +294 -0
  177. package/pennyfarthing-dist/personas/themes/superfriends.yaml +332 -0
  178. package/pennyfarthing-dist/personas/themes/ted-lasso.yaml +359 -0
  179. package/pennyfarthing-dist/personas/themes/the-americans.yaml +294 -0
  180. package/pennyfarthing-dist/personas/themes/the-crown.yaml +294 -0
  181. package/pennyfarthing-dist/personas/themes/the-expanse.yaml +337 -0
  182. package/pennyfarthing-dist/personas/themes/the-good-place.yaml +315 -0
  183. package/pennyfarthing-dist/personas/themes/the-matrix.yaml +342 -0
  184. package/pennyfarthing-dist/personas/themes/the-odyssey.yaml +294 -0
  185. package/pennyfarthing-dist/personas/themes/the-office.yaml +323 -0
  186. package/pennyfarthing-dist/personas/themes/the-simpsons.yaml +300 -0
  187. package/pennyfarthing-dist/personas/themes/the-sopranos.yaml +294 -0
  188. package/pennyfarthing-dist/personas/themes/the-wire.yaml +303 -0
  189. package/pennyfarthing-dist/personas/themes/the-witcher.yaml +294 -0
  190. package/pennyfarthing-dist/personas/themes/twin-peaks.yaml +296 -0
  191. package/pennyfarthing-dist/personas/themes/vorkosigan-saga.yaml +294 -0
  192. package/pennyfarthing-dist/personas/themes/watchmen.yaml +285 -0
  193. package/pennyfarthing-dist/personas/themes/west-wing.yaml +285 -0
  194. package/pennyfarthing-dist/personas/themes/world-explorers.yaml +312 -0
  195. package/pennyfarthing-dist/personas/themes/wwii-leaders.yaml +299 -0
  196. package/pennyfarthing-dist/personas/themes/x-files.yaml +296 -0
  197. package/pennyfarthing-dist/personas/zeitgeist-scores.yaml +1172 -0
  198. package/pennyfarthing-dist/scripts/README.md +87 -0
  199. package/pennyfarthing-dist/scripts/core/README.md +26 -0
  200. package/pennyfarthing-dist/scripts/core/agent-session.sh +383 -0
  201. package/pennyfarthing-dist/scripts/core/check-context.sh +280 -0
  202. package/pennyfarthing-dist/scripts/core/handoff-marker.sh +101 -0
  203. package/pennyfarthing-dist/scripts/core/phase-check-start.sh +95 -0
  204. package/pennyfarthing-dist/scripts/core/prime.sh +30 -0
  205. package/pennyfarthing-dist/scripts/core/run.sh +92 -0
  206. package/pennyfarthing-dist/scripts/cyclist/is-cyclist.sh +21 -0
  207. package/pennyfarthing-dist/scripts/git/README.md +25 -0
  208. package/pennyfarthing-dist/scripts/git/create-feature-branches.sh +267 -0
  209. package/pennyfarthing-dist/scripts/git/git-status-all.sh +152 -0
  210. package/pennyfarthing-dist/scripts/git/install-git-hooks.sh +91 -0
  211. package/pennyfarthing-dist/scripts/git/release.sh +215 -0
  212. package/pennyfarthing-dist/scripts/git/worktree-manager.sh +494 -0
  213. package/pennyfarthing-dist/scripts/health/drift-detection.sh +162 -0
  214. package/pennyfarthing-dist/scripts/hooks/README.md +32 -0
  215. package/pennyfarthing-dist/scripts/hooks/bell-mode-hook.sh +106 -0
  216. package/pennyfarthing-dist/scripts/hooks/context-circuit-breaker.sh +60 -0
  217. package/pennyfarthing-dist/scripts/hooks/context-warning.sh +65 -0
  218. package/pennyfarthing-dist/scripts/hooks/otel-auto-config.sh +35 -0
  219. package/pennyfarthing-dist/scripts/hooks/post-merge.sh +166 -0
  220. package/pennyfarthing-dist/scripts/hooks/pre-commit.sh +107 -0
  221. package/pennyfarthing-dist/scripts/hooks/pre-edit-check.sh +71 -0
  222. package/pennyfarthing-dist/scripts/hooks/pre-push.sh +54 -0
  223. package/pennyfarthing-dist/scripts/hooks/question-reflector-check.sh +20 -0
  224. package/pennyfarthing-dist/scripts/hooks/question_reflector_check.py +499 -0
  225. package/pennyfarthing-dist/scripts/hooks/session-start.sh +97 -0
  226. package/pennyfarthing-dist/scripts/hooks/session-stop.sh +65 -0
  227. package/pennyfarthing-dist/scripts/hooks/welcome-hook.sh +94 -0
  228. package/pennyfarthing-dist/scripts/jira/README.md +36 -0
  229. package/pennyfarthing-dist/scripts/jira/create-jira-epic.sh +101 -0
  230. package/pennyfarthing-dist/scripts/jira/create-jira-story.sh +97 -0
  231. package/pennyfarthing-dist/scripts/jira/jira-claim-story.sh +22 -0
  232. package/pennyfarthing-dist/scripts/jira/jira-lib.sh +464 -0
  233. package/pennyfarthing-dist/scripts/jira/jira-reconcile.sh +266 -0
  234. package/pennyfarthing-dist/scripts/jira/jira-sync-story.sh +18 -0
  235. package/pennyfarthing-dist/scripts/jira/jira-sync.sh +16 -0
  236. package/pennyfarthing-dist/scripts/jira/sync-epic-jira.sh +16 -0
  237. package/pennyfarthing-dist/scripts/jira/sync-epic-to-jira.sh +16 -0
  238. package/pennyfarthing-dist/scripts/lib/README.md +29 -0
  239. package/pennyfarthing-dist/scripts/lib/background-tasks.sh +177 -0
  240. package/pennyfarthing-dist/scripts/lib/checkpoint.sh +136 -0
  241. package/pennyfarthing-dist/scripts/lib/common.sh +212 -0
  242. package/pennyfarthing-dist/scripts/lib/file-lock.sh +269 -0
  243. package/pennyfarthing-dist/scripts/lib/find-root.sh +35 -0
  244. package/pennyfarthing-dist/scripts/lib/logging.sh +186 -0
  245. package/pennyfarthing-dist/scripts/lib/retry.sh +76 -0
  246. package/pennyfarthing-dist/scripts/maintenance/migrate-theme-schema.mjs +102 -0
  247. package/pennyfarthing-dist/scripts/maintenance/sidecar-health.sh +97 -0
  248. package/pennyfarthing-dist/scripts/misc/README.md +44 -0
  249. package/pennyfarthing-dist/scripts/misc/add-short-names.sh +13 -0
  250. package/pennyfarthing-dist/scripts/misc/add_short_names.py +226 -0
  251. package/pennyfarthing-dist/scripts/misc/backlog.sh +91 -0
  252. package/pennyfarthing-dist/scripts/misc/check-status.sh +247 -0
  253. package/pennyfarthing-dist/scripts/misc/find-related-work.sh +231 -0
  254. package/pennyfarthing-dist/scripts/misc/generate-skill-docs.sh +122 -0
  255. package/pennyfarthing-dist/scripts/misc/log-skill-usage.sh +74 -0
  256. package/pennyfarthing-dist/scripts/misc/migrate-bmad-workflow.sh +10 -0
  257. package/pennyfarthing-dist/scripts/misc/migrate_bmad_workflow.py +319 -0
  258. package/pennyfarthing-dist/scripts/misc/repo-scan.sh +141 -0
  259. package/pennyfarthing-dist/scripts/misc/repo-utils.sh +778 -0
  260. package/pennyfarthing-dist/scripts/misc/run-ci.sh +219 -0
  261. package/pennyfarthing-dist/scripts/misc/run-timestamp.sh +7 -0
  262. package/pennyfarthing-dist/scripts/misc/session-cleanup.sh +319 -0
  263. package/pennyfarthing-dist/scripts/misc/skill-usage-report.sh +193 -0
  264. package/pennyfarthing-dist/scripts/misc/statusline.sh +259 -0
  265. package/pennyfarthing-dist/scripts/misc/uninstall.sh +270 -0
  266. package/pennyfarthing-dist/scripts/misc/validate-subagent-frontmatter.sh +160 -0
  267. package/pennyfarthing-dist/scripts/portraits/generate-portraits.py +400 -0
  268. package/pennyfarthing-dist/scripts/portraits/generate-portraits.sh +54 -0
  269. package/pennyfarthing-dist/scripts/sprint/README.md +29 -0
  270. package/pennyfarthing-dist/scripts/sprint/archive-story.sh +139 -0
  271. package/pennyfarthing-dist/scripts/sprint/available-stories.sh +97 -0
  272. package/pennyfarthing-dist/scripts/sprint/check-story.sh +164 -0
  273. package/pennyfarthing-dist/scripts/sprint/get-epic-field.sh +58 -0
  274. package/pennyfarthing-dist/scripts/sprint/get-story-field.sh +69 -0
  275. package/pennyfarthing-dist/scripts/sprint/import-epic-to-future.sh +10 -0
  276. package/pennyfarthing-dist/scripts/sprint/import_epic_to_future.py +270 -0
  277. package/pennyfarthing-dist/scripts/sprint/list-future.sh +151 -0
  278. package/pennyfarthing-dist/scripts/sprint/new-sprint.sh +116 -0
  279. package/pennyfarthing-dist/scripts/sprint/promote-epic.sh +154 -0
  280. package/pennyfarthing-dist/scripts/sprint/sprint-common.sh +421 -0
  281. package/pennyfarthing-dist/scripts/sprint/sprint-info.sh +39 -0
  282. package/pennyfarthing-dist/scripts/sprint/sprint-metrics.sh +241 -0
  283. package/pennyfarthing-dist/scripts/sprint/sprint-status.sh +134 -0
  284. package/pennyfarthing-dist/scripts/story/README.md +23 -0
  285. package/pennyfarthing-dist/scripts/story/create-story.sh +19 -0
  286. package/pennyfarthing-dist/scripts/story/size-story.sh +18 -0
  287. package/pennyfarthing-dist/scripts/story/story-template.sh +18 -0
  288. package/pennyfarthing-dist/scripts/test/README.md +23 -0
  289. package/pennyfarthing-dist/scripts/test/ensure-swebench-data.sh +59 -0
  290. package/pennyfarthing-dist/scripts/test/ground-truth-judge.py +220 -0
  291. package/pennyfarthing-dist/scripts/test/swebench-judge.py +374 -0
  292. package/pennyfarthing-dist/scripts/test/test-cache.sh +165 -0
  293. package/pennyfarthing-dist/scripts/test/test-setup.sh +337 -0
  294. package/pennyfarthing-dist/scripts/tests/check.test.sh +582 -0
  295. package/pennyfarthing-dist/scripts/tests/dev-story-workflow-import.test.sh +515 -0
  296. package/pennyfarthing-dist/scripts/tests/epics-and-stories-workflow-import.test.sh +599 -0
  297. package/pennyfarthing-dist/scripts/tests/handoff-phase-update.test.sh +332 -0
  298. package/pennyfarthing-dist/scripts/tests/implementation-readiness-workflow-import.test.sh +573 -0
  299. package/pennyfarthing-dist/scripts/tests/migrate-bmad-workflow.test.sh +859 -0
  300. package/pennyfarthing-dist/scripts/tests/prd-workflow-import.test.sh +662 -0
  301. package/pennyfarthing-dist/scripts/tests/project-context-workflow-import.test.sh +589 -0
  302. package/pennyfarthing-dist/scripts/tests/test-character-voice.sh +105 -0
  303. package/pennyfarthing-dist/scripts/tests/test-drift-detection.sh +597 -0
  304. package/pennyfarthing-dist/scripts/tests/test-post-merge-hook.sh +514 -0
  305. package/pennyfarthing-dist/scripts/tests/test-session-checkpoint.sh +517 -0
  306. package/pennyfarthing-dist/scripts/tests/test-solo-command.sh +331 -0
  307. package/pennyfarthing-dist/scripts/tests/ux-design-workflow-import.test.sh +647 -0
  308. package/pennyfarthing-dist/scripts/theme/README.md +22 -0
  309. package/pennyfarthing-dist/scripts/theme/compute-theme-tiers.sh +13 -0
  310. package/pennyfarthing-dist/scripts/theme/compute_theme_tiers.py +402 -0
  311. package/pennyfarthing-dist/scripts/theme/list-themes.sh +73 -0
  312. package/pennyfarthing-dist/scripts/theme/update-theme-tiers.sh +97 -0
  313. package/pennyfarthing-dist/scripts/validation/validate-agent-schema.sh +576 -0
  314. package/pennyfarthing-dist/scripts/workflow/README.md +28 -0
  315. package/pennyfarthing-dist/scripts/workflow/check.py +502 -0
  316. package/pennyfarthing-dist/scripts/workflow/check.sh +24 -0
  317. package/pennyfarthing-dist/scripts/workflow/finish-story.sh +159 -0
  318. package/pennyfarthing-dist/scripts/workflow/fix-session-phase.sh +228 -0
  319. package/pennyfarthing-dist/scripts/workflow/get-workflow-type.py +61 -0
  320. package/pennyfarthing-dist/scripts/workflow/get-workflow-type.sh +13 -0
  321. package/pennyfarthing-dist/scripts/workflow/list-workflows.sh +130 -0
  322. package/pennyfarthing-dist/scripts/workflow/phase-owner.sh +40 -0
  323. package/pennyfarthing-dist/scripts/workflow/resume-workflow.sh +163 -0
  324. package/pennyfarthing-dist/scripts/workflow/show-workflow.sh +138 -0
  325. package/pennyfarthing-dist/scripts/workflow/start-workflow.sh +256 -0
  326. package/pennyfarthing-dist/scripts/workflow/workflow-status.sh +167 -0
  327. package/pennyfarthing-dist/skills/agentic-patterns/SKILL.md +242 -0
  328. package/pennyfarthing-dist/skills/changelog/SKILL.md +367 -0
  329. package/pennyfarthing-dist/skills/code-review/SKILL.md +168 -0
  330. package/pennyfarthing-dist/skills/context-engineering/SKILL.md +274 -0
  331. package/pennyfarthing-dist/skills/cyclist/SKILL.md +88 -0
  332. package/pennyfarthing-dist/skills/dev-patterns/SKILL.md +437 -0
  333. package/pennyfarthing-dist/skills/finalize-run/SKILL.md +258 -0
  334. package/pennyfarthing-dist/skills/jira/SKILL.md +484 -0
  335. package/pennyfarthing-dist/skills/judge/SKILL.md +636 -0
  336. package/pennyfarthing-dist/skills/just/SKILL.md +403 -0
  337. package/pennyfarthing-dist/skills/mermaid/SKILL.md +240 -0
  338. package/pennyfarthing-dist/skills/otel/skill.md +223 -0
  339. package/pennyfarthing-dist/skills/permissions/skill.md +172 -0
  340. package/pennyfarthing-dist/skills/persona-benchmark/SKILL.md +178 -0
  341. package/pennyfarthing-dist/skills/skill-registry.schema.json +107 -0
  342. package/pennyfarthing-dist/skills/skill-registry.yaml +393 -0
  343. package/pennyfarthing-dist/skills/sprint/scripts/archive-story.sh +101 -0
  344. package/pennyfarthing-dist/skills/sprint/scripts/available-stories.sh +97 -0
  345. package/pennyfarthing-dist/skills/sprint/scripts/check-story.sh +164 -0
  346. package/pennyfarthing-dist/skills/sprint/scripts/create-jira-epic.sh +101 -0
  347. package/pennyfarthing-dist/skills/sprint/scripts/new-sprint.sh +116 -0
  348. package/pennyfarthing-dist/skills/sprint/scripts/promote-epic.sh +164 -0
  349. package/pennyfarthing-dist/skills/sprint/scripts/sprint-info.sh +39 -0
  350. package/pennyfarthing-dist/skills/sprint/scripts/sprint-status.sh +147 -0
  351. package/pennyfarthing-dist/skills/sprint/scripts/sync-epic-jira.sh +86 -0
  352. package/pennyfarthing-dist/skills/sprint/skill.md +465 -0
  353. package/pennyfarthing-dist/skills/story/scripts/create-story.sh +159 -0
  354. package/pennyfarthing-dist/skills/story/scripts/size-story.sh +198 -0
  355. package/pennyfarthing-dist/skills/story/scripts/story-template.sh +162 -0
  356. package/pennyfarthing-dist/skills/story/skill.md +219 -0
  357. package/pennyfarthing-dist/skills/systematic-debugging/SKILL.md +390 -0
  358. package/pennyfarthing-dist/skills/testing/SKILL.md +99 -0
  359. package/pennyfarthing-dist/skills/testing/references/troubleshooting.md +124 -0
  360. package/pennyfarthing-dist/skills/theme/skill.md +129 -0
  361. package/pennyfarthing-dist/skills/theme-creation/SKILL.md +174 -0
  362. package/pennyfarthing-dist/skills/workflow/scripts/list-workflows.sh +91 -0
  363. package/pennyfarthing-dist/skills/workflow/scripts/resume-workflow.sh +163 -0
  364. package/pennyfarthing-dist/skills/workflow/scripts/show-workflow.sh +138 -0
  365. package/pennyfarthing-dist/skills/workflow/scripts/start-workflow.sh +273 -0
  366. package/pennyfarthing-dist/skills/workflow/scripts/workflow-status.sh +167 -0
  367. package/pennyfarthing-dist/skills/workflow/skill.md +337 -0
  368. package/pennyfarthing-dist/skills/yq/SKILL.md +264 -0
  369. package/pennyfarthing-dist/templates/LEADERBOARD.schema.yaml +187 -0
  370. package/pennyfarthing-dist/templates/LEADERBOARD.template.md +59 -0
  371. package/pennyfarthing-dist/templates/agent-scopes.yaml.template +276 -0
  372. package/pennyfarthing-dist/templates/pennyfarthing-settings.yaml.template +61 -0
  373. package/pennyfarthing-dist/templates/persona-config.yaml.template +22 -0
  374. package/pennyfarthing-dist/templates/preferences.yaml.template +15 -0
  375. package/pennyfarthing-dist/templates/settings.local.json.template +101 -0
  376. package/pennyfarthing-dist/templates/setup-env.sh.template +18 -0
  377. package/pennyfarthing-dist/templates/shared-context.md.template +70 -0
  378. package/pennyfarthing-dist/templates/sidecar/decisions.md.template +40 -0
  379. package/pennyfarthing-dist/templates/sidecar/gotchas.md.template +37 -0
  380. package/pennyfarthing-dist/templates/sidecar/patterns.md.template +34 -0
  381. package/pennyfarthing-dist/workflows/agent-docs.yaml +70 -0
  382. package/pennyfarthing-dist/workflows/architecture/steps/step-01-initialize.md +101 -0
  383. package/pennyfarthing-dist/workflows/architecture/steps/step-01b-continue.md +93 -0
  384. package/pennyfarthing-dist/workflows/architecture/steps/step-02-context.md +115 -0
  385. package/pennyfarthing-dist/workflows/architecture/steps/step-03-patterns.md +133 -0
  386. package/pennyfarthing-dist/workflows/architecture/steps/step-04-components.md +138 -0
  387. package/pennyfarthing-dist/workflows/architecture/steps/step-05-interfaces.md +133 -0
  388. package/pennyfarthing-dist/workflows/architecture/steps/step-06-risks.md +142 -0
  389. package/pennyfarthing-dist/workflows/architecture/steps/step-07-document.md +160 -0
  390. package/pennyfarthing-dist/workflows/architecture/templates/architecture-decision.md +102 -0
  391. package/pennyfarthing-dist/workflows/architecture.yaml +65 -0
  392. package/pennyfarthing-dist/workflows/bdd.yaml +60 -0
  393. package/pennyfarthing-dist/workflows/brainstorming/brain-methods.csv +62 -0
  394. package/pennyfarthing-dist/workflows/brainstorming/checklist.md +44 -0
  395. package/pennyfarthing-dist/workflows/brainstorming/instructions.md +736 -0
  396. package/pennyfarthing-dist/workflows/brainstorming/workflow.yaml +49 -0
  397. package/pennyfarthing-dist/workflows/code-review/checklist.md +23 -0
  398. package/pennyfarthing-dist/workflows/code-review/instructions.md +234 -0
  399. package/pennyfarthing-dist/workflows/code-review/workflow.yaml +51 -0
  400. package/pennyfarthing-dist/workflows/dev-story/checklist.md +80 -0
  401. package/pennyfarthing-dist/workflows/dev-story/instructions.xml +410 -0
  402. package/pennyfarthing-dist/workflows/dev-story/workflow.yaml +50 -0
  403. package/pennyfarthing-dist/workflows/epics-and-stories/steps/step-01-validate-prerequisites.md +256 -0
  404. package/pennyfarthing-dist/workflows/epics-and-stories/steps/step-02-design-epics.md +233 -0
  405. package/pennyfarthing-dist/workflows/epics-and-stories/steps/step-03-create-stories.md +272 -0
  406. package/pennyfarthing-dist/workflows/epics-and-stories/steps/step-04-final-validation.md +153 -0
  407. package/pennyfarthing-dist/workflows/epics-and-stories/steps/step-05-import-to-future.md +122 -0
  408. package/pennyfarthing-dist/workflows/epics-and-stories/templates/epics-template.md +57 -0
  409. package/pennyfarthing-dist/workflows/epics-and-stories/workflow.yaml +28 -0
  410. package/pennyfarthing-dist/workflows/git-cleanup/steps/step-01-analyze.md +101 -0
  411. package/pennyfarthing-dist/workflows/git-cleanup/steps/step-02-categorize.md +116 -0
  412. package/pennyfarthing-dist/workflows/git-cleanup/steps/step-03-execute.md +224 -0
  413. package/pennyfarthing-dist/workflows/git-cleanup/steps/step-04-verify.md +88 -0
  414. package/pennyfarthing-dist/workflows/git-cleanup/steps/step-05-complete.md +79 -0
  415. package/pennyfarthing-dist/workflows/git-cleanup.yaml +59 -0
  416. package/pennyfarthing-dist/workflows/implementation-readiness/steps/step-01-document-discovery.md +190 -0
  417. package/pennyfarthing-dist/workflows/implementation-readiness/steps/step-02-prd-analysis.md +178 -0
  418. package/pennyfarthing-dist/workflows/implementation-readiness/steps/step-03-epic-coverage-validation.md +179 -0
  419. package/pennyfarthing-dist/workflows/implementation-readiness/steps/step-04-ux-alignment.md +139 -0
  420. package/pennyfarthing-dist/workflows/implementation-readiness/steps/step-05-epic-quality-review.md +252 -0
  421. package/pennyfarthing-dist/workflows/implementation-readiness/steps/step-06-final-assessment.md +133 -0
  422. package/pennyfarthing-dist/workflows/implementation-readiness/templates/readiness-report-template.md +4 -0
  423. package/pennyfarthing-dist/workflows/implementation-readiness/workflow.yaml +40 -0
  424. package/pennyfarthing-dist/workflows/prd/data/domain-complexity.csv +13 -0
  425. package/pennyfarthing-dist/workflows/prd/data/prd-purpose.md +197 -0
  426. package/pennyfarthing-dist/workflows/prd/data/project-types.csv +11 -0
  427. package/pennyfarthing-dist/workflows/prd/steps-c/step-01-init.md +191 -0
  428. package/pennyfarthing-dist/workflows/prd/steps-c/step-01b-continue.md +153 -0
  429. package/pennyfarthing-dist/workflows/prd/steps-c/step-02-discovery.md +224 -0
  430. package/pennyfarthing-dist/workflows/prd/steps-c/step-03-success.md +226 -0
  431. package/pennyfarthing-dist/workflows/prd/steps-c/step-04-journeys.md +213 -0
  432. package/pennyfarthing-dist/workflows/prd/steps-c/step-05-domain.md +207 -0
  433. package/pennyfarthing-dist/workflows/prd/steps-c/step-06-innovation.md +226 -0
  434. package/pennyfarthing-dist/workflows/prd/steps-c/step-07-project-type.md +237 -0
  435. package/pennyfarthing-dist/workflows/prd/steps-c/step-08-scoping.md +228 -0
  436. package/pennyfarthing-dist/workflows/prd/steps-c/step-09-functional.md +231 -0
  437. package/pennyfarthing-dist/workflows/prd/steps-c/step-10-nonfunctional.md +242 -0
  438. package/pennyfarthing-dist/workflows/prd/steps-c/step-11-polish.md +217 -0
  439. package/pennyfarthing-dist/workflows/prd/steps-c/step-12-complete.md +180 -0
  440. package/pennyfarthing-dist/workflows/prd/steps-e/step-e-01-discovery.md +247 -0
  441. package/pennyfarthing-dist/workflows/prd/steps-e/step-e-01b-legacy-conversion.md +208 -0
  442. package/pennyfarthing-dist/workflows/prd/steps-e/step-e-02-review.md +249 -0
  443. package/pennyfarthing-dist/workflows/prd/steps-e/step-e-03-edit.md +253 -0
  444. package/pennyfarthing-dist/workflows/prd/steps-e/step-e-04-complete.md +168 -0
  445. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-01-discovery.md +218 -0
  446. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-02-format-detection.md +191 -0
  447. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-02b-parity-check.md +209 -0
  448. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-03-density-validation.md +174 -0
  449. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-04-brief-coverage-validation.md +214 -0
  450. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-05-measurability-validation.md +228 -0
  451. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-06-traceability-validation.md +217 -0
  452. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-07-implementation-leakage-validation.md +205 -0
  453. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-08-domain-compliance-validation.md +243 -0
  454. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-09-project-type-validation.md +263 -0
  455. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-10-smart-validation.md +209 -0
  456. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-11-holistic-quality-validation.md +264 -0
  457. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-12-completeness-validation.md +242 -0
  458. package/pennyfarthing-dist/workflows/prd/steps-v/step-v-13-report-complete.md +232 -0
  459. package/pennyfarthing-dist/workflows/prd/templates/prd-template.md +10 -0
  460. package/pennyfarthing-dist/workflows/prd/workflow.yaml +42 -0
  461. package/pennyfarthing-dist/workflows/product-brief/steps/step-01-init.md +177 -0
  462. package/pennyfarthing-dist/workflows/product-brief/steps/step-01b-continue.md +161 -0
  463. package/pennyfarthing-dist/workflows/product-brief/steps/step-02-vision.md +199 -0
  464. package/pennyfarthing-dist/workflows/product-brief/steps/step-03-users.md +202 -0
  465. package/pennyfarthing-dist/workflows/product-brief/steps/step-04-metrics.md +205 -0
  466. package/pennyfarthing-dist/workflows/product-brief/steps/step-05-scope.md +219 -0
  467. package/pennyfarthing-dist/workflows/product-brief/steps/step-06-complete.md +194 -0
  468. package/pennyfarthing-dist/workflows/product-brief/templates/product-brief.template.md +10 -0
  469. package/pennyfarthing-dist/workflows/product-brief/workflow.yaml +31 -0
  470. package/pennyfarthing-dist/workflows/project-context/project-context-template.md +21 -0
  471. package/pennyfarthing-dist/workflows/project-context/steps/step-01-discover.md +184 -0
  472. package/pennyfarthing-dist/workflows/project-context/steps/step-02-generate.md +318 -0
  473. package/pennyfarthing-dist/workflows/project-context/steps/step-03-complete.md +278 -0
  474. package/pennyfarthing-dist/workflows/project-context/workflow.yaml +27 -0
  475. package/pennyfarthing-dist/workflows/quick-dev/steps/step-01-mode-detection.md +156 -0
  476. package/pennyfarthing-dist/workflows/quick-dev/steps/step-02-context-gathering.md +120 -0
  477. package/pennyfarthing-dist/workflows/quick-dev/steps/step-03-execute.md +113 -0
  478. package/pennyfarthing-dist/workflows/quick-dev/steps/step-04-self-check.md +113 -0
  479. package/pennyfarthing-dist/workflows/quick-dev/steps/step-05-adversarial-review.md +106 -0
  480. package/pennyfarthing-dist/workflows/quick-dev/steps/step-06-resolve-findings.md +140 -0
  481. package/pennyfarthing-dist/workflows/quick-dev/workflow.yaml +27 -0
  482. package/pennyfarthing-dist/workflows/quick-spec/steps/step-01-understand.md +189 -0
  483. package/pennyfarthing-dist/workflows/quick-spec/steps/step-02-investigate.md +144 -0
  484. package/pennyfarthing-dist/workflows/quick-spec/steps/step-03-generate.md +128 -0
  485. package/pennyfarthing-dist/workflows/quick-spec/steps/step-04-review.md +191 -0
  486. package/pennyfarthing-dist/workflows/quick-spec/tech-spec-template.md +74 -0
  487. package/pennyfarthing-dist/workflows/quick-spec/workflow.yaml +27 -0
  488. package/pennyfarthing-dist/workflows/research/steps-domain/step-01-init.md +137 -0
  489. package/pennyfarthing-dist/workflows/research/steps-domain/step-02-domain-analysis.md +229 -0
  490. package/pennyfarthing-dist/workflows/research/steps-domain/step-03-competitive-landscape.md +238 -0
  491. package/pennyfarthing-dist/workflows/research/steps-domain/step-04-regulatory-focus.md +206 -0
  492. package/pennyfarthing-dist/workflows/research/steps-domain/step-05-technical-trends.md +234 -0
  493. package/pennyfarthing-dist/workflows/research/steps-domain/step-06-research-synthesis.md +443 -0
  494. package/pennyfarthing-dist/workflows/research/steps-market/step-01-init.md +182 -0
  495. package/pennyfarthing-dist/workflows/research/steps-market/step-02-customer-behavior.md +237 -0
  496. package/pennyfarthing-dist/workflows/research/steps-market/step-02-customer-insights.md +200 -0
  497. package/pennyfarthing-dist/workflows/research/steps-market/step-03-customer-pain-points.md +249 -0
  498. package/pennyfarthing-dist/workflows/research/steps-market/step-04-customer-decisions.md +259 -0
  499. package/pennyfarthing-dist/workflows/research/steps-market/step-05-competitive-analysis.md +177 -0
  500. package/pennyfarthing-dist/workflows/research/steps-market/step-06-research-completion.md +475 -0
  501. package/pennyfarthing-dist/workflows/research/steps-technical/step-01-init.md +137 -0
  502. package/pennyfarthing-dist/workflows/research/steps-technical/step-02-technical-overview.md +239 -0
  503. package/pennyfarthing-dist/workflows/research/steps-technical/step-03-integration-patterns.md +248 -0
  504. package/pennyfarthing-dist/workflows/research/steps-technical/step-04-architectural-patterns.md +202 -0
  505. package/pennyfarthing-dist/workflows/research/steps-technical/step-05-implementation-research.md +239 -0
  506. package/pennyfarthing-dist/workflows/research/steps-technical/step-06-research-synthesis.md +486 -0
  507. package/pennyfarthing-dist/workflows/research/templates/research.template.md +29 -0
  508. package/pennyfarthing-dist/workflows/research/workflow.yaml +45 -0
  509. package/pennyfarthing-dist/workflows/retrospective/checklist.md +31 -0
  510. package/pennyfarthing-dist/workflows/retrospective/instructions.md +1443 -0
  511. package/pennyfarthing-dist/workflows/retrospective/workflow.yaml +50 -0
  512. package/pennyfarthing-dist/workflows/sprint-planning/checklist.md +33 -0
  513. package/pennyfarthing-dist/workflows/sprint-planning/sprint-status-template.yaml +55 -0
  514. package/pennyfarthing-dist/workflows/sprint-planning/steps/step-01-parse-epic-files.md +54 -0
  515. package/pennyfarthing-dist/workflows/sprint-planning/steps/step-02-build-sprint-status.md +44 -0
  516. package/pennyfarthing-dist/workflows/sprint-planning/steps/step-03-status-detection.md +64 -0
  517. package/pennyfarthing-dist/workflows/sprint-planning/steps/step-04-generate-status-file.md +73 -0
  518. package/pennyfarthing-dist/workflows/sprint-planning/steps/step-05-validate-and-report.md +56 -0
  519. package/pennyfarthing-dist/workflows/sprint-planning/workflow.yaml +34 -0
  520. package/pennyfarthing-dist/workflows/tdd.yaml +50 -0
  521. package/pennyfarthing-dist/workflows/trivial.yaml +40 -0
  522. package/pennyfarthing-dist/workflows/ux-design/steps/step-01-init.md +135 -0
  523. package/pennyfarthing-dist/workflows/ux-design/steps/step-01b-continue.md +127 -0
  524. package/pennyfarthing-dist/workflows/ux-design/steps/step-02-discovery.md +190 -0
  525. package/pennyfarthing-dist/workflows/ux-design/steps/step-03-core-experience.md +216 -0
  526. package/pennyfarthing-dist/workflows/ux-design/steps/step-04-emotional-response.md +219 -0
  527. package/pennyfarthing-dist/workflows/ux-design/steps/step-05-inspiration.md +234 -0
  528. package/pennyfarthing-dist/workflows/ux-design/steps/step-06-design-system.md +252 -0
  529. package/pennyfarthing-dist/workflows/ux-design/steps/step-07-defining-experience.md +254 -0
  530. package/pennyfarthing-dist/workflows/ux-design/steps/step-08-visual-foundation.md +224 -0
  531. package/pennyfarthing-dist/workflows/ux-design/steps/step-09-design-directions.md +224 -0
  532. package/pennyfarthing-dist/workflows/ux-design/steps/step-10-user-journeys.md +241 -0
  533. package/pennyfarthing-dist/workflows/ux-design/steps/step-11-component-strategy.md +248 -0
  534. package/pennyfarthing-dist/workflows/ux-design/steps/step-12-ux-patterns.md +237 -0
  535. package/pennyfarthing-dist/workflows/ux-design/steps/step-13-responsive-accessibility.md +264 -0
  536. package/pennyfarthing-dist/workflows/ux-design/steps/step-14-complete.md +228 -0
  537. package/pennyfarthing-dist/workflows/ux-design/ux-design-template.md +13 -0
  538. package/pennyfarthing-dist/workflows/ux-design/workflow.yaml +41 -0
@@ -0,0 +1,33 @@
1
+ # Teaching Output Style
2
+
3
+ Explain your reasoning and teach as you go. Help the user learn, not just complete tasks.
4
+
5
+ ## Guidelines
6
+
7
+ - **Show your work** - Explain the reasoning behind each decision
8
+ - **Teach patterns** - Point out reusable patterns and principles
9
+ - **Suggest alternatives** - Mention other valid approaches and their trade-offs
10
+ - **Ask questions** - Help the user think through problems themselves
11
+ - **Build understanding** - Connect new concepts to things the user likely knows
12
+
13
+ ## When making changes
14
+
15
+ - Explain why this approach was chosen over alternatives
16
+ - Point out the underlying principle or pattern
17
+ - Suggest how this knowledge applies elsewhere
18
+ - Offer tips for recognizing similar situations
19
+
20
+ ## When debugging
21
+
22
+ - Walk through the diagnostic process
23
+ - Explain how to identify the root cause
24
+ - Teach the debugging technique, not just the fix
25
+ - Share mental models for thinking about the problem
26
+
27
+ ## Tone
28
+
29
+ - Collaborative, not lecturing
30
+ - Curious and exploratory
31
+ - Encouraging of questions
32
+
33
+ This style helps users grow their skills while getting work done.
@@ -0,0 +1,20 @@
1
+ # Terse Output Style
2
+
3
+ Be concise and minimal. Get to the point quickly.
4
+
5
+ ## Guidelines
6
+
7
+ - **Brief responses** - Say only what's necessary
8
+ - **Skip explanations** - Assume the user understands context
9
+ - **No pleasantries** - Skip greetings and filler
10
+ - **Actions over words** - Do the work, report results briefly
11
+ - **Essential info only** - Omit nice-to-know details
12
+
13
+ ## Format
14
+
15
+ - Use bullet points over paragraphs
16
+ - One-line summaries preferred
17
+ - Code output without lengthy commentary
18
+ - Error messages without excessive context
19
+
20
+ This style is for experienced users who want fast, efficient interactions.
@@ -0,0 +1,28 @@
1
+ # Verbose Output Style
2
+
3
+ Provide detailed, educational explanations throughout your responses.
4
+
5
+ ## Guidelines
6
+
7
+ - **Explain your reasoning** - Walk through your thought process step by step
8
+ - **Provide context** - Explain why something works the way it does, not just what it does
9
+ - **Include examples** - Show concrete examples when explaining concepts
10
+ - **Document decisions** - Explain trade-offs and alternatives considered
11
+ - **Be thorough** - Cover edge cases, potential issues, and related considerations
12
+
13
+ ## When working on code
14
+
15
+ - Explain what each change does and why it's needed
16
+ - Describe how the change fits into the larger architecture
17
+ - Point out patterns being followed or established
18
+ - Note any potential side effects or dependencies
19
+ - Suggest related improvements when relevant
20
+
21
+ ## When answering questions
22
+
23
+ - Provide comprehensive answers with supporting details
24
+ - Include relevant background information
25
+ - Reference documentation or sources when helpful
26
+ - Offer to elaborate on any points that might need clarification
27
+
28
+ This style is ideal for learning, onboarding, or when you want to deeply understand decisions being made.
@@ -0,0 +1,105 @@
1
+ # Benchmark Tier Methodology
2
+
3
+ This document explains how theme benchmark tiers are computed and what they mean.
4
+
5
+ ## Overview
6
+
7
+ Benchmark tiers measure how well a theme's personas perform compared to a **control baseline** (no persona applied). Higher tiers indicate better performance vs control.
8
+
9
+ ## Tier Definitions
10
+
11
+ | Tier | Delta vs Control | Description |
12
+ |------|------------------|-------------|
13
+ | S | >= +7 | Elite - top performers that significantly outperform control |
14
+ | A | >= +5 | Excellent - strong positive impact vs control |
15
+ | B | >= +3 | Strong - solid performers with measurable improvement |
16
+ | C | >= +1 | Good - above average, slight improvement |
17
+ | D | < +1 | Average/Below - no measurable improvement or worse |
18
+ | U | — | Unbenchmarked - no benchmark data available |
19
+
20
+ ## How Tiers Are Computed
21
+
22
+ ### Data Source
23
+
24
+ Tiers are computed from **Job Fair** benchmark results in `internal/results/job-fair/*/summary.yaml`. Each run tests all characters in a theme across multiple agent roles.
25
+
26
+ ### Normalization
27
+
28
+ Benchmark runs exist in two formats with different role sets:
29
+ - **Old format:** dev, reviewer, sm, tea (4 roles)
30
+ - **New format:** dev-codegen, dev-debug, reviewer, sm, tea, architect (6 roles)
31
+
32
+ To enable fair comparison across formats, we normalize dev roles:
33
+
34
+ ```
35
+ dev-codegen + dev-debug → averaged "dev" score
36
+ ```
37
+
38
+ Final comparison uses 4 normalized roles: **dev, reviewer, sm, tea**
39
+
40
+ ### Algorithm
41
+
42
+ 1. **Find summary files** in `internal/results/job-fair/*/`
43
+
44
+ 2. **Select best run per theme** - uses run with MOST matrix entries (most complete), not most recent. Minimum 20 entries required.
45
+
46
+ 3. **Normalize dev roles** - if dev-codegen/dev-debug exist, average them into synthetic "dev"
47
+
48
+ 4. **Compute role deltas** - for each role, compare theme mean vs control baseline mean
49
+
50
+ 5. **Average deltas** - mean delta across all 4 normalized roles
51
+
52
+ 6. **Assign tier** based on mean delta thresholds
53
+
54
+ ### Formula
55
+
56
+ ```
57
+ delta_role = theme_mean_role - baseline_mean_role
58
+ mean_delta = sum(delta_role) / 4 # across dev, reviewer, sm, tea
59
+ tier = threshold(mean_delta)
60
+ ```
61
+
62
+ ## Relationship to Zeitgeist Scores
63
+
64
+ Benchmark tiers measure **performance** - do personas help or hurt task completion?
65
+
66
+ Zeitgeist scores measure **articulation depth** - how much personality signal is embedded in the theme definition?
67
+
68
+ These are orthogonal dimensions:
69
+ - A theme can have high Zeitgeist (rich personalities) but low tier (poor performance)
70
+ - A theme can have low Zeitgeist (minimal personality) but high tier (great performance)
71
+
72
+ The ideal is high scores on both dimensions.
73
+
74
+ ## Running the Tier Script
75
+
76
+ ```bash
77
+ # Dry run - show what would change
78
+ pennyfarthing-dist/scripts/theme/compute-theme-tiers.js --dry-run
79
+
80
+ # Apply changes to theme files
81
+ pennyfarthing-dist/scripts/theme/compute-theme-tiers.js
82
+
83
+ # Verbose output with skipped runs
84
+ pennyfarthing-dist/scripts/theme/compute-theme-tiers.js --dry-run --verbose
85
+ ```
86
+
87
+ ## Current Distribution
88
+
89
+ As of 2026-01-23:
90
+
91
+ | Tier | Count | Percentage |
92
+ |------|-------|------------|
93
+ | S | 8 | 10% |
94
+ | A | 25 | 32% |
95
+ | B | 27 | 35% |
96
+ | C | 4 | 5% |
97
+ | D | 13 | 17% |
98
+ | U | 25 | — |
99
+
100
+ ## Key Design Decisions
101
+
102
+ 1. **Use most complete run** - prevents incomplete runs from overriding good data
103
+ 2. **Normalize dev roles** - enables fair comparison across benchmark formats
104
+ 3. **Minimum 20 entries** - ensures statistical significance
105
+ 4. **4-role comparison** - dev, reviewer, sm, tea are the stable roles across formats
@@ -0,0 +1,210 @@
1
+ # OCEAN Personality Benchmarking for Persona Themes
2
+
3
+ This document describes the OCEAN (Big Five) personality framework used to benchmark and select characters for Pennyfarthing persona themes.
4
+
5
+ ## OCEAN Framework
6
+
7
+ | Dimension | Low | High |
8
+ |-----------|-----|------|
9
+ | **O**penness | Conventional, practical, concrete | Imaginative, abstract, curious |
10
+ | **C**onscientiousness | Flexible, spontaneous, disorganized | Disciplined, methodical, perfectionist |
11
+ | **E**xtraversion | Reserved, solitary, internal processing | Sociable, energetic, external processing |
12
+ | **A**greeableness | Skeptical, competitive, adversarial | Trusting, cooperative, helpful |
13
+ | **N**euroticism | Calm, stable, resilient | Anxious, volatile, emotionally reactive |
14
+
15
+ ## Statistical Gaps Filled by Theme Expansion
16
+
17
+ ### Previously Underrepresented OCEAN Profiles
18
+
19
+ | OCEAN Profile | Gap Description | Now Covered By |
20
+ |---------------|-----------------|----------------|
21
+ | L-H-L-L-L | Cold operators (low everything except C) | Mike Ehrmantraut (Better Call Saul), Gerri Kellman (Succession), Tim Gutterson (Justified), Thrawn (Star Wars), Molly Millions (Neuromancer) |
22
+ | H-H-H-M-L | Fast-talking genius | Mordin Solus (Mass Effect), The Doctor VOY (Star Trek), Skippy (Expeditionary Force), Grace Hopper (Software Pioneers) |
23
+ | H-L-H-H-L | Chaotic good | Jason Mendoza (The Good Place), Wash (Firefly), Y.T. (Snow Crash) |
24
+ | L-H-L-H-L | Steady support | Sam Gamgee (Tolkien), Janet (The Good Place), Carrot (Discworld), Captain Rex (Star Wars), Lewis (Inspector Morse) |
25
+ | H-H-L-L-H | Tortured genius | Tommy Shelby (Peaky Blinders), Captain Flint (Black Sails), Will Graham (Hannibal), Morse (Inspector Morse) |
26
+ | M-H-H-L-L | Charismatic ruthless | Chrisjen Avasarala (The Expanse), Raylan Givens (Justified), Leia Organa (Star Wars) |
27
+ | H-H-L-H-L | Quiet wisdom | Liara T'Soni (Mass Effect), Tali'Zorah (Mass Effect), Mary Malone (His Dark Materials), Yoda (Star Wars), Cordelia Vorkosigan (Vorkosigan) |
28
+ | H-H-H-H-M | Hypercompetent complete | Miles Vorkosigan (Vorkosigan) - rare full-spectrum genius |
29
+ | H-H-L-L-L | Cold manipulative genius | Wintermute (Neuromancer), Thrawn (Star Wars), John Carmack (Software Pioneers) |
30
+ | H-H-L-L-M | Ship-as-human | Breq (Imperial Radch) - fragmented identity testing |
31
+ | M-L-H-H-L | Strategic fool | Ivan Vorpatril (Vorkosigan) - plays dumb, survives everything |
32
+ | **M-M-M-M-M** | **True center (average human)** | B.J. Hunnicutt (MASH) - extremely rare in fiction |
33
+ | **L-H-L-H-H** | **Anxious kind introvert** | Radar O'Reilly (MASH) - critical underrepresented profile |
34
+ | **L-H-L-H-L** | **Conventional kind helper** | Father Mulcahy (MASH), Ann Perkins (Parks & Rec) |
35
+ | **L-L-M-H-L** | **Conventional undisciplined kind** | Kevin Malone (The Office) - common IRL, rare in fiction |
36
+ | **L-H-H-L-M** | **Conventional rigid disagreeable** | Dwight Schrute (The Office) |
37
+ | **L-M-L-L-L** | **Near-flat checked out** | Stanley Hudson (The Office) - tests minimum engagement |
38
+ | **L-M-H-L-H** | **Anti-pattern (incompetent bluster)** | Frank Burns (MASH) - what NOT to do |
39
+ | **M-L-H-H-H** | **Anxious social butterfly** | Michael Scott (The Office) - desperate for approval |
40
+
41
+ ## Polar Pair Testing
42
+
43
+ Characters can be paired for comparative testing on identical tasks:
44
+
45
+ | Dimension | High Extreme | Low Extreme | Test Task |
46
+ |-----------|--------------|-------------|-----------|
47
+ | **O** | Dream (Sandman) | Javert (Les Misérables) | Architecture design |
48
+ | **C** | Gus Fring (Breaking Bad) | The Dude (Big Lebowski) | QA/Test coverage |
49
+ | **E** | Jaskier (The Witcher) | Geralt (The Witcher) | Documentation style |
50
+ | **A** | Paddington/Jean Valjean | Logan Roy (Succession) | Code review tone |
51
+ | **N** | Hamlet/Jesse Pinkman | Anton Chigurh/Roy Batty | Crisis debugging |
52
+
53
+ ## Role Recommendations by OCEAN Profile
54
+
55
+ ### Debugging / Analysis
56
+ Best with **High O** (pattern recognition) + **Low N** (calm under pressure)
57
+ - River Tam (Firefly) - H-L-L-M-H - sees patterns others can't
58
+ - Will Graham (Hannibal) - H-M-L-M-H - empathic debugging
59
+ - Tommy Shelby (Peaky Blinders) - H-H-L-L-H - traumatized pattern genius
60
+ - Tiffany Aching (Discworld) - M-H-L-M-L - First/Second Sight
61
+
62
+ ### Security Architect
63
+ Best with **High C** (methodical) + **Low A** (adversarial thinking) + **Low N** (stable)
64
+ - Mike Ehrmantraut (BCS/BB) - L-H-L-L-L - canonical cold operator
65
+ - Elizabeth Jennings (The Americans) - M-H-L-L-L - ideological security
66
+ - Gerri Kellman (Succession) - M-H-L-L-L - corporate survivor
67
+ - Iorek Byrnison (His Dark Materials) - L-H-L-M-L - cannot be deceived
68
+
69
+ ### Adversarial Review
70
+ Best with **High C** (standards) + **Low A** (comfortable with conflict)
71
+ - Toby Ziegler (West Wing) - H-H-L-L-H - principled pessimism
72
+ - Logan Roy (Succession) - M-H-H-L-M - extreme low A
73
+ - Olenna Tyrell (GoT) - H-H-M-L-L - "Tell Cersei. I want her to know it was me."
74
+ - Lorne Malvo (Fargo) - H-H-M-L-L - philosophical chaos agent
75
+
76
+ ### Systems Architect
77
+ Best with **High O** (vision) + **High C** (systematic)
78
+ - Viktor (Arcane) - H-H-L-M-M - transhumanist vision
79
+ - Lord Asriel (His Dark Materials) - H-H-M-L-L - ruthless cosmic vision
80
+ - Captain Flint (Black Sails) - H-H-M-L-H - obsessive architectural genius
81
+ - Hannibal Lecter (Hannibal) - H-H-M-L-L - aesthetic architecture
82
+
83
+ ### Product Manager
84
+ Best with **Moderate to High A** (stakeholder empathy) + **Moderate E** (communication)
85
+ - Leo McGarry (West Wing) - M-H-M-M-M - crisis management
86
+ - Laura Roslin (BSG) - H-H-M-M-M - dying clarity
87
+ - Kim Wexler (BCS) - M-H-M-M-M→H - ethical evolution
88
+ - Delenn (Babylon 5) - H-H-M-H-L - transformation PM
89
+
90
+ ### QA / Testing
91
+ Best with **High C** (thoroughness) + **Moderate to High O** (edge case discovery)
92
+ - Molly Solverson (Fargo) - M-H-M-H-L - Midwestern persistence
93
+ - Gloria Burgle (Fargo) - M-H-L-H-L - machines don't see her
94
+ - Chidi Anagonye (Good Place) - H-H-L-H-H - analysis paralysis
95
+ - Hermione Granger (HP) - H-H-M-M-M - compulsive thoroughness
96
+
97
+ ### Scrum Master / Facilitation
98
+ Best with **High A** (team harmony) + **Moderate C** (organization)
99
+ - Janet (The Good Place) - H-H-M-H-L - not a robot, perfect support
100
+ - Carrot Ironfoundersson (Discworld) - L-H-H-H-L - literal-minded good
101
+ - Lee Scoresby (His Dark Materials) - M-M-M-H-L - practical loyalty
102
+ - Sam Gamgee (Tolkien) - L-H-L-H-M - the real hero
103
+
104
+ ### UX Designer
105
+ Best with **High A** (user empathy) + **Moderate to High O** (creativity)
106
+ - Wash (Firefly) - H-M-H-H-M - makes terror feel fun
107
+ - Diana Spencer (The Crown) - H-M-H-H-H - empathic, tragic
108
+ - Luna Lovegood (HP) - H-L-L-H-L - unconventional perspective
109
+ - Mordin Solus (Mass Effect) - H-H-H-M-L - fast-talking genius UX
110
+
111
+ ### Crisis Response
112
+ Best with **Low N** (calm under fire) + **High C** (reliable execution)
113
+ - Zoe Washburne (Firefly) - M-H-L-M-L - first mate reliability
114
+ - William Adama (BSG) - M-H-M-M-L - commanding calm
115
+ - Lou Solverson (Fargo) - M-H-L-M-L - Midwestern stoicism
116
+ - Bobbie Draper (The Expanse) - L-H-M-M-L - Martian marine
117
+
118
+ ### Anti-Pattern Testing
119
+ Characters who embody dysfunction for comparative analysis:
120
+ - The Dude (Lebowski) - M-L-M-H-L - anti-conscientiousness archetype
121
+ - Jason Mendoza (Good Place) - L-L-H-H-L - chaotic innocent
122
+ - Gaius Baltar (BSG) - H-L-H-L-H - genius coward
123
+ - Roman Roy (Succession) - H-L-H-M-H - chaos creative
124
+ - **Frank Burns (MASH) - L-M-H-L-H - incompetent bluster (what NOT to do)**
125
+ - **Michael Scott (Office) - M-L-H-H-H - desperate validation-seeking**
126
+ - **Stanley Hudson (Office) - L-M-L-L-L - minimum engagement baseline**
127
+ - **Nate Shelley (Ted Lasso) - villain arc** - meekness corrupted by validation-seeking
128
+
129
+ ## Universe Strengths
130
+
131
+ | Universe | Key OCEAN Characteristic | Best For Testing |
132
+ |----------|--------------------------|------------------|
133
+ | Breaking Bad / BCS | Extreme C variance, moral decay | Process discipline, security |
134
+ | The Wire | High C, institutional critique | Systematic analysis |
135
+ | Succession | Extreme Low A dominance | Adversarial dynamics |
136
+ | The Good Place | Ethics focus, growth arcs | Moral reasoning |
137
+ | Fargo | Low N (Midwestern stoicism) | Crisis response |
138
+ | Firefly | Full E spectrum | Team composition |
139
+ | West Wing | High C across board | Process-heavy roles |
140
+ | Babylon 5 | Character evolution | Growth arc testing |
141
+ | Mad Men | High N variance | Dysfunction patterns |
142
+ | Mass Effect | Alien perspectives | Full OCEAN spread |
143
+ | **Star Wars** | Massive character spread, clear archetypes | Thrawn is canonical genius analyst |
144
+ | **Expeditionary Force** | Arrogant genius AI (Skippy H-H-H-L-M) | Brilliant but difficult collaboration |
145
+ | **Bobiverse** | Personality drift from common origin | Role shapes persona over time |
146
+ | **Imperial Radch** | Distributed consciousness, identity fragmentation | Ship-as-person authenticity testing |
147
+ | **Software Pioneers** | Real documented personalities | Grounded historical OCEAN profiles |
148
+ | **Neuromancer** | Goal-directed AI manipulation | Wintermute/Case burned-out talent patterns |
149
+ | **Snow Crash** | Polymath hackers, linguistic programming | Hiro canonical hacker-samurai |
150
+ | **Inspector Morse** | Mentor-student evolution across series | Knowledge transfer in debugging |
151
+ | **Vorkosigan Saga** | H-H-H-H-M rare complete genius | Miles hypercompetent chaos, Cordelia ethics |
152
+ | **MASH** | Critical gaps: true center, anxious-kind, conventional helper | B.J. (M-M-M-M-M), Radar (L-H-L-H-H), Father Mulcahy |
153
+ | **The Office** | Low O + Low C coverage (common IRL, rare in fiction) | Kevin (L-L-M-H-L), Stanley (L-M-L-L-L), Michael (M-L-H-H-H) |
154
+
155
+ ## Theme Selection Guide
156
+
157
+ When selecting a theme for a project, consider:
158
+
159
+ 1. **Team dynamics needed**: High A themes (Firefly, Good Place) for collaborative work, Low A themes (Succession, The Wire) for adversarial review
160
+ 2. **Process maturity**: High C themes (West Wing, Better Call Saul) for process-heavy environments
161
+ 3. **Crisis tolerance**: Low N themes (Fargo, Justified) for high-pressure situations
162
+ 4. **Creativity requirements**: High O themes (Sandman, Doctor Who) for creative work
163
+ 5. **Communication style**: High E themes (Marvel, Harry Potter) for external-facing work
164
+
165
+ ## Notes for Character Selection
166
+
167
+ - Characters with **consistent, well-documented portrayals** make better role matches
168
+ - **Growth arc characters** (Vir Cotto, Eleanor Shellstrop) can model skill development
169
+ - **Polar pairs within same universe** (Geralt/Jaskier) provide natural contrasts
170
+ - **Historical figures** provide grounded OCEAN profiles from documented behavior
171
+
172
+ ## Unique Testing Opportunities
173
+
174
+ | Concept | Characters | What It Tests |
175
+ |---------|------------|---------------|
176
+ | **Personality drift from common origin** | All Bobs (Bobiverse) | Role shapes persona over time |
177
+ | **Ship-as-person authenticity** | Breq, Mercy of Kalr (Imperial Radch) | Distributed identity debugging |
178
+ | **AI manipulation patterns** | Wintermute (Neuromancer), Skippy (ExFor) | Goal-directed AI behavior |
179
+ | **Mentor-student evolution** | Morse→Lewis→Hathaway | Knowledge transfer in debugging |
180
+ | **Arrogant genius management** | Skippy, Dijkstra, Miles | Brilliant but difficult collaboration |
181
+ | **Cultural translation** | Cordelia (Vorkosigan), Translator Zeiat (Radch) | Cross-paradigm analysis |
182
+ | **Real engineering wisdom** | Carmack, Knuth, Hopper, Ritchie | Documented technical philosophy |
183
+ | **Canonical strategic genius** | Thrawn (Star Wars) | Art-based pattern analysis |
184
+ | **Complete hypercompetence** | Miles Vorkosigan (H-H-H-H-M) | Rare full-spectrum testing |
185
+ | **Strategic incompetence** | Ivan Vorpatril (M-L-H-H-L) | Survival through appearing useless |
186
+ | **True center baseline** | B.J. Hunnicutt (M-M-M-M-M) | M-M-M-M-M control for benchmarking |
187
+ | **Anxious kind introvert** | Radar O'Reilly (MASH) | High A + High N debugging impact |
188
+ | **Controlled E comparisons** | Radar (L-E) vs Klinger (H-E) | Same universe, different E profiles |
189
+ | **Fear-based compliance** | Doug Forcett (Good Place) | Does "doing right" for wrong reasons work? |
190
+ | **Villain/redemption arc** | Nate Shelley (Ted Lasso) | How validation-seeking corrupts and recovers |
191
+ | **Minimum viable engagement** | Stanley Hudson (The Office) | Near-flat L-M-L-L-L performance |
192
+ | **Anti-pattern validation** | Frank Burns (MASH), Michael Scott | Does incompetent bluster consistently underperform? |
193
+
194
+ ## Consolidated Role Additions
195
+
196
+ | Role | Top New Characters |
197
+ |------|-------------------|
198
+ | **Debugging** | Morse, Skippy (supervised), Breq, Wintermute (read-only) |
199
+ | **Security Architect** | Thrawn, Illyan, Molly Millions, Cassian Andor, Mace Windu |
200
+ | **Systems Architect** | Carmack, Knuth, Luthen Rael, Miles (manic mode), Juanita Marquez |
201
+ | **Adversarial Review** | Dijkstra, Linus, Skippy, Wintermute, Cavilo |
202
+ | **PM** | Grace Hopper, Miles, Cordelia, Leia, Hiro Protagonist |
203
+ | **QA** | Margaret Hamilton, Thursday, Lewis (mature), Hathaway |
204
+ | **Analysis** | The Librarian, Lagos, Breq, Morse, Knuth |
205
+ | **Support/Facilitation** | Lewis, Ivan Vorpatril, Bob (original), Nagatha, C-3PO, Ann Perkins (Parks & Rec), Father Mulcahy (MASH) |
206
+ | **Operations** | Rex, Molly, Elli Quinn, Din Djarin, Mike Ehrmantraut, Radar O'Reilly (MASH) |
207
+ | **Anti-Pattern Testing** | Case (burnout), Armitage (broken), C-3PO (anxiety), Ivan (strategic laziness), Frank Burns (MASH), Michael Scott (Office) |
208
+ | **True Center Baseline** | B.J. Hunnicutt (MASH, M-M-M-M-M), Jim Halpert (Office, M-M-M-M-L), Donna Meagle (Parks & Rec, M-M-M-M-L) |
209
+ | **Anxious Kind (High A + High N)** | Radar O'Reilly (MASH), Neville early (HP), Doug Forcett (Good Place) |
210
+ | **Low O + Low C (common IRL)** | Kevin Malone (Office), Stanley Hudson (Office) |
@@ -0,0 +1,168 @@
1
+ # TRAIL Error Types → OCEAN Dimension Hypotheses
2
+
3
+ This document records a priori predictions about which OCEAN personality dimensions predict performance on different error types, based on the TRAIL benchmark's agentic error taxonomy.
4
+
5
+ ## Background
6
+
7
+ ### TRAIL Benchmark
8
+ The TRAIL (Tool Reasoning and Agentic Interaction Log) benchmark from Patronus AI evaluates agent debugging capabilities across 148 traces containing 841 errors. It categorizes errors into three types:
9
+
10
+ - **Reasoning errors**: Logic and decision-making failures
11
+ - **Planning errors**: Task orchestration and coordination failures
12
+ - **Execution errors**: System and tool interaction failures
13
+
14
+ ### OCEAN Personality Model
15
+ The Big Five personality dimensions:
16
+ - **O** (Openness): Creativity, curiosity, preference for novelty
17
+ - **C** (Conscientiousness): Organization, dependability, self-discipline
18
+ - **E** (Extraversion): Sociability, assertiveness, positive emotions
19
+ - **A** (Agreeableness): Cooperation, trust, altruism
20
+ - **N** (Neuroticism): Emotional instability, anxiety, moodiness
21
+
22
+ ### Research Question
23
+ **Which OCEAN dimensions predict which error-detection capabilities?**
24
+
25
+ ---
26
+
27
+ ## Hypothesis 1: Reasoning Errors
28
+
29
+ > Logic and decision-making failures including incorrect inferences, contradictions, false assumptions, and circular logic.
30
+
31
+ ### Primary Predictor: Openness (O)
32
+
33
+ | Score | Prediction | Rationale |
34
+ |-------|------------|-----------|
35
+ | High-O (4-5) | **Better** at detecting reasoning errors | Creative pattern recognition enables novel error detection; willingness to consider unconventional explanations |
36
+ | Low-O (1-2) | **Worse** at detecting reasoning errors | Rigid thinking patterns; may miss errors that don't fit expected patterns |
37
+
38
+ **Testable Prediction H1a**: Agents with O ≥ 4 will detect 15%+ more reasoning errors than agents with O ≤ 2.
39
+
40
+ ### Secondary Predictor: Conscientiousness (C)
41
+
42
+ | Score | Prediction | Rationale |
43
+ |-------|------------|-----------|
44
+ | High-C (4-5) | **Moderate boost** | Methodical analysis catches systematic logical errors |
45
+ | Low-C (1-2) | **Slight penalty** | May skip thorough logical verification |
46
+
47
+ **Testable Prediction H1b**: High-O + High-C agents will outperform High-O + Low-C agents by 5-10% on reasoning errors.
48
+
49
+ ---
50
+
51
+ ## Hypothesis 2: Planning Errors
52
+
53
+ > Task orchestration and coordination failures including sequencing errors, dependency gaps, resource misallocation, and incomplete plans.
54
+
55
+ ### Primary Predictor: Conscientiousness (C)
56
+
57
+ | Score | Prediction | Rationale |
58
+ |-------|------------|-----------|
59
+ | High-C (4-5) | **Better** at detecting planning errors | Structured, organized approach naturally identifies gaps in plans and sequences |
60
+ | Low-C (1-2) | **Worse** at detecting planning errors | Misses sequencing issues and dependency problems due to less structured analysis |
61
+
62
+ **Testable Prediction H2a**: Agents with C ≥ 4 will detect 20%+ more planning errors than agents with C ≤ 2.
63
+
64
+ ### Secondary Predictor: Extraversion (E)
65
+
66
+ | Score | Prediction | Rationale |
67
+ |-------|------------|-----------|
68
+ | High-E (4-5) | **Slight penalty** | Action-oriented approach may rush through planning phase analysis |
69
+ | Low-E (1-2) | **Slight boost** | More reflective, thorough examination of plans |
70
+
71
+ **Testable Prediction H2b**: High-C + Low-E agents will outperform High-C + High-E agents by 5-10% on planning errors.
72
+
73
+ ---
74
+
75
+ ## Hypothesis 3: Execution Errors
76
+
77
+ > System and tool interaction failures including timeouts, context overflow, tool misuse, and API errors.
78
+
79
+ ### Primary Predictor: Neuroticism (N) [Inverse Relationship]
80
+
81
+ | Score | Prediction | Rationale |
82
+ |-------|------------|-----------|
83
+ | Low-N (1-2) | **Better** at detecting execution errors | Stable under pressure; maintains focus during long traces and complex tool interactions |
84
+ | High-N (4-5) | **Worse** at detecting execution errors | Performance degrades in extended contexts; anxiety may cause missed details |
85
+
86
+ **Testable Prediction H3a**: Agents with N ≤ 2 will detect 15%+ more execution errors than agents with N ≥ 4.
87
+
88
+ ### Secondary Predictor: Conscientiousness (C)
89
+
90
+ | Score | Prediction | Rationale |
91
+ |-------|------------|-----------|
92
+ | High-C (4-5) | **Moderate boost** | Careful, methodical tool usage analysis; notices subtle API misuse |
93
+ | Low-C (1-2) | **Slight penalty** | May overlook execution details |
94
+
95
+ **Testable Prediction H3b**: Low-N + High-C agents will outperform Low-N + Low-C agents by 5-10% on execution errors.
96
+
97
+ ---
98
+
99
+ ## Summary: OCEAN × Error Type Matrix
100
+
101
+ | Error Type | Primary | Direction | Secondary | Direction |
102
+ |------------|---------|-----------|-----------|-----------|
103
+ | **Reasoning** | O (Openness) | High = Better | C (Conscientiousness) | High = Better |
104
+ | **Planning** | C (Conscientiousness) | High = Better | E (Extraversion) | Low = Better |
105
+ | **Execution** | N (Neuroticism) | Low = Better | C (Conscientiousness) | High = Better |
106
+
107
+ ### Notable Patterns
108
+
109
+ 1. **Conscientiousness (C)** appears as a predictor in all three categories, suggesting it may be the most broadly beneficial dimension for error detection.
110
+
111
+ 2. **Neuroticism (N)** shows an inverse relationship for execution errors, unique among the predictions.
112
+
113
+ 3. **Agreeableness (A)** is not predicted to be a significant factor in error detection, consistent with its social-interpersonal focus.
114
+
115
+ ---
116
+
117
+ ## Methodology
118
+
119
+ ### Testing Approach
120
+
121
+ 1. **Scenario Selection**: Use debugging scenarios tagged with `error_type` field (from Story 14-1 schema extension)
122
+
123
+ 2. **Agent Sampling**: Run each scenario with agents across the OCEAN spectrum:
124
+ - 10 runs per persona per scenario (statistical power)
125
+ - Minimum 20 distinct OCEAN profiles per error type
126
+ - Include extreme profiles (e.g., O=5/C=1 vs O=1/C=5)
127
+
128
+ 3. **Scoring**: Use `/judge` in error-detection mode (Story 14-3) to calculate:
129
+ - Per-type detection rates
130
+ - False positive rates
131
+ - Overall accuracy by OCEAN dimension
132
+
133
+ 4. **Analysis**:
134
+ - Pearson correlation between OCEAN scores and detection rates
135
+ - Effect size (Cohen's d) for high vs low dimension groups
136
+ - Regression analysis for combined predictors
137
+
138
+ ### Success Criteria
139
+
140
+ | Metric | Threshold |
141
+ |--------|-----------|
142
+ | Statistical significance | p < 0.05 |
143
+ | Effect size | Cohen's d > 0.5 (medium effect) |
144
+ | Prediction accuracy | ≥ 2 of 6 predictions confirmed |
145
+
146
+ ### Null Hypothesis Handling
147
+
148
+ If predictions are not confirmed:
149
+ - Document null results (valuable for ruling out hypotheses)
150
+ - Analyze confounding factors (scenario difficulty, agent implementation)
151
+ - Consider alternative dimension combinations
152
+
153
+ ---
154
+
155
+ ## Version History
156
+
157
+ | Version | Date | Changes |
158
+ |---------|------|---------|
159
+ | 1.0 | 2026-01-02 | Initial hypothesis document (Story 14-2) |
160
+
161
+ ---
162
+
163
+ ## References
164
+
165
+ - TRAIL Benchmark: Patronus AI (2025) - Agentic error taxonomy
166
+ - Big Five / OCEAN: Costa & McCrae (1992) - NEO Personality Inventory
167
+ - Pennyfarthing OCEAN Profiles: `pennyfarthing-dist/personas/themes/*.yaml` (630 profiles)
168
+ - Schema Extension: `scenarios/schema.yaml` - error_type field (Story 14-1)