@pennyfarthing/core 11.0.0 → 11.1.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (401) hide show
  1. package/README.md +81 -23
  2. package/package.json +1 -1
  3. package/packages/core/dist/cli/utils/010-detect-remove-old-packages.test.d.ts +20 -0
  4. package/packages/core/dist/cli/utils/010-detect-remove-old-packages.test.d.ts.map +1 -0
  5. package/packages/core/dist/cli/utils/010-detect-remove-old-packages.test.js +278 -0
  6. package/packages/core/dist/cli/utils/010-detect-remove-old-packages.test.js.map +1 -0
  7. package/packages/core/dist/cli/utils/constants.d.ts +8 -2
  8. package/packages/core/dist/cli/utils/constants.d.ts.map +1 -1
  9. package/packages/core/dist/cli/utils/constants.js +4 -1
  10. package/packages/core/dist/cli/utils/constants.js.map +1 -1
  11. package/packages/core/dist/cli/utils/constants.test.d.ts +10 -0
  12. package/packages/core/dist/cli/utils/constants.test.d.ts.map +1 -0
  13. package/packages/core/dist/cli/utils/constants.test.js +38 -0
  14. package/packages/core/dist/cli/utils/constants.test.js.map +1 -0
  15. package/packages/core/dist/consultation/consultation-protocol.d.ts +139 -0
  16. package/packages/core/dist/consultation/consultation-protocol.d.ts.map +1 -0
  17. package/packages/core/dist/consultation/consultation-protocol.js +178 -0
  18. package/packages/core/dist/consultation/consultation-protocol.js.map +1 -0
  19. package/packages/core/dist/consultation/consultation-protocol.test.d.ts +20 -0
  20. package/packages/core/dist/consultation/consultation-protocol.test.d.ts.map +1 -0
  21. package/packages/core/dist/consultation/consultation-protocol.test.js +474 -0
  22. package/packages/core/dist/consultation/consultation-protocol.test.js.map +1 -0
  23. package/packages/core/dist/consultation/dialogue-manager.d.ts +75 -0
  24. package/packages/core/dist/consultation/dialogue-manager.d.ts.map +1 -0
  25. package/packages/core/dist/consultation/dialogue-manager.js +334 -0
  26. package/packages/core/dist/consultation/dialogue-manager.js.map +1 -0
  27. package/packages/core/dist/consultation/dialogue-manager.test.d.ts +19 -0
  28. package/packages/core/dist/consultation/dialogue-manager.test.d.ts.map +1 -0
  29. package/packages/core/dist/consultation/dialogue-manager.test.js +444 -0
  30. package/packages/core/dist/consultation/dialogue-manager.test.js.map +1 -0
  31. package/packages/core/dist/public/js/react/react.js +3 -3
  32. package/packages/core/dist/scripts/theme-detail.test.d.ts +10 -0
  33. package/packages/core/dist/scripts/theme-detail.test.js +199 -0
  34. package/packages/core/dist/server/api/git.d.ts +13 -1
  35. package/packages/core/dist/server/api/git.d.ts.map +1 -1
  36. package/packages/core/dist/server/api/git.js +53 -34
  37. package/packages/core/dist/server/api/git.js.map +1 -1
  38. package/packages/core/dist/server/api/health-score.d.ts.map +1 -1
  39. package/packages/core/dist/server/api/health-score.js +25 -1
  40. package/packages/core/dist/server/api/health-score.js.map +1 -1
  41. package/packages/core/dist/server/api/settings.d.ts.map +1 -1
  42. package/packages/core/dist/server/api/settings.js +63 -1
  43. package/packages/core/dist/server/api/settings.js.map +1 -1
  44. package/packages/core/dist/server/api/theme-agents.d.ts.map +1 -1
  45. package/packages/core/dist/server/api/theme-agents.js +61 -0
  46. package/packages/core/dist/server/api/theme-agents.js.map +1 -1
  47. package/packages/core/dist/server/server.d.ts.map +1 -1
  48. package/packages/core/dist/server/server.js +17 -12
  49. package/packages/core/dist/server/server.js.map +1 -1
  50. package/packages/core/dist/shared/skill-search.test.js +2 -2
  51. package/packages/core/dist/workflow/gate-file-validation.d.ts +49 -0
  52. package/packages/core/dist/workflow/gate-file-validation.d.ts.map +1 -0
  53. package/packages/core/dist/workflow/gate-file-validation.js +157 -0
  54. package/packages/core/dist/workflow/gate-file-validation.js.map +1 -0
  55. package/packages/core/dist/workflow/gate-file-validation.test.d.ts +19 -0
  56. package/packages/core/dist/workflow/gate-file-validation.test.d.ts.map +1 -0
  57. package/packages/core/dist/workflow/gate-file-validation.test.js +536 -0
  58. package/packages/core/dist/workflow/gate-file-validation.test.js.map +1 -0
  59. package/packages/core/dist/workflow/gate-schema-validation.test.d.ts +14 -0
  60. package/packages/core/dist/workflow/gate-schema-validation.test.d.ts.map +1 -0
  61. package/packages/core/dist/workflow/gate-schema-validation.test.js +339 -0
  62. package/packages/core/dist/workflow/gate-schema-validation.test.js.map +1 -0
  63. package/packages/core/dist/workflow/handoff.js +2 -2
  64. package/packages/core/dist/workflow/handoff.js.map +1 -1
  65. package/packages/core/dist/workflow/handoff.test.js +16 -0
  66. package/packages/core/dist/workflow/handoff.test.js.map +1 -1
  67. package/packages/core/dist/workflow/workflow-schema.d.ts +4 -2
  68. package/packages/core/dist/workflow/workflow-schema.d.ts.map +1 -1
  69. package/packages/core/dist/workflow/workflow-schema.js +43 -8
  70. package/packages/core/dist/workflow/workflow-schema.js.map +1 -1
  71. package/pennyfarthing-dist/agents/README.md +6 -14
  72. package/pennyfarthing-dist/agents/architect.md +43 -29
  73. package/pennyfarthing-dist/agents/ba.md +30 -29
  74. package/pennyfarthing-dist/agents/dev.md +32 -43
  75. package/pennyfarthing-dist/agents/devops.md +57 -21
  76. package/pennyfarthing-dist/agents/orchestrator.md +3 -10
  77. package/pennyfarthing-dist/agents/pm.md +45 -31
  78. package/pennyfarthing-dist/agents/reviewer.md +20 -66
  79. package/pennyfarthing-dist/agents/sm-setup.md +2 -2
  80. package/pennyfarthing-dist/agents/sm.md +8 -30
  81. package/pennyfarthing-dist/agents/tea.md +25 -41
  82. package/pennyfarthing-dist/agents/tech-writer.md +33 -90
  83. package/pennyfarthing-dist/agents/ux-designer.md +39 -39
  84. package/pennyfarthing-dist/commands/benchmark-control.md +8 -64
  85. package/pennyfarthing-dist/commands/benchmark.md +8 -480
  86. package/pennyfarthing-dist/commands/job-fair.md +8 -97
  87. package/pennyfarthing-dist/commands/pf-benchmark-control.md +70 -0
  88. package/pennyfarthing-dist/commands/pf-benchmark.md +486 -0
  89. package/pennyfarthing-dist/commands/pf-chore.md +4 -4
  90. package/pennyfarthing-dist/commands/pf-ci.md +40 -0
  91. package/pennyfarthing-dist/commands/pf-close-epic.md +9 -27
  92. package/pennyfarthing-dist/commands/pf-continue-session.md +9 -213
  93. package/pennyfarthing-dist/commands/pf-create-branches-from-story.md +11 -353
  94. package/pennyfarthing-dist/commands/pf-docs.md +28 -0
  95. package/pennyfarthing-dist/commands/pf-epic.md +67 -0
  96. package/pennyfarthing-dist/commands/pf-git-cleanup.md +11 -52
  97. package/pennyfarthing-dist/commands/pf-git.md +75 -0
  98. package/pennyfarthing-dist/commands/pf-help.md +110 -128
  99. package/pennyfarthing-dist/commands/pf-job-fair.md +102 -0
  100. package/pennyfarthing-dist/commands/pf-new-work.md +9 -18
  101. package/pennyfarthing-dist/commands/pf-parallel-work.md +6 -66
  102. package/pennyfarthing-dist/commands/pf-release.md +11 -76
  103. package/pennyfarthing-dist/commands/pf-repo-status.md +11 -44
  104. package/pennyfarthing-dist/commands/pf-run-ci.md +8 -111
  105. package/pennyfarthing-dist/commands/pf-session.md +51 -0
  106. package/pennyfarthing-dist/commands/pf-solo.md +447 -0
  107. package/pennyfarthing-dist/commands/pf-sprint-planning.md +8 -104
  108. package/pennyfarthing-dist/commands/pf-standalone.md +1 -1
  109. package/pennyfarthing-dist/commands/pf-start-epic.md +9 -163
  110. package/pennyfarthing-dist/commands/pf-sync-epic-to-jira.md +8 -179
  111. package/pennyfarthing-dist/commands/pf-sync-work-with-sprint.md +8 -368
  112. package/pennyfarthing-dist/commands/pf-update-domain-docs.md +8 -78
  113. package/pennyfarthing-dist/commands/solo.md +8 -442
  114. package/pennyfarthing-dist/guides/agent-behavior.md +13 -13
  115. package/pennyfarthing-dist/guides/agent-coordination.md +7 -7
  116. package/pennyfarthing-dist/guides/agent-tag-taxonomy.md +6 -5
  117. package/pennyfarthing-dist/guides/bikerack.md +128 -0
  118. package/pennyfarthing-dist/guides/brownfield-tools.md +133 -0
  119. package/pennyfarthing-dist/guides/command-tag-taxonomy.md +2 -2
  120. package/pennyfarthing-dist/guides/gate-schema.md +227 -0
  121. package/pennyfarthing-dist/guides/gates.md +120 -0
  122. package/pennyfarthing-dist/guides/handoff-cli.md +116 -0
  123. package/pennyfarthing-dist/guides/hooks.md +86 -4
  124. package/pennyfarthing-dist/guides/output-styles.md +65 -0
  125. package/pennyfarthing-dist/guides/patterns/approval-gates-pattern.md +5 -5
  126. package/pennyfarthing-dist/guides/patterns/tdd-flow-pattern.md +4 -4
  127. package/pennyfarthing-dist/guides/prompt-patterns.md +5 -5
  128. package/pennyfarthing-dist/guides/reflector.md +4 -4
  129. package/pennyfarthing-dist/guides/session-artifacts.md +1 -1
  130. package/pennyfarthing-dist/guides/skill-schema.md +1 -1
  131. package/pennyfarthing-dist/guides/tandem-protocol.md +13 -1
  132. package/pennyfarthing-dist/guides/worktree-mode.md +3 -3
  133. package/pennyfarthing-dist/guides/xml-tags.md +5 -4
  134. package/pennyfarthing-dist/personas/themes/hogans-heroes.yaml +11 -22
  135. package/pennyfarthing-dist/personas/themes/stephen-king.yaml +13 -24
  136. package/pennyfarthing-dist/scripts/core/dialogue-manager.sh +322 -0
  137. package/pennyfarthing-dist/scripts/core/phase-check-start.sh +1 -1
  138. package/pennyfarthing-dist/scripts/hooks/otel-auto-config.sh +19 -14
  139. package/pennyfarthing-dist/scripts/portraits/generate-portraits.py +191 -57
  140. package/pennyfarthing-dist/scripts/portraits/generate-portraits.sh +26 -10
  141. package/pennyfarthing-dist/skills/pf-changelog/SKILL.md +4 -4
  142. package/pennyfarthing-dist/skills/pf-sprint/skill.md +1 -1
  143. package/pennyfarthing-dist/skills/skill-registry.schema.json +4 -0
  144. package/pennyfarthing-dist/skills/skill-registry.yaml +5 -0
  145. package/pennyfarthing-dist/workflows/2party-tdd.yaml +11 -0
  146. package/pennyfarthing-dist/workflows/agent-docs.yaml +2 -0
  147. package/pennyfarthing-dist/workflows/bdd-tandem.yaml +4 -0
  148. package/pennyfarthing-dist/workflows/bdd.yaml +4 -0
  149. package/pennyfarthing-dist/workflows/git-cleanup/steps/step-05-complete.md +1 -1
  150. package/pennyfarthing-dist/workflows/tdd-tandem.yaml +3 -0
  151. package/pennyfarthing-dist/workflows/tdd.yaml +3 -0
  152. package/pennyfarthing-dist/workflows/trivial.yaml +2 -0
  153. package/pennyfarthing_scripts/__pycache__/__init__.cpython-314.pyc +0 -0
  154. package/pennyfarthing_scripts/__pycache__/bellmode_hook.cpython-314.pyc +0 -0
  155. package/pennyfarthing_scripts/__pycache__/cli.cpython-314.pyc +0 -0
  156. package/pennyfarthing_scripts/__pycache__/config.cpython-314.pyc +0 -0
  157. package/pennyfarthing_scripts/__pycache__/context.cpython-314.pyc +0 -0
  158. package/pennyfarthing_scripts/__pycache__/hooks.cpython-314.pyc +0 -0
  159. package/pennyfarthing_scripts/__pycache__/jira_bidirectional_sync.cpython-314.pyc +0 -0
  160. package/pennyfarthing_scripts/__pycache__/jira_epic_creation.cpython-314.pyc +0 -0
  161. package/pennyfarthing_scripts/__pycache__/jira_sync.cpython-314.pyc +0 -0
  162. package/pennyfarthing_scripts/__pycache__/jira_sync_story.cpython-314.pyc +0 -0
  163. package/pennyfarthing_scripts/__pycache__/output.cpython-314.pyc +0 -0
  164. package/pennyfarthing_scripts/__pycache__/patch_mode.cpython-314.pyc +0 -0
  165. package/pennyfarthing_scripts/__pycache__/pretooluse_hook.cpython-314.pyc +0 -0
  166. package/pennyfarthing_scripts/__pycache__/schema_validation_hook.cpython-314.pyc +0 -0
  167. package/pennyfarthing_scripts/__pycache__/session_start_hook.cpython-314.pyc +0 -0
  168. package/pennyfarthing_scripts/__pycache__/workflow.cpython-314.pyc +0 -0
  169. package/pennyfarthing_scripts/bc/__pycache__/__init__.cpython-314.pyc +0 -0
  170. package/pennyfarthing_scripts/bc/__pycache__/cli.cpython-314.pyc +0 -0
  171. package/pennyfarthing_scripts/bc/__pycache__/focus.cpython-314.pyc +0 -0
  172. package/pennyfarthing_scripts/bikerack/__pycache__/__init__.cpython-314.pyc +0 -0
  173. package/pennyfarthing_scripts/bikerack/__pycache__/__main__.cpython-314.pyc +0 -0
  174. package/pennyfarthing_scripts/bikerack/__pycache__/background_panel.cpython-314.pyc +0 -0
  175. package/pennyfarthing_scripts/bikerack/__pycache__/base_panel.cpython-314.pyc +0 -0
  176. package/pennyfarthing_scripts/bikerack/__pycache__/changed_panel.cpython-314.pyc +0 -0
  177. package/pennyfarthing_scripts/bikerack/__pycache__/cli.cpython-314.pyc +0 -0
  178. package/pennyfarthing_scripts/bikerack/__pycache__/debug_panel.cpython-314.pyc +0 -0
  179. package/pennyfarthing_scripts/bikerack/__pycache__/diffs_panel.cpython-314.pyc +0 -0
  180. package/pennyfarthing_scripts/bikerack/__pycache__/git_panel.cpython-314.pyc +0 -0
  181. package/pennyfarthing_scripts/bikerack/__pycache__/launcher.cpython-314.pyc +0 -0
  182. package/pennyfarthing_scripts/bikerack/__pycache__/sprint_panel.cpython-314.pyc +0 -0
  183. package/pennyfarthing_scripts/bikerack/__pycache__/tui.cpython-314.pyc +0 -0
  184. package/pennyfarthing_scripts/bikerack/__pycache__/ws_client.cpython-314.pyc +0 -0
  185. package/pennyfarthing_scripts/bikerack/cli.py +10 -11
  186. package/pennyfarthing_scripts/bikerack/debug_panel.py +218 -0
  187. package/pennyfarthing_scripts/bikerack/diffs_panel.py +203 -27
  188. package/pennyfarthing_scripts/brownfield/__pycache__/__init__.cpython-314.pyc +0 -0
  189. package/pennyfarthing_scripts/brownfield/__pycache__/__main__.cpython-314.pyc +0 -0
  190. package/pennyfarthing_scripts/brownfield/__pycache__/cli.cpython-314.pyc +0 -0
  191. package/pennyfarthing_scripts/brownfield/__pycache__/discover.cpython-314.pyc +0 -0
  192. package/pennyfarthing_scripts/cli.py +114 -0
  193. package/pennyfarthing_scripts/codemarkers/__pycache__/__init__.cpython-314.pyc +0 -0
  194. package/pennyfarthing_scripts/codemarkers/__pycache__/__main__.cpython-314.pyc +0 -0
  195. package/pennyfarthing_scripts/codemarkers/__pycache__/analyze.cpython-314.pyc +0 -0
  196. package/pennyfarthing_scripts/codemarkers/__pycache__/cli.cpython-314.pyc +0 -0
  197. package/pennyfarthing_scripts/codemarkers/__pycache__/formatters.cpython-314.pyc +0 -0
  198. package/pennyfarthing_scripts/codemarkers/__pycache__/models.cpython-314.pyc +0 -0
  199. package/pennyfarthing_scripts/common/__pycache__/__init__.cpython-314.pyc +0 -0
  200. package/pennyfarthing_scripts/common/__pycache__/config.cpython-314.pyc +0 -0
  201. package/pennyfarthing_scripts/common/__pycache__/output.cpython-314.pyc +0 -0
  202. package/pennyfarthing_scripts/common/__pycache__/themes.cpython-314.pyc +0 -0
  203. package/pennyfarthing_scripts/complexity/__pycache__/__init__.cpython-314.pyc +0 -0
  204. package/pennyfarthing_scripts/complexity/__pycache__/__main__.cpython-314.pyc +0 -0
  205. package/pennyfarthing_scripts/complexity/__pycache__/analyze.cpython-314.pyc +0 -0
  206. package/pennyfarthing_scripts/complexity/__pycache__/cli.cpython-314.pyc +0 -0
  207. package/pennyfarthing_scripts/complexity/__pycache__/formatters.cpython-314.pyc +0 -0
  208. package/pennyfarthing_scripts/complexity/__pycache__/models.cpython-314.pyc +0 -0
  209. package/pennyfarthing_scripts/deadcode/__pycache__/__init__.cpython-314.pyc +0 -0
  210. package/pennyfarthing_scripts/deadcode/__pycache__/__main__.cpython-314.pyc +0 -0
  211. package/pennyfarthing_scripts/deadcode/__pycache__/analyze.cpython-314.pyc +0 -0
  212. package/pennyfarthing_scripts/deadcode/__pycache__/cli.cpython-314.pyc +0 -0
  213. package/pennyfarthing_scripts/deadcode/__pycache__/formatters.cpython-314.pyc +0 -0
  214. package/pennyfarthing_scripts/deadcode/__pycache__/models.cpython-314.pyc +0 -0
  215. package/pennyfarthing_scripts/dependencies/__pycache__/__init__.cpython-314.pyc +0 -0
  216. package/pennyfarthing_scripts/dependencies/__pycache__/__main__.cpython-314.pyc +0 -0
  217. package/pennyfarthing_scripts/dependencies/__pycache__/analyze.cpython-314.pyc +0 -0
  218. package/pennyfarthing_scripts/dependencies/__pycache__/cli.cpython-314.pyc +0 -0
  219. package/pennyfarthing_scripts/dependencies/__pycache__/formatters.cpython-314.pyc +0 -0
  220. package/pennyfarthing_scripts/dependencies/__pycache__/models.cpython-314.pyc +0 -0
  221. package/pennyfarthing_scripts/epic/__init__.py +0 -0
  222. package/pennyfarthing_scripts/epic/cli.py +64 -0
  223. package/pennyfarthing_scripts/gate/__init__.py +1 -0
  224. package/pennyfarthing_scripts/gate/__pycache__/__init__.cpython-314.pyc +0 -0
  225. package/pennyfarthing_scripts/gate/__pycache__/cli.cpython-314.pyc +0 -0
  226. package/pennyfarthing_scripts/gate/__pycache__/validate.cpython-314.pyc +0 -0
  227. package/pennyfarthing_scripts/gate/cli.py +56 -0
  228. package/pennyfarthing_scripts/gate/validate.py +266 -0
  229. package/pennyfarthing_scripts/git/__pycache__/__init__.cpython-314.pyc +0 -0
  230. package/pennyfarthing_scripts/git/__pycache__/create_branches.cpython-314.pyc +0 -0
  231. package/pennyfarthing_scripts/git/__pycache__/status_all.cpython-314.pyc +0 -0
  232. package/pennyfarthing_scripts/git_group/__init__.py +0 -0
  233. package/pennyfarthing_scripts/git_group/cli.py +100 -0
  234. package/pennyfarthing_scripts/handoff/__init__.py +1 -0
  235. package/pennyfarthing_scripts/handoff/__pycache__/__init__.cpython-314.pyc +0 -0
  236. package/pennyfarthing_scripts/handoff/__pycache__/cli.cpython-314.pyc +0 -0
  237. package/pennyfarthing_scripts/handoff/__pycache__/complete_phase.cpython-314.pyc +0 -0
  238. package/pennyfarthing_scripts/handoff/__pycache__/gate_file.cpython-314.pyc +0 -0
  239. package/pennyfarthing_scripts/handoff/__pycache__/gate_runner.cpython-314.pyc +0 -0
  240. package/pennyfarthing_scripts/handoff/__pycache__/marker.cpython-314.pyc +0 -0
  241. package/pennyfarthing_scripts/handoff/__pycache__/resolve_gate.cpython-314.pyc +0 -0
  242. package/pennyfarthing_scripts/handoff/cli.py +120 -0
  243. package/pennyfarthing_scripts/handoff/complete_phase.py +155 -0
  244. package/pennyfarthing_scripts/handoff/gate_file.py +105 -0
  245. package/pennyfarthing_scripts/handoff/gate_runner.py +152 -0
  246. package/pennyfarthing_scripts/handoff/marker.py +109 -0
  247. package/pennyfarthing_scripts/handoff/resolve_gate.py +152 -0
  248. package/pennyfarthing_scripts/healthscore/__pycache__/__init__.cpython-314.pyc +0 -0
  249. package/pennyfarthing_scripts/healthscore/__pycache__/__main__.cpython-314.pyc +0 -0
  250. package/pennyfarthing_scripts/healthscore/__pycache__/analyze.cpython-314.pyc +0 -0
  251. package/pennyfarthing_scripts/healthscore/__pycache__/cli.cpython-314.pyc +0 -0
  252. package/pennyfarthing_scripts/healthscore/__pycache__/formatters.cpython-314.pyc +0 -0
  253. package/pennyfarthing_scripts/healthscore/__pycache__/models.cpython-314.pyc +0 -0
  254. package/pennyfarthing_scripts/hotspots/__pycache__/__init__.cpython-314.pyc +0 -0
  255. package/pennyfarthing_scripts/hotspots/__pycache__/__main__.cpython-314.pyc +0 -0
  256. package/pennyfarthing_scripts/hotspots/__pycache__/analyze.cpython-314.pyc +0 -0
  257. package/pennyfarthing_scripts/hotspots/__pycache__/cli.cpython-314.pyc +0 -0
  258. package/pennyfarthing_scripts/hotspots/__pycache__/formatters.cpython-314.pyc +0 -0
  259. package/pennyfarthing_scripts/hotspots/__pycache__/models.cpython-314.pyc +0 -0
  260. package/pennyfarthing_scripts/jira/__pycache__/__init__.cpython-314.pyc +0 -0
  261. package/pennyfarthing_scripts/jira/__pycache__/__main__.cpython-314.pyc +0 -0
  262. package/pennyfarthing_scripts/jira/__pycache__/bidirectional.cpython-314.pyc +0 -0
  263. package/pennyfarthing_scripts/jira/__pycache__/claim.cpython-314.pyc +0 -0
  264. package/pennyfarthing_scripts/jira/__pycache__/cli.cpython-314.pyc +0 -0
  265. package/pennyfarthing_scripts/jira/__pycache__/client.cpython-314.pyc +0 -0
  266. package/pennyfarthing_scripts/jira/__pycache__/create.cpython-314.pyc +0 -0
  267. package/pennyfarthing_scripts/jira/__pycache__/epic.cpython-314.pyc +0 -0
  268. package/pennyfarthing_scripts/jira/__pycache__/operations.cpython-314.pyc +0 -0
  269. package/pennyfarthing_scripts/jira/__pycache__/reconcile.cpython-314.pyc +0 -0
  270. package/pennyfarthing_scripts/jira/__pycache__/story.cpython-314.pyc +0 -0
  271. package/pennyfarthing_scripts/jira/__pycache__/sync.cpython-314.pyc +0 -0
  272. package/pennyfarthing_scripts/launch/__pycache__/__init__.cpython-314.pyc +0 -0
  273. package/pennyfarthing_scripts/launch/__pycache__/cli.cpython-314.pyc +0 -0
  274. package/pennyfarthing_scripts/migration/__pycache__/__init__.cpython-314.pyc +0 -0
  275. package/pennyfarthing_scripts/migration/__pycache__/session.cpython-314.pyc +0 -0
  276. package/pennyfarthing_scripts/migration/__pycache__/skill.cpython-314.pyc +0 -0
  277. package/pennyfarthing_scripts/migration/__pycache__/step.cpython-314.pyc +0 -0
  278. package/pennyfarthing_scripts/migration/__pycache__/validate.cpython-314.pyc +0 -0
  279. package/pennyfarthing_scripts/preflight/__pycache__/__init__.cpython-314.pyc +0 -0
  280. package/pennyfarthing_scripts/preflight/__pycache__/__main__.cpython-314.pyc +0 -0
  281. package/pennyfarthing_scripts/preflight/__pycache__/cli.cpython-314.pyc +0 -0
  282. package/pennyfarthing_scripts/preflight/__pycache__/finish.cpython-314.pyc +0 -0
  283. package/pennyfarthing_scripts/prime/__pycache__/__init__.cpython-314.pyc +0 -0
  284. package/pennyfarthing_scripts/prime/__pycache__/cli.cpython-314.pyc +0 -0
  285. package/pennyfarthing_scripts/prime/__pycache__/loader.cpython-314.pyc +0 -0
  286. package/pennyfarthing_scripts/prime/__pycache__/models.cpython-314.pyc +0 -0
  287. package/pennyfarthing_scripts/prime/__pycache__/persona.cpython-314.pyc +0 -0
  288. package/pennyfarthing_scripts/prime/__pycache__/session.cpython-314.pyc +0 -0
  289. package/pennyfarthing_scripts/prime/__pycache__/tiers.cpython-314.pyc +0 -0
  290. package/pennyfarthing_scripts/prime/__pycache__/version_sentinel.cpython-314.pyc +0 -0
  291. package/pennyfarthing_scripts/prime/__pycache__/workflow.cpython-314.pyc +0 -0
  292. package/pennyfarthing_scripts/prime/workflow.py +39 -0
  293. package/pennyfarthing_scripts/session/__init__.py +0 -0
  294. package/pennyfarthing_scripts/session/cli.py +87 -0
  295. package/pennyfarthing_scripts/sprint/__pycache__/__init__.cpython-314.pyc +0 -0
  296. package/pennyfarthing_scripts/sprint/__pycache__/__main__.cpython-314.pyc +0 -0
  297. package/pennyfarthing_scripts/sprint/__pycache__/archive.cpython-314.pyc +0 -0
  298. package/pennyfarthing_scripts/sprint/__pycache__/archive_epic.cpython-314.pyc +0 -0
  299. package/pennyfarthing_scripts/sprint/__pycache__/cli.cpython-314.pyc +0 -0
  300. package/pennyfarthing_scripts/sprint/__pycache__/epic_add.cpython-314.pyc +0 -0
  301. package/pennyfarthing_scripts/sprint/__pycache__/epic_update.cpython-314.pyc +0 -0
  302. package/pennyfarthing_scripts/sprint/__pycache__/import_epic.cpython-314.pyc +0 -0
  303. package/pennyfarthing_scripts/sprint/__pycache__/loader.cpython-314.pyc +0 -0
  304. package/pennyfarthing_scripts/sprint/__pycache__/status.cpython-314.pyc +0 -0
  305. package/pennyfarthing_scripts/sprint/__pycache__/story_add.cpython-314.pyc +0 -0
  306. package/pennyfarthing_scripts/sprint/__pycache__/story_finish.cpython-314.pyc +0 -0
  307. package/pennyfarthing_scripts/sprint/__pycache__/story_update.cpython-314.pyc +0 -0
  308. package/pennyfarthing_scripts/sprint/__pycache__/validate_cmd.cpython-314.pyc +0 -0
  309. package/pennyfarthing_scripts/sprint/__pycache__/validator.cpython-314.pyc +0 -0
  310. package/pennyfarthing_scripts/sprint/__pycache__/work.cpython-314.pyc +0 -0
  311. package/pennyfarthing_scripts/sprint/__pycache__/yaml_io.cpython-314.pyc +0 -0
  312. package/pennyfarthing_scripts/sprint/story_finish.py +14 -0
  313. package/pennyfarthing_scripts/story/__pycache__/__init__.cpython-314.pyc +0 -0
  314. package/pennyfarthing_scripts/story/__pycache__/__main__.cpython-314.pyc +0 -0
  315. package/pennyfarthing_scripts/story/__pycache__/cli.cpython-314.pyc +0 -0
  316. package/pennyfarthing_scripts/story/__pycache__/create.cpython-314.pyc +0 -0
  317. package/pennyfarthing_scripts/story/__pycache__/size.cpython-314.pyc +0 -0
  318. package/pennyfarthing_scripts/story/__pycache__/template.cpython-314.pyc +0 -0
  319. package/pennyfarthing_scripts/tests/__pycache__/__init__.cpython-314.pyc +0 -0
  320. package/pennyfarthing_scripts/tests/__pycache__/conftest.cpython-314-pytest-9.0.2.pyc +0 -0
  321. package/pennyfarthing_scripts/tests/__pycache__/test_108_2_remove_handoff_fallback.cpython-314-pytest-9.0.2.pyc +0 -0
  322. package/pennyfarthing_scripts/tests/__pycache__/test_archive_epic.cpython-314-pytest-9.0.2.pyc +0 -0
  323. package/pennyfarthing_scripts/tests/__pycache__/test_bc.cpython-314-pytest-9.0.2.pyc +0 -0
  324. package/pennyfarthing_scripts/tests/__pycache__/test_bikerack.cpython-314-pytest-9.0.2.pyc +0 -0
  325. package/pennyfarthing_scripts/tests/__pycache__/test_brownfield.cpython-314-pytest-9.0.2.pyc +0 -0
  326. package/pennyfarthing_scripts/tests/__pycache__/test_cli_modules.cpython-314-pytest-9.0.2.pyc +0 -0
  327. package/pennyfarthing_scripts/tests/__pycache__/test_cli_normalization.cpython-314-pytest-9.0.2.pyc +0 -0
  328. package/pennyfarthing_scripts/tests/__pycache__/test_codemarkers.cpython-314-pytest-9.0.2.pyc +0 -0
  329. package/pennyfarthing_scripts/tests/__pycache__/test_common.cpython-314-pytest-9.0.2.pyc +0 -0
  330. package/pennyfarthing_scripts/tests/__pycache__/test_epic_shard_validation.cpython-314-pytest-9.0.2.pyc +0 -0
  331. package/pennyfarthing_scripts/tests/__pycache__/test_gate_file_resolution.cpython-314-pytest-9.0.2.pyc +0 -0
  332. package/pennyfarthing_scripts/tests/__pycache__/test_gate_runner.cpython-314-pytest-9.0.2.pyc +0 -0
  333. package/pennyfarthing_scripts/tests/__pycache__/test_git_utils.cpython-314-pytest-9.0.2.pyc +0 -0
  334. package/pennyfarthing_scripts/tests/__pycache__/test_handoff_cli.cpython-314-pytest-9.0.2.pyc +0 -0
  335. package/pennyfarthing_scripts/tests/__pycache__/test_handoff_e2e.cpython-314-pytest-9.0.2.pyc +0 -0
  336. package/pennyfarthing_scripts/tests/__pycache__/test_healthscore.cpython-314-pytest-9.0.2.pyc +0 -0
  337. package/pennyfarthing_scripts/tests/__pycache__/test_jira_package.cpython-314-pytest-9.0.2.pyc +0 -0
  338. package/pennyfarthing_scripts/tests/__pycache__/test_package_structure.cpython-314-pytest-9.0.2.pyc +0 -0
  339. package/pennyfarthing_scripts/tests/__pycache__/test_patch_mode.cpython-314-pytest-9.0.2.pyc +0 -0
  340. package/pennyfarthing_scripts/tests/__pycache__/test_prime.cpython-314-pytest-9.0.2.pyc +0 -0
  341. package/pennyfarthing_scripts/tests/__pycache__/test_resolve_gate_file_field.cpython-314-pytest-9.0.2.pyc +0 -0
  342. package/pennyfarthing_scripts/tests/__pycache__/test_sprint_package.cpython-314-pytest-9.0.2.pyc +0 -0
  343. package/pennyfarthing_scripts/tests/__pycache__/test_sprint_panel.cpython-314-pytest-9.0.2.pyc +0 -0
  344. package/pennyfarthing_scripts/tests/__pycache__/test_sprint_validator.cpython-314-pytest-9.0.2.pyc +0 -0
  345. package/pennyfarthing_scripts/tests/__pycache__/test_story_add.cpython-314-pytest-9.0.2.pyc +0 -0
  346. package/pennyfarthing_scripts/tests/__pycache__/test_story_package.cpython-314-pytest-9.0.2.pyc +0 -0
  347. package/pennyfarthing_scripts/tests/__pycache__/test_story_update.cpython-314-pytest-9.0.2.pyc +0 -0
  348. package/pennyfarthing_scripts/tests/__pycache__/test_tiers.cpython-314-pytest-9.0.2.pyc +0 -0
  349. package/pennyfarthing_scripts/tests/__pycache__/test_token_counting.cpython-314-pytest-9.0.2.pyc +0 -0
  350. package/pennyfarthing_scripts/tests/__pycache__/test_topology_loader.cpython-314-pytest-9.0.2.pyc +0 -0
  351. package/pennyfarthing_scripts/tests/__pycache__/test_tui_focus.cpython-314-pytest-9.0.2.pyc +0 -0
  352. package/pennyfarthing_scripts/tests/__pycache__/test_tui_panel_persistence.cpython-314-pytest-9.0.2.pyc +0 -0
  353. package/pennyfarthing_scripts/tests/__pycache__/test_validate_cmd.cpython-314-pytest-9.0.2.pyc +0 -0
  354. package/pennyfarthing_scripts/tests/__pycache__/test_version_sentinel.cpython-314-pytest-9.0.2.pyc +0 -0
  355. package/pennyfarthing_scripts/tests/__pycache__/test_workflow_check.cpython-314-pytest-9.0.2.pyc +0 -0
  356. package/pennyfarthing_scripts/tests/__pycache__/test_yaml_io.cpython-314-pytest-9.0.2.pyc +0 -0
  357. package/pennyfarthing_scripts/tests/test_108_1_gate_migration.py +540 -0
  358. package/pennyfarthing_scripts/tests/test_108_2_remove_handoff_fallback.py +339 -0
  359. package/pennyfarthing_scripts/tests/test_confidence_sm_evaluation.py +253 -0
  360. package/pennyfarthing_scripts/tests/test_confidence_sm_gate.py +315 -0
  361. package/pennyfarthing_scripts/tests/test_gate_file_resolution.py +341 -0
  362. package/pennyfarthing_scripts/tests/test_gate_runner.py +620 -0
  363. package/pennyfarthing_scripts/tests/test_handoff_cli.py +929 -0
  364. package/pennyfarthing_scripts/tests/test_handoff_e2e.py +454 -0
  365. package/pennyfarthing_scripts/tests/test_resolve_gate_file_field.py +464 -0
  366. package/pennyfarthing_scripts/theme/__pycache__/__init__.cpython-314.pyc +0 -0
  367. package/pennyfarthing_scripts/theme/__pycache__/cli.cpython-314.pyc +0 -0
  368. package/pennyfarthing_scripts/validate/__pycache__/__init__.cpython-314.pyc +0 -0
  369. package/pennyfarthing_scripts/validate/__pycache__/cli.cpython-314.pyc +0 -0
  370. package/pennyfarthing_scripts/validate/adapters/__pycache__/__init__.cpython-314.pyc +0 -0
  371. package/pennyfarthing_scripts/validate/adapters/__pycache__/agent.cpython-314.pyc +0 -0
  372. package/pennyfarthing_scripts/validate/adapters/__pycache__/schema.cpython-314.pyc +0 -0
  373. package/pennyfarthing_scripts/validate/adapters/__pycache__/skill_command.cpython-314.pyc +0 -0
  374. package/pennyfarthing_scripts/validate/adapters/__pycache__/sprint.cpython-314.pyc +0 -0
  375. package/pennyfarthing_scripts/validate/adapters/__pycache__/workflow.cpython-314.pyc +0 -0
  376. package/pennyfarthing_scripts/validate/adapters/skill_command.py +200 -0
  377. package/pennyfarthing_scripts/validate/adapters/workflow.py +64 -0
  378. package/pennyfarthing_scripts/validate/cli.py +15 -4
  379. package/packages/core/dist/scripts/benchmark-integration.d.ts +0 -182
  380. package/packages/core/dist/scripts/benchmark-integration.d.ts.map +0 -1
  381. package/packages/core/dist/scripts/benchmark-integration.js +0 -691
  382. package/packages/core/dist/scripts/benchmark-integration.js.map +0 -1
  383. package/packages/core/dist/scripts/job-fair-aggregator.d.ts +0 -150
  384. package/packages/core/dist/scripts/job-fair-aggregator.d.ts.map +0 -1
  385. package/packages/core/dist/scripts/job-fair-aggregator.js +0 -547
  386. package/packages/core/dist/scripts/job-fair-aggregator.js.map +0 -1
  387. package/pennyfarthing-dist/agents/handoff.md +0 -250
  388. package/pennyfarthing-dist/agents/sm-handoff.md +0 -152
  389. package/pennyfarthing-dist/scripts/core/handoff-marker.sh +0 -112
  390. package/pennyfarthing-dist/scripts/hooks/__pycache__/question_reflector_check.cpython-314.pyc +0 -0
  391. package/pennyfarthing_scripts/__pycache__/__init__.cpython-311.pyc +0 -0
  392. package/pennyfarthing_scripts/__pycache__/jira.cpython-314.pyc +0 -0
  393. package/pennyfarthing_scripts/__pycache__/sprint.cpython-314.pyc +0 -0
  394. package/pennyfarthing_scripts/__pycache__/workflow.cpython-311.pyc +0 -0
  395. package/pennyfarthing_scripts/jira/__pycache__/compat.cpython-314.pyc +0 -0
  396. package/pennyfarthing_scripts/jira/__pycache__/mappings.cpython-314.pyc +0 -0
  397. package/pennyfarthing_scripts/jira/__pycache__/models.cpython-314.pyc +0 -0
  398. package/pennyfarthing_scripts/migration/__pycache__/__main__.cpython-314.pyc +0 -0
  399. package/pennyfarthing_scripts/migration/__pycache__/cli.cpython-314.pyc +0 -0
  400. package/pennyfarthing_scripts/prime/__pycache__/__main__.cpython-314.pyc +0 -0
  401. package/pennyfarthing_scripts/tests/__pycache__/test_workflow_cli.cpython-314-pytest-9.0.2.pyc +0 -0
@@ -1,447 +1,13 @@
1
1
  ---
2
- description: Run a single agent on a scenario with absolute rubric scoring
3
- argument-hint: <theme:agent> --scenario <name> [--as <role>] [--runs N] [--no-judge]
2
+ deprecated: true
3
+ redirect: pf-solo
4
+ description: "DEPRECATED: Use /pf-solo instead."
4
5
  ---
5
6
 
6
- # Solo Benchmark
7
+ # /solo - DEPRECATED
7
8
 
8
- <purpose>
9
- Run a single agent on a scenario. This is the CANONICAL agent execution path.
9
+ Solo commands have been renamed. Use `/pf-solo` instead.
10
10
 
11
- **Modes:**
12
- - **Full (default):** Agent runs → `/judge` evaluates → `/finalize-run` saves
13
- - **No-judge (`--no-judge`):** Agent runs only, returns raw response (for /duel, /relay)
14
- </purpose>
15
-
16
- <architecture>
17
- ```
18
- /solo theme:agent --scenario X
19
-
20
- ├──► Execute agent via CLI
21
- │ └──► Response + tokens
22
-
23
- ├──► /judge --mode solo (if not --no-judge)
24
- │ └──► Score + verdict
25
-
26
- └──► /finalize-run --type solo
27
- └──► Validate + save
28
- ```
29
- </architecture>
30
-
31
- <usage>
32
- ```
33
- /solo <contestant> --scenario <name>
34
- /solo <contestant> --scenario <name> --runs 4
35
- /solo <contestant> --scenario <name> --no-judge
36
- /solo <contestant> --as <role> --scenario <name>
37
- ```
38
-
39
- **Arguments:**
40
- - `contestant` - `theme:agent` format (e.g., `discworld:reviewer`) OR `theme:character` with `--as`
41
- - `--scenario` - Scenario from `scenarios/` directory
42
- - `--as <role>` - Override role (use character's persona for different role's task)
43
- - `--runs N` - Number of runs (default: 1, max: 20)
44
- - `--no-judge` - Skip judging, return raw response
45
-
46
- **Cross-Role Testing with `--as`:**
47
-
48
- The `--as` flag enables running any persona as any role, useful for research:
49
-
50
- ```
51
- /solo shakespeare:prospero --as dev --scenario django-10554
52
- ```
53
-
54
- This uses Prospero's persona traits (wise orchestrator, magic metaphors) but gives him a dev task.
55
- The scenario's role determines what the agent is asked to do; the character determines how they do it.
56
- </usage>
57
-
58
- <on-invoke>
59
- The user invoked this command with: $ARGUMENTS
60
-
61
- ## Step 1: Parse Arguments
62
-
63
- Extract:
64
- - `contestant`: `theme:agent` spec (or `theme:character` if using `--as`)
65
- - `scenario_name`: After `--scenario`
66
- - `role_override`: After `--as` (optional - for cross-role testing)
67
- - `runs`: Number (default: 1)
68
- - `no_judge`: Boolean
69
-
70
- Validate spec contains `:`, scenario is required, runs is 1-20.
71
-
72
- **If `--as` is provided:**
73
- - The second part of the spec is a CHARACTER name, not a role
74
- - `role_override` becomes the effective role for scenario matching
75
- - Character persona is extracted by name lookup across all agents in theme
76
-
77
- ## Step 2: Load Scenario
78
-
79
- ```yaml
80
- Glob tool:
81
- pattern: "scenarios/**/{scenario_name}.yaml"
82
- ```
83
-
84
- Extract: `prompt`, `scenario_title`, `code_content` (if present)
85
-
86
- ## Step 3: Load Persona
87
-
88
- Read: `pennyfarthing-dist/personas/themes/{theme}.yaml`
89
-
90
- **Standard mode (no `--as`):**
91
- - Look up `agents.{agent}` section
92
- - Extract: `character`, `style`, `expertise`, `catchphrases`, `emoji`
93
- - `effective_role` = agent name from spec
94
-
95
- **Cross-role mode (with `--as`):**
96
- - The spec contains `theme:character_name` (e.g., `shakespeare:prospero`)
97
- - Search ALL agent sections for one where `character` field matches (case-insensitive, partial match OK)
98
- - Extract persona traits from that agent's config
99
- - `effective_role` = the `--as` value (NOT the role the character normally fills)
100
-
101
- ```python
102
- # Pseudocode for cross-role lookup
103
- if role_override:
104
- character_query = spec.split(':')[1].lower() # e.g., "prospero"
105
- for agent_name, agent_config in theme['agents'].items():
106
- char_name = agent_config.get('character', '').lower()
107
- if character_query in char_name or char_name.startswith(character_query):
108
- persona = agent_config
109
- source_role = agent_name # where character normally lives
110
- break
111
- effective_role = role_override # what we're asking them to do
112
- else:
113
- agent_name = spec.split(':')[1] # e.g., "dev"
114
- persona = theme['agents'][agent_name]
115
- effective_role = agent_name
116
- source_role = agent_name
117
- ```
118
-
119
- This enables running Prospero (normally SM) as a dev, or Gus Fring (normally orchestrator) as a reviewer.
120
-
121
- ## Step 3b: Build Agent Prompt
122
-
123
- Use the Write tool to create the prompt file with this template:
124
-
125
- ```
126
- You are {character}.
127
-
128
- **Style:** {style}
129
- **Expertise:** {expertise}
130
- **Catchphrases:** {catchphrases}
131
-
132
- ---
133
-
134
- ## Challenge
135
-
136
- {scenario_prompt}
137
-
138
- {code_content if present}
139
-
140
- ---
141
-
142
- Respond fully in character. Under 500 words.
143
-
144
- **IMPORTANT:** Provide your complete response directly. Do not attempt to use tools, read files, or make function calls.
145
- ```
146
-
147
- **Cross-role note:** When using `--as`, the scenario prompt comes from the `effective_role` (e.g., dev tasks),
148
- but the character/style/expertise come from the character's original role config. This tests whether
149
- personality traits affect task performance independent of role-specific training.
150
-
151
- The final instruction is critical - without it, the model may output tool-call syntax even with `--tools ""`, resulting in incomplete responses.
152
-
153
- ## Step 4: Execute Agent via CLI
154
-
155
- **RECOMMENDED: Use the shell script for reliable execution:**
156
-
157
- ```bash
158
- ./scripts/solo-runner.sh {theme}:{agent} {scenario} {output_dir}
159
- ```
160
-
161
- The shell script handles all escaping, temp files, and JSON parsing correctly.
162
- Use inline commands only for simple cases or when the script isn't available.
163
-
164
- ---
165
-
166
- **CRITICAL: The `--tools ""` flag is MANDATORY.**
167
-
168
- Without `--tools ""`, agents may use tools internally (Read, Write, Bash, etc.), causing:
169
- 1. Multi-turn conversations (num_turns > 1)
170
- 2. The `.result` field only captures the FINAL message (often just a summary)
171
- 3. Full response content is LOST - judges only see truncated output
172
- 4. Scores are INVALID because judges evaluate incomplete data
173
-
174
- **Evidence:** Miles Vorkosigan benchmark (2026-01-01) scored 76.69 with tools enabled vs Leo McGarry's 91.03 with `--tools ""`. Miles' runs had num_turns: 5-7 and judges only saw summaries, not full story breakdowns.
175
-
176
- **CRITICAL: Use PIPE syntax, NOT heredocs.**
177
-
178
- **NEVER USE HEREDOCS** - Heredoc syntax (`<<'EOF'`, `<<EOF`, `<<'PROMPT'`, etc.) FAILS in subagents.
179
- The permission system treats heredocs differently and they get auto-denied.
180
-
181
- **ALWAYS USE PIPE SYNTAX** - This works in both main sessions and subagents:
182
- - `echo "$PROMPT" | claude -p ...` - WORKS
183
- - `cat file.txt | claude -p ...` - WORKS
184
- - `printf '%s' "$PROMPT" | claude -p ...` - WORKS
185
- - `claude -p ... <<'EOF'` - **FAILS IN SUBAGENTS - DO NOT USE**
186
-
187
- **CRITICAL: Use FILE REDIRECTION, NOT variable capture.**
188
-
189
- **NEVER CAPTURE OUTPUT IN VARIABLES** - Command substitution with `$(...)` causes zsh parse errors
190
- when the JSON output contains parentheses or special characters:
191
- - `OUTPUT=$(cat file.txt | claude -p ...)` - **FAILS with `parse error near ')'`**
192
-
193
- **ALWAYS REDIRECT TO FILES** - This avoids shell parsing issues:
194
- - `cat file.txt | claude -p ... > output.json` - WORKS
195
- - Then read: `jq -r '.result' output.json` - WORKS
196
-
197
- ```bash
198
- # Step 1: Capture timestamp to file (avoid variable capture issues)
199
- date -u +%Y-%m-%dT%H:%M:%SZ > /tmp/timestamp_$$.txt
200
-
201
- # Step 2: Write prompt to file using Write tool (avoids escaping issues)
202
- # Use the Write tool to create: /tmp/prompt_$$.txt
203
-
204
- # Step 3: Execute with file redirection (NOT variable capture)
205
- cat /tmp/prompt_$$.txt | claude -p --output-format json --tools "" > /tmp/output_$$.json
206
-
207
- # Step 4: Extract results from files
208
- TIMESTAMP=$(cat /tmp/timestamp_$$.txt)
209
- RESPONSE=$(jq -r '.result' /tmp/output_$$.json)
210
- INPUT_TOKENS=$(jq -r '.usage.input_tokens // 0' /tmp/output_$$.json)
211
- OUTPUT_TOKENS=$(jq -r '.usage.output_tokens // 0' /tmp/output_$$.json)
212
-
213
- # Step 5: Cleanup
214
- rm -f /tmp/timestamp_$$.txt /tmp/prompt_$$.txt /tmp/output_$$.json
215
- ```
216
-
217
- **Why file redirection works:** The shell never tries to parse the JSON output.
218
- It goes directly to a file, then jq reads it safely.
219
-
220
- ## Step 5: Check Mode
221
-
222
- **If `--no-judge`:** Return raw response and metadata, STOP.
223
-
224
- ```markdown
225
- ## Solo Agent Response
226
-
227
- **Contestant:** {spec} ({character})
228
- **Scenario:** {scenario_name}
229
-
230
- ---
231
-
232
- {response}
233
-
234
- ---
235
-
236
- ```json
237
- {
238
- "spec": "{spec}",
239
- "character": "{character}",
240
- "cli_timestamp": "{TIMESTAMP}",
241
- "response_length": {length},
242
- "input_tokens": {INPUT_TOKENS},
243
- "output_tokens": {OUTPUT_TOKENS}
244
- }
245
- ```
246
- ```
247
-
248
- **If full mode:** Continue to Step 6.
249
-
250
- ## Step 6: Invoke Judge Skill
251
-
252
- **Detect SWE-bench scenarios for deterministic evaluation:**
253
-
254
- Check if the scenario is from SWE-bench by looking at its path or category:
255
- ```python
256
- is_swebench = (
257
- 'swe-bench' in scenario_path.lower() or
258
- scenario.get('category') == 'swe-bench' or
259
- scenario.get('source') == 'swe-bench'
260
- )
261
- ```
262
-
263
- **If SWE-bench scenario:**
264
-
265
- Use deterministic Python-based evaluation instead of LLM-as-judge:
266
-
267
- ```bash
268
- # Save response to temp file for Python judge
269
- echo '{"result": "{RESPONSE}"}' > /tmp/solo_response_$$.json
270
-
271
- # Run SWE-bench judge (deterministic scoring against ground truth)
272
- python3 .pennyfarthing/scripts/test/swebench-judge.py {scenario_name} /tmp/solo_response_$$.json
273
- ```
274
-
275
- The Python script returns:
276
- - `total`: Score out of 100
277
- - `scores`: Breakdown by category (root_cause, fix_quality, completeness, persona)
278
- - `details`: Specific findings and matches
279
-
280
- **If standard scenario (non-SWE-bench):**
281
-
282
- Use LLM-as-judge:
283
- ```
284
- /judge --mode solo --data {
285
- "spec": "{spec}",
286
- "character": "{character}",
287
- "challenge": "{prompt}",
288
- "response": "{RESPONSE}"
289
- }
290
- ```
291
-
292
- Capture: `score`, `judge_timestamp`, `judge_response`, `judge_tokens`
293
-
294
- ## Step 7: Invoke Finalize-Run Skill
295
-
296
- ```
297
- /finalize-run --type solo --data {
298
- "timestamp": "{ISO8601}",
299
- "scenario": {"name": "{scenario_name}", "title": "{title}"},
300
- "agents": [{
301
- "spec": "{spec}",
302
- "cli_timestamp": "{TIMESTAMP}",
303
- "response_text": "{RESPONSE}",
304
- "input_tokens": {INPUT_TOKENS},
305
- "output_tokens": {OUTPUT_TOKENS}
306
- }],
307
- "judge": {
308
- "cli_timestamp": "{judge_timestamp}",
309
- "response_text": "{judge_response}",
310
- "input_tokens": {judge_input},
311
- "output_tokens": {judge_output}
312
- },
313
- "scores": {"{spec}": {score}},
314
- "output_path": "internal/results/solo/{timestamp}-{theme}-{agent}.json"
315
- }
316
- ```
317
-
318
- ## Step 8: Display Results
319
-
320
- ```markdown
321
- ## Solo Evaluation
322
-
323
- {judge verdict}
324
-
325
- ---
326
-
327
- ## Efficiency
328
-
329
- | Metric | Value |
330
- |--------|-------|
331
- | Agent Tokens | {agent_total} |
332
- | Judge Tokens | {judge_total} |
333
- | Score | {score}/100 |
334
- | Tokens/Point | {tpp} |
335
-
336
- ---
337
-
338
- ✓ Saved to {output_path}
339
- ```
340
-
341
- ## Step 9: Multi-Run Mode (if runs > 1)
342
-
343
- 1. Create output directory (see Step 10 for path logic)
344
- 2. Repeat Steps 4-7 for each run
345
- 3. Save each to `runs/run_{i}.json` and `runs/judge_{i}.json`
346
- 4. Calculate statistics and save summary.yaml (Step 10)
347
-
348
- ## Step 10: Save Summary (ALWAYS - even for n=1)
349
-
350
- **Output path logic:**
351
-
352
- ```
353
- if theme == "control":
354
- base_path = "internal/results/baselines/{scenario}/{effective_role}/"
355
- elif role_override: # cross-role mode
356
- # Include character name and effective role for clarity
357
- base_path = "internal/results/benchmarks/{scenario}/{theme}-{character}-as-{effective_role}/"
358
- else:
359
- base_path = "internal/results/benchmarks/{scenario}/{theme}-{effective_role}/"
360
- ```
361
-
362
- **Cross-role example:** `/solo shakespeare:prospero --as dev --scenario django-10554`
363
- → saves to `internal/results/benchmarks/django-10554/shakespeare-prospero-as-dev/`
364
-
365
- **For ALL runs (including n=1):**
366
-
367
- 1. Create directory structure:
368
- ```bash
369
- mkdir -p "{base_path}/runs"
370
- ```
371
-
372
- 2. Save run files:
373
- - `runs/run_{i}.json` - Agent response + tokens
374
- - `runs/judge_{i}.json` - Judge evaluation
375
-
376
- 3. Calculate statistics:
377
- ```python
378
- scores = [run.score for run in runs]
379
- mean = sum(scores) / len(scores)
380
- std_dev = sqrt(sum((s - mean)^2 for s in scores) / len(scores))
381
- ```
382
-
383
- 4. **ALWAYS save summary.yaml:**
384
- ```yaml
385
- # {theme}:{character} on {scenario} (as {effective_role})
386
- # Generated: {ISO8601 timestamp}
387
-
388
- agent:
389
- theme: {theme}
390
- character: {character_name}
391
- effective_role: {effective_role} # role being performed
392
- source_role: {source_role} # role where character normally lives
393
- spec: {theme}:{character} # original spec
394
- cross_role: {true if role_override else false}
395
-
396
- scenario:
397
- name: {scenario_name}
398
- category: {category}
399
- difficulty: {difficulty}
400
-
401
- statistics:
402
- n: {run_count}
403
- mean: {mean:.2f}
404
- std_dev: {std_dev:.2f}
405
- min: {min_score}
406
- max: {max_score}
407
- scores: [{score1}, {score2}, ...]
408
-
409
- efficiency:
410
- avg_input_tokens: {avg_in}
411
- avg_output_tokens: {avg_out}
412
- tokens_per_point: {tpp:.2f}
413
-
414
- metadata:
415
- created_at: {ISO8601 timestamp}
416
- pennyfarthing_version: {version from package.json} # REQUIRED
417
- model: sonnet
418
-
419
- # Include baseline comparison if baseline exists and theme != control
420
- baseline_comparison:
421
- control_mean: {baseline_mean}
422
- control_stddev: {baseline_std}
423
- delta: {mean - baseline_mean:+.2f}
424
-
425
- runs:
426
- - run_1.json
427
- - run_2.json
428
- # ...
429
- ```
430
-
431
- 5. Display:
432
- ```
433
- ✓ Saved {n} run(s) to {base_path}
434
- ✓ Summary: {base_path}/summary.yaml
435
- ```
436
-
437
- </on-invoke>
438
-
439
- <reference>
440
- - **Judge Skill:** `.claude/project/skills/judge/SKILL.md`
441
- - **Finalize-Run Skill:** `.claude/project/skills/finalize-run/SKILL.md`
442
- - **Themes:** `pennyfarthing-dist/personas/themes/*.yaml`
443
- - **Scenarios:** `scenarios/**/*.yaml`
444
- - **Baselines:** `internal/results/baselines/{scenario}/{role}/` (control theme)
445
- - **Benchmarks:** `internal/results/benchmarks/{scenario}/{theme}-{role}/` (all other themes)
446
- - **Results README:** `internal/results/README.md`
447
- </reference>
11
+ | Old Command | New Command |
12
+ |-------------|-------------|
13
+ | `/solo` | `/pf-solo` |
@@ -3,9 +3,9 @@
3
3
 
4
4
  **Tests:** Use `testing-runner` subagent, never run directly.
5
5
 
6
- **Handoff:** Run `handoff-marker.sh {next_agent}` → extract markeremit → EXIT. See `<agent-exit-protocol>`.
6
+ **Handoff:** Run `pf handoff resolve-gate` → gate check`pf handoff complete-phase` `pf handoff marker` → EXIT. See `<agent-exit-protocol>`.
7
7
 
8
- **Sidecars:** Write learnings BEFORE spawning handoff subagent.
8
+ **Sidecars:** Write learnings BEFORE starting exit protocol.
9
9
 
10
10
  **Scripts:** Pennyfarthing scripts are Python-based (`pennyfarthing_scripts/`), not shell—check before assuming `.sh`.
11
11
  </critical>
@@ -117,16 +117,16 @@ See `.pennyfarthing/guides/tandem-protocol.md` for full protocol details.
117
117
 
118
118
  1. Write assessment to session
119
119
  2. Terminate tandem backseat (if active)
120
- 3. Spawn `handoff` subagent returns `HANDOFF_RESULT: {status, next_agent}`
121
- 4. If blocked → report error, stop
122
- 5. Run `.pennyfarthing/scripts/core/handoff-marker.sh {next_agent}`
123
- 6. Extract marker from YAML output, emit it:
124
- ```
125
- <!-- CYCLIST:HANDOFF:/dev -->
126
-
127
- Run `/dev` to continue
128
- ```
129
- 7. EXIT (nothing after marker)
120
+ 3. `pf handoff resolve-gate {story-id} {workflow} {phase}` RESOLVE_RESULT
121
+ 4. If blocked → report error, STOP
122
+ 5. If skip → jump to step 7
123
+ 6. If ready spawn gate subagent with gate file → GATE_RESULT
124
+ - If fail → fix issues, retry from step 3 (max 3 retries)
125
+ - If pass → continue
126
+ 7. `pf handoff complete-phase {story-id} {workflow} {from} {to} {gate-type}`
127
+ 8. `pf handoff marker {next_agent}` → emit marker → EXIT
128
+
129
+ **Agents drive exit directly — no handoff subagent.** Scripts handle routing and session updates atomically.
130
130
  </agent-exit-protocol>
131
131
 
132
132
  <wrong-phase-detection>
@@ -143,6 +143,6 @@ Output:
143
143
  ```
144
144
  <!-- CYCLIST:HANDOFF:/{OWNER} -->
145
145
 
146
- Run `/{OWNER}` to continue
146
+ Run `/pf-{OWNER}` to continue
147
147
  ```
148
148
  </wrong-phase-detection>
@@ -4,7 +4,7 @@
4
4
 
5
5
  This document describes how Pennyfarthing agents are coordinated. The framework supports both single-repo and multi-repo projects.
6
6
 
7
- **Key Principle:** Single entry point (`/new-work` or `/work`), state detection via session files, handoffs via Haiku subagents.
7
+ **Key Principle:** Single entry point (`/pf-session new` or `/work`), state detection via session files, handoffs via Haiku subagents.
8
8
 
9
9
  ## The TDD Flow
10
10
 
@@ -12,7 +12,7 @@ SM → TEA → Dev → Reviewer → SM (setup → red → green → review → f
12
12
 
13
13
  Handoffs between agents are managed by Haiku subagents.
14
14
 
15
- **Entry points:** `/new-work` (new story) or `/work` (smart resume/start)
15
+ **Entry points:** `/pf-session new` (new story) or `/work` (smart resume/start)
16
16
  **State detection:** Agents read session file on activation
17
17
  **Handoffs:** Agents spawn Haiku subagents to update session file
18
18
  **Finish:** SM handles when status = `approved`
@@ -311,7 +311,7 @@ SM → TEA → Dev → Reviewer → SM
311
311
  | **Reviewer Subagents** | | |
312
312
  | `reviewer-preflight.md` | Pre-flight checks before review | haiku |
313
313
  ### SM → TEA (Story Setup)
314
- **Trigger:** User selects story via `/new-work`
314
+ **Trigger:** User selects story via `/pf-session new`
315
315
  **Subagent:** `sm-setup MODE=setup` then `sm-handoff`
316
316
  **Action:** Claim Jira, write session file, create branches
317
317
  **Handoff phrase:** "TEA, Story X-Y needs tests. Write failing tests for these ACs."
@@ -423,13 +423,13 @@ Dev Agent Example (API story):
423
423
  ## Architecture History
424
424
 
425
425
  ### Previous Architecture (Pre-December 2025)
426
- - Separate commands: `/new-work`, `/pickup-work`, `/handoff-work`, `/finish-work`
426
+ - Separate commands: `/pf-session new`, `/pickup-work`, `/handoff-work`, `/finish-work`
427
427
  - Manual handoff documentation
428
428
  - No subagent extraction
429
429
 
430
430
  ### Current Architecture (January 2026)
431
431
  - Smart entry point: `/work` (resumes or starts new)
432
- - Alternative: `/new-work` (explicitly start new story)
432
+ - Alternative: `/pf-session new` (explicitly start new story)
433
433
  - State detection via session file in `.session/`
434
434
  - Handoffs via Haiku subagents in `.pennyfarthing/agents/`
435
435
  - SM handles finish-story when status = `approved`
@@ -448,7 +448,7 @@ sprint/ # Sprint tracking
448
448
  ```bash
449
449
  # Entry points
450
450
  /work # Smart entry - resume or start new
451
- /new-work # Explicitly start new story
451
+ /pf-session new # Explicitly start new story
452
452
 
453
453
  # TDD Flow agents
454
454
  /sm # Scrum Master (setup + finish)
@@ -465,7 +465,7 @@ sprint/ # Sprint tracking
465
465
  # Utility
466
466
  /check # Run quality gates before handoff
467
467
  /chore # Quick commit for small changes
468
- /release # Merge develop to main
468
+ /pf-git release # Merge develop to main
469
469
  ```
470
470
 
471
471
  ---
@@ -105,8 +105,8 @@ High-priority instruction that MUST be followed.
105
105
 
106
106
  ```xml
107
107
  <critical>
108
- **HANDOFF REQUIRES MARKER OUTPUT.** After `handoff` subagent returns:
109
- Run `handoff-marker.sh {next_agent}` as ABSOLUTE LAST ACTION, output result, EXIT.
108
+ **HANDOFF REQUIRES MARKER OUTPUT.** After exit protocol completes:
109
+ Run `pf handoff marker {next_agent}` as ABSOLUTE LAST ACTION, output result, EXIT.
110
110
  </critical>
111
111
  ```
112
112
  - **Validation:** First `<critical>` should be within line 30
@@ -178,7 +178,7 @@ Read `**Workflow:**` and `**Phase:**` from session. Query:
178
178
  OWNER=$(.pennyfarthing/scripts/workflow/phase-owner.sh {workflow} {phase})
179
179
  ```
180
180
 
181
- **If OWNER != "dev":** Run `handoff-marker.sh $OWNER`, output result, tell user.
181
+ **If OWNER != "dev":** Run `pf handoff marker $OWNER`, output result, tell user.
182
182
  </phase-check>
183
183
  ```
184
184
 
@@ -250,8 +250,9 @@ Tags that contain checklists with `- [ ]` items.
250
250
  ## MANDATORY: Complete Before Exiting
251
251
 
252
252
  - [ ] Write Dev Assessment to session file
253
- - [ ] Spawn `handoff` subagent
254
- - [ ] Verify handoff completed (subagent emits marker)
253
+ - [ ] Run `pf handoff resolve-gate` — verify gate status
254
+ - [ ] Run `pf handoff complete-phase` atomic session update
255
+ - [ ] Run `pf handoff marker {next_agent}` — emit marker and EXIT
255
256
  </handoff-gate>
256
257
  ```
257
258