xtrm-tools 0.7.3 → 0.7.7

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (533) hide show
  1. package/.xtrm/config/hooks.json +3 -0
  2. package/.xtrm/config/pi/extensions/xtrm-ui/format.ts +189 -0
  3. package/.xtrm/config/pi/extensions/xtrm-ui/index.ts +76 -17
  4. package/.xtrm/config/pi/extensions/xtrm-ui/package.json +16 -5
  5. package/.xtrm/ext-src/custom-footer/.pi/structured-returns/83051fe4-97da-4e2c-bdaa-343b32f4e714.combined.log +7 -0
  6. package/.xtrm/ext-src/custom-footer/.pi/structured-returns/83051fe4-97da-4e2c-bdaa-343b32f4e714.stderr.log +0 -0
  7. package/.xtrm/ext-src/custom-footer/.pi/structured-returns/83051fe4-97da-4e2c-bdaa-343b32f4e714.stdout.log +7 -0
  8. package/.xtrm/ext-src/xtrm-ui/format.ts +282 -0
  9. package/.xtrm/{extensions → ext-src}/xtrm-ui/index.ts +76 -17
  10. package/.xtrm/ext-src/xtrm-ui/package.json +21 -0
  11. package/.xtrm/hooks/specialists/specialists-complete.mjs +70 -0
  12. package/.xtrm/hooks/specialists/specialists-session-start.mjs +105 -0
  13. package/.xtrm/registry.json +397 -409
  14. package/.xtrm/skills/default/README.txt +31 -0
  15. package/.xtrm/skills/default/clean-code/SKILL.md +201 -0
  16. package/.xtrm/skills/default/creating-service-skills/SKILL.md +433 -0
  17. package/.xtrm/skills/default/creating-service-skills/references/script_quality_standards.md +425 -0
  18. package/.xtrm/skills/default/creating-service-skills/references/service_skill_system_guide.md +278 -0
  19. package/.xtrm/skills/default/creating-service-skills/scripts/bootstrap.py +326 -0
  20. package/.xtrm/skills/default/creating-service-skills/scripts/deep_dive.py +304 -0
  21. package/.xtrm/skills/default/creating-service-skills/scripts/scaffolder.py +482 -0
  22. package/.xtrm/skills/default/deepwiki/SKILL.md +50 -0
  23. package/.xtrm/skills/default/delegating/SKILL.md +196 -0
  24. package/.xtrm/skills/default/delegating/config.yaml +210 -0
  25. package/.xtrm/skills/default/delegating/references/orchestration-protocols.md +41 -0
  26. package/.xtrm/skills/default/documenting/CHANGELOG.md +23 -0
  27. package/.xtrm/skills/default/documenting/README.md +148 -0
  28. package/.xtrm/skills/default/documenting/SKILL.md +113 -0
  29. package/.xtrm/skills/default/documenting/examples/example_pattern.md +70 -0
  30. package/.xtrm/skills/default/documenting/examples/example_reference.md +70 -0
  31. package/.xtrm/skills/default/documenting/examples/example_ssot_analytics.md +64 -0
  32. package/.xtrm/skills/default/documenting/examples/example_workflow.md +141 -0
  33. package/.xtrm/skills/default/documenting/references/changelog-format.md +97 -0
  34. package/.xtrm/skills/default/documenting/references/metadata-schema.md +136 -0
  35. package/.xtrm/skills/default/documenting/references/taxonomy.md +81 -0
  36. package/.xtrm/skills/default/documenting/references/versioning-rules.md +78 -0
  37. package/.xtrm/skills/default/documenting/scripts/bump_version.sh +60 -0
  38. package/.xtrm/skills/default/documenting/scripts/changelog/__init__.py +0 -0
  39. package/.xtrm/skills/default/documenting/scripts/changelog/add_entry.py +216 -0
  40. package/.xtrm/skills/default/documenting/scripts/changelog/bump_release.py +117 -0
  41. package/.xtrm/skills/default/documenting/scripts/changelog/init_changelog.py +54 -0
  42. package/.xtrm/skills/default/documenting/scripts/changelog/validate_changelog.py +128 -0
  43. package/.xtrm/skills/default/documenting/scripts/drift_detector.py +266 -0
  44. package/.xtrm/skills/default/documenting/scripts/generate_template.py +311 -0
  45. package/.xtrm/skills/default/documenting/scripts/list_by_category.sh +84 -0
  46. package/.xtrm/skills/default/documenting/scripts/orchestrator.py +255 -0
  47. package/.xtrm/skills/default/documenting/scripts/validate_metadata.py +242 -0
  48. package/.xtrm/skills/default/documenting/templates/CHANGELOG.md.template +13 -0
  49. package/.xtrm/skills/default/find-docs/SKILL.md +175 -0
  50. package/.xtrm/skills/default/find-skills/SKILL.md +133 -0
  51. package/.xtrm/skills/default/github-search/SKILL.md +49 -0
  52. package/.xtrm/skills/default/gitnexus-debugging/SKILL.md +89 -0
  53. package/.xtrm/skills/default/gitnexus-impact-analysis/SKILL.md +97 -0
  54. package/.xtrm/skills/default/gitnexus-pr-review/SKILL.md +163 -0
  55. package/.xtrm/skills/default/gitnexus-refactoring/SKILL.md +121 -0
  56. package/.xtrm/skills/default/hook-development/SKILL.md +797 -0
  57. package/.xtrm/skills/default/hook-development/examples/load-context.sh +55 -0
  58. package/.xtrm/skills/default/hook-development/examples/quality-check.js +1168 -0
  59. package/.xtrm/skills/default/hook-development/examples/validate-bash.sh +43 -0
  60. package/.xtrm/skills/default/hook-development/examples/validate-write.sh +38 -0
  61. package/.xtrm/skills/default/hook-development/references/advanced.md +527 -0
  62. package/.xtrm/skills/default/hook-development/references/migration.md +369 -0
  63. package/.xtrm/skills/default/hook-development/references/patterns.md +412 -0
  64. package/.xtrm/skills/default/hook-development/scripts/README.md +164 -0
  65. package/.xtrm/skills/default/hook-development/scripts/hook-linter.sh +153 -0
  66. package/.xtrm/skills/default/hook-development/scripts/test-hook.sh +252 -0
  67. package/.xtrm/skills/default/hook-development/scripts/validate-hook-schema.sh +159 -0
  68. package/.xtrm/skills/default/init-session/SKILL.md +69 -0
  69. package/.xtrm/skills/default/last30days/SKILL.md +881 -0
  70. package/.xtrm/skills/default/last30days/scripts/briefing.py +260 -0
  71. package/.xtrm/skills/default/last30days/scripts/evaluate-synthesis.py +120 -0
  72. package/.xtrm/skills/default/last30days/scripts/evaluate_search_quality.py +641 -0
  73. package/.xtrm/skills/default/last30days/scripts/generate-synthesis-inputs.py +53 -0
  74. package/.xtrm/skills/default/last30days/scripts/last30days.py +2137 -0
  75. package/.xtrm/skills/default/last30days/scripts/lib/__init__.py +1 -0
  76. package/.xtrm/skills/default/last30days/scripts/lib/bird_x.py +458 -0
  77. package/.xtrm/skills/default/last30days/scripts/lib/bluesky.py +225 -0
  78. package/.xtrm/skills/default/last30days/scripts/lib/brave_search.py +329 -0
  79. package/.xtrm/skills/default/last30days/scripts/lib/cache.py +165 -0
  80. package/.xtrm/skills/default/last30days/scripts/lib/chrome_cookies.py +265 -0
  81. package/.xtrm/skills/default/last30days/scripts/lib/cookie_extract.py +295 -0
  82. package/.xtrm/skills/default/last30days/scripts/lib/dates.py +124 -0
  83. package/.xtrm/skills/default/last30days/scripts/lib/dedupe.py +290 -0
  84. package/.xtrm/skills/default/last30days/scripts/lib/entity_extract.py +127 -0
  85. package/.xtrm/skills/default/last30days/scripts/lib/env.py +807 -0
  86. package/.xtrm/skills/default/last30days/scripts/lib/exa_search.py +176 -0
  87. package/.xtrm/skills/default/last30days/scripts/lib/hackernews.py +266 -0
  88. package/.xtrm/skills/default/last30days/scripts/lib/http.py +174 -0
  89. package/.xtrm/skills/default/last30days/scripts/lib/instagram.py +365 -0
  90. package/.xtrm/skills/default/last30days/scripts/lib/models.py +221 -0
  91. package/.xtrm/skills/default/last30days/scripts/lib/normalize.py +489 -0
  92. package/.xtrm/skills/default/last30days/scripts/lib/openai_reddit.py +631 -0
  93. package/.xtrm/skills/default/last30days/scripts/lib/openrouter_search.py +216 -0
  94. package/.xtrm/skills/default/last30days/scripts/lib/parallel_search.py +139 -0
  95. package/.xtrm/skills/default/last30days/scripts/lib/polymarket.py +580 -0
  96. package/.xtrm/skills/default/last30days/scripts/lib/quality_nudge.py +201 -0
  97. package/.xtrm/skills/default/last30days/scripts/lib/query.py +117 -0
  98. package/.xtrm/skills/default/last30days/scripts/lib/query_type.py +111 -0
  99. package/.xtrm/skills/default/last30days/scripts/lib/reddit.py +617 -0
  100. package/.xtrm/skills/default/last30days/scripts/lib/reddit_enrich.py +325 -0
  101. package/.xtrm/skills/default/last30days/scripts/lib/reddit_public.py +259 -0
  102. package/.xtrm/skills/default/last30days/scripts/lib/relevance.py +148 -0
  103. package/.xtrm/skills/default/last30days/scripts/lib/render.py +1018 -0
  104. package/.xtrm/skills/default/last30days/scripts/lib/safari_cookies.py +182 -0
  105. package/.xtrm/skills/default/last30days/scripts/lib/schema.py +843 -0
  106. package/.xtrm/skills/default/last30days/scripts/lib/score.py +775 -0
  107. package/.xtrm/skills/default/last30days/scripts/lib/scrapecreators_x.py +182 -0
  108. package/.xtrm/skills/default/last30days/scripts/lib/setup_wizard.py +186 -0
  109. package/.xtrm/skills/default/last30days/scripts/lib/tiktok.py +349 -0
  110. package/.xtrm/skills/default/last30days/scripts/lib/truthsocial.py +183 -0
  111. package/.xtrm/skills/default/last30days/scripts/lib/ui.py +620 -0
  112. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/LICENSE +21 -0
  113. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/bird-search.mjs +134 -0
  114. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/cookies.js +191 -0
  115. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/features.json +17 -0
  116. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/paginate-cursor.js +37 -0
  117. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/query-ids.json +20 -0
  118. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/runtime-features.js +151 -0
  119. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/runtime-query-ids.js +264 -0
  120. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/twitter-client-base.js +129 -0
  121. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/twitter-client-constants.js +50 -0
  122. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/twitter-client-features.js +347 -0
  123. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/twitter-client-search.js +157 -0
  124. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/twitter-client-types.js +2 -0
  125. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/lib/twitter-client-utils.js +511 -0
  126. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/LICENSE +22 -0
  127. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/README.md +29 -0
  128. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/index.d.ts +3 -0
  129. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/index.d.ts.map +1 -0
  130. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/index.js +2 -0
  131. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/index.js.map +1 -0
  132. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chrome.d.ts +8 -0
  133. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chrome.d.ts.map +1 -0
  134. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chrome.js +27 -0
  135. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chrome.js.map +1 -0
  136. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/crypto.d.ts +11 -0
  137. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/crypto.d.ts.map +1 -0
  138. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/crypto.js +100 -0
  139. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/crypto.js.map +1 -0
  140. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/linuxKeyring.d.ts +25 -0
  141. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/linuxKeyring.d.ts.map +1 -0
  142. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/linuxKeyring.js +104 -0
  143. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/linuxKeyring.js.map +1 -0
  144. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/shared.d.ts +10 -0
  145. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/shared.d.ts.map +1 -0
  146. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/shared.js +293 -0
  147. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/shared.js.map +1 -0
  148. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/windowsDpapi.d.ts +10 -0
  149. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/windowsDpapi.d.ts.map +1 -0
  150. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/windowsDpapi.js +26 -0
  151. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqlite/windowsDpapi.js.map +1 -0
  152. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteLinux.d.ts +7 -0
  153. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteLinux.d.ts.map +1 -0
  154. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteLinux.js +51 -0
  155. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteLinux.js.map +1 -0
  156. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteMac.d.ts +7 -0
  157. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteMac.d.ts.map +1 -0
  158. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteMac.js +60 -0
  159. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteMac.js.map +1 -0
  160. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteWindows.d.ts +7 -0
  161. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteWindows.d.ts.map +1 -0
  162. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteWindows.js +38 -0
  163. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromeSqliteWindows.js.map +1 -0
  164. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/linuxPaths.d.ts +5 -0
  165. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/linuxPaths.d.ts.map +1 -0
  166. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/linuxPaths.js +33 -0
  167. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/linuxPaths.js.map +1 -0
  168. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/macosKeychain.d.ts +24 -0
  169. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/macosKeychain.d.ts.map +1 -0
  170. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/macosKeychain.js +30 -0
  171. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/macosKeychain.js.map +1 -0
  172. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/paths.d.ts +11 -0
  173. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/paths.d.ts.map +1 -0
  174. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/paths.js +43 -0
  175. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/paths.js.map +1 -0
  176. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/windowsMasterKey.d.ts +8 -0
  177. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/windowsMasterKey.d.ts.map +1 -0
  178. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/windowsMasterKey.js +41 -0
  179. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/windowsMasterKey.js.map +1 -0
  180. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/windowsPaths.d.ts +8 -0
  181. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/windowsPaths.d.ts.map +1 -0
  182. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/windowsPaths.js +53 -0
  183. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/chromium/windowsPaths.js.map +1 -0
  184. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edge.d.ts +8 -0
  185. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edge.d.ts.map +1 -0
  186. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edge.js +27 -0
  187. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edge.js.map +1 -0
  188. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteLinux.d.ts +7 -0
  189. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteLinux.d.ts.map +1 -0
  190. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteLinux.js +53 -0
  191. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteLinux.js.map +1 -0
  192. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteMac.d.ts +8 -0
  193. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteMac.d.ts.map +1 -0
  194. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteMac.js +60 -0
  195. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteMac.js.map +1 -0
  196. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteWindows.d.ts +7 -0
  197. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteWindows.d.ts.map +1 -0
  198. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteWindows.js +38 -0
  199. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/edgeSqliteWindows.js.map +1 -0
  200. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/firefoxSqlite.d.ts +6 -0
  201. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/firefoxSqlite.d.ts.map +1 -0
  202. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/firefoxSqlite.js +257 -0
  203. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/firefoxSqlite.js.map +1 -0
  204. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/inline.d.ts +8 -0
  205. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/inline.d.ts.map +1 -0
  206. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/inline.js +71 -0
  207. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/inline.js.map +1 -0
  208. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/safariBinaryCookies.d.ts +6 -0
  209. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/safariBinaryCookies.d.ts.map +1 -0
  210. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/safariBinaryCookies.js +173 -0
  211. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/providers/safariBinaryCookies.js.map +1 -0
  212. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/public.d.ts +26 -0
  213. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/public.d.ts.map +1 -0
  214. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/public.js +195 -0
  215. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/public.js.map +1 -0
  216. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/types.d.ts +121 -0
  217. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/types.d.ts.map +1 -0
  218. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/types.js +2 -0
  219. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/types.js.map +1 -0
  220. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/base64.d.ts +2 -0
  221. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/base64.d.ts.map +1 -0
  222. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/base64.js +18 -0
  223. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/base64.js.map +1 -0
  224. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/exec.d.ts +8 -0
  225. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/exec.d.ts.map +1 -0
  226. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/exec.js +110 -0
  227. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/exec.js.map +1 -0
  228. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/expire.d.ts +2 -0
  229. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/expire.d.ts.map +1 -0
  230. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/expire.js +32 -0
  231. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/expire.js.map +1 -0
  232. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/fs.d.ts +2 -0
  233. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/fs.d.ts.map +1 -0
  234. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/fs.js +13 -0
  235. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/fs.js.map +1 -0
  236. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/hostMatch.d.ts +2 -0
  237. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/hostMatch.d.ts.map +1 -0
  238. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/hostMatch.js +7 -0
  239. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/hostMatch.js.map +1 -0
  240. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/nodeSqlite.d.ts +5 -0
  241. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/nodeSqlite.d.ts.map +1 -0
  242. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/nodeSqlite.js +58 -0
  243. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/nodeSqlite.js.map +1 -0
  244. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/origins.d.ts +2 -0
  245. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/origins.d.ts.map +1 -0
  246. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/origins.js +27 -0
  247. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/origins.js.map +1 -0
  248. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/runtime.d.ts +2 -0
  249. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/runtime.d.ts.map +1 -0
  250. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/runtime.js +8 -0
  251. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/dist/util/runtime.js.map +1 -0
  252. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/node_modules/@steipete/sweet-cookie/package.json +40 -0
  253. package/.xtrm/skills/default/last30days/scripts/lib/vendor/bird-search/package.json +13 -0
  254. package/.xtrm/skills/default/last30days/scripts/lib/websearch.py +401 -0
  255. package/.xtrm/skills/default/last30days/scripts/lib/xai_x.py +217 -0
  256. package/.xtrm/skills/default/last30days/scripts/lib/xiaohongshu_api.py +162 -0
  257. package/.xtrm/skills/default/last30days/scripts/lib/youtube_yt.py +538 -0
  258. package/.xtrm/skills/default/last30days/scripts/store.py +654 -0
  259. package/.xtrm/skills/default/last30days/scripts/sync.sh +50 -0
  260. package/.xtrm/skills/default/last30days/scripts/test-v1-vs-v2.sh +219 -0
  261. package/.xtrm/skills/default/last30days/scripts/watchlist.py +329 -0
  262. package/.xtrm/skills/default/planning/SKILL.md +405 -0
  263. package/.xtrm/skills/default/planning/evals/evals.json +19 -0
  264. package/.xtrm/skills/default/prompt-improving/README.md +162 -0
  265. package/.xtrm/skills/default/prompt-improving/SKILL.md +74 -0
  266. package/.xtrm/skills/default/prompt-improving/references/analysis_commands.md +24 -0
  267. package/.xtrm/skills/default/prompt-improving/references/chain_of_thought.md +24 -0
  268. package/.xtrm/skills/default/prompt-improving/references/mcp_definitions.md +20 -0
  269. package/.xtrm/skills/default/prompt-improving/references/multishot.md +23 -0
  270. package/.xtrm/skills/default/prompt-improving/references/xml_core.md +60 -0
  271. package/.xtrm/skills/default/quality-gates/.claude/hooks/hook-config.json +66 -0
  272. package/.xtrm/skills/default/quality-gates/.claude/hooks/quality-check.cjs +1286 -0
  273. package/.xtrm/skills/default/quality-gates/.claude/hooks/quality-check.py +334 -0
  274. package/.xtrm/skills/default/quality-gates/.claude/settings.json +3 -0
  275. package/.xtrm/skills/default/quality-gates/.claude/skills/using-quality-gates/SKILL.md +254 -0
  276. package/.xtrm/skills/default/quality-gates/README.md +109 -0
  277. package/.xtrm/skills/default/quality-gates/evals/evals.json +181 -0
  278. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/FINAL-EVAL-SUMMARY.md +75 -0
  279. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/edge-case-auto-fix-verification/with_skill/outputs/response.md +59 -0
  280. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/edge-case-mixed-language-project/with_skill/outputs/response.md +60 -0
  281. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/eval-summary.md +105 -0
  282. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/partial-install-python-only/with_skill/outputs/response.md +93 -0
  283. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/python-refactor-request/with_skill/outputs/response.md +104 -0
  284. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/quality-gate-error-fix/with_skill/outputs/response.md +74 -0
  285. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/should-not-trigger-general-chat/with_skill/outputs/response.md +18 -0
  286. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/should-not-trigger-math-question/with_skill/outputs/response.md +18 -0
  287. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/should-not-trigger-unrelated-coding/with_skill/outputs/response.md +56 -0
  288. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/tdd-guard-blocking-confusion/with_skill/outputs/response.md +67 -0
  289. package/.xtrm/skills/default/quality-gates/workspace/iteration-1/typescript-feature-with-tests/with_skill/outputs/response.md +97 -0
  290. package/.xtrm/skills/default/scoping-service-skills/SKILL.md +231 -0
  291. package/.xtrm/skills/default/scoping-service-skills/scripts/scope.py +74 -0
  292. package/.xtrm/skills/default/service-skills-set/README.md +93 -0
  293. package/.xtrm/skills/default/service-skills-set/git-hooks/doc_reminder.py +67 -0
  294. package/.xtrm/skills/default/service-skills-set/git-hooks/skill_staleness.py +194 -0
  295. package/.xtrm/skills/default/service-skills-set/install-service-skills.py +193 -0
  296. package/.xtrm/skills/default/service-skills-set/service-registry.json +4 -0
  297. package/.xtrm/skills/default/service-skills-set/service-skills-readme.md +236 -0
  298. package/.xtrm/skills/default/service-skills-set/settings.json +37 -0
  299. package/.xtrm/skills/default/session-close-report/SKILL.md +131 -0
  300. package/.xtrm/skills/default/skill-creator/LICENSE.txt +202 -0
  301. package/.xtrm/skills/default/skill-creator/SKILL.md +479 -0
  302. package/.xtrm/skills/default/skill-creator/agents/analyzer.md +274 -0
  303. package/.xtrm/skills/default/skill-creator/agents/comparator.md +202 -0
  304. package/.xtrm/skills/default/skill-creator/agents/grader.md +223 -0
  305. package/.xtrm/skills/default/skill-creator/assets/eval_review.html +146 -0
  306. package/.xtrm/skills/default/skill-creator/eval-viewer/generate_review.py +471 -0
  307. package/.xtrm/skills/default/skill-creator/eval-viewer/viewer.html +1325 -0
  308. package/.xtrm/skills/default/skill-creator/references/schemas.md +430 -0
  309. package/.xtrm/skills/default/skill-creator/scripts/__init__.py +0 -0
  310. package/.xtrm/skills/default/skill-creator/scripts/aggregate_benchmark.py +401 -0
  311. package/.xtrm/skills/default/skill-creator/scripts/generate_report.py +326 -0
  312. package/.xtrm/skills/default/skill-creator/scripts/improve_description.py +248 -0
  313. package/.xtrm/skills/default/skill-creator/scripts/package_skill.py +136 -0
  314. package/.xtrm/skills/default/skill-creator/scripts/quick_validate.py +103 -0
  315. package/.xtrm/skills/default/skill-creator/scripts/run_eval.py +310 -0
  316. package/.xtrm/skills/default/skill-creator/scripts/run_loop.py +332 -0
  317. package/.xtrm/skills/default/skill-creator/scripts/utils.py +47 -0
  318. package/.xtrm/skills/default/specialists-creator/SKILL.md +705 -0
  319. package/.xtrm/skills/default/specialists-creator/scripts/validate-specialist.ts +41 -0
  320. package/.xtrm/skills/default/sync-docs/SKILL.md +262 -0
  321. package/.xtrm/skills/default/sync-docs/evals/evals.json +89 -0
  322. package/.xtrm/skills/default/sync-docs/references/doc-structure.md +99 -0
  323. package/.xtrm/skills/default/sync-docs/references/schema.md +103 -0
  324. package/.xtrm/skills/default/sync-docs/scripts/changelog/add_entry.py +216 -0
  325. package/.xtrm/skills/default/sync-docs/scripts/context_gatherer.py +405 -0
  326. package/.xtrm/skills/default/sync-docs/scripts/doc_structure_analyzer.py +495 -0
  327. package/.xtrm/skills/default/sync-docs/scripts/drift_detector.py +563 -0
  328. package/.xtrm/skills/default/sync-docs/scripts/validate_doc.py +365 -0
  329. package/.xtrm/skills/default/sync-docs/scripts/validate_metadata.py +185 -0
  330. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/benchmark.json +293 -0
  331. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/benchmark.md +13 -0
  332. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-doc-audit/eval_metadata.json +27 -0
  333. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-doc-audit/with_skill/outputs/result.md +210 -0
  334. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-doc-audit/with_skill/run-1/grading.json +28 -0
  335. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-doc-audit/with_skill/run-1/timing.json +1 -0
  336. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-doc-audit/without_skill/outputs/result.md +101 -0
  337. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-doc-audit/without_skill/run-1/grading.json +28 -0
  338. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-doc-audit/without_skill/run-1/timing.json +5 -0
  339. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-doc-audit/without_skill/timing.json +5 -0
  340. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-fix-mode/eval_metadata.json +27 -0
  341. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-fix-mode/with_skill/outputs/result.md +198 -0
  342. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-fix-mode/with_skill/run-1/grading.json +28 -0
  343. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-fix-mode/with_skill/run-1/timing.json +1 -0
  344. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-fix-mode/without_skill/outputs/result.md +94 -0
  345. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-fix-mode/without_skill/run-1/grading.json +28 -0
  346. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-fix-mode/without_skill/run-1/timing.json +1 -0
  347. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-sprint-closeout/eval_metadata.json +27 -0
  348. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-sprint-closeout/with_skill/outputs/result.md +237 -0
  349. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-sprint-closeout/with_skill/run-1/grading.json +28 -0
  350. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-sprint-closeout/with_skill/run-1/timing.json +1 -0
  351. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-sprint-closeout/without_skill/outputs/result.md +134 -0
  352. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-sprint-closeout/without_skill/run-1/grading.json +28 -0
  353. package/.xtrm/skills/default/sync-docs-workspace/iteration-1/eval-sprint-closeout/without_skill/run-1/timing.json +1 -0
  354. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/benchmark.json +297 -0
  355. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/benchmark.md +13 -0
  356. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-doc-audit/eval_metadata.json +27 -0
  357. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-doc-audit/with_skill/outputs/result.md +137 -0
  358. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-doc-audit/with_skill/run-1/grading.json +92 -0
  359. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-doc-audit/with_skill/run-1/timing.json +1 -0
  360. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-doc-audit/without_skill/outputs/result.md +134 -0
  361. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-doc-audit/without_skill/run-1/grading.json +86 -0
  362. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-doc-audit/without_skill/run-1/timing.json +1 -0
  363. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-fix-mode/eval_metadata.json +27 -0
  364. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-fix-mode/with_skill/outputs/result.md +193 -0
  365. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-fix-mode/with_skill/run-1/grading.json +72 -0
  366. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-fix-mode/with_skill/run-1/timing.json +1 -0
  367. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-fix-mode/without_skill/outputs/result.md +211 -0
  368. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-fix-mode/without_skill/run-1/grading.json +91 -0
  369. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-fix-mode/without_skill/run-1/timing.json +5 -0
  370. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-sprint-closeout/eval_metadata.json +27 -0
  371. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-sprint-closeout/with_skill/outputs/result.md +182 -0
  372. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-sprint-closeout/with_skill/run-1/grading.json +95 -0
  373. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-sprint-closeout/with_skill/run-1/timing.json +1 -0
  374. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-sprint-closeout/without_skill/outputs/result.md +222 -0
  375. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-sprint-closeout/without_skill/run-1/grading.json +88 -0
  376. package/.xtrm/skills/default/sync-docs-workspace/iteration-2/eval-sprint-closeout/without_skill/run-1/timing.json +5 -0
  377. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/benchmark.json +298 -0
  378. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/benchmark.md +13 -0
  379. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-doc-audit/eval_metadata.json +27 -0
  380. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-doc-audit/with_skill/outputs/result.md +125 -0
  381. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-doc-audit/with_skill/run-1/grading.json +97 -0
  382. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-doc-audit/with_skill/run-1/timing.json +5 -0
  383. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-doc-audit/without_skill/outputs/result.md +144 -0
  384. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-doc-audit/without_skill/run-1/grading.json +78 -0
  385. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-doc-audit/without_skill/run-1/timing.json +5 -0
  386. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-fix-mode/eval_metadata.json +27 -0
  387. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-fix-mode/with_skill/outputs/result.md +104 -0
  388. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-fix-mode/with_skill/run-1/grading.json +91 -0
  389. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-fix-mode/with_skill/run-1/timing.json +5 -0
  390. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-fix-mode/without_skill/outputs/result.md +79 -0
  391. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-fix-mode/without_skill/run-1/grading.json +82 -0
  392. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-fix-mode/without_skill/run-1/timing.json +5 -0
  393. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/eval_metadata.json +27 -0
  394. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/with_skill/outputs/phase1_context.json +302 -0
  395. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/with_skill/outputs/phase2_drift.txt +33 -0
  396. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/with_skill/outputs/phase3_analysis.json +114 -0
  397. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/with_skill/outputs/phase4_fix.txt +118 -0
  398. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/with_skill/outputs/phase5_validate.txt +38 -0
  399. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/with_skill/outputs/result.md +158 -0
  400. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/with_skill/run-1/grading.json +95 -0
  401. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/with_skill/run-1/timing.json +5 -0
  402. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/without_skill/outputs/result.md +71 -0
  403. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/without_skill/run-1/grading.json +90 -0
  404. package/.xtrm/skills/default/sync-docs-workspace/iteration-3/eval-sprint-closeout/without_skill/run-1/timing.json +5 -0
  405. package/.xtrm/skills/default/test-planning/SKILL.md +465 -0
  406. package/.xtrm/skills/default/test-planning/evals/evals.json +23 -0
  407. package/.xtrm/skills/default/updating-service-skills/SKILL.md +136 -0
  408. package/.xtrm/skills/default/updating-service-skills/scripts/drift_detector.py +222 -0
  409. package/.xtrm/skills/default/using-nodes/SKILL.md +333 -0
  410. package/.xtrm/skills/default/using-quality-gates/SKILL.md +254 -0
  411. package/.xtrm/skills/default/using-service-skills/SKILL.md +108 -0
  412. package/.xtrm/skills/default/using-service-skills/scripts/cataloger.py +74 -0
  413. package/.xtrm/skills/default/using-service-skills/scripts/skill_activator.py +152 -0
  414. package/.xtrm/skills/default/using-specialists/SKILL.md +848 -0
  415. package/.xtrm/skills/default/using-specialists/evals/evals.json +68 -0
  416. package/.xtrm/skills/default/using-tdd/SKILL.md +410 -0
  417. package/.xtrm/skills/default/using-xtrm/SKILL.md +127 -0
  418. package/.xtrm/skills/default/xt-debugging/SKILL.md +149 -0
  419. package/.xtrm/skills/default/xt-end/SKILL.md +297 -0
  420. package/.xtrm/skills/default/xt-merge/SKILL.md +326 -0
  421. package/.xtrm/skills/optional/README.txt +2 -0
  422. package/.xtrm/skills/optional/architecture-design/PACK.json +11 -0
  423. package/.xtrm/skills/optional/architecture-design/architecture-patterns/SKILL.md +494 -0
  424. package/.xtrm/skills/optional/architecture-design/architecture-patterns/references/advanced-patterns.md +391 -0
  425. package/.xtrm/skills/optional/architecture-design/prompt-engineering-patterns/SKILL.md +473 -0
  426. package/.xtrm/skills/optional/architecture-design/prompt-engineering-patterns/assets/few-shot-examples.json +106 -0
  427. package/.xtrm/skills/optional/architecture-design/prompt-engineering-patterns/assets/prompt-template-library.md +264 -0
  428. package/.xtrm/skills/optional/architecture-design/prompt-engineering-patterns/references/chain-of-thought.md +412 -0
  429. package/.xtrm/skills/optional/architecture-design/prompt-engineering-patterns/references/few-shot-learning.md +386 -0
  430. package/.xtrm/skills/optional/architecture-design/prompt-engineering-patterns/references/prompt-optimization.md +428 -0
  431. package/.xtrm/skills/optional/architecture-design/prompt-engineering-patterns/references/prompt-templates.md +484 -0
  432. package/.xtrm/skills/optional/architecture-design/prompt-engineering-patterns/references/system-prompts.md +195 -0
  433. package/.xtrm/skills/optional/architecture-design/prompt-engineering-patterns/scripts/optimize-prompt.py +279 -0
  434. package/.xtrm/skills/optional/architecture-design/subagent-driven-development/SKILL.md +277 -0
  435. package/.xtrm/skills/optional/architecture-design/subagent-driven-development/code-quality-reviewer-prompt.md +26 -0
  436. package/.xtrm/skills/optional/architecture-design/subagent-driven-development/implementer-prompt.md +113 -0
  437. package/.xtrm/skills/optional/architecture-design/subagent-driven-development/spec-reviewer-prompt.md +61 -0
  438. package/.xtrm/skills/optional/code-quality/PACK.json +12 -0
  439. package/.xtrm/skills/optional/code-quality/code-review-excellence/SKILL.md +529 -0
  440. package/.xtrm/skills/optional/code-quality/multi-reviewer-patterns/SKILL.md +127 -0
  441. package/.xtrm/skills/optional/code-quality/systematic-debugging/SKILL.md +296 -0
  442. package/.xtrm/skills/optional/code-quality/verification-before-completion/SKILL.md +139 -0
  443. package/.xtrm/skills/optional/data-engineering/PACK.json +9 -0
  444. package/.xtrm/skills/optional/data-engineering/data-analyst/SKILL.md +57 -0
  445. package/.xtrm/skills/optional/research-methods/PACK.json +12 -0
  446. package/.xtrm/skills/optional/research-methods/academic-researcher/SKILL.md +269 -0
  447. package/.xtrm/skills/optional/research-methods/brainstorming/SKILL.md +164 -0
  448. package/.xtrm/skills/optional/research-methods/brainstorming/scripts/frame-template.html +214 -0
  449. package/.xtrm/skills/optional/research-methods/brainstorming/scripts/helper.js +88 -0
  450. package/.xtrm/skills/optional/research-methods/brainstorming/scripts/server.cjs +354 -0
  451. package/.xtrm/skills/optional/research-methods/brainstorming/scripts/start-server.sh +148 -0
  452. package/.xtrm/skills/optional/research-methods/brainstorming/scripts/stop-server.sh +56 -0
  453. package/.xtrm/skills/optional/research-methods/brainstorming/spec-document-reviewer-prompt.md +49 -0
  454. package/.xtrm/skills/optional/research-methods/brainstorming/visual-companion.md +287 -0
  455. package/.xtrm/skills/optional/research-methods/deep-research/SKILL.md +192 -0
  456. package/.xtrm/skills/optional/research-methods/fact-checker/SKILL.md +182 -0
  457. package/.xtrm/skills/optional/security-ops/PACK.json +9 -0
  458. package/.xtrm/skills/optional/security-ops/security-auditor/SKILL.md +165 -0
  459. package/.xtrm/skills/optional/xt-optional/PACK.json +16 -0
  460. package/.xtrm/skills/optional/xt-optional/docker-expert/SKILL.md +409 -0
  461. package/.xtrm/skills/optional/xt-optional/obsidian-cli/SKILL.md +106 -0
  462. package/.xtrm/skills/optional/xt-optional/python-testing/SKILL.md +815 -0
  463. package/.xtrm/skills/optional/xt-optional/senior-backend/SKILL.md +209 -0
  464. package/.xtrm/skills/optional/xt-optional/senior-backend/references/api_design_patterns.md +103 -0
  465. package/.xtrm/skills/optional/xt-optional/senior-backend/references/backend_security_practices.md +103 -0
  466. package/.xtrm/skills/optional/xt-optional/senior-backend/references/database_optimization_guide.md +103 -0
  467. package/.xtrm/skills/optional/xt-optional/senior-backend/scripts/api_load_tester.py +114 -0
  468. package/.xtrm/skills/optional/xt-optional/senior-backend/scripts/api_scaffolder.py +114 -0
  469. package/.xtrm/skills/optional/xt-optional/senior-backend/scripts/database_migration_tool.py +114 -0
  470. package/.xtrm/skills/optional/xt-optional/senior-data-scientist/SKILL.md +226 -0
  471. package/.xtrm/skills/optional/xt-optional/senior-data-scientist/references/experiment_design_frameworks.md +80 -0
  472. package/.xtrm/skills/optional/xt-optional/senior-data-scientist/references/feature_engineering_patterns.md +80 -0
  473. package/.xtrm/skills/optional/xt-optional/senior-data-scientist/references/statistical_methods_advanced.md +80 -0
  474. package/.xtrm/skills/optional/xt-optional/senior-data-scientist/scripts/experiment_designer.py +100 -0
  475. package/.xtrm/skills/optional/xt-optional/senior-data-scientist/scripts/feature_engineering_pipeline.py +100 -0
  476. package/.xtrm/skills/optional/xt-optional/senior-data-scientist/scripts/model_evaluation_suite.py +100 -0
  477. package/.xtrm/skills/optional/xt-optional/senior-devops/SKILL.md +209 -0
  478. package/.xtrm/skills/optional/xt-optional/senior-devops/references/cicd_pipeline_guide.md +103 -0
  479. package/.xtrm/skills/optional/xt-optional/senior-devops/references/deployment_strategies.md +103 -0
  480. package/.xtrm/skills/optional/xt-optional/senior-devops/references/infrastructure_as_code.md +103 -0
  481. package/.xtrm/skills/optional/xt-optional/senior-devops/scripts/deployment_manager.py +114 -0
  482. package/.xtrm/skills/optional/xt-optional/senior-devops/scripts/pipeline_generator.py +114 -0
  483. package/.xtrm/skills/optional/xt-optional/senior-devops/scripts/terraform_scaffolder.py +114 -0
  484. package/.xtrm/skills/optional/xt-optional/senior-security/SKILL.md +209 -0
  485. package/.xtrm/skills/optional/xt-optional/senior-security/references/cryptography_implementation.md +103 -0
  486. package/.xtrm/skills/optional/xt-optional/senior-security/references/penetration_testing_guide.md +103 -0
  487. package/.xtrm/skills/optional/xt-optional/senior-security/references/security_architecture_patterns.md +103 -0
  488. package/.xtrm/skills/optional/xt-optional/senior-security/scripts/pentest_automator.py +114 -0
  489. package/.xtrm/skills/optional/xt-optional/senior-security/scripts/security_auditor.py +114 -0
  490. package/.xtrm/skills/optional/xt-optional/senior-security/scripts/threat_modeler.py +114 -0
  491. package/CHANGELOG.md +16 -0
  492. package/README.md +5 -0
  493. package/cli/dist/index.cjs +862 -614
  494. package/cli/dist/index.cjs.map +1 -1
  495. package/cli/package.json +1 -1
  496. package/package.json +4 -1
  497. package/.xtrm/extensions/xtrm-ui/format.ts +0 -93
  498. package/.xtrm/extensions/xtrm-ui/package.json +0 -10
  499. /package/.xtrm/{extensions → ext-src}/auto-session-name/index.ts +0 -0
  500. /package/.xtrm/{extensions → ext-src}/auto-session-name/package.json +0 -0
  501. /package/.xtrm/{extensions → ext-src}/auto-update/index.ts +0 -0
  502. /package/.xtrm/{extensions → ext-src}/auto-update/package.json +0 -0
  503. /package/.xtrm/{extensions → ext-src}/beads/index.ts +0 -0
  504. /package/.xtrm/{extensions → ext-src}/beads/package.json +0 -0
  505. /package/.xtrm/{extensions → ext-src}/compact-header/index.ts +0 -0
  506. /package/.xtrm/{extensions → ext-src}/compact-header/package.json +0 -0
  507. /package/.xtrm/{extensions → ext-src}/core/adapter.ts +0 -0
  508. /package/.xtrm/{extensions → ext-src}/core/guard-rules.ts +0 -0
  509. /package/.xtrm/{extensions → ext-src}/core/lib.ts +0 -0
  510. /package/.xtrm/{extensions → ext-src}/core/logger.ts +0 -0
  511. /package/.xtrm/{extensions → ext-src}/core/package.json +0 -0
  512. /package/.xtrm/{extensions → ext-src}/core/runner.ts +0 -0
  513. /package/.xtrm/{extensions → ext-src}/core/session-state.ts +0 -0
  514. /package/.xtrm/{extensions → ext-src}/custom-footer/index.ts +0 -0
  515. /package/.xtrm/{extensions → ext-src}/custom-footer/package.json +0 -0
  516. /package/.xtrm/{extensions → ext-src}/custom-provider-qwen-cli/index.ts +0 -0
  517. /package/.xtrm/{extensions → ext-src}/custom-provider-qwen-cli/package.json +0 -0
  518. /package/.xtrm/{extensions → ext-src}/git-checkpoint/index.ts +0 -0
  519. /package/.xtrm/{extensions → ext-src}/git-checkpoint/package.json +0 -0
  520. /package/.xtrm/{extensions → ext-src}/lsp-bootstrap/index.ts +0 -0
  521. /package/.xtrm/{extensions → ext-src}/lsp-bootstrap/package.json +0 -0
  522. /package/.xtrm/{extensions → ext-src}/pi-serena-compact/index.ts +0 -0
  523. /package/.xtrm/{extensions → ext-src}/pi-serena-compact/package.json +0 -0
  524. /package/.xtrm/{extensions → ext-src}/quality-gates/index.ts +0 -0
  525. /package/.xtrm/{extensions → ext-src}/quality-gates/package.json +0 -0
  526. /package/.xtrm/{extensions → ext-src}/service-skills/index.ts +0 -0
  527. /package/.xtrm/{extensions → ext-src}/service-skills/package.json +0 -0
  528. /package/.xtrm/{extensions → ext-src}/session-flow/index.ts +0 -0
  529. /package/.xtrm/{extensions → ext-src}/session-flow/package.json +0 -0
  530. /package/.xtrm/{extensions → ext-src}/xtrm-loader/index.ts +0 -0
  531. /package/.xtrm/{extensions → ext-src}/xtrm-loader/package.json +0 -0
  532. /package/.xtrm/{extensions → ext-src}/xtrm-ui/themes/pidex-dark.json +0 -0
  533. /package/.xtrm/{extensions → ext-src}/xtrm-ui/themes/pidex-light.json +0 -0
@@ -0,0 +1,279 @@
1
+ #!/usr/bin/env python3
2
+ """
3
+ Prompt Optimization Script
4
+
5
+ Automatically test and optimize prompts using A/B testing and metrics tracking.
6
+ """
7
+
8
+ import json
9
+ import time
10
+ from typing import List, Dict, Any
11
+ from dataclasses import dataclass
12
+ from concurrent.futures import ThreadPoolExecutor
13
+ import numpy as np
14
+
15
+
16
+ @dataclass
17
+ class TestCase:
18
+ input: Dict[str, Any]
19
+ expected_output: str
20
+ metadata: Dict[str, Any] = None
21
+
22
+
23
+ class PromptOptimizer:
24
+ def __init__(self, llm_client, test_suite: List[TestCase]):
25
+ self.client = llm_client
26
+ self.test_suite = test_suite
27
+ self.results_history = []
28
+ self.executor = ThreadPoolExecutor()
29
+
30
+ def shutdown(self):
31
+ """Shutdown the thread pool executor."""
32
+ self.executor.shutdown(wait=True)
33
+
34
+ def evaluate_prompt(self, prompt_template: str, test_cases: List[TestCase] = None) -> Dict[str, float]:
35
+ """Evaluate a prompt template against test cases in parallel."""
36
+ if test_cases is None:
37
+ test_cases = self.test_suite
38
+
39
+ metrics = {
40
+ 'accuracy': [],
41
+ 'latency': [],
42
+ 'token_count': [],
43
+ 'success_rate': []
44
+ }
45
+
46
+ def process_test_case(test_case):
47
+ start_time = time.time()
48
+
49
+ # Render prompt with test case inputs
50
+ prompt = prompt_template.format(**test_case.input)
51
+
52
+ # Get LLM response
53
+ response = self.client.complete(prompt)
54
+
55
+ # Measure latency
56
+ latency = time.time() - start_time
57
+
58
+ # Calculate individual metrics
59
+ token_count = len(prompt.split()) + len(response.split())
60
+ success = 1 if response else 0
61
+ accuracy = self.calculate_accuracy(response, test_case.expected_output)
62
+
63
+ return {
64
+ 'latency': latency,
65
+ 'token_count': token_count,
66
+ 'success_rate': success,
67
+ 'accuracy': accuracy
68
+ }
69
+
70
+ # Run test cases in parallel
71
+ results = list(self.executor.map(process_test_case, test_cases))
72
+
73
+ # Aggregate metrics
74
+ for result in results:
75
+ metrics['latency'].append(result['latency'])
76
+ metrics['token_count'].append(result['token_count'])
77
+ metrics['success_rate'].append(result['success_rate'])
78
+ metrics['accuracy'].append(result['accuracy'])
79
+
80
+ return {
81
+ 'avg_accuracy': np.mean(metrics['accuracy']),
82
+ 'avg_latency': np.mean(metrics['latency']),
83
+ 'p95_latency': np.percentile(metrics['latency'], 95),
84
+ 'avg_tokens': np.mean(metrics['token_count']),
85
+ 'success_rate': np.mean(metrics['success_rate'])
86
+ }
87
+
88
+ def calculate_accuracy(self, response: str, expected: str) -> float:
89
+ """Calculate accuracy score between response and expected output."""
90
+ # Simple exact match
91
+ if response.strip().lower() == expected.strip().lower():
92
+ return 1.0
93
+
94
+ # Partial match using word overlap
95
+ response_words = set(response.lower().split())
96
+ expected_words = set(expected.lower().split())
97
+
98
+ if not expected_words:
99
+ return 0.0
100
+
101
+ overlap = len(response_words & expected_words)
102
+ return overlap / len(expected_words)
103
+
104
+ def optimize(self, base_prompt: str, max_iterations: int = 5) -> Dict[str, Any]:
105
+ """Iteratively optimize a prompt."""
106
+ current_prompt = base_prompt
107
+ best_prompt = base_prompt
108
+ best_score = 0
109
+ current_metrics = None
110
+
111
+ for iteration in range(max_iterations):
112
+ print(f"\nIteration {iteration + 1}/{max_iterations}")
113
+
114
+ # Evaluate current prompt
115
+ # Bolt Optimization: Avoid re-evaluating if we already have metrics from previous iteration
116
+ if current_metrics:
117
+ metrics = current_metrics
118
+ else:
119
+ metrics = self.evaluate_prompt(current_prompt)
120
+
121
+ print(f"Accuracy: {metrics['avg_accuracy']:.2f}, Latency: {metrics['avg_latency']:.2f}s")
122
+
123
+ # Track results
124
+ self.results_history.append({
125
+ 'iteration': iteration,
126
+ 'prompt': current_prompt,
127
+ 'metrics': metrics
128
+ })
129
+
130
+ # Update best if improved
131
+ if metrics['avg_accuracy'] > best_score:
132
+ best_score = metrics['avg_accuracy']
133
+ best_prompt = current_prompt
134
+
135
+ # Stop if good enough
136
+ if metrics['avg_accuracy'] > 0.95:
137
+ print("Achieved target accuracy!")
138
+ break
139
+
140
+ # Generate variations for next iteration
141
+ variations = self.generate_variations(current_prompt, metrics)
142
+
143
+ # Test variations and pick best
144
+ best_variation = current_prompt
145
+ best_variation_score = metrics['avg_accuracy']
146
+ best_variation_metrics = metrics
147
+
148
+ for variation in variations:
149
+ var_metrics = self.evaluate_prompt(variation)
150
+ if var_metrics['avg_accuracy'] > best_variation_score:
151
+ best_variation_score = var_metrics['avg_accuracy']
152
+ best_variation = variation
153
+ best_variation_metrics = var_metrics
154
+
155
+ current_prompt = best_variation
156
+ current_metrics = best_variation_metrics
157
+
158
+ return {
159
+ 'best_prompt': best_prompt,
160
+ 'best_score': best_score,
161
+ 'history': self.results_history
162
+ }
163
+
164
+ def generate_variations(self, prompt: str, current_metrics: Dict) -> List[str]:
165
+ """Generate prompt variations to test."""
166
+ variations = []
167
+
168
+ # Variation 1: Add explicit format instruction
169
+ variations.append(prompt + "\n\nProvide your answer in a clear, concise format.")
170
+
171
+ # Variation 2: Add step-by-step instruction
172
+ variations.append("Let's solve this step by step.\n\n" + prompt)
173
+
174
+ # Variation 3: Add verification step
175
+ variations.append(prompt + "\n\nVerify your answer before responding.")
176
+
177
+ # Variation 4: Make more concise
178
+ concise = self.make_concise(prompt)
179
+ if concise != prompt:
180
+ variations.append(concise)
181
+
182
+ # Variation 5: Add examples (if none present)
183
+ if "example" not in prompt.lower():
184
+ variations.append(self.add_examples(prompt))
185
+
186
+ return variations[:3] # Return top 3 variations
187
+
188
+ def make_concise(self, prompt: str) -> str:
189
+ """Remove redundant words to make prompt more concise."""
190
+ replacements = [
191
+ ("in order to", "to"),
192
+ ("due to the fact that", "because"),
193
+ ("at this point in time", "now"),
194
+ ("in the event that", "if"),
195
+ ]
196
+
197
+ result = prompt
198
+ for old, new in replacements:
199
+ result = result.replace(old, new)
200
+
201
+ return result
202
+
203
+ def add_examples(self, prompt: str) -> str:
204
+ """Add example section to prompt."""
205
+ return f"""{prompt}
206
+
207
+ Example:
208
+ Input: Sample input
209
+ Output: Sample output
210
+ """
211
+
212
+ def compare_prompts(self, prompt_a: str, prompt_b: str) -> Dict[str, Any]:
213
+ """A/B test two prompts."""
214
+ print("Testing Prompt A...")
215
+ metrics_a = self.evaluate_prompt(prompt_a)
216
+
217
+ print("Testing Prompt B...")
218
+ metrics_b = self.evaluate_prompt(prompt_b)
219
+
220
+ return {
221
+ 'prompt_a_metrics': metrics_a,
222
+ 'prompt_b_metrics': metrics_b,
223
+ 'winner': 'A' if metrics_a['avg_accuracy'] > metrics_b['avg_accuracy'] else 'B',
224
+ 'improvement': abs(metrics_a['avg_accuracy'] - metrics_b['avg_accuracy'])
225
+ }
226
+
227
+ def export_results(self, filename: str):
228
+ """Export optimization results to JSON."""
229
+ with open(filename, 'w') as f:
230
+ json.dump(self.results_history, f, indent=2)
231
+
232
+
233
+ def main():
234
+ # Example usage
235
+ test_suite = [
236
+ TestCase(
237
+ input={'text': 'This movie was amazing!'},
238
+ expected_output='Positive'
239
+ ),
240
+ TestCase(
241
+ input={'text': 'Worst purchase ever.'},
242
+ expected_output='Negative'
243
+ ),
244
+ TestCase(
245
+ input={'text': 'It was okay, nothing special.'},
246
+ expected_output='Neutral'
247
+ )
248
+ ]
249
+
250
+ # Mock LLM client for demonstration
251
+ class MockLLMClient:
252
+ def complete(self, prompt):
253
+ # Simulate LLM response
254
+ if 'amazing' in prompt:
255
+ return 'Positive'
256
+ elif 'worst' in prompt.lower():
257
+ return 'Negative'
258
+ else:
259
+ return 'Neutral'
260
+
261
+ optimizer = PromptOptimizer(MockLLMClient(), test_suite)
262
+
263
+ try:
264
+ base_prompt = "Classify the sentiment of: {text}\nSentiment:"
265
+
266
+ results = optimizer.optimize(base_prompt)
267
+
268
+ print("\n" + "="*50)
269
+ print("Optimization Complete!")
270
+ print(f"Best Accuracy: {results['best_score']:.2f}")
271
+ print(f"Best Prompt:\n{results['best_prompt']}")
272
+
273
+ optimizer.export_results('optimization_results.json')
274
+ finally:
275
+ optimizer.shutdown()
276
+
277
+
278
+ if __name__ == '__main__':
279
+ main()
@@ -0,0 +1,277 @@
1
+ ---
2
+ name: subagent-driven-development
3
+ description: Use when executing implementation plans with independent tasks in the current session
4
+ ---
5
+
6
+ # Subagent-Driven Development
7
+
8
+ Execute plan by dispatching fresh subagent per task, with two-stage review after each: spec compliance review first, then code quality review.
9
+
10
+ **Why subagents:** You delegate tasks to specialized agents with isolated context. By precisely crafting their instructions and context, you ensure they stay focused and succeed at their task. They should never inherit your session's context or history — you construct exactly what they need. This also preserves your own context for coordination work.
11
+
12
+ **Core principle:** Fresh subagent per task + two-stage review (spec then quality) = high quality, fast iteration
13
+
14
+ ## When to Use
15
+
16
+ ```dot
17
+ digraph when_to_use {
18
+ "Have implementation plan?" [shape=diamond];
19
+ "Tasks mostly independent?" [shape=diamond];
20
+ "Stay in this session?" [shape=diamond];
21
+ "subagent-driven-development" [shape=box];
22
+ "executing-plans" [shape=box];
23
+ "Manual execution or brainstorm first" [shape=box];
24
+
25
+ "Have implementation plan?" -> "Tasks mostly independent?" [label="yes"];
26
+ "Have implementation plan?" -> "Manual execution or brainstorm first" [label="no"];
27
+ "Tasks mostly independent?" -> "Stay in this session?" [label="yes"];
28
+ "Tasks mostly independent?" -> "Manual execution or brainstorm first" [label="no - tightly coupled"];
29
+ "Stay in this session?" -> "subagent-driven-development" [label="yes"];
30
+ "Stay in this session?" -> "executing-plans" [label="no - parallel session"];
31
+ }
32
+ ```
33
+
34
+ **vs. Executing Plans (parallel session):**
35
+ - Same session (no context switch)
36
+ - Fresh subagent per task (no context pollution)
37
+ - Two-stage review after each task: spec compliance first, then code quality
38
+ - Faster iteration (no human-in-loop between tasks)
39
+
40
+ ## The Process
41
+
42
+ ```dot
43
+ digraph process {
44
+ rankdir=TB;
45
+
46
+ subgraph cluster_per_task {
47
+ label="Per Task";
48
+ "Dispatch implementer subagent (./implementer-prompt.md)" [shape=box];
49
+ "Implementer subagent asks questions?" [shape=diamond];
50
+ "Answer questions, provide context" [shape=box];
51
+ "Implementer subagent implements, tests, commits, self-reviews" [shape=box];
52
+ "Dispatch spec reviewer subagent (./spec-reviewer-prompt.md)" [shape=box];
53
+ "Spec reviewer subagent confirms code matches spec?" [shape=diamond];
54
+ "Implementer subagent fixes spec gaps" [shape=box];
55
+ "Dispatch code quality reviewer subagent (./code-quality-reviewer-prompt.md)" [shape=box];
56
+ "Code quality reviewer subagent approves?" [shape=diamond];
57
+ "Implementer subagent fixes quality issues" [shape=box];
58
+ "Mark task complete in TodoWrite" [shape=box];
59
+ }
60
+
61
+ "Read plan, extract all tasks with full text, note context, create TodoWrite" [shape=box];
62
+ "More tasks remain?" [shape=diamond];
63
+ "Dispatch final code reviewer subagent for entire implementation" [shape=box];
64
+ "Use superpowers:finishing-a-development-branch" [shape=box style=filled fillcolor=lightgreen];
65
+
66
+ "Read plan, extract all tasks with full text, note context, create TodoWrite" -> "Dispatch implementer subagent (./implementer-prompt.md)";
67
+ "Dispatch implementer subagent (./implementer-prompt.md)" -> "Implementer subagent asks questions?";
68
+ "Implementer subagent asks questions?" -> "Answer questions, provide context" [label="yes"];
69
+ "Answer questions, provide context" -> "Dispatch implementer subagent (./implementer-prompt.md)";
70
+ "Implementer subagent asks questions?" -> "Implementer subagent implements, tests, commits, self-reviews" [label="no"];
71
+ "Implementer subagent implements, tests, commits, self-reviews" -> "Dispatch spec reviewer subagent (./spec-reviewer-prompt.md)";
72
+ "Dispatch spec reviewer subagent (./spec-reviewer-prompt.md)" -> "Spec reviewer subagent confirms code matches spec?";
73
+ "Spec reviewer subagent confirms code matches spec?" -> "Implementer subagent fixes spec gaps" [label="no"];
74
+ "Implementer subagent fixes spec gaps" -> "Dispatch spec reviewer subagent (./spec-reviewer-prompt.md)" [label="re-review"];
75
+ "Spec reviewer subagent confirms code matches spec?" -> "Dispatch code quality reviewer subagent (./code-quality-reviewer-prompt.md)" [label="yes"];
76
+ "Dispatch code quality reviewer subagent (./code-quality-reviewer-prompt.md)" -> "Code quality reviewer subagent approves?";
77
+ "Code quality reviewer subagent approves?" -> "Implementer subagent fixes quality issues" [label="no"];
78
+ "Implementer subagent fixes quality issues" -> "Dispatch code quality reviewer subagent (./code-quality-reviewer-prompt.md)" [label="re-review"];
79
+ "Code quality reviewer subagent approves?" -> "Mark task complete in TodoWrite" [label="yes"];
80
+ "Mark task complete in TodoWrite" -> "More tasks remain?";
81
+ "More tasks remain?" -> "Dispatch implementer subagent (./implementer-prompt.md)" [label="yes"];
82
+ "More tasks remain?" -> "Dispatch final code reviewer subagent for entire implementation" [label="no"];
83
+ "Dispatch final code reviewer subagent for entire implementation" -> "Use superpowers:finishing-a-development-branch";
84
+ }
85
+ ```
86
+
87
+ ## Model Selection
88
+
89
+ Use the least powerful model that can handle each role to conserve cost and increase speed.
90
+
91
+ **Mechanical implementation tasks** (isolated functions, clear specs, 1-2 files): use a fast, cheap model. Most implementation tasks are mechanical when the plan is well-specified.
92
+
93
+ **Integration and judgment tasks** (multi-file coordination, pattern matching, debugging): use a standard model.
94
+
95
+ **Architecture, design, and review tasks**: use the most capable available model.
96
+
97
+ **Task complexity signals:**
98
+ - Touches 1-2 files with a complete spec → cheap model
99
+ - Touches multiple files with integration concerns → standard model
100
+ - Requires design judgment or broad codebase understanding → most capable model
101
+
102
+ ## Handling Implementer Status
103
+
104
+ Implementer subagents report one of four statuses. Handle each appropriately:
105
+
106
+ **DONE:** Proceed to spec compliance review.
107
+
108
+ **DONE_WITH_CONCERNS:** The implementer completed the work but flagged doubts. Read the concerns before proceeding. If the concerns are about correctness or scope, address them before review. If they're observations (e.g., "this file is getting large"), note them and proceed to review.
109
+
110
+ **NEEDS_CONTEXT:** The implementer needs information that wasn't provided. Provide the missing context and re-dispatch.
111
+
112
+ **BLOCKED:** The implementer cannot complete the task. Assess the blocker:
113
+ 1. If it's a context problem, provide more context and re-dispatch with the same model
114
+ 2. If the task requires more reasoning, re-dispatch with a more capable model
115
+ 3. If the task is too large, break it into smaller pieces
116
+ 4. If the plan itself is wrong, escalate to the human
117
+
118
+ **Never** ignore an escalation or force the same model to retry without changes. If the implementer said it's stuck, something needs to change.
119
+
120
+ ## Prompt Templates
121
+
122
+ - `./implementer-prompt.md` - Dispatch implementer subagent
123
+ - `./spec-reviewer-prompt.md` - Dispatch spec compliance reviewer subagent
124
+ - `./code-quality-reviewer-prompt.md` - Dispatch code quality reviewer subagent
125
+
126
+ ## Example Workflow
127
+
128
+ ```
129
+ You: I'm using Subagent-Driven Development to execute this plan.
130
+
131
+ [Read plan file once: docs/superpowers/plans/feature-plan.md]
132
+ [Extract all 5 tasks with full text and context]
133
+ [Create TodoWrite with all tasks]
134
+
135
+ Task 1: Hook installation script
136
+
137
+ [Get Task 1 text and context (already extracted)]
138
+ [Dispatch implementation subagent with full task text + context]
139
+
140
+ Implementer: "Before I begin - should the hook be installed at user or system level?"
141
+
142
+ You: "User level (~/.config/superpowers/hooks/)"
143
+
144
+ Implementer: "Got it. Implementing now..."
145
+ [Later] Implementer:
146
+ - Implemented install-hook command
147
+ - Added tests, 5/5 passing
148
+ - Self-review: Found I missed --force flag, added it
149
+ - Committed
150
+
151
+ [Dispatch spec compliance reviewer]
152
+ Spec reviewer: ✅ Spec compliant - all requirements met, nothing extra
153
+
154
+ [Get git SHAs, dispatch code quality reviewer]
155
+ Code reviewer: Strengths: Good test coverage, clean. Issues: None. Approved.
156
+
157
+ [Mark Task 1 complete]
158
+
159
+ Task 2: Recovery modes
160
+
161
+ [Get Task 2 text and context (already extracted)]
162
+ [Dispatch implementation subagent with full task text + context]
163
+
164
+ Implementer: [No questions, proceeds]
165
+ Implementer:
166
+ - Added verify/repair modes
167
+ - 8/8 tests passing
168
+ - Self-review: All good
169
+ - Committed
170
+
171
+ [Dispatch spec compliance reviewer]
172
+ Spec reviewer: ❌ Issues:
173
+ - Missing: Progress reporting (spec says "report every 100 items")
174
+ - Extra: Added --json flag (not requested)
175
+
176
+ [Implementer fixes issues]
177
+ Implementer: Removed --json flag, added progress reporting
178
+
179
+ [Spec reviewer reviews again]
180
+ Spec reviewer: ✅ Spec compliant now
181
+
182
+ [Dispatch code quality reviewer]
183
+ Code reviewer: Strengths: Solid. Issues (Important): Magic number (100)
184
+
185
+ [Implementer fixes]
186
+ Implementer: Extracted PROGRESS_INTERVAL constant
187
+
188
+ [Code reviewer reviews again]
189
+ Code reviewer: ✅ Approved
190
+
191
+ [Mark Task 2 complete]
192
+
193
+ ...
194
+
195
+ [After all tasks]
196
+ [Dispatch final code-reviewer]
197
+ Final reviewer: All requirements met, ready to merge
198
+
199
+ Done!
200
+ ```
201
+
202
+ ## Advantages
203
+
204
+ **vs. Manual execution:**
205
+ - Subagents follow TDD naturally
206
+ - Fresh context per task (no confusion)
207
+ - Parallel-safe (subagents don't interfere)
208
+ - Subagent can ask questions (before AND during work)
209
+
210
+ **vs. Executing Plans:**
211
+ - Same session (no handoff)
212
+ - Continuous progress (no waiting)
213
+ - Review checkpoints automatic
214
+
215
+ **Efficiency gains:**
216
+ - No file reading overhead (controller provides full text)
217
+ - Controller curates exactly what context is needed
218
+ - Subagent gets complete information upfront
219
+ - Questions surfaced before work begins (not after)
220
+
221
+ **Quality gates:**
222
+ - Self-review catches issues before handoff
223
+ - Two-stage review: spec compliance, then code quality
224
+ - Review loops ensure fixes actually work
225
+ - Spec compliance prevents over/under-building
226
+ - Code quality ensures implementation is well-built
227
+
228
+ **Cost:**
229
+ - More subagent invocations (implementer + 2 reviewers per task)
230
+ - Controller does more prep work (extracting all tasks upfront)
231
+ - Review loops add iterations
232
+ - But catches issues early (cheaper than debugging later)
233
+
234
+ ## Red Flags
235
+
236
+ **Never:**
237
+ - Start implementation on main/master branch without explicit user consent
238
+ - Skip reviews (spec compliance OR code quality)
239
+ - Proceed with unfixed issues
240
+ - Dispatch multiple implementation subagents in parallel (conflicts)
241
+ - Make subagent read plan file (provide full text instead)
242
+ - Skip scene-setting context (subagent needs to understand where task fits)
243
+ - Ignore subagent questions (answer before letting them proceed)
244
+ - Accept "close enough" on spec compliance (spec reviewer found issues = not done)
245
+ - Skip review loops (reviewer found issues = implementer fixes = review again)
246
+ - Let implementer self-review replace actual review (both are needed)
247
+ - **Start code quality review before spec compliance is ✅** (wrong order)
248
+ - Move to next task while either review has open issues
249
+
250
+ **If subagent asks questions:**
251
+ - Answer clearly and completely
252
+ - Provide additional context if needed
253
+ - Don't rush them into implementation
254
+
255
+ **If reviewer finds issues:**
256
+ - Implementer (same subagent) fixes them
257
+ - Reviewer reviews again
258
+ - Repeat until approved
259
+ - Don't skip the re-review
260
+
261
+ **If subagent fails task:**
262
+ - Dispatch fix subagent with specific instructions
263
+ - Don't try to fix manually (context pollution)
264
+
265
+ ## Integration
266
+
267
+ **Required workflow skills:**
268
+ - **superpowers:using-git-worktrees** - REQUIRED: Set up isolated workspace before starting
269
+ - **superpowers:writing-plans** - Creates the plan this skill executes
270
+ - **superpowers:requesting-code-review** - Code review template for reviewer subagents
271
+ - **superpowers:finishing-a-development-branch** - Complete development after all tasks
272
+
273
+ **Subagents should use:**
274
+ - **superpowers:test-driven-development** - Subagents follow TDD for each task
275
+
276
+ **Alternative workflow:**
277
+ - **superpowers:executing-plans** - Use for parallel session instead of same-session execution
@@ -0,0 +1,26 @@
1
+ # Code Quality Reviewer Prompt Template
2
+
3
+ Use this template when dispatching a code quality reviewer subagent.
4
+
5
+ **Purpose:** Verify implementation is well-built (clean, tested, maintainable)
6
+
7
+ **Only dispatch after spec compliance review passes.**
8
+
9
+ ```
10
+ Task tool (superpowers:code-reviewer):
11
+ Use template at requesting-code-review/code-reviewer.md
12
+
13
+ WHAT_WAS_IMPLEMENTED: [from implementer's report]
14
+ PLAN_OR_REQUIREMENTS: Task N from [plan-file]
15
+ BASE_SHA: [commit before task]
16
+ HEAD_SHA: [current commit]
17
+ DESCRIPTION: [task summary]
18
+ ```
19
+
20
+ **In addition to standard code quality concerns, the reviewer should check:**
21
+ - Does each file have one clear responsibility with a well-defined interface?
22
+ - Are units decomposed so they can be understood and tested independently?
23
+ - Is the implementation following the file structure from the plan?
24
+ - Did this implementation create new files that are already large, or significantly grow existing files? (Don't flag pre-existing file sizes — focus on what this change contributed.)
25
+
26
+ **Code reviewer returns:** Strengths, Issues (Critical/Important/Minor), Assessment
@@ -0,0 +1,113 @@
1
+ # Implementer Subagent Prompt Template
2
+
3
+ Use this template when dispatching an implementer subagent.
4
+
5
+ ```
6
+ Task tool (general-purpose):
7
+ description: "Implement Task N: [task name]"
8
+ prompt: |
9
+ You are implementing Task N: [task name]
10
+
11
+ ## Task Description
12
+
13
+ [FULL TEXT of task from plan - paste it here, don't make subagent read file]
14
+
15
+ ## Context
16
+
17
+ [Scene-setting: where this fits, dependencies, architectural context]
18
+
19
+ ## Before You Begin
20
+
21
+ If you have questions about:
22
+ - The requirements or acceptance criteria
23
+ - The approach or implementation strategy
24
+ - Dependencies or assumptions
25
+ - Anything unclear in the task description
26
+
27
+ **Ask them now.** Raise any concerns before starting work.
28
+
29
+ ## Your Job
30
+
31
+ Once you're clear on requirements:
32
+ 1. Implement exactly what the task specifies
33
+ 2. Write tests (following TDD if task says to)
34
+ 3. Verify implementation works
35
+ 4. Commit your work
36
+ 5. Self-review (see below)
37
+ 6. Report back
38
+
39
+ Work from: [directory]
40
+
41
+ **While you work:** If you encounter something unexpected or unclear, **ask questions**.
42
+ It's always OK to pause and clarify. Don't guess or make assumptions.
43
+
44
+ ## Code Organization
45
+
46
+ You reason best about code you can hold in context at once, and your edits are more
47
+ reliable when files are focused. Keep this in mind:
48
+ - Follow the file structure defined in the plan
49
+ - Each file should have one clear responsibility with a well-defined interface
50
+ - If a file you're creating is growing beyond the plan's intent, stop and report
51
+ it as DONE_WITH_CONCERNS — don't split files on your own without plan guidance
52
+ - If an existing file you're modifying is already large or tangled, work carefully
53
+ and note it as a concern in your report
54
+ - In existing codebases, follow established patterns. Improve code you're touching
55
+ the way a good developer would, but don't restructure things outside your task.
56
+
57
+ ## When You're in Over Your Head
58
+
59
+ It is always OK to stop and say "this is too hard for me." Bad work is worse than
60
+ no work. You will not be penalized for escalating.
61
+
62
+ **STOP and escalate when:**
63
+ - The task requires architectural decisions with multiple valid approaches
64
+ - You need to understand code beyond what was provided and can't find clarity
65
+ - You feel uncertain about whether your approach is correct
66
+ - The task involves restructuring existing code in ways the plan didn't anticipate
67
+ - You've been reading file after file trying to understand the system without progress
68
+
69
+ **How to escalate:** Report back with status BLOCKED or NEEDS_CONTEXT. Describe
70
+ specifically what you're stuck on, what you've tried, and what kind of help you need.
71
+ The controller can provide more context, re-dispatch with a more capable model,
72
+ or break the task into smaller pieces.
73
+
74
+ ## Before Reporting Back: Self-Review
75
+
76
+ Review your work with fresh eyes. Ask yourself:
77
+
78
+ **Completeness:**
79
+ - Did I fully implement everything in the spec?
80
+ - Did I miss any requirements?
81
+ - Are there edge cases I didn't handle?
82
+
83
+ **Quality:**
84
+ - Is this my best work?
85
+ - Are names clear and accurate (match what things do, not how they work)?
86
+ - Is the code clean and maintainable?
87
+
88
+ **Discipline:**
89
+ - Did I avoid overbuilding (YAGNI)?
90
+ - Did I only build what was requested?
91
+ - Did I follow existing patterns in the codebase?
92
+
93
+ **Testing:**
94
+ - Do tests actually verify behavior (not just mock behavior)?
95
+ - Did I follow TDD if required?
96
+ - Are tests comprehensive?
97
+
98
+ If you find issues during self-review, fix them now before reporting.
99
+
100
+ ## Report Format
101
+
102
+ When done, report:
103
+ - **Status:** DONE | DONE_WITH_CONCERNS | BLOCKED | NEEDS_CONTEXT
104
+ - What you implemented (or what you attempted, if blocked)
105
+ - What you tested and test results
106
+ - Files changed
107
+ - Self-review findings (if any)
108
+ - Any issues or concerns
109
+
110
+ Use DONE_WITH_CONCERNS if you completed the work but have doubts about correctness.
111
+ Use BLOCKED if you cannot complete the task. Use NEEDS_CONTEXT if you need
112
+ information that wasn't provided. Never silently produce work you're unsure about.
113
+ ```