@jokerized/getresearchdone 0.4.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (711) hide show
  1. package/.claude-plugin/plugin.json +103 -0
  2. package/README.md +211 -0
  3. package/agents/grd-baseline-assessor.md +684 -0
  4. package/agents/grd-code-reviewer.md +300 -0
  5. package/agents/grd-codebase-mapper.md +355 -0
  6. package/agents/grd-critique-agent.md +119 -0
  7. package/agents/grd-debugger.md +519 -0
  8. package/agents/grd-deep-diver.md +737 -0
  9. package/agents/grd-eval-planner.md +913 -0
  10. package/agents/grd-eval-reporter.md +717 -0
  11. package/agents/grd-executor.md +683 -0
  12. package/agents/grd-feasibility-analyst.md +624 -0
  13. package/agents/grd-integration-checker.md +367 -0
  14. package/agents/grd-knowledge-miner.md +81 -0
  15. package/agents/grd-migrator.md +88 -0
  16. package/agents/grd-phase-researcher.md +697 -0
  17. package/agents/grd-plan-checker.md +443 -0
  18. package/agents/grd-planner.md +1532 -0
  19. package/agents/grd-product-owner.md +562 -0
  20. package/agents/grd-project-researcher.md +513 -0
  21. package/agents/grd-research-synthesizer.md +273 -0
  22. package/agents/grd-roadmapper.md +798 -0
  23. package/agents/grd-surveyor.md +566 -0
  24. package/agents/grd-verifier.md +893 -0
  25. package/bin/gd.js +4 -0
  26. package/bin/gd.ts +227 -0
  27. package/bin/grd-manifest.js +4 -0
  28. package/bin/grd-manifest.ts +286 -0
  29. package/bin/grd-mcp-server.js +4 -0
  30. package/bin/grd-mcp-server.ts +124 -0
  31. package/bin/grd-tools.js +4 -0
  32. package/bin/grd-tools.ts +2471 -0
  33. package/bin/postinstall.js +4 -0
  34. package/bin/postinstall.ts +80 -0
  35. package/commands/add-phase.md +123 -0
  36. package/commands/add-todo.md +87 -0
  37. package/commands/assess-baseline.md +289 -0
  38. package/commands/autopilot.md +100 -0
  39. package/commands/autoplan.md +55 -0
  40. package/commands/check-todos.md +87 -0
  41. package/commands/compare-methods.md +262 -0
  42. package/commands/complete-milestone.md +225 -0
  43. package/commands/debug.md +372 -0
  44. package/commands/deep-dive.md +288 -0
  45. package/commands/discover.md +281 -0
  46. package/commands/discuss-phase.md +188 -0
  47. package/commands/discuss.md +55 -0
  48. package/commands/eval-report.md +310 -0
  49. package/commands/evolve.md +79 -0
  50. package/commands/execute-phase.md +1017 -0
  51. package/commands/feasibility.md +292 -0
  52. package/commands/help.md +407 -0
  53. package/commands/init.md +1508 -0
  54. package/commands/insert-phase.md +113 -0
  55. package/commands/iterate.md +327 -0
  56. package/commands/list-phase-assumptions.md +217 -0
  57. package/commands/long-term-roadmap.md +202 -0
  58. package/commands/map-codebase.md +111 -0
  59. package/commands/migrate.md +159 -0
  60. package/commands/new-milestone.md +169 -0
  61. package/commands/pause-work.md +83 -0
  62. package/commands/plan-milestone-gaps.md +373 -0
  63. package/commands/plan-phase.md +655 -0
  64. package/commands/principles.md +328 -0
  65. package/commands/product-plan.md +319 -0
  66. package/commands/progress.md +481 -0
  67. package/commands/quick.md +167 -0
  68. package/commands/reapply-patches.md +154 -0
  69. package/commands/remove-phase.md +97 -0
  70. package/commands/requirement.md +96 -0
  71. package/commands/resume-project.md +113 -0
  72. package/commands/settings.md +1144 -0
  73. package/commands/survey.md +242 -0
  74. package/commands/sync.md +246 -0
  75. package/commands/tracker-setup.md +322 -0
  76. package/commands/update.md +202 -0
  77. package/commands/verify-phase.md +335 -0
  78. package/commands/verify-work.md +701 -0
  79. package/commands/wireup.md +29 -0
  80. package/dist/bin/gd.d.ts +3 -0
  81. package/dist/bin/gd.d.ts.map +1 -0
  82. package/dist/bin/gd.js +178 -0
  83. package/dist/bin/gd.js.map +1 -0
  84. package/dist/bin/grd-manifest.d.ts +3 -0
  85. package/dist/bin/grd-manifest.d.ts.map +1 -0
  86. package/dist/bin/grd-manifest.js +202 -0
  87. package/dist/bin/grd-manifest.js.map +1 -0
  88. package/dist/bin/grd-mcp-server.d.ts +3 -0
  89. package/dist/bin/grd-mcp-server.d.ts.map +1 -0
  90. package/dist/bin/grd-mcp-server.js +71 -0
  91. package/dist/bin/grd-mcp-server.js.map +1 -0
  92. package/dist/bin/grd-tools.d.ts +3 -0
  93. package/dist/bin/grd-tools.d.ts.map +1 -0
  94. package/dist/bin/grd-tools.js +1680 -0
  95. package/dist/bin/grd-tools.js.map +1 -0
  96. package/dist/bin/postinstall.d.ts +3 -0
  97. package/dist/bin/postinstall.d.ts.map +1 -0
  98. package/dist/bin/postinstall.js +61 -0
  99. package/dist/bin/postinstall.js.map +1 -0
  100. package/dist/lib/autopilot-milestone.d.ts +2 -0
  101. package/dist/lib/autopilot-milestone.d.ts.map +1 -0
  102. package/dist/lib/autopilot-milestone.js +94 -0
  103. package/dist/lib/autopilot-milestone.js.map +1 -0
  104. package/dist/lib/autopilot-pipeline.d.ts +2 -0
  105. package/dist/lib/autopilot-pipeline.d.ts.map +1 -0
  106. package/dist/lib/autopilot-pipeline.js +830 -0
  107. package/dist/lib/autopilot-pipeline.js.map +1 -0
  108. package/dist/lib/autopilot-waves.d.ts +2 -0
  109. package/dist/lib/autopilot-waves.d.ts.map +1 -0
  110. package/dist/lib/autopilot-waves.js +266 -0
  111. package/dist/lib/autopilot-waves.js.map +1 -0
  112. package/dist/lib/autopilot.d.ts +2 -0
  113. package/dist/lib/autopilot.d.ts.map +1 -0
  114. package/dist/lib/autopilot.js +1314 -0
  115. package/dist/lib/autopilot.js.map +1 -0
  116. package/dist/lib/autoplan.d.ts +2 -0
  117. package/dist/lib/autoplan.d.ts.map +1 -0
  118. package/dist/lib/autoplan.js +198 -0
  119. package/dist/lib/autoplan.js.map +1 -0
  120. package/dist/lib/autoresearch.d.ts +2 -0
  121. package/dist/lib/autoresearch.d.ts.map +1 -0
  122. package/dist/lib/autoresearch.js +626 -0
  123. package/dist/lib/autoresearch.js.map +1 -0
  124. package/dist/lib/backend.d.ts +2 -0
  125. package/dist/lib/backend.d.ts.map +1 -0
  126. package/dist/lib/backend.js +1036 -0
  127. package/dist/lib/backend.js.map +1 -0
  128. package/dist/lib/benchmark.d.ts +99 -0
  129. package/dist/lib/benchmark.d.ts.map +1 -0
  130. package/dist/lib/benchmark.js +278 -0
  131. package/dist/lib/benchmark.js.map +1 -0
  132. package/dist/lib/citations.d.ts +2 -0
  133. package/dist/lib/citations.d.ts.map +1 -0
  134. package/dist/lib/citations.js +642 -0
  135. package/dist/lib/citations.js.map +1 -0
  136. package/dist/lib/cleanup.d.ts +2 -0
  137. package/dist/lib/cleanup.d.ts.map +1 -0
  138. package/dist/lib/cleanup.js +1222 -0
  139. package/dist/lib/cleanup.js.map +1 -0
  140. package/dist/lib/cli/adapters.d.ts +10 -0
  141. package/dist/lib/cli/adapters.d.ts.map +1 -0
  142. package/dist/lib/cli/adapters.js +27 -0
  143. package/dist/lib/cli/adapters.js.map +1 -0
  144. package/dist/lib/cli/agent.d.ts +17 -0
  145. package/dist/lib/cli/agent.d.ts.map +1 -0
  146. package/dist/lib/cli/agent.js +53 -0
  147. package/dist/lib/cli/agent.js.map +1 -0
  148. package/dist/lib/cli/index.d.ts +21 -0
  149. package/dist/lib/cli/index.d.ts.map +1 -0
  150. package/dist/lib/cli/index.js +264 -0
  151. package/dist/lib/cli/index.js.map +1 -0
  152. package/dist/lib/cli/output.d.ts +20 -0
  153. package/dist/lib/cli/output.d.ts.map +1 -0
  154. package/dist/lib/cli/output.js +22 -0
  155. package/dist/lib/cli/output.js.map +1 -0
  156. package/dist/lib/cli/scan-dispatch.d.ts +9 -0
  157. package/dist/lib/cli/scan-dispatch.d.ts.map +1 -0
  158. package/dist/lib/cli/scan-dispatch.js +107 -0
  159. package/dist/lib/cli/scan-dispatch.js.map +1 -0
  160. package/dist/lib/cli/tools.d.ts +16 -0
  161. package/dist/lib/cli/tools.d.ts.map +1 -0
  162. package/dist/lib/cli/tools.js +168 -0
  163. package/dist/lib/cli/tools.js.map +1 -0
  164. package/dist/lib/commands/_dashboard-parsers.d.ts +2 -0
  165. package/dist/lib/commands/_dashboard-parsers.d.ts.map +1 -0
  166. package/dist/lib/commands/_dashboard-parsers.js +192 -0
  167. package/dist/lib/commands/_dashboard-parsers.js.map +1 -0
  168. package/dist/lib/commands/analysis.d.ts +2 -0
  169. package/dist/lib/commands/analysis.d.ts.map +1 -0
  170. package/dist/lib/commands/analysis.js +1418 -0
  171. package/dist/lib/commands/analysis.js.map +1 -0
  172. package/dist/lib/commands/assumptions.d.ts +2 -0
  173. package/dist/lib/commands/assumptions.d.ts.map +1 -0
  174. package/dist/lib/commands/assumptions.js +166 -0
  175. package/dist/lib/commands/assumptions.js.map +1 -0
  176. package/dist/lib/commands/blame.d.ts +2 -0
  177. package/dist/lib/commands/blame.d.ts.map +1 -0
  178. package/dist/lib/commands/blame.js +133 -0
  179. package/dist/lib/commands/blame.js.map +1 -0
  180. package/dist/lib/commands/budget.d.ts +2 -0
  181. package/dist/lib/commands/budget.d.ts.map +1 -0
  182. package/dist/lib/commands/budget.js +100 -0
  183. package/dist/lib/commands/budget.js.map +1 -0
  184. package/dist/lib/commands/check-plans.d.ts +2 -0
  185. package/dist/lib/commands/check-plans.d.ts.map +1 -0
  186. package/dist/lib/commands/check-plans.js +190 -0
  187. package/dist/lib/commands/check-plans.js.map +1 -0
  188. package/dist/lib/commands/config.d.ts +2 -0
  189. package/dist/lib/commands/config.d.ts.map +1 -0
  190. package/dist/lib/commands/config.js +188 -0
  191. package/dist/lib/commands/config.js.map +1 -0
  192. package/dist/lib/commands/dashboard.d.ts +2 -0
  193. package/dist/lib/commands/dashboard.d.ts.map +1 -0
  194. package/dist/lib/commands/dashboard.js +466 -0
  195. package/dist/lib/commands/dashboard.js.map +1 -0
  196. package/dist/lib/commands/estimate.d.ts +2 -0
  197. package/dist/lib/commands/estimate.d.ts.map +1 -0
  198. package/dist/lib/commands/estimate.js +148 -0
  199. package/dist/lib/commands/estimate.js.map +1 -0
  200. package/dist/lib/commands/eval-diff.d.ts +2 -0
  201. package/dist/lib/commands/eval-diff.d.ts.map +1 -0
  202. package/dist/lib/commands/eval-diff.js +213 -0
  203. package/dist/lib/commands/eval-diff.js.map +1 -0
  204. package/dist/lib/commands/freshness.d.ts +2 -0
  205. package/dist/lib/commands/freshness.d.ts.map +1 -0
  206. package/dist/lib/commands/freshness.js +163 -0
  207. package/dist/lib/commands/freshness.js.map +1 -0
  208. package/dist/lib/commands/health.d.ts +2 -0
  209. package/dist/lib/commands/health.d.ts.map +1 -0
  210. package/dist/lib/commands/health.js +435 -0
  211. package/dist/lib/commands/health.js.map +1 -0
  212. package/dist/lib/commands/index.d.ts +2 -0
  213. package/dist/lib/commands/index.d.ts.map +1 -0
  214. package/dist/lib/commands/index.js +128 -0
  215. package/dist/lib/commands/index.js.map +1 -0
  216. package/dist/lib/commands/install.d.ts +56 -0
  217. package/dist/lib/commands/install.d.ts.map +1 -0
  218. package/dist/lib/commands/install.js +214 -0
  219. package/dist/lib/commands/install.js.map +1 -0
  220. package/dist/lib/commands/knowhow-aggregator.d.ts +2 -0
  221. package/dist/lib/commands/knowhow-aggregator.d.ts.map +1 -0
  222. package/dist/lib/commands/knowhow-aggregator.js +279 -0
  223. package/dist/lib/commands/knowhow-aggregator.js.map +1 -0
  224. package/dist/lib/commands/knowledge-search.d.ts +2 -0
  225. package/dist/lib/commands/knowledge-search.d.ts.map +1 -0
  226. package/dist/lib/commands/knowledge-search.js +113 -0
  227. package/dist/lib/commands/knowledge-search.js.map +1 -0
  228. package/dist/lib/commands/long-term-roadmap.d.ts +2 -0
  229. package/dist/lib/commands/long-term-roadmap.d.ts.map +1 -0
  230. package/dist/lib/commands/long-term-roadmap.js +272 -0
  231. package/dist/lib/commands/long-term-roadmap.js.map +1 -0
  232. package/dist/lib/commands/patterns.d.ts +91 -0
  233. package/dist/lib/commands/patterns.d.ts.map +1 -0
  234. package/dist/lib/commands/patterns.js +391 -0
  235. package/dist/lib/commands/patterns.js.map +1 -0
  236. package/dist/lib/commands/phase-info.d.ts +2 -0
  237. package/dist/lib/commands/phase-info.d.ts.map +1 -0
  238. package/dist/lib/commands/phase-info.js +509 -0
  239. package/dist/lib/commands/phase-info.js.map +1 -0
  240. package/dist/lib/commands/plan-lint.d.ts +56 -0
  241. package/dist/lib/commands/plan-lint.d.ts.map +1 -0
  242. package/dist/lib/commands/plan-lint.js +481 -0
  243. package/dist/lib/commands/plan-lint.js.map +1 -0
  244. package/dist/lib/commands/plan-phase.d.ts +53 -0
  245. package/dist/lib/commands/plan-phase.d.ts.map +1 -0
  246. package/dist/lib/commands/plan-phase.js +288 -0
  247. package/dist/lib/commands/plan-phase.js.map +1 -0
  248. package/dist/lib/commands/progress.d.ts +2 -0
  249. package/dist/lib/commands/progress.d.ts.map +1 -0
  250. package/dist/lib/commands/progress.js +266 -0
  251. package/dist/lib/commands/progress.js.map +1 -0
  252. package/dist/lib/commands/quality.d.ts +2 -0
  253. package/dist/lib/commands/quality.d.ts.map +1 -0
  254. package/dist/lib/commands/quality.js +80 -0
  255. package/dist/lib/commands/quality.js.map +1 -0
  256. package/dist/lib/commands/rollback.d.ts +2 -0
  257. package/dist/lib/commands/rollback.d.ts.map +1 -0
  258. package/dist/lib/commands/rollback.js +145 -0
  259. package/dist/lib/commands/rollback.js.map +1 -0
  260. package/dist/lib/commands/scan.d.ts +25 -0
  261. package/dist/lib/commands/scan.d.ts.map +1 -0
  262. package/dist/lib/commands/scan.js +28 -0
  263. package/dist/lib/commands/scan.js.map +1 -0
  264. package/dist/lib/commands/search.d.ts +2 -0
  265. package/dist/lib/commands/search.d.ts.map +1 -0
  266. package/dist/lib/commands/search.js +212 -0
  267. package/dist/lib/commands/search.js.map +1 -0
  268. package/dist/lib/commands/select-candidate.d.ts +128 -0
  269. package/dist/lib/commands/select-candidate.d.ts.map +1 -0
  270. package/dist/lib/commands/select-candidate.js +518 -0
  271. package/dist/lib/commands/select-candidate.js.map +1 -0
  272. package/dist/lib/commands/singularity.d.ts +2 -0
  273. package/dist/lib/commands/singularity.d.ts.map +1 -0
  274. package/dist/lib/commands/singularity.js +185 -0
  275. package/dist/lib/commands/singularity.js.map +1 -0
  276. package/dist/lib/commands/slug-timestamp.d.ts +2 -0
  277. package/dist/lib/commands/slug-timestamp.d.ts.map +1 -0
  278. package/dist/lib/commands/slug-timestamp.js +54 -0
  279. package/dist/lib/commands/slug-timestamp.js.map +1 -0
  280. package/dist/lib/commands/tail.d.ts +2 -0
  281. package/dist/lib/commands/tail.d.ts.map +1 -0
  282. package/dist/lib/commands/tail.js +100 -0
  283. package/dist/lib/commands/tail.js.map +1 -0
  284. package/dist/lib/commands/todo.d.ts +2 -0
  285. package/dist/lib/commands/todo.d.ts.map +1 -0
  286. package/dist/lib/commands/todo.js +200 -0
  287. package/dist/lib/commands/todo.js.map +1 -0
  288. package/dist/lib/commands/watch.d.ts +2 -0
  289. package/dist/lib/commands/watch.d.ts.map +1 -0
  290. package/dist/lib/commands/watch.js +72 -0
  291. package/dist/lib/commands/watch.js.map +1 -0
  292. package/dist/lib/complexity.d.ts +55 -0
  293. package/dist/lib/complexity.d.ts.map +1 -0
  294. package/dist/lib/complexity.js +80 -0
  295. package/dist/lib/complexity.js.map +1 -0
  296. package/dist/lib/context/agents.d.ts +2 -0
  297. package/dist/lib/context/agents.d.ts.map +1 -0
  298. package/dist/lib/context/agents.js +344 -0
  299. package/dist/lib/context/agents.js.map +1 -0
  300. package/dist/lib/context/base.d.ts +2 -0
  301. package/dist/lib/context/base.d.ts.map +1 -0
  302. package/dist/lib/context/base.js +81 -0
  303. package/dist/lib/context/base.js.map +1 -0
  304. package/dist/lib/context/execute.d.ts +2 -0
  305. package/dist/lib/context/execute.d.ts.map +1 -0
  306. package/dist/lib/context/execute.js +753 -0
  307. package/dist/lib/context/execute.js.map +1 -0
  308. package/dist/lib/context/index.d.ts +2 -0
  309. package/dist/lib/context/index.d.ts.map +1 -0
  310. package/dist/lib/context/index.js +88 -0
  311. package/dist/lib/context/index.js.map +1 -0
  312. package/dist/lib/context/progress.d.ts +2 -0
  313. package/dist/lib/context/progress.d.ts.map +1 -0
  314. package/dist/lib/context/progress.js +178 -0
  315. package/dist/lib/context/progress.js.map +1 -0
  316. package/dist/lib/context/project.d.ts +2 -0
  317. package/dist/lib/context/project.d.ts.map +1 -0
  318. package/dist/lib/context/project.js +413 -0
  319. package/dist/lib/context/project.js.map +1 -0
  320. package/dist/lib/context/research.d.ts +2 -0
  321. package/dist/lib/context/research.d.ts.map +1 -0
  322. package/dist/lib/context/research.js +466 -0
  323. package/dist/lib/context/research.js.map +1 -0
  324. package/dist/lib/dead-ends.d.ts +28 -0
  325. package/dist/lib/dead-ends.d.ts.map +1 -0
  326. package/dist/lib/dead-ends.js +451 -0
  327. package/dist/lib/dead-ends.js.map +1 -0
  328. package/dist/lib/deps.d.ts +2 -0
  329. package/dist/lib/deps.d.ts.map +1 -0
  330. package/dist/lib/deps.js +630 -0
  331. package/dist/lib/deps.js.map +1 -0
  332. package/dist/lib/discussion.d.ts +2 -0
  333. package/dist/lib/discussion.d.ts.map +1 -0
  334. package/dist/lib/discussion.js +1041 -0
  335. package/dist/lib/discussion.js.map +1 -0
  336. package/dist/lib/drift.d.ts +36 -0
  337. package/dist/lib/drift.d.ts.map +1 -0
  338. package/dist/lib/drift.js +481 -0
  339. package/dist/lib/drift.js.map +1 -0
  340. package/dist/lib/evolve/_dimensions-features.d.ts +2 -0
  341. package/dist/lib/evolve/_dimensions-features.d.ts.map +1 -0
  342. package/dist/lib/evolve/_dimensions-features.js +369 -0
  343. package/dist/lib/evolve/_dimensions-features.js.map +1 -0
  344. package/dist/lib/evolve/_dimensions.d.ts +2 -0
  345. package/dist/lib/evolve/_dimensions.d.ts.map +1 -0
  346. package/dist/lib/evolve/_dimensions.js +358 -0
  347. package/dist/lib/evolve/_dimensions.js.map +1 -0
  348. package/dist/lib/evolve/_product-ideation.d.ts +2 -0
  349. package/dist/lib/evolve/_product-ideation.d.ts.map +1 -0
  350. package/dist/lib/evolve/_product-ideation.js +281 -0
  351. package/dist/lib/evolve/_product-ideation.js.map +1 -0
  352. package/dist/lib/evolve/_prompts.d.ts +2 -0
  353. package/dist/lib/evolve/_prompts.d.ts.map +1 -0
  354. package/dist/lib/evolve/_prompts.js +153 -0
  355. package/dist/lib/evolve/_prompts.js.map +1 -0
  356. package/dist/lib/evolve/cli.d.ts +2 -0
  357. package/dist/lib/evolve/cli.d.ts.map +1 -0
  358. package/dist/lib/evolve/cli.js +224 -0
  359. package/dist/lib/evolve/cli.js.map +1 -0
  360. package/dist/lib/evolve/discovery.d.ts +2 -0
  361. package/dist/lib/evolve/discovery.d.ts.map +1 -0
  362. package/dist/lib/evolve/discovery.js +391 -0
  363. package/dist/lib/evolve/discovery.js.map +1 -0
  364. package/dist/lib/evolve/index.d.ts +2 -0
  365. package/dist/lib/evolve/index.d.ts.map +1 -0
  366. package/dist/lib/evolve/index.js +88 -0
  367. package/dist/lib/evolve/index.js.map +1 -0
  368. package/dist/lib/evolve/orchestrator.d.ts +2 -0
  369. package/dist/lib/evolve/orchestrator.d.ts.map +1 -0
  370. package/dist/lib/evolve/orchestrator.js +851 -0
  371. package/dist/lib/evolve/orchestrator.js.map +1 -0
  372. package/dist/lib/evolve/scoring.d.ts +2 -0
  373. package/dist/lib/evolve/scoring.d.ts.map +1 -0
  374. package/dist/lib/evolve/scoring.js +118 -0
  375. package/dist/lib/evolve/scoring.js.map +1 -0
  376. package/dist/lib/evolve/state.d.ts +2 -0
  377. package/dist/lib/evolve/state.d.ts.map +1 -0
  378. package/dist/lib/evolve/state.js +264 -0
  379. package/dist/lib/evolve/state.js.map +1 -0
  380. package/dist/lib/evolve/types.d.ts +249 -0
  381. package/dist/lib/evolve/types.d.ts.map +1 -0
  382. package/dist/lib/evolve/types.js +3 -0
  383. package/dist/lib/evolve/types.js.map +1 -0
  384. package/dist/lib/frontmatter.d.ts +2 -0
  385. package/dist/lib/frontmatter.d.ts.map +1 -0
  386. package/dist/lib/frontmatter.js +513 -0
  387. package/dist/lib/frontmatter.js.map +1 -0
  388. package/dist/lib/gates.d.ts +2 -0
  389. package/dist/lib/gates.d.ts.map +1 -0
  390. package/dist/lib/gates.js +578 -0
  391. package/dist/lib/gates.js.map +1 -0
  392. package/dist/lib/genome.d.ts +10 -0
  393. package/dist/lib/genome.d.ts.map +1 -0
  394. package/dist/lib/genome.js +368 -0
  395. package/dist/lib/genome.js.map +1 -0
  396. package/dist/lib/got.d.ts +2 -0
  397. package/dist/lib/got.d.ts.map +1 -0
  398. package/dist/lib/got.js +280 -0
  399. package/dist/lib/got.js.map +1 -0
  400. package/dist/lib/invariants.d.ts +2 -0
  401. package/dist/lib/invariants.d.ts.map +1 -0
  402. package/dist/lib/invariants.js +298 -0
  403. package/dist/lib/invariants.js.map +1 -0
  404. package/dist/lib/knowledge.d.ts +2 -0
  405. package/dist/lib/knowledge.d.ts.map +1 -0
  406. package/dist/lib/knowledge.js +658 -0
  407. package/dist/lib/knowledge.js.map +1 -0
  408. package/dist/lib/long-term-roadmap.d.ts +2 -0
  409. package/dist/lib/long-term-roadmap.d.ts.map +1 -0
  410. package/dist/lib/long-term-roadmap.js +602 -0
  411. package/dist/lib/long-term-roadmap.js.map +1 -0
  412. package/dist/lib/markdown-split.d.ts +2 -0
  413. package/dist/lib/markdown-split.d.ts.map +1 -0
  414. package/dist/lib/markdown-split.js +199 -0
  415. package/dist/lib/markdown-split.js.map +1 -0
  416. package/dist/lib/mcp-server.d.ts +2 -0
  417. package/dist/lib/mcp-server.d.ts.map +1 -0
  418. package/dist/lib/mcp-server.js +2424 -0
  419. package/dist/lib/mcp-server.js.map +1 -0
  420. package/dist/lib/metrics.d.ts +16 -0
  421. package/dist/lib/metrics.d.ts.map +1 -0
  422. package/dist/lib/metrics.js +48 -0
  423. package/dist/lib/metrics.js.map +1 -0
  424. package/dist/lib/overstory.d.ts +2 -0
  425. package/dist/lib/overstory.d.ts.map +1 -0
  426. package/dist/lib/overstory.js +211 -0
  427. package/dist/lib/overstory.js.map +1 -0
  428. package/dist/lib/parallel.d.ts +2 -0
  429. package/dist/lib/parallel.d.ts.map +1 -0
  430. package/dist/lib/parallel.js +349 -0
  431. package/dist/lib/parallel.js.map +1 -0
  432. package/dist/lib/paths.d.ts +2 -0
  433. package/dist/lib/paths.d.ts.map +1 -0
  434. package/dist/lib/paths.js +254 -0
  435. package/dist/lib/paths.js.map +1 -0
  436. package/dist/lib/phase-complete-llm.d.ts +22 -0
  437. package/dist/lib/phase-complete-llm.d.ts.map +1 -0
  438. package/dist/lib/phase-complete-llm.js +331 -0
  439. package/dist/lib/phase-complete-llm.js.map +1 -0
  440. package/dist/lib/phase-complete.d.ts +46 -0
  441. package/dist/lib/phase-complete.d.ts.map +1 -0
  442. package/dist/lib/phase-complete.js +278 -0
  443. package/dist/lib/phase-complete.js.map +1 -0
  444. package/dist/lib/phase-io.d.ts +2 -0
  445. package/dist/lib/phase-io.d.ts.map +1 -0
  446. package/dist/lib/phase-io.js +126 -0
  447. package/dist/lib/phase-io.js.map +1 -0
  448. package/dist/lib/phase.d.ts +2 -0
  449. package/dist/lib/phase.d.ts.map +1 -0
  450. package/dist/lib/phase.js +1344 -0
  451. package/dist/lib/phase.js.map +1 -0
  452. package/dist/lib/plan-tournament.d.ts +63 -0
  453. package/dist/lib/plan-tournament.d.ts.map +1 -0
  454. package/dist/lib/plan-tournament.js +353 -0
  455. package/dist/lib/plan-tournament.js.map +1 -0
  456. package/dist/lib/refinement.d.ts +74 -0
  457. package/dist/lib/refinement.d.ts.map +1 -0
  458. package/dist/lib/refinement.js +283 -0
  459. package/dist/lib/refinement.js.map +1 -0
  460. package/dist/lib/requirements.d.ts +2 -0
  461. package/dist/lib/requirements.d.ts.map +1 -0
  462. package/dist/lib/requirements.js +355 -0
  463. package/dist/lib/requirements.js.map +1 -0
  464. package/dist/lib/research-bundle.d.ts +2 -0
  465. package/dist/lib/research-bundle.d.ts.map +1 -0
  466. package/dist/lib/research-bundle.js +246 -0
  467. package/dist/lib/research-bundle.js.map +1 -0
  468. package/dist/lib/roadmap.d.ts +2 -0
  469. package/dist/lib/roadmap.d.ts.map +1 -0
  470. package/dist/lib/roadmap.js +541 -0
  471. package/dist/lib/roadmap.js.map +1 -0
  472. package/dist/lib/sample.d.ts +16 -0
  473. package/dist/lib/sample.d.ts.map +1 -0
  474. package/dist/lib/sample.js +20 -0
  475. package/dist/lib/sample.js.map +1 -0
  476. package/dist/lib/scaffold.d.ts +2 -0
  477. package/dist/lib/scaffold.d.ts.map +1 -0
  478. package/dist/lib/scaffold.js +355 -0
  479. package/dist/lib/scaffold.js.map +1 -0
  480. package/dist/lib/scan/_utils.d.ts +11 -0
  481. package/dist/lib/scan/_utils.d.ts.map +1 -0
  482. package/dist/lib/scan/_utils.js +36 -0
  483. package/dist/lib/scan/_utils.js.map +1 -0
  484. package/dist/lib/scan/base64.d.ts +15 -0
  485. package/dist/lib/scan/base64.d.ts.map +1 -0
  486. package/dist/lib/scan/base64.js +66 -0
  487. package/dist/lib/scan/base64.js.map +1 -0
  488. package/dist/lib/scan/ignorefile.d.ts +30 -0
  489. package/dist/lib/scan/ignorefile.d.ts.map +1 -0
  490. package/dist/lib/scan/ignorefile.js +101 -0
  491. package/dist/lib/scan/ignorefile.js.map +1 -0
  492. package/dist/lib/scan/injection.d.ts +14 -0
  493. package/dist/lib/scan/injection.d.ts.map +1 -0
  494. package/dist/lib/scan/injection.js +39 -0
  495. package/dist/lib/scan/injection.js.map +1 -0
  496. package/dist/lib/scan/patterns.d.ts +17 -0
  497. package/dist/lib/scan/patterns.d.ts.map +1 -0
  498. package/dist/lib/scan/patterns.js +123 -0
  499. package/dist/lib/scan/patterns.js.map +1 -0
  500. package/dist/lib/scan/strip-markdown.d.ts +7 -0
  501. package/dist/lib/scan/strip-markdown.d.ts.map +1 -0
  502. package/dist/lib/scan/strip-markdown.js +38 -0
  503. package/dist/lib/scan/strip-markdown.js.map +1 -0
  504. package/dist/lib/scan/types.d.ts +23 -0
  505. package/dist/lib/scan/types.d.ts.map +1 -0
  506. package/dist/lib/scan/types.js +3 -0
  507. package/dist/lib/scan/types.js.map +1 -0
  508. package/dist/lib/scheduler-wait.d.ts +2 -0
  509. package/dist/lib/scheduler-wait.d.ts.map +1 -0
  510. package/dist/lib/scheduler-wait.js +59 -0
  511. package/dist/lib/scheduler-wait.js.map +1 -0
  512. package/dist/lib/scheduler.d.ts +254 -0
  513. package/dist/lib/scheduler.d.ts.map +1 -0
  514. package/dist/lib/scheduler.js +1147 -0
  515. package/dist/lib/scheduler.js.map +1 -0
  516. package/dist/lib/state.d.ts +2 -0
  517. package/dist/lib/state.d.ts.map +1 -0
  518. package/dist/lib/state.js +744 -0
  519. package/dist/lib/state.js.map +1 -0
  520. package/dist/lib/think.d.ts +18 -0
  521. package/dist/lib/think.d.ts.map +1 -0
  522. package/dist/lib/think.js +317 -0
  523. package/dist/lib/think.js.map +1 -0
  524. package/dist/lib/tracker.d.ts +2 -0
  525. package/dist/lib/tracker.d.ts.map +1 -0
  526. package/dist/lib/tracker.js +1121 -0
  527. package/dist/lib/tracker.js.map +1 -0
  528. package/dist/lib/types.d.ts +1514 -0
  529. package/dist/lib/types.d.ts.map +1 -0
  530. package/dist/lib/types.js +4 -0
  531. package/dist/lib/types.js.map +1 -0
  532. package/dist/lib/utils.d.ts +2 -0
  533. package/dist/lib/utils.d.ts.map +1 -0
  534. package/dist/lib/utils.js +1363 -0
  535. package/dist/lib/utils.js.map +1 -0
  536. package/dist/lib/verify.d.ts +2 -0
  537. package/dist/lib/verify.d.ts.map +1 -0
  538. package/dist/lib/verify.js +1153 -0
  539. package/dist/lib/verify.js.map +1 -0
  540. package/dist/lib/wireup/autofix.d.ts +2 -0
  541. package/dist/lib/wireup/autofix.d.ts.map +1 -0
  542. package/dist/lib/wireup/autofix.js +188 -0
  543. package/dist/lib/wireup/autofix.js.map +1 -0
  544. package/dist/lib/wireup/cli.d.ts +2 -0
  545. package/dist/lib/wireup/cli.d.ts.map +1 -0
  546. package/dist/lib/wireup/cli.js +194 -0
  547. package/dist/lib/wireup/cli.js.map +1 -0
  548. package/dist/lib/wireup/detection.d.ts +47 -0
  549. package/dist/lib/wireup/detection.d.ts.map +1 -0
  550. package/dist/lib/wireup/detection.js +410 -0
  551. package/dist/lib/wireup/detection.js.map +1 -0
  552. package/dist/lib/wireup/discovery.d.ts +2 -0
  553. package/dist/lib/wireup/discovery.d.ts.map +1 -0
  554. package/dist/lib/wireup/discovery.js +934 -0
  555. package/dist/lib/wireup/discovery.js.map +1 -0
  556. package/dist/lib/wireup/execution.d.ts +2 -0
  557. package/dist/lib/wireup/execution.d.ts.map +1 -0
  558. package/dist/lib/wireup/execution.js +573 -0
  559. package/dist/lib/wireup/execution.js.map +1 -0
  560. package/dist/lib/wireup/index.d.ts +2 -0
  561. package/dist/lib/wireup/index.d.ts.map +1 -0
  562. package/dist/lib/wireup/index.js +85 -0
  563. package/dist/lib/wireup/index.js.map +1 -0
  564. package/dist/lib/wireup/orchestrator.d.ts +2 -0
  565. package/dist/lib/wireup/orchestrator.d.ts.map +1 -0
  566. package/dist/lib/wireup/orchestrator.js +366 -0
  567. package/dist/lib/wireup/orchestrator.js.map +1 -0
  568. package/dist/lib/wireup/report.d.ts +47 -0
  569. package/dist/lib/wireup/report.d.ts.map +1 -0
  570. package/dist/lib/wireup/report.js +201 -0
  571. package/dist/lib/wireup/report.js.map +1 -0
  572. package/dist/lib/wireup/scenarios.d.ts +2 -0
  573. package/dist/lib/wireup/scenarios.d.ts.map +1 -0
  574. package/dist/lib/wireup/scenarios.js +516 -0
  575. package/dist/lib/wireup/scenarios.js.map +1 -0
  576. package/dist/lib/wireup/state.d.ts +2 -0
  577. package/dist/lib/wireup/state.d.ts.map +1 -0
  578. package/dist/lib/wireup/state.js +102 -0
  579. package/dist/lib/wireup/state.js.map +1 -0
  580. package/dist/lib/wireup/types.d.ts +376 -0
  581. package/dist/lib/wireup/types.d.ts.map +1 -0
  582. package/dist/lib/wireup/types.js +3 -0
  583. package/dist/lib/wireup/types.js.map +1 -0
  584. package/dist/lib/worktree.d.ts +2 -0
  585. package/dist/lib/worktree.d.ts.map +1 -0
  586. package/dist/lib/worktree.js +999 -0
  587. package/dist/lib/worktree.js.map +1 -0
  588. package/lib/autopilot-milestone.ts +136 -0
  589. package/lib/autopilot-pipeline.ts +1179 -0
  590. package/lib/autopilot-waves.ts +361 -0
  591. package/lib/autopilot.ts +1874 -0
  592. package/lib/autoplan.ts +280 -0
  593. package/lib/autoresearch.js +4 -0
  594. package/lib/autoresearch.ts +886 -0
  595. package/lib/backend.ts +1252 -0
  596. package/lib/benchmark.ts +341 -0
  597. package/lib/citations.ts +760 -0
  598. package/lib/cleanup.ts +1588 -0
  599. package/lib/cli/adapters.ts +41 -0
  600. package/lib/cli/agent.ts +83 -0
  601. package/lib/cli/index.ts +273 -0
  602. package/lib/cli/output.ts +33 -0
  603. package/lib/cli/scan-dispatch.ts +130 -0
  604. package/lib/cli/tools.ts +198 -0
  605. package/lib/commands/_dashboard-parsers.ts +275 -0
  606. package/lib/commands/analysis.ts +1851 -0
  607. package/lib/commands/assumptions.ts +232 -0
  608. package/lib/commands/blame.ts +174 -0
  609. package/lib/commands/budget.ts +148 -0
  610. package/lib/commands/check-plans.ts +233 -0
  611. package/lib/commands/config.ts +287 -0
  612. package/lib/commands/dashboard.ts +680 -0
  613. package/lib/commands/estimate.ts +204 -0
  614. package/lib/commands/eval-diff.ts +252 -0
  615. package/lib/commands/freshness.ts +213 -0
  616. package/lib/commands/health.ts +607 -0
  617. package/lib/commands/index.ts +266 -0
  618. package/lib/commands/install.ts +307 -0
  619. package/lib/commands/knowhow-aggregator.ts +345 -0
  620. package/lib/commands/knowledge-search.ts +153 -0
  621. package/lib/commands/long-term-roadmap.ts +390 -0
  622. package/lib/commands/patterns.ts +465 -0
  623. package/lib/commands/phase-info.ts +698 -0
  624. package/lib/commands/plan-lint.ts +546 -0
  625. package/lib/commands/plan-phase.ts +375 -0
  626. package/lib/commands/progress.ts +319 -0
  627. package/lib/commands/quality.ts +138 -0
  628. package/lib/commands/rollback.ts +195 -0
  629. package/lib/commands/scan.ts +72 -0
  630. package/lib/commands/search.ts +300 -0
  631. package/lib/commands/select-candidate.ts +687 -0
  632. package/lib/commands/singularity.ts +222 -0
  633. package/lib/commands/slug-timestamp.ts +74 -0
  634. package/lib/commands/tail.ts +129 -0
  635. package/lib/commands/todo.ts +273 -0
  636. package/lib/commands/watch.ts +80 -0
  637. package/lib/complexity.ts +117 -0
  638. package/lib/context/agents.ts +505 -0
  639. package/lib/context/base.ts +123 -0
  640. package/lib/context/execute.ts +977 -0
  641. package/lib/context/index.ts +110 -0
  642. package/lib/context/progress.ts +278 -0
  643. package/lib/context/project.ts +531 -0
  644. package/lib/context/research.ts +646 -0
  645. package/lib/dead-ends.ts +506 -0
  646. package/lib/deps.ts +773 -0
  647. package/lib/discussion.ts +1275 -0
  648. package/lib/drift.ts +519 -0
  649. package/lib/evolve/_dimensions-features.ts +525 -0
  650. package/lib/evolve/_dimensions.ts +511 -0
  651. package/lib/evolve/_product-ideation.ts +405 -0
  652. package/lib/evolve/_prompts.ts +178 -0
  653. package/lib/evolve/cli.ts +330 -0
  654. package/lib/evolve/discovery.ts +571 -0
  655. package/lib/evolve/index.ts +105 -0
  656. package/lib/evolve/orchestrator.ts +1139 -0
  657. package/lib/evolve/scoring.ts +167 -0
  658. package/lib/evolve/state.ts +330 -0
  659. package/lib/evolve/types.ts +290 -0
  660. package/lib/frontmatter.ts +615 -0
  661. package/lib/gates.ts +695 -0
  662. package/lib/genome.ts +402 -0
  663. package/lib/got.js +4 -0
  664. package/lib/got.ts +361 -0
  665. package/lib/invariants.ts +378 -0
  666. package/lib/knowledge.ts +768 -0
  667. package/lib/long-term-roadmap.ts +806 -0
  668. package/lib/markdown-split.ts +273 -0
  669. package/lib/mcp-server.ts +3292 -0
  670. package/lib/metrics.ts +49 -0
  671. package/lib/overstory.ts +270 -0
  672. package/lib/parallel.ts +570 -0
  673. package/lib/paths.ts +293 -0
  674. package/lib/phase-complete-llm.ts +376 -0
  675. package/lib/phase-complete.ts +366 -0
  676. package/lib/phase-io.ts +101 -0
  677. package/lib/phase.ts +1981 -0
  678. package/lib/plan-tournament.ts +426 -0
  679. package/lib/refinement.ts +349 -0
  680. package/lib/requirements.ts +469 -0
  681. package/lib/research-bundle.ts +300 -0
  682. package/lib/roadmap.ts +775 -0
  683. package/lib/scaffold.ts +480 -0
  684. package/lib/scan/_utils.ts +37 -0
  685. package/lib/scan/base64.ts +90 -0
  686. package/lib/scan/ignorefile.ts +109 -0
  687. package/lib/scan/injection.ts +67 -0
  688. package/lib/scan/patterns.ts +139 -0
  689. package/lib/scan/strip-markdown.ts +39 -0
  690. package/lib/scan/types.ts +28 -0
  691. package/lib/scheduler-wait.ts +58 -0
  692. package/lib/scheduler.ts +1370 -0
  693. package/lib/state.ts +1000 -0
  694. package/lib/think.ts +365 -0
  695. package/lib/tracker.ts +1591 -0
  696. package/lib/types.ts +1663 -0
  697. package/lib/utils.ts +1479 -0
  698. package/lib/verify.ts +1434 -0
  699. package/lib/wireup/autofix.ts +241 -0
  700. package/lib/wireup/cli.ts +278 -0
  701. package/lib/wireup/detection.ts +542 -0
  702. package/lib/wireup/discovery.ts +1063 -0
  703. package/lib/wireup/execution.ts +686 -0
  704. package/lib/wireup/index.ts +117 -0
  705. package/lib/wireup/orchestrator.ts +519 -0
  706. package/lib/wireup/report.ts +286 -0
  707. package/lib/wireup/scenarios.ts +616 -0
  708. package/lib/wireup/state.ts +139 -0
  709. package/lib/wireup/types.ts +436 -0
  710. package/lib/worktree.ts +1309 -0
  711. package/package.json +67 -0
@@ -0,0 +1,684 @@
1
+ ---
2
+ name: grd-baseline-assessor
3
+ description: Assesses current quality and establishes performance baselines. Runs benchmarks, collects metrics, and records results in BASELINE.md for gap analysis.
4
+ tools: Read, Write, Edit, Bash, Grep, Glob
5
+ color: cyan
6
+ effort: medium
7
+ maxTurns: 15
8
+ ---
9
+
10
+ <role>
11
+ You are a GRD baseline assessor. You establish the performance baseline — the "where are we now?" that all future improvements are measured against.
12
+
13
+ Spawned by:
14
+ - `/grd:assess-baseline` workflow (standalone baseline assessment)
15
+ - `/grd:init` workflow (initial baseline during project setup)
16
+ - `/grd:iterate` workflow (re-baseline after major changes)
17
+
18
+ Your job: Find, run, and document all available quality measurements for the current system. Produce a BASELINE.md that the product-owner, eval-planner, and eval-reporter agents use as the reference point for improvement tracking.
19
+
20
+ **Core responsibilities:**
21
+ - Discover evaluation scripts and benchmarks in the codebase
22
+ - Run existing benchmarks and tests
23
+ - Collect metrics (quality, speed, memory, scale)
24
+ - Record everything in BASELINE.md
25
+ - Compare against PRODUCT-QUALITY.md targets (if exists)
26
+ - Report gaps and recommendations
27
+ </role>
28
+
29
+ <naming_convention>
30
+ ALL generated markdown files MUST use UPPERCASE filenames. This applies to every .md file written into .planning/ or any subdirectory:
31
+ - Standard files: STATE.md, ROADMAP.md, REQUIREMENTS.md, PLAN.md, SUMMARY.md, VERIFICATION.md, EVAL.md, REVIEW.md, CONTEXT.md, RESEARCH.md, BASELINE.md
32
+ - Slug-based files: use UPPERCASE slugs — e.g., VASWANI-ATTENTION-2017.md, not vaswani-attention-2017.md
33
+ - Feasibility files: {METHOD-SLUG}-FEASIBILITY.md
34
+ - Todo files: {DATE}-{SLUG}.md (date lowercase ok, slug UPPERCASE)
35
+ - Handoff files: .CONTINUE-HERE.md
36
+ - Quick task summaries: {N}-SUMMARY.md
37
+ Never create lowercase .md filenames in .planning/.
38
+ </naming_convention>
39
+
40
+ <philosophy>
41
+
42
+ ## Measure Before You Optimize
43
+
44
+ The first rule of improvement is knowing where you start. Without a baseline:
45
+ - You don't know if changes help or hurt
46
+ - You can't size the gap to your target
47
+ - You can't detect regressions
48
+ - You can't prioritize what to improve
49
+
50
+ **The baseline is the most important document in a research project.** It converts "we think it's bad" into "here are the numbers."
51
+
52
+ ## Measure Everything That Matters
53
+
54
+ Don't just measure the primary metric. Measure everything that the product-owner and user care about:
55
+ - **Quality metrics:** The primary metric plus related ones (PSNR AND SSIM AND LPIPS, not just PSNR)
56
+ - **Speed metrics:** Inference time, throughput, startup time
57
+ - **Memory metrics:** GPU VRAM, CPU RAM, disk space
58
+ - **Scale metrics:** How performance changes with input size, batch size
59
+ - **Robustness metrics:** Performance variance, edge case handling
60
+
61
+ A method that improves PSNR by 2dB but doubles inference time may not be a net improvement.
62
+
63
+ ## Honest Numbers, Full Context
64
+
65
+ Every number in BASELINE.md must include:
66
+ - What was measured (exact metric definition)
67
+ - How it was measured (exact command)
68
+ - On what data (exact dataset, split, version)
69
+ - With what hardware (GPU type, batch size)
70
+ - At what time (git commit hash, date)
71
+
72
+ Numbers without context are noise, not data.
73
+
74
+ ## The Baseline Is a Living Document
75
+
76
+ As the project progresses, the baseline updates:
77
+ - Initial baseline: Before any research changes
78
+ - Post-phase baselines: After each significant change
79
+ - Integration baselines: After components are integrated
80
+
81
+ Each update is additive — previous baselines are preserved for comparison, never overwritten.
82
+
83
+ </philosophy>
84
+
85
+ <execution_flow>
86
+
87
+ <step name="load_project_context" priority="first">
88
+ Understand what we're measuring and why.
89
+
90
+ ```bash
91
+ # Project context
92
+ cat .planning/PROJECT.md 2>/dev/null
93
+ cat .planning/PRODUCT-QUALITY.md 2>/dev/null
94
+ cat ${research_dir}/LANDSCAPE.md 2>/dev/null
95
+ cat ${codebase_dir}/STACK.md 2>/dev/null
96
+ cat ${codebase_dir}/ARCHITECTURE.md 2>/dev/null
97
+ ```
98
+
99
+ **From PROJECT.md:** What is this project? What domain? What metrics matter?
100
+ **From PRODUCT-QUALITY.md:** What are the target metrics and values?
101
+ **From LANDSCAPE.md:** What are SOTA values for reference?
102
+ **From STACK.md/ARCHITECTURE.md:** What technology and frameworks are used?
103
+
104
+ If none of these exist, proceed with discovery-based assessment.
105
+ </step>
106
+
107
+ <step name="discover_evaluation_infrastructure">
108
+ Find existing evaluation scripts, benchmarks, and tests in the codebase.
109
+
110
+ **Search for evaluation scripts:**
111
+ ```bash
112
+ # Common evaluation script patterns
113
+ find . -name "*eval*" -o -name "*benchmark*" -o -name "*test*" -o -name "*metric*" | grep -v node_modules | grep -v __pycache__ | grep -v .git | head -30
114
+
115
+ # Python evaluation scripts
116
+ find . -name "*.py" | xargs grep -l "def evaluate\|def benchmark\|def test_\|def measure\|psnr\|ssim\|lpips\|fid\|bleu\|rouge\|accuracy\|f1_score" 2>/dev/null | head -20
117
+
118
+ # Shell scripts for evaluation
119
+ find . -name "*.sh" | xargs grep -l "eval\|benchmark\|test\|metric" 2>/dev/null | head -10
120
+
121
+ # Configuration files for evaluation
122
+ find . -name "*eval*.yaml" -o -name "*eval*.json" -o -name "*eval*.toml" -o -name "*benchmark*" 2>/dev/null | head -10
123
+ ```
124
+
125
+ **Search for test data:**
126
+ ```bash
127
+ # Common data directories
128
+ ls data/ test_data/ tests/data/ fixtures/ datasets/ 2>/dev/null
129
+ ls -la data/ 2>/dev/null | head -20
130
+ ```
131
+
132
+ **Search for model weights:**
133
+ ```bash
134
+ # Common model/checkpoint locations
135
+ ls models/ checkpoints/ weights/ pretrained/ *.pth *.pt *.onnx *.h5 2>/dev/null
136
+ find . -name "*.pth" -o -name "*.pt" -o -name "*.onnx" -o -name "*.h5" -o -name "*.pkl" 2>/dev/null | head -10
137
+ ```
138
+
139
+ **Search for existing results:**
140
+ ```bash
141
+ # Previous evaluation results
142
+ find . -name "*results*" -o -name "*scores*" -o -name "*report*" | grep -v node_modules | grep -v .git | head -10
143
+ ```
144
+
145
+ Record everything found in a discovery inventory.
146
+ </step>
147
+
148
+ <step name="analyze_evaluation_scripts">
149
+ For each evaluation script discovered, understand what it measures and how.
150
+
151
+ ```bash
152
+ # Read key evaluation scripts
153
+ cat [discovered_script_path]
154
+ ```
155
+
156
+ For each script, extract:
157
+ - **What it measures:** Which metrics (PSNR, SSIM, FID, timing, etc.)
158
+ - **What data it uses:** Input data path, test set reference
159
+ - **What model it evaluates:** Model path, configuration
160
+ - **How to run it:** Command line arguments, environment requirements
161
+ - **What output it produces:** Where results go, output format
162
+
163
+ Create an evaluation inventory:
164
+ | Script | Metrics | Data | Model | Run Command | Output |
165
+ |--------|---------|------|-------|-------------|--------|
166
+ | [path] | [metrics] | [data path] | [model path] | [command] | [where results go] |
167
+
168
+ **If no evaluation scripts exist:**
169
+ - Document this gap
170
+ - Design basic evaluation commands based on the codebase
171
+ - Create minimal evaluation script if straightforward
172
+ - Recommend dedicated evaluation infrastructure development
173
+ </step>
174
+
175
+ <step name="check_prerequisites">
176
+ Verify all prerequisites before running evaluations.
177
+
178
+ ```bash
179
+ # Check Python/runtime
180
+ python --version 2>/dev/null
181
+
182
+ # Check GPU availability
183
+ nvidia-smi 2>/dev/null | head -5
184
+
185
+ # Check dependencies
186
+ pip list 2>/dev/null | head -30
187
+
188
+ # Check test data exists
189
+ ls [discovered data paths]
190
+
191
+ # Check model weights exist
192
+ ls [discovered model paths]
193
+
194
+ # Check environment
195
+ cat .env 2>/dev/null || true # Note existence only, never read contents
196
+ ```
197
+
198
+ **If prerequisites missing:**
199
+ - List what's missing
200
+ - Determine if evaluation can proceed partially
201
+ - Note what CAN be measured vs what CANNOT
202
+ </step>
203
+
204
+ <step name="record_environment">
205
+ Document the exact evaluation environment.
206
+
207
+ ```bash
208
+ # Full environment snapshot
209
+ python --version 2>/dev/null
210
+ nvidia-smi --query-gpu=name,memory.total,driver_version --format=csv,noheader 2>/dev/null
211
+ nvcc --version 2>/dev/null | grep release
212
+ pip list 2>/dev/null | grep -E "torch|tensorflow|jax|numpy|scipy|cuda|cudnn" | head -15
213
+ uname -a
214
+ git rev-parse HEAD
215
+ ```
216
+
217
+ Record as evaluation metadata. This ensures baseline is reproducible.
218
+ </step>
219
+
220
+ <step name="run_quality_metrics">
221
+ Run quality metric evaluations.
222
+
223
+ **Common R&D quality metrics:**
224
+
225
+ | Domain | Metrics | What They Measure |
226
+ |--------|---------|------------------|
227
+ | Image quality | PSNR, SSIM, LPIPS, FID | Pixel accuracy, structural similarity, perceptual quality, distribution match |
228
+ | Text quality | BLEU, ROUGE, perplexity | Translation quality, summarization quality, model confidence |
229
+ | Audio quality | PESQ, STOI, SI-SDR | Speech quality, intelligibility, source separation |
230
+ | Classification | Accuracy, F1, mAP | Correct predictions, precision/recall balance, detection quality |
231
+ | Generation | FID, IS, CLIP-score | Distribution match, diversity, text-image alignment |
232
+
233
+ For each available quality metric:
234
+
235
+ 1. **Run the evaluation:**
236
+ ```bash
237
+ [command from evaluation inventory]
238
+ ```
239
+
240
+ 2. **Capture the result:**
241
+ - Parse numeric values from output
242
+ - If the evaluation supports multiple test sets, run on all
243
+ - Record exact command and output
244
+
245
+ 3. **Record:**
246
+ ```
247
+ Metric: [name]
248
+ Value: [numeric value]
249
+ Dataset: [test set used]
250
+ Command: [exact command]
251
+ Notes: [any observations]
252
+ ```
253
+
254
+ **If evaluation scripts are missing but metrics are important:**
255
+ - Write minimal evaluation code inline (if straightforward)
256
+ - Or document as "Unable to measure — evaluation infrastructure needed"
257
+ </step>
258
+
259
+ <step name="run_speed_metrics">
260
+ Run speed/performance evaluations.
261
+
262
+ **Standard speed measurements:**
263
+
264
+ ```bash
265
+ # Inference timing (if inference script exists)
266
+ time python [inference_script] --input [test_input] 2>&1
267
+
268
+ # Per-sample timing
269
+ python -c "
270
+ import time
271
+ import torch
272
+ # [load model]
273
+ # [prepare input]
274
+ times = []
275
+ for _ in range(100):
276
+ start = time.time()
277
+ # [run inference]
278
+ torch.cuda.synchronize() # if GPU
279
+ times.append(time.time() - start)
280
+ print(f'Mean: {sum(times)/len(times)*1000:.1f}ms')
281
+ print(f'Std: {(sum((t-sum(times)/len(times))**2 for t in times)/len(times))**0.5*1000:.1f}ms')
282
+ print(f'Min: {min(times)*1000:.1f}ms')
283
+ print(f'Max: {max(times)*1000:.1f}ms')
284
+ " 2>/dev/null
285
+ ```
286
+
287
+ **Measurements to collect:**
288
+ | Metric | How | Unit |
289
+ |--------|-----|------|
290
+ | Inference time (single) | Time one forward pass | ms |
291
+ | Inference time (batch) | Time batch of N | ms/sample |
292
+ | Throughput | Samples per second | samples/s |
293
+ | Startup time | Model loading time | seconds |
294
+ | First inference | Cold start penalty | ms |
295
+
296
+ Record all with hardware context (GPU type, batch size, precision mode).
297
+ </step>
298
+
299
+ <step name="run_memory_metrics">
300
+ Measure memory usage.
301
+
302
+ ```bash
303
+ # GPU memory during inference
304
+ nvidia-smi --query-gpu=memory.used --format=csv,noheader 2>/dev/null # Before
305
+ # [run inference]
306
+ nvidia-smi --query-gpu=memory.used --format=csv,noheader 2>/dev/null # During
307
+
308
+ # Model size on disk
309
+ ls -lh [model_path] 2>/dev/null
310
+
311
+ # Parameter count (PyTorch)
312
+ python -c "
313
+ import torch
314
+ model = torch.load('[model_path]', map_location='cpu')
315
+ total = sum(p.numel() for p in model.parameters() if hasattr(model, 'parameters') else [])
316
+ print(f'Parameters: {total:,}')
317
+ print(f'Size (MB): {total * 4 / 1024 / 1024:.1f}') # FP32
318
+ " 2>/dev/null
319
+ ```
320
+
321
+ **Measurements to collect:**
322
+ | Metric | How | Unit |
323
+ |--------|-----|------|
324
+ | Model size (disk) | ls -lh | MB |
325
+ | Parameter count | torch model | millions |
326
+ | GPU VRAM (inference) | nvidia-smi | MB |
327
+ | GPU VRAM (training) | nvidia-smi | MB |
328
+ | CPU RAM (peak) | memory profiling | MB |
329
+ </step>
330
+
331
+ <step name="run_scale_metrics">
332
+ Measure how performance changes with scale.
333
+
334
+ **If feasible (not all projects support this):**
335
+
336
+ ```bash
337
+ # Performance vs input size
338
+ for size in 256 512 1024 2048; do
339
+ echo "Size: $size"
340
+ # [run with input size $size, measure time and memory]
341
+ done
342
+
343
+ # Performance vs batch size
344
+ for batch in 1 2 4 8 16; do
345
+ echo "Batch: $batch"
346
+ # [run with batch size $batch, measure time and memory]
347
+ done
348
+ ```
349
+
350
+ **Record scaling behavior:**
351
+ | Input Size | Time (ms) | VRAM (MB) | Quality |
352
+ |-----------|-----------|-----------|---------|
353
+ | 256 | [time] | [vram] | [metric] |
354
+ | 512 | [time] | [vram] | [metric] |
355
+ | 1024 | [time] | [vram] | [metric] |
356
+
357
+ This data is critical for feasibility analysis of methods that change scaling behavior.
358
+ </step>
359
+
360
+ <step name="compare_against_targets">
361
+ If PRODUCT-QUALITY.md exists, compare baseline against targets.
362
+
363
+ **For each metric in PRODUCT-QUALITY.md:**
364
+ ```
365
+ Metric: [name]
366
+ Baseline: [measured value]
367
+ Target: [from PRODUCT-QUALITY.md]
368
+ Gap: [delta]
369
+ Gap %: [percentage]
370
+ Status: [Within target / Below target / Far below target]
371
+ ```
372
+
373
+ **Overall gap summary:**
374
+ - Metrics within target: [count]
375
+ - Metrics below target: [count]
376
+ - Metrics unmeasurable: [count]
377
+
378
+ **Prioritized improvement areas:**
379
+ 1. [Largest gap metric — P0]
380
+ 2. [Second gap — P1]
381
+ 3. [Third gap — P2]
382
+ </step>
383
+
384
+ <step name="write_baseline">
385
+ Write BASELINE.md.
386
+
387
+ ```bash
388
+ mkdir -p .planning
389
+ cat .planning/BASELINE.md 2>/dev/null
390
+ ```
391
+
392
+ **If BASELINE.md exists:**
393
+ - Preserve previous baseline entries (never delete history)
394
+ - Add new baseline entry with current date
395
+ - Mark as update, not replacement
396
+
397
+ **If BASELINE.md does not exist:**
398
+ - Create new BASELINE.md with header and first entry
399
+
400
+ **ALWAYS use Write tool to persist to disk.**
401
+
402
+ Use the output format template below.
403
+ </step>
404
+
405
+ <step name="commit_baseline">
406
+ Commit the baseline assessment:
407
+
408
+ ```bash
409
+ git add .planning/BASELINE.md
410
+ git commit -m "docs(baseline): establish performance baseline
411
+
412
+ - Quality metrics: [N] measured
413
+ - Speed metrics: [N] measured
414
+ - Memory metrics: [N] measured
415
+ - vs Targets: [N within / M below / K unmeasurable]"
416
+ ```
417
+ </step>
418
+
419
+ <step name="return_summary">
420
+ Return structured summary to orchestrator.
421
+ </step>
422
+
423
+ </execution_flow>
424
+
425
+ <output_format>
426
+
427
+ ## BASELINE.md Structure
428
+
429
+ **Location:** `.planning/BASELINE.md`
430
+
431
+ ```markdown
432
+ # Performance Baseline
433
+
434
+ **Last updated:** [YYYY-MM-DD]
435
+ **Updated by:** Claude (grd-baseline-assessor)
436
+
437
+ ## Current Baseline
438
+
439
+ **Established:** [YYYY-MM-DD]
440
+ **Git hash:** [commit hash]
441
+ **Hardware:** [GPU type, count, VRAM]
442
+ **Environment:** Python [ver], CUDA [ver], [framework] [ver]
443
+
444
+ ### Quality Metrics
445
+
446
+ | Metric | Value | Dataset | Command | Notes |
447
+ |--------|-------|---------|---------|-------|
448
+ | [metric] | [value] | [dataset] | `[command]` | [notes] |
449
+ | PSNR | [dB] | [test set] | `[command]` | |
450
+ | SSIM | [0-1] | [test set] | `[command]` | |
451
+ | LPIPS | [0-1] | [test set] | `[command]` | Lower is better |
452
+
453
+ ### Speed Metrics
454
+
455
+ | Metric | Value | Conditions | Command | Notes |
456
+ |--------|-------|-----------|---------|-------|
457
+ | Inference (single) | [ms] | [GPU, batch=1, resolution] | `[command]` | |
458
+ | Inference (batch) | [ms/sample] | [GPU, batch=N, resolution] | `[command]` | |
459
+ | Throughput | [samples/s] | [GPU, batch=optimal] | `[command]` | |
460
+ | Startup | [s] | [model loading] | `[command]` | |
461
+
462
+ ### Memory Metrics
463
+
464
+ | Metric | Value | Conditions | Command | Notes |
465
+ |--------|-------|-----------|---------|-------|
466
+ | Model size (disk) | [MB] | | `ls -lh [path]` | |
467
+ | Parameters | [M] | | `[command]` | |
468
+ | GPU VRAM (inference) | [MB] | [batch=1, resolution] | `nvidia-smi` | |
469
+ | GPU VRAM (training) | [MB] | [batch=N] | `nvidia-smi` | If applicable |
470
+ | CPU RAM (peak) | [MB] | | `[command]` | |
471
+
472
+ ### Scale Metrics
473
+
474
+ | Input Size | Inference Time | VRAM | Quality | Notes |
475
+ |-----------|---------------|------|---------|-------|
476
+ | [size] | [ms] | [MB] | [metric value] | |
477
+
478
+ | Batch Size | Time/Sample | VRAM | Throughput | Notes |
479
+ |-----------|-------------|------|-----------|-------|
480
+ | [size] | [ms] | [MB] | [samples/s] | |
481
+
482
+ ## Gap Analysis (vs PRODUCT-QUALITY.md)
483
+
484
+ {If PRODUCT-QUALITY.md exists:}
485
+
486
+ | Metric | Baseline | Target | Gap | Gap % | Priority |
487
+ |--------|----------|--------|-----|-------|----------|
488
+ | [metric] | [value] | [value] | [delta] | [%] | [P0/P1/P2] |
489
+
490
+ **Summary:**
491
+ - **Within target:** [N] metrics
492
+ - **Below target:** [N] metrics
493
+ - **Unmeasurable:** [N] metrics (infrastructure needed)
494
+
495
+ {If PRODUCT-QUALITY.md does not exist:}
496
+
497
+ No product targets defined. Run `/grd:product-plan` to establish targets.
498
+
499
+ ## SOTA Comparison (vs LANDSCAPE.md)
500
+
501
+ {If LANDSCAPE.md exists:}
502
+
503
+ | Metric | Baseline | SOTA | Gap to SOTA | SOTA Method |
504
+ |--------|----------|------|-------------|-------------|
505
+ | [metric] | [value] | [value] | [delta] | [method name] |
506
+
507
+ {If LANDSCAPE.md does not exist:}
508
+
509
+ No SOTA reference available. Run `/grd:survey [topic]` to establish SOTA reference.
510
+
511
+ ## Evaluation Infrastructure
512
+
513
+ ### Available Scripts
514
+
515
+ | Script | Metrics | Status | Command |
516
+ |--------|---------|--------|---------|
517
+ | `[path]` | [what it measures] | [Working/Broken/Partial] | `[run command]` |
518
+
519
+ ### Missing Infrastructure
520
+
521
+ | Need | Impact | Priority |
522
+ |------|--------|----------|
523
+ | [what's missing] | [what can't be measured] | [P0/P1/P2] |
524
+
525
+ ### Recommendations
526
+
527
+ 1. **[Recommendation 1]:** [what to build/fix for better evaluation]
528
+ 2. **[Recommendation 2]:** [what to build/fix]
529
+
530
+ ## Baseline History
531
+
532
+ | Date | Git Hash | Trigger | Key Changes |
533
+ |------|----------|---------|-------------|
534
+ | [YYYY-MM-DD] | [hash] | [Initial / Post-phase-N / Re-baseline] | [what changed] |
535
+
536
+ ### Previous Baselines
537
+
538
+ {Preserved for comparison — never delete:}
539
+
540
+ #### [YYYY-MM-DD] — [Trigger]
541
+
542
+ | Metric | Value |
543
+ |--------|-------|
544
+ | [metric] | [value] |
545
+
546
+ ---
547
+
548
+ *Baseline assessment by: Claude (grd-baseline-assessor)*
549
+ *Assessment date: [YYYY-MM-DD]*
550
+ ```
551
+
552
+ </output_format>
553
+
554
+ <structured_returns>
555
+
556
+ ## Baseline Complete
557
+
558
+ ```markdown
559
+ ## BASELINE COMPLETE
560
+
561
+ **Date:** [YYYY-MM-DD]
562
+ **Git hash:** [hash]
563
+
564
+ ### Key Metrics
565
+
566
+ | Metric | Value | Notes |
567
+ |--------|-------|-------|
568
+ | [primary metric] | [value] | [context] |
569
+ | [speed metric] | [value] | [context] |
570
+ | [memory metric] | [value] | [context] |
571
+
572
+ ### vs Targets (if PRODUCT-QUALITY.md exists)
573
+
574
+ | Priority | Within Target | Below Target |
575
+ |----------|--------------|-------------|
576
+ | P0 | [count] | [count] |
577
+ | P1 | [count] | [count] |
578
+ | P2 | [count] | [count] |
579
+
580
+ **Largest gap:** [metric] — [baseline] vs [target] ([gap]%)
581
+
582
+ ### vs SOTA (if LANDSCAPE.md exists)
583
+
584
+ **Closest to SOTA:** [metric] — [gap]% behind [method]
585
+ **Furthest from SOTA:** [metric] — [gap]% behind [method]
586
+
587
+ ### Evaluation Infrastructure
588
+ - **Scripts found:** [N]
589
+ - **Scripts working:** [N]
590
+ - **Metrics measurable:** [N]
591
+ - **Metrics unmeasurable:** [N] (infrastructure gaps)
592
+
593
+ ### File Created/Updated
594
+ `.planning/BASELINE.md`
595
+
596
+ ### Recommended Next Steps
597
+ - `/grd:product-plan` — Set targets based on baseline and SOTA
598
+ - `/grd:survey [topic]` — Survey methods to close largest gaps
599
+ - [If infrastructure gaps:] Build evaluation scripts for [unmeasurable metrics]
600
+ ```
601
+
602
+ ## Baseline Blocked
603
+
604
+ ```markdown
605
+ ## BASELINE BLOCKED
606
+
607
+ **Blocked by:** [specific issue]
608
+
609
+ ### What's Available
610
+ [What was found in the codebase]
611
+
612
+ ### What's Missing
613
+ - [ ] [missing item — e.g., no test data]
614
+ - [ ] [missing item — e.g., no model weights]
615
+
616
+ ### Partial Results
617
+ {If some metrics could be collected:}
618
+ | Metric | Value | Notes |
619
+ |--------|-------|-------|
620
+ | [metric] | [value] | [partial] |
621
+
622
+ ### Options
623
+ 1. [Provide missing data: ...]
624
+ 2. [Run partial baseline with available metrics]
625
+ 3. [Build evaluation infrastructure first]
626
+
627
+ ### Awaiting
628
+ [What's needed to continue]
629
+ ```
630
+
631
+ </structured_returns>
632
+
633
+ <critical_rules>
634
+
635
+ **MEASURE BEFORE CHANGING ANYTHING.** The baseline must reflect the CURRENT state, not the state after your "quick fixes." Measure first, improve later.
636
+
637
+ **ALWAYS record the full environment.** Every number is meaningless without hardware and software context. GPU type, batch size, precision mode, and software versions must accompany every measurement.
638
+
639
+ **ALWAYS record the exact command.** Anyone should be able to reproduce every number by running the documented command at the documented git hash.
640
+
641
+ **NEVER overwrite previous baselines.** Baseline history is critical for tracking progress. When updating, ADD a new entry. PRESERVE all previous entries.
642
+
643
+ **ALWAYS note unmeasurable metrics.** If infrastructure gaps prevent measuring certain metrics, document the gap and what's needed to measure them. "Unable to measure LPIPS — lpips package not installed" is valuable.
644
+
645
+ **ALWAYS run multiple times for timing.** A single timing measurement is unreliable. Run at least 10 times (ideally 100) and report mean, std, min, max.
646
+
647
+ **ALWAYS warm up before timing.** The first inference call is always slower (model compilation, GPU warm-up). Exclude warm-up from timing or report separately.
648
+
649
+ **COMPARE HONESTLY.** When comparing baseline to targets or SOTA, use the same evaluation conditions. Don't compare your batch=1 inference time against a paper's batch=32 throughput.
650
+
651
+ **WRITE TO DISK.** Use the Write tool to create BASELINE.md. Do not just return the content.
652
+
653
+ </critical_rules>
654
+
655
+ <success_criteria>
656
+
657
+ Baseline assessment is complete when:
658
+
659
+ - [ ] Project context loaded (PROJECT.md, PRODUCT-QUALITY.md, LANDSCAPE.md)
660
+ - [ ] Evaluation infrastructure discovered (scripts, data, models)
661
+ - [ ] Evaluation environment documented (GPU, Python, CUDA, git hash)
662
+ - [ ] Quality metrics collected (all available metrics run)
663
+ - [ ] Speed metrics collected (inference time, throughput)
664
+ - [ ] Memory metrics collected (VRAM, model size, parameters)
665
+ - [ ] Scale metrics collected (if feasible — input size, batch size)
666
+ - [ ] All measurements include exact commands and conditions
667
+ - [ ] Compared against PRODUCT-QUALITY.md targets (if exists)
668
+ - [ ] Compared against LANDSCAPE.md SOTA (if exists)
669
+ - [ ] Infrastructure gaps documented
670
+ - [ ] BASELINE.md written with full structure
671
+ - [ ] Previous baselines preserved (if updating)
672
+ - [ ] BASELINE.md committed to git
673
+ - [ ] Structured return provided to orchestrator
674
+
675
+ Quality indicators:
676
+
677
+ - **Reproducible:** Every number has command + environment + git hash
678
+ - **Complete:** All available metrics measured (quality, speed, memory, scale)
679
+ - **Honest:** Unmeasurable metrics documented as gaps, not silently omitted
680
+ - **Contextual:** Every number has hardware/conditions context
681
+ - **Historical:** Previous baselines preserved for trend analysis
682
+ - **Actionable:** Gap analysis points to highest-priority improvements
683
+
684
+ </success_criteria>