@rubix0270/arboris 1.0.2 → 1.0.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (451) hide show
  1. package/package.json +8 -20
  2. package/run.mjs +10 -0
  3. package/dist/cli.mjs +0 -383
  4. package/manifest.json +0 -323
  5. package/prisma/skills/accessibility/SKILL.md +0 -147
  6. package/prisma/skills/agent-architecture-audit/SKILL.md +0 -257
  7. package/prisma/skills/agent-eval/SKILL.md +0 -146
  8. package/prisma/skills/agent-harness-construction/SKILL.md +0 -74
  9. package/prisma/skills/agent-introspection-debugging/SKILL.md +0 -154
  10. package/prisma/skills/agent-payment-x402/SKILL.md +0 -225
  11. package/prisma/skills/agent-self-evaluation/SKILL.md +0 -182
  12. package/prisma/skills/agent-self-evaluation/examples/high-score-example.md +0 -87
  13. package/prisma/skills/agent-self-evaluation/examples/low-score-example.md +0 -86
  14. package/prisma/skills/agent-self-evaluation/references/evaluation-criteria.md +0 -71
  15. package/prisma/skills/agent-self-evaluation/references/hook-integration.md +0 -64
  16. package/prisma/skills/agent-self-evaluation/scripts/evaluate.py +0 -408
  17. package/prisma/skills/agent-self-evaluation/templates/evaluation-report.md +0 -86
  18. package/prisma/skills/agent-sort/SKILL.md +0 -216
  19. package/prisma/skills/agentic-engineering/SKILL.md +0 -64
  20. package/prisma/skills/agentic-os/SKILL.md +0 -388
  21. package/prisma/skills/ai-first-engineering/SKILL.md +0 -52
  22. package/prisma/skills/ai-regression-testing/SKILL.md +0 -386
  23. package/prisma/skills/android-clean-architecture/SKILL.md +0 -340
  24. package/prisma/skills/angular-developer/SKILL.md +0 -155
  25. package/prisma/skills/angular-developer/references/angular-animations.md +0 -160
  26. package/prisma/skills/angular-developer/references/angular-aria.md +0 -410
  27. package/prisma/skills/angular-developer/references/cli.md +0 -86
  28. package/prisma/skills/angular-developer/references/component-harnesses.md +0 -59
  29. package/prisma/skills/angular-developer/references/component-styling.md +0 -91
  30. package/prisma/skills/angular-developer/references/components.md +0 -117
  31. package/prisma/skills/angular-developer/references/creating-services.md +0 -97
  32. package/prisma/skills/angular-developer/references/data-resolvers.md +0 -69
  33. package/prisma/skills/angular-developer/references/define-routes.md +0 -67
  34. package/prisma/skills/angular-developer/references/defining-providers.md +0 -72
  35. package/prisma/skills/angular-developer/references/di-fundamentals.md +0 -120
  36. package/prisma/skills/angular-developer/references/e2e-testing.md +0 -56
  37. package/prisma/skills/angular-developer/references/effects.md +0 -83
  38. package/prisma/skills/angular-developer/references/hierarchical-injectors.md +0 -43
  39. package/prisma/skills/angular-developer/references/host-elements.md +0 -80
  40. package/prisma/skills/angular-developer/references/injection-context.md +0 -63
  41. package/prisma/skills/angular-developer/references/inputs.md +0 -101
  42. package/prisma/skills/angular-developer/references/linked-signal.md +0 -59
  43. package/prisma/skills/angular-developer/references/loading-strategies.md +0 -61
  44. package/prisma/skills/angular-developer/references/mcp.md +0 -108
  45. package/prisma/skills/angular-developer/references/navigate-to-routes.md +0 -69
  46. package/prisma/skills/angular-developer/references/outputs.md +0 -86
  47. package/prisma/skills/angular-developer/references/reactive-forms.md +0 -122
  48. package/prisma/skills/angular-developer/references/rendering-strategies.md +0 -44
  49. package/prisma/skills/angular-developer/references/resource.md +0 -77
  50. package/prisma/skills/angular-developer/references/route-animations.md +0 -56
  51. package/prisma/skills/angular-developer/references/route-guards.md +0 -52
  52. package/prisma/skills/angular-developer/references/router-lifecycle.md +0 -45
  53. package/prisma/skills/angular-developer/references/router-testing.md +0 -87
  54. package/prisma/skills/angular-developer/references/show-routes-with-outlets.md +0 -68
  55. package/prisma/skills/angular-developer/references/signal-forms.md +0 -795
  56. package/prisma/skills/angular-developer/references/signals-overview.md +0 -94
  57. package/prisma/skills/angular-developer/references/tailwind-css.md +0 -69
  58. package/prisma/skills/angular-developer/references/template-driven-forms.md +0 -114
  59. package/prisma/skills/angular-developer/references/testing-fundamentals.md +0 -65
  60. package/prisma/skills/api-connector-builder/SKILL.md +0 -121
  61. package/prisma/skills/api-design/SKILL.md +0 -524
  62. package/prisma/skills/architecture-decision-records/SKILL.md +0 -180
  63. package/prisma/skills/article-writing/SKILL.md +0 -80
  64. package/prisma/skills/automation-audit-ops/SKILL.md +0 -143
  65. package/prisma/skills/autonomous-agent-harness/SKILL.md +0 -274
  66. package/prisma/skills/autonomous-loops/SKILL.md +0 -611
  67. package/prisma/skills/backend-patterns/SKILL.md +0 -562
  68. package/prisma/skills/benchmark/SKILL.md +0 -94
  69. package/prisma/skills/benchmark-methodology/SKILL.md +0 -190
  70. package/prisma/skills/benchmark-optimization-loop/SKILL.md +0 -70
  71. package/prisma/skills/blender-motion-state-inspection/SKILL.md +0 -165
  72. package/prisma/skills/blueprint/SKILL.md +0 -106
  73. package/prisma/skills/brand-discovery/SKILL.md +0 -145
  74. package/prisma/skills/brand-discovery/references/10_purpose-why.md +0 -40
  75. package/prisma/skills/brand-discovery/references/20_positioning.md +0 -44
  76. package/prisma/skills/brand-discovery/references/30_audience-niche.md +0 -52
  77. package/prisma/skills/brand-discovery/references/40_personality-archetype.md +0 -57
  78. package/prisma/skills/brand-discovery/references/50_voice-tone.md +0 -59
  79. package/prisma/skills/brand-discovery/references/60_narrative-story.md +0 -50
  80. package/prisma/skills/brand-discovery/references/70_founder-tension.md +0 -49
  81. package/prisma/skills/brand-discovery/references/90_SYNTHESIS.md +0 -133
  82. package/prisma/skills/brand-voice/SKILL.md +0 -98
  83. package/prisma/skills/brand-voice/references/voice-profile-schema.md +0 -55
  84. package/prisma/skills/browser-qa/SKILL.md +0 -105
  85. package/prisma/skills/bun-runtime/SKILL.md +0 -85
  86. package/prisma/skills/canary-watch/SKILL.md +0 -108
  87. package/prisma/skills/carrier-relationship-management/SKILL.md +0 -212
  88. package/prisma/skills/cisco-ios-patterns/SKILL.md +0 -164
  89. package/prisma/skills/ck/SKILL.md +0 -148
  90. package/prisma/skills/ck/commands/forget.mjs +0 -44
  91. package/prisma/skills/ck/commands/info.mjs +0 -24
  92. package/prisma/skills/ck/commands/init.mjs +0 -143
  93. package/prisma/skills/ck/commands/list.mjs +0 -40
  94. package/prisma/skills/ck/commands/migrate.mjs +0 -202
  95. package/prisma/skills/ck/commands/resume.mjs +0 -36
  96. package/prisma/skills/ck/commands/save.mjs +0 -210
  97. package/prisma/skills/ck/commands/shared.mjs +0 -387
  98. package/prisma/skills/ck/hooks/session-start.mjs +0 -224
  99. package/prisma/skills/claude-devfleet/SKILL.md +0 -112
  100. package/prisma/skills/click-path-audit/SKILL.md +0 -245
  101. package/prisma/skills/clickhouse-io/SKILL.md +0 -440
  102. package/prisma/skills/code-tour/SKILL.md +0 -254
  103. package/prisma/skills/codebase-onboarding/SKILL.md +0 -234
  104. package/prisma/skills/codehealth-mcp/SKILL.md +0 -167
  105. package/prisma/skills/coding-standards/SKILL.md +0 -551
  106. package/prisma/skills/competitive-platform-analysis/SKILL.md +0 -214
  107. package/prisma/skills/competitive-report-structure/SKILL.md +0 -162
  108. package/prisma/skills/compose-multiplatform-patterns/SKILL.md +0 -300
  109. package/prisma/skills/config-gc/SKILL.md +0 -120
  110. package/prisma/skills/configure-ecc/SKILL.md +0 -385
  111. package/prisma/skills/connections-optimizer/SKILL.md +0 -190
  112. package/prisma/skills/content-engine/SKILL.md +0 -132
  113. package/prisma/skills/content-hash-cache-pattern/SKILL.md +0 -162
  114. package/prisma/skills/context-budget/SKILL.md +0 -136
  115. package/prisma/skills/continuous-agent-loop/SKILL.md +0 -46
  116. package/prisma/skills/continuous-learning/SKILL.md +0 -132
  117. package/prisma/skills/continuous-learning/config.json +0 -18
  118. package/prisma/skills/continuous-learning/evaluate-session.sh +0 -69
  119. package/prisma/skills/continuous-learning-v2/SKILL.md +0 -361
  120. package/prisma/skills/continuous-learning-v2/agents/observer-loop.sh +0 -359
  121. package/prisma/skills/continuous-learning-v2/agents/observer.md +0 -189
  122. package/prisma/skills/continuous-learning-v2/agents/session-guardian.sh +0 -150
  123. package/prisma/skills/continuous-learning-v2/agents/start-observer.sh +0 -248
  124. package/prisma/skills/continuous-learning-v2/config.json +0 -8
  125. package/prisma/skills/continuous-learning-v2/hooks/observe.sh +0 -585
  126. package/prisma/skills/continuous-learning-v2/scripts/detect-project.sh +0 -322
  127. package/prisma/skills/continuous-learning-v2/scripts/instinct-cli.py +0 -1956
  128. package/prisma/skills/continuous-learning-v2/scripts/lib/homunculus-dir.sh +0 -31
  129. package/prisma/skills/continuous-learning-v2/scripts/migrate-homunculus.sh +0 -68
  130. package/prisma/skills/continuous-learning-v2/scripts/test_parse_instinct.py +0 -1421
  131. package/prisma/skills/cost-aware-llm-pipeline/SKILL.md +0 -184
  132. package/prisma/skills/cost-tracking/SKILL.md +0 -97
  133. package/prisma/skills/council/SKILL.md +0 -204
  134. package/prisma/skills/cpp-coding-standards/SKILL.md +0 -724
  135. package/prisma/skills/cpp-testing/SKILL.md +0 -325
  136. package/prisma/skills/crosspost/SKILL.md +0 -112
  137. package/prisma/skills/csharp-testing/SKILL.md +0 -322
  138. package/prisma/skills/customer-billing-ops/SKILL.md +0 -141
  139. package/prisma/skills/customs-trade-compliance/SKILL.md +0 -263
  140. package/prisma/skills/dart-flutter-patterns/SKILL.md +0 -564
  141. package/prisma/skills/dashboard-builder/SKILL.md +0 -109
  142. package/prisma/skills/data-scraper-agent/SKILL.md +0 -765
  143. package/prisma/skills/data-throughput-accelerator/SKILL.md +0 -73
  144. package/prisma/skills/database-migrations/SKILL.md +0 -430
  145. package/prisma/skills/deep-research/SKILL.md +0 -160
  146. package/prisma/skills/defi-amm-security/SKILL.md +0 -167
  147. package/prisma/skills/delivery-gate/SKILL.md +0 -126
  148. package/prisma/skills/delivery-gate/hooks/quality-gate.py +0 -220
  149. package/prisma/skills/deployment-patterns/SKILL.md +0 -428
  150. package/prisma/skills/design-system/SKILL.md +0 -83
  151. package/prisma/skills/django-celery/SKILL.md +0 -458
  152. package/prisma/skills/django-patterns/SKILL.md +0 -735
  153. package/prisma/skills/django-security/SKILL.md +0 -644
  154. package/prisma/skills/django-tdd/SKILL.md +0 -730
  155. package/prisma/skills/django-verification/SKILL.md +0 -470
  156. package/prisma/skills/dmux-workflows/SKILL.md +0 -192
  157. package/prisma/skills/docker-patterns/SKILL.md +0 -365
  158. package/prisma/skills/documentation-lookup/SKILL.md +0 -91
  159. package/prisma/skills/dotnet-patterns/SKILL.md +0 -322
  160. package/prisma/skills/dynamic-workflow-mode/SKILL.md +0 -124
  161. package/prisma/skills/e2e-testing/SKILL.md +0 -327
  162. package/prisma/skills/ecc-guide/SKILL.md +0 -190
  163. package/prisma/skills/ecc-recipes/SKILL.md +0 -149
  164. package/prisma/skills/ecc-tools-cost-audit/SKILL.md +0 -161
  165. package/prisma/skills/email-ops/SKILL.md +0 -122
  166. package/prisma/skills/energy-procurement/SKILL.md +0 -228
  167. package/prisma/skills/enterprise-agent-ops/SKILL.md +0 -51
  168. package/prisma/skills/error-handling/SKILL.md +0 -377
  169. package/prisma/skills/eval-harness/SKILL.md +0 -271
  170. package/prisma/skills/evm-token-decimals/SKILL.md +0 -131
  171. package/prisma/skills/exa-search/SKILL.md +0 -108
  172. package/prisma/skills/fal-ai-media/SKILL.md +0 -289
  173. package/prisma/skills/fastapi-patterns/SKILL.md +0 -514
  174. package/prisma/skills/finance-billing-ops/SKILL.md +0 -128
  175. package/prisma/skills/flox-environments/SKILL.md +0 -497
  176. package/prisma/skills/flutter-dart-code-review/SKILL.md +0 -436
  177. package/prisma/skills/foundation-models-on-device/SKILL.md +0 -243
  178. package/prisma/skills/frontend-a11y/SKILL.md +0 -446
  179. package/prisma/skills/frontend-design-direction/SKILL.md +0 -93
  180. package/prisma/skills/frontend-patterns/SKILL.md +0 -657
  181. package/prisma/skills/frontend-slides/SKILL.md +0 -185
  182. package/prisma/skills/frontend-slides/STYLE_PRESETS.md +0 -330
  183. package/prisma/skills/frontend-slides/animation-patterns.md +0 -122
  184. package/prisma/skills/frontend-slides/html-template.md +0 -419
  185. package/prisma/skills/frontend-slides/scripts/export-pdf.sh +0 -418
  186. package/prisma/skills/frontend-slides/scripts/extract-pptx.py +0 -96
  187. package/prisma/skills/frontend-slides/viewport-base.css +0 -153
  188. package/prisma/skills/fsharp-testing/SKILL.md +0 -281
  189. package/prisma/skills/gan-style-harness/SKILL.md +0 -279
  190. package/prisma/skills/gateguard/SKILL.md +0 -133
  191. package/prisma/skills/generating-python-installer/SKILL.md +0 -820
  192. package/prisma/skills/git-workflow/SKILL.md +0 -716
  193. package/prisma/skills/github-ops/SKILL.md +0 -145
  194. package/prisma/skills/golang-patterns/SKILL.md +0 -675
  195. package/prisma/skills/golang-testing/SKILL.md +0 -721
  196. package/prisma/skills/google-workspace-ops/SKILL.md +0 -96
  197. package/prisma/skills/growth-log/SKILL.md +0 -128
  198. package/prisma/skills/healthcare-cdss-patterns/SKILL.md +0 -246
  199. package/prisma/skills/healthcare-emr-patterns/SKILL.md +0 -160
  200. package/prisma/skills/healthcare-eval-harness/SKILL.md +0 -208
  201. package/prisma/skills/healthcare-phi-compliance/SKILL.md +0 -146
  202. package/prisma/skills/hermes-imports/SKILL.md +0 -89
  203. package/prisma/skills/hexagonal-architecture/SKILL.md +0 -277
  204. package/prisma/skills/hipaa-compliance/SKILL.md +0 -79
  205. package/prisma/skills/homelab-network-readiness/SKILL.md +0 -170
  206. package/prisma/skills/homelab-network-setup/SKILL.md +0 -130
  207. package/prisma/skills/homelab-pihole-dns/SKILL.md +0 -275
  208. package/prisma/skills/homelab-vlan-segmentation/SKILL.md +0 -312
  209. package/prisma/skills/homelab-wireguard-vpn/SKILL.md +0 -306
  210. package/prisma/skills/hookify-rules/SKILL.md +0 -128
  211. package/prisma/skills/inherit-legacy-style/SKILL.md +0 -157
  212. package/prisma/skills/intent-driven-development/SKILL.md +0 -360
  213. package/prisma/skills/inventory-demand-planning/SKILL.md +0 -247
  214. package/prisma/skills/investor-materials/SKILL.md +0 -97
  215. package/prisma/skills/investor-outreach/SKILL.md +0 -92
  216. package/prisma/skills/ios-icon-gen/SKILL.md +0 -158
  217. package/prisma/skills/ios-icon-gen/scripts/generate_icons.swift +0 -258
  218. package/prisma/skills/ios-icon-gen/scripts/iconify_gen.sh +0 -235
  219. package/prisma/skills/iterative-retrieval/SKILL.md +0 -212
  220. package/prisma/skills/ito-basket-compare/SKILL.md +0 -64
  221. package/prisma/skills/ito-data-atlas-agent/SKILL.md +0 -64
  222. package/prisma/skills/ito-market-intelligence/SKILL.md +0 -61
  223. package/prisma/skills/ito-trade-planner/SKILL.md +0 -68
  224. package/prisma/skills/java-coding-standards/SKILL.md +0 -384
  225. package/prisma/skills/jira-integration/SKILL.md +0 -303
  226. package/prisma/skills/jpa-patterns/SKILL.md +0 -152
  227. package/prisma/skills/knowledge-ops/SKILL.md +0 -155
  228. package/prisma/skills/kotlin-coroutines-flows/SKILL.md +0 -285
  229. package/prisma/skills/kotlin-exposed-patterns/SKILL.md +0 -720
  230. package/prisma/skills/kotlin-ktor-patterns/SKILL.md +0 -690
  231. package/prisma/skills/kotlin-patterns/SKILL.md +0 -712
  232. package/prisma/skills/kotlin-testing/SKILL.md +0 -825
  233. package/prisma/skills/kubernetes-patterns/SKILL.md +0 -756
  234. package/prisma/skills/laravel-patterns/SKILL.md +0 -416
  235. package/prisma/skills/laravel-plugin-discovery/SKILL.md +0 -230
  236. package/prisma/skills/laravel-security/SKILL.md +0 -948
  237. package/prisma/skills/laravel-tdd/SKILL.md +0 -675
  238. package/prisma/skills/laravel-verification/SKILL.md +0 -180
  239. package/prisma/skills/latency-critical-systems/SKILL.md +0 -74
  240. package/prisma/skills/lead-intelligence/SKILL.md +0 -322
  241. package/prisma/skills/lead-intelligence/agents/enrichment-agent.md +0 -85
  242. package/prisma/skills/lead-intelligence/agents/mutual-mapper.md +0 -75
  243. package/prisma/skills/lead-intelligence/agents/outreach-drafter.md +0 -98
  244. package/prisma/skills/lead-intelligence/agents/signal-scorer.md +0 -60
  245. package/prisma/skills/liquid-glass-design/SKILL.md +0 -279
  246. package/prisma/skills/llm-trading-agent-security/SKILL.md +0 -147
  247. package/prisma/skills/logistics-exception-management/SKILL.md +0 -222
  248. package/prisma/skills/loop-design-check/SKILL.md +0 -143
  249. package/prisma/skills/mailtrap-email-integration/SKILL.md +0 -77
  250. package/prisma/skills/make-interfaces-feel-better/SKILL.md +0 -152
  251. package/prisma/skills/manim-video/SKILL.md +0 -90
  252. package/prisma/skills/manim-video/assets/network_graph_scene.py +0 -52
  253. package/prisma/skills/market-research/SKILL.md +0 -76
  254. package/prisma/skills/marketing-campaign/SKILL.md +0 -114
  255. package/prisma/skills/mcp-server-patterns/SKILL.md +0 -70
  256. package/prisma/skills/messages-ops/SKILL.md +0 -105
  257. package/prisma/skills/ml-adoption-playbook/SKILL.md +0 -57
  258. package/prisma/skills/mle-workflow/SKILL.md +0 -347
  259. package/prisma/skills/motion-advanced/SKILL.md +0 -596
  260. package/prisma/skills/motion-foundations/SKILL.md +0 -299
  261. package/prisma/skills/motion-patterns/SKILL.md +0 -434
  262. package/prisma/skills/motion-ui/SKILL.md +0 -576
  263. package/prisma/skills/mysql-patterns/SKILL.md +0 -413
  264. package/prisma/skills/nanoclaw-repl/SKILL.md +0 -34
  265. package/prisma/skills/nestjs-patterns/SKILL.md +0 -231
  266. package/prisma/skills/netmiko-ssh-automation/SKILL.md +0 -174
  267. package/prisma/skills/network-bgp-diagnostics/SKILL.md +0 -168
  268. package/prisma/skills/network-config-validation/SKILL.md +0 -211
  269. package/prisma/skills/network-interface-health/SKILL.md +0 -153
  270. package/prisma/skills/nextjs-turbopack/SKILL.md +0 -58
  271. package/prisma/skills/nodejs-keccak256/SKILL.md +0 -103
  272. package/prisma/skills/nutrient-document-processing/SKILL.md +0 -168
  273. package/prisma/skills/nuxt4-patterns/SKILL.md +0 -101
  274. package/prisma/skills/openclaw-persona-forge/SKILL.md +0 -289
  275. package/prisma/skills/openclaw-persona-forge/gacha.py +0 -224
  276. package/prisma/skills/openclaw-persona-forge/gacha.sh +0 -5
  277. package/prisma/skills/openclaw-persona-forge/references/avatar-style.md +0 -124
  278. package/prisma/skills/openclaw-persona-forge/references/boundary-rules.md +0 -53
  279. package/prisma/skills/openclaw-persona-forge/references/error-handling.md +0 -53
  280. package/prisma/skills/openclaw-persona-forge/references/identity-tension.md +0 -48
  281. package/prisma/skills/openclaw-persona-forge/references/naming-system.md +0 -39
  282. package/prisma/skills/openclaw-persona-forge/references/output-template.md +0 -166
  283. package/prisma/skills/opensource-pipeline/SKILL.md +0 -256
  284. package/prisma/skills/orch-add-feature/SKILL.md +0 -45
  285. package/prisma/skills/orch-build-mvp/SKILL.md +0 -49
  286. package/prisma/skills/orch-change-feature/SKILL.md +0 -43
  287. package/prisma/skills/orch-fix-defect/SKILL.md +0 -43
  288. package/prisma/skills/orch-pipeline/SKILL.md +0 -121
  289. package/prisma/skills/orch-refine-code/SKILL.md +0 -44
  290. package/prisma/skills/parallel-execution-optimizer/SKILL.md +0 -73
  291. package/prisma/skills/perl-patterns/SKILL.md +0 -505
  292. package/prisma/skills/perl-security/SKILL.md +0 -504
  293. package/prisma/skills/perl-testing/SKILL.md +0 -476
  294. package/prisma/skills/plan-orchestrate/SKILL.md +0 -263
  295. package/prisma/skills/plankton-code-quality/SKILL.md +0 -237
  296. package/prisma/skills/postgres-patterns/SKILL.md +0 -148
  297. package/prisma/skills/prediction-market-oracle-research/SKILL.md +0 -64
  298. package/prisma/skills/prediction-market-risk-review/SKILL.md +0 -61
  299. package/prisma/skills/prisma-patterns/SKILL.md +0 -401
  300. package/prisma/skills/product-capability/SKILL.md +0 -142
  301. package/prisma/skills/product-lens/SKILL.md +0 -93
  302. package/prisma/skills/production-audit/SKILL.md +0 -207
  303. package/prisma/skills/production-scheduling/SKILL.md +0 -238
  304. package/prisma/skills/project-flow-ops/SKILL.md +0 -112
  305. package/prisma/skills/prompt-optimizer/SKILL.md +0 -398
  306. package/prisma/skills/python-patterns/SKILL.md +0 -751
  307. package/prisma/skills/python-testing/SKILL.md +0 -817
  308. package/prisma/skills/pytorch-patterns/SKILL.md +0 -397
  309. package/prisma/skills/quality-nonconformance/SKILL.md +0 -260
  310. package/prisma/skills/quarkus-patterns/SKILL.md +0 -723
  311. package/prisma/skills/quarkus-security/SKILL.md +0 -468
  312. package/prisma/skills/quarkus-tdd/SKILL.md +0 -812
  313. package/prisma/skills/quarkus-verification/SKILL.md +0 -480
  314. package/prisma/skills/ralphinho-rfc-pipeline/SKILL.md +0 -68
  315. package/prisma/skills/react-native-patterns/SKILL.md +0 -326
  316. package/prisma/skills/react-patterns/SKILL.md +0 -342
  317. package/prisma/skills/react-performance/SKILL.md +0 -575
  318. package/prisma/skills/react-testing/SKILL.md +0 -424
  319. package/prisma/skills/recsys-pipeline-architect/SKILL.md +0 -115
  320. package/prisma/skills/recursive-decision-ledger/SKILL.md +0 -80
  321. package/prisma/skills/redis-patterns/SKILL.md +0 -404
  322. package/prisma/skills/regex-vs-llm-structured-text/SKILL.md +0 -221
  323. package/prisma/skills/remotion-video-creation/SKILL.md +0 -43
  324. package/prisma/skills/remotion-video-creation/rules/3d.md +0 -86
  325. package/prisma/skills/remotion-video-creation/rules/animations.md +0 -29
  326. package/prisma/skills/remotion-video-creation/rules/assets/charts-bar-chart.tsx +0 -173
  327. package/prisma/skills/remotion-video-creation/rules/assets/text-animations-typewriter.tsx +0 -100
  328. package/prisma/skills/remotion-video-creation/rules/assets/text-animations-word-highlight.tsx +0 -108
  329. package/prisma/skills/remotion-video-creation/rules/assets.md +0 -78
  330. package/prisma/skills/remotion-video-creation/rules/audio.md +0 -172
  331. package/prisma/skills/remotion-video-creation/rules/calculate-metadata.md +0 -104
  332. package/prisma/skills/remotion-video-creation/rules/can-decode.md +0 -75
  333. package/prisma/skills/remotion-video-creation/rules/charts.md +0 -58
  334. package/prisma/skills/remotion-video-creation/rules/compositions.md +0 -146
  335. package/prisma/skills/remotion-video-creation/rules/display-captions.md +0 -126
  336. package/prisma/skills/remotion-video-creation/rules/extract-frames.md +0 -229
  337. package/prisma/skills/remotion-video-creation/rules/fonts.md +0 -152
  338. package/prisma/skills/remotion-video-creation/rules/get-audio-duration.md +0 -58
  339. package/prisma/skills/remotion-video-creation/rules/get-video-dimensions.md +0 -68
  340. package/prisma/skills/remotion-video-creation/rules/get-video-duration.md +0 -58
  341. package/prisma/skills/remotion-video-creation/rules/gifs.md +0 -138
  342. package/prisma/skills/remotion-video-creation/rules/images.md +0 -130
  343. package/prisma/skills/remotion-video-creation/rules/import-srt-captions.md +0 -67
  344. package/prisma/skills/remotion-video-creation/rules/lottie.md +0 -67
  345. package/prisma/skills/remotion-video-creation/rules/measuring-dom-nodes.md +0 -34
  346. package/prisma/skills/remotion-video-creation/rules/measuring-text.md +0 -143
  347. package/prisma/skills/remotion-video-creation/rules/sequencing.md +0 -106
  348. package/prisma/skills/remotion-video-creation/rules/tailwind.md +0 -11
  349. package/prisma/skills/remotion-video-creation/rules/text-animations.md +0 -20
  350. package/prisma/skills/remotion-video-creation/rules/timing.md +0 -179
  351. package/prisma/skills/remotion-video-creation/rules/transcribe-captions.md +0 -19
  352. package/prisma/skills/remotion-video-creation/rules/transitions.md +0 -122
  353. package/prisma/skills/remotion-video-creation/rules/trimming.md +0 -52
  354. package/prisma/skills/remotion-video-creation/rules/videos.md +0 -171
  355. package/prisma/skills/repo-scan/SKILL.md +0 -79
  356. package/prisma/skills/research-ops/SKILL.md +0 -113
  357. package/prisma/skills/returns-reverse-logistics/SKILL.md +0 -240
  358. package/prisma/skills/rules-distill/SKILL.md +0 -265
  359. package/prisma/skills/rules-distill/scripts/scan-rules.sh +0 -58
  360. package/prisma/skills/rules-distill/scripts/scan-skills.sh +0 -129
  361. package/prisma/skills/rust-patterns/SKILL.md +0 -500
  362. package/prisma/skills/rust-testing/SKILL.md +0 -501
  363. package/prisma/skills/safety-guard/SKILL.md +0 -76
  364. package/prisma/skills/santa-method/SKILL.md +0 -307
  365. package/prisma/skills/scientific-db-pubmed-database/SKILL.md +0 -176
  366. package/prisma/skills/scientific-db-uspto-database/SKILL.md +0 -178
  367. package/prisma/skills/scientific-pkg-gget/SKILL.md +0 -167
  368. package/prisma/skills/scientific-thinking-literature-review/SKILL.md +0 -193
  369. package/prisma/skills/scientific-thinking-scholar-evaluation/SKILL.md +0 -161
  370. package/prisma/skills/search-first/SKILL.md +0 -183
  371. package/prisma/skills/security-bounty-hunter/SKILL.md +0 -100
  372. package/prisma/skills/security-review/SKILL.md +0 -504
  373. package/prisma/skills/security-review/cloud-infrastructure-security.md +0 -361
  374. package/prisma/skills/security-scan/SKILL.md +0 -166
  375. package/prisma/skills/seo/SKILL.md +0 -155
  376. package/prisma/skills/skill-comply/SKILL.md +0 -59
  377. package/prisma/skills/skill-comply/fixtures/compliant_trace.jsonl +0 -5
  378. package/prisma/skills/skill-comply/fixtures/noncompliant_trace.jsonl +0 -3
  379. package/prisma/skills/skill-comply/fixtures/tdd_spec.yaml +0 -44
  380. package/prisma/skills/skill-comply/prompts/classifier.md +0 -24
  381. package/prisma/skills/skill-comply/prompts/scenario_generator.md +0 -62
  382. package/prisma/skills/skill-comply/prompts/spec_generator.md +0 -42
  383. package/prisma/skills/skill-comply/pyproject.toml +0 -15
  384. package/prisma/skills/skill-comply/scripts/__init__.py +0 -0
  385. package/prisma/skills/skill-comply/scripts/classifier.py +0 -85
  386. package/prisma/skills/skill-comply/scripts/grader.py +0 -124
  387. package/prisma/skills/skill-comply/scripts/parser.py +0 -107
  388. package/prisma/skills/skill-comply/scripts/report.py +0 -170
  389. package/prisma/skills/skill-comply/scripts/run.py +0 -127
  390. package/prisma/skills/skill-comply/scripts/runner.py +0 -194
  391. package/prisma/skills/skill-comply/scripts/scenario_generator.py +0 -70
  392. package/prisma/skills/skill-comply/scripts/spec_generator.py +0 -72
  393. package/prisma/skills/skill-comply/scripts/utils.py +0 -13
  394. package/prisma/skills/skill-comply/tests/test_grader.py +0 -197
  395. package/prisma/skills/skill-comply/tests/test_parser.py +0 -90
  396. package/prisma/skills/skill-comply/tests/test_runner.py +0 -172
  397. package/prisma/skills/skill-scout/SKILL.md +0 -141
  398. package/prisma/skills/skill-stocktake/SKILL.md +0 -195
  399. package/prisma/skills/skill-stocktake/scripts/quick-diff.sh +0 -87
  400. package/prisma/skills/skill-stocktake/scripts/save-results.sh +0 -56
  401. package/prisma/skills/skill-stocktake/scripts/scan.sh +0 -170
  402. package/prisma/skills/social-graph-ranker/SKILL.md +0 -155
  403. package/prisma/skills/social-publisher/SKILL.md +0 -130
  404. package/prisma/skills/springboot-patterns/SKILL.md +0 -315
  405. package/prisma/skills/springboot-security/SKILL.md +0 -273
  406. package/prisma/skills/springboot-tdd/SKILL.md +0 -159
  407. package/prisma/skills/springboot-verification/SKILL.md +0 -232
  408. package/prisma/skills/strategic-compact/SKILL.md +0 -136
  409. package/prisma/skills/swift-actor-persistence/SKILL.md +0 -144
  410. package/prisma/skills/swift-concurrency-6-2/SKILL.md +0 -216
  411. package/prisma/skills/swift-protocol-di-testing/SKILL.md +0 -191
  412. package/prisma/skills/swiftui-patterns/SKILL.md +0 -259
  413. package/prisma/skills/taste/SKILL.md +0 -264
  414. package/prisma/skills/taste/references/genre-taxonomy.md +0 -87
  415. package/prisma/skills/tdd-workflow/SKILL.md +0 -583
  416. package/prisma/skills/team-agent-orchestration/SKILL.md +0 -111
  417. package/prisma/skills/team-builder/SKILL.md +0 -169
  418. package/prisma/skills/terminal-ops/SKILL.md +0 -110
  419. package/prisma/skills/tinystruct-patterns/SKILL.md +0 -279
  420. package/prisma/skills/tinystruct-patterns/references/architecture.md +0 -90
  421. package/prisma/skills/tinystruct-patterns/references/data-handling.md +0 -60
  422. package/prisma/skills/tinystruct-patterns/references/database.md +0 -99
  423. package/prisma/skills/tinystruct-patterns/references/routing.md +0 -64
  424. package/prisma/skills/tinystruct-patterns/references/system-usage.md +0 -97
  425. package/prisma/skills/tinystruct-patterns/references/testing.md +0 -72
  426. package/prisma/skills/token-budget-advisor/SKILL.md +0 -134
  427. package/prisma/skills/ui-demo/SKILL.md +0 -466
  428. package/prisma/skills/ui-to-vue/SKILL.md +0 -135
  429. package/prisma/skills/uncloud/SKILL.md +0 -344
  430. package/prisma/skills/unified-notifications-ops/SKILL.md +0 -188
  431. package/prisma/skills/verification-loop/SKILL.md +0 -127
  432. package/prisma/skills/video-editing/SKILL.md +0 -311
  433. package/prisma/skills/videodb/SKILL.md +0 -375
  434. package/prisma/skills/videodb/reference/api-reference.md +0 -550
  435. package/prisma/skills/videodb/reference/capture-reference.md +0 -407
  436. package/prisma/skills/videodb/reference/capture.md +0 -101
  437. package/prisma/skills/videodb/reference/editor.md +0 -443
  438. package/prisma/skills/videodb/reference/generative.md +0 -331
  439. package/prisma/skills/videodb/reference/rtstream-reference.md +0 -564
  440. package/prisma/skills/videodb/reference/rtstream.md +0 -65
  441. package/prisma/skills/videodb/reference/search.md +0 -230
  442. package/prisma/skills/videodb/reference/streaming.md +0 -406
  443. package/prisma/skills/videodb/reference/use-cases.md +0 -118
  444. package/prisma/skills/videodb/scripts/ws_listener.py +0 -282
  445. package/prisma/skills/visa-doc-translate/README.md +0 -86
  446. package/prisma/skills/visa-doc-translate/SKILL.md +0 -117
  447. package/prisma/skills/vite-patterns/SKILL.md +0 -450
  448. package/prisma/skills/vue-patterns/SKILL.md +0 -471
  449. package/prisma/skills/windows-desktop-e2e/SKILL.md +0 -888
  450. package/prisma/skills/workspace-surface-audit/SKILL.md +0 -126
  451. package/prisma/skills/x-api/SKILL.md +0 -235
@@ -1,377 +0,0 @@
1
- ---
2
- name: error-handling
3
- description: Patterns for robust error handling across TypeScript, Python, and Go. Covers typed errors, error boundaries, retries, circuit breakers, and user-facing error messages.
4
- metadata:
5
- origin: ECC
6
- ---
7
-
8
- # Error Handling Patterns
9
-
10
- Consistent, robust error handling patterns for production applications.
11
-
12
- ## When to Activate
13
-
14
- - Designing error types or exception hierarchies for a new module or service
15
- - Adding retry logic or circuit breakers for unreliable external dependencies
16
- - Reviewing API endpoints for missing error handling
17
- - Implementing user-facing error messages and feedback
18
- - Debugging cascading failures or silent error swallowing
19
-
20
- ## Core Principles
21
-
22
- 1. **Fail fast and loudly** — surface errors at the boundary where they occur; don't bury them
23
- 2. **Typed errors over string messages** — errors are first-class values with structure
24
- 3. **User messages ≠ developer messages** — show friendly text to users, log full context server-side
25
- 4. **Never swallow errors silently** — every `catch` block must either handle, re-throw, or log
26
- 5. **Errors are part of your API contract** — document every error code a client may receive
27
-
28
- ## TypeScript / JavaScript
29
-
30
- ### Typed Error Classes
31
-
32
- ```typescript
33
- // Define an error hierarchy for your domain
34
- export class AppError extends Error {
35
- constructor(
36
- message: string,
37
- public readonly code: string,
38
- public readonly statusCode: number = 500,
39
- public readonly details?: unknown,
40
- ) {
41
- super(message)
42
- this.name = this.constructor.name
43
- // Maintain correct prototype chain in transpiled ES5 JavaScript.
44
- // Required for `instanceof` checks (e.g., `error instanceof NotFoundError`)
45
- // to work correctly when extending the built-in Error class.
46
- Object.setPrototypeOf(this, new.target.prototype)
47
- }
48
- }
49
-
50
- export class NotFoundError extends AppError {
51
- constructor(resource: string, id: string) {
52
- super(`${resource} not found: ${id}`, 'NOT_FOUND', 404)
53
- }
54
- }
55
-
56
- export class ValidationError extends AppError {
57
- constructor(message: string, details: { field: string; message: string }[]) {
58
- super(message, 'VALIDATION_ERROR', 422, details)
59
- }
60
- }
61
-
62
- export class UnauthorizedError extends AppError {
63
- constructor(reason = 'Authentication required') {
64
- super(reason, 'UNAUTHORIZED', 401)
65
- }
66
- }
67
-
68
- export class RateLimitError extends AppError {
69
- constructor(public readonly retryAfterMs: number) {
70
- super('Rate limit exceeded', 'RATE_LIMITED', 429)
71
- }
72
- }
73
- ```
74
-
75
- ### Result Pattern (no-throw style)
76
-
77
- For operations where failure is expected and common (parsing, external calls):
78
-
79
- ```typescript
80
- type Result<T, E = AppError> =
81
- | { ok: true; value: T }
82
- | { ok: false; error: E }
83
-
84
- function ok<T>(value: T): Result<T> {
85
- return { ok: true, value }
86
- }
87
-
88
- function err<E>(error: E): Result<never, E> {
89
- return { ok: false, error }
90
- }
91
-
92
- // Usage
93
- async function fetchUser(id: string): Promise<Result<User>> {
94
- try {
95
- const user = await db.users.findUnique({ where: { id } })
96
- if (!user) return err(new NotFoundError('User', id))
97
- return ok(user)
98
- } catch (e) {
99
- return err(new AppError('Database error', 'DB_ERROR'))
100
- }
101
- }
102
-
103
- const result = await fetchUser('abc-123')
104
- if (!result.ok) {
105
- // TypeScript knows result.error here
106
- logger.error('Failed to fetch user', { error: result.error })
107
- return
108
- }
109
- // TypeScript knows result.value here
110
- console.log(result.value.email)
111
- ```
112
-
113
- ### API Error Handler (Next.js / Express)
114
-
115
- ```typescript
116
- import { NextRequest, NextResponse } from 'next/server'
117
-
118
- function handleApiError(error: unknown): NextResponse {
119
- // Known application error
120
- if (error instanceof AppError) {
121
- return NextResponse.json(
122
- {
123
- error: {
124
- code: error.code,
125
- message: error.message,
126
- ...(error.details ? { details: error.details } : {}),
127
- },
128
- },
129
- { status: error.statusCode },
130
- )
131
- }
132
-
133
- // Zod validation error
134
- if (error instanceof z.ZodError) {
135
- return NextResponse.json(
136
- {
137
- error: {
138
- code: 'VALIDATION_ERROR',
139
- message: 'Request validation failed',
140
- details: error.issues.map(i => ({
141
- field: i.path.join('.'),
142
- message: i.message,
143
- })),
144
- },
145
- },
146
- { status: 422 },
147
- )
148
- }
149
-
150
- // Unexpected error — log details, return generic message
151
- console.error('Unexpected error:', error)
152
- return NextResponse.json(
153
- { error: { code: 'INTERNAL_ERROR', message: 'An unexpected error occurred' } },
154
- { status: 500 },
155
- )
156
- }
157
-
158
- export async function POST(req: NextRequest) {
159
- try {
160
- // ... handler logic
161
- } catch (error) {
162
- return handleApiError(error)
163
- }
164
- }
165
- ```
166
-
167
- ### React Error Boundary
168
-
169
- ```typescript
170
- import { Component, ErrorInfo, ReactNode } from 'react'
171
-
172
- interface Props {
173
- fallback: ReactNode
174
- onError?: (error: Error, info: ErrorInfo) => void
175
- children: ReactNode
176
- }
177
-
178
- interface State {
179
- hasError: boolean
180
- error: Error | null
181
- }
182
-
183
- export class ErrorBoundary extends Component<Props, State> {
184
- state: State = { hasError: false, error: null }
185
-
186
- static getDerivedStateFromError(error: Error): State {
187
- return { hasError: true, error }
188
- }
189
-
190
- componentDidCatch(error: Error, info: ErrorInfo) {
191
- this.props.onError?.(error, info)
192
- console.error('Unhandled React error:', error, info)
193
- }
194
-
195
- render() {
196
- if (this.state.hasError) return this.props.fallback
197
- return this.props.children
198
- }
199
- }
200
-
201
- // Usage
202
- <ErrorBoundary fallback={<p>Something went wrong. Please refresh.</p>}>
203
- <MyComponent />
204
- </ErrorBoundary>
205
- ```
206
-
207
- ## Python
208
-
209
- ### Custom Exception Hierarchy
210
-
211
- ```python
212
- class AppError(Exception):
213
- """Base application error."""
214
- def __init__(self, message: str, code: str, status_code: int = 500):
215
- super().__init__(message)
216
- self.code = code
217
- self.status_code = status_code
218
-
219
- class NotFoundError(AppError):
220
- def __init__(self, resource: str, id: str):
221
- super().__init__(f"{resource} not found: {id}", "NOT_FOUND", 404)
222
-
223
- class ValidationError(AppError):
224
- def __init__(self, message: str, details: list[dict] | None = None):
225
- super().__init__(message, "VALIDATION_ERROR", 422)
226
- self.details = details or []
227
- ```
228
-
229
- ### FastAPI Global Exception Handler
230
-
231
- ```python
232
- from fastapi import FastAPI, Request
233
- from fastapi.responses import JSONResponse
234
-
235
- app = FastAPI()
236
-
237
- @app.exception_handler(AppError)
238
- async def app_error_handler(request: Request, exc: AppError) -> JSONResponse:
239
- return JSONResponse(
240
- status_code=exc.status_code,
241
- content={"error": {"code": exc.code, "message": str(exc)}},
242
- )
243
-
244
- @app.exception_handler(Exception)
245
- async def generic_error_handler(request: Request, exc: Exception) -> JSONResponse:
246
- # Log full details, return generic message
247
- logger.exception("Unexpected error", exc_info=exc)
248
- return JSONResponse(
249
- status_code=500,
250
- content={"error": {"code": "INTERNAL_ERROR", "message": "An unexpected error occurred"}},
251
- )
252
- ```
253
-
254
- ## Go
255
-
256
- ### Sentinel Errors and Error Wrapping
257
-
258
- ```go
259
- package domain
260
-
261
- import "errors"
262
-
263
- // Sentinel errors for type-checking
264
- var (
265
- ErrNotFound = errors.New("not found")
266
- ErrUnauthorized = errors.New("unauthorized")
267
- ErrConflict = errors.New("conflict")
268
- )
269
-
270
- // Wrap errors with context — never lose the original
271
- func (r *UserRepository) FindByID(ctx context.Context, id string) (*User, error) {
272
- user, err := r.db.QueryRow(ctx, "SELECT * FROM users WHERE id = $1", id)
273
- if errors.Is(err, sql.ErrNoRows) {
274
- return nil, fmt.Errorf("user %s: %w", id, ErrNotFound)
275
- }
276
- if err != nil {
277
- return nil, fmt.Errorf("querying user %s: %w", id, err)
278
- }
279
- return user, nil
280
- }
281
-
282
- // At the handler level, unwrap to determine response
283
- func (h *Handler) GetUser(w http.ResponseWriter, r *http.Request) {
284
- user, err := h.service.GetUser(r.Context(), chi.URLParam(r, "id"))
285
- if err != nil {
286
- switch {
287
- case errors.Is(err, domain.ErrNotFound):
288
- writeError(w, http.StatusNotFound, "not_found", err.Error())
289
- case errors.Is(err, domain.ErrUnauthorized):
290
- writeError(w, http.StatusForbidden, "forbidden", "Access denied")
291
- default:
292
- slog.Error("unexpected error", "err", err)
293
- writeError(w, http.StatusInternalServerError, "internal_error", "An unexpected error occurred")
294
- }
295
- return
296
- }
297
- writeJSON(w, http.StatusOK, user)
298
- }
299
- ```
300
-
301
- ## Retry with Exponential Backoff
302
-
303
- ```typescript
304
- interface RetryOptions {
305
- maxAttempts?: number
306
- baseDelayMs?: number
307
- maxDelayMs?: number
308
- retryIf?: (error: unknown) => boolean
309
- }
310
-
311
- async function withRetry<T>(
312
- fn: () => Promise<T>,
313
- options: RetryOptions = {},
314
- ): Promise<T> {
315
- const {
316
- maxAttempts = 3,
317
- baseDelayMs = 500,
318
- maxDelayMs = 10_000,
319
- retryIf = () => true,
320
- } = options
321
-
322
- let lastError: unknown
323
-
324
- for (let attempt = 1; attempt <= maxAttempts; attempt++) {
325
- try {
326
- return await fn()
327
- } catch (error) {
328
- lastError = error
329
- if (attempt === maxAttempts || !retryIf(error)) throw error
330
-
331
- const jitter = Math.random() * baseDelayMs
332
- const delay = Math.min(baseDelayMs * 2 ** (attempt - 1) + jitter, maxDelayMs)
333
- await new Promise(resolve => setTimeout(resolve, delay))
334
- }
335
- }
336
-
337
- throw lastError
338
- }
339
-
340
- // Usage: retry transient network errors, not 4xx
341
- const data = await withRetry(() => fetch('/api/data').then(r => r.json()), {
342
- maxAttempts: 3,
343
- retryIf: (error) => !(error instanceof AppError && error.statusCode < 500),
344
- })
345
- ```
346
-
347
- ## User-Facing Error Messages
348
-
349
- Map error codes to human-readable messages. Keep technical details out of user-visible text.
350
-
351
- ```typescript
352
- const USER_ERROR_MESSAGES: Record<string, string> = {
353
- NOT_FOUND: 'The requested item could not be found.',
354
- UNAUTHORIZED: 'Please sign in to continue.',
355
- FORBIDDEN: "You don't have permission to do that.",
356
- VALIDATION_ERROR: 'Please check your input and try again.',
357
- RATE_LIMITED: 'Too many requests. Please wait a moment and try again.',
358
- INTERNAL_ERROR: 'Something went wrong on our end. Please try again later.',
359
- }
360
-
361
- export function getUserMessage(code: string): string {
362
- return USER_ERROR_MESSAGES[code] ?? USER_ERROR_MESSAGES.INTERNAL_ERROR
363
- }
364
- ```
365
-
366
- ## Error Handling Checklist
367
-
368
- Before merging any code that touches error handling:
369
-
370
- - [ ] Every `catch` block handles, re-throws, or logs — no silent swallowing
371
- - [ ] API errors follow the standard envelope `{ error: { code, message } }`
372
- - [ ] User-facing messages contain no stack traces or internal details
373
- - [ ] Full error context is logged server-side
374
- - [ ] Custom error classes extend a base `AppError` with a `code` field
375
- - [ ] Async functions surface errors to callers — no fire-and-forget without fallback
376
- - [ ] Retry logic only retries retriable errors (not 4xx client errors)
377
- - [ ] React components are wrapped in `ErrorBoundary` for rendering errors
@@ -1,271 +0,0 @@
1
- ---
2
- name: eval-harness
3
- description: Formal evaluation framework for Claude Code sessions implementing eval-driven development (EDD) principles
4
- metadata:
5
- origin: ECC
6
- tools: Read, Write, Edit, Bash, Grep, Glob
7
- ---
8
-
9
- # Eval Harness Skill
10
-
11
- A formal evaluation framework for Claude Code sessions, implementing eval-driven development (EDD) principles.
12
-
13
- ## When to Activate
14
-
15
- - Setting up eval-driven development (EDD) for AI-assisted workflows
16
- - Defining pass/fail criteria for Claude Code task completion
17
- - Measuring agent reliability with pass@k metrics
18
- - Creating regression test suites for prompt or agent changes
19
- - Benchmarking agent performance across model versions
20
-
21
- ## Philosophy
22
-
23
- Eval-Driven Development treats evals as the "unit tests of AI development":
24
- - Define expected behavior BEFORE implementation
25
- - Run evals continuously during development
26
- - Track regressions with each change
27
- - Use pass@k metrics for reliability measurement
28
-
29
- ## Eval Types
30
-
31
- ### Capability Evals
32
- Test if Claude can do something it couldn't before:
33
- ```markdown
34
- [CAPABILITY EVAL: feature-name]
35
- Task: Description of what Claude should accomplish
36
- Success Criteria:
37
- - [ ] Criterion 1
38
- - [ ] Criterion 2
39
- - [ ] Criterion 3
40
- Expected Output: Description of expected result
41
- ```
42
-
43
- ### Regression Evals
44
- Ensure changes don't break existing functionality:
45
- ```markdown
46
- [REGRESSION EVAL: feature-name]
47
- Baseline: SHA or checkpoint name
48
- Tests:
49
- - existing-test-1: PASS/FAIL
50
- - existing-test-2: PASS/FAIL
51
- - existing-test-3: PASS/FAIL
52
- Result: X/Y passed (previously Y/Y)
53
- ```
54
-
55
- ## Grader Types
56
-
57
- ### 1. Code-Based Grader
58
- Deterministic checks using code:
59
- ```bash
60
- # Check if file contains expected pattern
61
- grep -q "export function handleAuth" src/auth.ts && echo "PASS" || echo "FAIL"
62
-
63
- # Check if tests pass
64
- npm test -- --testPathPattern="auth" && echo "PASS" || echo "FAIL"
65
-
66
- # Check if build succeeds
67
- npm run build && echo "PASS" || echo "FAIL"
68
- ```
69
-
70
- ### 2. Model-Based Grader
71
- Use Claude to evaluate open-ended outputs:
72
- ```markdown
73
- [MODEL GRADER PROMPT]
74
- Evaluate the following code change:
75
- 1. Does it solve the stated problem?
76
- 2. Is it well-structured?
77
- 3. Are edge cases handled?
78
- 4. Is error handling appropriate?
79
-
80
- Score: 1-5 (1=poor, 5=excellent)
81
- Reasoning: [explanation]
82
- ```
83
-
84
- ### 3. Human Grader
85
- Flag for manual review:
86
- ```markdown
87
- [HUMAN REVIEW REQUIRED]
88
- Change: Description of what changed
89
- Reason: Why human review is needed
90
- Risk Level: LOW/MEDIUM/HIGH
91
- ```
92
-
93
- ## Metrics
94
-
95
- ### pass@k
96
- "At least one success in k attempts"
97
- - pass@1: First attempt success rate
98
- - pass@3: Success within 3 attempts
99
- - Typical target: pass@3 > 90%
100
-
101
- ### pass^k
102
- "All k trials succeed"
103
- - Higher bar for reliability
104
- - pass^3: 3 consecutive successes
105
- - Use for critical paths
106
-
107
- ## Eval Workflow
108
-
109
- ### 1. Define (Before Coding)
110
- ```markdown
111
- ## EVAL DEFINITION: feature-xyz
112
-
113
- ### Capability Evals
114
- 1. Can create new user account
115
- 2. Can validate email format
116
- 3. Can hash password securely
117
-
118
- ### Regression Evals
119
- 1. Existing login still works
120
- 2. Session management unchanged
121
- 3. Logout flow intact
122
-
123
- ### Success Metrics
124
- - pass@3 > 90% for capability evals
125
- - pass^3 = 100% for regression evals
126
- ```
127
-
128
- ### 2. Implement
129
- Write code to pass the defined evals.
130
-
131
- ### 3. Evaluate
132
- ```bash
133
- # Run capability evals
134
- [Run each capability eval, record PASS/FAIL]
135
-
136
- # Run regression evals
137
- npm test -- --testPathPattern="existing"
138
-
139
- # Generate report
140
- ```
141
-
142
- ### 4. Report
143
- ```markdown
144
- EVAL REPORT: feature-xyz
145
- ========================
146
-
147
- Capability Evals:
148
- create-user: PASS (pass@1)
149
- validate-email: PASS (pass@2)
150
- hash-password: PASS (pass@1)
151
- Overall: 3/3 passed
152
-
153
- Regression Evals:
154
- login-flow: PASS
155
- session-mgmt: PASS
156
- logout-flow: PASS
157
- Overall: 3/3 passed
158
-
159
- Metrics:
160
- pass@1: 67% (2/3)
161
- pass@3: 100% (3/3)
162
-
163
- Status: READY FOR REVIEW
164
- ```
165
-
166
- ## Integration Patterns
167
-
168
- ### Pre-Implementation
169
- ```
170
- /eval define feature-name
171
- ```
172
- Creates eval definition file at `.claude/evals/feature-name.md`
173
-
174
- ### During Implementation
175
- ```
176
- /eval check feature-name
177
- ```
178
- Runs current evals and reports status
179
-
180
- ### Post-Implementation
181
- ```
182
- /eval report feature-name
183
- ```
184
- Generates full eval report
185
-
186
- ## Eval Storage
187
-
188
- Store evals in project:
189
- ```
190
- .claude/
191
- evals/
192
- feature-xyz.md # Eval definition
193
- feature-xyz.log # Eval run history
194
- baseline.json # Regression baselines
195
- ```
196
-
197
- ## Best Practices
198
-
199
- 1. **Define evals BEFORE coding** - Forces clear thinking about success criteria
200
- 2. **Run evals frequently** - Catch regressions early
201
- 3. **Track pass@k over time** - Monitor reliability trends
202
- 4. **Use code graders when possible** - Deterministic > probabilistic
203
- 5. **Human review for security** - Never fully automate security checks
204
- 6. **Keep evals fast** - Slow evals don't get run
205
- 7. **Version evals with code** - Evals are first-class artifacts
206
-
207
- ## Example: Adding Authentication
208
-
209
- ```markdown
210
- ## EVAL: add-authentication
211
-
212
- ### Phase 1: Define (10 min)
213
- Capability Evals:
214
- - [ ] User can register with email/password
215
- - [ ] User can login with valid credentials
216
- - [ ] Invalid credentials rejected with proper error
217
- - [ ] Sessions persist across page reloads
218
- - [ ] Logout clears session
219
-
220
- Regression Evals:
221
- - [ ] Public routes still accessible
222
- - [ ] API responses unchanged
223
- - [ ] Database schema compatible
224
-
225
- ### Phase 2: Implement (varies)
226
- [Write code]
227
-
228
- ### Phase 3: Evaluate
229
- Run: /eval check add-authentication
230
-
231
- ### Phase 4: Report
232
- EVAL REPORT: add-authentication
233
- ==============================
234
- Capability: 5/5 passed (pass@3: 100%)
235
- Regression: 3/3 passed (pass^3: 100%)
236
- Status: SHIP IT
237
- ```
238
-
239
- ## Product Evals (v1.8)
240
-
241
- Use product evals when behavior quality cannot be captured by unit tests alone.
242
-
243
- ### Grader Types
244
-
245
- 1. Code grader (deterministic assertions)
246
- 2. Rule grader (regex/schema constraints)
247
- 3. Model grader (LLM-as-judge rubric)
248
- 4. Human grader (manual adjudication for ambiguous outputs)
249
-
250
- ### pass@k Guidance
251
-
252
- - `pass@1`: direct reliability
253
- - `pass@3`: practical reliability under controlled retries
254
- - `pass^3`: stability test (all 3 runs must pass)
255
-
256
- Recommended thresholds:
257
- - Capability evals: pass@3 >= 0.90
258
- - Regression evals: pass^3 = 1.00 for release-critical paths
259
-
260
- ### Eval Anti-Patterns
261
-
262
- - Overfitting prompts to known eval examples
263
- - Measuring only happy-path outputs
264
- - Ignoring cost and latency drift while chasing pass rates
265
- - Allowing flaky graders in release gates
266
-
267
- ### Minimal Eval Artifact Layout
268
-
269
- - `.claude/evals/<feature>.md` definition
270
- - `.claude/evals/<feature>.log` run history
271
- - `docs/releases/<version>/eval-summary.md` release snapshot