arboris-cli 1.0.0 → 1.1.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (451) hide show
  1. package/dist/cli.mjs +420 -0
  2. package/manifest.json +602 -0
  3. package/package.json +22 -10
  4. package/prisma/skills/accessibility/SKILL.md +147 -0
  5. package/prisma/skills/agent-architecture-audit/SKILL.md +257 -0
  6. package/prisma/skills/agent-eval/SKILL.md +146 -0
  7. package/prisma/skills/agent-harness-construction/SKILL.md +74 -0
  8. package/prisma/skills/agent-introspection-debugging/SKILL.md +154 -0
  9. package/prisma/skills/agent-payment-x402/SKILL.md +225 -0
  10. package/prisma/skills/agent-self-evaluation/SKILL.md +182 -0
  11. package/prisma/skills/agent-self-evaluation/examples/high-score-example.md +87 -0
  12. package/prisma/skills/agent-self-evaluation/examples/low-score-example.md +86 -0
  13. package/prisma/skills/agent-self-evaluation/references/evaluation-criteria.md +71 -0
  14. package/prisma/skills/agent-self-evaluation/references/hook-integration.md +64 -0
  15. package/prisma/skills/agent-self-evaluation/scripts/evaluate.py +408 -0
  16. package/prisma/skills/agent-self-evaluation/templates/evaluation-report.md +86 -0
  17. package/prisma/skills/agent-sort/SKILL.md +216 -0
  18. package/prisma/skills/agentic-engineering/SKILL.md +64 -0
  19. package/prisma/skills/agentic-os/SKILL.md +388 -0
  20. package/prisma/skills/ai-first-engineering/SKILL.md +52 -0
  21. package/prisma/skills/ai-regression-testing/SKILL.md +386 -0
  22. package/prisma/skills/android-clean-architecture/SKILL.md +340 -0
  23. package/prisma/skills/angular-developer/SKILL.md +155 -0
  24. package/prisma/skills/angular-developer/references/angular-animations.md +160 -0
  25. package/prisma/skills/angular-developer/references/angular-aria.md +410 -0
  26. package/prisma/skills/angular-developer/references/cli.md +86 -0
  27. package/prisma/skills/angular-developer/references/component-harnesses.md +59 -0
  28. package/prisma/skills/angular-developer/references/component-styling.md +91 -0
  29. package/prisma/skills/angular-developer/references/components.md +117 -0
  30. package/prisma/skills/angular-developer/references/creating-services.md +97 -0
  31. package/prisma/skills/angular-developer/references/data-resolvers.md +69 -0
  32. package/prisma/skills/angular-developer/references/define-routes.md +67 -0
  33. package/prisma/skills/angular-developer/references/defining-providers.md +72 -0
  34. package/prisma/skills/angular-developer/references/di-fundamentals.md +120 -0
  35. package/prisma/skills/angular-developer/references/e2e-testing.md +56 -0
  36. package/prisma/skills/angular-developer/references/effects.md +83 -0
  37. package/prisma/skills/angular-developer/references/hierarchical-injectors.md +43 -0
  38. package/prisma/skills/angular-developer/references/host-elements.md +80 -0
  39. package/prisma/skills/angular-developer/references/injection-context.md +63 -0
  40. package/prisma/skills/angular-developer/references/inputs.md +101 -0
  41. package/prisma/skills/angular-developer/references/linked-signal.md +59 -0
  42. package/prisma/skills/angular-developer/references/loading-strategies.md +61 -0
  43. package/prisma/skills/angular-developer/references/mcp.md +108 -0
  44. package/prisma/skills/angular-developer/references/navigate-to-routes.md +69 -0
  45. package/prisma/skills/angular-developer/references/outputs.md +86 -0
  46. package/prisma/skills/angular-developer/references/reactive-forms.md +122 -0
  47. package/prisma/skills/angular-developer/references/rendering-strategies.md +44 -0
  48. package/prisma/skills/angular-developer/references/resource.md +77 -0
  49. package/prisma/skills/angular-developer/references/route-animations.md +56 -0
  50. package/prisma/skills/angular-developer/references/route-guards.md +52 -0
  51. package/prisma/skills/angular-developer/references/router-lifecycle.md +45 -0
  52. package/prisma/skills/angular-developer/references/router-testing.md +87 -0
  53. package/prisma/skills/angular-developer/references/show-routes-with-outlets.md +68 -0
  54. package/prisma/skills/angular-developer/references/signal-forms.md +795 -0
  55. package/prisma/skills/angular-developer/references/signals-overview.md +94 -0
  56. package/prisma/skills/angular-developer/references/tailwind-css.md +69 -0
  57. package/prisma/skills/angular-developer/references/template-driven-forms.md +114 -0
  58. package/prisma/skills/angular-developer/references/testing-fundamentals.md +65 -0
  59. package/prisma/skills/api-connector-builder/SKILL.md +121 -0
  60. package/prisma/skills/api-design/SKILL.md +524 -0
  61. package/prisma/skills/architecture-decision-records/SKILL.md +180 -0
  62. package/prisma/skills/article-writing/SKILL.md +80 -0
  63. package/prisma/skills/automation-audit-ops/SKILL.md +143 -0
  64. package/prisma/skills/autonomous-agent-harness/SKILL.md +274 -0
  65. package/prisma/skills/autonomous-loops/SKILL.md +611 -0
  66. package/prisma/skills/backend-patterns/SKILL.md +562 -0
  67. package/prisma/skills/benchmark/SKILL.md +94 -0
  68. package/prisma/skills/benchmark-methodology/SKILL.md +190 -0
  69. package/prisma/skills/benchmark-optimization-loop/SKILL.md +70 -0
  70. package/prisma/skills/blender-motion-state-inspection/SKILL.md +165 -0
  71. package/prisma/skills/blueprint/SKILL.md +106 -0
  72. package/prisma/skills/brand-discovery/SKILL.md +145 -0
  73. package/prisma/skills/brand-discovery/references/10_purpose-why.md +40 -0
  74. package/prisma/skills/brand-discovery/references/20_positioning.md +44 -0
  75. package/prisma/skills/brand-discovery/references/30_audience-niche.md +52 -0
  76. package/prisma/skills/brand-discovery/references/40_personality-archetype.md +57 -0
  77. package/prisma/skills/brand-discovery/references/50_voice-tone.md +59 -0
  78. package/prisma/skills/brand-discovery/references/60_narrative-story.md +50 -0
  79. package/prisma/skills/brand-discovery/references/70_founder-tension.md +49 -0
  80. package/prisma/skills/brand-discovery/references/90_SYNTHESIS.md +133 -0
  81. package/prisma/skills/brand-voice/SKILL.md +98 -0
  82. package/prisma/skills/brand-voice/references/voice-profile-schema.md +55 -0
  83. package/prisma/skills/browser-qa/SKILL.md +105 -0
  84. package/prisma/skills/bun-runtime/SKILL.md +85 -0
  85. package/prisma/skills/canary-watch/SKILL.md +108 -0
  86. package/prisma/skills/carrier-relationship-management/SKILL.md +212 -0
  87. package/prisma/skills/cisco-ios-patterns/SKILL.md +164 -0
  88. package/prisma/skills/ck/SKILL.md +148 -0
  89. package/prisma/skills/ck/commands/forget.mjs +44 -0
  90. package/prisma/skills/ck/commands/info.mjs +24 -0
  91. package/prisma/skills/ck/commands/init.mjs +143 -0
  92. package/prisma/skills/ck/commands/list.mjs +40 -0
  93. package/prisma/skills/ck/commands/migrate.mjs +202 -0
  94. package/prisma/skills/ck/commands/resume.mjs +36 -0
  95. package/prisma/skills/ck/commands/save.mjs +210 -0
  96. package/prisma/skills/ck/commands/shared.mjs +387 -0
  97. package/prisma/skills/ck/hooks/session-start.mjs +224 -0
  98. package/prisma/skills/claude-devfleet/SKILL.md +112 -0
  99. package/prisma/skills/click-path-audit/SKILL.md +245 -0
  100. package/prisma/skills/clickhouse-io/SKILL.md +440 -0
  101. package/prisma/skills/code-tour/SKILL.md +254 -0
  102. package/prisma/skills/codebase-onboarding/SKILL.md +234 -0
  103. package/prisma/skills/codehealth-mcp/SKILL.md +167 -0
  104. package/prisma/skills/coding-standards/SKILL.md +551 -0
  105. package/prisma/skills/competitive-platform-analysis/SKILL.md +214 -0
  106. package/prisma/skills/competitive-report-structure/SKILL.md +162 -0
  107. package/prisma/skills/compose-multiplatform-patterns/SKILL.md +300 -0
  108. package/prisma/skills/config-gc/SKILL.md +120 -0
  109. package/prisma/skills/configure-ecc/SKILL.md +385 -0
  110. package/prisma/skills/connections-optimizer/SKILL.md +190 -0
  111. package/prisma/skills/content-engine/SKILL.md +132 -0
  112. package/prisma/skills/content-hash-cache-pattern/SKILL.md +162 -0
  113. package/prisma/skills/context-budget/SKILL.md +136 -0
  114. package/prisma/skills/continuous-agent-loop/SKILL.md +46 -0
  115. package/prisma/skills/continuous-learning/SKILL.md +132 -0
  116. package/prisma/skills/continuous-learning/config.json +18 -0
  117. package/prisma/skills/continuous-learning/evaluate-session.sh +69 -0
  118. package/prisma/skills/continuous-learning-v2/SKILL.md +361 -0
  119. package/prisma/skills/continuous-learning-v2/agents/observer-loop.sh +359 -0
  120. package/prisma/skills/continuous-learning-v2/agents/observer.md +189 -0
  121. package/prisma/skills/continuous-learning-v2/agents/session-guardian.sh +150 -0
  122. package/prisma/skills/continuous-learning-v2/agents/start-observer.sh +248 -0
  123. package/prisma/skills/continuous-learning-v2/config.json +8 -0
  124. package/prisma/skills/continuous-learning-v2/hooks/observe.sh +585 -0
  125. package/prisma/skills/continuous-learning-v2/scripts/detect-project.sh +322 -0
  126. package/prisma/skills/continuous-learning-v2/scripts/instinct-cli.py +1956 -0
  127. package/prisma/skills/continuous-learning-v2/scripts/lib/homunculus-dir.sh +31 -0
  128. package/prisma/skills/continuous-learning-v2/scripts/migrate-homunculus.sh +68 -0
  129. package/prisma/skills/continuous-learning-v2/scripts/test_parse_instinct.py +1421 -0
  130. package/prisma/skills/cost-aware-llm-pipeline/SKILL.md +184 -0
  131. package/prisma/skills/cost-tracking/SKILL.md +97 -0
  132. package/prisma/skills/council/SKILL.md +204 -0
  133. package/prisma/skills/cpp-coding-standards/SKILL.md +724 -0
  134. package/prisma/skills/cpp-testing/SKILL.md +325 -0
  135. package/prisma/skills/crosspost/SKILL.md +112 -0
  136. package/prisma/skills/csharp-testing/SKILL.md +322 -0
  137. package/prisma/skills/customer-billing-ops/SKILL.md +141 -0
  138. package/prisma/skills/customs-trade-compliance/SKILL.md +263 -0
  139. package/prisma/skills/dart-flutter-patterns/SKILL.md +564 -0
  140. package/prisma/skills/dashboard-builder/SKILL.md +109 -0
  141. package/prisma/skills/data-scraper-agent/SKILL.md +765 -0
  142. package/prisma/skills/data-throughput-accelerator/SKILL.md +73 -0
  143. package/prisma/skills/database-migrations/SKILL.md +430 -0
  144. package/prisma/skills/deep-research/SKILL.md +160 -0
  145. package/prisma/skills/defi-amm-security/SKILL.md +167 -0
  146. package/prisma/skills/delivery-gate/SKILL.md +126 -0
  147. package/prisma/skills/delivery-gate/hooks/quality-gate.py +220 -0
  148. package/prisma/skills/deployment-patterns/SKILL.md +428 -0
  149. package/prisma/skills/design-system/SKILL.md +83 -0
  150. package/prisma/skills/django-celery/SKILL.md +458 -0
  151. package/prisma/skills/django-patterns/SKILL.md +735 -0
  152. package/prisma/skills/django-security/SKILL.md +644 -0
  153. package/prisma/skills/django-tdd/SKILL.md +730 -0
  154. package/prisma/skills/django-verification/SKILL.md +470 -0
  155. package/prisma/skills/dmux-workflows/SKILL.md +192 -0
  156. package/prisma/skills/docker-patterns/SKILL.md +365 -0
  157. package/prisma/skills/documentation-lookup/SKILL.md +91 -0
  158. package/prisma/skills/dotnet-patterns/SKILL.md +322 -0
  159. package/prisma/skills/dynamic-workflow-mode/SKILL.md +124 -0
  160. package/prisma/skills/e2e-testing/SKILL.md +327 -0
  161. package/prisma/skills/ecc-guide/SKILL.md +190 -0
  162. package/prisma/skills/ecc-recipes/SKILL.md +149 -0
  163. package/prisma/skills/ecc-tools-cost-audit/SKILL.md +161 -0
  164. package/prisma/skills/email-ops/SKILL.md +122 -0
  165. package/prisma/skills/energy-procurement/SKILL.md +228 -0
  166. package/prisma/skills/enterprise-agent-ops/SKILL.md +51 -0
  167. package/prisma/skills/error-handling/SKILL.md +377 -0
  168. package/prisma/skills/eval-harness/SKILL.md +271 -0
  169. package/prisma/skills/evm-token-decimals/SKILL.md +131 -0
  170. package/prisma/skills/exa-search/SKILL.md +108 -0
  171. package/prisma/skills/fal-ai-media/SKILL.md +289 -0
  172. package/prisma/skills/fastapi-patterns/SKILL.md +514 -0
  173. package/prisma/skills/finance-billing-ops/SKILL.md +128 -0
  174. package/prisma/skills/flox-environments/SKILL.md +497 -0
  175. package/prisma/skills/flutter-dart-code-review/SKILL.md +436 -0
  176. package/prisma/skills/foundation-models-on-device/SKILL.md +243 -0
  177. package/prisma/skills/frontend-a11y/SKILL.md +446 -0
  178. package/prisma/skills/frontend-design-direction/SKILL.md +93 -0
  179. package/prisma/skills/frontend-patterns/SKILL.md +657 -0
  180. package/prisma/skills/frontend-slides/SKILL.md +185 -0
  181. package/prisma/skills/frontend-slides/STYLE_PRESETS.md +330 -0
  182. package/prisma/skills/frontend-slides/animation-patterns.md +122 -0
  183. package/prisma/skills/frontend-slides/html-template.md +419 -0
  184. package/prisma/skills/frontend-slides/scripts/export-pdf.sh +418 -0
  185. package/prisma/skills/frontend-slides/scripts/extract-pptx.py +96 -0
  186. package/prisma/skills/frontend-slides/viewport-base.css +153 -0
  187. package/prisma/skills/fsharp-testing/SKILL.md +281 -0
  188. package/prisma/skills/gan-style-harness/SKILL.md +279 -0
  189. package/prisma/skills/gateguard/SKILL.md +133 -0
  190. package/prisma/skills/generating-python-installer/SKILL.md +820 -0
  191. package/prisma/skills/git-workflow/SKILL.md +716 -0
  192. package/prisma/skills/github-ops/SKILL.md +145 -0
  193. package/prisma/skills/golang-patterns/SKILL.md +675 -0
  194. package/prisma/skills/golang-testing/SKILL.md +721 -0
  195. package/prisma/skills/google-workspace-ops/SKILL.md +96 -0
  196. package/prisma/skills/growth-log/SKILL.md +128 -0
  197. package/prisma/skills/healthcare-cdss-patterns/SKILL.md +246 -0
  198. package/prisma/skills/healthcare-emr-patterns/SKILL.md +160 -0
  199. package/prisma/skills/healthcare-eval-harness/SKILL.md +208 -0
  200. package/prisma/skills/healthcare-phi-compliance/SKILL.md +146 -0
  201. package/prisma/skills/hermes-imports/SKILL.md +89 -0
  202. package/prisma/skills/hexagonal-architecture/SKILL.md +277 -0
  203. package/prisma/skills/hipaa-compliance/SKILL.md +79 -0
  204. package/prisma/skills/homelab-network-readiness/SKILL.md +170 -0
  205. package/prisma/skills/homelab-network-setup/SKILL.md +130 -0
  206. package/prisma/skills/homelab-pihole-dns/SKILL.md +275 -0
  207. package/prisma/skills/homelab-vlan-segmentation/SKILL.md +312 -0
  208. package/prisma/skills/homelab-wireguard-vpn/SKILL.md +306 -0
  209. package/prisma/skills/hookify-rules/SKILL.md +128 -0
  210. package/prisma/skills/inherit-legacy-style/SKILL.md +157 -0
  211. package/prisma/skills/intent-driven-development/SKILL.md +360 -0
  212. package/prisma/skills/inventory-demand-planning/SKILL.md +247 -0
  213. package/prisma/skills/investor-materials/SKILL.md +97 -0
  214. package/prisma/skills/investor-outreach/SKILL.md +92 -0
  215. package/prisma/skills/ios-icon-gen/SKILL.md +158 -0
  216. package/prisma/skills/ios-icon-gen/scripts/generate_icons.swift +258 -0
  217. package/prisma/skills/ios-icon-gen/scripts/iconify_gen.sh +235 -0
  218. package/prisma/skills/iterative-retrieval/SKILL.md +212 -0
  219. package/prisma/skills/ito-basket-compare/SKILL.md +64 -0
  220. package/prisma/skills/ito-data-atlas-agent/SKILL.md +64 -0
  221. package/prisma/skills/ito-market-intelligence/SKILL.md +61 -0
  222. package/prisma/skills/ito-trade-planner/SKILL.md +68 -0
  223. package/prisma/skills/java-coding-standards/SKILL.md +384 -0
  224. package/prisma/skills/jira-integration/SKILL.md +303 -0
  225. package/prisma/skills/jpa-patterns/SKILL.md +152 -0
  226. package/prisma/skills/knowledge-ops/SKILL.md +155 -0
  227. package/prisma/skills/kotlin-coroutines-flows/SKILL.md +285 -0
  228. package/prisma/skills/kotlin-exposed-patterns/SKILL.md +720 -0
  229. package/prisma/skills/kotlin-ktor-patterns/SKILL.md +690 -0
  230. package/prisma/skills/kotlin-patterns/SKILL.md +712 -0
  231. package/prisma/skills/kotlin-testing/SKILL.md +825 -0
  232. package/prisma/skills/kubernetes-patterns/SKILL.md +756 -0
  233. package/prisma/skills/laravel-patterns/SKILL.md +416 -0
  234. package/prisma/skills/laravel-plugin-discovery/SKILL.md +230 -0
  235. package/prisma/skills/laravel-security/SKILL.md +948 -0
  236. package/prisma/skills/laravel-tdd/SKILL.md +675 -0
  237. package/prisma/skills/laravel-verification/SKILL.md +180 -0
  238. package/prisma/skills/latency-critical-systems/SKILL.md +74 -0
  239. package/prisma/skills/lead-intelligence/SKILL.md +322 -0
  240. package/prisma/skills/lead-intelligence/agents/enrichment-agent.md +85 -0
  241. package/prisma/skills/lead-intelligence/agents/mutual-mapper.md +75 -0
  242. package/prisma/skills/lead-intelligence/agents/outreach-drafter.md +98 -0
  243. package/prisma/skills/lead-intelligence/agents/signal-scorer.md +60 -0
  244. package/prisma/skills/liquid-glass-design/SKILL.md +279 -0
  245. package/prisma/skills/llm-trading-agent-security/SKILL.md +147 -0
  246. package/prisma/skills/logistics-exception-management/SKILL.md +222 -0
  247. package/prisma/skills/loop-design-check/SKILL.md +143 -0
  248. package/prisma/skills/mailtrap-email-integration/SKILL.md +77 -0
  249. package/prisma/skills/make-interfaces-feel-better/SKILL.md +152 -0
  250. package/prisma/skills/manim-video/SKILL.md +90 -0
  251. package/prisma/skills/manim-video/assets/network_graph_scene.py +52 -0
  252. package/prisma/skills/market-research/SKILL.md +76 -0
  253. package/prisma/skills/marketing-campaign/SKILL.md +114 -0
  254. package/prisma/skills/mcp-server-patterns/SKILL.md +70 -0
  255. package/prisma/skills/messages-ops/SKILL.md +105 -0
  256. package/prisma/skills/ml-adoption-playbook/SKILL.md +57 -0
  257. package/prisma/skills/mle-workflow/SKILL.md +347 -0
  258. package/prisma/skills/motion-advanced/SKILL.md +596 -0
  259. package/prisma/skills/motion-foundations/SKILL.md +299 -0
  260. package/prisma/skills/motion-patterns/SKILL.md +434 -0
  261. package/prisma/skills/motion-ui/SKILL.md +576 -0
  262. package/prisma/skills/mysql-patterns/SKILL.md +413 -0
  263. package/prisma/skills/nanoclaw-repl/SKILL.md +34 -0
  264. package/prisma/skills/nestjs-patterns/SKILL.md +231 -0
  265. package/prisma/skills/netmiko-ssh-automation/SKILL.md +174 -0
  266. package/prisma/skills/network-bgp-diagnostics/SKILL.md +168 -0
  267. package/prisma/skills/network-config-validation/SKILL.md +211 -0
  268. package/prisma/skills/network-interface-health/SKILL.md +153 -0
  269. package/prisma/skills/nextjs-turbopack/SKILL.md +58 -0
  270. package/prisma/skills/nodejs-keccak256/SKILL.md +103 -0
  271. package/prisma/skills/nutrient-document-processing/SKILL.md +168 -0
  272. package/prisma/skills/nuxt4-patterns/SKILL.md +101 -0
  273. package/prisma/skills/openclaw-persona-forge/SKILL.md +289 -0
  274. package/prisma/skills/openclaw-persona-forge/gacha.py +224 -0
  275. package/prisma/skills/openclaw-persona-forge/gacha.sh +5 -0
  276. package/prisma/skills/openclaw-persona-forge/references/avatar-style.md +124 -0
  277. package/prisma/skills/openclaw-persona-forge/references/boundary-rules.md +53 -0
  278. package/prisma/skills/openclaw-persona-forge/references/error-handling.md +53 -0
  279. package/prisma/skills/openclaw-persona-forge/references/identity-tension.md +48 -0
  280. package/prisma/skills/openclaw-persona-forge/references/naming-system.md +39 -0
  281. package/prisma/skills/openclaw-persona-forge/references/output-template.md +166 -0
  282. package/prisma/skills/opensource-pipeline/SKILL.md +256 -0
  283. package/prisma/skills/orch-add-feature/SKILL.md +45 -0
  284. package/prisma/skills/orch-build-mvp/SKILL.md +49 -0
  285. package/prisma/skills/orch-change-feature/SKILL.md +43 -0
  286. package/prisma/skills/orch-fix-defect/SKILL.md +43 -0
  287. package/prisma/skills/orch-pipeline/SKILL.md +121 -0
  288. package/prisma/skills/orch-refine-code/SKILL.md +44 -0
  289. package/prisma/skills/parallel-execution-optimizer/SKILL.md +73 -0
  290. package/prisma/skills/perl-patterns/SKILL.md +505 -0
  291. package/prisma/skills/perl-security/SKILL.md +504 -0
  292. package/prisma/skills/perl-testing/SKILL.md +476 -0
  293. package/prisma/skills/plan-orchestrate/SKILL.md +263 -0
  294. package/prisma/skills/plankton-code-quality/SKILL.md +237 -0
  295. package/prisma/skills/postgres-patterns/SKILL.md +148 -0
  296. package/prisma/skills/prediction-market-oracle-research/SKILL.md +64 -0
  297. package/prisma/skills/prediction-market-risk-review/SKILL.md +61 -0
  298. package/prisma/skills/prisma-patterns/SKILL.md +401 -0
  299. package/prisma/skills/product-capability/SKILL.md +142 -0
  300. package/prisma/skills/product-lens/SKILL.md +93 -0
  301. package/prisma/skills/production-audit/SKILL.md +207 -0
  302. package/prisma/skills/production-scheduling/SKILL.md +238 -0
  303. package/prisma/skills/project-flow-ops/SKILL.md +112 -0
  304. package/prisma/skills/prompt-optimizer/SKILL.md +398 -0
  305. package/prisma/skills/python-patterns/SKILL.md +751 -0
  306. package/prisma/skills/python-testing/SKILL.md +817 -0
  307. package/prisma/skills/pytorch-patterns/SKILL.md +397 -0
  308. package/prisma/skills/quality-nonconformance/SKILL.md +260 -0
  309. package/prisma/skills/quarkus-patterns/SKILL.md +723 -0
  310. package/prisma/skills/quarkus-security/SKILL.md +468 -0
  311. package/prisma/skills/quarkus-tdd/SKILL.md +812 -0
  312. package/prisma/skills/quarkus-verification/SKILL.md +480 -0
  313. package/prisma/skills/ralphinho-rfc-pipeline/SKILL.md +68 -0
  314. package/prisma/skills/react-native-patterns/SKILL.md +326 -0
  315. package/prisma/skills/react-patterns/SKILL.md +342 -0
  316. package/prisma/skills/react-performance/SKILL.md +575 -0
  317. package/prisma/skills/react-testing/SKILL.md +424 -0
  318. package/prisma/skills/recsys-pipeline-architect/SKILL.md +115 -0
  319. package/prisma/skills/recursive-decision-ledger/SKILL.md +80 -0
  320. package/prisma/skills/redis-patterns/SKILL.md +404 -0
  321. package/prisma/skills/regex-vs-llm-structured-text/SKILL.md +221 -0
  322. package/prisma/skills/remotion-video-creation/SKILL.md +43 -0
  323. package/prisma/skills/remotion-video-creation/rules/3d.md +86 -0
  324. package/prisma/skills/remotion-video-creation/rules/animations.md +29 -0
  325. package/prisma/skills/remotion-video-creation/rules/assets/charts-bar-chart.tsx +173 -0
  326. package/prisma/skills/remotion-video-creation/rules/assets/text-animations-typewriter.tsx +100 -0
  327. package/prisma/skills/remotion-video-creation/rules/assets/text-animations-word-highlight.tsx +108 -0
  328. package/prisma/skills/remotion-video-creation/rules/assets.md +78 -0
  329. package/prisma/skills/remotion-video-creation/rules/audio.md +172 -0
  330. package/prisma/skills/remotion-video-creation/rules/calculate-metadata.md +104 -0
  331. package/prisma/skills/remotion-video-creation/rules/can-decode.md +75 -0
  332. package/prisma/skills/remotion-video-creation/rules/charts.md +58 -0
  333. package/prisma/skills/remotion-video-creation/rules/compositions.md +146 -0
  334. package/prisma/skills/remotion-video-creation/rules/display-captions.md +126 -0
  335. package/prisma/skills/remotion-video-creation/rules/extract-frames.md +229 -0
  336. package/prisma/skills/remotion-video-creation/rules/fonts.md +152 -0
  337. package/prisma/skills/remotion-video-creation/rules/get-audio-duration.md +58 -0
  338. package/prisma/skills/remotion-video-creation/rules/get-video-dimensions.md +68 -0
  339. package/prisma/skills/remotion-video-creation/rules/get-video-duration.md +58 -0
  340. package/prisma/skills/remotion-video-creation/rules/gifs.md +138 -0
  341. package/prisma/skills/remotion-video-creation/rules/images.md +130 -0
  342. package/prisma/skills/remotion-video-creation/rules/import-srt-captions.md +67 -0
  343. package/prisma/skills/remotion-video-creation/rules/lottie.md +67 -0
  344. package/prisma/skills/remotion-video-creation/rules/measuring-dom-nodes.md +34 -0
  345. package/prisma/skills/remotion-video-creation/rules/measuring-text.md +143 -0
  346. package/prisma/skills/remotion-video-creation/rules/sequencing.md +106 -0
  347. package/prisma/skills/remotion-video-creation/rules/tailwind.md +11 -0
  348. package/prisma/skills/remotion-video-creation/rules/text-animations.md +20 -0
  349. package/prisma/skills/remotion-video-creation/rules/timing.md +179 -0
  350. package/prisma/skills/remotion-video-creation/rules/transcribe-captions.md +19 -0
  351. package/prisma/skills/remotion-video-creation/rules/transitions.md +122 -0
  352. package/prisma/skills/remotion-video-creation/rules/trimming.md +52 -0
  353. package/prisma/skills/remotion-video-creation/rules/videos.md +171 -0
  354. package/prisma/skills/repo-scan/SKILL.md +79 -0
  355. package/prisma/skills/research-ops/SKILL.md +113 -0
  356. package/prisma/skills/returns-reverse-logistics/SKILL.md +240 -0
  357. package/prisma/skills/rules-distill/SKILL.md +265 -0
  358. package/prisma/skills/rules-distill/scripts/scan-rules.sh +58 -0
  359. package/prisma/skills/rules-distill/scripts/scan-skills.sh +129 -0
  360. package/prisma/skills/rust-patterns/SKILL.md +500 -0
  361. package/prisma/skills/rust-testing/SKILL.md +501 -0
  362. package/prisma/skills/safety-guard/SKILL.md +76 -0
  363. package/prisma/skills/santa-method/SKILL.md +307 -0
  364. package/prisma/skills/scientific-db-pubmed-database/SKILL.md +176 -0
  365. package/prisma/skills/scientific-db-uspto-database/SKILL.md +178 -0
  366. package/prisma/skills/scientific-pkg-gget/SKILL.md +167 -0
  367. package/prisma/skills/scientific-thinking-literature-review/SKILL.md +193 -0
  368. package/prisma/skills/scientific-thinking-scholar-evaluation/SKILL.md +161 -0
  369. package/prisma/skills/search-first/SKILL.md +183 -0
  370. package/prisma/skills/security-bounty-hunter/SKILL.md +100 -0
  371. package/prisma/skills/security-review/SKILL.md +504 -0
  372. package/prisma/skills/security-review/cloud-infrastructure-security.md +361 -0
  373. package/prisma/skills/security-scan/SKILL.md +166 -0
  374. package/prisma/skills/seo/SKILL.md +155 -0
  375. package/prisma/skills/skill-comply/SKILL.md +59 -0
  376. package/prisma/skills/skill-comply/fixtures/compliant_trace.jsonl +5 -0
  377. package/prisma/skills/skill-comply/fixtures/noncompliant_trace.jsonl +3 -0
  378. package/prisma/skills/skill-comply/fixtures/tdd_spec.yaml +44 -0
  379. package/prisma/skills/skill-comply/prompts/classifier.md +24 -0
  380. package/prisma/skills/skill-comply/prompts/scenario_generator.md +62 -0
  381. package/prisma/skills/skill-comply/prompts/spec_generator.md +42 -0
  382. package/prisma/skills/skill-comply/pyproject.toml +15 -0
  383. package/prisma/skills/skill-comply/scripts/__init__.py +0 -0
  384. package/prisma/skills/skill-comply/scripts/classifier.py +85 -0
  385. package/prisma/skills/skill-comply/scripts/grader.py +124 -0
  386. package/prisma/skills/skill-comply/scripts/parser.py +107 -0
  387. package/prisma/skills/skill-comply/scripts/report.py +170 -0
  388. package/prisma/skills/skill-comply/scripts/run.py +127 -0
  389. package/prisma/skills/skill-comply/scripts/runner.py +194 -0
  390. package/prisma/skills/skill-comply/scripts/scenario_generator.py +70 -0
  391. package/prisma/skills/skill-comply/scripts/spec_generator.py +72 -0
  392. package/prisma/skills/skill-comply/scripts/utils.py +13 -0
  393. package/prisma/skills/skill-comply/tests/test_grader.py +197 -0
  394. package/prisma/skills/skill-comply/tests/test_parser.py +90 -0
  395. package/prisma/skills/skill-comply/tests/test_runner.py +172 -0
  396. package/prisma/skills/skill-scout/SKILL.md +141 -0
  397. package/prisma/skills/skill-stocktake/SKILL.md +195 -0
  398. package/prisma/skills/skill-stocktake/scripts/quick-diff.sh +87 -0
  399. package/prisma/skills/skill-stocktake/scripts/save-results.sh +56 -0
  400. package/prisma/skills/skill-stocktake/scripts/scan.sh +170 -0
  401. package/prisma/skills/social-graph-ranker/SKILL.md +155 -0
  402. package/prisma/skills/social-publisher/SKILL.md +130 -0
  403. package/prisma/skills/springboot-patterns/SKILL.md +315 -0
  404. package/prisma/skills/springboot-security/SKILL.md +273 -0
  405. package/prisma/skills/springboot-tdd/SKILL.md +159 -0
  406. package/prisma/skills/springboot-verification/SKILL.md +232 -0
  407. package/prisma/skills/strategic-compact/SKILL.md +136 -0
  408. package/prisma/skills/swift-actor-persistence/SKILL.md +144 -0
  409. package/prisma/skills/swift-concurrency-6-2/SKILL.md +216 -0
  410. package/prisma/skills/swift-protocol-di-testing/SKILL.md +191 -0
  411. package/prisma/skills/swiftui-patterns/SKILL.md +259 -0
  412. package/prisma/skills/taste/SKILL.md +264 -0
  413. package/prisma/skills/taste/references/genre-taxonomy.md +87 -0
  414. package/prisma/skills/tdd-workflow/SKILL.md +583 -0
  415. package/prisma/skills/team-agent-orchestration/SKILL.md +111 -0
  416. package/prisma/skills/team-builder/SKILL.md +169 -0
  417. package/prisma/skills/terminal-ops/SKILL.md +110 -0
  418. package/prisma/skills/tinystruct-patterns/SKILL.md +279 -0
  419. package/prisma/skills/tinystruct-patterns/references/architecture.md +90 -0
  420. package/prisma/skills/tinystruct-patterns/references/data-handling.md +60 -0
  421. package/prisma/skills/tinystruct-patterns/references/database.md +99 -0
  422. package/prisma/skills/tinystruct-patterns/references/routing.md +64 -0
  423. package/prisma/skills/tinystruct-patterns/references/system-usage.md +97 -0
  424. package/prisma/skills/tinystruct-patterns/references/testing.md +72 -0
  425. package/prisma/skills/token-budget-advisor/SKILL.md +134 -0
  426. package/prisma/skills/ui-demo/SKILL.md +466 -0
  427. package/prisma/skills/ui-to-vue/SKILL.md +135 -0
  428. package/prisma/skills/uncloud/SKILL.md +344 -0
  429. package/prisma/skills/unified-notifications-ops/SKILL.md +188 -0
  430. package/prisma/skills/verification-loop/SKILL.md +127 -0
  431. package/prisma/skills/video-editing/SKILL.md +311 -0
  432. package/prisma/skills/videodb/SKILL.md +375 -0
  433. package/prisma/skills/videodb/reference/api-reference.md +550 -0
  434. package/prisma/skills/videodb/reference/capture-reference.md +407 -0
  435. package/prisma/skills/videodb/reference/capture.md +101 -0
  436. package/prisma/skills/videodb/reference/editor.md +443 -0
  437. package/prisma/skills/videodb/reference/generative.md +331 -0
  438. package/prisma/skills/videodb/reference/rtstream-reference.md +564 -0
  439. package/prisma/skills/videodb/reference/rtstream.md +65 -0
  440. package/prisma/skills/videodb/reference/search.md +230 -0
  441. package/prisma/skills/videodb/reference/streaming.md +406 -0
  442. package/prisma/skills/videodb/reference/use-cases.md +118 -0
  443. package/prisma/skills/videodb/scripts/ws_listener.py +282 -0
  444. package/prisma/skills/visa-doc-translate/README.md +86 -0
  445. package/prisma/skills/visa-doc-translate/SKILL.md +117 -0
  446. package/prisma/skills/vite-patterns/SKILL.md +450 -0
  447. package/prisma/skills/vue-patterns/SKILL.md +471 -0
  448. package/prisma/skills/windows-desktop-e2e/SKILL.md +888 -0
  449. package/prisma/skills/workspace-surface-audit/SKILL.md +126 -0
  450. package/prisma/skills/x-api/SKILL.md +235 -0
  451. package/run.mjs +0 -10
@@ -0,0 +1,147 @@
1
+ ---
2
+ name: accessibility
3
+ description: Design, implement, and audit inclusive digital products using WCAG 2.2 Level AA
4
+ standards. Use this skill to generate semantic ARIA for Web and accessibility traits for Web and Native platforms (iOS/Android).
5
+ metadata:
6
+ origin: ECC
7
+ ---
8
+
9
+ # Accessibility (WCAG 2.2)
10
+
11
+ This skill ensures that digital interfaces are Perceivable, Operable, Understandable, and Robust (POUR) for all users, including those using screen readers, switch controls, or keyboard navigation. It focuses on the technical implementation of WCAG 2.2 success criteria.
12
+
13
+ ## When to Use
14
+
15
+ - Defining UI component specifications for Web, iOS, or Android.
16
+ - Auditing existing code for accessibility barriers or compliance gaps.
17
+ - Implementing new WCAG 2.2 standards like Target Size (Minimum) and Focus Appearance.
18
+ - Mapping high-level design requirements to technical attributes (ARIA roles, traits, hints).
19
+
20
+ ## Core Concepts
21
+
22
+ - **POUR Principles**: The foundation of WCAG (Perceivable, Operable, Understandable, Robust).
23
+ - **Semantic Mapping**: Using native elements over generic containers to provide built-in accessibility.
24
+ - **Accessibility Tree**: The representation of the UI that assistive technologies actually "read."
25
+ - **Focus Management**: Controlling the order and visibility of the keyboard/screen reader cursor.
26
+ - **Labeling & Hints**: Providing context through `aria-label`, `accessibilityLabel`, and `contentDescription`.
27
+
28
+ ## How It Works
29
+
30
+ ### Step 1: Identify the Component Role
31
+
32
+ Determine the functional purpose (e.g., Is this a button, a link, or a tab?). Use the most semantic native element available before resorting to custom roles.
33
+
34
+ ### Step 2: Define Perceivable Attributes
35
+
36
+ - Ensure text contrast meets **4.5:1** (normal) or **3:1** (large/UI).
37
+ - Add text alternatives for non-text content (images, icons).
38
+ - Implement responsive reflow (up to 400% zoom without loss of function).
39
+
40
+ ### Step 3: Implement Operable Controls
41
+
42
+ - Ensure a minimum **24x24 CSS pixel** target size (WCAG 2.2 SC 2.5.8).
43
+ - Verify all interactive elements are reachable via keyboard and have a visible focus indicator (SC 2.4.11).
44
+ - Provide single-pointer alternatives for dragging movements.
45
+
46
+ ### Step 4: Ensure Understandable Logic
47
+
48
+ - Use consistent navigation patterns.
49
+ - Provide descriptive error messages and suggestions for correction (SC 3.3.3).
50
+ - Implement "Redundant Entry" (SC 3.3.7) to prevent asking for the same data twice.
51
+
52
+ ### Step 5: Verify Robust Compatibility
53
+
54
+ - Use correct `Name, Role, Value` patterns.
55
+ - Implement `aria-live` or live regions for dynamic status updates.
56
+
57
+ ## Accessibility Architecture Diagram
58
+
59
+ ```mermaid
60
+ flowchart TD
61
+ UI["UI Component"] --> Platform{Platform?}
62
+ Platform -->|Web| ARIA["WAI-ARIA + HTML5"]
63
+ Platform -->|iOS| SwiftUI["Accessibility Traits + Labels"]
64
+ Platform -->|Android| Compose["Semantics + ContentDesc"]
65
+
66
+ ARIA --> AT["Assistive Technology (Screen Readers, Switches)"]
67
+ SwiftUI --> AT
68
+ Compose --> AT
69
+ ```
70
+
71
+ ## Cross-Platform Mapping
72
+
73
+ | Feature | Web (HTML/ARIA) | iOS (SwiftUI) | Android (Compose) |
74
+ | :----------------- | :----------------------- | :----------------------------------- | :---------------------------------------------------------- |
75
+ | **Primary Label** | `aria-label` / `<label>` | `.accessibilityLabel()` | `contentDescription` |
76
+ | **Secondary Hint** | `aria-describedby` | `.accessibilityHint()` | `Modifier.semantics { stateDescription = ... }` |
77
+ | **Action Role** | `role="button"` | `.accessibilityAddTraits(.isButton)` | `Modifier.semantics { role = Role.Button }` |
78
+ | **Live Updates** | `aria-live="polite"` | `.accessibilityLiveRegion(.polite)` | `Modifier.semantics { liveRegion = LiveRegionMode.Polite }` |
79
+
80
+ ## Examples
81
+
82
+ ### Web: Accessible Search
83
+
84
+ ```html
85
+ <form role="search">
86
+ <label for="search-input" class="sr-only">Search products</label>
87
+ <input type="search" id="search-input" placeholder="Search..." />
88
+ <button type="submit" aria-label="Submit Search">
89
+ <svg aria-hidden="true">...</svg>
90
+ </button>
91
+ </form>
92
+ ```
93
+
94
+ ### iOS: Accessible Action Button
95
+
96
+ ```swift
97
+ Button(action: deleteItem) {
98
+ Image(systemName: "trash")
99
+ }
100
+ .accessibilityLabel("Delete item")
101
+ .accessibilityHint("Permanently removes this item from your list")
102
+ .accessibilityAddTraits(.isButton)
103
+ ```
104
+
105
+ ### Android: Accessible Toggle
106
+
107
+ ```kotlin
108
+ Switch(
109
+ checked = isEnabled,
110
+ onCheckedChange = { onToggle() },
111
+ modifier = Modifier.semantics {
112
+ contentDescription = "Enable notifications"
113
+ }
114
+ )
115
+ ```
116
+
117
+ ## Anti-Patterns to Avoid
118
+
119
+ - **Div-Buttons**: Using a `<div>` or `<span>` for a click event without adding a role and keyboard support.
120
+ - **Color-Only Meaning**: Indicating an error or status _only_ with a color change (e.g., turning a border red).
121
+ - **Uncontained Modal Focus**: Modals that don't trap focus, allowing keyboard users to navigate background content while the modal is open. Focus must be contained _and_ escapable via the `Escape` key or an explicit close button (WCAG SC 2.1.2).
122
+ - **Redundant Alt Text**: Using "Image of..." or "Picture of..." in alt text (screen readers already announce the role "Image").
123
+
124
+ ## Best Practices Checklist
125
+
126
+ - [ ] Interactive elements meet the **24x24px** (Web) or **44x44pt** (Native) target size.
127
+ - [ ] Focus indicators are clearly visible and high-contrast.
128
+ - [ ] Modals **contain focus** while open, and release it cleanly on close (`Escape` key or close button).
129
+ - [ ] Dropdowns and menus restore focus to the trigger element on close.
130
+ - [ ] Forms provide text-based error suggestions.
131
+ - [ ] All icon-only buttons have a descriptive text label.
132
+ - [ ] Content reflows properly when text is scaled.
133
+
134
+ ## References
135
+
136
+ - [WCAG 2.2 Guidelines](https://www.w3.org/TR/WCAG22/)
137
+ - [WAI-ARIA Authoring Practices](https://www.w3.org/TR/wai-aria-practices/)
138
+ - [iOS Accessibility Programming Guide](https://developer.apple.com/documentation/accessibility)
139
+ - [iOS Human Interface Guidelines - Accessibility](https://developer.apple.com/design/human-interface-guidelines/accessibility)
140
+ - [Android Accessibility Developer Guide](https://developer.android.com/guide/topics/ui/accessibility)
141
+
142
+ ## Related Skills
143
+
144
+ - `frontend-patterns`
145
+ - `design-system`
146
+ - `liquid-glass-design`
147
+ - `swiftui-patterns`
@@ -0,0 +1,257 @@
1
+ ---
2
+ name: agent-architecture-audit
3
+ description: Full-stack diagnostic for agent and LLM applications. Audits the 12-layer agent stack for wrapper regression, memory pollution, tool discipline failures, hidden repair loops, and rendering corruption. Produces severity-ranked findings with code-first fixes. Essential for developers building agent applications, autonomous loops, or any LLM-powered feature.
4
+ metadata:
5
+ origin: oh-my-agent-check
6
+ tools: Read, Write, Edit, Bash, Grep, Glob
7
+ ---
8
+
9
+ # Agent Architecture Audit
10
+
11
+ A diagnostic workflow for agent systems that hide failures behind wrapper layers, stale memory, retry loops, or transport/rendering mutations.
12
+
13
+ ## When to Activate
14
+
15
+ **MANDATORY for:**
16
+ - Releasing any agent or LLM-powered application to production
17
+ - Shipping features with tool calling, memory, or multi-step workflows
18
+ - Agent behavior degrades after adding wrapper layers
19
+ - User reports "the agent is getting worse" or "tools are flaky"
20
+ - Same model works in playground but breaks inside your wrapper
21
+ - Debugging agent behavior for more than 15 minutes without finding root cause
22
+
23
+ **Especially critical when:**
24
+ - You've added new prompt layers, tool definitions, or memory systems
25
+ - Different agents in your system behave inconsistently
26
+ - The model was fine yesterday but is hallucinating today
27
+ - You suspect hidden repair/retry loops silently mutating responses
28
+
29
+ **Do not use for:**
30
+ - General code debugging — use `agent-introspection-debugging`
31
+ - Code review — use language-specific reviewer agents
32
+ - Security scanning — use `security-review` or `security-review/scan`
33
+ - Agent performance benchmarking — use `agent-eval`
34
+ - Writing new features — use the appropriate workflow skill
35
+
36
+ ## The 12-Layer Stack
37
+
38
+ Every agent system has these layers. Any of them can corrupt the answer:
39
+
40
+ | # | Layer | What Goes Wrong |
41
+ |---|-------|----------------|
42
+ | 1 | System prompt | Conflicting instructions, instruction bloat |
43
+ | 2 | Session history | Stale context injection from previous turns |
44
+ | 3 | Long-term memory | Pollution across sessions, old topics in new conversations |
45
+ | 4 | Distillation | Compressed artifacts re-entering as pseudo-facts |
46
+ | 5 | Active recall | Redundant re-summary layers wasting context |
47
+ | 6 | Tool selection | Wrong tool routing, model skips required tools |
48
+ | 7 | Tool execution | Hallucinated execution — claims to call but doesn't |
49
+ | 8 | Tool interpretation | Misread or ignored tool output |
50
+ | 9 | Answer shaping | Format corruption in final response |
51
+ | 10 | Platform rendering | Transport-layer mutation (UI, API, CLI mutates valid answers) |
52
+ | 11 | Hidden repair loops | Silent fallback/retry agents running second LLM pass |
53
+ | 12 | Persistence | Expired state or cached artifacts reused as live evidence |
54
+
55
+ ## Common Failure Patterns
56
+
57
+ ### 1. Wrapper Regression
58
+
59
+ The base model produces correct answers, but the wrapper layers make it worse.
60
+
61
+ **Symptoms:**
62
+ - Model works fine in playground or direct API call, breaks in your agent
63
+ - Added a new prompt layer, existing behavior degraded
64
+ - Agent sounds confident but is confidently wrong
65
+ - "It was working before the last update"
66
+
67
+ ### 2. Memory Contamination
68
+
69
+ Old topics leak into new conversations through history, memory retrieval, or distillation.
70
+
71
+ **Symptoms:**
72
+ - Agent brings up unrelated past topics
73
+ - User corrections don't stick (old memory overwrites new)
74
+ - Same-session artifacts re-enter as pseudo-facts
75
+ - Memory grows without bound, degrading response quality over time
76
+
77
+ ### 3. Tool Discipline Failure
78
+
79
+ Tools are declared in the prompt but not enforced in code. The model skips them or hallucinates execution.
80
+
81
+ **Symptoms:**
82
+ - "Must use tool X" in prompt, but model answers without calling it
83
+ - Tool results look correct but were never actually executed
84
+ - Different tools fight over the same responsibility
85
+ - Model uses tool when it shouldn't, or skips it when it must
86
+
87
+ ### 4. Rendering/Transport Corruption
88
+
89
+ The agent's internal answer is correct, but the platform layer mutates it during delivery.
90
+
91
+ **Symptoms:**
92
+ - Logs show correct answer, user sees broken output
93
+ - Markdown rendering, JSON parsing, or streaming fragments corrupt valid responses
94
+ - Hidden fallback agent quietly replaces the answer before delivery
95
+ - Output differs between terminal and UI
96
+
97
+ ### 5. Hidden Agent Layers
98
+
99
+ Silent repair, retry, summarization, or recall agents run without explicit contracts.
100
+
101
+ **Symptoms:**
102
+ - Output changes between internal generation and user delivery
103
+ - "Auto-fix" loops run a second LLM pass the user doesn't know about
104
+ - Multiple agents modify the same output without coordination
105
+ - Answers get "smoothed" or "corrected" by invisible layers
106
+
107
+ ## Audit Workflow
108
+
109
+ ### Phase 1: Scope
110
+
111
+ Define what you're auditing:
112
+
113
+ - **Target system** — what agent application?
114
+ - **Entrypoints** — how do users interact with it?
115
+ - **Model stack** — which LLM(s) and providers?
116
+ - **Symptoms** — what does the user report?
117
+ - **Time window** — when did it start?
118
+ - **Layers to audit** — which of the 12 layers apply?
119
+
120
+ ### Phase 2: Evidence Collection
121
+
122
+ Gather evidence from the codebase:
123
+
124
+ - **Source code** — agent loop, tool router, memory admission, prompt assembly
125
+ - **Logs** — historical session traces, tool call records
126
+ - **Config** — prompt templates, tool schemas, provider settings
127
+ - **Memory files** — SOPs, knowledge bases, session archives
128
+
129
+ Use `rg` to search for anti-patterns:
130
+
131
+ ```bash
132
+ # Tool requirements expressed only in prompt text (not code)
133
+ rg "must.*tool|必须.*工具|required.*call" --type md
134
+
135
+ # Tool execution without validation
136
+ rg "tool_call|toolCall|tool_use" --type py --type ts
137
+
138
+ # Hidden LLM calls outside main agent loop
139
+ rg "completion|chat\.create|messages\.create|llm\.invoke"
140
+
141
+ # Memory admission without user-correction priority
142
+ rg "memory.*admit|long.*term.*update|persist.*memory" --type py --type ts
143
+
144
+ # Fallback loops that run additional LLM calls
145
+ rg "fallback|retry.*llm|repair.*prompt|re-?prompt" --type py --type ts
146
+
147
+ # Silent output mutation
148
+ rg "mutate|rewrite.*response|transform.*output|shap" --type py --type ts
149
+ ```
150
+
151
+ ### Phase 3: Failure Mapping
152
+
153
+ For each finding, document:
154
+
155
+ - **Symptom** — what the user sees
156
+ - **Mechanism** — how the wrapper causes it
157
+ - **Source layer** — which of the 12 layers
158
+ - **Root cause** — the deepest cause
159
+ - **Evidence** — file:line or log:row reference
160
+ - **Confidence** — 0.0 to 1.0
161
+
162
+ ### Phase 4: Fix Strategy
163
+
164
+ Default fix order (code-first, not prompt-first):
165
+
166
+ 1. **Code-gate tool requirements** — enforce in code, not just prompt text
167
+ 2. **Remove or narrow hidden repair agents** — make fallback explicit with contracts
168
+ 3. **Reduce context duplication** — same info through prompt + history + memory + distillation
169
+ 4. **Tighten memory admission** — user corrections > agent assertions
170
+ 5. **Tighten distillation triggers** — don't compress what shouldn't be compressed
171
+ 6. **Reduce rendering mutation** — pass-through, don't transform
172
+ 7. **Convert to typed JSON envelopes** — structured internal flow, not freeform prose
173
+
174
+ ## Severity Model
175
+
176
+ | Level | Meaning | Action |
177
+ |-------|---------|--------|
178
+ | `critical` | Agent can confidently produce wrong operational behavior | Fix before next release |
179
+ | `high` | Agent frequently degrades correctness or stability | Fix this sprint |
180
+ | `medium` | Correctness usually survives but output is fragile or wasteful | Plan for next cycle |
181
+ | `low` | Mostly cosmetic or maintainability issues | Backlog |
182
+
183
+ ## Output Format
184
+
185
+ Present findings to the user in this order:
186
+
187
+ 1. **Severity-ranked findings** (most critical first)
188
+ 2. **Architecture diagnosis** (which layer corrupted what, and why)
189
+ 3. **Ordered fix plan** (code-first, not prompt-first)
190
+
191
+ Do not lead with compliments or summaries. If the system is broken, say so directly.
192
+
193
+ ## Quick Diagnostic Questions
194
+
195
+ When auditing an agent system, answer these:
196
+
197
+ | # | Question | If Yes → |
198
+ |---|----------|----------|
199
+ | 1 | Can the model skip a required tool and still answer? | Tool not code-gated |
200
+ | 2 | Does old conversation content appear in new turns? | Memory contamination |
201
+ | 3 | Is the same info in system prompt AND memory AND history? | Context duplication |
202
+ | 4 | Does the platform run a second LLM pass before delivery? | Hidden repair loop |
203
+ | 5 | Does the output differ between internal generation and user delivery? | Rendering corruption |
204
+ | 6 | Are "must use tool X" rules only in prompt text? | Tool discipline failure |
205
+ | 7 | Can the agent's own monologue become persistent memory? | Memory poisoning |
206
+
207
+ ## Anti-Patterns to Avoid
208
+
209
+ - Avoid blaming the model before falsifying wrapper-layer regressions.
210
+ - Avoid blaming memory without showing the contamination path.
211
+ - Do not let a clean current state erase a dirty historical incident.
212
+ - Do not treat markdown prose as a trustworthy internal protocol.
213
+ - Do not accept "must use tool" in prompt text when code never enforces it.
214
+ - Keep findings direct, evidence-backed, and severity-ranked.
215
+
216
+ ## Report Schema
217
+
218
+ Audits should produce structured reports following this shape:
219
+
220
+ ```json
221
+ {
222
+ "schema_version": "ecc.agent-architecture-audit.report.v1",
223
+ "executive_verdict": {
224
+ "overall_health": "high_risk",
225
+ "primary_failure_mode": "string",
226
+ "most_urgent_fix": "string"
227
+ },
228
+ "scope": {
229
+ "target_name": "string",
230
+ "model_stack": ["string"],
231
+ "layers_to_audit": ["string"]
232
+ },
233
+ "findings": [
234
+ {
235
+ "severity": "critical|high|medium|low",
236
+ "title": "string",
237
+ "mechanism": "string",
238
+ "source_layer": "string",
239
+ "root_cause": "string",
240
+ "evidence_refs": ["file:line"],
241
+ "confidence": 0.0,
242
+ "recommended_fix": "string"
243
+ }
244
+ ],
245
+ "ordered_fix_plan": [
246
+ { "order": 1, "goal": "string", "why_now": "string", "expected_effect": "string" }
247
+ ]
248
+ }
249
+ ```
250
+
251
+ ## Related Skills
252
+
253
+ - `agent-introspection-debugging` — Debug agent runtime failures (loops, timeouts, state errors)
254
+ - `agent-eval` — Benchmark agent performance head-to-head
255
+ - `security-review` — Security audit for code and configuration
256
+ - `autonomous-agent-harness` — Set up autonomous agent operations
257
+ - `agent-harness-construction` — Build agent harnesses from scratch
@@ -0,0 +1,146 @@
1
+ ---
2
+ name: agent-eval
3
+ description: Head-to-head comparison of coding agents (Claude Code, Aider, Codex, etc.) on custom tasks with pass rate, cost, time, and consistency metrics
4
+ metadata:
5
+ origin: ECC
6
+ tools: Read, Write, Edit, Bash, Grep, Glob
7
+ ---
8
+
9
+ # Agent Eval Skill
10
+
11
+ A lightweight CLI tool for comparing coding agents head-to-head on reproducible tasks. Every "which coding agent is best?" comparison runs on vibes — this tool systematizes it.
12
+
13
+ ## When to Activate
14
+
15
+ - Comparing coding agents (Claude Code, Aider, Codex, etc.) on your own codebase
16
+ - Measuring agent performance before adopting a new tool or model
17
+ - Running regression checks when an agent updates its model or tooling
18
+ - Producing data-backed agent selection decisions for a team
19
+
20
+ ## Installation
21
+
22
+ > **Note:** Install agent-eval from its repository after reviewing the source.
23
+
24
+ ## Core Concepts
25
+
26
+ ### YAML Task Definitions
27
+
28
+ Define tasks declaratively. Each task specifies what to do, which files to touch, and how to judge success:
29
+
30
+ ```yaml
31
+ name: add-retry-logic
32
+ description: Add exponential backoff retry to the HTTP client
33
+ repo: ./my-project
34
+ files:
35
+ - src/http_client.py
36
+ prompt: |
37
+ Add retry logic with exponential backoff to all HTTP requests.
38
+ Max 3 retries. Initial delay 1s, max delay 30s.
39
+ judge:
40
+ - type: pytest
41
+ command: pytest tests/test_http_client.py -v
42
+ - type: grep
43
+ pattern: "exponential_backoff|retry"
44
+ files: src/http_client.py
45
+ commit: "abc1234" # pin to specific commit for reproducibility
46
+ ```
47
+
48
+ ### Git Worktree Isolation
49
+
50
+ Each agent run gets its own git worktree — no Docker required. This provides reproducibility isolation so agents cannot interfere with each other or corrupt the base repo.
51
+
52
+ ### Metrics Collected
53
+
54
+ | Metric | What It Measures |
55
+ |--------|-----------------|
56
+ | Pass rate | Did the agent produce code that passes the judge? |
57
+ | Cost | API spend per task (when available) |
58
+ | Time | Wall-clock seconds to completion |
59
+ | Consistency | Pass rate across repeated runs (e.g., 3/3 = 100%) |
60
+
61
+ ## Workflow
62
+
63
+ ### 1. Define Tasks
64
+
65
+ Create a `tasks/` directory with YAML files, one per task:
66
+
67
+ ```bash
68
+ mkdir tasks
69
+ # Write task definitions (see template above)
70
+ ```
71
+
72
+ ### 2. Run Agents
73
+
74
+ Execute agents against your tasks:
75
+
76
+ ```bash
77
+ agent-eval run --task tasks/add-retry-logic.yaml --agent claude-code --agent aider --runs 3
78
+ ```
79
+
80
+ Each run:
81
+ 1. Creates a fresh git worktree from the specified commit
82
+ 2. Hands the prompt to the agent
83
+ 3. Runs the judge criteria
84
+ 4. Records pass/fail, cost, and time
85
+
86
+ ### 3. Compare Results
87
+
88
+ Generate a comparison report:
89
+
90
+ ```bash
91
+ agent-eval report --format table
92
+ ```
93
+
94
+ ```
95
+ Task: add-retry-logic (3 runs each)
96
+ ┌──────────────┬───────────┬────────┬────────┬─────────────┐
97
+ │ Agent │ Pass Rate │ Cost │ Time │ Consistency │
98
+ ├──────────────┼───────────┼────────┼────────┼─────────────┤
99
+ │ claude-code │ 3/3 │ $0.12 │ 45s │ 100% │
100
+ │ aider │ 2/3 │ $0.08 │ 38s │ 67% │
101
+ └──────────────┴───────────┴────────┴────────┴─────────────┘
102
+ ```
103
+
104
+ ## Judge Types
105
+
106
+ ### Code-Based (deterministic)
107
+
108
+ ```yaml
109
+ judge:
110
+ - type: pytest
111
+ command: pytest tests/ -v
112
+ - type: command
113
+ command: npm run build
114
+ ```
115
+
116
+ ### Pattern-Based
117
+
118
+ ```yaml
119
+ judge:
120
+ - type: grep
121
+ pattern: "class.*Retry"
122
+ files: src/**/*.py
123
+ ```
124
+
125
+ ### Model-Based (LLM-as-judge)
126
+
127
+ ```yaml
128
+ judge:
129
+ - type: llm
130
+ prompt: |
131
+ Does this implementation correctly handle exponential backoff?
132
+ Check for: max retries, increasing delays, jitter.
133
+ ```
134
+
135
+ ## Best Practices
136
+
137
+ - **Start with 3-5 tasks** that represent your real workload, not toy examples
138
+ - **Run at least 3 trials** per agent to capture variance — agents are non-deterministic
139
+ - **Pin the commit** in your task YAML so results are reproducible across days/weeks
140
+ - **Include at least one deterministic judge** (tests, build) per task — LLM judges add noise
141
+ - **Track cost alongside pass rate** — a 95% agent at 10x the cost may not be the right choice
142
+ - **Version your task definitions** — they are test fixtures, treat them as code
143
+
144
+ ## Links
145
+
146
+ - Repository: [github.com/joaquinhuigomez/agent-eval](https://github.com/joaquinhuigomez/agent-eval)
@@ -0,0 +1,74 @@
1
+ ---
2
+ name: agent-harness-construction
3
+ description: Design and optimize AI agent action spaces, tool definitions, and observation formatting for higher completion rates.
4
+ metadata:
5
+ origin: ECC
6
+ ---
7
+
8
+ # Agent Harness Construction
9
+
10
+ Use this skill when you are improving how an agent plans, calls tools, recovers from errors, and converges on completion.
11
+
12
+ ## Core Model
13
+
14
+ Agent output quality is constrained by:
15
+ 1. Action space quality
16
+ 2. Observation quality
17
+ 3. Recovery quality
18
+ 4. Context budget quality
19
+
20
+ ## Action Space Design
21
+
22
+ 1. Use stable, explicit tool names.
23
+ 2. Keep inputs schema-first and narrow.
24
+ 3. Return deterministic output shapes.
25
+ 4. Avoid catch-all tools unless isolation is impossible.
26
+
27
+ ## Granularity Rules
28
+
29
+ - Use micro-tools for high-risk operations (deploy, migration, permissions).
30
+ - Use medium tools for common edit/read/search loops.
31
+ - Use macro-tools only when round-trip overhead is the dominant cost.
32
+
33
+ ## Observation Design
34
+
35
+ Every tool response should include:
36
+ - `status`: success|warning|error
37
+ - `summary`: one-line result
38
+ - `next_actions`: actionable follow-ups
39
+ - `artifacts`: file paths / IDs
40
+
41
+ ## Error Recovery Contract
42
+
43
+ For every error path, include:
44
+ - root cause hint
45
+ - safe retry instruction
46
+ - explicit stop condition
47
+
48
+ ## Context Budgeting
49
+
50
+ 1. Keep system prompt minimal and invariant.
51
+ 2. Move large guidance into skills loaded on demand.
52
+ 3. Prefer references to files over inlining long documents.
53
+ 4. Compact at phase boundaries, not arbitrary token thresholds.
54
+
55
+ ## Architecture Pattern Guidance
56
+
57
+ - ReAct: best for exploratory tasks with uncertain path.
58
+ - Function-calling: best for structured deterministic flows.
59
+ - Hybrid (recommended): ReAct planning + typed tool execution.
60
+
61
+ ## Benchmarking
62
+
63
+ Track:
64
+ - completion rate
65
+ - retries per task
66
+ - pass@1 and pass@3
67
+ - cost per successful task
68
+
69
+ ## Anti-Patterns
70
+
71
+ - Too many tools with overlapping semantics.
72
+ - Opaque tool output with no recovery hints.
73
+ - Error-only output without next steps.
74
+ - Context overloading with irrelevant references.