@shakudo/kaji-setup-external 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (411) hide show
  1. package/README.md +155 -0
  2. package/assets/skills/ci-cd/.claude-plugin/plugin.json +8 -0
  3. package/assets/skills/ci-cd/SKILL.md +573 -0
  4. package/assets/skills/ci-cd/assets/templates/github-actions/docker-build.yml +164 -0
  5. package/assets/skills/ci-cd/assets/templates/github-actions/go-ci.yml +420 -0
  6. package/assets/skills/ci-cd/assets/templates/github-actions/node-ci.yml +313 -0
  7. package/assets/skills/ci-cd/assets/templates/github-actions/python-ci.yml +388 -0
  8. package/assets/skills/ci-cd/assets/templates/github-actions/security-scan.yml +416 -0
  9. package/assets/skills/ci-cd/assets/templates/gitlab-ci/docker-build.yml +298 -0
  10. package/assets/skills/ci-cd/assets/templates/gitlab-ci/go-ci.yml +548 -0
  11. package/assets/skills/ci-cd/assets/templates/gitlab-ci/node-ci.yml +334 -0
  12. package/assets/skills/ci-cd/assets/templates/gitlab-ci/python-ci.yml +472 -0
  13. package/assets/skills/ci-cd/assets/templates/gitlab-ci/security-scan.yml +479 -0
  14. package/assets/skills/ci-cd/references/best_practices.md +675 -0
  15. package/assets/skills/ci-cd/references/devsecops.md +862 -0
  16. package/assets/skills/ci-cd/references/optimization.md +651 -0
  17. package/assets/skills/ci-cd/references/security.md +611 -0
  18. package/assets/skills/ci-cd/references/troubleshooting.md +656 -0
  19. package/assets/skills/ci-cd/scripts/ci_health.py +301 -0
  20. package/assets/skills/ci-cd/scripts/pipeline_analyzer.py +440 -0
  21. package/assets/skills/context-optimization/CONTRIBUTING.md +78 -0
  22. package/assets/skills/context-optimization/LICENSE +22 -0
  23. package/assets/skills/context-optimization/README.md +228 -0
  24. package/assets/skills/context-optimization/SKILL.md +104 -0
  25. package/assets/skills/context-optimization/docs/agentskills.md +1264 -0
  26. package/assets/skills/context-optimization/docs/blogs.md +1230 -0
  27. package/assets/skills/context-optimization/docs/claude_research.md +85 -0
  28. package/assets/skills/context-optimization/docs/compression.md +298 -0
  29. package/assets/skills/context-optimization/docs/gemini_research.md +22 -0
  30. package/assets/skills/context-optimization/docs/hncapsule.md +92 -0
  31. package/assets/skills/context-optimization/docs/netflix_context.md +10 -0
  32. package/assets/skills/context-optimization/docs/vercel_tool.md +140 -0
  33. package/assets/skills/context-optimization/examples/book-sft-pipeline/README.md +78 -0
  34. package/assets/skills/context-optimization/examples/book-sft-pipeline/SKILL.md +380 -0
  35. package/assets/skills/context-optimization/examples/book-sft-pipeline/examples/gertrude-stein/README.md +168 -0
  36. package/assets/skills/context-optimization/examples/book-sft-pipeline/examples/gertrude-stein/dataset_sample.jsonl +5 -0
  37. package/assets/skills/context-optimization/examples/book-sft-pipeline/examples/gertrude-stein/pangram/Screenshot 2025-12-27 at 3.05.04/342/200/257AM.png +0 -0
  38. package/assets/skills/context-optimization/examples/book-sft-pipeline/examples/gertrude-stein/pangram/Screenshot 2025-12-27 at 3.05.36/342/200/257AM.png +0 -0
  39. package/assets/skills/context-optimization/examples/book-sft-pipeline/examples/gertrude-stein/pangram/Screenshot 2025-12-27 at 3.07.18/342/200/257AM.png +0 -0
  40. package/assets/skills/context-optimization/examples/book-sft-pipeline/examples/gertrude-stein/sample_outputs.md +63 -0
  41. package/assets/skills/context-optimization/examples/book-sft-pipeline/examples/gertrude-stein/training_config.json +80 -0
  42. package/assets/skills/context-optimization/examples/book-sft-pipeline/references/segmentation-strategies.md +324 -0
  43. package/assets/skills/context-optimization/examples/book-sft-pipeline/references/tinker-format.md +211 -0
  44. package/assets/skills/context-optimization/examples/book-sft-pipeline/references/tinker.txt +3176 -0
  45. package/assets/skills/context-optimization/examples/book-sft-pipeline/scripts/pipeline_example.py +187 -0
  46. package/assets/skills/context-optimization/examples/digital-brain-skill/AGENT.md +35 -0
  47. package/assets/skills/context-optimization/examples/digital-brain-skill/HOW-SKILLS-BUILT-THIS.md +407 -0
  48. package/assets/skills/context-optimization/examples/digital-brain-skill/README.md +209 -0
  49. package/assets/skills/context-optimization/examples/digital-brain-skill/SKILL.md +203 -0
  50. package/assets/skills/context-optimization/examples/digital-brain-skill/SKILLS-MAPPING.md +219 -0
  51. package/assets/skills/context-optimization/examples/digital-brain-skill/agents/AGENTS.md +82 -0
  52. package/assets/skills/context-optimization/examples/digital-brain-skill/agents/scripts/content_ideas.py +132 -0
  53. package/assets/skills/context-optimization/examples/digital-brain-skill/agents/scripts/idea_to_draft.py +181 -0
  54. package/assets/skills/context-optimization/examples/digital-brain-skill/agents/scripts/stale_contacts.py +139 -0
  55. package/assets/skills/context-optimization/examples/digital-brain-skill/agents/scripts/weekly_review.py +121 -0
  56. package/assets/skills/context-optimization/examples/digital-brain-skill/content/CONTENT.md +88 -0
  57. package/assets/skills/context-optimization/examples/digital-brain-skill/content/calendar.md +108 -0
  58. package/assets/skills/context-optimization/examples/digital-brain-skill/content/engagement.jsonl +2 -0
  59. package/assets/skills/context-optimization/examples/digital-brain-skill/content/ideas.jsonl +2 -0
  60. package/assets/skills/context-optimization/examples/digital-brain-skill/content/posts.jsonl +2 -0
  61. package/assets/skills/context-optimization/examples/digital-brain-skill/content/templates/linkedin-post.md +102 -0
  62. package/assets/skills/context-optimization/examples/digital-brain-skill/content/templates/newsletter.md +92 -0
  63. package/assets/skills/context-optimization/examples/digital-brain-skill/content/templates/thread.md +73 -0
  64. package/assets/skills/context-optimization/examples/digital-brain-skill/examples/content-workflow.md +204 -0
  65. package/assets/skills/context-optimization/examples/digital-brain-skill/examples/meeting-prep.md +243 -0
  66. package/assets/skills/context-optimization/examples/digital-brain-skill/identity/IDENTITY.md +46 -0
  67. package/assets/skills/context-optimization/examples/digital-brain-skill/identity/bio-variants.md +101 -0
  68. package/assets/skills/context-optimization/examples/digital-brain-skill/identity/brand.md +165 -0
  69. package/assets/skills/context-optimization/examples/digital-brain-skill/identity/prompts/content-generation.xml +46 -0
  70. package/assets/skills/context-optimization/examples/digital-brain-skill/identity/prompts/reply-generator.xml +40 -0
  71. package/assets/skills/context-optimization/examples/digital-brain-skill/identity/values.yaml +60 -0
  72. package/assets/skills/context-optimization/examples/digital-brain-skill/identity/voice.md +165 -0
  73. package/assets/skills/context-optimization/examples/digital-brain-skill/knowledge/KNOWLEDGE.md +85 -0
  74. package/assets/skills/context-optimization/examples/digital-brain-skill/knowledge/bookmarks.jsonl +2 -0
  75. package/assets/skills/context-optimization/examples/digital-brain-skill/knowledge/competitors.md +117 -0
  76. package/assets/skills/context-optimization/examples/digital-brain-skill/knowledge/learning.yaml +74 -0
  77. package/assets/skills/context-optimization/examples/digital-brain-skill/knowledge/research/_template.md +79 -0
  78. package/assets/skills/context-optimization/examples/digital-brain-skill/network/NETWORK.md +110 -0
  79. package/assets/skills/context-optimization/examples/digital-brain-skill/network/circles.yaml +80 -0
  80. package/assets/skills/context-optimization/examples/digital-brain-skill/network/contacts.jsonl +2 -0
  81. package/assets/skills/context-optimization/examples/digital-brain-skill/network/interactions.jsonl +2 -0
  82. package/assets/skills/context-optimization/examples/digital-brain-skill/network/intros.md +92 -0
  83. package/assets/skills/context-optimization/examples/digital-brain-skill/operations/OPERATIONS.md +75 -0
  84. package/assets/skills/context-optimization/examples/digital-brain-skill/operations/goals.yaml +83 -0
  85. package/assets/skills/context-optimization/examples/digital-brain-skill/operations/meetings.jsonl +2 -0
  86. package/assets/skills/context-optimization/examples/digital-brain-skill/operations/metrics.jsonl +2 -0
  87. package/assets/skills/context-optimization/examples/digital-brain-skill/operations/reviews/_weekly_template.md +114 -0
  88. package/assets/skills/context-optimization/examples/digital-brain-skill/operations/todos.md +76 -0
  89. package/assets/skills/context-optimization/examples/digital-brain-skill/package.json +41 -0
  90. package/assets/skills/context-optimization/examples/digital-brain-skill/references/file-formats.md +386 -0
  91. package/assets/skills/context-optimization/examples/digital-brain-skill/scripts/install.sh +79 -0
  92. package/assets/skills/context-optimization/examples/interleaved_thinking/README.md +620 -0
  93. package/assets/skills/context-optimization/examples/interleaved_thinking/SKILL.md +221 -0
  94. package/assets/skills/context-optimization/examples/interleaved_thinking/docs/agentthinking.md +63 -0
  95. package/assets/skills/context-optimization/examples/interleaved_thinking/docs/interleavedthinking.md +610 -0
  96. package/assets/skills/context-optimization/examples/interleaved_thinking/docs/m2-1.md +224 -0
  97. package/assets/skills/context-optimization/examples/interleaved_thinking/examples/01_basic_capture.py +76 -0
  98. package/assets/skills/context-optimization/examples/interleaved_thinking/examples/02_tool_usage.py +187 -0
  99. package/assets/skills/context-optimization/examples/interleaved_thinking/examples/03_full_optimization.py +1222 -0
  100. package/assets/skills/context-optimization/examples/interleaved_thinking/generated_skills/comprehensive-research-agent/SKILL.md +90 -0
  101. package/assets/skills/context-optimization/examples/interleaved_thinking/generated_skills/comprehensive-research-agent/references/optimization_summary.json +9 -0
  102. package/assets/skills/context-optimization/examples/interleaved_thinking/generated_skills/comprehensive-research-agent/references/optimized_prompt.txt +1 -0
  103. package/assets/skills/context-optimization/examples/interleaved_thinking/generated_skills/comprehensive-research-agent/references/patterns_found.json +205 -0
  104. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/final_prompt.txt +67 -0
  105. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_1/analysis.txt +48 -0
  106. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_1/optimization.txt +15 -0
  107. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_1/optimized_prompt.txt +1 -0
  108. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_1/trace.txt +178 -0
  109. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_10/analysis.txt +47 -0
  110. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_10/trace.txt +162 -0
  111. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_2/analysis.txt +48 -0
  112. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_2/optimization.txt +130 -0
  113. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_2/optimized_prompt.txt +72 -0
  114. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_2/trace.txt +156 -0
  115. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_3/analysis.txt +46 -0
  116. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_3/optimization.txt +147 -0
  117. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_3/optimized_prompt.txt +84 -0
  118. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_3/trace.txt +159 -0
  119. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_4/analysis.txt +46 -0
  120. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_4/optimization.txt +134 -0
  121. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_4/optimized_prompt.txt +67 -0
  122. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_4/trace.txt +165 -0
  123. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_5/analysis.txt +50 -0
  124. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_5/optimization.txt +135 -0
  125. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_5/optimized_prompt.txt +71 -0
  126. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_5/trace.txt +146 -0
  127. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_6/analysis.txt +15 -0
  128. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_6/optimization.txt +15 -0
  129. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_6/optimized_prompt.txt +1 -0
  130. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_6/trace.txt +147 -0
  131. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_7/analysis.txt +46 -0
  132. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_7/optimization.txt +103 -0
  133. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_7/optimized_prompt.txt +45 -0
  134. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_7/trace.txt +134 -0
  135. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_8/analysis.txt +47 -0
  136. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_8/optimization.txt +114 -0
  137. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_8/optimized_prompt.txt +60 -0
  138. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_8/trace.txt +135 -0
  139. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_9/analysis.txt +44 -0
  140. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_9/optimization.txt +106 -0
  141. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_9/optimized_prompt.txt +51 -0
  142. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/iteration_9/trace.txt +170 -0
  143. package/assets/skills/context-optimization/examples/interleaved_thinking/optimization_artifacts/summary.json +11 -0
  144. package/assets/skills/context-optimization/examples/interleaved_thinking/pyproject.toml +70 -0
  145. package/assets/skills/context-optimization/examples/interleaved_thinking/reasoning_trace_optimizer/__init__.py +53 -0
  146. package/assets/skills/context-optimization/examples/interleaved_thinking/reasoning_trace_optimizer/analyzer.py +465 -0
  147. package/assets/skills/context-optimization/examples/interleaved_thinking/reasoning_trace_optimizer/capture.py +417 -0
  148. package/assets/skills/context-optimization/examples/interleaved_thinking/reasoning_trace_optimizer/cli.py +271 -0
  149. package/assets/skills/context-optimization/examples/interleaved_thinking/reasoning_trace_optimizer/loop.py +468 -0
  150. package/assets/skills/context-optimization/examples/interleaved_thinking/reasoning_trace_optimizer/models.py +193 -0
  151. package/assets/skills/context-optimization/examples/interleaved_thinking/reasoning_trace_optimizer/optimizer.py +449 -0
  152. package/assets/skills/context-optimization/examples/interleaved_thinking/reasoning_trace_optimizer/skill_generator.py +502 -0
  153. package/assets/skills/context-optimization/examples/interleaved_thinking/tests/__init__.py +1 -0
  154. package/assets/skills/context-optimization/examples/interleaved_thinking/tests/test_models.py +144 -0
  155. package/assets/skills/context-optimization/examples/llm-as-judge-skills/.prettierrc +8 -0
  156. package/assets/skills/context-optimization/examples/llm-as-judge-skills/CONTRIBUTING.md +78 -0
  157. package/assets/skills/context-optimization/examples/llm-as-judge-skills/LICENSE +21 -0
  158. package/assets/skills/context-optimization/examples/llm-as-judge-skills/README.md +659 -0
  159. package/assets/skills/context-optimization/examples/llm-as-judge-skills/agents/evaluator-agent/evaluator-agent.md +177 -0
  160. package/assets/skills/context-optimization/examples/llm-as-judge-skills/agents/index.md +114 -0
  161. package/assets/skills/context-optimization/examples/llm-as-judge-skills/agents/orchestrator-agent/orchestrator-agent.md +205 -0
  162. package/assets/skills/context-optimization/examples/llm-as-judge-skills/agents/research-agent/research-agent.md +183 -0
  163. package/assets/skills/context-optimization/examples/llm-as-judge-skills/env.example +6 -0
  164. package/assets/skills/context-optimization/examples/llm-as-judge-skills/eslint.config.js +18 -0
  165. package/assets/skills/context-optimization/examples/llm-as-judge-skills/examples/basic-evaluation.ts +89 -0
  166. package/assets/skills/context-optimization/examples/llm-as-judge-skills/examples/full-evaluation-workflow.ts +136 -0
  167. package/assets/skills/context-optimization/examples/llm-as-judge-skills/examples/generate-rubric.ts +67 -0
  168. package/assets/skills/context-optimization/examples/llm-as-judge-skills/examples/pairwise-comparison.ts +97 -0
  169. package/assets/skills/context-optimization/examples/llm-as-judge-skills/package.json +79 -0
  170. package/assets/skills/context-optimization/examples/llm-as-judge-skills/prompts/agent-system/orchestrator-prompt.md +197 -0
  171. package/assets/skills/context-optimization/examples/llm-as-judge-skills/prompts/evaluation/direct-scoring-prompt.md +153 -0
  172. package/assets/skills/context-optimization/examples/llm-as-judge-skills/prompts/evaluation/pairwise-comparison-prompt.md +200 -0
  173. package/assets/skills/context-optimization/examples/llm-as-judge-skills/prompts/index.md +138 -0
  174. package/assets/skills/context-optimization/examples/llm-as-judge-skills/prompts/research/research-synthesis-prompt.md +171 -0
  175. package/assets/skills/context-optimization/examples/llm-as-judge-skills/skills/context-fundamentals/context-fundamentals.md +114 -0
  176. package/assets/skills/context-optimization/examples/llm-as-judge-skills/skills/index.md +79 -0
  177. package/assets/skills/context-optimization/examples/llm-as-judge-skills/skills/llm-evaluator/llm-evaluator.md +77 -0
  178. package/assets/skills/context-optimization/examples/llm-as-judge-skills/skills/tool-design/tool-design.md +198 -0
  179. package/assets/skills/context-optimization/examples/llm-as-judge-skills/src/agents/evaluator.ts +112 -0
  180. package/assets/skills/context-optimization/examples/llm-as-judge-skills/src/agents/index.ts +3 -0
  181. package/assets/skills/context-optimization/examples/llm-as-judge-skills/src/config/index.ts +18 -0
  182. package/assets/skills/context-optimization/examples/llm-as-judge-skills/src/index.ts +19 -0
  183. package/assets/skills/context-optimization/examples/llm-as-judge-skills/src/tools/evaluation/direct-score.ts +164 -0
  184. package/assets/skills/context-optimization/examples/llm-as-judge-skills/src/tools/evaluation/generate-rubric.ts +161 -0
  185. package/assets/skills/context-optimization/examples/llm-as-judge-skills/src/tools/evaluation/index.ts +9 -0
  186. package/assets/skills/context-optimization/examples/llm-as-judge-skills/src/tools/evaluation/pairwise-compare.ts +255 -0
  187. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tests/evaluation.test.ts +233 -0
  188. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tests/setup.ts +27 -0
  189. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tests/skills.test.ts +213 -0
  190. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tools/evaluation/direct-score.md +159 -0
  191. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tools/evaluation/generate-rubric.md +189 -0
  192. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tools/evaluation/pairwise-compare.md +182 -0
  193. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tools/index.md +141 -0
  194. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tools/orchestration/delegate-to-agent.md +171 -0
  195. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tools/research/read-url.md +162 -0
  196. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tools/research/web-search.md +128 -0
  197. package/assets/skills/context-optimization/examples/llm-as-judge-skills/tsconfig.json +26 -0
  198. package/assets/skills/context-optimization/examples/llm-as-judge-skills/vitest.config.ts +20 -0
  199. package/assets/skills/context-optimization/examples/x-to-book-system/PRD.md +644 -0
  200. package/assets/skills/context-optimization/examples/x-to-book-system/README.md +181 -0
  201. package/assets/skills/context-optimization/examples/x-to-book-system/SKILLS-MAPPING.md +187 -0
  202. package/assets/skills/context-optimization/researcher/example_output.md +75 -0
  203. package/assets/skills/context-optimization/researcher/llm-as-a-judge.md +362 -0
  204. package/assets/skills/context-optimization/skills/advanced-evaluation/SKILL.md +454 -0
  205. package/assets/skills/context-optimization/skills/advanced-evaluation/references/bias-mitigation.md +288 -0
  206. package/assets/skills/context-optimization/skills/advanced-evaluation/references/implementation-patterns.md +315 -0
  207. package/assets/skills/context-optimization/skills/advanced-evaluation/references/metrics-guide.md +331 -0
  208. package/assets/skills/context-optimization/skills/advanced-evaluation/scripts/evaluation_example.py +337 -0
  209. package/assets/skills/context-optimization/skills/bdi-mental-states/SKILL.md +295 -0
  210. package/assets/skills/context-optimization/skills/bdi-mental-states/references/bdi-ontology-core.md +207 -0
  211. package/assets/skills/context-optimization/skills/bdi-mental-states/references/framework-integration.md +582 -0
  212. package/assets/skills/context-optimization/skills/bdi-mental-states/references/rdf-examples.md +315 -0
  213. package/assets/skills/context-optimization/skills/bdi-mental-states/references/sparql-competency.md +420 -0
  214. package/assets/skills/context-optimization/skills/context-compression/SKILL.md +265 -0
  215. package/assets/skills/context-optimization/skills/context-compression/references/evaluation-framework.md +213 -0
  216. package/assets/skills/context-optimization/skills/context-compression/scripts/compression_evaluator.py +658 -0
  217. package/assets/skills/context-optimization/skills/context-degradation/SKILL.md +231 -0
  218. package/assets/skills/context-optimization/skills/context-degradation/references/patterns.md +314 -0
  219. package/assets/skills/context-optimization/skills/context-degradation/scripts/degradation_detector.py +419 -0
  220. package/assets/skills/context-optimization/skills/context-fundamentals/SKILL.md +185 -0
  221. package/assets/skills/context-optimization/skills/context-fundamentals/references/context-components.md +283 -0
  222. package/assets/skills/context-optimization/skills/context-fundamentals/scripts/context_manager.py +370 -0
  223. package/assets/skills/context-optimization/skills/context-optimization/SKILL.md +179 -0
  224. package/assets/skills/context-optimization/skills/context-optimization/references/optimization_techniques.md +272 -0
  225. package/assets/skills/context-optimization/skills/context-optimization/scripts/compaction.py +379 -0
  226. package/assets/skills/context-optimization/skills/evaluation/SKILL.md +231 -0
  227. package/assets/skills/context-optimization/skills/evaluation/references/metrics.md +339 -0
  228. package/assets/skills/context-optimization/skills/evaluation/scripts/evaluator.py +474 -0
  229. package/assets/skills/context-optimization/skills/filesystem-context/SKILL.md +321 -0
  230. package/assets/skills/context-optimization/skills/filesystem-context/references/implementation-patterns.md +549 -0
  231. package/assets/skills/context-optimization/skills/filesystem-context/scripts/filesystem_context.py +353 -0
  232. package/assets/skills/context-optimization/skills/hosted-agents/SKILL.md +279 -0
  233. package/assets/skills/context-optimization/skills/hosted-agents/references/infrastructure-patterns.md +700 -0
  234. package/assets/skills/context-optimization/skills/hosted-agents/scripts/sandbox_manager.py +495 -0
  235. package/assets/skills/context-optimization/skills/memory-systems/SKILL.md +221 -0
  236. package/assets/skills/context-optimization/skills/memory-systems/references/implementation.md +458 -0
  237. package/assets/skills/context-optimization/skills/memory-systems/scripts/memory_store.py +396 -0
  238. package/assets/skills/context-optimization/skills/multi-agent-patterns/SKILL.md +255 -0
  239. package/assets/skills/context-optimization/skills/multi-agent-patterns/references/frameworks.md +433 -0
  240. package/assets/skills/context-optimization/skills/multi-agent-patterns/scripts/coordination.py +439 -0
  241. package/assets/skills/context-optimization/skills/project-development/SKILL.md +342 -0
  242. package/assets/skills/context-optimization/skills/project-development/references/case-studies.md +388 -0
  243. package/assets/skills/context-optimization/skills/project-development/references/pipeline-patterns.md +610 -0
  244. package/assets/skills/context-optimization/skills/project-development/scripts/pipeline_template.py +677 -0
  245. package/assets/skills/context-optimization/skills/tool-design/SKILL.md +311 -0
  246. package/assets/skills/context-optimization/skills/tool-design/references/architectural_reduction.md +210 -0
  247. package/assets/skills/context-optimization/skills/tool-design/references/best_practices.md +176 -0
  248. package/assets/skills/context-optimization/skills/tool-design/scripts/description_generator.py +237 -0
  249. package/assets/skills/context-optimization/template/SKILL.md +98 -0
  250. package/assets/skills/dremio-analytics/SKILL.md +287 -0
  251. package/assets/skills/elevenlabs-voice/SKILL.md +269 -0
  252. package/assets/skills/git-workflow/SKILL.md +266 -0
  253. package/assets/skills/gitops-workflows/.claude-plugin/plugin.json +8 -0
  254. package/assets/skills/gitops-workflows/SKILL.md +568 -0
  255. package/assets/skills/gitops-workflows/assets/applicationsets/cluster-generator.yaml +32 -0
  256. package/assets/skills/gitops-workflows/assets/argocd/install-argocd-3.x.yaml +92 -0
  257. package/assets/skills/gitops-workflows/assets/flux/flux-bootstrap-github.sh +49 -0
  258. package/assets/skills/gitops-workflows/assets/flux/oci-helmrelease.yaml +38 -0
  259. package/assets/skills/gitops-workflows/assets/progressive-delivery/argo-rollouts-canary.yaml +62 -0
  260. package/assets/skills/gitops-workflows/assets/secrets/sops-age-config.yaml +33 -0
  261. package/assets/skills/gitops-workflows/references/argocd_vs_flux.md +243 -0
  262. package/assets/skills/gitops-workflows/references/best_practices.md +160 -0
  263. package/assets/skills/gitops-workflows/references/multi_cluster.md +80 -0
  264. package/assets/skills/gitops-workflows/references/oci_artifacts.md +290 -0
  265. package/assets/skills/gitops-workflows/references/progressive_delivery.md +94 -0
  266. package/assets/skills/gitops-workflows/references/repo_patterns.md +184 -0
  267. package/assets/skills/gitops-workflows/references/secret_management.md +213 -0
  268. package/assets/skills/gitops-workflows/references/troubleshooting.md +134 -0
  269. package/assets/skills/gitops-workflows/scripts/applicationset_generator.py +156 -0
  270. package/assets/skills/gitops-workflows/scripts/check_argocd_health.py +275 -0
  271. package/assets/skills/gitops-workflows/scripts/check_flux_health.py +418 -0
  272. package/assets/skills/gitops-workflows/scripts/oci_artifact_checker.py +150 -0
  273. package/assets/skills/gitops-workflows/scripts/promotion_validator.py +88 -0
  274. package/assets/skills/gitops-workflows/scripts/secret_audit.py +178 -0
  275. package/assets/skills/gitops-workflows/scripts/sync_drift_detector.py +144 -0
  276. package/assets/skills/gitops-workflows/scripts/validate_gitops_repo.py +299 -0
  277. package/assets/skills/iac-terraform/.claude-plugin/plugin.json +8 -0
  278. package/assets/skills/iac-terraform/SKILL.md +653 -0
  279. package/assets/skills/iac-terraform/assets/templates/MODULE_TEMPLATE.md +386 -0
  280. package/assets/skills/iac-terraform/assets/workflows/github-actions-terraform.yml +224 -0
  281. package/assets/skills/iac-terraform/assets/workflows/github-actions-terragrunt.yml +236 -0
  282. package/assets/skills/iac-terraform/assets/workflows/gitlab-ci-terraform.yml +184 -0
  283. package/assets/skills/iac-terraform/references/best_practices.md +709 -0
  284. package/assets/skills/iac-terraform/references/cost_optimization.md +665 -0
  285. package/assets/skills/iac-terraform/references/troubleshooting.md +635 -0
  286. package/assets/skills/iac-terraform/scripts/init_module.py +319 -0
  287. package/assets/skills/iac-terraform/scripts/inspect_state.py +232 -0
  288. package/assets/skills/iac-terraform/scripts/validate_module.py +227 -0
  289. package/assets/skills/k8s-troubleshooter/.claude-plugin/plugin.json +8 -0
  290. package/assets/skills/k8s-troubleshooter/SKILL.md +336 -0
  291. package/assets/skills/k8s-troubleshooter/references/common_issues.md +582 -0
  292. package/assets/skills/k8s-troubleshooter/references/helm_troubleshooting.md +708 -0
  293. package/assets/skills/k8s-troubleshooter/references/incident_response.md +466 -0
  294. package/assets/skills/k8s-troubleshooter/references/performance_troubleshooting.md +687 -0
  295. package/assets/skills/k8s-troubleshooter/scripts/check_namespace.py +500 -0
  296. package/assets/skills/k8s-troubleshooter/scripts/cluster_health.py +223 -0
  297. package/assets/skills/k8s-troubleshooter/scripts/diagnose_pod.py +157 -0
  298. package/assets/skills/mattermost-notify/SKILL.md +248 -0
  299. package/assets/skills/monitoring-observability/SKILL.md +869 -0
  300. package/assets/skills/monitoring-observability/assets/templates/otel-config/collector-config.yaml +227 -0
  301. package/assets/skills/monitoring-observability/assets/templates/prometheus-alerts/kubernetes-alerts.yml +293 -0
  302. package/assets/skills/monitoring-observability/assets/templates/prometheus-alerts/webapp-alerts.yml +243 -0
  303. package/assets/skills/monitoring-observability/assets/templates/runbooks/incident-runbook-template.md +409 -0
  304. package/assets/skills/monitoring-observability/monitoring-observability.skill +0 -0
  305. package/assets/skills/monitoring-observability/references/alerting_best_practices.md +609 -0
  306. package/assets/skills/monitoring-observability/references/datadog_migration.md +649 -0
  307. package/assets/skills/monitoring-observability/references/dql_promql_translation.md +756 -0
  308. package/assets/skills/monitoring-observability/references/logging_guide.md +775 -0
  309. package/assets/skills/monitoring-observability/references/metrics_design.md +406 -0
  310. package/assets/skills/monitoring-observability/references/slo_sla_guide.md +652 -0
  311. package/assets/skills/monitoring-observability/references/tool_comparison.md +697 -0
  312. package/assets/skills/monitoring-observability/references/tracing_guide.md +663 -0
  313. package/assets/skills/monitoring-observability/scripts/alert_quality_checker.py +315 -0
  314. package/assets/skills/monitoring-observability/scripts/analyze_metrics.py +279 -0
  315. package/assets/skills/monitoring-observability/scripts/dashboard_generator.py +395 -0
  316. package/assets/skills/monitoring-observability/scripts/datadog_cost_analyzer.py +477 -0
  317. package/assets/skills/monitoring-observability/scripts/health_check_validator.py +297 -0
  318. package/assets/skills/monitoring-observability/scripts/log_analyzer.py +321 -0
  319. package/assets/skills/monitoring-observability/scripts/slo_calculator.py +365 -0
  320. package/assets/skills/neo4j-graph-rag/SKILL.md +258 -0
  321. package/assets/skills/pagerduty-ops/SKILL.md +380 -0
  322. package/assets/skills/playwright/API_REFERENCE.md +653 -0
  323. package/assets/skills/playwright/SKILL.md +453 -0
  324. package/assets/skills/playwright/lib/helpers.js +441 -0
  325. package/assets/skills/playwright/package.json +26 -0
  326. package/assets/skills/playwright/run.js +228 -0
  327. package/assets/skills/project-memory/README.md +687 -0
  328. package/assets/skills/project-memory/SKILL.md +298 -0
  329. package/assets/skills/project-memory/references/bugs_template.md +41 -0
  330. package/assets/skills/project-memory/references/decisions_template.md +92 -0
  331. package/assets/skills/project-memory/references/issues_template.md +76 -0
  332. package/assets/skills/project-memory/references/key_facts_template.md +158 -0
  333. package/assets/skills/recruit-workflow/SKILL.md +276 -0
  334. package/assets/skills/recruit-workflow/references/email-templates.md +347 -0
  335. package/assets/skills/recruit-workflow/references/workflow-stages.md +395 -0
  336. package/assets/skills/recruit-workflow/scripts/clay_client.py +188 -0
  337. package/assets/skills/recruit-workflow/scripts/lever_client.py +197 -0
  338. package/assets/skills/recruit-workflow/scripts/mailgun_client.py +245 -0
  339. package/assets/skills/recruit-workflow/scripts/minio_client.py +426 -0
  340. package/assets/skills/shakudo-microservice/SKILL.md +215 -0
  341. package/assets/skills/tmux/SKILL.md +631 -0
  342. package/assets/skills/tmux/references/direct-socket-control.md +108 -0
  343. package/assets/skills/tmux/references/session-lifecycle.md +503 -0
  344. package/assets/skills/tmux/references/session-registry.md +1484 -0
  345. package/assets/skills/tmux/tools/cleanup-sessions.sh +263 -0
  346. package/assets/skills/tmux/tools/create-session.sh +224 -0
  347. package/assets/skills/tmux/tools/find-sessions.sh +262 -0
  348. package/assets/skills/tmux/tools/kill-session.sh +308 -0
  349. package/assets/skills/tmux/tools/lib/registry.sh +437 -0
  350. package/assets/skills/tmux/tools/lib/time_utils.sh +54 -0
  351. package/assets/skills/tmux/tools/list-sessions.sh +255 -0
  352. package/assets/skills/tmux/tools/pane-health.sh +424 -0
  353. package/assets/skills/tmux/tools/safe-send.sh +503 -0
  354. package/assets/skills/tmux/tools/wait-for-text.sh +260 -0
  355. package/assets/skills/twilio-sms/SKILL.md +508 -0
  356. package/assets/skills/zellij/SKILL.md +274 -0
  357. package/assets/skills/zellij/references/actions.md +558 -0
  358. package/assets/skills/zellij/references/layouts.md +424 -0
  359. package/bin/cli.ts +46 -0
  360. package/package.json +43 -0
  361. package/src/alias.ts +108 -0
  362. package/src/backup.ts +51 -0
  363. package/src/config.ts +115 -0
  364. package/src/dependencies.ts +163 -0
  365. package/src/errors.ts +77 -0
  366. package/src/index.ts +207 -0
  367. package/src/prompts.ts +142 -0
  368. package/src/schemas.ts +21 -0
  369. package/src/skills.ts +45 -0
  370. package/src/speckit.ts +116 -0
  371. package/src/types.ts +106 -0
  372. package/src/utils.ts +110 -0
  373. package/src/vibe-git.ts +50 -0
  374. package/templates/.specify/memory/constitution.md +109 -0
  375. package/templates/.specify/scripts/bash/check-prerequisites.sh +262 -0
  376. package/templates/.specify/scripts/bash/common.sh +670 -0
  377. package/templates/.specify/scripts/bash/create-new-feature.sh +594 -0
  378. package/templates/.specify/scripts/bash/create-worktree-feature.sh +401 -0
  379. package/templates/.specify/scripts/bash/init-workspace.sh +433 -0
  380. package/templates/.specify/scripts/bash/list-spec-worktrees.sh +198 -0
  381. package/templates/.specify/scripts/bash/setup-plan.sh +105 -0
  382. package/templates/.specify/scripts/bash/test-workspace-rollup.sh +175 -0
  383. package/templates/.specify/scripts/bash/update-agent-context.sh +799 -0
  384. package/templates/.specify/templates/agent-file-template.md +28 -0
  385. package/templates/.specify/templates/checklist-template.md +40 -0
  386. package/templates/.specify/templates/commands/analyze.md +197 -0
  387. package/templates/.specify/templates/commands/checklist.md +306 -0
  388. package/templates/.specify/templates/commands/clarify.md +194 -0
  389. package/templates/.specify/templates/commands/constitution.md +97 -0
  390. package/templates/.specify/templates/commands/implement.md +149 -0
  391. package/templates/.specify/templates/commands/plan.md +123 -0
  392. package/templates/.specify/templates/commands/projects.md +48 -0
  393. package/templates/.specify/templates/commands/rollup.md +66 -0
  394. package/templates/.specify/templates/commands/specify.md +275 -0
  395. package/templates/.specify/templates/commands/specs.md +71 -0
  396. package/templates/.specify/templates/commands/tasks.md +151 -0
  397. package/templates/.specify/templates/commands/taskstoissues.md +35 -0
  398. package/templates/.specify/templates/commands/workspace.md +128 -0
  399. package/templates/.specify/templates/plan-template.md +104 -0
  400. package/templates/.specify/templates/spec-template.md +115 -0
  401. package/templates/.specify/templates/tasks-template.md +251 -0
  402. package/templates/.specify/templates/workspace.yaml +110 -0
  403. package/templates/.specify/workspace.yaml +95 -0
  404. package/templates/AGENTS.md +460 -0
  405. package/templates/oh-my-opencode.json +27 -0
  406. package/templates/opencode.json +383 -0
  407. package/templates/package.json +10 -0
  408. package/templates/project-memory/bugs.md +16 -0
  409. package/templates/project-memory/decisions.md +22 -0
  410. package/templates/project-memory/issues.md +15 -0
  411. package/templates/project-memory/key_facts.md +26 -0
@@ -0,0 +1,153 @@
1
+ # Direct Scoring Prompt
2
+
3
+ ## Purpose
4
+
5
+ System prompt for evaluating a single LLM response using direct scoring methodology.
6
+
7
+ ## Prompt Template
8
+
9
+ ```markdown
10
+ # Direct Scoring Evaluation
11
+
12
+ You are an expert evaluator assessing the quality of an AI-generated response.
13
+
14
+ ## Your Task
15
+
16
+ Evaluate the response below against the specified criteria. For each criterion:
17
+ 1. First, identify specific evidence from the response
18
+ 2. Then, determine the appropriate score based on the rubric
19
+ 3. Finally, provide actionable feedback
20
+
21
+ ## Important Guidelines
22
+
23
+ - Be objective and consistent
24
+ - Base scores on explicit evidence, not assumptions
25
+ - Consider the original task requirements
26
+ - Avoid length bias - a shorter, better answer outperforms a longer, weaker one
27
+ - When uncertain between two scores, explain your reasoning then choose
28
+
29
+ ## Original Prompt/Task
30
+
31
+ <task>
32
+ {{original_prompt}}
33
+ </task>
34
+
35
+ {{#if context}}
36
+ ## Additional Context
37
+
38
+ <context>
39
+ {{context}}
40
+ </context>
41
+ {{/if}}
42
+
43
+ ## Response to Evaluate
44
+
45
+ <response>
46
+ {{response}}
47
+ </response>
48
+
49
+ ## Evaluation Criteria
50
+
51
+ {{#each criteria}}
52
+ ### {{name}} (Weight: {{weight}})
53
+ {{description}}
54
+
55
+ {{#if rubric}}
56
+ **Rubric:**
57
+ {{#each rubric}}
58
+ - **{{score}}**: {{description}}
59
+ {{/each}}
60
+ {{/if}}
61
+ {{/each}}
62
+
63
+ ## Your Evaluation
64
+
65
+ For each criterion, provide:
66
+ 1. **Evidence**: Specific quotes or observations from the response
67
+ 2. **Score**: Your score according to the rubric
68
+ 3. **Justification**: Why this score is appropriate
69
+ 4. **Improvement**: Specific suggestion for improvement
70
+
71
+ Then provide:
72
+ - **Overall Assessment**: Summary of quality
73
+ - **Key Strengths**: What the response does well
74
+ - **Key Weaknesses**: What needs improvement
75
+ - **Priority Improvements**: Most impactful changes
76
+
77
+ Format your response as structured JSON:
78
+ ```json
79
+ {
80
+ "scores": [
81
+ {
82
+ "criterion": "{{name}}",
83
+ "evidence": ["quote1", "quote2"],
84
+ "score": {{score}},
85
+ "maxScore": {{maxScore}},
86
+ "justification": "...",
87
+ "improvement": "..."
88
+ }
89
+ ],
90
+ "overallScore": {{score}},
91
+ "summary": {
92
+ "assessment": "...",
93
+ "strengths": ["...", "..."],
94
+ "weaknesses": ["...", "..."],
95
+ "priorities": ["...", "..."]
96
+ }
97
+ }
98
+ ```
99
+ ```
100
+
101
+ ## Variables
102
+
103
+ | Variable | Description | Required |
104
+ |----------|-------------|----------|
105
+ | original_prompt | The prompt that generated the response | Yes |
106
+ | context | Additional context (RAG docs, history) | No |
107
+ | response | The response being evaluated | Yes |
108
+ | criteria | Array of evaluation criteria | Yes |
109
+ | criteria.name | Criterion name | Yes |
110
+ | criteria.weight | Criterion weight | Yes |
111
+ | criteria.description | What criterion measures | Yes |
112
+ | criteria.rubric | Score level descriptions | No |
113
+
114
+ ## Example Usage
115
+
116
+ ### Input
117
+ ```json
118
+ {
119
+ "original_prompt": "Explain quantum entanglement to a high school student",
120
+ "response": "Quantum entanglement is like having two magic coins...",
121
+ "criteria": [
122
+ {
123
+ "name": "Accuracy",
124
+ "weight": 0.4,
125
+ "description": "Scientific correctness of the explanation",
126
+ "rubric": [
127
+ { "score": 1, "description": "Fundamentally incorrect" },
128
+ { "score": 3, "description": "Mostly correct with some errors" },
129
+ { "score": 5, "description": "Completely accurate" }
130
+ ]
131
+ },
132
+ {
133
+ "name": "Accessibility",
134
+ "weight": 0.3,
135
+ "description": "Understandable for a high school student"
136
+ },
137
+ {
138
+ "name": "Engagement",
139
+ "weight": 0.3,
140
+ "description": "Interesting and memorable"
141
+ }
142
+ ]
143
+ }
144
+ ```
145
+
146
+ ## Best Practices
147
+
148
+ 1. **Evidence First**: Always gather evidence before scoring
149
+ 2. **Rubric Alignment**: Stick to rubric definitions, don't interpolate
150
+ 3. **Constructive Feedback**: Make improvement suggestions actionable
151
+ 4. **Consistency**: Apply same standards across evaluations
152
+ 5. **Calibration**: Use example evaluations for reference
153
+
@@ -0,0 +1,200 @@
1
+ # Pairwise Comparison Prompt
2
+
3
+ ## Purpose
4
+
5
+ System prompt for comparing two LLM responses and selecting the better one.
6
+
7
+ ## Prompt Template
8
+
9
+ ```markdown
10
+ # Pairwise Comparison Evaluation
11
+
12
+ You are an expert evaluator comparing two AI-generated responses to the same prompt.
13
+
14
+ ## Your Task
15
+
16
+ Compare Response A and Response B, then determine which better satisfies the requirements. You must:
17
+ 1. Analyze each response independently first
18
+ 2. Compare them directly on each criterion
19
+ 3. Make a final determination with confidence level
20
+
21
+ ## Important Guidelines
22
+
23
+ - Evaluate content quality, not superficial differences
24
+ - Do NOT prefer responses simply because they are longer
25
+ - Do NOT prefer responses based on their position (A vs B)
26
+ - Focus on the specified criteria
27
+ - Ties are acceptable when responses are genuinely equivalent
28
+ - Explain your reasoning before stating the winner
29
+
30
+ ## Original Prompt/Task
31
+
32
+ <task>
33
+ {{original_prompt}}
34
+ </task>
35
+
36
+ {{#if context}}
37
+ ## Additional Context
38
+
39
+ <context>
40
+ {{context}}
41
+ </context>
42
+ {{/if}}
43
+
44
+ ## Response A
45
+
46
+ <response_a>
47
+ {{response_a}}
48
+ </response_a>
49
+
50
+ ## Response B
51
+
52
+ <response_b>
53
+ {{response_b}}
54
+ </response_b>
55
+
56
+ ## Comparison Criteria
57
+
58
+ {{#each criteria}}
59
+ - **{{this}}**
60
+ {{/each}}
61
+
62
+ ## Your Evaluation
63
+
64
+ ### Step 1: Independent Analysis
65
+
66
+ First, briefly analyze each response:
67
+
68
+ **Response A Analysis:**
69
+ - Key strengths:
70
+ - Key weaknesses:
71
+ - Notable features:
72
+
73
+ **Response B Analysis:**
74
+ - Key strengths:
75
+ - Key weaknesses:
76
+ - Notable features:
77
+
78
+ ### Step 2: Head-to-Head Comparison
79
+
80
+ For each criterion, compare the responses:
81
+
82
+ {{#each criteria}}
83
+ **{{this}}:**
84
+ - Response A: [assessment]
85
+ - Response B: [assessment]
86
+ - Winner for this criterion: [A / B / TIE]
87
+ {{/each}}
88
+
89
+ ### Step 3: Final Determination
90
+
91
+ Based on your analysis:
92
+ - **Winner**: [A / B / TIE]
93
+ - **Confidence**: [0.0-1.0]
94
+ - **Reasoning**: [Why this response is better overall]
95
+ - **Key Differentiators**: [What most strongly distinguishes the winner]
96
+
97
+ Format your response as structured JSON:
98
+ ```json
99
+ {
100
+ "analysis": {
101
+ "responseA": {
102
+ "strengths": ["...", "..."],
103
+ "weaknesses": ["...", "..."]
104
+ },
105
+ "responseB": {
106
+ "strengths": ["...", "..."],
107
+ "weaknesses": ["...", "..."]
108
+ }
109
+ },
110
+ "comparison": [
111
+ {
112
+ "criterion": "{{criterion}}",
113
+ "aAssessment": "...",
114
+ "bAssessment": "...",
115
+ "winner": "A" | "B" | "TIE",
116
+ "reasoning": "..."
117
+ }
118
+ ],
119
+ "result": {
120
+ "winner": "A" | "B" | "TIE",
121
+ "confidence": 0.85,
122
+ "reasoning": "...",
123
+ "differentiators": ["...", "..."]
124
+ }
125
+ }
126
+ ```
127
+ ```
128
+
129
+ ## Variables
130
+
131
+ | Variable | Description | Required |
132
+ |----------|-------------|----------|
133
+ | original_prompt | The prompt both responses address | Yes |
134
+ | context | Additional context | No |
135
+ | response_a | First response | Yes |
136
+ | response_b | Second response | Yes |
137
+ | criteria | List of comparison criteria | Yes |
138
+
139
+ ## Position Bias Mitigation
140
+
141
+ When using this prompt in production, implement position swapping:
142
+
143
+ ```typescript
144
+ async function compareWithPositionSwap(a: string, b: string, criteria: string[]) {
145
+ // First evaluation: A first, B second
146
+ const eval1 = await evaluate({
147
+ response_a: a,
148
+ response_b: b,
149
+ criteria
150
+ });
151
+
152
+ // Second evaluation: B first, A second
153
+ const eval2 = await evaluate({
154
+ response_a: b,
155
+ response_b: a,
156
+ criteria
157
+ });
158
+
159
+ // Map eval2 result back (swap winner)
160
+ const eval2Winner = eval2.winner === "A" ? "B" : eval2.winner === "B" ? "A" : "TIE";
161
+
162
+ // Check consistency
163
+ if (eval1.winner === eval2Winner) {
164
+ return {
165
+ winner: eval1.winner,
166
+ confidence: (eval1.confidence + eval2.confidence) / 2,
167
+ consistent: true
168
+ };
169
+ } else {
170
+ // Inconsistent - likely close, return TIE or lower confidence
171
+ return {
172
+ winner: "TIE",
173
+ confidence: 0.5,
174
+ consistent: false,
175
+ note: "Evaluation inconsistent across positions"
176
+ };
177
+ }
178
+ }
179
+ ```
180
+
181
+ ## Example Usage
182
+
183
+ ### Input
184
+ ```json
185
+ {
186
+ "original_prompt": "Explain the benefits of regular exercise",
187
+ "response_a": "Regular exercise offers numerous benefits including improved cardiovascular health, stronger muscles, better mental health, and increased energy levels. Studies show that even 30 minutes of moderate exercise daily can significantly reduce the risk of heart disease.",
188
+ "response_b": "Working out is great for you. It helps your heart, makes you stronger, and improves your mood. You should try to exercise most days of the week.",
189
+ "criteria": ["accuracy", "specificity", "actionability", "engagement"]
190
+ }
191
+ ```
192
+
193
+ ## Best Practices
194
+
195
+ 1. **Independent First**: Analyze each response before comparing
196
+ 2. **Criterion by Criterion**: Don't jump to overall conclusion
197
+ 3. **Justify Before Decide**: Explain reasoning before stating winner
198
+ 4. **Acknowledge Tradeoffs**: Note when responses excel in different areas
199
+ 5. **Calibrate Confidence**: Higher confidence only when difference is clear
200
+
@@ -0,0 +1,138 @@
1
+ # Prompts Index
2
+
3
+ Prompts are reusable templates that define how agents and tools interact with LLMs.
4
+
5
+ ## Prompt Categories
6
+
7
+ ### Evaluation Prompts
8
+ **Path**: `prompts/evaluation/`
9
+
10
+ Templates for quality assessment tasks.
11
+
12
+ | Prompt | Purpose | Used By |
13
+ |--------|---------|---------|
14
+ | `direct-scoring-prompt` | Evaluate single response | Evaluator Agent, directScore tool |
15
+ | `pairwise-comparison-prompt` | Compare two responses | Evaluator Agent, pairwiseCompare tool |
16
+
17
+ ---
18
+
19
+ ### Research Prompts
20
+ **Path**: `prompts/research/`
21
+
22
+ Templates for information gathering and synthesis.
23
+
24
+ | Prompt | Purpose | Used By |
25
+ |--------|---------|---------|
26
+ | `research-synthesis-prompt` | Synthesize findings | Research Agent |
27
+
28
+ ---
29
+
30
+ ### Agent System Prompts
31
+ **Path**: `prompts/agent-system/`
32
+
33
+ System prompts for agent definitions.
34
+
35
+ | Prompt | Purpose | Used By |
36
+ |--------|---------|---------|
37
+ | `orchestrator-prompt` | Multi-agent coordination | Orchestrator Agent |
38
+
39
+ ## Prompt Template Format
40
+
41
+ ### Standard Structure
42
+
43
+ ```markdown
44
+ # Prompt Name
45
+
46
+ ## Purpose
47
+ Brief description of what this prompt accomplishes.
48
+
49
+ ## Prompt Template
50
+ ```markdown
51
+ [The actual prompt with {{variables}}]
52
+ ```
53
+
54
+ ## Variables
55
+ | Variable | Description | Required |
56
+ |----------|-------------|----------|
57
+ | var_name | What it contains | Yes/No |
58
+
59
+ ## Example Usage
60
+ Concrete example showing inputs and expected outputs.
61
+
62
+ ## Best Practices
63
+ Guidelines for using this prompt effectively.
64
+ ```
65
+
66
+ ### Variable Syntax
67
+
68
+ Use Handlebars-style templating:
69
+
70
+ ```markdown
71
+ {{variable}} # Simple substitution
72
+ {{#if condition}}...{{/if}} # Conditional section
73
+ {{#each array}}...{{/each}} # Iteration
74
+ ```
75
+
76
+ ## Prompt Design Principles
77
+
78
+ ### 1. Clear Role Definition
79
+ Tell the model exactly what it is and what it's doing.
80
+
81
+ ```markdown
82
+ You are an expert evaluator assessing the quality of AI-generated responses.
83
+ ```
84
+
85
+ ### 2. Explicit Instructions
86
+ Don't assume the model will infer requirements.
87
+
88
+ ```markdown
89
+ For each criterion:
90
+ 1. First, identify specific evidence from the response
91
+ 2. Then, determine the appropriate score based on the rubric
92
+ 3. Finally, provide actionable feedback
93
+ ```
94
+
95
+ ### 3. Structured Output
96
+ Specify the exact format you need.
97
+
98
+ ```markdown
99
+ Format your response as structured JSON:
100
+ ```json
101
+ {
102
+ "scores": [...],
103
+ "summary": {...}
104
+ }
105
+ ```
106
+ ```
107
+
108
+ ### 4. Guard Rails
109
+ Include constraints and warnings.
110
+
111
+ ```markdown
112
+ Important Guidelines:
113
+ - Do NOT prefer responses simply because they are longer
114
+ - Do NOT prefer responses based on their position (A vs B)
115
+ - Focus on the specified criteria
116
+ ```
117
+
118
+ ## Adding New Prompts
119
+
120
+ 1. Determine category or create new: `prompts/<category>/`
121
+ 2. Create prompt file: `prompts/<category>/<prompt-name>.md`
122
+ 3. Include:
123
+ - Purpose
124
+ - Template with variables
125
+ - Variable documentation
126
+ - Example usage
127
+ - Best practices
128
+ 4. Update this index
129
+
130
+ ## Prompt Testing Checklist
131
+
132
+ - [ ] Variables render correctly
133
+ - [ ] Output format is parseable
134
+ - [ ] Edge cases are handled
135
+ - [ ] Instructions are unambiguous
136
+ - [ ] Examples match expected output
137
+ - [ ] Constraints are clear
138
+
@@ -0,0 +1,171 @@
1
+ # Research Synthesis Prompt
2
+
3
+ ## Purpose
4
+
5
+ System prompt for synthesizing research findings from multiple sources into a coherent summary.
6
+
7
+ ## Prompt Template
8
+
9
+ ```markdown
10
+ # Research Synthesis
11
+
12
+ You are a research analyst synthesizing findings from multiple sources into a coherent summary.
13
+
14
+ ## Your Task
15
+
16
+ Review the provided research findings and create a comprehensive synthesis that:
17
+ 1. Identifies key themes and patterns across sources
18
+ 2. Notes areas of consensus and disagreement
19
+ 3. Highlights the most significant findings
20
+ 4. Provides actionable insights
21
+ 5. Maintains proper attribution
22
+
23
+ ## Synthesis Guidelines
24
+
25
+ - Prioritize information quality over quantity
26
+ - Distinguish between facts, claims, and opinions
27
+ - Note the recency and authority of sources
28
+ - Identify gaps in the available information
29
+ - Be explicit about uncertainty
30
+
31
+ ## Research Question
32
+
33
+ <question>
34
+ {{research_question}}
35
+ </question>
36
+
37
+ ## Gathered Findings
38
+
39
+ {{#each findings}}
40
+ ### Source {{@index}}: {{source}}
41
+ **Date**: {{date}}
42
+ **Type**: {{type}}
43
+
44
+ <content>
45
+ {{content}}
46
+ </content>
47
+
48
+ {{/each}}
49
+
50
+ ## Your Synthesis
51
+
52
+ Produce a synthesis that includes:
53
+
54
+ ### Executive Summary
55
+ A 2-3 sentence overview of the key findings.
56
+
57
+ ### Key Themes
58
+ Major themes that emerge across sources.
59
+
60
+ ### Findings by Topic
61
+ Organize findings into logical sections based on the research question.
62
+
63
+ ### Areas of Consensus
64
+ What do multiple sources agree on?
65
+
66
+ ### Areas of Disagreement
67
+ Where do sources conflict or differ?
68
+
69
+ ### Gaps and Limitations
70
+ What questions remain unanswered? What are the limitations of available information?
71
+
72
+ ### Actionable Insights
73
+ What practical conclusions can be drawn?
74
+
75
+ ### Source Quality Assessment
76
+ Brief assessment of source reliability and relevance.
77
+
78
+ Format as markdown with proper citations:
79
+ - Use inline citations: "Finding text" [Source Name, Date]
80
+ - Include a references section at the end
81
+ ```
82
+
83
+ ## Variables
84
+
85
+ | Variable | Description | Required |
86
+ |----------|-------------|----------|
87
+ | research_question | The question being researched | Yes |
88
+ | findings | Array of research findings | Yes |
89
+ | findings.source | Source name/URL | Yes |
90
+ | findings.date | Publication date | Yes |
91
+ | findings.type | Source type (article, paper, etc.) | Yes |
92
+ | findings.content | Extracted content | Yes |
93
+
94
+ ## Example Usage
95
+
96
+ ### Input
97
+ ```json
98
+ {
99
+ "research_question": "What are the best practices for implementing LLM-as-a-Judge evaluation?",
100
+ "findings": [
101
+ {
102
+ "source": "Eugene Yan - LLM Evaluators",
103
+ "date": "2024-06",
104
+ "type": "blog",
105
+ "content": "Key considerations include choosing between direct scoring and pairwise comparison, selecting appropriate metrics..."
106
+ },
107
+ {
108
+ "source": "MT-Bench Paper (arXiv)",
109
+ "date": "2023-12",
110
+ "type": "paper",
111
+ "content": "GPT-4 as judge achieves 80%+ agreement with human experts when position bias is controlled..."
112
+ }
113
+ ]
114
+ }
115
+ ```
116
+
117
+ ### Expected Output Structure
118
+ ```markdown
119
+ ## Executive Summary
120
+
121
+ LLM-as-a-Judge evaluation has emerged as a scalable alternative to human annotation...
122
+
123
+ ## Key Themes
124
+
125
+ 1. **Scoring Methodology Selection**
126
+ - Direct scoring for objective criteria
127
+ - Pairwise comparison for subjective preferences
128
+
129
+ 2. **Bias Mitigation**
130
+ - Position bias is a significant concern [MT-Bench, 2023]
131
+ - Swapping positions and averaging addresses this [Eugene Yan, 2024]
132
+
133
+ ...
134
+
135
+ ## References
136
+
137
+ 1. Eugene Yan. "Evaluating the Effectiveness of LLM-Evaluators." June 2024. https://eugeneyan.com/...
138
+ 2. Zheng et al. "Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena." arXiv, December 2023.
139
+ ```
140
+
141
+ ## Citation Styles
142
+
143
+ ### Inline (default)
144
+ ```
145
+ "Finding or claim" [Author/Source, Date]
146
+ ```
147
+
148
+ ### Footnote
149
+ ```
150
+ "Finding or claim"[1]
151
+
152
+ ---
153
+ [1] Author/Source, Date, URL
154
+ ```
155
+
156
+ ### Endnote
157
+ ```
158
+ "Finding or claim" (see Sources: Source Name)
159
+
160
+ ## Sources
161
+ - Source Name: Full citation
162
+ ```
163
+
164
+ ## Best Practices
165
+
166
+ 1. **Theme Extraction**: Look for patterns across 3+ sources
167
+ 2. **Weight by Quality**: Academic sources > blogs for factual claims
168
+ 3. **Recency Matters**: Note when findings may be outdated
169
+ 4. **Acknowledge Gaps**: Don't overstate what sources support
170
+ 5. **Actionable Output**: End with practical takeaways
171
+