@synsci/cli-darwin-x64 1.1.49

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (373) hide show
  1. package/bin/skills/accelerate/SKILL.md +332 -0
  2. package/bin/skills/accelerate/references/custom-plugins.md +453 -0
  3. package/bin/skills/accelerate/references/megatron-integration.md +489 -0
  4. package/bin/skills/accelerate/references/performance.md +525 -0
  5. package/bin/skills/audiocraft/SKILL.md +564 -0
  6. package/bin/skills/audiocraft/references/advanced-usage.md +666 -0
  7. package/bin/skills/audiocraft/references/troubleshooting.md +504 -0
  8. package/bin/skills/autogpt/SKILL.md +403 -0
  9. package/bin/skills/autogpt/references/advanced-usage.md +535 -0
  10. package/bin/skills/autogpt/references/troubleshooting.md +420 -0
  11. package/bin/skills/awq/SKILL.md +310 -0
  12. package/bin/skills/awq/references/advanced-usage.md +324 -0
  13. package/bin/skills/awq/references/troubleshooting.md +344 -0
  14. package/bin/skills/axolotl/SKILL.md +158 -0
  15. package/bin/skills/axolotl/references/api.md +5548 -0
  16. package/bin/skills/axolotl/references/dataset-formats.md +1029 -0
  17. package/bin/skills/axolotl/references/index.md +15 -0
  18. package/bin/skills/axolotl/references/other.md +3563 -0
  19. package/bin/skills/bigcode-evaluation-harness/SKILL.md +405 -0
  20. package/bin/skills/bigcode-evaluation-harness/references/benchmarks.md +393 -0
  21. package/bin/skills/bigcode-evaluation-harness/references/custom-tasks.md +424 -0
  22. package/bin/skills/bigcode-evaluation-harness/references/issues.md +394 -0
  23. package/bin/skills/bitsandbytes/SKILL.md +411 -0
  24. package/bin/skills/bitsandbytes/references/memory-optimization.md +521 -0
  25. package/bin/skills/bitsandbytes/references/qlora-training.md +521 -0
  26. package/bin/skills/bitsandbytes/references/quantization-formats.md +447 -0
  27. package/bin/skills/blip-2/SKILL.md +564 -0
  28. package/bin/skills/blip-2/references/advanced-usage.md +680 -0
  29. package/bin/skills/blip-2/references/troubleshooting.md +526 -0
  30. package/bin/skills/chroma/SKILL.md +406 -0
  31. package/bin/skills/chroma/references/integration.md +38 -0
  32. package/bin/skills/clip/SKILL.md +253 -0
  33. package/bin/skills/clip/references/applications.md +207 -0
  34. package/bin/skills/constitutional-ai/SKILL.md +290 -0
  35. package/bin/skills/crewai/SKILL.md +498 -0
  36. package/bin/skills/crewai/references/flows.md +438 -0
  37. package/bin/skills/crewai/references/tools.md +429 -0
  38. package/bin/skills/crewai/references/troubleshooting.md +480 -0
  39. package/bin/skills/deepspeed/SKILL.md +141 -0
  40. package/bin/skills/deepspeed/references/08.md +17 -0
  41. package/bin/skills/deepspeed/references/09.md +173 -0
  42. package/bin/skills/deepspeed/references/2020.md +378 -0
  43. package/bin/skills/deepspeed/references/2023.md +279 -0
  44. package/bin/skills/deepspeed/references/assets.md +179 -0
  45. package/bin/skills/deepspeed/references/index.md +35 -0
  46. package/bin/skills/deepspeed/references/mii.md +118 -0
  47. package/bin/skills/deepspeed/references/other.md +1191 -0
  48. package/bin/skills/deepspeed/references/tutorials.md +6554 -0
  49. package/bin/skills/dspy/SKILL.md +590 -0
  50. package/bin/skills/dspy/references/examples.md +663 -0
  51. package/bin/skills/dspy/references/modules.md +475 -0
  52. package/bin/skills/dspy/references/optimizers.md +566 -0
  53. package/bin/skills/faiss/SKILL.md +221 -0
  54. package/bin/skills/faiss/references/index_types.md +280 -0
  55. package/bin/skills/flash-attention/SKILL.md +367 -0
  56. package/bin/skills/flash-attention/references/benchmarks.md +215 -0
  57. package/bin/skills/flash-attention/references/transformers-integration.md +293 -0
  58. package/bin/skills/gguf/SKILL.md +427 -0
  59. package/bin/skills/gguf/references/advanced-usage.md +504 -0
  60. package/bin/skills/gguf/references/troubleshooting.md +442 -0
  61. package/bin/skills/gptq/SKILL.md +450 -0
  62. package/bin/skills/gptq/references/calibration.md +337 -0
  63. package/bin/skills/gptq/references/integration.md +129 -0
  64. package/bin/skills/gptq/references/troubleshooting.md +95 -0
  65. package/bin/skills/grpo-rl-training/README.md +97 -0
  66. package/bin/skills/grpo-rl-training/SKILL.md +572 -0
  67. package/bin/skills/grpo-rl-training/examples/reward_functions_library.py +393 -0
  68. package/bin/skills/grpo-rl-training/templates/basic_grpo_training.py +228 -0
  69. package/bin/skills/guidance/SKILL.md +572 -0
  70. package/bin/skills/guidance/references/backends.md +554 -0
  71. package/bin/skills/guidance/references/constraints.md +674 -0
  72. package/bin/skills/guidance/references/examples.md +767 -0
  73. package/bin/skills/hqq/SKILL.md +445 -0
  74. package/bin/skills/hqq/references/advanced-usage.md +528 -0
  75. package/bin/skills/hqq/references/troubleshooting.md +503 -0
  76. package/bin/skills/hugging-face-cli/SKILL.md +191 -0
  77. package/bin/skills/hugging-face-cli/references/commands.md +954 -0
  78. package/bin/skills/hugging-face-cli/references/examples.md +374 -0
  79. package/bin/skills/hugging-face-datasets/SKILL.md +547 -0
  80. package/bin/skills/hugging-face-datasets/examples/diverse_training_examples.json +239 -0
  81. package/bin/skills/hugging-face-datasets/examples/system_prompt_template.txt +196 -0
  82. package/bin/skills/hugging-face-datasets/examples/training_examples.json +176 -0
  83. package/bin/skills/hugging-face-datasets/scripts/dataset_manager.py +522 -0
  84. package/bin/skills/hugging-face-datasets/scripts/sql_manager.py +844 -0
  85. package/bin/skills/hugging-face-datasets/templates/chat.json +55 -0
  86. package/bin/skills/hugging-face-datasets/templates/classification.json +62 -0
  87. package/bin/skills/hugging-face-datasets/templates/completion.json +51 -0
  88. package/bin/skills/hugging-face-datasets/templates/custom.json +75 -0
  89. package/bin/skills/hugging-face-datasets/templates/qa.json +54 -0
  90. package/bin/skills/hugging-face-datasets/templates/tabular.json +81 -0
  91. package/bin/skills/hugging-face-evaluation/SKILL.md +656 -0
  92. package/bin/skills/hugging-face-evaluation/examples/USAGE_EXAMPLES.md +382 -0
  93. package/bin/skills/hugging-face-evaluation/examples/artificial_analysis_to_hub.py +141 -0
  94. package/bin/skills/hugging-face-evaluation/examples/example_readme_tables.md +135 -0
  95. package/bin/skills/hugging-face-evaluation/examples/metric_mapping.json +50 -0
  96. package/bin/skills/hugging-face-evaluation/requirements.txt +20 -0
  97. package/bin/skills/hugging-face-evaluation/scripts/evaluation_manager.py +1374 -0
  98. package/bin/skills/hugging-face-evaluation/scripts/inspect_eval_uv.py +104 -0
  99. package/bin/skills/hugging-face-evaluation/scripts/inspect_vllm_uv.py +317 -0
  100. package/bin/skills/hugging-face-evaluation/scripts/lighteval_vllm_uv.py +303 -0
  101. package/bin/skills/hugging-face-evaluation/scripts/run_eval_job.py +98 -0
  102. package/bin/skills/hugging-face-evaluation/scripts/run_vllm_eval_job.py +331 -0
  103. package/bin/skills/hugging-face-evaluation/scripts/test_extraction.py +206 -0
  104. package/bin/skills/hugging-face-jobs/SKILL.md +1041 -0
  105. package/bin/skills/hugging-face-jobs/index.html +216 -0
  106. package/bin/skills/hugging-face-jobs/references/hardware_guide.md +336 -0
  107. package/bin/skills/hugging-face-jobs/references/hub_saving.md +352 -0
  108. package/bin/skills/hugging-face-jobs/references/token_usage.md +546 -0
  109. package/bin/skills/hugging-face-jobs/references/troubleshooting.md +475 -0
  110. package/bin/skills/hugging-face-jobs/scripts/cot-self-instruct.py +718 -0
  111. package/bin/skills/hugging-face-jobs/scripts/finepdfs-stats.py +546 -0
  112. package/bin/skills/hugging-face-jobs/scripts/generate-responses.py +587 -0
  113. package/bin/skills/hugging-face-model-trainer/SKILL.md +711 -0
  114. package/bin/skills/hugging-face-model-trainer/references/gguf_conversion.md +296 -0
  115. package/bin/skills/hugging-face-model-trainer/references/hardware_guide.md +283 -0
  116. package/bin/skills/hugging-face-model-trainer/references/hub_saving.md +364 -0
  117. package/bin/skills/hugging-face-model-trainer/references/reliability_principles.md +371 -0
  118. package/bin/skills/hugging-face-model-trainer/references/trackio_guide.md +189 -0
  119. package/bin/skills/hugging-face-model-trainer/references/training_methods.md +150 -0
  120. package/bin/skills/hugging-face-model-trainer/references/training_patterns.md +203 -0
  121. package/bin/skills/hugging-face-model-trainer/references/troubleshooting.md +282 -0
  122. package/bin/skills/hugging-face-model-trainer/scripts/convert_to_gguf.py +424 -0
  123. package/bin/skills/hugging-face-model-trainer/scripts/dataset_inspector.py +417 -0
  124. package/bin/skills/hugging-face-model-trainer/scripts/estimate_cost.py +150 -0
  125. package/bin/skills/hugging-face-model-trainer/scripts/train_dpo_example.py +106 -0
  126. package/bin/skills/hugging-face-model-trainer/scripts/train_grpo_example.py +89 -0
  127. package/bin/skills/hugging-face-model-trainer/scripts/train_sft_example.py +122 -0
  128. package/bin/skills/hugging-face-paper-publisher/SKILL.md +627 -0
  129. package/bin/skills/hugging-face-paper-publisher/examples/example_usage.md +327 -0
  130. package/bin/skills/hugging-face-paper-publisher/references/quick_reference.md +216 -0
  131. package/bin/skills/hugging-face-paper-publisher/scripts/paper_manager.py +508 -0
  132. package/bin/skills/hugging-face-paper-publisher/templates/arxiv.md +299 -0
  133. package/bin/skills/hugging-face-paper-publisher/templates/ml-report.md +358 -0
  134. package/bin/skills/hugging-face-paper-publisher/templates/modern.md +319 -0
  135. package/bin/skills/hugging-face-paper-publisher/templates/standard.md +201 -0
  136. package/bin/skills/hugging-face-tool-builder/SKILL.md +115 -0
  137. package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.py +57 -0
  138. package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.sh +40 -0
  139. package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.tsx +57 -0
  140. package/bin/skills/hugging-face-tool-builder/references/find_models_by_paper.sh +230 -0
  141. package/bin/skills/hugging-face-tool-builder/references/hf_enrich_models.sh +96 -0
  142. package/bin/skills/hugging-face-tool-builder/references/hf_model_card_frontmatter.sh +188 -0
  143. package/bin/skills/hugging-face-tool-builder/references/hf_model_papers_auth.sh +171 -0
  144. package/bin/skills/hugging-face-trackio/SKILL.md +65 -0
  145. package/bin/skills/hugging-face-trackio/references/logging_metrics.md +206 -0
  146. package/bin/skills/hugging-face-trackio/references/retrieving_metrics.md +223 -0
  147. package/bin/skills/huggingface-tokenizers/SKILL.md +516 -0
  148. package/bin/skills/huggingface-tokenizers/references/algorithms.md +653 -0
  149. package/bin/skills/huggingface-tokenizers/references/integration.md +637 -0
  150. package/bin/skills/huggingface-tokenizers/references/pipeline.md +723 -0
  151. package/bin/skills/huggingface-tokenizers/references/training.md +565 -0
  152. package/bin/skills/instructor/SKILL.md +740 -0
  153. package/bin/skills/instructor/references/examples.md +107 -0
  154. package/bin/skills/instructor/references/providers.md +70 -0
  155. package/bin/skills/instructor/references/validation.md +606 -0
  156. package/bin/skills/knowledge-distillation/SKILL.md +458 -0
  157. package/bin/skills/knowledge-distillation/references/minillm.md +334 -0
  158. package/bin/skills/lambda-labs/SKILL.md +545 -0
  159. package/bin/skills/lambda-labs/references/advanced-usage.md +611 -0
  160. package/bin/skills/lambda-labs/references/troubleshooting.md +530 -0
  161. package/bin/skills/langchain/SKILL.md +480 -0
  162. package/bin/skills/langchain/references/agents.md +499 -0
  163. package/bin/skills/langchain/references/integration.md +562 -0
  164. package/bin/skills/langchain/references/rag.md +600 -0
  165. package/bin/skills/langsmith/SKILL.md +422 -0
  166. package/bin/skills/langsmith/references/advanced-usage.md +548 -0
  167. package/bin/skills/langsmith/references/troubleshooting.md +537 -0
  168. package/bin/skills/litgpt/SKILL.md +469 -0
  169. package/bin/skills/litgpt/references/custom-models.md +568 -0
  170. package/bin/skills/litgpt/references/distributed-training.md +451 -0
  171. package/bin/skills/litgpt/references/supported-models.md +336 -0
  172. package/bin/skills/litgpt/references/training-recipes.md +619 -0
  173. package/bin/skills/llama-cpp/SKILL.md +258 -0
  174. package/bin/skills/llama-cpp/references/optimization.md +89 -0
  175. package/bin/skills/llama-cpp/references/quantization.md +213 -0
  176. package/bin/skills/llama-cpp/references/server.md +125 -0
  177. package/bin/skills/llama-factory/SKILL.md +80 -0
  178. package/bin/skills/llama-factory/references/_images.md +23 -0
  179. package/bin/skills/llama-factory/references/advanced.md +1055 -0
  180. package/bin/skills/llama-factory/references/getting_started.md +349 -0
  181. package/bin/skills/llama-factory/references/index.md +19 -0
  182. package/bin/skills/llama-factory/references/other.md +31 -0
  183. package/bin/skills/llamaguard/SKILL.md +337 -0
  184. package/bin/skills/llamaindex/SKILL.md +569 -0
  185. package/bin/skills/llamaindex/references/agents.md +83 -0
  186. package/bin/skills/llamaindex/references/data_connectors.md +108 -0
  187. package/bin/skills/llamaindex/references/query_engines.md +406 -0
  188. package/bin/skills/llava/SKILL.md +304 -0
  189. package/bin/skills/llava/references/training.md +197 -0
  190. package/bin/skills/lm-evaluation-harness/SKILL.md +490 -0
  191. package/bin/skills/lm-evaluation-harness/references/api-evaluation.md +490 -0
  192. package/bin/skills/lm-evaluation-harness/references/benchmark-guide.md +488 -0
  193. package/bin/skills/lm-evaluation-harness/references/custom-tasks.md +602 -0
  194. package/bin/skills/lm-evaluation-harness/references/distributed-eval.md +519 -0
  195. package/bin/skills/long-context/SKILL.md +536 -0
  196. package/bin/skills/long-context/references/extension_methods.md +468 -0
  197. package/bin/skills/long-context/references/fine_tuning.md +611 -0
  198. package/bin/skills/long-context/references/rope.md +402 -0
  199. package/bin/skills/mamba/SKILL.md +260 -0
  200. package/bin/skills/mamba/references/architecture-details.md +206 -0
  201. package/bin/skills/mamba/references/benchmarks.md +255 -0
  202. package/bin/skills/mamba/references/training-guide.md +388 -0
  203. package/bin/skills/megatron-core/SKILL.md +366 -0
  204. package/bin/skills/megatron-core/references/benchmarks.md +249 -0
  205. package/bin/skills/megatron-core/references/parallelism-guide.md +404 -0
  206. package/bin/skills/megatron-core/references/production-examples.md +473 -0
  207. package/bin/skills/megatron-core/references/training-recipes.md +547 -0
  208. package/bin/skills/miles/SKILL.md +315 -0
  209. package/bin/skills/miles/references/api-reference.md +141 -0
  210. package/bin/skills/miles/references/troubleshooting.md +352 -0
  211. package/bin/skills/mlflow/SKILL.md +704 -0
  212. package/bin/skills/mlflow/references/deployment.md +744 -0
  213. package/bin/skills/mlflow/references/model-registry.md +770 -0
  214. package/bin/skills/mlflow/references/tracking.md +680 -0
  215. package/bin/skills/modal/SKILL.md +341 -0
  216. package/bin/skills/modal/references/advanced-usage.md +503 -0
  217. package/bin/skills/modal/references/troubleshooting.md +494 -0
  218. package/bin/skills/model-merging/SKILL.md +539 -0
  219. package/bin/skills/model-merging/references/evaluation.md +462 -0
  220. package/bin/skills/model-merging/references/examples.md +428 -0
  221. package/bin/skills/model-merging/references/methods.md +352 -0
  222. package/bin/skills/model-pruning/SKILL.md +495 -0
  223. package/bin/skills/model-pruning/references/wanda.md +347 -0
  224. package/bin/skills/moe-training/SKILL.md +526 -0
  225. package/bin/skills/moe-training/references/architectures.md +432 -0
  226. package/bin/skills/moe-training/references/inference.md +348 -0
  227. package/bin/skills/moe-training/references/training.md +425 -0
  228. package/bin/skills/nanogpt/SKILL.md +290 -0
  229. package/bin/skills/nanogpt/references/architecture.md +382 -0
  230. package/bin/skills/nanogpt/references/data.md +476 -0
  231. package/bin/skills/nanogpt/references/training.md +564 -0
  232. package/bin/skills/nemo-curator/SKILL.md +383 -0
  233. package/bin/skills/nemo-curator/references/deduplication.md +87 -0
  234. package/bin/skills/nemo-curator/references/filtering.md +102 -0
  235. package/bin/skills/nemo-evaluator/SKILL.md +494 -0
  236. package/bin/skills/nemo-evaluator/references/adapter-system.md +340 -0
  237. package/bin/skills/nemo-evaluator/references/configuration.md +447 -0
  238. package/bin/skills/nemo-evaluator/references/custom-benchmarks.md +315 -0
  239. package/bin/skills/nemo-evaluator/references/execution-backends.md +361 -0
  240. package/bin/skills/nemo-guardrails/SKILL.md +297 -0
  241. package/bin/skills/nnsight/SKILL.md +436 -0
  242. package/bin/skills/nnsight/references/README.md +78 -0
  243. package/bin/skills/nnsight/references/api.md +344 -0
  244. package/bin/skills/nnsight/references/tutorials.md +300 -0
  245. package/bin/skills/openrlhf/SKILL.md +249 -0
  246. package/bin/skills/openrlhf/references/algorithm-comparison.md +404 -0
  247. package/bin/skills/openrlhf/references/custom-rewards.md +530 -0
  248. package/bin/skills/openrlhf/references/hybrid-engine.md +287 -0
  249. package/bin/skills/openrlhf/references/multi-node-training.md +454 -0
  250. package/bin/skills/outlines/SKILL.md +652 -0
  251. package/bin/skills/outlines/references/backends.md +615 -0
  252. package/bin/skills/outlines/references/examples.md +773 -0
  253. package/bin/skills/outlines/references/json_generation.md +652 -0
  254. package/bin/skills/peft/SKILL.md +431 -0
  255. package/bin/skills/peft/references/advanced-usage.md +514 -0
  256. package/bin/skills/peft/references/troubleshooting.md +480 -0
  257. package/bin/skills/phoenix/SKILL.md +475 -0
  258. package/bin/skills/phoenix/references/advanced-usage.md +619 -0
  259. package/bin/skills/phoenix/references/troubleshooting.md +538 -0
  260. package/bin/skills/pinecone/SKILL.md +358 -0
  261. package/bin/skills/pinecone/references/deployment.md +181 -0
  262. package/bin/skills/pytorch-fsdp/SKILL.md +126 -0
  263. package/bin/skills/pytorch-fsdp/references/index.md +7 -0
  264. package/bin/skills/pytorch-fsdp/references/other.md +4249 -0
  265. package/bin/skills/pytorch-lightning/SKILL.md +346 -0
  266. package/bin/skills/pytorch-lightning/references/callbacks.md +436 -0
  267. package/bin/skills/pytorch-lightning/references/distributed.md +490 -0
  268. package/bin/skills/pytorch-lightning/references/hyperparameter-tuning.md +556 -0
  269. package/bin/skills/pyvene/SKILL.md +473 -0
  270. package/bin/skills/pyvene/references/README.md +73 -0
  271. package/bin/skills/pyvene/references/api.md +383 -0
  272. package/bin/skills/pyvene/references/tutorials.md +376 -0
  273. package/bin/skills/qdrant/SKILL.md +493 -0
  274. package/bin/skills/qdrant/references/advanced-usage.md +648 -0
  275. package/bin/skills/qdrant/references/troubleshooting.md +631 -0
  276. package/bin/skills/ray-data/SKILL.md +326 -0
  277. package/bin/skills/ray-data/references/integration.md +82 -0
  278. package/bin/skills/ray-data/references/transformations.md +83 -0
  279. package/bin/skills/ray-train/SKILL.md +406 -0
  280. package/bin/skills/ray-train/references/multi-node.md +628 -0
  281. package/bin/skills/rwkv/SKILL.md +260 -0
  282. package/bin/skills/rwkv/references/architecture-details.md +344 -0
  283. package/bin/skills/rwkv/references/rwkv7.md +386 -0
  284. package/bin/skills/rwkv/references/state-management.md +369 -0
  285. package/bin/skills/saelens/SKILL.md +386 -0
  286. package/bin/skills/saelens/references/README.md +70 -0
  287. package/bin/skills/saelens/references/api.md +333 -0
  288. package/bin/skills/saelens/references/tutorials.md +318 -0
  289. package/bin/skills/segment-anything/SKILL.md +500 -0
  290. package/bin/skills/segment-anything/references/advanced-usage.md +589 -0
  291. package/bin/skills/segment-anything/references/troubleshooting.md +484 -0
  292. package/bin/skills/sentence-transformers/SKILL.md +255 -0
  293. package/bin/skills/sentence-transformers/references/models.md +123 -0
  294. package/bin/skills/sentencepiece/SKILL.md +235 -0
  295. package/bin/skills/sentencepiece/references/algorithms.md +200 -0
  296. package/bin/skills/sentencepiece/references/training.md +304 -0
  297. package/bin/skills/sglang/SKILL.md +442 -0
  298. package/bin/skills/sglang/references/deployment.md +490 -0
  299. package/bin/skills/sglang/references/radix-attention.md +413 -0
  300. package/bin/skills/sglang/references/structured-generation.md +541 -0
  301. package/bin/skills/simpo/SKILL.md +219 -0
  302. package/bin/skills/simpo/references/datasets.md +478 -0
  303. package/bin/skills/simpo/references/hyperparameters.md +452 -0
  304. package/bin/skills/simpo/references/loss-functions.md +350 -0
  305. package/bin/skills/skypilot/SKILL.md +509 -0
  306. package/bin/skills/skypilot/references/advanced-usage.md +491 -0
  307. package/bin/skills/skypilot/references/troubleshooting.md +570 -0
  308. package/bin/skills/slime/SKILL.md +464 -0
  309. package/bin/skills/slime/references/api-reference.md +392 -0
  310. package/bin/skills/slime/references/troubleshooting.md +386 -0
  311. package/bin/skills/speculative-decoding/SKILL.md +467 -0
  312. package/bin/skills/speculative-decoding/references/lookahead.md +309 -0
  313. package/bin/skills/speculative-decoding/references/medusa.md +350 -0
  314. package/bin/skills/stable-diffusion/SKILL.md +519 -0
  315. package/bin/skills/stable-diffusion/references/advanced-usage.md +716 -0
  316. package/bin/skills/stable-diffusion/references/troubleshooting.md +555 -0
  317. package/bin/skills/tensorboard/SKILL.md +629 -0
  318. package/bin/skills/tensorboard/references/integrations.md +638 -0
  319. package/bin/skills/tensorboard/references/profiling.md +545 -0
  320. package/bin/skills/tensorboard/references/visualization.md +620 -0
  321. package/bin/skills/tensorrt-llm/SKILL.md +187 -0
  322. package/bin/skills/tensorrt-llm/references/multi-gpu.md +298 -0
  323. package/bin/skills/tensorrt-llm/references/optimization.md +242 -0
  324. package/bin/skills/tensorrt-llm/references/serving.md +470 -0
  325. package/bin/skills/tinker/SKILL.md +362 -0
  326. package/bin/skills/tinker/references/api-reference.md +168 -0
  327. package/bin/skills/tinker/references/getting-started.md +157 -0
  328. package/bin/skills/tinker/references/loss-functions.md +163 -0
  329. package/bin/skills/tinker/references/models-and-lora.md +139 -0
  330. package/bin/skills/tinker/references/recipes.md +280 -0
  331. package/bin/skills/tinker/references/reinforcement-learning.md +212 -0
  332. package/bin/skills/tinker/references/rendering.md +243 -0
  333. package/bin/skills/tinker/references/supervised-learning.md +232 -0
  334. package/bin/skills/tinker-training-cost/SKILL.md +187 -0
  335. package/bin/skills/tinker-training-cost/scripts/calculate_cost.py +123 -0
  336. package/bin/skills/torchforge/SKILL.md +433 -0
  337. package/bin/skills/torchforge/references/api-reference.md +327 -0
  338. package/bin/skills/torchforge/references/troubleshooting.md +409 -0
  339. package/bin/skills/torchtitan/SKILL.md +358 -0
  340. package/bin/skills/torchtitan/references/checkpoint.md +181 -0
  341. package/bin/skills/torchtitan/references/custom-models.md +258 -0
  342. package/bin/skills/torchtitan/references/float8.md +133 -0
  343. package/bin/skills/torchtitan/references/fsdp.md +126 -0
  344. package/bin/skills/transformer-lens/SKILL.md +346 -0
  345. package/bin/skills/transformer-lens/references/README.md +54 -0
  346. package/bin/skills/transformer-lens/references/api.md +362 -0
  347. package/bin/skills/transformer-lens/references/tutorials.md +339 -0
  348. package/bin/skills/trl-fine-tuning/SKILL.md +455 -0
  349. package/bin/skills/trl-fine-tuning/references/dpo-variants.md +227 -0
  350. package/bin/skills/trl-fine-tuning/references/online-rl.md +82 -0
  351. package/bin/skills/trl-fine-tuning/references/reward-modeling.md +122 -0
  352. package/bin/skills/trl-fine-tuning/references/sft-training.md +168 -0
  353. package/bin/skills/unsloth/SKILL.md +80 -0
  354. package/bin/skills/unsloth/references/index.md +7 -0
  355. package/bin/skills/unsloth/references/llms-full.md +16799 -0
  356. package/bin/skills/unsloth/references/llms-txt.md +12044 -0
  357. package/bin/skills/unsloth/references/llms.md +82 -0
  358. package/bin/skills/verl/SKILL.md +391 -0
  359. package/bin/skills/verl/references/api-reference.md +301 -0
  360. package/bin/skills/verl/references/troubleshooting.md +391 -0
  361. package/bin/skills/vllm/SKILL.md +364 -0
  362. package/bin/skills/vllm/references/optimization.md +226 -0
  363. package/bin/skills/vllm/references/quantization.md +284 -0
  364. package/bin/skills/vllm/references/server-deployment.md +255 -0
  365. package/bin/skills/vllm/references/troubleshooting.md +447 -0
  366. package/bin/skills/weights-and-biases/SKILL.md +590 -0
  367. package/bin/skills/weights-and-biases/references/artifacts.md +584 -0
  368. package/bin/skills/weights-and-biases/references/integrations.md +700 -0
  369. package/bin/skills/weights-and-biases/references/sweeps.md +847 -0
  370. package/bin/skills/whisper/SKILL.md +317 -0
  371. package/bin/skills/whisper/references/languages.md +189 -0
  372. package/bin/synsc +0 -0
  373. package/package.json +10 -0
@@ -0,0 +1,299 @@
1
+ ---
2
+ title: {{TITLE}}
3
+ authors: {{AUTHORS}}
4
+ date: {{DATE}}
5
+ arxiv:
6
+ layout: arxiv
7
+ ---
8
+
9
+ # {{TITLE}}
10
+
11
+ <div class="arxiv-header">
12
+
13
+ **{{AUTHORS}}**
14
+
15
+ *Submitted to arXiv: {{DATE}}*
16
+
17
+ </div>
18
+
19
+ ---
20
+
21
+ **Abstract**—{{ABSTRACT}}
22
+
23
+ **Index Terms**—Machine Learning, Deep Learning, Neural Networks
24
+
25
+ ---
26
+
27
+ ## I. INTRODUCTION
28
+
29
+ **T**HIS paper presents [brief overview of the contribution]. The main contributions of this work are:
30
+
31
+ - Contribution 1: Description
32
+ - Contribution 2: Description
33
+ - Contribution 3: Description
34
+
35
+ The rest of this paper is organized as follows: Section II reviews related work, Section III describes the proposed methodology, Section IV presents experimental results, and Section V concludes the paper.
36
+
37
+ ## II. RELATED WORK
38
+
39
+ ### A. Subarea 1
40
+
41
+ Discussion of relevant prior work in subarea 1.
42
+
43
+ ### B. Subarea 2
44
+
45
+ Discussion of relevant prior work in subarea 2.
46
+
47
+ ### C. Comparison with Prior Art
48
+
49
+ Table comparing existing methods:
50
+
51
+ | Method | Year | Approach | Limitation |
52
+ |--------|------|----------|------------|
53
+ | Method A [1] | 2020 | Description | Issue |
54
+ | Method B [2] | 2021 | Description | Issue |
55
+ | Method C [3] | 2023 | Description | Issue |
56
+
57
+ ## III. METHODOLOGY
58
+
59
+ ### A. Problem Formulation
60
+
61
+ Let $X = \{x_1, x_2, ..., x_n\}$ be the input space and $Y = \{y_1, y_2, ..., y_m\}$ be the output space. We aim to learn a function $f: X \rightarrow Y$ that minimizes:
62
+
63
+ $$
64
+ \mathcal{L}(\theta) = \sum_{i=1}^{N} \ell(f(x_i; \theta), y_i) + \lambda R(\theta)
65
+ $$
66
+
67
+ where $\theta$ represents model parameters, $\ell$ is the loss function, and $R(\theta)$ is a regularization term.
68
+
69
+ ### B. Model Architecture
70
+
71
+ Describe the model architecture in detail.
72
+
73
+ **Input Layer**: Description
74
+
75
+ **Hidden Layers**: Let $h^{(l)}$ denote the activation of layer $l$:
76
+
77
+ $$
78
+ h^{(l)} = \sigma(W^{(l)}h^{(l-1)} + b^{(l)})
79
+ $$
80
+
81
+ where $\sigma$ is the activation function, $W^{(l)}$ is the weight matrix, and $b^{(l)}$ is the bias vector.
82
+
83
+ **Output Layer**: Description
84
+
85
+ ### C. Training Algorithm
86
+
87
+ **Algorithm 1**: Training Procedure
88
+
89
+ ```
90
+ 1: Input: Training data D = {(xi, yi)}
91
+ 2: Initialize parameters θ
92
+ 3: for epoch = 1 to max_epochs do
93
+ 4: for each mini-batch B ⊂ D do
94
+ 5: Compute loss: L(θ) = 1/|B| Σ ℓ(f(xi; θ), yi)
95
+ 6: Update: θ ← θ - η∇θL(θ)
96
+ 7: end for
97
+ 8: end for
98
+ 9: Return: Trained parameters θ*
99
+ ```
100
+
101
+ ### D. Complexity Analysis
102
+
103
+ **Time Complexity**: The training algorithm has time complexity $O(NTE)$ where $N$ is the dataset size, $T$ is the number of epochs, and $E$ is the per-example computation cost.
104
+
105
+ **Space Complexity**: The model requires $O(P)$ space where $P$ is the number of parameters.
106
+
107
+ ## IV. EXPERIMENTS
108
+
109
+ ### A. Experimental Setup
110
+
111
+ **Datasets**: We evaluate on the following benchmarks:
112
+
113
+ 1. **Dataset A**: Description (size, splits, characteristics)
114
+ 2. **Dataset B**: Description
115
+ 3. **Dataset C**: Description
116
+
117
+ **Baselines**: We compare against:
118
+
119
+ - Baseline 1 [4]: Description
120
+ - Baseline 2 [5]: Description
121
+ - Baseline 3 [6]: Description
122
+
123
+ **Evaluation Metrics**: Performance is measured using:
124
+
125
+ - Metric 1: Definition
126
+ - Metric 2: Definition
127
+ - Metric 3: Definition
128
+
129
+ **Implementation Details**: All experiments are conducted using:
130
+
131
+ - Framework: PyTorch 2.0
132
+ - Hardware: NVIDIA A100 GPUs
133
+ - Hyperparameters: Learning rate $\eta = 10^{-4}$, batch size $B = 32$, epochs $T = 100$
134
+
135
+ ### B. Quantitative Results
136
+
137
+ **TABLE I: MAIN RESULTS**
138
+
139
+ | Method | Dataset A | Dataset B | Dataset C | Average |
140
+ |--------|-----------|-----------|-----------|---------|
141
+ | Baseline 1 [4] | 82.3 | 78.5 | 80.1 | 80.3 |
142
+ | Baseline 2 [5] | 85.7 | 82.1 | 83.9 | 83.9 |
143
+ | Baseline 3 [6] | 88.1 | 85.3 | 86.7 | 86.7 |
144
+ | **Ours** | **91.2** | **88.9** | **90.1** | **90.1** |
145
+
146
+ Our method achieves state-of-the-art performance across all three benchmarks, with an average improvement of 3.4 percentage points over the previous best method.
147
+
148
+ ### C. Ablation Study
149
+
150
+ **TABLE II: ABLATION STUDY RESULTS**
151
+
152
+ | Configuration | Dataset A | Δ |
153
+ |---------------|-----------|---|
154
+ | Full Model | 91.2 | - |
155
+ | w/o Component A | 88.7 | -2.5 |
156
+ | w/o Component B | 89.4 | -1.8 |
157
+ | w/o Component C | 90.5 | -0.7 |
158
+
159
+ The ablation study demonstrates that all components contribute to the final performance, with Component A having the largest impact.
160
+
161
+ ### D. Qualitative Analysis
162
+
163
+ **Fig. 1**: Visualization of learned representations using t-SNE projection.
164
+
165
+ **Fig. 2**: Example predictions showing correct classifications and failure cases.
166
+
167
+ ### E. Computational Efficiency
168
+
169
+ **TABLE III: COMPUTATIONAL REQUIREMENTS**
170
+
171
+ | Method | Parameters | FLOPs | Inference (ms) |
172
+ |--------|------------|-------|----------------|
173
+ | Baseline 1 [4] | 50M | 10G | 8.2 |
174
+ | Baseline 2 [5] | 100M | 25G | 15.7 |
175
+ | Baseline 3 [6] | 200M | 50G | 28.3 |
176
+ | **Ours** | **80M** | **18G** | **12.1** |
177
+
178
+ Our method achieves superior performance while maintaining reasonable computational costs.
179
+
180
+ ## V. DISCUSSION
181
+
182
+ ### A. Analysis of Results
183
+
184
+ The experimental results demonstrate that [analysis].
185
+
186
+ ### B. Limitations
187
+
188
+ Current limitations include:
189
+
190
+ 1. Limitation 1: Description
191
+ 2. Limitation 2: Description
192
+ 3. Limitation 3: Description
193
+
194
+ ### C. Broader Impact
195
+
196
+ Potential applications include:
197
+
198
+ - Application 1: Description
199
+ - Application 2: Description
200
+ - Application 3: Description
201
+
202
+ **Ethical Considerations**: [Discussion of potential risks and mitigation strategies]
203
+
204
+ ## VI. CONCLUSION
205
+
206
+ This paper presented {{TITLE}}, which achieves [main achievement]. The key contributions are:
207
+
208
+ 1. Contribution 1: Summary
209
+ 2. Contribution 2: Summary
210
+ 3. Contribution 3: Summary
211
+
212
+ Future work will focus on [future directions].
213
+
214
+ ## ACKNOWLEDGMENTS
215
+
216
+ The authors thank [acknowledgments]. This work was supported by [funding sources].
217
+
218
+ ## REFERENCES
219
+
220
+ [1] Author A et al., "Paper Title," *Conference Name*, 2020.
221
+
222
+ [2] Author B et al., "Paper Title," *Journal Name*, vol. X, no. Y, pp. Z-W, 2021.
223
+
224
+ [3] Author C et al., "Paper Title," *arXiv preprint arXiv:XXXX.XXXXX*, 2023.
225
+
226
+ [4] Author D et al., "Baseline 1 Paper," *Conference*, 2019.
227
+
228
+ [5] Author E et al., "Baseline 2 Paper," *Conference*, 2021.
229
+
230
+ [6] Author F et al., "Baseline 3 Paper," *Conference*, 2023.
231
+
232
+ ---
233
+
234
+ ## APPENDIX A: ADDITIONAL EXPERIMENTS
235
+
236
+ Supplementary experimental results.
237
+
238
+ ## APPENDIX B: PROOF OF THEOREM
239
+
240
+ **Theorem 1**: Statement of theorem.
241
+
242
+ **Proof**: Detailed proof.
243
+
244
+ ## APPENDIX C: HYPERPARAMETERS
245
+
246
+ Complete list of hyperparameters used in all experiments:
247
+
248
+ | Hyperparameter | Value | Description |
249
+ |----------------|-------|-------------|
250
+ | Learning rate | $10^{-4}$ | Initial learning rate |
251
+ | Batch size | 32 | Training batch size |
252
+ | Epochs | 100 | Number of training epochs |
253
+ | Optimizer | AdamW | Optimization algorithm |
254
+ | Weight decay | 0.01 | L2 regularization coefficient |
255
+ | Warmup steps | 1000 | LR warmup duration |
256
+ | Dropout | 0.1 | Dropout probability |
257
+
258
+ ---
259
+
260
+ <style>
261
+ .arxiv-header {
262
+ text-align: center;
263
+ margin-bottom: 2em;
264
+ }
265
+
266
+ body {
267
+ font-family: 'Computer Modern', serif;
268
+ line-height: 1.6;
269
+ }
270
+
271
+ h1 {
272
+ text-align: center;
273
+ font-size: 1.8em;
274
+ margin-top: 1em;
275
+ }
276
+
277
+ h2 {
278
+ font-size: 1.3em;
279
+ margin-top: 1.5em;
280
+ font-weight: bold;
281
+ }
282
+
283
+ h3 {
284
+ font-size: 1.1em;
285
+ font-style: italic;
286
+ margin-top: 1em;
287
+ }
288
+
289
+ table {
290
+ margin: 1em auto;
291
+ border-collapse: collapse;
292
+ }
293
+
294
+ th, td {
295
+ border: 1px solid #000;
296
+ padding: 0.5em;
297
+ text-align: center;
298
+ }
299
+ </style>
@@ -0,0 +1,358 @@
1
+ ---
2
+ title: {{TITLE}}
3
+ authors: {{AUTHORS}}
4
+ date: {{DATE}}
5
+ type: ml-experiment-report
6
+ tags: [machine-learning, experiment-report]
7
+ ---
8
+
9
+ # {{TITLE}}
10
+
11
+ **Machine Learning Experiment Report**
12
+
13
+ **Researchers**: {{AUTHORS}}
14
+ **Date**: {{DATE}}
15
+ **Status**: Draft / Final / In Review
16
+
17
+ ---
18
+
19
+ ## Executive Summary
20
+
21
+ {{ABSTRACT}}
22
+
23
+ ### Key Findings
24
+ - Finding 1
25
+ - Finding 2
26
+ - Finding 3
27
+
28
+ ### Recommendations
29
+ - Recommendation 1
30
+ - Recommendation 2
31
+
32
+ ---
33
+
34
+ ## 1. Objective
35
+
36
+ ### 1.1 Research Question
37
+
38
+ What specific question are we trying to answer?
39
+
40
+ ### 1.2 Success Criteria
41
+
42
+ How will we measure success?
43
+
44
+ - **Metric 1**: Target value
45
+ - **Metric 2**: Target value
46
+ - **Metric 3**: Target value
47
+
48
+ ### 1.3 Constraints
49
+
50
+ - Computational budget
51
+ - Time constraints
52
+ - Data availability
53
+
54
+ ---
55
+
56
+ ## 2. Dataset
57
+
58
+ ### 2.1 Data Description
59
+
60
+ | Property | Value |
61
+ |----------|-------|
62
+ | **Name** | Dataset name |
63
+ | **Source** | Origin of data |
64
+ | **Size** | Number of examples |
65
+ | **Features** | Feature count and types |
66
+ | **Target** | What we're predicting |
67
+ | **License** | Usage rights |
68
+
69
+ ### 2.2 Data Splits
70
+
71
+ | Split | Size | Percentage |
72
+ |-------|------|------------|
73
+ | Train | X examples | Y% |
74
+ | Validation | X examples | Y% |
75
+ | Test | X examples | Y% |
76
+
77
+ ### 2.3 Data Quality
78
+
79
+ - **Missing Values**: Analysis and handling
80
+ - **Outliers**: Detection and treatment
81
+ - **Imbalance**: Class distribution
82
+ - **Preprocessing**: Transformations applied
83
+
84
+ ### 2.4 Exploratory Analysis
85
+
86
+ Key insights from data exploration:
87
+
88
+ 1. Pattern 1
89
+ 2. Pattern 2
90
+ 3. Pattern 3
91
+
92
+ ---
93
+
94
+ ## 3. Model
95
+
96
+ ### 3.1 Architecture
97
+
98
+ Describe the model architecture:
99
+
100
+ ```
101
+ Input → Layer 1 → Layer 2 → ... → Output
102
+ ```
103
+
104
+ ### 3.2 Model Specifications
105
+
106
+ | Component | Configuration |
107
+ |-----------|--------------|
108
+ | **Type** | Model family |
109
+ | **Parameters** | Total count |
110
+ | **Layers** | Number and types |
111
+ | **Activation** | Functions used |
112
+ | **Dropout** | Regularization rate |
113
+
114
+ ### 3.3 Baseline Models
115
+
116
+ What are we comparing against?
117
+
118
+ 1. **Baseline 1**: Simple baseline (e.g., majority class)
119
+ 2. **Baseline 2**: Standard approach (e.g., logistic regression)
120
+ 3. **Baseline 3**: Previous best method
121
+
122
+ ---
123
+
124
+ ## 4. Training
125
+
126
+ ### 4.1 Hyperparameters
127
+
128
+ | Hyperparameter | Value | Rationale |
129
+ |----------------|-------|-----------|
130
+ | Learning Rate | 1e-4 | Tuned via grid search |
131
+ | Batch Size | 32 | GPU memory constraint |
132
+ | Epochs | 100 | Based on validation |
133
+ | Optimizer | AdamW | Standard for transformers |
134
+ | Weight Decay | 0.01 | Regularization |
135
+ | LR Schedule | Cosine | Smooth convergence |
136
+
137
+ ### 4.2 Training Process
138
+
139
+ ```python
140
+ # Training pseudocode
141
+ for epoch in range(num_epochs):
142
+ train_loss = train_one_epoch(model, train_loader)
143
+ val_loss = validate(model, val_loader)
144
+ if val_loss < best_loss:
145
+ save_checkpoint(model)
146
+ ```
147
+
148
+ ### 4.3 Computational Resources
149
+
150
+ | Resource | Specification |
151
+ |----------|--------------|
152
+ | **Hardware** | GPU model and count |
153
+ | **Memory** | RAM and VRAM |
154
+ | **Training Time** | Hours/days |
155
+ | **Cost** | Estimated compute cost |
156
+
157
+ ### 4.4 Training Curves
158
+
159
+ Include plots of:
160
+ - Training loss over time
161
+ - Validation loss over time
162
+ - Learning rate schedule
163
+ - Other relevant metrics
164
+
165
+ ---
166
+
167
+ ## 5. Results
168
+
169
+ ### 5.1 Quantitative Results
170
+
171
+ | Model | Accuracy | Precision | Recall | F1 | AUC |
172
+ |-------|----------|-----------|--------|-------|-----|
173
+ | Baseline 1 | 0.65 | 0.64 | 0.66 | 0.65 | 0.70 |
174
+ | Baseline 2 | 0.78 | 0.77 | 0.79 | 0.78 | 0.82 |
175
+ | **Ours** | **0.89** | **0.88** | **0.90** | **0.89** | **0.93** |
176
+
177
+ ### 5.2 Statistical Significance
178
+
179
+ - **P-value**: Statistical test results
180
+ - **Confidence Intervals**: 95% CI for key metrics
181
+ - **Multiple Runs**: Mean ± std over N runs
182
+
183
+ ### 5.3 Per-Class Performance
184
+
185
+ | Class | Precision | Recall | F1 | Support |
186
+ |-------|-----------|--------|-----|---------|
187
+ | Class 1 | 0.90 | 0.88 | 0.89 | 500 |
188
+ | Class 2 | 0.87 | 0.91 | 0.89 | 450 |
189
+ | Class 3 | 0.88 | 0.89 | 0.88 | 550 |
190
+
191
+ ### 5.4 Qualitative Results
192
+
193
+ #### Success Cases
194
+
195
+ Examples where the model performs well.
196
+
197
+ #### Failure Cases
198
+
199
+ Examples where the model fails and why.
200
+
201
+ ---
202
+
203
+ ## 6. Analysis
204
+
205
+ ### 6.1 Ablation Study
206
+
207
+ | Configuration | Score | Change |
208
+ |---------------|-------|--------|
209
+ | Full Model | 0.89 | - |
210
+ | - Feature Set A | 0.85 | -0.04 |
211
+ | - Feature Set B | 0.87 | -0.02 |
212
+ | - Augmentation | 0.86 | -0.03 |
213
+
214
+ ### 6.2 Error Analysis
215
+
216
+ What types of errors is the model making?
217
+
218
+ 1. **Error Type 1**: Frequency and cause
219
+ 2. **Error Type 2**: Frequency and cause
220
+ 3. **Error Type 3**: Frequency and cause
221
+
222
+ ### 6.3 Feature Importance
223
+
224
+ Which features matter most?
225
+
226
+ | Feature | Importance | Notes |
227
+ |---------|------------|-------|
228
+ | Feature 1 | 0.35 | Most predictive |
229
+ | Feature 2 | 0.28 | Secondary signal |
230
+ | Feature 3 | 0.15 | Marginal impact |
231
+
232
+ ---
233
+
234
+ ## 7. Robustness
235
+
236
+ ### 7.1 Cross-Dataset Evaluation
237
+
238
+ How does the model generalize to other datasets?
239
+
240
+ | Dataset | Score | Notes |
241
+ |---------|-------|-------|
242
+ | Original | 0.89 | Training distribution |
243
+ | Dataset A | 0.82 | Similar domain |
244
+ | Dataset B | 0.71 | Different domain |
245
+
246
+ ### 7.2 Adversarial Robustness
247
+
248
+ Performance under adversarial conditions.
249
+
250
+ ### 7.3 Fairness Analysis
251
+
252
+ Performance across demographic groups or sensitive attributes.
253
+
254
+ ---
255
+
256
+ ## 8. Deployment Considerations
257
+
258
+ ### 8.1 Model Size
259
+
260
+ - **Parameters**: Total count
261
+ - **Disk Size**: MB/GB on disk
262
+ - **Memory**: Runtime memory usage
263
+
264
+ ### 8.2 Inference Speed
265
+
266
+ | Batch Size | Latency | Throughput |
267
+ |------------|---------|------------|
268
+ | 1 | 10ms | 100 QPS |
269
+ | 8 | 45ms | 178 QPS |
270
+ | 32 | 150ms | 213 QPS |
271
+
272
+ ### 8.3 Production Requirements
273
+
274
+ - **Dependencies**: Software requirements
275
+ - **Infrastructure**: Hardware needs
276
+ - **Monitoring**: What to track in production
277
+ - **Fallback**: Backup strategy
278
+
279
+ ---
280
+
281
+ ## 9. Conclusions
282
+
283
+ ### 9.1 Summary
284
+
285
+ Key takeaways from the experiment.
286
+
287
+ ### 9.2 Did We Meet Objectives?
288
+
289
+ | Objective | Status | Notes |
290
+ |-----------|--------|-------|
291
+ | Objective 1 | ✅ Met | Achieved target |
292
+ | Objective 2 | ⚠️ Partial | Close to target |
293
+ | Objective 3 | ❌ Not Met | Needs more work |
294
+
295
+ ### 9.3 Lessons Learned
296
+
297
+ What did we learn from this experiment?
298
+
299
+ 1. Lesson 1
300
+ 2. Lesson 2
301
+ 3. Lesson 3
302
+
303
+ ---
304
+
305
+ ## 10. Next Steps
306
+
307
+ ### 10.1 Short-term (1-2 weeks)
308
+
309
+ - [ ] Task 1
310
+ - [ ] Task 2
311
+ - [ ] Task 3
312
+
313
+ ### 10.2 Medium-term (1-2 months)
314
+
315
+ - [ ] Task 1
316
+ - [ ] Task 2
317
+ - [ ] Task 3
318
+
319
+ ### 10.3 Long-term (3+ months)
320
+
321
+ - [ ] Task 1
322
+ - [ ] Task 2
323
+ - [ ] Task 3
324
+
325
+ ---
326
+
327
+ ## References
328
+
329
+ 1. Reference 1
330
+ 2. Reference 2
331
+ 3. Reference 3
332
+
333
+ ---
334
+
335
+ ## Appendix
336
+
337
+ ### A. Hyperparameter Search
338
+
339
+ Results from hyperparameter tuning.
340
+
341
+ ### B. Additional Experiments
342
+
343
+ Supplementary experiments not included in main text.
344
+
345
+ ### C. Code
346
+
347
+ Links to code repositories:
348
+ - Training code: [link]
349
+ - Evaluation code: [link]
350
+ - Model checkpoint: [link]
351
+
352
+ ### D. Data Card
353
+
354
+ Detailed data documentation following standard practices.
355
+
356
+ ### E. Model Card
357
+
358
+ Model documentation following responsible AI practices.