@synsci/cli-darwin-x64 1.1.49

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (373) hide show
  1. package/bin/skills/accelerate/SKILL.md +332 -0
  2. package/bin/skills/accelerate/references/custom-plugins.md +453 -0
  3. package/bin/skills/accelerate/references/megatron-integration.md +489 -0
  4. package/bin/skills/accelerate/references/performance.md +525 -0
  5. package/bin/skills/audiocraft/SKILL.md +564 -0
  6. package/bin/skills/audiocraft/references/advanced-usage.md +666 -0
  7. package/bin/skills/audiocraft/references/troubleshooting.md +504 -0
  8. package/bin/skills/autogpt/SKILL.md +403 -0
  9. package/bin/skills/autogpt/references/advanced-usage.md +535 -0
  10. package/bin/skills/autogpt/references/troubleshooting.md +420 -0
  11. package/bin/skills/awq/SKILL.md +310 -0
  12. package/bin/skills/awq/references/advanced-usage.md +324 -0
  13. package/bin/skills/awq/references/troubleshooting.md +344 -0
  14. package/bin/skills/axolotl/SKILL.md +158 -0
  15. package/bin/skills/axolotl/references/api.md +5548 -0
  16. package/bin/skills/axolotl/references/dataset-formats.md +1029 -0
  17. package/bin/skills/axolotl/references/index.md +15 -0
  18. package/bin/skills/axolotl/references/other.md +3563 -0
  19. package/bin/skills/bigcode-evaluation-harness/SKILL.md +405 -0
  20. package/bin/skills/bigcode-evaluation-harness/references/benchmarks.md +393 -0
  21. package/bin/skills/bigcode-evaluation-harness/references/custom-tasks.md +424 -0
  22. package/bin/skills/bigcode-evaluation-harness/references/issues.md +394 -0
  23. package/bin/skills/bitsandbytes/SKILL.md +411 -0
  24. package/bin/skills/bitsandbytes/references/memory-optimization.md +521 -0
  25. package/bin/skills/bitsandbytes/references/qlora-training.md +521 -0
  26. package/bin/skills/bitsandbytes/references/quantization-formats.md +447 -0
  27. package/bin/skills/blip-2/SKILL.md +564 -0
  28. package/bin/skills/blip-2/references/advanced-usage.md +680 -0
  29. package/bin/skills/blip-2/references/troubleshooting.md +526 -0
  30. package/bin/skills/chroma/SKILL.md +406 -0
  31. package/bin/skills/chroma/references/integration.md +38 -0
  32. package/bin/skills/clip/SKILL.md +253 -0
  33. package/bin/skills/clip/references/applications.md +207 -0
  34. package/bin/skills/constitutional-ai/SKILL.md +290 -0
  35. package/bin/skills/crewai/SKILL.md +498 -0
  36. package/bin/skills/crewai/references/flows.md +438 -0
  37. package/bin/skills/crewai/references/tools.md +429 -0
  38. package/bin/skills/crewai/references/troubleshooting.md +480 -0
  39. package/bin/skills/deepspeed/SKILL.md +141 -0
  40. package/bin/skills/deepspeed/references/08.md +17 -0
  41. package/bin/skills/deepspeed/references/09.md +173 -0
  42. package/bin/skills/deepspeed/references/2020.md +378 -0
  43. package/bin/skills/deepspeed/references/2023.md +279 -0
  44. package/bin/skills/deepspeed/references/assets.md +179 -0
  45. package/bin/skills/deepspeed/references/index.md +35 -0
  46. package/bin/skills/deepspeed/references/mii.md +118 -0
  47. package/bin/skills/deepspeed/references/other.md +1191 -0
  48. package/bin/skills/deepspeed/references/tutorials.md +6554 -0
  49. package/bin/skills/dspy/SKILL.md +590 -0
  50. package/bin/skills/dspy/references/examples.md +663 -0
  51. package/bin/skills/dspy/references/modules.md +475 -0
  52. package/bin/skills/dspy/references/optimizers.md +566 -0
  53. package/bin/skills/faiss/SKILL.md +221 -0
  54. package/bin/skills/faiss/references/index_types.md +280 -0
  55. package/bin/skills/flash-attention/SKILL.md +367 -0
  56. package/bin/skills/flash-attention/references/benchmarks.md +215 -0
  57. package/bin/skills/flash-attention/references/transformers-integration.md +293 -0
  58. package/bin/skills/gguf/SKILL.md +427 -0
  59. package/bin/skills/gguf/references/advanced-usage.md +504 -0
  60. package/bin/skills/gguf/references/troubleshooting.md +442 -0
  61. package/bin/skills/gptq/SKILL.md +450 -0
  62. package/bin/skills/gptq/references/calibration.md +337 -0
  63. package/bin/skills/gptq/references/integration.md +129 -0
  64. package/bin/skills/gptq/references/troubleshooting.md +95 -0
  65. package/bin/skills/grpo-rl-training/README.md +97 -0
  66. package/bin/skills/grpo-rl-training/SKILL.md +572 -0
  67. package/bin/skills/grpo-rl-training/examples/reward_functions_library.py +393 -0
  68. package/bin/skills/grpo-rl-training/templates/basic_grpo_training.py +228 -0
  69. package/bin/skills/guidance/SKILL.md +572 -0
  70. package/bin/skills/guidance/references/backends.md +554 -0
  71. package/bin/skills/guidance/references/constraints.md +674 -0
  72. package/bin/skills/guidance/references/examples.md +767 -0
  73. package/bin/skills/hqq/SKILL.md +445 -0
  74. package/bin/skills/hqq/references/advanced-usage.md +528 -0
  75. package/bin/skills/hqq/references/troubleshooting.md +503 -0
  76. package/bin/skills/hugging-face-cli/SKILL.md +191 -0
  77. package/bin/skills/hugging-face-cli/references/commands.md +954 -0
  78. package/bin/skills/hugging-face-cli/references/examples.md +374 -0
  79. package/bin/skills/hugging-face-datasets/SKILL.md +547 -0
  80. package/bin/skills/hugging-face-datasets/examples/diverse_training_examples.json +239 -0
  81. package/bin/skills/hugging-face-datasets/examples/system_prompt_template.txt +196 -0
  82. package/bin/skills/hugging-face-datasets/examples/training_examples.json +176 -0
  83. package/bin/skills/hugging-face-datasets/scripts/dataset_manager.py +522 -0
  84. package/bin/skills/hugging-face-datasets/scripts/sql_manager.py +844 -0
  85. package/bin/skills/hugging-face-datasets/templates/chat.json +55 -0
  86. package/bin/skills/hugging-face-datasets/templates/classification.json +62 -0
  87. package/bin/skills/hugging-face-datasets/templates/completion.json +51 -0
  88. package/bin/skills/hugging-face-datasets/templates/custom.json +75 -0
  89. package/bin/skills/hugging-face-datasets/templates/qa.json +54 -0
  90. package/bin/skills/hugging-face-datasets/templates/tabular.json +81 -0
  91. package/bin/skills/hugging-face-evaluation/SKILL.md +656 -0
  92. package/bin/skills/hugging-face-evaluation/examples/USAGE_EXAMPLES.md +382 -0
  93. package/bin/skills/hugging-face-evaluation/examples/artificial_analysis_to_hub.py +141 -0
  94. package/bin/skills/hugging-face-evaluation/examples/example_readme_tables.md +135 -0
  95. package/bin/skills/hugging-face-evaluation/examples/metric_mapping.json +50 -0
  96. package/bin/skills/hugging-face-evaluation/requirements.txt +20 -0
  97. package/bin/skills/hugging-face-evaluation/scripts/evaluation_manager.py +1374 -0
  98. package/bin/skills/hugging-face-evaluation/scripts/inspect_eval_uv.py +104 -0
  99. package/bin/skills/hugging-face-evaluation/scripts/inspect_vllm_uv.py +317 -0
  100. package/bin/skills/hugging-face-evaluation/scripts/lighteval_vllm_uv.py +303 -0
  101. package/bin/skills/hugging-face-evaluation/scripts/run_eval_job.py +98 -0
  102. package/bin/skills/hugging-face-evaluation/scripts/run_vllm_eval_job.py +331 -0
  103. package/bin/skills/hugging-face-evaluation/scripts/test_extraction.py +206 -0
  104. package/bin/skills/hugging-face-jobs/SKILL.md +1041 -0
  105. package/bin/skills/hugging-face-jobs/index.html +216 -0
  106. package/bin/skills/hugging-face-jobs/references/hardware_guide.md +336 -0
  107. package/bin/skills/hugging-face-jobs/references/hub_saving.md +352 -0
  108. package/bin/skills/hugging-face-jobs/references/token_usage.md +546 -0
  109. package/bin/skills/hugging-face-jobs/references/troubleshooting.md +475 -0
  110. package/bin/skills/hugging-face-jobs/scripts/cot-self-instruct.py +718 -0
  111. package/bin/skills/hugging-face-jobs/scripts/finepdfs-stats.py +546 -0
  112. package/bin/skills/hugging-face-jobs/scripts/generate-responses.py +587 -0
  113. package/bin/skills/hugging-face-model-trainer/SKILL.md +711 -0
  114. package/bin/skills/hugging-face-model-trainer/references/gguf_conversion.md +296 -0
  115. package/bin/skills/hugging-face-model-trainer/references/hardware_guide.md +283 -0
  116. package/bin/skills/hugging-face-model-trainer/references/hub_saving.md +364 -0
  117. package/bin/skills/hugging-face-model-trainer/references/reliability_principles.md +371 -0
  118. package/bin/skills/hugging-face-model-trainer/references/trackio_guide.md +189 -0
  119. package/bin/skills/hugging-face-model-trainer/references/training_methods.md +150 -0
  120. package/bin/skills/hugging-face-model-trainer/references/training_patterns.md +203 -0
  121. package/bin/skills/hugging-face-model-trainer/references/troubleshooting.md +282 -0
  122. package/bin/skills/hugging-face-model-trainer/scripts/convert_to_gguf.py +424 -0
  123. package/bin/skills/hugging-face-model-trainer/scripts/dataset_inspector.py +417 -0
  124. package/bin/skills/hugging-face-model-trainer/scripts/estimate_cost.py +150 -0
  125. package/bin/skills/hugging-face-model-trainer/scripts/train_dpo_example.py +106 -0
  126. package/bin/skills/hugging-face-model-trainer/scripts/train_grpo_example.py +89 -0
  127. package/bin/skills/hugging-face-model-trainer/scripts/train_sft_example.py +122 -0
  128. package/bin/skills/hugging-face-paper-publisher/SKILL.md +627 -0
  129. package/bin/skills/hugging-face-paper-publisher/examples/example_usage.md +327 -0
  130. package/bin/skills/hugging-face-paper-publisher/references/quick_reference.md +216 -0
  131. package/bin/skills/hugging-face-paper-publisher/scripts/paper_manager.py +508 -0
  132. package/bin/skills/hugging-face-paper-publisher/templates/arxiv.md +299 -0
  133. package/bin/skills/hugging-face-paper-publisher/templates/ml-report.md +358 -0
  134. package/bin/skills/hugging-face-paper-publisher/templates/modern.md +319 -0
  135. package/bin/skills/hugging-face-paper-publisher/templates/standard.md +201 -0
  136. package/bin/skills/hugging-face-tool-builder/SKILL.md +115 -0
  137. package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.py +57 -0
  138. package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.sh +40 -0
  139. package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.tsx +57 -0
  140. package/bin/skills/hugging-face-tool-builder/references/find_models_by_paper.sh +230 -0
  141. package/bin/skills/hugging-face-tool-builder/references/hf_enrich_models.sh +96 -0
  142. package/bin/skills/hugging-face-tool-builder/references/hf_model_card_frontmatter.sh +188 -0
  143. package/bin/skills/hugging-face-tool-builder/references/hf_model_papers_auth.sh +171 -0
  144. package/bin/skills/hugging-face-trackio/SKILL.md +65 -0
  145. package/bin/skills/hugging-face-trackio/references/logging_metrics.md +206 -0
  146. package/bin/skills/hugging-face-trackio/references/retrieving_metrics.md +223 -0
  147. package/bin/skills/huggingface-tokenizers/SKILL.md +516 -0
  148. package/bin/skills/huggingface-tokenizers/references/algorithms.md +653 -0
  149. package/bin/skills/huggingface-tokenizers/references/integration.md +637 -0
  150. package/bin/skills/huggingface-tokenizers/references/pipeline.md +723 -0
  151. package/bin/skills/huggingface-tokenizers/references/training.md +565 -0
  152. package/bin/skills/instructor/SKILL.md +740 -0
  153. package/bin/skills/instructor/references/examples.md +107 -0
  154. package/bin/skills/instructor/references/providers.md +70 -0
  155. package/bin/skills/instructor/references/validation.md +606 -0
  156. package/bin/skills/knowledge-distillation/SKILL.md +458 -0
  157. package/bin/skills/knowledge-distillation/references/minillm.md +334 -0
  158. package/bin/skills/lambda-labs/SKILL.md +545 -0
  159. package/bin/skills/lambda-labs/references/advanced-usage.md +611 -0
  160. package/bin/skills/lambda-labs/references/troubleshooting.md +530 -0
  161. package/bin/skills/langchain/SKILL.md +480 -0
  162. package/bin/skills/langchain/references/agents.md +499 -0
  163. package/bin/skills/langchain/references/integration.md +562 -0
  164. package/bin/skills/langchain/references/rag.md +600 -0
  165. package/bin/skills/langsmith/SKILL.md +422 -0
  166. package/bin/skills/langsmith/references/advanced-usage.md +548 -0
  167. package/bin/skills/langsmith/references/troubleshooting.md +537 -0
  168. package/bin/skills/litgpt/SKILL.md +469 -0
  169. package/bin/skills/litgpt/references/custom-models.md +568 -0
  170. package/bin/skills/litgpt/references/distributed-training.md +451 -0
  171. package/bin/skills/litgpt/references/supported-models.md +336 -0
  172. package/bin/skills/litgpt/references/training-recipes.md +619 -0
  173. package/bin/skills/llama-cpp/SKILL.md +258 -0
  174. package/bin/skills/llama-cpp/references/optimization.md +89 -0
  175. package/bin/skills/llama-cpp/references/quantization.md +213 -0
  176. package/bin/skills/llama-cpp/references/server.md +125 -0
  177. package/bin/skills/llama-factory/SKILL.md +80 -0
  178. package/bin/skills/llama-factory/references/_images.md +23 -0
  179. package/bin/skills/llama-factory/references/advanced.md +1055 -0
  180. package/bin/skills/llama-factory/references/getting_started.md +349 -0
  181. package/bin/skills/llama-factory/references/index.md +19 -0
  182. package/bin/skills/llama-factory/references/other.md +31 -0
  183. package/bin/skills/llamaguard/SKILL.md +337 -0
  184. package/bin/skills/llamaindex/SKILL.md +569 -0
  185. package/bin/skills/llamaindex/references/agents.md +83 -0
  186. package/bin/skills/llamaindex/references/data_connectors.md +108 -0
  187. package/bin/skills/llamaindex/references/query_engines.md +406 -0
  188. package/bin/skills/llava/SKILL.md +304 -0
  189. package/bin/skills/llava/references/training.md +197 -0
  190. package/bin/skills/lm-evaluation-harness/SKILL.md +490 -0
  191. package/bin/skills/lm-evaluation-harness/references/api-evaluation.md +490 -0
  192. package/bin/skills/lm-evaluation-harness/references/benchmark-guide.md +488 -0
  193. package/bin/skills/lm-evaluation-harness/references/custom-tasks.md +602 -0
  194. package/bin/skills/lm-evaluation-harness/references/distributed-eval.md +519 -0
  195. package/bin/skills/long-context/SKILL.md +536 -0
  196. package/bin/skills/long-context/references/extension_methods.md +468 -0
  197. package/bin/skills/long-context/references/fine_tuning.md +611 -0
  198. package/bin/skills/long-context/references/rope.md +402 -0
  199. package/bin/skills/mamba/SKILL.md +260 -0
  200. package/bin/skills/mamba/references/architecture-details.md +206 -0
  201. package/bin/skills/mamba/references/benchmarks.md +255 -0
  202. package/bin/skills/mamba/references/training-guide.md +388 -0
  203. package/bin/skills/megatron-core/SKILL.md +366 -0
  204. package/bin/skills/megatron-core/references/benchmarks.md +249 -0
  205. package/bin/skills/megatron-core/references/parallelism-guide.md +404 -0
  206. package/bin/skills/megatron-core/references/production-examples.md +473 -0
  207. package/bin/skills/megatron-core/references/training-recipes.md +547 -0
  208. package/bin/skills/miles/SKILL.md +315 -0
  209. package/bin/skills/miles/references/api-reference.md +141 -0
  210. package/bin/skills/miles/references/troubleshooting.md +352 -0
  211. package/bin/skills/mlflow/SKILL.md +704 -0
  212. package/bin/skills/mlflow/references/deployment.md +744 -0
  213. package/bin/skills/mlflow/references/model-registry.md +770 -0
  214. package/bin/skills/mlflow/references/tracking.md +680 -0
  215. package/bin/skills/modal/SKILL.md +341 -0
  216. package/bin/skills/modal/references/advanced-usage.md +503 -0
  217. package/bin/skills/modal/references/troubleshooting.md +494 -0
  218. package/bin/skills/model-merging/SKILL.md +539 -0
  219. package/bin/skills/model-merging/references/evaluation.md +462 -0
  220. package/bin/skills/model-merging/references/examples.md +428 -0
  221. package/bin/skills/model-merging/references/methods.md +352 -0
  222. package/bin/skills/model-pruning/SKILL.md +495 -0
  223. package/bin/skills/model-pruning/references/wanda.md +347 -0
  224. package/bin/skills/moe-training/SKILL.md +526 -0
  225. package/bin/skills/moe-training/references/architectures.md +432 -0
  226. package/bin/skills/moe-training/references/inference.md +348 -0
  227. package/bin/skills/moe-training/references/training.md +425 -0
  228. package/bin/skills/nanogpt/SKILL.md +290 -0
  229. package/bin/skills/nanogpt/references/architecture.md +382 -0
  230. package/bin/skills/nanogpt/references/data.md +476 -0
  231. package/bin/skills/nanogpt/references/training.md +564 -0
  232. package/bin/skills/nemo-curator/SKILL.md +383 -0
  233. package/bin/skills/nemo-curator/references/deduplication.md +87 -0
  234. package/bin/skills/nemo-curator/references/filtering.md +102 -0
  235. package/bin/skills/nemo-evaluator/SKILL.md +494 -0
  236. package/bin/skills/nemo-evaluator/references/adapter-system.md +340 -0
  237. package/bin/skills/nemo-evaluator/references/configuration.md +447 -0
  238. package/bin/skills/nemo-evaluator/references/custom-benchmarks.md +315 -0
  239. package/bin/skills/nemo-evaluator/references/execution-backends.md +361 -0
  240. package/bin/skills/nemo-guardrails/SKILL.md +297 -0
  241. package/bin/skills/nnsight/SKILL.md +436 -0
  242. package/bin/skills/nnsight/references/README.md +78 -0
  243. package/bin/skills/nnsight/references/api.md +344 -0
  244. package/bin/skills/nnsight/references/tutorials.md +300 -0
  245. package/bin/skills/openrlhf/SKILL.md +249 -0
  246. package/bin/skills/openrlhf/references/algorithm-comparison.md +404 -0
  247. package/bin/skills/openrlhf/references/custom-rewards.md +530 -0
  248. package/bin/skills/openrlhf/references/hybrid-engine.md +287 -0
  249. package/bin/skills/openrlhf/references/multi-node-training.md +454 -0
  250. package/bin/skills/outlines/SKILL.md +652 -0
  251. package/bin/skills/outlines/references/backends.md +615 -0
  252. package/bin/skills/outlines/references/examples.md +773 -0
  253. package/bin/skills/outlines/references/json_generation.md +652 -0
  254. package/bin/skills/peft/SKILL.md +431 -0
  255. package/bin/skills/peft/references/advanced-usage.md +514 -0
  256. package/bin/skills/peft/references/troubleshooting.md +480 -0
  257. package/bin/skills/phoenix/SKILL.md +475 -0
  258. package/bin/skills/phoenix/references/advanced-usage.md +619 -0
  259. package/bin/skills/phoenix/references/troubleshooting.md +538 -0
  260. package/bin/skills/pinecone/SKILL.md +358 -0
  261. package/bin/skills/pinecone/references/deployment.md +181 -0
  262. package/bin/skills/pytorch-fsdp/SKILL.md +126 -0
  263. package/bin/skills/pytorch-fsdp/references/index.md +7 -0
  264. package/bin/skills/pytorch-fsdp/references/other.md +4249 -0
  265. package/bin/skills/pytorch-lightning/SKILL.md +346 -0
  266. package/bin/skills/pytorch-lightning/references/callbacks.md +436 -0
  267. package/bin/skills/pytorch-lightning/references/distributed.md +490 -0
  268. package/bin/skills/pytorch-lightning/references/hyperparameter-tuning.md +556 -0
  269. package/bin/skills/pyvene/SKILL.md +473 -0
  270. package/bin/skills/pyvene/references/README.md +73 -0
  271. package/bin/skills/pyvene/references/api.md +383 -0
  272. package/bin/skills/pyvene/references/tutorials.md +376 -0
  273. package/bin/skills/qdrant/SKILL.md +493 -0
  274. package/bin/skills/qdrant/references/advanced-usage.md +648 -0
  275. package/bin/skills/qdrant/references/troubleshooting.md +631 -0
  276. package/bin/skills/ray-data/SKILL.md +326 -0
  277. package/bin/skills/ray-data/references/integration.md +82 -0
  278. package/bin/skills/ray-data/references/transformations.md +83 -0
  279. package/bin/skills/ray-train/SKILL.md +406 -0
  280. package/bin/skills/ray-train/references/multi-node.md +628 -0
  281. package/bin/skills/rwkv/SKILL.md +260 -0
  282. package/bin/skills/rwkv/references/architecture-details.md +344 -0
  283. package/bin/skills/rwkv/references/rwkv7.md +386 -0
  284. package/bin/skills/rwkv/references/state-management.md +369 -0
  285. package/bin/skills/saelens/SKILL.md +386 -0
  286. package/bin/skills/saelens/references/README.md +70 -0
  287. package/bin/skills/saelens/references/api.md +333 -0
  288. package/bin/skills/saelens/references/tutorials.md +318 -0
  289. package/bin/skills/segment-anything/SKILL.md +500 -0
  290. package/bin/skills/segment-anything/references/advanced-usage.md +589 -0
  291. package/bin/skills/segment-anything/references/troubleshooting.md +484 -0
  292. package/bin/skills/sentence-transformers/SKILL.md +255 -0
  293. package/bin/skills/sentence-transformers/references/models.md +123 -0
  294. package/bin/skills/sentencepiece/SKILL.md +235 -0
  295. package/bin/skills/sentencepiece/references/algorithms.md +200 -0
  296. package/bin/skills/sentencepiece/references/training.md +304 -0
  297. package/bin/skills/sglang/SKILL.md +442 -0
  298. package/bin/skills/sglang/references/deployment.md +490 -0
  299. package/bin/skills/sglang/references/radix-attention.md +413 -0
  300. package/bin/skills/sglang/references/structured-generation.md +541 -0
  301. package/bin/skills/simpo/SKILL.md +219 -0
  302. package/bin/skills/simpo/references/datasets.md +478 -0
  303. package/bin/skills/simpo/references/hyperparameters.md +452 -0
  304. package/bin/skills/simpo/references/loss-functions.md +350 -0
  305. package/bin/skills/skypilot/SKILL.md +509 -0
  306. package/bin/skills/skypilot/references/advanced-usage.md +491 -0
  307. package/bin/skills/skypilot/references/troubleshooting.md +570 -0
  308. package/bin/skills/slime/SKILL.md +464 -0
  309. package/bin/skills/slime/references/api-reference.md +392 -0
  310. package/bin/skills/slime/references/troubleshooting.md +386 -0
  311. package/bin/skills/speculative-decoding/SKILL.md +467 -0
  312. package/bin/skills/speculative-decoding/references/lookahead.md +309 -0
  313. package/bin/skills/speculative-decoding/references/medusa.md +350 -0
  314. package/bin/skills/stable-diffusion/SKILL.md +519 -0
  315. package/bin/skills/stable-diffusion/references/advanced-usage.md +716 -0
  316. package/bin/skills/stable-diffusion/references/troubleshooting.md +555 -0
  317. package/bin/skills/tensorboard/SKILL.md +629 -0
  318. package/bin/skills/tensorboard/references/integrations.md +638 -0
  319. package/bin/skills/tensorboard/references/profiling.md +545 -0
  320. package/bin/skills/tensorboard/references/visualization.md +620 -0
  321. package/bin/skills/tensorrt-llm/SKILL.md +187 -0
  322. package/bin/skills/tensorrt-llm/references/multi-gpu.md +298 -0
  323. package/bin/skills/tensorrt-llm/references/optimization.md +242 -0
  324. package/bin/skills/tensorrt-llm/references/serving.md +470 -0
  325. package/bin/skills/tinker/SKILL.md +362 -0
  326. package/bin/skills/tinker/references/api-reference.md +168 -0
  327. package/bin/skills/tinker/references/getting-started.md +157 -0
  328. package/bin/skills/tinker/references/loss-functions.md +163 -0
  329. package/bin/skills/tinker/references/models-and-lora.md +139 -0
  330. package/bin/skills/tinker/references/recipes.md +280 -0
  331. package/bin/skills/tinker/references/reinforcement-learning.md +212 -0
  332. package/bin/skills/tinker/references/rendering.md +243 -0
  333. package/bin/skills/tinker/references/supervised-learning.md +232 -0
  334. package/bin/skills/tinker-training-cost/SKILL.md +187 -0
  335. package/bin/skills/tinker-training-cost/scripts/calculate_cost.py +123 -0
  336. package/bin/skills/torchforge/SKILL.md +433 -0
  337. package/bin/skills/torchforge/references/api-reference.md +327 -0
  338. package/bin/skills/torchforge/references/troubleshooting.md +409 -0
  339. package/bin/skills/torchtitan/SKILL.md +358 -0
  340. package/bin/skills/torchtitan/references/checkpoint.md +181 -0
  341. package/bin/skills/torchtitan/references/custom-models.md +258 -0
  342. package/bin/skills/torchtitan/references/float8.md +133 -0
  343. package/bin/skills/torchtitan/references/fsdp.md +126 -0
  344. package/bin/skills/transformer-lens/SKILL.md +346 -0
  345. package/bin/skills/transformer-lens/references/README.md +54 -0
  346. package/bin/skills/transformer-lens/references/api.md +362 -0
  347. package/bin/skills/transformer-lens/references/tutorials.md +339 -0
  348. package/bin/skills/trl-fine-tuning/SKILL.md +455 -0
  349. package/bin/skills/trl-fine-tuning/references/dpo-variants.md +227 -0
  350. package/bin/skills/trl-fine-tuning/references/online-rl.md +82 -0
  351. package/bin/skills/trl-fine-tuning/references/reward-modeling.md +122 -0
  352. package/bin/skills/trl-fine-tuning/references/sft-training.md +168 -0
  353. package/bin/skills/unsloth/SKILL.md +80 -0
  354. package/bin/skills/unsloth/references/index.md +7 -0
  355. package/bin/skills/unsloth/references/llms-full.md +16799 -0
  356. package/bin/skills/unsloth/references/llms-txt.md +12044 -0
  357. package/bin/skills/unsloth/references/llms.md +82 -0
  358. package/bin/skills/verl/SKILL.md +391 -0
  359. package/bin/skills/verl/references/api-reference.md +301 -0
  360. package/bin/skills/verl/references/troubleshooting.md +391 -0
  361. package/bin/skills/vllm/SKILL.md +364 -0
  362. package/bin/skills/vllm/references/optimization.md +226 -0
  363. package/bin/skills/vllm/references/quantization.md +284 -0
  364. package/bin/skills/vllm/references/server-deployment.md +255 -0
  365. package/bin/skills/vllm/references/troubleshooting.md +447 -0
  366. package/bin/skills/weights-and-biases/SKILL.md +590 -0
  367. package/bin/skills/weights-and-biases/references/artifacts.md +584 -0
  368. package/bin/skills/weights-and-biases/references/integrations.md +700 -0
  369. package/bin/skills/weights-and-biases/references/sweeps.md +847 -0
  370. package/bin/skills/whisper/SKILL.md +317 -0
  371. package/bin/skills/whisper/references/languages.md +189 -0
  372. package/bin/synsc +0 -0
  373. package/package.json +10 -0
@@ -0,0 +1,392 @@
1
+ # slime API Reference
2
+
3
+ ## Architecture Overview
4
+
5
+ slime operates with a three-module architecture orchestrated by Ray:
6
+
7
+ ```
8
+ ┌─────────────────────────────────────────────────────────┐
9
+ │ Data Buffer │
10
+ │ - Prompt initialization and management │
11
+ │ - Custom data generation and filtering │
12
+ │ - Rollout sample storage │
13
+ └─────────────┬───────────────────────────┬───────────────┘
14
+ │ │
15
+ ┌─────────────▼───────────┐ ┌─────────────▼───────────────┐
16
+ │ Training (Megatron-LM) │ │ Rollout (SGLang + Router) │
17
+ │ - Actor model training │ │ - Response generation │
18
+ │ - Critic (optional) │ │ - Reward/verifier output │
19
+ │ - Weight sync to rollout│ │ - Multi-turn support │
20
+ └─────────────────────────┘ └─────────────────────────────┘
21
+ ```
22
+
23
+ ## Core Data Structures
24
+
25
+ ### Sample Object
26
+
27
+ The `Sample` object is the core data structure defined in `slime/utils/types.py`:
28
+
29
+ ```python
30
+ from slime.utils.types import Sample
31
+
32
+ @dataclass
33
+ class Sample:
34
+ # Core fields
35
+ group_index: Optional[int] # Group index for batching
36
+ index: Optional[int] # Sample index
37
+ prompt: str | list[dict] = "" # Input prompt or chat history
38
+ tokens: list[int] = field(default_factory=list) # Token IDs
39
+ response: str = "" # Generated response
40
+ response_length: int = 0 # Response length in tokens
41
+ label: Optional[str] = None # Ground truth label
42
+ reward: Optional[float | dict] = None # RL reward signal
43
+ loss_mask: Optional[list[int]] = None # 1=compute loss, 0=mask
44
+ status: Status = Status.PENDING # Sample status
45
+ metadata: dict = field(default_factory=dict) # Custom data
46
+
47
+ # Multimodal support
48
+ multimodal_inputs: Optional[Any] = None # Raw multimodal data (images, videos)
49
+ multimodal_train_inputs: Optional[Any] = None # Processed multimodal data (pixel_values)
50
+
51
+ # Rollout tracking
52
+ weight_versions: list[str] = field(default_factory=list)
53
+ rollout_log_probs: Optional[list[float]] = None # Log probs from SGLang
54
+ rollout_routed_experts: Optional[list[list[int]]] = None # Expert routing (MoE)
55
+
56
+ # Control fields
57
+ remove_sample: bool = False
58
+ generate_function_path: Optional[str] = None
59
+ train_metadata: Optional[dict] = None
60
+ non_generation_time: float = 0.0
61
+
62
+ # Speculative decoding info (nested dataclass)
63
+ @dataclass
64
+ class SpecInfo:
65
+ spec_accept_token_num: int = 0
66
+ spec_draft_token_num: int = 0
67
+ spec_verify_ct: int = 0
68
+ completion_token_num: int = 0
69
+ ```
70
+
71
+ ### Status Enum
72
+
73
+ ```python
74
+ class Status(Enum):
75
+ PENDING = "pending" # Not yet processed
76
+ COMPLETED = "completed" # Successfully generated
77
+ TRUNCATED = "truncated" # Hit max length
78
+ ABORTED = "aborted" # Failed generation
79
+ FAILED = "failed" # Generation failed
80
+ ```
81
+
82
+ ## Configuration System
83
+
84
+ slime uses three categories of command-line arguments:
85
+
86
+ ### 1. Megatron Arguments
87
+
88
+ All Megatron-LM arguments are supported directly:
89
+
90
+ ```bash
91
+ --tensor-model-parallel-size 2
92
+ --pipeline-model-parallel-size 1
93
+ --num-layers 32
94
+ --hidden-size 4096
95
+ --num-attention-heads 32
96
+ --seq-length 4096
97
+ --micro-batch-size 1
98
+ --global-batch-size 256
99
+ ```
100
+
101
+ ### 2. SGLang Arguments
102
+
103
+ SGLang arguments are prefixed with `--sglang-`:
104
+
105
+ ```bash
106
+ --sglang-mem-fraction-static 0.8 # GPU memory for KV cache
107
+ --sglang-context-length 8192 # Maximum context length
108
+ --sglang-log-level INFO # Logging verbosity
109
+ --sglang-tp-size 2 # Tensor parallelism
110
+ --sglang-disable-cuda-graph # Disable CUDA graphs
111
+ ```
112
+
113
+ ### 3. slime-Specific Arguments
114
+
115
+ Defined in `slime/utils/arguments.py`:
116
+
117
+ ```bash
118
+ # Resource Allocation
119
+ --actor-num-nodes 1 # Training nodes
120
+ --actor-num-gpus-per-node 8 # GPUs per training node
121
+ --rollout-num-gpus 8 # Total rollout GPUs
122
+ --rollout-num-gpus-per-engine 2 # GPUs per SGLang engine
123
+ --colocate # Share GPUs for train/inference
124
+
125
+ # Data Configuration
126
+ --prompt-data /path/to/data.jsonl # Training data path
127
+ --input-key prompt # Key for prompts in JSON
128
+ --label-key label # Key for labels in JSON
129
+ --apply-chat-template # Apply chat formatting
130
+
131
+ # Training Loop
132
+ --num-rollout 3000 # Total rollout iterations
133
+ --rollout-batch-size 32 # Prompts per rollout
134
+ --n-samples-per-prompt 8 # Responses per prompt
135
+ --global-batch-size 256 # Training batch size
136
+ --num-steps-per-rollout 1 # Training steps per rollout
137
+
138
+ # RL Algorithm
139
+ --advantage-estimator grpo # grpo, gspo, ppo, reinforce_plus_plus
140
+ --use-kl-loss # Enable KL loss
141
+ --kl-loss-coef 0.001 # KL coefficient
142
+ --calculate-per-token-loss # Token-level loss
143
+
144
+ # Off-Policy Options
145
+ --use-tis # Truncated Importance Sampling
146
+ --tis-threshold 0.9 # TIS threshold
147
+ --true-on-policy-mode # Force on-policy training
148
+ ```
149
+
150
+ ## Data Buffer System
151
+
152
+ ### RolloutDataSource (Base Class)
153
+
154
+ ```python
155
+ from slime.data import RolloutDataSource
156
+
157
+ class RolloutDataSource:
158
+ def __init__(self, dataset, args):
159
+ self.dataset = dataset
160
+ self.args = args
161
+
162
+ def get_samples(self, num_samples: int) -> list[Sample]:
163
+ """Fetch prompts from dataset."""
164
+ return [Sample(prompt=p) for p in self.dataset.sample(num_samples)]
165
+
166
+ def add_samples(self, samples: list[Sample]) -> None:
167
+ """Called after generation (no-op by default)."""
168
+ pass
169
+ ```
170
+
171
+ ### Buffered Data Source (Off-Policy)
172
+
173
+ ```python
174
+ from slime.data import RolloutDataSourceWithBuffer
175
+
176
+ class RolloutDataSourceWithBuffer(RolloutDataSource):
177
+ def __init__(self, dataset, args):
178
+ super().__init__(dataset, args)
179
+ self.buffer = []
180
+
181
+ def add_samples(self, samples: list[Sample]) -> None:
182
+ """Store generated samples for reuse."""
183
+ self.buffer.extend(samples)
184
+
185
+ def buffer_filter(self, args, buffer, num_samples) -> list[Sample]:
186
+ """Custom selection logic."""
187
+ # Example: prioritized sampling based on reward
188
+ sorted_buffer = sorted(buffer, key=lambda s: s.reward, reverse=True)
189
+ return sorted_buffer[:num_samples]
190
+ ```
191
+
192
+ ## Custom Functions
193
+
194
+ ### Custom Generate Function
195
+
196
+ For multi-turn or tool-calling scenarios:
197
+
198
+ ```python
199
+ # custom_generate.py
200
+ from slime.data import Sample
201
+
202
+ async def custom_generate(args, samples: list[Sample], evaluation: bool = False) -> list[Sample]:
203
+ """
204
+ Custom generation function for multi-turn interactions.
205
+
206
+ Args:
207
+ args: Training arguments
208
+ samples: List of Sample objects with prompts
209
+ evaluation: Whether this is an evaluation run
210
+
211
+ Returns:
212
+ List of Sample objects with responses and rewards
213
+ """
214
+ for sample in samples:
215
+ conversation = sample.prompt if isinstance(sample.prompt, list) else [
216
+ {"role": "user", "content": sample.prompt}
217
+ ]
218
+
219
+ for turn in range(args.max_turns):
220
+ # Generate response
221
+ response = await generate_single(conversation)
222
+
223
+ # Check for tool call
224
+ tool_call = extract_tool_call(response)
225
+ if tool_call:
226
+ # Execute tool
227
+ tool_result = await execute_tool(tool_call)
228
+ conversation.append({"role": "assistant", "content": response})
229
+ conversation.append({"role": "tool", "content": tool_result})
230
+ else:
231
+ # Final response
232
+ sample.response = response
233
+ break
234
+
235
+ # Compute reward
236
+ sample.reward = compute_reward(sample)
237
+
238
+ # Set loss mask (1 for model tokens, 0 for tool responses)
239
+ sample.loss_mask = build_loss_mask(sample)
240
+
241
+ return samples
242
+ ```
243
+
244
+ Usage:
245
+ ```bash
246
+ python train.py \
247
+ --custom-generate-function-path custom_generate.py \
248
+ --max-turns 5
249
+ ```
250
+
251
+ ### Custom Reward Function
252
+
253
+ ```python
254
+ # custom_rm.py
255
+ from slime.data import Sample
256
+
257
+ async def reward_func(args, sample: Sample, **kwargs) -> float:
258
+ """
259
+ Compute reward for a single sample.
260
+
261
+ Args:
262
+ args: Training arguments
263
+ sample: Sample object with response
264
+
265
+ Returns:
266
+ Reward score (float)
267
+ """
268
+ response = sample.response
269
+ ground_truth = sample.label or sample.metadata.get("answer", "")
270
+
271
+ # Example: exact match reward
272
+ if response.strip() == ground_truth.strip():
273
+ return 1.0
274
+ return 0.0
275
+
276
+ # For batched processing (more efficient)
277
+ async def batched_custom_rm(args, samples: list[Sample]) -> list[float]:
278
+ """Batch reward computation."""
279
+ rewards = []
280
+ for sample in samples:
281
+ reward = await reward_func(args, sample)
282
+ rewards.append(reward)
283
+ return rewards
284
+ ```
285
+
286
+ Usage:
287
+ ```bash
288
+ python train.py \
289
+ --custom-rm-path custom_rm.py \
290
+ --group-rm # Enable batched processing
291
+ ```
292
+
293
+ ## Model Configuration
294
+
295
+ ### Pre-configured Model Scripts
296
+
297
+ Located in `scripts/models/`:
298
+
299
+ ```bash
300
+ # List available models
301
+ ls scripts/models/
302
+ # glm4-9B.sh, qwen3-4B.sh, qwen3-30B-A3B.sh, deepseek-v3.sh, llama3-8B.sh
303
+
304
+ # Source model configuration
305
+ source scripts/models/qwen3-4B.sh
306
+ # This sets MODEL_ARGS and CKPT_ARGS arrays
307
+ ```
308
+
309
+ ### Example Model Script
310
+
311
+ ```bash
312
+ # scripts/models/qwen3-4B.sh
313
+ export MODEL_ARGS=(
314
+ --num-layers 36
315
+ --hidden-size 2560
316
+ --num-attention-heads 20
317
+ --num-query-groups 4
318
+ --ffn-hidden-size 6912
319
+ --max-position-embeddings 32768
320
+ --rotary-percent 1.0
321
+ --rotary-base 1000000
322
+ --swiglu
323
+ --untie-embeddings-and-output-weights
324
+ --no-position-embedding
325
+ --normalization RMSNorm
326
+ --tokenizer-type HuggingFaceTokenizer
327
+ --bf16
328
+ )
329
+
330
+ export CKPT_ARGS=(
331
+ --hf-checkpoint /path/to/qwen3-4b-hf
332
+ --initial-megatron-checkpoint /path/to/megatron/ckpt
333
+ )
334
+ ```
335
+
336
+ ## Async Training
337
+
338
+ ### Enabling Async Mode
339
+
340
+ ```bash
341
+ python train_async.py \
342
+ --actor-num-gpus-per-node 8 \
343
+ --rollout-num-gpus 8 \
344
+ --async-buffer-size 4 \
345
+ --update-weights-interval 2 \
346
+ ${MODEL_ARGS[@]}
347
+ ```
348
+
349
+ ### Async-Specific Parameters
350
+
351
+ ```bash
352
+ --async-buffer-size 4 # Number of rollouts to buffer
353
+ --update-weights-interval 2 # Sync weights every N rollouts
354
+ ```
355
+
356
+ **Note**: Colocated mode (`--colocate`) is NOT supported with async training.
357
+
358
+ ## Evaluation
359
+
360
+ ### Multi-Task Evaluation
361
+
362
+ ```bash
363
+ --eval-prompt-data aime /path/to/aime.jsonl \
364
+ --eval-prompt-data gsm8k /path/to/gsm8k.jsonl \
365
+ --n-samples-per-eval-prompt 16 \
366
+ --eval-interval 50
367
+ ```
368
+
369
+ ### Evaluation Configuration
370
+
371
+ ```bash
372
+ --eval-interval 50 # Evaluate every N rollouts
373
+ --n-samples-per-eval-prompt 16 # Samples for evaluation
374
+ --eval-temperature 0.0 # Greedy decoding for eval
375
+ ```
376
+
377
+ ## Supported Models
378
+
379
+ | Model Family | Configurations |
380
+ |--------------|----------------|
381
+ | GLM | GLM-4.5, GLM-4.6, GLM-4.7, GLM-Z1-9B |
382
+ | Qwen | Qwen3 (4B, 8B, 30B-A3B), Qwen3-MoE, Qwen2.5 |
383
+ | DeepSeek | V3, V3.1, R1 |
384
+ | Llama | Llama 3 (8B, 70B) |
385
+ | Others | Kimi K2, Moonlight-16B |
386
+
387
+ ## Resources
388
+
389
+ - Documentation: https://thudm.github.io/slime/
390
+ - GitHub: https://github.com/THUDM/slime
391
+ - Blog: https://lmsys.org/blog/2025-07-09-slime/
392
+ - Examples: `examples/` directory (14+ worked examples)