@synsci/cli-darwin-x64 1.1.49
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/skills/accelerate/SKILL.md +332 -0
- package/bin/skills/accelerate/references/custom-plugins.md +453 -0
- package/bin/skills/accelerate/references/megatron-integration.md +489 -0
- package/bin/skills/accelerate/references/performance.md +525 -0
- package/bin/skills/audiocraft/SKILL.md +564 -0
- package/bin/skills/audiocraft/references/advanced-usage.md +666 -0
- package/bin/skills/audiocraft/references/troubleshooting.md +504 -0
- package/bin/skills/autogpt/SKILL.md +403 -0
- package/bin/skills/autogpt/references/advanced-usage.md +535 -0
- package/bin/skills/autogpt/references/troubleshooting.md +420 -0
- package/bin/skills/awq/SKILL.md +310 -0
- package/bin/skills/awq/references/advanced-usage.md +324 -0
- package/bin/skills/awq/references/troubleshooting.md +344 -0
- package/bin/skills/axolotl/SKILL.md +158 -0
- package/bin/skills/axolotl/references/api.md +5548 -0
- package/bin/skills/axolotl/references/dataset-formats.md +1029 -0
- package/bin/skills/axolotl/references/index.md +15 -0
- package/bin/skills/axolotl/references/other.md +3563 -0
- package/bin/skills/bigcode-evaluation-harness/SKILL.md +405 -0
- package/bin/skills/bigcode-evaluation-harness/references/benchmarks.md +393 -0
- package/bin/skills/bigcode-evaluation-harness/references/custom-tasks.md +424 -0
- package/bin/skills/bigcode-evaluation-harness/references/issues.md +394 -0
- package/bin/skills/bitsandbytes/SKILL.md +411 -0
- package/bin/skills/bitsandbytes/references/memory-optimization.md +521 -0
- package/bin/skills/bitsandbytes/references/qlora-training.md +521 -0
- package/bin/skills/bitsandbytes/references/quantization-formats.md +447 -0
- package/bin/skills/blip-2/SKILL.md +564 -0
- package/bin/skills/blip-2/references/advanced-usage.md +680 -0
- package/bin/skills/blip-2/references/troubleshooting.md +526 -0
- package/bin/skills/chroma/SKILL.md +406 -0
- package/bin/skills/chroma/references/integration.md +38 -0
- package/bin/skills/clip/SKILL.md +253 -0
- package/bin/skills/clip/references/applications.md +207 -0
- package/bin/skills/constitutional-ai/SKILL.md +290 -0
- package/bin/skills/crewai/SKILL.md +498 -0
- package/bin/skills/crewai/references/flows.md +438 -0
- package/bin/skills/crewai/references/tools.md +429 -0
- package/bin/skills/crewai/references/troubleshooting.md +480 -0
- package/bin/skills/deepspeed/SKILL.md +141 -0
- package/bin/skills/deepspeed/references/08.md +17 -0
- package/bin/skills/deepspeed/references/09.md +173 -0
- package/bin/skills/deepspeed/references/2020.md +378 -0
- package/bin/skills/deepspeed/references/2023.md +279 -0
- package/bin/skills/deepspeed/references/assets.md +179 -0
- package/bin/skills/deepspeed/references/index.md +35 -0
- package/bin/skills/deepspeed/references/mii.md +118 -0
- package/bin/skills/deepspeed/references/other.md +1191 -0
- package/bin/skills/deepspeed/references/tutorials.md +6554 -0
- package/bin/skills/dspy/SKILL.md +590 -0
- package/bin/skills/dspy/references/examples.md +663 -0
- package/bin/skills/dspy/references/modules.md +475 -0
- package/bin/skills/dspy/references/optimizers.md +566 -0
- package/bin/skills/faiss/SKILL.md +221 -0
- package/bin/skills/faiss/references/index_types.md +280 -0
- package/bin/skills/flash-attention/SKILL.md +367 -0
- package/bin/skills/flash-attention/references/benchmarks.md +215 -0
- package/bin/skills/flash-attention/references/transformers-integration.md +293 -0
- package/bin/skills/gguf/SKILL.md +427 -0
- package/bin/skills/gguf/references/advanced-usage.md +504 -0
- package/bin/skills/gguf/references/troubleshooting.md +442 -0
- package/bin/skills/gptq/SKILL.md +450 -0
- package/bin/skills/gptq/references/calibration.md +337 -0
- package/bin/skills/gptq/references/integration.md +129 -0
- package/bin/skills/gptq/references/troubleshooting.md +95 -0
- package/bin/skills/grpo-rl-training/README.md +97 -0
- package/bin/skills/grpo-rl-training/SKILL.md +572 -0
- package/bin/skills/grpo-rl-training/examples/reward_functions_library.py +393 -0
- package/bin/skills/grpo-rl-training/templates/basic_grpo_training.py +228 -0
- package/bin/skills/guidance/SKILL.md +572 -0
- package/bin/skills/guidance/references/backends.md +554 -0
- package/bin/skills/guidance/references/constraints.md +674 -0
- package/bin/skills/guidance/references/examples.md +767 -0
- package/bin/skills/hqq/SKILL.md +445 -0
- package/bin/skills/hqq/references/advanced-usage.md +528 -0
- package/bin/skills/hqq/references/troubleshooting.md +503 -0
- package/bin/skills/hugging-face-cli/SKILL.md +191 -0
- package/bin/skills/hugging-face-cli/references/commands.md +954 -0
- package/bin/skills/hugging-face-cli/references/examples.md +374 -0
- package/bin/skills/hugging-face-datasets/SKILL.md +547 -0
- package/bin/skills/hugging-face-datasets/examples/diverse_training_examples.json +239 -0
- package/bin/skills/hugging-face-datasets/examples/system_prompt_template.txt +196 -0
- package/bin/skills/hugging-face-datasets/examples/training_examples.json +176 -0
- package/bin/skills/hugging-face-datasets/scripts/dataset_manager.py +522 -0
- package/bin/skills/hugging-face-datasets/scripts/sql_manager.py +844 -0
- package/bin/skills/hugging-face-datasets/templates/chat.json +55 -0
- package/bin/skills/hugging-face-datasets/templates/classification.json +62 -0
- package/bin/skills/hugging-face-datasets/templates/completion.json +51 -0
- package/bin/skills/hugging-face-datasets/templates/custom.json +75 -0
- package/bin/skills/hugging-face-datasets/templates/qa.json +54 -0
- package/bin/skills/hugging-face-datasets/templates/tabular.json +81 -0
- package/bin/skills/hugging-face-evaluation/SKILL.md +656 -0
- package/bin/skills/hugging-face-evaluation/examples/USAGE_EXAMPLES.md +382 -0
- package/bin/skills/hugging-face-evaluation/examples/artificial_analysis_to_hub.py +141 -0
- package/bin/skills/hugging-face-evaluation/examples/example_readme_tables.md +135 -0
- package/bin/skills/hugging-face-evaluation/examples/metric_mapping.json +50 -0
- package/bin/skills/hugging-face-evaluation/requirements.txt +20 -0
- package/bin/skills/hugging-face-evaluation/scripts/evaluation_manager.py +1374 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_eval_uv.py +104 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_vllm_uv.py +317 -0
- package/bin/skills/hugging-face-evaluation/scripts/lighteval_vllm_uv.py +303 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_eval_job.py +98 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_vllm_eval_job.py +331 -0
- package/bin/skills/hugging-face-evaluation/scripts/test_extraction.py +206 -0
- package/bin/skills/hugging-face-jobs/SKILL.md +1041 -0
- package/bin/skills/hugging-face-jobs/index.html +216 -0
- package/bin/skills/hugging-face-jobs/references/hardware_guide.md +336 -0
- package/bin/skills/hugging-face-jobs/references/hub_saving.md +352 -0
- package/bin/skills/hugging-face-jobs/references/token_usage.md +546 -0
- package/bin/skills/hugging-face-jobs/references/troubleshooting.md +475 -0
- package/bin/skills/hugging-face-jobs/scripts/cot-self-instruct.py +718 -0
- package/bin/skills/hugging-face-jobs/scripts/finepdfs-stats.py +546 -0
- package/bin/skills/hugging-face-jobs/scripts/generate-responses.py +587 -0
- package/bin/skills/hugging-face-model-trainer/SKILL.md +711 -0
- package/bin/skills/hugging-face-model-trainer/references/gguf_conversion.md +296 -0
- package/bin/skills/hugging-face-model-trainer/references/hardware_guide.md +283 -0
- package/bin/skills/hugging-face-model-trainer/references/hub_saving.md +364 -0
- package/bin/skills/hugging-face-model-trainer/references/reliability_principles.md +371 -0
- package/bin/skills/hugging-face-model-trainer/references/trackio_guide.md +189 -0
- package/bin/skills/hugging-face-model-trainer/references/training_methods.md +150 -0
- package/bin/skills/hugging-face-model-trainer/references/training_patterns.md +203 -0
- package/bin/skills/hugging-face-model-trainer/references/troubleshooting.md +282 -0
- package/bin/skills/hugging-face-model-trainer/scripts/convert_to_gguf.py +424 -0
- package/bin/skills/hugging-face-model-trainer/scripts/dataset_inspector.py +417 -0
- package/bin/skills/hugging-face-model-trainer/scripts/estimate_cost.py +150 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_dpo_example.py +106 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_grpo_example.py +89 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_sft_example.py +122 -0
- package/bin/skills/hugging-face-paper-publisher/SKILL.md +627 -0
- package/bin/skills/hugging-face-paper-publisher/examples/example_usage.md +327 -0
- package/bin/skills/hugging-face-paper-publisher/references/quick_reference.md +216 -0
- package/bin/skills/hugging-face-paper-publisher/scripts/paper_manager.py +508 -0
- package/bin/skills/hugging-face-paper-publisher/templates/arxiv.md +299 -0
- package/bin/skills/hugging-face-paper-publisher/templates/ml-report.md +358 -0
- package/bin/skills/hugging-face-paper-publisher/templates/modern.md +319 -0
- package/bin/skills/hugging-face-paper-publisher/templates/standard.md +201 -0
- package/bin/skills/hugging-face-tool-builder/SKILL.md +115 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.py +57 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.sh +40 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.tsx +57 -0
- package/bin/skills/hugging-face-tool-builder/references/find_models_by_paper.sh +230 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_enrich_models.sh +96 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_card_frontmatter.sh +188 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_papers_auth.sh +171 -0
- package/bin/skills/hugging-face-trackio/SKILL.md +65 -0
- package/bin/skills/hugging-face-trackio/references/logging_metrics.md +206 -0
- package/bin/skills/hugging-face-trackio/references/retrieving_metrics.md +223 -0
- package/bin/skills/huggingface-tokenizers/SKILL.md +516 -0
- package/bin/skills/huggingface-tokenizers/references/algorithms.md +653 -0
- package/bin/skills/huggingface-tokenizers/references/integration.md +637 -0
- package/bin/skills/huggingface-tokenizers/references/pipeline.md +723 -0
- package/bin/skills/huggingface-tokenizers/references/training.md +565 -0
- package/bin/skills/instructor/SKILL.md +740 -0
- package/bin/skills/instructor/references/examples.md +107 -0
- package/bin/skills/instructor/references/providers.md +70 -0
- package/bin/skills/instructor/references/validation.md +606 -0
- package/bin/skills/knowledge-distillation/SKILL.md +458 -0
- package/bin/skills/knowledge-distillation/references/minillm.md +334 -0
- package/bin/skills/lambda-labs/SKILL.md +545 -0
- package/bin/skills/lambda-labs/references/advanced-usage.md +611 -0
- package/bin/skills/lambda-labs/references/troubleshooting.md +530 -0
- package/bin/skills/langchain/SKILL.md +480 -0
- package/bin/skills/langchain/references/agents.md +499 -0
- package/bin/skills/langchain/references/integration.md +562 -0
- package/bin/skills/langchain/references/rag.md +600 -0
- package/bin/skills/langsmith/SKILL.md +422 -0
- package/bin/skills/langsmith/references/advanced-usage.md +548 -0
- package/bin/skills/langsmith/references/troubleshooting.md +537 -0
- package/bin/skills/litgpt/SKILL.md +469 -0
- package/bin/skills/litgpt/references/custom-models.md +568 -0
- package/bin/skills/litgpt/references/distributed-training.md +451 -0
- package/bin/skills/litgpt/references/supported-models.md +336 -0
- package/bin/skills/litgpt/references/training-recipes.md +619 -0
- package/bin/skills/llama-cpp/SKILL.md +258 -0
- package/bin/skills/llama-cpp/references/optimization.md +89 -0
- package/bin/skills/llama-cpp/references/quantization.md +213 -0
- package/bin/skills/llama-cpp/references/server.md +125 -0
- package/bin/skills/llama-factory/SKILL.md +80 -0
- package/bin/skills/llama-factory/references/_images.md +23 -0
- package/bin/skills/llama-factory/references/advanced.md +1055 -0
- package/bin/skills/llama-factory/references/getting_started.md +349 -0
- package/bin/skills/llama-factory/references/index.md +19 -0
- package/bin/skills/llama-factory/references/other.md +31 -0
- package/bin/skills/llamaguard/SKILL.md +337 -0
- package/bin/skills/llamaindex/SKILL.md +569 -0
- package/bin/skills/llamaindex/references/agents.md +83 -0
- package/bin/skills/llamaindex/references/data_connectors.md +108 -0
- package/bin/skills/llamaindex/references/query_engines.md +406 -0
- package/bin/skills/llava/SKILL.md +304 -0
- package/bin/skills/llava/references/training.md +197 -0
- package/bin/skills/lm-evaluation-harness/SKILL.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/api-evaluation.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/benchmark-guide.md +488 -0
- package/bin/skills/lm-evaluation-harness/references/custom-tasks.md +602 -0
- package/bin/skills/lm-evaluation-harness/references/distributed-eval.md +519 -0
- package/bin/skills/long-context/SKILL.md +536 -0
- package/bin/skills/long-context/references/extension_methods.md +468 -0
- package/bin/skills/long-context/references/fine_tuning.md +611 -0
- package/bin/skills/long-context/references/rope.md +402 -0
- package/bin/skills/mamba/SKILL.md +260 -0
- package/bin/skills/mamba/references/architecture-details.md +206 -0
- package/bin/skills/mamba/references/benchmarks.md +255 -0
- package/bin/skills/mamba/references/training-guide.md +388 -0
- package/bin/skills/megatron-core/SKILL.md +366 -0
- package/bin/skills/megatron-core/references/benchmarks.md +249 -0
- package/bin/skills/megatron-core/references/parallelism-guide.md +404 -0
- package/bin/skills/megatron-core/references/production-examples.md +473 -0
- package/bin/skills/megatron-core/references/training-recipes.md +547 -0
- package/bin/skills/miles/SKILL.md +315 -0
- package/bin/skills/miles/references/api-reference.md +141 -0
- package/bin/skills/miles/references/troubleshooting.md +352 -0
- package/bin/skills/mlflow/SKILL.md +704 -0
- package/bin/skills/mlflow/references/deployment.md +744 -0
- package/bin/skills/mlflow/references/model-registry.md +770 -0
- package/bin/skills/mlflow/references/tracking.md +680 -0
- package/bin/skills/modal/SKILL.md +341 -0
- package/bin/skills/modal/references/advanced-usage.md +503 -0
- package/bin/skills/modal/references/troubleshooting.md +494 -0
- package/bin/skills/model-merging/SKILL.md +539 -0
- package/bin/skills/model-merging/references/evaluation.md +462 -0
- package/bin/skills/model-merging/references/examples.md +428 -0
- package/bin/skills/model-merging/references/methods.md +352 -0
- package/bin/skills/model-pruning/SKILL.md +495 -0
- package/bin/skills/model-pruning/references/wanda.md +347 -0
- package/bin/skills/moe-training/SKILL.md +526 -0
- package/bin/skills/moe-training/references/architectures.md +432 -0
- package/bin/skills/moe-training/references/inference.md +348 -0
- package/bin/skills/moe-training/references/training.md +425 -0
- package/bin/skills/nanogpt/SKILL.md +290 -0
- package/bin/skills/nanogpt/references/architecture.md +382 -0
- package/bin/skills/nanogpt/references/data.md +476 -0
- package/bin/skills/nanogpt/references/training.md +564 -0
- package/bin/skills/nemo-curator/SKILL.md +383 -0
- package/bin/skills/nemo-curator/references/deduplication.md +87 -0
- package/bin/skills/nemo-curator/references/filtering.md +102 -0
- package/bin/skills/nemo-evaluator/SKILL.md +494 -0
- package/bin/skills/nemo-evaluator/references/adapter-system.md +340 -0
- package/bin/skills/nemo-evaluator/references/configuration.md +447 -0
- package/bin/skills/nemo-evaluator/references/custom-benchmarks.md +315 -0
- package/bin/skills/nemo-evaluator/references/execution-backends.md +361 -0
- package/bin/skills/nemo-guardrails/SKILL.md +297 -0
- package/bin/skills/nnsight/SKILL.md +436 -0
- package/bin/skills/nnsight/references/README.md +78 -0
- package/bin/skills/nnsight/references/api.md +344 -0
- package/bin/skills/nnsight/references/tutorials.md +300 -0
- package/bin/skills/openrlhf/SKILL.md +249 -0
- package/bin/skills/openrlhf/references/algorithm-comparison.md +404 -0
- package/bin/skills/openrlhf/references/custom-rewards.md +530 -0
- package/bin/skills/openrlhf/references/hybrid-engine.md +287 -0
- package/bin/skills/openrlhf/references/multi-node-training.md +454 -0
- package/bin/skills/outlines/SKILL.md +652 -0
- package/bin/skills/outlines/references/backends.md +615 -0
- package/bin/skills/outlines/references/examples.md +773 -0
- package/bin/skills/outlines/references/json_generation.md +652 -0
- package/bin/skills/peft/SKILL.md +431 -0
- package/bin/skills/peft/references/advanced-usage.md +514 -0
- package/bin/skills/peft/references/troubleshooting.md +480 -0
- package/bin/skills/phoenix/SKILL.md +475 -0
- package/bin/skills/phoenix/references/advanced-usage.md +619 -0
- package/bin/skills/phoenix/references/troubleshooting.md +538 -0
- package/bin/skills/pinecone/SKILL.md +358 -0
- package/bin/skills/pinecone/references/deployment.md +181 -0
- package/bin/skills/pytorch-fsdp/SKILL.md +126 -0
- package/bin/skills/pytorch-fsdp/references/index.md +7 -0
- package/bin/skills/pytorch-fsdp/references/other.md +4249 -0
- package/bin/skills/pytorch-lightning/SKILL.md +346 -0
- package/bin/skills/pytorch-lightning/references/callbacks.md +436 -0
- package/bin/skills/pytorch-lightning/references/distributed.md +490 -0
- package/bin/skills/pytorch-lightning/references/hyperparameter-tuning.md +556 -0
- package/bin/skills/pyvene/SKILL.md +473 -0
- package/bin/skills/pyvene/references/README.md +73 -0
- package/bin/skills/pyvene/references/api.md +383 -0
- package/bin/skills/pyvene/references/tutorials.md +376 -0
- package/bin/skills/qdrant/SKILL.md +493 -0
- package/bin/skills/qdrant/references/advanced-usage.md +648 -0
- package/bin/skills/qdrant/references/troubleshooting.md +631 -0
- package/bin/skills/ray-data/SKILL.md +326 -0
- package/bin/skills/ray-data/references/integration.md +82 -0
- package/bin/skills/ray-data/references/transformations.md +83 -0
- package/bin/skills/ray-train/SKILL.md +406 -0
- package/bin/skills/ray-train/references/multi-node.md +628 -0
- package/bin/skills/rwkv/SKILL.md +260 -0
- package/bin/skills/rwkv/references/architecture-details.md +344 -0
- package/bin/skills/rwkv/references/rwkv7.md +386 -0
- package/bin/skills/rwkv/references/state-management.md +369 -0
- package/bin/skills/saelens/SKILL.md +386 -0
- package/bin/skills/saelens/references/README.md +70 -0
- package/bin/skills/saelens/references/api.md +333 -0
- package/bin/skills/saelens/references/tutorials.md +318 -0
- package/bin/skills/segment-anything/SKILL.md +500 -0
- package/bin/skills/segment-anything/references/advanced-usage.md +589 -0
- package/bin/skills/segment-anything/references/troubleshooting.md +484 -0
- package/bin/skills/sentence-transformers/SKILL.md +255 -0
- package/bin/skills/sentence-transformers/references/models.md +123 -0
- package/bin/skills/sentencepiece/SKILL.md +235 -0
- package/bin/skills/sentencepiece/references/algorithms.md +200 -0
- package/bin/skills/sentencepiece/references/training.md +304 -0
- package/bin/skills/sglang/SKILL.md +442 -0
- package/bin/skills/sglang/references/deployment.md +490 -0
- package/bin/skills/sglang/references/radix-attention.md +413 -0
- package/bin/skills/sglang/references/structured-generation.md +541 -0
- package/bin/skills/simpo/SKILL.md +219 -0
- package/bin/skills/simpo/references/datasets.md +478 -0
- package/bin/skills/simpo/references/hyperparameters.md +452 -0
- package/bin/skills/simpo/references/loss-functions.md +350 -0
- package/bin/skills/skypilot/SKILL.md +509 -0
- package/bin/skills/skypilot/references/advanced-usage.md +491 -0
- package/bin/skills/skypilot/references/troubleshooting.md +570 -0
- package/bin/skills/slime/SKILL.md +464 -0
- package/bin/skills/slime/references/api-reference.md +392 -0
- package/bin/skills/slime/references/troubleshooting.md +386 -0
- package/bin/skills/speculative-decoding/SKILL.md +467 -0
- package/bin/skills/speculative-decoding/references/lookahead.md +309 -0
- package/bin/skills/speculative-decoding/references/medusa.md +350 -0
- package/bin/skills/stable-diffusion/SKILL.md +519 -0
- package/bin/skills/stable-diffusion/references/advanced-usage.md +716 -0
- package/bin/skills/stable-diffusion/references/troubleshooting.md +555 -0
- package/bin/skills/tensorboard/SKILL.md +629 -0
- package/bin/skills/tensorboard/references/integrations.md +638 -0
- package/bin/skills/tensorboard/references/profiling.md +545 -0
- package/bin/skills/tensorboard/references/visualization.md +620 -0
- package/bin/skills/tensorrt-llm/SKILL.md +187 -0
- package/bin/skills/tensorrt-llm/references/multi-gpu.md +298 -0
- package/bin/skills/tensorrt-llm/references/optimization.md +242 -0
- package/bin/skills/tensorrt-llm/references/serving.md +470 -0
- package/bin/skills/tinker/SKILL.md +362 -0
- package/bin/skills/tinker/references/api-reference.md +168 -0
- package/bin/skills/tinker/references/getting-started.md +157 -0
- package/bin/skills/tinker/references/loss-functions.md +163 -0
- package/bin/skills/tinker/references/models-and-lora.md +139 -0
- package/bin/skills/tinker/references/recipes.md +280 -0
- package/bin/skills/tinker/references/reinforcement-learning.md +212 -0
- package/bin/skills/tinker/references/rendering.md +243 -0
- package/bin/skills/tinker/references/supervised-learning.md +232 -0
- package/bin/skills/tinker-training-cost/SKILL.md +187 -0
- package/bin/skills/tinker-training-cost/scripts/calculate_cost.py +123 -0
- package/bin/skills/torchforge/SKILL.md +433 -0
- package/bin/skills/torchforge/references/api-reference.md +327 -0
- package/bin/skills/torchforge/references/troubleshooting.md +409 -0
- package/bin/skills/torchtitan/SKILL.md +358 -0
- package/bin/skills/torchtitan/references/checkpoint.md +181 -0
- package/bin/skills/torchtitan/references/custom-models.md +258 -0
- package/bin/skills/torchtitan/references/float8.md +133 -0
- package/bin/skills/torchtitan/references/fsdp.md +126 -0
- package/bin/skills/transformer-lens/SKILL.md +346 -0
- package/bin/skills/transformer-lens/references/README.md +54 -0
- package/bin/skills/transformer-lens/references/api.md +362 -0
- package/bin/skills/transformer-lens/references/tutorials.md +339 -0
- package/bin/skills/trl-fine-tuning/SKILL.md +455 -0
- package/bin/skills/trl-fine-tuning/references/dpo-variants.md +227 -0
- package/bin/skills/trl-fine-tuning/references/online-rl.md +82 -0
- package/bin/skills/trl-fine-tuning/references/reward-modeling.md +122 -0
- package/bin/skills/trl-fine-tuning/references/sft-training.md +168 -0
- package/bin/skills/unsloth/SKILL.md +80 -0
- package/bin/skills/unsloth/references/index.md +7 -0
- package/bin/skills/unsloth/references/llms-full.md +16799 -0
- package/bin/skills/unsloth/references/llms-txt.md +12044 -0
- package/bin/skills/unsloth/references/llms.md +82 -0
- package/bin/skills/verl/SKILL.md +391 -0
- package/bin/skills/verl/references/api-reference.md +301 -0
- package/bin/skills/verl/references/troubleshooting.md +391 -0
- package/bin/skills/vllm/SKILL.md +364 -0
- package/bin/skills/vllm/references/optimization.md +226 -0
- package/bin/skills/vllm/references/quantization.md +284 -0
- package/bin/skills/vllm/references/server-deployment.md +255 -0
- package/bin/skills/vllm/references/troubleshooting.md +447 -0
- package/bin/skills/weights-and-biases/SKILL.md +590 -0
- package/bin/skills/weights-and-biases/references/artifacts.md +584 -0
- package/bin/skills/weights-and-biases/references/integrations.md +700 -0
- package/bin/skills/weights-and-biases/references/sweeps.md +847 -0
- package/bin/skills/whisper/SKILL.md +317 -0
- package/bin/skills/whisper/references/languages.md +189 -0
- package/bin/synsc +0 -0
- package/package.json +10 -0
|
@@ -0,0 +1,392 @@
|
|
|
1
|
+
# slime API Reference
|
|
2
|
+
|
|
3
|
+
## Architecture Overview
|
|
4
|
+
|
|
5
|
+
slime operates with a three-module architecture orchestrated by Ray:
|
|
6
|
+
|
|
7
|
+
```
|
|
8
|
+
┌─────────────────────────────────────────────────────────┐
|
|
9
|
+
│ Data Buffer │
|
|
10
|
+
│ - Prompt initialization and management │
|
|
11
|
+
│ - Custom data generation and filtering │
|
|
12
|
+
│ - Rollout sample storage │
|
|
13
|
+
└─────────────┬───────────────────────────┬───────────────┘
|
|
14
|
+
│ │
|
|
15
|
+
┌─────────────▼───────────┐ ┌─────────────▼───────────────┐
|
|
16
|
+
│ Training (Megatron-LM) │ │ Rollout (SGLang + Router) │
|
|
17
|
+
│ - Actor model training │ │ - Response generation │
|
|
18
|
+
│ - Critic (optional) │ │ - Reward/verifier output │
|
|
19
|
+
│ - Weight sync to rollout│ │ - Multi-turn support │
|
|
20
|
+
└─────────────────────────┘ └─────────────────────────────┘
|
|
21
|
+
```
|
|
22
|
+
|
|
23
|
+
## Core Data Structures
|
|
24
|
+
|
|
25
|
+
### Sample Object
|
|
26
|
+
|
|
27
|
+
The `Sample` object is the core data structure defined in `slime/utils/types.py`:
|
|
28
|
+
|
|
29
|
+
```python
|
|
30
|
+
from slime.utils.types import Sample
|
|
31
|
+
|
|
32
|
+
@dataclass
|
|
33
|
+
class Sample:
|
|
34
|
+
# Core fields
|
|
35
|
+
group_index: Optional[int] # Group index for batching
|
|
36
|
+
index: Optional[int] # Sample index
|
|
37
|
+
prompt: str | list[dict] = "" # Input prompt or chat history
|
|
38
|
+
tokens: list[int] = field(default_factory=list) # Token IDs
|
|
39
|
+
response: str = "" # Generated response
|
|
40
|
+
response_length: int = 0 # Response length in tokens
|
|
41
|
+
label: Optional[str] = None # Ground truth label
|
|
42
|
+
reward: Optional[float | dict] = None # RL reward signal
|
|
43
|
+
loss_mask: Optional[list[int]] = None # 1=compute loss, 0=mask
|
|
44
|
+
status: Status = Status.PENDING # Sample status
|
|
45
|
+
metadata: dict = field(default_factory=dict) # Custom data
|
|
46
|
+
|
|
47
|
+
# Multimodal support
|
|
48
|
+
multimodal_inputs: Optional[Any] = None # Raw multimodal data (images, videos)
|
|
49
|
+
multimodal_train_inputs: Optional[Any] = None # Processed multimodal data (pixel_values)
|
|
50
|
+
|
|
51
|
+
# Rollout tracking
|
|
52
|
+
weight_versions: list[str] = field(default_factory=list)
|
|
53
|
+
rollout_log_probs: Optional[list[float]] = None # Log probs from SGLang
|
|
54
|
+
rollout_routed_experts: Optional[list[list[int]]] = None # Expert routing (MoE)
|
|
55
|
+
|
|
56
|
+
# Control fields
|
|
57
|
+
remove_sample: bool = False
|
|
58
|
+
generate_function_path: Optional[str] = None
|
|
59
|
+
train_metadata: Optional[dict] = None
|
|
60
|
+
non_generation_time: float = 0.0
|
|
61
|
+
|
|
62
|
+
# Speculative decoding info (nested dataclass)
|
|
63
|
+
@dataclass
|
|
64
|
+
class SpecInfo:
|
|
65
|
+
spec_accept_token_num: int = 0
|
|
66
|
+
spec_draft_token_num: int = 0
|
|
67
|
+
spec_verify_ct: int = 0
|
|
68
|
+
completion_token_num: int = 0
|
|
69
|
+
```
|
|
70
|
+
|
|
71
|
+
### Status Enum
|
|
72
|
+
|
|
73
|
+
```python
|
|
74
|
+
class Status(Enum):
|
|
75
|
+
PENDING = "pending" # Not yet processed
|
|
76
|
+
COMPLETED = "completed" # Successfully generated
|
|
77
|
+
TRUNCATED = "truncated" # Hit max length
|
|
78
|
+
ABORTED = "aborted" # Failed generation
|
|
79
|
+
FAILED = "failed" # Generation failed
|
|
80
|
+
```
|
|
81
|
+
|
|
82
|
+
## Configuration System
|
|
83
|
+
|
|
84
|
+
slime uses three categories of command-line arguments:
|
|
85
|
+
|
|
86
|
+
### 1. Megatron Arguments
|
|
87
|
+
|
|
88
|
+
All Megatron-LM arguments are supported directly:
|
|
89
|
+
|
|
90
|
+
```bash
|
|
91
|
+
--tensor-model-parallel-size 2
|
|
92
|
+
--pipeline-model-parallel-size 1
|
|
93
|
+
--num-layers 32
|
|
94
|
+
--hidden-size 4096
|
|
95
|
+
--num-attention-heads 32
|
|
96
|
+
--seq-length 4096
|
|
97
|
+
--micro-batch-size 1
|
|
98
|
+
--global-batch-size 256
|
|
99
|
+
```
|
|
100
|
+
|
|
101
|
+
### 2. SGLang Arguments
|
|
102
|
+
|
|
103
|
+
SGLang arguments are prefixed with `--sglang-`:
|
|
104
|
+
|
|
105
|
+
```bash
|
|
106
|
+
--sglang-mem-fraction-static 0.8 # GPU memory for KV cache
|
|
107
|
+
--sglang-context-length 8192 # Maximum context length
|
|
108
|
+
--sglang-log-level INFO # Logging verbosity
|
|
109
|
+
--sglang-tp-size 2 # Tensor parallelism
|
|
110
|
+
--sglang-disable-cuda-graph # Disable CUDA graphs
|
|
111
|
+
```
|
|
112
|
+
|
|
113
|
+
### 3. slime-Specific Arguments
|
|
114
|
+
|
|
115
|
+
Defined in `slime/utils/arguments.py`:
|
|
116
|
+
|
|
117
|
+
```bash
|
|
118
|
+
# Resource Allocation
|
|
119
|
+
--actor-num-nodes 1 # Training nodes
|
|
120
|
+
--actor-num-gpus-per-node 8 # GPUs per training node
|
|
121
|
+
--rollout-num-gpus 8 # Total rollout GPUs
|
|
122
|
+
--rollout-num-gpus-per-engine 2 # GPUs per SGLang engine
|
|
123
|
+
--colocate # Share GPUs for train/inference
|
|
124
|
+
|
|
125
|
+
# Data Configuration
|
|
126
|
+
--prompt-data /path/to/data.jsonl # Training data path
|
|
127
|
+
--input-key prompt # Key for prompts in JSON
|
|
128
|
+
--label-key label # Key for labels in JSON
|
|
129
|
+
--apply-chat-template # Apply chat formatting
|
|
130
|
+
|
|
131
|
+
# Training Loop
|
|
132
|
+
--num-rollout 3000 # Total rollout iterations
|
|
133
|
+
--rollout-batch-size 32 # Prompts per rollout
|
|
134
|
+
--n-samples-per-prompt 8 # Responses per prompt
|
|
135
|
+
--global-batch-size 256 # Training batch size
|
|
136
|
+
--num-steps-per-rollout 1 # Training steps per rollout
|
|
137
|
+
|
|
138
|
+
# RL Algorithm
|
|
139
|
+
--advantage-estimator grpo # grpo, gspo, ppo, reinforce_plus_plus
|
|
140
|
+
--use-kl-loss # Enable KL loss
|
|
141
|
+
--kl-loss-coef 0.001 # KL coefficient
|
|
142
|
+
--calculate-per-token-loss # Token-level loss
|
|
143
|
+
|
|
144
|
+
# Off-Policy Options
|
|
145
|
+
--use-tis # Truncated Importance Sampling
|
|
146
|
+
--tis-threshold 0.9 # TIS threshold
|
|
147
|
+
--true-on-policy-mode # Force on-policy training
|
|
148
|
+
```
|
|
149
|
+
|
|
150
|
+
## Data Buffer System
|
|
151
|
+
|
|
152
|
+
### RolloutDataSource (Base Class)
|
|
153
|
+
|
|
154
|
+
```python
|
|
155
|
+
from slime.data import RolloutDataSource
|
|
156
|
+
|
|
157
|
+
class RolloutDataSource:
|
|
158
|
+
def __init__(self, dataset, args):
|
|
159
|
+
self.dataset = dataset
|
|
160
|
+
self.args = args
|
|
161
|
+
|
|
162
|
+
def get_samples(self, num_samples: int) -> list[Sample]:
|
|
163
|
+
"""Fetch prompts from dataset."""
|
|
164
|
+
return [Sample(prompt=p) for p in self.dataset.sample(num_samples)]
|
|
165
|
+
|
|
166
|
+
def add_samples(self, samples: list[Sample]) -> None:
|
|
167
|
+
"""Called after generation (no-op by default)."""
|
|
168
|
+
pass
|
|
169
|
+
```
|
|
170
|
+
|
|
171
|
+
### Buffered Data Source (Off-Policy)
|
|
172
|
+
|
|
173
|
+
```python
|
|
174
|
+
from slime.data import RolloutDataSourceWithBuffer
|
|
175
|
+
|
|
176
|
+
class RolloutDataSourceWithBuffer(RolloutDataSource):
|
|
177
|
+
def __init__(self, dataset, args):
|
|
178
|
+
super().__init__(dataset, args)
|
|
179
|
+
self.buffer = []
|
|
180
|
+
|
|
181
|
+
def add_samples(self, samples: list[Sample]) -> None:
|
|
182
|
+
"""Store generated samples for reuse."""
|
|
183
|
+
self.buffer.extend(samples)
|
|
184
|
+
|
|
185
|
+
def buffer_filter(self, args, buffer, num_samples) -> list[Sample]:
|
|
186
|
+
"""Custom selection logic."""
|
|
187
|
+
# Example: prioritized sampling based on reward
|
|
188
|
+
sorted_buffer = sorted(buffer, key=lambda s: s.reward, reverse=True)
|
|
189
|
+
return sorted_buffer[:num_samples]
|
|
190
|
+
```
|
|
191
|
+
|
|
192
|
+
## Custom Functions
|
|
193
|
+
|
|
194
|
+
### Custom Generate Function
|
|
195
|
+
|
|
196
|
+
For multi-turn or tool-calling scenarios:
|
|
197
|
+
|
|
198
|
+
```python
|
|
199
|
+
# custom_generate.py
|
|
200
|
+
from slime.data import Sample
|
|
201
|
+
|
|
202
|
+
async def custom_generate(args, samples: list[Sample], evaluation: bool = False) -> list[Sample]:
|
|
203
|
+
"""
|
|
204
|
+
Custom generation function for multi-turn interactions.
|
|
205
|
+
|
|
206
|
+
Args:
|
|
207
|
+
args: Training arguments
|
|
208
|
+
samples: List of Sample objects with prompts
|
|
209
|
+
evaluation: Whether this is an evaluation run
|
|
210
|
+
|
|
211
|
+
Returns:
|
|
212
|
+
List of Sample objects with responses and rewards
|
|
213
|
+
"""
|
|
214
|
+
for sample in samples:
|
|
215
|
+
conversation = sample.prompt if isinstance(sample.prompt, list) else [
|
|
216
|
+
{"role": "user", "content": sample.prompt}
|
|
217
|
+
]
|
|
218
|
+
|
|
219
|
+
for turn in range(args.max_turns):
|
|
220
|
+
# Generate response
|
|
221
|
+
response = await generate_single(conversation)
|
|
222
|
+
|
|
223
|
+
# Check for tool call
|
|
224
|
+
tool_call = extract_tool_call(response)
|
|
225
|
+
if tool_call:
|
|
226
|
+
# Execute tool
|
|
227
|
+
tool_result = await execute_tool(tool_call)
|
|
228
|
+
conversation.append({"role": "assistant", "content": response})
|
|
229
|
+
conversation.append({"role": "tool", "content": tool_result})
|
|
230
|
+
else:
|
|
231
|
+
# Final response
|
|
232
|
+
sample.response = response
|
|
233
|
+
break
|
|
234
|
+
|
|
235
|
+
# Compute reward
|
|
236
|
+
sample.reward = compute_reward(sample)
|
|
237
|
+
|
|
238
|
+
# Set loss mask (1 for model tokens, 0 for tool responses)
|
|
239
|
+
sample.loss_mask = build_loss_mask(sample)
|
|
240
|
+
|
|
241
|
+
return samples
|
|
242
|
+
```
|
|
243
|
+
|
|
244
|
+
Usage:
|
|
245
|
+
```bash
|
|
246
|
+
python train.py \
|
|
247
|
+
--custom-generate-function-path custom_generate.py \
|
|
248
|
+
--max-turns 5
|
|
249
|
+
```
|
|
250
|
+
|
|
251
|
+
### Custom Reward Function
|
|
252
|
+
|
|
253
|
+
```python
|
|
254
|
+
# custom_rm.py
|
|
255
|
+
from slime.data import Sample
|
|
256
|
+
|
|
257
|
+
async def reward_func(args, sample: Sample, **kwargs) -> float:
|
|
258
|
+
"""
|
|
259
|
+
Compute reward for a single sample.
|
|
260
|
+
|
|
261
|
+
Args:
|
|
262
|
+
args: Training arguments
|
|
263
|
+
sample: Sample object with response
|
|
264
|
+
|
|
265
|
+
Returns:
|
|
266
|
+
Reward score (float)
|
|
267
|
+
"""
|
|
268
|
+
response = sample.response
|
|
269
|
+
ground_truth = sample.label or sample.metadata.get("answer", "")
|
|
270
|
+
|
|
271
|
+
# Example: exact match reward
|
|
272
|
+
if response.strip() == ground_truth.strip():
|
|
273
|
+
return 1.0
|
|
274
|
+
return 0.0
|
|
275
|
+
|
|
276
|
+
# For batched processing (more efficient)
|
|
277
|
+
async def batched_custom_rm(args, samples: list[Sample]) -> list[float]:
|
|
278
|
+
"""Batch reward computation."""
|
|
279
|
+
rewards = []
|
|
280
|
+
for sample in samples:
|
|
281
|
+
reward = await reward_func(args, sample)
|
|
282
|
+
rewards.append(reward)
|
|
283
|
+
return rewards
|
|
284
|
+
```
|
|
285
|
+
|
|
286
|
+
Usage:
|
|
287
|
+
```bash
|
|
288
|
+
python train.py \
|
|
289
|
+
--custom-rm-path custom_rm.py \
|
|
290
|
+
--group-rm # Enable batched processing
|
|
291
|
+
```
|
|
292
|
+
|
|
293
|
+
## Model Configuration
|
|
294
|
+
|
|
295
|
+
### Pre-configured Model Scripts
|
|
296
|
+
|
|
297
|
+
Located in `scripts/models/`:
|
|
298
|
+
|
|
299
|
+
```bash
|
|
300
|
+
# List available models
|
|
301
|
+
ls scripts/models/
|
|
302
|
+
# glm4-9B.sh, qwen3-4B.sh, qwen3-30B-A3B.sh, deepseek-v3.sh, llama3-8B.sh
|
|
303
|
+
|
|
304
|
+
# Source model configuration
|
|
305
|
+
source scripts/models/qwen3-4B.sh
|
|
306
|
+
# This sets MODEL_ARGS and CKPT_ARGS arrays
|
|
307
|
+
```
|
|
308
|
+
|
|
309
|
+
### Example Model Script
|
|
310
|
+
|
|
311
|
+
```bash
|
|
312
|
+
# scripts/models/qwen3-4B.sh
|
|
313
|
+
export MODEL_ARGS=(
|
|
314
|
+
--num-layers 36
|
|
315
|
+
--hidden-size 2560
|
|
316
|
+
--num-attention-heads 20
|
|
317
|
+
--num-query-groups 4
|
|
318
|
+
--ffn-hidden-size 6912
|
|
319
|
+
--max-position-embeddings 32768
|
|
320
|
+
--rotary-percent 1.0
|
|
321
|
+
--rotary-base 1000000
|
|
322
|
+
--swiglu
|
|
323
|
+
--untie-embeddings-and-output-weights
|
|
324
|
+
--no-position-embedding
|
|
325
|
+
--normalization RMSNorm
|
|
326
|
+
--tokenizer-type HuggingFaceTokenizer
|
|
327
|
+
--bf16
|
|
328
|
+
)
|
|
329
|
+
|
|
330
|
+
export CKPT_ARGS=(
|
|
331
|
+
--hf-checkpoint /path/to/qwen3-4b-hf
|
|
332
|
+
--initial-megatron-checkpoint /path/to/megatron/ckpt
|
|
333
|
+
)
|
|
334
|
+
```
|
|
335
|
+
|
|
336
|
+
## Async Training
|
|
337
|
+
|
|
338
|
+
### Enabling Async Mode
|
|
339
|
+
|
|
340
|
+
```bash
|
|
341
|
+
python train_async.py \
|
|
342
|
+
--actor-num-gpus-per-node 8 \
|
|
343
|
+
--rollout-num-gpus 8 \
|
|
344
|
+
--async-buffer-size 4 \
|
|
345
|
+
--update-weights-interval 2 \
|
|
346
|
+
${MODEL_ARGS[@]}
|
|
347
|
+
```
|
|
348
|
+
|
|
349
|
+
### Async-Specific Parameters
|
|
350
|
+
|
|
351
|
+
```bash
|
|
352
|
+
--async-buffer-size 4 # Number of rollouts to buffer
|
|
353
|
+
--update-weights-interval 2 # Sync weights every N rollouts
|
|
354
|
+
```
|
|
355
|
+
|
|
356
|
+
**Note**: Colocated mode (`--colocate`) is NOT supported with async training.
|
|
357
|
+
|
|
358
|
+
## Evaluation
|
|
359
|
+
|
|
360
|
+
### Multi-Task Evaluation
|
|
361
|
+
|
|
362
|
+
```bash
|
|
363
|
+
--eval-prompt-data aime /path/to/aime.jsonl \
|
|
364
|
+
--eval-prompt-data gsm8k /path/to/gsm8k.jsonl \
|
|
365
|
+
--n-samples-per-eval-prompt 16 \
|
|
366
|
+
--eval-interval 50
|
|
367
|
+
```
|
|
368
|
+
|
|
369
|
+
### Evaluation Configuration
|
|
370
|
+
|
|
371
|
+
```bash
|
|
372
|
+
--eval-interval 50 # Evaluate every N rollouts
|
|
373
|
+
--n-samples-per-eval-prompt 16 # Samples for evaluation
|
|
374
|
+
--eval-temperature 0.0 # Greedy decoding for eval
|
|
375
|
+
```
|
|
376
|
+
|
|
377
|
+
## Supported Models
|
|
378
|
+
|
|
379
|
+
| Model Family | Configurations |
|
|
380
|
+
|--------------|----------------|
|
|
381
|
+
| GLM | GLM-4.5, GLM-4.6, GLM-4.7, GLM-Z1-9B |
|
|
382
|
+
| Qwen | Qwen3 (4B, 8B, 30B-A3B), Qwen3-MoE, Qwen2.5 |
|
|
383
|
+
| DeepSeek | V3, V3.1, R1 |
|
|
384
|
+
| Llama | Llama 3 (8B, 70B) |
|
|
385
|
+
| Others | Kimi K2, Moonlight-16B |
|
|
386
|
+
|
|
387
|
+
## Resources
|
|
388
|
+
|
|
389
|
+
- Documentation: https://thudm.github.io/slime/
|
|
390
|
+
- GitHub: https://github.com/THUDM/slime
|
|
391
|
+
- Blog: https://lmsys.org/blog/2025-07-09-slime/
|
|
392
|
+
- Examples: `examples/` directory (14+ worked examples)
|