@synsci/cli-darwin-x64 1.1.49
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/skills/accelerate/SKILL.md +332 -0
- package/bin/skills/accelerate/references/custom-plugins.md +453 -0
- package/bin/skills/accelerate/references/megatron-integration.md +489 -0
- package/bin/skills/accelerate/references/performance.md +525 -0
- package/bin/skills/audiocraft/SKILL.md +564 -0
- package/bin/skills/audiocraft/references/advanced-usage.md +666 -0
- package/bin/skills/audiocraft/references/troubleshooting.md +504 -0
- package/bin/skills/autogpt/SKILL.md +403 -0
- package/bin/skills/autogpt/references/advanced-usage.md +535 -0
- package/bin/skills/autogpt/references/troubleshooting.md +420 -0
- package/bin/skills/awq/SKILL.md +310 -0
- package/bin/skills/awq/references/advanced-usage.md +324 -0
- package/bin/skills/awq/references/troubleshooting.md +344 -0
- package/bin/skills/axolotl/SKILL.md +158 -0
- package/bin/skills/axolotl/references/api.md +5548 -0
- package/bin/skills/axolotl/references/dataset-formats.md +1029 -0
- package/bin/skills/axolotl/references/index.md +15 -0
- package/bin/skills/axolotl/references/other.md +3563 -0
- package/bin/skills/bigcode-evaluation-harness/SKILL.md +405 -0
- package/bin/skills/bigcode-evaluation-harness/references/benchmarks.md +393 -0
- package/bin/skills/bigcode-evaluation-harness/references/custom-tasks.md +424 -0
- package/bin/skills/bigcode-evaluation-harness/references/issues.md +394 -0
- package/bin/skills/bitsandbytes/SKILL.md +411 -0
- package/bin/skills/bitsandbytes/references/memory-optimization.md +521 -0
- package/bin/skills/bitsandbytes/references/qlora-training.md +521 -0
- package/bin/skills/bitsandbytes/references/quantization-formats.md +447 -0
- package/bin/skills/blip-2/SKILL.md +564 -0
- package/bin/skills/blip-2/references/advanced-usage.md +680 -0
- package/bin/skills/blip-2/references/troubleshooting.md +526 -0
- package/bin/skills/chroma/SKILL.md +406 -0
- package/bin/skills/chroma/references/integration.md +38 -0
- package/bin/skills/clip/SKILL.md +253 -0
- package/bin/skills/clip/references/applications.md +207 -0
- package/bin/skills/constitutional-ai/SKILL.md +290 -0
- package/bin/skills/crewai/SKILL.md +498 -0
- package/bin/skills/crewai/references/flows.md +438 -0
- package/bin/skills/crewai/references/tools.md +429 -0
- package/bin/skills/crewai/references/troubleshooting.md +480 -0
- package/bin/skills/deepspeed/SKILL.md +141 -0
- package/bin/skills/deepspeed/references/08.md +17 -0
- package/bin/skills/deepspeed/references/09.md +173 -0
- package/bin/skills/deepspeed/references/2020.md +378 -0
- package/bin/skills/deepspeed/references/2023.md +279 -0
- package/bin/skills/deepspeed/references/assets.md +179 -0
- package/bin/skills/deepspeed/references/index.md +35 -0
- package/bin/skills/deepspeed/references/mii.md +118 -0
- package/bin/skills/deepspeed/references/other.md +1191 -0
- package/bin/skills/deepspeed/references/tutorials.md +6554 -0
- package/bin/skills/dspy/SKILL.md +590 -0
- package/bin/skills/dspy/references/examples.md +663 -0
- package/bin/skills/dspy/references/modules.md +475 -0
- package/bin/skills/dspy/references/optimizers.md +566 -0
- package/bin/skills/faiss/SKILL.md +221 -0
- package/bin/skills/faiss/references/index_types.md +280 -0
- package/bin/skills/flash-attention/SKILL.md +367 -0
- package/bin/skills/flash-attention/references/benchmarks.md +215 -0
- package/bin/skills/flash-attention/references/transformers-integration.md +293 -0
- package/bin/skills/gguf/SKILL.md +427 -0
- package/bin/skills/gguf/references/advanced-usage.md +504 -0
- package/bin/skills/gguf/references/troubleshooting.md +442 -0
- package/bin/skills/gptq/SKILL.md +450 -0
- package/bin/skills/gptq/references/calibration.md +337 -0
- package/bin/skills/gptq/references/integration.md +129 -0
- package/bin/skills/gptq/references/troubleshooting.md +95 -0
- package/bin/skills/grpo-rl-training/README.md +97 -0
- package/bin/skills/grpo-rl-training/SKILL.md +572 -0
- package/bin/skills/grpo-rl-training/examples/reward_functions_library.py +393 -0
- package/bin/skills/grpo-rl-training/templates/basic_grpo_training.py +228 -0
- package/bin/skills/guidance/SKILL.md +572 -0
- package/bin/skills/guidance/references/backends.md +554 -0
- package/bin/skills/guidance/references/constraints.md +674 -0
- package/bin/skills/guidance/references/examples.md +767 -0
- package/bin/skills/hqq/SKILL.md +445 -0
- package/bin/skills/hqq/references/advanced-usage.md +528 -0
- package/bin/skills/hqq/references/troubleshooting.md +503 -0
- package/bin/skills/hugging-face-cli/SKILL.md +191 -0
- package/bin/skills/hugging-face-cli/references/commands.md +954 -0
- package/bin/skills/hugging-face-cli/references/examples.md +374 -0
- package/bin/skills/hugging-face-datasets/SKILL.md +547 -0
- package/bin/skills/hugging-face-datasets/examples/diverse_training_examples.json +239 -0
- package/bin/skills/hugging-face-datasets/examples/system_prompt_template.txt +196 -0
- package/bin/skills/hugging-face-datasets/examples/training_examples.json +176 -0
- package/bin/skills/hugging-face-datasets/scripts/dataset_manager.py +522 -0
- package/bin/skills/hugging-face-datasets/scripts/sql_manager.py +844 -0
- package/bin/skills/hugging-face-datasets/templates/chat.json +55 -0
- package/bin/skills/hugging-face-datasets/templates/classification.json +62 -0
- package/bin/skills/hugging-face-datasets/templates/completion.json +51 -0
- package/bin/skills/hugging-face-datasets/templates/custom.json +75 -0
- package/bin/skills/hugging-face-datasets/templates/qa.json +54 -0
- package/bin/skills/hugging-face-datasets/templates/tabular.json +81 -0
- package/bin/skills/hugging-face-evaluation/SKILL.md +656 -0
- package/bin/skills/hugging-face-evaluation/examples/USAGE_EXAMPLES.md +382 -0
- package/bin/skills/hugging-face-evaluation/examples/artificial_analysis_to_hub.py +141 -0
- package/bin/skills/hugging-face-evaluation/examples/example_readme_tables.md +135 -0
- package/bin/skills/hugging-face-evaluation/examples/metric_mapping.json +50 -0
- package/bin/skills/hugging-face-evaluation/requirements.txt +20 -0
- package/bin/skills/hugging-face-evaluation/scripts/evaluation_manager.py +1374 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_eval_uv.py +104 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_vllm_uv.py +317 -0
- package/bin/skills/hugging-face-evaluation/scripts/lighteval_vllm_uv.py +303 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_eval_job.py +98 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_vllm_eval_job.py +331 -0
- package/bin/skills/hugging-face-evaluation/scripts/test_extraction.py +206 -0
- package/bin/skills/hugging-face-jobs/SKILL.md +1041 -0
- package/bin/skills/hugging-face-jobs/index.html +216 -0
- package/bin/skills/hugging-face-jobs/references/hardware_guide.md +336 -0
- package/bin/skills/hugging-face-jobs/references/hub_saving.md +352 -0
- package/bin/skills/hugging-face-jobs/references/token_usage.md +546 -0
- package/bin/skills/hugging-face-jobs/references/troubleshooting.md +475 -0
- package/bin/skills/hugging-face-jobs/scripts/cot-self-instruct.py +718 -0
- package/bin/skills/hugging-face-jobs/scripts/finepdfs-stats.py +546 -0
- package/bin/skills/hugging-face-jobs/scripts/generate-responses.py +587 -0
- package/bin/skills/hugging-face-model-trainer/SKILL.md +711 -0
- package/bin/skills/hugging-face-model-trainer/references/gguf_conversion.md +296 -0
- package/bin/skills/hugging-face-model-trainer/references/hardware_guide.md +283 -0
- package/bin/skills/hugging-face-model-trainer/references/hub_saving.md +364 -0
- package/bin/skills/hugging-face-model-trainer/references/reliability_principles.md +371 -0
- package/bin/skills/hugging-face-model-trainer/references/trackio_guide.md +189 -0
- package/bin/skills/hugging-face-model-trainer/references/training_methods.md +150 -0
- package/bin/skills/hugging-face-model-trainer/references/training_patterns.md +203 -0
- package/bin/skills/hugging-face-model-trainer/references/troubleshooting.md +282 -0
- package/bin/skills/hugging-face-model-trainer/scripts/convert_to_gguf.py +424 -0
- package/bin/skills/hugging-face-model-trainer/scripts/dataset_inspector.py +417 -0
- package/bin/skills/hugging-face-model-trainer/scripts/estimate_cost.py +150 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_dpo_example.py +106 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_grpo_example.py +89 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_sft_example.py +122 -0
- package/bin/skills/hugging-face-paper-publisher/SKILL.md +627 -0
- package/bin/skills/hugging-face-paper-publisher/examples/example_usage.md +327 -0
- package/bin/skills/hugging-face-paper-publisher/references/quick_reference.md +216 -0
- package/bin/skills/hugging-face-paper-publisher/scripts/paper_manager.py +508 -0
- package/bin/skills/hugging-face-paper-publisher/templates/arxiv.md +299 -0
- package/bin/skills/hugging-face-paper-publisher/templates/ml-report.md +358 -0
- package/bin/skills/hugging-face-paper-publisher/templates/modern.md +319 -0
- package/bin/skills/hugging-face-paper-publisher/templates/standard.md +201 -0
- package/bin/skills/hugging-face-tool-builder/SKILL.md +115 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.py +57 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.sh +40 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.tsx +57 -0
- package/bin/skills/hugging-face-tool-builder/references/find_models_by_paper.sh +230 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_enrich_models.sh +96 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_card_frontmatter.sh +188 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_papers_auth.sh +171 -0
- package/bin/skills/hugging-face-trackio/SKILL.md +65 -0
- package/bin/skills/hugging-face-trackio/references/logging_metrics.md +206 -0
- package/bin/skills/hugging-face-trackio/references/retrieving_metrics.md +223 -0
- package/bin/skills/huggingface-tokenizers/SKILL.md +516 -0
- package/bin/skills/huggingface-tokenizers/references/algorithms.md +653 -0
- package/bin/skills/huggingface-tokenizers/references/integration.md +637 -0
- package/bin/skills/huggingface-tokenizers/references/pipeline.md +723 -0
- package/bin/skills/huggingface-tokenizers/references/training.md +565 -0
- package/bin/skills/instructor/SKILL.md +740 -0
- package/bin/skills/instructor/references/examples.md +107 -0
- package/bin/skills/instructor/references/providers.md +70 -0
- package/bin/skills/instructor/references/validation.md +606 -0
- package/bin/skills/knowledge-distillation/SKILL.md +458 -0
- package/bin/skills/knowledge-distillation/references/minillm.md +334 -0
- package/bin/skills/lambda-labs/SKILL.md +545 -0
- package/bin/skills/lambda-labs/references/advanced-usage.md +611 -0
- package/bin/skills/lambda-labs/references/troubleshooting.md +530 -0
- package/bin/skills/langchain/SKILL.md +480 -0
- package/bin/skills/langchain/references/agents.md +499 -0
- package/bin/skills/langchain/references/integration.md +562 -0
- package/bin/skills/langchain/references/rag.md +600 -0
- package/bin/skills/langsmith/SKILL.md +422 -0
- package/bin/skills/langsmith/references/advanced-usage.md +548 -0
- package/bin/skills/langsmith/references/troubleshooting.md +537 -0
- package/bin/skills/litgpt/SKILL.md +469 -0
- package/bin/skills/litgpt/references/custom-models.md +568 -0
- package/bin/skills/litgpt/references/distributed-training.md +451 -0
- package/bin/skills/litgpt/references/supported-models.md +336 -0
- package/bin/skills/litgpt/references/training-recipes.md +619 -0
- package/bin/skills/llama-cpp/SKILL.md +258 -0
- package/bin/skills/llama-cpp/references/optimization.md +89 -0
- package/bin/skills/llama-cpp/references/quantization.md +213 -0
- package/bin/skills/llama-cpp/references/server.md +125 -0
- package/bin/skills/llama-factory/SKILL.md +80 -0
- package/bin/skills/llama-factory/references/_images.md +23 -0
- package/bin/skills/llama-factory/references/advanced.md +1055 -0
- package/bin/skills/llama-factory/references/getting_started.md +349 -0
- package/bin/skills/llama-factory/references/index.md +19 -0
- package/bin/skills/llama-factory/references/other.md +31 -0
- package/bin/skills/llamaguard/SKILL.md +337 -0
- package/bin/skills/llamaindex/SKILL.md +569 -0
- package/bin/skills/llamaindex/references/agents.md +83 -0
- package/bin/skills/llamaindex/references/data_connectors.md +108 -0
- package/bin/skills/llamaindex/references/query_engines.md +406 -0
- package/bin/skills/llava/SKILL.md +304 -0
- package/bin/skills/llava/references/training.md +197 -0
- package/bin/skills/lm-evaluation-harness/SKILL.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/api-evaluation.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/benchmark-guide.md +488 -0
- package/bin/skills/lm-evaluation-harness/references/custom-tasks.md +602 -0
- package/bin/skills/lm-evaluation-harness/references/distributed-eval.md +519 -0
- package/bin/skills/long-context/SKILL.md +536 -0
- package/bin/skills/long-context/references/extension_methods.md +468 -0
- package/bin/skills/long-context/references/fine_tuning.md +611 -0
- package/bin/skills/long-context/references/rope.md +402 -0
- package/bin/skills/mamba/SKILL.md +260 -0
- package/bin/skills/mamba/references/architecture-details.md +206 -0
- package/bin/skills/mamba/references/benchmarks.md +255 -0
- package/bin/skills/mamba/references/training-guide.md +388 -0
- package/bin/skills/megatron-core/SKILL.md +366 -0
- package/bin/skills/megatron-core/references/benchmarks.md +249 -0
- package/bin/skills/megatron-core/references/parallelism-guide.md +404 -0
- package/bin/skills/megatron-core/references/production-examples.md +473 -0
- package/bin/skills/megatron-core/references/training-recipes.md +547 -0
- package/bin/skills/miles/SKILL.md +315 -0
- package/bin/skills/miles/references/api-reference.md +141 -0
- package/bin/skills/miles/references/troubleshooting.md +352 -0
- package/bin/skills/mlflow/SKILL.md +704 -0
- package/bin/skills/mlflow/references/deployment.md +744 -0
- package/bin/skills/mlflow/references/model-registry.md +770 -0
- package/bin/skills/mlflow/references/tracking.md +680 -0
- package/bin/skills/modal/SKILL.md +341 -0
- package/bin/skills/modal/references/advanced-usage.md +503 -0
- package/bin/skills/modal/references/troubleshooting.md +494 -0
- package/bin/skills/model-merging/SKILL.md +539 -0
- package/bin/skills/model-merging/references/evaluation.md +462 -0
- package/bin/skills/model-merging/references/examples.md +428 -0
- package/bin/skills/model-merging/references/methods.md +352 -0
- package/bin/skills/model-pruning/SKILL.md +495 -0
- package/bin/skills/model-pruning/references/wanda.md +347 -0
- package/bin/skills/moe-training/SKILL.md +526 -0
- package/bin/skills/moe-training/references/architectures.md +432 -0
- package/bin/skills/moe-training/references/inference.md +348 -0
- package/bin/skills/moe-training/references/training.md +425 -0
- package/bin/skills/nanogpt/SKILL.md +290 -0
- package/bin/skills/nanogpt/references/architecture.md +382 -0
- package/bin/skills/nanogpt/references/data.md +476 -0
- package/bin/skills/nanogpt/references/training.md +564 -0
- package/bin/skills/nemo-curator/SKILL.md +383 -0
- package/bin/skills/nemo-curator/references/deduplication.md +87 -0
- package/bin/skills/nemo-curator/references/filtering.md +102 -0
- package/bin/skills/nemo-evaluator/SKILL.md +494 -0
- package/bin/skills/nemo-evaluator/references/adapter-system.md +340 -0
- package/bin/skills/nemo-evaluator/references/configuration.md +447 -0
- package/bin/skills/nemo-evaluator/references/custom-benchmarks.md +315 -0
- package/bin/skills/nemo-evaluator/references/execution-backends.md +361 -0
- package/bin/skills/nemo-guardrails/SKILL.md +297 -0
- package/bin/skills/nnsight/SKILL.md +436 -0
- package/bin/skills/nnsight/references/README.md +78 -0
- package/bin/skills/nnsight/references/api.md +344 -0
- package/bin/skills/nnsight/references/tutorials.md +300 -0
- package/bin/skills/openrlhf/SKILL.md +249 -0
- package/bin/skills/openrlhf/references/algorithm-comparison.md +404 -0
- package/bin/skills/openrlhf/references/custom-rewards.md +530 -0
- package/bin/skills/openrlhf/references/hybrid-engine.md +287 -0
- package/bin/skills/openrlhf/references/multi-node-training.md +454 -0
- package/bin/skills/outlines/SKILL.md +652 -0
- package/bin/skills/outlines/references/backends.md +615 -0
- package/bin/skills/outlines/references/examples.md +773 -0
- package/bin/skills/outlines/references/json_generation.md +652 -0
- package/bin/skills/peft/SKILL.md +431 -0
- package/bin/skills/peft/references/advanced-usage.md +514 -0
- package/bin/skills/peft/references/troubleshooting.md +480 -0
- package/bin/skills/phoenix/SKILL.md +475 -0
- package/bin/skills/phoenix/references/advanced-usage.md +619 -0
- package/bin/skills/phoenix/references/troubleshooting.md +538 -0
- package/bin/skills/pinecone/SKILL.md +358 -0
- package/bin/skills/pinecone/references/deployment.md +181 -0
- package/bin/skills/pytorch-fsdp/SKILL.md +126 -0
- package/bin/skills/pytorch-fsdp/references/index.md +7 -0
- package/bin/skills/pytorch-fsdp/references/other.md +4249 -0
- package/bin/skills/pytorch-lightning/SKILL.md +346 -0
- package/bin/skills/pytorch-lightning/references/callbacks.md +436 -0
- package/bin/skills/pytorch-lightning/references/distributed.md +490 -0
- package/bin/skills/pytorch-lightning/references/hyperparameter-tuning.md +556 -0
- package/bin/skills/pyvene/SKILL.md +473 -0
- package/bin/skills/pyvene/references/README.md +73 -0
- package/bin/skills/pyvene/references/api.md +383 -0
- package/bin/skills/pyvene/references/tutorials.md +376 -0
- package/bin/skills/qdrant/SKILL.md +493 -0
- package/bin/skills/qdrant/references/advanced-usage.md +648 -0
- package/bin/skills/qdrant/references/troubleshooting.md +631 -0
- package/bin/skills/ray-data/SKILL.md +326 -0
- package/bin/skills/ray-data/references/integration.md +82 -0
- package/bin/skills/ray-data/references/transformations.md +83 -0
- package/bin/skills/ray-train/SKILL.md +406 -0
- package/bin/skills/ray-train/references/multi-node.md +628 -0
- package/bin/skills/rwkv/SKILL.md +260 -0
- package/bin/skills/rwkv/references/architecture-details.md +344 -0
- package/bin/skills/rwkv/references/rwkv7.md +386 -0
- package/bin/skills/rwkv/references/state-management.md +369 -0
- package/bin/skills/saelens/SKILL.md +386 -0
- package/bin/skills/saelens/references/README.md +70 -0
- package/bin/skills/saelens/references/api.md +333 -0
- package/bin/skills/saelens/references/tutorials.md +318 -0
- package/bin/skills/segment-anything/SKILL.md +500 -0
- package/bin/skills/segment-anything/references/advanced-usage.md +589 -0
- package/bin/skills/segment-anything/references/troubleshooting.md +484 -0
- package/bin/skills/sentence-transformers/SKILL.md +255 -0
- package/bin/skills/sentence-transformers/references/models.md +123 -0
- package/bin/skills/sentencepiece/SKILL.md +235 -0
- package/bin/skills/sentencepiece/references/algorithms.md +200 -0
- package/bin/skills/sentencepiece/references/training.md +304 -0
- package/bin/skills/sglang/SKILL.md +442 -0
- package/bin/skills/sglang/references/deployment.md +490 -0
- package/bin/skills/sglang/references/radix-attention.md +413 -0
- package/bin/skills/sglang/references/structured-generation.md +541 -0
- package/bin/skills/simpo/SKILL.md +219 -0
- package/bin/skills/simpo/references/datasets.md +478 -0
- package/bin/skills/simpo/references/hyperparameters.md +452 -0
- package/bin/skills/simpo/references/loss-functions.md +350 -0
- package/bin/skills/skypilot/SKILL.md +509 -0
- package/bin/skills/skypilot/references/advanced-usage.md +491 -0
- package/bin/skills/skypilot/references/troubleshooting.md +570 -0
- package/bin/skills/slime/SKILL.md +464 -0
- package/bin/skills/slime/references/api-reference.md +392 -0
- package/bin/skills/slime/references/troubleshooting.md +386 -0
- package/bin/skills/speculative-decoding/SKILL.md +467 -0
- package/bin/skills/speculative-decoding/references/lookahead.md +309 -0
- package/bin/skills/speculative-decoding/references/medusa.md +350 -0
- package/bin/skills/stable-diffusion/SKILL.md +519 -0
- package/bin/skills/stable-diffusion/references/advanced-usage.md +716 -0
- package/bin/skills/stable-diffusion/references/troubleshooting.md +555 -0
- package/bin/skills/tensorboard/SKILL.md +629 -0
- package/bin/skills/tensorboard/references/integrations.md +638 -0
- package/bin/skills/tensorboard/references/profiling.md +545 -0
- package/bin/skills/tensorboard/references/visualization.md +620 -0
- package/bin/skills/tensorrt-llm/SKILL.md +187 -0
- package/bin/skills/tensorrt-llm/references/multi-gpu.md +298 -0
- package/bin/skills/tensorrt-llm/references/optimization.md +242 -0
- package/bin/skills/tensorrt-llm/references/serving.md +470 -0
- package/bin/skills/tinker/SKILL.md +362 -0
- package/bin/skills/tinker/references/api-reference.md +168 -0
- package/bin/skills/tinker/references/getting-started.md +157 -0
- package/bin/skills/tinker/references/loss-functions.md +163 -0
- package/bin/skills/tinker/references/models-and-lora.md +139 -0
- package/bin/skills/tinker/references/recipes.md +280 -0
- package/bin/skills/tinker/references/reinforcement-learning.md +212 -0
- package/bin/skills/tinker/references/rendering.md +243 -0
- package/bin/skills/tinker/references/supervised-learning.md +232 -0
- package/bin/skills/tinker-training-cost/SKILL.md +187 -0
- package/bin/skills/tinker-training-cost/scripts/calculate_cost.py +123 -0
- package/bin/skills/torchforge/SKILL.md +433 -0
- package/bin/skills/torchforge/references/api-reference.md +327 -0
- package/bin/skills/torchforge/references/troubleshooting.md +409 -0
- package/bin/skills/torchtitan/SKILL.md +358 -0
- package/bin/skills/torchtitan/references/checkpoint.md +181 -0
- package/bin/skills/torchtitan/references/custom-models.md +258 -0
- package/bin/skills/torchtitan/references/float8.md +133 -0
- package/bin/skills/torchtitan/references/fsdp.md +126 -0
- package/bin/skills/transformer-lens/SKILL.md +346 -0
- package/bin/skills/transformer-lens/references/README.md +54 -0
- package/bin/skills/transformer-lens/references/api.md +362 -0
- package/bin/skills/transformer-lens/references/tutorials.md +339 -0
- package/bin/skills/trl-fine-tuning/SKILL.md +455 -0
- package/bin/skills/trl-fine-tuning/references/dpo-variants.md +227 -0
- package/bin/skills/trl-fine-tuning/references/online-rl.md +82 -0
- package/bin/skills/trl-fine-tuning/references/reward-modeling.md +122 -0
- package/bin/skills/trl-fine-tuning/references/sft-training.md +168 -0
- package/bin/skills/unsloth/SKILL.md +80 -0
- package/bin/skills/unsloth/references/index.md +7 -0
- package/bin/skills/unsloth/references/llms-full.md +16799 -0
- package/bin/skills/unsloth/references/llms-txt.md +12044 -0
- package/bin/skills/unsloth/references/llms.md +82 -0
- package/bin/skills/verl/SKILL.md +391 -0
- package/bin/skills/verl/references/api-reference.md +301 -0
- package/bin/skills/verl/references/troubleshooting.md +391 -0
- package/bin/skills/vllm/SKILL.md +364 -0
- package/bin/skills/vllm/references/optimization.md +226 -0
- package/bin/skills/vllm/references/quantization.md +284 -0
- package/bin/skills/vllm/references/server-deployment.md +255 -0
- package/bin/skills/vllm/references/troubleshooting.md +447 -0
- package/bin/skills/weights-and-biases/SKILL.md +590 -0
- package/bin/skills/weights-and-biases/references/artifacts.md +584 -0
- package/bin/skills/weights-and-biases/references/integrations.md +700 -0
- package/bin/skills/weights-and-biases/references/sweeps.md +847 -0
- package/bin/skills/whisper/SKILL.md +317 -0
- package/bin/skills/whisper/references/languages.md +189 -0
- package/bin/synsc +0 -0
- package/package.json +10 -0
|
@@ -0,0 +1,287 @@
|
|
|
1
|
+
# Hybrid Engine Architecture
|
|
2
|
+
|
|
3
|
+
Complete guide to OpenRLHF's GPU resource sharing system for maximizing utilization during RLHF training.
|
|
4
|
+
|
|
5
|
+
## Overview
|
|
6
|
+
|
|
7
|
+
The Hybrid Engine allows Actor, Critic, Reward, Reference models and vLLM engines to share GPU resources, minimizing idle time and maximizing GPU utilization through dynamic sleep/wake cycles.
|
|
8
|
+
|
|
9
|
+
## Architecture
|
|
10
|
+
|
|
11
|
+
### Core Components
|
|
12
|
+
|
|
13
|
+
**Enable Hybrid Engine**:
|
|
14
|
+
```bash
|
|
15
|
+
--colocate_all_models # Enable GPU sharing across all models
|
|
16
|
+
```
|
|
17
|
+
|
|
18
|
+
**Components that share GPUs**:
|
|
19
|
+
1. **Actor Model** - Policy being trained
|
|
20
|
+
2. **Critic Model** - Value function for PPO
|
|
21
|
+
3. **Reward Model** - Scores completions
|
|
22
|
+
4. **Reference Model** - KL penalty baseline
|
|
23
|
+
5. **vLLM Engines** - Fast inference generation
|
|
24
|
+
|
|
25
|
+
### GPU Allocation Strategy
|
|
26
|
+
|
|
27
|
+
**Optimal ratio** (vLLM : Actor : Critic = 1:1:1):
|
|
28
|
+
```bash
|
|
29
|
+
# 70B model on 48× A100 GPUs
|
|
30
|
+
--vllm_num_engines 4 # 16 GPUs total
|
|
31
|
+
--vllm_tensor_parallel_size 4 # 4 GPUs per engine
|
|
32
|
+
--actor_num_nodes 1 # 16 GPUs
|
|
33
|
+
--actor_num_gpus_per_node 16
|
|
34
|
+
--critic_num_nodes 1 # 16 GPUs
|
|
35
|
+
--critic_num_gpus_per_node 16
|
|
36
|
+
```
|
|
37
|
+
|
|
38
|
+
**Constraint**: `actor_num_nodes * actor_num_gpus_per_node == vllm_num_engines * vllm_tensor_parallel_size`
|
|
39
|
+
|
|
40
|
+
## vLLM Sleep Mode
|
|
41
|
+
|
|
42
|
+
### How It Works
|
|
43
|
+
|
|
44
|
+
**Enable vLLM sleep**:
|
|
45
|
+
```bash
|
|
46
|
+
--vllm_enable_sleep
|
|
47
|
+
```
|
|
48
|
+
|
|
49
|
+
**Sleep/wake cycle**:
|
|
50
|
+
1. **Wake up** before generation: Load vLLM engines to GPU
|
|
51
|
+
2. **Generate** samples: vLLM performs inference
|
|
52
|
+
3. **Sleep** after generation: Offload vLLM engines to CPU
|
|
53
|
+
|
|
54
|
+
**Implementation**:
|
|
55
|
+
```python
|
|
56
|
+
# In SamplesGenerator.generate_samples()
|
|
57
|
+
batch_vllm_engine_call(self.vllm_engines, "wake_up") # GPU ← CPU
|
|
58
|
+
# ... generate samples ...
|
|
59
|
+
batch_vllm_engine_call(self.vllm_engines, "sleep") # CPU ← GPU
|
|
60
|
+
```
|
|
61
|
+
|
|
62
|
+
**When used**:
|
|
63
|
+
- Sample generation during PPO rollout
|
|
64
|
+
- Initial weight sync from actor to vLLM
|
|
65
|
+
- Evaluation phase
|
|
66
|
+
|
|
67
|
+
### Memory Management
|
|
68
|
+
|
|
69
|
+
**Control GPU memory**:
|
|
70
|
+
```bash
|
|
71
|
+
--vllm_gpu_memory_utilization 0.5 # Use 50% of GPU for vLLM
|
|
72
|
+
```
|
|
73
|
+
|
|
74
|
+
**Example**:
|
|
75
|
+
- A100 80GB × 0.5 = 40GB for vLLM
|
|
76
|
+
- Remaining 40GB for other models when colocated
|
|
77
|
+
|
|
78
|
+
## DeepSpeed Sleep Mode
|
|
79
|
+
|
|
80
|
+
### How It Works
|
|
81
|
+
|
|
82
|
+
**Enable DeepSpeed sleep**:
|
|
83
|
+
```bash
|
|
84
|
+
--deepspeed_enable_sleep
|
|
85
|
+
```
|
|
86
|
+
|
|
87
|
+
**Sleep/wake cycle**:
|
|
88
|
+
1. **Reload states** before training: Move model CPU → GPU
|
|
89
|
+
2. **Train** model: DeepSpeed performs optimization
|
|
90
|
+
3. **Offload states** after training: Move model GPU → CPU
|
|
91
|
+
|
|
92
|
+
**Implementation**:
|
|
93
|
+
```python
|
|
94
|
+
# In PPOTrainer.ppo_train()
|
|
95
|
+
# For actor model
|
|
96
|
+
self.actor.reload_states() # GPU ← CPU
|
|
97
|
+
# ... training loop ...
|
|
98
|
+
self.actor.offload_states() # CPU ← GPU
|
|
99
|
+
|
|
100
|
+
# For critic model
|
|
101
|
+
self.critic.reload_states() # GPU ← CPU
|
|
102
|
+
# ... training loop ...
|
|
103
|
+
self.critic.offload_states() # CPU ← GPU
|
|
104
|
+
```
|
|
105
|
+
|
|
106
|
+
**Synchronization**:
|
|
107
|
+
- Ray barriers ensure models don't reload simultaneously
|
|
108
|
+
- Prevents OOM from concurrent GPU memory usage
|
|
109
|
+
|
|
110
|
+
### Initial Offload
|
|
111
|
+
|
|
112
|
+
**Actor offload** (after initialization):
|
|
113
|
+
```python
|
|
114
|
+
if args.deepspeed_enable_sleep:
|
|
115
|
+
self.actor.offload_states() # Start in CPU
|
|
116
|
+
```
|
|
117
|
+
|
|
118
|
+
## OOM Prevention Strategies
|
|
119
|
+
|
|
120
|
+
### 1. Memory Utilization Control
|
|
121
|
+
|
|
122
|
+
**Limit vLLM memory**:
|
|
123
|
+
```bash
|
|
124
|
+
--vllm_gpu_memory_utilization 0.5 # Conservative
|
|
125
|
+
--vllm_gpu_memory_utilization 0.7 # Aggressive
|
|
126
|
+
```
|
|
127
|
+
|
|
128
|
+
### 2. Ray Barriers for Synchronization
|
|
129
|
+
|
|
130
|
+
**Prevent simultaneous loading**:
|
|
131
|
+
- vLLM wakes → generates → sleeps
|
|
132
|
+
- Then DeepSpeed reloads → trains → offloads
|
|
133
|
+
- Never both in GPU memory simultaneously
|
|
134
|
+
|
|
135
|
+
### 3. Disable Colocation for Large Models
|
|
136
|
+
|
|
137
|
+
**If OOM occurs**:
|
|
138
|
+
```bash
|
|
139
|
+
# Remove --colocate_all_models
|
|
140
|
+
# Allocate separate GPUs for each model
|
|
141
|
+
--actor_num_nodes 1 --actor_num_gpus_per_node 16
|
|
142
|
+
--critic_num_nodes 1 --critic_num_gpus_per_node 16
|
|
143
|
+
--reward_num_nodes 1 --reward_num_gpus_per_node 16
|
|
144
|
+
--ref_num_nodes 1 --ref_num_gpus_per_node 16
|
|
145
|
+
```
|
|
146
|
+
|
|
147
|
+
### 4. ZeRO-3 Sharding
|
|
148
|
+
|
|
149
|
+
**Memory efficiency**:
|
|
150
|
+
```bash
|
|
151
|
+
--zero_stage 3 # Shard parameters, gradients, optimizer states
|
|
152
|
+
```
|
|
153
|
+
|
|
154
|
+
Combined with Hybrid Engine for maximum efficiency.
|
|
155
|
+
|
|
156
|
+
## Complete Example (70B Model)
|
|
157
|
+
|
|
158
|
+
### With Hybrid Engine (48 GPUs)
|
|
159
|
+
|
|
160
|
+
```bash
|
|
161
|
+
ray job submit --address="http://127.0.0.1:8265" \
|
|
162
|
+
-- python3 -m openrlhf.cli.train_ppo_ray \
|
|
163
|
+
--colocate_all_models \
|
|
164
|
+
--vllm_enable_sleep \
|
|
165
|
+
--deepspeed_enable_sleep \
|
|
166
|
+
--vllm_num_engines 4 \
|
|
167
|
+
--vllm_tensor_parallel_size 4 \
|
|
168
|
+
--vllm_gpu_memory_utilization 0.5 \
|
|
169
|
+
--actor_num_nodes 1 --actor_num_gpus_per_node 16 \
|
|
170
|
+
--critic_num_nodes 1 --critic_num_gpus_per_node 16 \
|
|
171
|
+
--reward_num_nodes 1 --reward_num_gpus_per_node 8 \
|
|
172
|
+
--ref_num_nodes 1 --ref_num_gpus_per_node 8 \
|
|
173
|
+
--pretrain meta-llama/Llama-2-70b-hf \
|
|
174
|
+
--reward_pretrain ./reward-model-70b \
|
|
175
|
+
--zero_stage 3 --bf16
|
|
176
|
+
```
|
|
177
|
+
|
|
178
|
+
**GPU allocation**:
|
|
179
|
+
- vLLM: 4 engines × 4 GPUs = 16 GPUs
|
|
180
|
+
- Actor: 16 GPUs (shares with vLLM via sleep)
|
|
181
|
+
- Critic: 16 GPUs
|
|
182
|
+
- Reward: 8 GPUs
|
|
183
|
+
- Reference: 8 GPUs
|
|
184
|
+
- **Total**: 48 GPUs (16 shared efficiently)
|
|
185
|
+
|
|
186
|
+
### Without Hybrid Engine (64 GPUs)
|
|
187
|
+
|
|
188
|
+
```bash
|
|
189
|
+
ray job submit --address="http://127.0.0.1:8265" \
|
|
190
|
+
-- python3 -m openrlhf.cli.train_ppo_ray \
|
|
191
|
+
--vllm_num_engines 4 \
|
|
192
|
+
--vllm_tensor_parallel_size 4 \
|
|
193
|
+
--actor_num_nodes 1 --actor_num_gpus_per_node 16 \
|
|
194
|
+
--critic_num_nodes 1 --critic_num_gpus_per_node 16 \
|
|
195
|
+
--reward_num_nodes 1 --reward_num_gpus_per_node 16 \
|
|
196
|
+
--ref_num_nodes 1 --ref_num_gpus_per_node 16 \
|
|
197
|
+
--pretrain meta-llama/Llama-2-70b-hf \
|
|
198
|
+
--zero_stage 3 --bf16
|
|
199
|
+
```
|
|
200
|
+
|
|
201
|
+
**GPU allocation**:
|
|
202
|
+
- vLLM: 16 GPUs (dedicated)
|
|
203
|
+
- Actor: 16 GPUs (dedicated)
|
|
204
|
+
- Critic: 16 GPUs (dedicated)
|
|
205
|
+
- Reward: 16 GPUs (dedicated)
|
|
206
|
+
- **Total**: 64 GPUs (no sharing)
|
|
207
|
+
|
|
208
|
+
**Savings**: Hybrid Engine saves 25% GPUs (48 vs 64)
|
|
209
|
+
|
|
210
|
+
## Ray Placement Groups
|
|
211
|
+
|
|
212
|
+
### Automatic Creation
|
|
213
|
+
|
|
214
|
+
**When `--colocate_all_models` is enabled**:
|
|
215
|
+
```python
|
|
216
|
+
# Placement group created for GPU sharing
|
|
217
|
+
placement_group = {
|
|
218
|
+
"bundle": [{"GPU": actor_num_gpus_per_node}], # Shared GPUs
|
|
219
|
+
"strategy": "PACK" # Colocate on same nodes
|
|
220
|
+
}
|
|
221
|
+
```
|
|
222
|
+
|
|
223
|
+
**Resource constraints**:
|
|
224
|
+
- vLLM engines scheduled on actor node GPUs
|
|
225
|
+
- DeepSpeed models scheduled on same GPUs
|
|
226
|
+
- Ray ensures proper scheduling
|
|
227
|
+
|
|
228
|
+
## Performance Benefits
|
|
229
|
+
|
|
230
|
+
**GPU utilization**:
|
|
231
|
+
- **Without Hybrid**: ~60-70% (idle during generation or training)
|
|
232
|
+
- **With Hybrid**: ~90-95% (constant utilization)
|
|
233
|
+
|
|
234
|
+
**Cost savings**:
|
|
235
|
+
- 25-33% fewer GPUs needed
|
|
236
|
+
- Same throughput with Hybrid Engine
|
|
237
|
+
|
|
238
|
+
**Stability**:
|
|
239
|
+
- More stable than async training
|
|
240
|
+
- Ray barriers prevent race conditions
|
|
241
|
+
|
|
242
|
+
## Troubleshooting
|
|
243
|
+
|
|
244
|
+
### OOM During Sleep/Wake
|
|
245
|
+
|
|
246
|
+
**Symptom**: OOM when model wakes up
|
|
247
|
+
|
|
248
|
+
**Solution 1** - Lower vLLM memory:
|
|
249
|
+
```bash
|
|
250
|
+
--vllm_gpu_memory_utilization 0.4 # Reduce from 0.5
|
|
251
|
+
```
|
|
252
|
+
|
|
253
|
+
**Solution 2** - Disable colocation:
|
|
254
|
+
```bash
|
|
255
|
+
# Remove --colocate_all_models
|
|
256
|
+
```
|
|
257
|
+
|
|
258
|
+
### DeepSpeed GPU Index Error
|
|
259
|
+
|
|
260
|
+
**Symptom**: `RuntimeError: Index out of range`
|
|
261
|
+
|
|
262
|
+
**Solution**:
|
|
263
|
+
```bash
|
|
264
|
+
export RAY_EXPERIMENTAL_NOSET_CUDA_VISIBLE_DEVICES=1
|
|
265
|
+
```
|
|
266
|
+
|
|
267
|
+
### vLLM Engines Don't Share GPUs
|
|
268
|
+
|
|
269
|
+
**Symptom**: vLLM uses separate GPUs despite `--colocate_all_models`
|
|
270
|
+
|
|
271
|
+
**Check constraint**:
|
|
272
|
+
```bash
|
|
273
|
+
# This must be true:
|
|
274
|
+
actor_num_nodes * actor_num_gpus_per_node == vllm_num_engines * vllm_tensor_parallel_size
|
|
275
|
+
|
|
276
|
+
# Example (valid):
|
|
277
|
+
# Actor: 1 node × 16 GPUs = 16
|
|
278
|
+
# vLLM: 4 engines × 4 TP = 16
|
|
279
|
+
# ✓ Equal
|
|
280
|
+
```
|
|
281
|
+
|
|
282
|
+
## References
|
|
283
|
+
|
|
284
|
+
- OpenRLHF: https://github.com/OpenRLHF/OpenRLHF
|
|
285
|
+
- Ray: https://docs.ray.io/en/latest/ray-core/scheduling/placement-group.html
|
|
286
|
+
- vLLM: https://docs.vllm.ai/
|
|
287
|
+
- DeepSpeed ZeRO: https://www.deepspeed.ai/tutorials/zero/
|