@synsci/cli-darwin-x64 1.1.49
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/skills/accelerate/SKILL.md +332 -0
- package/bin/skills/accelerate/references/custom-plugins.md +453 -0
- package/bin/skills/accelerate/references/megatron-integration.md +489 -0
- package/bin/skills/accelerate/references/performance.md +525 -0
- package/bin/skills/audiocraft/SKILL.md +564 -0
- package/bin/skills/audiocraft/references/advanced-usage.md +666 -0
- package/bin/skills/audiocraft/references/troubleshooting.md +504 -0
- package/bin/skills/autogpt/SKILL.md +403 -0
- package/bin/skills/autogpt/references/advanced-usage.md +535 -0
- package/bin/skills/autogpt/references/troubleshooting.md +420 -0
- package/bin/skills/awq/SKILL.md +310 -0
- package/bin/skills/awq/references/advanced-usage.md +324 -0
- package/bin/skills/awq/references/troubleshooting.md +344 -0
- package/bin/skills/axolotl/SKILL.md +158 -0
- package/bin/skills/axolotl/references/api.md +5548 -0
- package/bin/skills/axolotl/references/dataset-formats.md +1029 -0
- package/bin/skills/axolotl/references/index.md +15 -0
- package/bin/skills/axolotl/references/other.md +3563 -0
- package/bin/skills/bigcode-evaluation-harness/SKILL.md +405 -0
- package/bin/skills/bigcode-evaluation-harness/references/benchmarks.md +393 -0
- package/bin/skills/bigcode-evaluation-harness/references/custom-tasks.md +424 -0
- package/bin/skills/bigcode-evaluation-harness/references/issues.md +394 -0
- package/bin/skills/bitsandbytes/SKILL.md +411 -0
- package/bin/skills/bitsandbytes/references/memory-optimization.md +521 -0
- package/bin/skills/bitsandbytes/references/qlora-training.md +521 -0
- package/bin/skills/bitsandbytes/references/quantization-formats.md +447 -0
- package/bin/skills/blip-2/SKILL.md +564 -0
- package/bin/skills/blip-2/references/advanced-usage.md +680 -0
- package/bin/skills/blip-2/references/troubleshooting.md +526 -0
- package/bin/skills/chroma/SKILL.md +406 -0
- package/bin/skills/chroma/references/integration.md +38 -0
- package/bin/skills/clip/SKILL.md +253 -0
- package/bin/skills/clip/references/applications.md +207 -0
- package/bin/skills/constitutional-ai/SKILL.md +290 -0
- package/bin/skills/crewai/SKILL.md +498 -0
- package/bin/skills/crewai/references/flows.md +438 -0
- package/bin/skills/crewai/references/tools.md +429 -0
- package/bin/skills/crewai/references/troubleshooting.md +480 -0
- package/bin/skills/deepspeed/SKILL.md +141 -0
- package/bin/skills/deepspeed/references/08.md +17 -0
- package/bin/skills/deepspeed/references/09.md +173 -0
- package/bin/skills/deepspeed/references/2020.md +378 -0
- package/bin/skills/deepspeed/references/2023.md +279 -0
- package/bin/skills/deepspeed/references/assets.md +179 -0
- package/bin/skills/deepspeed/references/index.md +35 -0
- package/bin/skills/deepspeed/references/mii.md +118 -0
- package/bin/skills/deepspeed/references/other.md +1191 -0
- package/bin/skills/deepspeed/references/tutorials.md +6554 -0
- package/bin/skills/dspy/SKILL.md +590 -0
- package/bin/skills/dspy/references/examples.md +663 -0
- package/bin/skills/dspy/references/modules.md +475 -0
- package/bin/skills/dspy/references/optimizers.md +566 -0
- package/bin/skills/faiss/SKILL.md +221 -0
- package/bin/skills/faiss/references/index_types.md +280 -0
- package/bin/skills/flash-attention/SKILL.md +367 -0
- package/bin/skills/flash-attention/references/benchmarks.md +215 -0
- package/bin/skills/flash-attention/references/transformers-integration.md +293 -0
- package/bin/skills/gguf/SKILL.md +427 -0
- package/bin/skills/gguf/references/advanced-usage.md +504 -0
- package/bin/skills/gguf/references/troubleshooting.md +442 -0
- package/bin/skills/gptq/SKILL.md +450 -0
- package/bin/skills/gptq/references/calibration.md +337 -0
- package/bin/skills/gptq/references/integration.md +129 -0
- package/bin/skills/gptq/references/troubleshooting.md +95 -0
- package/bin/skills/grpo-rl-training/README.md +97 -0
- package/bin/skills/grpo-rl-training/SKILL.md +572 -0
- package/bin/skills/grpo-rl-training/examples/reward_functions_library.py +393 -0
- package/bin/skills/grpo-rl-training/templates/basic_grpo_training.py +228 -0
- package/bin/skills/guidance/SKILL.md +572 -0
- package/bin/skills/guidance/references/backends.md +554 -0
- package/bin/skills/guidance/references/constraints.md +674 -0
- package/bin/skills/guidance/references/examples.md +767 -0
- package/bin/skills/hqq/SKILL.md +445 -0
- package/bin/skills/hqq/references/advanced-usage.md +528 -0
- package/bin/skills/hqq/references/troubleshooting.md +503 -0
- package/bin/skills/hugging-face-cli/SKILL.md +191 -0
- package/bin/skills/hugging-face-cli/references/commands.md +954 -0
- package/bin/skills/hugging-face-cli/references/examples.md +374 -0
- package/bin/skills/hugging-face-datasets/SKILL.md +547 -0
- package/bin/skills/hugging-face-datasets/examples/diverse_training_examples.json +239 -0
- package/bin/skills/hugging-face-datasets/examples/system_prompt_template.txt +196 -0
- package/bin/skills/hugging-face-datasets/examples/training_examples.json +176 -0
- package/bin/skills/hugging-face-datasets/scripts/dataset_manager.py +522 -0
- package/bin/skills/hugging-face-datasets/scripts/sql_manager.py +844 -0
- package/bin/skills/hugging-face-datasets/templates/chat.json +55 -0
- package/bin/skills/hugging-face-datasets/templates/classification.json +62 -0
- package/bin/skills/hugging-face-datasets/templates/completion.json +51 -0
- package/bin/skills/hugging-face-datasets/templates/custom.json +75 -0
- package/bin/skills/hugging-face-datasets/templates/qa.json +54 -0
- package/bin/skills/hugging-face-datasets/templates/tabular.json +81 -0
- package/bin/skills/hugging-face-evaluation/SKILL.md +656 -0
- package/bin/skills/hugging-face-evaluation/examples/USAGE_EXAMPLES.md +382 -0
- package/bin/skills/hugging-face-evaluation/examples/artificial_analysis_to_hub.py +141 -0
- package/bin/skills/hugging-face-evaluation/examples/example_readme_tables.md +135 -0
- package/bin/skills/hugging-face-evaluation/examples/metric_mapping.json +50 -0
- package/bin/skills/hugging-face-evaluation/requirements.txt +20 -0
- package/bin/skills/hugging-face-evaluation/scripts/evaluation_manager.py +1374 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_eval_uv.py +104 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_vllm_uv.py +317 -0
- package/bin/skills/hugging-face-evaluation/scripts/lighteval_vllm_uv.py +303 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_eval_job.py +98 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_vllm_eval_job.py +331 -0
- package/bin/skills/hugging-face-evaluation/scripts/test_extraction.py +206 -0
- package/bin/skills/hugging-face-jobs/SKILL.md +1041 -0
- package/bin/skills/hugging-face-jobs/index.html +216 -0
- package/bin/skills/hugging-face-jobs/references/hardware_guide.md +336 -0
- package/bin/skills/hugging-face-jobs/references/hub_saving.md +352 -0
- package/bin/skills/hugging-face-jobs/references/token_usage.md +546 -0
- package/bin/skills/hugging-face-jobs/references/troubleshooting.md +475 -0
- package/bin/skills/hugging-face-jobs/scripts/cot-self-instruct.py +718 -0
- package/bin/skills/hugging-face-jobs/scripts/finepdfs-stats.py +546 -0
- package/bin/skills/hugging-face-jobs/scripts/generate-responses.py +587 -0
- package/bin/skills/hugging-face-model-trainer/SKILL.md +711 -0
- package/bin/skills/hugging-face-model-trainer/references/gguf_conversion.md +296 -0
- package/bin/skills/hugging-face-model-trainer/references/hardware_guide.md +283 -0
- package/bin/skills/hugging-face-model-trainer/references/hub_saving.md +364 -0
- package/bin/skills/hugging-face-model-trainer/references/reliability_principles.md +371 -0
- package/bin/skills/hugging-face-model-trainer/references/trackio_guide.md +189 -0
- package/bin/skills/hugging-face-model-trainer/references/training_methods.md +150 -0
- package/bin/skills/hugging-face-model-trainer/references/training_patterns.md +203 -0
- package/bin/skills/hugging-face-model-trainer/references/troubleshooting.md +282 -0
- package/bin/skills/hugging-face-model-trainer/scripts/convert_to_gguf.py +424 -0
- package/bin/skills/hugging-face-model-trainer/scripts/dataset_inspector.py +417 -0
- package/bin/skills/hugging-face-model-trainer/scripts/estimate_cost.py +150 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_dpo_example.py +106 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_grpo_example.py +89 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_sft_example.py +122 -0
- package/bin/skills/hugging-face-paper-publisher/SKILL.md +627 -0
- package/bin/skills/hugging-face-paper-publisher/examples/example_usage.md +327 -0
- package/bin/skills/hugging-face-paper-publisher/references/quick_reference.md +216 -0
- package/bin/skills/hugging-face-paper-publisher/scripts/paper_manager.py +508 -0
- package/bin/skills/hugging-face-paper-publisher/templates/arxiv.md +299 -0
- package/bin/skills/hugging-face-paper-publisher/templates/ml-report.md +358 -0
- package/bin/skills/hugging-face-paper-publisher/templates/modern.md +319 -0
- package/bin/skills/hugging-face-paper-publisher/templates/standard.md +201 -0
- package/bin/skills/hugging-face-tool-builder/SKILL.md +115 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.py +57 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.sh +40 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.tsx +57 -0
- package/bin/skills/hugging-face-tool-builder/references/find_models_by_paper.sh +230 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_enrich_models.sh +96 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_card_frontmatter.sh +188 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_papers_auth.sh +171 -0
- package/bin/skills/hugging-face-trackio/SKILL.md +65 -0
- package/bin/skills/hugging-face-trackio/references/logging_metrics.md +206 -0
- package/bin/skills/hugging-face-trackio/references/retrieving_metrics.md +223 -0
- package/bin/skills/huggingface-tokenizers/SKILL.md +516 -0
- package/bin/skills/huggingface-tokenizers/references/algorithms.md +653 -0
- package/bin/skills/huggingface-tokenizers/references/integration.md +637 -0
- package/bin/skills/huggingface-tokenizers/references/pipeline.md +723 -0
- package/bin/skills/huggingface-tokenizers/references/training.md +565 -0
- package/bin/skills/instructor/SKILL.md +740 -0
- package/bin/skills/instructor/references/examples.md +107 -0
- package/bin/skills/instructor/references/providers.md +70 -0
- package/bin/skills/instructor/references/validation.md +606 -0
- package/bin/skills/knowledge-distillation/SKILL.md +458 -0
- package/bin/skills/knowledge-distillation/references/minillm.md +334 -0
- package/bin/skills/lambda-labs/SKILL.md +545 -0
- package/bin/skills/lambda-labs/references/advanced-usage.md +611 -0
- package/bin/skills/lambda-labs/references/troubleshooting.md +530 -0
- package/bin/skills/langchain/SKILL.md +480 -0
- package/bin/skills/langchain/references/agents.md +499 -0
- package/bin/skills/langchain/references/integration.md +562 -0
- package/bin/skills/langchain/references/rag.md +600 -0
- package/bin/skills/langsmith/SKILL.md +422 -0
- package/bin/skills/langsmith/references/advanced-usage.md +548 -0
- package/bin/skills/langsmith/references/troubleshooting.md +537 -0
- package/bin/skills/litgpt/SKILL.md +469 -0
- package/bin/skills/litgpt/references/custom-models.md +568 -0
- package/bin/skills/litgpt/references/distributed-training.md +451 -0
- package/bin/skills/litgpt/references/supported-models.md +336 -0
- package/bin/skills/litgpt/references/training-recipes.md +619 -0
- package/bin/skills/llama-cpp/SKILL.md +258 -0
- package/bin/skills/llama-cpp/references/optimization.md +89 -0
- package/bin/skills/llama-cpp/references/quantization.md +213 -0
- package/bin/skills/llama-cpp/references/server.md +125 -0
- package/bin/skills/llama-factory/SKILL.md +80 -0
- package/bin/skills/llama-factory/references/_images.md +23 -0
- package/bin/skills/llama-factory/references/advanced.md +1055 -0
- package/bin/skills/llama-factory/references/getting_started.md +349 -0
- package/bin/skills/llama-factory/references/index.md +19 -0
- package/bin/skills/llama-factory/references/other.md +31 -0
- package/bin/skills/llamaguard/SKILL.md +337 -0
- package/bin/skills/llamaindex/SKILL.md +569 -0
- package/bin/skills/llamaindex/references/agents.md +83 -0
- package/bin/skills/llamaindex/references/data_connectors.md +108 -0
- package/bin/skills/llamaindex/references/query_engines.md +406 -0
- package/bin/skills/llava/SKILL.md +304 -0
- package/bin/skills/llava/references/training.md +197 -0
- package/bin/skills/lm-evaluation-harness/SKILL.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/api-evaluation.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/benchmark-guide.md +488 -0
- package/bin/skills/lm-evaluation-harness/references/custom-tasks.md +602 -0
- package/bin/skills/lm-evaluation-harness/references/distributed-eval.md +519 -0
- package/bin/skills/long-context/SKILL.md +536 -0
- package/bin/skills/long-context/references/extension_methods.md +468 -0
- package/bin/skills/long-context/references/fine_tuning.md +611 -0
- package/bin/skills/long-context/references/rope.md +402 -0
- package/bin/skills/mamba/SKILL.md +260 -0
- package/bin/skills/mamba/references/architecture-details.md +206 -0
- package/bin/skills/mamba/references/benchmarks.md +255 -0
- package/bin/skills/mamba/references/training-guide.md +388 -0
- package/bin/skills/megatron-core/SKILL.md +366 -0
- package/bin/skills/megatron-core/references/benchmarks.md +249 -0
- package/bin/skills/megatron-core/references/parallelism-guide.md +404 -0
- package/bin/skills/megatron-core/references/production-examples.md +473 -0
- package/bin/skills/megatron-core/references/training-recipes.md +547 -0
- package/bin/skills/miles/SKILL.md +315 -0
- package/bin/skills/miles/references/api-reference.md +141 -0
- package/bin/skills/miles/references/troubleshooting.md +352 -0
- package/bin/skills/mlflow/SKILL.md +704 -0
- package/bin/skills/mlflow/references/deployment.md +744 -0
- package/bin/skills/mlflow/references/model-registry.md +770 -0
- package/bin/skills/mlflow/references/tracking.md +680 -0
- package/bin/skills/modal/SKILL.md +341 -0
- package/bin/skills/modal/references/advanced-usage.md +503 -0
- package/bin/skills/modal/references/troubleshooting.md +494 -0
- package/bin/skills/model-merging/SKILL.md +539 -0
- package/bin/skills/model-merging/references/evaluation.md +462 -0
- package/bin/skills/model-merging/references/examples.md +428 -0
- package/bin/skills/model-merging/references/methods.md +352 -0
- package/bin/skills/model-pruning/SKILL.md +495 -0
- package/bin/skills/model-pruning/references/wanda.md +347 -0
- package/bin/skills/moe-training/SKILL.md +526 -0
- package/bin/skills/moe-training/references/architectures.md +432 -0
- package/bin/skills/moe-training/references/inference.md +348 -0
- package/bin/skills/moe-training/references/training.md +425 -0
- package/bin/skills/nanogpt/SKILL.md +290 -0
- package/bin/skills/nanogpt/references/architecture.md +382 -0
- package/bin/skills/nanogpt/references/data.md +476 -0
- package/bin/skills/nanogpt/references/training.md +564 -0
- package/bin/skills/nemo-curator/SKILL.md +383 -0
- package/bin/skills/nemo-curator/references/deduplication.md +87 -0
- package/bin/skills/nemo-curator/references/filtering.md +102 -0
- package/bin/skills/nemo-evaluator/SKILL.md +494 -0
- package/bin/skills/nemo-evaluator/references/adapter-system.md +340 -0
- package/bin/skills/nemo-evaluator/references/configuration.md +447 -0
- package/bin/skills/nemo-evaluator/references/custom-benchmarks.md +315 -0
- package/bin/skills/nemo-evaluator/references/execution-backends.md +361 -0
- package/bin/skills/nemo-guardrails/SKILL.md +297 -0
- package/bin/skills/nnsight/SKILL.md +436 -0
- package/bin/skills/nnsight/references/README.md +78 -0
- package/bin/skills/nnsight/references/api.md +344 -0
- package/bin/skills/nnsight/references/tutorials.md +300 -0
- package/bin/skills/openrlhf/SKILL.md +249 -0
- package/bin/skills/openrlhf/references/algorithm-comparison.md +404 -0
- package/bin/skills/openrlhf/references/custom-rewards.md +530 -0
- package/bin/skills/openrlhf/references/hybrid-engine.md +287 -0
- package/bin/skills/openrlhf/references/multi-node-training.md +454 -0
- package/bin/skills/outlines/SKILL.md +652 -0
- package/bin/skills/outlines/references/backends.md +615 -0
- package/bin/skills/outlines/references/examples.md +773 -0
- package/bin/skills/outlines/references/json_generation.md +652 -0
- package/bin/skills/peft/SKILL.md +431 -0
- package/bin/skills/peft/references/advanced-usage.md +514 -0
- package/bin/skills/peft/references/troubleshooting.md +480 -0
- package/bin/skills/phoenix/SKILL.md +475 -0
- package/bin/skills/phoenix/references/advanced-usage.md +619 -0
- package/bin/skills/phoenix/references/troubleshooting.md +538 -0
- package/bin/skills/pinecone/SKILL.md +358 -0
- package/bin/skills/pinecone/references/deployment.md +181 -0
- package/bin/skills/pytorch-fsdp/SKILL.md +126 -0
- package/bin/skills/pytorch-fsdp/references/index.md +7 -0
- package/bin/skills/pytorch-fsdp/references/other.md +4249 -0
- package/bin/skills/pytorch-lightning/SKILL.md +346 -0
- package/bin/skills/pytorch-lightning/references/callbacks.md +436 -0
- package/bin/skills/pytorch-lightning/references/distributed.md +490 -0
- package/bin/skills/pytorch-lightning/references/hyperparameter-tuning.md +556 -0
- package/bin/skills/pyvene/SKILL.md +473 -0
- package/bin/skills/pyvene/references/README.md +73 -0
- package/bin/skills/pyvene/references/api.md +383 -0
- package/bin/skills/pyvene/references/tutorials.md +376 -0
- package/bin/skills/qdrant/SKILL.md +493 -0
- package/bin/skills/qdrant/references/advanced-usage.md +648 -0
- package/bin/skills/qdrant/references/troubleshooting.md +631 -0
- package/bin/skills/ray-data/SKILL.md +326 -0
- package/bin/skills/ray-data/references/integration.md +82 -0
- package/bin/skills/ray-data/references/transformations.md +83 -0
- package/bin/skills/ray-train/SKILL.md +406 -0
- package/bin/skills/ray-train/references/multi-node.md +628 -0
- package/bin/skills/rwkv/SKILL.md +260 -0
- package/bin/skills/rwkv/references/architecture-details.md +344 -0
- package/bin/skills/rwkv/references/rwkv7.md +386 -0
- package/bin/skills/rwkv/references/state-management.md +369 -0
- package/bin/skills/saelens/SKILL.md +386 -0
- package/bin/skills/saelens/references/README.md +70 -0
- package/bin/skills/saelens/references/api.md +333 -0
- package/bin/skills/saelens/references/tutorials.md +318 -0
- package/bin/skills/segment-anything/SKILL.md +500 -0
- package/bin/skills/segment-anything/references/advanced-usage.md +589 -0
- package/bin/skills/segment-anything/references/troubleshooting.md +484 -0
- package/bin/skills/sentence-transformers/SKILL.md +255 -0
- package/bin/skills/sentence-transformers/references/models.md +123 -0
- package/bin/skills/sentencepiece/SKILL.md +235 -0
- package/bin/skills/sentencepiece/references/algorithms.md +200 -0
- package/bin/skills/sentencepiece/references/training.md +304 -0
- package/bin/skills/sglang/SKILL.md +442 -0
- package/bin/skills/sglang/references/deployment.md +490 -0
- package/bin/skills/sglang/references/radix-attention.md +413 -0
- package/bin/skills/sglang/references/structured-generation.md +541 -0
- package/bin/skills/simpo/SKILL.md +219 -0
- package/bin/skills/simpo/references/datasets.md +478 -0
- package/bin/skills/simpo/references/hyperparameters.md +452 -0
- package/bin/skills/simpo/references/loss-functions.md +350 -0
- package/bin/skills/skypilot/SKILL.md +509 -0
- package/bin/skills/skypilot/references/advanced-usage.md +491 -0
- package/bin/skills/skypilot/references/troubleshooting.md +570 -0
- package/bin/skills/slime/SKILL.md +464 -0
- package/bin/skills/slime/references/api-reference.md +392 -0
- package/bin/skills/slime/references/troubleshooting.md +386 -0
- package/bin/skills/speculative-decoding/SKILL.md +467 -0
- package/bin/skills/speculative-decoding/references/lookahead.md +309 -0
- package/bin/skills/speculative-decoding/references/medusa.md +350 -0
- package/bin/skills/stable-diffusion/SKILL.md +519 -0
- package/bin/skills/stable-diffusion/references/advanced-usage.md +716 -0
- package/bin/skills/stable-diffusion/references/troubleshooting.md +555 -0
- package/bin/skills/tensorboard/SKILL.md +629 -0
- package/bin/skills/tensorboard/references/integrations.md +638 -0
- package/bin/skills/tensorboard/references/profiling.md +545 -0
- package/bin/skills/tensorboard/references/visualization.md +620 -0
- package/bin/skills/tensorrt-llm/SKILL.md +187 -0
- package/bin/skills/tensorrt-llm/references/multi-gpu.md +298 -0
- package/bin/skills/tensorrt-llm/references/optimization.md +242 -0
- package/bin/skills/tensorrt-llm/references/serving.md +470 -0
- package/bin/skills/tinker/SKILL.md +362 -0
- package/bin/skills/tinker/references/api-reference.md +168 -0
- package/bin/skills/tinker/references/getting-started.md +157 -0
- package/bin/skills/tinker/references/loss-functions.md +163 -0
- package/bin/skills/tinker/references/models-and-lora.md +139 -0
- package/bin/skills/tinker/references/recipes.md +280 -0
- package/bin/skills/tinker/references/reinforcement-learning.md +212 -0
- package/bin/skills/tinker/references/rendering.md +243 -0
- package/bin/skills/tinker/references/supervised-learning.md +232 -0
- package/bin/skills/tinker-training-cost/SKILL.md +187 -0
- package/bin/skills/tinker-training-cost/scripts/calculate_cost.py +123 -0
- package/bin/skills/torchforge/SKILL.md +433 -0
- package/bin/skills/torchforge/references/api-reference.md +327 -0
- package/bin/skills/torchforge/references/troubleshooting.md +409 -0
- package/bin/skills/torchtitan/SKILL.md +358 -0
- package/bin/skills/torchtitan/references/checkpoint.md +181 -0
- package/bin/skills/torchtitan/references/custom-models.md +258 -0
- package/bin/skills/torchtitan/references/float8.md +133 -0
- package/bin/skills/torchtitan/references/fsdp.md +126 -0
- package/bin/skills/transformer-lens/SKILL.md +346 -0
- package/bin/skills/transformer-lens/references/README.md +54 -0
- package/bin/skills/transformer-lens/references/api.md +362 -0
- package/bin/skills/transformer-lens/references/tutorials.md +339 -0
- package/bin/skills/trl-fine-tuning/SKILL.md +455 -0
- package/bin/skills/trl-fine-tuning/references/dpo-variants.md +227 -0
- package/bin/skills/trl-fine-tuning/references/online-rl.md +82 -0
- package/bin/skills/trl-fine-tuning/references/reward-modeling.md +122 -0
- package/bin/skills/trl-fine-tuning/references/sft-training.md +168 -0
- package/bin/skills/unsloth/SKILL.md +80 -0
- package/bin/skills/unsloth/references/index.md +7 -0
- package/bin/skills/unsloth/references/llms-full.md +16799 -0
- package/bin/skills/unsloth/references/llms-txt.md +12044 -0
- package/bin/skills/unsloth/references/llms.md +82 -0
- package/bin/skills/verl/SKILL.md +391 -0
- package/bin/skills/verl/references/api-reference.md +301 -0
- package/bin/skills/verl/references/troubleshooting.md +391 -0
- package/bin/skills/vllm/SKILL.md +364 -0
- package/bin/skills/vllm/references/optimization.md +226 -0
- package/bin/skills/vllm/references/quantization.md +284 -0
- package/bin/skills/vllm/references/server-deployment.md +255 -0
- package/bin/skills/vllm/references/troubleshooting.md +447 -0
- package/bin/skills/weights-and-biases/SKILL.md +590 -0
- package/bin/skills/weights-and-biases/references/artifacts.md +584 -0
- package/bin/skills/weights-and-biases/references/integrations.md +700 -0
- package/bin/skills/weights-and-biases/references/sweeps.md +847 -0
- package/bin/skills/whisper/SKILL.md +317 -0
- package/bin/skills/whisper/references/languages.md +189 -0
- package/bin/synsc +0 -0
- package/package.json +10 -0
|
@@ -0,0 +1,422 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: langsmith-observability
|
|
3
|
+
description: LLM observability platform for tracing, evaluation, and monitoring. Use when debugging LLM applications, evaluating model outputs against datasets, monitoring production systems, or building systematic testing pipelines for AI applications.
|
|
4
|
+
version: 1.0.0
|
|
5
|
+
author: Synthetic Sciences
|
|
6
|
+
license: MIT
|
|
7
|
+
tags: [Observability, LangSmith, Tracing, Evaluation, Monitoring, Debugging, Testing, LLM Ops, Production]
|
|
8
|
+
dependencies: [langsmith>=0.2.0]
|
|
9
|
+
---
|
|
10
|
+
|
|
11
|
+
# LangSmith - LLM Observability Platform
|
|
12
|
+
|
|
13
|
+
Development platform for debugging, evaluating, and monitoring language models and AI applications.
|
|
14
|
+
|
|
15
|
+
## When to use LangSmith
|
|
16
|
+
|
|
17
|
+
**Use LangSmith when:**
|
|
18
|
+
- Debugging LLM application issues (prompts, chains, agents)
|
|
19
|
+
- Evaluating model outputs systematically against datasets
|
|
20
|
+
- Monitoring production LLM systems
|
|
21
|
+
- Building regression testing for AI features
|
|
22
|
+
- Analyzing latency, token usage, and costs
|
|
23
|
+
- Collaborating on prompt engineering
|
|
24
|
+
|
|
25
|
+
**Key features:**
|
|
26
|
+
- **Tracing**: Capture inputs, outputs, latency for all LLM calls
|
|
27
|
+
- **Evaluation**: Systematic testing with built-in and custom evaluators
|
|
28
|
+
- **Datasets**: Create test sets from production traces or manually
|
|
29
|
+
- **Monitoring**: Track metrics, errors, and costs in production
|
|
30
|
+
- **Integrations**: Works with OpenAI, Anthropic, LangChain, LlamaIndex
|
|
31
|
+
|
|
32
|
+
**Use alternatives instead:**
|
|
33
|
+
- **Weights & Biases**: Deep learning experiment tracking, model training
|
|
34
|
+
- **MLflow**: General ML lifecycle, model registry focus
|
|
35
|
+
- **Arize/WhyLabs**: ML monitoring, data drift detection
|
|
36
|
+
|
|
37
|
+
## Quick start
|
|
38
|
+
|
|
39
|
+
### Installation
|
|
40
|
+
|
|
41
|
+
```bash
|
|
42
|
+
pip install langsmith
|
|
43
|
+
|
|
44
|
+
# Set environment variables
|
|
45
|
+
export LANGSMITH_API_KEY="your-api-key"
|
|
46
|
+
export LANGSMITH_TRACING=true
|
|
47
|
+
```
|
|
48
|
+
|
|
49
|
+
### Basic tracing with @traceable
|
|
50
|
+
|
|
51
|
+
```python
|
|
52
|
+
from langsmith import traceable
|
|
53
|
+
from openai import OpenAI
|
|
54
|
+
|
|
55
|
+
client = OpenAI()
|
|
56
|
+
|
|
57
|
+
@traceable
|
|
58
|
+
def generate_response(prompt: str) -> str:
|
|
59
|
+
response = client.chat.completions.create(
|
|
60
|
+
model="gpt-4o",
|
|
61
|
+
messages=[{"role": "user", "content": prompt}]
|
|
62
|
+
)
|
|
63
|
+
return response.choices[0].message.content
|
|
64
|
+
|
|
65
|
+
# Automatically traced to LangSmith
|
|
66
|
+
result = generate_response("What is machine learning?")
|
|
67
|
+
```
|
|
68
|
+
|
|
69
|
+
### OpenAI wrapper (automatic tracing)
|
|
70
|
+
|
|
71
|
+
```python
|
|
72
|
+
from langsmith.wrappers import wrap_openai
|
|
73
|
+
from openai import OpenAI
|
|
74
|
+
|
|
75
|
+
# Wrap client for automatic tracing
|
|
76
|
+
client = wrap_openai(OpenAI())
|
|
77
|
+
|
|
78
|
+
# All calls automatically traced
|
|
79
|
+
response = client.chat.completions.create(
|
|
80
|
+
model="gpt-4o",
|
|
81
|
+
messages=[{"role": "user", "content": "Hello!"}]
|
|
82
|
+
)
|
|
83
|
+
```
|
|
84
|
+
|
|
85
|
+
## Core concepts
|
|
86
|
+
|
|
87
|
+
### Runs and traces
|
|
88
|
+
|
|
89
|
+
A **run** is a single execution unit (LLM call, chain, tool). Runs form hierarchical **traces** showing the full execution flow.
|
|
90
|
+
|
|
91
|
+
```python
|
|
92
|
+
from langsmith import traceable
|
|
93
|
+
|
|
94
|
+
@traceable(run_type="chain")
|
|
95
|
+
def process_query(query: str) -> str:
|
|
96
|
+
# Parent run
|
|
97
|
+
context = retrieve_context(query) # Child run
|
|
98
|
+
response = generate_answer(query, context) # Child run
|
|
99
|
+
return response
|
|
100
|
+
|
|
101
|
+
@traceable(run_type="retriever")
|
|
102
|
+
def retrieve_context(query: str) -> list:
|
|
103
|
+
return vector_store.search(query)
|
|
104
|
+
|
|
105
|
+
@traceable(run_type="llm")
|
|
106
|
+
def generate_answer(query: str, context: list) -> str:
|
|
107
|
+
return llm.invoke(f"Context: {context}\n\nQuestion: {query}")
|
|
108
|
+
```
|
|
109
|
+
|
|
110
|
+
### Projects
|
|
111
|
+
|
|
112
|
+
Projects organize related runs. Set via environment or code:
|
|
113
|
+
|
|
114
|
+
```python
|
|
115
|
+
import os
|
|
116
|
+
os.environ["LANGSMITH_PROJECT"] = "my-project"
|
|
117
|
+
|
|
118
|
+
# Or per-function
|
|
119
|
+
@traceable(project_name="my-project")
|
|
120
|
+
def my_function():
|
|
121
|
+
pass
|
|
122
|
+
```
|
|
123
|
+
|
|
124
|
+
## Client API
|
|
125
|
+
|
|
126
|
+
```python
|
|
127
|
+
from langsmith import Client
|
|
128
|
+
|
|
129
|
+
client = Client()
|
|
130
|
+
|
|
131
|
+
# List runs
|
|
132
|
+
runs = list(client.list_runs(
|
|
133
|
+
project_name="my-project",
|
|
134
|
+
filter='eq(status, "success")',
|
|
135
|
+
limit=100
|
|
136
|
+
))
|
|
137
|
+
|
|
138
|
+
# Get run details
|
|
139
|
+
run = client.read_run(run_id="...")
|
|
140
|
+
|
|
141
|
+
# Create feedback
|
|
142
|
+
client.create_feedback(
|
|
143
|
+
run_id="...",
|
|
144
|
+
key="correctness",
|
|
145
|
+
score=0.9,
|
|
146
|
+
comment="Good answer"
|
|
147
|
+
)
|
|
148
|
+
```
|
|
149
|
+
|
|
150
|
+
## Datasets and evaluation
|
|
151
|
+
|
|
152
|
+
### Create dataset
|
|
153
|
+
|
|
154
|
+
```python
|
|
155
|
+
from langsmith import Client
|
|
156
|
+
|
|
157
|
+
client = Client()
|
|
158
|
+
|
|
159
|
+
# Create dataset
|
|
160
|
+
dataset = client.create_dataset("qa-test-set", description="QA evaluation")
|
|
161
|
+
|
|
162
|
+
# Add examples
|
|
163
|
+
client.create_examples(
|
|
164
|
+
inputs=[
|
|
165
|
+
{"question": "What is Python?"},
|
|
166
|
+
{"question": "What is ML?"}
|
|
167
|
+
],
|
|
168
|
+
outputs=[
|
|
169
|
+
{"answer": "A programming language"},
|
|
170
|
+
{"answer": "Machine learning"}
|
|
171
|
+
],
|
|
172
|
+
dataset_id=dataset.id
|
|
173
|
+
)
|
|
174
|
+
```
|
|
175
|
+
|
|
176
|
+
### Run evaluation
|
|
177
|
+
|
|
178
|
+
```python
|
|
179
|
+
from langsmith import evaluate
|
|
180
|
+
|
|
181
|
+
def my_model(inputs: dict) -> dict:
|
|
182
|
+
# Your model logic
|
|
183
|
+
return {"answer": generate_answer(inputs["question"])}
|
|
184
|
+
|
|
185
|
+
def correctness_evaluator(run, example):
|
|
186
|
+
prediction = run.outputs["answer"]
|
|
187
|
+
reference = example.outputs["answer"]
|
|
188
|
+
score = 1.0 if reference.lower() in prediction.lower() else 0.0
|
|
189
|
+
return {"key": "correctness", "score": score}
|
|
190
|
+
|
|
191
|
+
results = evaluate(
|
|
192
|
+
my_model,
|
|
193
|
+
data="qa-test-set",
|
|
194
|
+
evaluators=[correctness_evaluator],
|
|
195
|
+
experiment_prefix="v1"
|
|
196
|
+
)
|
|
197
|
+
|
|
198
|
+
print(f"Average score: {results.aggregate_metrics['correctness']}")
|
|
199
|
+
```
|
|
200
|
+
|
|
201
|
+
### Built-in evaluators
|
|
202
|
+
|
|
203
|
+
```python
|
|
204
|
+
from langsmith.evaluation import LangChainStringEvaluator
|
|
205
|
+
|
|
206
|
+
# Use LangChain evaluators
|
|
207
|
+
results = evaluate(
|
|
208
|
+
my_model,
|
|
209
|
+
data="qa-test-set",
|
|
210
|
+
evaluators=[
|
|
211
|
+
LangChainStringEvaluator("qa"),
|
|
212
|
+
LangChainStringEvaluator("cot_qa")
|
|
213
|
+
]
|
|
214
|
+
)
|
|
215
|
+
```
|
|
216
|
+
|
|
217
|
+
## Advanced tracing
|
|
218
|
+
|
|
219
|
+
### Tracing context
|
|
220
|
+
|
|
221
|
+
```python
|
|
222
|
+
from langsmith import tracing_context
|
|
223
|
+
|
|
224
|
+
with tracing_context(
|
|
225
|
+
project_name="experiment-1",
|
|
226
|
+
tags=["production", "v2"],
|
|
227
|
+
metadata={"version": "2.0"}
|
|
228
|
+
):
|
|
229
|
+
# All traceable calls inherit context
|
|
230
|
+
result = my_function()
|
|
231
|
+
```
|
|
232
|
+
|
|
233
|
+
### Manual runs
|
|
234
|
+
|
|
235
|
+
```python
|
|
236
|
+
from langsmith import trace
|
|
237
|
+
|
|
238
|
+
with trace(
|
|
239
|
+
name="custom_operation",
|
|
240
|
+
run_type="tool",
|
|
241
|
+
inputs={"query": "test"}
|
|
242
|
+
) as run:
|
|
243
|
+
result = do_something()
|
|
244
|
+
run.end(outputs={"result": result})
|
|
245
|
+
```
|
|
246
|
+
|
|
247
|
+
### Process inputs/outputs
|
|
248
|
+
|
|
249
|
+
```python
|
|
250
|
+
def sanitize_inputs(inputs: dict) -> dict:
|
|
251
|
+
if "password" in inputs:
|
|
252
|
+
inputs["password"] = "***"
|
|
253
|
+
return inputs
|
|
254
|
+
|
|
255
|
+
@traceable(process_inputs=sanitize_inputs)
|
|
256
|
+
def login(username: str, password: str):
|
|
257
|
+
return authenticate(username, password)
|
|
258
|
+
```
|
|
259
|
+
|
|
260
|
+
### Sampling
|
|
261
|
+
|
|
262
|
+
```python
|
|
263
|
+
import os
|
|
264
|
+
os.environ["LANGSMITH_TRACING_SAMPLING_RATE"] = "0.1" # 10% sampling
|
|
265
|
+
```
|
|
266
|
+
|
|
267
|
+
## LangChain integration
|
|
268
|
+
|
|
269
|
+
```python
|
|
270
|
+
from langchain_openai import ChatOpenAI
|
|
271
|
+
from langchain_core.prompts import ChatPromptTemplate
|
|
272
|
+
|
|
273
|
+
# Tracing enabled automatically with LANGSMITH_TRACING=true
|
|
274
|
+
llm = ChatOpenAI(model="gpt-4o")
|
|
275
|
+
prompt = ChatPromptTemplate.from_messages([
|
|
276
|
+
("system", "You are a helpful assistant."),
|
|
277
|
+
("user", "{input}")
|
|
278
|
+
])
|
|
279
|
+
|
|
280
|
+
chain = prompt | llm
|
|
281
|
+
|
|
282
|
+
# All chain runs traced automatically
|
|
283
|
+
response = chain.invoke({"input": "Hello!"})
|
|
284
|
+
```
|
|
285
|
+
|
|
286
|
+
## Production monitoring
|
|
287
|
+
|
|
288
|
+
### Hub prompts
|
|
289
|
+
|
|
290
|
+
```python
|
|
291
|
+
from langsmith import Client
|
|
292
|
+
|
|
293
|
+
client = Client()
|
|
294
|
+
|
|
295
|
+
# Pull prompt from hub
|
|
296
|
+
prompt = client.pull_prompt("my-org/qa-prompt")
|
|
297
|
+
|
|
298
|
+
# Use in application
|
|
299
|
+
result = prompt.invoke({"question": "What is AI?"})
|
|
300
|
+
```
|
|
301
|
+
|
|
302
|
+
### Async client
|
|
303
|
+
|
|
304
|
+
```python
|
|
305
|
+
from langsmith import AsyncClient
|
|
306
|
+
|
|
307
|
+
async def main():
|
|
308
|
+
client = AsyncClient()
|
|
309
|
+
|
|
310
|
+
runs = []
|
|
311
|
+
async for run in client.list_runs(project_name="my-project"):
|
|
312
|
+
runs.append(run)
|
|
313
|
+
|
|
314
|
+
return runs
|
|
315
|
+
```
|
|
316
|
+
|
|
317
|
+
### Feedback collection
|
|
318
|
+
|
|
319
|
+
```python
|
|
320
|
+
from langsmith import Client
|
|
321
|
+
|
|
322
|
+
client = Client()
|
|
323
|
+
|
|
324
|
+
# Collect user feedback
|
|
325
|
+
def record_feedback(run_id: str, user_rating: int, comment: str = None):
|
|
326
|
+
client.create_feedback(
|
|
327
|
+
run_id=run_id,
|
|
328
|
+
key="user_rating",
|
|
329
|
+
score=user_rating / 5.0, # Normalize to 0-1
|
|
330
|
+
comment=comment
|
|
331
|
+
)
|
|
332
|
+
|
|
333
|
+
# In your application
|
|
334
|
+
record_feedback(run_id="...", user_rating=4, comment="Helpful response")
|
|
335
|
+
```
|
|
336
|
+
|
|
337
|
+
## Testing integration
|
|
338
|
+
|
|
339
|
+
### Pytest integration
|
|
340
|
+
|
|
341
|
+
```python
|
|
342
|
+
from langsmith import test
|
|
343
|
+
|
|
344
|
+
@test
|
|
345
|
+
def test_qa_accuracy():
|
|
346
|
+
result = my_qa_function("What is Python?")
|
|
347
|
+
assert "programming" in result.lower()
|
|
348
|
+
```
|
|
349
|
+
|
|
350
|
+
### Evaluation in CI/CD
|
|
351
|
+
|
|
352
|
+
```python
|
|
353
|
+
from langsmith import evaluate
|
|
354
|
+
|
|
355
|
+
def run_evaluation():
|
|
356
|
+
results = evaluate(
|
|
357
|
+
my_model,
|
|
358
|
+
data="regression-test-set",
|
|
359
|
+
evaluators=[accuracy_evaluator]
|
|
360
|
+
)
|
|
361
|
+
|
|
362
|
+
# Fail CI if accuracy drops
|
|
363
|
+
assert results.aggregate_metrics["accuracy"] >= 0.9, \
|
|
364
|
+
f"Accuracy {results.aggregate_metrics['accuracy']} below threshold"
|
|
365
|
+
```
|
|
366
|
+
|
|
367
|
+
## Best practices
|
|
368
|
+
|
|
369
|
+
1. **Structured naming** - Use consistent project/run naming conventions
|
|
370
|
+
2. **Add metadata** - Include version, environment, user info
|
|
371
|
+
3. **Sample in production** - Use sampling rate to control volume
|
|
372
|
+
4. **Create datasets** - Build test sets from interesting production cases
|
|
373
|
+
5. **Automate evaluation** - Run evaluations in CI/CD pipelines
|
|
374
|
+
6. **Monitor costs** - Track token usage and latency trends
|
|
375
|
+
|
|
376
|
+
## Common issues
|
|
377
|
+
|
|
378
|
+
**Traces not appearing:**
|
|
379
|
+
```python
|
|
380
|
+
import os
|
|
381
|
+
# Ensure tracing is enabled
|
|
382
|
+
os.environ["LANGSMITH_TRACING"] = "true"
|
|
383
|
+
os.environ["LANGSMITH_API_KEY"] = "your-key"
|
|
384
|
+
|
|
385
|
+
# Verify connection
|
|
386
|
+
from langsmith import Client
|
|
387
|
+
client = Client()
|
|
388
|
+
print(client.list_projects()) # Should work
|
|
389
|
+
```
|
|
390
|
+
|
|
391
|
+
**High latency from tracing:**
|
|
392
|
+
```python
|
|
393
|
+
# Enable background batching (default)
|
|
394
|
+
from langsmith import Client
|
|
395
|
+
client = Client(auto_batch_tracing=True)
|
|
396
|
+
|
|
397
|
+
# Or use sampling
|
|
398
|
+
os.environ["LANGSMITH_TRACING_SAMPLING_RATE"] = "0.1"
|
|
399
|
+
```
|
|
400
|
+
|
|
401
|
+
**Large payloads:**
|
|
402
|
+
```python
|
|
403
|
+
# Hide sensitive/large fields
|
|
404
|
+
@traceable(
|
|
405
|
+
process_inputs=lambda x: {k: v for k, v in x.items() if k != "large_field"}
|
|
406
|
+
)
|
|
407
|
+
def my_function(data):
|
|
408
|
+
pass
|
|
409
|
+
```
|
|
410
|
+
|
|
411
|
+
## References
|
|
412
|
+
|
|
413
|
+
- **[Advanced Usage](references/advanced-usage.md)** - Custom evaluators, distributed tracing, hub prompts
|
|
414
|
+
- **[Troubleshooting](references/troubleshooting.md)** - Common issues, debugging, performance
|
|
415
|
+
|
|
416
|
+
## Resources
|
|
417
|
+
|
|
418
|
+
- **Documentation**: https://docs.smith.langchain.com
|
|
419
|
+
- **Python SDK**: https://github.com/langchain-ai/langsmith-sdk
|
|
420
|
+
- **Web App**: https://smith.langchain.com
|
|
421
|
+
- **Version**: 0.2.0+
|
|
422
|
+
- **License**: MIT
|