@synsci/cli-darwin-x64 1.1.49
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/skills/accelerate/SKILL.md +332 -0
- package/bin/skills/accelerate/references/custom-plugins.md +453 -0
- package/bin/skills/accelerate/references/megatron-integration.md +489 -0
- package/bin/skills/accelerate/references/performance.md +525 -0
- package/bin/skills/audiocraft/SKILL.md +564 -0
- package/bin/skills/audiocraft/references/advanced-usage.md +666 -0
- package/bin/skills/audiocraft/references/troubleshooting.md +504 -0
- package/bin/skills/autogpt/SKILL.md +403 -0
- package/bin/skills/autogpt/references/advanced-usage.md +535 -0
- package/bin/skills/autogpt/references/troubleshooting.md +420 -0
- package/bin/skills/awq/SKILL.md +310 -0
- package/bin/skills/awq/references/advanced-usage.md +324 -0
- package/bin/skills/awq/references/troubleshooting.md +344 -0
- package/bin/skills/axolotl/SKILL.md +158 -0
- package/bin/skills/axolotl/references/api.md +5548 -0
- package/bin/skills/axolotl/references/dataset-formats.md +1029 -0
- package/bin/skills/axolotl/references/index.md +15 -0
- package/bin/skills/axolotl/references/other.md +3563 -0
- package/bin/skills/bigcode-evaluation-harness/SKILL.md +405 -0
- package/bin/skills/bigcode-evaluation-harness/references/benchmarks.md +393 -0
- package/bin/skills/bigcode-evaluation-harness/references/custom-tasks.md +424 -0
- package/bin/skills/bigcode-evaluation-harness/references/issues.md +394 -0
- package/bin/skills/bitsandbytes/SKILL.md +411 -0
- package/bin/skills/bitsandbytes/references/memory-optimization.md +521 -0
- package/bin/skills/bitsandbytes/references/qlora-training.md +521 -0
- package/bin/skills/bitsandbytes/references/quantization-formats.md +447 -0
- package/bin/skills/blip-2/SKILL.md +564 -0
- package/bin/skills/blip-2/references/advanced-usage.md +680 -0
- package/bin/skills/blip-2/references/troubleshooting.md +526 -0
- package/bin/skills/chroma/SKILL.md +406 -0
- package/bin/skills/chroma/references/integration.md +38 -0
- package/bin/skills/clip/SKILL.md +253 -0
- package/bin/skills/clip/references/applications.md +207 -0
- package/bin/skills/constitutional-ai/SKILL.md +290 -0
- package/bin/skills/crewai/SKILL.md +498 -0
- package/bin/skills/crewai/references/flows.md +438 -0
- package/bin/skills/crewai/references/tools.md +429 -0
- package/bin/skills/crewai/references/troubleshooting.md +480 -0
- package/bin/skills/deepspeed/SKILL.md +141 -0
- package/bin/skills/deepspeed/references/08.md +17 -0
- package/bin/skills/deepspeed/references/09.md +173 -0
- package/bin/skills/deepspeed/references/2020.md +378 -0
- package/bin/skills/deepspeed/references/2023.md +279 -0
- package/bin/skills/deepspeed/references/assets.md +179 -0
- package/bin/skills/deepspeed/references/index.md +35 -0
- package/bin/skills/deepspeed/references/mii.md +118 -0
- package/bin/skills/deepspeed/references/other.md +1191 -0
- package/bin/skills/deepspeed/references/tutorials.md +6554 -0
- package/bin/skills/dspy/SKILL.md +590 -0
- package/bin/skills/dspy/references/examples.md +663 -0
- package/bin/skills/dspy/references/modules.md +475 -0
- package/bin/skills/dspy/references/optimizers.md +566 -0
- package/bin/skills/faiss/SKILL.md +221 -0
- package/bin/skills/faiss/references/index_types.md +280 -0
- package/bin/skills/flash-attention/SKILL.md +367 -0
- package/bin/skills/flash-attention/references/benchmarks.md +215 -0
- package/bin/skills/flash-attention/references/transformers-integration.md +293 -0
- package/bin/skills/gguf/SKILL.md +427 -0
- package/bin/skills/gguf/references/advanced-usage.md +504 -0
- package/bin/skills/gguf/references/troubleshooting.md +442 -0
- package/bin/skills/gptq/SKILL.md +450 -0
- package/bin/skills/gptq/references/calibration.md +337 -0
- package/bin/skills/gptq/references/integration.md +129 -0
- package/bin/skills/gptq/references/troubleshooting.md +95 -0
- package/bin/skills/grpo-rl-training/README.md +97 -0
- package/bin/skills/grpo-rl-training/SKILL.md +572 -0
- package/bin/skills/grpo-rl-training/examples/reward_functions_library.py +393 -0
- package/bin/skills/grpo-rl-training/templates/basic_grpo_training.py +228 -0
- package/bin/skills/guidance/SKILL.md +572 -0
- package/bin/skills/guidance/references/backends.md +554 -0
- package/bin/skills/guidance/references/constraints.md +674 -0
- package/bin/skills/guidance/references/examples.md +767 -0
- package/bin/skills/hqq/SKILL.md +445 -0
- package/bin/skills/hqq/references/advanced-usage.md +528 -0
- package/bin/skills/hqq/references/troubleshooting.md +503 -0
- package/bin/skills/hugging-face-cli/SKILL.md +191 -0
- package/bin/skills/hugging-face-cli/references/commands.md +954 -0
- package/bin/skills/hugging-face-cli/references/examples.md +374 -0
- package/bin/skills/hugging-face-datasets/SKILL.md +547 -0
- package/bin/skills/hugging-face-datasets/examples/diverse_training_examples.json +239 -0
- package/bin/skills/hugging-face-datasets/examples/system_prompt_template.txt +196 -0
- package/bin/skills/hugging-face-datasets/examples/training_examples.json +176 -0
- package/bin/skills/hugging-face-datasets/scripts/dataset_manager.py +522 -0
- package/bin/skills/hugging-face-datasets/scripts/sql_manager.py +844 -0
- package/bin/skills/hugging-face-datasets/templates/chat.json +55 -0
- package/bin/skills/hugging-face-datasets/templates/classification.json +62 -0
- package/bin/skills/hugging-face-datasets/templates/completion.json +51 -0
- package/bin/skills/hugging-face-datasets/templates/custom.json +75 -0
- package/bin/skills/hugging-face-datasets/templates/qa.json +54 -0
- package/bin/skills/hugging-face-datasets/templates/tabular.json +81 -0
- package/bin/skills/hugging-face-evaluation/SKILL.md +656 -0
- package/bin/skills/hugging-face-evaluation/examples/USAGE_EXAMPLES.md +382 -0
- package/bin/skills/hugging-face-evaluation/examples/artificial_analysis_to_hub.py +141 -0
- package/bin/skills/hugging-face-evaluation/examples/example_readme_tables.md +135 -0
- package/bin/skills/hugging-face-evaluation/examples/metric_mapping.json +50 -0
- package/bin/skills/hugging-face-evaluation/requirements.txt +20 -0
- package/bin/skills/hugging-face-evaluation/scripts/evaluation_manager.py +1374 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_eval_uv.py +104 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_vllm_uv.py +317 -0
- package/bin/skills/hugging-face-evaluation/scripts/lighteval_vllm_uv.py +303 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_eval_job.py +98 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_vllm_eval_job.py +331 -0
- package/bin/skills/hugging-face-evaluation/scripts/test_extraction.py +206 -0
- package/bin/skills/hugging-face-jobs/SKILL.md +1041 -0
- package/bin/skills/hugging-face-jobs/index.html +216 -0
- package/bin/skills/hugging-face-jobs/references/hardware_guide.md +336 -0
- package/bin/skills/hugging-face-jobs/references/hub_saving.md +352 -0
- package/bin/skills/hugging-face-jobs/references/token_usage.md +546 -0
- package/bin/skills/hugging-face-jobs/references/troubleshooting.md +475 -0
- package/bin/skills/hugging-face-jobs/scripts/cot-self-instruct.py +718 -0
- package/bin/skills/hugging-face-jobs/scripts/finepdfs-stats.py +546 -0
- package/bin/skills/hugging-face-jobs/scripts/generate-responses.py +587 -0
- package/bin/skills/hugging-face-model-trainer/SKILL.md +711 -0
- package/bin/skills/hugging-face-model-trainer/references/gguf_conversion.md +296 -0
- package/bin/skills/hugging-face-model-trainer/references/hardware_guide.md +283 -0
- package/bin/skills/hugging-face-model-trainer/references/hub_saving.md +364 -0
- package/bin/skills/hugging-face-model-trainer/references/reliability_principles.md +371 -0
- package/bin/skills/hugging-face-model-trainer/references/trackio_guide.md +189 -0
- package/bin/skills/hugging-face-model-trainer/references/training_methods.md +150 -0
- package/bin/skills/hugging-face-model-trainer/references/training_patterns.md +203 -0
- package/bin/skills/hugging-face-model-trainer/references/troubleshooting.md +282 -0
- package/bin/skills/hugging-face-model-trainer/scripts/convert_to_gguf.py +424 -0
- package/bin/skills/hugging-face-model-trainer/scripts/dataset_inspector.py +417 -0
- package/bin/skills/hugging-face-model-trainer/scripts/estimate_cost.py +150 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_dpo_example.py +106 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_grpo_example.py +89 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_sft_example.py +122 -0
- package/bin/skills/hugging-face-paper-publisher/SKILL.md +627 -0
- package/bin/skills/hugging-face-paper-publisher/examples/example_usage.md +327 -0
- package/bin/skills/hugging-face-paper-publisher/references/quick_reference.md +216 -0
- package/bin/skills/hugging-face-paper-publisher/scripts/paper_manager.py +508 -0
- package/bin/skills/hugging-face-paper-publisher/templates/arxiv.md +299 -0
- package/bin/skills/hugging-face-paper-publisher/templates/ml-report.md +358 -0
- package/bin/skills/hugging-face-paper-publisher/templates/modern.md +319 -0
- package/bin/skills/hugging-face-paper-publisher/templates/standard.md +201 -0
- package/bin/skills/hugging-face-tool-builder/SKILL.md +115 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.py +57 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.sh +40 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.tsx +57 -0
- package/bin/skills/hugging-face-tool-builder/references/find_models_by_paper.sh +230 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_enrich_models.sh +96 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_card_frontmatter.sh +188 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_papers_auth.sh +171 -0
- package/bin/skills/hugging-face-trackio/SKILL.md +65 -0
- package/bin/skills/hugging-face-trackio/references/logging_metrics.md +206 -0
- package/bin/skills/hugging-face-trackio/references/retrieving_metrics.md +223 -0
- package/bin/skills/huggingface-tokenizers/SKILL.md +516 -0
- package/bin/skills/huggingface-tokenizers/references/algorithms.md +653 -0
- package/bin/skills/huggingface-tokenizers/references/integration.md +637 -0
- package/bin/skills/huggingface-tokenizers/references/pipeline.md +723 -0
- package/bin/skills/huggingface-tokenizers/references/training.md +565 -0
- package/bin/skills/instructor/SKILL.md +740 -0
- package/bin/skills/instructor/references/examples.md +107 -0
- package/bin/skills/instructor/references/providers.md +70 -0
- package/bin/skills/instructor/references/validation.md +606 -0
- package/bin/skills/knowledge-distillation/SKILL.md +458 -0
- package/bin/skills/knowledge-distillation/references/minillm.md +334 -0
- package/bin/skills/lambda-labs/SKILL.md +545 -0
- package/bin/skills/lambda-labs/references/advanced-usage.md +611 -0
- package/bin/skills/lambda-labs/references/troubleshooting.md +530 -0
- package/bin/skills/langchain/SKILL.md +480 -0
- package/bin/skills/langchain/references/agents.md +499 -0
- package/bin/skills/langchain/references/integration.md +562 -0
- package/bin/skills/langchain/references/rag.md +600 -0
- package/bin/skills/langsmith/SKILL.md +422 -0
- package/bin/skills/langsmith/references/advanced-usage.md +548 -0
- package/bin/skills/langsmith/references/troubleshooting.md +537 -0
- package/bin/skills/litgpt/SKILL.md +469 -0
- package/bin/skills/litgpt/references/custom-models.md +568 -0
- package/bin/skills/litgpt/references/distributed-training.md +451 -0
- package/bin/skills/litgpt/references/supported-models.md +336 -0
- package/bin/skills/litgpt/references/training-recipes.md +619 -0
- package/bin/skills/llama-cpp/SKILL.md +258 -0
- package/bin/skills/llama-cpp/references/optimization.md +89 -0
- package/bin/skills/llama-cpp/references/quantization.md +213 -0
- package/bin/skills/llama-cpp/references/server.md +125 -0
- package/bin/skills/llama-factory/SKILL.md +80 -0
- package/bin/skills/llama-factory/references/_images.md +23 -0
- package/bin/skills/llama-factory/references/advanced.md +1055 -0
- package/bin/skills/llama-factory/references/getting_started.md +349 -0
- package/bin/skills/llama-factory/references/index.md +19 -0
- package/bin/skills/llama-factory/references/other.md +31 -0
- package/bin/skills/llamaguard/SKILL.md +337 -0
- package/bin/skills/llamaindex/SKILL.md +569 -0
- package/bin/skills/llamaindex/references/agents.md +83 -0
- package/bin/skills/llamaindex/references/data_connectors.md +108 -0
- package/bin/skills/llamaindex/references/query_engines.md +406 -0
- package/bin/skills/llava/SKILL.md +304 -0
- package/bin/skills/llava/references/training.md +197 -0
- package/bin/skills/lm-evaluation-harness/SKILL.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/api-evaluation.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/benchmark-guide.md +488 -0
- package/bin/skills/lm-evaluation-harness/references/custom-tasks.md +602 -0
- package/bin/skills/lm-evaluation-harness/references/distributed-eval.md +519 -0
- package/bin/skills/long-context/SKILL.md +536 -0
- package/bin/skills/long-context/references/extension_methods.md +468 -0
- package/bin/skills/long-context/references/fine_tuning.md +611 -0
- package/bin/skills/long-context/references/rope.md +402 -0
- package/bin/skills/mamba/SKILL.md +260 -0
- package/bin/skills/mamba/references/architecture-details.md +206 -0
- package/bin/skills/mamba/references/benchmarks.md +255 -0
- package/bin/skills/mamba/references/training-guide.md +388 -0
- package/bin/skills/megatron-core/SKILL.md +366 -0
- package/bin/skills/megatron-core/references/benchmarks.md +249 -0
- package/bin/skills/megatron-core/references/parallelism-guide.md +404 -0
- package/bin/skills/megatron-core/references/production-examples.md +473 -0
- package/bin/skills/megatron-core/references/training-recipes.md +547 -0
- package/bin/skills/miles/SKILL.md +315 -0
- package/bin/skills/miles/references/api-reference.md +141 -0
- package/bin/skills/miles/references/troubleshooting.md +352 -0
- package/bin/skills/mlflow/SKILL.md +704 -0
- package/bin/skills/mlflow/references/deployment.md +744 -0
- package/bin/skills/mlflow/references/model-registry.md +770 -0
- package/bin/skills/mlflow/references/tracking.md +680 -0
- package/bin/skills/modal/SKILL.md +341 -0
- package/bin/skills/modal/references/advanced-usage.md +503 -0
- package/bin/skills/modal/references/troubleshooting.md +494 -0
- package/bin/skills/model-merging/SKILL.md +539 -0
- package/bin/skills/model-merging/references/evaluation.md +462 -0
- package/bin/skills/model-merging/references/examples.md +428 -0
- package/bin/skills/model-merging/references/methods.md +352 -0
- package/bin/skills/model-pruning/SKILL.md +495 -0
- package/bin/skills/model-pruning/references/wanda.md +347 -0
- package/bin/skills/moe-training/SKILL.md +526 -0
- package/bin/skills/moe-training/references/architectures.md +432 -0
- package/bin/skills/moe-training/references/inference.md +348 -0
- package/bin/skills/moe-training/references/training.md +425 -0
- package/bin/skills/nanogpt/SKILL.md +290 -0
- package/bin/skills/nanogpt/references/architecture.md +382 -0
- package/bin/skills/nanogpt/references/data.md +476 -0
- package/bin/skills/nanogpt/references/training.md +564 -0
- package/bin/skills/nemo-curator/SKILL.md +383 -0
- package/bin/skills/nemo-curator/references/deduplication.md +87 -0
- package/bin/skills/nemo-curator/references/filtering.md +102 -0
- package/bin/skills/nemo-evaluator/SKILL.md +494 -0
- package/bin/skills/nemo-evaluator/references/adapter-system.md +340 -0
- package/bin/skills/nemo-evaluator/references/configuration.md +447 -0
- package/bin/skills/nemo-evaluator/references/custom-benchmarks.md +315 -0
- package/bin/skills/nemo-evaluator/references/execution-backends.md +361 -0
- package/bin/skills/nemo-guardrails/SKILL.md +297 -0
- package/bin/skills/nnsight/SKILL.md +436 -0
- package/bin/skills/nnsight/references/README.md +78 -0
- package/bin/skills/nnsight/references/api.md +344 -0
- package/bin/skills/nnsight/references/tutorials.md +300 -0
- package/bin/skills/openrlhf/SKILL.md +249 -0
- package/bin/skills/openrlhf/references/algorithm-comparison.md +404 -0
- package/bin/skills/openrlhf/references/custom-rewards.md +530 -0
- package/bin/skills/openrlhf/references/hybrid-engine.md +287 -0
- package/bin/skills/openrlhf/references/multi-node-training.md +454 -0
- package/bin/skills/outlines/SKILL.md +652 -0
- package/bin/skills/outlines/references/backends.md +615 -0
- package/bin/skills/outlines/references/examples.md +773 -0
- package/bin/skills/outlines/references/json_generation.md +652 -0
- package/bin/skills/peft/SKILL.md +431 -0
- package/bin/skills/peft/references/advanced-usage.md +514 -0
- package/bin/skills/peft/references/troubleshooting.md +480 -0
- package/bin/skills/phoenix/SKILL.md +475 -0
- package/bin/skills/phoenix/references/advanced-usage.md +619 -0
- package/bin/skills/phoenix/references/troubleshooting.md +538 -0
- package/bin/skills/pinecone/SKILL.md +358 -0
- package/bin/skills/pinecone/references/deployment.md +181 -0
- package/bin/skills/pytorch-fsdp/SKILL.md +126 -0
- package/bin/skills/pytorch-fsdp/references/index.md +7 -0
- package/bin/skills/pytorch-fsdp/references/other.md +4249 -0
- package/bin/skills/pytorch-lightning/SKILL.md +346 -0
- package/bin/skills/pytorch-lightning/references/callbacks.md +436 -0
- package/bin/skills/pytorch-lightning/references/distributed.md +490 -0
- package/bin/skills/pytorch-lightning/references/hyperparameter-tuning.md +556 -0
- package/bin/skills/pyvene/SKILL.md +473 -0
- package/bin/skills/pyvene/references/README.md +73 -0
- package/bin/skills/pyvene/references/api.md +383 -0
- package/bin/skills/pyvene/references/tutorials.md +376 -0
- package/bin/skills/qdrant/SKILL.md +493 -0
- package/bin/skills/qdrant/references/advanced-usage.md +648 -0
- package/bin/skills/qdrant/references/troubleshooting.md +631 -0
- package/bin/skills/ray-data/SKILL.md +326 -0
- package/bin/skills/ray-data/references/integration.md +82 -0
- package/bin/skills/ray-data/references/transformations.md +83 -0
- package/bin/skills/ray-train/SKILL.md +406 -0
- package/bin/skills/ray-train/references/multi-node.md +628 -0
- package/bin/skills/rwkv/SKILL.md +260 -0
- package/bin/skills/rwkv/references/architecture-details.md +344 -0
- package/bin/skills/rwkv/references/rwkv7.md +386 -0
- package/bin/skills/rwkv/references/state-management.md +369 -0
- package/bin/skills/saelens/SKILL.md +386 -0
- package/bin/skills/saelens/references/README.md +70 -0
- package/bin/skills/saelens/references/api.md +333 -0
- package/bin/skills/saelens/references/tutorials.md +318 -0
- package/bin/skills/segment-anything/SKILL.md +500 -0
- package/bin/skills/segment-anything/references/advanced-usage.md +589 -0
- package/bin/skills/segment-anything/references/troubleshooting.md +484 -0
- package/bin/skills/sentence-transformers/SKILL.md +255 -0
- package/bin/skills/sentence-transformers/references/models.md +123 -0
- package/bin/skills/sentencepiece/SKILL.md +235 -0
- package/bin/skills/sentencepiece/references/algorithms.md +200 -0
- package/bin/skills/sentencepiece/references/training.md +304 -0
- package/bin/skills/sglang/SKILL.md +442 -0
- package/bin/skills/sglang/references/deployment.md +490 -0
- package/bin/skills/sglang/references/radix-attention.md +413 -0
- package/bin/skills/sglang/references/structured-generation.md +541 -0
- package/bin/skills/simpo/SKILL.md +219 -0
- package/bin/skills/simpo/references/datasets.md +478 -0
- package/bin/skills/simpo/references/hyperparameters.md +452 -0
- package/bin/skills/simpo/references/loss-functions.md +350 -0
- package/bin/skills/skypilot/SKILL.md +509 -0
- package/bin/skills/skypilot/references/advanced-usage.md +491 -0
- package/bin/skills/skypilot/references/troubleshooting.md +570 -0
- package/bin/skills/slime/SKILL.md +464 -0
- package/bin/skills/slime/references/api-reference.md +392 -0
- package/bin/skills/slime/references/troubleshooting.md +386 -0
- package/bin/skills/speculative-decoding/SKILL.md +467 -0
- package/bin/skills/speculative-decoding/references/lookahead.md +309 -0
- package/bin/skills/speculative-decoding/references/medusa.md +350 -0
- package/bin/skills/stable-diffusion/SKILL.md +519 -0
- package/bin/skills/stable-diffusion/references/advanced-usage.md +716 -0
- package/bin/skills/stable-diffusion/references/troubleshooting.md +555 -0
- package/bin/skills/tensorboard/SKILL.md +629 -0
- package/bin/skills/tensorboard/references/integrations.md +638 -0
- package/bin/skills/tensorboard/references/profiling.md +545 -0
- package/bin/skills/tensorboard/references/visualization.md +620 -0
- package/bin/skills/tensorrt-llm/SKILL.md +187 -0
- package/bin/skills/tensorrt-llm/references/multi-gpu.md +298 -0
- package/bin/skills/tensorrt-llm/references/optimization.md +242 -0
- package/bin/skills/tensorrt-llm/references/serving.md +470 -0
- package/bin/skills/tinker/SKILL.md +362 -0
- package/bin/skills/tinker/references/api-reference.md +168 -0
- package/bin/skills/tinker/references/getting-started.md +157 -0
- package/bin/skills/tinker/references/loss-functions.md +163 -0
- package/bin/skills/tinker/references/models-and-lora.md +139 -0
- package/bin/skills/tinker/references/recipes.md +280 -0
- package/bin/skills/tinker/references/reinforcement-learning.md +212 -0
- package/bin/skills/tinker/references/rendering.md +243 -0
- package/bin/skills/tinker/references/supervised-learning.md +232 -0
- package/bin/skills/tinker-training-cost/SKILL.md +187 -0
- package/bin/skills/tinker-training-cost/scripts/calculate_cost.py +123 -0
- package/bin/skills/torchforge/SKILL.md +433 -0
- package/bin/skills/torchforge/references/api-reference.md +327 -0
- package/bin/skills/torchforge/references/troubleshooting.md +409 -0
- package/bin/skills/torchtitan/SKILL.md +358 -0
- package/bin/skills/torchtitan/references/checkpoint.md +181 -0
- package/bin/skills/torchtitan/references/custom-models.md +258 -0
- package/bin/skills/torchtitan/references/float8.md +133 -0
- package/bin/skills/torchtitan/references/fsdp.md +126 -0
- package/bin/skills/transformer-lens/SKILL.md +346 -0
- package/bin/skills/transformer-lens/references/README.md +54 -0
- package/bin/skills/transformer-lens/references/api.md +362 -0
- package/bin/skills/transformer-lens/references/tutorials.md +339 -0
- package/bin/skills/trl-fine-tuning/SKILL.md +455 -0
- package/bin/skills/trl-fine-tuning/references/dpo-variants.md +227 -0
- package/bin/skills/trl-fine-tuning/references/online-rl.md +82 -0
- package/bin/skills/trl-fine-tuning/references/reward-modeling.md +122 -0
- package/bin/skills/trl-fine-tuning/references/sft-training.md +168 -0
- package/bin/skills/unsloth/SKILL.md +80 -0
- package/bin/skills/unsloth/references/index.md +7 -0
- package/bin/skills/unsloth/references/llms-full.md +16799 -0
- package/bin/skills/unsloth/references/llms-txt.md +12044 -0
- package/bin/skills/unsloth/references/llms.md +82 -0
- package/bin/skills/verl/SKILL.md +391 -0
- package/bin/skills/verl/references/api-reference.md +301 -0
- package/bin/skills/verl/references/troubleshooting.md +391 -0
- package/bin/skills/vllm/SKILL.md +364 -0
- package/bin/skills/vllm/references/optimization.md +226 -0
- package/bin/skills/vllm/references/quantization.md +284 -0
- package/bin/skills/vllm/references/server-deployment.md +255 -0
- package/bin/skills/vllm/references/troubleshooting.md +447 -0
- package/bin/skills/weights-and-biases/SKILL.md +590 -0
- package/bin/skills/weights-and-biases/references/artifacts.md +584 -0
- package/bin/skills/weights-and-biases/references/integrations.md +700 -0
- package/bin/skills/weights-and-biases/references/sweeps.md +847 -0
- package/bin/skills/whisper/SKILL.md +317 -0
- package/bin/skills/whisper/references/languages.md +189 -0
- package/bin/synsc +0 -0
- package/package.json +10 -0
|
@@ -0,0 +1,493 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: qdrant-vector-search
|
|
3
|
+
description: High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.
|
|
4
|
+
version: 1.0.0
|
|
5
|
+
author: Synthetic Sciences
|
|
6
|
+
license: MIT
|
|
7
|
+
tags: [RAG, Vector Search, Qdrant, Semantic Search, Embeddings, Similarity Search, HNSW, Production, Distributed]
|
|
8
|
+
dependencies: [qdrant-client>=1.12.0]
|
|
9
|
+
---
|
|
10
|
+
|
|
11
|
+
# Qdrant - Vector Similarity Search Engine
|
|
12
|
+
|
|
13
|
+
High-performance vector database written in Rust for production RAG and semantic search.
|
|
14
|
+
|
|
15
|
+
## When to use Qdrant
|
|
16
|
+
|
|
17
|
+
**Use Qdrant when:**
|
|
18
|
+
- Building production RAG systems requiring low latency
|
|
19
|
+
- Need hybrid search (vectors + metadata filtering)
|
|
20
|
+
- Require horizontal scaling with sharding/replication
|
|
21
|
+
- Want on-premise deployment with full data control
|
|
22
|
+
- Need multi-vector storage per record (dense + sparse)
|
|
23
|
+
- Building real-time recommendation systems
|
|
24
|
+
|
|
25
|
+
**Key features:**
|
|
26
|
+
- **Rust-powered**: Memory-safe, high performance
|
|
27
|
+
- **Rich filtering**: Filter by any payload field during search
|
|
28
|
+
- **Multiple vectors**: Dense, sparse, multi-dense per point
|
|
29
|
+
- **Quantization**: Scalar, product, binary for memory efficiency
|
|
30
|
+
- **Distributed**: Raft consensus, sharding, replication
|
|
31
|
+
- **REST + gRPC**: Both APIs with full feature parity
|
|
32
|
+
|
|
33
|
+
**Use alternatives instead:**
|
|
34
|
+
- **Chroma**: Simpler setup, embedded use cases
|
|
35
|
+
- **FAISS**: Maximum raw speed, research/batch processing
|
|
36
|
+
- **Pinecone**: Fully managed, zero ops preferred
|
|
37
|
+
- **Weaviate**: GraphQL preference, built-in vectorizers
|
|
38
|
+
|
|
39
|
+
## Quick start
|
|
40
|
+
|
|
41
|
+
### Installation
|
|
42
|
+
|
|
43
|
+
```bash
|
|
44
|
+
# Python client
|
|
45
|
+
pip install qdrant-client
|
|
46
|
+
|
|
47
|
+
# Docker (recommended for development)
|
|
48
|
+
docker run -p 6333:6333 -p 6334:6334 qdrant/qdrant
|
|
49
|
+
|
|
50
|
+
# Docker with persistent storage
|
|
51
|
+
docker run -p 6333:6333 -p 6334:6334 \
|
|
52
|
+
-v $(pwd)/qdrant_storage:/qdrant/storage \
|
|
53
|
+
qdrant/qdrant
|
|
54
|
+
```
|
|
55
|
+
|
|
56
|
+
### Basic usage
|
|
57
|
+
|
|
58
|
+
```python
|
|
59
|
+
from qdrant_client import QdrantClient
|
|
60
|
+
from qdrant_client.models import Distance, VectorParams, PointStruct
|
|
61
|
+
|
|
62
|
+
# Connect to Qdrant
|
|
63
|
+
client = QdrantClient(host="localhost", port=6333)
|
|
64
|
+
|
|
65
|
+
# Create collection
|
|
66
|
+
client.create_collection(
|
|
67
|
+
collection_name="documents",
|
|
68
|
+
vectors_config=VectorParams(size=384, distance=Distance.COSINE)
|
|
69
|
+
)
|
|
70
|
+
|
|
71
|
+
# Insert vectors with payload
|
|
72
|
+
client.upsert(
|
|
73
|
+
collection_name="documents",
|
|
74
|
+
points=[
|
|
75
|
+
PointStruct(
|
|
76
|
+
id=1,
|
|
77
|
+
vector=[0.1, 0.2, ...], # 384-dim vector
|
|
78
|
+
payload={"title": "Doc 1", "category": "tech"}
|
|
79
|
+
),
|
|
80
|
+
PointStruct(
|
|
81
|
+
id=2,
|
|
82
|
+
vector=[0.3, 0.4, ...],
|
|
83
|
+
payload={"title": "Doc 2", "category": "science"}
|
|
84
|
+
)
|
|
85
|
+
]
|
|
86
|
+
)
|
|
87
|
+
|
|
88
|
+
# Search with filtering
|
|
89
|
+
results = client.search(
|
|
90
|
+
collection_name="documents",
|
|
91
|
+
query_vector=[0.15, 0.25, ...],
|
|
92
|
+
query_filter={
|
|
93
|
+
"must": [{"key": "category", "match": {"value": "tech"}}]
|
|
94
|
+
},
|
|
95
|
+
limit=10
|
|
96
|
+
)
|
|
97
|
+
|
|
98
|
+
for point in results:
|
|
99
|
+
print(f"ID: {point.id}, Score: {point.score}, Payload: {point.payload}")
|
|
100
|
+
```
|
|
101
|
+
|
|
102
|
+
## Core concepts
|
|
103
|
+
|
|
104
|
+
### Points - Basic data unit
|
|
105
|
+
|
|
106
|
+
```python
|
|
107
|
+
from qdrant_client.models import PointStruct
|
|
108
|
+
|
|
109
|
+
# Point = ID + Vector(s) + Payload
|
|
110
|
+
point = PointStruct(
|
|
111
|
+
id=123, # Integer or UUID string
|
|
112
|
+
vector=[0.1, 0.2, 0.3, ...], # Dense vector
|
|
113
|
+
payload={ # Arbitrary JSON metadata
|
|
114
|
+
"title": "Document title",
|
|
115
|
+
"category": "tech",
|
|
116
|
+
"timestamp": 1699900000,
|
|
117
|
+
"tags": ["python", "ml"]
|
|
118
|
+
}
|
|
119
|
+
)
|
|
120
|
+
|
|
121
|
+
# Batch upsert (recommended)
|
|
122
|
+
client.upsert(
|
|
123
|
+
collection_name="documents",
|
|
124
|
+
points=[point1, point2, point3],
|
|
125
|
+
wait=True # Wait for indexing
|
|
126
|
+
)
|
|
127
|
+
```
|
|
128
|
+
|
|
129
|
+
### Collections - Vector containers
|
|
130
|
+
|
|
131
|
+
```python
|
|
132
|
+
from qdrant_client.models import VectorParams, Distance, HnswConfigDiff
|
|
133
|
+
|
|
134
|
+
# Create with HNSW configuration
|
|
135
|
+
client.create_collection(
|
|
136
|
+
collection_name="documents",
|
|
137
|
+
vectors_config=VectorParams(
|
|
138
|
+
size=384, # Vector dimensions
|
|
139
|
+
distance=Distance.COSINE # COSINE, EUCLID, DOT, MANHATTAN
|
|
140
|
+
),
|
|
141
|
+
hnsw_config=HnswConfigDiff(
|
|
142
|
+
m=16, # Connections per node (default 16)
|
|
143
|
+
ef_construct=100, # Build-time accuracy (default 100)
|
|
144
|
+
full_scan_threshold=10000 # Switch to brute force below this
|
|
145
|
+
),
|
|
146
|
+
on_disk_payload=True # Store payload on disk
|
|
147
|
+
)
|
|
148
|
+
|
|
149
|
+
# Collection info
|
|
150
|
+
info = client.get_collection("documents")
|
|
151
|
+
print(f"Points: {info.points_count}, Vectors: {info.vectors_count}")
|
|
152
|
+
```
|
|
153
|
+
|
|
154
|
+
### Distance metrics
|
|
155
|
+
|
|
156
|
+
| Metric | Use Case | Range |
|
|
157
|
+
|--------|----------|-------|
|
|
158
|
+
| `COSINE` | Text embeddings, normalized vectors | 0 to 2 |
|
|
159
|
+
| `EUCLID` | Spatial data, image features | 0 to ∞ |
|
|
160
|
+
| `DOT` | Recommendations, unnormalized | -∞ to ∞ |
|
|
161
|
+
| `MANHATTAN` | Sparse features, discrete data | 0 to ∞ |
|
|
162
|
+
|
|
163
|
+
## Search operations
|
|
164
|
+
|
|
165
|
+
### Basic search
|
|
166
|
+
|
|
167
|
+
```python
|
|
168
|
+
# Simple nearest neighbor search
|
|
169
|
+
results = client.search(
|
|
170
|
+
collection_name="documents",
|
|
171
|
+
query_vector=[0.1, 0.2, ...],
|
|
172
|
+
limit=10,
|
|
173
|
+
with_payload=True,
|
|
174
|
+
with_vectors=False # Don't return vectors (faster)
|
|
175
|
+
)
|
|
176
|
+
```
|
|
177
|
+
|
|
178
|
+
### Filtered search
|
|
179
|
+
|
|
180
|
+
```python
|
|
181
|
+
from qdrant_client.models import Filter, FieldCondition, MatchValue, Range
|
|
182
|
+
|
|
183
|
+
# Complex filtering
|
|
184
|
+
results = client.search(
|
|
185
|
+
collection_name="documents",
|
|
186
|
+
query_vector=query_embedding,
|
|
187
|
+
query_filter=Filter(
|
|
188
|
+
must=[
|
|
189
|
+
FieldCondition(key="category", match=MatchValue(value="tech")),
|
|
190
|
+
FieldCondition(key="timestamp", range=Range(gte=1699000000))
|
|
191
|
+
],
|
|
192
|
+
must_not=[
|
|
193
|
+
FieldCondition(key="status", match=MatchValue(value="archived"))
|
|
194
|
+
]
|
|
195
|
+
),
|
|
196
|
+
limit=10
|
|
197
|
+
)
|
|
198
|
+
|
|
199
|
+
# Shorthand filter syntax
|
|
200
|
+
results = client.search(
|
|
201
|
+
collection_name="documents",
|
|
202
|
+
query_vector=query_embedding,
|
|
203
|
+
query_filter={
|
|
204
|
+
"must": [
|
|
205
|
+
{"key": "category", "match": {"value": "tech"}},
|
|
206
|
+
{"key": "price", "range": {"gte": 10, "lte": 100}}
|
|
207
|
+
]
|
|
208
|
+
},
|
|
209
|
+
limit=10
|
|
210
|
+
)
|
|
211
|
+
```
|
|
212
|
+
|
|
213
|
+
### Batch search
|
|
214
|
+
|
|
215
|
+
```python
|
|
216
|
+
from qdrant_client.models import SearchRequest
|
|
217
|
+
|
|
218
|
+
# Multiple queries in one request
|
|
219
|
+
results = client.search_batch(
|
|
220
|
+
collection_name="documents",
|
|
221
|
+
requests=[
|
|
222
|
+
SearchRequest(vector=[0.1, ...], limit=5),
|
|
223
|
+
SearchRequest(vector=[0.2, ...], limit=5, filter={"must": [...]}),
|
|
224
|
+
SearchRequest(vector=[0.3, ...], limit=10)
|
|
225
|
+
]
|
|
226
|
+
)
|
|
227
|
+
```
|
|
228
|
+
|
|
229
|
+
## RAG integration
|
|
230
|
+
|
|
231
|
+
### With sentence-transformers
|
|
232
|
+
|
|
233
|
+
```python
|
|
234
|
+
from sentence_transformers import SentenceTransformer
|
|
235
|
+
from qdrant_client import QdrantClient
|
|
236
|
+
from qdrant_client.models import VectorParams, Distance, PointStruct
|
|
237
|
+
|
|
238
|
+
# Initialize
|
|
239
|
+
encoder = SentenceTransformer("all-MiniLM-L6-v2")
|
|
240
|
+
client = QdrantClient(host="localhost", port=6333)
|
|
241
|
+
|
|
242
|
+
# Create collection
|
|
243
|
+
client.create_collection(
|
|
244
|
+
collection_name="knowledge_base",
|
|
245
|
+
vectors_config=VectorParams(size=384, distance=Distance.COSINE)
|
|
246
|
+
)
|
|
247
|
+
|
|
248
|
+
# Index documents
|
|
249
|
+
documents = [
|
|
250
|
+
{"id": 1, "text": "Python is a programming language", "source": "wiki"},
|
|
251
|
+
{"id": 2, "text": "Machine learning uses algorithms", "source": "textbook"},
|
|
252
|
+
]
|
|
253
|
+
|
|
254
|
+
points = [
|
|
255
|
+
PointStruct(
|
|
256
|
+
id=doc["id"],
|
|
257
|
+
vector=encoder.encode(doc["text"]).tolist(),
|
|
258
|
+
payload={"text": doc["text"], "source": doc["source"]}
|
|
259
|
+
)
|
|
260
|
+
for doc in documents
|
|
261
|
+
]
|
|
262
|
+
client.upsert(collection_name="knowledge_base", points=points)
|
|
263
|
+
|
|
264
|
+
# RAG retrieval
|
|
265
|
+
def retrieve(query: str, top_k: int = 5) -> list[dict]:
|
|
266
|
+
query_vector = encoder.encode(query).tolist()
|
|
267
|
+
results = client.search(
|
|
268
|
+
collection_name="knowledge_base",
|
|
269
|
+
query_vector=query_vector,
|
|
270
|
+
limit=top_k
|
|
271
|
+
)
|
|
272
|
+
return [{"text": r.payload["text"], "score": r.score} for r in results]
|
|
273
|
+
|
|
274
|
+
# Use in RAG pipeline
|
|
275
|
+
context = retrieve("What is Python?")
|
|
276
|
+
prompt = f"Context: {context}\n\nQuestion: What is Python?"
|
|
277
|
+
```
|
|
278
|
+
|
|
279
|
+
### With LangChain
|
|
280
|
+
|
|
281
|
+
```python
|
|
282
|
+
from langchain_community.vectorstores import Qdrant
|
|
283
|
+
from langchain_community.embeddings import HuggingFaceEmbeddings
|
|
284
|
+
|
|
285
|
+
embeddings = HuggingFaceEmbeddings(model_name="all-MiniLM-L6-v2")
|
|
286
|
+
vectorstore = Qdrant.from_documents(documents, embeddings, url="http://localhost:6333", collection_name="docs")
|
|
287
|
+
retriever = vectorstore.as_retriever(search_kwargs={"k": 5})
|
|
288
|
+
```
|
|
289
|
+
|
|
290
|
+
### With LlamaIndex
|
|
291
|
+
|
|
292
|
+
```python
|
|
293
|
+
from llama_index.vector_stores.qdrant import QdrantVectorStore
|
|
294
|
+
from llama_index.core import VectorStoreIndex, StorageContext
|
|
295
|
+
|
|
296
|
+
vector_store = QdrantVectorStore(client=client, collection_name="llama_docs")
|
|
297
|
+
storage_context = StorageContext.from_defaults(vector_store=vector_store)
|
|
298
|
+
index = VectorStoreIndex.from_documents(documents, storage_context=storage_context)
|
|
299
|
+
query_engine = index.as_query_engine()
|
|
300
|
+
```
|
|
301
|
+
|
|
302
|
+
## Multi-vector support
|
|
303
|
+
|
|
304
|
+
### Named vectors (different embedding models)
|
|
305
|
+
|
|
306
|
+
```python
|
|
307
|
+
from qdrant_client.models import VectorParams, Distance
|
|
308
|
+
|
|
309
|
+
# Collection with multiple vector types
|
|
310
|
+
client.create_collection(
|
|
311
|
+
collection_name="hybrid_search",
|
|
312
|
+
vectors_config={
|
|
313
|
+
"dense": VectorParams(size=384, distance=Distance.COSINE),
|
|
314
|
+
"sparse": VectorParams(size=30000, distance=Distance.DOT)
|
|
315
|
+
}
|
|
316
|
+
)
|
|
317
|
+
|
|
318
|
+
# Insert with named vectors
|
|
319
|
+
client.upsert(
|
|
320
|
+
collection_name="hybrid_search",
|
|
321
|
+
points=[
|
|
322
|
+
PointStruct(
|
|
323
|
+
id=1,
|
|
324
|
+
vector={
|
|
325
|
+
"dense": dense_embedding,
|
|
326
|
+
"sparse": sparse_embedding
|
|
327
|
+
},
|
|
328
|
+
payload={"text": "document text"}
|
|
329
|
+
)
|
|
330
|
+
]
|
|
331
|
+
)
|
|
332
|
+
|
|
333
|
+
# Search specific vector
|
|
334
|
+
results = client.search(
|
|
335
|
+
collection_name="hybrid_search",
|
|
336
|
+
query_vector=("dense", query_dense), # Specify which vector
|
|
337
|
+
limit=10
|
|
338
|
+
)
|
|
339
|
+
```
|
|
340
|
+
|
|
341
|
+
### Sparse vectors (BM25, SPLADE)
|
|
342
|
+
|
|
343
|
+
```python
|
|
344
|
+
from qdrant_client.models import SparseVectorParams, SparseIndexParams, SparseVector
|
|
345
|
+
|
|
346
|
+
# Collection with sparse vectors
|
|
347
|
+
client.create_collection(
|
|
348
|
+
collection_name="sparse_search",
|
|
349
|
+
vectors_config={},
|
|
350
|
+
sparse_vectors_config={"text": SparseVectorParams(index=SparseIndexParams(on_disk=False))}
|
|
351
|
+
)
|
|
352
|
+
|
|
353
|
+
# Insert sparse vector
|
|
354
|
+
client.upsert(
|
|
355
|
+
collection_name="sparse_search",
|
|
356
|
+
points=[PointStruct(id=1, vector={"text": SparseVector(indices=[1, 5, 100], values=[0.5, 0.8, 0.2])}, payload={"text": "document"})]
|
|
357
|
+
)
|
|
358
|
+
```
|
|
359
|
+
|
|
360
|
+
## Quantization (memory optimization)
|
|
361
|
+
|
|
362
|
+
```python
|
|
363
|
+
from qdrant_client.models import ScalarQuantization, ScalarQuantizationConfig, ScalarType
|
|
364
|
+
|
|
365
|
+
# Scalar quantization (4x memory reduction)
|
|
366
|
+
client.create_collection(
|
|
367
|
+
collection_name="quantized",
|
|
368
|
+
vectors_config=VectorParams(size=384, distance=Distance.COSINE),
|
|
369
|
+
quantization_config=ScalarQuantization(
|
|
370
|
+
scalar=ScalarQuantizationConfig(
|
|
371
|
+
type=ScalarType.INT8,
|
|
372
|
+
quantile=0.99, # Clip outliers
|
|
373
|
+
always_ram=True # Keep quantized in RAM
|
|
374
|
+
)
|
|
375
|
+
)
|
|
376
|
+
)
|
|
377
|
+
|
|
378
|
+
# Search with rescoring
|
|
379
|
+
results = client.search(
|
|
380
|
+
collection_name="quantized",
|
|
381
|
+
query_vector=query,
|
|
382
|
+
search_params={"quantization": {"rescore": True}}, # Rescore top results
|
|
383
|
+
limit=10
|
|
384
|
+
)
|
|
385
|
+
```
|
|
386
|
+
|
|
387
|
+
## Payload indexing
|
|
388
|
+
|
|
389
|
+
```python
|
|
390
|
+
from qdrant_client.models import PayloadSchemaType
|
|
391
|
+
|
|
392
|
+
# Create payload index for faster filtering
|
|
393
|
+
client.create_payload_index(
|
|
394
|
+
collection_name="documents",
|
|
395
|
+
field_name="category",
|
|
396
|
+
field_schema=PayloadSchemaType.KEYWORD
|
|
397
|
+
)
|
|
398
|
+
|
|
399
|
+
client.create_payload_index(
|
|
400
|
+
collection_name="documents",
|
|
401
|
+
field_name="timestamp",
|
|
402
|
+
field_schema=PayloadSchemaType.INTEGER
|
|
403
|
+
)
|
|
404
|
+
|
|
405
|
+
# Index types: KEYWORD, INTEGER, FLOAT, GEO, TEXT (full-text), BOOL
|
|
406
|
+
```
|
|
407
|
+
|
|
408
|
+
## Production deployment
|
|
409
|
+
|
|
410
|
+
### Qdrant Cloud
|
|
411
|
+
|
|
412
|
+
```python
|
|
413
|
+
from qdrant_client import QdrantClient
|
|
414
|
+
|
|
415
|
+
# Connect to Qdrant Cloud
|
|
416
|
+
client = QdrantClient(
|
|
417
|
+
url="https://your-cluster.cloud.qdrant.io",
|
|
418
|
+
api_key="your-api-key"
|
|
419
|
+
)
|
|
420
|
+
```
|
|
421
|
+
|
|
422
|
+
### Performance tuning
|
|
423
|
+
|
|
424
|
+
```python
|
|
425
|
+
# Optimize for search speed (higher recall)
|
|
426
|
+
client.update_collection(
|
|
427
|
+
collection_name="documents",
|
|
428
|
+
hnsw_config=HnswConfigDiff(ef_construct=200, m=32)
|
|
429
|
+
)
|
|
430
|
+
|
|
431
|
+
# Optimize for indexing speed (bulk loads)
|
|
432
|
+
client.update_collection(
|
|
433
|
+
collection_name="documents",
|
|
434
|
+
optimizer_config={"indexing_threshold": 20000}
|
|
435
|
+
)
|
|
436
|
+
```
|
|
437
|
+
|
|
438
|
+
## Best practices
|
|
439
|
+
|
|
440
|
+
1. **Batch operations** - Use batch upsert/search for efficiency
|
|
441
|
+
2. **Payload indexing** - Index fields used in filters
|
|
442
|
+
3. **Quantization** - Enable for large collections (>1M vectors)
|
|
443
|
+
4. **Sharding** - Use for collections >10M vectors
|
|
444
|
+
5. **On-disk storage** - Enable `on_disk_payload` for large payloads
|
|
445
|
+
6. **Connection pooling** - Reuse client instances
|
|
446
|
+
|
|
447
|
+
## Common issues
|
|
448
|
+
|
|
449
|
+
**Slow search with filters:**
|
|
450
|
+
```python
|
|
451
|
+
# Create payload index for filtered fields
|
|
452
|
+
client.create_payload_index(
|
|
453
|
+
collection_name="docs",
|
|
454
|
+
field_name="category",
|
|
455
|
+
field_schema=PayloadSchemaType.KEYWORD
|
|
456
|
+
)
|
|
457
|
+
```
|
|
458
|
+
|
|
459
|
+
**Out of memory:**
|
|
460
|
+
```python
|
|
461
|
+
# Enable quantization and on-disk storage
|
|
462
|
+
client.create_collection(
|
|
463
|
+
collection_name="large_collection",
|
|
464
|
+
vectors_config=VectorParams(size=384, distance=Distance.COSINE),
|
|
465
|
+
quantization_config=ScalarQuantization(...),
|
|
466
|
+
on_disk_payload=True
|
|
467
|
+
)
|
|
468
|
+
```
|
|
469
|
+
|
|
470
|
+
**Connection issues:**
|
|
471
|
+
```python
|
|
472
|
+
# Use timeout and retry
|
|
473
|
+
client = QdrantClient(
|
|
474
|
+
host="localhost",
|
|
475
|
+
port=6333,
|
|
476
|
+
timeout=30,
|
|
477
|
+
prefer_grpc=True # gRPC for better performance
|
|
478
|
+
)
|
|
479
|
+
```
|
|
480
|
+
|
|
481
|
+
## References
|
|
482
|
+
|
|
483
|
+
- **[Advanced Usage](references/advanced-usage.md)** - Distributed mode, hybrid search, recommendations
|
|
484
|
+
- **[Troubleshooting](references/troubleshooting.md)** - Common issues, debugging, performance tuning
|
|
485
|
+
|
|
486
|
+
## Resources
|
|
487
|
+
|
|
488
|
+
- **GitHub**: https://github.com/qdrant/qdrant (22k+ stars)
|
|
489
|
+
- **Docs**: https://qdrant.tech/documentation/
|
|
490
|
+
- **Python Client**: https://github.com/qdrant/qdrant-client
|
|
491
|
+
- **Cloud**: https://cloud.qdrant.io
|
|
492
|
+
- **Version**: 1.12.0+
|
|
493
|
+
- **License**: Apache 2.0
|