@synsci/cli-darwin-x64 1.1.49
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/skills/accelerate/SKILL.md +332 -0
- package/bin/skills/accelerate/references/custom-plugins.md +453 -0
- package/bin/skills/accelerate/references/megatron-integration.md +489 -0
- package/bin/skills/accelerate/references/performance.md +525 -0
- package/bin/skills/audiocraft/SKILL.md +564 -0
- package/bin/skills/audiocraft/references/advanced-usage.md +666 -0
- package/bin/skills/audiocraft/references/troubleshooting.md +504 -0
- package/bin/skills/autogpt/SKILL.md +403 -0
- package/bin/skills/autogpt/references/advanced-usage.md +535 -0
- package/bin/skills/autogpt/references/troubleshooting.md +420 -0
- package/bin/skills/awq/SKILL.md +310 -0
- package/bin/skills/awq/references/advanced-usage.md +324 -0
- package/bin/skills/awq/references/troubleshooting.md +344 -0
- package/bin/skills/axolotl/SKILL.md +158 -0
- package/bin/skills/axolotl/references/api.md +5548 -0
- package/bin/skills/axolotl/references/dataset-formats.md +1029 -0
- package/bin/skills/axolotl/references/index.md +15 -0
- package/bin/skills/axolotl/references/other.md +3563 -0
- package/bin/skills/bigcode-evaluation-harness/SKILL.md +405 -0
- package/bin/skills/bigcode-evaluation-harness/references/benchmarks.md +393 -0
- package/bin/skills/bigcode-evaluation-harness/references/custom-tasks.md +424 -0
- package/bin/skills/bigcode-evaluation-harness/references/issues.md +394 -0
- package/bin/skills/bitsandbytes/SKILL.md +411 -0
- package/bin/skills/bitsandbytes/references/memory-optimization.md +521 -0
- package/bin/skills/bitsandbytes/references/qlora-training.md +521 -0
- package/bin/skills/bitsandbytes/references/quantization-formats.md +447 -0
- package/bin/skills/blip-2/SKILL.md +564 -0
- package/bin/skills/blip-2/references/advanced-usage.md +680 -0
- package/bin/skills/blip-2/references/troubleshooting.md +526 -0
- package/bin/skills/chroma/SKILL.md +406 -0
- package/bin/skills/chroma/references/integration.md +38 -0
- package/bin/skills/clip/SKILL.md +253 -0
- package/bin/skills/clip/references/applications.md +207 -0
- package/bin/skills/constitutional-ai/SKILL.md +290 -0
- package/bin/skills/crewai/SKILL.md +498 -0
- package/bin/skills/crewai/references/flows.md +438 -0
- package/bin/skills/crewai/references/tools.md +429 -0
- package/bin/skills/crewai/references/troubleshooting.md +480 -0
- package/bin/skills/deepspeed/SKILL.md +141 -0
- package/bin/skills/deepspeed/references/08.md +17 -0
- package/bin/skills/deepspeed/references/09.md +173 -0
- package/bin/skills/deepspeed/references/2020.md +378 -0
- package/bin/skills/deepspeed/references/2023.md +279 -0
- package/bin/skills/deepspeed/references/assets.md +179 -0
- package/bin/skills/deepspeed/references/index.md +35 -0
- package/bin/skills/deepspeed/references/mii.md +118 -0
- package/bin/skills/deepspeed/references/other.md +1191 -0
- package/bin/skills/deepspeed/references/tutorials.md +6554 -0
- package/bin/skills/dspy/SKILL.md +590 -0
- package/bin/skills/dspy/references/examples.md +663 -0
- package/bin/skills/dspy/references/modules.md +475 -0
- package/bin/skills/dspy/references/optimizers.md +566 -0
- package/bin/skills/faiss/SKILL.md +221 -0
- package/bin/skills/faiss/references/index_types.md +280 -0
- package/bin/skills/flash-attention/SKILL.md +367 -0
- package/bin/skills/flash-attention/references/benchmarks.md +215 -0
- package/bin/skills/flash-attention/references/transformers-integration.md +293 -0
- package/bin/skills/gguf/SKILL.md +427 -0
- package/bin/skills/gguf/references/advanced-usage.md +504 -0
- package/bin/skills/gguf/references/troubleshooting.md +442 -0
- package/bin/skills/gptq/SKILL.md +450 -0
- package/bin/skills/gptq/references/calibration.md +337 -0
- package/bin/skills/gptq/references/integration.md +129 -0
- package/bin/skills/gptq/references/troubleshooting.md +95 -0
- package/bin/skills/grpo-rl-training/README.md +97 -0
- package/bin/skills/grpo-rl-training/SKILL.md +572 -0
- package/bin/skills/grpo-rl-training/examples/reward_functions_library.py +393 -0
- package/bin/skills/grpo-rl-training/templates/basic_grpo_training.py +228 -0
- package/bin/skills/guidance/SKILL.md +572 -0
- package/bin/skills/guidance/references/backends.md +554 -0
- package/bin/skills/guidance/references/constraints.md +674 -0
- package/bin/skills/guidance/references/examples.md +767 -0
- package/bin/skills/hqq/SKILL.md +445 -0
- package/bin/skills/hqq/references/advanced-usage.md +528 -0
- package/bin/skills/hqq/references/troubleshooting.md +503 -0
- package/bin/skills/hugging-face-cli/SKILL.md +191 -0
- package/bin/skills/hugging-face-cli/references/commands.md +954 -0
- package/bin/skills/hugging-face-cli/references/examples.md +374 -0
- package/bin/skills/hugging-face-datasets/SKILL.md +547 -0
- package/bin/skills/hugging-face-datasets/examples/diverse_training_examples.json +239 -0
- package/bin/skills/hugging-face-datasets/examples/system_prompt_template.txt +196 -0
- package/bin/skills/hugging-face-datasets/examples/training_examples.json +176 -0
- package/bin/skills/hugging-face-datasets/scripts/dataset_manager.py +522 -0
- package/bin/skills/hugging-face-datasets/scripts/sql_manager.py +844 -0
- package/bin/skills/hugging-face-datasets/templates/chat.json +55 -0
- package/bin/skills/hugging-face-datasets/templates/classification.json +62 -0
- package/bin/skills/hugging-face-datasets/templates/completion.json +51 -0
- package/bin/skills/hugging-face-datasets/templates/custom.json +75 -0
- package/bin/skills/hugging-face-datasets/templates/qa.json +54 -0
- package/bin/skills/hugging-face-datasets/templates/tabular.json +81 -0
- package/bin/skills/hugging-face-evaluation/SKILL.md +656 -0
- package/bin/skills/hugging-face-evaluation/examples/USAGE_EXAMPLES.md +382 -0
- package/bin/skills/hugging-face-evaluation/examples/artificial_analysis_to_hub.py +141 -0
- package/bin/skills/hugging-face-evaluation/examples/example_readme_tables.md +135 -0
- package/bin/skills/hugging-face-evaluation/examples/metric_mapping.json +50 -0
- package/bin/skills/hugging-face-evaluation/requirements.txt +20 -0
- package/bin/skills/hugging-face-evaluation/scripts/evaluation_manager.py +1374 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_eval_uv.py +104 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_vllm_uv.py +317 -0
- package/bin/skills/hugging-face-evaluation/scripts/lighteval_vllm_uv.py +303 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_eval_job.py +98 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_vllm_eval_job.py +331 -0
- package/bin/skills/hugging-face-evaluation/scripts/test_extraction.py +206 -0
- package/bin/skills/hugging-face-jobs/SKILL.md +1041 -0
- package/bin/skills/hugging-face-jobs/index.html +216 -0
- package/bin/skills/hugging-face-jobs/references/hardware_guide.md +336 -0
- package/bin/skills/hugging-face-jobs/references/hub_saving.md +352 -0
- package/bin/skills/hugging-face-jobs/references/token_usage.md +546 -0
- package/bin/skills/hugging-face-jobs/references/troubleshooting.md +475 -0
- package/bin/skills/hugging-face-jobs/scripts/cot-self-instruct.py +718 -0
- package/bin/skills/hugging-face-jobs/scripts/finepdfs-stats.py +546 -0
- package/bin/skills/hugging-face-jobs/scripts/generate-responses.py +587 -0
- package/bin/skills/hugging-face-model-trainer/SKILL.md +711 -0
- package/bin/skills/hugging-face-model-trainer/references/gguf_conversion.md +296 -0
- package/bin/skills/hugging-face-model-trainer/references/hardware_guide.md +283 -0
- package/bin/skills/hugging-face-model-trainer/references/hub_saving.md +364 -0
- package/bin/skills/hugging-face-model-trainer/references/reliability_principles.md +371 -0
- package/bin/skills/hugging-face-model-trainer/references/trackio_guide.md +189 -0
- package/bin/skills/hugging-face-model-trainer/references/training_methods.md +150 -0
- package/bin/skills/hugging-face-model-trainer/references/training_patterns.md +203 -0
- package/bin/skills/hugging-face-model-trainer/references/troubleshooting.md +282 -0
- package/bin/skills/hugging-face-model-trainer/scripts/convert_to_gguf.py +424 -0
- package/bin/skills/hugging-face-model-trainer/scripts/dataset_inspector.py +417 -0
- package/bin/skills/hugging-face-model-trainer/scripts/estimate_cost.py +150 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_dpo_example.py +106 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_grpo_example.py +89 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_sft_example.py +122 -0
- package/bin/skills/hugging-face-paper-publisher/SKILL.md +627 -0
- package/bin/skills/hugging-face-paper-publisher/examples/example_usage.md +327 -0
- package/bin/skills/hugging-face-paper-publisher/references/quick_reference.md +216 -0
- package/bin/skills/hugging-face-paper-publisher/scripts/paper_manager.py +508 -0
- package/bin/skills/hugging-face-paper-publisher/templates/arxiv.md +299 -0
- package/bin/skills/hugging-face-paper-publisher/templates/ml-report.md +358 -0
- package/bin/skills/hugging-face-paper-publisher/templates/modern.md +319 -0
- package/bin/skills/hugging-face-paper-publisher/templates/standard.md +201 -0
- package/bin/skills/hugging-face-tool-builder/SKILL.md +115 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.py +57 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.sh +40 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.tsx +57 -0
- package/bin/skills/hugging-face-tool-builder/references/find_models_by_paper.sh +230 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_enrich_models.sh +96 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_card_frontmatter.sh +188 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_papers_auth.sh +171 -0
- package/bin/skills/hugging-face-trackio/SKILL.md +65 -0
- package/bin/skills/hugging-face-trackio/references/logging_metrics.md +206 -0
- package/bin/skills/hugging-face-trackio/references/retrieving_metrics.md +223 -0
- package/bin/skills/huggingface-tokenizers/SKILL.md +516 -0
- package/bin/skills/huggingface-tokenizers/references/algorithms.md +653 -0
- package/bin/skills/huggingface-tokenizers/references/integration.md +637 -0
- package/bin/skills/huggingface-tokenizers/references/pipeline.md +723 -0
- package/bin/skills/huggingface-tokenizers/references/training.md +565 -0
- package/bin/skills/instructor/SKILL.md +740 -0
- package/bin/skills/instructor/references/examples.md +107 -0
- package/bin/skills/instructor/references/providers.md +70 -0
- package/bin/skills/instructor/references/validation.md +606 -0
- package/bin/skills/knowledge-distillation/SKILL.md +458 -0
- package/bin/skills/knowledge-distillation/references/minillm.md +334 -0
- package/bin/skills/lambda-labs/SKILL.md +545 -0
- package/bin/skills/lambda-labs/references/advanced-usage.md +611 -0
- package/bin/skills/lambda-labs/references/troubleshooting.md +530 -0
- package/bin/skills/langchain/SKILL.md +480 -0
- package/bin/skills/langchain/references/agents.md +499 -0
- package/bin/skills/langchain/references/integration.md +562 -0
- package/bin/skills/langchain/references/rag.md +600 -0
- package/bin/skills/langsmith/SKILL.md +422 -0
- package/bin/skills/langsmith/references/advanced-usage.md +548 -0
- package/bin/skills/langsmith/references/troubleshooting.md +537 -0
- package/bin/skills/litgpt/SKILL.md +469 -0
- package/bin/skills/litgpt/references/custom-models.md +568 -0
- package/bin/skills/litgpt/references/distributed-training.md +451 -0
- package/bin/skills/litgpt/references/supported-models.md +336 -0
- package/bin/skills/litgpt/references/training-recipes.md +619 -0
- package/bin/skills/llama-cpp/SKILL.md +258 -0
- package/bin/skills/llama-cpp/references/optimization.md +89 -0
- package/bin/skills/llama-cpp/references/quantization.md +213 -0
- package/bin/skills/llama-cpp/references/server.md +125 -0
- package/bin/skills/llama-factory/SKILL.md +80 -0
- package/bin/skills/llama-factory/references/_images.md +23 -0
- package/bin/skills/llama-factory/references/advanced.md +1055 -0
- package/bin/skills/llama-factory/references/getting_started.md +349 -0
- package/bin/skills/llama-factory/references/index.md +19 -0
- package/bin/skills/llama-factory/references/other.md +31 -0
- package/bin/skills/llamaguard/SKILL.md +337 -0
- package/bin/skills/llamaindex/SKILL.md +569 -0
- package/bin/skills/llamaindex/references/agents.md +83 -0
- package/bin/skills/llamaindex/references/data_connectors.md +108 -0
- package/bin/skills/llamaindex/references/query_engines.md +406 -0
- package/bin/skills/llava/SKILL.md +304 -0
- package/bin/skills/llava/references/training.md +197 -0
- package/bin/skills/lm-evaluation-harness/SKILL.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/api-evaluation.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/benchmark-guide.md +488 -0
- package/bin/skills/lm-evaluation-harness/references/custom-tasks.md +602 -0
- package/bin/skills/lm-evaluation-harness/references/distributed-eval.md +519 -0
- package/bin/skills/long-context/SKILL.md +536 -0
- package/bin/skills/long-context/references/extension_methods.md +468 -0
- package/bin/skills/long-context/references/fine_tuning.md +611 -0
- package/bin/skills/long-context/references/rope.md +402 -0
- package/bin/skills/mamba/SKILL.md +260 -0
- package/bin/skills/mamba/references/architecture-details.md +206 -0
- package/bin/skills/mamba/references/benchmarks.md +255 -0
- package/bin/skills/mamba/references/training-guide.md +388 -0
- package/bin/skills/megatron-core/SKILL.md +366 -0
- package/bin/skills/megatron-core/references/benchmarks.md +249 -0
- package/bin/skills/megatron-core/references/parallelism-guide.md +404 -0
- package/bin/skills/megatron-core/references/production-examples.md +473 -0
- package/bin/skills/megatron-core/references/training-recipes.md +547 -0
- package/bin/skills/miles/SKILL.md +315 -0
- package/bin/skills/miles/references/api-reference.md +141 -0
- package/bin/skills/miles/references/troubleshooting.md +352 -0
- package/bin/skills/mlflow/SKILL.md +704 -0
- package/bin/skills/mlflow/references/deployment.md +744 -0
- package/bin/skills/mlflow/references/model-registry.md +770 -0
- package/bin/skills/mlflow/references/tracking.md +680 -0
- package/bin/skills/modal/SKILL.md +341 -0
- package/bin/skills/modal/references/advanced-usage.md +503 -0
- package/bin/skills/modal/references/troubleshooting.md +494 -0
- package/bin/skills/model-merging/SKILL.md +539 -0
- package/bin/skills/model-merging/references/evaluation.md +462 -0
- package/bin/skills/model-merging/references/examples.md +428 -0
- package/bin/skills/model-merging/references/methods.md +352 -0
- package/bin/skills/model-pruning/SKILL.md +495 -0
- package/bin/skills/model-pruning/references/wanda.md +347 -0
- package/bin/skills/moe-training/SKILL.md +526 -0
- package/bin/skills/moe-training/references/architectures.md +432 -0
- package/bin/skills/moe-training/references/inference.md +348 -0
- package/bin/skills/moe-training/references/training.md +425 -0
- package/bin/skills/nanogpt/SKILL.md +290 -0
- package/bin/skills/nanogpt/references/architecture.md +382 -0
- package/bin/skills/nanogpt/references/data.md +476 -0
- package/bin/skills/nanogpt/references/training.md +564 -0
- package/bin/skills/nemo-curator/SKILL.md +383 -0
- package/bin/skills/nemo-curator/references/deduplication.md +87 -0
- package/bin/skills/nemo-curator/references/filtering.md +102 -0
- package/bin/skills/nemo-evaluator/SKILL.md +494 -0
- package/bin/skills/nemo-evaluator/references/adapter-system.md +340 -0
- package/bin/skills/nemo-evaluator/references/configuration.md +447 -0
- package/bin/skills/nemo-evaluator/references/custom-benchmarks.md +315 -0
- package/bin/skills/nemo-evaluator/references/execution-backends.md +361 -0
- package/bin/skills/nemo-guardrails/SKILL.md +297 -0
- package/bin/skills/nnsight/SKILL.md +436 -0
- package/bin/skills/nnsight/references/README.md +78 -0
- package/bin/skills/nnsight/references/api.md +344 -0
- package/bin/skills/nnsight/references/tutorials.md +300 -0
- package/bin/skills/openrlhf/SKILL.md +249 -0
- package/bin/skills/openrlhf/references/algorithm-comparison.md +404 -0
- package/bin/skills/openrlhf/references/custom-rewards.md +530 -0
- package/bin/skills/openrlhf/references/hybrid-engine.md +287 -0
- package/bin/skills/openrlhf/references/multi-node-training.md +454 -0
- package/bin/skills/outlines/SKILL.md +652 -0
- package/bin/skills/outlines/references/backends.md +615 -0
- package/bin/skills/outlines/references/examples.md +773 -0
- package/bin/skills/outlines/references/json_generation.md +652 -0
- package/bin/skills/peft/SKILL.md +431 -0
- package/bin/skills/peft/references/advanced-usage.md +514 -0
- package/bin/skills/peft/references/troubleshooting.md +480 -0
- package/bin/skills/phoenix/SKILL.md +475 -0
- package/bin/skills/phoenix/references/advanced-usage.md +619 -0
- package/bin/skills/phoenix/references/troubleshooting.md +538 -0
- package/bin/skills/pinecone/SKILL.md +358 -0
- package/bin/skills/pinecone/references/deployment.md +181 -0
- package/bin/skills/pytorch-fsdp/SKILL.md +126 -0
- package/bin/skills/pytorch-fsdp/references/index.md +7 -0
- package/bin/skills/pytorch-fsdp/references/other.md +4249 -0
- package/bin/skills/pytorch-lightning/SKILL.md +346 -0
- package/bin/skills/pytorch-lightning/references/callbacks.md +436 -0
- package/bin/skills/pytorch-lightning/references/distributed.md +490 -0
- package/bin/skills/pytorch-lightning/references/hyperparameter-tuning.md +556 -0
- package/bin/skills/pyvene/SKILL.md +473 -0
- package/bin/skills/pyvene/references/README.md +73 -0
- package/bin/skills/pyvene/references/api.md +383 -0
- package/bin/skills/pyvene/references/tutorials.md +376 -0
- package/bin/skills/qdrant/SKILL.md +493 -0
- package/bin/skills/qdrant/references/advanced-usage.md +648 -0
- package/bin/skills/qdrant/references/troubleshooting.md +631 -0
- package/bin/skills/ray-data/SKILL.md +326 -0
- package/bin/skills/ray-data/references/integration.md +82 -0
- package/bin/skills/ray-data/references/transformations.md +83 -0
- package/bin/skills/ray-train/SKILL.md +406 -0
- package/bin/skills/ray-train/references/multi-node.md +628 -0
- package/bin/skills/rwkv/SKILL.md +260 -0
- package/bin/skills/rwkv/references/architecture-details.md +344 -0
- package/bin/skills/rwkv/references/rwkv7.md +386 -0
- package/bin/skills/rwkv/references/state-management.md +369 -0
- package/bin/skills/saelens/SKILL.md +386 -0
- package/bin/skills/saelens/references/README.md +70 -0
- package/bin/skills/saelens/references/api.md +333 -0
- package/bin/skills/saelens/references/tutorials.md +318 -0
- package/bin/skills/segment-anything/SKILL.md +500 -0
- package/bin/skills/segment-anything/references/advanced-usage.md +589 -0
- package/bin/skills/segment-anything/references/troubleshooting.md +484 -0
- package/bin/skills/sentence-transformers/SKILL.md +255 -0
- package/bin/skills/sentence-transformers/references/models.md +123 -0
- package/bin/skills/sentencepiece/SKILL.md +235 -0
- package/bin/skills/sentencepiece/references/algorithms.md +200 -0
- package/bin/skills/sentencepiece/references/training.md +304 -0
- package/bin/skills/sglang/SKILL.md +442 -0
- package/bin/skills/sglang/references/deployment.md +490 -0
- package/bin/skills/sglang/references/radix-attention.md +413 -0
- package/bin/skills/sglang/references/structured-generation.md +541 -0
- package/bin/skills/simpo/SKILL.md +219 -0
- package/bin/skills/simpo/references/datasets.md +478 -0
- package/bin/skills/simpo/references/hyperparameters.md +452 -0
- package/bin/skills/simpo/references/loss-functions.md +350 -0
- package/bin/skills/skypilot/SKILL.md +509 -0
- package/bin/skills/skypilot/references/advanced-usage.md +491 -0
- package/bin/skills/skypilot/references/troubleshooting.md +570 -0
- package/bin/skills/slime/SKILL.md +464 -0
- package/bin/skills/slime/references/api-reference.md +392 -0
- package/bin/skills/slime/references/troubleshooting.md +386 -0
- package/bin/skills/speculative-decoding/SKILL.md +467 -0
- package/bin/skills/speculative-decoding/references/lookahead.md +309 -0
- package/bin/skills/speculative-decoding/references/medusa.md +350 -0
- package/bin/skills/stable-diffusion/SKILL.md +519 -0
- package/bin/skills/stable-diffusion/references/advanced-usage.md +716 -0
- package/bin/skills/stable-diffusion/references/troubleshooting.md +555 -0
- package/bin/skills/tensorboard/SKILL.md +629 -0
- package/bin/skills/tensorboard/references/integrations.md +638 -0
- package/bin/skills/tensorboard/references/profiling.md +545 -0
- package/bin/skills/tensorboard/references/visualization.md +620 -0
- package/bin/skills/tensorrt-llm/SKILL.md +187 -0
- package/bin/skills/tensorrt-llm/references/multi-gpu.md +298 -0
- package/bin/skills/tensorrt-llm/references/optimization.md +242 -0
- package/bin/skills/tensorrt-llm/references/serving.md +470 -0
- package/bin/skills/tinker/SKILL.md +362 -0
- package/bin/skills/tinker/references/api-reference.md +168 -0
- package/bin/skills/tinker/references/getting-started.md +157 -0
- package/bin/skills/tinker/references/loss-functions.md +163 -0
- package/bin/skills/tinker/references/models-and-lora.md +139 -0
- package/bin/skills/tinker/references/recipes.md +280 -0
- package/bin/skills/tinker/references/reinforcement-learning.md +212 -0
- package/bin/skills/tinker/references/rendering.md +243 -0
- package/bin/skills/tinker/references/supervised-learning.md +232 -0
- package/bin/skills/tinker-training-cost/SKILL.md +187 -0
- package/bin/skills/tinker-training-cost/scripts/calculate_cost.py +123 -0
- package/bin/skills/torchforge/SKILL.md +433 -0
- package/bin/skills/torchforge/references/api-reference.md +327 -0
- package/bin/skills/torchforge/references/troubleshooting.md +409 -0
- package/bin/skills/torchtitan/SKILL.md +358 -0
- package/bin/skills/torchtitan/references/checkpoint.md +181 -0
- package/bin/skills/torchtitan/references/custom-models.md +258 -0
- package/bin/skills/torchtitan/references/float8.md +133 -0
- package/bin/skills/torchtitan/references/fsdp.md +126 -0
- package/bin/skills/transformer-lens/SKILL.md +346 -0
- package/bin/skills/transformer-lens/references/README.md +54 -0
- package/bin/skills/transformer-lens/references/api.md +362 -0
- package/bin/skills/transformer-lens/references/tutorials.md +339 -0
- package/bin/skills/trl-fine-tuning/SKILL.md +455 -0
- package/bin/skills/trl-fine-tuning/references/dpo-variants.md +227 -0
- package/bin/skills/trl-fine-tuning/references/online-rl.md +82 -0
- package/bin/skills/trl-fine-tuning/references/reward-modeling.md +122 -0
- package/bin/skills/trl-fine-tuning/references/sft-training.md +168 -0
- package/bin/skills/unsloth/SKILL.md +80 -0
- package/bin/skills/unsloth/references/index.md +7 -0
- package/bin/skills/unsloth/references/llms-full.md +16799 -0
- package/bin/skills/unsloth/references/llms-txt.md +12044 -0
- package/bin/skills/unsloth/references/llms.md +82 -0
- package/bin/skills/verl/SKILL.md +391 -0
- package/bin/skills/verl/references/api-reference.md +301 -0
- package/bin/skills/verl/references/troubleshooting.md +391 -0
- package/bin/skills/vllm/SKILL.md +364 -0
- package/bin/skills/vllm/references/optimization.md +226 -0
- package/bin/skills/vllm/references/quantization.md +284 -0
- package/bin/skills/vllm/references/server-deployment.md +255 -0
- package/bin/skills/vllm/references/troubleshooting.md +447 -0
- package/bin/skills/weights-and-biases/SKILL.md +590 -0
- package/bin/skills/weights-and-biases/references/artifacts.md +584 -0
- package/bin/skills/weights-and-biases/references/integrations.md +700 -0
- package/bin/skills/weights-and-biases/references/sweeps.md +847 -0
- package/bin/skills/whisper/SKILL.md +317 -0
- package/bin/skills/whisper/references/languages.md +189 -0
- package/bin/synsc +0 -0
- package/package.json +10 -0
|
@@ -0,0 +1,572 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: guidance
|
|
3
|
+
description: Control LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance - Microsoft Research's constrained generation framework
|
|
4
|
+
version: 1.0.0
|
|
5
|
+
author: Synthetic Sciences
|
|
6
|
+
license: MIT
|
|
7
|
+
tags: [Prompt Engineering, Guidance, Constrained Generation, Structured Output, JSON Validation, Grammar, Microsoft Research, Format Enforcement, Multi-Step Workflows]
|
|
8
|
+
dependencies: [guidance, transformers]
|
|
9
|
+
---
|
|
10
|
+
|
|
11
|
+
# Guidance: Constrained LLM Generation
|
|
12
|
+
|
|
13
|
+
## When to Use This Skill
|
|
14
|
+
|
|
15
|
+
Use Guidance when you need to:
|
|
16
|
+
- **Control LLM output syntax** with regex or grammars
|
|
17
|
+
- **Guarantee valid JSON/XML/code** generation
|
|
18
|
+
- **Reduce latency** vs traditional prompting approaches
|
|
19
|
+
- **Enforce structured formats** (dates, emails, IDs, etc.)
|
|
20
|
+
- **Build multi-step workflows** with Pythonic control flow
|
|
21
|
+
- **Prevent invalid outputs** through grammatical constraints
|
|
22
|
+
|
|
23
|
+
**GitHub Stars**: 18,000+ | **From**: Microsoft Research
|
|
24
|
+
|
|
25
|
+
## Installation
|
|
26
|
+
|
|
27
|
+
```bash
|
|
28
|
+
# Base installation
|
|
29
|
+
pip install guidance
|
|
30
|
+
|
|
31
|
+
# With specific backends
|
|
32
|
+
pip install guidance[transformers] # Hugging Face models
|
|
33
|
+
pip install guidance[llama_cpp] # llama.cpp models
|
|
34
|
+
```
|
|
35
|
+
|
|
36
|
+
## Quick Start
|
|
37
|
+
|
|
38
|
+
### Basic Example: Structured Generation
|
|
39
|
+
|
|
40
|
+
```python
|
|
41
|
+
from guidance import models, gen
|
|
42
|
+
|
|
43
|
+
# Load model (supports OpenAI, Transformers, llama.cpp)
|
|
44
|
+
lm = models.OpenAI("gpt-4")
|
|
45
|
+
|
|
46
|
+
# Generate with constraints
|
|
47
|
+
result = lm + "The capital of France is " + gen("capital", max_tokens=5)
|
|
48
|
+
|
|
49
|
+
print(result["capital"]) # "Paris"
|
|
50
|
+
```
|
|
51
|
+
|
|
52
|
+
### With Anthropic Claude
|
|
53
|
+
|
|
54
|
+
```python
|
|
55
|
+
from guidance import models, gen, system, user, assistant
|
|
56
|
+
|
|
57
|
+
# Configure Claude
|
|
58
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
59
|
+
|
|
60
|
+
# Use context managers for chat format
|
|
61
|
+
with system():
|
|
62
|
+
lm += "You are a helpful assistant."
|
|
63
|
+
|
|
64
|
+
with user():
|
|
65
|
+
lm += "What is the capital of France?"
|
|
66
|
+
|
|
67
|
+
with assistant():
|
|
68
|
+
lm += gen(max_tokens=20)
|
|
69
|
+
```
|
|
70
|
+
|
|
71
|
+
## Core Concepts
|
|
72
|
+
|
|
73
|
+
### 1. Context Managers
|
|
74
|
+
|
|
75
|
+
Guidance uses Pythonic context managers for chat-style interactions.
|
|
76
|
+
|
|
77
|
+
```python
|
|
78
|
+
from guidance import system, user, assistant, gen
|
|
79
|
+
|
|
80
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
81
|
+
|
|
82
|
+
# System message
|
|
83
|
+
with system():
|
|
84
|
+
lm += "You are a JSON generation expert."
|
|
85
|
+
|
|
86
|
+
# User message
|
|
87
|
+
with user():
|
|
88
|
+
lm += "Generate a person object with name and age."
|
|
89
|
+
|
|
90
|
+
# Assistant response
|
|
91
|
+
with assistant():
|
|
92
|
+
lm += gen("response", max_tokens=100)
|
|
93
|
+
|
|
94
|
+
print(lm["response"])
|
|
95
|
+
```
|
|
96
|
+
|
|
97
|
+
**Benefits:**
|
|
98
|
+
- Natural chat flow
|
|
99
|
+
- Clear role separation
|
|
100
|
+
- Easy to read and maintain
|
|
101
|
+
|
|
102
|
+
### 2. Constrained Generation
|
|
103
|
+
|
|
104
|
+
Guidance ensures outputs match specified patterns using regex or grammars.
|
|
105
|
+
|
|
106
|
+
#### Regex Constraints
|
|
107
|
+
|
|
108
|
+
```python
|
|
109
|
+
from guidance import models, gen
|
|
110
|
+
|
|
111
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
112
|
+
|
|
113
|
+
# Constrain to valid email format
|
|
114
|
+
lm += "Email: " + gen("email", regex=r"[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}")
|
|
115
|
+
|
|
116
|
+
# Constrain to date format (YYYY-MM-DD)
|
|
117
|
+
lm += "Date: " + gen("date", regex=r"\d{4}-\d{2}-\d{2}")
|
|
118
|
+
|
|
119
|
+
# Constrain to phone number
|
|
120
|
+
lm += "Phone: " + gen("phone", regex=r"\d{3}-\d{3}-\d{4}")
|
|
121
|
+
|
|
122
|
+
print(lm["email"]) # Guaranteed valid email
|
|
123
|
+
print(lm["date"]) # Guaranteed YYYY-MM-DD format
|
|
124
|
+
```
|
|
125
|
+
|
|
126
|
+
**How it works:**
|
|
127
|
+
- Regex converted to grammar at token level
|
|
128
|
+
- Invalid tokens filtered during generation
|
|
129
|
+
- Model can only produce matching outputs
|
|
130
|
+
|
|
131
|
+
#### Selection Constraints
|
|
132
|
+
|
|
133
|
+
```python
|
|
134
|
+
from guidance import models, gen, select
|
|
135
|
+
|
|
136
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
137
|
+
|
|
138
|
+
# Constrain to specific choices
|
|
139
|
+
lm += "Sentiment: " + select(["positive", "negative", "neutral"], name="sentiment")
|
|
140
|
+
|
|
141
|
+
# Multiple-choice selection
|
|
142
|
+
lm += "Best answer: " + select(
|
|
143
|
+
["A) Paris", "B) London", "C) Berlin", "D) Madrid"],
|
|
144
|
+
name="answer"
|
|
145
|
+
)
|
|
146
|
+
|
|
147
|
+
print(lm["sentiment"]) # One of: positive, negative, neutral
|
|
148
|
+
print(lm["answer"]) # One of: A, B, C, or D
|
|
149
|
+
```
|
|
150
|
+
|
|
151
|
+
### 3. Token Healing
|
|
152
|
+
|
|
153
|
+
Guidance automatically "heals" token boundaries between prompt and generation.
|
|
154
|
+
|
|
155
|
+
**Problem:** Tokenization creates unnatural boundaries.
|
|
156
|
+
|
|
157
|
+
```python
|
|
158
|
+
# Without token healing
|
|
159
|
+
prompt = "The capital of France is "
|
|
160
|
+
# Last token: " is "
|
|
161
|
+
# First generated token might be " Par" (with leading space)
|
|
162
|
+
# Result: "The capital of France is Paris" (double space!)
|
|
163
|
+
```
|
|
164
|
+
|
|
165
|
+
**Solution:** Guidance backs up one token and regenerates.
|
|
166
|
+
|
|
167
|
+
```python
|
|
168
|
+
from guidance import models, gen
|
|
169
|
+
|
|
170
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
171
|
+
|
|
172
|
+
# Token healing enabled by default
|
|
173
|
+
lm += "The capital of France is " + gen("capital", max_tokens=5)
|
|
174
|
+
# Result: "The capital of France is Paris" (correct spacing)
|
|
175
|
+
```
|
|
176
|
+
|
|
177
|
+
**Benefits:**
|
|
178
|
+
- Natural text boundaries
|
|
179
|
+
- No awkward spacing issues
|
|
180
|
+
- Better model performance (sees natural token sequences)
|
|
181
|
+
|
|
182
|
+
### 4. Grammar-Based Generation
|
|
183
|
+
|
|
184
|
+
Define complex structures using context-free grammars.
|
|
185
|
+
|
|
186
|
+
```python
|
|
187
|
+
from guidance import models, gen
|
|
188
|
+
|
|
189
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
190
|
+
|
|
191
|
+
# JSON grammar (simplified)
|
|
192
|
+
json_grammar = """
|
|
193
|
+
{
|
|
194
|
+
"name": <gen name regex="[A-Za-z ]+" max_tokens=20>,
|
|
195
|
+
"age": <gen age regex="[0-9]+" max_tokens=3>,
|
|
196
|
+
"email": <gen email regex="[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\\.[a-zA-Z]{2,}" max_tokens=50>
|
|
197
|
+
}
|
|
198
|
+
"""
|
|
199
|
+
|
|
200
|
+
# Generate valid JSON
|
|
201
|
+
lm += gen("person", grammar=json_grammar)
|
|
202
|
+
|
|
203
|
+
print(lm["person"]) # Guaranteed valid JSON structure
|
|
204
|
+
```
|
|
205
|
+
|
|
206
|
+
**Use cases:**
|
|
207
|
+
- Complex structured outputs
|
|
208
|
+
- Nested data structures
|
|
209
|
+
- Programming language syntax
|
|
210
|
+
- Domain-specific languages
|
|
211
|
+
|
|
212
|
+
### 5. Guidance Functions
|
|
213
|
+
|
|
214
|
+
Create reusable generation patterns with the `@guidance` decorator.
|
|
215
|
+
|
|
216
|
+
```python
|
|
217
|
+
from guidance import guidance, gen, models
|
|
218
|
+
|
|
219
|
+
@guidance
|
|
220
|
+
def generate_person(lm):
|
|
221
|
+
"""Generate a person with name and age."""
|
|
222
|
+
lm += "Name: " + gen("name", max_tokens=20, stop="\n")
|
|
223
|
+
lm += "\nAge: " + gen("age", regex=r"[0-9]+", max_tokens=3)
|
|
224
|
+
return lm
|
|
225
|
+
|
|
226
|
+
# Use the function
|
|
227
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
228
|
+
lm = generate_person(lm)
|
|
229
|
+
|
|
230
|
+
print(lm["name"])
|
|
231
|
+
print(lm["age"])
|
|
232
|
+
```
|
|
233
|
+
|
|
234
|
+
**Stateful Functions:**
|
|
235
|
+
|
|
236
|
+
```python
|
|
237
|
+
@guidance(stateless=False)
|
|
238
|
+
def react_agent(lm, question, tools, max_rounds=5):
|
|
239
|
+
"""ReAct agent with tool use."""
|
|
240
|
+
lm += f"Question: {question}\n\n"
|
|
241
|
+
|
|
242
|
+
for i in range(max_rounds):
|
|
243
|
+
# Thought
|
|
244
|
+
lm += f"Thought {i+1}: " + gen("thought", stop="\n")
|
|
245
|
+
|
|
246
|
+
# Action
|
|
247
|
+
lm += "\nAction: " + select(list(tools.keys()), name="action")
|
|
248
|
+
|
|
249
|
+
# Execute tool
|
|
250
|
+
tool_result = tools[lm["action"]]()
|
|
251
|
+
lm += f"\nObservation: {tool_result}\n\n"
|
|
252
|
+
|
|
253
|
+
# Check if done
|
|
254
|
+
lm += "Done? " + select(["Yes", "No"], name="done")
|
|
255
|
+
if lm["done"] == "Yes":
|
|
256
|
+
break
|
|
257
|
+
|
|
258
|
+
# Final answer
|
|
259
|
+
lm += "\nFinal Answer: " + gen("answer", max_tokens=100)
|
|
260
|
+
return lm
|
|
261
|
+
```
|
|
262
|
+
|
|
263
|
+
## Backend Configuration
|
|
264
|
+
|
|
265
|
+
### Anthropic Claude
|
|
266
|
+
|
|
267
|
+
```python
|
|
268
|
+
from guidance import models
|
|
269
|
+
|
|
270
|
+
lm = models.Anthropic(
|
|
271
|
+
model="claude-sonnet-4-5-20250929",
|
|
272
|
+
api_key="your-api-key" # Or set ANTHROPIC_API_KEY env var
|
|
273
|
+
)
|
|
274
|
+
```
|
|
275
|
+
|
|
276
|
+
### OpenAI
|
|
277
|
+
|
|
278
|
+
```python
|
|
279
|
+
lm = models.OpenAI(
|
|
280
|
+
model="gpt-4o-mini",
|
|
281
|
+
api_key="your-api-key" # Or set OPENAI_API_KEY env var
|
|
282
|
+
)
|
|
283
|
+
```
|
|
284
|
+
|
|
285
|
+
### Local Models (Transformers)
|
|
286
|
+
|
|
287
|
+
```python
|
|
288
|
+
from guidance.models import Transformers
|
|
289
|
+
|
|
290
|
+
lm = Transformers(
|
|
291
|
+
"microsoft/Phi-4-mini-instruct",
|
|
292
|
+
device="cuda" # Or "cpu"
|
|
293
|
+
)
|
|
294
|
+
```
|
|
295
|
+
|
|
296
|
+
### Local Models (llama.cpp)
|
|
297
|
+
|
|
298
|
+
```python
|
|
299
|
+
from guidance.models import LlamaCpp
|
|
300
|
+
|
|
301
|
+
lm = LlamaCpp(
|
|
302
|
+
model_path="/path/to/model.gguf",
|
|
303
|
+
n_ctx=4096,
|
|
304
|
+
n_gpu_layers=35
|
|
305
|
+
)
|
|
306
|
+
```
|
|
307
|
+
|
|
308
|
+
## Common Patterns
|
|
309
|
+
|
|
310
|
+
### Pattern 1: JSON Generation
|
|
311
|
+
|
|
312
|
+
```python
|
|
313
|
+
from guidance import models, gen, system, user, assistant
|
|
314
|
+
|
|
315
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
316
|
+
|
|
317
|
+
with system():
|
|
318
|
+
lm += "You generate valid JSON."
|
|
319
|
+
|
|
320
|
+
with user():
|
|
321
|
+
lm += "Generate a user profile with name, age, and email."
|
|
322
|
+
|
|
323
|
+
with assistant():
|
|
324
|
+
lm += """{
|
|
325
|
+
"name": """ + gen("name", regex=r'"[A-Za-z ]+"', max_tokens=30) + """,
|
|
326
|
+
"age": """ + gen("age", regex=r"[0-9]+", max_tokens=3) + """,
|
|
327
|
+
"email": """ + gen("email", regex=r'"[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}"', max_tokens=50) + """
|
|
328
|
+
}"""
|
|
329
|
+
|
|
330
|
+
print(lm) # Valid JSON guaranteed
|
|
331
|
+
```
|
|
332
|
+
|
|
333
|
+
### Pattern 2: Classification
|
|
334
|
+
|
|
335
|
+
```python
|
|
336
|
+
from guidance import models, gen, select
|
|
337
|
+
|
|
338
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
339
|
+
|
|
340
|
+
text = "This product is amazing! I love it."
|
|
341
|
+
|
|
342
|
+
lm += f"Text: {text}\n"
|
|
343
|
+
lm += "Sentiment: " + select(["positive", "negative", "neutral"], name="sentiment")
|
|
344
|
+
lm += "\nConfidence: " + gen("confidence", regex=r"[0-9]+", max_tokens=3) + "%"
|
|
345
|
+
|
|
346
|
+
print(f"Sentiment: {lm['sentiment']}")
|
|
347
|
+
print(f"Confidence: {lm['confidence']}%")
|
|
348
|
+
```
|
|
349
|
+
|
|
350
|
+
### Pattern 3: Multi-Step Reasoning
|
|
351
|
+
|
|
352
|
+
```python
|
|
353
|
+
from guidance import models, gen, guidance
|
|
354
|
+
|
|
355
|
+
@guidance
|
|
356
|
+
def chain_of_thought(lm, question):
|
|
357
|
+
"""Generate answer with step-by-step reasoning."""
|
|
358
|
+
lm += f"Question: {question}\n\n"
|
|
359
|
+
|
|
360
|
+
# Generate multiple reasoning steps
|
|
361
|
+
for i in range(3):
|
|
362
|
+
lm += f"Step {i+1}: " + gen(f"step_{i+1}", stop="\n", max_tokens=100) + "\n"
|
|
363
|
+
|
|
364
|
+
# Final answer
|
|
365
|
+
lm += "\nTherefore, the answer is: " + gen("answer", max_tokens=50)
|
|
366
|
+
|
|
367
|
+
return lm
|
|
368
|
+
|
|
369
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
370
|
+
lm = chain_of_thought(lm, "What is 15% of 200?")
|
|
371
|
+
|
|
372
|
+
print(lm["answer"])
|
|
373
|
+
```
|
|
374
|
+
|
|
375
|
+
### Pattern 4: ReAct Agent
|
|
376
|
+
|
|
377
|
+
```python
|
|
378
|
+
from guidance import models, gen, select, guidance
|
|
379
|
+
|
|
380
|
+
@guidance(stateless=False)
|
|
381
|
+
def react_agent(lm, question):
|
|
382
|
+
"""ReAct agent with tool use."""
|
|
383
|
+
tools = {
|
|
384
|
+
"calculator": lambda expr: eval(expr),
|
|
385
|
+
"search": lambda query: f"Search results for: {query}",
|
|
386
|
+
}
|
|
387
|
+
|
|
388
|
+
lm += f"Question: {question}\n\n"
|
|
389
|
+
|
|
390
|
+
for round in range(5):
|
|
391
|
+
# Thought
|
|
392
|
+
lm += f"Thought: " + gen("thought", stop="\n") + "\n"
|
|
393
|
+
|
|
394
|
+
# Action selection
|
|
395
|
+
lm += "Action: " + select(["calculator", "search", "answer"], name="action")
|
|
396
|
+
|
|
397
|
+
if lm["action"] == "answer":
|
|
398
|
+
lm += "\nFinal Answer: " + gen("answer", max_tokens=100)
|
|
399
|
+
break
|
|
400
|
+
|
|
401
|
+
# Action input
|
|
402
|
+
lm += "\nAction Input: " + gen("action_input", stop="\n") + "\n"
|
|
403
|
+
|
|
404
|
+
# Execute tool
|
|
405
|
+
if lm["action"] in tools:
|
|
406
|
+
result = tools[lm["action"]](lm["action_input"])
|
|
407
|
+
lm += f"Observation: {result}\n\n"
|
|
408
|
+
|
|
409
|
+
return lm
|
|
410
|
+
|
|
411
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
412
|
+
lm = react_agent(lm, "What is 25 * 4 + 10?")
|
|
413
|
+
print(lm["answer"])
|
|
414
|
+
```
|
|
415
|
+
|
|
416
|
+
### Pattern 5: Data Extraction
|
|
417
|
+
|
|
418
|
+
```python
|
|
419
|
+
from guidance import models, gen, guidance
|
|
420
|
+
|
|
421
|
+
@guidance
|
|
422
|
+
def extract_entities(lm, text):
|
|
423
|
+
"""Extract structured entities from text."""
|
|
424
|
+
lm += f"Text: {text}\n\n"
|
|
425
|
+
|
|
426
|
+
# Extract person
|
|
427
|
+
lm += "Person: " + gen("person", stop="\n", max_tokens=30) + "\n"
|
|
428
|
+
|
|
429
|
+
# Extract organization
|
|
430
|
+
lm += "Organization: " + gen("organization", stop="\n", max_tokens=30) + "\n"
|
|
431
|
+
|
|
432
|
+
# Extract date
|
|
433
|
+
lm += "Date: " + gen("date", regex=r"\d{4}-\d{2}-\d{2}", max_tokens=10) + "\n"
|
|
434
|
+
|
|
435
|
+
# Extract location
|
|
436
|
+
lm += "Location: " + gen("location", stop="\n", max_tokens=30) + "\n"
|
|
437
|
+
|
|
438
|
+
return lm
|
|
439
|
+
|
|
440
|
+
text = "Tim Cook announced at Apple Park on 2024-09-15 in Cupertino."
|
|
441
|
+
|
|
442
|
+
lm = models.Anthropic("claude-sonnet-4-5-20250929")
|
|
443
|
+
lm = extract_entities(lm, text)
|
|
444
|
+
|
|
445
|
+
print(f"Person: {lm['person']}")
|
|
446
|
+
print(f"Organization: {lm['organization']}")
|
|
447
|
+
print(f"Date: {lm['date']}")
|
|
448
|
+
print(f"Location: {lm['location']}")
|
|
449
|
+
```
|
|
450
|
+
|
|
451
|
+
## Best Practices
|
|
452
|
+
|
|
453
|
+
### 1. Use Regex for Format Validation
|
|
454
|
+
|
|
455
|
+
```python
|
|
456
|
+
# ✅ Good: Regex ensures valid format
|
|
457
|
+
lm += "Email: " + gen("email", regex=r"[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}")
|
|
458
|
+
|
|
459
|
+
# ❌ Bad: Free generation may produce invalid emails
|
|
460
|
+
lm += "Email: " + gen("email", max_tokens=50)
|
|
461
|
+
```
|
|
462
|
+
|
|
463
|
+
### 2. Use select() for Fixed Categories
|
|
464
|
+
|
|
465
|
+
```python
|
|
466
|
+
# ✅ Good: Guaranteed valid category
|
|
467
|
+
lm += "Status: " + select(["pending", "approved", "rejected"], name="status")
|
|
468
|
+
|
|
469
|
+
# ❌ Bad: May generate typos or invalid values
|
|
470
|
+
lm += "Status: " + gen("status", max_tokens=20)
|
|
471
|
+
```
|
|
472
|
+
|
|
473
|
+
### 3. Leverage Token Healing
|
|
474
|
+
|
|
475
|
+
```python
|
|
476
|
+
# Token healing is enabled by default
|
|
477
|
+
# No special action needed - just concatenate naturally
|
|
478
|
+
lm += "The capital is " + gen("capital") # Automatic healing
|
|
479
|
+
```
|
|
480
|
+
|
|
481
|
+
### 4. Use stop Sequences
|
|
482
|
+
|
|
483
|
+
```python
|
|
484
|
+
# ✅ Good: Stop at newline for single-line outputs
|
|
485
|
+
lm += "Name: " + gen("name", stop="\n")
|
|
486
|
+
|
|
487
|
+
# ❌ Bad: May generate multiple lines
|
|
488
|
+
lm += "Name: " + gen("name", max_tokens=50)
|
|
489
|
+
```
|
|
490
|
+
|
|
491
|
+
### 5. Create Reusable Functions
|
|
492
|
+
|
|
493
|
+
```python
|
|
494
|
+
# ✅ Good: Reusable pattern
|
|
495
|
+
@guidance
|
|
496
|
+
def generate_person(lm):
|
|
497
|
+
lm += "Name: " + gen("name", stop="\n")
|
|
498
|
+
lm += "\nAge: " + gen("age", regex=r"[0-9]+")
|
|
499
|
+
return lm
|
|
500
|
+
|
|
501
|
+
# Use multiple times
|
|
502
|
+
lm = generate_person(lm)
|
|
503
|
+
lm += "\n\n"
|
|
504
|
+
lm = generate_person(lm)
|
|
505
|
+
```
|
|
506
|
+
|
|
507
|
+
### 6. Balance Constraints
|
|
508
|
+
|
|
509
|
+
```python
|
|
510
|
+
# ✅ Good: Reasonable constraints
|
|
511
|
+
lm += gen("name", regex=r"[A-Za-z ]+", max_tokens=30)
|
|
512
|
+
|
|
513
|
+
# ❌ Too strict: May fail or be very slow
|
|
514
|
+
lm += gen("name", regex=r"^(John|Jane)$", max_tokens=10)
|
|
515
|
+
```
|
|
516
|
+
|
|
517
|
+
## Comparison to Alternatives
|
|
518
|
+
|
|
519
|
+
| Feature | Guidance | Instructor | Outlines | LMQL |
|
|
520
|
+
|---------|----------|------------|----------|------|
|
|
521
|
+
| Regex Constraints | ✅ Yes | ❌ No | ✅ Yes | ✅ Yes |
|
|
522
|
+
| Grammar Support | ✅ CFG | ❌ No | ✅ CFG | ✅ CFG |
|
|
523
|
+
| Pydantic Validation | ❌ No | ✅ Yes | ✅ Yes | ❌ No |
|
|
524
|
+
| Token Healing | ✅ Yes | ❌ No | ✅ Yes | ❌ No |
|
|
525
|
+
| Local Models | ✅ Yes | ⚠️ Limited | ✅ Yes | ✅ Yes |
|
|
526
|
+
| API Models | ✅ Yes | ✅ Yes | ⚠️ Limited | ✅ Yes |
|
|
527
|
+
| Pythonic Syntax | ✅ Yes | ✅ Yes | ✅ Yes | ❌ SQL-like |
|
|
528
|
+
| Learning Curve | Low | Low | Medium | High |
|
|
529
|
+
|
|
530
|
+
**When to choose Guidance:**
|
|
531
|
+
- Need regex/grammar constraints
|
|
532
|
+
- Want token healing
|
|
533
|
+
- Building complex workflows with control flow
|
|
534
|
+
- Using local models (Transformers, llama.cpp)
|
|
535
|
+
- Prefer Pythonic syntax
|
|
536
|
+
|
|
537
|
+
**When to choose alternatives:**
|
|
538
|
+
- Instructor: Need Pydantic validation with automatic retrying
|
|
539
|
+
- Outlines: Need JSON schema validation
|
|
540
|
+
- LMQL: Prefer declarative query syntax
|
|
541
|
+
|
|
542
|
+
## Performance Characteristics
|
|
543
|
+
|
|
544
|
+
**Latency Reduction:**
|
|
545
|
+
- 30-50% faster than traditional prompting for constrained outputs
|
|
546
|
+
- Token healing reduces unnecessary regeneration
|
|
547
|
+
- Grammar constraints prevent invalid token generation
|
|
548
|
+
|
|
549
|
+
**Memory Usage:**
|
|
550
|
+
- Minimal overhead vs unconstrained generation
|
|
551
|
+
- Grammar compilation cached after first use
|
|
552
|
+
- Efficient token filtering at inference time
|
|
553
|
+
|
|
554
|
+
**Token Efficiency:**
|
|
555
|
+
- Prevents wasted tokens on invalid outputs
|
|
556
|
+
- No need for retry loops
|
|
557
|
+
- Direct path to valid outputs
|
|
558
|
+
|
|
559
|
+
## Resources
|
|
560
|
+
|
|
561
|
+
- **Documentation**: https://guidance.readthedocs.io
|
|
562
|
+
- **GitHub**: https://github.com/guidance-ai/guidance (18k+ stars)
|
|
563
|
+
- **Notebooks**: https://github.com/guidance-ai/guidance/tree/main/notebooks
|
|
564
|
+
- **Discord**: Community support available
|
|
565
|
+
|
|
566
|
+
## See Also
|
|
567
|
+
|
|
568
|
+
- `references/constraints.md` - Comprehensive regex and grammar patterns
|
|
569
|
+
- `references/backends.md` - Backend-specific configuration
|
|
570
|
+
- `references/examples.md` - Production-ready examples
|
|
571
|
+
|
|
572
|
+
|