@synsci/cli-darwin-x64 1.1.49
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/skills/accelerate/SKILL.md +332 -0
- package/bin/skills/accelerate/references/custom-plugins.md +453 -0
- package/bin/skills/accelerate/references/megatron-integration.md +489 -0
- package/bin/skills/accelerate/references/performance.md +525 -0
- package/bin/skills/audiocraft/SKILL.md +564 -0
- package/bin/skills/audiocraft/references/advanced-usage.md +666 -0
- package/bin/skills/audiocraft/references/troubleshooting.md +504 -0
- package/bin/skills/autogpt/SKILL.md +403 -0
- package/bin/skills/autogpt/references/advanced-usage.md +535 -0
- package/bin/skills/autogpt/references/troubleshooting.md +420 -0
- package/bin/skills/awq/SKILL.md +310 -0
- package/bin/skills/awq/references/advanced-usage.md +324 -0
- package/bin/skills/awq/references/troubleshooting.md +344 -0
- package/bin/skills/axolotl/SKILL.md +158 -0
- package/bin/skills/axolotl/references/api.md +5548 -0
- package/bin/skills/axolotl/references/dataset-formats.md +1029 -0
- package/bin/skills/axolotl/references/index.md +15 -0
- package/bin/skills/axolotl/references/other.md +3563 -0
- package/bin/skills/bigcode-evaluation-harness/SKILL.md +405 -0
- package/bin/skills/bigcode-evaluation-harness/references/benchmarks.md +393 -0
- package/bin/skills/bigcode-evaluation-harness/references/custom-tasks.md +424 -0
- package/bin/skills/bigcode-evaluation-harness/references/issues.md +394 -0
- package/bin/skills/bitsandbytes/SKILL.md +411 -0
- package/bin/skills/bitsandbytes/references/memory-optimization.md +521 -0
- package/bin/skills/bitsandbytes/references/qlora-training.md +521 -0
- package/bin/skills/bitsandbytes/references/quantization-formats.md +447 -0
- package/bin/skills/blip-2/SKILL.md +564 -0
- package/bin/skills/blip-2/references/advanced-usage.md +680 -0
- package/bin/skills/blip-2/references/troubleshooting.md +526 -0
- package/bin/skills/chroma/SKILL.md +406 -0
- package/bin/skills/chroma/references/integration.md +38 -0
- package/bin/skills/clip/SKILL.md +253 -0
- package/bin/skills/clip/references/applications.md +207 -0
- package/bin/skills/constitutional-ai/SKILL.md +290 -0
- package/bin/skills/crewai/SKILL.md +498 -0
- package/bin/skills/crewai/references/flows.md +438 -0
- package/bin/skills/crewai/references/tools.md +429 -0
- package/bin/skills/crewai/references/troubleshooting.md +480 -0
- package/bin/skills/deepspeed/SKILL.md +141 -0
- package/bin/skills/deepspeed/references/08.md +17 -0
- package/bin/skills/deepspeed/references/09.md +173 -0
- package/bin/skills/deepspeed/references/2020.md +378 -0
- package/bin/skills/deepspeed/references/2023.md +279 -0
- package/bin/skills/deepspeed/references/assets.md +179 -0
- package/bin/skills/deepspeed/references/index.md +35 -0
- package/bin/skills/deepspeed/references/mii.md +118 -0
- package/bin/skills/deepspeed/references/other.md +1191 -0
- package/bin/skills/deepspeed/references/tutorials.md +6554 -0
- package/bin/skills/dspy/SKILL.md +590 -0
- package/bin/skills/dspy/references/examples.md +663 -0
- package/bin/skills/dspy/references/modules.md +475 -0
- package/bin/skills/dspy/references/optimizers.md +566 -0
- package/bin/skills/faiss/SKILL.md +221 -0
- package/bin/skills/faiss/references/index_types.md +280 -0
- package/bin/skills/flash-attention/SKILL.md +367 -0
- package/bin/skills/flash-attention/references/benchmarks.md +215 -0
- package/bin/skills/flash-attention/references/transformers-integration.md +293 -0
- package/bin/skills/gguf/SKILL.md +427 -0
- package/bin/skills/gguf/references/advanced-usage.md +504 -0
- package/bin/skills/gguf/references/troubleshooting.md +442 -0
- package/bin/skills/gptq/SKILL.md +450 -0
- package/bin/skills/gptq/references/calibration.md +337 -0
- package/bin/skills/gptq/references/integration.md +129 -0
- package/bin/skills/gptq/references/troubleshooting.md +95 -0
- package/bin/skills/grpo-rl-training/README.md +97 -0
- package/bin/skills/grpo-rl-training/SKILL.md +572 -0
- package/bin/skills/grpo-rl-training/examples/reward_functions_library.py +393 -0
- package/bin/skills/grpo-rl-training/templates/basic_grpo_training.py +228 -0
- package/bin/skills/guidance/SKILL.md +572 -0
- package/bin/skills/guidance/references/backends.md +554 -0
- package/bin/skills/guidance/references/constraints.md +674 -0
- package/bin/skills/guidance/references/examples.md +767 -0
- package/bin/skills/hqq/SKILL.md +445 -0
- package/bin/skills/hqq/references/advanced-usage.md +528 -0
- package/bin/skills/hqq/references/troubleshooting.md +503 -0
- package/bin/skills/hugging-face-cli/SKILL.md +191 -0
- package/bin/skills/hugging-face-cli/references/commands.md +954 -0
- package/bin/skills/hugging-face-cli/references/examples.md +374 -0
- package/bin/skills/hugging-face-datasets/SKILL.md +547 -0
- package/bin/skills/hugging-face-datasets/examples/diverse_training_examples.json +239 -0
- package/bin/skills/hugging-face-datasets/examples/system_prompt_template.txt +196 -0
- package/bin/skills/hugging-face-datasets/examples/training_examples.json +176 -0
- package/bin/skills/hugging-face-datasets/scripts/dataset_manager.py +522 -0
- package/bin/skills/hugging-face-datasets/scripts/sql_manager.py +844 -0
- package/bin/skills/hugging-face-datasets/templates/chat.json +55 -0
- package/bin/skills/hugging-face-datasets/templates/classification.json +62 -0
- package/bin/skills/hugging-face-datasets/templates/completion.json +51 -0
- package/bin/skills/hugging-face-datasets/templates/custom.json +75 -0
- package/bin/skills/hugging-face-datasets/templates/qa.json +54 -0
- package/bin/skills/hugging-face-datasets/templates/tabular.json +81 -0
- package/bin/skills/hugging-face-evaluation/SKILL.md +656 -0
- package/bin/skills/hugging-face-evaluation/examples/USAGE_EXAMPLES.md +382 -0
- package/bin/skills/hugging-face-evaluation/examples/artificial_analysis_to_hub.py +141 -0
- package/bin/skills/hugging-face-evaluation/examples/example_readme_tables.md +135 -0
- package/bin/skills/hugging-face-evaluation/examples/metric_mapping.json +50 -0
- package/bin/skills/hugging-face-evaluation/requirements.txt +20 -0
- package/bin/skills/hugging-face-evaluation/scripts/evaluation_manager.py +1374 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_eval_uv.py +104 -0
- package/bin/skills/hugging-face-evaluation/scripts/inspect_vllm_uv.py +317 -0
- package/bin/skills/hugging-face-evaluation/scripts/lighteval_vllm_uv.py +303 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_eval_job.py +98 -0
- package/bin/skills/hugging-face-evaluation/scripts/run_vllm_eval_job.py +331 -0
- package/bin/skills/hugging-face-evaluation/scripts/test_extraction.py +206 -0
- package/bin/skills/hugging-face-jobs/SKILL.md +1041 -0
- package/bin/skills/hugging-face-jobs/index.html +216 -0
- package/bin/skills/hugging-face-jobs/references/hardware_guide.md +336 -0
- package/bin/skills/hugging-face-jobs/references/hub_saving.md +352 -0
- package/bin/skills/hugging-face-jobs/references/token_usage.md +546 -0
- package/bin/skills/hugging-face-jobs/references/troubleshooting.md +475 -0
- package/bin/skills/hugging-face-jobs/scripts/cot-self-instruct.py +718 -0
- package/bin/skills/hugging-face-jobs/scripts/finepdfs-stats.py +546 -0
- package/bin/skills/hugging-face-jobs/scripts/generate-responses.py +587 -0
- package/bin/skills/hugging-face-model-trainer/SKILL.md +711 -0
- package/bin/skills/hugging-face-model-trainer/references/gguf_conversion.md +296 -0
- package/bin/skills/hugging-face-model-trainer/references/hardware_guide.md +283 -0
- package/bin/skills/hugging-face-model-trainer/references/hub_saving.md +364 -0
- package/bin/skills/hugging-face-model-trainer/references/reliability_principles.md +371 -0
- package/bin/skills/hugging-face-model-trainer/references/trackio_guide.md +189 -0
- package/bin/skills/hugging-face-model-trainer/references/training_methods.md +150 -0
- package/bin/skills/hugging-face-model-trainer/references/training_patterns.md +203 -0
- package/bin/skills/hugging-face-model-trainer/references/troubleshooting.md +282 -0
- package/bin/skills/hugging-face-model-trainer/scripts/convert_to_gguf.py +424 -0
- package/bin/skills/hugging-face-model-trainer/scripts/dataset_inspector.py +417 -0
- package/bin/skills/hugging-face-model-trainer/scripts/estimate_cost.py +150 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_dpo_example.py +106 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_grpo_example.py +89 -0
- package/bin/skills/hugging-face-model-trainer/scripts/train_sft_example.py +122 -0
- package/bin/skills/hugging-face-paper-publisher/SKILL.md +627 -0
- package/bin/skills/hugging-face-paper-publisher/examples/example_usage.md +327 -0
- package/bin/skills/hugging-face-paper-publisher/references/quick_reference.md +216 -0
- package/bin/skills/hugging-face-paper-publisher/scripts/paper_manager.py +508 -0
- package/bin/skills/hugging-face-paper-publisher/templates/arxiv.md +299 -0
- package/bin/skills/hugging-face-paper-publisher/templates/ml-report.md +358 -0
- package/bin/skills/hugging-face-paper-publisher/templates/modern.md +319 -0
- package/bin/skills/hugging-face-paper-publisher/templates/standard.md +201 -0
- package/bin/skills/hugging-face-tool-builder/SKILL.md +115 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.py +57 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.sh +40 -0
- package/bin/skills/hugging-face-tool-builder/references/baseline_hf_api.tsx +57 -0
- package/bin/skills/hugging-face-tool-builder/references/find_models_by_paper.sh +230 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_enrich_models.sh +96 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_card_frontmatter.sh +188 -0
- package/bin/skills/hugging-face-tool-builder/references/hf_model_papers_auth.sh +171 -0
- package/bin/skills/hugging-face-trackio/SKILL.md +65 -0
- package/bin/skills/hugging-face-trackio/references/logging_metrics.md +206 -0
- package/bin/skills/hugging-face-trackio/references/retrieving_metrics.md +223 -0
- package/bin/skills/huggingface-tokenizers/SKILL.md +516 -0
- package/bin/skills/huggingface-tokenizers/references/algorithms.md +653 -0
- package/bin/skills/huggingface-tokenizers/references/integration.md +637 -0
- package/bin/skills/huggingface-tokenizers/references/pipeline.md +723 -0
- package/bin/skills/huggingface-tokenizers/references/training.md +565 -0
- package/bin/skills/instructor/SKILL.md +740 -0
- package/bin/skills/instructor/references/examples.md +107 -0
- package/bin/skills/instructor/references/providers.md +70 -0
- package/bin/skills/instructor/references/validation.md +606 -0
- package/bin/skills/knowledge-distillation/SKILL.md +458 -0
- package/bin/skills/knowledge-distillation/references/minillm.md +334 -0
- package/bin/skills/lambda-labs/SKILL.md +545 -0
- package/bin/skills/lambda-labs/references/advanced-usage.md +611 -0
- package/bin/skills/lambda-labs/references/troubleshooting.md +530 -0
- package/bin/skills/langchain/SKILL.md +480 -0
- package/bin/skills/langchain/references/agents.md +499 -0
- package/bin/skills/langchain/references/integration.md +562 -0
- package/bin/skills/langchain/references/rag.md +600 -0
- package/bin/skills/langsmith/SKILL.md +422 -0
- package/bin/skills/langsmith/references/advanced-usage.md +548 -0
- package/bin/skills/langsmith/references/troubleshooting.md +537 -0
- package/bin/skills/litgpt/SKILL.md +469 -0
- package/bin/skills/litgpt/references/custom-models.md +568 -0
- package/bin/skills/litgpt/references/distributed-training.md +451 -0
- package/bin/skills/litgpt/references/supported-models.md +336 -0
- package/bin/skills/litgpt/references/training-recipes.md +619 -0
- package/bin/skills/llama-cpp/SKILL.md +258 -0
- package/bin/skills/llama-cpp/references/optimization.md +89 -0
- package/bin/skills/llama-cpp/references/quantization.md +213 -0
- package/bin/skills/llama-cpp/references/server.md +125 -0
- package/bin/skills/llama-factory/SKILL.md +80 -0
- package/bin/skills/llama-factory/references/_images.md +23 -0
- package/bin/skills/llama-factory/references/advanced.md +1055 -0
- package/bin/skills/llama-factory/references/getting_started.md +349 -0
- package/bin/skills/llama-factory/references/index.md +19 -0
- package/bin/skills/llama-factory/references/other.md +31 -0
- package/bin/skills/llamaguard/SKILL.md +337 -0
- package/bin/skills/llamaindex/SKILL.md +569 -0
- package/bin/skills/llamaindex/references/agents.md +83 -0
- package/bin/skills/llamaindex/references/data_connectors.md +108 -0
- package/bin/skills/llamaindex/references/query_engines.md +406 -0
- package/bin/skills/llava/SKILL.md +304 -0
- package/bin/skills/llava/references/training.md +197 -0
- package/bin/skills/lm-evaluation-harness/SKILL.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/api-evaluation.md +490 -0
- package/bin/skills/lm-evaluation-harness/references/benchmark-guide.md +488 -0
- package/bin/skills/lm-evaluation-harness/references/custom-tasks.md +602 -0
- package/bin/skills/lm-evaluation-harness/references/distributed-eval.md +519 -0
- package/bin/skills/long-context/SKILL.md +536 -0
- package/bin/skills/long-context/references/extension_methods.md +468 -0
- package/bin/skills/long-context/references/fine_tuning.md +611 -0
- package/bin/skills/long-context/references/rope.md +402 -0
- package/bin/skills/mamba/SKILL.md +260 -0
- package/bin/skills/mamba/references/architecture-details.md +206 -0
- package/bin/skills/mamba/references/benchmarks.md +255 -0
- package/bin/skills/mamba/references/training-guide.md +388 -0
- package/bin/skills/megatron-core/SKILL.md +366 -0
- package/bin/skills/megatron-core/references/benchmarks.md +249 -0
- package/bin/skills/megatron-core/references/parallelism-guide.md +404 -0
- package/bin/skills/megatron-core/references/production-examples.md +473 -0
- package/bin/skills/megatron-core/references/training-recipes.md +547 -0
- package/bin/skills/miles/SKILL.md +315 -0
- package/bin/skills/miles/references/api-reference.md +141 -0
- package/bin/skills/miles/references/troubleshooting.md +352 -0
- package/bin/skills/mlflow/SKILL.md +704 -0
- package/bin/skills/mlflow/references/deployment.md +744 -0
- package/bin/skills/mlflow/references/model-registry.md +770 -0
- package/bin/skills/mlflow/references/tracking.md +680 -0
- package/bin/skills/modal/SKILL.md +341 -0
- package/bin/skills/modal/references/advanced-usage.md +503 -0
- package/bin/skills/modal/references/troubleshooting.md +494 -0
- package/bin/skills/model-merging/SKILL.md +539 -0
- package/bin/skills/model-merging/references/evaluation.md +462 -0
- package/bin/skills/model-merging/references/examples.md +428 -0
- package/bin/skills/model-merging/references/methods.md +352 -0
- package/bin/skills/model-pruning/SKILL.md +495 -0
- package/bin/skills/model-pruning/references/wanda.md +347 -0
- package/bin/skills/moe-training/SKILL.md +526 -0
- package/bin/skills/moe-training/references/architectures.md +432 -0
- package/bin/skills/moe-training/references/inference.md +348 -0
- package/bin/skills/moe-training/references/training.md +425 -0
- package/bin/skills/nanogpt/SKILL.md +290 -0
- package/bin/skills/nanogpt/references/architecture.md +382 -0
- package/bin/skills/nanogpt/references/data.md +476 -0
- package/bin/skills/nanogpt/references/training.md +564 -0
- package/bin/skills/nemo-curator/SKILL.md +383 -0
- package/bin/skills/nemo-curator/references/deduplication.md +87 -0
- package/bin/skills/nemo-curator/references/filtering.md +102 -0
- package/bin/skills/nemo-evaluator/SKILL.md +494 -0
- package/bin/skills/nemo-evaluator/references/adapter-system.md +340 -0
- package/bin/skills/nemo-evaluator/references/configuration.md +447 -0
- package/bin/skills/nemo-evaluator/references/custom-benchmarks.md +315 -0
- package/bin/skills/nemo-evaluator/references/execution-backends.md +361 -0
- package/bin/skills/nemo-guardrails/SKILL.md +297 -0
- package/bin/skills/nnsight/SKILL.md +436 -0
- package/bin/skills/nnsight/references/README.md +78 -0
- package/bin/skills/nnsight/references/api.md +344 -0
- package/bin/skills/nnsight/references/tutorials.md +300 -0
- package/bin/skills/openrlhf/SKILL.md +249 -0
- package/bin/skills/openrlhf/references/algorithm-comparison.md +404 -0
- package/bin/skills/openrlhf/references/custom-rewards.md +530 -0
- package/bin/skills/openrlhf/references/hybrid-engine.md +287 -0
- package/bin/skills/openrlhf/references/multi-node-training.md +454 -0
- package/bin/skills/outlines/SKILL.md +652 -0
- package/bin/skills/outlines/references/backends.md +615 -0
- package/bin/skills/outlines/references/examples.md +773 -0
- package/bin/skills/outlines/references/json_generation.md +652 -0
- package/bin/skills/peft/SKILL.md +431 -0
- package/bin/skills/peft/references/advanced-usage.md +514 -0
- package/bin/skills/peft/references/troubleshooting.md +480 -0
- package/bin/skills/phoenix/SKILL.md +475 -0
- package/bin/skills/phoenix/references/advanced-usage.md +619 -0
- package/bin/skills/phoenix/references/troubleshooting.md +538 -0
- package/bin/skills/pinecone/SKILL.md +358 -0
- package/bin/skills/pinecone/references/deployment.md +181 -0
- package/bin/skills/pytorch-fsdp/SKILL.md +126 -0
- package/bin/skills/pytorch-fsdp/references/index.md +7 -0
- package/bin/skills/pytorch-fsdp/references/other.md +4249 -0
- package/bin/skills/pytorch-lightning/SKILL.md +346 -0
- package/bin/skills/pytorch-lightning/references/callbacks.md +436 -0
- package/bin/skills/pytorch-lightning/references/distributed.md +490 -0
- package/bin/skills/pytorch-lightning/references/hyperparameter-tuning.md +556 -0
- package/bin/skills/pyvene/SKILL.md +473 -0
- package/bin/skills/pyvene/references/README.md +73 -0
- package/bin/skills/pyvene/references/api.md +383 -0
- package/bin/skills/pyvene/references/tutorials.md +376 -0
- package/bin/skills/qdrant/SKILL.md +493 -0
- package/bin/skills/qdrant/references/advanced-usage.md +648 -0
- package/bin/skills/qdrant/references/troubleshooting.md +631 -0
- package/bin/skills/ray-data/SKILL.md +326 -0
- package/bin/skills/ray-data/references/integration.md +82 -0
- package/bin/skills/ray-data/references/transformations.md +83 -0
- package/bin/skills/ray-train/SKILL.md +406 -0
- package/bin/skills/ray-train/references/multi-node.md +628 -0
- package/bin/skills/rwkv/SKILL.md +260 -0
- package/bin/skills/rwkv/references/architecture-details.md +344 -0
- package/bin/skills/rwkv/references/rwkv7.md +386 -0
- package/bin/skills/rwkv/references/state-management.md +369 -0
- package/bin/skills/saelens/SKILL.md +386 -0
- package/bin/skills/saelens/references/README.md +70 -0
- package/bin/skills/saelens/references/api.md +333 -0
- package/bin/skills/saelens/references/tutorials.md +318 -0
- package/bin/skills/segment-anything/SKILL.md +500 -0
- package/bin/skills/segment-anything/references/advanced-usage.md +589 -0
- package/bin/skills/segment-anything/references/troubleshooting.md +484 -0
- package/bin/skills/sentence-transformers/SKILL.md +255 -0
- package/bin/skills/sentence-transformers/references/models.md +123 -0
- package/bin/skills/sentencepiece/SKILL.md +235 -0
- package/bin/skills/sentencepiece/references/algorithms.md +200 -0
- package/bin/skills/sentencepiece/references/training.md +304 -0
- package/bin/skills/sglang/SKILL.md +442 -0
- package/bin/skills/sglang/references/deployment.md +490 -0
- package/bin/skills/sglang/references/radix-attention.md +413 -0
- package/bin/skills/sglang/references/structured-generation.md +541 -0
- package/bin/skills/simpo/SKILL.md +219 -0
- package/bin/skills/simpo/references/datasets.md +478 -0
- package/bin/skills/simpo/references/hyperparameters.md +452 -0
- package/bin/skills/simpo/references/loss-functions.md +350 -0
- package/bin/skills/skypilot/SKILL.md +509 -0
- package/bin/skills/skypilot/references/advanced-usage.md +491 -0
- package/bin/skills/skypilot/references/troubleshooting.md +570 -0
- package/bin/skills/slime/SKILL.md +464 -0
- package/bin/skills/slime/references/api-reference.md +392 -0
- package/bin/skills/slime/references/troubleshooting.md +386 -0
- package/bin/skills/speculative-decoding/SKILL.md +467 -0
- package/bin/skills/speculative-decoding/references/lookahead.md +309 -0
- package/bin/skills/speculative-decoding/references/medusa.md +350 -0
- package/bin/skills/stable-diffusion/SKILL.md +519 -0
- package/bin/skills/stable-diffusion/references/advanced-usage.md +716 -0
- package/bin/skills/stable-diffusion/references/troubleshooting.md +555 -0
- package/bin/skills/tensorboard/SKILL.md +629 -0
- package/bin/skills/tensorboard/references/integrations.md +638 -0
- package/bin/skills/tensorboard/references/profiling.md +545 -0
- package/bin/skills/tensorboard/references/visualization.md +620 -0
- package/bin/skills/tensorrt-llm/SKILL.md +187 -0
- package/bin/skills/tensorrt-llm/references/multi-gpu.md +298 -0
- package/bin/skills/tensorrt-llm/references/optimization.md +242 -0
- package/bin/skills/tensorrt-llm/references/serving.md +470 -0
- package/bin/skills/tinker/SKILL.md +362 -0
- package/bin/skills/tinker/references/api-reference.md +168 -0
- package/bin/skills/tinker/references/getting-started.md +157 -0
- package/bin/skills/tinker/references/loss-functions.md +163 -0
- package/bin/skills/tinker/references/models-and-lora.md +139 -0
- package/bin/skills/tinker/references/recipes.md +280 -0
- package/bin/skills/tinker/references/reinforcement-learning.md +212 -0
- package/bin/skills/tinker/references/rendering.md +243 -0
- package/bin/skills/tinker/references/supervised-learning.md +232 -0
- package/bin/skills/tinker-training-cost/SKILL.md +187 -0
- package/bin/skills/tinker-training-cost/scripts/calculate_cost.py +123 -0
- package/bin/skills/torchforge/SKILL.md +433 -0
- package/bin/skills/torchforge/references/api-reference.md +327 -0
- package/bin/skills/torchforge/references/troubleshooting.md +409 -0
- package/bin/skills/torchtitan/SKILL.md +358 -0
- package/bin/skills/torchtitan/references/checkpoint.md +181 -0
- package/bin/skills/torchtitan/references/custom-models.md +258 -0
- package/bin/skills/torchtitan/references/float8.md +133 -0
- package/bin/skills/torchtitan/references/fsdp.md +126 -0
- package/bin/skills/transformer-lens/SKILL.md +346 -0
- package/bin/skills/transformer-lens/references/README.md +54 -0
- package/bin/skills/transformer-lens/references/api.md +362 -0
- package/bin/skills/transformer-lens/references/tutorials.md +339 -0
- package/bin/skills/trl-fine-tuning/SKILL.md +455 -0
- package/bin/skills/trl-fine-tuning/references/dpo-variants.md +227 -0
- package/bin/skills/trl-fine-tuning/references/online-rl.md +82 -0
- package/bin/skills/trl-fine-tuning/references/reward-modeling.md +122 -0
- package/bin/skills/trl-fine-tuning/references/sft-training.md +168 -0
- package/bin/skills/unsloth/SKILL.md +80 -0
- package/bin/skills/unsloth/references/index.md +7 -0
- package/bin/skills/unsloth/references/llms-full.md +16799 -0
- package/bin/skills/unsloth/references/llms-txt.md +12044 -0
- package/bin/skills/unsloth/references/llms.md +82 -0
- package/bin/skills/verl/SKILL.md +391 -0
- package/bin/skills/verl/references/api-reference.md +301 -0
- package/bin/skills/verl/references/troubleshooting.md +391 -0
- package/bin/skills/vllm/SKILL.md +364 -0
- package/bin/skills/vllm/references/optimization.md +226 -0
- package/bin/skills/vllm/references/quantization.md +284 -0
- package/bin/skills/vllm/references/server-deployment.md +255 -0
- package/bin/skills/vllm/references/troubleshooting.md +447 -0
- package/bin/skills/weights-and-biases/SKILL.md +590 -0
- package/bin/skills/weights-and-biases/references/artifacts.md +584 -0
- package/bin/skills/weights-and-biases/references/integrations.md +700 -0
- package/bin/skills/weights-and-biases/references/sweeps.md +847 -0
- package/bin/skills/whisper/SKILL.md +317 -0
- package/bin/skills/whisper/references/languages.md +189 -0
- package/bin/synsc +0 -0
- package/package.json +10 -0
|
@@ -0,0 +1,299 @@
|
|
|
1
|
+
---
|
|
2
|
+
title: {{TITLE}}
|
|
3
|
+
authors: {{AUTHORS}}
|
|
4
|
+
date: {{DATE}}
|
|
5
|
+
arxiv:
|
|
6
|
+
layout: arxiv
|
|
7
|
+
---
|
|
8
|
+
|
|
9
|
+
# {{TITLE}}
|
|
10
|
+
|
|
11
|
+
<div class="arxiv-header">
|
|
12
|
+
|
|
13
|
+
**{{AUTHORS}}**
|
|
14
|
+
|
|
15
|
+
*Submitted to arXiv: {{DATE}}*
|
|
16
|
+
|
|
17
|
+
</div>
|
|
18
|
+
|
|
19
|
+
---
|
|
20
|
+
|
|
21
|
+
**Abstract**—{{ABSTRACT}}
|
|
22
|
+
|
|
23
|
+
**Index Terms**—Machine Learning, Deep Learning, Neural Networks
|
|
24
|
+
|
|
25
|
+
---
|
|
26
|
+
|
|
27
|
+
## I. INTRODUCTION
|
|
28
|
+
|
|
29
|
+
**T**HIS paper presents [brief overview of the contribution]. The main contributions of this work are:
|
|
30
|
+
|
|
31
|
+
- Contribution 1: Description
|
|
32
|
+
- Contribution 2: Description
|
|
33
|
+
- Contribution 3: Description
|
|
34
|
+
|
|
35
|
+
The rest of this paper is organized as follows: Section II reviews related work, Section III describes the proposed methodology, Section IV presents experimental results, and Section V concludes the paper.
|
|
36
|
+
|
|
37
|
+
## II. RELATED WORK
|
|
38
|
+
|
|
39
|
+
### A. Subarea 1
|
|
40
|
+
|
|
41
|
+
Discussion of relevant prior work in subarea 1.
|
|
42
|
+
|
|
43
|
+
### B. Subarea 2
|
|
44
|
+
|
|
45
|
+
Discussion of relevant prior work in subarea 2.
|
|
46
|
+
|
|
47
|
+
### C. Comparison with Prior Art
|
|
48
|
+
|
|
49
|
+
Table comparing existing methods:
|
|
50
|
+
|
|
51
|
+
| Method | Year | Approach | Limitation |
|
|
52
|
+
|--------|------|----------|------------|
|
|
53
|
+
| Method A [1] | 2020 | Description | Issue |
|
|
54
|
+
| Method B [2] | 2021 | Description | Issue |
|
|
55
|
+
| Method C [3] | 2023 | Description | Issue |
|
|
56
|
+
|
|
57
|
+
## III. METHODOLOGY
|
|
58
|
+
|
|
59
|
+
### A. Problem Formulation
|
|
60
|
+
|
|
61
|
+
Let $X = \{x_1, x_2, ..., x_n\}$ be the input space and $Y = \{y_1, y_2, ..., y_m\}$ be the output space. We aim to learn a function $f: X \rightarrow Y$ that minimizes:
|
|
62
|
+
|
|
63
|
+
$$
|
|
64
|
+
\mathcal{L}(\theta) = \sum_{i=1}^{N} \ell(f(x_i; \theta), y_i) + \lambda R(\theta)
|
|
65
|
+
$$
|
|
66
|
+
|
|
67
|
+
where $\theta$ represents model parameters, $\ell$ is the loss function, and $R(\theta)$ is a regularization term.
|
|
68
|
+
|
|
69
|
+
### B. Model Architecture
|
|
70
|
+
|
|
71
|
+
Describe the model architecture in detail.
|
|
72
|
+
|
|
73
|
+
**Input Layer**: Description
|
|
74
|
+
|
|
75
|
+
**Hidden Layers**: Let $h^{(l)}$ denote the activation of layer $l$:
|
|
76
|
+
|
|
77
|
+
$$
|
|
78
|
+
h^{(l)} = \sigma(W^{(l)}h^{(l-1)} + b^{(l)})
|
|
79
|
+
$$
|
|
80
|
+
|
|
81
|
+
where $\sigma$ is the activation function, $W^{(l)}$ is the weight matrix, and $b^{(l)}$ is the bias vector.
|
|
82
|
+
|
|
83
|
+
**Output Layer**: Description
|
|
84
|
+
|
|
85
|
+
### C. Training Algorithm
|
|
86
|
+
|
|
87
|
+
**Algorithm 1**: Training Procedure
|
|
88
|
+
|
|
89
|
+
```
|
|
90
|
+
1: Input: Training data D = {(xi, yi)}
|
|
91
|
+
2: Initialize parameters θ
|
|
92
|
+
3: for epoch = 1 to max_epochs do
|
|
93
|
+
4: for each mini-batch B ⊂ D do
|
|
94
|
+
5: Compute loss: L(θ) = 1/|B| Σ ℓ(f(xi; θ), yi)
|
|
95
|
+
6: Update: θ ← θ - η∇θL(θ)
|
|
96
|
+
7: end for
|
|
97
|
+
8: end for
|
|
98
|
+
9: Return: Trained parameters θ*
|
|
99
|
+
```
|
|
100
|
+
|
|
101
|
+
### D. Complexity Analysis
|
|
102
|
+
|
|
103
|
+
**Time Complexity**: The training algorithm has time complexity $O(NTE)$ where $N$ is the dataset size, $T$ is the number of epochs, and $E$ is the per-example computation cost.
|
|
104
|
+
|
|
105
|
+
**Space Complexity**: The model requires $O(P)$ space where $P$ is the number of parameters.
|
|
106
|
+
|
|
107
|
+
## IV. EXPERIMENTS
|
|
108
|
+
|
|
109
|
+
### A. Experimental Setup
|
|
110
|
+
|
|
111
|
+
**Datasets**: We evaluate on the following benchmarks:
|
|
112
|
+
|
|
113
|
+
1. **Dataset A**: Description (size, splits, characteristics)
|
|
114
|
+
2. **Dataset B**: Description
|
|
115
|
+
3. **Dataset C**: Description
|
|
116
|
+
|
|
117
|
+
**Baselines**: We compare against:
|
|
118
|
+
|
|
119
|
+
- Baseline 1 [4]: Description
|
|
120
|
+
- Baseline 2 [5]: Description
|
|
121
|
+
- Baseline 3 [6]: Description
|
|
122
|
+
|
|
123
|
+
**Evaluation Metrics**: Performance is measured using:
|
|
124
|
+
|
|
125
|
+
- Metric 1: Definition
|
|
126
|
+
- Metric 2: Definition
|
|
127
|
+
- Metric 3: Definition
|
|
128
|
+
|
|
129
|
+
**Implementation Details**: All experiments are conducted using:
|
|
130
|
+
|
|
131
|
+
- Framework: PyTorch 2.0
|
|
132
|
+
- Hardware: NVIDIA A100 GPUs
|
|
133
|
+
- Hyperparameters: Learning rate $\eta = 10^{-4}$, batch size $B = 32$, epochs $T = 100$
|
|
134
|
+
|
|
135
|
+
### B. Quantitative Results
|
|
136
|
+
|
|
137
|
+
**TABLE I: MAIN RESULTS**
|
|
138
|
+
|
|
139
|
+
| Method | Dataset A | Dataset B | Dataset C | Average |
|
|
140
|
+
|--------|-----------|-----------|-----------|---------|
|
|
141
|
+
| Baseline 1 [4] | 82.3 | 78.5 | 80.1 | 80.3 |
|
|
142
|
+
| Baseline 2 [5] | 85.7 | 82.1 | 83.9 | 83.9 |
|
|
143
|
+
| Baseline 3 [6] | 88.1 | 85.3 | 86.7 | 86.7 |
|
|
144
|
+
| **Ours** | **91.2** | **88.9** | **90.1** | **90.1** |
|
|
145
|
+
|
|
146
|
+
Our method achieves state-of-the-art performance across all three benchmarks, with an average improvement of 3.4 percentage points over the previous best method.
|
|
147
|
+
|
|
148
|
+
### C. Ablation Study
|
|
149
|
+
|
|
150
|
+
**TABLE II: ABLATION STUDY RESULTS**
|
|
151
|
+
|
|
152
|
+
| Configuration | Dataset A | Δ |
|
|
153
|
+
|---------------|-----------|---|
|
|
154
|
+
| Full Model | 91.2 | - |
|
|
155
|
+
| w/o Component A | 88.7 | -2.5 |
|
|
156
|
+
| w/o Component B | 89.4 | -1.8 |
|
|
157
|
+
| w/o Component C | 90.5 | -0.7 |
|
|
158
|
+
|
|
159
|
+
The ablation study demonstrates that all components contribute to the final performance, with Component A having the largest impact.
|
|
160
|
+
|
|
161
|
+
### D. Qualitative Analysis
|
|
162
|
+
|
|
163
|
+
**Fig. 1**: Visualization of learned representations using t-SNE projection.
|
|
164
|
+
|
|
165
|
+
**Fig. 2**: Example predictions showing correct classifications and failure cases.
|
|
166
|
+
|
|
167
|
+
### E. Computational Efficiency
|
|
168
|
+
|
|
169
|
+
**TABLE III: COMPUTATIONAL REQUIREMENTS**
|
|
170
|
+
|
|
171
|
+
| Method | Parameters | FLOPs | Inference (ms) |
|
|
172
|
+
|--------|------------|-------|----------------|
|
|
173
|
+
| Baseline 1 [4] | 50M | 10G | 8.2 |
|
|
174
|
+
| Baseline 2 [5] | 100M | 25G | 15.7 |
|
|
175
|
+
| Baseline 3 [6] | 200M | 50G | 28.3 |
|
|
176
|
+
| **Ours** | **80M** | **18G** | **12.1** |
|
|
177
|
+
|
|
178
|
+
Our method achieves superior performance while maintaining reasonable computational costs.
|
|
179
|
+
|
|
180
|
+
## V. DISCUSSION
|
|
181
|
+
|
|
182
|
+
### A. Analysis of Results
|
|
183
|
+
|
|
184
|
+
The experimental results demonstrate that [analysis].
|
|
185
|
+
|
|
186
|
+
### B. Limitations
|
|
187
|
+
|
|
188
|
+
Current limitations include:
|
|
189
|
+
|
|
190
|
+
1. Limitation 1: Description
|
|
191
|
+
2. Limitation 2: Description
|
|
192
|
+
3. Limitation 3: Description
|
|
193
|
+
|
|
194
|
+
### C. Broader Impact
|
|
195
|
+
|
|
196
|
+
Potential applications include:
|
|
197
|
+
|
|
198
|
+
- Application 1: Description
|
|
199
|
+
- Application 2: Description
|
|
200
|
+
- Application 3: Description
|
|
201
|
+
|
|
202
|
+
**Ethical Considerations**: [Discussion of potential risks and mitigation strategies]
|
|
203
|
+
|
|
204
|
+
## VI. CONCLUSION
|
|
205
|
+
|
|
206
|
+
This paper presented {{TITLE}}, which achieves [main achievement]. The key contributions are:
|
|
207
|
+
|
|
208
|
+
1. Contribution 1: Summary
|
|
209
|
+
2. Contribution 2: Summary
|
|
210
|
+
3. Contribution 3: Summary
|
|
211
|
+
|
|
212
|
+
Future work will focus on [future directions].
|
|
213
|
+
|
|
214
|
+
## ACKNOWLEDGMENTS
|
|
215
|
+
|
|
216
|
+
The authors thank [acknowledgments]. This work was supported by [funding sources].
|
|
217
|
+
|
|
218
|
+
## REFERENCES
|
|
219
|
+
|
|
220
|
+
[1] Author A et al., "Paper Title," *Conference Name*, 2020.
|
|
221
|
+
|
|
222
|
+
[2] Author B et al., "Paper Title," *Journal Name*, vol. X, no. Y, pp. Z-W, 2021.
|
|
223
|
+
|
|
224
|
+
[3] Author C et al., "Paper Title," *arXiv preprint arXiv:XXXX.XXXXX*, 2023.
|
|
225
|
+
|
|
226
|
+
[4] Author D et al., "Baseline 1 Paper," *Conference*, 2019.
|
|
227
|
+
|
|
228
|
+
[5] Author E et al., "Baseline 2 Paper," *Conference*, 2021.
|
|
229
|
+
|
|
230
|
+
[6] Author F et al., "Baseline 3 Paper," *Conference*, 2023.
|
|
231
|
+
|
|
232
|
+
---
|
|
233
|
+
|
|
234
|
+
## APPENDIX A: ADDITIONAL EXPERIMENTS
|
|
235
|
+
|
|
236
|
+
Supplementary experimental results.
|
|
237
|
+
|
|
238
|
+
## APPENDIX B: PROOF OF THEOREM
|
|
239
|
+
|
|
240
|
+
**Theorem 1**: Statement of theorem.
|
|
241
|
+
|
|
242
|
+
**Proof**: Detailed proof.
|
|
243
|
+
|
|
244
|
+
## APPENDIX C: HYPERPARAMETERS
|
|
245
|
+
|
|
246
|
+
Complete list of hyperparameters used in all experiments:
|
|
247
|
+
|
|
248
|
+
| Hyperparameter | Value | Description |
|
|
249
|
+
|----------------|-------|-------------|
|
|
250
|
+
| Learning rate | $10^{-4}$ | Initial learning rate |
|
|
251
|
+
| Batch size | 32 | Training batch size |
|
|
252
|
+
| Epochs | 100 | Number of training epochs |
|
|
253
|
+
| Optimizer | AdamW | Optimization algorithm |
|
|
254
|
+
| Weight decay | 0.01 | L2 regularization coefficient |
|
|
255
|
+
| Warmup steps | 1000 | LR warmup duration |
|
|
256
|
+
| Dropout | 0.1 | Dropout probability |
|
|
257
|
+
|
|
258
|
+
---
|
|
259
|
+
|
|
260
|
+
<style>
|
|
261
|
+
.arxiv-header {
|
|
262
|
+
text-align: center;
|
|
263
|
+
margin-bottom: 2em;
|
|
264
|
+
}
|
|
265
|
+
|
|
266
|
+
body {
|
|
267
|
+
font-family: 'Computer Modern', serif;
|
|
268
|
+
line-height: 1.6;
|
|
269
|
+
}
|
|
270
|
+
|
|
271
|
+
h1 {
|
|
272
|
+
text-align: center;
|
|
273
|
+
font-size: 1.8em;
|
|
274
|
+
margin-top: 1em;
|
|
275
|
+
}
|
|
276
|
+
|
|
277
|
+
h2 {
|
|
278
|
+
font-size: 1.3em;
|
|
279
|
+
margin-top: 1.5em;
|
|
280
|
+
font-weight: bold;
|
|
281
|
+
}
|
|
282
|
+
|
|
283
|
+
h3 {
|
|
284
|
+
font-size: 1.1em;
|
|
285
|
+
font-style: italic;
|
|
286
|
+
margin-top: 1em;
|
|
287
|
+
}
|
|
288
|
+
|
|
289
|
+
table {
|
|
290
|
+
margin: 1em auto;
|
|
291
|
+
border-collapse: collapse;
|
|
292
|
+
}
|
|
293
|
+
|
|
294
|
+
th, td {
|
|
295
|
+
border: 1px solid #000;
|
|
296
|
+
padding: 0.5em;
|
|
297
|
+
text-align: center;
|
|
298
|
+
}
|
|
299
|
+
</style>
|
|
@@ -0,0 +1,358 @@
|
|
|
1
|
+
---
|
|
2
|
+
title: {{TITLE}}
|
|
3
|
+
authors: {{AUTHORS}}
|
|
4
|
+
date: {{DATE}}
|
|
5
|
+
type: ml-experiment-report
|
|
6
|
+
tags: [machine-learning, experiment-report]
|
|
7
|
+
---
|
|
8
|
+
|
|
9
|
+
# {{TITLE}}
|
|
10
|
+
|
|
11
|
+
**Machine Learning Experiment Report**
|
|
12
|
+
|
|
13
|
+
**Researchers**: {{AUTHORS}}
|
|
14
|
+
**Date**: {{DATE}}
|
|
15
|
+
**Status**: Draft / Final / In Review
|
|
16
|
+
|
|
17
|
+
---
|
|
18
|
+
|
|
19
|
+
## Executive Summary
|
|
20
|
+
|
|
21
|
+
{{ABSTRACT}}
|
|
22
|
+
|
|
23
|
+
### Key Findings
|
|
24
|
+
- Finding 1
|
|
25
|
+
- Finding 2
|
|
26
|
+
- Finding 3
|
|
27
|
+
|
|
28
|
+
### Recommendations
|
|
29
|
+
- Recommendation 1
|
|
30
|
+
- Recommendation 2
|
|
31
|
+
|
|
32
|
+
---
|
|
33
|
+
|
|
34
|
+
## 1. Objective
|
|
35
|
+
|
|
36
|
+
### 1.1 Research Question
|
|
37
|
+
|
|
38
|
+
What specific question are we trying to answer?
|
|
39
|
+
|
|
40
|
+
### 1.2 Success Criteria
|
|
41
|
+
|
|
42
|
+
How will we measure success?
|
|
43
|
+
|
|
44
|
+
- **Metric 1**: Target value
|
|
45
|
+
- **Metric 2**: Target value
|
|
46
|
+
- **Metric 3**: Target value
|
|
47
|
+
|
|
48
|
+
### 1.3 Constraints
|
|
49
|
+
|
|
50
|
+
- Computational budget
|
|
51
|
+
- Time constraints
|
|
52
|
+
- Data availability
|
|
53
|
+
|
|
54
|
+
---
|
|
55
|
+
|
|
56
|
+
## 2. Dataset
|
|
57
|
+
|
|
58
|
+
### 2.1 Data Description
|
|
59
|
+
|
|
60
|
+
| Property | Value |
|
|
61
|
+
|----------|-------|
|
|
62
|
+
| **Name** | Dataset name |
|
|
63
|
+
| **Source** | Origin of data |
|
|
64
|
+
| **Size** | Number of examples |
|
|
65
|
+
| **Features** | Feature count and types |
|
|
66
|
+
| **Target** | What we're predicting |
|
|
67
|
+
| **License** | Usage rights |
|
|
68
|
+
|
|
69
|
+
### 2.2 Data Splits
|
|
70
|
+
|
|
71
|
+
| Split | Size | Percentage |
|
|
72
|
+
|-------|------|------------|
|
|
73
|
+
| Train | X examples | Y% |
|
|
74
|
+
| Validation | X examples | Y% |
|
|
75
|
+
| Test | X examples | Y% |
|
|
76
|
+
|
|
77
|
+
### 2.3 Data Quality
|
|
78
|
+
|
|
79
|
+
- **Missing Values**: Analysis and handling
|
|
80
|
+
- **Outliers**: Detection and treatment
|
|
81
|
+
- **Imbalance**: Class distribution
|
|
82
|
+
- **Preprocessing**: Transformations applied
|
|
83
|
+
|
|
84
|
+
### 2.4 Exploratory Analysis
|
|
85
|
+
|
|
86
|
+
Key insights from data exploration:
|
|
87
|
+
|
|
88
|
+
1. Pattern 1
|
|
89
|
+
2. Pattern 2
|
|
90
|
+
3. Pattern 3
|
|
91
|
+
|
|
92
|
+
---
|
|
93
|
+
|
|
94
|
+
## 3. Model
|
|
95
|
+
|
|
96
|
+
### 3.1 Architecture
|
|
97
|
+
|
|
98
|
+
Describe the model architecture:
|
|
99
|
+
|
|
100
|
+
```
|
|
101
|
+
Input → Layer 1 → Layer 2 → ... → Output
|
|
102
|
+
```
|
|
103
|
+
|
|
104
|
+
### 3.2 Model Specifications
|
|
105
|
+
|
|
106
|
+
| Component | Configuration |
|
|
107
|
+
|-----------|--------------|
|
|
108
|
+
| **Type** | Model family |
|
|
109
|
+
| **Parameters** | Total count |
|
|
110
|
+
| **Layers** | Number and types |
|
|
111
|
+
| **Activation** | Functions used |
|
|
112
|
+
| **Dropout** | Regularization rate |
|
|
113
|
+
|
|
114
|
+
### 3.3 Baseline Models
|
|
115
|
+
|
|
116
|
+
What are we comparing against?
|
|
117
|
+
|
|
118
|
+
1. **Baseline 1**: Simple baseline (e.g., majority class)
|
|
119
|
+
2. **Baseline 2**: Standard approach (e.g., logistic regression)
|
|
120
|
+
3. **Baseline 3**: Previous best method
|
|
121
|
+
|
|
122
|
+
---
|
|
123
|
+
|
|
124
|
+
## 4. Training
|
|
125
|
+
|
|
126
|
+
### 4.1 Hyperparameters
|
|
127
|
+
|
|
128
|
+
| Hyperparameter | Value | Rationale |
|
|
129
|
+
|----------------|-------|-----------|
|
|
130
|
+
| Learning Rate | 1e-4 | Tuned via grid search |
|
|
131
|
+
| Batch Size | 32 | GPU memory constraint |
|
|
132
|
+
| Epochs | 100 | Based on validation |
|
|
133
|
+
| Optimizer | AdamW | Standard for transformers |
|
|
134
|
+
| Weight Decay | 0.01 | Regularization |
|
|
135
|
+
| LR Schedule | Cosine | Smooth convergence |
|
|
136
|
+
|
|
137
|
+
### 4.2 Training Process
|
|
138
|
+
|
|
139
|
+
```python
|
|
140
|
+
# Training pseudocode
|
|
141
|
+
for epoch in range(num_epochs):
|
|
142
|
+
train_loss = train_one_epoch(model, train_loader)
|
|
143
|
+
val_loss = validate(model, val_loader)
|
|
144
|
+
if val_loss < best_loss:
|
|
145
|
+
save_checkpoint(model)
|
|
146
|
+
```
|
|
147
|
+
|
|
148
|
+
### 4.3 Computational Resources
|
|
149
|
+
|
|
150
|
+
| Resource | Specification |
|
|
151
|
+
|----------|--------------|
|
|
152
|
+
| **Hardware** | GPU model and count |
|
|
153
|
+
| **Memory** | RAM and VRAM |
|
|
154
|
+
| **Training Time** | Hours/days |
|
|
155
|
+
| **Cost** | Estimated compute cost |
|
|
156
|
+
|
|
157
|
+
### 4.4 Training Curves
|
|
158
|
+
|
|
159
|
+
Include plots of:
|
|
160
|
+
- Training loss over time
|
|
161
|
+
- Validation loss over time
|
|
162
|
+
- Learning rate schedule
|
|
163
|
+
- Other relevant metrics
|
|
164
|
+
|
|
165
|
+
---
|
|
166
|
+
|
|
167
|
+
## 5. Results
|
|
168
|
+
|
|
169
|
+
### 5.1 Quantitative Results
|
|
170
|
+
|
|
171
|
+
| Model | Accuracy | Precision | Recall | F1 | AUC |
|
|
172
|
+
|-------|----------|-----------|--------|-------|-----|
|
|
173
|
+
| Baseline 1 | 0.65 | 0.64 | 0.66 | 0.65 | 0.70 |
|
|
174
|
+
| Baseline 2 | 0.78 | 0.77 | 0.79 | 0.78 | 0.82 |
|
|
175
|
+
| **Ours** | **0.89** | **0.88** | **0.90** | **0.89** | **0.93** |
|
|
176
|
+
|
|
177
|
+
### 5.2 Statistical Significance
|
|
178
|
+
|
|
179
|
+
- **P-value**: Statistical test results
|
|
180
|
+
- **Confidence Intervals**: 95% CI for key metrics
|
|
181
|
+
- **Multiple Runs**: Mean ± std over N runs
|
|
182
|
+
|
|
183
|
+
### 5.3 Per-Class Performance
|
|
184
|
+
|
|
185
|
+
| Class | Precision | Recall | F1 | Support |
|
|
186
|
+
|-------|-----------|--------|-----|---------|
|
|
187
|
+
| Class 1 | 0.90 | 0.88 | 0.89 | 500 |
|
|
188
|
+
| Class 2 | 0.87 | 0.91 | 0.89 | 450 |
|
|
189
|
+
| Class 3 | 0.88 | 0.89 | 0.88 | 550 |
|
|
190
|
+
|
|
191
|
+
### 5.4 Qualitative Results
|
|
192
|
+
|
|
193
|
+
#### Success Cases
|
|
194
|
+
|
|
195
|
+
Examples where the model performs well.
|
|
196
|
+
|
|
197
|
+
#### Failure Cases
|
|
198
|
+
|
|
199
|
+
Examples where the model fails and why.
|
|
200
|
+
|
|
201
|
+
---
|
|
202
|
+
|
|
203
|
+
## 6. Analysis
|
|
204
|
+
|
|
205
|
+
### 6.1 Ablation Study
|
|
206
|
+
|
|
207
|
+
| Configuration | Score | Change |
|
|
208
|
+
|---------------|-------|--------|
|
|
209
|
+
| Full Model | 0.89 | - |
|
|
210
|
+
| - Feature Set A | 0.85 | -0.04 |
|
|
211
|
+
| - Feature Set B | 0.87 | -0.02 |
|
|
212
|
+
| - Augmentation | 0.86 | -0.03 |
|
|
213
|
+
|
|
214
|
+
### 6.2 Error Analysis
|
|
215
|
+
|
|
216
|
+
What types of errors is the model making?
|
|
217
|
+
|
|
218
|
+
1. **Error Type 1**: Frequency and cause
|
|
219
|
+
2. **Error Type 2**: Frequency and cause
|
|
220
|
+
3. **Error Type 3**: Frequency and cause
|
|
221
|
+
|
|
222
|
+
### 6.3 Feature Importance
|
|
223
|
+
|
|
224
|
+
Which features matter most?
|
|
225
|
+
|
|
226
|
+
| Feature | Importance | Notes |
|
|
227
|
+
|---------|------------|-------|
|
|
228
|
+
| Feature 1 | 0.35 | Most predictive |
|
|
229
|
+
| Feature 2 | 0.28 | Secondary signal |
|
|
230
|
+
| Feature 3 | 0.15 | Marginal impact |
|
|
231
|
+
|
|
232
|
+
---
|
|
233
|
+
|
|
234
|
+
## 7. Robustness
|
|
235
|
+
|
|
236
|
+
### 7.1 Cross-Dataset Evaluation
|
|
237
|
+
|
|
238
|
+
How does the model generalize to other datasets?
|
|
239
|
+
|
|
240
|
+
| Dataset | Score | Notes |
|
|
241
|
+
|---------|-------|-------|
|
|
242
|
+
| Original | 0.89 | Training distribution |
|
|
243
|
+
| Dataset A | 0.82 | Similar domain |
|
|
244
|
+
| Dataset B | 0.71 | Different domain |
|
|
245
|
+
|
|
246
|
+
### 7.2 Adversarial Robustness
|
|
247
|
+
|
|
248
|
+
Performance under adversarial conditions.
|
|
249
|
+
|
|
250
|
+
### 7.3 Fairness Analysis
|
|
251
|
+
|
|
252
|
+
Performance across demographic groups or sensitive attributes.
|
|
253
|
+
|
|
254
|
+
---
|
|
255
|
+
|
|
256
|
+
## 8. Deployment Considerations
|
|
257
|
+
|
|
258
|
+
### 8.1 Model Size
|
|
259
|
+
|
|
260
|
+
- **Parameters**: Total count
|
|
261
|
+
- **Disk Size**: MB/GB on disk
|
|
262
|
+
- **Memory**: Runtime memory usage
|
|
263
|
+
|
|
264
|
+
### 8.2 Inference Speed
|
|
265
|
+
|
|
266
|
+
| Batch Size | Latency | Throughput |
|
|
267
|
+
|------------|---------|------------|
|
|
268
|
+
| 1 | 10ms | 100 QPS |
|
|
269
|
+
| 8 | 45ms | 178 QPS |
|
|
270
|
+
| 32 | 150ms | 213 QPS |
|
|
271
|
+
|
|
272
|
+
### 8.3 Production Requirements
|
|
273
|
+
|
|
274
|
+
- **Dependencies**: Software requirements
|
|
275
|
+
- **Infrastructure**: Hardware needs
|
|
276
|
+
- **Monitoring**: What to track in production
|
|
277
|
+
- **Fallback**: Backup strategy
|
|
278
|
+
|
|
279
|
+
---
|
|
280
|
+
|
|
281
|
+
## 9. Conclusions
|
|
282
|
+
|
|
283
|
+
### 9.1 Summary
|
|
284
|
+
|
|
285
|
+
Key takeaways from the experiment.
|
|
286
|
+
|
|
287
|
+
### 9.2 Did We Meet Objectives?
|
|
288
|
+
|
|
289
|
+
| Objective | Status | Notes |
|
|
290
|
+
|-----------|--------|-------|
|
|
291
|
+
| Objective 1 | ✅ Met | Achieved target |
|
|
292
|
+
| Objective 2 | ⚠️ Partial | Close to target |
|
|
293
|
+
| Objective 3 | ❌ Not Met | Needs more work |
|
|
294
|
+
|
|
295
|
+
### 9.3 Lessons Learned
|
|
296
|
+
|
|
297
|
+
What did we learn from this experiment?
|
|
298
|
+
|
|
299
|
+
1. Lesson 1
|
|
300
|
+
2. Lesson 2
|
|
301
|
+
3. Lesson 3
|
|
302
|
+
|
|
303
|
+
---
|
|
304
|
+
|
|
305
|
+
## 10. Next Steps
|
|
306
|
+
|
|
307
|
+
### 10.1 Short-term (1-2 weeks)
|
|
308
|
+
|
|
309
|
+
- [ ] Task 1
|
|
310
|
+
- [ ] Task 2
|
|
311
|
+
- [ ] Task 3
|
|
312
|
+
|
|
313
|
+
### 10.2 Medium-term (1-2 months)
|
|
314
|
+
|
|
315
|
+
- [ ] Task 1
|
|
316
|
+
- [ ] Task 2
|
|
317
|
+
- [ ] Task 3
|
|
318
|
+
|
|
319
|
+
### 10.3 Long-term (3+ months)
|
|
320
|
+
|
|
321
|
+
- [ ] Task 1
|
|
322
|
+
- [ ] Task 2
|
|
323
|
+
- [ ] Task 3
|
|
324
|
+
|
|
325
|
+
---
|
|
326
|
+
|
|
327
|
+
## References
|
|
328
|
+
|
|
329
|
+
1. Reference 1
|
|
330
|
+
2. Reference 2
|
|
331
|
+
3. Reference 3
|
|
332
|
+
|
|
333
|
+
---
|
|
334
|
+
|
|
335
|
+
## Appendix
|
|
336
|
+
|
|
337
|
+
### A. Hyperparameter Search
|
|
338
|
+
|
|
339
|
+
Results from hyperparameter tuning.
|
|
340
|
+
|
|
341
|
+
### B. Additional Experiments
|
|
342
|
+
|
|
343
|
+
Supplementary experiments not included in main text.
|
|
344
|
+
|
|
345
|
+
### C. Code
|
|
346
|
+
|
|
347
|
+
Links to code repositories:
|
|
348
|
+
- Training code: [link]
|
|
349
|
+
- Evaluation code: [link]
|
|
350
|
+
- Model checkpoint: [link]
|
|
351
|
+
|
|
352
|
+
### D. Data Card
|
|
353
|
+
|
|
354
|
+
Detailed data documentation following standard practices.
|
|
355
|
+
|
|
356
|
+
### E. Model Card
|
|
357
|
+
|
|
358
|
+
Model documentation following responsible AI practices.
|