@ruaruababa/vibe-kit 1.0.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CATALOG.md +317 -0
- package/README.md +121 -0
- package/aliases.json +65 -0
- package/bin/vibe.js +2 -0
- package/bundles.json +265 -0
- package/catalog.json +1560 -0
- package/dist/antigravity-skills/bin/cli.js +438 -0
- package/dist/antigravity-skills/lib/skill-utils.js +158 -0
- package/dist/antigravity-skills/scripts/build-catalog.js +305 -0
- package/dist/antigravity-skills/scripts/normalize-frontmatter.js +144 -0
- package/dist/antigravity-skills/scripts/validate-skills.js +230 -0
- package/dist/bin/vibe.js +2 -0
- package/dist/dist/src/cli/index.js +26 -0
- package/dist/lib/skill-utils.js +158 -0
- package/dist/scripts/build-catalog.js +50 -0
- package/dist/scripts/normalize-frontmatter.js +144 -0
- package/dist/scripts/validate-skills.js +56 -0
- package/dist/src/cli/index.js +146 -0
- package/dist/src/types/index.js +13 -0
- package/dist/src/utils/fs.js +1 -0
- package/package.json +43 -0
- package/skills/accessibility-compliance-accessibility-audit/SKILL.md +42 -0
- package/skills/accessibility-compliance-accessibility-audit/resources/implementation-playbook.md +502 -0
- package/skills/agent-orchestration-improve-agent/SKILL.md +349 -0
- package/skills/agent-orchestration-multi-agent-optimize/SKILL.md +239 -0
- package/skills/agent-orchestrator/SKILL.md +24 -0
- package/skills/ai-engineer/SKILL.md +171 -0
- package/skills/airflow-dag-patterns/SKILL.md +41 -0
- package/skills/airflow-dag-patterns/resources/implementation-playbook.md +509 -0
- package/skills/angular-migration/SKILL.md +428 -0
- package/skills/anti-reversing-techniques/SKILL.md +42 -0
- package/skills/anti-reversing-techniques/resources/implementation-playbook.md +539 -0
- package/skills/api-design-principles/SKILL.md +37 -0
- package/skills/api-design-principles/assets/api-design-checklist.md +155 -0
- package/skills/api-design-principles/assets/rest-api-template.py +182 -0
- package/skills/api-design-principles/references/graphql-schema-design.md +583 -0
- package/skills/api-design-principles/references/rest-best-practices.md +408 -0
- package/skills/api-design-principles/resources/implementation-playbook.md +513 -0
- package/skills/api-documenter/SKILL.md +184 -0
- package/skills/api-testing-observability-api-mock/SKILL.md +46 -0
- package/skills/api-testing-observability-api-mock/resources/implementation-playbook.md +1327 -0
- package/skills/application-performance-performance-optimization/SKILL.md +154 -0
- package/skills/architect-review/SKILL.md +174 -0
- package/skills/architecture-decision-records/SKILL.md +441 -0
- package/skills/architecture-patterns/SKILL.md +37 -0
- package/skills/architecture-patterns/resources/implementation-playbook.md +479 -0
- package/skills/arm-cortex-expert/SKILL.md +306 -0
- package/skills/async-python-patterns/SKILL.md +39 -0
- package/skills/async-python-patterns/resources/implementation-playbook.md +678 -0
- package/skills/attack-tree-construction/SKILL.md +38 -0
- package/skills/attack-tree-construction/resources/implementation-playbook.md +671 -0
- package/skills/auth-implementation-patterns/SKILL.md +39 -0
- package/skills/auth-implementation-patterns/resources/implementation-playbook.md +618 -0
- package/skills/backend-architect/SKILL.md +333 -0
- package/skills/backend-development-feature-development/SKILL.md +180 -0
- package/skills/backend-security-coder/SKILL.md +156 -0
- package/skills/backtesting-frameworks/SKILL.md +39 -0
- package/skills/backtesting-frameworks/resources/implementation-playbook.md +647 -0
- package/skills/bash-defensive-patterns/SKILL.md +43 -0
- package/skills/bash-defensive-patterns/resources/implementation-playbook.md +517 -0
- package/skills/bash-pro/SKILL.md +310 -0
- package/skills/bats-testing-patterns/SKILL.md +34 -0
- package/skills/bats-testing-patterns/resources/implementation-playbook.md +614 -0
- package/skills/bazel-build-optimization/SKILL.md +397 -0
- package/skills/billing-automation/SKILL.md +42 -0
- package/skills/billing-automation/resources/implementation-playbook.md +544 -0
- package/skills/binary-analysis-patterns/SKILL.md +450 -0
- package/skills/blockchain-developer/SKILL.md +208 -0
- package/skills/business-analyst/SKILL.md +182 -0
- package/skills/c-pro/SKILL.md +56 -0
- package/skills/c4-architecture-c4-architecture/SKILL.md +389 -0
- package/skills/c4-code/SKILL.md +244 -0
- package/skills/c4-component/SKILL.md +153 -0
- package/skills/c4-container/SKILL.md +171 -0
- package/skills/c4-context/SKILL.md +150 -0
- package/skills/changelog-automation/SKILL.md +38 -0
- package/skills/changelog-automation/resources/implementation-playbook.md +538 -0
- package/skills/cicd-automation-workflow-automate/SKILL.md +51 -0
- package/skills/cicd-automation-workflow-automate/resources/implementation-playbook.md +1333 -0
- package/skills/clean-markdown/SKILL.md +23 -0
- package/skills/cloud-architect/SKILL.md +135 -0
- package/skills/code-documentation-code-explain/SKILL.md +46 -0
- package/skills/code-documentation-code-explain/resources/implementation-playbook.md +802 -0
- package/skills/code-documentation-doc-generate/SKILL.md +48 -0
- package/skills/code-documentation-doc-generate/resources/implementation-playbook.md +640 -0
- package/skills/code-refactoring-context-restore/SKILL.md +179 -0
- package/skills/code-refactoring-refactor-clean/SKILL.md +51 -0
- package/skills/code-refactoring-refactor-clean/resources/implementation-playbook.md +879 -0
- package/skills/code-refactoring-tech-debt/SKILL.md +386 -0
- package/skills/code-review-ai-ai-review/SKILL.md +450 -0
- package/skills/code-review-excellence/SKILL.md +40 -0
- package/skills/code-review-excellence/resources/implementation-playbook.md +515 -0
- package/skills/code-reviewer/SKILL.md +178 -0
- package/skills/codebase-cleanup-deps-audit/SKILL.md +51 -0
- package/skills/codebase-cleanup-deps-audit/resources/implementation-playbook.md +766 -0
- package/skills/codebase-cleanup-refactor-clean/SKILL.md +51 -0
- package/skills/codebase-cleanup-refactor-clean/resources/implementation-playbook.md +879 -0
- package/skills/codebase-cleanup-tech-debt/SKILL.md +386 -0
- package/skills/competitive-landscape/SKILL.md +34 -0
- package/skills/competitive-landscape/resources/implementation-playbook.md +494 -0
- package/skills/comprehensive-review-full-review/SKILL.md +146 -0
- package/skills/comprehensive-review-pr-enhance/SKILL.md +46 -0
- package/skills/comprehensive-review-pr-enhance/resources/implementation-playbook.md +691 -0
- package/skills/conductor-implement/SKILL.md +388 -0
- package/skills/conductor-manage/SKILL.md +39 -0
- package/skills/conductor-manage/resources/implementation-playbook.md +1120 -0
- package/skills/conductor-new-track/SKILL.md +433 -0
- package/skills/conductor-revert/SKILL.md +372 -0
- package/skills/conductor-setup/SKILL.md +426 -0
- package/skills/conductor-status/SKILL.md +338 -0
- package/skills/conductor-validator/SKILL.md +62 -0
- package/skills/content-marketer/SKILL.md +170 -0
- package/skills/context-driven-development/SKILL.md +400 -0
- package/skills/context-management-context-restore/SKILL.md +179 -0
- package/skills/context-management-context-save/SKILL.md +177 -0
- package/skills/context-manager/SKILL.md +185 -0
- package/skills/cost-optimization/SKILL.md +286 -0
- package/skills/cpp-pro/SKILL.md +59 -0
- package/skills/cqrs-implementation/SKILL.md +35 -0
- package/skills/cqrs-implementation/resources/implementation-playbook.md +540 -0
- package/skills/csharp-pro/SKILL.md +59 -0
- package/skills/customer-support/SKILL.md +170 -0
- package/skills/data-engineer/SKILL.md +224 -0
- package/skills/data-engineering-data-driven-feature/SKILL.md +182 -0
- package/skills/data-engineering-data-pipeline/SKILL.md +201 -0
- package/skills/data-quality-frameworks/SKILL.md +40 -0
- package/skills/data-quality-frameworks/resources/implementation-playbook.md +573 -0
- package/skills/data-scientist/SKILL.md +199 -0
- package/skills/data-storytelling/SKILL.md +465 -0
- package/skills/database-admin/SKILL.md +165 -0
- package/skills/database-architect/SKILL.md +268 -0
- package/skills/database-cloud-optimization-cost-optimize/SKILL.md +44 -0
- package/skills/database-cloud-optimization-cost-optimize/resources/implementation-playbook.md +1441 -0
- package/skills/database-migration/SKILL.md +436 -0
- package/skills/database-migrations-migration-observability/SKILL.md +420 -0
- package/skills/database-migrations-sql-migrations/SKILL.md +53 -0
- package/skills/database-migrations-sql-migrations/resources/implementation-playbook.md +499 -0
- package/skills/database-optimizer/SKILL.md +167 -0
- package/skills/dbt-transformation-patterns/SKILL.md +34 -0
- package/skills/dbt-transformation-patterns/resources/implementation-playbook.md +547 -0
- package/skills/debugger/SKILL.md +49 -0
- package/skills/debugging-strategies/SKILL.md +34 -0
- package/skills/debugging-strategies/resources/implementation-playbook.md +511 -0
- package/skills/debugging-toolkit-smart-debug/SKILL.md +197 -0
- package/skills/defi-protocol-templates/SKILL.md +466 -0
- package/skills/dependency-management-deps-audit/SKILL.md +44 -0
- package/skills/dependency-management-deps-audit/resources/implementation-playbook.md +766 -0
- package/skills/dependency-upgrade/SKILL.md +421 -0
- package/skills/deployment-engineer/SKILL.md +170 -0
- package/skills/deployment-pipeline-design/SKILL.md +371 -0
- package/skills/deployment-validation-config-validate/SKILL.md +496 -0
- package/skills/devops-troubleshooter/SKILL.md +161 -0
- package/skills/distributed-debugging-debug-trace/SKILL.md +44 -0
- package/skills/distributed-debugging-debug-trace/resources/implementation-playbook.md +1307 -0
- package/skills/distributed-tracing/SKILL.md +450 -0
- package/skills/django-pro/SKILL.md +180 -0
- package/skills/docs-architect/SKILL.md +98 -0
- package/skills/documentation-generation-doc-generate/SKILL.md +48 -0
- package/skills/documentation-generation-doc-generate/resources/implementation-playbook.md +640 -0
- package/skills/dotnet-architect/SKILL.md +197 -0
- package/skills/dotnet-backend-patterns/SKILL.md +37 -0
- package/skills/dotnet-backend-patterns/assets/repository-template.cs +523 -0
- package/skills/dotnet-backend-patterns/assets/service-template.cs +336 -0
- package/skills/dotnet-backend-patterns/references/dapper-patterns.md +544 -0
- package/skills/dotnet-backend-patterns/references/ef-core-best-practices.md +355 -0
- package/skills/dotnet-backend-patterns/resources/implementation-playbook.md +799 -0
- package/skills/dummy-skill/SKILL.md +5 -0
- package/skills/dx-optimizer/SKILL.md +83 -0
- package/skills/e2e-testing-patterns/SKILL.md +41 -0
- package/skills/e2e-testing-patterns/resources/implementation-playbook.md +531 -0
- package/skills/elixir-pro/SKILL.md +59 -0
- package/skills/embedding-strategies/SKILL.md +491 -0
- package/skills/employment-contract-templates/SKILL.md +39 -0
- package/skills/employment-contract-templates/resources/implementation-playbook.md +493 -0
- package/skills/error-debugging-error-analysis/SKILL.md +47 -0
- package/skills/error-debugging-error-analysis/resources/implementation-playbook.md +1143 -0
- package/skills/error-debugging-error-trace/SKILL.md +43 -0
- package/skills/error-debugging-error-trace/resources/implementation-playbook.md +1361 -0
- package/skills/error-debugging-multi-agent-review/SKILL.md +216 -0
- package/skills/error-detective/SKILL.md +53 -0
- package/skills/error-diagnostics-error-analysis/SKILL.md +47 -0
- package/skills/error-diagnostics-error-analysis/resources/implementation-playbook.md +1143 -0
- package/skills/error-diagnostics-error-trace/SKILL.md +48 -0
- package/skills/error-diagnostics-error-trace/resources/implementation-playbook.md +1371 -0
- package/skills/error-diagnostics-smart-debug/SKILL.md +197 -0
- package/skills/error-handling-patterns/SKILL.md +35 -0
- package/skills/error-handling-patterns/resources/implementation-playbook.md +635 -0
- package/skills/event-sourcing-architect/SKILL.md +58 -0
- package/skills/event-store-design/SKILL.md +449 -0
- package/skills/fastapi-pro/SKILL.md +192 -0
- package/skills/fastapi-templates/SKILL.md +32 -0
- package/skills/fastapi-templates/resources/implementation-playbook.md +566 -0
- package/skills/final-test/SKILL.md +5 -0
- package/skills/firmware-analyst/SKILL.md +320 -0
- package/skills/flutter-expert/SKILL.md +200 -0
- package/skills/framework-migration-code-migrate/SKILL.md +48 -0
- package/skills/framework-migration-code-migrate/resources/implementation-playbook.md +1052 -0
- package/skills/framework-migration-deps-upgrade/SKILL.md +48 -0
- package/skills/framework-migration-deps-upgrade/resources/implementation-playbook.md +755 -0
- package/skills/framework-migration-legacy-modernize/SKILL.md +132 -0
- package/skills/frontend-developer/SKILL.md +171 -0
- package/skills/frontend-mobile-development-component-scaffold/SKILL.md +403 -0
- package/skills/frontend-mobile-security-xss-scan/SKILL.md +322 -0
- package/skills/frontend-security-coder/SKILL.md +170 -0
- package/skills/full-stack-orchestration-full-stack-feature/SKILL.md +135 -0
- package/skills/gdpr-data-handling/SKILL.md +33 -0
- package/skills/gdpr-data-handling/resources/implementation-playbook.md +615 -0
- package/skills/git-advanced-workflows/SKILL.md +412 -0
- package/skills/git-pr-workflows-git-workflow/SKILL.md +140 -0
- package/skills/git-pr-workflows-onboard/SKILL.md +416 -0
- package/skills/git-pr-workflows-pr-enhance/SKILL.md +48 -0
- package/skills/git-pr-workflows-pr-enhance/resources/implementation-playbook.md +701 -0
- package/skills/github-actions-templates/SKILL.md +345 -0
- package/skills/gitlab-ci-patterns/SKILL.md +283 -0
- package/skills/gitops-workflow/SKILL.md +303 -0
- package/skills/gitops-workflow/references/argocd-setup.md +134 -0
- package/skills/gitops-workflow/references/sync-policies.md +131 -0
- package/skills/go-concurrency-patterns/SKILL.md +33 -0
- package/skills/go-concurrency-patterns/resources/implementation-playbook.md +654 -0
- package/skills/godot-gdscript-patterns/SKILL.md +33 -0
- package/skills/godot-gdscript-patterns/resources/implementation-playbook.md +804 -0
- package/skills/golang-pro/SKILL.md +179 -0
- package/skills/grafana-dashboards/SKILL.md +381 -0
- package/skills/graphql-architect/SKILL.md +182 -0
- package/skills/haskell-pro/SKILL.md +56 -0
- package/skills/helm-chart-scaffolding/SKILL.md +34 -0
- package/skills/helm-chart-scaffolding/assets/Chart.yaml.template +42 -0
- package/skills/helm-chart-scaffolding/assets/values.yaml.template +185 -0
- package/skills/helm-chart-scaffolding/references/chart-structure.md +500 -0
- package/skills/helm-chart-scaffolding/resources/implementation-playbook.md +543 -0
- package/skills/helm-chart-scaffolding/scripts/validate-chart.sh +244 -0
- package/skills/hr-pro/SKILL.md +126 -0
- package/skills/hybrid-cloud-architect/SKILL.md +168 -0
- package/skills/hybrid-cloud-networking/SKILL.md +238 -0
- package/skills/hybrid-search-implementation/SKILL.md +32 -0
- package/skills/hybrid-search-implementation/resources/implementation-playbook.md +567 -0
- package/skills/incident-responder/SKILL.md +213 -0
- package/skills/incident-response-incident-response/SKILL.md +168 -0
- package/skills/incident-response-smart-fix/SKILL.md +29 -0
- package/skills/incident-response-smart-fix/resources/implementation-playbook.md +838 -0
- package/skills/incident-runbook-templates/SKILL.md +395 -0
- package/skills/ios-developer/SKILL.md +219 -0
- package/skills/istio-traffic-management/SKILL.md +337 -0
- package/skills/java-pro/SKILL.md +177 -0
- package/skills/javascript-pro/SKILL.md +57 -0
- package/skills/javascript-testing-patterns/SKILL.md +35 -0
- package/skills/javascript-testing-patterns/resources/implementation-playbook.md +1024 -0
- package/skills/javascript-typescript-typescript-scaffold/SKILL.md +361 -0
- package/skills/julia-pro/SKILL.md +209 -0
- package/skills/k8s-manifest-generator/SKILL.md +35 -0
- package/skills/k8s-manifest-generator/assets/configmap-template.yaml +296 -0
- package/skills/k8s-manifest-generator/assets/deployment-template.yaml +203 -0
- package/skills/k8s-manifest-generator/assets/service-template.yaml +171 -0
- package/skills/k8s-manifest-generator/references/deployment-spec.md +753 -0
- package/skills/k8s-manifest-generator/references/service-spec.md +724 -0
- package/skills/k8s-manifest-generator/resources/implementation-playbook.md +510 -0
- package/skills/k8s-security-policies/SKILL.md +346 -0
- package/skills/k8s-security-policies/assets/network-policy-template.yaml +177 -0
- package/skills/k8s-security-policies/references/rbac-patterns.md +187 -0
- package/skills/kpi-dashboard-design/SKILL.md +440 -0
- package/skills/kubernetes-architect/SKILL.md +170 -0
- package/skills/langchain-architecture/SKILL.md +350 -0
- package/skills/legacy-modernizer/SKILL.md +53 -0
- package/skills/legal-advisor/SKILL.md +70 -0
- package/skills/linkerd-patterns/SKILL.md +321 -0
- package/skills/llm-application-dev-ai-assistant/SKILL.md +35 -0
- package/skills/llm-application-dev-ai-assistant/resources/implementation-playbook.md +1236 -0
- package/skills/llm-application-dev-langchain-agent/SKILL.md +246 -0
- package/skills/llm-application-dev-prompt-optimize/SKILL.md +37 -0
- package/skills/llm-application-dev-prompt-optimize/resources/implementation-playbook.md +591 -0
- package/skills/llm-evaluation/SKILL.md +483 -0
- package/skills/machine-learning-ops-ml-pipeline/SKILL.md +314 -0
- package/skills/malware-analyst/SKILL.md +247 -0
- package/skills/market-sizing-analysis/SKILL.md +425 -0
- package/skills/market-sizing-analysis/examples/saas-market-sizing.md +349 -0
- package/skills/market-sizing-analysis/references/data-sources.md +360 -0
- package/skills/memory-forensics/SKILL.md +491 -0
- package/skills/memory-safety-patterns/SKILL.md +33 -0
- package/skills/memory-safety-patterns/resources/implementation-playbook.md +603 -0
- package/skills/mermaid-expert/SKILL.md +59 -0
- package/skills/microservices-patterns/SKILL.md +35 -0
- package/skills/microservices-patterns/resources/implementation-playbook.md +607 -0
- package/skills/minecraft-bukkit-pro/SKILL.md +126 -0
- package/skills/ml-engineer/SKILL.md +168 -0
- package/skills/ml-pipeline-workflow/SKILL.md +257 -0
- package/skills/mlops-engineer/SKILL.md +219 -0
- package/skills/mobile-developer/SKILL.md +205 -0
- package/skills/mobile-security-coder/SKILL.md +184 -0
- package/skills/modern-javascript-patterns/SKILL.md +35 -0
- package/skills/modern-javascript-patterns/resources/implementation-playbook.md +910 -0
- package/skills/monorepo-architect/SKILL.md +61 -0
- package/skills/monorepo-management/SKILL.md +35 -0
- package/skills/monorepo-management/resources/implementation-playbook.md +621 -0
- package/skills/mtls-configuration/SKILL.md +359 -0
- package/skills/multi-cloud-architecture/SKILL.md +189 -0
- package/skills/multi-platform-apps-multi-platform/SKILL.md +203 -0
- package/skills/network-engineer/SKILL.md +169 -0
- package/skills/nextjs-app-router-patterns/SKILL.md +33 -0
- package/skills/nextjs-app-router-patterns/resources/implementation-playbook.md +543 -0
- package/skills/nft-standards/SKILL.md +395 -0
- package/skills/node-expert/SKILL.md +23 -0
- package/skills/nodejs-backend-patterns/SKILL.md +35 -0
- package/skills/nodejs-backend-patterns/resources/implementation-playbook.md +1019 -0
- package/skills/nx-workspace-patterns/SKILL.md +464 -0
- package/skills/observability-engineer/SKILL.md +237 -0
- package/skills/observability-monitoring-monitor-setup/SKILL.md +48 -0
- package/skills/observability-monitoring-monitor-setup/resources/implementation-playbook.md +505 -0
- package/skills/observability-monitoring-slo-implement/SKILL.md +43 -0
- package/skills/observability-monitoring-slo-implement/resources/implementation-playbook.md +1077 -0
- package/skills/on-call-handoff-patterns/SKILL.md +453 -0
- package/skills/openapi-spec-generation/SKILL.md +33 -0
- package/skills/openapi-spec-generation/resources/implementation-playbook.md +1027 -0
- package/skills/payment-integration/SKILL.md +77 -0
- package/skills/paypal-integration/SKILL.md +479 -0
- package/skills/pci-compliance/SKILL.md +478 -0
- package/skills/performance-engineer/SKILL.md +180 -0
- package/skills/performance-testing-review-ai-review/SKILL.md +450 -0
- package/skills/performance-testing-review-multi-agent-review/SKILL.md +216 -0
- package/skills/php-pro/SKILL.md +63 -0
- package/skills/posix-shell-pro/SKILL.md +304 -0
- package/skills/postgresql/SKILL.md +230 -0
- package/skills/postmortem-writing/SKILL.md +386 -0
- package/skills/projection-patterns/SKILL.md +33 -0
- package/skills/projection-patterns/resources/implementation-playbook.md +501 -0
- package/skills/prometheus-configuration/SKILL.md +404 -0
- package/skills/prompt-engineer/SKILL.md +272 -0
- package/skills/prompt-engineering-patterns/SKILL.md +213 -0
- package/skills/prompt-engineering-patterns/assets/few-shot-examples.json +106 -0
- package/skills/prompt-engineering-patterns/assets/prompt-template-library.md +246 -0
- package/skills/prompt-engineering-patterns/references/chain-of-thought.md +399 -0
- package/skills/prompt-engineering-patterns/references/few-shot-learning.md +369 -0
- package/skills/prompt-engineering-patterns/references/prompt-optimization.md +414 -0
- package/skills/prompt-engineering-patterns/references/prompt-templates.md +470 -0
- package/skills/prompt-engineering-patterns/references/system-prompts.md +189 -0
- package/skills/prompt-engineering-patterns/scripts/optimize-prompt.py +279 -0
- package/skills/protocol-reverse-engineering/SKILL.md +29 -0
- package/skills/protocol-reverse-engineering/resources/implementation-playbook.md +509 -0
- package/skills/python-development-python-scaffold/SKILL.md +331 -0
- package/skills/python-packaging/SKILL.md +36 -0
- package/skills/python-packaging/resources/implementation-playbook.md +869 -0
- package/skills/python-performance-optimization/SKILL.md +36 -0
- package/skills/python-performance-optimization/resources/implementation-playbook.md +868 -0
- package/skills/python-pro/SKILL.md +158 -0
- package/skills/python-testing-patterns/SKILL.md +37 -0
- package/skills/python-testing-patterns/resources/implementation-playbook.md +906 -0
- package/skills/quant-analyst/SKILL.md +53 -0
- package/skills/rag-implementation/SKILL.md +421 -0
- package/skills/react-modernization/SKILL.md +34 -0
- package/skills/react-modernization/resources/implementation-playbook.md +512 -0
- package/skills/react-native-architecture/SKILL.md +33 -0
- package/skills/react-native-architecture/resources/implementation-playbook.md +670 -0
- package/skills/react-state-management/SKILL.md +441 -0
- package/skills/reference-builder/SKILL.md +188 -0
- package/skills/reverse-engineer/SKILL.md +173 -0
- package/skills/risk-manager/SKILL.md +61 -0
- package/skills/risk-metrics-calculation/SKILL.md +33 -0
- package/skills/risk-metrics-calculation/resources/implementation-playbook.md +554 -0
- package/skills/ruby-pro/SKILL.md +56 -0
- package/skills/rust-async-patterns/SKILL.md +33 -0
- package/skills/rust-async-patterns/resources/implementation-playbook.md +516 -0
- package/skills/rust-pro/SKILL.md +178 -0
- package/skills/saga-orchestration/SKILL.md +496 -0
- package/skills/sales-automator/SKILL.md +55 -0
- package/skills/sast-configuration/SKILL.md +212 -0
- package/skills/scala-pro/SKILL.md +82 -0
- package/skills/screen-reader-testing/SKILL.md +33 -0
- package/skills/screen-reader-testing/resources/implementation-playbook.md +544 -0
- package/skills/search-specialist/SKILL.md +80 -0
- package/skills/secrets-management/SKILL.md +364 -0
- package/skills/security-auditor/SKILL.md +169 -0
- package/skills/security-compliance-compliance-check/SKILL.md +55 -0
- package/skills/security-compliance-compliance-check/resources/implementation-playbook.md +963 -0
- package/skills/security-requirement-extraction/SKILL.md +33 -0
- package/skills/security-requirement-extraction/resources/implementation-playbook.md +676 -0
- package/skills/security-scanning-security-dependencies/SKILL.md +43 -0
- package/skills/security-scanning-security-dependencies/resources/implementation-playbook.md +544 -0
- package/skills/security-scanning-security-hardening/SKILL.md +147 -0
- package/skills/security-scanning-security-sast/SKILL.md +495 -0
- package/skills/seo-authority-builder/SKILL.md +136 -0
- package/skills/seo-cannibalization-detector/SKILL.md +123 -0
- package/skills/seo-content-auditor/SKILL.md +83 -0
- package/skills/seo-content-planner/SKILL.md +108 -0
- package/skills/seo-content-refresher/SKILL.md +118 -0
- package/skills/seo-content-writer/SKILL.md +96 -0
- package/skills/seo-keyword-strategist/SKILL.md +95 -0
- package/skills/seo-meta-optimizer/SKILL.md +92 -0
- package/skills/seo-snippet-hunter/SKILL.md +114 -0
- package/skills/seo-structure-architect/SKILL.md +108 -0
- package/skills/service-mesh-expert/SKILL.md +58 -0
- package/skills/service-mesh-observability/SKILL.md +395 -0
- package/skills/shellcheck-configuration/SKILL.md +466 -0
- package/skills/similarity-search-patterns/SKILL.md +33 -0
- package/skills/similarity-search-patterns/resources/implementation-playbook.md +557 -0
- package/skills/slo-implementation/SKILL.md +341 -0
- package/skills/solidity-security/SKILL.md +34 -0
- package/skills/solidity-security/resources/implementation-playbook.md +524 -0
- package/skills/spark-optimization/SKILL.md +427 -0
- package/skills/sql-optimization-patterns/SKILL.md +35 -0
- package/skills/sql-optimization-patterns/resources/implementation-playbook.md +504 -0
- package/skills/sql-pro/SKILL.md +173 -0
- package/skills/startup-analyst/SKILL.md +328 -0
- package/skills/startup-business-analyst-business-case/SKILL.md +487 -0
- package/skills/startup-business-analyst-financial-projections/SKILL.md +353 -0
- package/skills/startup-business-analyst-market-opportunity/SKILL.md +240 -0
- package/skills/startup-financial-modeling/SKILL.md +467 -0
- package/skills/startup-metrics-framework/SKILL.md +34 -0
- package/skills/startup-metrics-framework/resources/implementation-playbook.md +500 -0
- package/skills/stride-analysis-patterns/SKILL.md +33 -0
- package/skills/stride-analysis-patterns/resources/implementation-playbook.md +655 -0
- package/skills/stripe-integration/SKILL.md +454 -0
- package/skills/systems-programming-rust-project/SKILL.md +440 -0
- package/skills/tailwind-design-system/SKILL.md +33 -0
- package/skills/tailwind-design-system/resources/implementation-playbook.md +665 -0
- package/skills/tdd-orchestrator/SKILL.md +205 -0
- package/skills/tdd-workflows-tdd-cycle/SKILL.md +221 -0
- package/skills/tdd-workflows-tdd-green/SKILL.md +73 -0
- package/skills/tdd-workflows-tdd-green/resources/implementation-playbook.md +870 -0
- package/skills/tdd-workflows-tdd-red/SKILL.md +164 -0
- package/skills/tdd-workflows-tdd-refactor/SKILL.md +187 -0
- package/skills/team-collaboration-issue/SKILL.md +37 -0
- package/skills/team-collaboration-issue/resources/implementation-playbook.md +640 -0
- package/skills/team-collaboration-standup-notes/SKILL.md +44 -0
- package/skills/team-collaboration-standup-notes/resources/implementation-playbook.md +768 -0
- package/skills/team-composition-analysis/SKILL.md +413 -0
- package/skills/temporal-python-pro/SKILL.md +370 -0
- package/skills/temporal-python-testing/SKILL.md +170 -0
- package/skills/temporal-python-testing/resources/integration-testing.md +455 -0
- package/skills/temporal-python-testing/resources/local-setup.md +553 -0
- package/skills/temporal-python-testing/resources/replay-testing.md +462 -0
- package/skills/temporal-python-testing/resources/unit-testing.md +328 -0
- package/skills/terraform-module-library/SKILL.md +261 -0
- package/skills/terraform-module-library/references/aws-modules.md +63 -0
- package/skills/terraform-specialist/SKILL.md +166 -0
- package/skills/test-automator/SKILL.md +224 -0
- package/skills/threat-mitigation-mapping/SKILL.md +33 -0
- package/skills/threat-mitigation-mapping/resources/implementation-playbook.md +744 -0
- package/skills/threat-modeling-expert/SKILL.md +60 -0
- package/skills/track-management/SKILL.md +38 -0
- package/skills/track-management/resources/implementation-playbook.md +591 -0
- package/skills/turborepo-caching/SKILL.md +419 -0
- package/skills/tutorial-engineer/SKILL.md +139 -0
- package/skills/typescript-advanced-types/SKILL.md +35 -0
- package/skills/typescript-advanced-types/resources/implementation-playbook.md +716 -0
- package/skills/typescript-pro/SKILL.md +55 -0
- package/skills/ui-minimal/SKILL.md +23 -0
- package/skills/ui-ux-designer/SKILL.md +209 -0
- package/skills/ui-visual-validator/SKILL.md +214 -0
- package/skills/unit-testing-test-generate/SKILL.md +319 -0
- package/skills/unity-developer/SKILL.md +230 -0
- package/skills/unity-ecs-patterns/SKILL.md +33 -0
- package/skills/unity-ecs-patterns/resources/implementation-playbook.md +625 -0
- package/skills/uv-package-manager/SKILL.md +37 -0
- package/skills/uv-package-manager/resources/implementation-playbook.md +830 -0
- package/skills/vector-database-engineer/SKILL.md +60 -0
- package/skills/vector-index-tuning/SKILL.md +42 -0
- package/skills/vector-index-tuning/resources/implementation-playbook.md +507 -0
- package/skills/wcag-audit-patterns/SKILL.md +41 -0
- package/skills/wcag-audit-patterns/resources/implementation-playbook.md +541 -0
- package/skills/web3-testing/SKILL.md +427 -0
- package/skills/workflow-orchestration-patterns/SKILL.md +333 -0
- package/skills/workflow-patterns/SKILL.md +38 -0
- package/skills/workflow-patterns/resources/implementation-playbook.md +621 -0
|
@@ -0,0 +1,414 @@
|
|
|
1
|
+
# Prompt Optimization Guide
|
|
2
|
+
|
|
3
|
+
## Systematic Refinement Process
|
|
4
|
+
|
|
5
|
+
### 1. Baseline Establishment
|
|
6
|
+
```python
|
|
7
|
+
def establish_baseline(prompt, test_cases):
|
|
8
|
+
results = {
|
|
9
|
+
'accuracy': 0,
|
|
10
|
+
'avg_tokens': 0,
|
|
11
|
+
'avg_latency': 0,
|
|
12
|
+
'success_rate': 0
|
|
13
|
+
}
|
|
14
|
+
|
|
15
|
+
for test_case in test_cases:
|
|
16
|
+
response = llm.complete(prompt.format(**test_case['input']))
|
|
17
|
+
|
|
18
|
+
results['accuracy'] += evaluate_accuracy(response, test_case['expected'])
|
|
19
|
+
results['avg_tokens'] += count_tokens(response)
|
|
20
|
+
results['avg_latency'] += measure_latency(response)
|
|
21
|
+
results['success_rate'] += is_valid_response(response)
|
|
22
|
+
|
|
23
|
+
# Average across test cases
|
|
24
|
+
n = len(test_cases)
|
|
25
|
+
return {k: v/n for k, v in results.items()}
|
|
26
|
+
```
|
|
27
|
+
|
|
28
|
+
### 2. Iterative Refinement Workflow
|
|
29
|
+
```
|
|
30
|
+
Initial Prompt → Test → Analyze Failures → Refine → Test → Repeat
|
|
31
|
+
```
|
|
32
|
+
|
|
33
|
+
```python
|
|
34
|
+
class PromptOptimizer:
|
|
35
|
+
def __init__(self, initial_prompt, test_suite):
|
|
36
|
+
self.prompt = initial_prompt
|
|
37
|
+
self.test_suite = test_suite
|
|
38
|
+
self.history = []
|
|
39
|
+
|
|
40
|
+
def optimize(self, max_iterations=10):
|
|
41
|
+
for i in range(max_iterations):
|
|
42
|
+
# Test current prompt
|
|
43
|
+
results = self.evaluate_prompt(self.prompt)
|
|
44
|
+
self.history.append({
|
|
45
|
+
'iteration': i,
|
|
46
|
+
'prompt': self.prompt,
|
|
47
|
+
'results': results
|
|
48
|
+
})
|
|
49
|
+
|
|
50
|
+
# Stop if good enough
|
|
51
|
+
if results['accuracy'] > 0.95:
|
|
52
|
+
break
|
|
53
|
+
|
|
54
|
+
# Analyze failures
|
|
55
|
+
failures = self.analyze_failures(results)
|
|
56
|
+
|
|
57
|
+
# Generate refinement suggestions
|
|
58
|
+
refinements = self.generate_refinements(failures)
|
|
59
|
+
|
|
60
|
+
# Apply best refinement
|
|
61
|
+
self.prompt = self.select_best_refinement(refinements)
|
|
62
|
+
|
|
63
|
+
return self.get_best_prompt()
|
|
64
|
+
```
|
|
65
|
+
|
|
66
|
+
### 3. A/B Testing Framework
|
|
67
|
+
```python
|
|
68
|
+
class PromptABTest:
|
|
69
|
+
def __init__(self, variant_a, variant_b):
|
|
70
|
+
self.variant_a = variant_a
|
|
71
|
+
self.variant_b = variant_b
|
|
72
|
+
|
|
73
|
+
def run_test(self, test_queries, metrics=['accuracy', 'latency']):
|
|
74
|
+
results = {
|
|
75
|
+
'A': {m: [] for m in metrics},
|
|
76
|
+
'B': {m: [] for m in metrics}
|
|
77
|
+
}
|
|
78
|
+
|
|
79
|
+
for query in test_queries:
|
|
80
|
+
# Randomly assign variant (50/50 split)
|
|
81
|
+
variant = 'A' if random.random() < 0.5 else 'B'
|
|
82
|
+
prompt = self.variant_a if variant == 'A' else self.variant_b
|
|
83
|
+
|
|
84
|
+
response, metrics_data = self.execute_with_metrics(
|
|
85
|
+
prompt.format(query=query['input'])
|
|
86
|
+
)
|
|
87
|
+
|
|
88
|
+
for metric in metrics:
|
|
89
|
+
results[variant][metric].append(metrics_data[metric])
|
|
90
|
+
|
|
91
|
+
return self.analyze_results(results)
|
|
92
|
+
|
|
93
|
+
def analyze_results(self, results):
|
|
94
|
+
from scipy import stats
|
|
95
|
+
|
|
96
|
+
analysis = {}
|
|
97
|
+
for metric in results['A'].keys():
|
|
98
|
+
a_values = results['A'][metric]
|
|
99
|
+
b_values = results['B'][metric]
|
|
100
|
+
|
|
101
|
+
# Statistical significance test
|
|
102
|
+
t_stat, p_value = stats.ttest_ind(a_values, b_values)
|
|
103
|
+
|
|
104
|
+
analysis[metric] = {
|
|
105
|
+
'A_mean': np.mean(a_values),
|
|
106
|
+
'B_mean': np.mean(b_values),
|
|
107
|
+
'improvement': (np.mean(b_values) - np.mean(a_values)) / np.mean(a_values),
|
|
108
|
+
'statistically_significant': p_value < 0.05,
|
|
109
|
+
'p_value': p_value,
|
|
110
|
+
'winner': 'B' if np.mean(b_values) > np.mean(a_values) else 'A'
|
|
111
|
+
}
|
|
112
|
+
|
|
113
|
+
return analysis
|
|
114
|
+
```
|
|
115
|
+
|
|
116
|
+
## Optimization Strategies
|
|
117
|
+
|
|
118
|
+
### Token Reduction
|
|
119
|
+
```python
|
|
120
|
+
def optimize_for_tokens(prompt):
|
|
121
|
+
optimizations = [
|
|
122
|
+
# Remove redundant phrases
|
|
123
|
+
('in order to', 'to'),
|
|
124
|
+
('due to the fact that', 'because'),
|
|
125
|
+
('at this point in time', 'now'),
|
|
126
|
+
|
|
127
|
+
# Consolidate instructions
|
|
128
|
+
('First, ...\\nThen, ...\\nFinally, ...', 'Steps: 1) ... 2) ... 3) ...'),
|
|
129
|
+
|
|
130
|
+
# Use abbreviations (after first definition)
|
|
131
|
+
('Natural Language Processing (NLP)', 'NLP'),
|
|
132
|
+
|
|
133
|
+
# Remove filler words
|
|
134
|
+
(' actually ', ' '),
|
|
135
|
+
(' basically ', ' '),
|
|
136
|
+
(' really ', ' ')
|
|
137
|
+
]
|
|
138
|
+
|
|
139
|
+
optimized = prompt
|
|
140
|
+
for old, new in optimizations:
|
|
141
|
+
optimized = optimized.replace(old, new)
|
|
142
|
+
|
|
143
|
+
return optimized
|
|
144
|
+
```
|
|
145
|
+
|
|
146
|
+
### Latency Reduction
|
|
147
|
+
```python
|
|
148
|
+
def optimize_for_latency(prompt):
|
|
149
|
+
strategies = {
|
|
150
|
+
'shorter_prompt': reduce_token_count(prompt),
|
|
151
|
+
'streaming': enable_streaming_response(prompt),
|
|
152
|
+
'caching': add_cacheable_prefix(prompt),
|
|
153
|
+
'early_stopping': add_stop_sequences(prompt)
|
|
154
|
+
}
|
|
155
|
+
|
|
156
|
+
# Test each strategy
|
|
157
|
+
best_strategy = None
|
|
158
|
+
best_latency = float('inf')
|
|
159
|
+
|
|
160
|
+
for name, modified_prompt in strategies.items():
|
|
161
|
+
latency = measure_average_latency(modified_prompt)
|
|
162
|
+
if latency < best_latency:
|
|
163
|
+
best_latency = latency
|
|
164
|
+
best_strategy = modified_prompt
|
|
165
|
+
|
|
166
|
+
return best_strategy
|
|
167
|
+
```
|
|
168
|
+
|
|
169
|
+
### Accuracy Improvement
|
|
170
|
+
```python
|
|
171
|
+
def improve_accuracy(prompt, failure_cases):
|
|
172
|
+
improvements = []
|
|
173
|
+
|
|
174
|
+
# Add constraints for common failures
|
|
175
|
+
if has_format_errors(failure_cases):
|
|
176
|
+
improvements.append("Output must be valid JSON with no additional text.")
|
|
177
|
+
|
|
178
|
+
# Add examples for edge cases
|
|
179
|
+
edge_cases = identify_edge_cases(failure_cases)
|
|
180
|
+
if edge_cases:
|
|
181
|
+
improvements.append(f"Examples of edge cases:\\n{format_examples(edge_cases)}")
|
|
182
|
+
|
|
183
|
+
# Add verification step
|
|
184
|
+
if has_logical_errors(failure_cases):
|
|
185
|
+
improvements.append("Before responding, verify your answer is logically consistent.")
|
|
186
|
+
|
|
187
|
+
# Strengthen instructions
|
|
188
|
+
if has_ambiguity_errors(failure_cases):
|
|
189
|
+
improvements.append(clarify_ambiguous_instructions(prompt))
|
|
190
|
+
|
|
191
|
+
return integrate_improvements(prompt, improvements)
|
|
192
|
+
```
|
|
193
|
+
|
|
194
|
+
## Performance Metrics
|
|
195
|
+
|
|
196
|
+
### Core Metrics
|
|
197
|
+
```python
|
|
198
|
+
class PromptMetrics:
|
|
199
|
+
@staticmethod
|
|
200
|
+
def accuracy(responses, ground_truth):
|
|
201
|
+
return sum(r == gt for r, gt in zip(responses, ground_truth)) / len(responses)
|
|
202
|
+
|
|
203
|
+
@staticmethod
|
|
204
|
+
def consistency(responses):
|
|
205
|
+
# Measure how often identical inputs produce identical outputs
|
|
206
|
+
from collections import defaultdict
|
|
207
|
+
input_responses = defaultdict(list)
|
|
208
|
+
|
|
209
|
+
for inp, resp in responses:
|
|
210
|
+
input_responses[inp].append(resp)
|
|
211
|
+
|
|
212
|
+
consistency_scores = []
|
|
213
|
+
for inp, resps in input_responses.items():
|
|
214
|
+
if len(resps) > 1:
|
|
215
|
+
# Percentage of responses that match the most common response
|
|
216
|
+
most_common_count = Counter(resps).most_common(1)[0][1]
|
|
217
|
+
consistency_scores.append(most_common_count / len(resps))
|
|
218
|
+
|
|
219
|
+
return np.mean(consistency_scores) if consistency_scores else 1.0
|
|
220
|
+
|
|
221
|
+
@staticmethod
|
|
222
|
+
def token_efficiency(prompt, responses):
|
|
223
|
+
avg_prompt_tokens = np.mean([count_tokens(prompt.format(**r['input'])) for r in responses])
|
|
224
|
+
avg_response_tokens = np.mean([count_tokens(r['output']) for r in responses])
|
|
225
|
+
return avg_prompt_tokens + avg_response_tokens
|
|
226
|
+
|
|
227
|
+
@staticmethod
|
|
228
|
+
def latency_p95(latencies):
|
|
229
|
+
return np.percentile(latencies, 95)
|
|
230
|
+
```
|
|
231
|
+
|
|
232
|
+
### Automated Evaluation
|
|
233
|
+
```python
|
|
234
|
+
def evaluate_prompt_comprehensively(prompt, test_suite):
|
|
235
|
+
results = {
|
|
236
|
+
'accuracy': [],
|
|
237
|
+
'consistency': [],
|
|
238
|
+
'latency': [],
|
|
239
|
+
'tokens': [],
|
|
240
|
+
'success_rate': []
|
|
241
|
+
}
|
|
242
|
+
|
|
243
|
+
# Run each test case multiple times for consistency measurement
|
|
244
|
+
for test_case in test_suite:
|
|
245
|
+
runs = []
|
|
246
|
+
for _ in range(3): # 3 runs per test case
|
|
247
|
+
start = time.time()
|
|
248
|
+
response = llm.complete(prompt.format(**test_case['input']))
|
|
249
|
+
latency = time.time() - start
|
|
250
|
+
|
|
251
|
+
runs.append(response)
|
|
252
|
+
results['latency'].append(latency)
|
|
253
|
+
results['tokens'].append(count_tokens(prompt) + count_tokens(response))
|
|
254
|
+
|
|
255
|
+
# Accuracy (best of 3 runs)
|
|
256
|
+
accuracies = [evaluate_accuracy(r, test_case['expected']) for r in runs]
|
|
257
|
+
results['accuracy'].append(max(accuracies))
|
|
258
|
+
|
|
259
|
+
# Consistency (how similar are the 3 runs?)
|
|
260
|
+
results['consistency'].append(calculate_similarity(runs))
|
|
261
|
+
|
|
262
|
+
# Success rate (all runs successful?)
|
|
263
|
+
results['success_rate'].append(all(is_valid(r) for r in runs))
|
|
264
|
+
|
|
265
|
+
return {
|
|
266
|
+
'avg_accuracy': np.mean(results['accuracy']),
|
|
267
|
+
'avg_consistency': np.mean(results['consistency']),
|
|
268
|
+
'p95_latency': np.percentile(results['latency'], 95),
|
|
269
|
+
'avg_tokens': np.mean(results['tokens']),
|
|
270
|
+
'success_rate': np.mean(results['success_rate'])
|
|
271
|
+
}
|
|
272
|
+
```
|
|
273
|
+
|
|
274
|
+
## Failure Analysis
|
|
275
|
+
|
|
276
|
+
### Categorizing Failures
|
|
277
|
+
```python
|
|
278
|
+
class FailureAnalyzer:
|
|
279
|
+
def categorize_failures(self, test_results):
|
|
280
|
+
categories = {
|
|
281
|
+
'format_errors': [],
|
|
282
|
+
'factual_errors': [],
|
|
283
|
+
'logic_errors': [],
|
|
284
|
+
'incomplete_responses': [],
|
|
285
|
+
'hallucinations': [],
|
|
286
|
+
'off_topic': []
|
|
287
|
+
}
|
|
288
|
+
|
|
289
|
+
for result in test_results:
|
|
290
|
+
if not result['success']:
|
|
291
|
+
category = self.determine_failure_type(
|
|
292
|
+
result['response'],
|
|
293
|
+
result['expected']
|
|
294
|
+
)
|
|
295
|
+
categories[category].append(result)
|
|
296
|
+
|
|
297
|
+
return categories
|
|
298
|
+
|
|
299
|
+
def generate_fixes(self, categorized_failures):
|
|
300
|
+
fixes = []
|
|
301
|
+
|
|
302
|
+
if categorized_failures['format_errors']:
|
|
303
|
+
fixes.append({
|
|
304
|
+
'issue': 'Format errors',
|
|
305
|
+
'fix': 'Add explicit format examples and constraints',
|
|
306
|
+
'priority': 'high'
|
|
307
|
+
})
|
|
308
|
+
|
|
309
|
+
if categorized_failures['hallucinations']:
|
|
310
|
+
fixes.append({
|
|
311
|
+
'issue': 'Hallucinations',
|
|
312
|
+
'fix': 'Add grounding instruction: "Base your answer only on provided context"',
|
|
313
|
+
'priority': 'critical'
|
|
314
|
+
})
|
|
315
|
+
|
|
316
|
+
if categorized_failures['incomplete_responses']:
|
|
317
|
+
fixes.append({
|
|
318
|
+
'issue': 'Incomplete responses',
|
|
319
|
+
'fix': 'Add: "Ensure your response fully addresses all parts of the question"',
|
|
320
|
+
'priority': 'medium'
|
|
321
|
+
})
|
|
322
|
+
|
|
323
|
+
return fixes
|
|
324
|
+
```
|
|
325
|
+
|
|
326
|
+
## Versioning and Rollback
|
|
327
|
+
|
|
328
|
+
### Prompt Version Control
|
|
329
|
+
```python
|
|
330
|
+
class PromptVersionControl:
|
|
331
|
+
def __init__(self, storage_path):
|
|
332
|
+
self.storage = storage_path
|
|
333
|
+
self.versions = []
|
|
334
|
+
|
|
335
|
+
def save_version(self, prompt, metadata):
|
|
336
|
+
version = {
|
|
337
|
+
'id': len(self.versions),
|
|
338
|
+
'prompt': prompt,
|
|
339
|
+
'timestamp': datetime.now(),
|
|
340
|
+
'metrics': metadata.get('metrics', {}),
|
|
341
|
+
'description': metadata.get('description', ''),
|
|
342
|
+
'parent_id': metadata.get('parent_id')
|
|
343
|
+
}
|
|
344
|
+
self.versions.append(version)
|
|
345
|
+
self.persist()
|
|
346
|
+
return version['id']
|
|
347
|
+
|
|
348
|
+
def rollback(self, version_id):
|
|
349
|
+
if version_id < len(self.versions):
|
|
350
|
+
return self.versions[version_id]['prompt']
|
|
351
|
+
raise ValueError(f"Version {version_id} not found")
|
|
352
|
+
|
|
353
|
+
def compare_versions(self, v1_id, v2_id):
|
|
354
|
+
v1 = self.versions[v1_id]
|
|
355
|
+
v2 = self.versions[v2_id]
|
|
356
|
+
|
|
357
|
+
return {
|
|
358
|
+
'diff': generate_diff(v1['prompt'], v2['prompt']),
|
|
359
|
+
'metrics_comparison': {
|
|
360
|
+
metric: {
|
|
361
|
+
'v1': v1['metrics'].get(metric),
|
|
362
|
+
'v2': v2['metrics'].get(metric'),
|
|
363
|
+
'change': v2['metrics'].get(metric, 0) - v1['metrics'].get(metric, 0)
|
|
364
|
+
}
|
|
365
|
+
for metric in set(v1['metrics'].keys()) | set(v2['metrics'].keys())
|
|
366
|
+
}
|
|
367
|
+
}
|
|
368
|
+
```
|
|
369
|
+
|
|
370
|
+
## Best Practices
|
|
371
|
+
|
|
372
|
+
1. **Establish Baseline**: Always measure initial performance
|
|
373
|
+
2. **Change One Thing**: Isolate variables for clear attribution
|
|
374
|
+
3. **Test Thoroughly**: Use diverse, representative test cases
|
|
375
|
+
4. **Track Metrics**: Log all experiments and results
|
|
376
|
+
5. **Validate Significance**: Use statistical tests for A/B comparisons
|
|
377
|
+
6. **Document Changes**: Keep detailed notes on what and why
|
|
378
|
+
7. **Version Everything**: Enable rollback to previous versions
|
|
379
|
+
8. **Monitor Production**: Continuously evaluate deployed prompts
|
|
380
|
+
|
|
381
|
+
## Common Optimization Patterns
|
|
382
|
+
|
|
383
|
+
### Pattern 1: Add Structure
|
|
384
|
+
```
|
|
385
|
+
Before: "Analyze this text"
|
|
386
|
+
After: "Analyze this text for:\n1. Main topic\n2. Key arguments\n3. Conclusion"
|
|
387
|
+
```
|
|
388
|
+
|
|
389
|
+
### Pattern 2: Add Examples
|
|
390
|
+
```
|
|
391
|
+
Before: "Extract entities"
|
|
392
|
+
After: "Extract entities\\n\\nExample:\\nText: Apple released iPhone\\nEntities: {company: Apple, product: iPhone}"
|
|
393
|
+
```
|
|
394
|
+
|
|
395
|
+
### Pattern 3: Add Constraints
|
|
396
|
+
```
|
|
397
|
+
Before: "Summarize this"
|
|
398
|
+
After: "Summarize in exactly 3 bullet points, 15 words each"
|
|
399
|
+
```
|
|
400
|
+
|
|
401
|
+
### Pattern 4: Add Verification
|
|
402
|
+
```
|
|
403
|
+
Before: "Calculate..."
|
|
404
|
+
After: "Calculate... Then verify your calculation is correct before responding."
|
|
405
|
+
```
|
|
406
|
+
|
|
407
|
+
## Tools and Utilities
|
|
408
|
+
|
|
409
|
+
- Prompt diff tools for version comparison
|
|
410
|
+
- Automated test runners
|
|
411
|
+
- Metric dashboards
|
|
412
|
+
- A/B testing frameworks
|
|
413
|
+
- Token counting utilities
|
|
414
|
+
- Latency profilers
|