@heyai-rules/pilo-masterkit 1.2.2 → 3.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.agent/agents/architect.md +211 -211
- package/.agent/agents/build-error-resolver.md +114 -114
- package/.agent/agents/chief-of-staff.md +151 -151
- package/.agent/agents/code-reviewer.md +237 -237
- package/.agent/agents/cpp-build-resolver.md +90 -90
- package/.agent/agents/cpp-reviewer.md +72 -72
- package/.agent/agents/csharp-reviewer.md +101 -0
- package/.agent/agents/dart-build-resolver.md +201 -0
- package/.agent/agents/database-reviewer.md +91 -91
- package/.agent/agents/doc-updater.md +107 -107
- package/.agent/agents/docs-lookup.md +68 -68
- package/.agent/agents/e2e-runner.md +107 -107
- package/.agent/agents/flutter-reviewer.md +243 -243
- package/.agent/agents/gan-evaluator.md +209 -0
- package/.agent/agents/gan-generator.md +131 -0
- package/.agent/agents/gan-planner.md +99 -0
- package/.agent/agents/go-build-resolver.md +94 -94
- package/.agent/agents/go-reviewer.md +76 -76
- package/.agent/agents/harness-optimizer.md +35 -35
- package/.agent/agents/healthcare-reviewer.md +83 -0
- package/.agent/agents/java-build-resolver.md +153 -153
- package/.agent/agents/java-reviewer.md +92 -92
- package/.agent/agents/kotlin-build-resolver.md +118 -118
- package/.agent/agents/kotlin-reviewer.md +159 -159
- package/.agent/agents/loop-operator.md +36 -36
- package/.agent/agents/opensource-forker.md +198 -0
- package/.agent/agents/opensource-packager.md +249 -0
- package/.agent/agents/opensource-sanitizer.md +188 -0
- package/.agent/agents/performance-optimizer.md +392 -133
- package/.agent/agents/personas/athena-agent/agent.json +10 -0
- package/.agent/agents/personas/athena-agent/athena-backend-logic-architecture-profile.md +189 -0
- package/.agent/agents/personas/athena-agent/context-files/agents.md +55 -0
- package/.agent/agents/personas/athena-agent/context-files/identity.md +23 -0
- package/.agent/agents/personas/athena-agent/context-files/soul.md +51 -0
- package/.agent/agents/personas/athena-agent/context-files/user-predefined.md +15 -0
- package/.agent/agents/personas/athena-agent/user-context-files/system/bootstrap.md +37 -0
- package/.agent/agents/personas/athena-agent/user-context-files/system/user.md +45 -0
- package/.agent/agents/personas/da-vinci-agent/agent.json +10 -0
- package/.agent/agents/personas/da-vinci-agent/context-files/agents.md +55 -0
- package/.agent/agents/personas/da-vinci-agent/context-files/identity.md +23 -0
- package/.agent/agents/personas/da-vinci-agent/context-files/soul.md +51 -0
- package/.agent/agents/personas/da-vinci-agent/context-files/user-predefined.md +15 -0
- package/.agent/agents/personas/da-vinci-agent/da-vinci-frontend-ui-ux-design-profile.md +189 -0
- package/.agent/agents/personas/da-vinci-agent/user-context-files/system/bootstrap.md +37 -0
- package/.agent/agents/personas/da-vinci-agent/user-context-files/system/user.md +45 -0
- package/.agent/agents/personas/duong-tang-agent/agent.json +10 -0
- package/.agent/agents/personas/duong-tang-agent/context-files/agents.md +55 -0
- package/.agent/agents/personas/duong-tang-agent/context-files/identity.md +23 -0
- package/.agent/agents/personas/duong-tang-agent/context-files/soul.md +51 -0
- package/.agent/agents/personas/duong-tang-agent/context-files/user-predefined.md +15 -0
- package/.agent/agents/personas/duong-tang-agent/tang-monk-quality-testing-documentation-profile.md +189 -0
- package/.agent/agents/personas/duong-tang-agent/user-context-files/system/bootstrap.md +37 -0
- package/.agent/agents/personas/duong-tang-agent/user-context-files/system/user.md +45 -0
- package/.agent/agents/personas/gia-cat-luong-agent/agent.json +10 -0
- package/.agent/agents/personas/gia-cat-luong-agent/context-files/agents.md +55 -0
- package/.agent/agents/personas/gia-cat-luong-agent/context-files/identity.md +23 -0
- package/.agent/agents/personas/gia-cat-luong-agent/context-files/soul.md +51 -0
- package/.agent/agents/personas/gia-cat-luong-agent/context-files/user-predefined.md +15 -0
- package/.agent/agents/personas/gia-cat-luong-agent/kongming-research-strategy-analysis-profile.md +189 -0
- package/.agent/agents/personas/gia-cat-luong-agent/user-context-files/system/bootstrap.md +37 -0
- package/.agent/agents/personas/gia-cat-luong-agent/user-context-files/system/user.md +45 -0
- package/.agent/agents/personas/mihata-agent/agent.json +10 -0
- package/.agent/agents/personas/mihata-agent/context-files/agents.md +55 -0
- package/.agent/agents/personas/mihata-agent/context-files/identity.md +23 -0
- package/.agent/agents/personas/mihata-agent/context-files/soul.md +51 -0
- package/.agent/agents/personas/mihata-agent/context-files/user-predefined.md +15 -0
- package/.agent/agents/personas/mihata-agent/mihata-multi-agent-orchestration-profile.md +189 -0
- package/.agent/agents/personas/mihata-agent/user-context-files/system/bootstrap.md +37 -0
- package/.agent/agents/personas/mihata-agent/user-context-files/system/user.md +45 -0
- package/.agent/agents/personas/tesla-agent/agent.json +10 -0
- package/.agent/agents/personas/tesla-agent/context-files/agents.md +55 -0
- package/.agent/agents/personas/tesla-agent/context-files/identity.md +23 -0
- package/.agent/agents/personas/tesla-agent/context-files/soul.md +51 -0
- package/.agent/agents/personas/tesla-agent/context-files/user-predefined.md +15 -0
- package/.agent/agents/personas/tesla-agent/tesla-fullstack-system-optimization-profile.md +189 -0
- package/.agent/agents/personas/tesla-agent/user-context-files/system/bootstrap.md +37 -0
- package/.agent/agents/personas/tesla-agent/user-context-files/system/user.md +45 -0
- package/.agent/agents/personas/tu-ma-y-agent/agent.json +10 -0
- package/.agent/agents/personas/tu-ma-y-agent/context-files/agents.md +55 -0
- package/.agent/agents/personas/tu-ma-y-agent/context-files/identity.md +23 -0
- package/.agent/agents/personas/tu-ma-y-agent/context-files/soul.md +51 -0
- package/.agent/agents/personas/tu-ma-y-agent/context-files/user-predefined.md +15 -0
- package/.agent/agents/personas/tu-ma-y-agent/simayi-feasibility-risk-control-profile.md +189 -0
- package/.agent/agents/personas/tu-ma-y-agent/user-context-files/system/bootstrap.md +37 -0
- package/.agent/agents/personas/tu-ma-y-agent/user-context-files/system/user.md +45 -0
- package/.agent/agents/personas/venti-agent/agent.json +10 -0
- package/.agent/agents/personas/venti-agent/context-files/agents.md +55 -0
- package/.agent/agents/personas/venti-agent/context-files/identity.md +23 -0
- package/.agent/agents/personas/venti-agent/context-files/soul.md +51 -0
- package/.agent/agents/personas/venti-agent/context-files/user-predefined.md +15 -0
- package/.agent/agents/personas/venti-agent/user-context-files/system/bootstrap.md +37 -0
- package/.agent/agents/personas/venti-agent/user-context-files/system/user.md +45 -0
- package/.agent/agents/personas/venti-agent/venti-learning-communication-mentoring-profile.md +189 -0
- package/.agent/agents/planner.md +212 -212
- package/.agent/agents/python-reviewer.md +98 -98
- package/.agent/agents/pytorch-build-resolver.md +120 -120
- package/.agent/agents/refactor-cleaner.md +85 -85
- package/.agent/agents/rust-build-resolver.md +148 -148
- package/.agent/agents/rust-reviewer.md +94 -94
- package/.agent/agents/security-reviewer.md +108 -108
- package/.agent/agents/tdd-guide.md +91 -91
- package/.agent/agents/typescript-reviewer.md +112 -112
- package/.agent/contexts/dev.md +20 -0
- package/.agent/contexts/research.md +26 -0
- package/.agent/contexts/review.md +22 -0
- package/.agent/hooks/hooks.json +395 -0
- package/.agent/hooks/readme.md +222 -0
- package/.agent/mcp-configs/mcp-servers.json +181 -0
- package/.agent/rules/common/agents.md +50 -0
- package/.agent/rules/common/code-review.md +124 -0
- package/.agent/rules/common/coding-style.md +48 -0
- package/.agent/rules/common/development-workflow.md +44 -0
- package/.agent/rules/common/git-workflow.md +24 -0
- package/.agent/rules/common/hooks.md +30 -0
- package/.agent/rules/common/patterns.md +31 -0
- package/.agent/rules/common/performance.md +55 -0
- package/.agent/rules/common/security.md +29 -0
- package/.agent/rules/common/testing.md +29 -0
- package/.agent/rules/cpp/coding-style.md +44 -0
- package/.agent/rules/cpp/hooks.md +39 -0
- package/.agent/rules/cpp/patterns.md +51 -0
- package/.agent/rules/cpp/security.md +51 -0
- package/.agent/rules/cpp/testing.md +44 -0
- package/.agent/rules/csharp/coding-style.md +72 -0
- package/.agent/rules/csharp/hooks.md +25 -0
- package/.agent/rules/csharp/patterns.md +50 -0
- package/.agent/rules/csharp/security.md +58 -0
- package/.agent/rules/csharp/testing.md +46 -0
- package/.agent/rules/dart/coding-style.md +159 -0
- package/.agent/rules/dart/hooks.md +66 -0
- package/.agent/rules/dart/patterns.md +261 -0
- package/.agent/rules/dart/security.md +135 -0
- package/.agent/rules/dart/testing.md +215 -0
- package/.agent/rules/golang/coding-style.md +32 -0
- package/.agent/rules/golang/hooks.md +17 -0
- package/.agent/rules/golang/patterns.md +45 -0
- package/.agent/rules/golang/security.md +34 -0
- package/.agent/rules/golang/testing.md +31 -0
- package/.agent/rules/java/coding-style.md +114 -0
- package/.agent/rules/java/hooks.md +18 -0
- package/.agent/rules/java/patterns.md +146 -0
- package/.agent/rules/java/security.md +100 -0
- package/.agent/rules/java/testing.md +131 -0
- package/.agent/rules/kotlin/coding-style.md +86 -0
- package/.agent/rules/kotlin/hooks.md +17 -0
- package/.agent/rules/kotlin/patterns.md +146 -0
- package/.agent/rules/kotlin/security.md +82 -0
- package/.agent/rules/kotlin/testing.md +128 -0
- package/.agent/rules/perl/coding-style.md +46 -0
- package/.agent/rules/perl/hooks.md +22 -0
- package/.agent/rules/perl/patterns.md +76 -0
- package/.agent/rules/perl/security.md +69 -0
- package/.agent/rules/perl/testing.md +54 -0
- package/.agent/rules/php/coding-style.md +40 -0
- package/.agent/rules/php/hooks.md +24 -0
- package/.agent/rules/php/patterns.md +33 -0
- package/.agent/rules/php/security.md +37 -0
- package/.agent/rules/php/testing.md +39 -0
- package/.agent/rules/python/coding-style.md +42 -0
- package/.agent/rules/python/hooks.md +19 -0
- package/.agent/rules/python/patterns.md +39 -0
- package/.agent/rules/python/security.md +30 -0
- package/.agent/rules/python/testing.md +38 -0
- package/.agent/rules/readme.md +111 -0
- package/.agent/rules/rust/coding-style.md +151 -0
- package/.agent/rules/rust/hooks.md +16 -0
- package/.agent/rules/rust/patterns.md +168 -0
- package/.agent/rules/rust/security.md +141 -0
- package/.agent/rules/rust/testing.md +154 -0
- package/.agent/rules/swift/coding-style.md +47 -0
- package/.agent/rules/swift/hooks.md +20 -0
- package/.agent/rules/swift/patterns.md +66 -0
- package/.agent/rules/swift/security.md +33 -0
- package/.agent/rules/swift/testing.md +45 -0
- package/.agent/rules/typescript/coding-style.md +199 -0
- package/.agent/rules/typescript/hooks.md +22 -0
- package/.agent/rules/typescript/patterns.md +52 -0
- package/.agent/rules/typescript/security.md +28 -0
- package/.agent/rules/typescript/testing.md +18 -0
- package/.agent/rules/web/coding-style.md +96 -0
- package/.agent/rules/web/design-quality.md +63 -0
- package/.agent/rules/web/hooks.md +120 -0
- package/.agent/rules/web/patterns.md +79 -0
- package/.agent/rules/web/performance.md +64 -0
- package/.agent/rules/web/security.md +57 -0
- package/.agent/rules/web/testing.md +55 -0
- package/.agent/rules/zh/agents.md +50 -0
- package/.agent/rules/zh/code-review.md +124 -0
- package/.agent/rules/zh/coding-style.md +48 -0
- package/.agent/rules/zh/development-workflow.md +44 -0
- package/.agent/rules/zh/git-workflow.md +24 -0
- package/.agent/rules/zh/hooks.md +30 -0
- package/.agent/rules/zh/patterns.md +31 -0
- package/.agent/rules/zh/performance.md +55 -0
- package/.agent/rules/zh/readme.md +108 -0
- package/.agent/rules/zh/security.md +29 -0
- package/.agent/rules/zh/testing.md +29 -0
- package/.agent/skills/agent-eval/SKILL.md +145 -0
- package/.agent/skills/agent-harness-construction/SKILL.md +73 -0
- package/.agent/skills/agent-payment-x402/SKILL.md +178 -0
- package/.agent/skills/agentic-engineering/SKILL.md +63 -0
- package/.agent/skills/ai-first-engineering/SKILL.md +51 -0
- package/.agent/skills/ai-regression-testing/SKILL.md +385 -0
- package/.agent/skills/android-clean-architecture/SKILL.md +339 -0
- package/.agent/skills/api-design/SKILL.md +523 -0
- package/.agent/skills/architecture-decision-records/SKILL.md +179 -0
- package/.agent/skills/article-writing/SKILL.md +79 -0
- package/.agent/skills/autonomous-agent-harness/SKILL.md +267 -0
- package/.agent/skills/autonomous-loops/SKILL.md +610 -0
- package/.agent/skills/backend-patterns/SKILL.md +598 -0
- package/.agent/skills/benchmark/SKILL.md +93 -0
- package/.agent/skills/blueprint/SKILL.md +105 -0
- package/.agent/skills/brand-voice/SKILL.md +97 -0
- package/.agent/skills/brand-voice/references/voice-profile-schema.md +55 -0
- package/.agent/skills/browser-qa/SKILL.md +87 -0
- package/.agent/skills/bun-runtime/SKILL.md +84 -0
- package/.agent/skills/canary-watch/SKILL.md +99 -0
- package/.agent/skills/carrier-relationship-management/SKILL.md +212 -0
- package/.agent/skills/ck/SKILL.md +147 -0
- package/.agent/skills/ck/commands/forget.mjs +44 -0
- package/.agent/skills/ck/commands/info.mjs +24 -0
- package/.agent/skills/ck/commands/init.mjs +143 -0
- package/.agent/skills/ck/commands/list.mjs +40 -0
- package/.agent/skills/ck/commands/migrate.mjs +202 -0
- package/.agent/skills/ck/commands/resume.mjs +36 -0
- package/.agent/skills/ck/commands/save.mjs +210 -0
- package/.agent/skills/ck/commands/shared.mjs +387 -0
- package/.agent/skills/ck/hooks/session-start.mjs +224 -0
- package/.agent/skills/claude-api/SKILL.md +337 -0
- package/.agent/skills/claude-devfleet/SKILL.md +103 -0
- package/.agent/skills/click-path-audit/SKILL.md +244 -0
- package/.agent/skills/clickhouse-io/SKILL.md +439 -0
- package/.agent/skills/codebase-onboarding/SKILL.md +233 -0
- package/.agent/skills/coding-standards/SKILL.md +530 -0
- package/.agent/skills/compose-multiplatform-patterns/SKILL.md +299 -0
- package/.agent/skills/configure-ecc/SKILL.md +367 -0
- package/.agent/skills/connections-optimizer/SKILL.md +189 -0
- package/.agent/skills/content-engine/SKILL.md +131 -0
- package/.agent/skills/content-hash-cache-pattern/SKILL.md +161 -0
- package/.agent/skills/context-budget/SKILL.md +135 -0
- package/.agent/skills/continuous-agent-loop/SKILL.md +45 -0
- package/.agent/skills/continuous-learning/SKILL.md +119 -0
- package/.agent/skills/continuous-learning/config.json +18 -0
- package/.agent/skills/continuous-learning/evaluate-session.sh +69 -0
- package/.agent/skills/continuous-learning-v2/SKILL.md +365 -0
- package/.agent/skills/continuous-learning-v2/agents/observer-loop.sh +271 -0
- package/.agent/skills/continuous-learning-v2/agents/observer.md +198 -0
- package/.agent/skills/continuous-learning-v2/agents/session-guardian.sh +150 -0
- package/.agent/skills/continuous-learning-v2/agents/start-observer.sh +244 -0
- package/.agent/skills/continuous-learning-v2/config.json +8 -0
- package/.agent/skills/continuous-learning-v2/hooks/observe.sh +428 -0
- package/.agent/skills/continuous-learning-v2/scripts/detect-project.sh +228 -0
- package/.agent/skills/continuous-learning-v2/scripts/instinct-cli.py +1426 -0
- package/.agent/skills/continuous-learning-v2/scripts/test-parse-instinct.py +984 -0
- package/.agent/skills/cost-aware-llm-pipeline/SKILL.md +183 -0
- package/.agent/skills/cpp-coding-standards/SKILL.md +723 -0
- package/.agent/skills/cpp-testing/SKILL.md +324 -0
- package/.agent/skills/crosspost/SKILL.md +111 -0
- package/.agent/skills/csharp-testing/SKILL.md +321 -0
- package/.agent/skills/customer-billing-ops/SKILL.md +140 -0
- package/.agent/skills/customs-trade-compliance/SKILL.md +263 -0
- package/.agent/skills/dart-flutter-patterns/SKILL.md +563 -0
- package/.agent/skills/data-scraper-agent/SKILL.md +764 -0
- package/.agent/skills/database-migrations/SKILL.md +429 -0
- package/.agent/skills/deep-research/SKILL.md +155 -0
- package/.agent/skills/deployment-patterns/SKILL.md +427 -0
- package/.agent/skills/design-system/SKILL.md +82 -0
- package/.agent/skills/django-patterns/SKILL.md +734 -0
- package/.agent/skills/django-security/SKILL.md +593 -0
- package/.agent/skills/django-tdd/SKILL.md +729 -0
- package/.agent/skills/django-verification/SKILL.md +469 -0
- package/.agent/skills/dmux-workflows/SKILL.md +191 -0
- package/.agent/skills/docker-patterns/SKILL.md +364 -0
- package/.agent/skills/documentation-lookup/SKILL.md +90 -0
- package/.agent/skills/dotnet-patterns/SKILL.md +321 -0
- package/.agent/skills/e2e-testing/SKILL.md +326 -0
- package/.agent/skills/energy-procurement/SKILL.md +228 -0
- package/.agent/skills/enterprise-agent-ops/SKILL.md +50 -0
- package/.agent/skills/eval-harness/SKILL.md +270 -0
- package/.agent/skills/exa-search/SKILL.md +103 -0
- package/.agent/skills/fal-ai-media/SKILL.md +284 -0
- package/.agent/skills/flutter-dart-code-review/SKILL.md +435 -0
- package/.agent/skills/foundation-models-on-device/SKILL.md +243 -0
- package/.agent/skills/frontend-patterns/SKILL.md +642 -0
- package/.agent/skills/frontend-slides/SKILL.md +184 -0
- package/.agent/skills/frontend-slides/style-presets.md +330 -0
- package/.agent/skills/gan-style-harness/SKILL.md +278 -0
- package/.agent/skills/git-workflow/SKILL.md +715 -0
- package/.agent/skills/golang-patterns/SKILL.md +674 -0
- package/.agent/skills/golang-testing/SKILL.md +720 -0
- package/.agent/skills/google-workspace-ops/SKILL.md +95 -0
- package/.agent/skills/healthcare-cdss-patterns/SKILL.md +245 -0
- package/.agent/skills/healthcare-emr-patterns/SKILL.md +159 -0
- package/.agent/skills/healthcare-eval-harness/SKILL.md +207 -0
- package/.agent/skills/healthcare-phi-compliance/SKILL.md +145 -0
- package/.agent/skills/hexagonal-architecture/SKILL.md +276 -0
- package/.agent/skills/inventory-demand-planning/SKILL.md +247 -0
- package/.agent/skills/investor-materials/SKILL.md +96 -0
- package/.agent/skills/investor-outreach/SKILL.md +91 -0
- package/.agent/skills/iterative-retrieval/SKILL.md +211 -0
- package/.agent/skills/java-coding-standards/SKILL.md +147 -0
- package/.agent/skills/jira-integration/SKILL.md +293 -0
- package/.agent/skills/jpa-patterns/SKILL.md +151 -0
- package/.agent/skills/kotlin-coroutines-flows/SKILL.md +284 -0
- package/.agent/skills/kotlin-exposed-patterns/SKILL.md +719 -0
- package/.agent/skills/kotlin-ktor-patterns/SKILL.md +689 -0
- package/.agent/skills/kotlin-patterns/SKILL.md +711 -0
- package/.agent/skills/kotlin-testing/SKILL.md +824 -0
- package/.agent/skills/laravel-patterns/SKILL.md +415 -0
- package/.agent/skills/laravel-plugin-discovery/SKILL.md +229 -0
- package/.agent/skills/laravel-security/SKILL.md +285 -0
- package/.agent/skills/laravel-tdd/SKILL.md +283 -0
- package/.agent/skills/laravel-verification/SKILL.md +179 -0
- package/.agent/skills/lead-intelligence/SKILL.md +321 -0
- package/.agent/skills/lead-intelligence/agents/enrichment-agent.md +85 -0
- package/.agent/skills/lead-intelligence/agents/mutual-mapper.md +75 -0
- package/.agent/skills/lead-intelligence/agents/outreach-drafter.md +98 -0
- package/.agent/skills/lead-intelligence/agents/signal-scorer.md +60 -0
- package/.agent/skills/liquid-glass-design/SKILL.md +279 -0
- package/.agent/skills/logistics-exception-management/SKILL.md +222 -0
- package/.agent/skills/manim-video/SKILL.md +89 -0
- package/.agent/skills/manim-video/assets/network-graph-scene.py +52 -0
- package/.agent/skills/market-research/SKILL.md +75 -0
- package/.agent/skills/mcp-builder/SKILL.md +173 -113
- package/.agent/skills/mcp-builder/license.txt +202 -0
- package/.agent/skills/mcp-builder/reference/evaluation.md +602 -0
- package/.agent/skills/mcp-builder/reference/mcp-best-practices.md +249 -0
- package/.agent/skills/mcp-builder/reference/node-mcp-server.md +970 -0
- package/.agent/skills/mcp-builder/reference/python-mcp-server.md +719 -0
- package/.agent/skills/mcp-builder/scripts/connections.py +151 -0
- package/.agent/skills/mcp-builder/scripts/evaluation.py +373 -0
- package/.agent/skills/mcp-builder/scripts/example-evaluation.xml +22 -0
- package/.agent/skills/mcp-builder/scripts/requirements.txt +2 -0
- package/.agent/skills/mcp-server-patterns/SKILL.md +67 -0
- package/.agent/skills/nanoclaw-repl/SKILL.md +33 -0
- package/.agent/skills/nestjs-patterns/SKILL.md +230 -0
- package/.agent/skills/nextjs-turbopack/SKILL.md +44 -0
- package/.agent/skills/nutrient-document-processing/SKILL.md +167 -0
- package/.agent/skills/nuxt4-patterns/SKILL.md +100 -0
- package/.agent/skills/openclaw-persona-forge/SKILL.md +296 -0
- package/.agent/skills/openclaw-persona-forge/gacha.py +224 -0
- package/.agent/skills/openclaw-persona-forge/gacha.sh +5 -0
- package/.agent/skills/openclaw-persona-forge/references/avatar-style.md +124 -0
- package/.agent/skills/openclaw-persona-forge/references/boundary-rules.md +53 -0
- package/.agent/skills/openclaw-persona-forge/references/error-handling.md +53 -0
- package/.agent/skills/openclaw-persona-forge/references/identity-tension.md +48 -0
- package/.agent/skills/openclaw-persona-forge/references/naming-system.md +39 -0
- package/.agent/skills/openclaw-persona-forge/references/output-template.md +166 -0
- package/.agent/skills/opensource-pipeline/SKILL.md +255 -0
- package/.agent/skills/perl-patterns/SKILL.md +504 -0
- package/.agent/skills/perl-security/SKILL.md +503 -0
- package/.agent/skills/perl-testing/SKILL.md +475 -0
- package/.agent/skills/plankton-code-quality/SKILL.md +236 -0
- package/.agent/skills/postgres-patterns/SKILL.md +147 -0
- package/.agent/skills/product-lens/SKILL.md +85 -0
- package/.agent/skills/production-scheduling/SKILL.md +238 -0
- package/.agent/skills/project-flow-ops/SKILL.md +111 -0
- package/.agent/skills/project-guidelines-example/SKILL.md +349 -0
- package/.agent/skills/prompt-optimizer/SKILL.md +397 -0
- package/.agent/skills/python-patterns/SKILL.md +622 -313
- package/.agent/skills/python-testing/SKILL.md +816 -0
- package/.agent/skills/pytorch-patterns/SKILL.md +396 -0
- package/.agent/skills/quality-nonconformance/SKILL.md +260 -0
- package/.agent/skills/ralphinho-rfc-pipeline/SKILL.md +67 -0
- package/.agent/skills/regex-vs-llm-structured-text/SKILL.md +220 -0
- package/.agent/skills/remotion-video-creation/SKILL.md +43 -0
- package/.agent/skills/remotion-video-creation/rules/3d.md +86 -0
- package/.agent/skills/remotion-video-creation/rules/animations.md +29 -0
- package/.agent/skills/remotion-video-creation/rules/assets/charts-bar-chart.tsx +173 -0
- package/.agent/skills/remotion-video-creation/rules/assets/text-animations-typewriter.tsx +100 -0
- package/.agent/skills/remotion-video-creation/rules/assets/text-animations-word-highlight.tsx +108 -0
- package/.agent/skills/remotion-video-creation/rules/assets.md +78 -0
- package/.agent/skills/remotion-video-creation/rules/audio.md +172 -0
- package/.agent/skills/remotion-video-creation/rules/calculate-metadata.md +104 -0
- package/.agent/skills/remotion-video-creation/rules/can-decode.md +75 -0
- package/.agent/skills/remotion-video-creation/rules/charts.md +58 -0
- package/.agent/skills/remotion-video-creation/rules/compositions.md +146 -0
- package/.agent/skills/remotion-video-creation/rules/display-captions.md +126 -0
- package/.agent/skills/remotion-video-creation/rules/extract-frames.md +229 -0
- package/.agent/skills/remotion-video-creation/rules/fonts.md +152 -0
- package/.agent/skills/remotion-video-creation/rules/get-audio-duration.md +58 -0
- package/.agent/skills/remotion-video-creation/rules/get-video-dimensions.md +68 -0
- package/.agent/skills/remotion-video-creation/rules/get-video-duration.md +58 -0
- package/.agent/skills/remotion-video-creation/rules/gifs.md +138 -0
- package/.agent/skills/remotion-video-creation/rules/images.md +130 -0
- package/.agent/skills/remotion-video-creation/rules/import-srt-captions.md +67 -0
- package/.agent/skills/remotion-video-creation/rules/lottie.md +67 -0
- package/.agent/skills/remotion-video-creation/rules/measuring-dom-nodes.md +34 -0
- package/.agent/skills/remotion-video-creation/rules/measuring-text.md +143 -0
- package/.agent/skills/remotion-video-creation/rules/sequencing.md +106 -0
- package/.agent/skills/remotion-video-creation/rules/tailwind.md +11 -0
- package/.agent/skills/remotion-video-creation/rules/text-animations.md +20 -0
- package/.agent/skills/remotion-video-creation/rules/timing.md +179 -0
- package/.agent/skills/remotion-video-creation/rules/transcribe-captions.md +19 -0
- package/.agent/skills/remotion-video-creation/rules/transitions.md +122 -0
- package/.agent/skills/remotion-video-creation/rules/trimming.md +52 -0
- package/.agent/skills/remotion-video-creation/rules/videos.md +171 -0
- package/.agent/skills/repo-scan/SKILL.md +78 -0
- package/.agent/skills/returns-reverse-logistics/SKILL.md +240 -0
- package/.agent/skills/rules-distill/SKILL.md +264 -0
- package/.agent/skills/rules-distill/scripts/scan-rules.sh +58 -0
- package/.agent/skills/rules-distill/scripts/scan-skills.sh +129 -0
- package/.agent/skills/rust-patterns/SKILL.md +499 -0
- package/.agent/skills/rust-testing/SKILL.md +500 -0
- package/.agent/skills/safety-guard/SKILL.md +75 -0
- package/.agent/skills/santa-method/SKILL.md +306 -0
- package/.agent/skills/search-first/SKILL.md +161 -0
- package/.agent/skills/security-review/SKILL.md +495 -0
- package/.agent/skills/security-review/cloud-infrastructure-security.md +361 -0
- package/.agent/skills/security-scan/SKILL.md +165 -0
- package/.agent/skills/skill-comply/SKILL.md +58 -0
- package/.agent/skills/skill-comply/fixtures/compliant-trace.jsonl +5 -0
- package/.agent/skills/skill-comply/fixtures/noncompliant-trace.jsonl +3 -0
- package/.agent/skills/skill-comply/fixtures/tdd-spec.yaml +44 -0
- package/.agent/skills/skill-comply/prompts/classifier.md +24 -0
- package/.agent/skills/skill-comply/prompts/scenario-generator.md +62 -0
- package/.agent/skills/skill-comply/prompts/spec-generator.md +42 -0
- package/.agent/skills/skill-comply/pyproject.toml +15 -0
- package/.agent/skills/skill-comply/scripts/classifier.py +85 -0
- package/.agent/skills/skill-comply/scripts/grader.py +122 -0
- package/.agent/skills/skill-comply/scripts/init.py +0 -0
- package/.agent/skills/skill-comply/scripts/parser.py +107 -0
- package/.agent/skills/skill-comply/scripts/report.py +170 -0
- package/.agent/skills/skill-comply/scripts/run.py +127 -0
- package/.agent/skills/skill-comply/scripts/runner.py +161 -0
- package/.agent/skills/skill-comply/scripts/scenario-generator.py +70 -0
- package/.agent/skills/skill-comply/scripts/spec-generator.py +72 -0
- package/.agent/skills/skill-comply/scripts/utils.py +13 -0
- package/.agent/skills/skill-comply/tests/test-grader.py +137 -0
- package/.agent/skills/skill-comply/tests/test-parser.py +90 -0
- package/.agent/skills/skill-creator/SKILL.md +485 -0
- package/.agent/skills/skill-creator/agents/analyzer.md +274 -0
- package/.agent/skills/skill-creator/agents/comparator.md +202 -0
- package/.agent/skills/skill-creator/agents/grader.md +223 -0
- package/.agent/skills/skill-creator/assets/eval-review.html +146 -0
- package/.agent/skills/skill-creator/eval-viewer/generate-review.py +471 -0
- package/.agent/skills/skill-creator/eval-viewer/viewer.html +1325 -0
- package/.agent/skills/skill-creator/license.txt +202 -0
- package/.agent/skills/skill-creator/references/schemas.md +430 -0
- package/.agent/skills/skill-creator/scripts/aggregate-benchmark.py +401 -0
- package/.agent/skills/skill-creator/scripts/generate-report.py +326 -0
- package/.agent/skills/skill-creator/scripts/improve-description.py +247 -0
- package/.agent/skills/skill-creator/scripts/init.py +0 -0
- package/.agent/skills/skill-creator/scripts/package-skill.py +136 -0
- package/.agent/skills/skill-creator/scripts/quick-validate.py +103 -0
- package/.agent/skills/skill-creator/scripts/run-eval.py +310 -0
- package/.agent/skills/skill-creator/scripts/run-loop.py +328 -0
- package/.agent/skills/skill-creator/scripts/utils.py +47 -0
- package/.agent/skills/skill-stocktake/SKILL.md +193 -0
- package/.agent/skills/skill-stocktake/scripts/quick-diff.sh +87 -0
- package/.agent/skills/skill-stocktake/scripts/save-results.sh +56 -0
- package/.agent/skills/skill-stocktake/scripts/scan.sh +170 -0
- package/.agent/skills/social-graph-ranker/SKILL.md +154 -0
- package/.agent/skills/springboot-patterns/SKILL.md +314 -0
- package/.agent/skills/springboot-security/SKILL.md +272 -0
- package/.agent/skills/springboot-tdd/SKILL.md +158 -0
- package/.agent/skills/springboot-verification/SKILL.md +231 -0
- package/.agent/skills/strategic-compact/SKILL.md +131 -0
- package/.agent/skills/strategic-compact/suggest-compact.sh +54 -0
- package/.agent/skills/swift-actor-persistence/SKILL.md +143 -0
- package/.agent/skills/swift-concurrency-6-2/SKILL.md +216 -0
- package/.agent/skills/swift-protocol-di-testing/SKILL.md +190 -0
- package/.agent/skills/swiftui-patterns/SKILL.md +259 -0
- package/.agent/skills/tdd-workflow/SKILL.md +412 -98
- package/.agent/skills/team-builder/SKILL.md +168 -0
- package/.agent/skills/token-budget-advisor/SKILL.md +133 -0
- package/.agent/skills/ui-demo/SKILL.md +465 -0
- package/.agent/skills/ui-ux-pro-max/data/charts.csv +26 -26
- package/.agent/skills/ui-ux-pro-max/data/colors.csv +97 -97
- package/.agent/skills/ui-ux-pro-max/data/landing.csv +28 -28
- package/.agent/skills/ui-ux-pro-max/data/products.csv +96 -96
- package/.agent/skills/ui-ux-pro-max/data/stacks/flutter.csv +53 -53
- package/.agent/skills/ui-ux-pro-max/data/stacks/html-tailwind.csv +56 -56
- package/.agent/skills/ui-ux-pro-max/data/stacks/nextjs.csv +53 -53
- package/.agent/skills/ui-ux-pro-max/data/stacks/react-native.csv +52 -52
- package/.agent/skills/ui-ux-pro-max/data/stacks/react.csv +54 -54
- package/.agent/skills/ui-ux-pro-max/data/stacks/svelte.csv +54 -54
- package/.agent/skills/ui-ux-pro-max/data/stacks/swiftui.csv +51 -51
- package/.agent/skills/ui-ux-pro-max/data/stacks/vue.csv +50 -50
- package/.agent/skills/ui-ux-pro-max/data/styles.csv +68 -68
- package/.agent/skills/ui-ux-pro-max/data/ux-guidelines.csv +99 -99
- package/.agent/skills/ui-ux-pro-max/scripts/search.py +114 -114
- package/.agent/skills/verification-loop/SKILL.md +126 -0
- package/.agent/skills/video-editing/SKILL.md +310 -0
- package/.agent/skills/videodb/SKILL.md +374 -0
- package/.agent/skills/videodb/reference/api-reference.md +550 -0
- package/.agent/skills/videodb/reference/capture-reference.md +407 -0
- package/.agent/skills/videodb/reference/capture.md +101 -0
- package/.agent/skills/videodb/reference/editor.md +443 -0
- package/.agent/skills/videodb/reference/generative.md +331 -0
- package/.agent/skills/videodb/reference/rtstream-reference.md +564 -0
- package/.agent/skills/videodb/reference/rtstream.md +65 -0
- package/.agent/skills/videodb/reference/search.md +230 -0
- package/.agent/skills/videodb/reference/streaming.md +406 -0
- package/.agent/skills/videodb/reference/use-cases.md +118 -0
- package/.agent/skills/videodb/scripts/ws-listener.py +282 -0
- package/.agent/skills/visa-doc-translate/SKILL.md +117 -0
- package/.agent/skills/visa-doc-translate/readme.md +86 -0
- package/.agent/skills/workspace-surface-audit/SKILL.md +125 -0
- package/.agent/skills/x-api/SKILL.md +230 -0
- package/.agent/tasks/two-track-merge-contract.md +29 -0
- package/.agent/workflows/aside.md +164 -164
- package/.agent/workflows/build-fix.md +62 -62
- package/.agent/workflows/checkpoint.md +74 -74
- package/.agent/workflows/claw.md +23 -51
- package/.agent/workflows/clean-memory.md +34 -0
- package/.agent/workflows/code-review.md +289 -40
- package/.agent/workflows/context-budget.md +23 -29
- package/.agent/workflows/cpp-build.md +173 -173
- package/.agent/workflows/cpp-review.md +132 -132
- package/.agent/workflows/cpp-test.md +251 -251
- package/.agent/workflows/devfleet.md +23 -92
- package/.agent/workflows/docs.md +23 -31
- package/.agent/workflows/e2e.md +268 -365
- package/.agent/workflows/eval.md +23 -120
- package/.agent/workflows/evolve.md +178 -178
- package/.agent/workflows/flutter-build.md +164 -0
- package/.agent/workflows/flutter-review.md +116 -0
- package/.agent/workflows/flutter-test.md +144 -0
- package/.agent/workflows/gan-build.md +99 -0
- package/.agent/workflows/gan-design.md +35 -0
- package/.agent/workflows/go-build.md +183 -183
- package/.agent/workflows/go-review.md +148 -148
- package/.agent/workflows/go-test.md +268 -268
- package/.agent/workflows/gradle-build.md +70 -70
- package/.agent/workflows/harness-audit.md +73 -71
- package/.agent/workflows/instinct-export.md +66 -66
- package/.agent/workflows/instinct-import.md +114 -114
- package/.agent/workflows/instinct-status.md +59 -59
- package/.agent/workflows/jira.md +106 -0
- package/.agent/workflows/kotlin-build.md +174 -174
- package/.agent/workflows/kotlin-review.md +140 -140
- package/.agent/workflows/kotlin-test.md +312 -312
- package/.agent/workflows/learn-eval.md +116 -116
- package/.agent/workflows/learn.md +70 -70
- package/.agent/workflows/loop-start.md +32 -32
- package/.agent/workflows/loop-status.md +24 -24
- package/.agent/workflows/model-route.md +26 -26
- package/.agent/workflows/multi-backend.md +158 -158
- package/.agent/workflows/multi-execute.md +315 -315
- package/.agent/workflows/multi-frontend.md +158 -158
- package/.agent/workflows/multi-plan.md +268 -268
- package/.agent/workflows/multi-workflow.md +191 -191
- package/.agent/workflows/orchestrate.md +135 -231
- package/.agent/workflows/plan.md +117 -115
- package/.agent/workflows/pm2.md +272 -272
- package/.agent/workflows/projects.md +39 -39
- package/.agent/workflows/promote.md +41 -41
- package/.agent/workflows/prompt-optimize.md +23 -38
- package/.agent/workflows/prp-commit.md +112 -0
- package/.agent/workflows/prp-implement.md +385 -0
- package/.agent/workflows/prp-plan.md +502 -0
- package/.agent/workflows/prp-pr.md +184 -0
- package/.agent/workflows/prp-prd.md +447 -0
- package/.agent/workflows/prune.md +31 -31
- package/.agent/workflows/python-review.md +297 -297
- package/.agent/workflows/quality-gate.md +29 -29
- package/.agent/workflows/refactor-clean.md +80 -80
- package/.agent/workflows/resume-session.md +156 -156
- package/.agent/workflows/rules-distill.md +20 -11
- package/.agent/workflows/rust-build.md +187 -187
- package/.agent/workflows/rust-review.md +142 -142
- package/.agent/workflows/rust-test.md +308 -308
- package/.agent/workflows/santa-loop.md +175 -0
- package/.agent/workflows/save-session.md +275 -275
- package/.agent/workflows/sessions.md +333 -333
- package/.agent/workflows/setup-pm.md +80 -80
- package/.agent/workflows/skill-create.md +174 -174
- package/.agent/workflows/skill-health.md +54 -54
- package/.agent/workflows/tdd.md +231 -328
- package/.agent/workflows/test-coverage.md +69 -69
- package/.agent/workflows/update-codemaps.md +72 -72
- package/.agent/workflows/update-docs.md +84 -84
- package/.agent/workflows/verify.md +23 -59
- package/LICENSE +176 -176
- package/README.md +28 -20
- package/RELEASE.md +32 -36
- package/package.json +87 -79
- package/scripts/release-check.js +55 -55
- package/src/bin/cli.js +399 -53
- package/src/lib/installer.js +360 -114
- package/src/lib/manifests/stacks.js +122 -0
- package/src/lib/slash-commands.js +28 -0
- package/src/templates/claude/CLAUDE.en.md +42 -0
- package/src/templates/claude/CLAUDE.md +42 -0
- package/src/templates/claude/CLAUDE.vi.md +42 -0
- package/src/templates/codex/AGENTS.en.md +40 -0
- package/src/templates/codex/AGENTS.md +40 -0
- package/src/templates/codex/AGENTS.vi.md +40 -0
- package/src/templates/cursor/pilo-masterkit.mdc +20 -0
- package/src/templates/gemini/GEMINI.en.md +56 -0
- package/src/templates/gemini/GEMINI.md +56 -0
- package/src/templates/gemini/GEMINI.vi.md +56 -0
- package/src/templates/github/copilot-instructions.md +16 -0
|
@@ -0,0 +1,178 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: agent-payment-x402
|
|
3
|
+
description: Add x402 payment execution to AI agents — per-task budgets, spending controls, and non-custodial wallets via MCP tools. Use when agents need to pay for APIs, services, or other agents.
|
|
4
|
+
origin: community
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
# Agent Payment Execution (x402)
|
|
8
|
+
|
|
9
|
+
Enable AI agents to make autonomous payments with built-in spending controls. Uses the x402 HTTP payment protocol and MCP tools so agents can pay for external services, APIs, or other agents without custodial risk.
|
|
10
|
+
|
|
11
|
+
## When to Use
|
|
12
|
+
|
|
13
|
+
Use when: your agent needs to pay for an API call, purchase a service, settle with another agent, enforce per-task spending limits, or manage a non-custodial wallet. Pairs naturally with cost-aware-llm-pipeline and security-review skills.
|
|
14
|
+
|
|
15
|
+
## How It Works
|
|
16
|
+
|
|
17
|
+
### x402 Protocol
|
|
18
|
+
x402 extends HTTP 402 (Payment Required) into a machine-negotiable flow. When a server returns `402`, the agent's payment tool automatically negotiates price, checks budget, signs a transaction, and retries — no human in the loop.
|
|
19
|
+
|
|
20
|
+
### Spending Controls
|
|
21
|
+
Every payment tool call enforces a `SpendingPolicy`:
|
|
22
|
+
- **Per-task budget** — max spend for a single agent action
|
|
23
|
+
- **Per-session budget** — cumulative limit across an entire session
|
|
24
|
+
- **Allowlisted recipients** — restrict which addresses/services the agent can pay
|
|
25
|
+
- **Rate limits** — max transactions per minute/hour
|
|
26
|
+
|
|
27
|
+
### Non-Custodial Wallets
|
|
28
|
+
Agents hold their own keys via ERC-4337 smart accounts. The orchestrator sets policy before delegation; the agent can only spend within bounds. No pooled funds, no custodial risk.
|
|
29
|
+
|
|
30
|
+
## MCP Integration
|
|
31
|
+
|
|
32
|
+
The payment layer exposes standard MCP tools that slot into any Claude Code or agent harness setup.
|
|
33
|
+
|
|
34
|
+
> **Security note**: Always pin the package version. This tool manages private keys — unpinned `npx` installs introduce supply-chain risk.
|
|
35
|
+
|
|
36
|
+
```json
|
|
37
|
+
{
|
|
38
|
+
"mcpServers": {
|
|
39
|
+
"agentpay": {
|
|
40
|
+
"command": "npx",
|
|
41
|
+
"args": ["agentwallet-sdk@6.0.0"]
|
|
42
|
+
}
|
|
43
|
+
}
|
|
44
|
+
}
|
|
45
|
+
```
|
|
46
|
+
|
|
47
|
+
### Available Tools (agent-callable)
|
|
48
|
+
|
|
49
|
+
| Tool | Purpose |
|
|
50
|
+
|------|---------|
|
|
51
|
+
| `get_balance` | Check agent wallet balance |
|
|
52
|
+
| `send_payment` | Send payment to address or ENS |
|
|
53
|
+
| `check_spending` | Query remaining budget |
|
|
54
|
+
| `list_transactions` | Audit trail of all payments |
|
|
55
|
+
|
|
56
|
+
> **Note**: Spending policy is set by the **orchestrator** before delegating to the agent — not by the agent itself. This prevents agents from escalating their own spending limits. Configure policy via `set_policy` in your orchestration layer or pre-task hook, never as an agent-callable tool.
|
|
57
|
+
|
|
58
|
+
## Examples
|
|
59
|
+
|
|
60
|
+
### Budget enforcement in an MCP client
|
|
61
|
+
|
|
62
|
+
When building an orchestrator that calls the agentpay MCP server, enforce budgets before dispatching paid tool calls.
|
|
63
|
+
|
|
64
|
+
> **Prerequisites**: Install the package before adding the MCP config — `npx` without `-y` will prompt for confirmation in non-interactive environments, causing the server to hang: `npm install -g agentwallet-sdk@6.0.0`
|
|
65
|
+
|
|
66
|
+
```typescript
|
|
67
|
+
import { Client } from "@modelcontextprotocol/sdk/client/index.js";
|
|
68
|
+
import { StdioClientTransport } from "@modelcontextprotocol/sdk/client/stdio.js";
|
|
69
|
+
|
|
70
|
+
async function main() {
|
|
71
|
+
// 1. Validate credentials before constructing the transport.
|
|
72
|
+
// A missing key must fail immediately — never let the subprocess start without auth.
|
|
73
|
+
const walletKey = process.env.WALLET_PRIVATE_KEY;
|
|
74
|
+
if (!walletKey) {
|
|
75
|
+
throw new Error("WALLET_PRIVATE_KEY is not set — refusing to start payment server");
|
|
76
|
+
}
|
|
77
|
+
|
|
78
|
+
// Connect to the agentpay MCP server via stdio transport.
|
|
79
|
+
// Whitelist only the env vars the server needs — never forward all of process.env
|
|
80
|
+
// to a third-party subprocess that manages private keys.
|
|
81
|
+
const transport = new StdioClientTransport({
|
|
82
|
+
command: "npx",
|
|
83
|
+
args: ["agentwallet-sdk@6.0.0"],
|
|
84
|
+
env: {
|
|
85
|
+
PATH: process.env.PATH ?? "",
|
|
86
|
+
NODE_ENV: process.env.NODE_ENV ?? "production",
|
|
87
|
+
WALLET_PRIVATE_KEY: walletKey,
|
|
88
|
+
},
|
|
89
|
+
});
|
|
90
|
+
const agentpay = new Client({ name: "orchestrator", version: "1.0.0" });
|
|
91
|
+
await agentpay.connect(transport);
|
|
92
|
+
|
|
93
|
+
// 2. Set spending policy before delegating to the agent.
|
|
94
|
+
// Always verify success — a silent failure means no controls are active.
|
|
95
|
+
const policyResult = await agentpay.callTool({
|
|
96
|
+
name: "set_policy",
|
|
97
|
+
arguments: {
|
|
98
|
+
per_task_budget: 0.50,
|
|
99
|
+
per_session_budget: 5.00,
|
|
100
|
+
allowlisted_recipients: ["api.example.com"],
|
|
101
|
+
},
|
|
102
|
+
});
|
|
103
|
+
if (policyResult.isError) {
|
|
104
|
+
throw new Error(
|
|
105
|
+
`Failed to set spending policy — do not delegate: ${JSON.stringify(policyResult.content)}`
|
|
106
|
+
);
|
|
107
|
+
}
|
|
108
|
+
|
|
109
|
+
// 3. Use preToolCheck before any paid action
|
|
110
|
+
await preToolCheck(agentpay, 0.01);
|
|
111
|
+
}
|
|
112
|
+
|
|
113
|
+
// Pre-tool hook: fail-closed budget enforcement with four distinct error paths.
|
|
114
|
+
async function preToolCheck(agentpay: Client, apiCost: number): Promise<void> {
|
|
115
|
+
// Path 1: Reject invalid input (NaN/Infinity bypass the < comparison)
|
|
116
|
+
if (!Number.isFinite(apiCost) || apiCost < 0) {
|
|
117
|
+
throw new Error(`Invalid apiCost: ${apiCost} — action blocked`);
|
|
118
|
+
}
|
|
119
|
+
|
|
120
|
+
// Path 2: Transport/connectivity failure
|
|
121
|
+
let result;
|
|
122
|
+
try {
|
|
123
|
+
result = await agentpay.callTool({ name: "check_spending" });
|
|
124
|
+
} catch (err) {
|
|
125
|
+
throw new Error(`Payment service unreachable — action blocked: ${err}`);
|
|
126
|
+
}
|
|
127
|
+
|
|
128
|
+
// Path 3: Tool returned an error (e.g., auth failure, wallet not initialised)
|
|
129
|
+
if (result.isError) {
|
|
130
|
+
throw new Error(
|
|
131
|
+
`check_spending failed — action blocked: ${JSON.stringify(result.content)}`
|
|
132
|
+
);
|
|
133
|
+
}
|
|
134
|
+
|
|
135
|
+
// Path 4: Parse and validate the response shape
|
|
136
|
+
let remaining: number;
|
|
137
|
+
try {
|
|
138
|
+
const parsed = JSON.parse(
|
|
139
|
+
(result.content as Array<{ text: string }>)[0].text
|
|
140
|
+
);
|
|
141
|
+
if (!Number.isFinite(parsed?.remaining)) {
|
|
142
|
+
throw new TypeError("missing or non-finite 'remaining' field");
|
|
143
|
+
}
|
|
144
|
+
remaining = parsed.remaining;
|
|
145
|
+
} catch (err) {
|
|
146
|
+
throw new Error(
|
|
147
|
+
`check_spending returned unexpected format — action blocked: ${err}`
|
|
148
|
+
);
|
|
149
|
+
}
|
|
150
|
+
|
|
151
|
+
// Path 5: Budget exceeded
|
|
152
|
+
if (remaining < apiCost) {
|
|
153
|
+
throw new Error(
|
|
154
|
+
`Budget exceeded: need $${apiCost} but only $${remaining} remaining`
|
|
155
|
+
);
|
|
156
|
+
}
|
|
157
|
+
}
|
|
158
|
+
|
|
159
|
+
main().catch((err) => {
|
|
160
|
+
console.error(err);
|
|
161
|
+
process.exitCode = 1;
|
|
162
|
+
});
|
|
163
|
+
```
|
|
164
|
+
|
|
165
|
+
## Best Practices
|
|
166
|
+
|
|
167
|
+
- **Set budgets before delegation**: When spawning sub-agents, attach a SpendingPolicy via your orchestration layer. Never give an agent unlimited spend.
|
|
168
|
+
- **Pin your dependencies**: Always specify an exact version in your MCP config (e.g., `agentwallet-sdk@6.0.0`). Verify package integrity before deploying to production.
|
|
169
|
+
- **Audit trails**: Use `list_transactions` in post-task hooks to log what was spent and why.
|
|
170
|
+
- **Fail closed**: If the payment tool is unreachable, block the paid action — don't fall back to unmetered access.
|
|
171
|
+
- **Pair with security-review**: Payment tools are high-privilege. Apply the same scrutiny as shell access.
|
|
172
|
+
- **Test with testnets first**: Use Base Sepolia for development; switch to Base mainnet for production.
|
|
173
|
+
|
|
174
|
+
## Production Reference
|
|
175
|
+
|
|
176
|
+
- **npm**: [`agentwallet-sdk`](https://www.npmjs.com/package/agentwallet-sdk)
|
|
177
|
+
- **Merged into NVIDIA NeMo Agent Toolkit**: [PR #17](https://github.com/NVIDIA/NeMo-Agent-Toolkit-Examples/pull/17) — x402 payment tool for NVIDIA's agent examples
|
|
178
|
+
- **Protocol spec**: [x402.org](https://x402.org)
|
|
@@ -0,0 +1,63 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: agentic-engineering
|
|
3
|
+
description: Operate as an agentic engineer using eval-first execution, decomposition, and cost-aware model routing.
|
|
4
|
+
origin: ECC
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
# Agentic Engineering
|
|
8
|
+
|
|
9
|
+
Use this skill for engineering workflows where AI agents perform most implementation work and humans enforce quality and risk controls.
|
|
10
|
+
|
|
11
|
+
## Operating Principles
|
|
12
|
+
|
|
13
|
+
1. Define completion criteria before execution.
|
|
14
|
+
2. Decompose work into agent-sized units.
|
|
15
|
+
3. Route model tiers by task complexity.
|
|
16
|
+
4. Measure with evals and regression checks.
|
|
17
|
+
|
|
18
|
+
## Eval-First Loop
|
|
19
|
+
|
|
20
|
+
1. Define capability eval and regression eval.
|
|
21
|
+
2. Run baseline and capture failure signatures.
|
|
22
|
+
3. Execute implementation.
|
|
23
|
+
4. Re-run evals and compare deltas.
|
|
24
|
+
|
|
25
|
+
## Task Decomposition
|
|
26
|
+
|
|
27
|
+
Apply the 15-minute unit rule:
|
|
28
|
+
- each unit should be independently verifiable
|
|
29
|
+
- each unit should have a single dominant risk
|
|
30
|
+
- each unit should expose a clear done condition
|
|
31
|
+
|
|
32
|
+
## Model Routing
|
|
33
|
+
|
|
34
|
+
- Haiku: classification, boilerplate transforms, narrow edits
|
|
35
|
+
- Sonnet: implementation and refactors
|
|
36
|
+
- Opus: architecture, root-cause analysis, multi-file invariants
|
|
37
|
+
|
|
38
|
+
## Session Strategy
|
|
39
|
+
|
|
40
|
+
- Continue session for closely-coupled units.
|
|
41
|
+
- Start fresh session after major phase transitions.
|
|
42
|
+
- Compact after milestone completion, not during active debugging.
|
|
43
|
+
|
|
44
|
+
## Review Focus for AI-Generated Code
|
|
45
|
+
|
|
46
|
+
Prioritize:
|
|
47
|
+
- invariants and edge cases
|
|
48
|
+
- error boundaries
|
|
49
|
+
- security and auth assumptions
|
|
50
|
+
- hidden coupling and rollout risk
|
|
51
|
+
|
|
52
|
+
Do not waste review cycles on style-only disagreements when automated format/lint already enforce style.
|
|
53
|
+
|
|
54
|
+
## Cost Discipline
|
|
55
|
+
|
|
56
|
+
Track per task:
|
|
57
|
+
- model
|
|
58
|
+
- token estimate
|
|
59
|
+
- retries
|
|
60
|
+
- wall-clock time
|
|
61
|
+
- success/failure
|
|
62
|
+
|
|
63
|
+
Escalate model tier only when lower tier fails with a clear reasoning gap.
|
|
@@ -0,0 +1,51 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: ai-first-engineering
|
|
3
|
+
description: Engineering operating model for teams where AI agents generate a large share of implementation output.
|
|
4
|
+
origin: ECC
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
# AI-First Engineering
|
|
8
|
+
|
|
9
|
+
Use this skill when designing process, reviews, and architecture for teams shipping with AI-assisted code generation.
|
|
10
|
+
|
|
11
|
+
## Process Shifts
|
|
12
|
+
|
|
13
|
+
1. Planning quality matters more than typing speed.
|
|
14
|
+
2. Eval coverage matters more than anecdotal confidence.
|
|
15
|
+
3. Review focus shifts from syntax to system behavior.
|
|
16
|
+
|
|
17
|
+
## Architecture Requirements
|
|
18
|
+
|
|
19
|
+
Prefer architectures that are agent-friendly:
|
|
20
|
+
- explicit boundaries
|
|
21
|
+
- stable contracts
|
|
22
|
+
- typed interfaces
|
|
23
|
+
- deterministic tests
|
|
24
|
+
|
|
25
|
+
Avoid implicit behavior spread across hidden conventions.
|
|
26
|
+
|
|
27
|
+
## Code Review in AI-First Teams
|
|
28
|
+
|
|
29
|
+
Review for:
|
|
30
|
+
- behavior regressions
|
|
31
|
+
- security assumptions
|
|
32
|
+
- data integrity
|
|
33
|
+
- failure handling
|
|
34
|
+
- rollout safety
|
|
35
|
+
|
|
36
|
+
Minimize time spent on style issues already covered by automation.
|
|
37
|
+
|
|
38
|
+
## Hiring and Evaluation Signals
|
|
39
|
+
|
|
40
|
+
Strong AI-first engineers:
|
|
41
|
+
- decompose ambiguous work cleanly
|
|
42
|
+
- define measurable acceptance criteria
|
|
43
|
+
- produce high-signal prompts and evals
|
|
44
|
+
- enforce risk controls under delivery pressure
|
|
45
|
+
|
|
46
|
+
## Testing Standard
|
|
47
|
+
|
|
48
|
+
Raise testing bar for generated code:
|
|
49
|
+
- required regression coverage for touched domains
|
|
50
|
+
- explicit edge-case assertions
|
|
51
|
+
- integration checks for interface boundaries
|
|
@@ -0,0 +1,385 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: ai-regression-testing
|
|
3
|
+
description: Regression testing strategies for AI-assisted development. Sandbox-mode API testing without database dependencies, automated bug-check workflows, and patterns to catch AI blind spots where the same model writes and reviews code.
|
|
4
|
+
origin: ECC
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
# AI Regression Testing
|
|
8
|
+
|
|
9
|
+
Testing patterns specifically designed for AI-assisted development, where the same model writes code and reviews it — creating systematic blind spots that only automated tests can catch.
|
|
10
|
+
|
|
11
|
+
## When to Activate
|
|
12
|
+
|
|
13
|
+
- AI agent (Claude Code, Cursor, Codex) has modified API routes or backend logic
|
|
14
|
+
- A bug was found and fixed — need to prevent re-introduction
|
|
15
|
+
- Project has a sandbox/mock mode that can be leveraged for DB-free testing
|
|
16
|
+
- Running `/bug-check` or similar review commands after code changes
|
|
17
|
+
- Multiple code paths exist (sandbox vs production, feature flags, etc.)
|
|
18
|
+
|
|
19
|
+
## The Core Problem
|
|
20
|
+
|
|
21
|
+
When an AI writes code and then reviews its own work, it carries the same assumptions into both steps. This creates a predictable failure pattern:
|
|
22
|
+
|
|
23
|
+
```
|
|
24
|
+
AI writes fix → AI reviews fix → AI says "looks correct" → Bug still exists
|
|
25
|
+
```
|
|
26
|
+
|
|
27
|
+
**Real-world example** (observed in production):
|
|
28
|
+
|
|
29
|
+
```
|
|
30
|
+
Fix 1: Added notification_settings to API response
|
|
31
|
+
→ Forgot to add it to the SELECT query
|
|
32
|
+
→ AI reviewed and missed it (same blind spot)
|
|
33
|
+
|
|
34
|
+
Fix 2: Added it to SELECT query
|
|
35
|
+
→ TypeScript build error (column not in generated types)
|
|
36
|
+
→ AI reviewed Fix 1 but didn't catch the SELECT issue
|
|
37
|
+
|
|
38
|
+
Fix 3: Changed to SELECT *
|
|
39
|
+
→ Fixed production path, forgot sandbox path
|
|
40
|
+
→ AI reviewed and missed it AGAIN (4th occurrence)
|
|
41
|
+
|
|
42
|
+
Fix 4: Test caught it instantly on first run PASS:
|
|
43
|
+
```
|
|
44
|
+
|
|
45
|
+
The pattern: **sandbox/production path inconsistency** is the #1 AI-introduced regression.
|
|
46
|
+
|
|
47
|
+
## Sandbox-Mode API Testing
|
|
48
|
+
|
|
49
|
+
Most projects with AI-friendly architecture have a sandbox/mock mode. This is the key to fast, DB-free API testing.
|
|
50
|
+
|
|
51
|
+
### Setup (Vitest + Next.js App Router)
|
|
52
|
+
|
|
53
|
+
```typescript
|
|
54
|
+
// vitest.config.ts
|
|
55
|
+
import { defineConfig } from "vitest/config";
|
|
56
|
+
import path from "path";
|
|
57
|
+
|
|
58
|
+
export default defineConfig({
|
|
59
|
+
test: {
|
|
60
|
+
environment: "node",
|
|
61
|
+
globals: true,
|
|
62
|
+
include: ["__tests__/**/*.test.ts"],
|
|
63
|
+
setupFiles: ["__tests__/setup.ts"],
|
|
64
|
+
},
|
|
65
|
+
resolve: {
|
|
66
|
+
alias: {
|
|
67
|
+
"@": path.resolve(__dirname, "."),
|
|
68
|
+
},
|
|
69
|
+
},
|
|
70
|
+
});
|
|
71
|
+
```
|
|
72
|
+
|
|
73
|
+
```typescript
|
|
74
|
+
// __tests__/setup.ts
|
|
75
|
+
// Force sandbox mode — no database needed
|
|
76
|
+
process.env.SANDBOX_MODE = "true";
|
|
77
|
+
process.env.NEXT_PUBLIC_SUPABASE_URL = "";
|
|
78
|
+
process.env.NEXT_PUBLIC_SUPABASE_ANON_KEY = "";
|
|
79
|
+
```
|
|
80
|
+
|
|
81
|
+
### Test Helper for Next.js API Routes
|
|
82
|
+
|
|
83
|
+
```typescript
|
|
84
|
+
// __tests__/helpers.ts
|
|
85
|
+
import { NextRequest } from "next/server";
|
|
86
|
+
|
|
87
|
+
export function createTestRequest(
|
|
88
|
+
url: string,
|
|
89
|
+
options?: {
|
|
90
|
+
method?: string;
|
|
91
|
+
body?: Record<string, unknown>;
|
|
92
|
+
headers?: Record<string, string>;
|
|
93
|
+
sandboxUserId?: string;
|
|
94
|
+
},
|
|
95
|
+
): NextRequest {
|
|
96
|
+
const { method = "GET", body, headers = {}, sandboxUserId } = options || {};
|
|
97
|
+
const fullUrl = url.startsWith("http") ? url : `http://localhost:3000${url}`;
|
|
98
|
+
const reqHeaders: Record<string, string> = { ...headers };
|
|
99
|
+
|
|
100
|
+
if (sandboxUserId) {
|
|
101
|
+
reqHeaders["x-sandbox-user-id"] = sandboxUserId;
|
|
102
|
+
}
|
|
103
|
+
|
|
104
|
+
const init: { method: string; headers: Record<string, string>; body?: string } = {
|
|
105
|
+
method,
|
|
106
|
+
headers: reqHeaders,
|
|
107
|
+
};
|
|
108
|
+
|
|
109
|
+
if (body) {
|
|
110
|
+
init.body = JSON.stringify(body);
|
|
111
|
+
reqHeaders["content-type"] = "application/json";
|
|
112
|
+
}
|
|
113
|
+
|
|
114
|
+
return new NextRequest(fullUrl, init);
|
|
115
|
+
}
|
|
116
|
+
|
|
117
|
+
export async function parseResponse(response: Response) {
|
|
118
|
+
const json = await response.json();
|
|
119
|
+
return { status: response.status, json };
|
|
120
|
+
}
|
|
121
|
+
```
|
|
122
|
+
|
|
123
|
+
### Writing Regression Tests
|
|
124
|
+
|
|
125
|
+
The key principle: **write tests for bugs that were found, not for code that works**.
|
|
126
|
+
|
|
127
|
+
```typescript
|
|
128
|
+
// __tests__/api/user/profile.test.ts
|
|
129
|
+
import { describe, it, expect } from "vitest";
|
|
130
|
+
import { createTestRequest, parseResponse } from "../../helpers";
|
|
131
|
+
import { GET, PATCH } from "@/app/api/user/profile/route";
|
|
132
|
+
|
|
133
|
+
// Define the contract — what fields MUST be in the response
|
|
134
|
+
const REQUIRED_FIELDS = [
|
|
135
|
+
"id",
|
|
136
|
+
"email",
|
|
137
|
+
"full_name",
|
|
138
|
+
"phone",
|
|
139
|
+
"role",
|
|
140
|
+
"created_at",
|
|
141
|
+
"avatar_url",
|
|
142
|
+
"notification_settings", // ← Added after bug found it missing
|
|
143
|
+
];
|
|
144
|
+
|
|
145
|
+
describe("GET /api/user/profile", () => {
|
|
146
|
+
it("returns all required fields", async () => {
|
|
147
|
+
const req = createTestRequest("/api/user/profile");
|
|
148
|
+
const res = await GET(req);
|
|
149
|
+
const { status, json } = await parseResponse(res);
|
|
150
|
+
|
|
151
|
+
expect(status).toBe(200);
|
|
152
|
+
for (const field of REQUIRED_FIELDS) {
|
|
153
|
+
expect(json.data).toHaveProperty(field);
|
|
154
|
+
}
|
|
155
|
+
});
|
|
156
|
+
|
|
157
|
+
// Regression test — this exact bug was introduced by AI 4 times
|
|
158
|
+
it("notification_settings is not undefined (BUG-R1 regression)", async () => {
|
|
159
|
+
const req = createTestRequest("/api/user/profile");
|
|
160
|
+
const res = await GET(req);
|
|
161
|
+
const { json } = await parseResponse(res);
|
|
162
|
+
|
|
163
|
+
expect("notification_settings" in json.data).toBe(true);
|
|
164
|
+
const ns = json.data.notification_settings;
|
|
165
|
+
expect(ns === null || typeof ns === "object").toBe(true);
|
|
166
|
+
});
|
|
167
|
+
});
|
|
168
|
+
```
|
|
169
|
+
|
|
170
|
+
### Testing Sandbox/Production Parity
|
|
171
|
+
|
|
172
|
+
The most common AI regression: fixing production path but forgetting sandbox path (or vice versa).
|
|
173
|
+
|
|
174
|
+
```typescript
|
|
175
|
+
// Test that sandbox responses match the expected contract
|
|
176
|
+
describe("GET /api/user/messages (conversation list)", () => {
|
|
177
|
+
it("includes partner_name in sandbox mode", async () => {
|
|
178
|
+
const req = createTestRequest("/api/user/messages", {
|
|
179
|
+
sandboxUserId: "user-001",
|
|
180
|
+
});
|
|
181
|
+
const res = await GET(req);
|
|
182
|
+
const { json } = await parseResponse(res);
|
|
183
|
+
|
|
184
|
+
// This caught a bug where partner_name was added
|
|
185
|
+
// to production path but not sandbox path
|
|
186
|
+
if (json.data.length > 0) {
|
|
187
|
+
for (const conv of json.data) {
|
|
188
|
+
expect("partner_name" in conv).toBe(true);
|
|
189
|
+
}
|
|
190
|
+
}
|
|
191
|
+
});
|
|
192
|
+
});
|
|
193
|
+
```
|
|
194
|
+
|
|
195
|
+
## Integrating Tests into Bug-Check Workflow
|
|
196
|
+
|
|
197
|
+
### Custom Command Definition
|
|
198
|
+
|
|
199
|
+
```markdown
|
|
200
|
+
<!-- .claude/commands/bug-check.md -->
|
|
201
|
+
# Bug Check
|
|
202
|
+
|
|
203
|
+
## Step 1: Automated Tests (mandatory, cannot skip)
|
|
204
|
+
|
|
205
|
+
Run these commands FIRST before any code review:
|
|
206
|
+
|
|
207
|
+
npm run test # Vitest test suite
|
|
208
|
+
npm run build # TypeScript type check + build
|
|
209
|
+
|
|
210
|
+
- If tests fail → report as highest priority bug
|
|
211
|
+
- If build fails → report type errors as highest priority
|
|
212
|
+
- Only proceed to Step 2 if both pass
|
|
213
|
+
|
|
214
|
+
## Step 2: Code Review (AI review)
|
|
215
|
+
|
|
216
|
+
1. Sandbox / production path consistency
|
|
217
|
+
2. API response shape matches frontend expectations
|
|
218
|
+
3. SELECT clause completeness
|
|
219
|
+
4. Error handling with rollback
|
|
220
|
+
5. Optimistic update race conditions
|
|
221
|
+
|
|
222
|
+
## Step 3: For each bug fixed, propose a regression test
|
|
223
|
+
```
|
|
224
|
+
|
|
225
|
+
### The Workflow
|
|
226
|
+
|
|
227
|
+
```
|
|
228
|
+
User: "バグチェックして" (or "/bug-check")
|
|
229
|
+
│
|
|
230
|
+
├─ Step 1: npm run test
|
|
231
|
+
│ ├─ FAIL → Bug found mechanically (no AI judgment needed)
|
|
232
|
+
│ └─ PASS → Continue
|
|
233
|
+
│
|
|
234
|
+
├─ Step 2: npm run build
|
|
235
|
+
│ ├─ FAIL → Type error found mechanically
|
|
236
|
+
│ └─ PASS → Continue
|
|
237
|
+
│
|
|
238
|
+
├─ Step 3: AI code review (with known blind spots in mind)
|
|
239
|
+
│ └─ Findings reported
|
|
240
|
+
│
|
|
241
|
+
└─ Step 4: For each fix, write a regression test
|
|
242
|
+
└─ Next bug-check catches if fix breaks
|
|
243
|
+
```
|
|
244
|
+
|
|
245
|
+
## Common AI Regression Patterns
|
|
246
|
+
|
|
247
|
+
### Pattern 1: Sandbox/Production Path Mismatch
|
|
248
|
+
|
|
249
|
+
**Frequency**: Most common (observed in 3 out of 4 regressions)
|
|
250
|
+
|
|
251
|
+
```typescript
|
|
252
|
+
// FAIL: AI adds field to production path only
|
|
253
|
+
if (isSandboxMode()) {
|
|
254
|
+
return { data: { id, email, name } }; // Missing new field
|
|
255
|
+
}
|
|
256
|
+
// Production path
|
|
257
|
+
return { data: { id, email, name, notification_settings } };
|
|
258
|
+
|
|
259
|
+
// PASS: Both paths must return the same shape
|
|
260
|
+
if (isSandboxMode()) {
|
|
261
|
+
return { data: { id, email, name, notification_settings: null } };
|
|
262
|
+
}
|
|
263
|
+
return { data: { id, email, name, notification_settings } };
|
|
264
|
+
```
|
|
265
|
+
|
|
266
|
+
**Test to catch it**:
|
|
267
|
+
|
|
268
|
+
```typescript
|
|
269
|
+
it("sandbox and production return same fields", async () => {
|
|
270
|
+
// In test env, sandbox mode is forced ON
|
|
271
|
+
const res = await GET(createTestRequest("/api/user/profile"));
|
|
272
|
+
const { json } = await parseResponse(res);
|
|
273
|
+
|
|
274
|
+
for (const field of REQUIRED_FIELDS) {
|
|
275
|
+
expect(json.data).toHaveProperty(field);
|
|
276
|
+
}
|
|
277
|
+
});
|
|
278
|
+
```
|
|
279
|
+
|
|
280
|
+
### Pattern 2: SELECT Clause Omission
|
|
281
|
+
|
|
282
|
+
**Frequency**: Common with Supabase/Prisma when adding new columns
|
|
283
|
+
|
|
284
|
+
```typescript
|
|
285
|
+
// FAIL: New column added to response but not to SELECT
|
|
286
|
+
const { data } = await supabase
|
|
287
|
+
.from("users")
|
|
288
|
+
.select("id, email, name") // notification_settings not here
|
|
289
|
+
.single();
|
|
290
|
+
|
|
291
|
+
return { data: { ...data, notification_settings: data.notification_settings } };
|
|
292
|
+
// → notification_settings is always undefined
|
|
293
|
+
|
|
294
|
+
// PASS: Use SELECT * or explicitly include new columns
|
|
295
|
+
const { data } = await supabase
|
|
296
|
+
.from("users")
|
|
297
|
+
.select("*")
|
|
298
|
+
.single();
|
|
299
|
+
```
|
|
300
|
+
|
|
301
|
+
### Pattern 3: Error State Leakage
|
|
302
|
+
|
|
303
|
+
**Frequency**: Moderate — when adding error handling to existing components
|
|
304
|
+
|
|
305
|
+
```typescript
|
|
306
|
+
// FAIL: Error state set but old data not cleared
|
|
307
|
+
catch (err) {
|
|
308
|
+
setError("Failed to load");
|
|
309
|
+
// reservations still shows data from previous tab!
|
|
310
|
+
}
|
|
311
|
+
|
|
312
|
+
// PASS: Clear related state on error
|
|
313
|
+
catch (err) {
|
|
314
|
+
setReservations([]); // Clear stale data
|
|
315
|
+
setError("Failed to load");
|
|
316
|
+
}
|
|
317
|
+
```
|
|
318
|
+
|
|
319
|
+
### Pattern 4: Optimistic Update Without Proper Rollback
|
|
320
|
+
|
|
321
|
+
```typescript
|
|
322
|
+
// FAIL: No rollback on failure
|
|
323
|
+
const handleRemove = async (id: string) => {
|
|
324
|
+
setItems(prev => prev.filter(i => i.id !== id));
|
|
325
|
+
await fetch(`/api/items/${id}`, { method: "DELETE" });
|
|
326
|
+
// If API fails, item is gone from UI but still in DB
|
|
327
|
+
};
|
|
328
|
+
|
|
329
|
+
// PASS: Capture previous state and rollback on failure
|
|
330
|
+
const handleRemove = async (id: string) => {
|
|
331
|
+
const prevItems = [...items];
|
|
332
|
+
setItems(prev => prev.filter(i => i.id !== id));
|
|
333
|
+
try {
|
|
334
|
+
const res = await fetch(`/api/items/${id}`, { method: "DELETE" });
|
|
335
|
+
if (!res.ok) throw new Error("API error");
|
|
336
|
+
} catch {
|
|
337
|
+
setItems(prevItems); // Rollback
|
|
338
|
+
alert("削除に失敗しました");
|
|
339
|
+
}
|
|
340
|
+
};
|
|
341
|
+
```
|
|
342
|
+
|
|
343
|
+
## Strategy: Test Where Bugs Were Found
|
|
344
|
+
|
|
345
|
+
Don't aim for 100% coverage. Instead:
|
|
346
|
+
|
|
347
|
+
```
|
|
348
|
+
Bug found in /api/user/profile → Write test for profile API
|
|
349
|
+
Bug found in /api/user/messages → Write test for messages API
|
|
350
|
+
Bug found in /api/user/favorites → Write test for favorites API
|
|
351
|
+
No bug in /api/user/notifications → Don't write test (yet)
|
|
352
|
+
```
|
|
353
|
+
|
|
354
|
+
**Why this works with AI development:**
|
|
355
|
+
|
|
356
|
+
1. AI tends to make the **same category of mistake** repeatedly
|
|
357
|
+
2. Bugs cluster in complex areas (auth, multi-path logic, state management)
|
|
358
|
+
3. Once tested, that exact regression **cannot happen again**
|
|
359
|
+
4. Test count grows organically with bug fixes — no wasted effort
|
|
360
|
+
|
|
361
|
+
## Quick Reference
|
|
362
|
+
|
|
363
|
+
| AI Regression Pattern | Test Strategy | Priority |
|
|
364
|
+
|---|---|---|
|
|
365
|
+
| Sandbox/production mismatch | Assert same response shape in sandbox mode | High |
|
|
366
|
+
| SELECT clause omission | Assert all required fields in response | High |
|
|
367
|
+
| Error state leakage | Assert state cleanup on error | Medium |
|
|
368
|
+
| Missing rollback | Assert state restored on API failure | Medium |
|
|
369
|
+
| Type cast masking null | Assert field is not undefined | Medium |
|
|
370
|
+
|
|
371
|
+
## DO / DON'T
|
|
372
|
+
|
|
373
|
+
**DO:**
|
|
374
|
+
- Write tests immediately after finding a bug (before fixing it if possible)
|
|
375
|
+
- Test the API response shape, not the implementation
|
|
376
|
+
- Run tests as the first step of every bug-check
|
|
377
|
+
- Keep tests fast (< 1 second total with sandbox mode)
|
|
378
|
+
- Name tests after the bug they prevent (e.g., "BUG-R1 regression")
|
|
379
|
+
|
|
380
|
+
**DON'T:**
|
|
381
|
+
- Write tests for code that has never had a bug
|
|
382
|
+
- Trust AI self-review as a substitute for automated tests
|
|
383
|
+
- Skip sandbox path testing because "it's just mock data"
|
|
384
|
+
- Write integration tests when unit tests suffice
|
|
385
|
+
- Aim for coverage percentage — aim for regression prevention
|