claude-code-pilot 2.0.0 → 3.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +76 -97
- package/bin/install.js +267 -250
- package/manifest.json +5 -18
- package/package.json +5 -7
- package/src/agents/build-error-resolver.md +114 -0
- package/src/agents/ccp-advisor-researcher.md +104 -0
- package/src/agents/ccp-assumptions-analyzer.md +105 -0
- package/{gsd/agents/gsd-codebase-mapper.md → src/agents/ccp-codebase-mapper.md} +7 -7
- package/{gsd/agents/gsd-debugger.md → src/agents/ccp-debugger.md} +125 -8
- package/{gsd/agents/gsd-executor.md → src/agents/ccp-executor.md} +31 -20
- package/{gsd/agents/gsd-integration-checker.md → src/agents/ccp-integration-checker.md} +2 -2
- package/{gsd/agents/gsd-nyquist-auditor.md → src/agents/ccp-nyquist-auditor.md} +3 -3
- package/{gsd/agents/gsd-phase-researcher.md → src/agents/ccp-phase-researcher.md} +127 -13
- package/{gsd/agents/gsd-plan-checker.md → src/agents/ccp-plan-checker.md} +57 -21
- package/{gsd/agents/gsd-planner.md → src/agents/ccp-planner.md} +61 -23
- package/{gsd/agents/gsd-project-researcher.md → src/agents/ccp-project-researcher.md} +33 -6
- package/{gsd/agents/gsd-research-synthesizer.md → src/agents/ccp-research-synthesizer.md} +11 -11
- package/{gsd/agents/gsd-roadmapper.md → src/agents/ccp-roadmapper.md} +39 -10
- package/src/agents/ccp-ui-auditor.md +439 -0
- package/src/agents/ccp-ui-checker.md +300 -0
- package/src/agents/ccp-ui-researcher.md +357 -0
- package/{gsd/agents/gsd-verifier.md → src/agents/ccp-verifier.md} +81 -15
- package/src/agents/cpp-build-resolver.md +90 -0
- package/src/agents/cpp-reviewer.md +72 -0
- package/src/agents/database-reviewer.md +91 -0
- package/{ecc → src}/agents/doc-updater.md +1 -1
- package/src/agents/docs-lookup.md +68 -0
- package/src/agents/flutter-reviewer.md +243 -0
- package/src/agents/gan-evaluator.md +209 -0
- package/src/agents/gan-generator.md +131 -0
- package/src/agents/gan-planner.md +99 -0
- package/src/agents/go-build-resolver.md +94 -0
- package/src/agents/go-reviewer.md +76 -0
- package/src/agents/harness-optimizer.md +35 -0
- package/src/agents/java-build-resolver.md +153 -0
- package/src/agents/java-reviewer.md +92 -0
- package/src/agents/kotlin-build-resolver.md +118 -0
- package/src/agents/kotlin-reviewer.md +159 -0
- package/src/agents/loop-operator.md +36 -0
- package/src/agents/opensource-forker.md +198 -0
- package/src/agents/opensource-packager.md +249 -0
- package/src/agents/opensource-sanitizer.md +188 -0
- package/src/agents/performance-optimizer.md +446 -0
- package/src/agents/planner.md +212 -0
- package/src/agents/python-reviewer.md +98 -0
- package/src/agents/pytorch-build-resolver.md +120 -0
- package/src/agents/refactor-cleaner.md +85 -0
- package/src/agents/rust-build-resolver.md +148 -0
- package/src/agents/rust-reviewer.md +94 -0
- package/src/agents/typescript-reviewer.md +112 -0
- package/src/available-rules/README.md +80 -0
- package/src/available-rules/cpp/coding-style.md +44 -0
- package/src/available-rules/cpp/hooks.md +39 -0
- package/src/available-rules/cpp/patterns.md +51 -0
- package/src/available-rules/cpp/security.md +51 -0
- package/src/available-rules/cpp/testing.md +44 -0
- package/src/available-rules/csharp/coding-style.md +72 -0
- package/src/available-rules/csharp/hooks.md +25 -0
- package/src/available-rules/csharp/patterns.md +50 -0
- package/src/available-rules/csharp/security.md +58 -0
- package/src/available-rules/csharp/testing.md +46 -0
- package/src/available-rules/java/coding-style.md +114 -0
- package/src/available-rules/java/hooks.md +18 -0
- package/src/available-rules/java/patterns.md +146 -0
- package/src/available-rules/java/security.md +100 -0
- package/src/available-rules/java/testing.md +131 -0
- package/src/available-rules/kotlin/hooks.md +17 -0
- package/src/available-rules/rust/coding-style.md +151 -0
- package/src/available-rules/rust/hooks.md +16 -0
- package/src/available-rules/rust/patterns.md +168 -0
- package/src/available-rules/rust/security.md +141 -0
- package/src/available-rules/rust/testing.md +154 -0
- package/src/commands/ccp/add-backlog.md +76 -0
- package/{gsd/commands-gsd → src/commands/ccp}/add-phase.md +3 -3
- package/{gsd/commands-gsd → src/commands/ccp}/add-tests.md +5 -5
- package/{gsd/commands-gsd → src/commands/ccp}/add-todo.md +4 -4
- package/src/commands/ccp/aside.md +165 -0
- package/{gsd/commands-gsd → src/commands/ccp}/audit-milestone.md +3 -3
- package/src/commands/ccp/audit-uat.md +24 -0
- package/src/commands/ccp/autonomous.md +41 -0
- package/src/commands/ccp/build-fix.md +67 -0
- package/{gsd/commands-gsd → src/commands/ccp}/check-todos.md +3 -3
- package/{ecc/commands → src/commands/ccp}/checkpoint.md +12 -7
- package/{gsd/commands-gsd → src/commands/ccp}/cleanup.md +3 -3
- package/src/commands/ccp/code-review.md +45 -0
- package/{gsd/commands-gsd → src/commands/ccp}/complete-milestone.md +9 -9
- package/src/commands/ccp/context-budget.md +30 -0
- package/src/commands/ccp/cpp-build.md +174 -0
- package/src/commands/ccp/cpp-review.md +133 -0
- package/src/commands/ccp/cpp-test.md +252 -0
- package/{gsd/commands-gsd → src/commands/ccp}/debug.md +14 -9
- package/src/commands/ccp/discuss-phase.md +64 -0
- package/src/commands/ccp/do.md +30 -0
- package/src/commands/ccp/docs-update.md +48 -0
- package/src/commands/ccp/docs.md +32 -0
- package/src/commands/ccp/e2e.md +365 -0
- package/src/commands/ccp/eval.md +125 -0
- package/{ecc/commands → src/commands/ccp}/evolve.md +5 -5
- package/src/commands/ccp/execute-phase.md +59 -0
- package/src/commands/ccp/fast.md +30 -0
- package/src/commands/ccp/forensics.md +56 -0
- package/src/commands/ccp/go-build.md +184 -0
- package/src/commands/ccp/go-review.md +149 -0
- package/src/commands/ccp/go-test.md +269 -0
- package/src/commands/ccp/gradle-build.md +71 -0
- package/src/commands/ccp/harness-audit.md +76 -0
- package/{gsd/commands-gsd → src/commands/ccp}/health.md +3 -3
- package/{gsd/commands-gsd → src/commands/ccp}/help.md +5 -5
- package/{gsd/commands-gsd → src/commands/ccp}/insert-phase.md +3 -3
- package/src/commands/ccp/kotlin-build.md +175 -0
- package/src/commands/ccp/kotlin-review.md +141 -0
- package/src/commands/ccp/kotlin-test.md +313 -0
- package/{ecc/commands → src/commands/ccp}/learn.md +7 -2
- package/{gsd/commands-gsd → src/commands/ccp}/list-phase-assumptions.md +2 -2
- package/src/commands/ccp/manager.md +39 -0
- package/{gsd/commands-gsd → src/commands/ccp}/map-codebase.md +7 -7
- package/src/commands/ccp/milestone-summary.md +51 -0
- package/{ecc/commands → src/commands/ccp}/model-route.md +6 -1
- package/{gsd/commands-gsd → src/commands/ccp}/new-milestone.md +8 -8
- package/{gsd/commands-gsd → src/commands/ccp}/new-project.md +8 -8
- package/src/commands/ccp/next.md +24 -0
- package/src/commands/ccp/note.md +34 -0
- package/src/commands/ccp/orchestrate.md +232 -0
- package/{gsd/commands-gsd → src/commands/ccp}/pause-work.md +3 -3
- package/{gsd/commands-gsd → src/commands/ccp}/plan-milestone-gaps.md +5 -5
- package/{gsd/commands-gsd → src/commands/ccp}/plan-phase.md +9 -7
- package/src/commands/ccp/plan.md +115 -0
- package/src/commands/ccp/plant-seed.md +28 -0
- package/src/commands/ccp/pr-branch.md +25 -0
- package/src/commands/ccp/profile-user.md +46 -0
- package/{gsd/commands-gsd → src/commands/ccp}/progress.md +3 -3
- package/src/commands/ccp/prompt-optimize.md +39 -0
- package/src/commands/ccp/prune.md +25 -0
- package/src/commands/ccp/python-review.md +298 -0
- package/{ecc/commands → src/commands/ccp}/quality-gate.md +7 -2
- package/{gsd/commands-gsd → src/commands/ccp}/quick.md +10 -8
- package/src/commands/ccp/refactor-clean.md +85 -0
- package/{gsd/commands-gsd → src/commands/ccp}/remove-phase.md +3 -3
- package/{gsd/commands-gsd → src/commands/ccp}/research-phase.md +17 -12
- package/{ecc/commands → src/commands/ccp}/resume-session.md +9 -8
- package/{gsd/commands-gsd → src/commands/ccp}/resume-work.md +3 -3
- package/src/commands/ccp/review-backlog.md +61 -0
- package/src/commands/ccp/review.md +37 -0
- package/src/commands/ccp/rules-distill.md +12 -0
- package/src/commands/ccp/rust-build.md +188 -0
- package/src/commands/ccp/rust-review.md +143 -0
- package/src/commands/ccp/rust-test.md +309 -0
- package/{ecc/commands → src/commands/ccp}/save-session.md +2 -1
- package/src/commands/ccp/secure-phase.md +35 -0
- package/src/commands/ccp/session-report.md +19 -0
- package/{ecc/commands → src/commands/ccp}/sessions.md +39 -34
- package/src/commands/ccp/set-profile.md +12 -0
- package/{gsd/commands-gsd → src/commands/ccp}/settings.md +5 -5
- package/src/commands/ccp/setup-pm.md +81 -0
- package/{kit/commands → src/commands/ccp}/setup-refresh.md +4 -3
- package/{kit/commands → src/commands/ccp}/setup.md +67 -40
- package/src/commands/ccp/ship.md +23 -0
- package/src/commands/ccp/skill-create.md +172 -0
- package/src/commands/ccp/skill-health.md +51 -0
- package/src/commands/ccp/stats.md +18 -0
- package/src/commands/ccp/tdd.md +329 -0
- package/src/commands/ccp/test-coverage.md +74 -0
- package/src/commands/ccp/thread.md +127 -0
- package/{kit/commands → src/commands/ccp}/tool-guide.md +2 -1
- package/src/commands/ccp/ui-phase.md +34 -0
- package/src/commands/ccp/ui-review.md +32 -0
- package/src/commands/ccp/update-codemaps.md +77 -0
- package/src/commands/ccp/update-docs.md +89 -0
- package/{gsd/commands-gsd → src/commands/ccp}/update.md +5 -5
- package/{gsd/commands-gsd → src/commands/ccp}/validate-phase.md +3 -3
- package/{gsd/commands-gsd → src/commands/ccp}/verify-work.md +5 -5
- package/{ecc/commands → src/commands/ccp}/verify.md +5 -0
- package/src/commands/ccp/workstreams.md +68 -0
- package/{ecc → src}/examples/CLAUDE.md +4 -4
- package/{ecc → src}/examples/django-api-CLAUDE.md +5 -5
- package/{ecc → src}/examples/go-microservice-CLAUDE.md +6 -6
- package/{ecc → src}/examples/rust-api-CLAUDE.md +4 -4
- package/{ecc → src}/examples/saas-nextjs-CLAUDE.md +8 -8
- package/{gsd/hooks/gsd-context-monitor.js → src/hooks/ccp-context-monitor.js} +3 -3
- package/src/hooks/ccp-prompt-guard.js +96 -0
- package/{gsd/hooks/gsd-statusline.js → src/hooks/ccp-statusline.js} +7 -7
- package/src/hooks/ccp-workflow-guard.js +94 -0
- package/src/hooks/config-protection.js +141 -0
- package/{kit → src}/hooks/kit-check-update.js +7 -4
- package/src/hooks/mcp-health-check.js +620 -0
- package/{ecc/scripts → src}/hooks/run-with-flags-shell.sh +1 -1
- package/{ecc/scripts → src}/hooks/run-with-flags.js +74 -13
- package/src/hooks/session-end-marker.js +29 -0
- package/{ecc/scripts → src}/hooks/session-end.js +83 -40
- package/{ecc/scripts → src}/hooks/session-start.js +76 -10
- package/{ecc/scripts → src}/lib/hook-flags.js +8 -4
- package/{ecc/scripts → src}/lib/project-detect.js +2 -1
- package/{ecc/scripts → src}/lib/session-manager.d.ts +5 -1
- package/{ecc/scripts → src}/lib/session-manager.js +202 -92
- package/{ecc/scripts → src}/lib/utils.d.ts +23 -1
- package/{ecc/scripts → src}/lib/utils.js +91 -3
- package/{gsd/get-shit-done/bin/gsd-tools.cjs → src/pilot/bin/ccp-tools.cjs} +257 -86
- package/{gsd/get-shit-done → src/pilot}/bin/lib/commands.cjs +1 -1
- package/src/pilot/bin/lib/config.cjs +444 -0
- package/src/pilot/bin/lib/core.cjs +1190 -0
- package/src/pilot/bin/lib/init.cjs +1281 -0
- package/src/pilot/bin/lib/model-profiles.cjs +67 -0
- package/{gsd/get-shit-done → src/pilot}/bin/lib/phase.cjs +2 -2
- package/src/pilot/bin/lib/security.cjs +382 -0
- package/{gsd/get-shit-done → src/pilot}/bin/lib/state.cjs +1 -1
- package/src/pilot/bin/lib/uat.cjs +282 -0
- package/{gsd/get-shit-done → src/pilot}/bin/lib/verify.cjs +10 -10
- package/{gsd/get-shit-done → src/pilot}/references/continuation-format.md +16 -16
- package/{gsd/get-shit-done → src/pilot}/references/decimal-phase-calculation.md +5 -5
- package/{gsd/get-shit-done → src/pilot}/references/git-integration.md +5 -5
- package/{gsd/get-shit-done → src/pilot}/references/git-planning-commit.md +4 -4
- package/src/pilot/references/mcp-servers.json +153 -0
- package/{gsd/get-shit-done → src/pilot}/references/model-profile-resolution.md +2 -2
- package/{gsd/get-shit-done → src/pilot}/references/model-profiles.md +20 -20
- package/{gsd/get-shit-done → src/pilot}/references/phase-argument-parsing.md +4 -4
- package/{gsd/get-shit-done → src/pilot}/references/planning-config.md +15 -15
- package/{gsd/get-shit-done → src/pilot}/references/ui-brand.md +5 -5
- package/{gsd/get-shit-done → src/pilot}/references/verification-patterns.md +1 -1
- package/{gsd/get-shit-done → src/pilot}/templates/DEBUG.md +1 -1
- package/{gsd/get-shit-done → src/pilot}/templates/UAT.md +3 -3
- package/src/pilot/templates/UI-SPEC.md +100 -0
- package/{gsd/get-shit-done → src/pilot}/templates/VALIDATION.md +1 -1
- package/src/pilot/templates/claude-md.md +122 -0
- package/{gsd/get-shit-done → src/pilot}/templates/codebase/architecture.md +2 -2
- package/{gsd/get-shit-done → src/pilot}/templates/codebase/structure.md +13 -13
- package/{gsd/get-shit-done → src/pilot}/templates/context.md +4 -4
- package/src/pilot/templates/copilot-instructions.md +7 -0
- package/{gsd/get-shit-done → src/pilot}/templates/debug-subagent-prompt.md +4 -4
- package/src/pilot/templates/dev-preferences.md +21 -0
- package/{gsd/get-shit-done → src/pilot}/templates/discovery.md +2 -2
- package/src/pilot/templates/discussion-log.md +63 -0
- package/{gsd/get-shit-done → src/pilot}/templates/phase-prompt.md +12 -12
- package/{gsd/get-shit-done → src/pilot}/templates/planner-subagent-prompt.md +7 -7
- package/{gsd/get-shit-done → src/pilot}/templates/project.md +1 -1
- package/{gsd/get-shit-done → src/pilot}/templates/research.md +2 -2
- package/{gsd/get-shit-done → src/pilot}/templates/state.md +2 -2
- package/{gsd/get-shit-done → src/pilot}/templates/summary-complex.md +1 -1
- package/{gsd/get-shit-done → src/pilot}/workflows/add-phase.md +11 -11
- package/{gsd/get-shit-done → src/pilot}/workflows/add-tests.md +15 -15
- package/{gsd/get-shit-done → src/pilot}/workflows/add-todo.md +7 -7
- package/{gsd/get-shit-done → src/pilot}/workflows/audit-milestone.md +24 -16
- package/src/pilot/workflows/audit-uat.md +109 -0
- package/src/pilot/workflows/autonomous.md +891 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/check-todos.md +10 -10
- package/{gsd/get-shit-done → src/pilot}/workflows/cleanup.md +3 -3
- package/{gsd/get-shit-done → src/pilot}/workflows/complete-milestone.md +19 -16
- package/{gsd/get-shit-done → src/pilot}/workflows/diagnose-issues.md +9 -4
- package/{gsd/get-shit-done → src/pilot}/workflows/discovery-phase.md +8 -8
- package/src/pilot/workflows/discuss-phase-assumptions.md +653 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/discuss-phase.md +407 -49
- package/src/pilot/workflows/do.md +104 -0
- package/src/pilot/workflows/docs-update.md +1165 -0
- package/src/pilot/workflows/execute-phase.md +821 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/execute-plan.md +79 -28
- package/src/pilot/workflows/fast.md +105 -0
- package/src/pilot/workflows/forensics.md +265 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/health.md +34 -11
- package/src/pilot/workflows/help.md +767 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/insert-phase.md +10 -10
- package/{gsd/get-shit-done → src/pilot}/workflows/list-phase-assumptions.md +4 -4
- package/src/pilot/workflows/manager.md +362 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/map-codebase.md +27 -17
- package/src/pilot/workflows/milestone-summary.md +223 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/new-milestone.md +135 -33
- package/{gsd/get-shit-done → src/pilot}/workflows/new-project.md +152 -79
- package/src/pilot/workflows/next.md +97 -0
- package/src/pilot/workflows/node-repair.md +92 -0
- package/src/pilot/workflows/note.md +156 -0
- package/src/pilot/workflows/pause-work.md +177 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/plan-milestone-gaps.md +10 -11
- package/src/pilot/workflows/plan-phase.md +859 -0
- package/src/pilot/workflows/plant-seed.md +169 -0
- package/src/pilot/workflows/pr-branch.md +129 -0
- package/src/pilot/workflows/profile-user.md +452 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/progress.md +95 -34
- package/{gsd/get-shit-done → src/pilot}/workflows/quick.md +33 -21
- package/{gsd/get-shit-done → src/pilot}/workflows/remove-phase.md +14 -14
- package/{gsd/get-shit-done → src/pilot}/workflows/research-phase.md +18 -10
- package/{gsd/get-shit-done → src/pilot}/workflows/resume-project.md +37 -18
- package/src/pilot/workflows/review.md +244 -0
- package/src/pilot/workflows/secure-phase.md +164 -0
- package/src/pilot/workflows/session-report.md +146 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/set-profile.md +7 -7
- package/{gsd/get-shit-done → src/pilot}/workflows/settings.md +75 -22
- package/src/pilot/workflows/ship.md +228 -0
- package/src/pilot/workflows/stats.md +60 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/transition.md +57 -17
- package/src/pilot/workflows/ui-phase.md +302 -0
- package/src/pilot/workflows/ui-review.md +165 -0
- package/{gsd/get-shit-done → src/pilot}/workflows/update.md +88 -58
- package/{gsd/get-shit-done → src/pilot}/workflows/validate-phase.md +24 -17
- package/{gsd/get-shit-done → src/pilot}/workflows/verify-phase.md +26 -15
- package/{gsd/get-shit-done → src/pilot}/workflows/verify-work.md +89 -37
- package/{ecc → src}/rules/common/agents.md +1 -0
- package/src/rules/common/code-review.md +124 -0
- package/{ecc → src}/rules/common/coding-style.md +21 -0
- package/src/rules/zh/README.md +108 -0
- package/src/rules/zh/agents.md +50 -0
- package/src/rules/zh/code-review.md +124 -0
- package/src/rules/zh/coding-style.md +48 -0
- package/src/rules/zh/development-workflow.md +44 -0
- package/src/rules/zh/git-workflow.md +24 -0
- package/src/rules/zh/hooks.md +30 -0
- package/src/rules/zh/patterns.md +31 -0
- package/src/rules/zh/performance.md +55 -0
- package/src/rules/zh/security.md +29 -0
- package/src/rules/zh/testing.md +29 -0
- package/src/skills/agentic-engineering/SKILL.md +63 -0
- package/src/skills/ai-first-engineering/SKILL.md +51 -0
- package/src/skills/ai-regression-testing/SKILL.md +385 -0
- package/src/skills/api-design/SKILL.md +523 -0
- package/src/skills/architecture-decision-records/SKILL.md +179 -0
- package/src/skills/autonomous-agent-harness/SKILL.md +267 -0
- package/src/skills/autonomous-loops/SKILL.md +610 -0
- package/src/skills/backend-patterns/SKILL.md +598 -0
- package/src/skills/benchmark/SKILL.md +87 -0
- package/src/skills/blueprint/SKILL.md +90 -0
- package/src/skills/browser-qa/SKILL.md +81 -0
- package/src/skills/bun-runtime/SKILL.md +84 -0
- package/src/skills/claude-api/SKILL.md +337 -0
- package/src/skills/codebase-onboarding/SKILL.md +233 -0
- package/src/skills/coding-standards/SKILL.md +530 -0
- package/src/skills/content-hash-cache-pattern/SKILL.md +161 -0
- package/src/skills/context-budget/SKILL.md +135 -0
- package/{ecc → src}/skills/continuous-learning-v2/SKILL.md +6 -6
- package/{ecc → src}/skills/continuous-learning-v2/agents/observer-loop.sh +1 -1
- package/{ecc → src}/skills/continuous-learning-v2/agents/observer.md +1 -1
- package/src/skills/cost-aware-llm-pipeline/SKILL.md +183 -0
- package/src/skills/cpp-coding-standards/SKILL.md +723 -0
- package/src/skills/cpp-testing/SKILL.md +324 -0
- package/src/skills/database-migrations/SKILL.md +429 -0
- package/src/skills/deep-research/SKILL.md +155 -0
- package/src/skills/deployment-patterns/SKILL.md +427 -0
- package/src/skills/design-system/SKILL.md +82 -0
- package/src/skills/django-patterns/SKILL.md +734 -0
- package/src/skills/django-security/SKILL.md +593 -0
- package/src/skills/django-tdd/SKILL.md +729 -0
- package/src/skills/django-verification/SKILL.md +469 -0
- package/src/skills/docker-patterns/SKILL.md +364 -0
- package/src/skills/documentation-lookup/SKILL.md +90 -0
- package/src/skills/e2e-testing/SKILL.md +326 -0
- package/src/skills/eval-harness/SKILL.md +270 -0
- package/src/skills/exa-search/SKILL.md +103 -0
- package/src/skills/flutter-dart-code-review/SKILL.md +435 -0
- package/src/skills/frontend-patterns/SKILL.md +642 -0
- package/src/skills/gan-style-harness/SKILL.md +278 -0
- package/src/skills/git-workflow/SKILL.md +715 -0
- package/src/skills/golang-patterns/SKILL.md +674 -0
- package/src/skills/golang-testing/SKILL.md +720 -0
- package/src/skills/hexagonal-architecture/SKILL.md +276 -0
- package/src/skills/iterative-retrieval/SKILL.md +211 -0
- package/src/skills/java-coding-standards/SKILL.md +147 -0
- package/src/skills/jpa-patterns/SKILL.md +151 -0
- package/src/skills/kotlin-coroutines-flows/SKILL.md +284 -0
- package/src/skills/kotlin-exposed-patterns/SKILL.md +719 -0
- package/src/skills/kotlin-ktor-patterns/SKILL.md +689 -0
- package/src/skills/kotlin-patterns/SKILL.md +711 -0
- package/src/skills/kotlin-testing/SKILL.md +824 -0
- package/src/skills/laravel-patterns/SKILL.md +415 -0
- package/src/skills/laravel-plugin-discovery/SKILL.md +229 -0
- package/src/skills/laravel-security/SKILL.md +285 -0
- package/src/skills/laravel-tdd/SKILL.md +283 -0
- package/src/skills/laravel-verification/SKILL.md +179 -0
- package/src/skills/mcp-server-patterns/SKILL.md +67 -0
- package/src/skills/nextjs-turbopack/SKILL.md +44 -0
- package/src/skills/nuxt4-patterns/SKILL.md +100 -0
- package/src/skills/opensource-pipeline/SKILL.md +255 -0
- package/src/skills/perl-patterns/SKILL.md +504 -0
- package/src/skills/perl-security/SKILL.md +503 -0
- package/src/skills/perl-testing/SKILL.md +475 -0
- package/src/skills/postgres-patterns/SKILL.md +147 -0
- package/src/skills/project-flow-ops/SKILL.md +111 -0
- package/src/skills/project-guidelines-example/SKILL.md +349 -0
- package/src/skills/prompt-optimizer/SKILL.md +397 -0
- package/src/skills/python-patterns/SKILL.md +750 -0
- package/src/skills/python-testing/SKILL.md +816 -0
- package/src/skills/pytorch-patterns/SKILL.md +396 -0
- package/src/skills/regex-vs-llm-structured-text/SKILL.md +220 -0
- package/src/skills/repo-scan/SKILL.md +78 -0
- package/src/skills/rules-distill/SKILL.md +264 -0
- package/src/skills/rules-distill/scripts/scan-rules.sh +58 -0
- package/src/skills/rules-distill/scripts/scan-skills.sh +129 -0
- package/src/skills/rust-patterns/SKILL.md +499 -0
- package/src/skills/rust-testing/SKILL.md +500 -0
- package/src/skills/safety-guard/SKILL.md +69 -0
- package/src/skills/search-first/SKILL.md +161 -0
- package/src/skills/security-review/SKILL.md +495 -0
- package/src/skills/security-review/cloud-infrastructure-security.md +361 -0
- package/src/skills/security-scan/SKILL.md +165 -0
- package/src/skills/springboot-patterns/SKILL.md +314 -0
- package/src/skills/springboot-security/SKILL.md +272 -0
- package/src/skills/springboot-tdd/SKILL.md +158 -0
- package/src/skills/springboot-verification/SKILL.md +231 -0
- package/src/skills/swift-concurrency-6-2/SKILL.md +216 -0
- package/src/skills/tdd-workflow/SKILL.md +410 -0
- package/src/skills/token-budget-advisor/SKILL.md +133 -0
- package/{ecc/skills/verification-loop-SKILL.md → src/skills/verification-loop/SKILL.md} +1 -1
- package/src/skills/workspace-surface-audit/SKILL.md +125 -0
- package/ecc/scripts/hooks/session-end-marker.js +0 -15
- package/gsd/LICENSE +0 -21
- package/gsd/commands-gsd/discuss-phase.md +0 -90
- package/gsd/commands-gsd/execute-phase.md +0 -41
- package/gsd/commands-gsd/join-discord.md +0 -18
- package/gsd/commands-gsd/reapply-patches.md +0 -123
- package/gsd/commands-gsd/set-profile.md +0 -34
- package/gsd/get-shit-done/bin/lib/config.cjs +0 -169
- package/gsd/get-shit-done/bin/lib/core.cjs +0 -492
- package/gsd/get-shit-done/bin/lib/init.cjs +0 -710
- package/gsd/get-shit-done/workflows/execute-phase.md +0 -459
- package/gsd/get-shit-done/workflows/help.md +0 -489
- package/gsd/get-shit-done/workflows/pause-work.md +0 -122
- package/gsd/get-shit-done/workflows/plan-phase.md +0 -560
- package/gsd/hooks/gsd-check-update.js +0 -81
- package/kit/CLAUDE.md +0 -43
- package/kit/commands/kit/update.md +0 -46
- package/kit/mcp.json +0 -10
- package/kit/rules/code-style.md +0 -24
- /package/{ecc → src}/agents/architect.md +0 -0
- /package/{ecc → src}/agents/code-reviewer.md +0 -0
- /package/{ecc → src}/agents/e2e-runner.md +0 -0
- /package/{ecc → src}/agents/security-reviewer.md +0 -0
- /package/{ecc → src}/agents/tdd-guide.md +0 -0
- /package/{ecc/rules → src/available-rules}/golang/coding-style.md +0 -0
- /package/{ecc/rules → src/available-rules}/golang/hooks.md +0 -0
- /package/{ecc/rules → src/available-rules}/golang/patterns.md +0 -0
- /package/{ecc/rules → src/available-rules}/golang/security.md +0 -0
- /package/{ecc/rules → src/available-rules}/golang/testing.md +0 -0
- /package/{ecc/rules → src/available-rules}/kotlin/coding-style.md +0 -0
- /package/{ecc/rules → src/available-rules}/kotlin/patterns.md +0 -0
- /package/{ecc/rules → src/available-rules}/kotlin/security.md +0 -0
- /package/{ecc/rules → src/available-rules}/kotlin/testing.md +0 -0
- /package/{ecc/rules → src/available-rules}/perl/coding-style.md +0 -0
- /package/{ecc/rules → src/available-rules}/perl/hooks.md +0 -0
- /package/{ecc/rules → src/available-rules}/perl/patterns.md +0 -0
- /package/{ecc/rules → src/available-rules}/perl/security.md +0 -0
- /package/{ecc/rules → src/available-rules}/perl/testing.md +0 -0
- /package/{ecc/rules → src/available-rules}/php/coding-style.md +0 -0
- /package/{ecc/rules → src/available-rules}/php/hooks.md +0 -0
- /package/{ecc/rules → src/available-rules}/php/patterns.md +0 -0
- /package/{ecc/rules → src/available-rules}/php/security.md +0 -0
- /package/{ecc/rules → src/available-rules}/php/testing.md +0 -0
- /package/{ecc/rules → src/available-rules}/python/coding-style.md +0 -0
- /package/{ecc/rules → src/available-rules}/python/hooks.md +0 -0
- /package/{ecc/rules → src/available-rules}/python/patterns.md +0 -0
- /package/{ecc/rules → src/available-rules}/python/security.md +0 -0
- /package/{ecc/rules → src/available-rules}/python/testing.md +0 -0
- /package/{ecc/rules → src/available-rules}/swift/coding-style.md +0 -0
- /package/{ecc/rules → src/available-rules}/swift/hooks.md +0 -0
- /package/{ecc/rules → src/available-rules}/swift/patterns.md +0 -0
- /package/{ecc/rules → src/available-rules}/swift/security.md +0 -0
- /package/{ecc/rules → src/available-rules}/swift/testing.md +0 -0
- /package/{ecc/rules → src/available-rules}/typescript/coding-style.md +0 -0
- /package/{ecc/rules → src/available-rules}/typescript/hooks.md +0 -0
- /package/{ecc/rules → src/available-rules}/typescript/patterns.md +0 -0
- /package/{ecc/rules → src/available-rules}/typescript/security.md +0 -0
- /package/{ecc/rules → src/available-rules}/typescript/testing.md +0 -0
- /package/{ecc → src}/contexts/dev.md +0 -0
- /package/{ecc → src}/contexts/research.md +0 -0
- /package/{ecc → src}/contexts/review.md +0 -0
- /package/{ecc → src}/examples/user-CLAUDE.md +0 -0
- /package/{ecc/scripts → src}/hooks/check-hook-enabled.js +0 -0
- /package/{ecc/scripts → src}/hooks/evaluate-session.js +0 -0
- /package/{ecc/scripts → src}/hooks/pre-compact.js +0 -0
- /package/{ecc/scripts → src}/hooks/suggest-compact.js +0 -0
- /package/{ecc/scripts → src}/lib/package-manager.d.ts +0 -0
- /package/{ecc/scripts → src}/lib/package-manager.js +0 -0
- /package/{ecc/scripts → src}/lib/resolve-formatter.js +0 -0
- /package/{ecc/scripts → src}/lib/session-aliases.d.ts +0 -0
- /package/{ecc/scripts → src}/lib/session-aliases.js +0 -0
- /package/{ecc/scripts → src}/lib/shell-split.js +0 -0
- /package/{gsd/get-shit-done → src/pilot}/bin/lib/frontmatter.cjs +0 -0
- /package/{gsd/get-shit-done → src/pilot}/bin/lib/milestone.cjs +0 -0
- /package/{gsd/get-shit-done → src/pilot}/bin/lib/roadmap.cjs +0 -0
- /package/{gsd/get-shit-done → src/pilot}/bin/lib/template.cjs +0 -0
- /package/{gsd/get-shit-done → src/pilot}/references/checkpoints.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/references/questioning.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/references/tdd.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/codebase/concerns.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/codebase/conventions.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/codebase/integrations.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/codebase/stack.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/codebase/testing.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/config.json +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/continue-here.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/milestone-archive.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/milestone.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/requirements.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/research-project/ARCHITECTURE.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/research-project/FEATURES.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/research-project/PITFALLS.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/research-project/STACK.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/research-project/SUMMARY.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/retrospective.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/roadmap.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/summary-minimal.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/summary-standard.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/summary.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/user-setup.md +0 -0
- /package/{gsd/get-shit-done → src/pilot}/templates/verification-report.md +0 -0
- /package/{ecc → src}/rules/common/development-workflow.md +0 -0
- /package/{ecc → src}/rules/common/git-workflow.md +0 -0
- /package/{ecc → src}/rules/common/hooks.md +0 -0
- /package/{ecc → src}/rules/common/patterns.md +0 -0
- /package/{ecc → src}/rules/common/performance.md +0 -0
- /package/{ecc → src}/rules/common/security.md +0 -0
- /package/{ecc → src}/rules/common/testing.md +0 -0
- /package/{ecc → src}/skills/continuous-learning-v2/agents/start-observer.sh +0 -0
- /package/{ecc → src}/skills/continuous-learning-v2/config.json +0 -0
- /package/{ecc → src}/skills/continuous-learning-v2/hooks/observe.sh +0 -0
- /package/{ecc → src}/skills/continuous-learning-v2/scripts/detect-project.sh +0 -0
- /package/{ecc → src}/skills/continuous-learning-v2/scripts/instinct-cli.py +0 -0
- /package/{ecc → src}/skills/continuous-learning-v2/scripts/test_parse_instinct.py +0 -0
- /package/{ecc → src}/skills/strategic-compact/SKILL.md +0 -0
- /package/{ecc → src}/skills/strategic-compact/suggest-compact.sh +0 -0
|
@@ -0,0 +1,326 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: e2e-testing
|
|
3
|
+
description: Playwright E2E testing patterns, Page Object Model, configuration, CI/CD integration, artifact management, and flaky test strategies.
|
|
4
|
+
origin: ECC
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
# E2E Testing Patterns
|
|
8
|
+
|
|
9
|
+
Comprehensive Playwright patterns for building stable, fast, and maintainable E2E test suites.
|
|
10
|
+
|
|
11
|
+
## Test File Organization
|
|
12
|
+
|
|
13
|
+
```
|
|
14
|
+
tests/
|
|
15
|
+
├── e2e/
|
|
16
|
+
│ ├── auth/
|
|
17
|
+
│ │ ├── login.spec.ts
|
|
18
|
+
│ │ ├── logout.spec.ts
|
|
19
|
+
│ │ └── register.spec.ts
|
|
20
|
+
│ ├── features/
|
|
21
|
+
│ │ ├── browse.spec.ts
|
|
22
|
+
│ │ ├── search.spec.ts
|
|
23
|
+
│ │ └── create.spec.ts
|
|
24
|
+
│ └── api/
|
|
25
|
+
│ └── endpoints.spec.ts
|
|
26
|
+
├── fixtures/
|
|
27
|
+
│ ├── auth.ts
|
|
28
|
+
│ └── data.ts
|
|
29
|
+
└── playwright.config.ts
|
|
30
|
+
```
|
|
31
|
+
|
|
32
|
+
## Page Object Model (POM)
|
|
33
|
+
|
|
34
|
+
```typescript
|
|
35
|
+
import { Page, Locator } from '@playwright/test'
|
|
36
|
+
|
|
37
|
+
export class ItemsPage {
|
|
38
|
+
readonly page: Page
|
|
39
|
+
readonly searchInput: Locator
|
|
40
|
+
readonly itemCards: Locator
|
|
41
|
+
readonly createButton: Locator
|
|
42
|
+
|
|
43
|
+
constructor(page: Page) {
|
|
44
|
+
this.page = page
|
|
45
|
+
this.searchInput = page.locator('[data-testid="search-input"]')
|
|
46
|
+
this.itemCards = page.locator('[data-testid="item-card"]')
|
|
47
|
+
this.createButton = page.locator('[data-testid="create-btn"]')
|
|
48
|
+
}
|
|
49
|
+
|
|
50
|
+
async goto() {
|
|
51
|
+
await this.page.goto('/items')
|
|
52
|
+
await this.page.waitForLoadState('networkidle')
|
|
53
|
+
}
|
|
54
|
+
|
|
55
|
+
async search(query: string) {
|
|
56
|
+
await this.searchInput.fill(query)
|
|
57
|
+
await this.page.waitForResponse(resp => resp.url().includes('/api/search'))
|
|
58
|
+
await this.page.waitForLoadState('networkidle')
|
|
59
|
+
}
|
|
60
|
+
|
|
61
|
+
async getItemCount() {
|
|
62
|
+
return await this.itemCards.count()
|
|
63
|
+
}
|
|
64
|
+
}
|
|
65
|
+
```
|
|
66
|
+
|
|
67
|
+
## Test Structure
|
|
68
|
+
|
|
69
|
+
```typescript
|
|
70
|
+
import { test, expect } from '@playwright/test'
|
|
71
|
+
import { ItemsPage } from '../../pages/ItemsPage'
|
|
72
|
+
|
|
73
|
+
test.describe('Item Search', () => {
|
|
74
|
+
let itemsPage: ItemsPage
|
|
75
|
+
|
|
76
|
+
test.beforeEach(async ({ page }) => {
|
|
77
|
+
itemsPage = new ItemsPage(page)
|
|
78
|
+
await itemsPage.goto()
|
|
79
|
+
})
|
|
80
|
+
|
|
81
|
+
test('should search by keyword', async ({ page }) => {
|
|
82
|
+
await itemsPage.search('test')
|
|
83
|
+
|
|
84
|
+
const count = await itemsPage.getItemCount()
|
|
85
|
+
expect(count).toBeGreaterThan(0)
|
|
86
|
+
|
|
87
|
+
await expect(itemsPage.itemCards.first()).toContainText(/test/i)
|
|
88
|
+
await page.screenshot({ path: 'artifacts/search-results.png' })
|
|
89
|
+
})
|
|
90
|
+
|
|
91
|
+
test('should handle no results', async ({ page }) => {
|
|
92
|
+
await itemsPage.search('xyznonexistent123')
|
|
93
|
+
|
|
94
|
+
await expect(page.locator('[data-testid="no-results"]')).toBeVisible()
|
|
95
|
+
expect(await itemsPage.getItemCount()).toBe(0)
|
|
96
|
+
})
|
|
97
|
+
})
|
|
98
|
+
```
|
|
99
|
+
|
|
100
|
+
## Playwright Configuration
|
|
101
|
+
|
|
102
|
+
```typescript
|
|
103
|
+
import { defineConfig, devices } from '@playwright/test'
|
|
104
|
+
|
|
105
|
+
export default defineConfig({
|
|
106
|
+
testDir: './tests/e2e',
|
|
107
|
+
fullyParallel: true,
|
|
108
|
+
forbidOnly: !!process.env.CI,
|
|
109
|
+
retries: process.env.CI ? 2 : 0,
|
|
110
|
+
workers: process.env.CI ? 1 : undefined,
|
|
111
|
+
reporter: [
|
|
112
|
+
['html', { outputFolder: 'playwright-report' }],
|
|
113
|
+
['junit', { outputFile: 'playwright-results.xml' }],
|
|
114
|
+
['json', { outputFile: 'playwright-results.json' }]
|
|
115
|
+
],
|
|
116
|
+
use: {
|
|
117
|
+
baseURL: process.env.BASE_URL || 'http://localhost:3000',
|
|
118
|
+
trace: 'on-first-retry',
|
|
119
|
+
screenshot: 'only-on-failure',
|
|
120
|
+
video: 'retain-on-failure',
|
|
121
|
+
actionTimeout: 10000,
|
|
122
|
+
navigationTimeout: 30000,
|
|
123
|
+
},
|
|
124
|
+
projects: [
|
|
125
|
+
{ name: 'chromium', use: { ...devices['Desktop Chrome'] } },
|
|
126
|
+
{ name: 'firefox', use: { ...devices['Desktop Firefox'] } },
|
|
127
|
+
{ name: 'webkit', use: { ...devices['Desktop Safari'] } },
|
|
128
|
+
{ name: 'mobile-chrome', use: { ...devices['Pixel 5'] } },
|
|
129
|
+
],
|
|
130
|
+
webServer: {
|
|
131
|
+
command: 'npm run dev',
|
|
132
|
+
url: 'http://localhost:3000',
|
|
133
|
+
reuseExistingServer: !process.env.CI,
|
|
134
|
+
timeout: 120000,
|
|
135
|
+
},
|
|
136
|
+
})
|
|
137
|
+
```
|
|
138
|
+
|
|
139
|
+
## Flaky Test Patterns
|
|
140
|
+
|
|
141
|
+
### Quarantine
|
|
142
|
+
|
|
143
|
+
```typescript
|
|
144
|
+
test('flaky: complex search', async ({ page }) => {
|
|
145
|
+
test.fixme(true, 'Flaky - Issue #123')
|
|
146
|
+
// test code...
|
|
147
|
+
})
|
|
148
|
+
|
|
149
|
+
test('conditional skip', async ({ page }) => {
|
|
150
|
+
test.skip(process.env.CI, 'Flaky in CI - Issue #123')
|
|
151
|
+
// test code...
|
|
152
|
+
})
|
|
153
|
+
```
|
|
154
|
+
|
|
155
|
+
### Identify Flakiness
|
|
156
|
+
|
|
157
|
+
```bash
|
|
158
|
+
npx playwright test tests/search.spec.ts --repeat-each=10
|
|
159
|
+
npx playwright test tests/search.spec.ts --retries=3
|
|
160
|
+
```
|
|
161
|
+
|
|
162
|
+
### Common Causes & Fixes
|
|
163
|
+
|
|
164
|
+
**Race conditions:**
|
|
165
|
+
```typescript
|
|
166
|
+
// Bad: assumes element is ready
|
|
167
|
+
await page.click('[data-testid="button"]')
|
|
168
|
+
|
|
169
|
+
// Good: auto-wait locator
|
|
170
|
+
await page.locator('[data-testid="button"]').click()
|
|
171
|
+
```
|
|
172
|
+
|
|
173
|
+
**Network timing:**
|
|
174
|
+
```typescript
|
|
175
|
+
// Bad: arbitrary timeout
|
|
176
|
+
await page.waitForTimeout(5000)
|
|
177
|
+
|
|
178
|
+
// Good: wait for specific condition
|
|
179
|
+
await page.waitForResponse(resp => resp.url().includes('/api/data'))
|
|
180
|
+
```
|
|
181
|
+
|
|
182
|
+
**Animation timing:**
|
|
183
|
+
```typescript
|
|
184
|
+
// Bad: click during animation
|
|
185
|
+
await page.click('[data-testid="menu-item"]')
|
|
186
|
+
|
|
187
|
+
// Good: wait for stability
|
|
188
|
+
await page.locator('[data-testid="menu-item"]').waitFor({ state: 'visible' })
|
|
189
|
+
await page.waitForLoadState('networkidle')
|
|
190
|
+
await page.locator('[data-testid="menu-item"]').click()
|
|
191
|
+
```
|
|
192
|
+
|
|
193
|
+
## Artifact Management
|
|
194
|
+
|
|
195
|
+
### Screenshots
|
|
196
|
+
|
|
197
|
+
```typescript
|
|
198
|
+
await page.screenshot({ path: 'artifacts/after-login.png' })
|
|
199
|
+
await page.screenshot({ path: 'artifacts/full-page.png', fullPage: true })
|
|
200
|
+
await page.locator('[data-testid="chart"]').screenshot({ path: 'artifacts/chart.png' })
|
|
201
|
+
```
|
|
202
|
+
|
|
203
|
+
### Traces
|
|
204
|
+
|
|
205
|
+
```typescript
|
|
206
|
+
await browser.startTracing(page, {
|
|
207
|
+
path: 'artifacts/trace.json',
|
|
208
|
+
screenshots: true,
|
|
209
|
+
snapshots: true,
|
|
210
|
+
})
|
|
211
|
+
// ... test actions ...
|
|
212
|
+
await browser.stopTracing()
|
|
213
|
+
```
|
|
214
|
+
|
|
215
|
+
### Video
|
|
216
|
+
|
|
217
|
+
```typescript
|
|
218
|
+
// In playwright.config.ts
|
|
219
|
+
use: {
|
|
220
|
+
video: 'retain-on-failure',
|
|
221
|
+
videosPath: 'artifacts/videos/'
|
|
222
|
+
}
|
|
223
|
+
```
|
|
224
|
+
|
|
225
|
+
## CI/CD Integration
|
|
226
|
+
|
|
227
|
+
```yaml
|
|
228
|
+
# .github/workflows/e2e.yml
|
|
229
|
+
name: E2E Tests
|
|
230
|
+
on: [push, pull_request]
|
|
231
|
+
|
|
232
|
+
jobs:
|
|
233
|
+
test:
|
|
234
|
+
runs-on: ubuntu-latest
|
|
235
|
+
steps:
|
|
236
|
+
- uses: actions/checkout@v4
|
|
237
|
+
- uses: actions/setup-node@v4
|
|
238
|
+
with:
|
|
239
|
+
node-version: 20
|
|
240
|
+
- run: npm ci
|
|
241
|
+
- run: npx playwright install --with-deps
|
|
242
|
+
- run: npx playwright test
|
|
243
|
+
env:
|
|
244
|
+
BASE_URL: ${{ vars.STAGING_URL }}
|
|
245
|
+
- uses: actions/upload-artifact@v4
|
|
246
|
+
if: always()
|
|
247
|
+
with:
|
|
248
|
+
name: playwright-report
|
|
249
|
+
path: playwright-report/
|
|
250
|
+
retention-days: 30
|
|
251
|
+
```
|
|
252
|
+
|
|
253
|
+
## Test Report Template
|
|
254
|
+
|
|
255
|
+
```markdown
|
|
256
|
+
# E2E Test Report
|
|
257
|
+
|
|
258
|
+
**Date:** YYYY-MM-DD HH:MM
|
|
259
|
+
**Duration:** Xm Ys
|
|
260
|
+
**Status:** PASSING / FAILING
|
|
261
|
+
|
|
262
|
+
## Summary
|
|
263
|
+
- Total: X | Passed: Y (Z%) | Failed: A | Flaky: B | Skipped: C
|
|
264
|
+
|
|
265
|
+
## Failed Tests
|
|
266
|
+
|
|
267
|
+
### test-name
|
|
268
|
+
**File:** `tests/e2e/feature.spec.ts:45`
|
|
269
|
+
**Error:** Expected element to be visible
|
|
270
|
+
**Screenshot:** artifacts/failed.png
|
|
271
|
+
**Recommended Fix:** [description]
|
|
272
|
+
|
|
273
|
+
## Artifacts
|
|
274
|
+
- HTML Report: playwright-report/index.html
|
|
275
|
+
- Screenshots: artifacts/*.png
|
|
276
|
+
- Videos: artifacts/videos/*.webm
|
|
277
|
+
- Traces: artifacts/*.zip
|
|
278
|
+
```
|
|
279
|
+
|
|
280
|
+
## Wallet / Web3 Testing
|
|
281
|
+
|
|
282
|
+
```typescript
|
|
283
|
+
test('wallet connection', async ({ page, context }) => {
|
|
284
|
+
// Mock wallet provider
|
|
285
|
+
await context.addInitScript(() => {
|
|
286
|
+
window.ethereum = {
|
|
287
|
+
isMetaMask: true,
|
|
288
|
+
request: async ({ method }) => {
|
|
289
|
+
if (method === 'eth_requestAccounts')
|
|
290
|
+
return ['0x1234567890123456789012345678901234567890']
|
|
291
|
+
if (method === 'eth_chainId') return '0x1'
|
|
292
|
+
}
|
|
293
|
+
}
|
|
294
|
+
})
|
|
295
|
+
|
|
296
|
+
await page.goto('/')
|
|
297
|
+
await page.locator('[data-testid="connect-wallet"]').click()
|
|
298
|
+
await expect(page.locator('[data-testid="wallet-address"]')).toContainText('0x1234')
|
|
299
|
+
})
|
|
300
|
+
```
|
|
301
|
+
|
|
302
|
+
## Financial / Critical Flow Testing
|
|
303
|
+
|
|
304
|
+
```typescript
|
|
305
|
+
test('trade execution', async ({ page }) => {
|
|
306
|
+
// Skip on production — real money
|
|
307
|
+
test.skip(process.env.NODE_ENV === 'production', 'Skip on production')
|
|
308
|
+
|
|
309
|
+
await page.goto('/markets/test-market')
|
|
310
|
+
await page.locator('[data-testid="position-yes"]').click()
|
|
311
|
+
await page.locator('[data-testid="trade-amount"]').fill('1.0')
|
|
312
|
+
|
|
313
|
+
// Verify preview
|
|
314
|
+
const preview = page.locator('[data-testid="trade-preview"]')
|
|
315
|
+
await expect(preview).toContainText('1.0')
|
|
316
|
+
|
|
317
|
+
// Confirm and wait for blockchain
|
|
318
|
+
await page.locator('[data-testid="confirm-trade"]').click()
|
|
319
|
+
await page.waitForResponse(
|
|
320
|
+
resp => resp.url().includes('/api/trade') && resp.status() === 200,
|
|
321
|
+
{ timeout: 30000 }
|
|
322
|
+
)
|
|
323
|
+
|
|
324
|
+
await expect(page.locator('[data-testid="trade-success"]')).toBeVisible()
|
|
325
|
+
})
|
|
326
|
+
```
|
|
@@ -0,0 +1,270 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: eval-harness
|
|
3
|
+
description: Formal evaluation framework for Claude Code sessions implementing eval-driven development (EDD) principles
|
|
4
|
+
origin: ECC
|
|
5
|
+
tools: Read, Write, Edit, Bash, Grep, Glob
|
|
6
|
+
---
|
|
7
|
+
|
|
8
|
+
# Eval Harness Skill
|
|
9
|
+
|
|
10
|
+
A formal evaluation framework for Claude Code sessions, implementing eval-driven development (EDD) principles.
|
|
11
|
+
|
|
12
|
+
## When to Activate
|
|
13
|
+
|
|
14
|
+
- Setting up eval-driven development (EDD) for AI-assisted workflows
|
|
15
|
+
- Defining pass/fail criteria for Claude Code task completion
|
|
16
|
+
- Measuring agent reliability with pass@k metrics
|
|
17
|
+
- Creating regression test suites for prompt or agent changes
|
|
18
|
+
- Benchmarking agent performance across model versions
|
|
19
|
+
|
|
20
|
+
## Philosophy
|
|
21
|
+
|
|
22
|
+
Eval-Driven Development treats evals as the "unit tests of AI development":
|
|
23
|
+
- Define expected behavior BEFORE implementation
|
|
24
|
+
- Run evals continuously during development
|
|
25
|
+
- Track regressions with each change
|
|
26
|
+
- Use pass@k metrics for reliability measurement
|
|
27
|
+
|
|
28
|
+
## Eval Types
|
|
29
|
+
|
|
30
|
+
### Capability Evals
|
|
31
|
+
Test if Claude can do something it couldn't before:
|
|
32
|
+
```markdown
|
|
33
|
+
[CAPABILITY EVAL: feature-name]
|
|
34
|
+
Task: Description of what Claude should accomplish
|
|
35
|
+
Success Criteria:
|
|
36
|
+
- [ ] Criterion 1
|
|
37
|
+
- [ ] Criterion 2
|
|
38
|
+
- [ ] Criterion 3
|
|
39
|
+
Expected Output: Description of expected result
|
|
40
|
+
```
|
|
41
|
+
|
|
42
|
+
### Regression Evals
|
|
43
|
+
Ensure changes don't break existing functionality:
|
|
44
|
+
```markdown
|
|
45
|
+
[REGRESSION EVAL: feature-name]
|
|
46
|
+
Baseline: SHA or checkpoint name
|
|
47
|
+
Tests:
|
|
48
|
+
- existing-test-1: PASS/FAIL
|
|
49
|
+
- existing-test-2: PASS/FAIL
|
|
50
|
+
- existing-test-3: PASS/FAIL
|
|
51
|
+
Result: X/Y passed (previously Y/Y)
|
|
52
|
+
```
|
|
53
|
+
|
|
54
|
+
## Grader Types
|
|
55
|
+
|
|
56
|
+
### 1. Code-Based Grader
|
|
57
|
+
Deterministic checks using code:
|
|
58
|
+
```bash
|
|
59
|
+
# Check if file contains expected pattern
|
|
60
|
+
grep -q "export function handleAuth" src/auth.ts && echo "PASS" || echo "FAIL"
|
|
61
|
+
|
|
62
|
+
# Check if tests pass
|
|
63
|
+
npm test -- --testPathPattern="auth" && echo "PASS" || echo "FAIL"
|
|
64
|
+
|
|
65
|
+
# Check if build succeeds
|
|
66
|
+
npm run build && echo "PASS" || echo "FAIL"
|
|
67
|
+
```
|
|
68
|
+
|
|
69
|
+
### 2. Model-Based Grader
|
|
70
|
+
Use Claude to evaluate open-ended outputs:
|
|
71
|
+
```markdown
|
|
72
|
+
[MODEL GRADER PROMPT]
|
|
73
|
+
Evaluate the following code change:
|
|
74
|
+
1. Does it solve the stated problem?
|
|
75
|
+
2. Is it well-structured?
|
|
76
|
+
3. Are edge cases handled?
|
|
77
|
+
4. Is error handling appropriate?
|
|
78
|
+
|
|
79
|
+
Score: 1-5 (1=poor, 5=excellent)
|
|
80
|
+
Reasoning: [explanation]
|
|
81
|
+
```
|
|
82
|
+
|
|
83
|
+
### 3. Human Grader
|
|
84
|
+
Flag for manual review:
|
|
85
|
+
```markdown
|
|
86
|
+
[HUMAN REVIEW REQUIRED]
|
|
87
|
+
Change: Description of what changed
|
|
88
|
+
Reason: Why human review is needed
|
|
89
|
+
Risk Level: LOW/MEDIUM/HIGH
|
|
90
|
+
```
|
|
91
|
+
|
|
92
|
+
## Metrics
|
|
93
|
+
|
|
94
|
+
### pass@k
|
|
95
|
+
"At least one success in k attempts"
|
|
96
|
+
- pass@1: First attempt success rate
|
|
97
|
+
- pass@3: Success within 3 attempts
|
|
98
|
+
- Typical target: pass@3 > 90%
|
|
99
|
+
|
|
100
|
+
### pass^k
|
|
101
|
+
"All k trials succeed"
|
|
102
|
+
- Higher bar for reliability
|
|
103
|
+
- pass^3: 3 consecutive successes
|
|
104
|
+
- Use for critical paths
|
|
105
|
+
|
|
106
|
+
## Eval Workflow
|
|
107
|
+
|
|
108
|
+
### 1. Define (Before Coding)
|
|
109
|
+
```markdown
|
|
110
|
+
## EVAL DEFINITION: feature-xyz
|
|
111
|
+
|
|
112
|
+
### Capability Evals
|
|
113
|
+
1. Can create new user account
|
|
114
|
+
2. Can validate email format
|
|
115
|
+
3. Can hash password securely
|
|
116
|
+
|
|
117
|
+
### Regression Evals
|
|
118
|
+
1. Existing login still works
|
|
119
|
+
2. Session management unchanged
|
|
120
|
+
3. Logout flow intact
|
|
121
|
+
|
|
122
|
+
### Success Metrics
|
|
123
|
+
- pass@3 > 90% for capability evals
|
|
124
|
+
- pass^3 = 100% for regression evals
|
|
125
|
+
```
|
|
126
|
+
|
|
127
|
+
### 2. Implement
|
|
128
|
+
Write code to pass the defined evals.
|
|
129
|
+
|
|
130
|
+
### 3. Evaluate
|
|
131
|
+
```bash
|
|
132
|
+
# Run capability evals
|
|
133
|
+
[Run each capability eval, record PASS/FAIL]
|
|
134
|
+
|
|
135
|
+
# Run regression evals
|
|
136
|
+
npm test -- --testPathPattern="existing"
|
|
137
|
+
|
|
138
|
+
# Generate report
|
|
139
|
+
```
|
|
140
|
+
|
|
141
|
+
### 4. Report
|
|
142
|
+
```markdown
|
|
143
|
+
EVAL REPORT: feature-xyz
|
|
144
|
+
========================
|
|
145
|
+
|
|
146
|
+
Capability Evals:
|
|
147
|
+
create-user: PASS (pass@1)
|
|
148
|
+
validate-email: PASS (pass@2)
|
|
149
|
+
hash-password: PASS (pass@1)
|
|
150
|
+
Overall: 3/3 passed
|
|
151
|
+
|
|
152
|
+
Regression Evals:
|
|
153
|
+
login-flow: PASS
|
|
154
|
+
session-mgmt: PASS
|
|
155
|
+
logout-flow: PASS
|
|
156
|
+
Overall: 3/3 passed
|
|
157
|
+
|
|
158
|
+
Metrics:
|
|
159
|
+
pass@1: 67% (2/3)
|
|
160
|
+
pass@3: 100% (3/3)
|
|
161
|
+
|
|
162
|
+
Status: READY FOR REVIEW
|
|
163
|
+
```
|
|
164
|
+
|
|
165
|
+
## Integration Patterns
|
|
166
|
+
|
|
167
|
+
### Pre-Implementation
|
|
168
|
+
```
|
|
169
|
+
/eval define feature-name
|
|
170
|
+
```
|
|
171
|
+
Creates eval definition file at `.claude/evals/feature-name.md`
|
|
172
|
+
|
|
173
|
+
### During Implementation
|
|
174
|
+
```
|
|
175
|
+
/eval check feature-name
|
|
176
|
+
```
|
|
177
|
+
Runs current evals and reports status
|
|
178
|
+
|
|
179
|
+
### Post-Implementation
|
|
180
|
+
```
|
|
181
|
+
/eval report feature-name
|
|
182
|
+
```
|
|
183
|
+
Generates full eval report
|
|
184
|
+
|
|
185
|
+
## Eval Storage
|
|
186
|
+
|
|
187
|
+
Store evals in project:
|
|
188
|
+
```
|
|
189
|
+
.claude/
|
|
190
|
+
evals/
|
|
191
|
+
feature-xyz.md # Eval definition
|
|
192
|
+
feature-xyz.log # Eval run history
|
|
193
|
+
baseline.json # Regression baselines
|
|
194
|
+
```
|
|
195
|
+
|
|
196
|
+
## Best Practices
|
|
197
|
+
|
|
198
|
+
1. **Define evals BEFORE coding** - Forces clear thinking about success criteria
|
|
199
|
+
2. **Run evals frequently** - Catch regressions early
|
|
200
|
+
3. **Track pass@k over time** - Monitor reliability trends
|
|
201
|
+
4. **Use code graders when possible** - Deterministic > probabilistic
|
|
202
|
+
5. **Human review for security** - Never fully automate security checks
|
|
203
|
+
6. **Keep evals fast** - Slow evals don't get run
|
|
204
|
+
7. **Version evals with code** - Evals are first-class artifacts
|
|
205
|
+
|
|
206
|
+
## Example: Adding Authentication
|
|
207
|
+
|
|
208
|
+
```markdown
|
|
209
|
+
## EVAL: add-authentication
|
|
210
|
+
|
|
211
|
+
### Phase 1: Define (10 min)
|
|
212
|
+
Capability Evals:
|
|
213
|
+
- [ ] User can register with email/password
|
|
214
|
+
- [ ] User can login with valid credentials
|
|
215
|
+
- [ ] Invalid credentials rejected with proper error
|
|
216
|
+
- [ ] Sessions persist across page reloads
|
|
217
|
+
- [ ] Logout clears session
|
|
218
|
+
|
|
219
|
+
Regression Evals:
|
|
220
|
+
- [ ] Public routes still accessible
|
|
221
|
+
- [ ] API responses unchanged
|
|
222
|
+
- [ ] Database schema compatible
|
|
223
|
+
|
|
224
|
+
### Phase 2: Implement (varies)
|
|
225
|
+
[Write code]
|
|
226
|
+
|
|
227
|
+
### Phase 3: Evaluate
|
|
228
|
+
Run: /eval check add-authentication
|
|
229
|
+
|
|
230
|
+
### Phase 4: Report
|
|
231
|
+
EVAL REPORT: add-authentication
|
|
232
|
+
==============================
|
|
233
|
+
Capability: 5/5 passed (pass@3: 100%)
|
|
234
|
+
Regression: 3/3 passed (pass^3: 100%)
|
|
235
|
+
Status: SHIP IT
|
|
236
|
+
```
|
|
237
|
+
|
|
238
|
+
## Product Evals (v1.8)
|
|
239
|
+
|
|
240
|
+
Use product evals when behavior quality cannot be captured by unit tests alone.
|
|
241
|
+
|
|
242
|
+
### Grader Types
|
|
243
|
+
|
|
244
|
+
1. Code grader (deterministic assertions)
|
|
245
|
+
2. Rule grader (regex/schema constraints)
|
|
246
|
+
3. Model grader (LLM-as-judge rubric)
|
|
247
|
+
4. Human grader (manual adjudication for ambiguous outputs)
|
|
248
|
+
|
|
249
|
+
### pass@k Guidance
|
|
250
|
+
|
|
251
|
+
- `pass@1`: direct reliability
|
|
252
|
+
- `pass@3`: practical reliability under controlled retries
|
|
253
|
+
- `pass^3`: stability test (all 3 runs must pass)
|
|
254
|
+
|
|
255
|
+
Recommended thresholds:
|
|
256
|
+
- Capability evals: pass@3 >= 0.90
|
|
257
|
+
- Regression evals: pass^3 = 1.00 for release-critical paths
|
|
258
|
+
|
|
259
|
+
### Eval Anti-Patterns
|
|
260
|
+
|
|
261
|
+
- Overfitting prompts to known eval examples
|
|
262
|
+
- Measuring only happy-path outputs
|
|
263
|
+
- Ignoring cost and latency drift while chasing pass rates
|
|
264
|
+
- Allowing flaky graders in release gates
|
|
265
|
+
|
|
266
|
+
### Minimal Eval Artifact Layout
|
|
267
|
+
|
|
268
|
+
- `.claude/evals/<feature>.md` definition
|
|
269
|
+
- `.claude/evals/<feature>.log` run history
|
|
270
|
+
- `docs/releases/<version>/eval-summary.md` release snapshot
|
|
@@ -0,0 +1,103 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: exa-search
|
|
3
|
+
description: Neural search via Exa MCP for web, code, and company research. Use when the user needs web search, code examples, company intel, people lookup, or AI-powered deep research with Exa's neural search engine.
|
|
4
|
+
origin: ECC
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
# Exa Search
|
|
8
|
+
|
|
9
|
+
Neural search for web content, code, companies, and people via the Exa MCP server.
|
|
10
|
+
|
|
11
|
+
## When to Activate
|
|
12
|
+
|
|
13
|
+
- User needs current web information or news
|
|
14
|
+
- Searching for code examples, API docs, or technical references
|
|
15
|
+
- Researching companies, competitors, or market players
|
|
16
|
+
- Finding professional profiles or people in a domain
|
|
17
|
+
- Running background research for any development task
|
|
18
|
+
- User says "search for", "look up", "find", or "what's the latest on"
|
|
19
|
+
|
|
20
|
+
## MCP Requirement
|
|
21
|
+
|
|
22
|
+
Exa MCP server must be configured. Add to `~/.claude.json`:
|
|
23
|
+
|
|
24
|
+
```json
|
|
25
|
+
"exa-web-search": {
|
|
26
|
+
"command": "npx",
|
|
27
|
+
"args": ["-y", "exa-mcp-server"],
|
|
28
|
+
"env": { "EXA_API_KEY": "YOUR_EXA_API_KEY_HERE" }
|
|
29
|
+
}
|
|
30
|
+
```
|
|
31
|
+
|
|
32
|
+
Get an API key at [exa.ai](https://exa.ai).
|
|
33
|
+
This repo's current Exa setup documents the tool surface exposed here: `web_search_exa` and `get_code_context_exa`.
|
|
34
|
+
If your Exa server exposes additional tools, verify their exact names before depending on them in docs or prompts.
|
|
35
|
+
|
|
36
|
+
## Core Tools
|
|
37
|
+
|
|
38
|
+
### web_search_exa
|
|
39
|
+
General web search for current information, news, or facts.
|
|
40
|
+
|
|
41
|
+
```
|
|
42
|
+
web_search_exa(query: "latest AI developments 2026", numResults: 5)
|
|
43
|
+
```
|
|
44
|
+
|
|
45
|
+
**Parameters:**
|
|
46
|
+
|
|
47
|
+
| Param | Type | Default | Notes |
|
|
48
|
+
|-------|------|---------|-------|
|
|
49
|
+
| `query` | string | required | Search query |
|
|
50
|
+
| `numResults` | number | 8 | Number of results |
|
|
51
|
+
| `type` | string | `auto` | Search mode |
|
|
52
|
+
| `livecrawl` | string | `fallback` | Prefer live crawling when needed |
|
|
53
|
+
| `category` | string | none | Optional focus such as `company` or `research paper` |
|
|
54
|
+
|
|
55
|
+
### get_code_context_exa
|
|
56
|
+
Find code examples and documentation from GitHub, Stack Overflow, and docs sites.
|
|
57
|
+
|
|
58
|
+
```
|
|
59
|
+
get_code_context_exa(query: "Python asyncio patterns", tokensNum: 3000)
|
|
60
|
+
```
|
|
61
|
+
|
|
62
|
+
**Parameters:**
|
|
63
|
+
|
|
64
|
+
| Param | Type | Default | Notes |
|
|
65
|
+
|-------|------|---------|-------|
|
|
66
|
+
| `query` | string | required | Code or API search query |
|
|
67
|
+
| `tokensNum` | number | 5000 | Content tokens (1000-50000) |
|
|
68
|
+
|
|
69
|
+
## Usage Patterns
|
|
70
|
+
|
|
71
|
+
### Quick Lookup
|
|
72
|
+
```
|
|
73
|
+
web_search_exa(query: "Node.js 22 new features", numResults: 3)
|
|
74
|
+
```
|
|
75
|
+
|
|
76
|
+
### Code Research
|
|
77
|
+
```
|
|
78
|
+
get_code_context_exa(query: "Rust error handling patterns Result type", tokensNum: 3000)
|
|
79
|
+
```
|
|
80
|
+
|
|
81
|
+
### Company or People Research
|
|
82
|
+
```
|
|
83
|
+
web_search_exa(query: "Vercel funding valuation 2026", numResults: 3, category: "company")
|
|
84
|
+
web_search_exa(query: "site:linkedin.com/in AI safety researchers Anthropic", numResults: 5)
|
|
85
|
+
```
|
|
86
|
+
|
|
87
|
+
### Technical Deep Dive
|
|
88
|
+
```
|
|
89
|
+
web_search_exa(query: "WebAssembly component model status and adoption", numResults: 5)
|
|
90
|
+
get_code_context_exa(query: "WebAssembly component model examples", tokensNum: 4000)
|
|
91
|
+
```
|
|
92
|
+
|
|
93
|
+
## Tips
|
|
94
|
+
|
|
95
|
+
- Use `web_search_exa` for current information, company lookups, and broad discovery
|
|
96
|
+
- Use search operators like `site:`, quoted phrases, and `intitle:` to narrow results
|
|
97
|
+
- Lower `tokensNum` (1000-2000) for focused code snippets, higher (5000+) for comprehensive context
|
|
98
|
+
- Use `get_code_context_exa` when you need API usage or code examples rather than general web pages
|
|
99
|
+
|
|
100
|
+
## Related Skills
|
|
101
|
+
|
|
102
|
+
- `deep-research` — Full research workflow using firecrawl + exa together
|
|
103
|
+
- `market-research` — Business-oriented research with decision frameworks
|