workflow-ai 1.1.0 → 1.2.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +49 -0
- package/README.md +105 -7
- package/configs/pipeline.yaml +23 -2
- package/package.json +44 -44
- package/src/lib/operations/tickets.mjs +305 -207
- package/src/lib/utils.mjs +286 -286
- package/src/runner.mjs +314 -34
- package/src/scripts/check-conditions.js +2 -2
- package/src/scripts/get-next-id.js +144 -41
- package/src/scripts/move-ticket.js +225 -68
- package/src/scripts/pick-next-task.js +753 -93
- package/src/skills/coach/SKILL.md +1 -1
- package/src/skills/manual-testing/SKILL.md +2 -0
- package/src/scripts/tests/timeout-cascade.test.js +0 -28
- package/src/skills/analyze-report/README.md +0 -44
- package/src/skills/analyze-report/algorithms/progress-assessment.md +0 -108
- package/src/skills/analyze-report/knowledge/analysis-frameworks.md +0 -66
- package/src/skills/analyze-report/knowledge/report-structure.md +0 -61
- package/src/skills/analyze-report/scripts/calc-plan-metrics.js +0 -234
- package/src/skills/analyze-report/templates/analysis-report.md +0 -80
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-1.md +0 -5
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-2.md +0 -98
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-3.md +0 -99
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/judge.json +0 -163
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-1.md +0 -89
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-2.md +0 -88
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-3.md +0 -100
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-glm/trial-1.md +0 -77
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-glm/trial-2.md +0 -64
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-glm/trial-3.md +0 -110
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-minimax/trial-1.md +0 -74
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-minimax/trial-2.md +0 -38
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-minimax/trial-3.md +0 -61
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/meta.json +0 -115
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001-evidence-from-log.yaml +0 -60
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/claude-sonnet/trial-1.md +0 -90
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/claude-sonnet/trial-2.md +0 -89
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/claude-sonnet/trial-3.md +0 -5
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/judge.json +0 -163
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-deepseek/trial-1.md +0 -84
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-deepseek/trial-2.md +0 -77
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-deepseek/trial-3.md +0 -89
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-glm/trial-1.md +0 -103
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-glm/trial-2.md +0 -103
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-glm/trial-3.md +0 -103
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-minimax/trial-1.md +0 -93
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-minimax/trial-2.md +0 -93
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-minimax/trial-3.md +0 -86
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/meta.json +0 -115
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002-result-block-format.yaml +0 -44
- package/src/skills/analyze-report/tests/fixtures/REPORT-002-incorrect-attribution.md +0 -27
- package/src/skills/analyze-report/tests/fixtures/pipeline-2026-04-06_qa-001-skip.log +0 -32
- package/src/skills/analyze-report/tests/index.yaml +0 -25
- package/src/skills/analyze-report/tests/rubrics/evidence-from-log.md +0 -22
- package/src/skills/analyze-report/tests/rubrics/result-block-format.md +0 -22
- package/src/skills/analyze-report/workflows/progress.md +0 -158
- package/src/skills/analyze-report/workflows/retrospective.md +0 -143
- package/src/skills/coach/README.md +0 -43
- package/src/skills/coach/SKILL.md.legacy +0 -157
- package/src/skills/coach/algorithms/gap-analysis.md +0 -69
- package/src/skills/coach/algorithms/improvement-prioritization.md +0 -62
- package/src/skills/coach/algorithms/skill-scoring.md +0 -80
- package/src/skills/coach/knowledge/audit-applied-changes-clean.txt +0 -11
- package/src/skills/coach/knowledge/backlog-management.md +0 -67
- package/src/skills/coach/knowledge/backlog-management.md.legacy +0 -90
- package/src/skills/coach/knowledge/common-antipatterns.md +0 -76
- package/src/skills/coach/knowledge/prompt-engineering.md +0 -45
- package/src/skills/coach/knowledge/shared-knowledge-guide.md +0 -44
- package/src/skills/coach/knowledge/skill-anatomy.md +0 -49
- package/src/skills/coach/knowledge/test-authorship.md +0 -141
- package/src/skills/coach/templates/audit-report.md +0 -39
- package/src/skills/coach/templates/coach-backlog-init.yaml +0 -14
- package/src/skills/coach/templates/coach-backlog-init.yaml.legacy +0 -10
- package/src/skills/coach/templates/improvement-plan.md +0 -42
- package/src/skills/coach/templates/new-skill.md +0 -95
- package/src/skills/coach/tests/cases/TC-COACH-001/current/claude-sonnet/trial-1.md +0 -58
- package/src/skills/coach/tests/cases/TC-COACH-001/current/claude-sonnet/trial-2.md +0 -65
- package/src/skills/coach/tests/cases/TC-COACH-001/current/claude-sonnet/trial-3.md +0 -58
- package/src/skills/coach/tests/cases/TC-COACH-001/current/judge.json +0 -151
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-deepseek/trial-1.md +0 -46
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-deepseek/trial-2.md +0 -0
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-deepseek/trial-3.md +0 -75
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-glm/trial-1.md +0 -81
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-glm/trial-2.md +0 -101
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-glm/trial-3.md +0 -91
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-minimax/trial-1.md +0 -48
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-minimax/trial-2.md +0 -30
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-minimax/trial-3.md +0 -55
- package/src/skills/coach/tests/cases/TC-COACH-001/current/meta.json +0 -94
- package/src/skills/coach/tests/cases/TC-COACH-001-evidence-based-temporal-diagram.yaml +0 -53
- package/src/skills/coach/tests/cases/TC-COACH-002/current/claude-sonnet/trial-1.md +0 -46
- package/src/skills/coach/tests/cases/TC-COACH-002/current/claude-sonnet/trial-2.md +0 -50
- package/src/skills/coach/tests/cases/TC-COACH-002/current/claude-sonnet/trial-3.md +0 -48
- package/src/skills/coach/tests/cases/TC-COACH-002/current/judge.json +0 -151
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-deepseek/trial-2.md +0 -37
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-deepseek/trial-3.md +0 -30
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-glm/trial-1.md +0 -23
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-glm/trial-2.md +0 -29
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-glm/trial-3.md +0 -35
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-minimax/trial-1.md +0 -13
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-minimax/trial-2.md +0 -19
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-minimax/trial-3.md +0 -33
- package/src/skills/coach/tests/cases/TC-COACH-002/current/meta.json +0 -94
- package/src/skills/coach/tests/cases/TC-COACH-002-root-cause-first.yaml +0 -57
- package/src/skills/coach/tests/fixtures/pipeline-2026-04-06_id-collision.log +0 -77
- package/src/skills/coach/tests/index.yaml +0 -29
- package/src/skills/coach/tests/rubrics/calibration/evidence-based-bad.md +0 -13
- package/src/skills/coach/tests/rubrics/calibration/evidence-based-good.md +0 -29
- package/src/skills/coach/tests/rubrics/evidence-based.md +0 -26
- package/src/skills/coach/tests/rubrics/root-cause-first.md +0 -21
- package/src/skills/coach/workflows/analyze.md +0 -79
- package/src/skills/coach/workflows/analyze.md.legacy +0 -64
- package/src/skills/coach/workflows/audit.md +0 -74
- package/src/skills/coach/workflows/audit.md.legacy +0 -59
- package/src/skills/coach/workflows/create.md +0 -80
- package/src/skills/coach/workflows/create.md.legacy +0 -67
- package/src/skills/coach/workflows/improve.md +0 -71
- package/src/skills/coach/workflows/improve.md.legacy +0 -60
- package/src/skills/coach/workflows/research.md +0 -55
- package/src/skills/coach/workflows/review.md +0 -52
- package/src/skills/coach/workflows/review.md.legacy +0 -48
- package/src/skills/coach/workflows/test.md +0 -97
- package/src/skills/create-plan/README.md +0 -39
- package/src/skills/create-plan/algorithms/risk-assessment.md +0 -73
- package/src/skills/create-plan/knowledge/plan-completeness.md +0 -67
- package/src/skills/create-plan/knowledge/plan-lifecycle.md +0 -33
- package/src/skills/create-plan/knowledge/task-verification-pairs.md +0 -151
- package/src/skills/create-plan/knowledge/test-hygiene.md +0 -47
- package/src/skills/create-plan/scripts/validate-completeness.js +0 -182
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/claude-sonnet/trial-1.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/claude-sonnet/trial-2.md +0 -39
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/claude-sonnet/trial-3.md +0 -35
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/judge.json +0 -167
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-deepseek/trial-2.md +0 -10
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-deepseek/trial-3.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-glm/trial-1.md +0 -26
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-glm/trial-2.md +0 -86
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-glm/trial-3.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-1.md +0 -11
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-2.md +0 -15
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-3.md +0 -14
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/meta.json +0 -119
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001-validate-completeness.yaml +0 -41
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-1.md +0 -25
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-2.md +0 -30
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-3.md +0 -37
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/judge.json +0 -164
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-1.md +0 -3
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-2.md +0 -11
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-3.md +0 -13
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-glm/trial-1.md +0 -44
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-glm/trial-2.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-glm/trial-3.md +0 -49
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-minimax/trial-1.md +0 -6
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-minimax/trial-2.md +0 -11
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-minimax/trial-3.md +0 -16
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/meta.json +0 -116
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002-task-granularity.yaml +0 -39
- package/src/skills/create-plan/tests/index.yaml +0 -25
- package/src/skills/create-plan/tests/rubrics/task-granularity.md +0 -21
- package/src/skills/create-plan/tests/rubrics/validate-completeness.md +0 -21
- package/src/skills/create-plan/workflows/create.md +0 -136
- package/src/skills/create-report/README.md +0 -40
- package/src/skills/create-report/algorithms/metric-calculation.md +0 -93
- package/src/skills/create-report/knowledge/report-metrics.md +0 -82
- package/src/skills/create-report/scripts/calc-metrics.js +0 -383
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/claude-sonnet/trial-1.md +0 -25
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/claude-sonnet/trial-2.md +0 -26
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/claude-sonnet/trial-3.md +0 -28
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/judge.json +0 -163
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-deepseek/trial-1.md +0 -4
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-deepseek/trial-2.md +0 -3
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-deepseek/trial-3.md +0 -6
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-glm/trial-1.md +0 -8
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-glm/trial-2.md +0 -12
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-glm/trial-3.md +0 -7
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-minimax/trial-1.md +0 -12
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-minimax/trial-2.md +0 -22
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-minimax/trial-3.md +0 -13
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/meta.json +0 -115
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001-root-cause-attribution.yaml +0 -57
- package/src/skills/create-report/tests/index.yaml +0 -20
- package/src/skills/create-report/tests/rubrics/root-cause-attribution.md +0 -21
- package/src/skills/create-report/workflows/standard.md +0 -175
- package/src/skills/decompose-gaps/README.md +0 -39
- package/src/skills/decompose-gaps/algorithms/scope-check.md +0 -110
- package/src/skills/decompose-gaps/knowledge/scope-validation.md +0 -65
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/claude-sonnet/trial-1.md +0 -41
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/claude-sonnet/trial-2.md +0 -41
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/claude-sonnet/trial-3.md +0 -56
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/judge.json +0 -164
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-deepseek/trial-1.md +0 -25
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-deepseek/trial-2.md +0 -17
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-deepseek/trial-3.md +0 -22
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-glm/trial-1.md +0 -25
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-glm/trial-2.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-glm/trial-3.md +0 -29
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-minimax/trial-1.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-minimax/trial-2.md +0 -35
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-minimax/trial-3.md +0 -18
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/meta.json +0 -116
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001-scope-exclusion.yaml +0 -46
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/claude-sonnet/trial-1.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/claude-sonnet/trial-2.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/claude-sonnet/trial-3.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/judge.json +0 -163
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-deepseek/trial-2.md +0 -15
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-deepseek/trial-3.md +0 -7
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-glm/trial-1.md +0 -21
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-glm/trial-2.md +0 -38
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-glm/trial-3.md +0 -16
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-minimax/trial-1.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-minimax/trial-2.md +0 -10
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-minimax/trial-3.md +0 -9
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/meta.json +0 -115
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002-glob-before-write.yaml +0 -36
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/claude-sonnet/trial-1.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/claude-sonnet/trial-2.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/claude-sonnet/trial-3.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/judge.json +0 -165
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-deepseek/trial-2.md +0 -26
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-deepseek/trial-3.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-glm/trial-1.md +0 -39
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-glm/trial-2.md +0 -37
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-glm/trial-3.md +0 -45
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-minimax/trial-1.md +0 -26
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-minimax/trial-2.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-minimax/trial-3.md +0 -7
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/meta.json +0 -117
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003-parent-plan-mandatory.yaml +0 -41
- package/src/skills/decompose-gaps/tests/index.yaml +0 -30
- package/src/skills/decompose-gaps/tests/rubrics/glob-before-write.md +0 -21
- package/src/skills/decompose-gaps/tests/rubrics/parent-plan-mandatory.md +0 -22
- package/src/skills/decompose-gaps/tests/rubrics/scope-exclusion.md +0 -21
- package/src/skills/decompose-gaps/workflows/decompose.md +0 -123
- package/src/skills/decompose-plan/README.md +0 -43
- package/src/skills/decompose-plan/algorithms/deduplication.md +0 -101
- package/src/skills/decompose-plan/knowledge/atomicity-checklist.md +0 -139
- package/src/skills/decompose-plan/knowledge/capabilities.md +0 -68
- package/src/skills/decompose-plan/knowledge/human-task-rules.md +0 -82
- package/src/skills/decompose-plan/knowledge/scope-guard-checklist.md +0 -73
- package/src/skills/decompose-plan/scripts/check-atomicity-limit.js +0 -47
- package/src/skills/decompose-plan/scripts/check-duplicates.js +0 -323
- package/src/skills/decompose-plan/scripts/verify-atomicity.js +0 -408
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/claude-sonnet/trial-1.md +0 -30
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/claude-sonnet/trial-2.md +0 -36
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/claude-sonnet/trial-3.md +0 -37
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-deepseek/trial-1.md +0 -20
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-deepseek/trial-2.md +0 -17
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-deepseek/trial-3.md +0 -28
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-glm/trial-1.md +0 -114
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-glm/trial-2.md +0 -137
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-glm/trial-3.md +0 -188
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-minimax/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-minimax/trial-2.md +0 -32
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-minimax/trial-3.md +0 -110
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001-atomicity-no-1to1.yaml +0 -56
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/claude-sonnet/trial-1.md +0 -47
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/claude-sonnet/trial-2.md +0 -54
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/claude-sonnet/trial-3.md +0 -43
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-deepseek/trial-1.md +0 -15
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-deepseek/trial-2.md +0 -5
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-deepseek/trial-3.md +0 -12
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-glm/trial-1.md +0 -34
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-glm/trial-2.md +0 -30
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-glm/trial-3.md +0 -35
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-minimax/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-minimax/trial-2.md +0 -31
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002-get-next-id-mandatory.yaml +0 -44
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/claude-sonnet/trial-1.md +0 -21
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/claude-sonnet/trial-2.md +0 -38
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/claude-sonnet/trial-3.md +0 -30
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-deepseek/trial-1.md +0 -31
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-deepseek/trial-2.md +0 -35
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-deepseek/trial-3.md +0 -48
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-glm/trial-1.md +0 -167
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-glm/trial-2.md +0 -62
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-glm/trial-3.md +0 -174
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-minimax/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-minimax/trial-2.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003-verbatim-dod-transfer.yaml +0 -42
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/claude-sonnet/trial-1.md +0 -55
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/claude-sonnet/trial-2.md +0 -49
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/claude-sonnet/trial-3.md +0 -49
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-deepseek/trial-1.md +0 -104
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-deepseek/trial-2.md +0 -45
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-deepseek/trial-3.md +0 -58
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-glm/trial-1.md +0 -193
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-glm/trial-2.md +0 -202
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-glm/trial-3.md +0 -155
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-minimax/trial-1.md +0 -52
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-minimax/trial-2.md +0 -17
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004-executor-atomicity.yaml +0 -64
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/claude-sonnet/trial-1.md +0 -59
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/claude-sonnet/trial-2.md +0 -204
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/claude-sonnet/trial-3.md +0 -213
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-deepseek/trial-2.md +0 -57
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-deepseek/trial-3.md +0 -54
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-glm/trial-1.md +0 -147
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-glm/trial-2.md +0 -165
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-glm/trial-3.md +0 -133
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-minimax/trial-1.md +0 -81
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-minimax/trial-2.md +0 -108
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-minimax/trial-3.md +0 -3
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/meta.json +0 -114
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005-capabilities-registry.yaml +0 -78
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/claude-sonnet/trial-1.md +0 -225
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/claude-sonnet/trial-2.md +0 -66
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/claude-sonnet/trial-3.md +0 -36
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-deepseek/trial-1.md +0 -42
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-deepseek/trial-2.md +0 -67
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-deepseek/trial-3.md +0 -40
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-glm/trial-1.md +0 -122
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-glm/trial-2.md +0 -131
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-glm/trial-3.md +0 -138
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-minimax/trial-1.md +0 -41
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-minimax/trial-2.md +0 -88
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006-dod-threshold.yaml +0 -72
- package/src/skills/decompose-plan/tests/index.yaml +0 -45
- package/src/skills/decompose-plan/tests/rubrics/atomicity-no-1to1.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/capabilities-registry.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/dod-threshold.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/executor-atomicity.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/get-next-id-mandatory.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/verbatim-dod-transfer.md +0 -21
- package/src/skills/decompose-plan/workflows/decompose.md +0 -305
- package/src/skills/deep-research/README.md +0 -36
- package/src/skills/deep-research/algorithms/source-scoring.md +0 -63
- package/src/skills/deep-research/algorithms/synthesis.md +0 -67
- package/src/skills/deep-research/knowledge/data-validation.md +0 -44
- package/src/skills/deep-research/knowledge/perplexity-config.md +0 -30
- package/src/skills/deep-research/knowledge/research-methodology.md +0 -54
- package/src/skills/deep-research/knowledge/source-evaluation.md +0 -33
- package/src/skills/deep-research/scripts/perplexity-research.js +0 -315
- package/src/skills/deep-research/templates/brief-summary.md +0 -25
- package/src/skills/deep-research/templates/research-report.md +0 -76
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/claude-haiku/trial-1.md +0 -48
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/claude-haiku/trial-2.md +0 -88
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/claude-haiku/trial-3.md +0 -56
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/judge.json +0 -163
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-free/trial-1.md +0 -58
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-free/trial-2.md +0 -249
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-free/trial-3.md +0 -44
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm/trial-1.md +0 -96
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm/trial-2.md +0 -56
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm/trial-3.md +0 -94
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm-air/trial-1.md +0 -11
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm-air/trial-2.md +0 -1
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm-air/trial-3.md +0 -1
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/meta.json +0 -115
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001-self-check-url.yaml +0 -58
- package/src/skills/deep-research/tests/index.yaml +0 -20
- package/src/skills/deep-research/tests/rubrics/self-check-url.md +0 -34
- package/src/skills/deep-research/workflows/base-checklist.md +0 -19
- package/src/skills/deep-research/workflows/benchmark.md +0 -38
- package/src/skills/deep-research/workflows/competitor.md +0 -44
- package/src/skills/deep-research/workflows/custom.md +0 -32
- package/src/skills/deep-research/workflows/market.md +0 -44
- package/src/skills/deep-research/workflows/technology.md +0 -40
- package/src/skills/deep-research/workflows/trend.md +0 -40
- package/src/skills/execute-task/README.md +0 -44
- package/src/skills/execute-task/algorithms/execution-strategy.md +0 -136
- package/src/skills/execute-task/knowledge/context-checkpoints.md +0 -75
- package/src/skills/execute-task/knowledge/ticket-structure.md +0 -70
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/claude-haiku/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-free/trial-1.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-free/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-free/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-glm-air/trial-1.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-glm-air/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-glm-air/trial-3.md +0 -11
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/meta.json +0 -88
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001-no-ticket-creation.yaml +0 -48
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/claude-haiku/trial-2.md +0 -6
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-free/trial-1.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-free/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-free/trial-3.md +0 -8
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-glm-air/trial-1.md +0 -9
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-glm-air/trial-2.md +0 -26
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-glm-air/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/meta.json +0 -89
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002-no-duplicate-dod.yaml +0 -44
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/claude-haiku/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/judge.json +0 -46
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/meta.json +0 -37
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003-verification-proportionality.yaml +0 -46
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/claude-haiku/trial-1.md +0 -18
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/claude-haiku/trial-2.md +0 -16
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/claude-haiku/trial-3.md +0 -14
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-free/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-free/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-free/trial-3.md +0 -1
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-glm-air/trial-1.md +0 -8
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-glm-air/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-glm-air/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/meta.json +0 -89
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004-no-foreign-ticket-edit.yaml +0 -50
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/claude-haiku/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-free/trial-1.md +0 -15
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-free/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-free/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-glm-air/trial-1.md +0 -11
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-glm-air/trial-2.md +0 -11
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-glm-air/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/meta.json +0 -88
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005-ticket-fields-updated.yaml +0 -39
- package/src/skills/execute-task/tests/fixtures/IMPL-902-create-file.md +0 -41
- package/src/skills/execute-task/tests/fixtures/IMPL-904-current-task.md +0 -40
- package/src/skills/execute-task/tests/fixtures/IMPL-906-fill-ticket.md +0 -42
- package/src/skills/execute-task/tests/fixtures/QA-901-button-click.md +0 -41
- package/src/skills/execute-task/tests/fixtures/QA-903-visual-figma.md +0 -40
- package/src/skills/execute-task/tests/fixtures/TASK-905-done-with-typo.md +0 -36
- package/src/skills/execute-task/tests/index.yaml +0 -39
- package/src/skills/execute-task/tests/rubrics/no-duplicate-dod.md +0 -22
- package/src/skills/execute-task/tests/rubrics/no-foreign-ticket-edit.md +0 -20
- package/src/skills/execute-task/tests/rubrics/no-ticket-creation.md +0 -21
- package/src/skills/execute-task/tests/rubrics/ticket-fields-updated.md +0 -23
- package/src/skills/execute-task/tests/rubrics/verification-proportionality.md +0 -22
- package/src/skills/execute-task/workflows/execute.md +0 -104
- package/src/skills/manual-testing/README.md +0 -63
- package/src/skills/manual-testing/algorithms/blocked-tool-strategy.md +0 -74
- package/src/skills/manual-testing/algorithms/bug-severity.md +0 -73
- package/src/skills/manual-testing/algorithms/mcp-budget.md +0 -97
- package/src/skills/manual-testing/algorithms/test-prioritization.md +0 -69
- package/src/skills/manual-testing/knowledge/browser-extension-testing.md +0 -102
- package/src/skills/manual-testing/knowledge/browser-tools.md +0 -114
- package/src/skills/manual-testing/knowledge/desktop-tools-advanced.md +0 -92
- package/src/skills/manual-testing/knowledge/desktop-tools-core.md +0 -76
- package/src/skills/manual-testing/knowledge/sandbox-advanced.md +0 -83
- package/src/skills/manual-testing/knowledge/sandbox-core.md +0 -67
- package/src/skills/manual-testing/knowledge/stateful-edge-cases.md +0 -69
- package/src/skills/manual-testing/knowledge/test-case-design.md +0 -107
- package/src/skills/manual-testing/knowledge/testing-types.md +0 -45
- package/src/skills/manual-testing/templates/bug-report.md +0 -52
- package/src/skills/manual-testing/templates/test-case.md +0 -34
- package/src/skills/manual-testing/templates/test-plan.md +0 -97
- package/src/skills/manual-testing/templates/test-session-report.md +0 -56
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/claude-sonnet/trial-1.md +0 -34
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/claude-sonnet/trial-2.md +0 -32
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/claude-sonnet/trial-3.md +0 -30
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/judge.json +0 -163
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-deepseek/trial-2.md +0 -7
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-deepseek/trial-3.md +0 -0
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-glm/trial-1.md +0 -4
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-glm/trial-2.md +0 -15
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-glm/trial-3.md +0 -8
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-minimax/trial-1.md +0 -5
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-minimax/trial-2.md +0 -7
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-minimax/trial-3.md +0 -7
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/meta.json +0 -114
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001-sandbox-mandatory.yaml +0 -38
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/claude-sonnet/trial-1.md +0 -44
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/claude-sonnet/trial-2.md +0 -32
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/claude-sonnet/trial-3.md +0 -47
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/judge.json +0 -163
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-deepseek/trial-1.md +0 -19
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-deepseek/trial-2.md +0 -15
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-deepseek/trial-3.md +0 -24
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-glm/trial-1.md +0 -19
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-glm/trial-2.md +0 -13
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-glm/trial-3.md +0 -18
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-minimax/trial-1.md +0 -21
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-minimax/trial-2.md +0 -15
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-minimax/trial-3.md +0 -14
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/meta.json +0 -114
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002-visual-tc-screenshot.yaml +0 -37
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/claude-sonnet/trial-1.md +0 -76
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/claude-sonnet/trial-2.md +0 -71
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/claude-sonnet/trial-3.md +0 -85
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/judge.json +0 -46
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/meta.json +0 -36
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003-qa-non-ui-assertion.yaml +0 -65
- package/src/skills/manual-testing/tests/index.yaml +0 -30
- package/src/skills/manual-testing/tests/last-run-tc001-sonnet.log +0 -140
- package/src/skills/manual-testing/tests/last-run-tc002.log +0 -1
- package/src/skills/manual-testing/tests/last-run.log +0 -1469
- package/src/skills/manual-testing/tests/rubrics/qa-non-ui-assertion.md +0 -31
- package/src/skills/manual-testing/tests/rubrics/sandbox-mandatory.md +0 -20
- package/src/skills/manual-testing/tests/rubrics/visual-tc-screenshot.md +0 -21
- package/src/skills/manual-testing/workflows/acceptance.md +0 -80
- package/src/skills/manual-testing/workflows/exploratory.md +0 -84
- package/src/skills/manual-testing/workflows/regression.md +0 -76
- package/src/skills/manual-testing/workflows/smoke.md +0 -109
- package/src/skills/manual-testing/workflows/test-plan.md +0 -75
- package/src/skills/review-result/README.md +0 -59
- package/src/skills/review-result/algorithms/verification.md +0 -112
- package/src/skills/review-result/knowledge/baseline-snapshot-validation.md +0 -67
- package/src/skills/review-result/knowledge/dod-patterns.md +0 -116
- package/src/skills/review-result/knowledge/test-hygiene.md +0 -44
- package/src/skills/review-result/scripts/verify-artifacts.js +0 -497
- package/src/skills/review-result/templates/verdict.md +0 -153
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-haiku/trial-1.md +0 -22
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-haiku/trial-2.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-haiku/trial-3.md +0 -21
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-1.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-2.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-3.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/judge.json +0 -164
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-2.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-3.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-1.md +0 -49
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-2.md +0 -28
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-3.md +0 -37
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-1.md +0 -22
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-2.md +0 -13
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-3.md +0 -21
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/meta.json +0 -116
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001-visual-tc-trigger.yaml +0 -51
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-1.md +0 -23
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-2.md +0 -22
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-3.md +0 -28
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-sonnet/trial-1.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-sonnet/trial-2.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-sonnet/trial-3.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/judge.json +0 -163
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-deepseek/trial-1.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-deepseek/trial-2.md +0 -0
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-deepseek/trial-3.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-glm/trial-1.md +0 -39
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-glm/trial-2.md +0 -25
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-glm/trial-3.md +0 -32
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-minimax/trial-1.md +0 -34
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-minimax/trial-2.md +0 -8
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-minimax/trial-3.md +0 -23
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/meta.json +0 -115
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002-path-line-suffix.yaml +0 -39
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/claude-sonnet/trial-1.md +0 -40
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/claude-sonnet/trial-2.md +0 -15
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/claude-sonnet/trial-3.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/judge.json +0 -163
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-deepseek/trial-2.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-deepseek/trial-3.md +0 -11
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-glm/trial-1.md +0 -16
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-glm/trial-2.md +0 -18
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-glm/trial-3.md +0 -17
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-minimax/trial-1.md +0 -17
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-minimax/trial-2.md +0 -31
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-minimax/trial-3.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/meta.json +0 -115
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003-test-isolation.yaml +0 -50
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004/current/claude-sonnet/trial-1.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004/current/claude-sonnet/trial-2.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004/current/claude-sonnet/trial-3.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004/current/judge.json +0 -46
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004/current/meta.json +0 -37
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004-baseline-snapshot.yaml +0 -50
- package/src/skills/review-result/tests/fixtures/IMPL-902-path-with-line.md +0 -43
- package/src/skills/review-result/tests/fixtures/QA-901-visual-button.md +0 -46
- package/src/skills/review-result/tests/fixtures/QA-904-test-isolation-violation/QA-904.md +0 -51
- package/src/skills/review-result/tests/fixtures/QA-904-test-isolation-violation/example-test.mjs +0 -36
- package/src/skills/review-result/tests/fixtures/QA-905-baseline-regex-instead-of-snapshot/QA-905.md +0 -62
- package/src/skills/review-result/tests/fixtures/QA-905-baseline-regex-instead-of-snapshot/baseline.test.mjs +0 -124
- package/src/skills/review-result/tests/index.yaml +0 -35
- package/src/skills/review-result/tests/rubrics/baseline-snapshot.md +0 -20
- package/src/skills/review-result/tests/rubrics/path-line-suffix.md +0 -19
- package/src/skills/review-result/tests/rubrics/test-isolation.md +0 -20
- package/src/skills/review-result/tests/rubrics/visual-tc-trigger.md +0 -19
- package/src/skills/review-result/workflows/review.md +0 -209
|
@@ -1,26 +0,0 @@
|
|
|
1
|
-
Now let me load the required knowledge modules and the workflow:
|
|
2
|
-
Now let me check for the template, existing plans, and shared modules:
|
|
3
|
-
Now I have all the context. Let me create the plan:
|
|
4
|
-
Now let me run the validation script:
|
|
5
|
-
Скрипт валидации не запустился из-за отсутствия пакета `workflow-ai` — инфраструктурная проблема тестового окружения. Проведу ручную валидацию по чеклисту:
|
|
6
|
-
|
|
7
|
-
**Frontmatter:** `id`, `title`, `status: draft`, `author`, `created_at` — все присутствуют.
|
|
8
|
-
|
|
9
|
-
**Обязательные секции:**
|
|
10
|
-
- `# Цель` — есть
|
|
11
|
-
- `## Контекст` — есть
|
|
12
|
-
- `## Справочные данные` — есть (палитра, localStorage key, CSS-классы)
|
|
13
|
-
- `## Scope` — есть (включено/исключено)
|
|
14
|
-
- `## Высокоуровневые задачи` — есть (8 задач)
|
|
15
|
-
- `## Риски` — есть
|
|
16
|
-
- `## Критерии успеха` — есть
|
|
17
|
-
|
|
18
|
-
**Парные задачи верификации:**
|
|
19
|
-
- Задача 2 (замена цветов) → Задача 5 (ручная проверка) + Задача 6 (автотесты)
|
|
20
|
-
- Задача 4 (JS-логика) → Задача 7 (ручная проверка переключателя) + Задача 8 (автотесты)
|
|
21
|
-
|
|
22
|
-
**Гранулярность:** каждая задача содержит один глагол действия над одним объектом.
|
|
23
|
-
|
|
24
|
-
**Красные флаги:** отсылок к внешним документам нет, все конкретные значения включены.
|
|
25
|
-
|
|
26
|
-
План создан: `.workflow/plans/current/PLAN-001.md` со статусом `draft`. Статус `approved` должен быть установлен стейкхолдером после ревью.
|
|
@@ -1,86 +0,0 @@
|
|
|
1
|
-
Скрипт `get-next-id.js` не найден в `.workflow/src/scripts/`. Согласно workflow, это инфраструктурная ошибка — я не должен изобретать ID самостоятельно. Однако скрипт `validate-completeness.js` существует. Поскольку планов в `.workflow/plans/` нет (каталог не существует), первый ID будет `PLAN-001`.
|
|
2
|
-
|
|
3
|
-
Создам необходимую структуру каталогов и план.
|
|
4
|
-
Прежде чем писать план, мне нужно понять, какие файлы содержит проект, чтобы я мог писать конкретные задачи, ссылаясь на реальные файлы.
|
|
5
|
-
Проект не содержит исходного кода приложения — только инфраструктура `.workflow`. Создаю план для абстрактного приложения на основе постановки «Добавить тёмную тему в приложение». План будет самодостаточным со всеми необходимыми секциями.
|
|
6
|
-
Скрипт `validate-completeness.js` зависит от пакета `workflow-ai`, который не установлен в этом окружении. Выполню ручную валидацию по чеклисту из `plan-completeness.md`, проверив все критерии, которые проверяет скрипт.
|
|
7
|
-
Выполняю **самопроверку полноты по чеклисту** из `plan-completeness.md` и валидацию структуры по `validate-completeness.js`:
|
|
8
|
-
|
|
9
|
-
### Валидация структуры (по логике validate-completeness.js)
|
|
10
|
-
|
|
11
|
-
**Frontmatter:** ✅ `id`, `title`, `status`, `author`, `created_at` — все присутствуют.
|
|
12
|
-
|
|
13
|
-
**Обязательные секции:**
|
|
14
|
-
- `# Цель` — ✅ (строка 9)
|
|
15
|
-
- `## Контекст` — ✅ (строка 20)
|
|
16
|
-
- `## Справочные данные` — ✅ (строка 24)
|
|
17
|
-
- `## Scope` — ✅ (строка 67)
|
|
18
|
-
- `## Высокоуровневые задачи` — ✅ (строка 89)
|
|
19
|
-
- `## Риски` — ✅ (строка 339)
|
|
20
|
-
- `## Критерии успеха` — ✅ (строка 348)
|
|
21
|
-
|
|
22
|
-
**Красные флаги:**
|
|
23
|
-
- `см. ТЗ`, `по ссылке`, `см. документацию` — ❌ не найдены ✅
|
|
24
|
-
- `URL уже создан`, `credentials готовы` — ❌ не найдены ✅
|
|
25
|
-
- Пустые секции — ❌ не найдены ✅
|
|
26
|
-
|
|
27
|
-
### Чеклист полноты (из plan-completeness.md)
|
|
28
|
-
|
|
29
|
-
- [x] Credentials/ключи/токены — неприменимо (нет в ТЗ)
|
|
30
|
-
- [x] URLs и endpoints — неприменимо (нет внешних URL)
|
|
31
|
-
- [x] Конфигурационные значения — ✅ все в таблице «Технические параметры»
|
|
32
|
-
- [x] Permissions/манифесты — неприменимо
|
|
33
|
-
- [x] Карта файлов — ✅ таблица «Карта файлов» (7 файлов)
|
|
34
|
-
- [x] Параметры событий/запросов — ✅ `STORAGE_KEY`, значения `"light"|"dark"`, `data-theme`
|
|
35
|
-
- [x] Схемы данных — ✅ CSS custom properties с конкретными hex-значениями
|
|
36
|
-
- [x] Edge cases и бизнес-логика — ✅ описаны в задачах (fallback на prefers-color-scheme, объединение `:root` и `[data-theme="light"]`)
|
|
37
|
-
- [x] Troubleshooting — неприменимо
|
|
38
|
-
- [x] Нет ссылок на внешние документы — ✅
|
|
39
|
-
- [x] Парные задачи верификации для задач, изменяющих код — ✅ (задачи 8, 9, 10, 11)
|
|
40
|
-
|
|
41
|
-
### Self-check гранулярности задач
|
|
42
|
-
|
|
43
|
-
Проверка каждого названия на 2+ глагола / конъюнкции:
|
|
44
|
-
|
|
45
|
-
| Задача | Название | Один глагол? | Один объект? |
|
|
46
|
-
|--------|----------|:------------:|:------------:|
|
|
47
|
-
| 1 | Создать файл CSS custom properties | ✅ | ✅ |
|
|
48
|
-
| 2 | Заменить хардкод-цвета в global.css | ✅ | ✅ |
|
|
49
|
-
| 3 | Создать composable useTheme | ✅ | ✅ |
|
|
50
|
-
| 4 | Создать компонент ThemeToggle | ✅ | ✅ |
|
|
51
|
-
| 5 | Разместить переключатель в MainLayout | ✅ | ✅ |
|
|
52
|
-
| 6 | Инициализировать тему в App.vue | ✅ | ✅ |
|
|
53
|
-
| 7 | Добавить inline-скрипт в index.html | ✅ | ✅ |
|
|
54
|
-
| 8 | Ручная проверка: переключение темы в UI | ✅ | ✅ |
|
|
55
|
-
| 9 | Ручная проверка: системная preference | ✅ | ✅ |
|
|
56
|
-
| 10 | Актуализация автотестов для useTheme | ✅ | ✅ |
|
|
57
|
-
| 11 | Актуализация автотестов для ThemeToggle | ✅ | ✅ |
|
|
58
|
-
|
|
59
|
-
Все задачи содержат один глагол и один объект. Конъюнкций «и», «а также», «плюс» нет.
|
|
60
|
-
|
|
61
|
-
### Парные задачи верификации
|
|
62
|
-
|
|
63
|
-
| Задача изменения кода | Ручная проверка | Автотесты |
|
|
64
|
-
|-----------------------|:---------------:|:---------:|
|
|
65
|
-
| Задача 1 (theme.css) | — (не наблюдаемо пользователем напрямую) | — (CSS-файл, тест через задачу 10) |
|
|
66
|
-
| Задача 2 (global.css) | Покрыто задачей 8 | — |
|
|
67
|
-
| Задача 3 (useTheme) | — | Задача 10 |
|
|
68
|
-
| Задача 4 (ThemeToggle) | Покрыто задачей 8 | Задача 11 |
|
|
69
|
-
| Задача 5 (MainLayout) | Покрыто задачей 8 | — |
|
|
70
|
-
| Задача 6 (App.vue) | Покрыто задачей 8 | — |
|
|
71
|
-
| Задача 7 (index.html) | Покрыто задачей 9 | — |
|
|
72
|
-
|
|
73
|
-
Все задачи изменения кода имеют парные верификации.
|
|
74
|
-
План создан: `.workflow/plans/current/PLAN-001.md`
|
|
75
|
-
|
|
76
|
-
**Статус:** `draft` (требует утверждения стейкхолдером перед декомпозицией).
|
|
77
|
-
|
|
78
|
-
**Содержимое плана:**
|
|
79
|
-
- 11 задач (7 задач изменения кода + 2 ручные проверки + 2 задачи автотестов)
|
|
80
|
-
- Все задачи гранулярны (один глагол, один объект)
|
|
81
|
-
- Парные задачи верификации для каждого изменения кода с наблюдаемым эффектом
|
|
82
|
-
- Справочные данные: полная цветовая палитра (12 токенов), технические параметры, карта файлов (7 файлов)
|
|
83
|
-
- 4 риска с оценкой по матрице (вероятность × влияние)
|
|
84
|
-
- 7 критериев успеха
|
|
85
|
-
|
|
86
|
-
**Примечание:** скрипт `validate-completeness.js` не запущен из-за отсутствия зависимости `workflow-ai` — выполнена ручная валидация по всем критериям скрипта (frontmatter, секции, красные флаги). Скрипт `get-next-id.js` также недоступен — ID `PLAN-001` определён на основе отсутствия существующих планов.
|
package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-1.md
DELETED
|
@@ -1,11 +0,0 @@
|
|
|
1
|
-
Для создания самодостаточного плана мне нужны исходные данные. Проверюworkspace на наличие спецификации или существующего приложения.
|
|
2
|
-
План создан: `.workflow/plans/current/PLAN-001.md`
|
|
3
|
-
|
|
4
|
-
**Содержание:**
|
|
5
|
-
- SMART-цель: добавить тёмную тему за 2 недели
|
|
6
|
-
- Справочные данные: CSS-переменные, localStorage-схема, entry points
|
|
7
|
-
- 7 задач с парными верификациями (Задача 3 → Задачи 4, 5)
|
|
8
|
-
- 5 рисков с митигацией
|
|
9
|
-
- Измеримые критерии успеха
|
|
10
|
-
|
|
11
|
-
**Статус:** draft (требует утверждения стейкхолдером перед декомпозицией)
|
package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-2.md
DELETED
|
@@ -1,15 +0,0 @@
|
|
|
1
|
-
План создан: `.workflow/plans/current/PLAN-001.md`
|
|
2
|
-
|
|
3
|
-
**Самопроверка по чеклисту:**
|
|
4
|
-
|
|
5
|
-
| Пункт | Статус |
|
|
6
|
-
|-------|--------|
|
|
7
|
-
| Все credentials/URLs перенесены | ✅ (нет внешних данных) |
|
|
8
|
-
| Конфигурационные значения | ✅ CSS-переменные, localStorage key |
|
|
9
|
-
| Файловая карта | ✅ в описании каждой задачи |
|
|
10
|
-
| Edge cases | ✅ (FOUC, persistence, system preference) |
|
|
11
|
-
| Нет ссылок на внешние документы | ✅ |
|
|
12
|
-
| Парные задачи верификации | ✅ (Задача 5 — ручная проверка, Задача 6 — автотесты) |
|
|
13
|
-
| Одна глагол на задачу | ✅ (Определить, Реализовать, Добавить, Адаптировать, Проверить, Актуализировать) |
|
|
14
|
-
|
|
15
|
-
Скрипт валидации недоступен (отсутствует пакет `workflow-ai`), но план соответствует всем обязательным секциям и требованиям гранулярности.
|
package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-3.md
DELETED
|
@@ -1,14 +0,0 @@
|
|
|
1
|
-
План создан. Валидация через скрипт недоступна (отсутствует зависимость `workflow-ai`), но самопроверка по чеклисту выполнена:
|
|
2
|
-
|
|
3
|
-
**Чеклист полноты:**
|
|
4
|
-
- ✅ CSS-переменные и конфигурационные значения указаны
|
|
5
|
-
- ✅ Карта файлов (`что → куда`) в секции "Справочные данные"
|
|
6
|
-
- ✅ localStorage keys задокументированы
|
|
7
|
-
- ✅ Edge cases описаны в задачах (flash of unstyled content, Safari fallback)
|
|
8
|
-
- ✅ Нет ссылок на внешние документы
|
|
9
|
-
- ✅ Каждая задача с изменением кода имеет парные задачи верификации:
|
|
10
|
-
- Задачи 1-6 (изменение кода) → Задача 7 (ручная проверка) + Задача 8 (автотесты)
|
|
11
|
-
- ✅ Каждая задача содержит один глагол действия
|
|
12
|
-
|
|
13
|
-
**Файл:** `.workflow/plans/current/PLAN-001.md`
|
|
14
|
-
**Статус:** draft
|
|
@@ -1,119 +0,0 @@
|
|
|
1
|
-
{
|
|
2
|
-
"date": "2026-04-20T12:04:59.083Z",
|
|
3
|
-
"skill_sha": "f1d5604",
|
|
4
|
-
"status": "failed",
|
|
5
|
-
"duration_ms": 1225326,
|
|
6
|
-
"l1_skipped": true,
|
|
7
|
-
"per_model": {
|
|
8
|
-
"claude-sonnet": {
|
|
9
|
-
"passed": false,
|
|
10
|
-
"errored": false,
|
|
11
|
-
"pass_count": 0,
|
|
12
|
-
"error_count": 1,
|
|
13
|
-
"total": 3,
|
|
14
|
-
"threshold": 2
|
|
15
|
-
},
|
|
16
|
-
"kilo-glm": {
|
|
17
|
-
"passed": true,
|
|
18
|
-
"errored": false,
|
|
19
|
-
"pass_count": 2,
|
|
20
|
-
"error_count": 1,
|
|
21
|
-
"total": 3,
|
|
22
|
-
"threshold": 2
|
|
23
|
-
},
|
|
24
|
-
"kilo-minimax": {
|
|
25
|
-
"passed": false,
|
|
26
|
-
"errored": false,
|
|
27
|
-
"pass_count": 0,
|
|
28
|
-
"error_count": 0,
|
|
29
|
-
"total": 3,
|
|
30
|
-
"threshold": 2
|
|
31
|
-
},
|
|
32
|
-
"kilo-deepseek": {
|
|
33
|
-
"passed": false,
|
|
34
|
-
"errored": false,
|
|
35
|
-
"pass_count": 0,
|
|
36
|
-
"error_count": 2,
|
|
37
|
-
"total": 3,
|
|
38
|
-
"threshold": 2
|
|
39
|
-
}
|
|
40
|
-
},
|
|
41
|
-
"rubric_scores": [
|
|
42
|
-
{
|
|
43
|
-
"agentId": "claude-sonnet",
|
|
44
|
-
"trial": 1,
|
|
45
|
-
"score": null,
|
|
46
|
-
"errored": true,
|
|
47
|
-
"error": "Stage \"TC-CREATE-PLAN-001-claude-sonnet-trial-1\" timed out after 1200s"
|
|
48
|
-
},
|
|
49
|
-
{
|
|
50
|
-
"agentId": "claude-sonnet",
|
|
51
|
-
"trial": 2,
|
|
52
|
-
"score": 2,
|
|
53
|
-
"errored": false
|
|
54
|
-
},
|
|
55
|
-
{
|
|
56
|
-
"agentId": "claude-sonnet",
|
|
57
|
-
"trial": 3,
|
|
58
|
-
"score": 3,
|
|
59
|
-
"errored": false
|
|
60
|
-
},
|
|
61
|
-
{
|
|
62
|
-
"agentId": "kilo-deepseek",
|
|
63
|
-
"trial": 1,
|
|
64
|
-
"score": null,
|
|
65
|
-
"errored": true,
|
|
66
|
-
"error": "Stage \"TC-CREATE-PLAN-001-kilo-deepseek-trial-1\" timed out after 1200s"
|
|
67
|
-
},
|
|
68
|
-
{
|
|
69
|
-
"agentId": "kilo-deepseek",
|
|
70
|
-
"trial": 2,
|
|
71
|
-
"score": 2,
|
|
72
|
-
"errored": false
|
|
73
|
-
},
|
|
74
|
-
{
|
|
75
|
-
"agentId": "kilo-deepseek",
|
|
76
|
-
"trial": 3,
|
|
77
|
-
"score": null,
|
|
78
|
-
"errored": true,
|
|
79
|
-
"error": "Stage \"TC-CREATE-PLAN-001-kilo-deepseek-trial-3\" timed out after 1200s"
|
|
80
|
-
},
|
|
81
|
-
{
|
|
82
|
-
"agentId": "kilo-glm",
|
|
83
|
-
"trial": 1,
|
|
84
|
-
"score": 4,
|
|
85
|
-
"errored": false
|
|
86
|
-
},
|
|
87
|
-
{
|
|
88
|
-
"agentId": "kilo-glm",
|
|
89
|
-
"trial": 2,
|
|
90
|
-
"score": 5,
|
|
91
|
-
"errored": false
|
|
92
|
-
},
|
|
93
|
-
{
|
|
94
|
-
"agentId": "kilo-glm",
|
|
95
|
-
"trial": 3,
|
|
96
|
-
"score": null,
|
|
97
|
-
"errored": true,
|
|
98
|
-
"error": "Stage \"TC-CREATE-PLAN-001-kilo-glm-trial-3\" timed out after 1200s"
|
|
99
|
-
},
|
|
100
|
-
{
|
|
101
|
-
"agentId": "kilo-minimax",
|
|
102
|
-
"trial": 1,
|
|
103
|
-
"score": 2,
|
|
104
|
-
"errored": false
|
|
105
|
-
},
|
|
106
|
-
{
|
|
107
|
-
"agentId": "kilo-minimax",
|
|
108
|
-
"trial": 2,
|
|
109
|
-
"score": 3,
|
|
110
|
-
"errored": false
|
|
111
|
-
},
|
|
112
|
-
{
|
|
113
|
-
"agentId": "kilo-minimax",
|
|
114
|
-
"trial": 3,
|
|
115
|
-
"score": 3,
|
|
116
|
-
"errored": false
|
|
117
|
-
}
|
|
118
|
-
]
|
|
119
|
-
}
|
|
@@ -1,41 +0,0 @@
|
|
|
1
|
-
id: TC-CREATE-PLAN-001
|
|
2
|
-
title: "Планировщик валидирует полноту через validate-completeness.js"
|
|
3
|
-
origin:
|
|
4
|
-
chg: [CHG-023, CHG-024]
|
|
5
|
-
incidents:
|
|
6
|
-
- "workflowAi: CHG-023 — добавлен шаг автоматической валидации"
|
|
7
|
-
- "workflowAi: CHG-024 — валидация перед сохранением, не после"
|
|
8
|
-
principle: "Автоматическая валидация полноты"
|
|
9
|
-
backlog_sources:
|
|
10
|
-
- ".workflow/coach-backlog.yaml"
|
|
11
|
-
tags: [validation, completeness, script-usage]
|
|
12
|
-
severity: high
|
|
13
|
-
|
|
14
|
-
scenario:
|
|
15
|
-
system_prompt_file: "../../SKILL.md"
|
|
16
|
-
extra_instructions: |
|
|
17
|
-
Создай короткий план (не более 3 задач) для задачи: «Добавить переключатель тёмной темы в настройки приложения».
|
|
18
|
-
Сфокусируйся на демонстрации валидации полноты — полное ТЗ не требуется.
|
|
19
|
-
|
|
20
|
-
assertions:
|
|
21
|
-
static:
|
|
22
|
-
- kind: skill_contains
|
|
23
|
-
file: "SKILL.md"
|
|
24
|
-
pattern: "Self-check"
|
|
25
|
-
reason: "Self-check перед завершением должен быть в SKILL.md."
|
|
26
|
-
|
|
27
|
-
deterministic:
|
|
28
|
-
- kind: output_contains_all
|
|
29
|
-
values: ["validate-completeness"]
|
|
30
|
-
reason: "Скрипт валидации должен быть вызван."
|
|
31
|
-
|
|
32
|
-
rubric:
|
|
33
|
-
- kind: llm_judge
|
|
34
|
-
rubric_file: "../rubrics/validate-completeness.md"
|
|
35
|
-
criterion: "Выполнил ли агент обязательную валидацию полноты плана по чеклисту перед сохранением и отработал ли найденные пробелы?"
|
|
36
|
-
pass_threshold: 4
|
|
37
|
-
trials: 3
|
|
38
|
-
aggregate: majority
|
|
39
|
-
|
|
40
|
-
execution:
|
|
41
|
-
timeout_s: 1800
|
package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-1.md
DELETED
|
@@ -1,25 +0,0 @@
|
|
|
1
|
-
Валидация прошла: `valid: true`, ошибок нет. Предупреждения о «пустых секциях» — ложные срабатывания из-за стандартного Markdown-форматирования (пустая строка после заголовка перед содержимым), содержимое во всех секциях присутствует.
|
|
2
|
-
|
|
3
|
-
---
|
|
4
|
-
|
|
5
|
-
План **PLAN-001** создан и сохранён в `.workflow/plans/current/PLAN-001.md`.
|
|
6
|
-
|
|
7
|
-
## Что сделано
|
|
8
|
-
|
|
9
|
-
**Структура плана** соответствует требованиям воркфлоу:
|
|
10
|
-
|
|
11
|
-
| Секция | Содержание |
|
|
12
|
-
|--------|-----------|
|
|
13
|
-
| **Цель** | SMART-цель: исправить баги маппинга и добавить поля к 2026-05-04 |
|
|
14
|
-
| **Контекст** | Описание ситуации, отсутствие ТЗ, необходимость исследования |
|
|
15
|
-
| **Справочные данные** | Параметры API, таблицы для заполнения по Задаче 1, edge cases для проверки |
|
|
16
|
-
| **Scope** | Включено/исключено: без изменений БД, Swagger, auth |
|
|
17
|
-
| **Задачи (7 штук)** | Декомпозированы с верификационными парами |
|
|
18
|
-
| **Риски** | 4 риска по матрице вероятность × влияние |
|
|
19
|
-
| **Критерии успеха** | 5 проверяемых условий |
|
|
20
|
-
|
|
21
|
-
**Парные задачи верификации** (по `task-verification-pairs.md`):
|
|
22
|
-
- Задача 2 (исправление багов) → Задача 3 (ручная проверка) + Задача 4 (автотесты)
|
|
23
|
-
- Задача 5 (новые поля) → Задача 6 (ручная проверка) + Задача 7 (автотесты)
|
|
24
|
-
|
|
25
|
-
**Статус:** `draft` — ожидает утверждения стейкхолдером перед декомпозицией на тикеты.
|
package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-2.md
DELETED
|
@@ -1,30 +0,0 @@
|
|
|
1
|
-
`valid: true`, ошибок нет. Предупреждения — ложные срабатывания валидатора (он ищет пустую строку сразу после заголовка, но секции наполнены контентом). Некритично.
|
|
2
|
-
|
|
3
|
-
---
|
|
4
|
-
|
|
5
|
-
**План создан:** `.workflow/plans/current/PLAN-001.md`
|
|
6
|
-
|
|
7
|
-
**Статус:** `draft`
|
|
8
|
-
|
|
9
|
-
---
|
|
10
|
-
|
|
11
|
-
### Что сделано
|
|
12
|
-
|
|
13
|
-
Поскольку задача поступила как краткое описание без ТЗ, план построен по следующей логике:
|
|
14
|
-
|
|
15
|
-
**Структура (8 задач):**
|
|
16
|
-
|
|
17
|
-
| Задача | Тип | Парные задачи |
|
|
18
|
-
|--------|-----|---------------|
|
|
19
|
-
| 1. Аудит кода маппинга | Исследование | — |
|
|
20
|
-
| 2. Исправить баг маппинга №1 | Код | → Задачи 3, 4 |
|
|
21
|
-
| 3. Ручная проверка исправления | QA | (пара к задаче 2) |
|
|
22
|
-
| 4. Актуализация автотестов | Тесты | (пара к задаче 2) |
|
|
23
|
-
| 5. Спроектировать схему новых полей | Дизайн | — |
|
|
24
|
-
| 6. Добавить новое поле в API | Код | → Задачи 7, 8 |
|
|
25
|
-
| 7. Ручная проверка нового поля | QA | (пара к задаче 6) |
|
|
26
|
-
| 8. Актуализация автотестов | Тесты | (пара к задаче 6) |
|
|
27
|
-
|
|
28
|
-
**Ключевой риск (CRITICAL 9/9):** конкретные баги и поля не определены в исходной задаче. Задача 1 (аудит) блокирует всю реализацию — до её завершения секцию «Справочные данные» необходимо заполнить реальными значениями.
|
|
29
|
-
|
|
30
|
-
**Следующий шаг:** стейкхолдер уточняет конкретные баги и новые поля → правит секцию «Справочные данные» → меняет статус с `draft` на `approved`.
|
package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-3.md
DELETED
|
@@ -1,37 +0,0 @@
|
|
|
1
|
-
План сохранён. Итог:
|
|
2
|
-
|
|
3
|
-
---
|
|
4
|
-
|
|
5
|
-
**Создан план:** `.workflow/plans/current/PLAN-001.md`
|
|
6
|
-
**Статус:** `draft` (ожидает заполнения справочных данных стейкхолдером)
|
|
7
|
-
|
|
8
|
-
---
|
|
9
|
-
|
|
10
|
-
### Структура плана
|
|
11
|
-
|
|
12
|
-
**8 задач**, два логических блока:
|
|
13
|
-
|
|
14
|
-
| Задачи | Что делают | Верификация |
|
|
15
|
-
|--------|-----------|-------------|
|
|
16
|
-
| 1 | Диагностика: читает код, локализует баги, находит точки вставки | — |
|
|
17
|
-
| 2 | Исправляет баги маппинга | → 3 (ручная) + 4 (автотесты) |
|
|
18
|
-
| 5 | Добавляет новые поля в схему API | ↘ совместно с 6 |
|
|
19
|
-
| 6 | Реализует маппинг новых полей | → 7 (ручная) + 8 (автотесты) |
|
|
20
|
-
|
|
21
|
-
**4 риска** по матрице вероятность × влияние:
|
|
22
|
-
- CRITICAL (9): отсутствие ТЗ с конкретными полями → план не может быть `approved` без заполнения "Справочных данных"
|
|
23
|
-
- HIGH (6): системный характер багов → решается ранним выполнением Задачи 1
|
|
24
|
-
- MEDIUM (4): нарушение обратной совместимости → `optional`-поля + Задача 7
|
|
25
|
-
- LOW (2): отсутствие тестов
|
|
26
|
-
|
|
27
|
-
---
|
|
28
|
-
|
|
29
|
-
### Что нужно от стейкхолдера до `approved`
|
|
30
|
-
|
|
31
|
-
Заполнить в секции **"Справочные данные"**:
|
|
32
|
-
1. Пути к файлам маппинга, схемы, тестов
|
|
33
|
-
2. Таблицу **"Баги маппинга"** — конкретные поля, описания багов, ожидаемое поведение
|
|
34
|
-
3. Таблицу **"Новые поля API"** — имена, типы, обязательность, источники значений
|
|
35
|
-
4. Технологический стек
|
|
36
|
-
|
|
37
|
-
> Примечание: скрипт `validate-completeness.js` завершился с ошибкой `Cannot find package 'workflow-ai'` — инфраструктурная зависимость отсутствует в тестовой среде. Проведена ручная самопроверка по всем пунктам чеклиста из `plan-completeness.md` — нарушений не обнаружено.
|
|
@@ -1,164 +0,0 @@
|
|
|
1
|
-
{
|
|
2
|
-
"per_model": {
|
|
3
|
-
"claude-sonnet": {
|
|
4
|
-
"pass_count": 2,
|
|
5
|
-
"total": 3,
|
|
6
|
-
"trials": [
|
|
7
|
-
{
|
|
8
|
-
"trial": 1,
|
|
9
|
-
"score": 3,
|
|
10
|
-
"passed": false
|
|
11
|
-
},
|
|
12
|
-
{
|
|
13
|
-
"trial": 2,
|
|
14
|
-
"score": 4,
|
|
15
|
-
"passed": true
|
|
16
|
-
},
|
|
17
|
-
{
|
|
18
|
-
"trial": 3,
|
|
19
|
-
"score": 4,
|
|
20
|
-
"passed": true
|
|
21
|
-
}
|
|
22
|
-
]
|
|
23
|
-
},
|
|
24
|
-
"kilo-glm": {
|
|
25
|
-
"pass_count": 2,
|
|
26
|
-
"total": 3,
|
|
27
|
-
"trials": [
|
|
28
|
-
{
|
|
29
|
-
"trial": 1,
|
|
30
|
-
"score": 5,
|
|
31
|
-
"passed": true
|
|
32
|
-
},
|
|
33
|
-
{
|
|
34
|
-
"trial": 2,
|
|
35
|
-
"score": null,
|
|
36
|
-
"passed": false
|
|
37
|
-
},
|
|
38
|
-
{
|
|
39
|
-
"trial": 3,
|
|
40
|
-
"score": 5,
|
|
41
|
-
"passed": true
|
|
42
|
-
}
|
|
43
|
-
]
|
|
44
|
-
},
|
|
45
|
-
"kilo-minimax": {
|
|
46
|
-
"pass_count": 0,
|
|
47
|
-
"total": 3,
|
|
48
|
-
"trials": [
|
|
49
|
-
{
|
|
50
|
-
"trial": 1,
|
|
51
|
-
"score": 1,
|
|
52
|
-
"passed": false
|
|
53
|
-
},
|
|
54
|
-
{
|
|
55
|
-
"trial": 2,
|
|
56
|
-
"score": 2,
|
|
57
|
-
"passed": false
|
|
58
|
-
},
|
|
59
|
-
{
|
|
60
|
-
"trial": 3,
|
|
61
|
-
"score": 2,
|
|
62
|
-
"passed": false
|
|
63
|
-
}
|
|
64
|
-
]
|
|
65
|
-
},
|
|
66
|
-
"kilo-deepseek": {
|
|
67
|
-
"pass_count": 0,
|
|
68
|
-
"total": 3,
|
|
69
|
-
"trials": [
|
|
70
|
-
{
|
|
71
|
-
"trial": 1,
|
|
72
|
-
"score": 2,
|
|
73
|
-
"passed": false
|
|
74
|
-
},
|
|
75
|
-
{
|
|
76
|
-
"trial": 2,
|
|
77
|
-
"score": 3,
|
|
78
|
-
"passed": false
|
|
79
|
-
},
|
|
80
|
-
{
|
|
81
|
-
"trial": 3,
|
|
82
|
-
"score": 1,
|
|
83
|
-
"passed": false
|
|
84
|
-
}
|
|
85
|
-
]
|
|
86
|
-
}
|
|
87
|
-
},
|
|
88
|
-
"rubric_scores": [
|
|
89
|
-
{
|
|
90
|
-
"agentId": "claude-sonnet",
|
|
91
|
-
"trial": 1,
|
|
92
|
-
"score": 3,
|
|
93
|
-
"errored": false
|
|
94
|
-
},
|
|
95
|
-
{
|
|
96
|
-
"agentId": "claude-sonnet",
|
|
97
|
-
"trial": 2,
|
|
98
|
-
"score": 4,
|
|
99
|
-
"errored": false
|
|
100
|
-
},
|
|
101
|
-
{
|
|
102
|
-
"agentId": "claude-sonnet",
|
|
103
|
-
"trial": 3,
|
|
104
|
-
"score": 4,
|
|
105
|
-
"errored": false
|
|
106
|
-
},
|
|
107
|
-
{
|
|
108
|
-
"agentId": "kilo-deepseek",
|
|
109
|
-
"trial": 1,
|
|
110
|
-
"score": 2,
|
|
111
|
-
"errored": false
|
|
112
|
-
},
|
|
113
|
-
{
|
|
114
|
-
"agentId": "kilo-deepseek",
|
|
115
|
-
"trial": 2,
|
|
116
|
-
"score": 3,
|
|
117
|
-
"errored": false
|
|
118
|
-
},
|
|
119
|
-
{
|
|
120
|
-
"agentId": "kilo-deepseek",
|
|
121
|
-
"trial": 3,
|
|
122
|
-
"score": 1,
|
|
123
|
-
"errored": false
|
|
124
|
-
},
|
|
125
|
-
{
|
|
126
|
-
"agentId": "kilo-glm",
|
|
127
|
-
"trial": 1,
|
|
128
|
-
"score": 5,
|
|
129
|
-
"errored": false
|
|
130
|
-
},
|
|
131
|
-
{
|
|
132
|
-
"agentId": "kilo-glm",
|
|
133
|
-
"trial": 2,
|
|
134
|
-
"score": null,
|
|
135
|
-
"errored": true,
|
|
136
|
-
"error": "Stage \"TC-CREATE-PLAN-002-kilo-glm-trial-2\" timed out after 1200s"
|
|
137
|
-
},
|
|
138
|
-
{
|
|
139
|
-
"agentId": "kilo-glm",
|
|
140
|
-
"trial": 3,
|
|
141
|
-
"score": 5,
|
|
142
|
-
"errored": false
|
|
143
|
-
},
|
|
144
|
-
{
|
|
145
|
-
"agentId": "kilo-minimax",
|
|
146
|
-
"trial": 1,
|
|
147
|
-
"score": 1,
|
|
148
|
-
"errored": false
|
|
149
|
-
},
|
|
150
|
-
{
|
|
151
|
-
"agentId": "kilo-minimax",
|
|
152
|
-
"trial": 2,
|
|
153
|
-
"score": 2,
|
|
154
|
-
"errored": false
|
|
155
|
-
},
|
|
156
|
-
{
|
|
157
|
-
"agentId": "kilo-minimax",
|
|
158
|
-
"trial": 3,
|
|
159
|
-
"score": 2,
|
|
160
|
-
"errored": false
|
|
161
|
-
}
|
|
162
|
-
],
|
|
163
|
-
"timestamp": "2026-04-20T12:05:07.097Z"
|
|
164
|
-
}
|
package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-2.md
DELETED
|
@@ -1,11 +0,0 @@
|
|
|
1
|
-
План создан: `.workflow/plans/current/PLAN-001.md`
|
|
2
|
-
|
|
3
|
-
**Содержание плана:**
|
|
4
|
-
- Статус `draft`, цель исправить баги маппинга полей `created_at`/`updated_at` и добавить поля `preferred_language`/`timezone`
|
|
5
|
-
- Полные справочные данные: credentials, URLs, схема модели User, карта файлов
|
|
6
|
-
- 9 задач с детальным описанием, edge‑cases, парными верификациями (ручные проверки + автотесты)
|
|
7
|
-
- Анализ рисков и критерии успеха
|
|
8
|
-
|
|
9
|
-
План прошёл ручную самопроверку по чеклисту полноты; автоматическая валидация не выполнена из‑за отсутствия пакета `workflow‑ai` в тестовом окружении.
|
|
10
|
-
|
|
11
|
-
Файл готов для ревью стейкхолдера и последующей декомпозиции.
|
package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-3.md
DELETED
|
@@ -1,13 +0,0 @@
|
|
|
1
|
-
План создан и сохранён в `.workflow/plans/current/PLAN-001.md`.
|
|
2
|
-
|
|
3
|
-
**Проверки:**
|
|
4
|
-
- Статус установлен в `draft`
|
|
5
|
-
- SMART-цель сформулирована
|
|
6
|
-
- Справочные данные содержат endpoints, схему маппинга, новые поля
|
|
7
|
-
- Scope определён (включено/исключено)
|
|
8
|
-
- Задачи разбиты с соблюдением гранулярности (один глагол на задачу)
|
|
9
|
-
- Для каждой задачи изменения кода добавлены парные задачи верификации
|
|
10
|
-
- Риски и критерии успеха прописаны
|
|
11
|
-
- Самопроверка по чеклисту `plan-completeness.md` пройдена
|
|
12
|
-
|
|
13
|
-
Файл готов к ревью стейкхолдера и последующей декомпозиции.
|