workflow-ai 1.1.0 → 1.3.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +61 -0
- package/README.md +37 -377
- package/configs/pipeline.yaml +113 -4
- package/package.json +54 -44
- package/src/lib/operations/tickets.mjs +305 -207
- package/src/lib/utils.mjs +286 -286
- package/src/runner.mjs +316 -35
- package/src/scripts/check-conditions.js +2 -2
- package/src/scripts/check-relevance.js +3 -1
- package/src/scripts/get-next-id.js +144 -41
- package/src/scripts/mark-blocked.js +160 -0
- package/src/scripts/move-ticket.js +305 -83
- package/src/scripts/pick-next-task.js +782 -93
- package/src/skills/__test-cal-001-1777553217513/SKILL.md +2 -0
- package/src/skills/__test-runner-1777553217483/SKILL.md +5 -0
- package/src/skills/coach/SKILL.md +2 -2
- package/src/skills/execute-task/SKILL.md +1 -1
- package/src/skills/manual-testing/SKILL.md +2 -0
- package/src/skills/review-result/SKILL.md +23 -1
- package/src/scripts/tests/timeout-cascade.test.js +0 -28
- package/src/skills/analyze-report/README.md +0 -44
- package/src/skills/analyze-report/algorithms/progress-assessment.md +0 -108
- package/src/skills/analyze-report/knowledge/analysis-frameworks.md +0 -66
- package/src/skills/analyze-report/knowledge/report-structure.md +0 -61
- package/src/skills/analyze-report/scripts/calc-plan-metrics.js +0 -234
- package/src/skills/analyze-report/templates/analysis-report.md +0 -80
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-1.md +0 -5
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-2.md +0 -98
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-3.md +0 -99
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/judge.json +0 -163
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-1.md +0 -89
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-2.md +0 -88
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-3.md +0 -100
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-glm/trial-1.md +0 -77
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-glm/trial-2.md +0 -64
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-glm/trial-3.md +0 -110
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-minimax/trial-1.md +0 -74
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-minimax/trial-2.md +0 -38
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-minimax/trial-3.md +0 -61
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/meta.json +0 -115
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001-evidence-from-log.yaml +0 -60
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/claude-sonnet/trial-1.md +0 -90
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/claude-sonnet/trial-2.md +0 -89
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/claude-sonnet/trial-3.md +0 -5
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/judge.json +0 -163
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-deepseek/trial-1.md +0 -84
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-deepseek/trial-2.md +0 -77
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-deepseek/trial-3.md +0 -89
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-glm/trial-1.md +0 -103
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-glm/trial-2.md +0 -103
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-glm/trial-3.md +0 -103
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-minimax/trial-1.md +0 -93
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-minimax/trial-2.md +0 -93
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-minimax/trial-3.md +0 -86
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/meta.json +0 -115
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002-result-block-format.yaml +0 -44
- package/src/skills/analyze-report/tests/fixtures/REPORT-002-incorrect-attribution.md +0 -27
- package/src/skills/analyze-report/tests/fixtures/pipeline-2026-04-06_qa-001-skip.log +0 -32
- package/src/skills/analyze-report/tests/index.yaml +0 -25
- package/src/skills/analyze-report/tests/rubrics/evidence-from-log.md +0 -22
- package/src/skills/analyze-report/tests/rubrics/result-block-format.md +0 -22
- package/src/skills/analyze-report/workflows/progress.md +0 -158
- package/src/skills/analyze-report/workflows/retrospective.md +0 -143
- package/src/skills/coach/README.md +0 -43
- package/src/skills/coach/SKILL.md.legacy +0 -157
- package/src/skills/coach/algorithms/gap-analysis.md +0 -69
- package/src/skills/coach/algorithms/improvement-prioritization.md +0 -62
- package/src/skills/coach/algorithms/skill-scoring.md +0 -80
- package/src/skills/coach/knowledge/audit-applied-changes-clean.txt +0 -11
- package/src/skills/coach/knowledge/backlog-management.md +0 -67
- package/src/skills/coach/knowledge/backlog-management.md.legacy +0 -90
- package/src/skills/coach/knowledge/common-antipatterns.md +0 -76
- package/src/skills/coach/knowledge/prompt-engineering.md +0 -45
- package/src/skills/coach/knowledge/shared-knowledge-guide.md +0 -44
- package/src/skills/coach/knowledge/skill-anatomy.md +0 -49
- package/src/skills/coach/knowledge/test-authorship.md +0 -141
- package/src/skills/coach/templates/audit-report.md +0 -39
- package/src/skills/coach/templates/coach-backlog-init.yaml +0 -14
- package/src/skills/coach/templates/coach-backlog-init.yaml.legacy +0 -10
- package/src/skills/coach/templates/improvement-plan.md +0 -42
- package/src/skills/coach/templates/new-skill.md +0 -95
- package/src/skills/coach/tests/cases/TC-COACH-001/current/claude-sonnet/trial-1.md +0 -58
- package/src/skills/coach/tests/cases/TC-COACH-001/current/claude-sonnet/trial-2.md +0 -65
- package/src/skills/coach/tests/cases/TC-COACH-001/current/claude-sonnet/trial-3.md +0 -58
- package/src/skills/coach/tests/cases/TC-COACH-001/current/judge.json +0 -151
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-deepseek/trial-1.md +0 -46
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-deepseek/trial-2.md +0 -0
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-deepseek/trial-3.md +0 -75
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-glm/trial-1.md +0 -81
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-glm/trial-2.md +0 -101
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-glm/trial-3.md +0 -91
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-minimax/trial-1.md +0 -48
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-minimax/trial-2.md +0 -30
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-minimax/trial-3.md +0 -55
- package/src/skills/coach/tests/cases/TC-COACH-001/current/meta.json +0 -94
- package/src/skills/coach/tests/cases/TC-COACH-001-evidence-based-temporal-diagram.yaml +0 -53
- package/src/skills/coach/tests/cases/TC-COACH-002/current/claude-sonnet/trial-1.md +0 -46
- package/src/skills/coach/tests/cases/TC-COACH-002/current/claude-sonnet/trial-2.md +0 -50
- package/src/skills/coach/tests/cases/TC-COACH-002/current/claude-sonnet/trial-3.md +0 -48
- package/src/skills/coach/tests/cases/TC-COACH-002/current/judge.json +0 -151
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-deepseek/trial-2.md +0 -37
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-deepseek/trial-3.md +0 -30
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-glm/trial-1.md +0 -23
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-glm/trial-2.md +0 -29
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-glm/trial-3.md +0 -35
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-minimax/trial-1.md +0 -13
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-minimax/trial-2.md +0 -19
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-minimax/trial-3.md +0 -33
- package/src/skills/coach/tests/cases/TC-COACH-002/current/meta.json +0 -94
- package/src/skills/coach/tests/cases/TC-COACH-002-root-cause-first.yaml +0 -57
- package/src/skills/coach/tests/fixtures/pipeline-2026-04-06_id-collision.log +0 -77
- package/src/skills/coach/tests/index.yaml +0 -29
- package/src/skills/coach/tests/rubrics/calibration/evidence-based-bad.md +0 -13
- package/src/skills/coach/tests/rubrics/calibration/evidence-based-good.md +0 -29
- package/src/skills/coach/tests/rubrics/evidence-based.md +0 -26
- package/src/skills/coach/tests/rubrics/root-cause-first.md +0 -21
- package/src/skills/coach/workflows/analyze.md +0 -79
- package/src/skills/coach/workflows/analyze.md.legacy +0 -64
- package/src/skills/coach/workflows/audit.md +0 -74
- package/src/skills/coach/workflows/audit.md.legacy +0 -59
- package/src/skills/coach/workflows/create.md +0 -80
- package/src/skills/coach/workflows/create.md.legacy +0 -67
- package/src/skills/coach/workflows/improve.md +0 -71
- package/src/skills/coach/workflows/improve.md.legacy +0 -60
- package/src/skills/coach/workflows/research.md +0 -55
- package/src/skills/coach/workflows/review.md +0 -52
- package/src/skills/coach/workflows/review.md.legacy +0 -48
- package/src/skills/coach/workflows/test.md +0 -97
- package/src/skills/create-plan/README.md +0 -39
- package/src/skills/create-plan/algorithms/risk-assessment.md +0 -73
- package/src/skills/create-plan/knowledge/plan-completeness.md +0 -67
- package/src/skills/create-plan/knowledge/plan-lifecycle.md +0 -33
- package/src/skills/create-plan/knowledge/task-verification-pairs.md +0 -151
- package/src/skills/create-plan/knowledge/test-hygiene.md +0 -47
- package/src/skills/create-plan/scripts/validate-completeness.js +0 -182
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/claude-sonnet/trial-1.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/claude-sonnet/trial-2.md +0 -39
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/claude-sonnet/trial-3.md +0 -35
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/judge.json +0 -167
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-deepseek/trial-2.md +0 -10
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-deepseek/trial-3.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-glm/trial-1.md +0 -26
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-glm/trial-2.md +0 -86
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-glm/trial-3.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-1.md +0 -11
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-2.md +0 -15
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-3.md +0 -14
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/meta.json +0 -119
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001-validate-completeness.yaml +0 -41
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-1.md +0 -25
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-2.md +0 -30
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-3.md +0 -37
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/judge.json +0 -164
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-1.md +0 -3
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-2.md +0 -11
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-3.md +0 -13
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-glm/trial-1.md +0 -44
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-glm/trial-2.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-glm/trial-3.md +0 -49
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-minimax/trial-1.md +0 -6
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-minimax/trial-2.md +0 -11
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-minimax/trial-3.md +0 -16
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/meta.json +0 -116
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002-task-granularity.yaml +0 -39
- package/src/skills/create-plan/tests/index.yaml +0 -25
- package/src/skills/create-plan/tests/rubrics/task-granularity.md +0 -21
- package/src/skills/create-plan/tests/rubrics/validate-completeness.md +0 -21
- package/src/skills/create-plan/workflows/create.md +0 -136
- package/src/skills/create-report/README.md +0 -40
- package/src/skills/create-report/algorithms/metric-calculation.md +0 -93
- package/src/skills/create-report/knowledge/report-metrics.md +0 -82
- package/src/skills/create-report/scripts/calc-metrics.js +0 -383
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/claude-sonnet/trial-1.md +0 -25
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/claude-sonnet/trial-2.md +0 -26
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/claude-sonnet/trial-3.md +0 -28
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/judge.json +0 -163
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-deepseek/trial-1.md +0 -4
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-deepseek/trial-2.md +0 -3
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-deepseek/trial-3.md +0 -6
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-glm/trial-1.md +0 -8
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-glm/trial-2.md +0 -12
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-glm/trial-3.md +0 -7
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-minimax/trial-1.md +0 -12
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-minimax/trial-2.md +0 -22
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-minimax/trial-3.md +0 -13
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/meta.json +0 -115
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001-root-cause-attribution.yaml +0 -57
- package/src/skills/create-report/tests/index.yaml +0 -20
- package/src/skills/create-report/tests/rubrics/root-cause-attribution.md +0 -21
- package/src/skills/create-report/workflows/standard.md +0 -175
- package/src/skills/decompose-gaps/README.md +0 -39
- package/src/skills/decompose-gaps/algorithms/scope-check.md +0 -110
- package/src/skills/decompose-gaps/knowledge/scope-validation.md +0 -65
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/claude-sonnet/trial-1.md +0 -41
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/claude-sonnet/trial-2.md +0 -41
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/claude-sonnet/trial-3.md +0 -56
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/judge.json +0 -164
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-deepseek/trial-1.md +0 -25
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-deepseek/trial-2.md +0 -17
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-deepseek/trial-3.md +0 -22
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-glm/trial-1.md +0 -25
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-glm/trial-2.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-glm/trial-3.md +0 -29
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-minimax/trial-1.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-minimax/trial-2.md +0 -35
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-minimax/trial-3.md +0 -18
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/meta.json +0 -116
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001-scope-exclusion.yaml +0 -46
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/claude-sonnet/trial-1.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/claude-sonnet/trial-2.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/claude-sonnet/trial-3.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/judge.json +0 -163
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-deepseek/trial-2.md +0 -15
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-deepseek/trial-3.md +0 -7
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-glm/trial-1.md +0 -21
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-glm/trial-2.md +0 -38
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-glm/trial-3.md +0 -16
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-minimax/trial-1.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-minimax/trial-2.md +0 -10
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-minimax/trial-3.md +0 -9
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/meta.json +0 -115
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002-glob-before-write.yaml +0 -36
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/claude-sonnet/trial-1.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/claude-sonnet/trial-2.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/claude-sonnet/trial-3.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/judge.json +0 -165
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-deepseek/trial-2.md +0 -26
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-deepseek/trial-3.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-glm/trial-1.md +0 -39
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-glm/trial-2.md +0 -37
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-glm/trial-3.md +0 -45
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-minimax/trial-1.md +0 -26
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-minimax/trial-2.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-minimax/trial-3.md +0 -7
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/meta.json +0 -117
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003-parent-plan-mandatory.yaml +0 -41
- package/src/skills/decompose-gaps/tests/index.yaml +0 -30
- package/src/skills/decompose-gaps/tests/rubrics/glob-before-write.md +0 -21
- package/src/skills/decompose-gaps/tests/rubrics/parent-plan-mandatory.md +0 -22
- package/src/skills/decompose-gaps/tests/rubrics/scope-exclusion.md +0 -21
- package/src/skills/decompose-gaps/workflows/decompose.md +0 -123
- package/src/skills/decompose-plan/README.md +0 -43
- package/src/skills/decompose-plan/algorithms/deduplication.md +0 -101
- package/src/skills/decompose-plan/knowledge/atomicity-checklist.md +0 -139
- package/src/skills/decompose-plan/knowledge/capabilities.md +0 -68
- package/src/skills/decompose-plan/knowledge/human-task-rules.md +0 -82
- package/src/skills/decompose-plan/knowledge/scope-guard-checklist.md +0 -73
- package/src/skills/decompose-plan/scripts/check-atomicity-limit.js +0 -47
- package/src/skills/decompose-plan/scripts/check-duplicates.js +0 -323
- package/src/skills/decompose-plan/scripts/verify-atomicity.js +0 -408
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/claude-sonnet/trial-1.md +0 -30
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/claude-sonnet/trial-2.md +0 -36
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/claude-sonnet/trial-3.md +0 -37
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-deepseek/trial-1.md +0 -20
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-deepseek/trial-2.md +0 -17
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-deepseek/trial-3.md +0 -28
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-glm/trial-1.md +0 -114
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-glm/trial-2.md +0 -137
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-glm/trial-3.md +0 -188
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-minimax/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-minimax/trial-2.md +0 -32
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-minimax/trial-3.md +0 -110
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001-atomicity-no-1to1.yaml +0 -56
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/claude-sonnet/trial-1.md +0 -47
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/claude-sonnet/trial-2.md +0 -54
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/claude-sonnet/trial-3.md +0 -43
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-deepseek/trial-1.md +0 -15
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-deepseek/trial-2.md +0 -5
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-deepseek/trial-3.md +0 -12
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-glm/trial-1.md +0 -34
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-glm/trial-2.md +0 -30
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-glm/trial-3.md +0 -35
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-minimax/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-minimax/trial-2.md +0 -31
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002-get-next-id-mandatory.yaml +0 -44
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/claude-sonnet/trial-1.md +0 -21
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/claude-sonnet/trial-2.md +0 -38
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/claude-sonnet/trial-3.md +0 -30
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-deepseek/trial-1.md +0 -31
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-deepseek/trial-2.md +0 -35
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-deepseek/trial-3.md +0 -48
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-glm/trial-1.md +0 -167
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-glm/trial-2.md +0 -62
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-glm/trial-3.md +0 -174
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-minimax/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-minimax/trial-2.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003-verbatim-dod-transfer.yaml +0 -42
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/claude-sonnet/trial-1.md +0 -55
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/claude-sonnet/trial-2.md +0 -49
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/claude-sonnet/trial-3.md +0 -49
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-deepseek/trial-1.md +0 -104
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-deepseek/trial-2.md +0 -45
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-deepseek/trial-3.md +0 -58
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-glm/trial-1.md +0 -193
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-glm/trial-2.md +0 -202
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-glm/trial-3.md +0 -155
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-minimax/trial-1.md +0 -52
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-minimax/trial-2.md +0 -17
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004-executor-atomicity.yaml +0 -64
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/claude-sonnet/trial-1.md +0 -59
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/claude-sonnet/trial-2.md +0 -204
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/claude-sonnet/trial-3.md +0 -213
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-deepseek/trial-2.md +0 -57
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-deepseek/trial-3.md +0 -54
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-glm/trial-1.md +0 -147
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-glm/trial-2.md +0 -165
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-glm/trial-3.md +0 -133
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-minimax/trial-1.md +0 -81
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-minimax/trial-2.md +0 -108
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-minimax/trial-3.md +0 -3
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/meta.json +0 -114
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005-capabilities-registry.yaml +0 -78
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/claude-sonnet/trial-1.md +0 -225
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/claude-sonnet/trial-2.md +0 -66
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/claude-sonnet/trial-3.md +0 -36
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-deepseek/trial-1.md +0 -42
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-deepseek/trial-2.md +0 -67
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-deepseek/trial-3.md +0 -40
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-glm/trial-1.md +0 -122
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-glm/trial-2.md +0 -131
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-glm/trial-3.md +0 -138
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-minimax/trial-1.md +0 -41
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-minimax/trial-2.md +0 -88
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006-dod-threshold.yaml +0 -72
- package/src/skills/decompose-plan/tests/index.yaml +0 -45
- package/src/skills/decompose-plan/tests/rubrics/atomicity-no-1to1.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/capabilities-registry.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/dod-threshold.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/executor-atomicity.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/get-next-id-mandatory.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/verbatim-dod-transfer.md +0 -21
- package/src/skills/decompose-plan/workflows/decompose.md +0 -305
- package/src/skills/deep-research/README.md +0 -36
- package/src/skills/deep-research/algorithms/source-scoring.md +0 -63
- package/src/skills/deep-research/algorithms/synthesis.md +0 -67
- package/src/skills/deep-research/knowledge/data-validation.md +0 -44
- package/src/skills/deep-research/knowledge/perplexity-config.md +0 -30
- package/src/skills/deep-research/knowledge/research-methodology.md +0 -54
- package/src/skills/deep-research/knowledge/source-evaluation.md +0 -33
- package/src/skills/deep-research/scripts/perplexity-research.js +0 -315
- package/src/skills/deep-research/templates/brief-summary.md +0 -25
- package/src/skills/deep-research/templates/research-report.md +0 -76
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/claude-haiku/trial-1.md +0 -48
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/claude-haiku/trial-2.md +0 -88
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/claude-haiku/trial-3.md +0 -56
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/judge.json +0 -163
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-free/trial-1.md +0 -58
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-free/trial-2.md +0 -249
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-free/trial-3.md +0 -44
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm/trial-1.md +0 -96
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm/trial-2.md +0 -56
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm/trial-3.md +0 -94
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm-air/trial-1.md +0 -11
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm-air/trial-2.md +0 -1
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm-air/trial-3.md +0 -1
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/meta.json +0 -115
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001-self-check-url.yaml +0 -58
- package/src/skills/deep-research/tests/index.yaml +0 -20
- package/src/skills/deep-research/tests/rubrics/self-check-url.md +0 -34
- package/src/skills/deep-research/workflows/base-checklist.md +0 -19
- package/src/skills/deep-research/workflows/benchmark.md +0 -38
- package/src/skills/deep-research/workflows/competitor.md +0 -44
- package/src/skills/deep-research/workflows/custom.md +0 -32
- package/src/skills/deep-research/workflows/market.md +0 -44
- package/src/skills/deep-research/workflows/technology.md +0 -40
- package/src/skills/deep-research/workflows/trend.md +0 -40
- package/src/skills/execute-task/README.md +0 -44
- package/src/skills/execute-task/algorithms/execution-strategy.md +0 -136
- package/src/skills/execute-task/knowledge/context-checkpoints.md +0 -75
- package/src/skills/execute-task/knowledge/ticket-structure.md +0 -70
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/claude-haiku/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-free/trial-1.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-free/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-free/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-glm-air/trial-1.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-glm-air/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-glm-air/trial-3.md +0 -11
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/meta.json +0 -88
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001-no-ticket-creation.yaml +0 -48
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/claude-haiku/trial-2.md +0 -6
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-free/trial-1.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-free/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-free/trial-3.md +0 -8
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-glm-air/trial-1.md +0 -9
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-glm-air/trial-2.md +0 -26
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-glm-air/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/meta.json +0 -89
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002-no-duplicate-dod.yaml +0 -44
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/claude-haiku/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/judge.json +0 -46
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/meta.json +0 -37
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003-verification-proportionality.yaml +0 -46
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/claude-haiku/trial-1.md +0 -18
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/claude-haiku/trial-2.md +0 -16
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/claude-haiku/trial-3.md +0 -14
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-free/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-free/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-free/trial-3.md +0 -1
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-glm-air/trial-1.md +0 -8
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-glm-air/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-glm-air/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/meta.json +0 -89
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004-no-foreign-ticket-edit.yaml +0 -50
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/claude-haiku/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-free/trial-1.md +0 -15
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-free/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-free/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-glm-air/trial-1.md +0 -11
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-glm-air/trial-2.md +0 -11
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-glm-air/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/meta.json +0 -88
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005-ticket-fields-updated.yaml +0 -39
- package/src/skills/execute-task/tests/fixtures/IMPL-902-create-file.md +0 -41
- package/src/skills/execute-task/tests/fixtures/IMPL-904-current-task.md +0 -40
- package/src/skills/execute-task/tests/fixtures/IMPL-906-fill-ticket.md +0 -42
- package/src/skills/execute-task/tests/fixtures/QA-901-button-click.md +0 -41
- package/src/skills/execute-task/tests/fixtures/QA-903-visual-figma.md +0 -40
- package/src/skills/execute-task/tests/fixtures/TASK-905-done-with-typo.md +0 -36
- package/src/skills/execute-task/tests/index.yaml +0 -39
- package/src/skills/execute-task/tests/rubrics/no-duplicate-dod.md +0 -22
- package/src/skills/execute-task/tests/rubrics/no-foreign-ticket-edit.md +0 -20
- package/src/skills/execute-task/tests/rubrics/no-ticket-creation.md +0 -21
- package/src/skills/execute-task/tests/rubrics/ticket-fields-updated.md +0 -23
- package/src/skills/execute-task/tests/rubrics/verification-proportionality.md +0 -22
- package/src/skills/execute-task/workflows/execute.md +0 -104
- package/src/skills/manual-testing/README.md +0 -63
- package/src/skills/manual-testing/algorithms/blocked-tool-strategy.md +0 -74
- package/src/skills/manual-testing/algorithms/bug-severity.md +0 -73
- package/src/skills/manual-testing/algorithms/mcp-budget.md +0 -97
- package/src/skills/manual-testing/algorithms/test-prioritization.md +0 -69
- package/src/skills/manual-testing/knowledge/browser-extension-testing.md +0 -102
- package/src/skills/manual-testing/knowledge/browser-tools.md +0 -114
- package/src/skills/manual-testing/knowledge/desktop-tools-advanced.md +0 -92
- package/src/skills/manual-testing/knowledge/desktop-tools-core.md +0 -76
- package/src/skills/manual-testing/knowledge/sandbox-advanced.md +0 -83
- package/src/skills/manual-testing/knowledge/sandbox-core.md +0 -67
- package/src/skills/manual-testing/knowledge/stateful-edge-cases.md +0 -69
- package/src/skills/manual-testing/knowledge/test-case-design.md +0 -107
- package/src/skills/manual-testing/knowledge/testing-types.md +0 -45
- package/src/skills/manual-testing/templates/bug-report.md +0 -52
- package/src/skills/manual-testing/templates/test-case.md +0 -34
- package/src/skills/manual-testing/templates/test-plan.md +0 -97
- package/src/skills/manual-testing/templates/test-session-report.md +0 -56
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/claude-sonnet/trial-1.md +0 -34
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/claude-sonnet/trial-2.md +0 -32
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/claude-sonnet/trial-3.md +0 -30
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/judge.json +0 -163
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-deepseek/trial-2.md +0 -7
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-deepseek/trial-3.md +0 -0
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-glm/trial-1.md +0 -4
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-glm/trial-2.md +0 -15
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-glm/trial-3.md +0 -8
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-minimax/trial-1.md +0 -5
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-minimax/trial-2.md +0 -7
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-minimax/trial-3.md +0 -7
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/meta.json +0 -114
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001-sandbox-mandatory.yaml +0 -38
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/claude-sonnet/trial-1.md +0 -44
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/claude-sonnet/trial-2.md +0 -32
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/claude-sonnet/trial-3.md +0 -47
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/judge.json +0 -163
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-deepseek/trial-1.md +0 -19
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-deepseek/trial-2.md +0 -15
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-deepseek/trial-3.md +0 -24
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-glm/trial-1.md +0 -19
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-glm/trial-2.md +0 -13
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-glm/trial-3.md +0 -18
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-minimax/trial-1.md +0 -21
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-minimax/trial-2.md +0 -15
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-minimax/trial-3.md +0 -14
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/meta.json +0 -114
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002-visual-tc-screenshot.yaml +0 -37
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/claude-sonnet/trial-1.md +0 -76
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/claude-sonnet/trial-2.md +0 -71
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/claude-sonnet/trial-3.md +0 -85
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/judge.json +0 -46
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/meta.json +0 -36
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003-qa-non-ui-assertion.yaml +0 -65
- package/src/skills/manual-testing/tests/index.yaml +0 -30
- package/src/skills/manual-testing/tests/last-run-tc001-sonnet.log +0 -140
- package/src/skills/manual-testing/tests/last-run-tc002.log +0 -1
- package/src/skills/manual-testing/tests/last-run.log +0 -1469
- package/src/skills/manual-testing/tests/rubrics/qa-non-ui-assertion.md +0 -31
- package/src/skills/manual-testing/tests/rubrics/sandbox-mandatory.md +0 -20
- package/src/skills/manual-testing/tests/rubrics/visual-tc-screenshot.md +0 -21
- package/src/skills/manual-testing/workflows/acceptance.md +0 -80
- package/src/skills/manual-testing/workflows/exploratory.md +0 -84
- package/src/skills/manual-testing/workflows/regression.md +0 -76
- package/src/skills/manual-testing/workflows/smoke.md +0 -109
- package/src/skills/manual-testing/workflows/test-plan.md +0 -75
- package/src/skills/review-result/README.md +0 -59
- package/src/skills/review-result/algorithms/verification.md +0 -112
- package/src/skills/review-result/knowledge/baseline-snapshot-validation.md +0 -67
- package/src/skills/review-result/knowledge/dod-patterns.md +0 -116
- package/src/skills/review-result/knowledge/test-hygiene.md +0 -44
- package/src/skills/review-result/scripts/verify-artifacts.js +0 -497
- package/src/skills/review-result/templates/verdict.md +0 -153
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-haiku/trial-1.md +0 -22
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-haiku/trial-2.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-haiku/trial-3.md +0 -21
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-1.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-2.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-3.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/judge.json +0 -164
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-2.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-3.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-1.md +0 -49
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-2.md +0 -28
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-3.md +0 -37
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-1.md +0 -22
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-2.md +0 -13
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-3.md +0 -21
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/meta.json +0 -116
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001-visual-tc-trigger.yaml +0 -51
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-1.md +0 -23
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-2.md +0 -22
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-3.md +0 -28
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-sonnet/trial-1.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-sonnet/trial-2.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-sonnet/trial-3.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/judge.json +0 -163
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-deepseek/trial-1.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-deepseek/trial-2.md +0 -0
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-deepseek/trial-3.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-glm/trial-1.md +0 -39
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-glm/trial-2.md +0 -25
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-glm/trial-3.md +0 -32
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-minimax/trial-1.md +0 -34
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-minimax/trial-2.md +0 -8
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-minimax/trial-3.md +0 -23
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/meta.json +0 -115
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002-path-line-suffix.yaml +0 -39
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/claude-sonnet/trial-1.md +0 -40
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/claude-sonnet/trial-2.md +0 -15
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/claude-sonnet/trial-3.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/judge.json +0 -163
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-deepseek/trial-2.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-deepseek/trial-3.md +0 -11
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-glm/trial-1.md +0 -16
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-glm/trial-2.md +0 -18
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-glm/trial-3.md +0 -17
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-minimax/trial-1.md +0 -17
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-minimax/trial-2.md +0 -31
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-minimax/trial-3.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/meta.json +0 -115
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003-test-isolation.yaml +0 -50
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004/current/claude-sonnet/trial-1.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004/current/claude-sonnet/trial-2.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004/current/claude-sonnet/trial-3.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004/current/judge.json +0 -46
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004/current/meta.json +0 -37
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-004-baseline-snapshot.yaml +0 -50
- package/src/skills/review-result/tests/fixtures/IMPL-902-path-with-line.md +0 -43
- package/src/skills/review-result/tests/fixtures/QA-901-visual-button.md +0 -46
- package/src/skills/review-result/tests/fixtures/QA-904-test-isolation-violation/QA-904.md +0 -51
- package/src/skills/review-result/tests/fixtures/QA-904-test-isolation-violation/example-test.mjs +0 -36
- package/src/skills/review-result/tests/fixtures/QA-905-baseline-regex-instead-of-snapshot/QA-905.md +0 -62
- package/src/skills/review-result/tests/fixtures/QA-905-baseline-regex-instead-of-snapshot/baseline.test.mjs +0 -124
- package/src/skills/review-result/tests/index.yaml +0 -35
- package/src/skills/review-result/tests/rubrics/baseline-snapshot.md +0 -20
- package/src/skills/review-result/tests/rubrics/path-line-suffix.md +0 -19
- package/src/skills/review-result/tests/rubrics/test-isolation.md +0 -20
- package/src/skills/review-result/tests/rubrics/visual-tc-trigger.md +0 -19
- package/src/skills/review-result/workflows/review.md +0 -209
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-haiku/trial-3.md
DELETED
|
@@ -1,21 +0,0 @@
|
|
|
1
|
-
**Вердикт: FAILED**
|
|
2
|
-
|
|
3
|
-
### Замечания по DoD
|
|
4
|
-
|
|
5
|
-
| № | Критерий | Статус | Причина |
|
|
6
|
-
|----|----------|--------|---------|
|
|
7
|
-
| 1 | Кнопка визуально соответствует макету | ❌ | Скриншот не приложен. Невозможно верифицировать соответствие макету Figma без визуального подтверждения. |
|
|
8
|
-
| 2 | Цвет совпадает с design tokens | ❌ | Скриншот не приложен. Невозможно подтвердить совпадение цвета визуально. |
|
|
9
|
-
| 3 | Скругление углов 8px | ❌ | Скриншот не приложен. Хотя это можно верифицировать из кода, исполнитель не предоставил визуальное подтверждение. |
|
|
10
|
-
|
|
11
|
-
### Итог
|
|
12
|
-
|
|
13
|
-
Исполнитель отметил все критерии как выполненные, но не приложил **ни одного скриншота**. Это нарушает принцип **Evidence-Based** ревью: для визуальных критериев требуется реальное визуальное доказательство (скриншот из браузера/dev-tools, сравнение с макетом).
|
|
14
|
-
|
|
15
|
-
---RESULT---
|
|
16
|
-
status: failed
|
|
17
|
-
issues:
|
|
18
|
-
- "Критерий 1 (Кнопка визуально соответствует макету): требуется скриншот для верификации соответствия макету"
|
|
19
|
-
- "Критерий 2 (Цвет совпадает с design tokens): требуется скриншот для подтверждения цвета"
|
|
20
|
-
- "Критерий 3 (Скругление углов 8px): требуется скриншот или код-inspection для визуального подтверждения"
|
|
21
|
-
---RESULT---
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-1.md
DELETED
|
@@ -1,6 +0,0 @@
|
|
|
1
|
-
---RESULT---
|
|
2
|
-
status: failed
|
|
3
|
-
issues:
|
|
4
|
-
- "DoD пункт 1 «Кнопка визуально соответствует макету» не верифицирован: визуальный критерий требует скриншот как evidence (шаг 4.7, триггер B), однако скриншоты не приложены. Исполнитель явно подтвердил: «Скриншоты не прикладывал — проверял локально в браузере». Визуальное соответствие макету не может быть подтверждено без визуального артефакта. [HIGH]"
|
|
5
|
-
- "QA-тикет не содержит требуемого evidence: отсутствуют a11y assertion и детальное описание шагов воспроизведения. Фраза «проверял локально в браузере» не является достаточным evidence по правилам шага 5.1."
|
|
6
|
-
---RESULT---
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-2.md
DELETED
|
@@ -1,6 +0,0 @@
|
|
|
1
|
-
---RESULT---
|
|
2
|
-
status: failed
|
|
3
|
-
issues:
|
|
4
|
-
- "DoD п.1 «Кнопка визуально соответствует макету» не подтверждён: ожидалось визуальное evidence (скриншот или a11y assertion), получено только текстовое утверждение «проверял локально в браузере» — без артефактов. Файл скриншота не приложен и физически не существует."
|
|
5
|
-
- "Файл-артефакт src/components/SubmitButton.tsx физически отсутствует в репозитории."
|
|
6
|
-
---RESULT---
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-3.md
DELETED
|
@@ -1,6 +0,0 @@
|
|
|
1
|
-
---RESULT---
|
|
2
|
-
status: failed
|
|
3
|
-
issues:
|
|
4
|
-
- "Пункт DoD 1 не выполнен: визуальный критерий «Кнопка визуально соответствует макету» не подтверждён — скриншоты отсутствуют физически, executor явно указал «скриншоты не прикладывал». Визуальное свойство не может быть верифицировано без визуального артефакта. [HIGH, шаг 4.7.0]"
|
|
5
|
-
- "Отсутствует evidence для всех трёх пунктов DoD: в секции Result нет a11y tree assertion, DOM-проверок, пошагового описания тест-шагов с ожидаемым/фактическим результатом. Указано только «проверял локально в браузере» — недостаточно для QA-тикета (шаг 5.1: «не приложил evidence»)."
|
|
6
|
-
---RESULT---
|
|
@@ -1,164 +0,0 @@
|
|
|
1
|
-
{
|
|
2
|
-
"per_model": {
|
|
3
|
-
"claude-sonnet": {
|
|
4
|
-
"pass_count": 3,
|
|
5
|
-
"total": 3,
|
|
6
|
-
"trials": [
|
|
7
|
-
{
|
|
8
|
-
"trial": 1,
|
|
9
|
-
"score": 5,
|
|
10
|
-
"passed": true
|
|
11
|
-
},
|
|
12
|
-
{
|
|
13
|
-
"trial": 2,
|
|
14
|
-
"score": 5,
|
|
15
|
-
"passed": true
|
|
16
|
-
},
|
|
17
|
-
{
|
|
18
|
-
"trial": 3,
|
|
19
|
-
"score": 5,
|
|
20
|
-
"passed": true
|
|
21
|
-
}
|
|
22
|
-
]
|
|
23
|
-
},
|
|
24
|
-
"kilo-glm": {
|
|
25
|
-
"pass_count": 3,
|
|
26
|
-
"total": 3,
|
|
27
|
-
"trials": [
|
|
28
|
-
{
|
|
29
|
-
"trial": 1,
|
|
30
|
-
"score": 5,
|
|
31
|
-
"passed": true
|
|
32
|
-
},
|
|
33
|
-
{
|
|
34
|
-
"trial": 2,
|
|
35
|
-
"score": 5,
|
|
36
|
-
"passed": true
|
|
37
|
-
},
|
|
38
|
-
{
|
|
39
|
-
"trial": 3,
|
|
40
|
-
"score": 5,
|
|
41
|
-
"passed": true
|
|
42
|
-
}
|
|
43
|
-
]
|
|
44
|
-
},
|
|
45
|
-
"kilo-minimax": {
|
|
46
|
-
"pass_count": 3,
|
|
47
|
-
"total": 3,
|
|
48
|
-
"trials": [
|
|
49
|
-
{
|
|
50
|
-
"trial": 1,
|
|
51
|
-
"score": 5,
|
|
52
|
-
"passed": true
|
|
53
|
-
},
|
|
54
|
-
{
|
|
55
|
-
"trial": 2,
|
|
56
|
-
"score": 5,
|
|
57
|
-
"passed": true
|
|
58
|
-
},
|
|
59
|
-
{
|
|
60
|
-
"trial": 3,
|
|
61
|
-
"score": 5,
|
|
62
|
-
"passed": true
|
|
63
|
-
}
|
|
64
|
-
]
|
|
65
|
-
},
|
|
66
|
-
"kilo-deepseek": {
|
|
67
|
-
"pass_count": 2,
|
|
68
|
-
"total": 3,
|
|
69
|
-
"trials": [
|
|
70
|
-
{
|
|
71
|
-
"trial": 1,
|
|
72
|
-
"score": null,
|
|
73
|
-
"passed": false
|
|
74
|
-
},
|
|
75
|
-
{
|
|
76
|
-
"trial": 2,
|
|
77
|
-
"score": 5,
|
|
78
|
-
"passed": true
|
|
79
|
-
},
|
|
80
|
-
{
|
|
81
|
-
"trial": 3,
|
|
82
|
-
"score": 5,
|
|
83
|
-
"passed": true
|
|
84
|
-
}
|
|
85
|
-
]
|
|
86
|
-
}
|
|
87
|
-
},
|
|
88
|
-
"rubric_scores": [
|
|
89
|
-
{
|
|
90
|
-
"agentId": "kilo-deepseek",
|
|
91
|
-
"trial": 1,
|
|
92
|
-
"score": null,
|
|
93
|
-
"errored": true,
|
|
94
|
-
"error": "Stage \"TC-REVIEW-RESULT-001-kilo-deepseek-trial-1\" timed out after 1200s"
|
|
95
|
-
},
|
|
96
|
-
{
|
|
97
|
-
"agentId": "kilo-deepseek",
|
|
98
|
-
"trial": 2,
|
|
99
|
-
"score": 5,
|
|
100
|
-
"errored": false
|
|
101
|
-
},
|
|
102
|
-
{
|
|
103
|
-
"agentId": "kilo-deepseek",
|
|
104
|
-
"trial": 3,
|
|
105
|
-
"score": 5,
|
|
106
|
-
"errored": false
|
|
107
|
-
},
|
|
108
|
-
{
|
|
109
|
-
"agentId": "kilo-glm",
|
|
110
|
-
"trial": 1,
|
|
111
|
-
"score": 5,
|
|
112
|
-
"errored": false
|
|
113
|
-
},
|
|
114
|
-
{
|
|
115
|
-
"agentId": "kilo-glm",
|
|
116
|
-
"trial": 2,
|
|
117
|
-
"score": 5,
|
|
118
|
-
"errored": false
|
|
119
|
-
},
|
|
120
|
-
{
|
|
121
|
-
"agentId": "kilo-glm",
|
|
122
|
-
"trial": 3,
|
|
123
|
-
"score": 5,
|
|
124
|
-
"errored": false
|
|
125
|
-
},
|
|
126
|
-
{
|
|
127
|
-
"agentId": "kilo-minimax",
|
|
128
|
-
"trial": 1,
|
|
129
|
-
"score": 5,
|
|
130
|
-
"errored": false
|
|
131
|
-
},
|
|
132
|
-
{
|
|
133
|
-
"agentId": "kilo-minimax",
|
|
134
|
-
"trial": 2,
|
|
135
|
-
"score": 5,
|
|
136
|
-
"errored": false
|
|
137
|
-
},
|
|
138
|
-
{
|
|
139
|
-
"agentId": "kilo-minimax",
|
|
140
|
-
"trial": 3,
|
|
141
|
-
"score": 5,
|
|
142
|
-
"errored": false
|
|
143
|
-
},
|
|
144
|
-
{
|
|
145
|
-
"agentId": "claude-sonnet",
|
|
146
|
-
"trial": 1,
|
|
147
|
-
"score": 5,
|
|
148
|
-
"errored": false
|
|
149
|
-
},
|
|
150
|
-
{
|
|
151
|
-
"agentId": "claude-sonnet",
|
|
152
|
-
"trial": 2,
|
|
153
|
-
"score": 5,
|
|
154
|
-
"errored": false
|
|
155
|
-
},
|
|
156
|
-
{
|
|
157
|
-
"agentId": "claude-sonnet",
|
|
158
|
-
"trial": 3,
|
|
159
|
-
"score": 5,
|
|
160
|
-
"errored": false
|
|
161
|
-
}
|
|
162
|
-
],
|
|
163
|
-
"timestamp": "2026-04-25T07:11:20.099Z"
|
|
164
|
-
}
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-2.md
DELETED
|
@@ -1,7 +0,0 @@
|
|
|
1
|
-
---RESULT---
|
|
2
|
-
status: failed
|
|
3
|
-
issues:
|
|
4
|
-
- "Пункт DoD «Кнопка визуально соответствует макету» не выполнен: ожидалось визуальное evidence (скриншот), скриншоты отсутствуют, файл .workflow/tickets/review/QA-901.md:45"
|
|
5
|
-
- "Пункт DoD «Цвет кнопки совпадает с design tokens» не выполнен: файл src/components/SubmitButton.tsx из списка изменённых файлов отсутствует, .workflow/tickets/review/QA-901.md:41"
|
|
6
|
-
- "Пункт DoD «Скругление углов 8px» не выполнен: файл src/components/SubmitButton.tsx отсутствует, .workflow/tickets/review/QA-901.md:41"
|
|
7
|
-
---RESULT---
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-3.md
DELETED
|
@@ -1,6 +0,0 @@
|
|
|
1
|
-
---RESULT---
|
|
2
|
-
status: failed
|
|
3
|
-
issues:
|
|
4
|
-
- "Пункты DoD 1-3 (визуальные критерии) не выполнены: ожидалось визуальное evidence (скриншоты или a11y assertions), получено только текстовое утверждение без подтверждения"
|
|
5
|
-
- "Файл-артефакт src/components/SubmitButton.tsx отсутствует"
|
|
6
|
-
---RESULT---
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-1.md
DELETED
|
@@ -1,49 +0,0 @@
|
|
|
1
|
-
Теперь загружаю необходимые знания и шаблоны.
|
|
2
|
-
Скрипт недоступен — перехожу к ручной проверке (fallback по воркфлоу). Проверяю артефакты.
|
|
3
|
-
Выполняю ревью по полному воркфлоу.
|
|
4
|
-
|
|
5
|
-
**Шаг 0:** Секции `## Ревью` нет — полная проверка.
|
|
6
|
-
|
|
7
|
-
**Шаг 1:** Скрипт verify-artifacts недоступен — ручная проверка (fallback).
|
|
8
|
-
|
|
9
|
-
**Шаг 2:** Парсинг тикета:
|
|
10
|
-
- Тип: `qa`
|
|
11
|
-
- DoD: 3 пункта, все отмечены `[x]`
|
|
12
|
-
- Изменённые файлы: `src/components/SubmitButton.tsx`
|
|
13
|
-
- Result заполнен
|
|
14
|
-
|
|
15
|
-
**Шаг 3:** Проверка DoD — файл `src/components/SubmitButton.tsx` не существует (Glob не нашёл).
|
|
16
|
-
|
|
17
|
-
**Шаг 4.7:** Визуальная верификация:
|
|
18
|
-
- Условие **B** сработало: DoD содержит визуальный критерий «Кнопка визуально соответствует макету» (формулировка «визуально соответствует»).
|
|
19
|
-
- Шаг 4.7.0: Glob по `**/*.png`, `**/*QA-901*.png`, `**/*submit*button*.png` — скриншоты не найдены.
|
|
20
|
-
- Finding: визуальный критерий без визуального evidence → issue уровня HIGH.
|
|
21
|
-
|
|
22
|
-
**Шаг 5:** Тип тикета `qa`, но:
|
|
23
|
-
- HIGH issue: отсутствует файл-артефакт `src/components/SubmitButton.tsx` → блокирует `passed` (шаг 5.1).
|
|
24
|
-
- HIGH issue: визуальный критерий DoD не подтверждён скриншотом.
|
|
25
|
-
|
|
26
|
-
Записываю секцию ревью в тикет:
|
|
27
|
-
## Ревью тикета QA-901
|
|
28
|
-
|
|
29
|
-
**Вердикт: ❌ failed**
|
|
30
|
-
|
|
31
|
-
### Проверенные критерии
|
|
32
|
-
|
|
33
|
-
| # | Критерий DoD | Результат | Обоснование |
|
|
34
|
-
|---|-------------|-----------|-------------|
|
|
35
|
-
| 1 | Кнопка визуально соответствует макету | ❌ | Визуальный критерий без визуального evidence. Скриншоты не приложены, не найдены в проекте. Текстовое заявление «проверял локально» не подтверждает визуальное соответствие |
|
|
36
|
-
| 2 | Цвет кнопки совпадает с design tokens | ⚠️ | Файл `src/components/SubmitButton.tsx` не существует — невозможно верифицировать |
|
|
37
|
-
| 3 | Скругление углов 8px | ⚠️ | Файл `src/components/SubmitButton.tsx` не существует — невозможно верифицировать |
|
|
38
|
-
|
|
39
|
-
### Issues
|
|
40
|
-
|
|
41
|
-
1. **Файл-артефакт отсутствует:** `src/components/SubmitButton.tsx` из секции «Изменённые файлы» не существует физически (Glob + Bash fallback подтвердили отсутствие)
|
|
42
|
-
2. **Визуальный критерий без evidence:** DoD «Кнопка визуально соответствует макету» содержит визуальную формулировку, но скриншоты исполнителем не приложены. Evidence — только текстовое заявление без визуального артефакта. Визуальное свойство не может быть подтверждено без скриншота
|
|
43
|
-
|
|
44
|
-
---RESULT---
|
|
45
|
-
status: failed
|
|
46
|
-
issues:
|
|
47
|
-
- "Файл src/components/SubmitButton.tsx из секции «Изменённые файлы» не существует физически"
|
|
48
|
-
- "DoD «Кнопка визуально соответствует макету»: визуальный критерий без скриншота — визуальное evidence отсутствует, подтверждение невозможно"
|
|
49
|
-
---RESULT---
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-2.md
DELETED
|
@@ -1,28 +0,0 @@
|
|
|
1
|
-
Тикет прочитан, модули знаний не найдены в изолированном воркспейсе. Приступаю к ревью.
|
|
2
|
-
|
|
3
|
-
**Анализ тикета QA-901:**
|
|
4
|
-
|
|
5
|
-
Тикет содержит 3 пункта DoD, все отмечены `[x]`. Тип задачи — визуальная проверка (QA). В секции Result исполнитель подтверждает выполнение, но явно указывает: «Скриншоты не прикладывал — проверял локально в браузере».
|
|
6
|
-
|
|
7
|
-
**Проверка критериев DoD:**
|
|
8
|
-
|
|
9
|
-
| # | Критерий | Статус | Обоснование |
|
|
10
|
-
|---|----------|--------|-------------|
|
|
11
|
-
| 1 | Кнопка визуально соответствует макету | ❌ | Визуальный критерий требует объективного подтверждения (скриншот, ссылку на макет). Исполнитель не приложил скриншоты. |
|
|
12
|
-
| 2 | Цвет кнопки совпадает с design tokens | ❓ | Нет артефакта (скриншота, код-ревью, diff), подтверждающего совпадение цвета с токенами. |
|
|
13
|
-
| 3 | Скругление углов 8px | ❓ | Нет артефакта, подтверждающего значение. |
|
|
14
|
-
|
|
15
|
-
Для визуальных критериев DoD требуется evidence-based подтверждение — скриншоты до/после или иные объективные артефакты. Принимать на веру слово исполнителя нельзя (принцип #2: Evidence-Based). Файл `src/components/SubmitButton.tsx` заявлен, но в изолированном воркспейсе его нет — верификация по коду также невозможна.
|
|
16
|
-
|
|
17
|
-
При этом критерий «Кнопка визуально соответствует макету» — ключевой визуальный критерий, по которому скриншоты обязательны и явно не приложены. Остальные два пункта производны и без подтверждения первого тоже не могут быть приняты.
|
|
18
|
-
|
|
19
|
-
---
|
|
20
|
-
|
|
21
|
-
```
|
|
22
|
-
---RESULT---
|
|
23
|
-
status: failed
|
|
24
|
-
issues:
|
|
25
|
-
- "Пункт DoD 1 не выполнен: визуальный критерий «Кнопка визуально соответствует макету» требует объективного подтверждения (скриншот/ссылка на макет), но скриншоты исполнителем не приложены. Result содержит лишь словесное утверждение без артефактов."
|
|
26
|
-
- "Пункты DoD 2 и 3 не подтверждены: нет скриншотов, diff, или иных артефактов, подтверждающих совпадение цвета с design tokens и скругление 8px."
|
|
27
|
-
---RESULT---
|
|
28
|
-
```
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-3.md
DELETED
|
@@ -1,37 +0,0 @@
|
|
|
1
|
-
Скрипт верификации недоступен — fallback на ручную проверку. Файл-артефакт и скриншоты отсутствуют.
|
|
2
|
-
|
|
3
|
-
**Анализ по шагам:**
|
|
4
|
-
|
|
5
|
-
- **Шаг 0:** Секции `## Ревью` нет → полная проверка.
|
|
6
|
-
- **Шаг 1:** Скрипт недоступен → ручной fallback.
|
|
7
|
-
- **Шаг 2:** Тип `qa`, DoD содержит 3 пункта, все `[x]`. Result заполнен, но без evidence.
|
|
8
|
-
- **Шаг 4.7:** DoD содержит визуальный критерий («визуально соответствует макету») → шаг 4.7 обязательный. Скриншоты не найдены (Glob `**/*.png` = 0). Исполнитель прямо заявил: «Скриншоты не прикладывал». → **finding**: визуальный DoD без визуального evidence (HIGH).
|
|
9
|
-
- **Шаг 5.1 (QA-правило):** Исполнитель не приложил evidence — ни скриншотов, ни a11y assertions, ни описания шагов. Только текстовое заявление «проверял локально». Для QA-тикета это FAIL.
|
|
10
|
-
|
|
11
|
-
Теперь запишу секцию ревью в тикет:
|
|
12
|
-
## Вердикт ревью QA-901
|
|
13
|
-
|
|
14
|
-
**Тип:** `qa` (тестирование)
|
|
15
|
-
|
|
16
|
-
**Проверенные критерии DoD:**
|
|
17
|
-
|
|
18
|
-
| # | Критерий | Результат | Обоснование |
|
|
19
|
-
|---|----------|-----------|-------------|
|
|
20
|
-
| 1 | Кнопка визуально соответствует макету | ❌ failed | Визуальный критерий не подтверждён: скриншоты отсутствуют, a11y assertions отсутствуют. Evidence — только текстовое заявление «проверял локально в браузере». Шаг 4.7 review.md: визуальный TC без визуального артефакта = HIGH issue |
|
|
21
|
-
| 2 | Цвет кнопки совпадает с design tokens | ❌ failed | Артефакт `src/components/SubmitButton.tsx` физически не существует, верификация невозможна |
|
|
22
|
-
| 3 | Скругление углов 8px | ❌ failed | Артефакт не найден, evidence не предоставлено |
|
|
23
|
-
|
|
24
|
-
**Дополнительные проблемы:**
|
|
25
|
-
- Файл `src/components/SubmitButton.tsx` из «Изменённые файлы» не существует в workdir
|
|
26
|
-
- Для QA-тикета с визуальным DoD обязательны скриншоты или иное визуальное evidence (п. 5.1 review.md)
|
|
27
|
-
|
|
28
|
-
**Рекомендации:**
|
|
29
|
-
- Приложить скриншоты кнопки для подтверждения визуального соответствия макету
|
|
30
|
-
- Убедиться, что `src/components/SubmitButton.tsx` существует в workdir
|
|
31
|
-
|
|
32
|
-
---RESULT---
|
|
33
|
-
status: failed
|
|
34
|
-
issues:
|
|
35
|
-
- "Пункт DoD #1 не выполнен: визуальный критерий «Кнопка визуально соответствует макету» не подтверждён — скриншоты отсутствуют, a11y assertions нет, evidence только текстовое заявление исполнителя (шаг 4.7 review.md)"
|
|
36
|
-
- "Изменённый файл src/components/SubmitButton.tsx не найден в workdir — верификация DoD #2 и #3 невозможна"
|
|
37
|
-
---RESULT---
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-1.md
DELETED
|
@@ -1,22 +0,0 @@
|
|
|
1
|
-
Тикет QA-901 не прошёл ревью.
|
|
2
|
-
|
|
3
|
-
**Проблема:** Визуальный критерий DoD «Кнопка визуально соответствует макету» отмечен как выполненный, однако:
|
|
4
|
-
- Скриншоты не приложены
|
|
5
|
-
- Ссылка на макет Figma отсутствует в `context.references`
|
|
6
|
-
- Нет никаких артефактов для верификации
|
|
7
|
-
|
|
8
|
-
Принцип Evidence-Based нарушен: отметка `[x]` не подтверждена реальным доказательством.
|
|
9
|
-
|
|
10
|
-
```markdown
|
|
11
|
-
## Ревью
|
|
12
|
-
|
|
13
|
-
| Дата | Статус | Самари |
|
|
14
|
-
|------|--------|--------|
|
|
15
|
-
| 2026-04-20 10:58 | ❌ failed | Визуальный критерий без скриншотов/макета — неверифицируем |
|
|
16
|
-
```
|
|
17
|
-
|
|
18
|
-
---RESULT---
|
|
19
|
-
status: failed
|
|
20
|
-
issues:
|
|
21
|
-
- "Пункт DoD 'Кнопка визуально соответствует макету': ожидался скриншот или ссылка на макет, получено — ничего. Result не содержит доказательств."
|
|
22
|
-
---RESULT---
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-2.md
DELETED
|
@@ -1,13 +0,0 @@
|
|
|
1
|
-
Тикет QA-901. Визуальный критерий DoD без скриншотов не может быть верифицирован.
|
|
2
|
-
Тикет не содержит секции `## Ревью` — переходу к полной проверке.
|
|
3
|
-
|
|
4
|
-
**Issues:**
|
|
5
|
-
1. Файл `src/components/SubmitButton.tsx` не существует
|
|
6
|
-
2. DoD содержит визуальный критерий «Кнопка визуально соответствует макету» — скриншоты не приложены, визуальное соответствие не подтверждено
|
|
7
|
-
|
|
8
|
-
---RESULT---
|
|
9
|
-
status: failed
|
|
10
|
-
issues:
|
|
11
|
-
- "Пункт DoD 'Кнопка визуально соответствует макету' не выполнен: ожидалось скриншот evidence, получено отсутствие файлов-артефактов. Согласно шагу 4.7 workflow, визуальный критерий требует визуального подтверждения"
|
|
12
|
-
- "Файл src/components/SubmitButton.tsx не существует: ожидалось наличие изменённого файла, получено отсутствие файла по указанному пути"
|
|
13
|
-
---RESULT---
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-3.md
DELETED
|
@@ -1,21 +0,0 @@
|
|
|
1
|
-
Анализирую тикет QA-901 согласно воркфлоу review.md.
|
|
2
|
-
|
|
3
|
-
**Шаг 0 (Быстрый выход):** Секция `## Ревью` отсутствует → переходим к полной проверке.
|
|
4
|
-
|
|
5
|
-
**Шаг 4.7 (Визуальная верификация):**
|
|
6
|
-
DoD содержит визуальный критерий: «Кнопка визуально соответствует макету».
|
|
7
|
-
Result исполнителя: «Скриншоты не прикладывал — проверял локально в браузере».
|
|
8
|
-
|
|
9
|
-
Согласно шагу 4.7.0: если визуальный TC есть, а скриншоты не найдены — это **finding**: evidence визуального TC является только текстовым без визуального артефакта.
|
|
10
|
-
|
|
11
|
-
Согласно `dod-patterns.md` (раздел qa): «Скриншоты ОБЯЗАТЕЛЬНЫ, если указаны в DoD OR в разделе Result». Исполнитель сам указал в Result отсутствие скриншотов.
|
|
12
|
-
|
|
13
|
-
Утверждение «проверено локально в браузере» не является достаточным визуальным evidence для DoD-пункта «визуально соответствует макету» — визуальное свойство не может быть подтверждено без скриншота.
|
|
14
|
-
|
|
15
|
-
---
|
|
16
|
-
|
|
17
|
-
---RESULT---
|
|
18
|
-
status: failed
|
|
19
|
-
issues:
|
|
20
|
-
- "Пункт DoD 'Кнопка визуально соответствует макету': визуальный критерий без скриншота. Ожидалось: скриншот сравнения с макетом Figma. Получено: текстовое утверждение 'проверено локально в браузере'. Файл: QA-901.md:45"
|
|
21
|
-
---RESULT---
|
|
@@ -1,116 +0,0 @@
|
|
|
1
|
-
{
|
|
2
|
-
"date": "2026-04-25T07:11:20.108Z",
|
|
3
|
-
"skill_sha": "b6cda8a",
|
|
4
|
-
"status": "passed",
|
|
5
|
-
"duration_ms": 138941,
|
|
6
|
-
"l1_skipped": true,
|
|
7
|
-
"per_model": {
|
|
8
|
-
"claude-sonnet": {
|
|
9
|
-
"passed": true,
|
|
10
|
-
"errored": false,
|
|
11
|
-
"pass_count": 3,
|
|
12
|
-
"error_count": 0,
|
|
13
|
-
"total": 3,
|
|
14
|
-
"threshold": 2
|
|
15
|
-
},
|
|
16
|
-
"kilo-glm": {
|
|
17
|
-
"passed": true,
|
|
18
|
-
"errored": false,
|
|
19
|
-
"pass_count": 3,
|
|
20
|
-
"error_count": 0,
|
|
21
|
-
"total": 3,
|
|
22
|
-
"threshold": 2
|
|
23
|
-
},
|
|
24
|
-
"kilo-minimax": {
|
|
25
|
-
"passed": true,
|
|
26
|
-
"errored": false,
|
|
27
|
-
"pass_count": 3,
|
|
28
|
-
"error_count": 0,
|
|
29
|
-
"total": 3,
|
|
30
|
-
"threshold": 2
|
|
31
|
-
},
|
|
32
|
-
"kilo-deepseek": {
|
|
33
|
-
"passed": true,
|
|
34
|
-
"errored": false,
|
|
35
|
-
"pass_count": 2,
|
|
36
|
-
"error_count": 1,
|
|
37
|
-
"total": 3,
|
|
38
|
-
"threshold": 2
|
|
39
|
-
}
|
|
40
|
-
},
|
|
41
|
-
"rubric_scores": [
|
|
42
|
-
{
|
|
43
|
-
"agentId": "kilo-deepseek",
|
|
44
|
-
"trial": 1,
|
|
45
|
-
"score": null,
|
|
46
|
-
"errored": true,
|
|
47
|
-
"error": "Stage \"TC-REVIEW-RESULT-001-kilo-deepseek-trial-1\" timed out after 1200s"
|
|
48
|
-
},
|
|
49
|
-
{
|
|
50
|
-
"agentId": "kilo-deepseek",
|
|
51
|
-
"trial": 2,
|
|
52
|
-
"score": 5,
|
|
53
|
-
"errored": false
|
|
54
|
-
},
|
|
55
|
-
{
|
|
56
|
-
"agentId": "kilo-deepseek",
|
|
57
|
-
"trial": 3,
|
|
58
|
-
"score": 5,
|
|
59
|
-
"errored": false
|
|
60
|
-
},
|
|
61
|
-
{
|
|
62
|
-
"agentId": "kilo-glm",
|
|
63
|
-
"trial": 1,
|
|
64
|
-
"score": 5,
|
|
65
|
-
"errored": false
|
|
66
|
-
},
|
|
67
|
-
{
|
|
68
|
-
"agentId": "kilo-glm",
|
|
69
|
-
"trial": 2,
|
|
70
|
-
"score": 5,
|
|
71
|
-
"errored": false
|
|
72
|
-
},
|
|
73
|
-
{
|
|
74
|
-
"agentId": "kilo-glm",
|
|
75
|
-
"trial": 3,
|
|
76
|
-
"score": 5,
|
|
77
|
-
"errored": false
|
|
78
|
-
},
|
|
79
|
-
{
|
|
80
|
-
"agentId": "kilo-minimax",
|
|
81
|
-
"trial": 1,
|
|
82
|
-
"score": 5,
|
|
83
|
-
"errored": false
|
|
84
|
-
},
|
|
85
|
-
{
|
|
86
|
-
"agentId": "kilo-minimax",
|
|
87
|
-
"trial": 2,
|
|
88
|
-
"score": 5,
|
|
89
|
-
"errored": false
|
|
90
|
-
},
|
|
91
|
-
{
|
|
92
|
-
"agentId": "kilo-minimax",
|
|
93
|
-
"trial": 3,
|
|
94
|
-
"score": 5,
|
|
95
|
-
"errored": false
|
|
96
|
-
},
|
|
97
|
-
{
|
|
98
|
-
"agentId": "claude-sonnet",
|
|
99
|
-
"trial": 1,
|
|
100
|
-
"score": 5,
|
|
101
|
-
"errored": false
|
|
102
|
-
},
|
|
103
|
-
{
|
|
104
|
-
"agentId": "claude-sonnet",
|
|
105
|
-
"trial": 2,
|
|
106
|
-
"score": 5,
|
|
107
|
-
"errored": false
|
|
108
|
-
},
|
|
109
|
-
{
|
|
110
|
-
"agentId": "claude-sonnet",
|
|
111
|
-
"trial": 3,
|
|
112
|
-
"score": 5,
|
|
113
|
-
"errored": false
|
|
114
|
-
}
|
|
115
|
-
]
|
|
116
|
-
}
|
|
@@ -1,51 +0,0 @@
|
|
|
1
|
-
id: TC-REVIEW-RESULT-001
|
|
2
|
-
title: "Ревьюер запускает визуальную проверку при визуальных TC в DoD"
|
|
3
|
-
origin:
|
|
4
|
-
chg: [CHG-043, CHG-044]
|
|
5
|
-
incidents:
|
|
6
|
-
- "PulseProxy: CHG-043 — триггер шага 4.7 расширен на визуальные критерии"
|
|
7
|
-
- "PulseProxy: CHG-044 — уточнение «скриншоты не обязательны от исполнителя»"
|
|
8
|
-
principle: "Визуальный TC триггерит шаг 4.7"
|
|
9
|
-
backlog_sources:
|
|
10
|
-
- "d:/Dev/PulseProxy/.workflow/coach-backlog.yaml"
|
|
11
|
-
tags: [visual-testing, review-trigger]
|
|
12
|
-
severity: high
|
|
13
|
-
|
|
14
|
-
scenario:
|
|
15
|
-
system_prompt_file: "../../SKILL.md"
|
|
16
|
-
extra_instructions: |
|
|
17
|
-
Проведи ревью тикета QA-901, который лежит в .workflow/tickets/review/QA-901.md.
|
|
18
|
-
DoD содержит визуальный критерий «Кнопка визуально соответствует макету».
|
|
19
|
-
Скриншоты исполнителем не приложены.
|
|
20
|
-
inputs:
|
|
21
|
-
- kind: ticket_file
|
|
22
|
-
path: "../fixtures/QA-901-visual-button.md"
|
|
23
|
-
ticket_id: QA-901
|
|
24
|
-
dest_dir: review
|
|
25
|
-
|
|
26
|
-
assertions:
|
|
27
|
-
static:
|
|
28
|
-
- kind: skill_contains
|
|
29
|
-
file: "SKILL.md"
|
|
30
|
-
pattern: "RESULT"
|
|
31
|
-
reason: "Формат RESULT обязателен."
|
|
32
|
-
- kind: skill_contains
|
|
33
|
-
file: "SKILL.md"
|
|
34
|
-
pattern: "Evidence"
|
|
35
|
-
reason: "Evidence-based подход."
|
|
36
|
-
|
|
37
|
-
deterministic:
|
|
38
|
-
- kind: output_contains_all
|
|
39
|
-
values: ["---RESULT---"]
|
|
40
|
-
reason: "Результат должен быть в формате пайплайна."
|
|
41
|
-
|
|
42
|
-
rubric:
|
|
43
|
-
- kind: llm_judge
|
|
44
|
-
rubric_file: "../rubrics/visual-tc-trigger.md"
|
|
45
|
-
criterion: "Запустил ли ревьюер визуальную проверку (потребовал скриншот / зафиксировал отсутствие визуального evidence) для DoD-пункта с визуальным критерием?"
|
|
46
|
-
pass_threshold: 4
|
|
47
|
-
trials: 3
|
|
48
|
-
aggregate: majority
|
|
49
|
-
|
|
50
|
-
execution:
|
|
51
|
-
timeout_s: 1200
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-1.md
DELETED
|
@@ -1,23 +0,0 @@
|
|
|
1
|
-
**Анализ результата:**
|
|
2
|
-
|
|
3
|
-
Согласно контексту тикета (примечание в секции context), это **регрессионный тест скила** (`TC-REVIEW-RESULT-002`). Рабочая директория изолирована, настоящего файла `src/background/index.ts` нет — это преднамеренно. Путь указан в отраслевой нотации (path:line-range формат), и ревьюер не должен помечать его как отсутствующий.
|
|
4
|
-
|
|
5
|
-
**Проверка DoD:**
|
|
6
|
-
|
|
7
|
-
| Критерий | Статус | Обоснование |
|
|
8
|
-
|----------|--------|------------|
|
|
9
|
-
| Парсер обрабатывает URL с query string | ✅ [x] | Summary заявляет: "Логика парсинга вынесена в отдельную функцию" |
|
|
10
|
-
| Unit-тест на edge-case `redirect_to` | ✅ [x] | Summary заявляет: "добавлен тест" |
|
|
11
|
-
|
|
12
|
-
**Проверка Result:**
|
|
13
|
-
|
|
14
|
-
- Summary заполнен содержательно (19 слов)
|
|
15
|
-
- Охватывает оба пункта DoD
|
|
16
|
-
- Нет противоречий между заявленным и DoD
|
|
17
|
-
|
|
18
|
-
**Вывод:** Все критерии DoD выполнены, Summary содержателен, тикет соответствует требованиям.
|
|
19
|
-
|
|
20
|
-
---RESULT---
|
|
21
|
-
status: passed
|
|
22
|
-
issues: []
|
|
23
|
-
---RESULT---
|
package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-2.md
DELETED
|
@@ -1,22 +0,0 @@
|
|
|
1
|
-
## Анализ тикета IMPL-902
|
|
2
|
-
|
|
3
|
-
**Шаг 0 — Быстрый выход:** Секция "Ревью" отсутствует → переходу к полной проверке.
|
|
4
|
-
|
|
5
|
-
**Шаг 1-6 — Полная проверка по DoD:**
|
|
6
|
-
|
|
7
|
-
| Критерий | Статус | Примечание |
|
|
8
|
-
|----------|--------|-----------|
|
|
9
|
-
| Парсер обрабатывает URL с query string без потери параметров | ✅ | Помечен [x], указано в Result |
|
|
10
|
-
| Добавлен unit-тест на edge-case с пустым `redirect_to` | ✅ | Помечен [x], указано в Result |
|
|
11
|
-
|
|
12
|
-
**Верификация артефактов:**
|
|
13
|
-
- Изменённые файлы указаны в формате `path:line-range` (src/background/index.ts:204-211) ✅
|
|
14
|
-
- Result содержит описание реализации (вынесена логика парсинга в отдельную функцию, добавлен тест) ✅
|
|
15
|
-
- Контекст тикета указывает что это регрессионный тест в изолированной директории — корректное состояние ✅
|
|
16
|
-
|
|
17
|
-
**Вердикт:** Все пункты DoD выполнены, артефакты в порядке.
|
|
18
|
-
|
|
19
|
-
---RESULT---
|
|
20
|
-
status: passed
|
|
21
|
-
issues: []
|
|
22
|
-
---RESULT---
|