workflow-ai 1.0.68 → 1.2.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +49 -0
- package/README.md +105 -7
- package/configs/pipeline.yaml +23 -2
- package/package.json +44 -41
- package/src/lib/operations/plans.mjs +85 -0
- package/src/lib/operations/skills.mjs +124 -0
- package/src/lib/operations/tickets.mjs +430 -0
- package/src/lib/utils.mjs +286 -286
- package/src/runner.mjs +314 -34
- package/src/scripts/check-conditions.js +2 -2
- package/src/scripts/get-next-id.js +8 -31
- package/src/scripts/pick-next-task.js +3 -9
- package/src/skills/coach/SKILL.md +1 -1
- package/src/skills/manual-testing/SKILL.md +2 -0
- package/src/skills/review-result/SKILL.md +1 -0
- package/src/scripts/tests/timeout-cascade.test.js +0 -28
- package/src/skills/analyze-report/README.md +0 -44
- package/src/skills/analyze-report/algorithms/progress-assessment.md +0 -108
- package/src/skills/analyze-report/knowledge/analysis-frameworks.md +0 -66
- package/src/skills/analyze-report/knowledge/report-structure.md +0 -61
- package/src/skills/analyze-report/scripts/calc-plan-metrics.js +0 -234
- package/src/skills/analyze-report/templates/analysis-report.md +0 -80
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-1.md +0 -69
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-2.md +0 -103
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-3.md +0 -99
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/judge.json +0 -163
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-1.md +0 -89
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-2.md +0 -88
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-3.md +0 -100
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-glm/trial-1.md +0 -77
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-glm/trial-2.md +0 -64
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-glm/trial-3.md +0 -110
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-minimax/trial-1.md +0 -74
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-minimax/trial-2.md +0 -38
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-minimax/trial-3.md +0 -61
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/meta.json +0 -115
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001-evidence-from-log.yaml +0 -60
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/claude-sonnet/trial-1.md +0 -90
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/claude-sonnet/trial-2.md +0 -89
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/claude-sonnet/trial-3.md +0 -77
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/judge.json +0 -163
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-deepseek/trial-1.md +0 -84
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-deepseek/trial-2.md +0 -77
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-deepseek/trial-3.md +0 -89
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-glm/trial-1.md +0 -103
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-glm/trial-2.md +0 -103
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-glm/trial-3.md +0 -103
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-minimax/trial-1.md +0 -93
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-minimax/trial-2.md +0 -93
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/kilo-minimax/trial-3.md +0 -86
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002/current/meta.json +0 -115
- package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-002-result-block-format.yaml +0 -44
- package/src/skills/analyze-report/tests/fixtures/REPORT-002-incorrect-attribution.md +0 -27
- package/src/skills/analyze-report/tests/fixtures/pipeline-2026-04-06_qa-001-skip.log +0 -32
- package/src/skills/analyze-report/tests/index.yaml +0 -25
- package/src/skills/analyze-report/tests/rubrics/evidence-from-log.md +0 -22
- package/src/skills/analyze-report/tests/rubrics/result-block-format.md +0 -22
- package/src/skills/analyze-report/workflows/progress.md +0 -158
- package/src/skills/analyze-report/workflows/retrospective.md +0 -143
- package/src/skills/coach/README.md +0 -43
- package/src/skills/coach/SKILL.md.legacy +0 -157
- package/src/skills/coach/algorithms/gap-analysis.md +0 -69
- package/src/skills/coach/algorithms/improvement-prioritization.md +0 -62
- package/src/skills/coach/algorithms/skill-scoring.md +0 -80
- package/src/skills/coach/knowledge/audit-applied-changes-clean.txt +0 -11
- package/src/skills/coach/knowledge/backlog-management.md +0 -67
- package/src/skills/coach/knowledge/backlog-management.md.legacy +0 -90
- package/src/skills/coach/knowledge/common-antipatterns.md +0 -76
- package/src/skills/coach/knowledge/prompt-engineering.md +0 -45
- package/src/skills/coach/knowledge/shared-knowledge-guide.md +0 -44
- package/src/skills/coach/knowledge/skill-anatomy.md +0 -49
- package/src/skills/coach/knowledge/test-authorship.md +0 -141
- package/src/skills/coach/templates/audit-report.md +0 -39
- package/src/skills/coach/templates/coach-backlog-init.yaml +0 -14
- package/src/skills/coach/templates/coach-backlog-init.yaml.legacy +0 -10
- package/src/skills/coach/templates/improvement-plan.md +0 -42
- package/src/skills/coach/templates/new-skill.md +0 -95
- package/src/skills/coach/tests/cases/TC-COACH-001/current/claude-sonnet/trial-1.md +0 -58
- package/src/skills/coach/tests/cases/TC-COACH-001/current/claude-sonnet/trial-2.md +0 -65
- package/src/skills/coach/tests/cases/TC-COACH-001/current/claude-sonnet/trial-3.md +0 -58
- package/src/skills/coach/tests/cases/TC-COACH-001/current/judge.json +0 -151
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-deepseek/trial-1.md +0 -46
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-deepseek/trial-2.md +0 -0
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-deepseek/trial-3.md +0 -75
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-glm/trial-1.md +0 -81
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-glm/trial-2.md +0 -101
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-glm/trial-3.md +0 -91
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-minimax/trial-1.md +0 -48
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-minimax/trial-2.md +0 -30
- package/src/skills/coach/tests/cases/TC-COACH-001/current/kilo-minimax/trial-3.md +0 -55
- package/src/skills/coach/tests/cases/TC-COACH-001/current/meta.json +0 -94
- package/src/skills/coach/tests/cases/TC-COACH-001-evidence-based-temporal-diagram.yaml +0 -53
- package/src/skills/coach/tests/cases/TC-COACH-002/current/claude-sonnet/trial-1.md +0 -46
- package/src/skills/coach/tests/cases/TC-COACH-002/current/claude-sonnet/trial-2.md +0 -50
- package/src/skills/coach/tests/cases/TC-COACH-002/current/claude-sonnet/trial-3.md +0 -48
- package/src/skills/coach/tests/cases/TC-COACH-002/current/judge.json +0 -151
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-deepseek/trial-2.md +0 -37
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-deepseek/trial-3.md +0 -30
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-glm/trial-1.md +0 -23
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-glm/trial-2.md +0 -29
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-glm/trial-3.md +0 -35
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-minimax/trial-1.md +0 -13
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-minimax/trial-2.md +0 -19
- package/src/skills/coach/tests/cases/TC-COACH-002/current/kilo-minimax/trial-3.md +0 -33
- package/src/skills/coach/tests/cases/TC-COACH-002/current/meta.json +0 -94
- package/src/skills/coach/tests/cases/TC-COACH-002-root-cause-first.yaml +0 -57
- package/src/skills/coach/tests/fixtures/pipeline-2026-04-06_id-collision.log +0 -77
- package/src/skills/coach/tests/index.yaml +0 -29
- package/src/skills/coach/tests/rubrics/calibration/evidence-based-bad.md +0 -13
- package/src/skills/coach/tests/rubrics/calibration/evidence-based-good.md +0 -29
- package/src/skills/coach/tests/rubrics/evidence-based.md +0 -26
- package/src/skills/coach/tests/rubrics/root-cause-first.md +0 -21
- package/src/skills/coach/workflows/analyze.md +0 -79
- package/src/skills/coach/workflows/analyze.md.legacy +0 -64
- package/src/skills/coach/workflows/audit.md +0 -74
- package/src/skills/coach/workflows/audit.md.legacy +0 -59
- package/src/skills/coach/workflows/create.md +0 -80
- package/src/skills/coach/workflows/create.md.legacy +0 -67
- package/src/skills/coach/workflows/improve.md +0 -71
- package/src/skills/coach/workflows/improve.md.legacy +0 -60
- package/src/skills/coach/workflows/research.md +0 -55
- package/src/skills/coach/workflows/review.md +0 -52
- package/src/skills/coach/workflows/review.md.legacy +0 -48
- package/src/skills/coach/workflows/test.md +0 -97
- package/src/skills/create-plan/README.md +0 -39
- package/src/skills/create-plan/algorithms/risk-assessment.md +0 -73
- package/src/skills/create-plan/knowledge/plan-completeness.md +0 -67
- package/src/skills/create-plan/knowledge/plan-lifecycle.md +0 -33
- package/src/skills/create-plan/knowledge/task-verification-pairs.md +0 -151
- package/src/skills/create-plan/knowledge/test-hygiene.md +0 -47
- package/src/skills/create-plan/scripts/validate-completeness.js +0 -182
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/claude-sonnet/trial-1.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/claude-sonnet/trial-2.md +0 -39
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/claude-sonnet/trial-3.md +0 -35
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/judge.json +0 -167
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-deepseek/trial-2.md +0 -10
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-deepseek/trial-3.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-glm/trial-1.md +0 -26
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-glm/trial-2.md +0 -86
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-glm/trial-3.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-1.md +0 -11
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-2.md +0 -15
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/kilo-minimax/trial-3.md +0 -14
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001/current/meta.json +0 -119
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-001-validate-completeness.yaml +0 -41
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-1.md +0 -25
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-2.md +0 -30
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/claude-sonnet/trial-3.md +0 -37
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/judge.json +0 -164
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-1.md +0 -3
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-2.md +0 -11
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-deepseek/trial-3.md +0 -13
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-glm/trial-1.md +0 -44
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-glm/trial-2.md +0 -5
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-glm/trial-3.md +0 -49
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-minimax/trial-1.md +0 -6
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-minimax/trial-2.md +0 -11
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/kilo-minimax/trial-3.md +0 -16
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002/current/meta.json +0 -116
- package/src/skills/create-plan/tests/cases/TC-CREATE-PLAN-002-task-granularity.yaml +0 -39
- package/src/skills/create-plan/tests/index.yaml +0 -25
- package/src/skills/create-plan/tests/rubrics/task-granularity.md +0 -21
- package/src/skills/create-plan/tests/rubrics/validate-completeness.md +0 -21
- package/src/skills/create-plan/workflows/create.md +0 -136
- package/src/skills/create-report/README.md +0 -40
- package/src/skills/create-report/algorithms/metric-calculation.md +0 -93
- package/src/skills/create-report/knowledge/report-metrics.md +0 -82
- package/src/skills/create-report/scripts/calc-metrics.js +0 -383
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/claude-sonnet/trial-1.md +0 -25
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/claude-sonnet/trial-2.md +0 -26
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/claude-sonnet/trial-3.md +0 -28
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/judge.json +0 -163
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-deepseek/trial-1.md +0 -4
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-deepseek/trial-2.md +0 -3
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-deepseek/trial-3.md +0 -6
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-glm/trial-1.md +0 -8
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-glm/trial-2.md +0 -12
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-glm/trial-3.md +0 -7
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-minimax/trial-1.md +0 -12
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-minimax/trial-2.md +0 -22
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/kilo-minimax/trial-3.md +0 -13
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001/current/meta.json +0 -115
- package/src/skills/create-report/tests/cases/TC-CREATE-REPORT-001-root-cause-attribution.yaml +0 -57
- package/src/skills/create-report/tests/index.yaml +0 -20
- package/src/skills/create-report/tests/rubrics/root-cause-attribution.md +0 -21
- package/src/skills/create-report/workflows/standard.md +0 -175
- package/src/skills/decompose-gaps/README.md +0 -39
- package/src/skills/decompose-gaps/algorithms/scope-check.md +0 -110
- package/src/skills/decompose-gaps/knowledge/scope-validation.md +0 -65
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/claude-sonnet/trial-1.md +0 -41
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/claude-sonnet/trial-2.md +0 -41
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/claude-sonnet/trial-3.md +0 -56
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/judge.json +0 -164
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-deepseek/trial-1.md +0 -25
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-deepseek/trial-2.md +0 -17
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-deepseek/trial-3.md +0 -22
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-glm/trial-1.md +0 -25
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-glm/trial-2.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-glm/trial-3.md +0 -29
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-minimax/trial-1.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-minimax/trial-2.md +0 -35
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/kilo-minimax/trial-3.md +0 -18
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001/current/meta.json +0 -116
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-001-scope-exclusion.yaml +0 -46
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/claude-sonnet/trial-1.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/claude-sonnet/trial-2.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/claude-sonnet/trial-3.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/judge.json +0 -163
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-deepseek/trial-2.md +0 -15
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-deepseek/trial-3.md +0 -7
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-glm/trial-1.md +0 -21
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-glm/trial-2.md +0 -38
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-glm/trial-3.md +0 -16
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-minimax/trial-1.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-minimax/trial-2.md +0 -10
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/kilo-minimax/trial-3.md +0 -9
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002/current/meta.json +0 -115
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-002-glob-before-write.yaml +0 -36
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/claude-sonnet/trial-1.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/claude-sonnet/trial-2.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/claude-sonnet/trial-3.md +0 -30
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/judge.json +0 -165
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-deepseek/trial-2.md +0 -26
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-deepseek/trial-3.md +0 -5
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-glm/trial-1.md +0 -39
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-glm/trial-2.md +0 -37
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-glm/trial-3.md +0 -45
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-minimax/trial-1.md +0 -26
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-minimax/trial-2.md +0 -27
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/kilo-minimax/trial-3.md +0 -7
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003/current/meta.json +0 -117
- package/src/skills/decompose-gaps/tests/cases/TC-DECOMPOSE-GAPS-003-parent-plan-mandatory.yaml +0 -41
- package/src/skills/decompose-gaps/tests/index.yaml +0 -30
- package/src/skills/decompose-gaps/tests/rubrics/glob-before-write.md +0 -21
- package/src/skills/decompose-gaps/tests/rubrics/parent-plan-mandatory.md +0 -22
- package/src/skills/decompose-gaps/tests/rubrics/scope-exclusion.md +0 -21
- package/src/skills/decompose-gaps/workflows/decompose.md +0 -123
- package/src/skills/decompose-plan/README.md +0 -43
- package/src/skills/decompose-plan/algorithms/deduplication.md +0 -101
- package/src/skills/decompose-plan/knowledge/atomicity-checklist.md +0 -139
- package/src/skills/decompose-plan/knowledge/capabilities.md +0 -68
- package/src/skills/decompose-plan/knowledge/human-task-rules.md +0 -82
- package/src/skills/decompose-plan/knowledge/scope-guard-checklist.md +0 -73
- package/src/skills/decompose-plan/scripts/check-atomicity-limit.js +0 -47
- package/src/skills/decompose-plan/scripts/check-duplicates.js +0 -323
- package/src/skills/decompose-plan/scripts/verify-atomicity.js +0 -408
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/claude-sonnet/trial-1.md +0 -30
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/claude-sonnet/trial-2.md +0 -36
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/claude-sonnet/trial-3.md +0 -37
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-deepseek/trial-1.md +0 -20
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-deepseek/trial-2.md +0 -17
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-deepseek/trial-3.md +0 -28
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-glm/trial-1.md +0 -114
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-glm/trial-2.md +0 -137
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-glm/trial-3.md +0 -188
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-minimax/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-minimax/trial-2.md +0 -32
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/kilo-minimax/trial-3.md +0 -110
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-001-atomicity-no-1to1.yaml +0 -56
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/claude-sonnet/trial-1.md +0 -47
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/claude-sonnet/trial-2.md +0 -54
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/claude-sonnet/trial-3.md +0 -43
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-deepseek/trial-1.md +0 -15
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-deepseek/trial-2.md +0 -5
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-deepseek/trial-3.md +0 -12
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-glm/trial-1.md +0 -34
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-glm/trial-2.md +0 -30
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-glm/trial-3.md +0 -35
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-minimax/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-minimax/trial-2.md +0 -31
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-002-get-next-id-mandatory.yaml +0 -44
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/claude-sonnet/trial-1.md +0 -21
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/claude-sonnet/trial-2.md +0 -38
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/claude-sonnet/trial-3.md +0 -30
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-deepseek/trial-1.md +0 -31
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-deepseek/trial-2.md +0 -35
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-deepseek/trial-3.md +0 -48
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-glm/trial-1.md +0 -167
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-glm/trial-2.md +0 -62
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-glm/trial-3.md +0 -174
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-minimax/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-minimax/trial-2.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-003-verbatim-dod-transfer.yaml +0 -42
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/claude-sonnet/trial-1.md +0 -55
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/claude-sonnet/trial-2.md +0 -49
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/claude-sonnet/trial-3.md +0 -49
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-deepseek/trial-1.md +0 -104
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-deepseek/trial-2.md +0 -45
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-deepseek/trial-3.md +0 -58
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-glm/trial-1.md +0 -193
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-glm/trial-2.md +0 -202
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-glm/trial-3.md +0 -155
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-minimax/trial-1.md +0 -52
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-minimax/trial-2.md +0 -17
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-004-executor-atomicity.yaml +0 -64
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/claude-sonnet/trial-1.md +0 -59
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/claude-sonnet/trial-2.md +0 -204
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/claude-sonnet/trial-3.md +0 -213
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-deepseek/trial-2.md +0 -57
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-deepseek/trial-3.md +0 -54
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-glm/trial-1.md +0 -147
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-glm/trial-2.md +0 -165
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-glm/trial-3.md +0 -133
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-minimax/trial-1.md +0 -81
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-minimax/trial-2.md +0 -108
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/kilo-minimax/trial-3.md +0 -3
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005/current/meta.json +0 -114
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-005-capabilities-registry.yaml +0 -78
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/claude-sonnet/trial-1.md +0 -225
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/claude-sonnet/trial-2.md +0 -66
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/claude-sonnet/trial-3.md +0 -36
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/judge.json +0 -163
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-deepseek/trial-1.md +0 -42
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-deepseek/trial-2.md +0 -67
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-deepseek/trial-3.md +0 -40
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-glm/trial-1.md +0 -122
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-glm/trial-2.md +0 -131
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-glm/trial-3.md +0 -138
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-minimax/trial-1.md +0 -41
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-minimax/trial-2.md +0 -88
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/kilo-minimax/trial-3.md +0 -0
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006/current/meta.json +0 -115
- package/src/skills/decompose-plan/tests/cases/TC-DECOMPOSE-PLAN-006-dod-threshold.yaml +0 -72
- package/src/skills/decompose-plan/tests/index.yaml +0 -45
- package/src/skills/decompose-plan/tests/rubrics/atomicity-no-1to1.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/capabilities-registry.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/dod-threshold.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/executor-atomicity.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/get-next-id-mandatory.md +0 -21
- package/src/skills/decompose-plan/tests/rubrics/verbatim-dod-transfer.md +0 -21
- package/src/skills/decompose-plan/workflows/decompose.md +0 -305
- package/src/skills/deep-research/README.md +0 -36
- package/src/skills/deep-research/algorithms/source-scoring.md +0 -63
- package/src/skills/deep-research/algorithms/synthesis.md +0 -67
- package/src/skills/deep-research/knowledge/data-validation.md +0 -44
- package/src/skills/deep-research/knowledge/perplexity-config.md +0 -30
- package/src/skills/deep-research/knowledge/research-methodology.md +0 -54
- package/src/skills/deep-research/knowledge/source-evaluation.md +0 -33
- package/src/skills/deep-research/scripts/perplexity-research.js +0 -315
- package/src/skills/deep-research/templates/brief-summary.md +0 -25
- package/src/skills/deep-research/templates/research-report.md +0 -76
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/claude-haiku/trial-1.md +0 -48
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/claude-haiku/trial-2.md +0 -88
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/claude-haiku/trial-3.md +0 -56
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/judge.json +0 -163
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-free/trial-1.md +0 -58
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-free/trial-2.md +0 -249
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-free/trial-3.md +0 -44
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm/trial-1.md +0 -96
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm/trial-2.md +0 -56
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm/trial-3.md +0 -94
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm-air/trial-1.md +0 -11
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm-air/trial-2.md +0 -1
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/kilo-glm-air/trial-3.md +0 -1
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001/current/meta.json +0 -115
- package/src/skills/deep-research/tests/cases/TC-DEEP-RESEARCH-001-self-check-url.yaml +0 -58
- package/src/skills/deep-research/tests/index.yaml +0 -20
- package/src/skills/deep-research/tests/rubrics/self-check-url.md +0 -34
- package/src/skills/deep-research/workflows/base-checklist.md +0 -19
- package/src/skills/deep-research/workflows/benchmark.md +0 -38
- package/src/skills/deep-research/workflows/competitor.md +0 -44
- package/src/skills/deep-research/workflows/custom.md +0 -32
- package/src/skills/deep-research/workflows/market.md +0 -44
- package/src/skills/deep-research/workflows/technology.md +0 -40
- package/src/skills/deep-research/workflows/trend.md +0 -40
- package/src/skills/execute-task/README.md +0 -44
- package/src/skills/execute-task/algorithms/execution-strategy.md +0 -136
- package/src/skills/execute-task/knowledge/context-checkpoints.md +0 -75
- package/src/skills/execute-task/knowledge/ticket-structure.md +0 -70
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/claude-haiku/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-free/trial-1.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-free/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-free/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-glm-air/trial-1.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-glm-air/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/kilo-glm-air/trial-3.md +0 -11
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001/current/meta.json +0 -88
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-001-no-ticket-creation.yaml +0 -48
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/claude-haiku/trial-2.md +0 -6
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-free/trial-1.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-free/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-free/trial-3.md +0 -8
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-glm-air/trial-1.md +0 -9
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-glm-air/trial-2.md +0 -26
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/kilo-glm-air/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002/current/meta.json +0 -89
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-002-no-duplicate-dod.yaml +0 -44
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/claude-haiku/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/judge.json +0 -46
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003/current/meta.json +0 -37
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-003-verification-proportionality.yaml +0 -46
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/claude-haiku/trial-1.md +0 -18
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/claude-haiku/trial-2.md +0 -16
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/claude-haiku/trial-3.md +0 -14
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-free/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-free/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-free/trial-3.md +0 -1
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-glm-air/trial-1.md +0 -8
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-glm-air/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/kilo-glm-air/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004/current/meta.json +0 -89
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-004-no-foreign-ticket-edit.yaml +0 -50
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/claude-haiku/trial-1.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/claude-haiku/trial-2.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/claude-haiku/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/judge.json +0 -124
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-free/trial-1.md +0 -15
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-free/trial-2.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-free/trial-3.md +0 -5
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-glm-air/trial-1.md +0 -11
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-glm-air/trial-2.md +0 -11
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/kilo-glm-air/trial-3.md +0 -4
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005/current/meta.json +0 -88
- package/src/skills/execute-task/tests/cases/TC-EXECUTE-TASK-005-ticket-fields-updated.yaml +0 -39
- package/src/skills/execute-task/tests/fixtures/IMPL-902-create-file.md +0 -41
- package/src/skills/execute-task/tests/fixtures/IMPL-904-current-task.md +0 -40
- package/src/skills/execute-task/tests/fixtures/IMPL-906-fill-ticket.md +0 -42
- package/src/skills/execute-task/tests/fixtures/QA-901-button-click.md +0 -41
- package/src/skills/execute-task/tests/fixtures/QA-903-visual-figma.md +0 -40
- package/src/skills/execute-task/tests/fixtures/TASK-905-done-with-typo.md +0 -36
- package/src/skills/execute-task/tests/index.yaml +0 -39
- package/src/skills/execute-task/tests/rubrics/no-duplicate-dod.md +0 -22
- package/src/skills/execute-task/tests/rubrics/no-foreign-ticket-edit.md +0 -20
- package/src/skills/execute-task/tests/rubrics/no-ticket-creation.md +0 -21
- package/src/skills/execute-task/tests/rubrics/ticket-fields-updated.md +0 -23
- package/src/skills/execute-task/tests/rubrics/verification-proportionality.md +0 -22
- package/src/skills/execute-task/workflows/execute.md +0 -104
- package/src/skills/manual-testing/README.md +0 -63
- package/src/skills/manual-testing/algorithms/blocked-tool-strategy.md +0 -74
- package/src/skills/manual-testing/algorithms/bug-severity.md +0 -73
- package/src/skills/manual-testing/algorithms/mcp-budget.md +0 -97
- package/src/skills/manual-testing/algorithms/test-prioritization.md +0 -69
- package/src/skills/manual-testing/knowledge/browser-extension-testing.md +0 -102
- package/src/skills/manual-testing/knowledge/browser-tools.md +0 -114
- package/src/skills/manual-testing/knowledge/desktop-tools-advanced.md +0 -92
- package/src/skills/manual-testing/knowledge/desktop-tools-core.md +0 -76
- package/src/skills/manual-testing/knowledge/sandbox-advanced.md +0 -83
- package/src/skills/manual-testing/knowledge/sandbox-core.md +0 -67
- package/src/skills/manual-testing/knowledge/stateful-edge-cases.md +0 -69
- package/src/skills/manual-testing/knowledge/test-case-design.md +0 -107
- package/src/skills/manual-testing/knowledge/testing-types.md +0 -45
- package/src/skills/manual-testing/templates/bug-report.md +0 -52
- package/src/skills/manual-testing/templates/test-case.md +0 -34
- package/src/skills/manual-testing/templates/test-plan.md +0 -97
- package/src/skills/manual-testing/templates/test-session-report.md +0 -56
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/claude-sonnet/trial-1.md +0 -34
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/claude-sonnet/trial-2.md +0 -32
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/claude-sonnet/trial-3.md +0 -30
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/judge.json +0 -163
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-deepseek/trial-1.md +0 -0
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-deepseek/trial-2.md +0 -7
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-deepseek/trial-3.md +0 -0
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-glm/trial-1.md +0 -4
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-glm/trial-2.md +0 -15
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-glm/trial-3.md +0 -8
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-minimax/trial-1.md +0 -5
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-minimax/trial-2.md +0 -7
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/kilo-minimax/trial-3.md +0 -7
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001/current/meta.json +0 -114
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-001-sandbox-mandatory.yaml +0 -38
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/claude-sonnet/trial-1.md +0 -44
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/claude-sonnet/trial-2.md +0 -32
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/claude-sonnet/trial-3.md +0 -47
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/judge.json +0 -163
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-deepseek/trial-1.md +0 -19
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-deepseek/trial-2.md +0 -15
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-deepseek/trial-3.md +0 -24
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-glm/trial-1.md +0 -19
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-glm/trial-2.md +0 -13
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-glm/trial-3.md +0 -18
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-minimax/trial-1.md +0 -21
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-minimax/trial-2.md +0 -15
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/kilo-minimax/trial-3.md +0 -14
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002/current/meta.json +0 -114
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-002-visual-tc-screenshot.yaml +0 -37
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/claude-sonnet/trial-1.md +0 -76
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/claude-sonnet/trial-2.md +0 -71
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/claude-sonnet/trial-3.md +0 -85
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/judge.json +0 -46
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003/current/meta.json +0 -36
- package/src/skills/manual-testing/tests/cases/TC-MANUAL-TESTING-003-qa-non-ui-assertion.yaml +0 -65
- package/src/skills/manual-testing/tests/index.yaml +0 -30
- package/src/skills/manual-testing/tests/last-run-tc001-sonnet.log +0 -140
- package/src/skills/manual-testing/tests/last-run-tc002.log +0 -1
- package/src/skills/manual-testing/tests/last-run.log +0 -1469
- package/src/skills/manual-testing/tests/rubrics/qa-non-ui-assertion.md +0 -31
- package/src/skills/manual-testing/tests/rubrics/sandbox-mandatory.md +0 -20
- package/src/skills/manual-testing/tests/rubrics/visual-tc-screenshot.md +0 -21
- package/src/skills/manual-testing/workflows/acceptance.md +0 -80
- package/src/skills/manual-testing/workflows/exploratory.md +0 -84
- package/src/skills/manual-testing/workflows/regression.md +0 -76
- package/src/skills/manual-testing/workflows/smoke.md +0 -109
- package/src/skills/manual-testing/workflows/test-plan.md +0 -75
- package/src/skills/review-result/README.md +0 -59
- package/src/skills/review-result/algorithms/verification.md +0 -112
- package/src/skills/review-result/knowledge/dod-patterns.md +0 -115
- package/src/skills/review-result/knowledge/test-hygiene.md +0 -44
- package/src/skills/review-result/scripts/verify-artifacts.js +0 -497
- package/src/skills/review-result/templates/verdict.md +0 -153
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-haiku/trial-1.md +0 -22
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-haiku/trial-2.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-haiku/trial-3.md +0 -21
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-1.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-2.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/claude-sonnet/trial-3.md +0 -18
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/judge.json +0 -164
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-2.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-deepseek/trial-3.md +0 -6
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-1.md +0 -49
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-2.md +0 -28
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-glm/trial-3.md +0 -37
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-1.md +0 -22
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-2.md +0 -13
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/kilo-minimax/trial-3.md +0 -21
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001/current/meta.json +0 -116
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-001-visual-tc-trigger.yaml +0 -51
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-1.md +0 -23
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-2.md +0 -22
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-haiku/trial-3.md +0 -28
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-sonnet/trial-1.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-sonnet/trial-2.md +0 -36
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/claude-sonnet/trial-3.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/judge.json +0 -163
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-deepseek/trial-1.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-deepseek/trial-2.md +0 -0
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-deepseek/trial-3.md +0 -4
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-glm/trial-1.md +0 -39
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-glm/trial-2.md +0 -25
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-glm/trial-3.md +0 -32
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-minimax/trial-1.md +0 -34
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-minimax/trial-2.md +0 -8
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/kilo-minimax/trial-3.md +0 -23
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002/current/meta.json +0 -115
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-002-path-line-suffix.yaml +0 -39
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/claude-sonnet/trial-1.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/claude-sonnet/trial-2.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/claude-sonnet/trial-3.md +0 -7
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/judge.json +0 -163
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-deepseek/trial-1.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-deepseek/trial-2.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-deepseek/trial-3.md +0 -11
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-glm/trial-1.md +0 -16
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-glm/trial-2.md +0 -18
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-glm/trial-3.md +0 -17
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-minimax/trial-1.md +0 -17
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-minimax/trial-2.md +0 -31
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/kilo-minimax/trial-3.md +0 -5
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003/current/meta.json +0 -115
- package/src/skills/review-result/tests/cases/TC-REVIEW-RESULT-003-test-isolation.yaml +0 -50
- package/src/skills/review-result/tests/fixtures/IMPL-902-path-with-line.md +0 -43
- package/src/skills/review-result/tests/fixtures/QA-901-visual-button.md +0 -46
- package/src/skills/review-result/tests/fixtures/QA-904-test-isolation-violation/QA-904.md +0 -51
- package/src/skills/review-result/tests/fixtures/QA-904-test-isolation-violation/example-test.mjs +0 -36
- package/src/skills/review-result/tests/index.yaml +0 -30
- package/src/skills/review-result/tests/rubrics/path-line-suffix.md +0 -19
- package/src/skills/review-result/tests/rubrics/test-isolation.md +0 -20
- package/src/skills/review-result/tests/rubrics/visual-tc-trigger.md +0 -19
- package/src/skills/review-result/workflows/review.md +0 -209
package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-2.md
DELETED
|
@@ -1,103 +0,0 @@
|
|
|
1
|
-
## Анализ прогресса PLAN-001 — Верификация атрибуции QA-001
|
|
2
|
-
|
|
3
|
-
---
|
|
4
|
-
|
|
5
|
-
## Executive Summary
|
|
6
|
-
|
|
7
|
-
Отчёт REPORT-002 содержит **некорректную атрибуцию root cause** для пропуска тикета QA-001. По данным лога пайплайна, стейдж `check-conditions` (файл `check-conditions.js`) отработал корректно и вернул `conditions_ok`. Решение о пропуске принял следующий стейдж — `check-relevance` (`check-relevance.js`), выставив статус `irrelevant` по причине `dependencies_inactive`. Рекомендация из REPORT-002 направлена на исправление не того компонента, что делает её недействительной и потенциально опасной.
|
|
8
|
-
|
|
9
|
-
---
|
|
10
|
-
|
|
11
|
-
## Верификация findings по логу пайплайна
|
|
12
|
-
|
|
13
|
-
### Трассировка шагов QA-001
|
|
14
|
-
|
|
15
|
-
| Step | Стейдж | Скрипт | Результат | Верификация |
|
|
16
|
-
|------|--------|--------|-----------|-------------|
|
|
17
|
-
| 312 | `pick-next-task` | `script-pick` | ticket=QA-001, status=picked | ✅ Лог строка 4: `Selected ticket: QA-001 (plan PLAN-001, status=ready)` |
|
|
18
|
-
| 313 | `check-conditions` | `check-conditions.js` | `conditions_ok` | ✅ Лог строки 13–18: все три проверки прошли, result=conditions_ok |
|
|
19
|
-
| 314 | `check-relevance` | `check-relevance.js` | `irrelevant`, reason=`dependencies_inactive` | ✅ Лог строка 29: `COMPLETE ... status="irrelevant" reason="dependencies_inactive"` |
|
|
20
|
-
| 315 | `skip-ticket` | `script-skip` | QA-001 → skipped/ | ✅ Лог строка 31: `Moving QA-001 → skipped/ (reason from check-relevance: dependencies_inactive)` |
|
|
21
|
-
|
|
22
|
-
### Детальный анализ стейджа check-conditions (Step 313)
|
|
23
|
-
|
|
24
|
-
```
|
|
25
|
-
[check-conditions] dependencies.resolved: true ← зависимости разрешены
|
|
26
|
-
[check-conditions] prerequisites.met: true ← предусловия выполнены
|
|
27
|
-
[check-conditions] blocking_tickets: [] ← блокеров нет
|
|
28
|
-
[check-conditions] Result: conditions_ok
|
|
29
|
-
```
|
|
30
|
-
|
|
31
|
-
**Вывод `[HIGH]`:** `check-conditions.js` принял корректное решение. Атрибуция root cause в REPORT-002 на этот компонент — **ошибочна**.
|
|
32
|
-
|
|
33
|
-
### Детальный анализ стейджа check-relevance (Step 314)
|
|
34
|
-
|
|
35
|
-
```
|
|
36
|
-
[check-relevance] dependencies.status: inactive
|
|
37
|
-
[check-relevance] decision: irrelevant (dependencies inactive)
|
|
38
|
-
[check-relevance] COMPLETE status="irrelevant" reason="dependencies_inactive"
|
|
39
|
-
```
|
|
40
|
-
|
|
41
|
-
**Вывод `[HIGH]`:** Именно `check-relevance.js` принял решение о нерелевантности тикета. Причина — `dependencies.status: inactive`, тогда как на предыдущем стейдже `check-conditions` те же зависимости были оценены как `resolved: true`.
|
|
42
|
-
|
|
43
|
-
---
|
|
44
|
-
|
|
45
|
-
## Корневая причина (скорректированная)
|
|
46
|
-
|
|
47
|
-
### Противоречие между стейджами `[HIGH]`
|
|
48
|
-
|
|
49
|
-
| Стейдж | Компонент | Оценка зависимостей | Решение |
|
|
50
|
-
|--------|-----------|---------------------|---------|
|
|
51
|
-
| `check-conditions` (Step 313) | `check-conditions.js` | `dependencies.resolved: true` | `conditions_ok` — пропускает дальше |
|
|
52
|
-
| `check-relevance` (Step 314) | `check-relevance.js` | `dependencies.status: inactive` | `irrelevant` → skip |
|
|
53
|
-
|
|
54
|
-
Два стейджа оценивают зависимости одного и того же тикета QA-001 **противоположно**: один видит их как resolved, другой — как inactive. Это указывает на одну из следующих причин:
|
|
55
|
-
|
|
56
|
-
1. **`[MEDIUM]`** `check-relevance.js` использует иной источник данных или иную семантику поля `dependencies` — без доступа к исходному коду не верифицируемо.
|
|
57
|
-
2. **`[MEDIUM]`** Между стейджами 313 и 314 произошло изменение состояния зависимостей — маловероятно при timestamps с разницей < 1 секунды.
|
|
58
|
-
3. **`[LOW]`** Оба стейджа смотрят на один источник, но интерпретируют разные поля (`resolved` vs `status`) — логическая несогласованность в дизайне.
|
|
59
|
-
|
|
60
|
-
**Реальный root cause:** `check-relevance.js` — единственный компонент, принявший решение о пропуске QA-001.
|
|
61
|
-
|
|
62
|
-
---
|
|
63
|
-
|
|
64
|
-
## Findings: исправленная атрибуция
|
|
65
|
-
|
|
66
|
-
### Finding 1 — Некорректный skip QA-001
|
|
67
|
-
|
|
68
|
-
- **Статус в итерации:** skipped
|
|
69
|
-
- **Root cause (REPORT-002):** `check-conditions.js` — **НЕВЕРНО** `[HIGH]`
|
|
70
|
-
- **Root cause (скорректированный):** `check-relevance.js` — стейдж вернул `irrelevant` с reason=`dependencies_inactive`, несмотря на то что `check-conditions` подтвердил `conditions_ok` `[HIGH]`
|
|
71
|
-
- **Затронутый компонент:** `.workflow/src/scripts/check-relevance.js`
|
|
72
|
-
- **Приоритет:** HIGH
|
|
73
|
-
|
|
74
|
-
### Finding 2 — Противоречивая семантика зависимостей между стейджами
|
|
75
|
-
|
|
76
|
-
- **Описание:** `check-conditions.js` и `check-relevance.js` используют несогласованные метрики/источники для оценки зависимостей одного тикета `[MEDIUM]`
|
|
77
|
-
- **Риск:** любой тикет с «частично разрешёнными» зависимостями может быть ошибочно пропущен
|
|
78
|
-
- **Приоритет:** HIGH
|
|
79
|
-
|
|
80
|
-
---
|
|
81
|
-
|
|
82
|
-
## Рекомендации
|
|
83
|
-
|
|
84
|
-
| # | Действие | Компонент | Приоритет | Ожидаемый результат |
|
|
85
|
-
|---|----------|-----------|-----------|---------------------|
|
|
86
|
-
| 1 | Исправить логику оценки зависимостей в `check-relevance.js`: привести семантику `dependencies.status` в соответствие с `check-conditions.js` | `check-relevance.js` | **HIGH** | Устранение ложных срабатываний `irrelevant` при корректных зависимостях |
|
|
87
|
-
| 2 | Добавить cross-check: если `check-conditions` вернул `conditions_ok`, `check-relevance` не должен принимать решение на основе `dependencies` без явного изменения их состояния | Архитектура пайплайна | **HIGH** | Устранение класса противоречий между стейджами |
|
|
88
|
-
| 3 | Отозвать рекомендацию из REPORT-002 («пересмотреть пороги `check-conditions.js`») — она основана на ошибочной атрибуции и может привести к регрессии | REPORT-002 | **MEDIUM** | Предотвращение ненужных изменений в корректно работающем компоненте |
|
|
89
|
-
| 4 | Добавить интеграционный тест: сценарий, где `check-conditions` → `conditions_ok`, а `check-relevance` → `irrelevant`, должен логировать предупреждение или требовать явного объяснения | Тест-сьют | **MEDIUM** | Детектирование аналогичных противоречий в будущих итерациях |
|
|
90
|
-
|
|
91
|
-
---
|
|
92
|
-
|
|
93
|
-
## За пределами скоупа
|
|
94
|
-
|
|
95
|
-
Остальные 4 тикета итерации не анализировались — данных по ним в предоставленных артефактах нет.
|
|
96
|
-
|
|
97
|
-
---
|
|
98
|
-
|
|
99
|
-
---RESULT---
|
|
100
|
-
status: has_gaps
|
|
101
|
-
report_id: REPORT-002
|
|
102
|
-
gaps: "Некорректная атрибуция root cause в REPORT-002: `check-conditions.js` не причастен к пропуску QA-001 — решение принял `check-relevance.js` (reason=dependencies_inactive). Рекомендация из REPORT-002 направлена на неверный компонент; требуется исправление `check-relevance.js` и согласование семантики зависимостей между стейджами."
|
|
103
|
-
---RESULT---
|
package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/claude-sonnet/trial-3.md
DELETED
|
@@ -1,99 +0,0 @@
|
|
|
1
|
-
## Анализ прогресса PLAN-001 — Верификация атрибуции QA-001
|
|
2
|
-
|
|
3
|
-
**Дата анализа:** 2026-04-20
|
|
4
|
-
**Источники:** REPORT-002-incorrect-attribution.md, pipeline-2026-04-06_qa-001-skip.log
|
|
5
|
-
|
|
6
|
-
---
|
|
7
|
-
|
|
8
|
-
## Executive Summary
|
|
9
|
-
|
|
10
|
-
Верификация по логу пайплайна показала, что root cause в REPORT-002 атрибутирован **некорректно**. Стейдж `check-conditions` (шаг 313) вернул `conditions_ok` — QA-001 прошёл все проверки условий. Решение о skip принял стейдж `check-relevance` (шаг 314, строка с `decision=irrelevant, reason=dependencies_inactive`). REPORT-002 содержит ошибочное указание на `check-conditions.js` как источник проблемы, что ведёт к некорректной рекомендации. QA-001 остаётся невыполненным — план имеет пробел.
|
|
11
|
-
|
|
12
|
-
---
|
|
13
|
-
|
|
14
|
-
## Верификация findings по логу
|
|
15
|
-
|
|
16
|
-
### Трассировка обработки QA-001
|
|
17
|
-
|
|
18
|
-
| Шаг | Стейдж | Компонент | Результат | Атрибуция проблемы |
|
|
19
|
-
|-----|--------|-----------|-----------|-------------------|
|
|
20
|
-
| 312 | `pick-next-task` | `script-pick` | `status=picked` — QA-001 выбран | — |
|
|
21
|
-
| 313 | `check-conditions` | `check-conditions.js` | `conditions_ok` — все условия пройдены | ❌ **Ложная атрибуция в REPORT-002** |
|
|
22
|
-
| 314 | `check-relevance` | `check-relevance.js` | `status=irrelevant`, `reason=dependencies_inactive` | ✅ **Реальный источник skip** |
|
|
23
|
-
| 315 | `skip-ticket` | `script-skip` | QA-001 → skipped/, причина из `check-relevance` | Следствие шага 314 |
|
|
24
|
-
|
|
25
|
-
### Детали шага 313 — `check-conditions` (лог)
|
|
26
|
-
|
|
27
|
-
```
|
|
28
|
-
- dependencies.resolved: true
|
|
29
|
-
- prerequisites.met: true
|
|
30
|
-
- blocking_tickets: []
|
|
31
|
-
Result: conditions_ok
|
|
32
|
-
```
|
|
33
|
-
|
|
34
|
-
**Вывод [HIGH]:** `check-conditions.js` выполнил проверку корректно. Зависимости резолвлены, пререквизиты выполнены, блокирующих тикетов нет. Компонент не является источником проблемы.
|
|
35
|
-
|
|
36
|
-
### Детали шага 314 — `check-relevance` (лог)
|
|
37
|
-
|
|
38
|
-
```
|
|
39
|
-
- dependencies.status: inactive
|
|
40
|
-
- decision: irrelevant (dependencies inactive)
|
|
41
|
-
COMPLETE status="irrelevant" reason="dependencies_inactive"
|
|
42
|
-
```
|
|
43
|
-
|
|
44
|
-
**Вывод [HIGH]:** Стейдж `check-relevance` получил `dependencies.status: inactive` и принял решение `irrelevant`, инициировав skip. Это противоречит результату шага 313, где `dependencies.resolved: true`.
|
|
45
|
-
|
|
46
|
-
---
|
|
47
|
-
|
|
48
|
-
## Ключевые находки
|
|
49
|
-
|
|
50
|
-
### F1 — Некорректная атрибуция root cause в REPORT-002 [HIGH]
|
|
51
|
-
|
|
52
|
-
- **Факт:** REPORT-002 указывает `check-conditions.js` как причину skip QA-001
|
|
53
|
-
- **Опровержение по логу:** `check-conditions.js` вернул `conditions_ok` на шаге 313
|
|
54
|
-
- **Реальная причина:** `check-relevance.js` на шаге 314 вернул `irrelevant` по `dependencies_inactive`
|
|
55
|
-
- **Следствие:** Рекомендация из REPORT-002 («пересмотреть пороги `check-conditions.js`») — **неверная**; применение этой рекомендации не устранит проблему
|
|
56
|
-
|
|
57
|
-
### F2 — Противоречие между `check-conditions` и `check-relevance` [HIGH]
|
|
58
|
-
|
|
59
|
-
- **Факт:** Шаг 313 — `dependencies.resolved: true`; Шаг 314 — `dependencies.status: inactive`
|
|
60
|
-
- **Интерпретация [MEDIUM]:** Два стейджа используют разные источники или разную семантику поля `dependencies`. Либо `check-conditions` проверяет факт наличия зависимостей (есть/нет), а `check-relevance` — их активность (active/inactive). Либо рассинхронизация данных между шагами.
|
|
61
|
-
- **Следствие:** Тикет QA-001 прошёл conditions-проверку, но был отклонён по relevance на основании состояния, которое conditions-проверка не охватывает
|
|
62
|
-
|
|
63
|
-
### F3 — QA-001 не выполнен, план имеет пробел [HIGH]
|
|
64
|
-
|
|
65
|
-
- **Факт:** QA-001 находится в `skipped/`, задача не выполнена
|
|
66
|
-
- **Следствие для PLAN-001:** Итерация закрыта без выполнения QA-001; прогресс по плану неполный
|
|
67
|
-
|
|
68
|
-
---
|
|
69
|
-
|
|
70
|
-
## Скорректированная атрибуция
|
|
71
|
-
|
|
72
|
-
| Компонент | Атрибуция в REPORT-002 | Корректная атрибуция | Обоснование |
|
|
73
|
-
|-----------|----------------------|---------------------|-------------|
|
|
74
|
-
| `check-conditions.js` | Root cause | Не причастен | Вернул `conditions_ok`, шаг 313 |
|
|
75
|
-
| `check-relevance.js` | Не упомянут | **Root cause** | Вернул `irrelevant/dependencies_inactive`, шаг 314 |
|
|
76
|
-
| `skip-ticket` | Не упомянут | Исполнитель skip | Выполнил skip по решению `check-relevance`, шаг 315 |
|
|
77
|
-
|
|
78
|
-
---
|
|
79
|
-
|
|
80
|
-
## Рекомендации
|
|
81
|
-
|
|
82
|
-
| # | Действие | Приоритет | Ожидаемый результат |
|
|
83
|
-
|---|----------|-----------|---------------------|
|
|
84
|
-
| 1 | Исследовать `check-relevance.js`: почему `dependencies.status=inactive` при уже резолвленных `dependencies.resolved=true` | **CRITICAL** | Устранение реальной причины skip QA-001 |
|
|
85
|
-
| 2 | Выявить семантическое расхождение между полями `dependencies.resolved` и `dependencies.status` в двух стейджах | **HIGH** | Устранение потенциальной системной ошибки классификации тикетов |
|
|
86
|
-
| 3 | Переоткрыть QA-001 и выполнить в следующей итерации после устранения причины | **HIGH** | Закрытие пробела в PLAN-001 |
|
|
87
|
-
| 4 | Обновить REPORT-002 с корректной атрибуцией (или создать REPORT-002-corrected) | **MEDIUM** | Устранение ложного следа для будущих ретроспектив |
|
|
88
|
-
|
|
89
|
-
---
|
|
90
|
-
|
|
91
|
-
## За пределами скоупа
|
|
92
|
-
|
|
93
|
-
Лог содержит только итерацию обработки QA-001 (шаги 312–315). Состояние остальных 4 тикетов итерации не верифицировано по логу — данных нет.
|
|
94
|
-
|
|
95
|
-
---RESULT---
|
|
96
|
-
status: has_gaps
|
|
97
|
-
report_id: REPORT-002
|
|
98
|
-
gaps: "QA-001 не выполнен (skipped по некорректной причине в check-relevance.js); root cause атрибуция в REPORT-002 ошибочна — требуется исследование check-relevance.js и переоткрытие QA-001"
|
|
99
|
-
---RESULT---
|
|
@@ -1,163 +0,0 @@
|
|
|
1
|
-
{
|
|
2
|
-
"per_model": {
|
|
3
|
-
"claude-sonnet": {
|
|
4
|
-
"pass_count": 3,
|
|
5
|
-
"total": 3,
|
|
6
|
-
"trials": [
|
|
7
|
-
{
|
|
8
|
-
"trial": 1,
|
|
9
|
-
"score": 5,
|
|
10
|
-
"passed": true
|
|
11
|
-
},
|
|
12
|
-
{
|
|
13
|
-
"trial": 2,
|
|
14
|
-
"score": 5,
|
|
15
|
-
"passed": true
|
|
16
|
-
},
|
|
17
|
-
{
|
|
18
|
-
"trial": 3,
|
|
19
|
-
"score": 5,
|
|
20
|
-
"passed": true
|
|
21
|
-
}
|
|
22
|
-
]
|
|
23
|
-
},
|
|
24
|
-
"kilo-glm": {
|
|
25
|
-
"pass_count": 3,
|
|
26
|
-
"total": 3,
|
|
27
|
-
"trials": [
|
|
28
|
-
{
|
|
29
|
-
"trial": 1,
|
|
30
|
-
"score": 5,
|
|
31
|
-
"passed": true
|
|
32
|
-
},
|
|
33
|
-
{
|
|
34
|
-
"trial": 2,
|
|
35
|
-
"score": 5,
|
|
36
|
-
"passed": true
|
|
37
|
-
},
|
|
38
|
-
{
|
|
39
|
-
"trial": 3,
|
|
40
|
-
"score": 5,
|
|
41
|
-
"passed": true
|
|
42
|
-
}
|
|
43
|
-
]
|
|
44
|
-
},
|
|
45
|
-
"kilo-minimax": {
|
|
46
|
-
"pass_count": 3,
|
|
47
|
-
"total": 3,
|
|
48
|
-
"trials": [
|
|
49
|
-
{
|
|
50
|
-
"trial": 1,
|
|
51
|
-
"score": 5,
|
|
52
|
-
"passed": true
|
|
53
|
-
},
|
|
54
|
-
{
|
|
55
|
-
"trial": 2,
|
|
56
|
-
"score": 4,
|
|
57
|
-
"passed": true
|
|
58
|
-
},
|
|
59
|
-
{
|
|
60
|
-
"trial": 3,
|
|
61
|
-
"score": 5,
|
|
62
|
-
"passed": true
|
|
63
|
-
}
|
|
64
|
-
]
|
|
65
|
-
},
|
|
66
|
-
"kilo-deepseek": {
|
|
67
|
-
"pass_count": 3,
|
|
68
|
-
"total": 3,
|
|
69
|
-
"trials": [
|
|
70
|
-
{
|
|
71
|
-
"trial": 1,
|
|
72
|
-
"score": 5,
|
|
73
|
-
"passed": true
|
|
74
|
-
},
|
|
75
|
-
{
|
|
76
|
-
"trial": 2,
|
|
77
|
-
"score": 5,
|
|
78
|
-
"passed": true
|
|
79
|
-
},
|
|
80
|
-
{
|
|
81
|
-
"trial": 3,
|
|
82
|
-
"score": 5,
|
|
83
|
-
"passed": true
|
|
84
|
-
}
|
|
85
|
-
]
|
|
86
|
-
}
|
|
87
|
-
},
|
|
88
|
-
"rubric_scores": [
|
|
89
|
-
{
|
|
90
|
-
"agentId": "claude-sonnet",
|
|
91
|
-
"trial": 1,
|
|
92
|
-
"score": 5,
|
|
93
|
-
"errored": false
|
|
94
|
-
},
|
|
95
|
-
{
|
|
96
|
-
"agentId": "claude-sonnet",
|
|
97
|
-
"trial": 2,
|
|
98
|
-
"score": 5,
|
|
99
|
-
"errored": false
|
|
100
|
-
},
|
|
101
|
-
{
|
|
102
|
-
"agentId": "claude-sonnet",
|
|
103
|
-
"trial": 3,
|
|
104
|
-
"score": 5,
|
|
105
|
-
"errored": false
|
|
106
|
-
},
|
|
107
|
-
{
|
|
108
|
-
"agentId": "kilo-deepseek",
|
|
109
|
-
"trial": 1,
|
|
110
|
-
"score": 5,
|
|
111
|
-
"errored": false
|
|
112
|
-
},
|
|
113
|
-
{
|
|
114
|
-
"agentId": "kilo-deepseek",
|
|
115
|
-
"trial": 2,
|
|
116
|
-
"score": 5,
|
|
117
|
-
"errored": false
|
|
118
|
-
},
|
|
119
|
-
{
|
|
120
|
-
"agentId": "kilo-deepseek",
|
|
121
|
-
"trial": 3,
|
|
122
|
-
"score": 5,
|
|
123
|
-
"errored": false
|
|
124
|
-
},
|
|
125
|
-
{
|
|
126
|
-
"agentId": "kilo-glm",
|
|
127
|
-
"trial": 1,
|
|
128
|
-
"score": 5,
|
|
129
|
-
"errored": false
|
|
130
|
-
},
|
|
131
|
-
{
|
|
132
|
-
"agentId": "kilo-glm",
|
|
133
|
-
"trial": 2,
|
|
134
|
-
"score": 5,
|
|
135
|
-
"errored": false
|
|
136
|
-
},
|
|
137
|
-
{
|
|
138
|
-
"agentId": "kilo-glm",
|
|
139
|
-
"trial": 3,
|
|
140
|
-
"score": 5,
|
|
141
|
-
"errored": false
|
|
142
|
-
},
|
|
143
|
-
{
|
|
144
|
-
"agentId": "kilo-minimax",
|
|
145
|
-
"trial": 1,
|
|
146
|
-
"score": 5,
|
|
147
|
-
"errored": false
|
|
148
|
-
},
|
|
149
|
-
{
|
|
150
|
-
"agentId": "kilo-minimax",
|
|
151
|
-
"trial": 2,
|
|
152
|
-
"score": 4,
|
|
153
|
-
"errored": false
|
|
154
|
-
},
|
|
155
|
-
{
|
|
156
|
-
"agentId": "kilo-minimax",
|
|
157
|
-
"trial": 3,
|
|
158
|
-
"score": 5,
|
|
159
|
-
"errored": false
|
|
160
|
-
}
|
|
161
|
-
],
|
|
162
|
-
"timestamp": "2026-04-20T12:18:36.988Z"
|
|
163
|
-
}
|
package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-1.md
DELETED
|
@@ -1,89 +0,0 @@
|
|
|
1
|
-
# Анализ: прогресс плана PLAN-001
|
|
2
|
-
|
|
3
|
-
**Дата анализа:** 2026-04-20
|
|
4
|
-
**Анализируемый план:** PLAN-001
|
|
5
|
-
**Анализируемый отчёт:** REPORT-002
|
|
6
|
-
**Тип анализа:** PROGRESS
|
|
7
|
-
|
|
8
|
-
---
|
|
9
|
-
|
|
10
|
-
## Executive Summary
|
|
11
|
-
|
|
12
|
-
За итерацию обработано 5 тикетов. Прогресс плана формально соответствует целевым показателям (Completion Rate 80%, Block Rate 0%), однако один тикет (QA-001) был ошибочно пропущен пайплайном. В предыдущем отчёте root cause проблемы неверно атрибутирован скрипту `check-conditions.js`. Верификация по логу пайплайна показывает, что корректный root cause — скрипт `check-relevance.js`, который принял решение о неактуальности тикета из-за статуса зависимостей. Требуется исправление логики определения актуальности для предотвращения подобных пропусков.
|
|
13
|
-
|
|
14
|
-
---
|
|
15
|
-
|
|
16
|
-
## Метрики
|
|
17
|
-
|
|
18
|
-
| Метрика | Значение | Статус |
|
|
19
|
-
|---------|----------|--------|
|
|
20
|
-
| Completion Rate | 80% | 🟢 |
|
|
21
|
-
| Weighted Completion | N/A (нет данных о сложности) | — |
|
|
22
|
-
| First-Pass Rate | N/A (нет данных о ревью) | — |
|
|
23
|
-
| Block Rate | 0% | 🟢 |
|
|
24
|
-
| Тренд | ➡️ (данных за предыдущие периоды нет) | — |
|
|
25
|
-
|
|
26
|
-
## Распределение задач
|
|
27
|
-
|
|
28
|
-
| Статус | Количество | % |
|
|
29
|
-
|--------|-----------|---|
|
|
30
|
-
| Done | 4 | 80% |
|
|
31
|
-
| In Progress | 0 | 0% |
|
|
32
|
-
| Ready | 0 | 0% |
|
|
33
|
-
| Blocked | 0 | 0% |
|
|
34
|
-
| Backlog | 0 | 0% |
|
|
35
|
-
| Skipped | 1 | 20% |
|
|
36
|
-
|
|
37
|
-
*Примечание: распределение восстановлено на основе утверждения «обработано 5 тикетов» из REPORT-002 и факта пропуска QA-001.*
|
|
38
|
-
|
|
39
|
-
---
|
|
40
|
-
|
|
41
|
-
## Ключевые находки
|
|
42
|
-
|
|
43
|
-
### 1. Неверная атрибуция root cause в предыдущем отчёте
|
|
44
|
-
**Уверенность:** [HIGH]
|
|
45
|
-
**Данные:** В отчёте REPORT-002 root cause проблемы пропуска QA-001 указан как `check-conditions.js` (стр. 16–18). Лог пайплайна (стр. 12–17) показывает, что `check-conditions.js` корректно определил условия как выполненные (`dependencies.resolved: true`, `prerequisites.met: true`, `blocking_tickets: []`). Решение о пропуске принял стейдж `check-relevance` (стр. 23–26 лога: `decision: irrelevant (dependencies inactive)`).
|
|
46
|
-
|
|
47
|
-
### 2. Ошибочное определение актуальности тикета
|
|
48
|
-
**Уверенность:** [HIGH]
|
|
49
|
-
**Данные:** Стейдж `check-relevance` вернул статус `irrelevant` с причиной `dependencies_inactive` (стр. 24–25 лога). Это привело к переходу на стейдж `skip-ticket` и перемещению тикета в `skipped/` (стр. 31). При этом тикет был отобран для обработки (`status=ready`), а его зависимости разрешены согласно `check-conditions`. Противоречие между «разрешены» и «неактивны» указывает на ошибку логики в `check-relevance.js`.
|
|
50
|
-
|
|
51
|
-
---
|
|
52
|
-
|
|
53
|
-
## Проблемы и риски
|
|
54
|
-
|
|
55
|
-
| # | Проблема | Серьёзность | Данные | Рекомендация |
|
|
56
|
-
|---|---------|-------------|--------|-------------|
|
|
57
|
-
| 1 | Ошибочный пропуск тикета из-за некорректной работы `check-relevance.js` | HIGH | Лог пайплайна (стр. 23–26): `decision: irrelevant (dependencies inactive)` при разрешённых зависимостях. | Исправить логику определения актуальности зависимостей в `check-relevance.js`. |
|
|
58
|
-
| 2 | Неверная атрибуция root cause в отчёте REPORT-002 | MEDIUM | REPORT-002, стр. 16–18 vs лог пайплайна. | Обновить отчёт с корректной атрибуцией для точного отслеживания проблем. |
|
|
59
|
-
|
|
60
|
-
---
|
|
61
|
-
|
|
62
|
-
## Рекомендации
|
|
63
|
-
|
|
64
|
-
| # | Действие | Приоритет | Почему | Ожидаемый результат |
|
|
65
|
-
|---|---------|-----------|--------|-------------------|
|
|
66
|
-
| 1 | Исправить логику определения актуальности в `check-relevance.js` | HIGH | Скрипт ошибочно помечает тикеты с разрешёнными зависимостями как неактуальные, приводя к пропуску. | Тикеты с выполненными условиями не будут пропускаться на этапе проверки актуальности. |
|
|
67
|
-
| 2 | Обновить отчёт REPORT-002 с корректной атрибуцией root cause | MEDIUM | Неверная атрибуция мешает точному анализу и исправлению проблем пайплайна. | История инцидентов будет отражать реальные причины, упрощая диагностику. |
|
|
68
|
-
| 3 | Добавить валидацию согласованности статусов зависимостей между `check-conditions` и `check-relevance` | MEDIUM | Противоречие между «разрешены» и «неактивны» должно обнаруживаться автоматически. | Раннее выявление конфликтующих статусов предотвратит ошибочные пропуски. |
|
|
69
|
-
|
|
70
|
-
---
|
|
71
|
-
|
|
72
|
-
## За пределами скоупа
|
|
73
|
-
|
|
74
|
-
*Данных о других тикетах плана, их сложности и результатах ревью недостаточно для расчёта взвешенного Completion Rate и First-Pass Rate. Для полной оценки прогресса необходимы исходные файлы плана и детальные отчёты по каждому тикету.*
|
|
75
|
-
|
|
76
|
-
---
|
|
77
|
-
|
|
78
|
-
## Решение
|
|
79
|
-
|
|
80
|
-
**Статус плана:** 🟢 ON_TRACK (формальные метрики в норме, но есть проблема качества пайплайна)
|
|
81
|
-
**Рекомендуемое действие:** Исправить `check-relevance.js` и обновить отчётность.
|
|
82
|
-
|
|
83
|
-
---
|
|
84
|
-
|
|
85
|
-
---RESULT---
|
|
86
|
-
status: has_gaps
|
|
87
|
-
report_id: REPORT-002
|
|
88
|
-
gaps: "Ошибочный пропуск тикета QA-001 из-за некорректной логики в check-relevance.js требует исправления скрипта и обновления атрибуции в отчёте."
|
|
89
|
-
---RESULT---
|
package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-2.md
DELETED
|
@@ -1,88 +0,0 @@
|
|
|
1
|
-
# Анализ: прогресс
|
|
2
|
-
|
|
3
|
-
**Дата анализа:** 2026-04-20
|
|
4
|
-
**Анализируемый план:** PLAN-001
|
|
5
|
-
**Анализируемый отчёт:** REPORT-002
|
|
6
|
-
**Тип анализа:** PROGRESS
|
|
7
|
-
|
|
8
|
-
---
|
|
9
|
-
|
|
10
|
-
## Executive Summary
|
|
11
|
-
|
|
12
|
-
Анализ лога пайплайна итерации от 2026-04-06 выявил ошибку атрибуции root cause в отчёте REPORT-002. Тикет QA-001 был пропущен из-за решения стейджа `check-relevance` (неактивные зависимости), а не `check-conditions`. Прогресс плана PLAN-001 затруднён из-за некорректной логики определения актуальности, что требует корректировки скрипта `check-relevance.js`.
|
|
13
|
-
|
|
14
|
-
---
|
|
15
|
-
|
|
16
|
-
## Метрики
|
|
17
|
-
|
|
18
|
-
Данных для расчёта метрик недостаточно (отсутствует общее количество тикетов плана, распределение по статусам). Из отчёта известно: обработано 5 тикетов, 1 пропущен (QA-001).
|
|
19
|
-
|
|
20
|
-
| Метрика | Значение | Статус |
|
|
21
|
-
|---------|----------|--------|
|
|
22
|
-
| Completion Rate | Данных нет | — |
|
|
23
|
-
| Weighted Completion | Данных нет | — |
|
|
24
|
-
| First-Pass Rate | Данных нет | — |
|
|
25
|
-
| Block Rate | Данных нет | — |
|
|
26
|
-
| Тренд | Данных нет | — |
|
|
27
|
-
|
|
28
|
-
## Распределение задач
|
|
29
|
-
|
|
30
|
-
| Статус | Количество | % |
|
|
31
|
-
|--------|-----------|---|
|
|
32
|
-
| Done | Нет данных | — |
|
|
33
|
-
| In Progress | Нет данных | — |
|
|
34
|
-
| Ready | Нет данных | — |
|
|
35
|
-
| Blocked | Нет данных | — |
|
|
36
|
-
| Backlog | Нет данных | — |
|
|
37
|
-
|
|
38
|
-
---
|
|
39
|
-
|
|
40
|
-
## Ключевые находки
|
|
41
|
-
|
|
42
|
-
### 1. Ошибка атрибуции root cause в отчёте REPORT-002
|
|
43
|
-
**Уверенность:** [HIGH]
|
|
44
|
-
**Данные:** В отчёте root cause для QA-001 атрибутирован `check-conditions.js`. Лог пайплайна (строка 29) показывает, что решение о skip принял стейдж `check-relevance` с reason=`dependencies_inactive`. Стейдж `check-conditions` завершился со статусом `conditions_ok`.
|
|
45
|
-
|
|
46
|
-
### 2. Некорректная логика определения актуальности в check-relevance
|
|
47
|
-
**Уверенность:** [MEDIUM]
|
|
48
|
-
**Данные:** `check-conditions` определил `dependencies.resolved: true`, тогда как `check-relevance` сообщил `dependencies.status: inactive`. Противоречие указывает на разные критерии оценки зависимостей либо на ошибку в `check-relevance`.
|
|
49
|
-
|
|
50
|
-
---
|
|
51
|
-
|
|
52
|
-
## Проблемы и риски
|
|
53
|
-
|
|
54
|
-
| # | Проблема | Серьёзность | Данные | Рекомендация |
|
|
55
|
-
|---|---------|-------------|--------|-------------|
|
|
56
|
-
| 1 | Ошибочная атрибуция root cause в отчёте | HIGH | Отчёт REPORT-002 указывает на `check-conditions.js`, лог доказывает, что решение принял `check-relevance`. | Исправить отчёт и пересмотреть методологию атрибуции root cause. |
|
|
57
|
-
| 2 | Логика стейджа `check-relevance` приводит к неправильному пропуску тикетов | HIGH | Тикет QA-001 пропущен из-за `dependencies_inactive`, хотя `check-conditions` показал resolved. | Пересмотреть алгоритм проверки зависимостей в `check-relevance.js`. |
|
|
58
|
-
|
|
59
|
-
---
|
|
60
|
-
|
|
61
|
-
## Рекомендации
|
|
62
|
-
|
|
63
|
-
| # | Действие | Приоритет | Почему | Ожидаемый результат |
|
|
64
|
-
|---|---------|-----------|--------|-------------------|
|
|
65
|
-
| 1 | Исправить root cause в отчёте REPORT-002 на `check-relevance.js` | HIGH | Ошибка атрибуции ведёт к неправильным корректирующим действиям. | Корректное определение причины пропуска тикета для последующих исправлений. |
|
|
66
|
-
| 2 | Пересмотреть логику `check-relevance.js` — согласовать с `check-conditions` по критериям зависимостей | HIGH | Противоречивые критерии приводят к некорректным пропускам. | Устранение ложных пропусков тикетов. |
|
|
67
|
-
| 3 | Добавить верификацию атрибуции root cause в процесс анализа отчётов | MEDIUM | Предотвращение аналогичных ошибок в будущем. | Повышение точности анализа. |
|
|
68
|
-
|
|
69
|
-
---
|
|
70
|
-
|
|
71
|
-
## За пределами скоупа
|
|
72
|
-
|
|
73
|
-
Нет данных о других тикетах плана, их статусах и сложности. Для полной оценки прогресса необходимо получить актуальный список тикетов PLAN-001.
|
|
74
|
-
|
|
75
|
-
---
|
|
76
|
-
|
|
77
|
-
## Решение
|
|
78
|
-
|
|
79
|
-
**Статус плана:** AT_RISK (обнаружена HIGH проблема, влияющая на корректность обработки задач)
|
|
80
|
-
**Рекомендуемое действие:** скорректировать (исправить `check-relevance.js` и обновить отчёт)
|
|
81
|
-
|
|
82
|
-
---
|
|
83
|
-
|
|
84
|
-
---RESULT---
|
|
85
|
-
status: has_gaps
|
|
86
|
-
report_id: REPORT-002
|
|
87
|
-
gaps: "Некорректная атрибуция root cause и ошибка логики check-relevance приводят к пропуску тикетов, требуют исправления отчёта и скрипта."
|
|
88
|
-
---RESULT---
|
package/src/skills/analyze-report/tests/cases/TC-ANALYZE-REPORT-001/current/kilo-deepseek/trial-3.md
DELETED
|
@@ -1,100 +0,0 @@
|
|
|
1
|
-
# Анализ: прогресс
|
|
2
|
-
|
|
3
|
-
**Дата анализа:** 2026-04-20
|
|
4
|
-
**Анализируемый план:** PLAN-001
|
|
5
|
-
**Анализируемый отчёт:** REPORT-002
|
|
6
|
-
**Тип анализа:** PROGRESS
|
|
7
|
-
|
|
8
|
-
---
|
|
9
|
-
|
|
10
|
-
## Executive Summary
|
|
11
|
-
|
|
12
|
-
Прогресс плана PLAN-001 составляет 80% по количеству выполненных тикетов (4 из 5). Обнаружена критическая ошибка атрибуции root cause в отчёте REPORT-002: причиной пропуска тикета QA-001 является стейдж `check-relevance`, а не `check-conditions`. Данные из лога пайплайна опровергают вывод предыдущего отчёта. Рекомендуется немедленно исправить логику проверки релевантности и обновить отчёт.
|
|
13
|
-
|
|
14
|
-
---
|
|
15
|
-
|
|
16
|
-
## Метрики
|
|
17
|
-
|
|
18
|
-
| Метрика | Значение | Статус |
|
|
19
|
-
|---------|----------|--------|
|
|
20
|
-
| Completion Rate | 80% | 🟢 |
|
|
21
|
-
| Weighted Completion | Данных недостаточно | — |
|
|
22
|
-
| First-Pass Rate | Данных недостаточно | — |
|
|
23
|
-
| Block Rate | 0% | 🟢 |
|
|
24
|
-
| Тренд | Данных недостаточно | — |
|
|
25
|
-
|
|
26
|
-
## Распределение задач
|
|
27
|
-
|
|
28
|
-
| Статус | Количество | % |
|
|
29
|
-
|--------|-----------|---|
|
|
30
|
-
| Done | 4 | 80% |
|
|
31
|
-
| In Progress | 0 | 0% |
|
|
32
|
-
| Ready | 0 | 0% |
|
|
33
|
-
| Blocked | 0 | 0% |
|
|
34
|
-
| Backlog | 0 | 0% |
|
|
35
|
-
| Skipped | 1 | 20% |
|
|
36
|
-
|
|
37
|
-
---
|
|
38
|
-
|
|
39
|
-
## Ключевые находки
|
|
40
|
-
|
|
41
|
-
### 1. Неверная атрибуция root cause в отчёте REPORT-002
|
|
42
|
-
**Уверенность:** [HIGH]
|
|
43
|
-
**Данные:**
|
|
44
|
-
- В отчёте REPORT-002 указан root cause: `check-conditions.js` (строка 17-18).
|
|
45
|
-
- Лог пайплайна (строки 9-17) показывает, что стейдж `check-conditions` завершился со статусом `conditions_ok` (`dependencies.resolved: true`, `prerequisites.met: true`).
|
|
46
|
-
- Решение о пропуске принял стейдж `check-relevance` (строка 24-26: `decision: irrelevant (dependencies inactive)`).
|
|
47
|
-
|
|
48
|
-
### 2. Ошибка логики стейджа check-relevance
|
|
49
|
-
**Уверенность:** [HIGH]
|
|
50
|
-
**Данные:**
|
|
51
|
-
- `check-conditions` подтвердил разрешённые зависимости (`dependencies.resolved: true`).
|
|
52
|
-
- `check-relevance` оценил те же зависимости как неактивные (`dependencies.status: inactive`).
|
|
53
|
-
- Это противоречие привело к ошибочному решению `irrelevant` и пропуску тикета QA-001.
|
|
54
|
-
|
|
55
|
-
---
|
|
56
|
-
|
|
57
|
-
## Проблемы и риски
|
|
58
|
-
|
|
59
|
-
| # | Проблема | Серьёзность | Данные | Рекомендация |
|
|
60
|
-
|---|---------|-------------|--------|-------------|
|
|
61
|
-
| 1 | Некорректная атрибуция root cause в отчёте | HIGH | Отчёт указывает на `check-conditions`, лог показывает `check-relevance` | Обновить отчёт REPORT-002 с корректной атрибуцией |
|
|
62
|
-
| 2 | Ошибочная логика проверки релевантности | HIGH | Противоречие между `dependencies.resolved: true` и `dependencies.status: inactive` | Исправить алгоритм в `check-relevance.js` |
|
|
63
|
-
|
|
64
|
-
---
|
|
65
|
-
|
|
66
|
-
## Рекомендации
|
|
67
|
-
|
|
68
|
-
| # | Действие | Приоритет | Почему | Ожидаемый результат |
|
|
69
|
-
|---|---------|-----------|--------|-------------------|
|
|
70
|
-
| 1 | Исправить алгоритм определения статуса зависимостей в `check-relevance.js` | HIGH | Логика стейджа противоречит данным из `check-conditions` | Стейдж будет принимать корректные решения о релевантности |
|
|
71
|
-
| 2 | Обновить отчёт REPORT-002 с корректным root cause (`check-relevance.js`) | HIGH | Отчёт содержит неверные данные, что вводит в заблуждение | Отчёт будет отражать реальную причину пропуска тикета |
|
|
72
|
-
|
|
73
|
-
---
|
|
74
|
-
|
|
75
|
-
## За пределами скоупа
|
|
76
|
-
|
|
77
|
-
- Нет данных о сложности тикетов и результатах ревью для расчёта взвешенных метрик и First-Pass Rate.
|
|
78
|
-
- Неизвестен общий состав тикетов плана PLAN-001 (только факт обработки 5 тикетов в итерации).
|
|
79
|
-
|
|
80
|
-
---
|
|
81
|
-
|
|
82
|
-
## Решение
|
|
83
|
-
|
|
84
|
-
**Статус плана:** ATTENTION (обнаружена HIGH проблема, влияющая на корректность обработки тикетов)
|
|
85
|
-
**Рекомендуемое действие:** скорректировать (исправить логику стейджа и обновить отчёт)
|
|
86
|
-
|
|
87
|
-
---
|
|
88
|
-
|
|
89
|
-
**DoD проверка:**
|
|
90
|
-
- [x] Секция Result заполнена
|
|
91
|
-
- [x] Каждый вывод подкреплён конкретными данными из отчёта/лога
|
|
92
|
-
- [x] Рекомендации actionable (содержат конкретное действие)
|
|
93
|
-
- [x] Указан уровень уверенности для ключевых выводов
|
|
94
|
-
- [x] Использован требуемый формат вывода
|
|
95
|
-
|
|
96
|
-
---RESULT---
|
|
97
|
-
status: has_gaps
|
|
98
|
-
report_id: REPORT-002
|
|
99
|
-
gaps: "Ошибка атрибуции root cause в отчёте REPORT-002; некорректная логика стейджа check-relevance приводит к пропуску тикетов."
|
|
100
|
-
---RESULT---
|