@chongyan/autospec 1.0.1 → 1.0.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/LICENSE +21 -21
- package/README.en.md +447 -321
- package/README.md +418 -286
- package/knowledge/01-principles/00-principles-hierarchy.md +247 -0
- package/knowledge/01-principles/01-first-principles.md +241 -0
- package/knowledge/01-principles/02-strategic-principles.md +286 -0
- package/knowledge/01-principles/03-tactical-principles.md +385 -0
- package/knowledge/01-principles/04-operational-principles.md +275 -0
- package/knowledge/01-principles/05-domain-principles.md +539 -0
- package/knowledge/01-principles/06-methodology-principles.md +281 -0
- package/knowledge/01-principles/07-cognitive-principles.md +277 -0
- package/knowledge/01-principles/08-auto-fix-principles.md +320 -0
- package/knowledge/01-principles/09-constitution.md +220 -0
- package/knowledge/{principles/evolution.md → 01-principles/10-evolution-mechanism.md} +160 -14
- package/knowledge/01-principles/README.en.md +385 -0
- package/knowledge/01-principles/README.md +385 -0
- package/knowledge/{process/overview.md → 02-process/00-overview.md} +90 -5
- package/knowledge/02-process/README.en.md +143 -0
- package/knowledge/02-process/README.md +186 -0
- package/knowledge/{guides/support/pipeline-protocol.md → 03-guides/00-pipeline-protocol.md} +10 -10
- package/knowledge/{guides/support/team-orchestrator.md → 03-guides/01-team-orchestrator.md} +53 -8
- package/knowledge/{guides/stages/requirement-analyzer.md → 03-guides/02-analyze-requirement.md} +3 -3
- package/knowledge/{guides/stages/ai-effect-evaluator.md → 03-guides/08-evaluate-ai-effect.md} +14 -7
- package/knowledge/{guides/support/skill-distiller.md → 03-guides/19-distill-skill.md} +3 -3
- package/knowledge/{guides/support/skill-updater.md → 03-guides/20-update-skill.md} +1 -1
- package/knowledge/{guides/support/methodology-extractor.md → 03-guides/22-extract-methodology.md} +2 -2
- package/knowledge/{guides/support/complexity-assessor.md → 03-guides/24-assess-complexity.md} +6 -4
- package/knowledge/{guides/support/tech-stack-analyzer.md → 03-guides/26-analyze-tech-stack.md} +1 -1
- package/knowledge/{guides/domain-driven-design.md → 03-guides/42-apply-ddd.md} +1 -1
- package/knowledge/{process/ai-sdlc.md → 03-guides/43-run-ai-sdlc.md} +1 -1
- package/knowledge/{guides/knowledge-management.md → 03-guides/44-manage-knowledge.md} +4 -4
- package/knowledge/03-guides/README.en.md +212 -0
- package/knowledge/03-guides/README.md +212 -0
- package/knowledge/{checklists/requirement.md → 04-checklists/00-requirement.md} +1 -1
- package/knowledge/{checklists/design.md → 04-checklists/01-design.md} +1 -1
- package/knowledge/{checklists/code.md → 04-checklists/02-code.md} +16 -1
- package/knowledge/{checklists/release.md → 04-checklists/04-release.md} +1 -1
- package/knowledge/04-checklists/README.en.md +119 -0
- package/knowledge/04-checklists/README.md +123 -0
- package/knowledge/{config/validation-patterns.yaml → 05-config/00-validation-patterns.yaml} +1 -1
- package/knowledge/{config/team-tasks.yaml → 05-config/02-team-tasks.yaml} +2 -2
- package/knowledge/05-config/03-role-composition.yaml +346 -0
- package/knowledge/{config/skill-compositions.yaml → 05-config/05-skill-compositions.yaml} +24 -24
- package/knowledge/05-config/README.en.md +54 -0
- package/knowledge/05-config/README.md +132 -0
- package/knowledge/06-environment/00-template-registry.md +310 -0
- package/knowledge/06-environment/01-detection-patterns.yaml +1692 -0
- package/knowledge/{environment → 06-environment}/README.en.md +4 -0
- package/knowledge/{environment → 06-environment}/README.md +66 -25
- package/knowledge/{standards/coding-style.md → 07-standards/00-coding-style.md} +123 -4
- package/knowledge/{standards/code-review.md → 07-standards/01-code-review.md} +3 -3
- package/knowledge/{standards/data-consistency.md → 07-standards/02-data-consistency.md} +1 -1
- package/knowledge/{standards/document-versioning.md → 07-standards/03-document-versioning.md} +1 -1
- package/knowledge/{standards/risk-detection.md → 07-standards/04-risk-detection.md} +5 -5
- package/knowledge/07-standards/README.en.md +119 -0
- package/knowledge/07-standards/README.md +123 -0
- package/knowledge/08-organization/00-vision-mission.md +113 -0
- package/knowledge/{organization/ai-native-team.md → 08-organization/01-ai-native-culture.md} +1 -1
- package/knowledge/{organization/team-metrics.md → 08-organization/02-team-metrics.md} +1 -1
- package/knowledge/08-organization/03-committee-structure.md +54 -0
- package/knowledge/08-organization/04-governance-metrics.md +55 -0
- package/knowledge/08-organization/05-improvement-process.md +71 -0
- package/knowledge/08-organization/README.en.md +165 -0
- package/knowledge/08-organization/README.md +165 -0
- package/knowledge/09-templates/00-requirement-proposal.md +344 -0
- package/knowledge/09-templates/01-architecture-design.md +494 -0
- package/knowledge/09-templates/02-api-design.md +408 -0
- package/knowledge/09-templates/03-database-design.md +313 -0
- package/knowledge/09-templates/04-product-design.md +237 -0
- package/knowledge/09-templates/05-domain-business.md +388 -0
- package/knowledge/09-templates/06-test-design.md +268 -0
- package/knowledge/09-templates/07-evaluation-design.md +372 -0
- package/knowledge/09-templates/08-component-knowledge.md +272 -0
- package/knowledge/09-templates/09-best-practices.md +218 -0
- package/knowledge/{environment/middleware-knowledge.md → 09-templates/10-middleware-knowledge.md} +106 -1
- package/knowledge/09-templates/README.en.md +222 -0
- package/knowledge/09-templates/README.md +216 -0
- package/knowledge/README.en.md +372 -0
- package/knowledge/README.md +354 -99
- package/package.json +1 -1
- package/plugins/.claude-plugin/plugin.json +460 -81
- package/plugins/agents/roles/ceo.md +1 -1
- package/plugins/agents/roles/product-owner.md +1 -1
- package/plugins/agents/roles/tech-lead.md +1 -1
- package/plugins/agents/support/consistency-checker.md +36 -3
- package/plugins/agents/support/monitoring-agent.md +215 -0
- package/plugins/agents/support/safety-auditor.md +2 -2
- package/plugins/agents/support/stage-gate-evaluator.md +95 -11
- package/plugins/agents/support/test-coverage-reviewer.md +1 -1
- package/plugins/benchmarks/templates/README.md +165 -13
- package/plugins/benchmarks/templates/commands/apply-template.yaml +108 -0
- package/plugins/benchmarks/templates/commands/archive-template.yaml +65 -0
- package/plugins/benchmarks/templates/commands/env-export-template.yaml +64 -0
- package/plugins/benchmarks/templates/commands/env-sync-template.yaml +104 -0
- package/plugins/benchmarks/templates/commands/env-template-template.yaml +96 -0
- package/plugins/benchmarks/templates/commands/env-template.yaml +58 -0
- package/plugins/benchmarks/templates/commands/env-update-template.yaml +110 -0
- package/plugins/benchmarks/templates/commands/env-validate-template.yaml +95 -0
- package/plugins/benchmarks/templates/commands/field-evolve-template.yaml +104 -0
- package/plugins/benchmarks/templates/commands/project-evolve-template.yaml +104 -0
- package/plugins/benchmarks/templates/commands/propose-template.yaml +88 -0
- package/plugins/benchmarks/templates/commands/review-template.yaml +124 -0
- package/plugins/benchmarks/templates/commands/run-template.yaml +127 -0
- package/plugins/benchmarks/templates/commands/test-template.yaml +149 -0
- package/plugins/benchmarks/templates/pipeline/experiment-template.yaml +92 -0
- package/plugins/benchmarks/templates/pipeline/hotfix-template.yaml +81 -0
- package/plugins/benchmarks/templates/skills/agile-iteration-template.yaml +78 -0
- package/plugins/benchmarks/templates/skills/benchmark-executor-template.yaml +114 -0
- package/plugins/benchmarks/templates/skills/benchmark-generator-template.yaml +52 -0
- package/plugins/benchmarks/templates/skills/delivery-stage-template.yaml +130 -0
- package/plugins/benchmarks/templates/skills/design-stage-template.yaml +131 -0
- package/plugins/benchmarks/templates/skills/experiment-iteration-template.yaml +60 -0
- package/plugins/benchmarks/templates/skills/exploration-phase-template.yaml +114 -0
- package/plugins/benchmarks/templates/skills/field-evolve-analyzer-template.yaml +51 -0
- package/plugins/benchmarks/templates/skills/field-evolve-distiller-template.yaml +34 -0
- package/plugins/benchmarks/templates/skills/field-evolve-executor-template.yaml +50 -0
- package/plugins/benchmarks/templates/skills/field-evolve-fixer-template.yaml +52 -0
- package/plugins/benchmarks/templates/skills/field-evolve-learner-template.yaml +33 -0
- package/plugins/benchmarks/templates/skills/field-evolve-scanner-template.yaml +74 -0
- package/plugins/benchmarks/templates/skills/field-evolve-template.yaml +71 -0
- package/plugins/benchmarks/templates/skills/field-evolve-verifier-template.yaml +51 -0
- package/plugins/benchmarks/templates/skills/hotfix-iteration-template.yaml +54 -0
- package/plugins/benchmarks/templates/skills/implementation-stage-template.yaml +127 -0
- package/plugins/benchmarks/templates/skills/layer1-validation-template.yaml +121 -0
- package/plugins/benchmarks/templates/skills/project-evolve-analyzer-template.yaml +51 -0
- package/plugins/benchmarks/templates/skills/project-evolve-fixer-template.yaml +52 -0
- package/plugins/benchmarks/templates/skills/project-evolve-generator-template.yaml +34 -0
- package/plugins/benchmarks/templates/skills/project-evolve-learner-template.yaml +50 -0
- package/plugins/benchmarks/templates/skills/project-evolve-reviewer-template.yaml +50 -0
- package/plugins/benchmarks/templates/skills/project-evolve-scanner-template.yaml +75 -0
- package/plugins/benchmarks/templates/skills/project-evolve-template.yaml +72 -0
- package/plugins/benchmarks/templates/skills/project-evolve-verifier-template.yaml +51 -0
- package/plugins/benchmarks/templates/skills/skill-forge-template.yaml +117 -0
- package/plugins/benchmarks/templates/skills/startup-guard-template.yaml +103 -0
- package/plugins/benchmarks/templates/skills/testing-stage-template.yaml +146 -0
- package/plugins/benchmarks/templates/skills/waterfall-iteration-template.yaml +55 -0
- package/plugins/commands/README.en.md +2 -2
- package/plugins/commands/README.md +2 -2
- package/plugins/commands/apply.md +102 -16
- package/plugins/commands/archive.md +60 -4
- package/plugins/commands/env-sync.md +1047 -406
- package/plugins/commands/env-template.md +11 -135
- package/plugins/commands/env-update.md +1 -1
- package/plugins/commands/env-validate.md +3 -3
- package/plugins/commands/explore.md +118 -1
- package/plugins/commands/field-evolve.md +51 -175
- package/plugins/commands/project-evolve.md +167 -68
- package/plugins/commands/propose.md +97 -6
- package/plugins/commands/review.md +5 -5
- package/plugins/commands/run.md +841 -13
- package/plugins/commands/status.md +138 -17
- package/plugins/commands/test.md +389 -0
- package/plugins/hooks/constitution-guard.js +1 -1
- package/plugins/hooks/environment-autocommit.js +366 -24
- package/plugins/hooks/environment-manager.js +3 -2
- package/plugins/hooks/execution-tracker.js +109 -4
- package/plugins/hooks/layer1-validator.js +117 -1
- package/plugins/hooks/lib/auto-fix-loop.js +605 -0
- package/plugins/hooks/lib/environment-config-loader.js +11 -7
- package/plugins/hooks/lib/hook-state-manager.js +98 -0
- package/plugins/hooks/lib/memory-extractor.js +27 -5
- package/plugins/hooks/lib/memory-manager.js +1 -1
- package/plugins/hooks/lib/test-auto-fix.test.js +194 -0
- package/plugins/hooks/monitoring-trigger.js +467 -0
- package/plugins/skills/README.en.md +15 -3
- package/plugins/skills/README.md +21 -11
- package/plugins/skills/agile-iteration/SKILL.md +187 -0
- package/plugins/skills/delivery-stage/SKILL.md +133 -12
- package/plugins/skills/design-stage/SKILL.md +103 -12
- package/plugins/skills/experiment-evaluator/SKILL.md +271 -0
- package/plugins/skills/experiment-iteration/SKILL.md +154 -0
- package/plugins/skills/exploration-phase/SKILL.md +93 -10
- package/plugins/skills/field-evolve-analyzer/SKILL.md +65 -0
- package/plugins/skills/field-evolve-distiller/SKILL.md +66 -0
- package/plugins/skills/field-evolve-executor/SKILL.md +94 -0
- package/plugins/skills/field-evolve-executor/executor.js +342 -0
- package/plugins/skills/field-evolve-fixer/SKILL.md +69 -0
- package/plugins/skills/field-evolve-learner/SKILL.md +65 -0
- package/plugins/skills/field-evolve-scanner/SKILL.md +87 -0
- package/plugins/skills/field-evolve-scanner/scripts/fallback-scanner.js +288 -0
- package/plugins/skills/field-evolve-verifier/SKILL.md +64 -0
- package/plugins/skills/hotfix-iteration/SKILL.md +279 -0
- package/plugins/skills/implementation-stage/SKILL.md +156 -15
- package/plugins/skills/layer1-validation/SKILL.md +1 -1
- package/plugins/skills/pending-dashboard/SKILL.md +9 -8
- package/plugins/skills/project-evolve-analyzer/SKILL.md +95 -0
- package/plugins/skills/project-evolve-fixer/SKILL.md +99 -0
- package/plugins/skills/project-evolve-generator/SKILL.md +149 -0
- package/plugins/skills/project-evolve-learner/SKILL.md +103 -0
- package/plugins/skills/project-evolve-reviewer/SKILL.md +104 -0
- package/plugins/skills/project-evolve-scanner/SKILL.md +95 -0
- package/plugins/skills/project-evolve-scanner/scripts/dependency-reuse-checker.js +395 -0
- package/plugins/skills/project-evolve-scanner/scripts/subsystem-coverage.js +315 -0
- package/plugins/skills/project-evolve-verifier/SKILL.md +105 -0
- package/plugins/skills/requirement-stage/SKILL.md +47 -13
- package/plugins/skills/skill-forge/SKILL.md +2 -2
- package/plugins/skills/testing-stage/SKILL.md +583 -8
- package/plugins/skills/waterfall-iteration/SKILL.md +115 -0
- package/scripts/cli/index.js +1 -1
- package/scripts/cli/init.js +30 -4
- package/scripts/cli/list.js +3 -2
- package/scripts/config/commands.config.js +8 -8
- package/scripts/config/hooks.config.js +1 -1
- package/scripts/install/constants.js +204 -165
- package/scripts/state.js +210 -1
- package/knowledge/config/README.en.md +0 -44
- package/knowledge/config/README.md +0 -44
- package/knowledge/config/role-composition.yaml +0 -98
- package/knowledge/config/team-triggers.yaml +0 -198
- package/knowledge/domain/README.md +0 -115
- package/knowledge/domain/flows/README.md +0 -194
- package/knowledge/domain/glossary.md +0 -143
- package/knowledge/domain/rules.md +0 -138
- package/knowledge/environment/component-knowledge.md +0 -316
- package/knowledge/environment/detection-patterns.yaml +0 -502
- package/knowledge/environment/template-registry.md +0 -321
- package/knowledge/guides/requirement-engineering.md +0 -329
- package/knowledge/guides/system-design.md +0 -352
- package/knowledge/principles/constitution.md +0 -134
- package/knowledge/principles/core-principles.md +0 -368
- package/knowledge/principles/design-philosophy.md +0 -877
- package/knowledge/process/README.en.md +0 -38
- package/knowledge/process/README.md +0 -48
- package/knowledge/templates/ai-evaluation.md +0 -150
- package/knowledge/templates/api-design.md +0 -117
- package/knowledge/templates/database-design.md +0 -132
- package/knowledge/templates/domain-driven-design.md +0 -321
- package/knowledge/templates/product-proposal.md +0 -201
- package/knowledge/templates/system-design.md +0 -227
- package/knowledge/templates/task-breakdown.md +0 -107
- package/knowledge/templates/test-case.md +0 -170
- package/plugins/commands/validate.md +0 -108
- package/plugins/skills/benchmark-executor/README.md +0 -93
- package/plugins/skills/evolution-process/SKILL.md +0 -291
- package/plugins/skills/project-evolution/SKILL.md +0 -847
- package/scripts/evolution/evolution-router.js +0 -273
- package/scripts/evolution/evolution-signal-collector.js +0 -307
- package/scripts/evolution/knowledge-loader.js +0 -346
- package/scripts/evolution/marketplace.js +0 -317
- package/scripts/evolution/version-manager.js +0 -371
- /package/knowledge/{process → 02-process}/01-requirement.md +0 -0
- /package/knowledge/{process → 02-process}/02-design.md +0 -0
- /package/knowledge/{process → 02-process}/03-implementation.md +0 -0
- /package/knowledge/{process → 02-process}/04-review.md +0 -0
- /package/knowledge/{process → 02-process}/05-testing.md +0 -0
- /package/knowledge/{process → 02-process}/06-delivery.md +0 -0
- /package/knowledge/{guides/stages/design-planner.md → 03-guides/03-design-solution.md} +0 -0
- /package/knowledge/{guides/stages/code-implementer.md → 03-guides/04-implement-code.md} +0 -0
- /package/knowledge/{guides/stages/test-planner.md → 03-guides/05-plan-testing.md} +0 -0
- /package/knowledge/{guides/stages/test-generator.md → 03-guides/06-generate-tests.md} +0 -0
- /package/knowledge/{guides/stages/release-checker.md → 03-guides/07-check-release.md} +0 -0
- /package/knowledge/{guides/stages/requirement-reviewer.md → 03-guides/09-review-requirement.md} +0 -0
- /package/knowledge/{guides/stages/design-reviewer.md → 03-guides/10-review-design.md} +0 -0
- /package/knowledge/{guides/stages/code-reviewer.md → 03-guides/11-review-code.md} +0 -0
- /package/knowledge/{guides/stages/test-reviewer.md → 03-guides/12-review-testing.md} +0 -0
- /package/knowledge/{guides/stages/security-reviewer.md → 03-guides/13-audit-security.md} +0 -0
- /package/knowledge/{guides/stages/consistency-checker.md → 03-guides/14-check-consistency.md} +0 -0
- /package/knowledge/{guides/stages/unit-test-runner.md → 03-guides/15-run-unit-tests.md} +0 -0
- /package/knowledge/{guides/stages/integration-test-runner.md → 03-guides/16-run-integration-tests.md} +0 -0
- /package/knowledge/{guides/stages/test-context-analyzer.md → 03-guides/17-analyze-test-context.md} +0 -0
- /package/knowledge/{guides/support/practice-logger.md → 03-guides/18-log-practice.md} +0 -0
- /package/knowledge/{guides/support/skill-validator.md → 03-guides/21-validate-skill.md} +0 -0
- /package/knowledge/{guides/support/scope-inference.md → 03-guides/23-infer-scope.md} +0 -0
- /package/knowledge/{guides/support/component-discovery.md → 03-guides/25-discover-component.md} +0 -0
- /package/knowledge/{guides/support/environment-scanner.md → 03-guides/27-scan-environment.md} +0 -0
- /package/knowledge/{guides/support/environment-validator.md → 03-guides/28-validate-environment.md} +0 -0
- /package/knowledge/{guides/support/knowledge-generator.md → 03-guides/29-generate-knowledge.md} +0 -0
- /package/knowledge/{guides/support/ai-capability-analyzer.md → 03-guides/30-analyze-ai-capability.md} +0 -0
- /package/knowledge/{guides/support/ai-component-analyzer.md → 03-guides/31-analyze-ai-component.md} +0 -0
- /package/knowledge/{guides/support/ai-agent-analyzer.md → 03-guides/32-analyze-ai-agent.md} +0 -0
- /package/knowledge/{guides/support/ai-rag-analyzer.md → 03-guides/33-analyze-ai-rag.md} +0 -0
- /package/knowledge/{guides/support/ai-task-assessor.md → 03-guides/34-assess-ai-task.md} +0 -0
- /package/knowledge/{guides/support/ai-pipeline-evaluator.md → 03-guides/35-evaluate-ai-pipeline.md} +0 -0
- /package/knowledge/{guides/support/ai-artifact-evaluator.md → 03-guides/36-evaluate-ai-artifact.md} +0 -0
- /package/knowledge/{guides/support/ai-evaluation-planner.md → 03-guides/37-plan-ai-evaluation.md} +0 -0
- /package/knowledge/{guides/support/ai-path-evaluator.md → 03-guides/38-evaluate-ai-path.md} +0 -0
- /package/knowledge/{guides/support/ai-data-validator.md → 03-guides/39-validate-ai-data.md} +0 -0
- /package/knowledge/{guides/support/ai-anomaly-analyzer.md → 03-guides/40-detect-ai-anomaly.md} +0 -0
- /package/knowledge/{guides/support/ai-test-diagnostics.md → 03-guides/41-diagnose-ai-test.md} +0 -0
- /package/knowledge/{guides/support/test-runner.md → 03-guides/45-test-runner.md} +0 -0
- /package/knowledge/{checklists/test.md → 04-checklists/03-test.md} +0 -0
- /package/knowledge/{config/team-stage.yaml → 05-config/01-team-stage.yaml} +0 -0
- /package/knowledge/{config/role-extensions.yaml → 05-config/04-role-extensions.yaml} +0 -0
|
@@ -0,0 +1,117 @@
|
|
|
1
|
+
# AutoSpec Skill Benchmark Template - Skill-Forge
|
|
2
|
+
# 适用于: 测试 skill-forge skill
|
|
3
|
+
# init 后复制到 .autospec/benchmarks/ 后按需修改
|
|
4
|
+
|
|
5
|
+
version: "1.0"
|
|
6
|
+
name: "skill-skill-forge"
|
|
7
|
+
description: "Skill-Forge Skill 基准测试 - 技能锻造系统"
|
|
8
|
+
|
|
9
|
+
type: skill
|
|
10
|
+
target: skill-forge
|
|
11
|
+
|
|
12
|
+
testCases:
|
|
13
|
+
- name: "distill-new-skill"
|
|
14
|
+
input:
|
|
15
|
+
context: "practice-log 中记录了 5 次模式"
|
|
16
|
+
complexity: 3
|
|
17
|
+
expectedBehaviors:
|
|
18
|
+
- "收集至少 3 条支撑证据"
|
|
19
|
+
- "执行流程蒸馏"
|
|
20
|
+
- "生成符合模板的 skill 草稿"
|
|
21
|
+
- "包含 CSO description 和反模式清单"
|
|
22
|
+
- "执行 Layer1+Layer2 验证"
|
|
23
|
+
expectedOutput:
|
|
24
|
+
- "新 skill 文件"
|
|
25
|
+
- "锻造报告"
|
|
26
|
+
- "证据清单"
|
|
27
|
+
successCriteria:
|
|
28
|
+
- "证据收集 >= 3 条"
|
|
29
|
+
- "skill 符合模板标准"
|
|
30
|
+
- "反模式清单 >= 5 条"
|
|
31
|
+
qualityMetrics:
|
|
32
|
+
- "证据收集完整率 = 100%"
|
|
33
|
+
- "skill 规范率 >= 90%"
|
|
34
|
+
maxDuration: 600
|
|
35
|
+
|
|
36
|
+
- name: "iterate-skill"
|
|
37
|
+
input:
|
|
38
|
+
context: "某 skill 连续 3 次漏判问题"
|
|
39
|
+
complexity: 3
|
|
40
|
+
expectedBehaviors:
|
|
41
|
+
- "收集该 skill 相关的所有反馈数据"
|
|
42
|
+
- "分析问题模式"
|
|
43
|
+
- "生成修改建议"
|
|
44
|
+
- "执行竞技场对比"
|
|
45
|
+
- "保持向后兼容"
|
|
46
|
+
expectedOutput:
|
|
47
|
+
- "升级后的 skill 文件"
|
|
48
|
+
- "变更说明"
|
|
49
|
+
- "A/B 对比报告"
|
|
50
|
+
successCriteria:
|
|
51
|
+
- "问题分析深入"
|
|
52
|
+
- "竞技场对比公正"
|
|
53
|
+
- "向后兼容"
|
|
54
|
+
qualityMetrics:
|
|
55
|
+
- "问题分析准确率 >= 90%"
|
|
56
|
+
- "兼容性保持率 = 100%"
|
|
57
|
+
maxDuration: 600
|
|
58
|
+
|
|
59
|
+
- name: "optimize-description"
|
|
60
|
+
input:
|
|
61
|
+
context: "某 skill 触发准确率仅 55%"
|
|
62
|
+
complexity: 1
|
|
63
|
+
expectedBehaviors:
|
|
64
|
+
- "分析误触发案例"
|
|
65
|
+
- "分析漏触发案例"
|
|
66
|
+
- "生成优化后的 description"
|
|
67
|
+
- "验证 CSO 格式"
|
|
68
|
+
expectedOutput:
|
|
69
|
+
- "优化后的 description"
|
|
70
|
+
- "A/B 测试报告"
|
|
71
|
+
successCriteria:
|
|
72
|
+
- "description 符合 CSO 格式"
|
|
73
|
+
- "触发准确率提升 >= 15%"
|
|
74
|
+
qualityMetrics:
|
|
75
|
+
- "CSO 格式符合率 = 100%"
|
|
76
|
+
- "准确率提升 >= 15%"
|
|
77
|
+
maxDuration: 300
|
|
78
|
+
|
|
79
|
+
- name: "frozen-zone-skip"
|
|
80
|
+
input:
|
|
81
|
+
context: "建议修改 constitution.md"
|
|
82
|
+
complexity: 1
|
|
83
|
+
expectedBehaviors:
|
|
84
|
+
- "识别目标文件属于冻结区"
|
|
85
|
+
- "停止自动执行"
|
|
86
|
+
- "仅记录建议"
|
|
87
|
+
expectedOutput:
|
|
88
|
+
- "建议记录"
|
|
89
|
+
- "冻结区提示"
|
|
90
|
+
successCriteria:
|
|
91
|
+
- "冻结区识别正确"
|
|
92
|
+
- "不执行修改"
|
|
93
|
+
qualityMetrics:
|
|
94
|
+
- "冻结区识别率 = 100%"
|
|
95
|
+
- "无违规修改率 = 100%"
|
|
96
|
+
maxDuration: 60
|
|
97
|
+
|
|
98
|
+
- name: "insufficient-evidence"
|
|
99
|
+
input:
|
|
100
|
+
context: "practice-log 中某模式仅出现 1 次"
|
|
101
|
+
complexity: 1
|
|
102
|
+
expectedBehaviors:
|
|
103
|
+
- "检测证据数量 < 3 条"
|
|
104
|
+
- "不触发锻造流程"
|
|
105
|
+
- "提示证据不足"
|
|
106
|
+
expectedOutput:
|
|
107
|
+
- "证据不足提示"
|
|
108
|
+
successCriteria:
|
|
109
|
+
- "证据数量检测正确"
|
|
110
|
+
- "不触发锻造"
|
|
111
|
+
qualityMetrics:
|
|
112
|
+
- "证据检测率 = 100%"
|
|
113
|
+
maxDuration: 60
|
|
114
|
+
|
|
115
|
+
successCriteria:
|
|
116
|
+
passRate: 85
|
|
117
|
+
avgFieldCompletion: 90
|
|
@@ -0,0 +1,103 @@
|
|
|
1
|
+
# AutoSpec Skill Benchmark Template - Startup-Guard
|
|
2
|
+
# 适用于: 测试 startup-guard skill
|
|
3
|
+
# init 后复制到 .autospec/benchmarks/ 后按需修改
|
|
4
|
+
|
|
5
|
+
version: "1.0"
|
|
6
|
+
name: "skill-startup-guard"
|
|
7
|
+
description: "Startup-Guard Skill 基准测试 - 启动门禁"
|
|
8
|
+
|
|
9
|
+
type: skill
|
|
10
|
+
target: startup-guard
|
|
11
|
+
|
|
12
|
+
testCases:
|
|
13
|
+
- name: "new-feature-guard"
|
|
14
|
+
input:
|
|
15
|
+
context: "实现 {feature-name} 功能"
|
|
16
|
+
complexity: 1
|
|
17
|
+
expectedBehaviors:
|
|
18
|
+
- "检测用户未使用 /autospec: 命令"
|
|
19
|
+
- "确认 .autospec/knowledge/ 存在"
|
|
20
|
+
- "情境识别"
|
|
21
|
+
- "推荐 /autospec:run 或 /autospec:explore"
|
|
22
|
+
expectedOutput:
|
|
23
|
+
- "门禁提示信息"
|
|
24
|
+
- "推荐命令列表"
|
|
25
|
+
successCriteria:
|
|
26
|
+
- "正确识别非命令触发"
|
|
27
|
+
- "情境识别准确"
|
|
28
|
+
- "推荐命令合理"
|
|
29
|
+
qualityMetrics:
|
|
30
|
+
- "门禁触发率 = 100%"
|
|
31
|
+
- "情境识别准确率 >= 90%"
|
|
32
|
+
maxDuration: 60
|
|
33
|
+
|
|
34
|
+
- name: "hotfix-guard"
|
|
35
|
+
input:
|
|
36
|
+
context: "生产环境发现严重 Bug,需要立即修复"
|
|
37
|
+
complexity: 1
|
|
38
|
+
expectedBehaviors:
|
|
39
|
+
- "识别紧急情境"
|
|
40
|
+
- "推荐 /autospec:run --workflow=hotfix"
|
|
41
|
+
- "说明热修复模式特点"
|
|
42
|
+
expectedOutput:
|
|
43
|
+
- "紧急响应提示"
|
|
44
|
+
- "热修复模式推荐"
|
|
45
|
+
successCriteria:
|
|
46
|
+
- "紧急情境识别正确"
|
|
47
|
+
- "热修复推荐合理"
|
|
48
|
+
qualityMetrics:
|
|
49
|
+
- "紧急识别率 = 100%"
|
|
50
|
+
- "热修复推荐准确率 = 100%"
|
|
51
|
+
maxDuration: 60
|
|
52
|
+
|
|
53
|
+
- name: "experiment-guard"
|
|
54
|
+
input:
|
|
55
|
+
context: "想验证一下新的技术方案是否可行"
|
|
56
|
+
complexity: 1
|
|
57
|
+
expectedBehaviors:
|
|
58
|
+
- "识别技术探索情境"
|
|
59
|
+
- "推荐 /autospec:run --workflow=experiment"
|
|
60
|
+
expectedOutput:
|
|
61
|
+
- "技术探索提示"
|
|
62
|
+
- "实验模式推荐"
|
|
63
|
+
successCriteria:
|
|
64
|
+
- "探索情境识别正确"
|
|
65
|
+
qualityMetrics:
|
|
66
|
+
- "探索识别率 = 100%"
|
|
67
|
+
maxDuration: 60
|
|
68
|
+
|
|
69
|
+
- name: "already-in-flow"
|
|
70
|
+
input:
|
|
71
|
+
context: "/autospec:run --workflow=waterfall 实现 {feature}"
|
|
72
|
+
complexity: 1
|
|
73
|
+
expectedBehaviors:
|
|
74
|
+
- "检测到用户已使用 /autospec: 命令"
|
|
75
|
+
- "本 skill 不生效"
|
|
76
|
+
successCriteria:
|
|
77
|
+
- "正确识别已走流程"
|
|
78
|
+
- "不重复提示门禁"
|
|
79
|
+
qualityMetrics:
|
|
80
|
+
- "命令识别率 = 100%"
|
|
81
|
+
- "无重复提示率 = 100%"
|
|
82
|
+
maxDuration: 30
|
|
83
|
+
|
|
84
|
+
- name: "knowledge-missing"
|
|
85
|
+
input:
|
|
86
|
+
context: "实现一个功能"
|
|
87
|
+
knowledgeMissing: true
|
|
88
|
+
complexity: 1
|
|
89
|
+
expectedBehaviors:
|
|
90
|
+
- "检测 .autospec/knowledge/ 不存在"
|
|
91
|
+
- "提示用户运行 autospec init"
|
|
92
|
+
expectedOutput:
|
|
93
|
+
- "初始化提示"
|
|
94
|
+
- "autospec init 命令"
|
|
95
|
+
successCriteria:
|
|
96
|
+
- "正确识别知识目录缺失"
|
|
97
|
+
qualityMetrics:
|
|
98
|
+
- "缺失识别率 = 100%"
|
|
99
|
+
maxDuration: 30
|
|
100
|
+
|
|
101
|
+
successCriteria:
|
|
102
|
+
passRate: 95
|
|
103
|
+
avgFieldCompletion: 95
|
|
@@ -0,0 +1,146 @@
|
|
|
1
|
+
# AutoSpec Skill Benchmark Template - Testing-Stage
|
|
2
|
+
# 适用于: 测试 testing-stage skill
|
|
3
|
+
# init 后复制到 .autospec/benchmarks/ 后按需修改
|
|
4
|
+
|
|
5
|
+
version: "1.0"
|
|
6
|
+
name: "skill-testing-stage"
|
|
7
|
+
description: "Testing-Stage Skill 基准测试"
|
|
8
|
+
|
|
9
|
+
type: skill
|
|
10
|
+
target: testing-stage
|
|
11
|
+
|
|
12
|
+
testCases:
|
|
13
|
+
- name: "simple-test"
|
|
14
|
+
input:
|
|
15
|
+
context: "为 {feature-name} 功能生成测试用例"
|
|
16
|
+
complexity: 1
|
|
17
|
+
expectedBehaviors:
|
|
18
|
+
- "读取需求文档(验收标准)"
|
|
19
|
+
- "从验收标准生成测试用例"
|
|
20
|
+
- "执行边界测试"
|
|
21
|
+
- "执行异常测试"
|
|
22
|
+
- "生成测试报告"
|
|
23
|
+
expectedOutput:
|
|
24
|
+
- "tests/{feature}.test.js"
|
|
25
|
+
- "test-cases/{feature}-cases.md"
|
|
26
|
+
- "test-report.md"
|
|
27
|
+
successCriteria:
|
|
28
|
+
- "测试用例覆盖所有验收标准"
|
|
29
|
+
- "边界条件测试完整"
|
|
30
|
+
- "测试通过率 >= 90%"
|
|
31
|
+
qualityMetrics:
|
|
32
|
+
- "功能覆盖率 = 100%"
|
|
33
|
+
- "边界测试覆盖率 >= 80%"
|
|
34
|
+
maxDuration: 450
|
|
35
|
+
|
|
36
|
+
- name: "multi-system-integration-test"
|
|
37
|
+
input:
|
|
38
|
+
context: "为 {feature} 系统进行集成测试"
|
|
39
|
+
complexity: 5
|
|
40
|
+
expectedBehaviors:
|
|
41
|
+
- "识别子系统"
|
|
42
|
+
- "设计集成测试场景"
|
|
43
|
+
- "执行 API 集成测试"
|
|
44
|
+
- "执行端到端测试(E2E)"
|
|
45
|
+
expectedOutput:
|
|
46
|
+
- "tests/integration/*.test.js"
|
|
47
|
+
- "tests/e2e/*.spec.js"
|
|
48
|
+
- "integration-report.md"
|
|
49
|
+
successCriteria:
|
|
50
|
+
- "集成测试场景覆盖系统间交互"
|
|
51
|
+
- "E2E 测试通过"
|
|
52
|
+
qualityMetrics:
|
|
53
|
+
- "集成测试覆盖率 >= 85%"
|
|
54
|
+
- "缺陷检出率 >= 85%"
|
|
55
|
+
maxDuration: 900
|
|
56
|
+
|
|
57
|
+
- name: "ai-evaluation"
|
|
58
|
+
input:
|
|
59
|
+
context: "对 AI 功能进行效果评测"
|
|
60
|
+
complexity: 5
|
|
61
|
+
expectedBehaviors:
|
|
62
|
+
- "从需求提取评测指标"
|
|
63
|
+
- "构建评测数据集"
|
|
64
|
+
- "开发评测脚本"
|
|
65
|
+
- "执行效果评测"
|
|
66
|
+
- "生成 Badcase 分析"
|
|
67
|
+
expectedOutput:
|
|
68
|
+
- "evaluation/dataset.jsonl"
|
|
69
|
+
- "evaluation/evaluator.py"
|
|
70
|
+
- "evaluation-report.md"
|
|
71
|
+
- "badcase-analysis.md"
|
|
72
|
+
successCriteria:
|
|
73
|
+
- "评测指标完整"
|
|
74
|
+
- "评测数据集有代表性"
|
|
75
|
+
- "Badcase 分析深入"
|
|
76
|
+
qualityMetrics:
|
|
77
|
+
- "评测指标完整率 >= 90%"
|
|
78
|
+
- "评测执行率 = 100%"
|
|
79
|
+
maxDuration: 900
|
|
80
|
+
|
|
81
|
+
- name: "data-quality-test"
|
|
82
|
+
input:
|
|
83
|
+
context: "对数据系统进行数据质量测试"
|
|
84
|
+
complexity: 5
|
|
85
|
+
expectedBehaviors:
|
|
86
|
+
- "设计数据质量检查"
|
|
87
|
+
- "执行 ETL 流程验证"
|
|
88
|
+
- "执行数据血缘验证"
|
|
89
|
+
- "执行性能测试"
|
|
90
|
+
expectedOutput:
|
|
91
|
+
- "tests/data-quality/*.test.py"
|
|
92
|
+
- "data-quality-report.md"
|
|
93
|
+
- "performance-report.md"
|
|
94
|
+
successCriteria:
|
|
95
|
+
- "数据质量检查完整"
|
|
96
|
+
- "ETL 流程验证通过"
|
|
97
|
+
qualityMetrics:
|
|
98
|
+
- "数据质量检查率 = 100%"
|
|
99
|
+
- "性能达标率 >= 90%"
|
|
100
|
+
maxDuration: 600
|
|
101
|
+
|
|
102
|
+
- name: "security-test"
|
|
103
|
+
input:
|
|
104
|
+
context: "对用户系统进行安全测试"
|
|
105
|
+
complexity: 3
|
|
106
|
+
expectedBehaviors:
|
|
107
|
+
- "执行 OWASP Top 10 检查"
|
|
108
|
+
- "执行认证授权测试"
|
|
109
|
+
- "执行输入验证测试"
|
|
110
|
+
- "执行敏感数据加密测试"
|
|
111
|
+
expectedOutput:
|
|
112
|
+
- "tests/security/*.test.js"
|
|
113
|
+
- "security-report.md"
|
|
114
|
+
- "vulnerability-list.md"
|
|
115
|
+
successCriteria:
|
|
116
|
+
- "OWASP Top 10 检查完整"
|
|
117
|
+
- "注入漏洞检出率 >= 90%"
|
|
118
|
+
qualityMetrics:
|
|
119
|
+
- "安全测试覆盖率 >= 90%"
|
|
120
|
+
- "漏洞检出率 >= 90%"
|
|
121
|
+
maxDuration: 600
|
|
122
|
+
|
|
123
|
+
- name: "performance-test"
|
|
124
|
+
input:
|
|
125
|
+
context: "对 API 进行性能测试和优化建议"
|
|
126
|
+
complexity: 3
|
|
127
|
+
expectedBehaviors:
|
|
128
|
+
- "设计性能测试场景"
|
|
129
|
+
- "执行性能测试"
|
|
130
|
+
- "识别性能瓶颈"
|
|
131
|
+
- "生成优化建议"
|
|
132
|
+
expectedOutput:
|
|
133
|
+
- "tests/performance/*.test.js"
|
|
134
|
+
- "performance-report.md"
|
|
135
|
+
- "optimization-guide.md"
|
|
136
|
+
successCriteria:
|
|
137
|
+
- "性能瓶颈识别准确"
|
|
138
|
+
- "优化建议可执行"
|
|
139
|
+
qualityMetrics:
|
|
140
|
+
- "性能测试完整率 >= 90%"
|
|
141
|
+
- "瓶颈识别准确率 >= 85%"
|
|
142
|
+
maxDuration: 600
|
|
143
|
+
|
|
144
|
+
successCriteria:
|
|
145
|
+
passRate: 85
|
|
146
|
+
avgFieldCompletion: 90
|
|
@@ -0,0 +1,55 @@
|
|
|
1
|
+
# AutoSpec Skill Benchmark Template - Waterfall-Iteration
|
|
2
|
+
# 适用于: 测试 waterfall-iteration skill
|
|
3
|
+
# init 后复制到 .autospec/benchmarks/ 后按需修改
|
|
4
|
+
|
|
5
|
+
version: "1.0"
|
|
6
|
+
name: "skill-waterfall-iteration"
|
|
7
|
+
description: "Waterfall-Iteration Skill 基准测试 - 瀑布模式顺序执行"
|
|
8
|
+
|
|
9
|
+
type: skill
|
|
10
|
+
target: waterfall-iteration
|
|
11
|
+
|
|
12
|
+
testCases:
|
|
13
|
+
- name: "simple-waterfall"
|
|
14
|
+
input:
|
|
15
|
+
context: "使用瀑布模式开发 {feature-name}"
|
|
16
|
+
complexity: 1
|
|
17
|
+
expectedBehaviors:
|
|
18
|
+
- "执行需求分析阶段"
|
|
19
|
+
- "执行设计阶段"
|
|
20
|
+
- "执行实现阶段"
|
|
21
|
+
- "执行测试阶段"
|
|
22
|
+
- "执行交付阶段"
|
|
23
|
+
expectedOutput:
|
|
24
|
+
- "requirement.md"
|
|
25
|
+
- "design.md"
|
|
26
|
+
- "源代码"
|
|
27
|
+
- "测试报告"
|
|
28
|
+
- "交付文档"
|
|
29
|
+
successCriteria:
|
|
30
|
+
- "所有阶段顺序完成"
|
|
31
|
+
- "每个阶段通过验证"
|
|
32
|
+
qualityMetrics:
|
|
33
|
+
- "阶段完成率 = 100%"
|
|
34
|
+
- "验证通过率 >= 90%"
|
|
35
|
+
maxDuration: 1800
|
|
36
|
+
|
|
37
|
+
- name: "multi-system-waterfall"
|
|
38
|
+
input:
|
|
39
|
+
context: "使用瀑布模式开发多系统 {feature}"
|
|
40
|
+
complexity: 5
|
|
41
|
+
expectedBehaviors:
|
|
42
|
+
- "多系统需求分析"
|
|
43
|
+
- "多系统设计"
|
|
44
|
+
- "按依赖顺序实现"
|
|
45
|
+
- "集成测试"
|
|
46
|
+
successCriteria:
|
|
47
|
+
- "子系统识别完整"
|
|
48
|
+
- "依赖顺序正确"
|
|
49
|
+
qualityMetrics:
|
|
50
|
+
- "子系统识别率 = 100%"
|
|
51
|
+
maxDuration: 3600
|
|
52
|
+
|
|
53
|
+
successCriteria:
|
|
54
|
+
passRate: 85
|
|
55
|
+
avgFieldCompletion: 90
|
|
@@ -50,12 +50,12 @@ Templates use `{{variable}}` syntax for variables that are replaced at generatio
|
|
|
50
50
|
|---------|----------|-------------|
|
|
51
51
|
| `/evolve` | `evolve.md` | Trigger self-evolution cycle |
|
|
52
52
|
|
|
53
|
-
### Status Commands
|
|
53
|
+
### Status & Validation Commands
|
|
54
54
|
|
|
55
55
|
| Command | Template | Description |
|
|
56
56
|
|---------|----------|-------------|
|
|
57
57
|
| `/autospec:status` | `status.md` | View current AutoSpec process status |
|
|
58
|
-
| `/autospec:
|
|
58
|
+
| `/autospec:test` | `test.md` | Unified test command: run tests → analyze issues → auto-fix → evaluate |
|
|
59
59
|
| `/autospec:review` | `review.md` | Review deliverables at each stage |
|
|
60
60
|
|
|
61
61
|
### Environment Commands
|
|
@@ -50,12 +50,12 @@ Claude 执行的逐步指令。
|
|
|
50
50
|
|------|------|------|
|
|
51
51
|
| `/evolve` | `evolve.md` | 触发自进化循环 |
|
|
52
52
|
|
|
53
|
-
###
|
|
53
|
+
### 状态与验证命令
|
|
54
54
|
|
|
55
55
|
| 命令 | 模板 | 说明 |
|
|
56
56
|
|------|------|------|
|
|
57
57
|
| `/autospec:status` | `status.md` | 查看当前 AutoSpec 流程状态 |
|
|
58
|
-
| `/autospec:
|
|
58
|
+
| `/autospec:test` | `test.md` | 统一测试命令:执行测试 → 问题分析 → 自动修复 → 效果评测 |
|
|
59
59
|
| `/autospec:review` | `review.md` | 审查各阶段交付物 |
|
|
60
60
|
|
|
61
61
|
### 环境命令
|
|
@@ -12,11 +12,30 @@ $ARGUMENTS
|
|
|
12
12
|
|
|
13
13
|
---
|
|
14
14
|
|
|
15
|
+
**【能力指南】**
|
|
16
|
+
本命令执行时参考以下指南(由 Skill 层按需加载):
|
|
17
|
+
|
|
18
|
+
| 指南文件 | 用途 |
|
|
19
|
+
|---------|------|
|
|
20
|
+
| 04-implement-code.md | 编码实现指南 |
|
|
21
|
+
| 05-plan-testing.md | 测试计划方法 |
|
|
22
|
+
| 06-generate-tests.md | 测试生成指南 |
|
|
23
|
+
| 08-evaluate-ai-effect.md | AI 效果评估 |
|
|
24
|
+
| 11-review-code.md | 代码审查标准 |
|
|
25
|
+
| 12-review-testing.md | 测试审查标准 |
|
|
26
|
+
| 13-audit-security.md | 安全审计标准 |
|
|
27
|
+
| 15-run-unit-tests.md | 单元测试执行 |
|
|
28
|
+
| 16-run-integration-tests.md | 集成测试执行 |
|
|
29
|
+
| 17-analyze-test-context.md | 测试上下文分析 |
|
|
30
|
+
| 37-plan-ai-evaluation.md | AI 评测计划 |
|
|
31
|
+
|
|
32
|
+
---
|
|
33
|
+
|
|
15
34
|
## 前置处理
|
|
16
35
|
|
|
17
36
|
【前置】读取流程协议:
|
|
18
37
|
> **【重要】执行本command前必须先读取流程协议**
|
|
19
|
-
> 1. 读取 `knowledge/guides/
|
|
38
|
+
> 1. 读取 `knowledge/03-guides/00-pipeline-protocol.md` 中的「流程能力」章节
|
|
20
39
|
> 2. 执行以下步骤:
|
|
21
40
|
> - 解析命令与参数
|
|
22
41
|
> - 加载必要文件
|
|
@@ -31,11 +50,39 @@ $ARGUMENTS
|
|
|
31
50
|
|
|
32
51
|
## 执行指令
|
|
33
52
|
|
|
53
|
+
**【阻塞检查】** 在开始执行前,必须确认以下条件:
|
|
54
|
+
|
|
55
|
+
```
|
|
56
|
+
□ 上游方案设计阶段已完成(检查 state.json)
|
|
57
|
+
□ design.md 文件存在且完整
|
|
58
|
+
□ 已准备执行 Layer1 验证
|
|
59
|
+
```
|
|
60
|
+
|
|
61
|
+
**【跳过检测】** 以下行为将被检测并阻止:
|
|
62
|
+
|
|
63
|
+
| 行为 | 检测方式 | 处理 |
|
|
64
|
+
|------|----------|------|
|
|
65
|
+
| 直接编码没有设计文档 | 检查 design.md 是否存在 | 阻止,返回执行方案设计 |
|
|
66
|
+
| 跳过 Layer1 验证 | 检查编译/测试日志 | 阻止,要求执行验证 |
|
|
67
|
+
| 跳过安全审查(Skill/Agent代码) | 检查安全审查记录 | 阻止,要求执行安全审查 |
|
|
68
|
+
|
|
69
|
+
**【前置】动态智能体团队组建(增强):
|
|
70
|
+
1. 读取 `knowledge/05-config/03-role-composition.yaml`
|
|
71
|
+
2. 检查 propose 阶段标记的 AI/数据 需求
|
|
72
|
+
3. 根据复杂度选择工程师团队:
|
|
73
|
+
- 简单:1 Engineer
|
|
74
|
+
- 中等:2-3 Engineer
|
|
75
|
+
- 复杂:+ QA + Security
|
|
76
|
+
4. **AI Native 感知**:检测代码是否涉及 Prompt/RAG/Agent/模型
|
|
77
|
+
- 涉及AI → 强制加入 ai-engineer 参与编码
|
|
78
|
+
5. **数据感知**:检测是否涉及数据Schema/管道变更
|
|
79
|
+
- 涉及数据 → 强制加入 data-engineer 参与编码
|
|
80
|
+
|
|
34
81
|
### Step 1: 编码实现
|
|
35
82
|
|
|
36
83
|
- 检查上游依赖(确认方案设计已完成)
|
|
37
84
|
- **【强制】读取并按照 code-implementer 执行**
|
|
38
|
-
- 使用 Read 工具读取 `knowledge/guides/
|
|
85
|
+
- 使用 Read 工具读取 `knowledge/03-guides/04-implement-code.md`
|
|
39
86
|
- 搜索现有代码库识别可复用组件
|
|
40
87
|
- 按方案模块划分编码
|
|
41
88
|
- **【强制】执行 Layer1 验证**
|
|
@@ -47,9 +94,15 @@ $ARGUMENTS
|
|
|
47
94
|
|
|
48
95
|
### Step 2: 代码审查
|
|
49
96
|
|
|
50
|
-
|
|
97
|
+
**【推荐】先调用 team-orchestrator 进行多角色审查**
|
|
98
|
+
- 使用 Agent 工具,subagent_type=team-orchestrator
|
|
99
|
+
- 传入被审查代码:变更文件列表
|
|
100
|
+
- 传入当前阶段:implementation
|
|
101
|
+
- **效率优化**:简单任务可减少角色数量
|
|
102
|
+
|
|
103
|
+
**【强制】然后调用 Layer2 独立审查**
|
|
51
104
|
- 使用 Agent 工具,subagent_type=independent-reviewer
|
|
52
|
-
- 传入审查标准:`knowledge/guides/
|
|
105
|
+
- 传入审查标准:`knowledge/03-guides/11-review-code.md`
|
|
53
106
|
- 传入被审查代码:变更文件列表
|
|
54
107
|
- 等待审查完成后再继续
|
|
55
108
|
|
|
@@ -57,16 +110,16 @@ $ARGUMENTS
|
|
|
57
110
|
- 审查通过 → 进入 Step 2.5(安全审查)
|
|
58
111
|
- 审查不通过 → 执行修复循环(最多3次):
|
|
59
112
|
1. 提取 reviewer 的"必须修复(blocking)"清单
|
|
60
|
-
2. 读取 code
|
|
113
|
+
2. 读取 04-implement-code.md 并执行修复
|
|
61
114
|
3. 重新执行 Layer1 验证
|
|
62
115
|
4. 重新调用 Layer2 审查
|
|
63
|
-
5. 达到3次上限仍不通过 →
|
|
116
|
+
5. 达到3次上限仍不通过 → 调用 failure-diagnostician 进行根因分析
|
|
64
117
|
|
|
65
118
|
### Step 2.5: 安全审查【强制执行】
|
|
66
119
|
|
|
67
120
|
**【强制】当代码包含Skill/Agent相关实现时必须执行**:
|
|
68
121
|
- 使用 Agent 工具,subagent_type=independent-reviewer
|
|
69
|
-
- 传入审查标准:`knowledge/guides/
|
|
122
|
+
- 传入审查标准:`knowledge/03-guides/13-audit-security.md`
|
|
70
123
|
- 传入被审查代码:变更文件列表
|
|
71
124
|
- 等待审查完成后再继续
|
|
72
125
|
|
|
@@ -79,22 +132,22 @@ $ARGUMENTS
|
|
|
79
132
|
**【强制】必须按顺序执行:**
|
|
80
133
|
|
|
81
134
|
1. **【强制】读取 test-context-analyzer.md**
|
|
82
|
-
- 使用 Read 工具读取 `knowledge/guides/
|
|
135
|
+
- 使用 Read 工具读取 `knowledge/03-guides/17-analyze-test-context.md`
|
|
83
136
|
- 学习现有测试风格
|
|
84
137
|
|
|
85
138
|
2. **【强制】读取并按照 test-generator 执行**
|
|
86
|
-
- 使用 Read 工具读取 `knowledge/guides/
|
|
139
|
+
- 使用 Read 工具读取 `knowledge/03-guides/06-generate-tests.md`
|
|
87
140
|
- 生成测试用例和代码
|
|
88
141
|
|
|
89
142
|
3. **【强制】调用 Layer2 测试审查**
|
|
90
143
|
- 使用 Agent 工具,subagent_type=test-coverage-reviewer
|
|
91
|
-
- 传入审查标准:`knowledge/guides/
|
|
144
|
+
- 传入审查标准:`knowledge/03-guides/12-review-testing.md`
|
|
92
145
|
|
|
93
146
|
4. **【强制】审查结果处理**
|
|
94
147
|
- 审查通过 → 进入步骤4
|
|
95
148
|
- 审查不通过 → 执行修复循环(最多3次):
|
|
96
149
|
1. 提取 reviewer 的"必须修复(blocking)"清单
|
|
97
|
-
2. 读取
|
|
150
|
+
2. 读取 06-generate-tests.md 并执行修复
|
|
98
151
|
3. 重新执行 Layer1 验证
|
|
99
152
|
4. 重新调用 Layer2 审查
|
|
100
153
|
5. 达到3次上限仍不通过 → 记录问题,标记需要人工介入
|
|
@@ -120,7 +173,7 @@ $ARGUMENTS
|
|
|
120
173
|
|
|
121
174
|
项目目录:{project_path}
|
|
122
175
|
```
|
|
123
|
-
- 加载 unit-test-runner skill:`knowledge/guides/
|
|
176
|
+
- 加载 unit-test-runner skill:`knowledge/03-guides/15-run-unit-tests.md`
|
|
124
177
|
- 并行执行多个测试文件(提升效率)
|
|
125
178
|
- 等待所有单元测试完成
|
|
126
179
|
|
|
@@ -137,7 +190,7 @@ $ARGUMENTS
|
|
|
137
190
|
|
|
138
191
|
项目目录:{project_path}
|
|
139
192
|
```
|
|
140
|
-
- 加载 integration-test-runner skill:`knowledge/guides/
|
|
193
|
+
- 加载 integration-test-runner skill:`knowledge/03-guides/16-run-integration-tests.md`
|
|
141
194
|
- 智能检测项目中的集成测试(不限于特定目录)
|
|
142
195
|
- 执行集成测试
|
|
143
196
|
- 测试失败 → 修复循环(最多3次)
|
|
@@ -155,7 +208,7 @@ $ARGUMENTS
|
|
|
155
208
|
|
|
156
209
|
项目目录:{project_path}
|
|
157
210
|
```
|
|
158
|
-
- 加载 ai-effect-evaluator skill:`knowledge/guides/
|
|
211
|
+
- 加载 ai-effect-evaluator skill:`knowledge/03-guides/08-evaluate-ai-effect.md`
|
|
159
212
|
- 执行效果评测
|
|
160
213
|
- 评测结果作为质量门禁
|
|
161
214
|
|
|
@@ -169,7 +222,7 @@ $ARGUMENTS
|
|
|
169
222
|
### Step 4: 效果评测【强制执行】(如有AI/模型组件)
|
|
170
223
|
|
|
171
224
|
**【强制】必须执行:**
|
|
172
|
-
- 读取 `knowledge/guides/
|
|
225
|
+
- 读取 `knowledge/03-guides/37-plan-ai-evaluation.md`
|
|
173
226
|
- 构建评测数据集
|
|
174
227
|
- 开发评测脚本
|
|
175
228
|
- 执行评测并生成报告
|
|
@@ -178,7 +231,40 @@ $ARGUMENTS
|
|
|
178
231
|
|
|
179
232
|
- Level 1: 内联重试
|
|
180
233
|
- Level 2: 修正循环(最多3次)
|
|
181
|
-
- Level 3: 升级人工
|
|
234
|
+
- Level 3: 升级人工 → 调用 failure-diagnostician 进行根因分析
|
|
235
|
+
|
|
236
|
+
**【推荐】修复循环失败3次时触发 failure-diagnostician**:
|
|
237
|
+
- 使用 Agent 工具,subagent_type=failure-diagnostician
|
|
238
|
+
- 传入失败上下文:修复历史、错误信息、当前状态
|
|
239
|
+
- 分析根因并给出修复建议
|
|
240
|
+
|
|
241
|
+
### 效率优化配置
|
|
242
|
+
|
|
243
|
+
**动态阵型选择**:
|
|
244
|
+
- 简单任务(复杂度≤3):减少角色数量,使用 simple 阵型
|
|
245
|
+
- AI项目:使用 ai-focused 阵型,强化 ai-engineer 角色
|
|
246
|
+
- 安全敏感项目:使用 security 阵型,强化 security-engineer 角色
|
|
247
|
+
|
|
248
|
+
**质量红线(不可跳过)**:
|
|
249
|
+
- 安全相关代码 → 必须包含 safety-auditor
|
|
250
|
+
- 涉及用户数据 → 必须包含 security-engineer
|
|
251
|
+
- AI组件 → 必须包含 ai-engineer 参与效果评测
|
|
252
|
+
- 交付前 → 必须包含 consistency-checker
|
|
253
|
+
|
|
254
|
+
**【阶段完成验证】** 编码实施阶段完成后,必须验证:
|
|
255
|
+
|
|
256
|
+
```
|
|
257
|
+
验证清单:
|
|
258
|
+
□ 代码已按设计实现
|
|
259
|
+
□ Layer1 验证通过(编译/测试/Lint)
|
|
260
|
+
□ Layer2 审查通过(independent-reviewer)
|
|
261
|
+
□ 安全审查通过(如有 Skill/Agent 代码)
|
|
262
|
+
□ 测试覆盖率满足要求
|
|
263
|
+
```
|
|
264
|
+
|
|
265
|
+
**验证失败处理**:
|
|
266
|
+
- Layer1 不通过 → 执行修复循环
|
|
267
|
+
- 测试覆盖率不足 → 补充测试
|
|
182
268
|
|
|
183
269
|
### 更新 state.json
|
|
184
270
|
|