prizmkit 1.1.7 → 1.1.9

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (132) hide show
  1. package/bundled/VERSION.json +3 -3
  2. package/bundled/adapters/codebuddy/skill-adapter.js +21 -7
  3. package/bundled/agents/prizm-dev-team-reviewer.md +53 -173
  4. package/bundled/dev-pipeline/.env.example +45 -0
  5. package/bundled/dev-pipeline/README.md +64 -64
  6. package/bundled/dev-pipeline/SCHEMA_ANALYSIS.md +535 -0
  7. package/bundled/dev-pipeline/assets/feature-list-example.json +0 -1
  8. package/bundled/dev-pipeline/launch-bugfix-daemon.sh +64 -18
  9. package/bundled/dev-pipeline/launch-feature-daemon.sh +15 -12
  10. package/bundled/dev-pipeline/launch-refactor-daemon.sh +64 -18
  11. package/bundled/dev-pipeline/lib/branch.sh +6 -1
  12. package/bundled/dev-pipeline/lib/common.sh +71 -0
  13. package/bundled/dev-pipeline/lib/heartbeat.sh +2 -2
  14. package/bundled/dev-pipeline/reset-bug.sh +10 -9
  15. package/bundled/dev-pipeline/reset-feature.sh +9 -8
  16. package/bundled/dev-pipeline/reset-refactor.sh +10 -9
  17. package/bundled/dev-pipeline/retry-bugfix.sh +67 -29
  18. package/bundled/dev-pipeline/retry-feature.sh +54 -18
  19. package/bundled/dev-pipeline/retry-refactor.sh +112 -29
  20. package/bundled/dev-pipeline/run-bugfix.sh +281 -59
  21. package/bundled/dev-pipeline/run-feature.sh +53 -18
  22. package/bundled/dev-pipeline/run-refactor.sh +392 -66
  23. package/bundled/dev-pipeline/scripts/check-session-status.py +24 -1
  24. package/bundled/dev-pipeline/scripts/cleanup-logs.py +2 -2
  25. package/bundled/dev-pipeline/scripts/detect-stuck.py +195 -85
  26. package/bundled/dev-pipeline/scripts/generate-bootstrap-prompt.py +57 -33
  27. package/bundled/dev-pipeline/scripts/generate-bugfix-prompt.py +25 -9
  28. package/bundled/dev-pipeline/scripts/generate-refactor-prompt.py +104 -17
  29. package/bundled/dev-pipeline/scripts/init-bugfix-pipeline.py +34 -9
  30. package/bundled/dev-pipeline/scripts/init-pipeline.py +10 -10
  31. package/bundled/dev-pipeline/scripts/init-refactor-pipeline.py +19 -8
  32. package/bundled/dev-pipeline/scripts/parse-stream-progress.py +1 -5
  33. package/bundled/dev-pipeline/scripts/patch-completion-notes.py +191 -0
  34. package/bundled/dev-pipeline/scripts/update-bug-status.py +167 -22
  35. package/bundled/dev-pipeline/scripts/update-feature-status.py +104 -62
  36. package/bundled/dev-pipeline/scripts/update-refactor-status.py +351 -21
  37. package/bundled/dev-pipeline/templates/agent-prompts/dev-fix.md +1 -1
  38. package/bundled/dev-pipeline/templates/agent-prompts/reviewer-review.md +7 -11
  39. package/bundled/dev-pipeline/templates/bootstrap-prompt.md +41 -7
  40. package/bundled/dev-pipeline/templates/bootstrap-tier1.md +27 -3
  41. package/bundled/dev-pipeline/templates/bootstrap-tier2.md +43 -19
  42. package/bundled/dev-pipeline/templates/bootstrap-tier3.md +54 -26
  43. package/bundled/dev-pipeline/templates/bug-fix-list-schema.json +6 -15
  44. package/bundled/dev-pipeline/templates/bugfix-bootstrap-prompt.md +36 -25
  45. package/bundled/dev-pipeline/templates/feature-list-schema.json +109 -31
  46. package/bundled/dev-pipeline/templates/refactor-bootstrap-prompt.md +270 -0
  47. package/bundled/dev-pipeline/templates/refactor-list-schema.json +11 -3
  48. package/bundled/dev-pipeline/templates/sections/context-budget-rules.md +3 -1
  49. package/bundled/dev-pipeline/templates/sections/critical-paths-agent.md +1 -0
  50. package/bundled/dev-pipeline/templates/sections/feature-context.md +2 -0
  51. package/bundled/dev-pipeline/templates/sections/phase-commit-full.md +29 -2
  52. package/bundled/dev-pipeline/templates/sections/phase-commit.md +22 -0
  53. package/bundled/dev-pipeline/templates/sections/phase-deploy-verification.md +2 -2
  54. package/bundled/dev-pipeline/templates/sections/phase-review-agent.md +8 -6
  55. package/bundled/dev-pipeline/templates/sections/phase-review-full.md +7 -5
  56. package/bundled/dev-pipeline/templates/sections/phase-specify-plan-full.md +3 -3
  57. package/bundled/skills/_metadata.json +5 -22
  58. package/bundled/skills/app-planner/SKILL.md +98 -72
  59. package/bundled/skills/app-planner/assets/app-design-guide.md +1 -1
  60. package/bundled/skills/app-planner/references/architecture-decisions.md +1 -1
  61. package/bundled/skills/app-planner/references/project-brief-guide.md +69 -66
  62. package/bundled/skills/bug-fix-workflow/SKILL.md +52 -9
  63. package/bundled/skills/bug-planner/SKILL.md +139 -197
  64. package/bundled/skills/bug-planner/assets/bug-confirmation-template.md +43 -0
  65. package/bundled/skills/bug-planner/references/critic-and-verification.md +44 -0
  66. package/bundled/skills/bug-planner/references/error-recovery.md +73 -0
  67. package/bundled/skills/bug-planner/references/input-formats.md +53 -0
  68. package/bundled/skills/bug-planner/references/schema-validation.md +25 -0
  69. package/bundled/skills/bug-planner/references/severity-rules.md +16 -0
  70. package/bundled/skills/bug-planner/scripts/validate-bug-list.py +4 -8
  71. package/bundled/skills/bugfix-pipeline-launcher/SKILL.md +34 -39
  72. package/bundled/skills/feature-pipeline-launcher/SKILL.md +49 -36
  73. package/bundled/skills/feature-pipeline-launcher/scripts/preflight-check.py +3 -3
  74. package/bundled/skills/feature-planner/SKILL.md +53 -142
  75. package/bundled/skills/feature-planner/assets/evaluation-guide.md +1 -1
  76. package/bundled/skills/feature-planner/assets/planning-guide.md +21 -5
  77. package/bundled/skills/feature-planner/references/browser-interaction.md +2 -4
  78. package/bundled/skills/feature-planner/references/completeness-review.md +57 -0
  79. package/bundled/skills/feature-planner/references/error-recovery.md +16 -35
  80. package/bundled/skills/feature-planner/references/incremental-feature-planning.md +1 -1
  81. package/bundled/skills/feature-planner/references/new-project-planning.md +2 -2
  82. package/bundled/skills/feature-planner/scripts/validate-and-generate.py +19 -20
  83. package/bundled/skills/feature-workflow/SKILL.md +24 -25
  84. package/bundled/skills/prizm-kit/SKILL.md +39 -49
  85. package/bundled/skills/prizmkit-code-review/SKILL.md +51 -64
  86. package/bundled/skills/prizmkit-code-review/rules/dimensions.md +85 -0
  87. package/bundled/skills/prizmkit-code-review/rules/fix-strategy.md +11 -11
  88. package/bundled/skills/prizmkit-committer/SKILL.md +3 -31
  89. package/bundled/skills/prizmkit-deploy/SKILL.md +34 -31
  90. package/bundled/skills/prizmkit-deploy/assets/deploy-template.md +1 -1
  91. package/bundled/skills/prizmkit-implement/SKILL.md +35 -68
  92. package/bundled/skills/prizmkit-init/SKILL.md +112 -65
  93. package/bundled/skills/prizmkit-init/assets/project-brief-template.md +82 -0
  94. package/bundled/skills/prizmkit-plan/SKILL.md +120 -79
  95. package/bundled/skills/prizmkit-plan/assets/plan-template.md +28 -18
  96. package/bundled/skills/prizmkit-plan/assets/spec-template.md +28 -11
  97. package/bundled/skills/prizmkit-plan/references/clarify-guide.md +3 -3
  98. package/bundled/skills/prizmkit-plan/references/verification-checklist.md +60 -0
  99. package/bundled/skills/prizmkit-prizm-docs/SKILL.md +10 -81
  100. package/bundled/skills/prizmkit-prizm-docs/assets/{PRIZM-SPEC.md → prizm-docs-format.md} +41 -526
  101. package/bundled/skills/prizmkit-prizm-docs/references/op-init.md +46 -0
  102. package/bundled/skills/prizmkit-prizm-docs/references/op-rebuild.md +16 -0
  103. package/bundled/skills/prizmkit-prizm-docs/references/op-status.md +14 -0
  104. package/bundled/skills/prizmkit-prizm-docs/references/op-update.md +19 -0
  105. package/bundled/skills/prizmkit-prizm-docs/references/op-validate.md +17 -0
  106. package/bundled/skills/prizmkit-retrospective/SKILL.md +27 -65
  107. package/bundled/skills/prizmkit-retrospective/references/knowledge-injection-steps.md +3 -4
  108. package/bundled/skills/prizmkit-retrospective/references/structural-sync-steps.md +7 -25
  109. package/bundled/skills/recovery-workflow/SKILL.md +22 -22
  110. package/bundled/skills/recovery-workflow/evals/evals.json +5 -5
  111. package/bundled/skills/recovery-workflow/scripts/detect-recovery-state.py +43 -10
  112. package/bundled/skills/refactor-pipeline-launcher/SKILL.md +48 -40
  113. package/bundled/skills/refactor-planner/SKILL.md +43 -61
  114. package/bundled/skills/refactor-planner/scripts/validate-and-generate-refactor.py +17 -17
  115. package/bundled/skills/refactor-workflow/SKILL.md +23 -24
  116. package/bundled/team/prizm-dev-team.json +1 -1
  117. package/bundled/{skills/prizm-kit/assets → templates}/project-memory-template.md +1 -1
  118. package/package.json +1 -1
  119. package/src/clean.js +3 -4
  120. package/src/gitignore-template.js +7 -9
  121. package/src/scaffold.js +14 -5
  122. package/bundled/dev-pipeline/templates/agent-prompts/reviewer-analyze.md +0 -5
  123. package/bundled/dev-pipeline/templates/sections/phase-analyze-agent.md +0 -19
  124. package/bundled/dev-pipeline/templates/sections/phase-analyze-full.md +0 -19
  125. package/bundled/skills/app-planner/references/project-conventions.md +0 -93
  126. package/bundled/skills/prizmkit-analyze/SKILL.md +0 -207
  127. package/bundled/skills/prizmkit-code-review/rules/dimensions-bugfix.md +0 -25
  128. package/bundled/skills/prizmkit-code-review/rules/dimensions-feature.md +0 -43
  129. package/bundled/skills/prizmkit-code-review/rules/dimensions-refactor.md +0 -25
  130. package/bundled/skills/prizmkit-implement/references/deploy-guide-protocol.md +0 -69
  131. package/bundled/skills/prizmkit-verify/SKILL.md +0 -281
  132. package/bundled/skills/prizmkit-verify/scripts/verify-light.py +0 -402
@@ -1,43 +0,0 @@
1
- # Review Dimensions — Feature Mode
2
-
3
- Review code against `spec.md` and `plan.md` across 6 dimensions:
4
-
5
- ## 1. Spec Compliance
6
- Does code implement all acceptance criteria? Missing criteria are the #1 source of "it works but it's wrong" bugs.
7
- - Check every acceptance criterion in spec.md has a corresponding implementation
8
- - Verify edge cases mentioned in spec are handled
9
- - Confirm scope boundaries are respected (no over-implementation, no under-implementation)
10
-
11
- ## 2. Plan Adherence
12
- Does implementation follow architectural decisions in plan.md? Deviations may be improvements or may break assumptions other components depend on.
13
- - Check component structure matches plan's architecture approach
14
- - Verify data model matches plan's schema design
15
- - Confirm API contracts (endpoints, request/response) match plan
16
-
17
- ## 3. Code Quality
18
- Naming, structure, complexity, DRY. Focus on maintainability — will someone understand this code in 6 months?
19
- - Function/variable names are descriptive and consistent with project conventions
20
- - No unnecessary complexity (cyclomatic complexity, deep nesting)
21
- - No copy-paste duplication that should be abstracted
22
- - Error messages are informative for debugging
23
-
24
- ## 4. Security
25
- Injection (SQL, XSS, command), auth/authz gaps, sensitive data exposure, insecure defaults. Security issues are always HIGH+ because they're the hardest to catch later.
26
- - User input is validated and sanitized before use in queries, HTML, or commands
27
- - Authentication and authorization checks are present on protected routes
28
- - Sensitive data (passwords, tokens, PII) is not logged or exposed in responses
29
- - Cryptographic operations use established libraries, not custom implementations
30
-
31
- ## 5. Consistency
32
- Follows project patterns from `.prizm-docs/` PATTERNS section. Inconsistent patterns increase cognitive load for every future reader.
33
- - Code style matches existing codebase conventions
34
- - Error handling follows established patterns
35
- - File organization follows project structure conventions
36
- - Naming conventions align with `.prizm-docs/` RULES
37
-
38
- ## 6. Test Coverage
39
- Are critical paths tested? Focus on paths that handle user input, money, or state transitions.
40
- - Happy path tests exist for each user story
41
- - Error/edge case tests for critical paths
42
- - Tests are deterministic (no flaky timing dependencies)
43
- - Test names clearly describe what they verify
@@ -1,25 +0,0 @@
1
- # Review Dimensions — Refactor Mode
2
-
3
- Review code against `refactor-analysis.md` goals. Standard dimensions (code quality, security, consistency, test coverage) still apply — load `${SKILL_DIR}/rules/dimensions-feature.md` §3–§6 for those.
4
-
5
- ## Refactor-Specific Dimensions
6
-
7
- ### Behavior Preservation (replaces Spec Compliance)
8
- Observable behavior must remain unchanged. This is the #1 refactor risk — "improving" code that subtly changes behavior.
9
- - All existing tests still pass without modification (test changes during refactor are a red flag)
10
- - Public API signatures are unchanged (parameter types, return types, error types)
11
- - Side effects (logging, metrics, events) are preserved unless explicitly scoped for removal
12
- - Edge case handling is preserved — refactors often silently drop edge case branches
13
-
14
- ### Structural Improvement (replaces Plan Adherence)
15
- Is the code measurably better against the refactor goals?
16
- - Complexity metrics improved (fewer nested conditions, shorter functions)
17
- - Coupling reduced (fewer cross-module imports, clearer boundaries)
18
- - Duplication reduced (DRY violations eliminated per refactor-analysis.md scope)
19
- - Readability improved (naming, organization, documentation)
20
-
21
- ### Test Integrity
22
- Refactors must not weaken the test suite.
23
- - No tests were deleted or skipped
24
- - Test coverage percentage is equal or higher
25
- - If tests were rewritten, they cover the same behavioral scenarios
@@ -1,69 +0,0 @@
1
- # Deploy Guide Update Protocol
2
-
3
- When dependency manifests change during implementation, update `DEPLOY.md` at the project root.
4
-
5
- ## Detection
6
-
7
- 1. Check if any dependency manifests were modified in this session:
8
- ```bash
9
- git diff --name-only HEAD -- package.json requirements*.txt Pipfile pyproject.toml go.mod Cargo.toml pom.xml build.gradle Gemfile composer.json docker-compose*.yml Dockerfile .tool-versions 2>/dev/null
10
- ```
11
- 2. If no manifest files changed → skip this step entirely
12
- 3. If manifest files changed, scan for **newly added** dependencies (not version bumps):
13
- ```bash
14
- git diff -U0 HEAD -- package.json requirements*.txt Pipfile pyproject.toml go.mod Cargo.toml pom.xml build.gradle Gemfile composer.json docker-compose*.yml Dockerfile .tool-versions 2>/dev/null | grep '^\+' | grep -v '^\+\+\+' | head -30
15
- ```
16
-
17
- ## Recording
18
-
19
- For each genuinely new framework/tool, record in `DEPLOY.md` at project root:
20
-
21
- | Field | Description | Source |
22
- |-------|-------------|--------|
23
- | **Name** | Framework/tool name | Package name from manifest |
24
- | **Version** | Installed version or constraint | Version spec from manifest |
25
- | **Purpose** | Why it was introduced | You just added it — you know why |
26
- | **Install Command** | How to install locally | Standard install command for the ecosystem |
27
- | **Key Config** | Config files or env vars needed | Config files you just created/modified |
28
- | **Notes** | Setup gotchas, required services | Docker services, manual steps, env vars |
29
-
30
- ## Template for `DEPLOY.md`
31
-
32
- ```markdown
33
- # Deploy Guide
34
-
35
- > Auto-maintained by PrizmKit. Manual edits are preserved.
36
- > Last updated: YYYY-MM-DD
37
-
38
- ## Frameworks & Tools
39
-
40
- ### <Framework Name>
41
-
42
- - **Version**: <version constraint>
43
- - **Purpose**: <why this framework is used>
44
- - **Install**:
45
- ```bash
46
- <install command>
47
- ```
48
- - **Key Config**:
49
- - `<config file or env var>`: <description>
50
- - **Notes**:
51
- - <any setup gotchas, required external services, manual steps>
52
- ```
53
-
54
- ## Update Rules
55
-
56
- - Create file if absent; append new sections if file exists
57
- - Update version if framework already documented
58
- - Preserve manually added content
59
- - Keep entries sorted alphabetically
60
-
61
- ## Filter Out
62
-
63
- - Patch version bumps of existing deps
64
- - Dev-only tools needing no setup (linters, formatters)
65
- - Transitive/lock-file-only changes
66
-
67
- ## Final Step
68
-
69
- Stage the file: `git add DEPLOY.md`
@@ -1,281 +0,0 @@
1
- ---
2
- name: "prizmkit-verify"
3
- description: "Automated framework verification for PrizmKit's 4-layer architecture (Skill → Pipeline → Workflow → Script). Use this skill to validate structural integrity, contract consistency between layers, cross-scenario alignment, and adapter/installer correctness. Trigger whenever: verifying framework health, checking skill contracts, auditing layer integration, running pre-release checks, or after significant framework modifications."
4
- ---
5
-
6
- # PrizmKit Framework Verification
7
-
8
- Verify PrizmKit's four-layer architecture for structural integrity, contract consistency, and cross-scenario alignment.
9
-
10
- ## Execution Modes
11
-
12
- Parse the user's arguments to determine scope and depth:
13
-
14
- | Command | What Runs | When to Use |
15
- |---------|-----------|-------------|
16
- | `/prizmkit-verify light` | Light checks only (R1-R4) | Quick CI gate, seconds |
17
- | `/prizmkit-verify medium` | Light + Medium (R1-R4) | Regular development, recommended default |
18
- | `/prizmkit-verify full` | All rounds (R1-R6) including E2E | Pre-release, after major changes |
19
- | `/prizmkit-verify --round R1` | Specific round only | Targeted check after editing a layer |
20
- | `/prizmkit-verify` (no args) | Same as `medium` | Default |
21
-
22
- ## Architecture Context
23
-
24
- PrizmKit has four layers, bottom to top:
25
-
26
- ```
27
- L1 Skill layer → core/skills/prizmkit-skill/ (10 skills, dev lifecycle)
28
- L2 Pipeline layer → core/skills/orchestration-skill/pipelines/ (7 skills, planners + launchers)
29
- L3 Workflow layer → core/skills/orchestration-skill/workflows/ (4 skills, scenario entry points)
30
- L4 Script layer → dev-pipeline/ (Bash + Python runtime engine)
31
- ```
32
-
33
- Three verification depths, cumulative:
34
- - **Light**: Structural/static checks, scriptable, zero tokens
35
- - **Medium**: AI reads SKILL.md content, verifies inter-layer contracts
36
- - **Deep**: End-to-end scenario traces through all 4 layers
37
-
38
- ## Execution
39
-
40
- ### Phase 0: Setup
41
-
42
- 1. Create output directory: `.prizmkit/verify/` (timestamp-based run dir)
43
- 2. Determine which rounds to run based on user args
44
- 3. Initialize the report structure
45
-
46
- ### Phase 1: Horizontal Verification (R1-R4)
47
-
48
- Run rounds sequentially. For each round, run Light first, then Medium if in scope.
49
-
50
- ---
51
-
52
- #### R1: L1 Skill Layer
53
-
54
- **Target**: All skills in `core/skills/prizmkit-skill/` (expect ~10)
55
-
56
- **Light checks** — run `${SKILL_DIR}/scripts/verify-light.py --round R1` if available, otherwise check manually:
57
-
58
- | Check | How | Pass Criteria |
59
- |-------|-----|---------------|
60
- | Frontmatter validity | Read each SKILL.md YAML header | `name` + `description` present; `name` is lowercase-hyphens; `name` matches directory name |
61
- | Asset reference integrity | Grep `${SKILL_DIR}` in each SKILL.md, resolve path relative to skill dir | Every referenced file exists on disk |
62
- | Cross-skill references | Grep `/prizmkit-*` references in each SKILL.md | Every referenced skill exists under `core/skills/` |
63
- | Orphaned assets | List files in each skill's `assets/`, `scripts/`, `references/` dirs | Every file is referenced by SKILL.md (warn if not) |
64
- | Lint: ESLint | Run `npm run lint` | 0 errors (warnings OK) |
65
- | Lint: Ruff | Run `npm run lint:py` | 0 errors |
66
-
67
- **Medium checks** — read each SKILL.md FULLY and verify artifact chain contracts:
68
-
69
- For each adjacent skill pair in the chain:
70
- `prizmkit-plan → prizmkit-analyze → prizmkit-implement → prizmkit-code-review → prizmkit-retrospective → prizmkit-committer → prizmkit-deploy`
71
-
72
- | Verification Point | What to Check |
73
- |-------------------|---------------|
74
- | Output format defined? | Does the producer skill explicitly describe its output structure (file name, format, key sections)? |
75
- | Input expectation defined? | Does the consumer skill explicitly state what it reads and in what format? |
76
- | Format match | Do output and input specifications agree on structure, field names, file paths? |
77
- | Path convention | Do both skills reference the same directory convention (e.g., `.prizmkit/specs/###-feature-name/`)? |
78
- | Error handling | What happens if upstream artifact is missing or malformed? Is fallback defined? |
79
- | Quality gate | Does each consumer check preconditions before proceeding? |
80
-
81
- Record each pair as: `PASS` / `WARN (gap described)` / `FAIL (mismatch described)`
82
-
83
- ---
84
-
85
- #### R2: L2 Pipeline Layer
86
-
87
- **Target**: All skills in `core/skills/orchestration-skill/pipelines/` (expect 7)
88
-
89
- **Light checks**:
90
-
91
- | Check | How | Pass Criteria |
92
- |-------|-----|---------------|
93
- | Frontmatter validity | Same as R1 | `name` + `description`; name matches dir |
94
- | Asset reference integrity | Same as R1 | All `${SKILL_DIR}` refs resolve |
95
- | Planner→Launcher coupling | For each scenario (feature/bugfix/refactor): check planner mentions correct JSON filename, launcher references correct script | All 3 pairs coupled correctly |
96
- | Script path references | Launchers reference `dev-pipeline/run-*.sh` | All referenced scripts exist and are executable |
97
- | Schema consistency | Compare planner SKILL.md output schema with `dev-pipeline/templates/*-list-schema.json` | Field names, types, required fields match |
98
- | Schema version match | Compare `dev-pipeline/templates/*-list-schema.json` const value with `dev-pipeline/scripts/init-*-pipeline.py` EXPECTED_SCHEMA | Versions identical |
99
-
100
- **Medium checks** — read each SKILL.md FULLY:
101
-
102
- For each scenario (feature, bugfix, refactor):
103
- 1. What JSON format does the planner produce? What fields, what validation?
104
- 2. What JSON format does the launcher expect? What fields does it read?
105
- 3. Do they match? Any fields the launcher expects but planner doesn't produce?
106
- 4. What command does the launcher assemble? What env vars? Do they match `run-*.sh` expectations?
107
-
108
- ---
109
-
110
- #### R3: L3 Workflow Layer
111
-
112
- **Target**: All skills in `core/skills/orchestration-skill/workflows/` (expect 4)
113
-
114
- **Light checks**:
115
-
116
- | Check | How | Pass Criteria |
117
- |-------|-----|---------------|
118
- | Frontmatter validity | Same as R1 | `name` + `description`; name matches dir |
119
- | Pipeline skill references | Grep each workflow for planner/launcher skill names | All referenced pipeline skills exist |
120
- | Checkpoint naming | Grep for `CP-` patterns | Follows `CP-{XW}-{N}` convention |
121
- | Flow completeness markers | Each workflow has: When to Use, Execution phases, Error handling table, Relationship table | All sections present |
122
-
123
- **Medium checks** — read each SKILL.md FULLY:
124
-
125
- | Verification Point | What to Check |
126
- |-------------------|---------------|
127
- | Workflow → Pipeline handoff | What does workflow pass to each pipeline skill? Is the input format documented? |
128
- | Flow completeness | feature-workflow: requirements→planning→launch→monitor? bug-fix-workflow: diagnosis→triage→fix→review→commit? refactor-workflow: goals→planning→launch→monitor? recovery-workflow: detect→diagnose→recover? |
129
- | Fast path integration | If fast path exists, is there a decision tree for when to use it? |
130
- | Cross-workflow consistency | Same checkpoint naming, phase structure, error handling patterns? |
131
-
132
- ---
133
-
134
- #### R4: L4 Script Layer
135
-
136
- **Target**: `dev-pipeline/` directory
137
-
138
- **Light checks**:
139
-
140
- | Check | How | Pass Criteria |
141
- |-------|-----|---------------|
142
- | Shell scripts executable | `ls -la dev-pipeline/*.sh` | All `.sh` files have `+x` permission |
143
- | Python scripts compile | `python3 -m py_compile` each `.py` file | All compile without error |
144
- | Bash lib sourcing | Grep `run-*.sh` for `source.*lib/` | Each sources common.sh, heartbeat.sh, branch.sh |
145
- | JSON templates valid | `python3 -c "import json; json.load(open(f))"` for each `.json` in `templates/` | All parse without error |
146
- | Ruff Python lint | `ruff check dev-pipeline/scripts/` | 0 errors |
147
- | Test file existence | List `dev-pipeline/tests/` | At least test files exist for prompt generation |
148
-
149
- **Medium checks** — read key scripts:
150
-
151
- | Verification Point | What to Check |
152
- |-------------------|---------------|
153
- | Shell → Python integration | For each `run-*.sh`: list all Python scripts invoked, verify each exists, verify argument names match |
154
- | Bootstrap prompt generation | Read `generate-bootstrap-prompt.py` (or per-scenario variant): does it reference correct skill names? Are all `{{PLACEHOLDER}}` variables populated? |
155
- | State management | Read `init-*-pipeline.py`: correct schema validation? Proper state dir initialization? |
156
- | Cross-scenario consistency | Compare `run-feature.sh` vs `run-bugfix.sh` vs `run-refactor.sh`: same structure? Same error handling patterns? Note intentional divergences vs unintentional gaps |
157
-
158
- ---
159
-
160
- ### Phase 2: Vertical E2E Verification (R5-R6)
161
-
162
- Only runs in `full` mode. For each scenario, trace data flow through all 4 layers.
163
-
164
- #### R5: Feature Scenario
165
-
166
- Trace "Add user authentication with JWT" through:
167
-
168
- 1. **L3→L2**: feature-workflow → feature-planner → feature-pipeline-launcher
169
- - What format does workflow pass to planner?
170
- - What JSON does planner produce?
171
- - What command does launcher assemble?
172
-
173
- 2. **L2→L4**: feature-pipeline-launcher → run-feature.sh
174
- - What args/env vars are passed?
175
- - How does init-pipeline.py consume feature-list.json?
176
- - What bootstrap prompt does generate-bootstrap-prompt.py produce?
177
-
178
- 3. **L4→L1**: run-feature.sh → AI CLI → Skills
179
- - What skills does the bootstrap prompt reference?
180
- - In what order?
181
- - Are skill names correct?
182
-
183
- At each transition, verify: **format match**, **field completeness**, **path consistency**, **error propagation**.
184
-
185
- #### R6: Bugfix + Refactor Scenarios
186
-
187
- Same trace for:
188
- - Bugfix: "Fix login crash when email is empty"
189
- - Refactor: "Extract authentication logic into shared module"
190
-
191
- Then cross-scenario comparison:
192
- - Layer transition patterns structurally similar?
193
- - Schema conventions consistent (F-NNN / B-NNN / R-NNN)?
194
- - Bootstrap prompt patterns consistent?
195
- - State management patterns consistent?
196
- - Intentional divergences documented?
197
-
198
- ---
199
-
200
- ### Phase 3: Supplementary Checks
201
-
202
- These address gaps identified in the first verification run. Include in `medium` and `full` modes.
203
-
204
- #### S1: Adapter Layer Verification
205
-
206
- | Check | How |
207
- |-------|-----|
208
- | Claude adapter output | Run `node adapters/claude/command-adapter.js` on a sample skill, verify output format |
209
- | CodeBuddy adapter output | Run `node adapters/codebuddy/skill-adapter.js` on same skill, verify output |
210
- | Cross-platform equivalence | Diff the two outputs for semantic equivalence |
211
- | ${SKILL_DIR} resolution | Verify Claude resolves to `.claude/command-assets/`, CodeBuddy to `.codebuddy/skills/` |
212
-
213
- #### S2: Installer & Distribution Verification
214
-
215
- | Check | How |
216
- |-------|-----|
217
- | _metadata.json completeness | All skills listed in `core/skills/_metadata.json` suites? |
218
- | Bundle sync | `npm run bundle && npm run bundle:verify` passes |
219
- | Skill count | Bundled skill count matches source skill count |
220
-
221
- #### S3: Rules Consistency
222
-
223
- | Check | How |
224
- |-------|-----|
225
- | Rules exist | All files in `core/rules/` referenced by `_rules-metadata.json` |
226
- | Rule-skill alignment | Rules reference skill names that exist |
227
-
228
- ---
229
-
230
- ## Report Format
231
-
232
- Generate the verification report at `.prizmkit/verify/verify-report-{timestamp}.md`:
233
-
234
- ```markdown
235
- # PrizmKit Verification Report
236
- **Date**: {date} **Mode**: {light|medium|full}
237
-
238
- ## Summary
239
- | Round | Layer | Light | Medium | Deep | Issues |
240
- |-------|-------|-------|--------|------|--------|
241
- | R1 | L1 Skill | PASS/WARN/FAIL | PASS/WARN/FAIL | — | count |
242
- | R2 | L2 Pipeline | ... | ... | — | count |
243
- | R3 | L3 Workflow | ... | ... | — | count |
244
- | R4 | L4 Script | ... | ... | — | count |
245
- | R5 | E2E Feature | — | — | PASS/WARN/FAIL | count |
246
- | R6 | E2E Bug+Refactor | — | — | PASS/WARN/FAIL | count |
247
- | S1 | Adapters | ... | ... | — | count |
248
- | S2 | Distribution | ... | ... | — | count |
249
- | S3 | Rules | ... | ... | — | count |
250
-
251
- ## Issues by Severity
252
- ### Critical
253
- - [C1] Description... (Round, Layer, Location)
254
-
255
- ### High
256
- - [H1] Description...
257
-
258
- ### Medium
259
- - [M1] Description...
260
-
261
- ## Cross-Scenario Comparison (full mode only)
262
- | Dimension | Feature | Bugfix | Refactor | Consistent? |
263
- |-----------|---------|--------|----------|-------------|
264
- | ... | ... | ... | ... | ✅/⚠️ |
265
-
266
- ## Recommendations
267
- 1. **P0**: ...
268
- 2. **P1**: ...
269
- ```
270
-
271
- ## Parallelization Strategy
272
-
273
- For efficiency, run independent checks concurrently:
274
- - R1 Light + R2 Light + R3 Light + R4 Light can run in parallel
275
- - R1 Medium depends on R1 Light passing
276
- - R5/R6 depend on R1-R4 completing (need full picture of each layer)
277
- - S1/S2/S3 can run in parallel with R1-R4
278
-
279
- ## Previous Verification Findings
280
-
281
- Reference `docs/framework-verification-report.md` for the baseline findings from the initial verification run. This helps track whether previously identified issues have been resolved.