prizmkit 1.1.7 → 1.1.9
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bundled/VERSION.json +3 -3
- package/bundled/adapters/codebuddy/skill-adapter.js +21 -7
- package/bundled/agents/prizm-dev-team-reviewer.md +53 -173
- package/bundled/dev-pipeline/.env.example +45 -0
- package/bundled/dev-pipeline/README.md +64 -64
- package/bundled/dev-pipeline/SCHEMA_ANALYSIS.md +535 -0
- package/bundled/dev-pipeline/assets/feature-list-example.json +0 -1
- package/bundled/dev-pipeline/launch-bugfix-daemon.sh +64 -18
- package/bundled/dev-pipeline/launch-feature-daemon.sh +15 -12
- package/bundled/dev-pipeline/launch-refactor-daemon.sh +64 -18
- package/bundled/dev-pipeline/lib/branch.sh +6 -1
- package/bundled/dev-pipeline/lib/common.sh +71 -0
- package/bundled/dev-pipeline/lib/heartbeat.sh +2 -2
- package/bundled/dev-pipeline/reset-bug.sh +10 -9
- package/bundled/dev-pipeline/reset-feature.sh +9 -8
- package/bundled/dev-pipeline/reset-refactor.sh +10 -9
- package/bundled/dev-pipeline/retry-bugfix.sh +67 -29
- package/bundled/dev-pipeline/retry-feature.sh +54 -18
- package/bundled/dev-pipeline/retry-refactor.sh +112 -29
- package/bundled/dev-pipeline/run-bugfix.sh +281 -59
- package/bundled/dev-pipeline/run-feature.sh +53 -18
- package/bundled/dev-pipeline/run-refactor.sh +392 -66
- package/bundled/dev-pipeline/scripts/check-session-status.py +24 -1
- package/bundled/dev-pipeline/scripts/cleanup-logs.py +2 -2
- package/bundled/dev-pipeline/scripts/detect-stuck.py +195 -85
- package/bundled/dev-pipeline/scripts/generate-bootstrap-prompt.py +57 -33
- package/bundled/dev-pipeline/scripts/generate-bugfix-prompt.py +25 -9
- package/bundled/dev-pipeline/scripts/generate-refactor-prompt.py +104 -17
- package/bundled/dev-pipeline/scripts/init-bugfix-pipeline.py +34 -9
- package/bundled/dev-pipeline/scripts/init-pipeline.py +10 -10
- package/bundled/dev-pipeline/scripts/init-refactor-pipeline.py +19 -8
- package/bundled/dev-pipeline/scripts/parse-stream-progress.py +1 -5
- package/bundled/dev-pipeline/scripts/patch-completion-notes.py +191 -0
- package/bundled/dev-pipeline/scripts/update-bug-status.py +167 -22
- package/bundled/dev-pipeline/scripts/update-feature-status.py +104 -62
- package/bundled/dev-pipeline/scripts/update-refactor-status.py +351 -21
- package/bundled/dev-pipeline/templates/agent-prompts/dev-fix.md +1 -1
- package/bundled/dev-pipeline/templates/agent-prompts/reviewer-review.md +7 -11
- package/bundled/dev-pipeline/templates/bootstrap-prompt.md +41 -7
- package/bundled/dev-pipeline/templates/bootstrap-tier1.md +27 -3
- package/bundled/dev-pipeline/templates/bootstrap-tier2.md +43 -19
- package/bundled/dev-pipeline/templates/bootstrap-tier3.md +54 -26
- package/bundled/dev-pipeline/templates/bug-fix-list-schema.json +6 -15
- package/bundled/dev-pipeline/templates/bugfix-bootstrap-prompt.md +36 -25
- package/bundled/dev-pipeline/templates/feature-list-schema.json +109 -31
- package/bundled/dev-pipeline/templates/refactor-bootstrap-prompt.md +270 -0
- package/bundled/dev-pipeline/templates/refactor-list-schema.json +11 -3
- package/bundled/dev-pipeline/templates/sections/context-budget-rules.md +3 -1
- package/bundled/dev-pipeline/templates/sections/critical-paths-agent.md +1 -0
- package/bundled/dev-pipeline/templates/sections/feature-context.md +2 -0
- package/bundled/dev-pipeline/templates/sections/phase-commit-full.md +29 -2
- package/bundled/dev-pipeline/templates/sections/phase-commit.md +22 -0
- package/bundled/dev-pipeline/templates/sections/phase-deploy-verification.md +2 -2
- package/bundled/dev-pipeline/templates/sections/phase-review-agent.md +8 -6
- package/bundled/dev-pipeline/templates/sections/phase-review-full.md +7 -5
- package/bundled/dev-pipeline/templates/sections/phase-specify-plan-full.md +3 -3
- package/bundled/skills/_metadata.json +5 -22
- package/bundled/skills/app-planner/SKILL.md +98 -72
- package/bundled/skills/app-planner/assets/app-design-guide.md +1 -1
- package/bundled/skills/app-planner/references/architecture-decisions.md +1 -1
- package/bundled/skills/app-planner/references/project-brief-guide.md +69 -66
- package/bundled/skills/bug-fix-workflow/SKILL.md +52 -9
- package/bundled/skills/bug-planner/SKILL.md +139 -197
- package/bundled/skills/bug-planner/assets/bug-confirmation-template.md +43 -0
- package/bundled/skills/bug-planner/references/critic-and-verification.md +44 -0
- package/bundled/skills/bug-planner/references/error-recovery.md +73 -0
- package/bundled/skills/bug-planner/references/input-formats.md +53 -0
- package/bundled/skills/bug-planner/references/schema-validation.md +25 -0
- package/bundled/skills/bug-planner/references/severity-rules.md +16 -0
- package/bundled/skills/bug-planner/scripts/validate-bug-list.py +4 -8
- package/bundled/skills/bugfix-pipeline-launcher/SKILL.md +34 -39
- package/bundled/skills/feature-pipeline-launcher/SKILL.md +49 -36
- package/bundled/skills/feature-pipeline-launcher/scripts/preflight-check.py +3 -3
- package/bundled/skills/feature-planner/SKILL.md +53 -142
- package/bundled/skills/feature-planner/assets/evaluation-guide.md +1 -1
- package/bundled/skills/feature-planner/assets/planning-guide.md +21 -5
- package/bundled/skills/feature-planner/references/browser-interaction.md +2 -4
- package/bundled/skills/feature-planner/references/completeness-review.md +57 -0
- package/bundled/skills/feature-planner/references/error-recovery.md +16 -35
- package/bundled/skills/feature-planner/references/incremental-feature-planning.md +1 -1
- package/bundled/skills/feature-planner/references/new-project-planning.md +2 -2
- package/bundled/skills/feature-planner/scripts/validate-and-generate.py +19 -20
- package/bundled/skills/feature-workflow/SKILL.md +24 -25
- package/bundled/skills/prizm-kit/SKILL.md +39 -49
- package/bundled/skills/prizmkit-code-review/SKILL.md +51 -64
- package/bundled/skills/prizmkit-code-review/rules/dimensions.md +85 -0
- package/bundled/skills/prizmkit-code-review/rules/fix-strategy.md +11 -11
- package/bundled/skills/prizmkit-committer/SKILL.md +3 -31
- package/bundled/skills/prizmkit-deploy/SKILL.md +34 -31
- package/bundled/skills/prizmkit-deploy/assets/deploy-template.md +1 -1
- package/bundled/skills/prizmkit-implement/SKILL.md +35 -68
- package/bundled/skills/prizmkit-init/SKILL.md +112 -65
- package/bundled/skills/prizmkit-init/assets/project-brief-template.md +82 -0
- package/bundled/skills/prizmkit-plan/SKILL.md +120 -79
- package/bundled/skills/prizmkit-plan/assets/plan-template.md +28 -18
- package/bundled/skills/prizmkit-plan/assets/spec-template.md +28 -11
- package/bundled/skills/prizmkit-plan/references/clarify-guide.md +3 -3
- package/bundled/skills/prizmkit-plan/references/verification-checklist.md +60 -0
- package/bundled/skills/prizmkit-prizm-docs/SKILL.md +10 -81
- package/bundled/skills/prizmkit-prizm-docs/assets/{PRIZM-SPEC.md → prizm-docs-format.md} +41 -526
- package/bundled/skills/prizmkit-prizm-docs/references/op-init.md +46 -0
- package/bundled/skills/prizmkit-prizm-docs/references/op-rebuild.md +16 -0
- package/bundled/skills/prizmkit-prizm-docs/references/op-status.md +14 -0
- package/bundled/skills/prizmkit-prizm-docs/references/op-update.md +19 -0
- package/bundled/skills/prizmkit-prizm-docs/references/op-validate.md +17 -0
- package/bundled/skills/prizmkit-retrospective/SKILL.md +27 -65
- package/bundled/skills/prizmkit-retrospective/references/knowledge-injection-steps.md +3 -4
- package/bundled/skills/prizmkit-retrospective/references/structural-sync-steps.md +7 -25
- package/bundled/skills/recovery-workflow/SKILL.md +22 -22
- package/bundled/skills/recovery-workflow/evals/evals.json +5 -5
- package/bundled/skills/recovery-workflow/scripts/detect-recovery-state.py +43 -10
- package/bundled/skills/refactor-pipeline-launcher/SKILL.md +48 -40
- package/bundled/skills/refactor-planner/SKILL.md +43 -61
- package/bundled/skills/refactor-planner/scripts/validate-and-generate-refactor.py +17 -17
- package/bundled/skills/refactor-workflow/SKILL.md +23 -24
- package/bundled/team/prizm-dev-team.json +1 -1
- package/bundled/{skills/prizm-kit/assets → templates}/project-memory-template.md +1 -1
- package/package.json +1 -1
- package/src/clean.js +3 -4
- package/src/gitignore-template.js +7 -9
- package/src/scaffold.js +14 -5
- package/bundled/dev-pipeline/templates/agent-prompts/reviewer-analyze.md +0 -5
- package/bundled/dev-pipeline/templates/sections/phase-analyze-agent.md +0 -19
- package/bundled/dev-pipeline/templates/sections/phase-analyze-full.md +0 -19
- package/bundled/skills/app-planner/references/project-conventions.md +0 -93
- package/bundled/skills/prizmkit-analyze/SKILL.md +0 -207
- package/bundled/skills/prizmkit-code-review/rules/dimensions-bugfix.md +0 -25
- package/bundled/skills/prizmkit-code-review/rules/dimensions-feature.md +0 -43
- package/bundled/skills/prizmkit-code-review/rules/dimensions-refactor.md +0 -25
- package/bundled/skills/prizmkit-implement/references/deploy-guide-protocol.md +0 -69
- package/bundled/skills/prizmkit-verify/SKILL.md +0 -281
- package/bundled/skills/prizmkit-verify/scripts/verify-light.py +0 -402
|
@@ -1,43 +0,0 @@
|
|
|
1
|
-
# Review Dimensions — Feature Mode
|
|
2
|
-
|
|
3
|
-
Review code against `spec.md` and `plan.md` across 6 dimensions:
|
|
4
|
-
|
|
5
|
-
## 1. Spec Compliance
|
|
6
|
-
Does code implement all acceptance criteria? Missing criteria are the #1 source of "it works but it's wrong" bugs.
|
|
7
|
-
- Check every acceptance criterion in spec.md has a corresponding implementation
|
|
8
|
-
- Verify edge cases mentioned in spec are handled
|
|
9
|
-
- Confirm scope boundaries are respected (no over-implementation, no under-implementation)
|
|
10
|
-
|
|
11
|
-
## 2. Plan Adherence
|
|
12
|
-
Does implementation follow architectural decisions in plan.md? Deviations may be improvements or may break assumptions other components depend on.
|
|
13
|
-
- Check component structure matches plan's architecture approach
|
|
14
|
-
- Verify data model matches plan's schema design
|
|
15
|
-
- Confirm API contracts (endpoints, request/response) match plan
|
|
16
|
-
|
|
17
|
-
## 3. Code Quality
|
|
18
|
-
Naming, structure, complexity, DRY. Focus on maintainability — will someone understand this code in 6 months?
|
|
19
|
-
- Function/variable names are descriptive and consistent with project conventions
|
|
20
|
-
- No unnecessary complexity (cyclomatic complexity, deep nesting)
|
|
21
|
-
- No copy-paste duplication that should be abstracted
|
|
22
|
-
- Error messages are informative for debugging
|
|
23
|
-
|
|
24
|
-
## 4. Security
|
|
25
|
-
Injection (SQL, XSS, command), auth/authz gaps, sensitive data exposure, insecure defaults. Security issues are always HIGH+ because they're the hardest to catch later.
|
|
26
|
-
- User input is validated and sanitized before use in queries, HTML, or commands
|
|
27
|
-
- Authentication and authorization checks are present on protected routes
|
|
28
|
-
- Sensitive data (passwords, tokens, PII) is not logged or exposed in responses
|
|
29
|
-
- Cryptographic operations use established libraries, not custom implementations
|
|
30
|
-
|
|
31
|
-
## 5. Consistency
|
|
32
|
-
Follows project patterns from `.prizm-docs/` PATTERNS section. Inconsistent patterns increase cognitive load for every future reader.
|
|
33
|
-
- Code style matches existing codebase conventions
|
|
34
|
-
- Error handling follows established patterns
|
|
35
|
-
- File organization follows project structure conventions
|
|
36
|
-
- Naming conventions align with `.prizm-docs/` RULES
|
|
37
|
-
|
|
38
|
-
## 6. Test Coverage
|
|
39
|
-
Are critical paths tested? Focus on paths that handle user input, money, or state transitions.
|
|
40
|
-
- Happy path tests exist for each user story
|
|
41
|
-
- Error/edge case tests for critical paths
|
|
42
|
-
- Tests are deterministic (no flaky timing dependencies)
|
|
43
|
-
- Test names clearly describe what they verify
|
|
@@ -1,25 +0,0 @@
|
|
|
1
|
-
# Review Dimensions — Refactor Mode
|
|
2
|
-
|
|
3
|
-
Review code against `refactor-analysis.md` goals. Standard dimensions (code quality, security, consistency, test coverage) still apply — load `${SKILL_DIR}/rules/dimensions-feature.md` §3–§6 for those.
|
|
4
|
-
|
|
5
|
-
## Refactor-Specific Dimensions
|
|
6
|
-
|
|
7
|
-
### Behavior Preservation (replaces Spec Compliance)
|
|
8
|
-
Observable behavior must remain unchanged. This is the #1 refactor risk — "improving" code that subtly changes behavior.
|
|
9
|
-
- All existing tests still pass without modification (test changes during refactor are a red flag)
|
|
10
|
-
- Public API signatures are unchanged (parameter types, return types, error types)
|
|
11
|
-
- Side effects (logging, metrics, events) are preserved unless explicitly scoped for removal
|
|
12
|
-
- Edge case handling is preserved — refactors often silently drop edge case branches
|
|
13
|
-
|
|
14
|
-
### Structural Improvement (replaces Plan Adherence)
|
|
15
|
-
Is the code measurably better against the refactor goals?
|
|
16
|
-
- Complexity metrics improved (fewer nested conditions, shorter functions)
|
|
17
|
-
- Coupling reduced (fewer cross-module imports, clearer boundaries)
|
|
18
|
-
- Duplication reduced (DRY violations eliminated per refactor-analysis.md scope)
|
|
19
|
-
- Readability improved (naming, organization, documentation)
|
|
20
|
-
|
|
21
|
-
### Test Integrity
|
|
22
|
-
Refactors must not weaken the test suite.
|
|
23
|
-
- No tests were deleted or skipped
|
|
24
|
-
- Test coverage percentage is equal or higher
|
|
25
|
-
- If tests were rewritten, they cover the same behavioral scenarios
|
|
@@ -1,69 +0,0 @@
|
|
|
1
|
-
# Deploy Guide Update Protocol
|
|
2
|
-
|
|
3
|
-
When dependency manifests change during implementation, update `DEPLOY.md` at the project root.
|
|
4
|
-
|
|
5
|
-
## Detection
|
|
6
|
-
|
|
7
|
-
1. Check if any dependency manifests were modified in this session:
|
|
8
|
-
```bash
|
|
9
|
-
git diff --name-only HEAD -- package.json requirements*.txt Pipfile pyproject.toml go.mod Cargo.toml pom.xml build.gradle Gemfile composer.json docker-compose*.yml Dockerfile .tool-versions 2>/dev/null
|
|
10
|
-
```
|
|
11
|
-
2. If no manifest files changed → skip this step entirely
|
|
12
|
-
3. If manifest files changed, scan for **newly added** dependencies (not version bumps):
|
|
13
|
-
```bash
|
|
14
|
-
git diff -U0 HEAD -- package.json requirements*.txt Pipfile pyproject.toml go.mod Cargo.toml pom.xml build.gradle Gemfile composer.json docker-compose*.yml Dockerfile .tool-versions 2>/dev/null | grep '^\+' | grep -v '^\+\+\+' | head -30
|
|
15
|
-
```
|
|
16
|
-
|
|
17
|
-
## Recording
|
|
18
|
-
|
|
19
|
-
For each genuinely new framework/tool, record in `DEPLOY.md` at project root:
|
|
20
|
-
|
|
21
|
-
| Field | Description | Source |
|
|
22
|
-
|-------|-------------|--------|
|
|
23
|
-
| **Name** | Framework/tool name | Package name from manifest |
|
|
24
|
-
| **Version** | Installed version or constraint | Version spec from manifest |
|
|
25
|
-
| **Purpose** | Why it was introduced | You just added it — you know why |
|
|
26
|
-
| **Install Command** | How to install locally | Standard install command for the ecosystem |
|
|
27
|
-
| **Key Config** | Config files or env vars needed | Config files you just created/modified |
|
|
28
|
-
| **Notes** | Setup gotchas, required services | Docker services, manual steps, env vars |
|
|
29
|
-
|
|
30
|
-
## Template for `DEPLOY.md`
|
|
31
|
-
|
|
32
|
-
```markdown
|
|
33
|
-
# Deploy Guide
|
|
34
|
-
|
|
35
|
-
> Auto-maintained by PrizmKit. Manual edits are preserved.
|
|
36
|
-
> Last updated: YYYY-MM-DD
|
|
37
|
-
|
|
38
|
-
## Frameworks & Tools
|
|
39
|
-
|
|
40
|
-
### <Framework Name>
|
|
41
|
-
|
|
42
|
-
- **Version**: <version constraint>
|
|
43
|
-
- **Purpose**: <why this framework is used>
|
|
44
|
-
- **Install**:
|
|
45
|
-
```bash
|
|
46
|
-
<install command>
|
|
47
|
-
```
|
|
48
|
-
- **Key Config**:
|
|
49
|
-
- `<config file or env var>`: <description>
|
|
50
|
-
- **Notes**:
|
|
51
|
-
- <any setup gotchas, required external services, manual steps>
|
|
52
|
-
```
|
|
53
|
-
|
|
54
|
-
## Update Rules
|
|
55
|
-
|
|
56
|
-
- Create file if absent; append new sections if file exists
|
|
57
|
-
- Update version if framework already documented
|
|
58
|
-
- Preserve manually added content
|
|
59
|
-
- Keep entries sorted alphabetically
|
|
60
|
-
|
|
61
|
-
## Filter Out
|
|
62
|
-
|
|
63
|
-
- Patch version bumps of existing deps
|
|
64
|
-
- Dev-only tools needing no setup (linters, formatters)
|
|
65
|
-
- Transitive/lock-file-only changes
|
|
66
|
-
|
|
67
|
-
## Final Step
|
|
68
|
-
|
|
69
|
-
Stage the file: `git add DEPLOY.md`
|
|
@@ -1,281 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: "prizmkit-verify"
|
|
3
|
-
description: "Automated framework verification for PrizmKit's 4-layer architecture (Skill → Pipeline → Workflow → Script). Use this skill to validate structural integrity, contract consistency between layers, cross-scenario alignment, and adapter/installer correctness. Trigger whenever: verifying framework health, checking skill contracts, auditing layer integration, running pre-release checks, or after significant framework modifications."
|
|
4
|
-
---
|
|
5
|
-
|
|
6
|
-
# PrizmKit Framework Verification
|
|
7
|
-
|
|
8
|
-
Verify PrizmKit's four-layer architecture for structural integrity, contract consistency, and cross-scenario alignment.
|
|
9
|
-
|
|
10
|
-
## Execution Modes
|
|
11
|
-
|
|
12
|
-
Parse the user's arguments to determine scope and depth:
|
|
13
|
-
|
|
14
|
-
| Command | What Runs | When to Use |
|
|
15
|
-
|---------|-----------|-------------|
|
|
16
|
-
| `/prizmkit-verify light` | Light checks only (R1-R4) | Quick CI gate, seconds |
|
|
17
|
-
| `/prizmkit-verify medium` | Light + Medium (R1-R4) | Regular development, recommended default |
|
|
18
|
-
| `/prizmkit-verify full` | All rounds (R1-R6) including E2E | Pre-release, after major changes |
|
|
19
|
-
| `/prizmkit-verify --round R1` | Specific round only | Targeted check after editing a layer |
|
|
20
|
-
| `/prizmkit-verify` (no args) | Same as `medium` | Default |
|
|
21
|
-
|
|
22
|
-
## Architecture Context
|
|
23
|
-
|
|
24
|
-
PrizmKit has four layers, bottom to top:
|
|
25
|
-
|
|
26
|
-
```
|
|
27
|
-
L1 Skill layer → core/skills/prizmkit-skill/ (10 skills, dev lifecycle)
|
|
28
|
-
L2 Pipeline layer → core/skills/orchestration-skill/pipelines/ (7 skills, planners + launchers)
|
|
29
|
-
L3 Workflow layer → core/skills/orchestration-skill/workflows/ (4 skills, scenario entry points)
|
|
30
|
-
L4 Script layer → dev-pipeline/ (Bash + Python runtime engine)
|
|
31
|
-
```
|
|
32
|
-
|
|
33
|
-
Three verification depths, cumulative:
|
|
34
|
-
- **Light**: Structural/static checks, scriptable, zero tokens
|
|
35
|
-
- **Medium**: AI reads SKILL.md content, verifies inter-layer contracts
|
|
36
|
-
- **Deep**: End-to-end scenario traces through all 4 layers
|
|
37
|
-
|
|
38
|
-
## Execution
|
|
39
|
-
|
|
40
|
-
### Phase 0: Setup
|
|
41
|
-
|
|
42
|
-
1. Create output directory: `.prizmkit/verify/` (timestamp-based run dir)
|
|
43
|
-
2. Determine which rounds to run based on user args
|
|
44
|
-
3. Initialize the report structure
|
|
45
|
-
|
|
46
|
-
### Phase 1: Horizontal Verification (R1-R4)
|
|
47
|
-
|
|
48
|
-
Run rounds sequentially. For each round, run Light first, then Medium if in scope.
|
|
49
|
-
|
|
50
|
-
---
|
|
51
|
-
|
|
52
|
-
#### R1: L1 Skill Layer
|
|
53
|
-
|
|
54
|
-
**Target**: All skills in `core/skills/prizmkit-skill/` (expect ~10)
|
|
55
|
-
|
|
56
|
-
**Light checks** — run `${SKILL_DIR}/scripts/verify-light.py --round R1` if available, otherwise check manually:
|
|
57
|
-
|
|
58
|
-
| Check | How | Pass Criteria |
|
|
59
|
-
|-------|-----|---------------|
|
|
60
|
-
| Frontmatter validity | Read each SKILL.md YAML header | `name` + `description` present; `name` is lowercase-hyphens; `name` matches directory name |
|
|
61
|
-
| Asset reference integrity | Grep `${SKILL_DIR}` in each SKILL.md, resolve path relative to skill dir | Every referenced file exists on disk |
|
|
62
|
-
| Cross-skill references | Grep `/prizmkit-*` references in each SKILL.md | Every referenced skill exists under `core/skills/` |
|
|
63
|
-
| Orphaned assets | List files in each skill's `assets/`, `scripts/`, `references/` dirs | Every file is referenced by SKILL.md (warn if not) |
|
|
64
|
-
| Lint: ESLint | Run `npm run lint` | 0 errors (warnings OK) |
|
|
65
|
-
| Lint: Ruff | Run `npm run lint:py` | 0 errors |
|
|
66
|
-
|
|
67
|
-
**Medium checks** — read each SKILL.md FULLY and verify artifact chain contracts:
|
|
68
|
-
|
|
69
|
-
For each adjacent skill pair in the chain:
|
|
70
|
-
`prizmkit-plan → prizmkit-analyze → prizmkit-implement → prizmkit-code-review → prizmkit-retrospective → prizmkit-committer → prizmkit-deploy`
|
|
71
|
-
|
|
72
|
-
| Verification Point | What to Check |
|
|
73
|
-
|-------------------|---------------|
|
|
74
|
-
| Output format defined? | Does the producer skill explicitly describe its output structure (file name, format, key sections)? |
|
|
75
|
-
| Input expectation defined? | Does the consumer skill explicitly state what it reads and in what format? |
|
|
76
|
-
| Format match | Do output and input specifications agree on structure, field names, file paths? |
|
|
77
|
-
| Path convention | Do both skills reference the same directory convention (e.g., `.prizmkit/specs/###-feature-name/`)? |
|
|
78
|
-
| Error handling | What happens if upstream artifact is missing or malformed? Is fallback defined? |
|
|
79
|
-
| Quality gate | Does each consumer check preconditions before proceeding? |
|
|
80
|
-
|
|
81
|
-
Record each pair as: `PASS` / `WARN (gap described)` / `FAIL (mismatch described)`
|
|
82
|
-
|
|
83
|
-
---
|
|
84
|
-
|
|
85
|
-
#### R2: L2 Pipeline Layer
|
|
86
|
-
|
|
87
|
-
**Target**: All skills in `core/skills/orchestration-skill/pipelines/` (expect 7)
|
|
88
|
-
|
|
89
|
-
**Light checks**:
|
|
90
|
-
|
|
91
|
-
| Check | How | Pass Criteria |
|
|
92
|
-
|-------|-----|---------------|
|
|
93
|
-
| Frontmatter validity | Same as R1 | `name` + `description`; name matches dir |
|
|
94
|
-
| Asset reference integrity | Same as R1 | All `${SKILL_DIR}` refs resolve |
|
|
95
|
-
| Planner→Launcher coupling | For each scenario (feature/bugfix/refactor): check planner mentions correct JSON filename, launcher references correct script | All 3 pairs coupled correctly |
|
|
96
|
-
| Script path references | Launchers reference `dev-pipeline/run-*.sh` | All referenced scripts exist and are executable |
|
|
97
|
-
| Schema consistency | Compare planner SKILL.md output schema with `dev-pipeline/templates/*-list-schema.json` | Field names, types, required fields match |
|
|
98
|
-
| Schema version match | Compare `dev-pipeline/templates/*-list-schema.json` const value with `dev-pipeline/scripts/init-*-pipeline.py` EXPECTED_SCHEMA | Versions identical |
|
|
99
|
-
|
|
100
|
-
**Medium checks** — read each SKILL.md FULLY:
|
|
101
|
-
|
|
102
|
-
For each scenario (feature, bugfix, refactor):
|
|
103
|
-
1. What JSON format does the planner produce? What fields, what validation?
|
|
104
|
-
2. What JSON format does the launcher expect? What fields does it read?
|
|
105
|
-
3. Do they match? Any fields the launcher expects but planner doesn't produce?
|
|
106
|
-
4. What command does the launcher assemble? What env vars? Do they match `run-*.sh` expectations?
|
|
107
|
-
|
|
108
|
-
---
|
|
109
|
-
|
|
110
|
-
#### R3: L3 Workflow Layer
|
|
111
|
-
|
|
112
|
-
**Target**: All skills in `core/skills/orchestration-skill/workflows/` (expect 4)
|
|
113
|
-
|
|
114
|
-
**Light checks**:
|
|
115
|
-
|
|
116
|
-
| Check | How | Pass Criteria |
|
|
117
|
-
|-------|-----|---------------|
|
|
118
|
-
| Frontmatter validity | Same as R1 | `name` + `description`; name matches dir |
|
|
119
|
-
| Pipeline skill references | Grep each workflow for planner/launcher skill names | All referenced pipeline skills exist |
|
|
120
|
-
| Checkpoint naming | Grep for `CP-` patterns | Follows `CP-{XW}-{N}` convention |
|
|
121
|
-
| Flow completeness markers | Each workflow has: When to Use, Execution phases, Error handling table, Relationship table | All sections present |
|
|
122
|
-
|
|
123
|
-
**Medium checks** — read each SKILL.md FULLY:
|
|
124
|
-
|
|
125
|
-
| Verification Point | What to Check |
|
|
126
|
-
|-------------------|---------------|
|
|
127
|
-
| Workflow → Pipeline handoff | What does workflow pass to each pipeline skill? Is the input format documented? |
|
|
128
|
-
| Flow completeness | feature-workflow: requirements→planning→launch→monitor? bug-fix-workflow: diagnosis→triage→fix→review→commit? refactor-workflow: goals→planning→launch→monitor? recovery-workflow: detect→diagnose→recover? |
|
|
129
|
-
| Fast path integration | If fast path exists, is there a decision tree for when to use it? |
|
|
130
|
-
| Cross-workflow consistency | Same checkpoint naming, phase structure, error handling patterns? |
|
|
131
|
-
|
|
132
|
-
---
|
|
133
|
-
|
|
134
|
-
#### R4: L4 Script Layer
|
|
135
|
-
|
|
136
|
-
**Target**: `dev-pipeline/` directory
|
|
137
|
-
|
|
138
|
-
**Light checks**:
|
|
139
|
-
|
|
140
|
-
| Check | How | Pass Criteria |
|
|
141
|
-
|-------|-----|---------------|
|
|
142
|
-
| Shell scripts executable | `ls -la dev-pipeline/*.sh` | All `.sh` files have `+x` permission |
|
|
143
|
-
| Python scripts compile | `python3 -m py_compile` each `.py` file | All compile without error |
|
|
144
|
-
| Bash lib sourcing | Grep `run-*.sh` for `source.*lib/` | Each sources common.sh, heartbeat.sh, branch.sh |
|
|
145
|
-
| JSON templates valid | `python3 -c "import json; json.load(open(f))"` for each `.json` in `templates/` | All parse without error |
|
|
146
|
-
| Ruff Python lint | `ruff check dev-pipeline/scripts/` | 0 errors |
|
|
147
|
-
| Test file existence | List `dev-pipeline/tests/` | At least test files exist for prompt generation |
|
|
148
|
-
|
|
149
|
-
**Medium checks** — read key scripts:
|
|
150
|
-
|
|
151
|
-
| Verification Point | What to Check |
|
|
152
|
-
|-------------------|---------------|
|
|
153
|
-
| Shell → Python integration | For each `run-*.sh`: list all Python scripts invoked, verify each exists, verify argument names match |
|
|
154
|
-
| Bootstrap prompt generation | Read `generate-bootstrap-prompt.py` (or per-scenario variant): does it reference correct skill names? Are all `{{PLACEHOLDER}}` variables populated? |
|
|
155
|
-
| State management | Read `init-*-pipeline.py`: correct schema validation? Proper state dir initialization? |
|
|
156
|
-
| Cross-scenario consistency | Compare `run-feature.sh` vs `run-bugfix.sh` vs `run-refactor.sh`: same structure? Same error handling patterns? Note intentional divergences vs unintentional gaps |
|
|
157
|
-
|
|
158
|
-
---
|
|
159
|
-
|
|
160
|
-
### Phase 2: Vertical E2E Verification (R5-R6)
|
|
161
|
-
|
|
162
|
-
Only runs in `full` mode. For each scenario, trace data flow through all 4 layers.
|
|
163
|
-
|
|
164
|
-
#### R5: Feature Scenario
|
|
165
|
-
|
|
166
|
-
Trace "Add user authentication with JWT" through:
|
|
167
|
-
|
|
168
|
-
1. **L3→L2**: feature-workflow → feature-planner → feature-pipeline-launcher
|
|
169
|
-
- What format does workflow pass to planner?
|
|
170
|
-
- What JSON does planner produce?
|
|
171
|
-
- What command does launcher assemble?
|
|
172
|
-
|
|
173
|
-
2. **L2→L4**: feature-pipeline-launcher → run-feature.sh
|
|
174
|
-
- What args/env vars are passed?
|
|
175
|
-
- How does init-pipeline.py consume feature-list.json?
|
|
176
|
-
- What bootstrap prompt does generate-bootstrap-prompt.py produce?
|
|
177
|
-
|
|
178
|
-
3. **L4→L1**: run-feature.sh → AI CLI → Skills
|
|
179
|
-
- What skills does the bootstrap prompt reference?
|
|
180
|
-
- In what order?
|
|
181
|
-
- Are skill names correct?
|
|
182
|
-
|
|
183
|
-
At each transition, verify: **format match**, **field completeness**, **path consistency**, **error propagation**.
|
|
184
|
-
|
|
185
|
-
#### R6: Bugfix + Refactor Scenarios
|
|
186
|
-
|
|
187
|
-
Same trace for:
|
|
188
|
-
- Bugfix: "Fix login crash when email is empty"
|
|
189
|
-
- Refactor: "Extract authentication logic into shared module"
|
|
190
|
-
|
|
191
|
-
Then cross-scenario comparison:
|
|
192
|
-
- Layer transition patterns structurally similar?
|
|
193
|
-
- Schema conventions consistent (F-NNN / B-NNN / R-NNN)?
|
|
194
|
-
- Bootstrap prompt patterns consistent?
|
|
195
|
-
- State management patterns consistent?
|
|
196
|
-
- Intentional divergences documented?
|
|
197
|
-
|
|
198
|
-
---
|
|
199
|
-
|
|
200
|
-
### Phase 3: Supplementary Checks
|
|
201
|
-
|
|
202
|
-
These address gaps identified in the first verification run. Include in `medium` and `full` modes.
|
|
203
|
-
|
|
204
|
-
#### S1: Adapter Layer Verification
|
|
205
|
-
|
|
206
|
-
| Check | How |
|
|
207
|
-
|-------|-----|
|
|
208
|
-
| Claude adapter output | Run `node adapters/claude/command-adapter.js` on a sample skill, verify output format |
|
|
209
|
-
| CodeBuddy adapter output | Run `node adapters/codebuddy/skill-adapter.js` on same skill, verify output |
|
|
210
|
-
| Cross-platform equivalence | Diff the two outputs for semantic equivalence |
|
|
211
|
-
| ${SKILL_DIR} resolution | Verify Claude resolves to `.claude/command-assets/`, CodeBuddy to `.codebuddy/skills/` |
|
|
212
|
-
|
|
213
|
-
#### S2: Installer & Distribution Verification
|
|
214
|
-
|
|
215
|
-
| Check | How |
|
|
216
|
-
|-------|-----|
|
|
217
|
-
| _metadata.json completeness | All skills listed in `core/skills/_metadata.json` suites? |
|
|
218
|
-
| Bundle sync | `npm run bundle && npm run bundle:verify` passes |
|
|
219
|
-
| Skill count | Bundled skill count matches source skill count |
|
|
220
|
-
|
|
221
|
-
#### S3: Rules Consistency
|
|
222
|
-
|
|
223
|
-
| Check | How |
|
|
224
|
-
|-------|-----|
|
|
225
|
-
| Rules exist | All files in `core/rules/` referenced by `_rules-metadata.json` |
|
|
226
|
-
| Rule-skill alignment | Rules reference skill names that exist |
|
|
227
|
-
|
|
228
|
-
---
|
|
229
|
-
|
|
230
|
-
## Report Format
|
|
231
|
-
|
|
232
|
-
Generate the verification report at `.prizmkit/verify/verify-report-{timestamp}.md`:
|
|
233
|
-
|
|
234
|
-
```markdown
|
|
235
|
-
# PrizmKit Verification Report
|
|
236
|
-
**Date**: {date} **Mode**: {light|medium|full}
|
|
237
|
-
|
|
238
|
-
## Summary
|
|
239
|
-
| Round | Layer | Light | Medium | Deep | Issues |
|
|
240
|
-
|-------|-------|-------|--------|------|--------|
|
|
241
|
-
| R1 | L1 Skill | PASS/WARN/FAIL | PASS/WARN/FAIL | — | count |
|
|
242
|
-
| R2 | L2 Pipeline | ... | ... | — | count |
|
|
243
|
-
| R3 | L3 Workflow | ... | ... | — | count |
|
|
244
|
-
| R4 | L4 Script | ... | ... | — | count |
|
|
245
|
-
| R5 | E2E Feature | — | — | PASS/WARN/FAIL | count |
|
|
246
|
-
| R6 | E2E Bug+Refactor | — | — | PASS/WARN/FAIL | count |
|
|
247
|
-
| S1 | Adapters | ... | ... | — | count |
|
|
248
|
-
| S2 | Distribution | ... | ... | — | count |
|
|
249
|
-
| S3 | Rules | ... | ... | — | count |
|
|
250
|
-
|
|
251
|
-
## Issues by Severity
|
|
252
|
-
### Critical
|
|
253
|
-
- [C1] Description... (Round, Layer, Location)
|
|
254
|
-
|
|
255
|
-
### High
|
|
256
|
-
- [H1] Description...
|
|
257
|
-
|
|
258
|
-
### Medium
|
|
259
|
-
- [M1] Description...
|
|
260
|
-
|
|
261
|
-
## Cross-Scenario Comparison (full mode only)
|
|
262
|
-
| Dimension | Feature | Bugfix | Refactor | Consistent? |
|
|
263
|
-
|-----------|---------|--------|----------|-------------|
|
|
264
|
-
| ... | ... | ... | ... | ✅/⚠️ |
|
|
265
|
-
|
|
266
|
-
## Recommendations
|
|
267
|
-
1. **P0**: ...
|
|
268
|
-
2. **P1**: ...
|
|
269
|
-
```
|
|
270
|
-
|
|
271
|
-
## Parallelization Strategy
|
|
272
|
-
|
|
273
|
-
For efficiency, run independent checks concurrently:
|
|
274
|
-
- R1 Light + R2 Light + R3 Light + R4 Light can run in parallel
|
|
275
|
-
- R1 Medium depends on R1 Light passing
|
|
276
|
-
- R5/R6 depend on R1-R4 completing (need full picture of each layer)
|
|
277
|
-
- S1/S2/S3 can run in parallel with R1-R4
|
|
278
|
-
|
|
279
|
-
## Previous Verification Findings
|
|
280
|
-
|
|
281
|
-
Reference `docs/framework-verification-report.md` for the baseline findings from the initial verification run. This helps track whether previously identified issues have been resolved.
|