onto-mcp 0.3.1 → 0.3.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.onto/authority/core-lexicon.yaml +1 -0
- package/.onto/domains/software-engineering/competency_qs.md +192 -63
- package/.onto/domains/software-engineering/concepts.md +67 -5
- package/.onto/domains/software-engineering/conciseness_rules.md +22 -2
- package/.onto/domains/software-engineering/dependency_rules.md +78 -8
- package/.onto/domains/software-engineering/domain_scope.md +181 -150
- package/.onto/domains/software-engineering/extension_cases.md +318 -542
- package/.onto/domains/software-engineering/logic_rules.md +75 -3
- package/.onto/domains/software-engineering/problem_framing_profile.md +29 -2
- package/.onto/domains/software-engineering/prompt_interface.md +122 -0
- package/.onto/domains/software-engineering/structure_spec.md +53 -4
- package/.onto/principles/llm-native-development-guideline.md +20 -0
- package/.onto/principles/productization-charter.md +6 -0
- package/.onto/processes/reconstruct/reconstruct-boundary-contract.md +278 -91
- package/.onto/processes/reconstruct/reconstruct-execution-ux-contract.md +45 -12
- package/.onto/processes/reconstruct/source-profile-contract.md +39 -6
- package/.onto/processes/reconstruct/top-level-concept-discovery-contract.md +387 -0
- package/.onto/processes/review/lens-registry.md +16 -0
- package/.onto/processes/shared/target-material-kind-contract.md +18 -2
- package/.onto/roles/axiology.md +7 -2
- package/AGENTS.md +3 -2
- package/README.md +39 -33
- package/dist/core-api/reconstruct-api.js +22 -5
- package/dist/core-api/review-api.js +1288 -533
- package/dist/core-runtime/cli/mock-review-unit-executor.js +17 -0
- package/dist/core-runtime/cli/review-invoke.js +23 -48
- package/dist/core-runtime/cli/run-review-prompt-execution.js +122 -0
- package/dist/core-runtime/path-boundary.js +58 -0
- package/dist/core-runtime/reconstruct/artifact-types.js +5 -0
- package/dist/core-runtime/reconstruct/materialize-preparation.js +54 -4
- package/dist/core-runtime/reconstruct/pipeline-execution-ledger.js +38 -2
- package/dist/core-runtime/reconstruct/post-seed-validation.js +13 -0
- package/dist/core-runtime/reconstruct/record.js +11 -0
- package/dist/core-runtime/reconstruct/run.js +1133 -26
- package/dist/core-runtime/reconstruct/seed-candidate-validation.js +29 -0
- package/dist/core-runtime/review/execution-plan-boundary.js +123 -0
- package/dist/core-runtime/review/materializers.js +8 -3
- package/dist/core-runtime/review/review-artifact-utils.js +15 -2
- package/dist/core-runtime/review/review-invocation-runner.js +604 -0
- package/dist/core-runtime/target-material-kind.js +43 -5
- package/dist/mcp/server.js +158 -39
- package/dist/mcp/tool-schemas.js +22 -2
- package/package.json +3 -1
- package/.onto/domains/llm-native-development/competency_qs.md +0 -430
- package/.onto/domains/llm-native-development/concepts.md +0 -242
- package/.onto/domains/llm-native-development/conciseness_rules.md +0 -163
- package/.onto/domains/llm-native-development/dependency_rules.md +0 -216
- package/.onto/domains/llm-native-development/domain_scope.md +0 -197
- package/.onto/domains/llm-native-development/extension_cases.md +0 -474
- package/.onto/domains/llm-native-development/logic_rules.md +0 -123
- package/.onto/domains/llm-native-development/prompt_interface.md +0 -49
- package/.onto/domains/llm-native-development/structure_spec.md +0 -245
|
@@ -1108,6 +1108,7 @@ terms:
|
|
|
1108
1108
|
- "Orthogonal to medium: medium is a cross-product reference and learning frame such as excel or cli; target_material_kind is a per-run target classification."
|
|
1109
1109
|
- "Orthogonal to target_input_kind: target_input_kind says how the target entered the runtime, such as single_file or git_diff; target_material_kind says what kind of material is being handled."
|
|
1110
1110
|
- "Do not use review context `source_kind` for this axis. In review artifacts, source_kind identifies context-source artifacts such as materialized_input or review_target_profile."
|
|
1111
|
+
- "`mixed` is a classification value for bundles with multiple member material kinds, not a generic adapter target. A process exposing `mixed` must define supported_composite, partial_composite, unsupported, or reserved_future behavior in its process contract before dispatch."
|
|
1111
1112
|
- term_id: "pipeline_execution_ledger"
|
|
1112
1113
|
canonical_label: "PipelineExecutionLedger"
|
|
1113
1114
|
korean_label: "파이프라인 실행 단위 원장"
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
---
|
|
2
|
-
version:
|
|
3
|
-
last_updated: "2026-
|
|
2
|
+
version: 8
|
|
3
|
+
last_updated: "2026-05-28"
|
|
4
4
|
source: bundled-domain-baseline
|
|
5
5
|
status: established
|
|
6
6
|
---
|
|
@@ -19,7 +19,9 @@ Priority levels:
|
|
|
19
19
|
- **P2** — Should be answerable for production systems. Failure indicates a quality gap.
|
|
20
20
|
- **P3** — Recommended for mature systems. Failure indicates a refinement opportunity.
|
|
21
21
|
|
|
22
|
-
Prefix allocation protocol: New CQ sections use 1–2 character alphabetic prefix codes with mandatory `-` separator (e.g., CQ-XX-01). Prefixes must not be string prefixes of existing prefixes. Current allocations: S, D, I, E, T, V, O, A, SE, P, C, DE, B, R, MT, M.
|
|
22
|
+
Prefix allocation protocol: New CQ sections use 1–2 character alphabetic prefix codes with mandatory `-` separator (e.g., CQ-XX-01). Prefixes must not be string prefixes of existing prefixes. Current allocations: S, D, I, E, T, V, O, A, SE, P, C, DE, B, R, MT, M, G.
|
|
23
|
+
|
|
24
|
+
Applicability verdict protocol: Every CQ has an applicability result before PASS/FAIL. Use `N/A` only when the target lacks the pattern, scale, risk, consumer, or behavior named by the CQ and that absence is supported by architecture, dependency, product, or scope evidence. A missing artifact is `FAIL`, not `N/A`, when the target actually uses the pattern or claims the obligation.
|
|
23
25
|
|
|
24
26
|
---
|
|
25
27
|
|
|
@@ -32,8 +34,8 @@ Verifies that the system's module structure, boundaries, and public interfaces a
|
|
|
32
34
|
- Verification criteria: PASS if a complete list of modules can be produced, each with a declared role. FAIL if modules exist that cannot be classified into a structural role
|
|
33
35
|
|
|
34
36
|
- **CQ-S-02** [P1] Can the modules that a specific module depends on be derived?
|
|
35
|
-
- Inference path: dependency_rules.md 'Acyclic Dependencies' → module dependency graph must be a DAG → dependencies must be traceable;
|
|
36
|
-
- Verification criteria: PASS if for any module, direct and transitive dependencies can be enumerated from import statements or
|
|
37
|
+
- Inference path: dependency_rules.md 'Acyclic Dependencies' → module dependency graph must be a DAG → dependencies must be traceable; dependency_rules.md 'Direction Rules' → dependency kind and declared architecture determine direction rules
|
|
38
|
+
- Verification criteria: PASS if for any module, direct and transitive dependencies can be enumerated from import statements, build configuration, or declared registration artifacts. FAIL if dependencies are implicit (global state, service locator without registration) or cannot be classified by dependency kind
|
|
37
39
|
|
|
38
40
|
- **CQ-S-03** [P1] Can the list of public APIs (externally exposed interfaces) be extracted?
|
|
39
41
|
- Inference path: structure_spec.md 'Golden Relationships' → Module-Interface coherence → every module must expose its contract via a public interface; structure_spec.md 'Required Relationships' → every public function/class must have at least one caller or test
|
|
@@ -44,7 +46,7 @@ Verifies that the system's module structure, boundaries, and public interfaces a
|
|
|
44
46
|
- Verification criteria: PASS if consumers only import from the module's public API surface. FAIL if consumers directly import internal files or private functions
|
|
45
47
|
|
|
46
48
|
- **CQ-S-05** [P2] Is the architectural pattern explicitly declared and consistently applied?
|
|
47
|
-
- Inference path: structure_spec.md 'Architectural Patterns' → each pattern defines specific dependency rules; domain_scope.md '
|
|
49
|
+
- Inference path: structure_spec.md 'Architectural Patterns' → each pattern defines specific dependency rules; domain_scope.md 'Major Sub-areas' → Architecture and structure
|
|
48
50
|
- Verification criteria: PASS if the codebase declares its architectural pattern and conforms to that pattern's rules. FAIL if no pattern is declared or the codebase violates its declared pattern
|
|
49
51
|
|
|
50
52
|
- **CQ-S-06** [P2] Is the package/module organization axis consistently applied at each directory level?
|
|
@@ -55,9 +57,9 @@ Verifies that the system's module structure, boundaries, and public interfaces a
|
|
|
55
57
|
- Inference path: structure_spec.md 'Quantitative Thresholds' → module size >500 lines, function size >50 lines, cyclomatic complexity >10, dependency fan-out >7, inheritance depth >5
|
|
56
58
|
- Verification criteria: PASS if structural metrics are measured and threshold violations are addressed or justified. FAIL if no metrics are measured or violations exist without justification
|
|
57
59
|
|
|
58
|
-
- **CQ-S-08** [P2] Are
|
|
59
|
-
- Inference path: structure_spec.md 'Isolated Node Prohibition' → public function with no callers
|
|
60
|
-
- Verification criteria: PASS if
|
|
60
|
+
- **CQ-S-08** [P2] Are isolated nodes or intentionally retained unreachable modules identifiable?
|
|
61
|
+
- Inference path: structure_spec.md 'Isolated Node Prohibition' → public function/class with no callers and modules with no imports are dead-code or retention signals; CQ-V-01 owns public API test existence
|
|
62
|
+
- Verification criteria: PASS if public functions/classes have callers or intentional-retention rationale, and modules are imported, exposed, or intentionally retained. FAIL if unreachable public APIs or modules exist with no caller/import/export/retention rationale
|
|
61
63
|
|
|
62
64
|
- **CQ-S-09** [P3] Is an intermediate abstraction layer justified by having 2+ consumers?
|
|
63
65
|
- Inference path: structure_spec.md 'Authority and Layer Separation' → abstraction layer justified only when 2+ consumers directly consume it
|
|
@@ -74,11 +76,11 @@ Verifies that the system's module structure, boundaries, and public interfaces a
|
|
|
74
76
|
Verifies that data paths are traceable, transformations identifiable, and sources of truth designated.
|
|
75
77
|
|
|
76
78
|
- **CQ-D-01** [P1] Can the path that a specific user input takes through the system be traced?
|
|
77
|
-
- Inference path: domain_scope.md '
|
|
79
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Architecture and structure; structure_spec.md 'Required Module Structure Elements' → Entry Point → Business Logic → Data Access
|
|
78
80
|
- Verification criteria: PASS if for any user input, the complete processing chain (entry → validation → logic → data access → response) can be traced. FAIL if any input path has untraceable segments
|
|
79
81
|
|
|
80
82
|
- **CQ-D-02** [P1] Can it be identified where specific data is created, transformed, and consumed?
|
|
81
|
-
- Inference path: domain_scope.md '
|
|
83
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Data and state + Architecture and structure; concepts.md 'Data/State Management Terms' → State, Mutation, Transaction
|
|
82
84
|
- Verification criteria: PASS if every data entity has identifiable creation, transformation, and consumption points. FAIL if data appears in a consumer without a traceable origin
|
|
83
85
|
|
|
84
86
|
- **CQ-D-03** [P1] Can the scope of impact when a state change occurs be determined?
|
|
@@ -94,17 +96,21 @@ Verifies that data paths are traceable, transformations identifiable, and source
|
|
|
94
96
|
- Verification criteria: PASS if maximum propagation delay from write to read model is documented. FAIL if CQRS is applied with no consistency guarantee
|
|
95
97
|
|
|
96
98
|
- **CQ-D-06** [P2] When Event Sourcing is used, are terminal states, projections, and partial commit prevention defined?
|
|
97
|
-
- Inference path: domain_scope.md '
|
|
99
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Data and state; logic_rules.md 'State Management Logic' → single business operation with multiple events must choose partial commit prevention strategy
|
|
98
100
|
- Verification criteria: PASS if event-sourced aggregates define terminal states, projector branches, and partial commit prevention strategy. FAIL if any of the three is missing
|
|
99
101
|
|
|
100
102
|
- **CQ-D-07** [P2] Is the schema strategy (schema-on-write vs schema-on-read) explicitly declared?
|
|
101
|
-
- Inference path: domain_scope.md '
|
|
103
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Data and state; schema strategy determines migration and consistency guarantees
|
|
102
104
|
- Verification criteria: PASS if schema strategy is declared with implications for migration and consistency. FAIL if implicit or mixed without documentation
|
|
103
105
|
|
|
104
106
|
- **CQ-D-08** [P3] When data is delivered through multiple paths, is the priority for inconsistency resolution specified?
|
|
105
107
|
- Inference path: dependency_rules.md 'Source of Truth Management' → multi-path data delivery priority must be specified as a contract
|
|
106
108
|
- Verification criteria: PASS if multi-path data delivery includes a documented priority contract. FAIL if resolution is ad-hoc
|
|
107
109
|
|
|
110
|
+
- **CQ-D-09** [P2] Are retention and deletion obligations traceable across primary and derived data stores?
|
|
111
|
+
- Inference path: concepts.md 'Security Terms' → Data Classification, Retention Policy, Deletion/Erasure Path; domain_scope.md 'Major Sub-areas' → Data and state
|
|
112
|
+
- Verification criteria: N/A if the system stores no user, tenant, regulated, or operationally retained data. PASS if classified data has retention/deletion rules covering primary stores, indexes, caches, logs, backups, generated artifacts, and downstream processors. FAIL if data can outlive its declared purpose or deletion is not traceable across derived stores
|
|
113
|
+
|
|
108
114
|
---
|
|
109
115
|
|
|
110
116
|
## 3. Change Impact (CQ-I)
|
|
@@ -140,7 +146,7 @@ Verifies that the system supports safe evolution — changes can be assessed for
|
|
|
140
146
|
- Verification criteria: PASS if dependency analysis tools run in CI and reject circular dependencies. FAIL if no automated cycle detection exists
|
|
141
147
|
|
|
142
148
|
- **CQ-I-08** [P3] When feature toggles are introduced, is the toggle lifecycle managed?
|
|
143
|
-
- Inference path: concepts.md 'Change Management Terms' → Feature Toggle → toggle debt risk; domain_scope.md '
|
|
149
|
+
- Inference path: concepts.md 'Change Management Terms' → Feature Toggle → toggle debt risk; domain_scope.md 'Major Sub-areas' → Operations and maintenance
|
|
144
150
|
- Verification criteria: PASS if toggles have owners, expiration dates, and cleanup procedures. FAIL if toggles persist indefinitely (toggle debt)
|
|
145
151
|
|
|
146
152
|
---
|
|
@@ -150,7 +156,7 @@ Verifies that the system supports safe evolution — changes can be assessed for
|
|
|
150
156
|
Verifies that error paths are defined, recovery strategies exist, and errors propagate with sufficient diagnostic information.
|
|
151
157
|
|
|
152
158
|
- **CQ-E-01** [P1] Are errors classified into operational (recoverable) and programmer (non-recoverable)?
|
|
153
|
-
- Inference path: domain_scope.md '
|
|
159
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Error and failure posture; concepts.md 'Architecture Core Terms' → Middleware for error handling
|
|
154
160
|
- Verification criteria: PASS if errors are classified: recoverable (network timeout, validation failure) vs non-recoverable (null dereference, assertion violation). FAIL if all errors are treated uniformly
|
|
155
161
|
|
|
156
162
|
- **CQ-E-02** [P1] Can the system's recovery path in a specific failure scenario be traced?
|
|
@@ -162,11 +168,11 @@ Verifies that error paths are defined, recovery strategies exist, and errors pro
|
|
|
162
168
|
- Verification criteria: PASS if error propagation follows a defined pattern (Result types, exception hierarchy, error middleware) and is traceable. FAIL if errors are swallowed, re-thrown without context, or propagate through undefined channels
|
|
163
169
|
|
|
164
170
|
- **CQ-E-04** [P1] Do user-facing error messages include cause and recommended actions?
|
|
165
|
-
- Inference path: domain_scope.md 'Required Concept Categories' → Error path
|
|
171
|
+
- Inference path: domain_scope.md 'Required Concept Categories' → Error path; domain_scope.md 'Major Sub-areas' → Error and failure posture
|
|
166
172
|
- Verification criteria: PASS if error messages include (1) what went wrong, (2) what the user can do, and (3) a correlation ID. FAIL if users see raw exceptions or generic messages
|
|
167
173
|
|
|
168
174
|
- **CQ-E-05** [P2] Are circuit breaker patterns applied to external dependency calls?
|
|
169
|
-
- Inference path: dependency_rules.md 'Runtime Dependency Rules' → Circuit Breaker; domain_scope.md '
|
|
175
|
+
- Inference path: dependency_rules.md 'Runtime Dependency Rules' → Circuit Breaker; domain_scope.md 'Major Sub-areas' → Error and failure posture
|
|
170
176
|
- Verification criteria: PASS if external calls implement circuit breaker with defined thresholds. FAIL if no failure isolation exists
|
|
171
177
|
|
|
172
178
|
- **CQ-E-06** [P2] Are retry policies defined with limits and backoff strategies?
|
|
@@ -177,9 +183,9 @@ Verifies that error paths are defined, recovery strategies exist, and errors pro
|
|
|
177
183
|
- Inference path: dependency_rules.md 'Runtime Dependency Rules' → Bulkhead → isolate thread/connection pools per dependency
|
|
178
184
|
- Verification criteria: PASS if resource pools are isolated per dependency. FAIL if all dependencies share a single pool
|
|
179
185
|
|
|
180
|
-
- **CQ-E-08** [P3] Are
|
|
181
|
-
- Inference path: dependency_rules.md 'External Dependency Management' → external calls
|
|
182
|
-
- Verification criteria: PASS if critical flows define
|
|
186
|
+
- **CQ-E-08** [P3] Are dependency failure semantics defined for degraded or halted operation?
|
|
187
|
+
- Inference path: dependency_rules.md 'External Dependency Management' → external calls declare retry, timeout, circuit breaker, explicit degraded behavior, or fail-loud/fail-close behavior; domain_scope.md 'Major Sub-areas' → Error and failure posture
|
|
188
|
+
- Verification criteria: PASS if critical flows define whether they retry, degrade visibly, queue, fail-close, or fail-loud with diagnostics. FAIL if unavailability triggers hidden fallback or has no declared behavior
|
|
183
189
|
|
|
184
190
|
---
|
|
185
191
|
|
|
@@ -200,7 +206,7 @@ Verifies that the type system and constraint design enforce correctness at the e
|
|
|
200
206
|
- Verification criteria: PASS if every terminal state has a transition event, handler branch, and explicit subsequent action list. FAIL if terminal states lack processing branches
|
|
201
207
|
|
|
202
208
|
- **CQ-T-04** [P2] Are type-level safety mechanisms used to prevent invalid states at compile time?
|
|
203
|
-
- Inference path: logic_rules.md 'Type System Logic' → excluding fields from failure branches at compile time; domain_scope.md '
|
|
209
|
+
- Inference path: logic_rules.md 'Type System Logic' → excluding fields from failure branches at compile time; domain_scope.md 'Major Sub-areas' → Interface and contract
|
|
204
210
|
- Verification criteria: PASS if types make invalid states unrepresentable (separate validated/unvalidated types, Result types, branded types). FAIL if validity is checked only at runtime when compile-time is possible
|
|
205
211
|
|
|
206
212
|
- **CQ-T-05** [P2] Are database constraints aligned with application-level validation?
|
|
@@ -238,16 +244,16 @@ Verifies that the test strategy covers necessary levels, boundaries are correctl
|
|
|
238
244
|
- Verification criteria: PASS if every public function has at least one test exercising its primary behavior. FAIL if public functions have no corresponding test
|
|
239
245
|
|
|
240
246
|
- **CQ-V-02** [P1] Can uncovered code paths be identified?
|
|
241
|
-
- Inference path: structure_spec.md 'Quantitative Thresholds' → coverage <80% insufficient, <60% critical; domain_scope.md '
|
|
247
|
+
- Inference path: structure_spec.md 'Quantitative Thresholds' → coverage <80% insufficient, <60% critical; domain_scope.md 'Major Sub-areas' → Verification and quality
|
|
242
248
|
- Verification criteria: PASS if coverage is measured with uncovered paths documented. FAIL if no coverage measurement exists or coverage <60% without remediation plan
|
|
243
249
|
|
|
244
250
|
- **CQ-V-03** [P1] Are happy path and error path each verified separately?
|
|
245
251
|
- Inference path: domain_scope.md 'Required Concept Categories' → Happy path and Error path are separate categories; logic_rules.md 'Testing Logic' → Test Boundary Rules
|
|
246
252
|
- Verification criteria: PASS if both success and error paths have dedicated tests per feature. FAIL if only happy-path tests exist
|
|
247
253
|
|
|
248
|
-
- **CQ-V-04** [P1] Are test levels
|
|
249
|
-
- Inference path: domain_scope.md '
|
|
250
|
-
- Verification criteria: PASS if
|
|
254
|
+
- **CQ-V-04** [P1] Are applicable test levels correctly classified and justified?
|
|
255
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Verification and quality; logic_rules.md 'Testing Logic' → Test Boundary Rules → unit vs integration boundary
|
|
256
|
+
- Verification criteria: PASS if unit, integration, E2E, contract, semantic, or other relevant verification levels are classified according to target risk and each applicable level is present or explicitly N/A with rationale. FAIL if required levels are missing, all tests are collapsed into one level, or integration/E2E/semantic checks are mislabeled as unit tests
|
|
251
257
|
|
|
252
258
|
- **CQ-V-05** [P2] Are tests independent of execution order?
|
|
253
259
|
- Inference path: logic_rules.md 'Testing Logic' → Test Independence → no order dependency; Test Determinism → flaky tests worse than no tests
|
|
@@ -266,13 +272,17 @@ Verifies that the test strategy covers necessary levels, boundaries are correctl
|
|
|
266
272
|
- Verification criteria: PASS if linting, type checking, and dependency analysis run automatically in CI. FAIL if static analysis is optional or violations ignored
|
|
267
273
|
|
|
268
274
|
- **CQ-V-09** [P3] Is mutation testing applied to critical business logic?
|
|
269
|
-
- Inference path: domain_scope.md '
|
|
275
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Verification and quality; concepts.md 'Testing' → Mutation Test; logic_rules.md 'Testing Logic' → Mutation Testing → apply selectively
|
|
270
276
|
- Verification criteria: PASS if mutation testing covers critical paths with surviving mutants reviewed. WARNING if not applied to critical logic
|
|
271
277
|
|
|
272
278
|
- **CQ-V-10** [P3] Is property-based testing used for functions with broad input domains?
|
|
273
|
-
- Inference path: domain_scope.md '
|
|
279
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Verification and quality; concepts.md 'Testing' → Property-Based Test
|
|
274
280
|
- Verification criteria: PASS if broad-domain functions (parsers, validators, serializers) use property-based testing. WARNING if tested only with hand-picked examples
|
|
275
281
|
|
|
282
|
+
- **CQ-V-11** [P2] Are critical user-facing flows tested for accessibility and locale behavior when applicable?
|
|
283
|
+
- Inference path: concepts.md 'Internationalization/Accessibility Terms' → WCAG, ARIA, Screen Reader, Locale, ICU; domain_scope.md 'Major Sub-areas' → Accessibility and internationalization
|
|
284
|
+
- Verification criteria: N/A if the system has no user-facing UI/API text, locale-sensitive output, or accessibility obligation. PASS if critical flows have accessibility checks and locale/formatting tests appropriate to their exposure. FAIL if public or regulated user-facing flows lack accessibility/i18n verification
|
|
285
|
+
|
|
276
286
|
---
|
|
277
287
|
|
|
278
288
|
## 7. Deployment/Operations (CQ-O)
|
|
@@ -280,7 +290,7 @@ Verifies that the test strategy covers necessary levels, boundaries are correctl
|
|
|
280
290
|
Verifies that the system is deployable, observable, and operable in production.
|
|
281
291
|
|
|
282
292
|
- **CQ-O-01** [P1] Can the running code version be determined per environment?
|
|
283
|
-
- Inference path: domain_scope.md '
|
|
293
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Operations and maintenance; concepts.md 'DevOps Terms' → Build Artifact
|
|
284
294
|
- Verification criteria: PASS if each environment has a mechanism to identify the exact code version (build hash, image tag, version endpoint). FAIL if no version identification exists
|
|
285
295
|
|
|
286
296
|
- **CQ-O-02** [P1] Are configurations separated from code and managed per environment?
|
|
@@ -292,25 +302,29 @@ Verifies that the system is deployable, observable, and operable in production.
|
|
|
292
302
|
- Verification criteria: PASS if structured logging, metrics, and distributed tracing are present. FAIL if production issues cannot be diagnosed from telemetry
|
|
293
303
|
|
|
294
304
|
- **CQ-O-04** [P2] Are deployment strategies defined with rollback procedures?
|
|
295
|
-
- Inference path: concepts.md 'DevOps' → Blue-Green, Canary, Rolling; domain_scope.md '
|
|
305
|
+
- Inference path: concepts.md 'DevOps' → Blue-Green, Canary, Rolling; domain_scope.md 'Major Sub-areas' → Operations and maintenance
|
|
296
306
|
- Verification criteria: PASS if deployment strategy includes rollback procedures and health check criteria. FAIL if deployments are all-or-nothing
|
|
297
307
|
|
|
298
308
|
- **CQ-O-05** [P2] Are SLIs/SLOs defined for critical paths?
|
|
299
|
-
- Inference path: concepts.md 'Observability' → SLI/SLO/SLA; domain_scope.md '
|
|
309
|
+
- Inference path: concepts.md 'Observability' → SLI/SLO/SLA; domain_scope.md 'Major Sub-areas' → Operations and maintenance
|
|
300
310
|
- Verification criteria: PASS if critical paths have defined SLIs with target SLOs. FAIL if no quantitative service level targets exist
|
|
301
311
|
|
|
302
312
|
- **CQ-O-06** [P3] Are the 12-Factor App principles addressed?
|
|
303
|
-
- Inference path: domain_scope.md '
|
|
313
|
+
- Inference path: domain_scope.md 'Reference Standards and Frameworks' → 12-Factor App, SRE, DORA
|
|
304
314
|
- Verification criteria: PASS if each factor is addressed (compliance or documented deviation). FAIL if not assessed for cloud-deployed services
|
|
305
315
|
|
|
316
|
+
- **CQ-O-07** [P2] Can each deployed release artifact be traced back to source, build, verification, approval, and environment?
|
|
317
|
+
- Inference path: concepts.md 'DevOps Terms' → Build Artifact, Artifact Attestation, Release Artifact Traceability; dependency_rules.md 'AI Supply Chain and Provenance' for AI-generated release inputs
|
|
318
|
+
- Verification criteria: PASS if deployed artifacts can be traced to source revision, build pipeline, dependency set/SBOM where applicable, verification results, approval gate, and deployment environment. FAIL if production code identity or release provenance cannot be reconstructed
|
|
319
|
+
|
|
306
320
|
---
|
|
307
321
|
|
|
308
|
-
## 8. AI Agent Collaboration (CQ-A)
|
|
322
|
+
## 8. AI Agent and LLM-Native Collaboration (CQ-A)
|
|
309
323
|
|
|
310
|
-
Verifies that specifications support AI agent execution — self-contained, unambiguous, and
|
|
324
|
+
Verifies that specifications support AI agent execution and LLM-native software workflows — self-contained, unambiguous, verifiable, and diagnosable. Applicable when AI agents are consumers/executors or when product/development/review behavior depends on model calls, prompt/context contracts, retrieval, tool use, or semantic evaluation.
|
|
311
325
|
|
|
312
326
|
- **CQ-A-01** [P1] Is the specification that an AI agent executes self-contained?
|
|
313
|
-
- Inference path: concepts.md 'Document Design Terms' → Self-contained Spec required when AI agents are executors; domain_scope.md '
|
|
327
|
+
- Inference path: concepts.md 'Document Design Terms' → Self-contained Spec required when AI agents are executors; domain_scope.md 'Major Sub-areas' → Documentation and consumers
|
|
314
328
|
- Verification criteria: PASS if every AI-executed spec includes all necessary context without depending on other sessions or implicit knowledge. FAIL if the spec requires absent context
|
|
315
329
|
|
|
316
330
|
- **CQ-A-02** [P1] Are verification criteria for AI-generated output defined at the "ambiguity detection" level?
|
|
@@ -318,7 +332,7 @@ Verifies that specifications support AI agent execution — self-contained, unam
|
|
|
318
332
|
- Verification criteria: PASS if verification uses concrete, measurable criteria (not subjective assessment). FAIL if criteria allow partial fulfillment or arbitrary interpretation
|
|
319
333
|
|
|
320
334
|
- **CQ-A-03** [P2] Are reading paths specified for AI vs human documentation consumers?
|
|
321
|
-
- Inference path: domain_scope.md '
|
|
335
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Documentation and consumers; concepts.md 'Document Design Terms' → Contract Document vs Guide Document
|
|
322
336
|
- Verification criteria: PASS if contract (machine-readable) and guide (human-readable) content are structurally separated. FAIL if specs and explanations are mixed without separation
|
|
323
337
|
|
|
324
338
|
- **CQ-A-04** [P2] Is constraint design for AI tasks using pre-inclusion rather than post-verification?
|
|
@@ -330,9 +344,65 @@ Verifies that specifications support AI agent execution — self-contained, unam
|
|
|
330
344
|
- Verification criteria: PASS if inter-agent communication uses typed schemas. FAIL if agents communicate via unstructured natural language
|
|
331
345
|
|
|
332
346
|
- **CQ-A-06** [P3] Is AI agent autonomy explicitly bounded?
|
|
333
|
-
- Inference path: domain_scope.md '
|
|
347
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → LLM-native and agentic behavior; prompt_interface.md 'Agent Permission and Autonomy'
|
|
334
348
|
- Verification criteria: PASS if agent permissions define autonomous scope, human-approval scope, and prohibited actions. FAIL if scope is undefined
|
|
335
349
|
|
|
350
|
+
- **CQ-A-07** [P1] Are LLM model/provider/version facts recorded where behavior must be reproducible?
|
|
351
|
+
- Inference path: concepts.md 'LLM-Native Engineering Terms' → Model Provider and Model Version; logic_rules.md 'LLM Boundary Logic' → model/provider facts are external dependency facts
|
|
352
|
+
- Verification criteria: PASS if runs or configs record provider, model id/version or alias status, auth/route realization, and capability requirements. FAIL if a model behavior change cannot be traced to a concrete provider/model fact
|
|
353
|
+
|
|
354
|
+
- **CQ-A-08** [P1] Are prompt templates, context assembly rules, and output schemas versioned and reviewable?
|
|
355
|
+
- Inference path: logic_rules.md 'LLM Boundary Logic' → prompt/context artifacts are behavior-affecting; concepts.md 'Prompt Template' and 'Context Assembly'
|
|
356
|
+
- Verification criteria: PASS if prompt/context/schema changes are tracked like code/config and have an owner or artifact seat. FAIL if runtime behavior depends on unversioned prompt text or hidden context assembly logic
|
|
357
|
+
|
|
358
|
+
- **CQ-A-09** [P1] Does LLM-native failure handling fail-loud instead of silently degrading in development, review, or authority paths?
|
|
359
|
+
- Inference path: logic_rules.md 'LLM-Native Failure Posture' → default fail-loud; concepts.md 'Silent Degradation'
|
|
360
|
+
- Verification criteria: PASS if malformed output, missing context, invalid tool result, schema mismatch, provider preflight failure, and unbudgeted truncation halt or produce structured diagnostics. FAIL if fallback/repair hides the origin or presents degraded output as complete
|
|
361
|
+
|
|
362
|
+
- **CQ-A-10** [P2] Is semantic evaluation defined for non-deterministic model or agent output?
|
|
363
|
+
- Inference path: concepts.md 'Semantic Evaluation'; domain_scope.md 'Major Sub-areas' → LLM-native and agentic behavior
|
|
364
|
+
- Verification criteria: PASS if golden set, rubric, baseline, comparison method, or human review path exists for output quality. FAIL if only route success, schema validity, or exact-match tests are used to claim quality
|
|
365
|
+
|
|
366
|
+
- **CQ-A-11** [P2] Do agent tools and MCP boundaries have self-describing schemas and validated results?
|
|
367
|
+
- Inference path: logic_rules.md 'LLM Boundary Logic' → Agent and Tool Rules; structure_spec.md 'LLM-Native System Structure'
|
|
368
|
+
- Verification criteria: PASS if tool schemas include applicability, parameters, result shape, failure semantics, and result validation. FAIL if agents must infer tool use from hidden docs or consume unchecked tool output
|
|
369
|
+
|
|
370
|
+
- **CQ-A-12** [P2] Can evidence-backed LLM output trace claims to source provenance?
|
|
371
|
+
- Inference path: logic_rules.md 'LLM Boundary Logic' → Retrieved context used as evidence must carry source provenance
|
|
372
|
+
- Verification criteria: PASS if evidence-backed claims cite retrieved context or artifact refs. FAIL if generated claims are treated as evidence without source provenance
|
|
373
|
+
|
|
374
|
+
- **CQ-A-13** [P3] Are graceful degradation paths explicit product behavior rather than hidden development shortcuts?
|
|
375
|
+
- Inference path: logic_rules.md 'LLM-Native Failure Posture' → graceful degradation allowed only as explicit product behavior
|
|
376
|
+
- Verification criteria: PASS if degraded behavior declares trigger, lost capability, visible marker, diagnostic artifact, and recovery path. FAIL if fallback exists only to keep the run passing
|
|
377
|
+
|
|
378
|
+
- **CQ-A-14** [P1] Are LLM, runtime, and middleware ownership boundaries explicit and non-overlapping?
|
|
379
|
+
- Inference path: concepts.md 'LLM-Native Engineering Terms' → domain projections for LLM Boundary, Runtime Boundary, Middleware Boundary, Ownership Non-Interference; logic_rules.md 'LLM Boundary Logic' → operational prohibitions; prompt_interface.md 'Ownership Boundary Structure' → interface obligations
|
|
380
|
+
- Verification criteria: PASS if semantic judgment, deterministic binding/validation/persistence, and transport/adaptation responsibilities are declared with forbidden crossovers. FAIL if runtime or middleware silently performs semantic repair, middleware becomes hidden policy authority, or LLM output bypasses runtime-owned authority, persistence, authorization, idempotency, or cost/security gates
|
|
381
|
+
|
|
382
|
+
- **CQ-A-15** [P1] Is LLM output treated as untrusted until validated for the concrete downstream sink?
|
|
383
|
+
- Inference path: concepts.md 'Output Zero-Trust' and 'Sink Validation'; logic_rules.md 'Output Zero-Trust Rules'; prompt_interface.md 'Output Sink Constraints'
|
|
384
|
+
- Verification criteria: PASS if every shell, SQL, HTML, file, email, API/tool, and authority-artifact sink has explicit validation/encoding/authorization before use. FAIL if schema validity or prompt instructions are treated as sufficient safety for downstream use
|
|
385
|
+
|
|
386
|
+
- **CQ-A-16** [P1] Is external content prevented from injecting instructions that change tool permission, role authority, output authority, or secret disclosure behavior?
|
|
387
|
+
- Inference path: concepts.md 'Prompt Injection Boundary'; logic_rules.md 'External Content and Prompt Injection Rules'; prompt_interface.md 'External Content Handling'
|
|
388
|
+
- Verification criteria: PASS if external content is framed as data, instruction hierarchy is runtime-enforced, exfiltration sinks are blocked or approved, and at least one prompt-injection scenario is tested when tool/authority impact exists. FAIL if the model alone is trusted to ignore hostile retrieved/user/web/file content
|
|
389
|
+
|
|
390
|
+
- **CQ-A-17** [P1] Does retrieval/RAG preserve permission, source validation, poisoning resistance, provenance, and auditability before context injection?
|
|
391
|
+
- Inference path: concepts.md 'RAG Permission Boundary' and 'Embedding Index Compatibility'; logic_rules.md 'Retrieval and Vector Rules'; structure_spec.md 'LLM-Native System Structure'
|
|
392
|
+
- Verification criteria: PASS if retrieval filters by user/tenant/sensitivity before context assembly, records source/retrieval provenance, validates ingestion, handles embedding/index compatibility, and audits retrieval paths. FAIL if retrieved text can cross permission boundaries or become evidence without provenance
|
|
393
|
+
|
|
394
|
+
- **CQ-A-18** [P2] Are agent functionality, permission, and autonomy minimized separately?
|
|
395
|
+
- Inference path: concepts.md 'Agent Functionality', 'Agent Permission', and 'Agent Autonomy'; logic_rules.md 'Agent and Tool Rules'; prompt_interface.md 'Agent Permission and Autonomy'
|
|
396
|
+
- Verification criteria: PASS if available tools, authorized scope, and no-approval autonomy are separately declared with high-impact approval gates. FAIL if broad tool availability is treated as permission or permission is treated as autonomous approval
|
|
397
|
+
|
|
398
|
+
- **CQ-A-19** [P2] Can a model/provider, prompt/context, tool, or retrieval-corpus change be evaluated as a behavior migration?
|
|
399
|
+
- Inference path: dependency_rules.md 'LLM/Agent Dependency Management' and 'AI Supply Chain and Provenance'; concepts.md 'Behavior-Affecting Artifact'
|
|
400
|
+
- Verification criteria: PASS if impacted prompts, tool schemas, retrieval indexes, eval baselines, dashboards, cost/rate assumptions, and release gates are listed or explicitly non-applicable. FAIL if the change is treated as an implementation detail with no semantic regression path
|
|
401
|
+
|
|
402
|
+
- **CQ-A-20** [P2] Does semantic evaluation distinguish route success, schema validity, quality, safety, and production drift?
|
|
403
|
+
- Inference path: concepts.md 'Semantic Evaluation', 'Evaluation Baseline', and 'Production Drift'; logic_rules.md 'AI Governance and Risk Rules'
|
|
404
|
+
- Verification criteria: PASS if the system separates deterministic pass/fail, schema validation, semantic quality, safety/security testing, and drift monitoring. FAIL if successful model invocation is used as evidence of output quality
|
|
405
|
+
|
|
336
406
|
---
|
|
337
407
|
|
|
338
408
|
## 9. Security (CQ-SE)
|
|
@@ -340,11 +410,11 @@ Verifies that specifications support AI agent execution — self-contained, unam
|
|
|
340
410
|
Verifies that authentication, authorization, input validation, and supply chain security are addressed.
|
|
341
411
|
|
|
342
412
|
- **CQ-SE-01** [P1] Are authentication mechanisms correctly implemented?
|
|
343
|
-
- Inference path: logic_rules.md 'Security Logic' → Authentication Logic → token validation: structure, signature, expiration, issuer, audience → skipping any = vulnerability; domain_scope.md '
|
|
413
|
+
- Inference path: logic_rules.md 'Security Logic' → Authentication Logic → token validation: structure, signature, expiration, issuer, audience → skipping any = vulnerability; domain_scope.md 'Major Sub-areas' → Security and authorization
|
|
344
414
|
- Verification criteria: PASS if token validation checks all five steps and sessions have absolute/idle timeouts with invalidation on password change. FAIL if any step is skipped or sessions lack expiration
|
|
345
415
|
|
|
346
416
|
- **CQ-SE-02** [P1] Is authorization enforced at the correct granularity (not just authentication)?
|
|
347
|
-
- Inference path: domain_scope.md '
|
|
417
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Security and authorization; domain_scope.md 'Bias Detection Criteria' → security scope bias: auth without authz = full access
|
|
348
418
|
- Verification criteria: PASS if every protected resource checks user permission level. FAIL if only "is logged in" is checked without "has permission"
|
|
349
419
|
|
|
350
420
|
- **CQ-SE-03** [P1] Is input validation applied at system boundaries with defense in depth?
|
|
@@ -356,13 +426,17 @@ Verifies that authentication, authorization, input validation, and supply chain
|
|
|
356
426
|
- Verification criteria: PASS if authorization model is documented with explicit conflict resolution strategy. FAIL if multiple policies apply with no resolution order
|
|
357
427
|
|
|
358
428
|
- **CQ-SE-05** [P2] Are dependency security audits automated in CI?
|
|
359
|
-
- Inference path: dependency_rules.md 'Build/Package Dependency Rules' → Dependency Security → audit in CI; domain_scope.md '
|
|
429
|
+
- Inference path: dependency_rules.md 'Build/Package Dependency Rules' → Dependency Security → audit in CI; domain_scope.md 'Major Sub-areas' → Security and authorization
|
|
360
430
|
- Verification criteria: PASS if `npm audit` / `pip audit` or equivalent runs in CI with critical vulnerabilities blocking merge. FAIL if no automated scanning
|
|
361
431
|
|
|
362
432
|
- **CQ-SE-06** [P2] Are OWASP Top 10 risks systematically addressed?
|
|
363
|
-
- Inference path: domain_scope.md '
|
|
433
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Security and authorization; domain_scope.md 'Reference Standards and Frameworks' → OWASP Top 10 for web-facing systems
|
|
364
434
|
- Verification criteria: PASS if assessed against OWASP Top 10 with mitigations documented. FAIL if no assessment for web-facing systems
|
|
365
435
|
|
|
436
|
+
- **CQ-SE-07** [P2] Are sensitive and personal data classified, minimized, and protected according to their handling obligations?
|
|
437
|
+
- Inference path: concepts.md 'Security Terms' → Data Classification, Data Minimization; domain_scope.md 'Major Sub-areas' → Security and authorization
|
|
438
|
+
- Verification criteria: N/A if the system processes no personal, sensitive, tenant-scoped, regulated, or secret-bearing data. PASS if data classes, minimization rationale, access controls, encryption/redaction, logging policy, and retention/deletion obligations are documented and enforced. FAIL if sensitive data is collected, stored, logged, or exposed without classification and handling rules
|
|
439
|
+
|
|
366
440
|
---
|
|
367
441
|
|
|
368
442
|
## 10. Performance (CQ-P)
|
|
@@ -370,24 +444,24 @@ Verifies that authentication, authorization, input validation, and supply chain
|
|
|
370
444
|
Verifies that performance requirements are defined, measured, and validated. Scale-dependent — small systems may not need all of these.
|
|
371
445
|
|
|
372
446
|
- **CQ-P-01** [P2] Are performance targets defined as measurable SLOs?
|
|
373
|
-
- Inference path: domain_scope.md '
|
|
447
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Verification and quality; concepts.md 'Observability' → SLI/SLO
|
|
374
448
|
- Verification criteria: PASS if critical paths have targets (e.g., p99 < 500ms). FAIL if no performance targets for production endpoints
|
|
375
449
|
|
|
376
|
-
- **CQ-P-02** [P2] Is
|
|
450
|
+
- **CQ-P-02** [P2] Is cached data tied to a declared source of truth and staleness tolerance?
|
|
377
451
|
- Inference path: domain_scope.md 'Required Concept Categories' → Source of truth; concepts.md 'Data/State Management Terms' → Eventual Consistency
|
|
378
|
-
- Verification criteria: PASS if
|
|
452
|
+
- Verification criteria: PASS if each cache declares its authoritative source, owner, consumers, allowed staleness, and inconsistency resolution rule. FAIL if cached data exists without source-of-truth or staleness semantics
|
|
379
453
|
|
|
380
454
|
- **CQ-P-03** [P2] Is load testing performed for expected and peak traffic?
|
|
381
|
-
- Inference path: domain_scope.md '
|
|
455
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Verification and quality; structure_spec.md 'Quantitative Thresholds' → P99 > 1s triggers review
|
|
382
456
|
- Verification criteria: PASS if load tests simulate expected/peak traffic with results compared to SLOs. FAIL if no load testing for concurrent-user systems
|
|
383
457
|
|
|
384
458
|
- **CQ-P-04** [P3] Are common performance anti-patterns (N+1, unindexed, unbounded) identified?
|
|
385
|
-
- Inference path: structure_spec.md 'Storage/Data Layer'; domain_scope.md '
|
|
459
|
+
- Inference path: structure_spec.md 'Storage/Data Layer'; domain_scope.md 'Major Sub-areas' → Verification and quality
|
|
386
460
|
- Verification criteria: PASS if anti-patterns detected by tooling or review. FAIL if known anti-patterns exist in production without mitigation
|
|
387
461
|
|
|
388
462
|
- **CQ-P-05** [P2] Is caching invalidation strategy defined with consistency guarantees?
|
|
389
463
|
- Inference path: logic_rules.md §Performance Logic → Caching Rules → cache invalidation strategies; dependency_rules.md §Source of Truth Management
|
|
390
|
-
- Verification criteria: PASS if
|
|
464
|
+
- Verification criteria: PASS if each cache has an invalidation or refresh mechanism (TTL, event-based invalidation, versioned keys, write-through/update, or documented manual refresh) that is compatible with its declared staleness tolerance. FAIL if cache entries can remain stale beyond the declared tolerance or invalidation triggers are undefined
|
|
391
465
|
|
|
392
466
|
- **CQ-P-06** [P2] Are N+1 query patterns detected and prevented?
|
|
393
467
|
- Inference path: logic_rules.md §Performance Logic → N+1 Query Detection; structure_spec.md §Storage/Data Layer
|
|
@@ -419,6 +493,10 @@ Verifies that concurrent access is handled safely — deadlocks prevented, race
|
|
|
419
493
|
- Inference path: logic_rules.md 'State Management Logic' → Distributed State Rules → optimistic for low contention, pessimistic for high; concepts.md 'Data/State Management Terms' → Optimistic vs Pessimistic Locking
|
|
420
494
|
- Verification criteria: PASS if strategy is documented with justification based on conflict probability. FAIL if implicit or mismatched to contention level
|
|
421
495
|
|
|
496
|
+
- **CQ-C-05** [P2] Are concurrent resource limits defined with behavior at capacity?
|
|
497
|
+
- Inference path: domain_scope.md 'Required Concept Categories' → Concurrency; logic_rules.md §Concurrency Logic → Concurrency Model Rules; dependency_rules.md §Runtime Dependency Rules → Bulkhead and Timeout and Retry Policies
|
|
498
|
+
- Verification criteria: PASS if connection pools, thread pools, queues, actor mailboxes, worker pools, or other concurrent capacity limits declare behavior at saturation. FAIL if resource exhaustion under concurrent load leads to undefined blocking, dropped work, retry storms, or unbounded memory growth
|
|
499
|
+
|
|
422
500
|
---
|
|
423
501
|
|
|
424
502
|
## 12. Dependencies (CQ-DE)
|
|
@@ -429,9 +507,9 @@ Verifies that dependency management is sound — direction rules enforced, exter
|
|
|
429
507
|
- Inference path: dependency_rules.md 'Acyclic Dependencies' → ADP → DAG required; dependency_rules.md 'Breaking Cycles' → inversion, events, shared kernel strategies
|
|
430
508
|
- Verification criteria: PASS if dependency graph is verified acyclic by tooling. FAIL if circular dependencies exist. Type-only circular imports are WARNING
|
|
431
509
|
|
|
432
|
-
- **CQ-DE-02** [P1] Do dependencies
|
|
433
|
-
- Inference path: dependency_rules.md 'Direction Rules' →
|
|
434
|
-
- Verification criteria: PASS if
|
|
510
|
+
- **CQ-DE-02** [P1] Do dependencies follow the declared architecture's direction rule and dependency kind?
|
|
511
|
+
- Inference path: dependency_rules.md 'Direction Rules' → dependency direction vocabulary, pattern selection rule, DIP, Clean Architecture, Hexagonal Architecture, Stable Dependencies Principle
|
|
512
|
+
- Verification criteria: PASS if source-code, runtime-flow, and contract dependencies are classified and each source-code edge follows the declared architecture's rule. FAIL if a single global upper/lower rule is applied across Layered, Clean, Hexagonal, and DIP contexts or if business policy depends on concrete infrastructure without accepted rationale
|
|
435
513
|
|
|
436
514
|
- **CQ-DE-03** [P1] Are lock files committed and deterministic builds guaranteed?
|
|
437
515
|
- Inference path: dependency_rules.md 'Build/Package Dependency Rules' → Lock File Management → lock files ensure deterministic builds
|
|
@@ -449,6 +527,14 @@ Verifies that dependency management is sound — direction rules enforced, exter
|
|
|
449
527
|
- Inference path: dependency_rules.md 'Build/Package Dependency Rules' → Transitive Dependency Management → phantom dependencies from hoisting; dependency_rules.md 'Referential Integrity' → imported modules must be declared
|
|
450
528
|
- Verification criteria: PASS if only declared dependencies can be imported. FAIL if code imports undeclared packages relying on hoisting
|
|
451
529
|
|
|
530
|
+
- **CQ-DE-07** [P2] Are event schema contract dependencies owned, versioned, and decommissionable?
|
|
531
|
+
- Inference path: dependency_rules.md 'Breaking Cycles' → Event-based decoupling; dependency_rules.md 'Event Schema Contract Dependencies'; competency_qs.md 'Event/Messaging'
|
|
532
|
+
- Verification criteria: N/A if the system has no event/message/pub-sub path. PASS if each event/message schema has an owner, versioning policy, compatibility rules, producer/consumer direction, migration window for breaking changes, and decommissioning path. FAIL if event-based decoupling hides consumer coupling, stale subscribers, or schema ownership gaps
|
|
533
|
+
|
|
534
|
+
- **CQ-DE-08** [P2] Are general software supply-chain artifacts traceable and verifiable outside AI-specific provenance?
|
|
535
|
+
- Inference path: concepts.md 'DevOps Terms' → SBOM, Artifact Attestation, Release Artifact Traceability; dependency_rules.md 'Build/Package Dependency Rules' → Dependency Security; domain_scope.md 'Reference Standards and Frameworks' → SLSA and NIST SSDF
|
|
536
|
+
- Verification criteria: PASS if build/release artifacts have dependency inventory, license/security checks, signature or attestation where required, and provenance verification tied to CI/release gates. FAIL if only AI artifacts have provenance while conventional build/release artifacts lack traceability
|
|
537
|
+
|
|
452
538
|
## 13. Boundary Conditions (CQ-B)
|
|
453
539
|
|
|
454
540
|
Verifies that edge cases, limits, and boundary values are identified and handled.
|
|
@@ -463,11 +549,7 @@ Scope boundary: CQ-B addresses **value-level boundaries** (min/max, empty, overf
|
|
|
463
549
|
- Inference path: domain_scope.md 'Required Concept Categories' → Boundary condition → integer overflow in payment calculation; logic_rules.md §Type System Logic → type safety
|
|
464
550
|
- Verification criteria: PASS if arithmetic operations on user-influenced values use overflow-safe types or explicit checks. FAIL if arithmetic can overflow silently (e.g., 32-bit integer for monetary values)
|
|
465
551
|
|
|
466
|
-
- **CQ-B-03** [P2] Are
|
|
467
|
-
- Inference path: domain_scope.md 'Required Concept Categories' → Boundary condition → concurrent access; logic_rules.md §Concurrency Logic → Race Condition Patterns
|
|
468
|
-
- Verification criteria: PASS if resource limits (connection pools, thread pools, queue depths) have defined behavior at capacity. FAIL if resource exhaustion leads to undefined behavior
|
|
469
|
-
|
|
470
|
-
- **CQ-B-04** [P2] Are collection size boundaries handled (empty collections, single-element, maximum size)?
|
|
552
|
+
- **CQ-B-03** [P2] Are collection size boundaries handled (empty collections, single-element, maximum size)?
|
|
471
553
|
- Inference path: domain_scope.md 'Required Concept Categories' → Boundary condition → empty array dereference; logic_rules.md §Constraint Design Logic → schema validation at entry point
|
|
472
554
|
- Verification criteria: PASS if operations on collections handle empty, single-element, and maximum-size cases. FAIL if code assumes non-empty collections without validation
|
|
473
555
|
|
|
@@ -480,13 +562,21 @@ Verifies that requirements are captured, testable, and traceable. Applicable whe
|
|
|
480
562
|
Scope boundary: CQ-R addresses **requirements traceability and quantification**. Test strategy and coverage belong to CQ-V.
|
|
481
563
|
|
|
482
564
|
- **CQ-R-01** [P2] Is each functional requirement traceable to at least one test or verification method?
|
|
483
|
-
- Inference path: domain_scope.md §
|
|
565
|
+
- Inference path: domain_scope.md §Major Sub-areas → Interface and contract; structure_spec.md §Golden Relationships → Test-Code coherence
|
|
484
566
|
- Verification criteria: PASS if a traceability matrix (or equivalent) links requirements to tests. FAIL if requirements exist without corresponding verification
|
|
485
567
|
|
|
486
568
|
- **CQ-R-02** [P2] Are non-functional requirements quantified with measurable targets?
|
|
487
|
-
- Inference path: domain_scope.md §
|
|
569
|
+
- Inference path: domain_scope.md §Major Sub-areas → Interface and contract + Verification and quality
|
|
488
570
|
- Verification criteria: PASS if NFRs have numeric targets (e.g., p99 < 500ms, 99.9% availability). FAIL if NFRs are stated qualitatively ("should be fast")
|
|
489
571
|
|
|
572
|
+
- **CQ-R-03** [P2] Are accessibility and internationalization requirements explicit when user-facing scope requires them?
|
|
573
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Accessibility and internationalization; concepts.md 'Internationalization/Accessibility Terms'
|
|
574
|
+
- Verification criteria: N/A if the system has no user-facing surface, locale-sensitive output, public/regulatory exposure, or assistive-technology obligation. PASS if accessibility level, supported locales, text direction, formatting, translation ownership, and acceptance criteria are explicit. FAIL if user-facing scope exists but accessibility/i18n requirements are absent or purely qualitative
|
|
575
|
+
|
|
576
|
+
- **CQ-R-04** [P1] Are user-visible, API, or release-affecting behaviors expressed as testable acceptance criteria?
|
|
577
|
+
- Inference path: domain_scope.md 'Major Sub-areas' → Interface and contract; structure_spec.md 'Golden Relationships' → Documentation-Code alignment; logic_rules.md 'Constraint Design Logic' → free-text pass_criteria failure modes
|
|
578
|
+
- Verification criteria: PASS if externally observable or release-affecting behavior has acceptance criteria specific enough to verify. FAIL if behavior is described only as intent, narrative, or qualitative preference with no concrete expected result, non-goal, or verification method
|
|
579
|
+
|
|
490
580
|
---
|
|
491
581
|
|
|
492
582
|
## 15. Maintenance (CQ-MT)
|
|
@@ -494,13 +584,17 @@ Scope boundary: CQ-R addresses **requirements traceability and quantification**.
|
|
|
494
584
|
Verifies that maintenance processes are defined and classified. Applicable when the system has an established user base with ongoing development.
|
|
495
585
|
|
|
496
586
|
- **CQ-MT-01** [P2] Are maintenance activities classified by type (corrective/adaptive/perfective/preventive)?
|
|
497
|
-
- Inference path: domain_scope.md §
|
|
587
|
+
- Inference path: domain_scope.md §Domain Purpose → full lifecycle; ISO/IEC/IEEE 12207 in domain_scope.md §Reference Standards and Frameworks; concepts.md §Change Management Terms → Technical Debt
|
|
498
588
|
- Verification criteria: PASS if maintenance work is classified and tracked by type with distinct workflows. FAIL if all maintenance is treated as undifferentiated "bug fixes"
|
|
499
589
|
|
|
500
590
|
- **CQ-MT-02** [P2] Is technical debt tracked with remediation plans?
|
|
501
|
-
- Inference path: domain_scope.md §
|
|
591
|
+
- Inference path: domain_scope.md §Top-Down Concern Stack → Lifecycle and governance; concepts.md §Interpretation Principles → "Technical debt" is not "code I don't like"
|
|
502
592
|
- Verification criteria: PASS if technical debt items are documented with estimated cost and remediation priority. FAIL if technical debt is acknowledged informally but not tracked
|
|
503
593
|
|
|
594
|
+
- **CQ-MT-03** [P2] Is retirement or decommissioning complete across runtime, docs, data, dependencies, and operations?
|
|
595
|
+
- Inference path: domain_scope.md §Domain Purpose → decommissioning and retirement; extension_cases.md Case SE-07; concepts.md §Change Management Terms → Deprecation and Feature Toggle
|
|
596
|
+
- Verification criteria: N/A if no feature, service, API, model, prompt, tool, index, datastore, or dependency is being retired. PASS if communication, dependency removal, route/flag cleanup, documentation updates, data retention/deletion, monitoring cleanup, and final verification are complete or explicitly retained with rationale. FAIL if retired behavior survives as stale routes, flags, docs, alerts, data obligations, indexes, or consumers without rationale
|
|
597
|
+
|
|
504
598
|
---
|
|
505
599
|
|
|
506
600
|
## 16. Event/Messaging (CQ-M)
|
|
@@ -510,7 +604,7 @@ Verifies that asynchronous messaging patterns are correctly implemented. Applica
|
|
|
510
604
|
Scope boundary: CQ-M addresses **messaging infrastructure and delivery semantics**. Data flow traceability belongs to CQ-D.
|
|
511
605
|
|
|
512
606
|
- **CQ-M-01** [P1] Is the message delivery guarantee explicitly chosen and its trade-offs documented?
|
|
513
|
-
- Inference path: domain_scope.md §
|
|
607
|
+
- Inference path: domain_scope.md §Major Sub-areas → Architecture and structure; logic_rules.md §State Management Logic → determinism
|
|
514
608
|
- Verification criteria: PASS if delivery guarantee is documented with justification. FAIL if the guarantee is implicit or unknown
|
|
515
609
|
|
|
516
610
|
- **CQ-M-02** [P1] Are message consumers idempotent when at-least-once delivery is used?
|
|
@@ -518,7 +612,7 @@ Scope boundary: CQ-M addresses **messaging infrastructure and delivery semantics
|
|
|
518
612
|
- Verification criteria: PASS if consumers handle duplicate messages safely (idempotency keys, deduplication). FAIL if duplicate processing causes side effects
|
|
519
613
|
|
|
520
614
|
- **CQ-M-03** [P2] Is message ordering guaranteed where business logic requires it?
|
|
521
|
-
- Inference path: domain_scope.md §
|
|
615
|
+
- Inference path: domain_scope.md §Major Sub-areas → Architecture and structure; logic_rules.md §State Management Logic → Fundamental State Rules → same input must produce same output
|
|
522
616
|
- Verification criteria: PASS if ordering-sensitive operations use partitioned/ordered channels. FAIL if ordering is assumed but not enforced
|
|
523
617
|
|
|
524
618
|
- **CQ-M-04** [P2] Are dead-letter queues configured for unprocessable messages?
|
|
@@ -527,12 +621,47 @@ Scope boundary: CQ-M addresses **messaging infrastructure and delivery semantics
|
|
|
527
621
|
|
|
528
622
|
---
|
|
529
623
|
|
|
624
|
+
## 17. AI Governance, Provenance, and Value Alignment (CQ-G)
|
|
625
|
+
|
|
626
|
+
Verifies that AI-era software behavior has accountable ownership, evidence, incident feedback, and explicit value tradeoffs. Applicable when AI behavior materially affects users, operators, release decisions, security/privacy, authority artifacts, or engineering workflow.
|
|
627
|
+
|
|
628
|
+
- **CQ-G-01** [P1] Is there an accountable owner for material AI risk?
|
|
629
|
+
- Inference path: concepts.md 'AI Governance' and 'AI Risk Owner'; domain_scope.md 'Axiology Input'; logic_rules.md 'AI Governance and Risk Rules'
|
|
630
|
+
- Verification criteria: PASS if the risk owner, risk treatment, approval/acceptance gate, and review cadence are named or explicitly non-applicable. FAIL if AI risk is treated as an implicit engineering side effect with no accountable owner
|
|
631
|
+
|
|
632
|
+
- **CQ-G-02** [P1] Do generated or retrieved authority-affecting artifacts preserve provenance?
|
|
633
|
+
- Inference path: concepts.md 'Provenance' and 'Generated Artifact'; dependency_rules.md 'AI Supply Chain and Provenance'; structure_spec.md 'Provenance record'
|
|
634
|
+
- Verification criteria: PASS if source refs, builder/agent, inputs, transformation path, verification state, and relevant model/provider facts are recorded. FAIL if generated or retrieved claims become authority without traceable evidence
|
|
635
|
+
|
|
636
|
+
- **CQ-G-03** [P2] Do red-team findings, semantic eval failures, incidents, and drift signals feed back into controls?
|
|
637
|
+
- Inference path: concepts.md 'Red-Team/Eval Loop'; structure_spec.md 'Red-team/incident loop'; logic_rules.md 'AI Governance and Risk Rules'
|
|
638
|
+
- Verification criteria: PASS if findings can update prompts, policies, tests, evals, guardrails, release gates, or incident playbooks with an owner. FAIL if findings are recorded but cannot change the system
|
|
639
|
+
|
|
640
|
+
- **CQ-G-04** [P2] Is fail-loud chosen where diagnosability and artifact truth matter more than apparent continuity?
|
|
641
|
+
- Inference path: domain_scope.md 'Axiology Input'; logic_rules.md 'LLM-Native Failure Posture'; concepts.md 'Fail-Loud' and 'Silent Degradation'
|
|
642
|
+
- Verification criteria: PASS if development/review/authority paths halt or emit diagnostic artifacts on contract failure, and product degradation visibly marks loss and recovery. FAIL if the system optimizes for a smooth demo by hiding trust loss, missing evidence, or failing boundaries
|
|
643
|
+
|
|
644
|
+
- **CQ-G-05** [P2] Are human approval and user/operator agency preserved for high-impact AI actions?
|
|
645
|
+
- Inference path: concepts.md 'Human Approval Gate' and 'Agent Autonomy'; structure_spec.md 'Human approval gate'; prompt_interface.md 'Agent Permission and Autonomy'
|
|
646
|
+
- Verification criteria: PASS if high-impact actions have approval, denial, audit, and recovery paths. FAIL if an agent can independently perform irreversible, external, privileged, or user-affecting actions without an explicit gate or risk acceptance
|
|
647
|
+
|
|
648
|
+
- **CQ-G-06** [P3] Are lifecycle and standards anchors current enough for the target's risk?
|
|
649
|
+
- Inference path: domain_scope.md 'Reference Standards and Frameworks' and 'Top-Down Concern Stack'
|
|
650
|
+
- Verification criteria: PASS if requirements, quality, accessibility, AI governance, secure development, and lifecycle references use current anchors or justify older references. FAIL if outdated standards or implementation-only scope hide acquisition, operation, maintenance, retirement, accessibility, or governance obligations
|
|
651
|
+
|
|
652
|
+
- **CQ-G-07** [P2] Are AI transparency and incident disclosure paths defined for affected stakeholders?
|
|
653
|
+
- Inference path: concepts.md 'AI Incident Disclosure'; domain_scope.md 'AI governance and risk'; extension_cases.md Case AI-06
|
|
654
|
+
- Verification criteria: N/A if AI behavior cannot materially affect users, operators, release decisions, security/privacy, authority artifacts, or trust claims. PASS if disclosure audience, trigger, timing, content, artifact/audit refs, remediation communication, and trust-status display are defined. FAIL if material AI failures can occur without a communication path to affected users, operators, or internal stakeholders
|
|
655
|
+
|
|
656
|
+
---
|
|
657
|
+
|
|
530
658
|
## Related Documents
|
|
531
659
|
- domain_scope.md — top-level scope definition and bias detection criteria
|
|
532
660
|
- concepts.md — term definitions for type system, architecture, testing, security, etc.
|
|
533
|
-
- logic_rules.md — rules for types (CQ-T), state (CQ-D), security (CQ-SE), concurrency (CQ-C), testing (CQ-V), AI agent collaboration (CQ-A)
|
|
661
|
+
- logic_rules.md — rules for types (CQ-T), state (CQ-D), security (CQ-SE), concurrency (CQ-C), testing (CQ-V), AI agent and LLM-native collaboration (CQ-A)
|
|
534
662
|
- structure_spec.md — rules for module structure (CQ-S), golden relationships (CQ-V), verification pipeline (CQ-V), thresholds (CQ-S)
|
|
535
663
|
- logic_rules.md §Error Handling Logic — inference path target for CQ-B boundary condition questions
|
|
536
|
-
- domain_scope.md §
|
|
664
|
+
- domain_scope.md §Domain Purpose and §Top-Down Concern Stack — inference path targets for CQ-MT maintenance questions
|
|
537
665
|
- dependency_rules.md — rules for dependency direction (CQ-DE), API management (CQ-I), build dependencies (CQ-DE), runtime dependencies (CQ-E)
|
|
538
|
-
- domain_scope.md §Sub-area to CQ
|
|
666
|
+
- domain_scope.md §Sub-area to CQ Mapping — mapping between scope sub-areas and CQ sections
|
|
667
|
+
- prompt_interface.md — interface criteria for prompt/context/tool/output/sink boundaries used by CQ-A and CQ-G
|