onto-mcp 0.3.1 → 0.3.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (52) hide show
  1. package/.onto/authority/core-lexicon.yaml +1 -0
  2. package/.onto/domains/software-engineering/competency_qs.md +192 -63
  3. package/.onto/domains/software-engineering/concepts.md +67 -5
  4. package/.onto/domains/software-engineering/conciseness_rules.md +22 -2
  5. package/.onto/domains/software-engineering/dependency_rules.md +78 -8
  6. package/.onto/domains/software-engineering/domain_scope.md +181 -150
  7. package/.onto/domains/software-engineering/extension_cases.md +318 -542
  8. package/.onto/domains/software-engineering/logic_rules.md +75 -3
  9. package/.onto/domains/software-engineering/problem_framing_profile.md +29 -2
  10. package/.onto/domains/software-engineering/prompt_interface.md +122 -0
  11. package/.onto/domains/software-engineering/structure_spec.md +53 -4
  12. package/.onto/principles/llm-native-development-guideline.md +20 -0
  13. package/.onto/principles/productization-charter.md +6 -0
  14. package/.onto/processes/reconstruct/reconstruct-boundary-contract.md +278 -91
  15. package/.onto/processes/reconstruct/reconstruct-execution-ux-contract.md +45 -12
  16. package/.onto/processes/reconstruct/source-profile-contract.md +39 -6
  17. package/.onto/processes/reconstruct/top-level-concept-discovery-contract.md +387 -0
  18. package/.onto/processes/review/lens-registry.md +16 -0
  19. package/.onto/processes/shared/target-material-kind-contract.md +18 -2
  20. package/.onto/roles/axiology.md +7 -2
  21. package/AGENTS.md +3 -2
  22. package/README.md +39 -33
  23. package/dist/core-api/reconstruct-api.js +22 -5
  24. package/dist/core-api/review-api.js +1288 -533
  25. package/dist/core-runtime/cli/mock-review-unit-executor.js +17 -0
  26. package/dist/core-runtime/cli/review-invoke.js +23 -48
  27. package/dist/core-runtime/cli/run-review-prompt-execution.js +122 -0
  28. package/dist/core-runtime/path-boundary.js +58 -0
  29. package/dist/core-runtime/reconstruct/artifact-types.js +5 -0
  30. package/dist/core-runtime/reconstruct/materialize-preparation.js +54 -4
  31. package/dist/core-runtime/reconstruct/pipeline-execution-ledger.js +38 -2
  32. package/dist/core-runtime/reconstruct/post-seed-validation.js +13 -0
  33. package/dist/core-runtime/reconstruct/record.js +11 -0
  34. package/dist/core-runtime/reconstruct/run.js +1133 -26
  35. package/dist/core-runtime/reconstruct/seed-candidate-validation.js +29 -0
  36. package/dist/core-runtime/review/execution-plan-boundary.js +123 -0
  37. package/dist/core-runtime/review/materializers.js +8 -3
  38. package/dist/core-runtime/review/review-artifact-utils.js +15 -2
  39. package/dist/core-runtime/review/review-invocation-runner.js +604 -0
  40. package/dist/core-runtime/target-material-kind.js +43 -5
  41. package/dist/mcp/server.js +158 -39
  42. package/dist/mcp/tool-schemas.js +22 -2
  43. package/package.json +3 -1
  44. package/.onto/domains/llm-native-development/competency_qs.md +0 -430
  45. package/.onto/domains/llm-native-development/concepts.md +0 -242
  46. package/.onto/domains/llm-native-development/conciseness_rules.md +0 -163
  47. package/.onto/domains/llm-native-development/dependency_rules.md +0 -216
  48. package/.onto/domains/llm-native-development/domain_scope.md +0 -197
  49. package/.onto/domains/llm-native-development/extension_cases.md +0 -474
  50. package/.onto/domains/llm-native-development/logic_rules.md +0 -123
  51. package/.onto/domains/llm-native-development/prompt_interface.md +0 -49
  52. package/.onto/domains/llm-native-development/structure_spec.md +0 -245
@@ -1108,6 +1108,7 @@ terms:
1108
1108
  - "Orthogonal to medium: medium is a cross-product reference and learning frame such as excel or cli; target_material_kind is a per-run target classification."
1109
1109
  - "Orthogonal to target_input_kind: target_input_kind says how the target entered the runtime, such as single_file or git_diff; target_material_kind says what kind of material is being handled."
1110
1110
  - "Do not use review context `source_kind` for this axis. In review artifacts, source_kind identifies context-source artifacts such as materialized_input or review_target_profile."
1111
+ - "`mixed` is a classification value for bundles with multiple member material kinds, not a generic adapter target. A process exposing `mixed` must define supported_composite, partial_composite, unsupported, or reserved_future behavior in its process contract before dispatch."
1111
1112
  - term_id: "pipeline_execution_ledger"
1112
1113
  canonical_label: "PipelineExecutionLedger"
1113
1114
  korean_label: "파이프라인 실행 단위 원장"
@@ -1,6 +1,6 @@
1
1
  ---
2
- version: 3
3
- last_updated: "2026-03-31"
2
+ version: 8
3
+ last_updated: "2026-05-28"
4
4
  source: bundled-domain-baseline
5
5
  status: established
6
6
  ---
@@ -19,7 +19,9 @@ Priority levels:
19
19
  - **P2** — Should be answerable for production systems. Failure indicates a quality gap.
20
20
  - **P3** — Recommended for mature systems. Failure indicates a refinement opportunity.
21
21
 
22
- Prefix allocation protocol: New CQ sections use 1–2 character alphabetic prefix codes with mandatory `-` separator (e.g., CQ-XX-01). Prefixes must not be string prefixes of existing prefixes. Current allocations: S, D, I, E, T, V, O, A, SE, P, C, DE, B, R, MT, M.
22
+ Prefix allocation protocol: New CQ sections use 1–2 character alphabetic prefix codes with mandatory `-` separator (e.g., CQ-XX-01). Prefixes must not be string prefixes of existing prefixes. Current allocations: S, D, I, E, T, V, O, A, SE, P, C, DE, B, R, MT, M, G.
23
+
24
+ Applicability verdict protocol: Every CQ has an applicability result before PASS/FAIL. Use `N/A` only when the target lacks the pattern, scale, risk, consumer, or behavior named by the CQ and that absence is supported by architecture, dependency, product, or scope evidence. A missing artifact is `FAIL`, not `N/A`, when the target actually uses the pattern or claims the obligation.
23
25
 
24
26
  ---
25
27
 
@@ -32,8 +34,8 @@ Verifies that the system's module structure, boundaries, and public interfaces a
32
34
  - Verification criteria: PASS if a complete list of modules can be produced, each with a declared role. FAIL if modules exist that cannot be classified into a structural role
33
35
 
34
36
  - **CQ-S-02** [P1] Can the modules that a specific module depends on be derived?
35
- - Inference path: dependency_rules.md 'Acyclic Dependencies' → module dependency graph must be a DAG → dependencies must be traceable; structure_spec.md 'Layer Structure Principles' → upper layers depend on lower layers
36
- - Verification criteria: PASS if for any module, direct and transitive dependencies can be enumerated from import statements or build configuration. FAIL if dependencies are implicit (global state, service locator without registration)
37
+ - Inference path: dependency_rules.md 'Acyclic Dependencies' → module dependency graph must be a DAG → dependencies must be traceable; dependency_rules.md 'Direction Rules' → dependency kind and declared architecture determine direction rules
38
+ - Verification criteria: PASS if for any module, direct and transitive dependencies can be enumerated from import statements, build configuration, or declared registration artifacts. FAIL if dependencies are implicit (global state, service locator without registration) or cannot be classified by dependency kind
37
39
 
38
40
  - **CQ-S-03** [P1] Can the list of public APIs (externally exposed interfaces) be extracted?
39
41
  - Inference path: structure_spec.md 'Golden Relationships' → Module-Interface coherence → every module must expose its contract via a public interface; structure_spec.md 'Required Relationships' → every public function/class must have at least one caller or test
@@ -44,7 +46,7 @@ Verifies that the system's module structure, boundaries, and public interfaces a
44
46
  - Verification criteria: PASS if consumers only import from the module's public API surface. FAIL if consumers directly import internal files or private functions
45
47
 
46
48
  - **CQ-S-05** [P2] Is the architectural pattern explicitly declared and consistently applied?
47
- - Inference path: structure_spec.md 'Architectural Patterns' → each pattern defines specific dependency rules; domain_scope.md 'Structure & Architecture' → Module Separation is required
49
+ - Inference path: structure_spec.md 'Architectural Patterns' → each pattern defines specific dependency rules; domain_scope.md 'Major Sub-areas' → Architecture and structure
48
50
  - Verification criteria: PASS if the codebase declares its architectural pattern and conforms to that pattern's rules. FAIL if no pattern is declared or the codebase violates its declared pattern
49
51
 
50
52
  - **CQ-S-06** [P2] Is the package/module organization axis consistently applied at each directory level?
@@ -55,9 +57,9 @@ Verifies that the system's module structure, boundaries, and public interfaces a
55
57
  - Inference path: structure_spec.md 'Quantitative Thresholds' → module size >500 lines, function size >50 lines, cyclomatic complexity >10, dependency fan-out >7, inheritance depth >5
56
58
  - Verification criteria: PASS if structural metrics are measured and threshold violations are addressed or justified. FAIL if no metrics are measured or violations exist without justification
57
59
 
58
- - **CQ-S-08** [P2] Are there isolated nodes (dead code, modules with no imports, public APIs without tests)?
59
- - Inference path: structure_spec.md 'Isolated Node Prohibition' → public function with no callers = dead code warning; public API without tests = verification gap
60
- - Verification criteria: PASS if no public functions or modules exist without at least one caller or test. FAIL if dead code or untested public APIs exist without intentional retention documentation
60
+ - **CQ-S-08** [P2] Are isolated nodes or intentionally retained unreachable modules identifiable?
61
+ - Inference path: structure_spec.md 'Isolated Node Prohibition' → public function/class with no callers and modules with no imports are dead-code or retention signals; CQ-V-01 owns public API test existence
62
+ - Verification criteria: PASS if public functions/classes have callers or intentional-retention rationale, and modules are imported, exposed, or intentionally retained. FAIL if unreachable public APIs or modules exist with no caller/import/export/retention rationale
61
63
 
62
64
  - **CQ-S-09** [P3] Is an intermediate abstraction layer justified by having 2+ consumers?
63
65
  - Inference path: structure_spec.md 'Authority and Layer Separation' → abstraction layer justified only when 2+ consumers directly consume it
@@ -74,11 +76,11 @@ Verifies that the system's module structure, boundaries, and public interfaces a
74
76
  Verifies that data paths are traceable, transformations identifiable, and sources of truth designated.
75
77
 
76
78
  - **CQ-D-01** [P1] Can the path that a specific user input takes through the system be traced?
77
- - Inference path: domain_scope.md 'Structure & Architecture' → Data Flow → input-to-processing-to-output paths; structure_spec.md 'Required Module Structure Elements' → Entry Point → Business Logic → Data Access
79
+ - Inference path: domain_scope.md 'Major Sub-areas' → Architecture and structure; structure_spec.md 'Required Module Structure Elements' → Entry Point → Business Logic → Data Access
78
80
  - Verification criteria: PASS if for any user input, the complete processing chain (entry → validation → logic → data access → response) can be traced. FAIL if any input path has untraceable segments
79
81
 
80
82
  - **CQ-D-02** [P1] Can it be identified where specific data is created, transformed, and consumed?
81
- - Inference path: domain_scope.md 'Structure & Architecture' → Data Flow transformation chains; concepts.md 'Data/State Management Terms' → State, Mutation, Transaction
83
+ - Inference path: domain_scope.md 'Major Sub-areas' → Data and state + Architecture and structure; concepts.md 'Data/State Management Terms' → State, Mutation, Transaction
82
84
  - Verification criteria: PASS if every data entity has identifiable creation, transformation, and consumption points. FAIL if data appears in a consumer without a traceable origin
83
85
 
84
86
  - **CQ-D-03** [P1] Can the scope of impact when a state change occurs be determined?
@@ -94,17 +96,21 @@ Verifies that data paths are traceable, transformations identifiable, and source
94
96
  - Verification criteria: PASS if maximum propagation delay from write to read model is documented. FAIL if CQRS is applied with no consistency guarantee
95
97
 
96
98
  - **CQ-D-06** [P2] When Event Sourcing is used, are terminal states, projections, and partial commit prevention defined?
97
- - Inference path: domain_scope.md 'Data & State' → Event Sourcing; logic_rules.md 'State Management Logic' → single business operation with multiple events must choose partial commit prevention strategy
99
+ - Inference path: domain_scope.md 'Major Sub-areas' → Data and state; logic_rules.md 'State Management Logic' → single business operation with multiple events must choose partial commit prevention strategy
98
100
  - Verification criteria: PASS if event-sourced aggregates define terminal states, projector branches, and partial commit prevention strategy. FAIL if any of the three is missing
99
101
 
100
102
  - **CQ-D-07** [P2] Is the schema strategy (schema-on-write vs schema-on-read) explicitly declared?
101
- - Inference path: domain_scope.md 'Data & State' → Data Modeling schema strategy determines migration and consistency guarantees
103
+ - Inference path: domain_scope.md 'Major Sub-areas' → Data and state; schema strategy determines migration and consistency guarantees
102
104
  - Verification criteria: PASS if schema strategy is declared with implications for migration and consistency. FAIL if implicit or mixed without documentation
103
105
 
104
106
  - **CQ-D-08** [P3] When data is delivered through multiple paths, is the priority for inconsistency resolution specified?
105
107
  - Inference path: dependency_rules.md 'Source of Truth Management' → multi-path data delivery priority must be specified as a contract
106
108
  - Verification criteria: PASS if multi-path data delivery includes a documented priority contract. FAIL if resolution is ad-hoc
107
109
 
110
+ - **CQ-D-09** [P2] Are retention and deletion obligations traceable across primary and derived data stores?
111
+ - Inference path: concepts.md 'Security Terms' → Data Classification, Retention Policy, Deletion/Erasure Path; domain_scope.md 'Major Sub-areas' → Data and state
112
+ - Verification criteria: N/A if the system stores no user, tenant, regulated, or operationally retained data. PASS if classified data has retention/deletion rules covering primary stores, indexes, caches, logs, backups, generated artifacts, and downstream processors. FAIL if data can outlive its declared purpose or deletion is not traceable across derived stores
113
+
108
114
  ---
109
115
 
110
116
  ## 3. Change Impact (CQ-I)
@@ -140,7 +146,7 @@ Verifies that the system supports safe evolution — changes can be assessed for
140
146
  - Verification criteria: PASS if dependency analysis tools run in CI and reject circular dependencies. FAIL if no automated cycle detection exists
141
147
 
142
148
  - **CQ-I-08** [P3] When feature toggles are introduced, is the toggle lifecycle managed?
143
- - Inference path: concepts.md 'Change Management Terms' → Feature Toggle → toggle debt risk; domain_scope.md 'Operations & Deployment' → feature flags
149
+ - Inference path: concepts.md 'Change Management Terms' → Feature Toggle → toggle debt risk; domain_scope.md 'Major Sub-areas' → Operations and maintenance
144
150
  - Verification criteria: PASS if toggles have owners, expiration dates, and cleanup procedures. FAIL if toggles persist indefinitely (toggle debt)
145
151
 
146
152
  ---
@@ -150,7 +156,7 @@ Verifies that the system supports safe evolution — changes can be assessed for
150
156
  Verifies that error paths are defined, recovery strategies exist, and errors propagate with sufficient diagnostic information.
151
157
 
152
158
  - **CQ-E-01** [P1] Are errors classified into operational (recoverable) and programmer (non-recoverable)?
153
- - Inference path: domain_scope.md 'Interface & Contract' → Error Handling classification required; concepts.md 'Architecture Core Terms' → Middleware for error handling
159
+ - Inference path: domain_scope.md 'Major Sub-areas' → Error and failure posture; concepts.md 'Architecture Core Terms' → Middleware for error handling
154
160
  - Verification criteria: PASS if errors are classified: recoverable (network timeout, validation failure) vs non-recoverable (null dereference, assertion violation). FAIL if all errors are treated uniformly
155
161
 
156
162
  - **CQ-E-02** [P1] Can the system's recovery path in a specific failure scenario be traced?
@@ -162,11 +168,11 @@ Verifies that error paths are defined, recovery strategies exist, and errors pro
162
168
  - Verification criteria: PASS if error propagation follows a defined pattern (Result types, exception hierarchy, error middleware) and is traceable. FAIL if errors are swallowed, re-thrown without context, or propagate through undefined channels
163
169
 
164
170
  - **CQ-E-04** [P1] Do user-facing error messages include cause and recommended actions?
165
- - Inference path: domain_scope.md 'Required Concept Categories' → Error path → user sees raw stack trace; domain_scope.md 'Interface & Contract' → Error Handling user guidance
171
+ - Inference path: domain_scope.md 'Required Concept Categories' → Error path; domain_scope.md 'Major Sub-areas' → Error and failure posture
166
172
  - Verification criteria: PASS if error messages include (1) what went wrong, (2) what the user can do, and (3) a correlation ID. FAIL if users see raw exceptions or generic messages
167
173
 
168
174
  - **CQ-E-05** [P2] Are circuit breaker patterns applied to external dependency calls?
169
- - Inference path: dependency_rules.md 'Runtime Dependency Rules' → Circuit Breaker; domain_scope.md 'Interface & Contract' → Error Handling → cascading failure prevention
175
+ - Inference path: dependency_rules.md 'Runtime Dependency Rules' → Circuit Breaker; domain_scope.md 'Major Sub-areas' → Error and failure posture
170
176
  - Verification criteria: PASS if external calls implement circuit breaker with defined thresholds. FAIL if no failure isolation exists
171
177
 
172
178
  - **CQ-E-06** [P2] Are retry policies defined with limits and backoff strategies?
@@ -177,9 +183,9 @@ Verifies that error paths are defined, recovery strategies exist, and errors pro
177
183
  - Inference path: dependency_rules.md 'Runtime Dependency Rules' → Bulkhead → isolate thread/connection pools per dependency
178
184
  - Verification criteria: PASS if resource pools are isolated per dependency. FAIL if all dependencies share a single pool
179
185
 
180
- - **CQ-E-08** [P3] Are fallback paths defined for degraded operation?
181
- - Inference path: dependency_rules.md 'External Dependency Management' → external calls must have fallback; domain_scope.md 'Interface & Contract' → Error Handling fallback paths
182
- - Verification criteria: PASS if critical flows define fallback behavior (cached data, defaults, queuing). FAIL if unavailability produces only errors
186
+ - **CQ-E-08** [P3] Are dependency failure semantics defined for degraded or halted operation?
187
+ - Inference path: dependency_rules.md 'External Dependency Management' → external calls declare retry, timeout, circuit breaker, explicit degraded behavior, or fail-loud/fail-close behavior; domain_scope.md 'Major Sub-areas' → Error and failure posture
188
+ - Verification criteria: PASS if critical flows define whether they retry, degrade visibly, queue, fail-close, or fail-loud with diagnostics. FAIL if unavailability triggers hidden fallback or has no declared behavior
183
189
 
184
190
  ---
185
191
 
@@ -200,7 +206,7 @@ Verifies that the type system and constraint design enforce correctness at the e
200
206
  - Verification criteria: PASS if every terminal state has a transition event, handler branch, and explicit subsequent action list. FAIL if terminal states lack processing branches
201
207
 
202
208
  - **CQ-T-04** [P2] Are type-level safety mechanisms used to prevent invalid states at compile time?
203
- - Inference path: logic_rules.md 'Type System Logic' → excluding fields from failure branches at compile time; domain_scope.md 'Interface & Contract' → Type System → type-level safety
209
+ - Inference path: logic_rules.md 'Type System Logic' → excluding fields from failure branches at compile time; domain_scope.md 'Major Sub-areas' → Interface and contract
204
210
  - Verification criteria: PASS if types make invalid states unrepresentable (separate validated/unvalidated types, Result types, branded types). FAIL if validity is checked only at runtime when compile-time is possible
205
211
 
206
212
  - **CQ-T-05** [P2] Are database constraints aligned with application-level validation?
@@ -238,16 +244,16 @@ Verifies that the test strategy covers necessary levels, boundaries are correctl
238
244
  - Verification criteria: PASS if every public function has at least one test exercising its primary behavior. FAIL if public functions have no corresponding test
239
245
 
240
246
  - **CQ-V-02** [P1] Can uncovered code paths be identified?
241
- - Inference path: structure_spec.md 'Quantitative Thresholds' → coverage <80% insufficient, <60% critical; domain_scope.md 'Verification & Quality' → Test Strategy
247
+ - Inference path: structure_spec.md 'Quantitative Thresholds' → coverage <80% insufficient, <60% critical; domain_scope.md 'Major Sub-areas' → Verification and quality
242
248
  - Verification criteria: PASS if coverage is measured with uncovered paths documented. FAIL if no coverage measurement exists or coverage <60% without remediation plan
243
249
 
244
250
  - **CQ-V-03** [P1] Are happy path and error path each verified separately?
245
251
  - Inference path: domain_scope.md 'Required Concept Categories' → Happy path and Error path are separate categories; logic_rules.md 'Testing Logic' → Test Boundary Rules
246
252
  - Verification criteria: PASS if both success and error paths have dedicated tests per feature. FAIL if only happy-path tests exist
247
253
 
248
- - **CQ-V-04** [P1] Are test levels (unit, integration, E2E) correctly classified and each present?
249
- - Inference path: domain_scope.md 'Verification & Quality' → Testing Pyramid; logic_rules.md 'Testing Logic' → Test Boundary Rules → unit vs integration boundary
250
- - Verification criteria: PASS if tests are classified by level with each present. FAIL if all tests are at one level, or if integration tests are mislabeled as unit tests
254
+ - **CQ-V-04** [P1] Are applicable test levels correctly classified and justified?
255
+ - Inference path: domain_scope.md 'Major Sub-areas' → Verification and quality; logic_rules.md 'Testing Logic' → Test Boundary Rules → unit vs integration boundary
256
+ - Verification criteria: PASS if unit, integration, E2E, contract, semantic, or other relevant verification levels are classified according to target risk and each applicable level is present or explicitly N/A with rationale. FAIL if required levels are missing, all tests are collapsed into one level, or integration/E2E/semantic checks are mislabeled as unit tests
251
257
 
252
258
  - **CQ-V-05** [P2] Are tests independent of execution order?
253
259
  - Inference path: logic_rules.md 'Testing Logic' → Test Independence → no order dependency; Test Determinism → flaky tests worse than no tests
@@ -266,13 +272,17 @@ Verifies that the test strategy covers necessary levels, boundaries are correctl
266
272
  - Verification criteria: PASS if linting, type checking, and dependency analysis run automatically in CI. FAIL if static analysis is optional or violations ignored
267
273
 
268
274
  - **CQ-V-09** [P3] Is mutation testing applied to critical business logic?
269
- - Inference path: domain_scope.md 'Verification & Quality' → mutation testing; concepts.md 'Testing' → Mutation Test; logic_rules.md 'Testing Logic' → Mutation Testing → apply selectively
275
+ - Inference path: domain_scope.md 'Major Sub-areas' → Verification and quality; concepts.md 'Testing' → Mutation Test; logic_rules.md 'Testing Logic' → Mutation Testing → apply selectively
270
276
  - Verification criteria: PASS if mutation testing covers critical paths with surviving mutants reviewed. WARNING if not applied to critical logic
271
277
 
272
278
  - **CQ-V-10** [P3] Is property-based testing used for functions with broad input domains?
273
- - Inference path: domain_scope.md 'Verification & Quality' → property-based testing; concepts.md 'Testing' → Property-Based Test
279
+ - Inference path: domain_scope.md 'Major Sub-areas' → Verification and quality; concepts.md 'Testing' → Property-Based Test
274
280
  - Verification criteria: PASS if broad-domain functions (parsers, validators, serializers) use property-based testing. WARNING if tested only with hand-picked examples
275
281
 
282
+ - **CQ-V-11** [P2] Are critical user-facing flows tested for accessibility and locale behavior when applicable?
283
+ - Inference path: concepts.md 'Internationalization/Accessibility Terms' → WCAG, ARIA, Screen Reader, Locale, ICU; domain_scope.md 'Major Sub-areas' → Accessibility and internationalization
284
+ - Verification criteria: N/A if the system has no user-facing UI/API text, locale-sensitive output, or accessibility obligation. PASS if critical flows have accessibility checks and locale/formatting tests appropriate to their exposure. FAIL if public or regulated user-facing flows lack accessibility/i18n verification
285
+
276
286
  ---
277
287
 
278
288
  ## 7. Deployment/Operations (CQ-O)
@@ -280,7 +290,7 @@ Verifies that the test strategy covers necessary levels, boundaries are correctl
280
290
  Verifies that the system is deployable, observable, and operable in production.
281
291
 
282
292
  - **CQ-O-01** [P1] Can the running code version be determined per environment?
283
- - Inference path: domain_scope.md 'Operations & Deployment' → environment separation, monitoring; concepts.md 'DevOps' → Artifact = versioned immutable build output
293
+ - Inference path: domain_scope.md 'Major Sub-areas' → Operations and maintenance; concepts.md 'DevOps Terms' → Build Artifact
284
294
  - Verification criteria: PASS if each environment has a mechanism to identify the exact code version (build hash, image tag, version endpoint). FAIL if no version identification exists
285
295
 
286
296
  - **CQ-O-02** [P1] Are configurations separated from code and managed per environment?
@@ -292,25 +302,29 @@ Verifies that the system is deployable, observable, and operable in production.
292
302
  - Verification criteria: PASS if structured logging, metrics, and distributed tracing are present. FAIL if production issues cannot be diagnosed from telemetry
293
303
 
294
304
  - **CQ-O-04** [P2] Are deployment strategies defined with rollback procedures?
295
- - Inference path: concepts.md 'DevOps' → Blue-Green, Canary, Rolling; domain_scope.md 'Operations & Deployment' → progressive rollouts
305
+ - Inference path: concepts.md 'DevOps' → Blue-Green, Canary, Rolling; domain_scope.md 'Major Sub-areas' → Operations and maintenance
296
306
  - Verification criteria: PASS if deployment strategy includes rollback procedures and health check criteria. FAIL if deployments are all-or-nothing
297
307
 
298
308
  - **CQ-O-05** [P2] Are SLIs/SLOs defined for critical paths?
299
- - Inference path: concepts.md 'Observability' → SLI/SLO/SLA; domain_scope.md 'Operations & Deployment' → error budgets
309
+ - Inference path: concepts.md 'Observability' → SLI/SLO/SLA; domain_scope.md 'Major Sub-areas' → Operations and maintenance
300
310
  - Verification criteria: PASS if critical paths have defined SLIs with target SLOs. FAIL if no quantitative service level targets exist
301
311
 
302
312
  - **CQ-O-06** [P3] Are the 12-Factor App principles addressed?
303
- - Inference path: domain_scope.md 'Operations & Deployment' → 12-Factor App; domain_scope.md 'Normative System Classification' → Layer 3 standards
313
+ - Inference path: domain_scope.md 'Reference Standards and Frameworks' → 12-Factor App, SRE, DORA
304
314
  - Verification criteria: PASS if each factor is addressed (compliance or documented deviation). FAIL if not assessed for cloud-deployed services
305
315
 
316
+ - **CQ-O-07** [P2] Can each deployed release artifact be traced back to source, build, verification, approval, and environment?
317
+ - Inference path: concepts.md 'DevOps Terms' → Build Artifact, Artifact Attestation, Release Artifact Traceability; dependency_rules.md 'AI Supply Chain and Provenance' for AI-generated release inputs
318
+ - Verification criteria: PASS if deployed artifacts can be traced to source revision, build pipeline, dependency set/SBOM where applicable, verification results, approval gate, and deployment environment. FAIL if production code identity or release provenance cannot be reconstructed
319
+
306
320
  ---
307
321
 
308
- ## 8. AI Agent Collaboration (CQ-A)
322
+ ## 8. AI Agent and LLM-Native Collaboration (CQ-A)
309
323
 
310
- Verifies that specifications support AI agent execution — self-contained, unambiguous, and verifiable. Applicable when AI agents are consumers or executors.
324
+ Verifies that specifications support AI agent execution and LLM-native software workflows — self-contained, unambiguous, verifiable, and diagnosable. Applicable when AI agents are consumers/executors or when product/development/review behavior depends on model calls, prompt/context contracts, retrieval, tool use, or semantic evaluation.
311
325
 
312
326
  - **CQ-A-01** [P1] Is the specification that an AI agent executes self-contained?
313
- - Inference path: concepts.md 'Document Design Terms' → Self-contained Spec required when AI agents are executors; domain_scope.md 'Documentation & Consumers' → dual-consumer handling
327
+ - Inference path: concepts.md 'Document Design Terms' → Self-contained Spec required when AI agents are executors; domain_scope.md 'Major Sub-areas' → Documentation and consumers
314
328
  - Verification criteria: PASS if every AI-executed spec includes all necessary context without depending on other sessions or implicit knowledge. FAIL if the spec requires absent context
315
329
 
316
330
  - **CQ-A-02** [P1] Are verification criteria for AI-generated output defined at the "ambiguity detection" level?
@@ -318,7 +332,7 @@ Verifies that specifications support AI agent execution — self-contained, unam
318
332
  - Verification criteria: PASS if verification uses concrete, measurable criteria (not subjective assessment). FAIL if criteria allow partial fulfillment or arbitrary interpretation
319
333
 
320
334
  - **CQ-A-03** [P2] Are reading paths specified for AI vs human documentation consumers?
321
- - Inference path: domain_scope.md 'Documentation & Consumers' → dual-consumer handling; concepts.md 'Document Design Terms' → Contract Document vs Guide Document
335
+ - Inference path: domain_scope.md 'Major Sub-areas' → Documentation and consumers; concepts.md 'Document Design Terms' → Contract Document vs Guide Document
322
336
  - Verification criteria: PASS if contract (machine-readable) and guide (human-readable) content are structurally separated. FAIL if specs and explanations are mixed without separation
323
337
 
324
338
  - **CQ-A-04** [P2] Is constraint design for AI tasks using pre-inclusion rather than post-verification?
@@ -330,9 +344,65 @@ Verifies that specifications support AI agent execution — self-contained, unam
330
344
  - Verification criteria: PASS if inter-agent communication uses typed schemas. FAIL if agents communicate via unstructured natural language
331
345
 
332
346
  - **CQ-A-06** [P3] Is AI agent autonomy explicitly bounded?
333
- - Inference path: domain_scope.md 'Documentation & Consumers' → Constraint Design hard/soft classification; concepts.md 'Constraint Design Terms' → Hard vs Soft Constraint
347
+ - Inference path: domain_scope.md 'Major Sub-areas' → LLM-native and agentic behavior; prompt_interface.md 'Agent Permission and Autonomy'
334
348
  - Verification criteria: PASS if agent permissions define autonomous scope, human-approval scope, and prohibited actions. FAIL if scope is undefined
335
349
 
350
+ - **CQ-A-07** [P1] Are LLM model/provider/version facts recorded where behavior must be reproducible?
351
+ - Inference path: concepts.md 'LLM-Native Engineering Terms' → Model Provider and Model Version; logic_rules.md 'LLM Boundary Logic' → model/provider facts are external dependency facts
352
+ - Verification criteria: PASS if runs or configs record provider, model id/version or alias status, auth/route realization, and capability requirements. FAIL if a model behavior change cannot be traced to a concrete provider/model fact
353
+
354
+ - **CQ-A-08** [P1] Are prompt templates, context assembly rules, and output schemas versioned and reviewable?
355
+ - Inference path: logic_rules.md 'LLM Boundary Logic' → prompt/context artifacts are behavior-affecting; concepts.md 'Prompt Template' and 'Context Assembly'
356
+ - Verification criteria: PASS if prompt/context/schema changes are tracked like code/config and have an owner or artifact seat. FAIL if runtime behavior depends on unversioned prompt text or hidden context assembly logic
357
+
358
+ - **CQ-A-09** [P1] Does LLM-native failure handling fail-loud instead of silently degrading in development, review, or authority paths?
359
+ - Inference path: logic_rules.md 'LLM-Native Failure Posture' → default fail-loud; concepts.md 'Silent Degradation'
360
+ - Verification criteria: PASS if malformed output, missing context, invalid tool result, schema mismatch, provider preflight failure, and unbudgeted truncation halt or produce structured diagnostics. FAIL if fallback/repair hides the origin or presents degraded output as complete
361
+
362
+ - **CQ-A-10** [P2] Is semantic evaluation defined for non-deterministic model or agent output?
363
+ - Inference path: concepts.md 'Semantic Evaluation'; domain_scope.md 'Major Sub-areas' → LLM-native and agentic behavior
364
+ - Verification criteria: PASS if golden set, rubric, baseline, comparison method, or human review path exists for output quality. FAIL if only route success, schema validity, or exact-match tests are used to claim quality
365
+
366
+ - **CQ-A-11** [P2] Do agent tools and MCP boundaries have self-describing schemas and validated results?
367
+ - Inference path: logic_rules.md 'LLM Boundary Logic' → Agent and Tool Rules; structure_spec.md 'LLM-Native System Structure'
368
+ - Verification criteria: PASS if tool schemas include applicability, parameters, result shape, failure semantics, and result validation. FAIL if agents must infer tool use from hidden docs or consume unchecked tool output
369
+
370
+ - **CQ-A-12** [P2] Can evidence-backed LLM output trace claims to source provenance?
371
+ - Inference path: logic_rules.md 'LLM Boundary Logic' → Retrieved context used as evidence must carry source provenance
372
+ - Verification criteria: PASS if evidence-backed claims cite retrieved context or artifact refs. FAIL if generated claims are treated as evidence without source provenance
373
+
374
+ - **CQ-A-13** [P3] Are graceful degradation paths explicit product behavior rather than hidden development shortcuts?
375
+ - Inference path: logic_rules.md 'LLM-Native Failure Posture' → graceful degradation allowed only as explicit product behavior
376
+ - Verification criteria: PASS if degraded behavior declares trigger, lost capability, visible marker, diagnostic artifact, and recovery path. FAIL if fallback exists only to keep the run passing
377
+
378
+ - **CQ-A-14** [P1] Are LLM, runtime, and middleware ownership boundaries explicit and non-overlapping?
379
+ - Inference path: concepts.md 'LLM-Native Engineering Terms' → domain projections for LLM Boundary, Runtime Boundary, Middleware Boundary, Ownership Non-Interference; logic_rules.md 'LLM Boundary Logic' → operational prohibitions; prompt_interface.md 'Ownership Boundary Structure' → interface obligations
380
+ - Verification criteria: PASS if semantic judgment, deterministic binding/validation/persistence, and transport/adaptation responsibilities are declared with forbidden crossovers. FAIL if runtime or middleware silently performs semantic repair, middleware becomes hidden policy authority, or LLM output bypasses runtime-owned authority, persistence, authorization, idempotency, or cost/security gates
381
+
382
+ - **CQ-A-15** [P1] Is LLM output treated as untrusted until validated for the concrete downstream sink?
383
+ - Inference path: concepts.md 'Output Zero-Trust' and 'Sink Validation'; logic_rules.md 'Output Zero-Trust Rules'; prompt_interface.md 'Output Sink Constraints'
384
+ - Verification criteria: PASS if every shell, SQL, HTML, file, email, API/tool, and authority-artifact sink has explicit validation/encoding/authorization before use. FAIL if schema validity or prompt instructions are treated as sufficient safety for downstream use
385
+
386
+ - **CQ-A-16** [P1] Is external content prevented from injecting instructions that change tool permission, role authority, output authority, or secret disclosure behavior?
387
+ - Inference path: concepts.md 'Prompt Injection Boundary'; logic_rules.md 'External Content and Prompt Injection Rules'; prompt_interface.md 'External Content Handling'
388
+ - Verification criteria: PASS if external content is framed as data, instruction hierarchy is runtime-enforced, exfiltration sinks are blocked or approved, and at least one prompt-injection scenario is tested when tool/authority impact exists. FAIL if the model alone is trusted to ignore hostile retrieved/user/web/file content
389
+
390
+ - **CQ-A-17** [P1] Does retrieval/RAG preserve permission, source validation, poisoning resistance, provenance, and auditability before context injection?
391
+ - Inference path: concepts.md 'RAG Permission Boundary' and 'Embedding Index Compatibility'; logic_rules.md 'Retrieval and Vector Rules'; structure_spec.md 'LLM-Native System Structure'
392
+ - Verification criteria: PASS if retrieval filters by user/tenant/sensitivity before context assembly, records source/retrieval provenance, validates ingestion, handles embedding/index compatibility, and audits retrieval paths. FAIL if retrieved text can cross permission boundaries or become evidence without provenance
393
+
394
+ - **CQ-A-18** [P2] Are agent functionality, permission, and autonomy minimized separately?
395
+ - Inference path: concepts.md 'Agent Functionality', 'Agent Permission', and 'Agent Autonomy'; logic_rules.md 'Agent and Tool Rules'; prompt_interface.md 'Agent Permission and Autonomy'
396
+ - Verification criteria: PASS if available tools, authorized scope, and no-approval autonomy are separately declared with high-impact approval gates. FAIL if broad tool availability is treated as permission or permission is treated as autonomous approval
397
+
398
+ - **CQ-A-19** [P2] Can a model/provider, prompt/context, tool, or retrieval-corpus change be evaluated as a behavior migration?
399
+ - Inference path: dependency_rules.md 'LLM/Agent Dependency Management' and 'AI Supply Chain and Provenance'; concepts.md 'Behavior-Affecting Artifact'
400
+ - Verification criteria: PASS if impacted prompts, tool schemas, retrieval indexes, eval baselines, dashboards, cost/rate assumptions, and release gates are listed or explicitly non-applicable. FAIL if the change is treated as an implementation detail with no semantic regression path
401
+
402
+ - **CQ-A-20** [P2] Does semantic evaluation distinguish route success, schema validity, quality, safety, and production drift?
403
+ - Inference path: concepts.md 'Semantic Evaluation', 'Evaluation Baseline', and 'Production Drift'; logic_rules.md 'AI Governance and Risk Rules'
404
+ - Verification criteria: PASS if the system separates deterministic pass/fail, schema validation, semantic quality, safety/security testing, and drift monitoring. FAIL if successful model invocation is used as evidence of output quality
405
+
336
406
  ---
337
407
 
338
408
  ## 9. Security (CQ-SE)
@@ -340,11 +410,11 @@ Verifies that specifications support AI agent execution — self-contained, unam
340
410
  Verifies that authentication, authorization, input validation, and supply chain security are addressed.
341
411
 
342
412
  - **CQ-SE-01** [P1] Are authentication mechanisms correctly implemented?
343
- - Inference path: logic_rules.md 'Security Logic' → Authentication Logic → token validation: structure, signature, expiration, issuer, audience → skipping any = vulnerability; domain_scope.md 'Security & Auth' → OAuth 2.0/OIDC
413
+ - Inference path: logic_rules.md 'Security Logic' → Authentication Logic → token validation: structure, signature, expiration, issuer, audience → skipping any = vulnerability; domain_scope.md 'Major Sub-areas' → Security and authorization
344
414
  - Verification criteria: PASS if token validation checks all five steps and sessions have absolute/idle timeouts with invalidation on password change. FAIL if any step is skipped or sessions lack expiration
345
415
 
346
416
  - **CQ-SE-02** [P1] Is authorization enforced at the correct granularity (not just authentication)?
347
- - Inference path: domain_scope.md 'Security & Auth' → RBAC vs ABAC; domain_scope.md 'Bias Detection Criteria' → security scope bias: auth without authz = full access (OWASP A01)
417
+ - Inference path: domain_scope.md 'Major Sub-areas' → Security and authorization; domain_scope.md 'Bias Detection Criteria' → security scope bias: auth without authz = full access
348
418
  - Verification criteria: PASS if every protected resource checks user permission level. FAIL if only "is logged in" is checked without "has permission"
349
419
 
350
420
  - **CQ-SE-03** [P1] Is input validation applied at system boundaries with defense in depth?
@@ -356,13 +426,17 @@ Verifies that authentication, authorization, input validation, and supply chain
356
426
  - Verification criteria: PASS if authorization model is documented with explicit conflict resolution strategy. FAIL if multiple policies apply with no resolution order
357
427
 
358
428
  - **CQ-SE-05** [P2] Are dependency security audits automated in CI?
359
- - Inference path: dependency_rules.md 'Build/Package Dependency Rules' → Dependency Security → audit in CI; domain_scope.md 'Security & Auth' → supply chain security
429
+ - Inference path: dependency_rules.md 'Build/Package Dependency Rules' → Dependency Security → audit in CI; domain_scope.md 'Major Sub-areas' → Security and authorization
360
430
  - Verification criteria: PASS if `npm audit` / `pip audit` or equivalent runs in CI with critical vulnerabilities blocking merge. FAIL if no automated scanning
361
431
 
362
432
  - **CQ-SE-06** [P2] Are OWASP Top 10 risks systematically addressed?
363
- - Inference path: domain_scope.md 'Security & Auth' → OWASP Top 10 (2021); domain_scope.md 'Reference Standards/Frameworks' → OWASP for web-facing systems
433
+ - Inference path: domain_scope.md 'Major Sub-areas' → Security and authorization; domain_scope.md 'Reference Standards and Frameworks' → OWASP Top 10 for web-facing systems
364
434
  - Verification criteria: PASS if assessed against OWASP Top 10 with mitigations documented. FAIL if no assessment for web-facing systems
365
435
 
436
+ - **CQ-SE-07** [P2] Are sensitive and personal data classified, minimized, and protected according to their handling obligations?
437
+ - Inference path: concepts.md 'Security Terms' → Data Classification, Data Minimization; domain_scope.md 'Major Sub-areas' → Security and authorization
438
+ - Verification criteria: N/A if the system processes no personal, sensitive, tenant-scoped, regulated, or secret-bearing data. PASS if data classes, minimization rationale, access controls, encryption/redaction, logging policy, and retention/deletion obligations are documented and enforced. FAIL if sensitive data is collected, stored, logged, or exposed without classification and handling rules
439
+
366
440
  ---
367
441
 
368
442
  ## 10. Performance (CQ-P)
@@ -370,24 +444,24 @@ Verifies that authentication, authorization, input validation, and supply chain
370
444
  Verifies that performance requirements are defined, measured, and validated. Scale-dependent — small systems may not need all of these.
371
445
 
372
446
  - **CQ-P-01** [P2] Are performance targets defined as measurable SLOs?
373
- - Inference path: domain_scope.md 'Verification & Quality' → Performance p50/p95/p99 latency, RPS, error rate; concepts.md 'Observability' → SLI/SLO
447
+ - Inference path: domain_scope.md 'Major Sub-areas' → Verification and quality; concepts.md 'Observability' → SLI/SLO
374
448
  - Verification criteria: PASS if critical paths have targets (e.g., p99 < 500ms). FAIL if no performance targets for production endpoints
375
449
 
376
- - **CQ-P-02** [P2] Is caching applied with explicit invalidation and consistency guarantees?
450
+ - **CQ-P-02** [P2] Is cached data tied to a declared source of truth and staleness tolerance?
377
451
  - Inference path: domain_scope.md 'Required Concept Categories' → Source of truth; concepts.md 'Data/State Management Terms' → Eventual Consistency
378
- - Verification criteria: PASS if cached data has TTL or invalidation trigger and documented source-of-truth relationship. FAIL if caching lacks invalidation strategy
452
+ - Verification criteria: PASS if each cache declares its authoritative source, owner, consumers, allowed staleness, and inconsistency resolution rule. FAIL if cached data exists without source-of-truth or staleness semantics
379
453
 
380
454
  - **CQ-P-03** [P2] Is load testing performed for expected and peak traffic?
381
- - Inference path: domain_scope.md 'Verification & Quality' → load testing; structure_spec.md 'Quantitative Thresholds' → P99 > 1s triggers review
455
+ - Inference path: domain_scope.md 'Major Sub-areas' → Verification and quality; structure_spec.md 'Quantitative Thresholds' → P99 > 1s triggers review
382
456
  - Verification criteria: PASS if load tests simulate expected/peak traffic with results compared to SLOs. FAIL if no load testing for concurrent-user systems
383
457
 
384
458
  - **CQ-P-04** [P3] Are common performance anti-patterns (N+1, unindexed, unbounded) identified?
385
- - Inference path: structure_spec.md 'Storage/Data Layer'; domain_scope.md 'Verification & Quality' → identify bottlenecks before production
459
+ - Inference path: structure_spec.md 'Storage/Data Layer'; domain_scope.md 'Major Sub-areas' → Verification and quality
386
460
  - Verification criteria: PASS if anti-patterns detected by tooling or review. FAIL if known anti-patterns exist in production without mitigation
387
461
 
388
462
  - **CQ-P-05** [P2] Is caching invalidation strategy defined with consistency guarantees?
389
463
  - Inference path: logic_rules.md §Performance Logic → Caching Rules → cache invalidation strategies; dependency_rules.md §Source of Truth Management
390
- - Verification criteria: PASS if cached data has explicit invalidation strategy (TTL, event-based, versioned) with documented staleness tolerance. FAIL if caching has no invalidation strategy
464
+ - Verification criteria: PASS if each cache has an invalidation or refresh mechanism (TTL, event-based invalidation, versioned keys, write-through/update, or documented manual refresh) that is compatible with its declared staleness tolerance. FAIL if cache entries can remain stale beyond the declared tolerance or invalidation triggers are undefined
391
465
 
392
466
  - **CQ-P-06** [P2] Are N+1 query patterns detected and prevented?
393
467
  - Inference path: logic_rules.md §Performance Logic → N+1 Query Detection; structure_spec.md §Storage/Data Layer
@@ -419,6 +493,10 @@ Verifies that concurrent access is handled safely — deadlocks prevented, race
419
493
  - Inference path: logic_rules.md 'State Management Logic' → Distributed State Rules → optimistic for low contention, pessimistic for high; concepts.md 'Data/State Management Terms' → Optimistic vs Pessimistic Locking
420
494
  - Verification criteria: PASS if strategy is documented with justification based on conflict probability. FAIL if implicit or mismatched to contention level
421
495
 
496
+ - **CQ-C-05** [P2] Are concurrent resource limits defined with behavior at capacity?
497
+ - Inference path: domain_scope.md 'Required Concept Categories' → Concurrency; logic_rules.md §Concurrency Logic → Concurrency Model Rules; dependency_rules.md §Runtime Dependency Rules → Bulkhead and Timeout and Retry Policies
498
+ - Verification criteria: PASS if connection pools, thread pools, queues, actor mailboxes, worker pools, or other concurrent capacity limits declare behavior at saturation. FAIL if resource exhaustion under concurrent load leads to undefined blocking, dropped work, retry storms, or unbounded memory growth
499
+
422
500
  ---
423
501
 
424
502
  ## 12. Dependencies (CQ-DE)
@@ -429,9 +507,9 @@ Verifies that dependency management is sound — direction rules enforced, exter
429
507
  - Inference path: dependency_rules.md 'Acyclic Dependencies' → ADP → DAG required; dependency_rules.md 'Breaking Cycles' → inversion, events, shared kernel strategies
430
508
  - Verification criteria: PASS if dependency graph is verified acyclic by tooling. FAIL if circular dependencies exist. Type-only circular imports are WARNING
431
509
 
432
- - **CQ-DE-02** [P1] Do dependencies point in the correct direction (stable inward, volatile outward)?
433
- - Inference path: dependency_rules.md 'Direction Rules' → DIP; dependency_rules.md 'Stable Dependencies Principle'; dependency_rules.md 'Clean Architecture Dependency Rule'
434
- - Verification criteria: PASS if business logic depends on abstractions, not infrastructure. FAIL if business logic imports framework classes without abstraction
510
+ - **CQ-DE-02** [P1] Do dependencies follow the declared architecture's direction rule and dependency kind?
511
+ - Inference path: dependency_rules.md 'Direction Rules' → dependency direction vocabulary, pattern selection rule, DIP, Clean Architecture, Hexagonal Architecture, Stable Dependencies Principle
512
+ - Verification criteria: PASS if source-code, runtime-flow, and contract dependencies are classified and each source-code edge follows the declared architecture's rule. FAIL if a single global upper/lower rule is applied across Layered, Clean, Hexagonal, and DIP contexts or if business policy depends on concrete infrastructure without accepted rationale
435
513
 
436
514
  - **CQ-DE-03** [P1] Are lock files committed and deterministic builds guaranteed?
437
515
  - Inference path: dependency_rules.md 'Build/Package Dependency Rules' → Lock File Management → lock files ensure deterministic builds
@@ -449,6 +527,14 @@ Verifies that dependency management is sound — direction rules enforced, exter
449
527
  - Inference path: dependency_rules.md 'Build/Package Dependency Rules' → Transitive Dependency Management → phantom dependencies from hoisting; dependency_rules.md 'Referential Integrity' → imported modules must be declared
450
528
  - Verification criteria: PASS if only declared dependencies can be imported. FAIL if code imports undeclared packages relying on hoisting
451
529
 
530
+ - **CQ-DE-07** [P2] Are event schema contract dependencies owned, versioned, and decommissionable?
531
+ - Inference path: dependency_rules.md 'Breaking Cycles' → Event-based decoupling; dependency_rules.md 'Event Schema Contract Dependencies'; competency_qs.md 'Event/Messaging'
532
+ - Verification criteria: N/A if the system has no event/message/pub-sub path. PASS if each event/message schema has an owner, versioning policy, compatibility rules, producer/consumer direction, migration window for breaking changes, and decommissioning path. FAIL if event-based decoupling hides consumer coupling, stale subscribers, or schema ownership gaps
533
+
534
+ - **CQ-DE-08** [P2] Are general software supply-chain artifacts traceable and verifiable outside AI-specific provenance?
535
+ - Inference path: concepts.md 'DevOps Terms' → SBOM, Artifact Attestation, Release Artifact Traceability; dependency_rules.md 'Build/Package Dependency Rules' → Dependency Security; domain_scope.md 'Reference Standards and Frameworks' → SLSA and NIST SSDF
536
+ - Verification criteria: PASS if build/release artifacts have dependency inventory, license/security checks, signature or attestation where required, and provenance verification tied to CI/release gates. FAIL if only AI artifacts have provenance while conventional build/release artifacts lack traceability
537
+
452
538
  ## 13. Boundary Conditions (CQ-B)
453
539
 
454
540
  Verifies that edge cases, limits, and boundary values are identified and handled.
@@ -463,11 +549,7 @@ Scope boundary: CQ-B addresses **value-level boundaries** (min/max, empty, overf
463
549
  - Inference path: domain_scope.md 'Required Concept Categories' → Boundary condition → integer overflow in payment calculation; logic_rules.md §Type System Logic → type safety
464
550
  - Verification criteria: PASS if arithmetic operations on user-influenced values use overflow-safe types or explicit checks. FAIL if arithmetic can overflow silently (e.g., 32-bit integer for monetary values)
465
551
 
466
- - **CQ-B-03** [P2] Are concurrent access boundaries addressed at resource limits?
467
- - Inference path: domain_scope.md 'Required Concept Categories' → Boundary condition → concurrent access; logic_rules.md §Concurrency Logic → Race Condition Patterns
468
- - Verification criteria: PASS if resource limits (connection pools, thread pools, queue depths) have defined behavior at capacity. FAIL if resource exhaustion leads to undefined behavior
469
-
470
- - **CQ-B-04** [P2] Are collection size boundaries handled (empty collections, single-element, maximum size)?
552
+ - **CQ-B-03** [P2] Are collection size boundaries handled (empty collections, single-element, maximum size)?
471
553
  - Inference path: domain_scope.md 'Required Concept Categories' → Boundary condition → empty array dereference; logic_rules.md §Constraint Design Logic → schema validation at entry point
472
554
  - Verification criteria: PASS if operations on collections handle empty, single-element, and maximum-size cases. FAIL if code assumes non-empty collections without validation
473
555
 
@@ -480,13 +562,21 @@ Verifies that requirements are captured, testable, and traceable. Applicable whe
480
562
  Scope boundary: CQ-R addresses **requirements traceability and quantification**. Test strategy and coverage belong to CQ-V.
481
563
 
482
564
  - **CQ-R-01** [P2] Is each functional requirement traceable to at least one test or verification method?
483
- - Inference path: domain_scope.md §Interface & Contract Requirements & Specification → testable requirements; structure_spec.md §Golden Relationships → Test-Code coherence
565
+ - Inference path: domain_scope.md §Major Sub-areasInterface and contract; structure_spec.md §Golden Relationships → Test-Code coherence
484
566
  - Verification criteria: PASS if a traceability matrix (or equivalent) links requirements to tests. FAIL if requirements exist without corresponding verification
485
567
 
486
568
  - **CQ-R-02** [P2] Are non-functional requirements quantified with measurable targets?
487
- - Inference path: domain_scope.md §Interface & Contract Requirements & Specification non-functional requirements must be quantified; domain_scope.md §Verification & Quality → Performance → p50/p95/p99
569
+ - Inference path: domain_scope.md §Major Sub-areasInterface and contract + Verification and quality
488
570
  - Verification criteria: PASS if NFRs have numeric targets (e.g., p99 < 500ms, 99.9% availability). FAIL if NFRs are stated qualitatively ("should be fast")
489
571
 
572
+ - **CQ-R-03** [P2] Are accessibility and internationalization requirements explicit when user-facing scope requires them?
573
+ - Inference path: domain_scope.md 'Major Sub-areas' → Accessibility and internationalization; concepts.md 'Internationalization/Accessibility Terms'
574
+ - Verification criteria: N/A if the system has no user-facing surface, locale-sensitive output, public/regulatory exposure, or assistive-technology obligation. PASS if accessibility level, supported locales, text direction, formatting, translation ownership, and acceptance criteria are explicit. FAIL if user-facing scope exists but accessibility/i18n requirements are absent or purely qualitative
575
+
576
+ - **CQ-R-04** [P1] Are user-visible, API, or release-affecting behaviors expressed as testable acceptance criteria?
577
+ - Inference path: domain_scope.md 'Major Sub-areas' → Interface and contract; structure_spec.md 'Golden Relationships' → Documentation-Code alignment; logic_rules.md 'Constraint Design Logic' → free-text pass_criteria failure modes
578
+ - Verification criteria: PASS if externally observable or release-affecting behavior has acceptance criteria specific enough to verify. FAIL if behavior is described only as intent, narrative, or qualitative preference with no concrete expected result, non-goal, or verification method
579
+
490
580
  ---
491
581
 
492
582
  ## 15. Maintenance (CQ-MT)
@@ -494,13 +584,17 @@ Scope boundary: CQ-R addresses **requirements traceability and quantification**.
494
584
  Verifies that maintenance processes are defined and classified. Applicable when the system has an established user base with ongoing development.
495
585
 
496
586
  - **CQ-MT-01** [P2] Are maintenance activities classified by type (corrective/adaptive/perfective/preventive)?
497
- - Inference path: domain_scope.md §Operations, Deployment & Maintenance Maintenance IEEE 14764 four categories; concepts.md §Change Management Terms → Technical Debt
587
+ - Inference path: domain_scope.md §Domain Purposefull lifecycle; ISO/IEC/IEEE 12207 in domain_scope.md §Reference Standards and Frameworks; concepts.md §Change Management Terms → Technical Debt
498
588
  - Verification criteria: PASS if maintenance work is classified and tracked by type with distinct workflows. FAIL if all maintenance is treated as undifferentiated "bug fixes"
499
589
 
500
590
  - **CQ-MT-02** [P2] Is technical debt tracked with remediation plans?
501
- - Inference path: domain_scope.md §Operations, Deployment & Maintenance Maintenance preventive maintenance; concepts.md §Interpretation Principles → "Technical debt" is not "code I don't like"
591
+ - Inference path: domain_scope.md §Top-Down Concern StackLifecycle and governance; concepts.md §Interpretation Principles → "Technical debt" is not "code I don't like"
502
592
  - Verification criteria: PASS if technical debt items are documented with estimated cost and remediation priority. FAIL if technical debt is acknowledged informally but not tracked
503
593
 
594
+ - **CQ-MT-03** [P2] Is retirement or decommissioning complete across runtime, docs, data, dependencies, and operations?
595
+ - Inference path: domain_scope.md §Domain Purpose → decommissioning and retirement; extension_cases.md Case SE-07; concepts.md §Change Management Terms → Deprecation and Feature Toggle
596
+ - Verification criteria: N/A if no feature, service, API, model, prompt, tool, index, datastore, or dependency is being retired. PASS if communication, dependency removal, route/flag cleanup, documentation updates, data retention/deletion, monitoring cleanup, and final verification are complete or explicitly retained with rationale. FAIL if retired behavior survives as stale routes, flags, docs, alerts, data obligations, indexes, or consumers without rationale
597
+
504
598
  ---
505
599
 
506
600
  ## 16. Event/Messaging (CQ-M)
@@ -510,7 +604,7 @@ Verifies that asynchronous messaging patterns are correctly implemented. Applica
510
604
  Scope boundary: CQ-M addresses **messaging infrastructure and delivery semantics**. Data flow traceability belongs to CQ-D.
511
605
 
512
606
  - **CQ-M-01** [P1] Is the message delivery guarantee explicitly chosen and its trade-offs documented?
513
- - Inference path: domain_scope.md §Structure & Architecture Event/Messaging at-most-once, at-least-once, exactly-once trade-offs; logic_rules.md §State Management Logic → determinism
607
+ - Inference path: domain_scope.md §Major Sub-areasArchitecture and structure; logic_rules.md §State Management Logic → determinism
514
608
  - Verification criteria: PASS if delivery guarantee is documented with justification. FAIL if the guarantee is implicit or unknown
515
609
 
516
610
  - **CQ-M-02** [P1] Are message consumers idempotent when at-least-once delivery is used?
@@ -518,7 +612,7 @@ Scope boundary: CQ-M addresses **messaging infrastructure and delivery semantics
518
612
  - Verification criteria: PASS if consumers handle duplicate messages safely (idempotency keys, deduplication). FAIL if duplicate processing causes side effects
519
613
 
520
614
  - **CQ-M-03** [P2] Is message ordering guaranteed where business logic requires it?
521
- - Inference path: domain_scope.md §Structure & Architecture Event/Messaging pipeline scalability; logic_rules.md §State Management Logic → Fundamental State Rules → same input must produce same output
615
+ - Inference path: domain_scope.md §Major Sub-areasArchitecture and structure; logic_rules.md §State Management Logic → Fundamental State Rules → same input must produce same output
522
616
  - Verification criteria: PASS if ordering-sensitive operations use partitioned/ordered channels. FAIL if ordering is assumed but not enforced
523
617
 
524
618
  - **CQ-M-04** [P2] Are dead-letter queues configured for unprocessable messages?
@@ -527,12 +621,47 @@ Scope boundary: CQ-M addresses **messaging infrastructure and delivery semantics
527
621
 
528
622
  ---
529
623
 
624
+ ## 17. AI Governance, Provenance, and Value Alignment (CQ-G)
625
+
626
+ Verifies that AI-era software behavior has accountable ownership, evidence, incident feedback, and explicit value tradeoffs. Applicable when AI behavior materially affects users, operators, release decisions, security/privacy, authority artifacts, or engineering workflow.
627
+
628
+ - **CQ-G-01** [P1] Is there an accountable owner for material AI risk?
629
+ - Inference path: concepts.md 'AI Governance' and 'AI Risk Owner'; domain_scope.md 'Axiology Input'; logic_rules.md 'AI Governance and Risk Rules'
630
+ - Verification criteria: PASS if the risk owner, risk treatment, approval/acceptance gate, and review cadence are named or explicitly non-applicable. FAIL if AI risk is treated as an implicit engineering side effect with no accountable owner
631
+
632
+ - **CQ-G-02** [P1] Do generated or retrieved authority-affecting artifacts preserve provenance?
633
+ - Inference path: concepts.md 'Provenance' and 'Generated Artifact'; dependency_rules.md 'AI Supply Chain and Provenance'; structure_spec.md 'Provenance record'
634
+ - Verification criteria: PASS if source refs, builder/agent, inputs, transformation path, verification state, and relevant model/provider facts are recorded. FAIL if generated or retrieved claims become authority without traceable evidence
635
+
636
+ - **CQ-G-03** [P2] Do red-team findings, semantic eval failures, incidents, and drift signals feed back into controls?
637
+ - Inference path: concepts.md 'Red-Team/Eval Loop'; structure_spec.md 'Red-team/incident loop'; logic_rules.md 'AI Governance and Risk Rules'
638
+ - Verification criteria: PASS if findings can update prompts, policies, tests, evals, guardrails, release gates, or incident playbooks with an owner. FAIL if findings are recorded but cannot change the system
639
+
640
+ - **CQ-G-04** [P2] Is fail-loud chosen where diagnosability and artifact truth matter more than apparent continuity?
641
+ - Inference path: domain_scope.md 'Axiology Input'; logic_rules.md 'LLM-Native Failure Posture'; concepts.md 'Fail-Loud' and 'Silent Degradation'
642
+ - Verification criteria: PASS if development/review/authority paths halt or emit diagnostic artifacts on contract failure, and product degradation visibly marks loss and recovery. FAIL if the system optimizes for a smooth demo by hiding trust loss, missing evidence, or failing boundaries
643
+
644
+ - **CQ-G-05** [P2] Are human approval and user/operator agency preserved for high-impact AI actions?
645
+ - Inference path: concepts.md 'Human Approval Gate' and 'Agent Autonomy'; structure_spec.md 'Human approval gate'; prompt_interface.md 'Agent Permission and Autonomy'
646
+ - Verification criteria: PASS if high-impact actions have approval, denial, audit, and recovery paths. FAIL if an agent can independently perform irreversible, external, privileged, or user-affecting actions without an explicit gate or risk acceptance
647
+
648
+ - **CQ-G-06** [P3] Are lifecycle and standards anchors current enough for the target's risk?
649
+ - Inference path: domain_scope.md 'Reference Standards and Frameworks' and 'Top-Down Concern Stack'
650
+ - Verification criteria: PASS if requirements, quality, accessibility, AI governance, secure development, and lifecycle references use current anchors or justify older references. FAIL if outdated standards or implementation-only scope hide acquisition, operation, maintenance, retirement, accessibility, or governance obligations
651
+
652
+ - **CQ-G-07** [P2] Are AI transparency and incident disclosure paths defined for affected stakeholders?
653
+ - Inference path: concepts.md 'AI Incident Disclosure'; domain_scope.md 'AI governance and risk'; extension_cases.md Case AI-06
654
+ - Verification criteria: N/A if AI behavior cannot materially affect users, operators, release decisions, security/privacy, authority artifacts, or trust claims. PASS if disclosure audience, trigger, timing, content, artifact/audit refs, remediation communication, and trust-status display are defined. FAIL if material AI failures can occur without a communication path to affected users, operators, or internal stakeholders
655
+
656
+ ---
657
+
530
658
  ## Related Documents
531
659
  - domain_scope.md — top-level scope definition and bias detection criteria
532
660
  - concepts.md — term definitions for type system, architecture, testing, security, etc.
533
- - logic_rules.md — rules for types (CQ-T), state (CQ-D), security (CQ-SE), concurrency (CQ-C), testing (CQ-V), AI agent collaboration (CQ-A)
661
+ - logic_rules.md — rules for types (CQ-T), state (CQ-D), security (CQ-SE), concurrency (CQ-C), testing (CQ-V), AI agent and LLM-native collaboration (CQ-A)
534
662
  - structure_spec.md — rules for module structure (CQ-S), golden relationships (CQ-V), verification pipeline (CQ-V), thresholds (CQ-S)
535
663
  - logic_rules.md §Error Handling Logic — inference path target for CQ-B boundary condition questions
536
- - domain_scope.md §Operations, Deployment & Maintenance — inference path target for CQ-MT maintenance questions
664
+ - domain_scope.md §Domain Purpose and §Top-Down Concern Stack — inference path targets for CQ-MT maintenance questions
537
665
  - dependency_rules.md — rules for dependency direction (CQ-DE), API management (CQ-I), build dependencies (CQ-DE), runtime dependencies (CQ-E)
538
- - domain_scope.md §Sub-area to CQ Section Mapping — mapping between scope sub-areas and CQ sections
666
+ - domain_scope.md §Sub-area to CQ Mapping — mapping between scope sub-areas and CQ sections
667
+ - prompt_interface.md — interface criteria for prompt/context/tool/output/sink boundaries used by CQ-A and CQ-G