onto-mcp 0.3.0 → 0.3.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (61) hide show
  1. package/.onto/authority/core-lexicon.yaml +12 -0
  2. package/.onto/domains/software-engineering/competency_qs.md +192 -63
  3. package/.onto/domains/software-engineering/concepts.md +67 -5
  4. package/.onto/domains/software-engineering/conciseness_rules.md +22 -2
  5. package/.onto/domains/software-engineering/dependency_rules.md +78 -8
  6. package/.onto/domains/software-engineering/domain_scope.md +181 -150
  7. package/.onto/domains/software-engineering/extension_cases.md +318 -542
  8. package/.onto/domains/software-engineering/logic_rules.md +75 -3
  9. package/.onto/domains/software-engineering/problem_framing_profile.md +29 -2
  10. package/.onto/domains/software-engineering/prompt_interface.md +122 -0
  11. package/.onto/domains/software-engineering/structure_spec.md +53 -4
  12. package/.onto/principles/llm-native-development-guideline.md +20 -0
  13. package/.onto/principles/productization-charter.md +6 -0
  14. package/.onto/processes/evolve/material-kind-adapter-contract.md +6 -0
  15. package/.onto/processes/reconstruct/reconstruct-boundary-contract.md +468 -81
  16. package/.onto/processes/reconstruct/reconstruct-execution-ux-contract.md +177 -0
  17. package/.onto/processes/reconstruct/source-profile-contract.md +39 -6
  18. package/.onto/processes/reconstruct/top-level-concept-discovery-contract.md +387 -0
  19. package/.onto/processes/review/binding-contract.md +8 -0
  20. package/.onto/processes/review/lens-registry.md +16 -0
  21. package/.onto/processes/review/pre-dispatch-contracts.md +34 -13
  22. package/.onto/processes/review/productized-live-path.md +3 -1
  23. package/.onto/processes/shared/pipeline-execution-ledger-contract.md +185 -0
  24. package/.onto/processes/shared/target-material-kind-contract.md +24 -2
  25. package/.onto/roles/axiology.md +7 -2
  26. package/AGENTS.md +4 -2
  27. package/README.md +52 -29
  28. package/dist/core-api/reconstruct-api.js +92 -5
  29. package/dist/core-api/review-api.js +1744 -371
  30. package/dist/core-runtime/cli/mock-review-unit-executor.js +17 -0
  31. package/dist/core-runtime/cli/render-review-final-output.js +9 -0
  32. package/dist/core-runtime/cli/review-invoke.js +387 -55
  33. package/dist/core-runtime/cli/run-review-prompt-execution.js +361 -90
  34. package/dist/core-runtime/path-boundary.js +58 -0
  35. package/dist/core-runtime/pipeline-execution-ledger.js +100 -0
  36. package/dist/core-runtime/reconstruct/artifact-types.js +33 -1
  37. package/dist/core-runtime/reconstruct/materialize-preparation.js +54 -4
  38. package/dist/core-runtime/reconstruct/pipeline-execution-ledger.js +342 -0
  39. package/dist/core-runtime/reconstruct/post-seed-validation.js +630 -0
  40. package/dist/core-runtime/reconstruct/record.js +105 -1
  41. package/dist/core-runtime/reconstruct/run.js +1594 -38
  42. package/dist/core-runtime/reconstruct/seed-candidate-validation.js +29 -0
  43. package/dist/core-runtime/review/continuation-plan.js +160 -0
  44. package/dist/core-runtime/review/execution-plan-boundary.js +123 -0
  45. package/dist/core-runtime/review/materializers.js +8 -3
  46. package/dist/core-runtime/review/pipeline-execution-ledger.js +250 -0
  47. package/dist/core-runtime/review/review-artifact-utils.js +15 -2
  48. package/dist/core-runtime/review/review-invocation-runner.js +604 -0
  49. package/dist/core-runtime/target-material-kind.js +43 -5
  50. package/dist/mcp/server.js +289 -59
  51. package/dist/mcp/tool-schemas.js +28 -2
  52. package/package.json +4 -2
  53. package/.onto/domains/llm-native-development/competency_qs.md +0 -430
  54. package/.onto/domains/llm-native-development/concepts.md +0 -242
  55. package/.onto/domains/llm-native-development/conciseness_rules.md +0 -163
  56. package/.onto/domains/llm-native-development/dependency_rules.md +0 -216
  57. package/.onto/domains/llm-native-development/domain_scope.md +0 -197
  58. package/.onto/domains/llm-native-development/extension_cases.md +0 -474
  59. package/.onto/domains/llm-native-development/logic_rules.md +0 -123
  60. package/.onto/domains/llm-native-development/prompt_interface.md +0 -49
  61. package/.onto/domains/llm-native-development/structure_spec.md +0 -245
@@ -1,242 +0,0 @@
1
- ---
2
- version: 2
3
- last_updated: "2026-03-30"
4
- source: manual
5
- status: established
6
- ---
7
-
8
- # LLM-Native Development Domain — Concept Dictionary and Interpretation Rules
9
-
10
- Classification axis: **normative system** — classification by the standard/protocol/framework layer to which each term belongs.
11
-
12
- ## Normative System Classification
13
-
14
- Standards and frameworks in LLM-powered system development form a layered system. Each layer builds on the one below it.
15
-
16
- ### Layer 1 — Foundation Protocols
17
-
18
- These define how systems communicate with models and with each other.
19
-
20
- - HTTP/REST = the transport protocol for model API calls. All commercial LLM providers expose REST endpoints
21
- - MCP (Model Context Protocol) = an open protocol standardizing how agents connect to external tools, data sources, and resources. Three primitives: Tools (executable functions), Resources (read-only data), Prompts (reusable prompt templates)
22
- - A2A (Agent-to-Agent Protocol) = a protocol for inter-agent communication. Complements MCP (agents↔tools) by connecting agents↔agents
23
- - OpenAPI / JSON Schema = specification formats for API contracts and structured data shapes. Used for function calling schemas and MCP tool parameter definitions
24
-
25
- ### Layer 2 — Framework Abstractions
26
-
27
- Reusable libraries and SDKs that wrap Layer 1 protocols into higher-level programming constructs.
28
-
29
- - OpenAI SDK / Anthropic SDK = vendor-specific client libraries for model API access. Handle authentication, request formatting, streaming, retry logic
30
- - LangChain = a framework providing abstractions for chains (sequential prompt pipelines), agents (decision loops), and memory (conversation state)
31
- - LlamaIndex = a framework for data ingestion, chunking, indexing, and querying. Specialized for RAG pipeline construction
32
- - LangGraph = a framework for stateful multi-step agent workflows as graphs. Adds explicit state management and cycle support to LangChain
33
- - Other notable SDKs: Semantic Kernel (Microsoft, enterprise .NET/Python), Vercel AI SDK (TypeScript, streaming UI)
34
-
35
- ### Layer 3 — Application Patterns
36
-
37
- Architectural patterns that combine Layer 1 and Layer 2 into recognizable system designs.
38
-
39
- - RAG (Retrieval Augmented Generation) = retrieving relevant documents and including them in model context before generation
40
- - Agent = a system where an LLM autonomously decides actions to accomplish a goal, distinguished by the presence of a decision loop
41
- - Chatbot = a conversational interface backed by an LLM, maintaining history across turns
42
- - Code Assistant = an LLM-powered system that reads, generates, and modifies code
43
- - Multi-Agent System = multiple specialized agents collaborating with distinct roles, tools, and instructions
44
-
45
- ### Layer 4 — Quality and Governance
46
-
47
- Frameworks for measuring, controlling, and governing LLM system behavior.
48
-
49
- - Evaluation Frameworks = tools for measuring LLM output quality (RAGAS, DeepEval, Promptfoo)
50
- - EU AI Act = EU regulation classifying AI systems by risk level with corresponding obligations
51
- - NIST AI RMF = voluntary framework for managing AI risks. Four functions: Govern, Map, Measure, Manage
52
- - Model Cards / System Cards = standardized documentation for model capabilities, limitations, and evaluation results
53
-
54
- ### Relationship Between Layers
55
-
56
- Layer 1 defines communication. Layer 2 wraps it into developer abstractions. Layer 3 combines abstractions into architectures. Layer 4 governs quality and safety. Skipping Layer 1 understanding makes debugging opaque; ignoring Layer 4 leaves quality and compliance unmanaged.
57
-
58
- ## Model Integration Terms
59
-
60
- - Model = a trained neural network that generates text given input. In this domain, unqualified "model" refers to an LLM
61
- - API Endpoint = a URL that accepts model requests and returns responses
62
- - Model Routing = directing requests to different models based on task complexity, cost, or latency requirements
63
- - Fallback = switching to an alternative model when the primary is unavailable or rate-limited
64
- - Model Version = a specific model release (e.g., `gpt-4-0613`, `claude-3-5-sonnet-20241022`). Version pinning prevents unexpected behavior changes
65
- - Inference = the process of generating output from input. All API-based model usage is inference (vs. training, which updates weights)
66
- - Latency = time between request and complete response. For streaming, Time to First Token (TTFT) measures perceived responsiveness
67
- - Throughput = requests or tokens processed per unit time
68
- - Token = the atomic text unit a model processes. Subword units whose boundaries are model-specific. ~1.3 tokens per English word; non-Latin scripts require more
69
- - Context Window = maximum tokens a model processes per request (input + output). Ranges from 4K to 1M+
70
- - Rate Limit = maximum requests or tokens per time period (RPM, TPM, RPD). Exceeding returns HTTP 429
71
- - Model Capability = what a model can do (text generation, tool use, vision, structured output). Varies by model and version
72
-
73
- ## Prompt & Context Design Terms
74
-
75
- - System Prompt = instructions defining model role, behavior constraints, and output format. Processed before user input
76
- - User Prompt = the end-user's input, combined with system prompt and retrieved context
77
- - Few-Shot Prompting = including example input-output pairs to demonstrate desired behavior. Contrasted with zero-shot (no examples)
78
- - Zero-Shot Prompting = providing instructions without examples, relying on pre-trained knowledge
79
- - Chain-of-Thought (CoT) = instructing the model to show intermediate reasoning steps before the final answer. Variants: zero-shot CoT ("think step by step"), few-shot CoT (with reasoning examples)
80
- - Instruction Hierarchy = precedence order when instructions conflict: system prompt > developer instructions > user prompt
81
- - Structured Output = constraining output to a specific format (JSON, XML). Implemented via JSON mode, response format schemas, or tool call schemas
82
- - JSON Mode = a model feature guaranteeing valid JSON output. Does not guarantee schema conformance
83
- - Tool Call Schema = a JSON Schema describing a tool's name, description, and parameters for the model's function calling
84
- - Token Budget = maximum tokens allocated to a specific prompt component, preventing any single part from consuming the entire context window
85
- - Context Rot = output quality degradation when the context contains too much irrelevant or contradictory information
86
- - Prompt Template = a reusable prompt structure with variable placeholders filled at runtime
87
- - Prompt Injection (as input design concern) = the risk that user input overrides the system prompt. Addressed via input validation, delimiters, and instruction hierarchy
88
-
89
- ## Retrieval & Knowledge Systems Terms
90
-
91
- - RAG (Retrieval Augmented Generation) = augmenting model input with retrieved external information. Three stages: indexing, retrieval, generation. A pattern, not a product — implementation varies widely
92
- - Chunking = splitting documents into segments for indexing. Strategies: fixed-size, semantic, recursive, sentence-level. Chunk size affects retrieval precision and recall
93
- - Embedding = a dense vector representation capturing semantic meaning, generated by embedding models. Similar texts → similar vectors. Model-specific and not interchangeable across models
94
- - Vector Database = a database for storing and querying high-dimensional vectors via approximate nearest neighbor (ANN) search. Examples: Pinecone, Weaviate, Chroma, pgvector
95
- - Semantic Search = finding documents by meaning similarity using embedding distance metrics (cosine similarity, dot product)
96
- - Hybrid Search = combining keyword search (BM25) with semantic search to leverage strengths of both
97
- - Reranking = second-pass scoring of retrieved documents using a cross-encoder model for improved relevance
98
- - Knowledge Base = a structured information collection for LLM retrieval. Distinguished from the model's parametric knowledge (learned during training)
99
- - Knowledge Graph = a structured representation of entities and relationships, providing relational knowledge that complements unstructured retrieval
100
- - File=Concept = each file defines exactly one concept; each concept is defined in exactly one file. Applies to concept files, not meta files
101
- - Frontmatter = YAML metadata block at file top. Source of truth is structure_spec.md
102
- - Navigation Index = a meta file (INDEX.md) summarizing directory contents and each file's role
103
- - System Map = a meta file (ARCHITECTURE.md, llms.txt) representing entire system structure as one document
104
- - llms.txt = a specification for LLM-friendly project information, placed at project root
105
- - CLAUDE.md = a project-level instruction file for Claude-based tools. Analogous to .cursorrules (Cursor), AGENTS.md (general AI agents)
106
- - Persistent Memory = information storage persisting across agent sessions. Distinguished from conversation history (single-session) and model parameters (training-time)
107
-
108
- ## Agentic Systems Terms
109
-
110
- - Agent = an LLM-powered system that autonomously decides actions. Core loop: observe → think → act. Distinguished from prompt-response by iterative decision-making
111
- - Tool = a function an agent invokes to interact with external systems. Defined by schema (name, description, parameters)
112
- - Tool Use = the model generating structured tool call requests (function name + arguments). The model does not execute tools — application code does
113
- - ReAct (Reasoning + Acting) = alternating reasoning steps and action steps, with the reasoning trace in context for self-correction
114
- - Planning = decomposing a complex goal into steps. May be explicit (output a plan) or implicit (decide next step from current state)
115
- - Reflection = the agent evaluating its own outputs and deciding to revise, retry, or proceed
116
- - Workflow = a predefined sequence of LLM calls where control flow is code-determined, not model-determined. Patterns: prompt chaining, routing, parallelization, orchestrator-workers, evaluator-optimizer
117
- - Orchestration = coordinating multiple LLM calls, tool invocations, or agent interactions. Code-driven (workflows) or model-driven (agents)
118
- - Multi-Agent = multiple agents with distinct roles collaborating. Patterns: orchestrator-specialist, peer-to-peer, hierarchical
119
- - MCP Server / MCP Client = server exposes tools/resources/prompts; client discovers and invokes them. One agent can connect to multiple servers
120
- - ACI (Agent-Computer Interface) = designing tools optimized for agent use: clear descriptions, specific parameters, structured results, informative errors
121
- - Agent State = information maintained during execution: goal, completed steps, pending actions, results
122
- - Scratchpad = working memory for intermediate reasoning and partial results, included in model context
123
- - Long-Running Agent = a task spanning multiple sessions. Requires context compaction, progress persistence, and continuity strategies
124
- - Context Compaction = summarizing accumulated context to fit the window. Techniques: summarization, selective retention, structured progress notes
125
- - Sub-Agent = an agent spawned by a parent to handle a delegated sub-task, returning results to the parent
126
-
127
- ## Evaluation & Testing Terms
128
-
129
- - Evaluation (Eval) = systematic measurement of LLM output quality. Non-deterministic outputs require different methods than traditional testing
130
- - Golden Set = curated input-output pairs representing expected behavior, used as a regression suite
131
- - AI-as-Judge = using an LLM to evaluate another LLM's output on criteria like relevance, accuracy, safety
132
- - Comparative Evaluation = evaluating system variants side-by-side on the same inputs. Less susceptible to absolute scoring biases
133
- - A/B Testing = deploying variants to real users and measuring outcomes from actual interactions
134
- - Regression Testing = re-running a golden set after changes to detect quality degradation
135
- - Benchmark = standardized test suite for model capabilities (MMLU, HumanEval, MATH). Useful for model selection, not application-specific quality
136
- - Hallucination = fluent, confident output that is factually incorrect or unsupported by context. Intrinsic (contradicts source) vs extrinsic (unverifiable)
137
- - Faithfulness = degree to which output is supported by provided context. RAG-specific metric
138
- - Relevance = degree to which output addresses the user's question. Distinct from faithfulness
139
- - HITL (Human-in-the-Loop) = incorporating human judgment for evaluation, feedback, or high-stakes approval
140
- - Evaluation Pipeline = automated system for test case management, scoring, result aggregation, and trend tracking
141
-
142
- ## Safety & Alignment Terms
143
-
144
- - Prompt Injection = malicious input causing the model to ignore its system prompt. Direct (attacker input) or indirect (embedded in retrieved documents)
145
- - Jailbreak = bypassing a model's training-time safety constraints. Targets the model itself, not the application's system prompt
146
- - Guardrail = runtime validation of inputs/outputs against rules. Input guardrails filter prompts; output guardrails filter responses
147
- - Content Policy = rules defining what the system should and should not generate
148
- - Output Filtering = post-processing to detect and handle policy-violating content
149
- - Red Teaming = adversarial testing to identify vulnerabilities standard testing misses
150
- - Alignment = degree to which model behavior matches intended purpose and values
151
- - PII (Personally Identifiable Information) = data identifying an individual. Must be detected and handled in both inputs and outputs
152
- - Responsible AI = developing AI that is fair, transparent, accountable, safe, and privacy-preserving
153
- - EU AI Act = EU regulation with risk-tier classification. High-risk systems require conformity assessments, documentation, and human oversight
154
-
155
- ## Production Operations Terms
156
-
157
- - Observability = understanding system state from external outputs. Three pillars: logs, metrics, traces
158
- - Logging = recording LLM call details (prompts, outputs, model version, tokens, latency, errors)
159
- - Tracing = tracking a single request through all processing steps for end-to-end debugging
160
- - Monitoring = real-time observation of latency, error rates, throughput, token consumption, and cost
161
- - Cost Tracking = measuring model usage cost from token consumption × per-token pricing. Often the largest variable cost
162
- - Latency Management = reducing response time via model selection, caching, streaming, prompt optimization, batching
163
- - Quality Drift = gradual output quality degradation from model updates, knowledge staleness, or changing user patterns
164
- - Feedback Loop = collecting user signals (ratings, edits, regenerations) for system improvement
165
- - Incident Response = handling LLM-specific failures (outages, safety violations, quality degradation, cost spikes)
166
- - Deployment Strategy = rolling out changes via blue-green, canary, or shadow deployment with evaluation gates
167
- - Caching = reusing results at two levels: response-level (identical inputs) and KV cache (prompt prefix reuse, exposed as "prompt caching")
168
-
169
- ## Data & Model Adaptation Terms
170
-
171
- - Fine-Tuning = further training a pre-trained model on task-specific data. Modifies weights, unlike prompting
172
- - Full Fine-Tuning = updating all parameters. Highest quality but most compute; risks catastrophic forgetting
173
- - LoRA (Low-Rank Adaptation) = parameter-efficient fine-tuning via low-rank matrices added to frozen model weights
174
- - QLoRA = LoRA on a quantized base model. Enables fine-tuning large models on consumer hardware
175
- - Adapter = a small trainable module inserted into a pre-trained model. Multiple adapters can be swapped at inference time
176
- - RLHF (Reinforcement Learning from Human Feedback) = training via human preference rankings → reward model → policy optimization
177
- - DPO (Direct Preference Optimization) = optimizing directly on preference pairs without a separate reward model
178
- - RLAIF (Reinforcement Learning from AI Feedback) = using AI instead of humans for the feedback signal
179
- - Dataset Engineering = curating, cleaning, and constructing training datasets. Data quality often matters more than model architecture
180
- - Data Curation = selecting training examples by relevance, diversity, accuracy, and representativeness
181
- - Data Augmentation = creating training examples by transforming existing ones (paraphrasing, back-translation, substitution)
182
- - Synthetic Data = training data generated by AI. Scalable and privacy-safe but risks bias amplification
183
- - Distillation = training a smaller "student" model to reproduce a larger "teacher" model's outputs
184
- - Model Compression = reducing model size via quantization, pruning, or distillation
185
-
186
- ## Cross-Cutting Terms
187
-
188
- These span multiple sub-areas and are not attributed to any single area.
189
-
190
- - CI/CD for LLM Systems = CI/CD adapted for LLM applications: evaluation gates, prompt regression testing, model version tracking, non-deterministic output validation
191
- - Version Management = tracking prompt versions, tool schema versions, model versions, and agent configuration versions independently
192
- - Experiment Tracking = recording experiment configurations and metrics. Attribution: model parameters → Area 8, output quality → Area 5, infrastructure → Area 7
193
- - Prompt Versioning = managing prompt template evolution with identifiers, changelogs, and rollback capability
194
- - Spec-First Development = defining specifications (tool schemas, prompt templates, evaluation criteria) before implementation
195
-
196
- ## Domain Inheritance
197
-
198
- This domain inherits from `software-engineering/concepts.md`. Inherited terms follow the parent domain's definitions unless explicitly redefined.
199
-
200
- | Term | Parent Domain Definition | This Domain Redefinition | Change Scope |
201
- |------|------------------------|-------------------------|-------------|
202
- | model | domain model (business object) | LLM model (trained neural network for text generation) | Default meaning changed — unqualified "model" refers to LLM |
203
- | agent | software agent (generic autonomous program) | LLM agent (autonomous system with LLM-driven decision loop) | Narrowed to LLM-powered agents |
204
- | token | authentication token / API key | LLM token (subword unit processed by a model) | Default meaning changed — unqualified "token" refers to LLM token |
205
- | pipeline | CI/CD pipeline | May refer to CI/CD, RAG, or evaluation pipeline. Qualification required | Ambiguity introduced — context-dependent |
206
- | context | execution context (runtime state) | Context window content (information provided to the model) | Default meaning changed — unqualified "context" refers to model input |
207
- | memory | system memory (RAM) | Agent memory (information persisted across interactions) | Default meaning changed — unqualified "memory" refers to agent memory |
208
-
209
- ## Homonyms Requiring Attention
210
-
211
- - "context": context window (model input capacity) ≠ execution context (runtime state) ≠ bounded context (DDD) ≠ React context
212
- - "model": LLM model (neural network) ≠ domain model (DDD business objects) ≠ data model (schema) ≠ ML model (non-LLM)
213
- - "agent": LLM agent (autonomous, LLM decision loop) ≠ software agent (generic) ≠ user agent (browser HTTP header)
214
- - "embedding": vector embedding (dense semantic vector) ≠ UI embedding (iframe) ≠ word embedding (static, Word2Vec/GloVe)
215
- - "token": LLM token (subword unit) ≠ authentication token (JWT, OAuth) ≠ API token (API key)
216
- - "prompt": system prompt ≠ user prompt ≠ prompt template ≠ MCP prompt primitive
217
- - "memory": agent memory (cross-session) ≠ system memory (RAM) ≠ persistent storage (disk/DB) ≠ conversation history (single-session)
218
- - "tool": MCP tool (agent-invocable function) ≠ development tool (IDE, linter) ≠ CLI tool
219
- - "chain": prompt chain (sequential LLM calls) ≠ blockchain ≠ certificate chain (TLS) ≠ LangChain chain
220
- - "index": navigation index (INDEX.md) ≠ database index (B-tree) ≠ vector index (HNSW, IVF) ≠ array index
221
- - "alignment": model alignment (values/behavior) ≠ text alignment (visual) ≠ ontology alignment (concept mapping)
222
-
223
- ## Interpretation Principles
224
-
225
- These principles apply when interpreting terms and concepts within this domain.
226
-
227
- - Model capability descriptions are version-specific. Documentation must specify the model version when describing capabilities
228
- - Prompt engineering patterns are empirical, not universal. A technique effective for one model may degrade performance on another
229
- - "RAG" is a pattern, not a product. Two "RAG systems" may share almost no implementation details
230
- - Agent architecture patterns (ReAct, planning, reflection, tool use) are not mutually exclusive. They are composable design techniques
231
- - Token counts are model-specific because tokenization differs between models. Budget calculations must use the target model's tokenizer
232
- - "Best practices" in LLM development have short half-lives. Treat patterns as empirically validated, not permanent
233
- - The boundary between prompting and fine-tuning is a design decision based on cost, quality, data, and latency — not task nature
234
- - Evaluation results are meaningful only relative to the evaluation methodology
235
- - LLM architecture decisions are interdependent. Changing retrieval (Area 3) affects prompt design (Area 2) and evaluation (Area 5)
236
-
237
- ## Related Documents
238
- - domain_scope.md — the scope definition where these terms are used, including sub-area membership criteria
239
- - structure_spec.md — source of truth for frontmatter specifications and structural rules
240
- - dependency_rules.md — details of reference chains and direction rules
241
- - prompt_interface.md — prompt/interface design criteria
242
- - competency_qs.md — questions these concepts must be able to answer
@@ -1,163 +0,0 @@
1
- ---
2
- version: 2
3
- last_updated: "2026-05-27"
4
- source: manual
5
- status: established
6
- ---
7
-
8
- # Conciseness Rules (llm-native-development)
9
-
10
- This document contains the domain-specific rules that conciseness references during conciseness verification.
11
- It is organized in the order of **type (allow/remove) → verification criteria → role boundaries → measurement method**.
12
-
13
- ---
14
-
15
- ## 1. Allowed Duplication
16
-
17
- Each rule is tagged with a strength level:
18
- - **[MUST-ALLOW]**: Duplication that breaks the system if removed. Must be retained.
19
- - **[MAY-ALLOW]**: Duplication retained for convenience. Can be consolidated, but only remove when the benefit clearly outweighs the consolidation cost.
20
-
21
- ### Traceability
22
-
23
- - [MUST-ALLOW] Spec → implementation → frontmatter triple traceability — spec.md requirements, implementation file code, and frontmatter metadata express the same facts in different formats. The three layers have different consumers (human judgment, LLM generation, machine verification), so removing any one severs the traceability chain.
24
- - [MUST-ALLOW] AI/human dual consumption paths — The same information coexists in human-facing documents (README, explanatory text) and LLM-facing structures (frontmatter, system map). Since consumers read differently, consolidation destroys accessibility for one side.
25
-
26
- ### Navigation Structure
27
-
28
- - [MUST-ALLOW] File list overlap between system map (ARCHITECTURE.md) and navigation index (INDEX.md) — The system map provides a structural overview of the whole, while the navigation index provides per-directory detail. Since their purposes and navigation depth differ, independent maintenance is mandatory.
29
- - [MAY-ALLOW] Frontmatter relationship declarations and body "Related Documents" links pointing to the same file. Frontmatter is for machine verification, body links are for human navigation, so retention is acceptable, but there is a risk of inconsistency during updates.
30
-
31
- ### Role and Domain Document (Role-Document Mapping)
32
-
33
- - [MUST-ALLOW] Agent role definitions (roles/*.md) and process document (process.md) role-document mapping both mentioning role names. Role files define judgment criteria while process documents define execution order, so their concerns differ.
34
- - [MAY-ALLOW] The same file (e.g., concepts.md) appearing in multiple agents' domain document lists. If the reference context differs per agent, retain; if it is mere listing, extraction into a common reference is possible.
35
-
36
- ### Learning and Promotion
37
-
38
- - [MAY-ALLOW] The same content temporarily coexisting in project learning items and promoted domain documents. Project learning items should be removed after promotion is complete, but temporary duplication during the promotion process is allowed.
39
-
40
- ### Model and Configuration Overlap (→Area 1, Area 4)
41
-
42
- - [MUST-ALLOW] Model fallback configuration overlap: A model routing table (→Area 1: Model Integration) and an agent tool configuration (→Area 4: Agentic Systems) may both reference the same model identifier. The routing table determines which model to call and when to fall back, while the agent configuration determines which model the agent is permitted to use for tool calls. Different consumers (router vs agent) justify the duplication — removing either breaks its consumer's decision logic.
43
-
44
- ### Safety Defense-in-Depth (→Area 2, Area 6)
45
-
46
- - [MUST-ALLOW] Safety policy in prompt and guardrails: The same safety rule may appear in both the system prompt (→Area 2: Prompt & Context Design) and the output guardrail layer (→Area 6: Safety & Alignment). The system prompt constrains model generation behavior, while the guardrail filters the output post-generation. This is defense-in-depth — removing the prompt-side rule increases the chance of harmful generation; removing the guardrail-side rule removes the safety net when prompt-side prevention fails.
47
-
48
- ### Evaluation and Operations Threshold Overlap (→Area 5, Area 7)
49
-
50
- - [MAY-ALLOW] Evaluation criteria in Area 5 and monitoring thresholds in Area 7: Both reference quality thresholds (e.g., hallucination rate, response relevance score), but they serve different lifecycle stages. Area 5 (Evaluation & Testing) applies thresholds pre-deployment to determine release readiness, while Area 7 (Production Operations) applies thresholds at runtime to trigger alerts and rollback. Consolidation into a shared threshold definition is preferred, but independent maintenance is acceptable when evaluation and operations teams have different update cadences.
51
-
52
- ---
53
-
54
- ## 2. Removal Target Patterns
55
-
56
- Each rule is tagged with a strength level:
57
- - **[MUST-REMOVE]**: Duplication whose existence causes errors or incorrect reasoning.
58
- - **[SHOULD-REMOVE]**: Duplication that is not significantly harmful but adds unnecessary complexity.
59
-
60
- ### Role Duplication
61
-
62
- - [MUST-REMOVE] Duplicate agent definitions for the same role — Agents with different names but identical judgment criteria and assigned documents existing in 2 or more instances. Maintaining both causes verification result conflicts or unnecessary repeated execution.
63
- - [SHOULD-REMOVE] Judgment criteria duplicately described in both role definition files and process documents — The authoritative source for judgment criteria is the role definition file, so process documents should reference only the role name.
64
-
65
- ### Navigation and History Mixing
66
-
67
- - [MUST-REMOVE] Change history mixed into navigation index — INDEX.md describing both file lists (navigation) and change logs (history). Since navigation structure and change history have different update cycles and consumption purposes, separation is mandatory.
68
- - [SHOULD-REMOVE] Change history fields and navigation relationship fields mixed in frontmatter — Frontmatter describes current-state metadata, so change history should be delegated to git or a separate history file.
69
- - [MUST-REMOVE] Backward-compatibility notes, deprecated behavior, migration rationale, and historical alternatives in documents loaded for current execution — These materials inflate model context with non-current behavior. Keep execution context focused on current behavior, current contracts, current authority, and current failure handling; place historical material in isolated archive, deprecated, or development-record paths and link only when a task needs that history.
70
-
71
- ### Structural Duplication
72
-
73
- - [MUST-REMOVE] Multiple paths to the same concept file — The same concept existing in different directories under different filenames. By the "File = Concept" equation, one concept must be expressed as exactly one file.
74
- - [SHOULD-REMOVE] System map structural descriptions that list only information completely identical to individual file frontmatter — The system map should show overviews and relationships; simple copies of frontmatter cause update omissions.
75
-
76
- ### Constraint Duplication
77
-
78
- - [SHOULD-REMOVE] Rules from structure_spec.md re-described in individual file bodies — The authoritative source for constraints is structure_spec.md, so individual files should maintain only references.
79
-
80
- ### Tool and MCP Duplication (→Area 4)
81
-
82
- - [MUST-REMOVE] Same tool defined in multiple MCP servers with identical schemas — The agent cannot deterministically select which server to call, producing inconsistent behavior across invocations. Consolidate into a single MCP server, or differentiate schemas if the tools genuinely serve different purposes.
83
-
84
- ### Prompt Template Duplication (→Area 2, Area 4)
85
-
86
- - [MUST-REMOVE] Same prompt template defined in both Area 2 (prompt design) and Area 4 (agent instructions) — Violates single source of truth. When the template is updated in one location but not the other, the agent's behavior diverges from the intended design. Define the template in one authoritative location and reference it from the other.
87
-
88
- ### Metric Definition Duplication (→Area 5, Area 7)
89
-
90
- - [SHOULD-REMOVE] Same evaluation metric computed in both Area 5 (evaluation pipeline) and Area 7 (monitoring) — Independent definition in both causes definition drift. Consolidate the metric definition (formula, thresholds, labels) into a shared location. Different computation frequencies (batch vs real-time) are acceptable.
91
-
92
- ---
93
-
94
- ## 3. Minimum Granularity Criteria
95
-
96
- A sub-classification is allowed only if it satisfies **one or more** of the following. If none are satisfied, merge with the parent.
97
-
98
- 1. **Competency question difference**: Does it produce a different answer to a question in competency_qs.md? Each of the 8 sub-areas in domain_scope.md is designed to answer distinct competency questions — if two classifications answer the same question set, they are merge candidates.
99
- 2. **Constraint difference**: Do different rules from logic_rules.md or structure_spec.md apply? For example, Area 2 (Prompt & Context Design) has token budget constraints that do not apply to Area 1 (Model Integration), and Area 6 (Safety & Alignment) has compliance-specific constraints absent from Area 5 (Evaluation & Testing).
100
- 3. **Consumer difference**: Do different sub-areas consume the classification? A concept consumed only by Area 3 (Retrieval & Knowledge Systems) and a concept consumed only by Area 4 (Agentic Systems) justify separate classifications even if they appear similar, because their consumers apply different logic.
101
-
102
- Examples:
103
- - `Concept file` and `meta file` have different constraints (frontmatter schema, "File = Concept" equation applicability), so the classification is justified.
104
- - If `navigation index` and `system map` list only the same files in the same directory with no additional information, they are candidates for merging.
105
- - A "model capability assessment" concept consumed by Area 1 (routing decisions) and Area 5 (evaluation benchmarks) may justify separate classifications if the competency questions and constraints differ. If they are identical in both respects, consolidate.
106
-
107
- ---
108
-
109
- ## 4. Boundaries — Domain-Specific Application Cases
110
-
111
- The authoritative source for boundary definitions is `roles/conciseness.md`. This section describes only the specific application cases in the llm-native-development domain.
112
-
113
- ### pragmatics Boundary
114
-
115
- - conciseness: Does an unnecessary element **exist**? (structural level)
116
- - pragmatics: Does unnecessary information **waste** the LLM's context window? (execution level)
117
- - In the 8-area structure: conciseness asks "does this element need to exist in this area?" while pragmatics asks "does including this element in the current execution context consume tokens without contributing to the task?"
118
- - Example: A deprecated concept file remains in the directory → conciseness. A valid but currently unnecessary file is included in the reference chain → pragmatics.
119
- - Example (8-area): Agent config (→Area 4) embeds a full prompt template copy → conciseness. Agent loads unused MCP tool schemas → pragmatics.
120
-
121
- ### coverage Boundary
122
-
123
- - conciseness: Is there something that should not be there? (reduction direction)
124
- - coverage: Is there something missing that should be there? (expansion direction)
125
- - In the 8-area structure: coverage checks whether all 8 areas are represented (missing area = gap), while conciseness checks whether any area contains duplicated or misplaced elements.
126
- - Example: An agent role is defined but has no assigned domain document → coverage. Two agents with identical judgment criteria are defined → conciseness.
127
- - Example (8-area): Area 5 has no evaluation methodology → coverage. Same metric defined in Area 5 and Area 7 with identical formulas → conciseness.
128
-
129
- ### logic Boundary (preceding/following relationship)
130
-
131
- - logic precedes: Determines logical equivalence (implication)
132
- - conciseness follows: Decides whether to remove after equivalence is confirmed
133
- - In the 8-area structure: logic determines whether two rules across areas are logically equivalent; conciseness then decides whether one should be removed. Particularly relevant for cross-area constraints (e.g., Area 6 safety rule implied by Area 2 prompt instruction).
134
- - Example: A rule in structure_spec.md implies a constraint in an individual file body → logic determines equivalence → conciseness determines "re-description in individual file is unnecessary."
135
- - Example (8-area): Area 1 fallback rule "if X fails, use Y" and Area 4 agent config "try X first, then Y" → logic determines equivalence → conciseness removes the non-authoritative copy.
136
-
137
- ### semantics Boundary (preceding/following relationship)
138
-
139
- - semantics precedes: Determines semantic identity (synonym status)
140
- - conciseness follows: Decides whether merging is needed after synonym is confirmed
141
- - In the 8-area structure: semantics identifies when two terms across different areas refer to the same concept; conciseness then decides whether to merge into a single canonical definition with cross-area references.
142
- - Example: system map / architecture overview / structure guide are the same concept → semantics determines synonym → conciseness determines "consolidate into one canonical term."
143
- - Example (8-area): "response quality score" (→Area 5) and "output quality metric" (→Area 7) are the same measurement → semantics determines synonym → conciseness consolidates the definition.
144
-
145
- ---
146
-
147
- ## 5. Quantitative Criteria
148
-
149
- Thresholds observed in this domain are recorded as they accumulate.
150
-
151
- - **Tool overlap threshold** (→Area 4): If 2 tools share >80% of their parameter schemas (measured by field name and type overlap), they are candidates for consolidation. Review whether they serve genuinely different purposes before removing.
152
- - **Prompt template similarity** (→Area 2): If 2 prompt templates differ by <10% of tokens (measured by edit distance / max token count), they should be consolidated into a single parameterized template. The remaining differences should be expressed as template variables.
153
- - (Additional thresholds are accumulated through reviews)
154
-
155
- ---
156
-
157
- ## Related Documents
158
-
159
- - `concepts.md` — Term definitions, synonym mappings, homonym list (semantic criteria for duplication determination)
160
- - `structure_spec.md` — Frontmatter specifications, file type classification (structural-perspective removal criteria)
161
- - `competency_qs.md` — Competency question list (criteria for judging "actual difference" in minimum granularity)
162
- - `dependency_rules.md` — Reference chains and direction rules (basis for allowing reference copies)
163
- - `domain_scope.md` — Domain scope, required concept categories (scope reference for duplication determination)
@@ -1,216 +0,0 @@
1
- ---
2
- version: 2
3
- last_updated: "2026-03-30"
4
- source: manual
5
- status: established
6
- ---
7
-
8
- # LLM-Native Development Domain — Dependency Rules
9
-
10
- Classification axis: **linkage type** — dependencies and connections classified by the type of relationship between components, documents, and sub-areas within LLM-powered systems.
11
-
12
- ## Inter-Area Dependency Map
13
-
14
- The 8 sub-areas defined in domain_scope.md interact through three distinct dependency flows: runtime data flow, design-time constraint flow, and feedback loops.
15
-
16
- ### Runtime Data Flow
17
-
18
- Direction: data flows from left to right toward model output.
19
-
20
- ```
21
- 3 (Retrieval) → 2 (Prompt) → 1 (Model) → output
22
-
23
- 4 (Agentic) ──── tool calls ──────┘
24
- ```
25
-
26
- - Area 3 (Retrieval & Knowledge Systems) produces retrieved content that Area 2 (Prompt & Context Design) assembles into model input
27
- - Area 2 constructs the complete prompt and sends it to Area 1 (Model Integration) for inference
28
- - Area 4 (Agentic Systems) orchestrates multi-step workflows by issuing tool calls through Area 1
29
-
30
- ### Design-Time Constraint Flow
31
-
32
- Direction: constraints propagate from the constraining area to the constrained area.
33
-
34
- ```
35
- 1 (Model capability) → constrains → 2 (Prompt design)
36
- 1 (Model capability) → constrains → 3 (Retrieval chunk size)
37
- 6 (Safety policy) → constrains → 2 (Prompt design), 4 (Agent behavior)
38
- 5 (Eval criteria) → constrains → all areas
39
- ```
40
-
41
- - Area 1 constraints (context window size, supported output formats, tool calling capabilities) determine what Area 2 can construct and what chunk sizes Area 3 must produce
42
- - Area 6 (Safety & Alignment) constraints (content policies, guardrail requirements) restrict what Area 2 may include in prompts and what actions Area 4 agents may take
43
- - Area 5 (Evaluation & Testing) criteria serve as acceptance conditions for all areas — each area's output must satisfy its relevant evaluation criteria
44
-
45
- ### Feedback Loops
46
-
47
- Intentional cycles that exist between areas. Each must have an explicit termination condition.
48
-
49
- **Loop 1 — Evaluation ↔ Adaptation (→Area 5, →Area 8)**:
50
- - Area 5 evaluation results drive fine-tuning decisions in Area 8 (Data & Model Adaptation). The fine-tuned model is then re-evaluated by Area 5.
51
- - Termination: quality threshold met OR training budget exhausted.
52
-
53
- **Loop 2 — Drift Correction (→Area 7, →Area 2, →Area 1)**:
54
- - Area 7 (Production Operations) detects output quality drift → Area 2 adjusts prompts → Area 1 produces new output → Area 7 monitors the result.
55
- - Termination: drift metric returns to baseline.
56
-
57
- **Loop 3 — Production Re-evaluation (→Area 7, →Area 5)**:
58
- - Area 7 production metrics trigger re-evaluation by Area 5. This is one-directional (not a cycle): Area 7 informs Area 5, and Area 5 may then inform other areas independently.
59
-
60
- ## Inter-Area Direction Rules
61
-
62
- Each rule specifies the direction of dependency or data flow between two areas.
63
-
64
- | Source | Target | Relationship | Direction |
65
- |--------|--------|-------------|-----------|
66
- | Area 3 (Retrieval) | Area 2 (Prompt) | Retrieval provides content to prompt construction | 3 feeds 2 |
67
- | Area 2 (Prompt) | Area 1 (Model) | Constructed prompt is sent to model for inference | 2 feeds 1 |
68
- | Area 4 (Agentic) | Area 1 (Model) | Agent issues tool calls through model | 4 orchestrates 1 |
69
- | Area 6 (Safety) | Area 2 (Prompt), Area 4 (Agentic) | Safety constrains prompt design and agent behavior | 6 constrains 2, 4 |
70
- | Area 5 (Evaluation) | All areas | Evaluation criteria are references, not hard dependencies. All areas may be evaluated | 5 evaluates all |
71
- | Area 8 (Adaptation) | Area 1 (Model) | Adapted model replaces or supplements base model | 8 produces, 1 consumes |
72
- | Area 7 (Operations) | Area 5 (Evaluation) | Operational metrics inform evaluation priorities | 7 informs 5 |
73
-
74
- **Reverse direction prohibition**: An area that is fed, constrained, or consumed must not impose structural requirements back on the feeding/constraining area. For example, Area 1 (Model) must not require Area 2 (Prompt) to use a specific prompt template — Area 1 exposes capabilities, and Area 2 decides how to use them.
75
-
76
- ## Diamond Dependencies
77
-
78
- A diamond dependency occurs when two areas share a common upstream dependency, creating a potential for conflicting requirements.
79
-
80
- **Primary diamond: 4 ← {2, 3} ← 1**
81
-
82
- Area 4 (Agentic Systems) depends on both Area 2 (Prompt & Context Design) and Area 3 (Retrieval & Knowledge Systems). Both Area 2 and Area 3 are constrained by Area 1 (Model Integration) — specifically by context window size, supported output formats, and tool calling capabilities.
83
-
84
- Resolution:
85
- - Area 1 constraints propagate independently to Area 2 and Area 3. Each area adapts to model constraints within its own scope.
86
- - Conflict scenario: Area 2 and Area 3 could impose contradictory requirements on each other (e.g., Area 2 demands more token budget for instructions while Area 3 demands more for retrieved content).
87
- - Conflict prevention: The handoff point defined in domain_scope.md (Area 3 boundary ends at retrieval results returned; Area 2 boundary begins at assembling results into prompt) ensures that token budget allocation is Area 2's responsibility, not Area 3's. Area 3 returns ranked results, and Area 2 decides how many to include.
88
-
89
- ## Truth Source Hierarchy
90
-
91
- When multiple sources provide guidance on the same LLM system design question, defer to the higher-priority source. Document any conflict and its resolution.
92
-
93
- | Priority | Source | Authority Level |
94
- |----------|--------|----------------|
95
- | 1 | Model provider documentation (Anthropic, OpenAI, Google) | Normative for model behavior. Defines what the model can and cannot do, correct API usage, and known limitations. Overrides all other sources for model-specific questions |
96
- | 2 | Protocol specifications (MCP, A2A, OpenAPI for tool schemas) | Normative for integration interfaces. Defines correct interaction patterns between system components |
97
- | 3 | Framework documentation (LangChain, LlamaIndex, LangGraph) | Informative for implementation. Describes available abstractions and their intended usage. May deviate from model provider guidance — when conflict occurs, defer to level 1 |
98
- | 4 | Community best practices (Chip Huyen "AI Engineering", applied-llms.org, Anthropic/OpenAI guides) | Informative for patterns and trade-offs. Reflects practitioner experience. Follow unless domain-specific requirements dictate otherwise |
99
- | 5 | Internal conventions (project CLAUDE.md, spec.md, onto domain files) | Local authority. Must not contradict levels 1-2. May deviate from levels 3-4 with documented rationale |
100
-
101
- ## External Dependency Management
102
-
103
- LLM-powered systems depend on external services and libraries that evolve independently. Each dependency type has specific management requirements.
104
-
105
- ### Model API Dependencies (→Area 1)
106
-
107
- - Model version must be pinned (e.g., `claude-sonnet-4-20250514`, not `claude-sonnet-4-latest`) in production environments. Unpinned versions may change behavior without warning.
108
- - Breaking changes include: model deprecation, behavior changes between versions, pricing changes, rate limit changes, API format changes.
109
- - Each breaking change requires a migration plan: (1) identify affected components, (2) evaluate replacement model, (3) run Area 5 evaluation suite against new model, (4) deploy with canary rollout via Area 7.
110
-
111
- ### MCP Server Dependencies (→Area 4)
112
-
113
- - Server availability must be checked at startup. The agent must define fallback behavior for each MCP server: degrade gracefully (skip tools provided by the unavailable server) or halt (if the server provides critical capabilities).
114
- - MCP server schema changes (tool parameter changes, new required fields) are breaking changes. Pin server versions or validate schemas at connection time.
115
-
116
- ### Embedding Model Dependencies (→Area 3)
117
-
118
- - Changing the embedding model requires re-indexing all stored vectors. Embeddings from different models are incompatible — cosine similarity between vectors from different models is meaningless.
119
- - Migration path: (1) generate new embeddings with new model, (2) run parallel retrieval evaluation (Area 5) comparing old and new, (3) switch over atomically, (4) delete old embeddings after validation period.
120
-
121
- ### Framework Dependencies (→Area 4, →Area 3)
122
-
123
- - LangChain/LlamaIndex/LangGraph version upgrades may change default behavior (e.g., default prompt templates, default chunking strategies, default retrieval parameters).
124
- - Pin framework versions in production. Upgrade only with explicit testing against Area 5 evaluation suite.
125
- - When a framework abstracts away model-specific behavior, verify that the abstraction matches model provider documentation (Truth Source Hierarchy level 1 takes precedence over level 3).
126
-
127
- ## Metric Ownership Rules
128
-
129
- Each metric in the system has exactly one owning area. Ownership is determined by the measurement purpose, not by the observable being measured.
130
-
131
- | Measurement Purpose | Owning Area | Example Metric |
132
- |---|---|---|
133
- | Is the output correct? | Area 5 (Evaluation) | Answer accuracy, hallucination rate, factual grounding score |
134
- | Is the output safe? | Area 6 (Safety) | Prompt injection detection rate, PII leak rate, policy violation rate |
135
- | Is the system running efficiently? | Area 7 (Operations) | Response latency (p50/p95/p99), error rate, cost per request, throughput |
136
- | Is the model training improving? | Area 8 (Adaptation) | Training loss, validation accuracy, fine-tuning convergence rate |
137
-
138
- **Shared observable rule**: When the same observable (e.g., response latency) is measured for different purposes, each purpose's metric is owned by the respective area. For example:
139
- - Response latency measured for SLA compliance → Area 7 (operational concern)
140
- - Response latency measured as part of model comparison evaluation → Area 5 (evaluation concern)
141
-
142
- These are distinct metrics with distinct owners, even though they measure the same underlying quantity.
143
-
144
- ## Document Reference Direction Rules
145
-
146
- ### Inter-Layer Direction
147
-
148
- - System Map (ARCHITECTURE.md) → Directory Index (INDEX.md) → Individual Documents: Allowed (upper → lower)
149
- - Individual Documents → System Map: Prohibited (lower → upper reversal). Individual documents may reference only same-level or lower-level documents
150
-
151
- ### Inter-Type Direction
152
-
153
- - Domain Documents → Process Documents: Prohibited. Domain must be independent of process
154
- - Process Documents → Domain Documents: Allowed. Processes reference domain knowledge
155
- - Role Documents → Domain Documents: Allowed. Roles read domain documents as judgment criteria
156
-
157
- ### Same-Type Document Direction
158
-
159
- - References between domain documents of the same type: Allowed (e.g., concepts.md ↔ structure_spec.md)
160
- - Bidirectional references are allowed, but cycles are prohibited (see Cycle Exception Clause below)
161
- - When referencing, specify in the Related Documents section
162
-
163
- ### Inter-Domain Direction
164
-
165
- - Child (specialized) domain → Parent (general) domain: Allowed (e.g., llm-native-development → software-engineering)
166
- - Parent (general) domain → Child (specialized) domain: Prohibited
167
- - Same-level domains: Bidirectional allowed, but cycles are prohibited
168
-
169
- ## Acyclicity
170
-
171
- - Document reference graph: Acyclic (DAG). Circular references cause LLM infinite traversal
172
- - Inter-area dependency graph: Acyclic except for declared feedback loops (see Feedback Loops above). Each declared loop must specify a termination condition
173
-
174
- ### Cycle Exception Clause
175
-
176
- - Bidirectional references (A↔B) arising from the bidirectional relationship requirement in logic_rules.md are cycle exceptions
177
- - When cycles are unavoidable: Declare bidirectional references in frontmatter and record the reference reason
178
- - Cycle exceptions allow only 2-node bidirectional (A↔B). 3-node or more cycles (A→B→C→A) are prohibited
179
- - Feedback loops between areas (declared in Inter-Area Dependency Map) are also cycle exceptions, but must include termination conditions
180
-
181
- ## Referential Integrity
182
-
183
- - File paths listed in frontmatter's `depends_on`, `related_to`, etc. must actually exist
184
- - When a file is moved/deleted, all frontmatter referencing that file must be updated
185
- - Files registered in INDEX.md must actually exist in that directory
186
-
187
- ### Inheritance Referential Integrity
188
-
189
- - The parent domain path referenced in concepts.md's inheritance declaration must actually exist
190
- - Terms in the redefinition list must actually exist in the parent domain's concepts.md
191
- - When parent domain terms are deleted/moved, child domain inheritance declarations must also be updated
192
-
193
- ### Source of Truth for Redefinitions
194
-
195
- - Terms in the redefinition list: The child domain (llm-native-development) is the source of truth
196
- - Terms not in the redefinition list: The parent domain (software-engineering) is the source of truth
197
-
198
- ## Duplication Prevention Rules
199
-
200
- - If the same content exists in 2 or more files → Designate one as the source of truth and replace others with references
201
- - If similar concepts are defined under different names in separate files → Consolidate or explicitly state the differences
202
- - If INDEX.md descriptions and the original file's frontmatter description conflict → Frontmatter takes precedence (INDEX.md is the source of truth for file existence and location; frontmatter is the source of truth for file attributes)
203
-
204
- ## Change Atomicity
205
-
206
- - When a single concept change spans multiple files, all changes are reflected in a single commit
207
- - The unit of change is "one concept addition/modification/deletion"
208
- - When an inter-area dependency changes (e.g., Area 1 model capability changes that affect Area 2 prompt design and Area 3 chunk size), all affected areas' documentation must be updated atomically
209
-
210
- ## Related Documents
211
-
212
- - domain_scope.md — Sub-area definitions, membership criteria, and handoff points referenced by inter-area rules
213
- - logic_rules.md — Bidirectional relationship requirement (linked to this file's cycle exception clause)
214
- - structure_spec.md — Frontmatter specifications, INDEX.md role definition
215
- - concepts.md — Domain inheritance declaration, term definitions
216
- - competency_qs.md — Dependency verification questions