npm - @creativeaitools/agent-wiki - Versions diffs - 2.0.0 - Mend

@creativeaitools/agent-wiki 2.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (30) hide show

package/AGENT-WIKI-SPEC-v2.md +2584 -0
package/AGENTS.md +314 -0
package/INBOX.md +19 -0
package/LICENSE +21 -0
package/ONBOARD.md +373 -0
package/README.md +429 -0
package/WIKI.md +706 -0
package/_system/config.example.json +105 -0
package/dist/src/catalog.js +66 -0
package/dist/src/cli.js +330 -0
package/dist/src/compile.js +104 -0
package/dist/src/config.js +84 -0
package/dist/src/lifecycle.js +171 -0
package/dist/src/migrate.js +26 -0
package/dist/src/onboard.js +159 -0
package/dist/src/page.js +188 -0
package/dist/src/registry.js +74 -0
package/dist/src/schedule-prompts.js +74 -0
package/dist/src/upgrade.js +215 -0
package/dist/src/wiki-utils.js +112 -0
package/dist/src/workspace.js +198 -0
package/package.json +54 -0
package/skills/compile-wiki/SKILL.md +140 -0
package/skills/extract-knowledge-primitives/SKILL.md +350 -0
package/skills/import-link/SKILL.md +101 -0
package/skills/import-link/config.json +12 -0
package/skills/process-inbox/SKILL.md +255 -0
package/skills/process-workspace-sources/SKILL.md +127 -0
package/skills/update-overview/SKILL.md +140 -0
package/skills/write-synthesis/SKILL.md +154 -0

package/AGENT-WIKI-SPEC-v2.md ADDED Viewed

@@ -0,0 +1,2584 @@
+# Agent Wiki v2 Specification
+Version: 2.0.0
+Last Updated: 2026-06-27
+---
+## 1. Purpose
+This specification defines the **v2 format, rules, and runtime expectations** for an AI-agent-compatible wiki.
+The goal of the system is to make the wiki useful for both:
+- **humans**, who need readable pages, durable notes, summaries, and workflows
+- **agents**, who need stable structure, normalized records, explicit claims, and machine-facing cache artifacts
+This spec merges two requirements:
+1. a **knowledge ontology** that distinguishes entities, concepts, sources, claims, evidence, relationships, contradictions, and questions
+2. a **practical wiki architecture** that works as a markdown-first knowledge system with compile-time normalization in either standalone vault mode or embedded workspace mode
+This document defines the v2 contract for:
+- operating modes
+- folder layout
+- page types
+- frontmatter fields
+- structured claims and evidence
+- relationship representation
+- compile output files
+- dashboard generation
+- freshness and health rules
+- minimum validation rules
+---
+## 2. Design Principles
+The wiki must separate:
+- **things** from **ideas**
+- **claims** from **evidence**
+- **sources** from **summaries**
+- **facts** from **interpretations**
+- **confidence** from **certainty theater**
+- **human-edited content** from **compiled/generated artifacts**
+- **page structure** from **compiled machine caches**
+The wiki is intended to act as:
+- a human-readable knowledge base
+- a belief-tracking layer
+- an agent-friendly context substrate
+- a source-traceable research system
+- a maintenance surface for contradictions, stale content, and open questions
+---
+## 3. Normative Language
+The keywords **MUST**, **MUST NOT**, **SHOULD**, **SHOULD NOT**, and **MAY** in this document are to be interpreted as requirement levels.
+- **MUST**: required for v2 compliance
+- **SHOULD**: strongly recommended unless there is a justified reason not to
+- **MAY**: optional
+---
+## 4. Scope of v2
+v2 is intentionally constrained.
+v2 includes:
+- vault and workspace operating modes
+- lifecycle CLI commands for initialization and health checks
+- page typing
+- structured claims
+- structured evidence
+- aliases
+- relations
+- generated reports
+- machine-facing compile outputs
+v2 does **not** require:
+- a dedicated top-level timeline folder
+- full contradiction pages as primary authoring surfaces
+- a separate metrics/state object system
+- automatic semantic deduplication
+- ontology inference beyond explicit page metadata and claims
+Those can be added in a later version.
+---
+## 5. Knowledge Model
+The system recognizes the following knowledge object types.
+### 5.1 Primary object types
+#### Entity
+A durable thing in the world or system.
+Examples:
+- person
+- organization
+- project
+- product
+- system
+- place
+- event
+- artifact
+- document-as-thing
+#### Concept
+An abstract idea, reusable pattern, definition, method, workflow, runbook, checklist, or operational playbook.
+Examples:
+- principle
+- method
+- workflow pattern
+- theory
+- policy
+- standard
+- abstraction
+- taxonomy definition
+- runbook
+- checklist
+- workflow
+- playbook
+#### Source
+An origin of information.
+Examples:
+- PDF
+- webpage
+- article
+- transcript
+- meeting notes
+- email
+- dataset
+- screenshot
+- raw imported file
+- source bridge page
+#### Claim
+A statement that can be evaluated for support, confidence, freshness, and conflict.
+#### Evidence
+A bounded support, challenge, or context record attached to a claim.
+#### Relationship
+A typed connection between two objects.
+#### Contradiction
+A tracked conflict between claims, sources, definitions, dates, or interpretations.
+#### Question
+An unresolved uncertainty or research gap.
+### 5.2 Secondary object types
+#### Synthesis
+A maintained summary, overview, comparison, timeline, or analysis derived from other pages or sources.
+#### Timeline Event
+A dated event record represented inside an entity page, synthesis page, or compiled cache.
+#### Alias
+An alternate name for a page/object.
+#### Metric / State
+Optional quantitative or stateful information. Not a required first-class authored page type in v2.
+---
+## 6. Operating Modes and Wiki Layout
+Agent Wiki supports two operating modes.
+- **Vault mode**: the wiki root is the primary repository or folder. Source material enters through `_inbox/`, `import-link`, or direct source-page creation. Original inbox files are retained in `raw/` after promotion.
+- **Workspace mode**: the wiki root is embedded inside a larger workspace, normally at `workspace/wiki`. Source candidates may live outside the wiki directory and are discovered by workspace scanning, while deliberate captures may still enter through the wiki's `_inbox/`. Original workspace files stay in place and are referenced by canonical source pages through workspace-relative `originPath` values.
+A v2-compliant wiki MUST have a `wikiType` of `vault` or `workspace`. If `_system/config.json` is absent, tools SHOULD default to `vault` mode for backward compatibility.
+### 6.1 Vault Mode Layout
+A vault-mode wiki SHOULD use the following top-level structure when initialized.
+```text
+<wiki>/
+  AGENTS.md
+  WIKI.md
+  overview.md
+  index.md
+  INBOX.md
+  sources/
+  entities/
+  concepts/
+  claims/
+  syntheses/
+  questions/
+  reports/
+  skills/
+  _inbox/
+  raw/
+  _attachments/
+  _archive/
+  _system/
+    config.example.json
+    cache/
+    indexes/
+    logs/
+```
+Fresh template repositories MAY omit empty runtime/content directories. Initialization tooling and workflows SHOULD create missing directories when they are needed.
+### 6.2 Workspace Mode Layout
+A workspace-mode wiki is stored inside a larger workspace. The default wiki directory is `wiki/`.
+```text
+<workspace>/
+  docs/
+  research/
+  decisions/
+  wiki/
+    AGENTS.md
+    WIKI.md
+    overview.md
+    index.md
+    INBOX.md
+    sources/
+    entities/
+    concepts/
+    claims/
+    syntheses/
+    questions/
+    reports/
+    skills/
+    _attachments/
+    _archive/
+    _system/
+      config.json
+      config.example.json
+      cache/
+      indexes/
+      logs/
+      state/
+```
+Workspace mode MUST include `_inbox/`, `_inbox/trash/`, and `raw/` inside the wiki root for deliberate external captures and notes. Workspace discovery MUST still exclude the wiki directory itself, so these inbox/raw folders are not treated as workspace source candidates.
+Workspace discovery state SHOULD live under `_system/state/` or another deterministic local runtime location inside the wiki root. It is local operational state, not canonical wiki knowledge.
+### 6.3 Required top-level files
+#### `AGENTS.md`
+MUST describe how agents are expected to behave in the wiki.
+Typical contents:
+- editing conventions
+- generated artifact rules
+- compile expectations
+- page ownership expectations
+- naming conventions
+- what agents may or may not rewrite
+#### `WIKI.md`
+MUST describe the wiki schema and editorial rules in human-readable form.
+Typical contents:
+- folder meanings
+- page types
+- claim/evidence rules
+- confidence meanings
+- status vocabularies
+- report meanings
+#### `index.md`
+SHOULD be the deterministic root-level page catalog.
+The file SHOULD be regenerated as a whole by `agent-wiki index` from compiled page metadata. It is not a place for durable human-authored prose; use `README.md`, `WIKI.md`, `ONBOARD.md`, or other root documentation for that.
+#### `overview.md`
+SHOULD be the human-facing landing page for the wiki.
+The file SHOULD provide a long-form narrative overview of the wiki, including a wiki summary and paragraph-form summaries for each active page type. It MAY be AI-authored or AI-maintained, but it is durable orientation prose and SHOULD NOT be regenerated automatically on every compile.
+`overview.md` is not evidence, not a generated report, and not a replacement for compiled caches. Claims in `overview.md` SHOULD be treated as orientation unless they are represented in canonical pages, claims, evidence records, or source pages.
+#### `INBOX.md`
+SHOULD be a short navigation pointer to the durable inbox rules in `WIKI.md` and the operational `process-inbox` skill. It MUST NOT duplicate the full inbox workflow; `WIKI.md` owns lifecycle concepts and the skill owns exact commands.
+### 6.3.1 Optional top-level files
+#### `log.md`
+Operational log entries belong in `_system/logs/log.md`.
+### 6.4 Required directories
+#### `sources/`
+Stores canonical verbatim source pages.
+Large sources MAY be represented by one parent source page and multiple source part pages under `sources/parts/`.
+#### `entities/`
+Stores durable thing pages.
+#### `concepts/`
+Stores concept pages, including workflow, runbook, checklist, and playbook concepts.
+#### `claims/`
+Stores standalone claim pages representing atomic propositions with dedicated evidence tracking.
+#### `syntheses/`
+Stores maintained rollups, analyses, comparisons, summaries, and timeline-style syntheses.
+#### `questions/`
+Stores open question pages.
+#### `reports/`
+Stores generated dashboard pages and maintenance views.
+#### `skills/`
+Stores agent skill definitions at the wiki root so the wiki follows common portable skill conventions. Skills are human-authored operational instructions and supporting files, not authored knowledge pages.
+#### `_inbox/`
+Stores raw files waiting to be promoted into canonical source pages. This folder exists in both vault and workspace mode. Files in `_inbox/` are not canonical source pages and MUST NOT be treated as evidence for claims.
+#### `raw/`
+Stores retained original raw files after inbox promotion. This folder exists in both vault and workspace mode. Files in `raw/` are not canonical source pages and MUST NOT be treated as evidence for claims.
+#### `_attachments/`
+Stores binary assets and attachments referenced by source pages or other pages (PDFs, images, raw files). Created on vault initialization; MAY be empty.
+#### `_archive/`
+Stores deprecated or no-longer-maintained pages that have been removed from active content folders. Created on vault initialization; MAY be empty.
+#### `_system/`
+Stores machine-generated runtime and compile artifacts.
+Sub-directories:
+- `cache/` — compiled artifact outputs (do not hand-edit)
+- `indexes/` — generated index files (do not hand-edit)
+- `logs/` — compile run logs (do not hand-edit)
+- `state/` — local runtime state for deterministic workflows, including workspace source discovery (do not hand-edit)
+Files:
+- `config.example.json` — tracked example for optional local system configuration
+The compile pipeline reads from the wiki and writes to `_system/cache/`, `_system/indexes/`, and `_system/logs/`. Utility commands exposed by `agent-wiki` MAY update deterministic generated catalog pages or scaffold new authored pages when explicitly invoked. The root `skills/` directory is not a compile output and is not scanned for page frontmatter.
+`_system/config.json`, when present, is local operational configuration, not canonical vault knowledge. It SHOULD be ignored by version control and SHOULD NOT be committed to shared template repositories. `_system/config.example.json` SHOULD be tracked when the project wants to document the supported shape of local configuration.
+Local config SHOULD NOT contain secrets, API keys, access tokens, private credentials, or machine-specific state that changes on every run. Detection results such as whether a converter is currently installed SHOULD be checked at runtime rather than stored as durable truth.
+Each skill SHOULD live in its own sub-directory under `skills/`, containing at minimum an instruction file. Deterministic operations SHOULD be exposed through the TypeScript `agent-wiki` CLI instead of bundled Python-era script folders. Example layout:
+```text
+skills/
+  compile-wiki/
+    SKILL.md
+  process-inbox/
+    SKILL.md
+```
+### 6.5 Local system configuration
+`_system/config.json` MAY define local tool policy and command preferences used by deterministic scripts and skills. The file is optional. Tools SHOULD use conservative defaults when it is absent.
+Shared repositories SHOULD track `_system/config.example.json` and ignore `_system/config.json`. Operators MAY create `_system/config.json` by copying the example or by approving an onboarding setup action. Agents MUST NOT write `_system/config.json` unless the operator explicitly approves the local choices to persist.
+Recommended shape:
+```json
+{
+  "schemaVersion": 1,
+  "wikiType": "vault",
+  "pythonCommand": null,
+  "knownVaults": {
+    "my-vault-name": "/absolute/path/to/vault"
+  },
+  "workspace": {
+    "root": null,
+    "wikiDir": "wiki",
+    "scan": {
+      "includeExtensions": [".md", ".markdown", ".txt", ".pdf", ".docx", ".csv", ".json", ".yaml", ".yml"],
+      "excludeDirs": [".git", ".hg", ".svn", ".obsidian", ".venv", "venv", "env", "__pycache__", "node_modules", "dist", "build", "_system", "reports"],
+      "excludeFileGlobs": ["*.lock", "package-lock.json", "pnpm-lock.yaml", "yarn.lock"]
+    }
+  },
+  "conversion": {
+    "enabled": true,
+    "defaultBackend": "auto",
+    "backendOrder": ["pymupdf4llm", "markitdown"],
+    "allowNetwork": false,
+    "allowOcr": false,
+    "allowLlm": false,
+    "allowTranscription": false,
+    "allowHostedDocumentIntelligence": false,
+    "backends": {
+      "pymupdf4llm": {
+        "enabled": true,
+        "command": null,
+        "formats": ["pdf"]
+      },
+      "markitdown": {
+        "enabled": true,
+        "command": "markitdown",
+        "formats": ["pdf", "docx", "pptx", "xlsx", "html", "csv", "json", "xml", "epub"]
+      },
+      "arxiv2md": {
+        "enabled": false,
+        "command": null,
+        "formats": ["pdf"]
+      },
+      "marker": {
+        "enabled": false,
+        "command": null,
+        "formats": ["pdf"]
+      }
+    }
+  }
+}
+```
+`wikiType` MUST be either `vault` or `workspace` when present. Missing `wikiType` SHOULD be interpreted as `vault`.
+`workspace.root` MAY be `null` for vault mode. In workspace mode, it SHOULD be the absolute workspace root when known. `workspace.wikiDir` is the path from the workspace root to the wiki root, defaulting to `wiki`.
+`workspace.scan` MAY define deterministic workspace discovery policy. Discovery policy is local operational policy, not canonical wiki knowledge. Workspace scanning MUST exclude the wiki directory itself and SHOULD exclude source-control, dependency, build, cache, and generated-output directories.
+`knownVaults` is an optional object that maps Obsidian vault names (as registered in the Obsidian app) to their absolute paths on the local file system. When present, agents MAY use this map to resolve `obsidian://` cross-vault links to readable file paths. Keys SHOULD match the vault folder name exactly as Obsidian registers it. Values MUST be absolute paths. This field is local operator configuration and MUST NOT be committed to shared template repositories.
+`pythonCommand` MAY be `null` to use the active environment, `python3`, `python`, or a project-local virtual environment path such as `.venv/bin/python`.
+Each wiki root remains a single Agent Wiki. In vault mode, the repository or selected root is the wiki root. In workspace mode, the wiki root is the configured wiki directory inside a larger workspace. Skills, scripts, and config files MUST read and write wiki content relative to the selected wiki root unless a workspace-mode command explicitly reads source candidates from the workspace root.
+The lifecycle CLI MAY track multiple local Agent Wiki roots through a machine-local registry outside any wiki root, conventionally `~/.config/agent-wiki/registry.json`. Registry entries MUST refer only to Agent Wiki roots created or migrated by the CLI. The registry is local operator state, not canonical wiki knowledge, and MUST NOT be stored inside a wiki. Operators MAY target a registered wiki with `agent-wiki --wiki NAME <command>`.
+Obsidian is an optional editor for the wiki root. Opening the wiki root as an Obsidian vault MUST NOT change where skills or scripts read and write content.
+Configuration SHOULD express operator policy and preferences, not transient detection state. For example, a backend can be enabled in config but still unavailable at runtime if the command or Python package is not installed. Tools SHOULD detect that condition during execution and report it clearly.
+Operating system, platform, and shell detection SHOULD be reported by onboarding probes as runtime environment state and SHOULD NOT be persisted in `_system/config.json` unless a future operator policy field requires an explicit override.
+If `.venv/` is used, it SHOULD be project-local and ignored by version control. Shared template repositories SHOULD NOT require it to exist.
+`.gitignore` SHOULD include `_system/config.json` and `.venv/` so local setup choices and installed packages do not become shared vault content.
+### 6.6 Lifecycle CLI
+The system SHOULD provide an agent-independent lifecycle CLI. The command name MAY be installed as `agent-wiki`, and the package SHOULD support Node/npm-based local development:
+```bash
+npm install
+npm run build
+npm link
+```
+The lifecycle CLI SHOULD support initializing a vault-mode wiki:
+```bash
+agent-wiki init --type vault --root /path/to/wiki
+agent-wiki registry add MyWiki --root /path/to/wiki --type vault
+agent-wiki --wiki MyWiki onboard --check
+```
+It SHOULD support initializing a workspace-mode wiki:
+```bash
+agent-wiki init --type workspace --workspace-root /path/to/workspace --wiki-dir wiki
+agent-wiki registry add MyProject --root /path/to/workspace/wiki --type workspace
+agent-wiki --wiki MyProject onboard --check
+```
+`init` SHOULD create the required content, generated, system, and inbox lifecycle directories for the selected mode. Both vault mode and workspace mode SHOULD create `_inbox/`, `_inbox/trash/`, and `raw/`.
+By default, `init` SHOULD write `_system/config.json` with `schemaVersion`, `wikiType`, and workspace settings appropriate to the selected mode. It SHOULD preserve unrelated existing config fields when updating an existing local config. A `--no-config` flag MAY suppress this for advanced bare-skeleton setup or tests.
+By default, `init` SHOULD copy missing bundled root documentation, root-level `skills/`, package metadata, and `_system/config.example.json` into the wiki. It MUST NOT overwrite existing files. A `--no-template` flag MAY suppress this for advanced bare-skeleton setup or tests.
+The lifecycle CLI SHOULD provide a read-only health check:
+```bash
+agent-wiki doctor --wiki-root /path/to/wiki --type vault
+agent-wiki doctor --wiki-root /path/to/workspace/wiki --type workspace
+agent-wiki --wiki MyWiki doctor
+```
+`doctor` SHOULD verify mode-specific required folders, local config sanity, required template script/skill availability, and whether `wikiType` is valid. It MUST NOT create files, write config, install packages, run conversion, or mutate wiki content. It SHOULD return a non-zero exit code only for errors, not warnings or informational findings.
+The lifecycle CLI SHOULD provide machine-local registry commands:
+```bash
+agent-wiki registry add MyWiki --root /path/to/wiki --type vault
+agent-wiki registry show MyWiki
+agent-wiki registry remove MyWiki
+agent-wiki list
+agent-wiki check --all
+agent-wiki check --all --full
+```
+`agent-wiki list` SHOULD list registered wiki names, types, and paths. `agent-wiki check --all` SHOULD run a light read-only check across registered wikis using `doctor` and the deterministic onboarding summary. `agent-wiki check --all --full` MAY also run compile and index validation and therefore MAY write generated cache/index files.
+The lifecycle CLI SHOULD provide scheduled-agent prompt generation for recurring skill-based maintenance:
+```bash
+agent-wiki schedule prompt process-inbox
+agent-wiki schedule prompt extract-primitives
+agent-wiki schedule prompt update-overview
+agent-wiki schedule prompt process-inbox MyWiki OtherWiki
+agent-wiki schedule prompt update-overview --wiki MyWiki
+```
+Schedule prompt commands MUST print prompts for an external scheduled-agent harness. They MUST NOT execute the skill workflow themselves. By default, schedule prompt commands SHOULD target all registered Agent Wiki roots. Operators MAY target one or more registered wikis by name. Generated prompts SHOULD instruct the scheduled agent to read each wiki's `AGENTS.md` and `WIKI.md`, follow the local skill instructions, log per-wiki failures, and continue to the next wiki.
+### 6.7 Workspace Discovery CLI
+Workspace mode SHOULD provide deterministic discovery commands that operate from the workspace root while storing local state under the wiki root.
+The CLI SHOULD support:
+```bash
+agent-wiki workspace scan --workspace-root /path/to/workspace --wiki-dir wiki --json
+agent-wiki workspace pending --workspace-root /path/to/workspace --wiki-dir wiki --json
+agent-wiki workspace mark-sourced --workspace-root /path/to/workspace --path docs/example.md --source-id source.2026-06-27.document.example --source-path sources/2026-06-27-document-example.md
+```
+Workspace scanning SHOULD identify candidate non-code files outside the wiki directory using deterministic include/exclude rules. It SHOULD report file path, modified time, size, extension, content hash, recommended source type, and any known source-page mapping.
+Workspace discovery MUST NOT semantically read files, create source pages, modify workspace files, move files, delete files, or treat workspace files as canonical evidence. It only reports candidates and records local mapping state.
+After an agent creates a canonical source page for a workspace file, `mark-sourced` MAY record the relationship between the workspace-relative source path and the wiki source page. That mapping is local operational state and SHOULD NOT replace source-page metadata.
+### 6.8 Onboarding probe
+The system SHOULD provide a deterministic onboarding probe at `agent-wiki onboard`.
+The probe SHOULD support:
+```bash
+agent-wiki onboard --check
+```
+It MAY also support a read-only human question helper:
+```bash
+agent-wiki onboard --check --questions
+```
+`agent-wiki onboard --check` SHOULD inspect local environment capabilities and print a structured report. The report SHOULD be suitable for both human review and agent-guided setup.
+The probe SHOULD check:
+- operating system and platform details needed for setup guidance
+- available Python commands, including `python3`, `python`, and `.venv/bin/python`
+- Python versions for available commands
+- whether `.venv/` exists
+- whether `_system/config.json` exists
+- whether `_system/config.json` declares `wikiType`
+- whether `.obsidian/` exists at the wiki root
+- whether mode-specific required runtime/content folders exist
+- whether `import-link` has a local config file and whether it appears configured
+- available local conversion CLI commands, such as `markitdown`, `marker`, and `arxiv2md`
+- importable Python conversion packages under each available Python command, such as `pymupdf4llm`, `markitdown`, and `marker`
+The probe MUST NOT install packages, create virtual environments, write `_system/config.json`, create folders, modify skill config, or mutate vault content when run with `--check`.
+Onboarding decisions SHOULD remain operator-driven. Agents MAY use the probe output to ask the operator a short series of setup questions, then run lifecycle commands or write local config only after the operator has approved those actions.
+`agent-wiki onboard` MAY support an explicit mutating config writer:
+```bash
+agent-wiki onboard --write-config --python-command python3 --conversion disabled
+```
+`--write-config` MAY create or update local `_system/config.json`. It MUST NOT be implied by `--check` or `--questions`. Agents MUST run it only after the operator has approved the specific local choices to persist.
+When `_system/config.example.json` exists, `--write-config` SHOULD start from the example shape and update only approved local policy fields. It SHOULD preserve unrelated existing config fields when updating an existing `_system/config.json`.
+`--write-config` MUST write operator policy and command preferences only. It MUST NOT write transient detection state such as whether a package or command is currently installed. It MUST NOT create `.venv/`, install packages, create folders, modify `skills/import-link/config.json`, or run the compile pipeline.
+The config writer SHOULD require explicit flags for choices that materially change behavior, including:
+- `--python-command <command>` for the preferred Python command
+- `--conversion disabled` to keep inbox conversion disabled
+- `--conversion available-local` to enable conversion using only already installed local backends
+- `--conversion custom` when explicit backend choices or policy flags are supplied
+The supported wiki root for the probe is the current working directory. In workspace mode, callers SHOULD run the probe from the embedded wiki root, not from the workspace root.
+Network, OCR, LLM, transcription, and hosted document-intelligence conversion behavior MUST remain disabled unless explicitly enabled by dedicated flags. The writer SHOULD report exactly which fields were written.
+Onboarding questions SHOULD be compact multiple-choice prompts. The operator should be able to answer quickly with letter choices such as `1A 2B 3A`. Agents SHOULD avoid long open-ended setup questions unless a path, command, or credential must be supplied by the operator.
+Each setup question SHOULD include:
+- a short label
+- two to four lettered choices
+- a clear recommended choice when one exists
+- a one-sentence consequence for each choice
+- a short answer format, such as `Reply with: 1A 2B 3A 4C`
+Question text SHOULD be friendly and operational. It SHOULD describe what the choice will do, not narrate internal implementation details.
+Recommended setup questions include:
+- which Python command to use
+- whether to use or create a project-local `.venv/`
+- whether inbox conversion should be enabled
+- which conversion backend policy to use
+- whether network, OCR, LLM, transcription, or hosted document-intelligence behavior is allowed
+- whether mode-specific missing runtime folders should be created with `agent-wiki init`
+- whether `_system/config.json` should be written
+After core onboarding, agents SHOULD recommend optional Obsidian setup when the operator wants an Obsidian workflow. The recommendation SHOULD be concise and operational:
+1. Open Obsidian.
+2. Click the current vault name at the bottom of the file explorer pane, or use Obsidian's vault switcher if the control is not visible.
+3. Click "Manage vaults..."
+4. Click "Open folder as vault".
+5. Navigate to the wiki root.
+6. Click "Select Folder".
+Opening the wiki root as an Obsidian vault may create local `.obsidian/` settings. `.obsidian/` is local application state and SHOULD be ignored by version control.
+### 6.9 Project development workflow
+Changes to this project SHOULD move from contract to implementation in a consistent order.
+When adding a feature or changing project behavior, the recommended workflow is:
+1. Update this specification.
+2. Update configuration files or configuration templates when the change affects operator policy, defaults, or local setup.
+3. Update deterministic scripts.
+4. Update skill instructions and skill-local support files.
+5. Update root-level Markdown documentation other than this specification.
+Each step SHOULD be skipped when the change does not affect that surface. The specification SHOULD be reviewed first because it defines the contract that configuration, scripts, skills, and root-level documentation implement.
+### 6.10 Deterministic page scaffolding
+The system SHOULD provide a deterministic page scaffolding utility at `agent-wiki create-page`.
+The page scaffolder exists to reduce schema drift when agents create new authored knowledge pages. It is an operational helper, not an authorship engine. It MUST NOT decide what a page means, invent claims, write interpretations, choose evidence, or synthesize source material on its own. The caller remains responsible for supplying the title, page type, subtype where applicable, body prose, source references, claim references, and other semantic fields.
+The scaffolder SHOULD support canonical page types:
+- `source`
+- `entity`
+- `concept`
+- `claim`
+- `question`
+- `synthesis`
+The scaffolder MUST NOT create generated page types such as `index` or `report`.
+For `source` pages, the scaffolder is only a deterministic source-page writer. It MUST support ordinary whole source pages, large-source parent manifest pages, and large-source part pages. It MUST NOT fetch links, capture web pages, convert binary files, move raw files, decide split points, perform OCR, call LLMs, or determine whether a large source should be partitioned. Source-oriented workflows such as `import-link` and `process-inbox` own acquisition, conversion, raw-file lifecycle, large-source partitioning decisions, source segment preparation, and provenance gathering. Those workflows MAY call the scaffolder to write validated canonical source pages once they have prepared the verbatim Markdown body and required source metadata.
+The scaffolder SHOULD provide a command-line interface shaped like:
+```bash
+agent-wiki create-page \
+  --type synthesis \
+  --subtype analysis \
+  --slug large-document-ingestion \
+  --title "Large Document Ingestion" \
+  --body-file /tmp/body.md
+```
+The exact option set MAY evolve, but the script SHOULD support:
+- `--type <pageType>` for the page type.
+- `--subtype <subtype>` for the page-type-specific subtype when applicable, such as `sourceType`, `entityType`, `conceptType`, `claimType`, or `synthesisType`.
+- `--slug <slug>` for the stable filename and ID suffix.
+- `--title <title>` for the page title.
+- `--body-file <path>` for substantive Markdown body prose.
+- `--body <text>` for short body text when shell quoting is safe.
+- repeated reference flags where useful, such as `--source-page <id>`, `--derived-claim <id>`, `--related-page <id>`, or `--tag <tag>`.
+- source-specific flags where useful, such as `--source-url <url>`, `--origin-path <path>`, `--retrieved-at <date>`, `--source-role <whole|parent|part>`, `--source-part <id>`, `--parent-source-id <id>`, `--part-index <n>`, `--part-count <n>`, or `--locator <locator>`.
+- `--dry-run` to print the resolved path and frontmatter without writing.
+- `--no-log` so skills can create several pages and then write one batch log entry.
+For each created page, the scaffolder MUST:
+- resolve wiki paths relative to the wiki root
+- select the correct folder for the requested `pageType`
+- construct the stable `id` using this specification's naming rules
+- create the required frontmatter for the page type using the current date for `createdAt` and `updatedAt`
+- map `--subtype` to the correct page-type-specific field
+- derive the filename from the stable ID using the filename rules in Section 8.2
+- refuse to overwrite an existing file unless a future explicit update mode is specified
+- check for duplicate IDs in existing wiki pages before writing
+- require verbatim Markdown body content for `source` pages
+- validate source role requirements: `whole` source pages stand alone; `parent` source pages carry ordered `sourceParts`; `part` source pages carry `parentSourceId`, `partIndex`, `partCount`, and a stable `locator`
+- require substantive Markdown body prose for `entity`, `concept`, `claim`, `question`, and `synthesis` pages
+- write valid Markdown with YAML frontmatter followed by the supplied body prose
+The scaffolder SHOULD produce predictable, machine-readable console output for success and failure so skills can report results clearly. It SHOULD return a non-zero exit code when validation fails, when the target path already exists, or when the requested ID already exists elsewhere in the vault.
+Skills that use the scaffolder SHOULD still write one operational log entry after the meaningful skill run or change batch through `agent-wiki log`. They SHOULD NOT rely on the scaffolder to log every individual page when multiple pages are created as part of one operation.
+---
+## 7. Folder Semantics
+### 7.1 `sources/`
+`source` pages represent verbatim source material. They are created by the `import-link` and `process-inbox` skills.
+A `source` page SHOULD include:
+- verbatim content (text and images)
+- source metadata
+- attachments (images, pdfs, etc.)
+- retrieval information
+A page in `sources/` MUST have `pageType: source`.
+#### 7.1.1 Large sources
+Large sources SHOULD NOT be stored as one giant markdown body when doing so would make extraction, review, or evidence citation unreliable.
+When captured or converted source text exceeds the large-source threshold, agents SHOULD create:
+- one parent source page for the whole source
+- multiple child source part pages for bounded verbatim text segments
+The parent source page represents the document, transcript, webpage capture, or other source as a whole. It SHOULD include bibliographic metadata, retrieval metadata, attachment references, the retained raw file path when applicable, and a manifest of child source part paths. Its body SHOULD stay short and SHOULD NOT contain the full long-form source text.
+Source part pages are canonical source pages scoped to a deterministic segment of the parent source. They SHOULD contain the verbatim extracted text for that segment, preserve available locators, and point back to the parent source.
+Source part pages SHOULD live under:
+```text
+sources/parts/
+```
+Large-source partitioning SHOULD use deterministic split rules:
+1. Prefer semantic boundaries such as chapters, sections, headings, appendix boundaries, transcript topic blocks, or slide boundaries.
+2. Fall back to page ranges, timestamps, or other stable locators when semantic structure is unavailable.
+3. Keep each part near a target size of 8,000-15,000 words.
+4. Do not exceed 20,000 words in one part unless preserving an indivisible structure requires it.
+5. Merge very small adjacent sections when that preserves meaning and stays within the target size.
+6. Avoid splitting inside tables, code blocks, quoted blocks, or list structures when possible.
+A source SHOULD be partitioned when converted text is larger than roughly 25,000 words or when an agent cannot reliably process the full source in one extraction pass. Tools MAY use token estimates instead of word counts, but the chosen threshold SHOULD be stable and documented.
+Extraction workflows SHOULD process child source part pages, not the parent source body. Evidence SHOULD cite the most specific available source part and locator.
+The parent source page SHOULD use `status: partitioned` while one or more child parts remain `status: unprocessed`. It SHOULD use `status: processed` only after all child parts have been processed or intentionally archived.
+#### 7.1.2 Source conversion
+Raw inbox files MAY be converted to Markdown before source page creation. Conversion is an intake step that produces the text used for canonical `source` pages or source part pages.
+Plain text and Markdown inbox files SHOULD be treated as already prepared source body files. `process-inbox` SHOULD pass those files, or prepared source-part files derived from them, to `agent-wiki create-page` with `--body-file` rather than copying the body into `--body`. This preserves formatting, avoids shell quoting failures, and lets the scaffolder validate canonical `source` pages from file-backed body content.
+In vault mode, original raw files SHOULD be retained in `raw/` after successful inbox promotion. Raw files in `_inbox/` or `raw/` are not canonical evidence. The converted source page or source part page is the canonical evidence surface.
+In workspace mode, original workspace files MUST remain in place. A workspace source page SHOULD use `originPath` to point to the workspace-relative file path. Workspace files outside the wiki directory are discovery inputs until promoted into canonical source pages; they MUST NOT be treated as canonical evidence merely because discovery reported them.
+Conversion tools SHOULD preserve document structure and stable locators when available, including headings, page ranges, slide numbers, table boundaries, timestamps, and section paths. When a source is partitioned, partition locators SHOULD use the most specific stable locator available from the conversion output.
+Conversion behavior SHOULD be deterministic:
+1. Tools SHOULD use a stable backend order for automatic conversion.
+2. Tools MUST NOT install converters, model dependencies, or system packages during a skill run.
+3. Tools MUST NOT call network, cloud OCR, LLM, transcription, or hosted document-intelligence services unless the operator explicitly requests or configures that behavior.
+4. If no configured local conversion path exists, the source candidate MUST remain where it is and the failure reason SHOULD be reported to the operator. In vault mode that means the raw file remains in `_inbox/`; in workspace mode that means the workspace file remains untouched.
+5. If conversion succeeds but produces warnings or degraded output, those warnings SHOULD be recorded in source metadata.
+Automatic conversion tools SHOULD read local policy from `_system/config.json` when that file exists. The config MAY define the preferred Python command, whether conversion is enabled, the automatic backend order, backend-specific command names, and whether network, OCR, LLM, transcription, or hosted document-intelligence behavior is allowed. Missing config SHOULD fall back to conservative local-only defaults.
+When optional Python converter packages are installed, they SHOULD be installed in a project-local virtual environment such as `.venv/`. Agents MUST NOT create virtual environments or install packages unless the operator explicitly asks them to do so. `.venv/` is local environment state and MUST NOT be treated as vault content.
+Common local converter backends MAY include:
+- `pymupdf4llm` for fast local extraction from native-text PDFs
+- `markitdown` for general document-to-Markdown conversion
+- `arxiv2md` for arXiv or academic sources where a structured arXiv source can be identified
+- `marker` for complex PDFs where higher-fidelity local extraction is needed
+These backend names are examples, not required dependencies. The wiki schema MUST remain stable regardless of the converter used.
+### 7.2 `entities/`
+An `entity` page represents a durable thing.
+Typical entity kinds:
+- person
+- organization
+- project
+- product
+- system
+- place
+- event
+- artifact
+A page in `entities/` MUST have `pageType: entity`.
+### 7.3 `concepts/`
+A `concept` page represents a definition, method, abstraction, policy, standard, workflow, runbook, checklist, or operational playbook.
+A page in `concepts/` MUST have `pageType: concept`.
+### 7.4 `syntheses/`
+A `synthesis` page represents maintained cross-source interpretation or rollup.
+Examples:
+- overview
+- analysis
+- comparison
+- brief
+- timeline
+- summary
+A page in `syntheses/` MUST have `pageType: synthesis`.
+Agents SHOULD create a synthesis page when the user asks for durable interpretation across multiple sources, claims, entities, concepts, or time periods. Synthesis pages are appropriate for briefs, comparisons, literature-style summaries, chronological narratives, decision memos, and maintained analyses that should remain available as authored knowledge.
+Agents SHOULD NOT create a synthesis page for:
+- a single atomic proposition that belongs in `claims/`
+- a raw or verbatim captured item that belongs in `sources/`
+- an unresolved unknown that belongs in `questions/`
+- a deterministic maintenance output that belongs in `reports/`
+- whole-wiki orientation that belongs in root `overview.md`
+Synthesis pages are secondary authored interpretation. They MUST preserve uncertainty, identify their source basis, and avoid presenting unsupported conclusions as established fact.
+### 7.5 `questions/`
+A `question` page represents an unresolved issue.
+A page in `questions/` MUST have `pageType: question`.
+### 7.6 `claims/`
+A `claim` page represents a standalone atomic proposition that tracks its own evidence independent of any one source.
+A page in `claims/` MUST have `pageType: claim`.
+### 7.7 `reports/`
+A `report` page is generated and SHOULD NOT be treated as an authoritative source of truth.
+A page in `reports/` MUST have `pageType: report` if it includes frontmatter.
+Reports are views over compiled or source page data.
+### 7.8 `index.md`
+`index.md` is the deterministic root-level page catalog for the wiki.
+It SHOULD have `pageType: index`. It MUST NOT be typed as `report`.
+The `index` page type is reserved for wiki-level navigation and page discovery. There is typically only one `index` page per wiki root.
+`index.md` SHOULD be regenerated as a whole by `agent-wiki index` from `_system/cache/pages.json`. The script MUST NOT independently define page truth; it only renders a deterministic catalog from compiled page metadata.
+The script SHOULD support:
+- `--write` to rewrite `index.md`
+- `--check` to verify that `index.md` matches the deterministic rendered output
+The generated page SHOULD include frontmatter and grouped page tables by `pageType`. It MAY include root documentation files as a separate documentation section when those files are intentionally outside the normal page catalog.
+Because the whole file is deterministic, agents and humans SHOULD NOT place durable manual prose in `index.md`. Durable orientation content belongs in root documentation files such as `README.md`, `WIKI.md`, `ONBOARD.md`, and `AGENTS.md`.
+### 7.9 `overview.md`
+`overview.md` is the root-level narrative landing page for the wiki.
+It SHOULD have `pageType: overview`. It MUST NOT be typed as `report`, `index`, or `synthesis`.
+The `overview` page type is reserved for wiki-level orientation. There is typically only one `overview` page per wiki root, and it SHOULD live at root `overview.md`.
+The page SHOULD include:
+- a human-facing summary of the wiki
+- paragraph-form summaries of each active page type
+- enough context for a new human reader to understand what is in the wiki and where to go next
+`overview.md` MAY be written or refreshed by an agent, but it SHOULD be updated intentionally after meaningful content changes rather than regenerated as part of every compile run. It is durable orientation prose, not a deterministic artifact.
+`overview.md` MUST NOT be treated as primary evidence for claims unless the relevant material has been promoted into canonical source, claim, evidence, or page metadata records.
+### 7.10 Authored knowledge page bodies
+When an agent or human creates an `entity`, `concept`, `claim`, `question`, or `synthesis` page, the page MUST include a substantive Markdown body after the frontmatter.
+The body SHOULD be detailed, human-facing prose that explains what the page represents, why it matters, and how the structured fields should be understood. It SHOULD NOT be a placeholder, a one-line restatement of the title, or only a machine-readable metadata dump.
+For each page type, the body SHOULD cover the natural human context for that page:
+- `entity` pages SHOULD describe the entity, its role in the vault, important identifiers or aliases, and known context or uncertainty.
+- `concept` pages SHOULD explain the concept, its meaning, boundaries, related methods or examples, and any important distinctions.
+- `claim` pages SHOULD restate the proposition in prose, summarize the evidence posture, and note important caveats or uncertainty.
+- `question` pages SHOULD explain why the question exists, what is already known, what remains unresolved, and what would count as resolution.
+- `synthesis` pages SHOULD provide maintained narrative interpretation, scope, source basis, and current conclusions or open tensions.
+Agents MUST preserve existing human-authored body prose unless the operator explicitly asks for a rewrite.
+---
+## 8. Page Identity and Naming
+Each page MUST have a stable `id`.
+### 8.1 Requirements
+- `id` MUST be globally unique within the wiki root.
+  - *Note: Duplicate IDs will not self-repair. The compiler flags collisions in the console and logs the offending file paths in `_system/logs/`. In the compiled indexes, the last processed file with the duplicate ID will overwrite previous entries.*
+- `id` SHOULD be stable over time
+- `id` SHOULD NOT depend on the page filename alone
+- `id` SHOULD use dotted lowercase namespace-style format
+  - *Exception for Source Pages:* Source pages use the format `source.<yyyy-mm-dd>.<sourceType>.<sourceSlug>` to balance semantic density with chronological sorting and collision prevention.
+  - *Exception for attachments:* Attachment IDs are generated using `agent-wiki uuid` and stored in the frontmatter of the source page as the value of the `attachments` field. This allows for easy reference to attachments from the source page and ensures that attachments are properly linked to their sources.
+  - *Exception for evidence blocks:* Evidence block IDs are generated using `agent-wiki uuid` and stored in the frontmatter of the source page as the value of the `evidence` field. This allows for easy reference to evidence blocks from the source page and ensures that evidence blocks are properly linked to their sources.
+Examples:
+- `entity.place.riverside-community-garden`
+- `concept.policy.watershed-management`
+- `source.2026-04-12.webpage.urban-tree-canopy`
+- `synthesis.overview.coastal-resilience`
+- `question.accessibility.evacuation-routing`
+#### Rationale: Dotted Namespaces vs. UUIDs
+While UUIDs guarantee mathematical uniqueness without central coordination, the dotted lowercase namespace format prioritizes **semantic density** and **agent ergonomics**:
+- **Context at a Glance:** Humans and agents can immediately infer what an ID points to without needing to resolve the node.
+- **Token Efficiency:** Descriptive IDs like `synthesis.overview.coastal-resilience` provide rich metadata at a low token cost.
+- **Collision Prevention:** Scoping IDs by `<pageType>.<namespace>.<slug>` prevents common naming collisions in a flat namespace.
+### 8.2 Filenames
+Filenames MAY change. IDs SHOULD remain stable. The id is used to generate filenames, dots are replaced with hyphens. filename format: `source-<yyyy-mm-dd>-<sourceType>-<sourceSlug>.md` or `<idWithHyphens>.md`.
+### 8.3 Canonical names
+Entities and concepts SHOULD include `canonicalName`.
+### 8.4 Internal linking convention
+Internal references in wiki-native pages and wiki-native documentation MUST use Obsidian-style wikilinks.
+```md
+[[page-slug]]
+[[page-slug|Display Text]]
+[[page-slug#section-heading]]
+```
+Standard markdown links (`[text](path)`) MUST NOT be used for internal vault-page references. They MAY be used for external URLs.
+This convention applies to:
+- page body content
+- wiki-native root docs (`AGENTS.md`, `WIKI.md`, `INBOX.md`, `ONBOARD.md`, `CLAUDE.md`, etc.)
+- the **navigation/display reference fields** in frontmatter: `sourcePages`, `derivedClaims`, `relatedPages`, `relatedClaims`, `extractedEntities`, `extractedConcepts`, `extractedClaims`, `extractedQuestions`, `originPath`
+Public repository documentation MAY use standard markdown links for repository readability, especially `README.md` when it is intended to render cleanly on GitHub.
+#### 8.4.1 Reference target vs. display text
+The wikilink **target is the filename stem, not the page ID** (see §8.2: the id is hyphenated, and the `source.` prefix is dropped for source pages). The page ID is carried as the wikilink **alias** (display text) so the type-prefixed ID stays human-legible:
+```yaml
+# id source.2026-04-12.webpage.tidal-flood-map lives in
+# sources/2026-04-12-webpage-tidal-flood-map.md
+sourcePages: ["[[2026-04-12-webpage-tidal-flood-map|source.2026-04-12.webpage.tidal-flood-map]]"]
+derivedClaims: ["[[claim-descriptive-high-tide-risk|claim.descriptive.high-tide-risk]]"]
+originPath: "[[raw/2026-04-12-report|raw/2026-04-12-report.md]]"
+```
+Writing the dotted ID directly as the target (`[[source.2026-04-12.webpage.tidal-flood-map]]`) does **not** resolve in Obsidian, because no file is named that. `agent-wiki create-page` wraps supported fields automatically; `agent-wiki migrate-refs-to-links` converts existing pages.
+#### 8.4.2 Raw-ID fields (MUST NOT be wikilinked)
+The following frontmatter fields are resolved by **exact ID match** during compilation (`agent-wiki compile` builds an id→page map and looks these up). They MUST remain bare IDs — wrapping them in `[[ ]]` breaks resolution:
+- `id`, `parentSourceId`, `subjectPageId`, `sourceIds`, `sourceParts`
+- `evidence[].sourceId`, relation `sourceClaimIds`, timeline `sourceIds`
+In short: **grounding/relationship lists are links; structural pointers used by the compiler are raw IDs.**
+Skill instruction files SHOULD use explicit relative paths when directing agents to project files, schemas, scripts, or examples. Skills MAY mention wikilinks only when the desired output is wiki-native content that should contain wikilinks.
+Rationale: wikilinks decouple vault references from file system paths, survive renames, and are resolved natively by Obsidian and compatible tooling. Public repository docs have a different audience and SHOULD remain readable in standard markdown renderers.
+### 8.5 Attachment IDs
+Attachments (binary assets like images, PDFs, etc.) stored in `_attachments/` do not use frontmatter IDs. Instead, their **filename** acts as their unique identifier for internal linking (e.g., via Obsidian wikilinks).
+To prevent silent overwrites in the flat `_attachments/` directory, attachment IDs MUST use the following pattern:
+`yyyy-mm-dd-<source-slug>-<UUID>-<index>.<ext>`
+- `yyyy-mm-dd`: The date of capture.
+- `<source-slug>`: The same 4-word summary as the source file.
+- `<UUID>`: A unique identifier generated specifically for the attachment.
+- `<index>`: An incremental index (starting at 1) for sources containing multiple attachments.
+### 8.6 Cross-vault linking
+Pages in one vault MAY reference pages in a separate Obsidian vault using an `obsidian://` URI link.
+An `obsidian://` URI has the form:
+```
+obsidian://open?vault=<vault-name>&file=<url-encoded-file-path>
+```
+- `<vault-name>` is the name of the target vault as registered in Obsidian (the folder name Obsidian uses to identify the vault).
+- `<url-encoded-file-path>` is the path to the target file within that vault, URL-encoded (spaces become `%20`, slashes remain `/`).
+To obtain the URI for a target page, open the target vault in Obsidian, right-click the file in the file explorer, and select **Copy Obsidian URL**.
+In the linking page, write the cross-vault reference as a standard markdown link (NOT a wikilink, since wikilinks only resolve within the same vault):
+```md
+[Display text](obsidian://open?vault=my-other-vault&file=folder%2Fpage-slug)
+```
+Example:
+```md
+[Working with multiple vaults](obsidian://open?vault=o4e-06&file=00%20Obsidian%20for%20Everyone%20course%2F00-02%20First%20steps%20with%20Obsidian%2FWorking%20with%20multiple%20vaults)
+```
+Cross-vault links are Obsidian-local and will not resolve in GitHub, plain markdown renderers, or agent contexts. Pages that use cross-vault links SHOULD note this limitation in a comment or body prose so future readers and agents do not treat broken links as vault errors.
+#### Agent resolution of `obsidian://` URIs
+Agents MUST NOT attempt to launch or dispatch `obsidian://` URIs through the OS protocol handler.
+An agent MAY resolve an `obsidian://` URI to a readable file path when `knownVaults` is present in `_system/config.json`. The resolution procedure is:
+1. Parse the URI query string and extract the `vault` and `file` parameters.
+2. URL-decode the `file` parameter (replace `%20` with space, `%2F` with `/`, etc.) to obtain the relative file path within the target vault.
+3. Append `.md` if the decoded path has no file extension.
+4. Look up the `vault` value as a key in `knownVaults`. If the key is absent, stop and report that the vault is not configured locally.
+5. Construct the full absolute file path: `<knownVaults[vault]>/<decoded-file-path>`.
+6. Verify the file exists before reading. If it does not exist, report the missing path rather than silently failing.
+When `knownVaults` is absent or the target vault is not listed, agents MUST treat the `obsidian://` URI as an opaque external reference and MUST NOT guess or scan for the target vault root.
+---
+## 9. Required Universal Frontmatter
+Every authored page except purely generated disposable report pages SHOULD include frontmatter.
+Minimum universal frontmatter:
+```yaml
+id: entity.place.riverside-community-garden
+pageType: entity
+title: Riverside Community Garden
+status: active
+createdAt: 2026-04-12
+updatedAt: 2026-04-12
+aliases: []
+tags: []
+```
+### 9.1 Universal fields
+#### `id`
+Type: string
+Required: yes
+#### `pageType`
+Type: enum
+Required: yes
+Allowed values:
+- `source`
+- `entity`
+- `concept`
+- `claim`
+- `synthesis`
+- `question`
+- `report`
+- `index`
+- `overview`
+#### `title`
+Type: string
+Required: yes
+#### `status`
+Type: string
+Required: yes
+Interpretation depends partly on page type.
+#### `createdAt`
+Type: date (`YYYY-MM-DD`)
+Required: yes
+#### `updatedAt`
+Type: date (`YYYY-MM-DD`)
+Required: yes
+#### `aliases`
+Type: string[]
+Required: yes, but MAY be empty
+#### `tags`
+Type: string[]
+Required: yes, but MAY be empty
+### 9.2 Recommended universal fields
+```yaml
+canonicalName: <Canonical Name>
+owner:
+summary:
+sourcePages: []
+relatedPages: []
+confidence:
+freshness:
+```
+These are optional in v2, but strongly recommended where applicable.
+---
+## 10. Page-Type Specific Frontmatter
+This section defines the pure schema templates for each page type, followed by a concrete example.
+### 10.1 Source pages
+**Schema:**
+```yaml
+id: source.<yyyy-mm-dd>.<sourceType>.<sourceSlug>
+pageType: source
+title: <title>
+status: <status>
+sourceType: <sourceType>
+sourceRole: <sourceRole>
+parentSourceId: <sourceId>
+partIndex: <number>
+partCount: <number>
+locator: <locator>
+sourceParts: []
+originUrl: <url>
+originPath: <wikilink-to-local-raw-file>
+convertedAt: <yyyy-mm-dd>
+conversionTool: <tool>
+conversionToolVersion: <version>
+conversionBackend: <backend>
+conversionWarnings: []
+publishedAt: <yyyy-mm-dd>
+retrievedAt: <yyyy-mm-dd>
+updatedAt: <yyyy-mm-dd>
+createdAt: <yyyy-mm-dd>
+aliases: []
+tags: []
+attachments: []
+```
+**Example:**
+```yaml
+id: source.2026-04-28.webpage.urban-tree-canopy
+pageType: source
+title: Urban Tree Canopy Assessment
+status: processed
+sourceType: webpage
+sourceRole: whole
+parentSourceId:
+partIndex:
+partCount:
+locator:
+sourceParts: []
+originUrl: https://example.com/reports/urban-tree-canopy
+originPath:
+convertedAt:
+conversionTool:
+conversionToolVersion:
+conversionBackend:
+conversionWarnings: []
+publishedAt: 2026-04-25
+retrievedAt: 2026-04-28
+updatedAt: 2026-04-28
+createdAt: 2026-04-28
+aliases: []
+tags: [urban-planning, tree-canopy]
+attachments: []
+```
+#### `status`
+Allowed values:
+- `unprocessed`
+- `partitioned`
+- `processed`
+- `archived`
+#### `sourceType`
+Allowed values:
+- `webpage`
+- `article`
+- `document`
+- `pdf`
+- `transcript`
+- `email`
+- `meeting-notes`
+- `dataset`
+- `screenshot`
+- `bridge`
+- `import`
+- `other`
+#### `sourceRole`
+Allowed values:
+- `whole`
+- `parent`
+- `part`
+Use `whole` for ordinary source pages that contain the complete captured source body in one page.
+Use `parent` for the parent page of a large partitioned source. Parent source pages SHOULD include `sourceParts` and SHOULD NOT include the full long-form verbatim source body.
+Use `part` for child source part pages. Part source pages SHOULD include `parentSourceId`, `partIndex`, `partCount`, and `locator` when available.
+#### `sourceParts`
+Ordered relative paths to child source part pages. This field SHOULD be present on parent source pages and empty or omitted on ordinary source pages and part pages.
+#### `parentSourceId`
+The source ID of the parent source page. This field SHOULD be present on part source pages and empty or omitted on ordinary source pages and parent source pages.
+#### `partIndex`
+One-based ordinal for a source part within its parent source. This field SHOULD be present on part source pages.
+#### `partCount`
+Total number of source parts for the parent source. This field SHOULD be present on part source pages and MAY be present on parent source pages.
+#### `locator`
+A stable locator for the part within the parent source, such as page range, heading path, timestamp range, slide range, or section range.
+Source pages SHOULD include `originUrl` for externally retrieved material. Source pages promoted from local raw inbox files MAY use `originPath` instead. At least one of `originUrl` or `originPath` SHOULD be present.
+#### Conversion provenance
+Source pages SHOULD include conversion provenance when the canonical source body was produced by converting a raw file or external asset into Markdown.
+Recommended fields:
+- `convertedAt` - date the conversion was performed
+- `conversionTool` - converter or wrapper used
+- `conversionToolVersion` - converter version when available
+- `conversionBackend` - selected backend when the tool supports multiple backends
+- `conversionWarnings` - ordered list of warnings, quality notes, skipped content, or degraded extraction notices
+For partitioned sources, conversion provenance SHOULD appear on the parent source page and MAY also appear on child source part pages when part-level conversion details differ. Child source parts SHOULD still include locators that let evidence point back to the relevant portion of the converted source.
+Large source parent IDs SHOULD use the ordinary source ID format:
+```text
+source.<yyyy-mm-dd>.<sourceType>.<sourceSlug>
+```
+Large source part IDs SHOULD append a stable part suffix:
+```text
+source.<yyyy-mm-dd>.<sourceType>.<sourceSlug>.part<nnn>
+```
+Large source part filenames SHOULD preserve the same ordering:
+```text
+sources/parts/<yyyy-mm-dd>-<sourceType>-<sourceSlug>-part<nnn>.md
+```
+### 10.2 Entity pages
+**Schema:**
+```yaml
+id: entity.<entityType>.<entitySlug>
+pageType: entity
+title: <title>
+entityType: <entityType>
+canonicalName: <canonicalName>
+status: active
+createdAt: <yyyy-mm-dd>
+updatedAt: <yyyy-mm-dd>
+aliases: []
+tags: []
+```
+**Example:**
+```yaml
+id: entity.place.riverside-community-garden
+pageType: entity
+title: Riverside Community Garden
+entityType: place
+canonicalName: Riverside Community Garden
+status: active
+createdAt: 2026-04-12
+updatedAt: 2026-04-12
+aliases: [riverside-garden]
+tags: [urban-agriculture]
+```
+#### `entityType`
+Allowed values:
+- `person`
+- `organization`
+- `project`
+- `product`
+- `system`
+- `place`
+- `event`
+- `artifact`
+- `document`
+- `other`
+### 10.3 Concept pages
+**Schema:**
+```yaml
+id: concept.<conceptType>.<conceptSlug>
+pageType: concept
+title: <title>
+conceptType: <conceptType>
+status: active
+createdAt: <yyyy-mm-dd>
+updatedAt: <yyyy-mm-dd>
+aliases: []
+tags: []
+```
+**Example:**
+```yaml
+id: concept.method.adaptive-reuse
+pageType: concept
+title: Adaptive Reuse
+conceptType: method
+status: active
+createdAt: 2026-04-12
+updatedAt: 2026-04-12
+aliases: [building-reuse]
+tags: [architecture]
+```
+#### `conceptType`
+Allowed values:
+- `definition`
+- `principle`
+- `framework`
+- `method`
+- `policy`
+- `standard`
+- `pattern`
+- `workflow`
+- `runbook`
+- `checklist`
+- `playbook`
+- `theory`
+- `taxonomy`
+- `other`
+### 10.4 Synthesis pages
+**Schema:**
+```yaml
+id: synthesis.<synthesisType>.<synthesisSlug>
+pageType: synthesis
+title: <title>
+synthesisType: <synthesisType>
+scope: <scope>
+status: active
+sourcePages: []
+derivedClaims: []
+createdAt: <yyyy-mm-dd>
+updatedAt: <yyyy-mm-dd>
+aliases: []
+tags: []
+```
+**Example:**
+```yaml
+id: synthesis.overview.coastal-resilience
+pageType: synthesis
+title: Coastal Resilience Overview
+synthesisType: overview
+scope: coastal flood mitigation
+status: active
+sourcePages: ["[[2026-04-12-webpage-tidal-flood-map|source.2026-04-12.webpage.tidal-flood-map]]"]
+derivedClaims: ["[[claim-descriptive-high-tide-risk|claim.descriptive.high-tide-risk]]"]
+createdAt: 2026-04-12
+updatedAt: 2026-04-12
+aliases: []
+tags: [climate-resilience]
+```
+#### `synthesisType`
+Allowed values:
+- `summary`
+- `overview`
+- `analysis`
+- `timeline`
+- `brief`
+- `comparison`
+### 10.5 Synthesis workflow rules
+Synthesis pages are durable authored knowledge, not deterministic reports. They combine judgment, source selection, prose, and uncertainty management. A synthesis page MAY cite source pages, derived claim pages, related entities, related concepts, questions, or prior syntheses, but it MUST remain clear about what is directly sourced and what is interpretive.
+#### When to create a synthesis
+Agents SHOULD create a synthesis page when at least one of the following is true:
+- the user asks to synthesize, compare, summarize, brief, analyze, or narrate across more than one source or page
+- several claims or sources need an integrated explanation that is more useful than a flat list
+- the vault needs a durable current-state brief for a topic, project, decision area, or research thread
+- a chronological account is needed and the chronology is more naturally maintained as a narrative than as isolated timeline records
+- contradictions, open questions, or competing interpretations need to be held together in one maintained reading
+Agents SHOULD update an existing synthesis instead of creating a new one when the existing page has the same scope, audience, and synthesis type. Agents SHOULD create a new synthesis when the scope, time horizon, audience, or analytical question is materially different.
+#### Expected body structure
+The body of a synthesis page MUST be substantive Markdown prose. It SHOULD be written for a human reader and SHOULD contain enough context to stand alone without requiring the reader to inspect every referenced source first.
+A synthesis body SHOULD normally include:
+- scope and purpose
+- source basis or coverage
+- main synthesis in paragraph form
+- important evidence, claims, or examples
+- uncertainty, limits, contradictions, or unresolved questions
+- current conclusion or next-step implication, when appropriate
+Timeline syntheses SHOULD include a chronological narrative and MAY also include a structured `timeline:` field when individual events need deterministic extraction into `_system/cache/timeline-events.json`.
+Comparison syntheses SHOULD make comparison dimensions explicit. Brief syntheses SHOULD prioritize concise conclusions and decision-relevant context. Analysis syntheses SHOULD explain reasoning and uncertainty instead of only listing findings.
+#### Source and evidence grounding
+Synthesis pages MUST list their source basis in `sourcePages` when source pages are used. If the synthesis relies on established claim pages, it SHOULD list them in `derivedClaims`.
+Synthesis prose SHOULD cite the most specific canonical source page, source part, claim page, or question page needed to support the discussion. Large-document syntheses SHOULD cite source part pages rather than only parent source manifests when the relevant evidence came from a specific part.
+Synthesis pages MUST NOT be used to launder unsupported assertions into accepted knowledge. If a synthesis introduces an atomic proposition that should be tracked independently, the agent SHOULD create or update a claim page and reference it from `derivedClaims`.
+Evidence entries for claim pages SHOULD point back to canonical source pages whenever possible. They SHOULD NOT point only to a synthesis page unless the synthesis itself is the best available authored source for an interpretive claim about the wiki's analysis.
+When the evidence base is incomplete, contested, or weak, the synthesis body MUST say so plainly. Agents MUST preserve minority views, contradictions, and caveats that matter to the synthesis question.
+#### Maintenance rules
+When a synthesis page is meaningfully changed, the agent MUST update `updatedAt`. If the source basis changes, the agent SHOULD update `sourcePages`, `derivedClaims`, and any relevant relationships at the same time.
+Agents SHOULD maintain synthesis pages by revising the existing body prose in place, while preserving human-authored material unless the operator explicitly asks for a rewrite. If a prior conclusion becomes stale or contradicted, agents SHOULD revise the conclusion and record the reason in prose rather than silently removing the older context.
+Agents SHOULD create question pages for unresolved issues discovered during synthesis when the question is important enough to track independently. Agents SHOULD create or update claim pages for important atomic assertions that need evidence tracking outside the synthesis body.
+Synthesis pages SHOULD be refreshed intentionally after meaningful new sources or claims are added to their scope. They SHOULD NOT be regenerated automatically during every compile run.
+#### Skill boundary
+The deterministic page scaffolder MAY create the initial synthesis page file and required frontmatter, but it does not decide what to synthesize or write the synthesis body.
+A dedicated synthesis skill SHOULD be added if agents are expected to frequently handle requests such as "synthesize these sources", "write a brief", "compare these documents", "summarize this research thread", or "make a timeline synthesis". Such a skill SHOULD own source and claim selection, synthesis type selection, body prose, uncertainty handling, updates to related claim/question records, and operational logging.
+### 10.6 Question pages
+Questions are first-class authored pages in v2.
+They represent known unknowns, unresolved research tasks, or ambiguity the system should not erase.
+#### Question rules
+- Questions MUST have stable IDs.
+- Questions MUST link related pages or claims.
+- Resolved questions MUST remain in the vault with updated status, not be deleted.
+**Schema:**
+```yaml
+id: question.<domain>.<questionSlug>
+pageType: question
+title: <title>
+priority: <priority>
+status: open
+relatedClaims: []
+relatedPages: []
+openedAt: <yyyy-mm-dd>
+createdAt: <yyyy-mm-dd>
+updatedAt: <yyyy-mm-dd>
+aliases: []
+tags: []
+```
+**Example:**
+```yaml
+id: question.accessibility.evacuation-routing
+pageType: question
+title: Which evacuation routes are accessible during high-water events?
+priority: high
+status: open
+relatedClaims: []
+relatedPages: []
+openedAt: 2026-04-12
+createdAt: 2026-04-12
+updatedAt: 2026-04-12
+aliases: []
+tags: [emergency-planning]
+```
+#### `priority`
+Allowed values:
+- `low`
+- `medium`
+- `high`
+- `critical`
+#### `status`
+Allowed values for question pages:
+- `open`
+- `researching`
+- `blocked`
+- `resolved`
+- `dropped`
+### 10.7 Claim pages
+See also: Section 11. Structured Claims.
+**Schema:**
+```yaml
+id: claim.<claimType>.<claimSlug>
+pageType: claim
+title: <title>
+claimType: <claimType>
+status: <status>
+confidence: <float>
+text: <text>
+subjectPageId: <page-id>
+sourceIds: []
+evidence: []
+createdAt: <yyyy-mm-dd>
+updatedAt: <yyyy-mm-dd>
+aliases: []
+tags: []
+```
+**Example:**
+```yaml
+id: claim.historical.library-reopened-2024
+pageType: claim
+title: Northside Library reopened in 2024
+claimType: historical
+status: supported
+confidence: 0.90
+text: Northside Library reopened to the public in 2024 after seismic upgrades were completed.
+subjectPageId: entity.place.northside-library
+sourceIds:
+  - source.2026-04-12.library-renovation-notice
+evidence: []
+createdAt: 2026-04-12
+updatedAt: 2026-04-12
+aliases: []
+tags: []
+```
+---
+## 11. Structured Claims
+Claims are a primary **pagetype** in the system. They are authored as top-level, standalone files in the `claims/` directory.
+For v2, Standalone Claim Pages are the normative shape. However, pages MAY also contain zero or more embedded claims in their frontmatter under the `claims:` key for convenience. Both formats are parsed identically by the compile pipeline.
+### 11.1 Claim shape
+**Schema:**
+```yaml
+claims:
+  - id: claim.<claimType>.<claimSlug>
+    text: <text>
+    status: <status>
+    confidence: <float>
+    claimType: <claimType>
+    relatedClaimIds: []
+    evidence:
+      - id: <evidenceId>
+        sourceId: <sourceId>
+        path: <sourcePath>
+        lines: <lineRange>
+        kind: <kind>
+        relation: <relation>
+        weight: <float>
+        note: <note>
+        excerpt: <text>
+        retrievedAt: <yyyy-mm-dd>
+        updatedAt: <yyyy-mm-dd>
+    createdAt: <yyyy-mm-dd>
+    updatedAt: <yyyy-mm-dd>
+    validFrom: <yyyy-mm-dd>
+    validTo: <yyyy-mm-dd>
+```
+**Example:**
+```yaml
+claims:
+  - id: claim.descriptive.school-energy-retrofit
+    text: The Lincoln Middle School heat-pump retrofit reduced annual building energy use by 18 percent.
+    status: supported
+    confidence: 0.91
+    claimType: descriptive
+    relatedClaimIds: []
+    evidence:
+      - id: evidence.quote.supports.a1b2c3d4
+        sourceId: source.2026-04-12.webpage.school-energy-audit
+        path: sources/2026-04-12.webpage.school-energy-audit.md
+        lines: 55-79
+        kind: quote
+        relation: supports
+        weight: 0.86
+        note: The audit compares normalized energy use before and after the retrofit.
+        excerpt: "Weather-normalized annual energy consumption fell by 18 percent after commissioning."
+        retrievedAt: 2026-04-12
+        updatedAt: 2026-04-12
+    createdAt: 2026-04-12
+    updatedAt: 2026-04-12
+    validFrom: 2026-04-12
+    validTo:
+```
+### 11.2 Required claim fields
+#### `id`
+Type: string
+Required: yes
+Must be globally unique.
+#### `text`
+Type: string
+Required: yes
+#### `status`
+Type: enum
+Required: yes
+Allowed values:
+- `supported`
+- `weakly_supported`
+- `inferred`
+- `unverified`
+- `contested`
+- `contradicted`
+- `deprecated`
+#### `confidence`
+Type: number
+Required: yes
+Range: `0.0` to `1.0`
+#### `claimType`
+Type: enum
+Required: yes
+Allowed values:
+- `descriptive`
+- `historical`
+- `causal`
+- `interpretive`
+- `normative`
+- `forecast`
+#### `evidence`
+Type: array
+Required: yes, but MAY be empty in draft states
+#### `createdAt`
+Type: date
+Required: yes
+#### `updatedAt`
+Type: date
+Required: yes
+### 11.3 Optional claim fields
+- `relatedClaimIds: string[]`
+- `validFrom: date | null`
+- `validTo: date | null`
+- `tags: string[]`
+- `note: string`
+### 11.4 Claim rules
+- Claim IDs MUST be stable.
+- Claim IDs MUST be unique across the vault.
+- Claims SHOULD be atomic and not overloaded.
+- A claim SHOULD express one proposition, not several glued together.
+- A claim MAY be attached to entity, concept, source, synthesis, or question pages when appropriate.
+- Pages SHOULD NOT hide all important assertions in prose if those assertions matter for machine use.
+---
+## 12. Evidence
+Evidence entries attach provenance and support semantics to a claim.
+### 12.1 Evidence shape
+**Schema:**
+```yaml
+evidence:
+  - id: evidence.<kind>.<relation>.<uuid>
+    sourceId: <source-id>
+    path: <source-path>
+    lines: <line-range>
+    kind: <kind>
+    relation: <relation>
+    weight: <float>
+    note: <note>
+    excerpt: <text>
+    retrievedAt: <yyyy-mm-dd>
+    updatedAt: <yyyy-mm-dd>
+```
+**Example:**
+```yaml
+evidence:
+  - id: evidence.quote.supports.a1b2c3d4
+    sourceId: source.2026-04-28.article.urban-tree-canopy
+    path: sources/2026-04-28.article.urban-tree-canopy.md
+    lines: 10-18
+    kind: quote
+    relation: supports
+    weight: 0.82
+    note: Direct statement from the canopy assessment
+    excerpt: "..."
+    retrievedAt: 2026-04-12
+    updatedAt: 2026-04-12
+```
+Nested block YAML is the canonical representation for evidence records. Obsidian's Properties UI MAY display nested evidence lists as JSON-like inline objects instead of editable nested fields. That display is cosmetic and MUST NOT be treated as a schema violation when the underlying Markdown frontmatter is valid YAML matching this shape.
+### 12.2 Required evidence fields
+#### `id`
+Type: string
+Required: yes
+#### `sourceId`
+Type: string
+Required: yes
+Must reference an existing source page ID when possible.
+#### `path`
+Type: string
+Required: yes
+Path to the supporting page or source page.
+#### `kind`
+Type: enum
+Required: yes
+Allowed values:
+- `quote`
+- `summary`
+- `measurement`
+- `observation`
+- `screenshot`
+- `transcript`
+- `inference`
+#### `relation`
+Type: enum
+Required: yes
+Allowed values:
+- `supports`
+- `weakens`
+- `contradicts`
+- `context_only`
+#### `weight`
+Type: number
+Required: yes
+Range: `0.0` to `1.0`
+#### `updatedAt`
+Type: date
+Required: yes
+### 12.3 Optional evidence fields
+- `lines: string`
+- `note: string`
+- `excerpt: string`
+- `retrievedAt: date`
+- `locatorText: string`
+### 12.4 Evidence rules
+- Evidence MUST not imply stronger support than it actually provides.
+- `context_only` evidence MUST NOT be treated as direct support during compile scoring.
+- Evidence SHOULD point back to a source page, not only to a synthesis page, whenever possible.
+- Claims SHOULD have at least one evidence item to avoid appearing in evidence-gap reports.
+- Evidence entries MAY represent negative evidence using `weakens` or `contradicts`.
+---
+## 13. Relationships
+Relationships are explicit machine-readable edges between objects.
+Relationships MAY be authored in page frontmatter under `relations:`.
+### 13.1 Relationship shape
+**Schema:**
+```yaml
+relations:
+  - subject: <subject-id>
+    predicate: <predicate>
+    object: <object-id>
+    confidence: <float>
+    sourceClaimIds: []
+```
+**Example:**
+```yaml
+relations:
+  - subject: entity.place.lincoln-middle-school
+    predicate: uses
+    object: entity.system.ground-source-heat-pump
+    confidence: 0.88
+    sourceClaimIds: ["claim.descriptive.school-energy-retrofit"]
+```
+### 13.2 Required relationship fields
+#### `subject`
+Type: string
+Required: yes
+#### `predicate`
+Type: enum/string
+Required: yes
+#### `object`
+Type: string
+Required: yes
+#### `confidence`
+Type: number
+Required: yes
+Range: `0.0` to `1.0`
+### 13.3 Optional relationship fields
+- `sourceClaimIds: string[]`
+- `note: string`
+- `updatedAt: date`
+### 13.4 Recommended predicates
+v2 SHOULD use a controlled predicate set:
+- `is_a`
+- `part_of`
+- `depends_on`
+- `uses`
+- `produces`
+- `founded_by`
+- `owned_by`
+- `located_in`
+- `related_to`
+- `supports`
+- `contradicts`
+- `mentions`
+- `applies_to`
+- `derived_from`
+### 13.5 Relationship rules
+- Relationship IDs are optional in v2, but compiled output MAY assign normalized IDs.
+- Relationships SHOULD be grounded by source claims where possible.
+- Freeform predicates SHOULD be avoided in v2.
+---
+## 14. Contradictions
+v2 tracks contradictions primarily through compiled outputs and reports.
+Contradictions MAY also be represented in page content.
+v2 does not require contradiction pages, but the compiler MUST be able to surface contradiction records.
+### 14.1 Compiled contradiction shape
+**Schema:**
+```yaml
+id: contradiction.<contradictionType>.<contradictionSlug>
+type: <type>
+status: <status>
+summary: <summary>
+claimIds: []
+sourceIds: []
+resolution: <resolution>
+updatedAt: <yyyy-mm-dd>
+```
+**Example:**
+```yaml
+id: contradiction.interpretation-conflict.ferry-ridership
+type: interpretation_conflict
+status: open
+summary: Two claims disagree on whether weekend ferry ridership has recovered to pre-closure levels.
+claimIds:
+  - claim.descriptive.ferry-ridership-recovered
+  - claim.descriptive.ferry-ridership-still-depressed
+sourceIds:
+  - source.2026-04-20.ferry-ridership-dashboard
+resolution:
+updatedAt: 2026-04-29
+```
+### 14.2 Required fields
+- `id`
+- `type`
+- `status`
+- `summary`
+- `claimIds`
+- `updatedAt`
+### 14.3 Allowed contradiction types
+- `direct-conflict`
+- `date-conflict`
+- `scope-conflict`
+- `definition-conflict`
+- `interpretation-conflict`
+### 14.4 Allowed contradiction status values
+- `open`
+- `under-review`
+- `resolved`
+- `dismissed`
+### 14.5 Detection strategy
+**Explicit detection (compiled from flags):**
+- Claims with `status: contradicted`
+- Evidence entries with `relation: contradicts`
+**Semantic conflict detection (cross-claim analysis):**
+The compiler SHOULD also detect implicit conflicts by comparing claims that share the same `subjectPageId`.
+- **Date conflict** (`type: date-conflict`): Two or more `claimType: historical` claims on the same subject that have different `date` field values and are both in an active (non-deprecated, non-contradicted) status.
+- **Scope conflict** (`type: scope-conflict`): Claims with `status: contested` coexisting with claims of `status: supported` or `weakly-supported` on the same subject, indicating active unresolved disagreement.
+Semantic contradiction detection operates on structured fields only. It does not perform natural-language text comparison.
+---
+## 15. Timelines
+Timelines represent dated events and temporal changes tied to pages in the wiki. They exist to support chronology, historical tracking, date-based retrieval, and temporal conflict detection.
+Timeline data does not require a top-level `timelines/` folder in v2. It is represented through page-level `timeline:` records, synthesis pages with `synthesisType: timeline`, and compiled timeline cache output.
+### 15.1 Structure
+Timeline entries MUST be represented under a `timeline:` field.
+**Schema:**
+```yaml
+timeline:
+  - id: tl.<slug>.<index>
+    date: <yyyy-mm-dd>
+    endDate: <yyyy-mm-dd>
+    text: <text>
+    eventType: <eventType>
+    status: <status>
+    confidence: <float>
+    relatedClaims: []
+    sourceIds: []
+    updatedAt: <yyyy-mm-dd>
+```
+**Example:**
+```yaml
+timeline:
+  - id: tl.riverside-garden.001
+    date: 2026-04-12
+    endDate:
+    text: Riverside Community Garden opened its spring seedling exchange.
+    eventType: community-event
+    status: supported
+    confidence: 0.90
+    relatedClaims:
+      - "[[claim.historical.seedling-exchange-opened]]"
+    sourceIds:
+      - source.2026-04-12.webpage.garden-newsletter
+    updatedAt: 2026-04-12
+```
+### 15.2 Required and Optional Fields
+**Required fields:**
+- `id`
+- `date`
+- `text`
+**Optional fields:**
+- `endDate`
+- `eventType`
+- `status`
+- `confidence`
+- `relatedClaims`
+- `sourceIds`
+- `relatedPages`
+- `note`
+- `createdAt`
+- `updatedAt`
+### 15.3 Placement and Semantics
+Timeline entries MAY appear on any authored page type when that page is the natural owner of the event, including entity, concept, source, synthesis, and question pages.
+A timeline entry SHALL be authored on the page that most naturally owns the event. It SHOULD reference related claims and source IDs when the event matters for reasoning, retrieval, or contradiction analysis.
+For a single-point event, use `date`. For a bounded range, use both `date` and `endDate`.
+A synthesis page that acts as a dedicated chronology SHALL use:
+```yaml
+pageType: synthesis
+synthesisType: timeline
+```
+### 15.4 Compile and Validation
+The compile pipeline SHOULD extract timeline entries into:
+```text
+_system/cache/timeline-events.json
+```
+This cache is used for chronological queries, filtering, timeline reports, and temporal conflict detection.
+A v2 validator SHOULD check:
+- every timeline entry has an `id`
+- every timeline entry has a valid `date`
+- every timeline entry has `text`
+- timeline IDs are unique
+- `endDate`, if present, is not earlier than `date`
+- referenced claim IDs and source IDs exist when possible
+The compiler SHOULD flag timeline conflicts when multiple entries appear to describe the same event but disagree on date, range, or ordering.
+---
+## 16. Aliases
+Entities and concepts SHOULD include aliases when relevant.
+**Example:**
+```yaml
+canonicalName: Riverside Community Garden
+aliases:
+  - riverside-garden
+  - river-garden
+```
+Alias support exists to improve:
+- search
+- deduplication
+- matching
+- claim linking
+- prompt grounding
+---
+## 17. Authoritative Sources of Truth
+The system has multiple layers with different authorities.
+### 17.1 Authoritative layers
+Primary truth sources:
+1. page frontmatter
+2. authored page content where structured references exist
+3. compiled caches derived from the above
+### 17.2 Non-authoritative layers
+These are views, not truth sources:
+- `reports/`
+- `_system/logs/log.md`
+- ad hoc dashboard summaries
+- search indexes
+- prompt supplements that do not round-trip back to pages
+### 17.3 Rule
+Compiled outputs SHALL reflect page truth.
+Reports SHALL reflect compiled or page truth.
+Reports SHALL NOT silently become the canonical data layer.
+### 17.4 Documentation layers
+The project documentation has separate audiences. Agents SHOULD load the smallest authoritative document that can answer the current task.
+Documentation layers:
+- `AGENTS.md` — agent behavior contract for editing, compiling, linking, logging, and preserving human content.
+- `WIKI.md` — compact runtime schema, editorial guide, page type summary, ID formats, status enums, and examples for ordinary vault operations.
+- `INBOX.md` — short pointer to `WIKI.md` inbox rules and the `process-inbox` skill.
+- `ONBOARD.md` — first-run setup and local environment configuration workflow.
+- `AGENT-WIKI-SPEC-v2.md` — full project and development contract for maintainers, system changes, script behavior, validation rules, compatibility rules, and unresolved ambiguity.
+Skills and ordinary wiki operations SHOULD prefer `WIKI.md` for schema, allowed enums, ID formats, and examples. They SHOULD consult `AGENT-WIKI-SPEC-v2.md` only when changing project behavior, updating scripts, updating skills, modifying configuration policy, resolving ambiguity, or when `WIKI.md` does not contain enough detail.
+If `WIKI.md` conflicts with `AGENT-WIKI-SPEC-v2.md`, the full specification remains canonical until the conflict is resolved.
+---
+## 18. Compile Pipeline
+The compile step reads the authored wiki and emits stable machine-facing artifacts.
+### 18.1 Compile goals
+The compile pipeline exists so agents and runtime code do not need to scrape arbitrary markdown.
+It MUST:
+- normalize page metadata
+- extract claims
+- extract evidence
+- extract relations
+- compute health signals
+- emit stable cache files
+- generate reports
+### 18.2 Minimum v2 compile outputs
+The following files MUST be emitted under `_system/cache/`:
+- `agent-digest.json`
+- `claims.jsonl`
+- `pages.json`
+- `relations.jsonl`
+The following files SHOULD also be emitted:
+- `contradictions.json`
+- `questions.json`
+- `timeline-events.json`
+- `source-index.json`
+### 18.3 Required cache files
+#### `agent-digest.json`
+Purpose:
+- compact high-signal prompt supplement
+- runtime context pack
+- first-pass retrieval layer
+This file SHOULD contain:
+- key page summaries
+- important claims
+- notable open questions
+- notable contradictions
+- high-priority entity/concept summaries
+#### `claims.jsonl`
+Purpose:
+- claim-level retrieval
+- fast lookup by claim ID
+- status/confidence filtering
+- backlinks to owning pages
+Each line SHOULD contain:
+- normalized claim record
+- owning page ID
+- page path
+- evidence summary
+- freshness info if available
+#### `pages.json`
+Purpose:
+- normalized metadata index for all pages
+Each page record SHOULD include:
+- `id`
+- `pageType`
+- `title`
+- `path`
+- `status`
+- `updatedAt`
+- `aliases`
+- `tags`
+- page-type-specific metadata
+- counts for claims/relations/questions if available
+#### `relations.jsonl`
+Purpose:
+- graph edge retrieval
+- relationship traversal
+- cheap graph context generation
+Each line SHOULD contain:
+- normalized subject
+- predicate
+- object
+- page source
+- source claim IDs if present
+- confidence
+### 18.4 Recommended cache files
+#### `contradictions.json`
+Conflict registry.
+#### `questions.json`
+Open question registry.
+#### `timeline-events.json`
+Chronological event index.
+#### `source-index.json`
+Source metadata registry.
+The source index SHOULD preserve large-source structure when present. Source records SHOULD include `sourceRole`, `parentSourceId`, `sourceParts`, `partIndex`, `partCount`, and `locator` when those fields exist on source pages.
+Tools SHOULD be able to answer:
+- which source parts belong to a parent source
+- which parent source owns a source part
+- which source parts remain `status: unprocessed`
+- which locator should be used for evidence citation
+### 18.5 Agent digest limits
+The `agent-digest.json` output truncates content to keep the file compact for use as a prompt supplement. Implementations SHOULD define these as named constants so they can be tuned as vault size grows.
+| Constant | Default | Description |
+|---|---|---|
+| `MAX_DIGEST_KEY_PAGES` | `50` | Max entity/concept pages included |
+| `MAX_DIGEST_CLAIMS` | `30` | Max top supported claims included |
+| `MAX_DIGEST_QUESTIONS` | `20` | Max open question pages included |
+| `MAX_DIGEST_CONTRADICTIONS` | `10` | Max open contradictions included |
+Implementations MUST NOT silently discard high-value pages due to truncation without surfacing the total counts in `vaultStats`. Operators SHOULD increase limits if `vaultStats` shows totals significantly exceeding the defaults.
+---
+## 19. Search and Indexes
+The compiler MAY emit additional indexes under `_system/indexes/`.
+Examples:
+- alias index
+- tag index
+- page type index
+- stale page index
+- path-to-id index
+- id-to-path index
+These indexes are implementation details and not normative v2 authored data.
+---
+## 20. Reports
+Reports are generated maintenance views.
+### 20.1 Required reports
+When dashboard generation is enabled, the system SHOULD generate:
+- `reports/open-questions.md`
+- `reports/contradictions.md`
+- `reports/low-confidence.md`
+- `reports/claim-health.md`
+- `reports/stale-pages.md`
+### 20.2 Recommended additional reports
+- `reports/orphaned-claims.md`
+- `reports/evidence-gaps.md`
+- `reports/relationship-gaps.md`
+- `reports/timeline-conflicts.md`
+#### 20.3 Report rules
+- Reports SHOULD be fully regenerable.
+- Reports SHOULD NOT be treated as primary truth.
+- Compiler-generated reports SHOULD be treated as fully replaceable generated files.
+- Reports SHOULD identify the compile timestamp.
+---
+## 21. Logs
+Logs capture operational history. They do not replace page frontmatter, structured claims, evidence, or compile caches.
+### 21.1 Log locations
+- `_system/logs/log.md` is the canonical operational log for generated compile events and meaningful skill runs or change batches.
+- Files in `_system/logs/` SHOULD be written by tooling and MUST NOT be hand-edited.
+### 21.2 Log authority
+Logs are non-authoritative operational records. Agents and tooling MUST NOT treat log entries as primary evidence for claims unless the relevant material is promoted into a canonical `source` page.
+### 21.3 `_system/logs/log.md` entries
+Entries in `_system/logs/log.md` SHOULD be prepended so the most recent entry appears first. Entries SHOULD be concise. Each entry SHOULD include:
+- date
+- actor or tool, when known
+- changed area
+- short reason or outcome
+Skills SHOULD write one log entry after each meaningful skill run or change batch. They SHOULD NOT log every individual file write when those writes are part of one coherent operation.
+Trivial report/cache regeneration does not need a log entry unless it records a meaningful vault change or operational incident.
+### 21.4 Log writer
+Operational log entries SHOULD be written through `agent-wiki log`.
+The log writer MUST support:
+```bash
+agent-wiki log --message "<message>"
+```
+The log writer MUST prepend the new entry to `_system/logs/log.md`.
+---
+## 22. Health Rules
+The system SHOULD compute health signals at compile time.
+### 22.1 Low confidence
+A claim SHOULD be considered low confidence when:
+- `confidence < 0.50`
+- or status is `weakly_supported`, `unverified`, or `contested`
+Exact threshold MAY be configurable, but SHOULD be stable.
+#### 22.2 Evidence gaps
+A claim SHOULD appear in evidence-gap reporting when:
+- it has zero evidence entries
+- or only `context_only` evidence exists
+### 22.3 Staleness
+A page or claim MAY be considered stale when:
+- `updatedAt` exceeds configured freshness expectations
+- or linked source retrieval dates are old
+- or evidence is old relative to domain expectations
+v2 does not prescribe one universal stale threshold because domains vary.
+#### 22.4 Contradictions
+A contradiction SHOULD be surfaced when:
+- two claims with overlapping scope conflict materially
+- evidence relations include `contradicts`
+- a claim status is `contradicted`
+- multiple source-backed dates or definitions disagree
+---
+## 23. Freshness Model
+Freshness SHOULD be tracked at multiple levels when possible.
+### 23.1 Recommended fields
+- page `updatedAt`
+- claim `updatedAt`
+- evidence `updatedAt`
+- source `publishedAt`
+- source `retrievedAt`
+#### 23.2 Rule
+A recently edited page is not automatically a fresh page.
+Compile SHOULD distinguish between recent edits and recent underlying evidence.
+---
+## 24. Validation Rules
+A v2 validator SHOULD check the following.
+### 24.1 Required validation
+- every page has a valid `pageType`
+- every page has a unique `id`
+- required frontmatter fields are present
+- claims have unique IDs
+- claims have required fields
+- confidence fields are numeric and in range
+- evidence entries have required fields
+- relation entries have required fields
+- question pages use allowed status enums
+- pages are stored in folders consistent with `pageType`
+#### 24.2 Recommended validation
+- source IDs referenced by evidence exist
+- related page references exist
+- claim IDs referenced by relationships exist when provided
+- aliases do not duplicate canonical title unnecessarily
+- source part pages have valid `parentSourceId`, `partIndex`, `partCount`, and `locator` when `sourceRole: part`
+- parent source pages have ordered, existing `sourceParts` when `sourceRole: parent`
+- partitioned parent source pages do not contain the full long-form source body
+- source pages promoted from non-Markdown raw files include conversion provenance when available
+- conversion warnings are preserved when conversion output is degraded, incomplete, or produced with fallback behavior
+- large converted sources preserve stable locators on source part pages
+---
+## 25. Human Editing Expectations
+Humans MAY:
+- add prose
+- add notes and commentary
+- create new pages
+- update frontmatter
+- add or revise claims manually
+- add questions
+Humans SHOULD NOT:
+- directly hand-edit cache files
+- treat reports as canonical data
+- bypass IDs for important pages
+- mix unrelated claims into one compound claim
+---
+## 26. Agent Editing Expectations
+Agents MUST:
+- preserve human-authored content unless explicitly directed otherwise
+- use stable IDs when generating claims or pages
+- update `updatedAt` when meaningfully changing structured content
+- avoid inventing unsupported certainty
+- update `_system/logs/log.md` through `agent-wiki log` after each meaningful skill run or change batch
+Agents SHOULD:
+- create question pages for unresolved important unknowns
+- attach evidence to claims where possible
+- reuse canonical IDs instead of duplicating objects
+Agents MUST NOT:
+- silently rewrite human commentary unless explicitly directed otherwise
+- delete unresolved uncertainty by omission
+- convert weak evidence into strong support semantics
+- treat reports as primary truth records
+---
+## 27. Example Minimal Entity Page
+```md
+---
+id: entity.place.riverside-community-garden
+pageType: entity
+title: Riverside Community Garden
+entityType: place
+canonicalName: Riverside Community Garden
+status: active
+createdAt: 2026-04-12
+updatedAt: 2026-04-12
+aliases:
+  - riverside-garden
+tags:
+  - urban-agriculture
+  - community
+claims:
+  - id: claim.descriptive.garden-weekly-produce-donations
+    text: Riverside Community Garden donates a portion of its weekly produce harvest to the neighborhood food pantry.
+    status: supported
+    confidence: 0.91
+    claimType: descriptive
+    relatedClaimIds: []
+    evidence:
+      - id: evidence.quote.supports.a1b2c3d4
+        sourceId: source.2026-04-12.webpage.garden-newsletter
+        path: sources/2026-04-12.webpage.garden-newsletter.md
+        lines: 55-79
+        kind: quote
+        relation: supports
+        weight: 0.86
+        note: The newsletter describes the weekly donation arrangement.
+        excerpt: "Each Friday harvest includes a pantry donation box."
+        retrievedAt: 2026-04-12
+        updatedAt: 2026-04-12
+    createdAt: 2026-04-12
+    updatedAt: 2026-04-12
+relations:
+  - subject: entity.place.riverside-community-garden
+    predicate: supports
+    object: entity.organization.neighborhood-food-pantry
+    confidence: 0.88
+    sourceClaimIds:
+      - "[[claim.descriptive.garden-weekly-produce-donations]]"
+---
+# Riverside Community Garden
+Riverside Community Garden is a neighborhood garden that coordinates volunteer planting, harvest tracking, and weekly produce donations.
+```
+---
+## 28. Example Question Page
+```md
+---
+id: question.maintenance.flood-sensor-calibration
+pageType: question
+title: Which flood sensors need calibration before storm season?
+priority: high
+status: open
+relatedClaims:
+  - "[[claim.descriptive.sensor-readings-drifted]]"
+relatedPages:
+  - "[[coastal-resilience-overview]]"
+openedAt: 2026-04-12
+createdAt: 2026-04-12
+updatedAt: 2026-04-12
+aliases: []
+tags:
+  - flood-monitoring
+  - open-question
+---
+# Which flood sensors need calibration before storm season?
+## Context
+This question exists because several river gauge readings drifted from manual spot checks during the spring inspection.
+## Current concern
+We need to identify which sensors require calibration before they are used for storm-season alerting.
+```
+---
+## 29. Compatibility Notes
+v2 implementations MAY add fields beyond this spec, provided they do not break:
+- required fields
+- required enum values
+- compile output expectations
+Unknown fields MUST be preserved by conforming tooling when possible.