PyPI - mneme-cli - Versions diffs - 0.5.1__tar.gz → 0.5.3__tar.gz - Mend

mneme-cli 0.5.1tar.gz → 0.5.3tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (50) hide show

{mneme_cli-0.5.1 → mneme_cli-0.5.3}/AGENTS.md RENAMED Viewed

@@ -280,6 +280,93 @@ plain-markdown `<details>` fallback so the page is useful outside
 Obsidian. Run after a large ingest, or whenever the wiki's shape
 changes meaningfully.
+### 3.9 TRACE — linking the full V-model chain
+The trace chain a notified body expects has two legs that both terminate
+at code and tests:
+```
+UN  ──implemented-by──┐
+                      ├──> REQ ──detailed-in──> DDS ──implemented-in──> codebase
+RMA ──mitigated-by────┘                             └──verified-by───> tests
+```
+The first three links (UN→REQ, RMA→REQ, REQ→DDS) are created
+automatically by the CSV mappings in `profiles/mappings/` (or by
+`mneme trace add` when ingesting structured sources). The last two
+links (DDS→codebase, DDS→tests) close the V-model and are the agent's
+responsibility when a user passes you one or more repositories.
+**When the user passes you a repo path, you must:**
+```bash
+# 1. Inventory: what code modules / test files exist?
+mneme scan-repo <repo-path> <client>
+# → reports which wiki pages reference the repo's modules, and which do not.
+# 2. For each DDS page that corresponds to a code module, add the link.
+#    The target is a git URL or an absolute repo path; mneme treats it
+#    as an opaque string (not a wiki slug) — the target may live outside
+#    the workspace.
+mneme trace add <client>/dds-cyb-001 \
+                "github.com/<org>/<repo>/blob/main/src/auth/password_policy.py" \
+                implemented-in
+# 3. For each DDS page that has a corresponding test, add the link.
+#    The test target can be a wiki page (for test-plan docs) or an
+#    external path (for a test file in a repo).
+mneme trace add <client>/dds-cyb-001 <client>/test-auth-001 verified-by
+mneme trace add <client>/dds-cyb-001 \
+                "github.com/<org>/<repo>/blob/main/tests/test_password_policy.py" \
+                verified-by
+```
+Do this for every DDS page that has implementing code or a verifying
+test. When there are tens or hundreds of links to create (typical for
+a real medical-device codebase):
+```bash
+# Batch approach — the agent parses the repo, maps DDS → files,
+# then writes a shell script of `mneme trace add` lines and runs it.
+# mneme has no bulk-trace-add subcommand yet; scripting is the way.
+for pair in dds-cyb-001:src/auth/password_policy.py \
+            dds-cyb-002:src/auth/mfa.py \
+            dds-cyb-003:src/auth/rate_limiter.py; do
+  dds=${pair%%:*}; file=${pair##*:}
+  mneme trace add <client>/$dds "<repo-url>/$file" implemented-in
+done
+```
+**Verify the chain is now complete:**
+```bash
+mneme trace gaps <client>
+# → should report 0 hazards without mitigation, 0 DDS without
+#   implementation link, 0 DDS without verification link
+mneme trace show <client>/un-001
+# → UN.001
+#     implemented-by -> REQ.SYS.001
+#         detailed-in -> DDS.CYB.001
+#             implemented-in -> github.com/.../password_policy.py
+#             verified-by    -> github.com/.../test_password_policy.py
+```
+**Relationship vocabulary — use exactly these strings:**
+| Relationship | From → To | Semantics |
+|---|---|---|
+| `implemented-by` | UN → REQ | The user need is met by this requirement |
+| `mitigated-by` | RMA → REQ | The hazard is mitigated by this requirement |
+| `derived-from` | REQ → UN / REQ → higher-level REQ | Parent requirement |
+| `detailed-in` | REQ → DDS | The requirement is elaborated by this design spec |
+| `implemented-in` | DDS → codebase | The design spec is realised by this source file / module |
+| `verified-by` | DDS → test / REQ → test | The spec/requirement is verified by this test |
+| `validated-by` | DDS → clinical/usability study | Validation (not verification) evidence |
+Stick to this vocabulary. Custom relationships confuse downstream
+matrix exports and break the default `trace gaps` heuristics.
 ---
 ## 4. Profiles and the writing-style contract
@@ -572,7 +659,306 @@ file.
 Stop conditions: inbox is empty, `mneme stats` shows a plausible page
 count, and `mneme lint` reports no critical issues.
-### 6.6 Pre-submission readiness check before sending to a notified body
+### 6.6 Close the V-model by linking DDS to codebase and tests
+The user has just handed you one or more repositories. Your job is to
+connect every DDS page to the implementing source file(s) and the
+verifying test file(s) so `mneme trace show` walks end-to-end from a
+user need / hazard all the way to the exact line of code and the exact
+test that exercises it.
+```
+1. mneme profile show                            # sanity check
+2. mneme trace matrix <client>                   # baseline — which DDS exist?
+3. For each repo the user passes:
+     a. mneme scan-repo <repo-path> <client>     # surface module gaps
+     b. Read the repo tree and README yourself.
+        Build a mapping: DDS ID -> [source files]
+                         DDS ID -> [test files]
+        Prefer explicit evidence (comments referencing the DDS ID,
+        module/function names that mirror the DDS title, docstrings
+        that cite the requirement). When evidence is weak, flag the
+        DDS as ambiguous and surface it — do not guess.
+4. For each confident (DDS, file) pair:
+     mneme trace add <client>/<dds-slug> "<repo-url-or-path>/<file>" implemented-in
+     mneme trace add <client>/<dds-slug> "<repo-url-or-path>/<test-file>" verified-by
+   Batch these in a shell loop — there is no bulk-trace-add subcommand.
+5. mneme trace gaps <client>                     # should trend to zero
+6. mneme trace show <client>/un-001              # spot-check: full chain
+                                                   from UN to test file?
+7. mneme trace matrix <client> --csv --out trace-matrix.csv
+                                                 # DHF-ready export
+```
+Stop conditions: (a) every DDS page either has both `implemented-in`
+and `verified-by` trace links OR is explicitly flagged ambiguous in a
+report to the user, AND (b) `trace gaps` reports zero open chains.
+Hard rules:
+- Do not fabricate file paths. If the repo has no file matching a DDS,
+  report the gap and stop — the user must either point you at another
+  repo or add the link manually.
+- Trace targets for external files are opaque strings. Use a stable
+  form the team can resolve later (a git URL with a pinned commit is
+  ideal; a bare relative path is fine when the repo lives alongside
+  the workspace).
+- Never rewrite a DDS page's body to embed the code link. The link
+  lives in `schema/traceability.json` only. Wiki pages stay prose.
+### 6.7 Ingest a code repo into the wiki as searchable module summaries
+The user has handed you a code repo. Your job is to produce one wiki page
+per logical module so future agents can answer "how does this codebase do
+X?" through `mneme search` instead of re-reading the source.
+This is the foundation for any later code-aware work (style-matched
+extension, refactor planning, gap analysis). It does not modify the repo —
+read-only ingestion.
+```
+1. Walk <REPO_PATH>. Skip: .git, node_modules, .venv, dist, build,
+   __pycache__, anything in .gitignore.
+2. Group files into logical modules. Heuristics:
+   - A directory containing __init__.py / mod.rs / index.ts / mod.go
+     is one module.
+   - A standalone script with no siblings is one module.
+   - Tests (tests/ or *_test.* alongside) are part of the module they
+     test, not separate modules.
+3. For each module, write a summary file at
+     /tmp/mneme-summaries/<module-path>.md
+   with this exact frontmatter and section structure:
+   ---
+   title: <Module Name>
+   type: code-summary
+   client: <CLIENT_SLUG>
+   sources:
+     - <repo-relative path of every file in the module>
+   tags:
+     - code
+     - <language>
+     - <one-or-two-domain-tags>
+   ---
+   ## Purpose
+   One paragraph in plain English. No code.
+   ## Public API
+   List of exported functions / classes / types, one line each.
+   Format: `name(args) -> return_type` then a sentence.
+   ## Key data structures
+   Non-trivial types or schemas this module owns. Skip if none.
+   ## Dependencies
+   - Internal: which other modules in this repo it imports
+   - External: which libraries (with pinned version if any)
+   ## Tests
+   Path to test file(s) + one sentence on coverage shape.
+   ## Conventions observed
+   3-5 bullets: error style, async/sync, naming, comment density, etc.
+4. For files too large to read in one pass:
+   a. Read the first 200 lines.
+   b. Read the last 100 lines.
+   c. If the file has a clear table-of-contents (a __all__, an exports
+      block, a class index near the top), use it to guide which middle
+      sections to read in additional 200-line chunks.
+   d. State in the summary's Purpose section that this was a partial
+      read, and tag the page `partial-read` so a future pass can
+      revisit.
+5. Ingest the summaries in one pass:
+     mneme ingest-dir /tmp/mneme-summaries <CLIENT_SLUG> --recursive --flat
+   Use --flat: the summaries already encode their path in the slug, and
+   they don't live under sources/<CLIENT_SLUG>/ so subpath auto-detection
+   won't help.
+6. Smoke-test:
+     mneme stats
+     mneme search "<a real concept from the repo>" --client <CLIENT_SLUG>
+     mneme tags list
+```
+Stop conditions: every module in the repo (modulo the skip list) has a
+wiki summary, and a search for a known concept returns the right module.
+Hard rules:
+- Do not generate summaries for files you did not actually read. Partial
+  reads must be tagged `partial-read` in the page's frontmatter.
+- Do not speculate. If a module's purpose is unclear from the code, write
+  "unclear, needs human review" and tag the page `needs-review`.
+- Do not modify the repo. Read-only.
+- Keep summaries under 300 lines. They are pointers, not replacements.
+- One module = one wiki page. Do not split a module across pages, and
+  do not merge unrelated modules into one page.
+Report when done: total modules summarized, count tagged `partial-read`,
+count tagged `needs-review`, directories skipped and why, and the three
+search queries you used to verify the ingest.
+### 6.8 Augment a wiki page with knowledge from ingested code summaries
+Pre-condition: 6.7 has run, so the repo is in the wiki as `code-summary`
+pages. You now have a target wiki page (sparse, half-finished, or
+explicitly marked TBD) and you want to enrich it with sections that
+draw on the code knowledge — in the page's existing voice, with every
+claim cited.
+This is selective augment, not regeneration. Existing prose is sacred.
+```
+1. Read the target page in full at <WORKSPACE>/wiki/<client>/<page>.md.
+   Note: existing tone, sentence length, citation density, heading depth,
+   table-of-contents shape. These define the local style you must match.
+2. Decide what to add. Two paths:
+   a. Human-driven: the user told you "add a Performance Characteristics
+      section drawing latency data from the codebase." Skip to step 3.
+   b. Agent-driven: gap analysis. Compare the target's actual sections
+      against (i) the active profile's expected sections for this
+      doc-type (run `mneme profile show`), and (ii) topics covered by
+      code-summary pages that the target does not cite. Propose 1-5
+      candidate sections to the human and wait for confirmation. Do not
+      add sections without confirmation.
+3. For each agreed section, gather evidence:
+     mneme search "<topic keywords>" --client <client> -k 20
+   Prefer hits with the `code` tag for implementation details. Prefer
+   regulatory wiki pages for context and definitions. Read the top hits
+   in full before writing.
+4. Draft the section. Hard requirements:
+   - Match the target's local style. Local consistency wins over the
+     active profile's global rules within a single page.
+   - Every non-trivial claim cites its source as
+     `(wiki: <client>/<page>)` or `(source: <repo-relative-path>)`.
+   - When evidence is insufficient for a claim, do not invent it.
+     Insert `[TO ADD REF]` and continue.
+5. Insert at the structurally correct location. Read the target's TOC.
+   The new section's heading depth and ordering must follow the
+   document's own logic, not your intuition.
+6. Update the target's frontmatter:
+   - Append every newly cited source to the `sources:` list.
+   - Bump `updated:` to today.
+   - If the page was previously marked draft / TBD and is now complete,
+     update `confidence:` accordingly.
+7. Re-ingest the target so search picks up the new content. Two options:
+   a. If the page has a corresponding source file in sources/<client>/,
+      mirror your wiki edits back to it and run:
+        mneme resync sources/<client>/<path-to-source> <client>
+   b. Otherwise, edit the wiki page directly and run:
+        mneme reindex
+```
+Stop conditions: every agreed section is either (a) written with full
+citations, or (b) explicitly flagged as evidence-insufficient and
+reported back to the human. The page passes
+`mneme validate writing-style <client>/<page>` against the active
+profile.
+Hard rules:
+- Do NOT rewrite existing prose. Augment only — add new sections, do not
+  edit current ones unless explicitly asked.
+- Do NOT fabricate citations. Every `(wiki: ...)` and `(source: ...)`
+  reference must resolve to an actual page or file.
+- Do NOT exceed the human-confirmed scope. If gap analysis surfaced 5
+  candidate sections and the human approved 2, write only those 2.
+- Do NOT touch the page's frontmatter `created:` or `client:` fields.
+Report when done: sections added (with line counts), sources cited
+(deduplicated list), any sections you were asked to write but skipped
+because evidence was insufficient (with a one-line explanation per skip),
+and the result of the post-edit `mneme validate writing-style` run.
+### 6.9 Validate a claim against the literature wiki
+You are about to write or have already written a factual claim in a
+deliverable (DVR, CER, technical documentation, etc.). Before the
+claim ships to a notified body, it must be backed by an authoritative
+source — or explicitly carry `[TO ADD REF]` so the gap is visible.
+Pre-condition: the relevant literature has been ingested into the wiki
+(typically under `research-questions/` or similar) and tagged with
+`literature` plus an authority marker (`authority` / `non-authority`).
+If those tags don't exist, run a one-time `mneme tags bulk-suggest` /
+`bulk-apply` pass to add them — see Step 3 in the README.
+```
+1. Identify the claim. Reduce it to its load-bearing assertion.
+   "Parkinsonian tremor manifests primarily in the 4-6 Hz band" is a
+   claim. "Tremor is a problem" is not — too vague to validate.
+2. Search the literature for evidence. Be specific in the query:
+     mneme search "<claim keywords>" --client <client> -k 30
+   When `mneme search --tag` is available (planned), prefer:
+     mneme search "<claim keywords>" --client <client> --tag authority -k 20
+3. Read the top hits in full. Sort the relevant ones into three buckets:
+   a. AUTHORITY supports the claim (peer-reviewed, recent, on-topic)
+   b. NON-AUTHORITY supports the claim (preprints, blog posts, secondary)
+   c. Nothing relevant, or hits contradict the claim
+4. Decide based on the bucket:
+   a. AUTHORITY support
+      -> Write the claim with the citation:
+         "...4-6 Hz band (wiki: <client>/research-questions/.../<page>)."
+      -> Append the cited page to the deliverable's frontmatter
+         `sources:` list if not already present.
+   b. NON-AUTHORITY support only
+      -> Either soften the claim ("Preliminary reports suggest..."),
+         OR keep the strong form with [TO ADD REF] and find an
+         authority source separately.
+      -> Do NOT cite a non-authority source as if it were authoritative.
+   c. No support / contradicting evidence
+      -> Three options, in order of preference:
+         i.  Drop the claim. The deliverable doesn't need it.
+         ii. Find a new authority source. Drop the PDF into
+             sources/<client>/<literature-path>/, summarize and ingest
+             it (run a single-page version of 6.7), then return to step 2.
+         iii. Keep the claim but mark it [TO ADD REF] AND open a
+              tracked TODO so the gap doesn't ship by accident.
+5. After resolving the claim (or marking it), run:
+     mneme validate writing-style <client>/<deliverable-page>
+   The review packet flags every remaining [TO ADD REF] and every
+   uncited factual claim. Address them or hand the page back to the
+   human reviewer with the gaps surfaced.
+```
+Stop conditions: the claim is either (a) cited with an authority
+source, (b) softened to match the strength of the available evidence,
+(c) dropped, or (d) explicitly marked `[TO ADD REF]` AND tracked for
+follow-up. Never (e) cited with fabricated or non-authoritative
+evidence dressed as authoritative.
+Hard rules:
+- Do NOT cite a wiki page you did not read. Read every page you cite.
+- Do NOT cite a non-authority source as `(wiki: ...)` without making
+  its non-authority status visible in the surrounding prose.
+- Do NOT silently weaken or rewrite the claim to dodge the citation
+  requirement. If the evidence is weak, say so.
+- Do NOT bulk-clear `[TO ADD REF]` markers without going through this
+  procedure for each one. Each marker is a discrete claim that needs
+  individual evidence.
+Report when done: the original claim, the final form of the claim
+(verbatim if changed), the citation added (or the [TO ADD REF] marker
+left in place), the wiki pages read, and a one-line note on whether
+this gap should be tracked for human follow-up.
+### 6.10 Pre-submission readiness check before sending to a notified body
 ```
 1. mneme profile show                            # confirm active profile

{mneme_cli-0.5.1 → mneme_cli-0.5.3}/CHANGELOG.md RENAMED Viewed

@@ -4,6 +4,71 @@ All notable changes to this project are documented here.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [0.5.2] - 2026-04-14
+### Changed
+- **`ingest-dir --preserve-structure` is now the default.** The wiki now
+  mirrors the source directory layout unless you pass `--flat`. This avoids
+  silent same-basename collisions (e.g. multiple `INSTRUCTIONS.md` files from
+  different source directories overwriting each other). Closes suggestion #15.
+- **`mneme ingest` (single-file) also mirrors by default.** When the source
+  lives under `sources/<client>/`, its relative position becomes a wiki
+  subpath automatically. Pass `--flat` to opt out.
+### Fixed
+- **`mneme profile list`** now discovers profiles correctly. Previously it
+  filtered files by `.json` (wrong extension — profiles are markdown) and
+  only checked the bundled directory, which meant the shipped `eu-mdr.md`
+  and `iso-13485.md` profiles appeared as "No profiles found". Now unions
+  workspace + bundled, marks origin, and flags shadowed bundled profiles.
+  Closes suggestion #25 discovery bug.
+### Added
+- **`ingest-dir --flat`** — explicit opt-out for the new preserve-structure
+  default.
+- **`ingest --flat`** — opt-out for the single-file command.
+- **xlsx support is now built-in.** `openpyxl` moved from
+  `[project.optional-dependencies].xlsx` to `dependencies`. The `[xlsx]`
+  extra is kept for backwards compatibility but is no longer required.
+### Documentation
+- **README**: expanded the agent end-to-end example. Step 3 now covers
+  bulk tagging (`tags bulk-suggest` + `bulk-apply`), Step 3b adds entity
+  typing (`entity suggest` + `bulk-apply`), and Step 3c walks the full
+  V-model trace chain (UN→REQ→DDS and RMA→REQ→DDS, terminating at code
+  and tests).
+- **AGENTS.md**: new section 3.9 "TRACE — linking the full V-model
+  chain" documents the `implemented-in` / `verified-by` relationships
+  and the DDS-to-codebase linking agents must perform when the user
+  passes repositories. New task template 6.6 "Close the V-model by
+  linking DDS to codebase and tests" gives the exact procedure, stop
+  conditions, and hard rules (no fabricated paths, trace targets are
+  opaque strings, never embed code links in page bodies).
+## [0.5.3] - 2026-04-15
+### Documentation
+- **AGENTS.md**: new task template 6.7 "Ingest a code repo into the
+  wiki as searchable module summaries" — the foundation for any
+  code-aware agent work. One wiki page per logical module, chunked
+  reading for large files, explicit tagging for partial/unclear pages,
+  and `mneme ingest-dir --flat` for the bulk write.
+- **AGENTS.md**: new task template 6.8 "Augment a wiki page with
+  knowledge from ingested code summaries" — selective enrichment of a
+  target page using evidence drawn from the code summaries produced by
+  6.7. Existing prose is sacred; agent only adds new sections, in the
+  page's local style, with every claim cited.
+- **AGENTS.md**: new task template 6.9 "Validate a claim against the
+  literature wiki" — the discipline an agent applies before any
+  factual claim ships to a notified body. Three buckets (authority /
+  non-authority / no support), four resolutions (cite / soften / drop /
+  mark `[TO ADD REF]`), zero tolerance for non-authority dressed as
+  authoritative.
 ## [0.5.0] - 2026-04-13
 ### Breaking Changes

{mneme_cli-0.5.1 → mneme_cli-0.5.3}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: mneme-cli
-Version: 0.5.1
+Version: 0.5.3
 Summary: mneme - CLI tool that turns documents into a searchable second brain. Ingest once, query forever.
 Author-email: Tolis Moustaklis <apostolos.moustaklis@gmail.com>
 License-Expression: MIT
@@ -29,13 +29,13 @@ Requires-Python: >=3.9
 Description-Content-Type: text/markdown
 License-File: LICENSE
 Requires-Dist: portalocker>=2.0.0
+Requires-Dist: openpyxl>=3.1.0
 Provides-Extra: pdf
 Requires-Dist: pymupdf>=1.23.0; extra == "pdf"
 Provides-Extra: xlsx
 Requires-Dist: openpyxl>=3.1.0; extra == "xlsx"
 Provides-Extra: all
 Requires-Dist: pymupdf>=1.23.0; extra == "all"
-Requires-Dist: openpyxl>=3.1.0; extra == "all"
 Provides-Extra: release
 Requires-Dist: build>=1.0.0; extra == "release"
 Requires-Dist: twine>=5.0.0; extra == "release"
@@ -183,7 +183,7 @@ One installed CLI serves many projects — each workspace is just a directory.
 | `mneme stats` | Health overview |
 | `mneme repair` | Fix corrupted archives |
-**Formats:** `.md`, `.txt`, `.pdf`, `.xlsx` (with `pip install "mneme-cli[xlsx]"`)
+**Formats:** `.md`, `.txt`, `.pdf`, `.xlsx` (built-in), plus `.csv` via `mneme ingest-csv`
 ---
@@ -238,38 +238,115 @@ Creates the workspace tree, sets the EU MDR writing-style profile, and initializ
 cp -r ~/Downloads/parkinson-research/* inbox/
 mneme tornado --client parkiwatch
-# Or ingest individual files
+# Or ingest individual files (auto-mirrors sources/<client>/ layout into wiki/)
 mneme ingest research-paper.pdf parkiwatch
-mneme ingest-csv risk-register.csv parkiwatch --mapping risk-register
 mneme ingest spec-table.xlsx parkiwatch          # .xlsx renders sheets as markdown tables
-mneme ingest-dir docs/ parkiwatch --recursive    # walk subdirectories
+mneme ingest-dir docs/ parkiwatch --recursive    # walk subdirectories, preserve structure
+# Structured CSV ingestion — one row becomes one wiki page + trace links.
+# Mappings live in <workspace>/profiles/mappings/ or are auto-detected.
+mneme ingest-csv user-needs.csv    parkiwatch --mapping parkiwatch-user-needs
+mneme ingest-csv requirements.csv  parkiwatch --mapping parkiwatch-req
+mneme ingest-csv design-specs.csv  parkiwatch --mapping parkiwatch-dds
+mneme ingest-csv risk-register.csv parkiwatch --mapping parkiwatch-rma
 ```
-What happens per ingest: source file → wiki page in `wiki/parkiwatch/` → frontmatter with auto-extracted entities → entry in `index.md` → row in the FTS5 search DB → log entry.
+What happens per ingest: source file → wiki page in `wiki/parkiwatch/<mirrored-subpath>/` → frontmatter with auto-extracted proper-noun entities → entry in `index.md` → row in the FTS5 search DB → log entry. CSV ingests additionally create trace links (e.g. UN→REQ `implemented-by`, REQ→DDS `detailed-in`) in `schema/traceability.json`.
-### Step 3 — Tag the new pages (LLM agent)
+### Step 3 — Tag many pages at once (LLM agent, bulk)
-The new pages have only the auto-applied `parkiwatch` client tag. The agent now adds meaningful tags:
+New pages have only the auto-applied `parkiwatch` client tag. The agent tags them in batches:
 ```bash
-# For each new page, the agent runs:
-mneme tags suggest parkiwatch/research-paper > /tmp/packet.md
+# 1. Pack up to 30 untagged pages into a single review packet.
+#    --filter scopes by wiki_path substring; omit for everything.
+mneme tags bulk-suggest --filter indicators --limit 30 \
+                        --json --out /tmp/tag-packet.json
 ```
-The packet contains the page body, the current tag taxonomy (every tag in the workspace + usage counts), and a ready-to-paste prompt. **The LLM reads the packet** — it understands the content and decides on tags, preferring existing taxonomy entries when they fit. The LLM's response is JSON:
+The packet contains, for each page: wiki_path, title, current tags, body excerpt, and the existing taxonomy with usage counts. **The LLM reads the packet** and returns a response JSON:
 ```json
-{"tags": ["clinical-trial", "iso-13485"], "new_tags": ["bradykinesia-detection"]}
+{
+  "pages": [
+    {"wiki_path": "parkiwatch/indicators/bda_algorithm_description.md",
+     "add": ["bradykinesia", "algorithm", "imu", "medical-device"]},
+    {"wiki_path": "parkiwatch/indicators/tremor_indicator_dataflow.md",
+     "add": ["tremor", "dataflow", "imu", "algorithm"]}
+  ]
+}
 ```
-The agent then runs:
+```bash
+# 2. Apply all decisions in one atomic call
+mneme tags bulk-apply /tmp/tag-response.json
+# → Pages updated: 9   Tags added: 42   Tags removed: 0
+```
+Each application rewrites the wiki page frontmatter, updates `schema/tags.json`, re-indexes the page in FTS5, and appends a log entry. Subsequent packets reuse the growing taxonomy, so the vocabulary converges.
+For single pages use `mneme tags suggest <slug>` + `mneme tags apply <slug> --add a,b,c`.
+### Step 3b — Classify entities by type (LLM agent)
+Ingest auto-extracts capitalized proper nouns (e.g. "Parkiwatch", "IEC 62304") into `schema/entities.json` with `type: unknown`. Typing is an LLM judgement call, handled the same packet way as tags:
 ```bash
-mneme tags apply parkiwatch/research-paper \
-  --add clinical-trial,iso-13485,bradykinesia-detection
+# 1. Build an entity-classification packet (up to 50 unclassified entities)
+mneme entity suggest --client parkiwatch --limit 50 \
+                     --json --out /tmp/entity-packet.json
+# 2. LLM reads the packet and returns classifications:
+#    [{"id": "iec-62304", "type": "standard"},
+#     {"id": "notified-body", "type": "organization"},
+#     {"id": "bradykinesia", "type": "concept"}, ...]
+# 3. Apply atomically
+mneme entity bulk-apply /tmp/entity-response.json
+# → Entities typed: 47   Errors: 0
+```
+Supported types include `standard`, `organization`, `person`, `concept`, `technology`, `regulation`, or any custom type the profile defines. Typed entities power filtered search and the knowledge graph.
+### Step 3c — Verify the trace chain (human, on demand)
+The CSV ingests in Step 2 created two parallel trace chains. Both converge at a requirement, drill into design specs, and finally terminate at **code** and **tests** — the complete QMS traceability an auditor expects:
+```
+Chain A:  UN ─┐
+              ├─> REQ ──> DDS ──┬─> codebase  (via `implemented-in`)
+Chain B:  RMA ┘                 └─> tests     (via `verified-by`)
 ```
-Atomic operation: rewrites the wiki page frontmatter, updates `schema/tags.json`, re-indexes the page in FTS5 (so search picks up the new tags immediately), appends a log entry. **Repeat for every page** — the taxonomy grows, and subsequent pages tend to reuse existing tags (consistency).
+Each arrow is a trace-link relationship type (`implemented-by`, `mitigated-by`, `detailed-in`, `implemented-in`, `verified-by`). The DDS→codebase link is stored as a frontmatter field on each DDS page (e.g. a git URL pointing at the implementing module). The DDS→tests link is a standard trace relationship added either by CSV ingest or by `mneme trace add`.
+Walk either chain from any root page:
+```bash
+# Chain A — from a user need forward to the specs that implement it
+mneme trace show parkiwatch/un-001
+# → UN.001 (secure sign-in)
+#     implemented-by -> REQ.SYS.001 (User Authentication)
+#         detailed-in -> DDS.CYB.001 (Strong Password Policy)
+#         detailed-in -> DDS.CYB.002 (Multi-Factor Authentication)
+#         ...
+# Chain B — from a hazard forward to the specs that mitigate it
+mneme trace show parkiwatch/rma-cyb-002
+# → RMA.CYB.002 (Unauthorized access -- weak passwords)
+#     mitigated-by -> REQ.SYS.001 (User Authentication)
+#         detailed-in -> DDS.CYB.001, DDS.CYB.002, ...
+#             implemented-in -> src/auth/password_policy.py   (codebase)
+#             verified-by    -> TEST.AUTH.001                 (tests)
+# Trace gaps for a notified body audit
+mneme trace gaps parkiwatch
+# → Hazards with no mitigation: ...
+#   User needs with no requirements: ...
+# Export the full traceability matrix for the DHF
+mneme trace matrix parkiwatch --csv --out trace-matrix.csv
+```
 ### Step 4 — Search the knowledge base (anyone)

{mneme_cli-0.5.1 → mneme_cli-0.5.3}/README.md RENAMED Viewed

@@ -140,7 +140,7 @@ One installed CLI serves many projects — each workspace is just a directory.
 | `mneme stats` | Health overview |
 | `mneme repair` | Fix corrupted archives |
-**Formats:** `.md`, `.txt`, `.pdf`, `.xlsx` (with `pip install "mneme-cli[xlsx]"`)
+**Formats:** `.md`, `.txt`, `.pdf`, `.xlsx` (built-in), plus `.csv` via `mneme ingest-csv`
 ---
@@ -195,38 +195,115 @@ Creates the workspace tree, sets the EU MDR writing-style profile, and initializ
 cp -r ~/Downloads/parkinson-research/* inbox/
 mneme tornado --client parkiwatch
-# Or ingest individual files
+# Or ingest individual files (auto-mirrors sources/<client>/ layout into wiki/)
 mneme ingest research-paper.pdf parkiwatch
-mneme ingest-csv risk-register.csv parkiwatch --mapping risk-register
 mneme ingest spec-table.xlsx parkiwatch          # .xlsx renders sheets as markdown tables
-mneme ingest-dir docs/ parkiwatch --recursive    # walk subdirectories
+mneme ingest-dir docs/ parkiwatch --recursive    # walk subdirectories, preserve structure
+# Structured CSV ingestion — one row becomes one wiki page + trace links.
+# Mappings live in <workspace>/profiles/mappings/ or are auto-detected.
+mneme ingest-csv user-needs.csv    parkiwatch --mapping parkiwatch-user-needs
+mneme ingest-csv requirements.csv  parkiwatch --mapping parkiwatch-req
+mneme ingest-csv design-specs.csv  parkiwatch --mapping parkiwatch-dds
+mneme ingest-csv risk-register.csv parkiwatch --mapping parkiwatch-rma
 ```
-What happens per ingest: source file → wiki page in `wiki/parkiwatch/` → frontmatter with auto-extracted entities → entry in `index.md` → row in the FTS5 search DB → log entry.
+What happens per ingest: source file → wiki page in `wiki/parkiwatch/<mirrored-subpath>/` → frontmatter with auto-extracted proper-noun entities → entry in `index.md` → row in the FTS5 search DB → log entry. CSV ingests additionally create trace links (e.g. UN→REQ `implemented-by`, REQ→DDS `detailed-in`) in `schema/traceability.json`.
-### Step 3 — Tag the new pages (LLM agent)
+### Step 3 — Tag many pages at once (LLM agent, bulk)
-The new pages have only the auto-applied `parkiwatch` client tag. The agent now adds meaningful tags:
+New pages have only the auto-applied `parkiwatch` client tag. The agent tags them in batches:
 ```bash
-# For each new page, the agent runs:
-mneme tags suggest parkiwatch/research-paper > /tmp/packet.md
+# 1. Pack up to 30 untagged pages into a single review packet.
+#    --filter scopes by wiki_path substring; omit for everything.
+mneme tags bulk-suggest --filter indicators --limit 30 \
+                        --json --out /tmp/tag-packet.json
 ```
-The packet contains the page body, the current tag taxonomy (every tag in the workspace + usage counts), and a ready-to-paste prompt. **The LLM reads the packet** — it understands the content and decides on tags, preferring existing taxonomy entries when they fit. The LLM's response is JSON:
+The packet contains, for each page: wiki_path, title, current tags, body excerpt, and the existing taxonomy with usage counts. **The LLM reads the packet** and returns a response JSON:
 ```json
-{"tags": ["clinical-trial", "iso-13485"], "new_tags": ["bradykinesia-detection"]}
+{
+  "pages": [
+    {"wiki_path": "parkiwatch/indicators/bda_algorithm_description.md",
+     "add": ["bradykinesia", "algorithm", "imu", "medical-device"]},
+    {"wiki_path": "parkiwatch/indicators/tremor_indicator_dataflow.md",
+     "add": ["tremor", "dataflow", "imu", "algorithm"]}
+  ]
+}
 ```
-The agent then runs:
+```bash
+# 2. Apply all decisions in one atomic call
+mneme tags bulk-apply /tmp/tag-response.json
+# → Pages updated: 9   Tags added: 42   Tags removed: 0
+```
+Each application rewrites the wiki page frontmatter, updates `schema/tags.json`, re-indexes the page in FTS5, and appends a log entry. Subsequent packets reuse the growing taxonomy, so the vocabulary converges.
+For single pages use `mneme tags suggest <slug>` + `mneme tags apply <slug> --add a,b,c`.
+### Step 3b — Classify entities by type (LLM agent)
+Ingest auto-extracts capitalized proper nouns (e.g. "Parkiwatch", "IEC 62304") into `schema/entities.json` with `type: unknown`. Typing is an LLM judgement call, handled the same packet way as tags:
 ```bash
-mneme tags apply parkiwatch/research-paper \
-  --add clinical-trial,iso-13485,bradykinesia-detection
+# 1. Build an entity-classification packet (up to 50 unclassified entities)
+mneme entity suggest --client parkiwatch --limit 50 \
+                     --json --out /tmp/entity-packet.json
+# 2. LLM reads the packet and returns classifications:
+#    [{"id": "iec-62304", "type": "standard"},
+#     {"id": "notified-body", "type": "organization"},
+#     {"id": "bradykinesia", "type": "concept"}, ...]
+# 3. Apply atomically
+mneme entity bulk-apply /tmp/entity-response.json
+# → Entities typed: 47   Errors: 0
+```
+Supported types include `standard`, `organization`, `person`, `concept`, `technology`, `regulation`, or any custom type the profile defines. Typed entities power filtered search and the knowledge graph.
+### Step 3c — Verify the trace chain (human, on demand)
+The CSV ingests in Step 2 created two parallel trace chains. Both converge at a requirement, drill into design specs, and finally terminate at **code** and **tests** — the complete QMS traceability an auditor expects:
+```
+Chain A:  UN ─┐
+              ├─> REQ ──> DDS ──┬─> codebase  (via `implemented-in`)
+Chain B:  RMA ┘                 └─> tests     (via `verified-by`)
 ```
-Atomic operation: rewrites the wiki page frontmatter, updates `schema/tags.json`, re-indexes the page in FTS5 (so search picks up the new tags immediately), appends a log entry. **Repeat for every page** — the taxonomy grows, and subsequent pages tend to reuse existing tags (consistency).
+Each arrow is a trace-link relationship type (`implemented-by`, `mitigated-by`, `detailed-in`, `implemented-in`, `verified-by`). The DDS→codebase link is stored as a frontmatter field on each DDS page (e.g. a git URL pointing at the implementing module). The DDS→tests link is a standard trace relationship added either by CSV ingest or by `mneme trace add`.
+Walk either chain from any root page:
+```bash
+# Chain A — from a user need forward to the specs that implement it
+mneme trace show parkiwatch/un-001
+# → UN.001 (secure sign-in)
+#     implemented-by -> REQ.SYS.001 (User Authentication)
+#         detailed-in -> DDS.CYB.001 (Strong Password Policy)
+#         detailed-in -> DDS.CYB.002 (Multi-Factor Authentication)
+#         ...
+# Chain B — from a hazard forward to the specs that mitigate it
+mneme trace show parkiwatch/rma-cyb-002
+# → RMA.CYB.002 (Unauthorized access -- weak passwords)
+#     mitigated-by -> REQ.SYS.001 (User Authentication)
+#         detailed-in -> DDS.CYB.001, DDS.CYB.002, ...
+#             implemented-in -> src/auth/password_policy.py   (codebase)
+#             verified-by    -> TEST.AUTH.001                 (tests)
+# Trace gaps for a notified body audit
+mneme trace gaps parkiwatch
+# → Hazards with no mitigation: ...
+#   User needs with no requirements: ...
+# Export the full traceability matrix for the DHF
+mneme trace matrix parkiwatch --csv --out trace-matrix.csv
+```
 ### Step 4 — Search the knowledge base (anyone)

{mneme_cli-0.5.1 → mneme_cli-0.5.3}/mneme/__init__.py RENAMED Viewed

@@ -5,4 +5,4 @@ Public API:
     from mneme.core import ingest_source_to_both, dual_search, ...
 """
-__version__ = "0.5.1"
+__version__ = "0.5.3"

{mneme_cli-0.5.1 → mneme_cli-0.5.3}/mneme/core.py RENAMED Viewed

@@ -565,7 +565,7 @@ def ingest_source_to_both(source_path: str, client_slug: str, force: bool = Fals
             raw_content = '\n\n'.join(sheets)
         except ImportError:
             raise ValueError(
-                'Excel extraction requires openpyxl. Install: pip install "mneme-cli[xlsx]"'
+                'Excel extraction requires openpyxl. Install: pip install openpyxl'
             )
     else:
         # Generic text fallback
@@ -2103,7 +2103,7 @@ def lint() -> dict:
 def ingest_dir(directory: str, client_slug: str, force: bool = False,
-               recursive: bool = False, preserve_structure: bool = False) -> dict:
+               recursive: bool = False, preserve_structure: bool = True) -> dict:
     """
     Batch ingest all supported files from a directory.
@@ -2112,9 +2112,10 @@ def ingest_dir(directory: str, client_slug: str, force: bool = False,
     When recursive=True, walks subdirectories as well.
-    When preserve_structure=True, each file's directory position relative to
-    ``directory`` becomes a wiki subdirectory under ``wiki/<client>/``. Also
-    naturally resolves same-basename collisions (suggestion #15).
+    When preserve_structure=True (the default), each file's directory position
+    relative to ``directory`` becomes a wiki subdirectory under
+    ``wiki/<client>/``. Also naturally resolves same-basename collisions
+    (suggestion #15). Pass preserve_structure=False for a flat wiki.
     Returns a summary of all ingestions.
     """
@@ -2154,10 +2155,18 @@ def ingest_dir(directory: str, client_slug: str, force: bool = False,
     for fpath in files:
         fname = os.path.basename(fpath)
-        # Compute subpath relative to the input directory when preserving structure
         if preserve_structure:
-            sub_rel = os.path.relpath(os.path.dirname(fpath), directory)
-            subpath = '' if sub_rel in ('', '.') else sub_rel
+            # Prefer the path relative to sources/<client>/ when the input lives
+            # there, so callers running `ingest-dir sources/<client>/SUBDIR` get
+            # the SUBDIR prefix in the wiki tree (rather than silently flattening
+            # because SUBDIR itself has no nested subdirectories). Falls back to
+            # relative-to-input-directory for sources outside the canonical tree.
+            auto = _auto_detect_subpath(fpath, client_slug)
+            if auto:
+                subpath = auto
+            else:
+                sub_rel = os.path.relpath(os.path.dirname(fpath), directory)
+                subpath = '' if sub_rel in ('', '.') else sub_rel
         else:
             subpath = ''
         try:
@@ -6241,6 +6250,8 @@ def main() -> None:
     ingest_parser.add_argument('file', help='Path to source file (.md, .txt, .pdf)')
     ingest_parser.add_argument('client_slug', help='Client slug (e.g. demo-retail, my-client)')
     ingest_parser.add_argument('--force', action='store_true', help='Re-ingest even if source was previously ingested')
+    ingest_parser.add_argument('--flat', action='store_true',
+                               help='Write the page directly to wiki/<client>/ without mirroring source subpath')
     # init
     init_parser = subparsers.add_parser('init', help='Initialize a new mneme workspace')
@@ -6256,8 +6267,13 @@ def main() -> None:
     ingest_dir_parser.add_argument('client_slug', help='Client slug (e.g. demo-retail, my-client)')
     ingest_dir_parser.add_argument('--force', action='store_true', help='Re-ingest even if sources were previously ingested')
     ingest_dir_parser.add_argument('--recursive', '-r', action='store_true', help='Recurse into subdirectories')
-    ingest_dir_parser.add_argument('--preserve-structure', dest='preserve_structure', action='store_true',
-                                   help='Mirror source directory structure into wiki/<client>/ subdirectories')
+    # Default-on since v0.5.2: mirror source directory structure into the wiki.
+    # --flat is the explicit opt-out for users who want a single-directory wiki.
+    ingest_dir_parser.add_argument('--preserve-structure', dest='preserve_structure',
+                                   action='store_true', default=True,
+                                   help='(default) Mirror source directory structure into wiki/<client>/ subdirectories')
+    ingest_dir_parser.add_argument('--flat', dest='preserve_structure', action='store_false',
+                                   help='Write all pages to wiki/<client>/ without subdirectories')
     # tornado
     tornado_parser = subparsers.add_parser('tornado', help='Process inbox: auto-detect, ingest, archive')
@@ -6546,7 +6562,12 @@ def main() -> None:
             print(f'Error: invalid client slug "{args.client_slug}". Use lowercase letters, numbers, hyphens only.', file=sys.stderr)
             sys.exit(1)
         try:
-            result = ingest_source_to_both(args.file, args.client_slug, force=args.force)
+            # Auto-mirror the source's position under sources/<client>/ into the
+            # wiki, unless --flat is passed. This makes single-file ingest match
+            # the default ingest-dir behavior (preserve structure) and avoids
+            # same-basename collisions across different source directories.
+            auto_sub = '' if args.flat else _auto_detect_subpath(args.file, args.client_slug)
+            result = ingest_source_to_both(args.file, args.client_slug, force=args.force, subpath=auto_sub)
             if not result:
                 # Skipped due to duplicate detection
                 sys.exit(0)
@@ -6821,21 +6842,42 @@ def main() -> None:
     elif args.command == 'profile':
         if args.profile_command == 'list':
-            if os.path.exists(PROFILES_DIR):
-                profiles = [f[:-5] for f in os.listdir(PROFILES_DIR) if f.endswith('.json')]
-                active = None
-                if os.path.exists(ACTIVE_PROFILE_FILE):
-                    with open(ACTIVE_PROFILE_FILE, 'r') as f:
-                        active = f.read().strip()
-                if profiles:
-                    print('Available profiles:\n')
-                    for p in sorted(profiles):
-                        marker = ' (active)' if p == active else ''
-                        print(f'  - {p}{marker}')
-                else:
-                    print('No profiles found in profiles/ directory.')
+            # Profiles are markdown files. Look in both the workspace profiles
+            # directory (per-project overrides) and the bundled profiles
+            # directory (shipped with mneme). Workspace entries shadow bundled
+            # ones with the same name.
+            workspace_profiles: dict[str, str] = {}
+            bundled_profiles: dict[str, str] = {}
+            if os.path.isdir(WORKSPACE_PROFILES_DIR):
+                for f in os.listdir(WORKSPACE_PROFILES_DIR):
+                    if f.endswith('.md'):
+                        workspace_profiles[f[:-3]] = 'workspace'
+            if os.path.isdir(PROFILES_DIR):
+                for f in os.listdir(PROFILES_DIR):
+                    if f.endswith('.md'):
+                        bundled_profiles[f[:-3]] = 'bundled'
+            merged = {**bundled_profiles, **workspace_profiles}  # workspace overrides
+            active = None
+            if os.path.exists(ACTIVE_PROFILE_FILE):
+                with open(ACTIVE_PROFILE_FILE, 'r') as f:
+                    active = f.read().strip()
+            if merged:
+                print('Available profiles:\n')
+                for p in sorted(merged):
+                    origin = merged[p]
+                    shadowed = origin == 'workspace' and p in bundled_profiles
+                    markers = []
+                    if p == active:
+                        markers.append('active')
+                    markers.append(origin)
+                    if shadowed:
+                        markers.append('shadows bundled')
+                    print(f'  - {p}  [{", ".join(markers)}]')
             else:
-                print('No profiles/ directory found.')
+                print('No profiles found.')
+                print(f'  Searched: {WORKSPACE_PROFILES_DIR} (workspace) and {PROFILES_DIR} (bundled).')
         elif args.profile_command == 'set':
             try:
                 set_active_profile(args.name)

{mneme_cli-0.5.1 → mneme_cli-0.5.3}/pyproject.toml RENAMED Viewed

@@ -32,12 +32,14 @@ classifiers = [
 ]
 dependencies = [
     "portalocker>=2.0.0",
+    "openpyxl>=3.1.0",
 ]
 [project.optional-dependencies]
 pdf = ["pymupdf>=1.23.0"]
+# Kept for backwards compatibility — xlsx support is now built-in.
 xlsx = ["openpyxl>=3.1.0"]
-all = ["pymupdf>=1.23.0", "openpyxl>=3.1.0"]
+all = ["pymupdf>=1.23.0"]
 release = [
     "build>=1.0.0",
     "twine>=5.0.0",

{mneme_cli-0.5.1 → mneme_cli-0.5.3}/tests/test_core.py RENAMED Viewed

@@ -398,7 +398,13 @@ class TestIngestDirPreserveStructure:
             with open(full, 'w') as f:
                 f.write(content)
-    def test_flat_default_unchanged(self, sync_workspace):
+    def test_preserve_structure_is_default(self, sync_workspace):
+        """v0.5.2+: ingest-dir mirrors source structure by default.
+        Previously flat by default; flipped because flat wikis silently
+        overwrite pages that share a basename across different source dirs
+        (suggestion #15).
+        """
         from mneme.core import ingest_dir
         self._make_source_tree(sync_workspace, {
             'demo/REQUIREMENTS/req-001.md': '# req 1',
@@ -406,9 +412,25 @@ class TestIngestDirPreserveStructure:
         })
         ingest_dir(os.path.join(sync_workspace, 'sources', 'demo'),
                    'demo', recursive=True)
-        # Without --preserve-structure, both pages flatten
+        # Default now mirrors source layout
+        assert os.path.exists(os.path.join(sync_workspace, 'wiki', 'demo', 'requirements', 'req-001.md'))
+        assert os.path.exists(os.path.join(sync_workspace, 'wiki', 'demo', 'design', 'dds-001.md'))
+        # Flat-mode pages should NOT exist
+        assert not os.path.exists(os.path.join(sync_workspace, 'wiki', 'demo', 'req-001.md'))
+        assert not os.path.exists(os.path.join(sync_workspace, 'wiki', 'demo', 'dds-001.md'))
+    def test_flat_opt_out_still_works(self, sync_workspace):
+        """Callers can still request a flat wiki with preserve_structure=False."""
+        from mneme.core import ingest_dir
+        self._make_source_tree(sync_workspace, {
+            'demo/REQUIREMENTS/req-001.md': '# req 1',
+            'demo/DESIGN/dds-001.md': '# dds 1',
+        })
+        ingest_dir(os.path.join(sync_workspace, 'sources', 'demo'),
+                   'demo', recursive=True, preserve_structure=False)
         assert os.path.exists(os.path.join(sync_workspace, 'wiki', 'demo', 'req-001.md'))
         assert os.path.exists(os.path.join(sync_workspace, 'wiki', 'demo', 'dds-001.md'))
+        assert not os.path.exists(os.path.join(sync_workspace, 'wiki', 'demo', 'requirements', 'req-001.md'))
     def test_preserve_structure_creates_subdirs(self, sync_workspace):
         from mneme.core import ingest_dir
@@ -830,6 +852,127 @@ class TestCLI:
         rc, out, err = _run_mnemo('search', 'xyznonexistent12345qqqzzz')
         assert rc == 0
+    def test_profile_list_discovers_bundled_markdown_profiles(self):
+        """Regression: profile list used to filter .json (wrong ext) and only
+        look at the bundled dir. Bundled profiles ship as .md files."""
+        td = tempfile.mkdtemp(prefix='mneme-profile-list-')
+        try:
+            for sub in ('wiki', 'sources', 'schema'):
+                os.makedirs(os.path.join(td, sub), exist_ok=True)
+            with open(os.path.join(td, 'index.md'), 'w') as f:
+                f.write('# Index\n')
+            with open(os.path.join(td, 'log.md'), 'w') as f:
+                f.write('# Log\n')
+            for name, empty in (
+                ('entities.json', {'version': 1, 'updated': '2026-01-01', 'entities': []}),
+                ('tags.json', {'version': 1, 'updated': '2026-01-01', 'tags': {}}),
+                ('graph.json', {'version': 1, 'updated': '2026-01-01', 'nodes': [], 'edges': []}),
+            ):
+                with open(os.path.join(td, 'schema', name), 'w') as f:
+                    json.dump(empty, f)
+            rc, out, err = _run_mnemo('--workspace', td, 'profile', 'list')
+            assert rc == 0, f'profile list failed: {err}'
+            assert 'eu-mdr' in out, f'bundled eu-mdr not listed. got: {out}'
+            assert 'iso-13485' in out, f'bundled iso-13485 not listed. got: {out}'
+            assert 'bundled' in out, f'origin marker missing. got: {out}'
+        finally:
+            shutil.rmtree(td, ignore_errors=True)
+    def test_single_ingest_auto_detects_subpath_from_sources_tree(self):
+        """`mneme ingest` should mirror a file's position under sources/<client>/
+        by default (suggestion #15 — avoid silent basename collisions)."""
+        td = tempfile.mkdtemp(prefix='mneme-ingest-auto-')
+        try:
+            for sub in ('wiki', 'sources', 'schema'):
+                os.makedirs(os.path.join(td, sub), exist_ok=True)
+            with open(os.path.join(td, 'index.md'), 'w') as f:
+                f.write('# Index\n')
+            with open(os.path.join(td, 'log.md'), 'w') as f:
+                f.write('# Log\n')
+            for name, empty in (
+                ('entities.json', {'version': 1, 'updated': '2026-01-01', 'entities': []}),
+                ('tags.json', {'version': 1, 'updated': '2026-01-01', 'tags': {}}),
+                ('graph.json', {'version': 1, 'updated': '2026-01-01', 'nodes': [], 'edges': []}),
+            ):
+                with open(os.path.join(td, 'schema', name), 'w') as f:
+                    json.dump(empty, f)
+            src = os.path.join(td, 'sources', 'demo', 'TRACE', 'REQ', 'req-001.md')
+            os.makedirs(os.path.dirname(src), exist_ok=True)
+            with open(src, 'w') as f:
+                f.write('# Req 1\n\nFiller body content for a test.\n')
+            rc, out, err = _run_mnemo('--workspace', td, 'ingest', src, 'demo')
+            assert rc == 0, f'ingest failed: {err}'
+            nested = os.path.join(td, 'wiki', 'demo', 'trace', 'req', 'req-001.md')
+            flat = os.path.join(td, 'wiki', 'demo', 'req-001.md')
+            assert os.path.exists(nested), f'nested page missing at {nested}'
+            assert not os.path.exists(flat), 'flat page should not exist by default'
+        finally:
+            shutil.rmtree(td, ignore_errors=True)
+    def test_single_ingest_flat_flag_opts_out(self):
+        """`mneme ingest --flat` keeps the old flat-wiki behavior."""
+        td = tempfile.mkdtemp(prefix='mneme-ingest-flat-')
+        try:
+            for sub in ('wiki', 'sources', 'schema'):
+                os.makedirs(os.path.join(td, sub), exist_ok=True)
+            with open(os.path.join(td, 'index.md'), 'w') as f:
+                f.write('# Index\n')
+            with open(os.path.join(td, 'log.md'), 'w') as f:
+                f.write('# Log\n')
+            for name, empty in (
+                ('entities.json', {'version': 1, 'updated': '2026-01-01', 'entities': []}),
+                ('tags.json', {'version': 1, 'updated': '2026-01-01', 'tags': {}}),
+                ('graph.json', {'version': 1, 'updated': '2026-01-01', 'nodes': [], 'edges': []}),
+            ):
+                with open(os.path.join(td, 'schema', name), 'w') as f:
+                    json.dump(empty, f)
+            src = os.path.join(td, 'sources', 'demo', 'TRACE', 'req-001.md')
+            os.makedirs(os.path.dirname(src), exist_ok=True)
+            with open(src, 'w') as f:
+                f.write('# Req 1\n\nFiller body content for a test.\n')
+            rc, out, err = _run_mnemo('--workspace', td, 'ingest', src, 'demo', '--flat')
+            assert rc == 0, f'ingest failed: {err}'
+            flat = os.path.join(td, 'wiki', 'demo', 'req-001.md')
+            nested = os.path.join(td, 'wiki', 'demo', 'trace', 'req-001.md')
+            assert os.path.exists(flat)
+            assert not os.path.exists(nested)
+        finally:
+            shutil.rmtree(td, ignore_errors=True)
+    def test_profile_list_shows_workspace_override(self):
+        """Workspace profiles should shadow bundled ones of the same name."""
+        td = tempfile.mkdtemp(prefix='mneme-profile-override-')
+        try:
+            for sub in ('wiki', 'sources', 'schema', 'profiles'):
+                os.makedirs(os.path.join(td, sub), exist_ok=True)
+            with open(os.path.join(td, 'index.md'), 'w') as f:
+                f.write('# Index\n')
+            with open(os.path.join(td, 'log.md'), 'w') as f:
+                f.write('# Log\n')
+            for name, empty in (
+                ('entities.json', {'version': 1, 'updated': '2026-01-01', 'entities': []}),
+                ('tags.json', {'version': 1, 'updated': '2026-01-01', 'tags': {}}),
+                ('graph.json', {'version': 1, 'updated': '2026-01-01', 'nodes': [], 'edges': []}),
+            ):
+                with open(os.path.join(td, 'schema', name), 'w') as f:
+                    json.dump(empty, f)
+            with open(os.path.join(td, 'profiles', 'eu-mdr.md'), 'w') as f:
+                f.write('---\nname: Override\ndescription: local\n---\n# Principles\n- test\n')
+            with open(os.path.join(td, 'profiles', 'custom.md'), 'w') as f:
+                f.write('---\nname: Custom\ndescription: workspace-only\n---\n# Principles\n- test\n')
+            rc, out, err = _run_mnemo('--workspace', td, 'profile', 'list')
+            assert rc == 0
+            assert 'custom' in out
+            assert 'shadows bundled' in out, f'override marker missing. got: {out}'
+        finally:
+            shutil.rmtree(td, ignore_errors=True)
     def test_ingest_missing_file_exits_nonzero(self):
         rc, out, err = _run_mnemo('ingest', '/tmp/nonexistent_file_xyz_mnemo.md', 'test')
         assert rc == 1