@swarmvaultai/engine 0.1.21 → 0.1.23

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -169,8 +169,8 @@ This matters because many "OpenAI-compatible" backends only implement part of th
169
169
  ### Compile + Query
170
170
 
171
171
  - `compileVault(rootDir, { approve })` writes wiki pages, graph data, and search state using the vault schema as guidance, or stages a review bundle
172
- - compile also writes graph orientation pages such as `wiki/graph/report.md` and `wiki/graph/communities/<community>.md`
173
- - `benchmarkVault(rootDir, { questions })` writes `state/benchmark.json` and folds the latest benchmark summary into `wiki/graph/report.md`
172
+ - compile also writes graph orientation pages such as `wiki/graph/report.md`, `wiki/graph/report.json`, and `wiki/graph/communities/<community>.md`
173
+ - `benchmarkVault(rootDir, { questions })` writes `state/benchmark.json` and folds the latest benchmark summary into `wiki/graph/report.md` and `wiki/graph/report.json`
174
174
  - `queryVault(rootDir, { question, save, format, review })` answers against the compiled vault using the same schema layer and saves by default
175
175
  - `exploreVault(rootDir, { question, steps, format, review })` runs a save-first multi-step exploration loop and writes a hub page plus step outputs
176
176
  - `searchVault(rootDir, query, limit)` searches compiled pages directly
@@ -220,12 +220,13 @@ Running the engine produces a local workspace with these main areas:
220
220
  - `raw/assets/`: copied attachments referenced by ingested markdown bundles and remote URL ingests
221
221
  - `wiki/`: generated markdown pages, the append-only `log.md` activity trail, staged candidates, saved query outputs, exploration hub pages, and a human-only `insights/` area
222
222
  - `wiki/graph/`: generated graph report pages and per-community summaries derived from `state/graph.json`
223
+ - `wiki/graph/report.json`: machine-readable graph report data used by the viewer and export surfaces
223
224
  - `wiki/outputs/assets/`: local chart/image artifacts and JSON manifests for saved visual outputs
224
225
  - `wiki/code/`: generated module pages for ingested code sources
225
226
  - `wiki/projects/`: generated project rollups over canonical pages
226
227
  - `wiki/candidates/`: staged concept and entity pages awaiting confirmation on a later compile
227
228
  - `state/manifests/`: source manifests
228
- - `state/extracts/`: extracted text
229
+ - `state/extracts/`: extracted markdown plus JSON sidecars describing extractor kind, warnings, PDF page counts, and image-vision metadata
229
230
  - `state/analyses/`: model analysis output
230
231
  - `state/code-index.json`: repo-aware code module aliases and local resolution data
231
232
  - `state/benchmark.json`: latest benchmark/trust summary for the current vault
@@ -240,6 +241,8 @@ Running the engine produces a local workspace with these main areas:
240
241
  Saved outputs are indexed immediately into the graph page registry and search index, then linked back into compiled source, concept, and entity pages immediately through the lightweight artifact sync path. New concept and entity pages stage into `wiki/candidates/` first and promote to active pages on the next matching compile. Insight pages are indexed into search and page reads, but compile does not mutate them. Project-scoped pages receive `project_ids`, project tags, and layered root-plus-project schema hashes when all contributing sources resolve to the same configured project.
241
242
  Code sources also emit module, symbol, and parser-backed rationale nodes into `state/graph.json`, so local imports, exports, inheritance, same-module call edges, and rationale links are queryable through the same viewer and search pipeline.
242
243
  Ingest, inbox import, compile, query, lint, review, and candidate operations also append human-readable entries to `wiki/log.md`.
244
+ PDF sources now go through a local text-extraction pass before analysis, and image sources use the configured `visionProvider` for structured OCR/diagram extraction when a real multimodal provider is available. When image extraction is unavailable, SwarmVault records an explicit warning in the extraction sidecar and carries that warning forward into analysis instead of silently treating the source as empty.
245
+ Compile and repo-refresh runs also keep benchmark artifacts current by default, so graph report consumers can show freshness and stale-state without requiring a separate benchmark command first.
243
246
 
244
247
  ## Notes
245
248