npm - @gmickel/gno - Versions diffs - 0.22.3 → 0.22.5 - Mend

@gmickel/gno 0.22.3 → 0.22.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +49 -0
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -27,11 +27,42 @@ GNO is a local knowledge engine that turns your documents into a searchable, con
 - [How It Works](#how-it-works)
 - [Features](#features)
 - [Local Models](#local-models)
+- [Fine-Tuned Models](#fine-tuned-models)
 - [Architecture](#architecture)
 - [Development](#development)
 ---
+## What's New in v0.22
+- **Promoted Slim Retrieval Model**: published `slim-retrieval-v1` on Hugging Face for direct `hf:` installation in GNO
+- **Fine-Tuning Workflow**: local MLX LoRA training, portable GGUF export, automatic checkpoint selection, promotion bundles, and repeatable benchmark comparisons
+- **Autonomous Search Harness**: bounded candidate search with early-stop guards, repeated incumbent confirmation, and promotion targets
+- **Public Docs & Site**: fine-tuned model docs and feature pages now point at the published HF model and the `slim-tuned` preset
+### Fine-Tuned Model Quick Use
+```yaml
+models:
+  activePreset: slim-tuned
+  presets:
+    - id: slim-tuned
+      name: GNO Slim Retrieval v1
+      embed: hf:gpustack/bge-m3-GGUF/bge-m3-Q4_K_M.gguf
+      rerank: hf:ggml-org/Qwen3-Reranker-0.6B-Q8_0-GGUF/qwen3-reranker-0.6b-q8_0.gguf
+      gen: hf:guiltylemon/gno-expansion-slim-retrieval-v1/gno-expansion-auto-entity-lock-default-mix-lr95-f16.gguf
+```
+Then:
+```bash
+gno models use slim-tuned
+gno models pull --gen
+gno query "ECONNREFUSED 127.0.0.1:5432" --thorough
+```
+> Full guide: [Fine-Tuned Models](https://gno.sh/docs/FINE-TUNED-MODELS/) · [Feature page](https://gno.sh/features/fine-tuned-models/)
 ## What's New in v0.21
 - **Ask CLI Query Modes**: `gno ask` now accepts repeatable `--query-mode term|intent|hyde` entries, matching the existing Ask API and Web controls
@@ -447,6 +478,24 @@ gno models use slim
 gno models pull --all  # Optional: pre-download models (auto-downloads on first use)
 ```
+## Fine-Tuned Models
+GNO now has a published promoted retrieval model for the default slim path:
+- model repo: `guiltylemon/gno-expansion-slim-retrieval-v1`
+- recommended preset id: `slim-tuned`
+- runtime URI:
+  - `hf:guiltylemon/gno-expansion-slim-retrieval-v1/gno-expansion-auto-entity-lock-default-mix-lr95-f16.gguf`
+Use it when you want the tuned retrieval expansion path immediately, without running local fine-tuning yourself.
+For private/internal products, use the same workflow but keep the final GGUF private and point `gen:` at a `file:` URI instead of publishing to Hugging Face.
+See:
+- [Fine-Tuned Models docs](https://gno.sh/docs/FINE-TUNED-MODELS/)
+- [Fine-Tuned Models feature page](https://gno.sh/features/fine-tuned-models/)
 ### HTTP Backends (Remote GPU)
 Offload inference to a GPU server on your network:

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@gmickel/gno",
-  "version": "0.22.3",
+  "version": "0.22.5",
   "description": "Local semantic search for your documents. Index Markdown, PDF, and Office files with hybrid BM25 + vector search.",
   "keywords": [
     "embeddings",