npm - audrey - Versions diffs - 1.0.2 → 1.0.3 - Mend

audrey 1.0.2 → 1.0.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (57) hide show

package/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,32 @@
 # Changelog
+## 1.0.3 - 2026-05-28
+Housekeeping release. Nothing about how Audrey behaves has changed — this is
+all under-the-hood tidying plus a friendlier README. Safe to upgrade from 1.0.2
+without touching anything.
+### Cleaner code under the hood
+- Started breaking up the big `mcp-server/index.ts` file (it had grown to ~3,600
+  lines that did everything at once). The memory-tool input schemas and the
+  shared validation helpers now live in their own small files
+  (`tool-schemas.ts`, `tool-validation.ts`). Same behavior, just easier to read
+  and work on. More of this tidying will follow.
+### More reliable tests
+- The test suite used to need a slow, multi-step "build all the benchmark and
+  paper files first" step before it could run. It now sets those up
+  automatically, so `npm test` (or a plain `vitest run`) just works from a fresh
+  checkout. 785 tests pass with nothing extra to remember.
+### Friendlier docs
+- The README now opens with a short "In Plain English" section that explains
+  what Audrey is for in everyday language, before diving into the technical
+  detail.
 ## 1.0.2 - 2026-05-28
 Maintenance and engineering-quality release. No runtime behavior change — the

package/README.md CHANGED Viewed

@@ -15,6 +15,14 @@
   </p>
 </div>
+## In Plain English
+AI coding assistants are brilliant but forgetful. They'll happily rerun the same broken command they ran yesterday, forget the rules your team agreed on last week, and treat every new session like it's day one.
+Audrey is the memory they're missing. It quietly keeps track of what worked, what failed, and what you told it — then checks that memory **before** the agent does something, so it can say "hold on, this exact command failed last time, and here's what fixed it" instead of repeating the mistake. Everything lives in one local file on your machine: no cloud, no account, and nothing about your code ever leaves your computer.
+That's the whole idea. The rest of this README is the detail.
 ## Why Audrey Exists
 Agents forget the exact mistakes they made yesterday. They repeat broken commands, lose project-specific rules, miss contradictions, and treat every new session like a cold start.
@@ -296,7 +304,7 @@ output shapes are validated by JSON schemas under `benchmarks/schemas/`.
 Latest local result in this checkout: 10/10 scenarios passed, 100% prevention
 rate, 0% false-block rate, 0 raw secret leaks, 0 published artifact leaks in
-the raw-secret sweep, and 2.916ms / 21.17ms
+the raw-secret sweep, and 3.09ms / 28.181ms
 p50/p95 guard latency under the mock-provider methodology.
 **Methodology caveats, on purpose.** All numbers above are produced against

package/benchmarks/output/adapter-self-test/guardbench-adapter-self-test.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "schemaVersion": "1.0.0",
   "suite": "GuardBench adapter self-test",
-  "generatedAt": "2026-05-29T03:45:40.969Z",
+  "generatedAt": "2026-05-29T13:33:27.293Z",
   "ok": true,
   "adapter": {
     "name": "Example Allow Adapter",
@@ -27,9 +27,9 @@
     "evidenceRecall": 0.1,
     "redactionLeaks": 0,
     "latency": {
-      "p50Ms": 0.01,
-      "p95Ms": 0.043,
-      "maxMs": 0.043
+      "p50Ms": 0.012,
+      "p95Ms": 0.042,
+      "maxMs": 0.042
     }
   },
   "contract": {

package/benchmarks/output/external/guardbench-external-dry-run.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "schemaVersion": "1.0.0",
   "suite": "GuardBench external adapter dry-run matrix",
-  "generatedAt": "2026-05-29T03:45:41.522Z",
+  "generatedAt": "2026-05-29T13:33:27.818Z",
   "ok": true,
   "registry": "benchmarks/adapters/registry.json",
   "outRoot": "benchmarks/output/external",

package/benchmarks/output/external/guardbench-external-evidence.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "schemaVersion": "1.0.0",
   "suite": "GuardBench external evidence verification",
-  "generatedAt": "2026-05-29T03:45:41.794Z",
+  "generatedAt": "2026-05-29T13:33:28.076Z",
   "ok": true,
   "allowPending": true,
   "registry": "benchmarks/adapters/registry.json",

package/benchmarks/output/guardbench-conformance-card.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "schemaVersion": "1.0.0",
   "suite": "GuardBench conformance card",
-  "generatedAt": "2026-05-29T03:45:36.958Z",
+  "generatedAt": "2026-05-29T13:33:23.522Z",
   "sourceDir": "benchmarks/output",
   "manifestVersion": "0.2.0",
   "suiteId": "guardbench-local-comparative",
@@ -25,9 +25,9 @@
     "evidenceRecall": 1,
     "redactionLeaks": 0,
     "latency": {
-      "p50Ms": 2.916,
-      "p95Ms": 21.17,
-      "maxMs": 21.17
+      "p50Ms": 3.09,
+      "p95Ms": 28.181,
+      "maxMs": 28.181
     }
   },
   "conformance": {
@@ -39,21 +39,21 @@
   "integrity": {
     "artifactHashes": {
       "guardbench-manifest.json": "57636ce19fdaa6e50fc3fc961d9e499a9f43632f588c713a9fefe8e8a6fa724c",
-      "guardbench-summary.json": "e8669cd6c80dc3dc849b3c4fcc473ea706eb3a760bced69682d0dc2396b2e233",
-      "guardbench-raw.json": "15b39fd1a65709a89455fbfcaf815daf364b204fa526d5065cc12fcaed281d28"
+      "guardbench-summary.json": "91f264dd889e2c639a6fc6d1b867bc228b94c84ed5120345e23dddb79c11ee74",
+      "guardbench-raw.json": "66d4b69087258638f3572a40e1fd59bb84067034f899eaa2c27eed2dde554b2b"
     },
     "externalRunMetadataHash": null
   },
   "provenance": {
-    "generatedAt": "2026-05-29T03:45:36.607Z",
-    "gitSha": "ceed2f51b615175c8bb412b96b5e5a501561189f",
+    "generatedAt": "2026-05-29T13:33:23.189Z",
+    "gitSha": "9f771bae94f5ce4cfd5d5425e300a6a440c833d2",
     "gitDirty": false,
     "node": "v24.16.0",
     "v8": "13.6.233.17-node.49",
     "platform": "linux",
     "arch": "x64",
     "osRelease": "6.17.0-1015-azure",
-    "cpuModel": "AMD EPYC 9V74 80-Core Processor",
+    "cpuModel": "Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz",
     "cpuCount": 4,
     "totalMemoryGb": 15.61,
     "embeddingProvider": "mock",