npm - ai-wiki-toolkit-linux-arm64 - Versions diffs - 0.1.26 → 0.1.28 - Mend

ai-wiki-toolkit-linux-arm64 0.1.26 → 0.1.28

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 # ai-wiki-toolkit-linux-arm64
 This package contains the `aiwiki-toolkit` executable for `linux-arm64-glibc`.
-It is published as the platform-specific binary package for `ai-wiki-toolkit` `0.1.26`.
+It is published as the platform-specific binary package for `ai-wiki-toolkit` `0.1.28`.
 Most users should install `ai-wiki-toolkit` instead of using this package directly.
 ---
@@ -110,6 +110,7 @@ The command emits a transient AI Wiki Context Packet with:
 - a coarse task type and risk tags
 - an effort level so simple operational tasks can stay lightweight
+- generated success criteria and verification checks so non-trivial tasks start with a clear finish line
 - index cards with short descriptions and reference links for relevant memory
 - `must_load` docs to consult first when direct context is required
 - source-cited `must_follow` rules extracted from authoritative user-owned docs
@@ -420,10 +421,17 @@ To inspect memory quality from local reuse and task-check evidence:
 ```bash
 aiwiki-toolkit diagnose memory
 aiwiki-toolkit diagnose memory --since 14d --handle your-handle
+aiwiki-toolkit diagnose memory --focus trial-error
 ```
 This writes regenerated local reports under `ai-wiki/_toolkit/diagnostics/` and prints the report to stdout. The report highlights high-ROI memory, noisy memory, stale or missing docs, conflict notes, missed-memory signals, and coverage gaps such as document reuse events that were never paired with a task-level reuse check. It does not edit user-owned AI wiki docs.
+Use `--focus trial-error` to generate a focused trial/error reduction report from existing
+AI wiki evidence. It summarizes material effects such as `avoided_retry`,
+`blocked_wrong_path`, `changed_plan`, and `faster_resolution`, separates missed or repeated issue
+signals from unproven wiki use, and lists replay candidates that still need source incident
+artifacts before becoming formal impact-eval families.
 To turn diagnostics and handle-local drafts into a human-reviewable consolidation queue:
 ```bash
@@ -433,6 +441,28 @@ aiwiki-toolkit consolidate queue --since 14d --handle your-handle
 This writes regenerated local reports under `ai-wiki/_toolkit/consolidation/` and prints the queue to stdout. The queue suggests one action per draft cluster: keep, refine, promotion candidate, conflict, or supersession. It does not edit user-owned AI wiki docs or create shared conventions, review patterns, problems, features, or decisions; those still require human confirmation.
+To summarize first-attempt product impact from a captured eval run:
+```bash
+aiwiki-toolkit eval impact report --run-dir /path/to/eval-run
+aiwiki-toolkit eval impact report --run-dir /path/to/eval-run --format json
+aiwiki-toolkit eval impact summarize --run-dir /path/to/eval-run --run-dir /path/to/another-run
+aiwiki-toolkit eval impact summarize --runs-file evals/impact/runs.json
+```
+This reads an existing run directory with `metadata.json`, result captures, optional
+`score.json` files, and optional `confounds.json`. It compares the run's primary variants,
+normally `no_aiwiki_workflow` versus `aiwiki_ambient_memory_workflow`, using first-attempt
+metrics only: `first_pass` captures count toward the signal, while `final` repair captures
+stay diagnostic. The command reports first-attempt success rate, average score, attempts, human
+nudges, changed files, untracked files, change-profile splits for project files versus AI wiki
+telemetry and user-owned wiki churn, and whether the run is ready for shareable causal claims.
+It does not run agents or mutate eval artifacts.
+Use `eval impact summarize` to aggregate multiple captured runs into a product-level dashboard.
+It reports each family's primary outcome, product signal, shareability, success and score deltas,
+and change-profile deltas so neutral success-rate runs can still surface quality or churn signals.
 To diagnose missing starter pointers, stale managed prompt blocks, or rule drift and print copy-paste upgrade starters:
 ```bash

package/bin/aiwiki-toolkit CHANGED Viewed

Binary file

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "ai-wiki-toolkit-linux-arm64",
-  "version": "0.1.26",
+  "version": "0.1.28",
   "description": "Platform binary package for ai-wiki-toolkit (linux-arm64-glibc).",
   "license": "MIT",
   "homepage": "https://github.com/BochengYin/ai-wiki-toolkit#readme",