npm - @sanity/ailf-studio - Versions diffs - 0.1.5 → 0.1.7 - Mend

@sanity/ailf-studio 0.1.5 → 0.1.7

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (4) hide show

package/README.md CHANGED Viewed

@@ -13,28 +13,13 @@ AILF reports are stored.
 ### 1. Add the dependency
-#### Continuous releases (recommended for external projects)
-Every merge to `main` that touches `packages/studio/` automatically publishes
-via [pkg.pr.new](https://pkg.pr.new). Install the latest main build:
-```bash
-pnpm add https://pkg.pr.new/sanity-labs/ai-literacy-framework/@sanity/ailf-studio@main
-```
-Or pin to a specific commit:
 ```bash
-pnpm add https://pkg.pr.new/sanity-labs/ai-literacy-framework/@sanity/ailf-studio@<commit-sha>
+pnpm add @sanity/ailf-studio
 ```
-To update to the latest build, re-run the install command — the `@main` URL
-always resolves to the most recent build.
-#### PR preview packages
-PRs labeled `trigger: preview` also publish preview packages. Install URLs are
-posted as PR comments automatically.
+> **Note:** The package is published with `restricted` access to the `@sanity`
+> npm scope. You need an npm token with read access — see the root
+> [README](../../README.md#obtain-secrets) for how to obtain one.
 #### Within the monorepo
@@ -70,7 +55,7 @@ This registers:
 - The `ailf.referenceSolution` document type (gold-standard reference
   implementations)
 - The `ailf.evalRequest` document type (evaluation request triggers)
-- The **AI Literacy** dashboard tool in the Studio sidebar
+- The **AI Literacy Framework** dashboard tool in the Studio sidebar
 ### 3. Alternative: tool-only installation
@@ -120,7 +105,8 @@ export default defineConfig({
 ## Dashboard Views
-The plugin provides five views accessible from tabs in the dashboard:
+The plugin provides three tab views plus a detail drill-down, accessible from
+the **AI Literacy Framework** tool in the Studio sidebar.
 ### Latest Reports
@@ -128,11 +114,14 @@ A card list of the most recent evaluation reports. Each card shows:
 - Overall score, doc lift, and lowest-scoring area
 - Evaluation mode, source, and trigger type
-- Git metadata (branch, PR number) when available
+- Git metadata (branch, PR number, origin repo) when available
 - Auto-comparison delta against the previous run
 Click any card to navigate to the Report Detail view.
+The view includes a **search bar** for filtering reports by document slug, area,
+or content release perspective.
 ### Score Timeline
 A line chart of overall and per-area scores over time. Filterable by:
@@ -153,25 +142,22 @@ report from dropdowns, then view:
 - Per-model deltas (when both reports include per-model breakdowns)
 - Noise threshold classification
-### Content Impact
-Find all evaluation reports related to a specific Sanity document. Enter a
-document ID to see:
-- Which evaluations included that document in their target set
-- Score trends for that document's feature area over time
-- Whether edits to the document improved or regressed scores
 ### Report Detail
-Full drill-down into a single report:
-- Per-area score table with all dimensions (task completion, code correctness,
-  doc coverage, lift from docs)
-- Per-model breakdowns with cost-per-quality-point
-- Provenance metadata (trigger, git info, grader model, context hash)
-- Auto-comparison summary against the previous comparable run
-- Link to the Promptfoo web viewer for raw evaluation output
+Full drill-down into a single report (navigated from Latest Reports or a direct
+URL):
+- **Overview stats** — composite score, doc lift, cost, duration
+- **Per-area score table** with all dimensions (task completion, code
+  correctness, doc coverage, lift from docs)
+- **Three-layer table** — floor / ceiling / actual decomposition (when
+  available)
+- **Per-model breakdowns** with cost-per-quality-point
+- **Judgment list** — individual grader verdicts with reasoning
+- **Recommendations** — gap analysis remediation suggestions (when available)
+- **Provenance card** — trigger, git info (branch, PR, origin repo), grader
+  model, context hash, eval fingerprint
+- **Auto-comparison summary** against the previous comparable run
 ## Filtering
@@ -198,13 +184,13 @@ export default defineConfig({
 })
 ```
-Reports are written by the evaluation pipeline (`turbo pipeline -- --publish`).
-See the [report store design docs](../../docs/design-docs/report-store/index.md)
-for the full architecture.
+Reports are written by the evaluation pipeline (`ailf pipeline --publish`). See
+the [report store design docs](../../docs/design-docs/report-store/index.md) for
+the full architecture.
 ## Exported API
-The plugin exports building blocks for custom views or extensions:
+The plugin exports building blocks for custom views or extensions.
 ### Plugin & Tool
@@ -230,6 +216,7 @@ The plugin exports building blocks for custom views or extensions:
 | ------------------- | --------------------------------------------------------------------------------------------- |
 | `AssertionInput`    | Custom input for task assertions with contextual type descriptions and monospace code styling |
 | `CanonicalDocInput` | Custom input for canonical doc references with polymorphic resolution type help               |
+| `ReleasePicker`     | Content release perspective picker for evaluation scoping                                     |
 | `MirrorBanner`      | Banner showing repo source, sync status, and provenance for mirrored tasks                    |
 | `SyncStatusBadge`   | Colored badge (green/yellow/red) showing sync freshness of mirrored tasks                     |
@@ -240,31 +227,58 @@ The plugin exports building blocks for custom views or extensions:
 | `GraduateToNativeAction`    | Converts a mirrored (read-only) task to a native (editable) task by removing origin |
 | `createRunEvaluationAction` | Factory for creating a Studio action that triggers evaluations                      |
+### Glossary
+| Export     | Description                                                              |
+| ---------- | ------------------------------------------------------------------------ |
+| `GLOSSARY` | Centralized tooltip descriptions for all evaluation metrics and concepts |
 ### GROQ Queries
-| Export                 | Description                             |
-| ---------------------- | --------------------------------------- |
-| `latestReportsQuery`   | N most recent reports (filterable)      |
-| `scoreTimelineQuery`   | Score data points over time             |
-| `reportDetailQuery`    | Full report with all fields             |
-| `comparisonPairQuery`  | Two reports for side-by-side comparison |
-| `contentImpactQuery`   | Reports related to a document ID        |
-| `distinctSourcesQuery` | All unique source names                 |
-| `distinctModesQuery`   | All unique evaluation modes             |
-| `distinctAreasQuery`   | All unique feature areas                |
+| Export                         | Description                                |
+| ------------------------------ | ------------------------------------------ |
+| `latestReportsQuery`           | N most recent reports (filterable)         |
+| `scoreTimelineQuery`           | Score data points over time                |
+| `reportDetailQuery`            | Full report with all fields                |
+| `comparisonPairQuery`          | Two reports for side-by-side comparison    |
+| `contentImpactQuery`           | Reports related to a document ID           |
+| `recentDocumentEvalsQuery`     | Recent evaluations for a specific document |
+| `articleSearchQuery`           | Full-text search across article documents  |
+| `distinctSourcesQuery`         | All unique source names                    |
+| `distinctModesQuery`           | All unique evaluation modes                |
+| `distinctAreasQuery`           | All unique feature areas                   |
+| `distinctModelsQuery`          | All unique model identifiers               |
+| `distinctPerspectivesQuery`    | All unique content release perspectives    |
+| `distinctTargetDocumentsQuery` | All unique target document slugs           |
 ### Types
-| Export              | Description                                    |
-| ------------------- | ---------------------------------------------- |
-| `ReportListItem`    | Shape returned by `latestReportsQuery`         |
-| `ReportDetail`      | Shape returned by `reportDetailQuery`          |
-| `TimelineDataPoint` | Shape returned by `scoreTimelineQuery`         |
-| `ComparisonData`    | Auto-comparison data embedded in reports       |
-| `ContentImpactItem` | Shape returned by `contentImpactQuery`         |
-| `ProvenanceData`    | Report provenance metadata                     |
-| `SummaryData`       | Score summary (overall + per-area + per-model) |
-| `ScoreItem`         | Individual area score entry                    |
+| Export                       | Description                                                           |
+| ---------------------------- | --------------------------------------------------------------------- |
+| `ReportListItem`             | Shape returned by `latestReportsQuery`                                |
+| `ReportDetail`               | Shape returned by `reportDetailQuery`                                 |
+| `TimelineDataPoint`          | Shape returned by `scoreTimelineQuery`                                |
+| `ComparisonData`             | Auto-comparison data embedded in reports                              |
+| `ContentImpactItem`          | Shape returned by `contentImpactQuery`                                |
+| `ProvenanceData`             | Report provenance metadata                                            |
+| `SummaryData`                | Score summary (overall + per-area + per-model)                        |
+| `ScoreItem`                  | Individual area score entry                                           |
+| `RecommendationGap`          | Single gap analysis recommendation                                    |
+| `RecommendationsData`        | Full recommendations payload                                          |
+| `JudgmentData`               | Individual grader judgment with reasoning                             |
+| `DocumentRef`                | Canonical document reference (re-exported from `@sanity/ailf-shared`) |
+| `ScoreGrade`                 | Letter grade type (re-exported from `@sanity/ailf-shared`)            |
+| `scoreGrade`                 | Function to compute letter grade from numeric score                   |
+| `RunEvaluationActionOptions` | Options for `createRunEvaluationAction` factory                       |
+### Utility Functions
+| Export               | Description                                               |
+| -------------------- | --------------------------------------------------------- |
+| `formatPercent`      | Format a number as a percentage string                    |
+| `formatRelativeTime` | Format an ISO timestamp as relative time (e.g., "2h ago") |
+| `formatDelta`        | Format a score delta with +/− sign                        |
+| `formatDuration`     | Format milliseconds as human-readable duration            |
 ## Development
@@ -279,8 +293,8 @@ pnpm --filter @sanity/ailf-studio dev
 turbo build
 ```
-The plugin is pure TypeScript (TSC compilation, no bundler). The consuming
-Studio's bundler (Vite) handles the final bundle.
+The plugin uses [tsup](https://github.com/egoist/tsup) for bundling. The
+consuming Studio's bundler (Vite) handles the final bundle.
 ## Related Documentation

package/dist/index.d.ts CHANGED Viewed

@@ -436,10 +436,10 @@ declare const webhookConfigSchema: {
  * supports browser back/forward navigation.
  *
  * Route structure:
- *   /ai-literacy                        → Latest Reports (home)
- *   /ai-literacy/report/:reportId       → Report Detail
- *   /ai-literacy/timeline               → Score Timeline
- *   /ai-literacy/compare                → Compare
+ *   /ailf                        → Latest Reports (home)
+ *   /ailf/report/:reportId       → Report Detail
+ *   /ailf/timeline               → Score Timeline
+ *   /ailf/compare                → Compare
  */
 /**