npm - @tangle-network/agent-eval - Versions diffs - 0.45.0 → 0.47.0 - Mend

@tangle-network/agent-eval 0.45.0 → 0.47.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (24) hide show

package/dist/adapters/http.d.ts +1 -1
package/dist/adapters/http.js +11 -4
package/dist/adapters/http.js.map +1 -1
package/dist/adapters/langchain.d.ts +1 -1
package/dist/campaign/index.d.ts +3 -3
package/dist/chunk-ZQABFCVJ.js +85 -0
package/dist/chunk-ZQABFCVJ.js.map +1 -0
package/dist/contract/index.d.ts +217 -2
package/dist/contract/index.js +206 -1
package/dist/contract/index.js.map +1 -1
package/dist/hosted/index.d.ts +192 -0
package/dist/hosted/index.js +10 -0
package/dist/hosted/index.js.map +1 -0
package/dist/openapi.json +1 -1
package/dist/rl.d.ts +1 -1
package/dist/{run-improvement-loop-pJ4yrx4X.d.ts → run-improvement-loop-Bfam3MT1.d.ts} +2 -2
package/dist/{types-BURGZ8Ug.d.ts → types-8u72Gc76.d.ts} +1 -1
package/docs/design/external-agent-wedge.md +2 -2
package/docs/design/phase-d-rfc.md +125 -0
package/docs/hosted-ingest-spec.md +204 -0
package/docs/phase-b-pairing-kit.md +188 -0
package/docs/phase-b-runbook.md +176 -0
package/docs/quickstart-external.md +43 -4
package/package.json +6 -1

package/docs/quickstart-external.md CHANGED Viewed

@@ -13,12 +13,51 @@ Tangle sandbox, no Tangle account, and no hosted infrastructure.
 ## Install
 ```sh
-npm i @tangle-network/agent-eval@^0.44.0
+npm i @tangle-network/agent-eval@^0.46.0
 ```
-The package's `@tangle-network/sandbox` peer is `optional` (as of
-0.44.0). Foreign consumers can install agent-eval and run the full LAND
-tier without our sandbox or its dependencies.
+The package's `@tangle-network/sandbox` peer is `optional`. Foreign
+consumers install agent-eval and run the full LAND tier without our
+sandbox or its dependencies.
+## The one-shot happy path
+If you don't want to learn the substrate, the entire LAND tier reduces
+to one function call:
+```ts
+import { selfImprove } from '@tangle-network/agent-eval/contract'
+const result = await selfImprove({
+  agent: (surface, scenario, ctx) =>
+    runYourAgent({ systemPrompt: surface as string, scenario, signal: ctx.signal }),
+  scenarios,
+  judge,
+  baselineSurface: 'You are a senior copywriter…',
+  budget: { dollars: 10, generations: 3 },
+})
+console.log(`lift: ${result.lift.toFixed(3)} (${result.gateDecision})`)
+if (result.gateDecision === 'ship') {
+  // result.winner.surface is the optimized prompt
+}
+```
+That's the LAND happy path. Smart defaults pick: in-memory storage,
+`gepaDriver` with copywriting-flavored mutation primitives,
+`defaultProductionGate` with `deltaThreshold: 0.05`, 25% deterministic
+train/holdout split.
+Every escape hatch the substrate exposes is reachable from
+`selfImprove` — custom `driver`, custom `gate`, distributed-driver
+`cellPlacement`, `onProgress` streaming callback, `autoOnPromote: 'pr'`
+to open a GitHub PR with the winner. See the type signatures in
+[`src/contract/self-improve.ts`](../src/contract/self-improve.ts) for
+the full surface.
+The sections below are the lower-level path — useful when you want
+fine-grained control over each piece. Read those next if `selfImprove`
+isn't enough.
 ## Five types, four functions

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@tangle-network/agent-eval",
-  "version": "0.45.0",
+  "version": "0.47.0",
   "description": "Substrate for self-improving agents: traces, verifiable rewards, preferences, GEPA / reflective mutation, auto-research, replay, sequential anytime-valid stats, and release gates.",
   "homepage": "https://github.com/tangle-network/agent-eval#readme",
   "repository": {
@@ -119,6 +119,11 @@
       "import": "./dist/adapters/http.js",
       "default": "./dist/adapters/http.js"
     },
+    "./hosted": {
+      "types": "./dist/hosted/index.d.ts",
+      "import": "./dist/hosted/index.js",
+      "default": "./dist/hosted/index.js"
+    },
     "./openapi.json": {
       "default": "./dist/openapi.json"
     }