npm - @modelrelay/sdk - Versions diffs - 0.7.0 → 0.17.0 - Mend

@modelrelay/sdk 0.7.0 → 0.17.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/README.md CHANGED Viewed

@@ -82,11 +82,87 @@ const stream = await mr.chat.completions.create(
 );
 ```
-### Typed models, providers, and stop reasons
+### Typed models and stop reasons
-- Models and providers use string literal unions with an `Other` escape hatch: pass `{ other: "my-provider" }` or `{ other: "custom/model-x" }` to preserve custom IDs while benefiting from autocomplete on known values (e.g., `Models.OpenAIGpt4o`, `Providers.Anthropic`).
+- Models are plain strings (e.g., `"gpt-4o"`), so new models do not require SDK updates.
 - Stop reasons are parsed into the `StopReason` union (e.g., `StopReasons.EndTurn`); unknown values surface as `{ other: "<raw>" }`.
-- Usage backfills `totalTokens` when providers omit it, ensuring consistent accounting.
+- Usage backfills `totalTokens` when the backend omits it, ensuring consistent accounting.
+### Structured outputs (`response_format`)
+Request structured JSON instead of free-form text when the backend supports it:
+```ts
+import { ModelRelay, type ResponseFormat } from "@modelrelay/sdk";
+const mr = new ModelRelay({ key: "mr_sk_..." });
+const format: ResponseFormat = {
+  type: "json_schema",
+  json_schema: {
+    name: "summary",
+    schema: {
+      type: "object",
+      properties: { headline: { type: "string" } },
+      additionalProperties: false,
+    },
+    strict: true,
+  },
+};
+const completion = await mr.chat.completions.create(
+  {
+    model: "gpt-4o-mini",
+    messages: [{ role: "user", content: "Summarize ModelRelay" }],
+    responseFormat: format,
+    stream: false,
+  },
+  { stream: false },
+);
+console.log(completion.content[0]); // JSON string matching your schema
+```
+### Structured streaming (NDJSON + response_format)
+Use the structured streaming contract for `/llm/proxy` to stream schema-valid
+JSON payloads over NDJSON:
+```ts
+type Item = { id: string; label: string };
+type RecommendationPayload = { items: Item[] };
+const format: ResponseFormat = {
+  type: "json_schema",
+  json_schema: {
+    name: "recommendations",
+    schema: {
+      type: "object",
+      properties: { items: { type: "array", items: { type: "object" } } },
+    },
+  },
+};
+const stream = await mr.chat.completions.streamJSON<RecommendationPayload>({
+  model: "grok-4-1-fast",
+  messages: [{ role: "user", content: "Recommend items for my user" }],
+  responseFormat: format,
+});
+for await (const evt of stream) {
+  if (evt.type === "update") {
+    // Progressive UI: evt.payload is a partial but schema-valid payload.
+    renderPartial(evt.payload.items);
+  }
+  if (evt.type === "completion") {
+    renderFinal(evt.payload.items);
+  }
+}
+// Prefer a single blocking result but still want structured validation?
+const final = await stream.collect();
+console.log(final.items.length);
+```
 ### Telemetry & metrics hooks