npm - @salesforce/sfdx-agent-sdk - Versions diffs - 0.18.0 → 0.20.0 - Mend

@salesforce/sfdx-agent-sdk 0.18.0 → 0.20.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/CHANGELOG.md CHANGED Viewed

@@ -3,6 +3,20 @@
 All notable changes to `@salesforce/sfdx-agent-sdk` are documented in this file.
 Format follows [Keep a Changelog](https://keepachangelog.com/en/1.1.0/).
+## [0.20.0] - 2026-06-12
+### Features
+- **agent-sdk,harness-mastra,harness-claude**: add tool-call-delta ChatEvent for streaming tool-call args @W-22965697@ ([#597](https://github.com/forcedotcom/agentic-dx/pull/597))
+- **agent-sdk,harness-mastra,harness-claude**: add tool-progress ChatEvent variant @W-22951127@ ([#594](https://github.com/forcedotcom/agentic-dx/pull/594))
+- **harness-claude**: map SDKThinkingTokensMessage to UsageMetadata.reasoningTokens ([#595](https://github.com/forcedotcom/agentic-dx/pull/595))
+### Fixes
+- **harness-claude**: map SDKResultSuccess.is_error to ChatEvent.error ([#593](https://github.com/forcedotcom/agentic-dx/pull/593))
+## [0.19.0] - 2026-06-11
+_No changes — released alongside dependent packages._
 ## [0.18.0] - 2026-06-09
 ### Tests

package/README.md CHANGED Viewed

@@ -190,18 +190,20 @@ iterating the same `eventStream` until it sees a terminal `finish` event.
 Discriminated union (`event.type`) of streaming events:
-| Type                    | Key Fields                                                                              | Description                                                                                                                                                                                                                                                                                                                      |
-| ----------------------- | --------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| `start`                 | —                                                                                       | Stream has begun.                                                                                                                                                                                                                                                                                                                |
-| `text-delta`            | `text`                                                                                  | Incremental response text.                                                                                                                                                                                                                                                                                                       |
-| `reasoning-delta`       | `text`                                                                                  | Chain-of-thought fragment.                                                                                                                                                                                                                                                                                                       |
-| `tool-call`             | `toolCallId`, `toolName`, `args`, `annotations?`, `serverName?`                         | Tool invocation. `annotations` is the MCP-spec hints (`readOnlyHint`, `destructiveHint`, …) when the source declared them; `serverName` is set when the tool came from an MCP server.                                                                                                                                            |
-| `tool-approval-request` | `toolCall: ToolCallInfo`, `annotations?`, `serverName?`                                 | Engine requests approval before executing a tool. Same `annotations` / `serverName` semantics as `tool-call`.                                                                                                                                                                                                                    |
-| `tool-result`           | `toolCallId`, `toolName`, `result`, `isError?`, `error?`, `annotations?`, `serverName?` | Tool execution completed. `error` is present when `isError` is true (best-effort: harnesses may synthesize an `Error` from a string payload, so `error.stack` is not guaranteed to point at the tool's throw site; the field may be absent on empty error payloads). Same `annotations` / `serverName` semantics as `tool-call`. |
-| `step-start`            | `stepIndex`                                                                             | New LLM invocation step began.                                                                                                                                                                                                                                                                                                   |
-| `step-finish`           | `stepIndex`, `finishReason`, `usage?`                                                   | Step completed with per-step token usage.                                                                                                                                                                                                                                                                                        |
-| `error`                 | `error`, `code?`                                                                        | Mid-stream error (yielded, not thrown).                                                                                                                                                                                                                                                                                          |
-| `finish`                | `finishReason`, `usage?`                                                                | Stream completed with aggregate token usage.                                                                                                                                                                                                                                                                                     |
+| Type                    | Key Fields                                                                              | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
+| ----------------------- | --------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
+| `start`                 | —                                                                                       | Stream has begun.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
+| `text-delta`            | `text`                                                                                  | Incremental response text.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
+| `reasoning-delta`       | `text`                                                                                  | Chain-of-thought fragment.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
+| `tool-call`             | `toolCallId`, `toolName`, `args`, `annotations?`, `serverName?`                         | Tool invocation. `annotations` is the MCP-spec hints (`readOnlyHint`, `destructiveHint`, …) when the source declared them; `serverName` is set when the tool came from an MCP server.                                                                                                                                                                                                                                                                                                                                          |
+| `tool-call-delta`       | `toolCallId`, `toolName?`, `argsTextDelta`                                              | Incremental fragment of a tool call's args JSON, emitted while the model composes the call. Concatenate successive deltas for the same `toolCallId` to build the args text; the parsed result matches the terminal `tool-call.args`. Useful for live-typing tool inputs UI; consumers that don't need streaming-args can ignore this event and continue reading the parsed `args` on the terminal `tool-call`. `toolName` is optional (Claude's signal does not carry it on the wire).                                         |
+| `tool-approval-request` | `toolCall: ToolCallInfo`, `annotations?`, `serverName?`                                 | Engine requests approval before executing a tool. Same `annotations` / `serverName` semantics as `tool-call`.                                                                                                                                                                                                                                                                                                                                                                                                                  |
+| `tool-result`           | `toolCallId`, `toolName`, `result`, `isError?`, `error?`, `annotations?`, `serverName?` | Tool execution completed. `error` is present when `isError` is true (best-effort: harnesses may synthesize an `Error` from a string payload, so `error.stack` is not guaranteed to point at the tool's throw site; the field may be absent on empty error payloads). Same `annotations` / `serverName` semantics as `tool-call`.                                                                                                                                                                                               |
+| `tool-progress`         | `toolCallId`, `toolName`, `output?`, `parentToolCallId?`                                | Incremental progress signal from a long-running tool call. Distinct from `tool-result`: zero or more `tool-progress` events may be emitted before exactly one terminal `tool-result`. `output` and `parentToolCallId` are best-effort enrichment that depends on the tool — the event itself is the load-bearing "tool is still working" signal; consumers SHOULD NOT branch on which optional fields are present. Useful for "tool is working" UI on long-running tools (build, test, deploy, large search, sub-agent tasks). |
+| `step-start`            | `stepIndex`                                                                             | New LLM invocation step began.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
+| `step-finish`           | `stepIndex`, `finishReason`, `usage?`                                                   | Step completed with per-step token usage.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
+| `error`                 | `error`, `code?`                                                                        | Mid-stream error (yielded, not thrown).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |
+| `finish`                | `finishReason`, `usage?`                                                                | Stream completed with aggregate token usage.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
 > **Diagnostic logging.** The `ChatEvent` union is the harness-agnostic public stream — it never carries
 > harness-internal chunk shapes. When a harness encounters a chunk type its adapter does not recognize (typically after

package/dist/index.d.ts CHANGED Viewed

@@ -1,5 +1,5 @@
 export type { Message, MessagePart, ImagePart, FilePart } from './types/messages.js';
-export type { ChatEvent, StartEvent, TextDeltaEvent, ReasoningDeltaEvent, ToolCallEvent, ToolApprovalRequestEvent, ToolResultEvent, StepStartEvent, StepFinishEvent, ErrorEvent, FinishEvent, ChatStreamResult, } from './types/events.js';
+export type { ChatEvent, StartEvent, TextDeltaEvent, ReasoningDeltaEvent, ToolCallEvent, ToolCallDeltaEvent, ToolApprovalRequestEvent, ToolResultEvent, ToolProgressEvent, StepStartEvent, StepFinishEvent, ErrorEvent, FinishEvent, ChatStreamResult, } from './types/events.js';
 export type { ToolDefinition, ToolCallInfo, ToolResultInfo } from './types/tools.js';
 export type { ContextUsage, FinishReason, UsageMetadata } from './types/usage.js';
 export type { AgentHooks, HooksForAgent, ToolResultRedactor, ToolResultRedactionInput, ToolResultRedactionResult, } from './types/redaction.js';

package/dist/types/events.d.ts CHANGED Viewed

@@ -8,7 +8,7 @@ import type { FinishReason, UsageMetadata } from './usage.js';
  * convention, with the addition of `tool-approval-request` for human-in-the-loop
  * tool approval flows.
  */
-export type ChatEvent = StartEvent | TextDeltaEvent | ReasoningDeltaEvent | ToolCallEvent | ToolApprovalRequestEvent | ToolResultEvent | StepStartEvent | StepFinishEvent | ErrorEvent | FinishEvent;
+export type ChatEvent = StartEvent | TextDeltaEvent | ReasoningDeltaEvent | ToolCallEvent | ToolCallDeltaEvent | ToolApprovalRequestEvent | ToolResultEvent | ToolProgressEvent | StepStartEvent | StepFinishEvent | ErrorEvent | FinishEvent;
 /**
  * The stream has begun. Symmetric counterpart to {@link FinishEvent}.
  *
@@ -68,6 +68,39 @@ export type ToolCallEvent = ToolCallInfo & {
      */
     serverName?: string;
 };
+/**
+ * An incremental fragment of a tool call's args JSON, emitted while the
+ * model is still composing the call. Concatenate successive
+ * `tool-call-delta` events for the same `toolCallId` to build the complete
+ * args JSON that lands on the terminal {@link ToolCallEvent}.
+ *
+ * Useful for UIs that want to render "live-typing" tool args as the model
+ * generates them, mirroring `text-delta` for chat responses. Consumers that
+ * don't need streaming-args UX can ignore this event and continue to read
+ * the parsed `args` object on the terminal {@link ToolCallEvent}, which is
+ * unchanged.
+ *
+ * Both harnesses produce the underlying signal natively (Claude:
+ * `input_json_delta`; Mastra: `tool-call-delta` chunk). The terminal
+ * {@link ToolCallEvent} still fires once the full args object has been
+ * parsed — `tool-call-delta` is purely additive UX enrichment.
+ */
+export type ToolCallDeltaEvent = {
+    type: 'tool-call-delta';
+    /** The id of the in-progress tool call. Matches the eventual {@link ToolCallEvent.toolCallId}. */
+    toolCallId: string;
+    /**
+     * The name of the tool. Optional because Mastra's `tool-call-delta`
+     * chunk types `toolName` as optional and may omit it; Claude always
+     * populates it from the per-`toolCallId` state captured on the prior
+     * `content_block_start`. Consumers SHOULD NOT branch on `toolName`
+     * presence — when absent, resolve it from the prior
+     * {@link ToolCallEvent} on the same `toolCallId`.
+     */
+    toolName?: string;
+    /** The args-JSON text fragment. */
+    argsTextDelta: string;
+};
 /**
  * The harness is requesting approval before executing a tool call.
  * The stream suspends until the consumer calls `approveToolCall()` or
@@ -119,6 +152,39 @@ export type ToolResultEvent = ToolResultInfo & {
      */
     serverName?: string;
 };
+/**
+ * An incremental tool-progress signal emitted while a long-running tool call is
+ * still in flight. Distinct from {@link ToolResultEvent} which marks terminal
+ * completion: a single tool call may emit zero or more `tool-progress` events,
+ * followed by exactly one `tool-result`.
+ *
+ * Useful for UIs that want to render a "tool is working" indicator for
+ * long-running tools (build, test, deploy, large search, sub-agent tasks)
+ * where the user benefits from seeing intermediate activity before the terminal
+ * `tool-result` lands.
+ *
+ * The event itself is the load-bearing "tool is still working" signal —
+ * `output` and `parentToolCallId` are best-effort enrichment that depends on
+ * the tool. Consumer code SHOULD NOT branch on which optional fields are
+ * present.
+ */
+export type ToolProgressEvent = {
+    type: 'tool-progress';
+    /** The id of the in-progress tool call. Matches a prior {@link ToolCallEvent.toolCallId}. */
+    toolCallId: string;
+    /** The name of the tool. Matches a prior {@link ToolCallEvent.toolName}. */
+    toolName: string;
+    /**
+     * The tool's incremental output, if the tool produces one. Shape is
+     * tool-defined.
+     */
+    output?: unknown;
+    /**
+     * The parent tool-call id when this progress is for a nested tool call
+     * (e.g. a sub-agent invoking another tool), if applicable.
+     */
+    parentToolCallId?: string;
+};
 /**
  * Marks the beginning of a new LLM invocation within a multi-step agentic loop.
  *

package/dist/types/usage.d.ts CHANGED Viewed

@@ -9,7 +9,25 @@ export type UsageMetadata = {
     outputTokens?: number;
     /** Sum of input and output tokens. */
     totalTokens?: number;
-    /** Tokens consumed by the model's reasoning/thinking phase. */
+    /**
+     * Tokens consumed by the model's reasoning/thinking phase.
+     *
+     * Provider-reported and possibly approximate: Mastra surfaces an
+     * actual billed count from the gateway, while Claude surfaces a live
+     * estimate digested from `SDKThinkingTokensMessage.estimated_tokens`
+     * (the SDK's own JSDoc flags this as "approximate progress for
+     * spinners/pills, not the authoritative billed output_tokens").
+     * Consumers reading this field see the best-available reasoning-token
+     * count regardless of harness — both surfaces populate the same slot
+     * with the same shape and semantics on `step-finish.usage` and
+     * `finish.usage`.
+     *
+     * Intentionally excluded from {@link ContextUsage.usedFraction} —
+     * reasoning tokens don't survive into the next turn's prompt, so they
+     * don't belong in a "should I compact?" denominator. See
+     * `packages/sfdx-agent-sdk/ARCHITECTURE.md` →
+     * "Context-window usage tracking" for the rationale.
+     */
     reasoningTokens?: number;
     /** Input tokens served from provider cache (reduces cost). */
     cachedInputTokens?: number;
@@ -74,6 +92,13 @@ export type ContextUsage = {
      * cache-hit paths. Mastra is unaffected because it does not populate the
      * cache fields, so the sum collapses to `inputTokens` alone.
      *
+     * `reasoningTokens` is intentionally NOT in the sum: this fraction
+     * answers "should I compact for the next turn?" and reasoning blocks
+     * are stripped from the transcript before the next turn's prompt is
+     * sent, so they don't occupy next-turn context. See
+     * `packages/sfdx-agent-sdk/ARCHITECTURE.md` →
+     * "Context-window usage tracking" for the rationale.
+     *
      * `undefined` when ALL three input-bearing fields are missing on the
      * latest reading (pre-first-turn, post-`clearHistory()`, or when a
      * harness emits a reading without any input-side counts). Consumers

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@salesforce/sfdx-agent-sdk",
-  "version": "0.18.0",
+  "version": "0.20.0",
   "description": "Harness-agnostic agentic infrastructure for Salesforce developer experience tooling",
   "type": "module",
   "main": "dist/index.js",
@@ -45,8 +45,8 @@
   },
   "devDependencies": {
     "@eslint/js": "^10.0.1",
-    "@salesforce/sfdx-agent-harness-claude": "0.14.0",
-    "@salesforce/sfdx-agent-harness-mastra": "0.17.0",
+    "@salesforce/sfdx-agent-harness-claude": "0.16.0",
+    "@salesforce/sfdx-agent-harness-mastra": "0.19.0",
     "@types/node": "^22.19.20",
     "@vitest/coverage-istanbul": "^4.1.8",
     "@vitest/eslint-plugin": "^1.6.19",