@wix/evalforge-evaluator 0.177.0 → 0.179.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -31,12 +31,25 @@ evaluator <project-id> <eval-run-id>
31
31
  | `AI_GATEWAY_HEADERS` | No | Custom headers for AI Gateway (newline-separated `key:value` pairs) |
32
32
  | `EVAL_API_PREFIX` | No | API path prefix (e.g., `/api/v1`) |
33
33
  | `EVALUATIONS_DIR` | No | Directory for evaluation working directories |
34
- | `TRACE_PUSH_URL` | No | URL for pushing trace events (remote job execution) |
34
+ | `TRACE_PUSH_URL` | No | Enables remote trace push when set (remote job execution). Events are pushed via the gRPC `PushTraceEvent` RPC; the URL value itself is legacy |
35
35
  | `EVAL_ROUTE_HEADER` | No | `x-wix-route` header for deploy preview routing |
36
- | `EVAL_AUTH_TOKEN` | No | Bearer token for public endpoint authentication |
36
+ | `EVAL_AUTH_TOKEN` | No | Bearer token for the remaining legacy REST public endpoints |
37
+ | `EVAL_GRPC_AUTH_TOKEN` | No | S2S-signed token for ambassador/gRPC calls (absent in local dev — calls go out unauthenticated) |
38
+
39
+ For OpenCode runs, the evaluator sets `lsp: false` in `OPENCODE_CONFIG_CONTENT` and `OPENCODE_DISABLE_LSP_DOWNLOAD` / `OPENCODE_DISABLE_FILETIME_CHECK` in the process environment (same as ditto `codegen`) to avoid LSP hangs after edit tools and spurious "file modified since last read" failures in automated evals.
37
40
 
38
41
  The evaluator is typically launched by the backend (locally or on a remote Dev Machine) with these variables pre-configured.
39
42
 
43
+ ## Backend API access
44
+
45
+ Backend calls go through the evalforge ambassador packages (gRPC via
46
+ `@wix/http-client`): all reads, plus `addResult` (`AddEvalRunResult`),
47
+ `clearResults` (`ClearEvalRunResults`), and trace-event push
48
+ (`PushTraceEvent`). The only call still on the legacy REST surface is
49
+ `updateEvalRun` — the gRPC `UpdateEvalRun` handler only forwards
50
+ user-editable fields, not the system state transitions
51
+ (`status`/`completedAt`/`jobError`/`jobStatus`) the evaluator writes.
52
+
40
53
  ## Scripts
41
54
 
42
55
  ```bash