npm - observa-sdk - Versions diffs - 0.0.8 → 0.0.9 - Mend

observa-sdk 0.0.8 → 0.0.9

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/README.md CHANGED Viewed

@@ -41,18 +41,58 @@ const observa = init({
 ## Quick Start
-### JWT-based API Key (Recommended)
+### Auto-Capture with OpenAI (Recommended)
-After signing up, you'll receive a JWT-formatted API key that automatically encodes your tenant and project context:
+The easiest way to track LLM calls is using the `observeOpenAI()` wrapper - it automatically captures 90%+ of your LLM interactions:
 ```typescript
 import { init } from "observa-sdk";
+import OpenAI from "openai";
-// Initialize with JWT API key from signup (automatically extracts tenant/project context)
+// Initialize Observa
 const observa = init({
   apiKey: "eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9...", // Your API key from signup
 });
+// Initialize OpenAI client
+const openai = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });
+// Wrap with Observa (automatic tracing)
+const wrappedOpenAI = observa.observeOpenAI(openai, {
+  name: 'my-app',
+  userId: 'user_123',
+  redact: (data) => {
+    // Optional: Scrub sensitive data before sending to Observa
+    if (data?.messages) {
+      return { ...data, messages: '[REDACTED]' };
+    }
+    return data;
+  }
+});
+// Use wrapped client - automatically tracked!
+const response = await wrappedOpenAI.chat.completions.create({
+  model: 'gpt-4',
+  messages: [{ role: 'user', content: 'Hello!' }],
+});
+// Streaming also works automatically
+const stream = await wrappedOpenAI.chat.completions.create({
+  model: 'gpt-4',
+  messages: [{ role: 'user', content: 'Hello!' }],
+  stream: true,
+});
+for await (const chunk of stream) {
+  process.stdout.write(chunk.choices[0]?.delta?.content || '');
+}
+```
+### Legacy Manual Tracking
+For more control, you can still use the manual `track()` method:
+```typescript
 // Track AI interactions with simple wrapping
 const response = await observa.track({ query: "What is the weather?" }, () =>
   fetch("https://api.openai.com/v1/chat/completions", {
@@ -67,18 +107,26 @@ const response = await observa.track({ query: "What is the weather?" }, () =>
 );
 ```
-### Legacy API Key Format
+### Manual Tracking (Advanced)
+For more control over what gets tracked, use the manual tracking methods:
 ```typescript
-// For backward compatibility, you can still provide tenantId/projectId explicitly
-const observa = init({
-  apiKey: "your-api-key",
-  tenantId: "acme_corp",
-  projectId: "prod_app",
-  environment: "prod", // optional, defaults to "dev"
+// Use trackLLMCall for fine-grained control
+const spanId = observa.trackLLMCall({
+  model: 'gpt-4',
+  input: 'Hello!',
+  output: 'Hi there!',
+  inputTokens: 10,
+  outputTokens: 5,
+  latencyMs: 1200,
+  operationName: 'chat',
+  providerName: 'openai',
 });
 ```
+See the [API Reference](#api-reference) section for all available methods.
 ## Multi-Tenant Architecture
 Observa SDK uses a **multi-tenant shared runtime architecture** for optimal cost, scalability, and operational simplicity.
@@ -141,6 +189,9 @@ interface ObservaInitConfig {
   projectId?: string;
   environment?: "dev" | "prod";
+  // Observa backend URL (optional, defaults to https://api.observa.ai)
+  apiUrl?: string;
   // SDK behavior
   mode?: "development" | "production";
   sampleRate?: number; // 0..1, default: 1.0
@@ -153,6 +204,7 @@ interface ObservaInitConfig {
 - **apiKey**: Your Observa API key (JWT format recommended)
 - **tenantId** / **projectId**: Required only for legacy API keys
 - **environment**: `"dev"` or `"prod"` (defaults to `"dev"`)
+- **apiUrl**: Observa backend URL (optional, defaults to `https://api.observa.ai`)
 - **mode**: SDK mode - `"development"` logs traces to console, `"production"` sends to Observa
 - **sampleRate**: Fraction of traces to record (0.0 to 1.0)
 - **maxResponseChars**: Maximum response size to capture (prevents huge payloads)
@@ -163,9 +215,246 @@ interface ObservaInitConfig {
 Initialize the Observa SDK instance.
+**Example:**
+```typescript
+import { init } from "observa-sdk";
+const observa = init({
+  apiKey: "eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9...", // Your JWT API key
+  apiUrl: "https://api.observa.ai", // Optional, defaults to https://api.observa.ai
+  environment: "prod", // Optional, defaults to "dev"
+  mode: "production", // Optional, "development" or "production"
+  sampleRate: 1.0, // Optional, 0.0 to 1.0, default: 1.0
+  maxResponseChars: 50000, // Optional, default: 50000
+});
+```
+### `observa.observeOpenAI(client, options?)`
+Wrap an OpenAI client with automatic tracing. This is the **recommended** way to track LLM calls.
+**Parameters:**
+- `client` (required): OpenAI client instance
+- `options` (optional):
+  - `name` (optional): Application/service name
+  - `tags` (optional): Array of tags
+  - `userId` (optional): User identifier
+  - `sessionId` (optional): Session identifier
+  - `redact` (optional): Function to sanitize data before sending to Observa
+**Returns**: Wrapped OpenAI client (use it exactly like the original client)
+**Example:**
+```typescript
+import OpenAI from 'openai';
+const openai = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });
+const wrapped = observa.observeOpenAI(openai, {
+  name: 'my-app',
+  userId: 'user_123',
+  redact: (data) => {
+    // Sanitize sensitive data
+    if (data?.messages) {
+      return { ...data, messages: '[REDACTED]' };
+    }
+    return data;
+  }
+});
+// Use wrapped client - automatically tracked!
+const response = await wrapped.chat.completions.create({
+  model: 'gpt-4',
+  messages: [{ role: 'user', content: 'Hello!' }],
+});
+```
+### `observa.observeAnthropic(client, options?)`
+Wrap an Anthropic client with automatic tracing. Same API as `observeOpenAI()`.
+**Example:**
+```typescript
+import Anthropic from '@anthropic-ai/sdk';
+const anthropic = new Anthropic({ apiKey: process.env.ANTHROPIC_API_KEY });
+const wrapped = observa.observeAnthropic(anthropic, {
+  name: 'my-app',
+  redact: (data) => ({ ...data, messages: '[REDACTED]' })
+});
+// Use wrapped client - automatically tracked!
+const response = await wrapped.messages.create({
+  model: 'claude-3-opus-20240229',
+  max_tokens: 1024,
+  messages: [{ role: 'user', content: 'Hello!' }],
+});
+```
+### `observa.startTrace(options)`
+Start a new trace for manual trace management. Returns the trace ID.
+**Parameters:**
+- `options.name` (optional): Trace name
+- `options.metadata` (optional): Custom metadata object
+- `options.conversationId` (optional): Conversation identifier
+- `options.sessionId` (optional): Session identifier
+- `options.userId` (optional): User identifier
+**Returns**: `string` - The trace ID
+**Example:**
+```typescript
+const traceId = observa.startTrace({
+  name: "RAG Query",
+  conversationId: "conv-123",
+  userId: "user-456",
+  metadata: { feature: "chat", version: "2.0" }
+});
+```
+### `observa.endTrace(options)`
+End the current trace and send all buffered events. Must be called after `startTrace()`.
+**Parameters:**
+- `options.outcome` (optional): `"success"` | `"error"` | `"timeout"` (default: `"success"`)
+**Returns**: `Promise<string>` - The trace ID
+**Example:**
+```typescript
+await observa.endTrace({ outcome: "success" });
+```
+### `observa.trackLLMCall(options)` ⭐ NEW - Full OTEL Support
+Track an LLM call with complete OTEL compliance. **This is the recommended method** for tracking LLM calls.
+**Parameters:**
+- `model` (required): Model name
+- `input`, `output`: Input/output text
+- `inputTokens`, `outputTokens`, `totalTokens`: Token counts
+- `latencyMs` (required): Latency in milliseconds
+- `operationName`: OTEL operation name ("chat", "text_completion", "generate_content")
+- `providerName`: Provider name ("openai", "anthropic", etc.) - auto-inferred from model if not provided
+- `responseModel`: Actual model used (vs requested)
+- `topK`, `topP`, `frequencyPenalty`, `presencePenalty`, `stopSequences`, `seed`: Sampling parameters
+- `inputCost`, `outputCost`: Structured cost tracking
+- `inputMessages`, `outputMessages`, `systemInstructions`: Structured message objects
+- `serverAddress`, `serverPort`: Server metadata
+- `conversationIdOtel`: OTEL conversation ID
+- And more... (see SDK_SOTA_IMPLEMENTATION.md for complete list)
+**Example:**
+```typescript
+const spanId = observa.trackLLMCall({
+  model: "gpt-4-turbo",
+  input: "What is AI?",
+  output: "AI is...",
+  inputTokens: 10,
+  outputTokens: 50,
+  latencyMs: 1200,
+  operationName: "chat",
+  providerName: "openai", // Auto-inferred if not provided
+  temperature: 0.7,
+  topP: 0.9,
+  inputCost: 0.00245,
+  outputCost: 0.01024
+});
+```
+### `observa.trackEmbedding(options)` ⭐ NEW
+Track an embedding operation with full OTEL support.
+**Example:**
+```typescript
+const spanId = observa.trackEmbedding({
+  model: "text-embedding-ada-002",
+  dimensionCount: 1536,
+  inputTokens: 10,
+  outputTokens: 1536,
+  latencyMs: 45,
+  cost: 0.0001
+});
+```
+### `observa.trackVectorDbOperation(options)` ⭐ NEW
+Track vector database operations (Pinecone, Weaviate, Qdrant, etc.).
+**Example:**
+```typescript
+const spanId = observa.trackVectorDbOperation({
+  operationType: "vector_search",
+  indexName: "documents",
+  vectorDimensions: 1536,
+  resultsCount: 10,
+  latencyMs: 30,
+  cost: 0.0005,
+  providerName: "pinecone"
+});
+```
+### `observa.trackCacheOperation(options)` ⭐ NEW
+Track cache hit/miss operations.
+**Example:**
+```typescript
+const spanId = observa.trackCacheOperation({
+  cacheBackend: "redis",
+  hitStatus: "hit",
+  latencyMs: 2,
+  savedCost: 0.01269
+});
+```
+### `observa.trackAgentCreate(options)` ⭐ NEW
+Track agent creation.
+**Example:**
+```typescript
+const spanId = observa.trackAgentCreate({
+  agentName: "Customer Support Agent",
+  toolsBound: ["web_search", "database_query"],
+  modelConfig: { model: "gpt-4-turbo", temperature: 0.7 }
+});
+```
+### `observa.trackToolCall(options)` - Enhanced
+Track a tool call with OTEL standardization.
+**New Parameters:**
+- `toolType`: "function" | "extension" | "datastore"
+- `toolDescription`: Tool description
+- `toolCallId`: Unique tool invocation ID
+- `errorType`, `errorCategory`: Structured error classification
+### `observa.trackRetrieval(options)` - Enhanced
+Track retrieval operations with vector metadata.
+**New Parameters:**
+- `embeddingModel`: Model used for embeddings
+- `embeddingDimensions`: Vector dimensions
+- `vectorMetric`: Similarity metric
+- `rerankScore`, `fusionMethod`, `qualityScore`: Quality metrics
+### `observa.trackError(options)` - Enhanced
+Track errors with structured classification.
+**New Parameters:**
+- `errorCategory`: Error category
+- `errorCode`: Error code
 ### `observa.track(event, action)`
-Track an AI interaction.
+Track an AI interaction (legacy method, still supported).
 **Parameters**: