npm - @elsium-ai/core - Versions diffs - 0.2.0 → 0.2.2 - Mend

@elsium-ai/core 0.2.0 → 0.2.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +1117 -19
package/package.json +6 -2

package/README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # @elsium-ai/core
-Core types, schemas, errors, and utilities for [ElsiumAI](https://github.com/elsium-ai/elsium-ai).
+Core types, errors, result pattern, streaming, and infrastructure utilities for [ElsiumAI](https://github.com/elsium-ai/elsium-ai).
 [![npm](https://img.shields.io/npm/v/@elsium-ai/core.svg)](https://www.npmjs.com/package/@elsium-ai/core)
 [![License: MIT](https://img.shields.io/badge/license-MIT-blue.svg)](https://github.com/elsium-ai/elsium-ai/blob/main/LICENSE)
@@ -13,38 +13,1136 @@ npm install @elsium-ai/core
 ## What's Inside
-- **Types** — `CompletionRequest`, `LLMResponse`, `Message`, `Middleware`, and all shared interfaces
-- **Circuit Breaker** — Detects failing providers, stops sending traffic, auto-recovers
-- **Request Dedup** — Identical in-flight calls coalesce into one API request
-- **Policy Engine** — Declarative rules to deny by model, cost, token count, or content pattern
-- **Graceful Shutdown** — Drains in-flight operations before process exit
-- **Retry with Backoff** — Exponential backoff with jitter
-- **Logger** — Structured logging with levels and context
-- **Config** — Type-safe environment variable access via `env()`
+| Category | Exports |
+|---|---|
+| **Types** | `Role`, `TextContent`, `ImageContent`, `ContentPart`, `ToolCall`, `ToolResult`, `Message`, `TokenUsage`, `CostBreakdown`, `StopReason`, `LLMResponse`, `StreamEvent`, `StreamCheckpoint`, `XRayData`, `ProviderConfig`, `CompletionRequest`, `ToolDefinition`, `MiddlewareContext`, `MiddlewareNext`, `Middleware` |
+| **Errors** | `ElsiumError`, `ErrorCode`, `ErrorDetails` |
+| **Result** | `Result`, `Ok`, `Err`, `ok()`, `err()`, `isOk()`, `isErr()`, `unwrap()`, `unwrapOr()`, `tryCatch()`, `tryCatchSync()` |
+| **Stream** | `ElsiumStream`, `createStream()`, `StreamTransformer`, `ResilientStreamOptions` |
+| **Logger** | `createLogger()`, `Logger`, `LogLevel`, `LogEntry`, `LoggerOptions` |
+| **Config** | `env()`, `envNumber()`, `envBool()` |
+| **Utilities** | `generateId()`, `generateTraceId()`, `extractText()`, `sleep()`, `retry()` |
+| **Circuit Breaker** | `createCircuitBreaker()`, `CircuitBreakerConfig`, `CircuitBreaker`, `CircuitState` |
+| **Request Dedup** | `createDedup()`, `dedupMiddleware()`, `DedupConfig`, `Dedup` |
+| **Policy Engine** | `createPolicySet()`, `policyMiddleware()`, `modelAccessPolicy()`, `tokenLimitPolicy()`, `costLimitPolicy()`, `contentPolicy()`, `PolicyDecision`, `PolicyResult`, `PolicyContext`, `PolicyRule`, `PolicyConfig`, `PolicySet` |
+| **Shutdown** | `createShutdownManager()`, `ShutdownConfig`, `ShutdownManager` |
-## Usage
+---
-```typescript
+## Types
+All type exports are interfaces and type aliases — no runtime cost.
+### Role
+```ts
+type Role = 'system' | 'user' | 'assistant' | 'tool'
+```
+### TextContent
+```ts
+interface TextContent {
+  type: 'text'
+  text: string
+}
+```
+### ImageContent
+```ts
+interface ImageContent {
+  type: 'image'
+  source:
+    | { type: 'base64'; mediaType: string; data: string }
+    | { type: 'url'; url: string }
+}
+```
+### ContentPart
+```ts
+type ContentPart = TextContent | ImageContent
+```
+### ToolCall
+```ts
+interface ToolCall {
+  id: string
+  name: string
+  arguments: Record<string, unknown>
+}
+```
+### ToolResult
+```ts
+interface ToolResult {
+  toolCallId: string
+  content: string
+  isError?: boolean
+}
+```
+### Message
+```ts
+interface Message {
+  role: Role
+  content: string | ContentPart[]
+  name?: string
+  toolCalls?: ToolCall[]
+  toolResults?: ToolResult[]
+  metadata?: Record<string, unknown>
+}
+```
+### TokenUsage
+```ts
+interface TokenUsage {
+  inputTokens: number
+  outputTokens: number
+  totalTokens: number
+  cacheReadTokens?: number
+  cacheWriteTokens?: number
+}
+```
+### CostBreakdown
+```ts
+interface CostBreakdown {
+  inputCost: number
+  outputCost: number
+  totalCost: number
+  currency: 'USD'
+}
+```
+### StopReason
+```ts
+type StopReason = 'end_turn' | 'max_tokens' | 'stop_sequence' | 'tool_use'
+```
+### LLMResponse
+The unified response shape returned by all providers after a completion.
+```ts
+interface LLMResponse {
+  id: string
+  message: Message
+  usage: TokenUsage
+  cost: CostBreakdown
+  model: string
+  provider: string
+  stopReason: StopReason
+  latencyMs: number
+  traceId: string
+}
+```
+### StreamEvent
+A discriminated union of all events emitted during streaming.
+```ts
+type StreamEvent =
+  | { type: 'text_delta'; text: string }
+  | { type: 'tool_call_start'; toolCall: { id: string; name: string } }
+  | { type: 'tool_call_delta'; toolCallId: string; arguments: string }
+  | { type: 'tool_call_end'; toolCallId: string }
+  | { type: 'message_start'; id: string; model: string }
+  | { type: 'message_end'; usage: TokenUsage; stopReason: StopReason }
+  | { type: 'error'; error: Error }
+  | { type: 'checkpoint'; checkpoint: StreamCheckpoint }
+  | { type: 'recovery'; partialText: string; error: Error }
+```
+### StreamCheckpoint
+```ts
+interface StreamCheckpoint {
+  id: string
+  timestamp: number
+  text: string
+  tokensSoFar: number
+  eventIndex: number
+}
+```
+### XRayData
+Full request/response trace data for observability.
+```ts
+interface XRayData {
+  traceId: string
+  timestamp: number
+  provider: string
+  model: string
+  latencyMs: number
+  request: {
+    url: string
+    method: string
+    headers: Record<string, string>
+    body: Record<string, unknown>
+  }
+  response: {
+    status: number
+    headers: Record<string, string>
+    body: Record<string, unknown>
+  }
+  usage: TokenUsage
+  cost: CostBreakdown
+}
+```
+### ProviderConfig
+```ts
+interface ProviderConfig {
+  apiKey: string
+  baseUrl?: string
+  timeout?: number
+  maxRetries?: number
+}
+```
+### CompletionRequest
+```ts
+interface CompletionRequest {
+  messages: Message[]
+  model?: string
+  system?: string
+  maxTokens?: number
+  temperature?: number
+  seed?: number
+  topP?: number
+  stopSequences?: string[]
+  tools?: ToolDefinition[]
+  schema?: z.ZodType
+  stream?: boolean
+  metadata?: Record<string, unknown>
+  signal?: AbortSignal
+}
+```
+### ToolDefinition
+```ts
+interface ToolDefinition {
+  name: string
+  description: string
+  inputSchema: Record<string, unknown>
+}
+```
+### Middleware types
+```ts
+interface MiddlewareContext {
+  request: CompletionRequest
+  provider: string
+  model: string
+  traceId: string
+  startTime: number
+  metadata: Record<string, unknown>
+}
+type MiddlewareNext = (ctx: MiddlewareContext) => Promise<LLMResponse>
+type Middleware = (ctx: MiddlewareContext, next: MiddlewareNext) => Promise<LLMResponse>
+```
+---
+## Errors
+### ErrorCode
+```ts
+type ErrorCode =
+  | 'PROVIDER_ERROR'
+  | 'RATE_LIMIT'
+  | 'AUTH_ERROR'
+  | 'INVALID_REQUEST'
+  | 'TIMEOUT'
+  | 'NETWORK_ERROR'
+  | 'PARSE_ERROR'
+  | 'VALIDATION_ERROR'
+  | 'TOOL_ERROR'
+  | 'BUDGET_EXCEEDED'
+  | 'MAX_ITERATIONS'
+  | 'STREAM_ERROR'
+  | 'CONFIG_ERROR'
+  | 'UNKNOWN'
+```
+### ErrorDetails
+```ts
+interface ErrorDetails {
+  code: ErrorCode
+  message: string
+  provider?: string
+  model?: string
+  statusCode?: number
+  retryable: boolean
+  retryAfterMs?: number
+  cause?: Error
+  metadata?: Record<string, unknown>
+}
+```
+### ElsiumError
+Structured error class used throughout the framework. Extends `Error` with machine-readable fields for error handling, retries, and observability.
+```ts
+class ElsiumError extends Error {
+  readonly code: ErrorCode
+  readonly provider?: string
+  readonly model?: string
+  readonly statusCode?: number
+  readonly retryable: boolean
+  readonly retryAfterMs?: number
+  readonly cause?: Error
+  readonly metadata?: Record<string, unknown>
+  constructor(details: ErrorDetails)
+  toJSON(): Record<string, unknown>
+  static providerError(message: string, opts: {
+    provider: string
+    statusCode?: number
+    retryable?: boolean
+    cause?: Error
+  }): ElsiumError
+  static rateLimit(provider: string, retryAfterMs?: number): ElsiumError
+  static authError(provider: string): ElsiumError
+  static timeout(provider: string, timeoutMs: number): ElsiumError
+  static validation(message: string, metadata?: Record<string, unknown>): ElsiumError
+  static budgetExceeded(spent: number, budget: number): ElsiumError
+}
+```
+#### Static factory methods
+**`ElsiumError.providerError(message, opts)`** — Generic provider failure.
+| Parameter | Type | Description |
+|---|---|---|
+| `message` | `string` | Error description |
+| `opts.provider` | `string` | Provider name |
+| `opts.statusCode` | `number?` | HTTP status code |
+| `opts.retryable` | `boolean?` | Whether to retry (default `false`) |
+| `opts.cause` | `Error?` | Underlying error |
+**`ElsiumError.rateLimit(provider, retryAfterMs?)`** — Rate limit (429). Always retryable.
+**`ElsiumError.authError(provider)`** — Authentication failure (401). Not retryable.
+**`ElsiumError.timeout(provider, timeoutMs)`** — Request timeout. Retryable.
+**`ElsiumError.validation(message, metadata?)`** — Validation failure. Not retryable.
+**`ElsiumError.budgetExceeded(spent, budget)`** — Token/cost budget exceeded. Not retryable.
+```ts
+import { ElsiumError } from '@elsium-ai/core'
+try {
+  await callProvider()
+} catch (e) {
+  if (e instanceof ElsiumError && e.retryable) {
+    // safe to retry
+  }
+}
+// Create specific errors
+const err = ElsiumError.rateLimit('anthropic', 5000)
+console.log(err.code)           // 'RATE_LIMIT'
+console.log(err.retryAfterMs)   // 5000
+```
+---
+## Result
+A type-safe Result pattern for representing success/failure without exceptions.
+### Types
+```ts
+type Result<T, E = Error> = Ok<T> | Err<E>
+interface Ok<T> {
+  readonly ok: true
+  readonly value: T
+}
+interface Err<E> {
+  readonly ok: false
+  readonly error: E
+}
+```
+### ok()
+Wraps a value in a success result.
+```ts
+function ok<T>(value: T): Ok<T>
+```
+### err()
+Wraps an error in a failure result.
+```ts
+function err<E>(error: E): Err<E>
+```
+### isOk()
+Type guard that narrows a `Result` to `Ok`.
+```ts
+function isOk<T, E>(result: Result<T, E>): result is Ok<T>
+```
+### isErr()
+Type guard that narrows a `Result` to `Err`.
+```ts
+function isErr<T, E>(result: Result<T, E>): result is Err<E>
+```
+### unwrap()
+Extracts the value from an `Ok`, or throws the error from an `Err`.
+```ts
+function unwrap<T, E>(result: Result<T, E>): T
+```
+### unwrapOr()
+Extracts the value from an `Ok`, or returns the fallback for an `Err`.
+```ts
+function unwrapOr<T, E>(result: Result<T, E>, fallback: T): T
+```
+### tryCatch()
+Wraps an async function in a `Result`. Caught errors are normalized to `Error`.
+```ts
+function tryCatch<T>(fn: () => Promise<T>): Promise<Result<T, Error>>
+```
+### tryCatchSync()
+Synchronous version of `tryCatch`.
+```ts
+function tryCatchSync<T>(fn: () => T): Result<T, Error>
+```
+```ts
+import { ok, err, isOk, unwrap, unwrapOr, tryCatch } from '@elsium-ai/core'
+// Manual construction
+const success = ok(42)
+const failure = err(new Error('boom'))
+if (isOk(success)) {
+  console.log(success.value) // 42
+}
+// Safe unwrap with fallback
+unwrapOr(failure, 0) // 0
+// Wrap async operations
+const result = await tryCatch(() => fetch('/api/data').then(r => r.json()))
+if (isOk(result)) {
+  console.log(result.value)
+}
+```
+---
+## Stream
+### StreamTransformer
+A function that transforms a stream of events into another stream of events.
+```ts
+type StreamTransformer = (
+  source: AsyncIterable<StreamEvent>,
+) => AsyncIterable<StreamEvent>
+```
+### ResilientStreamOptions
+```ts
+interface ResilientStreamOptions {
+  checkpointIntervalMs?: number                          // default: 1000
+  maxRetries?: number
+  onCheckpoint?: (checkpoint: StreamCheckpoint) => void
+  onPartialRecovery?: (text: string, error: Error) => void
+}
+```
+### ElsiumStream
+An `AsyncIterable<StreamEvent>` wrapper with convenience methods for consuming and transforming LLM streams. Supports only a single consumer — iterating twice throws.
+```ts
+class ElsiumStream implements AsyncIterable<StreamEvent> {
+  constructor(source: AsyncIterable<StreamEvent>)
+}
+```
+#### `stream.text()`
+Returns an `AsyncIterable<string>` that yields only the text deltas.
+```ts
+text(): AsyncIterable<string>
+```
+#### `stream.toText()`
+Collects all text deltas and returns the full text.
+```ts
+async toText(): Promise<string>
+```
+#### `stream.toTextWithTimeout(timeoutMs)`
+Like `toText()` but stops collecting after `timeoutMs` milliseconds. Returns whatever text was collected before the deadline.
+```ts
+async toTextWithTimeout(timeoutMs: number): Promise<string>
+```
+#### `stream.toResponse()`
+Collects the full text, token usage, and stop reason from the stream.
+```ts
+async toResponse(): Promise<{
+  text: string
+  usage: TokenUsage | null
+  stopReason: StopReason | null
+}>
+```
+#### `stream.pipe(transform)`
+Creates a new `ElsiumStream` by piping events through a `StreamTransformer`.
+```ts
+pipe(transform: StreamTransformer): ElsiumStream
+```
+#### `stream.resilient(options?)`
+Wraps the stream with checkpoint and partial-recovery support. Periodically emits `checkpoint` events and, on error, emits a `recovery` event containing whatever text was received before the failure.
+```ts
+resilient(options?: ResilientStreamOptions): ElsiumStream
+```
+### createStream()
+Creates an `ElsiumStream` from an imperative callback. The `emit` function pushes events into a buffered async iterable (max 10,000 events).
+```ts
+function createStream(
+  executor: (emit: (event: StreamEvent) => void) => Promise<void>,
+): ElsiumStream
+```
+```ts
+import { ElsiumStream, createStream } from '@elsium-ai/core'
+// Create a stream from an imperative source
+const stream = createStream(async (emit) => {
+  emit({ type: 'message_start', id: 'msg_1', model: 'claude-sonnet-4-6' })
+  emit({ type: 'text_delta', text: 'Hello ' })
+  emit({ type: 'text_delta', text: 'world!' })
+  emit({ type: 'message_end', usage: { inputTokens: 10, outputTokens: 5, totalTokens: 15 }, stopReason: 'end_turn' })
+})
+// Consume as full text
+const text = await stream.toText() // "Hello world!"
+// Or iterate text deltas
+for await (const chunk of stream.text()) {
+  process.stdout.write(chunk)
+}
+// Add resilience with checkpoints
+const resilient = stream.resilient({
+  checkpointIntervalMs: 500,
+  onCheckpoint: (cp) => console.log('checkpoint:', cp.text.length, 'chars'),
+})
+// Pipe through a transform
+const filtered = stream.pipe(async function* (source) {
+  for await (const event of source) {
+    if (event.type !== 'checkpoint') yield event
+  }
+})
+```
+---
+## Logger
+### LogLevel
+```ts
+type LogLevel = 'debug' | 'info' | 'warn' | 'error'
+```
+### LogEntry
+```ts
+interface LogEntry {
+  level: LogLevel
+  message: string
+  timestamp: string
+  traceId?: string
+  data?: Record<string, unknown>
+}
+```
+### LoggerOptions
+```ts
+interface LoggerOptions {
+  level?: LogLevel                    // default: 'info'
+  pretty?: boolean                    // default: false (JSON single-line)
+  context?: Record<string, unknown>   // merged into every entry
+}
+```
+### Logger
+```ts
+interface Logger {
+  debug(message: string, data?: Record<string, unknown>): void
+  info(message: string, data?: Record<string, unknown>): void
+  warn(message: string, data?: Record<string, unknown>): void
+  error(message: string, data?: Record<string, unknown>): void
+  child(context: Record<string, unknown>): Logger
+}
+```
+### createLogger()
+Creates a structured JSON logger. Messages below the configured level are silently dropped. `error` and `warn` go to `console.error`/`console.warn`; everything else goes to `console.log`.
+```ts
+function createLogger(options?: LoggerOptions): Logger
+```
+```ts
+import { createLogger } from '@elsium-ai/core'
+const logger = createLogger({ level: 'debug', pretty: true })
+logger.info('server started', { port: 3000 })
+// {"level":"info","message":"server started","timestamp":"...","data":{"port":3000}}
+const child = logger.child({ traceId: 'trc_abc123' })
+child.warn('slow response', { latencyMs: 4200 })
+// includes traceId in every entry
+```
+---
+## Config
+Type-safe environment variable access. All three functions throw an `ElsiumError` with code `CONFIG_ERROR` when the variable is missing and no fallback is provided.
+### env()
+Returns a string environment variable, or the fallback, or throws.
+```ts
+function env(name: string, fallback?: string): string
+```
+### envNumber()
+Parses the variable as a finite number. Throws if the value is not a valid finite number.
+```ts
+function envNumber(name: string, fallback?: number): number
+```
+### envBool()
+Parses the variable as a boolean. `'true'`, `'1'`, and `'yes'` (case-insensitive) are truthy; everything else is falsy.
+```ts
+function envBool(name: string, fallback?: boolean): boolean
+```
+```ts
+import { env, envNumber, envBool } from '@elsium-ai/core'
+const apiKey = env('ANTHROPIC_API_KEY')               // throws if missing
+const port = envNumber('PORT', 3000)                  // 3000 if unset
+const debug = envBool('DEBUG', false)                 // false if unset
+```
+---
+## Utilities
+### generateId()
+Generates a unique ID with an optional prefix, using timestamp + 4 random bytes.
+```ts
+function generateId(prefix?: string): string   // default prefix: 'els'
+```
+Returns a string like `els_m1abc2d_8f3e1a2b`.
+### generateTraceId()
+Generates a trace ID using timestamp + 6 random bytes. Always prefixed with `trc_`.
+```ts
+function generateTraceId(): string
+```
+Returns a string like `trc_m1abc2d_8f3e1a2b4c5d`.
+### extractText()
+Extracts plain text from a `Message.content` field, handling both the `string` and `ContentPart[]` forms.
+```ts
+function extractText(content: string | { type: string; text?: string }[]): string
+```
+### sleep()
+Returns a promise that resolves after `ms` milliseconds.
+```ts
+function sleep(ms: number): Promise<void>
+```
+### retry()
+Retries an async function with exponential backoff and jitter. Respects `retryAfterMs` on errors (e.g., `ElsiumError` rate limits).
+```ts
+function retry<T>(
+  fn: () => Promise<T>,
+  options?: {
+    maxRetries?: number                         // default: 3
+    baseDelayMs?: number                        // default: 1000
+    maxDelayMs?: number                         // default: 30000
+    shouldRetry?: (error: unknown) => boolean   // default: checks error.retryable
+  },
+): Promise<T>
+```
+| Parameter | Type | Default | Description |
+|---|---|---|---|
+| `fn` | `() => Promise<T>` | — | The async operation to retry |
+| `options.maxRetries` | `number` | `3` | Maximum number of retry attempts |
+| `options.baseDelayMs` | `number` | `1000` | Base delay for exponential backoff |
+| `options.maxDelayMs` | `number` | `30000` | Maximum delay cap |
+| `options.shouldRetry` | `(error: unknown) => boolean` | checks `error.retryable` | Predicate to decide whether to retry |
+```ts
+import { retry, generateId, generateTraceId, extractText, sleep } from '@elsium-ai/core'
+const id = generateId()             // "els_m1abc2d_8f3e1a2b"
+const traceId = generateTraceId()   // "trc_m1abc2d_8f3e1a2b4c5d"
+// Extract text from either content format
+extractText('hello')                             // "hello"
+extractText([{ type: 'text', text: 'hello' }])  // "hello"
+// Retry with defaults (3 retries, exponential backoff)
+const data = await retry(() => fetchFromProvider(), {
+  maxRetries: 5,
+  shouldRetry: (err) => err instanceof Error,
+})
+```
+---
+## Circuit Breaker
+Monitors failures within a sliding time window and stops sending traffic to a failing service. Automatically recovers via the half-open state.
+### CircuitState
+```ts
+type CircuitState = 'closed' | 'open' | 'half-open'
+```
+State machine: **closed** (healthy) → **open** (tripping after threshold failures) → **half-open** (probing after reset timeout) → **closed** (on success) or back to **open** (on failure).
+### CircuitBreakerConfig
+```ts
+interface CircuitBreakerConfig {
+  failureThreshold?: number                                 // default: 5
+  resetTimeoutMs?: number                                   // default: 30000
+  halfOpenMaxAttempts?: number                               // default: 3
+  windowMs?: number                                          // default: 60000
+  onStateChange?: (from: CircuitState, to: CircuitState) => void
+  shouldCount?: (error: unknown) => boolean                  // default: checks error.retryable, or true for unknown errors
+}
+```
+| Parameter | Type | Default | Description |
+|---|---|---|---|
+| `failureThreshold` | `number` | `5` | Failures within `windowMs` before opening |
+| `resetTimeoutMs` | `number` | `30000` | Time in open state before probing (half-open) |
+| `halfOpenMaxAttempts` | `number` | `3` | Max concurrent probes in half-open state |
+| `windowMs` | `number` | `60000` | Sliding window for counting failures |
+| `onStateChange` | `(from, to) => void` | — | Callback on state transitions |
+| `shouldCount` | `(error) => boolean` | checks `retryable` | Predicate to decide if an error counts as a failure |
+### CircuitBreaker
+```ts
+interface CircuitBreaker {
+  execute<T>(fn: () => Promise<T>): Promise<T>
+  readonly state: CircuitState
+  readonly failureCount: number
+  reset(): void
+}
+```
+| Member | Description |
+|---|---|
+| `execute(fn)` | Runs `fn` if the circuit is closed or half-open. Throws `ElsiumError` with code `PROVIDER_ERROR` if open. |
+| `state` | Current state. Accessing this may trigger an open → half-open transition if `resetTimeoutMs` has elapsed. |
+| `failureCount` | Number of failures within the current window. |
+| `reset()` | Manually resets to the closed state and clears failure counts. |
+### createCircuitBreaker()
+```ts
+function createCircuitBreaker(config?: CircuitBreakerConfig): CircuitBreaker
+```
+```ts
+import { createCircuitBreaker } from '@elsium-ai/core'
+const breaker = createCircuitBreaker({
+  failureThreshold: 3,
+  resetTimeoutMs: 10_000,
+  onStateChange: (from, to) => console.log(`circuit: ${from} → ${to}`),
+})
+const result = await breaker.execute(() => callProvider())
+console.log(breaker.state)         // 'closed'
+console.log(breaker.failureCount)  // 0
+```
+---
+## Request Dedup
+Coalesces identical in-flight requests into a single execution and caches results for a short TTL.
+### DedupConfig
+```ts
+interface DedupConfig {
+  ttlMs?: number       // default: 5000
+  maxEntries?: number  // default: 1000
+}
+```
+### Dedup
+```ts
+interface Dedup<T> {
+  deduplicate(key: string, fn: () => Promise<T>): Promise<T>
+  hashRequest(request: unknown): string
+  readonly size: number
+  clear(): void
+}
+```
+| Member | Description |
+|---|---|
+| `deduplicate(key, fn)` | Returns a cached result if within TTL, joins an in-flight request if one exists for `key`, or executes `fn`. |
+| `hashRequest(request)` | Deterministic SHA-256 hash (first 16 hex chars) of a JSON-serializable object. Handles key ordering. |
+| `size` | Number of cached + in-flight entries (expired entries are evicted on access). |
+| `clear()` | Clears all cached and in-flight entries. |
+### createDedup()
+```ts
+function createDedup<T>(config?: DedupConfig): Dedup<T>
+```
+### dedupMiddleware()
+Returns a `Middleware` that deduplicates LLM requests based on their messages, model, provider, and key completion parameters.
+```ts
+function dedupMiddleware(config?: DedupConfig): Middleware
+```
+```ts
+import { createDedup, dedupMiddleware } from '@elsium-ai/core'
+// Standalone usage
+const dedup = createDedup<string>({ ttlMs: 10_000 })
+const key = dedup.hashRequest({ model: 'claude-sonnet-4-6', messages: [...] })
+const result = await dedup.deduplicate(key, () => expensive())
+// As middleware — identical concurrent requests share one API call
+const middleware = dedupMiddleware({ ttlMs: 3000 })
+```
+---
+## Policy Engine
+Declarative rules to allow or deny LLM requests before they reach a provider.
+### PolicyDecision
+```ts
+type PolicyDecision = 'allow' | 'deny'
+```
+### PolicyResult
+```ts
+interface PolicyResult {
+  decision: PolicyDecision
+  reason: string
+  policyName: string
+}
+```
+### PolicyContext
+The evaluation context passed to every policy rule.
+```ts
+interface PolicyContext {
+  model?: string
+  provider?: string
+  actor?: string
+  role?: string
+  tokenCount?: number
+  costEstimate?: number
+  requestContent?: string
+  metadata?: Record<string, unknown>
+}
+```
+### PolicyRule
+```ts
+type PolicyRule = (ctx: PolicyContext) => PolicyResult
+```
+### PolicyConfig
+```ts
+interface PolicyConfig {
+  name: string
+  description?: string
+  rules: PolicyRule[]
+  mode?: 'all-must-pass' | 'any-must-pass'   // default: 'all-must-pass'
+}
+```
+### PolicySet
+```ts
+interface PolicySet {
+  evaluate(ctx: PolicyContext): PolicyResult[]
+  addPolicy(policy: PolicyConfig): void
+  removePolicy(name: string): void
+  readonly policies: string[]
+}
+```
+| Member | Description |
+|---|---|
+| `evaluate(ctx)` | Runs all policy rules and returns an array of **denials only** (empty = all passed). |
+| `addPolicy(policy)` | Adds a policy at runtime. |
+| `removePolicy(name)` | Removes a policy by name. |
+| `policies` | List of currently registered policy names. |
+### createPolicySet()
+```ts
+function createPolicySet(policies: PolicyConfig[]): PolicySet
+```
+### policyMiddleware()
+Returns a `Middleware` that evaluates all policies before forwarding the request. Throws `ElsiumError` with code `VALIDATION_ERROR` if any policy denies.
+```ts
+function policyMiddleware(policySet: PolicySet): Middleware
+```
+### Built-in policy factories
+#### modelAccessPolicy()
+Restricts requests to a list of allowed models. Supports glob-style trailing wildcards (e.g., `'claude-*'`).
+```ts
+function modelAccessPolicy(allowedModels: string[]): PolicyConfig
+```
+#### tokenLimitPolicy()
+Denies requests whose estimated token count exceeds `maxTokens`.
+```ts
+function tokenLimitPolicy(maxTokens: number): PolicyConfig
+```
+#### costLimitPolicy()
+Denies requests whose estimated cost exceeds `maxCost`.
+```ts
+function costLimitPolicy(maxCost: number): PolicyConfig
+```
+#### contentPolicy()
+Denies requests whose content matches any of the provided regex patterns.
+```ts
+function contentPolicy(blockedPatterns: RegExp[]): PolicyConfig
+```
+```ts
 import {
-  createCircuitBreaker,
   createPolicySet,
+  policyMiddleware,
   modelAccessPolicy,
+  tokenLimitPolicy,
   costLimitPolicy,
-  policyMiddleware,
-  env,
+  contentPolicy,
 } from '@elsium-ai/core'
-// Circuit breaker
-const cb = createCircuitBreaker({ failureThreshold: 5, resetTimeoutMs: 30_000 })
-const result = await cb.execute(() => fetchFromProvider())
-// Policy engine
 const policies = createPolicySet([
   modelAccessPolicy(['claude-sonnet-4-6', 'gpt-4o']),
+  tokenLimitPolicy(100_000),
   costLimitPolicy(5.00),
+  contentPolicy([/password/i, /secret_key/i]),
 ])
+// Check manually
+const denials = policies.evaluate({ model: 'unknown-model' })
+// [{ decision: 'deny', reason: 'Model "unknown-model" is not in allowed list', policyName: 'model-access' }]
+// Or use as middleware
+const middleware = policyMiddleware(policies)
+```
+---
+## Shutdown Manager
+Tracks in-flight operations and drains them before process exit. Automatically registers signal handlers for `SIGTERM` and `SIGINT`.
+### ShutdownConfig
+```ts
+interface ShutdownConfig {
+  drainTimeoutMs?: number         // default: 30000
+  signals?: string[]              // default: ['SIGTERM', 'SIGINT']
+  onDrainStart?: () => void
+  onDrainComplete?: () => void
+  onForceShutdown?: () => void
+}
+```
+| Parameter | Type | Default | Description |
+|---|---|---|---|
+| `drainTimeoutMs` | `number` | `30000` | Max time to wait for in-flight operations to finish |
+| `signals` | `string[]` | `['SIGTERM', 'SIGINT']` | OS signals that trigger shutdown |
+| `onDrainStart` | `() => void` | — | Called when drain begins |
+| `onDrainComplete` | `() => void` | — | Called when all operations finish within timeout |
+| `onForceShutdown` | `() => void` | — | Called when drain timeout expires |
+### ShutdownManager
+```ts
+interface ShutdownManager {
+  trackOperation<T>(fn: () => Promise<T>): Promise<T>
+  shutdown(): Promise<void>
+  dispose(): void
+  readonly inFlight: number
+  readonly isShuttingDown: boolean
+}
+```
+| Member | Description |
+|---|---|
+| `trackOperation(fn)` | Executes `fn` while tracking it as in-flight. Throws `ElsiumError` with code `VALIDATION_ERROR` if already shutting down. |
+| `shutdown()` | Initiates graceful shutdown. Waits for in-flight operations up to `drainTimeoutMs`. Idempotent — multiple calls return the same promise. |
+| `dispose()` | Removes all registered signal handlers. Call this in tests to prevent leaks. |
+| `inFlight` | Number of currently tracked operations. |
+| `isShuttingDown` | `true` after `shutdown()` is called. |
+### createShutdownManager()
+```ts
+function createShutdownManager(config?: ShutdownConfig): ShutdownManager
 ```
+```ts
+import { createShutdownManager } from '@elsium-ai/core'
+const shutdown = createShutdownManager({
+  drainTimeoutMs: 10_000,
+  onDrainStart: () => console.log('draining...'),
+  onDrainComplete: () => console.log('drained, exiting'),
+  onForceShutdown: () => console.log('force shutdown!'),
+})
+// Wrap every request
+const result = await shutdown.trackOperation(() => handleRequest())
+console.log(shutdown.inFlight) // 0 after completion
+// Cleanup in tests
+shutdown.dispose()
+```
+---
 ## Part of ElsiumAI
 This package is the foundation layer of the [ElsiumAI](https://github.com/elsium-ai/elsium-ai) framework. See the [full documentation](https://github.com/elsium-ai/elsium-ai) for guides and examples.

package/package.json CHANGED Viewed

@@ -1,12 +1,12 @@
 {
 	"name": "@elsium-ai/core",
-	"version": "0.2.0",
+	"version": "0.2.2",
 	"description": "Core types, schemas, errors, and utilities for ElsiumAI",
 	"license": "MIT",
 	"author": "Eric Utrera <ebutrera9103@gmail.com>",
 	"repository": {
 		"type": "git",
-		"url": "https://github.com/elsium-ai/elsium-ai",
+		"url": "git+https://github.com/elsium-ai/elsium-ai.git",
 		"directory": "packages/core"
 	},
 	"type": "module",
@@ -30,5 +30,9 @@
 	},
 	"devDependencies": {
 		"typescript": "^5.7.0"
+	},
+	"publishConfig": {
+		"registry": "https://registry.npmjs.org",
+		"access": "public"
 	}
 }