npm - llm-cost-meter - Versions diffs - 0.1.0 - Mend

llm-cost-meter 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (48) hide show

package/CHANGELOG.md +78 -0
package/LICENSE +21 -0
package/README.md +469 -0
package/dashboard/index.html +544 -0
package/dist/adapters/console.d.ts +5 -0
package/dist/adapters/console.d.ts.map +1 -0
package/dist/adapters/console.js +16 -0
package/dist/adapters/console.js.map +1 -0
package/dist/adapters/index.d.ts +9 -0
package/dist/adapters/index.d.ts.map +1 -0
package/dist/adapters/index.js +29 -0
package/dist/adapters/index.js.map +1 -0
package/dist/adapters/local.d.ts +11 -0
package/dist/adapters/local.d.ts.map +1 -0
package/dist/adapters/local.js +72 -0
package/dist/adapters/local.js.map +1 -0
package/dist/cli.d.ts +2 -0
package/dist/cli.d.ts.map +1 -0
package/dist/cli.js +200 -0
package/dist/cli.js.map +1 -0
package/dist/dashboard/server.js +135 -0
package/dist/index.d.ts +65 -0
package/dist/index.d.ts.map +1 -0
package/dist/index.js +291 -0
package/dist/index.js.map +1 -0
package/dist/pricing/anthropic.json +47 -0
package/dist/pricing/index.d.ts +28 -0
package/dist/pricing/index.d.ts.map +1 -0
package/dist/pricing/index.js +92 -0
package/dist/pricing/index.js.map +1 -0
package/dist/pricing/openai.json +67 -0
package/dist/reporters/csv.d.ts +3 -0
package/dist/reporters/csv.d.ts.map +1 -0
package/dist/reporters/csv.js +14 -0
package/dist/reporters/csv.js.map +1 -0
package/dist/reporters/json.d.ts +3 -0
package/dist/reporters/json.d.ts.map +1 -0
package/dist/reporters/json.js +25 -0
package/dist/reporters/json.js.map +1 -0
package/dist/reporters/summary.d.ts +4 -0
package/dist/reporters/summary.d.ts.map +1 -0
package/dist/reporters/summary.js +85 -0
package/dist/reporters/summary.js.map +1 -0
package/dist/types.d.ts +89 -0
package/dist/types.d.ts.map +1 -0
package/dist/types.js +2 -0
package/dist/types.js.map +1 -0
package/package.json +82 -0

package/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,78 @@
+# Changelog
+## 0.1.0 (2026-04-05)
+Initial release of llm-cost-meter.
+### Core Features
+- `meter()` wrapper function — wrap any LLM API call to track cost, tokens, and latency
+- `CostMeter` class for instance-level configuration (multiple meters, separate pipelines)
+- `configure()` / `resetConfig()` for global configuration with merge semantics
+- Auto-detection of OpenAI and Anthropic response formats (no explicit provider needed)
+- Tagging system: `feature`, `userId`, `sessionId`, `env`, and arbitrary custom `tags`
+- `awaitWrites` option for guaranteed event persistence on critical paths
+- `flush()` for draining pending writes before process shutdown
+### Pricing
+- Built-in pricing tables for 22 models:
+  - **Anthropic**: Claude Opus 4, Sonnet 4, Haiku 4.5, Claude 3.5 Sonnet/Haiku, Claude 3 Opus/Sonnet/Haiku
+  - **OpenAI**: GPT-4o, GPT-4o-mini, GPT-4-turbo, GPT-4, GPT-3.5-turbo, o1, o1-mini, o3, o3-mini (including dated variants)
+- `configurePricing()` — add custom or fine-tuned model pricing at runtime
+- `removePricing()` — remove a model's pricing entry
+- `setPricingTable()` — set an entire provider's pricing at once
+- Unknown model warnings with `warnOnMissingModel` flag (default: true)
+### Adapters
+- Console adapter — prints cost per call to stdout (ideal for development)
+- Local file adapter — appends events as NDJSON with async write queue
+  - Sequential write queue prevents file corruption from concurrent calls
+  - Automatic retry (1 retry after 100ms) on write failure
+  - Queue recovers from errors instead of stalling
+- Bring-your-own adapter via `CostAdapter` interface (with optional `flush()`)
+### Error Handling & Observability
+- `onError` callback — notified when adapter writes fail (replaces silent swallowing)
+- Failed LLM calls tracked as `status: 'error'` events with `errorMessage`
+- `getMeterStats()` — monitor eventsTracked, eventsDropped, adapterErrors, unknownModels
+- `resetStats()` — reset counters for test isolation
+- Memory-safe: unknownModels and warnedModels Sets capped at 1000 entries
+- `getAllPricing()` returns a deep copy (mutations don't affect internal state)
+### CLI
+- `llm-cost-meter report` command with:
+  - `--group-by` (feature, userId, model, env, provider, sessionId)
+  - `--feature`, `--env`, `--user` filters
+  - `--from`, `--to` date range filtering
+  - `--format` (table, csv, json)
+  - `--top N` to limit results
+  - `--file` for custom events file path
+- Streaming file reader (handles large NDJSON files without loading into memory)
+- Malformed line warnings logged to stderr
+- Auto-generated insights (e.g., "chat drives 53% of cost but only 24% of calls")
+### Dashboard
+- Web dashboard at `http://localhost:3000` (no external services needed)
+- Date range picker with from/to date inputs
+- Feature, User, Model, Environment dropdown filters
+- Active filter pills with one-click removal
+- Click-to-drill-down on chart segments and table rows
+- Export CSV / Export JSON buttons on every table
+- 5 KPI cards: Total Spend, Avg Cost/Call, Total Tokens, Most Expensive Feature, Costliest User
+- Charts: Daily Cost Trend, Cost by Feature (doughnut), Calls by Model (bar), Cost by User (bar), Cost vs Calls (bubble)
+- Scrollable event log with full model names
+- Runs on plain Node.js — no ts-node required
+### Package Quality
+- TypeScript-first with full type declarations
+- 110 tests (unit, integration, CLI subprocess, error handling)
+- Real API smoke test (`npm run test:smoke`) for Anthropic and OpenAI
+- 24 KB published tarball (no source maps, no tests, no dev files)
+- `"type": "commonjs"` + `"exports"` field for modern bundler support
+- MIT license

package/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2026 shmulikdav
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

package/README.md ADDED Viewed

@@ -0,0 +1,469 @@
+# llm-cost-meter
+[![npm version](https://img.shields.io/npm/v/llm-cost-meter.svg)](https://www.npmjs.com/package/llm-cost-meter)
+[![license](https://img.shields.io/npm/l/llm-cost-meter.svg)](https://github.com/shmulikdav/llmeter/blob/main/LICENSE)
+[![node](https://img.shields.io/node/v/llm-cost-meter.svg)](https://nodejs.org)
+**Per-feature, per-user LLM cost attribution for production AI apps.**
+Every team running AI in production knows their monthly bill. No team knows which feature is responsible for which cost. `llm-cost-meter` wraps your LLM API calls, calculates actual cost from token usage, and tags every call by feature, user, and environment — so you can finally see where the money goes.
+## Installation
+```bash
+npm install llm-cost-meter
+```
+Requires Node.js >= 18.
+## Quick Start (60 seconds)
+### 1. Wrap your LLM call
+```typescript
+import { meter, configure } from 'llm-cost-meter';
+import Anthropic from '@anthropic-ai/sdk';
+configure({ adapters: ['console'] });
+const client = new Anthropic();
+const response = await meter(
+  () => client.messages.create({
+    model: 'claude-sonnet-4-20250514',
+    max_tokens: 1024,
+    messages: [{ role: 'user', content: 'Summarize this article...' }]
+  }),
+  {
+    feature: 'article-summarizer',
+    userId: 'user_abc123',
+  }
+);
+// Your response is unchanged
+console.log(response.content);
+// Console output: [llm-cost-meter] article-summarizer — $0.00396 (600 tokens, 1240ms)
+```
+### 2. Enable local storage
+```typescript
+configure({
+  adapters: ['console', 'local'],
+  localPath: './.llm-costs/events.ndjson',
+});
+```
+### 3. View your report
+```bash
+npx llm-cost-meter report
+```
+```
+llm-cost-meter report — 2025-04-01 to 2025-04-05
+Source: ./.llm-costs/events.ndjson (1,284 events)
+By feature:
+┌─────────────────────┬────────┬──────────────┬────────────────┬────────────────┐
+│ Feature             │ Calls  │ Total Tokens │ Avg Cost/Call  │ Total Cost     │
+├─────────────────────┼────────┼──────────────┼────────────────┼────────────────┤
+│ article-summarizer  │ 843    │ 2,104,200    │ $0.0039        │ $3.29          │
+│ chat                │ 312    │ 987,400      │ $0.0121        │ $3.78          │
+│ tag-classifier      │ 129    │ 198,300      │ $0.0002        │ $0.02          │
+├─────────────────────┼────────┼──────────────┼────────────────┼────────────────┤
+│ TOTAL               │ 1,284  │ 3,289,900    │ —              │ $7.09          │
+└─────────────────────┴────────┴──────────────┴────────────────┴────────────────┘
+Insight: 'chat' drives 53% of cost but only 24% of calls.
+```
+## Tagging Guide
+Every `meter()` call accepts these tags:
+| Tag | Purpose | Example |
+|-----|---------|---------|
+| `feature` | Which product feature made the call | `'article-summarizer'` |
+| `userId` | Which user triggered it | `'user_abc123'` |
+| `sessionId` | Group calls within a session | `'sess_xyz'` |
+| `env` | Environment | `'production'` |
+| `tags` | Arbitrary key-value pairs | `{ team: 'product', tier: 'pro' }` |
+```typescript
+await meter(llmCall, {
+  feature: 'chat',
+  userId: req.user.id,
+  sessionId: req.sessionId,
+  env: 'production',
+  tags: { team: 'product', tier: 'pro' },
+});
+```
+## Global Configuration
+Call `configure()` once at app startup:
+```typescript
+import { configure } from 'llm-cost-meter';
+configure({
+  adapters: ['console', 'local'],       // Output destinations
+  localPath: './.llm-costs/events.ndjson', // File path for local adapter
+  defaultTags: {                         // Applied to every event
+    env: process.env.NODE_ENV ?? 'development',
+    service: 'my-app',
+  },
+  verbose: false,                        // true = log adapter errors to console
+  warnOnMissingModel: true,              // true = warn when model not in pricing table
+  onError: (err, event) => {             // Called when adapter writes fail
+    myLogger.warn('Cost tracking error', { err, feature: event?.feature });
+  },
+});
+```
+`configure()` merges with the current config. To start fresh, call `resetConfig()` first:
+```typescript
+import { resetConfig, configure } from 'llm-cost-meter';
+resetConfig();  // Back to defaults
+configure({ adapters: ['local'] });
+```
+## Error Handling
+By default, adapter errors are silent (your LLM calls are never affected). To catch them:
+```typescript
+// Option 1: onError callback (recommended for production)
+configure({
+  onError: (err, event) => {
+    console.error('Failed to write cost event:', err.message);
+    // Send to your error tracker, Sentry, DataDog, etc.
+  },
+});
+// Option 2: verbose mode (logs to console.error)
+configure({ verbose: true });
+// Option 3: await writes to guarantee persistence
+const response = await meter(llmCall, {
+  feature: 'billing-critical',
+  awaitWrites: true,  // Will throw if adapter fails
+});
+```
+### Monitoring meter health
+```typescript
+import { getMeterStats } from 'llm-cost-meter';
+const stats = getMeterStats();
+console.log(stats);
+// {
+//   eventsTracked: 1284,
+//   eventsDropped: 0,
+//   adapterErrors: 0,
+//   unknownModels: ['openai/ft:gpt-4o-mini:my-org']
+// }
+```
+## Custom Pricing
+Add pricing for fine-tuned models, new models, or entirely new providers:
+```typescript
+import { configurePricing, setPricingTable } from 'llm-cost-meter';
+// Add a single model
+configurePricing('openai', 'ft:gpt-4o-mini:my-org', { input: 0.30, output: 1.20 });
+// Add a new provider
+configurePricing('mistral', 'mistral-large', { input: 2.00, output: 6.00 });
+// Override existing pricing
+configurePricing('anthropic', 'claude-sonnet-4-20250514', { input: 3.50, output: 16.00 });
+// Set an entire provider at once
+setPricingTable('deepseek', {
+  'deepseek-chat': { input: 0.14, output: 0.28, unit: 'per_million_tokens' },
+  'deepseek-coder': { input: 0.14, output: 0.28, unit: 'per_million_tokens' },
+});
+```
+When a model isn't found in the pricing table, cost is reported as $0.00 and a warning is logged (disable with `warnOnMissingModel: false`).
+## Guaranteed Write Mode
+By default, `meter()` is fire-and-forget — adapter writes happen in the background so your LLM response is returned immediately. For billing-critical paths:
+```typescript
+// Wait for adapters to finish writing before continuing
+const response = await meter(llmCall, {
+  feature: 'billing',
+  awaitWrites: true,
+});
+// Flush all pending writes before process exit
+import { flush } from 'llm-cost-meter';
+process.on('SIGTERM', async () => {
+  await flush();
+  process.exit(0);
+});
+```
+## Advanced: CostMeter Class
+Use `CostMeter` when you need multiple independent meters (e.g., different adapters for different teams, separate configs for billing vs analytics):
+```typescript
+import { CostMeter } from 'llm-cost-meter';
+const billingMeter = new CostMeter({
+  provider: 'anthropic',
+  adapters: ['local'],
+  localPath: './billing/events.ndjson',
+  onError: (err) => alertOps('billing tracking failed', err),
+});
+const analyticsMeter = new CostMeter({
+  adapters: [new DataDogAdapter()],
+  defaultTags: { team: 'analytics' },
+});
+// Each meter writes to its own destination
+const response = await billingMeter.track(
+  () => client.messages.create({ ... }),
+  { feature: 'chat', userId: req.user.id }
+);
+// Manual event recording
+billingMeter.record({
+  model: 'claude-sonnet-4-20250514',
+  provider: 'anthropic',
+  inputTokens: 450,
+  outputTokens: 210,
+  feature: 'classifier',
+  userId: 'user_123',
+});
+// Flush before shutdown
+await billingMeter.flush();
+```
+**When to use `meter()` vs `CostMeter`:**
+- `meter()` — simple apps, single config, most use cases
+- `CostMeter` — multi-team apps, separate billing/analytics pipelines, multiple output destinations
+## CLI Reference
+```bash
+# Default report grouped by feature
+npx llm-cost-meter report
+# Group by different dimensions
+npx llm-cost-meter report --group-by userId
+npx llm-cost-meter report --group-by model
+npx llm-cost-meter report --group-by env
+# Filter by tag
+npx llm-cost-meter report --feature article-summarizer
+npx llm-cost-meter report --env production
+npx llm-cost-meter report --user user_abc123
+# Date range
+npx llm-cost-meter report --from 2025-04-01 --to 2025-04-05
+# Export formats
+npx llm-cost-meter report --format csv > costs.csv
+npx llm-cost-meter report --format json > costs.json
+# Show top N most expensive
+npx llm-cost-meter report --top 5
+# Custom events file path
+npx llm-cost-meter report --file ./path/to/events.ndjson
+```
+## Adapter Reference
+### Console Adapter
+Prints cost per call to stdout. Ideal for development.
+```typescript
+configure({ adapters: ['console'] });
+// Output: [llm-cost-meter] article-summarizer — $0.00396 (600 tokens, 1240ms)
+```
+### Local File Adapter
+Appends events as NDJSON (newline-delimited JSON) to a file. Uses an async write queue to prevent file corruption from concurrent calls.
+```typescript
+configure({
+  adapters: ['local'],
+  localPath: './.llm-costs/events.ndjson',
+});
+```
+### Bring Your Own Adapter
+Implement the `CostAdapter` interface:
+```typescript
+import { CostAdapter, CostEvent, configure } from 'llm-cost-meter';
+class DataDogAdapter implements CostAdapter {
+  name = 'datadog';
+  async write(event: CostEvent): Promise<void> {
+    await fetch('https://api.datadoghq.com/api/v1/series', {
+      method: 'POST',
+      body: JSON.stringify({
+        series: [{
+          metric: 'llm.cost',
+          points: [[Date.now() / 1000, event.totalCostUSD]],
+          tags: [`feature:${event.feature}`, `model:${event.model}`],
+        }],
+      }),
+    });
+  }
+  // Optional: called by flush() before shutdown
+  async flush(): Promise<void> {
+    // Drain any internal buffers
+  }
+}
+configure({
+  adapters: [new DataDogAdapter(), 'local'],
+});
+```
+## Integration Recipes
+### Express.js Middleware
+```typescript
+import { meter } from 'llm-cost-meter';
+function withCostTracking(feature: string) {
+  return (req, res, next) => {
+    req.llmMeter = (fn) =>
+      meter(fn, {
+        feature,
+        userId: req.user?.id,
+        sessionId: req.sessionId,
+        env: process.env.NODE_ENV,
+      });
+    next();
+  };
+}
+router.post('/summarize', withCostTracking('article-summarizer'), async (req, res) => {
+  const result = await req.llmMeter(() =>
+    client.messages.create({ ... })
+  );
+  res.json(result);
+});
+```
+### Next.js API Route
+```typescript
+import { meter, configure } from 'llm-cost-meter';
+configure({ adapters: ['local'], defaultTags: { env: 'production' } });
+export default async function handler(req, res) {
+  const response = await meter(
+    () => openai.chat.completions.create({ ... }),
+    { feature: 'chat', userId: req.body.userId }
+  );
+  res.json(response);
+}
+```
+## Pricing Tables
+Built-in pricing for current models (USD per million tokens):
+### Anthropic
+| Model | Input | Output |
+|-------|-------|--------|
+| claude-opus-4-20250514 | $15.00 | $75.00 |
+| claude-sonnet-4-20250514 | $3.00 | $15.00 |
+| claude-haiku-4-5-20251001 | $0.80 | $4.00 |
+| claude-3-5-sonnet-20241022 | $3.00 | $15.00 |
+| claude-3-5-haiku-20241022 | $0.80 | $4.00 |
+| claude-3-opus-20240229 | $15.00 | $75.00 |
+| claude-3-sonnet-20240229 | $3.00 | $15.00 |
+| claude-3-haiku-20240307 | $0.25 | $1.25 |
+### OpenAI
+| Model | Input | Output |
+|-------|-------|--------|
+| gpt-4o | $2.50 | $10.00 |
+| gpt-4o-mini | $0.15 | $0.60 |
+| gpt-4-turbo | $10.00 | $30.00 |
+| gpt-4 | $30.00 | $60.00 |
+| gpt-3.5-turbo | $0.50 | $1.50 |
+| o1 | $15.00 | $60.00 |
+| o1-mini | $3.00 | $12.00 |
+| o3 | $10.00 | $40.00 |
+| o3-mini | $1.10 | $4.40 |
+Dated model variants (e.g., `gpt-4o-2024-08-06`) are also included. Use `configurePricing()` to add models not in this table.
+## Testing Your Code
+Use `resetConfig()` and `resetStats()` in test setup to avoid state leaking between tests:
+```typescript
+import { configure, resetConfig, resetStats, meter, CostAdapter, CostEvent } from 'llm-cost-meter';
+class TestAdapter implements CostAdapter {
+  name = 'test';
+  events: CostEvent[] = [];
+  async write(event: CostEvent) { this.events.push(event); }
+}
+beforeEach(() => {
+  resetConfig();
+  resetStats();
+  const adapter = new TestAdapter();
+  configure({ adapters: [adapter], warnOnMissingModel: false });
+});
+```
+## FAQ
+**Does it add latency to my API calls?**
+No. By default, adapter writes are fire-and-forget. Your LLM response is returned immediately. Use `awaitWrites: true` only when you need guaranteed persistence.
+**Does it send my data anywhere?**
+No. All data stays local by default. The `console` adapter prints to stdout and the `local` adapter writes to a file on disk. No network calls are made unless you add a custom adapter.
+**What happens if the pricing table doesn't have my model?**
+A warning is logged (unless `warnOnMissingModel: false`) and cost is reported as $0.00. Use `configurePricing()` to add the model. Check `getMeterStats().unknownModels` to see which models are missing.
+**What if an adapter fails?**
+By default, errors are silent — your app is never affected. Use `onError` callback or `verbose: true` to catch errors. Use `getMeterStats().adapterErrors` to monitor.
+**Can I use it with streaming responses?**
+V1 does not support streaming token counting. Streaming support is planned for V2. For now, check the final response's usage field after streaming completes.
+**Does it work with fine-tuned models?**
+Yes. Use `configurePricing()` to set pricing for your fine-tuned model IDs.
+**Is it safe for high-traffic apps?**
+The local file adapter uses an async write queue that serializes writes — no file corruption from concurrent calls. For high-throughput, consider a custom adapter that batches writes.
+## License
+MIT