npm - @nextn/outbound-guard - Versions diffs - 0.1.1 → 0.1.3 - Mend

@nextn/outbound-guard 0.1.1 → 0.1.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (22) hide show

package/README.md +136 -479
package/dist/{src/index.cjs → index.cjs} +285 -103
package/dist/index.cjs.map +1 -0
package/dist/{src/index.d.cts → index.d.cts} +33 -21
package/dist/{src/index.d.ts → index.d.ts} +33 -21
package/dist/{src/index.js → index.js} +279 -99
package/dist/index.js.map +1 -0
package/package.json +1 -1
package/dist/demo/loadgen.cjs +0 -454
package/dist/demo/loadgen.cjs.map +0 -1
package/dist/demo/loadgen.d.cts +0 -2
package/dist/demo/loadgen.d.ts +0 -2
package/dist/demo/loadgen.js +0 -452
package/dist/demo/loadgen.js.map +0 -1
package/dist/demo/upstream.cjs +0 -58
package/dist/demo/upstream.cjs.map +0 -1
package/dist/demo/upstream.d.cts +0 -2
package/dist/demo/upstream.d.ts +0 -2
package/dist/demo/upstream.js +0 -34
package/dist/demo/upstream.js.map +0 -1
package/dist/src/index.cjs.map +0 -1
package/dist/src/index.js.map +0 -1

package/README.md CHANGED Viewed

@@ -1,479 +1,136 @@
-# outbound-guard
-A small, opinionated **Node.js HTTP client** that protects your service from slow or failing upstreams by enforcing:
-- concurrency limits (in-flight cap)
-- bounded queue (backpressure)
-- request timeouts
-This is a **library**, not a service.
-It is **process-local**, **in-memory**, and intentionally simple.
----
-## Why this exists
-Most production outages don’t start inside your service.
-They start when you call **something you don’t control**:
-- partner APIs
-- payment gateways
-- internal services under load
-- flaky dependencies
-Without protection, outbound calls cause:
-- unbounded concurrency
-- growing queues
-- long tail latency
-- cascading failures
-`outbound-guard` puts **hard limits** around outbound HTTP calls so your Node.js process stays alive and predictable under stress.
----
-## What this library does (in practice)
-outbound-guard is built around a single idea:
-collapse duplicate work first, then apply limits.
-Most failures don’t come from too many different requests —
-they come from too many identical requests hitting a slow dependency.
-This library solves that problem before it escalates.
-For every outbound HTTP request, it enforces:
-### 1. GET micro-cache + request coalescing (core feature)
-This is the most important feature in `outbound-guard`.
-When enabled, identical GET requests are **collapsed into a single upstream call**.
-- One request becomes the **leader**
-- All others become **followers**
-- Only **one upstream request** is ever in flight
-- Followers either:
-  - receive cached data immediately, or
-  - fail fast when limits are reached
-This prevents the most common real-world failure mode:
-**thundering herds on slow but healthy upstreams**.
-This is not long-lived caching.
-It is **short-lived, in-process, burst protection**.
-### 2. Concurrency limits
-- At most `maxInFlight` requests execute at once.
-- Prevents connection exhaustion and event-loop overload.
-### 3. Bounded queue (backpressure)
-- Excess requests wait in a FIFO queue (up to `maxQueue`).
-- If the queue is full → **reject immediately**.
-- If waiting too long → **reject with a timeout**.
-Failing early is a feature.
-### 4. Request timeouts
-- Every request has a hard timeout via `AbortController`.
-- No hanging promises.
-### 5. Observability hooks
-- Emits lifecycle events (queueing, failures, breaker transitions)
-- Exposes a lightweight `snapshot()` for debugging
-No metrics backend required.
----
-## The failure mode this library is designed to stop
-#### Thundering herd on slow GET requests
-This is how many production incidents start:
-- Traffic spikes
-- Many requests trigger the **same GET**
-- The upstream slows down (not down — just slow)
-- Node starts N identical outbound requests
-- Queues grow, retries multiply, latency explodes
-- Eventually the process collapses
-Timeouts and retries don’t fix this.
-They **amplify it**.
-### What outbound-guard does differently
-With `microCache` enabled:
-- Only **one upstream GET** is ever in flight per key
-- All concurrent identical requests share it
-- While refreshing:
-  - previous data is served (within bounds)
-  - or failures surface quickly
-- Retries happen **once**, not per caller
-This keeps:
-- upstream traffic flat
-- latency predictable
-- failures visible instead of hidden
-This is **request coalescing**, not caching.
-## Why the micro-cache is intentionally different
-Most HTTP clients do one of two things:
-1. Cache aggressively and risk serving bad data
-2. Don’t cache at all and collapse under burst load
-`outbound-guard` does neither.
-Its micro-cache is:
-- GET-only
-- short-lived
-- bounded by time and memory
-- aware of in-flight requests
-The goal is **operational stability**, not freshness guarantees.
-If the upstream is:
-- slow → callers don’t pile up
-- failing → failures surface quickly
-- recovered → traffic resumes cleanly
-This design keeps failure behavior honest.
-Got it 👍
-You want **the same words**, just **clean structure + proper Markdown**, copy-paste ready.
-Below is **exactly your text**, only reorganized into clear sections with headers and spacing.
-No rewording, no meaning changes.
----
-````md
-## How to tune micro-cache safely
-Most users only need to understand **two knobs**.
----
-### `maxWaiters`
-Controls how many concurrent callers are allowed to wait for the leader.
-```ts
-maxWaiters: 10
-````
-* Low value → aggressive load shedding
-* High value → tolerate more fan-in
-If this fills quickly, it means:
-> “This upstream is too slow for current traffic.”
-That’s a signal, not a bug.
----
-### `followerTimeoutMs`
-Controls how long followers are willing to wait once.
-```ts
-followerTimeoutMs: 5000
-```
-* Followers wait at most once
-* No per-request retries
-* No silent backlog growth
-If this expires:
-* followers fail fast
-* queues drain
-* the system stays responsive
-This prevents **“slow death by waiting”**.
----
-## Retry settings (leader-only)
-Retries apply **only to the leader**.
-```ts
-retry: {
-  maxAttempts: 3,
-  baseDelayMs: 50,
-  maxDelayMs: 200,
-  retryOnStatus: [503],
-}
-```
-### What this means
-* **`maxAttempts`**
-  total leader tries (including first)
-* **`baseDelayMs`**
-  initial backoff
-* **`maxDelayMs`**
-  cap on exponential backoff
-* **`retryOnStatus`**
-  retry only when the upstream explicitly signals trouble
-Followers never retry.
-Retries never multiply under load.
-```
-```
-### How outbound-guard helps
-When `microCache` is enabled:
-- **Only one real GET request** is sent upstream.
-- All concurrent identical GETs **share the same in-flight request**.
-- If the upstream takes 5 seconds, deduplication lasts for the full 5 seconds.
-- After success, the response is cached briefly (default: 1 seconds).
-- Requests during that window are served immediately — no new upstream calls.
-This dramatically reduces:
-- outbound request count
-- upstream pressure
-- cost
-- tail latency
-- failure amplification
-This is **request coalescing**, not long-lived caching.
-It is intentionally:
-- GET-only
-- short-lived
-- in-memory
-- process-local
-The goal is load shedding and cost reduction by collapsing duplicate work under concurrent load — not durability guarantees.
--------------
-## What this library does NOT do (by design)
-- ❌ No persistence
-- ❌ No Redis / Kafka
-- ❌ No retries by default
-- ❌ No distributed coordination
-- ❌ No service discovery
-This library provides **resilience**, not **durability**.
-If you need guaranteed delivery, pair it with:
-- a database outbox
-- a job queue
-- a message broker
----
-## Installation
-```bash
-npm install outbound-guard
-````
-(Node.js ≥ 20)
----
-## Basic usage
-```ts
-import { ResilientHttpClient } from "outbound-guard";
-const client = new ResilientHttpClient({
-  maxInFlight: 20,
-  maxQueue: 100,
-  enqueueTimeoutMs: 200,
-  requestTimeoutMs: 5000,
-  microCache: {
-    enabled: true,
-    // short-lived cache window
-    ttlMs: 1000,
-    maxStaleMs: 800,
-    // protect against fan-in explosions
-    maxWaiters: 10,
-    followerTimeoutMs: 5000,
-    // leader-only retries
-    retry: {
-      maxAttempts: 3,
-      baseDelayMs: 50,
-      maxDelayMs: 200,
-      retryOnStatus: [503],
-    },
-  },
-});
-// Use it instead of fetch/axios directly
-await client.request({
-  method: "GET",
-  url: "https://third-party.example.com/config",
-});
-console.log(res.status, res.body);
-```
-That’s it.
-Everything else happens automatically.
----
-## Error handling
-Errors are **explicit and typed**:
-* `QueueFullError`
-* `QueueTimeoutError`
-* `RequestTimeoutError`
-Example:
-```ts
-try {
-  await client.request({ method: "GET", url });
-} catch (err) {
-  if (err instanceof CircuitOpenError) {
-    // upstream is unhealthy → fail fast
-  }
-}
-```
----
-## Observability
-### Events
-```ts
-client.on("request:failure", (e) => {
-  console.error("request failed", e.error);
-});
-```
-### Snapshot
-```ts
-const snap = client.snapshot();
-console.log(snap.inFlight);
-console.log(snap.queueDepth);
-console.log(snap.breakers);
-```
-Useful for logs, debugging, or ad-hoc metrics.
----
-## Demo (local, no deployment)
-This repo includes a demo that visibly shows request coalescing, backpressure, and recovery under load.
-### Terminal A — flaky upstream
-```bash
-npm run demo:upstream
-```
-### Terminal B — load generator
-```bash
-npm run demo:loadgen
-```
-You will see patterns like:
-=== burst: cold-start ===
-ok-1 ok-1 ok-1 ...
-=== burst: cached ===
-ok-1 ok-1 ok-1 ...
-=== burst: refresh-with-stale ===
-ok-1 ok-1 ok-2
-=== burst: failure ===
-ok-2 ok-2 ok-2
-=== burst: recovered ===
-ok-3 ok-3 ok-4
-This shows:
-only one upstream hit per burst
-cached responses during spikes
-safe reuse during refresh
-fast recovery without restart
----
-## When should you use this?
-Good fit if you:
-* call external APIs from Node.js
-* run BFFs or API gateways
-* send webhooks
-* run background workers
-* want predictable failure under load
----
-## When should you NOT use this?
-Not a good fit if you need:
-* durable delivery across restarts
-* distributed rate limiting
-* cross-process coordination
-* heavy retry orchestration
-This library is **not** a service mesh.
----
-## Design philosophy
-* Explicit > clever
-* Fail fast > degrade silently
-* Small surface area > feature creep
-* In-process resilience first
-See `docs/DESIGN.md` for details.
----
-## License
-MIT
-```
+# outbound-guard
+Process-local Node.js HTTP client that collapses duplicate GETs and wraps every outbound call with per-host limits, bounded queueing, timeouts, and a small health gate. Its goal is simple: stop thundering herds and keep your service predictable when upstreams are slow or flaky.
+## Highlights
+- Request coalescing + short-lived GET micro-cache (leader/followers, stale-while-refresh)
+- Per-base-URL limiter with bounded FIFO queue (maxQueue = maxInFlight * 10)
+- Lightweight health gate (OPEN → CLOSED → HALF_OPEN probe) with queue flush on close
+- Hard per-attempt timeouts and leader-only retries (no retry amplification)
+- Zero deps beyond `undici`; entirely in-memory and process-local
+## Install
+```bash
+npm install @nextn/outbound-guard
+# Node 18+ (tested on 20)
+```
+## Quick start
+```ts
+import { ResilientHttpClient } from "@nextn/outbound-guard";
+const client = new ResilientHttpClient({
+  // applied per base URL (protocol + host + port)
+  maxInFlight: 20,
+  requestTimeoutMs: 5_000,
+  microCache: {
+    enabled: true,
+    ttlMs: 1_000,          // fresh window
+    maxStaleMs: 10_000,    // serve stale while refreshing
+    maxEntries: 500,
+    maxWaiters: 1_000,     // concurrent followers per key
+    followerTimeoutMs: 5_000,
+    retry: {
+      maxAttempts: 3,
+      baseDelayMs: 50,
+      maxDelayMs: 200,
+      retryOnStatus: [429, 502, 503, 504], // leader-only
+    },
+  },
+});
+const res = await client.request({
+  method: "GET",
+  url: "https://third-party.example.com/config",
+});
+console.log(res.status, Buffer.from(res.body).toString("utf8"));
+```
+`res.body` is a `Uint8Array`; convert with `Buffer.from(res.body)` or `TextDecoder` as needed.
+## How it works
+### Micro-cache and request coalescing (GET only)
+- Keyed by `GET ${normalizedUrl}` by default (hostname lowercased, default ports stripped). Override via `microCache.keyFn`.
+- One caller becomes **leader**; identical concurrent GETs become **followers** and wait for the leader result.
+- Fresh window: successful 2xx responses are cached for `ttlMs`.
+- Stale-while-refresh: after `ttlMs`, followers can be served from the previous value while a new leader refreshes, up to `maxStaleMs`.
+- Follower guardrails: reject immediately when `maxWaiters` is exceeded or when waiting longer than `followerTimeoutMs`.
+- Failure handling: if a refresh fails but stale data is within `maxStaleMs`, stale is served; otherwise the error is surfaced.
+- Leader-only retry: optional exponential backoff for retryable statuses so retries do not multiply under fan-in.
+### Concurrency and queueing
+- Limits are per base URL (`protocol://host:port`).
+- At most `maxInFlight` requests run at once; overflow enters a bounded FIFO queue sized at `maxInFlight * 10` (internal for now).
+- If the queue is full, a `QueueFullError` is thrown immediately. Queued waiters are rejected if the upstream is marked unhealthy.
+### Health gate (tiny circuit breaker)
+- Tracks outcomes per base URL. Hard failures (request timeouts or unknown errors) and soft failures (429, 502, 503, 504) feed the window.
+- Closes immediately after 3 consecutive hard failures, or when (with ≥10 samples) hard-fail rate ≥30% or total fail rate ≥50%.
+- CLOSED: new requests fail fast with `UpstreamUnhealthyError` (micro-cache can still serve stale if available).
+- Cooldown uses exponential backoff: starts at ~1s with jitter, doubles up to 30s. When cooldown elapses, the circuit moves to HALF_OPEN.
+- HALF_OPEN: exactly one probe is allowed; other calls get `HalfOpenRejectedError`. A successful probe reopens; a failing probe recloses.
+- Per-host isolation: a bad upstream does not poison other hosts.
+### Timeouts and retries
+- `requestTimeoutMs` is enforced per attempt with `AbortController`; hanging upstreams become `RequestTimeoutError`.
+- Retries are opt-in and apply only to GET leaders via `microCache.retry`. Followers never retry, so retries cannot explode under load.
+## API surface
+```ts
+await client.request({
+  method: "GET" | "POST" | "PUT" | "PATCH" | "DELETE" | "HEAD" | "OPTIONS",
+  url: "https://example.com/resource",
+  headers?: Record<string, string>,
+  body?: string | Uint8Array | Buffer,
+});
+client.snapshot(); // { inFlight, queueDepth }
+client.on(eventName, handler); // see below for event names
+```
+### Errors
+All exported errors extend `ResilientHttpError`:
+- `QueueFullError` – queue capacity hit for that base URL.
+- `RequestTimeoutError` – per-attempt timeout exceeded.
+- `UpstreamUnhealthyError` – circuit is CLOSED for the base URL.
+- `HalfOpenRejectedError` – circuit is HALF_OPEN and the call was not the probe.
+### Events
+The client is an `EventEmitter`. Useful hooks:
+- `request:start | request:success | request:failure | request:rejected`
+- `health:closed | health:half_open | health:open`
+- `microcache:retry | microcache:refresh_failed`
+Event payloads include the request, requestId, status/duration when available, and error objects on failures.
+## Demo (local)
+Visualize coalescing and backpressure without deploying anything:
+```bash
+npm run demo:upstream   # terminal A: flaky upstream
+npm run demo:loadgen    # terminal B: bursts against the client
+```
+Watch how bursts collapse to a single upstream hit, stale responses are served during refresh, and failures recover cleanly.
+## When to use
+- Calling external APIs or partner services from Node.js
+- BFFs/API gateways that must isolate upstream slowness
+- Webhook senders or background workers that need predictable failure behavior
+## When not to use
+- If you need durable delivery across restarts (use queues/outbox)
+- If you need cross-process coordination or distributed rate limiting
+- If you need a service mesh or long-lived caching
+## Design stance
+- Favor explicit limits over hidden buffers
+- Fail fast instead of building invisible backlogs
+- Keep the surface small; stay in-process and dependency-light