npm - @dmop/puru - Versions diffs - 0.1.0 - Mend

@dmop/puru 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (9) hide show

package/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2026 Danilo Pedrosa
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

package/README.md ADDED Viewed

@@ -0,0 +1,418 @@
+# puru (プール)
+Goroutine-style concurrency for JavaScript with an **M:N scheduler** — multiplexes thousands of tasks onto a small pool of OS threads, just like Go.
+Works on **Node.js** and **Bun**. No separate worker files, no manual message passing.
+*puru (プール) means "pool" in Japanese.*
+## Install
+```bash
+npm install puru
+```
+## Quick Start
+```typescript
+import { spawn, chan, WaitGroup, select, after } from 'puru'
+// CPU work — runs in a dedicated worker thread
+const { result } = spawn(() => fibonacci(40))
+console.log(await result)
+// I/O work — many tasks share worker threads (M:N scheduling)
+const wg = new WaitGroup()
+for (const url of urls) {
+  wg.spawn(() => fetch(url).then(r => r.json()), { concurrent: true })
+}
+const results = await wg.wait()
+```
+## How It Works
+puru uses an **M:N scheduler** — M tasks are multiplexed onto N OS threads:
+```text
+              puru scheduler
+    ┌──────────────────────────────┐
+    │                              │
+    │   Task 1 ─┐                  │
+    │   Task 2 ─┤──► Thread 1     │
+    │   Task 3 ─┘    (shared)     │
+    │                              │
+    │   Task 4 ────► Thread 2     │  N threads
+    │                (exclusive)   │  (os.availableParallelism)
+    │                              │
+    │   Task 5 ─┐                  │
+    │   Task 6 ─┤──► Thread 3     │
+    │   Task 7 ─┘    (shared)     │
+    │                              │
+    └──────────────────────────────┘
+```
+**Two modes:**
+| Mode | Flag | Best for | How it works |
+| --- | --- | --- | --- |
+| **Exclusive** (default) | `spawn(fn)` | CPU-bound work | 1 task per thread, full core usage |
+| **Concurrent** | `spawn(fn, { concurrent: true })` | I/O-bound / async work | Many tasks share a thread's event loop |
+This is the same model Go uses: goroutines (M) are scheduled onto OS threads (N). CPU-bound work gets a dedicated thread. I/O-bound work shares threads efficiently.
+## Why puru
+Same task, four ways — process 4 items in parallel:
+**worker_threads** — 2 files, 15 lines, manual everything:
+```typescript
+// worker.js (separate file required)
+const { parentPort } = require('worker_threads')
+parentPort.on('message', (data) => {
+  parentPort.postMessage(heavyWork(data))
+})
+// main.js
+import { Worker } from 'worker_threads'
+const results = await Promise.all(items.map(item =>
+  new Promise((resolve, reject) => {
+    const w = new Worker('./worker.js')
+    w.postMessage(item)
+    w.on('message', resolve)
+    w.on('error', reject)
+  })
+))
+```
+**Tinypool** — still needs a separate file:
+```typescript
+// worker.js (separate file required)
+export default function(data) { return heavyWork(data) }
+// main.js
+import Tinypool from 'tinypool'
+const pool = new Tinypool({ filename: './worker.js' })
+const results = await Promise.all(items.map(item => pool.run(item)))
+```
+**Piscina** — same pattern, separate file:
+```typescript
+// worker.js (separate file required)
+module.exports = function(data) { return heavyWork(data) }
+// main.js
+import Piscina from 'piscina'
+const pool = new Piscina({ filename: './worker.js' })
+const results = await Promise.all(items.map(item => pool.run(item)))
+```
+**puru** — one file, 4 lines:
+```typescript
+import { WaitGroup } from 'puru'
+const wg = new WaitGroup()
+for (const item of items) wg.spawn(() => heavyWork(item))
+const results = await wg.wait()
+```
+| Feature | worker_threads | Tinypool | Piscina | **puru** |
+| --- | --- | --- | --- | --- |
+| Separate worker file | Required | Required | Required | **Not needed** |
+| Inline functions | No | No | No | **Yes** |
+| M:N scheduler | No | No | No | **Yes** |
+| Concurrent mode (I/O) | No | No | No | **Yes** |
+| Channels (cross-thread) | No | No | No | **Yes** |
+| Cancellation | No | No | No | **Yes** |
+| WaitGroup / ErrGroup | No | No | No | **Yes** |
+| select (with default) | No | No | No | **Yes** |
+| Mutex / Once | No | No | No | **Yes** |
+| Ticker | No | No | No | **Yes** |
+| Backpressure | No | No | No | **Yes** |
+| Priority scheduling | No | No | Yes | **Yes** |
+| Pool management | Manual | Automatic | Automatic | **Automatic** |
+| Bun support | No | No | No | **Yes** |
+## API
+### `spawn(fn, opts?)`
+Run a function in a worker thread. Returns `{ result: Promise<T>, cancel: () => void }`.
+```typescript
+// CPU-bound — exclusive mode (default)
+const { result } = spawn(() => fibonacci(40))
+// I/O-bound — concurrent mode (many tasks per thread)
+const { result } = spawn(() => fetch(url), { concurrent: true })
+// With priority
+const { result } = spawn(() => criticalWork(), { priority: 'high' })
+// Cancel
+const { result, cancel } = spawn(() => longTask())
+setTimeout(cancel, 5000)
+```
+**Exclusive mode** (default): the function gets a dedicated thread. Use for CPU-heavy work.
+**Concurrent mode** (`{ concurrent: true }`): multiple tasks share a thread's event loop. Use for async/I/O work where you want to run thousands of tasks without thousands of threads.
+Functions must be self-contained — they cannot capture variables from the enclosing scope:
+```typescript
+const x = 42
+spawn(() => x + 1)   // ReferenceError: x is not defined
+spawn(() => 42 + 1)  // works
+```
+### `chan(capacity?)`
+Create a channel for communicating between async tasks — including across worker threads.
+```typescript
+const ch = chan<number>(10) // buffered, capacity 10
+const ch = chan<string>()   // unbuffered, capacity 0
+await ch.send(42)
+const value = await ch.recv() // 42
+ch.close()
+await ch.recv() // null (closed)
+// Async iteration
+for await (const value of ch) {
+  process(value)
+}
+```
+**Channels in workers** — pass channels to `spawn()` and use them from worker threads, just like Go:
+```typescript
+const ch = chan<number>(10)
+// Producer worker
+spawn(async ({ ch }) => {
+  for (let i = 0; i < 100; i++) await ch.send(i)
+  ch.close()
+}, { channels: { ch } })
+// Consumer worker
+spawn(async ({ ch }) => {
+  for await (const item of ch) process(item)
+}, { channels: { ch } })
+// Fan-out: multiple workers pulling from the same channel
+const input = chan<Job>(50)
+const output = chan<Result>(50)
+for (let i = 0; i < 4; i++) {
+  spawn(async ({ input, output }) => {
+    for await (const job of input) {
+      await output.send(processJob(job))
+    }
+  }, { channels: { input, output } })
+}
+```
+### `WaitGroup`
+Structured concurrency. Spawn multiple tasks, wait for all.
+```typescript
+const wg = new WaitGroup()
+wg.spawn(() => cpuWork())                          // exclusive
+wg.spawn(() => fetchData(), { concurrent: true })  // concurrent
+const results = await wg.wait()       // like Promise.all
+const settled = await wg.waitSettled() // like Promise.allSettled
+wg.cancel() // cancel all tasks
+```
+### `ErrGroup`
+Like `WaitGroup`, but cancels all remaining tasks on first error. The Go standard for production code (`golang.org/x/sync/errgroup`).
+```typescript
+const eg = new ErrGroup()
+eg.spawn(() => fetchUser(id))
+eg.spawn(() => fetchOrders(id))
+eg.spawn(() => fetchAnalytics(id))
+try {
+  const [user, orders, analytics] = await eg.wait()
+} catch (err) {
+  // First error — all other tasks were cancelled
+  console.error('Failed:', err)
+}
+```
+### `Mutex`
+Async mutual exclusion. Serialize access to shared resources under concurrency.
+```typescript
+const mu = new Mutex()
+// withLock — recommended (auto-unlocks on error)
+const result = await mu.withLock(async () => {
+  return await db.query('UPDATE ...')
+})
+// Manual lock/unlock
+await mu.lock()
+try { /* critical section */ }
+finally { mu.unlock() }
+```
+### `Once<T>`
+Run a function exactly once, even if called concurrently. All callers get the same result.
+```typescript
+const once = new Once<DBConnection>()
+const conn = await once.do(() => createExpensiveConnection())
+// Subsequent calls return the cached result
+```
+### `select(cases, opts?)`
+Wait for the first of multiple promises to resolve, like Go's `select`.
+```typescript
+// Blocking — waits for first ready
+await select([
+  [ch.recv(), (value) => console.log('received', value)],
+  [after(5000), () => console.log('timeout')],
+])
+// Non-blocking — returns immediately if nothing is ready (Go's select with default)
+await select(
+  [[ch.recv(), (value) => process(value)]],
+  { default: () => console.log('channel not ready') },
+)
+```
+### `after(ms)` / `ticker(ms)`
+Timers for use with `select` and async iteration.
+```typescript
+await after(1000) // one-shot: resolves after 1 second
+// Repeating: tick every 500ms
+const t = ticker(500)
+for await (const _ of t) {
+  console.log('tick')
+  if (shouldStop) t.stop()
+}
+```
+### `register(name, fn)` / `run(name, ...args)`
+Named task registry. Register functions by name, call them by name.
+```typescript
+register('resize', (buffer, w, h) => sharp(buffer).resize(w, h).toBuffer())
+const resized = await run('resize', imageBuffer, 800, 600)
+```
+### `configure(opts?)`
+Optional global configuration. Must be called before the first `spawn()`.
+```typescript
+configure({
+  maxThreads: 4,        // default: os.availableParallelism()
+  concurrency: 64,      // max concurrent tasks per shared worker (default: 64)
+  idleTimeout: 30_000,  // kill idle workers after 30s (default)
+  adapter: 'auto',      // 'auto' | 'node' | 'bun' | 'inline'
+})
+```
+### `stats()` / `resize(n)`
+```typescript
+const s = stats()  // { totalWorkers, idleWorkers, busyWorkers, queuedTasks, ... }
+resize(8)          // scale pool up/down at runtime
+```
+### `detectRuntime()` / `detectCapability()`
+```typescript
+detectRuntime()     // 'node' | 'bun' | 'deno' | 'browser'
+detectCapability()  // 'full-threads' | 'single-thread'
+```
+## Benchmarks
+Apple M1 Pro (8 cores), 16 GB RAM. Median of 5 runs after warmup.
+```bash
+npm run bench          # all benchmarks (Node.js)
+npm run bench:bun      # all benchmarks (Bun)
+```
+### CPU-Bound Parallelism (Node.js)
+| Benchmark | Without puru | With puru | Speedup |
+| --- | --: | --: | --: |
+| Fibonacci (fib(38) x8) | 4,345 ms | 2,131 ms | **2.0x** |
+| Prime counting (2M range) | 335 ms | 77 ms | **4.4x** |
+| Matrix multiply (200x200 x8) | 140 ms | 39 ms | **3.6x** |
+| Data processing (100K items x8) | 221 ms | 67 ms | **3.3x** |
+### Channels Fan-Out Pipeline (Node.js)
+200 items with CPU-heavy transform, 4 parallel transform workers:
+| Approach | Time | vs Sequential |
+| --- | --: | --: |
+| Sequential (no channels) | 176 ms | baseline |
+| Main-thread channels only | 174 ms | 1.0x |
+| **puru fan-out (4 workers)** | **51 ms** | **3.4x faster** |
+### M:N Concurrent Async (Node.js)
+100 async tasks with simulated I/O + CPU:
+| Approach | Time | vs Sequential |
+| --- | --: | --: |
+| Sequential | 1,140 ms | baseline |
+| Promise.all (main thread) | 20 ms | 58x faster |
+| **puru concurrent (M:N)** | **16 ms** | **73x faster** |
+Both Promise.all and puru concurrent are fast — but puru runs everything **off the main thread**, keeping your server responsive under load.
+> Spawn overhead is ~0.1-0.5 ms. Use `spawn` for tasks > 5ms. For trivial operations, call directly.
+## Runtimes
+| Runtime | Support | How |
+| --- | --- | --- |
+| Node.js >= 18 | Full | `worker_threads` |
+| Bun | Full | Web Workers (file-based) |
+| Cloudflare Workers | Error | No thread support |
+| Vercel Edge | Error | No thread support |
+## Testing
+```typescript
+import { configure } from 'puru'
+configure({ adapter: 'inline' }) // runs tasks in main thread, no real workers
+```
+## Limitations
+- Functions passed to `spawn()` cannot capture variables from the enclosing scope
+- Channel values must be structured-cloneable (no functions, symbols, or WeakRefs)
+- `null` cannot be sent through a channel (it's the "closed" sentinel)
+- `register()`/`run()` args must be JSON-serializable
+- Channel operations from workers have ~0.1-0.5ms RPC overhead per send/recv (fine for coarse-grained coordination, not for per-item micro-operations)
+## License
+MIT