pompelmi 0.34.1 → 0.34.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (2) hide show
  1. package/README.md +69 -627
  2. package/package.json +1 -1
package/README.md CHANGED
@@ -1,677 +1,119 @@
1
- <div align="center">
2
-
3
- <picture>
4
- <source media="(prefers-color-scheme: dark)" srcset="https://raw.githubusercontent.com/pompelmi/pompelmi/refs/heads/main/assets/logo.svg">
5
- <source media="(prefers-color-scheme: light)" srcset="https://raw.githubusercontent.com/pompelmi/pompelmi/refs/heads/main/assets/logo.svg">
6
- <img src="https://raw.githubusercontent.com/pompelmi/pompelmi/refs/heads/main/assets/logo.svg" alt="pompelmi" width="320" />
7
- </picture>
8
-
9
- <h1>pompelmi</h1>
10
-
11
- <p><strong>Secure file upload scanning for Node.js — private, in-process, zero cloud dependencies.</strong></p>
12
-
13
- <p>
14
- Scan files <em>before</em> they touch disk &nbsp;•&nbsp;
15
- No cloud APIs, no daemon &nbsp;•&nbsp;
16
- TypeScript-first &nbsp;•&nbsp;
17
- Drop-in framework adapters
18
- </p>
19
-
20
- <p>
21
- <a href="https://www.npmjs.com/package/pompelmi"><img alt="npm version" src="https://img.shields.io/npm/v/pompelmi?label=version&color=0a7ea4&logo=npm"></a>
22
- <a href="https://www.npmjs.com/package/pompelmi"><img alt="npm downloads" src="https://img.shields.io/npm/dm/pompelmi?label=downloads&color=6E9F18&logo=npm"></a>
23
- <a href="https://github.com/pompelmi/pompelmi/blob/main/LICENSE"><img alt="license" src="https://img.shields.io/npm/l/pompelmi?color=blue"></a>
24
- <img alt="node" src="https://img.shields.io/badge/node-%3E%3D18-339933?logo=node.js&logoColor=white">
25
- <a href="https://github.com/pompelmi/pompelmi/actions/workflows/ci.yml"><img alt="CI" src="https://img.shields.io/github/actions/workflow/status/pompelmi/pompelmi/ci.yml?branch=main&label=CI&logo=github"></a>
26
- <a href="https://codecov.io/gh/pompelmi/pompelmi"><img alt="codecov" src="https://codecov.io/gh/pompelmi/pompelmi/branch/main/graph/badge.svg?flag=core"/></a>
27
- <img alt="types" src="https://img.shields.io/badge/types-TypeScript-3178C6?logo=typescript&logoColor=white">
28
- <img alt="ESM" src="https://img.shields.io/badge/ESM%2FCJS-compatible-yellow">
29
- <a href="https://snyk.io/test/github/pompelmi/pompelmi"><img alt="Snyk" src="https://snyk.io/test/github/pompelmi/pompelmi/badge.svg"></a>
30
- <a href="https://securityscorecards.dev/viewer/?uri=github.com/pompelmi/pompelmi"><img alt="OpenSSF Scorecard" src="https://api.securityscorecards.dev/projects/github.com/pompelmi/pompelmi/badge"/></a>
31
- </p>
32
-
33
- <p>
34
- <a href="https://pompelmi.github.io/pompelmi/"><strong>📚 Docs</strong></a> &nbsp;•&nbsp;
35
- <a href="#-installation"><strong>💾 Install</strong></a> &nbsp;•&nbsp;
36
- <a href="#-quickstart"><strong>⚡ Quickstart</strong></a> &nbsp;•&nbsp;
37
- <a href="#-framework-adapters"><strong>🧩 Adapters</strong></a> &nbsp;•&nbsp;
38
- <a href="#-yara"><strong>🧬 YARA</strong></a> &nbsp;•&nbsp;
39
- <a href="#-github-action"><strong>🤖 CI/CD</strong></a> &nbsp;•&nbsp;
40
- <a href="./examples/"><strong>💡 Examples</strong></a>
41
- </p>
42
-
43
- </div>
44
-
45
- ---
46
-
47
- ## Why pompelmi?
48
-
49
- Most upload handlers check the file extension and content-type header — and stop there. Real threats arrive as ZIP bombs, polyglot files, macro-embedded documents, and files with spoofed MIME types.
50
-
51
- **pompelmi scans file bytes in-process, before anything is written to disk or stored**, blocking threats at the earliest possible point — with no cloud API and no daemon.
52
-
53
- | | pompelmi | ClamAV | Cloud AV APIs |
54
- |---|---|---|---|
55
- | **Setup** | `npm install` | Daemon + config | API keys + integration |
56
- | **Privacy** | ✅ In-process — data stays local | ✅ Local (separate daemon) | ❌ Files sent externally |
57
- | **Latency** | ✅ Zero (no IPC, no network) | IPC overhead | Network round-trip |
58
- | **Cost** | Free (MIT) | Free (GPL) | Per-scan billing |
59
- | **Framework adapters** | ✅ Express, Koa, Next.js, NestJS, Fastify | ❌ | ❌ |
60
- | **TypeScript** | ✅ First-class | community types | varies |
61
- | **YARA** | ✅ Built-in | manual setup | limited |
62
-
63
- ---
64
-
65
- ## 📦 Installation
1
+ ![Pompelmi banner](./assets/readme-banner.png)
66
2
 
67
- ```bash
68
- npm install pompelmi
69
- ```
3
+ # Pompelmi
70
4
 
71
- > Node.js 18+. No daemon, no config files, no API keys required.
5
+ [![GitHub stars](https://img.shields.io/github/stars/pompelmi/pompelmi?style=flat-square&logo=github)](https://github.com/pompelmi/pompelmi/stargazers)
6
+ [![npm version](https://img.shields.io/npm/v/pompelmi?label=version&logo=npm)](https://www.npmjs.com/package/pompelmi)
7
+ [![CI](https://img.shields.io/github/actions/workflow/status/pompelmi/pompelmi/ci.yml?branch=main&label=CI&logo=github)](https://github.com/pompelmi/pompelmi/actions/workflows/ci.yml)
72
8
 
73
- ---
9
+ In-process file upload security for Node.js.
74
10
 
75
- ## Quickstart
11
+ Your file upload endpoint is part of your attack surface.
76
12
 
77
- Scan a file and get a verdict in three lines:
13
+ Pompelmi is an open-source Node.js library that scans and blocks risky uploads before they hit storage or downstream processing. It runs in-process, with no cloud API, no daemon, and no required data egress.
78
14
 
79
- ```ts
80
- import { scanFile } from 'pompelmi';
15
+ Works with Express, Next.js, NestJS, Fastify, and Koa. The MIT-licensed core is the primary path in this repo.
81
16
 
82
- const result = await scanFile('path/to/upload.pdf');
83
- // result.verdict → "clean" | "suspicious" | "malicious"
17
+ ## Why this matters
84
18
 
85
- if (result.verdict !== 'clean') {
86
- throw new Error(`Blocked: ${result.verdict} — ${result.reasons}`);
87
- }
88
- ```
19
+ Most upload handlers stop at extension checks or client-provided MIME types. That leaves gaps for spoofed files, archive bombs, polyglots, and script-bearing documents.
89
20
 
90
- Works standalone in any Node.js context — no framework required.
21
+ Without Pompelmi:
91
22
 
92
- ---
23
+ `upload -> trust filename/MIME -> store -> parse or serve later`
93
24
 
94
- ## 🎬 Demo
95
-
96
- ![Pompelmi Demo](./assets/malware-detection-node-demo.gif)
97
-
98
- **Try it now:** browse the [examples/](./examples/) directory or run a sample locally:
99
-
100
- ```bash
101
- npx tsx examples/scan-one-file.ts
102
- ```
25
+ With Pompelmi:
103
26
 
104
- ---
27
+ `upload -> inspect bytes + structure -> allow | quarantine | reject -> store/process`
105
28
 
106
- ## Why developers choose pompelmi
29
+ ## Key protections
107
30
 
108
- - **Privacy-first** all scanning is in-process; no bytes leave your infrastructure, ever.
109
- - **No daemon, no sidecar** install like any npm package and start scanning immediately.
110
- - **Blocks early** runs before you write to disk, persist to storage, or pass files to other services.
111
- - **Defense-in-depth** magic-byte MIME sniffing, extension allow-lists, size caps, ZIP bomb guards, polyglot detection.
112
- - **Composable** — chain heuristics, YARA rules, and custom scanners with `composeScanners`. Set `stopOn` and per-scanner timeouts.
113
- - **Framework-friendly** — drop-in middleware for Express, Koa, Next.js, NestJS, Nuxt/Nitro, and Fastify.
114
- - **TypeScript-first** — complete types, modern ESM/CJS builds, tree-shakeable, minimal core dependencies.
115
- - **CI/CD ready** — GitHub Action to scan files and artifacts in pipelines.
31
+ - Extension, size, and declared MIME policy checks.
32
+ - Magic-byte validation for renamed or disguised files.
33
+ - Archive controls for ZIP bombs, traversal, and nesting depth.
34
+ - Heuristics for risky structures such as executables, polyglots, and script-bearing documents.
35
+ - Optional YARA matching when you need signature-based rules.
116
36
 
117
- ---
118
-
119
- ## 🧩 Framework adapters
120
-
121
- All adapters share the same policy options and scanning contract. Install only what you need.
122
-
123
- | Framework | Package | Status |
124
- |---|---|---|
125
- | **Express** | `@pompelmi/express-middleware` | ✅ Stable |
126
- | **Next.js** | `@pompelmi/next-upload` | ✅ Stable |
127
- | **Koa** | `@pompelmi/koa-middleware` | ✅ Stable |
128
- | **NestJS** | `@pompelmi/nestjs-integration` | ✅ Stable |
129
- | **Nuxt / Nitro** | built-in `pompelmi` | ✅ [Guide](https://pompelmi.github.io/pompelmi/how-to/nuxt-nitro/) |
130
- | **Fastify** | `@pompelmi/fastify-plugin` | 🔶 Alpha |
131
- | **Remix / SvelteKit / hapi** | — | 🔜 Planned |
37
+ ## Quick start
132
38
 
133
39
  ```bash
134
- npm i @pompelmi/express-middleware # Express
135
- npm i @pompelmi/next-upload # Next.js
136
- npm i @pompelmi/koa-middleware # Koa
137
- npm i @pompelmi/nestjs-integration # NestJS
138
- npm i @pompelmi/fastify-plugin # Fastify (alpha)
139
- npm i -g @pompelmi/cli # CLI / CI/CD
140
- ```
141
-
142
- ### Express
143
-
144
- ```ts
145
- import express from 'express';
146
- import multer from 'multer';
147
- import { createUploadGuard } from '@pompelmi/express-middleware';
148
- import { scanner, policy } from './lib/security';
149
-
150
- const app = express();
151
- app.post(
152
- '/upload',
153
- multer({ storage: multer.memoryStorage() }).any(),
154
- createUploadGuard({ ...policy, scanner }),
155
- (req, res) => res.json({ verdict: (req as any).pompelmi?.verdict })
156
- );
157
- ```
158
-
159
- ### Next.js App Router
160
-
161
- ```ts
162
- // app/api/upload/route.ts
163
- import { createNextUploadHandler } from '@pompelmi/next-upload';
164
- import { scanner, policy } from '@/lib/security';
165
-
166
- export const runtime = 'nodejs';
167
- export const POST = createNextUploadHandler({ ...policy, scanner });
40
+ npm install pompelmi
168
41
  ```
169
42
 
170
- ### NestJS
171
-
172
43
  ```ts
173
- // app.module.ts
174
- import { PompelmiModule } from '@pompelmi/nestjs-integration';
175
- import { CommonHeuristicsScanner } from 'pompelmi';
176
-
177
- @Module({
178
- imports: [
179
- PompelmiModule.forRoot({
180
- includeExtensions: ['pdf', 'zip', 'png', 'jpg'],
181
- maxFileSizeBytes: 10 * 1024 * 1024,
182
- scanners: [CommonHeuristicsScanner],
183
- }),
184
- ],
185
- })
186
- export class AppModule {}
187
- ```
188
-
189
- > 📖 **More examples:** Check the [examples/](./examples/) directory for complete working demos including Koa, Nuxt/Nitro, standalone, and more.
44
+ import { scanBytes, STRICT_PUBLIC_UPLOAD } from 'pompelmi';
190
45
 
191
- 👉 **[View all adapter docs →](https://pompelmi.github.io/pompelmi/)** &nbsp;&nbsp; **[Browse all examples →](./examples/)**
192
-
193
- ---
194
-
195
- ## 🧱 Composing scanners
196
-
197
- Build a layered scanner with heuristics, ZIP bomb protection, and optional YARA:
198
-
199
- ```ts
200
- import { CommonHeuristicsScanner, createZipBombGuard, composeScanners } from 'pompelmi';
201
-
202
- export const scanner = composeScanners(
203
- [
204
- ['zipGuard', createZipBombGuard({ maxEntries: 512, maxCompressionRatio: 12 })],
205
- ['heuristics', CommonHeuristicsScanner],
206
- // ['yara', YourYaraScanner],
207
- ],
208
- { parallel: false, stopOn: 'suspicious', timeoutMsPerScanner: 1500, tagSourceName: true }
209
- );
210
- ```
211
-
212
- `composeScanners` supports two call forms:
213
- - **Named array** *(recommended)*: `composeScanners([['name', scanner], ...], opts?)`
214
- - **Variadic** *(backward-compatible)*: `composeScanners(scannerA, scannerB, ...)`
215
-
216
- ### Upload flow
217
-
218
- ```mermaid
219
- flowchart TD
220
- A["Client uploads file(s)"] --> B["Web App Route"]
221
- B --> C{"Pre-filters (ext, size, MIME)"}
222
- C -- fail --> X["HTTP 4xx"]
223
- C -- pass --> D{"Is ZIP?"}
224
- D -- yes --> E["Iterate entries (limits & scan)"]
225
- E --> F{"Verdict?"}
226
- D -- no --> F{"Scan bytes"}
227
- F -- malicious/suspicious --> Y["HTTP 422 blocked"]
228
- F -- clean --> Z["HTTP 200 ok + results"]
229
- ```
230
-
231
- ---
232
-
233
- ## ⚙️ Configuration
234
-
235
- All adapters accept the same options:
236
-
237
- | Option | Type | Description |
238
- |---|---|---|
239
- | `scanner` | `{ scan(bytes: Uint8Array): Promise<Match[]> }` | Your scanning engine. Return `[]` for clean. |
240
- | `includeExtensions` | `string[]` | Allowed file extensions (case-insensitive). |
241
- | `allowedMimeTypes` | `string[]` | Allowed MIME types after magic-byte sniffing. |
242
- | `maxFileSizeBytes` | `number` | Per-file size cap; oversized files are rejected early. |
243
- | `timeoutMs` | `number` | Per-file scan timeout. |
244
- | `concurrency` | `number` | Max files scanned in parallel. |
245
- | `failClosed` | `boolean` | Block uploads on scanner errors or timeouts. |
246
- | `onScanEvent` | `(event) => void` | Hook for logging and metrics. |
247
-
248
- **Example — images only, 5 MB max:**
249
-
250
- ```ts
251
- {
252
- includeExtensions: ['png', 'jpg', 'jpeg', 'webp'],
253
- allowedMimeTypes: ['image/png', 'image/jpeg', 'image/webp'],
254
- maxFileSizeBytes: 5 * 1024 * 1024,
46
+ const report = await scanBytes(file.buffer, {
47
+ filename: file.originalname,
48
+ mimeType: file.mimetype,
49
+ policy: STRICT_PUBLIC_UPLOAD,
255
50
  failClosed: true,
256
- }
257
- ```
258
-
259
- ---
260
-
261
- ## 📦 Import entrypoints
262
-
263
- pompelmi ships multiple named entrypoints so you only bundle what you need:
264
-
265
- | Entrypoint | Import | Environment | What it includes |
266
- |---|---|---|---|
267
- | **Default (Node.js)** | `import ... from 'pompelmi'` | Node.js | Full API — HIPAA, cache, threat-intel, ZIP streaming, YARA |
268
- | **Browser-safe** | `import ... from 'pompelmi/browser'` | Browser / bundler | Core scan API, scanners, policy — no Node.js built-ins |
269
- | **React** | `import ... from 'pompelmi/react'` | Browser / React | All browser-safe + `useFileScanner` hook (peer: react ≥18) |
270
- | **Quarantine** | `import ... from 'pompelmi/quarantine'` | Node.js | Quarantine lifecycle — hold/review/promote/delete |
271
- | **Hooks** | `import ... from 'pompelmi/hooks'` | Both | `onScanStart`, `onScanComplete`, `onThreatDetected`, `onQuarantine` |
272
- | **Audit** | `import ... from 'pompelmi/audit'` | Node.js | Structured NDJSON audit trail for compliance/SIEM |
273
- | **Policy packs** | `import ... from 'pompelmi/policy-packs'` | Both | Named pre-configured policies (`documents-only`, `images-only`, …) |
274
-
275
- ---
276
-
277
- ## 🔒 Policy packs
278
-
279
- Named, pre-configured policies for common upload scenarios:
280
-
281
- ```ts
282
- import { POLICY_PACKS, getPolicyPack } from 'pompelmi/policy-packs';
283
-
284
- // Use a built-in pack:
285
- const policy = POLICY_PACKS['strict-public-upload'];
286
-
287
- // Or retrieve by name:
288
- const policy = getPolicyPack('documents-only');
289
- ```
290
-
291
- | Pack | Extensions | Max size | Best for |
292
- |---|---|---|---|
293
- | `documents-only` | PDF, Word, Excel, PowerPoint, CSV, TXT, MD | 25 MB | Document portals, data import |
294
- | `images-only` | JPEG, PNG, GIF, WebP, AVIF, TIFF | 10 MB | Avatars, product images (SVG excluded) |
295
- | `strict-public-upload` | JPEG, PNG, WebP, PDF only | 5 MB | Anonymous/untrusted upload surfaces |
296
- | `conservative-default` | ZIP, images, PDF, CSV, DOCX, XLSX | 10 MB | General hardened default |
297
- | `archives` | ZIP, tar, gz, 7z, rar | 100 MB | Archive endpoints (pair with `createZipBombGuard`) |
298
-
299
- All packs are built on `definePolicy` and are fully overridable.
300
-
301
- ---
302
-
303
- ## 🗄️ Quarantine workflow
304
-
305
- Hold suspicious files for manual review before accepting or permanently deleting them.
306
-
307
- ```ts
308
- import { scanBytes } from 'pompelmi';
309
- import { QuarantineManager, FilesystemQuarantineStorage } from 'pompelmi/quarantine';
310
-
311
- // One-time setup — store quarantined files locally.
312
- const quarantine = new QuarantineManager({
313
- storage: new FilesystemQuarantineStorage({ dir: './quarantine' }),
314
51
  });
315
52
 
316
- // In your upload handler:
317
- const report = await scanBytes(fileBytes, { ctx: { filename: 'upload.pdf' } });
318
-
319
53
  if (report.verdict !== 'clean') {
320
- const entry = await quarantine.quarantine(fileBytes, report, {
321
- originalName: 'upload.pdf',
322
- sizeBytes: fileBytes.length,
323
- uploadedBy: req.user?.id,
324
- });
325
- return res.status(202).json({ quarantineId: entry.id });
54
+ return res.status(422).json({ error: 'Upload blocked', reasons: report.reasons });
326
55
  }
327
56
  ```
328
57
 
329
- **Review API:**
330
-
331
- ```ts
332
- // List pending entries:
333
- const pending = await quarantine.listPending();
334
-
335
- // Approve (promote to storage):
336
- await quarantine.resolve(entryId, { decision: 'promote', reviewedBy: 'ops-team' });
337
-
338
- // Delete permanently:
339
- await quarantine.resolve(entryId, { decision: 'delete', reviewedBy: 'ops-team', reviewNote: 'Confirmed malware' });
340
-
341
- // Generate an audit report:
342
- const report = await quarantine.report({ status: 'pending' });
343
- ```
344
-
345
- The `QuarantineStorage` interface is pluggable — implement it for S3, GCS, a database, or any other backend. `FilesystemQuarantineStorage` is the local reference implementation.
346
-
347
- ---
58
+ Next steps:
348
59
 
349
- ## 🪝 Scan hooks
60
+ - [Getting started](https://pompelmi.github.io/pompelmi/getting-started/)
61
+ - [Framework guides](https://pompelmi.github.io/pompelmi/how-to/express/)
62
+ - [Threat model and architecture](https://pompelmi.github.io/pompelmi/explaination/architecture/)
350
63
 
351
- Observe the scan lifecycle without modifying the pipeline:
64
+ ## Framework support
352
65
 
353
- ```ts
354
- import { scanBytes } from 'pompelmi';
355
- import { createScanHooks, withHooks } from 'pompelmi/hooks';
356
-
357
- const hooks = createScanHooks({
358
- onScanComplete(ctx, report) {
359
- metrics.increment('scans.total');
360
- metrics.histogram('scan.duration_ms', report.durationMs ?? 0);
361
- },
362
- onThreatDetected(ctx, report) {
363
- alerting.notify({ file: ctx.filename, verdict: report.verdict });
364
- },
365
- onScanError(ctx, error) {
366
- logger.error({ file: ctx.filename, error });
367
- },
368
- });
369
-
370
- // Wrap your scan function once, then use it everywhere:
371
- const scan = withHooks(scanBytes, hooks);
372
- const report = await scan(fileBytes, { ctx: { filename: 'upload.zip' } });
373
- ```
374
-
375
- ---
66
+ | Framework | Package or guide |
67
+ | --- | --- |
68
+ | Express | `@pompelmi/express-middleware` |
69
+ | Next.js | `@pompelmi/next-upload` |
70
+ | NestJS | `@pompelmi/nestjs-integration` |
71
+ | Koa | `@pompelmi/koa-middleware` |
72
+ | Fastify | `@pompelmi/fastify-plugin` |
73
+ | Nuxt/Nitro | guide in docs |
376
74
 
377
- ## 🔍 Audit trail
75
+ ## Trust / production readiness
378
76
 
379
- Write a structured NDJSON audit record for every scan and quarantine event:
77
+ - MIT-licensed core, typed APIs, framework adapters, and composable policy packs.
78
+ - Structured verdicts, reasons, and rule matches for logging, quarantine, and review flows.
79
+ - Public docs, examples, changelog, tests, and a security disclosure policy.
80
+ - Local-first deployment model with no required cloud scanning dependency.
81
+ - Built as a defense-in-depth upload gate, not a full antivirus replacement.
380
82
 
381
- ```ts
382
- import { AuditTrail } from 'pompelmi/audit';
383
-
384
- const audit = new AuditTrail({
385
- output: { dest: 'file', path: './audit.jsonl' },
386
- });
83
+ Start here:
387
84
 
388
- // After each scan:
389
- audit.logScanComplete(report, { filename: 'upload.pdf', uploadedBy: req.user?.id });
85
+ - [Production readiness](https://pompelmi.github.io/pompelmi/production-readiness/)
86
+ - [Threat model and architecture](https://pompelmi.github.io/pompelmi/explaination/architecture/)
87
+ - [Examples directory](./examples)
88
+ - [Security policy](./SECURITY.md)
89
+ - [Tests](./tests)
390
90
 
391
- // After quarantine:
392
- audit.logQuarantine(entry);
393
-
394
- // After resolution:
395
- audit.logQuarantineResolved(entry);
396
- ```
91
+ ## FAQ
397
92
 
398
- Each record is a single JSON line with `timestamp`, `event`, `verdict`, `matchCount`, `durationMs`, `sha256`, and more — ready for your SIEM or compliance tools.
93
+ ### Does Pompelmi send files to a cloud API?
399
94
 
400
- ---
95
+ No. Scanning runs in-process by default. File bytes do not need to leave your infrastructure.
401
96
 
402
- ## Production checklist
403
- - [ ] Set `maxFileSizeBytes` — reject oversized files before scanning.
404
- - [ ] Restrict `includeExtensions` and `allowedMimeTypes` to what your app truly needs (or use a [policy pack](#-policy-packs)).
405
- - [ ] Set `failClosed: true` to block uploads on timeouts or scanner errors.
406
- - [ ] Enable deep ZIP inspection; keep nesting depth low.
407
- - [ ] Use `composeScanners` with `stopOn` to fail fast on early detections.
408
- - [ ] Log scan events with [scan hooks](#-scan-hooks) and monitor for anomaly spikes.
409
- - [ ] Wire up the [quarantine workflow](#-quarantine-workflow) for suspicious files rather than silently dropping them.
410
- - [ ] Write an [audit trail](#-audit-trail) for compliance and incident response.
411
- - [ ] Consider running scans in a separate process or container for defense-in-depth.
412
- - [ ] Sanitize file names and paths before persisting uploads.
413
- - [ ] Keep files in memory until policy passes — avoid writing untrusted bytes to disk first.
414
-
415
- ---
416
-
417
- ## 🧬 YARA
418
-
419
- YARA lets you write custom pattern-matching rules and use them as a scanner engine. pompelmi treats YARA matches as signals you map to verdicts (`suspicious`, `malicious`).
420
-
421
- > **Optional.** pompelmi works without YARA. Add it when you need custom detection rules.
422
-
423
- ### Minimal adapter
424
-
425
- ```ts
426
- export const MyYaraScanner = {
427
- async scan(bytes: Uint8Array) {
428
- const matches = await compiledRules.scan(bytes, { timeout: 1500 });
429
- return matches.map(m => ({ rule: m.rule, meta: m.meta ?? {}, tags: m.tags ?? [] }));
430
- }
431
- };
432
- ```
433
-
434
- Plug it into your composed scanner:
435
-
436
- ```ts
437
- import { composeScanners, CommonHeuristicsScanner } from 'pompelmi';
438
-
439
- export const scanner = composeScanners(
440
- [
441
- ['heuristics', CommonHeuristicsScanner],
442
- ['yara', MyYaraScanner],
443
- ],
444
- { parallel: false, stopOn: 'suspicious', timeoutMsPerScanner: 1500, tagSourceName: true }
445
- );
446
- ```
447
-
448
- Starter rules for common threats (EICAR, PDF-embedded JS, Office macros) are in [`rules/starter/`](./rules/).
449
-
450
- **Suggested verdict mapping:**
451
- - `malicious` — high-confidence rules (e.g., `EICAR_Test_File`)
452
- - `suspicious` — heuristic rules (e.g., PDF JavaScript, macro keywords)
453
- - `clean` — no matches
454
-
455
- ### Quick smoke test
456
-
457
- ```bash
458
- # Create a minimal PDF with risky embedded actions
459
- printf '%%PDF-1.7\n1 0 obj\n<< /OpenAction 1 0 R /AA << /JavaScript (alert(1)) >> >>\nendobj\n%%%%EOF\n' > risky.pdf
460
-
461
- # Send it to your endpoint — expect HTTP 422
462
- curl -F "file=@risky.pdf;type=application/pdf" http://localhost:3000/upload -i
463
- ```
464
-
465
- 👉 **[Full YARA guide in docs →](https://pompelmi.github.io/pompelmi/)**
466
-
467
- ---
468
-
469
- ## 🤖 GitHub Action
470
-
471
- Scan files or build artifacts in CI with a single step:
472
-
473
- ```yaml
474
- - uses: pompelmi/pompelmi/.github/actions/pompelmi-scan@v1
475
- with:
476
- path: .
477
- deep_zip: true
478
- fail_on_detect: true
479
- ```
480
-
481
- | Input | Default | Description |
482
- |---|---|---|
483
- | `path` | `.` | Directory to scan. |
484
- | `artifact` | `""` | Single file or archive to scan. |
485
- | `yara_rules` | `""` | Glob path to `.yar` rule files. |
486
- | `deep_zip` | `true` | Traverse nested archives. |
487
- | `max_depth` | `3` | Max nesting depth. |
488
- | `fail_on_detect` | `true` | Fail the job on any detection. |
489
-
490
- ---
491
-
492
- ## 💡 Use cases
493
-
494
- - **Document upload portals** — verify PDFs, DOCX files, and archives before storage.
495
- - **User-generated content platforms** — block malicious images, scripts, or embedded payloads.
496
- - **Internal tooling and wikis** — protect collaboration tools from lateral-movement attacks.
497
- - **Privacy-sensitive environments** — healthcare, legal, and finance platforms where files must stay on-prem.
498
- - **CI/CD pipelines** — catch malicious artifacts before they enter your build or release chain.
499
-
500
- ---
501
-
502
- ## 🏢 Pompelmi Enterprise
503
-
504
- > The open-source `pompelmi` core is **MIT-licensed and always will be** — actively maintained, freely available, no strings attached. `@pompelmi/enterprise` is an optional commercial plugin for teams that need compliance evidence, production observability, and operational tooling on top.
505
-
506
- ### What Enterprise adds
507
-
508
- | Feature | Core (Free, MIT) | Enterprise |
509
- |---|:---:|:---:|
510
- | File scanning, heuristics, YARA | ✅ | ✅ |
511
- | Framework adapters (Express, Next.js, NestJS…) | ✅ | ✅ |
512
- | Quarantine workflow & policy packs | ✅ | ✅ |
513
- | **Advanced Audit Logging (SIEM-compatible)** | — | ✅ |
514
- | **HMAC-signed tamper-evident log entries** | — | ✅ |
515
- | **File / Webhook / Console log sinks** | — | ✅ |
516
- | **On-disk audit log query API** | — | ✅ |
517
- | **Premium YARA Rules** (WannaCry, Cobalt Strike, XMRig, Mimikatz, LOLBAS) | — | ✅ |
518
- | **Prometheus Metrics endpoint** | — | ✅ |
519
- | **Embedded Web GUI Dashboard** | — | ✅ |
520
- | **Priority support & response SLA** | — | ✅ |
521
-
522
- ### Who it's for
523
-
524
- - **Compliance teams** — HMAC-signed NDJSON audit logs satisfy SOC 2, HIPAA, ISO 27001, and PCI-DSS evidence requirements. Routes to file, console, or a SIEM webhook — no file bytes ever leave your infrastructure.
525
- - **Security operations** — live Prometheus metrics (blocked files, YARA hits by category, p95 scan latency) feed directly into your existing Grafana dashboards, zero custom instrumentation required.
526
- - **Platform / DevSecOps teams** — zero-config embedded web GUI shows scan activity in real time. No build step, no SaaS, no data egress. Five curated premium YARA rules (ransomware, APT, miner, LOLBAS) loaded automatically.
527
-
528
- ### Drop-in integration (30 seconds)
529
-
530
- ```bash
531
- npm install @pompelmi/enterprise
532
- ```
533
-
534
- ```ts
535
- import Pompelmi from 'pompelmi';
536
- import { PompelmiEnterprise } from '@pompelmi/enterprise';
537
-
538
- const enterprise = await PompelmiEnterprise.create({
539
- licenseKey: process.env.POMPELMI_LICENSE_KEY,
540
- auditLogger: { sinks: ['file'], hmac: true, hmacSecret: process.env.AUDIT_HMAC_SECRET },
541
- dashboard: { enabled: true, port: 3742 },
542
- });
543
-
544
- const scanner = new Pompelmi();
545
- enterprise.injectInto(scanner); // loads premium YARA rules + hooks all scan events
546
-
547
- const results = await scanner.scan('/srv/uploads');
548
- // → audit log → ./pompelmi-audit/audit-YYYY-MM-DD.ndjson
549
- // → metrics → http://localhost:3742/metrics
550
- // → dashboard → http://localhost:3742
551
- ```
552
-
553
- <div align="center">
554
-
555
- [![Get Pompelmi Enterprise](https://img.shields.io/badge/Pompelmi%20Enterprise-Upgrade%20Now%20%E2%86%92-0a7ea4?style=for-the-badge)](https://buy.polar.sh/polar_cl_sTQdCkfdsz6D0lyLRIKKB7MJCnmBm6mfsOmTr2l2fqn)
556
-
557
- **[View full feature comparison and pricing →](https://pompelmi.github.io/pompelmi/enterprise)**
558
-
559
- </div>
560
-
561
- ---
562
-
563
- ## 🔒 Security
564
-
565
- - pompelmi **reads** bytes — it never executes uploaded files.
566
- - ZIP scanning enforces entry count, per-entry size, total uncompressed size, and nesting depth limits to guard against archive bombs.
567
- - YARA detection quality depends on the rules you provide; tune them to your threat model.
568
- - For defense-in-depth, consider running scans in a separate process or container.
569
- - **Changelog / releases:** [GitHub Releases](https://github.com/pompelmi/pompelmi/releases).
570
- - **Vulnerability disclosure:** [GitHub Security Advisories](https://github.com/pompelmi/pompelmi/security/advisories). We coordinate a fix before public disclosure.
571
-
572
- ---
573
-
574
- ## 🏆 Recognition
575
-
576
- Featured in:
577
-
578
- - [HelpNet Security](https://www.helpnetsecurity.com/2026/02/02/pompelmi-open-source-secure-file-upload-scanning-node-js/)
579
- - [Stack Overflow Blog](https://stackoverflow.blog/2026/02/23/defense-against-uploads-oss-file-scanner-pompelmi/)
580
- - [Node Weekly #594](https://nodeweekly.com/issues/594)
581
- - [Bytes Newsletter #429](https://bytes.dev/archives/429)
582
- - [Detection Engineering Weekly #124](https://www.detectionengineering.net/p/det-eng-weekly-issue-124-the-defcon)
583
- - [daily.dev](https://app.daily.dev/posts/pompelmi)
584
-
585
- <p align="center">
586
- <a href="https://github.com/sorrycc/awesome-javascript"><img src="https://awesome.re/mentioned-badge.svg" alt="Awesome JavaScript"/></a>
587
- <a href="https://github.com/dzharii/awesome-typescript"><img src="https://awesome.re/mentioned-badge.svg" alt="Awesome TypeScript"/></a>
588
- <a href="https://github.com/sbilly/awesome-security"><img src="https://awesome.re/mentioned-badge.svg" alt="Awesome Security"/></a>
589
- <a href="https://github.com/sindresorhus/awesome-nodejs"><img src="https://awesome.re/mentioned-badge.svg" alt="Awesome Node.js"/></a>
590
- </p>
591
-
592
- <!-- MENTIONS:START -->
593
- <!-- MENTIONS:END -->
594
-
595
- ---
596
-
597
- ## 💬 FAQ
598
-
599
- **Does pompelmi send files to third parties?**
600
- No. All scanning runs in-process inside your Node.js application. No bytes leave your infrastructure.
601
-
602
- **Does it require a daemon or external service?**
603
- No. Install it like any npm package — no daemon, no sidecar, no config files to write.
604
-
605
- **Can I use YARA rules?**
606
- Yes. Wrap your YARA engine behind the `{ scan(bytes) }` interface and pass it to `composeScanners`. Starter rules are in [`rules/starter/`](./rules/).
607
-
608
- **Does it work with my framework?**
609
- Stable adapters exist for Express, Koa, Next.js, and NestJS. A Fastify plugin is in alpha. The core library works standalone with any Node.js server.
610
-
611
- **Why 422 for blocked files?**
612
- It's a common convention that keeps policy violations distinct from transport errors. Use whatever HTTP status code fits your API contract.
613
-
614
- **Are ZIP bombs handled?**
615
- Yes. Archive scanning enforces limits on entry count, per-entry size, total uncompressed size, and nesting depth. Use `failClosed: true` in production.
616
-
617
- **Is commercial support available?**
618
- Yes. Limited async support for integration help, configuration review, and troubleshooting is available from the maintainer. Email [pompelmideveloper@yahoo.com](mailto:pompelmideveloper@yahoo.com).
619
-
620
- ---
621
-
622
- ## 💼 Commercial support
623
-
624
- Limited commercial support is available on a **private, asynchronous, best-effort basis** from the maintainer. This may include:
625
-
626
- - Integration assistance
627
- - Configuration and policy review
628
- - Prioritized troubleshooting
629
- - Upload security guidance
630
-
631
- Support is in writing only — no live calls or real-time support.
632
-
633
- **To inquire**, email [pompelmideveloper@yahoo.com](mailto:pompelmideveloper@yahoo.com) with your framework, Node.js version, pompelmi version, and a short description of your goal or issue.
634
-
635
- > Community support (GitHub Issues and Discussions) remains free and open. For vulnerability disclosure, see [SECURITY.md](./SECURITY.md).
636
-
637
- ---
638
-
639
- ## 🤝 Contributing
640
-
641
- PRs and issues are welcome.
642
-
643
- ```bash
644
- pnpm -r build
645
- pnpm -r lint
646
- pnpm vitest run --coverage --passWithNoTests
647
- ```
97
+ ### Does it require ClamAV, a sidecar, or another daemon?
648
98
 
649
- See [CONTRIBUTING.md](./CONTRIBUTING.md) for full guidelines.
99
+ No. Built-in heuristics work without a daemon. ClamAV and YARA integrations are optional.
650
100
 
651
- <p align="center">
652
- <a href="https://github.com/pompelmi/pompelmi/graphs/contributors">
653
- <img src="https://contrib.rocks/image?repo=pompelmi/pompelmi" alt="Contributors" />
654
- </a>
655
- </p>
101
+ ### What does it help block?
656
102
 
657
- <p align="center">
658
- <a href="https://github.com/sponsors/pompelmi">
659
- <img src="https://img.shields.io/badge/Sponsor-pompelmi-EA4AAA?style=for-the-badge&logo=githubsponsors&logoColor=white" alt="Sponsor pompelmi" />
660
- </a>
661
- </p>
103
+ It adds a layered upload gate before storage or downstream processing. That helps catch spoofed files, archive bombs, polyglots, and common risky document structures.
662
104
 
663
- ---
105
+ ### Is this a complete antivirus replacement?
664
106
 
665
- ## 🌍 Translations
107
+ No. Pompelmi is an upload security layer and risk-reduction control. It should sit inside a broader defense-in-depth design.
666
108
 
667
- [🇮🇹 Italian](docs/i18n/README.it.md) [🇫🇷 French](docs/i18n/README.fr.md) • [🇪🇸 Spanish](docs/i18n/README.es.md) • [🇩🇪 German](docs/i18n/README.de.md) • [🇯🇵 Japanese](docs/i18n/README.ja.md) • [🇨🇳 Chinese](docs/i18n/README.zh-CN.md) [🇰🇷 Korean](docs/i18n/README.ko.md) • [🇧🇷 Portuguese](docs/i18n/README.pt-BR.md) • [🇷🇺 Russian](docs/i18n/README.ru.md) • [🇹🇷 Turkish](docs/i18n/README.tr.md)
109
+ ### Can it help in privacy-sensitive or regulated environments?
668
110
 
669
- The English README is the authoritative source. Contributions to translations are welcome via PR.
111
+ It can support internal control objectives by reducing upload risk and producing structured scan outcomes. It is not itself a compliance certification.
670
112
 
671
- ---
113
+ ## Commercial / enterprise
672
114
 
673
- <p align="right"><a href="#pompelmi">↑ Back to top</a></p>
115
+ Commercial support and enterprise options are available for teams that need rollout help, advanced auditability, or additional operational features. The open-source MIT core remains the default path. See [Support options](https://pompelmi.github.io/pompelmi/support/) and [`@pompelmi/enterprise`](https://pompelmi.github.io/pompelmi/enterprise/).
674
116
 
675
- ## 📜 License
117
+ ## License
676
118
 
677
- [MIT](./LICENSE) © 2025–present pompelmi contributors
119
+ MIT. See [LICENSE](./LICENSE). Also: [Docs](https://pompelmi.github.io/pompelmi/), [GitHub](https://github.com/pompelmi/pompelmi), [npm](https://www.npmjs.com/package/pompelmi).
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "pompelmi",
3
- "version": "0.34.1",
3
+ "version": "0.34.2",
4
4
  "description": "In-process file upload security for Node.js — no cloud API, no daemon, no data egress. TypeScript-first library with Express, Next.js, NestJS, Fastify, Koa, and Nuxt/Nitro adapters. Features magic-byte MIME validation, ZIP bomb protection, YARA integration, and layered heuristic scanning. Built for privacy-sensitive and self-hosted environments.",
5
5
  "main": "./dist/pompelmi.cjs",
6
6
  "module": "./dist/pompelmi.esm.js",