npm - @intentsolutionsio/penetration-tester - Versions diffs - 2.0.0 → 3.0.4 - Mend

@intentsolutionsio/penetration-tester 2.0.0 → 3.0.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (112) hide show

package/skills/auditing-npm-dependencies/references/PLAYBOOK.md ADDED Viewed

@@ -0,0 +1,175 @@
+# PLAYBOOK — Remediating npm Findings
+## Decision flow
+```
+                npm audit finding
+                       |
+                       v
+            +-------------------+
+            | DIRECT or         |
+            | TRANSITIVE?       |
+            +---+-----------+---+
+                |           |
+        DIRECT  |           |  TRANSITIVE
+                v           v
+       +----------+   +-----------------+
+       | semver-  |   | parent has new  |
+       | minor    |   | version with    |
+       | fix?     |   | floor above fix?|
+       +-+----+---+   +--+----------+---+
+         |    |          |          |
+       YES   NO        YES        NO
+         |    |          |          |
+         v    v          v          v
+    audit  manual      bump      overrides
+    fix    review     parent     block
+```
+## Per-runtime remediation patterns
+### Frontend bundler (webpack / vite / next.js)
+CVEs in build-time-only deps (loaders, transformers, dev plugins)
+are lower priority than CVEs in runtime deps that ship in the
+bundle.
+- Build-time only: file ticket, fix in next sprint.
+- Runtime in bundle: block release.
+- Use `npm audit --omit=dev` to focus on runtime; this skill's
+  `--include-dev` flag toggles the inverse.
+### Node server (Express, Fastify, Hapi)
+Every advisory affects a long-running process. Treat all of them
+as runtime, including those marked dev-only — `eslint` etc. usually
+DON'T ship to production, but if they're in your Docker image at
+runtime, they do.
+Inspect your `Dockerfile`. If it copies `node_modules` whole, every
+dep ships. If it uses `npm ci --omit=dev`, devDeps are pruned.
+### Electron desktop app
+Electron apps bundle Chromium + Node into a binary distributed to
+end-users. A vulnerable dep ships to every user. Patches require a
+full app release; you cannot ship a runtime hotpatch. CRITICAL
+findings require an emergency release.
+### AWS Lambda / serverless function
+Each function invocation loads the dependency tree. The blast radius
+of a compromised dep is whatever the Lambda's IAM role grants. Audit
+the IAM role alongside the dep audit — a compromised `request` package
+in a Lambda with `s3:*` permissions is an exfiltration vector to
+every object in your S3 buckets.
+### Monorepo (Nx / Turborepo / pnpm workspaces)
+Audit each workspace separately. `npm audit --workspaces` exists in
+npm 7+ but produces output keyed by workspace and is awkward to
+parse. This skill currently scans one package.json at a time; for
+monorepos, iterate over the workspace globs in your top-level
+`package.json` and run the scanner per package.
+## Override-block templates
+### Single-package override
+```json
+{
+  "overrides": {
+    "minimist": "^1.2.6"
+  }
+}
+```
+Use when one transitive package needs a floor and the rest of the
+parent's deps are fine.
+### Nested override (specific parent only)
+```json
+{
+  "overrides": {
+    "express": {
+      "qs": "^6.10.3"
+    }
+  }
+}
+```
+Force `qs@^6.10.3` only when pulled in through `express` — leave
+other consumers of `qs` untouched. Use when an override breaks an
+unrelated package and you need surgical control.
+### Wildcard override (rare)
+```json
+{
+  "overrides": {
+    "lodash": "$lodash"
+  }
+}
+```
+`$lodash` resolves to the version declared in your `dependencies`.
+Use to enforce that every transitive use of `lodash` matches your
+declared version. Reserve for cases where transitive duplication is
+causing real bundle-size or behavior problems.
+## Provider rotation procedures
+Vulnerabilities sometimes overlap credential leaks: a CVE in a
+package that handles auth tokens may incidentally expose tokens.
+After fixing the dep, check whether the vulnerable version logged
+or transmitted secrets in a way that requires rotation.
+| Provider | Rotation surface |
+|---|---|
+| GitHub | Settings → Developer settings → Personal access tokens → Regenerate |
+| npm | `npm token revoke <id>` + `npm token create` |
+| AWS | IAM → Access keys → Deactivate → Delete → New access key |
+| Stripe | Dashboard → Developers → API keys → Roll key |
+| Sentry | Settings → Auth Tokens → Revoke + Create |
+## GitHub Dependabot integration
+Dependabot opens automatic PRs when GitHub's vulnerability database
+flags a finding. This skill is complementary, not redundant:
+- Dependabot is the long-running watcher.
+- This skill is the PR-time gate that ensures Dependabot's findings
+  haven't accumulated unreviewed.
+In CI:
+```yaml
+- name: npm audit
+  run: |
+    python3 plugins/security/penetration-tester/skills/auditing-npm-dependencies/scripts/audit_npm.py . \
+        --min-severity high --format markdown --output npm-audit.md
+- name: Comment on PR
+  if: github.event_name == 'pull_request'
+  uses: marocchino/sticky-pull-request-comment@v2
+  with:
+    path: npm-audit.md
+```
+## SOC2 evidence retention
+For Trust Service Category CC7 (System Operations) and CC8 (Change
+Management), retain npm audit output as evidence that dependency
+vulnerabilities are tracked:
+```bash
+mkdir -p evidence/CC7/
+python3 ./scripts/audit_npm.py . --include-dev --no-cache \
+    --format json \
+    --output evidence/CC7/npm-audit-$(date +%Y%m%d).json
+```
+Keep at least one audit per quarter. Auditors will ask for evidence
+that you ran the audit, that findings were triaged, and that
+remediation timing matched your published vulnerability-response
+policy.

package/skills/auditing-npm-dependencies/references/THEORY.md ADDED Viewed

@@ -0,0 +1,122 @@
+# THEORY — Why npm Dependency Audits Matter
+## The shape of the problem
+A modern Node application is mostly other people's code. A typical
+React + Next.js app installs ~30 direct dependencies and ends up
+with ~1,500 packages in `node_modules` after npm resolves the
+transitive closure. Every one of those packages can:
+1. Ship a known CVE that gets disclosed after you installed.
+2. Be hijacked through maintainer-account takeover (the npm registry
+   has had several published cases since 2017).
+3. Be replaced by a typosquatted near-name package that someone
+   slipped into a lockfile during a careless `npm install`.
+Because npm packages execute arbitrary JavaScript on install
+(`postinstall` scripts), a single compromised package in your
+transitive tree can read environment variables, exfiltrate secrets,
+and pivot to other systems with no user interaction beyond the
+install.
+## Historical npm supply-chain compromises
+These are the cases that drove the industry to treat dependency
+auditing as a CI gate, not optional polish.
+| Year | Event | Mechanism |
+|---|---|---|
+| 2018 | `event-stream` | Maintainer transferred to a new account that injected wallet-stealing code into a transitive dep (`flatmap-stream`). Affected Bitcoin wallet users. |
+| 2021 | `ua-parser-js` | Maintainer account compromise. Crypto miner shipped to ~8M weekly downloads. |
+| 2022 | `node-ipc` | Maintainer-installed protestware that targeted users in specific geographies. |
+| 2022 | `colors.js` / `faker.js` | Maintainer self-sabotage; broke production builds globally for the affected versions. |
+| 2024 | `lottie-player` | Cloudflare CDN compromise distributed wallet-drainer code. |
+Each case ended with the same remediation: roll back to a known-good
+version range, audit logs for exploitation evidence, publish an
+incident postmortem. The presence of an audit gate would not have
+prevented the original install (the malicious code was new), but
+would have surfaced the compromise within hours of disclosure.
+## Direct vs transitive: the remediation diff
+A CVE in a direct dependency you require in `package.json` is
+fixable by upgrading the version range. A CVE in a transitive
+dependency requires either (a) a parent bump that pulls in the fix
+or (b) an `overrides` block in your root `package.json`.
+The `overrides` mechanism was introduced in npm 8.3 (December 2021).
+Before that, only Yarn's `resolutions` field offered the equivalent
+escape hatch. `overrides` lets you force the resolved version of a
+package regardless of what the dependency graph requested:
+```json
+{
+  "overrides": {
+    "minimist": "^1.2.6"
+  }
+}
+```
+This is safe when the override pins to a SemVer-compatible version
+of the package the parent expected. It is RISKY when the override
+crosses a major-version boundary — the parent may rely on removed
+APIs and break at runtime. The audit script tags transitive findings
+with a `relationship: transitive` evidence field to flag them for
+the operator's manual review.
+## npm audit output schema diff (v1 vs v2)
+npm 6 emitted findings under an `advisories` key, one per advisory.
+npm 7+ rewrote the output to a `vulnerabilities` key, keyed by
+package name, with the per-package record summarizing all advisories
+affecting that package.
+| Aspect | v1 (npm 6) | v2 (npm 7+) |
+|---|---|---|
+| Top-level key | `advisories` | `vulnerabilities` |
+| Keyed by | Advisory ID | Package name |
+| Direct vs transitive | implicit | explicit via `via` field |
+| Fix metadata | `patched_versions` string | `fixAvailable` object or boolean |
+| CVE field | `cves` array | per-advisory `cve` field in `via[]` |
+This skill's parser handles both shapes. Practically, most modern
+projects run npm 8+; the v1 parser is kept for legacy CI runners
+and engineering laptops that haven't upgraded.
+## Why severity normalization matters
+Different tools use different severity vocabularies. npm uses
+`info/low/moderate/high/critical`. CVSS uses numeric scores.
+GitHub uses 4 levels. PyPA uses something else again. The
+penetration-tester `Severity` enum is the canonical mapping target
+so downstream consumers (SOC2 evidence packages, security
+dashboards, executive summary reports) see one vocabulary across
+every tool. The `Severity.from_npm_audit` classmethod in
+`lib/finding.py` does the npm-specific mapping.
+## When `npm audit fix` is dangerous
+`npm audit fix` rewrites `package-lock.json` to the closest
+non-breaking versions that resolve advisories. Two failure modes:
+1. **Semver-major fix required.** npm refuses to auto-apply and
+   warns "requires manual review." This is correct behavior; do not
+   pass `--force` casually. A major version bump can break the
+   parent's API contract.
+2. **Lockfile churn on shared CI runners.** If the lockfile rewrite
+   happens in a CI step that doesn't commit back, the next CI run
+   sees the same vulnerabilities and re-applies the fix, producing
+   a stable but undocumented divergence between developer workstations
+   and CI. Either commit the fix in CI (with appropriate guards) or
+   require the fix to land via human PR.
+## When to use the CISA KEV list
+NIST's NVD scoring (CVSS) reflects intrinsic severity. CISA's Known
+Exploited Vulnerabilities (KEV) catalog reflects observed exploitation
+in the wild. A high-CVSS finding with no KEV listing is concerning;
+a medium-CVSS finding WITH KEV listing is more concerning because
+attackers are actively using it. The skill currently emits CVSS-derived
+severity only; KEV enrichment is a planned addition via the
+`mcp__pen-tester-cve` MCP server when wired.

package/skills/auditing-npm-dependencies/scripts/audit_npm.py ADDED Viewed

@@ -0,0 +1,408 @@
+#!/usr/bin/env python3
+"""auditing-npm-dependencies — wrap `npm audit --json` into canonical Findings.
+Walks a Node.js project, runs `npm audit --json` in the target directory, parses
+both the v1 (npm 6) and v2 (npm 7+) audit output shapes, and emits Findings via
+lib/finding.py. Output formats (json/jsonl/markdown) and exit-code semantics are
+shared with the rest of the penetration-tester v3 pack via lib/report.py.
+The scanner deduplicates per-CVE across direct + transitive dependency paths,
+classifies each finding as direct vs transitive (impacts remediation strategy),
+and maps npm's severity vocabulary (info/low/moderate/high/critical) onto the
+shared Severity enum.
+Usage:
+    python3 audit_npm.py PATH [--output FILE] [--format json|jsonl|markdown]
+                              [--min-severity sev] [--include-dev] [--no-cache]
+                              [--json-only]
+"""
+from __future__ import annotations
+import argparse
+import json
+import shutil
+import subprocess
+import sys
+from pathlib import Path
+from typing import Any
+# --- Make lib/ importable regardless of CWD ----------------------------------
+_LIB_ROOT = Path(__file__).resolve().parents[3]
+sys.path.insert(0, str(_LIB_ROOT))
+from lib.finding import Finding, Severity  # noqa: E402
+from lib import report  # noqa: E402
+SKILL_ID = "auditing-npm-dependencies"
+CATEGORY = "dependency-vulnerability"
+CWE_DEFAULT = "CWE-1104"  # Use of unmaintained / vulnerable third-party component
+# --- npm invocation ----------------------------------------------------------
+def _npm_present() -> bool:
+    """Probe for an npm binary on PATH."""
+    return shutil.which("npm") is not None
+def _project_is_node(directory: Path) -> bool:
+    """A directory is a node project if it has package.json at the root."""
+    return (directory / "package.json").exists()
+def _run_npm_audit(directory: Path, include_dev: bool, no_cache: bool) -> tuple[dict[str, Any] | None, str]:
+    """Run `npm audit --json` in directory, return (parsed_json_or_None, raw_stdout).
+    Returns parsed dict on success, None when audit failed or output was not JSON.
+    The raw stdout is always returned so the caller can attach it to an INFO
+    Finding for debugging.
+    """
+    cmd: list[str] = ["npm", "audit", "--json"]
+    if not include_dev:
+        cmd.append("--omit=dev")
+    if no_cache:
+        # npm 8+ honors --no-audit; older npm ignored. Combined with cache flag.
+        cmd.append("--no-fund")
+    try:
+        proc = subprocess.run(  # noqa: S603 — npm is the audit tool
+            cmd,
+            cwd=str(directory),
+            capture_output=True,
+            text=True,
+            timeout=180,
+            check=False,
+        )
+    except subprocess.TimeoutExpired:
+        return None, "npm audit timed out after 180s"
+    except FileNotFoundError:
+        return None, "npm binary not found"
+    stdout = proc.stdout or ""
+    # npm audit exits non-zero when vulns are found — that's expected; we
+    # parse the JSON regardless.
+    try:
+        data = json.loads(stdout)
+    except json.JSONDecodeError:
+        return None, stdout
+    return data, stdout
+# --- npm v2 schema parser (npm 7+) -------------------------------------------
+def _parse_v2(data: dict[str, Any], target_label: str) -> list[Finding]:
+    """Parse npm 7+ audit output. Schema is `vulnerabilities.<pkg> → record`.
+    Direct vs transitive distinction comes from `record.via`: if `via` contains
+    string entries (other package names), this is a transitive vulnerability
+    surfaced through those parents. If `via` contains dict entries, the package
+    itself is the source (direct vuln in the named package).
+    """
+    findings: list[Finding] = []
+    vulns: dict[str, Any] = data.get("vulnerabilities", {}) or {}
+    for pkg_name, record in vulns.items():
+        severity_str = str(record.get("severity", "info"))
+        severity = Severity.from_npm_audit(severity_str)
+        via = record.get("via", [])
+        is_direct = any(isinstance(item, dict) for item in via)
+        source_label = "direct" if is_direct else "transitive"
+        # Extract CVE / GHSA from any dict entries in `via`.
+        cve_id: str | None = None
+        ghsa_id: str | None = None
+        title_summary: str | None = None
+        urls: list[str] = []
+        for item in via:
+            if not isinstance(item, dict):
+                continue
+            raw_source = item.get("source")
+            # npm audit v2 sometimes emits `source` as a list, sometimes as a
+            # scalar (string advisory ID or integer advisory ID). Normalize.
+            source_list: list[Any] = []
+            if isinstance(raw_source, list):
+                source_list = raw_source
+            elif raw_source is not None:
+                source_list = [raw_source]
+            for src in source_list:
+                if isinstance(src, str) and src.upper().startswith("CVE-"):
+                    cve_id = cve_id or src
+                if isinstance(src, str) and src.upper().startswith("GHSA-"):
+                    ghsa_id = ghsa_id or src
+            cve_id = cve_id or item.get("cve") or None
+            # When source is itself a GHSA string, surface it as ghsa_id.
+            source_str = raw_source if isinstance(raw_source, str) else None
+            ghsa_id = ghsa_id or item.get("ghsa") or source_str or None
+            title_summary = title_summary or item.get("title")
+            url = item.get("url")
+            if url:
+                urls.append(url)
+        affected_range = str(record.get("range", "unknown"))
+        fix_available = record.get("fixAvailable", False)
+        if isinstance(fix_available, dict):
+            fix_version_str = f"{fix_available.get('name', pkg_name)}@{fix_available.get('version', '?')}"
+        elif fix_available is True:
+            fix_version_str = "non-breaking upgrade available (npm audit fix)"
+        else:
+            fix_version_str = "no fix available"
+        title = title_summary or f"npm vulnerability in {pkg_name} ({severity_str})"
+        detail_lines = [
+            f"Affected package: {pkg_name}",
+            f"Affected range: {affected_range}",
+            f"Dependency relationship: {source_label}",
+            f"npm severity: {severity_str}",
+        ]
+        if cve_id:
+            detail_lines.append(f"CVE: {cve_id}")
+        if ghsa_id:
+            detail_lines.append(f"GHSA: {ghsa_id}")
+        if is_direct:
+            remediation = (
+                f"1. Run `npm audit fix` in {target_label}.\n"
+                f"2. If fix requires a semver-major bump, evaluate breaking changes "
+                f"and decide whether to upgrade.\n"
+                f"3. Commit the updated package-lock.json."
+            )
+        else:
+            remediation = (
+                f"1. Run `npm ls {pkg_name}` to identify which parent(s) pull in "
+                f"the vulnerable version.\n"
+                f"2. Check whether upgrading the parent picks up the fix.\n"
+                f"3. If not, add a root-level `overrides` block for {pkg_name} "
+                f"pinning to the fix version (requires npm 8.3+)."
+            )
+        if fix_version_str.startswith("no fix"):
+            remediation += (
+                "\n\nNO FIX AVAILABLE — subscribe to GHSA notifications and "
+                "consider vendoring + patching or replacing the package."
+            )
+        evidence_items: list[tuple[str, Any]] = [
+            ("package", pkg_name),
+            ("range", affected_range),
+            ("relationship", source_label),
+            ("fix", fix_version_str),
+        ]
+        if cve_id:
+            evidence_items.append(("cve", cve_id))
+        if ghsa_id:
+            evidence_items.append(("ghsa", ghsa_id))
+        # Bump severity to HIGH if no fix and originally moderate+ — operator
+        # has limited remediation surface.
+        if fix_version_str.startswith("no fix") and severity.numeric >= 3:
+            severity = max(severity, Severity.HIGH, key=lambda s: s.numeric)
+        findings.append(
+            Finding(
+                skill_id=SKILL_ID,
+                title=title,
+                severity=severity,
+                target=f"{target_label}::{pkg_name}",
+                detail="\n".join(detail_lines),
+                remediation=remediation,
+                cve_id=cve_id,
+                cwe_id=CWE_DEFAULT,
+                references=tuple(urls or []),
+                evidence=tuple(evidence_items),
+            )
+        )
+    return findings
+# --- npm v1 schema parser (npm 6) --------------------------------------------
+def _parse_v1(data: dict[str, Any], target_label: str) -> list[Finding]:
+    """Parse npm 6 audit output. Schema is `advisories.<id> → record`."""
+    findings: list[Finding] = []
+    advisories: dict[str, Any] = data.get("advisories", {}) or {}
+    for adv_id, record in advisories.items():
+        severity_str = str(record.get("severity", "info"))
+        severity = Severity.from_npm_audit(severity_str)
+        pkg_name = record.get("module_name", "<unknown>")
+        affected_range = record.get("vulnerable_versions", "unknown")
+        patched_versions = record.get("patched_versions", "")
+        cves = record.get("cves") or []
+        cve_id = cves[0] if cves else None
+        title = record.get("title") or f"npm advisory {adv_id} in {pkg_name}"
+        url = record.get("url", "")
+        detail_lines = [
+            f"Affected package: {pkg_name}",
+            f"Affected versions: {affected_range}",
+            f"npm severity: {severity_str}",
+            f"Advisory ID: {adv_id}",
+        ]
+        if cve_id:
+            detail_lines.append(f"CVE: {cve_id}")
+        remediation = (
+            f"1. Upgrade {pkg_name} to a version matching `{patched_versions}`.\n"
+            "2. Run `npm install` to refresh package-lock.json.\n"
+            "3. Re-run audit to confirm the finding is resolved."
+        )
+        findings.append(
+            Finding(
+                skill_id=SKILL_ID,
+                title=title,
+                severity=severity,
+                target=f"{target_label}::{pkg_name}",
+                detail="\n".join(detail_lines),
+                remediation=remediation,
+                cve_id=cve_id,
+                cwe_id=CWE_DEFAULT,
+                references=(url,) if url else (),
+                evidence=(
+                    ("package", pkg_name),
+                    ("affected", affected_range),
+                    ("patched", patched_versions),
+                    ("advisory_id", adv_id),
+                ),
+            )
+        )
+    return findings
+# --- Operational helpers -----------------------------------------------------
+def _info_finding(title: str, detail: str, target: str) -> Finding:
+    return Finding(
+        skill_id=SKILL_ID,
+        title=title,
+        severity=Severity.INFO,
+        target=target,
+        detail=detail,
+        remediation="Operational issue; no security action required.",
+        references=(),
+        evidence=(),
+    )
+def audit_directory(directory: Path, include_dev: bool, no_cache: bool, json_only: bool) -> tuple[list[Finding], str]:
+    """Run an audit, returning (findings, raw_stdout)."""
+    if not _npm_present():
+        return [
+            _info_finding(
+                "npm not installed",
+                "The npm binary was not found on PATH; cannot run audit.",
+                str(directory),
+            )
+        ], ""
+    if not _project_is_node(directory):
+        return [
+            _info_finding(
+                "target is not a Node project",
+                f"No package.json at {directory}; skipping npm audit.",
+                str(directory),
+            )
+        ], ""
+    data, raw = _run_npm_audit(directory, include_dev, no_cache)
+    if json_only:
+        sys.stdout.write(raw)
+        return [], raw
+    if data is None:
+        return [
+            _info_finding(
+                "npm audit returned non-JSON output",
+                f"Raw stdout (first 500 chars): {raw[:500]}",
+                str(directory),
+            )
+        ], raw
+    target_label = directory.name or str(directory)
+    if "vulnerabilities" in data:
+        findings = _parse_v2(data, target_label)
+    elif "advisories" in data:
+        findings = _parse_v1(data, target_label)
+    else:
+        findings = []
+    if not findings:
+        findings = [
+            _info_finding(
+                "no npm vulnerabilities found",
+                "npm audit reported a clean dependency tree.",
+                str(directory),
+            )
+        ]
+    return findings, raw
+# --- CLI ---------------------------------------------------------------------
+def _build_arg_parser() -> argparse.ArgumentParser:
+    p = argparse.ArgumentParser(description=__doc__.split("\n")[0])
+    p.add_argument("path", help="Path to Node project root (contains package.json)")
+    p.add_argument("--output", default=None, help="Write findings to FILE (default: stdout)")
+    p.add_argument(
+        "--format",
+        default="markdown",
+        choices=["json", "jsonl", "markdown"],
+        help="Output format (default: markdown)",
+    )
+    p.add_argument(
+        "--min-severity",
+        default="info",
+        choices=["info", "low", "medium", "high", "critical"],
+        help="Filter out findings below this severity (default: info — emit all)",
+    )
+    p.add_argument(
+        "--include-dev",
+        action="store_true",
+        help="Audit devDependencies too (default: prod only)",
+    )
+    p.add_argument(
+        "--no-cache",
+        action="store_true",
+        help="Disable npm fund cache hint (slower; fresher data)",
+    )
+    p.add_argument(
+        "--json-only",
+        action="store_true",
+        help="Print raw npm audit --json and exit (debug)",
+    )
+    return p
+def _filter_min_severity(findings: list[Finding], min_sev: str) -> list[Finding]:
+    floor = Severity(min_sev).numeric
+    return [f for f in findings if f.severity.numeric >= floor]
+def main(argv: list[str] | None = None) -> int:
+    args = _build_arg_parser().parse_args(argv)
+    directory = Path(args.path).resolve()
+    findings, _raw = audit_directory(
+        directory,
+        include_dev=args.include_dev,
+        no_cache=args.no_cache,
+        json_only=args.json_only,
+    )
+    if args.json_only:
+        return 0
+    findings = _filter_min_severity(findings, args.min_severity)
+    report.emit(findings, args.output, args.format, scan_target=str(directory))
+    return report.exit_code(findings)
+if __name__ == "__main__":
+    sys.exit(main())