npm - aiseo-audit - Versions diffs - 1.0.0 - Mend

aiseo-audit 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

package/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2026 aiseo-audit contributors
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

package/README.md ADDED Viewed

@@ -0,0 +1,269 @@
+# aiseo-audit
+[![npm version](https://img.shields.io/npm/v/aiseo-audit.svg)](https://www.npmjs.com/package/aiseo-audit)
+[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/licenses/MIT)
+[![Node.js](https://img.shields.io/badge/node-%3E%3D20-brightgreen.svg)](https://nodejs.org)
+Deterministic CLI that audits web pages for **AI search readiness**. Think Lighthouse, but for how well AI engines can fetch, extract, understand, and cite your content.
+**AI SEO measures how reusable your content is for generative engines, not traditional search rankings.**
+## What is AI SEO?
+Traditional SEO optimizes for ranking in a list of links. **AI SEO** optimizes for being **cited** in generated answers. Different goal, different signals.
+When someone asks ChatGPT, Claude, Perplexity, or Gemini a question, those engines fetch web content, extract the useful parts, and decide what to cite. AI SEO (also called Generative Engine Optimization or GEO) is the practice of structuring your content so that process works in your favor. The foundational research behind this field comes from [Princeton's GEO paper](https://arxiv.org/abs/2311.09735), which identified the specific content traits that increase generative engine citations.
+aiseo-audit measures those signals: can the content be extracted? Is it structured for reuse? Does it contain the patterns AI engines actually quote? It runs entirely locally with no AI API calls and no external services.
+## How aiseo-audit Is Different
+Most "AI readiness" audits check whether certain files and tags exist. Does the site have llms.txt? Is there a sitemap? Is JSON-LD present? Those are binary checks that tell you very little about whether AI engines will actually use your content.
+aiseo-audit goes deeper:
+- **Content analysis, not just tag detection.** NLP-based entity extraction, readability scoring, answer capsule detection, section length analysis, and boilerplate measurement. 30+ factors across 7 research-backed categories.
+- **Research-grounded scoring.** Thresholds and weights are derived from published research on what generative engines actually cite. See [Audit Breakdown](docs/AUDIT_BREAKDOWN.md) for the full methodology and [Research](docs/RESEARCH.md) for where the data comes from.
+- **Configurable weights.** Prioritize the categories that matter to your content via `aiseo.config.json`. Zero vendor lock-in.
+- **Four output formats.** Pretty terminal, JSON, Markdown, and self-contained HTML reports.
+- **Zero external dependencies at runtime.** No API keys, no network calls beyond fetching the target URL. Fully deterministic.
+## Quick Start
+Run a one-off audit without installing:
+```bash
+npx aiseo-audit https://example.com
+```
+## Install
+```bash
+# As a project dependency
+npm install aiseo-audit
+# As a dev dependency
+npm install --save-dev aiseo-audit
+# Globally
+npm install -g aiseo-audit
+```
+## Usage
+```bash
+# Pretty terminal output (default)
+aiseo-audit https://example.com
+# JSON output
+aiseo-audit https://example.com --json
+# Markdown output
+aiseo-audit https://example.com --md
+# HTML report (Lighthouse-style)
+aiseo-audit https://example.com --html
+# Write output to a file (uses the selected format)
+aiseo-audit https://example.com --html --out report.html
+aiseo-audit https://example.com --md --out report.md
+aiseo-audit https://example.com --json --out report.json
+# CI/CD: fail if score below threshold
+aiseo-audit https://example.com --fail-under 70
+# Custom timeout
+aiseo-audit https://example.com --timeout 30000
+# Custom user agent
+aiseo-audit https://example.com --user-agent "MyBot/1.0"
+# Use config file
+aiseo-audit https://example.com --config aiseo.config.json
+```
+## CLI Options
+| Option              | Description                           | Default                |
+| ------------------- | ------------------------------------- | ---------------------- |
+| `<url>`             | URL to audit (required)               | -                      |
+| `--json`            | Output as JSON                        | -                      |
+| `--md`              | Output as Markdown                    | -                      |
+| `--html`            | Output as HTML                        | -                      |
+| `--out <path>`      | Write rendered output to a file       | -                      |
+| `--fail-under <n>`  | Exit with code 1 if score < threshold | -                      |
+| `--timeout <ms>`    | Request timeout in ms                 | `45000`                |
+| `--user-agent <ua>` | Custom User-Agent string              | `AISEOAudit/<version>` |
+| `--config <path>`   | Path to config file                   | -                      |
+If no output flag is given, the default is `pretty` (color-coded terminal output). The default format can also be set in the config file.
+## CI/CD
+```yaml
+# .github/workflows/aiseo-audit.yml
+name: AI SEO Audit
+on:
+  pull_request:
+  push:
+    branches: [main]
+jobs:
+  audit:
+    runs-on: ubuntu-latest
+    steps:
+      - run: npx aiseo-audit https://yoursite.com --fail-under 70
+```
+## User Agent
+By default, all HTTP requests (page fetch, `robots.txt`, `llms.txt`) are sent with the header `User-Agent: AISEOAudit/<version>`. This is intentional. If a site blocks unknown bots, that is a meaningful negative signal for AI search readiness, and the audit should surface it as a failing "Fetch Success" score.
+The `--user-agent` flag exists as an escape hatch for cases where you want to bypass bot detection and test the content independently of access policy. It does not change the audit logic, only what the server sees in the request header.
+## Audit Categories
+The audit evaluates 7 categories of AI search readiness (_[Detailed Breakdown here](docs/AUDIT_BREAKDOWN.md)_):
+| Category                        | What It Measures                                                                         |
+| ------------------------------- | ---------------------------------------------------------------------------------------- |
+| **Content Extractability**      | Can AI engines successfully fetch and extract meaningful text from the page?             |
+| **Content Structure for Reuse** | Is the content organized with headings, lists, and tables that engines can segment?      |
+| **Answerability**               | Does the content provide clear definitions, direct answers, and step-by-step patterns?   |
+| **Entity Clarity**              | Are named entities (people, orgs, places) clearly present and consistent with the topic? |
+| **Grounding Signals**           | Does the content cite external sources, include statistics, and attribute claims?        |
+| **Authority Context**           | Is there author attribution, organization identity, publish dates, and structured data?  |
+| **Readability for Compression** | Is the content written at a readability level that compresses well for AI summarization? |
+## Output Formats
+### Pretty (default)
+Color-coded terminal output with scores, factor breakdowns, and top recommendations. Best for quick checks during development.
+### JSON
+Full structured output with all scores, factor details, raw data, and recommendations. Best for integrations, CI/CD pipelines, and programmatic consumption.
+### Markdown
+Structured report with category tables, factor details, and recommendations grouped by category. Best for documentation, PRs, and sharing.
+### HTML
+Self-contained single-file report with SVG score gauges, color-coded sections, and recommendations grouped by category. Best for stakeholder reports and visual review.
+## Config File
+Create a config file in your project root to customize behavior. The CLI automatically discovers your config by searching from the current directory up to the filesystem root, looking for (in order):
+- `aiseo.config.json`
+- `.aiseo.config.json`
+- `aiseo-audit.config.json`
+You can also pass an explicit path with `--config path/to/config.json`.
+```json
+{
+  "timeout": 45000,
+  "format": "pretty",
+  "failUnder": 50,
+  "weights": {
+    "contentExtractability": 1,
+    "contentStructure": 1,
+    "answerability": 1,
+    "entityClarity": 1,
+    "groundingSignals": 1,
+    "authorityContext": 1,
+    "readabilityForCompression": 1
+  }
+}
+```
+Weights are relative. Set a category to `2` to double its importance, or `0` to exclude it.
+## Programmatic API
+```typescript
+import { analyzeUrl, loadConfig, renderReport } from "aiseo-audit";
+const config = await loadConfig();
+const result = await analyzeUrl(
+  { url: "https://example.com", timeout: 45000, userAgent: "MyApp/1.0" },
+  config,
+);
+console.log(result.overallScore); // 72
+console.log(result.grade); // "B-"
+// Render in any format
+const html = renderReport(result, { format: "html" });
+const md = renderReport(result, { format: "md" });
+const json = renderReport(result, { format: "json" });
+```
+### Exported Types
+```typescript
+import type {
+  AnalyzerResultType,
+  AnalyzerOptionsType,
+  AuditResultType,
+  CategoryNameType,
+  CategoryResultType,
+  FactorResultType,
+  RecommendationType,
+  ReportFormatType,
+  AiseoConfigType,
+} from "aiseo-audit";
+```
+## Philosophy
+This tool measures **AI search reusability**: how well a page's content can be fetched, extracted, understood, and reused by AI engines like ChatGPT, Claude, Perplexity, and Gemini.
+It is:
+- **Deterministic**: No AI API calls. Same URL produces the same score.
+- **Engine-agnostic**: Not optimized for any specific AI platform.
+- **Content-focused**: Analyzes what's on the page, not external signals.
+- **Lightweight**: Fast CLI with minimal dependencies.
+## Exit Codes
+| Code | Meaning                                         |
+| ---- | ----------------------------------------------- |
+| `0`  | Success                                         |
+| `1`  | Score below `--fail-under` threshold            |
+| `2`  | Runtime error (fetch failed, invalid URL, etc.) |
+## Compatibility Notes
+**Node.js** -- Requires Node 20 or later. The `engines` field in `package.json` enforces this. Earlier versions will produce runtime errors.
+**Zod** -- Uses [Zod 4](https://zod.dev). If you consume the library API and also use Zod in your project, ensure you are on Zod 4+ to avoid type incompatibilities.
+**CJS bin entry** -- The `bin/aiseo-audit.js` executable uses `require()` (CommonJS). This is compatible with all Node 20+ environments regardless of your project's module system. The library exports support both ESM (`import`) and CJS (`require`).
+**Config discovery** -- When using the programmatic API, `loadConfig()` searches for config files starting from `process.cwd()`. If your application's working directory differs from where your config file lives, pass an explicit path:
+```typescript
+const config = await loadConfig("/path/to/aiseo.config.json");
+```
+## Documentation
+- [Audit Breakdown](docs/AUDIT_BREAKDOWN.md) - Full scoring methodology, every factor, every threshold, with research citations
+- [Research](docs/RESEARCH.md) - Sources and gap analysis
+## Contributing
+See [CONTRIBUTING.md](CONTRIBUTING.md) for development setup, project structure, and pull request guidelines.
+## Releases
+Release notes are published on the [GitHub Releases](https://github.com/agencyenterprise/aiseo-audit/releases) page. A separate `CHANGELOG.md` is not maintained.
+## License
+MIT

package/bin/aiseo-audit.js ADDED Viewed

	@@ -0,0 +1,2 @@
1	+ #!/usr/bin/env node
2	+ require("../dist/cli.js");

package/dist/cli.d.mts ADDED Viewed

	@@ -0,0 +1,2 @@
1	+
2	+ export { }

package/dist/cli.d.ts ADDED Viewed

	@@ -0,0 +1,2 @@
1	+
2	+ export { }