@just-every/mcp-read-website-fast 0.1.3 → 0.1.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (3) hide show
  1. package/README.md +44 -3
  2. package/package.json +8 -1
  3. package/tsconfig.json +0 -24
package/README.md CHANGED
@@ -1,12 +1,27 @@
1
- # MCP Read JustEvery Website
1
+ # @just-every/mcp-read-website-fast
2
+
3
+ Fast, token-efficient web content extraction for AI agents - converts websites to clean Markdown.
4
+
5
+ [![npm version](https://badge.fury.io/js/@just-every%2Fmcp-read-website-fast.svg)](https://www.npmjs.com/package/@just-every/mcp-read-website-fast)
6
+ [![GitHub Actions](https://github.com/just-every/mcp-read-website-fast/workflows/Release/badge.svg)](https://github.com/just-every/mcp-read-website-fast/actions)
7
+
8
+ ## Overview
2
9
 
3
10
  Existing MCP web crawlers are slow and consume large quantities of tokens. This pauses the development process and provides incomplete results as LLMs need to parse whole web pages.
4
11
 
5
12
  This MCP package fetches web pages locally, strips noise, and converts content to clean Markdown while preserving links. Designed for Claude Code, IDEs and LLM pipelines with minimal token footprint. Crawl sites locally with minimal dependencies.
6
13
 
7
- ## MCP Server Configuration
14
+ ## Features
8
15
 
9
- This tool can be used as an MCP (Model Context Protocol) server with Claude Desktop, Cursor, VS Code, and other compatible clients.
16
+ - **Fast startup** using official MCP SDK with lazy loading for optimal performance
17
+ - **Content extraction** using Mozilla Readability (same as Firefox Reader View)
18
+ - **HTML to Markdown** conversion with Turndown + GFM support
19
+ - **Smart caching** with SHA-256 hashed URLs
20
+ - **Polite crawling** with robots.txt support and rate limiting
21
+ - **Concurrent fetching** with configurable depth crawling
22
+ - **Stream-first design** for low memory usage
23
+ - **Link preservation** for knowledge graphs
24
+ - **Optional chunking** for downstream processing
10
25
 
11
26
  ## Installation
12
27
 
@@ -160,6 +175,32 @@ npm run typecheck
160
175
  npm run lint
161
176
  ```
162
177
 
178
+ ## Contributing
179
+
180
+ Contributions are welcome! Please:
181
+
182
+ 1. Fork the repository
183
+ 2. Create a feature branch
184
+ 3. Add tests for new functionality
185
+ 4. Submit a pull request
186
+
187
+ ## Troubleshooting
188
+
189
+ ### Cache Issues
190
+ ```bash
191
+ npm run dev clear-cache
192
+ ```
193
+
194
+ ### Timeout Errors
195
+ - Increase timeout with `-t` flag
196
+ - Check network connectivity
197
+ - Verify URL is accessible
198
+
199
+ ### Content Not Extracted
200
+ - Some sites block automated access
201
+ - Try custom user agent with `-u` flag
202
+ - Check if site requires JavaScript (not supported)
203
+
163
204
  ## License
164
205
 
165
206
  MIT
package/package.json CHANGED
@@ -1,11 +1,18 @@
1
1
  {
2
2
  "name": "@just-every/mcp-read-website-fast",
3
- "version": "0.1.3",
3
+ "version": "0.1.5",
4
4
  "description": "Markdown Content Preprocessor - Fetch web pages, extract content, convert to clean Markdown",
5
5
  "main": "dist/index.js",
6
6
  "bin": {
7
7
  "mcp-read-website-fast": "bin/mcp-read-website.js"
8
8
  },
9
+ "files": [
10
+ "dist/**/*",
11
+ "bin/**/*",
12
+ "README.md",
13
+ "LICENSE",
14
+ "package.json"
15
+ ],
9
16
  "scripts": {
10
17
  "build": "tsc -p tsconfig.prod.json",
11
18
  "build:dev": "tsc",
package/tsconfig.json DELETED
@@ -1,24 +0,0 @@
1
- {
2
- "compilerOptions": {
3
- "target": "ES2022",
4
- "module": "NodeNext",
5
- "moduleResolution": "NodeNext",
6
- "lib": ["ES2022"],
7
- "outDir": "./dist",
8
- "rootDir": "./src",
9
- "strict": true,
10
- "esModuleInterop": true,
11
- "skipLibCheck": true,
12
- "forceConsistentCasingInFileNames": true,
13
- "resolveJsonModule": true,
14
- "declaration": true,
15
- "declarationMap": true,
16
- "sourceMap": true,
17
- "noUnusedLocals": true,
18
- "noUnusedParameters": true,
19
- "noImplicitReturns": true,
20
- "noFallthroughCasesInSwitch": true
21
- },
22
- "include": ["src/**/*"],
23
- "exclude": ["node_modules", "dist", "src/test-*.ts", "src/serve-*.ts", "src/trace-*.ts"]
24
- }