@houtini/seo-crawler-mcp 2.1.4 → 2.1.5
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +26 -1
- package/package.json +1 -1
- package/server.json +36 -36
- package/houtini-logo.jpg +0 -0
package/README.md
CHANGED
|
@@ -5,12 +5,21 @@
|
|
|
5
5
|
[](https://github.com/houtini-ai/seo-crawler-mcp/actions)
|
|
6
6
|
[](https://www.typescriptlang.org/)
|
|
7
7
|
[](https://www.npmjs.com/package/@houtini/seo-crawler-mcp)
|
|
8
|
-
[](https://snyk.io/test/github/houtini-ai/seo-crawler-mcp)
|
|
9
8
|
[](https://registry.modelcontextprotocol.io)
|
|
10
9
|
[](https://opensource.org/licenses/Apache-2.0)
|
|
11
10
|
|
|
12
11
|
**Crawl and analyse your website for errors and issues that probably affect your site's SEO**
|
|
13
12
|
|
|
13
|
+
<p align="center">
|
|
14
|
+
<a href="https://glama.ai/mcp/servers/@houtini-ai/seo-crawler-mcp">
|
|
15
|
+
<img width="380" height="200" src="https://glama.ai/mcp/servers/@houtini-ai/seo-crawler-mcp/badge" alt="SEO Crawler MCP server" />
|
|
16
|
+
</a>
|
|
17
|
+
</p>
|
|
18
|
+
|
|
19
|
+
> **Quick Navigation**
|
|
20
|
+
>
|
|
21
|
+
> [Installation](#installation) | [CLI mode](#cli-mode-terminal-usage) | [How to use](#how-to-use-this) | [What gets detected](#what-gets-detected) | [Data storage](#data-storage) | [Performance](#performance) | [Tools reference](#mcp-tools-reference) | [Available queries](#available-queries)
|
|
22
|
+
|
|
14
23
|
I wanted to build on my experience working with the MCP protocol SDK to see just how far we can extend an AI assistant's capabilities. I decided that I'd quite like to build a crawler to check my site's "technical SEO" health and came across Crawlee - which seemed like the ideal library to base the crawl component of my MCP.
|
|
15
24
|
|
|
16
25
|
What's interesting is that MCP usually indicates a server connection of some sort. This is not so with SEO Crawler MCP. The MCP protocol is probably more powerful than I realised - this is a self-contained application wrapped in the MCP SDK that handles everything locally:
|
|
@@ -69,6 +78,22 @@ Restart Claude Desktop. Four tools will be available:
|
|
|
69
78
|
- `seo-crawler-mcp:query_seo_data`
|
|
70
79
|
- `seo-crawler-mcp:list_seo_queries`
|
|
71
80
|
|
|
81
|
+
### Claude Code (CLI)
|
|
82
|
+
|
|
83
|
+
Claude Code uses a different registration mechanism -- it doesn't read `claude_desktop_config.json`. Use `claude mcp add` instead:
|
|
84
|
+
|
|
85
|
+
```bash
|
|
86
|
+
claude mcp add -e OUTPUT_DIR=/path/to/seo-audits -s user seo-crawler-mcp -- npx -y @houtini/seo-crawler-mcp
|
|
87
|
+
```
|
|
88
|
+
|
|
89
|
+
Verify with:
|
|
90
|
+
|
|
91
|
+
```bash
|
|
92
|
+
claude mcp get seo-crawler-mcp
|
|
93
|
+
```
|
|
94
|
+
|
|
95
|
+
You should see `Status: Connected`.
|
|
96
|
+
|
|
72
97
|
### Development Install
|
|
73
98
|
|
|
74
99
|
```bash
|
package/package.json
CHANGED
package/server.json
CHANGED
|
@@ -1,36 +1,36 @@
|
|
|
1
|
-
{
|
|
2
|
-
"$schema": "https://static.modelcontextprotocol.io/schemas/2025-12-11/server.schema.json",
|
|
3
|
-
"name": "io.github.houtini-ai/seo-crawler-mcp",
|
|
4
|
-
"description": "Crawl and analyse websites for SEO errors using Crawlee with SQLite storage",
|
|
5
|
-
"repository": {
|
|
6
|
-
"url": "https://github.com/houtini-ai/seo-crawler-mcp",
|
|
7
|
-
"source": "github"
|
|
8
|
-
},
|
|
9
|
-
"version": "2.1.
|
|
10
|
-
"packages": [
|
|
11
|
-
{
|
|
12
|
-
"registryType": "npm",
|
|
13
|
-
"identifier": "@houtini/seo-crawler-mcp",
|
|
14
|
-
"version": "2.1.
|
|
15
|
-
"transport": {
|
|
16
|
-
"type": "stdio"
|
|
17
|
-
},
|
|
18
|
-
"environmentVariables": [
|
|
19
|
-
{
|
|
20
|
-
"name": "OUTPUT_DIR",
|
|
21
|
-
"description": "Directory where crawl results are saved",
|
|
22
|
-
"isRequired": true,
|
|
23
|
-
"format": "string",
|
|
24
|
-
"isSecret": false
|
|
25
|
-
},
|
|
26
|
-
{
|
|
27
|
-
"name": "DEBUG",
|
|
28
|
-
"description": "Enable verbose debug logging (set to 'true' to enable)",
|
|
29
|
-
"isRequired": false,
|
|
30
|
-
"format": "string",
|
|
31
|
-
"isSecret": false
|
|
32
|
-
}
|
|
33
|
-
]
|
|
34
|
-
}
|
|
35
|
-
]
|
|
36
|
-
}
|
|
1
|
+
{
|
|
2
|
+
"$schema": "https://static.modelcontextprotocol.io/schemas/2025-12-11/server.schema.json",
|
|
3
|
+
"name": "io.github.houtini-ai/seo-crawler-mcp",
|
|
4
|
+
"description": "Crawl and analyse websites for SEO errors using Crawlee with SQLite storage",
|
|
5
|
+
"repository": {
|
|
6
|
+
"url": "https://github.com/houtini-ai/seo-crawler-mcp",
|
|
7
|
+
"source": "github"
|
|
8
|
+
},
|
|
9
|
+
"version": "2.1.5",
|
|
10
|
+
"packages": [
|
|
11
|
+
{
|
|
12
|
+
"registryType": "npm",
|
|
13
|
+
"identifier": "@houtini/seo-crawler-mcp",
|
|
14
|
+
"version": "2.1.5",
|
|
15
|
+
"transport": {
|
|
16
|
+
"type": "stdio"
|
|
17
|
+
},
|
|
18
|
+
"environmentVariables": [
|
|
19
|
+
{
|
|
20
|
+
"name": "OUTPUT_DIR",
|
|
21
|
+
"description": "Directory where crawl results are saved",
|
|
22
|
+
"isRequired": true,
|
|
23
|
+
"format": "string",
|
|
24
|
+
"isSecret": false
|
|
25
|
+
},
|
|
26
|
+
{
|
|
27
|
+
"name": "DEBUG",
|
|
28
|
+
"description": "Enable verbose debug logging (set to 'true' to enable)",
|
|
29
|
+
"isRequired": false,
|
|
30
|
+
"format": "string",
|
|
31
|
+
"isSecret": false
|
|
32
|
+
}
|
|
33
|
+
]
|
|
34
|
+
}
|
|
35
|
+
]
|
|
36
|
+
}
|
package/houtini-logo.jpg
DELETED
|
Binary file
|