@kryptosai/mcp-observatory 0.22.0 → 0.24.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/COMMERCIAL.md +5 -3
- package/PRIVACY.md +5 -2
- package/README.md +28 -13
- package/dist/src/cli.js +1 -1
- package/dist/src/cli.js.map +1 -1
- package/dist/src/commands/init-ci.d.ts +3 -0
- package/dist/src/commands/init-ci.js +26 -9
- package/dist/src/commands/init-ci.js.map +1 -1
- package/dist/src/commercial.js +2 -2
- package/dist/src/commercial.js.map +1 -1
- package/dist/src/reporters/pr-comment.js +6 -2
- package/dist/src/reporters/pr-comment.js.map +1 -1
- package/dist/src/score.js +1 -1
- package/dist/src/score.js.map +1 -1
- package/dist/src/validate.js +58 -3
- package/dist/src/validate.js.map +1 -1
- package/docs/certification-campaign-template.md +10 -10
- package/docs/certification-distribution.md +16 -0
- package/docs/directory-listing-copy.md +12 -5
- package/docs/distribution-launch.md +5 -5
- package/docs/enterprise-outreach-playbook.md +2 -2
- package/docs/mcp-lock-files.md +63 -0
- package/docs/mcp-safety-report-latest.md +16 -7
- package/docs/mcp-security-field-guide.md +97 -0
- package/docs/mcp-server-safety-index.md +61 -0
- package/docs/methodology.md +90 -0
- package/docs/metrics-dashboard.md +105 -0
- package/docs/paid-pilot-offer.md +74 -0
- package/docs/project-case-study.md +77 -43
- package/docs/proof.md +42 -12
- package/docs/public-post-drafts.md +98 -0
- package/docs/publish-readiness.md +6 -4
- package/docs/reference-evaluations.md +134 -0
- package/docs/safety-index/artifacts/antv-chart-server.json +2765 -0
- package/docs/safety-index/artifacts/antv-chart-server.md +156 -0
- package/docs/safety-index/artifacts/browsermcp-server.json +416 -0
- package/docs/safety-index/artifacts/browsermcp-server.md +163 -0
- package/docs/safety-index/artifacts/context7-server.json +286 -0
- package/docs/safety-index/artifacts/context7-server.md +163 -0
- package/docs/safety-index/artifacts/everything-server.json +482 -0
- package/docs/safety-index/artifacts/everything-server.md +163 -0
- package/docs/safety-index/artifacts/executeautomation-playwright-server.json +955 -0
- package/docs/safety-index/artifacts/executeautomation-playwright-server.md +163 -0
- package/docs/safety-index/artifacts/filesystem-server.json +583 -0
- package/docs/safety-index/artifacts/filesystem-server.md +156 -0
- package/docs/safety-index/artifacts/memory-server.json +469 -0
- package/docs/safety-index/artifacts/memory-server.md +156 -0
- package/docs/safety-index/artifacts/opentofu-server.json +387 -0
- package/docs/safety-index/artifacts/opentofu-server.md +163 -0
- package/docs/safety-index/artifacts/playwright-mcp-server.json +919 -0
- package/docs/safety-index/artifacts/playwright-mcp-server.md +156 -0
- package/docs/safety-index/artifacts/promptopia-server.json +442 -0
- package/docs/safety-index/artifacts/promptopia-server.md +156 -0
- package/docs/safety-index/artifacts/puppeteer-server.json +377 -0
- package/docs/safety-index/artifacts/puppeteer-server.md +163 -0
- package/docs/safety-index/artifacts/ref-tools-server.json +262 -0
- package/docs/safety-index/artifacts/ref-tools-server.md +156 -0
- package/docs/safety-index/artifacts/sequential-thinking-server.json +286 -0
- package/docs/safety-index/artifacts/sequential-thinking-server.md +156 -0
- package/docs/safety-index/maintainer-note-template.md +25 -0
- package/docs/safety-index/targets.json +192 -0
- package/package.json +17 -13
|
@@ -0,0 +1,134 @@
|
|
|
1
|
+
# MCP Observatory Reference Evaluations
|
|
2
|
+
|
|
3
|
+
Reference evaluations show how MCP Observatory applies to common MCP server categories. These are public, safe examples intended to help maintainers and security reviewers understand what the tool checks and what kind of risk each category can expose.
|
|
4
|
+
|
|
5
|
+
The examples below are not customer claims. They are public evaluation targets, public pull requests, or category examples that can be reproduced with the CLI.
|
|
6
|
+
|
|
7
|
+
## Official MCP Reference Servers
|
|
8
|
+
|
|
9
|
+
Representative repo: [`modelcontextprotocol/servers`](https://github.com/modelcontextprotocol/servers)
|
|
10
|
+
|
|
11
|
+
Public proof:
|
|
12
|
+
|
|
13
|
+
- PR: [`modelcontextprotocol/servers#4392`](https://github.com/modelcontextprotocol/servers/pull/4392)
|
|
14
|
+
- Status: open, mergeable, with a passing MCP Observatory check as of June 19, 2026
|
|
15
|
+
|
|
16
|
+
What this represents:
|
|
17
|
+
|
|
18
|
+
- reference MCP implementations
|
|
19
|
+
- simple tools that should behave predictably in CI
|
|
20
|
+
- a good baseline for model context protocol testing
|
|
21
|
+
|
|
22
|
+
What Observatory checks:
|
|
23
|
+
|
|
24
|
+
- server startup in GitHub Actions
|
|
25
|
+
- tools list/respond correctly
|
|
26
|
+
- schema quality and security scan output
|
|
27
|
+
- report generation for maintainers
|
|
28
|
+
|
|
29
|
+
Adoption command:
|
|
30
|
+
|
|
31
|
+
```bash
|
|
32
|
+
npx @kryptosai/mcp-observatory init-ci --all --command "npx -y @modelcontextprotocol/server-sequential-thinking"
|
|
33
|
+
```
|
|
34
|
+
|
|
35
|
+
## Browser Automation MCP Servers
|
|
36
|
+
|
|
37
|
+
Representative public examples:
|
|
38
|
+
|
|
39
|
+
- [`microsoft/playwright-mcp`](https://github.com/microsoft/playwright-mcp)
|
|
40
|
+
- [`executeautomation/mcp-playwright`](https://github.com/executeautomation/mcp-playwright)
|
|
41
|
+
|
|
42
|
+
Public proof:
|
|
43
|
+
|
|
44
|
+
- PR: [`microsoft/playwright-mcp#1657`](https://github.com/microsoft/playwright-mcp/pull/1657)
|
|
45
|
+
- PR: [`executeautomation/mcp-playwright#225`](https://github.com/executeautomation/mcp-playwright/pull/225)
|
|
46
|
+
|
|
47
|
+
What this represents:
|
|
48
|
+
|
|
49
|
+
- high-capability browser tools
|
|
50
|
+
- agent access to pages, scripts, navigation, screenshots, and user-like actions
|
|
51
|
+
- a category where secure tool invocation and explicit trust boundaries matter
|
|
52
|
+
|
|
53
|
+
What Observatory checks:
|
|
54
|
+
|
|
55
|
+
- tool inventory
|
|
56
|
+
- schema quality
|
|
57
|
+
- risky browser/code-execution surfaces
|
|
58
|
+
- intentional suppressions for known acceptable findings
|
|
59
|
+
- whether deep invocation should be skipped for tools that can mutate browser state
|
|
60
|
+
|
|
61
|
+
Adoption command:
|
|
62
|
+
|
|
63
|
+
```bash
|
|
64
|
+
npx @kryptosai/mcp-observatory test --security npx -y @playwright/mcp
|
|
65
|
+
```
|
|
66
|
+
|
|
67
|
+
## Filesystem MCP Servers
|
|
68
|
+
|
|
69
|
+
Representative public category: filesystem-backed MCP servers.
|
|
70
|
+
|
|
71
|
+
Public proof:
|
|
72
|
+
|
|
73
|
+
- PR: [`cyanheads/filesystem-mcp-server#19`](https://github.com/cyanheads/filesystem-mcp-server/pull/19), closed unmerged by maintainer
|
|
74
|
+
|
|
75
|
+
What this represents:
|
|
76
|
+
|
|
77
|
+
- local file access exposed to agents
|
|
78
|
+
- read/write boundaries that should be explicit
|
|
79
|
+
- capability declarations that need to match observed MCP behavior
|
|
80
|
+
|
|
81
|
+
What Observatory checks:
|
|
82
|
+
|
|
83
|
+
- tools/resources capability consistency
|
|
84
|
+
- broad filesystem access findings
|
|
85
|
+
- schema quality for path-oriented tools
|
|
86
|
+
- safe sandbox target configuration for CI
|
|
87
|
+
|
|
88
|
+
Adoption command:
|
|
89
|
+
|
|
90
|
+
```bash
|
|
91
|
+
npx @kryptosai/mcp-observatory test --security npx -y filesystem-mcp-server .
|
|
92
|
+
```
|
|
93
|
+
|
|
94
|
+
Use a harmless temporary directory for CI checks when evaluating filesystem servers.
|
|
95
|
+
|
|
96
|
+
## Documentation And Search MCP Servers
|
|
97
|
+
|
|
98
|
+
Representative public example: [`upstash/context7`](https://github.com/upstash/context7)
|
|
99
|
+
|
|
100
|
+
Public proof:
|
|
101
|
+
|
|
102
|
+
- PR: [`upstash/context7#2800`](https://github.com/upstash/context7/pull/2800)
|
|
103
|
+
|
|
104
|
+
What this represents:
|
|
105
|
+
|
|
106
|
+
- documentation retrieval and search tools
|
|
107
|
+
- untrusted or fast-changing text entering an agent context
|
|
108
|
+
- a category where prompt-injection-aware review matters
|
|
109
|
+
|
|
110
|
+
What Observatory checks:
|
|
111
|
+
|
|
112
|
+
- tool inventory
|
|
113
|
+
- schema quality
|
|
114
|
+
- startup reliability
|
|
115
|
+
- security findings around broad retrieval or response behavior
|
|
116
|
+
- report artifacts that maintainers can review in pull requests
|
|
117
|
+
|
|
118
|
+
Adoption command:
|
|
119
|
+
|
|
120
|
+
```bash
|
|
121
|
+
npx @kryptosai/mcp-observatory init-ci --all --command "npx -y @upstash/context7-mcp"
|
|
122
|
+
```
|
|
123
|
+
|
|
124
|
+
## How To Read These Evaluations
|
|
125
|
+
|
|
126
|
+
Passing an Observatory check means the server passed the configured compatibility and security checks for that run. It does not mean the server is universally safe for every environment.
|
|
127
|
+
|
|
128
|
+
Use the results as an engineering control:
|
|
129
|
+
|
|
130
|
+
- add CI for repeatability
|
|
131
|
+
- compare artifacts between releases
|
|
132
|
+
- review security findings and suppressions
|
|
133
|
+
- document accepted risk for broad tools
|
|
134
|
+
- escalate production/private usage to hosted reporting, certification, or fleet visibility when the server becomes operationally important
|