@kryptosai/mcp-observatory 0.22.0 → 0.24.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (62) hide show
  1. package/COMMERCIAL.md +5 -3
  2. package/PRIVACY.md +5 -2
  3. package/README.md +28 -13
  4. package/dist/src/cli.js +1 -1
  5. package/dist/src/cli.js.map +1 -1
  6. package/dist/src/commands/init-ci.d.ts +3 -0
  7. package/dist/src/commands/init-ci.js +26 -9
  8. package/dist/src/commands/init-ci.js.map +1 -1
  9. package/dist/src/commercial.js +2 -2
  10. package/dist/src/commercial.js.map +1 -1
  11. package/dist/src/reporters/pr-comment.js +6 -2
  12. package/dist/src/reporters/pr-comment.js.map +1 -1
  13. package/dist/src/score.js +1 -1
  14. package/dist/src/score.js.map +1 -1
  15. package/dist/src/validate.js +58 -3
  16. package/dist/src/validate.js.map +1 -1
  17. package/docs/certification-campaign-template.md +10 -10
  18. package/docs/certification-distribution.md +16 -0
  19. package/docs/directory-listing-copy.md +12 -5
  20. package/docs/distribution-launch.md +5 -5
  21. package/docs/enterprise-outreach-playbook.md +2 -2
  22. package/docs/mcp-lock-files.md +63 -0
  23. package/docs/mcp-safety-report-latest.md +16 -7
  24. package/docs/mcp-security-field-guide.md +97 -0
  25. package/docs/mcp-server-safety-index.md +61 -0
  26. package/docs/methodology.md +90 -0
  27. package/docs/metrics-dashboard.md +105 -0
  28. package/docs/paid-pilot-offer.md +74 -0
  29. package/docs/project-case-study.md +77 -43
  30. package/docs/proof.md +42 -12
  31. package/docs/public-post-drafts.md +98 -0
  32. package/docs/publish-readiness.md +6 -4
  33. package/docs/reference-evaluations.md +134 -0
  34. package/docs/safety-index/artifacts/antv-chart-server.json +2765 -0
  35. package/docs/safety-index/artifacts/antv-chart-server.md +156 -0
  36. package/docs/safety-index/artifacts/browsermcp-server.json +416 -0
  37. package/docs/safety-index/artifacts/browsermcp-server.md +163 -0
  38. package/docs/safety-index/artifacts/context7-server.json +286 -0
  39. package/docs/safety-index/artifacts/context7-server.md +163 -0
  40. package/docs/safety-index/artifacts/everything-server.json +482 -0
  41. package/docs/safety-index/artifacts/everything-server.md +163 -0
  42. package/docs/safety-index/artifacts/executeautomation-playwright-server.json +955 -0
  43. package/docs/safety-index/artifacts/executeautomation-playwright-server.md +163 -0
  44. package/docs/safety-index/artifacts/filesystem-server.json +583 -0
  45. package/docs/safety-index/artifacts/filesystem-server.md +156 -0
  46. package/docs/safety-index/artifacts/memory-server.json +469 -0
  47. package/docs/safety-index/artifacts/memory-server.md +156 -0
  48. package/docs/safety-index/artifacts/opentofu-server.json +387 -0
  49. package/docs/safety-index/artifacts/opentofu-server.md +163 -0
  50. package/docs/safety-index/artifacts/playwright-mcp-server.json +919 -0
  51. package/docs/safety-index/artifacts/playwright-mcp-server.md +156 -0
  52. package/docs/safety-index/artifacts/promptopia-server.json +442 -0
  53. package/docs/safety-index/artifacts/promptopia-server.md +156 -0
  54. package/docs/safety-index/artifacts/puppeteer-server.json +377 -0
  55. package/docs/safety-index/artifacts/puppeteer-server.md +163 -0
  56. package/docs/safety-index/artifacts/ref-tools-server.json +262 -0
  57. package/docs/safety-index/artifacts/ref-tools-server.md +156 -0
  58. package/docs/safety-index/artifacts/sequential-thinking-server.json +286 -0
  59. package/docs/safety-index/artifacts/sequential-thinking-server.md +156 -0
  60. package/docs/safety-index/maintainer-note-template.md +25 -0
  61. package/docs/safety-index/targets.json +192 -0
  62. package/package.json +17 -13
@@ -0,0 +1,134 @@
1
+ # MCP Observatory Reference Evaluations
2
+
3
+ Reference evaluations show how MCP Observatory applies to common MCP server categories. These are public, safe examples intended to help maintainers and security reviewers understand what the tool checks and what kind of risk each category can expose.
4
+
5
+ The examples below are not customer claims. They are public evaluation targets, public pull requests, or category examples that can be reproduced with the CLI.
6
+
7
+ ## Official MCP Reference Servers
8
+
9
+ Representative repo: [`modelcontextprotocol/servers`](https://github.com/modelcontextprotocol/servers)
10
+
11
+ Public proof:
12
+
13
+ - PR: [`modelcontextprotocol/servers#4392`](https://github.com/modelcontextprotocol/servers/pull/4392)
14
+ - Status: open, mergeable, with a passing MCP Observatory check as of June 19, 2026
15
+
16
+ What this represents:
17
+
18
+ - reference MCP implementations
19
+ - simple tools that should behave predictably in CI
20
+ - a good baseline for model context protocol testing
21
+
22
+ What Observatory checks:
23
+
24
+ - server startup in GitHub Actions
25
+ - tools list/respond correctly
26
+ - schema quality and security scan output
27
+ - report generation for maintainers
28
+
29
+ Adoption command:
30
+
31
+ ```bash
32
+ npx @kryptosai/mcp-observatory init-ci --all --command "npx -y @modelcontextprotocol/server-sequential-thinking"
33
+ ```
34
+
35
+ ## Browser Automation MCP Servers
36
+
37
+ Representative public examples:
38
+
39
+ - [`microsoft/playwright-mcp`](https://github.com/microsoft/playwright-mcp)
40
+ - [`executeautomation/mcp-playwright`](https://github.com/executeautomation/mcp-playwright)
41
+
42
+ Public proof:
43
+
44
+ - PR: [`microsoft/playwright-mcp#1657`](https://github.com/microsoft/playwright-mcp/pull/1657)
45
+ - PR: [`executeautomation/mcp-playwright#225`](https://github.com/executeautomation/mcp-playwright/pull/225)
46
+
47
+ What this represents:
48
+
49
+ - high-capability browser tools
50
+ - agent access to pages, scripts, navigation, screenshots, and user-like actions
51
+ - a category where secure tool invocation and explicit trust boundaries matter
52
+
53
+ What Observatory checks:
54
+
55
+ - tool inventory
56
+ - schema quality
57
+ - risky browser/code-execution surfaces
58
+ - intentional suppressions for known acceptable findings
59
+ - whether deep invocation should be skipped for tools that can mutate browser state
60
+
61
+ Adoption command:
62
+
63
+ ```bash
64
+ npx @kryptosai/mcp-observatory test --security npx -y @playwright/mcp
65
+ ```
66
+
67
+ ## Filesystem MCP Servers
68
+
69
+ Representative public category: filesystem-backed MCP servers.
70
+
71
+ Public proof:
72
+
73
+ - PR: [`cyanheads/filesystem-mcp-server#19`](https://github.com/cyanheads/filesystem-mcp-server/pull/19), closed unmerged by maintainer
74
+
75
+ What this represents:
76
+
77
+ - local file access exposed to agents
78
+ - read/write boundaries that should be explicit
79
+ - capability declarations that need to match observed MCP behavior
80
+
81
+ What Observatory checks:
82
+
83
+ - tools/resources capability consistency
84
+ - broad filesystem access findings
85
+ - schema quality for path-oriented tools
86
+ - safe sandbox target configuration for CI
87
+
88
+ Adoption command:
89
+
90
+ ```bash
91
+ npx @kryptosai/mcp-observatory test --security npx -y filesystem-mcp-server .
92
+ ```
93
+
94
+ Use a harmless temporary directory for CI checks when evaluating filesystem servers.
95
+
96
+ ## Documentation And Search MCP Servers
97
+
98
+ Representative public example: [`upstash/context7`](https://github.com/upstash/context7)
99
+
100
+ Public proof:
101
+
102
+ - PR: [`upstash/context7#2800`](https://github.com/upstash/context7/pull/2800)
103
+
104
+ What this represents:
105
+
106
+ - documentation retrieval and search tools
107
+ - untrusted or fast-changing text entering an agent context
108
+ - a category where prompt-injection-aware review matters
109
+
110
+ What Observatory checks:
111
+
112
+ - tool inventory
113
+ - schema quality
114
+ - startup reliability
115
+ - security findings around broad retrieval or response behavior
116
+ - report artifacts that maintainers can review in pull requests
117
+
118
+ Adoption command:
119
+
120
+ ```bash
121
+ npx @kryptosai/mcp-observatory init-ci --all --command "npx -y @upstash/context7-mcp"
122
+ ```
123
+
124
+ ## How To Read These Evaluations
125
+
126
+ Passing an Observatory check means the server passed the configured compatibility and security checks for that run. It does not mean the server is universally safe for every environment.
127
+
128
+ Use the results as an engineering control:
129
+
130
+ - add CI for repeatability
131
+ - compare artifacts between releases
132
+ - review security findings and suppressions
133
+ - document accepted risk for broad tools
134
+ - escalate production/private usage to hosted reporting, certification, or fleet visibility when the server becomes operationally important