@kryptosai/mcp-observatory 0.23.0 → 0.24.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (46) hide show
  1. package/README.md +8 -7
  2. package/dist/src/commands/init-ci.d.ts +3 -0
  3. package/dist/src/commands/init-ci.js +24 -12
  4. package/dist/src/commands/init-ci.js.map +1 -1
  5. package/dist/src/reporters/pr-comment.js +6 -2
  6. package/dist/src/reporters/pr-comment.js.map +1 -1
  7. package/docs/certification-campaign-template.md +2 -2
  8. package/docs/mcp-safety-report-latest.md +12 -7
  9. package/docs/mcp-server-safety-index.md +56 -80
  10. package/docs/methodology.md +90 -0
  11. package/docs/metrics-dashboard.md +105 -0
  12. package/docs/paid-pilot-offer.md +21 -5
  13. package/docs/project-case-study.md +12 -8
  14. package/docs/proof.md +28 -15
  15. package/docs/public-post-drafts.md +18 -6
  16. package/docs/publish-readiness.md +1 -5
  17. package/docs/reference-evaluations.md +1 -1
  18. package/docs/safety-index/artifacts/antv-chart-server.json +2765 -0
  19. package/docs/safety-index/artifacts/antv-chart-server.md +156 -0
  20. package/docs/safety-index/artifacts/browsermcp-server.json +416 -0
  21. package/docs/safety-index/artifacts/browsermcp-server.md +163 -0
  22. package/docs/safety-index/artifacts/context7-server.json +286 -0
  23. package/docs/safety-index/artifacts/context7-server.md +163 -0
  24. package/docs/safety-index/artifacts/everything-server.json +482 -0
  25. package/docs/safety-index/artifacts/everything-server.md +163 -0
  26. package/docs/safety-index/artifacts/executeautomation-playwright-server.json +955 -0
  27. package/docs/safety-index/artifacts/executeautomation-playwright-server.md +163 -0
  28. package/docs/safety-index/artifacts/filesystem-server.json +583 -0
  29. package/docs/safety-index/artifacts/filesystem-server.md +156 -0
  30. package/docs/safety-index/artifacts/memory-server.json +469 -0
  31. package/docs/safety-index/artifacts/memory-server.md +156 -0
  32. package/docs/safety-index/artifacts/opentofu-server.json +387 -0
  33. package/docs/safety-index/artifacts/opentofu-server.md +163 -0
  34. package/docs/safety-index/artifacts/playwright-mcp-server.json +919 -0
  35. package/docs/safety-index/artifacts/playwright-mcp-server.md +156 -0
  36. package/docs/safety-index/artifacts/promptopia-server.json +442 -0
  37. package/docs/safety-index/artifacts/promptopia-server.md +156 -0
  38. package/docs/safety-index/artifacts/puppeteer-server.json +377 -0
  39. package/docs/safety-index/artifacts/puppeteer-server.md +163 -0
  40. package/docs/safety-index/artifacts/ref-tools-server.json +262 -0
  41. package/docs/safety-index/artifacts/ref-tools-server.md +156 -0
  42. package/docs/safety-index/artifacts/sequential-thinking-server.json +286 -0
  43. package/docs/safety-index/artifacts/sequential-thinking-server.md +156 -0
  44. package/docs/safety-index/maintainer-note-template.md +25 -0
  45. package/docs/safety-index/targets.json +192 -0
  46. package/package.json +12 -9
@@ -0,0 +1,163 @@
1
+ # MCP Observatory Run Report
2
+
3
+ Generated at 2026-06-24T02:07:44.894Z
4
+
5
+ ## Target and Environment Metadata
6
+
7
+ - Target: `executeautomation-playwright-server`
8
+ - Adapter: `local-process`
9
+ - Command: `npx -y @executeautomation/playwright-mcp-server`
10
+ - Server: `playwright-mcp 1.0.11`
11
+ - Platform: `darwin 25.5.0`
12
+ - Node: `v22.22.1`
13
+
14
+ ## Executive Summary
15
+
16
+ **Health Score: 69/100 (D)**
17
+
18
+ | Dimension | Score | Weight |
19
+ | --- | --- | --- |
20
+ | Protocol Compliance | 100/100 | 30% |
21
+ | Schema Quality | 60/100 | 20% |
22
+ | Security | 0/100 | 20% |
23
+ | Reliability | 83/100 | 20% |
24
+ | Performance | 100/100 | 10% |
25
+
26
+ | Gate | Total | Pass | Fail | Partial | Unsupported | Flaky | Skipped |
27
+ | --- | --- | --- | --- | --- | --- | --- | --- |
28
+ | fail | 7 | 3 | 2 | 1 | 1 | 0 | 0 |
29
+
30
+ ## At a Glance
31
+
32
+ - Safety verdict: **Blocked** — One or more checks can break agent dependence and should be fixed before production use.
33
+ - Top risks: schema-quality: Found 1 quality finding(s) across 34 item(s): 1 warnings, 0 info.; security: Found 1 security finding(s): 1 high, 0 medium, 0 low.; security-lite: Found 1 security finding(s): 1 high, 0 medium, 0 low.
34
+ - Regression/schema drift: Run `mcp-observatory diff <previous-run.json> <current-run.json>` to classify regressions and schema drift.
35
+ - Failing checks: security-lite, security
36
+ - Partial or flaky checks: schema-quality
37
+ - Skipped checks: none
38
+ - Unsupported checks: prompts
39
+ - Suggested next step: Start with the failing checks: security-lite, security.
40
+ - CI next step: `Add CI: npx @kryptosai/mcp-observatory init-ci --all --command "npx -y <server-package>"`
41
+
42
+ ## Regressions and Recoveries
43
+
44
+ _Use the `diff` command against another run artifact to classify regressions and recoveries over time._
45
+
46
+ ## Full Capability Status Table
47
+
48
+ | Focus | Check | Status | Duration (ms) | Message |
49
+ | --- | --- | --- | --- | --- |
50
+ | healthy | conformance | pass | 7.25 | All 7 conformance checks passed. |
51
+ | healthy | resources | pass | 1.65 | Advertised capability responded with the minimal expected shape, but one optional resource endpoint appears unsupported. |
52
+ | healthy | tools | pass | 3.17 | Advertised capability responded with the minimal expected shape (33 items). |
53
+ | review | schema-quality | partial | 1.86 | Found 1 quality finding(s) across 34 item(s): 1 warnings, 0 info. |
54
+ | confirm intent | prompts | unsupported | 0.00 | Prompts are not advertised by the target. |
55
+ | act now | security | fail | 0.87 | Found 1 security finding(s): 1 high, 0 medium, 0 low. |
56
+ | act now | security-lite | fail | 0.48 | Found 1 security finding(s): 1 high, 0 medium, 0 low. |
57
+
58
+ ## Evidence Snippets
59
+
60
+ ### conformance — pass
61
+
62
+ Summary: All 7 conformance checks passed.
63
+
64
+ - Endpoint: `conformance/check`
65
+ - Advertised: `true`
66
+ - Responded: `true`
67
+ - Minimal shape present: `true`
68
+ - Item count: `7`
69
+ - Identifiers: none
70
+ - Diagnostics: [pass] capabilities-present: Server returned capabilities object., [pass] server-info: Server provided initialization info., [pass] tools-capability-match: tools/list returned 33 tool(s). (+4 more)
71
+
72
+ ### resources — pass
73
+
74
+ Summary: Advertised capability responded with the minimal expected shape, but one optional resource endpoint appears unsupported.
75
+
76
+ - Endpoint: `resources/list`
77
+ - Advertised: `true`
78
+ - Responded: `true`
79
+ - Minimal shape present: `true`
80
+ - Item count: `1`
81
+ - Identifiers: console://logs
82
+ - Diagnostics: none
83
+ - Endpoint: `resources/templates/list`
84
+ - Advertised: `true`
85
+ - Responded: `false`
86
+ - Minimal shape present: `false`
87
+ - Item count: `0`
88
+ - Identifiers: none
89
+ - Diagnostics: MCP error -32601: Method not found
90
+
91
+ ### tools — pass
92
+
93
+ Summary: Advertised capability responded with the minimal expected shape (33 items).
94
+
95
+ - Endpoint: `tools/list`
96
+ - Advertised: `true`
97
+ - Responded: `true`
98
+ - Minimal shape present: `true`
99
+ - Item count: `33`
100
+ - Identifiers: start_codegen_session, end_codegen_session, get_codegen_session, clear_codegen_session, playwright_navigate (+28 more)
101
+ - Diagnostics: none
102
+
103
+ ### schema-quality — partial
104
+
105
+ Summary: Found 1 quality finding(s) across 34 item(s): 1 warnings, 0 info.
106
+
107
+ - Endpoint: `schema-quality/scan`
108
+ - Advertised: `true`
109
+ - Responded: `true`
110
+ - Minimal shape present: `true`
111
+ - Item count: `1`
112
+ - Identifiers: Browser console logs
113
+ - Diagnostics: [warning] resource "Browser console logs": Missing description
114
+
115
+ ### prompts — unsupported
116
+
117
+ Summary: Prompts are not advertised by the target.
118
+
119
+ - Endpoint: `prompts/list`
120
+ - Advertised: `false`
121
+ - Responded: `false`
122
+ - Minimal shape present: `false`
123
+ - Item count: `0`
124
+ - Identifiers: none
125
+ - Diagnostics: none
126
+
127
+ ### security — fail
128
+
129
+ Summary: Found 1 security finding(s): 1 high, 0 medium, 0 low.
130
+
131
+ - Endpoint: `security/scan`
132
+ - Advertised: `true`
133
+ - Responded: `true`
134
+ - Minimal shape present: `true`
135
+ - Item count: `1`
136
+ - Identifiers: playwright_evaluate
137
+ - Diagnostics: [high] Tool "playwright_evaluate" has parameter "script" which may allow arbitrary command execution.
138
+
139
+ ### security-lite — fail
140
+
141
+ Summary: Found 1 security finding(s): 1 high, 0 medium, 0 low.
142
+
143
+ - Endpoint: `security/scan-lite`
144
+ - Advertised: `true`
145
+ - Responded: `true`
146
+ - Minimal shape present: `true`
147
+ - Item count: `1`
148
+ - Identifiers: playwright_evaluate
149
+ - Diagnostics: [high] Tool "playwright_evaluate" has parameter "script" which may allow arbitrary command execution.
150
+
151
+ ## Reproduction Commands
152
+
153
+ ```bash
154
+ npm run cli -- run --target <path-to-target-config.json>
155
+ npm run cli -- report --run <path-to-run-artifact.json> --format markdown
156
+ ```
157
+
158
+ ## Artifact Provenance
159
+
160
+ - Artifact type: `run`
161
+ - Schema version: `1.0.0`
162
+ - Run ID: `run_2026-06-24T020744894Z_ce4e4a75`
163
+ - Gate: `fail`