npm - @kryptosai/mcp-observatory - Versions diffs - 0.23.0 → 0.24.0 - Mend

@kryptosai/mcp-observatory 0.23.0 → 0.24.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (46) hide show

package/README.md +8 -7
package/dist/src/commands/init-ci.d.ts +3 -0
package/dist/src/commands/init-ci.js +24 -12
package/dist/src/commands/init-ci.js.map +1 -1
package/dist/src/reporters/pr-comment.js +6 -2
package/dist/src/reporters/pr-comment.js.map +1 -1
package/docs/certification-campaign-template.md +2 -2
package/docs/mcp-safety-report-latest.md +12 -7
package/docs/mcp-server-safety-index.md +56 -80
package/docs/methodology.md +90 -0
package/docs/metrics-dashboard.md +105 -0
package/docs/paid-pilot-offer.md +21 -5
package/docs/project-case-study.md +12 -8
package/docs/proof.md +28 -15
package/docs/public-post-drafts.md +18 -6
package/docs/publish-readiness.md +1 -5
package/docs/reference-evaluations.md +1 -1
package/docs/safety-index/artifacts/antv-chart-server.json +2765 -0
package/docs/safety-index/artifacts/antv-chart-server.md +156 -0
package/docs/safety-index/artifacts/browsermcp-server.json +416 -0
package/docs/safety-index/artifacts/browsermcp-server.md +163 -0
package/docs/safety-index/artifacts/context7-server.json +286 -0
package/docs/safety-index/artifacts/context7-server.md +163 -0
package/docs/safety-index/artifacts/everything-server.json +482 -0
package/docs/safety-index/artifacts/everything-server.md +163 -0
package/docs/safety-index/artifacts/executeautomation-playwright-server.json +955 -0
package/docs/safety-index/artifacts/executeautomation-playwright-server.md +163 -0
package/docs/safety-index/artifacts/filesystem-server.json +583 -0
package/docs/safety-index/artifacts/filesystem-server.md +156 -0
package/docs/safety-index/artifacts/memory-server.json +469 -0
package/docs/safety-index/artifacts/memory-server.md +156 -0
package/docs/safety-index/artifacts/opentofu-server.json +387 -0
package/docs/safety-index/artifacts/opentofu-server.md +163 -0
package/docs/safety-index/artifacts/playwright-mcp-server.json +919 -0
package/docs/safety-index/artifacts/playwright-mcp-server.md +156 -0
package/docs/safety-index/artifacts/promptopia-server.json +442 -0
package/docs/safety-index/artifacts/promptopia-server.md +156 -0
package/docs/safety-index/artifacts/puppeteer-server.json +377 -0
package/docs/safety-index/artifacts/puppeteer-server.md +163 -0
package/docs/safety-index/artifacts/ref-tools-server.json +262 -0
package/docs/safety-index/artifacts/ref-tools-server.md +156 -0
package/docs/safety-index/artifacts/sequential-thinking-server.json +286 -0
package/docs/safety-index/artifacts/sequential-thinking-server.md +156 -0
package/docs/safety-index/maintainer-note-template.md +25 -0
package/docs/safety-index/targets.json +192 -0
package/package.json +12 -9

package/docs/safety-index/artifacts/executeautomation-playwright-server.md ADDED Viewed

@@ -0,0 +1,163 @@
+# MCP Observatory Run Report
+Generated at 2026-06-24T02:07:44.894Z
+## Target and Environment Metadata
+- Target: `executeautomation-playwright-server`
+- Adapter: `local-process`
+- Command: `npx -y @executeautomation/playwright-mcp-server`
+- Server: `playwright-mcp 1.0.11`
+- Platform: `darwin 25.5.0`
+- Node: `v22.22.1`
+## Executive Summary
+**Health Score: 69/100 (D)**
+| Dimension | Score | Weight |
+| --- | --- | --- |
+| Protocol Compliance | 100/100 | 30% |
+| Schema Quality | 60/100 | 20% |
+| Security | 0/100 | 20% |
+| Reliability | 83/100 | 20% |
+| Performance | 100/100 | 10% |
+| Gate | Total | Pass | Fail | Partial | Unsupported | Flaky | Skipped |
+| --- | --- | --- | --- | --- | --- | --- | --- |
+| fail | 7 | 3 | 2 | 1 | 1 | 0 | 0 |
+## At a Glance
+- Safety verdict: **Blocked** — One or more checks can break agent dependence and should be fixed before production use.
+- Top risks: schema-quality: Found 1 quality finding(s) across 34 item(s): 1 warnings, 0 info.; security: Found 1 security finding(s): 1 high, 0 medium, 0 low.; security-lite: Found 1 security finding(s): 1 high, 0 medium, 0 low.
+- Regression/schema drift: Run `mcp-observatory diff <previous-run.json> <current-run.json>` to classify regressions and schema drift.
+- Failing checks: security-lite, security
+- Partial or flaky checks: schema-quality
+- Skipped checks: none
+- Unsupported checks: prompts
+- Suggested next step: Start with the failing checks: security-lite, security.
+- CI next step: `Add CI: npx @kryptosai/mcp-observatory init-ci --all --command "npx -y <server-package>"`
+## Regressions and Recoveries
+_Use the `diff` command against another run artifact to classify regressions and recoveries over time._
+## Full Capability Status Table
+| Focus | Check | Status | Duration (ms) | Message |
+| --- | --- | --- | --- | --- |
+| healthy | conformance | pass | 7.25 | All 7 conformance checks passed. |
+| healthy | resources | pass | 1.65 | Advertised capability responded with the minimal expected shape, but one optional resource endpoint appears unsupported. |
+| healthy | tools | pass | 3.17 | Advertised capability responded with the minimal expected shape (33 items). |
+| review | schema-quality | partial | 1.86 | Found 1 quality finding(s) across 34 item(s): 1 warnings, 0 info. |
+| confirm intent | prompts | unsupported | 0.00 | Prompts are not advertised by the target. |
+| act now | security | fail | 0.87 | Found 1 security finding(s): 1 high, 0 medium, 0 low. |
+| act now | security-lite | fail | 0.48 | Found 1 security finding(s): 1 high, 0 medium, 0 low. |
+## Evidence Snippets
+### conformance — pass
+Summary: All 7 conformance checks passed.
+- Endpoint: `conformance/check`
+  - Advertised: `true`
+  - Responded: `true`
+  - Minimal shape present: `true`
+  - Item count: `7`
+  - Identifiers: none
+  - Diagnostics: [pass] capabilities-present: Server returned capabilities object., [pass] server-info: Server provided initialization info., [pass] tools-capability-match: tools/list returned 33 tool(s). (+4 more)
+### resources — pass
+Summary: Advertised capability responded with the minimal expected shape, but one optional resource endpoint appears unsupported.
+- Endpoint: `resources/list`
+  - Advertised: `true`
+  - Responded: `true`
+  - Minimal shape present: `true`
+  - Item count: `1`
+  - Identifiers: console://logs
+  - Diagnostics: none
+- Endpoint: `resources/templates/list`
+  - Advertised: `true`
+  - Responded: `false`
+  - Minimal shape present: `false`
+  - Item count: `0`
+  - Identifiers: none
+  - Diagnostics: MCP error -32601: Method not found
+### tools — pass
+Summary: Advertised capability responded with the minimal expected shape (33 items).
+- Endpoint: `tools/list`
+  - Advertised: `true`
+  - Responded: `true`
+  - Minimal shape present: `true`
+  - Item count: `33`
+  - Identifiers: start_codegen_session, end_codegen_session, get_codegen_session, clear_codegen_session, playwright_navigate (+28 more)
+  - Diagnostics: none
+### schema-quality — partial
+Summary: Found 1 quality finding(s) across 34 item(s): 1 warnings, 0 info.
+- Endpoint: `schema-quality/scan`
+  - Advertised: `true`
+  - Responded: `true`
+  - Minimal shape present: `true`
+  - Item count: `1`
+  - Identifiers: Browser console logs
+  - Diagnostics: [warning] resource "Browser console logs": Missing description
+### prompts — unsupported
+Summary: Prompts are not advertised by the target.
+- Endpoint: `prompts/list`
+  - Advertised: `false`
+  - Responded: `false`
+  - Minimal shape present: `false`
+  - Item count: `0`
+  - Identifiers: none
+  - Diagnostics: none
+### security — fail
+Summary: Found 1 security finding(s): 1 high, 0 medium, 0 low.
+- Endpoint: `security/scan`
+  - Advertised: `true`
+  - Responded: `true`
+  - Minimal shape present: `true`
+  - Item count: `1`
+  - Identifiers: playwright_evaluate
+  - Diagnostics: [high] Tool "playwright_evaluate" has parameter "script" which may allow arbitrary command execution.
+### security-lite — fail
+Summary: Found 1 security finding(s): 1 high, 0 medium, 0 low.
+- Endpoint: `security/scan-lite`
+  - Advertised: `true`
+  - Responded: `true`
+  - Minimal shape present: `true`
+  - Item count: `1`
+  - Identifiers: playwright_evaluate
+  - Diagnostics: [high] Tool "playwright_evaluate" has parameter "script" which may allow arbitrary command execution.
+## Reproduction Commands
+```bash
+npm run cli -- run --target <path-to-target-config.json>
+npm run cli -- report --run <path-to-run-artifact.json> --format markdown
+```
+## Artifact Provenance
+- Artifact type: `run`
+- Schema version: `1.0.0`
+- Run ID: `run_2026-06-24T020744894Z_ce4e4a75`
+- Gate: `fail`