@cubis/foundry 0.3.34 → 0.3.36

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (594) hide show
  1. package/README.md +56 -11
  2. package/bin/cubis.js +2824 -966
  3. package/mcp/.dockerignore +5 -0
  4. package/mcp/Dockerfile +19 -0
  5. package/mcp/README.md +550 -11
  6. package/mcp/config.json +18 -0
  7. package/mcp/dist/index.d.ts +1 -0
  8. package/mcp/dist/index.js +1488 -0
  9. package/mcp/package.json +36 -0
  10. package/mcp/src/cbxConfig/index.ts +23 -0
  11. package/mcp/src/cbxConfig/paths.ts +45 -0
  12. package/mcp/src/cbxConfig/reader.ts +149 -0
  13. package/mcp/src/cbxConfig/readerWriter.test.ts +147 -0
  14. package/mcp/src/cbxConfig/serviceConfig.test.ts +42 -0
  15. package/mcp/src/cbxConfig/serviceConfig.ts +157 -0
  16. package/mcp/src/cbxConfig/types.ts +44 -0
  17. package/mcp/src/cbxConfig/writer.ts +90 -0
  18. package/mcp/src/config/index.test.ts +81 -0
  19. package/mcp/src/config/index.ts +45 -0
  20. package/mcp/src/config/schema.ts +55 -0
  21. package/mcp/src/gateway/catalog.ts +37 -0
  22. package/mcp/src/gateway/config.ts +247 -0
  23. package/mcp/src/gateway/manager.ts +255 -0
  24. package/mcp/src/gateway/types.ts +54 -0
  25. package/mcp/src/gateway/upstreamClient.ts +80 -0
  26. package/mcp/src/index.ts +211 -0
  27. package/mcp/src/server.ts +219 -0
  28. package/mcp/src/tools/configTools.test.ts +166 -0
  29. package/mcp/src/tools/future/README.md +21 -0
  30. package/mcp/src/tools/future/index.ts +8 -0
  31. package/mcp/src/tools/index.ts +76 -0
  32. package/mcp/src/tools/mcpGateway.ts +44 -0
  33. package/mcp/src/tools/postmanGetMode.ts +82 -0
  34. package/mcp/src/tools/postmanGetStatus.ts +65 -0
  35. package/mcp/src/tools/postmanModes.ts +28 -0
  36. package/mcp/src/tools/postmanSetMode.ts +71 -0
  37. package/mcp/src/tools/skillBrowseCategory.ts +54 -0
  38. package/mcp/src/tools/skillGet.ts +43 -0
  39. package/mcp/src/tools/skillListCategories.ts +40 -0
  40. package/mcp/src/tools/skillSearch.ts +72 -0
  41. package/mcp/src/tools/skillTools.test.ts +126 -0
  42. package/mcp/src/tools/stitchGetMode.ts +58 -0
  43. package/mcp/src/tools/stitchGetStatus.ts +70 -0
  44. package/mcp/src/tools/stitchSetProfile.ts +82 -0
  45. package/mcp/src/transports/stdio.ts +9 -0
  46. package/mcp/src/transports/streamableHttp.ts +43 -0
  47. package/mcp/src/upstream/passthrough.ts +273 -0
  48. package/mcp/src/utils/errors.ts +38 -0
  49. package/mcp/src/utils/logger.ts +63 -0
  50. package/mcp/src/vault/manifest.test.ts +126 -0
  51. package/mcp/src/vault/manifest.ts +99 -0
  52. package/mcp/src/vault/scanner.test.ts +70 -0
  53. package/mcp/src/vault/scanner.ts +145 -0
  54. package/mcp/src/vault/types.ts +21 -0
  55. package/package.json +9 -2
  56. package/workflows/scripts/generate-powers.mjs +27 -18
  57. package/workflows/skills/postman/SKILL.md +11 -12
  58. package/workflows/skills/skills_index.json +11 -93
  59. package/workflows/skills/stitch/SKILL.md +17 -0
  60. package/workflows/workflows/agent-environment-setup/manifest.json +0 -24
  61. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/nextjs-react-expert/SKILL.md +1 -1
  62. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/SKILL.md +9 -232
  63. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/skills_index.json +0 -102
  64. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/stitch/SKILL.md +17 -0
  65. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/nextjs-react-expert/SKILL.md +1 -1
  66. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/SKILL.md +9 -232
  67. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/skills_index.json +0 -102
  68. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/stitch/SKILL.md +17 -0
  69. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/nextjs-react-expert/SKILL.md +1 -1
  70. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/SKILL.md +9 -232
  71. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/skills_index.json +0 -102
  72. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/stitch/SKILL.md +17 -0
  73. package/mcp/catalogs/default.json +0 -14
  74. package/mcp/powers/atlassian-mcp/POWER.md +0 -91
  75. package/mcp/powers/atlassian-mcp/SKILL.md +0 -88
  76. package/mcp/powers/atlassian-mcp/steering/authentication-patterns.md +0 -520
  77. package/mcp/powers/atlassian-mcp/steering/common-workflows.md +0 -660
  78. package/mcp/powers/atlassian-mcp/steering/confluence-operations.md +0 -528
  79. package/mcp/powers/atlassian-mcp/steering/jira-queries.md +0 -421
  80. package/mcp/powers/atlassian-mcp/steering/mcp-server-setup.md +0 -292
  81. package/mcp/powers/building-mcp-server-on-cloudflare/POWER.md +0 -261
  82. package/mcp/powers/building-mcp-server-on-cloudflare/SKILL.md +0 -258
  83. package/mcp/powers/building-mcp-server-on-cloudflare/steering/examples.md +0 -115
  84. package/mcp/powers/building-mcp-server-on-cloudflare/steering/oauth-setup.md +0 -338
  85. package/mcp/powers/building-mcp-server-on-cloudflare/steering/troubleshooting.md +0 -317
  86. package/mcp/powers/datadog-automation/POWER.md +0 -238
  87. package/mcp/powers/datadog-automation/SKILL.md +0 -235
  88. package/mcp/powers/github-automation/POWER.md +0 -230
  89. package/mcp/powers/github-automation/SKILL.md +0 -227
  90. package/mcp/powers/mcp-builder/LICENSE.txt +0 -202
  91. package/mcp/powers/mcp-builder/POWER.md +0 -179
  92. package/mcp/powers/mcp-builder/SKILL.md +0 -176
  93. package/mcp/powers/mcp-builder/reference/evaluation.md +0 -602
  94. package/mcp/powers/mcp-builder/reference/mcp_best_practices.md +0 -915
  95. package/mcp/powers/mcp-builder/reference/node_mcp_server.md +0 -916
  96. package/mcp/powers/mcp-builder/reference/python_mcp_server.md +0 -752
  97. package/mcp/powers/mcp-builder/scripts/connections.py +0 -151
  98. package/mcp/powers/mcp-builder/scripts/evaluation.py +0 -373
  99. package/mcp/powers/mcp-builder/scripts/example_evaluation.xml +0 -22
  100. package/mcp/powers/mcp-builder/scripts/requirements.txt +0 -2
  101. package/mcp/powers/mcp-developer/POWER.md +0 -88
  102. package/mcp/powers/mcp-developer/SKILL.md +0 -85
  103. package/mcp/powers/mcp-developer/steering/protocol.md +0 -244
  104. package/mcp/powers/mcp-developer/steering/python-sdk.md +0 -367
  105. package/mcp/powers/mcp-developer/steering/resources.md +0 -554
  106. package/mcp/powers/mcp-developer/steering/tools.md +0 -480
  107. package/mcp/powers/mcp-developer/steering/typescript-sdk.md +0 -350
  108. package/mcp/powers/postman/POWER.md +0 -243
  109. package/mcp/powers/postman/SKILL.md +0 -240
  110. package/mcp/powers/postman/flutter-expert/POWER.md +0 -89
  111. package/mcp/powers/postman/flutter-expert/SKILL.md +0 -77
  112. package/mcp/powers/postman/flutter-expert/steering/bloc-state.md +0 -259
  113. package/mcp/powers/postman/flutter-expert/steering/engineering-principles.md +0 -82
  114. package/mcp/powers/postman/flutter-expert/steering/gorouter-navigation.md +0 -157
  115. package/mcp/powers/postman/flutter-expert/steering/performance.md +0 -99
  116. package/mcp/powers/postman/flutter-expert/steering/project-structure.md +0 -77
  117. package/mcp/powers/postman/flutter-expert/steering/riverpod-state.md +0 -404
  118. package/mcp/powers/postman/flutter-expert/steering/widget-patterns.md +0 -123
  119. package/mcp/powers/postman/flutter-security-reviewer/POWER.md +0 -204
  120. package/mcp/powers/postman/flutter-security-reviewer/SKILL.md +0 -180
  121. package/mcp/powers/postman/flutter-security-reviewer/steering/log_redaction.md +0 -17
  122. package/mcp/powers/postman/flutter-security-reviewer/steering/network_hardening.md +0 -13
  123. package/mcp/powers/postman/flutter-security-reviewer/steering/secret-scanning.md +0 -123
  124. package/mcp/powers/postman/flutter-security-reviewer/steering/secure_storage_policy.md +0 -16
  125. package/mcp/powers/postman/flutter-security-reviewer/steering/vulnerability-patterns.md +0 -189
  126. package/mcp/powers/postman/flutter-security-reviewer/templates/security_review_response.md +0 -31
  127. package/mcp/powers/postman/flutter-test-master/POWER.md +0 -527
  128. package/mcp/powers/postman/flutter-test-master/SKILL.md +0 -513
  129. package/mcp/powers/postman/flutter-test-master/steering/golden-testing.md +0 -444
  130. package/mcp/powers/postman/flutter-test-master/steering/integration-testing.md +0 -688
  131. package/mcp/powers/postman/flutter-test-master/steering/property-testing.md +0 -408
  132. package/mcp/powers/postman/flutter-test-master/steering/riverpod-testing.md +0 -606
  133. package/mcp/powers/postman/flutter-test-master/steering/test-utilities.md +0 -566
  134. package/mcp/powers/postman/flutter-test-master/steering/unit-testing.md +0 -511
  135. package/mcp/powers/postman/flutter-test-master/steering/widget-testing.md +0 -601
  136. package/mcp/powers/postman/git-commit/POWER.md +0 -47
  137. package/mcp/powers/postman/git-commit/SKILL.md +0 -46
  138. package/mcp/powers/postman/gorouter-restoration/POWER.md +0 -227
  139. package/mcp/powers/postman/gorouter-restoration/SKILL.md +0 -217
  140. package/mcp/powers/postman/gorouter-restoration/steering/guard_patterns.md +0 -14
  141. package/mcp/powers/postman/gorouter-restoration/steering/tab_preservation.md +0 -12
  142. package/mcp/powers/postman/gorouter-restoration/templates/guard_redirect.md +0 -30
  143. package/mcp/powers/postman/gorouter-restoration/templates/stateful_shell_skeleton.md +0 -40
  144. package/mcp/powers/postman/mcp.json +0 -11
  145. package/mcp/powers/postman/microservices-architect/POWER.md +0 -109
  146. package/mcp/powers/postman/microservices-architect/SKILL.md +0 -93
  147. package/mcp/powers/postman/microservices-architect/steering/communication.md +0 -499
  148. package/mcp/powers/postman/microservices-architect/steering/data.md +0 -721
  149. package/mcp/powers/postman/microservices-architect/steering/decomposition.md +0 -344
  150. package/mcp/powers/postman/microservices-architect/steering/observability.md +0 -805
  151. package/mcp/powers/postman/microservices-architect/steering/patterns.md +0 -603
  152. package/mcp/powers/postman/nestjs-expert/POWER.md +0 -130
  153. package/mcp/powers/postman/nestjs-expert/SKILL.md +0 -114
  154. package/mcp/powers/postman/nestjs-expert/steering/authentication.md +0 -461
  155. package/mcp/powers/postman/nestjs-expert/steering/controllers-routing.md +0 -114
  156. package/mcp/powers/postman/nestjs-expert/steering/dtos-validation.md +0 -156
  157. package/mcp/powers/postman/nestjs-expert/steering/migration-from-express.md +0 -1240
  158. package/mcp/powers/postman/nestjs-expert/steering/secure-sessions.md +0 -56
  159. package/mcp/powers/postman/nestjs-expert/steering/services-di.md +0 -143
  160. package/mcp/powers/postman/nestjs-expert/steering/strangler-fig-pattern.md +0 -47
  161. package/mcp/powers/postman/nestjs-expert/steering/testing-patterns.md +0 -189
  162. package/mcp/powers/postman/oneup-design/POWER.md +0 -72
  163. package/mcp/powers/postman/oneup-design/SKILL.md +0 -71
  164. package/mcp/powers/postman/oneup-design/steering/colors.md +0 -41
  165. package/mcp/powers/postman/oneup-design/steering/components.md +0 -53
  166. package/mcp/powers/postman/oneup-design/steering/spacing.md +0 -84
  167. package/mcp/powers/postman/openapi-docs/POWER.md +0 -78
  168. package/mcp/powers/postman/openapi-docs/SKILL.md +0 -68
  169. package/mcp/powers/postman/openapi-docs/steering/crud-templates.md +0 -43
  170. package/mcp/powers/postman/openapi-docs/steering/framework-express.md +0 -19
  171. package/mcp/powers/postman/openapi-docs/steering/framework-fastapi.md +0 -20
  172. package/mcp/powers/postman/openapi-docs/steering/framework-nestjs.md +0 -20
  173. package/mcp/powers/postman/openapi-docs/steering/framework-raw-openapi.md +0 -14
  174. package/mcp/powers/postman/openapi-docs/steering/openapi-checklist.md +0 -33
  175. package/mcp/powers/postman/openapi-docs/steering/stoplight-ui.md +0 -21
  176. package/mcp/powers/postman/steering/steering.md +0 -26
  177. package/mcp/powers/sentry-automation/POWER.md +0 -235
  178. package/mcp/powers/sentry-automation/SKILL.md +0 -232
  179. package/mcp/skills/atlassian-mcp/SKILL.md +0 -88
  180. package/mcp/skills/atlassian-mcp/references/authentication-patterns.md +0 -520
  181. package/mcp/skills/atlassian-mcp/references/common-workflows.md +0 -660
  182. package/mcp/skills/atlassian-mcp/references/confluence-operations.md +0 -528
  183. package/mcp/skills/atlassian-mcp/references/jira-queries.md +0 -421
  184. package/mcp/skills/atlassian-mcp/references/mcp-server-setup.md +0 -292
  185. package/mcp/skills/building-mcp-server-on-cloudflare/SKILL.md +0 -258
  186. package/mcp/skills/building-mcp-server-on-cloudflare/references/examples.md +0 -115
  187. package/mcp/skills/building-mcp-server-on-cloudflare/references/oauth-setup.md +0 -338
  188. package/mcp/skills/building-mcp-server-on-cloudflare/references/tool-patterns.md +0 -29
  189. package/mcp/skills/building-mcp-server-on-cloudflare/references/troubleshooting.md +0 -317
  190. package/mcp/skills/datadog-automation/SKILL.md +0 -235
  191. package/mcp/skills/github-automation/SKILL.md +0 -227
  192. package/mcp/skills/mcp-builder/LICENSE.txt +0 -202
  193. package/mcp/skills/mcp-builder/SKILL.md +0 -176
  194. package/mcp/skills/mcp-builder/reference/evaluation.md +0 -602
  195. package/mcp/skills/mcp-builder/reference/mcp_best_practices.md +0 -915
  196. package/mcp/skills/mcp-builder/reference/node_mcp_server.md +0 -916
  197. package/mcp/skills/mcp-builder/reference/python_mcp_server.md +0 -752
  198. package/mcp/skills/mcp-builder/scripts/connections.py +0 -151
  199. package/mcp/skills/mcp-builder/scripts/evaluation.py +0 -373
  200. package/mcp/skills/mcp-builder/scripts/example_evaluation.xml +0 -22
  201. package/mcp/skills/mcp-builder/scripts/requirements.txt +0 -2
  202. package/mcp/skills/mcp-developer/SKILL.md +0 -85
  203. package/mcp/skills/mcp-developer/references/protocol.md +0 -244
  204. package/mcp/skills/mcp-developer/references/python-sdk.md +0 -367
  205. package/mcp/skills/mcp-developer/references/resources.md +0 -554
  206. package/mcp/skills/mcp-developer/references/tools.md +0 -480
  207. package/mcp/skills/mcp-developer/references/typescript-sdk.md +0 -350
  208. package/mcp/skills/postman/POWER.md +0 -242
  209. package/mcp/skills/postman/SKILL.md +0 -240
  210. package/mcp/skills/postman/flutter-expert/POWER.md +0 -89
  211. package/mcp/skills/postman/flutter-expert/SKILL.md +0 -77
  212. package/mcp/skills/postman/flutter-expert/references/bloc-state.md +0 -259
  213. package/mcp/skills/postman/flutter-expert/references/engineering-principles.md +0 -82
  214. package/mcp/skills/postman/flutter-expert/references/gorouter-navigation.md +0 -157
  215. package/mcp/skills/postman/flutter-expert/references/performance.md +0 -99
  216. package/mcp/skills/postman/flutter-expert/references/project-structure.md +0 -77
  217. package/mcp/skills/postman/flutter-expert/references/riverpod-state.md +0 -404
  218. package/mcp/skills/postman/flutter-expert/references/widget-patterns.md +0 -123
  219. package/mcp/skills/postman/flutter-security-reviewer/POWER.md +0 -204
  220. package/mcp/skills/postman/flutter-security-reviewer/SKILL.md +0 -180
  221. package/mcp/skills/postman/flutter-security-reviewer/references/log_redaction.md +0 -17
  222. package/mcp/skills/postman/flutter-security-reviewer/references/network_hardening.md +0 -13
  223. package/mcp/skills/postman/flutter-security-reviewer/references/secret-scanning.md +0 -123
  224. package/mcp/skills/postman/flutter-security-reviewer/references/secure_storage_policy.md +0 -16
  225. package/mcp/skills/postman/flutter-security-reviewer/references/vulnerability-patterns.md +0 -189
  226. package/mcp/skills/postman/flutter-security-reviewer/templates/security_review_response.md +0 -31
  227. package/mcp/skills/postman/flutter-test-master/POWER.md +0 -527
  228. package/mcp/skills/postman/flutter-test-master/SKILL.md +0 -513
  229. package/mcp/skills/postman/flutter-test-master/references/golden-testing.md +0 -444
  230. package/mcp/skills/postman/flutter-test-master/references/integration-testing.md +0 -688
  231. package/mcp/skills/postman/flutter-test-master/references/property-testing.md +0 -408
  232. package/mcp/skills/postman/flutter-test-master/references/riverpod-testing.md +0 -606
  233. package/mcp/skills/postman/flutter-test-master/references/test-utilities.md +0 -566
  234. package/mcp/skills/postman/flutter-test-master/references/unit-testing.md +0 -511
  235. package/mcp/skills/postman/flutter-test-master/references/widget-testing.md +0 -601
  236. package/mcp/skills/postman/git-commit/POWER.md +0 -47
  237. package/mcp/skills/postman/git-commit/SKILL.md +0 -46
  238. package/mcp/skills/postman/gorouter-restoration/POWER.md +0 -227
  239. package/mcp/skills/postman/gorouter-restoration/SKILL.md +0 -217
  240. package/mcp/skills/postman/gorouter-restoration/references/guard_patterns.md +0 -14
  241. package/mcp/skills/postman/gorouter-restoration/references/tab_preservation.md +0 -12
  242. package/mcp/skills/postman/gorouter-restoration/templates/guard_redirect.md +0 -30
  243. package/mcp/skills/postman/gorouter-restoration/templates/stateful_shell_skeleton.md +0 -40
  244. package/mcp/skills/postman/mcp.json +0 -11
  245. package/mcp/skills/postman/microservices-architect/POWER.md +0 -109
  246. package/mcp/skills/postman/microservices-architect/SKILL.md +0 -93
  247. package/mcp/skills/postman/microservices-architect/references/communication.md +0 -499
  248. package/mcp/skills/postman/microservices-architect/references/data.md +0 -721
  249. package/mcp/skills/postman/microservices-architect/references/decomposition.md +0 -344
  250. package/mcp/skills/postman/microservices-architect/references/observability.md +0 -805
  251. package/mcp/skills/postman/microservices-architect/references/patterns.md +0 -603
  252. package/mcp/skills/postman/nestjs-expert/POWER.md +0 -130
  253. package/mcp/skills/postman/nestjs-expert/SKILL.md +0 -114
  254. package/mcp/skills/postman/nestjs-expert/references/authentication.md +0 -461
  255. package/mcp/skills/postman/nestjs-expert/references/controllers-routing.md +0 -114
  256. package/mcp/skills/postman/nestjs-expert/references/dtos-validation.md +0 -156
  257. package/mcp/skills/postman/nestjs-expert/references/migration-from-express.md +0 -1240
  258. package/mcp/skills/postman/nestjs-expert/references/secure-sessions.md +0 -56
  259. package/mcp/skills/postman/nestjs-expert/references/services-di.md +0 -143
  260. package/mcp/skills/postman/nestjs-expert/references/strangler-fig-pattern.md +0 -47
  261. package/mcp/skills/postman/nestjs-expert/references/testing-patterns.md +0 -189
  262. package/mcp/skills/postman/oneup-design/POWER.md +0 -72
  263. package/mcp/skills/postman/oneup-design/SKILL.md +0 -71
  264. package/mcp/skills/postman/oneup-design/references/colors.md +0 -41
  265. package/mcp/skills/postman/oneup-design/references/components.md +0 -53
  266. package/mcp/skills/postman/oneup-design/references/spacing.md +0 -84
  267. package/mcp/skills/postman/openapi-docs/POWER.md +0 -78
  268. package/mcp/skills/postman/openapi-docs/SKILL.md +0 -68
  269. package/mcp/skills/postman/openapi-docs/references/crud-templates.md +0 -43
  270. package/mcp/skills/postman/openapi-docs/references/framework-express.md +0 -19
  271. package/mcp/skills/postman/openapi-docs/references/framework-fastapi.md +0 -20
  272. package/mcp/skills/postman/openapi-docs/references/framework-nestjs.md +0 -20
  273. package/mcp/skills/postman/openapi-docs/references/framework-raw-openapi.md +0 -14
  274. package/mcp/skills/postman/openapi-docs/references/openapi-checklist.md +0 -33
  275. package/mcp/skills/postman/openapi-docs/references/stoplight-ui.md +0 -21
  276. package/mcp/skills/postman/references/steering.md +0 -26
  277. package/mcp/skills/sentry-automation/SKILL.md +0 -232
  278. package/workflows/powers/atlassian-mcp/POWER.md +0 -21
  279. package/workflows/powers/atlassian-mcp/SKILL.md +0 -18
  280. package/workflows/powers/building-mcp-server-on-cloudflare/POWER.md +0 -21
  281. package/workflows/powers/building-mcp-server-on-cloudflare/SKILL.md +0 -18
  282. package/workflows/powers/datadog-automation/POWER.md +0 -21
  283. package/workflows/powers/datadog-automation/SKILL.md +0 -18
  284. package/workflows/powers/github-automation/POWER.md +0 -21
  285. package/workflows/powers/github-automation/SKILL.md +0 -18
  286. package/workflows/powers/mcp-builder/POWER.md +0 -21
  287. package/workflows/powers/mcp-builder/SKILL.md +0 -18
  288. package/workflows/powers/mcp-developer/POWER.md +0 -21
  289. package/workflows/powers/mcp-developer/SKILL.md +0 -18
  290. package/workflows/powers/postman/POWER.md +0 -21
  291. package/workflows/powers/postman/SKILL.md +0 -18
  292. package/workflows/powers/sentry-automation/POWER.md +0 -21
  293. package/workflows/powers/sentry-automation/SKILL.md +0 -18
  294. package/workflows/skills/atlassian-mcp/SKILL.md +0 -18
  295. package/workflows/skills/building-mcp-server-on-cloudflare/SKILL.md +0 -18
  296. package/workflows/skills/datadog-automation/SKILL.md +0 -18
  297. package/workflows/skills/github-automation/SKILL.md +0 -18
  298. package/workflows/skills/mcp-builder/SKILL.md +0 -18
  299. package/workflows/skills/mcp-developer/SKILL.md +0 -18
  300. package/workflows/skills/sentry-automation/SKILL.md +0 -18
  301. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/atlassian-mcp/SKILL.md +0 -88
  302. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/atlassian-mcp/references/authentication-patterns.md +0 -520
  303. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/atlassian-mcp/references/common-workflows.md +0 -660
  304. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/atlassian-mcp/references/confluence-operations.md +0 -528
  305. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/atlassian-mcp/references/jira-queries.md +0 -421
  306. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/atlassian-mcp/references/mcp-server-setup.md +0 -292
  307. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/building-mcp-server-on-cloudflare/SKILL.md +0 -258
  308. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/building-mcp-server-on-cloudflare/references/examples.md +0 -115
  309. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/building-mcp-server-on-cloudflare/references/oauth-setup.md +0 -338
  310. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/building-mcp-server-on-cloudflare/references/tool-patterns.md +0 -29
  311. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/building-mcp-server-on-cloudflare/references/troubleshooting.md +0 -317
  312. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/datadog-automation/SKILL.md +0 -235
  313. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/github-automation/SKILL.md +0 -227
  314. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-builder/LICENSE.txt +0 -202
  315. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-builder/SKILL.md +0 -176
  316. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-builder/reference/evaluation.md +0 -602
  317. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-builder/reference/mcp_best_practices.md +0 -915
  318. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-builder/reference/node_mcp_server.md +0 -916
  319. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-builder/reference/python_mcp_server.md +0 -752
  320. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-builder/scripts/connections.py +0 -151
  321. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-builder/scripts/evaluation.py +0 -373
  322. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-builder/scripts/example_evaluation.xml +0 -22
  323. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-builder/scripts/requirements.txt +0 -2
  324. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-developer/SKILL.md +0 -85
  325. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-developer/references/protocol.md +0 -244
  326. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-developer/references/python-sdk.md +0 -367
  327. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-developer/references/resources.md +0 -554
  328. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-developer/references/tools.md +0 -480
  329. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/mcp-developer/references/typescript-sdk.md +0 -350
  330. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/POWER.md +0 -242
  331. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-expert/POWER.md +0 -89
  332. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-expert/SKILL.md +0 -77
  333. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-expert/references/bloc-state.md +0 -259
  334. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-expert/references/engineering-principles.md +0 -82
  335. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-expert/references/gorouter-navigation.md +0 -157
  336. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-expert/references/performance.md +0 -99
  337. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-expert/references/project-structure.md +0 -77
  338. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-expert/references/riverpod-state.md +0 -404
  339. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-expert/references/widget-patterns.md +0 -123
  340. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-security-reviewer/POWER.md +0 -204
  341. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-security-reviewer/SKILL.md +0 -180
  342. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-security-reviewer/references/log_redaction.md +0 -17
  343. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-security-reviewer/references/network_hardening.md +0 -13
  344. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-security-reviewer/references/secret-scanning.md +0 -123
  345. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-security-reviewer/references/secure_storage_policy.md +0 -16
  346. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-security-reviewer/references/vulnerability-patterns.md +0 -189
  347. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-security-reviewer/templates/security_review_response.md +0 -31
  348. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-test-master/POWER.md +0 -527
  349. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-test-master/SKILL.md +0 -513
  350. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-test-master/references/golden-testing.md +0 -444
  351. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-test-master/references/integration-testing.md +0 -688
  352. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-test-master/references/property-testing.md +0 -408
  353. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-test-master/references/riverpod-testing.md +0 -606
  354. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-test-master/references/test-utilities.md +0 -566
  355. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-test-master/references/unit-testing.md +0 -511
  356. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/flutter-test-master/references/widget-testing.md +0 -601
  357. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/git-commit/POWER.md +0 -47
  358. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/git-commit/SKILL.md +0 -46
  359. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/gorouter-restoration/POWER.md +0 -227
  360. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/gorouter-restoration/SKILL.md +0 -217
  361. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/gorouter-restoration/references/guard_patterns.md +0 -14
  362. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/gorouter-restoration/references/tab_preservation.md +0 -12
  363. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/gorouter-restoration/templates/guard_redirect.md +0 -30
  364. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/gorouter-restoration/templates/stateful_shell_skeleton.md +0 -40
  365. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/mcp.json +0 -11
  366. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/microservices-architect/POWER.md +0 -109
  367. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/microservices-architect/SKILL.md +0 -93
  368. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/microservices-architect/references/communication.md +0 -499
  369. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/microservices-architect/references/data.md +0 -721
  370. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/microservices-architect/references/decomposition.md +0 -344
  371. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/microservices-architect/references/observability.md +0 -805
  372. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/microservices-architect/references/patterns.md +0 -603
  373. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/nestjs-expert/POWER.md +0 -130
  374. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/nestjs-expert/SKILL.md +0 -114
  375. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/nestjs-expert/references/authentication.md +0 -461
  376. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/nestjs-expert/references/controllers-routing.md +0 -114
  377. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/nestjs-expert/references/dtos-validation.md +0 -156
  378. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/nestjs-expert/references/migration-from-express.md +0 -1240
  379. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/nestjs-expert/references/secure-sessions.md +0 -56
  380. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/nestjs-expert/references/services-di.md +0 -143
  381. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/nestjs-expert/references/strangler-fig-pattern.md +0 -47
  382. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/nestjs-expert/references/testing-patterns.md +0 -189
  383. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/oneup-design/POWER.md +0 -72
  384. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/oneup-design/SKILL.md +0 -71
  385. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/oneup-design/references/colors.md +0 -41
  386. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/oneup-design/references/components.md +0 -53
  387. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/oneup-design/references/spacing.md +0 -84
  388. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/openapi-docs/POWER.md +0 -78
  389. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/openapi-docs/SKILL.md +0 -68
  390. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/openapi-docs/references/crud-templates.md +0 -43
  391. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/openapi-docs/references/framework-express.md +0 -19
  392. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/openapi-docs/references/framework-fastapi.md +0 -20
  393. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/openapi-docs/references/framework-nestjs.md +0 -20
  394. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/openapi-docs/references/framework-raw-openapi.md +0 -14
  395. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/openapi-docs/references/openapi-checklist.md +0 -33
  396. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/openapi-docs/references/stoplight-ui.md +0 -21
  397. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/postman/references/steering.md +0 -26
  398. package/workflows/workflows/agent-environment-setup/platforms/copilot/skills/sentry-automation/SKILL.md +0 -232
  399. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/atlassian-mcp/SKILL.md +0 -88
  400. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/atlassian-mcp/references/authentication-patterns.md +0 -520
  401. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/atlassian-mcp/references/common-workflows.md +0 -660
  402. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/atlassian-mcp/references/confluence-operations.md +0 -528
  403. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/atlassian-mcp/references/jira-queries.md +0 -421
  404. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/atlassian-mcp/references/mcp-server-setup.md +0 -292
  405. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/building-mcp-server-on-cloudflare/SKILL.md +0 -258
  406. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/building-mcp-server-on-cloudflare/references/examples.md +0 -115
  407. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/building-mcp-server-on-cloudflare/references/oauth-setup.md +0 -338
  408. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/building-mcp-server-on-cloudflare/references/tool-patterns.md +0 -29
  409. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/building-mcp-server-on-cloudflare/references/troubleshooting.md +0 -317
  410. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/datadog-automation/SKILL.md +0 -235
  411. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/github-automation/SKILL.md +0 -227
  412. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-builder/LICENSE.txt +0 -202
  413. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-builder/SKILL.md +0 -176
  414. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-builder/reference/evaluation.md +0 -602
  415. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-builder/reference/mcp_best_practices.md +0 -915
  416. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-builder/reference/node_mcp_server.md +0 -916
  417. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-builder/reference/python_mcp_server.md +0 -752
  418. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-builder/scripts/connections.py +0 -151
  419. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-builder/scripts/evaluation.py +0 -373
  420. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-builder/scripts/example_evaluation.xml +0 -22
  421. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-builder/scripts/requirements.txt +0 -2
  422. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-developer/SKILL.md +0 -85
  423. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-developer/references/protocol.md +0 -244
  424. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-developer/references/python-sdk.md +0 -367
  425. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-developer/references/resources.md +0 -554
  426. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-developer/references/tools.md +0 -480
  427. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/mcp-developer/references/typescript-sdk.md +0 -350
  428. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/POWER.md +0 -242
  429. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-expert/POWER.md +0 -89
  430. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-expert/SKILL.md +0 -77
  431. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-expert/references/bloc-state.md +0 -259
  432. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-expert/references/engineering-principles.md +0 -82
  433. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-expert/references/gorouter-navigation.md +0 -157
  434. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-expert/references/performance.md +0 -99
  435. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-expert/references/project-structure.md +0 -77
  436. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-expert/references/riverpod-state.md +0 -404
  437. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-expert/references/widget-patterns.md +0 -123
  438. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-security-reviewer/POWER.md +0 -204
  439. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-security-reviewer/SKILL.md +0 -180
  440. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-security-reviewer/references/log_redaction.md +0 -17
  441. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-security-reviewer/references/network_hardening.md +0 -13
  442. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-security-reviewer/references/secret-scanning.md +0 -123
  443. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-security-reviewer/references/secure_storage_policy.md +0 -16
  444. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-security-reviewer/references/vulnerability-patterns.md +0 -189
  445. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-security-reviewer/templates/security_review_response.md +0 -31
  446. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-test-master/POWER.md +0 -527
  447. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-test-master/SKILL.md +0 -513
  448. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-test-master/references/golden-testing.md +0 -444
  449. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-test-master/references/integration-testing.md +0 -688
  450. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-test-master/references/property-testing.md +0 -408
  451. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-test-master/references/riverpod-testing.md +0 -606
  452. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-test-master/references/test-utilities.md +0 -566
  453. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-test-master/references/unit-testing.md +0 -511
  454. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/flutter-test-master/references/widget-testing.md +0 -601
  455. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/git-commit/POWER.md +0 -47
  456. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/git-commit/SKILL.md +0 -46
  457. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/gorouter-restoration/POWER.md +0 -227
  458. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/gorouter-restoration/SKILL.md +0 -217
  459. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/gorouter-restoration/references/guard_patterns.md +0 -14
  460. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/gorouter-restoration/references/tab_preservation.md +0 -12
  461. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/gorouter-restoration/templates/guard_redirect.md +0 -30
  462. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/gorouter-restoration/templates/stateful_shell_skeleton.md +0 -40
  463. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/mcp.json +0 -11
  464. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/microservices-architect/POWER.md +0 -109
  465. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/microservices-architect/SKILL.md +0 -93
  466. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/microservices-architect/references/communication.md +0 -499
  467. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/microservices-architect/references/data.md +0 -721
  468. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/microservices-architect/references/decomposition.md +0 -344
  469. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/microservices-architect/references/observability.md +0 -805
  470. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/microservices-architect/references/patterns.md +0 -603
  471. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/nestjs-expert/POWER.md +0 -130
  472. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/nestjs-expert/SKILL.md +0 -114
  473. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/nestjs-expert/references/authentication.md +0 -461
  474. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/nestjs-expert/references/controllers-routing.md +0 -114
  475. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/nestjs-expert/references/dtos-validation.md +0 -156
  476. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/nestjs-expert/references/migration-from-express.md +0 -1240
  477. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/nestjs-expert/references/secure-sessions.md +0 -56
  478. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/nestjs-expert/references/services-di.md +0 -143
  479. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/nestjs-expert/references/strangler-fig-pattern.md +0 -47
  480. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/nestjs-expert/references/testing-patterns.md +0 -189
  481. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/oneup-design/POWER.md +0 -72
  482. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/oneup-design/SKILL.md +0 -71
  483. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/oneup-design/references/colors.md +0 -41
  484. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/oneup-design/references/components.md +0 -53
  485. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/oneup-design/references/spacing.md +0 -84
  486. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/openapi-docs/POWER.md +0 -78
  487. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/openapi-docs/SKILL.md +0 -68
  488. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/openapi-docs/references/crud-templates.md +0 -43
  489. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/openapi-docs/references/framework-express.md +0 -19
  490. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/openapi-docs/references/framework-fastapi.md +0 -20
  491. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/openapi-docs/references/framework-nestjs.md +0 -20
  492. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/openapi-docs/references/framework-raw-openapi.md +0 -14
  493. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/openapi-docs/references/openapi-checklist.md +0 -33
  494. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/openapi-docs/references/stoplight-ui.md +0 -21
  495. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/postman/references/steering.md +0 -26
  496. package/workflows/workflows/agent-environment-setup/platforms/cursor/skills/sentry-automation/SKILL.md +0 -232
  497. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/atlassian-mcp/SKILL.md +0 -88
  498. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/atlassian-mcp/references/authentication-patterns.md +0 -520
  499. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/atlassian-mcp/references/common-workflows.md +0 -660
  500. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/atlassian-mcp/references/confluence-operations.md +0 -528
  501. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/atlassian-mcp/references/jira-queries.md +0 -421
  502. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/atlassian-mcp/references/mcp-server-setup.md +0 -292
  503. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/building-mcp-server-on-cloudflare/SKILL.md +0 -258
  504. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/building-mcp-server-on-cloudflare/references/examples.md +0 -115
  505. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/building-mcp-server-on-cloudflare/references/oauth-setup.md +0 -338
  506. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/building-mcp-server-on-cloudflare/references/tool-patterns.md +0 -29
  507. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/building-mcp-server-on-cloudflare/references/troubleshooting.md +0 -317
  508. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/datadog-automation/SKILL.md +0 -235
  509. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/github-automation/SKILL.md +0 -227
  510. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-builder/LICENSE.txt +0 -202
  511. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-builder/SKILL.md +0 -176
  512. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-builder/reference/evaluation.md +0 -602
  513. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-builder/reference/mcp_best_practices.md +0 -915
  514. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-builder/reference/node_mcp_server.md +0 -916
  515. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-builder/reference/python_mcp_server.md +0 -752
  516. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-builder/scripts/connections.py +0 -151
  517. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-builder/scripts/evaluation.py +0 -373
  518. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-builder/scripts/example_evaluation.xml +0 -22
  519. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-builder/scripts/requirements.txt +0 -2
  520. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-developer/SKILL.md +0 -85
  521. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-developer/references/protocol.md +0 -244
  522. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-developer/references/python-sdk.md +0 -367
  523. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-developer/references/resources.md +0 -554
  524. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-developer/references/tools.md +0 -480
  525. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/mcp-developer/references/typescript-sdk.md +0 -350
  526. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/POWER.md +0 -242
  527. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-expert/POWER.md +0 -89
  528. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-expert/SKILL.md +0 -77
  529. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-expert/references/bloc-state.md +0 -259
  530. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-expert/references/engineering-principles.md +0 -82
  531. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-expert/references/gorouter-navigation.md +0 -157
  532. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-expert/references/performance.md +0 -99
  533. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-expert/references/project-structure.md +0 -77
  534. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-expert/references/riverpod-state.md +0 -404
  535. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-expert/references/widget-patterns.md +0 -123
  536. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-security-reviewer/POWER.md +0 -204
  537. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-security-reviewer/SKILL.md +0 -180
  538. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-security-reviewer/references/log_redaction.md +0 -17
  539. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-security-reviewer/references/network_hardening.md +0 -13
  540. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-security-reviewer/references/secret-scanning.md +0 -123
  541. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-security-reviewer/references/secure_storage_policy.md +0 -16
  542. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-security-reviewer/references/vulnerability-patterns.md +0 -189
  543. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-security-reviewer/templates/security_review_response.md +0 -31
  544. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-test-master/POWER.md +0 -527
  545. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-test-master/SKILL.md +0 -513
  546. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-test-master/references/golden-testing.md +0 -444
  547. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-test-master/references/integration-testing.md +0 -688
  548. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-test-master/references/property-testing.md +0 -408
  549. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-test-master/references/riverpod-testing.md +0 -606
  550. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-test-master/references/test-utilities.md +0 -566
  551. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-test-master/references/unit-testing.md +0 -511
  552. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/flutter-test-master/references/widget-testing.md +0 -601
  553. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/git-commit/POWER.md +0 -47
  554. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/git-commit/SKILL.md +0 -46
  555. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/gorouter-restoration/POWER.md +0 -227
  556. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/gorouter-restoration/SKILL.md +0 -217
  557. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/gorouter-restoration/references/guard_patterns.md +0 -14
  558. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/gorouter-restoration/references/tab_preservation.md +0 -12
  559. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/gorouter-restoration/templates/guard_redirect.md +0 -30
  560. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/gorouter-restoration/templates/stateful_shell_skeleton.md +0 -40
  561. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/mcp.json +0 -11
  562. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/microservices-architect/POWER.md +0 -109
  563. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/microservices-architect/SKILL.md +0 -93
  564. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/microservices-architect/references/communication.md +0 -499
  565. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/microservices-architect/references/data.md +0 -721
  566. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/microservices-architect/references/decomposition.md +0 -344
  567. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/microservices-architect/references/observability.md +0 -805
  568. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/microservices-architect/references/patterns.md +0 -603
  569. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/nestjs-expert/POWER.md +0 -130
  570. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/nestjs-expert/SKILL.md +0 -114
  571. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/nestjs-expert/references/authentication.md +0 -461
  572. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/nestjs-expert/references/controllers-routing.md +0 -114
  573. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/nestjs-expert/references/dtos-validation.md +0 -156
  574. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/nestjs-expert/references/migration-from-express.md +0 -1240
  575. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/nestjs-expert/references/secure-sessions.md +0 -56
  576. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/nestjs-expert/references/services-di.md +0 -143
  577. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/nestjs-expert/references/strangler-fig-pattern.md +0 -47
  578. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/nestjs-expert/references/testing-patterns.md +0 -189
  579. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/oneup-design/POWER.md +0 -72
  580. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/oneup-design/SKILL.md +0 -71
  581. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/oneup-design/references/colors.md +0 -41
  582. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/oneup-design/references/components.md +0 -53
  583. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/oneup-design/references/spacing.md +0 -84
  584. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/openapi-docs/POWER.md +0 -78
  585. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/openapi-docs/SKILL.md +0 -68
  586. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/openapi-docs/references/crud-templates.md +0 -43
  587. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/openapi-docs/references/framework-express.md +0 -19
  588. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/openapi-docs/references/framework-fastapi.md +0 -20
  589. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/openapi-docs/references/framework-nestjs.md +0 -20
  590. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/openapi-docs/references/framework-raw-openapi.md +0 -14
  591. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/openapi-docs/references/openapi-checklist.md +0 -33
  592. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/openapi-docs/references/stoplight-ui.md +0 -21
  593. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/postman/references/steering.md +0 -26
  594. package/workflows/workflows/agent-environment-setup/platforms/windsurf/skills/sentry-automation/SKILL.md +0 -232
@@ -1,602 +0,0 @@
1
- # MCP Server Evaluation Guide
2
-
3
- ## Overview
4
-
5
- This document provides guidance on creating comprehensive evaluations for MCP servers. Evaluations test whether LLMs can effectively use your MCP server to answer realistic, complex questions using only the tools provided.
6
-
7
- ---
8
-
9
- ## Quick Reference
10
-
11
- ### Evaluation Requirements
12
- - Create 10 human-readable questions
13
- - Questions must be READ-ONLY, INDEPENDENT, NON-DESTRUCTIVE
14
- - Each question requires multiple tool calls (potentially dozens)
15
- - Answers must be single, verifiable values
16
- - Answers must be STABLE (won't change over time)
17
-
18
- ### Output Format
19
- ```xml
20
- <evaluation>
21
- <qa_pair>
22
- <question>Your question here</question>
23
- <answer>Single verifiable answer</answer>
24
- </qa_pair>
25
- </evaluation>
26
- ```
27
-
28
- ---
29
-
30
- ## Purpose of Evaluations
31
-
32
- The measure of quality of an MCP server is NOT how well or comprehensively the server implements tools, but how well these implementations (input/output schemas, docstrings/descriptions, functionality) enable LLMs with no other context and access ONLY to the MCP servers to answer realistic and difficult questions.
33
-
34
- ## Evaluation Overview
35
-
36
- Create 10 human-readable questions requiring ONLY READ-ONLY, INDEPENDENT, NON-DESTRUCTIVE, and IDEMPOTENT operations to answer. Each question should be:
37
- - Realistic
38
- - Clear and concise
39
- - Unambiguous
40
- - Complex, requiring potentially dozens of tool calls or steps
41
- - Answerable with a single, verifiable value that you identify in advance
42
-
43
- ## Question Guidelines
44
-
45
- ### Core Requirements
46
-
47
- 1. **Questions MUST be independent**
48
- - Each question should NOT depend on the answer to any other question
49
- - Should not assume prior write operations from processing another question
50
-
51
- 2. **Questions MUST require ONLY NON-DESTRUCTIVE AND IDEMPOTENT tool use**
52
- - Should not instruct or require modifying state to arrive at the correct answer
53
-
54
- 3. **Questions must be REALISTIC, CLEAR, CONCISE, and COMPLEX**
55
- - Must require another LLM to use multiple (potentially dozens of) tools or steps to answer
56
-
57
- ### Complexity and Depth
58
-
59
- 4. **Questions must require deep exploration**
60
- - Consider multi-hop questions requiring multiple sub-questions and sequential tool calls
61
- - Each step should benefit from information found in previous questions
62
-
63
- 5. **Questions may require extensive paging**
64
- - May need paging through multiple pages of results
65
- - May require querying old data (1-2 years out-of-date) to find niche information
66
- - The questions must be DIFFICULT
67
-
68
- 6. **Questions must require deep understanding**
69
- - Rather than surface-level knowledge
70
- - May pose complex ideas as True/False questions requiring evidence
71
- - May use multiple-choice format where LLM must search different hypotheses
72
-
73
- 7. **Questions must not be solvable with straightforward keyword search**
74
- - Do not include specific keywords from the target content
75
- - Use synonyms, related concepts, or paraphrases
76
- - Require multiple searches, analyzing multiple related items, extracting context, then deriving the answer
77
-
78
- ### Tool Testing
79
-
80
- 8. **Questions should stress-test tool return values**
81
- - May elicit tools returning large JSON objects or lists, overwhelming the LLM
82
- - Should require understanding multiple modalities of data:
83
- - IDs and names
84
- - Timestamps and datetimes (months, days, years, seconds)
85
- - File IDs, names, extensions, and mimetypes
86
- - URLs, GIDs, etc.
87
- - Should probe the tool's ability to return all useful forms of data
88
-
89
- 9. **Questions should MOSTLY reflect real human use cases**
90
- - The kinds of information retrieval tasks that HUMANS assisted by an LLM would care about
91
-
92
- 10. **Questions may require dozens of tool calls**
93
- - This challenges LLMs with limited context
94
- - Encourages MCP server tools to reduce information returned
95
-
96
- 11. **Include ambiguous questions**
97
- - May be ambiguous OR require difficult decisions on which tools to call
98
- - Force the LLM to potentially make mistakes or misinterpret
99
- - Ensure that despite AMBIGUITY, there is STILL A SINGLE VERIFIABLE ANSWER
100
-
101
- ### Stability
102
-
103
- 12. **Questions must be designed so the answer DOES NOT CHANGE**
104
- - Do not ask questions that rely on "current state" which is dynamic
105
- - For example, do not count:
106
- - Number of reactions to a post
107
- - Number of replies to a thread
108
- - Number of members in a channel
109
-
110
- 13. **DO NOT let the MCP server RESTRICT the kinds of questions you create**
111
- - Create challenging and complex questions
112
- - Some may not be solvable with the available MCP server tools
113
- - Questions may require specific output formats (datetime vs. epoch time, JSON vs. MARKDOWN)
114
- - Questions may require dozens of tool calls to complete
115
-
116
- ## Answer Guidelines
117
-
118
- ### Verification
119
-
120
- 1. **Answers must be VERIFIABLE via direct string comparison**
121
- - If the answer can be re-written in many formats, clearly specify the output format in the QUESTION
122
- - Examples: "Use YYYY/MM/DD.", "Respond True or False.", "Answer A, B, C, or D and nothing else."
123
- - Answer should be a single VERIFIABLE value such as:
124
- - User ID, user name, display name, first name, last name
125
- - Channel ID, channel name
126
- - Message ID, string
127
- - URL, title
128
- - Numerical quantity
129
- - Timestamp, datetime
130
- - Boolean (for True/False questions)
131
- - Email address, phone number
132
- - File ID, file name, file extension
133
- - Multiple choice answer
134
- - Answers must not require special formatting or complex, structured output
135
- - Answer will be verified using DIRECT STRING COMPARISON
136
-
137
- ### Readability
138
-
139
- 2. **Answers should generally prefer HUMAN-READABLE formats**
140
- - Examples: names, first name, last name, datetime, file name, message string, URL, yes/no, true/false, a/b/c/d
141
- - Rather than opaque IDs (though IDs are acceptable)
142
- - The VAST MAJORITY of answers should be human-readable
143
-
144
- ### Stability
145
-
146
- 3. **Answers must be STABLE/STATIONARY**
147
- - Look at old content (e.g., conversations that have ended, projects that have launched, questions answered)
148
- - Create QUESTIONS based on "closed" concepts that will always return the same answer
149
- - Questions may ask to consider a fixed time window to insulate from non-stationary answers
150
- - Rely on context UNLIKELY to change
151
- - Example: if finding a paper name, be SPECIFIC enough so answer is not confused with papers published later
152
-
153
- 4. **Answers must be CLEAR and UNAMBIGUOUS**
154
- - Questions must be designed so there is a single, clear answer
155
- - Answer can be derived from using the MCP server tools
156
-
157
- ### Diversity
158
-
159
- 5. **Answers must be DIVERSE**
160
- - Answer should be a single VERIFIABLE value in diverse modalities and formats
161
- - User concept: user ID, user name, display name, first name, last name, email address, phone number
162
- - Channel concept: channel ID, channel name, channel topic
163
- - Message concept: message ID, message string, timestamp, month, day, year
164
-
165
- 6. **Answers must NOT be complex structures**
166
- - Not a list of values
167
- - Not a complex object
168
- - Not a list of IDs or strings
169
- - Not natural language text
170
- - UNLESS the answer can be straightforwardly verified using DIRECT STRING COMPARISON
171
- - And can be realistically reproduced
172
- - It should be unlikely that an LLM would return the same list in any other order or format
173
-
174
- ## Evaluation Process
175
-
176
- ### Step 1: Documentation Inspection
177
-
178
- Read the documentation of the target API to understand:
179
- - Available endpoints and functionality
180
- - If ambiguity exists, fetch additional information from the web
181
- - Parallelize this step AS MUCH AS POSSIBLE
182
- - Ensure each subagent is ONLY examining documentation from the file system or on the web
183
-
184
- ### Step 2: Tool Inspection
185
-
186
- List the tools available in the MCP server:
187
- - Inspect the MCP server directly
188
- - Understand input/output schemas, docstrings, and descriptions
189
- - WITHOUT calling the tools themselves at this stage
190
-
191
- ### Step 3: Developing Understanding
192
-
193
- Repeat steps 1 & 2 until you have a good understanding:
194
- - Iterate multiple times
195
- - Think about the kinds of tasks you want to create
196
- - Refine your understanding
197
- - At NO stage should you READ the code of the MCP server implementation itself
198
- - Use your intuition and understanding to create reasonable, realistic, but VERY challenging tasks
199
-
200
- ### Step 4: Read-Only Content Inspection
201
-
202
- After understanding the API and tools, USE the MCP server tools:
203
- - Inspect content using READ-ONLY and NON-DESTRUCTIVE operations ONLY
204
- - Goal: identify specific content (e.g., users, channels, messages, projects, tasks) for creating realistic questions
205
- - Should NOT call any tools that modify state
206
- - Will NOT read the code of the MCP server implementation itself
207
- - Parallelize this step with individual sub-agents pursuing independent explorations
208
- - Ensure each subagent is only performing READ-ONLY, NON-DESTRUCTIVE, and IDEMPOTENT operations
209
- - BE CAREFUL: SOME TOOLS may return LOTS OF DATA which would cause you to run out of CONTEXT
210
- - Make INCREMENTAL, SMALL, AND TARGETED tool calls for exploration
211
- - In all tool call requests, use the `limit` parameter to limit results (<10)
212
- - Use pagination
213
-
214
- ### Step 5: Task Generation
215
-
216
- After inspecting the content, create 10 human-readable questions:
217
- - An LLM should be able to answer these with the MCP server
218
- - Follow all question and answer guidelines above
219
-
220
- ## Output Format
221
-
222
- Each QA pair consists of a question and an answer. The output should be an XML file with this structure:
223
-
224
- ```xml
225
- <evaluation>
226
- <qa_pair>
227
- <question>Find the project created in Q2 2024 with the highest number of completed tasks. What is the project name?</question>
228
- <answer>Website Redesign</answer>
229
- </qa_pair>
230
- <qa_pair>
231
- <question>Search for issues labeled as "bug" that were closed in March 2024. Which user closed the most issues? Provide their username.</question>
232
- <answer>sarah_dev</answer>
233
- </qa_pair>
234
- <qa_pair>
235
- <question>Look for pull requests that modified files in the /api directory and were merged between January 1 and January 31, 2024. How many different contributors worked on these PRs?</question>
236
- <answer>7</answer>
237
- </qa_pair>
238
- <qa_pair>
239
- <question>Find the repository with the most stars that was created before 2023. What is the repository name?</question>
240
- <answer>data-pipeline</answer>
241
- </qa_pair>
242
- </evaluation>
243
- ```
244
-
245
- ## Evaluation Examples
246
-
247
- ### Good Questions
248
-
249
- **Example 1: Multi-hop question requiring deep exploration (GitHub MCP)**
250
- ```xml
251
- <qa_pair>
252
- <question>Find the repository that was archived in Q3 2023 and had previously been the most forked project in the organization. What was the primary programming language used in that repository?</question>
253
- <answer>Python</answer>
254
- </qa_pair>
255
- ```
256
-
257
- This question is good because:
258
- - Requires multiple searches to find archived repositories
259
- - Needs to identify which had the most forks before archival
260
- - Requires examining repository details for the language
261
- - Answer is a simple, verifiable value
262
- - Based on historical (closed) data that won't change
263
-
264
- **Example 2: Requires understanding context without keyword matching (Project Management MCP)**
265
- ```xml
266
- <qa_pair>
267
- <question>Locate the initiative focused on improving customer onboarding that was completed in late 2023. The project lead created a retrospective document after completion. What was the lead's role title at that time?</question>
268
- <answer>Product Manager</answer>
269
- </qa_pair>
270
- ```
271
-
272
- This question is good because:
273
- - Doesn't use specific project name ("initiative focused on improving customer onboarding")
274
- - Requires finding completed projects from specific timeframe
275
- - Needs to identify the project lead and their role
276
- - Requires understanding context from retrospective documents
277
- - Answer is human-readable and stable
278
- - Based on completed work (won't change)
279
-
280
- **Example 3: Complex aggregation requiring multiple steps (Issue Tracker MCP)**
281
- ```xml
282
- <qa_pair>
283
- <question>Among all bugs reported in January 2024 that were marked as critical priority, which assignee resolved the highest percentage of their assigned bugs within 48 hours? Provide the assignee's username.</question>
284
- <answer>alex_eng</answer>
285
- </qa_pair>
286
- ```
287
-
288
- This question is good because:
289
- - Requires filtering bugs by date, priority, and status
290
- - Needs to group by assignee and calculate resolution rates
291
- - Requires understanding timestamps to determine 48-hour windows
292
- - Tests pagination (potentially many bugs to process)
293
- - Answer is a single username
294
- - Based on historical data from specific time period
295
-
296
- **Example 4: Requires synthesis across multiple data types (CRM MCP)**
297
- ```xml
298
- <qa_pair>
299
- <question>Find the account that upgraded from the Starter to Enterprise plan in Q4 2023 and had the highest annual contract value. What industry does this account operate in?</question>
300
- <answer>Healthcare</answer>
301
- </qa_pair>
302
- ```
303
-
304
- This question is good because:
305
- - Requires understanding subscription tier changes
306
- - Needs to identify upgrade events in specific timeframe
307
- - Requires comparing contract values
308
- - Must access account industry information
309
- - Answer is simple and verifiable
310
- - Based on completed historical transactions
311
-
312
- ### Poor Questions
313
-
314
- **Example 1: Answer changes over time**
315
- ```xml
316
- <qa_pair>
317
- <question>How many open issues are currently assigned to the engineering team?</question>
318
- <answer>47</answer>
319
- </qa_pair>
320
- ```
321
-
322
- This question is poor because:
323
- - The answer will change as issues are created, closed, or reassigned
324
- - Not based on stable/stationary data
325
- - Relies on "current state" which is dynamic
326
-
327
- **Example 2: Too easy with keyword search**
328
- ```xml
329
- <qa_pair>
330
- <question>Find the pull request with title "Add authentication feature" and tell me who created it.</question>
331
- <answer>developer123</answer>
332
- </qa_pair>
333
- ```
334
-
335
- This question is poor because:
336
- - Can be solved with a straightforward keyword search for exact title
337
- - Doesn't require deep exploration or understanding
338
- - No synthesis or analysis needed
339
-
340
- **Example 3: Ambiguous answer format**
341
- ```xml
342
- <qa_pair>
343
- <question>List all the repositories that have Python as their primary language.</question>
344
- <answer>repo1, repo2, repo3, data-pipeline, ml-tools</answer>
345
- </qa_pair>
346
- ```
347
-
348
- This question is poor because:
349
- - Answer is a list that could be returned in any order
350
- - Difficult to verify with direct string comparison
351
- - LLM might format differently (JSON array, comma-separated, newline-separated)
352
- - Better to ask for a specific aggregate (count) or superlative (most stars)
353
-
354
- ## Verification Process
355
-
356
- After creating evaluations:
357
-
358
- 1. **Examine the XML file** to understand the schema
359
- 2. **Load each task instruction** and in parallel using the MCP server and tools, identify the correct answer by attempting to solve the task YOURSELF
360
- 3. **Flag any operations** that require WRITE or DESTRUCTIVE operations
361
- 4. **Accumulate all CORRECT answers** and replace any incorrect answers in the document
362
- 5. **Remove any `<qa_pair>`** that require WRITE or DESTRUCTIVE operations
363
-
364
- Remember to parallelize solving tasks to avoid running out of context, then accumulate all answers and make changes to the file at the end.
365
-
366
- ## Tips for Creating Quality Evaluations
367
-
368
- 1. **Think Hard and Plan Ahead** before generating tasks
369
- 2. **Parallelize Where Opportunity Arises** to speed up the process and manage context
370
- 3. **Focus on Realistic Use Cases** that humans would actually want to accomplish
371
- 4. **Create Challenging Questions** that test the limits of the MCP server's capabilities
372
- 5. **Ensure Stability** by using historical data and closed concepts
373
- 6. **Verify Answers** by solving the questions yourself using the MCP server tools
374
- 7. **Iterate and Refine** based on what you learn during the process
375
-
376
- ---
377
-
378
- # Running Evaluations
379
-
380
- After creating your evaluation file, you can use the provided evaluation harness to test your MCP server.
381
-
382
- ## Setup
383
-
384
- 1. **Install Dependencies**
385
-
386
- ```bash
387
- pip install -r scripts/requirements.txt
388
- ```
389
-
390
- Or install manually:
391
- ```bash
392
- pip install anthropic mcp
393
- ```
394
-
395
- 2. **Set API Key**
396
-
397
- ```bash
398
- export ANTHROPIC_API_KEY=your_api_key_here
399
- ```
400
-
401
- ## Evaluation File Format
402
-
403
- Evaluation files use XML format with `<qa_pair>` elements:
404
-
405
- ```xml
406
- <evaluation>
407
- <qa_pair>
408
- <question>Find the project created in Q2 2024 with the highest number of completed tasks. What is the project name?</question>
409
- <answer>Website Redesign</answer>
410
- </qa_pair>
411
- <qa_pair>
412
- <question>Search for issues labeled as "bug" that were closed in March 2024. Which user closed the most issues? Provide their username.</question>
413
- <answer>sarah_dev</answer>
414
- </qa_pair>
415
- </evaluation>
416
- ```
417
-
418
- ## Running Evaluations
419
-
420
- The evaluation script (`scripts/evaluation.py`) supports three transport types:
421
-
422
- **Important:**
423
- - **stdio transport**: The evaluation script automatically launches and manages the MCP server process for you. Do not run the server manually.
424
- - **sse/http transports**: You must start the MCP server separately before running the evaluation. The script connects to the already-running server at the specified URL.
425
-
426
- ### 1. Local STDIO Server
427
-
428
- For locally-run MCP servers (script launches the server automatically):
429
-
430
- ```bash
431
- python scripts/evaluation.py \
432
- -t stdio \
433
- -c python \
434
- -a my_mcp_server.py \
435
- evaluation.xml
436
- ```
437
-
438
- With environment variables:
439
- ```bash
440
- python scripts/evaluation.py \
441
- -t stdio \
442
- -c python \
443
- -a my_mcp_server.py \
444
- -e API_KEY=abc123 \
445
- -e DEBUG=true \
446
- evaluation.xml
447
- ```
448
-
449
- ### 2. Server-Sent Events (SSE)
450
-
451
- For SSE-based MCP servers (you must start the server first):
452
-
453
- ```bash
454
- python scripts/evaluation.py \
455
- -t sse \
456
- -u https://example.com/mcp \
457
- -H "Authorization: Bearer token123" \
458
- -H "X-Custom-Header: value" \
459
- evaluation.xml
460
- ```
461
-
462
- ### 3. HTTP (Streamable HTTP)
463
-
464
- For HTTP-based MCP servers (you must start the server first):
465
-
466
- ```bash
467
- python scripts/evaluation.py \
468
- -t http \
469
- -u https://example.com/mcp \
470
- -H "Authorization: Bearer token123" \
471
- evaluation.xml
472
- ```
473
-
474
- ## Command-Line Options
475
-
476
- ```
477
- usage: evaluation.py [-h] [-t {stdio,sse,http}] [-m MODEL] [-c COMMAND]
478
- [-a ARGS [ARGS ...]] [-e ENV [ENV ...]] [-u URL]
479
- [-H HEADERS [HEADERS ...]] [-o OUTPUT]
480
- eval_file
481
-
482
- positional arguments:
483
- eval_file Path to evaluation XML file
484
-
485
- optional arguments:
486
- -h, --help Show help message
487
- -t, --transport Transport type: stdio, sse, or http (default: stdio)
488
- -m, --model Claude model to use (default: claude-3-7-sonnet-20250219)
489
- -o, --output Output file for report (default: print to stdout)
490
-
491
- stdio options:
492
- -c, --command Command to run MCP server (e.g., python, node)
493
- -a, --args Arguments for the command (e.g., server.py)
494
- -e, --env Environment variables in KEY=VALUE format
495
-
496
- sse/http options:
497
- -u, --url MCP server URL
498
- -H, --header HTTP headers in 'Key: Value' format
499
- ```
500
-
501
- ## Output
502
-
503
- The evaluation script generates a detailed report including:
504
-
505
- - **Summary Statistics**:
506
- - Accuracy (correct/total)
507
- - Average task duration
508
- - Average tool calls per task
509
- - Total tool calls
510
-
511
- - **Per-Task Results**:
512
- - Prompt and expected response
513
- - Actual response from the agent
514
- - Whether the answer was correct (✅/❌)
515
- - Duration and tool call details
516
- - Agent's summary of its approach
517
- - Agent's feedback on the tools
518
-
519
- ### Save Report to File
520
-
521
- ```bash
522
- python scripts/evaluation.py \
523
- -t stdio \
524
- -c python \
525
- -a my_server.py \
526
- -o evaluation_report.md \
527
- evaluation.xml
528
- ```
529
-
530
- ## Complete Example Workflow
531
-
532
- Here's a complete example of creating and running an evaluation:
533
-
534
- 1. **Create your evaluation file** (`my_evaluation.xml`):
535
-
536
- ```xml
537
- <evaluation>
538
- <qa_pair>
539
- <question>Find the user who created the most issues in January 2024. What is their username?</question>
540
- <answer>alice_developer</answer>
541
- </qa_pair>
542
- <qa_pair>
543
- <question>Among all pull requests merged in Q1 2024, which repository had the highest number? Provide the repository name.</question>
544
- <answer>backend-api</answer>
545
- </qa_pair>
546
- <qa_pair>
547
- <question>Find the project that was completed in December 2023 and had the longest duration from start to finish. How many days did it take?</question>
548
- <answer>127</answer>
549
- </qa_pair>
550
- </evaluation>
551
- ```
552
-
553
- 2. **Install dependencies**:
554
-
555
- ```bash
556
- pip install -r scripts/requirements.txt
557
- export ANTHROPIC_API_KEY=your_api_key
558
- ```
559
-
560
- 3. **Run evaluation**:
561
-
562
- ```bash
563
- python scripts/evaluation.py \
564
- -t stdio \
565
- -c python \
566
- -a github_mcp_server.py \
567
- -e GITHUB_TOKEN=ghp_xxx \
568
- -o github_eval_report.md \
569
- my_evaluation.xml
570
- ```
571
-
572
- 4. **Review the report** in `github_eval_report.md` to:
573
- - See which questions passed/failed
574
- - Read the agent's feedback on your tools
575
- - Identify areas for improvement
576
- - Iterate on your MCP server design
577
-
578
- ## Troubleshooting
579
-
580
- ### Connection Errors
581
-
582
- If you get connection errors:
583
- - **STDIO**: Verify the command and arguments are correct
584
- - **SSE/HTTP**: Check the URL is accessible and headers are correct
585
- - Ensure any required API keys are set in environment variables or headers
586
-
587
- ### Low Accuracy
588
-
589
- If many evaluations fail:
590
- - Review the agent's feedback for each task
591
- - Check if tool descriptions are clear and comprehensive
592
- - Verify input parameters are well-documented
593
- - Consider whether tools return too much or too little data
594
- - Ensure error messages are actionable
595
-
596
- ### Timeout Issues
597
-
598
- If tasks are timing out:
599
- - Use a more capable model (e.g., `claude-3-7-sonnet-20250219`)
600
- - Check if tools are returning too much data
601
- - Verify pagination is working correctly
602
- - Consider simplifying complex questions