npm - @greynewell/mcpbr - Versions diffs - 0.9.1 → 0.10.2 - Mend

@greynewell/mcpbr 0.9.1 → 0.10.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +14 -1
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -54,6 +54,8 @@ mcpbr runs controlled experiments: same model, same tasks, same environment - th
 - **Real GitHub issues** from SWE-bench (not toy examples)
 - **Reproducible results** via Docker containers with pinned dependencies
+> Read the full origin story: **[Why I Built mcpbr](https://greynewell.com/blog/why-i-built-mcpbr/)** — the problem, the approach, and where the project is headed.
 ## Supported Benchmarks
 mcpbr supports 30+ benchmarks across 10 categories through a flexible abstraction layer:
@@ -727,6 +729,17 @@ Run SWE-bench evaluation with the configured MCP server.
 | `--smtp-port PORT` | | SMTP server port (default: 587) |
 | `--smtp-user USER` | | SMTP username for authentication |
 | `--smtp-password PASS` | | SMTP password for authentication |
+| `--sampling-strategy TEXT` | | Task sampling strategy (`sequential`, `random`, `stratified`) |
+| `--random-seed INT` | | Random seed for reproducible sampling |
+| `--stratify-field TEXT` | | Field to stratify by (requires `--sampling-strategy stratified`) |
+| `--notify-slack URL` | | Slack webhook URL for completion notifications |
+| `--notify-discord URL` | | Discord webhook URL for completion notifications |
+| `--notify-email JSON` | | Email config as JSON string |
+| `--slack-bot-token TOKEN` | | Slack bot token (`xoxb-...`) for uploading results.json to a channel |
+| `--slack-channel ID` | | Slack channel ID for file uploads (used with `--slack-bot-token`) |
+| `--github-token TOKEN` | | GitHub token for auto-creating a Gist with full results (linked in notifications) |
+| `--wandb/--no-wandb` | | Enable/disable Weights & Biases logging |
+| `--wandb-project TEXT` | | W&B project name |
 | `--profile` | | Enable comprehensive performance profiling (tool latency, memory, overhead) |
 | `--help` | `-h` | Show help message |
@@ -1489,4 +1502,4 @@ MIT - see [LICENSE](LICENSE) for details.
 ---
-Built by [Grey Newell](https://greynewell.com)
+Built by [Grey Newell](https://greynewell.com) | [Why I Built mcpbr](https://greynewell.com/blog/why-i-built-mcpbr/) | [About](https://mcpbr.org/about/)

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@greynewell/mcpbr",
-  "version": "0.9.1",
+  "version": "0.10.2",
   "description": "Model Context Protocol Benchmark Runner - CLI tool for evaluating MCP servers",
   "keywords": [
     "mcpbr",