npm - @openhands/extensions - Versions diffs - 0.0.1-alpha → 0.2.0 - Mend

@openhands/extensions 0.0.1-alpha → 0.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (347) hide show

package/.agents/skills/custom-codereview-guide.md +25 -0
package/.github/pull_request_template.md +38 -0
package/.github/release.yml +14 -0
package/.github/workflows/check-extensions.yml +72 -0
package/.github/workflows/npm-publish.yml +89 -0
package/.github/workflows/pr.yml +30 -0
package/.github/workflows/release.yml +24 -0
package/.github/workflows/tests.yml +25 -0
package/.github/workflows/vulnerability-scan.yml +87 -0
package/.release-please-manifest.json +3 -0
package/AGENTS.md +132 -0
package/README.md +10 -0
package/analysis_results.md +162 -0
package/marketplaces/large-codebase.json +66 -0
package/marketplaces/openhands-extensions.json +682 -0
package/package.json +4 -10
package/plugins/README.md +30 -0
package/plugins/city-weather/.plugin/plugin.json +13 -0
package/plugins/city-weather/README.md +145 -0
package/plugins/city-weather/commands/now.md +56 -0
package/plugins/cobol-modernization/.plugin/plugin.json +19 -0
package/plugins/cobol-modernization/README.md +201 -0
package/plugins/cobol-modernization/references/troubleshooting.md +18 -0
package/plugins/cobol-modernization/skills/build-setup/SKILL.md +78 -0
package/plugins/cobol-modernization/skills/build-setup/scripts/install-gnucobol.sh +32 -0
package/plugins/cobol-modernization/skills/cobol-modernization-overview/SKILL.md +113 -0
package/plugins/cobol-modernization/skills/mainfraime-removal/SKILL.md +62 -0
package/plugins/cobol-modernization/skills/mainfraime-removal/references/cics-transformation-examples.md +45 -0
package/plugins/cobol-modernization/skills/mainframe-planning/SKILL.md +78 -0
package/plugins/cobol-modernization/skills/to-java-migration/SKILL.md +59 -0
package/plugins/cobol-modernization/skills/to-java-migration/references/cobol-to-java-example.md +58 -0
package/plugins/cobol-modernization/skills/to-java-migration/references/datatype-mappings.md +19 -0
package/plugins/issue-duplicate-checker/.plugin/plugin.json +13 -0
package/plugins/issue-duplicate-checker/README.md +51 -0
package/plugins/issue-duplicate-checker/action.yml +349 -0
package/plugins/issue-duplicate-checker/scripts/auto_close_duplicate_issues.py +569 -0
package/plugins/issue-duplicate-checker/scripts/issue_duplicate_check_openhands.py +681 -0
package/plugins/issue-duplicate-checker/scripts/post_duplicate_notice.js +220 -0
package/plugins/issue-duplicate-checker/scripts/remove_duplicate_candidate_label.js +27 -0
package/plugins/magic-test/.plugin/plugin.json +13 -0
package/plugins/magic-test/skills/magic-word/SKILL.md +33 -0
package/plugins/migration-scoring/.plugin/plugin.json +19 -0
package/plugins/migration-scoring/README.md +244 -0
package/plugins/migration-scoring/skills/migration-mapping/SKILL.md +72 -0
package/plugins/migration-scoring/skills/migration-report/SKILL.md +118 -0
package/plugins/migration-scoring/skills/migration-scoring-overview/SKILL.md +126 -0
package/plugins/migration-scoring/skills/score-quality/SKILL.md +54 -0
package/plugins/migration-scoring/skills/score-quality/references/scoring-criteria.md +30 -0
package/plugins/migration-scoring/skills/score-style/SKILL.md +106 -0
package/plugins/onboarding/.plugin/plugin.json +20 -0
package/plugins/onboarding/README.md +30 -0
package/plugins/onboarding/references/criteria.md +144 -0
package/plugins/onboarding/skills/agent-readiness-report/README.md +23 -0
package/plugins/onboarding/skills/agent-readiness-report/SKILL.md +122 -0
package/plugins/onboarding/skills/agent-readiness-report/scripts/scan_agent_instructions.sh +88 -0
package/plugins/onboarding/skills/agent-readiness-report/scripts/scan_build_env.sh +114 -0
package/plugins/onboarding/skills/agent-readiness-report/scripts/scan_feedback_loops.sh +133 -0
package/plugins/onboarding/skills/agent-readiness-report/scripts/scan_policy.sh +113 -0
package/plugins/onboarding/skills/agent-readiness-report/scripts/scan_workflows.sh +127 -0
package/plugins/onboarding/skills/improve-agent-readiness/README.md +19 -0
package/plugins/onboarding/skills/improve-agent-readiness/SKILL.md +167 -0
package/plugins/onboarding/skills/setup-agents-md/README.md +15 -0
package/plugins/onboarding/skills/setup-agents-md/SKILL.md +150 -0
package/plugins/onboarding/skills/setup-openhands/README.md +20 -0
package/plugins/onboarding/skills/setup-openhands/SKILL.md +56 -0
package/plugins/onboarding/skills/setup-pr-review/README.md +23 -0
package/plugins/onboarding/skills/setup-pr-review/SKILL.md +72 -0
package/plugins/openhands/.plugin/plugin.json +13 -0
package/plugins/openhands/README.md +52 -0
package/plugins/openhands/SKILL.md +61 -0
package/plugins/openhands/commands/create.md +55 -0
package/plugins/openhands/commands/openhands-cloud.md +8 -0
package/plugins/openhands/scripts/run.sh +69 -0
package/plugins/pr-review/.plugin/plugin.json +13 -0
package/plugins/pr-review/README.md +393 -0
package/plugins/pr-review/action.yml +298 -0
package/plugins/pr-review/scripts/agent_script.py +1282 -0
package/plugins/pr-review/scripts/evaluate_review.py +655 -0
package/plugins/pr-review/scripts/prompt.py +260 -0
package/plugins/pr-review/workflows/pr-review-by-openhands.yml +51 -0
package/plugins/pr-review/workflows/pr-review-evaluation.yml +85 -0
package/plugins/qa-changes/.plugin/plugin.json +11 -0
package/plugins/qa-changes/README.md +185 -0
package/plugins/qa-changes/action.yml +181 -0
package/plugins/qa-changes/scripts/agent_script.py +406 -0
package/plugins/qa-changes/scripts/evaluate_qa_changes.py +385 -0
package/plugins/qa-changes/scripts/prompt.py +174 -0
package/plugins/qa-changes/workflows/qa-changes-by-openhands.yml +50 -0
package/plugins/qa-changes/workflows/qa-changes-evaluation.yml +85 -0
package/plugins/release-notes/.plugin/plugin.json +19 -0
package/plugins/release-notes/README.md +283 -0
package/plugins/release-notes/SKILL.md +83 -0
package/plugins/release-notes/action.yml +117 -0
package/plugins/release-notes/commands/release-notes.md +8 -0
package/plugins/release-notes/scripts/agent_script.py +292 -0
package/plugins/release-notes/scripts/generate_release_notes.py +733 -0
package/plugins/release-notes/scripts/prompt.py +90 -0
package/plugins/release-notes/scripts/validate_release_notes.py +328 -0
package/plugins/release-notes/workflows/release-notes.yml +76 -0
package/plugins/vulnerability-remediation/.plugin/plugin.json +19 -0
package/plugins/vulnerability-remediation/README.md +217 -0
package/plugins/vulnerability-remediation/action.yml +187 -0
package/plugins/vulnerability-remediation/scripts/scan_and_remediate.py +561 -0
package/plugins/vulnerability-remediation/workflows/vulnerability-scan.yml +87 -0
package/pyproject.toml +12 -0
package/release-please-config.json +16 -0
package/scripts/sync_extensions.py +494 -0
package/scripts/sync_openhands_sdk_skill.py +264 -0
package/skills/README.md +159 -0
package/skills/add-javadoc/.plugin/plugin.json +18 -0
package/skills/add-javadoc/README.md +40 -0
package/skills/add-javadoc/SKILL.md +35 -0
package/skills/add-javadoc/references/example.md +32 -0
package/skills/add-skill/.plugin/plugin.json +18 -0
package/skills/add-skill/README.md +67 -0
package/skills/add-skill/SKILL.md +47 -0
package/skills/add-skill/scripts/fetch_skill.py +259 -0
package/skills/agent-creator/.plugin/plugin.json +20 -0
package/skills/agent-creator/README.md +104 -0
package/skills/agent-creator/SKILL.md +190 -0
package/skills/agent-creator/commands/agent-creator.md +8 -0
package/skills/agent-creator/references/fallback.md +117 -0
package/skills/agent-memory/.plugin/plugin.json +18 -0
package/skills/agent-memory/README.md +35 -0
package/skills/agent-memory/SKILL.md +30 -0
package/skills/agent-memory/commands/remember.md +8 -0
package/skills/agent-sdk-builder/.plugin/plugin.json +18 -0
package/skills/agent-sdk-builder/README.md +40 -0
package/skills/agent-sdk-builder/SKILL.md +37 -0
package/skills/agent-sdk-builder/commands/agent-builder.md +8 -0
package/skills/azure-devops/.plugin/plugin.json +18 -0
package/skills/azure-devops/README.md +55 -0
package/skills/azure-devops/SKILL.md +50 -0
package/skills/bitbucket/.plugin/plugin.json +17 -0
package/skills/bitbucket/README.md +50 -0
package/skills/bitbucket/SKILL.md +45 -0
package/skills/code-review/.plugin/plugin.json +19 -0
package/skills/code-review/README.md +18 -0
package/skills/code-review/SKILL.md +208 -0
package/skills/code-review/commands/codereview-roasted.md +8 -0
package/skills/code-review/commands/codereview.md +8 -0
package/skills/code-review/references/risk-evaluation.md +41 -0
package/skills/code-review/references/supply-chain-security.md +31 -0
package/skills/code-simplifier/.plugin/plugin.json +21 -0
package/skills/code-simplifier/README.md +30 -0
package/skills/code-simplifier/SKILL.md +91 -0
package/skills/code-simplifier/commands/simplify.md +8 -0
package/skills/code-simplifier/references/code-quality-review.md +86 -0
package/skills/code-simplifier/references/code-reuse-review.md +63 -0
package/skills/code-simplifier/references/efficiency-review.md +81 -0
package/skills/datadog/.plugin/plugin.json +19 -0
package/skills/datadog/README.md +100 -0
package/skills/datadog/SKILL.md +95 -0
package/skills/deno/.plugin/plugin.json +18 -0
package/skills/deno/README.md +5 -0
package/skills/deno/SKILL.md +99 -0
package/skills/deno/references/README.md +6 -0
package/skills/discord/.plugin/plugin.json +18 -0
package/skills/discord/README.md +31 -0
package/skills/discord/SKILL.md +109 -0
package/skills/discord/__init__.py +0 -0
package/skills/discord/references/REFERENCE.md +78 -0
package/skills/discord/scripts/__init__.py +0 -0
package/skills/discord/scripts/_http.py +127 -0
package/skills/discord/scripts/post_webhook.py +106 -0
package/skills/discord/scripts/send_message.py +102 -0
package/skills/docker/.plugin/plugin.json +17 -0
package/skills/docker/README.md +34 -0
package/skills/docker/SKILL.md +29 -0
package/skills/evidence-based-citations/.plugin/plugin.json +20 -0
package/skills/evidence-based-citations/README.md +31 -0
package/skills/evidence-based-citations/SKILL.md +59 -0
package/skills/flarglebargle/.plugin/plugin.json +16 -0
package/skills/flarglebargle/README.md +14 -0
package/skills/flarglebargle/SKILL.md +9 -0
package/skills/frontend-design/.plugin/plugin.json +21 -0
package/skills/frontend-design/LICENSE.txt +177 -0
package/skills/frontend-design/README.md +42 -0
package/skills/frontend-design/SKILL.md +42 -0
package/skills/github/.plugin/plugin.json +19 -0
package/skills/github/README.md +42 -0
package/skills/github/SKILL.md +106 -0
package/skills/github-pr-review/.plugin/plugin.json +18 -0
package/skills/github-pr-review/README.md +145 -0
package/skills/github-pr-review/SKILL.md +148 -0
package/skills/github-pr-review/commands/github-pr-review.md +8 -0
package/skills/github-pr-reviewer/.plugin/plugin.json +20 -0
package/skills/github-pr-reviewer/README.md +34 -0
package/skills/github-pr-reviewer/SKILL.md +89 -0
package/skills/github-pr-reviewer/commands/pr-reviewer:setup.md +8 -0
package/skills/github-repo-monitor/.plugin/plugin.json +22 -0
package/skills/github-repo-monitor/README.md +70 -0
package/skills/github-repo-monitor/SKILL.md +316 -0
package/skills/github-repo-monitor/commands/github-monitor:poll.md +8 -0
package/skills/github-repo-monitor/references/github-api.md +241 -0
package/skills/github-repo-monitor/references/state-schema.md +160 -0
package/skills/github-repo-monitor/scripts/main.py +915 -0
package/skills/github-repo-monitor/tests/test_main.py +400 -0
package/skills/gitlab/.plugin/plugin.json +17 -0
package/skills/gitlab/README.md +37 -0
package/skills/gitlab/SKILL.md +32 -0
package/skills/incident-retrospective/.plugin/plugin.json +21 -0
package/skills/incident-retrospective/README.md +34 -0
package/skills/incident-retrospective/SKILL.md +98 -0
package/skills/incident-retrospective/commands/incident-retro:setup.md +8 -0
package/skills/iterate/.plugin/plugin.json +13 -0
package/skills/iterate/README.md +25 -0
package/skills/iterate/SKILL.md +399 -0
package/skills/iterate/commands/babysit.md +8 -0
package/skills/iterate/commands/iterate.md +8 -0
package/skills/iterate/commands/verify.md +8 -0
package/skills/iterate/references/heuristics.md +58 -0
package/skills/iterate/references/verification.md +96 -0
package/skills/jupyter/.plugin/plugin.json +18 -0
package/skills/jupyter/README.md +55 -0
package/skills/jupyter/SKILL.md +50 -0
package/skills/kubernetes/.plugin/plugin.json +18 -0
package/skills/kubernetes/README.md +53 -0
package/skills/kubernetes/SKILL.md +48 -0
package/skills/learn-from-code-review/.plugin/plugin.json +19 -0
package/skills/learn-from-code-review/README.md +64 -0
package/skills/learn-from-code-review/SKILL.md +186 -0
package/skills/learn-from-code-review/commands/learn-from-reviews.md +8 -0
package/skills/linear/.plugin/plugin.json +19 -0
package/skills/linear/README.md +58 -0
package/skills/linear/SKILL.md +213 -0
package/skills/linear-triage/.plugin/plugin.json +21 -0
package/skills/linear-triage/README.md +34 -0
package/skills/linear-triage/SKILL.md +91 -0
package/skills/linear-triage/commands/linear-triage:setup.md +8 -0
package/skills/notion/.plugin/plugin.json +17 -0
package/skills/notion/README.md +114 -0
package/skills/notion/SKILL.md +109 -0
package/skills/npm/.plugin/plugin.json +17 -0
package/skills/npm/README.md +14 -0
package/skills/npm/SKILL.md +9 -0
package/skills/openhands-api/.plugin/plugin.json +22 -0
package/skills/openhands-api/README.md +48 -0
package/skills/openhands-api/SKILL.md +399 -0
package/skills/openhands-api/references/README.md +33 -0
package/skills/openhands-api/references/TROUBLESHOOTING.md +81 -0
package/skills/openhands-api/references/example_prompt.md +12 -0
package/skills/openhands-api/scripts/openhands_api.py +606 -0
package/skills/openhands-api/scripts/openhands_api.ts +252 -0
package/skills/openhands-automation/.plugin/plugin.json +19 -0
package/skills/openhands-automation/README.md +89 -0
package/skills/openhands-automation/SKILL.md +875 -0
package/skills/openhands-automation/commands/automation:create.md +8 -0
package/skills/openhands-automation/references/ab-testing.md +185 -0
package/skills/openhands-automation/references/custom-automation.md +644 -0
package/skills/openhands-sdk/.plugin/plugin.json +20 -0
package/skills/openhands-sdk/README.md +22 -0
package/skills/openhands-sdk/SKILL.md +229 -0
package/skills/openhands-sdk/commands/sdk.md +8 -0
package/skills/pdflatex/.plugin/plugin.json +18 -0
package/skills/pdflatex/README.md +39 -0
package/skills/pdflatex/SKILL.md +34 -0
package/skills/prd/.plugin/plugin.json +19 -0
package/skills/prd/README.md +28 -0
package/skills/prd/SKILL.md +237 -0
package/skills/prd/commands/prd.md +8 -0
package/skills/qa-changes/README.md +18 -0
package/skills/qa-changes/SKILL.md +229 -0
package/skills/qa-changes/commands/qa-changes.md +8 -0
package/skills/release-notes/README.md +24 -0
package/skills/release-notes/SKILL.md +19 -0
package/skills/release-notes/commands/release-notes.md +8 -0
package/skills/research-brief/.plugin/plugin.json +20 -0
package/skills/research-brief/README.md +34 -0
package/skills/research-brief/SKILL.md +99 -0
package/skills/research-brief/commands/research-brief:setup.md +8 -0
package/skills/security/.plugin/plugin.json +18 -0
package/skills/security/README.md +38 -0
package/skills/security/SKILL.md +33 -0
package/skills/skill-creator/.plugin/plugin.json +17 -0
package/skills/skill-creator/LICENSE.txt +202 -0
package/skills/skill-creator/README.md +182 -0
package/skills/skill-creator/SKILL.md +545 -0
package/skills/skill-creator/references/output-patterns.md +82 -0
package/skills/skill-creator/references/workflows.md +28 -0
package/skills/skill-creator/scripts/init_skill.py +303 -0
package/skills/skill-creator/scripts/quick_validate.py +95 -0
package/skills/slack-channel-monitor/.plugin/plugin.json +21 -0
package/skills/slack-channel-monitor/README.md +91 -0
package/skills/slack-channel-monitor/SKILL.md +276 -0
package/skills/slack-channel-monitor/commands/slack-monitor:poll.md +8 -0
package/skills/slack-channel-monitor/references/slack-api.md +207 -0
package/skills/slack-channel-monitor/references/state-schema.md +180 -0
package/skills/slack-channel-monitor/scripts/main.py +962 -0
package/skills/slack-standup-digest/.plugin/plugin.json +21 -0
package/skills/slack-standup-digest/README.md +34 -0
package/skills/slack-standup-digest/SKILL.md +92 -0
package/skills/slack-standup-digest/commands/standup-digest:setup.md +8 -0
package/skills/spark-version-upgrade/.plugin/plugin.json +20 -0
package/skills/spark-version-upgrade/README.md +54 -0
package/skills/spark-version-upgrade/SKILL.md +233 -0
package/skills/ssh/.plugin/plugin.json +18 -0
package/skills/ssh/README.md +140 -0
package/skills/ssh/SKILL.md +135 -0
package/skills/swift-linux/.plugin/plugin.json +17 -0
package/skills/swift-linux/README.md +86 -0
package/skills/swift-linux/SKILL.md +81 -0
package/skills/theme-factory/.plugin/plugin.json +19 -0
package/skills/theme-factory/LICENSE.txt +202 -0
package/skills/theme-factory/README.md +58 -0
package/skills/theme-factory/SKILL.md +59 -0
package/skills/theme-factory/theme-showcase.pdf +0 -0
package/skills/theme-factory/themes/arctic-frost.md +19 -0
package/skills/theme-factory/themes/botanical-garden.md +19 -0
package/skills/theme-factory/themes/desert-rose.md +19 -0
package/skills/theme-factory/themes/forest-canopy.md +19 -0
package/skills/theme-factory/themes/golden-hour.md +19 -0
package/skills/theme-factory/themes/midnight-galaxy.md +19 -0
package/skills/theme-factory/themes/modern-minimalist.md +19 -0
package/skills/theme-factory/themes/ocean-depths.md +19 -0
package/skills/theme-factory/themes/sunset-boulevard.md +19 -0
package/skills/theme-factory/themes/tech-innovation.md +19 -0
package/skills/uv/.plugin/plugin.json +18 -0
package/skills/uv/README.md +5 -0
package/skills/uv/SKILL.md +95 -0
package/skills/uv/references/README.md +5 -0
package/skills/vercel/.plugin/plugin.json +18 -0
package/skills/vercel/README.md +108 -0
package/skills/vercel/SKILL.md +103 -0
package/tests/test_add_skill_installs_to_agents_dir.py +42 -0
package/tests/test_catalogs.py +109 -0
package/tests/test_code_review_risk_evaluation.py +94 -0
package/tests/test_issue_duplicate_checker.py +240 -0
package/tests/test_openhands_api_python.py +152 -0
package/tests/test_plugin_manifest.py +83 -0
package/tests/test_pr_review_diff_payload.py +202 -0
package/tests/test_pr_review_feedback.py +263 -0
package/tests/test_pr_review_prompt.py +152 -0
package/tests/test_pr_review_review_context.py +253 -0
package/tests/test_qa_changes.py +232 -0
package/tests/test_qa_changes_evaluation.py +259 -0
package/tests/test_release_notes_generator.py +990 -0
package/tests/test_sdk_loading.py +150 -0
package/tests/test_skill_plugin_loading.py +149 -0
package/tests/test_skills_have_readme.py +66 -0
package/tests/test_sync_extensions.py +292 -0
package/tests/test_workflow_sync.py +46 -0
package/utils/analysis/README.md +7 -0
package/utils/analysis/laminar_signals/README.md +211 -0
package/utils/analysis/laminar_signals/analyze.py +780 -0
package/utils/analysis/laminar_signals/templates/default.j2 +49 -0
package/utils/analysis/laminar_signals/templates/pr_review.j2 +61 -0

package/skills/openhands-automation/commands/automation:create.md ADDED Viewed

@@ -0,0 +1,8 @@
+---
+# auto-generated by sync_extensions.py
+description: This skill should be used when the user asks to "create an automation", "schedule a task", "set up a cron job", "webhook integration", "event-triggered automation", or mentions automations, scheduled tasks, cron scheduling, or webhook events in OpenHands Cloud.
+---
+Read and follow the complete instructions in the SKILL.md file located in this skill's directory.
+$ARGUMENTS

package/skills/openhands-automation/references/ab-testing.md ADDED Viewed

@@ -0,0 +1,185 @@
+# A/B Testing for Plugin Automations
+Run A/B tests on plugin automations by defining **variants** — each with its own plugin set and selection weight — instead of a single `plugins` list. The automation service generates a tarball with all variant configs; at runtime, the SDK script selects a variant via weighted random and loads its plugins.
+> **Scope:** A/B testing is currently supported on the **plugin preset** endpoint only (`POST /v1/preset/plugin`). See [OpenHands/automation#147](https://github.com/OpenHands/automation/issues/147) for the roadmap to server-level variant support across all automation types.
+---
+## Quick Start
+Replace `plugins` with `variants` and add an `experiment_id`:
+```bash
+curl -X POST "${OPENHANDS_HOST}/api/automation/v1/preset/plugin" \
+  -H "Authorization: Bearer ${OPENHANDS_API_KEY}" \
+  -H "Content-Type: application/json" \
+  -d '{
+    "name": "Code Review A/B Test",
+    "experiment_id": "review-model-comparison",
+    "variants": [
+      {
+        "name": "control",
+        "weight": 50,
+        "plugins": [{"source": "github:owner/review-plugin", "ref": "v1.0.0"}]
+      },
+      {
+        "name": "treatment",
+        "weight": 50,
+        "plugins": [{"source": "github:owner/review-plugin", "ref": "v2.0.0-beta"}]
+      }
+    ],
+    "prompt": "Review this pull request for code quality and potential bugs.",
+    "trigger": {
+      "type": "event",
+      "source": "github",
+      "on": "pull_request.opened"
+    }
+  }'
+```
+## How It Works
+1. **At creation time**, the service generates a single tarball containing an `experiment_config.json` with all variant definitions (names, weights, plugin configs) alongside the SDK entrypoint and prompt.
+2. **At runtime**, `sdk_main.py` reads `experiment_config.json`, performs weighted-random selection across variants, and loads the selected variant's plugins.
+3. **Experiment metadata** (`experiment_id` and `variant` name) is attached as conversation tags, allowing you to filter and compare runs by variant in the UI.
+## API Reference
+### Request Fields
+`plugins` and `variants` are **mutually exclusive** — provide exactly one.
+| Field | Required | Description |
+|-------|----------|-------------|
+| `variants` | Yes* | List of experiment variants (2–10). Replaces `plugins`. |
+| `experiment_id` | Yes* | Human-readable experiment identifier (1–200 chars). Required when using `variants`. |
+*Required only for A/B tests. Standard plugin automations use `plugins` instead.
+All other fields (`name`, `prompt`, `trigger`, `timeout`, `repos`, `model`) are identical to the standard plugin preset request.
+### Variant Object
+| Field | Required | Type | Description |
+|-------|----------|------|-------------|
+| `name` | Yes | string | Unique variant name (1–100 chars) |
+| `weight` | Yes | integer | Relative selection weight (> 0) |
+| `plugins` | Yes | array | Plugin source(s) for this variant (at least one) |
+### Validation Rules
+- Exactly **one of** `plugins` or `variants` must be provided (not both, not neither)
+- `experiment_id` is **required** with `variants`, **forbidden** with `plugins`
+- At least **2** variants, at most **10**
+- Variant **names must be unique** within an experiment
+- Each variant must have at least **1 plugin**
+- Weights are relative — `[50, 50]` and `[1, 1]` both give 50/50 selection
+## Variant Selection
+Selection uses Python's `random.choices` with the configured weights. The probability of selecting variant *i* is:
+```
+P(variant_i) = weight_i / sum(all_weights)
+```
+Examples:
+- `[50, 50]` → 50% / 50%
+- `[80, 20]` → 80% / 20%
+- `[1, 1, 1]` → 33.3% each
+- `[70, 20, 10]` → 70% / 20% / 10%
+Selection happens independently on every run — there is no cross-run state or session stickiness.
+## Examples
+### Compare two plugin versions
+```json
+{
+  "name": "Plugin v2 Rollout",
+  "experiment_id": "plugin-v2-rollout",
+  "variants": [
+    {
+      "name": "stable",
+      "weight": 80,
+      "plugins": [{"source": "github:myorg/my-plugin", "ref": "v1.4.2"}]
+    },
+    {
+      "name": "canary",
+      "weight": 20,
+      "plugins": [{"source": "github:myorg/my-plugin", "ref": "v2.0.0-rc1"}]
+    }
+  ],
+  "prompt": "Run the standard analysis workflow.",
+  "trigger": {"type": "cron", "schedule": "0 9 * * 1-5"}
+}
+```
+### Test different plugin combinations
+```json
+{
+  "name": "Review Pipeline Experiment",
+  "experiment_id": "review-pipeline-2026",
+  "variants": [
+    {
+      "name": "basic",
+      "weight": 1,
+      "plugins": [{"source": "github:myorg/code-review"}]
+    },
+    {
+      "name": "enhanced",
+      "weight": 1,
+      "plugins": [
+        {"source": "github:myorg/code-review"},
+        {"source": "github:myorg/security-scanner"}
+      ]
+    }
+  ],
+  "prompt": "Review the PR and report findings.",
+  "trigger": {
+    "type": "event",
+    "source": "github",
+    "on": "pull_request.opened",
+    "filter": "contains(pull_request.labels[].name, 'needs-review')"
+  }
+}
+```
+### Three-way comparison
+```json
+{
+  "name": "Scanner Comparison",
+  "experiment_id": "scanner-eval-q3",
+  "variants": [
+    {"name": "scanner-a", "weight": 1, "plugins": [{"source": "github:myorg/scanner-a"}]},
+    {"name": "scanner-b", "weight": 1, "plugins": [{"source": "github:myorg/scanner-b"}]},
+    {"name": "scanner-c", "weight": 1, "plugins": [{"source": "github:myorg/scanner-c"}]}
+  ],
+  "prompt": "Scan the repository and produce a findings report.",
+  "trigger": {"type": "cron", "schedule": "0 2 * * 0"}
+}
+```
+## Observability
+Each experiment run tags the conversation with:
+| Tag | Value |
+|-----|-------|
+| `experiment_id` | The `experiment_id` from the request |
+| `variant` | The name of the selected variant |
+Use these tags to filter runs in the OpenHands UI and compare outcomes across variants.
+## Limitations
+- **Plugin preset only** — A/B testing is not yet supported for prompt presets or custom automations. See [#147](https://github.com/OpenHands/automation/issues/147) for the server-level variant selection roadmap.
+- **No session stickiness** — each run independently selects a variant. There is no user- or session-based assignment.
+- **No built-in metrics** — the platform records which variant ran (via tags) but does not compute statistical significance. Export run data for external analysis.
+- **Single prompt** — all variants share the same prompt. To test different prompts, use separate automations.