screenhand 0.3.1 → 0.3.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/mcp-desktop.js +12 -2
- package/package.json +1 -1
package/dist/mcp-desktop.js
CHANGED
|
@@ -318,6 +318,16 @@ const server = new McpServer({ name: "screenhand", version: "3.0.0" }, {
|
|
|
318
318
|
- **supervisor_start()** — background daemon that detects stalled agents and recovers.
|
|
319
319
|
- Pattern: session_claim() → do work → session_heartbeat() periodically → session_release()
|
|
320
320
|
|
|
321
|
+
### Planning (let ScreenHand figure out the steps)
|
|
322
|
+
- **plan_goal("Export video as H.264")** — describe WHAT you want, ScreenHand generates a step-by-step plan. It searches playbooks, saved strategies, and reference knowledge to build the plan. Does NOT execute — returns the plan for review.
|
|
323
|
+
- **plan_execute(goalId)** — run the plan automatically. Deterministic steps (known selectors/flows) run internally. Pauses at LLM steps where your judgment is needed — resolve them with plan_step_resolve().
|
|
324
|
+
- **plan_step(goalId)** — execute one step at a time (for more control than plan_execute).
|
|
325
|
+
- **plan_step_resolve(goalId, tool, params)** — when a plan pauses at an LLM step, YOU decide which tool and params to use. The server executes it, verifies postconditions, and advances.
|
|
326
|
+
- **plan_status(goalId)** — check progress: which step you're on, what's done, what's left.
|
|
327
|
+
- **plan_list()** — see all goals (active, completed, failed).
|
|
328
|
+
- **plan_cancel(goalId)** — abort a goal.
|
|
329
|
+
- Pattern: plan_goal("do X") → review steps → plan_execute() → resolve LLM steps as they pause → on success, strategy auto-saved to memory
|
|
330
|
+
|
|
321
331
|
## Tool Selection Priority
|
|
322
332
|
1. **ui_tree + ui_press** for native app elements (fastest, most reliable)
|
|
323
333
|
2. **browser_* tools** for web content in Chrome/Electron
|
|
@@ -4889,7 +4899,7 @@ originalTool("worker_status", "Get the current status of the worker daemon (read
|
|
|
4889
4899
|
// ═══════════════════════════════════════════════
|
|
4890
4900
|
// PLANNER — goal-oriented planning
|
|
4891
4901
|
// ═══════════════════════════════════════════════
|
|
4892
|
-
originalTool("plan_goal", "
|
|
4902
|
+
originalTool("plan_goal", "Describe WHAT you want to achieve — ScreenHand builds a step-by-step plan by searching playbooks, saved strategies, and platform references. Returns steps with confidence scores. Does NOT execute — review the plan, then use plan_execute() or plan_step() to run it. Use for complex multi-step workflows instead of figuring out each step yourself.", {
|
|
4893
4903
|
goal: z.string().describe("What you want to achieve (e.g. 'Export Premiere Pro timeline as H.264')"),
|
|
4894
4904
|
}, async ({ goal: goalDescription }) => {
|
|
4895
4905
|
const goal = planner.createGoal(goalDescription);
|
|
@@ -4924,7 +4934,7 @@ originalTool("plan_goal", "Create a goal and generate an execution plan. Returns
|
|
|
4924
4934
|
_meta: { goalId: goal.id, plan },
|
|
4925
4935
|
};
|
|
4926
4936
|
});
|
|
4927
|
-
originalTool("plan_execute", "
|
|
4937
|
+
originalTool("plan_execute", "Run a plan automatically. Known steps (from playbooks/references) execute internally at full speed. Pauses at LLM steps where YOUR judgment is needed — call plan_step_resolve() to provide the tool+params. On completion, the successful strategy is auto-saved to memory for future reuse.", {
|
|
4928
4938
|
goalId: z.string().describe("Goal ID from plan_goal"),
|
|
4929
4939
|
}, async ({ goalId }) => {
|
|
4930
4940
|
const goal = goalStore.get(goalId);
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "screenhand",
|
|
3
|
-
"version": "0.3.
|
|
3
|
+
"version": "0.3.2",
|
|
4
4
|
"mcpName": "io.github.manushi4/screenhand",
|
|
5
5
|
"description": "Give AI eyes and hands on your desktop. ScreenHand is an open-source MCP server that lets Claude and other AI agents see your screen, click buttons, type text, and control any app on macOS and Windows.",
|
|
6
6
|
"homepage": "https://screenhand.com",
|