npm - @zibby/skills - Versions diffs - 0.1.11 → 0.1.13 - Mend

@zibby/skills 0.1.11 → 0.1.13

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (29) hide show

package/dist/index.js +5 -5
package/dist/package.json +3 -3
package/dist/skill-installer.js +5 -5
package/docs/cli-reference.md +120 -256
package/docs/cloning-repositories.md +2 -2
package/docs/cloud/bundles.md +92 -0
package/docs/cloud/dedicated-egress.md +140 -0
package/docs/cloud/env-vars.md +144 -0
package/docs/cloud/limits.md +81 -0
package/docs/cloud/logs.md +104 -0
package/docs/cloud/triggering.md +114 -0
package/docs/concepts/agents.md +112 -0
package/docs/concepts/graph.md +83 -0
package/docs/concepts/sessions.md +70 -0
package/docs/concepts/skills.md +84 -0
package/docs/concepts/state.md +106 -0
package/docs/get-started/deploy.md +75 -0
package/docs/get-started/install.md +58 -0
package/docs/get-started/run-locally.md +94 -0
package/docs/get-started/trigger-and-logs.md +90 -0
package/docs/get-started/your-first-workflow.md +66 -0
package/docs/intro.md +37 -65
package/docs/legacy/test-automation.md +110 -0
package/docs/packages/agent-workflow.md +88 -0
package/docs/packages/cli.md +42 -207
package/docs/packages/core.md +40 -224
package/docs/recipes/index.md +62 -0
package/docs/recipes/test.md +154 -0
package/package.json +3 -3

package/docs/recipes/test.md ADDED Viewed

@@ -0,0 +1,154 @@
+---
+sidebar_position: 2
+title: Browser test recipe (zibby test)
+---
+# `zibby test` — browser test recipe
+The browser-test recipe takes a plain-English spec, drives a real browser via a coding agent (Cursor / Claude / Codex / Gemini), runs the assertions, and produces a Playwright script + verification video.
+It's a worked example of what the Zibby platform does — every step is a regular workflow node with Zod-validated handoff. You can read the source, fork it, or build your own variation.
+## Quick start
+```bash
+# Inline spec
+zibby test "Go to https://example.com and verify the title is 'Example Domain'"
+# Spec file
+zibby test test-specs/login.txt
+# With a specific agent
+zibby test test-specs/checkout.txt --agent claude
+```
+## What it produces
+```
+.zibby/output/sessions/<session-id>/
+├── execute_live/
+│   ├── result.json          ← Zod-validated assertions + agent reasoning
+│   └── browser-trace/       ← Playwright trace files
+├── generate_script/
+│   ├── result.json          ← parsed script + metadata
+│   └── generated.spec.js    ← reusable Playwright test
+└── video/
+    └── recording.webm       ← visual verification
+```
+Open the session in [Zibby Studio](https://zibby.app/studio) to scrub through the run, swap the prompt, re-execute any node.
+## The graph (this is just a Zibby workflow)
+Under the hood, `zibby test` is a 3-node graph:
+```
+   ┌──────────────┐    ┌──────────────────┐    ┌─────────────────┐
+   │  preflight   │ →  │   execute_live   │ →  │ generate_script │
+   │              │    │                  │    │                 │
+   │ extract      │    │ agent drives     │    │ produce         │
+   │ assertions   │    │ browser via MCP, │    │ Playwright      │
+   │ from spec    │    │ records video    │    │ test file       │
+   └──────────────┘    └──────────────────┘    └─────────────────┘
+        │                     │                       │
+     Zod out               Zod out                 Zod out
+   (Assertions)         (BrowserResult)         (PlaywrightScript)
+```
+Each node is a real `WorkflowGraph` node. The agent in `execute_live` does its own tool loop (browser navigation, click, assertion checking) — Zibby just defines the contract.
+## Customizing
+**Use a different agent per run:**
+```bash
+zibby test test-specs/checkout.txt --agent claude    # Claude Code
+zibby test test-specs/checkout.txt --agent cursor    # Cursor (default)
+zibby test test-specs/checkout.txt --agent codex     # OpenAI Codex
+```
+**Run only one node** (e.g. just regenerate the script from an existing run):
+```bash
+zibby test --session 1768974629717 --node generate_script
+```
+**Headless vs headed:**
+```bash
+zibby test test-specs/login.txt              # headed (default — see the browser)
+zibby test test-specs/login.txt --headless   # headless mode (for CI)
+```
+## Forking the recipe
+If the built-in recipe doesn't fit your case, scaffold a custom workflow and copy the structure:
+```bash
+zibby workflow new my-test-pipeline
+```
+Then in `graph.mjs`, define your own nodes:
+```js
+import { WorkflowGraph, z } from '@zibby/agent-workflow';
+const AssertionsSchema = z.object({
+  assertions: z.array(z.string()),
+  baseUrl: z.string().url(),
+});
+const BrowserResultSchema = z.object({
+  passed: z.boolean(),
+  details: z.array(z.object({ assertion: z.string(), passed: z.boolean() })),
+  videoPath: z.string().optional(),
+});
+const graph = new WorkflowGraph();
+graph.addNode('preflight', {
+  agent: 'claude',
+  prompt: ({ spec }) => `Extract assertions and base URL from: ${spec}`,
+  outputSchema: AssertionsSchema,
+});
+graph.addNode('execute_live', {
+  agent: 'cursor',
+  skills: ['browser'],
+  prompt: ({ preflight }) => `Navigate to ${preflight.baseUrl} and verify: ${preflight.assertions.join('; ')}`,
+  outputSchema: BrowserResultSchema,
+});
+graph.addEdge('preflight', 'execute_live');
+graph.setEntryPoint('preflight');
+export default graph;
+```
+That's the platform. The recipe is just a starter.
+## CI/CD
+```yaml
+- name: Run Zibby test
+  env:
+    ZIBBY_USER_TOKEN: ${{ secrets.ZIBBY_USER_TOKEN }}
+  run: |
+    npx @zibby/cli test test-specs/checkout.txt --headless
+```
+For workflows triggered remotely (rather than per-CI-run), use [`workflow trigger`](../cloud/triggering) on a deployed graph.
+## Why this is different from Playwright codegen / a basic LLM script
+| | Playwright codegen | LLM-only script | Zibby test recipe |
+|---|---|---|---|
+| Plain-English input | ❌ | ✅ | ✅ |
+| Real browser execution | ✅ | ❌ (just generates code) | ✅ |
+| Coding-agent driven | ❌ | partial | ✅ Cursor / Claude / Codex |
+| Multi-step verification | ❌ | ❌ | ✅ Zod-validated nodes |
+| Replayable + debuggable | ❌ | ❌ | ✅ Studio |
+| Vendor-neutral | N/A | locked to one LLM | swap agent per run |
+## See also
+- [Recipes overview](./index)
+- [Concepts: graph](../concepts/graph) — the primitives this recipe uses
+- [Cloud triggering](../cloud/triggering) — fire workflows from CI/CD

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@zibby/skills",
-  "version": "0.1.11",
+  "version": "0.1.13",
   "description": "Built-in skill definitions for Zibby test automation framework",
   "type": "module",
   "main": "dist/index.js",
@@ -28,7 +28,7 @@
   ],
   "author": "Zibby",
   "license": "MIT",
-  "homepage": "https://zibby.app",
+  "homepage": "https://zibby.dev",
   "repository": {
     "type": "git",
     "url": "https://github.com/ZibbyHQ/zibby-agent"
@@ -46,7 +46,7 @@
     "node": ">=18.0.0"
   },
   "dependencies": {
-    "@zibby/agent-workflow": "^0.1.2"
+    "@zibby/agent-workflow": "^0.3.0"
   },
   "peerDependencies": {
     "@zibby/core": ">=0.1.44"