openbuilder 0.1.1 → 0.1.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (2) hide show
  1. package/README.md +54 -16
  2. package/package.json +1 -1
package/README.md CHANGED
@@ -4,6 +4,10 @@
4
4
 
5
5
  Get meeting summaries, action items, key decisions, and speaker analytics — all from your terminal.
6
6
 
7
+ <p align="center">
8
+ <img src="docs/demo-telegram.jpg" alt="OpenBuilder in action — Telegram bot joining a Google Meet, taking screenshots, and reporting on the meeting in real time" width="500">
9
+ </p>
10
+
7
11
  ## Features
8
12
 
9
13
  - **Audio Capture + Whisper** — Captures browser audio via PulseAudio virtual sink, transcribes with OpenAI Whisper (no captions needed)
@@ -293,35 +297,68 @@ To set up OpenBuilder as an automated meeting bot (e.g. for OpenClaw agents):
293
297
  - Caption mode (`--captions`) is the most reliable on headless servers
294
298
  - Audio mode (`--audio`) requires PulseAudio + ffmpeg + OpenAI key + Xvfb (experimental on servers)
295
299
 
296
- ## Running on Your Own Hardware
300
+ ## Platform Compatibility
301
+
302
+ OpenBuilder works anywhere OpenClaw runs, as long as Playwright Chromium is available.
297
303
 
298
- OpenBuilder works on any machine running OpenClaw — Mac Mini, laptop, Raspberry Pi, VPS, etc.
304
+ | Platform | Status | Notes |
305
+ |----------|--------|-------|
306
+ | **macOS** (Mac Mini, MacBook — Intel & Apple Silicon) | ✅ Tested | Chromium installs automatically. Captions mode works great. No PulseAudio needed. |
307
+ | **Linux x64** (Ubuntu, Debian, VPS, EC2) | ✅ Tested | May need system deps: `npx playwright-core install-deps chromium`. Headless servers need Xvfb. |
308
+ | **Windows** (via WSL2) | ✅ Should work | Same as Linux x64. Not yet tested — feedback welcome. |
309
+ | **Docker / Podman** | ✅ Should work | Run `npx playwright-core install-deps chromium` in container. Not yet tested. |
310
+ | **Raspberry Pi** (ARM64, Pi 4/5) | ⚠️ Not tested | Playwright Chromium ARM builds can be unreliable. May need manual Chromium install. |
311
+ | **Raspberry Pi 3** (ARMv7) | ❌ Not supported | Playwright does not ship ARMv7 Chromium. |
299
312
 
300
313
  ### macOS (Mac Mini, MacBook)
314
+
301
315
  - `npx openbuilder` installs Chromium automatically
302
- - Audio capture mode is not available (no PulseAudio) — captions mode works great
316
+ - Audio capture mode is not available (no PulseAudio) — captions mode is the default and works great
303
317
  - Auth: `npx openbuilder auth` opens a browser window — sign in and press Enter
304
- - For headless operation: use `--auto` auth with `.env` credentials
318
+ - For headless/unattended operation: use `--auto` auth with `.env` credentials
319
+
320
+ ### Linux (Ubuntu, Debian, VPS, EC2)
321
+
322
+ - Install system dependencies: `npx playwright-core install-deps chromium`
323
+ - For headless servers (no display): start Xvfb first — `Xvfb :99 -screen 0 1280x720x24 &` and `export DISPLAY=:99`
324
+ - Audio capture mode available if PulseAudio + ffmpeg are installed (optional — captions mode works without them)
325
+
326
+ ### VPS Providers
305
327
 
306
- ### Linux (Ubuntu, Debian, etc.)
307
- - May need system deps: `npx playwright-core install-deps chromium`
308
- - For headless servers: ensure Xvfb is running (`Xvfb :99 -screen 0 1280x720x24 &`)
309
- - Audio mode available if PulseAudio + ffmpeg are installed
328
+ Works on any VPS that runs OpenClaw:
329
+
330
+ - **AWS** (EC2, Lightsail) tested
331
+ - **Oracle Cloud** (Always Free tier) should work
332
+ - **Hetzner**, **Fly.io**, **GCP**, **Railway**, **Render** — should work (Linux x64)
333
+ - **DigitalOcean** — should work
334
+
335
+ Minimum: 1 vCPU, 1GB RAM (Chromium is the main resource consumer).
336
+
337
+ ### Requirements
310
338
 
311
- ### What you need
312
339
  1. **Node.js 18+**
313
340
  2. **OpenClaw** running on the machine
314
- 3. **A Google account** for the bot (or join as guest)
315
- 4. **An AI API key** (Claude or OpenAI) for meeting reports
341
+ 3. **A Google account** for the bot (or join as guest with `--anon`)
342
+ 4. **An AI API key** Claude (Anthropic) or OpenAI for meeting reports (optional but recommended)
343
+
344
+ ### Quick Setup (any platform)
316
345
 
317
- ### Quick setup (any platform)
318
346
  ```bash
319
- npx openbuilder # Install + Chromium
347
+ npx openbuilder # Install skill + Chromium
320
348
  npx openbuilder auth # Sign into Google (one-time)
321
349
  npx openbuilder config set anthropicApiKey sk-ant-... # For AI reports
322
350
  ```
323
351
 
324
- Then tell your OpenClaw agent: "Join this meeting: \<url\>"
352
+ Then tell your OpenClaw agent: "Join this meeting: https://meet.google.com/..."
353
+
354
+ ### Known Limitations
355
+
356
+ - **Google Meet only** — Zoom and Teams support planned for a future release
357
+ - **Captions depend on Google Meet's CC feature** — if Meet changes their DOM structure, caption scraping may break
358
+ - **Audio capture mode is experimental** on headless servers — PulseAudio routing can be unreliable. Use `--captions` for reliability
359
+ - **Bot appears as a participant** — other meeting participants will see "Super Liang" (or your bot's Google account name) in the People panel
360
+ - **Google may block automated logins** — if Google flags the bot account, re-run `npx openbuilder auth` interactively or from a different IP
361
+ - **One meeting at a time** per bot instance
325
362
 
326
363
  ## OpenClaw Integration
327
364
 
@@ -329,8 +366,9 @@ OpenBuilder ships as an OpenClaw skill. After running `npx openbuilder`, it's av
329
366
 
330
367
  - Join meetings on your behalf
331
368
  - Capture and summarize transcripts
332
- - Generate full meeting reports
333
- - Send screenshots to your chat
369
+ - Generate full meeting reports with action items and decisions
370
+ - Send screenshots to your chat on demand
371
+ - Tell you what's being discussed in real time
334
372
 
335
373
  See [SKILL.md](./SKILL.md) for the full agent integration guide.
336
374
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "openbuilder",
3
- "version": "0.1.1",
3
+ "version": "0.1.3",
4
4
  "description": "Open-source AI meeting assistant — join Google Meet, capture transcripts, generate AI-powered meeting reports with summaries, action items, and speaker analytics",
5
5
  "type": "module",
6
6
  "bin": {