npm - pikiclaw - Versions diffs - 0.2.66 → 0.2.67 - Mend

pikiclaw 0.2.66 → 0.2.67

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/README.md CHANGED Viewed

@@ -4,13 +4,13 @@
 **Put the world's smartest AI agents in your pocket. Command local Claude, Codex & Gemini via best IM.**
-*让最好用的 IM 变成你电脑上的顶级 Agent 控制台*
+*Let the best IM app become a top-tier Agent console on your computer*
-> npx pikiclaw@latest
-<img src="docs/promo-install.gif" alt="Quick install" width="700">
+```
+npx pikiclaw@latest
+```
-<p align="center">
+<p>
 <a href="https://www.npmjs.com/package/pikiclaw"><img src="https://img.shields.io/npm/v/pikiclaw" alt="npm"></a>
 <a href="LICENSE"><img src="https://img.shields.io/badge/License-MIT-blue.svg" alt="License: MIT"></a>
 <a href="https://nodejs.org"><img src="https://img.shields.io/badge/Node.js-18+-green.svg" alt="Node.js 18+"></a>
@@ -18,188 +18,191 @@
 </div>
+## Demo
+> Real task: ask pikiclaw to gather and summarize today's AI news — the agent reads, writes, and sends results back through Telegram, all from your phone.
+<video src="docs/promo-demo.mp4" width="700" controls muted></video>
+> Basic operations: send a message, watch the agent stream, receive files back.
+<img src="docs/promo-basic-ops.gif" alt="Basic operations" width="700">
 ---
 ## Why pikiclaw?
-很多“IM 接 Agent”的方案，本质上还是在绕路：
+Most "IM + Agent" solutions either reinvent the agent (worse than official CLIs), run in remote sandboxes (not your environment), or only support short conversations (unusable for real tasks).
-- 要么自己造 Agent，效果不如官方 CLI
-- 要么跑在远端沙盒里，不是你的环境
-- 要么只能短对话，不适合长任务
+pikiclaw takes a different approach:
-pikiclaw 的目标很直接：
-- 用官方 Agent CLI，而不是重新发明一套
-- 用你自己的电脑，而不是陌生沙盒
-- 用你已经在用的 IM，而不是再学一套远程控制方式
+- **Official Agent CLIs** — Claude Code, Codex, Gemini CLI as-is, not a home-grown wrapper
+- **Your own machine** — local files, local tools, local environment
+- **Your existing IM** — Telegram, Feishu, or WeChat, no new app to learn
 ```
-  你（Telegram / 飞书）
-          │
-          ▼
+  You (Telegram / Feishu / WeChat)
+          |
+          v
        pikiclaw
-          │
-          ▼
-  Claude Code / Codex / Gemini
-          │
-          ▼
-       你的电脑
+          |
+          v
+  Claude Code / Codex / Gemini CLI
+          |
+          v
+     Your Computer
 ```
-它适合的不是”演示一次 AI”，而是你离开电脑以后，Agent 还能继续在本机把事做完。
-### 在 Telegram 里长这样
-<table>
-<tr>
-<td align=”center”><b>命令与 Agent 切换</b><br><img src=”docs/promo-tg-commands.png” alt=”Commands” width=”320”></td>
-<td align=”center”><b>代码审查</b><br><img src=”docs/promo-tg-task.png” alt=”Code review” width=”320”></td>
-</tr>
-<tr>
-<td align=”center”><b>多轮编码 + 文件回传</b><br><img src=”docs/promo-tg-complex.png” alt=”Complex task” width=”320”></td>
-<td align=”center”><b>状态监控 + 会话管理</b><br><img src=”docs/promo-tg-sessions.png” alt=”Sessions” width=”320”></td>
-</tr>
-</table>
+It's designed for the moment you walk away from your desk — the agent keeps working locally, and you stay in control from your phone.
 ---
 ## Quick Start
-### 准备
+### Prerequisites
 - Node.js 18+
-- 本机已安装并登录任意一个 Agent CLI
-  - [`claude`](https://docs.anthropic.com/en/docs/claude-code)
-  - [`codex`](https://github.com/openai/codex)
-  - [`gemini`](https://github.com/google-gemini/gemini-cli)
-- Telegram Bot Token 或飞书应用凭证
+- At least one Agent CLI installed and logged in:
+  - [`claude`](https://docs.anthropic.com/en/docs/claude-code) (Claude Code)
+  - [`codex`](https://github.com/openai/codex) (Codex CLI)
+  - [`gemini`](https://github.com/google-gemini/gemini-cli) (Gemini CLI)
+- A bot token for your IM channel (Telegram Bot Token, Feishu app credentials, or WeChat account)
-### 启动
+### Install & Launch
 ```bash
 cd your-workspace
 npx pikiclaw@latest
 ```
-默认会打开 Web Dashboard：`http://localhost:3939`
+<img src="docs/promo-install.gif" alt="Quick install" width="700">
-你可以在 Dashboard 里完成：
+This opens the **Web Dashboard** at `http://localhost:3939`, where you can:
-- 渠道配置
-- 默认 Agent / 模型设置
-- 工作目录切换
-- 会话和运行状态查看
+- Connect IM channels (Telegram / Feishu / WeChat)
+- Configure agents and models
+- Manage macOS system permissions
+- Set up browser & desktop automation extensions
+- Monitor sessions and system resources
 <details>
-<summary>Dashboard 截图</summary>
+<summary>Alternative: terminal setup wizard</summary>
-**配置管理** — IM 接入、AI Agent、系统权限
+```bash
+npx pikiclaw@latest --setup   # interactive terminal wizard
+npx pikiclaw@latest --doctor  # check environment only
+```
-<img src="docs/promo-dashboard-config.png" alt="Config" width="700">
+</details>
-**会话管理** — 按 Agent 分组的会话泳道
+---
-<img src="docs/promo-dashboard-sessions.png" alt="Sessions" width="700">
+## Dashboard
-</details>
+<details>
+<summary>Expand to see all dashboard pages</summary>
-如果你更喜欢终端向导：
+**IM Access** — Telegram, Feishu, WeChat channel status and configuration
-```bash
-npx pikiclaw@latest --setup
-```
+<img src="docs/promo-dashboard-im.png" alt="IM Access" width="700">
-如果只是检查环境：
+**Agent Config** — Default agent / model / reasoning effort, available agents overview
-```bash
-npx pikiclaw@latest --doctor
-```
+<img src="docs/promo-dashboard-agents.png" alt="Agent Config" width="700">
+**System Permissions** — macOS accessibility, screen recording, disk access
+<img src="docs/promo-dashboard-permissions.png" alt="Permissions" width="700">
+**Extensions** — Managed browser & desktop automation (Appium Mac2)
+<img src="docs/promo-dashboard-extensions.png" alt="Extensions" width="700">
+**Sessions** — Per-agent session list and runtime status
+<img src="docs/promo-dashboard-sessions.png" alt="Sessions" width="700">
+**System Info** — Working directory, CPU / memory / disk monitoring
+<img src="docs/promo-dashboard-system.png" alt="System Info" width="700">
+</details>
 ---
-## Current Capabilities
+## Features
-### Channels And Agents
+### Channels & Agents
-- Telegram、飞书都可用，也可以同时启动
-- Claude Code、Codex CLI、Gemini CLI 都已接入
-- agent 通过统一 driver registry 管理，模型列表、session 列表、usage 展示走同一套接口
+- Telegram, Feishu, and WeChat — run one or all simultaneously
+- Claude Code, Codex CLI, and Gemini CLI via unified driver registry
+- Model listing, session management, and usage tracking through a single interface
 ### Runtime
-- 流式预览和持续消息更新
-- 会话切换、恢复和多轮续聊
-- 工作目录浏览与切换
-- 文件附件自动进入 session workspace
-- 长任务防休眠、watchdog 守护和自动重启
-- 长文本自动拆分，图片和文件可直接回传到 IM
-- Dashboard 可查看运行状态、sessions、usage、主机状态和 macOS 权限状态
+- Streaming preview with continuous message updates
+- Session switching, resume, and multi-turn conversations
+- Working directory browsing and switching
+- File attachments automatically enter the session workspace
+- Long-task sleep prevention, watchdog, and auto-restart
+- Long text auto-splitting; images and files sent back to IM directly
+- Light / dark theme and i18n (Chinese & English)
 ### Skills
-- 项目级 skills 以 `.pikiclaw/skills/*/SKILL.md` 为 canonical 入口
-- 兼容 `.claude/commands/*.md`
-- 兼容 legacy `.claude/skills` / `.agents/skills`，并可合并回 `.pikiclaw/skills`
-- IM 内可通过 `/skills` 和 `/sk_<name>` 触发
+- Project-level skills at `.pikiclaw/skills/*/SKILL.md`
+- Compatible with `.claude/commands/*.md`
+- Legacy `.claude/skills` / `.agents/skills` support with migration path
+- Trigger via `/skills` and `/sk_<name>` in chat
 ### Codex Human Loop
-当 Codex 在运行过程中请求额外用户输入时，pikiclaw 会把问题转成 Telegram / 飞书里的交互提示，用户回复后再继续当前任务。
+When Codex requests additional user input mid-task, pikiclaw surfaces the question as an interactive prompt in your IM. Reply there and the task continues.
-### MCP And GUI Automation
+### MCP & GUI Automation
-每次 Agent stream 都会启动一个会话级 MCP bridge，把本地工具按本次任务注入给 Agent。
+Each agent stream launches a session-scoped MCP bridge that injects local tools:
-当前内置工具：
+- `im_list_files` — list session workspace files
+- `im_send_file` — send files back to IM in real time
-- `im_list_files`：列出 session workspace 文件
-- `im_send_file`：把文件实时发回 IM
+Optional GUI capabilities:
-可选 GUI 能力：
-- 浏览器自动化：通过 `@playwright/mcp` 管理一个专用的持久化 Chrome profile；第一次使用时在这个自动化浏览器里登录需要的网站，后续任务会复用同一个 profile
-- macOS 桌面自动化：通过 Appium Mac2 提供 `desktop_open_app`、`desktop_snapshot`、`desktop_click`、`desktop_type`、`desktop_screenshot` 等工具
+- **Browser automation** — managed Chrome profile via `@playwright/mcp`; log in once, reuse across tasks
+- **macOS desktop automation** — Appium Mac2 with `desktop_open_app`, `desktop_snapshot`, `desktop_click`, `desktop_type`, `desktop_screenshot`
 ---
 ## Commands
-| 命令 | 说明 |
+| Command | Description |
 |---|---|
-| `/start` | 显示入口信息、当前 Agent、工作目录 |
-| `/sessions` | 查看、切换或新建会话 |
-| `/agents` | 切换 Agent |
-| `/models` | 查看并切换模型 / reasoning effort |
-| `/switch` | 浏览并切换工作目录 |
-| `/status` | 查看运行状态、tokens、usage、会话信息 |
-| `/host` | 查看主机 CPU / 内存 / 磁盘 / 电量 |
-| `/skills` | 浏览项目 skills |
-| `/restart` | 重启并重新拉起 bot |
-| `/sk_<name>` | 运行项目 skill |
-普通文本消息会直接转给当前 Agent。
-<details>
-<summary>Telegram 命令效果预览</summary>
-<img src="docs/promo-tg-commands.png" alt="Commands in Telegram" width="360">
-</details>
+| `/start` | Show entry info, current agent, working directory |
+| `/sessions` | View, switch, or create sessions |
+| `/agents` | Switch agent |
+| `/models` | View and switch model / reasoning effort |
+| `/switch` | Browse and switch working directory |
+| `/status` | Runtime status, tokens, usage, session info |
+| `/host` | Host CPU / memory / disk / battery |
+| `/skills` | Browse project skills |
+| `/restart` | Restart and re-launch bot |
+| `/sk_<name>` | Run a project skill |
+Plain text messages are forwarded directly to the current agent.
 ---
-## Config And Setup Notes
+## Configuration
-- 持久化配置在 `~/.pikiclaw/setting.json`
-- Dashboard 是主配置入口，其他运行时配置仍然可用
-- 桌面 GUI 相关常用变量：
-  - `PIKICLAW_DESKTOP_GUI`
-  - `PIKICLAW_DESKTOP_APPIUM_URL`
+- Persistent config lives in `~/.pikiclaw/setting.json`
+- The Dashboard is the primary configuration interface
-浏览器自动化由 dashboard 和本地运行时共同管理，会自动创建并复用专用的 Chrome profile 目录。你只需要在这个专用浏览器里登录需要自动化的网站账号一次。
+<details>
+<summary>GUI automation setup</summary>
-如果要启用 macOS 桌面自动化，需要先准备 Appium Mac2：
+**Browser automation** is managed by the dashboard and runtime together — a dedicated Chrome profile is created and reused automatically. Just log in to the sites you need once in that browser.
+**macOS desktop automation** requires Appium Mac2:
 ```bash
 npm install -g appium
@@ -207,15 +210,21 @@ appium driver install mac2
 appium
 ```
-然后给运行 `pikiclaw` 的终端应用授予 macOS 的辅助功能权限。
+Then grant macOS Accessibility permission to your terminal app.
+Relevant environment variables:
+- `PIKICLAW_DESKTOP_GUI`
+- `PIKICLAW_DESKTOP_APPIUM_URL`
+</details>
 ---
 ## Roadmap
-- 把当前会话级 MCP bridge 继续扩展成更完整的顶级工具接入层
-- 继续完善 GUI 自动化能力，尤其是浏览器与桌面工具的协同链路
-- 增加更多 IM 渠道，WhatsApp 仍在规划中
+- Expand session-scoped MCP bridge into a more complete top-level tool layer
+- Improve GUI automation, especially browser + desktop tool coordination
+- More IM channels (WhatsApp, etc.)
 ---
@@ -229,25 +238,15 @@ npm run build
 npm test
 ```
-常用命令：
 ```bash
-npm run dev
-npm run build
-npm test
-npm run test:e2e
-npx vitest run test/channel-feishu.unit.test.ts
-npx pikiclaw@latest --doctor
+npm run dev          # local dev (--no-daemon, logs to ~/.pikiclaw/dev/dev.log)
+npm run build        # production build
+npm test             # unit tests
+npm run test:e2e     # end-to-end tests
+npx pikiclaw@latest --doctor  # environment check
 ```
-`npm run dev` 只跑本地源码链路，会固定使用 `--no-daemon`，避免跳转到生产/自举用的 `npx pikiclaw@latest`。
-同时会把本次启动的全部日志写到 `~/.pikiclaw/dev/dev.log`，并在每次启动时先清空旧日志。
-更多实现细节见：
-- [ARCHITECTURE.md](ARCHITECTURE.md)
-- [INTEGRATION.md](INTEGRATION.md)
-- [TESTING.md](TESTING.md)
+See also: [ARCHITECTURE.md](ARCHITECTURE.md) · [INTEGRATION.md](INTEGRATION.md) · [TESTING.md](TESTING.md)
 ---