@quantumclaw/quantumclaw 2026.3.22
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +4601 -0
- package/LICENSE +21 -0
- package/README.md +559 -0
- package/assets/avatar-placeholder.svg +19 -0
- package/assets/chrome-extension/icons/icon128.png +0 -0
- package/assets/chrome-extension/icons/icon16.png +0 -0
- package/assets/chrome-extension/icons/icon32.png +0 -0
- package/assets/chrome-extension/icons/icon48.png +0 -0
- package/assets/dmg-background-small.png +0 -0
- package/assets/dmg-background.png +0 -0
- package/docs/.i18n/README.md +31 -0
- package/docs/.i18n/glossary.ja-JP.json +14 -0
- package/docs/.i18n/glossary.zh-CN.json +302 -0
- package/docs/.i18n/ja-JP.tm.jsonl +0 -0
- package/docs/assets/install-script.svg +1 -0
- package/docs/assets/macos-onboarding/01-macos-warning.jpeg +0 -0
- package/docs/assets/macos-onboarding/02-local-networks.jpeg +0 -0
- package/docs/assets/macos-onboarding/03-security-notice.png +0 -0
- package/docs/assets/macos-onboarding/04-choose-gateway.png +0 -0
- package/docs/assets/macos-onboarding/05-permissions.png +0 -0
- package/docs/assets/pixel-lobster.svg +60 -0
- package/docs/assets/quantumclaw-logo-text-dark.png +0 -0
- package/docs/assets/quantumclaw-logo-text-dark.svg +418 -0
- package/docs/assets/quantumclaw-logo-text.png +0 -0
- package/docs/assets/quantumclaw-logo-text.svg +418 -0
- package/docs/assets/showcase/agents-ui.jpg +0 -0
- package/docs/assets/showcase/bambu-cli.png +0 -0
- package/docs/assets/showcase/codexmonitor.png +0 -0
- package/docs/assets/showcase/gohome-grafana.png +0 -0
- package/docs/assets/showcase/ios-testflight.jpg +0 -0
- package/docs/assets/showcase/oura-health.png +0 -0
- package/docs/assets/showcase/padel-cli.svg +11 -0
- package/docs/assets/showcase/padel-screenshot.jpg +0 -0
- package/docs/assets/showcase/papla-tts.jpg +0 -0
- package/docs/assets/showcase/pr-review-telegram.jpg +0 -0
- package/docs/assets/showcase/roborock-screenshot.jpg +0 -0
- package/docs/assets/showcase/roborock-status.svg +13 -0
- package/docs/assets/showcase/roof-camera-sky.jpg +0 -0
- package/docs/assets/showcase/snag.png +0 -0
- package/docs/assets/showcase/tesco-shop.jpg +0 -0
- package/docs/assets/showcase/wienerlinien.png +0 -0
- package/docs/assets/showcase/wine-cellar-skill.jpg +0 -0
- package/docs/assets/showcase/winix-air-purifier.jpg +0 -0
- package/docs/assets/showcase/xuezh-pronunciation.jpeg +0 -0
- package/docs/assets/sponsors/blacksmith.svg +14 -0
- package/docs/assets/sponsors/convex.svg +16 -0
- package/docs/assets/sponsors/openai.svg +3 -0
- package/docs/assets/sponsors/vercel.svg +5 -0
- package/docs/auth-credential-semantics.md +53 -0
- package/docs/automation/auth-monitoring.md +44 -0
- package/docs/automation/cron-jobs.md +727 -0
- package/docs/automation/cron-vs-heartbeat.md +286 -0
- package/docs/automation/gmail-pubsub.md +256 -0
- package/docs/automation/hooks.md +1056 -0
- package/docs/automation/poll.md +86 -0
- package/docs/automation/standing-orders.md +251 -0
- package/docs/automation/troubleshooting.md +122 -0
- package/docs/automation/webhook.md +217 -0
- package/docs/brave-search.md +93 -0
- package/docs/channels/bluebubbles.md +347 -0
- package/docs/channels/broadcast-groups.md +442 -0
- package/docs/channels/channel-routing.md +139 -0
- package/docs/channels/discord.md +1229 -0
- package/docs/channels/feishu.md +747 -0
- package/docs/channels/googlechat.md +261 -0
- package/docs/channels/group-messages.md +84 -0
- package/docs/channels/groups.md +379 -0
- package/docs/channels/imessage.md +367 -0
- package/docs/channels/index.md +47 -0
- package/docs/channels/irc.md +242 -0
- package/docs/channels/line.md +194 -0
- package/docs/channels/location.md +56 -0
- package/docs/channels/matrix.md +677 -0
- package/docs/channels/mattermost.md +427 -0
- package/docs/channels/msteams.md +780 -0
- package/docs/channels/nextcloud-talk.md +138 -0
- package/docs/channels/nostr.md +249 -0
- package/docs/channels/pairing.md +114 -0
- package/docs/channels/signal.md +329 -0
- package/docs/channels/slack.md +603 -0
- package/docs/channels/synology-chat.md +134 -0
- package/docs/channels/telegram.md +987 -0
- package/docs/channels/tlon.md +276 -0
- package/docs/channels/troubleshooting.md +118 -0
- package/docs/channels/twitch.md +379 -0
- package/docs/channels/whatsapp.md +460 -0
- package/docs/channels/zalo.md +243 -0
- package/docs/channels/zalouser.md +181 -0
- package/docs/ci.md +55 -0
- package/docs/cli/acp.md +288 -0
- package/docs/cli/agent.md +29 -0
- package/docs/cli/agents.md +123 -0
- package/docs/cli/approvals.md +50 -0
- package/docs/cli/backup.md +76 -0
- package/docs/cli/browser.md +106 -0
- package/docs/cli/channels.md +102 -0
- package/docs/cli/clawbot.md +21 -0
- package/docs/cli/completion.md +35 -0
- package/docs/cli/config.md +295 -0
- package/docs/cli/configure.md +36 -0
- package/docs/cli/cron.md +77 -0
- package/docs/cli/daemon.md +53 -0
- package/docs/cli/dashboard.md +22 -0
- package/docs/cli/devices.md +139 -0
- package/docs/cli/directory.md +63 -0
- package/docs/cli/dns.md +23 -0
- package/docs/cli/docs.md +15 -0
- package/docs/cli/doctor.md +48 -0
- package/docs/cli/gateway.md +235 -0
- package/docs/cli/health.md +21 -0
- package/docs/cli/hooks.md +329 -0
- package/docs/cli/index.md +1150 -0
- package/docs/cli/logs.md +28 -0
- package/docs/cli/memory.md +66 -0
- package/docs/cli/message.md +278 -0
- package/docs/cli/models.md +81 -0
- package/docs/cli/node.md +127 -0
- package/docs/cli/nodes.md +75 -0
- package/docs/cli/onboard.md +157 -0
- package/docs/cli/pairing.md +32 -0
- package/docs/cli/plugins.md +210 -0
- package/docs/cli/qr.md +46 -0
- package/docs/cli/reset.md +20 -0
- package/docs/cli/sandbox.md +197 -0
- package/docs/cli/secrets.md +188 -0
- package/docs/cli/security.md +79 -0
- package/docs/cli/sessions.md +110 -0
- package/docs/cli/setup.md +29 -0
- package/docs/cli/skills.md +36 -0
- package/docs/cli/status.md +30 -0
- package/docs/cli/system.md +60 -0
- package/docs/cli/tui.md +30 -0
- package/docs/cli/uninstall.md +20 -0
- package/docs/cli/update.md +103 -0
- package/docs/cli/voicecall.md +34 -0
- package/docs/cli/webhooks.md +25 -0
- package/docs/concepts/agent-loop.md +148 -0
- package/docs/concepts/agent-workspace.md +236 -0
- package/docs/concepts/agent.md +122 -0
- package/docs/concepts/architecture.md +137 -0
- package/docs/concepts/compaction.md +123 -0
- package/docs/concepts/context-engine.md +268 -0
- package/docs/concepts/context.md +172 -0
- package/docs/concepts/delegate-architecture.md +296 -0
- package/docs/concepts/features.md +73 -0
- package/docs/concepts/markdown-formatting.md +130 -0
- package/docs/concepts/memory.md +108 -0
- package/docs/concepts/messages.md +154 -0
- package/docs/concepts/model-failover.md +152 -0
- package/docs/concepts/model-providers.md +607 -0
- package/docs/concepts/models.md +225 -0
- package/docs/concepts/multi-agent.md +552 -0
- package/docs/concepts/oauth.md +158 -0
- package/docs/concepts/presence.md +102 -0
- package/docs/concepts/queue.md +89 -0
- package/docs/concepts/retry.md +69 -0
- package/docs/concepts/session-pruning.md +121 -0
- package/docs/concepts/session-tool.md +242 -0
- package/docs/concepts/session.md +310 -0
- package/docs/concepts/streaming.md +155 -0
- package/docs/concepts/system-prompt.md +132 -0
- package/docs/concepts/timezone.md +91 -0
- package/docs/concepts/typebox.md +291 -0
- package/docs/concepts/typing-indicators.md +68 -0
- package/docs/concepts/usage-tracking.md +35 -0
- package/docs/date-time.md +128 -0
- package/docs/debug/node-issue.md +85 -0
- package/docs/diagnostics/flags.md +91 -0
- package/docs/docs.json +2078 -0
- package/docs/gateway/authentication.md +179 -0
- package/docs/gateway/background-process.md +97 -0
- package/docs/gateway/bonjour.md +177 -0
- package/docs/gateway/bridge-protocol.md +91 -0
- package/docs/gateway/cli-backends.md +225 -0
- package/docs/gateway/configuration-examples.md +651 -0
- package/docs/gateway/configuration-reference.md +3123 -0
- package/docs/gateway/configuration.md +633 -0
- package/docs/gateway/discovery.md +123 -0
- package/docs/gateway/doctor.md +362 -0
- package/docs/gateway/gateway-lock.md +34 -0
- package/docs/gateway/health.md +44 -0
- package/docs/gateway/heartbeat.md +393 -0
- package/docs/gateway/index.md +261 -0
- package/docs/gateway/local-models.md +152 -0
- package/docs/gateway/logging.md +113 -0
- package/docs/gateway/multiple-gateways.md +112 -0
- package/docs/gateway/network-model.md +22 -0
- package/docs/gateway/openai-http-api.md +132 -0
- package/docs/gateway/openresponses-http-api.md +295 -0
- package/docs/gateway/openshell.md +307 -0
- package/docs/gateway/pairing.md +99 -0
- package/docs/gateway/protocol.md +267 -0
- package/docs/gateway/remote-gateway-readme.md +158 -0
- package/docs/gateway/remote.md +153 -0
- package/docs/gateway/sandbox-vs-tool-policy-vs-elevated.md +134 -0
- package/docs/gateway/sandboxing.md +469 -0
- package/docs/gateway/secrets-plan-contract.md +116 -0
- package/docs/gateway/secrets.md +503 -0
- package/docs/gateway/security/index.md +1220 -0
- package/docs/gateway/tailscale.md +132 -0
- package/docs/gateway/tools-invoke-http-api.md +118 -0
- package/docs/gateway/troubleshooting.md +378 -0
- package/docs/gateway/trusted-proxy-auth.md +330 -0
- package/docs/help/debugging.md +168 -0
- package/docs/help/environment.md +163 -0
- package/docs/help/faq.md +2997 -0
- package/docs/help/index.md +28 -0
- package/docs/help/scripts.md +28 -0
- package/docs/help/testing.md +526 -0
- package/docs/help/troubleshooting.md +297 -0
- package/docs/images/configure-model-picker-unsearchable.png +0 -0
- package/docs/images/feishu-step2-create-app.png +0 -0
- package/docs/images/feishu-step3-credentials.png +0 -0
- package/docs/images/feishu-step4-permissions.png +0 -0
- package/docs/images/feishu-step5-bot-capability.png +0 -0
- package/docs/images/feishu-step6-event-subscription.png +0 -0
- package/docs/images/feishu-verification-token.png +0 -0
- package/docs/images/groups-flow.svg +52 -0
- package/docs/images/mobile-ui-screenshot.png +0 -0
- package/docs/index.md +196 -0
- package/docs/install/ansible.md +230 -0
- package/docs/install/azure.md +311 -0
- package/docs/install/bun.md +55 -0
- package/docs/install/development-channels.md +120 -0
- package/docs/install/digitalocean.md +129 -0
- package/docs/install/docker-vm-runtime.md +142 -0
- package/docs/install/docker.md +375 -0
- package/docs/install/exe-dev.md +126 -0
- package/docs/install/fly.md +501 -0
- package/docs/install/gcp.md +402 -0
- package/docs/install/hetzner.md +251 -0
- package/docs/install/index.md +183 -0
- package/docs/install/installer.md +415 -0
- package/docs/install/kubernetes.md +191 -0
- package/docs/install/macos-vm.md +281 -0
- package/docs/install/migrating-matrix.md +346 -0
- package/docs/install/migrating.md +110 -0
- package/docs/install/nix.md +89 -0
- package/docs/install/node.md +138 -0
- package/docs/install/northflank.mdx +54 -0
- package/docs/install/oracle.md +156 -0
- package/docs/install/podman.md +133 -0
- package/docs/install/railway.mdx +100 -0
- package/docs/install/raspberry-pi.md +159 -0
- package/docs/install/render.mdx +169 -0
- package/docs/install/uninstall.md +128 -0
- package/docs/install/updating.md +128 -0
- package/docs/ja-JP/index.md +186 -0
- package/docs/ja-JP/start/getting-started.md +125 -0
- package/docs/ja-JP/start/wizard.md +77 -0
- package/docs/logging.md +352 -0
- package/docs/nav-tabs-underline.js +100 -0
- package/docs/network.md +54 -0
- package/docs/nodes/audio.md +187 -0
- package/docs/nodes/camera.md +162 -0
- package/docs/nodes/images.md +72 -0
- package/docs/nodes/index.md +393 -0
- package/docs/nodes/location-command.md +98 -0
- package/docs/nodes/media-understanding.md +394 -0
- package/docs/nodes/talk.md +92 -0
- package/docs/nodes/troubleshooting.md +114 -0
- package/docs/nodes/voicewake.md +66 -0
- package/docs/perplexity.md +174 -0
- package/docs/pi-dev.md +80 -0
- package/docs/pi.md +567 -0
- package/docs/platforms/android.md +168 -0
- package/docs/platforms/digitalocean.md +266 -0
- package/docs/platforms/index.md +54 -0
- package/docs/platforms/ios.md +220 -0
- package/docs/platforms/linux.md +94 -0
- package/docs/platforms/mac/bundled-gateway.md +73 -0
- package/docs/platforms/mac/canvas.md +125 -0
- package/docs/platforms/mac/child-process.md +69 -0
- package/docs/platforms/mac/dev-setup.md +104 -0
- package/docs/platforms/mac/health.md +34 -0
- package/docs/platforms/mac/icon.md +31 -0
- package/docs/platforms/mac/logging.md +57 -0
- package/docs/platforms/mac/menu-bar.md +81 -0
- package/docs/platforms/mac/peekaboo.md +65 -0
- package/docs/platforms/mac/permissions.md +50 -0
- package/docs/platforms/mac/remote.md +84 -0
- package/docs/platforms/mac/signing.md +47 -0
- package/docs/platforms/mac/skills.md +33 -0
- package/docs/platforms/mac/voice-overlay.md +60 -0
- package/docs/platforms/mac/voicewake.md +67 -0
- package/docs/platforms/mac/webchat.md +43 -0
- package/docs/platforms/mac/xpc.md +61 -0
- package/docs/platforms/macos.md +226 -0
- package/docs/platforms/oracle.md +303 -0
- package/docs/platforms/raspberry-pi.md +412 -0
- package/docs/platforms/windows.md +241 -0
- package/docs/plugins/agent-tools.md +10 -0
- package/docs/plugins/architecture.md +1366 -0
- package/docs/plugins/building-extensions.md +10 -0
- package/docs/plugins/building-plugins.md +239 -0
- package/docs/plugins/bundles.md +181 -0
- package/docs/plugins/community.md +145 -0
- package/docs/plugins/manifest.md +241 -0
- package/docs/plugins/sdk-channel-plugins.md +370 -0
- package/docs/plugins/sdk-entrypoints.md +161 -0
- package/docs/plugins/sdk-migration.md +172 -0
- package/docs/plugins/sdk-overview.md +196 -0
- package/docs/plugins/sdk-provider-plugins.md +370 -0
- package/docs/plugins/sdk-runtime.md +345 -0
- package/docs/plugins/sdk-setup.md +331 -0
- package/docs/plugins/sdk-testing.md +263 -0
- package/docs/plugins/voice-call.md +380 -0
- package/docs/plugins/zalouser.md +77 -0
- package/docs/prose.md +134 -0
- package/docs/providers/anthropic.md +259 -0
- package/docs/providers/bedrock.md +176 -0
- package/docs/providers/claude-max-api-proxy.md +154 -0
- package/docs/providers/cloudflare-ai-gateway.md +71 -0
- package/docs/providers/deepgram.md +93 -0
- package/docs/providers/github-copilot.md +72 -0
- package/docs/providers/glm.md +43 -0
- package/docs/providers/google.md +78 -0
- package/docs/providers/groq.md +96 -0
- package/docs/providers/huggingface.md +209 -0
- package/docs/providers/index.md +69 -0
- package/docs/providers/kilocode.md +74 -0
- package/docs/providers/litellm.md +154 -0
- package/docs/providers/minimax.md +224 -0
- package/docs/providers/mistral.md +54 -0
- package/docs/providers/models.md +45 -0
- package/docs/providers/modelstudio.md +66 -0
- package/docs/providers/moonshot.md +175 -0
- package/docs/providers/nvidia.md +55 -0
- package/docs/providers/ollama.md +352 -0
- package/docs/providers/openai.md +303 -0
- package/docs/providers/opencode-go.md +45 -0
- package/docs/providers/opencode.md +64 -0
- package/docs/providers/openrouter.md +37 -0
- package/docs/providers/perplexity-provider.md +62 -0
- package/docs/providers/qianfan.md +38 -0
- package/docs/providers/qwen.md +53 -0
- package/docs/providers/sglang.md +104 -0
- package/docs/providers/synthetic.md +99 -0
- package/docs/providers/together.md +66 -0
- package/docs/providers/venice.md +282 -0
- package/docs/providers/vercel-ai-gateway.md +60 -0
- package/docs/providers/vllm.md +92 -0
- package/docs/providers/volcengine.md +74 -0
- package/docs/providers/xai.md +60 -0
- package/docs/providers/xiaomi.md +86 -0
- package/docs/providers/zai.md +46 -0
- package/docs/reference/AGENTS.default.md +126 -0
- package/docs/reference/RELEASING.md +42 -0
- package/docs/reference/api-usage-costs.md +144 -0
- package/docs/reference/credits.md +30 -0
- package/docs/reference/device-models.md +47 -0
- package/docs/reference/memory-config.md +711 -0
- package/docs/reference/prompt-caching.md +185 -0
- package/docs/reference/rpc.md +43 -0
- package/docs/reference/secretref-credential-surface.md +140 -0
- package/docs/reference/secretref-user-supplied-credentials-matrix.json +563 -0
- package/docs/reference/session-management-compaction.md +324 -0
- package/docs/reference/templates/AGENTS.dev.md +83 -0
- package/docs/reference/templates/AGENTS.md +219 -0
- package/docs/reference/templates/BOOT.md +11 -0
- package/docs/reference/templates/BOOTSTRAP.md +62 -0
- package/docs/reference/templates/HEARTBEAT.md +14 -0
- package/docs/reference/templates/IDENTITY.dev.md +47 -0
- package/docs/reference/templates/IDENTITY.md +29 -0
- package/docs/reference/templates/SOUL.dev.md +76 -0
- package/docs/reference/templates/SOUL.md +43 -0
- package/docs/reference/templates/TOOLS.dev.md +24 -0
- package/docs/reference/templates/TOOLS.md +47 -0
- package/docs/reference/templates/USER.dev.md +18 -0
- package/docs/reference/templates/USER.md +23 -0
- package/docs/reference/test.md +90 -0
- package/docs/reference/token-use.md +175 -0
- package/docs/reference/transcript-hygiene.md +151 -0
- package/docs/reference/wizard.md +235 -0
- package/docs/security/CONTRIBUTING-THREAT-MODEL.md +98 -0
- package/docs/security/THREAT-MODEL-ATLAS.md +611 -0
- package/docs/security/formal-verification.md +167 -0
- package/docs/start/bootstrapping.md +41 -0
- package/docs/start/docs-directory.md +66 -0
- package/docs/start/getting-started.md +116 -0
- package/docs/start/hubs.md +198 -0
- package/docs/start/lore.md +219 -0
- package/docs/start/onboarding-overview.md +67 -0
- package/docs/start/onboarding.md +91 -0
- package/docs/start/openclaw.md +221 -0
- package/docs/start/quickstart.md +22 -0
- package/docs/start/setup.md +164 -0
- package/docs/start/showcase.md +418 -0
- package/docs/start/wizard-cli-automation.md +215 -0
- package/docs/start/wizard-cli-reference.md +299 -0
- package/docs/start/wizard.md +125 -0
- package/docs/style.css +37 -0
- package/docs/tools/acp-agents.md +623 -0
- package/docs/tools/agent-send.md +100 -0
- package/docs/tools/apply-patch.md +51 -0
- package/docs/tools/brave-search.md +93 -0
- package/docs/tools/browser-linux-troubleshooting.md +138 -0
- package/docs/tools/browser-login.md +73 -0
- package/docs/tools/browser-wsl2-windows-remote-cdp-troubleshooting.md +211 -0
- package/docs/tools/browser.md +731 -0
- package/docs/tools/btw.md +142 -0
- package/docs/tools/capability-cookbook.md +119 -0
- package/docs/tools/clawhub.md +298 -0
- package/docs/tools/creating-skills.md +117 -0
- package/docs/tools/diffs.md +386 -0
- package/docs/tools/elevated.md +114 -0
- package/docs/tools/exec-approvals.md +430 -0
- package/docs/tools/exec.md +207 -0
- package/docs/tools/firecrawl.md +140 -0
- package/docs/tools/index.md +137 -0
- package/docs/tools/llm-task.md +119 -0
- package/docs/tools/lobster.md +340 -0
- package/docs/tools/loop-detection.md +100 -0
- package/docs/tools/multi-agent-sandbox-tools.md +364 -0
- package/docs/tools/pdf.md +156 -0
- package/docs/tools/perplexity-search.md +174 -0
- package/docs/tools/plugin.md +255 -0
- package/docs/tools/reactions.md +64 -0
- package/docs/tools/skills-config.md +86 -0
- package/docs/tools/skills.md +309 -0
- package/docs/tools/slash-commands.md +294 -0
- package/docs/tools/subagents.md +295 -0
- package/docs/tools/tavily.md +125 -0
- package/docs/tools/thinking.md +96 -0
- package/docs/tools/tts.md +406 -0
- package/docs/tools/web.md +516 -0
- package/docs/tts.md +406 -0
- package/docs/vps.md +112 -0
- package/docs/web/control-ui.md +275 -0
- package/docs/web/dashboard.md +54 -0
- package/docs/web/index.md +120 -0
- package/docs/web/tui.md +170 -0
- package/docs/web/webchat.md +61 -0
- package/docs/whatsapp-openclaw-ai-zh.jpg +0 -0
- package/docs/whatsapp-openclaw.jpg +0 -0
- package/docs/zh-CN/AGENTS.md +61 -0
- package/docs/zh-CN/automation/auth-monitoring.md +47 -0
- package/docs/zh-CN/automation/cron-jobs.md +435 -0
- package/docs/zh-CN/automation/cron-vs-heartbeat.md +286 -0
- package/docs/zh-CN/automation/gmail-pubsub.md +249 -0
- package/docs/zh-CN/automation/hooks.md +1051 -0
- package/docs/zh-CN/automation/poll.md +76 -0
- package/docs/zh-CN/automation/troubleshooting.md +8 -0
- package/docs/zh-CN/automation/webhook.md +163 -0
- package/docs/zh-CN/brave-search.md +60 -0
- package/docs/zh-CN/channels/bluebubbles.md +354 -0
- package/docs/zh-CN/channels/broadcast-groups.md +449 -0
- package/docs/zh-CN/channels/channel-routing.md +117 -0
- package/docs/zh-CN/channels/discord.md +468 -0
- package/docs/zh-CN/channels/feishu.md +728 -0
- package/docs/zh-CN/channels/googlechat.md +257 -0
- package/docs/zh-CN/channels/grammy.md +38 -0
- package/docs/zh-CN/channels/group-messages.md +91 -0
- package/docs/zh-CN/channels/groups.md +379 -0
- package/docs/zh-CN/channels/imessage.md +302 -0
- package/docs/zh-CN/channels/index.md +53 -0
- package/docs/zh-CN/channels/line.md +180 -0
- package/docs/zh-CN/channels/location.md +63 -0
- package/docs/zh-CN/channels/matrix.md +221 -0
- package/docs/zh-CN/channels/mattermost.md +144 -0
- package/docs/zh-CN/channels/msteams.md +775 -0
- package/docs/zh-CN/channels/nextcloud-talk.md +142 -0
- package/docs/zh-CN/channels/nostr.md +249 -0
- package/docs/zh-CN/channels/pairing.md +89 -0
- package/docs/zh-CN/channels/signal.md +209 -0
- package/docs/zh-CN/channels/slack.md +531 -0
- package/docs/zh-CN/channels/synology-chat.md +138 -0
- package/docs/zh-CN/channels/telegram.md +751 -0
- package/docs/zh-CN/channels/tlon.md +136 -0
- package/docs/zh-CN/channels/troubleshooting.md +36 -0
- package/docs/zh-CN/channels/twitch.md +385 -0
- package/docs/zh-CN/channels/whatsapp.md +411 -0
- package/docs/zh-CN/channels/zalo.md +196 -0
- package/docs/zh-CN/channels/zalouser.md +147 -0
- package/docs/zh-CN/cli/acp.md +173 -0
- package/docs/zh-CN/cli/agent.md +30 -0
- package/docs/zh-CN/cli/agents.md +82 -0
- package/docs/zh-CN/cli/approvals.md +57 -0
- package/docs/zh-CN/cli/browser.md +114 -0
- package/docs/zh-CN/cli/channels.md +86 -0
- package/docs/zh-CN/cli/config.md +57 -0
- package/docs/zh-CN/cli/configure.md +38 -0
- package/docs/zh-CN/cli/cron.md +43 -0
- package/docs/zh-CN/cli/dashboard.md +23 -0
- package/docs/zh-CN/cli/devices.md +74 -0
- package/docs/zh-CN/cli/directory.md +70 -0
- package/docs/zh-CN/cli/dns.md +30 -0
- package/docs/zh-CN/cli/docs.md +22 -0
- package/docs/zh-CN/cli/doctor.md +48 -0
- package/docs/zh-CN/cli/gateway.md +206 -0
- package/docs/zh-CN/cli/health.md +28 -0
- package/docs/zh-CN/cli/hooks.md +298 -0
- package/docs/zh-CN/cli/index.md +1143 -0
- package/docs/zh-CN/cli/logs.md +31 -0
- package/docs/zh-CN/cli/memory.md +52 -0
- package/docs/zh-CN/cli/message.md +246 -0
- package/docs/zh-CN/cli/models.md +85 -0
- package/docs/zh-CN/cli/node.md +115 -0
- package/docs/zh-CN/cli/nodes.md +80 -0
- package/docs/zh-CN/cli/onboard.md +164 -0
- package/docs/zh-CN/cli/pairing.md +28 -0
- package/docs/zh-CN/cli/plugins.md +66 -0
- package/docs/zh-CN/cli/reset.md +24 -0
- package/docs/zh-CN/cli/sandbox.md +158 -0
- package/docs/zh-CN/cli/security.md +33 -0
- package/docs/zh-CN/cli/sessions.md +23 -0
- package/docs/zh-CN/cli/setup.md +36 -0
- package/docs/zh-CN/cli/skills.md +33 -0
- package/docs/zh-CN/cli/status.md +33 -0
- package/docs/zh-CN/cli/system.md +63 -0
- package/docs/zh-CN/cli/tui.md +30 -0
- package/docs/zh-CN/cli/uninstall.md +24 -0
- package/docs/zh-CN/cli/update.md +101 -0
- package/docs/zh-CN/cli/voicecall.md +41 -0
- package/docs/zh-CN/cli/webhooks.md +32 -0
- package/docs/zh-CN/concepts/agent-loop.md +146 -0
- package/docs/zh-CN/concepts/agent-workspace.md +219 -0
- package/docs/zh-CN/concepts/agent.md +115 -0
- package/docs/zh-CN/concepts/architecture.md +123 -0
- package/docs/zh-CN/concepts/compaction.md +67 -0
- package/docs/zh-CN/concepts/context.md +168 -0
- package/docs/zh-CN/concepts/features.md +59 -0
- package/docs/zh-CN/concepts/markdown-formatting.md +117 -0
- package/docs/zh-CN/concepts/memory.md +412 -0
- package/docs/zh-CN/concepts/messages.md +141 -0
- package/docs/zh-CN/concepts/model-failover.md +145 -0
- package/docs/zh-CN/concepts/model-providers.md +606 -0
- package/docs/zh-CN/concepts/models.md +225 -0
- package/docs/zh-CN/concepts/multi-agent.md +372 -0
- package/docs/zh-CN/concepts/oauth.md +164 -0
- package/docs/zh-CN/concepts/presence.md +99 -0
- package/docs/zh-CN/concepts/queue.md +94 -0
- package/docs/zh-CN/concepts/retry.md +76 -0
- package/docs/zh-CN/concepts/session-pruning.md +129 -0
- package/docs/zh-CN/concepts/session-tool.md +200 -0
- package/docs/zh-CN/concepts/session.md +166 -0
- package/docs/zh-CN/concepts/streaming.md +133 -0
- package/docs/zh-CN/concepts/system-prompt.md +101 -0
- package/docs/zh-CN/concepts/timezone.md +96 -0
- package/docs/zh-CN/concepts/typebox.md +284 -0
- package/docs/zh-CN/concepts/typing-indicators.md +74 -0
- package/docs/zh-CN/concepts/usage-tracking.md +42 -0
- package/docs/zh-CN/date-time.md +129 -0
- package/docs/zh-CN/debug/node-issue.md +90 -0
- package/docs/zh-CN/diagnostics/flags.md +98 -0
- package/docs/zh-CN/gateway/authentication.md +184 -0
- package/docs/zh-CN/gateway/background-process.md +100 -0
- package/docs/zh-CN/gateway/bonjour.md +174 -0
- package/docs/zh-CN/gateway/bridge-protocol.md +86 -0
- package/docs/zh-CN/gateway/cli-backends.md +213 -0
- package/docs/zh-CN/gateway/configuration-examples.md +587 -0
- package/docs/zh-CN/gateway/configuration-reference.md +3103 -0
- package/docs/zh-CN/gateway/configuration.md +640 -0
- package/docs/zh-CN/gateway/discovery.md +123 -0
- package/docs/zh-CN/gateway/doctor.md +238 -0
- package/docs/zh-CN/gateway/gateway-lock.md +41 -0
- package/docs/zh-CN/gateway/health.md +42 -0
- package/docs/zh-CN/gateway/heartbeat.md +274 -0
- package/docs/zh-CN/gateway/index.md +335 -0
- package/docs/zh-CN/gateway/local-models.md +159 -0
- package/docs/zh-CN/gateway/logging.md +114 -0
- package/docs/zh-CN/gateway/multiple-gateways.md +119 -0
- package/docs/zh-CN/gateway/network-model.md +23 -0
- package/docs/zh-CN/gateway/openai-http-api.md +125 -0
- package/docs/zh-CN/gateway/openresponses-http-api.md +317 -0
- package/docs/zh-CN/gateway/pairing.md +99 -0
- package/docs/zh-CN/gateway/protocol.md +220 -0
- package/docs/zh-CN/gateway/remote-gateway-readme.md +164 -0
- package/docs/zh-CN/gateway/remote.md +133 -0
- package/docs/zh-CN/gateway/sandbox-vs-tool-policy-vs-elevated.md +135 -0
- package/docs/zh-CN/gateway/sandboxing.md +188 -0
- package/docs/zh-CN/gateway/security/index.md +777 -0
- package/docs/zh-CN/gateway/tailscale.md +124 -0
- package/docs/zh-CN/gateway/tools-invoke-http-api.md +92 -0
- package/docs/zh-CN/gateway/troubleshooting.md +771 -0
- package/docs/zh-CN/help/debugging.md +160 -0
- package/docs/zh-CN/help/environment.md +88 -0
- package/docs/zh-CN/help/faq.md +2640 -0
- package/docs/zh-CN/help/index.md +28 -0
- package/docs/zh-CN/help/scripts.md +35 -0
- package/docs/zh-CN/help/testing.md +375 -0
- package/docs/zh-CN/help/troubleshooting.md +104 -0
- package/docs/zh-CN/index.md +186 -0
- package/docs/zh-CN/install/ansible.md +215 -0
- package/docs/zh-CN/install/bun.md +65 -0
- package/docs/zh-CN/install/development-channels.md +81 -0
- package/docs/zh-CN/install/docker.md +532 -0
- package/docs/zh-CN/install/exe-dev.md +133 -0
- package/docs/zh-CN/install/fly.md +490 -0
- package/docs/zh-CN/install/gcp.md +510 -0
- package/docs/zh-CN/install/hetzner.md +337 -0
- package/docs/zh-CN/install/index.md +235 -0
- package/docs/zh-CN/install/installer.md +422 -0
- package/docs/zh-CN/install/macos-vm.md +288 -0
- package/docs/zh-CN/install/migrating.md +199 -0
- package/docs/zh-CN/install/nix.md +99 -0
- package/docs/zh-CN/install/node.md +8 -0
- package/docs/zh-CN/install/northflank.mdx +60 -0
- package/docs/zh-CN/install/railway.mdx +106 -0
- package/docs/zh-CN/install/render.mdx +169 -0
- package/docs/zh-CN/install/uninstall.md +135 -0
- package/docs/zh-CN/install/updating.md +233 -0
- package/docs/zh-CN/logging.md +329 -0
- package/docs/zh-CN/network.md +59 -0
- package/docs/zh-CN/nodes/audio.md +120 -0
- package/docs/zh-CN/nodes/camera.md +162 -0
- package/docs/zh-CN/nodes/images.md +79 -0
- package/docs/zh-CN/nodes/index.md +348 -0
- package/docs/zh-CN/nodes/location-command.md +120 -0
- package/docs/zh-CN/nodes/media-understanding.md +380 -0
- package/docs/zh-CN/nodes/talk.md +97 -0
- package/docs/zh-CN/nodes/troubleshooting.md +8 -0
- package/docs/zh-CN/nodes/voicewake.md +72 -0
- package/docs/zh-CN/perplexity.md +102 -0
- package/docs/zh-CN/pi-dev.md +77 -0
- package/docs/zh-CN/pi.md +619 -0
- package/docs/zh-CN/platforms/android.md +155 -0
- package/docs/zh-CN/platforms/digitalocean.md +273 -0
- package/docs/zh-CN/platforms/index.md +60 -0
- package/docs/zh-CN/platforms/ios.md +114 -0
- package/docs/zh-CN/platforms/linux.md +100 -0
- package/docs/zh-CN/platforms/mac/bundled-gateway.md +75 -0
- package/docs/zh-CN/platforms/mac/canvas.md +128 -0
- package/docs/zh-CN/platforms/mac/child-process.md +73 -0
- package/docs/zh-CN/platforms/mac/dev-setup.md +109 -0
- package/docs/zh-CN/platforms/mac/health.md +41 -0
- package/docs/zh-CN/platforms/mac/icon.md +38 -0
- package/docs/zh-CN/platforms/mac/logging.md +64 -0
- package/docs/zh-CN/platforms/mac/menu-bar.md +88 -0
- package/docs/zh-CN/platforms/mac/peekaboo.md +62 -0
- package/docs/zh-CN/platforms/mac/permissions.md +46 -0
- package/docs/zh-CN/platforms/mac/remote.md +90 -0
- package/docs/zh-CN/platforms/mac/signing.md +54 -0
- package/docs/zh-CN/platforms/mac/skills.md +40 -0
- package/docs/zh-CN/platforms/mac/voice-overlay.md +67 -0
- package/docs/zh-CN/platforms/mac/voicewake.md +74 -0
- package/docs/zh-CN/platforms/mac/webchat.md +43 -0
- package/docs/zh-CN/platforms/mac/xpc.md +68 -0
- package/docs/zh-CN/platforms/macos.md +193 -0
- package/docs/zh-CN/platforms/oracle.md +310 -0
- package/docs/zh-CN/platforms/raspberry-pi.md +416 -0
- package/docs/zh-CN/platforms/windows.md +247 -0
- package/docs/zh-CN/plugins/agent-tools.md +99 -0
- package/docs/zh-CN/plugins/manifest.md +68 -0
- package/docs/zh-CN/plugins/voice-call.md +250 -0
- package/docs/zh-CN/plugins/zalouser.md +88 -0
- package/docs/zh-CN/prose.md +141 -0
- package/docs/zh-CN/providers/anthropic.md +265 -0
- package/docs/zh-CN/providers/bedrock.md +170 -0
- package/docs/zh-CN/providers/claude-max-api-proxy.md +155 -0
- package/docs/zh-CN/providers/cloudflare-ai-gateway.md +78 -0
- package/docs/zh-CN/providers/deepgram.md +97 -0
- package/docs/zh-CN/providers/github-copilot.md +67 -0
- package/docs/zh-CN/providers/glm.md +50 -0
- package/docs/zh-CN/providers/huggingface.md +216 -0
- package/docs/zh-CN/providers/index.md +69 -0
- package/docs/zh-CN/providers/kilocode.md +80 -0
- package/docs/zh-CN/providers/litellm.md +160 -0
- package/docs/zh-CN/providers/minimax.md +222 -0
- package/docs/zh-CN/providers/mistral.md +61 -0
- package/docs/zh-CN/providers/models.md +51 -0
- package/docs/zh-CN/providers/moonshot.md +182 -0
- package/docs/zh-CN/providers/nvidia.md +62 -0
- package/docs/zh-CN/providers/ollama.md +359 -0
- package/docs/zh-CN/providers/openai.md +308 -0
- package/docs/zh-CN/providers/opencode-go.md +52 -0
- package/docs/zh-CN/providers/opencode.md +71 -0
- package/docs/zh-CN/providers/openrouter.md +44 -0
- package/docs/zh-CN/providers/qianfan.md +45 -0
- package/docs/zh-CN/providers/qwen.md +55 -0
- package/docs/zh-CN/providers/sglang.md +111 -0
- package/docs/zh-CN/providers/synthetic.md +106 -0
- package/docs/zh-CN/providers/together.md +72 -0
- package/docs/zh-CN/providers/venice.md +289 -0
- package/docs/zh-CN/providers/vercel-ai-gateway.md +66 -0
- package/docs/zh-CN/providers/xiaomi.md +93 -0
- package/docs/zh-CN/providers/zai.md +53 -0
- package/docs/zh-CN/reference/AGENTS.default.md +131 -0
- package/docs/zh-CN/reference/RELEASING.md +48 -0
- package/docs/zh-CN/reference/api-usage-costs.md +141 -0
- package/docs/zh-CN/reference/credits.md +34 -0
- package/docs/zh-CN/reference/device-models.md +54 -0
- package/docs/zh-CN/reference/rpc.md +48 -0
- package/docs/zh-CN/reference/session-management-compaction.md +287 -0
- package/docs/zh-CN/reference/templates/AGENTS.dev.md +89 -0
- package/docs/zh-CN/reference/templates/AGENTS.md +225 -0
- package/docs/zh-CN/reference/templates/BOOT.md +17 -0
- package/docs/zh-CN/reference/templates/BOOTSTRAP.md +68 -0
- package/docs/zh-CN/reference/templates/HEARTBEAT.md +18 -0
- package/docs/zh-CN/reference/templates/IDENTITY.dev.md +54 -0
- package/docs/zh-CN/reference/templates/IDENTITY.md +36 -0
- package/docs/zh-CN/reference/templates/SOUL.dev.md +83 -0
- package/docs/zh-CN/reference/templates/SOUL.md +49 -0
- package/docs/zh-CN/reference/templates/TOOLS.dev.md +31 -0
- package/docs/zh-CN/reference/templates/TOOLS.md +53 -0
- package/docs/zh-CN/reference/templates/USER.dev.md +25 -0
- package/docs/zh-CN/reference/templates/USER.md +30 -0
- package/docs/zh-CN/reference/test.md +57 -0
- package/docs/zh-CN/reference/token-use.md +119 -0
- package/docs/zh-CN/reference/transcript-hygiene.md +109 -0
- package/docs/zh-CN/reference/wizard.md +242 -0
- package/docs/zh-CN/security/formal-verification.md +171 -0
- package/docs/zh-CN/start/bootstrapping.md +9 -0
- package/docs/zh-CN/start/docs-directory.md +70 -0
- package/docs/zh-CN/start/getting-started.md +143 -0
- package/docs/zh-CN/start/hubs.md +194 -0
- package/docs/zh-CN/start/lore.md +226 -0
- package/docs/zh-CN/start/onboarding-overview.md +58 -0
- package/docs/zh-CN/start/onboarding.md +105 -0
- package/docs/zh-CN/start/openclaw.md +248 -0
- package/docs/zh-CN/start/quickstart.md +88 -0
- package/docs/zh-CN/start/setup.md +153 -0
- package/docs/zh-CN/start/showcase.md +423 -0
- package/docs/zh-CN/start/wizard-cli-automation.md +222 -0
- package/docs/zh-CN/start/wizard-cli-reference.md +306 -0
- package/docs/zh-CN/start/wizard.md +132 -0
- package/docs/zh-CN/tools/agent-send.md +59 -0
- package/docs/zh-CN/tools/apply-patch.md +57 -0
- package/docs/zh-CN/tools/browser-linux-troubleshooting.md +144 -0
- package/docs/zh-CN/tools/browser-login.md +75 -0
- package/docs/zh-CN/tools/browser.md +553 -0
- package/docs/zh-CN/tools/chrome-extension.md +183 -0
- package/docs/zh-CN/tools/clawhub.md +209 -0
- package/docs/zh-CN/tools/creating-skills.md +61 -0
- package/docs/zh-CN/tools/elevated.md +64 -0
- package/docs/zh-CN/tools/exec-approvals.md +234 -0
- package/docs/zh-CN/tools/exec.md +169 -0
- package/docs/zh-CN/tools/firecrawl.md +68 -0
- package/docs/zh-CN/tools/index.md +515 -0
- package/docs/zh-CN/tools/llm-task.md +117 -0
- package/docs/zh-CN/tools/lobster.md +349 -0
- package/docs/zh-CN/tools/multi-agent-sandbox-tools.md +401 -0
- package/docs/zh-CN/tools/plugin.md +1612 -0
- package/docs/zh-CN/tools/reactions.md +29 -0
- package/docs/zh-CN/tools/skills-config.md +78 -0
- package/docs/zh-CN/tools/skills.md +279 -0
- package/docs/zh-CN/tools/slash-commands.md +205 -0
- package/docs/zh-CN/tools/subagents.md +167 -0
- package/docs/zh-CN/tools/thinking.md +80 -0
- package/docs/zh-CN/tools/web.md +289 -0
- package/docs/zh-CN/tts.md +375 -0
- package/docs/zh-CN/vps.md +47 -0
- package/docs/zh-CN/web/control-ui.md +191 -0
- package/docs/zh-CN/web/dashboard.md +53 -0
- package/docs/zh-CN/web/index.md +118 -0
- package/docs/zh-CN/web/tui.md +166 -0
- package/docs/zh-CN/web/webchat.md +56 -0
- package/package.json +841 -0
- package/quantumclaw.mjs +135 -0
- package/skills/1password/SKILL.md +70 -0
- package/skills/1password/references/cli-examples.md +29 -0
- package/skills/1password/references/get-started.md +17 -0
- package/skills/apple-notes/SKILL.md +77 -0
- package/skills/apple-reminders/SKILL.md +118 -0
- package/skills/bear-notes/SKILL.md +107 -0
- package/skills/blogwatcher/SKILL.md +69 -0
- package/skills/blucli/SKILL.md +47 -0
- package/skills/bluebubbles/SKILL.md +131 -0
- package/skills/camsnap/SKILL.md +45 -0
- package/skills/canvas/SKILL.md +198 -0
- package/skills/clawhub/SKILL.md +77 -0
- package/skills/coding-agent/SKILL.md +295 -0
- package/skills/discord/SKILL.md +197 -0
- package/skills/eightctl/SKILL.md +50 -0
- package/skills/gemini/SKILL.md +43 -0
- package/skills/gh-issues/SKILL.md +865 -0
- package/skills/gifgrep/SKILL.md +79 -0
- package/skills/github/SKILL.md +163 -0
- package/skills/gog/SKILL.md +116 -0
- package/skills/goplaces/SKILL.md +52 -0
- package/skills/healthcheck/SKILL.md +245 -0
- package/skills/himalaya/SKILL.md +257 -0
- package/skills/himalaya/references/configuration.md +184 -0
- package/skills/himalaya/references/message-composition.md +199 -0
- package/skills/imsg/SKILL.md +122 -0
- package/skills/mcporter/SKILL.md +61 -0
- package/skills/model-usage/SKILL.md +69 -0
- package/skills/model-usage/references/codexbar-cli.md +33 -0
- package/skills/model-usage/scripts/model_usage.py +320 -0
- package/skills/model-usage/scripts/test_model_usage.py +40 -0
- package/skills/nano-pdf/SKILL.md +38 -0
- package/skills/node-connect/SKILL.md +142 -0
- package/skills/notion/SKILL.md +174 -0
- package/skills/obsidian/SKILL.md +81 -0
- package/skills/openai-image-gen/SKILL.md +92 -0
- package/skills/openai-image-gen/scripts/gen.py +328 -0
- package/skills/openai-image-gen/scripts/test_gen.py +140 -0
- package/skills/openai-whisper/SKILL.md +38 -0
- package/skills/openai-whisper-api/SKILL.md +52 -0
- package/skills/openai-whisper-api/scripts/transcribe.sh +85 -0
- package/skills/openhue/SKILL.md +112 -0
- package/skills/oracle/SKILL.md +125 -0
- package/skills/ordercli/SKILL.md +78 -0
- package/skills/peekaboo/SKILL.md +190 -0
- package/skills/sag/SKILL.md +87 -0
- package/skills/session-logs/SKILL.md +115 -0
- package/skills/sherpa-onnx-tts/SKILL.md +103 -0
- package/skills/sherpa-onnx-tts/bin/sherpa-onnx-tts +178 -0
- package/skills/skill-creator/SKILL.md +372 -0
- package/skills/skill-creator/license.txt +202 -0
- package/skills/skill-creator/scripts/init_skill.py +378 -0
- package/skills/skill-creator/scripts/package_skill.py +139 -0
- package/skills/skill-creator/scripts/quick_validate.py +159 -0
- package/skills/skill-creator/scripts/test_package_skill.py +160 -0
- package/skills/skill-creator/scripts/test_quick_validate.py +72 -0
- package/skills/slack/SKILL.md +144 -0
- package/skills/songsee/SKILL.md +49 -0
- package/skills/sonoscli/SKILL.md +65 -0
- package/skills/spotify-player/SKILL.md +64 -0
- package/skills/summarize/SKILL.md +87 -0
- package/skills/things-mac/SKILL.md +86 -0
- package/skills/tmux/SKILL.md +153 -0
- package/skills/tmux/scripts/find-sessions.sh +112 -0
- package/skills/tmux/scripts/wait-for-text.sh +83 -0
- package/skills/trello/SKILL.md +95 -0
- package/skills/video-frames/SKILL.md +46 -0
- package/skills/video-frames/scripts/frame.sh +81 -0
- package/skills/voice-call/SKILL.md +45 -0
- package/skills/wacli/SKILL.md +72 -0
- package/skills/weather/SKILL.md +112 -0
- package/skills/xurl/SKILL.md +461 -0
|
@@ -0,0 +1,394 @@
|
|
|
1
|
+
---
|
|
2
|
+
summary: "Inbound image/audio/video understanding (optional) with provider + CLI fallbacks"
|
|
3
|
+
read_when:
|
|
4
|
+
- Designing or refactoring media understanding
|
|
5
|
+
- Tuning inbound audio/video/image preprocessing
|
|
6
|
+
title: "Media Understanding"
|
|
7
|
+
---
|
|
8
|
+
|
|
9
|
+
# Media Understanding - Inbound (2026-01-17)
|
|
10
|
+
|
|
11
|
+
QuantumClaw can **summarize inbound media** (image/audio/video) before the reply pipeline runs. It auto‑detects when local tools or provider keys are available, and can be disabled or customized. If understanding is off, models still receive the original files/URLs as usual.
|
|
12
|
+
|
|
13
|
+
Vendor-specific media behavior is registered by vendor plugins, while QuantumClaw
|
|
14
|
+
core owns the shared `tools.media` config, fallback order, and reply-pipeline
|
|
15
|
+
integration.
|
|
16
|
+
|
|
17
|
+
## Goals
|
|
18
|
+
|
|
19
|
+
- Optional: pre‑digest inbound media into short text for faster routing + better command parsing.
|
|
20
|
+
- Preserve original media delivery to the model (always).
|
|
21
|
+
- Support **provider APIs** and **CLI fallbacks**.
|
|
22
|
+
- Allow multiple models with ordered fallback (error/size/timeout).
|
|
23
|
+
|
|
24
|
+
## High-level behavior
|
|
25
|
+
|
|
26
|
+
1. Collect inbound attachments (`MediaPaths`, `MediaUrls`, `MediaTypes`).
|
|
27
|
+
2. For each enabled capability (image/audio/video), select attachments per policy (default: **first**).
|
|
28
|
+
3. Choose the first eligible model entry (size + capability + auth).
|
|
29
|
+
4. If a model fails or the media is too large, **fall back to the next entry**.
|
|
30
|
+
5. On success:
|
|
31
|
+
- `Body` becomes `[Image]`, `[Audio]`, or `[Video]` block.
|
|
32
|
+
- Audio sets `{{Transcript}}`; command parsing uses caption text when present,
|
|
33
|
+
otherwise the transcript.
|
|
34
|
+
- Captions are preserved as `User text:` inside the block.
|
|
35
|
+
|
|
36
|
+
If understanding fails or is disabled, **the reply flow continues** with the original body + attachments.
|
|
37
|
+
|
|
38
|
+
## Config overview
|
|
39
|
+
|
|
40
|
+
`tools.media` supports **shared models** plus per‑capability overrides:
|
|
41
|
+
|
|
42
|
+
- `tools.media.models`: shared model list (use `capabilities` to gate).
|
|
43
|
+
- `tools.media.image` / `tools.media.audio` / `tools.media.video`:
|
|
44
|
+
- defaults (`prompt`, `maxChars`, `maxBytes`, `timeoutSeconds`, `language`)
|
|
45
|
+
- provider overrides (`baseUrl`, `headers`, `providerOptions`)
|
|
46
|
+
- Deepgram audio options via `tools.media.audio.providerOptions.deepgram`
|
|
47
|
+
- audio transcript echo controls (`echoTranscript`, default `false`; `echoFormat`)
|
|
48
|
+
- optional **per‑capability `models` list** (preferred before shared models)
|
|
49
|
+
- `attachments` policy (`mode`, `maxAttachments`, `prefer`)
|
|
50
|
+
- `scope` (optional gating by channel/chatType/session key)
|
|
51
|
+
- `tools.media.concurrency`: max concurrent capability runs (default **2**).
|
|
52
|
+
|
|
53
|
+
```json5
|
|
54
|
+
{
|
|
55
|
+
tools: {
|
|
56
|
+
media: {
|
|
57
|
+
models: [
|
|
58
|
+
/* shared list */
|
|
59
|
+
],
|
|
60
|
+
image: {
|
|
61
|
+
/* optional overrides */
|
|
62
|
+
},
|
|
63
|
+
audio: {
|
|
64
|
+
/* optional overrides */
|
|
65
|
+
echoTranscript: true,
|
|
66
|
+
echoFormat: '📝 "{transcript}"',
|
|
67
|
+
},
|
|
68
|
+
video: {
|
|
69
|
+
/* optional overrides */
|
|
70
|
+
},
|
|
71
|
+
},
|
|
72
|
+
},
|
|
73
|
+
}
|
|
74
|
+
```
|
|
75
|
+
|
|
76
|
+
### Model entries
|
|
77
|
+
|
|
78
|
+
Each `models[]` entry can be **provider** or **CLI**:
|
|
79
|
+
|
|
80
|
+
```json5
|
|
81
|
+
{
|
|
82
|
+
type: "provider", // default if omitted
|
|
83
|
+
provider: "openai",
|
|
84
|
+
model: "gpt-5.2",
|
|
85
|
+
prompt: "Describe the image in <= 500 chars.",
|
|
86
|
+
maxChars: 500,
|
|
87
|
+
maxBytes: 10485760,
|
|
88
|
+
timeoutSeconds: 60,
|
|
89
|
+
capabilities: ["image"], // optional, used for multi‑modal entries
|
|
90
|
+
profile: "vision-profile",
|
|
91
|
+
preferredProfile: "vision-fallback",
|
|
92
|
+
}
|
|
93
|
+
```
|
|
94
|
+
|
|
95
|
+
```json5
|
|
96
|
+
{
|
|
97
|
+
type: "cli",
|
|
98
|
+
command: "gemini",
|
|
99
|
+
args: [
|
|
100
|
+
"-m",
|
|
101
|
+
"gemini-3-flash",
|
|
102
|
+
"--allowed-tools",
|
|
103
|
+
"read_file",
|
|
104
|
+
"Read the media at {{MediaPath}} and describe it in <= {{MaxChars}} characters.",
|
|
105
|
+
],
|
|
106
|
+
maxChars: 500,
|
|
107
|
+
maxBytes: 52428800,
|
|
108
|
+
timeoutSeconds: 120,
|
|
109
|
+
capabilities: ["video", "image"],
|
|
110
|
+
}
|
|
111
|
+
```
|
|
112
|
+
|
|
113
|
+
CLI templates can also use:
|
|
114
|
+
|
|
115
|
+
- `{{MediaDir}}` (directory containing the media file)
|
|
116
|
+
- `{{OutputDir}}` (scratch dir created for this run)
|
|
117
|
+
- `{{OutputBase}}` (scratch file base path, no extension)
|
|
118
|
+
|
|
119
|
+
## Defaults and limits
|
|
120
|
+
|
|
121
|
+
Recommended defaults:
|
|
122
|
+
|
|
123
|
+
- `maxChars`: **500** for image/video (short, command‑friendly)
|
|
124
|
+
- `maxChars`: **unset** for audio (full transcript unless you set a limit)
|
|
125
|
+
- `maxBytes`:
|
|
126
|
+
- image: **10MB**
|
|
127
|
+
- audio: **20MB**
|
|
128
|
+
- video: **50MB**
|
|
129
|
+
|
|
130
|
+
Rules:
|
|
131
|
+
|
|
132
|
+
- If media exceeds `maxBytes`, that model is skipped and the **next model is tried**.
|
|
133
|
+
- Audio files smaller than **1024 bytes** are treated as empty/corrupt and skipped before provider/CLI transcription.
|
|
134
|
+
- If the model returns more than `maxChars`, output is trimmed.
|
|
135
|
+
- `prompt` defaults to simple “Describe the {media}.” plus the `maxChars` guidance (image/video only).
|
|
136
|
+
- If `<capability>.enabled: true` but no models are configured, QuantumClaw tries the
|
|
137
|
+
**active reply model** when its provider supports the capability.
|
|
138
|
+
|
|
139
|
+
### Auto-detect media understanding (default)
|
|
140
|
+
|
|
141
|
+
If `tools.media.<capability>.enabled` is **not** set to `false` and you haven’t
|
|
142
|
+
configured models, QuantumClaw auto-detects in this order and **stops at the first
|
|
143
|
+
working option**:
|
|
144
|
+
|
|
145
|
+
1. **Local CLIs** (audio only; if installed)
|
|
146
|
+
- `sherpa-onnx-offline` (requires `SHERPA_ONNX_MODEL_DIR` with encoder/decoder/joiner/tokens)
|
|
147
|
+
- `whisper-cli` (`whisper-cpp`; uses `WHISPER_CPP_MODEL` or the bundled tiny model)
|
|
148
|
+
- `whisper` (Python CLI; downloads models automatically)
|
|
149
|
+
2. **Gemini CLI** (`gemini`) using `read_many_files`
|
|
150
|
+
3. **Provider keys**
|
|
151
|
+
- Audio: OpenAI → Groq → Deepgram → Google
|
|
152
|
+
- Image: OpenAI → Anthropic → Google → MiniMax
|
|
153
|
+
- Video: Google
|
|
154
|
+
|
|
155
|
+
To disable auto-detection, set:
|
|
156
|
+
|
|
157
|
+
```json5
|
|
158
|
+
{
|
|
159
|
+
tools: {
|
|
160
|
+
media: {
|
|
161
|
+
audio: {
|
|
162
|
+
enabled: false,
|
|
163
|
+
},
|
|
164
|
+
},
|
|
165
|
+
},
|
|
166
|
+
}
|
|
167
|
+
```
|
|
168
|
+
|
|
169
|
+
Note: Binary detection is best-effort across macOS/Linux/Windows; ensure the CLI is on `PATH` (we expand `~`), or set an explicit CLI model with a full command path.
|
|
170
|
+
|
|
171
|
+
### Proxy environment support (provider models)
|
|
172
|
+
|
|
173
|
+
When provider-based **audio** and **video** media understanding is enabled, QuantumClaw
|
|
174
|
+
honors standard outbound proxy environment variables for provider HTTP calls:
|
|
175
|
+
|
|
176
|
+
- `HTTPS_PROXY`
|
|
177
|
+
- `HTTP_PROXY`
|
|
178
|
+
- `https_proxy`
|
|
179
|
+
- `http_proxy`
|
|
180
|
+
|
|
181
|
+
If no proxy env vars are set, media understanding uses direct egress.
|
|
182
|
+
If the proxy value is malformed, QuantumClaw logs a warning and falls back to direct
|
|
183
|
+
fetch.
|
|
184
|
+
|
|
185
|
+
## Capabilities (optional)
|
|
186
|
+
|
|
187
|
+
If you set `capabilities`, the entry only runs for those media types. For shared
|
|
188
|
+
lists, QuantumClaw can infer defaults:
|
|
189
|
+
|
|
190
|
+
- `openai`, `anthropic`, `minimax`: **image**
|
|
191
|
+
- `moonshot`: **image + video**
|
|
192
|
+
- `google` (Gemini API): **image + audio + video**
|
|
193
|
+
- `mistral`: **audio**
|
|
194
|
+
- `zai`: **image**
|
|
195
|
+
- `groq`: **audio**
|
|
196
|
+
- `deepgram`: **audio**
|
|
197
|
+
|
|
198
|
+
For CLI entries, **set `capabilities` explicitly** to avoid surprising matches.
|
|
199
|
+
If you omit `capabilities`, the entry is eligible for the list it appears in.
|
|
200
|
+
|
|
201
|
+
## Provider support matrix (QuantumClaw integrations)
|
|
202
|
+
|
|
203
|
+
| Capability | Provider integration | Notes |
|
|
204
|
+
| ---------- | -------------------------------------------------- | ----------------------------------------------------------------------- |
|
|
205
|
+
| Image | OpenAI, Anthropic, Google, MiniMax, Moonshot, Z.AI | Vendor plugins register image support against core media understanding. |
|
|
206
|
+
| Audio | OpenAI, Groq, Deepgram, Google, Mistral | Provider transcription (Whisper/Deepgram/Gemini/Voxtral). |
|
|
207
|
+
| Video | Google, Moonshot | Provider video understanding via vendor plugins. |
|
|
208
|
+
|
|
209
|
+
## Model selection guidance
|
|
210
|
+
|
|
211
|
+
- Prefer the strongest latest-generation model available for each media capability when quality and safety matter.
|
|
212
|
+
- For tool-enabled agents handling untrusted inputs, avoid older/weaker media models.
|
|
213
|
+
- Keep at least one fallback per capability for availability (quality model + faster/cheaper model).
|
|
214
|
+
- CLI fallbacks (`whisper-cli`, `whisper`, `gemini`) are useful when provider APIs are unavailable.
|
|
215
|
+
- `parakeet-mlx` note: with `--output-dir`, QuantumClaw reads `<output-dir>/<media-basename>.txt` when output format is `txt` (or unspecified); non-`txt` formats fall back to stdout.
|
|
216
|
+
|
|
217
|
+
## Attachment policy
|
|
218
|
+
|
|
219
|
+
Per‑capability `attachments` controls which attachments are processed:
|
|
220
|
+
|
|
221
|
+
- `mode`: `first` (default) or `all`
|
|
222
|
+
- `maxAttachments`: cap the number processed (default **1**)
|
|
223
|
+
- `prefer`: `first`, `last`, `path`, `url`
|
|
224
|
+
|
|
225
|
+
When `mode: "all"`, outputs are labeled `[Image 1/2]`, `[Audio 2/2]`, etc.
|
|
226
|
+
|
|
227
|
+
## Config examples
|
|
228
|
+
|
|
229
|
+
### 1) Shared models list + overrides
|
|
230
|
+
|
|
231
|
+
```json5
|
|
232
|
+
{
|
|
233
|
+
tools: {
|
|
234
|
+
media: {
|
|
235
|
+
models: [
|
|
236
|
+
{ provider: "openai", model: "gpt-5.2", capabilities: ["image"] },
|
|
237
|
+
{
|
|
238
|
+
provider: "google",
|
|
239
|
+
model: "gemini-3-flash-preview",
|
|
240
|
+
capabilities: ["image", "audio", "video"],
|
|
241
|
+
},
|
|
242
|
+
{
|
|
243
|
+
type: "cli",
|
|
244
|
+
command: "gemini",
|
|
245
|
+
args: [
|
|
246
|
+
"-m",
|
|
247
|
+
"gemini-3-flash",
|
|
248
|
+
"--allowed-tools",
|
|
249
|
+
"read_file",
|
|
250
|
+
"Read the media at {{MediaPath}} and describe it in <= {{MaxChars}} characters.",
|
|
251
|
+
],
|
|
252
|
+
capabilities: ["image", "video"],
|
|
253
|
+
},
|
|
254
|
+
],
|
|
255
|
+
audio: {
|
|
256
|
+
attachments: { mode: "all", maxAttachments: 2 },
|
|
257
|
+
},
|
|
258
|
+
video: {
|
|
259
|
+
maxChars: 500,
|
|
260
|
+
},
|
|
261
|
+
},
|
|
262
|
+
},
|
|
263
|
+
}
|
|
264
|
+
```
|
|
265
|
+
|
|
266
|
+
### 2) Audio + Video only (image off)
|
|
267
|
+
|
|
268
|
+
```json5
|
|
269
|
+
{
|
|
270
|
+
tools: {
|
|
271
|
+
media: {
|
|
272
|
+
audio: {
|
|
273
|
+
enabled: true,
|
|
274
|
+
models: [
|
|
275
|
+
{ provider: "openai", model: "gpt-4o-mini-transcribe" },
|
|
276
|
+
{
|
|
277
|
+
type: "cli",
|
|
278
|
+
command: "whisper",
|
|
279
|
+
args: ["--model", "base", "{{MediaPath}}"],
|
|
280
|
+
},
|
|
281
|
+
],
|
|
282
|
+
},
|
|
283
|
+
video: {
|
|
284
|
+
enabled: true,
|
|
285
|
+
maxChars: 500,
|
|
286
|
+
models: [
|
|
287
|
+
{ provider: "google", model: "gemini-3-flash-preview" },
|
|
288
|
+
{
|
|
289
|
+
type: "cli",
|
|
290
|
+
command: "gemini",
|
|
291
|
+
args: [
|
|
292
|
+
"-m",
|
|
293
|
+
"gemini-3-flash",
|
|
294
|
+
"--allowed-tools",
|
|
295
|
+
"read_file",
|
|
296
|
+
"Read the media at {{MediaPath}} and describe it in <= {{MaxChars}} characters.",
|
|
297
|
+
],
|
|
298
|
+
},
|
|
299
|
+
],
|
|
300
|
+
},
|
|
301
|
+
},
|
|
302
|
+
},
|
|
303
|
+
}
|
|
304
|
+
```
|
|
305
|
+
|
|
306
|
+
### 3) Optional image understanding
|
|
307
|
+
|
|
308
|
+
```json5
|
|
309
|
+
{
|
|
310
|
+
tools: {
|
|
311
|
+
media: {
|
|
312
|
+
image: {
|
|
313
|
+
enabled: true,
|
|
314
|
+
maxBytes: 10485760,
|
|
315
|
+
maxChars: 500,
|
|
316
|
+
models: [
|
|
317
|
+
{ provider: "openai", model: "gpt-5.2" },
|
|
318
|
+
{ provider: "anthropic", model: "claude-opus-4-6" },
|
|
319
|
+
{
|
|
320
|
+
type: "cli",
|
|
321
|
+
command: "gemini",
|
|
322
|
+
args: [
|
|
323
|
+
"-m",
|
|
324
|
+
"gemini-3-flash",
|
|
325
|
+
"--allowed-tools",
|
|
326
|
+
"read_file",
|
|
327
|
+
"Read the media at {{MediaPath}} and describe it in <= {{MaxChars}} characters.",
|
|
328
|
+
],
|
|
329
|
+
},
|
|
330
|
+
],
|
|
331
|
+
},
|
|
332
|
+
},
|
|
333
|
+
},
|
|
334
|
+
}
|
|
335
|
+
```
|
|
336
|
+
|
|
337
|
+
### 4) Multi-modal single entry (explicit capabilities)
|
|
338
|
+
|
|
339
|
+
```json5
|
|
340
|
+
{
|
|
341
|
+
tools: {
|
|
342
|
+
media: {
|
|
343
|
+
image: {
|
|
344
|
+
models: [
|
|
345
|
+
{
|
|
346
|
+
provider: "google",
|
|
347
|
+
model: "gemini-3.1-pro-preview",
|
|
348
|
+
capabilities: ["image", "video", "audio"],
|
|
349
|
+
},
|
|
350
|
+
],
|
|
351
|
+
},
|
|
352
|
+
audio: {
|
|
353
|
+
models: [
|
|
354
|
+
{
|
|
355
|
+
provider: "google",
|
|
356
|
+
model: "gemini-3.1-pro-preview",
|
|
357
|
+
capabilities: ["image", "video", "audio"],
|
|
358
|
+
},
|
|
359
|
+
],
|
|
360
|
+
},
|
|
361
|
+
video: {
|
|
362
|
+
models: [
|
|
363
|
+
{
|
|
364
|
+
provider: "google",
|
|
365
|
+
model: "gemini-3.1-pro-preview",
|
|
366
|
+
capabilities: ["image", "video", "audio"],
|
|
367
|
+
},
|
|
368
|
+
],
|
|
369
|
+
},
|
|
370
|
+
},
|
|
371
|
+
},
|
|
372
|
+
}
|
|
373
|
+
```
|
|
374
|
+
|
|
375
|
+
## Status output
|
|
376
|
+
|
|
377
|
+
When media understanding runs, `/status` includes a short summary line:
|
|
378
|
+
|
|
379
|
+
```
|
|
380
|
+
📎 Media: image ok (openai/gpt-5.2) · audio skipped (maxBytes)
|
|
381
|
+
```
|
|
382
|
+
|
|
383
|
+
This shows per‑capability outcomes and the chosen provider/model when applicable.
|
|
384
|
+
|
|
385
|
+
## Notes
|
|
386
|
+
|
|
387
|
+
- Understanding is **best‑effort**. Errors do not block replies.
|
|
388
|
+
- Attachments are still passed to models even when understanding is disabled.
|
|
389
|
+
- Use `scope` to limit where understanding runs (e.g. only DMs).
|
|
390
|
+
|
|
391
|
+
## Related docs
|
|
392
|
+
|
|
393
|
+
- [Configuration](/gateway/configuration)
|
|
394
|
+
- [Image & Media Support](/nodes/images)
|
|
@@ -0,0 +1,92 @@
|
|
|
1
|
+
---
|
|
2
|
+
summary: "Talk mode: continuous speech conversations with ElevenLabs TTS"
|
|
3
|
+
read_when:
|
|
4
|
+
- Implementing Talk mode on macOS/iOS/Android
|
|
5
|
+
- Changing voice/TTS/interrupt behavior
|
|
6
|
+
title: "Talk Mode"
|
|
7
|
+
---
|
|
8
|
+
|
|
9
|
+
# Talk Mode
|
|
10
|
+
|
|
11
|
+
Talk mode is a continuous voice conversation loop:
|
|
12
|
+
|
|
13
|
+
1. Listen for speech
|
|
14
|
+
2. Send transcript to the model (main session, chat.send)
|
|
15
|
+
3. Wait for the response
|
|
16
|
+
4. Speak it via ElevenLabs (streaming playback)
|
|
17
|
+
|
|
18
|
+
## Behavior (macOS)
|
|
19
|
+
|
|
20
|
+
- **Always-on overlay** while Talk mode is enabled.
|
|
21
|
+
- **Listening → Thinking → Speaking** phase transitions.
|
|
22
|
+
- On a **short pause** (silence window), the current transcript is sent.
|
|
23
|
+
- Replies are **written to WebChat** (same as typing).
|
|
24
|
+
- **Interrupt on speech** (default on): if the user starts talking while the assistant is speaking, we stop playback and note the interruption timestamp for the next prompt.
|
|
25
|
+
|
|
26
|
+
## Voice directives in replies
|
|
27
|
+
|
|
28
|
+
The assistant may prefix its reply with a **single JSON line** to control voice:
|
|
29
|
+
|
|
30
|
+
```json
|
|
31
|
+
{ "voice": "<voice-id>", "once": true }
|
|
32
|
+
```
|
|
33
|
+
|
|
34
|
+
Rules:
|
|
35
|
+
|
|
36
|
+
- First non-empty line only.
|
|
37
|
+
- Unknown keys are ignored.
|
|
38
|
+
- `once: true` applies to the current reply only.
|
|
39
|
+
- Without `once`, the voice becomes the new default for Talk mode.
|
|
40
|
+
- The JSON line is stripped before TTS playback.
|
|
41
|
+
|
|
42
|
+
Supported keys:
|
|
43
|
+
|
|
44
|
+
- `voice` / `voice_id` / `voiceId`
|
|
45
|
+
- `model` / `model_id` / `modelId`
|
|
46
|
+
- `speed`, `rate` (WPM), `stability`, `similarity`, `style`, `speakerBoost`
|
|
47
|
+
- `seed`, `normalize`, `lang`, `output_format`, `latency_tier`
|
|
48
|
+
- `once`
|
|
49
|
+
|
|
50
|
+
## Config (`~/.quantumclaw/quantumclaw.json`)
|
|
51
|
+
|
|
52
|
+
```json5
|
|
53
|
+
{
|
|
54
|
+
talk: {
|
|
55
|
+
voiceId: "elevenlabs_voice_id",
|
|
56
|
+
modelId: "eleven_v3",
|
|
57
|
+
outputFormat: "mp3_44100_128",
|
|
58
|
+
apiKey: "elevenlabs_api_key",
|
|
59
|
+
silenceTimeoutMs: 1500,
|
|
60
|
+
interruptOnSpeech: true,
|
|
61
|
+
},
|
|
62
|
+
}
|
|
63
|
+
```
|
|
64
|
+
|
|
65
|
+
Defaults:
|
|
66
|
+
|
|
67
|
+
- `interruptOnSpeech`: true
|
|
68
|
+
- `silenceTimeoutMs`: when unset, Talk keeps the platform default pause window before sending the transcript (`700 ms on macOS and Android, 900 ms on iOS`)
|
|
69
|
+
- `voiceId`: falls back to `ELEVENLABS_VOICE_ID` / `SAG_VOICE_ID` (or first ElevenLabs voice when API key is available)
|
|
70
|
+
- `modelId`: defaults to `eleven_v3` when unset
|
|
71
|
+
- `apiKey`: falls back to `ELEVENLABS_API_KEY` (or gateway shell profile if available)
|
|
72
|
+
- `outputFormat`: defaults to `pcm_44100` on macOS/iOS and `pcm_24000` on Android (set `mp3_*` to force MP3 streaming)
|
|
73
|
+
|
|
74
|
+
## macOS UI
|
|
75
|
+
|
|
76
|
+
- Menu bar toggle: **Talk**
|
|
77
|
+
- Config tab: **Talk Mode** group (voice id + interrupt toggle)
|
|
78
|
+
- Overlay:
|
|
79
|
+
- **Listening**: cloud pulses with mic level
|
|
80
|
+
- **Thinking**: sinking animation
|
|
81
|
+
- **Speaking**: radiating rings
|
|
82
|
+
- Click cloud: stop speaking
|
|
83
|
+
- Click X: exit Talk mode
|
|
84
|
+
|
|
85
|
+
## Notes
|
|
86
|
+
|
|
87
|
+
- Requires Speech + Microphone permissions.
|
|
88
|
+
- Uses `chat.send` against session key `main`.
|
|
89
|
+
- TTS uses ElevenLabs streaming API with `ELEVENLABS_API_KEY` and incremental playback on macOS/iOS/Android for lower latency.
|
|
90
|
+
- `stability` for `eleven_v3` is validated to `0.0`, `0.5`, or `1.0`; other models accept `0..1`.
|
|
91
|
+
- `latency_tier` is validated to `0..4` when set.
|
|
92
|
+
- Android supports `pcm_16000`, `pcm_22050`, `pcm_24000`, and `pcm_44100` output formats for low-latency AudioTrack streaming.
|
|
@@ -0,0 +1,114 @@
|
|
|
1
|
+
---
|
|
2
|
+
summary: "Troubleshoot node pairing, foreground requirements, permissions, and tool failures"
|
|
3
|
+
read_when:
|
|
4
|
+
- Node is connected but camera/canvas/screen/exec tools fail
|
|
5
|
+
- You need the node pairing versus approvals mental model
|
|
6
|
+
title: "Node Troubleshooting"
|
|
7
|
+
---
|
|
8
|
+
|
|
9
|
+
# Node troubleshooting
|
|
10
|
+
|
|
11
|
+
Use this page when a node is visible in status but node tools fail.
|
|
12
|
+
|
|
13
|
+
## Command ladder
|
|
14
|
+
|
|
15
|
+
```bash
|
|
16
|
+
quantumclaw status
|
|
17
|
+
quantumclaw gateway status
|
|
18
|
+
quantumclaw logs --follow
|
|
19
|
+
quantumclaw doctor
|
|
20
|
+
quantumclaw channels status --probe
|
|
21
|
+
```
|
|
22
|
+
|
|
23
|
+
Then run node specific checks:
|
|
24
|
+
|
|
25
|
+
```bash
|
|
26
|
+
quantumclaw nodes status
|
|
27
|
+
quantumclaw nodes describe --node <idOrNameOrIp>
|
|
28
|
+
quantumclaw approvals get --node <idOrNameOrIp>
|
|
29
|
+
```
|
|
30
|
+
|
|
31
|
+
Healthy signals:
|
|
32
|
+
|
|
33
|
+
- Node is connected and paired for role `node`.
|
|
34
|
+
- `nodes describe` includes the capability you are calling.
|
|
35
|
+
- Exec approvals show expected mode/allowlist.
|
|
36
|
+
|
|
37
|
+
## Foreground requirements
|
|
38
|
+
|
|
39
|
+
`canvas.*`, `camera.*`, and `screen.*` are foreground only on iOS/Android nodes.
|
|
40
|
+
|
|
41
|
+
Quick check and fix:
|
|
42
|
+
|
|
43
|
+
```bash
|
|
44
|
+
quantumclaw nodes describe --node <idOrNameOrIp>
|
|
45
|
+
quantumclaw nodes canvas snapshot --node <idOrNameOrIp>
|
|
46
|
+
quantumclaw logs --follow
|
|
47
|
+
```
|
|
48
|
+
|
|
49
|
+
If you see `NODE_BACKGROUND_UNAVAILABLE`, bring the node app to the foreground and retry.
|
|
50
|
+
|
|
51
|
+
## Permissions matrix
|
|
52
|
+
|
|
53
|
+
| Capability | iOS | Android | macOS node app | Typical failure code |
|
|
54
|
+
| ---------------------------- | --------------------------------------- | -------------------------------------------- | ----------------------------- | ------------------------------ |
|
|
55
|
+
| `camera.snap`, `camera.clip` | Camera (+ mic for clip audio) | Camera (+ mic for clip audio) | Camera (+ mic for clip audio) | `*_PERMISSION_REQUIRED` |
|
|
56
|
+
| `screen.record` | Screen Recording (+ mic optional) | Screen capture prompt (+ mic optional) | Screen Recording | `*_PERMISSION_REQUIRED` |
|
|
57
|
+
| `location.get` | While Using or Always (depends on mode) | Foreground/Background location based on mode | Location permission | `LOCATION_PERMISSION_REQUIRED` |
|
|
58
|
+
| `system.run` | n/a (node host path) | n/a (node host path) | Exec approvals required | `SYSTEM_RUN_DENIED` |
|
|
59
|
+
|
|
60
|
+
## Pairing versus approvals
|
|
61
|
+
|
|
62
|
+
These are different gates:
|
|
63
|
+
|
|
64
|
+
1. **Device pairing**: can this node connect to the gateway?
|
|
65
|
+
2. **Exec approvals**: can this node run a specific shell command?
|
|
66
|
+
|
|
67
|
+
Quick checks:
|
|
68
|
+
|
|
69
|
+
```bash
|
|
70
|
+
quantumclaw devices list
|
|
71
|
+
quantumclaw nodes status
|
|
72
|
+
quantumclaw approvals get --node <idOrNameOrIp>
|
|
73
|
+
quantumclaw approvals allowlist add --node <idOrNameOrIp> "/usr/bin/uname"
|
|
74
|
+
```
|
|
75
|
+
|
|
76
|
+
If pairing is missing, approve the node device first.
|
|
77
|
+
If pairing is fine but `system.run` fails, fix exec approvals/allowlist.
|
|
78
|
+
|
|
79
|
+
## Common node error codes
|
|
80
|
+
|
|
81
|
+
- `NODE_BACKGROUND_UNAVAILABLE` → app is backgrounded; bring it foreground.
|
|
82
|
+
- `CAMERA_DISABLED` → camera toggle disabled in node settings.
|
|
83
|
+
- `*_PERMISSION_REQUIRED` → OS permission missing/denied.
|
|
84
|
+
- `LOCATION_DISABLED` → location mode is off.
|
|
85
|
+
- `LOCATION_PERMISSION_REQUIRED` → requested location mode not granted.
|
|
86
|
+
- `LOCATION_BACKGROUND_UNAVAILABLE` → app is backgrounded but only While Using permission exists.
|
|
87
|
+
- `SYSTEM_RUN_DENIED: approval required` → exec request needs explicit approval.
|
|
88
|
+
- `SYSTEM_RUN_DENIED: allowlist miss` → command blocked by allowlist mode.
|
|
89
|
+
On Windows node hosts, shell-wrapper forms like `cmd.exe /c ...` are treated as allowlist misses in
|
|
90
|
+
allowlist mode unless approved via ask flow.
|
|
91
|
+
|
|
92
|
+
## Fast recovery loop
|
|
93
|
+
|
|
94
|
+
```bash
|
|
95
|
+
quantumclaw nodes status
|
|
96
|
+
quantumclaw nodes describe --node <idOrNameOrIp>
|
|
97
|
+
quantumclaw approvals get --node <idOrNameOrIp>
|
|
98
|
+
quantumclaw logs --follow
|
|
99
|
+
```
|
|
100
|
+
|
|
101
|
+
If still stuck:
|
|
102
|
+
|
|
103
|
+
- Re-approve device pairing.
|
|
104
|
+
- Re-open node app (foreground).
|
|
105
|
+
- Re-grant OS permissions.
|
|
106
|
+
- Recreate/adjust exec approval policy.
|
|
107
|
+
|
|
108
|
+
Related:
|
|
109
|
+
|
|
110
|
+
- [/nodes/index](/nodes/index)
|
|
111
|
+
- [/nodes/camera](/nodes/camera)
|
|
112
|
+
- [/nodes/location-command](/nodes/location-command)
|
|
113
|
+
- [/tools/exec-approvals](/tools/exec-approvals)
|
|
114
|
+
- [/gateway/pairing](/gateway/pairing)
|
|
@@ -0,0 +1,66 @@
|
|
|
1
|
+
---
|
|
2
|
+
summary: "Global voice wake words (Gateway-owned) and how they sync across nodes"
|
|
3
|
+
read_when:
|
|
4
|
+
- Changing voice wake words behavior or defaults
|
|
5
|
+
- Adding new node platforms that need wake word sync
|
|
6
|
+
title: "Voice Wake"
|
|
7
|
+
---
|
|
8
|
+
|
|
9
|
+
# Voice Wake (Global Wake Words)
|
|
10
|
+
|
|
11
|
+
QuantumClaw treats **wake words as a single global list** owned by the **Gateway**.
|
|
12
|
+
|
|
13
|
+
- There are **no per-node custom wake words**.
|
|
14
|
+
- **Any node/app UI may edit** the list; changes are persisted by the Gateway and broadcast to everyone.
|
|
15
|
+
- macOS and iOS keep local **Voice Wake enabled/disabled** toggles (local UX + permissions differ).
|
|
16
|
+
- Android currently keeps Voice Wake off and uses a manual mic flow in the Voice tab.
|
|
17
|
+
|
|
18
|
+
## Storage (Gateway host)
|
|
19
|
+
|
|
20
|
+
Wake words are stored on the gateway machine at:
|
|
21
|
+
|
|
22
|
+
- `~/.quantumclaw/settings/voicewake.json`
|
|
23
|
+
|
|
24
|
+
Shape:
|
|
25
|
+
|
|
26
|
+
```json
|
|
27
|
+
{ "triggers": ["quantumclaw", "claude", "computer"], "updatedAtMs": 1730000000000 }
|
|
28
|
+
```
|
|
29
|
+
|
|
30
|
+
## Protocol
|
|
31
|
+
|
|
32
|
+
### Methods
|
|
33
|
+
|
|
34
|
+
- `voicewake.get` → `{ triggers: string[] }`
|
|
35
|
+
- `voicewake.set` with params `{ triggers: string[] }` → `{ triggers: string[] }`
|
|
36
|
+
|
|
37
|
+
Notes:
|
|
38
|
+
|
|
39
|
+
- Triggers are normalized (trimmed, empties dropped). Empty lists fall back to defaults.
|
|
40
|
+
- Limits are enforced for safety (count/length caps).
|
|
41
|
+
|
|
42
|
+
### Events
|
|
43
|
+
|
|
44
|
+
- `voicewake.changed` payload `{ triggers: string[] }`
|
|
45
|
+
|
|
46
|
+
Who receives it:
|
|
47
|
+
|
|
48
|
+
- All WebSocket clients (macOS app, WebChat, etc.)
|
|
49
|
+
- All connected nodes (iOS/Android), and also on node connect as an initial “current state” push.
|
|
50
|
+
|
|
51
|
+
## Client behavior
|
|
52
|
+
|
|
53
|
+
### macOS app
|
|
54
|
+
|
|
55
|
+
- Uses the global list to gate `VoiceWakeRuntime` triggers.
|
|
56
|
+
- Editing “Trigger words” in Voice Wake settings calls `voicewake.set` and then relies on the broadcast to keep other clients in sync.
|
|
57
|
+
|
|
58
|
+
### iOS node
|
|
59
|
+
|
|
60
|
+
- Uses the global list for `VoiceWakeManager` trigger detection.
|
|
61
|
+
- Editing Wake Words in Settings calls `voicewake.set` (over the Gateway WS) and also keeps local wake-word detection responsive.
|
|
62
|
+
|
|
63
|
+
### Android node
|
|
64
|
+
|
|
65
|
+
- Voice Wake is currently disabled in Android runtime/Settings.
|
|
66
|
+
- Android voice uses manual mic capture in the Voice tab instead of wake-word triggers.
|