npm - @vellumai/assistant - Versions diffs - 0.5.1 → 0.5.3 - Mend

@vellumai/assistant 0.5.1 → 0.5.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (405) hide show

package/ARCHITECTURE.md +163 -54
package/docs/architecture/integrations.md +62 -67
package/docs/credential-execution-service.md +3 -3
package/docs/skills.md +100 -0
package/package.json +1 -1
package/src/__tests__/agent-loop.test.ts +111 -0
package/src/__tests__/always-loaded-tools-guard.test.ts +3 -4
package/src/__tests__/app-builder-tool-scripts.test.ts +13 -151
package/src/__tests__/app-dir-path-guard.test.ts +78 -0
package/src/__tests__/app-executors.test.ts +1 -291
package/src/__tests__/app-git-history.test.ts +4 -4
package/src/__tests__/app-routes-csp.test.ts +1 -0
package/src/__tests__/app-store-dir-names.test.ts +426 -0
package/src/__tests__/attachments-store.test.ts +169 -21
package/src/__tests__/attachments.test.ts +115 -1
package/src/__tests__/btw-routes.test.ts +1 -0
package/src/__tests__/canonical-guardian-store.test.ts +38 -0
package/src/__tests__/channel-reply-delivery.test.ts +55 -0
package/src/__tests__/checker.test.ts +54 -0
package/src/__tests__/claude-code-skill-regression.test.ts +2 -0
package/src/__tests__/claude-code-tool-profiles.test.ts +2 -0
package/src/__tests__/compaction.benchmark.test.ts +2 -1
package/src/__tests__/config-schema-cmd.test.ts +68 -21
package/src/__tests__/config-schema.test.ts +1 -1
package/src/__tests__/conversation-agent-loop-overflow.test.ts +156 -5
package/src/__tests__/conversation-agent-loop.test.ts +297 -2
package/src/__tests__/conversation-attachments.test.ts +17 -19
package/src/__tests__/conversation-disk-view-integration.test.ts +277 -0
package/src/__tests__/conversation-disk-view.test.ts +810 -0
package/src/__tests__/conversation-error.test.ts +1 -1
package/src/__tests__/conversation-fork-crud.test.ts +551 -0
package/src/__tests__/conversation-fork-route.test.ts +386 -0
package/src/__tests__/conversation-history-web-search.test.ts +1 -1
package/src/__tests__/conversation-key-store-disk-view.test.ts +130 -0
package/src/__tests__/conversation-media-retry.test.ts +8 -2
package/src/__tests__/conversation-memory-dirty-tail.test.ts +150 -0
package/src/__tests__/conversation-provider-retry-repair.test.ts +7 -0
package/src/__tests__/conversation-queue.test.ts +36 -1
package/src/__tests__/conversation-routes-disk-view.test.ts +439 -0
package/src/__tests__/conversation-routes-guardian-reply.test.ts +2 -2
package/src/__tests__/conversation-routes-slash-commands.test.ts +2 -7
package/src/__tests__/conversation-runtime-assembly.test.ts +17 -2
package/src/__tests__/conversation-skill-tools.test.ts +4 -9
package/src/__tests__/conversation-slash-commands.test.ts +149 -0
package/src/__tests__/conversation-store.test.ts +24 -21
package/src/__tests__/conversation-surfaces-state-update.test.ts +246 -0
package/src/__tests__/conversation-surfaces-task-progress.test.ts +1 -0
package/src/__tests__/conversation-title-service.test.ts +137 -0
package/src/__tests__/conversation-tool-setup-app-refresh.test.ts +25 -315
package/src/__tests__/conversation-tool-setup-memory-scope.test.ts +1 -0
package/src/__tests__/conversation-tool-setup-side-effect-flag.test.ts +1 -0
package/src/__tests__/conversation-wipe.test.ts +226 -0
package/src/__tests__/conversation-workspace-cache-state.test.ts +44 -2
package/src/__tests__/conversation-workspace-injection.test.ts +11 -0
package/src/__tests__/credential-security-invariants.test.ts +3 -0
package/src/__tests__/credential-vault-unit.test.ts +5 -10
package/src/__tests__/cu-unified-flow.test.ts +1 -0
package/src/__tests__/db-conversation-fork-lineage-migration.test.ts +241 -0
package/src/__tests__/db-llm-request-log-provider-migration.test.ts +214 -0
package/src/__tests__/db-memory-archive-migration.test.ts +372 -0
package/src/__tests__/db-memory-brief-state-migration.test.ts +213 -0
package/src/__tests__/db-memory-reducer-checkpoints.test.ts +273 -0
package/src/__tests__/diagnostics-export.test.ts +70 -1
package/src/__tests__/first-greeting.test.ts +80 -0
package/src/__tests__/gateway-only-guard.test.ts +1 -0
package/src/__tests__/handlers-user-message-approval-consumption.test.ts +3 -7
package/src/__tests__/history-repair.test.ts +32 -10
package/src/__tests__/http-conversation-lineage.test.ts +251 -0
package/src/__tests__/image-source-path-reinject.test.ts +136 -0
package/src/__tests__/inline-command-runner.test.ts +311 -0
package/src/__tests__/inline-skill-authoring-guard.test.ts +220 -0
package/src/__tests__/inline-skill-load-permissions.test.ts +435 -0
package/src/__tests__/list-messages-attachments.test.ts +96 -0
package/src/__tests__/llm-context-normalization.test.ts +1116 -0
package/src/__tests__/llm-context-route-provider.test.ts +217 -0
package/src/__tests__/llm-request-log-turn-query.test.ts +270 -0
package/src/__tests__/media-generate-image.test.ts +47 -94
package/src/__tests__/memory-brief-open-loops.test.ts +530 -0
package/src/__tests__/memory-brief-time.test.ts +285 -0
package/src/__tests__/memory-brief-wrapper.test.ts +311 -0
package/src/__tests__/memory-chunk-archive.test.ts +400 -0
package/src/__tests__/memory-chunk-dual-write.test.ts +453 -0
package/src/__tests__/memory-episode-archive.test.ts +370 -0
package/src/__tests__/memory-episode-dual-write.test.ts +626 -0
package/src/__tests__/memory-lifecycle-e2e.test.ts +3 -1
package/src/__tests__/memory-observation-archive.test.ts +375 -0
package/src/__tests__/memory-observation-dual-write.test.ts +318 -0
package/src/__tests__/memory-recall-quality.test.ts +7 -7
package/src/__tests__/memory-reducer-store.test.ts +728 -0
package/src/__tests__/memory-reducer-types.test.ts +699 -0
package/src/__tests__/memory-reducer.test.ts +698 -0
package/src/__tests__/memory-regressions.test.ts +6 -4
package/src/__tests__/memory-simplified-config.test.ts +281 -0
package/src/__tests__/migration-cross-version-compatibility.test.ts +4 -1
package/src/__tests__/migration-export-http.test.ts +3 -1
package/src/__tests__/migration-import-commit-http.test.ts +18 -4
package/src/__tests__/migration-import-preflight-http.test.ts +1 -3
package/src/__tests__/mime-builder.test.ts +3 -2
package/src/__tests__/non-member-access-request.test.ts +12 -1
package/src/__tests__/notification-decision-identity.test.ts +52 -0
package/src/__tests__/oauth-apps-routes.test.ts +103 -0
package/src/__tests__/oauth-store.test.ts +115 -0
package/src/__tests__/parse-identity-fields.test.ts +129 -0
package/src/__tests__/provider-error-scenarios.test.ts +1 -3
package/src/__tests__/provider-failover-actual-provider.test.ts +66 -0
package/src/__tests__/recording-handler.test.ts +17 -0
package/src/__tests__/registry.test.ts +3 -8
package/src/__tests__/relay-server.test.ts +1 -1
package/src/__tests__/runtime-attachment-metadata.test.ts +7 -3
package/src/__tests__/schema-transforms.test.ts +165 -5
package/src/__tests__/server-history-render.test.ts +2 -2
package/src/__tests__/skill-load-inline-command.test.ts +598 -0
package/src/__tests__/skill-load-inline-includes.test.ts +644 -0
package/src/__tests__/skills-inline-command-expansions.test.ts +301 -0
package/src/__tests__/skills-transitive-hash.test.ts +333 -0
package/src/__tests__/slack-app-setup-skill-regression.test.ts +3 -1
package/src/__tests__/slack-inbound-verification.test.ts +2 -2
package/src/__tests__/starter-task-flow.test.ts +1 -0
package/src/__tests__/suggestion-routes.test.ts +443 -0
package/src/__tests__/swarm-conversation-integration.test.ts +1 -0
package/src/__tests__/swarm-recursion.test.ts +1 -0
package/src/__tests__/swarm-tool.test.ts +1 -0
package/src/__tests__/tool-execution-abort-cleanup.test.ts +1 -0
package/src/__tests__/tool-preview-lifecycle.test.ts +32 -5
package/src/__tests__/top-level-renderer.test.ts +22 -0
package/src/__tests__/turn-boundary-resolution.test.ts +243 -0
package/src/__tests__/vellum-self-knowledge-inline-command.test.ts +320 -0
package/src/__tests__/web-fetch.test.ts +6 -2
package/src/__tests__/workspace-migration-006-services-config.test.ts +335 -0
package/src/__tests__/workspace-migration-007-web-search-provider-rename.test.ts +312 -0
package/src/__tests__/workspace-migration-009-backfill-conversation-disk-view.test.ts +278 -0
package/src/__tests__/workspace-migration-010-app-dir-rename.test.ts +275 -0
package/src/__tests__/workspace-migration-012-rename-conversation-disk-view-dirs.test.ts +77 -0
package/src/__tests__/workspace-migration-013-repair-conversation-disk-view.test.ts +401 -0
package/src/__tests__/workspace-migration-backfill-installation-id.test.ts +328 -0
package/src/__tests__/workspace-migration-seed-device-id.test.ts +6 -10
package/src/agent/attachments.ts +27 -1
package/src/agent/loop.ts +29 -1
package/src/avatar/traits-png-sync.ts +80 -25
package/src/bundler/app-bundler.ts +4 -4
package/src/calls/call-domain.ts +1 -0
package/src/calls/voice-session-bridge.ts +1 -0
package/src/cli/commands/auth.ts +92 -0
package/src/cli/commands/avatar.ts +7 -6
package/src/cli/commands/config.ts +2 -0
package/src/cli/commands/oauth/providers.ts +29 -0
package/src/cli/program.ts +12 -0
package/src/cli.ts +15 -48
package/src/config/bundled-skills/app-builder/SKILL.md +103 -28
package/src/config/bundled-skills/app-builder/TOOLS.json +5 -199
package/src/config/bundled-skills/app-builder/tools/{app-query.ts → app-refresh.ts} +2 -2
package/src/config/bundled-skills/contacts/tools/google-contacts.ts +2 -3
package/src/config/bundled-skills/gmail/tools/gmail-archive.ts +6 -9
package/src/config/bundled-skills/gmail/tools/gmail-attachments.ts +4 -6
package/src/config/bundled-skills/gmail/tools/gmail-draft.ts +2 -3
package/src/config/bundled-skills/gmail/tools/gmail-filters.ts +2 -3
package/src/config/bundled-skills/gmail/tools/gmail-follow-up.ts +2 -3
package/src/config/bundled-skills/gmail/tools/gmail-forward.ts +2 -3
package/src/config/bundled-skills/gmail/tools/gmail-label.ts +4 -6
package/src/config/bundled-skills/gmail/tools/gmail-outreach-scan.ts +2 -3
package/src/config/bundled-skills/gmail/tools/gmail-send-draft.ts +2 -3
package/src/config/bundled-skills/gmail/tools/gmail-sender-digest.ts +2 -3
package/src/config/bundled-skills/gmail/tools/gmail-trash.ts +2 -3
package/src/config/bundled-skills/gmail/tools/gmail-unsubscribe.ts +2 -3
package/src/config/bundled-skills/gmail/tools/gmail-vacation.ts +2 -3
package/src/config/bundled-skills/google-calendar/tools/shared.ts +1 -1
package/src/config/bundled-skills/image-studio/SKILL.md +2 -2
package/src/config/bundled-skills/image-studio/TOOLS.json +2 -2
package/src/config/bundled-skills/image-studio/tools/media-generate-image.ts +45 -72
package/src/config/bundled-skills/media-processing/tools/extract-keyframes.ts +2 -2
package/src/config/bundled-skills/messaging/tools/shared.ts +1 -1
package/src/config/bundled-skills/settings/tools/voice-config-update.ts +19 -3
package/src/config/bundled-skills/skill-management/SKILL.md +1 -1
package/src/config/bundled-skills/skill-management/TOOLS.json +2 -2
package/src/config/bundled-skills/slack/tools/shared.ts +19 -4
package/src/config/bundled-skills/slack/tools/slack-scan-digest.ts +2 -3
package/src/config/bundled-skills/transcribe/SKILL.md +1 -1
package/src/config/bundled-skills/transcribe/TOOLS.json +2 -6
package/src/config/bundled-skills/transcribe/tools/transcribe-media.ts +19 -83
package/src/config/bundled-tool-registry.ts +2 -14
package/src/config/feature-flag-registry.json +24 -0
package/src/config/loader.ts +65 -0
package/src/config/raw-config-utils.ts +58 -0
package/src/config/schema-utils.ts +28 -7
package/src/config/schema.ts +20 -0
package/src/config/schemas/elevenlabs.ts +18 -0
package/src/config/schemas/memory-lifecycle.ts +4 -2
package/src/config/schemas/memory-simplified.ts +101 -0
package/src/config/schemas/memory-storage.ts +1 -1
package/src/config/schemas/memory.ts +4 -0
package/src/config/schemas/services.ts +8 -6
package/src/config/skills.ts +50 -4
package/src/contacts/contact-store.ts +13 -6
package/src/contacts/contacts-write.ts +0 -1
package/src/context/window-manager.ts +13 -2
package/src/daemon/conversation-agent-loop-handlers.ts +54 -8
package/src/daemon/conversation-agent-loop.ts +127 -20
package/src/daemon/conversation-attachments.ts +18 -36
package/src/daemon/conversation-error.ts +2 -1
package/src/daemon/conversation-history.ts +18 -4
package/src/daemon/conversation-lifecycle.ts +50 -16
package/src/daemon/conversation-messaging.ts +70 -26
package/src/daemon/conversation-process.ts +58 -34
package/src/daemon/conversation-runtime-assembly.ts +22 -38
package/src/daemon/conversation-slash.ts +121 -256
package/src/daemon/conversation-surfaces.ts +170 -24
package/src/daemon/conversation-tool-setup.ts +0 -6
package/src/daemon/conversation-workspace.ts +21 -1
package/src/daemon/conversation.ts +69 -30
package/src/daemon/first-greeting.ts +35 -0
package/src/daemon/handlers/config-embeddings.ts +156 -0
package/src/daemon/handlers/config-model.ts +62 -26
package/src/daemon/handlers/conversations.ts +0 -23
package/src/daemon/handlers/identity.ts +12 -1
package/src/daemon/handlers/recording.ts +26 -21
package/src/daemon/host-cu-proxy.ts +2 -2
package/src/daemon/lifecycle.ts +115 -65
package/src/daemon/message-protocol.ts +3 -0
package/src/daemon/message-types/conversations.ts +18 -0
package/src/daemon/message-types/messages.ts +1 -0
package/src/daemon/message-types/shared.ts +2 -0
package/src/daemon/message-types/surfaces.ts +2 -0
package/src/daemon/message-types/upgrades.ts +23 -0
package/src/daemon/server.ts +83 -12
package/src/daemon/shutdown-handlers.ts +8 -5
package/src/daemon/startup-error.ts +9 -0
package/src/daemon/tool-side-effects.ts +11 -28
package/src/events/tool-permission-telemetry-listener.ts +1 -3
package/src/followups/followup-store.ts +47 -1
package/src/instrument.ts +0 -4
package/src/media/app-icon-generator.ts +2 -2
package/src/memory/app-git-service.ts +28 -16
package/src/memory/app-store.ts +230 -41
package/src/memory/archive-store.ts +400 -0
package/src/memory/attachments-store.ts +558 -130
package/src/memory/brief-formatting.ts +33 -0
package/src/memory/brief-open-loops.ts +266 -0
package/src/memory/brief-time.ts +161 -0
package/src/memory/brief.ts +75 -0
package/src/memory/conversation-attention-store.ts +70 -0
package/src/memory/conversation-crud.ts +591 -8
package/src/memory/conversation-directories.ts +125 -0
package/src/memory/conversation-disk-view.ts +390 -0
package/src/memory/conversation-key-store.ts +17 -5
package/src/memory/conversation-queries.ts +5 -1
package/src/memory/conversation-title-service.ts +21 -49
package/src/memory/db-init.ts +40 -0
package/src/memory/embedding-backend.ts +42 -53
package/src/memory/embedding-gemini.test.ts +4 -4
package/src/memory/embedding-local.ts +1 -3
package/src/memory/embedding-ollama.ts +1 -3
package/src/memory/embedding-openai.ts +1 -3
package/src/memory/indexer.ts +114 -21
package/src/memory/items-extractor.ts +42 -13
package/src/memory/job-handlers/conversation-starters.ts +6 -1
package/src/memory/job-handlers/embedding.test.ts +2 -4
package/src/memory/job-handlers/embedding.ts +83 -0
package/src/memory/job-utils.ts +1 -1
package/src/memory/jobs-store.ts +6 -0
package/src/memory/jobs-worker.ts +12 -0
package/src/memory/llm-request-log-store.ts +100 -1
package/src/memory/migrations/102-alter-table-columns.ts +5 -0
package/src/memory/migrations/146-schedule-oneshot-routing.ts +3 -3
package/src/memory/migrations/147-migrate-reminders-to-schedules.ts +66 -70
package/src/memory/migrations/148-drop-reminders-table.ts +5 -9
package/src/memory/migrations/160-drop-loopback-port-column.ts +1 -3
package/src/memory/migrations/174-rename-thread-starters-table.ts +0 -7
package/src/memory/migrations/178-oauth-providers-managed-service-config-key.ts +15 -0
package/src/memory/migrations/179-llm-request-log-message-id.ts +16 -0
package/src/memory/migrations/180-backfill-inline-attachments-to-disk.ts +66 -0
package/src/memory/migrations/181-rename-thread-starters-checkpoints.ts +46 -0
package/src/memory/migrations/182-oauth-providers-display-metadata.ts +20 -0
package/src/memory/migrations/183-add-conversation-fork-lineage.ts +22 -0
package/src/memory/migrations/184-llm-request-log-provider.ts +12 -0
package/src/memory/migrations/185-memory-brief-state.ts +52 -0
package/src/memory/migrations/186-memory-archive.ts +109 -0
package/src/memory/migrations/187-memory-reducer-checkpoints.ts +19 -0
package/src/memory/migrations/index.ts +10 -0
package/src/memory/migrations/registry.ts +13 -0
package/src/memory/qdrant-client.ts +23 -4
package/src/memory/reducer-store.ts +271 -0
package/src/memory/reducer-types.ts +99 -0
package/src/memory/reducer.ts +453 -0
package/src/memory/retriever.test.ts +601 -2
package/src/memory/retriever.ts +85 -9
package/src/memory/schema/conversations.ts +9 -0
package/src/memory/schema/index.ts +2 -0
package/src/memory/schema/infrastructure.ts +13 -7
package/src/memory/schema/memory-archive.ts +121 -0
package/src/memory/schema/memory-brief.ts +55 -0
package/src/memory/schema/oauth.ts +6 -0
package/src/memory/search/semantic.ts +17 -4
package/src/messaging/providers/gmail/mime-builder.ts +3 -1
package/src/notifications/copy-composer.ts +26 -0
package/src/notifications/decision-engine.ts +14 -1
package/src/notifications/emit-signal.ts +1 -1
package/src/notifications/signal.ts +36 -0
package/src/oauth/byo-connection.test.ts +1 -45
package/src/oauth/byo-connection.ts +2 -8
package/src/oauth/connect-orchestrator.ts +15 -11
package/src/oauth/connection-resolver.test.ts +191 -0
package/src/oauth/connection-resolver.ts +66 -38
package/src/oauth/connection.ts +0 -1
package/src/oauth/oauth-store.ts +99 -47
package/src/oauth/platform-connection.test.ts +0 -1
package/src/oauth/platform-connection.ts +11 -3
package/src/oauth/seed-providers.ts +78 -3
package/src/oauth/token-persistence.ts +16 -10
package/src/permissions/checker.ts +160 -14
package/src/permissions/defaults.ts +14 -0
package/src/prompts/templates/BOOTSTRAP.md +2 -0
package/src/providers/anthropic/client.ts +8 -1
package/src/providers/failover.ts +4 -1
package/src/providers/gemini/client.ts +50 -0
package/src/providers/model-catalog.ts +92 -0
package/src/providers/model-intents.ts +29 -20
package/src/providers/openai/client.ts +49 -0
package/src/providers/types.ts +2 -0
package/src/runtime/access-request-helper.ts +16 -7
package/src/runtime/auth/credential-service.ts +3 -1
package/src/runtime/auth/route-policy.ts +14 -1
package/src/runtime/btw-sidechain.ts +101 -0
package/src/runtime/channel-reply-delivery.ts +17 -1
package/src/runtime/http-router.ts +3 -1
package/src/runtime/http-server.ts +196 -141
package/src/runtime/http-types.ts +1 -0
package/src/runtime/migrations/vbundle-builder.ts +5 -1
package/src/runtime/routes/access-request-decision.ts +41 -0
package/src/runtime/routes/app-management-routes.ts +6 -3
package/src/runtime/routes/app-routes.ts +7 -3
package/src/runtime/routes/approval-routes.ts +1 -0
package/src/runtime/routes/approval-strategies/guardian-callback-strategy.ts +34 -2
package/src/runtime/routes/attachment-routes.ts +45 -15
package/src/runtime/routes/btw-routes.ts +21 -61
package/src/runtime/routes/conversation-management-routes.ts +74 -0
package/src/runtime/routes/conversation-query-routes.ts +187 -10
package/src/runtime/routes/conversation-routes.ts +269 -28
package/src/runtime/routes/conversation-starter-routes.ts +9 -11
package/src/runtime/routes/diagnostics-routes.ts +1 -0
package/src/runtime/routes/identity-routes.ts +2 -35
package/src/runtime/routes/inbound-stages/acl-enforcement.ts +2 -2
package/src/runtime/routes/llm-context-normalization.ts +1212 -0
package/src/runtime/routes/log-export-routes.ts +3 -0
package/src/runtime/routes/memory-item-routes.test.ts +34 -0
package/src/runtime/routes/memory-item-routes.ts +94 -5
package/src/runtime/routes/migration-routes.ts +4 -1
package/src/runtime/routes/oauth-apps.ts +291 -0
package/src/runtime/routes/secret-routes.ts +30 -1
package/src/runtime/routes/settings-routes.ts +14 -0
package/src/runtime/routes/surface-action-routes.ts +68 -1
package/src/runtime/routes/trace-event-routes.ts +4 -1
package/src/schedule/schedule-store.ts +30 -21
package/src/security/secure-keys.ts +21 -0
package/src/signals/bash.ts +1 -1
package/src/skills/inline-command-expansions.ts +204 -0
package/src/skills/inline-command-render.ts +127 -0
package/src/skills/inline-command-runner.ts +242 -0
package/src/skills/transitive-version-hash.ts +88 -0
package/src/swarm/backend-claude-code.ts +3 -6
package/src/tasks/task-store.ts +43 -1
package/src/telemetry/usage-telemetry-reporter.test.ts +3 -2
package/src/telemetry/usage-telemetry-reporter.ts +3 -1
package/src/tools/AGENTS.md +6 -10
package/src/tools/apps/executors.ts +17 -232
package/src/tools/claude-code/claude-code.ts +2 -3
package/src/tools/credentials/vault.ts +7 -12
package/src/tools/host-filesystem/read.ts +13 -10
package/src/tools/network/__tests__/web-search.test.ts +4 -2
package/src/tools/permission-checker.ts +8 -1
package/src/tools/schedule/list.ts +2 -7
package/src/tools/schema-transforms.ts +5 -0
package/src/tools/shared/filesystem/format-diff.ts +2 -7
package/src/tools/skills/execute.ts +1 -1
package/src/tools/skills/load.ts +140 -6
package/src/tools/tool-manifest.ts +0 -6
package/src/tools/ui-surface/definitions.ts +2 -2
package/src/util/device-id.ts +28 -5
package/src/util/platform.ts +24 -0
package/src/util/pricing.ts +1 -0
package/src/util/retry.ts +1 -3
package/src/workspace/migrations/003-seed-device-id.ts +3 -4
package/src/workspace/migrations/006-services-config.ts +5 -0
package/src/workspace/migrations/008-voice-timeout-and-max-steps.ts +12 -0
package/src/workspace/migrations/009-backfill-conversation-disk-view.ts +10 -0
package/src/workspace/migrations/010-app-dir-rename.ts +223 -0
package/src/workspace/migrations/{002-backfill-installation-id.ts → 011-backfill-installation-id.ts} +24 -13
package/src/workspace/migrations/012-rename-conversation-disk-view-dirs.ts +64 -0
package/src/workspace/migrations/013-repair-conversation-disk-view.ts +11 -0
package/src/workspace/migrations/rebuild-conversation-disk-view.ts +186 -0
package/src/workspace/migrations/registry.ts +11 -1
package/src/workspace/top-level-renderer.ts +12 -0
package/src/__tests__/asset-materialize-tool.test.ts +0 -523
package/src/__tests__/asset-search-tool.test.ts +0 -536
package/src/__tests__/fixtures/media-reuse-fixtures.ts +0 -56
package/src/__tests__/media-reuse-story.e2e.test.ts +0 -762
package/src/__tests__/media-visibility-policy.test.ts +0 -190
package/src/config/bundled-skills/app-builder/tools/app-file-edit.ts +0 -14
package/src/config/bundled-skills/app-builder/tools/app-file-list.ts +0 -13
package/src/config/bundled-skills/app-builder/tools/app-file-read.ts +0 -21
package/src/config/bundled-skills/app-builder/tools/app-file-write.ts +0 -14
package/src/config/bundled-skills/app-builder/tools/app-list.ts +0 -13
package/src/config/bundled-skills/app-builder/tools/app-update.ts +0 -23
package/src/daemon/media-visibility-policy.ts +0 -59
package/src/tools/assets/materialize.ts +0 -248
package/src/tools/assets/search.ts +0 -400

package/ARCHITECTURE.md CHANGED Viewed

@@ -783,26 +783,26 @@ All client-server communication uses HTTP for request/response operations and Se
 The daemon emits two distinct error message types via SSE:
-| Message type         | Scope               | Purpose                                                                                                        | Payload                                                                       |
-| -------------------- | ------------------- | -------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------- |
-| `conversation_error` | Conversation-scoped | Typed, actionable failures during conversation runtime (e.g., provider network error, rate limit, API failure) | `sessionId`, `code` (typed enum), `userMessage`, `retryable`, `debugDetails?` |
-| `error`              | Global              | Generic, non-session failures (e.g., daemon startup errors, unknown message types)                             | `message` (string)                                                            |
-**Design rationale:** `conversation_error` carries structured metadata (error code, retryable flag, debug details) so the client can present actionable UI — a toast with retry/dismiss buttons — rather than a generic error banner. The older `error` type is retained for backward compatibility with non-session contexts.
-### Session Error Codes
-| Code                        | Meaning                                                                 | Retryable |
-| --------------------------- | ----------------------------------------------------------------------- | --------- |
-| `PROVIDER_NETWORK`          | Unable to reach the LLM provider (connection refused, timeout, DNS)     | Yes       |
-| `PROVIDER_RATE_LIMIT`       | LLM provider rate-limited the request (HTTP 429)                        | Yes       |
-| `PROVIDER_API`              | Provider returned a server error (5xx) or retryable 4xx                 | Yes       |
-| `PROVIDER_BILLING`          | Invalid/expired API key or insufficient credits (HTTP 401, billing 4xx) | No        |
-| `CONTEXT_TOO_LARGE`         | Request exceeds the model's context window (HTTP 413, token limit)      | No        |
-| `SESSION_ABORTED`           | Non-user abort interrupted the request                                  | Yes       |
-| `SESSION_PROCESSING_FAILED` | Catch-all for unexpected processing failures                            | No        |
-| `REGENERATE_FAILED`         | Failed to regenerate a previous response                                | Yes       |
-| `UNKNOWN`                   | Unrecognized error that does not match any specific category            | No        |
+| Message type         | Scope               | Purpose                                                                                                        | Payload                                                                            |
+| -------------------- | ------------------- | -------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------- |
+| `conversation_error` | Conversation-scoped | Typed, actionable failures during conversation runtime (e.g., provider network error, rate limit, API failure) | `conversationId`, `code` (typed enum), `userMessage`, `retryable`, `debugDetails?` |
+| `error`              | Global              | Generic, non-conversation failures (e.g., daemon startup errors, unknown message types)                        | `message` (string)                                                                 |
+**Design rationale:** `conversation_error` carries structured metadata (error code, retryable flag, debug details) so the client can present actionable UI — a toast with retry/dismiss buttons — rather than a generic error banner. The older `error` type is retained for backward compatibility with non-conversation contexts.
+### Conversation Error Codes
+| Code                             | Meaning                                                                 | Retryable |
+| -------------------------------- | ----------------------------------------------------------------------- | --------- |
+| `PROVIDER_NETWORK`               | Unable to reach the LLM provider (connection refused, timeout, DNS)     | Yes       |
+| `PROVIDER_RATE_LIMIT`            | LLM provider rate-limited the request (HTTP 429)                        | Yes       |
+| `PROVIDER_API`                   | Provider returned a server error (5xx) or retryable 4xx                 | Yes       |
+| `PROVIDER_BILLING`               | Invalid/expired API key or insufficient credits (HTTP 401, billing 4xx) | No        |
+| `CONTEXT_TOO_LARGE`              | Request exceeds the model's context window (HTTP 413, token limit)      | No        |
+| `CONVERSATION_ABORTED`           | Non-user abort interrupted the request                                  | Yes       |
+| `CONVERSATION_PROCESSING_FAILED` | Catch-all for unexpected processing failures                            | No        |
+| `REGENERATE_FAILED`              | Failed to regenerate a previous response                                | Yes       |
+| `UNKNOWN`                        | Unrecognized error that does not match any specific category            | No        |
 ### Error Classification
@@ -826,7 +826,7 @@ sequenceDiagram
     Note over Daemon: LLM call fails or<br/>processing error occurs
     Daemon->>Daemon: classifyConversationError(error, ctx)
-    Daemon->>DC: conversation_error {sessionId, code,<br/>userMessage, retryable, debugDetails?}
+    Daemon->>DC: conversation_error {conversationId, code,<br/>userMessage, retryable, debugDetails?}
     DC->>DC: broadcast to all subscribers
     DC->>VM: subscribe() stream delivers message
     VM->>VM: set conversationError property<br/>clear isThinking / isCancelling
@@ -837,7 +837,7 @@ sequenceDiagram
     alt User taps Retry (retryable == true)
         UI->>VM: retryAfterConversationError()
         VM->>VM: dismissConversationError()<br/>+ regenerateLastMessage()
-        VM->>DC: regenerate {sessionId}
+        VM->>DC: regenerate {conversationId}
         DC->>Daemon: HTTP POST /v1/messages
     else User taps Dismiss
         UI->>VM: dismissConversationError()
@@ -939,7 +939,7 @@ graph TB
     end
     SUBMIT --> SLASH_CHECK
-    SLASH_CHECK -->|"Yes (/model, /status, etc.)"| QA_ROUTE
+    SLASH_CHECK -->|"Yes (/models, /status, etc.)"| QA_ROUTE
     SLASH_CHECK -->|"No"| VOICE_CHECK
     VOICE_CHECK -->|"Yes"| QA_ROUTE
     VOICE_CHECK -->|"No"| CLASSIFIER
@@ -1017,12 +1017,12 @@ graph TB
     INPUT --> RESOLVE
     RESOLVE -->|"kind: passthrough"| PASSTHROUGH
-    RESOLVE -->|"kind: unknown<br/>(/model, /status, /commands, /pair,<br/>/models, provider shortcuts)"| HANDLED
+    RESOLVE -->|"kind: unknown<br/>(/models, /status, /commands, /pair)"| HANDLED
 ```
 Key behaviors:
-- **Built-in commands**: `/model`, `/models`, `/status`, `/commands`, `/pair`, and provider shortcuts (`/opus`, `/sonnet`, `/gpt4`, etc.) are handled directly by `resolveSlash()`. A deterministic `assistant_text_delta` + `message_complete` is emitted. No message persistence or model call occurs.
+- **Built-in commands**: `/models`, `/status`, `/commands`, and `/pair` are handled directly by `resolveSlash()`. A deterministic `assistant_text_delta` + `message_complete` is emitted. No message persistence or model call occurs.
 - **Passthrough**: Any input that does not match a built-in command passes through to the normal agent loop, including slash-like tokens that are not recognized.
 - **Queue**: Queued messages receive the same slash resolution.
@@ -1183,7 +1183,7 @@ The following capabilities ship as bundled skills in `assistant/src/config/bundl
 | `claude-code`   | Claude Code tool                                                                                                                                                                                                                                                  | Delegate coding tasks to Claude Code subprocess                                                                                                                                                                                                                                                      |
 | `computer-use`  | `computer_use_observe`, `computer_use_click`, `computer_use_type_text`, `computer_use_key`, `computer_use_scroll`, `computer_use_drag`, `computer_use_wait`, `computer_use_open_app`, `computer_use_run_applescript`, `computer_use_done`, `computer_use_respond` | Computer-use proxy tools — preactivated via `preactivatedSkillIds` in desktop sessions. Each tool forwards actions to the connected macOS client via `HostCuProxy`, which handles request/resolve proxying, step counting, loop detection, and observation formatting within the unified agent loop. |
 | `weather`       | `get-weather`                                                                                                                                                                                                                                                     | Fetch current weather data                                                                                                                                                                                                                                                                           |
-| `app-builder`   | `app_create`, `app_list`, `app_query`, `app_update`, `app_delete`, `app_file_list`, `app_file_read`, `app_file_edit`, `app_file_write`                                                                                                                            | Dynamic app authoring — CRUD and file-level editing for persistent apps (activated via `skill_load app-builder`; `app_open` remains a core proxy tool)                                                                                                                                               |
+| `app-builder`   | `app_create`, `app_delete`, `app_refresh`, `app_generate_icon`                                                                                                                                                                                                    | Dynamic app authoring — create and manage persistent apps; file editing uses generic file tools plus `app_refresh` (activated via `skill_load app-builder`; `app_open` remains a core proxy tool)                                                                                                    |
 | `self-upgrade`  | (instruction-only)                                                                                                                                                                                                                                                | Self-improvement workflow                                                                                                                                                                                                                                                                            |
 | `start-the-day` | (instruction-only)                                                                                                                                                                                                                                                | Morning briefing routine                                                                                                                                                                                                                                                                             |
@@ -1261,6 +1261,115 @@ graph TB
     TRUST -->|"Deny rule matches"| DENY["Blocked"]
 ```
+### Inline Skill Command Expansion
+Skills can embed dynamic shell output in their SKILL.md body using `!`command``tokens. When`skill_load` processes a skill containing these tokens, the commands are executed at load time through a sandboxed runner and their output is substituted inline. This enables externally authored skills to include project-specific context (e.g., directory listings, config values) without requiring manual edits.
+**Feature flag:** `feature_flags.inline-skill-commands.enabled` (default: enabled). When disabled, loading a skill that contains `!`command`` tokens fails closed with an error rather than leaving raw tokens in the prompt.
+#### Syntax and Parsing
+The `!`command``syntax is parsed by`parseInlineCommandExpansions()` from the SKILL.md body after frontmatter extraction. The parser:
+- Extracts all `!`command`` tokens outside fenced code blocks (documentation examples in fenced blocks are ignored)
+- Assigns each token a stable `placeholderId` (0-indexed encounter order)
+- Rejects malformed tokens fail-closed: empty commands, nested backticks, and unmatched opening backticks produce `InlineCommandExpansionError` entries rather than best-effort expansions
+#### Transitive Version Hash
+When a skill contains inline command expansions, the permission system computes a **transitive version hash** (`tv1:<sha256>`) that covers the root skill and all its included children (DFS pre-order). The hash folds:
+1. Each visited skill ID (graph structure)
+2. Each visited skill's directory content hash (file changes)
+Editing any file in the root skill or any included child invalidates the transitive hash, which forces re-approval. The hash is computed by `computeTransitiveSkillVersionHash()` and fails closed (`TransitiveHashError`) on missing children or cycles in the include graph.
+#### Permission Gating (`skill_load_dynamic:*`)
+Skills containing inline command expansions use a separate permission candidate namespace (`skill_load_dynamic:*`) instead of the normal `skill_load:*` namespace. This prevents them from falling through to the permissive default `skill_load:*` allow rule. The permission checker emits candidates in specificity order:
+1. `skill_load_dynamic:<skill-id>@<transitive-hash>` — version-pinned approval (most specific)
+2. `skill_load_dynamic:<skill-id>` — any-version approval
+A default ask rule at priority 200 (`default:ask-skill_load_dynamic-global`) catches these candidates, ensuring the guardian is always prompted before inline commands execute. The user can create a pinned trust rule for a specific transitive hash to auto-approve known-good versions. Non-interactive sessions (no human present) deny dynamic skill loads rather than silently auto-approving.
+```mermaid
+graph TB
+    LOAD["skill_load(selector)"] --> PARSE["Parse SKILL.md body"]
+    PARSE --> CHECK{"Has !\x60command\x60<br/>tokens?"}
+    CHECK -->|"No"| NORMAL["Normal skill_load:* candidate<br/>(auto-allowed)"]
+    CHECK -->|"Yes"| FLAG{"inline-skill-commands<br/>flag enabled?"}
+    FLAG -->|"No"| FAIL_FLAG["Fail closed:<br/>error returned"]
+    FLAG -->|"Yes"| SOURCE{"Eligible source?<br/>(bundled/managed/workspace)"}
+    SOURCE -->|"No (extra)"| FAIL_SOURCE["Fail closed:<br/>source not eligible"]
+    SOURCE -->|"Yes"| HASH["Compute transitive hash"]
+    HASH --> DYN["skill_load_dynamic:id@hash<br/>candidate emitted"]
+    DYN --> PERM["PermissionChecker"]
+    PERM --> RULE{"Trust rule?"}
+    RULE -->|"Pinned allow"| RENDER["Execute + render"]
+    RULE -->|"No rule"| PROMPT["Prompt guardian"]
+    RULE -->|"Deny"| DENY["Blocked"]
+```
+#### Sandbox-Only Execution
+Inline commands are executed through `runInlineCommand()`, a purpose-built sandbox runner with strict security constraints:
+- **Sandbox enforced**: The sandbox is always enabled with `networkMode: "off"` — no outbound network connections
+- **Sanitized environment**: Uses `buildSanitizedEnv()` — no API keys, tokens, credentials, gateway URLs, or workspace paths in the environment
+- **No host fallback**: Unlike the general `bash` tool, there is no fallback to host execution when the sandbox is unavailable
+- **No credential proxy**: No CES client, no credential materialization
+- **Timeout**: 10-second wall-clock limit (killed with SIGKILL on timeout)
+- **Output cap**: 20,000 characters maximum (truncated with `[output truncated]` marker)
+- **Binary rejection**: Output with >10% non-printable characters (after ANSI stripping) is rejected
+- **Stdout only**: stderr is discarded; ANSI escape sequences are stripped from stdout
+The runner returns a deterministic `InlineCommandResult` with machine-readable failure reasons (`timeout`, `non_zero_exit`, `binary_output`, `spawn_failure`) — raw stderr is never surfaced.
+#### Rendering Flow
+The `renderInlineCommands()` function processes expansions sequentially (not in parallel) to maintain deterministic order. Each `!`command`` token is replaced with an XML-wrapped result:
+- **Success**: `<inline_skill_command index="N">...output...</inline_skill_command>`
+- **Failure**: `<inline_skill_command index="N">[inline command unavailable: <reason>]</inline_skill_command>`
+Rendering applies at two levels during `skill_load`:
+1. **Root skill**: If the loaded skill has inline expansions, they are rendered before the skill body is emitted. A root skill with inline commands that fail the feature-flag or source-eligibility check returns an error (fail closed, no `<loaded_skill>` marker).
+2. **Included children**: Each included child skill's body is rendered independently. A render failure in one child does not prevent sibling rendering — the failed child's body falls back to raw (unexpanded) text with a warning log.
+#### v1 Source Restriction
+In the initial release, only skills from **bundled**, **managed**, and **workspace** sources are eligible for inline command expansion. Skills from **extra** (third-party) roots are explicitly rejected with an error message. The `INLINE_COMMAND_ELIGIBLE_SOURCES` set in `load.ts` enforces this restriction. Unknown or future source types also fail closed.
+#### Fail-Closed Behavior Summary
+Every layer in the pipeline defaults to rejection rather than silent degradation:
+| Layer            | Failure mode                                         | Behavior                                               |
+| ---------------- | ---------------------------------------------------- | ------------------------------------------------------ |
+| Parser           | Malformed token (empty, nested backtick, unmatched)  | Logged as error, not expanded                          |
+| Feature flag     | Flag disabled                                        | `skill_load` returns error, no `<loaded_skill>` marker |
+| Source check     | `extra` or unknown source                            | `skill_load` returns error, no `<loaded_skill>` marker |
+| Transitive hash  | Missing child or cycle in include graph              | `TransitiveHashError` thrown, permission check fails   |
+| Permission       | No trust rule and non-interactive                    | Denied (never silently auto-approved)                  |
+| Sandbox runner   | Timeout, non-zero exit, binary output, spawn failure | Deterministic stub rendered, no raw stderr             |
+| Renderer (root)  | Feature flag off or ineligible source                | Error returned from `skill_load`                       |
+| Renderer (child) | Exception during render                              | Raw body used, sibling rendering continues             |
+#### Key Source Files
+| File                                                | Role                                                                             |
+| --------------------------------------------------- | -------------------------------------------------------------------------------- |
+| `assistant/src/skills/inline-command-expansions.ts` | `parseInlineCommandExpansions()` — parser for `!`command`` tokens                |
+| `assistant/src/skills/inline-command-runner.ts`     | `runInlineCommand()` — sandbox-only command executor                             |
+| `assistant/src/skills/inline-command-render.ts`     | `renderInlineCommands()` — token replacement and XML wrapping                    |
+| `assistant/src/skills/transitive-version-hash.ts`   | `computeTransitiveSkillVersionHash()` — hash covering root + included children   |
+| `assistant/src/tools/skills/load.ts`                | `skill_load` execute path — feature flag check, source check, render integration |
+| `assistant/src/permissions/checker.ts`              | `skill_load_dynamic:*` candidate emission and allowlist options                  |
+| `assistant/src/permissions/defaults.ts`             | `default:ask-skill_load_dynamic-global` rule (priority 200)                      |
+| `meta/feature-flags/feature-flag-registry.json`     | `inline-skill-commands` flag definition                                          |
 ### Key Source Files
 | File                                                | Role                                                                                       |
@@ -1525,7 +1634,7 @@ sequenceDiagram
 ### Key design decisions
-- **Recursion guard**: A module-level `Set<sessionId>` prevents concurrent swarms within the same session while allowing independent sessions to run their own swarms in parallel.
+- **Recursion guard**: A module-level `Set<conversationId>` prevents concurrent swarms within the same conversation while allowing independent conversations to run their own swarms in parallel.
 - **Abort signal**: The tool checks `context.signal?.aborted` before planning and before execution. The signal is also forwarded into `executeSwarm` and the worker backend, enabling cooperative cancellation of in-flight workers.
 - **DAG scheduling**: Tasks with dependencies are topologically ordered. Independent tasks run in parallel up to `maxWorkers`.
 - **Per-task retries**: Failed tasks retry up to `maxRetriesPerTask` before being marked failed. Dependents are transitively blocked.
@@ -1568,7 +1677,7 @@ sequenceDiagram
     Note over Daemon: Processing previous request...<br/>Reaches safe tool-loop checkpoint
-    Daemon-->>DC: generation_handoff (sessionId, queuedCount)
+    Daemon-->>DC: generation_handoff (conversationId, queuedCount)
     Note over Daemon: Daemon yields current generation
     Daemon-->>DC: message_dequeued
@@ -1585,7 +1694,7 @@ sequenceDiagram
 ## Trace System — Debug Panel Data Flow
-The trace system provides real-time observability of daemon session internals. Each session creates a `TraceEmitter` that emits structured `trace_event` SSE events as the session processes requests, makes LLM calls, and executes tools.
+The trace system provides real-time observability of daemon conversation internals. Each conversation creates a `TraceEmitter` that emits structured `trace_event` SSE events as the conversation processes requests, makes LLM calls, and executes tools.
 ```mermaid
 sequenceDiagram
@@ -1636,41 +1745,41 @@ sequenceDiagram
     TE-->>DC: trace_event (message_complete)
     DC-->>TS: ingest()
-    Note over TS: Events deduplicated by eventId,<br/>ordered by sequence + timestampMs,<br/>grouped by session and requestId,<br/>capped at 5000 per session
+    Note over TS: Events deduplicated by eventId,<br/>ordered by sequence + timestampMs,<br/>grouped by conversation and requestId,<br/>capped at 5000 per conversation
-    TS-->>DP: @Published eventsBySession
+    TS-->>DP: @Published eventsByConversation
     Note over DP: Metrics strip: requests, LLM calls,<br/>tokens (in/out), avg latency, failures<br/>Timeline: events grouped by requestId
 ```
 ### Trace Event Kinds
-Events emitted during a session lifecycle:
-| Kind                        | Emitted by         | When                                                                                            |
-| --------------------------- | ------------------ | ----------------------------------------------------------------------------------------------- |
-| `request_received`          | Handlers / Session | User message or surface action arrives                                                          |
-| `request_queued`            | Handlers / Session | Message queued while session is busy                                                            |
-| `request_dequeued`          | Session            | Queued message begins processing                                                                |
-| `llm_call_started`          | Session            | LLM API call initiated                                                                          |
-| `llm_call_finished`         | Session            | LLM API call completed (carries `inputTokens`, `outputTokens`, `latencyMs`)                     |
-| `assistant_message`         | Session            | Assistant response assembled (carries `toolUseCount`)                                           |
-| `tool_started`              | ToolTraceListener  | Tool execution begins                                                                           |
-| `tool_permission_requested` | ToolTraceListener  | Permission check needed (carries `riskLevel`)                                                   |
-| `tool_permission_decided`   | ToolTraceListener  | Permission granted or denied (carries `decision`)                                               |
-| `tool_finished`             | ToolTraceListener  | Tool execution completed (carries `durationMs`)                                                 |
-| `tool_failed`               | ToolTraceListener  | Tool execution failed (carries `durationMs`)                                                    |
-| `secret_detected`           | ToolTraceListener  | Secret found in tool output                                                                     |
-| `generation_handoff`        | Session            | Yielding to next queued message                                                                 |
-| `message_complete`          | Session            | Full request processing finished                                                                |
-| `generation_cancelled`      | Session            | User cancelled the generation                                                                   |
-| `request_error`             | Handlers / Session | Unrecoverable error during processing (includes queue-full rejection and persist-failure paths) |
+Events emitted during a conversation lifecycle:
+| Kind                        | Emitted by              | When                                                                                            |
+| --------------------------- | ----------------------- | ----------------------------------------------------------------------------------------------- |
+| `request_received`          | Handlers / Conversation | User message or surface action arrives                                                          |
+| `request_queued`            | Handlers / Conversation | Message queued while conversation is busy                                                       |
+| `request_dequeued`          | Conversation            | Queued message begins processing                                                                |
+| `llm_call_started`          | Conversation            | LLM API call initiated                                                                          |
+| `llm_call_finished`         | Conversation            | LLM API call completed (carries `inputTokens`, `outputTokens`, `latencyMs`)                     |
+| `assistant_message`         | Conversation            | Assistant response assembled (carries `toolUseCount`)                                           |
+| `tool_started`              | ToolTraceListener       | Tool execution begins                                                                           |
+| `tool_permission_requested` | ToolTraceListener       | Permission check needed (carries `riskLevel`)                                                   |
+| `tool_permission_decided`   | ToolTraceListener       | Permission granted or denied (carries `decision`)                                               |
+| `tool_finished`             | ToolTraceListener       | Tool execution completed (carries `durationMs`)                                                 |
+| `tool_failed`               | ToolTraceListener       | Tool execution failed (carries `durationMs`)                                                    |
+| `secret_detected`           | ToolTraceListener       | Secret found in tool output                                                                     |
+| `generation_handoff`        | Conversation            | Yielding to next queued message                                                                 |
+| `message_complete`          | Conversation            | Full request processing finished                                                                |
+| `generation_cancelled`      | Conversation            | User cancelled the generation                                                                   |
+| `request_error`             | Handlers / Conversation | Unrecoverable error during processing (includes queue-full rejection and persist-failure paths) |
 ### Architecture
-- **TraceEmitter** (daemon, per-session): Constructed with a `sessionId` and a `sendToClient` callback. Maintains a monotonic sequence counter for stable ordering. Truncates summaries to 200 chars and attribute values to 500 chars. Each call to `emit()` sends a `trace_event` SSE event to connected clients.
-- **ToolTraceListener** (daemon): Subscribes to the session's `EventBus` via `onAny()` and translates tool domain events (`tool.execution.started`, `tool.execution.finished`, `tool.execution.failed`, `tool.permission.requested`, `tool.permission.decided`, `tool.secret.detected`) into trace events through the `TraceEmitter`.
+- **TraceEmitter** (daemon, per-conversation): Constructed with a `conversationId` and a `sendToClient` callback. Maintains a monotonic sequence counter for stable ordering. Truncates summaries to 200 chars and attribute values to 500 chars. Each call to `emit()` sends a `trace_event` SSE event to connected clients.
+- **ToolTraceListener** (daemon): Subscribes to the conversation's `EventBus` via `onAny()` and translates tool domain events (`tool.execution.started`, `tool.execution.finished`, `tool.execution.failed`, `tool.permission.requested`, `tool.permission.decided`, `tool.secret.detected`) into trace events through the `TraceEmitter`.
 - **DaemonClient** (Swift, shared): Decodes `trace_event` SSE events into `TraceEventMessage` structs and invokes the `onTraceEvent` callback.
-- **TraceStore** (Swift, macOS): `@MainActor ObservableObject` that ingests `TraceEventMessage` structs. Deduplicates by `eventId`, maintains stable sort order (sequence, then timestampMs, then insertion order), groups events by session and requestId, and enforces a retention cap of 5,000 events per session. Each request group is classified with a terminal status: `completed` (via `message_complete`), `cancelled` (via `generation_cancelled`), `handedOff` (via `generation_handoff`), `error` (via `request_error` or any event with `status == "error"`), or `active` (no terminal event yet).
+- **TraceStore** (Swift, macOS): `@MainActor ObservableObject` that ingests `TraceEventMessage` structs. Deduplicates by `eventId`, maintains stable sort order (sequence, then timestampMs, then insertion order), groups events by conversation and requestId, and enforces a retention cap of 5,000 events per conversation. Each request group is classified with a terminal status: `completed` (via `message_complete`), `cancelled` (via `generation_cancelled`), `handedOff` (via `generation_handoff`), `error` (via `request_error` or any event with `status == "error"`), or `active` (no terminal event yet).
 - **DebugPanel** (Swift, macOS): SwiftUI view that observes `TraceStore`. Displays a metrics strip (request count, LLM calls, total tokens, average latency, tool failures) and a `TraceTimelineView` showing events grouped by requestId with color-coded status indicators. The timeline auto-scrolls to new events while the user is at the bottom; scrolling up pauses auto-scroll and shows a "Jump to bottom" button that resumes it.
 ---

package/docs/architecture/integrations.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # Integrations Architecture
-OAuth, messaging adapters, script proxy, and asset-tool architecture.
+OAuth, messaging adapters, script proxy, and conversation disk view architecture.
 ## Integrations — OAuth2 + Unified Messaging
@@ -514,90 +514,85 @@ The proxy subsystem is fully wired, including credential injection. The session
 ---
-## Asset Search and Materialize — Cross-Conversation Media Reuse
+## Conversation Disk View — Filesystem-Based Conversation Access
-The `asset_search` and `asset_materialize` tools enable the assistant to discover and use previously uploaded media assets (images, documents, audio) across conversations. Assets are stored as base64-encoded blobs in the `attachments` table and linked to messages via the `message_attachments` join table.
+The conversation disk view projects conversation metadata, messages, and attachments to a browsable filesystem layout under `~/.vellum/workspace/conversations/`. This enables the assistant to search, read, and manipulate conversation data (including media attachments) using standard file tools (`read_file`, `glob`, `grep`) rather than dedicated asset search tools.
-### Asset Discovery and Materialization Flow
+### Directory Layout
-```mermaid
-sequenceDiagram
-    participant Model as LLM
-    participant Search as asset_search tool
-    participant DB as SQLite (attachments)
-    participant Visibility as media-visibility-policy
-    participant Materialize as asset_materialize tool
-    participant Sandbox as Sandbox filesystem
-    Model->>Search: search(mime_type: "image/*", recency: "last_7_days")
-    Search->>DB: query attachments (filters)
-    DB-->>Search: matching rows (metadata only, no base64)
-    Search->>Visibility: filterVisibleAttachments(results, currentContext)
-    Note over Visibility: Private-conversation attachments filtered out<br/>unless viewer is in the same conversation
-    Visibility-->>Search: visible results
-    Search-->>Model: metadata list (IDs, filenames, types, sizes)
-    Model->>Materialize: materialize(attachment_id, destination_path)
-    Materialize->>Materialize: sandboxPolicy(destination_path)
-    Materialize->>DB: load attachment (including base64 data)
-    Materialize->>Visibility: isAttachmentVisible(attachmentCtx, currentCtx)
-    Note over Visibility: Second visibility check at materialize time<br/>prevents TOCTOU between search and materialize
-    Materialize->>Materialize: size check (max 100 MB)
-    Materialize->>Sandbox: write decoded bytes to destination
-    Materialize-->>Model: "Materialized 'photo.jpg' to /workspace/media/photo.jpg"
-```
+Each conversation is projected to a directory named `{isoDate}_{id}`:
-### Private Conversation Visibility Gate
+```
+~/.vellum/workspace/conversations/
+  2025-01-15T10-30-00.000Z_abc123/
+    meta.json             # Conversation metadata (id, title, type, channel, timestamps)
+    messages.jsonl        # Flattened message log (one JSON object per line)
+    attachments/          # Decoded attachment files (original filenames, collision-safe)
+      photo.png
+      document.pdf
+```
-Attachments from private conversations are only visible to the same private conversation. Standard-conversation attachments are visible everywhere. The policy is enforced at both the search and materialize stages to prevent cross-conversation data leakage.
+### Write-Through Sync
-```mermaid
-graph TB
-    subgraph "Visibility Rules"
-        ATT_STD["Attachment from<br/>standard conversation"]
-        ATT_PVT["Attachment from<br/>private conversation"]
+The disk view is updated at the daemon level, not automatically by the DB CRUD layer. Conversation creation, metadata updates, and deletion are synced from `conversation-crud.ts`, but message sync (`syncMessageToDisk`) is only called from daemon-level code paths (e.g. `conversation-messaging.ts`) — not from the CRUD `addMessage()` function. This means `messages.jsonl` reflects messages processed through the daemon's messaging pipeline, not every message write. All disk writes are best-effort; failures are logged but never thrown, so the disk view cannot break DB operations.
-        VIEWER_ANY["Any conversation<br/>(standard or private)"]
-        VIEWER_SAME["Same private conversation<br/>(matching conversationId)"]
-        VIEWER_OTHER["Different private conversation<br/>or standard conversation"]
-    end
+> **Privacy note:** Conversation disk-view files live under `~/.vellum/workspace/conversations/` and are **excluded** from diagnostic log exports ("Send logs to Vellum") via the `WORKSPACE_SKIP_DIRS` filter in `log-export-routes.ts`. However, the SQLite database (`assistant.db`) is included in exports as a SQL dump, and it contains conversation messages and attachment data in its tables. The disk-view exclusion prevents the raw conversation files and decoded attachments from being exported, but conversation content stored in the database may still be present in the export.
-    ATT_STD -->|"always visible"| VIEWER_ANY
-    ATT_PVT -->|"visible"| VIEWER_SAME
-    ATT_PVT -->|"hidden"| VIEWER_OTHER
+```mermaid
+sequenceDiagram
+    participant CRUD as conversation-crud.ts
+    participant Daemon as conversation-messaging.ts
+    participant DiskView as conversation-disk-view.ts
+    participant FS as Filesystem
+    Note over CRUD,FS: Conversation creation (CRUD layer)
+    CRUD->>CRUD: INSERT conversation row
+    CRUD->>DiskView: initConversationDir(conv)
+    DiskView->>FS: mkdir + write meta.json
+    Note over Daemon,FS: Message insertion (daemon layer)
+    Daemon->>CRUD: addMessage(convId, role, content)
+    CRUD->>CRUD: INSERT message row
+    Daemon->>DiskView: syncMessageToDisk(convId, msgId, createdAtMs)
+    DiskView->>DiskView: flattenContentBlocks(content)
+    DiskView->>FS: append JSONL record to messages.jsonl
+    DiskView->>FS: reference files already materialized in attachments/
+    Note over CRUD,FS: Conversation update (CRUD layer)
+    CRUD->>CRUD: UPDATE conversation row
+    CRUD->>DiskView: updateMetaFile(conv)
+    DiskView->>FS: rewrite meta.json
+    Note over CRUD,FS: Conversation deletion (CRUD layer)
+    CRUD->>CRUD: DELETE conversation row
+    CRUD->>DiskView: removeConversationDir(id, createdAtMs)
+    DiskView->>FS: rm -rf conversation directory
 ```
-**Source conversation lookup**: The `getAttachmentSourceConversations()` function traces an attachment's lineage through `message_attachments` -> `messages` -> `conversations` to determine which conversations it belongs to and whether any of them are private.
+### Content Flattening
-**Mixed-source attachments**: If an attachment is linked to messages in both standard and private conversations (e.g., the user shared the same file in two conversations), the attachment is treated as globally visible because at least one source is non-private.
+Message content (stored as JSON `ContentBlock[]` in the DB) is flattened for the JSONL log:
-**Orphan attachments**: Attachments with no message linkage (orphans) are treated as universally visible rather than hidden, since they have no private-conversation provenance.
+- **Text blocks** are concatenated into a single `content` string.
+- **Tool use blocks** are extracted into a `toolCalls` array (`{ name, input }`).
+- **Tool result blocks** are extracted into a `toolResults` array.
+- **Image/file blocks** are skipped — they are represented via the `attachments/` subdirectory instead.
-### Search Capabilities
+### Attachment Projection
-| Parameter         | Type   | Description                                                                                    |
-| ----------------- | ------ | ---------------------------------------------------------------------------------------------- |
-| `mime_type`       | string | MIME type filter with wildcard support (`image/*`, `application/pdf`)                          |
-| `filename`        | string | Case-insensitive substring match on original filename                                          |
-| `recency`         | enum   | Time-based filter: `last_hour`, `last_24_hours`, `last_7_days`, `last_30_days`, `last_90_days` |
-| `conversation_id` | string | Scope results to attachments in a specific conversation                                        |
-| `limit`           | number | Maximum results (default 20, max 100)                                                          |
+Attachments are materialized into `conversations/<conversation>/attachments/` as soon as they are linked to a message. During disk-view sync, the JSONL record reuses those filenames directly and only falls back to materializing legacy rows that have not been projected yet. Filename collisions are still resolved by appending a numeric suffix (e.g., `photo-2.png`, `photo-3.png`).
-### Materialize Safeguards
+### Backfill Migration
-- **Sandbox path enforcement**: Destination path must resolve inside the sandbox working directory
-- **Size limit**: 100 MB ceiling prevents materializing excessively large attachments
-- **Double visibility check**: Both `asset_search` and `asset_materialize` independently verify visibility, preventing TOCTOU races between search and use
-- **Risk level**: Both tools are `RiskLevel.Low` since they read existing data and write only within the sandbox
+Existing conversations created before the disk view was introduced are backfilled by workspace migration `009-backfill-conversation-disk-view`, which replays all conversations and their messages through the disk-view sync functions.
 ### Key Source Files
-| File                                              | Role                                                                                          |
-| ------------------------------------------------- | --------------------------------------------------------------------------------------------- |
-| `assistant/src/tools/assets/search.ts`            | `asset_search` tool — cross-conversation attachment metadata search with visibility filtering |
-| `assistant/src/tools/assets/materialize.ts`       | `asset_materialize` tool — decode and write attachment to sandbox path                        |
-| `assistant/src/daemon/media-visibility-policy.ts` | Pure policy module — `isAttachmentVisible()`, `filterVisibleAttachments()`                    |
-| `assistant/src/memory/schema.ts`                  | `attachments` and `message_attachments` table schemas                                         |
-| `assistant/src/memory/conversation-crud.ts`       | `getConversationType()` — conversation type lookup for visibility context                     |
+| File                                                                        | Role                                                                                  |
+| --------------------------------------------------------------------------- | ------------------------------------------------------------------------------------- |
+| `assistant/src/memory/conversation-disk-view.ts`                            | Disk view module — init, update, sync, remove, content flattening                     |
+| `assistant/src/memory/conversation-crud.ts`                                 | DB CRUD layer — calls init, update, and remove disk-view functions (not message sync) |
+| `assistant/src/daemon/conversation-messaging.ts`                            | Daemon messaging pipeline — calls `syncMessageToDisk` after message insertion         |
+| `assistant/src/workspace/migrations/009-backfill-conversation-disk-view.ts` | Backfill migration for pre-existing conversations                                     |
 ---

package/docs/credential-execution-service.md CHANGED Viewed

@@ -46,7 +46,7 @@ CES exposes exactly three tools to the assistant, registered as a **deliberate e
 ### Tool registration
-CES tools use the standard `class ... implements Tool` registration pattern. This is explicitly approved as a deliberate exception to the no-new-tools policy because:
+CES tools use the standard `class ... implements Tool` registration pattern. These are justified exceptions to the general preference for skills because:
 - The security boundary requires that credential materialization happens in a separate process
 - Skill scripts run inside the assistant process and cannot enforce the hard isolation invariant
@@ -223,7 +223,7 @@ These invariants are enforced by guard tests and code review:
 1. **No cross-package source imports**: `assistant/` must not import from `credential-executor/` and vice versa. Communication is RPC only. Shared types flow through `packages/` only.
 2. **No credential values in assistant process memory**: The assistant sends credential handles (not values) to CES. CES materializes and uses them internally.
-3. **CES tools are the only approved exception to the no-new-tools policy** for credential-bearing execution. All other credential use continues through the existing broker for local deployments.
+3. **CES tools justify tool registrations over skills** for credential-bearing execution because of the hard process-boundary isolation requirement. All other credential use continues through the existing broker for local deployments.
 4. **Grants and audit logs are CES-internal**: The assistant cannot read CES grant tables or audit logs directly. CES exposes grant status and audit summaries via RPC responses.
 5. **No generic authenticated HTTP clients in secure commands**: `curl`, `wget`, `httpie`, interpreters, and shell trampolines are structurally denied as secure command entrypoints. This is checked at manifest validation and re-checked at execution time.
 6. **Managed CES container runs as non-root**: The CES Docker image runs as `uid 1001` (user `ces`). The CES data volume is owned by this user.
@@ -400,5 +400,5 @@ The following capabilities are intentionally deferred beyond v1:
 - [Security architecture](architecture/security.md) — existing credential broker and permission model
 - [AGENTS.md](../../AGENTS.md) — tooling direction and CES exception
-- [Tools AGENTS.md](../src/tools/AGENTS.md) — no-new-tools policy and CES exception
+- [Tools AGENTS.md](../src/tools/AGENTS.md) — tooling direction and CES exception
 - [Network traffic matrix](../../../vellum-assistant-platform/docs/network-traffic-matrix.md) — managed pod network policies