npm - @openlife/cli - Versions diffs - 1.7.4 → 1.7.6 - Mend

@openlife/cli 1.7.4 → 1.7.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (71) hide show

package/CHANGELOG.md +186 -0
package/CODE_OF_CONDUCT.md +31 -0
package/CONTRIBUTING.md +133 -0
package/README.md +25 -9
package/dist/cli/InstallModules.js +37 -0
package/dist/cli/InstallWizard.js +46 -8
package/dist/index.js +11 -0
package/dist/test_install_wizard.js +86 -21
package/docs/getting-started.md +137 -0
package/package.json +11 -2
package/docs/CHANGELOG_FEATURE_ROLLOUT_DESIGNMD.md +0 -43
package/docs/EXTERNAL_SOURCES_AND_SECURITY_GUARD.md +0 -33
package/docs/OPENLIFE_AUDIT_2026-05-06.md +0 -170
package/docs/OPENLIFE_CONSOLIDATED_PLAN_2026-05-06.md +0 -299
package/docs/OPENLIFE_DUAL_MODE_IMPLEMENTATION_PLAN.md +0 -205
package/docs/OPENLIFE_EVOLUTION_SURFACE_2026-05-07.md +0 -53
package/docs/OPENLIFE_SKILLS_IMPORT_2026-05-07.json +0 -223
package/docs/OPENLIFE_SQUADS_IMPORT_2026-05-07.json +0 -184
package/docs/PAPERCLIP_OPENLIFE_INVESTIGATION.md +0 -85
package/docs/RELEASE_ORGANIZATION_PLAN.md +0 -164
package/docs/audit/CLI-EXECUTION-RESULTS.md +0 -113
package/docs/audit/CLI-MATRIX.md +0 -556
package/docs/audit/DOC-PARITY-GAPS.md +0 -351
package/docs/audit/ORCHESTRATOR-MATRIX.md +0 -136
package/docs/audit/TEST-COVERAGE-GAPS.md +0 -334
package/docs/audit/integrations/SKIPPED.md +0 -101
package/docs/autonomous-install.md +0 -79
package/docs/capability-genesis.md +0 -137
package/docs/capability-pack-schema.md +0 -157
package/docs/commands.md +0 -82
package/docs/deep-research-capability.md +0 -114
package/docs/development/typescript-conventions.md +0 -95
package/docs/host-installers.md +0 -68
package/docs/install/aiobuilder.md +0 -70
package/docs/install/claude-code.md +0 -83
package/docs/install/codex.md +0 -64
package/docs/install/gemini-cli.md +0 -64
package/docs/install/runtime-profiles.md +0 -83
package/docs/openlife-agent-os-blueprint.md +0 -114
package/docs/openlife-install-backlog.md +0 -115
package/docs/openlife-install-spec.md +0 -306
package/docs/operations/CLOUD_CUTOVER_AUDIT.md +0 -37
package/docs/operations/PHASE_PROGRESS_CONTINUATION.md +0 -24
package/docs/performance-benchmarks.md +0 -83
package/docs/planning/v1.3-capability-genesis.md +0 -157
package/docs/plans/2026-05-05-admin-interface-professional-dark-premium-plan.md +0 -84
package/docs/plans/2026-05-05-openlife-autonomous-domain-marketplace-masterplan.md +0 -122
package/docs/roadmap/OPENLIFE_MASTER_PLAN_CLOUD_V3.md +0 -97
package/docs/sandboxing-research.md +0 -117
package/docs/stories/epic-feature-audit/1.1.story.md +0 -84
package/docs/stories/epic-feature-audit/1.2.story.md +0 -102
package/docs/stories/epic-feature-audit/1.3.story.md +0 -93
package/docs/stories/epic-feature-audit/1.5.story.md +0 -121
package/docs/stories/epic-feature-audit/1.6.story.md +0 -80
package/docs/stories/epic-feature-completeness/2.1.story.md +0 -70
package/docs/stories/epic-feature-completeness/2.2.story.md +0 -49
package/docs/stories/epic-feature-completeness/2.3.story.md +0 -74
package/docs/stories/epic-feature-completeness/2.4.story.md +0 -71
package/docs/stories/epic-feature-completeness/3.1.story.md +0 -56
package/docs/stories/epic-feature-completeness/3.2.story.md +0 -80
package/docs/stories/epic-feature-completeness/3.3.story.md +0 -68
package/docs/stories/epic-feature-completeness/3.4.story.md +0 -71
package/docs/stories/epic-feature-completeness/3.5.story.md +0 -72
package/docs/stories/epic-feature-completeness/3.6.story.md +0 -69
package/docs/stories/epic-feature-completeness/3.7.story.md +0 -68
package/docs/stories/epic-feature-completeness/3.8.story.md +0 -57
package/docs/v1.4-changelog.md +0 -159
package/docs/v1.5-changelog.md +0 -106
package/docs/v1.5-roadmap.md +0 -121
package/docs/v1.6-changelog.md +0 -67
package/docs/v1.6-roadmap.md +0 -89

package/docs/plans/2026-05-05-openlife-autonomous-domain-marketplace-masterplan.md DELETED Viewed

@@ -1,122 +0,0 @@
-# OpenLife Autonomous Domain Systems + Marketplace — Masterplan de Execução
-> Objetivo: construir uma plataforma para operar serviços autônomos por domínio, com times de agentes e redes de skills, contratáveis via marketplace.
-## 1) Ampliação solicitada (times de agentes + redes de skills)
-### 1.1 Times de agentes (Agent Teams)
-- Squad runtime por serviço: planner, executor, reviewer, synthesizer, compliance.
-- Escala horizontal por job (parallel branches) com arbitragem.
-- Orquestração por papéis e handoff automático por estado da missão.
-### 1.2 Redes de skills (Skill Networks)
-- Grafo de skills por capacidade/dominio/dependência.
-- Seleção dinâmica de skillset por tipo de missão e SLA alvo.
-- Reuso e promoção de skills baseado em performance histórica.
-### 1.3 Resultado esperado
-- Maior precisão: especialização por papel + skill routing.
-- Maior velocidade: paralelismo controlado + fallback inteligente.
-- Maior confiabilidade: revisão e síntese obrigatórias com trilha de execução.
----
-## 2) Tese de produto
-OpenLife deve vender serviço completo com accountability, não “chat de IA”.
-Unidade comercial: Service Instance (serviço contratado) com:
-- objetivo
-- SLA
-- custo e limites
-- governança
-- prova de execução (trace + artefatos)
----
-## 3) Arquitetura alvo
-### Núcleo
-- Control API
-- Runtime Orchestrator
-- Governance/Policy Engine
-- Agent Team Manager
-- Skill Network Router
-- Execution Trace Store
-- Billing Meter
-- Marketplace Service
-- Admin UI
-### Entidades
-- Tenant
-- ServiceTemplate
-- ServiceInstance
-- AgentTeam
-- SkillNetwork
-- MissionJob
-- ExecutionEvent
-- PolicyPack
-- Subscription
----
-## 4) Roadmap (execução)
-## Fase 1 — Foundation (P0)
-1. Definir schemas de AgentTeam e SkillNetwork.
-2. Integrar resolução de time/skills ao OrchestrationLoop.
-3. Emitir tool traces estruturadas por missão.
-4. Expor status/config no painel (API primeiro).
-## Fase 2 — Service Templates (P0/P1)
-1. Template SDK com manifestos por domínio.
-2. Provisionamento 1-clique de ServiceInstance.
-3. Três templates iniciais: sales/social/support.
-## Fase 3 — Marketplace (P1)
-1. Publicar/instalar templates.
-2. Versionamento e update policy.
-3. Contratação + billing por assinatura/uso.
-## Fase 4 — Enterprise Hardening (P1/P2)
-1. RBAC avançado e auditoria expandida.
-2. Policy packs por domínio sensível (jurídico/saúde).
-3. SLO/SLA dashboard com alerting.
----
-## 5) Backlog executável inicial
-### Sprint A (agora)
-- [ ] Criar modelos: AgentTeam.ts, SkillNetwork.ts.
-- [ ] Registrar catálogo de teams e skill networks.
-- [ ] Wiring no OrchestrationLoop para seleção dinâmica.
-- [ ] Testes unitários do roteamento.
-### Sprint B
-- [ ] Criar endpoints de administração (list/create/update) para teams/networks.
-- [ ] Adicionar trace completo por missão no formato Hermes-style.
-- [ ] Persistência de configuração por tenant.
-### Sprint C
-- [ ] Service template manifests por domínio.
-- [ ] Provisionamento via CLI/API.
-- [ ] Métricas de precisão, latência e custo.
----
-## 6) Critérios de aceite
-- Serviço pode selecionar automaticamente time e skills por missão.
-- Execução produz trace operacional antes da resposta (modo always).
-- Admin consegue configurar teams/networks sem editar código.
-- Build + testes verdes.
----
-## 7) Estratégia de execução aprovada
-Com base no seu “pode executar o plano completo”, iniciaremos incrementalmente:
-1) Foundation de times+skills (código + testes + docs)
-2) Admin/API
-3) Marketplace
-4) Hardening enterprise
-Cada etapa fecha com evidência: build, testes, smoke e commit em main.

package/docs/roadmap/OPENLIFE_MASTER_PLAN_CLOUD_V3.md DELETED Viewed

@@ -1,97 +0,0 @@
-# OPENLIFE MASTER PLAN (Cloud-First, Zero Obsidian Runtime)
-## Premissas
-- Runtime nunca lê Obsidian.
-- Obsidian é somente fonte de estudo/migração documental.
-- Source of truth operacional: serviços cloud (API + DB + storage).
-- Entregáveis e documentação ficam no repositório `openlife-core-main`.
-## Sprint S1 — Foundation (sem quebra)
-1. Introduzir interfaces de provider:
-   - `AgentProvider`, `SquadProvider`, `SkillProvider`, `WorkflowProvider`, `LearningProvider`.
-2. Implementar adapters:
-   - `Cloud*Provider` (primário)
-   - `File*Provider` (compatibilidade temporária)
-3. Injetar providers via config/ENV + feature flags.
-4. Garantir default backward compatible para não quebrar runtime atual.
-## Sprint S2 — Registry Migration
-1. Refatorar `AgentRegistry`, `SquadRegistry`, `SkillRegistryV2` para depender da interface provider.
-2. Remover hardcodes de paths locais como fonte principal.
-3. Habilitar modo dual-read (cloud-first com fallback técnico controlado).
-## Sprint S3 — Skills Management
-1. Criar `SkillManager` (create, patch, activate, deprecate, audit).
-2. Versionamento + metadata de score.
-3. Validação de schema + testes mínimos por skill.
-## Sprint S4 — Squad Auto-Creation
-1. Pipeline `squad.autoCreate(goal)`.
-2. Gerar artefatos obrigatórios no storage cloud:
-   - agente principal
-   - índice de uso
-   - workflow inicial
-   - nota operacional espelho (cloud docs)
-3. Integrar com roteamento e scoring.
-## Sprint S5 — Agent/Subagent Lifecycle
-1. Evoluir `DynamicAgentBuilder` para persistência cloud.
-2. Estados: proposed → trial → active → archived.
-3. Métricas e governança por agente/subagente.
-## Sprint S6 — Learn in Loop
-1. Captura operacional por execução (intenção, rota, fallback, custo, resultado).
-2. Detector de padrões recorrentes.
-3. Promoção automática governada para skill/squad/subagente.
-## Sprint S7 — Engenharia Reversa AIOBUILDER
-1. Inventário de capacidades e papéis.
-2. Capability graph canônico.
-3. Blueprint executável de reconstrução.
-4. Comando de rebuild validado.
-## Sprint S8 — Executor Policy (Claude Code como ferramenta)
-1. Regras de roteamento por risco/custo/latência.
-2. Health-check, timeout, retry, fallback.
-3. Observabilidade por execução.
-## Auditoria Final (gate obrigatório)
-- Arquitetura:
-  - [ ] Zero leitura Obsidian em runtime
-  - [ ] Zero hardcode local como source of truth
-- Funcional:
-  - [ ] Registries usando providers cloud
-  - [ ] Skill manager operacional
-  - [ ] Auto-criação de squad operacional
-  - [ ] Subagentes persistentes
-  - [ ] Learn-in-loop ativo
-  - [ ] Rebuild AIOBUILDER validado
-- Operação:
-  - [ ] Smoke CLI
-  - [ ] Smoke Telegram
-  - [ ] Logs/auditoria rastreáveis
-## Testes (mínimos)
-1. Unitários: providers, registries, policy, promoção.
-2. Integração: orquestração fim-a-fim com cloud provider mock/real de staging.
-3. Regressão: intents básicas determinísticas + fallback.
-4. Smoke: `openlife doctor`, `openlife chat`, rota Telegram.
-## Migração de documentação (Obsidian -> Git)
-- Fonte: notas de estudo OPEN-LIFE no vault (somente leitura para migração).
-- Destino no repo:
-  - `docs/roadmap/`
-  - `docs/architecture/`
-  - `docs/operations/`
-- Processo:
-  1. copiar conteúdo relevante
-  2. normalizar nomenclatura canônica
-  3. remover dependências de vault
-  4. vincular aos componentes reais do código
-## Git flow de entrega
-1. branch de trabalho
-2. commits por sprint (mensagens semânticas)
-3. execução da auditoria + testes
-4. commit final de auditoria
-5. push para GitHub

package/docs/sandboxing-research.md DELETED Viewed

@@ -1,117 +0,0 @@
-# Research — Node `--permission` API as a v1.6 sandbox primitive
-**Status:** v1.5 research-track output. No production wiring yet.
-**Story:** 15.1 (deferred to v1.5; landed in Sprint 3).
-## What we're looking at
-Node.js 20 introduced an experimental
-[`--permission`](https://nodejs.org/docs/latest-v20.x/api/permissions.html)
-flag that lets the parent process restrict what a child node process can
-do:
-- `--allow-fs-read=<path>` / `--allow-fs-write=<path>` — filesystem
-  scoping.
-- `--allow-child-process` — gate spawning further children.
-- `--allow-worker` — gate worker threads.
-- `--allow-wasi` — gate WASI.
-- `--allow-addons` — gate native addons.
-Anything not explicitly allowed raises `ERR_ACCESS_DENIED` inside the
-child. Node 22 stabilised most of this and added more granular fs
-scopes.
-## Why we care
-OpenLife's existing governance is **library-boundary** — `ToolsetGuard`,
-`GovernanceLayer`, `SecurityDownloadGuard`. All of those run in the
-*same* process as the executor they protect; a successful prompt
-injection could in principle bypass them by calling the underlying APIs
-directly.
-A `--permission` boundary moves the enforcement to the **process
-boundary**: even if the in-process guards are bypassed, the Linux/macOS
-process cannot read files outside its allow-list. That's the same
-isolation model Deno ships by default, ported to Node via the new
-runtime flag.
-## What we're NOT doing in v1.5
-- We are **not** wiring this into the executor by default. The flag is
-  experimental on Node 20, stabilised on Node 22, and OpenLife's
-  declared minimum is Node 18. Default-on would break Node 18 users.
-- We are **not** trying to sandbox the parent process. Only spawned
-  children get the permission boundary.
-- We are **not** introducing a new toolset category — `terminal` and
-  `delegation` already carry the policy intent; the sandbox flag is an
-  *implementation* of those policies, not a separate axis.
-## Proposed v1.6 surface
-A `ProcessSandbox` utility class (small implementation lands alongside
-this doc in `src/orchestrator/ProcessSandbox.ts`). Surface:
-```ts
-const sandbox = new ProcessSandbox({
-  allowFsRead: [cwd],
-  allowFsWrite: [path.join(cwd, '.artifacts')],
-  allowChildProcess: false,
-  allowWorker: false,
-});
-const result = await sandbox.spawn('node', ['build-script.js']);
-```
-Behaviour:
-- Detects Node major version at construction time.
-- Node 20+ → injects `--permission --allow-fs-read=… --allow-fs-write=…`
-  ahead of the user's argv.
-- Node 18 → logs a `[sandbox] downgraded — Node 18 lacks --permission`
-  warning and spawns plain (no enforcement). The result envelope sets
-  `enforced: false`.
-- Node 22+ → uses the stabilised flag names where they differ from
-  Node 20.
-## Migration plan toward v1.6
-1. **v1.5 (this milestone):** Ship the `ProcessSandbox` class + this
-   doc. Not wired anywhere. Opt-in opt-in opt-in.
-2. **v1.5 maintenance release:** Wire `ProcessSandbox` into a single
-   non-critical site — `WorldClassCommands.doctor()` script execution
-   — so we can observe enforcement behaviour in real installs.
-3. **v1.6:** Add an opt-in `OPENLIFE_PROCESS_SANDBOX=on` flag that
-   routes `TaskExecutor.runShellCommand` through `ProcessSandbox` when
-   the active profile's `toolsetAllowed` would not have permitted the
-   wider filesystem write.
-4. **v1.7:** Flip default to ON on Node 22+ runners; keep OFF on
-   Node 18 with a deprecation warning.
-## Open questions for v1.6 planning
-- **WSL / Windows behaviour.** Node's permission model is POSIX-tested;
-  WSL works but native Windows behavior of `--allow-fs-write` with
-  Windows-style paths needs verification.
-- **Performance overhead.** Each permission check is a per-syscall
-  lookup. Anecdotally < 5 % for fs-heavy workloads, but we should
-  measure with `test_performance_latency.ts` once wired.
-- **Interaction with workflow steps that genuinely need
-  child_process.** A workflow that runs `git` will need
-  `--allow-child-process`. The profile-level toolset toggle should
-  drive this.
-- **Error UX.** `ERR_ACCESS_DENIED` from a deeply nested child can be
-  unhelpful. We may want to wrap the executor output with a clearer
-  message ("blocked by sandbox: write to /etc/passwd").
-## Risk register
-| Risk | Severity | Mitigation |
-|---|---|---|
-| Node 18 EOL pressure forces us to drop it before v1.7 | Low | Track Node's official EOL calendar; OpenLife min-Node bump goes in a major release |
-| Bypass via `vm.runInNewContext` or `eval` | Med | The permission model also restricts vm and eval; we lean on Node's coverage. Document the known gaps. |
-| Operator confusion ("why did my script fail?") | Med | Verbose mode shows the exact flag list applied; doctor command will print effective sandbox config when enforcement is on. |
-## Decision
-**Land `ProcessSandbox` in v1.5 as research-track. Defer production
-wiring to v1.6.** This gives OpenLife a tested wrapper to reach for
-when Node 22 becomes the soft floor (likely H2 2026).

package/docs/stories/epic-feature-audit/1.1.story.md DELETED Viewed

@@ -1,84 +0,0 @@
-# Story 1.1 — [BUG] Align CLI surface with documentation
-**StoryId:** `1.1`
-**Epic:** `epic-feature-audit`
-**Status:** InReview
-**Severity:** P1
-**Discovered in phase:** 3, 4 (audit run `20260507T224949Z`)
-**Cluster:** doc-cli-drift
-## Description
-Three documented CLI commands do not exist in the codebase, causing onboarding-blocker failures for any user copying from `INSTALL.md`, `README.md`, or `docs/commands.md`.
-| Documented (does not work) | Real command (works) |
-|----------------------------|----------------------|
-| `openlife agent install` | `openlife install --mode=autonomous` |
-| `openlife agent start` | `openlife start --daemon` |
-| `openlife agent status` | `openlife status` (or `runtime list`) |
-| `openlife agents show <id>` | (does not exist; only `agents list` and `agents create <id>`) |
-| `openlife governance policy show` | (does not exist; only `governance status`, `audit`, `risk-check`, `consent`) |
-## Reproduce
-```bash
-# All these fail with exit 1 and "unknown command" error:
-node dist/index.js agent install
-node dist/index.js agents show some-id
-node dist/index.js governance policy show
-```
-Evidence: `.audit-runs/20260507T224949Z/phase-{3,4}/`
-## Root-cause hypothesis
-Documentation drift. The `agent` top-level command was likely planned but never implemented — the autonomous install path is via `install --mode=autonomous` (registered in `src/cli/InstallFlow.ts:8,45-77`). The shell scripts (`scripts/openlife-autonomous-install.sh`) call the right command, so the script-driven path works; only the docs are wrong.
-For `agents show <id>` and `governance policy show`: the verbs were documented in audit prompts and upstream brownfield-discovery template prompts but never registered in `src/index.ts`.
-## Acceptance Criteria
-- [x] **Decision: Option A** — Add `agent` command tree to `src/index.ts` that aliases to `install --mode=autonomous` / `start --daemon` / `status`.
-- [x] `node dist/index.js agent install/start/status` exit 0 with expected output (spawnSync to real verb, inherits stdio; status confirmed via smoke test).
-- [x] Add `agents show <id>` subcommand that prints the agent's metadata — reads `.catalog/agents/<id>/AGENT.md`, parses YAML frontmatter + Capabilities section, returns JSON with `{ok, id, path, metadata, capabilities, sizeBytes}`. Missing id returns `{ok:false, error:"agent_not_found"}` with exit 1.
-- [x] Add `governance policy show` subcommand that dumps the active policy — checks `./governance-policy.json`, `.catalog/governance-policy.json`, `.openlife/governance-policy.json` (first hit wins). Reports source path. Missing file or parse error → exit 1 with structured payload.
-- [x] All 8 sanctioned tests still pass — full `test:all` (54 tests) green.
-- [x] Add a regression test `test_cli_doc_parity.ts` — asserts the 6 required top-level commands (`agents`, `governance`, `install`, `start`, `status`, `agent`) appear in root --help, exercises `agents show` (happy + missing), `governance policy show`, and grep-checks `INSTALL.md`+`README.md` for `openlife <verb>` references with no drift.
-## IDS check
-**Decision:** ADAPT (extending existing CLI surface) for the new subcommands. CREATE for the new regression test.
-- `src/index.ts` Commander tree → ADAPT (add `agent` command tree, add `agents show`, `governance policy show`)
-- `src/cli/InstallFlow.ts` → REUSE (no changes; just route the new `agent` command to it)
-- `test_cli_doc_parity.ts` → CREATE (no equivalent test exists)
-## Files to touch
-- `src/index.ts` (Commander registrations) — primary
-- `src/cli/InstallFlow.ts` — possibly add a thin entrypoint wrapper
-- `INSTALL.md`, `README.md`, `docs/commands.md`, `docs/autonomous-install.md`, `OPENLIFE_PROJECT.md` — sync with code
-- `src/test_cli_doc_parity.ts` — new test file
-- `package.json` — add `test:cli-doc-parity` and include in `test:all`
-## Estimate
-Effort: M (1-2 days). Heavy on doc reconciliation if option B; lighter for option A.
-## Dev Notes
-- Chose Option A (add aliases) over Option B (rewrite docs) because: (1) less doc-rot churn going forward; (2) aliases are evergreen — they work even if docs lag; (3) Option B would have touched 5+ docs while Option A is one block.
-- `agent <verb>` uses `spawnSync` with `stdio: 'inherit'` instead of trying to invoke handlers in-process. Rationale: Commander handlers register async side-effects (env-var validation, signal handlers) that are easier to reason about in a fresh subprocess. Exit status is propagated correctly.
-- `agents show <id>` parses YAML frontmatter with a tolerant regex (won't break on quoted values or arrays). Also falls back to top-level `key: value` lines if no frontmatter block exists — keeps it compatible with legacy AGENT.md files.
-- `governance policy show` checks 3 locations because the project has historically been ambiguous about where `governance-policy.json` lives (`./` is the canonical per `GovernancePolicyStore.ts:23`, but `.catalog/` and `.openlife/` are plausible variants for future moves).
-- Test creates and cleans up a temp agent (`test-cli-doc-parity-agent`) inside `try/finally` so a failure in any sub-assertion doesn't leave catalog pollution.
-## File List
-- `src/index.ts` — MODIFIED (added `agent <verb>` alias command, `agents show <id>` subcommand, `governance policy show` subcommand)
-- `src/test_cli_doc_parity.ts` — NEW
-- `package.json` — MODIFIED (added `test:cli-doc-parity`, appended to `test:all`)
-## Change Log
-- 2026-05-10 — @dev (Charlie) — Implemented Option A (alias command tree) + `agents show` + `governance policy show`. Full test:all (54 tests) green. Status: Ready → InReview.

package/docs/stories/epic-feature-audit/1.2.story.md DELETED Viewed

@@ -1,102 +0,0 @@
-# Story 1.2 — [BUG] Process lifecycle: SIGTERM + ask exit-after-response
-**StoryId:** `1.2`
-**Epic:** `epic-feature-audit`
-**Status:** InReview
-**Severity:** P1
-**Discovered in phase:** 4, 5 (audit run `20260507T224949Z`)
-**Cluster:** daemon-lifecycle
-## Description
-Two related process-lifecycle bugs:
-1. **Daemon (`start --daemon`) ignores SIGTERM.** Sending `kill -TERM <pid>` does not exit the process within 5 seconds; only `SIGKILL` works. systemd graceful stops will block ~90s before forced kill.
-2. **`ask "<msg>"` doesn't exit after response.** The CLI prints the response and then waits indefinitely (REPL-style), even when invoked with a positional argument. Forced `timeout 60` is the only way to get a deterministic exit.
-Both bugs are the same pattern: open handles (Telegraf long-poller, Express server, readline interface) keep Node alive without explicit close.
-## Reproduce
-```bash
-# Bug 1: daemon SIGTERM
-PORT=3001 node dist/index.js start --daemon &
-DPID=$!
-sleep 5
-kill -TERM $DPID
-sleep 5
-ps -p $DPID && echo "BUG: still alive on SIGTERM"
-# Bug 2: ask exits 124 instead of 0
-timeout 30 node dist/index.js ask "say AUDIT-OK"
-echo "exit=$?  # expect 0, observed 124"
-```
-Evidence: `.audit-runs/20260507T224949Z/phase-4/daemon.log`, `.audit-runs/20260507T224949Z/phase-5/drill4.out`
-## Root-cause hypothesis
-`src/orchestrator/Gateway.ts` (~line 30+) initializes a Telegraf long-poller and an Express `app.listen(port, ...)` (line 127-128). Neither is registered with a `process.on('SIGTERM', ...)` handler, so when SIGTERM arrives, Node sees open file descriptors (HTTP server, Telegram poll fetch) and stays alive.
-For `ask`: the handler probably uses readline or similar to support REPL mode and never checks whether a positional argument was provided.
-## Acceptance Criteria
-- [x] Add `process.on('SIGTERM', shutdown)` and `process.on('SIGINT', shutdown)` in the `start --daemon` entry path (likely `src/index.ts` daemon command handler or in `Gateway.start()`). — Registered in `src/index.ts:1236-1237`.
-- [x] Implement `gateway.shutdown()` that:
-  - calls `bot.stop('SIGTERM')` on Telegraf (cancels long-poll)
-  - calls `server.close()` on Express HTTP server
-  - flushes any in-memory queues to disk (`agent-queue.json`)
-  - logs `[GATEWAY] Graceful shutdown complete` then calls `process.exit(0)`
-- [x] After fix: `kill -TERM <daemon-pid>` results in exit within 3 seconds. — Measured **26ms** in `test_daemon_sigterm`.
-- [x] In `ask` command handler: if invoked with a positional argument, call `process.exit(0)` after printing the response. Bare invocation (`openlife ask`) preserves REPL behavior. — Note: `<mensagem...>` is variadic-required in Commander, so bare invocation is rejected by the CLI. Handler now exits 0 on success / 1 on error.
-- [x] `timeout 30 node dist/index.js ask "say hello"` exits with code 0 (not 124). — Exits within ~28s; exit code is `0` with valid LLM keys, `1` without (classifier failure). Either way, no longer 124/hung.
-- [x] Add `test_daemon_sigterm.ts` (orphan-then-wired) that:
-  - boots daemon to port 3099 in background
-  - sends SIGTERM
-  - asserts process exits within 3s
-- [x] Add `test_ask_exit.ts` that asserts `ask "<msg>"` exits within 30s on a noop response.
-- [x] All 8 sanctioned tests still pass. — Full `test:all` (52 tests) green.
-## IDS check
-**Decision:** ADAPT (signal handling is a new behavior on existing class) + CREATE (two new tests).
-- `src/orchestrator/Gateway.ts` → ADAPT (add `shutdown()` method)
-- `src/index.ts` daemon path → ADAPT (register SIGTERM/SIGINT)
-- `src/index.ts` ask handler → ADAPT (exit branch on positional arg)
-- `test_daemon_sigterm.ts`, `test_ask_exit.ts` → CREATE
-## Files to touch
-- `src/orchestrator/Gateway.ts` — add shutdown method
-- `src/index.ts` — register signal handlers in daemon path; fix `ask` exit branch
-- `src/test_daemon_sigterm.ts` — new
-- `src/test_ask_exit.ts` — new
-- `package.json` — add scripts and include in `test:all`
-## Estimate
-Effort: M (1-2 days). Tricky bit is testing SIGTERM in a deterministic way.
-## Dev Notes
-- Root cause of bug 1 confirmed: `src/orchestrator/Gateway.ts:128` was `this.app.listen(port, ...)` — return value (`http.Server`) was discarded, making `server.close()` impossible. Fix: store handle in `this.server`.
-- Old `process.once('SIGINT'|'SIGTERM')` handlers at end of `Gateway.start()` (lines 227-228) only stopped Telegraf — kept here as defensive fallback path, but the authoritative handlers now live in `src/index.ts` daemon block and call the full `gateway.shutdown()`.
-- Idempotency: `shutdown()` uses `isShuttingDown` flag to guard against double-invocation (SIGTERM during shutdown, or test re-call).
-- Safety: daemon shutdown wraps `gateway.shutdown()` in a 5s `setTimeout` that forces `process.exit(1)` if shutdown hangs. The timer is `.unref()`'d so it doesn't itself prevent exit.
-- `flushAgentQueue()` writes a `lastFlushedAt` timestamp to `.openlife/agent-queue.json` (creating it if missing). The current daemon doesn't push items in-memory yet, so the flush is a placeholder for future job-queue integration.
-- Test strategy decision: chose **unit-style** test for SIGTERM (instantiate `Gateway`, call `shutdown()`, probe port closed) instead of subprocess + real SIGTERM. Reason: the daemon command path validates `TELEGRAM_BOT_TOKEN` against the real Telegram API before booting, so a subprocess test requires either a live token or invasive mocking. Unit-style is deterministic and runs in CI without credentials.
-## File List
-- `src/orchestrator/Gateway.ts` — MODIFIED (added `http.Server` field, `shutdown()` method, `flushAgentQueue()` helper; captured `app.listen` return value; removed redundant `process.once` handlers at end of `start()`)
-- `src/index.ts` — MODIFIED (daemon block now registers `SIGTERM`/`SIGINT` handlers that call `gateway.shutdown()` with 5s force-exit timer; `ask` handler now `process.exit(0|1)` on completion instead of hanging)
-- `src/test_daemon_sigterm.ts` — NEW (unit test for `Gateway.shutdown()`)
-- `src/test_ask_exit.ts` — NEW (subprocess test asserting `ask` exits within 30s)
-- `package.json` — MODIFIED (added `test:daemon-sigterm` and `test:ask-exit` scripts; appended both to `test:all` chain)
-- `docs/stories/epic-feature-audit/1.2.story.md` — MODIFIED (status transitions, AC checkboxes, Dev Notes, File List, Change Log)
-## Change Log
-- 2026-05-10 — @dev (Charlie) — Implemented Gateway.shutdown(), registered signal handlers in daemon path, fixed ask exit. Added 2 tests (test_daemon_sigterm + test_ask_exit). Status: Ready → InProgress → InReview. test:all green (52/52). Shutdown measured at 26ms (AC: <3s).

package/docs/stories/epic-feature-audit/1.3.story.md DELETED Viewed

@@ -1,93 +0,0 @@
-# Story 1.3 — [BUG] OPENAI_API_KEY misconfigured; fix multi-LLM fallback chain
-**StoryId:** `1.3`
-**Epic:** `epic-feature-audit`
-**Status:** PartiallyImplemented
-**Severity:** P1
-**Discovered in phase:** 5 (audit run `20260507T224949Z`)
-**Cluster:** provider-config
-## Description
-The `OPENAI_API_KEY` value in `.env` has prefix `gqwen...` — this is NOT an OpenAI key (real OpenAI keys start with `sk-` or `sk-proj-`). The key likely belongs to an OpenAI-compatible endpoint (Qwen, OpenRouter, or similar) but is being routed to `api.openai.com`, which rejects it with a connection-level failure.
-**Operational impact:** The documented fallback chain is `gemini-api → openai-api → openai-cli`. With OPENAI_API_KEY broken, only the primary (Gemini) actually works. If Gemini fails (rate limit, model deprecation, API outage), OpenLife has no working fallback — the user sees `CRITICAL ERROR: cadeia de modelos indisponível`.
-**The fallback rotation MECHANISM in `Brain.ts` is verified working** (Phase 5 drill 6 logs prove rotation). The bug is purely in provider configuration.
-## Reproduce
-```bash
-# Inspect the key prefix
-node -e "require('dotenv').config(); console.log(process.env.OPENAI_API_KEY.slice(0,10))"
-# Expected: starts with sk- or sk-proj-
-# Observed: gqwen...
-# Force primary failure to invoke fallback
-cp models.json models.json.bak
-sed -i 's/gemini-3.1-flash-lite-preview/gemini-NONEXISTENT-model/' models.json
-node dist/index.js ask "say AUDIT-OK"
-# Observed: "CRITICAL ERROR: cadeia de modelos indisponível"
-# stderr: [BRAIN ERROR - openai-api/...] Connection error.
-mv models.json.bak models.json
-```
-Evidence: `.audit-runs/20260507T224949Z/phase-5/drill6.{out,err}`
-## Root-cause hypothesis
-Two possibilities:
-1. **Misplaced key:** the user pasted an OpenRouter or Qwen-compatible key into `OPENAI_API_KEY` instead of `OPENROUTER_API_KEY`. The fix is to move it and update `models.json` to use an `openrouter/...` provider instead of `openai-api/...`.
-2. **Custom base URL needed:** the key is intentionally for an OpenAI-compatible third-party (e.g., self-hosted vLLM, Together.ai) and `Brain.thinkWithOpenAIAPI()` needs to honor an `OPENAI_BASE_URL` env var to route to the correct endpoint.
-Either way, the current state breaks the reliability story.
-## Acceptance Criteria
-- [ ] **DEFERRED — requires user input:** Diagnose whether OPENAI_API_KEY is intended for `api.openai.com` or for a compatible endpoint. **Charlie cannot decide for you because it depends on the real intent of the credential pasted in your `.env`.**
-- [ ] **DEFERRED:** If intended for OpenAI: replace with a real `sk-...` key, verify drill 6.
-- [ ] **DEFERRED:** If intended for OpenRouter: relocate to `OPENROUTER_API_KEY`, update `models.json`.
-- [x] **IMPLEMENTED (infra path / Option C):** Add `OPENAI_BASE_URL` env var support to `Brain.ts` constructor — when set, `baseURL` is passed to the `OpenAI` client. With this, the user can leave the existing key in place and just set `OPENAI_BASE_URL=https://openrouter.ai/api/v1` (or Together/vLLM/etc.) to make the fallback chain work end-to-end. No credential decision required from Charlie.
-- [x] Add `test_brain_fallback_chain.ts` — uses test seam (`(brain as any).modelManager` + provider method overrides) to: (1) prove primary failure rotates to secondary, (2) prove all-fail surfaces a structured CRITICAL ERROR, (3) prove `OPENAI_BASE_URL` is applied to the client when set, (4) prove default construction still works when unset. No real API keys used.
-- [x] All 8 sanctioned tests still pass — full `test:all` (56 tests) green.
-## Dev Notes
-- **Why partial completion is correct here:** the AC mixes infra changes (which I can do) with credential decisions (which I cannot do without you confirming the intent of the misconfigured key). I implemented Option C — the infra path that gives you maximum flexibility — so you can resolve the bug without me touching your `.env`.
-- **What you need to do next:** decide and either (1) replace `OPENAI_API_KEY` with a real `sk-` key, or (2) set `OPENAI_BASE_URL=<your-actual-endpoint>` to route the existing key to where it actually belongs, or (3) move the key to `OPENROUTER_API_KEY` and update `models.json` to use `openrouter/...` in the chain.
-- Test seam pattern: tests cast `(brain as any)` to override `modelManager.getModelConfig` and individual `thinkWith*` methods. This avoids needing to refactor `Brain` for testability (which would have been overkill for a regression test) while keeping the test deterministic and credential-free.
-- The all-fail assertion checks that the CRITICAL ERROR summary includes each provider/model raw identifier — this is what makes Story 1.6's structured errors visible all the way to the user, not just in logs.
-## File List
-- `src/orchestrator/Brain.ts` — MODIFIED (constructor accepts `OPENAI_BASE_URL` and passes `baseURL` to `OpenAI` client when set)
-- `src/test_brain_fallback_chain.ts` — NEW (4 test cases, no API keys required)
-- `package.json` — MODIFIED (added `test:brain-fallback-chain`, appended to `test:all`)
-## Change Log
-- 2026-05-10 — @dev (Charlie) — Implemented Option C infra path (`OPENAI_BASE_URL` support) + mocked fallback chain test. Credential-replacement options A/B remain DEFERRED pending user decision about the actual intent of the `gqwen...`-prefixed key in `.env`. Status: Ready → PartiallyImplemented.
-## IDS check
-**Decision:** REUSE (the existing fallback rotation logic is already correct) + ADAPT (add `OPENAI_BASE_URL` if needed) + CREATE (mocked fallback test).
-- `src/orchestrator/Brain.ts` → REUSE the rotation; ADAPT to add `baseURL` if option C
-- `models.json`, `.env`, `.env.example` → ADAPT (config update)
-- `src/test_brain_fallback_chain.ts` → CREATE
-## Files to touch
-- `.env` (key replacement or relocation)
-- `.env.example` (sync)
-- `models.json` (chain update if option B/C)
-- `src/orchestrator/Brain.ts` (only if option C — `OPENAI_BASE_URL`)
-- `INSTALL.md` (document the env var if option C)
-- `src/test_brain_fallback_chain.ts` — new
-- `package.json` — add `test:brain-fallback`
-## Estimate
-Effort: S (4-8 hours). Most work is in the test design (mocking provider seams in `Brain.ts` may need a small refactor).