npm - @jterrats/open-orchestra - Versions diffs - 1.0.15 → 1.0.17 - Mend

@jterrats/open-orchestra 1.0.15 → 1.0.17

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (191) hide show

package/AGENTS.md +3 -3
package/CLAUDE.md +26 -4
package/README.md +32 -9
package/dist/benchmark.js +65 -27
package/dist/benchmark.js.map +1 -1
package/dist/command-manifest.js +6 -1
package/dist/command-manifest.js.map +1 -1
package/dist/command-routes.js +8 -1
package/dist/command-routes.js.map +1 -1
package/dist/commands.d.ts +2 -1
package/dist/commands.js +2 -1
package/dist/commands.js.map +1 -1
package/dist/context-vault-commands.d.ts +5 -0
package/dist/context-vault-commands.js +79 -0
package/dist/context-vault-commands.js.map +1 -0
package/dist/context-vault-file-metadata.d.ts +3 -0
package/dist/context-vault-file-metadata.js +25 -0
package/dist/context-vault-file-metadata.js.map +1 -0
package/dist/context-vault-model.d.ts +79 -0
package/dist/context-vault-model.js +2 -0
package/dist/context-vault-model.js.map +1 -0
package/dist/context-vault-redaction.d.ts +5 -0
package/dist/context-vault-redaction.js +22 -0
package/dist/context-vault-redaction.js.map +1 -0
package/dist/context-vault-renderer.d.ts +3 -0
package/dist/context-vault-renderer.js +30 -0
package/dist/context-vault-renderer.js.map +1 -0
package/dist/context-vault-service.d.ts +17 -0
package/dist/context-vault-service.js +183 -0
package/dist/context-vault-service.js.map +1 -0
package/dist/delivery-commands.d.ts +1 -0
package/dist/delivery-commands.js +19 -1
package/dist/delivery-commands.js.map +1 -1
package/dist/diagram-schema.d.ts +2 -0
package/dist/diagram-schema.js +148 -0
package/dist/diagram-schema.js.map +1 -0
package/dist/diagrams/index.d.ts +2 -0
package/dist/diagrams/index.js +2 -0
package/dist/diagrams/index.js.map +1 -1
package/dist/diagrams/pipeline.js +7 -5
package/dist/diagrams/pipeline.js.map +1 -1
package/dist/evidence-compaction-model.d.ts +62 -0
package/dist/evidence-compaction-model.js +2 -0
package/dist/evidence-compaction-model.js.map +1 -0
package/dist/evidence-compaction-renderer.d.ts +2 -0
package/dist/evidence-compaction-renderer.js +35 -0
package/dist/evidence-compaction-renderer.js.map +1 -0
package/dist/evidence-compaction-service.d.ts +11 -0
package/dist/evidence-compaction-service.js +94 -0
package/dist/evidence-compaction-service.js.map +1 -0
package/dist/evidence-compaction-summary.d.ts +4 -0
package/dist/evidence-compaction-summary.js +160 -0
package/dist/evidence-compaction-summary.js.map +1 -0
package/dist/knowledge-base.js +7 -1
package/dist/knowledge-base.js.map +1 -1
package/dist/metrics-commands.js +3 -0
package/dist/metrics-commands.js.map +1 -1
package/dist/planning-commands.js +23 -1
package/dist/planning-commands.js.map +1 -1
package/dist/quality-contracts.js +12 -6
package/dist/quality-contracts.js.map +1 -1
package/dist/report-index.d.ts +3 -0
package/dist/report-index.js +3 -0
package/dist/report-index.js.map +1 -0
package/dist/report-model.d.ts +22 -0
package/dist/report-model.js +2 -0
package/dist/report-model.js.map +1 -0
package/dist/report-render-markdown.d.ts +1 -0
package/dist/report-render-markdown.js +40 -0
package/dist/report-render-markdown.js.map +1 -0
package/dist/report-schema.d.ts +2 -0
package/dist/report-schema.js +109 -0
package/dist/report-schema.js.map +1 -0
package/dist/rule-catalog.d.ts +33 -0
package/dist/rule-catalog.js +215 -0
package/dist/rule-catalog.js.map +1 -0
package/dist/runtime-bootstrap.js +33 -8
package/dist/runtime-bootstrap.js.map +1 -1
package/dist/runtime-capacity-policy.d.ts +38 -0
package/dist/runtime-capacity-policy.js +117 -0
package/dist/runtime-capacity-policy.js.map +1 -0
package/dist/runtime-capacity-scheduler-helpers.d.ts +40 -0
package/dist/runtime-capacity-scheduler-helpers.js +111 -0
package/dist/runtime-capacity-scheduler-helpers.js.map +1 -0
package/dist/runtime-capacity-scheduler-state.d.ts +44 -0
package/dist/runtime-capacity-scheduler-state.js +128 -0
package/dist/runtime-capacity-scheduler-state.js.map +1 -0
package/dist/runtime-capacity-scheduler.d.ts +34 -0
package/dist/runtime-capacity-scheduler.js +193 -0
package/dist/runtime-capacity-scheduler.js.map +1 -0
package/dist/runtime-capacity-snapshot.d.ts +14 -0
package/dist/runtime-capacity-snapshot.js +87 -0
package/dist/runtime-capacity-snapshot.js.map +1 -0
package/dist/runtime-child-prompt.d.ts +2 -1
package/dist/runtime-child-prompt.js +4 -1
package/dist/runtime-child-prompt.js.map +1 -1
package/dist/runtime-claude-native-bridge.js +2 -1
package/dist/runtime-claude-native-bridge.js.map +1 -1
package/dist/runtime-commands.js +6 -0
package/dist/runtime-commands.js.map +1 -1
package/dist/runtime-context-manifest.js +3 -24
package/dist/runtime-context-manifest.js.map +1 -1
package/dist/runtime-lifecycle-watch.d.ts +5 -2
package/dist/runtime-lifecycle-watch.js +19 -3
package/dist/runtime-lifecycle-watch.js.map +1 -1
package/dist/runtime-load-balancer.d.ts +12 -0
package/dist/runtime-load-balancer.js +106 -0
package/dist/runtime-load-balancer.js.map +1 -0
package/dist/runtime-spawn-bridge.js +23 -0
package/dist/runtime-spawn-bridge.js.map +1 -1
package/dist/runtime-spawn-guidance.js +15 -0
package/dist/runtime-spawn-guidance.js.map +1 -1
package/dist/runtime-worker-registry.d.ts +19 -0
package/dist/runtime-worker-registry.js +84 -0
package/dist/runtime-worker-registry.js.map +1 -0
package/dist/security/content-classifier.d.ts +2 -0
package/dist/security/content-classifier.js +147 -0
package/dist/security/content-classifier.js.map +1 -0
package/dist/security/operation-contract-types.d.ts +28 -0
package/dist/security/operation-contract-types.js +2 -0
package/dist/security/operation-contract-types.js.map +1 -0
package/dist/security/operation-contract.d.ts +2 -0
package/dist/security/operation-contract.js +169 -0
package/dist/security/operation-contract.js.map +1 -0
package/dist/security/policy-engine.d.ts +2 -0
package/dist/security/policy-engine.js +142 -0
package/dist/security/policy-engine.js.map +1 -0
package/dist/security/policy-types.d.ts +79 -0
package/dist/security/policy-types.js +7 -0
package/dist/security/policy-types.js.map +1 -0
package/dist/security/prompt-intake.d.ts +13 -0
package/dist/security/prompt-intake.js +33 -0
package/dist/security/prompt-intake.js.map +1 -0
package/dist/security/redaction.d.ts +3 -0
package/dist/security/redaction.js +64 -0
package/dist/security/redaction.js.map +1 -0
package/dist/security/sink-encoding.d.ts +6 -0
package/dist/security/sink-encoding.js +40 -0
package/dist/security/sink-encoding.js.map +1 -0
package/dist/sprint-commands.js +33 -22
package/dist/sprint-commands.js.map +1 -1
package/dist/structured-output-validation.d.ts +9 -0
package/dist/structured-output-validation.js +20 -0
package/dist/structured-output-validation.js.map +1 -0
package/dist/transcription-failures.d.ts +2 -0
package/dist/transcription-failures.js +4 -0
package/dist/transcription-failures.js.map +1 -0
package/dist/transcription-media-preflight.d.ts +9 -0
package/dist/transcription-media-preflight.js +147 -0
package/dist/transcription-media-preflight.js.map +1 -0
package/dist/transcription-request.d.ts +13 -0
package/dist/transcription-request.js +150 -0
package/dist/transcription-request.js.map +1 -0
package/dist/transcription-source-policy.d.ts +4 -0
package/dist/transcription-source-policy.js +43 -0
package/dist/transcription-source-policy.js.map +1 -0
package/dist/transcription-types.d.ts +161 -0
package/dist/transcription-types.js +2 -0
package/dist/transcription-types.js.map +1 -0
package/dist/types/runtime.d.ts +147 -0
package/dist/types.d.ts +3 -1
package/dist/types.js +1 -0
package/dist/types.js.map +1 -1
package/dist/web-api-read-routes.js +2 -0
package/dist/web-api-read-routes.js.map +1 -1
package/dist/web-console/assets/index-BJuVTqfQ.js +11 -0
package/dist/web-console/index.html +1 -1
package/dist/workflow-evidence-service.js +16 -0
package/dist/workflow-evidence-service.js.map +1 -1
package/dist/workflow-phase-planner.js +5 -3
package/dist/workflow-phase-planner.js.map +1 -1
package/dist/workflow-phases.js +5 -0
package/dist/workflow-phases.js.map +1 -1
package/dist/workflow-run-commands.js +89 -10
package/dist/workflow-run-commands.js.map +1 -1
package/dist/workflow-services.d.ts +1 -0
package/dist/workflow-services.js +8 -1
package/dist/workflow-services.js.map +1 -1
package/docs/audio-video-transcription-skill.md +102 -70
package/docs/autonomous-workflow.md +3 -3
package/docs/context-vault.md +34 -11
package/docs/diagrams/deterministic-pipeline/README.md +35 -1
package/docs/evidence-compaction.md +25 -0
package/docs/rule-loading-strategy.md +37 -0
package/docs/runtime-adapters.md +7 -0
package/docs/runtime-capacity.md +57 -0
package/docs/security-saas-orchestrator.md +368 -0
package/docs/sonar-quality-gates.md +1 -1
package/package.json +1 -1
package/rules/development/semantic-code.md +28 -0
package/dist/web-console/assets/index-Bis4CecA.js +0 -11

package/docs/diagrams/deterministic-pipeline/README.md CHANGED Viewed

@@ -17,7 +17,9 @@ diagrams:
    connector endpoints, connector-node overlaps, connector labels covering
    other lines, and unnecessary bends where the path is already straight.
 5. `generateDeterministicDiagram()` ties model, layout, rendering, and
-   validation together for a single pass.
+   validation together for a single pass. It first validates the runtime
+   payload against the structured diagram schema and fails with field-level
+   errors before any layout or SVG rendering occurs.
 6. `runDeterministicDiagramPipeline()` adds bounded iteration. It renders the
    first pass, applies deterministic text-fit repair when possible, regenerates
    the artifact, and retains only the final artifact unless
@@ -28,6 +30,38 @@ this does not widen `command-routes*` or `tool-commands` ownership in the same
 change. Consumers can call `runDeterministicDiagramPipeline()` directly to get a
 stable final SVG plus optional retained iteration artifacts.
+## Structured Output Contracts
+Agents that generate deterministic artifacts must return structured JSON-like
+payloads, not prose instructions for renderers to interpret.
+Diagram payloads must satisfy `DiagramModel`:
+- `id`, `title`, and `direction` are required.
+- `direction` must be `right` or `down`.
+- `nodes` must be a non-empty array. Each node needs `id`, `kind`, and
+  `text.label`; `kind` must be one of `actor`, `system`, `service`, `database`,
+  `queue`, or `boundary`.
+- `connectors` must be an array. Each connector needs `id`, `from`, `to`, and
+  `kind`; `from` and `to` must reference known node ids; `kind` must be one of
+  `sync`, `async`, `data`, or `control`.
+- Optional `groups` define labeled containers. Unused groups are allowed so
+  generated drafts can keep future grouping intent, but duplicate group ids are
+  rejected.
+Report payloads use `ReportDocument` through `renderReportMarkdown()`:
+- `id`, `title`, `summary`, and a non-empty `sections` array are required.
+- Section `kind` must be one of `summary`, `findings`, `decisions`, `risks`,
+  `evidence`, or `nextSteps`.
+- Section items require `id` and `text`; optional severity must be one of
+  `info`, `low`, `medium`, `high`, or `critical`.
+Validation errors include JSON-path-like locations, for example
+`$.connectors[0].to: references unknown node "api-gateway"`. Agent prompts
+should pass those messages back to the generating role unchanged so the payload
+can be corrected without guessing which field failed.
 ## Icon Policy
 Diagram nodes reference icons by semantic purpose and Iconify id. Rendering

package/docs/evidence-compaction.md ADDED Viewed

@@ -0,0 +1,25 @@
+# Evidence Compaction
+Use evidence compaction when a task has enough evidence artifacts to make
+workflow context noisy.
+```sh
+orchestra evidence compact --task GH-471 --threshold 20
+```
+The command writes Markdown and JSON summaries under
+`.agent-workflow/evidence-summaries/`. Raw evidence files are not modified or
+deleted; summary artifacts keep links to every raw evidence artifact.
+Compaction groups evidence by task, role, type, and result. It preserves failed
+evidence, unresolved or residual risk lines, and acceptance criteria references
+found through exact criterion text or `AC-<number>` mentions.
+Task context rendering uses an in-memory evidence summary when the evidence
+count reaches the configured threshold. Override the default threshold with:
+```sh
+orchestra context --task GH-471 --evidence-summary-threshold 10
+```
+For process-wide defaults, set `ORCHESTRA_EVIDENCE_SUMMARY_THRESHOLD`.

package/docs/rule-loading-strategy.md ADDED Viewed

@@ -0,0 +1,37 @@
+# Rule Loading Strategy
+Open Orchestra treats detailed delivery rules as neutral source material that can
+be rendered or referenced by each runtime. Cursor `.mdc` files are supported
+runtime outputs, but they are not the universal source of truth.
+## Source Model
+- Root files such as `AGENTS.md`, `CLAUDE.md`, and `ORCHESTRA.md` stay compact.
+- `src/rule-catalog.ts` owns rule metadata: id, title, canonical path, roles,
+  capabilities, triggers, and risk areas.
+- Detailed rule content lives under `rules/` using the format that best fits the
+  rule. Cursor-specific `.mdc` files remain valid rendered or legacy targets.
+- Runtime context manifests and quality contracts resolve rules by id instead of
+  hardcoding Cursor paths.
+## Runtime Behavior
+For a task or phase, Orchestra selects rules from:
+- active role and required roles;
+- capabilities needed by the work;
+- task title, goal, scope, paths, risks, and acceptance criteria;
+- phase-specific evidence and handoff requirements.
+The selected rules are injected as context references or excerpts for the active
+runtime. A runtime may render the same rule source differently: Codex receives
+compact markdown references, Claude can load markdown files, Cursor can receive
+`.mdc`, and VS Code-style integrations can consume structured JSON.
+## Semantic Code Rule
+Implementation roles should load `semantic-code` when writing or reviewing code,
+automation, scripts, tests, or architecture-sensitive refactors. The rule
+requires code to be readable by intent through domain naming, narrow types,
+focused helpers, and clear boundaries. Comments should explain why, trade-offs,
+or non-obvious constraints, not restate the code.

package/docs/runtime-adapters.md CHANGED Viewed

@@ -519,6 +519,13 @@ workflow after capacity is released. Manual `runtime spawn-request` calls follow
 the same guardrails: `queue` materializes a queued request artifact and session,
 while `reject` fails before creating a delegation artifact.
+Default local runtime-native guardrails allow 3 concurrent delegated sessions
+and 3 spawns per task. The separate SaaS-capacity scheduler defaults to 3 active
+runtime leases, 25 queued requests, 2 active requests per provider, and 3 active
+requests per runtime within the evaluated platform, tenant, and workspace
+policies. Hosted deployments should override those thresholds per tenant and
+workspace before enabling runtime-native dispatch.
 For multi-squad work, the parent renders one spawn request per independent
 squad/role/phase. Each detached session is tracked independently by `sessionId`;
 completion order is intentionally non-deterministic. Release aggregation,

package/docs/runtime-capacity.md ADDED Viewed

@@ -0,0 +1,57 @@
+# Runtime Capacity Model
+Open Orchestra runtime capacity is modeled as a deterministic local-first
+contract. Local runs use the implicit `local/local/local-workspace` scope; SaaS
+mode requires every request, queue item, lease, event, and snapshot to carry
+tenant and workspace scope.
+## Core Concepts
+- `RuntimeCapacityScope`: platform, tenant, and workspace identity.
+- `RuntimeWorkloadClass`: interactive, workflow phase, runtime-native spawn,
+  provider-backed phase, background maintenance, or evidence processing.
+- `RuntimeCapacityUnit`: weighted runtime demand, currently enforced by
+  `concurrencyUnits` with optional budget and resource hints.
+- `RuntimeQuotaPolicy`: platform, tenant, and workspace active/queued limits,
+  provider/runtime caps, and queue/reject behavior.
+- `RuntimeWorkerRecord`: registered worker capabilities, tenant affinity,
+  regions, supported providers/runtimes/workload classes, health, capacity, and
+  isolation metadata.
+## Scheduler Decisions
+`RuntimeCapacityScheduler.schedule()` returns one typed decision:
+- `admitted`: a `RuntimeLease` was granted for a specific worker.
+- `queued`: quota or worker capacity can recover and the request is accepted
+  into a scoped queue.
+- `rejected`: the request is invalid or the configured policy does not allow
+  queueing.
+- `deferred`: no eligible worker is available and queueing is disabled.
+Evaluation is fail-closed: request validation, SaaS scope, platform quota,
+tenant quota, workspace quota, provider/runtime caps, then worker selection.
+Queue limits are enforced at platform, tenant, and workspace levels before a
+queue decision is returned.
+## Load Balancing
+Worker selection is constraint-first and score-second. Eligibility checks tenant
+affinity, denied tenants, workload class, runtime/provider support, region and
+data residency, health, heartbeat freshness, open circuits, and available
+capacity. Scoring is deterministic: available capacity, queue depth, failure
+count, region preference, health, and tenant affinity are sorted with worker id
+as the final tie breaker.
+## Isolation
+SaaS mode rejects missing tenant/workspace scope. Snapshots can be filtered by
+scope so tenant-facing queue evidence does not expose other tenants. Decision
+messages use stable reason codes and user-safe summaries rather than worker
+internals, queue depths from other tenants, paths, or provider details.
+## Current Boundary
+This story intentionally keeps capacity state in memory. Hosted queues,
+transactional worker leases, tenant-secret routing, and data residency
+persistence remain follow-up architecture and security work before SaaS release.

package/docs/security-saas-orchestrator.md ADDED Viewed

@@ -0,0 +1,368 @@
+# SaaS And Orchestrator Security Definition
+## Purpose
+Open Orchestra is local-first workflow orchestration for humans and agent
+runtimes. The CLI owns the current source of truth in `.agent-workflow/`; the
+web console, runtime adapters, provider-backed phases, tracker integrations, and
+future SaaS surfaces must preserve that local trust model instead of turning
+agent automation into an implicit privileged service.
+This document defines the baseline security model for the local CLI and the
+future SaaS orchestrator. It intentionally avoids secrets, tenant identifiers,
+private hosts, and production endpoints.
+## Security Objectives
+- Keep local repositories, workflow state, secrets, and evidence under explicit
+  user or tenant control.
+- Treat prompts, issues, comments, model output, uploaded artifacts, generated
+  plans, tool metadata, and runtime handoffs as untrusted input.
+- Fail closed for cross-tenant access, secret exposure, unsafe writes, shell
+  execution, provider policy violations, and evidence integrity failures.
+- Make every privileged action reviewable through role gates, policy decisions,
+  and evidence records.
+- Support offline local development without weakening the SaaS security posture.
+## System View
+```mermaid
+flowchart LR
+  human["Human operator"]
+  cli["Local CLI"]
+  web["Local web console"]
+  api["Future SaaS API"]
+  workflow["Workflow core"]
+  state[".agent-workflow state"]
+  workers["SaaS workers"]
+  runtimes["Agent runtimes"]
+  tools["MCP and local tools"]
+  providers["Model providers"]
+  trackers["GitHub, Sonar, trackers"]
+  storage["Tenant storage and evidence ledger"]
+  human --> cli
+  human --> web
+  web --> cli
+  cli --> workflow
+  workflow --> state
+  workflow --> runtimes
+  workflow --> tools
+  workflow --> providers
+  workflow --> trackers
+  api --> workers
+  workers --> storage
+  workers --> providers
+  workers --> trackers
+  workers --> runtimes
+```
+The local CLI remains the default control plane. SaaS components may coordinate,
+store sanitized workflow metadata, and run isolated workers, but they must not
+receive raw secrets, raw repository contents, or direct runtime authority unless
+tenant policy and role gates explicitly allow it.
+## Trust Boundaries
+- Human to CLI: trust the installed CLI binary, local config, and explicit
+  flags. Treat terminal input, pasted prompts, and shell environment as
+  untrusted. Require argument validation, safe defaults, no secret echo, and
+  confirmation before writes outside known workflow paths.
+- CLI to workspace: trust the workspace root and allowlisted
+  `.agent-workflow/` paths. Treat user files, symlinks, generated paths, and
+  imported archives as untrusted. Require root containment, path traversal
+  rejection, symlink escape checks, and dry-run before broad writes.
+- Local web console to CLI/API: trust the loopback-only local service and
+  command contracts. Treat browser input, request bodies, and local plugins as
+  untrusted. Require CSRF-aware mutations, strict JSON validation, no arbitrary
+  command endpoint, and sanitized errors.
+- SaaS API to tenant workers: trust authenticated tenant context and policy.
+  Treat requests, uploaded artifacts, and webhook payloads as untrusted.
+  Require AuthN/AuthZ, tenant scoping, schema validation, rate limits, audit
+  logs, malware scanning, and secret scanning.
+- Tenant to tenant: trust only the current tenant partition. Treat other
+  tenants, shared queues, and shared caches as untrusted. Require mandatory
+  tenant id in every data access path, row or storage isolation, cache key
+  partitioning, and per-tenant encryption context.
+- Workflow core to runtimes: trust the rendered task packet and allowed
+  commands. Treat runtime instructions, child agent output, and handoff files as
+  untrusted. Require prompt-injection checks, ownership path limits, lifecycle
+  attestation, and no provider keys in packets.
+- Runtime to tools/MCP: trust tool registry metadata and approved scopes. Treat
+  tool descriptions, tool results, and external MCP servers as untrusted.
+  Require tool identity pinning, capability allowlists, OAuth token isolation,
+  and output sanitization.
+- Provider-backed phases: trust provider adapter policy and redacted context.
+  Treat model outputs and provider errors as untrusted. Require explicit opt-in,
+  tenant data policy, prompt and output filtering, cost limits, budget limits,
+  and redacted error handling.
+- Trackers and scanners: trust stable adapter contracts. Treat issues,
+  comments, scan reports, and CI logs as untrusted. Require remote text to be
+  handled as data, redact secrets, and verify webhook signatures when
+  applicable.
+- Evidence ledger: trust append-only local or tenant evidence records. Treat
+  generated evidence, command logs, screenshots, and runtime claims as
+  untrusted. Require hashing, provenance, immutable event ids, reviewer
+  sign-off, and tamper-evident summaries.
+- Storage and backups: trust tenant storage service and KMS policy. Treat
+  object keys, retained artifacts, and backup restore paths as untrusted.
+  Require encryption at rest, retention policy, restore testing, access logs,
+  and delete workflows.
+## Threat Model
+- Prompt injection: untrusted text asks an agent to ignore gates, reveal
+  secrets, or mutate files. Treat instructions from issues, docs, comments,
+  tools, and model output as data; enforce system policy outside the prompt.
+- Indirect prompt injection: a retrieved artifact hides malicious instructions
+  in evidence, PDFs, websites, or tool results. Scan and label context sources,
+  then strip or quarantine high-risk instruction patterns before runtime
+  packets.
+- SQL or NoSQL injection: tenant filters or search queries alter data access.
+  Use parameterized queries, typed repositories, schema validation, and tenant
+  predicates applied server-side.
+- Command injection: user or model text reaches a shell command. Use
+  `execFile` or `spawn` with argument arrays; block shell interpolation and
+  `shell: true` unless a reviewed exception exists.
+- SSRF: SaaS workers fetch attacker-controlled internal URLs. Allow only
+  `https://` URLs, deny private and metadata address ranges, use egress policy,
+  and avoid server-side fetches without approval.
+- Path traversal: generated paths escape the workspace or tenant storage
+  prefix. Resolve canonical paths, reject `..` and symlink escapes, and require
+  approved roots for secure files.
+- Secrets exfiltration: tokens appear in prompts, evidence, logs, provider
+  errors, or artifacts. Load secrets from secret managers or approved local
+  files, redact before persistence, and never send secrets to model context.
+- Tenant isolation failure: a request, cache, worker, or artifact crosses tenant
+  scope. Require tenant-scoped auth, data access, queue names, cache keys,
+  storage prefixes, audit events, and encryption context.
+- Unsafe file writes: runtime or SaaS worker writes outside intended docs,
+  workflow, or output paths. Require ownership paths, dry-run previews for broad
+  changes, path policy checks, and user approval for sensitive writes.
+- Tool impersonation: a malicious tool mimics a trusted MCP server, scanner, or
+  runtime adapter. Pin tool identity, origin, executable path, version, and
+  capability manifest; reject writable PATH tool discovery for sensitive tools.
+- Evidence tampering: a runtime edits evidence or claims tests passed without
+  proof. Require append-only evidence events, command metadata, hashes for large
+  artifacts, reviewer gates, and mismatch detection.
+- Cross-site request forgery: a browser triggers local web console mutations.
+  Keep local services loopback-only by default, require mutation tokens or
+  same-origin controls, and avoid ambient credentials.
+- Dependency compromise: a package or binary changes behavior after install.
+  Pin lockfiles, scan dependencies, verify sensitive binaries from trusted
+  paths, and keep dependency updates atomic.
+- Denial of wallet or quota: provider-backed phases consume unexpected tokens,
+  jobs, or storage. Enforce per-task and per-tenant budgets, rate limits,
+  cancellation, cost evidence, and fail-closed budget handling.
+## Secure-By-Default Controls
+### Deterministic Policy Engine
+The orchestration policy engine is a typed domain boundary, not prompt text.
+It should expose one deterministic decision contract used by CLI commands,
+runtime packet rendering, provider requests, tool calls, evidence writes,
+tracker/webhook adapters, and future SaaS workers. Every sensitive operation
+must pass a complete policy subject, action, resource, tenant/workspace scope,
+data classification, and sink before side effects begin. Missing, ambiguous, or
+schema-invalid input denies by default and records a sanitized reason.
+Recommended module boundaries:
+- `src/security/policy-types.ts`: discriminated unions for policy subjects,
+  actions, resources, sinks, decisions, redaction status, and denial reasons.
+- `src/security/policy-engine.ts`: pure decision engine and rule registry. It
+  performs no filesystem, network, shell, provider, or persistence I/O.
+- `src/security/prompt-intake.ts`: deserializes prompt/runtime packets into
+  typed segments and classifies each segment as instruction, data, tool input,
+  tool output, evidence, provider response, or unknown.
+- `src/security/content-classifier.ts`: deterministic detectors for
+  query-like and executable-like strings, prompt-injection patterns, path
+  traversal, SSRF candidates, shell metacharacters, SQL/NoSQL-like payloads,
+  and secret-shaped values.
+- `src/security/redaction.ts`: redacts restricted values, marks quarantined
+  segments, and returns a redaction report before persistence or model reuse.
+- `src/security/sink-encoding.ts`: sink-specific escaping and encoding for
+  Markdown, JSON, shell arguments, URLs, HTML/text UI, logs, evidence, and
+  provider messages.
+- `src/security/path-policy.ts`, `url-policy.ts`, `command-policy.ts`,
+  `tenant-policy.ts`, `tool-policy.ts`, `provider-policy.ts`,
+  `evidence-policy.ts`, and `runtime-packet-policy.ts`: focused rule modules
+  plugged into the pure engine.
+- Existing adapters such as CLI commands, runtime renderers, provider
+  adapters, MCP/tool adapters, and workflow evidence services stay thin: build
+  typed policy requests, call the engine, then execute or fail closed.
+Prompt/content intake pipeline:
+1. Deserialize all prompt packets, context packs, tool results, provider
+   responses, handoffs, issue text, and evidence snippets with strict schemas.
+   Unknown fields and malformed envelopes become `unknown` segments and are not
+   forwarded to sensitive sinks.
+2. Split content into typed segments with provenance, tenant/workspace/task
+   scope, source artifact, declared sink, and original byte length.
+3. Detect query-like strings (`SELECT`, GraphQL-like bodies, JSON filters,
+   search expressions), executable-like strings (shell fragments, command
+   substitutions, shebangs, PowerShell, SQL/NoSQL mutation verbs), and
+   instruction-like text asking agents to ignore policy or reveal secrets.
+4. Classify each segment as data, instruction, tool input, tool output,
+   evidence, provider response, or unknown. Remote text is data by default;
+   only trusted system-authored templates may become instruction segments.
+5. Redact restricted values before persistence, provider calls, logs, telemetry,
+   and evidence summaries. Quarantine segments when redaction confidence is
+   low, executable intent appears in a data segment, or the destination sink
+   cannot safely encode it.
+6. Encode for the exact sink immediately before use: argument arrays for
+   commands, canonicalized `https://` URLs for fetches, JSON string escaping
+   for packets, Markdown escaping for handoffs, `textContent`/HTML escaping for
+   UI, and provider-message wrapping that labels untrusted text as data.
+Policy decisions should be append-only evidence inputs with request id, task id,
+actor, action, resource summary, decision (`allow`, `deny`, `requiresApproval`,
+`quarantine`), matched rule ids, redaction status, and sanitized reasons. They
+must not include raw secrets, full prompt bodies, bearer headers, or internal
+stack traces.
+### Local CLI
+- Default to local-only operation; network calls require an explicit command,
+  configured adapter, or CI-owned workflow.
+- Keep `.agent-workflow/` as the auditable source of task, decision, evidence,
+  review, and release state.
+- Validate workspace roots before writes and reject unsafe roots without
+  explicit confirmation.
+- Use typed command contracts and JSON schemas for automation surfaces.
+- Never log secret values, bearer headers, raw provider errors, or raw stack
+  traces in user-facing output.
+- Preserve dry-run or evaluate modes for commands that alter config, tokens,
+  runtime adapters, generated files, or tracker state.
+### Future SaaS API And Web Console
+- Require tenant-authenticated sessions for every SaaS API request.
+- Enforce authorization server-side; UI role visibility is not authorization.
+- Bind every job, artifact, cache entry, evidence event, and storage object to a
+  tenant and workspace.
+- Validate request bodies with narrow schemas and reject unknown mutation fields.
+- Use short-lived worker credentials and scoped service identities.
+- Store only sanitized workflow metadata unless the tenant explicitly enables
+  managed artifact storage.
+- Apply tenant retention, deletion, export, and audit policies to every stored
+  artifact.
+### Workers, Runtimes, And Providers
+- Run workers with least privilege, no shared mutable workspace, and no default
+  access to tenant secrets.
+- Require explicit provider opt-in before direct model API calls.
+- Keep runtime-native delegation packets free of provider credentials and raw
+  secret material.
+- Pass bounded, redacted context packets to model providers.
+- Enforce allowed commands, ownership paths, and lifecycle recording for child
+  runtime work.
+- Treat model output as suggestions until validated by code review, tests, and
+  role gates.
+### Tools, MCP, And External Integrations
+- Require `https://` for remote MCP and integration endpoints.
+- Store OAuth and integration tokens only in tenant secret stores or approved
+  local secret paths; never in prompt, evidence, or generated runtime files.
+- Pin sensitive executable discovery to trusted paths and reject tools resolved
+  from user-writable PATH entries.
+- Redact tool results before persistence or model reuse.
+- Verify webhook signatures and replay windows before accepting remote events.
+- Keep scanner and tracker adapters narrow: one adapter owns I/O, policy checks,
+  retries, and sanitized errors for each integration.
+### Evidence Integrity
+- Record command evidence with command name, exit status, summary, and relevant
+  artifact paths, not raw secrets or full logs by default.
+- Use append-only event records for workflow lifecycle, runtime spawn state,
+  reviews, and evidence.
+- Hash large evidence artifacts and generated reports when they become release
+  inputs.
+- Require QA and Architect review when evidence does not map to acceptance
+  criteria or when technical contracts changed.
+- Preserve failed evidence and unresolved risk instead of overwriting it with a
+  later passing summary.
+## Role Gates
+- Product readiness: Product Owner and Analyst block when acceptance criteria,
+  non-goals, priority, or tenant impact are missing.
+- Architecture readiness: Architect and Security block when boundaries, data
+  flow, provider policy, storage ownership, or failure modes are unclear.
+- Security review: Security and Compliance/Privacy block sensitive work when
+  auth, secrets, PII, file paths, shell execution, network calls, dependencies,
+  TLS, cookies, sessions, CORS, webhooks, tenant isolation, or infrastructure
+  are touched without controls.
+- Implementation handoff: Developer and Tech Lead block when tests, typed
+  contracts, ownership paths, or migration/rollback notes are missing.
+- QA evidence: QA and Analyst block when evidence does not prove acceptance
+  criteria, edge cases, regression areas, or security controls.
+- Operational readiness: SRE, DevOps, and Release Manager block when
+  monitoring, alerting, rate limits, budgets, rollout, rollback, or incident
+  owner is missing for SaaS behavior.
+- Data readiness: DBA and Data Engineer/Analyst block when indexes, migrations,
+  retention, lineage, or tenant query patterns are not defined.
+- Release go/no-go: Product Owner, Release Manager, and Security block when
+  residual risk remains unresolved and is not explicitly risk-accepted.
+Security-sensitive tasks must include a threat model note, impacted boundaries,
+controls, validation evidence, residual risks, and a reviewer outcome before
+release.
+## Data Classification
+- Public: published docs and public command manifests. These may be indexed and
+  sent to providers when policy allows.
+- Internal: workflow metadata, task summaries, and sanitized evidence. These are
+  tenant/workspace scoped and redacted before external provider use unless
+  policy allows broader handling.
+- Confidential: private repo content, issue context, generated handoffs, and
+  logs. These are local-only or redacted-external by default; retention and
+  audit are required.
+- Restricted: secrets, tokens, credentials, regulated PII, and signing material.
+  These are never sent to prompts or persisted in evidence; use a secret
+  manager, tokenization, or approved local secure files.
+## Backlog Candidates
+1. Policy engine for tenant data classification, provider routing, network
+   access, tool capabilities, and fail-open/fail-closed behavior.
+2. Prompt-injection scanner for issues, comments, artifacts, tool outputs,
+   evidence, model responses, and context packs.
+3. Tenant isolation test suite covering SaaS API, workers, queues, caches,
+   evidence, object storage, and backup restore paths.
+4. Evidence integrity ledger with append-only events, artifact hashing,
+   reviewer attestations, and tamper detection.
+5. SSRF and URL validation library shared by SaaS workers, web console, MCP
+   proxy, and tracker adapters.
+6. Tool identity registry for MCP servers, local binaries, runtime adapters,
+   allowed commands, versions, and trusted executable paths.
+7. Secret redaction pipeline for runtime packets, provider errors, evidence,
+   logs, telemetry, imported artifacts, and generated summaries.
+8. SaaS audit log schema with tenant id, actor, action, target, policy decision,
+   evidence id, request id, and redaction status.
+9. Worker sandbox profile with filesystem, network, process, timeout, memory,
+   and budget limits.
+10. Release gate automation that blocks security-sensitive SaaS changes without
+    Security, QA, SRE, and Compliance/Privacy evidence.
+11. Tenant retention and deletion workflows with export, legal hold, backup
+    tombstone, and restore verification.
+12. Dependency and binary provenance checks for scanner tools, MCP proxies,
+    release automation, and runtime bridge helpers.
+## Validation Expectations
+- Documentation-only changes should run lightweight text checks and the
+  Orchestra evidence/review workflow.
+- Security-sensitive code changes should run format, lint, typecheck, unit
+  tests, secret scan, security audit, and targeted E2E or contract tests.
+- SaaS implementation stories should add tests for tenant isolation, prompt
+  injection handling, URL validation, path traversal, unsafe writes, command
+  execution, secret redaction, and evidence tampering.
+- Release evidence must name the acceptance criteria it proves or explicitly
+  record the deferred owner and rationale.

package/docs/sonar-quality-gates.md CHANGED Viewed

@@ -416,7 +416,7 @@ rules, and Orchestra review gates.
 Until Sonar directives are adopted, architecture violations are enforced through:
-- repo standards in `AGENTS.md` and `rules/*.mdc`;
+- repo standards in `AGENTS.md` and neutral rule sources selected by Orchestra;
 - architecture gate decisions and ADR-style records;
 - code review against domain boundaries;
 - tests that protect command contracts, workflow behavior, and generated

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@jterrats/open-orchestra",
-  "version": "1.0.15",
+  "version": "1.0.17",
   "type": "module",
   "workspaces": [
     "extensions/vscode-open-orchestra",

package/rules/development/semantic-code.md ADDED Viewed

@@ -0,0 +1,28 @@
+# Semantic Code
+Code must be readable by intent before it is explained by comments.
+## Naming
+- Use domain language for modules, functions, variables, types, and test names.
+- Prefer names that reveal purpose and observable behavior, such as `validateReleaseGateEvidence`, not vague names such as `processData`.
+- Boolean names must make the predicate clear: `isReady`, `hasEvidence`, `canRetry`, `shouldBlockRelease`.
+## Structure
+- Keep entry points thin. Move decisions and business rules into focused domain, service, or policy modules.
+- Extract helpers when a reader needs comments to understand a block of code.
+- Avoid generic containers in public APIs when narrow types or explicit models can describe the contract.
+- Avoid hardcoded command lists, statuses, roles, labels, or fixture values when a typed registry or catalog can be the source of truth.
+## Comments
+- Comments explain why, trade-offs, invariants, or external constraints.
+- Do not add comments that restate what the code already says.
+- If a function needs line-by-line comments to be understandable, refactor the names, types, or helper boundaries.
+## Review Checklist
+- A reviewer can identify the domain intent from names and file boundaries without tracing every line.
+- New code follows the existing project vocabulary and layering.
+- Tests read like behavior specifications and use meaningful scenario names.