RubyGems - phronomy - Versions diffs - 0.7.0 → 0.7.1 - Mend

phronomy 0.7.0 → 0.7.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (70) hide show

checksums.yaml +4 -4
data/.mutant.yml +8 -7
data/CHANGELOG.md +151 -1
data/README.md +155 -32
data/Rakefile +33 -0
data/benchmark/baseline.json +1 -1
data/benchmark/bench_regression.rb +1 -0
data/docs/decisions/004-invoke-timeout-is-not-cancellation.md +24 -0
data/docs/decisions/006-no-built-in-guardrails.md +20 -2
data/docs/decisions/010-cooperative-first-concurrency.md +248 -0
data/lib/phronomy/agent/base.rb +250 -65
data/lib/phronomy/agent/concerns/suspendable.rb +15 -0
data/lib/phronomy/agent/fsm.rb +41 -64
data/lib/phronomy/agent/orchestrator.rb +146 -121
data/lib/phronomy/agent/parallel_tool_chat.rb +79 -22
data/lib/phronomy/agent/react_agent.rb +8 -0
data/lib/phronomy/async_queue.rb +155 -0
data/lib/phronomy/blocking_adapter_pool.rb +435 -0
data/lib/phronomy/cancellation_scope.rb +123 -0
data/lib/phronomy/cancellation_token.rb +43 -2
data/lib/phronomy/concurrency_gate.rb +155 -0
data/lib/phronomy/configuration.rb +142 -0
data/lib/phronomy/deadline.rb +63 -0
data/lib/phronomy/diagnostics.rb +62 -0
data/lib/phronomy/embeddings/base.rb +17 -0
data/lib/phronomy/eval/runner.rb +9 -9
data/lib/phronomy/event_loop.rb +181 -43
data/lib/phronomy/fsm_session.rb +50 -4
data/lib/phronomy/guardrail/prompt_injection_guardrail.rb +58 -0
data/lib/phronomy/invocation_context.rb +152 -0
data/lib/phronomy/knowledge_source/base.rb +18 -0
data/lib/phronomy/llm_adapter/base.rb +104 -0
data/lib/phronomy/llm_adapter/ruby_llm.rb +41 -0
data/lib/phronomy/llm_adapter.rb +20 -0
data/lib/phronomy/metrics.rb +38 -0
data/lib/phronomy/runtime/deterministic_scheduler.rb +412 -0
data/lib/phronomy/runtime/fake_scheduler.rb +165 -0
data/lib/phronomy/runtime/gate_registry.rb +52 -0
data/lib/phronomy/runtime/pool_registry.rb +57 -0
data/lib/phronomy/runtime/runtime_metrics.rb +117 -0
data/lib/phronomy/runtime/scheduler.rb +98 -0
data/lib/phronomy/runtime/scheduler_timer_adapter.rb +79 -0
data/lib/phronomy/runtime/task_registry.rb +48 -0
data/lib/phronomy/runtime/thread_scheduler.rb +30 -0
data/lib/phronomy/runtime/timer_queue.rb +106 -0
data/lib/phronomy/runtime/timer_service.rb +42 -0
data/lib/phronomy/runtime.rb +374 -0
data/lib/phronomy/task/backend.rb +80 -0
data/lib/phronomy/task/fiber_backend.rb +157 -0
data/lib/phronomy/task/immediate_backend.rb +89 -0
data/lib/phronomy/task/thread_backend.rb +84 -0
data/lib/phronomy/task.rb +275 -0
data/lib/phronomy/task_group.rb +265 -0
data/lib/phronomy/testing/fake_clock.rb +109 -0
data/lib/phronomy/testing/fake_scheduler.rb +104 -0
data/lib/phronomy/testing/scheduler_helpers.rb +59 -0
data/lib/phronomy/testing.rb +12 -0
data/lib/phronomy/tool/base.rb +110 -2
data/lib/phronomy/tool/mcp_tool.rb +47 -16
data/lib/phronomy/tool/scope_policy.rb +50 -0
data/lib/phronomy/tool_executor.rb +106 -0
data/lib/phronomy/tracing/open_telemetry_tracer.rb +34 -0
data/lib/phronomy/vector_store/async_backend.rb +110 -0
data/lib/phronomy/vector_store/base.rb +7 -0
data/lib/phronomy/version.rb +1 -1
data/lib/phronomy/workflow.rb +52 -5
data/lib/phronomy/workflow_context.rb +29 -2
data/lib/phronomy/workflow_runner.rb +74 -3
data/lib/phronomy.rb +42 -0
metadata +40 -2

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: fbca82a7a23706719deda2e827af5a9b342c9b388d700929ca9eca19a531a2c9
-  data.tar.gz: b9727e2010acefc14738dbd5b71b5ea06b10f5ebd994a858dfbf1133e15ed003
+  metadata.gz: d9ae370d656048e38f700b6bced931fe249f731cea819ab94691eb4bcf6ef43c
+  data.tar.gz: 97d01ca3475f547a41397d1dad2ddb8ccaa10f6466d5a75c3f79e6875a7af0c6
 SHA512:
-  metadata.gz: c33ee2c26a4b6e3d0470f4d95e30b04e9bc228cc87bdd807db1569c707888817f2d904f085822f91e94441e5c21e95ab348f0d7bc56b1ef87c72770ed4976e1d
-  data.tar.gz: 706148e7047ab570ca7d69f735f5767c2a983cf35312a70732723595ea80ed3b5f3efaaddd59d393f5680ec91c36c6ca3e01a70dcb671faa940ae9211df59b5a
+  metadata.gz: d3ab9ebd145e1ed706ad1741a2e3184c412aa8fd0eac32c95eb0b4a1ef87af38ae73eb5b4205b7f2894dd228929130c9a7569d24a1d7a571a5aa3ec5a68a4172
+  data.tar.gz: efa88afdbaa2f3d8fc38ee7cbc7044711479490546a888d44540f3b6bae6da60a3a3e64cfbbef455d65f78bab64dd9a68056e4c9f7ac7a360d512179364c8b23

data/.mutant.yml CHANGED Viewed

@@ -12,10 +12,11 @@ includes:
 requires:
   - phronomy
-subjects:
-  - Phronomy::WorkflowContext
-  - Phronomy::WorkflowRunner
-  - Phronomy::Tool::Base
-  - Phronomy::Context::TokenBudget
-  - Phronomy::Context::TokenEstimator
-  - Phronomy::VectorStore::InMemory
+matcher:
+  subjects:
+    - Phronomy::WorkflowContext
+    - Phronomy::WorkflowRunner
+    - Phronomy::Tool::Base
+    - Phronomy::Context::TokenBudget
+    - Phronomy::Context::TokenEstimator
+    - Phronomy::VectorStore::InMemory

data/CHANGELOG.md CHANGED Viewed

@@ -11,6 +11,104 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 ### Added
+- **`Phronomy::Diagnostics` and `SchedulerReentrancyError`** (#278, #279):
+  `Phronomy::Diagnostics` exposes a snapshot of current scheduler state
+  (`pending_count`, `active_tasks`, `pool_utilization`, etc.) for debugging and
+  monitoring. `SchedulerReentrancyError` is raised when a scheduler operation is
+  attempted from within a scheduler callback, preventing deadlocks.
+  `Phronomy.configure { |c| c.scheduler_debug = true }` enables verbose scheduler
+  logging.
+- **`task_id` / `parent_task_id` on `InvocationContext`** (#277):
+  Every task spawned via `Task.spawn` now carries a `task_id` (a random UUID) and
+  an optional `parent_task_id`. These fields enable hierarchical task-tree tracing
+  and are forwarded automatically by `TaskGroup`.
+- **`Phronomy::Metrics` — task-centric observability snapshot** (#276):
+  `Phronomy::Metrics.snapshot` returns a hash with scheduler statistics:
+  `tasks_started`, `tasks_completed`, `tasks_failed`, `pool_queue_depth`, and
+  `pool_active_threads`. Intended for metrics export and health-check endpoints.
+- **`Phronomy::Testing::FakeClock` and `FakeScheduler`** (#273):
+  Two test helpers for deterministic concurrency testing.
+  `FakeClock` exposes `advance(seconds)` to control the passage of time without
+  sleeping. `FakeScheduler` replaces the real scheduler in specs, providing
+  synchronous execution and `flush` / `drain` helpers to drive task completion.
+- **`ScopePolicy` and approval gate integration** (#270):
+  `Phronomy::Tool::ScopePolicy` is a callable that maps `(tool_class, scope, agent)`
+  to `:allow`, `:approve`, or `:reject`. The default policy (`ScopePolicy::DEFAULT`)
+  automatically routes tools declaring high-risk scopes (`:write`, `:admin`,
+  `:external_network`, `:filesystem`, `:process`, `:external_process`) through the
+  existing approval gate; tools with `scope :read_only` or no scope are allowed
+  unconditionally. Per-agent policy overrides are available via
+  `agent.scope_policy = my_policy`.
+  **Behaviour change**: tools with the above scopes that previously executed without
+  an approval handler will now be **rejected** unless an approval handler is
+  registered or the agent uses a custom permissive policy.
+- **`PromptInjectionGuardrail`, `Tool::Base#redact_params`, and `#max_result_size`** (#271):
+  `Phronomy::Guardrail::PromptInjectionGuardrail` is a built-in `InputGuardrail`
+  subclass that detects prompt-injection patterns in user input.
+  `Tool::Base.redact_params(*names)` marks parameter names as sensitive; their
+  values are replaced with `"[REDACTED]"` in log and trace output.
+  `Tool::Base.max_result_size(n)` sets a per-tool character limit; results
+  exceeding the limit are truncated and a warning is logged. The global fallback is
+  `Phronomy.configure { |c| c.tool_result_max_size = n }` (default: no limit).
+- **`execution_mode` DSL on `Tool::Base`** (#263):
+  `Tool::Base.execution_mode` accepts `:cooperative`, `:blocking_io` (default),
+  `:cpu_bound`, or `:external_process`. Tools marked `:blocking_io` (the default)
+  are dispatched through `BlockingAdapterPool` when a `Runtime` is available,
+  keeping the scheduler thread unblocked. Tools marked `:cooperative` are called
+  directly on the scheduler thread (suitable for pure in-memory operations).
+- **`invoke_async` and `call_async` — async entry points** (#262):
+  `Agent::Base#invoke_async(input, **opts)` returns a `Phronomy::Task` wrapping
+  `#invoke`. `Workflow#invoke_async(input, config:)` does the same for workflows.
+  `Tool::Base#call_async(args, cancellation_token:)` returns a `Task` wrapping
+  `#call`. All three are backward-compatible with existing synchronous callers.
+- **`LLMAdapter` abstraction** (#266):
+  `Phronomy::LLMAdapter::Base` decouples the agent pipeline from RubyLLM.
+  `Phronomy::LLMAdapter::RubyLLM` (registered by default) wraps the existing
+  integration. Custom adapters can be registered via
+  `Phronomy.configure { |c| c.llm_adapter = MyAdapter }` for testing or
+  alternative LLM backends.
+- **`BlockingAdapterPool` backpressure limits** (#268):
+  `BlockingAdapterPool` now enforces configurable `pool_size` (default: 10) and
+  `queue_size` (default: 100) limits. Tasks submitted when the queue is full raise
+  `Phronomy::BackpressureError` immediately instead of growing the queue without
+  bound.
+- **Cooperative scheduler fairness** (#269):
+  The scheduler measures per-task lag and emits starvation and dispatch warnings
+  via `Phronomy.configuration.logger` when tasks wait longer than configured
+  thresholds. Configurable via `scheduler_starvation_warn_ms` and
+  `scheduler_dispatch_warn_ms`.
+- **Workflow entry actions awaitable with Task** (#264):
+  Entry action lambdas may now return a `Phronomy::Task`. The FSMSession awaits
+  the task on a background thread and posts `:action_completed` (with the resulting
+  `WorkflowContext`) or `:state_completed` back to the EventLoop without blocking
+  it. Backward-compatible: lambdas that return a `WorkflowContext` or `nil`
+  continue to work as before.
+- **`Task`, `TaskGroup`, `AsyncQueue`, `Deadline`, `InvocationContext`, `Runtime` concurrency abstractions** (#255):
+  Six new concurrency primitives form the foundation of the async execution layer.
+  `Task` wraps a callable with cancellation, timeout (`Deadline`), and context
+  propagation (`InvocationContext`). `TaskGroup` runs tasks concurrently and waits
+  for all to finish (or the first failure). `AsyncQueue` is a bounded, cancellable
+  queue. `Runtime` is the top-level façade that resolves a `BlockingAdapterPool`
+  and provides `blocking_io { }` and `cpu_bound { }` dispatch helpers.
+- **`BlockingAdapterPool`** (#256):
+  A bounded thread pool that isolates blocking I/O (LLM calls, database queries,
+  HTTP requests) from the cooperative scheduler thread. Default pool size is 10
+  threads with a queue depth of 100. Replaces direct `Thread.new` calls in core
+  agent and tool paths.
 - **`VectorStore#size` — document count for all backends, contract coverage for RedisSearch and Pgvector** (#240):
   `VectorStore::Base` gains `#size` as an abstract method; `InMemory`, `RedisSearch`,
   and `Pgvector` all implement it. `RedisSearch#size` queries `FT.INFO num_docs`;
@@ -128,9 +226,52 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
   `dispatch_parallel` and `fan_out` accept `cancellation_token:` and automatically
   inject it into every worker task's config unless the task already supplies its own.
+### Removed
+- **BREAKING: `Agent::Base#run_as_child` drops `&result_writer` block parameter** (#265):
+  The optional block form `run_as_child(input, ctx: ctx) { |r| ctx.answer = r[:output] }`
+  is no longer supported. The result is now delivered **exclusively** as the
+  `:child_completed` event payload `{ output:, messages:, usage: }`. The parent
+  Workflow task is the sole owner of the `WorkflowContext`; no background thread
+  writes to it directly. Callers that were using the block to write back into the
+  context must update their workflow design (e.g. read the result in the target
+  state's entry action after the transition, or store output through an external
+  shared resource if needed).
+- **BREAKING (internal): `AgentFSM#initialize` drops `result_writer:` keyword** (#265):
+  Direct callers of `AgentFSM.new(result_writer: ...)` must remove that keyword.
+  This class is considered internal; gem consumers should use `run_as_child` instead.
 ### Changed
-- **`CancellationToken` checked at granular checkpoints** (#223):
+- **`AgentFSM`, `ParallelToolChat`, and `Orchestrator` use `Task`/`TaskGroup` instead of bare `Thread.new`** (#257, #258, #259):
+  All three components now spawn async work through the `Task` and `TaskGroup`
+  abstractions. This enables cancellation propagation, context threading, and
+  `BlockingAdapterPool` routing. No public API changes; behaviour is equivalent.
+- **`Thread.current[:phronomy_*]` context propagation replaced with explicit `InvocationContext`** (#260):
+  Thread-local keys `phronomy_event_loop_thread`, `phronomy_cancellation_token`,
+  and `phronomy_context_version_caches` are no longer used as the primary
+  propagation channel. `InvocationContext` is threaded explicitly through call
+  stacks. Importantly, `Tool::Base#call` no longer falls back to
+  `Thread.current[:phronomy_cancellation_token]`; cancellation is only observed
+  when the caller passes `cancellation_token:` explicitly (or when
+  `ParallelToolChat` injects it). Tools that relied on the thread-local fallback
+  must be updated.
+- **`Timeout.timeout` removed from core paths; replaced with `CancellationScope`** (#261):
+  `Agent::Base#invoke` and `McpTool::StdioTransport` no longer use `Timeout.timeout`
+  (which is unsafe with `Thread.new` and `ensure` blocks). A `CancellationScope`
+  with `deadline_in(seconds)` provides equivalent semantics without the thread-
+  interruption hazards. `ScopeTimeoutError < TimeoutError` is raised on expiry.
+- **RAG/VectorStore blocking I/O placed behind `BlockingAdapterPool` async boundary** (#267):
+  `KnowledgeSource#fetch` and all three `VectorStore` backends now execute their
+  blocking I/O through `Runtime#blocking_io` when a `Runtime` is present. Callers
+  in a synchronous context see no change; callers in an EventLoop context benefit
+  from non-blocking scheduler behaviour.
   The cancellation token (passed via `config: { cancellation_token: token }`) is
   now checked at multiple additional points beyond the initial LLM call boundary:
   before each `KnowledgeSource#fetch` in `build_context` (RAG phase); after each
@@ -195,6 +336,15 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 ### Fixed
+- **`tool_name` preserved in `Orchestrator#prepare_tool_class` anonymous subclass wrapper**:
+  When `Orchestrator#prepare_tool_class` wrapped a subagent tool in an anonymous
+  subclass (`Class.new(prepared)`), the class-level instance variable `@tool_name`
+  was not inherited, causing the wrapper's `tool_name` to return `nil`. RubyLLM
+  then registered the tool under a `nil` key, making it unreachable when the LLM
+  called it by name. The fix captures the effective name before subclassing and
+  calls `tool_name effective_name` explicitly inside the anonymous class body —
+  the same pattern already used by the approval-gate wrapper.
 - **`EventLoop#start` is now idempotent; stale `:__stop__` sentinel race fixed** (#203):
   Calling `start` on an already-running `EventLoop` is now a no-op. Fixed a race condition
   where `stop` setting `@running = false` before the worker thread was scheduled left the

data/README.md CHANGED Viewed

@@ -22,31 +22,80 @@ It provides composable building blocks — Workflows, Agents, Tools, Guardrails,
 > **Note**: The `main` branch contains unreleased development work. Pin to a released gem
 > version (`gem "phronomy", "~> 0.x"`) for stability in production.
+**Core building blocks**
 | Feature | Stability |
 |---|---|
 | **Workflow** — Stateful, branching workflows with wait_state/send_event | Stable |
-| **Workflow EventLoop Mode** — Opt-in event-driven execution: `Phronomy.configure { \|c\| c.event_loop = true }` | Experimental |
-| **Agent EventLoop Mode** — `Agent#invoke` (non-blocking via EventLoop), `Agent#run_as_child` (child-FSM pattern for Workflow integration), parallel tool dispatch via `ParallelToolChat` | Experimental |
-| **Workflow Parallel Node** — Concurrent branches via application-level threads | Beta |
+| **Workflow action_timeout** — Per-state `action_timeout:` keyword on `state` DSL; cancels Task-returning entry actions that exceed the limit and raises `Phronomy::ActionTimeoutError` | Beta |
 | **Agent** — ReAct-style tool-calling agents with guardrails and conversation history | Stable |
 | **Before-Completion Hook** — Three-tier LLM parameter injection | Stable |
 | **Context Management** — Token budget calculation, estimation, and pruning | Stable |
-| **Knowledge/RAG** — Retrieval sources with pluggable loaders, splitters, and vector stores; `static_knowledge_refresh!` for runtime cache invalidation | Beta |
-| **`VectorStore#size`** — Returns document count for all three backends (InMemory, RedisSearch, Pgvector) | Beta |
-| **Multi-agent** — Agent-as-Tool pattern and hub-and-spoke handoff routing | Beta |
-| **GeneratorVerifier** — Generator-Verifier loop with injectable prompt builders/parsers | Beta |
-| **Agent::Orchestrator** — Parallel subagent dispatch, fan-out, and `subagent` DSL | Beta |
-| **Agent::TeamCoordinator** — Agent teams pattern: LLM coordinator + stateful workers with sequential task assignment (worker-local message history persisted across tasks) | Beta |
-| **Agent::SharedState** — Shared state pattern: peer agents collaborate via a shared KnowledgeStore; `member` DSL with per-agent instructions and `coordination` team protocol | Experimental |
 | **Guardrails** — Input/output validation with custom `InputGuardrail`/`OutputGuardrail` | Beta |
+| **`PromptInjectionGuardrail`** — Built-in `InputGuardrail` subclass that detects prompt-injection patterns; usable standalone or as part of a guardrail chain | Beta |
+| **`Tool::Base.redact_params` / `.max_result_size`** — Class-level DSL: `redact_params` masks parameter values in log/trace output; `max_result_size` truncates oversized tool results before they reach the LLM | Beta |
 | **Output Parser** — JSON and Struct-mapped parsers for structured LLM responses | Stable |
 | **Eval Framework** — Dataset-driven evaluation with multiple scorer types | Beta |
 | **Tracing** — Pluggable span-based observability | Stable |
-| **MCP Tool** — Model Context Protocol server integration | Beta |
 | **Error Taxonomy** — `RateLimitError`, `AuthenticationError`, `ContextLengthError`, `TransportError` (subclasses of `Phronomy::Error`) raised at the agent retry boundary | Beta |
-| **`Phronomy.with_configuration` / `Phronomy.reset_runtime!`** — Scoped configuration override and full runtime reset for test isolation | Beta |
+**Knowledge and integration**
+| Feature | Stability |
+|---|---|
+| **Knowledge/RAG** — Retrieval sources with pluggable loaders, splitters, and vector stores; `static_knowledge_refresh!` for runtime cache invalidation | Beta |
+| **`VectorStore#size`** — Returns document count for all three backends (InMemory, RedisSearch, Pgvector) | Beta |
+| **`VectorStore::AsyncBackend` mixin** — Pluggable async interface for `VectorStore`; default pool-backed implementations for `search_async`, `add_async`, `remove_async`, `clear_async`; backends with native async drivers override individual methods to bypass `BlockingAdapterPool` entirely; all existing backends remain unchanged | Beta |
+| **Parallel RAG multi-source fetch** — `Agent#build_context` fetches all `knowledge_sources` concurrently via `TaskGroup`; `config[:rag_failure_policy]` `:skip` (default) silently ignores failed sources so the agent answers with partial context, `:fail` surfaces the first error; per-source latency is emitted to `Phronomy.configuration.logger` at debug level | Beta |
+| **MCP Tool** — Model Context Protocol server integration | Beta |
+**Execution and reliability**
+| Feature | Stability |
+|---|---|
+| **Workflow EventLoop Mode** — Opt-in event-driven execution: `Phronomy.configure { \|c\| c.event_loop = true }` | Experimental |
+| **Agent EventLoop Mode** — `Agent#invoke` (non-blocking via EventLoop), `Agent#run_as_child` (child-FSM pattern for Workflow integration), parallel tool dispatch via `ParallelToolChat` | Experimental |
+| **`invoke_async` / `call_async`** — `Agent::Base#invoke_async` and `Workflow#invoke_async` return a `Task`; `Tool::Base#call_async` similarly; compatible with EventLoop and standalone contexts | Experimental |
 | **CancellationToken** — Cooperative cancellation via `cancel!`/`cancelled?`/`raise_if_cancelled!`; `timeout_after(seconds)` for monotonic-clock deadlines; optional `deadline:` (wall-clock) for backward compatibility; passed as `config: { cancellation_token: token }` to agents and `dispatch_parallel`; injected into `tool.execute` when the method declares a `cancellation_token:` keyword | Experimental |
 | **`dispatch_parallel` / `fan_out` `force_kill:` option** — `force_kill: false` (default) leaves timed-out workers running and raises `TimeoutError` immediately; `force_kill: true` restores the old `Thread#kill` behaviour with a `logger.warn` | Beta |
+| **`execution_mode` DSL on `Tool::Base`** — Declares how a tool's `execute` should be dispatched: `:cooperative` (same scheduler thread), `:blocking_io` (default; offloaded to `BlockingAdapterPool`), `:cpu_bound`, `:external_process` | Experimental |
+| **`invocation_context:` keyword on `Agent#invoke` / `Workflow#invoke`** — Pass a `Phronomy::InvocationContext` directly; `thread_id`, `cancellation_token`, and `deadline`-based timeout are derived from it; `task_id` / `parent_task_id` appear in trace spans automatically; `config:` keys remain supported as backward-compat aliases | Beta |
+| **`ConcurrencyGate` — unified backpressure** — Counting semaphore that enforces per-resource concurrency caps (`max_concurrent_agent_tasks`, `max_concurrent_tool_tasks`, `max_concurrent_workflow_tasks`, `max_concurrent_llm_calls`, `max_concurrent_rag_fetches`, `max_concurrent_vector_searches`); configured via `Phronomy.configure`; backpressure behaviour follows the global `backpressure` setting (`:wait`, `:raise`/`:reject`, `:timeout`); `nil` cap = unlimited (default) | Beta |
+| **Cooperative scheduler yield points** — `Runtime#yield` (cooperative yield; yields the current task's time slice); `Runtime#yield_if_needed(every: N)` (thread-local counter, yields every N calls); CPU-bound detection when `blocking_detect_threshold_ms` is set (warns and increments `non_yield_threshold_violation_count` when a task runs longer than the threshold without yielding); `starvation_threshold_ms` configuration field (default: 50ms) | Beta |
+| **`Phronomy::Metrics`** — `Phronomy::Metrics.snapshot` returns task-tree and pool counters; task-centric keys: `active_agent_tasks`, `active_tool_tasks`, `active_workflow_tasks`, `active_rag_tasks`, `active_llm_tasks`, `task_wait_time_p50_ms`, `task_wait_time_p95_ms`, `task_run_time_p50_ms`, `task_run_time_p95_ms`, `cancelled_tasks`, `failed_tasks`, `non_yield_threshold_violation_count`; pool/event-loop keys remain for backward compatibility; `Runtime#task_snapshot` exposes task-centric metrics directly | Beta |
+| **`Phronomy.with_configuration` / `Phronomy.reset_runtime!`** — Scoped configuration override and full runtime reset for test isolation | Beta |
+**Agent patterns**
+| Feature | Stability |
+|---|---|
+| **Workflow parallel pattern** — Concurrent branches via application-level threads (no built-in parallel primitive; see the Workflow section for the recommended pattern) | Beta |
+| **Multi-agent** — Agent-as-Tool pattern and hub-and-spoke handoff routing | Beta |
+| **GeneratorVerifier** — Generator-Verifier loop with injectable prompt builders/parsers | Beta |
+| **Agent::Orchestrator** — Parallel subagent dispatch, fan-out, and `subagent` DSL | Beta |
+| **Agent::TeamCoordinator** — Agent teams pattern: LLM coordinator + stateful workers with sequential task assignment (worker-local message history persisted across tasks) | Beta |
+| **Agent::SharedState** — Shared state pattern: peer agents collaborate via a shared KnowledgeStore; `member` DSL with per-agent instructions and `coordination` team protocol | Experimental |
+| **`ScopePolicy`** — Configurable policy callable that maps (tool, scope, agent) to `:allow`/`:approve`/`:reject`; default policy auto-routes high-risk scopes through the approval gate | Experimental |
+> **Public API boundary**: The tables above are the complete list of classes, modules, and features
+> intended for gem consumers. Every entry has an associated stability label.
+> All other classes, modules, and methods — including everything in the
+> [Advanced / Internal APIs](#advanced--internal-apis) section below — are
+> marked `@api private` in source and may change without notice. Do not
+> depend on internal APIs in application code.
+## Advanced / Internal APIs
+The APIs listed below are intended for advanced use cases, framework internals, and test infrastructure. Typical application code does not need to interact with them directly.
+> These APIs are subject to change without the same backwards-compatibility guarantees as the stable public API.
+| Feature | Stability |
+|---|---|
+| **`Phronomy::Diagnostics`** — Snapshot of scheduler internals for debug/monitoring; `SchedulerReentrancyError` raised on invalid re-entrant scheduler use; `Runtime.in_scheduler_context?` returns `true` when called from inside a scheduler task | Experimental |
+| **`Phronomy::Testing::FakeClock` / `FakeScheduler` / `SchedulerHelpers`** — Test helpers for deterministic concurrency specs: `FakeClock#advance(seconds)` controls time; `FakeScheduler` runs tasks synchronously and records `event_log`; `FakeScheduler#assert_order` / `#assert_cancelled` for ordering assertions; `FakeClock#advance_to_next_timer` fires the next pending callback; `Testing::SchedulerHelpers#with_fake_scheduler` replaces the global Runtime for the duration of a block | Beta |
+| **`Configuration#runtime_backend`** — `:thread` (default, one OS thread per task), `:immediate` (tests — tasks run synchronously, no extra threads), `:fiber` (**EXPERIMENTAL** — experimental validation backend only: runs tasks as Ruby Fibers on a cooperative scheduler to verify that framework components are truly non-blocking; **not for production use** and not a planned production replacement for `:thread`; no preemptive scheduling will be added). `:cooperative` is a **deprecated alias** for `:immediate` — do not use in new code | Beta |
+| **`Configuration#strict_runtime_guards`** — When `true`, calling `Agent#invoke` from inside a scheduler task raises `SchedulerReentrancyError`; when `false` (default) a warning is logged instead | Beta |
 ## Installation
@@ -150,13 +199,16 @@ puts "Approved: #{final.approved}"  # => true
 ```
 In EventLoop mode (`c.event_loop = true`), `Agent#run_as_child` spawns a child agent
-asynchronously. When the child succeeds, `:child_completed` is dispatched; when it fails,
-`:child_failed` is dispatched. Always declare both transitions to avoid a stuck workflow:
+asynchronously. When the child succeeds, `:child_completed` is dispatched with the result
+`{ output:, messages:, usage: }` as its payload; when it fails, `:child_failed` is
+dispatched. Always declare both transitions to avoid a stuck workflow:
 ```ruby
-# EventLoop mode: workflow that runs an agent as a child FSM
+# EventLoop mode: workflow that runs an agent as a child FSM.
+# The result { output:, messages:, usage: } arrives as the :child_completed event
+# payload — write it back to the context in the target state's entry action.
 entry :run_agent, ->(ctx) {
-  MyAgent.new.run_as_child(ctx.query, ctx: ctx) { |r| ctx.answer = r[:output] }
+  MyAgent.new.run_as_child(ctx.query, ctx: ctx)
 }
 transition from: :run_agent, on: :child_completed, to: :done
 transition from: :run_agent, on: :child_failed,    to: :handle_error
@@ -222,10 +274,11 @@ rescue Phronomy::GuardrailError => e
 end
 ```
-> **Limitations:** Phronomy ships no built-in guardrail implementations. There is no
-> built-in prompt injection detector, PII scanner, or content classifier. All guardrail
-> logic must be implemented by the application. Reference implementations for common
-> patterns are available in `phronomy-examples` (example 06).
+> **Note:** Phronomy includes `PromptInjectionGuardrail`, a built-in pattern-based
+> input guardrail that detects common injection patterns (see the feature table above).
+> PII scanning and content classification are **not** provided by the framework;
+> that logic must be implemented by the application. Reference implementations for
+> common patterns are available in `phronomy-examples` (example 06).
 ### Knowledge/RAG — Context injection and vector retrieval
@@ -407,9 +460,11 @@ class MyOrchestrator < Phronomy::Agent::Orchestrator
 end
 ```
-### Workflow Parallel Node — Concurrent branches
+### Workflow parallel pattern — Concurrent branches
-Phronomy does not provide a built-in parallel abstraction. Use application-level Ruby threads inside a `state` action:
+Phronomy does not provide a dedicated parallel-node primitive. The recommended
+pattern for concurrent branches is to use application-level Ruby threads inside
+a `state` action:
 ```ruby
 class EnrichContext
@@ -426,9 +481,9 @@ app = Phronomy::Workflow.define(EnrichContext) do
       summary: Thread.new { Summarizer.call(s) },
       tags:    Thread.new { Tagger.call(s) }
     }
-    # For production use, wrap with Timeout.timeout to avoid unbounded waits:
-    #   require "timeout"
-    #   Timeout.timeout(30) { threads.each_value(&:join) }
+    # For bounded waits, use Thread#join(timeout_seconds); nil means timed out — handle explicitly.
+    # Do not use Timeout.timeout or Thread#kill — both inject async exceptions that bypass cleanup.
+    # Prefer CancellationToken for cooperative cancellation of Phronomy-managed tasks.
     threads.each_value(&:join)
     s.merge(summary: threads[:summary].value, tags: Array(threads[:tags].value))
   end
@@ -535,6 +590,8 @@ Phronomy.configure do |c|
   c.trace_pii                       = false # default; set to true only when trace data contains no PII
   c.logger                          = nil   # optional; any object responding to #warn (e.g. Rails.logger)
   c.event_loop_stop_grace_seconds   = 5     # seconds to wait for sessions to drain on EventLoop#stop(drain: true)
+  c.runtime_backend                 = :thread   # :thread (default); :immediate (tests, synchronous); :fiber (experimental validation only); :cooperative (deprecated alias for :immediate)
+  c.strict_runtime_guards           = false          # when true, raises on invoke-inside-task
 end
 ```
@@ -546,6 +603,66 @@ end
 > The default is `false` (PII protection enabled). Set to `true` only when
 > trace data does not contain sensitive information.
+## Sync vs Async API
+Phronomy provides both synchronous and asynchronous invocation APIs.
+Understanding when to use each prevents scheduler stalls and hidden deadlocks.
+| Context | Recommended API |
+|---------|----------------|
+| Top-level application code, Rails controller, background job | `agent.invoke(input)` — blocks the calling thread until done |
+| Inside a `Runtime#spawn` block, `TaskGroup`, Workflow action, Tool `execute` | `agent.invoke_async(input).await` — non-blocking within the scheduler |
+### Why this matters
+`invoke` is a synchronous wrapper that calls `invoke_async` and then _blocks_ the calling
+thread until the task completes. When called from **inside** an active scheduler task, the
+calling task blocks the scheduler thread, preventing other tasks from making progress — a
+hidden deadlock when all scheduler threads are occupied.
+### Runtime guard
+Phronomy detects this pattern automatically:
+```ruby
+# Default (soft mode): logs a warning and continues
+Phronomy.configure { |c| c.strict_runtime_guards = false }
+# Strict mode: raises SchedulerReentrancyError immediately
+Phronomy.configure { |c| c.strict_runtime_guards = true }
+```
+You can also query the current context directly:
+```ruby
+Phronomy::Runtime.in_scheduler_context?  # => true if called from inside a task
+```
+### Migration: invoke → invoke_async
+```ruby
+# Before (blocks scheduler if called from inside a task)
+result = my_agent.invoke("Hello")
+# After (safe inside tasks and TaskGroups)
+result = my_agent.invoke_async("Hello").await
+```
+### :immediate backend (synchronous / test mode)
+The `:immediate` backend runs tasks synchronously using `FakeScheduler`
+(backed by `Task::ImmediateBackend`).  Blocking I/O is isolated in `BlockingAdapterPool`.
+To switch back to the default thread-per-task backend:
+```ruby
+Phronomy.configure { |c| c.runtime_backend = :thread }
+# or per-example using SchedulerHelpers:
+include Phronomy::Testing::SchedulerHelpers
+with_fake_scheduler do |sched|
+  # all spawns run synchronously; sched.event_log records every lifecycle event
+end
+```
 ## Context Management
 Phronomy includes a context window management layer. When model metadata is
@@ -583,7 +700,7 @@ class MyAgent < Phronomy::Agent::Base
   max_output_tokens 4096   # override max_output_tokens from registry
   context_overhead  600    # extra reservation for system prompt + tools
   invoke_timeout    30     # raise Phronomy::TimeoutError after 30 s (wait timeout, not cancellation)
-  max_parallel_tools 4     # cap concurrent tool-call threads (default: 10)
+  max_parallel_tools 4     # cap concurrent tool executions (default: 10)
 end
 ```
@@ -624,9 +741,13 @@ blocks always execute.
 > - Any external I/O (database query, vector search, HTTP request) inside those calls
 >
 > For deep in-flight safety, complement `CancellationToken` with per-source or
-> per-tool timeouts (e.g. `Net::HTTP#read_timeout`, `Timeout.timeout`, connection
-> pool limits). Ruby's GVL prevents fully preemptive cancellation without
-> `Thread#kill`, which Phronomy avoids by default due to resource safety concerns.
+> per-tool timeouts. Prefer library-native timeouts such as `Net::HTTP#read_timeout`,
+> database `statement_timeout`, or Redis client timeout — these signal the I/O layer
+> to abort cleanly. Avoid `Timeout.timeout` unless you understand its async-exception
+> risks: it injects `Timeout::Error` at an arbitrary execution point (the same
+> mechanism as `Thread#kill`), which Phronomy avoids by default due to resource
+> safety concerns. Ruby's GVL prevents fully preemptive cancellation without such
+> risky interruption.
 ```ruby
 token = Phronomy::CancellationToken.new
@@ -740,9 +861,11 @@ span attributes by default (`trace_pii: false`). To include full content in trac
 Phronomy configuration. Evaluate whether your tracing backend (OTLP collector, Jaeger,
 Honeycomb, etc.) meets your data-retention and privacy requirements.
-**Prompt injection** — Phronomy provides no built-in prompt injection detection.
-Applications that process untrusted user input should implement their own input
-guardrails (see the Guardrails section above).
+**Prompt injection** — Phronomy provides `PromptInjectionGuardrail`, a built-in
+pattern-based input guardrail that detects common injection patterns (ignore/override
+instructions, role-switching phrases, etc.). It is a useful starting point, not a
+comprehensive defence; applications processing untrusted input should layer additional
+custom guardrails as needed (see the Guardrails section above).
 **Tool and MCP security** — Tools can perform real-world side effects (database
 writes, API calls, file deletion). Treat tool execution as a privileged operation:

data/Rakefile CHANGED Viewed

@@ -7,4 +7,37 @@ RSpec::Core::RakeTask.new(:spec)
 require "standard/rake"
+# Verify that @api private classes do not leak into the public YARD output.
+# Any class or module without @api private that ends up in the public doc must
+# have a corresponding entry in the Features table in README.md.
+#
+# Usage: bundle exec rake yard_check
+desc "Build YARD docs excluding @api private items and check for undocumented public APIs"
+task :yard_check do
+  require "yard"
+  YARD::Registry.clear
+  YARD.parse(Dir["lib/**/*.rb"])
+  undocumented = []
+  YARD::Registry.all(:class, :module).each do |obj|
+    next if obj.visibility == :private
+    next if obj.tag(:api)&.name == "private"
+    next if obj.docstring.blank?
+    # Classes/modules with no docstring that are not @api private are worth
+    # noting, but only raise on truly undocumented public objects.
+    if obj.docstring.empty?
+      undocumented << obj.path
+    end
+  end
+  unless undocumented.empty?
+    warn "The following public classes/modules have no YARD documentation:\n" \
+         "  #{undocumented.join("\n  ")}\n" \
+         "Either add a docstring or mark them @api private."
+    exit 1
+  end
+  puts "yard_check passed — no undocumented public classes/modules found."
+end
 task default: %i[spec standard]

data/benchmark/baseline.json CHANGED Viewed

@@ -2,7 +2,7 @@
   "workflow_context_merge": 124364.81010472385,
   "workflow_define": 2179.945274115319,
   "tool_params_schema_definition": 19534379.159046534,
-  "dispatch_parallel_10": 1483.2255243486482,
+  "dispatch_parallel_10": 886.0,
   "cancellation_token_cancelled": 4335060.97443425,
   "cancellation_token_raise_if_cancelled_noop": 3566903.189098373,
   "trim_context_remove_2000": 1761.5700678986254

data/benchmark/bench_regression.rb CHANGED Viewed

@@ -91,6 +91,7 @@ stub_agent_class = Class.new(Phronomy::Agent::Base) do
   define_method(:invoke) do |_input, messages: [], thread_id: nil, config: {}|
     {output: "stub", messages: []}
   end
+  define_method(:invoke_async) { |input, **_kw| Phronomy::Runtime.instance.spawn(name: "bench-stub") { invoke(input) } }
 end
 orchestrator_class = Class.new(Phronomy::Agent::Orchestrator)

data/docs/decisions/004-invoke-timeout-is-not-cancellation.md CHANGED Viewed

@@ -49,3 +49,27 @@ transport layer participation.
 - Users who expect "cancel" semantics from a timeout will be surprised.
 - Proper cancellation requires the `CancellationToken` feature (#216), which
   has not yet been implemented.
+## Extension: PendingOperation#await cooperative cancellation semantics
+`BlockingAdapterPool::PendingOperation#await` also supports both `timeout:` and
+`cancellation_token:` parameters. The same non-preemptive rule applies here,
+consistent with ADR-010 (cooperative-first, non-preemptive concurrency model):
+1. **No forcible thread termination.** When a `cancellation_token` is cancelled,
+   `CancellationError` is raised to the `await` caller; when the timeout fires,
+   `TimeoutError` is raised instead. In both cases, the underlying worker thread
+   is **not** killed. The worker runs its block to natural completion.
+2. **Cooperative, not preemptive.** Cancellation takes effect only at `await`
+   call sites or at explicit `token.check!` checkpoints inside the submitted
+   block. Code that ignores the token will not be interrupted.
+3. **Timeout scope.** `timeout:` at `await` time is measured from the moment
+   `await` is called. If both submit-time and await-time timeouts are provided,
+   the earlier deadline wins.
+4. **Error propagation.** `CancellationError` (or `TimeoutError`) is raised to
+   the `await` caller; the submitter is responsible for handling it.
+These semantics are identical in spirit to the `invoke_timeout` decision above:
+the framework exposes a *wait* boundary, not a hard-kill boundary. Safe resource
+cleanup is the caller's responsibility.

data/docs/decisions/006-no-built-in-guardrails.md CHANGED Viewed

@@ -1,8 +1,8 @@
-# ADR-006: Built-in Guardrail Implementations Are Not Shipped
+# ADR-006: Minimal Built-in Guardrail Implementations
 ## Status
-Accepted
+Amended (see Amendment section below)
 ## Context
@@ -46,3 +46,21 @@ Users are responsible for implementing domain-specific guardrail logic.
 **Negative / Tradeoffs:**
 - Users must implement their own guardrails from scratch. Providing a cookbook
   of example patterns in the README partially mitigates this.
+## Amendment — `PromptInjectionGuardrail` Added
+After the original decision was accepted, `Guardrail::PromptInjectionGuardrail`
+was introduced as the **one exception** to the "no built-ins" rule.
+**Rationale for the exception:**
+- Prompt injection patterns are broadly applicable across almost all LLM
+  applications regardless of domain, unlike PII patterns which are locale-specific.
+- A lightweight, pure-regex implementation has no third-party dependency and
+  adds negligible gem weight.
+- It serves as a documented reference implementation that users can subclass with
+  `extra_patterns:` to extend.
+**Scope of the exception:**
+Only prompt-injection detection is provided as a built-in. PII scanning,
+content classification, and toxic-content filtering remain out of scope per the
+original decision.