RubyGems - language-operator - Versions diffs - 0.1.65 → 0.1.66 - Mend

language-operator 0.1.65 → 0.1.66

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (19) hide show

checksums.yaml +4 -4
data/Gemfile.lock +1 -1
data/README.md +20 -1
data/components/agent/Gemfile +1 -1
data/docs/observability.md +208 -0
data/lib/language_operator/agent/task_executor.rb +11 -1
data/lib/language_operator/agent.rb +24 -14
data/lib/language_operator/cli/commands/agent/base.rb +140 -47
data/lib/language_operator/cli/commands/agent/code_operations.rb +157 -16
data/lib/language_operator/cli/errors/suggestions.rb +1 -1
data/lib/language_operator/constants.rb +1 -0
data/lib/language_operator/kubernetes/client.rb +1 -1
data/lib/language_operator/version.rb +1 -1
data/synth/003/Makefile +12 -2
data/synth/003/agent.txt +1 -1
metadata +4 -6
data/lib/language_operator/cli/commands/agent/learning.rb +0 -408
data/synth/003/agent.optimized.rb +0 -66
data/synth/003/agent.synthesized.rb +0 -41

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: d7c2d32e32603ef4e3f33c04ded923e5c24d207f544a70c146034fa0d07c1f38
-  data.tar.gz: 64d778197bbcd3af3e3071db8954287912274fc9117838a8a320a8e5745956b1
+  metadata.gz: f9121bdf48b4ee7bad4c9918d68ddf554966acd1c264cd8e1a1cadc13e5c6459
+  data.tar.gz: cf90195c887a60165e1aee1990e2dafe2b69a37c736362d1bcbf341dd21dfb3f
 SHA512:
-  metadata.gz: e97cd6b388ac0a8965e0be894b0e22089b3139a7cadff32979ab5920516579003e2d686933737ad55aaa4c6ff51e0ff747601702d653bbc1e755b69c307410a8
-  data.tar.gz: e28c7f2db22231bfb41c6daac57154905acd11d6ab92c683146ec0e1bbe22fd3a109cff5af501fb86f9cc80b623342b1fca956a4d3ebb51b7bc7ba618581020b
+  metadata.gz: f533c69e38c4bc604b939e54b743228a4640611292d5afbe7cbaa79f0b4f05436c8e1360cb64a8f1de623acddc08d291d39a4381817a530bd4b96f4408699913
+  data.tar.gz: 1511bba1d7471abaa63e1029fe9b229414dd9f2b745fb2f38b83d073ac529506b00a913fd56418c4a461c06bf061a0526973a5213574ed2cc70332a0249d8d00

data/Gemfile.lock CHANGED Viewed

@@ -1,7 +1,7 @@
 PATH
   remote: .
   specs:
-    language-operator (0.1.65)
+    language-operator (0.1.66)
       faraday (~> 2.0)
       k8s-ruby (~> 0.17)
       lru_redux (~> 1.1)

data/README.md CHANGED Viewed

@@ -2,4 +2,23 @@
 [![Gem Version](https://img.shields.io/gem/v/language-operator.svg)](https://rubygems.org/gems/language-operator)
-This gem is experimental, used by [language-operator](https://github.com/language-operator/language-operator), and not ready for production.
+This gem is experimental, used by [language-operator](https://github.com/language-operator/language-operator), and not ready for production.
+## Observability
+The gem includes comprehensive OpenTelemetry instrumentation for monitoring agent executions and enabling the learning system to optimize performance.
+**Span Hierarchy:**
+```
+agent_executor (parent span - overall agent run)
+  └── task_executor.execute_task (child span - task execution)
+      └── execute_tool #{tool_name} (grandchild span - tool calls)
+```
+**Key Features:**
+- Automatic trace generation following OpenTelemetry GenAI conventions
+- Learning system integration via standardized span names and attributes
+- Optional data capture with privacy controls
+- Performance monitoring and debugging support
+For detailed information, see [docs/observability.md](./docs/observability.md).

data/components/agent/Gemfile CHANGED Viewed

@@ -2,7 +2,7 @@
 source 'https://rubygems.org'
-gem 'language-operator', '~> 0.1.65', path: '../..'
+gem 'language-operator', '~> 0.1.66'
 # Agent-specific dependencies for autonomous execution
 gem 'concurrent-ruby', '~> 1.3'

data/docs/observability.md ADDED Viewed

@@ -0,0 +1,208 @@
+# Observability and Telemetry
+The Language Operator gem includes comprehensive OpenTelemetry instrumentation to enable observability, debugging, and optimization of agent executions.
+## OpenTelemetry Integration
+The gem automatically instruments agent executions with OpenTelemetry spans, following the [OpenTelemetry Semantic Conventions for GenAI](https://opentelemetry.io/docs/specs/semconv/gen-ai/).
+### Configuration
+Configure telemetry via environment variables:
+```bash
+# Basic telemetry (always enabled)
+OTEL_EXPORTER_OTLP_ENDPOINT=https://your-otel-collector:4317
+# Data capture controls (optional - defaults to metadata only)
+CAPTURE_TASK_INPUTS=true      # Capture full task inputs as JSON
+CAPTURE_TASK_OUTPUTS=true     # Capture full task outputs as JSON
+CAPTURE_TOOL_ARGS=true        # Capture tool call arguments
+CAPTURE_TOOL_RESULTS=true     # Capture tool call results
+```
+**Security Note:** Data capture is disabled by default to prevent sensitive information leakage. Only enable full data capture in secure environments.
+## Span Hierarchy
+The gem creates a hierarchical trace structure that enables the learning system to identify and analyze complete agent executions:
+```
+agent_executor (parent span - overall agent run)
+  └── task_executor.execute_task (child span - task 1)
+      └── execute_tool github (grandchild span - tool call 1)
+      └── execute_tool slack (grandchild span - tool call 2)
+  └── task_executor.execute_task (child span - task 2)
+  └── task_executor.execute_task (child span - task 3)
+```
+### Span Names
+| Span Name | Purpose | Created By |
+|-----------|---------|------------|
+| `agent_executor` | Overall agent execution | `LanguageOperator::Agent.execute_main_block()` |
+| `task_executor.execute_task` | Individual task execution | `TaskExecutor#execute_task()` |
+| `execute_tool #{tool_name}` | Tool calls from LLM responses | `TaskTracer#record_single_tool_call()` |
+| `execute_tool.#{tool_name}` | Direct tool calls from symbolic tasks | `Client::Base` tool wrapper |
+## Span Attributes
+### Agent Executor Span
+The top-level `agent_executor` span includes:
+```
+agent.name: "my-agent"           # Agent identifier
+agent.task_count: 5              # Number of tasks in agent
+agent.mode: "autonomous"         # Execution mode (autonomous/scheduled/interactive)
+```
+### Task Executor Span
+Each `task_executor.execute_task` span includes:
+```
+# Core identification (CRITICAL for learning system)
+task.name: "fetch_user_data"            # Task identifier
+gen_ai.operation.name: "execute_task"   # Operation type
+# Execution metadata
+task.max_retries: 3                     # Retry configuration
+task.timeout: 30000                     # Timeout in milliseconds
+task.type: "hybrid"                     # Task type (neural/symbolic/hybrid)
+task.has_neural: "true"                 # Has neural implementation
+task.has_symbolic: "false"              # Has symbolic implementation
+# Agent context
+agent.name: "my-agent"                  # Agent identifier (explicit for learning system)
+# Data capture (when enabled)
+task.inputs: '{"user_id": 123}'         # JSON-encoded inputs (CAPTURE_TASK_INPUTS=true)
+task.outputs: '{"user": {...}}'         # JSON-encoded outputs (CAPTURE_TASK_OUTPUTS=true)
+```
+### Tool Call Spans
+Tool calls create spans with names like `execute_tool #{tool_name}` and include:
+```
+# GenAI semantic attributes
+gen_ai.operation.name: "execute_tool"           # Operation type
+gen_ai.tool.name: "github"                      # Tool identifier
+gen_ai.tool.call.id: "call_123"                 # Call ID (if available)
+# Data capture (when enabled)
+gen_ai.tool.call.arguments: '{"repo": "..."}'   # JSON arguments (CAPTURE_TOOL_ARGS=true)
+gen_ai.tool.call.result: '{"status": "ok"}'     # JSON result (CAPTURE_TOOL_RESULTS=true)
+# Size metadata (always captured)
+gen_ai.tool.call.arguments.size: 45             # Arguments size in bytes
+gen_ai.tool.call.result.size: 1024              # Result size in bytes
+```
+## Learning System Integration
+This span naming convention enables the language-operator Kubernetes controller to:
+1. **Identify Task Executions**: Query traces by `task_executor.execute_task` spans
+2. **Group by Agent**: Filter by `agent.name` attribute
+3. **Analyze Patterns**: Extract execution patterns from span attributes
+4. **Build Optimizations**: Create optimized implementations based on trace analysis
+### Example OTLP Query
+To find all task executions for an agent:
+```sql
+SELECT * FROM spans
+WHERE name = 'task_executor.execute_task'
+  AND attributes['agent.name'] = 'my-agent'
+  AND start_time > NOW() - INTERVAL '1 hour'
+```
+## Data Privacy and Security
+### Default Behavior (Secure)
+By default, the gem captures:
+- ✅ Task names and metadata
+- ✅ Execution timing and counts
+- ✅ Tool names and call frequencies
+- ✅ Data sizes (bytes)
+- ❌ **NOT** actual data content
+### Full Data Capture (Optional)
+When explicitly enabled, the gem additionally captures:
+- ⚠️ Complete task inputs and outputs as JSON
+- ⚠️ Tool call arguments and results
+- ⚠️ LLM prompts and responses
+**Warning:** Only enable full data capture in development or secure production environments. Captured data may contain sensitive information.
+### Data Sanitization
+When full capture is enabled, the gem:
+- Truncates large payloads (>1000 chars for span attributes)
+- Converts complex objects to JSON automatically
+- Respects OpenTelemetry attribute limits
+## Performance Impact
+Telemetry overhead is minimal:
+- **Default mode**: <5% performance overhead
+- **Full capture mode**: ~10% performance overhead
+- **Span creation**: <1ms per span
+- **Data serialization**: 1-5ms for complex objects
+## Debugging with Traces
+### Common Queries
+**Find slow tasks:**
+```sql
+SELECT attributes['task.name'], duration_ms
+FROM spans
+WHERE name = 'task_executor.execute_task'
+  AND duration_ms > 5000
+ORDER BY duration_ms DESC
+```
+**Tool usage analysis:**
+```sql
+SELECT attributes['gen_ai.tool.name'], COUNT(*)
+FROM spans
+WHERE name LIKE 'execute_tool%'
+GROUP BY attributes['gen_ai.tool.name']
+```
+**Agent execution frequency:**
+```sql
+SELECT attributes['agent.name'], COUNT(*) as executions
+FROM spans
+WHERE name = 'agent_executor'
+  AND start_time > NOW() - INTERVAL '24 hours'
+GROUP BY attributes['agent.name']
+```
+### Trace Sampling
+For high-volume agents, consider trace sampling:
+```bash
+# Sample 10% of traces
+OTEL_TRACES_SAMPLER=parentbased_traceidratio
+OTEL_TRACES_SAMPLER_ARG=0.1
+```
+## Related Documentation
+- [Agent Runtime Architecture](./agent-internals.md) - How agents execute
+- [Best Practices](./best-practices.md) - Production deployment guidance
+- [Understanding Generated Code](./understanding-generated-code.md) - Agent code structure
+## External Resources
+- [OpenTelemetry Semantic Conventions](https://opentelemetry.io/docs/specs/semconv/gen-ai/)
+- [Language Operator Controller](https://github.com/language-operator/language-operator) - Learning system implementation
+- [OTLP Specification](https://opentelemetry.io/docs/specs/otlp/) - Wire format

data/lib/language_operator/agent/task_executor.rb CHANGED Viewed

@@ -138,6 +138,10 @@ module LanguageOperator
           # Execute with retry logic
           result = execute_with_retry(task, task_name, inputs, timeout, max_retries, execution_start)
+          # Add task outputs to span for learning system (if enabled)
+          current_span = OpenTelemetry::Trace.current_span
+          current_span&.set_attribute('task.outputs', result.to_json) if current_span && capture_enabled?(:outputs)
           # Emit Kubernetes event for successful task completion
           emit_task_execution_event(task_name, success: true, execution_start: execution_start)
@@ -1023,13 +1027,19 @@ module LanguageOperator
         attributes = {
           # Core task identification (CRITICAL for learning system)
           'task.name' => task_name.to_s,
-          'task.inputs' => inputs.keys.map(&:to_s).join(','),
           'task.max_retries' => max_retries,
           # Semantic operation name for better trace organization
           'gen_ai.operation.name' => 'execute_task'
         }
+        # Add task inputs - JSON-encoded if capture enabled, else just keys
+        attributes['task.inputs'] = if capture_enabled?(:inputs)
+                                      inputs.to_json
+                                    else
+                                      inputs.keys.map(&:to_s).join(',')
+                                    end
         # Explicitly add agent name if available (redundant with resource attribute but ensures visibility)
         if (agent_name = ENV.fetch('AGENT_NAME', nil))
           attributes['agent.name'] = agent_name

data/lib/language_operator/agent.rb CHANGED Viewed

@@ -4,6 +4,7 @@ require_relative 'agent/base'
 require_relative 'agent/executor'
 require_relative 'agent/task_executor'
 require_relative 'agent/web_server'
+require_relative 'agent/instrumentation'
 require_relative 'dsl'
 require_relative 'logger'
@@ -24,6 +25,8 @@ module LanguageOperator
   #   agent.execute_goal("Summarize daily news")
   # rubocop:disable Metrics/ModuleLength
   module Agent
+    extend LanguageOperator::Agent::Instrumentation
     # Module-level logger for Agent framework
     @logger = LanguageOperator::Logger.new(component: 'Agent')
@@ -215,22 +218,29 @@ module LanguageOperator
                   agent: agent_def.name,
                   task_count: agent_def.tasks.size)
-      # Get inputs from environment or default to empty hash
-      inputs = {}
-      # Execute main block with task executor as context
-      result = agent_def.main.call(inputs, task_executor)
-      logger.info('Main block execution completed',
-                  result: result)
+      # Execute main block within agent_executor span for learning system integration
+      with_span('agent_executor', attributes: {
+                  'agent.name' => agent_def.name,
+                  'agent.task_count' => agent_def.tasks.size,
+                  'agent.mode' => ENV.fetch('AGENT_MODE', 'unknown')
+                }) do
+        # Get inputs from environment or default to empty hash
+        inputs = {}
+        # Execute main block with task executor as context
+        result = agent_def.main.call(inputs, task_executor)
+        logger.info('Main block execution completed',
+                    result: result)
+        # Call output handler if defined
+        if agent_def.output
+          logger.debug('Executing output handler', outputs: result)
+          execute_output_handler(agent_def, result, task_executor)
+        end
-      # Call output handler if defined
-      if agent_def.output
-        logger.debug('Executing output handler', outputs: result)
-        execute_output_handler(agent_def, result, task_executor)
+        result
       end
-      result
     end
     # Execute main block (DSL v1) in persistent mode for autonomous agents

data/lib/language_operator/cli/commands/agent/base.rb CHANGED Viewed

@@ -1,6 +1,7 @@
 # frozen_string_literal: true
 require 'thor'
+require 'json'
 require_relative '../../command_loader'
 require_relative '../../wizards/agent_wizard'
@@ -9,7 +10,6 @@ require_relative 'workspace'
 require_relative 'code_operations'
 require_relative 'logs'
 require_relative 'lifecycle'
-require_relative 'learning'
 # Include helper modules
 require_relative 'helpers/cluster_llm_client'
@@ -35,7 +35,6 @@ module LanguageOperator
           include CodeOperations
           include Logs
           include Lifecycle
-          include Learning
           # NOTE: Core commands (create, list, inspect, delete) will be added below
           # This file is a placeholder for the refactoring process
@@ -173,6 +172,9 @@ module LanguageOperator
               # Main agent information
               puts
               status = agent.dig('status', 'phase') || 'Unknown'
+              creation_timestamp = agent.dig('metadata', 'creationTimestamp')
+              formatted_created = creation_timestamp ? Formatters::ValueFormatter.time_ago(Time.parse(creation_timestamp)) : nil
               format_agent_details(
                 name: name,
                 namespace: ctx.namespace,
@@ -180,8 +182,8 @@ module LanguageOperator
                 status: format_status(status),
                 mode: agent.dig('spec', 'executionMode') || 'autonomous',
                 schedule: agent.dig('spec', 'schedule'),
-                persona: agent.dig('spec', 'persona'),
-                created: agent.dig('metadata', 'creationTimestamp')
+                persona: agent.dig('spec', 'persona') || 'None',
+                created: formatted_created
               )
               puts
@@ -191,7 +193,6 @@ module LanguageOperator
                 exec_data = get_execution_data(name, ctx)
                 exec_rows = {
-                  'Total Runs' => exec_data[:total_runs],
                   'Last Run' => exec_data[:last_run] || 'Never'
                 }
                 exec_rows['Next Run'] = exec_data[:next_run] || 'N/A' if agent.dig('spec', 'schedule')
@@ -200,6 +201,10 @@ module LanguageOperator
                 puts
               end
+              # Learning status
+              display_learning_section(agent, name, ctx)
+              puts
               # Resources
               resources = agent.dig('spec', 'resources')
               if resources
@@ -302,62 +307,71 @@ module LanguageOperator
               Formatters::ProgressFormatter.with_spinner("Deleting agent '#{name}'") do
                 ctx.client.delete_resource(RESOURCE_AGENT, name, ctx.namespace)
               end
+              # Verify deletion completed
+              verify_agent_deletion(ctx, name)
             end
           end
-          desc 'versions NAME', 'Show ConfigMap versions managed by operator'
-          long_desc <<-DESC
-            List the versioned ConfigMaps created by the operator for an agent.
+          private
-            Shows the automatic optimization history and available versions for rollback.
+          # Display learning status section in agent inspect
+          def display_learning_section(agent, _name, _ctx)
+            annotations = agent.dig('metadata', 'annotations')
+            annotations = annotations.respond_to?(:to_h) ? annotations.to_h : (annotations || {})
-            Examples:
-              aictl agent versions my-agent
-              aictl agent versions my-agent --cluster production
-          DESC
-          option :cluster, type: :string, desc: 'Override current cluster context'
-          def versions(name)
-            handle_command_error('list agent versions') do
-              ctx = CLI::Helpers::ClusterContext.from_options(options)
+            # Determine learning state
+            learning_enabled = !annotations.key?(Constants::KubernetesLabels::LEARNING_DISABLED_LABEL)
-              # Get agent to verify it exists
-              get_resource_or_exit(RESOURCE_AGENT, name)
+            # Get runs pending learning from agent status
+            runs_pending_learning = agent.dig('status', 'runsPendingLearning') || 0
+            learning_threshold = 10 # Standard threshold
-              # List all ConfigMaps with the agent label
-              config_maps = ctx.client.list_resources('ConfigMap', namespace: ctx.namespace)
+            # Calculate progress percentage
+            progress_percent = [(runs_pending_learning.to_f / learning_threshold * 100).round, 100].min
+            runs_display = if runs_pending_learning >= learning_threshold
+                             "#{runs_pending_learning}/#{learning_threshold} #{pastel.green('(Ready)')}"
+                           else
+                             "#{runs_pending_learning}/#{learning_threshold} (#{progress_percent}%)"
+                           end
-              # Filter for versioned ConfigMaps for this agent
-              agent_configs = config_maps.select do |cm|
-                labels = cm.dig('metadata', 'labels') || {}
-                labels['agent'] == name && labels['version']
-              end
-              # Sort by version (assuming numeric versions)
-              agent_configs.sort! do |a, b|
-                version_a = a.dig('metadata', 'labels', 'version').to_i
-                version_b = b.dig('metadata', 'labels', 'version').to_i
-                version_b <=> version_a # Reverse order (newest first)
-              end
+            status_color = learning_enabled ? :green : :yellow
+            status_text = learning_enabled ? 'Enabled' : 'Disabled'
-              display_agent_versions(agent_configs, name, ctx.name)
-            end
+            highlighted_box(
+              title: 'Learning',
+              color: :cyan,
+              rows: {
+                'Status' => pastel.send(status_color).bold(status_text),
+                'Threshold' => "#{pastel.cyan('10 successful runs')} (auto-learning trigger)",
+                'Confidence Target' => "#{pastel.cyan('85%')} (pattern detection)",
+                'Runs Recorded' => runs_display
+              }
+            )
           end
-          private
           # Shared helper methods that are used across multiple commands
           # These will be extracted from the original agent.rb
-          def handle_agent_not_found(name, ctx, error)
+          def handle_agent_not_found(name, ctx, error = nil)
             # Get available agents for fuzzy matching
             agents = ctx.client.list_resources(RESOURCE_AGENT, namespace: ctx.namespace)
             available_names = agents.map { |a| a.dig('metadata', 'name') }
-            CLI::Errors::Handler.handle_not_found(error,
-                                                  resource_type: RESOURCE_AGENT,
-                                                  resource_name: name,
-                                                  cluster: ctx.name,
-                                                  available_resources: available_names)
+            # Create error if not provided
+            error ||= K8s::Error::NotFound.new('GET', "/apis/langop.io/v1alpha1/namespaces/#{ctx.namespace}/languageagents/#{name}", 404, 'Not Found')
+            begin
+              CLI::Errors::Handler.handle_not_found(error, {
+                                                      resource_type: RESOURCE_AGENT,
+                                                      resource_name: name,
+                                                      cluster: ctx.name,
+                                                      available_resources: available_names
+                                                    })
+            rescue CLI::Errors::NotFoundError
+              # Error message already displayed by handler, just exit gracefully
+              exit 1
+            end
           end
           def display_agent_created(agent, ctx, _description, _synthesis_result)
@@ -372,8 +386,8 @@ module LanguageOperator
               status: format_status(status),
               mode: agent.dig('spec', 'executionMode') || 'autonomous',
               schedule: agent.dig('spec', 'schedule'),
-              persona: agent.dig('spec', 'persona') || '(auto-selected)',
-              created: Time.now.strftime('%Y-%m-%dT%H:%M:%SZ')
+              persona: agent.dig('spec', 'persona') || 'None',
+              created: 'just now'
             )
             puts
@@ -526,11 +540,17 @@ module LanguageOperator
             end
             table_data = agents.map do |agent|
+              status = if agent.dig('metadata', 'deletionTimestamp')
+                         'Pending Deletion'
+                       else
+                         agent.dig('status', 'phase') || 'Unknown'
+                       end
               {
                 name: agent.dig('metadata', 'name'),
                 namespace: agent.dig('metadata', 'namespace') || context.namespace,
                 mode: agent.dig('spec', 'executionMode') || 'autonomous',
-                status: agent.dig('status', 'phase') || 'Unknown'
+                status: status
               }
             end
@@ -556,11 +576,17 @@ module LanguageOperator
               agents = ctx.client.list_resources(RESOURCE_AGENT, namespace: ctx.namespace)
               agents.each do |agent|
+                status = if agent.dig('metadata', 'deletionTimestamp')
+                           'Pending Deletion'
+                         else
+                           agent.dig('status', 'phase') || 'Unknown'
+                         end
                 all_agents << {
                   cluster: cluster[:name],
                   name: agent.dig('metadata', 'name'),
                   mode: agent.dig('spec', 'executionMode') || 'autonomous',
-                  status: agent.dig('status', 'phase') || 'Unknown',
+                  status: status,
                   next_run: agent.dig('status', 'nextRun') || 'N/A',
                   executions: agent.dig('status', 'executionCount') || 0
                 }
@@ -828,6 +854,73 @@ module LanguageOperator
           rescue StandardError
             schedule
           end
+          def verify_agent_deletion(ctx, name)
+            max_wait = 30  # Wait up to 30 seconds
+            interval = 2   # Check every 2 seconds
+            elapsed = 0
+            Formatters::ProgressFormatter.with_spinner('Verifying deletion') do
+              loop do
+                begin
+                  agent = ctx.client.get_resource(RESOURCE_AGENT, name, ctx.namespace)
+                  # Check if deletion is stuck on finalizers
+                  deletion_timestamp = agent.dig('metadata', 'deletionTimestamp')
+                  if deletion_timestamp
+                    finalizers = agent.dig('metadata', 'finalizers') || []
+                    if finalizers.any?
+                      if elapsed >= max_wait
+                        deletion_stuck_error(name, finalizers)
+                        return
+                      end
+                    end
+                  end
+                rescue K8s::Error::NotFound
+                  # Agent successfully deleted
+                  break
+                end
+                if elapsed >= max_wait
+                  deletion_timeout_error(name)
+                  return
+                end
+                sleep interval
+                elapsed += interval
+              end
+            end
+            # Deletion verified - no additional success message needed
+          end
+          def deletion_stuck_error(name, finalizers)
+            puts
+            Formatters::ProgressFormatter.error("Deletion of agent '#{name}' is stuck")
+            puts
+            puts "The agent has the following finalizers preventing deletion:"
+            finalizers.each { |f| puts "  - #{pastel.yellow(f)}" }
+            puts
+            puts "This usually indicates the operator is not running properly."
+            puts
+            puts "To diagnose:"
+            puts "  kubectl get pods -n kube-system | grep language-operator"
+            puts "  kubectl logs -n kube-system -l app.kubernetes.io/name=language-operator"
+            puts
+            puts "Emergency cleanup (advanced users only):"
+            puts "  kubectl patch languageagent #{name} -p '{\"metadata\":{\"finalizers\":null}}' --type=merge"
+          end
+          def deletion_timeout_error(name)
+            puts
+            Formatters::ProgressFormatter.warn("Could not verify deletion of agent '#{name}' within 30 seconds")
+            puts
+            puts "Check deletion status with:"
+            puts "  aictl agent list"
+            puts "  kubectl get languageagent #{name}"
+            puts
+            puts "If the agent shows 'Unknown' status, it may be pending deletion."
+          end
         end
       end
     end