RubyGems - language-operator - Versions diffs - 0.1.36 → 0.1.37 - Mend

language-operator 0.1.36 → 0.1.37

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (11) hide show

checksums.yaml +4 -4
data/Gemfile.lock +2 -1
data/lib/language_operator/cli/commands/system.rb +230 -90
data/lib/language_operator/templates/agent_synthesis.tmpl +243 -0
data/lib/language_operator/templates/schema/agent_dsl_openapi.yaml +1 -1
data/lib/language_operator/templates/schema/agent_dsl_schema.json +1 -1
data/lib/language_operator/version.rb +1 -1
metadata +17 -4
data/CI_STATUS.md +0 -56
data/lib/language_operator/templates/examples/agent_synthesis.tmpl +0 -133
/data/lib/language_operator/templates/{examples/persona_distillation.tmpl → persona_distillation.tmpl} +0 -0

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: dab6721a2a81cadf3c5a252b082c77cbe3dc9262480355637c4971e0bde231f3
-  data.tar.gz: b11d038d3928f29fdb91e09e859e04112f4f46d846556e8582ff33ee37c074ae
+  metadata.gz: 8b57f326a81a7683900ef0700d54b97c296019f74e044dc886cc8aa02703e5a3
+  data.tar.gz: 7ba35359cc513dd5d53df60269f7052449453032c6ce749631e5dd6d80983a66
 SHA512:
-  metadata.gz: 1b0ec3258b68eeb7469f8e6679dfb6f691465c404bc3c2a99c9fd99156ed96bd0e06279d0100ed495f13b134128a875e57eb3d50013139c36bb9cff23edf398b
-  data.tar.gz: fcb46362bb7ef404b990ca17e061416f1da10a10d5e7c9b65188292797f8ea96b6a4f5e9d67f73587e891a225dae7d76bcb3c34500ec64b192e6d06c5d57b18b
+  metadata.gz: 62327e75b4357c55c1de51fbb56eebe4a16faede2b3ead5591a0dd2b8dc8760116130f9e9820ca47499aaa5ad3e285048862db7fef788d96416c4cc9fe123560
+  data.tar.gz: 0f63c9ca8e9523f6825a053cbd02a21ccd4103067b36ad190fb78316df051d0b6972a36917fd2a4ef9f26e449deaaaac487a83ae63e9f562fd9c317a8df2f490

data/Gemfile.lock CHANGED Viewed

@@ -1,7 +1,8 @@
 PATH
   remote: .
   specs:
-    language-operator (0.1.36)
+    language-operator (0.1.37)
+      faraday (~> 2.0)
       k8s-ruby (~> 0.17)
       mcp (~> 0.4)
       opentelemetry-exporter-otlp (~> 0.27)

data/lib/language_operator/cli/commands/system.rb CHANGED Viewed

@@ -198,49 +198,58 @@ module LanguageOperator
           end
         end
-        desc 'test-synthesis', 'Test agent synthesis from natural language instructions'
+        desc 'synthesize INSTRUCTIONS', 'Synthesize agent code from natural language instructions'
         long_desc <<-DESC
-          Test the agent synthesis process by converting natural language instructions
+          Synthesize agent code by converting natural language instructions
           into Ruby DSL code without creating an actual agent.
+          This command uses a LanguageModel resource from your cluster to generate
+          agent code. If --model is not specified, the first available model will
+          be auto-selected.
           This command helps you validate your instructions and understand how the
           synthesis engine interprets them. Use --dry-run to see the prompt that
           would be sent to the LLM, or run without it to generate actual code.
           Examples:
             # Test with dry-run (show prompt only)
-            aictl system test-synthesis --instructions "Monitor GitHub issues daily" --dry-run
+            aictl system synthesize "Monitor GitHub issues daily" --dry-run
+            # Generate code from instructions (auto-selects first available model)
+            aictl system synthesize "Send daily reports to Slack"
+            # Use a specific cluster model
+            aictl system synthesize "Process webhooks from GitHub" --model my-claude
-            # Generate code from instructions
-            aictl system test-synthesis --instructions "Send daily reports to Slack"
+            # Output raw code without formatting (useful for piping to files)
+            aictl system synthesize "Monitor logs" --raw > agent.rb
             # Specify custom agent name and tools
-            aictl system test-synthesis \\
-              --instructions "Process webhooks from GitHub" \\
+            aictl system synthesize "Process webhooks from GitHub" \\
               --agent-name github-processor \\
-              --tools github,slack
-            # Specify available models
-            aictl system test-synthesis \\
-              --instructions "Analyze logs every hour" \\
-              --models gpt-4,claude-3-5-sonnet
+              --tools github,slack \\
+              --model my-gpt4
         DESC
-        option :instructions, type: :string, required: true, desc: 'Natural language instructions for the agent'
         option :agent_name, type: :string, default: 'test-agent', desc: 'Name for the test agent'
         option :tools, type: :string, desc: 'Comma-separated list of available tools'
-        option :models, type: :string, desc: 'Comma-separated list of available models'
+        option :models, type: :string, desc: 'Comma-separated list of available models (from cluster)'
+        option :model, type: :string, desc: 'Model to use for synthesis (defaults to first available in cluster)'
         option :dry_run, type: :boolean, default: false, desc: 'Show prompt without calling LLM'
-        def test_synthesis
-          handle_command_error('test synthesis') do
+        option :raw, type: :boolean, default: false, desc: 'Output only the raw code without formatting'
+        def synthesize(instructions)
+          handle_command_error('synthesize agent') do
+            # Select model to use for synthesis
+            selected_model = select_synthesis_model
             # Load synthesis template
             template_content = load_bundled_template('agent')
             # Detect temporal intent from instructions
-            temporal_intent = detect_temporal_intent(options[:instructions])
+            temporal_intent = detect_temporal_intent(instructions)
             # Prepare template data
             template_data = {
-              'Instructions' => options[:instructions],
+              'Instructions' => instructions,
               'AgentName' => options[:agent_name],
               'ToolsList' => format_tools_list(options[:tools]),
               'ModelsList' => format_models_list(options[:models]),
@@ -267,11 +276,8 @@ module LanguageOperator
               return
             end
-            # Call LLM to generate code
-            puts 'Generating agent code from instructions...'
-            puts
-            llm_response = call_llm_for_synthesis(rendered_prompt)
+            # Call LLM to generate code (no output - just do it)
+            llm_response = call_llm_for_synthesis(rendered_prompt, selected_model)
             # Extract Ruby code from response
             generated_code = extract_ruby_code(llm_response)
@@ -284,34 +290,19 @@ module LanguageOperator
               exit 1
             end
-            # Display generated code
-            puts 'Generated Code:'
-            puts '=' * 80
-            puts generated_code
-            puts '=' * 80
-            puts
-            # Validate generated code
-            puts 'Validating generated code...'
-            validation_result = validate_code_against_schema(generated_code)
-            if validation_result[:valid] && validation_result[:warnings].empty?
-              Formatters::ProgressFormatter.success('✅ Code is valid - No issues found')
-            elsif validation_result[:valid]
-              Formatters::ProgressFormatter.success('✅ Code is valid - With warnings')
-              puts
-              validation_result[:warnings].each do |warn|
-                puts "  ⚠  #{warn[:message]}"
-              end
-            else
-              Formatters::ProgressFormatter.error('❌ Code validation failed')
-              puts
-              validation_result[:errors].each do |err|
-                puts "  ✗ #{err[:message]}"
-              end
+            # Handle raw output
+            if options[:raw]
+              puts generated_code
+              return
             end
-            puts
+            # Display formatted code
+            require 'rouge'
+            formatter = Rouge::Formatters::Terminal256.new
+            lexer = Rouge::Lexers::Ruby.new
+            highlighted_code = formatter.format(lexer.lex(generated_code))
+            puts highlighted_code
           end
         end
@@ -449,68 +440,217 @@ module LanguageOperator
         # Format models list for template
         def format_models_list(models_str)
-          # If not specified, try to detect from environment
+          # If not specified, try to detect from cluster
           if models_str.nil? || models_str.strip.empty?
             models = detect_available_models
             return models.map { |model| "- #{model}" }.join("\n") unless models.empty?
-            return 'No models specified (configure ANTHROPIC_API_KEY or OPENAI_API_KEY)'
+            return 'No models available (run: aictl model list)'
           end
           models = models_str.split(',').map(&:strip)
           models.map { |model| "- #{model}" }.join("\n")
         end
-        # Detect available models from environment
+        # Detect available models from cluster
         def detect_available_models
-          models = []
-          models << 'claude-3-5-sonnet-20241022' if ENV['ANTHROPIC_API_KEY']
-          models << 'gpt-4-turbo' if ENV['OPENAI_API_KEY']
-          models
+          models = ctx.client.list_resources('LanguageModel', namespace: ctx.namespace)
+          models.map { |m| m.dig('metadata', 'name') }
+        rescue StandardError => e
+          Formatters::ProgressFormatter.error("Failed to list models from cluster: #{e.message}")
+          []
         end
-        # Call LLM to generate code from synthesis prompt
-        def call_llm_for_synthesis(prompt)
-          require 'ruby_llm'
+        # Select model to use for synthesis
+        def select_synthesis_model
+          # If --model option specified, use it
+          return options[:model] if options[:model]
+          # Otherwise, auto-select from available cluster models
+          available_models = detect_available_models
-          # Check for API keys
-          unless ENV['ANTHROPIC_API_KEY'] || ENV['OPENAI_API_KEY']
-            Formatters::ProgressFormatter.error('No LLM credentials found')
+          if available_models.empty?
+            Formatters::ProgressFormatter.error('No models available in cluster')
             puts
-            puts 'Please set one of the following environment variables:'
-            puts '  - ANTHROPIC_API_KEY (for Claude models)'
-            puts '  - OPENAI_API_KEY (for GPT models)'
+            puts 'Please create a model first:'
+            puts '  aictl model create'
+            puts
+            puts 'Or list existing models:'
+            puts '  aictl model list'
             exit 1
           end
-          # Prefer Anthropic if available
-          if ENV['ANTHROPIC_API_KEY']
-            provider = :anthropic
-            api_key = ENV['ANTHROPIC_API_KEY']
-            model = 'claude-3-5-sonnet-20241022'
-          else
-            provider = :openai
-            api_key = ENV.fetch('OPENAI_API_KEY', nil)
-            model = 'gpt-4-turbo'
+          # Auto-select first available model (silently)
+          available_models.first
+        end
+        # Get endpoint for a cluster model
+        def get_model_endpoint(model_name)
+          # For cluster models, we use the service endpoint
+          # The service is typically named the same as the model and listens on port 4000
+          "http://#{model_name}.#{ctx.namespace}.svc.cluster.local:4000/v1"
+        end
+        # Call LLM to generate code from synthesis prompt using cluster model
+        def call_llm_for_synthesis(prompt, model_name)
+          require 'json'
+          require 'faraday'
+          # Get model resource
+          model = get_resource_or_exit('LanguageModel', model_name)
+          model_id = model.dig('spec', 'modelName')
+          # Get the model's pod
+          pod = get_model_pod(model_name)
+          pod_name = pod.dig('metadata', 'name')
+          # Set up port-forward to access the model pod
+          port_forward_pid = nil
+          local_port = find_available_port
+          begin
+            # Start kubectl port-forward in background
+            port_forward_pid = start_port_forward(pod_name, local_port, 4000)
+            # Wait for port-forward to be ready
+            wait_for_port(local_port)
+            # Build the JSON payload for the chat completion request
+            payload = {
+              model: model_id,
+              messages: [{ role: 'user', content: prompt }],
+              max_tokens: 4000,
+              temperature: 0.3
+            }
+            # Make HTTP request using Faraday
+            conn = Faraday.new(url: "http://localhost:#{local_port}") do |f|
+              f.request :json
+              f.response :json
+              f.adapter Faraday.default_adapter
+              f.options.timeout = 120
+              f.options.open_timeout = 10
+            end
+            response = conn.post('/v1/chat/completions', payload)
+            # Parse response
+            result = response.body
+            if result['error']
+              error_msg = result['error']['message'] || result['error']
+              raise "Model error: #{error_msg}"
+            elsif !result['choices'] || result['choices'].empty?
+              raise "Unexpected response format: #{result.inspect}"
+            end
+            # Extract the content from the first choice
+            result.dig('choices', 0, 'message', 'content')
+          rescue Faraday::TimeoutError
+            raise 'LLM request timed out after 120 seconds'
+          rescue Faraday::ConnectionFailed => e
+            raise "Failed to connect to model: #{e.message}"
+          rescue StandardError => e
+            Formatters::ProgressFormatter.error("LLM call failed: #{e.message}")
+            puts
+            puts "Make sure the model '#{model_name}' is running: kubectl get pods -n #{ctx.namespace}"
+            exit 1
+          ensure
+            # Clean up port-forward process
+            cleanup_port_forward(port_forward_pid) if port_forward_pid
           end
+        end
+        # Get the pod for a model
+        def get_model_pod(model_name)
+          # Get the deployment for the model
+          deployment = ctx.client.get_resource('Deployment', model_name, ctx.namespace)
+          labels = deployment.dig('spec', 'selector', 'matchLabels')
-          # Create client and call LLM
-          client = RubyLLM.new(provider: provider, api_key: api_key)
-          messages = [{ role: 'user', content: prompt }]
+          raise "Deployment '#{model_name}' has no selector labels" if labels.nil?
-          response = client.chat(messages, model: model, max_tokens: 4000, temperature: 0.3)
+          # Convert to hash if needed
+          labels_hash = labels.respond_to?(:to_h) ? labels.to_h : labels
+          raise "Deployment '#{model_name}' has empty selector labels" if labels_hash.empty?
-          # Extract content from response
-          if response.is_a?(Hash) && response.key?('content')
-            response['content']
-          elsif response.is_a?(String)
-            response
-          else
-            response.to_s
+          label_selector = labels_hash.map { |k, v| "#{k}=#{v}" }.join(',')
+          # Find a running pod
+          pods = ctx.client.list_resources('Pod', namespace: ctx.namespace, label_selector: label_selector)
+          raise "No pods found for model '#{model_name}'" if pods.empty?
+          running_pod = pods.find do |pod|
+            pod.dig('status', 'phase') == 'Running' &&
+              pod.dig('status', 'conditions')&.any? { |c| c['type'] == 'Ready' && c['status'] == 'True' }
+          end
+          if running_pod.nil?
+            pod_phases = pods.map { |p| p.dig('status', 'phase') }.join(', ')
+            raise "No running pods found. Pod phases: #{pod_phases}"
+          end
+          running_pod
+        rescue K8s::Error::NotFound
+          raise "Model deployment '#{model_name}' not found"
+        end
+        # Find an available local port for port-forwarding
+        def find_available_port
+          require 'socket'
+          # Try ports in the range 14000-14999
+          (14_000..14_999).each do |port|
+            server = TCPServer.new('127.0.0.1', port)
+            server.close
+            return port
+          rescue Errno::EADDRINUSE
+            # Port in use, try next
+            next
+          end
+          raise 'No available ports found in range 14000-14999'
+        end
+        # Start kubectl port-forward in background
+        def start_port_forward(pod_name, local_port, remote_port)
+          require 'English'
+          cmd = "kubectl port-forward -n #{ctx.namespace} #{pod_name} #{local_port}:#{remote_port}"
+          pid = spawn(cmd, out: '/dev/null', err: '/dev/null')
+          # Detach so it runs in background
+          Process.detach(pid)
+          pid
+        end
+        # Wait for port-forward to be ready
+        def wait_for_port(port, max_attempts: 30)
+          require 'socket'
+          max_attempts.times do
+            socket = TCPSocket.new('127.0.0.1', port)
+            socket.close
+            return true
+          rescue Errno::ECONNREFUSED, Errno::EHOSTUNREACH
+            sleep 0.1
+          end
+          raise "Port-forward to localhost:#{port} failed to become ready after #{max_attempts} attempts"
+        end
+        # Clean up port-forward process
+        def cleanup_port_forward(pid)
+          return unless pid
+          begin
+            Process.kill('TERM', pid)
+            Process.wait(pid, Process::WNOHANG)
+          rescue Errno::ESRCH
+            # Process already gone
+          rescue Errno::ECHILD
+            # Process already reaped
           end
-        rescue StandardError => e
-          Formatters::ProgressFormatter.error("LLM call failed: #{e.message}")
-          exit 1
         end
         # Extract Ruby code from LLM response
@@ -550,7 +690,7 @@ module LanguageOperator
         # Load bundled template from gem
         def load_bundled_template(type)
           filename = type == 'agent' ? 'agent_synthesis.tmpl' : 'persona_distillation.tmpl'
-          template_path = File.join(__dir__, '..', '..', 'templates', 'examples', filename)
+          template_path = File.join(__dir__, '..', '..', 'templates', filename)
           File.read(template_path)
         end

data/lib/language_operator/templates/agent_synthesis.tmpl ADDED Viewed

@@ -0,0 +1,243 @@
+You are generating Ruby DSL code for an autonomous agent in a Kubernetes operator.
+{{if .ErrorContext}}
+## IMPORTANT: Self-Healing Synthesis - Attempt {{.AttemptNumber}}
+The previous code synthesis encountered errors. Please analyze the errors below and generate CORRECTED code.
+### Previous Synthesis Failures
+{{if .ErrorContext.ValidationErrors}}
+**Validation Errors** (detected during code generation):
+{{range .ErrorContext.ValidationErrors}}
+- {{.}}
+{{end}}
+{{end}}
+{{if .ErrorContext.RuntimeErrors}}
+**Runtime Errors** (detected during execution):
+{{range .ErrorContext.RuntimeErrors}}
+- Time: {{.Timestamp}}
+- Type: {{.ErrorType}}
+- Message: {{.ErrorMessage}}
+{{if .StackTrace}}
+- Stack Trace:
+{{range .StackTrace}}
+  {{.}}
+{{end}}
+{{end}}
+- Exit Code: {{.ContainerExitCode}}
+{{end}}
+{{end}}
+{{if .ErrorContext.LastCrashLog}}
+**Last Container Logs** (before crash):
+```
+{{.ErrorContext.LastCrashLog}}
+```
+{{end}}
+### Your Task
+1. Carefully analyze each error above
+2. Identify the root cause of the failure
+3. Generate CORRECTED Ruby DSL code that addresses ALL errors
+4. Ensure the code:
+   - Fixes the specific errors mentioned
+   - Uses only available tools: {{.ToolsList}}
+   - Uses only available models: {{.ModelsList}}
+   - Follows the Language Operator DSL syntax exactly
+   - Does NOT use any dangerous Ruby methods (system, eval, etc.)
+This is attempt {{.AttemptNumber}} of {{.MaxAttempts}}. The user is counting on you to get it right!
+{{if .LastKnownGoodCode}}
+### Last Known Working Code (for reference)
+```ruby
+{{.LastKnownGoodCode}}
+```
+{{end}}
+{{else}}
+## Agent Synthesis Request
+{{end}}
+**User Instructions:**
+{{.Instructions}}
+**Available Tools:**
+{{.ToolsList}}
+**Available Models:**
+{{.ModelsList}}
+**Agent Name:** {{.AgentName}}
+**Detected Temporal Intent:** {{.TemporalIntent}}
+**Runtime Context:**
+- All agent messages and output are automatically logged to stdout
+- Agents have access to a workspace directory for file operations
+- LLM responses are captured and available in agent execution context
+## DSL v1 Reference Examples
+Study these examples to understand the task/main model with organic functions:
+### Example 1: Simple Scheduled Agent (Fully Neural)
+```ruby
+agent "daily-report" do
+  description "Generate daily sales report"
+  mode :scheduled
+  schedule "0 9 * * *"
+  task :fetch_sales,
+    instructions: "fetch yesterday's sales data from database",
+    inputs: {},
+    outputs: { sales: 'array', total: 'number' }
+  task :generate_report,
+    instructions: "create a markdown report summarizing the sales",
+    inputs: { sales: 'array', total: 'number' },
+    outputs: { report: 'string' }
+  main do |inputs|
+    sales_data = execute_task(:fetch_sales)
+    report = execute_task(:generate_report, inputs: sales_data)
+    report
+  end
+  output do |outputs|
+    puts outputs[:report]
+  end
+end
+```
+### Example 2: Hybrid Neural-Symbolic Agent
+```ruby
+agent "code-reviewer" do
+  description "Review pull requests"
+  # Symbolic task - deterministic tool call
+  task :fetch_pr_diff do |inputs|
+    {
+      diff: execute_tool('github', 'get_pr_diff', pr_number: inputs[:pr_number])
+    }
+  end
+  # Neural task - creative analysis
+  task :analyze_code,
+    instructions: "review code changes for bugs and improvements",
+    inputs: { diff: 'string' },
+    outputs: { issues: 'array', severity: 'string' }
+  # Symbolic task - deterministic logic
+  task :should_approve do |inputs|
+    critical = inputs[:issues].select { |i| i['severity'] == 'critical' }
+    { approve: critical.empty? }
+  end
+  main do |inputs|
+    pr_data = execute_task(:fetch_pr_diff, inputs: inputs)
+    analysis = execute_task(:analyze_code, inputs: pr_data)
+    decision = execute_task(:should_approve, inputs: analysis)
+    analysis.merge(decision)
+  end
+end
+```
+### Example 3: Multi-Step Agent with Tools
+```ruby
+agent "data-pipeline" do
+  description "ETL pipeline"
+  task :extract_data,
+    instructions: "extract data from the source database",
+    inputs: { source: 'string' },
+    outputs: { records: 'array', count: 'integer' }
+  task :transform_data,
+    instructions: "clean and normalize the records",
+    inputs: { records: 'array' },
+    outputs: { cleaned_records: 'array' }
+  task :load_data,
+    instructions: "load cleaned records into warehouse",
+    inputs: { cleaned_records: 'array' },
+    outputs: { success: 'boolean', loaded_count: 'integer' }
+  main do |inputs|
+    extracted = execute_task(:extract_data, inputs: inputs)
+    transformed = execute_task(:transform_data, inputs: extracted)
+    result = execute_task(:load_data, inputs: transformed)
+    result
+  end
+end
+```
+## Your Task: Generate DSL v1 Agent
+Using the examples above as reference, generate Ruby DSL code in this format:
+```ruby
+require 'language_operator'
+agent "{{.AgentName}}" do
+  description "Brief description extracted from instructions"
+{{.PersonaSection}}{{.ScheduleSection}}
+  # Break down instructions into tasks
+  # Each task needs:
+  # - instructions: what to do (for neural tasks)
+  # - inputs: hash with parameter types
+  # - outputs: hash with result types
+  task :task_name,
+    instructions: "clear description of what this task does",
+    inputs: { param_name: 'type' },
+    outputs: { result_name: 'type' }
+  # For symbolic tasks (when logic is simple/deterministic), use code blocks:
+  # task :task_name do |inputs|
+  #   { result: inputs[:param] * 2 }
+  # end
+  # REQUIRED: main block defines execution flow
+  main do |inputs|
+    result1 = execute_task(:task_name, inputs: { param_name: value })
+    # Chain tasks by passing outputs as inputs
+    result2 = execute_task(:another_task, inputs: result1)
+    result2  # Return final result
+  end
+{{.ConstraintsSection}}
+  # Output handling
+  output do |outputs|
+    # Save results, send notifications, etc.
+    puts "Agent completed: #{outputs.inspect}"
+  end
+end
+```
+**Type System:**
+- `'string'` - text values
+- `'integer'` - whole numbers
+- `'number'` - decimal numbers
+- `'boolean'` - true/false
+- `'array'` - lists
+- `'hash'` - key-value objects
+- `'any'` - any type
+**Rules:**
+1. Generate ONLY the Ruby code within triple-backticks, no explanations before or after
+{{.ScheduleRules}}
+3. Break down instructions into clear tasks with type-safe contracts
+4. REQUIRED: Always include a `main` block that calls tasks via `execute_task()`
+5. For simple deterministic tasks, use symbolic code blocks (do |inputs| ... end)
+6. For complex/creative tasks, use neural tasks with instructions
+7. Chain tasks by passing outputs as inputs
+8. Use available tools: {{.ToolsList}}
+9. Type schemas are REQUIRED for all tasks (inputs/outputs hashes)
+10. Use the agent name: "{{.AgentName}}"
+Generate the code now:

data/lib/language_operator/templates/schema/agent_dsl_openapi.yaml CHANGED Viewed

@@ -2,7 +2,7 @@
 :openapi: 3.0.3
 :info:
   :title: Language Operator Agent API
-  :version: 0.1.36
+  :version: 0.1.37
   :description: HTTP API endpoints exposed by Language Operator reactive agents
   :contact:
     :name: Language Operator

data/lib/language_operator/templates/schema/agent_dsl_schema.json CHANGED Viewed

@@ -3,7 +3,7 @@
   "$id": "https://github.com/language-operator/language-operator-gem/schema/agent-dsl.json",
   "title": "Language Operator Agent DSL",
   "description": "Schema for defining autonomous AI agents using the Language Operator DSL",
-  "version": "0.1.36",
+  "version": "0.1.37",
   "type": "object",
   "properties": {
     "name": {

data/lib/language_operator/version.rb CHANGED Viewed

@@ -1,5 +1,5 @@
 # frozen_string_literal: true
 module LanguageOperator
-  VERSION = '0.1.36'
+  VERSION = '0.1.37'
 end

metadata CHANGED Viewed

@@ -1,7 +1,7 @@
 --- !ruby/object:Gem::Specification
 name: language-operator
 version: !ruby/object:Gem::Version
-  version: 0.1.36
+  version: 0.1.37
 platform: ruby
 authors:
 - James Ryan
@@ -135,6 +135,20 @@ dependencies:
     - - "~>"
       - !ruby/object:Gem::Version
         version: '3.9'
+- !ruby/object:Gem::Dependency
+  name: faraday
+  requirement: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '2.0'
+  type: :runtime
+  prerelease: false
+  version_requirements: !ruby/object:Gem::Requirement
+    requirements:
+    - - "~>"
+      - !ruby/object:Gem::Version
+        version: '2.0'
 - !ruby/object:Gem::Dependency
   name: k8s-ruby
   requirement: !ruby/object:Gem::Requirement
@@ -397,7 +411,6 @@ extra_rdoc_files: []
 files:
 - ".rubocop.yml"
 - CHANGELOG.md
-- CI_STATUS.md
 - Gemfile
 - Gemfile.lock
 - LICENSE
@@ -506,8 +519,8 @@ files:
 - lib/language_operator/retryable.rb
 - lib/language_operator/synthesis_test_harness.rb
 - lib/language_operator/templates/README.md
-- lib/language_operator/templates/examples/agent_synthesis.tmpl
-- lib/language_operator/templates/examples/persona_distillation.tmpl
+- lib/language_operator/templates/agent_synthesis.tmpl
+- lib/language_operator/templates/persona_distillation.tmpl
 - lib/language_operator/templates/schema/.gitkeep
 - lib/language_operator/templates/schema/CHANGELOG.md
 - lib/language_operator/templates/schema/agent_dsl_openapi.yaml

data/CI_STATUS.md DELETED Viewed

@@ -1,56 +0,0 @@
-# CI Integration Test Status
-## Summary
-The CI integration tests are significantly improved from their previous completely broken state.
-### Fixed Issues
-1. **Numeric Constant Error** ✅
-   - **Problem**: SafeExecutor sandbox was blocking access to Ruby type constants (Numeric, Integer, Float, etc.)
-   - **Solution**: Inject type constants into the evaluated code scope in SafeExecutor#eval
-   - **Impact**: All symbolic tasks using type checking now work correctly
-2. **Neural Task Connection Errors** ✅
-   - **Problem**: Agent tried to connect to real LLM when INTEGRATION_MOCK_LLM=true, failing with "Not connected"
-   - **Solution**: Create mock chat object in create_test_agent when mocking is enabled
-   - **Impact**: Neural tasks can now execute without real LLM connection
-3. **Deep Symbol Keys** ✅
-   - **Problem**: Nested hashes in neural task outputs had string keys, tests expected symbol keys
-   - **Solution**: Implement deep_symbolize_keys in TaskExecutor#parse_neural_response
-   - **Impact**: Nested hash structures now match test expectations
-4. **Multi-Provider LLM Support** ✅
-   - **Problem**: Tests only supported OpenAI
-   - **Solution**: Added support for SYNTHESIS_*, ANTHROPIC_*, and OPENAI_API_KEY env vars
-   - **Impact**: Tests can use local models, Claude, or OpenAI
-### Current Test Status
-**Passing Tests** (28/72, 39%):
-- ✅ Comprehensive DSL v1 Integration (all 4 scenarios)
-- ✅ Symbolic Task Execution (complete)
-- ✅ Error Handling (skipped DSL syntax issues)
-- ✅ Type Coercion (partial)
-**Failing Tests** (44/72, 61%):
-- ❌ Neural Task Execution - individual mocks don't match all output schemas
-- ❌ Hybrid Agent Execution - some neural tasks failing
-- ❌ Parallel Execution - some neural tasks failing
-**Pending Tests**: 20 (performance benchmarks disabled)
-### Recommendations
-For full CI coverage with mocked LLMs, consider:
-1. Use real LLM in CI (with API key secrets) instead of mocking
-2. Add schema-aware mock generation based on task output definitions
-3. Add individual mocks for each failing neural task (tedious but thorough)
-### Bottom Line
-**Before**: 100% failure rate - all tests broken
-**After**: 39% pass rate with core functionality working
-The most critical tests (comprehensive integration) now pass. The CI is in a MUCH better state than before.

data/lib/language_operator/templates/examples/agent_synthesis.tmpl DELETED Viewed

@@ -1,133 +0,0 @@
-You are generating Ruby DSL code for an autonomous agent in a Kubernetes operator.
-{{if .ErrorContext}}
-## IMPORTANT: Self-Healing Synthesis - Attempt {{.AttemptNumber}}
-The previous code synthesis encountered errors. Please analyze the errors below and generate CORRECTED code.
-### Previous Synthesis Failures
-{{if .ErrorContext.ValidationErrors}}
-**Validation Errors** (detected during code generation):
-{{range .ErrorContext.ValidationErrors}}
-- {{.}}
-{{end}}
-{{end}}
-{{if .ErrorContext.RuntimeErrors}}
-**Runtime Errors** (detected during execution):
-{{range .ErrorContext.RuntimeErrors}}
-- Time: {{.Timestamp}}
-- Type: {{.ErrorType}}
-- Message: {{.ErrorMessage}}
-{{if .StackTrace}}
-- Stack Trace:
-{{range .StackTrace}}
-  {{.}}
-{{end}}
-{{end}}
-- Exit Code: {{.ContainerExitCode}}
-{{end}}
-{{end}}
-{{if .ErrorContext.LastCrashLog}}
-**Last Container Logs** (before crash):
-```
-{{.ErrorContext.LastCrashLog}}
-```
-{{end}}
-### Your Task
-1. Carefully analyze each error above
-2. Identify the root cause of the failure
-3. Generate CORRECTED Ruby DSL code that addresses ALL errors
-4. Ensure the code:
-   - Fixes the specific errors mentioned
-   - Uses only available tools: {{.ToolsList}}
-   - Uses only available models: {{.ModelsList}}
-   - Follows the Language Operator DSL syntax exactly
-   - Does NOT use any dangerous Ruby methods (system, eval, etc.)
-This is attempt {{.AttemptNumber}} of {{.MaxAttempts}}. The user is counting on you to get it right!
-{{if .LastKnownGoodCode}}
-### Last Known Working Code (for reference)
-```ruby
-{{.LastKnownGoodCode}}
-```
-{{end}}
-{{else}}
-## Agent Synthesis Request
-{{end}}
-**User Instructions:**
-{{.Instructions}}
-**Available Tools:**
-{{.ToolsList}}
-**Available Models:**
-{{.ModelsList}}
-**Agent Name:** {{.AgentName}}
-**Detected Temporal Intent:** {{.TemporalIntent}}
-**Runtime Context:**
-- All agent messages and output are automatically logged to stdout
-- Agents have access to a workspace directory for file operations
-- LLM responses are captured and available in agent execution context
-Generate Ruby DSL code using this exact format (wrapped in triple-backticks with ruby):
-```ruby
-require 'language_operator'
-agent "{{.AgentName}}" do
-  description "Brief description extracted from instructions"
-{{.PersonaSection}}{{.ScheduleSection}}
-  # Extract objectives from instructions
-  objectives [
-    "First objective",
-    "Second objective"
-  ]
-  # REQUIRED: Define workflow with at least one step
-  workflow do
-    # Use tools when available
-    step :step_name, tool: "tool_name", params: {key: "value"}
-    # Or use execute blocks for custom Ruby code (simple logging, calculations, etc.)
-    step :custom_step do
-      execute do
-        puts "Custom output from Ruby code"
-        { result: "done" }
-      end
-    end
-    # Chain steps with dependencies if needed
-    step :another_step, depends_on: :step_name
-  end
-{{.ConstraintsSection}}
-  # Output configuration (if workspace enabled)
-  output do
-    workspace "results/output.txt"
-  end
-end
-```
-**Rules:**
-1. Generate ONLY the Ruby code within triple-backticks, no explanations before or after
-{{.ScheduleRules}}
-5. Break down instructions into clear, actionable objectives
-6. REQUIRED: Always include a workflow block with at least one step (even for simple single-action agents)
-7. For simple tasks (logging, calculations), use a single step with an execute block containing Ruby code
-8. For complex tasks, use multiple steps with tools or execute blocks
-9. Use available tools in workflow steps when tools are provided
-10. Use the agent name: "{{.AgentName}}"
-Generate the code now:

/data/lib/language_operator/templates/{examples/persona_distillation.tmpl → persona_distillation.tmpl} RENAMED Viewed

File without changes