RubyGems - rubita - Versions diffs - 0.1.0 - Mend

rubita 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

checksums.yaml ADDED Viewed

@@ -0,0 +1,7 @@
+---
+SHA256:
+  metadata.gz: cc492f76a821f0b1b802564b280f4a99162f8d76855ffce00e644906003bdbe5
+  data.tar.gz: cec711f5a8c8802a40840b6d3bc5152115e84fe3e55e5d18f8a835e3184c2d2b
+SHA512:
+  metadata.gz: 676b956444b8497f5c885cb6ecb627df93ca489d7db63fbe587f3790afc92f8d185dfc2000155ab774ab9209fe3a12820c5ee3ced6ca8a7b801bf01a1a4d0d98
+  data.tar.gz: dffbb32c61c04ba730524c610aee6978157bdc0aab9725c50566c504252ec250c6f225d171d5e6ca4391f60949caeaecd9af7e4447922a4594af92e4deb41966

data/README.md ADDED Viewed

@@ -0,0 +1,124 @@
+# Rubita
+Rubita is a transpiler that converts a restricted Ruby DSL into BCC-compatible C code for eBPF programs.
+## Overview
+Rubita enables you to write eBPF probes and kernel tracing programs using a Ruby-like syntax, which are then transpiled into BCC (Berkeley Packet Filter Compiler Collection) compatible C code. This makes it easier to write complex eBPF programs while leveraging Ruby's expressiveness.
+### Supported Features
+- **Map Declarations**: Define eBPF hash maps with `BPF_HASH`
+- **Probe Definitions**: Support for `TRACEPOINT_PROBE`, `KFUNC_PROBE`, `KRETFUNC_PROBE`, and `LSM_PROBE`
+- **Method Definitions**: Define helper functions with `def`
+- **Field Access**: Convert Ruby dot notation (`obj.field`) to C pointer dereference (`obj->field`)
+- **String Literals**: Support format strings with proper escaping
+## Installation
+Add this line to your application's Gemfile:
+```bash
+gem 'rubita', github: 'udzura/rubita'
+```
+And then execute:
+```bash
+bundle install
+```
+## Basic Conversion
+### Hash Map Declaration
+**Ruby DSL:**
+```ruby
+BPF_HASH :events, key: :u64, value: :u64, size: 1024
+```
+**Generated C:**
+```c
+BPF_HASH(events, u64, u64, 1024);
+```
+### Tracepoint Probe
+**Ruby DSL:**
+```ruby
+TRACEPOINT_PROBE :syscalls, :sys_enter_open do
+  bpf_trace_printk("open syscall\n")
+  0
+end
+```
+**Generated C:**
+```c
+TRACEPOINT_PROBE(syscalls, sys_enter_open) {
+  bpf_trace_printk("open syscall\n");
+  return 0;
+}
+```
+### Kernel Function Probe
+**Ruby DSL:**
+```ruby
+KFUNC_PROBE :vfs_read do
+  bpf_trace_printk("Reading file: %d\n", args.got_bits)
+  0
+end
+```
+**Generated C:**
+```c
+KFUNC_PROBE(vfs_read) {
+  bpf_trace_printk("Reading file: %d\n", args->got_bits);
+  return 0;
+}
+```
+### Helper Functions
+**Ruby DSL:**
+```ruby
+def print_event(_ctx)
+  bpf_trace_printk("Event occurred\n")
+  0
+end
+```
+**Generated C:**
+```c
+int print_event(void *_ctx) {
+  bpf_trace_printk("Event occurred\n");
+  return 0;
+}
+```
+## Usage
+```ruby
+require 'rubita'
+ruby_code = <<~RUBY
+  BPF_HASH :counts, key: :u64, value: :u64, size: 10
+  TRACEPOINT_PROBE :syscalls, :sys_enter_openat do
+    bpf_trace_printk("openat\n")
+    0
+  end
+RUBY
+c_code = Rubita.transpile(ruby_code)
+puts c_code
+```
+## Development
+After checking out the repo, run `bin/setup` to install dependencies. Then, run `rake test` to run the tests. You can also run `bin/console` for an interactive prompt that will allow you to experiment.
+## Contributing
+Bug reports and pull requests are welcome on GitHub at https://github.com/udzura/rubita.

data/Rakefile ADDED Viewed

@@ -0,0 +1,12 @@
+# frozen_string_literal: true
+require "bundler/gem_tasks"
+require "rake/testtask"
+Rake::TestTask.new(:test) do |t|
+  t.libs << "test"
+  t.libs << "lib"
+  t.test_files = FileList["test/**/*_test.rb"]
+end
+task default: :test

data/lib/rubita/transpiler.rb ADDED Viewed

@@ -0,0 +1,321 @@
+# frozen_string_literal: true
+module Rubita
+  class Transpiler
+    def transpile(source)
+      sexp = Ripper.sexp(source)
+      raise Error, "failed to parse source" if sexp.nil?
+      nodes = extract_program_nodes(sexp)
+      converted = nodes.map do |node|
+        case node[0]
+        when :def
+          convert_definition(node)
+        when :command
+          convert_top_level_command(node)
+        when :method_add_block
+          convert_top_level_block(node)
+        else
+          raise Error, "unsupported top-level node: #{node[0]}"
+        end
+      end
+      converted.join("\n\n")
+    end
+    private
+    def extract_program_nodes(sexp)
+      return raise Error, "unexpected program structure" unless sexp[0] == :program
+      nodes = sexp[1]
+      return raise Error, "source must contain nodes" unless nodes.is_a?(Array) && !nodes.empty?
+      nodes
+    end
+    def convert_top_level_command(node)
+      ident = node[1]
+      return raise Error, "unsupported command format" unless [:@ident, :@const].include?(ident&.[](0))
+      case ident[1]
+      when "BPF_HASH"
+        convert_hashmap_command(node)
+      else
+        raise Error, "unsupported command: #{ident[1]}"
+      end
+    end
+    def convert_top_level_block(node)
+      call_node = node[1]
+      block_node = node[2]
+      return raise Error, "unsupported block call" unless call_node&.[](0) == :command
+      ident = call_node[1]
+      return raise Error, "unsupported block command format" unless ident&.[](0) == :@const
+      case ident[1]
+      when "TRACEPOINT_PROBE"
+        convert_probe_block(call_node, block_node, "TRACEPOINT_PROBE", 2)
+      when "KFUNC_PROBE"
+        convert_probe_block(call_node, block_node, "KFUNC_PROBE", 1)
+      when "KRETFUNC_PROBE"
+        convert_probe_block(call_node, block_node, "KRETFUNC_PROBE", 1)
+      when "LSM_PROBE"
+        convert_probe_block(call_node, block_node, "LSM_PROBE", 1)
+      else
+        raise Error, "unsupported block command: #{ident[1]}"
+      end
+    end
+    def convert_probe_block(call_node, block_node, macro_name, arg_count)
+      args_add_block = call_node[2]
+      return raise Error, "unsupported #{macro_name} args" unless args_add_block&.[](0) == :args_add_block
+      raw_args = args_add_block[1]
+      return raise Error, "#{macro_name} requires #{arg_count} arguments" unless raw_args.is_a?(Array) && raw_args.size == arg_count
+      macro_args = raw_args.map { |arg| convert_symbol_literal(arg) }
+      return raise Error, "#{macro_name} requires do ... end block" unless block_node&.[](0) == :do_block
+      bodystmt = block_node[2]
+      statements, return_value = extract_body_statements_and_return_from_bodystmt(bodystmt)
+      c_lines = ["#{macro_name}(#{macro_args.join(', ')}) {"]
+      statements.each do |statement|
+        c_lines << "  #{convert_statement(statement)}"
+      end
+      c_lines << "  return #{return_value};"
+      c_lines << "}"
+      c_lines.join("\n")
+    end
+    def convert_hashmap_command(node)
+      args_add_block = node[2]
+      return raise Error, "unsupported hashmap args" unless args_add_block[0] == :args_add_block
+      raw_args = args_add_block[1]
+      map_name = convert_symbol_literal(raw_args[0])
+      options_node = raw_args[1]
+      return raise Error, "hashmap options are required" unless options_node&.[](0) == :bare_assoc_hash
+      options = extract_assoc_hash(options_node)
+      key_type = options["key"] || (raise Error, "hashmap key is required")
+      value_type = options["value"] || (raise Error, "hashmap value is required")
+      size = options["size"]
+      if size
+        "BPF_HASH(#{map_name}, #{key_type}, #{value_type}, #{size});"
+      else
+        "BPF_HASH(#{map_name}, #{key_type}, #{value_type});"
+      end
+    end
+    def extract_assoc_hash(node)
+      pairs = node[1]
+      return raise Error, "invalid hashmap options" unless pairs.is_a?(Array)
+      pairs.each_with_object({}) do |pair, acc|
+        return raise Error, "invalid hashmap option pair" unless pair[0] == :assoc_new
+        label_node = pair[1]
+        value_node = pair[2]
+        return raise Error, "invalid hashmap option key" unless label_node[0] == :@label
+        key = label_node[1].delete_suffix(":")
+        value = convert_hashmap_option_value(value_node)
+        acc[key] = value
+      end
+    end
+    def convert_hashmap_option_value(node)
+      case node[0]
+      when :symbol_literal
+        convert_symbol_literal(node)
+      when :@int
+        node[1]
+      else
+        raise Error, "unsupported hashmap option value: #{node[0]}"
+      end
+    end
+    def convert_symbol_literal(node)
+      symbol_ident = node.dig(1, 1)
+      return raise Error, "unsupported symbol literal" unless symbol_ident&.[](0) == :@ident
+      symbol_ident[1]
+    end
+    def convert_definition(def_node)
+      function_name = extract_function_name(def_node)
+      statements, return_value = extract_body_statements_and_return(def_node)
+      c_lines = ["int #{function_name}(void *_ctx) {"]
+      statements.each do |statement|
+        c_lines << "  #{convert_statement(statement)}"
+      end
+      c_lines << "  return #{return_value};"
+      c_lines << "}"
+      c_lines.join("\n")
+    end
+    def extract_function_name(def_node)
+      ident = def_node[1]
+      return raise Error, "method name is missing" unless ident&.[](0) == :@ident
+      ident[1]
+    end
+    def extract_body_statements_and_return(def_node)
+      bodystmt = def_node[3]
+      extract_body_statements_and_return_from_bodystmt(bodystmt)
+    end
+    def extract_body_statements_and_return_from_bodystmt(bodystmt)
+      stmts = bodystmt[1]
+      return raise Error, "method body is missing" unless stmts.is_a?(Array) && !stmts.empty?
+      return_node = stmts.last
+      return raise Error, "last expression must be integer literal" unless return_node[0] == :@int
+      [stmts[0...-1], return_node[1]]
+    end
+    def convert_statement(statement)
+      case statement[0]
+      when :method_add_arg
+        convert_method_call_statement(statement)
+      else
+        raise Error, "unsupported statement: #{statement[0]}"
+      end
+    end
+    def convert_method_call_statement(statement)
+      call_target = statement[1]
+      arg_part = statement[2]
+      case call_target[0]
+      when :fcall
+        convert_fcall_statement(call_target, arg_part)
+      when :call
+        convert_call_statement(call_target, arg_part)
+      else
+        raise Error, "unsupported call target type: #{call_target[0]}"
+      end
+    end
+    def convert_fcall_statement(call_target, arg_part)
+      method_ident = call_target[1]
+      return raise Error, "unsupported method identifier" unless method_ident[0] == :@ident
+      args = extract_args(arg_part)
+      "#{method_ident[1]}(#{args.join(', ')});"
+    end
+    def convert_call_statement(call_target, arg_part)
+      receiver = call_target[1]
+      method_name_node = call_target[3]
+      return raise Error, "unsupported call method name" unless method_name_node&.[](0) == :@ident
+      method_name = method_name_node[1]
+      # Check if receiver is a global variable
+      if receiver&.[](0) == :var_ref && receiver[1]&.[](0) == :@gvar
+        gvar_name = receiver[1][1].delete_prefix("$")
+        args = extract_args_with_reference(arg_part)
+        "#{gvar_name}.#{method_name}(#{args.join(', ')});"
+      else
+        raise Error, "unsupported call receiver type: #{receiver&.[](0)}"
+      end
+    end
+    def convert_method_call(statement)
+      call_target = statement[1]
+      arg_part = statement[2]
+      return raise Error, "unsupported call target" unless call_target[0] == :fcall
+      method_ident = call_target[1]
+      return raise Error, "unsupported method identifier" unless method_ident[0] == :@ident
+      args = extract_args(arg_part)
+      "#{method_ident[1]}(#{args.join(', ')});"
+    end
+    def extract_args_with_reference(arg_part)
+      return raise Error, "unsupported arg format" unless arg_part[0] == :arg_paren
+      args_add_block = arg_part[1]
+      return raise Error, "unsupported arg list" unless args_add_block[0] == :args_add_block
+      raw_args = args_add_block[1]
+      raw_args.map { |arg| convert_arg_with_reference(arg) }
+    end
+    def convert_arg_with_reference(arg)
+      case arg[0]
+      when :vcall
+        arg_ident = arg[1]
+        return raise Error, "unsupported variable call" unless arg_ident&.[](0) == :@ident
+        "&#{arg_ident[1]}"
+      when :string_literal
+        string_content = arg.dig(1, 1)
+        return raise Error, "unsupported string format" unless string_content&.[](0) == :@tstring_content
+        escaped = escape_c_string(string_content[1])
+        %Q("#{escaped}")
+      when :call
+        convert_call_arg(arg)
+      else
+        raise Error, "unsupported argument type for reference: #{arg[0]}"
+      end
+    end
+    def extract_args(arg_part)
+      return raise Error, "unsupported arg format" unless arg_part[0] == :arg_paren
+      args_add_block = arg_part[1]
+      return raise Error, "unsupported arg list" unless args_add_block[0] == :args_add_block
+      raw_args = args_add_block[1]
+      raw_args.map { |arg| convert_arg(arg) }
+    end
+    def convert_arg(arg)
+      case arg[0]
+      when :string_literal
+        string_content = arg.dig(1, 1)
+        return raise Error, "unsupported string format" unless string_content&.[](0) == :@tstring_content
+        escaped = escape_c_string(string_content[1])
+        %Q("#{escaped}")
+      when :call
+        convert_call_arg(arg)
+      else
+        raise Error, "unsupported argument type: #{arg[0]}"
+      end
+    end
+    def convert_call_arg(call_node)
+      receiver = call_node[1]
+      method_name_node = call_node[3]
+      return raise Error, "unsupported call receiver" unless receiver&.[](0) == :vcall
+      receiver_ident = receiver[1]
+      return raise Error, "unsupported receiver identifier" unless receiver_ident&.[](0) == :@ident
+      return raise Error, "unsupported method name" unless method_name_node&.[](0) == :@ident
+      receiver_name = receiver_ident[1]
+      method_name = method_name_node[1]
+      "#{receiver_name}->#{method_name}"
+    end
+    def escape_c_string(value)
+      value.gsub("\\", "\\\\").gsub('"', '\\"')
+    end
+  end
+end

data/lib/rubita/version.rb ADDED Viewed

@@ -0,0 +1,5 @@
+# frozen_string_literal: true
+module Rubita
+  VERSION = "0.1.0"
+end

data/lib/rubita.rb ADDED Viewed

@@ -0,0 +1,14 @@
+# frozen_string_literal: true
+require "ripper"
+require_relative "rubita/version"
+require_relative "rubita/transpiler"
+module Rubita
+  class Error < StandardError; end
+  def self.transpile(source)
+    Transpiler.new.transpile(source)
+  end
+end

data/sig/rubita.rbs ADDED Viewed

@@ -0,0 +1,4 @@
+module Rubita
+  VERSION: String
+  # See the writing guide of rbs: https://github.com/ruby/rbs#guides
+end

metadata ADDED Viewed

@@ -0,0 +1,49 @@
+--- !ruby/object:Gem::Specification
+name: rubita
+version: !ruby/object:Gem::Version
+  version: 0.1.0
+platform: ruby
+authors:
+- Uchio Kondo
+bindir: exe
+cert_chain: []
+date: 1980-01-02 00:00:00.000000000 Z
+dependencies: []
+description: Rubita transpiles a restricted Ruby DSL into BCC-compatible C code for
+  eBPF use cases.
+email:
+- udzura@udzura.jp
+executables: []
+extensions: []
+extra_rdoc_files: []
+files:
+- README.md
+- Rakefile
+- lib/rubita.rb
+- lib/rubita/transpiler.rb
+- lib/rubita/version.rb
+- sig/rubita.rbs
+homepage: https://github.com/udzura/rubita
+licenses: []
+metadata:
+  allowed_push_host: https://rubygems.org
+  homepage_uri: https://github.com/udzura/rubita
+  source_code_uri: https://github.com/udzura/rubita
+rdoc_options: []
+require_paths:
+- lib
+required_ruby_version: !ruby/object:Gem::Requirement
+  requirements:
+  - - ">="
+    - !ruby/object:Gem::Version
+      version: 3.2.0
+required_rubygems_version: !ruby/object:Gem::Requirement
+  requirements:
+  - - ">="
+    - !ruby/object:Gem::Version
+      version: '0'
+requirements: []
+rubygems_version: 4.0.6
+specification_version: 4
+summary: Ruby to BCC-compatible C transpiler for eBPF programs
+test_files: []