excadg 0.1.0 → 0.1.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (4) hide show
  1. checksums.yaml +4 -4
  2. data/README.md +189 -0
  3. data/bin/excadg +133 -0
  4. metadata +7 -5
checksums.yaml CHANGED
@@ -1,7 +1,7 @@
1
1
  ---
2
2
  SHA256:
3
- metadata.gz: e1d50ffd84523c400ae4733e67c53d54a791cb066cec4a49dea27b4e7072ecda
4
- data.tar.gz: 4b5bf03ce6c2e9190e5af55a9f740247281da6ed6522c37a7ee3dd3698352dc9
3
+ metadata.gz: 6eaa4affddc5100a387f6faa4006451eac4fe7ca15d03817fd962e25b25c8a79
4
+ data.tar.gz: 607ad2df4d09b404ab728d3397f6a82b41631a9a603412367f66dabbba1e2a1f
5
5
  SHA512:
6
- metadata.gz: 274e5395a4d9ed5da1c5aba9a368638757f1571ac38aede22b0f729a794813c4611e41dfb901a9d555c0a202d7b494b47e077b9c379383a7cb116c2f1dd32ecd
7
- data.tar.gz: e35cd2c39d26791742b1c9ff1255f3c2edb9247b046a09dad62013c6bd9e48fdae54be73e17ae4e1fb430a385a2ec8bc19cfba63a54deffb21b27d59a196da50
6
+ metadata.gz: 9b540b1c30fe6a6a1a124e55a29dd21d68d2cf27631a6bfce4e14fa41fc2708e57587eb1174ba1ec15ce75be66eb7d8b54da392dedfc6afca63fd35fd56bf3ce
7
+ data.tar.gz: c16ae1cf5e5c723cad55a44ff2e16881ab575145cabb2a6ff2d75ba5ab935249cfa79fd22ab4b7a1966adc02cbf03674a57fe1453b8add297b16a7be6ed93303
data/README.md ADDED
@@ -0,0 +1,189 @@
1
+ # Description
2
+
3
+ That's a library (framework) to execute a graph of dependent tasks (vertices).
4
+ Its main feature is to run all possible tasks independently in parallel once they're ready.
5
+ Another feature is that the graph is dynamic and any vertex could produce another vertice(s) to extend execution graph.
6
+
7
+ # Usage
8
+
9
+ ## Tool
10
+
11
+ There is a tool script in root folder called `excadg`. Run `./bin/excadg --help` for available options.
12
+ It allows to make and run basic payload graphs specified by a YAML config. See [config/](config/) folder for sample configs.
13
+
14
+ ## Framework
15
+
16
+ Main usage pattern for this project is a framework. It implies that you:
17
+ - [implement custom payload](#payload-implementation) for vertices
18
+ - write a script or extend your program to [construct a swarm vertices](#vertices-constructing) to execute
19
+
20
+
21
+ ### Payload implementation
22
+
23
+ Payload is a piece of code to execute in a single vertex. This code should be self-contained and use specific ways to communicate with other vertices.
24
+ There is a [{ExcADG::Payload}](lib/excadg/payload.rb) module to make a custom payload class. Read through its documentation to get familiar with its interface.
25
+
26
+ Here is a very basic example of a payload that echoes a message:
27
+
28
+ ```
29
+ class MyPayload
30
+ include ExcADG::Payload
31
+ def get
32
+ lambda { system 'echo here I am' }
33
+ end
34
+ end
35
+ ```
36
+ More payload examples could be found in [{ExcADG::Payload::Example}](lib/excadg/payload/example.rb).
37
+
38
+ Any payload has 3 parts:
39
+ - code
40
+ - arguments
41
+ - dependencies data
42
+
43
+ **Code** is defined in {Payload#get} method. E.g. `system 'echo here I am'` in the above example.
44
+ **Arguments** are defined during payload object contruction (see {Payload#initialize}) as `args:` parameter. E.g. `MyPayload.new args: 'my custom message'`.
45
+ **Dependencies data** is provided by the framework and has all [{ExcADG::VStateData}](lib/excadg/vstate_data.rb) returned by the dependencies.
46
+
47
+ ### Vertices constructing
48
+
49
+ Vertex is a execution graph's building block. It executes its payload when all its dependencies are finished.
50
+ Vertex can be created this way: `Vertex.new name: :myvertex, payload: MyPayload.new`. This vertex doesn't have any dependencies, so it starts execution immediately upon construction. Vertices without dependencies are always aneeded in the graph to start it.
51
+
52
+ Another "kind" of vertices are vertices with dependencies. Dependencies are other vertices that the vertex waits to finish successfully before running its own payload.
53
+ > Example: `Vertex.new name: :other_vertex, payload: MyPayload.new, deps: [:myvertex]` has exactly 1 dependency - vertex called `:myvertex`
54
+ > `:other_vertex` won't start until `:myvertex` finishes.
55
+ > Moreover, as described in [payload](#payload), `:other_vertex`'s payload will have access to data returned by `:myvertex`'s payload.
56
+ > In this case, it'd be what `system 'echo here I am'` returns - `true`.
57
+
58
+ Dependencies could be specified both - using {ExcADG::Vertex} objects and names. E.g.
59
+ ``` ruby
60
+ Broker.run
61
+
62
+ v1 = Vertex.new payload: MyPayload.new
63
+ Vertex.new name: :v2, payload: MyPayload.new
64
+
65
+ Vertex.new name: :final, payload: MyPayload.new, deps: [v1, :v2]
66
+
67
+ Broker.wait_all
68
+ ```
69
+
70
+ *See [Broker section](#broker) for `Broker.run` and `Broker.wait_all` usage.*
71
+
72
+ Using actual objects looks simpler, but it's less convenient, as it requires you to construct all vertices as they appear in the execution graph.
73
+ However, names allows you to spawn vertices in arbitrary order and expect framework to figure execution order on the fly. See [run tool](#tool) as an example of using names.
74
+
75
+ *There is no need to store {ExcADG::Vertex} objects, as it and its data are available through [Broker](#broker)'s [DataStore](#data-store) and there is no interface to communicate with a {ExcADG::Vertex} directly.*
76
+
77
+ # Internals
78
+
79
+ ## Overview
80
+
81
+ This framework allows to spawn vertices. Once created, a vertex with payload starts immediately.
82
+ This means that there is no central place where execution seqence is controlled besides vertex's own mechanism to wait for dependencies.
83
+
84
+ As Ractors doesn't allow to reliably exchange data between each other at the moment, the main Ractor (thread) has to spawn a broker to synchronize data exchange. See [broker](#broker) sections to learn more.
85
+
86
+ This framework is based on [Ractors](https://docs.ruby-lang.org/en/master/ractor_md.html). It could be useful to get familiar with ractors before reading further.
87
+
88
+ ## Vertice processing states
89
+
90
+ Internally, each vertice goes through a sequence of states. Now it's **new**, **ready**, **done** and **failed**. Stages are controlled by the [{ExcADG::StateMachine}](#statemachine).
91
+
92
+ {ExcADG::Vertex} starts as a **new** vertex and waits for its dependencies.
93
+ When vertex received all dependencies states and made sure they're **done**, it becomes **ready** and starts its payload.
94
+ When payload finishes, the vertex transitions to the **done** state.
95
+ If any of the stages fails, the vertex becomes **failed**. It could happen as due to a failed dependency (a stage failed) as well as any other error occurred (e.g. it receives an incorrect data from broker). A single failed vertex makes all vertices depending on it to fail. This way a failures cascading through the graph.
96
+
97
+ ## {ExcADG::Broker}
98
+
99
+ Broker is a central component meant to receive and transmit data between vertices. There are several [{ExcADG::Request}](lib/excadg/request.rb) types broker supports.
100
+
101
+ When a vertex changes its state, it (actually, state machine does that) notifies the broker of its state and sends data (results of the transition) ptp the broker. Broker stores this data in a map and could send it to other vertices by request. Vertices polls their dependencies through broker to know where all dependencies are done or some of them failed.
102
+
103
+ Broker is desired to be as thin as possible to keep most of the work for vertices.
104
+
105
+ Each application should invoke {Broker.run} to enable messages processing.
106
+ Its counterpart is {Broker.wait_all} which waits for all known vertices to reach one of the terminal states (**done** or **failed**) within specified timeout. Same as `Broker.run`, it spawns a thread and returns it. The main application could keep it in background and query or `.join` on it once all vertices are spawned. Once the thread finishes, the main app could lookup vertices execution results in {Broker.data_store} (see [DataStore](#data-store)).
107
+
108
+ > Beware that broker constantly uses main Ractor's ports - incoming and outgoing. Hence, `Ractor#take`, `Ractor.yield` or any other messaging in the main ractor conflict with broker.
109
+
110
+ ## {ExcADG::DataStore}
111
+
112
+ It's a mostly internal [Broker's](#broker) object that holds all vertice's [{ExcADG::VStateData}](lib/excadg/vstate_data.rb).
113
+
114
+ ## {ExcADG::StateMachine}
115
+
116
+ State machine is a small internal mechanism that helps vertices to
117
+ - do state transitions
118
+ - preserve state transition results locally
119
+
120
+ State machine has transitions graph that is common for all vertices. Vertices bind certain logic to transitions. E.g. "wait for dependencies" or "run the payload". State transition mechanism ensures to run workload associated to the currently possible transition, process errors (if any) and send results to [Broker](#broker).
121
+
122
+ ## {ExcADG::Payload}
123
+
124
+ Payload is a special module that carries convention for Vertex-es payload. Only classes that `include ExcADG::Payload` are expected to be providede as payload during vertex creation. _Although, obviously, there are dozens of ways to trick the code._
125
+
126
+ Payload existence is caused by that Ractors require all objects a Ractor uses to be moved (or copied) to it. A pure {ExcADG::Proc} can't be transferred, as it doesn't have allocator and can access outer scope (which has to be transferred too then), what could be impossible due to other non-shareable objects there.
127
+
128
+ To make the interface clearer and more reliable, Payload encloses a `Proc` within its `get` method. As isolated `Proc`s are not useful most of the time, Payload has 2 ways to provide/access the needful for payload to work.
129
+ First way is `args` parameter of the {Payload#initialize}. It can be used on payload creation time to customize payload's behavior. The paremeter is accessible within the labmda through `@args` attribute.
130
+ Second way is built-in. Vertex invokes payload with {ExcADG::Array} of dependencies {ExcADG::VStateData}, if payload's `Proc` is parametrized (has arity 1).
131
+
132
+ > Be mindful about data your payload receives (args) and returns (state data). It could appear incompatible with ractors and cause vertice to fail.
133
+ > Although these failures are hard to tell beforehand, [state machine](#statemachine) processes them gracefully.
134
+
135
+ # Development
136
+
137
+ The project is based on RVM => there is a .ruby-gemset file.
138
+ Bundler is configured to install gems for dev environment, use `bundle install --without dev` to install only modules needed to run the code.
139
+
140
+ ## Gem
141
+
142
+ There is a gem specification. Typical commands:
143
+ - build gem: `gem build excadg.gemspec`
144
+ - install / uninstall gem built locally: `gem install ./excadg*.gem` / `gem uninstall excadg`
145
+ - publish: `gem push excadg*.gem`
146
+
147
+ ## Testing
148
+
149
+ Test are located in [spec/](spec/) folder and are written using rspec.
150
+
151
+ Commands to run tests:
152
+ - `rspec` - run most of the tests
153
+ - `rspec spec/broker_spec.rb` - run tests from the file
154
+ - `rspec spec/broker_spec:32` - run tests that starts on line 32 of the file
155
+ - `rspec --tag perf` - run tests of a specific suite
156
+
157
+ > there is no consistent set of suites, tags are used to exclude mutually incompatible tests or e.g. perfomance tests
158
+ > search for `config.filter_run_excluding` in [spec/spec_helper.rb](spec/spec_helper.rb) to see what tests are disabled by default
159
+
160
+ ### Logging
161
+
162
+ Most of the tests have loggin stubbed to avoid noize. Comment out either `stub_loogging` or `Log.mute` in the respective spec file to enable logging for it.
163
+
164
+ # Docs
165
+
166
+ `yard doc lib/` generates a descent documentation.
167
+
168
+ `yard server --reload` starts a server with documentation available at http://localhost:8808
169
+
170
+ # Next
171
+
172
+ > here is a list of improvemets could be implementex next
173
+
174
+ - add timeouts
175
+ - to payload
176
+ - to whole vertex
177
+ - to request processing
178
+ - implement throttling (:suspended state)
179
+ - limit # of running vertices
180
+ - problem: can't find what to suspend
181
+ - make a loop payload template
182
+ - provide a mechanism to control # of children
183
+ - avoid initializing Log on require
184
+
185
+ ## Graph checks
186
+
187
+ - check for loops in the config
188
+ - check for unreachable islands - graph connectivity
189
+ - check that there are nodes to start from
data/bin/excadg ADDED
@@ -0,0 +1,133 @@
1
+ #!/usr/bin/env ruby
2
+
3
+ require 'yaml'
4
+ require 'optparse'
5
+
6
+ require 'rgl/adjacency'
7
+ require 'rgl/dot'
8
+
9
+ require_relative '../lib/excadg'
10
+
11
+ options = {
12
+ graph: 'config.yaml',
13
+ logfile: :stdout,
14
+ loglevel: Logger::INFO,
15
+ dump: nil,
16
+ gdump: nil,
17
+ timeout: nil
18
+ }
19
+ OptionParser.new { |opts|
20
+ opts.banner = <<~EOD
21
+ tool to run a graph of payloads specified in config
22
+ usage: #{$PROGRAM_NAME} [args]
23
+ EOD
24
+ opts.on('-g', '--graph FILENAME', 'config file name to load, see config/sample.yaml for format', "default: #{options[:graph]}") { |o|
25
+ options[:graph] = o
26
+ }
27
+ opts.on(
28
+ '-l', '--outmode <FILENAME|:silent|:stdout>',
29
+ 'output mode',
30
+ 'app draws TUI if log file is set',
31
+ 'reserved keywords for stdout and silent modes',
32
+ "default: #{options[:logfile]}"
33
+ ) { |o|
34
+ options[:logfile] = o
35
+ }
36
+ opts.on('--loglevel <DEBUG|INFO|WARN|ERROR|FATAL>',
37
+ "log level, default: #{options[:loglevel]}") { |o|
38
+ name = o.upcase.to_sym
39
+ raise "unknown log level #{o}" unless Logger.const_defined? name
40
+
41
+ options[:loglevel] = Logger.const_get name
42
+ }
43
+ opts.on('-d', '--dump FILENAME', 'dump all vertices state data to the file in the end') { |o|
44
+ options[:dump] = o
45
+ }
46
+ opts.on('--gdump FILENAME', 'dump initial execution graph to the file specified') { |o|
47
+ options[:gdump] = o
48
+ }
49
+ opts.on('-t', '--timeout SECONDS', 'for how long to wait for the vertices to execute') { |o|
50
+ options[:timeout] = o.to_i
51
+ }
52
+ }.parse!
53
+
54
+ logfile, ui_drawer =
55
+ case options[:logfile]
56
+ when ':stdout', $stdout, :stdout
57
+ [$stdout, nil]
58
+ when ':silent'
59
+ [nil, nil]
60
+ else # all other strings are considered log file names
61
+ [options[:logfile], ExcADG::Tui]
62
+ end
63
+
64
+ if logfile.nil?
65
+ ExcADG::Log.mute
66
+ else
67
+ ExcADG::Log.logger ExcADG::Log::RLogger.new dest: logfile, level: options[:loglevel]
68
+ end
69
+
70
+ raise "'#{options[:graph]}' config file is not readable" unless File.readable? options[:graph]
71
+
72
+ ExcADG::Log.info 'reading config'
73
+ config = YAML.safe_load_file options[:graph]
74
+
75
+ runnable_vertices = config.select { |k, v| v&.dig('dependencies').nil? }.keys
76
+ raise ArgumentError, 'at least one vertex should be ready to start' if runnable_vertices.empty?
77
+
78
+ ExcADG::Log.info 'collect execution graph from config'
79
+
80
+ all_vertex_names = config.keys.collect!(&:to_sym)
81
+ graph = RGL::DirectedAdjacencyGraph.new
82
+ config.each_pair { |id, vconfig|
83
+ name = id.to_sym
84
+ payload = if vconfig&.key?('command')
85
+ ExcADG::Payload::Wrapper::Bin.new args: vconfig['command']
86
+ elsif vconfig&.key?('sleep')
87
+ ExcADG::Payload::Example::Sleepy.new args: vconfig['sleep'].to_i
88
+ else
89
+ ExcADG::Payload::Example::Echo.new args: vconfig
90
+ end
91
+ deps_v_names = (vconfig&.dig('dependencies') || []).collect(&:to_sym)
92
+ deps_v_names.each { |dep|
93
+ raise "there is no dependency named '#{dep}' in the graph for vertex '#{id}'" unless all_vertex_names.include? dep
94
+
95
+ graph.add_edge name, dep
96
+ }
97
+
98
+ ExcADG::Vertex.new name:, payload:, deps: deps_v_names
99
+ }
100
+
101
+ graph.write_to_graphic_file(options[:gdump].split('.').last, options[:gdump].split('.')[...-1].join('.')) unless options[:gdump].nil?
102
+
103
+ ExcADG::Log.info 'starting state data broker'
104
+ ExcADG::Broker.run
105
+
106
+ ui_drawer&.run
107
+
108
+ ExcADG::Log.info 'watching for all vertices to complete'
109
+ timed_out = false
110
+ begin
111
+ ExcADG::Broker.wait_all(timeout: options[:timeout]).join
112
+ rescue Timeout::Error
113
+ ExcADG::Log.error 'execution timed out'
114
+ timed_out = true
115
+ end
116
+ vertice_by_state = ExcADG::Broker.data_store.to_a.group_by(&:state)
117
+
118
+ ExcADG::Log.info "#{vertice_by_state[:done]&.size || 0} done, #{vertice_by_state[:failed]&.size || 0} failed"
119
+ has_failed = !vertice_by_state[:failed].nil?
120
+ ui_drawer&.summarize has_failed, timed_out
121
+
122
+ unless options[:dump].nil?
123
+ ExcADG::Log.info "writing data to #{options[:dump]}"
124
+ File.open(options[:dump], 'w+') { |f|
125
+ f.write JSON.dump ExcADG::Broker.data_store.to_a
126
+ }
127
+ end
128
+
129
+ sleep 0.1 # let the logger to print message
130
+ exit_code = 0
131
+ exit_code |= 1 if has_failed
132
+ exit_code |= 2 if timed_out
133
+ exit exit_code
metadata CHANGED
@@ -1,7 +1,7 @@
1
1
  --- !ruby/object:Gem::Specification
2
2
  name: excadg
3
3
  version: !ruby/object:Gem::Version
4
- version: 0.1.0
4
+ version: 0.1.2
5
5
  platform: ruby
6
6
  authors:
7
7
  - skorobogatydmitry
@@ -10,15 +10,17 @@ bindir: bin
10
10
  cert_chain: []
11
11
  date: 2024-07-06 00:00:00.000000000 Z
12
12
  dependencies: []
13
- description: "# Description\n\nThat's a library (framework) to execute a graph of
14
- dependent tasks (vertices). \nIts main feature is to run all possible tasks independently
15
- in parallel once they're ready. \nAnother feature is that the graph is dynamic
16
- and any vertex could produce another vertice(s) to extend execution graph.\n"
13
+ description: "\n\nThat's a library (framework) to execute a graph of dependent tasks
14
+ (vertices). \nIts main feature is to run all possible tasks independently in parallel
15
+ once they're ready. \nAnother feature is that the graph is dynamic and any vertex
16
+ could produce another vertice(s) to extend execution graph.\n"
17
17
  email: skorobogaty.dmitry@gmail.com
18
18
  executables: []
19
19
  extensions: []
20
20
  extra_rdoc_files: []
21
21
  files:
22
+ - README.md
23
+ - bin/excadg
22
24
  - lib/excadg.rb
23
25
  - lib/excadg/assertions.rb
24
26
  - lib/excadg/broker.rb