excadg 0.1.0 → 0.1.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- checksums.yaml +4 -4
- data/README.md +189 -0
- data/bin/excadg +133 -0
- metadata +7 -5
checksums.yaml
CHANGED
@@ -1,7 +1,7 @@
|
|
1
1
|
---
|
2
2
|
SHA256:
|
3
|
-
metadata.gz:
|
4
|
-
data.tar.gz:
|
3
|
+
metadata.gz: 6eaa4affddc5100a387f6faa4006451eac4fe7ca15d03817fd962e25b25c8a79
|
4
|
+
data.tar.gz: 607ad2df4d09b404ab728d3397f6a82b41631a9a603412367f66dabbba1e2a1f
|
5
5
|
SHA512:
|
6
|
-
metadata.gz:
|
7
|
-
data.tar.gz:
|
6
|
+
metadata.gz: 9b540b1c30fe6a6a1a124e55a29dd21d68d2cf27631a6bfce4e14fa41fc2708e57587eb1174ba1ec15ce75be66eb7d8b54da392dedfc6afca63fd35fd56bf3ce
|
7
|
+
data.tar.gz: c16ae1cf5e5c723cad55a44ff2e16881ab575145cabb2a6ff2d75ba5ab935249cfa79fd22ab4b7a1966adc02cbf03674a57fe1453b8add297b16a7be6ed93303
|
data/README.md
ADDED
@@ -0,0 +1,189 @@
|
|
1
|
+
# Description
|
2
|
+
|
3
|
+
That's a library (framework) to execute a graph of dependent tasks (vertices).
|
4
|
+
Its main feature is to run all possible tasks independently in parallel once they're ready.
|
5
|
+
Another feature is that the graph is dynamic and any vertex could produce another vertice(s) to extend execution graph.
|
6
|
+
|
7
|
+
# Usage
|
8
|
+
|
9
|
+
## Tool
|
10
|
+
|
11
|
+
There is a tool script in root folder called `excadg`. Run `./bin/excadg --help` for available options.
|
12
|
+
It allows to make and run basic payload graphs specified by a YAML config. See [config/](config/) folder for sample configs.
|
13
|
+
|
14
|
+
## Framework
|
15
|
+
|
16
|
+
Main usage pattern for this project is a framework. It implies that you:
|
17
|
+
- [implement custom payload](#payload-implementation) for vertices
|
18
|
+
- write a script or extend your program to [construct a swarm vertices](#vertices-constructing) to execute
|
19
|
+
|
20
|
+
|
21
|
+
### Payload implementation
|
22
|
+
|
23
|
+
Payload is a piece of code to execute in a single vertex. This code should be self-contained and use specific ways to communicate with other vertices.
|
24
|
+
There is a [{ExcADG::Payload}](lib/excadg/payload.rb) module to make a custom payload class. Read through its documentation to get familiar with its interface.
|
25
|
+
|
26
|
+
Here is a very basic example of a payload that echoes a message:
|
27
|
+
|
28
|
+
```
|
29
|
+
class MyPayload
|
30
|
+
include ExcADG::Payload
|
31
|
+
def get
|
32
|
+
lambda { system 'echo here I am' }
|
33
|
+
end
|
34
|
+
end
|
35
|
+
```
|
36
|
+
More payload examples could be found in [{ExcADG::Payload::Example}](lib/excadg/payload/example.rb).
|
37
|
+
|
38
|
+
Any payload has 3 parts:
|
39
|
+
- code
|
40
|
+
- arguments
|
41
|
+
- dependencies data
|
42
|
+
|
43
|
+
**Code** is defined in {Payload#get} method. E.g. `system 'echo here I am'` in the above example.
|
44
|
+
**Arguments** are defined during payload object contruction (see {Payload#initialize}) as `args:` parameter. E.g. `MyPayload.new args: 'my custom message'`.
|
45
|
+
**Dependencies data** is provided by the framework and has all [{ExcADG::VStateData}](lib/excadg/vstate_data.rb) returned by the dependencies.
|
46
|
+
|
47
|
+
### Vertices constructing
|
48
|
+
|
49
|
+
Vertex is a execution graph's building block. It executes its payload when all its dependencies are finished.
|
50
|
+
Vertex can be created this way: `Vertex.new name: :myvertex, payload: MyPayload.new`. This vertex doesn't have any dependencies, so it starts execution immediately upon construction. Vertices without dependencies are always aneeded in the graph to start it.
|
51
|
+
|
52
|
+
Another "kind" of vertices are vertices with dependencies. Dependencies are other vertices that the vertex waits to finish successfully before running its own payload.
|
53
|
+
> Example: `Vertex.new name: :other_vertex, payload: MyPayload.new, deps: [:myvertex]` has exactly 1 dependency - vertex called `:myvertex`
|
54
|
+
> `:other_vertex` won't start until `:myvertex` finishes.
|
55
|
+
> Moreover, as described in [payload](#payload), `:other_vertex`'s payload will have access to data returned by `:myvertex`'s payload.
|
56
|
+
> In this case, it'd be what `system 'echo here I am'` returns - `true`.
|
57
|
+
|
58
|
+
Dependencies could be specified both - using {ExcADG::Vertex} objects and names. E.g.
|
59
|
+
``` ruby
|
60
|
+
Broker.run
|
61
|
+
|
62
|
+
v1 = Vertex.new payload: MyPayload.new
|
63
|
+
Vertex.new name: :v2, payload: MyPayload.new
|
64
|
+
|
65
|
+
Vertex.new name: :final, payload: MyPayload.new, deps: [v1, :v2]
|
66
|
+
|
67
|
+
Broker.wait_all
|
68
|
+
```
|
69
|
+
|
70
|
+
*See [Broker section](#broker) for `Broker.run` and `Broker.wait_all` usage.*
|
71
|
+
|
72
|
+
Using actual objects looks simpler, but it's less convenient, as it requires you to construct all vertices as they appear in the execution graph.
|
73
|
+
However, names allows you to spawn vertices in arbitrary order and expect framework to figure execution order on the fly. See [run tool](#tool) as an example of using names.
|
74
|
+
|
75
|
+
*There is no need to store {ExcADG::Vertex} objects, as it and its data are available through [Broker](#broker)'s [DataStore](#data-store) and there is no interface to communicate with a {ExcADG::Vertex} directly.*
|
76
|
+
|
77
|
+
# Internals
|
78
|
+
|
79
|
+
## Overview
|
80
|
+
|
81
|
+
This framework allows to spawn vertices. Once created, a vertex with payload starts immediately.
|
82
|
+
This means that there is no central place where execution seqence is controlled besides vertex's own mechanism to wait for dependencies.
|
83
|
+
|
84
|
+
As Ractors doesn't allow to reliably exchange data between each other at the moment, the main Ractor (thread) has to spawn a broker to synchronize data exchange. See [broker](#broker) sections to learn more.
|
85
|
+
|
86
|
+
This framework is based on [Ractors](https://docs.ruby-lang.org/en/master/ractor_md.html). It could be useful to get familiar with ractors before reading further.
|
87
|
+
|
88
|
+
## Vertice processing states
|
89
|
+
|
90
|
+
Internally, each vertice goes through a sequence of states. Now it's **new**, **ready**, **done** and **failed**. Stages are controlled by the [{ExcADG::StateMachine}](#statemachine).
|
91
|
+
|
92
|
+
{ExcADG::Vertex} starts as a **new** vertex and waits for its dependencies.
|
93
|
+
When vertex received all dependencies states and made sure they're **done**, it becomes **ready** and starts its payload.
|
94
|
+
When payload finishes, the vertex transitions to the **done** state.
|
95
|
+
If any of the stages fails, the vertex becomes **failed**. It could happen as due to a failed dependency (a stage failed) as well as any other error occurred (e.g. it receives an incorrect data from broker). A single failed vertex makes all vertices depending on it to fail. This way a failures cascading through the graph.
|
96
|
+
|
97
|
+
## {ExcADG::Broker}
|
98
|
+
|
99
|
+
Broker is a central component meant to receive and transmit data between vertices. There are several [{ExcADG::Request}](lib/excadg/request.rb) types broker supports.
|
100
|
+
|
101
|
+
When a vertex changes its state, it (actually, state machine does that) notifies the broker of its state and sends data (results of the transition) ptp the broker. Broker stores this data in a map and could send it to other vertices by request. Vertices polls their dependencies through broker to know where all dependencies are done or some of them failed.
|
102
|
+
|
103
|
+
Broker is desired to be as thin as possible to keep most of the work for vertices.
|
104
|
+
|
105
|
+
Each application should invoke {Broker.run} to enable messages processing.
|
106
|
+
Its counterpart is {Broker.wait_all} which waits for all known vertices to reach one of the terminal states (**done** or **failed**) within specified timeout. Same as `Broker.run`, it spawns a thread and returns it. The main application could keep it in background and query or `.join` on it once all vertices are spawned. Once the thread finishes, the main app could lookup vertices execution results in {Broker.data_store} (see [DataStore](#data-store)).
|
107
|
+
|
108
|
+
> Beware that broker constantly uses main Ractor's ports - incoming and outgoing. Hence, `Ractor#take`, `Ractor.yield` or any other messaging in the main ractor conflict with broker.
|
109
|
+
|
110
|
+
## {ExcADG::DataStore}
|
111
|
+
|
112
|
+
It's a mostly internal [Broker's](#broker) object that holds all vertice's [{ExcADG::VStateData}](lib/excadg/vstate_data.rb).
|
113
|
+
|
114
|
+
## {ExcADG::StateMachine}
|
115
|
+
|
116
|
+
State machine is a small internal mechanism that helps vertices to
|
117
|
+
- do state transitions
|
118
|
+
- preserve state transition results locally
|
119
|
+
|
120
|
+
State machine has transitions graph that is common for all vertices. Vertices bind certain logic to transitions. E.g. "wait for dependencies" or "run the payload". State transition mechanism ensures to run workload associated to the currently possible transition, process errors (if any) and send results to [Broker](#broker).
|
121
|
+
|
122
|
+
## {ExcADG::Payload}
|
123
|
+
|
124
|
+
Payload is a special module that carries convention for Vertex-es payload. Only classes that `include ExcADG::Payload` are expected to be providede as payload during vertex creation. _Although, obviously, there are dozens of ways to trick the code._
|
125
|
+
|
126
|
+
Payload existence is caused by that Ractors require all objects a Ractor uses to be moved (or copied) to it. A pure {ExcADG::Proc} can't be transferred, as it doesn't have allocator and can access outer scope (which has to be transferred too then), what could be impossible due to other non-shareable objects there.
|
127
|
+
|
128
|
+
To make the interface clearer and more reliable, Payload encloses a `Proc` within its `get` method. As isolated `Proc`s are not useful most of the time, Payload has 2 ways to provide/access the needful for payload to work.
|
129
|
+
First way is `args` parameter of the {Payload#initialize}. It can be used on payload creation time to customize payload's behavior. The paremeter is accessible within the labmda through `@args` attribute.
|
130
|
+
Second way is built-in. Vertex invokes payload with {ExcADG::Array} of dependencies {ExcADG::VStateData}, if payload's `Proc` is parametrized (has arity 1).
|
131
|
+
|
132
|
+
> Be mindful about data your payload receives (args) and returns (state data). It could appear incompatible with ractors and cause vertice to fail.
|
133
|
+
> Although these failures are hard to tell beforehand, [state machine](#statemachine) processes them gracefully.
|
134
|
+
|
135
|
+
# Development
|
136
|
+
|
137
|
+
The project is based on RVM => there is a .ruby-gemset file.
|
138
|
+
Bundler is configured to install gems for dev environment, use `bundle install --without dev` to install only modules needed to run the code.
|
139
|
+
|
140
|
+
## Gem
|
141
|
+
|
142
|
+
There is a gem specification. Typical commands:
|
143
|
+
- build gem: `gem build excadg.gemspec`
|
144
|
+
- install / uninstall gem built locally: `gem install ./excadg*.gem` / `gem uninstall excadg`
|
145
|
+
- publish: `gem push excadg*.gem`
|
146
|
+
|
147
|
+
## Testing
|
148
|
+
|
149
|
+
Test are located in [spec/](spec/) folder and are written using rspec.
|
150
|
+
|
151
|
+
Commands to run tests:
|
152
|
+
- `rspec` - run most of the tests
|
153
|
+
- `rspec spec/broker_spec.rb` - run tests from the file
|
154
|
+
- `rspec spec/broker_spec:32` - run tests that starts on line 32 of the file
|
155
|
+
- `rspec --tag perf` - run tests of a specific suite
|
156
|
+
|
157
|
+
> there is no consistent set of suites, tags are used to exclude mutually incompatible tests or e.g. perfomance tests
|
158
|
+
> search for `config.filter_run_excluding` in [spec/spec_helper.rb](spec/spec_helper.rb) to see what tests are disabled by default
|
159
|
+
|
160
|
+
### Logging
|
161
|
+
|
162
|
+
Most of the tests have loggin stubbed to avoid noize. Comment out either `stub_loogging` or `Log.mute` in the respective spec file to enable logging for it.
|
163
|
+
|
164
|
+
# Docs
|
165
|
+
|
166
|
+
`yard doc lib/` generates a descent documentation.
|
167
|
+
|
168
|
+
`yard server --reload` starts a server with documentation available at http://localhost:8808
|
169
|
+
|
170
|
+
# Next
|
171
|
+
|
172
|
+
> here is a list of improvemets could be implementex next
|
173
|
+
|
174
|
+
- add timeouts
|
175
|
+
- to payload
|
176
|
+
- to whole vertex
|
177
|
+
- to request processing
|
178
|
+
- implement throttling (:suspended state)
|
179
|
+
- limit # of running vertices
|
180
|
+
- problem: can't find what to suspend
|
181
|
+
- make a loop payload template
|
182
|
+
- provide a mechanism to control # of children
|
183
|
+
- avoid initializing Log on require
|
184
|
+
|
185
|
+
## Graph checks
|
186
|
+
|
187
|
+
- check for loops in the config
|
188
|
+
- check for unreachable islands - graph connectivity
|
189
|
+
- check that there are nodes to start from
|
data/bin/excadg
ADDED
@@ -0,0 +1,133 @@
|
|
1
|
+
#!/usr/bin/env ruby
|
2
|
+
|
3
|
+
require 'yaml'
|
4
|
+
require 'optparse'
|
5
|
+
|
6
|
+
require 'rgl/adjacency'
|
7
|
+
require 'rgl/dot'
|
8
|
+
|
9
|
+
require_relative '../lib/excadg'
|
10
|
+
|
11
|
+
options = {
|
12
|
+
graph: 'config.yaml',
|
13
|
+
logfile: :stdout,
|
14
|
+
loglevel: Logger::INFO,
|
15
|
+
dump: nil,
|
16
|
+
gdump: nil,
|
17
|
+
timeout: nil
|
18
|
+
}
|
19
|
+
OptionParser.new { |opts|
|
20
|
+
opts.banner = <<~EOD
|
21
|
+
tool to run a graph of payloads specified in config
|
22
|
+
usage: #{$PROGRAM_NAME} [args]
|
23
|
+
EOD
|
24
|
+
opts.on('-g', '--graph FILENAME', 'config file name to load, see config/sample.yaml for format', "default: #{options[:graph]}") { |o|
|
25
|
+
options[:graph] = o
|
26
|
+
}
|
27
|
+
opts.on(
|
28
|
+
'-l', '--outmode <FILENAME|:silent|:stdout>',
|
29
|
+
'output mode',
|
30
|
+
'app draws TUI if log file is set',
|
31
|
+
'reserved keywords for stdout and silent modes',
|
32
|
+
"default: #{options[:logfile]}"
|
33
|
+
) { |o|
|
34
|
+
options[:logfile] = o
|
35
|
+
}
|
36
|
+
opts.on('--loglevel <DEBUG|INFO|WARN|ERROR|FATAL>',
|
37
|
+
"log level, default: #{options[:loglevel]}") { |o|
|
38
|
+
name = o.upcase.to_sym
|
39
|
+
raise "unknown log level #{o}" unless Logger.const_defined? name
|
40
|
+
|
41
|
+
options[:loglevel] = Logger.const_get name
|
42
|
+
}
|
43
|
+
opts.on('-d', '--dump FILENAME', 'dump all vertices state data to the file in the end') { |o|
|
44
|
+
options[:dump] = o
|
45
|
+
}
|
46
|
+
opts.on('--gdump FILENAME', 'dump initial execution graph to the file specified') { |o|
|
47
|
+
options[:gdump] = o
|
48
|
+
}
|
49
|
+
opts.on('-t', '--timeout SECONDS', 'for how long to wait for the vertices to execute') { |o|
|
50
|
+
options[:timeout] = o.to_i
|
51
|
+
}
|
52
|
+
}.parse!
|
53
|
+
|
54
|
+
logfile, ui_drawer =
|
55
|
+
case options[:logfile]
|
56
|
+
when ':stdout', $stdout, :stdout
|
57
|
+
[$stdout, nil]
|
58
|
+
when ':silent'
|
59
|
+
[nil, nil]
|
60
|
+
else # all other strings are considered log file names
|
61
|
+
[options[:logfile], ExcADG::Tui]
|
62
|
+
end
|
63
|
+
|
64
|
+
if logfile.nil?
|
65
|
+
ExcADG::Log.mute
|
66
|
+
else
|
67
|
+
ExcADG::Log.logger ExcADG::Log::RLogger.new dest: logfile, level: options[:loglevel]
|
68
|
+
end
|
69
|
+
|
70
|
+
raise "'#{options[:graph]}' config file is not readable" unless File.readable? options[:graph]
|
71
|
+
|
72
|
+
ExcADG::Log.info 'reading config'
|
73
|
+
config = YAML.safe_load_file options[:graph]
|
74
|
+
|
75
|
+
runnable_vertices = config.select { |k, v| v&.dig('dependencies').nil? }.keys
|
76
|
+
raise ArgumentError, 'at least one vertex should be ready to start' if runnable_vertices.empty?
|
77
|
+
|
78
|
+
ExcADG::Log.info 'collect execution graph from config'
|
79
|
+
|
80
|
+
all_vertex_names = config.keys.collect!(&:to_sym)
|
81
|
+
graph = RGL::DirectedAdjacencyGraph.new
|
82
|
+
config.each_pair { |id, vconfig|
|
83
|
+
name = id.to_sym
|
84
|
+
payload = if vconfig&.key?('command')
|
85
|
+
ExcADG::Payload::Wrapper::Bin.new args: vconfig['command']
|
86
|
+
elsif vconfig&.key?('sleep')
|
87
|
+
ExcADG::Payload::Example::Sleepy.new args: vconfig['sleep'].to_i
|
88
|
+
else
|
89
|
+
ExcADG::Payload::Example::Echo.new args: vconfig
|
90
|
+
end
|
91
|
+
deps_v_names = (vconfig&.dig('dependencies') || []).collect(&:to_sym)
|
92
|
+
deps_v_names.each { |dep|
|
93
|
+
raise "there is no dependency named '#{dep}' in the graph for vertex '#{id}'" unless all_vertex_names.include? dep
|
94
|
+
|
95
|
+
graph.add_edge name, dep
|
96
|
+
}
|
97
|
+
|
98
|
+
ExcADG::Vertex.new name:, payload:, deps: deps_v_names
|
99
|
+
}
|
100
|
+
|
101
|
+
graph.write_to_graphic_file(options[:gdump].split('.').last, options[:gdump].split('.')[...-1].join('.')) unless options[:gdump].nil?
|
102
|
+
|
103
|
+
ExcADG::Log.info 'starting state data broker'
|
104
|
+
ExcADG::Broker.run
|
105
|
+
|
106
|
+
ui_drawer&.run
|
107
|
+
|
108
|
+
ExcADG::Log.info 'watching for all vertices to complete'
|
109
|
+
timed_out = false
|
110
|
+
begin
|
111
|
+
ExcADG::Broker.wait_all(timeout: options[:timeout]).join
|
112
|
+
rescue Timeout::Error
|
113
|
+
ExcADG::Log.error 'execution timed out'
|
114
|
+
timed_out = true
|
115
|
+
end
|
116
|
+
vertice_by_state = ExcADG::Broker.data_store.to_a.group_by(&:state)
|
117
|
+
|
118
|
+
ExcADG::Log.info "#{vertice_by_state[:done]&.size || 0} done, #{vertice_by_state[:failed]&.size || 0} failed"
|
119
|
+
has_failed = !vertice_by_state[:failed].nil?
|
120
|
+
ui_drawer&.summarize has_failed, timed_out
|
121
|
+
|
122
|
+
unless options[:dump].nil?
|
123
|
+
ExcADG::Log.info "writing data to #{options[:dump]}"
|
124
|
+
File.open(options[:dump], 'w+') { |f|
|
125
|
+
f.write JSON.dump ExcADG::Broker.data_store.to_a
|
126
|
+
}
|
127
|
+
end
|
128
|
+
|
129
|
+
sleep 0.1 # let the logger to print message
|
130
|
+
exit_code = 0
|
131
|
+
exit_code |= 1 if has_failed
|
132
|
+
exit_code |= 2 if timed_out
|
133
|
+
exit exit_code
|
metadata
CHANGED
@@ -1,7 +1,7 @@
|
|
1
1
|
--- !ruby/object:Gem::Specification
|
2
2
|
name: excadg
|
3
3
|
version: !ruby/object:Gem::Version
|
4
|
-
version: 0.1.
|
4
|
+
version: 0.1.2
|
5
5
|
platform: ruby
|
6
6
|
authors:
|
7
7
|
- skorobogatydmitry
|
@@ -10,15 +10,17 @@ bindir: bin
|
|
10
10
|
cert_chain: []
|
11
11
|
date: 2024-07-06 00:00:00.000000000 Z
|
12
12
|
dependencies: []
|
13
|
-
description: "
|
14
|
-
|
15
|
-
|
16
|
-
|
13
|
+
description: "\n\nThat's a library (framework) to execute a graph of dependent tasks
|
14
|
+
(vertices). \nIts main feature is to run all possible tasks independently in parallel
|
15
|
+
once they're ready. \nAnother feature is that the graph is dynamic and any vertex
|
16
|
+
could produce another vertice(s) to extend execution graph.\n"
|
17
17
|
email: skorobogaty.dmitry@gmail.com
|
18
18
|
executables: []
|
19
19
|
extensions: []
|
20
20
|
extra_rdoc_files: []
|
21
21
|
files:
|
22
|
+
- README.md
|
23
|
+
- bin/excadg
|
22
24
|
- lib/excadg.rb
|
23
25
|
- lib/excadg/assertions.rb
|
24
26
|
- lib/excadg/broker.rb
|