rubex 0.0.1

Sign up to get free protection for your applications and to get access to all the features.
@@ -0,0 +1,7 @@
1
+ ---
2
+ SHA1:
3
+ metadata.gz: f3b95dd14dc7e9058f7f9ff1771fc0f715fdbcc6
4
+ data.tar.gz: b660d0b89243e6a676853738e89729f666c2c4a0
5
+ SHA512:
6
+ metadata.gz: 7dd59c3e0aa6b2e0320226c11c849681a168070d759db2a89d2d4d6012994d775051529cdf3293b94d65d7fc3ccf8e28d0996ab55465e348b50707adcf597bba
7
+ data.tar.gz: fe13cd37f6f736a872a6d2c8b48b10a9079580832c48323cec5e5a5f2f95bd8a3320ffac17a83ed5b45319973e905a563190fdc475895ddafa9c4b4df06eb7fc
@@ -0,0 +1,50 @@
1
+ *.gem
2
+ *.rbc
3
+ /.config
4
+ /coverage/
5
+ /InstalledFiles
6
+ /pkg/
7
+ /spec/reports/
8
+ /spec/examples.txt
9
+ /test/tmp/
10
+ /test/version_tmp/
11
+ /tmp/
12
+
13
+ # Used by dotenv library to load environment variables.
14
+ # .env
15
+
16
+ ## Specific to RubyMotion:
17
+ .dat*
18
+ .repl_history
19
+ build/
20
+ *.bridgesupport
21
+ build-iPhoneOS/
22
+ build-iPhoneSimulator/
23
+
24
+ ## Specific to RubyMotion (use of CocoaPods):
25
+ #
26
+ # We recommend against adding the Pods directory to your .gitignore. However
27
+ # you should judge for yourself, the pros and cons are mentioned at:
28
+ # https://guides.cocoapods.org/using/using-cocoapods.html#should-i-check-the-pods-directory-into-source-control
29
+ #
30
+ # vendor/Pods/
31
+
32
+ ## Documentation cache and generated files:
33
+ /.yardoc/
34
+ /_yardoc/
35
+ /doc/
36
+ /rdoc/
37
+
38
+ ## Environment normalization:
39
+ /.bundle/
40
+ /vendor/bundle
41
+ /lib/bundler/man/
42
+
43
+ # for a library or gem, you might want to ignore these files since the code is
44
+ # intended to run in multiple environments; otherwise, check them in:
45
+ Gemfile.lock
46
+ # .ruby-version
47
+ # .ruby-gemset
48
+
49
+ # unless supporting rvm < 1.11.0 or doing something fancy, ignore this:
50
+ .rvmrc
File without changes
data/Gemfile ADDED
@@ -0,0 +1,2 @@
1
+ source 'https://rubygems.org'
2
+ gemspec
File without changes
data/LICENSE ADDED
@@ -0,0 +1,23 @@
1
+ Copyright (c) 2016, Sameer Deshmukh
2
+ All rights reserved.
3
+
4
+ Redistribution and use in source and binary forms, with or without
5
+ modification, are permitted provided that the following conditions are met:
6
+
7
+ * Redistributions of source code must retain the above copyright notice, this
8
+ list of conditions and the following disclaimer.
9
+
10
+ * Redistributions in binary form must reproduce the above copyright notice,
11
+ this list of conditions and the following disclaimer in the documentation
12
+ and/or other materials provided with the distribution.
13
+
14
+ THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
15
+ AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
16
+ IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
17
+ DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE
18
+ FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
19
+ DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
20
+ SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
21
+ CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
22
+ OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
23
+ OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
@@ -0,0 +1,343 @@
1
+ # rubex
2
+ rubex - A Crystal-inspired language for writing Ruby extensions.
3
+
4
+ # Background
5
+
6
+ Rubex aims to make writing C extensions as intuitive as writing Ruby code. A very simple example would be a recursive implementation of a function that computes the factorial of a given number. The method for this is called `factorial` and the class in which it resides is called `Fact`. The code in rubex would like this:
7
+ ``` ruby
8
+ class Fact
9
+ def factorial(i64 n)
10
+ return (n > 1 ? n*factorial(n-1) : 1)
11
+ end
12
+ end
13
+ ```
14
+
15
+ The rubex compiler will compile this code into the equivalent C code, and also make the appropriate calls to the CRuby C API which will perform interconversions between Ruby and C data types and register the class and it's instance method with the interpreter using the appropriate calls to `rb_define_class()` and `rb_define_method()`.
16
+
17
+ Making C extensions in this manner will GREATLY simply the process, and allow for more succint and readable extensions that look very similar to Ruby. To actually see exactly how simple writing extensions will become, the current way of writing the same `factorial` function in the `Fact` class would look something like this with pure C code:
18
+ ``` c
19
+ #include <ruby.h>
20
+
21
+ int
22
+ calc_factorial(int n)
23
+ {
24
+ return (n > 1 ? n*calc_factorial(n-1) : 1);
25
+ }
26
+
27
+ static VALUE
28
+ cfactorial(VALUE self, VALUE n)
29
+ {
30
+ return INT2NUM(calc_factorial(NUM2INT(n)));
31
+ }
32
+
33
+ void Init_factorial()
34
+ {
35
+ VALUE cFact = rb_define_class("Fact", rb_cObject);
36
+ rb_define_method(cFact, "factorial", cfactorial, 1);
37
+ }
38
+ ```
39
+
40
+ Now imagine growing this to solving a non-trivial problem, and the benefits imparted by rubex in terms of productivity and simplicity become increasingly apparent. Users will simply need to call a command or rake task that will generate the relevant C code and create a shared object binary, that can be then imported into any Ruby program with a call to `require`.
41
+
42
+ # Syntax specification
43
+
44
+ WARNING: Specification set to change!
45
+
46
+ #### File extensions
47
+
48
+ Rubex files will have a `.rubex` file extension.
49
+
50
+ #### Comments
51
+
52
+ Rubex will only support single line comments in lines that start with a `#` character, just like Ruby.
53
+
54
+ #### Line termination
55
+
56
+ A line of code can be terminated with a newline (`\n`) character.
57
+
58
+ #### Support for C data types
59
+
60
+ The above example demonstrated the factorial function being used with a 64 bit integer data type, and rubex will support many such data types. The keywords for these data types will be borrowed from Crystal, and they will be translated to the corresponding C types with the rubex compiler. For the first version (v0.1) the following data types will be supported:
61
+
62
+ |rubex keyword | C type | Description |
63
+ |:--- |:--- |:--- |
64
+ |char |char |Character |
65
+ |i8 |int8_t |8 bit integer |
66
+ |i16 |int16_t |16 bit integer |
67
+ |i32 |int32_t |32 bit integer |
68
+ |i64 |int64_t |64 bit integer |
69
+ |u8 |uint8_t |8 bit unsigned integer |
70
+ |u16 |uint16_t |16 bit unsigned integer |
71
+ |u32 |uint32_t |32 bit unsigned integer |
72
+ |u64 |uint64_t |64 bit unsigned integer |
73
+ |int |int | Integer >= 16 bits. |
74
+ |unsigned int |unsigned int| Unsigned integer >= 16 bits. |
75
+ |long int |long int| Integer >= 32 bits.|
76
+ |unsigned long int |unsigned long int|Unsigned Integer >= 32 bits. |
77
+ |long long int |long long int|Integer >= 64 bits.|
78
+ |unsigned long long int|unsigned long long int|Unsigned Integer >= 64 bits.|
79
+ |f32/float |float |32 bit floating point |
80
+ |f64/double |double |64 bit floating point |
81
+ |long f64/long double |long double|Long double >= 96 bits. |
82
+ |object |VALUE |Ruby object |
83
+
84
+ Variables with these data types can be declared by writing the data type keyword before the variable name, and will not follow the Crystal convention. So for example, to declare some integers and floats in rubex, you would do this:
85
+ ```
86
+ i32 int_number
87
+ f64 float_number
88
+ i8 u, i = 33
89
+ ```
90
+ I will use the `stdint.h` header file that provides support for declaring integer types of precise bit length.
91
+
92
+ #### Structs
93
+
94
+ You can define your own C structures using the `struct` keyword. It can contain any sort of data type inside it, just like structs in C. It can also contain references and pointers to itself. To create a struct called 'Node', you can use the following syntax:
95
+ ```
96
+ struct Node do
97
+ int data
98
+ struct Node* next
99
+ end
100
+ # C equivalent:
101
+ # struct Node {
102
+ # int data;
103
+ # struct Node* next;
104
+ # };
105
+ ```
106
+ Varibles of type `Node` can be declared by using `struct Node`, like `struct Node foo`, or pointers with `Pointer(struct Node)` or `struct Node*`.
107
+
108
+ #### Typedefs
109
+
110
+ The `struct` in the example above can also be aliased with the `alias` keyword. The function of `alias` is similar to `typedef` in C. So `alias Node = struct Node` will declarations in the form of `Node foo`. Once a type has been aliased, the original name (`struct Node`) can be used interchangably with the new name (`Node`).
111
+
112
+ #### Unions
113
+
114
+ A C union can be defined with the `union` keyword similar to the way a `struct` is declared. For example,
115
+ ```
116
+ union IntAndFloat do
117
+ i32 a
118
+ f32 b
119
+ end
120
+ ```
121
+
122
+ The `union` must either be aliased to some other user-defined type or must be reffered to by the `union` keyword. So a variable of the above union will be declared as `union IntAndFloat intfloat`.
123
+
124
+ #### Enums
125
+
126
+ A C `enum` can be declared with the `enum` keyword. Each element in the enum will be separated by newlines (`\n`) or commas (`,`), and default values starting from `0` will be assigned to successive enum elements. For example,
127
+ ```
128
+ enum Week do
129
+ monday
130
+ tuesday
131
+ wednesday
132
+ thursday
133
+ friday
134
+ saturday
135
+ sunday
136
+ end
137
+ ```
138
+
139
+ The default values can be changed with an assignment:
140
+ ```
141
+ enum SomeEnum do
142
+ one = 3,
143
+ two = 5,
144
+ end
145
+ ```
146
+
147
+ #### Functions
148
+
149
+ There can basically be two types of functions that will need to be defined in rubex:
150
+
151
+ * _Pure Ruby methods_ that take a Ruby object as the first argument (the `VALUE` in C extensions, see the `cfactorial` method in the above example) and return a Ruby object.
152
+ * _C methods_ that purely take C data types as input and return a C data type (for example the `calc_factorial` function in the above example).
153
+
154
+ These two kinds of methods will have slightly different syntax when it comes to defining methods. Let me elaborate on them both:
155
+
156
+ **Pure Ruby Methods**
157
+
158
+ These will be defined just like normal Ruby methods, but will support typed formal parameters. Internally they will be translated to functions that accept `VALUE` as the first argument and return `VALUE`.
159
+
160
+ To define a method of this kind, the user can use syntax that looks like this:
161
+ ``` ruby
162
+ def plus_one(f32 n)
163
+ n + 1
164
+ end
165
+ ```
166
+
167
+ When a parameter of a Ruby method is declared to have a C type, it passed a Ruby object, which is then converted to a C value if possible. If the data type is not speicified, the function will take a Ruby object as an argument.
168
+
169
+ ONLY THESE METHODS WILL BE CALLABLE FROM INTERPRETED RUBY CODE.
170
+
171
+ **Pure C Methods**
172
+
173
+ These are methods that are only callable from C code. They cannot be called directly from interpreted Ruby code.
174
+
175
+ They also look very similar to the pure Ruby methods, but have a caveat that they must be defined with `cdef` instead of `def` and should also specify a return data type. These methods must specify the type of their formal arguments. Therefore, the syntax for writing a simple addition function would look like this:
176
+ ``` ruby
177
+ cdef f32 plus_one(f32 n)
178
+ n + 1
179
+ end
180
+ ```
181
+ In above example, the function `plus_one` takes a 32 bit floating point number `n` as an argument and returns a 32 bit floating point number after adding `1` to it. If a return type is not specified for a C function, it is assumed to be `VALUE` (Ruby object).
182
+
183
+ These functions will also be compiled such that the last line is actually the return statement, as is the case in Ruby. Pure C methods will only be callable from rubex and cannot be called by external interpreted Ruby code.
184
+
185
+ **Passing functions to other functions**
186
+
187
+ C functions (i.e. those defined with the `cdef` keyword) can be passed to other functions, just like C function pointers. If a C method accepts a function of a particular type signature it can be specified with the `Function()` keyword. The last argument to the `Function()` keyword is the return type of that function. The functionality can also be realized by specifying the function as `input parameters type -> return type`.
188
+
189
+ For example, a function that accepts two arguments, one of type `i32` and the other of type `f64` and returns a varible of type `i32` can be specified by either `Function(i32,f64,i32)` or `i32, f64 -> i32`.
190
+
191
+ #### Variables
192
+
193
+ Any variable can have the `extern` or `static` keyword associated with it in order to declare it so.
194
+
195
+ #### Pointers
196
+
197
+ Rubex will allow declaring pointers with either the `*` operator or the `Pointer()` keyword. For example, an `i32` pointer can be declared as either `*i32` or `Pointer(i32)`. Notice that the 'P' of `Pointer()` is in capitals. This notation will be followed for all compiler directives that take arguments.
198
+
199
+ #### Literals
200
+
201
+ Several literals will be available for creating many basic types in the language. I have listed some of the major ones below:
202
+
203
+ * **nil** - This (`nil`) represent's Ruby's NilClass object, and will be translated to the `Qnil` Ruby constant according to the C API.
204
+ * **Boolean values** - `true` and `false` are the literals for boolean values and will be translated to `Qtrue` and `Qfalse` by the compiler.
205
+ * **Integers** - Rubex will not make any assumptions about variables without an associated type assigned to integers. Thus, in order to create C integers, users must specify the data type of the variable. Therefore, `i = 3` will lead to `i` being compiled as a Ruby `Fixnum` object, and `i32 i = 3` will compile to a C 32 bit integer as `int32_t i = 3`.
206
+ * **Floats** - The assumptions made for integers will apply to floats as well.
207
+ * **Character** - Character `char` literals can be specified by enclosing the character in single quotes (`'` and `'`). These are equivalent to character literals in C.
208
+ * **Strings** - String literals are enclosed inside double quotes (`"` and `"`). When assigning a string literal to a variable, the variable must be of type `char*`. If the type is not specified, it will be treated as a Ruby string.
209
+ * **Symbol** - Rubex symbols use the exact same syntax as Ruby symbols, and will be directly translated to the relevant C API function for creating symbols.
210
+
211
+ #### Arrays
212
+
213
+ C arrays of a definite size (static arrays) can be specified using square brackets after the variable name. For example, to declare an array of 10, 16-bit integers, the syntax would be `i16 arr[10]`. Static arrays can also be declared with the `StaticArray()` keyword. Thus, to declare a C static array of `i8` of size `8`, you can either use`StaticArray(i8, 8)` or `i8[8]`.
214
+
215
+ If a Ruby array is specified using Ruby's Array literal syntax (`[]`), it will be directly translated into a Ruby Array object. For example the statement `a = []` will store a Ruby Array object in the variable 'a'.
216
+
217
+ Static arrays can be initialized with Ruby's array literal syntax. So for example, you can initiazile an array of 10 `i16` integers like this:
218
+ ```
219
+ i16 a[10] = [1,2,3,4,5,6,7,8,9,10]
220
+ # C equivalent -> int16_t a[10] = {1,2,3,4,5,6,7,8,9,10};
221
+ ```
222
+
223
+ #### Loops
224
+
225
+ Rubex will itself define a `for` loop for performing integer iterations. The syntax for this will be as follows:
226
+ ```
227
+ i32 i, s = 0, m = 10
228
+ for s < i <= m do
229
+ # code
230
+ end
231
+ ```
232
+
233
+ If the loop variable `i` and lower and upper bounds are all C integer expressions, this loop will be directly compiled into a C for-loop and will be very fast. The direction of the iteration is determined by the relations. If they are from the `{<,<=}` the iteration is upwards, or if they are from the set `{>,>=}`, the iteration is downwards.
234
+
235
+ Rubex can also translate `.each` method called on `StaticArray` into the equivalent C `for` loops. The only caveat being that since there's no way to dynamically determine the length of C arrays, an argument (an integer) will need to be passed into the `.each` method that will tell Rubex the lenghth. To demonstrate:
236
+ ```
237
+ i16 a[10] = [1,2,3,4,5,6,7,8,9,10]
238
+
239
+ a.each(10) do |x|
240
+ # do something...
241
+ end
242
+
243
+ # C equivalent:
244
+ # int size = 10
245
+ # for(int i = 0; i < size; i++) {
246
+ # i16 x = a[i]
247
+ # // do something....
248
+ # }
249
+ ```
250
+
251
+ #### Wrapping C functions
252
+
253
+ This is the most important functionality of rubex. A lot of it will be borrowed from Crystal since Crystal's API for creating C bindings is very well thought out and simple.
254
+
255
+ **lib**
256
+
257
+ For interfacing any C library with Ruby, rubex will provide the `lib` keyword. The `lib` declaration will basically group C functions and types that belong to a particular library. For example,
258
+ ```
259
+ #@[Link("pcre")]
260
+ lib LibPCRE
261
+ end
262
+ ```
263
+
264
+ The syntax above the `lib` declaration is a special 'magic comments' syntax. The presence of `@[...]` after the `#` of the comments will allow the Rubex compiler to know that the comment is not a regular comment but is actually a directive for an operation that is to be performed by the compiler.
265
+
266
+ The `Link` keyword inside the `@[...]` syntax of the magic comment will ensure that appropriate flags are passed to the compiler to find the external libraries. So for example, it the above case, the `Link("pcre")` directive will `-lpcre` to the linker.
267
+
268
+ If `Link(ldflags: "...")` is passed into the magic comment, those flags will be passed directly to the linker, without any modification, for example `Link(ldflags: "-lpcre")`. Enclosing those commands inside backticks will execute those commands, for example `Link(ldflags: "`pkg-config libpcre --libs`")`.
269
+
270
+ **require_header**
271
+
272
+ The `require_header` keyword will include C headers in the generated C code. For example `require_header 'math'` will put a statement `#include <math.h>`.
273
+
274
+ **fun**
275
+
276
+ A `fun` declaration will bind a C function.
277
+ ```
278
+ require_header 'math'
279
+
280
+ class Maths
281
+ def cos(f32 v)
282
+ CMath.cos(v)
283
+ end
284
+ end
285
+
286
+ lib CMath
287
+ fun f32 cos(f32 value)
288
+ end
289
+ ```
290
+ The user can the call the `cos` function from Ruby with `Maths.new.cos(0.7)`. Calls to `fun` must be inside a `lib` block. This facilitates easy linking and namespacing of C bindings.
291
+
292
+ The parentheses can be omitted if the function does not accept arguments:
293
+ ```
294
+ # In rubex
295
+ lib C
296
+ fun i32 getch
297
+ end
298
+
299
+ # In Ruby
300
+ C.getch
301
+ ```
302
+ If the return type is `void` you can omit it:
303
+ ```
304
+ # In Rubex
305
+ lib C
306
+ fun srand(u32 seed)
307
+ end
308
+
309
+ # In Ruby
310
+ C.srand(1)
311
+ ```
312
+
313
+ Functions with variable arguments can also be bound:
314
+ ```
315
+ # In Rubex
316
+ lib X
317
+ fun i32 variadic(i32 value, ...)
318
+ end
319
+
320
+ # In Ruby
321
+ X.variadic(1, 2, 3, 4)
322
+ ```
323
+
324
+ If a function starts with a name that you don't want to use in your Ruby program, or if it contains characters that cannot be used in Ruby methods, you can assign it a different name with `=`. For example,
325
+ ```
326
+ lib C
327
+ fun cosine = f32 cos(f32 value)
328
+ end
329
+ ```
330
+ Or say the function starts with a capital letter (which is a constant in Ruby and is confusing to use as a method name) or contains an invalid character like `.`, in which case you can wrap it in a string:
331
+ ```
332
+ lib LibSDL
333
+ fun init = u32 SDL_Init(u32 flags)
334
+ end
335
+
336
+ lib LLVMIntrinsics
337
+ fun ceil_f32 = f32 "llvm.ceil.f32"(f32 value)
338
+ end
339
+ ```
340
+
341
+ #### Embedding C code
342
+
343
+ If you must write C, you can do that with a `%{ ... %}` block or a `BEGIN_C do ... end` block.
@@ -0,0 +1,18 @@
1
+ require 'rspec/core/rake_task'
2
+ require 'bundler/gem_tasks'
3
+
4
+ $:.unshift File.expand_path("../lib", __FILE__)
5
+
6
+ Rake.application.rake_require "oedipus_lex"
7
+
8
+ desc "Generate Lexer"
9
+ task :lexer => "lib/rubex/lexer.rex.rb"
10
+
11
+ desc "Generate Parser"
12
+ task :parser => :lexer do
13
+ `racc lib/rubex/parser.racc -o lib/rubex/parser.racc.rb`
14
+ end
15
+
16
+ RSpec::Core::RakeTask.new(:spec)
17
+
18
+ task :spec => :parser