pdftdx 1.0.1 → 1.0.2
Sign up to get free protection for your applications and to get access to all the features.
- checksums.yaml +4 -4
- data/README.md +36 -0
- data/lib/pdftdx/version.rb +1 -1
- metadata +1 -1
checksums.yaml
CHANGED
@@ -1,7 +1,7 @@
|
|
1
1
|
---
|
2
2
|
SHA1:
|
3
|
-
metadata.gz:
|
4
|
-
data.tar.gz:
|
3
|
+
metadata.gz: fe2edf26823ec9fca9d819ac30a5fc4b3cc9d9f5
|
4
|
+
data.tar.gz: 66972844f0e200198e9eca7bbdbbee8eb2d731a4
|
5
5
|
SHA512:
|
6
|
-
metadata.gz:
|
7
|
-
data.tar.gz:
|
6
|
+
metadata.gz: 7d7c255e8cadbaade03b8a9a3bbec4f15bd3bd7fef333e9e4dc49c90bc6efc455197454816a3f66850428623ca6e565655e2ce5968bf888474d81697d88e02a8
|
7
|
+
data.tar.gz: 267759c85ad5ef6df40bb141f5c7190eb1d09a104fcb5045ba8c0035ef0a48b77c8675a5e795e119e57defd1b58d6bd75eabb184e3dd57a3befa18a6e8d6d811
|
data/README.md
CHANGED
@@ -2,6 +2,42 @@
|
|
2
2
|
|
3
3
|
Simple tool to extract Table Data from PDFs
|
4
4
|
|
5
|
+
## Presentation
|
6
|
+
|
7
|
+
This library is able to understand stuff that looks like tables in PDF files:
|
8
|
+
|
9
|
+
- Table Headers
|
10
|
+
- Table Rows
|
11
|
+
- Sub-Table Names (Partial tables)
|
12
|
+
|
13
|
+
Also, a set of filters are included to ensure that the output produced by the library is "clean" and free of false-positives or unusable / garbage information.
|
14
|
+
|
15
|
+
## Installation
|
16
|
+
|
17
|
+
### Gemfile
|
18
|
+
```ruby
|
19
|
+
gem 'golden_ticket'
|
20
|
+
```
|
21
|
+
|
22
|
+
### Terminal
|
23
|
+
```bash
|
24
|
+
gem install -V pdftdx
|
25
|
+
```
|
26
|
+
|
27
|
+
## Usage example
|
28
|
+
|
29
|
+
Reading a PDF file:
|
30
|
+
```ruby
|
31
|
+
require 'pdftdx'
|
32
|
+
tables = PDFTDX::extract_data 'path to your PDF file'
|
33
|
+
puts tables.inspect
|
34
|
+
```
|
35
|
+
|
36
|
+
Output:
|
37
|
+
```
|
38
|
+
=> [{ head: ['trauma.eresse.net', 'durjaya.dooba.io', 'suessmost.eresse.net'], data: [{ name: 'System', data: [['Machine OS', 'Win32', 'Linux', 'MacOS'], ['IP Address', '10.0.232.48', '10.0.232.134', '10.0.232.108']] }, []] }, []]
|
39
|
+
```
|
40
|
+
|
5
41
|
## License
|
6
42
|
|
7
43
|
The gem is available as open source under the terms of the [MIT License](http://opensource.org/licenses/MIT).
|
data/lib/pdftdx/version.rb
CHANGED