pdftdx 1.0.1 → 1.0.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- checksums.yaml +4 -4
- data/README.md +36 -0
- data/lib/pdftdx/version.rb +1 -1
- metadata +1 -1
checksums.yaml
CHANGED
@@ -1,7 +1,7 @@
|
|
1
1
|
---
|
2
2
|
SHA1:
|
3
|
-
metadata.gz:
|
4
|
-
data.tar.gz:
|
3
|
+
metadata.gz: fe2edf26823ec9fca9d819ac30a5fc4b3cc9d9f5
|
4
|
+
data.tar.gz: 66972844f0e200198e9eca7bbdbbee8eb2d731a4
|
5
5
|
SHA512:
|
6
|
-
metadata.gz:
|
7
|
-
data.tar.gz:
|
6
|
+
metadata.gz: 7d7c255e8cadbaade03b8a9a3bbec4f15bd3bd7fef333e9e4dc49c90bc6efc455197454816a3f66850428623ca6e565655e2ce5968bf888474d81697d88e02a8
|
7
|
+
data.tar.gz: 267759c85ad5ef6df40bb141f5c7190eb1d09a104fcb5045ba8c0035ef0a48b77c8675a5e795e119e57defd1b58d6bd75eabb184e3dd57a3befa18a6e8d6d811
|
data/README.md
CHANGED
@@ -2,6 +2,42 @@
|
|
2
2
|
|
3
3
|
Simple tool to extract Table Data from PDFs
|
4
4
|
|
5
|
+
## Presentation
|
6
|
+
|
7
|
+
This library is able to understand stuff that looks like tables in PDF files:
|
8
|
+
|
9
|
+
- Table Headers
|
10
|
+
- Table Rows
|
11
|
+
- Sub-Table Names (Partial tables)
|
12
|
+
|
13
|
+
Also, a set of filters are included to ensure that the output produced by the library is "clean" and free of false-positives or unusable / garbage information.
|
14
|
+
|
15
|
+
## Installation
|
16
|
+
|
17
|
+
### Gemfile
|
18
|
+
```ruby
|
19
|
+
gem 'golden_ticket'
|
20
|
+
```
|
21
|
+
|
22
|
+
### Terminal
|
23
|
+
```bash
|
24
|
+
gem install -V pdftdx
|
25
|
+
```
|
26
|
+
|
27
|
+
## Usage example
|
28
|
+
|
29
|
+
Reading a PDF file:
|
30
|
+
```ruby
|
31
|
+
require 'pdftdx'
|
32
|
+
tables = PDFTDX::extract_data 'path to your PDF file'
|
33
|
+
puts tables.inspect
|
34
|
+
```
|
35
|
+
|
36
|
+
Output:
|
37
|
+
```
|
38
|
+
=> [{ head: ['trauma.eresse.net', 'durjaya.dooba.io', 'suessmost.eresse.net'], data: [{ name: 'System', data: [['Machine OS', 'Win32', 'Linux', 'MacOS'], ['IP Address', '10.0.232.48', '10.0.232.134', '10.0.232.108']] }, []] }, []]
|
39
|
+
```
|
40
|
+
|
5
41
|
## License
|
6
42
|
|
7
43
|
The gem is available as open source under the terms of the [MIT License](http://opensource.org/licenses/MIT).
|
data/lib/pdftdx/version.rb
CHANGED