tolq-parsers-xlsx 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
checksums.yaml ADDED
@@ -0,0 +1,7 @@
1
+ ---
2
+ SHA1:
3
+ metadata.gz: d4914b8dce4cda89edc16b29f75c30b656a70811
4
+ data.tar.gz: 91e6d1876611d6a993e741879b4055811dafffa4
5
+ SHA512:
6
+ metadata.gz: f9b5d81c5532da4ce46fe55e8d6c39e77abb64487ff230ff7efcdef3a6586f3710e04c7cc7824e592bf122fce2cb509e3207f0f3729dd9b19431053b3f6c7d6c
7
+ data.tar.gz: a6a7c598f2ea09dc19b9b5ed3fa413a61e1290395ad08371a52201e0e2a842dc392df00ec3985b1a427a681908550551a4a119b2c8683d52acff47ff9c478f7b
data/.gitignore ADDED
@@ -0,0 +1,9 @@
1
+ /.bundle/
2
+ /.yardoc
3
+ /Gemfile.lock
4
+ /_yardoc/
5
+ /coverage/
6
+ /doc/
7
+ /pkg/
8
+ /spec/reports/
9
+ /tmp/
data/.travis.yml ADDED
@@ -0,0 +1,4 @@
1
+ language: ruby
2
+ rvm:
3
+ - 2.3.0
4
+ before_install: gem install bundler -v 1.11.2
@@ -0,0 +1,49 @@
1
+ # Contributor Code of Conduct
2
+
3
+ As contributors and maintainers of this project, and in the interest of
4
+ fostering an open and welcoming community, we pledge to respect all people who
5
+ contribute through reporting issues, posting feature requests, updating
6
+ documentation, submitting pull requests or patches, and other activities.
7
+
8
+ We are committed to making participation in this project a harassment-free
9
+ experience for everyone, regardless of level of experience, gender, gender
10
+ identity and expression, sexual orientation, disability, personal appearance,
11
+ body size, race, ethnicity, age, religion, or nationality.
12
+
13
+ Examples of unacceptable behavior by participants include:
14
+
15
+ * The use of sexualized language or imagery
16
+ * Personal attacks
17
+ * Trolling or insulting/derogatory comments
18
+ * Public or private harassment
19
+ * Publishing other's private information, such as physical or electronic
20
+ addresses, without explicit permission
21
+ * Other unethical or unprofessional conduct
22
+
23
+ Project maintainers have the right and responsibility to remove, edit, or
24
+ reject comments, commits, code, wiki edits, issues, and other contributions
25
+ that are not aligned to this Code of Conduct, or to ban temporarily or
26
+ permanently any contributor for other behaviors that they deem inappropriate,
27
+ threatening, offensive, or harmful.
28
+
29
+ By adopting this Code of Conduct, project maintainers commit themselves to
30
+ fairly and consistently applying these principles to every aspect of managing
31
+ this project. Project maintainers who do not follow or enforce the Code of
32
+ Conduct may be permanently removed from the project team.
33
+
34
+ This code of conduct applies both within project spaces and in public spaces
35
+ when an individual is representing the project or its community.
36
+
37
+ Instances of abusive, harassing, or otherwise unacceptable behavior may be
38
+ reported by contacting a project maintainer at timonv@gmail.com. All
39
+ complaints will be reviewed and investigated and will result in a response that
40
+ is deemed necessary and appropriate to the circumstances. Maintainers are
41
+ obligated to maintain confidentiality with regard to the reporter of an
42
+ incident.
43
+
44
+ This Code of Conduct is adapted from the [Contributor Covenant][homepage],
45
+ version 1.3.0, available at
46
+ [http://contributor-covenant.org/version/1/3/0/][version]
47
+
48
+ [homepage]: http://contributor-covenant.org
49
+ [version]: http://contributor-covenant.org/version/1/3/0/
data/Gemfile ADDED
@@ -0,0 +1,4 @@
1
+ source 'https://rubygems.org'
2
+
3
+ # Specify your gem's dependencies in yaml.gemspec
4
+ gemspec
data/LICENSE.txt ADDED
@@ -0,0 +1,21 @@
1
+ The MIT License (MIT)
2
+
3
+ Copyright (c) 2016 Timon Vonk
4
+
5
+ Permission is hereby granted, free of charge, to any person obtaining a copy
6
+ of this software and associated documentation files (the "Software"), to deal
7
+ in the Software without restriction, including without limitation the rights
8
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9
+ copies of the Software, and to permit persons to whom the Software is
10
+ furnished to do so, subject to the following conditions:
11
+
12
+ The above copyright notice and this permission notice shall be included in
13
+ all copies or substantial portions of the Software.
14
+
15
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
21
+ THE SOFTWARE.
data/README.md ADDED
@@ -0,0 +1,57 @@
1
+ # XLSX
2
+
3
+ This parser parses xlsx to and from the Tolq api format.
4
+
5
+ ## Installation
6
+
7
+ Add this line to your application's Gemfile:
8
+
9
+ ```ruby
10
+ gem 'tolq-parsers-xlsx'
11
+ ```
12
+
13
+ And then execute:
14
+
15
+ ```shell
16
+ $ bundle
17
+ ```
18
+ Or install it yourself as:
19
+
20
+ ```shell
21
+ $ gem install tolq-parsers-xlsx
22
+ ```
23
+
24
+ ## Usage
25
+
26
+ To parse to the tolq format:
27
+
28
+ ```ruby
29
+ xlsx = File.open('myxlsx.xlsx', 'rb', &:read)
30
+ parser = Tolq::Parsers::XLSX::Parser.new(quality: 'standard')
31
+ parser.parse(xlsx)
32
+ ```
33
+
34
+ And to unparse from the tolq format back to xlsx:
35
+
36
+ ```ruby
37
+ unparser = Tolq::Parsers::XLSX::Unparser.new
38
+ unparser.unparse(tolq_api_response)
39
+ ```
40
+ ## Development
41
+
42
+ Pointers:
43
+
44
+ * Keep in-code documentation up to date, document all public methods, and use YARD
45
+ * Write y'r tests
46
+ * Keep readme up to date
47
+ * Make PRs, not war
48
+
49
+ ## Contributing
50
+
51
+ Bug reports and pull requests are welcome on GitHub at https://github.com/tolq/parsers/xlsx. This project is intended to be a safe, welcoming space for collaboration, and contributors are expected to adhere to the [Contributor Covenant](http://contributor-covenant.org) code of conduct.
52
+
53
+
54
+ ## License
55
+
56
+ The gem is available as open source under the terms of the [MIT License](http://opensource.org/licenses/MIT).
57
+
data/Rakefile ADDED
@@ -0,0 +1,10 @@
1
+ require "bundler/gem_tasks"
2
+ require "rake/testtask"
3
+
4
+ Rake::TestTask.new(:test) do |t|
5
+ t.libs << "test"
6
+ t.libs << "lib"
7
+ t.test_files = FileList['test/**/*_test.rb']
8
+ end
9
+
10
+ task :default => :test
@@ -0,0 +1,10 @@
1
+ require 'rubyXL'
2
+
3
+ module Tolq
4
+ module Parsers
5
+ module XLSX
6
+ end
7
+ end
8
+ end
9
+
10
+ Dir[File.dirname(__FILE__) + '/xlsx/*.rb'].each {|file| require file }
@@ -0,0 +1,32 @@
1
+ module Tolq::Parsers::XLSX
2
+ module ColumnHelper
3
+ # Converts column to char, zero indexed
4
+ def self.column_to_char(idx)
5
+ dividend = idx + 1
6
+ column_name = ""
7
+
8
+ while dividend > 0
9
+ modulo = (dividend - 1) % 26
10
+ column_name = (65 + modulo).chr + column_name
11
+ dividend = (dividend - modulo) / 26
12
+ end
13
+
14
+ column_name
15
+ end
16
+
17
+ # Converts char to column, zero indexed
18
+ def self.char_to_column(char)
19
+ char
20
+ .split('')
21
+ .reverse
22
+ .map.with_index { |c,idx| (c.ord - 64) * (26**idx)}
23
+ .inject(&:+) - 1
24
+ end
25
+
26
+ def self.from_char_notation(key)
27
+ char, row = key.scan(/([A-Z]+)(\d+)/).first
28
+ return row.to_i - 1, ColumnHelper.char_to_column(char)
29
+ end
30
+ end
31
+ end
32
+
@@ -0,0 +1,93 @@
1
+ module Tolq::Parsers::XLSX
2
+ class ParseError < StandardError; end
3
+ class Parser
4
+
5
+ # Helper method to convert a XLSX into a annotated hash. Note that format
6
+ # is different from the Tolq api. It's just a hash.
7
+ #
8
+ # @param xlsx_io [IO] String or IO of xlsx
9
+ # @param parse_opts [Hash] Options for the XLSX parser
10
+ # @return [Hash] Hash representation of xlsx, i.e. { 'A1' => 'Hello World'
11
+ def self.to_hash(xlsx_io, **parse_opts)
12
+ extract_request_data_from_xlsx(xlsx_io, text_node: false)
13
+ end
14
+
15
+ # Creates a new xlsx parser
16
+ #
17
+ # @param quality [Symbol] the Tolq quality to order in
18
+ # @param source_language_code [String] The two letter source language code
19
+ # @param target_language_code [String] The two letter target language code
20
+ # @return [Tolq::Parsers::XLSX::Parser
21
+ def initialize(quality:, source_language_code:, target_language_code:)
22
+ @quality = quality
23
+ @source_language_code = source_language_code
24
+ @target_language_code = target_language_code
25
+ end
26
+
27
+ # Parses a xlsx
28
+ #
29
+ # @param xlsx_io [IO] A string or IO object to parse
30
+ # @param parse_opts [Hash] Options for the XLSX parser (i.e. col_sep, quote_char)
31
+ # @param exclude [Array] List of ranges to exclude in excel notation, i.e. ["A1:A3"]
32
+ # @return [Hash] A hash suitable to be converted to json for a Tolq api request
33
+ def parse(xlsx_io, exclude: [], **parse_opts)
34
+ exclude_ranges = to_numeric_exclude_ranges(exclude)
35
+ request_data = self.class.extract_request_data_from_xlsx(xlsx_io, exclude_ranges: exclude_ranges)
36
+ return nil if request_data.empty?
37
+
38
+ {
39
+ "request" => request_data,
40
+ "source_language_code" => @source_language_code,
41
+ "target_language_code" => @target_language_code,
42
+ "quality" => @quality.to_s
43
+ }
44
+ end
45
+
46
+ private
47
+
48
+ def self.extract_request_data_from_xlsx(xlsx_io, text_node: true, exclude_ranges: [])
49
+ workbook = RubyXL::Parser.parse_buffer(xlsx_io)
50
+ # TODO Validate single sheet
51
+ hash = {}
52
+ row_idx = 0
53
+ workbook.worksheets.first.each do |row|
54
+ cell_idx = 0
55
+ row && row.cells.each do |cell|
56
+ unless cell.value
57
+ cell_idx += 1
58
+ next
59
+ end
60
+ if excluded?(cell_idx, row_idx, exclude_ranges)
61
+ cell_idx += 1
62
+ next
63
+ end
64
+ cell_char = ColumnHelper.column_to_char(cell_idx)
65
+ if text_node
66
+ hash["#{cell_char}#{row_idx + 1}"] = { 'text' => cell.value }
67
+ else
68
+ hash["#{cell_char}#{row_idx + 1}"] = cell.value
69
+ end
70
+ cell_idx += 1
71
+ end
72
+ row_idx += 1
73
+ end
74
+ hash
75
+ end
76
+
77
+ def self.excluded?(colidx, rowidx, exclude_ranges)
78
+ !!exclude_ranges.find do |range|
79
+ from, to = range
80
+ colidx >= from.last && colidx <= to.last &&
81
+ rowidx >= from.first && rowidx <= to.first
82
+ end
83
+ end
84
+
85
+ def to_numeric_exclude_ranges(exclude_arr)
86
+ exclude_arr.map do |exrange|
87
+ exrange
88
+ .split(":")
89
+ .map { |k| ColumnHelper.from_char_notation(k) }
90
+ end
91
+ end
92
+ end
93
+ end
@@ -0,0 +1,33 @@
1
+ module Tolq::Parsers::XLSX
2
+ # Unparses a tolq response to XLSX suitable
3
+ class Unparser
4
+ # Unparses the tolq response
5
+ #
6
+ # @param tolq_response [Hash] A parsed response from our api
7
+ # @return [String] A xlsx formatted string
8
+ def from_tolq_response(tolq_response, original:, **args)
9
+ translations = tolq_response['translations'].values.first
10
+ translate_xlsx(translations, original: original).stream.read
11
+ end
12
+
13
+ # Unparses a translations hash of key => translation
14
+ #
15
+ # @param translations [Hash] A hash of translations with key => translation
16
+ # @return [String] A xlsx formatted string
17
+ def from_hash(translations, original:, **args)
18
+ translate_xlsx(translations, original: original).stream.read
19
+ end
20
+
21
+ private
22
+
23
+ def translate_xlsx(translations, original:)
24
+ workbook = RubyXL::Parser.parse_buffer(original)
25
+ worksheet = workbook.worksheets.first
26
+ translations.each do |k,v|
27
+ row, col = ColumnHelper.from_char_notation(k)
28
+ worksheet[row][col].change_contents(v)
29
+ end
30
+ workbook
31
+ end
32
+ end
33
+ end
@@ -0,0 +1,7 @@
1
+ module Tolq
2
+ module Parsers
3
+ module XLSX
4
+ VERSION = "0.1.0"
5
+ end
6
+ end
7
+ end
@@ -0,0 +1,29 @@
1
+ # coding: utf-8
2
+ lib = File.expand_path('../lib', __FILE__)
3
+ $LOAD_PATH.unshift(lib) unless $LOAD_PATH.include?(lib)
4
+ require 'xlsx/version'
5
+
6
+ Gem::Specification.new do |spec|
7
+ spec.name = "tolq-parsers-xlsx"
8
+ spec.version = Tolq::Parsers::XLSX::VERSION
9
+ spec.authors = ["Timon Vonk"]
10
+ spec.email = ["timon@tolq.com"]
11
+
12
+ spec.summary = %q{Parses and unparses between Tolq and xlsx}
13
+ spec.description = %q{Parses and unparses between Tolq and xlsx to be used with the Tolq api}
14
+ spec.homepage = "https://github.com/tolq/parsers/xlsx"
15
+ spec.license = "MIT"
16
+
17
+
18
+ spec.files = `git ls-files -z`.split("\x0").reject { |f| f.match(%r{^(test|spec|features)/}) }
19
+ spec.bindir = "bin"
20
+ spec.executables = spec.files.grep(%r{^bin/}) { |f| File.basename(f) }
21
+ spec.require_paths = ["lib"]
22
+
23
+ spec.add_dependency 'rubyXL'
24
+ spec.add_development_dependency "bundler", "~> 1.11"
25
+ spec.add_development_dependency "rake", "~> 10.0"
26
+ spec.add_development_dependency "minitest", "~> 5.0"
27
+ spec.add_development_dependency "pry"
28
+ spec.add_development_dependency "json"
29
+ end
metadata ADDED
@@ -0,0 +1,141 @@
1
+ --- !ruby/object:Gem::Specification
2
+ name: tolq-parsers-xlsx
3
+ version: !ruby/object:Gem::Version
4
+ version: 0.1.0
5
+ platform: ruby
6
+ authors:
7
+ - Timon Vonk
8
+ autorequire:
9
+ bindir: bin
10
+ cert_chain: []
11
+ date: 2017-02-24 00:00:00.000000000 Z
12
+ dependencies:
13
+ - !ruby/object:Gem::Dependency
14
+ name: rubyXL
15
+ requirement: !ruby/object:Gem::Requirement
16
+ requirements:
17
+ - - ">="
18
+ - !ruby/object:Gem::Version
19
+ version: '0'
20
+ type: :runtime
21
+ prerelease: false
22
+ version_requirements: !ruby/object:Gem::Requirement
23
+ requirements:
24
+ - - ">="
25
+ - !ruby/object:Gem::Version
26
+ version: '0'
27
+ - !ruby/object:Gem::Dependency
28
+ name: bundler
29
+ requirement: !ruby/object:Gem::Requirement
30
+ requirements:
31
+ - - "~>"
32
+ - !ruby/object:Gem::Version
33
+ version: '1.11'
34
+ type: :development
35
+ prerelease: false
36
+ version_requirements: !ruby/object:Gem::Requirement
37
+ requirements:
38
+ - - "~>"
39
+ - !ruby/object:Gem::Version
40
+ version: '1.11'
41
+ - !ruby/object:Gem::Dependency
42
+ name: rake
43
+ requirement: !ruby/object:Gem::Requirement
44
+ requirements:
45
+ - - "~>"
46
+ - !ruby/object:Gem::Version
47
+ version: '10.0'
48
+ type: :development
49
+ prerelease: false
50
+ version_requirements: !ruby/object:Gem::Requirement
51
+ requirements:
52
+ - - "~>"
53
+ - !ruby/object:Gem::Version
54
+ version: '10.0'
55
+ - !ruby/object:Gem::Dependency
56
+ name: minitest
57
+ requirement: !ruby/object:Gem::Requirement
58
+ requirements:
59
+ - - "~>"
60
+ - !ruby/object:Gem::Version
61
+ version: '5.0'
62
+ type: :development
63
+ prerelease: false
64
+ version_requirements: !ruby/object:Gem::Requirement
65
+ requirements:
66
+ - - "~>"
67
+ - !ruby/object:Gem::Version
68
+ version: '5.0'
69
+ - !ruby/object:Gem::Dependency
70
+ name: pry
71
+ requirement: !ruby/object:Gem::Requirement
72
+ requirements:
73
+ - - ">="
74
+ - !ruby/object:Gem::Version
75
+ version: '0'
76
+ type: :development
77
+ prerelease: false
78
+ version_requirements: !ruby/object:Gem::Requirement
79
+ requirements:
80
+ - - ">="
81
+ - !ruby/object:Gem::Version
82
+ version: '0'
83
+ - !ruby/object:Gem::Dependency
84
+ name: json
85
+ requirement: !ruby/object:Gem::Requirement
86
+ requirements:
87
+ - - ">="
88
+ - !ruby/object:Gem::Version
89
+ version: '0'
90
+ type: :development
91
+ prerelease: false
92
+ version_requirements: !ruby/object:Gem::Requirement
93
+ requirements:
94
+ - - ">="
95
+ - !ruby/object:Gem::Version
96
+ version: '0'
97
+ description: Parses and unparses between Tolq and xlsx to be used with the Tolq api
98
+ email:
99
+ - timon@tolq.com
100
+ executables: []
101
+ extensions: []
102
+ extra_rdoc_files: []
103
+ files:
104
+ - ".gitignore"
105
+ - ".travis.yml"
106
+ - CODE_OF_CONDUCT.md
107
+ - Gemfile
108
+ - LICENSE.txt
109
+ - README.md
110
+ - Rakefile
111
+ - lib/tolq-parsers-xlsx.rb
112
+ - lib/xlsx/column_helper.rb
113
+ - lib/xlsx/parser.rb
114
+ - lib/xlsx/unparser.rb
115
+ - lib/xlsx/version.rb
116
+ - tolq-parsers-xlsx.gemspec
117
+ homepage: https://github.com/tolq/parsers/xlsx
118
+ licenses:
119
+ - MIT
120
+ metadata: {}
121
+ post_install_message:
122
+ rdoc_options: []
123
+ require_paths:
124
+ - lib
125
+ required_ruby_version: !ruby/object:Gem::Requirement
126
+ requirements:
127
+ - - ">="
128
+ - !ruby/object:Gem::Version
129
+ version: '0'
130
+ required_rubygems_version: !ruby/object:Gem::Requirement
131
+ requirements:
132
+ - - ">="
133
+ - !ruby/object:Gem::Version
134
+ version: '0'
135
+ requirements: []
136
+ rubyforge_project:
137
+ rubygems_version: 2.5.1
138
+ signing_key:
139
+ specification_version: 4
140
+ summary: Parses and unparses between Tolq and xlsx
141
+ test_files: []