furirubi 0.1.0

Sign up to get free protection for your applications and to get access to all the features.
@@ -0,0 +1,7 @@
1
+ ---
2
+ SHA256:
3
+ metadata.gz: 8057d4efc4dbd9db38370ab4f60cc65d03b85b1d2c782dfaebf221108e743e98
4
+ data.tar.gz: f0881590ea5d0cb013c3f6532072cdb1b16e1ce66b2e730f099835763a7a705e
5
+ SHA512:
6
+ metadata.gz: 6620a8f25ef7945d3e74068ef6b8742f92c18a15f0765c459351680edeae5ef557e877463a15ae271dbbfa81c874594f3c3b972f5a6f1bf20bbe7e338c604fdd
7
+ data.tar.gz: 53a01214733509b9462fdd3e6507e09650204bfc8ea119e9d5ffc0b86c622a55c231ed487789910baa8dbdf6e828eac358441aa73fa7bd014a8899796d58714c
@@ -0,0 +1,11 @@
1
+ /.bundle/
2
+ /.yardoc
3
+ /_yardoc/
4
+ /coverage/
5
+ /doc/
6
+ /pkg/
7
+ /spec/reports/
8
+ /tmp/
9
+
10
+ # rspec failure tracking
11
+ .rspec_status
data/.rspec ADDED
@@ -0,0 +1,3 @@
1
+ --format documentation
2
+ --color
3
+ --require spec_helper
@@ -0,0 +1,6 @@
1
+ ---
2
+ language: ruby
3
+ cache: bundler
4
+ rvm:
5
+ - 2.4.1
6
+ before_install: gem install bundler -v 2.1.2
@@ -0,0 +1,74 @@
1
+ # Contributor Covenant Code of Conduct
2
+
3
+ ## Our Pledge
4
+
5
+ In the interest of fostering an open and welcoming environment, we as
6
+ contributors and maintainers pledge to making participation in our project and
7
+ our community a harassment-free experience for everyone, regardless of age, body
8
+ size, disability, ethnicity, gender identity and expression, level of experience,
9
+ nationality, personal appearance, race, religion, or sexual identity and
10
+ orientation.
11
+
12
+ ## Our Standards
13
+
14
+ Examples of behavior that contributes to creating a positive environment
15
+ include:
16
+
17
+ * Using welcoming and inclusive language
18
+ * Being respectful of differing viewpoints and experiences
19
+ * Gracefully accepting constructive criticism
20
+ * Focusing on what is best for the community
21
+ * Showing empathy towards other community members
22
+
23
+ Examples of unacceptable behavior by participants include:
24
+
25
+ * The use of sexualized language or imagery and unwelcome sexual attention or
26
+ advances
27
+ * Trolling, insulting/derogatory comments, and personal or political attacks
28
+ * Public or private harassment
29
+ * Publishing others' private information, such as a physical or electronic
30
+ address, without explicit permission
31
+ * Other conduct which could reasonably be considered inappropriate in a
32
+ professional setting
33
+
34
+ ## Our Responsibilities
35
+
36
+ Project maintainers are responsible for clarifying the standards of acceptable
37
+ behavior and are expected to take appropriate and fair corrective action in
38
+ response to any instances of unacceptable behavior.
39
+
40
+ Project maintainers have the right and responsibility to remove, edit, or
41
+ reject comments, commits, code, wiki edits, issues, and other contributions
42
+ that are not aligned to this Code of Conduct, or to ban temporarily or
43
+ permanently any contributor for other behaviors that they deem inappropriate,
44
+ threatening, offensive, or harmful.
45
+
46
+ ## Scope
47
+
48
+ This Code of Conduct applies both within project spaces and in public spaces
49
+ when an individual is representing the project or its community. Examples of
50
+ representing a project or community include using an official project e-mail
51
+ address, posting via an official social media account, or acting as an appointed
52
+ representative at an online or offline event. Representation of a project may be
53
+ further defined and clarified by project maintainers.
54
+
55
+ ## Enforcement
56
+
57
+ Instances of abusive, harassing, or otherwise unacceptable behavior may be
58
+ reported by contacting the project team at lddr99@gmail.com. All
59
+ complaints will be reviewed and investigated and will result in a response that
60
+ is deemed necessary and appropriate to the circumstances. The project team is
61
+ obligated to maintain confidentiality with regard to the reporter of an incident.
62
+ Further details of specific enforcement policies may be posted separately.
63
+
64
+ Project maintainers who do not follow or enforce the Code of Conduct in good
65
+ faith may face temporary or permanent repercussions as determined by other
66
+ members of the project's leadership.
67
+
68
+ ## Attribution
69
+
70
+ This Code of Conduct is adapted from the [Contributor Covenant][homepage], version 1.4,
71
+ available at [https://contributor-covenant.org/version/1/4][version]
72
+
73
+ [homepage]: https://contributor-covenant.org
74
+ [version]: https://contributor-covenant.org/version/1/4/
data/Gemfile ADDED
@@ -0,0 +1,10 @@
1
+ source "https://rubygems.org"
2
+
3
+ # Specify your gem's dependencies in furirubi.gemspec
4
+ gemspec
5
+
6
+ gem "rake", "~> 12.0"
7
+ gem "rspec", "~> 3.0"
8
+
9
+ gem "nokogiri", "~> 1.10"
10
+ gem "uri", "~> 0.10.0"
@@ -0,0 +1,40 @@
1
+ PATH
2
+ remote: .
3
+ specs:
4
+ furirubi (0.1.0)
5
+
6
+ GEM
7
+ remote: https://rubygems.org/
8
+ specs:
9
+ diff-lcs (1.3)
10
+ mini_portile2 (2.4.0)
11
+ nokogiri (1.10.9)
12
+ mini_portile2 (~> 2.4.0)
13
+ rake (12.3.3)
14
+ rspec (3.6.0)
15
+ rspec-core (~> 3.6.0)
16
+ rspec-expectations (~> 3.6.0)
17
+ rspec-mocks (~> 3.6.0)
18
+ rspec-core (3.6.0)
19
+ rspec-support (~> 3.6.0)
20
+ rspec-expectations (3.6.0)
21
+ diff-lcs (>= 1.2.0, < 2.0)
22
+ rspec-support (~> 3.6.0)
23
+ rspec-mocks (3.6.0)
24
+ diff-lcs (>= 1.2.0, < 2.0)
25
+ rspec-support (~> 3.6.0)
26
+ rspec-support (3.6.0)
27
+ uri (0.10.0)
28
+
29
+ PLATFORMS
30
+ ruby
31
+
32
+ DEPENDENCIES
33
+ furirubi!
34
+ nokogiri (~> 1.10)
35
+ rake (~> 12.0)
36
+ rspec (~> 3.0)
37
+ uri (~> 0.10.0)
38
+
39
+ BUNDLED WITH
40
+ 2.1.4
@@ -0,0 +1,21 @@
1
+ The MIT License (MIT)
2
+
3
+ Copyright (c) 2020 DaHung
4
+
5
+ Permission is hereby granted, free of charge, to any person obtaining a copy
6
+ of this software and associated documentation files (the "Software"), to deal
7
+ in the Software without restriction, including without limitation the rights
8
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9
+ copies of the Software, and to permit persons to whom the Software is
10
+ furnished to do so, subject to the following conditions:
11
+
12
+ The above copyright notice and this permission notice shall be included in
13
+ all copies or substantial portions of the Software.
14
+
15
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
21
+ THE SOFTWARE.
@@ -0,0 +1,39 @@
1
+ # Furirubi
2
+ Translate kanji to furigana and with the ruby HTML format. Based on https://jisho.org/.
3
+
4
+ e.g.
5
+ > 世界 経済 フォーラム は => <ruby>世界<rt>せかい</rt></ruby><ruby>経済<rt>けいざい</rt></ruby>フォーラムは
6
+ >
7
+ > Notice: Translation result is inaccurate. It needs to be confirmed again if it's going to be used on business
8
+
9
+ ## Installation
10
+
11
+ Add this line to your application's Gemfile:
12
+
13
+ ```ruby
14
+ gem 'furirubi'
15
+ ```
16
+
17
+ And then execute:
18
+
19
+ $ bundle install
20
+
21
+ Or install it yourself as:
22
+
23
+ $ gem install furirubi
24
+
25
+ ## Usage
26
+ ```ruby
27
+ Furirubi.parse('世界 経済 フォーラム は')
28
+
29
+ => "<ruby>世界<rt>せかい</rt></ruby><ruby>経済<rt>けいざい</rt></ruby>フォーラムは"
30
+ ```
31
+
32
+ # Roadmap
33
+ - [ ] Testing
34
+ - [ ] Support other dictionaries
35
+
36
+ ## License
37
+
38
+ The gem is available as open source under the terms of the [MIT License](https://opensource.org/licenses/MIT).
39
+
@@ -0,0 +1,6 @@
1
+ require "bundler/gem_tasks"
2
+ require "rspec/core/rake_task"
3
+
4
+ RSpec::Core::RakeTask.new(:spec)
5
+
6
+ task :default => :spec
@@ -0,0 +1,14 @@
1
+ #!/usr/bin/env ruby
2
+
3
+ require "bundler/setup"
4
+ require "furirubi"
5
+
6
+ # You can add fixtures and/or initialization code here to make experimenting
7
+ # with your gem easier. You can also use a different console, if you like.
8
+
9
+ # (If you use this, don't forget to add pry to your Gemfile!)
10
+ # require "pry"
11
+ # Pry.start
12
+
13
+ require "irb"
14
+ IRB.start(__FILE__)
@@ -0,0 +1,8 @@
1
+ #!/usr/bin/env bash
2
+ set -euo pipefail
3
+ IFS=$'\n\t'
4
+ set -vx
5
+
6
+ bundle install
7
+
8
+ # Do any other automated setup that you need to do here
@@ -0,0 +1,25 @@
1
+ require_relative 'lib/furirubi/version'
2
+
3
+ Gem::Specification.new do |spec|
4
+ spec.name = "furirubi"
5
+ spec.version = Furirubi::VERSION
6
+ spec.authors = ["DaHung"]
7
+ spec.email = ["lddr99@gmail.com"]
8
+
9
+ spec.summary = %q{Translate kanji to furigana and with the ruby HTML format.}
10
+ spec.homepage = "https://github.com/lddr99/furirubi"
11
+ spec.license = "MIT"
12
+ spec.required_ruby_version = Gem::Requirement.new(">= 2.3.0")
13
+
14
+ spec.metadata["homepage_uri"] = spec.homepage
15
+ spec.metadata["source_code_uri"] = "https://github.com/lddr99/furirubi"
16
+
17
+ # Specify which files should be added to the gem when it is released.
18
+ # The `git ls-files -z` loads the files in the RubyGem that have been added into git.
19
+ spec.files = Dir.chdir(File.expand_path('..', __FILE__)) do
20
+ `git ls-files -z`.split("\x0").reject { |f| f.match(%r{^(test|spec|features)/}) }
21
+ end
22
+ spec.bindir = "exe"
23
+ spec.executables = spec.files.grep(%r{^exe/}) { |f| File.basename(f) }
24
+ spec.require_paths = ["lib"]
25
+ end
@@ -0,0 +1,28 @@
1
+ require 'furirubi/version'
2
+ require 'furirubi/translators/jisho_web_translator'
3
+ require 'furirubi/formators/ruby_html'
4
+
5
+ module Furirubi
6
+ def self.translator
7
+ @translator ||= Furirubi::Translators::JishoWebTranslator.new
8
+ end
9
+
10
+ def self.translator=(trans)
11
+ @translator = trans
12
+ end
13
+
14
+ def self.formator
15
+ @formator ||= Formators::RubyHtml.new
16
+ end
17
+
18
+ def self.formator=(format)
19
+ @formator = format
20
+ end
21
+
22
+ def self.parse(search_term)
23
+ words = translator.translate(search_term)
24
+ words[search_term] = '' if words.size.zero?
25
+
26
+ formator.format(words)
27
+ end
28
+ end
@@ -0,0 +1,11 @@
1
+ module Formators
2
+ class RubyHtml
3
+ def format(words)
4
+ ruby_elements = words.map do |key, value|
5
+ value.empty? ? key : "<ruby>#{key}<rt>#{value}</rt></ruby>"
6
+ end
7
+
8
+ ruby_elements.join
9
+ end
10
+ end
11
+ end
@@ -0,0 +1,64 @@
1
+ require 'erb'
2
+ require 'open-uri'
3
+ require 'uri'
4
+ require 'nokogiri'
5
+
6
+ module Furirubi
7
+ module Translators
8
+ class JishoWebTranslator
9
+ URL = 'https://jisho.org/search/'.freeze
10
+
11
+ # @tricky: the flag is using to force-change the word to sentence
12
+ def translate(search_term, flag = 'flag')
13
+ words = {}
14
+ uri = URI.join(URL, ERB::Util.url_encode(search_term + flag))
15
+
16
+ elements = Nokogiri::HTML.parse(open(uri)).css('.japanese_word')
17
+ elements.each do |element|
18
+ result = parse_element(element, flag)
19
+ words.merge!(result) unless result.nil?
20
+ end
21
+
22
+ words
23
+ end
24
+
25
+ private
26
+
27
+ def parse_element(element, flag)
28
+ # parse furigana
29
+ if element.search('.japanese_word__furigana').size.positive?
30
+ furigana_elements = element.search('.japanese_word__furigana')
31
+ return parse_furigana_elements(furigana_elements)
32
+ end
33
+
34
+ # parse kana or symbol
35
+ parse_general_text(element, flag)
36
+ end
37
+
38
+ def parse_furigana_elements(elements)
39
+ words = {}
40
+
41
+ elements.each do |el|
42
+ if el.attr('data-text').empty?
43
+ words[el.text] = ''
44
+ next
45
+ end
46
+
47
+ words[el.attr('data-text')] = el.text
48
+ end
49
+
50
+ words
51
+ end
52
+
53
+ def parse_general_text(element, flag)
54
+ # parse kana
55
+ text = element.search('.japanese_word__text_wrapper a').text
56
+
57
+ # parse symbol
58
+ text = element.search('.japanese_word__text_wrapper').text if text.empty?
59
+
60
+ { text => '' } if text != flag
61
+ end
62
+ end
63
+ end
64
+ end
@@ -0,0 +1,3 @@
1
+ module Furirubi
2
+ VERSION = "0.1.0"
3
+ end
metadata ADDED
@@ -0,0 +1,61 @@
1
+ --- !ruby/object:Gem::Specification
2
+ name: furirubi
3
+ version: !ruby/object:Gem::Version
4
+ version: 0.1.0
5
+ platform: ruby
6
+ authors:
7
+ - DaHung
8
+ autorequire:
9
+ bindir: exe
10
+ cert_chain: []
11
+ date: 2020-06-23 00:00:00.000000000 Z
12
+ dependencies: []
13
+ description:
14
+ email:
15
+ - lddr99@gmail.com
16
+ executables: []
17
+ extensions: []
18
+ extra_rdoc_files: []
19
+ files:
20
+ - ".gitignore"
21
+ - ".rspec"
22
+ - ".travis.yml"
23
+ - CODE_OF_CONDUCT.md
24
+ - Gemfile
25
+ - Gemfile.lock
26
+ - LICENSE.txt
27
+ - README.md
28
+ - Rakefile
29
+ - bin/console
30
+ - bin/setup
31
+ - furirubi.gemspec
32
+ - lib/furirubi.rb
33
+ - lib/furirubi/formators/ruby_html.rb
34
+ - lib/furirubi/translators/jisho_web_translator.rb
35
+ - lib/furirubi/version.rb
36
+ homepage: https://github.com/lddr99/furirubi
37
+ licenses:
38
+ - MIT
39
+ metadata:
40
+ homepage_uri: https://github.com/lddr99/furirubi
41
+ source_code_uri: https://github.com/lddr99/furirubi
42
+ post_install_message:
43
+ rdoc_options: []
44
+ require_paths:
45
+ - lib
46
+ required_ruby_version: !ruby/object:Gem::Requirement
47
+ requirements:
48
+ - - ">="
49
+ - !ruby/object:Gem::Version
50
+ version: 2.3.0
51
+ required_rubygems_version: !ruby/object:Gem::Requirement
52
+ requirements:
53
+ - - ">="
54
+ - !ruby/object:Gem::Version
55
+ version: '0'
56
+ requirements: []
57
+ rubygems_version: 3.1.2
58
+ signing_key:
59
+ specification_version: 4
60
+ summary: Translate kanji to furigana and with the ruby HTML format.
61
+ test_files: []