wgit 0.10.6 → 0.10.7
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- checksums.yaml +4 -4
- data/CHANGELOG.md +9 -0
- data/README.md +1 -1
- data/lib/wgit/document.rb +5 -5
- data/lib/wgit/version.rb +1 -1
- metadata +2 -2
checksums.yaml
CHANGED
|
@@ -1,7 +1,7 @@
|
|
|
1
1
|
---
|
|
2
2
|
SHA256:
|
|
3
|
-
metadata.gz:
|
|
4
|
-
data.tar.gz:
|
|
3
|
+
metadata.gz: cd1829ff2dda87e2b88fb738c18ba3a31765a8e3afbe82874e804b58b6c094fb
|
|
4
|
+
data.tar.gz: 8085a6ab3da61aea02fea4bb1f7e7c18caf3edca0d4d11161df3e3a0255293e6
|
|
5
5
|
SHA512:
|
|
6
|
-
metadata.gz:
|
|
7
|
-
data.tar.gz:
|
|
6
|
+
metadata.gz: add6099d1433baebf93b4ad9471a5a35fb0a551e28eded322c46dfa4cfc45eea059284c8645bc2eb8bf33dd91c55060969f15ce6b24567445ffb737ae8a9afc4
|
|
7
|
+
data.tar.gz: 787830ce4f6eea9c7270542c36718c54f48beb7f33e26feb06abbf72d0ad0750943542de051ff7b43caced904ea83550e877631b1ecb936c8f3b6ee211713282
|
data/CHANGELOG.md
CHANGED
|
@@ -9,6 +9,15 @@
|
|
|
9
9
|
- ...
|
|
10
10
|
---
|
|
11
11
|
|
|
12
|
+
## v0.10.7
|
|
13
|
+
### Added
|
|
14
|
+
- ...
|
|
15
|
+
### Changed/Removed
|
|
16
|
+
- ...
|
|
17
|
+
### Fixed
|
|
18
|
+
- Security vulnerabilities by updating gem dependencies.
|
|
19
|
+
---
|
|
20
|
+
|
|
12
21
|
## v0.10.6
|
|
13
22
|
### Added
|
|
14
23
|
- `Wgit::DSL` method `#crawl_url` (aliased to `#crawl`).
|
data/README.md
CHANGED
|
@@ -18,7 +18,7 @@ Wgit was primarily designed to crawl static HTML websites to index and search t
|
|
|
18
18
|
|
|
19
19
|
Wgit provides a high level, easy-to-use API and DSL that you can use in your own applications and scripts.
|
|
20
20
|
|
|
21
|
-
Check out this [demo search engine](https://search-engine
|
|
21
|
+
Check out this [demo search engine](https://wgit-search-engine.fly.dev) - [built](https://github.com/michaeltelford/search_engine) using Wgit and Sinatra - deployed to [fly.io](https://fly.io). Try searching for something that's Ruby related like "Matz" or "Rails".
|
|
22
22
|
|
|
23
23
|
## Table Of Contents
|
|
24
24
|
|
data/lib/wgit/document.rb
CHANGED
|
@@ -89,9 +89,9 @@ module Wgit
|
|
|
89
89
|
#
|
|
90
90
|
# @return [String] An xpath String to obtain a webpage's text elements.
|
|
91
91
|
def self.text_elements_xpath
|
|
92
|
-
Wgit::Document.text_elements.each_with_index.reduce(
|
|
93
|
-
xpath +=
|
|
94
|
-
xpath += format(
|
|
92
|
+
Wgit::Document.text_elements.each_with_index.reduce('') do |xpath, (el, i)|
|
|
93
|
+
xpath += ' | ' unless i.zero?
|
|
94
|
+
xpath += format('//%s/text()', el)
|
|
95
95
|
end
|
|
96
96
|
end
|
|
97
97
|
|
|
@@ -505,7 +505,7 @@ be relative"
|
|
|
505
505
|
# parameter.
|
|
506
506
|
#
|
|
507
507
|
# @param xpath [String, #call] Used to find the value/object in @html.
|
|
508
|
-
# @param singleton [Boolean] singleton ? results.first (single Object) :
|
|
508
|
+
# @param singleton [Boolean] singleton ? results.first (single Object) :
|
|
509
509
|
# results (Enumerable).
|
|
510
510
|
# @param text_content_only [Boolean] text_content_only ? result.content
|
|
511
511
|
# (String) : result (Nokogiri Object).
|
|
@@ -546,7 +546,7 @@ be relative"
|
|
|
546
546
|
# parameter.
|
|
547
547
|
#
|
|
548
548
|
# @param xpath [String, #call] Used to find the value/object in @html.
|
|
549
|
-
# @param singleton [Boolean] singleton ? results.first (single Object) :
|
|
549
|
+
# @param singleton [Boolean] singleton ? results.first (single Object) :
|
|
550
550
|
# results (Enumerable).
|
|
551
551
|
# @param text_content_only [Boolean] text_content_only ? result.content
|
|
552
552
|
# (String) : result (Nokogiri Object).
|
data/lib/wgit/version.rb
CHANGED
metadata
CHANGED
|
@@ -1,14 +1,14 @@
|
|
|
1
1
|
--- !ruby/object:Gem::Specification
|
|
2
2
|
name: wgit
|
|
3
3
|
version: !ruby/object:Gem::Version
|
|
4
|
-
version: 0.10.
|
|
4
|
+
version: 0.10.7
|
|
5
5
|
platform: ruby
|
|
6
6
|
authors:
|
|
7
7
|
- Michael Telford
|
|
8
8
|
autorequire:
|
|
9
9
|
bindir: bin
|
|
10
10
|
cert_chain: []
|
|
11
|
-
date: 2022-
|
|
11
|
+
date: 2022-12-01 00:00:00.000000000 Z
|
|
12
12
|
dependencies:
|
|
13
13
|
- !ruby/object:Gem::Dependency
|
|
14
14
|
name: addressable
|