RubyGems - curlyq - Versions diffs - 0.0.4 → 0.0.5 - Mend

curlyq 0.0.4 → 0.0.5

Files changed (13) hide show

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: '091e39001a4456eef85fa25e97281c6218e3383619d37a14a66ca9bd41fee9ab'
-  data.tar.gz: c287e095d4f9525e924d08cd580c37ab19448ae93bb384d419526b60cf895493
+  metadata.gz: 2c5eb3f9a5444f19c44362545b302e3889c4e25dc34d9180452a736b1b80bc34
+  data.tar.gz: 3bf8d1009f493b60c31efb3636c64aa8871656dbcd9cebbeb01800d30fd0761c
 SHA512:
-  metadata.gz: e51492325696e09319666ee29b753472f555129dc7664f34a08b1ae30dd319cdb5518727030d2d7bbaa1dfaefc6dd14ce4ccf2756243238dd885d37e2dbffbb4
-  data.tar.gz: 27c006a7433cd9bd9208cc0aa69c0029974ca98524ecf3ee323bad00cc09490acfbfd5b4e61a65e22d5f750ebe05fa03d5d1bae1d1b977bf1028ed2aa21e2ee7
+  metadata.gz: 808d8122080450acee5e98e0a6338e887ba5b6e3306764dab79c713052c6e5f6749d8b4ef90f43fcdc2cc7da41766f40e6684e0e40d2de98055e2d71986ac0e8
+  data.tar.gz: d4e17b0cc425cbf7a704cdd188e36f734707cd885a097c6e99cb0f8bc0089e46ffdd99d1e15844a981bbfd9a205778178e45dcaa637cd8a7e761432f2610991e

data/.gitignore CHANGED Viewed

	@@ -1 +1,2 @@
1 1	html
2	+ *.bak

data/.irbrc ADDED Viewed

@@ -0,0 +1,4 @@
+$LOAD_PATH.unshift File.join(__dir__, 'lib')
+require_relative 'lib/curly'
+include Curly

data/CHANGELOG.md CHANGED Viewed

@@ -1,3 +1,22 @@
+### 0.0.5
+2024-01-11 18:06
+#### IMPROVED
+- Add --query capabilities to images command
+- Add --query to links command
+- Allow hyphens in query syntax
+- Allow any character other than comma, ampersand, or right square bracket in query value
+#### FIXED
+- Html --search returns a full Curl::Html object
+- --query works better with --search and is consistent with other query functions
+- Scrape command outputting malformed data
+- Hash output when --query is used with scrape
+- Nil match on tags command
 ### 0.0.4
 2024-01-10 13:54

data/Gemfile.lock CHANGED Viewed

@@ -1,7 +1,7 @@
 PATH
   remote: .
   specs:
-    curlyq (0.0.4)
+    curlyq (0.0.5)
       gli (~> 2.21.0)
       nokogiri (~> 1.16.0)
       selenium-webdriver (~> 4.16.0)

data/README.md CHANGED Viewed

@@ -10,7 +10,7 @@ _If you find this useful, feel free to [buy me some coffee][donate]._
 [donate]: https://brettterpstra.com/donate
-The current version of `curlyq` is 0.0.4
+The current version of `curlyq` is 0.0.5
 .
 CurlyQ is a utility that provides a simple interface for curl, with additional features for things like extracting images and links, finding elements by CSS selector or XPath, getting detailed header info, and more. It's designed to be part of a scripting pipeline, outputting everything as structured data (JSON or YAML). It also has rudimentary support for making calls to JSON endpoints easier, but it's expected that you'll use something like `jq` to parse the output.
@@ -44,7 +44,7 @@ SYNOPSIS
     curlyq [global options] command [command options] [arguments...]
 VERSION
-    0.0.4
+    0.0.5
 GLOBAL OPTIONS
     --help          - Show this message
@@ -65,12 +65,41 @@ COMMANDS
     tags       - Extract all instances of a tag
 ```
+### Query and Search syntax
+You can shape the results using `--search` (`-s`) and `--query` (`-q`) on some commands.
+A search uses either CSS or XPath syntax to locate elements. For example, if you wanted to locate all of the `<article>` elements with a class of `post` inside of the div with an id of `main`, you would run `--search '#main article.post'`. Searches can target tags, ids, and classes, and can accept `>` to target direct descendents. You can also use XPaths, but I hate those so I'm not going to document them.
+Queries are specifically for shaping CurlyQ output. If you're using the `html` command, it returns a key called `images`, so you can target just the images in the response with `-q 'images'`. The queries accept array syntax, so to get the first image, you would use `-q 'images[0]'`. Ranges are accepted as well, so `-q 'images[1..4]'` will return the 2nd through 5th images found on the page. You can also do comparisons, e.g. `images[rel=me]'` to target only images with a `rel` attribute of `me`.
+The comparisons for the query flag are:
+- `<` less than
+- `>` greater than
+- `<=` less than or equal to
+- `>=` greater than or equal to
+- `=` or `==` is equal to
+- `*=` contains text
+- `^=` starts with text
+- `$=` ends with text
 #### Commands
 curlyq makes use of subcommands, e.g. `curlyq html [options] URL` or `curlyq extract [options] URL`. Each subcommand takes its own options, but I've made an effort to standardize the choices between each command as much as possible.
 ##### extract
+Example:
+    curlyq extract -i -b 'Adding' -a 'accessing the source.' 'https://stackoverflow.com/questions/52428409/get-fully-rendered-html-using-selenium-webdriver-and-python'
+    [
+      "Adding <code>time.sleep(10)</code> in various places in case the page had not fully loaded when I was accessing the source."
+    ]
+This specifies a before and after string and includes them (`-i`) in the result.
 ```
 NAME
     extract - Extract contents between two regular expressions
@@ -80,17 +109,32 @@ SYNOPSIS
     curlyq [global options] extract [command options] URL...
 COMMAND OPTIONS
-    -a, --after=arg       - Text after extraction, parsed as regex (default: none)
-    -b, --before=arg      - Text before extraction, parsed as regex (default: none)
+    -a, --after=arg       - Text after extraction (default: none)
+    -b, --before=arg      - Text before extraction (default: none)
     -c, --[no-]compressed - Expect compressed results
     --[no-]clean          - Remove extra whitespace from results
     -h, --header=arg      - Define a header to send as key=value (may be used more than once, default: none)
+    -i, --[no-]include    - Include the before/after matches in the result
+    -r, --[no-]regex      - Process before/after strings as regular expressions
     --[no-]strip          - Strip HTML tags from results
 ```
 ##### headlinks
+Example:
+    curlyq headlinks -q '[rel=stylesheet]' https://brettterpstra.com
+    {
+      "rel": "stylesheet",
+      "href": "https://cdn3.brettterpstra.com/stylesheets/screen.7261.css",
+      "type": "text/css",
+      "title": null
+    }
+This pulls all `<links>` from the `<head>` of the page, and uses a query `-q` to only show links with `rel="stylesheet"`.
 ```
 NAME
     headlinks - Return all <head> links on URL's page
@@ -105,6 +149,61 @@ COMMAND OPTIONS
 ##### html
+The html command (aliased as `curl`) gets the entire text of the web page and provides a JSON response with a breakdown of:
+- URL, after any redirects
+- Response code
+- Response headers as a keyed hash
+- Meta elements for the page as a keyed hash
+- All meta links in the head as an array of objects containing (as available):
+    - rel
+    - href
+    - type
+    - title
+- source of `<head>`
+- source of `<body>`
+- the page title (determined first by og:title, then by a title tag)
+- description (using og:description first)
+- All links on the page as an array of objects with:
+    - href
+    - title
+    - rel
+    - text content
+    - classes as array
+- All images on the page as an array of objects containing:
+    - class
+    - all attributes as key/value pairs
+    - width and height (if specified)
+    - src
+    - alt and title
+You can add a query (`-q`) to only get the information needed, e.g. `-q images[width>600]`.
+Example:
+    curlyq html -s '#main article .aligncenter' -q 'images[1]' 'https://brettterpstra.com'
+    [
+      {
+        "class": "aligncenter",
+        "original": "https://cdn3.brettterpstra.com/uploads/2023/09/giveaway-keyboardmaestro2024-rb_tw.jpg",
+        "at2x": "https://cdn3.brettterpstra.com/uploads/2023/09/giveaway-keyboardmaestro2024-rb@2x.jpg",
+        "width": "800",
+        "height": "226",
+        "src": "https://cdn3.brettterpstra.com/uploads/2023/09/giveaway-keyboardmaestro2024-rb.jpg",
+        "alt": "Giveaway Robot with Keyboard Maestro icon",
+        "title": "Giveaway Robot with Keyboard Maestro icon"
+      }
+    ]
+The above example queries the full html of the page, but narrows the elements using `--search` and then takes the 2nd image from the results.
+    curlyq html -q 'meta.title'  https://brettterpstra.com/2024/01/10/introducing-curlyq-a-pipeline-oriented-curl-helper/
+    Introducing CurlyQ, a pipeline-oriented curl helper - BrettTerpstra.com
+The above example curls the page and returns the title attribute found in the meta (`-q 'meta.title'`).
 ```
 NAME
     html - Curl URL and output its elements, multiple URLs allowed
@@ -124,12 +223,78 @@ COMMAND OPTIONS
     --[no-]ignore_relative    - Ignore relative hrefs when gathering content links
     -q, --query, --filter=arg - Filter output using dot-syntax path (default: none)
     -r, --raw=arg             - Output a raw value for a key (default: none)
-    --search=arg              - Regurn an array of matches to a CSS or XPath query (default: none)
+    -s, --search=arg          - Regurn an array of matches to a CSS or XPath query (default: none)
     -x, --external_links_only - Only gather external links
 ```
 ##### images
+The images command returns only the images on the page as an array of objects. It can be queried to match certain requirements (see Query and Search syntax above).
+The base command will return all images on the page, including OpenGraph images from the head, `<img>` tags from the body, and `<srcset>` tags along with their child images.
+OpenGraph images will be returned with the structure:
+    {
+        "type": "opengraph",
+        "attrs": null,
+        "src": "https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb_tw.jpg"
+      }
+`img` tags will be returned with the structure:
+    {
+        "type": "img",
+        "src": "https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb.jpg",
+        "width": "800",
+        "height": "226",
+        "alt": "Banner image for CurlyQ",
+        "title": "CurlyQ, curl better",
+        "attrs": [
+          {
+            "key": "class",
+            "value": [
+              "aligncenter"
+            ], // all attributes included
+          }
+        ]
+      }
+`srcset` images will be returned with the structure:
+    {
+        "type": "srcset",
+            "attrs": [
+              {
+                "key": "srcset",
+                "value": "https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb_tw.jpg 1x, https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb@2x.jpg 2x"
+              }
+            ],
+            "images": [
+              {
+                "src": "https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb_tw.jpg",
+                "media": "1x"
+              },
+              {
+                "src": "https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb@2x.jpg",
+                "media": "2x"
+              }
+          ]
+        }
+    }
+Example:
+    curlyq images -t img -q '[alt$=screenshot]' https://brettterpstra.com
+This will return an array of images that are `<img>` tags, and only show the ones that have an `alt` attribute that ends with `screenshot`.
+    curlyq images -q '[width>750]' https://brettterpstra.com
+This example will only return images that have a width greater than 750 pixels. This query depends on the images having proper `width` attributes set on them in the source.
 ```
 NAME
     images - Extract all images from a URL
@@ -139,14 +304,17 @@ SYNOPSIS
     curlyq [global options] images [command options] URL...
 COMMAND OPTIONS
-    -c, --[no-]compressed - Expect compressed results
-    --[no-]clean          - Remove extra whitespace from results
-    -h, --header=arg      - Define a header to send as key=value (may be used more than once, default: none)
-    -t, --type=arg        - Type of images to return (img, srcset, opengraph, all) (may be used more than once, default: ["all"])
+    -c, --[no-]compressed     - Expect compressed results
+    --[no-]clean              - Remove extra whitespace from results
+    -h, --header=arg          - Define a header to send as key=value (may be used more than once, default: none)
+    -q, --query, --filter=arg - Filter output using dot-syntax path (default: none)
+    -t, --type=arg            - Type of images to return (img, srcset, opengraph, all) (may be used more than once, default: ["all"])
 ```
 ##### json
+The `json` command just returns an object with header/response info, and the contents of the JSON response after it's been read by the Ruby JSON library and output. If there are fetching or parsing errors it will fail gracefully with an error code.
 ```
 NAME
     json - Get a JSON response from a URL, multiple URLs allowed
@@ -163,6 +331,12 @@ COMMAND OPTIONS
 ##### links
+Returns all the links on the page, which can be queried on any attribute.
+Example:
+    curlyq images -t img -q '[width>750]' https://brettterpstra.com
 ```
 NAME
     links - Return all links on a URL's page
@@ -181,6 +355,26 @@ COMMAND OPTIONS
 ##### scrape
+Loads the page in a web browser, allowing scraping of dynamically loaded pages that return nothing but scripts when `curl`ed. The `-b` (`--browser`) option is required and should be 'chrome' or 'firefox' (or just 'c' or 'f'). The selected browser must be installed on your system.
+Example:
+    curlyq scrape -b firefox -q 'links[rel=me&content*=mastodon][0]' https://brettterpstra.com/2024/01/10/introducing-curlyq-a-pipeline-oriented-curl-helper/
+    {
+      "href": "https://nojack.easydns.ca/@ttscoff",
+      "title": null,
+      "rel": [
+        "me"
+      ],
+      "content": "Mastodon",
+      "class": [
+        "u-url"
+      ]
+    }
+This example scrapes the page using firefox and finds the first link with a rel of 'me' and text containing 'mastodon'.
 ```
 NAME
     scrape - Scrape a page using a web browser, for dynamic (JS) pages. Be sure to have the selected --browser installed.
@@ -190,7 +384,7 @@ SYNOPSIS
     curlyq [global options] scrape [command options] URL...
 COMMAND OPTIONS
-    -b, --browser=arg         - Browser to use (firefox, chrome) (default: none)
+    -b, --browser=arg         - Browser to use (firefox, chrome) (required, default: none)
     --[no-]clean              - Remove extra whitespace from results
     -h, --header=arg          - Define a header to send as "key=value" (may be used more than once, default: none)
     -q, --query, --filter=arg - Filter output using dot-syntax path (default: none)
@@ -202,6 +396,17 @@ COMMAND OPTIONS
 Full-page screenshots require Firefox, installed and specified with `--browser firefox`.
+Type defaults to `full`, but will only work if `-b` is Firefox. If you want to use Chrome, you must specify a `--type` as 'visible' or 'print'.
+The `-o` (`--output`) flag is required. It should be a path to a target PNG file (or PDF for `-t print` output). Extension will be modified automatically, all you need is the base name.
+Example:
+    curlyq screenshot -b f -o ~/Desktop/test https://brettterpstra.com/2024/01/10/introducing-curlyq-a-pipeline-oriented-curl-helper/
+    Screenshot saved to /Users/ttscoff/Desktop/test.png
 ```
 NAME
     screenshot - Save a screenshot of a URL
@@ -213,12 +418,14 @@ SYNOPSIS
 COMMAND OPTIONS
     -b, --browser=arg     - Browser to use (firefox, chrome) (default: chrome)
     -h, --header=arg      - Define a header to send as key=value (may be used more than once, default: none)
-    -o, --out, --file=arg - File destination (default: none)
-    -t, --type=arg        - Type of screenshot to save (full (requires firefox), print, visible) (default: full)
+    -o, --out, --file=arg - File destination (required, default: none)
+    -t, --type=arg        - Type of screenshot to save (full (requires firefox), print, visible) (default: visible)
 ```
 ##### tags
+Return a hierarchy of all tags in a page. Use `-t` to limit to a specific tag.
 ```
 NAME
     tags - Extract all instances of a tag
@@ -231,7 +438,8 @@ COMMAND OPTIONS
     -c, --[no-]compressed     - Expect compressed results
     --[no-]clean              - Remove extra whitespace from results
     -h, --header=arg          - Define a header to send as key=value (may be used more than once, default: none)
-    -q, --query, --search=arg - CSS/XPath query (default: none)
+    -q, --query, --filter=arg - CSS/XPath query (default: none)
+    --search=arg              - Regurn an array of matches to a CSS or XPath query (default: none)
     -t, --tag=arg             - Specify a tag to collect (may be used more than once, default: none)
 ```

data/bin/curlyq CHANGED Viewed

@@ -71,7 +71,7 @@ command %i[html curl] do |c|
   c.switch %i[I info], negatable: false
   c.desc 'Regurn an array of matches to a CSS or XPath query'
-  c.flag %i[search]
+  c.flag %i[s search]
   c.desc 'Define a header to send as "key=value"'
   c.flag %i[h header], multiple: true
@@ -110,25 +110,31 @@ command %i[html curl] do |c|
     output = []
     urls.each do |url|
-      res = Curl::Html.new(url, { browser: options[:browser], fallback: options[:fallback],
-                                  headers: headers, headers_only: options[:info],
-                                  compressed: options[:compressed], clean: options[:clean],
-                                  ignore_local_links: options[:ignore_relative],
-                                  ignore_fragment_links: options[:ignore_fragments],
-                                  external_links_only: options[:external_links_only] })
+      curl_settings = { browser: options[:browser], fallback: options[:fallback],
+                        headers: headers, headers_only: options[:info],
+                        compressed: options[:compressed], clean: options[:clean],
+                        ignore_local_links: options[:ignore_relative],
+                        ignore_fragment_links: options[:ignore_fragments],
+                        external_links_only: options[:external_links_only] }
+      res = Curl::Html.new(url, curl_settings)
       res.curl
       if options[:info]
         output.push(res.headers)
-        # print_out(res.headers, global_options[:yaml], raw: options[:raw], pretty: global_options[:pretty])
         next
       end
       if options[:search]
-        out = res.search(options[:search])
+        source = res.search(options[:search], return_source: true)
-        out = out.dot_query(options[:query]) if options[:query]
-        output.push(out)
+        out = res.parse(source)
+        if options[:query]
+          out = out.to_data(url: url, clean: options[:clean]).dot_query(options[:query])
+        else
+          out = out.to_data
+        end
+        output.push([out])
       elsif options[:query]
         queried = res.to_data.dot_query(options[:query])
         output.push(queried) if queried
@@ -136,7 +142,7 @@ command %i[html curl] do |c|
         output.push(res.to_data(url: url))
       end
     end
+    output.delete_if(&:nil?)
     output.delete_if(&:empty?)
     output = output[0] if output.count == 1
     output.map! { |o| o[options[:raw].to_sym] } if options[:raw]
@@ -149,13 +155,13 @@ desc 'Save a screenshot of a URL'
 arg_name 'URL', multiple: true
 command :screenshot do |c|
   c.desc 'Type of screenshot to save (full (requires firefox), print, visible)'
-  c.flag %i[t type], type: ScreenshotType, must_match: /^[fpv].*?$/, default_value: 'full'
+  c.flag %i[t type], type: ScreenshotType, must_match: /^[fpv].*?$/, default_value: 'visible'
   c.desc 'Browser to use (firefox, chrome)'
   c.flag %i[b browser], type: BrowserType, must_match: /^[fc].*?$/, default_value: 'chrome'
   c.desc 'File destination'
-  c.flag %i[o out file]
+  c.flag %i[o out file], required: true
   c.desc 'Define a header to send as key=value'
   c.flag %i[h header], multiple: true
@@ -164,11 +170,19 @@ command :screenshot do |c|
     urls = args.join(' ').split(/[, ]+/)
     headers = break_headers(options[:header])
+    type = options[:type]
+    browser = options[:browser]
+    type = type.is_a?(Symbol) ? type : type.normalize_screenshot_type
+    browser = browser.is_a?(Symbol) ? browser : browser.normalize_browser_type
+    raise 'Full page screen shots only available with Firefox' if type == :full_page && browser != :firefox
     urls.each do |url|
       c = Curl::Html.new(url)
       c.headers = headers
-      c.browser = options[:browser]
-      c.screenshot(options[:out], type: options[:type])
+      c.browser = browser
+      c.screenshot(options[:out], type: type)
     end
   end
 end
@@ -221,12 +235,18 @@ end
 desc 'Extract contents between two regular expressions'
 arg_name 'URL', multiple: true
 command :extract do |c|
-  c.desc 'Text before extraction, parsed as regex'
+  c.desc 'Text before extraction'
   c.flag %i[b before]
-  c.desc 'Text after extraction, parsed as regex'
+  c.desc 'Text after extraction'
   c.flag %i[a after]
+  c.desc 'Process before/after strings as regular expressions'
+  c.switch %i[r regex]
+  c.desc 'Include the before/after matches in the result'
+  c.switch %i[i include]
   c.desc 'Define a header to send as key=value'
   c.flag %i[h header], multiple: true
@@ -249,7 +269,15 @@ command :extract do |c|
       res = Curl::Html.new(url, { headers: headers, headers_only: false,
                                   compressed: options[:compressed], clean: options[:clean] })
       res.curl
-      extracted = res.extract(options[:before], options[:after])
+      if options[:regex]
+        before = Regexp.new(options[:before])
+        after = Regexp.new(options[:after])
+      else
+        before = /#{Regexp.escape(options[:before])}/
+        after = /#{Regexp.escape(options[:after])}/
+      end
+      extracted = res.extract(before, after, inclusive: options[:include])
       extracted.strip_tags! if options[:strip]
       output.concat(extracted)
     end
@@ -274,7 +302,10 @@ command :tags do |c|
   c.switch %i[clean]
   c.desc 'CSS/XPath query'
-  c.flag %i[q query search]
+  c.flag %i[q query filter]
+  c.desc 'Regurn an array of matches to a CSS or XPath query'
+  c.flag %i[search]
   c.action do |global_options, options, args|
     urls = args.join(' ').split(/[, ]+/)
@@ -286,9 +317,17 @@ command :tags do |c|
       res = Curl::Html.new(url, { headers: headers, headers_only: options[:headers],
                                   compressed: options[:compressed], clean: options[:clean] })
       res.curl
       output = []
       if options[:search]
-        output = res.tags.search(options[:search])
+        out = res.search(options[:search])
+        # out = out.dot_query(options[:query]) if options[:query]
+        output.push(out)
+      elsif options[:query]
+        query = options[:query] =~ /^links/ ? options[:query] : "links#{options[:query]}"
+        output = res.to_data.dot_query(query)
       elsif tags.count.positive?
         tags.each { |tag| output.concat(res.tags(tag)) }
       else
@@ -312,6 +351,9 @@ command :images do |c|
   c.desc 'Remove extra whitespace from results'
   c.switch %i[clean]
+  c.desc 'Filter output using dot-syntax path'
+  c.flag %i[q query filter]
   c.desc 'Define a header to send as key=value'
   c.flag %i[h header], multiple: true
@@ -326,7 +368,15 @@ command :images do |c|
     urls.each do |url|
       res = Curl::Html.new(url, { compressed: options[:compressed], clean: options[:clean] })
       res.curl
-      output.concat(res.images(types: types))
+      res = res.images(types: types)
+      if options[:query]
+        query = options[:query] =~ /^images/ ? options[:query] : "images#{options[:query]}"
+        res = { images: res }.dot_query(query)
+      end
+      output.concat(res)
     end
     print_out(output, global_options[:yaml], pretty: global_options[:pretty])
@@ -367,7 +417,7 @@ command :links do |c|
       if options[:query]
         query = options[:query] =~ /^links/ ? options[:query] : "links#{options[:query]}"
-        queried = { links: res.to_data[:links] }.dot_query(query)
+        queried = res.to_data.dot_query(query)
         output.concat(queried) if queried
       else
         output.concat(res.body_links)
@@ -414,7 +464,7 @@ desc %(Scrape a page using a web browser, for dynamic (JS) pages. Be sure to hav
 arg_name 'URL', multiple: true
 command :scrape do |c|
   c.desc 'Browser to use (firefox, chrome)'
-  c.flag %i[b browser], type: BrowserType
+  c.flag %i[b browser], type: BrowserType, required: true
   c.desc 'Regurn an array of matches to a CSS or XPath query'
   c.flag %i[search]
@@ -437,30 +487,19 @@ command :scrape do |c|
     output = []
     urls.each do |url|
-      driver = Selenium::WebDriver.for options[:browser]
-      begin
-        driver.get url
-        res = driver.page_source
-        res = Curl::Html.new(nil, { source: res, clean: options[:clean] })
-        res.curl
-        if options[:search]
-          out = res.search(options[:search])
-          out = out.dot_query(options[:query]) if options[:query]
-          output.push(out)
-        elsif options[:query]
-          queried = res.to_data(url: url).dot_query(options[:query])
-          output = queried if queried
-        else
-          output.push(res.to_data(url: url))
-        end
+      res = Curl::Html.new(url, { browser: options[:browser], clean: options[:clean] })
+      res.curl
-        # elements = driver.find_elements(css: options[:query])
+      if options[:search]
+        out = res.search(options[:search])
-        # elements.each { |e| output.push(e.text.strip) }
-      ensure
-        driver.quit
+        out = out.dot_query(options[:query]) if options[:query]
+        output.push(out)
+      elsif options[:query]
+        queried = res.to_data(url: url).dot_query(options[:query])
+        output.push(queried) if queried
+      else
+        output.push(res.to_data(url: url))
       end
     end

data/lib/curly/array.rb CHANGED Viewed

@@ -67,68 +67,69 @@ class ::Array
   end
   ##
-  ## Convert and execute a dot-syntax query on the array
-  ##
-  ## @param      path  [String]  The dot-syntax path
-  ##
-  ## @return     [Array] Matching elements
-  ##
-  def dot_query(path)
-    output = []
-    if path =~ /^\[([\d+.])\]\.?/
-      int = Regexp.last_match(1)
-      path.sub!(/^\[[\d.]+\]\.?/, '')
-      items = self[eval(int)]
-    else
-      items = self
-    end
+  ## Test if a tag contains an attribute matching filter queries
+  ##
+  ## @param      tag_name    [String] The tag name
+  ## @param      classes     [String] The classes to match
+  ## @param      id          [String] The id attribute to
+  ##                         match
+  ## @param      attribute   [String] The attribute
+  ## @param      operator    [String] The operator, <>= *=
+  ##                         $= ^=
+  ## @param      value       [String] The value to match
+  ## @param      descendant  [Boolean] Check descendant tags
+  ##
+  def tag_match(tag_name, classes, id, attribute, operator, value, descendant: false)
+    tag = self
+    keep = true
+    keep = false if tag_name && !tag['tag'] =~ /^#{tag_name}$/i
+    if tag.key?('attrs') && tag['attrs']
+      if keep && id
+        tag_id = tag['attrs'].filter { |a| a['key'] == 'id' }.first['value']
+        keep = tag_id && tag_id =~ /#{id}/i
+      end
-    if items.is_a? Hash
-      output = items.dot_query(path)
-    else
-      items.each do |item|
-        res = item.is_a?(Hash) ? item.stringify_keys : item
-        out = []
-        q = path.split(/(?<![\d.])\./)
-        q.each do |pth|
-          el = Regexp.last_match(1) if pth =~ /\[([0-9,.]+)\]/
-          pth.sub!(/\[([0-9,.]+)\]/, '')
-          ats = []
-          at = []
-          while pth =~ /\[[+&,]?\w+ *[\^*$=<>]=? *\w+/
-            m = pth.match(/\[(?<com>[,+&])? *(?<key>\w+) *(?<op>[\^*$=<>]{1,2}) *(?<val>\w+) */)
-            comp = [m['key'], m['op'], m['val']]
-            case m['com']
-            when ','
-              ats.push(comp)
-              at = []
-            else
-              at.push(comp)
-            end
-            pth.sub!(/\[(?<com>[,&+])? *(?<key>\w+) *(?<op>[\^*$=<>]{1,2}) *(?<val>\w+)/, '[')
-          end
-          ats.push(at) unless at.empty?
-          pth.sub!(/\[\]/, '')
-          return false if el.nil? && ats.empty? && !res.key?(pth)
-          res = res[pth] unless pth.empty?
-          while ats.count.positive?
-            atr = ats.shift
-            keepers = res.filter do |r|
-              evaluate_comp(r, atr)
-            end
-            out.concat(keepers)
-          end
-          out = out[eval(el)] if out.is_a?(Array) && el =~ /^[\d.,]+$/
+      if keep && classes
+        cls = tag['attrs'].filter { |a| a['key'] == 'class' }.first
+        if cls
+          all = true
+          classes.each { |c| all = cls['value'].include?(c) }
+          keep = all
+        else
+          keep = false
         end
-        output.push(out)
       end
+      if keep && attribute
+        attributes = tag['attrs'].filter { |a| a['key'] =~ /^#{attribute}$/i }
+        any = false
+        attributes.each do |a|
+          break if any
+          any = case operator
+                when /^*/
+                  a['value'] =~ /#{value}/i
+                when /^\^/
+                  a['value'] =~ /^#{value}/i
+                when /^\$/
+                  a['value'] =~ /#{value}$/i
+                else
+                  a['value'] =~ /^#{value}$/i
+                end
+        end
+        keep = any
+      end
+    end
+    return false if descendant && !keep
+    if !descendant && tag.key?('tags')
+      tags = tag['tags'].filter { |t| t.tag_match(tag_name, classes, id, attribute, operator, value) }
+      tags.count.positive?
+    else
+      keep
     end
-    output
   end
 end

data/lib/curly/curl/html.rb CHANGED Viewed

@@ -65,7 +65,13 @@ module Curl
       @external_links_only = options[:external_links_only]
       @curl = TTY::Which.which('curl')
-      @url = url
+      @url = url.nil? ? options[:url] : url
+    end
+    def parse(source)
+      @body = source
+      { url: @url, code: @code, headers: @headers, meta: @meta, links: @links, head: @head, body: source,
+        source: source.strip, body_links: content_links, body_images: content_images }
     end
     def curl
@@ -118,10 +124,15 @@ module Curl
     ##
     ## @return     [Array] array of matches
     ##
-    def extract(before, after)
-      before = /#{Regexp.escape(before)}/ unless before.instance_of?(Regexp)
-      after = /#{Regexp.escape(after)}/ unless after.instance_of?(Regexp)
-      rx = /(?<=#{before.source})(.*?)(?=#{after.source})/m
+    def extract(before, after, inclusive: false)
+      before = /#{Regexp.escape(before)}/ unless before.is_a?(Regexp)
+      after = /#{Regexp.escape(after)}/ unless after.is_a?(Regexp)
+      if inclusive
+        rx = /(#{before.source}.*?#{after.source})/m
+      else
+        rx = /(?<=#{before.source})(.*?)(?=#{after.source})/m
+      end
       @body.scan(rx).map { |r| @clean ? r[0].clean : r[0] }
     end
@@ -343,12 +354,16 @@ module Curl
     ##
     ## @return     [Array] array of matched elements
     ##
-    def search(path, source: @source)
+    def search(path, source: @source, return_source: false)
       doc = Nokogiri::HTML(source)
       output = []
-      doc.search(path).each do |el|
-        out = nokogiri_to_tag(el)
-        output.push(out)
+      if return_source
+        output = doc.search(path).to_html
+      else
+        doc.search(path).each do |el|
+          out = nokogiri_to_tag(el)
+          output.push(out)
+        end
       end
       output
     end
@@ -480,6 +495,7 @@ module Curl
     ##
     def content_links
       links = []
       link_tags = @body.to_enum(:scan, %r{<a ?(?<tag>.*?)>(?<text>.*?)</a>}).map { Regexp.last_match }
       link_tags.each do |m|
         href = m['tag'].match(/href=(["'])(.*?)\1/)
@@ -534,7 +550,7 @@ module Curl
     ## @return [String] page source
     ##
     def curl_dynamic_html
-      browser = @browser.normalize_browser_type if @browser.is_a?(String)
+      browser = @browser.is_a?(String) ? @browser.normalize_browser_type : @browser
       res = nil
       driver = Selenium::WebDriver.for browser
@@ -607,7 +623,7 @@ module Curl
     ##
     def curl_html(url = nil, source: nil, headers: nil,
                   headers_only: false, compressed: false, fallback: false)
-      unless url.nil?
+      if !url.nil?
         flags = 'SsL'
         flags += @headers_only ? 'I' : 'i'
         agents = [
@@ -620,8 +636,8 @@ module Curl
         compress = @compressed ? '--compressed' : ''
         @source = `#{@curl} -#{flags} #{compress} #{headers} '#{@url}' 2>/dev/null`
         agent = 0
-        while source.nil? || source.empty?
-          source = `#{@curl} -#{flags} #{compress} -A "#{agents[agent]}" #{headers} '#{@url}' 2>/dev/null`
+        while @source.nil? || @source.empty?
+          @source = `#{@curl} -#{flags} #{compress} -A "#{agents[agent]}" #{headers} '#{@url}' 2>/dev/null`
           break if agent >= agents.count - 1
         end
@@ -630,49 +646,50 @@ module Curl
           Process.exit 1
         end
-        if @fallback && (@source.nil? || @source.empty?)
-          @source = curl_dynamic_html(@url, @fallback, @headers)
+        headers = { 'location' => @url }
+        lines = @source.split(/\r\n/)
+        code = lines[0].match(/(\d\d\d)/)[1]
+        lines.shift
+        lines.each_with_index do |line, idx|
+          if line =~ /^([\w-]+): (.*?)$/
+            m = Regexp.last_match
+            headers[m[1]] = m[2]
+          else
+            @source = lines[idx..].join("\n")
+            break
+          end
         end
-      end
-      return false if source.nil? || source.empty?
-      @source.strip!
+        if headers['content-encoding'] =~ /gzip/i && !compressed
+          warn 'Response is gzipped, you may need to try again with --compressed'
+        end
-      headers = { 'location' => @url }
-      lines = @source.split(/\r\n/)
-      code = lines[0].match(/(\d\d\d)/)[1]
-      lines.shift
-      lines.each_with_index do |line, idx|
-        if line =~ /^([\w-]+): (.*?)$/
-          m = Regexp.last_match
-          headers[m[1]] = m[2]
-        else
-          @source = lines[idx..].join("\n")
-          break
+        if headers['content-type'] =~ /json/
+          return { url: @url, code: code, headers: headers, meta: nil, links: nil,
+                   head: nil, body: @source.strip, source: @source.strip, body_links: nil, body_images: nil }
         end
+      else
+        @source = source unless source.nil?
       end
-      if headers['content-encoding'] =~ /gzip/i && !compressed
-        warn 'Response is gzipped, you may need to try again with --compressed'
-      end
+      @source = curl_dynamic_html(@url, @fallback, @headers) if @fallback && (@source.nil? || @source.empty?)
-      if headers['content-type'] =~ /json/
-        return { url: @url, code: code, headers: headers, meta: nil, links: nil,
-                 head: nil, body: @source.strip, source: @source.strip, body_links: nil, body_images: nil }
-      end
+      return false if @source.nil? || @source.empty?
+      @source.strip!
-      head = source.match(%r{(?<=<head>)(.*?)(?=</head>)}mi)
+      head = @source.match(%r{(?<=<head>)(.*?)(?=</head>)}mi)
       if head.nil?
         { url: @url, code: code, headers: headers, meta: nil, links: nil, head: nil, body: @source.strip,
           source: @source.strip, body_links: nil, body_images: nil }
       else
+        @body = @source.match(%r{<body.*?>(.*?)</body>}mi)[1]
         meta = meta_tags(head[1])
         links = link_tags(head[1])
-        body = @source.match(%r{<body.*?>(.*?)</body>}mi)[1]
-        { url: @url, code: code, headers: headers, meta: meta, links: links, head: head[1], body: body,
-          source: @source.strip, body_links: body_links, body_images: body_images }
+        { url: @url, code: code, headers: headers, meta: meta, links: links, head: head[1], body: @body,
+          source: @source.strip, body_links: nil, body_images: nil }
       end
     end

data/lib/curly/hash.rb CHANGED Viewed

@@ -2,6 +2,27 @@
 # Hash helpers
 class ::Hash
+  def to_data(url: nil, clean: false)
+    if key?(:body_links)
+      {
+        url: self[:url] || url,
+        code: self[:code],
+        headers: self[:headers],
+        meta: self[:meta],
+        meta_links: self[:links],
+        head: clean ? self[:head]&.strip&.clean : self[:head],
+        body: clean ? self[:body]&.strip&.clean : self[:body],
+        source: clean ? self[:source]&.strip&.clean : self[:source],
+        title: self[:title],
+        description: self[:description],
+        links: self[:body_links],
+        images: self[:body_images]
+      }
+    else
+      self
+    end
+  end
   # Extract data using a dot-syntax path
   #
   # @param      path  [String] The path
@@ -18,7 +39,7 @@ class ::Hash
       ats = []
       at = []
       while pth =~ /\[[+&,]?\w+ *[\^*$=<>]=? *\w+/
-        m = pth.match(/\[(?<com>[,+&])? *(?<key>\w+) *(?<op>[\^*$=<>]{1,2}) *(?<val>\w+) */)
+        m = pth.match(/\[(?<com>[,+&])? *(?<key>\w+) *(?<op>[\^*$=<>]{1,2}) *(?<val>[^,&\]]+) */)
         comp = [m['key'], m['op'], m['val']]
         case m['com']
         when ','
@@ -28,15 +49,16 @@ class ::Hash
           at.push(comp)
         end
-        pth.sub!(/\[(?<com>[,&+])? *(?<key>\w+) *(?<op>[\^*$=<>]{1,2}) *(?<val>\w+)/, '[')
+        pth.sub!(/\[(?<com>[,&+])? *(?<key>\w+) *(?<op>[\^*$=<>]{1,2}) *(?<val>[^,&\]]+)/, '[')
       end
       ats.push(at) unless at.empty?
       pth.sub!(/\[\]/, '')
       return false if el.nil? && ats.empty? && !res.key?(pth)
       res = res[pth] unless pth.empty?
+      return false if res.nil?
       if ats.count.positive?
         while ats.count.positive?
           atr = ats.shift
@@ -60,7 +82,7 @@ class ::Hash
   ##
   ## @param      r     [Hash] hash of source elements and
   ##                   comparison operators
-  ## @param      atr   [String] The attribute to compare
+  ## @param      atr   [Array] Array of arrays conaining [attribute,comparitor,value]
   ##
   ## @return     [Boolean] whether the comparison passes or fails
   ##
@@ -118,7 +140,7 @@ class ::Hash
   end
   ##
-  ## Test if a hash contains a tag matching filter queries
+  ## Test if a tag contains an attribute matching filter queries
   ##
   ## @param      tag_name    [String] The tag name
   ## @param      classes     [String] The classes to match

data/lib/curly/version.rb CHANGED Viewed

@@ -1,3 +1,3 @@
 module Curly
-  VERSION = '0.0.4'
+  VERSION = '0.0.5'
 end

data/src/_README.md CHANGED Viewed

@@ -10,7 +10,7 @@ _If you find this useful, feel free to [buy me some coffee][donate]._
 [donate]: https://brettterpstra.com/donate
 <!--END GITHUB-->
-The current version of `curlyq` is <!--VER-->0.0.3<!--END VER-->.
+The current version of `curlyq` is <!--VER-->0.0.4<!--END VER-->.
 CurlyQ is a utility that provides a simple interface for curl, with additional features for things like extracting images and links, finding elements by CSS selector or XPath, getting detailed header info, and more. It's designed to be part of a scripting pipeline, outputting everything as structured data (JSON or YAML). It also has rudimentary support for making calls to JSON endpoints easier, but it's expected that you'll use something like `jq` to parse the output.
@@ -39,12 +39,41 @@ Run `curlyq help` for a list of subcommands. Run `curlyq help SUBCOMMAND` for de
 @cli(bundle exec bin/curlyq help)
 ```
+### Query and Search syntax
+You can shape the results using `--search` (`-s`) and `--query` (`-q`) on some commands.
+A search uses either CSS or XPath syntax to locate elements. For example, if you wanted to locate all of the `<article>` elements with a class of `post` inside of the div with an id of `main`, you would run `--search '#main article.post'`. Searches can target tags, ids, and classes, and can accept `>` to target direct descendents. You can also use XPaths, but I hate those so I'm not going to document them.
+Queries are specifically for shaping CurlyQ output. If you're using the `html` command, it returns a key called `images`, so you can target just the images in the response with `-q 'images'`. The queries accept array syntax, so to get the first image, you would use `-q 'images[0]'`. Ranges are accepted as well, so `-q 'images[1..4]'` will return the 2nd through 5th images found on the page. You can also do comparisons, e.g. `images[rel=me]'` to target only images with a `rel` attribute of `me`.
+The comparisons for the query flag are:
+- `<` less than
+- `>` greater than
+- `<=` less than or equal to
+- `>=` greater than or equal to
+- `=` or `==` is equal to
+- `*=` contains text
+- `^=` starts with text
+- `$=` ends with text
 #### Commands
 curlyq makes use of subcommands, e.g. `curlyq html [options] URL` or `curlyq extract [options] URL`. Each subcommand takes its own options, but I've made an effort to standardize the choices between each command as much as possible.
 ##### extract
+Example:
+    curlyq extract -i -b 'Adding' -a 'accessing the source.' 'https://stackoverflow.com/questions/52428409/get-fully-rendered-html-using-selenium-webdriver-and-python'
+    [
+      "Adding <code>time.sleep(10)</code> in various places in case the page had not fully loaded when I was accessing the source."
+    ]
+This specifies a before and after string and includes them (`-i`) in the result.
 ```
 @cli(bundle exec bin/curlyq help extract)
 ```
@@ -52,36 +81,198 @@ curlyq makes use of subcommands, e.g. `curlyq html [options] URL` or `curlyq ext
 ##### headlinks
+Example:
+    curlyq headlinks -q '[rel=stylesheet]' https://brettterpstra.com
+    {
+      "rel": "stylesheet",
+      "href": "https://cdn3.brettterpstra.com/stylesheets/screen.7261.css",
+      "type": "text/css",
+      "title": null
+    }
+This pulls all `<links>` from the `<head>` of the page, and uses a query `-q` to only show links with `rel="stylesheet"`.
 ```
 @cli(bundle exec bin/curlyq help headlinks)
 ```
 ##### html
+The html command (aliased as `curl`) gets the entire text of the web page and provides a JSON response with a breakdown of:
+- URL, after any redirects
+- Response code
+- Response headers as a keyed hash
+- Meta elements for the page as a keyed hash
+- All meta links in the head as an array of objects containing (as available):
+    - rel
+    - href
+    - type
+    - title
+- source of `<head>`
+- source of `<body>`
+- the page title (determined first by og:title, then by a title tag)
+- description (using og:description first)
+- All links on the page as an array of objects with:
+    - href
+    - title
+    - rel
+    - text content
+    - classes as array
+- All images on the page as an array of objects containing:
+    - class
+    - all attributes as key/value pairs
+    - width and height (if specified)
+    - src
+    - alt and title
+You can add a query (`-q`) to only get the information needed, e.g. `-q images[width>600]`.
+Example:
+    curlyq html -s '#main article .aligncenter' -q 'images[1]' 'https://brettterpstra.com'
+    [
+      {
+        "class": "aligncenter",
+        "original": "https://cdn3.brettterpstra.com/uploads/2023/09/giveaway-keyboardmaestro2024-rb_tw.jpg",
+        "at2x": "https://cdn3.brettterpstra.com/uploads/2023/09/giveaway-keyboardmaestro2024-rb@2x.jpg",
+        "width": "800",
+        "height": "226",
+        "src": "https://cdn3.brettterpstra.com/uploads/2023/09/giveaway-keyboardmaestro2024-rb.jpg",
+        "alt": "Giveaway Robot with Keyboard Maestro icon",
+        "title": "Giveaway Robot with Keyboard Maestro icon"
+      }
+    ]
+The above example queries the full html of the page, but narrows the elements using `--search` and then takes the 2nd image from the results.
+    curlyq html -q 'meta.title'  https://brettterpstra.com/2024/01/10/introducing-curlyq-a-pipeline-oriented-curl-helper/
+    Introducing CurlyQ, a pipeline-oriented curl helper - BrettTerpstra.com
+The above example curls the page and returns the title attribute found in the meta (`-q 'meta.title'`).
 ```
 @cli(bundle exec bin/curlyq help html)
 ```
 ##### images
+The images command returns only the images on the page as an array of objects. It can be queried to match certain requirements (see Query and Search syntax above).
+The base command will return all images on the page, including OpenGraph images from the head, `<img>` tags from the body, and `<srcset>` tags along with their child images.
+OpenGraph images will be returned with the structure:
+    {
+        "type": "opengraph",
+        "attrs": null,
+        "src": "https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb_tw.jpg"
+      }
+`img` tags will be returned with the structure:
+    {
+        "type": "img",
+        "src": "https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb.jpg",
+        "width": "800",
+        "height": "226",
+        "alt": "Banner image for CurlyQ",
+        "title": "CurlyQ, curl better",
+        "attrs": [
+          {
+            "key": "class",
+            "value": [
+              "aligncenter"
+            ], // all attributes included
+          }
+        ]
+      }
+`srcset` images will be returned with the structure:
+    {
+        "type": "srcset",
+            "attrs": [
+              {
+                "key": "srcset",
+                "value": "https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb_tw.jpg 1x, https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb@2x.jpg 2x"
+              }
+            ],
+            "images": [
+              {
+                "src": "https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb_tw.jpg",
+                "media": "1x"
+              },
+              {
+                "src": "https://cdn3.brettterpstra.com/uploads/2024/01/curlyq_header-rb@2x.jpg",
+                "media": "2x"
+              }
+          ]
+        }
+    }
+Example:
+    curlyq images -t img -q '[alt$=screenshot]' https://brettterpstra.com
+This will return an array of images that are `<img>` tags, and only show the ones that have an `alt` attribute that ends with `screenshot`.
+    curlyq images -q '[width>750]' https://brettterpstra.com
+This example will only return images that have a width greater than 750 pixels. This query depends on the images having proper `width` attributes set on them in the source.
 ```
 @cli(bundle exec bin/curlyq help images)
 ```
 ##### json
+The `json` command just returns an object with header/response info, and the contents of the JSON response after it's been read by the Ruby JSON library and output. If there are fetching or parsing errors it will fail gracefully with an error code.
 ```
 @cli(bundle exec bin/curlyq help json)
 ```
 ##### links
+Returns all the links on the page, which can be queried on any attribute.
+Example:
+    curlyq images -t img -q '[width>750]' https://brettterpstra.com
 ```
 @cli(bundle exec bin/curlyq help links)
 ```
 ##### scrape
+Loads the page in a web browser, allowing scraping of dynamically loaded pages that return nothing but scripts when `curl`ed. The `-b` (`--browser`) option is required and should be 'chrome' or 'firefox' (or just 'c' or 'f'). The selected browser must be installed on your system.
+Example:
+    curlyq scrape -b firefox -q 'links[rel=me&content*=mastodon][0]' https://brettterpstra.com/2024/01/10/introducing-curlyq-a-pipeline-oriented-curl-helper/
+    {
+      "href": "https://nojack.easydns.ca/@ttscoff",
+      "title": null,
+      "rel": [
+        "me"
+      ],
+      "content": "Mastodon",
+      "class": [
+        "u-url"
+      ]
+    }
+This example scrapes the page using firefox and finds the first link with a rel of 'me' and text containing 'mastodon'.
 ```
 @cli(bundle exec bin/curlyq help scrape)
 ```
@@ -90,12 +281,25 @@ curlyq makes use of subcommands, e.g. `curlyq html [options] URL` or `curlyq ext
 Full-page screenshots require Firefox, installed and specified with `--browser firefox`.
+Type defaults to `full`, but will only work if `-b` is Firefox. If you want to use Chrome, you must specify a `--type` as 'visible' or 'print'.
+The `-o` (`--output`) flag is required. It should be a path to a target PNG file (or PDF for `-t print` output). Extension will be modified automatically, all you need is the base name.
+Example:
+    curlyq screenshot -b f -o ~/Desktop/test https://brettterpstra.com/2024/01/10/introducing-curlyq-a-pipeline-oriented-curl-helper/
+    Screenshot saved to /Users/ttscoff/Desktop/test.png
 ```
 @cli(bundle exec bin/curlyq help screenshot)
 ```
 ##### tags
+Return a hierarchy of all tags in a page. Use `-t` to limit to a specific tag.
 ```
 @cli(bundle exec bin/curlyq help tags)
 ```

metadata CHANGED Viewed

@@ -1,14 +1,14 @@
 --- !ruby/object:Gem::Specification
 name: curlyq
 version: !ruby/object:Gem::Version
-  version: 0.0.4
+  version: 0.0.5
 platform: ruby
 authors:
 - Brett Terpstra
 autorequire:
 bindir: bin
 cert_chain: []
-date: 2024-01-10 00:00:00.000000000 Z
+date: 2024-01-12 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: rake
@@ -139,6 +139,7 @@ extra_rdoc_files:
 files:
 - ".github/FUNDING.yml"
 - ".gitignore"
+- ".irbrc"
 - CHANGELOG.md
 - Gemfile
 - Gemfile.lock