asimov-brightdata-module 0.0.3-arm64-darwin

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
checksums.yaml ADDED
@@ -0,0 +1,7 @@
1
+ ---
2
+ SHA256:
3
+ metadata.gz: a13c95b4761bd73ac94667ed6a6f20c1a5859f1113a37791d4da71e46357fff8
4
+ data.tar.gz: 879135b8460f93e27934fb54cf938cb39c8e199543b408dcc091ef31b56ec93f
5
+ SHA512:
6
+ metadata.gz: 8fa6376a5d149a6a1c50578e061d30e3122634b685e5488b9650c9fca0911ede88aa05ffa97f2869adddfbbc741f4c33da088b7b7877f0862d31136fdbcae4a0
7
+ data.tar.gz: 9d7e42360f144400fd599e56cecbc6ceb460bb622874067e6bbf8a2dcedf330aeb5df517e2b9f0cfae71d4f4bb77522a0b6d9e17dfdc97f3c5bcf23b47371f8e
data/AUTHORS ADDED
@@ -0,0 +1 @@
1
+ * Arto Bendiken <arto@asimov.so>
data/CHANGES.md ADDED
@@ -0,0 +1,17 @@
1
+ # Changelog
2
+
3
+ All notable changes to this project will be documented in this file.
4
+
5
+ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
6
+ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
7
+
8
+ ## 0.0.3 - 2025-05-29
9
+ ### Added
10
+ - Implement JSON-LD output for X profiles
11
+ - Implement pretty-printed JSON output
12
+ ### Changed
13
+ - Bump the asimov-module dependency
14
+
15
+ ## 0.0.2 - 2025-05-22
16
+
17
+ ## 0.0.1 - 2025-05-21
data/README.md ADDED
@@ -0,0 +1,153 @@
1
+ # ASIMOV Bright Data Module
2
+
3
+ [![License](https://img.shields.io/badge/license-Public%20Domain-blue.svg)](https://unlicense.org)
4
+ [![Package on Crates.io](https://img.shields.io/crates/v/asimov-brightdata-module.svg)](https://crates.io/crates/asimov-brightdata-module)
5
+ [![Package on PyPI](https://img.shields.io/pypi/v/asimov-brightdata-module.svg)](https://pypi.org/project/asimov-brightdata-module)
6
+ [![Package on RubyGems](https://img.shields.io/gem/v/asimov-brightdata-module.svg)](https://rubygems.org/gems/asimov-brightdata-module)
7
+ [![Package on NPM](https://img.shields.io/npm/v/asimov-brightdata-module.svg)](https://npmjs.com/package/asimov-brightdata-module)
8
+
9
+ [ASIMOV] module for data import powered by the [Bright Data] web data platform.
10
+
11
+ ## ✨ Features
12
+
13
+ - Imports structured data from Airbnb, Amazon, Crunchbase, eBay, Facebook,
14
+ Google, Indeed, Instagram, LinkedIn, Walmart, X (aka Twitter), Yahoo, and
15
+ YouTube.
16
+ - Collects the raw JSON data via the Bright Data API (requires an API key).
17
+ - Constructs a semantic knowledge graph based on the [KNOW] ontology.
18
+ - Supports plain JSON output as well as [RDF] output in the form of JSON-LD.
19
+ - Distributed as a standalone static binary with zero runtime dependencies.
20
+
21
+ ## 🛠️ Prerequisites
22
+
23
+ - [Rust] 1.85+ (2024 edition) if building from source code
24
+
25
+ ## ⬇️ Installation
26
+
27
+ ### Installation from PyPI
28
+
29
+ ```bash
30
+ pip install -U asimov-brightdata-module
31
+ ```
32
+
33
+ ### Installation from RubyGems
34
+
35
+ ```bash
36
+ gem install asimov-brightdata-module
37
+ ```
38
+
39
+ ### Installation from NPM
40
+
41
+ ```bash
42
+ npm install -g asimov-brightdata-module
43
+ ```
44
+
45
+ ### Installation from Source Code
46
+
47
+ ```bash
48
+ cargo install asimov-brightdata-module
49
+ ```
50
+
51
+ ## 👉 Examples
52
+
53
+ ```bash
54
+ export BRIGHTDATA_API_KEY="..."
55
+ ```
56
+
57
+ ### Fetching X Profiles
58
+
59
+ ```bash
60
+ asimov-brightdata-fetcher https://x.com/bright_init # JSON
61
+ asimov-brightdata-importer https://x.com/bright_init # JSON-LD
62
+ ```
63
+
64
+ ### Fetching LinkedIn Profiles
65
+
66
+ ```bash
67
+ asimov-brightdata-fetcher https://www.linkedin.com/in/orlenchner/
68
+ asimov-brightdata-fetcher https://www.linkedin.com/company/bright-data/
69
+ ```
70
+
71
+ ### Fetching Crunchbase Profiles
72
+
73
+ ```bash
74
+ asimov-brightdata-fetcher https://www.crunchbase.com/organization/brightdata
75
+ ```
76
+
77
+ ### Fetching Amazon Products
78
+
79
+ ```bash
80
+ asimov-brightdata-fetcher https://www.amazon.com/Master-Algorithm-Ultimate-Learning-Machine/dp/0465094279
81
+ ```
82
+
83
+ ## ⚙ Configuration
84
+
85
+ ### Environment Variables
86
+
87
+ - `BRIGHTDATA_API_KEY`: (required) the [Bright Data API key] to use
88
+
89
+ ## 📚 Reference
90
+
91
+ ### Installed Binaries
92
+
93
+ - `asimov-brightdata-cataloger`: discovers entities via the Bright Data API
94
+ _(not implemented yet)_
95
+ - `asimov-brightdata-fetcher`: collects JSON data from the Bright Data API
96
+ - `asimov-brightdata-importer`: collects and transforms JSON into JSON-LD
97
+ _(not implemented yet)_
98
+
99
+ ### Supported Datasets
100
+
101
+ Dataset | URL Prefix | JSON | RDF
102
+ :------ | :--------- | :--: | :--:
103
+ Airbnb | `https://www.airbnb.com/rooms/` | ✅ | 🚧
104
+ Amazon | `https://www.amazon.com/` | ✅ | 🚧
105
+ &nbsp; | `https://www.amazon.com/sp?seller=` | ✅ | 🚧
106
+ Crunchbase | `https://www.crunchbase.com/organization/` | ✅ | 🚧
107
+ eBay | `https://www.ebay.com/itm/` | ✅ | 🚧
108
+ Facebook | `https://www.facebook.com/events/` | ✅ | 🚧
109
+ &nbsp; | `https://www.facebook.com/groups/` | ✅ | 🚧
110
+ &nbsp; | `https://www.facebook.com/marketplace/item/` | ✅ | 🚧
111
+ &nbsp; | `https://www.facebook.com/share/p/` | ✅ | 🚧
112
+ Google | `https://www.google.com/shopping/product/` | ✅ | 🚧
113
+ Indeed | `https://www.indeed.com/cmp/` | ✅ | 🚧
114
+ Instagram | `https://www.instagram.com/` | ✅ | 🚧
115
+ &nbsp; | `https://www.instagram.com/p/` | ✅ | 🚧
116
+ &nbsp; | `https://www.instagram.com/reel/` | ✅ | 🚧
117
+ LinkedIn | `https://www.linkedin.com/company/` | ✅ | 🚧
118
+ &nbsp; | `https://www.linkedin.com/in/` | ✅ | 🚧
119
+ &nbsp; | `https://www.linkedin.com/jobs/` | ✅ | 🚧
120
+ &nbsp; | `https://www.linkedin.com/posts/` | ✅ | 🚧
121
+ &nbsp; | `https://www.linkedin.com/pulse/` | ✅ | 🚧
122
+ Walmart | `https://www.walmart.com/global/seller/` | ✅ | 🚧
123
+ &nbsp; | `https://www.walmart.com/ip/` | ✅ | 🚧
124
+ X (Twitter) | `https://x.com/` | ✅ | ✅
125
+ Yahoo | `https://finance.yahoo.com/quote/` | ✅ | 🚧
126
+ YouTube | `https://www.youtube.com/@` | ✅ | 🚧
127
+ &nbsp; | `https://www.youtube.com/watch?v=` | ✅ | 🚧
128
+ <img width="100" height="1"/> | <img width="550" height="1"/> | <img width="50" height="1"/> | <img width="50" height="1"/>
129
+
130
+ ## 👨‍💻 Development
131
+
132
+ ```bash
133
+ git clone https://github.com/asimov-modules/asimov-brightdata-module.git
134
+ ```
135
+
136
+ ---
137
+
138
+ [![Share on X](https://img.shields.io/badge/share%20on-x-03A9F4?logo=x)](https://x.com/intent/post?url=https://github.com/asimov-modules/asimov-brightdata-module&text=asimov-brightdata-module)
139
+ [![Share on Reddit](https://img.shields.io/badge/share%20on-reddit-red?logo=reddit)](https://reddit.com/submit?url=https://github.com/asimov-modules/asimov-brightdata-module&title=asimov-brightdata-module)
140
+ [![Share on Hacker News](https://img.shields.io/badge/share%20on-hn-orange?logo=ycombinator)](https://news.ycombinator.com/submitlink?u=https://github.com/asimov-modules/asimov-brightdata-module&t=asimov-brightdata-module)
141
+ [![Share on Facebook](https://img.shields.io/badge/share%20on-fb-1976D2?logo=facebook)](https://www.facebook.com/sharer/sharer.php?u=https://github.com/asimov-modules/asimov-brightdata-module)
142
+ [![Share on LinkedIn](https://img.shields.io/badge/share%20on-linkedin-3949AB?logo=linkedin)](https://www.linkedin.com/sharing/share-offsite/?url=https://github.com/asimov-modules/asimov-brightdata-module)
143
+
144
+ [ASIMOV]: https://github.com/asimov-platform
145
+ [Bright Data]: https://brightdata.com/products/web-scraper
146
+ [Bright Data API key]: https://docs.brightdata.com/general/account/api-token
147
+ [JSON-LD]: https://json-ld.org
148
+ [KNOW]: https://github.com/know-ontology
149
+ [NPM]: https:/npmjs.org
150
+ [Python]: https://python.org
151
+ [RDF]: https://github.com/rust-rdf
152
+ [Ruby]: https://ruby-lang.org
153
+ [Rust]: https://rust-lang.org
data/UNLICENSE ADDED
@@ -0,0 +1,24 @@
1
+ This is free and unencumbered software released into the public domain.
2
+
3
+ Anyone is free to copy, modify, publish, use, compile, sell, or
4
+ distribute this software, either in source code form or as a compiled
5
+ binary, for any purpose, commercial or non-commercial, and by any
6
+ means.
7
+
8
+ In jurisdictions that recognize copyright laws, the author or authors
9
+ of this software dedicate any and all copyright interest in the
10
+ software to the public domain. We make this dedication for the benefit
11
+ of the public at large and to the detriment of our heirs and
12
+ successors. We intend this dedication to be an overt act of
13
+ relinquishment in perpetuity of all present and future rights to this
14
+ software under copyright law.
15
+
16
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
17
+ EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
18
+ MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.
19
+ IN NO EVENT SHALL THE AUTHORS BE LIABLE FOR ANY CLAIM, DAMAGES OR
20
+ OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE,
21
+ ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
22
+ OTHER DEALINGS IN THE SOFTWARE.
23
+
24
+ For more information, please refer to <https://unlicense.org/>
data/VERSION ADDED
@@ -0,0 +1 @@
1
+ 0.0.3
@@ -0,0 +1,8 @@
1
+ #!/usr/bin/env ruby
2
+ require_relative "../lib/asimov-brightdata-module"
3
+ argv0 = ::ASIMOV_Brightdata_Module::Fetcher.path
4
+ if ::Gem.win_platform?
5
+ ::Kernel.exit(::Kernel.system(argv0, *ARGV) ? 0 : $?.exitstatus)
6
+ else
7
+ ::Kernel.exec(argv0, *ARGV)
8
+ end
@@ -0,0 +1,8 @@
1
+ #!/usr/bin/env ruby
2
+ require_relative "../lib/asimov-brightdata-module"
3
+ argv0 = ::ASIMOV_Brightdata_Module::Importer.path
4
+ if ::Gem.win_platform?
5
+ ::Kernel.exit(::Kernel.system(argv0, *ARGV) ? 0 : $?.exitstatus)
6
+ else
7
+ ::Kernel.exec(argv0, *ARGV)
8
+ end
@@ -0,0 +1,23 @@
1
+ # This is free and unencumbered software released into the public domain.
2
+
3
+ module ASIMOV_Brightdata_Module; end
4
+
5
+ class ASIMOV_Brightdata_Module::Fetcher
6
+ def self.path
7
+ ::File.join(__dir__, "asimov-brightdata-fetcher")
8
+ end
9
+
10
+ def self.execute(...)
11
+ self.new(...).execute
12
+ end
13
+
14
+ def initialize(*args, env: {})
15
+ @argv0 = self.class.path.freeze
16
+ @args = args.map(&:to_s).freeze
17
+ @env = env.to_h.freeze
18
+ end
19
+
20
+ def execute(*args)
21
+ ::Kernel.system(@env, @argv0, *(@args + args.map(&:to_s))) ? 0 : $?.exitstatus
22
+ end
23
+ end
@@ -0,0 +1,23 @@
1
+ # This is free and unencumbered software released into the public domain.
2
+
3
+ module ASIMOV_Brightdata_Module; end
4
+
5
+ class ASIMOV_Brightdata_Module::Importer
6
+ def self.path
7
+ ::File.join(__dir__, "asimov-brightdata-importer")
8
+ end
9
+
10
+ def self.execute(...)
11
+ self.new(...).execute
12
+ end
13
+
14
+ def initialize(*args, env: {})
15
+ @argv0 = self.class.path.freeze
16
+ @args = args.map(&:to_s).freeze
17
+ @env = env.to_h.freeze
18
+ end
19
+
20
+ def execute(*args)
21
+ ::Kernel.system(@env, @argv0, *(@args + args.map(&:to_s))) ? 0 : $?.exitstatus
22
+ end
23
+ end
@@ -0,0 +1,8 @@
1
+ # This is free and unencumbered software released into the public domain.
2
+
3
+ module ASIMOV_Brightdata_Module
4
+ VERSION = "0.0.3".freeze
5
+ end
6
+
7
+ require_relative 'asimov-brightdata-module/fetcher'
8
+ require_relative 'asimov-brightdata-module/importer'
metadata ADDED
@@ -0,0 +1,58 @@
1
+ --- !ruby/object:Gem::Specification
2
+ name: asimov-brightdata-module
3
+ version: !ruby/object:Gem::Version
4
+ version: 0.0.3
5
+ platform: arm64-darwin
6
+ authors:
7
+ - ASIMOV Protocol
8
+ bindir: bin
9
+ cert_chain: []
10
+ date: 2025-05-29 00:00:00.000000000 Z
11
+ dependencies: []
12
+ description: ASIMOV module for data import powered by the Bright Data web data platform.
13
+ email: support@asimov.so
14
+ executables:
15
+ - asimov-brightdata-fetcher
16
+ - asimov-brightdata-importer
17
+ extensions: []
18
+ extra_rdoc_files: []
19
+ files:
20
+ - AUTHORS
21
+ - CHANGES.md
22
+ - README.md
23
+ - UNLICENSE
24
+ - VERSION
25
+ - bin/asimov-brightdata-fetcher
26
+ - bin/asimov-brightdata-importer
27
+ - lib/asimov-brightdata-module.rb
28
+ - lib/asimov-brightdata-module/asimov-brightdata-fetcher
29
+ - lib/asimov-brightdata-module/asimov-brightdata-importer
30
+ - lib/asimov-brightdata-module/fetcher.rb
31
+ - lib/asimov-brightdata-module/importer.rb
32
+ homepage: https://github.com/asimov-modules
33
+ licenses:
34
+ - Unlicense
35
+ metadata:
36
+ homepage_uri: https://github.com/asimov-modules
37
+ source_code_uri: https://github.com/asimov-modules/asimov-brightdata-module
38
+ changelog_uri: https://github.com/asimov-modules/asimov-brightdata-module/blob/master/CHANGES.md
39
+ bug_tracker_uri: https://github.com/asimov-modules/asimov-brightdata-module/issues
40
+ documentation_uri: https://github.com/asimov-modules/asimov-brightdata-module/blob/master/README.md
41
+ rdoc_options: []
42
+ require_paths:
43
+ - lib
44
+ required_ruby_version: !ruby/object:Gem::Requirement
45
+ requirements:
46
+ - - ">="
47
+ - !ruby/object:Gem::Version
48
+ version: '3.0'
49
+ required_rubygems_version: !ruby/object:Gem::Requirement
50
+ requirements:
51
+ - - ">="
52
+ - !ruby/object:Gem::Version
53
+ version: '0'
54
+ requirements: []
55
+ rubygems_version: 3.6.9
56
+ specification_version: 4
57
+ summary: ASIMOV module for data import powered by the Bright Data web data platform.
58
+ test_files: []