abstract-webtools 0.1.6.114__tar.gz → 0.1.6.116__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/PKG-INFO +1 -1
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/setup.py +1 -1
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/k2s_downloader.py +15 -4
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools.egg-info/PKG-INFO +1 -1
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/README.md +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/pyproject.toml +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/setup.cfg +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/__init__.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/abstract_usurpit.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/abstract_webtools.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/big_user_agent_list.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/domain_identifier.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/extention_list.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/find_dirs.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/main.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/__init__.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/allss//.py" +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/cipherManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/crawlManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/crawlmgr2.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/curlMgr.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/domainManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/dynamicRateLimiter.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/get_test.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/linkManager/__init__.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/linkManager/linkManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/mySocketClient.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/networkManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/requestManager/__init__.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/requestManager/requestManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/seleniumManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/soupManager/__init__.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/soupManager/asoueces.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/soupManager/soupManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/sslManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/tlsAdapter.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/urlManager/__init__.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/urlManager/urlManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/userAgentManager.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/videoDownloader.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/managers/videoDownloader2.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/soup_gui.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/url_grabber.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/url_grabber_new.py +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools.egg-info/SOURCES.txt +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools.egg-info/dependency_links.txt +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools.egg-info/requires.txt +0 -0
- {abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools.egg-info/top_level.txt +0 -0
@@ -1,6 +1,6 @@
|
|
1
1
|
Metadata-Version: 2.4
|
2
2
|
Name: abstract_webtools
|
3
|
-
Version: 0.1.6.
|
3
|
+
Version: 0.1.6.116
|
4
4
|
Summary: Abstract Web Tools is a Python package that provides various utility functions for web scraping tasks. It is built on top of popular libraries such as `requests`, `BeautifulSoup`, and `urllib3` to simplify the process of fetching and parsing web content.
|
5
5
|
Home-page: https://github.com/AbstractEndeavors/abstract_essentials/tree/main/abstract_webtools
|
6
6
|
Author: putkoff
|
@@ -4,7 +4,7 @@ with open("README.md", "r", encoding="utf-8") as fh:
|
|
4
4
|
long_description = fh.read()
|
5
5
|
setuptools.setup(
|
6
6
|
name='abstract_webtools',
|
7
|
-
version='0.1.6.
|
7
|
+
version='0.1.6.116',
|
8
8
|
author='putkoff',
|
9
9
|
author_email='partners@abstractendeavors.com',
|
10
10
|
description='Abstract Web Tools is a Python package that provides various utility functions for web scraping tasks. It is built on top of popular libraries such as `requests`, `BeautifulSoup`, and `urllib3` to simplify the process of fetching and parsing web content.',
|
{abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/k2s_downloader.py
RENAMED
@@ -32,6 +32,16 @@ class K2SDownloader:
|
|
32
32
|
options = webdriver.ChromeOptions()
|
33
33
|
options.add_argument("--disable-blink-features=AutomationControlled")
|
34
34
|
options.add_argument("--headless")
|
35
|
+
|
36
|
+
# Configure download preferences
|
37
|
+
prefs = {
|
38
|
+
"download.default_directory": self.download_dir, # Set custom download directory
|
39
|
+
"download.prompt_for_download": False, # Disable download prompt
|
40
|
+
"download.directory_upgrade": True, # Allow directory override
|
41
|
+
"safebrowsing.enabled": True # Enable safe browsing
|
42
|
+
}
|
43
|
+
options.add_experimental_option("prefs", prefs)
|
44
|
+
|
35
45
|
return webdriver.Chrome(options=options)
|
36
46
|
|
37
47
|
def login(self):
|
@@ -58,10 +68,10 @@ class K2SDownloader:
|
|
58
68
|
# with open('login_error.html', 'w', encoding='utf-8') as f:
|
59
69
|
# f.write(self.driver.page_source)
|
60
70
|
|
61
|
-
def download_file(self, url):
|
71
|
+
def download_file(self, url,download_dir=None):
|
62
72
|
if not self.logged_in:
|
63
73
|
self.login()
|
64
|
-
|
74
|
+
download_dir = download_dir or self.download_dir
|
65
75
|
print(f"Navigating to: {url}")
|
66
76
|
self.driver.get(url)
|
67
77
|
time.sleep(5)
|
@@ -80,7 +90,7 @@ class K2SDownloader:
|
|
80
90
|
if download_url:
|
81
91
|
response = self.session.get(download_url, stream=True)
|
82
92
|
file_name = self._extract_filename(response, download_url)
|
83
|
-
file_path = os.path.join(
|
93
|
+
file_path = os.path.join(download_dir, file_name)
|
84
94
|
if not os.path.isfile(file_path):
|
85
95
|
with open(file_path, 'wb') as f:
|
86
96
|
for chunk in response.iter_content(chunk_size=8192):
|
@@ -119,6 +129,7 @@ class dlsManager:
|
|
119
129
|
def __init__(self, downloader):
|
120
130
|
self.downloader = downloader
|
121
131
|
self.json_file_path = self.downloader.json_file_path
|
132
|
+
self.download_dir = self.downloader.download_dir
|
122
133
|
all_dls= None
|
123
134
|
if self.json_file_path:
|
124
135
|
all_dls = safe_read_from_json(self.json_file_path)
|
@@ -134,7 +145,7 @@ class dlsManager:
|
|
134
145
|
def dl_k2s_link(self, k2s_link):
|
135
146
|
if k2s_link:
|
136
147
|
print(f"Downloading: {k2s_link}")
|
137
|
-
self.downloader.download_file(k2s_link)
|
148
|
+
self.downloader.download_file(k2s_link,self.download_dir)
|
138
149
|
time.sleep(10)
|
139
150
|
if self.json_file_path:
|
140
151
|
self.all_dls.append(self.last_data)
|
{abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools.egg-info/PKG-INFO
RENAMED
@@ -1,6 +1,6 @@
|
|
1
1
|
Metadata-Version: 2.4
|
2
2
|
Name: abstract_webtools
|
3
|
-
Version: 0.1.6.
|
3
|
+
Version: 0.1.6.116
|
4
4
|
Summary: Abstract Web Tools is a Python package that provides various utility functions for web scraping tasks. It is built on top of popular libraries such as `requests`, `BeautifulSoup`, and `urllib3` to simplify the process of fetching and parsing web content.
|
5
5
|
Home-page: https://github.com/AbstractEndeavors/abstract_essentials/tree/main/abstract_webtools
|
6
6
|
Author: putkoff
|
File without changes
|
File without changes
|
File without changes
|
{abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/__init__.py
RENAMED
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
{abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/extention_list.py
RENAMED
File without changes
|
{abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/find_dirs.py
RENAMED
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
{abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/soup_gui.py
RENAMED
File without changes
|
{abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/url_grabber.py
RENAMED
File without changes
|
{abstract_webtools-0.1.6.114 → abstract_webtools-0.1.6.116}/src/abstract_webtools/url_grabber_new.py
RENAMED
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|